On the feasibility of incremental checkpointing for scienti .c computing
- 10 June 2004
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
No abstract availableThis publication has 15 references indexed in Scilit:
- ReVive: cost-effective architectural support for rollback recovery in shared-memory multiprocessorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Starfish: fault-tolerant dynamic MPI programs on clusters of workstationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- CATCH-compiler-assisted techniques for checkpointingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fault-tolerance for off-the-shelf applications and hardwarePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Automatic computing: the next era of computingElectronics & Communication Engineering Journal, 2002
- SafetyNet: improving the availability of shared memory multiprocessors with global checkpoint/recoveryPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Memory exclusion: optimizing the performance of checkpointing systemsSoftware: Practice and Experience, 1999
- Diskless checkpointingIEEE Transactions on Parallel and Distributed Systems, 1998
- Managing checkpoints for parallel programsPublished by Springer Nature ,1996
- ickp: a consistent checkpointer for multicomputersIEEE Parallel & Distributed Technology: Systems & Applications, 1994