Fault propagation analysis based variable length checkpoint placement for fault-tolerant parallel and distributed systems
- 23 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
No abstract availableKeywords
This publication has 8 references indexed in Scilit:
- Application-transparent process-level error recovery for multicomputersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Independent checkpointing and concurrent rollback for recovery in distributed systems-an optimistic approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Analysis of checkpointing schemes for multiprocessor systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless CheckpointingJournal of Parallel and Distributed Computing, 1997
- Roll-forward checkpointing scheme: a novel fault-tolerant architectureIEEE Transactions on Computers, 1994
- Use of common time base for checkpointing and rollback recovery in a distributed systemIEEE Transactions on Software Engineering, 1993
- Rollback recovery in distributed systems using loosely synchronized clocksIEEE Transactions on Parallel and Distributed Systems, 1992
- Selection of a checkpoint interval in a critical-task environmentIEEE Transactions on Reliability, 1988