Proactive Fault Tolerance in MPI Applications Via Task Migration
- 1 January 2006
- book chapter
- Published by Springer Nature
- p. 485-496
- https://doi.org/10.1007/11945918_47
Abstract
No abstract availableKeywords
This publication has 19 references indexed in Scilit:
- Scalable Cosmological Simulations on Parallel MachinesPublished by Springer Nature ,2007
- Fault Tolerance in Message Passing Interface ProgramsThe International Journal of High Performance Computing Applications, 2004
- Adaptive MPIPublished by Springer Nature ,2004
- Critical event prediction for proactive management in large-scale computer clustersPublished by Association for Computing Machinery (ACM) ,2003
- Scaling Molecular Dynamics to 3000 Processors with Projections: A Performance Analysis Case StudyPublished by Springer Nature ,2003
- Supporting dynamic parallel object arraysConcurrency and Computation: Practice and Experience, 2003
- Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of WorkstationsCluster Computing, 2003
- CoCheck: checkpointing and process migration for MPIPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An efficient and transparent thread migration scheme in the PM2 runtime systemPublished by Springer Nature ,1999
- Using MPIPublished by MIT Press ,1999