Operating system issues for petascale systems
- 1 April 2006
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGOPS Operating Systems Review
- Vol. 40 (2) , 29-33
- https://doi.org/10.1145/1131322.1131332
Abstract
Petascale supercomputers will be available by 2008. The largest machine of these complex leadership-class machines will probably have nearly 250K CPUs. These massively parallel systems have a number of challenging operating system issues. In this paper, we focus on the issues most important for the system that will first breach the petaflop barrier: synchronization and collective operations, parallel I/O, and fault tolerance.Keywords
This publication has 4 references indexed in Scilit:
- The Soft Error Problem: An Architectural PerspectivePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- The Impact of Noise on the Scaling of Collectives: A Theoretical ApproachPublished by Springer Nature ,2005
- The Case of the Missing Supercomputer PerformancePublished by Association for Computing Machinery (ACM) ,2003
- Improving the Scalability of Parallel Jobs by adding Parallel Awareness to the Operating SystemPublished by Association for Computing Machinery (ACM) ,2003