The Case of the Missing Supercomputer Performance
Top Cited Papers
- 15 November 2003
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
In this paper we describe how we improved the effective performance of ASCI Q, the world's second-fastest supercomputer, to meet our expectations. Using an arsenal of performance-analysis techniques including analytical models, custom microbenchmarks, full applications, and simulators, we succeeded in observing a serious-but previously undetected-performance problem. We identified the source of the problem, eliminated the problem, and "closed the loop" by demonstrating up to a factor of 2 improvement in application performance. We present our methodology and provide insight into performance analysis that is immediately applicable to other large-scale supercomputers.Keywords
This publication has 4 references indexed in Scilit:
- BCS-MPIPublished by Association for Computing Machinery (ACM) ,2003
- The Quadrics network: high-performance clustering technologyIEEE Micro, 2002
- Predictive performance and scalability modeling of a large-scale applicationPublished by Association for Computing Machinery (ACM) ,2001
- IMPROVED RESOURCE UTILIZATION WITH BUFFERED COSCHEDULINGParallel Algorithms and Applications, 2001