Communication characteristics of large-scale scientific applications for contemporary cluster architectures
- 1 January 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
This paper examines the explicit communication characteristics of several sophisticated scientific applications, which, by themselves, constitute a representative suite of publicly available benchmarks for large cluster architectures. By focusing on the Message Passing Interface (MPI) and by using hardware counters on the microprocessor, we observe each application's inherent behavioral characteristics: point-to-point and collective communication, and floating point operations. Furthermore, we explore the sensitivities of these characteristics to both problem size and number of processors. Our analysis reveals several striking similarities across our diverse set of applications including the use of collective operations, especially those collectives with very small data payloads. We also highlight a trend of novel applications parting with regimented, static communication patterns in favor of dynamically evolving patterns, as evidenced by our experiments on applications that use implicit linear solvers and adaptive mesh refinement. Overall, our study contributes a better understanding or the requirements of current and emerging paradigms of scientific computing in terms of their computation and communication demands.Keywords
This publication has 13 references indexed in Scilit:
- The SPLASH-2 programs: characterization and methodological considerationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A general predictive performance model for wavefront algorithms on clusters of SMPsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Demonstrating the scalability of a molecular dynamics application on a Petaflop computerPublished by Association for Computing Machinery (ACM) ,2001
- Performance evaluation of the IBM SP and the Compaq AlphaServer SCPublished by Association for Computing Machinery (ACM) ,2000
- Performance analysis of the Alpha 21264-based Compaq ES40 systemPublished by Association for Computing Machinery (ACM) ,2000
- MPI versus MPI+OpenMP on the IBM SP for the NAS BenchmarksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Performance Modeling and Tuning of an Unstructured Mesh CFD ApplicationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Semicoarsening Multigrid on Distributed Memory MachinesSIAM Journal on Scientific Computing, 2000
- Portable profiling and tracing for parallel, scientific applications using C++Published by Association for Computing Machinery (ACM) ,1998
- NAS parallel benchmark resultsIEEE Parallel & Distributed Technology: Systems & Applications, 1993