Application-bypass reduction for large-scale clusters

1 January 2003

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 404-411
https://doi.org/10.1109/clustr.2003.1253340

Abstract

Process skew is an important factor in the performance of parallel applications, especially in large-scale clusters. Reduction is a common collective operation which, by its nature, introduces implicit synchronization between the processes involved in the communication and is therefore highly susceptible to performance degradation due to process skew. A collective operation with application-bypass does not require the application to block in order for the operation to make progress. Application-bypass collective operations are therefore highly tolerant of skew. In this paper we describe the design and implementation of an application-bypass version of the reduction operation in MPICH over GM. We evaluate our implementation on a 16-node cluster. Under conditions of process skew we find a factor of improvement of up to 3.3 for our application-bypass reduction versus the default MPICH implementation. In addition, we see that this factor of improvement increases with system size, indicating that the application-bypass implementation is more scalable and skew-tolerant than the default non-application-bypass version. This framework promises design and development of high-performance and scalable collective communication libraries for next-generation large-scale clusters.

Keywords

This publication has 6 references indexed in Scilit:

Performance benefits of NIC-based barrier on myrinet/GM
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Application-bypass broadcast in MPICH over GM
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Fast NIC-based barrier over Myrinet/GM
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Portals 3.0: protocol building blocks for low overhead communication
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A high-performance, portable implementation of the MPI message passing interface standard
Parallel Computing, 1996
Myrinet: a gigabit-per-second local area network
IEEE Micro, 1995