Application-bypass broadcast in MPICH over GM
- 1 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Processes of a parallel program can become unsynchronized, or skewed, during the course of running an application. Processes can become skewed as a result of unbalanced or asymmetric rode, or through the use of heterogeneous systems, where nodes in the system have different performance characteristics, as well as random, unpredictable effects such as the processes not being started at exactly the same time, or processors receiving interrupts during computation. Geographically distributed systems may have more severe skew because of variable communication times. Such skew can have a significant impact on the performance of collective communication operations which impose an implicit synchronization. The broadcast operation in MPICH is one such operation. An application-bypass broadcast operation is one which does not depend on the application running at a process to make progress. Such an operation would not be as sensitive to process skew. This paper describes the design and implementation of an application-bypass broadcast operation. We evaluated the implementation and find a factor of improvement of up to 16 for application-bypass broadcast compared to non-application-bypass broadcast when processes are skewed. Furthermore we see that as the system size increases, the effects of skew on non-application-bypass broadcast also increase. The application-bypass broadcast is much less sensitive to process skew which makes it more scalable than the non-application-bypass broadcast operation.Keywords
This publication has 8 references indexed in Scilit:
- Efficient reliable multicast on MyrinetPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Where to provide support for efficient multicasting in irregular networks: network interface or switch?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Efficient multicast on Myrinet using link-level flow controlPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Optimal multicast with packetization and network interface supportPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fast NIC-based barrier over Myrinet/GMPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination MessagesPublished by Springer Nature ,2000
- A high-performance, portable implementation of the MPI message passing interface standardParallel Computing, 1996
- Myrinet: a gigabit-per-second local area networkIEEE Micro, 1995