Efficient collective communication in distributed heterogeneous systems
- 20 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The Information Power Grid (IPG) is emerging as an in- frastructure that will enable distributed applications - s uch as video conferencing and distributed interactive simula- tion - to seamlessly integrate collections of heterogeneou s workstations, multiprocessors, and mobile nodes, over het - erogeneous wide-area networks. This paper introduces a framework for developing efficient collective communica- tion schedules in such systems. Our framework consists of analytical models of the heterogeneous system, scheduling algorithms for the collective communication pattern, and performance evaluation mechanisms. We show that previ- ous models, which considered node heterogeneity but ig- nored network heterogeneity, can lead to solutions which are worse than the optimal by an unbounded factor. We then introduce an enhanced communication model, and de- velop three heuristic algorithms for the broadcast and mul- ticast patterns. The completion time of the schedule is cho- sen as the performance metric. The heuristic algorithms are FEF (Fastest Edge First), ECEF (Earliest Completing Edge First), and ECEF with look-ahead. For small system sizes, we find the optimal solution using exhaustive search. Our simulation experiments indicate that the performance of ou r heuristic algorithms is close to optimal. For performance evaluation of larger systems, we have also developed a sim- ple lower bound on the completion time. Our heuristic algo- rithms achieve significant performance improvements over previous approaches.Keywords
This publication has 16 references indexed in Scilit:
- ECO: Efficient Collective Operations for communication on heterogeneous networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- All-to-all communication on meshes with wormhole routingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A mathematical model, heuristic, and simulation study for a basic data staging problem in a heterogeneous networking environmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The delay-constrained minimum spanning tree problemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Globus: a Metacomputing Infrastructure ToolkitThe International Journal of Supercomputer Applications and High Performance Computing, 1997
- Efficient Message Passing Interface (MPI) for Parallel Computing on Clusters of WorkstationsJournal of Parallel and Distributed Computing, 1997
- High-performance computing for visionProceedings of the IEEE, 1996
- CCL: a portable and tunable collective communication library for scalable parallel computersIEEE Transactions on Parallel and Distributed Systems, 1995
- Core based trees (CBT)Published by Association for Computing Machinery (ACM) ,1993
- Efficient algorithms for finding minimum spanning trees in undirected and directed graphsCombinatorica, 1986