Efficient communication primitives on hypercubes
- 1 September 1992
- journal article
- research article
- Published by Wiley in Concurrency: Practice and Experience
- Vol. 4 (6) , 427-457
- https://doi.org/10.1002/cpe.4330040603
Abstract
We give practical algorithms, complexity analysis and implementation for one‐to‐all broadcasting, all‐to‐all personalized communication and matrix transpose (with two‐dimensional partitioning of the matrix) on hypercubes. We assume the following communication characteristics: circuit‐switched, e‐cube routing and one‐port communication model.For one‐to‐all broadcasting, we give an algorithm that combines the well‐known recursive doubling algorithm[1] and the algorithm based on edgedisjoint spanning trees[2]. The measured times of the combined algorithm are always superior to those of the edge‐disjoint spanning tree algorithm and outperform the recursive doubling algorithm. For all‐to‐all personalized communication we propose a hybrid algorithm that combines the well‐known recursive doubling algorithm[3,4] and the recently proposed direct‐route algorithm[5,6] Our hybrid algorithm balances between data transfer time and start‐up time of these two algorithms, and its communication complexity is estimated to be better than the two previous algorithms for a range of machine parameters. For matrix transpose with two‐dimensional partitioning of the matrix, we relate a two‐phase algorithm to the previous result in Reference 7. The algorithm is predicted to be better than the recursive transpose algorithm[8] by n nearest‐neighbor communications[4]. It takes advantage of circuit‐switched routing and is congestion‐free within each phase. We also suggest a way of storing the matrix such that the transpose operation can be realized in one phase without congestion.Keywords
This publication has 23 references indexed in Scilit:
- Optimal matrix transposition and bit reversal on hypercubes: All-to-all personalized communicationJournal of Parallel and Distributed Computing, 1991
- Optimal communication algorithms for hypercubesJournal of Parallel and Distributed Computing, 1991
- Intensive hypercube communication Prearranged communication in link-bound machinesJournal of Parallel and Distributed Computing, 1990
- Benchmarking the iPSC/2 hypercube multiprocessorConcurrency: Practice and Experience, 1989
- Gaussian elimination on a hypercube automatonJournal of Parallel and Distributed Computing, 1987
- Communication efficient basic linear algebra computations on hypercube architecturesJournal of Parallel and Distributed Computing, 1987
- Hypercube Algorithms and ImplementationsSIAM Journal on Scientific and Statistical Computing, 1987
- An optimal routing algorithm for mesh-connected Parallel computersJournal of the ACM, 1980
- Sorting on a mesh-connected parallel computerCommunications of the ACM, 1977
- A large scale, homogeneous, fully distributed parallel machine, IACM SIGARCH Computer Architecture News, 1977