The Parallel Multipole Method on the Connection Machine
- 1 November 1991
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Scientific and Statistical Computing
- Vol. 12 (6) , 1420-1437
- https://doi.org/10.1137/0912077
Abstract
This paper reports on a fast implementation of the three-dimensional nonadaptive Parallel Multipole Method (PMM) on the Connection Machine system model CM–2. The data interactions within the decomposition tree are modeled by a hierarchy of three-dimensional grids forming a pyramid in which parent nodes have degree eight. The base of the pyramid is embedded in the Connection Machine as a three-dimensional grid. The standard grid embedding feature is used. For 10 or more particles per processor the communication time is insignificant. The evaluation of the potential field for a system with 128k particles takes 5 seconds, and a system of a million particles about 3 minutes. The maximum number of particles that can be represented in 2G bytes of primary storage is $ \sim 50$ million. The execution rate of this implementation of the PMM is at about 1.7 Gflops/sec for a particle-processor-ratio of 10 or greater. A further speed improvement is possible by an improved use of the memory hierarchy associated with each floating-point unit in the system.
Keywords
This publication has 8 references indexed in Scilit:
- Optimizing Tridiagonal Solvers for Alternating Direction Methods on Boolean Cube MultiprocessorsSIAM Journal on Scientific and Statistical Computing, 1990
- Embedding meshes in Boolean cubes by graph decompositionJournal of Parallel and Distributed Computing, 1990
- Computational Structure of the N-Body ProblemSIAM Journal on Scientific and Statistical Computing, 1989
- A Fast Adaptive Multipole Algorithm for Particle SimulationsSIAM Journal on Scientific and Statistical Computing, 1988
- A fast algorithm for particle simulationsJournal of Computational Physics, 1987
- Communication efficient basic linear algebra computations on hypercube architecturesJournal of Parallel and Distributed Computing, 1987
- A hierarchical O(N log N) force-calculation algorithmNature, 1986
- $B$-valuations of graphsCzechoslovak Mathematical Journal, 1972