Two approaches to the concurrent implementation of the prime factor algorithm on a hypercube
- 27 October 1991
- journal article
- research article
- Published by Wiley in Concurrency: Practice and Experience
- Vol. 3 (5) , 483-495
- https://doi.org/10.1002/cpe.4330030503
Abstract
On sequential computers, the prime factor algorithm (PFA) allows the Computation of the discrete Fourier transform (DFT) with a higher efficiency than the traditional Cooley‐Tukey FFT algorithm (CTA). However, the PFA requires substantial data movement, which poses a challenging problem for distributed‐memory multi‐processor systems. In this paper, two approaches for a concurrent implementation of the PFA on these structures are presented. In the first approach, the concurrent PFA runs on all nodes of the multi‐processor system, which is inefficient on large configurations due to the large communication overhead. A second approach developed to reduce this bottleneck is also presented. These solutions have been benchmarked on Caltech hypercubes, and the performances achieved are reported. In both approaches, thecrystal_routeralgorithm was exploited as a concurrent technique for communicating data among nodes.Keywords
This publication has 6 references indexed in Scilit:
- A new method for computing DFTPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A concurrent implementation of the prime factor algorithm on hypercubeIEEE Transactions on Signal Processing, 1991
- Discrete Fourier transforms when the number of data samples is primeProceedings of the IEEE, 1968
- An algorithm for the machine calculation of complex Fourier seriesMathematics of Computation, 1965
- The Interaction Algorithm and Practical Fourier Analysis: An AddendumJournal of the Royal Statistical Society Series B: Statistical Methodology, 1960
- The Interaction Algorithm and Practical Fourier AnalysisJournal of the Royal Statistical Society Series B: Statistical Methodology, 1958