Two approaches to the concurrent implementation of the prime factor algorithm on a hypercube

27 October 1991

journal article
research article
Published by Wiley in Concurrency: Practice and Experience

Vol. 3 (5) , 483-495
https://doi.org/10.1002/cpe.4330030503

Abstract

On sequential computers, the prime factor algorithm (PFA) allows the Computation of the discrete Fourier transform (DFT) with a higher efficiency than the traditional Cooley‐Tukey FFT algorithm (CTA). However, the PFA requires substantial data movement, which poses a challenging problem for distributed‐memory multi‐processor systems. In this paper, two approaches for a concurrent implementation of the PFA on these structures are presented. In the first approach, the concurrent PFA runs on all nodes of the multi‐processor system, which is inefficient on large configurations due to the large communication overhead. A second approach developed to reduce this bottleneck is also presented. These solutions have been benchmarked on Caltech hypercubes, and the performances achieved are reported. In both approaches, thecrystal_routeralgorithm was exploited as a concurrent technique for communicating data among nodes.

Keywords

This publication has 6 references indexed in Scilit:

A new method for computing DFT
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A concurrent implementation of the prime factor algorithm on hypercube
IEEE Transactions on Signal Processing, 1991
Discrete Fourier transforms when the number of data samples is prime
Proceedings of the IEEE, 1968
An algorithm for the machine calculation of complex Fourier series
Mathematics of Computation, 1965
The Interaction Algorithm and Practical Fourier Analysis: An Addendum
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1960
The Interaction Algorithm and Practical Fourier Analysis
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1958