Abstract
This chapter reviews parallel fast Fourier transform (FFT) algorithms. The sequence of length N to be transformed is distributed across P processors, and in each stage every processor performs independent computations on its respective subsequences in the local memory. The operation count of O(N log2 N/P) is similar to that of the serial algorithm, but is inversely proportional to the number of processors. For the most part, the sequences have complex elements, but recently developed parallel algorithms for sequences of real and conjugate symmetric elements are also reviewed. A new parallel algorithm for the discrete cosine transform (DCT), or sequences of real symmetric elements, is presented. We discuss radix-2 algorithms for the most part, but extensions to nonradix-2 transforms and the Bluestein FFT for arbitrary length sequences are also covered. The communication schemes used, are transposes and i-cycles. For simplification of the implementation, the multidimensional transform is presented first as an extension of the one-dimensional transform. Intrinsically multidimensional transforms are presented as a further extension.
This research was supported by the Office of Naval Research under grant N00014-89-J 1320.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bergland, G. D., 1967. “The Fast Fourier Transform Recursion Equations for Arbitrary-Length Records,” Math. Comp. 21, pp. 236–238.
Bergland, G. D., 1968. “A Fast Fourier Transform for Real-Valued Series,” Comm. ACM 11, pp. 703–710.
Bluestein, S., 1969. “A Linear Filtering Approach to the Computation of the discrete Fourier Transform,” in Papers on Digital Signal Processing, A. V. Oppenheim, ed., MIT Press, Cambridge, pp. 171–172.
Bokhari, S. H., 1991. “Complete Exchange on the iPSC 860,” ICASE Report 91-4.
Briggs, W. L., 1987. “Further Symmetries of In-Place FFTs,” SIAM J. Sci. Stat. Comp. 8, pp. 644–654.
Cooley, J. W., Lewis, P. A. W. and Welch, P. D.1970. “The Fast Fourier Transform Algorithm: Programming Considerations in the Calculation of Sine, Cosine and Laplace Transforms,” J. Sound Vib. 12, pp. 315–337.
Cooley, J. W. and Tukey, J. W., 1965. “An Algorithm for the Machine Calculation of Complex Fourier Series,” Math. Comp. 19, pp. 297–301.
Edelman, A., 1991. “Optimal Matrix Transposition and Bit Reversal on Hypercubes: All-to-All Personalized Communication,” J. Parallel Comput. 11, pp. 328–331.
Fraser, D., 1976. “Array Permutation by Index-Digit Permutation,” J. ACM 22, pp. 298–308.
Gentleman, W. M. and Sande, G., 1966. “Fast Fourier Transforms for Fun and Profit,” Proc. AFIPS Joint Computer Conf. 29, pp. 563–578.
Jackson, E., She, Z. and Orszag, S. A., 1991. “A Case Study in Parallel Computing: I. Homogeneous, Isotropic Turbulence on a Hypercube,” J. Sci. Comp. 6, pp. 27–45.
Johnsson, S. L. and Ho, C.-T., 1988, “Algorithms for Matrix Transposition on Boolean N-Cube Configured Ensemble Architectures,” SIAM J. Matrix Anal. Appl. 93, pp. 419–454.
Johnsson, S. L. and Ho, C.-T., 1991. “Optimal All-to-All Personalized Communication with Minimum Span on Boolean Cubes,” Harvard University TR-18-91.
Pelz, R. B., 1988. “Large-Scale Spectral Simulations of the Navier-Stokes Equations on a Hypercube Computer,” AIAA Paper 88-3642.
Pelz, R. B., 1989a. “Hypercube Algorithm for Turbulence Simulation,” in 11th Int. Conf. on Numerical Methods in Fluid Mechanics, D. L. Dwoyer, M. Y. Hussaini, R. G. Voigt, eds., Lecture Notes in Physics 323, Springer, Berlin, pp. 462–468.
Pelz, R. B., 1989b. “The parallel Chebyshev pseudospectral method: cosine transform and derivative recursion,” Proc. Fourth Conf. on Hypercubes, Concurrent Computers, and Applications, pp. 433–439.
Pelz, R. B., 1991a. “The parallel Fourier pseudospectral method,” J. Comp. Phys., 92, pp. 296–312.
Pelz, R. B., 1991b. “Parallel Fourier spectra methods on ensemble architectures,” Comp. Meths. Appl. Mech. Eng. 89, pp. 529–542.
Pelz, R. B., 1993. “Parallel compact FFTs for real sequences,” SIAM J. Sci. Comp. 14, pp. 914–935.
Pelz, R.B., 1995. “A parallel, compact FFT for real symmetric sequences,” Proc. Seventh SIAM Conf. on Parallel Processing for Scientific Computing, D.H. Bailey, et al., eds., pp. 27–32, SIAM, Philadelphia.
Swarztrauber, P. N., 1986. “Symmetric FFTs,” Math. Comp. 47, pp. 323–346.
Swarztrauber, P. N., 1987. “Multiprocessor FFTs,” Parallel Compitting 5, pp. 197–210.
Swarztrauber, P. N., Sweet, R. A., Briggs, W. L., Henson, V. E., Otto, J., 1991. “A Parallel Algorithm for Bluestein’s FFT,” Parallel Computing 17, pp. 607–617.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Pelz, R.B. (1997). Parallel FFTs. In: Keyes, D.E., Sameh, A., Venkatakrishnan, V. (eds) Parallel Numerical Algorithms. ICASE/LaRC Interdisciplinary Series in Science and Engineering, vol 4. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-5412-3_9
Download citation
DOI: https://doi.org/10.1007/978-94-011-5412-3_9
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-6277-0
Online ISBN: 978-94-011-5412-3
eBook Packages: Springer Book Archive