Skip to main content

A very high performance algorithm for NAS EP Benchmark

  • Performance Evaluation and Benchmarking
  • Conference paper
  • First Online:
High-Performance Computing and Networking (HPCN-Europe 1994)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 797))

Included in the following conference series:

  • 158 Accesses

Abstract

The NAS (Numerical Aerodynamic Simulation) Parallel Benchmarks have been developed at NASA Ames Research Center to study the performance of parallel supercomputers. Major algorithmic improvements to the Embarrassingly Parallel (EP) Benchmark are described. Using IBM RS/6000 workstations and IBM SP-1 scalable parallel machines as examples, we also describe tuning techniques to obtain very high performance on this benchmark. Compared to the generic EP code, various algorithmic and tuning techniques have resulted in a performance improvement by nearly a factor of 18. The techniques described are generally applicable to many numerical algorithms on most RISC machines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bailey, D.H., Barszcz, E., Barton, J.T.,Browning, D.S., Carter, R.L., Dagum, L., Fatoohi, R.A., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S.,Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. Int. Journal of Supercomputer Applications. (1991) 63–73

    Google Scholar 

  2. Bailey, D., Barton, J., Lesinski, T., Simon, H.: The NAS Parallel Benchmarks. NASA Technical Memorandum, 103863, Ames Research Center, Moffet Field, CA 94035-1000 (July 1993)

    Google Scholar 

  3. Gustavson, F.G., Shearer, J.B., Zubair, M.: Performance of EP: A NAS Parallel Benchmark on a Cluster of RS/6000. Internal Report, IBM T.J. Watson Research Center, Yorktown Heights, NY 10598 (1992)

    Google Scholar 

  4. Agarwal, R.C., Gustavson, F.G., Zubair, M.: Fast Embarrassingly Parallel Pseudo Random Number Generator Using Fused Multiply-Add on RS/6000. manuscript under preparation, IBM T.J. Watson Research Center, Yorktown Heights, NY 10598 (1994)

    Google Scholar 

  5. Bailey, D.H., Barszcz, E., Dagum, L., Simon, H.D.: NAS Parallel Benchmark Results 10-93. RNR Technical Report RNR-93-016 (Oct. 1993)

    Google Scholar 

  6. Agarwal, R.C., Cooley, J.W., Gustavson, F.G., Shearer, J.B., Slishman, G., Tuckerman, B.: New Scalar and Vector Elementary Functions for the IBM System/370. IBM Journal of Research and Development. (1986) 126–144.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Wolfgang Gentzsch Uwe Harms

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Agarwal, R.C., Gustavson, F.G., Zubair, M. (1994). A very high performance algorithm for NAS EP Benchmark. In: Gentzsch, W., Harms, U. (eds) High-Performance Computing and Networking. HPCN-Europe 1994. Lecture Notes in Computer Science, vol 797. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57981-8_110

Download citation

  • DOI: https://doi.org/10.1007/3-540-57981-8_110

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-57981-6

  • Online ISBN: 978-3-540-48408-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics