Skip to main content

Quantifying Differences between OpenMP and MPI Using a Large-Scale Application Suite

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1940))

Abstract

In this paper we provide quantitative information about the performance differences between the OpenMP and the MPI version of a large-scale application benchmark suite, SPECseis. We have gathered extensive performance data using hardware counters on a 4-processor Sun Enterprise system. For the presentation of this information we use a Speedup Component Model, which is able to precisely show the impact of various overheads on the program speedup. We have found that overall, the performance figures of both program versions match closely. However, our analysis also shows interesting differences in individual program phases and in overhead categories incurred. Our work gives initial answers to a largely unanswered research question: what are the sources of inefficiencies of OpenMP programs relative to other programming paradigms on large, realistic applications. Our results indicate that the OpenMP and MPI models are basically performance-equivalent on shared-memory architectures. However, we also found interesting differences in behavioral details, such as the number of instructions executed, and the incurred memory latencies and processor stalls.

This work was supported i part by NSF grants #9703180-CCR and #9872516-EIA. This work is to necessarily representative of the positions or policies of the U.S. Government.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rudolf Eigenmann and Siamak Hassanzadeh. Benchmarking with real industrial applications: The SPEC High-Performance Group. IEEE Computational Science & Engineering, III(1):18–23, Spring 1996. 483

    Google Scholar 

  2. Seon Wook Kim and Rudolf Eigenmann. Detailed, quantitative analysis of sharedmemory parallel programs. Technical Report ECE-HPCLab-00204, HPCLAB, Purdue University, School of Electrical and Computer Engineering, 2000. 483, 486

    Google Scholar 

  3. Bill Pottenger and Rudolf Eigenmann. Targeting a Shared-Address-Space version of the seismic benchmark Seis1.1. Technical Report 1456, Univ. of Illinois at Urbana-Champaign, Cntr. for Supercomputing Res. & Dev., September 1995. 483au]4._Jay Hoeffinger, Prasad Alavilli, Thomas Jackson, and Bob Kuhn. Producing scalable performance with OpenMP: Experiments with two CFD applications. Technical report, Univ. of Illinois at Urbana-Champaign, 2000. 483

    Google Scholar 

  4. Abdul Wahed and Jerry Yan. Code generator for openmp. http://www.nas.nasa.gov/Groups/Tools/Projects/LCM/demos/openmp frame/target.html, October 1999. 483

  5. Abdul Waheed and Jerry Yan. Parallelization of nas benchmarks for shared memory multiprocessors. In Proceedings of High Performance Computing and Networking (HPCN Europe’ 98), Amsterdam, The Netherlands, apr 21–23 1998. 483

    Google Scholar 

  6. C. C. Mosher and S. Hassanzadeh. ARCO seismic processing performance evaluation suite, user’s guide. Technical report, ARCO, Plano, TX, 1993. 484

    Google Scholar 

  7. Bob Cmelik and David Keppel. Shade: A fast instruction-set simulator for execution profiling. Proceedings of the 1994 ACM SIGMETRICS Conference on the Measurement and Modeling of Computer Systems, pages 128–137, May 1994. 485

    Google Scholar 

  8. David L. Weaver and Tom Germond. The SPARC Architecture Manual, Version 9. SPARC International, Inc., PTR Prentice Hall, Englewood Cliffs, NJ 07632, 1994. 4

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Armstrong, B., Kim, S.W., Eigenmann, R. (2000). Quantifying Differences between OpenMP and MPI Using a Large-Scale Application Suite. In: Valero, M., Joe, K., Kitsuregawa, M., Tanaka, H. (eds) High Performance Computing. ISHPC 2000. Lecture Notes in Computer Science, vol 1940. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-39999-2_45

Download citation

  • DOI: https://doi.org/10.1007/3-540-39999-2_45

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41128-4

  • Online ISBN: 978-3-540-39999-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics