Skip to main content

Scalable Parallel Trace-Based Performance Analysis

  • Conference paper
Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI 2006)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4192))

Abstract

Automatic trace analysis is an effective method for identifying complex performance phenomena in parallel applications. However, as the size of parallel systems and the number of processors used by individual applications is continuously raised, the traditional approach of analyzing a single global trace file, as done by kojak’s expert trace analyzer, becomes increasingly constrained by the large number of events. In this article, we present a scalable version of the expert analysis based on analyzing separate local trace files with a parallel tool which ‘replays’ the target application’s communication behavior. We describe the new parallel analyzer architecture and discuss first empirical results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nagel, W., Weber, M., Hoppe, H.C., Solchenbach, K.: VAMPIR: Visualization and Analysis of MPI Resources. Supercomputer 63, XII(1), 69–80 (1996)

    Google Scholar 

  2. Labarta, J., Girona, S., Pillet, V., Cortes, T., Gregoris, L.: DiP: A Parallel Program Development Environment. In: Proc. 2nd Int’l Euro-Par Conf., Lyon, France. Springer, Heidelberg (1996)

    Google Scholar 

  3. Wolf, F., Mohr, B.: Automatic performance analysis of hybrid MPI/OpenMP applications. Journal of Systems Architecture 49(10–11), 421–439 (2003)

    Article  Google Scholar 

  4. Wolf, F., Mohr, B., Dongarra, J., Moore, S.: Efficient Pattern Search in Large Traces through Successive Refinement. In: Proc. European Conf. on Parallel Computing (Euro-Par 2004), Pisa, Italy. Springer, Heidelberg (2004)

    Google Scholar 

  5. Wolf, F., Freitag, F., Mohr, B., Moore, S., Wylie, B.: Large Event Traces in Parallel Performance Analysis. In: Proc. 8th Workshop on Parallel Systems and Algorithms (PASA 2006), Frankfurt/Main, Germany. Lecture Notes in Informatics, Gesellschaft für Informatik (2006)

    Google Scholar 

  6. Freitag, F., Caubet, J., Labarta, J.: On the scalability of tracing mechanisms. In: Monien, B., Feldmann, R.L. (eds.) Euro-Par 2002. LNCS, vol. 2400, p. 97. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  7. Wu, C.E., Bolmarcich, A., Snir, M., Wootton, D., Parpia, F., Chan, A., Lusk, E., Gropp, W.: From Trace Generation to Visualization: A Performance Framework for Distributed Parallel Systems. In: Reich, S., Anderson, K.M. (eds.) OHS 2000 and SC 2000. LNCS, vol. 1903. Springer, Heidelberg (2000)

    Google Scholar 

  8. Brunst, H., Nagel, W.E.: Scalable Performance Analysis of Parallel Systems: Concepts and Experiences. In: Parallel Computing: Software Technology, Algorithms, Architectures and Applications, pp. 737–744. Elsevier, Amsterdam (2004)

    Chapter  Google Scholar 

  9. Knüpfer, A., Nagel, W.E.: Construction and Compression of Complete Call Graphs for Post-Mortem Program Trace Analysis. In: Proc. of the International Conference on Parallel Processing (ICCP 2005), Oslo, Norway, pp. 165–172. IEEE Computer Society, Los Alamitos (2005)

    Chapter  Google Scholar 

  10. Roth, P.C., Miller, B.P.: On-line automated performance diagnosis on thousands of processes. In: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2006), New York City, NY, USA (2006)

    Google Scholar 

  11. Fürlinger, K., Gerndt, M.: Distributed Application Monitoring for Clustered SMP Architectures. In: Proc. 9th Int’l Euro-Par Conf., Klagenfurt, Austria. Springer, Heidelberg (2003)

    Google Scholar 

  12. Fahringer, T., Gerndt, M., Mohr, B., Wolf, F., Riley, G., Träff, J.L.: Knowledge Specification for Automatic Performance Analysis. Technical Report FZJ-ZAM-IB-2001-08, ESPRIT IV Working Group APART, Forschungszentrum Jülich (2001) (Revised version)

    Google Scholar 

  13. Fahringer, T., Seragiotto Jr., C.: Modelling and Detecting Performance Problems for Distributed and Parallel Programs with JavaPSL. In: Proc. SC 2001, Denver, CO, USA (2001)

    Google Scholar 

  14. Jorba, J., Margalef, T., Luque, E.: Performance Analysis of Parallel Applications with KappaPI 2. In: Proc. Parallel Computing 2005, ParCo, Málaga, Spain (2006)

    Google Scholar 

  15. Song, F., Wolf, F., Bhatia, N., Dongarra, J., Moore, S.: An Algebra for Cross-Experiment Performance Analysis. In: Proc. Int’l Conf. on Parallel Processing (ICPP 2004), Montreal, Canada. IEEE Computer Society, Los Alamitos (2004)

    Google Scholar 

  16. Wolf, F.: Automatic Performance Analysis on Parallel Computers with SMP Nodes. PhD thesis, RWTH Aachen, Forschungszentrum Jülich (2003) ISBN 3-00-010003-2

    Google Scholar 

  17. The BlueGene/L Team at IBM and LLNL: An overview of the BlueGene/L supercomputer. In: Proc. SC 2002, Baltimore, MD, USA. IEEE Computer Society, Los Alamitos (2002)

    Google Scholar 

  18. Advanced Simulation and Computing Program: The ASC SMG 2000 Benchmark Code (2001), http://www.llnl.gov/asc/purple/benchmarks/limited/smg/

  19. Gibbon, P.: PEPC: A Multi-Purpose Parallel Tree-Code (2005), http://www.fz-juelich.de/zam/pepc/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Geimer, M., Wolf, F., Wylie, B.J.N., Mohr, B. (2006). Scalable Parallel Trace-Based Performance Analysis. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2006. Lecture Notes in Computer Science, vol 4192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846802_43

Download citation

  • DOI: https://doi.org/10.1007/11846802_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-39110-4

  • Online ISBN: 978-3-540-39112-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics