Automatic Search for Performance Problems in Parallel and Distributed Programs by Using Multi-experiment Analysis

Fahringer, Thomas; Seragiotto, Clovis

doi:10.1007/3-540-36265-7_15

Thomas Fahringer⁷ &
Clovis Seragiotto Jr⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2552))

Included in the following conference series:

International Conference on High-Performance Computing

1084 Accesses
7 Citations

Abstract

We introduce Aksum, a novel system for performance analysis that helps programmers to locate and to understand performance problems in message passing, shared memory and mixed parallel programs. The user must provide the set of problem and machine sizes for which performance analysis should be conducted. The search for performance problems (properties) is user-controllable by restricting the performance analysis to specific code regions, by creating new or customizing existing property specifications and property hierarchies, by indicating the maximum search time and maximum time a single experiment may take, by providing thresholds that define whether or not a property is critical, and by indicating conditions under which the search for properties stops. Aksum automatically selects and instruments code regions for collecting raw performance data based on which performance properties are computed. Heuristics are incorporated to prune the search for performance properties. We have implemented Aksum as a portable Java-based distributed system which displays all properties detected during the search process together with the code regions that cause them. A filtering mechanism allows the examination of properties at various levels of detail. We present an experiment with a financial modeling application to demonstrate the usefulness and effectiveness of our approach.

This research is partially supported by the Austrian Science Fund as part of Aurora Project runder SFBF1104.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Siegfried Benkner. VFC: The Vienna Fortran Compiler. Journal of Scientific Programming, 7(1): 67–81, 1999.
Google Scholar
Engelbert Dockner and Hans Moritsch. Pricing Constant Maturity Floaters with Embedded Options Using Monte Carlo Simulation. Aurora Technical Reports, University of Vienna. January, 1999.
Google Scholar
Antonio Espinosa, Tomàs Margalef, and Emilio Luque. Automatic Performance Evaluation of Parallel Programs. IEEE Proceedings of the 6th Euromicro Workshop on Parallel and Distributed Processing. January, 1998.
Google Scholar
Thomas Fahringer, Michael Gerndt, Bernd Mohr, Feliz Wolf, Graham Riley, and Jesper Larsson Träff. Knowledge Specification for Automatic Performance Analysis. http://www.fz-juelich.de/apart-1/reports/wp2-asl.ps.gz. January, 2001.
Thomas Fahringer and Clovis Seragiotto. Modeling and Detecting Performance Problems for Distributed and Parallel Programs with JavaPSL. Proceedings SC 2001, November, 2001.
Google Scholar
Michael Gerndt, Andreas Schmidt, Martin Schulz, Roland Wismüller. Performance Analysis for Teraflop Computers. A Distributed Approach. 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing. January, 2002.
Google Scholar
Mark Grand. Patterns in Java, Volume 1. Wiley, 1998.
Google Scholar
Barton P. Miller, Mark D. Callaghan, Jonathan M. Cargille, Jeffrey K. Hollingsworth, R. Bruce Irvin, Karen L. Karavanic, Krishna Kunchithapadam, and Tia Newhall. The Paradyn Parallel Performance Measurement Tool. IEEE Computer, November 1995.
Google Scholar
Hong-Linh Truong, Thomas Fahringer. SCALEA: A Performance Analysis Tool for Distributed and Parallel Program. 8th International Europar Conference, August, 2002.
Google Scholar
Jeffrey Vetter. Performance Analysis of Distributed Applications using Automatic Classification of Communication Inefficiencies. In Proceedings of the 14th International Conference on Supercomputing, pp. 245–254, Santa Fe, New Mexico, May, 2000.
Google Scholar
Felix Wolf, Bernd Mohr. Automatic Performance Analysis of SMP Cluster Applications. Internal Report, Forschungszentrum Jülich GmbH. August, 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Software Science, University of Vienna, Vienna
Thomas Fahringer & Clovis Seragiotto Jr

Authors

Thomas Fahringer
View author publications
You can also search for this author in PubMed Google Scholar
Clovis Seragiotto Jr
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CISE Department, University of Florida, 32611, FL, Gainesville, USA
Sartaj Sahni
Department of Electrical Engineering, EEB 200C, University of Southern California, 3740 McClintok Ave., 90089-2562, CA, Los Angeles, USA
Viktor K. Prasanna
India Software Lab, IBM Global Services India Exports, Golden Enclave, TISL Tower, Airport Road, 560 017, Bangalore, India
Uday Shukla

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fahringer, T., Seragiotto, C. (2002). Automatic Search for Performance Problems in Parallel and Distributed Programs by Using Multi-experiment Analysis. In: Sahni, S., Prasanna, V.K., Shukla, U. (eds) High Performance Computing — HiPC 2002. HiPC 2002. Lecture Notes in Computer Science, vol 2552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36265-7_15

Download citation

DOI: https://doi.org/10.1007/3-540-36265-7_15
Published: 18 December 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00303-8
Online ISBN: 978-3-540-36265-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics