On Using Incremental Profiling for the Performance Analysis of Shared Memory Parallel Applications
Profiling is often the method of choice for performance analysis of parallel applications due to its low overhead and easily comprehensible results. However, a disadvantage of profiling is the loss of temporal information that makes it impossible to causally relate performance phenomena to events that happened prior or later during execution. We investigate techniques to add temporal dimension to profiling data by incrementally capturing profiles during the runtime of the application and discuss the insights that can be gained from this type of performance data. The context in which we explore these ideas is an existing profiling tool for OpenMP applications.
KeywordsCritical Section Parallel Application Benchmark Suite Parallel Region Performance Counter
Unable to display preview. Download preview PDF.
- 1.Fuerlinger, K., Gerndt, M.: ompP: A profiling tool for OpenMP. In: Proceedings of the First International Workshop on OpenMP (IWOMP 2005), Eugene, Oregon, USA (May 2005) (Accepted for publication)Google Scholar
- 3.Mohr, B., Malony, A.D., Shende, S.S., Wolf, F.: Towards a performance tool interface for OpenMP: An approach based on directive rewriting. In: Proceedings of the Third Workshop on OpenMP (EWOMP 2001) (September 2001)Google Scholar
- 4.Itzkowitz, M., Mazurov, O., Copty, N., Lin, Y.: An OpenMP runtime API for profiling Accepted by the OpenMP ARB as an official ARB White Paper, available online at http://www.compunity.org/futures/omp-api.html
- 6.Gerndt, M., Fürlinger, K.: Specification and detection of performance problems with ASL. Concurrency and Computation: Practice and Experience, 2006 (to appear)Google Scholar
- 8.Shende, S.S., Malony, A.D.: The TAU parallel performance system. International Journal of High Performance Computing Applications (ACTS Collection Special Issue) (2005)Google Scholar
- 9.Malony, A.D., Shende, S.S., Bell, R., Li, K., Li, L., Trebon, N.: Advances in the TAU performance analysis system. In: Getov, V., Gerndt, M., Hoisie, A., Malony, A., Miller, B. (eds.) Performance Analysis and Grid Computing, pp. 129–144. Kluwer, Dordrecht (2003)Google Scholar