Abstract
The xns computational fluid dynamics code was successfully running on Blue Gene/L, however, its scalability was unsatisfactory until the first Jülich Blue Gene/L Scaling Workshop provided an opportunity for the application developers and performance analysts to start working together. Investigation of solver performance pin-pointed a communication bottleneck that appeared with approximately 900 processes, and subsequent remediation allowed the application to continue scaling with a four-fold simulation performance improvement at 4,096 processes. This experience also validated the scalasca performance analysis toolset, when working with a complex application at large scale, and helped direct the development of more comprehensive analyses. Performance properties have now been incorporated to automatically quantify point-to-point synchronisation time and wait states in scan operations, both of which were significant for xns on Blue Gene/L.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Behr, M., Arora, D., Coronado, O., Pasquali, M.: Models and finite element techniques for blood flow simulation. Int’l J. Computational Fluid Dynamics 20, 175–181 (2006)
Frings, W., Hermanns, M.-A., Mohr, B., Orth, B. (eds): Jülich Blue Gene/L Scaling Workshop (December 2006) Forschungszentrum Jülich ZAM-IB-2007-01, http://www.fz-juelich.de/zam/bgl-sws06/
IBM Advanced Computing Technology Center: High Performance Computing Toolkit, http://www.research.ibm.com/actc/
Vetter, J., Chambreau, C.: mpiP — lightweight, scalable MPI profiling (2005), http://www.llnl.gov/CASC/mpip/
Geimer, M., Wolf, F., Wylie, B.J.N., Mohr, B.: Scalable parallel trace-based performance analysis. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, vol. 4192, pp. 303–312. Springer, Heidelberg (2006)
Jones, T., et al.: MPI \(\textsc{PERUSE}\): A performance revealing extensions interface to MPI. http://www.mpi-peruse.org/
Forschungszentrum Jülich GmbH: SCALASCA: Scalable performance analysis of large-scale parallel applications. http://www.scalasca.org/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wylie, B.J.N., Geimer, M., Nicolai, M., Probst, M. (2007). Performance Analysis and Tuning of the XNS CFD Solver on Blue Gene/L. In: Cappello, F., Herault, T., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2007. Lecture Notes in Computer Science, vol 4757. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75416-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-75416-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75415-2
Online ISBN: 978-3-540-75416-9
eBook Packages: Computer ScienceComputer Science (R0)