Abstract
In this paper, we identify the scalability bottlenecks of an unstructured grid CFD code (PETSc-FUN3D) by studying the impact of several algorithmic and architectural parameters and by examiningdif ferent programmingmodels. We discuss the basic performance characteristics of this PDE code with the help of simple performance models developed in our earlier work, presentingprimarily experimental results. In addition to achievingg ood per-processor performance (which has been addressed in our cited work and without which scalability claims are suspect) we strive to improve the implementation and convergence scalability of PETSc-FUN3D on thousands of processors.
Keywords
- Execution Time
- Message Passing Interface
- Memory Bandwidth
- Lawrence Livermore National Laboratory
- Message Passing Interface Process
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
W. K. Anderson and D. L. Bonhaus. An implicit upwind algorithm for computing turbulent flows on unstructured grids. Computers and Fluids, 23:1–21, 1994.
W. K. Anderson, W. D. Gropp, D. K. Kaushik D. E. Keyes, and B. F. Smith. Achievinghig high sustained performance in an unstructured mesh CFD application. In Proceedings of SC’99. IEEE Computer Society, 1999. Gordon Bell Prize Award Paper in Special Category.
W. K. Anderson, R. D. Rausch, and D. L. TBonhaus. Implicit/multigrid algorithms for incompressible turbulent flows on unstructured grids. J. Computational Physics, 128:391–408, 1996.
S. Balay, W. D. Gropp, L. C. McInnes, and B. F. Smith. The Portable Extensible Toolkit for Scientific Computing(PETSc) version 28. http://www.mcs.anl.gov/petsc/petsc.html, 2000.
S. W. Bova, C. P. Breshears, C. E. Cuicchi, Z. Demirbilek, and H. A. Gabb. Dual-level parallel analysis of harbor wave response usingMPI and OpenMP. Int. J. High Performance Computing Applications, 14:49–64, 2000.
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Toward realistic performance bounds for implicit CFD codes. In D. Keyes, A. Ecer, J. Periaux, N. Satofuka, and P. Fox, editors, Proceedings of Parallel CFD’99, pages 233–240. Elsevier, 1999.
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Performance modelingand tuningof an unstructured mesh CFD application. In Proceedings of SC2000. IEEE Computer Society, 2000.
W. D. Gropp, L. C. McInnes, M. D. Tidriri, and D. E. Keyes. Globalized Newton-Krylov-Schwarz algorithms and software for parallel implicit CFD. Int. J. High Performance Computing Applications, 14:102–136, 2000.
William Gropp, Ewing Lusk, and Anthony Skjellum. Using MPI: Portable Parallel Programming with the Message Passing Interface, 2nd edition. MIT Press, Cambridge, MA, 1999.
William D. Gropp and Ewing Lusk. Reproducible measurements of MPI performance characteristics. In Jack Dongarra, Emilio Luque, and Tomàs Margalef, editors, Recent Advances in Parallel Virtual Machine and Message Passing Interface, volume 1697 of Lecture Notes in Computer Science, pages 11–18. Springer Verlag, 1999. 6th European PVM/MPI Users’ Group Meeting, Barcelona, Spain, September 1999.
P. D. Hough, T. G. Kolda, and V. J. Torczon. Asynchronous parallel pattern search for nonlinear optimization. Technical Report SAND2000-8213, Sandia National Laboratories, Livermore, January 2000. Submitted to SIAM J. Scientific Computation.
G. Karypis and V. Kumar. A fast and high quality scheme for partitioning irregular graphs. SIAM J. Scientific Computing, 20:359–392, 1999.
D. J. Mavriplis. Parallel unstructured mesh analysis of high-lift configurations. Technical Report 2000-0923, AIAA, 2000.
J. D. McCalpin. STREAM: Sustainable memory bandwidth in high performance computers. Technical report, University of Virginia, 1995. http://www.cs.virginia.edu/stream.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gropp, W.D., Kaushik, D.K., Keyes, D.E., Smith, B.F. (2000). Analyzing the Parallel Scalability of an Implicit Unstructured Mesh CFD Code. In: Valero, M., Prasanna, V.K., Vajapeyam, S. (eds) High Performance Computing — HiPC 2000. HiPC 2000. Lecture Notes in Computer Science, vol 1970. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44467-X_36
Download citation
DOI: https://doi.org/10.1007/3-540-44467-X_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41429-2
Online ISBN: 978-3-540-44467-1
eBook Packages: Springer Book Archive