Skip to main content

Performance Modeling of Deterministic Transport Computations

  • Chapter
Performance Analysis and Grid Computing

Abstract

In this work we present a performance model that encompasses the key characteristics of a Sn transport application using unstructured meshes. Sn transport is an important part of the ASCI workload. This builds on previous analysis which has been done for the case of structured meshes. The performance modeling of an unstructured grid application presents a number of complexities and subtleties that do not arise for structured grids. The resulting analytical model is parametric using basic system performance characteristics (latency, bandwidth, MFLOPS rate etc), and application characteristics (mesh size etc). It is validated on a large HP AlphaServer system showing high accuracy. The model compares favorably to a trace based modeling approach which is specific to a single mesh/processor mapping situation. The model is used to give insight into the achievable performance on possible future processing systems containing thousands of processors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W.D. Gropp, D.K. Kaushik, D.E. Keyes and B.F.Smith. Performance Modeling and Tuning of an Unstructures Mesh CFD Application. In Proc. SC2000, Dallas, 2000.

    Google Scholar 

  2. S. Girona, J. Labarta and R.M. Badia. Validation of Dimemas communication model for MPI collective operations. In Proc. EuroPVM/MPI’2000, LNCS, 1908:39–46, Springer-Verlag, 2000.

    Google Scholar 

  3. A. Hoisie, O. Lubeck and H. Wasserman. Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications. Int. J. of High Performance Computing Applications, 14:330–346, 2000.

    Article  Google Scholar 

  4. A. Hoisie, O. Lubeck, H.J. Wasserman, F. Petrini, H.J. Alme. A General Predictive Performance Model for Wavefront Algorithms on Clusters of SMPs. In Proc. ICPP 2000, August 20–25, 2000. Toronto, Canada.

    Google Scholar 

  5. G. Karypis, V. Kumar. METIS 4.0: Unstructured Graph Partitioning and Sparse Matrix Ordering System. Technical Report, Department of Computer Science, University of Minnesota, 1998.

    Google Scholar 

  6. D.J. Kerbyson, H.J. Alme, A. Hoisie, F. Petrini, H.J. Wasserman, M.L. Gittings. Predictive Performance and Scalability Modeling of a Large-scale Application. In Proc SC2001, Denver, November 2001.

    Google Scholar 

  7. D.J. Kerbyson, E. Papaefstathiou, J.S. Harper, S.C. Perry, G.R. Nudd. Is Predictive Tracing Too late for HPC Users? In R.J. Allan, A. Simpson, and D.A. Nicole (Eds), High Performance Computing, Plenum Press, pages 57–67, March 1999.

    Chapter  Google Scholar 

  8. K.R. Koch, R.S. Baker, R.E. Alcouffe. A Parallel Algorithm for 3D Sn Transport Sweeps. LA-CP-92–406, Los Alamos National Laboratory, 1992.

    Google Scholar 

  9. M.M. Mathis, D.J. Kerbyson, A. Hoisie. A Performance Model of Non-deterministic Particle Transport on Large-Scale Systems, In Proc. Computational Science — ICCS 2003, LNCS, 2659:905–915, Springer-Verlag, 2003.

    Chapter  Google Scholar 

  10. G.R. Nudd, D.J. Kerbyson et.al. PACE: A Toolset for the Performance Prediction of Parallel and Distributed Systems. Int. J. of High Performance Computing Applications, 14:228–251,2000.

    Article  Google Scholar 

  11. E. Papaefstathiou, D.J., Kerbyson. Predicting Communication Delays of Detailed Application Workloads. In Proc of 13th ISCA Int. Conf on Parallel and Distributed Computing Systems (PDCS), Las Vegas, August 2000.

    Google Scholar 

  12. S.D. Pautz. An Algorithm for Parallel Sn Sweeps on Unstructures Meshes. J. Nuclear Science and Engineering, Vol. 140, pages 111–136, 2002.

    Google Scholar 

  13. F. Petrini, W.C. Feng, A. Hoisie, S. Coll, E. Frachtenberg. The Quadrics Network: High-Performance Clustering Technology. IEEE Micro, 22(l):46–57, 2002

    Article  Google Scholar 

  14. S. Plimpton, B. Hendrickson, S. Burns, W. McLendon. Parallel Algorithms for Radiation Transport on Unstructured Grids. In Proc. SC2000, Dallas, November 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer Science+Business Media New York

About this chapter

Cite this chapter

Kerbyson, D.J., Hoisie, A., Pautz, S.D. (2004). Performance Modeling of Deterministic Transport Computations. In: Getov, V., Gerndt, M., Hoisie, A., Malony, A., Miller, B. (eds) Performance Analysis and Grid Computing. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0361-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-0361-3_2

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-5038-5

  • Online ISBN: 978-1-4615-0361-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics