Tuning HipGISAXS on Multi and Many Core Supercomputers

Sarje, Abhinav; Li, Xiaoye S.; Hexemer, Alexander

doi:10.1007/978-3-319-10214-6_11

Abhinav Sarje¹⁶,
Xiaoye S. Li¹⁶ &
Alexander Hexemer¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8551))

Included in the following conference series:

International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems

810 Accesses
2 Citations

Abstract

With the continual development of multi and many-core architectures, there is a constant need for architecture-specific tuning of application-codes in order to realize high computational performance and energy efficiency, closer to the theoretical peaks of these architectures. In this paper, we present optimization and tuning of HipGISAXS, a parallel X-ray scattering simulation code [9], on various massively-parallel state-of-the-art supercomputers based on multi and many-core processors. In particular, we target clusters of general-purpose multi-cores such as Intel Sandy Bridge and AMD Magny Cours, and many-core accelerators like Nvidia Kepler GPUs and Intel Xeon Phi coprocessors. We present both high-level algorithmic and low-level architecture-aware optimization and tuning methodologies on these platforms. We cover a detailed performance study of our codes on single and multiple nodes of several current top-ranking supercomputers. Additionally, we implement autotuning of many of the algorithmic and optimization parameters for dynamic selection of their optimal values to ensure high-performance and high-efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tesla Kepler GPU Accelerators. Datasheet (2012)
Google Scholar
Intel Xeon Phi Coprocessor. Developer’s Quick Start Guide. Version 1.5. White Paper (2013)
Google Scholar
Performance Application Programming Interface (PAPI) (2013), http://icl.cs.utk.edu/papi
Top500 Supercomputers (June 2013), http://www.top500.org
Chourou, S., Sarje, A., Li, X., Chan, E., Hexemer, A.: HipGISAXS: A High Performance Computing Code for Simulating Grazing Incidence X-Ray Scattering Data. Submitted to the Journal of Applied Crystallography (2013)
Google Scholar
Intel Corp.: Intel Xeon Phi Coprocessor Instruction Set Architecture Reference Manual (September 2012)
Google Scholar
Kim, C., Satish, N., Chhugani, J., et al.: Closing the Ninja Performance Gap through Traditional Programming and Compiler Technology. Tech. Rep. (2011)
Google Scholar
Pommier, J.: SIMD implementation of sin, cos, exp and log. Tech. Rep. (2007), http://gruntthepeon.free.fr/ssemath
Sarje, A., Li, X., Chourou, S., Chan, E., Hexemer, A.: Massively Parallel X-ray Scattering Simulations. In: Supercomputing (SC 2012) (2012)
Google Scholar
Satish, N., Kim, C., Chhugani, J., et al.: Can traditional programming bridge the Ninja performance gap for parallel computing applications? SIGARCH Computer Architecture News 40(3), 440–451 (2012). http://doi.acm.org/10.1145/2366231.2337210
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Abhinav Sarje & Xiaoye S. Li
Advanced Light Source, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Alexander Hexemer

Authors

Abhinav Sarje
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoye S. Li
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Hexemer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abhinav Sarje .

Editor information

Editors and Affiliations

University of Warwick Coventry, West Midlands, United Kingdom
Stephen A. Jarvis
University of Warwick Coventry, West Midlands, United Kingdom
Steven A. Wright
Sandia National Laboratories CSRI, Albuquerque, New Mexico, USA
Simon D. Hammond

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sarje, A., Li, X.S., Hexemer, A. (2014). Tuning HipGISAXS on Multi and Many Core Supercomputers. In: Jarvis, S., Wright, S., Hammond, S. (eds) High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation. PMBS 2013. Lecture Notes in Computer Science(), vol 8551. Springer, Cham. https://doi.org/10.1007/978-3-319-10214-6_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-10214-6_11
Published: 01 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10213-9
Online ISBN: 978-3-319-10214-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics