Design and Performance of the OP2 Library for Unstructured Mesh Applications

Bertolli, Carlo; Betts, Adam; Mudalige, Gihan; Giles, Mike; Kelly, Paul

doi:10.1007/978-3-642-29737-3_22

Carlo Bertolli³⁰,
Adam Betts³⁰,
Gihan Mudalige³¹,
Mike Giles³¹ &
…
Paul Kelly³⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7155))

Included in the following conference series:

European Conference on Parallel Processing

1472 Accesses
8 Citations

Abstract

OP2 is an “active” library framework for the solution of unstructured mesh applications. It aims to decouple the scientific specification of an application from its parallel implementation to achieve code longevity and near-optimal performance by re-targeting the back-end to different multi-core/many-core hardware. This paper presents the design of the OP2 code generation and compiler framework which, given an application written using the OP2 API, generates efficient code for state-of-the-art hardware (e.g. GPUs and multi-core CPUs). Through a representative unstructured mesh application we demonstrate the capabilities of the compiler framework to utilize the same OP2 hardware specific run-time support functionalities. Performance results show that the impact due to this sharing of basic functionalities is negligible.

This research is partly funded by EPSRC (grant reference numbers EP/I00677X/1, EP/I006079/1), the UK Technology Strategy Board, and Rolls- Royce plc through the SILOET programme.

Download to read the full chapter text

Chapter PDF

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs

Under the Hood of SYCL – An Initial Performance Analysis with An Unstructured-Mesh CFD Application

Source-to-Source Parallelization Compilers for Scientific Shared-Memory Multi-core and Accelerated Multiprocessing: Analysis, Pitfalls, Enhancement and Potential

Article 08 August 2019

Re’em Harel, Idan Mosseri, … Gal Oren

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Pgi cuda fortran (2011), http://www.pgroup.com/resources/cudafortran.htm/
The amd fusion family of apus (2011), http://sites.amd.com/us/fusion/apu/Pages/fusion.aspx
Burgess, D.A., Crumpton, P.I., Giles, M.B.: A parallel framework for unstructured grid solvers. In: Proc. of the 2nd European Computational Fluid Dynamics Conf., pp. 391–396. John Wiley and Sons, Germany (1994)
Google Scholar
Chafi, H., DeVito, Z., Moors, A., Rompf, T., Sujeeth, A.K., Hanrahan, P., Odersky, M., Olukotun, K.: Language virtualization for heterogeneous parallel computing. In: Proc. of the OOPSLA 2010 Applications, pp. 835–847. ACM, USA (2010)
Google Scholar
DeVito, Z., Joubert, N., Medina, M., Barrientos, M., Oakley, S., Alonso, J., Darve, E., Ham, F., Hanrahan, P.: Liszt: Programming mesh based pdes on heterogeneous parallel platforms (October 2010), http://psaap.stanford.edu
Giles, M.B.: Hydra (1998-2002), http://people.maths.ox.ac.uk/gilesm/hydra.html
Giles, M.B., Ghate, D., Duta, M.C.: Using automatic differentiation for adjoint CFD code development. Computational Fluid Dynamics J. 16(4), 434–443 (2008)
Google Scholar
Giles, M.B., Mudalige, G.R., Sharif, Z., Markall, G., Kelly, P.H.J.: Performance analysis of the op2 framework on many-core architectures. SIGMETRICS Perform. Eval. Rev. 38, 9–15 (2011)
Article Google Scholar
Howes, L.W., Lokhmotov, A., Donaldson, A.F., Kelly, P.H.J.: Deriving Efficient Data Movement from Decoupled Access/Execute Specifications. In: Seznec, A., Emer, J., O’Boyle, M., Martonosi, M., Ungerer, T. (eds.) HiPEAC 2009. LNCS, vol. 5409, pp. 168–182. Springer, Heidelberg (2009)
Chapter Google Scholar
Schordan, M., Quinlan, D.: A Source-To-Source Architecture for User-Defined Optimizations. In: Böszörményi, L., Schojer, P. (eds.) JMLC 2003. LNCS, vol. 2789, pp. 214–223. Springer, Heidelberg (2003)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, Imperial College, London, UK
Carlo Bertolli, Adam Betts & Paul Kelly
Oxford e-Research Centre, University of Oxford, UK
Gihan Mudalige & Mike Giles

Authors

Carlo Bertolli
View author publications
You can also search for this author in PubMed Google Scholar
Adam Betts
View author publications
You can also search for this author in PubMed Google Scholar
Gihan Mudalige
View author publications
You can also search for this author in PubMed Google Scholar
Mike Giles
View author publications
You can also search for this author in PubMed Google Scholar
Paul Kelly
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Scilytics, Koellnerhofgasse 3/15A, 1010, Vienna, Austria
Michael Alexander
ICAR-CNR, Via P. Castellino, 111, 80131, Napoli, Italy
Pasqua D’Ambra
University of Amsterdam, 1090, Amsterdam, Netherlands
Adam Belloum
Innovative Computing Laboratory, The University of Tennessee, USA
George Bosilca
Department of Experimental Medicine and Clinic, University Magna Græcia, 88100, Catanzaro, Italy
Mario Cannataro
Computer Science Department, University of Pisa, Italy
Marco Danelutto
Second University of Naples, Italy
Beniamino Di Martino
TU München, Boltzmannstr. 3, 85748, Garching, Germany
Michael Gerndt
Equipe Runtime, INRIA Bordeaux Sud-Ouest, 33405, Talence Cedex, France
Emmanuel Jeannot & Raymond Namyst &
Equipe HIEPACS, INRIA Bordeaux Sud-Ouest, 33405, Talence Cedex, France
Jean Roman
Oak Ridge National Laboratory, Computer Science and Mathematics Division, 37831-6164, Oak Ridge, TN, USA
Stephen L. Scott
Department of Scientific Computing, University of Vienna, Nordbergstr. 15/3C, 1090, Vienna, Austrial
Jesper Larsson Traff
Computer Science and Mathematics Division, Oak Ridge National Laboratory, 37831, Oak Ridge, TN, USA
Geoffroy Vallée
Technische Universität München, Germany
Josef Weidendorfer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bertolli, C., Betts, A., Mudalige, G., Giles, M., Kelly, P. (2012). Design and Performance of the OP2 Library for Unstructured Mesh Applications. In: Alexander, M., et al. Euro-Par 2011: Parallel Processing Workshops. Euro-Par 2011. Lecture Notes in Computer Science, vol 7155. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29737-3_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-29737-3_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29736-6
Online ISBN: 978-3-642-29737-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Design and Performance of the OP2 Library for Unstructured Mesh Applications

Abstract

Chapter PDF

Similar content being viewed by others

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs

Under the Hood of SYCL – An Initial Performance Analysis with An Unstructured-Mesh CFD Application

Source-to-Source Parallelization Compilers for Scientific Shared-Memory Multi-core and Accelerated Multiprocessing: Analysis, Pitfalls, Enhancement and Potential

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Design and Performance of the OP2 Library for Unstructured Mesh Applications

Abstract

Chapter PDF

Similar content being viewed by others

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs

Under the Hood of SYCL – An Initial Performance Analysis with An Unstructured-Mesh CFD Application

Source-to-Source Parallelization Compilers for Scientific Shared-Memory Multi-core and Accelerated Multiprocessing: Analysis, Pitfalls, Enhancement and Potential

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation