First Experiences with ab initio Molecular Dynamics on OpenPOWER: The Case of CPMD
In this article, we present the algorithmic adaptation and code re-engineering required for porting highly successful and popular planewave codes to next-generation heterogeneous OpenPOWER architectures that foster acceleration and high bandwidth links to GPUs. Here we focus on CPMD as the most representative software for ab initio molecular dynamics simulations. We have ported the construction of the electronic density, the application of the potential to the wavefunctions and the orthogonalization procedure to the GPU. The different GPU kernels consist mainly of fast Fourier transforms (FFT) and basic linear algebra operations (BLAS). The performance of the new implementation obtained on Firestone (POWER8/Tesla) is discussed. We show that the communication between the host and the GPU contributes a large fraction of the total run time. We expect a strong attenuation of the communication bottleneck when the NVLink high-speed interconnect will be available.
KeywordsCPMD POWER8 CUDA NVlink FFT Gram–Schmidt
- 1.CPMD ver. 4.1: Copyright IBM Corp.1990–2016, Copyright MPI für Festkörperforschung Stuttgart (1997–2001). http://www.cpmd.org
- 5.Weber, V., Bekas, C., Laino, T., Curioni, A., Bertsch, A., Futral, S.: Shedding light on lithium/air batteries using millions of threads on the BG/Q supercomputer. In: 2014 IEEE 28th International Parallel and Distributed Processing Symposium, pp. 735–744 (2014)Google Scholar