A Bucket Sort Algorithm for the Particle-In-Cell Method on Manycore Architectures

Jocksch, Andreas; Hariri, Farah; Tran, Trach-Minh; Brunner, Stephan; Gheller, Claudio; Villard, Laurent

doi:10.1007/978-3-319-32149-3_5

Andreas Jocksch⁷,
Farah Hariri⁸,
Trach-Minh Tran⁸,
Stephan Brunner⁸,
Claudio Gheller⁷ &
…
Laurent Villard⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9573))

Included in the following conference series:

International Conference on Parallel Processing and Applied Mathematics

1493 Accesses
5 Citations

Abstract

The Particle-In-Cell (PIC) method is effectively used in many scientific simulation codes. In order to optimize the performance of the PIC approach, data locality is required. This relies on efficient sorting algorithms. We present a bucket sort algorithm with small memory footprint for the PIC method targeting Graphics Processing Units (GPUs). Our sorting algorithm shows an increased performance with the amount of storage provided and with the orderliness of the particles. For our application where particles are presorted it performs better and requires less memory than other sorting algorithms in the literature. The overall PIC algorithm performs at its best if the sorting is applied.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

http://devblogs.nvidia.com/parallelforall/gpu-pro-tip-fast-histograms-using- shared-atomics-maxwell/
Burnetas, A., Solow, D., Agarwal, R.: An analysis and implementation of an efficient in-place bucket sort. Acta Informatica 34, 687–700 (1997)
Article MathSciNet Google Scholar
Chen, G., Chaćon, L., Barnes, D.C.: An efficient mixed-precision, hybrid CPU-GPU implementation of a nonlinearly implicit one-dimensional particle-in-cell algorithm. J. Comput. Phys. 231, 5374–5388 (2012)
Article MathSciNet Google Scholar
Decyk, V.K., Singh, T.V.: Particle-in-cell algorithms for emerging computer architectures. Comput. Phys. Commun. 185(3), 708–719 (2014)
Article MathSciNet Google Scholar
Hockney, R.W., Eastwood, J.W.: Computer Simulation Using Particles. Hilger, Bristol (1988)
Google Scholar
Jolliet, S., Bottino, A., Angelino, P., Hatzky, R., Tran, T.M., Mcmillan, B.F., Sauter, O., Appert, K., Idomura, Y., Villard, L.: A global collisionless PIC code in magnetic coordinates. Comput. Phys. Commun. 177, 409–425 (2007)
Article Google Scholar
Joseph, R.G., Ravunnitkutty, G., Ranka, S., D’Azevedo, E., Klasky, S.: Efficient GPU implementation for particle in cell algorithm. In: 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Anchorage (Alaska), May 2011
Google Scholar
Merrill, D., Grimshaw, A.: High performance and scalable radix sorting: a case study of implementing dynamic parallelism for GPU computing. Parallel Process. Lett. 21(2), 245–272 (2011)
Article MathSciNet Google Scholar
Mertmann, P., Eremin, D., Mussenbrock, T., Brinkmann, R.P., Awakowicz, P.: Fine-sorting one-dimensional particle-in-cell algorithm with Monte-Carlo collisions on a graphics processing unit. Comput. Phys. Commun. 182, 2161–2167 (2011)
Article Google Scholar
Rozen, T., Boryczko, K., Alda, W.: GPU bucket sort algorithm with applications to nearest-neighbour search. J. WSCG 16, 161–167 (2008)
Google Scholar
Satish, N., Kim, C., Chhugani, J., Nguyen, A.D., Lee, V.W., Kim, D., Dubey, P.: Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort. In: SIGMOD 2010, Indinapolis (Indiana), June 2010
Google Scholar
Sintorn, E., Assarsson, U.: Fast parallel GPU-sorting using a hybrid algorithm. J. Parallel Distrib. Comput. 68, 1381–1388 (2008)
Article Google Scholar
Stantchev, G., Dorland, W., Gumerov, N.: Fast parallel particle-to-grid interpolation for plasma PIC simulations on the GPU. J. Parallel Distrib. Comput. 68, 1339–1349 (2008)
Article Google Scholar

Download references

Acknowledgments

The authors wish to thank Peter Messmer and Jakob Progsch from NVIDIA for helpful discussions.

Author information

Authors and Affiliations

CSCS, Swiss National Supercomputing Centre, Via Trevano 131, 6900, Lugano, Switzerland
Andreas Jocksch & Claudio Gheller
Swiss Plasma Center, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015, Lausanne, Switzerland
Farah Hariri, Trach-Minh Tran, Stephan Brunner & Laurent Villard

Authors

Andreas Jocksch
View author publications
You can also search for this author in PubMed Google Scholar
Farah Hariri
View author publications
You can also search for this author in PubMed Google Scholar
Trach-Minh Tran
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Brunner
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Gheller
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Villard
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas Jocksch .

Editor information

Editors and Affiliations

Czestochowa University of Technolog, Czestochowa, Poland
Roman Wyrzykowski
Department of Computer Science, University of Southern California, Marina Del Rey, California, USA
Ewa Deelman
Electrical Engineering & Comput. Science, University of Tennessee, Knoxville, Tennessee, USA
Jack Dongarra
Czestochowa University of Technology, Institute of Computer & Information Sci., Czestochowa, Poland
Konrad Karczewski
Department of Computer Science, AGH University of Science and Technology, Krakow, Poland
Jacek Kitowski
Systèmes d’informations, Big Data et Rec, AGH University of Science and Technology, Krakow, Poland
Kazimierz Wiatr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jocksch, A., Hariri, F., Tran, TM., Brunner, S., Gheller, C., Villard, L. (2016). A Bucket Sort Algorithm for the Particle-In-Cell Method on Manycore Architectures. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Kitowski, J., Wiatr, K. (eds) Parallel Processing and Applied Mathematics. PPAM 2015. Lecture Notes in Computer Science(), vol 9573. Springer, Cham. https://doi.org/10.1007/978-3-319-32149-3_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-32149-3_5
Published: 02 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32148-6
Online ISBN: 978-3-319-32149-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics