Encyclopedia of Big Data Technologies

2019 Edition
| Editors: Sherif Sakr, Albert Y. Zomaya

GPU-Based Hardware Platforms

  • Johns PaulEmail author
  • Bingsheng He
  • Chiew Tong Lau
Reference work entry
DOI: https://doi.org/10.1007/978-3-319-77525-8_172

Definitions

GPU, SIMD, single-package platforms, multi-package platforms GPU-based hardware platforms are platforms that usually use GPUs in conjunction with CPUs to accelerate specific tasks. GPUs have evolved as a powerful accelerator for processing huge amounts of data in parallel.

Overview

This chapter details the architectural design and internal working of GPU-based hardware platforms. For this, we broadly classify GPU-based hardware platforms into two categories: single-package and multi-package. We then detail the architectural differences as well as the advantages and limitations of each category.

GPU

Graphics processing units (GPUs) were initially designed to accelerate the rendering of images for video editing and gaming. However, more recently, GPUs have evolved as a powerful accelerator for processing huge amounts of data in parallel. Similar to CPUs, GPUs also contains processing cores, registers, multiple levels of cache, and a global memory unit. However, GPUs differ...

This is a preview of subscription content, log in to check access.

References

  1. Bergstra J, Bastien F, Breuleux O, Lamblin P, Pascanu R, Delalleau O, Desjardins G, Warde-Farley D, Goodfellow IJ, Bergeron A, Bengio Y (2011) Theano: deep learning on GPUs with python. In: Big learn workshop, NIPS’11Google Scholar
  2. Cates JE, Lefohn AE, Whitaker RT (2018) GIST: an interactive, GPU-based level set segmentation tool for 3D medical images. Med Image Anal 8(3):217–231. https://doi.org/10.1016/j.media.2004.06.022CrossRefGoogle Scholar
  3. Chetlur S, Woolley C, Vandermersch P, Cohen J, Tran J, Catanzaro B, Shelhamer E (2014) cuDNN: efficient primitives for deep learning. CoRR. http://arxiv.org/abs/1410.0759. https://dblp.org/rec/bib/journals/corr/ChetlurWVCTCS14
  4. Heimel M, Saecker M, Pirk H, Manegold S, Markl V (2013) Hardware-oblivious parallelism for in-memory column-stores. Proc VLDB Endow 6(9):709–720. https://doi.org/10.14778/2536360.2536370CrossRefGoogle Scholar
  5. Kruger J, Westermann R (2003) Acceleration techniques for GPU-based volume rendering. In: Proceedings of the 14th IEEE visualization 2003 (VIS’03), IEEE computer society, Washington, DC, p 38.  https://doi.org/10.1109/VIS.2003.10001
  6. Liu Y, Liu X, Wu E (2004) Real-time 3d fluid simulation on GPU with complex obstacles. In: Proceedings of the 12th Pacific conference on computer graphics and applications, PG 2004, pp 247–256.  https://doi.org/10.1109/PCCGA.2004.1348355
  7. Mielikainen J, Huang B, Huang HLA, Goldberg MD (2012) Improved GPU/CUDA based parallel weather and research forecast (WRF) single moment 5-class (WSM5) cloud microphysics. IEEE J Sel Top Appl Earth Obs Remote Sens 5(4):1256–1265.  https://doi.org/10.1109/JSTARS.2012.2188780CrossRefGoogle Scholar
  8. Nickolls J, Dally WJ (2010) The GPU computing era. IEEE Micro 30(2):56–69. https://doi.org/10.1109/MM.2010.41CrossRefGoogle Scholar
  9. Owens JD, Luebke D, Govindaraju N, Harris M, Krüger J, Lefohn AE, Purcell TJ (2007) A survey of general-purpose computation on graphics hardware. Comput Graphics Forum 26(1):80–113. https://doi.org/10.1111/j.1467-8659.2007.01012.xCrossRefGoogle Scholar
  10. Owens JD, Houston M, Luebke D, Green S, Stone JE, Phillips JC (2008) GPU computing. Proc IEEE 96(5):879–899.  https://doi.org/10.1109/JPROC.2008.917757CrossRefGoogle Scholar
  11. Paul J, He J, He B (2016) GPL: a GPU-based pipelined query processing engine. In: Proceedings of the 2016 international conference on management of data (SIGMOD’16). ACM, New York, pp 1935–1950. https://doi.org/10.1145/2882903.2915224CrossRefGoogle Scholar
  12. Qualcomm (2017) Snapdragon 835 Mobile Platform. https://www.qualcomm.com/products/snapdragon/proc essors/835
  13. Shimokawabe T, Aoki T, Muroi C, Ishida J, Kawano K, Endo T, Nukada A, Maruyama N, Matsuoka S (2010) An 80-fold speedup, 15.0 tflops full GPU acceleration of non-hydrostatic weather model asuca production code. In: Proceedings of the 2010 ACM/IEEE international conference for high performance computing, networking, storage and analysis (SC’10). IEEE computer society, Washington, DC, pp 1–11. https://doi.org/10.1109/SC.2010.9Google Scholar
  14. Wikipedia (2011) Semiconductor package. https://en.wikipedia.org/wiki/Semiconductor_package
  15. Wikepedia (2017) Apple mobile application processors. https://en.wikipedia.org/wiki/Apple_mobile_application_processors?oldid=752966303

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Nanyang Technological UniversitySingaporeSingapore
  2. 2.National University of SingaporeSingaporeSingapore