Abstract
The rate of scientific discovery depends on the speed at which accurate results and analysis can be obtained. The use of parallel co-processors such as Graphical Processing Units (GPUs) is becoming more and more important in meeting this demand as improvements in serial data processing speed become increasingly difficult to sustain. However, parallel data processing requires more complex programming compared to serial processing. Here we present our methods for parallelising two pieces of scientific software, leveraging multiple GPUs to achieve up to thirty times speed up.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
EPSRC (2014)
EPSRC: Software for the future ii (2014)
Lau, L., Griffiths, M., Holmes, V., Ward, R., Jay, C., Dibsdale, C., Venters, C., Xu, J.: The blind men and the elephant: towards an empirical evaluation framework for software sustainability journal of open research software. J. Open Res. Softw. 2, e8 (2014)
Top500: Titan - cray xk7, opteron 6274 16c 2.200ghz, cray gemini interconnect, nvidia k20x (2013)
Nickolls, J., Dally, W.J.: The GPU computing era. Micro IEEE 30(2), 56–69 (2010)
McKenney, P.E.: Is parallel programming hard, and, if so, what can you do about it? (2011)
Tarditi, D., Puri, S., Oglesby, J.: Accelerator: using data parallelism to program GPUs for general-purpose uses. In: Proceedings of the 12th International Conference on Architectural, pp 325–335 (2006)
Harris, M.: Mapping computational concepts to GPUs. In: ACM SIGGRAPH 2005 Courses, SIGGRAPH ’05, ACM, NY, USA (2005)
Takizawa, H., Kobayashi, H.: Hierarchical parallel processing of large scale data clustering on a pc cluster with GPU co-processing. J. Supercomput. 36(3), 219–234 (2006)
Lindholm, E., Nickolls, J., Oberman, S., Montrym, J.: Nvidia tesla: a unified graphics and computing architecture. Micro IEEE 28(2), 39–55 (2008)
Newall, M., Holmes, V., Venters, C., Lunn, P.: GPU cluster for accelerated processing and visualisation of scientific data (2014)
Nvidia: Opencl (2013)
Nvidia: Introduction to cuda (2008)
Nvidia: The cuda parallel computing platform (2013)
Bonebright, T., Cook, P., Flowers, J., Miner, N., Neuhoff, J., Bargar, R., Barrass, S., Berger, J., Evreinov, G., Tecumseh Fitch, W., et al.: Sonification report: status of the field and research agenda (1997)
Nvidia: The allen telescope array (2013)
Nvidia: Nvidia cuda zone (2013)
Muhamedsalih, H., Jiang, X., Gao, F.: Accelerated surface measurement using wavelength scanning interferometer with compensation of environmental noise. In: Procedia Engineering: 12th CIRP Conference on Computer Aided Tolerancing, Apr 2012
Eager, D.L., Zahorjan, J., Lozowska, E.D.: Speedup versus efficiency in parallel systems. IEEE Trans. Comput. 38(3), 408–423 (1989)
Oxford University: Emerald: e-infrastructure south GPU supercomputer (2013)
Innovate UK (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Newall, M., Holmes, V., Venters, C., Lunn, P. (2015). Delivering Faster Results Through Parallelisation and GPU Acceleration. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems in Science and Information 2014. SAI 2014. Studies in Computational Intelligence, vol 591. Springer, Cham. https://doi.org/10.1007/978-3-319-14654-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-14654-6_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14653-9
Online ISBN: 978-3-319-14654-6
eBook Packages: EngineeringEngineering (R0)