A Hybrid Parallel Computing Model to Support Scalable Processing of Big Oceanographic Spatial Data

Song, Miaomiao; Li, Wenwen; Li, Wenqing; Liu, Enxiao; Yu, Dingfeng

doi:10.1007/978-981-10-3969-0_32

A Hybrid Parallel Computing Model to Support Scalable Processing of Big Oceanographic Spatial Data

Miaomiao Song¹³,
Wenwen Li¹⁴,
Wenqing Li¹³,
Enxiao Liu¹³ &
…
Dingfeng Yu¹³

Conference paper
First Online: 03 March 2017

1038 Accesses
1 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 699))

Abstract

Oceanographic sciences are facing big challenges due to the deluge of big data. As of 2010, the amount of new data stored in the world main countries, led by the US, has grown over 7 exabytes. Although the computer hardware is quickly evolving, with faster processor frequency, multi-core technology, and larger memory, traditional reprocessing paradigm on a single-desktop basis still suffers from significant limitations in its low computational efficiency and scalability. In this paper, we report our effort in developing a hybrid parallel computing model which utilizes Graphic Processing Unit (GPU) to accelerate Hadoop Map Reduce system. In each computing node, the actual reprocessing is offloaded from a CPU to a GPU to further boost up the system performance. We describe the architecture design of the proposed model and the automated task/data assignment on each GPU-enabled compute node. Electronic Navigational Charts in ocean fields involves a huge amount of spatio-temporal data. Reprojection of these data between different coordinate reference systems, which is a computation-intensive task, is selected as the use case. Systematic experiments were conducted to demonstrate the good performance of the proposed model.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Mitchell, A.E., et al.: NASA’s earth observing data and information system-supporting interoperability through a scalable architecture. AGU Fall Meet. Abstr. 1 (2013)
Google Scholar
Shekhar, S., Gunturi, V., Evans, M.R., Yang, K.: Spatial big-data challenges intersecting mobility and cloud computing. In: Proceedings of the Eleventh ACM International Workshop on Data Engineering for Wireless and Mobile Access. ACM, Scottsdale, pp. 1–6 (2012)
Google Scholar
Miao, X., Hao, L.: An implementation of GPU accelerated MapReduce: using Hadoop with OpenCL for data- and compute-intensive jobs. In: 2012 International Joint Conference on Service Sciences (IJCSS), pp. 6–11 (2012)
Google Scholar
Aji, A., Wang, F., Vo, H., Lee, R., Liu, Q., Zhang, X., Saltz, J.: Hadoop GIS: a high performance spatial data warehousing system over MapReduce. Proc. VLDB Endow. 6, 1009–1020 (2013)
Article Google Scholar
Hecht, H., Berking, B., Buttgenbach, G., et al.: The Electronic Chart: Functions, Potential, and Limitations of a New Marine Navigation System. GITC bv, Lemmer (2006)
Google Scholar
Shvachko, K., Kuang, H., Radia, S., et al.: The Hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–10. IEEE (2010)
Google Scholar
Shao, G., Berman, F., Wolski, R.: Master/slave computing on the grid. In: Proceedings of the 9th Heterogeneous Computing Workshop (HCW 2000), pp. 3–16. IEEE (2000)
Google Scholar
Bell, N., Hoberock, J.: Thrust: a productivity-oriented library for CUDA. In: GPU Computing Gems Jade Edition, vol. 2, pp. 359–371 (2011)
Google Scholar

Download references

Acknowledgment

The research work report in this paper was mainly supported by the Young Scientists Funds (Grant No. 2015QN027) from Shandong Academy of Sciences. It was partially sponsored by the Youth Fund of Natural Science of China (Grant No. 41401435).

Author information

Authors and Affiliations

Institute of Oceanographic Instrument Shandong Academy of Sciences, Qingdao, China
Miaomiao Song, Wenqing Li, Enxiao Liu & Dingfeng Yu
GeoDa Center for Geospatial Analysis and Computation School of Geographical Sciences and Urban Planning, Arizona State University, Tempe, USA
Wenwen Li

Authors

Miaomiao Song
View author publications
You can also search for this author in PubMed Google Scholar
Wenwen Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenqing Li
View author publications
You can also search for this author in PubMed Google Scholar
Enxiao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Dingfeng Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Miaomiao Song .

Editor information

Editors and Affiliations

Beijing Institute of Technology, Beijing, China
Hanning Yuan
Beijing Institute of Technology, Beijing, China
Jing Geng
Wuhan University, Wuhan, China
Fuling Bian

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, M., Li, W., Li, W., Liu, E., Yu, D. (2017). A Hybrid Parallel Computing Model to Support Scalable Processing of Big Oceanographic Spatial Data. In: Yuan, H., Geng, J., Bian, F. (eds) Geo-Spatial Knowledge and Intelligence. GRMSE 2016. Communications in Computer and Information Science, vol 699. Springer, Singapore. https://doi.org/10.1007/978-981-10-3969-0_32

Download citation

DOI: https://doi.org/10.1007/978-981-10-3969-0_32
Published: 03 March 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3968-3
Online ISBN: 978-981-10-3969-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics