Abstract
Ocean observation plays an essential role in ocean exploration. Ocean science is entering into big data era with the exponentially growth of information technology and advances in ocean observatories. Ocean observatories are collections of platforms capable of carrying sensors to sample the ocean over appropriate spatio-temporal scales. Data collected by these platforms help answer a range of fundamental and applied research questions. Given the huge volume, diverse types, sustained measurement and potential uses of ocean observing data, it is a typical kind of big data, namely marine big data. The traditional data-centric infrastructure is insufficient to deal with new challenges arising in ocean science. This paper discusses some possible new strategies to solve marine big data challenges in the phases of data storage, data computing and analysis. A geological example illustrates the significant use of marine big data. Finally, we highlight some challenges and key issues in marine big data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hole Oceanographic Institution. http://www.whoi.edu/Woods
Schofield, O., et al.: Automated sensor network to advance ocean science. EOS Trans. Am. Geophys. Union 91(39), 345–346 (2010)
Chave, A.D., et al.: Cyberinfrastructure for the US Ocean Observatories Initiative: enabling interactive observation in the ocean. In: OCEANS 2009 – EUROPE, Bremen. IEEE (2009)
Beyer, M.A., Laney, D.: The Importance of ‘Big Data’: A Definition. Gartner Inc., Stamford (2012)
Farcas, C., et al.: Ocean Observatories Initiative scientific data model. In: OCEANS 2011, Waikoloa, HI. IEEE (2011)
Park, K., Nguyen, M.C., Won, H.: Web-based collaborative big data analytics on big data as a service platform. In: 2015 17th International Conference on Advanced Communication Technology (ICACT), Seoul. IEEE (2015)
Bellatreche, L., Furtado, P., Mohania, M.K.: Guest editorial: a special issue in physical design for big data warehousing and mining. Distrib. Parallel Databases 34(3), 289–292 (2015)
Demchenko, Y., Laat, C., Membrey, P.: Defining architecture components of the Big Data Ecosystem. In: 2014 International Conference on Collaboration Technologies and Systems (CTS), Minneapolis, MN. IEEE (2014)
Du, Y., et al.: Study of migration model based on the massive marine data hybrid cloud storage. In: 2012 First International Conference on Agro-Geoinformatics (Agro-Geoinformatics), Shanghai. IEEE (2012)
Huang, D., et al.: Modeling and analysis in marine big data: advances and challenges. Math. Probl. Eng. 2015, 1–13 (2015)
Yang, K., et al.: Enabling efficient access control with dynamic policy updating for big data in the cloud. In: 2014 Proceedings IEEE INFOCOM, Toronto, ON. IEEE (2014)
Schofield, O., et al.: Ocean observatories and information: building a global ocean observing network. In: Orcutt, J. (ed.) Earth System Monitoring: Selected Entries from the Encyclopedia of Sustainability Science and Technology, pp. 319–336. Springer, New York (2013)
Argo. http://www.argo.net/
Ocean Networks Canada. http://www.oceannetworks.ca/
Ocean Observatories Initiative. http://oceanobservatories.org/
Siriweera, T.H.A.S., et al.: Intelligent big data analysis architecture based on automatic service composition. In: 2015 IEEE International Congress on Big Data (BigData Congress), New York. IEEE (2015)
Antonia, C., Andrei, N., MarÃa-Jesús, G.: DAMAR: information management system for marine data. In: 2011 IEEE – Spain OCEANS, Santander. IEEE (2011)
Ghemawat, S., Gobioff, H., Leung, S.-T.: The Google file system. SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003)
Shvachko, K., et al.: The Hadoop distributed file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), Incline Village, NV. IEEE (2010)
Chaiken, R., et al.: SCOPE: easy and efficient parallel processing of massive data sets. Proc. VLDB Endow. 1(2), 1265–1276 (2008)
Beaver, D., et al.: Finding a needle in Haystack: Facebook’s photo storage. In: Proceedings of the 9th USENIX Conference on Operating Systems Design and Implementation, Vancouver, BC, Canada, pp. 1–8. USENIX Association (2010)
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
DeCandia, G., et al.: Dynamo: amazon’s highly available key-value store. SIGOPS Oper. Syst. Rev. 41(6), 205–220 (2007)
Chang, F., et al.: Bigtable: a distributed storage system for structured data. ACM Trans. Comput. Syst. 26(2), 1–26 (2008)
Chodorow, K.: MongoDB: The Definitive Guide, 2nd edn. O’Reilly Media, Sebastopol (2013)
Murty, J.: Programming Amazon Web Services: S3, EC2, SQS, FPS, and SimpleDB. O’Reilly Media, Sebastopol (2008)
Anderson, J.C., Lehnardt, J., Slater, N.: CouchDB: The Definitive Guide. O’Reilly Media, Sebastopol (2010)
Cho, S.: Fast memory and storage architectures for the big data era. In: 2015 IEEE Asian Solid-State Circuits Conference (A-SSCC), Xiamen. IEEE (2015)
Mühlbauer, T., et al.: Instant loading for main memory databases. Proc. VLDB Endow. 6(14), 1702–1713 (2013)
Raynaud, T., Haque, R., Aït-kaci, H.: CedCom: a high-performance architecture for Big Data applications. In: 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), Doha. IEEE (2014)
Ousterhout, J., et al.: The case for RAMClouds: scalable high-performance storage entirely in DRAM. SIGOPS Oper. Syst. Rev. 43(4), 92–105 (2010)
Wu, X., et al.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 97–107 (2014)
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Isard, M., et al.: Dryad: distributed data-parallel programs from sequential building blocks. In: Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, Lisbon, Portugal, pp. 59–72. ACM (2007)
Malewicz, G., et al.: Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, Indianapolis, Indiana, USA, pp. 135–146. ACM (2010)
Wu, X.D., et al.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)
Leung, C.K.S., MacKinnon, R.K., Jiang, F.: Reducing the search space for big data mining for interesting patterns from uncertain data. In: 2014 IEEE International Congress on Big Data (BigData Congress), Anchorage, AK. IEEE (2014)
Wu, X., Zhang, S.: Synthesizing high-frequency rules from different data sources. IEEE Trans. Knowl. Data Eng. 15(2), 353–367 (2003)
Zhu, W., et al.: Multimedia big data computing. IEEE Multimedia 22(3), 96-c3 (2015)
Kantere, V.: A holistic framework for big scientific data management. In: 2014 IEEE International Congress on Big Data (BigData Congress), Anchorage, AK. IEEE (2014)
Olston, C., et al.: Pig latin: a not-so-foreign language for data processing. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, Vancouver, Canada, pp. 1099–1110. ACM (2008)
Thusoo, A., et al.: Hive - a petabyte scale data warehouse using Hadoop. In: 2010 IEEE 26th International Conference on Data Engineering (ICDE), Long Beach, CA. IEEE (2010)
Das, S., et al.: Ricardo: integrating R and Hadoop. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, Indianapolis, Indiana, USA, pp. 987–998. ACM (2010)
Wegener, D., et al.: Toolkit-based high-performance data mining of large data on MapReduce clusters. In: IEEE International Conference on Data Mining Workshops. ICDMW 2009, Miami, FL. IEEE (2009)
Lin, Y.C., Wu, C.-W., Tseng, V.S.: Mining high utility itemsets in big data. In: Cao, T., Lim, E.-P., Zhou, Z.-H., Ho, T.-B., Cheung, D., Motoda, H. (eds.) PAKDD 2015. LNCS, vol. 9078, pp. 649–661. Springer, Heidelberg (2015)
Acknowledgements
This work was supported by the National Natural Science Foundation of China (NSFC) under Grant No. 61572448, No. 61379127, and by the Shandong Provincial Natural Science Foundation, China under Grant No. ZR2014JL043.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, Y., Qiu, M., Liu, C., Guo, Z. (2016). Big Data in Ocean Observation: Opportunities and Challenges. In: Wang, Y., Yu, G., Zhang, Y., Han, Z., Wang, G. (eds) Big Data Computing and Communications. BigCom 2016. Lecture Notes in Computer Science(), vol 9784. Springer, Cham. https://doi.org/10.1007/978-3-319-42553-5_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-42553-5_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42552-8
Online ISBN: 978-3-319-42553-5
eBook Packages: Computer ScienceComputer Science (R0)