SQL or NoSQL? Which Is the Best Choice for Storing Big Spatio-Temporal Climate Data?
Management of big spatio-temporal data such as the results from large scale global climate models has long been a challenge because of the sheer vastness of the dataset. Although different data management systems like that incorporate a relational database management system have been proposed and widely used in prior studies, solutions that are particularly designed for big spatio-temporal data management have not been studied well. In this paper, we propose a general data management platform for high-dimensional spatio-temporal datasets like those found in the climate domain, where different database systems can be applied. Through this platform, we compare and evaluate several database systems including SQL database and NoSQL database from various aspects and explore the key impact factors for system performance. Our experimental results indicate advantages and disadvantages of each database system and give insight into the best system to use for big spatio-temporal data applications. Our analysis provides important insights into the understanding of performance of different data management systems, which is very useful for designing high dimensional big data applications.
KeywordsSpatio-temporal database NoSQL Big spatio-temporal data Performance
- 1.Unidata: NetCDF. http://www.unidata.ucar.edu/software/netcdf/
- 2.Apache: Hadoop (2011). http://hadoop.apache.org/
- 3.MongoDB: Mongodb. http://www.mongodb.org/
- 4.Cuzzocrea, A., Moussa, R.: A cloud-based framework for supporting effective and efficient OLAP in big data environments. In: IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp. 680–684 (2014)Google Scholar
- 5.Brezany, P., Yan, Z., Janciak, I., Chen, P., Ye, S.: An elastic OLAP cloud platform. In: IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing, pp. 356–363 (2011)Google Scholar
- 8.Ameri, P., Grabowski, U., Meyer, J., Streit, A.: On the application and performance of MongoDB for climate satellite data. In: IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp. 652–659 (2014)Google Scholar
- 9.Jern, M., Franzen, J.: “GeoAnalytics” - exploring spatio-temporal and multivariate data. In: Tenth International Conference on Information Visualization, pp. 25–31 (2006)Google Scholar
- 14.Tang, W., Feng, W.: Parallel map projection of vector-based big spatial data: coupling cloud computing with graphics processing units. Comput. Environ. Urban Syst. 61(11), 187–197 (2014)Google Scholar