Storing and Querying Semi-structured Spatio-Temporal Data in HBase

  • Chong ZhangEmail author
  • Xiaoying Chen
  • Xiaosheng Feng
  • Bin Ge
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9998)


With the development of remote sensing, positioning and other technology, a large amount of spatio-temporal data require effective management. In the current research status, a lot of works have focused on how to effectively use HBase to store and quickly find structured spatio-temporal data. However, some spatio-temporal data exists in the semi-structured documents, such as metadata that describes the remote sensing products, under such context, the query is changed to spatio-temporal query + semi-structured query (XPath), which is less studies in previous works. In this paper, we focus on how to efficiently and economically achieve semi-structured spatio-temporal data storage and query in HBase. Firstly, the formal description of the problem is presented. Secondly, we propose HSSST storage model using a semi-structured approach TwigStack. On this basis, semi-structured spatio-temporal range query and kNN queries are carried out. Experiments are conducted on real dataset, comparing with MongoDB which need higher hardware configuration, the results show that in moderate configuration of machines, the performance of semi-structured spatio-temporal query algorithms are superior to MongoDB, thus it has advantage in real application.


Spatio-temporal Semi-structured HBase Range query KNN Query 


  1. 1.
    Chodorow, K.: MongoDB: The Definitive Guide. O’Reilly Media, Inc., Sebastopol (2013)Google Scholar
  2. 2.
    Faloutsos, C., Roseman, S.: Fractals for secondary key retrieval. In: Proceedings of the Eighth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 247–252. ACM (1989)Google Scholar
  3. 3.
    Han, D., Stroulia, E.: Hgrid: a data model for large geospatial data sets in hbase. In: 2013 IEEE Sixth International Conference on Cloud Computing (CLOUD), pp. 910–917. IEEE (2013)Google Scholar
  4. 4.
    Hsu, Y.T., Pan, Y.C., Wei, L.Y., Peng, W.C., Lee, W.C.: Key formulation schemes for spatial index in cloud data managements. In: 2012 IEEE 13th International Conference on Mobile Data Management (MDM), pp. 21–26. IEEE (2012)Google Scholar
  5. 5.
    Nishimura, S., Das, S., Agrawal, D., Abbadi, A.E.: Md-hbase: a scalable multi-dimensional data infrastructure for location aware services. In: 2011 12th IEEE International Conference on Mobile Data Management (MDM), vol. 1, pp. 7–16. IEEE (2011)Google Scholar
  6. 6.
    Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. In: ACM Sigmod Record, vol. 24, pp. 71–79. ACM (1995)Google Scholar
  7. 7.
    Vahdati, S., Karim, F., Huang, J.-Y., Lange, C.: Mapping large scale research metadata to linked data: a performance comparison of HBase, CSV and XML. In: Garoufallou, E., Hartley, R.J., Gaitanou, P. (eds.) MTSR 2015. CCIS, vol. 544, pp. 261–273. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-24129-6_23 CrossRefGoogle Scholar
  8. 8.
    Zhang, N., Zheng, G., Chen, H., Chen, J., Chen, X.: Hbasespatial: a scalable spatial data storage based on hbase. In: 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), pp. 644–651. IEEE (2014)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Chong Zhang
    • 1
    • 2
    Email author
  • Xiaoying Chen
    • 1
    • 2
  • Xiaosheng Feng
    • 1
    • 2
  • Bin Ge
    • 1
    • 2
  1. 1.Science and Technology on Information Systems Engineering LaboratoryNational University of Defense TechnologyChangshaChina
  2. 2.Collaborative Innovation Center of Geospatial TechnologyWuhanChina

Personalised recommendations