Advertisement

Cluster Computing

, Volume 22, Supplement 3, pp 6483–6497 | Cite as

An extra spatial hierarchical schema in key-value store

  • Kun Zheng
  • Kang ZhengEmail author
  • Falin Fang
  • Miao Zhang
  • Qi Li
  • Yanghui Wang
  • Wenyu Zhao
Article

Abstract

The rapid growth of positioning technologies has resulted in an explosion of spatial data, and how to manage and retrieve such data has become a challenge. To solve this problem, many researchers pay attention to build spatial index in key-value store. Nevertheless, this will generate questions regarding the spatial index update and management. Furthermore, the efficiency of spatial query operations would decrease because it will generate much more request on network. Besides some scholars adopt space filling curve to carry out spatial query with primary key index, however it can bring out the questions of “Edge-Case Problem” and “Z-Order Problem” which is caused by space filling curve. To solve these questions, scholars resort to spatial index again. Nevertheless, we deem that the questions can be resolved without building spatial index. So this paper advocates an extra spatial hierarchical schema inspired by geohash, and design spatial query method based on primary keys index. Finally, to test the query accuracy and efficiency of the spatial hierarchical schema, we adopt Z-ordering, Hilbert, Row and Gray into the process of primary key encoding and conduct range query and k-NN queries in HBase. Experiment evaluation shows that the efficiency of the spatial queries good based on this schema even without the help of a spatial index.

Keywords

Primary key Key-value store Spatial hierarchical schema Space filling curve 

Notes

Acknowledgements

The authors would like to thank the National Science and Technology Major Project (No. 2017ZX05036-001-010), the National Key Research Program Plan of China (No. 2016YFB0502603), the Natural Science Foundation of Hubei Province of China (No. 2015CFB400) and Fundamental Research Founds for National University, China University of Geosciences (Wuhan) (1610491B20).

References

  1. 1.
    Le, H.V., Takasu, A.: An efficient distributed index for geospatial databases, pp. 28–42. Springer International Publishing, Basel (2015)Google Scholar
  2. 2.
    Li, F., et al.: R-Store: a scalable distributed system for supporting real-time analytics. In: Proceeding of the IEEE International Conference on Data Engineering (2014)Google Scholar
  3. 3.
    Zhou, X., et al.: ABR-Tree: an efficient distributed multidimensional indexing approach for massive data. In: Proceeding of the ICA3PP International Workshops and Symposiums on Algorithms and Architectures for Parallel Processing (2015)CrossRefGoogle Scholar
  4. 4.
    Han, J., et al.: Survey on NoSQL database. In: Proceeding of the International Conference on Pervasive Computing and Applications (2011)Google Scholar
  5. 5.
    George, L.: HBase schema design—things you need to know. Oreilly Community. http://www.oreilly.com/pub/e/2058 (2011)
  6. 6.
  7. 7.
    Zhong, Y., et al.: Towards parallel spatial query processing for big spatial data. In: Proceeding of the Parallel and Distributed Processing Symposium Workshops & Phd Forum (2012)Google Scholar
  8. 8.
    Chen, X.Y., et al.: Efficient historical query in HBase for spatio-temporal decision support. Int. J. Comput. Commun. Control 11(5), 613–630 (2016)CrossRefGoogle Scholar
  9. 9.
    Han, D., E. Stroulia: HGrid: a data model for large geospatial data sets in HBase. In: Proceeding of the IEEE Sixth International Conference on Cloud Computing (2013)Google Scholar
  10. 10.
    Zhang, N., et al.: HBaseSpatial: a scalable spatial data storage based on HBase. In: Proceeding of the IEEE International Conference on Trust, Security and Privacy in Computing and Communications (2014)Google Scholar
  11. 11.
    Aji, A., et al.: Hadoop-GIS: a high performance spatial data warehousing system over MapReduce. Proc. Vldb Endow. 6(11), 1009–1020 (2013)CrossRefGoogle Scholar
  12. 12.
    Zhang, X., et al.: An efficient multi-dimensional index for cloud data management. In: Proceeding of the International CIKM Workshop on Cloud Data Management, Clouddb 2009, Hong Kong, China, November (2009)Google Scholar
  13. 13.
    Lee, K., et al.: Efficient spatial query processing for big data. In: Proceeding of the ACM Sigspatial International Conference (2014)Google Scholar
  14. 14.
    Nishimura, S., et al.: MD-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services. Distrib Parallel Databases 31(2SI), 289–319 (2013)CrossRefGoogle Scholar
  15. 15.
    Hong, V.L., Distributed Moving Objects Database Based on Key–Value Stores. 2016Google Scholar
  16. 16.
  17. 17.
    Papadopoulos, A., Katsaros, D.: A-Tree: distributed indexing of multidimensional data for cloud computing environments. In: Proceeding of the IEEE Third International Conference on Cloud Computing Technology and Science (2011)Google Scholar
  18. 18.
    Zhang, Y., et al. A Large-scale Online Search System of High-Dimensional Vectors Based on Key-Value Store. In: Proceeding of the Eighth International Conference on Semantics, Knowledge and Grids (2012)Google Scholar
  19. 19.
    Chen, X., Zhang, J., Xu, Z., Liu, J.: HIB-tree: an efficient index method for the big data analytics of large-scale human activity trajectories. Future Gener. Comput. Syst. (2018).  https://doi.org/10.1016/j.future.2018.01.004 CrossRefGoogle Scholar
  20. 20.
    Fox, A., Eichelberger, C., Hughes,  J., Lyon, S.: Spatio-temporal indexing in non-relational distributed databases. In: Proceedings of IEEE International Conference on Big Data, pp. 291–299, Santa Clara Marriott, CA, USA (2013)Google Scholar
  21. 21.
    Hughes, J.N., et al.: A survey of techniques and open-source tools for processing streams of spatio-temporal events. In: Proceedings of the 7th ACM Sigspatial International Workshop on Geostreaming (IWGS), pp. 39–42 (2016)Google Scholar
  22. 22.
    Zheng, K., Gu, D., Fang, F., et al.: Data storage optimization strategy in distributed column-oriented database by considering spatial adjacency. Clust. Comput. 12, 1–12 (2017)Google Scholar
  23. 23.
  24. 24.
    Abel, D.J., Mark, D.M.: A comparative analysis of some two-dimensional orderings. Int. J. Geogr. Inf. Sci. 4(1), 21–31 (1990)CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Kun Zheng
    • 1
  • Kang Zheng
    • 1
    Email author
  • Falin Fang
    • 1
  • Miao Zhang
    • 1
  • Qi Li
    • 1
  • Yanghui Wang
    • 1
  • Wenyu Zhao
    • 1
  1. 1.Faculty of Information EngineeringChina University of Geosciences (Wuhan)WuhanChina

Personalised recommendations