Advertisement

Evaluation of Performance Metrics in GeoRediSpark Framework for GeoSpatial Query Processing

  • G. V. N. Priyanka
  • S. VasaviEmail author
  • A. Anu Gokhale
Conference paper
Part of the Learning and Analytics in Intelligent Systems book series (LAIS, volume 3)

Abstract

Now-a-days we are moving towards digitization and making all our devices producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, MongoDB, Redis. Bigdata such as geospatial data requires geospatial analytics in applications such as tourism, marketing, rural development. Spark framework provides operators for storing and processing distributed data. Our earlier work proposed “GeoRediSpark” to integrate Redis with Spark. Redis is a key-value store that uses in-memory store, hence integrating Redis with Spark can extend the real-time processing of geospatial data. The paper investigated on storage and retrieval of Redis built in geospatial queries and added two new geospatial operators namely GeoWithin and GeoIntersect to enhance the capabilities of Redis. Hashed indexing is used to improve the processing performance. Comparison on Redis metrics on three benchmark datasets is made in this paper. Hashset is used to display geographic data. Output of geospatial queries is visualized in specific to type of place and nature of query using Tableau.

Keywords

Geospatial data Consistent hashing Location-based query Master-executor daemon REmote DIctionary Server (Redis) metrics Hashset GeoHash Resilient Distributed Datasets (RDD) 

References

  1. 1.
    Sukumar P (2017) How Spark and Redis help derive geographical insights about customers. https://build.hoteltonight.com/how-spark-and-redis-help-derive-geographical-insights-about-customers-be7e32c1f479
  2. 2.
    SriHarsha R (2017) Magellan: Geospatial processing made easy. http://magellan.ghost.io/magellan-geospatial-processing-made-easy/
  3. 3.
    Nativ S (2017) Building a large scale recommendation engine with Spark and Redis-ML. https://databricks.com/session/building-a-large-scale-recommendation-engine-with-spark-and-redis-ml
  4. 4.
    Cihan B (2016) Machine learning on steroids with the new Redis-ML module. https://redislabs.com/blog/machine-learning-steroids-new-redis-ml-module/
  5. 5.
    Hagedorn S, Götze P, Sattler K-U (2017) The Stark framework for spatial temporal data analytics on Spark. In: Proceedings of 20th international conference on extending database technology (EDBT), pp 123–142Google Scholar
  6. 6.
    Tang M, Yu Y, Aref WG, Mahmood AR, Malluhi QM, Ouzzani M (2016) In-memory distributed spatial query processing and optimization, pp 1–15. http://merlintang.github.io/paper/memory-distributed-spatial.pdf
  7. 7.
    Tang M, Yu Y, Malluhi QM, Ouzzani M, Aref WG (2016) Location Spark: a distributed in memory data management system for big spatial data. Proc VLDB Endowment 9(13):1565–1568CrossRefGoogle Scholar
  8. 8.
    Hendawi AM, Ali M, Mokbel MF (2017) Panda∗: a generic and scalable framework for predictive spatio-temporal queries. GeoInformatica 21(2):175–208CrossRefGoogle Scholar
  9. 9.
    Putri FK, Song G, Kwon J, Rao P (2017) DISPAQ: distributed profitable-area query from big taxi trip data. Sensors 17(10):2201, 1–42Google Scholar
  10. 10.
    Hegde V, Aswathi TS, Sidharth R (2016) Student residential distance calculation using Haversine formulation and visualization through Googlemap for admission analysis. In: Proceedings of IEEE international conference on computational intelligence and computing research (ICCIC), pp 1–5Google Scholar
  11. 11.
    Li L, Taniar D, Indrawan-Santiago M, Shao Z (2017) Surrounding join query processing in spatial databases. Proceedings of ADC 2017, pp 17–28. Springer International PublishingGoogle Scholar
  12. 12.
    Vasavi S, Priyanka GVN, Anu Gokhale A (2019) Framework for visualization of geospatial query processing by integrating Redis with Spark, pp 1–19, IJMSTR, vol 6, issue 1 (in press)Google Scholar
  13. 13.
    Places in India (2018) http://www.latlong.net/country/india-102.html. Accessed 1 June 2018
  14. 14.
    Geospatial Analytics in Magellan (2018) https://raw.githubusercontent.com/dima42/uber-gps-analysis/master/gpsdata/all.tsv. Accessed 1 June 2018
  15. 15.
    Spatial dataset (2018) http://www.cs.utah.edu/~lifeifei/research/tpq/cal.cnode. Accessed 1 June 2018
  16. 16.
    Branagan C, Crosby P (2013) Understanding the top 5 Redis performance metrics. Datadog Inc, pp 1–22Google Scholar
  17. 17.
    Wang Y (2018) Vecstra: an efficient and scalable geo-spatial in-memory cache. In: Proceedings of the VLDB 2018Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • G. V. N. Priyanka
    • 1
  • S. Vasavi
    • 1
    Email author
  • A. Anu Gokhale
    • 2
  1. 1.VR Siddhartha Engineering CollegeVijayawadaIndia
  2. 2.Illinois State UniversityNormalUSA

Personalised recommendations