Skip to main content

Progressive Genetic Evolutions-Based Join Cost Optimization (PGE-JCO) for Distributed RDF Chain Queries

  • Conference paper
  • First Online:
Proceedings of the First International Conference on Computational Intelligence and Informatics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 507))

  • 1048 Accesses

Abstract

The finest way of semantic Web representation for further indexing and querying becomes robust due to the RDF structure. The magnified growth in semantic Web data, RDF query processing, emerged as complex due to numerous joins. Henceforth, to achieve scalability and robustness toward search space and time, the query joins must be optimized. The majority of existing benchmarking models have been evaluated on a single source. These methods utterly failed to optimize the queries with nested loop join, bind join, and AGJoin, which is due to the search cost only being considered as the optimization factor by all of the existing models. Hence to optimize the distribute chain queries, here in this paper we propose a novel evolutionary approach, which is based on progressive genetic evolutions to identify the optimized chain queries even for distributed triple stores. The experimental results show that the significance of the proposed model over the existing evolutionary approaches is optimal. Also, it is obvious to confirm that the metrics and evolution process introduced here in this paper motivates future research to identify the new dimensions of chain query optimization to search in distributed triple stores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Berners-Lee, T. H. (2001). The semantic web. Scientific american, 28–37.

    Google Scholar 

  2. Simmhan, Y. L. (2005). A survey of data provenance in e-science. ACM Sigmod Record, 31–36.

    Google Scholar 

  3. David, C. O. (2012). A survey of RDF storage approaches. ARIMA Journal, 11–35.

    Google Scholar 

  4. Quilitz, B. (2008). Querying distributed RDF data sources with SPARQL. Springer Berlin Heidelberg, 524–538.

    Google Scholar 

  5. Schmidt, M. M. (2010). Foundations of SPARQL query optimization. 13th International Conference on Database Theory (pp. 4–33). ACM.

    Google Scholar 

  6. Gonçalves, F. A. (2014). Query join ordering optimization with evolutionary multi-agent systems. Expert Systems with Applications, 6934–6944.

    Google Scholar 

  7. Whitley, D. (2014). An executable model of a simple genetic algorithm. In Foundations of genetic algorithms (pp. 45–62).

    Google Scholar 

  8. Jain, A. W. (2013). Learning trajectory preferences for manipulators via iterative improvement. Advances in Neural Information Processing Systems, 575–583.

    Google Scholar 

  9. Otten, R. H. (2012). The annealing algorithm. Springer Science & Business Media.

    Google Scholar 

  10. Hogenboom, A. N. (2012). RCQ-ACS: RDF Chain Query Optimization Using an Ant Colony System. International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) (pp. 74–81). IEEE/WIC/ACM.

    Google Scholar 

  11. Hogenboom, A. M. (2009). RCQ-GA: RDF chain query optimization using genetic algorithm. Springer Berlin Heidelberg, 181–192.

    Google Scholar 

  12. H. Stuckenschmidt, R. V. (2005). Towards Distributed Processing of RDF Path Queries. International Journal of Web Engineering and Technology, 207–230.

    Google Scholar 

  13. Andrey Gubichev, T. N. (2014). Exploiting the query structure for efficient join ordering in SPARQL queries. 17th international conference on Extending Database Technology.

    Google Scholar 

  14. TanselDokeroglu, U. T. (2012). Particle Swarm Intelligence as a New Heuristic for the Optimization of Distributed Database Queries. 6th International Conference on Application of Information and Communication Technologies, (pp. 1–7).

    Google Scholar 

  15. Senn, J. (2010). Parallel Join Processing on Graphics Processors for the Resource Description Framework. 23rd International Conference on Architecture of Computing Systems, (pp. 1–8).

    Google Scholar 

  16. Sherifsakr, S. E. (2014). Hybrid query execution engine for large attributed graphs. Journal of Information Systems, 45–73.

    Google Scholar 

  17. C. Liu, H. (2010). Towards efficient SPARQL query processing on RDF data. Tsinghua science and technology, 613–622.

    Google Scholar 

  18. X. Zhang, L. C. (2012). Towards efficient join processing over large RDF graph using map reduce. Scientific and Statistical Database Management, 250–259.

    Google Scholar 

  19. Layzer, D. (1980). Genetic variation and progressive evolution. American Naturalist, 809–826.

    Google Scholar 

  20. Ihaka, R. (1996). R: a language for data analysis and graphics. Journal of computational and graphical statistics, 299–314.

    Google Scholar 

  21. The Friend of a Friend (FOAF) project. (2000). Retrieved from foaf-project.org: http://www.foaf-project.org/.

  22. Agency, C. I. (2014). The CIA World Factbook 2015. New York: Skyhorse Publishing Inc.

    Google Scholar 

  23. Y. Guo, Z. P. (2005). Lubm: A benchmark for owl knowledge base systems. J. Web Sem, 158–182.

    Google Scholar 

  24. Wing, J. M. (2006). Computational thinking. Communications of the ACM, 33–35.

    Google Scholar 

  25. Chong, E. I. (2005). An efficient SQL-based RDF querying scheme. 31st international conference on Very large data bases (pp. 1216–1227). VLDB Endowment.

    Google Scholar 

  26. Carmines, E. G. (1979). Reliability and validity assessment. Sage publications.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Shailaja .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media Singapore

About this paper

Cite this paper

Shailaja, K., Kumar, P.V., Durga Bhavani, S. (2017). Progressive Genetic Evolutions-Based Join Cost Optimization (PGE-JCO) for Distributed RDF Chain Queries. In: Satapathy, S., Prasad, V., Rani, B., Udgata, S., Raju, K. (eds) Proceedings of the First International Conference on Computational Intelligence and Informatics . Advances in Intelligent Systems and Computing, vol 507. Springer, Singapore. https://doi.org/10.1007/978-981-10-2471-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-2471-9_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-2470-2

  • Online ISBN: 978-981-10-2471-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics