Skip to main content

Analytics over RDF Graphs

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1197))

Abstract

The continuous accumulation of multi-dimensional data and the development of Semantic Web and Linked Data published in RDF bring new requirements for data analytics tools. Such tools should take into account the special features of RDF graphs, exploit the semantics of RDF and support flexible aggregate queries. In this paper, we present an approach for applying analytics to RDF data, based on a high-level functional query language called HIFUN. According to that language, each analytical query is considered as a well-formed expression of a functional algebra and its definition is independent of the nature and structure of the data. In this work, we detail the required transformations, as well as the translation of HIFUN queries to SPARQL and we introduce the primary implementation of a tool, developed for these purposes.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://lod-cloud.net/.

  2. 2.

    https://www.w3.org/RDF/.

  3. 3.

    https://www.w3.org/TR/rdf-sparql-query/.

  4. 4.

    https://www.wikidata.org.

  5. 5.

    https://www.w3.org/TR/vocab-data-cube/.

  6. 6.

    https://team.inria.fr/oak/projects/warg/.

  7. 7.

    https://virtuoso.openlinksw.com/.

  8. 8.

    http://www.ics.forth.gr/isl/3DLod/.

References

  1. Abelló, A., et al.: Fusion cubes: towards self-service business intelligence. Int. J. Data Warehous. Min. (IJDWM) 9, 66–88 (2013)

    Article  Google Scholar 

  2. Antoniou, G., Van Harmelen, F.: A Semantic Web Primer. MIT Press, Cambridge (2004)

    Google Scholar 

  3. Beheshti, S.-M.-R., Benatallah, B., Motahari-Nezhad, H.R.: Scalable graph-based OLAP analytics over process execution data. Distrib. Parallel Databases 34(3), 379–423 (2014). https://doi.org/10.1007/s10619-014-7171-9

    Article  Google Scholar 

  4. Colazzo, D., Goasdoué, F., Manolescu, I., Roatiş, A.: RDF analytics: lenses over semantic graphs. In: Proceedings of the 23rd International Conference on World Wide Web (2014)

    Google Scholar 

  5. Etcheverry, L., Vaisman, A.A.: Enhancing OLAP analysis with web cubes. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 469–483. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_38

    Chapter  Google Scholar 

  6. Etcheverry, L., Vaisman, A.A.: QB4OLAP: a new vocabulary for OLAP cubes on the semantic web. In: Proceedings of the Third International Conference on Consuming Linked Data (2012)

    Google Scholar 

  7. Etcheverry, L., Vaisman, A.A.: Querying semantic web data cubes. In: AMW (2016)

    Google Scholar 

  8. Etcheverry, L., Vaisman, A.A.: Efficient analytical queries on semantic web data cubes. J. Data Semant. 6(4), 199–219 (2017). https://doi.org/10.1007/s13740-017-0082-y

    Article  Google Scholar 

  9. Inoue, H., Amagasa, T., Kitagawa, H.: An ETL framework for online analytical processing of linked open data. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 111–117. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38562-9_12

    Chapter  Google Scholar 

  10. Isaac, A., Haslhofer, B.: Europeana linked open data-data. europeana. eu. Semant. Web 4, 291–297 (2013)

    Article  Google Scholar 

  11. Kämpgen, B., Harth, A.: Transforming statistical linked data for use in OLAP systems. In: Proceedings of the 7th International Conference on Semantic Systems (2011)

    Google Scholar 

  12. Kämpgen, B., O’Riain, S., Harth, A.: Interacting with statistical linked data via OLAP operations. In: Simperl, E., et al. (eds.) ESWC 2012. LNCS, vol. 7540, pp. 87–101. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-46641-4_7

    Chapter  Google Scholar 

  13. Kokolaki, A., Tzitzikas, Y.: Facetize: an interactive tool for cleaning and transforming datasets for facilitating exploratory search. arXiv preprint arXiv:1812.10734 (2018)

  14. Mountantonakis, M., Tzitzikas, Y.: On measuring the lattice of commonalities among several linked datasets. Proc. VLDB Endow. 9, 1101–1112 (2016)

    Article  Google Scholar 

  15. Mountantonakis, M., Tzitzikas, Y.: How linked data can aid machine learning-based tasks. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds.) TPDL 2017. LNCS, vol. 10450, pp. 155–168. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67008-9_13

    Chapter  Google Scholar 

  16. Mountantonakis, M., Tzitzikas, Y.: LODsyndesis: global scale knowledge services. Heritage 1, 335–348 (2018)

    Article  Google Scholar 

  17. Mountantonakis, M., Tzitzikas, Y.: Scalable methods for measuring the connectivity and quality of large numbers of linked datasets. J. Data Inf. Qual. (JDIQ) 9, 1–49 (2018)

    Article  Google Scholar 

  18. Mountantonakis, M., Tzitzikas, Y.: Large scale semantic integration of linked data: a survey. ACM Comput. Surv. (CSUR) 52, 1–40 (2019)

    Article  Google Scholar 

  19. Nebot, V., Berlanga, R.: Building data warehouses with semantic web data. Decis. Support Syst. 52, 853–868 (2012)

    Article  Google Scholar 

  20. Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: a not-so-foreign language for data processing. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (2008)

    Google Scholar 

  21. Papadaki, M.-E., Papadakos, P., Mountantonakis, M., Tzitzikas, Y.: An interactive 3D visualization for the LOD cloud. In: EDBT/ICDT Workshops (2018)

    Google Scholar 

  22. Spyratos, N.: A functional model for data analysis. In: Larsen, H.L., Pasi, G., Ortiz-Arroyo, D., Andreasen, T., Christiansen, H. (eds.) FQAS 2006. LNCS (LNAI), vol. 4027, pp. 51–64. Springer, Heidelberg (2006). https://doi.org/10.1007/11766254_5

    Chapter  Google Scholar 

  23. Spyratos, N., Sugibuchi, T.: HIFUN - a high level functional query language for big data analytics. J. Intell. Inf. Syst. 51(3), 529–555 (2018). https://doi.org/10.1007/s10844-018-0495-6

    Article  Google Scholar 

  24. Spyratos, N., Sugibuchi, T.: Data exploration in the HIFUN language. In: Cuzzocrea, A., Greco, S., Larsen, H.L., Saccà, D., Andreasen, T., Christiansen, H. (eds.) FQAS 2019. LNCS (LNAI), vol. 11529, pp. 176–187. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27629-4_18

    Chapter  Google Scholar 

  25. Thusoo, A., et al.: Hive-a petabyte scale data warehouse using hadoop. In: 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) (2010)

    Google Scholar 

  26. Tzitzikas, Y., et al.: Integrating heterogeneous and distributed information about marine species through a top level ontology. In: Garoufallou, E., Greenberg, J. (eds.) MTSR 2013. CCIS, vol. 390, pp. 289–301. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-03437-9_29

    Chapter  Google Scholar 

  27. Wang, K., Xu, G., Su, Z., Liu, Y.D.: GraphQ: graph query processing with abstraction refinement-scalable and programmable analytics over very large graphs on a single \(\{\)PC\(\}\). In: 2015 Annual Technical Conference 2015 (2015)

    Google Scholar 

  28. Zapilko, B., Mathiak, B.: Performing statistical methods on linked data. In: International Conference on Dublin Core and Metadata Applications (2011)

    Google Scholar 

  29. Zhao, P., Li, X., Xin, D., Han, J.: Graph cube: on warehousing and OLAP multidimensional networks. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maria-Evangelia Papadaki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Papadaki, ME., Tzitzikas, Y., Spyratos, N. (2020). Analytics over RDF Graphs. In: Flouris, G., Laurent, D., Plexousakis, D., Spyratos, N., Tanaka, Y. (eds) Information Search, Integration, and Personalization. ISIP 2019. Communications in Computer and Information Science, vol 1197. Springer, Cham. https://doi.org/10.1007/978-3-030-44900-1_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-44900-1_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-44899-8

  • Online ISBN: 978-3-030-44900-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics