Abstract
The continuous accumulation of multi-dimensional data and the development of Semantic Web and Linked Data published in RDF bring new requirements for data analytics tools. Such tools should take into account the special features of RDF graphs, exploit the semantics of RDF and support flexible aggregate queries. In this paper, we present an approach for applying analytics to RDF data, based on a high-level functional query language called HIFUN. According to that language, each analytical query is considered as a well-formed expression of a functional algebra and its definition is independent of the nature and structure of the data. In this work, we detail the required transformations, as well as the translation of HIFUN queries to SPARQL and we introduce the primary implementation of a tool, developed for these purposes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Abelló, A., et al.: Fusion cubes: towards self-service business intelligence. Int. J. Data Warehous. Min. (IJDWM) 9, 66–88 (2013)
Antoniou, G., Van Harmelen, F.: A Semantic Web Primer. MIT Press, Cambridge (2004)
Beheshti, S.-M.-R., Benatallah, B., Motahari-Nezhad, H.R.: Scalable graph-based OLAP analytics over process execution data. Distrib. Parallel Databases 34(3), 379–423 (2014). https://doi.org/10.1007/s10619-014-7171-9
Colazzo, D., Goasdoué, F., Manolescu, I., Roatiş, A.: RDF analytics: lenses over semantic graphs. In: Proceedings of the 23rd International Conference on World Wide Web (2014)
Etcheverry, L., Vaisman, A.A.: Enhancing OLAP analysis with web cubes. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 469–483. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_38
Etcheverry, L., Vaisman, A.A.: QB4OLAP: a new vocabulary for OLAP cubes on the semantic web. In: Proceedings of the Third International Conference on Consuming Linked Data (2012)
Etcheverry, L., Vaisman, A.A.: Querying semantic web data cubes. In: AMW (2016)
Etcheverry, L., Vaisman, A.A.: Efficient analytical queries on semantic web data cubes. J. Data Semant. 6(4), 199–219 (2017). https://doi.org/10.1007/s13740-017-0082-y
Inoue, H., Amagasa, T., Kitagawa, H.: An ETL framework for online analytical processing of linked open data. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 111–117. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38562-9_12
Isaac, A., Haslhofer, B.: Europeana linked open data-data. europeana. eu. Semant. Web 4, 291–297 (2013)
Kämpgen, B., Harth, A.: Transforming statistical linked data for use in OLAP systems. In: Proceedings of the 7th International Conference on Semantic Systems (2011)
Kämpgen, B., O’Riain, S., Harth, A.: Interacting with statistical linked data via OLAP operations. In: Simperl, E., et al. (eds.) ESWC 2012. LNCS, vol. 7540, pp. 87–101. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-46641-4_7
Kokolaki, A., Tzitzikas, Y.: Facetize: an interactive tool for cleaning and transforming datasets for facilitating exploratory search. arXiv preprint arXiv:1812.10734 (2018)
Mountantonakis, M., Tzitzikas, Y.: On measuring the lattice of commonalities among several linked datasets. Proc. VLDB Endow. 9, 1101–1112 (2016)
Mountantonakis, M., Tzitzikas, Y.: How linked data can aid machine learning-based tasks. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds.) TPDL 2017. LNCS, vol. 10450, pp. 155–168. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67008-9_13
Mountantonakis, M., Tzitzikas, Y.: LODsyndesis: global scale knowledge services. Heritage 1, 335–348 (2018)
Mountantonakis, M., Tzitzikas, Y.: Scalable methods for measuring the connectivity and quality of large numbers of linked datasets. J. Data Inf. Qual. (JDIQ) 9, 1–49 (2018)
Mountantonakis, M., Tzitzikas, Y.: Large scale semantic integration of linked data: a survey. ACM Comput. Surv. (CSUR) 52, 1–40 (2019)
Nebot, V., Berlanga, R.: Building data warehouses with semantic web data. Decis. Support Syst. 52, 853–868 (2012)
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: a not-so-foreign language for data processing. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (2008)
Papadaki, M.-E., Papadakos, P., Mountantonakis, M., Tzitzikas, Y.: An interactive 3D visualization for the LOD cloud. In: EDBT/ICDT Workshops (2018)
Spyratos, N.: A functional model for data analysis. In: Larsen, H.L., Pasi, G., Ortiz-Arroyo, D., Andreasen, T., Christiansen, H. (eds.) FQAS 2006. LNCS (LNAI), vol. 4027, pp. 51–64. Springer, Heidelberg (2006). https://doi.org/10.1007/11766254_5
Spyratos, N., Sugibuchi, T.: HIFUN - a high level functional query language for big data analytics. J. Intell. Inf. Syst. 51(3), 529–555 (2018). https://doi.org/10.1007/s10844-018-0495-6
Spyratos, N., Sugibuchi, T.: Data exploration in the HIFUN language. In: Cuzzocrea, A., Greco, S., Larsen, H.L., Saccà, D., Andreasen, T., Christiansen, H. (eds.) FQAS 2019. LNCS (LNAI), vol. 11529, pp. 176–187. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27629-4_18
Thusoo, A., et al.: Hive-a petabyte scale data warehouse using hadoop. In: 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) (2010)
Tzitzikas, Y., et al.: Integrating heterogeneous and distributed information about marine species through a top level ontology. In: Garoufallou, E., Greenberg, J. (eds.) MTSR 2013. CCIS, vol. 390, pp. 289–301. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-03437-9_29
Wang, K., Xu, G., Su, Z., Liu, Y.D.: GraphQ: graph query processing with abstraction refinement-scalable and programmable analytics over very large graphs on a single \(\{\)PC\(\}\). In: 2015 Annual Technical Conference 2015 (2015)
Zapilko, B., Mathiak, B.: Performing statistical methods on linked data. In: International Conference on Dublin Core and Metadata Applications (2011)
Zhao, P., Li, X., Xin, D., Han, J.: Graph cube: on warehousing and OLAP multidimensional networks. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Papadaki, ME., Tzitzikas, Y., Spyratos, N. (2020). Analytics over RDF Graphs. In: Flouris, G., Laurent, D., Plexousakis, D., Spyratos, N., Tanaka, Y. (eds) Information Search, Integration, and Personalization. ISIP 2019. Communications in Computer and Information Science, vol 1197. Springer, Cham. https://doi.org/10.1007/978-3-030-44900-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-44900-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-44899-8
Online ISBN: 978-3-030-44900-1
eBook Packages: Computer ScienceComputer Science (R0)