Learning Commonalities in RDF

  • Sara El Hassad
  • François GoasdouéEmail author
  • Hélène Jaudoin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10249)


Finding the commonalities between descriptions of data or knowledge is a foundational reasoning problem of Machine Learning introduced in the 70’s, which amounts to computing a least general generalization (\(\mathtt {lgg}\)) of such descriptions. It has also started receiving consideration in Knowledge Representation from the 90’s, and recently in the Semantic Web field. We revisit this problem in the popular Resource Description Framework (RDF) of W3C, where descriptions are RDF graphs, i.e., a mix of data and knowledge. Notably, and in contrast to the literature, our solution to this problem holds for the entire RDF standard, i.e., we do not restrict RDF graphs in any way (neither their structure nor their semantics based on RDF entailment, i.e., inference) and, further, our algorithms can compute \(\mathtt {lgg}\)s of small-to-huge RDF graphs.


RDF RDFS RDF entailment Least general generalization 


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
    Baader, F., Sertkaya, B., Turhan, A.Y.: Computing the least common subsumer w.r.t. a background terminology. J. Appl. Logic 5(3), 392–420 (2007)MathSciNetCrossRefGoogle Scholar
  10. 10.
    Baget, J., Croitoru, M., Gutierrez, A., Leclère, M., Mugnier, M.: Translations between RDF(S) and conceptual graphs. In: ICCS (2010)Google Scholar
  11. 11.
    Chein, M., Mugnier, M.: Graph-Based Knowledge Representation - Computational Foundations of Conceptual Graphs. Springer, London (2009)zbMATHGoogle Scholar
  12. 12.
    Cohen, W.W., Borgida, A., Hirsh, H.: Computing least common subsumers in description logics. In: AAAI (1992)Google Scholar
  13. 13.
    Colucci, S., Donini, F., Giannini, S., Sciascio, E.D.: Defining and computing least common subsumers in RDF. J. Web Semant. 39, 62–80 (2016)CrossRefGoogle Scholar
  14. 14.
    Colucci, S., Donini, F.M., Sciascio, E.D.: Common subsumbers in RDF. In: AI*IA (2013)CrossRefGoogle Scholar
  15. 15.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: OSDI (2004)Google Scholar
  16. 16.
    El Hassad, S., Goasdoué, F., Jaudoin, H.: Learning commonalities in RDF and SPARQL (research report) (2016).
  17. 17.
    Garcia-Molina, H., Ullman, J.D., Widom, J.: Database Systems - The Complete Book. Pearson Education, Harlow (2009)Google Scholar
  18. 18.
    Goasdoué, F., Kaoudi, Z., Manolescu, I., Quiané-Ruiz, J., Zampetakis, S.: Cliquesquare: flat plans for massively parallel RDF queries. In: ICDE (2015)Google Scholar
  19. 19.
    Küsters, R.: Non-standard Inferences in Description Logics. LNCS, vol. 2100. Springer, Heidelberg (2001)zbMATHGoogle Scholar
  20. 20.
    Lehmann, J., Bühmann, L.: AutoSPARQL: let users query your knowledge base. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6643, pp. 63–79. Springer, Heidelberg (2011). doi: 10.1007/978-3-642-21034-1_5CrossRefGoogle Scholar
  21. 21.
    Meier, M.: Towards rule-based minimization of RDF graphs under constraints. In: Calvanese, D., Lausen, G. (eds.) RR 2008. LNCS, vol. 5341, pp. 89–103. Springer, Heidelberg (2008). doi: 10.1007/978-3-540-88737-9_8CrossRefGoogle Scholar
  22. 22.
    Neumann, T., Weikum, G.: The RDF-3X engine for scalable management of RDF data. VLDB J. 19(1), 91–113 (2010)CrossRefGoogle Scholar
  23. 23.
    Papailiou, N., Tsoumakos, D., Konstantinou, I., Karras, P., Koziris, N.: H\({}_{\text{2}}\)rdf+: an efficient data management system for big RDF graphs. In: SIGMOD (2014)Google Scholar
  24. 24.
    Pichler, R., Polleres, A., Skritek, S., Woltran, S.: Complexity of redundancy detection on RDF graphs in the presence of rules, constraints, and queries. Semant. Web 4(4), 351–393 (2013)Google Scholar
  25. 25.
    Plotkin, G.D.: A note on inductive generalization. Mach. Intell. 5, 153–163 (1970)MathSciNetzbMATHGoogle Scholar
  26. 26.
    Plotkin, G.D.: A further note on inductive generalization. Mach. Intell. 6, 101–124 (1971)MathSciNetzbMATHGoogle Scholar
  27. 27.
    Ramakrishnan, R., Gehrke, J.: Database Management Systems. McGraw-Hill, New York (2003)zbMATHGoogle Scholar
  28. 28.
    Robinson, J.A.: A machine-oriented logic based on the resolution principle. J. ACM 12(1), 23–41 (1965)MathSciNetCrossRefGoogle Scholar
  29. 29.
    Robinson, J.A., Voronkov, A. (eds.): Handbook of Automated Reasoning. Elsevier and MIT Press, Weidenbach (2001)zbMATHGoogle Scholar
  30. 30.
    Urbani, J., Kotoulas, S., Maassen, J., van Harmelen, F., Bal, H.E.: WebPIE: a web-scale parallel inference engine using MapReduce. J. Web Semant. 10, 59–75 (2012)CrossRefGoogle Scholar
  31. 31.
    Resource description framework 1.1.
  32. 32.
  33. 33.
    Zarrieß, B., Turhan, A.: Most specific generalizations w.r.t. general EL-TBoxes. In: IJCAI (2013)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Sara El Hassad
    • 1
  • François Goasdoué
    • 1
    Email author
  • Hélène Jaudoin
    • 1
  1. 1.IRISA, Univ. Rennes 1LannionFrance

Personalised recommendations