Skip to main content

Semantic Knowledge Discovery and Data-Driven Logical Reasoning from Heterogeneous Data Sources

  • Conference paper
  • First Online:
Uncertainty Reasoning for the Semantic Web III (URSW 2012, URSW 2011, URSW 2013)

Abstract

Available domain ontologies are increasing over the time. However there is still a huge amount of data stored and managed with RDBMS. This complementarity could be exploited both for discovering knowledge patterns that are not formalized within the ontology but that are learnable from the data, and for enhancing reasoning on ontologies by relying on the combination of formal domain models and the evidence coming from data. We propose a method for learning association rules from both ontologies and RDBMS in an integrated way. The extracted patterns can be used for enriching the available knowledge (in both format) and for refining existing ontologies. We also propose a method for automated reasoning on grounded knowledge bases (i.e. knowledge bases linked to RDBMS data) based on the standard Tableaux algorithm which combines logical reasoning and statistical inference thus making sense of the heterogeneous data sources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The concepts “male”, “parent”, “big city”, “medium-sized town”, and the relations “lives_in” are used in the ontology.

  2. 2.

    We call patterns like that in (1) a hybrid pattern since it is composed by elements (that is concept names, role names and attributes) coming from different data sources.

  3. 3.

    An example of random choice within Tableaux algorithm occurs when processing a concepts disjunction, i.e. \(C \sqcup D\), that is when it has to be decided whether an individual \(x\) belongs to concept the \(C\) or \(D\).

  4. 4.

    The restriction of \(g\) to the subsets of \(\mathbf {D}\) and \(\varSigma _I\) can be considered if a mapping for all objects does not exist.

  5. 5.

    The case of a negated concept within the disjunction is treated similarily.

  6. 6.

    If a most conservative behavior of the heuristic has to be considered, only the assertion concerning the disjunct \(C\) (resp. \(D\)) is added to \(\mathcal {I}_r\) while the additional items in the right hand side of the selected rule are not taken into account.

  7. 7.

    We are working for building an heterogeneous dataset for empirically evaluating the proposed approach.

  8. 8.

    Concept names of the considered ontology could be considered as well as new query concepts that are built starting (by using the constructors of the chosen DL language [2]) from the concept names that are formalized in the ontology.

References

  1. Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp. 207–216. ACM Press (1993)

    Google Scholar 

  2. Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F. (eds.): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, New York (2003)

    Google Scholar 

  3. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American, Singapore (2001)

    Google Scholar 

  4. Calvanese, D., De Giacomo, G., Lembo, D., Lenzerini, M., Poggi, A., Rodriguez-Muro, M., Rosati, R., Ruzzi, M., Savo, D.F.: The mastro system for ontology-based data access. Semant. Web J. 2(1), 43–53 (2011)

    Google Scholar 

  5. d’Amato, C.: Similarity-based learning methods for the semantic web. Ph.D. Thesis (2007)

    Google Scholar 

  6. d’Amato, C., Bryl, V., Serafini, L.: Data-driven logical reasoning. In: Bobillo, F., et al. (eds.) Proceedings of International Workshop on Uncertainty Reasoning for the Semantic Web (URSW 2012). CEUR-Workshop Proceedings, pp. 51–62. CEUR (2012)

    Google Scholar 

  7. d’Amato, C., Bryl, V., Serafini, L.: Semantic knowledge discovery from heterogeneous data sources. In: ten Teije, A., Völker, J., Handschuh, S., Stuckenschmidt, H., d’Acquin, M., Nikolov, A., Aussenac-Gilles, N., Hernandez, N. (eds.) EKAW 2012. LNCS, vol. 7603, pp. 26–31. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  8. d’Amato, C., Fanizzi, N., Esposito, F.: Query answering and ontology population: an inductive approach. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 288–302. Springer, Heidelberg (2008)

    Google Scholar 

  9. Donini, F.M., Lenzerini, M., Nardi, D., Schaerf, A.: \({\cal AL}\)-log: integrating datalog and description logics. J. Intell. Inf. Syst. 10, 227–252 (1998)

    Article  Google Scholar 

  10. Galárraga, L., Teflioudi, C., Suchanek, F., Hose, K.: Amie: association rule mining under incomplete evidence in ontological knowledge bases. In: Proceedings of the 20th International World Wide Web Conference (WWW 2013). ACM (2013)

    Google Scholar 

  11. Grimm, S., Motik, B., Preist, C.: Variance in e-business service discovery. In: Proceedings of the ISWC Workshop on Semantic Web Services (2004)

    Google Scholar 

  12. Hand, D.J., Smyth, P., Mannila, H.: Principles of Data Mining. MIT Press, Cambridge (2001)

    Google Scholar 

  13. Józefowska, J., Lawrynowicz, A., Lukaszewski, T.: The role of semantics in mining frequent patterns from knowledge bases in description logics with rules. Theory Pract. Logic Program. 10(3), 251–289 (2010)

    Article  MATH  Google Scholar 

  14. Lisi, F.A.: Al-quin: an onto-relational learning system for semantic web mining. Int. J. Semant. Web Inf. Syst. 7, 1–22 (2011)

    Article  Google Scholar 

  15. Rettinger, A., Lösch, U., Tresp, V., d’Amato, C., Fanizzi, N.: Mining the semantic web: statistical learning for next generation knowledge bases. Data Min. Knowl. Discov. 24(3), 613–662 (2012)

    Article  MATH  MathSciNet  Google Scholar 

  16. Spaccapietra, S., Parent, C.: View integration: a step forward in solving structural conflicts. IEEE Trans. Knowl. Data Eng. 6(2), 258–274 (1994)

    Article  Google Scholar 

  17. Völker, J., Niepert, M.: Statistical schema induction. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 124–138. Springer, Heidelberg (2011)

    Google Scholar 

  18. Witten, I., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufman, San Francisco (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Volha Bryl .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

d’Amato, C., Bryl, V., Serafini, L. (2014). Semantic Knowledge Discovery and Data-Driven Logical Reasoning from Heterogeneous Data Sources. In: Bobillo, F., et al. Uncertainty Reasoning for the Semantic Web III. URSW URSW URSW 2012 2011 2013. Lecture Notes in Computer Science(), vol 8816. Springer, Cham. https://doi.org/10.1007/978-3-319-13413-0_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13413-0_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13412-3

  • Online ISBN: 978-3-319-13413-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics