Abstract
With the digitalization of many industrial processes and the increasing interconnection of devices, the number of data sources and associated data sets is constantly increasing. Due to the heterogeneity of these large amounts of data sources, finding, accessing and understanding them is a major challenge for data consumers who want to work with the data. In order to make these data sources searchable and understandable, the paradigms of Ontology-Based Data Access (OBDA) or Ontology-Based Data Integration (OBDI) are used today. An important part of these paradigms is the creation of a mapping, such as a semantic model, between a previously defined ontology and the existing data sources. Although there are already many approaches that automate the creation of this mapping by using data-driven or data-structure-driven approaches, none of them focuses on the fact that the underlying ontology evolves over time. However, this is essential in today’s age of large amounts of data and ever-growing number of data sources.
In this paper, we propose an approach that allows the recommendation of semantic concepts for data attributes based on a constantly evolving knowledge graph. The approach allows the knowledge graph to learn data-driven representations for any concept that is available in the knowledge graph and that is already mapped to at least one data attribute. Instead of supporting a single method for recommending semantic concepts, we design the approach to be able to learn multiple data representatives per semantic concept, with each representative being trained on a different method, such as machine learning classifiers, rules, or statistical methods. In this way, for example, we are able to distinguish between different data types and data distributions. In order to evaluate our approach, we have trained it on several different publicly available data sets. In comparison to existing approaches, our evaluation shows that the accuracy of the recommendation improves through the use of our flexible and dedicated classification approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, J., Jiménez-Ruiz, E., Horrocks, I., Sutton, C.: ColNet: embeddingthe semantics of web tables for column type prediction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, pp. 29–36 (2019). https://doi.org/10.1609/aaai.v33i01.330129, https://aaai.org/ojs/index.php/AAAI/article/view/3765
Chen, J., Jiménez-Ruiz, E., Horrocks, I., Sutton, C.: Learning semantic annotations for tabular data. CoRR abs/1906.00781 (2019)
Craswell, N.: Mean reciprocal rank. In: Liu, L., Özsu, M.T. (eds.) Encyclopedia of Database Systems, p. 1703. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_488
Goel, A., Knoblock, C.A., Lerman, K.: Using conditional random fields to exploit token structure and labels for accurate semantic annotation. In: Proceedings of the 25th National Conference on Artificial Intelligence (AAAI 2011), San Francisco, CA (2011)
Goel, A., Knoblock, C.A., Lerman, K.: Exploiting structure within data for accurate labeling using conditional random fields. In: Proceedings of the 14th International Conference on Artificial Intelligence (ICAI) (2012)
Hulsebos, M., et al.: Sherlock. In: Teredesai, A., Kumar, V., Li, Y., Rosales, R., Terzi, E., Karypis, G. (eds.) Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining - KDD 2019, pp. 1500–1508. ACM Press, New York (2019). https://doi.org/10.1145/3292500.3330993
Li, X., Tu, Z., Jia, Q., Man, X., Wang, H., Zhang, X.: Deep-level quality management based on big data analytics with case study. In: Proceedings 2017 Chinese Automation Congress (CAC), Piscataway, NJ, pp. 4921–4926. IEEE (2017). https://doi.org/10.1109/CAC.2017.8243651
Paulus, A., Pomp, A., Poth, L., Lipp, J., Meisen, T.: Gathering and combining semantic concepts from multiple knowledge bases. In: Proceedings of the 20th International Conference on Enterprise Information Systems, pp. 69–80. SCITEPRESS - Science and Technology Publications (2018). https://doi.org/10.5220/0006800700690080
Pham, M., Alse, S., Knoblock, C.A., Szekely, P.: Semantic labeling: a domain-independent approach. In: Groth, P., et al. (eds.) ISWC 2016. LNCS, vol. 9981, pp. 446–462. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46523-4_27
Pinkel, C., Binnig, C., Kharlamov, E., Haase, P.: IncMap: pay as you go matching of relational schemata to OWL ontologies. In: OM, pp. 37–48 (2013)
Polfliet, S., Ichise, R.: Automated mapping generation for converting databases into linked data. In: Proceedings of the 2010 International Conference on Posters & Demonstrations Track, ISWC-PD 2010, Aachen, Germany, Germany, vol. 658. pp. 173–176. CEUR-WS.org (2010). http://dl.acm.org/citation.cfm?id=2878399.2878443
Pomp, A., Paulus, A., Jeschke, S., Meisen, T.: ESKAPE: information platform for enabling semantic data processing. In: Proceedings of the 19th International Conference on Enterprise Information. SCITEPRESS - Science and Technology Publications, April 2017
Pomp, A., Paulus, A., Klischies, D., Schwier, C., Meisen, T.: A web-based UI to enable semantic modeling for everyone. In: SEMANTiCS 2018–14th International Conference on Semantic Systems (2018)
Pomp, A., Paulus, A., Kirmse, A., Kraus, V., Meisen, T.: Applying semantics to reduce the time to analytics within complex heterogeneous infrastructures. Technologies 6(3), 86 (2018)
Pomp, A., Poth, L., Kraus, V., Meisen, T.: Enhancing knowledge graphs with data representatives. In: Proceedings of the 21st International Conference on Enterprise Information Systems, pp. 49–60. SCITEPRESS - Science and Technology Publications, 5 March–5 May 2019. https://doi.org/10.5220/0007677400490060
Ramnandan, S.K., Mittal, A., Knoblock, C.A., Szekely, P.: Assigning semantic labels to data sources. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 403–417. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18818-8_25
Rümmele, N., Tyshetskiy, Y., Collins, A.: Evaluating approaches for supervised semantic labeling. CoRR abs/1801.09788 (2018)
Syed, Z., Finin, T., Mulwad, V., Joshi, A.: Exploiting a web of semantic data for interpreting tables. In: Proceedings of the Second Web Science Conference, vol. 5 (2010)
Terrizzano, I.G., Schwarz, P.M., Roth, M., Colino, J.E.: Data wrangling: the challenging yourney from the wild to the Lake. In: CIDR (2015)
Wang, J., Wang, H., Wang, Z., Zhu, K.Q.: Understanding tables on the web. In: Atzeni, P., Cheung, D., Ram, S. (eds.) ER 2012. LNCS, vol. 7532, pp. 141–155. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34002-4_11
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Pomp, A., Kraus, V., Poth, L., Meisen, T. (2020). Semantic Concept Recommendation for Continuously Evolving Knowledge Graphs. In: Filipe, J., Śmiałek, M., Brodsky, A., Hammoudi, S. (eds) Enterprise Information Systems. ICEIS 2019. Lecture Notes in Business Information Processing, vol 378. Springer, Cham. https://doi.org/10.1007/978-3-030-40783-4_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-40783-4_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-40782-7
Online ISBN: 978-3-030-40783-4
eBook Packages: Computer ScienceComputer Science (R0)