Abstract
Describing relational data sources (i.e. databases) by means of ontologies constitutes the foundation of most of the semantic based approaches to data access and integration. In spite of the importance of the task this is mostly carried out manually and, to the best of our knowledge, not much research has been devoted to its automatisation. In this paper we introduce an automatic procedure for building ontologies starting from the integrity constraints present in the relational sources.
Our work builds upon the wide literature on database schema reverse engineering; however, we adapt these techniques to the specific purpose of reusing the extracted schemata (or ontologies) in the context of semantic data access. In particular, we ensure that the underlying data sources can be queried through the ontologies and the extracted ontologies can be used for semantic integration using recently developed techniques in this area.
In order to represent the extracted ontology we adopt a variant of the DLR-Lite description logic because of its ability to express the mostly used modelling constraints, and its nice computational properties. The connection with the relational data sources is captured by means of sound views. Moreover, the adoption of this formal language enables us to prove that the extracted ontologies preserve the semantics of the integrity constraints in the relational sources. Therefore, there is no data loss, and the extracted ontology constitutes a faithful wrapper of the relational sources.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sheth, A.P., Larson, J.A.: Federated database systems for managing distributed, heterogeneous and autonomous databases. ACM Computing Surveys 22(3), 183–236 (1990)
Calvanese, D., Giacomo, G.D., Lenzerini, M., Nardi, D., Rosati, R.: Data integration in data warehousing 10(3), 237–271 (2001)
Lenzerini, M.: Data integration: A theoretical perspective. In: Proc. of PODS 2002, pp. 233–346 (2002)
Heflin, J., Hendler, J.: A portrait of the semantic web in action. IEEE Intelligent Systems 16(2), 54–59 (2001)
Wache, H., Vogele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hubner, S.: Ontology-based integration of information - a survey of existing approaches. In: Proc. of IJCAI 2001 Workshop: Ontologies and Information Sharing, pp. 108–117 (2001)
Calvanese, D., Giacomo, G.D., Lembo, D., Lenzerini, M., Rosati, R.: Tractable reasoning and efficient query answering in description logics: The dl-lite family. J. of Automated Reasoning 39(3), 385–429 (2007)
Chen, P.: The entity-relationship model: Toward a unified view of data. ACM Transactions on Database Systems (TODS) 1(1), 9–36 (1976)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Schneider, P.F. (eds.): The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press, Cambridge (2003)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Levesque, H.J., Lakemeyer, G.: The Logic of Knowledge Bases. MIT Press, Cambridge (2001)
Lubyte, L., Tessaris, S.: Extracting ontologies from relational databases. Technical report, KRDB group – Free University of Bozen-Bolzano (2007), http://www.inf.unibz.it/krdb/pub/TR/KRDB07-4.pdf
Berardi, D., Calvanese, D., De Giacomo, G.: Reasoning on uml class diagrams. Artificial Intelligence 168(1), 70–118 (2005)
Calvanese, D., De Giacomo, G., Lenzerini, M.: Identification constraints and functional dependencies in description logics. In: Proc. of the 17th Int. Joint Conf. on Artificial Intelligence (IJCAI 2001), pp. 155–160 (2001)
Lembo, D., Lutz, C., Suntisrivaraporn, B.: Tasks for ontology design and maintenance. Deliverable D05, TONES EU-IST STREP FP6-7603 (2006)
Elmasri, R., Navathe, S.B.: Fundamentals of Database Systems, 4th edn. Addison Wesley Publ. Co., Reading (2004)
Miller, R.J., Ioannidis, Y.E., Ramakrishnan, R.: The use of information capacity in schema integration and translation. In: Proc. of VLDB 1993, pp. 120–133. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Fillottrani, P.R., Franconi, E., Tessaris, S.: The new icom ontology editor. In: Proc. of the 19th Int. Workshop on Description Logics, DL 2006 (2006)
Poggi, A., Lembo, D., Calvanese, D., Giacomo, G.D., Lenzerini, M., Rosati, R.: Linking data to ontologies. J. on Data Semantics X, 133–173 (2008)
Rodriguez-Muro, M., Lubyte, L., Calvanese, D.: Realizing ontology based data access: A plug-in for protégé. In: Proc. of the Workshop on Information Integration Methods, Architectures, and Systems (IIMAS 2008), pp. 286–289 (2008)
Calvanese, D., Giacomo, G.D., Horridge, M., et al.: Software tools for ontology interoperation. Deliverable D25, TONES EU-IST STREP FP6-7603 (2008)
Noy, N., Rector, A.: Defining n-ary relations on the semantic web. Technical report, W3C Recommendation (2006), http://www.w3.org/TR/swbp-n-aryRelations/
Hainaut, J.L.: Database reverse engineering: models, techniques and strategies. In: Proc. of the 10th Conference on ER Approach (1998)
Markowitz, V.M., Makowsky, J.A.: Identifying extended entity-relationship object structures in relational schemas. IEEE Transactions on Software Engineering 16(8), 777–790 (1990)
Chiang, R.H.L., Barron, T.M., Storey, V.C.: Reverse engineering of relational databases: extraction of an eer model from a relational database. Data and Knowledge Engineering 12(2), 107–142 (1994)
Andersson, M.: Extracting an entity-relationship schema from a relational database through reverse engineering. In: Loucopoulos, P. (ed.) ER 1994. LNCS, vol. 881, pp. 403–419. Springer, Heidelberg (1994)
Alhajj, R.: Extracting an extended entity-relationship model from a legacy relational database. Information Systems 26(6), 597–618 (2003)
Johannesson, P.: A method for transforming relational schemas into conceptual schemas. In: Proc. of the Int. Conf. on Data Engineering (ICDE 1994), pp. 190–201 (1994)
Astrova, I.: Reverse engineering of relational databases to ontologies. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 327–341. Springer, Heidelberg (2004)
Volz, R., Handschuh, S., Staab, S., Stojanovic, L., Stojanovic, N.: Unveiling the hidden bride: deep annotation for mapping and migrating legacy data to the semantic web. Web Semantics 2(1), 187–206 (2004)
Volz, R., Handschuh, S., Staab, S., Studer, R.: Ontolift demonstrator. Deliverable Del 12, WonderWeb IST-2001-33052 (2004)
Petrini, J., Risch, T.: Processing queries over RDF views of wrapped relational databases. In: Proc. of the 1st Int. Workshop on Wrapper Techniques for Legacy Systems, WRAP 2004 (2004)
Bizer, C.: D2R MAP - a database to RDF mapping language. In: Int. World Wide Web Conference, WWW 2003 (2003)
Barrasa, J., Corcho, O., Gomez-Perez, A.: An extensible and semantically based database-to-ontology mapping language. In: Bussler, C.J., Tannen, V., Fundulaki, I. (eds.) SWDB 2004. LNCS, vol. 3372. Springer, Heidelberg (2005)
de Laborda, C.P., Conrad, S.: Database to semantic web mapping using RDF query languages. In: Embley, D.W., Olivé, A., Ram, S. (eds.) ER 2006. LNCS, vol. 4215, pp. 241–254. Springer, Heidelberg (2006)
Prud’hommeaux, E., Seaborne, A.: SPARQL query language for RDF. Technical report, W3C Recommendation (2008), http://www.w3.org/TR/rdf-sparql-query/
An, Y., Borgida, A., Mylopoulos, J.: Inferring complex semantic mappings between relational tables and ontologies from simple correspondences. In: Int. Conf. on Ontologies, Databases and Applications of Semantics (ODBASE 2005), pp. 1152–1169 (2005)
Blaha, M.R., Premerlani, W.J.: Observed idiosyncracies of relational database designs. In: Proc. of the Working Conf. on Reverse Engineering (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lubyte, L., Tessaris, S. (2009). Automatic Extraction of Ontologies Wrapping Relational Data Sources. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2009. Lecture Notes in Computer Science, vol 5690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03573-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-03573-9_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03572-2
Online ISBN: 978-3-642-03573-9
eBook Packages: Computer ScienceComputer Science (R0)