Abstract
While providing a uniform syntax and a semistructured data model, XML does not express semantics but only structure such as nesting information. In this paper, we consider the problem of data integration and interoperation of heterogeneous XML sources and use an ontology-based framework to address this problem at a semantic level. Ontologies are extensively used for domain knowledge representation, by virtue of their conceptualization of the domain, which carries explicit semantics. In our approach, the global ontology is expressed in RDF Schema (RDFS) and constructed using the global-as-view approach by merging individual local ontologies, which represent XML source schemas. We provide a formal model for the mappings between XML schemas and local RDFS ontologies and those between local ontologies and the global RDFS ontology. We consider two cases of query processing, specifically for data integration and for data interoperation. In the first case, the user poses an RDF query on the global ontology, which is answered using all the mapped XML sources. In the second case, a query is posed on a single source and then is mapped to the XML sources that are connected to that source. For each case, we discuss the problem of query containment and present an equivalent query rewriting algorithm for queries expressed in two languages: conjunctive RDQL and conjunctive XQuery.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
A preliminary version of this paper was presented at the 8th International Database Engineering & Applications Symposium (Isabel F. Cruz, Huiyong Xiao, Feihong Hsu: An Ontology-Based Framework for XML Semantic Integration. IDEAS 2004: 217-226). This research was partially supported by the National Science Foundation under Awards ITR IIS-0326284 and IIS-0513553.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Duschka, O.M.: Complexity of Answering Queries Using Materialized Views. In: Proceedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 1998), pp. 254–263 (1998)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-Based Integration of XML Web Resources. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 117–131. Springer, Heidelberg (2002)
Amann, B., Fundulaki, I., Scholl, M., Beeri, C., Vercoustre, A.-M.: Mapping XML Fragments to Community Web Ontologies. In: Proceedings of the 4th International Workshop on the Web and Databases (WebDB 2001), pp. 97–102 (2001)
Arens, Y., Knoblock, C.A., Hsu, C.: Query Processing in the SIMS Information Mediator. AAAI Press, Menlo Park (1996)
Bishr, Y.A.: Overcoming the semantic and other barriers to GIS interoperability. International Journal of Geographical Information Science 12(4), 229–314 (1998)
Boag, S., Chamberlin, D., Fernández, M.F., Daniela Florescu, J.R., Siméon, J.: XQuery 1.0: An XML Query Language, W3C Working Draft (April 2005), http://www.w3.org/TR/xquery
Bourret, R.: XML and Databases (December 2004), http://www.rpbourret.com/xml/XMLAndDatabases.htm
Brickley, D., Guha, R.: RDF Vocabulary Description Language 1.0: RDF Schema. In: W3C Working Draft (February 2004), http://www.w3.org/TR/rdf-schema
Calì, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the Expressive Power of Data Integration Systems. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds.) ER 2002. LNCS, vol. 2503, pp. 338–350. Springer, Heidelberg (2002)
Calì, A., Calvanese, D., Giacomo, G.D., Lenzerini, M., Naggar, P., Vernacotola, F.: IBIS: Semantic Data Integration at Work. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 79–94. Springer, Heidelberg (2003)
Calvanese, D., Giacomo, G.D., Lenzerini, M., Vardi, M.Y.: View-Based Query Processing and Constraint Satisfaction. In: The 15th Annual IEEE Symposium on Logic in Computer Science (LICS 2000), pp. 361–371 (2000)
Calvanese, D., Giacomo, G.D., Lenzerini, M., Vardi, M.Y.: View-based Query Containment. In: Proceedings of the 22rd ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2003), pp. 56–67 (2003)
dos Santos Mello, R., Heuser, C.A., Camillo, S.D.: Querying Heterogeneous XML Sources through a Conceptual Schema. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 186–199. Springer, Heidelberg (2003)
Chen, Y., Revesz, P.: CXQuery: A Novel XML Query Language. In: Proceedings of International Conference on Advances in Infrastructure for Electronic Business, Science, and Medicine on the Internet (SSGRR 2002w) (2002)
Cruz, I.F., Xiao, H.: Using a Layered Approach for Interoperability on the Semantic Web. In: Proceedings of the 4th International Conference on Web Information Systems Engineering (WISE 2003), Rome, Italy, pp. 221–232 (December 2003)
Gruber, T.R.: A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition 5(2), 199–220 (1993)
Halevy, A.Y.: Answering Queries Using Views: A Survey. VLDB Jounal 10(4), 270–294 (2001)
Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data Management Infrastructure for Semantic Web Applications. In: Proceedings of the 12th International World Wide Web Conference (WWW 2003), pp. 556–567 (2003)
HP Labs. RDQL - RDF Data Query Language (2005), http://www.hpl.hp.com/semweb/rdql.htm
Kalfoglou, Y., Schorlemmer, M.: Ontology Mapping: the State of the Art. The Knowledge Engineering Review 18(1), 1–31 (2003)
Klein, M.C.A.: Interpreting XML Documents via an RDF Schema Ontology. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds.) DEXA 2002. LNCS, vol. 2453, pp. 889–894. Springer, Heidelberg (2002)
Lakshmanan, L.V.S., Sadri, F.: Interoperability on XML Data. In: Proceedings of the 2nd International Semantic Web Conference (ICSW 2003), pp. 146–163 (2003)
Lehti, P., Fankhauser, P.: XML Data Integration with OWL: Experiences and Challenges. In: 2004 Symposium on Applications and the Internet (SAINT 2004), pp. 160–170 (2004)
Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proceedings of the 21st ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2002), Madison, Wisconsin, June 2002, pp. 233–246. ACM, New York (2002)
Manola, F., Miller, E.: RDF Primer. W3C Working Draft (February 2004), http://www.w3.org/TR/rdf-primer
Mena, E., Kashyap, V., Sheth, A.P., Illarramendi, A.: OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. In: Proceedings of the 1st IFCIS International Conference on Cooperative Information Systems (CoopIS 1996), pp. 14–25 (1996)
Millstein, T.D., Halevy, A.Y., Friedman, M.: Query Containment for Data Integration Systems. Journal of Computer and System Sciences 66(1), 20–39 (2003)
Noy, N.F., Musen, M.A.: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment. In: Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI 2000), pp. 450–455 (2000)
Papakonstantinou, Y., Garcia-Molina, H., Widom, J.: Object Exchange Across Heterogeneous Information Sources. In: Proceedings of the 11th International Conference on Data Engineering (ICDE 1995), pp. 251–260 (1995)
Patel-Schneider, P.F., Siméon, J.: The Yin/Yang Web: XML Syntax and RDF Semantics. In: Proceedings of the 11th International World Wide Web Conference (WWW 2002), pp. 443–453 (July 2002)
Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 598–609. Springer, Heidelberg (2003)
Sahin, O.D., Gupta, A., Agrawal, D., Abbadi, A.E.: Query Processing Over Peer-To-Peer Data Sharing Systems. Technical Report CSD-2002-28, University of California at Santa Barbara (2002)
Shklar, L.A., Sheth, A.P., Kashyap, V., Shah, K.: InfoHarness: Use of Automatically Generated Metadata for Search and Retrieval of Heterogeneous Information. In: Iivari, J., Rossi, M., Lyytinen, K. (eds.) CAiSE 1995. LNCS, vol. 932, pp. 217–230. Springer, Heidelberg (1995)
Stumme, G., Maedche, A.: Ontology Merging for Federated Ontologies for the Semantic Web. In: Proceedings of the International Workshop on Foundations of Models for Information Integration (FMII 2001), pp. 16–18 (2001)
Ullman, J.D.: Information Integration Using Logical Views. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 19–40. Springer, Heidelberg (1996)
van der Meyden, R.: Logical Approaches to Incomplete Information: A Survey. In: Logics for Databases and Information Systems, pp. 307–356 (1998)
Xiao, H., Cruz, I.F., Hsu, F.: Semantic Mappings for the Integration of XML and RDF Sources. In: Proceedings of the VLDB Workshop on Information Integration on the Web (VLDB-IIWeb 2004) (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xiao, H., Cruz, I.F. (2006). Integrating and Exchanging XML Data Using Ontologies. In: Spaccapietra, S., Aberer, K., Cudré-Mauroux, P. (eds) Journal on Data Semantics VI. Lecture Notes in Computer Science, vol 4090. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11803034_4
Download citation
DOI: https://doi.org/10.1007/11803034_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36712-3
Online ISBN: 978-3-540-36871-7
eBook Packages: Computer ScienceComputer Science (R0)