Skip to main content

Integrating and Exchanging XML Data Using Ontologies

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((JODS,volume 4090))

Abstract

While providing a uniform syntax and a semistructured data model, XML does not express semantics but only structure such as nesting information. In this paper, we consider the problem of data integration and interoperation of heterogeneous XML sources and use an ontology-based framework to address this problem at a semantic level. Ontologies are extensively used for domain knowledge representation, by virtue of their conceptualization of the domain, which carries explicit semantics. In our approach, the global ontology is expressed in RDF Schema (RDFS) and constructed using the global-as-view approach by merging individual local ontologies, which represent XML source schemas. We provide a formal model for the mappings between XML schemas and local RDFS ontologies and those between local ontologies and the global RDFS ontology. We consider two cases of query processing, specifically for data integration and for data interoperation. In the first case, the user poses an RDF query on the global ontology, which is answered using all the mapped XML sources. In the second case, a query is posed on a single source and then is mapped to the XML sources that are connected to that source. For each case, we discuss the problem of query containment and present an equivalent query rewriting algorithm for queries expressed in two languages: conjunctive RDQL and conjunctive XQuery.

A preliminary version of this paper was presented at the 8th International Database Engineering & Applications Symposium (Isabel F. Cruz, Huiyong Xiao, Feihong Hsu: An Ontology-Based Framework for XML Semantic Integration. IDEAS 2004: 217-226). This research was partially supported by the National Science Foundation under Awards ITR IIS-0326284 and IIS-0513553.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Duschka, O.M.: Complexity of Answering Queries Using Materialized Views. In: Proceedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 1998), pp. 254–263 (1998)

    Google Scholar 

  2. Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)

    MATH  Google Scholar 

  3. Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-Based Integration of XML Web Resources. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 117–131. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  4. Amann, B., Fundulaki, I., Scholl, M., Beeri, C., Vercoustre, A.-M.: Mapping XML Fragments to Community Web Ontologies. In: Proceedings of the 4th International Workshop on the Web and Databases (WebDB 2001), pp. 97–102 (2001)

    Google Scholar 

  5. Arens, Y., Knoblock, C.A., Hsu, C.: Query Processing in the SIMS Information Mediator. AAAI Press, Menlo Park (1996)

    Google Scholar 

  6. Bishr, Y.A.: Overcoming the semantic and other barriers to GIS interoperability. International Journal of Geographical Information Science 12(4), 229–314 (1998)

    Article  Google Scholar 

  7. Boag, S., Chamberlin, D., Fernández, M.F., Daniela Florescu, J.R., Siméon, J.: XQuery 1.0: An XML Query Language, W3C Working Draft (April 2005), http://www.w3.org/TR/xquery

  8. Bourret, R.: XML and Databases (December 2004), http://www.rpbourret.com/xml/XMLAndDatabases.htm

  9. Brickley, D., Guha, R.: RDF Vocabulary Description Language 1.0: RDF Schema. In: W3C Working Draft (February 2004), http://www.w3.org/TR/rdf-schema

  10. Calì, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the Expressive Power of Data Integration Systems. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds.) ER 2002. LNCS, vol. 2503, pp. 338–350. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Calì, A., Calvanese, D., Giacomo, G.D., Lenzerini, M., Naggar, P., Vernacotola, F.: IBIS: Semantic Data Integration at Work. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 79–94. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  12. Calvanese, D., Giacomo, G.D., Lenzerini, M., Vardi, M.Y.: View-Based Query Processing and Constraint Satisfaction. In: The 15th Annual IEEE Symposium on Logic in Computer Science (LICS 2000), pp. 361–371 (2000)

    Google Scholar 

  13. Calvanese, D., Giacomo, G.D., Lenzerini, M., Vardi, M.Y.: View-based Query Containment. In: Proceedings of the 22rd ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2003), pp. 56–67 (2003)

    Google Scholar 

  14. dos Santos Mello, R., Heuser, C.A., Camillo, S.D.: Querying Heterogeneous XML Sources through a Conceptual Schema. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 186–199. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  15. Chen, Y., Revesz, P.: CXQuery: A Novel XML Query Language. In: Proceedings of International Conference on Advances in Infrastructure for Electronic Business, Science, and Medicine on the Internet (SSGRR 2002w) (2002)

    Google Scholar 

  16. Cruz, I.F., Xiao, H.: Using a Layered Approach for Interoperability on the Semantic Web. In: Proceedings of the 4th International Conference on Web Information Systems Engineering (WISE 2003), Rome, Italy, pp. 221–232 (December 2003)

    Google Scholar 

  17. Gruber, T.R.: A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition 5(2), 199–220 (1993)

    Article  Google Scholar 

  18. Halevy, A.Y.: Answering Queries Using Views: A Survey. VLDB Jounal 10(4), 270–294 (2001)

    Article  MATH  Google Scholar 

  19. Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data Management Infrastructure for Semantic Web Applications. In: Proceedings of the 12th International World Wide Web Conference (WWW 2003), pp. 556–567 (2003)

    Google Scholar 

  20. HP Labs. RDQL - RDF Data Query Language (2005), http://www.hpl.hp.com/semweb/rdql.htm

  21. Kalfoglou, Y., Schorlemmer, M.: Ontology Mapping: the State of the Art. The Knowledge Engineering Review 18(1), 1–31 (2003)

    Article  Google Scholar 

  22. Klein, M.C.A.: Interpreting XML Documents via an RDF Schema Ontology. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds.) DEXA 2002. LNCS, vol. 2453, pp. 889–894. Springer, Heidelberg (2002)

    Google Scholar 

  23. Lakshmanan, L.V.S., Sadri, F.: Interoperability on XML Data. In: Proceedings of the 2nd International Semantic Web Conference (ICSW 2003), pp. 146–163 (2003)

    Google Scholar 

  24. Lehti, P., Fankhauser, P.: XML Data Integration with OWL: Experiences and Challenges. In: 2004 Symposium on Applications and the Internet (SAINT 2004), pp. 160–170 (2004)

    Google Scholar 

  25. Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proceedings of the 21st ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2002), Madison, Wisconsin, June 2002, pp. 233–246. ACM, New York (2002)

    Chapter  Google Scholar 

  26. Manola, F., Miller, E.: RDF Primer. W3C Working Draft (February 2004), http://www.w3.org/TR/rdf-primer

  27. Mena, E., Kashyap, V., Sheth, A.P., Illarramendi, A.: OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. In: Proceedings of the 1st IFCIS International Conference on Cooperative Information Systems (CoopIS 1996), pp. 14–25 (1996)

    Google Scholar 

  28. Millstein, T.D., Halevy, A.Y., Friedman, M.: Query Containment for Data Integration Systems. Journal of Computer and System Sciences 66(1), 20–39 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  29. Noy, N.F., Musen, M.A.: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment. In: Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI 2000), pp. 450–455 (2000)

    Google Scholar 

  30. Papakonstantinou, Y., Garcia-Molina, H., Widom, J.: Object Exchange Across Heterogeneous Information Sources. In: Proceedings of the 11th International Conference on Data Engineering (ICDE 1995), pp. 251–260 (1995)

    Google Scholar 

  31. Patel-Schneider, P.F., Siméon, J.: The Yin/Yang Web: XML Syntax and RDF Semantics. In: Proceedings of the 11th International World Wide Web Conference (WWW 2002), pp. 443–453 (July 2002)

    Google Scholar 

  32. Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 598–609. Springer, Heidelberg (2003)

    Google Scholar 

  33. Sahin, O.D., Gupta, A., Agrawal, D., Abbadi, A.E.: Query Processing Over Peer-To-Peer Data Sharing Systems. Technical Report CSD-2002-28, University of California at Santa Barbara (2002)

    Google Scholar 

  34. Shklar, L.A., Sheth, A.P., Kashyap, V., Shah, K.: InfoHarness: Use of Automatically Generated Metadata for Search and Retrieval of Heterogeneous Information. In: Iivari, J., Rossi, M., Lyytinen, K. (eds.) CAiSE 1995. LNCS, vol. 932, pp. 217–230. Springer, Heidelberg (1995)

    Google Scholar 

  35. Stumme, G., Maedche, A.: Ontology Merging for Federated Ontologies for the Semantic Web. In: Proceedings of the International Workshop on Foundations of Models for Information Integration (FMII 2001), pp. 16–18 (2001)

    Google Scholar 

  36. Ullman, J.D.: Information Integration Using Logical Views. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 19–40. Springer, Heidelberg (1996)

    Google Scholar 

  37. van der Meyden, R.: Logical Approaches to Incomplete Information: A Survey. In: Logics for Databases and Information Systems, pp. 307–356 (1998)

    Google Scholar 

  38. Xiao, H., Cruz, I.F., Hsu, F.: Semantic Mappings for the Integration of XML and RDF Sources. In: Proceedings of the VLDB Workshop on Information Integration on the Web (VLDB-IIWeb 2004) (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xiao, H., Cruz, I.F. (2006). Integrating and Exchanging XML Data Using Ontologies. In: Spaccapietra, S., Aberer, K., Cudré-Mauroux, P. (eds) Journal on Data Semantics VI. Lecture Notes in Computer Science, vol 4090. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11803034_4

Download citation

  • DOI: https://doi.org/10.1007/11803034_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-36712-3

  • Online ISBN: 978-3-540-36871-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics