Advertisement

Accessing Data Integration Systems through Conceptual Schemas

  • Andrea Calì
  • Diego Calvanese
  • Giuseppe De Giacomo
  • Maurizio Lenzerini
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2224)

Abstract

Data integration systems provide access to a set of heterogeneous, autonomous data sources through a so-called global, or mediated view. There is a general consensus that the best way to describe the global view is through a conceptual data model, and that there are basically two approaches for designing a data integration system. In the global-as-view approach, one defines the concepts in the global schema as views over the sources, whereas in the local-as-view approach, one characterizes the sources as views over the global schema. It is well known that processing queries in the latter approach is similar to query answering with incomplete information, and, therefore, is a complex task. On the other hand, it is a common opinion that query processing is much easier in the former approach. In this paper we show the surprising result that, when the global schema is expressed in terms of a conceptual data model, even a very simple one, query processing becomes difficult in the global-as-view approach also. We demonstrate that the problem of incomplete information arises in this case too, and we illustrate some basic techniques for effectively answering queries posed to the global schema of the data integration system.

Keywords

Query Processing Data Integration Conceptual Schema Global Schema Semantic Condition 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    S. Abiteboul and O. Duschka. Complexity of answering queries using materialized views. In Proc. of PODS’98, pages 254–265, 1998.Google Scholar
  2. [2]
    S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison Wesley Publ. Co., Reading, Massachussetts, 1995.zbMATHGoogle Scholar
  3. [3]
    S. Bergamaschi, S. Castano, M. Vincini, and D. Beneventano. Intelligent techniques for the extraction and integration of heterogeneous information. In Proc. of the IJCAI’99 Workshop on Intelligent Information Integration, 1999.Google Scholar
  4. [4]
    M. Bouzeghoub and M. Lenzerini. Special issue on data extraction, cleaning, and reconciliation. Information Systems, 2001. To appear.Google Scholar
  5. [5]
    D. Calvanese, G. De Giacomo, M. Lenzerini, D. Nardi, and R. Rosati. Information integration: Conceptual modeling and reasoning support. In Proc. of CoopIS’98, pages 280–291, 1998.Google Scholar
  6. [6]
    D. Calvanese, G. De Giacomo, M. Lenzerini, D. Nardi, and R. Rosati. Data integration in data warehousing. Int. J. of Cooperative Information Systems, 2001. To appear.Google Scholar
  7. [7]
    D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. Answering regular path queries using views. In Proc. of ICDE 2000, pages 389–398, 2000.Google Scholar
  8. [8]
    D. Calvanese, G. De Giacomo, M. Lenzerini, and M. Y. Vardi. View-based query processing and constraint satisfaction. In Proc. of LICS 2000, pages 361–371, 2000.Google Scholar
  9. [9 ]
    M. J. Carey, L. M. Haas, P. M. Schwarz, M. Arya, W. F. Cody, R. Fagin, M. Flickner, A. Luniewski, W. Niblack, D. Petkovic, J. Thomas, J. H. Williams, and E. L. Wimmers. Towards heterogeneous multimedia information systems: The Garlic approach. In RIDE-DOM, pages 124–131, 1995.Google Scholar
  10. [10]
    P. P. Chen. The Entity-Relationship model: Toward a unified view of data. ACM Trans. on Database Systems, 1(1):9–36, Mar. 1976.CrossRefGoogle Scholar
  11. [11]
    G. De Giacomo. Intensional query answering by partial evaluation. J. of Intelligent Information Systems, 7(3):205–233, 1996.CrossRefGoogle Scholar
  12. [12]
    H. Galhardas, D. Florescu, D. Shasha, and E. Simon. An extensible framework for data cleaning. Technical Report 3742, INRIA, Rocquencourt, 1999.Google Scholar
  13. [13]
    G. Grahne and A. O. Mendelzon. Tableau techniques for querying information sources through global schemas. In Proc. of ICDT’99, volume 1540 of LNCS, pages 332–347. Springer-Verlag, 1999.Google Scholar
  14. [14]
    J. Gryz. Query folding with inclusion dependencies. In Proc. of ICDE’98, pages 126–133, 1998.Google Scholar
  15. [15]
    A. Y. Halevy. Theory of answering queries using views. SIGMOD Record, 29(4):40–47, 2000.CrossRefGoogle Scholar
  16. [16]
    R. Hull. Managing semantic heterogeneity in databases: A theoretical perspective. In Proc. of PODS’97, 1997.Google Scholar
  17. [17]
    A. Y. Levy. Logic-based techniques in data integration. In J. Minker, editor, Logic Based Artificial Intelligence. Kluwer Publishers, 2000.Google Scholar
  18. [18]
    C. Li and E. Chang. Query planning with limited source capabilities. In Proc. of ICDE 2000, pages 401–412, 2000.Google Scholar
  19. [19]
    C. Li, R. Yerneni, V. Vassalos, H. Garcia-Molina, Y. Papakonstantinou, J. D. Ullman, and M. Valiveti. Capability based mediation in TSIMMIS. In Proc. of ACM SIGMOD, pages 564–566, 1998.Google Scholar
  20. [20]
    J. W. Lloyd. Foundations of Logic Programming (Second, Extended Edition). Springer-Verlag, Berlin, Heidelberg, 1987.Google Scholar
  21. [21]
    B. Ludascher, A. Gupta, and M. E. Martone. Model-based mediation with domain maps. In Proc. of ICDE 2001, pages 81–90, 2001.Google Scholar
  22. [22]
    L. Palopoli, L. Pontieri, G. Terracina, and D. Ursino. Intensional and extensional integration and abstraction of heterogeneous databases. Data and Knowledge Engineering, 35(3):201–237, 2000.zbMATHCrossRefGoogle Scholar
  23. [23]
    X. Qian. Query folding. In Proc. of ICDE’96, pages 48–55, 1996.Google Scholar
  24. [24]
    J. D. Ullman. Information integration using logical views. In Proc. of ICDT’97, volume 1186 of LNCS, pages 19–40. Springer-Verlag, 1997.Google Scholar
  25. [25]
    R. van der Meyden. Logical approaches to incomplete information. In J. Chomicki and G. Saake, editors, Logics for Databases and Information Systems, pages 307–356. Kluwer Academic Publisher, 1998.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Andrea Calì
    • 1
  • Diego Calvanese
    • 1
  • Giuseppe De Giacomo
    • 1
  • Maurizio Lenzerini
    • 1
  1. 1.Dipartimento di Informatica e SistemisticaUniversità di Roma “La Sapienza”RomaItaly

Personalised recommendations