Advertisement

XML Schemata Inference and Evolution

  • Ismael Sanz
  • Juan Manuel Pérez
  • Rafael Berlanga
  • María José Aramburu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2736)

Abstract

This work addresses the automatic generation of conceptual models for XML-oriented databases, which in many cases have little or no support for schemata. Our techniques are based on both an incremental clustering algorithm, which groups together the incoming XML documents according to their structural similarities, and a schema inference method, which maintains dynamically the schema of each detected document cluster. Our proposal takes into consideration the schema evolution. For this purpose, we have adapted the Toodor document model that describes the temporal properties of the XML document types.

Keywords

Schema Inference Document Clustering XML Databases 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Aramburu, M.J., Berlanga, R.: A temporal object-oriented model for digital libraries of documents. Concurrency: Practice and Experience 13(11) (2001)Google Scholar
  2. 2.
    Chamberlin, D., Robie, J., Florescu, D.: Quilt: An XML query language for heterogeneous data sources. In: Suciu, D., Vossen, G. (eds.) WebDB 2000. LNCS, vol. 1997, pp. 53–62. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  3. 3.
    Cluet, S., Veltri, P., Vodislav, D.: Views in a large scale XML repository. In: VLDB 2001, pp. 271–280 (2001)Google Scholar
  4. 4.
    Hélide.: The G Web Applications Platform (2002), http://www.helide.com
  5. 5.
    Mena, E., Illarramendi, A., Kashyap, V., Sheth, A.P.: OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distributed and Parallel Databases 8(2), 223–271 (2000)CrossRefGoogle Scholar
  6. 6.
    W3C Consortium. XML schema (2002), http://www.w3.org/XML/Schema
  7. 7.
    W3C Consortium. XQuery 1.0: An XML Query Language (2002), http://www.w3.org/xquery
  8. 8.
    Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing 18(6), 1245–1262 (1989)zbMATHCrossRefMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Ismael Sanz
    • 1
  • Juan Manuel Pérez
    • 1
  • Rafael Berlanga
    • 1
  • María José Aramburu
    • 1
  1. 1.Departament de Llenguatges i Sistemes InformàticsDepartament de Ingenieria y Ciencia de los Computadores, Universitat Jaume ICastellónSpain

Personalised recommendations