Abstract
XML document may contain inconsistencies that violate predefined integrity constraints, and there are two basic concepts for this problem: Repair is the data consistent with the integrity constraints, and also minimally differs from the original one. Consistent data is the data common for every possible repair. In this paper, first we give a general constraint model for XML, which can express functional dependencies, keys and multivalued dependencies. Next we provide a repair framework for inconsistent XML document with three basic update operations: node insertion, node deletion and value modification. Following this approach, we introduce the concept of repair for inconsistent XML document, discuss the chase process to generate repairs, and prove some important properties of the chase process. Finally we give a method to obtain the greatest lower bound of all possible repairs, which is sufficient for consistent data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Segoufin, L., Vianu, V.: Representing and Querying XML with Incomplete Information. In: PODS, pp. 35–47 (2001)
Arenas, M., Bertossi, L.E., Chomick, J.: Consistent Query Answers in Inconsistent Databases. In: PODS, pp. 68–79 (1999)
Arenas, M., Libkin, L.: A Normal Form for XML Documents. TODS 29(1), 195–232 (2004)
Arenas, M., Libkin, L.: XML Data Exchange: Consistency and Query Answering. In: PODS, pp. 13–24 (2005)
Bohannon, P., Fan, W.F., Flaster, M., Rastogi, R.: A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification. In: SIGMOD, pp. 143–154 (2005)
Buneman, P., Davidson, S., Fan, W., Hara, C., Tan, W.: Reasoning about Keys for XML. In: Database Programming Languages, pp. 133–148 (2002)
Bravo, L., Bertossi, L.: Logic programs for consistently querying data integration systems. In: IJCAI, pp. 10–15 (2003)
Chomicki, J., Marcinkowski, J.: Minimal-Change Integrity Maintenance Using Tuple Deletions. Information and Computation 197(1-2), 90–121 (2005)
Greco, G., Greco, S., Zumpano, E.: A logical framework for querying and repairing inconsistent databases. IEEE Transaction on Knowledge and Data Engineering 15(6), 1389–1408 (2003)
Ng, W.: Repairing Inconsistent Merged XML Data. In: Mařík, V., Štěpánková, O., Retschitzegger, W. (eds.) DEXA 2003. LNCS, vol. 2736, pp. 244–255. Springer, Heidelberg (2003)
Vincent, M.W., Liu, J.: Multivalued Dependencies and a 4NF for XML. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 14–29. Springer, Heidelberg (2003)
Wijsen, J.: Database Repairing Using Updates. TODS 30(3), 722–768 (2005)
Extensible Markup Language (XML) 1.0, 2nd edn. W3C Recommendation (October 2000), http://www.w3.org/TR/REC-xml
XML Schema Part 1: Structures. W3C Recommendation (May 2001), http://www.w3.org/TR/xmlschema-1/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tan, Z., Wang, W., Xu, J., Shi, B. (2006). Repairing Inconsistent XML Documents. In: Lang, J., Lin, F., Wang, J. (eds) Knowledge Science, Engineering and Management. KSEM 2006. Lecture Notes in Computer Science(), vol 4092. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811220_32
Download citation
DOI: https://doi.org/10.1007/11811220_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37033-8
Online ISBN: 978-3-540-37035-2
eBook Packages: Computer ScienceComputer Science (R0)