Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Constraint-Driven Database Repair

  • Wenfei Fan
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_599

Synonyms

Data reconciliation; Data standardization; Minimal-change integrity maintenance

Definition

Given a set Σ of integrity constraints and a database instance D of a schema R, the problem of constraint-driven database repair is to find an instance D of the same schema R such that (i) D is consistent, i.e., D satisfies Σ, and moreover, (ii) Dminimally differs from the original database D, i.e., it takes a minimal number of repair operations or incurs minimal cost to obtain D by updating D.

Historical Background

Real-life data is often dirty, i.e., inconsistent, inaccurate, stale, or deliberately falsified. While the prevalent use of the Web has made it possible, on an unprecedented scale, to extract and integrate data from diverse sources, it has also increased the risks of creating and propagating dirty data. Dirty data routinely leads to misleading or biased analytical results and decisions and incurs loss of revenue, credibility, and customers. With this comes the need for...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Arenas M, Bertossi LE, Chomicki J. Consistent query answers in inconsistent databases. In: Proceedings of the 18th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems; 1999. p. 68–79.Google Scholar
  2. 2.
    Bohannon P, Fan W, Flaster M, Rastogi R. A cost-based model and effective heuristic for repairing constraints by value modification. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2005. p. 143–54.Google Scholar
  3. 3.
    Bravo L, Fan W, Ma S. Extending dependencies with conditions. In: Proceedings of the 33rd International Conference on Very Large Data Bases; 2007. p. 243–54.Google Scholar
  4. 4.
    Calì A, Lembo D, Rosati R. On the decidability and complexity of query answering over inconsistent and incomplete databases. In: Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems; 2003. p. 260–71.Google Scholar
  5. 5.
    Cao Y, Fan W, Yu W. Determining the relative accuracy of attributes. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2013. p. 565–76.Google Scholar
  6. 6.
    Chomicki J. Consistent query answering: five easy pieces. In: Proceedings of the 11th International Conference on Database Theory; 2007. p. 1–17.Google Scholar
  7. 7.
    Chomicki J, Marcinkowski J. Minimal-change integrity maintenance using tuple deletions. Inf Comput. 2005;197(1–2):90–121.MathSciNetzbMATHCrossRefGoogle Scholar
  8. 8.
    Chomicki J, Marcinkowski J. On the computational complexity of minimal-change integrity maintenance in relational databases. Inconsistency Tolerance. 2005. Lecture Notes in Computer Science 3300:119–150.Google Scholar
  9. 9.
    Cong G, Fan W, Geerts F, Jia X, Ma S. Improving data quality: consistency and accuracy. In: Proceedings of the 33rd International Conference on Very Large Data Bases; 2007. p. 315–26.Google Scholar
  10. 10.
    Fan W, Geerts F. Foundations of data quality management. Synthesis lectures on data management. Morgan & Claypool Publishers; 2012.zbMATHCrossRefGoogle Scholar
  11. 11.
    Fan W, Geerts F, Jia X, Kementsietsidis A. Conditional functional dependencies for capturing data inconsistencies. ACM Trans Database Syst. NY, USA: 2008;33(2):1–48.Google Scholar
  12. 12.
    Fan W, Li J, Ma S, Tang N, Yu W. Interaction between record matching and data repairing. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2011. p. 469–80.Google Scholar
  13. 13.
    Fan W, Li J, Ma S, Tang N, Yu W. Towards certain fixes with editing rules and master data. VLDB J. 2012;21(2):213–38.CrossRefGoogle Scholar
  14. 14.
    Fellegi I, Holt D. A systematic approach to automatic edit and imputation. J Am Stat Assoc. 1976;71(353):17–35.CrossRefGoogle Scholar
  15. 15.
    Lopatenko A, Bertossi LE. Complexity of consistent query answering in databases under cardinality-based and incremental repair semantics. In: Proceedings of the 11th International Conference on Database Theory; 2007. p. 179–93.Google Scholar
  16. 16.
    Wijsen J. Database repairing using updates. ACM Trans Database Syst. 2005;30(3):722–68.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of EdinburghEdinburghUK
  2. 2.Beihang UniversityBeijingChina