Skip to main content

Data Quality Is Context Dependent

  • Conference paper
Enabling Real-Time Business Intelligence (BIRTE 2010)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 84))

Abstract

We motivate, formalize and investigate the notions of data quality assessment and data quality query answering as context dependent activities. Contexts for the assessment and usage of a data source at hand are modeled as collections of external databases, that can be materialized or virtual, and mappings within the collections and with the data source at hand. In this way, the context becomes “the complement” of the data source wrt a data integration system. The proposed model allows for natural extensions, like considering data quality predicates, and even more expressive ontologies for data quality assessment.

Research funded by the NSERC Strategic Network on BI (BIN, ADC05) and NSERC/IBM CRDPJ/371084-2008.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)

    MATH  Google Scholar 

  2. Ballou, D., Wang, R., Pazer, H., Tayi, G.: Modeling Information Manufacturing Systems to Determine Information Product Quality. Management Science 44(4), 462–484 (1998)

    Article  MATH  Google Scholar 

  3. Batini, C., Scannapieco, M.: Data Quality: Concepts, Methodologies and Techniques. Springer, Heidelberg (2006)

    MATH  Google Scholar 

  4. Bertossi, L., Bravo, L.: Consistent Query Answers in Virtual Data Integration Systems. In: Bertossi, L., Hunter, A., Schaub, T. (eds.) Inconsistency Tolerance. LNCS, vol. 3300, pp. 42–83. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  5. Bertossi, L., Bravo, L.: Query Answering in Peer-to-Peer Data Exchange Systems. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 476–485. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  6. Bertossi, L., Bravo, L.: The Semantics of Consistency and Trust in Peer Data Exchange Systems. In: Dershowitz, N., Voronkov, A. (eds.) LPAR 2007. LNCS (LNAI), vol. 4790, pp. 107–122. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  7. Bleiholder, J., Naumann, F.: Data Fusion. ACM Computing Surveys 41(1), 1–41 (2008)

    Article  Google Scholar 

  8. Bolchini, C., Curino, C., Orsi, G., Quintarelli, E., Rossato, R., Schreiber, F., Tanca, L.: And What Can Context Do for Data? Communications of the ACM 52(11), 136–140 (2009)

    Article  Google Scholar 

  9. Bolchini, C., Curino, C., Quintarelli, E., Schreiber, F., Tanca, L.: A Data-Oriented Survey of Context Models. SIGMOD Record 36(4), 19–26 (2007)

    Article  Google Scholar 

  10. Bolchini, C., Quintarelli, E., Rossato, R.: Relational Data Tailoring Through View Composition. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 149–164. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  11. Bravo, L., Bertossi, L.: Logic Programs for Consistently Querying Data Integration Systems. In: Proc. International Joint Conference on Artificial Intelligence (IJCAI 2003), pp. 10–15. Morgan Kaufmann, San Francisco (2003)

    Google Scholar 

  12. Brewka, G., Eiter, T.: Equilibria in Heterogeneous Nonmonotonic Multi-Context Systems. In: Proc. AAAI 2007, pp. 385–390 (2007)

    Google Scholar 

  13. De Giacomo, G., Lembo, D., Lenzerini, M., Rosati, R.: On Reconciling Data Exchange, Data Integration, and Peer Data Management. In: Proc. PODS 2007, pp. 133–142 (2007)

    Google Scholar 

  14. Duschka, O., Genesereth, M., Levy, A.: Recursive Query Plans for Data Integration. Journal of Logic Programming 43(1), 49–73 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  15. Giunchiglia, F., Serafini, L.: Multilanguage Hierarchical Logics. Artificial Intelligence 65, 29–70 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  16. Grahne, G., Mendelzon, A.O.: Tableau Techniques for Querying Information Sources through Global Schemas. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 332–347. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  17. Halevy, A.: Answering Queries Using Views: A Survey. VLDB Journal 10(4), 270–294 (2001)

    Article  MATH  Google Scholar 

  18. Homola, M., Serafini, L.: Towards Formal Comparison of Ontology Linking, Mapping and Importing. In: Proc. DL 2010. CEUR-WS 573, pp. 291–302 (2010)

    Google Scholar 

  19. Jiang, L., Borgida, A., Mylopoulos, J.: Towards a Compositional Semantic Account of Data Quality Attributes. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 55–68. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  20. Kivinen, J., Mannila, H.: Approximate Inference of Functional Dependencies from Relations. Theoretical Computer Science 149, 129–149 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  21. Kolaitis, P.: Schema Mappings, Data Exchange, and Metadata Management. In: Proc. PODS 2005, pp. 61–75 (2005)

    Google Scholar 

  22. Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proc. PODS 2002, pp. 233–246 (2002)

    Google Scholar 

  23. Maier, D., Ullman, J., Vardi, M.: On the Foundations of the Universal Relation Model. ACM Transactions on Database Systems 9(2), 283–308 (1984)

    Article  MathSciNet  MATH  Google Scholar 

  24. Naumann, F.: Quality-Driven Query Answering for Integrated Information Systems. Springer, Heidelberg (2002)

    Book  MATH  Google Scholar 

  25. Stanford Center for Biomedical Informatics Research. The Protégé knowledge-base framework (2010), http://protege.stanford.edu/

  26. Wang, R., Strong, D.: Beyond Accuracy: What Data Quality Means to Data Consumers. J. Management and Information Systems 12(4), 5–33 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bertossi, L., Rizzolo, F., Jiang, L. (2011). Data Quality Is Context Dependent. In: Castellanos, M., Dayal, U., Markl, V. (eds) Enabling Real-Time Business Intelligence. BIRTE 2010. Lecture Notes in Business Information Processing, vol 84. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22970-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22970-1_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22969-5

  • Online ISBN: 978-3-642-22970-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics