Abstract
We motivate, formalize and investigate the notions of data quality assessment and data quality query answering as context dependent activities. Contexts for the assessment and usage of a data source at hand are modeled as collections of external databases, that can be materialized or virtual, and mappings within the collections and with the data source at hand. In this way, the context becomes “the complement” of the data source wrt a data integration system. The proposed model allows for natural extensions, like considering data quality predicates, and even more expressive ontologies for data quality assessment.
Research funded by the NSERC Strategic Network on BI (BIN, ADC05) and NSERC/IBM CRDPJ/371084-2008.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Ballou, D., Wang, R., Pazer, H., Tayi, G.: Modeling Information Manufacturing Systems to Determine Information Product Quality. Management Science 44(4), 462–484 (1998)
Batini, C., Scannapieco, M.: Data Quality: Concepts, Methodologies and Techniques. Springer, Heidelberg (2006)
Bertossi, L., Bravo, L.: Consistent Query Answers in Virtual Data Integration Systems. In: Bertossi, L., Hunter, A., Schaub, T. (eds.) Inconsistency Tolerance. LNCS, vol. 3300, pp. 42–83. Springer, Heidelberg (2005)
Bertossi, L., Bravo, L.: Query Answering in Peer-to-Peer Data Exchange Systems. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 476–485. Springer, Heidelberg (2004)
Bertossi, L., Bravo, L.: The Semantics of Consistency and Trust in Peer Data Exchange Systems. In: Dershowitz, N., Voronkov, A. (eds.) LPAR 2007. LNCS (LNAI), vol. 4790, pp. 107–122. Springer, Heidelberg (2007)
Bleiholder, J., Naumann, F.: Data Fusion. ACM Computing Surveys 41(1), 1–41 (2008)
Bolchini, C., Curino, C., Orsi, G., Quintarelli, E., Rossato, R., Schreiber, F., Tanca, L.: And What Can Context Do for Data? Communications of the ACM 52(11), 136–140 (2009)
Bolchini, C., Curino, C., Quintarelli, E., Schreiber, F., Tanca, L.: A Data-Oriented Survey of Context Models. SIGMOD Record 36(4), 19–26 (2007)
Bolchini, C., Quintarelli, E., Rossato, R.: Relational Data Tailoring Through View Composition. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 149–164. Springer, Heidelberg (2007)
Bravo, L., Bertossi, L.: Logic Programs for Consistently Querying Data Integration Systems. In: Proc. International Joint Conference on Artificial Intelligence (IJCAI 2003), pp. 10–15. Morgan Kaufmann, San Francisco (2003)
Brewka, G., Eiter, T.: Equilibria in Heterogeneous Nonmonotonic Multi-Context Systems. In: Proc. AAAI 2007, pp. 385–390 (2007)
De Giacomo, G., Lembo, D., Lenzerini, M., Rosati, R.: On Reconciling Data Exchange, Data Integration, and Peer Data Management. In: Proc. PODS 2007, pp. 133–142 (2007)
Duschka, O., Genesereth, M., Levy, A.: Recursive Query Plans for Data Integration. Journal of Logic Programming 43(1), 49–73 (2000)
Giunchiglia, F., Serafini, L.: Multilanguage Hierarchical Logics. Artificial Intelligence 65, 29–70 (1994)
Grahne, G., Mendelzon, A.O.: Tableau Techniques for Querying Information Sources through Global Schemas. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 332–347. Springer, Heidelberg (1998)
Halevy, A.: Answering Queries Using Views: A Survey. VLDB Journal 10(4), 270–294 (2001)
Homola, M., Serafini, L.: Towards Formal Comparison of Ontology Linking, Mapping and Importing. In: Proc. DL 2010. CEUR-WS 573, pp. 291–302 (2010)
Jiang, L., Borgida, A., Mylopoulos, J.: Towards a Compositional Semantic Account of Data Quality Attributes. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 55–68. Springer, Heidelberg (2008)
Kivinen, J., Mannila, H.: Approximate Inference of Functional Dependencies from Relations. Theoretical Computer Science 149, 129–149 (1995)
Kolaitis, P.: Schema Mappings, Data Exchange, and Metadata Management. In: Proc. PODS 2005, pp. 61–75 (2005)
Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proc. PODS 2002, pp. 233–246 (2002)
Maier, D., Ullman, J., Vardi, M.: On the Foundations of the Universal Relation Model. ACM Transactions on Database Systems 9(2), 283–308 (1984)
Naumann, F.: Quality-Driven Query Answering for Integrated Information Systems. Springer, Heidelberg (2002)
Stanford Center for Biomedical Informatics Research. The Protégé knowledge-base framework (2010), http://protege.stanford.edu/
Wang, R., Strong, D.: Beyond Accuracy: What Data Quality Means to Data Consumers. J. Management and Information Systems 12(4), 5–33 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bertossi, L., Rizzolo, F., Jiang, L. (2011). Data Quality Is Context Dependent. In: Castellanos, M., Dayal, U., Markl, V. (eds) Enabling Real-Time Business Intelligence. BIRTE 2010. Lecture Notes in Business Information Processing, vol 84. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22970-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-22970-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22969-5
Online ISBN: 978-3-642-22970-1
eBook Packages: Computer ScienceComputer Science (R0)