Skip to main content

Extending the Relational Model to Capture Data Quality Attributes

  • Chapter
Data Quality

Part of the book series: Advances in Database Systems ((ADBS,volume 23))

  • 467 Accesses

Conclusion

Both of the models presented in this chapter define data quality by extending the relational model. The Polygen Model resolves the data source tagging and intermediate source tagging problems. It addresses issues in heterogeneous distributed database systems from the “where” perspective and thus enables us to interpret data from different sources more accurately. Furthermore, it follows the relational model by specifying the data structure and data manipulation components of the data model. However, it does not include the data integrity component. The Attribute-based Model, on the other hand, allows for the structure, storage, and processing of quality relations and quality indicator relations through a quality indicator algebra. In addition, it includes a description of its data structure, a set of data integrity constraints for the model, and a quality indicator algebra.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ballou, D., I. Chengalur-Smith and R. Y. Wang, A Sampling Procedure for Data Quality Auditing in the Relational Environment (No. TDQM-00-02). Massachusetts Institute of Technology, 2000.

    Google Scholar 

  2. Ballou, D.P. and H. L. Pazer, “Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff,” Information Systems Research, 6(1), 1995, pp.51–72.

    Google Scholar 

  3. Ballou, D.P., R. Y. Wang, H. Pazer and G. K. Tayi, “Modeling Information Manufacturing Systems to Determine Information Product Quality,” Management Science, 44(4), 1998, pp. 462–484.

    Article  MATH  Google Scholar 

  4. Batini, C., M. Lenzirini and S. Navathe, “A comparative analysis of methodologies for database schema integration,” ACM Computing Survey, 18(4), 1986, pp. 323–364.

    Google Scholar 

  5. Codd, E. F., “A Relational Model of Data for Large Shared Data Banks,” Communications of the ACM, 13(6), 1970, pp. 377–387.

    Article  MATH  Google Scholar 

  6. Codd, E. F., “Relational completeness of data base sublanguages,” in Data Base Systems, R. Rustin, Editor 1972, Prentice Hall, 1972.

    Google Scholar 

  7. Codd, E. F., “Extending the relational database model to capture more meaning,” ACM Transactions on Database Systems, 4(4), 1979, pp. 397–434.

    Article  Google Scholar 

  8. Codd, E. F., “Relational database: A Practical Foundation for Productivity, the 1981 ACM Turing Award Lecture,” Communications of the ACM, 25(2), 1982, pp. 109–117.

    Article  Google Scholar 

  9. Codd, E. F. “An evaluation scheme for database management systems that are claimed to be relational,” in Proceedings of the Second International Conference on Data Engineering. Los Angeles, CA: pp. 720–729, 1986.

    Google Scholar 

  10. Date, C. J. “The outer join,” in Proceedings of The 2cd International Conference on Databases. Cambridge, England: pp. 76–106, 1983.

    Google Scholar 

  11. Date, C. J., An Introduction to Database Systems. 5th ed. Addison-Wesley Systems Programming Series, Addison-Wesley, Reading, 1990.

    Google Scholar 

  12. DeMichiel, L. G. Performing operations over mismatched domains. In pp. 36–45. Los Angeles, CA: 1989.

    Google Scholar 

  13. Elmasri, R., J. Larson and S. Navathe, Schema integration algorithms for federated databases and logical database design (No. 1987)

    Google Scholar 

  14. Huang, K., Y. Lee and R. Wang, Quality Information and Knowledge. Prentice Hall, Upper Saddle River: N.J., 1999.

    Google Scholar 

  15. Kahn, B. K., D. M. Strong and R. Y. Wang. “Information Quality Benchmarks: Product and Service Performance,” Communications of the ACM, 1999.

    Google Scholar 

  16. Khoshafian, S. N. and G. P. Copeland, “Object Identity,” in The Morgan Kaufmann Series in Data Management Systems, S. B. Zdonik and D. Maier, Ed. 1990, Morgan Kaufmann, San Mateo, CA, 1990.

    Google Scholar 

  17. Klug, A., “Equivalence of relational algebra and relational calculus query languages having aggregate functions,” The Journal of ACM, 29, 1982, pp. 699–717.

    MATH  Google Scholar 

  18. Lee, T. and S. Bressen. “Multimodal Integration of Disparate Information Sources with Attribution,” in Proceedings of Entity Relationship Workshop on Information Retrieval and Conceptual Modeling, 1997.

    Google Scholar 

  19. Lee, T., S. Bressen and S. Madnick. “Source Attribution for Querying Against Semi-structured Documents,”. in Proceedings of Workshop on Web Information and Data Management, ACM Conference on Information and Knowledge Management, 1998.

    Google Scholar 

  20. Lee, T. and L. McKnight. “Internet Data Management: Policy Barriers to an Intermediated Electronic Market in Data,” in Proceedings of 27th Annual Telecommunications Policy Research Conference 1999.

    Google Scholar 

  21. Madnick, S. E., “Database in the Internet Age,” Database Programming and Design, 1997, pp. 28–33.

    Google Scholar 

  22. Reddy, M. P. and R. Y. Wang. “Estimating Data Accuracy in a Federated Database Environment,” in Proceedings of 6th International Conference, CISMOD (Also in Lecture Notes in Computer Science). Bombay, India: pp. 115–134, 1995.

    Google Scholar 

  23. Rob, P. and C. Coronel, Database Systems: Design, Implementation, and Management. 3rd ed. Course Technology, Boston, 1997.

    Google Scholar 

  24. Strong, D. M., Y. W. Lee and R. Y. Wang, “Data Quality in Context,” Communications of the ACM, 40(5), 1997, pp. 103–110.

    Article  Google Scholar 

  25. Wang, R. Y., M. P. Reddy and H. B. Kon, “Toward quality data: An attribute-based approach,” Decision Support Systems (DSS), 13, 1995, pp. 349–372.

    Google Scholar 

  26. Wang, R. Y., V. C. Storey and C. P. Firth, “A Framework for Analysis of Data Quality Research,” IEEE Transactions on Knowledge and Data Engineering, 7(4), 1995, pp. 623–640.

    Article  Google Scholar 

  27. Wang, R. Y. and D. M. Strong, “Beyond Accuracy: What Data Quality Means to Data Consumers,” Journal of Management Information Systems (MIS), 12(4), 1996, pp. 5–34.

    MATH  Google Scholar 

  28. Wang, Y. R. and S. E. Madnick, Connectivity among information systems. Vol. 1. Cornposite Information Systems (CIS) Project, MIT Sloan School of Management, Cambridge, MA, 1988.

    Google Scholar 

  29. Wang, Y. R. and S. E. Madnick, “Facilitating connectivity in composite information systems,” ACM Data Base, 20(3), 1989, pp. 38–46.

    Google Scholar 

  30. Wang, Y. R. and S. E. Madnick. “The inter-database instance identification problem in integrating autonomous systems,” in Proceedings of the Fifth International Conference on Data Engineering. Los Angeles, CA: pp. 46–55, 1989.

    Google Scholar 

  31. Wang, Y. R. and S. E. Madnick. “A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective,”. in Proceedings of the 16th International Conference on Very Large Data bases (VLDB). Brisbane, Australia: pp. 519–538, 1990.

    Google Scholar 

  32. Wang, Y. R. and S. E. Madnick. “A Source Tagging Theory for Heterogeneous Database Systems,” in Proceedings of International Conference on Information Systems. Copenhagen, Denmark: pp. 243–256, 1990.

    Google Scholar 

  33. Yuan, Y., The design and implementation of system P: A polygen database management system (No. CIS-90-07). MIT Sloan School of Management, Cambridge, MA 02139, 1990

    Google Scholar 

Download references

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Kluwer Academic Publishers

About this chapter

Cite this chapter

(2002). Extending the Relational Model to Capture Data Quality Attributes. In: Data Quality. Advances in Database Systems, vol 23. Springer, Boston, MA. https://doi.org/10.1007/0-306-46987-1_2

Download citation

  • DOI: https://doi.org/10.1007/0-306-46987-1_2

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-7923-7215-8

  • Online ISBN: 978-0-306-46987-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics