Conclusion
Both of the models presented in this chapter define data quality by extending the relational model. The Polygen Model resolves the data source tagging and intermediate source tagging problems. It addresses issues in heterogeneous distributed database systems from the “where” perspective and thus enables us to interpret data from different sources more accurately. Furthermore, it follows the relational model by specifying the data structure and data manipulation components of the data model. However, it does not include the data integrity component. The Attribute-based Model, on the other hand, allows for the structure, storage, and processing of quality relations and quality indicator relations through a quality indicator algebra. In addition, it includes a description of its data structure, a set of data integrity constraints for the model, and a quality indicator algebra.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ballou, D., I. Chengalur-Smith and R. Y. Wang, A Sampling Procedure for Data Quality Auditing in the Relational Environment (No. TDQM-00-02). Massachusetts Institute of Technology, 2000.
Ballou, D.P. and H. L. Pazer, “Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff,” Information Systems Research, 6(1), 1995, pp.51–72.
Ballou, D.P., R. Y. Wang, H. Pazer and G. K. Tayi, “Modeling Information Manufacturing Systems to Determine Information Product Quality,” Management Science, 44(4), 1998, pp. 462–484.
Batini, C., M. Lenzirini and S. Navathe, “A comparative analysis of methodologies for database schema integration,” ACM Computing Survey, 18(4), 1986, pp. 323–364.
Codd, E. F., “A Relational Model of Data for Large Shared Data Banks,” Communications of the ACM, 13(6), 1970, pp. 377–387.
Codd, E. F., “Relational completeness of data base sublanguages,” in Data Base Systems, R. Rustin, Editor 1972, Prentice Hall, 1972.
Codd, E. F., “Extending the relational database model to capture more meaning,” ACM Transactions on Database Systems, 4(4), 1979, pp. 397–434.
Codd, E. F., “Relational database: A Practical Foundation for Productivity, the 1981 ACM Turing Award Lecture,” Communications of the ACM, 25(2), 1982, pp. 109–117.
Codd, E. F. “An evaluation scheme for database management systems that are claimed to be relational,” in Proceedings of the Second International Conference on Data Engineering. Los Angeles, CA: pp. 720–729, 1986.
Date, C. J. “The outer join,” in Proceedings of The 2cd International Conference on Databases. Cambridge, England: pp. 76–106, 1983.
Date, C. J., An Introduction to Database Systems. 5th ed. Addison-Wesley Systems Programming Series, Addison-Wesley, Reading, 1990.
DeMichiel, L. G. Performing operations over mismatched domains. In pp. 36–45. Los Angeles, CA: 1989.
Elmasri, R., J. Larson and S. Navathe, Schema integration algorithms for federated databases and logical database design (No. 1987)
Huang, K., Y. Lee and R. Wang, Quality Information and Knowledge. Prentice Hall, Upper Saddle River: N.J., 1999.
Kahn, B. K., D. M. Strong and R. Y. Wang. “Information Quality Benchmarks: Product and Service Performance,” Communications of the ACM, 1999.
Khoshafian, S. N. and G. P. Copeland, “Object Identity,” in The Morgan Kaufmann Series in Data Management Systems, S. B. Zdonik and D. Maier, Ed. 1990, Morgan Kaufmann, San Mateo, CA, 1990.
Klug, A., “Equivalence of relational algebra and relational calculus query languages having aggregate functions,” The Journal of ACM, 29, 1982, pp. 699–717.
Lee, T. and S. Bressen. “Multimodal Integration of Disparate Information Sources with Attribution,” in Proceedings of Entity Relationship Workshop on Information Retrieval and Conceptual Modeling, 1997.
Lee, T., S. Bressen and S. Madnick. “Source Attribution for Querying Against Semi-structured Documents,”. in Proceedings of Workshop on Web Information and Data Management, ACM Conference on Information and Knowledge Management, 1998.
Lee, T. and L. McKnight. “Internet Data Management: Policy Barriers to an Intermediated Electronic Market in Data,” in Proceedings of 27th Annual Telecommunications Policy Research Conference 1999.
Madnick, S. E., “Database in the Internet Age,” Database Programming and Design, 1997, pp. 28–33.
Reddy, M. P. and R. Y. Wang. “Estimating Data Accuracy in a Federated Database Environment,” in Proceedings of 6th International Conference, CISMOD (Also in Lecture Notes in Computer Science). Bombay, India: pp. 115–134, 1995.
Rob, P. and C. Coronel, Database Systems: Design, Implementation, and Management. 3rd ed. Course Technology, Boston, 1997.
Strong, D. M., Y. W. Lee and R. Y. Wang, “Data Quality in Context,” Communications of the ACM, 40(5), 1997, pp. 103–110.
Wang, R. Y., M. P. Reddy and H. B. Kon, “Toward quality data: An attribute-based approach,” Decision Support Systems (DSS), 13, 1995, pp. 349–372.
Wang, R. Y., V. C. Storey and C. P. Firth, “A Framework for Analysis of Data Quality Research,” IEEE Transactions on Knowledge and Data Engineering, 7(4), 1995, pp. 623–640.
Wang, R. Y. and D. M. Strong, “Beyond Accuracy: What Data Quality Means to Data Consumers,” Journal of Management Information Systems (MIS), 12(4), 1996, pp. 5–34.
Wang, Y. R. and S. E. Madnick, Connectivity among information systems. Vol. 1. Cornposite Information Systems (CIS) Project, MIT Sloan School of Management, Cambridge, MA, 1988.
Wang, Y. R. and S. E. Madnick, “Facilitating connectivity in composite information systems,” ACM Data Base, 20(3), 1989, pp. 38–46.
Wang, Y. R. and S. E. Madnick. “The inter-database instance identification problem in integrating autonomous systems,” in Proceedings of the Fifth International Conference on Data Engineering. Los Angeles, CA: pp. 46–55, 1989.
Wang, Y. R. and S. E. Madnick. “A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective,”. in Proceedings of the 16th International Conference on Very Large Data bases (VLDB). Brisbane, Australia: pp. 519–538, 1990.
Wang, Y. R. and S. E. Madnick. “A Source Tagging Theory for Heterogeneous Database Systems,” in Proceedings of International Conference on Information Systems. Copenhagen, Denmark: pp. 243–256, 1990.
Yuan, Y., The design and implementation of system P: A polygen database management system (No. CIS-90-07). MIT Sloan School of Management, Cambridge, MA 02139, 1990
Rights and permissions
Copyright information
© 2002 Kluwer Academic Publishers
About this chapter
Cite this chapter
(2002). Extending the Relational Model to Capture Data Quality Attributes. In: Data Quality. Advances in Database Systems, vol 23. Springer, Boston, MA. https://doi.org/10.1007/0-306-46987-1_2
Download citation
DOI: https://doi.org/10.1007/0-306-46987-1_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-7215-8
Online ISBN: 978-0-306-46987-9
eBook Packages: Springer Book Archive