Introduction
With the increasing availability of Big Data and their attendant analytics, the importance of data quality management has increased. Poor data quality represents one of the greatest hurdles to effective data analytics, computational linguistics, machine learning, and artificial intelligence. If the data are inaccurate, incomprehensible, or unusable, it does not matter how sophisticated our algorithms and paradigms are, or how intelligent our “machines.”
J. M. Juran provides a definition of data quality that is applicable to current Big Data environments: “Data are of high quality if they are fit for their intended use in operations, decision making, and planning” (Juran and Godfrey 1999, p. 34.9). In this context, quality means that Big Data are relevant to their intended uses and are of sufficient detail and quantity, with a high degree of accuracy and completeness, of known provenance, consistent with their metadata, and presented in appropriate ways.
Big Data provide...
Further Readings
Acock, A. C. (2005). Working with missing values. Journal of Marriage and Family, 67, 1012–1028.
Allison, P. A. (2002). Missing data. Thousand Oaks: Sage Publications.
Juran, J. M., & Godfrey, A. B. (1999). Juran’s quality handbook (Fifth ed.). New York: McGraw-Hill.
Labouseur, A. G., & Matheus, C. (2017). An introduction to dynamic data quality challenges. ACM Journal of Data and Information Quality, 8(2), 1–3.
Little, R. J. A., & Rubin, D. B. (1997). Statistical analysis with missing data. New York: Wiley.
Pipino, L. L. Y. W. L., & Wang, R. Y. (2002). Data quality assessment. Communications of the ACM, 45(4), 211–218.
Saunders, J. A., Morrow-Howell, N., Spitznagel, E., Dore, P., Proctor, E. K., & Pescarino, R. (2006). Imputing missing data: A comparison of methods for social workers. Social Work Research, 30(1), 19–30.
Strong, D. M., Lee, Y. W., & Wang, R. Y. (1997). Data quality in context. Communications of the ACM, 40(5), 103–110.
Truong, H.-L., Murguzur, A., & Yang, E. (2018). Challenges in enabling quality of analytics in the cloud. Journal of Data and Information Quality, 9(2), 1–4.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this entry
Cite this entry
Kuiler, E.W. (2019). Data Quality Management. In: Schintler, L., McNeely, C. (eds) Encyclopedia of Big Data. Springer, Cham. https://doi.org/10.1007/978-3-319-32001-4_317-1
Download citation
DOI: https://doi.org/10.1007/978-3-319-32001-4_317-1
Received:
Accepted:
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32001-4
Online ISBN: 978-3-319-32001-4
eBook Packages: Springer Reference Business and ManagementReference Module Humanities and Social SciencesReference Module Business, Economics and Social Sciences