Abstract
Data with good quality has precedence when analyzing and using big data to deduce value from such tremendous volume of data in today’s business environments. Decisions and insights derived from poor data has a negative and unpredictable consequences to organizations. At present, due to the lack of comprehensive and intensive research in the field of data quality, especially large data, there is an urgent need to address this issue by researchers to reach the optimal way to estimate and evaluate the quality of large data. Thus, enabling institutions to make rational decisions based on evaluation outputs. In this paper, the current research on the quality of large data was reviewed and summarized by exploring the basic characteristics of large data. The main challenges facing the quality of information were also discussed in the context of large data. Some of the initiatives suggested by the researchers to evaluate the quality of the data have been highlighted. Finally, we believe that the results of these reviews will enhance the conceptual measurements of the large data quality and produce a concrete groundwork for the future by creating an integrated data quality assessment and evaluation models using the suitable algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Meng, X.F., Ci, X.: Big data management: concepts, techniques and challenges. J. Comput. Res Dev. 50(1), 146–169 (2013)
Gantz, J., Reinsel, D.: The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east (2012)
Douglas, L.: 3D data management: controlling data volume, velocity and variety. Gart. Retriev. 6(2001), 6 (2001)
Brown, B., Chui, M., Manyika, J.: Are you ready for the era of ‘big data’. McKinsey Q. 4(1), 24–35 (2011)
Wang, J., Li, H., Wang, Q.: Research on ISO 8000 series standards for data quality. Stand. Sci. 12, 44–46 (2010)
Eckerson, W.W.: Data quality and the bottom line: achieving business success through a commitment to high quality data. The Data Warehousing Institute, pp. 1–36 (2002)
Kaisler, S. et al.: Advanced analytics for big data. In: Encyclopedia of Information Science and Technology, 3rd edn. IGI Global, pp. 7584–7593 (2015)
Russom, P.: Big data analytics. TDWI Best Pract. Rep. Fourth Quart. 19(4), 1–34 (2011)
Chen, M. et al.: Survey on data quality. In 2012 World Congress on Information and Communication Technologies (WICT). IEEE (2012)
Laranjeiro, N., Soydemir, S.N. , Bernardino, J.: A survey on data quality: classifying poor data. In 2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC). IEEE (2015)
Firmani, D., et al.: On the meaningfulness of “big data quality”. Data Sci. Eng. 1(1), 6–20 (2016)
Soares, S.: Big Data Governance: An Emerging Imperative. Mc Press, London (2012)
Caballero, I., Serrano, M., Piattini, M.: A data quality in use model for big data. In: International Conference on Conceptual Modeling. Springer (2014)
Juddoo, S.: Overview of data quality challenges in the context of big data. In: 2015 International Conference on Computing, Communication and Security (ICCCS). IEEE (2015)
Pipino, L.L., Lee, Y.W., Wang, R.Y.: Data quality assessment. Commun. ACM 45(4), 211–218 (2002)
Batini, C., Scannapieco, M.: Data and Information Quality: Dimensions, Principles and Techniques. Springer, Cham (2016)
Cai, L., Zhu, Y.: The challenges of data quality and data quality assessment in the big data era. Data Sci. J. 14 (2015)
Catarci, T. et al.: My (fair) big data. In: 2017 IEEE International Conference on Big Data (Big Data). IEEE (2017)
Bertino, E.: Big data-opportunities and challenges. IEEE, pp. 479–480 (2013)
Taleb, I. et al.: Big data quality: a quality dimensions evaluation. In: Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld), 2016 International IEEE Conferences. IEEE (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Salih, F.I., Ismail, S.A., Hamed, M.M., Mohd Yusop, O., Azmi, A., Mohd Azmi, N.F. (2019). Data Quality Issues in Big Data: A Review. In: Saeed, F., Gazem, N., Mohammed, F., Busalim, A. (eds) Recent Trends in Data Science and Soft Computing. IRICT 2018. Advances in Intelligent Systems and Computing, vol 843. Springer, Cham. https://doi.org/10.1007/978-3-319-99007-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-99007-1_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99006-4
Online ISBN: 978-3-319-99007-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)