Advertisement

A Data Quality in Use Model for Big Data

(Position Paper)
  • Ismael Caballero
  • Manuel Serrano
  • Mario Piattini
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8823)

Abstract

Organizations are nowadays immersed in the Big Data Era. Beyond the hype of the concept of Big Data, it is true that something in the way of doing business is really changing. Although some challenges keep being the same as for regular data, with big data, the focus has changed. The reason is due to Big Data is not only data, but also a complete framework including data themselves, storage, formats, and ways of provisioning, processing and analytics. A challenge that becomes even trickier is the one concerning to the management of the quality of big data. More than ever the need for assessing the quality-in-use of big datasets gains importance since the real contribution – business value- of a dataset to a business can be only estimated in its context of use. Although there exists different data quality models to assess the quality of data there still lacks of a quality-in-use model adapted to big data. To fill this gap, and based on ISO 25012 and ISO 25024, we propose the 3Cs model, which is composed of three data quality dimensions for assessing the quality-in-use of big datasets: Contextual Consistency, Operational Consistency and Temporal Consistency.

Keywords

Big Data Data Quality Quality-in-use model 3Cs Model 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Loshin, D.: Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph. Elsevier, Walthman (2013)Google Scholar
  2. 2.
    Mantha, B.: Five Guiding Principles for Realizing the Promise of Big Data. Business Intelligence Journal 19, 8–11 (2014)Google Scholar
  3. 3.
    Kambatla, K., Kollias, G., Kumar, V., Grama, A.: Trends in big data analytics. Journal of Parallel and Distributed Computing - In Press - Corrected Proof (2014)Google Scholar
  4. 4.
    Redman, T.C.: Data’s Credibility Problem. Harvard Business Review 91, 84–88 (2013)Google Scholar
  5. 5.
    Quality in ProgressGoogle Scholar
  6. 6.
    McAfee, A., Brynjolfsson, E.: Big data: The management revolution. Harvard Business Review 90, 60–68 (2012)Google Scholar
  7. 7.
    CIO INSIGHTGoogle Scholar
  8. 8.
    Deutsch, T.: Putting big data myths to rest. IBM Data Management Magazine (2013)Google Scholar
  9. 9.
    Howles, T.: Data, Data Quality, and Ethical Use. Software Quality Professional 16, 4–12 (2014)Google Scholar
  10. 10.
    ISO: ISO/IEC 25010, Systems and software engineering - Systems and software Quality Requirements and Evaluation (SQuaRE) - System and software quality models. International Organization for Standardization, Ginebra, Suiza (2011) Google Scholar
  11. 11.
    ISO: ISO/IEC 25012:2008 - Software engineering. Software product quality requirements and evaluation (SQuaRE). Data quality model International Organization for Standarization (2009) Google Scholar
  12. 12.
    Greenberg, P.: Big Data, Big Deal (2012), www.destinationCRM.com
  13. 13.
  14. 14.
    Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems 25, 599–616 (2009)CrossRefGoogle Scholar
  15. 15.
    Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. Commun. ACM 53, 50–58 (2010)CrossRefGoogle Scholar
  16. 16.
    Strong, D., Lee, Y., Wang, R.: Ten Potholes in the Road to Information Quality. IEEE Computer, 38–46 (1997)Google Scholar
  17. 17.
    Howard, P.: Market update- Data Quality - Market trends. Bloor (2013)Google Scholar
  18. 18.
    Lundquist, E.: Data Quality Is First Step Toward Reliable Data Analysis. p. 5. QuinStreet, Inc. (2013)Google Scholar
  19. 19.
    Becla, J., Wang, D.L., Lim, K.T.: Report from the 5th workshop on extremely large databases. Data Science Journal 11, 37–45 (2012)CrossRefGoogle Scholar
  20. 20.
    Kwon, O., Lee, N., Shin, B.: Data quality management, data usage experience and acquisition intention of big data analytics. International Journal of Information Management (2014)Google Scholar
  21. 21.
  22. 22.
    Lukoianova, T., Rubin, V.L.: Veracity roadmap: Is big data objective, truthful and credible? Advances in Classification Research Online 24 (2013)Google Scholar
  23. 23.
    ISO: ISO/IEC CD 25024 - Systems and software engineering – Systems and software Quality Requirements and Evaluation (SQuaRE) – Measurement of data quality Google Scholar
  24. 24.
    Lee, Y., Madnick, S., Wang, R., Wang, F., Hongyun, Z.: A Cubic Framework for the Chief Data Officer: Succeeding in a World of Big Data. MIS Quarterly Executive 13, 1–13 (2014)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Ismael Caballero
    • 1
  • Manuel Serrano
    • 1
  • Mario Piattini
    • 1
  1. 1.Paseo de la Universidad 4Ciudad RealSpain

Personalised recommendations