Skip to main content

Towards Big Data Quality Framework for Malaysia’s Public Sector Open Data Initiative

  • Conference paper
  • First Online:
Advances in Visual Informatics (IVIC 2017)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10645))

Included in the following conference series:

Abstract

This paper is about the conceptual development of the Big Data Quality Framework for Malaysia’s Public Sector Open Data Initiative (My-PSODI). At the moment, there is a lack of Big Data Quality Framework in existence particularly that is focusing on the specific context and needs of Malaysia’s Public Sector Open Data initiative. Most of existing data quality frameworks are catering the needs of traditional data types (i.e., structured data) and are very generic in nature. Due to the explosion of big data which consists mostly of unstructured data and structured data, and Malaysia’s vision of leveraging data in modernizing its service delivery, a new framework addressing the needs of Big Data for Malaysia is needed. Based on an extensive literature review, we develop a conceptual framework and systematic methodologies of how to construct the said framework to its fruition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Open Data Barometer (2017). http://opendatabarometer.org/

  2. MAMPU Analitis Data Raya Sektor Awam (DRSA): Strategi, Cabaran dan Halatuju (2013). http://www.mainpp.gov.my/index.php/nota-kursus-latihan/category/3-it?download=10:drsa-penang-anis-suhailis-mampu-latest

  3. MAMPU. Garis Panduan Analitis Data Raya Sektor Awam – Program Kesedaran Dasar dan Garis Panduan ICT Sektor Awam (2016). http://www.mampu.gov.my/ms/penerbitan-mampu/send/89-program-kesedaran-dasar-dan-garis-panduan-ict-sektor-awam/215-7-taklimat-7-gp-drsa

  4. Laranjeiro, N., Soydemir, S.N., Bernardino, J.: A survey on data quality: classifying poor data. In: IEEE 21st Pacific Rim International Symposium on Dependable Computing, 18–20 November, Zhangjiajie, China (2015)

    Google Scholar 

  5. Khoury, M.J., Ioannidis, J.P.A.: Big data meets public health. Science 346(6213), 1054–1055 (2014). doi:10.1126/science.aaa2709

    Article  Google Scholar 

  6. Gartner, Big Data Definition (2012). http://www.gartner.com/it-glossary/big-data/

  7. Abdel Hafez, H.A.: Mining big data in telecommunications industry: challenges, techniques, and revenue opportunity. Int. J. Comput. Electr. Autom. Control Inf. Eng. 10(1), 183–190 (2016)

    Google Scholar 

  8. Cai, L., Zhu, Y.: The challenges of data quality and data quality assessment in the big data era. Data Sci. J. 14(2), 1–10 (2015)

    Google Scholar 

  9. Saha, B., Srivastava, D.: Data quality: the other face of big data. In: IEEE 30th International Conference on Data Engineering (ICDE), 31 March–4 April, Chicago, IL (2014)

    Google Scholar 

  10. Chen, M., Song, M., Han, J., Haihong, E.: Survey on data quality. In: 2012 World Congress on Information and Communication Technologies (WICT), 30 October–2 November, Trivandrum, India (2012)

    Google Scholar 

  11. NIST: NIST Big Data Interoperability Framework, vol. 1, Definitions (2015). http://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.1500-1.pdf

  12. Lucas, A.: Corporate data quality management: from theory to practice. In: 5th Iberian Conference on Information Systems and Technologies (CISTI), 16–19 June, Santiago, Spain (2010)

    Google Scholar 

  13. Abdullah, N., Ismail, S.A., Sophiayati, S., Mohd Sam, S.: Data quality in big data: a review. Int. J. Adv. Soft Comput. Appl. 7(3), 16–27 (2015)

    Google Scholar 

  14. Fan, W., Geerts, F.: Foundations of data management. Synth. Lect. Data Manag. 4(5), 1–217 (2012). Morgan & Claypool

    Google Scholar 

  15. General Administration of Quality Supervision, Inspection and Quarantine of the People’s Republic of China. Quality management systems-fundamentals and vocabulary (GB/T19000—2008/ISO9000:2005), Beijing (2008)

    Google Scholar 

  16. Wang, R.Y., Strong, D.M.: Beyond accuracy: what data quality means to data consumers. J. Manag. Inf. Syst. 12(4), 5–33 (1996)

    Article  Google Scholar 

  17. Crosby, P.B.: Quality is Free: The Art of Making Quality Certain. McGraw-Hill, New York (1988)

    Google Scholar 

  18. Juran, J.M.: Juran on Leadership for Quality: An Executive Handbook. The Free Press, New York (1989)

    Google Scholar 

  19. Alexander, J.E., Tate, M.A.: Web Wisdom: How to Evaluate and Create Information on the Web. Erlbaum, Mahwah (1999)

    Google Scholar 

  20. Shanks, G., Corbitt, B.: Understanding data quality: social and cultural aspects. In: Proceedings of the 10th Australasian Conference on Information Systems, pp. 785–797. MCB University Press Ltd., Wellington (1999)

    Google Scholar 

  21. Zhu, X., Gauch, S.: Incorporating quality metrics in centralised/distributed information retrieval on the world wide web. In: SIGIR 2000 Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 24–28 July, Athens, Greece (2000)

    Google Scholar 

  22. Batini, C., Scannapeico, M.: Data Quality: Concepts, Methodologies and Techniques. Springer, Berlin (2006)

    MATH  Google Scholar 

  23. Krogstie, J.: Capturing enterprise data integration challenges using a semiotic data quality framework. Bus. Inf. Syst. Eng. 57(1), 27–36 (2015)

    Article  Google Scholar 

  24. Taleb, I., Dssouli, R., Serhani, M.A.: Big data pre-processing: a quality framework. In: 4th IEEE International Congress on Big Data, Santa Clara, CA 29 October–1 November (2015)

    Google Scholar 

  25. Juddoo, S.: Overview of data quality challenges in the context of big data. In: International Conference on Computing, Communication and Security (ICCCS), Mauritius, 4–5 December (2015)

    Google Scholar 

  26. Batini, C., Rula, A., Scannapieco, M., Viscusi, G.: From Data Quality to Big Data Quality, Big Data Concepts, Methodologies, Tools, and Applications, pp. 1934–1956. IGI Global, Hershey (2016)

    Book  Google Scholar 

  27. Ijab, M.T., Ahmad, A., Abdul Kadir, R.: Challenge of data quality: towards a big data quality framework. In: IMPACT: Technologies for Society’s Well-Being, Universiti Kebangsaan Malaysia (UKM), p. 44 (2016)

    Google Scholar 

  28. Economic Planning Unit - EPU Malaysia, 11th Malaysia Plan (2017). http://rmk11.epu.gov.my/index.php/en/muat-turun-dokumen

  29. Gurin, J.: Big Data and Open Data: What’s What and Why Does It Matter? The Guardian (2014). https://www.theguardian.com/public-leaders-network/2014/apr/15/big-data-open-data-transform-government

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamad Taha Ijab .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ijab, M.T., Ahmad, A., Kadir, R.A., Hamid, S. (2017). Towards Big Data Quality Framework for Malaysia’s Public Sector Open Data Initiative. In: Badioze Zaman, H., et al. Advances in Visual Informatics. IVIC 2017. Lecture Notes in Computer Science(), vol 10645. Springer, Cham. https://doi.org/10.1007/978-3-319-70010-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70010-6_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70009-0

  • Online ISBN: 978-3-319-70010-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics