Skip to main content

Towards Development of National Health Data Warehouse for Knowledge Discovery

  • Conference paper
  • First Online:
Intelligent Systems Technologies and Applications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 385))

Abstract

Availability of accurate data on time is essential for medical decision making. Healthcare organizations own a large amount of data in various systems. Researchers, health care providers and patients will not be able to utilize the knowledge in different stores unless integration of the information from disparate sources is completed. Developing health data warehouse is a complex process and also consumes a significant amount of time but it is essential to deliver quality health services. In this paper the architecture of a data warehouse model and the development process suitable for integrating data from different healthcare sources have been presented. We have developed a Star schema suitable for large data warehouse. Integrating health data requires a rigorous preprocessing and we have completed the preprocessing of national health data by applying efficient transformation techniques. Finally the knowledge discovery potentials from the data warehouse are also presented with relevant examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Roddick, J.F., Fule, P., Graco, W.J.: Exploratory medical knowledge discovery: experiences and issues. SIGKDD Explor. Newsletter 5(1), 94–99 (2003)

    Article  Google Scholar 

  2. Cios, K.: Uniqueness of medical data mining. Artificial intelligence in medicine. 26, 1–24 (2002)

    Article  Google Scholar 

  3. Fayyad, U.M., Shapiro, G.P., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. Advances in Knowledge Discovery and Data Mining, 1–36 (1996)

    Google Scholar 

  4. Khosla, R., Dillon, T.: Knowledge discovery, data mining and hybrid systems. In: Engineering Intelligent Hybrid Multi-Agent Systems, pp. 143–177. Kluwer Academic Publishers (1997)

    Google Scholar 

  5. Inmon, W.H.: EIS and the data warehouse: a simple approach to building an effective foundation for EIS. Database Programming and Design 5(11), 70–73 (1992)

    Google Scholar 

  6. Stolba, N., Banek, M., Tjoa, A.M.: The security issue of federated data warehouses in the area of evidence-based medicine. In: First International Conference on Availability, Reliability and Security, ARES 2006. IEEE (2006)

    Google Scholar 

  7. Sahama, T.R., Croll, P.R.: A data warehouse architecture for clinical data warehousing. In: Australasian Workshop on Health Knowledge Management and Discovery, HKMD 2007 (2007)

    Google Scholar 

  8. Lyman, J.A., Scully, K., Harrison, J.H.: The development of health care data warehouses to support data mining. Clin. Lab. Med. 28(1), 55–71 (2008)

    Article  Google Scholar 

  9. Nugawela, S.: Data Warehousing Model For Integrating Fragmented Electronic Health Records From Disparate And Heterogeneous Clinical Data Stores, M.Sc. Thesis, Queensland University of Technology (2013)

    Google Scholar 

  10. Inmon, W.: Building the Data Warehouse, 4th edn., Wiley, New York (2005)

    Google Scholar 

  11. Jiawei, H., Micheline, K., Jian, P.: Data Mining Concepts and Techniques, 3rd edn., Elsevier (2012)

    Google Scholar 

  12. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd edn., Wiley (2013)

    Google Scholar 

  13. Mullins, M., Siadaty, M.S., Lyman, J., et al.: Data mining and clinical data repositories: Insights from a 667,000 patient data set. Comput. Biol. Med. 36, 1351–1377 (2006)

    Article  Google Scholar 

  14. Zhu, X., Khoshgoftaar, T., Davidson, I., Zhang, S.: Special issue on mining low-quality data. Knowledge and Information Systems 11, 131–136 (2007)

    Article  Google Scholar 

  15. Brown, M.L., Kros, J.F.: Data mining and the impact of missing data. Industrial Management & Data Systems 103, 611–621 (2003)

    Article  Google Scholar 

  16. Lavrač, N.: Selected techniques for data mining in medicine. Artificial intelligence in medicine 16(1), 3–23 (1999)

    Article  Google Scholar 

  17. Lee, I.N., Liao, S.C., Embrechts, M.: Data mining techniques applied to medical information. Medical Informatics & the Internet in Medicine 25(2), 81–102 (2000)

    Google Scholar 

  18. Obenshain, M.K.: Application of Data Mining Techniques to Healthcare Data. Infection Control and Hospital Epidemiology 25(8), 690–695 (2004)

    Article  Google Scholar 

  19. Wang, H., Wang, S.: Medical knowledge acquisition through data mining. In: IEEE International Symposium ITME (2008)

    Google Scholar 

  20. Faisal, S.: Missing Data in Pathology Databases. MSc Thesis, Australian National University (2011)

    Google Scholar 

  21. Partington, S.N., Papakroni, V., Menzies, T.: Optimizing data collection for public health decisions: a data mining approach. BMC Public Health 14, 593–598 (2014)

    Article  Google Scholar 

  22. Cubillas, J.J., Ramos, M.I., Feito, F.R., Ureña, T.: An improvement in the appointment scheduling in primary health care centers using data mining. J. Med. Syst., 38, 89 (2014)

    Google Scholar 

  23. Hoque, A.S.M.L., Galib, S., Tasnim, M.: Mining pathological data to support medical diagnostics. In: Workshop on Advances on Data Management: Applications and Algorithms. Department of Computer Science and Engineering, BUET, Dhaka, pp. 71–74 (2013)

    Google Scholar 

  24. Kumari, S., Singh, A.: A data mining approach for the diagnosis of diabetes mellitus. In: IEEE 7th International Conference on Intelligent Systems and Control (2013)

    Google Scholar 

  25. Yilmaz, N., Inan, O., Uzer, M.S.: A New Data Preparation Method Based on Clustering Algorithms for Diagnosis Systems of Heart and Diabetes Diseases. J. Med. Syst. 38 (2013)

    Google Scholar 

  26. Herland, M., Khoshgoftaar, T.M., Wald, R.: A review of data mining using big data in health informatics. J. Big Data 1, 2 (2014)

    Google Scholar 

  27. Khan, S.I., Hoque, A.S.M.L.: Towards development of health data warehouse: bangladesh perspective. In: Proc. 2nd International Conference on Electrical Engineering and Information & Communication Technology (2015)

    Google Scholar 

  28. HEALTH BULLETIN, 2nd edn., DGHS, Ministry of Health and Family Welfare, Government of the People’s Republic of Bangladesh (2014)

    Google Scholar 

  29. http://www.dghs.gov.bd/index.php/en/health-program-progress/hpnsdp-2011-16/84-english-root/ehealth-eservice/497-hpnsdp-2011-16-brief (Accessed February 20, 2015)

  30. http://www.bpcdoa.com/clinics_and_diagnostics.html (Accessed February 22, 2015)

  31. http://www.thefinancialexpress-bd.com/2014/12/15/71077/print (Accessed February 22, 2015)

  32. Liang, Z., Sherif, S., Anna, L., Athman, B.: Cloud Data Management. Springer, Switzerland (2014)

    Google Scholar 

  33. Khan, S.I., Hoque, A.S.M.L.: A New Technique for Database Fragmentation in Distributed Systems. International Journal of Computer Applications 5(9), 20–24 (2010)

    Article  Google Scholar 

  34. Raouf, A.E., Badr, N.L., Tolba, M.F.: Dynamic distributed database over cloud environment. In: Hassanien, A.E., Tolba, M.F., Taher Azar, A. (eds.) AMLTA 2014. CCIS, vol. 488, pp. 67–76. Springer, Heidelberg (2014)

    Google Scholar 

  35. Harikumar, S., Ramachandran, R.: Hybridized fragmentation of very large databases using clustering. In: IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) (2015)

    Google Scholar 

  36. Triglycerides: Why do they matter? http://www.mayoclinic.org/diseases-conditions/high-blood-cholesterol/in-depth/triglycerides/art-20048186 (Accessed June 07, 2015)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shahidul Islam Khan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Khan, S.I., Hoque, A.S.M.L. (2016). Towards Development of National Health Data Warehouse for Knowledge Discovery. In: Berretti, S., Thampi, S., Dasgupta, S. (eds) Intelligent Systems Technologies and Applications. Advances in Intelligent Systems and Computing, vol 385. Springer, Cham. https://doi.org/10.1007/978-3-319-23258-4_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23258-4_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23257-7

  • Online ISBN: 978-3-319-23258-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics