Abstract
Availability of accurate data on time is essential for medical decision making. Healthcare organizations own a large amount of data in various systems. Researchers, health care providers and patients will not be able to utilize the knowledge in different stores unless integration of the information from disparate sources is completed. Developing health data warehouse is a complex process and also consumes a significant amount of time but it is essential to deliver quality health services. In this paper the architecture of a data warehouse model and the development process suitable for integrating data from different healthcare sources have been presented. We have developed a Star schema suitable for large data warehouse. Integrating health data requires a rigorous preprocessing and we have completed the preprocessing of national health data by applying efficient transformation techniques. Finally the knowledge discovery potentials from the data warehouse are also presented with relevant examples.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Roddick, J.F., Fule, P., Graco, W.J.: Exploratory medical knowledge discovery: experiences and issues. SIGKDD Explor. Newsletter 5(1), 94–99 (2003)
Cios, K.: Uniqueness of medical data mining. Artificial intelligence in medicine. 26, 1–24 (2002)
Fayyad, U.M., Shapiro, G.P., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. Advances in Knowledge Discovery and Data Mining, 1–36 (1996)
Khosla, R., Dillon, T.: Knowledge discovery, data mining and hybrid systems. In: Engineering Intelligent Hybrid Multi-Agent Systems, pp. 143–177. Kluwer Academic Publishers (1997)
Inmon, W.H.: EIS and the data warehouse: a simple approach to building an effective foundation for EIS. Database Programming and Design 5(11), 70–73 (1992)
Stolba, N., Banek, M., Tjoa, A.M.: The security issue of federated data warehouses in the area of evidence-based medicine. In: First International Conference on Availability, Reliability and Security, ARES 2006. IEEE (2006)
Sahama, T.R., Croll, P.R.: A data warehouse architecture for clinical data warehousing. In: Australasian Workshop on Health Knowledge Management and Discovery, HKMD 2007 (2007)
Lyman, J.A., Scully, K., Harrison, J.H.: The development of health care data warehouses to support data mining. Clin. Lab. Med. 28(1), 55–71 (2008)
Nugawela, S.: Data Warehousing Model For Integrating Fragmented Electronic Health Records From Disparate And Heterogeneous Clinical Data Stores, M.Sc. Thesis, Queensland University of Technology (2013)
Inmon, W.: Building the Data Warehouse, 4th edn., Wiley, New York (2005)
Jiawei, H., Micheline, K., Jian, P.: Data Mining Concepts and Techniques, 3rd edn., Elsevier (2012)
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd edn., Wiley (2013)
Mullins, M., Siadaty, M.S., Lyman, J., et al.: Data mining and clinical data repositories: Insights from a 667,000 patient data set. Comput. Biol. Med. 36, 1351–1377 (2006)
Zhu, X., Khoshgoftaar, T., Davidson, I., Zhang, S.: Special issue on mining low-quality data. Knowledge and Information Systems 11, 131–136 (2007)
Brown, M.L., Kros, J.F.: Data mining and the impact of missing data. Industrial Management & Data Systems 103, 611–621 (2003)
Lavrač, N.: Selected techniques for data mining in medicine. Artificial intelligence in medicine 16(1), 3–23 (1999)
Lee, I.N., Liao, S.C., Embrechts, M.: Data mining techniques applied to medical information. Medical Informatics & the Internet in Medicine 25(2), 81–102 (2000)
Obenshain, M.K.: Application of Data Mining Techniques to Healthcare Data. Infection Control and Hospital Epidemiology 25(8), 690–695 (2004)
Wang, H., Wang, S.: Medical knowledge acquisition through data mining. In: IEEE International Symposium ITME (2008)
Faisal, S.: Missing Data in Pathology Databases. MSc Thesis, Australian National University (2011)
Partington, S.N., Papakroni, V., Menzies, T.: Optimizing data collection for public health decisions: a data mining approach. BMC Public Health 14, 593–598 (2014)
Cubillas, J.J., Ramos, M.I., Feito, F.R., Ureña, T.: An improvement in the appointment scheduling in primary health care centers using data mining. J. Med. Syst., 38, 89 (2014)
Hoque, A.S.M.L., Galib, S., Tasnim, M.: Mining pathological data to support medical diagnostics. In: Workshop on Advances on Data Management: Applications and Algorithms. Department of Computer Science and Engineering, BUET, Dhaka, pp. 71–74 (2013)
Kumari, S., Singh, A.: A data mining approach for the diagnosis of diabetes mellitus. In: IEEE 7th International Conference on Intelligent Systems and Control (2013)
Yilmaz, N., Inan, O., Uzer, M.S.: A New Data Preparation Method Based on Clustering Algorithms for Diagnosis Systems of Heart and Diabetes Diseases. J. Med. Syst. 38 (2013)
Herland, M., Khoshgoftaar, T.M., Wald, R.: A review of data mining using big data in health informatics. J. Big Data 1, 2 (2014)
Khan, S.I., Hoque, A.S.M.L.: Towards development of health data warehouse: bangladesh perspective. In: Proc. 2nd International Conference on Electrical Engineering and Information & Communication Technology (2015)
HEALTH BULLETIN, 2nd edn., DGHS, Ministry of Health and Family Welfare, Government of the People’s Republic of Bangladesh (2014)
http://www.dghs.gov.bd/index.php/en/health-program-progress/hpnsdp-2011-16/84-english-root/ehealth-eservice/497-hpnsdp-2011-16-brief (Accessed February 20, 2015)
http://www.bpcdoa.com/clinics_and_diagnostics.html (Accessed February 22, 2015)
http://www.thefinancialexpress-bd.com/2014/12/15/71077/print (Accessed February 22, 2015)
Liang, Z., Sherif, S., Anna, L., Athman, B.: Cloud Data Management. Springer, Switzerland (2014)
Khan, S.I., Hoque, A.S.M.L.: A New Technique for Database Fragmentation in Distributed Systems. International Journal of Computer Applications 5(9), 20–24 (2010)
Raouf, A.E., Badr, N.L., Tolba, M.F.: Dynamic distributed database over cloud environment. In: Hassanien, A.E., Tolba, M.F., Taher Azar, A. (eds.) AMLTA 2014. CCIS, vol. 488, pp. 67–76. Springer, Heidelberg (2014)
Harikumar, S., Ramachandran, R.: Hybridized fragmentation of very large databases using clustering. In: IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) (2015)
Triglycerides: Why do they matter? http://www.mayoclinic.org/diseases-conditions/high-blood-cholesterol/in-depth/triglycerides/art-20048186 (Accessed June 07, 2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Khan, S.I., Hoque, A.S.M.L. (2016). Towards Development of National Health Data Warehouse for Knowledge Discovery. In: Berretti, S., Thampi, S., Dasgupta, S. (eds) Intelligent Systems Technologies and Applications. Advances in Intelligent Systems and Computing, vol 385. Springer, Cham. https://doi.org/10.1007/978-3-319-23258-4_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-23258-4_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23257-7
Online ISBN: 978-3-319-23258-4
eBook Packages: EngineeringEngineering (R0)