Advertisement

Applied Intelligence

, Volume 48, Issue 5, pp 1233–1250 | Cite as

A fuzzy classification model for myocardial infarction risk assessment

  • Sid Ahmed Mokeddem
Article

Abstract

The use of data mining approaches for analyzing patients trace in different medical databases has become an important research field especially with the evolution of these methods and their contributions in medical decision support. In this paper, we develop a new clinical decision support system (CDSS) to diagnose Coronary Artery Diseases (CAD). According to CAD experts, Angiography is most accurate CAD diagnosis technique. However, it has many aftereffects and is very costly. Existing studies showed that CAD diagnosis requires heterogeneous patients traces from medical history while applying data mining techniques to achieve high accuracy. In this paper, an automatic approach to design CDSS for CAD assessment is proposed. The proposed diagnosis model is based on Random Forest algorithm, C5.0 decision tree algorithm and Fuzzy modeling. It consists of two stages: first, Random Forest algorithm is used to rank the features and a C5.0 decision tree based approach for crisp rule generation is developed. Then, we created the fuzzy inference system. The generation of fuzzy weighted rules is carried out automatically from the previous crisp rules. Moreover, a critical issue about the CDSS is that some values of the features are missing in most cases. A new method to deal with the problem of missing data, which allows evaluating the similarity despite the missing information, was proposed. Finally, experimental results underscore very promising classification accuracy of 90.50% while optimizing training time using UCI (the University of California at Irvine) heart diseases datasets compared to the previously reported results.

Keywords

C5.0 CAD CDSS Machine learning Random forest Fuzzy logic Fuzzy expert system Neural network 

References

  1. 1.
    Aaronson KD, Schwartz JS, Chen TM, Wong KL, Goin JE, Mancini DM (1997) Development and prospective validation of a clinical index to predict survival in ambulatory patients referred for cardiac transplant evaluation. Circulation 95(12):2660–2667CrossRefGoogle Scholar
  2. 2.
    Abbasi M, Kashiyarndi S (2006) Clinical decision support systems: A discussion on different methodologies used in health care. Marlaedalen University Sweden. Available at: http://www.idt.mdh.se/kurser/ct3340/ht10/FinalPapers/15-AbbasiKashiyarndi.pdf. [Accessed on: 25 Fev 2014]
  3. 3.
    Abidin B, Dom RM, Rahman ARA, Bakar RA, Demiralp M, Baykara N, Mastorakis N (2009) Use of fuzzy neural network to predict coronary heart disease in a malaysian sample. In: WSEAS International Conference. Proceedings. Mathematics and Computers in Science and Engineering, 8. World Scientific and Engineering Academy and SocietyGoogle Scholar
  4. 4.
    Alizadehsani R, Habibi J, Hosseini MJ, Mashayekhi H, Boghrati R, Ghandeharioun A, Bahadorian B, Sani ZA (2013) A data mining approach for diagnosis of coronary artery disease. Comput Methods Program Biomed 111(1):52–61CrossRefGoogle Scholar
  5. 5.
    Bashir S, Qamar U, Khan FH (2015) Bagmoov: A novel ensemble for heart disease prediction bootstrap aggregation with multi-objective optimized voting. Australas Phys Eng Sci Med:1–19Google Scholar
  6. 6.
    Blake CL, Merz CJ (1998) Uci Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California. Department of Information and Computer Science:55
  7. 7.
    Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140zbMATHGoogle Scholar
  8. 8.
    Breiman L (2001) Random forests. Mach Learn 45(1):5–32CrossRefzbMATHGoogle Scholar
  9. 9.
    Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC press, USAzbMATHGoogle Scholar
  10. 10.
    Bujlow T, Riaz T, Pedersen JM (2012) A method for classification of network traffic based on c5. 0 machine learning algorithm. In: 2012 International Conference on Computing, Networking and Communications (ICNC). IEEE, pp 237–241Google Scholar
  11. 11.
    Ceylan R, Koyuncu H (2016) A new breakpoint in hybrid particle swarm-neural network architecture: Individual boundary adjustment. Int J Inf Technol Decis Making 15(6):1313–1344.  https://doi.org/10.1142/S0219622016500395 CrossRefGoogle Scholar
  12. 12.
    Chauraisa V, Pal S (2013) Early prediction of heart diseases using data mining techniques. Carib j SciTech 1:208–217Google Scholar
  13. 13.
    Chen AH, Huang SY, Hong PS, Cheng CH, Lin EJ (2011) Hdps: heart disease prediction system. In: Computing in Cardiology. IEEE, pp 557–560Google Scholar
  14. 14.
    Chitra R, Seenivasagam V (2013) Heart disease prediction system using supervised learning classifier. Bonfring Int J Softw Eng Soft Comput 3(1):01–07CrossRefGoogle Scholar
  15. 15.
    Das R, Turkoglu I, Sengur A (2009) Effective diagnosis of heart disease through neural networks ensembles. Expert Syst Appl 36(4):7675–7680CrossRefGoogle Scholar
  16. 16.
    Detrano R, Janosi A, Steinbrunn W, Pfisterer M, Schmid JJ, Sandhu S, Guppy KH, Lee S, Froelicher V (1989) International application of a new probability algorithm for the diagnosis of coronary artery disease. Amer J Cardiol 64(5):304–310CrossRefGoogle Scholar
  17. 17.
    Dudani SA (1976) The distance-weighted k-nearest-neighbor rule. IEEE Trans Syst Man Cybern 6(4):325–327.  https://doi.org/10.1109/TSMC.1976.5408784 CrossRefGoogle Scholar
  18. 18.
    Esposito F, Malerba D, Semeraro G, Kay J et al (1997) A comparative analysis of methods for pruning decision trees. IEEE Trans Pattern Anal Mach Intell 19(5):476–491CrossRefGoogle Scholar
  19. 19.
    Fidele B, Cheeneebash J, Gopaul A, Goorah SS (2009) Artificial neural network as a clinical decision-supporting tool to predict cardiovascular disease. Trends Appl Sci Res 4(1):36–46CrossRefGoogle Scholar
  20. 20.
    Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: Computational Learning Theory. Springer, pp 23–37Google Scholar
  21. 21.
    Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182zbMATHGoogle Scholar
  22. 22.
    Hastie T, Tibshirani R, Friedman J (2009) Unsupervised learning. In: The Elements of Statistical Learning. Springer, pp 485–585Google Scholar
  23. 23.
    Hedeshi NG, Abadeh MS (2014) Coronary artery disease detection using a fuzzy-boosting pso approach. Comput Intell Neurosci 2014:6Google Scholar
  24. 24.
    Hirao T, Isozaki H, Maeda E, Matsumoto Y (2002) Extracting important sentences with support vector machines. In: Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, pp 1–7Google Scholar
  25. 25.
    Jayadeva, Khemchandani R, Chandra S (2004) Fast and robust learning through fuzzy linear proximal support vector machines. Neurocomputing 61:401–411.  https://doi.org/10.1016/j.neucom.2004.02.004 CrossRefGoogle Scholar
  26. 26.
    Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1):82–89CrossRefGoogle Scholar
  27. 27.
    Karaolis M, Moutiris J, Hadjipanayi D, Pattichis CS et al (2010) Assessment of the risk factors of coronary heart events based on data mining with decision trees. IEEE Trans Inf Technol Biomed 14(3):559–566CrossRefGoogle Scholar
  28. 28.
    Karaolis M, Moutiris JA, Pattichis CS (2008) Assessment of the risk of coronary heart event based on data mining. In: 2008. BIBE 2008. 8th IEEE International Conference on BioInformatics and BioEngineering. IEEE, pp 1–5Google Scholar
  29. 29.
    Kim YS, Cho YE (1995) Correlation of pain severity with thermography. In: 1995., IEEE 17th Annual Conference on Engineering in Medicine and Biology Society, vol 2. IEEE, pp 1699– 1700Google Scholar
  30. 30.
    Kunjunninair AP (2012) Erratum to: clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules and decision tree rules. Cent Europ J Comput Sci 2(1):86.  https://doi.org/10.2478/s13537-012-0007-7 Google Scholar
  31. 31.
    Kurt I, Ture M, Kurum AT (2008) Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease. Expert Syst Appl 34(1):366–374CrossRefGoogle Scholar
  32. 32.
    Mackay J, Mensah GA, Mendis S, Greenlund K (2004) The atlas of heart disease and stroke. World Health Organization, GenevaGoogle Scholar
  33. 33.
    Marateb HR, Goudarzi S (2015) A noninvasive method for coronary artery diseases diagnosis using a clinically-interpretable fuzzy rule-based system. J Res Med Sci 20(3):214–23Google Scholar
  34. 34.
    Mendis S, Puska P, Norrving B et al. (2011) Global atlas on cardiovascular disease prevention and control. World Health Organization, GenevaGoogle Scholar
  35. 35.
    Merrad K (2012) les facteurs de risque lies aux maladies cardiovasculaires, journee commemorative du cinquantenaire de la clinique de cardiologie du chu mustapha pacha. alger L’expression, DZGoogle Scholar
  36. 36.
    Mingers J (1989) An empirical comparison of selection measures for decision-tree induction. Mach Learn 3 (4):319–342Google Scholar
  37. 37.
    Mokeddem S, Atmani B (2017) Assessment of clinical decision support systems for predicting coronary heart disease. In: Fuzzy Systems: Concepts, Methodologies, Tools, and Applications. IGI Global, Hershey, pp 184–201Google Scholar
  38. 38.
    Mokeddem S, Atmani B, Mokaddem M (2014) A new approach for coronary artery diseases diagnosis based on genetic algorithm. Int J Decis Support Syst Technol (IJDSST) 6(4):1–15CrossRefGoogle Scholar
  39. 39.
    Ordonez C (2006) Association rule discovery with the train and test approach for heart disease prediction. IEEE Trans Inf Technol Biomed 10(2):334–343.  https://doi.org/10.1109/TITB.2006.864475 CrossRefGoogle Scholar
  40. 40.
    Pang Sl, GONG Jz (2009) C5. 0 classification algorithm and application on individual credit evaluation of banks. Syst Eng-Theory Pract 29(12):94–104CrossRefGoogle Scholar
  41. 41.
    Polat K, Günes S, Tosun S (2006) Diagnosis of heart disease using artificial immune recognition system and fuzzy weighted pre-processing. Pattern Recogn 39(11):2186–2193.  https://doi.org/10.1016/j.patcog.2006.05.028 CrossRefGoogle Scholar
  42. 42.
    Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106Google Scholar
  43. 43.
    Quinlan JR (1996) Bagging, boosting, and c4. 5. In: AAAI/IAAI, vol 1, pp 725–730Google Scholar
  44. 44.
    Quinlan R (2004) Data mining tools see5 and c5. 0, 2004 Google ScholarGoogle Scholar
  45. 45.
    Rani KU (2011) Analysis of heart diseases dataset using neural network approach. arXiv:1110.2626
  46. 46.
    Rätsch G, Onoda T, Müller KR (2001) Soft margins for adaboost. Mach Learn 42(3):287–320CrossRefzbMATHGoogle Scholar
  47. 47.
    Ravindranath KR (2015) Clinical decision support system for heart diseases using extended sub tree. In: 2015 International Conference on Pervasive Computing (ICPC). IEEE, pp 1–5Google Scholar
  48. 48.
    Seewald AK (2003) Towards understanding stacking: studies of a general ensemble learning scheme. naGoogle Scholar
  49. 49.
    Shantakumar B, Kumaraswamy Y (2009) Intelligent and effective heart attack prediction system using data mining and artificial neural network. Eur J Sci Res 31(4):642–656. Cited By 47Google Scholar
  50. 50.
    Shouman M, Turner T, Stocker R (2012) Integrating naive bayes and k-means clustering with different initial centroid selection methods in the diagnosis of heart disease patients. Glob J Comput Sci Technol 2:125–137Google Scholar
  51. 51.
    Sundar NA, Latha PP, Chandra MR (2012) Performance analysis of classification data mining techniques over heart disease database. IJESAT International Journal of engineering science & advanced technology ISSN, pp 2250–3676Google Scholar
  52. 52.
    Tomar D, Agarwal S (2014) Feature selection based least square twin support vector machine for diagnosis of heart disease. Int J Bio-Sci Bio-Technol 6(2):69–82CrossRefGoogle Scholar
  53. 53.
    Tsien C, Fraser H, Long W, Kennedy R (1998) Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inf 52:493Google Scholar
  54. 54.
    Tsipouras MG, Exarchos TP, Fotiadis DI, Kotsia AP, Vakalis KV, Naka KK, Michalis LK (2008) Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling. IEEE Trans Inf Technol Biomed 12(4):447–458.  https://doi.org/10.1109/TITB.2007.907985 CrossRefGoogle Scholar
  55. 55.
    Warren J, Beliakov G, Van der Zwaag B (2000) Fuzzy logic in clinical practice decision support systems. In: 2000. Proceedings of the 33rd Annual Hawaii International Conference on System Sciences. IEEE, pp 10–ppGoogle Scholar
  56. 56.
    Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353CrossRefzbMATHGoogle Scholar
  57. 57.
    Zhou ZH, Chen S (2002) Neural network ensemble. Chin J Comput-Chin Ed 25(1):1–8MathSciNetGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2017

Authors and Affiliations

  1. 1.University of MostaganemMostaganemAlgeria

Personalised recommendations