Skip to main content
Log in

A fuzzy classification model for myocardial infarction risk assessment

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The use of data mining approaches for analyzing patients trace in different medical databases has become an important research field especially with the evolution of these methods and their contributions in medical decision support. In this paper, we develop a new clinical decision support system (CDSS) to diagnose Coronary Artery Diseases (CAD). According to CAD experts, Angiography is most accurate CAD diagnosis technique. However, it has many aftereffects and is very costly. Existing studies showed that CAD diagnosis requires heterogeneous patients traces from medical history while applying data mining techniques to achieve high accuracy. In this paper, an automatic approach to design CDSS for CAD assessment is proposed. The proposed diagnosis model is based on Random Forest algorithm, C5.0 decision tree algorithm and Fuzzy modeling. It consists of two stages: first, Random Forest algorithm is used to rank the features and a C5.0 decision tree based approach for crisp rule generation is developed. Then, we created the fuzzy inference system. The generation of fuzzy weighted rules is carried out automatically from the previous crisp rules. Moreover, a critical issue about the CDSS is that some values of the features are missing in most cases. A new method to deal with the problem of missing data, which allows evaluating the similarity despite the missing information, was proposed. Finally, experimental results underscore very promising classification accuracy of 90.50% while optimizing training time using UCI (the University of California at Irvine) heart diseases datasets compared to the previously reported results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Aaronson KD, Schwartz JS, Chen TM, Wong KL, Goin JE, Mancini DM (1997) Development and prospective validation of a clinical index to predict survival in ambulatory patients referred for cardiac transplant evaluation. Circulation 95(12):2660–2667

    Article  Google Scholar 

  2. Abbasi M, Kashiyarndi S (2006) Clinical decision support systems: A discussion on different methodologies used in health care. Marlaedalen University Sweden. Available at: http://www.idt.mdh.se/kurser/ct3340/ht10/FinalPapers/15-AbbasiKashiyarndi.pdf. [Accessed on: 25 Fev 2014]

  3. Abidin B, Dom RM, Rahman ARA, Bakar RA, Demiralp M, Baykara N, Mastorakis N (2009) Use of fuzzy neural network to predict coronary heart disease in a malaysian sample. In: WSEAS International Conference. Proceedings. Mathematics and Computers in Science and Engineering, 8. World Scientific and Engineering Academy and Society

  4. Alizadehsani R, Habibi J, Hosseini MJ, Mashayekhi H, Boghrati R, Ghandeharioun A, Bahadorian B, Sani ZA (2013) A data mining approach for diagnosis of coronary artery disease. Comput Methods Program Biomed 111(1):52–61

    Article  Google Scholar 

  5. Bashir S, Qamar U, Khan FH (2015) Bagmoov: A novel ensemble for heart disease prediction bootstrap aggregation with multi-objective optimized voting. Australas Phys Eng Sci Med:1–19

  6. Blake CL, Merz CJ (1998) Uci Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California. Department of Information and Computer Science:55

  7. Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140

    MATH  Google Scholar 

  8. Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  MATH  Google Scholar 

  9. Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC press, USA

    MATH  Google Scholar 

  10. Bujlow T, Riaz T, Pedersen JM (2012) A method for classification of network traffic based on c5. 0 machine learning algorithm. In: 2012 International Conference on Computing, Networking and Communications (ICNC). IEEE, pp 237–241

  11. Ceylan R, Koyuncu H (2016) A new breakpoint in hybrid particle swarm-neural network architecture: Individual boundary adjustment. Int J Inf Technol Decis Making 15(6):1313–1344. https://doi.org/10.1142/S0219622016500395

    Article  Google Scholar 

  12. Chauraisa V, Pal S (2013) Early prediction of heart diseases using data mining techniques. Carib j SciTech 1:208–217

    Google Scholar 

  13. Chen AH, Huang SY, Hong PS, Cheng CH, Lin EJ (2011) Hdps: heart disease prediction system. In: Computing in Cardiology. IEEE, pp 557–560

  14. Chitra R, Seenivasagam V (2013) Heart disease prediction system using supervised learning classifier. Bonfring Int J Softw Eng Soft Comput 3(1):01–07

    Article  Google Scholar 

  15. Das R, Turkoglu I, Sengur A (2009) Effective diagnosis of heart disease through neural networks ensembles. Expert Syst Appl 36(4):7675–7680

    Article  Google Scholar 

  16. Detrano R, Janosi A, Steinbrunn W, Pfisterer M, Schmid JJ, Sandhu S, Guppy KH, Lee S, Froelicher V (1989) International application of a new probability algorithm for the diagnosis of coronary artery disease. Amer J Cardiol 64(5):304–310

    Article  Google Scholar 

  17. Dudani SA (1976) The distance-weighted k-nearest-neighbor rule. IEEE Trans Syst Man Cybern 6(4):325–327. https://doi.org/10.1109/TSMC.1976.5408784

    Article  Google Scholar 

  18. Esposito F, Malerba D, Semeraro G, Kay J et al (1997) A comparative analysis of methods for pruning decision trees. IEEE Trans Pattern Anal Mach Intell 19(5):476–491

    Article  Google Scholar 

  19. Fidele B, Cheeneebash J, Gopaul A, Goorah SS (2009) Artificial neural network as a clinical decision-supporting tool to predict cardiovascular disease. Trends Appl Sci Res 4(1):36–46

    Article  Google Scholar 

  20. Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: Computational Learning Theory. Springer, pp 23–37

  21. Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182

    MATH  Google Scholar 

  22. Hastie T, Tibshirani R, Friedman J (2009) Unsupervised learning. In: The Elements of Statistical Learning. Springer, pp 485–585

  23. Hedeshi NG, Abadeh MS (2014) Coronary artery disease detection using a fuzzy-boosting pso approach. Comput Intell Neurosci 2014:6

    Google Scholar 

  24. Hirao T, Isozaki H, Maeda E, Matsumoto Y (2002) Extracting important sentences with support vector machines. In: Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, pp 1–7

  25. Jayadeva, Khemchandani R, Chandra S (2004) Fast and robust learning through fuzzy linear proximal support vector machines. Neurocomputing 61:401–411. https://doi.org/10.1016/j.neucom.2004.02.004

    Article  Google Scholar 

  26. Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1):82–89

    Article  Google Scholar 

  27. Karaolis M, Moutiris J, Hadjipanayi D, Pattichis CS et al (2010) Assessment of the risk factors of coronary heart events based on data mining with decision trees. IEEE Trans Inf Technol Biomed 14(3):559–566

    Article  Google Scholar 

  28. Karaolis M, Moutiris JA, Pattichis CS (2008) Assessment of the risk of coronary heart event based on data mining. In: 2008. BIBE 2008. 8th IEEE International Conference on BioInformatics and BioEngineering. IEEE, pp 1–5

  29. Kim YS, Cho YE (1995) Correlation of pain severity with thermography. In: 1995., IEEE 17th Annual Conference on Engineering in Medicine and Biology Society, vol 2. IEEE, pp 1699– 1700

  30. Kunjunninair AP (2012) Erratum to: clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules and decision tree rules. Cent Europ J Comput Sci 2(1):86. https://doi.org/10.2478/s13537-012-0007-7

    Google Scholar 

  31. Kurt I, Ture M, Kurum AT (2008) Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease. Expert Syst Appl 34(1):366–374

    Article  Google Scholar 

  32. Mackay J, Mensah GA, Mendis S, Greenlund K (2004) The atlas of heart disease and stroke. World Health Organization, Geneva

    Google Scholar 

  33. Marateb HR, Goudarzi S (2015) A noninvasive method for coronary artery diseases diagnosis using a clinically-interpretable fuzzy rule-based system. J Res Med Sci 20(3):214–23

    Google Scholar 

  34. Mendis S, Puska P, Norrving B et al. (2011) Global atlas on cardiovascular disease prevention and control. World Health Organization, Geneva

    Google Scholar 

  35. Merrad K (2012) les facteurs de risque lies aux maladies cardiovasculaires, journee commemorative du cinquantenaire de la clinique de cardiologie du chu mustapha pacha. alger L’expression, DZ

    Google Scholar 

  36. Mingers J (1989) An empirical comparison of selection measures for decision-tree induction. Mach Learn 3 (4):319–342

    Google Scholar 

  37. Mokeddem S, Atmani B (2017) Assessment of clinical decision support systems for predicting coronary heart disease. In: Fuzzy Systems: Concepts, Methodologies, Tools, and Applications. IGI Global, Hershey, pp 184–201

  38. Mokeddem S, Atmani B, Mokaddem M (2014) A new approach for coronary artery diseases diagnosis based on genetic algorithm. Int J Decis Support Syst Technol (IJDSST) 6(4):1–15

    Article  Google Scholar 

  39. Ordonez C (2006) Association rule discovery with the train and test approach for heart disease prediction. IEEE Trans Inf Technol Biomed 10(2):334–343. https://doi.org/10.1109/TITB.2006.864475

    Article  Google Scholar 

  40. Pang Sl, GONG Jz (2009) C5. 0 classification algorithm and application on individual credit evaluation of banks. Syst Eng-Theory Pract 29(12):94–104

    Article  Google Scholar 

  41. Polat K, Günes S, Tosun S (2006) Diagnosis of heart disease using artificial immune recognition system and fuzzy weighted pre-processing. Pattern Recogn 39(11):2186–2193. https://doi.org/10.1016/j.patcog.2006.05.028

    Article  Google Scholar 

  42. Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106

    Google Scholar 

  43. Quinlan JR (1996) Bagging, boosting, and c4. 5. In: AAAI/IAAI, vol 1, pp 725–730

  44. Quinlan R (2004) Data mining tools see5 and c5. 0, 2004 Google Scholar

  45. Rani KU (2011) Analysis of heart diseases dataset using neural network approach. arXiv:1110.2626

  46. Rätsch G, Onoda T, Müller KR (2001) Soft margins for adaboost. Mach Learn 42(3):287–320

    Article  MATH  Google Scholar 

  47. Ravindranath KR (2015) Clinical decision support system for heart diseases using extended sub tree. In: 2015 International Conference on Pervasive Computing (ICPC). IEEE, pp 1–5

  48. Seewald AK (2003) Towards understanding stacking: studies of a general ensemble learning scheme. na

  49. Shantakumar B, Kumaraswamy Y (2009) Intelligent and effective heart attack prediction system using data mining and artificial neural network. Eur J Sci Res 31(4):642–656. Cited By 47

    Google Scholar 

  50. Shouman M, Turner T, Stocker R (2012) Integrating naive bayes and k-means clustering with different initial centroid selection methods in the diagnosis of heart disease patients. Glob J Comput Sci Technol 2:125–137

  51. Sundar NA, Latha PP, Chandra MR (2012) Performance analysis of classification data mining techniques over heart disease database. IJESAT International Journal of engineering science & advanced technology ISSN, pp 2250–3676

  52. Tomar D, Agarwal S (2014) Feature selection based least square twin support vector machine for diagnosis of heart disease. Int J Bio-Sci Bio-Technol 6(2):69–82

    Article  Google Scholar 

  53. Tsien C, Fraser H, Long W, Kennedy R (1998) Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inf 52:493

    Google Scholar 

  54. Tsipouras MG, Exarchos TP, Fotiadis DI, Kotsia AP, Vakalis KV, Naka KK, Michalis LK (2008) Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling. IEEE Trans Inf Technol Biomed 12(4):447–458. https://doi.org/10.1109/TITB.2007.907985

    Article  Google Scholar 

  55. Warren J, Beliakov G, Van der Zwaag B (2000) Fuzzy logic in clinical practice decision support systems. In: 2000. Proceedings of the 33rd Annual Hawaii International Conference on System Sciences. IEEE, pp 10–pp

  56. Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353

    Article  MATH  Google Scholar 

  57. Zhou ZH, Chen S (2002) Neural network ensemble. Chin J Comput-Chin Ed 25(1):1–8

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sid Ahmed Mokeddem.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mokeddem, S.A. A fuzzy classification model for myocardial infarction risk assessment. Appl Intell 48, 1233–1250 (2018). https://doi.org/10.1007/s10489-017-1102-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-017-1102-1

Keywords

Navigation