Construct Left Ventricular Hypertrophy Prediction Model Based on Random Forest

  • Jimmy Ming-Tai Wu
  • Meng-Hsiun TsaiEmail author
  • Sheng-Han Xiao
  • Tsu-Yang Wu
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 109)


Heart disease ranks second in Taiwan’s top ten cause of death in 2016 and the number of deaths in heart disease increases by about 700 people each year. Left ventricular hypertrophy (LVH) has a significant impact on increasing the morbidity of coronary disease and stroke. Therefore, how to improve the accuracy of heart disease diagnosis is urgent. This study suggests a better method that used K-Nearest Neighbor (KNN) to impute missing values of ECG data and Z-score to standardize ECG data for the requirement of the random forest. This study combined the random forest and ECG data to develop an ECG left ventricular hypertrophy classifier. The experimental results show that the accuracy of the prediction model is 66.1%, the sensitivity is 58%, and the specificity is 70.9%.


Electrocardiogram Left ventricular hypertrophy Random forest Machine learning 


  1. 1.
    Taiwan national death cause statistics in 2016Google Scholar
  2. 2.
    Altman, N.S.: An introduction to kernel and nearest-neighbor nonparametric regression. Am. Stat. 46(3), 175–185 (1992)MathSciNetGoogle Scholar
  3. 3.
    Dangare, C.S., Apte, S.S.: Improved study of heart disease prediction system using data mining classification techniques. Int. J. Comput. Appl. 47(10), 44–48 (2012)Google Scholar
  4. 4.
    Devereux, R.B., Alonso, D.R., Lutas, E.M., Gottlieb, G.J., Campo, E., Sachs, I., Reichek, N.: Echocardiographic assessment of left ventricular hypertrophy: comparison to necropsy findings. Am. J. Cardiol. 57(6), 450–458 (1986)CrossRefGoogle Scholar
  5. 5.
    Dietterich, T.G.: Ensemble learning. In: The Handbook of Brain Theory and Neural Networks, vol. 2, pp. 110–125 (2002)Google Scholar
  6. 6.
    Janos, A., Steinbrunn, W.: Heart disease data setGoogle Scholar
  7. 7.
    Kannel, W.B.: Left ventricular hypertrophy as a risk factor: the Framingham experience. J. Hypertens. Suppl. Official J. Int. Soc. Hypertens. 9(2), S3–8 (1991). Discussion S8–9Google Scholar
  8. 8.
    Klabunde, R.E.: Electrocardiogram graphGoogle Scholar
  9. 9.
    Liaw, A., Wiener, M.: Classification and regression by randomforest. R News 2(3), 18–22 (2002)Google Scholar
  10. 10.
    Mosteller, R.D.: Simplified calculation of body-surface area. New Engl. J. Med. 317(17), 1098 (1987)Google Scholar
  11. 11.
    UC Irvine Machine Learning Repository: Statlog (heart) data setGoogle Scholar
  12. 12.
    Schiffrin, E.L., Pu, Q., Park, J.B.: Effect of amlodipine compared to atenolol on small arteries of previously untreated essential hypertensive patients. Am. J. Hypertens. 15(2), 105–110 (2002)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Jimmy Ming-Tai Wu
    • 1
  • Meng-Hsiun Tsai
    • 2
    Email author
  • Sheng-Han Xiao
    • 2
  • Tsu-Yang Wu
    • 1
  1. 1.College of Computer Science and EngineeringShandong University of Science and TechnologyQingdaoChina
  2. 2.Department of Management Information SystemsNational Chung Hsing UniversityTaichungTaiwan

Personalised recommendations