Imbalance Reduction Techniques Applied to ECG Classification Problem
In this work we explored capabilities of improving deep learning models performance by reducing the dataset imbalance. For our experiments a highly imbalanced ECG dataset MIT-BIH was used. Multiple approaches were considered. First we introduced mutliclass UMCE, the ensemble designed to deal with imbalanced datasets. Secondly, we studied the impact of applying oversampling techniques to a training set. smote without prior majority class undersampling was used as one of the methods. Another method we used was smote with noise introduced to synthetic learning examples. The baseline for our study was a single ResNet network with undersampling of the training set. Mutliclass UMCE proved to be superior compared to the baseline model, but failed to beat the results obtained by a single model with smote applied to training set. Introducing perturbations to signals generated by smote did not bring significant improvement. Future work may consider combining multiclass UMCE with smote.
KeywordsMachine learning ECG classification Imbalanced data
This work is supported by the Polish National Science Center under the Grant no. UMO-2015/19/B/ST6/01597 as well the statutory funds of the Department of Systems and Computer Networks, Faculty of Electronics, Wrocław University of Science and Technology.
We also wanna thank Michał Leś for lending his computing power resources. Thanks to him this results could be collected and presented.
- 1.Abadi, M.: TensorFlow: Large-scale machine learning on heterogeneous systems (2015). http://tensorflow.org/
- 2.Li, H., et al.: Visualizing the loss landscape of neural nets. CoRR abs/1712.09913 (2017). http://arxiv.org/abs/1712.09913
- 3.He, K., et al.: Deep residual learning for image recognition. CoRR abs/1512.03385 (2015). http://arxiv.org/abs/1512.03385
- 4.Bowyer, K.W., et al.: SMOTE: synthetic minority over-sampling technique. CoRR abs/1106.1813 (2011). http://arxiv.org/abs/1106.1813
- 5.Jun, T.J., et al.: ECG arrhythmia classification using a 2-D convolutional neural network. CoRR abs/1804.06812 (2018). http://arxiv.org/abs/1804.06812
- 6.Chollet, F.E.A.: Keras (2015). https://keras.io
- 7.Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org
- 8.Kachuee, M., Fazeli, S., Sarrafzadeh, M.: ECG heartbeat classification: a deep transferable representation. CoRR abs/1805.00794 (2018). http://arxiv.org/abs/1805.00794
- 9.Ksieniewicz, P.: Undersampled majority class ensemble for highly imbalanced binary classification. Proc. Mach. Learn. Res. 1, 1–13 (2010)Google Scholar
- 13.Xu, S.S., Mak, M.W., Cheung, C.C.: Towards end-to-end ECG classification with rawsignal extraction and deep neural networks. IEEE J. Biomed. Health Inform. (2019)Google Scholar