Abstract
In medical intelligent diagnosis, most of the real-world datasets have the class-imbalance problem and some strong correlation features. In this paper, a novel classification model with hierarchical feature representation is proposed to tackle small and imbalanced biomedicine datasets. The main idea of the proposed method is to integrate extreme learning machine-autoencoder (ELM-AE) into the weighted ELM (W-ELM) model. ELM-AE with norm optimization is utilized to extract more effective information from raw data, thereby forming a hierarchical and compact feature representation. Afterwards, random projections of learned feature results view as inputs of the W-ELM. An adaptive weighting scheme is designed to reduce the misclassified rate of the minority class by assigning a larger weight to minority samples. The classification performance of the proposed method is evaluated on two biomedical datasets from the UCI repository. The experimental results show that the proposed method cannot only effectively solve the class-imbalanced problem with small biomedical datasets, but also obtain a higher and more stable performance than other state-of-the-art classification methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Krawczyk, B.: Learning from imbalanced data: open challenges and future directions. Prog. Artif. Intell. 5(4), 221–232 (2016)
Rahman, M.M., Davis, D.N.: Addressing the class imbalance problem in medical datasets. Int. J. Mach. Learn. Comput. 3(2), 224–228 (2013)
Krawczyk, B., Galar, M., Jelen, Ł., Herrera, F.: Evolutionary undersampling boosting for imbalanced classification of breast cancer malignancy. Appl. Soft Comput. 38, 714–726 (2016)
Ali, S., Majid, A., Javed, S.G., et al.: Can-CSC-GBE: developing cost-sensitive classifier with gentleboost ensemble for breast cancer classification using protein amino acids and imbalanced data. Comput. Biol. Med. 73, 38–46 (2016)
Ren, F., Cao, P., Li, W., et al.: Ensemble based adaptive over-sampling method for imbalanced data learning in computer aided detection of microaneurysm. Comput. Med. Imaging Graph. 55, 54–67 (2017)
Yap, B.W., Rani, K.A., Rahman, H.A.A., Fong, S., Khairudin, Z., Abdullah, N.N.: An application of oversampling, undersampling, bagging and boosting in handling imbalanced datasets. In: Herawan, T., Deris, M.M., Abawajy, J. (eds.) Proceedings of the First International Conference on Advanced Data and Information Engineering (DaEng-2013). LNEE, vol. 285, pp. 13–22. Springer, Singapore (2014). https://doi.org/10.1007/978-981-4585-18-7_2
Gong, C.L., Gu, L.X.: A novel SMOTE-based classification approach to online data imbalance problem. Math. Probl. Eng., 1–14 (2016)
Zong, W., Huang, G.B., Chen, Y.: Weighted extreme learning machine for imbalance learning. Neurocomputing 101, 229–242 (2013)
Sani, S., Massie, S., Wiratunga, N., Cooper, K.: Learning deep and shallow features for human activity recognition. In: Li, G., Ge, Y., Zhang, Z., Jin, Z., Blumenstein, M. (eds.) KSEM 2017. LNCS (LNAI), vol. 10412, pp. 469–482. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-63558-3_40
Huang, G., Huang, G.B., Song, S., et al.: Trends in extreme learning machines: a review. Neural Netw. 61, 32–48 (2015)
Tang, J.X., Deng, C.W., Huang, G.B.: Extreme learning machine for multilayer perceptron. IEEE Trans. Neural Netw. Learn. Syst. 27(4), 809–821 (2016)
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2(1), 183–202 (2009)
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/datasets
Acknowledgments
This work is supported by the Science & Technology Development Program of Jilin Province, China (Nos. 20150307030GX, 2015Y059 and 20160204048GX), and by the International Science and Technology Cooperation Program of China under Grant (No. 2015DFA11180), National Key Research and Development Program of China (No. 2017YFC0108303), and Science Foundation for Young Scholars of Changchun University of Science and Technology (No. XQNJJ-2016-08).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, L., Zhao, J., Yang, H., Jiang, Z., Shi, W. (2018). An Improved Weighted ELM with Hierarchical Feature Representation for Imbalanced Biomedical Datasets. In: Liu, W., Giunchiglia, F., Yang, B. (eds) Knowledge Science, Engineering and Management. KSEM 2018. Lecture Notes in Computer Science(), vol 11061. Springer, Cham. https://doi.org/10.1007/978-3-319-99365-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-99365-2_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99364-5
Online ISBN: 978-3-319-99365-2
eBook Packages: Computer ScienceComputer Science (R0)