Abstract
Imbalanced multi-classification problem is an important research hotspot in machine learning. For multiple categories of data, an important method is divide-and-conquer, which covers the decomposition strategy and ensemble rule. However, with the increase of category, the number of poor binary-classifiers increases, which greatly reduces the classification accuracy. In this paper, a novel classification framework to solve current problem of multi-classification method for imbalanced data was proposed, which including a decomposition strategy based on maximum spanning tree, an ensemble rule based on node degree to find the optimal combination of binary-classifiers and a specific KNN algorithm which can dynamically adjust K neighborhood to increase probability of minority class. Finally we experimented on nine public data sets in different fields and compared our proposed classification framework with the most popular framework. The experimental results show that the proposed classification framework can greatly improve the classification accuracy of minority class.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sun, J., Lee, Y.C., Li, H., Huang, Q.H.: Combining B&B-based hybrid feature selection and the imbalance-oriented multiple-classifier ensemble for imbalanced credit risk assessment. Technol. Econ. Dev. Econ. 21, 351 (2015)
DÃez-Pastor, J.F., RodrÃguez, J.J., GarcÃa-Osorio, C., Kuncheva, L.I.: Random balance: ensembles of variable priors classifiers for imbalanced data. Knowl. Based Syst. 85, 96 (2015)
Li, Y., Guo, H., Liu, X., Li, Y., Li, J.: Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl. Based Syst. 94, 88 (2015)
Santhanam, V., Morariu, V.I., Harwood, D., Davis, L.S.: A non-parametric approach to extending generic binary classifiers for multi-classification. Patt. Recogn. 58, 149 (2016)
Fernández, A., López, V., Galar, M., Jesus, M.J.D., Herrera, F.: Analysing the classification of imbalanced data-sets with multiple classes: binarization techniques and ad-hoc approaches. Knowl. Based Syst. 42, 97 (2013)
Hüllermeier, E., Vanderlooy, S.: Combining predictions in pairwise classification: an optimal adaptive voting strategy and its relation to weighted voting. Patt. Recogn. 43, 128 (2010)
Galar, M., Fernndez, A., Barrenechea, E.: An overview of ensemble methods for binary classifiers in multi-class problems: experimental study on one-vs-one and one-vs-all schemes. Patt. Recogn. 44, 1761 (2011)
Zhang, Z., Krawczyk, B., Garca, S., Rosales-Prez, A., Herrera, F.: Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data. Knowl. Based Syst. 106, 251 (2016)
Acknowledgments
This work is funded by Natural Science Foundation of China (41571401), Chongqing Natural Science Foundation (cstc2014kjrc-qnrc40002), Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJ1500431), and WenFeng Creative Foundation of CQUPT (WF2014-05).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Xia, Y., Peng, Y., Zhang, X., Bae, H. (2017). DEMST-KNN: A Novel Classification Framework to Solve Imbalanced Multi-class Problem. In: Silhavy, R., Senkerik, R., Kominkova Oplatkova, Z., Prokopova, Z., Silhavy, P. (eds) Artificial Intelligence Trends in Intelligent Systems. CSOC 2017. Advances in Intelligent Systems and Computing, vol 573. Springer, Cham. https://doi.org/10.1007/978-3-319-57261-1_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-57261-1_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57260-4
Online ISBN: 978-3-319-57261-1
eBook Packages: EngineeringEngineering (R0)