Abstract
Hybrid classification approaches on credit domain are widely used to obtain valuable information about customer behaviours. Single classification algorithms such as neural networks, support vector machines and regression analysis have been used since years on related area. In this paper, we propose hybrid classification approaches, which try to combine several classifiers and ensemble learners to boost accuracy on classification results. We worked with two credit datasets, German dataset which is a public dataset and a Turkish Corporate Bank dataset. The goal of using such diverse datasets is to search for generalization ability of proposed model. Results show that feature selection plays a vital role on classification accuracy, hybrid approaches which shaped with ensemble learners outperform single classification techniques and hybrid approaches which consists SVM has better accuracy performance than other hybrid approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Joseph, C.: Credit Risk Analysis: A Tryst with Strategic Prudence, Chap. 1 (2006). ISBN 0070581363
Joseph, C.: Credit Risk Analysis: A Tryst with Strategic Prudence, Chap. 2 (2006). ISBN 0070581363
Joseph, C.: Credit Risk Analysis: A Tryst with Strategic Prudence, Chap. 3 (2006). ISBN 0070581363
Gallo, C., Letizia, C., Stasio, G.: Artificial neural networks in financial modelling. Research Gate, pp. 1–21 (2006)
Shachmurave, Y.: Applying Artificial Neural Networks to Business Economics & Finance (2002)
Sogala, S.S.: Comparing the Efficacy of the Decision Trees with Logistic Regression for Credit Risk Analysis. Head Risk Solutions & Research, HP India
Yu, H., Huang, X., Hu, X., Cai, H.: A comparative study on data mining algorithms for individual credit risk evaluation (2010)
Wang, Y., Wang, S., Lei, K.K.: A new fuzzy SVM to evaluate credit risk. IEEE Trans. Fuzzy Syst. 13, 820–831 (2005)
Hao, P.-Y., Lin, M.-S., Tsai, L.-B.: A new SVM with fuzzy hyper-plane and its application to evaluate credit risk
Kaya, M.E., Gürgen, F., Okay, N.: An analysis of support vector machines for credit risk modeling. In: PAKDD 2007 DMBiz Workshop, China (2008)
Zhang, Y., Orgun, M.A., Baxter, R., Lin, W.: An application of element oriented analysis based credit scoring. In: Perner, P. (ed.) ICDM 2010. LNCS (LNAI), vol. 6171, pp. 544–557. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14400-4_42
Huang, Z., Chan, H., Hsu, C.-J., Chan, W.-H., Wu, S.: Credit rating analysis with support vector machines and neural networks: a market comparative study. Decis. Support Syst. 37, 543–558 (2003)
Doumpos, M., Zopounidis, C.: Model combination for credit risk assessment: a stacked generalization approach. Ann. Oper. Res. 151, 289–306 (2006)
Gaganiz, C., Pasioures, F., Spathis, C., Zopounidis, C.: A comparison of nearest neighbors discriminant and logit models for auditing decisions. Intell. Syst. Account. Financ. Manag. 15, 23–40 (2007)
Campos, R., Ruiz, F.J., Agell, N., Angulo, C.: Financial credit risk measurement prediction using innovatiove soft-computing techniques
Kotsiantis, S.: Credit risk analysis using a hybrid data mining model. Int. J. Intell. Syst. Technol. Appl. 2, 345–356 (2007)
Cetiner, E.: Classifiers performance comparison on credit risk analysis. MS thesis Study, Computer Engineering Department, Bogazici University (2011)
Koutanaie, F.: A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring. J. Retail. Consum. Serv. 27, 11–23 (2015)
Keller, D., Sehami, M.: Toward optimal feature selection. In: Proceedings of International Conference on Machine Learning (1996)
Narendra, P.M., Fukunaga, K.: A branch and bound algorithm for feature selection. IEEE Trans. Comput. C-26, 917–922
Shunin, Y.N.: Neural networks modelling of business situations and decision-making analysis. Comput. Modell. New Technol. 9(2), 17–26 (2005)
Beryor, H., Merkl, D., Dittenbach, M.: Exploiting partial decision trees for feature subset selection in e-mail categorization. In: Proceedings of the 2006 ACM symposium on Applied Computing (2006)
Turkish Bank Dataset
“German Dataset”, Statlog Project Databases. ftp://ftp.ics.uci.edu/pub/machine-learning-databases/statlog/german
Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27, 861–874 (2005)
http://www.crsouza.blogspot.com/2010/03/kernel-functions-for-machine-learning.html
West, M.: Bayesian factor regression models in the “Large p, small n” paradigm. Bayesian Stat. 7, 723–732 (2003)
Bank for International Settlements. http://www.bis.org
Acknowledgement
The work of V.C. Gungor was supported by the Turkish National Academy of Sciences Distinguished Young Scientist Award Program (TUBA-GEBIP) under Grand no. V.G./TUBA-GEBIP/2013-14.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Cetiner, E., Gungor, V.C., Kocak, T. (2018). Evaluation of Hybrid Classification Approaches: Case Studies on Credit Datasets. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2018. Lecture Notes in Computer Science(), vol 10935. Springer, Cham. https://doi.org/10.1007/978-3-319-96133-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-96133-0_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96132-3
Online ISBN: 978-3-319-96133-0
eBook Packages: Computer ScienceComputer Science (R0)