Enhancing Prediction Accuracy of Default of Credit Using Ensemble Techniques

Emil Richard Singh, B.; Sivasankar, E.

doi:10.1007/978-981-13-1580-0_41

B. Emil Richard Singh¹⁷ &
E. Sivasankar¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 815))

919 Accesses
2 Citations

Abstract

Credit rating of an institution or individual provides a suggestive financial picture and strength of the individual or the institution. It gives the lender the ability to visualize the potentiality to the extent to which credit could be availed by the institution or the individual. Default prediction on the sum of all attributes such as payment history is a common instrument used for generation of credit rating. This research is aimed at comparing the predictive accuracy of ensemble of base classifiers using techniques of bagging, boosting, and random forest in the prediction of default of credit card clients and suggesting the technique with the highest accuracy. Customers’ default payment in Taiwan dataset is used to build the model. ML classification algorithms such as K-nearest neighbor, Naive Bayesian, decision tree, and support vector machines are applied to create the base model on the dataset. Bagging, boosting, and random forest are applied on the dataset to generate model for prediction. The accuracy of each of the models for various degrees is tabulated. Information gain feature filter method is used to identify features with maximum entropy. The features with high entropy suggested by information gain together with ensemble techniques are used to build the new model. The accuracy of the new model is then tabulated. Boosting ensemble technique is found to have the best accuracy of prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Leo Breiman. Bagging predictors. Machine learning, 24(2):123–140, 1996.
MathSciNet MATH Google Scholar
Hamed R Bonab and Fazli Can. A theoretical framework on the ideal number of classifiers for online ensembles in data streams. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pages 2053–2056. ACM, 2016.
Google Scholar
Nan-Chen Hsieh and LunPing Hung. A data driven ensemble classifier for credit scoring analysis. Expert systems with Applications, 37(1):534–545, 2010.
Article MathSciNet Google Scholar
Steven Finlay. Multiple classifier architectures and their application to credit risk assessment. European Journal of Operational Research, 210(2):368–378, 2011.
Article Google Scholar
Gang Wang, Jinxing Hao, Jian Ma, and Hongbing Jiang. A comparative assessment of ensemble learning for credit scoring. Expert systems with applications, 38(1):223–230, 2011.
Article Google Scholar
Gang Wang, Jian Ma, Lihua Huang, and Kaiquan Xu. Two credit scoring models based on dual strategy ensemble trees. Knowledge-Based Systems, 26:61–68, 2012.
Article Google Scholar
AI Marqu´es, Vicente Garc´ıa, and Javier Salvador Sanchez. Exploring the behaviour of base classifiers in credit scoring ensembles. Expert Systems with Applications, 39(11):10244–10250, 2012.
Article Google Scholar
AI Marqu´es, Vicente Garc´ıa, and Javier Salvador Sanchez. Two-level classifier ensembles for credit risk assessment. Expert Systems with Applications, 39(12):10916–10922, 2012.
Article Google Scholar
Thomas G Dietterich. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and random- ization. Machine learning, 40(2):139–157, 2000.
Article Google Scholar
Jiawei Han, Jian Pei, and Micheline Kamber. Data mining: concepts and techniques. Elsevier, 2011.
Google Scholar
Keinosuke Fukunaga. Introduction to statistical pattern recognition. Academic press, 2013.
Google Scholar
Harry Zhang. The optimality of naive bayes. AA, 1(2):3, 2004.
Google Scholar
S Rasoul Safavian and David Landgrebe. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics, 21(3):660–674, 1991.
Article MathSciNet Google Scholar
Nello Cristianini and John Shawe-Taylor. An introduction to support vector machines and other kernel-based learning methods. Cambridge university press, 2000.
Google Scholar
default of credit card clients data set. https://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients.

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, National Institute of Technology, Tiruchirappalli, Tamil Nadu, India
B. Emil Richard Singh & E. Sivasankar

Authors

B. Emil Richard Singh
View author publications
You can also search for this author in PubMed Google Scholar
E. Sivasankar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. Emil Richard Singh .

Editor information

Editors and Affiliations

School of Computer and Information Sciences, University of Hyderabad, Hyderabad, Telangana, India
Raju Surampudi Bapi
Department Computer Science and Engineering, MLR Institute of Technology, Hyderabad, Telangana, India
Koppula Srinivas Rao
IDRBT, Hyderabad, Telangana, India
Munaga V. N. K. Prasad

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Emil Richard Singh, B., Sivasankar, E. (2019). Enhancing Prediction Accuracy of Default of Credit Using Ensemble Techniques. In: Bapi, R., Rao, K., Prasad, M. (eds) First International Conference on Artificial Intelligence and Cognitive Computing . Advances in Intelligent Systems and Computing, vol 815. Springer, Singapore. https://doi.org/10.1007/978-981-13-1580-0_41

Download citation

DOI: https://doi.org/10.1007/978-981-13-1580-0_41
Published: 05 November 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1579-4
Online ISBN: 978-981-13-1580-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics