Abstract
Lending plays a key role in economy from early civilization. One of the most important issue in lending business is to measure the risk that the borrower will default or delay in loan payment. This is called credit risk. After Lehman shock in 2008–2009, big banks increased verification for lending operation to reduce risk. As borrowing from established financial institutions is getting harder, social lending also called Peer-to-Peer (P2P) lending, is becoming the popular trend. Because the client information at P2P lending is not sufficient as in traditional financial system, big data and machine learning become the default methods for analyzing credit risk. However, cost of computation and the problem of training the classifier with imbalance data affect the quality of result. This paper proposes a machine learning model with feature selection to measure credit risk of individual borrower on P2P lending. Based on our experimental results, we showed that the credit risk prediction for P2P lending can be improved using Logistic Regression in addition to proper feature selection.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
McWaters, J., et al.: The Future of Financial Services: How Disruptive Innovations are Reshaping the Way Financial Services are Structured. Provisioned and Consumed. World Economic Forum (2015)
Thomas, L.C.: A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers. Int. J. Forecast. 16(2), 149–172 (2000)
Sandberg, M.: Credit Risk Evaluation using Machine Learning (2017)
Birla, S., Kohli, K., Dutta, A.: Machine learning on imbalanced data in credit risk. In: 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pp. 1–6. IEEE, Vancouver (2016). https://doi.org/10.1109/iemcon.2016.7746326
Brown, I., Mues, C.: An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Syst. Appl. 39(3), 3446–3453 (2012)
Ashcraft, A.B., Schuermann, T.: Understanding the securitization of subprime mortgage credit. Found. Trends Finan. 2(3), 191–309 (2008)
Board, Financial Stability, FinTech Credit: Market Structure, Business Models and Financial Stability Implications. Financial Stability Board, Basel (2017)
John, C.W.: A note on the comparison of logit and discriminant models of consumer credit behavior. J. Finan. Quant. Anal. 15(3), 757–770 (1980)
Dong, G., Lai, K.K., Yen, J.: Credit scorecard based on logistic regression with random coefficients. Procedia Comput. Sci. 1(1), 2463–2468 (2010)
Malekipirbazari, M., Aksakalli, V.: Risk assessment in social lending via random forests. Expert Syst. Appl. 42(10), 4621–4631 (2015)
LendingClub, June 2018. https://www.lendingclub.com/info/download-data.action
Peng, H., Fuhui, L., Chris, D.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, SF., Chakraborty, G., Li, LH. (2019). Feature Selection on Credit Risk Prediction for Peer-to-Peer Lending. In: Kojima, K., Sakamoto, M., Mineshima, K., Satoh, K. (eds) New Frontiers in Artificial Intelligence. JSAI-isAI 2018. Lecture Notes in Computer Science(), vol 11717. Springer, Cham. https://doi.org/10.1007/978-3-030-31605-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-31605-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31604-4
Online ISBN: 978-3-030-31605-1
eBook Packages: Computer ScienceComputer Science (R0)