A Study of Dimensionality Reduction Techniques with Machine Learning Methods for Credit Risk Prediction

  • E. SivasankarEmail author
  • C. Selvi
  • C. Mala
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 556)


With the huge advancement of financial institution, credit risk prediction assumes a critical part to grant a loan to the customer and helps the financial institution to minimize their misfortunes. Despite the fact that there are different statistical and artificial intelligent methods available, there is no single best strategy for credit risk prediction. In our work, we have used feature selection and feature extraction methods as preprocessing techniques before building a classifier model. To validate the feasibility and effectiveness of our models, three credit data sets are picked namely Australia, German, and Japanese. Experimental results demonstrates that the SVM classifier performs better among several classifier methods, i.e., NB, LogR, DT, and KNN with LDA feature extraction technique. Test result demonstrates that the feature extraction preprocessing technique with base classifiers are the best suited for credit risk prediction.


Feature selection Feature extraction Machine learning Credit risk data set 


  1. 1.
    Marqués Marzal, A.I., García Jiménez, V., Sánchez Garreta, J.S.: Exploring the behaviour of base classifiers in credit scoring ensembles. (2012)Google Scholar
  2. 2.
    Xie, J., Wu, J., Qian, Q.: Feature selection algorithm based on association rules mining method. In: Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on, IEEE (2009) 357–362Google Scholar
  3. 3.
    Wang, G., Ma, J., Yang, S.: An improved boosting based on feature selection for corporate bankruptcy prediction. Expert Systems with Applications 41(5) (2014) 2353–2361Google Scholar
  4. 4.
    Jin, C., Jin, S.W., Qin, L.N.: Attribute selection method based on a hybrid bpnn and pso algorithms. Applied Soft Computing 12(8) (2012) 2147–2155CrossRefGoogle Scholar
  5. 5.
    Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2(4) (2010) 433–459CrossRefGoogle Scholar
  6. 6.
    McLachlan, G.: Discriminant analysis and statistical pattern recognition. Volume 544. John Wiley & Sons (2004)Google Scholar
  7. 7.
    Durand, D., et al.: Risk elements in consumer instalment financing. NBER Books (1941)Google Scholar
  8. 8.
    Karels, G.V., Prakash, A.J.: Multivariate normality and forecasting of business bankruptcy. Journal of Business Finance & Accounting 14(4) (1987) 573–593CrossRefGoogle Scholar
  9. 9.
    Reichert, A.K., Cho, C.C., Wagner, G.M.: An examination of the conceptual issues involved in developing credit-scoring models. Journal of Business & Economic Statistics 1(2) (1983) 101–114Google Scholar
  10. 10.
    West, D.: Neural network credit scoring models. Computers & Operations Research 27(11) (2000) 1131–1152CrossRefzbMATHGoogle Scholar
  11. 11.
    Desai, V.S., Crook, J.N., Overstreet, G.A.: A comparison of neural networks and linear scoring models in the credit union environment. European Journal of Operational Research 95(1) (1996) 24–37CrossRefzbMATHGoogle Scholar
  12. 12.
    Shin, K.s., Han, I.: A case-based approach using inductive indexing for corporate bond rating. Decision Support Systems 32(1) (2001) 41–52Google Scholar
  13. 13.
    Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., Vanthienen, J.: Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society 54(6) (2003) 627–635CrossRefzbMATHGoogle Scholar
  14. 14.
    Tam, K.Y., Kiang, M.Y.: Managerial applications of neural networks: the case of bank failure predictions. Management science 38(7) (1992) 926–947CrossRefzbMATHGoogle Scholar
  15. 15.
    Wang, G., Ma, J.: Study of corporate credit risk prediction based on integrating boosting and random subspace. Expert Systems with Applications 38(11) (2011) 13871–13878Google Scholar
  16. 16.
    Tsai, C.F., Hsu, Y.F., Yen, D.C.: A comparative study of classifier ensembles for bankruptcy prediction. Applied Soft Computing 24 (2014) 977–984CrossRefGoogle Scholar
  17. 17.
    Dash, M., Liu, H.: Feature selection for classification. Intelligent data analysis 1(3) (1997) 131–156CrossRefGoogle Scholar
  18. 18.
    Zhu, Z., Ong, Y.S., Dash, M.: Wrapper–filter feature selection algorithm using a memetic framework. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on 37(1) (2007) 70–76CrossRefGoogle Scholar
  19. 19.
    Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems. (2001) 556–562Google Scholar
  20. 20.
    Han, J., Kamber, M., Pei, J.: Data mining: concepts and techniques. Elsevier (2011)Google Scholar
  21. 21.

Copyright information

© Springer Nature Singapore Pte Ltd. 2017

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringNational Institute of TechnologyTiruchirappalliIndia

Personalised recommendations