An Efficient Method for Detecting Fraudulent Transactions Using Classification Algorithms on an Anonymized Credit Card Data Set

Manlangit, Sylvester; Azam, Sami; Shanmugam, Bharanidharan; Kannoorpatti, Krishnan; Jonkman, Mirjam; Balasubramaniam, Arasu

doi:10.1007/978-3-319-76348-4_41

An Efficient Method for Detecting Fraudulent Transactions Using Classification Algorithms on an Anonymized Credit Card Data Set

Sylvester Manlangit¹⁸,
Sami Azam¹⁸,
Bharanidharan Shanmugam¹⁸,
Krishnan Kannoorpatti¹⁸,
Mirjam Jonkman¹⁸ &
…
Arasu Balasubramaniam¹⁹

Conference paper
First Online: 22 March 2018

1860 Accesses
3 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 736))

Abstract

Credit card fraudulent transactions are causing businesses and banks to lose time and money. Detecting fraudulent transactions before a transaction is finalized will help businesses and banks to save resources. This research aims to compare the fraud detection accuracy of different sampling techniques and classification algorithms. An efficient method of detecting fraud using machine learning is proposed. Anonymized data set from Kaggle was used for detecting fraudulent transactions. Each transaction has been labeled as either a fraudulent transaction or not. The severe imbalance between fraud and non-fraudulent data caused the algorithms to under-perform. This was addressed with the application of sampling techniques. The combination of undersampling and SMOTE raised the recall accuracy of the classification algorithm. k-NN algorithm showed the highest recall accuracy compared to the other algorithms.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Jha, S., Westland, J.C.: A descriptive study of credit card fraud pattern. Glob. Bus. Rev. 14, 373–384 (2013)
Article Google Scholar
Liñares-Zegarra, J., Wilson, J.O.S.: Credit card interest rates and risk: new evidence from US survey data. Eur. J. Financ. 20, 892–914 (2014)
Article Google Scholar
Lepoivre, M.R., Avanzini, C.O., Bignon, G., Legendre, L., Piwele, A.K.: Credit card fraud detection with unsupervised algorithms (Report). J. Adv. Inf. Technol. 7, 34 (2016)
Article Google Scholar
Bhattacharyya, S., Jha, S., Tharakunnel, K., Westland, J.C.: Data mining for credit card fraud: a comparative study. Dec. Support Syst. 50, 602–613 (2011)
Article Google Scholar
Prakash, C.: A parameter optimized approach for improving credit card fraud detection. Int. J. Comput. Sci. Issues 10, 360–366 (2013)
Google Scholar
Venkata Ratnam, G., Siva Naga Prasad, M.: Credit card fraud detection using anti-k nearest neighbor algorithm. Int. J. Comput. Sci. Eng. 4, 1035–1039 (2012)
Google Scholar
Correa Bahnsen, A., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Exp. Syst. Appl. 51, 134–142 (2016)
Article Google Scholar
Dal Pozzolo, A., Caelen, O., Le Borgne, Y.-A., Waterschoot, S., Bontempi, G.: Learned lessons in credit card fraud detection from a practitioner perspective. Exp. Syst. Appl. 41, 4915–4928 (2014)
Article Google Scholar
Lee, Y.J., Yeh, Y.R., Wang, Y.C.F.: Anomaly detection via online oversampling principal component analysis. IEEE Trans. Knowl. Data Eng. 25, 1460–1470 (2013)
Article Google Scholar
http://setosa.io/ev/principal-component-analysis/. Accessed 11 Nov 2017
Dal Pozzolo, A., Caelen, O., Johnson, R.A., Bontempi, G.: Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE Symposium Series on Computational Intelligence, pp. 159–166. IEEE (2015)
Google Scholar
Liu, X.-Y., Wu, J., Zhou, Z.-H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 39, 539–550 (2009)
Article Google Scholar
Akbani, R., Kwek, S., Japkowicz, N.: Applying support vector machines to imbalanced datasets. In: European Conference on Machine Learning, pp. 39–50. Springer, Heidelberg (2004)
Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
MATH Google Scholar
Khyati, C., Bhawna, M.: Exploration of Data mining techniques in fraud detection: credit card. Int. J. Electron. Comput. Sci. Eng. 1, 1765–1771 (2012)
Google Scholar
Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Data Mining and Knowledge Discovery Handbook, pp. 853–867. Springer, Boston (2005)
Google Scholar
Nadarajan, S., Ramanujam, B.: Encountering imbalance in credit card fraud detection with metaheuristics. Adv. Nat. Appl. Sci. 10, 33–41 (2016)
Google Scholar
Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2, 18–22 (2002)
Google Scholar
Le Cessie, S., Van Houwelingen, J.C.: Ridge estimators in logistic regression. Appl. Stat. 41, 191–201 (1992)
Article MATH Google Scholar
Excel Master Series. http://blog.excelmasterseries.com/2014/06/logistic-regression-performed-in-excel.html. Accessed 13 Nov 2017
MedCalc. https://www.medcalc.org/manual/logistic_regression.php. Accessed 13 Nov 2017
Analytics Vidhya. https://www.analyticsvidhya.com/blog/2015/10/basics-logistic-regression/. Accessed 15 Nov 2017
Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Cambridge (2001)
Google Scholar
Analytics Vidhya. https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/. Accessed 15 Nov 2017

Download references

Acknowledgements

We would like to thank School of Engineering and IT, Charles Darwin University for providing funding and assistance for this research.

Author information

Authors and Affiliations

School of Engineering and IT, Charles Darwin University, Darwin, 0909, Australia
Sylvester Manlangit, Sami Azam, Bharanidharan Shanmugam, Krishnan Kannoorpatti & Mirjam Jonkman
Cookie Analytix Pvt. Ltd., Bengaluru, India
Arasu Balasubramaniam

Authors

Sylvester Manlangit
View author publications
You can also search for this author in PubMed Google Scholar
Sami Azam
View author publications
You can also search for this author in PubMed Google Scholar
Bharanidharan Shanmugam
View author publications
You can also search for this author in PubMed Google Scholar
Krishnan Kannoorpatti
View author publications
You can also search for this author in PubMed Google Scholar
Mirjam Jonkman
View author publications
You can also search for this author in PubMed Google Scholar
Arasu Balasubramaniam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sami Azam .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs , Auburn, Washington, USA
Ajith Abraham
Department of Computer Science, South Asian University, Chanakyapuri, Delhi, India
Pranab Kr. Muhuri
Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka , Durian Tunggal, Melaka, Malaysia
Azah Kamilah Muda
Machine Intelligence Research Labs , Auburn, Washington, USA
Niketa Gandhi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Manlangit, S., Azam, S., Shanmugam, B., Kannoorpatti, K., Jonkman, M., Balasubramaniam, A. (2018). An Efficient Method for Detecting Fraudulent Transactions Using Classification Algorithms on an Anonymized Credit Card Data Set. In: Abraham, A., Muhuri, P., Muda, A., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2017. Advances in Intelligent Systems and Computing, vol 736. Springer, Cham. https://doi.org/10.1007/978-3-319-76348-4_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-76348-4_41
Published: 22 March 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76347-7
Online ISBN: 978-3-319-76348-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics