Ecommerce Fraud Detection Through Fraud Islands and Multi-layer Machine Learning Model

  • Jay Nanduri
  • Yung-Wen Liu
  • Kiyoung Yang
  • Yuting JiaEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1130)


Main challenge for e-commerce transaction fraud prevention is that fraud patterns are rather dynamic and diverse. This paper introduces two innovative methods, fraud islands (link analysis) and multi-layer machine learning model, which can effectively tackle the challenge of detecting diverse fraud patterns. Fraud Islands are formed using link analysis to investigate the relationships between different fraudulent entities and to uncover the hidden complex fraud patterns through the formed network. Multi-layer model is used to deal with the largely diverse nature of fraud patterns. Currently, the fraud labels are determined through different channels which are banks’ declination decision, manual review agents’ rejection decisions, banks’ fraud alert and customers’ chargeback requests. It can be reasonably assumed that different fraud patterns could be caught though different fraud risk prevention forces (i.e. bank, manual review team and fraud machine learning model). The experiments showed that by integrating few different machine learning models which were trained using different types of fraud labels, the accuracy of fraud decisions can be significantly improved.


Ecommerce fraud detection Multi-layer machine learning model Link analysis Chargebacks Fraud islands 


  1. 1.
    Kou, Y., Lu, C.T., Sirwongwattana, S., Huang, Y.P.: Survey of fraud detection techniques. In: IEEE International Conference on Networking, Sensing and Control, vol. 2, pp. 749–754. IEEE (2004)Google Scholar
  2. 2.
    Wang, S., Liu, C., Gao, X., Qu, H., Xu, W.: Session-based fraud detection in online e-commerce transactions using recurrent neural networks. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 241–252. Springer, Cham, September 2017Google Scholar
  3. 3.
    Şahin, Y.G., Duman, E.: Detecting credit card fraud by decision trees and support vector machines (2011)Google Scholar
  4. 4.
    Minastireanu, E.A., Mesnita, G.: An analysis of the most used machine learning algorithms for online fraud detection. Inform. Econ. 23(1), 5–16 (2019)Google Scholar
  5. 5.
  6. 6.
    Shah, N., Lamba, H., Beutel, A., Faloutsos, C.: The many faces of link fraud. In: 2017 IEEE International Conference on Data Mining (ICDM), pp. 1069–1074. IEEE, November 2017Google Scholar
  7. 7.
    Sadowski, G., Rathle, P.: Fraud detection: discovering connections with graph databases. White Paper-Neo Technology-Graphs are Everywhere (2014)Google Scholar
  8. 8.
    Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM (JACM) 46(5), 604–632 (1999)MathSciNetCrossRefGoogle Scholar
  9. 9.
    Jurczyk, P., Agichtein, E.: Discovering authorities in question answer communities by using link analysis. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, pp. 919–922. ACM, November 2007Google Scholar
  10. 10.
    Vanneschi, L., Castelli, M.: Multilayer perceptrons. Encycl. Bioinform. Comput. Biol. 1, 612–620 (2019)Google Scholar
  11. 11.
    Svozila, D., Kvasnickab, V., Pospichalb, J.: Introduction to multi-layer feed-forward neural networks. Chemometr. Intell. Lab. Syst. 39(1), 43–62 (1997)CrossRefGoogle Scholar
  12. 12.
    Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)CrossRefGoogle Scholar
  13. 13.
    Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: 22nd SIGKDD Conference on Knowledge Discovery and Data Mining (2016)Google Scholar
  14. 14.
    Volkovs, M., Yu, G., Poutanen, T.: Content-based neighbor models for cold start in recommender systems. In: Proceeding of RecSys Challenge 2017, Proceedings of the Recommender Systems Challenge 2017, Article no. 7 (2017)Google Scholar
  15. 15.
    “Machine Learning Challenge Winning Solutions” in “Awesome XGBoost”. Accessed 14 June 2019
  16. 16.
    Xia, Y., Liu, C., Da, B., Xie, F.: A novel heterogeneous ensemble credit scoring model based on bstacking approach. Expert. Syst. Appl. 93, 182–199 (2017)CrossRefGoogle Scholar
  17. 17.
    Apache Spark Homepage. Accessed 14 June 2019
  18. 18.
    Azure Machine Learning Service Homepage. Accessed 14 June 2019

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Jay Nanduri
    • 1
  • Yung-Wen Liu
    • 1
  • Kiyoung Yang
    • 1
  • Yuting Jia
    • 1
    Email author
  1. 1.Dynamics 365 Fraud Protection, MicrosoftRedmondUSA

Personalised recommendations