Mining Customer Behavior in Trial Period of a Web Application Usage—Case Study

  • Goran MatoševićEmail author
  • Vanja Bevanda
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 464)


This paper proposes models for predicting customer conversion from trial account to full paid account of web application. Two models are proposed with focus on content of the application and time. In order to make a customer’s behavior prediction, data is extracted from web application’s usage log in trial period and processed with data mining techniques. For both models, content and time based, the same selected classification algorithms are used: decision trees, Naïve Bayes, k-Nearest Neighbors and One Rule classification. Additionally, a cluster algorithm k-means is used to see if clustering by two clusters (for converted and not-converted users) can be formed and used for classification. Results showed high accuracy of classification algorithms in early stage of trial period which can serve as a basis for an identification of users that are likely to abandon the application and not convert.


Web usage mining Customer conversions Web application usage Trial conversion 


  1. 1.
    Patel, K.B., Patel A.R.: Process of web usage mining to find interesting pattern from web usage data. Int. J. Comput. Technol. 3(1), 144–148 (2012)Google Scholar
  2. 2.
    Verbeke, W., Martens, D., Mues, C., Baesens, B.: Building comprehensible customer churn prediction models with advanced rule induction techniques. Expert Syst. Appl. 38, 2354–2364 (2011)CrossRefGoogle Scholar
  3. 3.
    Changchien, S.W., Lee, C.F., Hsu, Y.J.: On-line personalized sales promotion in electronic commerce. Expert Syst. Appl. 27(1), 35–52 (2004)CrossRefGoogle Scholar
  4. 4.
    Etzion, O., Fisher, A., Wasserkrug, S.: E-CLV a modeling approach for customer lifetime evaluation in e-commerce domains, with an application and case study for online auction. Inf. Syst. Front. 7, 421–434 (2005)CrossRefGoogle Scholar
  5. 5.
    Kuo, R.J., Liao, J.L., Tu, C.: Integration of art neural network and genetic k means algorithm for analyzing web browsing paths in electronic commerce. Decis. Support Syst. 40, 355–374 (2005)CrossRefGoogle Scholar
  6. 6.
    Umman, T., Serhat, G.: Online shopping customer data using association rules and cluster analysis. In: Advances in Data Mining. Applications and Theoretical Aspects. Lecture Notes in Computer Science, vol. 7987, pp. 127–136. Springer, Berlin (2013)Google Scholar
  7. 7.
    Khan, A.A., Jamwal, S., Sepehri, M.M.: Applying data mining to customer churn prediction in an internet service provider. Int. J. Comput. Appl. (0975–8887) 9(7), 8–14 (2010)Google Scholar
  8. 8.
    Xie, Y., Li, X., Ngai, E.W.T., Ying, W.: Customer churn prediction using improved balanced random forests. Expert Syst. Appl. 36, 5445–5449 (2009)Google Scholar
  9. 9.
    Ballings, M., Van den Poel, D.: Customer event history for churn prediction: how long is long enough? Expert Syst. Appl. 39, 13517–13522 (2012)CrossRefGoogle Scholar
  10. 10.
    Chang, G., Healy, M.J., McHugh, J.A.M., Wang, J.T.L.: Mining the World Wide Web: An Information Search Approach. Kluwer Academic Publishers, Boston (2001)Google Scholar
  11. 11.
    Jayalatchumy, D., Thambidurai, P.: Web mining research issues and future directions—a survey, IOSR. J. Comput. Eng. (IOSR-JCE) 14(03), 20–27 (2013)Google Scholar
  12. 12.
    Han, J., Kamber, M., Pei, J.: Data Mining Concepts and Techniques, 3rd edn. Elsevier Inc. (2012)Google Scholar
  13. 13.
    Srivastava, J., Cooley, R., Deshpande, M., Tan, P.: Web usage mining: discovery and applications of usage patterns from web data. SIGKDD Explor. ACM SIGKDD 1(2), 12–23 (2000)Google Scholar
  14. 14.
    Suthar, P., Oza, B.: A survey of web usage mining techniques. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 6(6) (2015)Google Scholar
  15. 15.
    Kantardzic, M.: Data Mining: Concepts, Models, Methods, and Algorithms, 2nd edn. IEEE Press & John Wiley (2011)Google Scholar
  16. 16.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)Google Scholar
  17. 17.
    Lan, H., Eibe, F., Hall, M.A.: Data mining: practical machine learning tools and techniques. Morgan Kaufman, Elsevier (2011)Google Scholar
  18. 18.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo, CA (1993)Google Scholar
  19. 19.
    Bhargava, N., et al.: Decision tree analysis on j48 algorithm for data mining. Proc. Int. J. Adv. Res. Comput. Sci. Softw. Eng 3(6) (2013)Google Scholar
  20. 20.
    Dimitoglou G., Adams, J.A., Jim, C.J.: Comparison of the C4.5 and a Naïve Bayes Classifier for the Prediction of Lung Cancer Survivability (2012). arXiv:1206.1121
  21. 21.
    Patil, T.R., Sherekar, S.S.: Performance analysis of Naive Bayes and J48 classification algorithm for data classification. Int. J. Comput. Sci. Appl. 6(2), 256–261 (2013)Google Scholar
  22. 22.
    Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. IT-13, 21–27 (1967)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Faculty of Economics and Tourism “Dr. Mijo Mirković”Juraj Dobrila University of PulaPulaCroatia

Personalised recommendations