An Efficient Web Server Log Analysis Using Genetic Algorithm-Based Preprocessing

  • Naresh Kumar Kar
  • Megha Mishra
  • Subhash Chandra Shrivastava
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 103)


Popularity of web increases day by day, everything associated with daily life of human being all are connected to web and most of the people spending their time over the web through social networking web apps for their purchasing. Web server stores every activity of users in the form of logs, which contain very useful patterns, henceforth web server log analysis is the vital research area. Web log data analysis has primary step is preprocessing which is meant for dimensionality reduction because web log data is hefty in size and need to normalize the data for further cognitive analysis or other data analysis. Bulky data size degrades the performance of data analytic algorithm so there is necessity of an efficient algorithm for preprocessing over web server log data. In this paper, we put emphasis on data preprocessing. We have proposed the use of genetic algorithm for dimension reduction and normalization of input web server log data. Experimental result shows the data preprocessed data produces higher precision value, precision calculated using MATLAB 2016 classification learner tool.


Genetic ROC TPR FNR Confusion matrix WM URL 


  1. 1.
    Rao RJ, Stewart C, Perez A, Renganathan SM, Assessing learning behavior and cognitive bias from web logs. In: 2018 IEEE Frontiers in education conference (FIE).
  2. 2.
    Vinay V, Wood K, Milic-Frayling N (2005) A comparison of dimensionality reduction techniques for text retrieval. In: Proceedings of the fourth international conference on machine learning and applications (ICMLA’05). 0-7695-2495-8/05 $20.00 ©Google Scholar
  3. 3.
    Mehra J, Thakur RS (2018) An Effective method for web log preprocessing and page access frequency using web usage mining. Int J Appl Eng Res 13(2):1227-1232. ISSN 0973-4562 © Research India Publications.
  4. 4.
    Xiang S, Zhong Z, Ding K (2015) Multicluster spatial—spectral unsupervised feature selection for hyperspectral image classification. IEEEGoogle Scholar
  5. 5.
    Miruthula P, Roopa SN (2015) Unsupervised feature selection algorithms: a survey. Int J Sci Res (IJSR)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  • Naresh Kumar Kar
    • 1
  • Megha Mishra
    • 1
  • Subhash Chandra Shrivastava
    • 1
  1. 1.Department of Computer Science and EngineeringRungta College of Engineering and TechnologyBhilaiIndia

Personalised recommendations