Skip to main content

Efficient Techniques for Clustering of Users on Web Log Data

  • Conference paper
  • First Online:
Computational Intelligence in Data Mining

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 556))

Abstract

Web usage mining is one of the essential framework to find domain knowledge from interaction of users with the web. This domain knowledge is used for effective management of predictive websites, creation of adaptive websites, enhancing business and web services, personalization, and so on. In nonprofitable organization’s website it is difficult to identify who are users, what information they need, and their interests change with time. Web usage mining based on log data provides a solution to this problem. The proposed work focuses on web log data preprocessing, sparse matrix construction based on web navigation of each user and clustering the users of similar interests. The performance of web usage mining is also compared based on k-means, X-means and farthest first clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. S. Jagan, Dr. S.P. Rajagopalan. A Survey on Web Personalization of Web Usage Mining, International Research Journal of Engineering and Technology (IRJET), Volume: 02 Issue: 01 | March-2015. pp. 6–12.

    Google Scholar 

  2. Rachit Adhvaryu, A Review Paper on Web Usage Mining and Pattern Discovery, Journal Of Information, Knowledge And Research In Computer Engineering, Volume – 02, Issue – 02nov 12 To Oct 13, pp. 279–284.

    Google Scholar 

  3. V. Vidyapriya, S. Kalaivani, An Efficient Clustering Technique for Weblogs, IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 2 Issue 7, July 2015. pp. 516–525.

    Google Scholar 

  4. B. Uma Maheswari, Dr. P. Sumathi, A New Clustering and Pre-processing for Web Log Mining, 2014 World Congress on Computing and Communication Technologies, IEEE, pp. 25–29.

    Google Scholar 

  5. Anupama D. S. & Sahana D. Gowda, Clustering Of Web User Sessions To Maintain Occurrence Of Sequence In Navigation Pattern, Second International Symposium on Computer Vision and the Internet (VisionNet’15), Elsevier 2015, pp. 558–564.

    Google Scholar 

  6. V. Chitraa, Antony Selvadoss Thanamani. Web Log Data Analysis by Enhanced Fuzzy C Means Clustering, International Journal on Computational Sciences & Applications (IJCSA), Vol. 4 Issue No. 2, April 2014, pp. 81–95.

    Google Scholar 

  7. J. HuaXu, H. Liu, Web User Clustering Analysis based on KMeans Algorithm, 2010 International Conference on Information Networking and Automation (ICINA) IEEE, pp. 6–9.

    Google Scholar 

  8. K. Santhisree, Dr A. Damodaram, S. Appaji, D. Nagarjuna Devi, Web Usage Data Clustering using Dbscan algorithm and Set similarities, 2010 International Conference on Data Storage and Data Engineering, IEEE, pp. 220–224.

    Google Scholar 

  9. Xidong Wang, Yiming Ouyang, Xuegang Hu, Yan Zhang, Discovery of User Frequent Access Patterns on Web Usage Mining, The 8th International Conference on Computer Supported Cooperative Work in Design Proceedings, 2003 IEEE, pp. 765–769.

    Google Scholar 

  10. LI Wei, ZHU Yu-quan, CHEN Geng, YANG Zhong, Clustering Of Web Users Based on Competitive Agglomeration, 2008 International Symposium on Computational Intelligence and Design, IEEE, pp. 515–519.

    Google Scholar 

  11. Xinran Yu & Turgay Korkmaz, Finding the Most Evident Co-Clusters on Web Log Dataset Using Frequent Super Sequence Mining., 2014 August, 13–15, San Francisco, California, USA. pp. 529–536.

    Google Scholar 

  12. T. Nadana Ravishankar & Dr. R. Shriram, Mining Web Log Files Using Self-Organizing Map and K-Means Clustering Methods, ICAREM.

    Google Scholar 

  13. Mohammed Hamed Ahmed Elhiber & Ajith Abraham., Access Patterns in Web Log Data: A Review. Journal of Network and Innovative Computing, ISSN 2160-2174, Volume 1 (2013), pp. 348–355.

    Google Scholar 

  14. J. Xiao, Y. Zhang, X. Jia & T. Li, Measuring Similarity of Interests for Clustering Web-Users, Proc. of the 12th Australian Database Conference 2001 (ADC’20OI) IEEE, Australia, 29 January - 2 February, 2001, pp. 107–114.

    Google Scholar 

  15. V. Sujatha & Punitha Valli, Improved User Navigation Pattern Prediction Technique From Web Log Data, International Conference on Communication Technology and System Design, 2011 Published by Elsevier Ltd, pp. 92–99.

    Google Scholar 

  16. J. Xiao, Y. Zhang, Clustering of Web Users Using Session-based Similarity Measures, Proc. of the 12th Australian Database Conference 2001 (ADC’20OI) IEEE, Gold Coast, Australia, 29 January - 2 February, 2001, pp. 223–228.

    Google Scholar 

Download references

Acknowledgements

We would like to thank our college Sree Vidyanikethan Engineering college, Tirupathi for providing valuable resources and encouragement to do research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. Dhana Lakshmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Dhana Lakshmi, P., Ramani, K., Eswara Reddy, B. (2017). Efficient Techniques for Clustering of Users on Web Log Data. In: Behera, H., Mohapatra, D. (eds) Computational Intelligence in Data Mining. Advances in Intelligent Systems and Computing, vol 556. Springer, Singapore. https://doi.org/10.1007/978-981-10-3874-7_35

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3874-7_35

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3873-0

  • Online ISBN: 978-981-10-3874-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics