Efficient Techniques for Clustering of Users on Web Log Data

Dhana Lakshmi, P.; Ramani, K.; Eswara Reddy, B.

doi:10.1007/978-981-10-3874-7_35

P. Dhana Lakshmi¹⁶,
K. Ramani¹⁶ &
B. Eswara Reddy¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 556))

1259 Accesses
2 Citations

Abstract

Web usage mining is one of the essential framework to find domain knowledge from interaction of users with the web. This domain knowledge is used for effective management of predictive websites, creation of adaptive websites, enhancing business and web services, personalization, and so on. In nonprofitable organization’s website it is difficult to identify who are users, what information they need, and their interests change with time. Web usage mining based on log data provides a solution to this problem. The proposed work focuses on web log data preprocessing, sparse matrix construction based on web navigation of each user and clustering the users of similar interests. The performance of web usage mining is also compared based on k-means, X-means and farthest first clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

S. Jagan, Dr. S.P. Rajagopalan. A Survey on Web Personalization of Web Usage Mining, International Research Journal of Engineering and Technology (IRJET), Volume: 02 Issue: 01 | March-2015. pp. 6–12.
Google Scholar
Rachit Adhvaryu, A Review Paper on Web Usage Mining and Pattern Discovery, Journal Of Information, Knowledge And Research In Computer Engineering, Volume – 02, Issue – 02nov 12 To Oct 13, pp. 279–284.
Google Scholar
V. Vidyapriya, S. Kalaivani, An Efficient Clustering Technique for Weblogs, IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 2 Issue 7, July 2015. pp. 516–525.
Google Scholar
B. Uma Maheswari, Dr. P. Sumathi, A New Clustering and Pre-processing for Web Log Mining, 2014 World Congress on Computing and Communication Technologies, IEEE, pp. 25–29.
Google Scholar
Anupama D. S. & Sahana D. Gowda, Clustering Of Web User Sessions To Maintain Occurrence Of Sequence In Navigation Pattern, Second International Symposium on Computer Vision and the Internet (VisionNet’15), Elsevier 2015, pp. 558–564.
Google Scholar
V. Chitraa, Antony Selvadoss Thanamani. Web Log Data Analysis by Enhanced Fuzzy C Means Clustering, International Journal on Computational Sciences & Applications (IJCSA), Vol. 4 Issue No. 2, April 2014, pp. 81–95.
Google Scholar
J. HuaXu, H. Liu, Web User Clustering Analysis based on KMeans Algorithm, 2010 International Conference on Information Networking and Automation (ICINA) IEEE, pp. 6–9.
Google Scholar
K. Santhisree, Dr A. Damodaram, S. Appaji, D. Nagarjuna Devi, Web Usage Data Clustering using Dbscan algorithm and Set similarities, 2010 International Conference on Data Storage and Data Engineering, IEEE, pp. 220–224.
Google Scholar
Xidong Wang, Yiming Ouyang, Xuegang Hu, Yan Zhang, Discovery of User Frequent Access Patterns on Web Usage Mining, The 8th International Conference on Computer Supported Cooperative Work in Design Proceedings, 2003 IEEE, pp. 765–769.
Google Scholar
LI Wei, ZHU Yu-quan, CHEN Geng, YANG Zhong, Clustering Of Web Users Based on Competitive Agglomeration, 2008 International Symposium on Computational Intelligence and Design, IEEE, pp. 515–519.
Google Scholar
Xinran Yu & Turgay Korkmaz, Finding the Most Evident Co-Clusters on Web Log Dataset Using Frequent Super Sequence Mining., 2014 August, 13–15, San Francisco, California, USA. pp. 529–536.
Google Scholar
T. Nadana Ravishankar & Dr. R. Shriram, Mining Web Log Files Using Self-Organizing Map and K-Means Clustering Methods, ICAREM.
Google Scholar
Mohammed Hamed Ahmed Elhiber & Ajith Abraham., Access Patterns in Web Log Data: A Review. Journal of Network and Innovative Computing, ISSN 2160-2174, Volume 1 (2013), pp. 348–355.
Google Scholar
J. Xiao, Y. Zhang, X. Jia & T. Li, Measuring Similarity of Interests for Clustering Web-Users, Proc. of the 12th Australian Database Conference 2001 (ADC’20OI) IEEE, Australia, 29 January - 2 February, 2001, pp. 107–114.
Google Scholar
V. Sujatha & Punitha Valli, Improved User Navigation Pattern Prediction Technique From Web Log Data, International Conference on Communication Technology and System Design, 2011 Published by Elsevier Ltd, pp. 92–99.
Google Scholar
J. Xiao, Y. Zhang, Clustering of Web Users Using Session-based Similarity Measures, Proc. of the 12th Australian Database Conference 2001 (ADC’20OI) IEEE, Gold Coast, Australia, 29 January - 2 February, 2001, pp. 223–228.
Google Scholar

Download references

Acknowledgements

We would like to thank our college Sree Vidyanikethan Engineering college, Tirupathi for providing valuable resources and encouragement to do research.

Author information

Authors and Affiliations

Sree Vidyanikethan Engineering College, Tirupathi, India
P. Dhana Lakshmi & K. Ramani
JNTUA College of Engineering Kalikiri, Kalikiri, India
B. Eswara Reddy

Authors

P. Dhana Lakshmi
View author publications
You can also search for this author in PubMed Google Scholar
K. Ramani
View author publications
You can also search for this author in PubMed Google Scholar
B. Eswara Reddy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Dhana Lakshmi .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering & Information Technology, Veer Surendra Sai University of Technology, Sambalpur, Odisha, India
Himansu Sekhar Behera
Department of CSE, National Institute of Technology (NIT), Rourkela, Odisha, India
Durga Prasad Mohapatra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dhana Lakshmi, P., Ramani, K., Eswara Reddy, B. (2017). Efficient Techniques for Clustering of Users on Web Log Data. In: Behera, H., Mohapatra, D. (eds) Computational Intelligence in Data Mining. Advances in Intelligent Systems and Computing, vol 556. Springer, Singapore. https://doi.org/10.1007/978-981-10-3874-7_35

Download citation

DOI: https://doi.org/10.1007/978-981-10-3874-7_35
Published: 20 May 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3873-0
Online ISBN: 978-981-10-3874-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics