Skip to main content

Analysis of Web Usage Data for Clustering Based Recommender System

  • Conference paper
Trends in Practical Applications of Agents and Multiagent Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 221))

Abstract

Implicit web usage data is sparse and noisy and cannot be used for usage clustering unless passed through a sophisticated pre-processing phase. In this paper we propose a systematic way to analyze and preprocess the web usage data so that data clustering can be applied effectively to extract similar groups of user. We split the entire process into analysis, preprocessing and outlier detection and show the effect of each phase on Java Application Programming Interface (API) documentation usage data that is collected from our server logs. We use the extracted clusters for web based recommender systems and present the accuracy of the recommendations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alam, S.: Intelligent web usage clustering based recommender system. In: Proceedings of the Fifth ACM Conference on Recommender Systems, pp. 367–370. ACM (2011)

    Google Scholar 

  2. Alam, S., Dobbie, G., Riddle, P.: Particle swarm optimization based clustering of web usage data. In: Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 451–454. IEEE Computer Society (2008)

    Google Scholar 

  3. Alam, S., Dobbie, G., Riddle, P.: Exploiting swarm behaviour of simple agents for clustering web users’ session data. In: Cao, L. (ed.) Data Mining and Multi-agent Integration, pp. 61–75. Springer US (2009)

    Google Scholar 

  4. Alam, S., Dobbie, G., Riddle, P., Koh, Y.: Hierarchical pso clustering based recommender system. In: 2012 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2012)

    Google Scholar 

  5. Alam, S., Dobbie, G., Riddle, P., Naeem, M.: A swarm intelligence based clustering approach for outlier detection. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1–7 (2010), doi:10.1109/CEC.2010.5586152

    Google Scholar 

  6. Castellano, G., Fanelli, A.M., Torsello, M.A.: Lodap: a log data preprocessor for mining web browsing patterns. In: Proceedings of the 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and Data Bases, vol. 6, pp. 12–17. World Scientific and Engineering Academy and Society (WSEAS), Stevens Point (2007)

    Google Scholar 

  7. Castro-Herrera, C., Duan, C., Cleland-Huang, J., Mobasher, B.: Using data mining and recommender systems to facilitate large-scale, open, and inclusive requirements elicitation processes. In: Proceedings of the 2008 16th IEEE International Requirements Engineering Conference, pp. 165–168. IEEE Computer Society, Washington, DC (2008), doi:10.1109/RE.2008.47

    Chapter  Google Scholar 

  8. Cooley, R., Mobasher, B., Srivastava, J., et al.: Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems 1(1), 5–32 (1999)

    Article  Google Scholar 

  9. Gemmell, J., Shepitsen, A., Mobasher, M., Burke, R.: Personalization in Folksonomies Based on Tag Clustering. In: Proceedings of the 6th Workshop on Intelligent Techniques for Web Personalization and Recommender Systems (2008)

    Google Scholar 

  10. Mobasher, B.: Web mining overview. In: Encyclopedia of Data Warehousing and Mining, pp. 2085–2089. IGI Global (2009)

    Google Scholar 

  11. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Recommender systems for large-scale e-commerce: Scalable neighborhood formation using clustering. In: Proceedings of the Fifth International Conference on Computer and Information Technology (2002)

    Google Scholar 

  12. Tanasa, D.: Web usage mining: Contributions to intersites logs preprocessing and sequential pattern extraction with low support. PhD thesis (2005)

    Google Scholar 

  13. Tanasa, D., Trousse, B.: Advanced data preprocessing for intersites web usage mining. IEEE Intelligent Systems 19, 59–65 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shafiq Alam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Alam, S., Dobbie, G., Riddle, P., Koh, Y.S. (2013). Analysis of Web Usage Data for Clustering Based Recommender System. In: Pérez, J., et al. Trends in Practical Applications of Agents and Multiagent Systems. Advances in Intelligent Systems and Computing, vol 221. Springer, Cham. https://doi.org/10.1007/978-3-319-00563-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-00563-8_21

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-00562-1

  • Online ISBN: 978-3-319-00563-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics