Abstract
Implicit web usage data is sparse and noisy and cannot be used for usage clustering unless passed through a sophisticated pre-processing phase. In this paper we propose a systematic way to analyze and preprocess the web usage data so that data clustering can be applied effectively to extract similar groups of user. We split the entire process into analysis, preprocessing and outlier detection and show the effect of each phase on Java Application Programming Interface (API) documentation usage data that is collected from our server logs. We use the extracted clusters for web based recommender systems and present the accuracy of the recommendations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alam, S.: Intelligent web usage clustering based recommender system. In: Proceedings of the Fifth ACM Conference on Recommender Systems, pp. 367–370. ACM (2011)
Alam, S., Dobbie, G., Riddle, P.: Particle swarm optimization based clustering of web usage data. In: Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 451–454. IEEE Computer Society (2008)
Alam, S., Dobbie, G., Riddle, P.: Exploiting swarm behaviour of simple agents for clustering web users’ session data. In: Cao, L. (ed.) Data Mining and Multi-agent Integration, pp. 61–75. Springer US (2009)
Alam, S., Dobbie, G., Riddle, P., Koh, Y.: Hierarchical pso clustering based recommender system. In: 2012 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2012)
Alam, S., Dobbie, G., Riddle, P., Naeem, M.: A swarm intelligence based clustering approach for outlier detection. In: IEEE Congress on Evolutionary Computation (CEC), pp. 1–7 (2010), doi:10.1109/CEC.2010.5586152
Castellano, G., Fanelli, A.M., Torsello, M.A.: Lodap: a log data preprocessor for mining web browsing patterns. In: Proceedings of the 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and Data Bases, vol. 6, pp. 12–17. World Scientific and Engineering Academy and Society (WSEAS), Stevens Point (2007)
Castro-Herrera, C., Duan, C., Cleland-Huang, J., Mobasher, B.: Using data mining and recommender systems to facilitate large-scale, open, and inclusive requirements elicitation processes. In: Proceedings of the 2008 16th IEEE International Requirements Engineering Conference, pp. 165–168. IEEE Computer Society, Washington, DC (2008), doi:10.1109/RE.2008.47
Cooley, R., Mobasher, B., Srivastava, J., et al.: Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems 1(1), 5–32 (1999)
Gemmell, J., Shepitsen, A., Mobasher, M., Burke, R.: Personalization in Folksonomies Based on Tag Clustering. In: Proceedings of the 6th Workshop on Intelligent Techniques for Web Personalization and Recommender Systems (2008)
Mobasher, B.: Web mining overview. In: Encyclopedia of Data Warehousing and Mining, pp. 2085–2089. IGI Global (2009)
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Recommender systems for large-scale e-commerce: Scalable neighborhood formation using clustering. In: Proceedings of the Fifth International Conference on Computer and Information Technology (2002)
Tanasa, D.: Web usage mining: Contributions to intersites logs preprocessing and sequential pattern extraction with low support. PhD thesis (2005)
Tanasa, D., Trousse, B.: Advanced data preprocessing for intersites web usage mining. IEEE Intelligent Systems 19, 59–65 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Alam, S., Dobbie, G., Riddle, P., Koh, Y.S. (2013). Analysis of Web Usage Data for Clustering Based Recommender System. In: Pérez, J., et al. Trends in Practical Applications of Agents and Multiagent Systems. Advances in Intelligent Systems and Computing, vol 221. Springer, Cham. https://doi.org/10.1007/978-3-319-00563-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-00563-8_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-00562-1
Online ISBN: 978-3-319-00563-8
eBook Packages: EngineeringEngineering (R0)