Abstract
Many systems attempt to forecast user navigation in the Internet through the use of past behavior, preferences and environmental factors. Most of these models overlook the possibility that users may have many diverse sets of preferences. For example, the same person may search for information in different ways at night (when they are pursuing their hobbies and interests) as opposed to during the day (when they are at work). Thus, most users may well have different sets of preferences at different times of the day and behave differently in accordance with those preferences. In this paper, we present clustering methods for creating time dependent models to predict user navigation patterns; these methods allow us to segment log files into appropriate groups of navigation behaviour. The benefits of these methods over more established methods are highlighted. An empirical analysis is carried out on a sample of usage logs for Wireless Application Protocol (WAP) browsing as empirical support for the technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson, C.R., Domingos, P., Weld, D.S.: Adaptive Web Navigation for Wireless Devices. In: Proceedings of Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001 (2001)
Begole, J., Tang, J.C., Hill, B.: Rhythm Modeling, Visualizations and Applications. In: Proceedings of the 2003 Symposium on User Interface Software and Technology (UIST 2003), pp. 11–20 (2003)
Beitzel, S., Jensen, E., Chowdhury, A., Grossman, D., Frieder, O.: HoURLy analysis of a very large topically categorized web query log. In: Proceedings of the 27th annual international conference on Research and development in information retrieval, pp. 321–328 (2004)
Billsus, D., Brunk, C., Evans, C., Gladish, B., Pazzani, M.: Adaptive Interfaces for Ubiquitous Web Access. Communications of the ACM 45(5) (2002)
Cooley, R., Mobasher, B., Srivatava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. In: Proceedings of the IEEE International Conference on Tools with Artificial Intelligence (ICTAI 1997), Newport Beach, CA (November 1997)
Cotter, P., Smyth, B.: PTV: Intelligent Personalised TV Guides. In: Proceedings of the 12th Innovative Applications of Artificial Intelligence (IAAI 2000) Conference. AAAI Press, Menlo Park (2000)
Etzioni, O.: The World-Wide Web: quagmire or gold mine? Communications of the ACM 39(11), 65–68
Halvey, M., Keane, M.T., Smyth, B.: Mobile Web Surfing is the same as Web Surfing. Communications of the ACM (2005) (Accepted; In Press)
Halvey, M., Keane, M.T., Smyth, B.: Predicting Navigation Patterns on the Mobile-Internet using Time of the Week. In: World Wide Web 2005 (2005) (Accepted; In Press)
Horvitz, E., Koch, P., Kadie, C.M., Jacobs, A.: Coordinate: Probabilistic Forecasting of Presence and Availability. In: Proceedings of the Eighteenth Conference on Uncertainty and Artificial Intelligence, Edmonton, Alberta, pp. 224–233. Morgan Kaufmann Publishers, San Francisco (2002)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall advanced reference series. Prentice-Hall, Inc., Upper Saddle River (1988)
Jain, A., Murty, M.N., Flynn, P.: Data clustering: A review. ACM Computing Surveys 31(3), 264–323 (1999)
Lau, T., Horvitz, E.: Patterns of Search: Analyzing and Modeling Web Query Refinement. In: Proceedings of the Seventh International Conference on User Modeling (1999)
Letizia, L.H.: An Agent That Assists Web Browsing. In: International Joint Conference on Artificial Intelligence, Montreal (August 1995)
Pirolli, P.: Distributions of Surfers Paths through the World Wide Web: Empirical Characterizations. The Web Journal 2, 29–45 (1998)
Ramsay, M., Nielsen, J.: Nielsen Report. WAP Usability Deja Vu: 1994 All Over Again (2000)
Silverman, J.F., Cooper, D.B.: Bayesian clustering for unsupervised estimation of surface and texture models. IEEE Trans. Pattern Anal. Mach. Intell. 10, 482–495 (1998)
Smyth, B., Cotter, P.: The Plight of the Navigator: Solving the Navigation Problem for Wireless Portals. In: De Bra, P., Brusilovsky, P., Conejo, R. (eds.) AH 2002. LNCS, vol. 2347, pp. 328–337. Springer, Heidelberg (2002)
Spiliopoulou, M.: The laborious way from data mining to Web log mining. International Journal of Computer Systems Science and Engineering 14(2), 113–125 (1999)
Wu, Z., Leahy, R.: An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 15, 1101–1113 (1993)
Zhu, J., Hong, J., Hughes, J.G.: Using Markov models for web site link prediction. In: ACM Conference on Hypertext/Hypermedia (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Halvey, M., Keane, M.T., Smyth, B. (2005). Birds of a Feather Surf Together: Using Clustering Methods to Improve Navigation Prediction from Internet Log Files. In: Perner, P., Imiya, A. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2005. Lecture Notes in Computer Science(), vol 3587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11510888_18
Download citation
DOI: https://doi.org/10.1007/11510888_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26923-6
Online ISBN: 978-3-540-31891-0
eBook Packages: Computer ScienceComputer Science (R0)