Skip to main content

Birds of a Feather Surf Together: Using Clustering Methods to Improve Navigation Prediction from Internet Log Files

  • Conference paper
Book cover Machine Learning and Data Mining in Pattern Recognition (MLDM 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3587))

Abstract

Many systems attempt to forecast user navigation in the Internet through the use of past behavior, preferences and environmental factors. Most of these models overlook the possibility that users may have many diverse sets of preferences. For example, the same person may search for information in different ways at night (when they are pursuing their hobbies and interests) as opposed to during the day (when they are at work). Thus, most users may well have different sets of preferences at different times of the day and behave differently in accordance with those preferences. In this paper, we present clustering methods for creating time dependent models to predict user navigation patterns; these methods allow us to segment log files into appropriate groups of navigation behaviour. The benefits of these methods over more established methods are highlighted. An empirical analysis is carried out on a sample of usage logs for Wireless Application Protocol (WAP) browsing as empirical support for the technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson, C.R., Domingos, P., Weld, D.S.: Adaptive Web Navigation for Wireless Devices. In: Proceedings of Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001 (2001)

    Google Scholar 

  2. Begole, J., Tang, J.C., Hill, B.: Rhythm Modeling, Visualizations and Applications. In: Proceedings of the 2003 Symposium on User Interface Software and Technology (UIST 2003), pp. 11–20 (2003)

    Google Scholar 

  3. Beitzel, S., Jensen, E., Chowdhury, A., Grossman, D., Frieder, O.: HoURLy analysis of a very large topically categorized web query log. In: Proceedings of the 27th annual international conference on Research and development in information retrieval, pp. 321–328 (2004)

    Google Scholar 

  4. Billsus, D., Brunk, C., Evans, C., Gladish, B., Pazzani, M.: Adaptive Interfaces for Ubiquitous Web Access. Communications of the ACM 45(5) (2002)

    Google Scholar 

  5. Cooley, R., Mobasher, B., Srivatava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. In: Proceedings of the IEEE International Conference on Tools with Artificial Intelligence (ICTAI 1997), Newport Beach, CA (November 1997)

    Google Scholar 

  6. Cotter, P., Smyth, B.: PTV: Intelligent Personalised TV Guides. In: Proceedings of the 12th Innovative Applications of Artificial Intelligence (IAAI 2000) Conference. AAAI Press, Menlo Park (2000)

    Google Scholar 

  7. Etzioni, O.: The World-Wide Web: quagmire or gold mine? Communications of the ACM 39(11), 65–68

    Google Scholar 

  8. Halvey, M., Keane, M.T., Smyth, B.: Mobile Web Surfing is the same as Web Surfing. Communications of the ACM (2005) (Accepted; In Press)

    Google Scholar 

  9. Halvey, M., Keane, M.T., Smyth, B.: Predicting Navigation Patterns on the Mobile-Internet using Time of the Week. In: World Wide Web 2005 (2005) (Accepted; In Press)

    Google Scholar 

  10. Horvitz, E., Koch, P., Kadie, C.M., Jacobs, A.: Coordinate: Probabilistic Forecasting of Presence and Availability. In: Proceedings of the Eighteenth Conference on Uncertainty and Artificial Intelligence, Edmonton, Alberta, pp. 224–233. Morgan Kaufmann Publishers, San Francisco (2002)

    Google Scholar 

  11. Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall advanced reference series. Prentice-Hall, Inc., Upper Saddle River (1988)

    Google Scholar 

  12. Jain, A., Murty, M.N., Flynn, P.: Data clustering: A review. ACM Computing Surveys 31(3), 264–323 (1999)

    Article  Google Scholar 

  13. Lau, T., Horvitz, E.: Patterns of Search: Analyzing and Modeling Web Query Refinement. In: Proceedings of the Seventh International Conference on User Modeling (1999)

    Google Scholar 

  14. Letizia, L.H.: An Agent That Assists Web Browsing. In: International Joint Conference on Artificial Intelligence, Montreal (August 1995)

    Google Scholar 

  15. Pirolli, P.: Distributions of Surfers Paths through the World Wide Web: Empirical Characterizations. The Web Journal 2, 29–45 (1998)

    Google Scholar 

  16. Ramsay, M., Nielsen, J.: Nielsen Report. WAP Usability Deja Vu: 1994 All Over Again (2000)

    Google Scholar 

  17. Silverman, J.F., Cooper, D.B.: Bayesian clustering for unsupervised estimation of surface and texture models. IEEE Trans. Pattern Anal. Mach. Intell. 10, 482–495 (1998)

    Article  Google Scholar 

  18. Smyth, B., Cotter, P.: The Plight of the Navigator: Solving the Navigation Problem for Wireless Portals. In: De Bra, P., Brusilovsky, P., Conejo, R. (eds.) AH 2002. LNCS, vol. 2347, pp. 328–337. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  19. Spiliopoulou, M.: The laborious way from data mining to Web log mining. International Journal of Computer Systems Science and Engineering 14(2), 113–125 (1999)

    Google Scholar 

  20. Wu, Z., Leahy, R.: An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 15, 1101–1113 (1993)

    Article  Google Scholar 

  21. Zhu, J., Hong, J., Hughes, J.G.: Using Markov models for web site link prediction. In: ACM Conference on Hypertext/Hypermedia (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Halvey, M., Keane, M.T., Smyth, B. (2005). Birds of a Feather Surf Together: Using Clustering Methods to Improve Navigation Prediction from Internet Log Files. In: Perner, P., Imiya, A. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2005. Lecture Notes in Computer Science(), vol 3587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11510888_18

Download citation

  • DOI: https://doi.org/10.1007/11510888_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26923-6

  • Online ISBN: 978-3-540-31891-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics