Combining Browsing Behaviors and Page Contents for Finding User Interests

  • Fang Li
  • Yihong Li
  • Yanchen Wu
  • Kai Zhou
  • Feng Li
  • Xingguang Wang
  • Benjamin Liu*

Abstract This paper proposes a system for finding a user's interests based on his browsing behaviors and the contents of his visited pages. An advanced client browser plug-in is implemented to track the user browsing behaviors and collect the information about the web pages that he has viewed. We develop a user-interest model in which user interests can be inferred by clustering and summarization the viewed page contents. The corresponding degree of his interest can be calculated based on his browsing behaviors and histories. The calculation for the interested degree is based on Gaussian process regression model which captures the relationship between a user's browsing behaviors and his interest to a web page. Experiments show that the system can find the user interests automatically and dynamically.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    A Library for Support Vector Machines (LIBSVM). http://www.csie.ntu.edu.tw/∼cjlin/ libsvm/
  2. 2.
    Atterer R, Wnuk M, and Schmidt A (2006) Knowing the User’s Every Move: User Activity Tracking for Website Usability Evaluation and Implicit Interaction. In Proceeding of the 15th International Conference on World Wide Web (Edinburgh Scotland, May 23-26, 2006). WWW’06, ACM Press, New York, pp 203-212Google Scholar
  3. 3.
    Cai D, Yu SP, Wen JR and Ma WY (2003) VIPS: a Vision-based Page Segmentation Algo- rithm. Microsoft Technical Report (MSR-TR-2003-79), November, 2003Google Scholar
  4. 4.
    S.M. Wild (2003) Seeding non-negative matrix factorizations with the spherical K-Means clustering. MS Thesis for the Department of Applied Mathematics, University of Colorado, April 2003Google Scholar
  5. 5.
    Lozano JA, Pena JM and Larranage, P (1999) An empirical comparison of four initialization methods for the k-means algorithm. Pattern Recognition Letters, 20: 1027-1040, 1999Google Scholar
  6. 6.
    Rasmussen CE and Williams CKI (2006) Gaussian Processes for Machine Learning, MIT Press, 2006Google Scholar
  7. 7.
    Weinreich H, Obendorf H, Herder E, and Mayer M (2006) Off the Beaten Tracks: Exploring Three Aspects of Web Navigation. In Proceeding of the 15th International Conference on World Wide Web (Edinburgh Scotland, May 23-26, 2006). WWW’06, ACM Press, New York, pp 133-142Google Scholar
  8. 8.
    White RW, and Drucker SM (2007). Investigating Behavioral Variability in Web Search. In Proceeding of the 16th International Conference on World Wide Web (Alberta Canada, May 8-12, 2007). WWW’07, ACM Press, New York, pp 21-30Google Scholar

Copyright information

© Springer Science + Business Media B.V 2008

Authors and Affiliations

  • Fang Li
    • 1
  • Yihong Li
    • 1
  • Yanchen Wu
    • 1
  • Kai Zhou
    • 1
  • Feng Li
    • 1
  • Xingguang Wang
    • 1
  • Benjamin Liu*
    • 1
  1. 1.Department of Computer Science and EngineeringShanghai Jiao Tong UniversityChina

Personalised recommendations