Skip to main content

Extracting User Behavior by Web Communities Technology on Global Web Logs

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3180))

Abstract

A lot of work has been done on extracting the model of web user behavior. Most of them target server-side logs that cannot track user behavior outside of the server. Recently, a novel way has been developed to collect web browsing histories, using the same method for determining TV audience ratings; i.e., by collecting data from randomly selected users called panels. The logs collected from panels(called panel logs) cover an extremely broad URL-space, and it is difficult to capture the global behaviors of the users. Here we utilize mining results of web community to group those URLs into easily understandable topics. We also use search keywords in search engine sites because user behavior is deeply related to search keyword according to preliminary experiments on panel logs. We develop a prototype system to extract user access patterns from the panel logs and to capture the global behavior based on web communities.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (1998)

    Google Scholar 

  2. Shahabi, C., Zarkesh, A., Adibi, J., Shah, V.: Knowledge discovery from users web-page navigation. In: Proceedings of the IEEE RIDE 1997 Workshop (1997)

    Google Scholar 

  3. Batista, P., Silva, M.: Mining on-line newspaper web access logs. In: 12th International Meeting of the EuroWorking Group on Decision Support Systems, EWGDSS 2001 (2001)

    Google Scholar 

  4. Fu, Y., Sandhu, K., Shih, M.: Clustering of web users based on access patterns. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS (LNAI), vol. 1836, pp. 21–38. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  5. Ungar, L., Foster, D.: Clustering methods for collaborative filtering. In: AAAI Workshop on Recommendation Systems (1998)

    Google Scholar 

  6. Su, Z., Yang, Q., Zhang, H., Xu, X., Hu, Y.: Correlation-based document clustering using web logs. In: 34th Hawaii International Conference on System Sciences, HICSS-34 (2001)

    Google Scholar 

  7. Tan, P., Kumar, V.: Mining association patterns in web usage data. In: International Conference on Advances in Infrastructure for e-Business, e-Education, e-Science, and e-Medicine on the Internet (2002)

    Google Scholar 

  8. Zaiane, O., Xin, M., Han, J.: Discovering web access patterns and trends by applying olap and data mining technology on web logs. In: Proc. Advances in Digital Libraries, ADL 1998 (1998)

    Google Scholar 

  9. Beeferman, D., Berger, A.: Agglomerative clustering of s earch engine query log. In: The 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2000 (2000)

    Google Scholar 

  10. Wen, J., Nie, J., Zhang, H.: Query clustering using user logs. ACM Transactions on Information Systems (ACM TOIS) 20, 59–81 (2002)

    Article  Google Scholar 

  11. Ohura, Y., Takahashi, K., Pramudiono, I., Kitsuregawa, M.: Experiments on query expansion for internet yellow page services using web log mining. In: The 28th International Conference on Very Large Data Bases, VLDB 2002 (2002)

    Google Scholar 

  12. Koutsoupias, N.: Exploring web access logs with correspondence analysis. Methods and Applications of Artificial Intelligence, Second Hellenic (2002)

    Google Scholar 

  13. Prasetyo, B., Pramudiono, I., Takahashi, K., Kitsuregawa, M.: Naviz: Website navigational behavior visualizer. In: Chen, M.-S., Yu, P.S., Liu, B. (eds.) PAKDD 2002. LNCS (LNAI), vol. 2336, p. 276. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  14. Zeng, H., Chen, Z., Ma, W.: A unified framework for clustering heterogeneous web objects. In: The Third International Conference on Web Information Systems Engineering, WISE 2002 (2002)

    Google Scholar 

  15. Nanopoulos, A., Manolopoulos, Y., Zakrzewicz, M., Morzy, T.: Indexing web access-logs for pattern queries. In: 4th ACM CIKM Nternational Workshop on Web Information and Data Management (WIDM 2002), pp. 63–68 (2002)

    Google Scholar 

  16. Pramudiono, I., Shintani, T., Takahashi, K., Kitsuregawa, M.: User behavior analysis of location aware search engine. In: Proceedings of International Conference On Mobile Data Management (MDM 2002), pp. 139–145 (2002); [17] Catledge, L., Pitkow, J.: Characterizing browsing behaviors on the world-wide web. Computer Networks and ISDN Systems (1995)

    Google Scholar 

  17. Catledge, L., Pitkow, J.: Characterizing browsing behaviors on the world-wide web. Computer Networks and ISDN Systems (1995)

    Google Scholar 

  18. Murata, T.: Web community. IPSJ Magazine 44, 702–706 (2003)

    MathSciNet  Google Scholar 

  19. Flake, G., Lawrence, S., Giles, C.L., Coetzee, F.: Self-organization and identification of web communities. IEEE Computer 35, 66–71 (2002)

    Google Scholar 

  20. Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Trawling the web for emerging cyber-communities. In: Proc. of the 8th WWW conference, pp. 403–416 (1999)

    Google Scholar 

  21. Toyoda, M., Kitsuregawa, M.: Creating a web community chart for navigating related communities. In: Conference Proceedings of Hypertext 2001, pp. 103–112 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Otsuka, S., Toyoda, M., Hirai, J., Kitsuregawa, M. (2004). Extracting User Behavior by Web Communities Technology on Global Web Logs. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_92

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30075-5_92

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22936-0

  • Online ISBN: 978-3-540-30075-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics