Abstract
Web usage models and profiles capture significant interests and trends from past accesses. They are used to improve user experience, say through recommendation of pages, pre-fetching of pages, etc. While browsing behavior changes dynamically over time, many web usage modeling techniques are static due to prohibitive model compilation times and also lack of fast incremental update mechanism. However, profiles have to be maintained so that they dynamically adapt to new interests and trends, since otherwise their use can lead to poor, irrelevant, and mis-targeted recommendations in personalization systems. We present a new profile maintenance scheme, which extends the Relational Fuzzy Subtractive Clustering (RFSC) technique and enables efficient incremental update of usage profiles. An impact factor is defined whose value can be used to decide the need for recompilation. The results from extensive experiments on a large real dataset of web logs show that the proposed maintenance technique, with considerably reduced computational costs, is almost as good as complete remodeling.
This work is an extended version of our earlier work presented at WebKDD 2005 [29].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abraham, A.: Business Intelligence from Web Usage Mining. J. of Information and Knowledge Management (JIKM) 2(4), 375–390 (2003)
Baraglia, R., Silvestri, F.: An Online Recommender System for Large Web Sites. In: Proc. IEEE/WIC/ACM Int’l. Conference on Web Intelligence, Beijing, China (September 2004)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)
Breese, J., Heckerman, D., Kadie, C.: Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In: Proc. of UAI 1998, pp. 43–52 (1998)
Can, F., Ozkarahan, E.A.: A Dynamic Cluster Maintenance System for Information Retrieval. In: Proc. 10th Annual International ACM-SIGIR Conference, pp. 123–131 (1987)
Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. SIAM Journal on Computing 33(6), 1417–1440 (2004)
Chiu, S.L.: Fuzzy model identification based on cluster estimation. J. of Intelligent and Fuzzy Systems 2(3) (1994)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. J. of Knowledge and Information Systems 1, 1–27 (1999)
Corsini, P., Lazzerini, B., Marcelloni, F.: A New Fuzzy Relational Clustering Algorithm Based on Fuzzy C-means Algorithm. Soft Computing. Springer, Heidelberg (2004)
Ester, M., Kriegel, H., Sander, J., Wimmer, M., Xu, X.: Incremental Clustering for Mining in a Data Warehousing Environment. In: Proc. of VLDB 1998, pp. 323–333. Morgan Kaufmann Publishers Inc., San Francisco (1998)
Fu, Y., Sandhu, K., Shih, M.-Y.: A Generalization-Based Approach to Clustering of Web Usage Sessions. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS (LNAI), vol. 1836, pp. 21–38. Springer, Heidelberg (2000)
Huang, J.Z., Ng, M.K., Ching, W.-K., Ng, J., David Wai-Lok, C.: A cube model and cluster analysis for web access sessions. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, pp. 48–67. Springer, Heidelberg (2002)
Hathaway, R.J., Bezdek, J.C., Davenport, J.W.: On relational data version of c-means algorithm. Pattern Recognition Letters 17, 607–612 (1996)
Hubert, L., Arabie, P.: Comparing partitions. J. of Classification 2, 193–198 (1985)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)
Mobasher, B.: Web Usage Mining and Personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing. CRC Press, Boca Raton (2004)
Mobasher, B., Cooley, R., Srivastava, J.: Automatic personalization based on web usage mining. Comm. ACM 43(8), 142–151 (2000)
Nasraoui, O.: World Wide Web Personalization. In: Wang, J. (ed.) Encyclopedia of Data Mining and Data Warehousing. Idea Group, USA (2005)
Nasraoui, O., Cardona, C., Rojas, C., Gonzalez, F.: Mining Evolving User Profiles in Noisy Web Clickstream Data with a Scalable Immune System Clustering Algorithm. In: Proc. WebKDD 2003, Washington DC (August 2003)
Nasraoui, O., Krishnapuram, R., Joshi, A., Kamdar, T.: Automatic Web User Profiling and Personalization using Robust Fuzzy Relational Clustering. In: E-Commerce and Intelligent Methods. Springer, Heidelberg (2002)
Nasraoui, O., Frigui, H., Krishnapuram, R., Joshi, A.: Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering. International Journal on Artificial Intelligence Tools 9(4), 509–526 (2000)
Nasraoui, O., Krishnapuram, R.: One Step Evolutionary Mining of Context Sensitive Associations and Web Navigation Patterns. In: Proc. SIAM conference on Data Mining, Arlington, VA, April 2002, pp. 531–547 (2002)
Pal, K., Pal, N., Keller, J.M., Bezdek, J.: Relational mountain (density) clustering method and web log analysis. Int’l. J. of Intelligent Systems 20(3), 375–392 (2005)
Pennock, D.M., Horvitz, E., Lawrence, S., Giles, C.L.: Collaborative filtering by personality diagnosis: A hybrid memory- and model-based approach. In: Proc. of UAI 2000, Stanford, CA, pp. 473–480 (2000)
Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.: Analysis of recommender algorithms for e-commerce. In: Proc. 2nd ACM E-commerce Conference, Minnesota, USA (2000)
Shahabi, C., Banaei-Kashani, F.: A Framework for Efficient and Anonymous Web Usage Mining Based on Client-Side Tracking. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, p. 113. Springer, Heidelberg (2002)
Suryavanshi, B.S., Shiri, N., Mudur, S.P.: An Efficient Technique for Mining Usage Profiles using Relational Fuzzy Subtractive Clustering. In: Proc. of IEEE Int’l. Workshop on Challenges in Web Information Retrieval and Integration (WIRI 2005), Tokyo, Japan, April 8-9 (2005)
Suryavanshi, B.S., Shiri, N., Mudur, S.P.: A Fuzzy Hybrid Collaborative Filtering Technique for Web Personalization. In: Proc. of 3rd Workshop on Intelligent Techniques for Web Personalization (ITWP 2005), Edinburgh, Scotland (August 2005)
Suryavanshi, B.S., Shiri, N., Mudur, S.P.: Incremental Relational Fuzzy Subtractive Clustering for Dynamic Web Usage Profiling. In: Nasraoui, O., Zaïane, O.R., Spiliopoulou, M., Mobasher, B., Masand, B., Yu, P.S. (eds.) WebKDD 2005. LNCS (LNAI), vol. 4198. Springer, Heidelberg (2006)
Tasoulis, D., Vrahatis, M.: Unsupervised Clustering on Dynamic Databases. Pattern Recognition Letters (to appear, 2005)
Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)
Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. on PAMI 13(8), 841–847 (1991)
Xie, Y., Phoha, V.V.: Web User Clustering from Access Log Using Belief Function. In: Proc. 1st International Conference on Knowledge Capture (K-CAP 2001), pp. 202–208. ACM Press, New York (2001)
Yan, T.W., Jacobsen, M., Garcia-Molina, H., Dayal, U.: From User Access Patterns to Dynamic Hypertext Linking. In: Proc. 5th International World Wide Web Conf. (1996)
Yager, R.R., Filev, D.P.: Approximate clustering via the mountain method. IEEE Transaction on System Man Cybern. 24(8), 1279–1284 (1994)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Proc. 1996 ACM SIGMOD Int. Conf. Management of Data, Montreal, Canada, pp. 103–114 (June 1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Suryavanshi, B.S., Shiri, N., Mudur, S.P. (2006). Adaptive Web Usage Profiling. In: Nasraoui, O., Zaïane, O., Spiliopoulou, M., Mobasher, B., Masand, B., Yu, P.S. (eds) Advances in Web Mining and Web Usage Analysis. WebKDD 2005. Lecture Notes in Computer Science(), vol 4198. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11891321_7
Download citation
DOI: https://doi.org/10.1007/11891321_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46346-7
Online ISBN: 978-3-540-46348-1
eBook Packages: Computer ScienceComputer Science (R0)