Abstract
In this article, a new algorithm called Multidimensional Sequence Alignment Method (MDSAM) is illustrated for mining navigation patterns on a web site. MDSAM examines sequences composed of several information types, such as visited pages and visiting time spent on pages. Besides, MDSAM handles large databases and uses heuristics to compute a multidimensional cost based on one-dimensional optimal trajectories. Empirical results show that MDSAM identifies profiles showing visited pages, visiting time spent on pages and the order in which pages are visited on a web site.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. In: Proceedings Conference ACM SIGMOD on Management of Data ACM SIGMOD CMD 1998, Seattle, WA, June 1998, pp. 94–104 (1998)
Büchner, A.G., Baumgarten, M., Anand, S.S., Mulvenna, M.D., Highes, J.G.: Navigation Pattern Discovery from Internet Data. In: Proceedings Workshop ACM on Web Usage Analysis and User Profiling (ACM WEBKDD 1999), San Diego, CA, August 1999, pp. 25–30 (1999)
Cadez, I., Heckerman, D., Meek, C., Smyth, P., White, S.: Visualization of Navigation Patterns on a Web Site Using Model Based Clustering. Technical Report MSR-TR-2000- 18, Microsoft Research (2000)
Capri: Generic sequence discovery product (2001), http://www.mineit.com/products
Cooley, R.: Web Usage Mining: Discovery and Application of Interesting Patterns from Web Data. Ph. D. Thesis, University of Minnesota (2000), http://www.users.cs.umn.edu/~cooley/pubs.html
Cooley, R., Mobasher, B., Srivastava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. A survey paper. In: Proceedings 9th IEEE Conference on Tools with Artificial Intelligence (ICTAI 1997), Newport Beach, CA (November 1997)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems 1(1), 5–32 (1999)
Foss, A., Weinan, W., Zaïane, O.R.: A Non-Parametric Approach to Web Log Analysis. In: Proceedings of Workshop on Web Mining in First International SIAM Conference on Data Mining (SDM 2001), Chicago, IL, April 2001, pp. 41–50 (2001)
Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)
Hair, J., Andersen, R., Tatham, R., Black, W.: Multivariate Data Analysis. Prentice Hall, New Jersey (1998)
Hay, B., Wets, G., Vanhoof, K.: Clustering Navigation Patterns on a Website Using a Sequence Alignment Method. In: Proceedings 17th Conference on Artificial Intelligence, Intelligent Techniques for Web Personalization (IJCAI 2001), Seattle, WA, August 2001, pp. 1–6 (2001)
Joh, C.H., Arentze, T.A., Timmermans, H.J.P.: A position-sensitive sequence alignment method illustrated for space-time activity-diary data. Environment and Planning A 33(2), 313–338 (2001)
Joh, C.H., Arentze, T.A., Timmermans, H.J.P.: Multidimensional Sequence Alignment Methods for Activity-Travel Pattern Analysis: a comparison of Dynamic Programming and Genetic Algorithms. Geographical Analysis 33(3), 247–270 (2001)
Mannila, H., Ronkainen, P.: Similarity of event sequences. In: Proceedings 4th Workshop on Temporal Representation and Reasoning (TIME 1997), Daytona Beach, Florida, May 1997, pp. 136–139 (1997)
Masand, B., Spiliopoulou, M.: Advances in Web Usage Mining and User Profiling. In: Proceedings Workshop ACM on Web Usage Analysis and User Profiling (ACM WEBKDD 1999). LNCS (LNAI), vol. 1836. Springer, Heidelberg (2000)
Mena, J.: Data Mining Your Website. Digital Press, Boston (1999)
Mobasher, B., Jain, N., Han, E., Srivastava, J.: Web Mining: Pattern discovery from World Wide Web transactions. Technical Report TR 96–050, University of Minnesota (1996)
Mulvenna, M.D., Anand, S.S., Büchner, A.G.: Personalization on the Net using Web mining: introduction. Communications of the ACM 43(8), 122–125 (2000)
Nasraoui, O., Krishnapuram, R., Anupam, J.: Mining Web Access Logs Using a Fuzzy Relational Clustering Algorithm Based on a Robust Estimator. In: Proceedings 8th World Wide Web Conference (WWW8 1999), Toronto, Canada (May 1999)
Piatetsky-Shapiro, G., Fayyad, U., Smith, P.: From data mining to knowledge discovery: An overview. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smith, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–35. AAAI/MIT Press (1996)
Sankoff, D., Kruskal, J.B.: Time warps, string edits and macromolecules: the theory and practice of sequence comparison. Addison Wesley, Reading (1983)
Shahabi, C., Faisal, A., Kashani, F.B., Faruque, J.: INSITE: A Tool for interpreting Users? Interaction with a Web Space. In: Proceedings 26th Conference on Very Large Databases (VLDB 2000), Egypt, Caïro, September 2000, pp. 635–638 (2000)
Shahabi, C., Zarkesh, A., Adibi, J., Shah, V.: Knowledge discovery from users Web-page navigation. In: Proceedings 7th Workshop IEEE on Research Issues in Data Engineering, UK, Birmingham, April 1997, pp. 20–31 (1997)
Spiliopoulou, M.: Web Usage Mining: Data Mining über die Nutzung des Web. In: Küsters, U., Meyer, M., Wilde, K.D. (eds.) Handbuch Data Mining im Marketing. Vieweg, ch. 13 (2000)
Spiliopoulou, M., Faulstich, L.: WUM: a Tool for Web Utilization Analysis. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 184–203. Springer, Heidelberg (1999)
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: discovery and applications of usage patterns from web data. ACM SIGKDD Explorations 1(2), 12–23 (2000)
Wang, W., Zaïane, O.R.: Clustering Web Sessions by Sequence Alignment. In: Proceedings 3rd Workshop on Management of Information on the Web in conjunction with 13th International Conference on Database and Expert Systems Applications DEXA, Aix en Provence, France (September 2002)
Wilson, W.C.: Activity pattern analysis by means of Sequence Alignment Methods. Environment and Planning A(30), 1017–1038 (1998)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Fransisco (2000)
Zaïane, O.R.: Conference Tutorial Notes: Web Mining: Concepts, Practices and Research. In: Proceedings XIV Brazilian Symposium on Databases (SDBD 2000), Promoted by the Brazilian Computer Society in cooperation with ACM SIGMOD, Brazil, João Pessoa, Paraïba, October 2001, pp. 410–474 (2001)
Zaïane, O.R., Xin, M., Han, J.: Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs. In: Proceedings of Advances in Digital Libraries, Santa Barbara, April 1998, pp. 19–29. IEEE, Los Alamitos (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hay, B., Wets, G., Vanhoof, K. (2003). Web Usage Mining by Means of Multidimensional Sequence Alignment Methods. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds) WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles. WebKDD 2002. Lecture Notes in Computer Science(), vol 2703. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39663-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-39663-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20304-9
Online ISBN: 978-3-540-39663-5
eBook Packages: Springer Book Archive