Skip to main content

Web Usage Mining by Means of Multidimensional Sequence Alignment Methods

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2703))

Abstract

In this article, a new algorithm called Multidimensional Sequence Alignment Method (MDSAM) is illustrated for mining navigation patterns on a web site. MDSAM examines sequences composed of several information types, such as visited pages and visiting time spent on pages. Besides, MDSAM handles large databases and uses heuristics to compute a multidimensional cost based on one-dimensional optimal trajectories. Empirical results show that MDSAM identifies profiles showing visited pages, visiting time spent on pages and the order in which pages are visited on a web site.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. In: Proceedings Conference ACM SIGMOD on Management of Data ACM SIGMOD CMD 1998, Seattle, WA, June 1998, pp. 94–104 (1998)

    Google Scholar 

  2. Büchner, A.G., Baumgarten, M., Anand, S.S., Mulvenna, M.D., Highes, J.G.: Navigation Pattern Discovery from Internet Data. In: Proceedings Workshop ACM on Web Usage Analysis and User Profiling (ACM WEBKDD 1999), San Diego, CA, August 1999, pp. 25–30 (1999)

    Google Scholar 

  3. Cadez, I., Heckerman, D., Meek, C., Smyth, P., White, S.: Visualization of Navigation Patterns on a Web Site Using Model Based Clustering. Technical Report MSR-TR-2000- 18, Microsoft Research (2000)

    Google Scholar 

  4. Capri: Generic sequence discovery product (2001), http://www.mineit.com/products

  5. Cooley, R.: Web Usage Mining: Discovery and Application of Interesting Patterns from Web Data. Ph. D. Thesis, University of Minnesota (2000), http://www.users.cs.umn.edu/~cooley/pubs.html

  6. Cooley, R., Mobasher, B., Srivastava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. A survey paper. In: Proceedings 9th IEEE Conference on Tools with Artificial Intelligence (ICTAI 1997), Newport Beach, CA (November 1997)

    Google Scholar 

  7. Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems 1(1), 5–32 (1999)

    Google Scholar 

  8. Foss, A., Weinan, W., Zaïane, O.R.: A Non-Parametric Approach to Web Log Analysis. In: Proceedings of Workshop on Web Mining in First International SIAM Conference on Data Mining (SDM 2001), Chicago, IL, April 2001, pp. 41–50 (2001)

    Google Scholar 

  9. Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)

    MATH  Google Scholar 

  10. Hair, J., Andersen, R., Tatham, R., Black, W.: Multivariate Data Analysis. Prentice Hall, New Jersey (1998)

    Google Scholar 

  11. Hay, B., Wets, G., Vanhoof, K.: Clustering Navigation Patterns on a Website Using a Sequence Alignment Method. In: Proceedings 17th Conference on Artificial Intelligence, Intelligent Techniques for Web Personalization (IJCAI 2001), Seattle, WA, August 2001, pp. 1–6 (2001)

    Google Scholar 

  12. Joh, C.H., Arentze, T.A., Timmermans, H.J.P.: A position-sensitive sequence alignment method illustrated for space-time activity-diary data. Environment and Planning A 33(2), 313–338 (2001)

    Article  Google Scholar 

  13. Joh, C.H., Arentze, T.A., Timmermans, H.J.P.: Multidimensional Sequence Alignment Methods for Activity-Travel Pattern Analysis: a comparison of Dynamic Programming and Genetic Algorithms. Geographical Analysis 33(3), 247–270 (2001)

    Article  Google Scholar 

  14. Mannila, H., Ronkainen, P.: Similarity of event sequences. In: Proceedings 4th Workshop on Temporal Representation and Reasoning (TIME 1997), Daytona Beach, Florida, May 1997, pp. 136–139 (1997)

    Google Scholar 

  15. Masand, B., Spiliopoulou, M.: Advances in Web Usage Mining and User Profiling. In: Proceedings Workshop ACM on Web Usage Analysis and User Profiling (ACM WEBKDD 1999). LNCS (LNAI), vol. 1836. Springer, Heidelberg (2000)

    Google Scholar 

  16. Mena, J.: Data Mining Your Website. Digital Press, Boston (1999)

    Google Scholar 

  17. Mobasher, B., Jain, N., Han, E., Srivastava, J.: Web Mining: Pattern discovery from World Wide Web transactions. Technical Report TR 96–050, University of Minnesota (1996)

    Google Scholar 

  18. Mulvenna, M.D., Anand, S.S., Büchner, A.G.: Personalization on the Net using Web mining: introduction. Communications of the ACM 43(8), 122–125 (2000)

    Article  Google Scholar 

  19. Nasraoui, O., Krishnapuram, R., Anupam, J.: Mining Web Access Logs Using a Fuzzy Relational Clustering Algorithm Based on a Robust Estimator. In: Proceedings 8th World Wide Web Conference (WWW8 1999), Toronto, Canada (May 1999)

    Google Scholar 

  20. Piatetsky-Shapiro, G., Fayyad, U., Smith, P.: From data mining to knowledge discovery: An overview. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smith, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–35. AAAI/MIT Press (1996)

    Google Scholar 

  21. Sankoff, D., Kruskal, J.B.: Time warps, string edits and macromolecules: the theory and practice of sequence comparison. Addison Wesley, Reading (1983)

    Google Scholar 

  22. Shahabi, C., Faisal, A., Kashani, F.B., Faruque, J.: INSITE: A Tool for interpreting Users? Interaction with a Web Space. In: Proceedings 26th Conference on Very Large Databases (VLDB 2000), Egypt, Caïro, September 2000, pp. 635–638 (2000)

    Google Scholar 

  23. Shahabi, C., Zarkesh, A., Adibi, J., Shah, V.: Knowledge discovery from users Web-page navigation. In: Proceedings 7th Workshop IEEE on Research Issues in Data Engineering, UK, Birmingham, April 1997, pp. 20–31 (1997)

    Google Scholar 

  24. Spiliopoulou, M.: Web Usage Mining: Data Mining über die Nutzung des Web. In: Küsters, U., Meyer, M., Wilde, K.D. (eds.) Handbuch Data Mining im Marketing. Vieweg, ch. 13 (2000)

    Google Scholar 

  25. Spiliopoulou, M., Faulstich, L.: WUM: a Tool for Web Utilization Analysis. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 184–203. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  26. Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: discovery and applications of usage patterns from web data. ACM SIGKDD Explorations 1(2), 12–23 (2000)

    Article  Google Scholar 

  27. Wang, W., Zaïane, O.R.: Clustering Web Sessions by Sequence Alignment. In: Proceedings 3rd Workshop on Management of Information on the Web in conjunction with 13th International Conference on Database and Expert Systems Applications DEXA, Aix en Provence, France (September 2002)

    Google Scholar 

  28. Wilson, W.C.: Activity pattern analysis by means of Sequence Alignment Methods. Environment and Planning A(30), 1017–1038 (1998)

    Article  Google Scholar 

  29. Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Fransisco (2000)

    Google Scholar 

  30. Zaïane, O.R.: Conference Tutorial Notes: Web Mining: Concepts, Practices and Research. In: Proceedings XIV Brazilian Symposium on Databases (SDBD 2000), Promoted by the Brazilian Computer Society in cooperation with ACM SIGMOD, Brazil, João Pessoa, Paraïba, October 2001, pp. 410–474 (2001)

    Google Scholar 

  31. Zaïane, O.R., Xin, M., Han, J.: Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs. In: Proceedings of Advances in Digital Libraries, Santa Barbara, April 1998, pp. 19–29. IEEE, Los Alamitos (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hay, B., Wets, G., Vanhoof, K. (2003). Web Usage Mining by Means of Multidimensional Sequence Alignment Methods. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds) WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles. WebKDD 2002. Lecture Notes in Computer Science(), vol 2703. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39663-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39663-5_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20304-9

  • Online ISBN: 978-3-540-39663-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics