Abstract
In this paper, we describe a method for discovering knowledge about users on a web site from data composed of demographic descriptions and site navigations. The goal is to obtain knowledge that is useful to answer two types of questions: (1) how do site users visit a web site? (2) Who are these users? Our approach is based on the following idea: the set of all site users can be divided into several coherent subgroups; each subgroup shows both distinct personal characteristics, and a distinct browsing behaviour. We aim at obtaining associations between site usage patterns and personal user descriptions. We call this combined knowledge ’rich navigation patterns’. This knowledge characterizes a precise web site usage and can be used in several applications: prediction of site navigation, recommendations or improvement in site design.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proceedings of the 20th VLDB conference, Santiago, Chile (1994)
Beaudouin, V., Assadi, H., Beauvisage, T., Lelong, B., Licoppe, C., Ziemalicki, C., Arbues, L., Lendrevie, J.: Parcours sur Internet: analyse des traces d’usage. Rapport RP/FTR&D/7495, France Telecom R&D, Net Value, HEC (2002)
Borges, J., Levene, M.: Mining Association Rules in Hypertext Databases. In: Proceedings of Conference on Knowledge Discovery and Data Mining (1998)
Borges, J., Levene, M.: Data Mining of User Navigation Patterns. In: Proceedings of the Workshop on Web Usage Analysis and User Profiling, San Diego, CA, August 15, pp. 31–36 (1999)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information System 1(1), 5–32 (1999)
Cooley, R., Tan, P., Srivastava, J.: WebSIFT: The Web Site Information Filter System. In: Proceedings of the Web Usage Analysis and User Profiling Workshop (August 1999)
Cybermétrie. Cybermétrie La mesure collective des sites de l’Internet en France, Source: Médiamétrie, http://www.mediametrie.fr/web/produits/cybermetrie.html
Demiriz, A., Zaki, M.: webSPADE: A Parallel Sequence Mining Algorithm to Analyze the Web Log Data. Submitted to KDD 2002 (2002)
Fu, Y., Sandhu, K., Shih, M.: Clustering of Web users based on access patterns. In: proceedings of the 1999 KDD Workshop on Web Mining, San Diego (1999)
Han, J., Pei, J., Mortazavi-Asl, B., Chen, Q., Dayal, U., Hsu, M.: Frequent Pattern Projected Sequential Pattern Mining. In: Proceedings of international Conference on KDD, Boston (August 2000)
Hay, B., Wets, G., Vanhoof, K.: Clustering navigation patterns on a website using a Sequence Alignment Method. In: Proceedings of IJCAI’s Workshop on Intelligent Techniques for Web Personnalisation, Seattle, Washington, August 4–6 (2001)
Media Metrix mediametrix.htm (comScore), http://www.comscore.com/products/mmetrix/
NetValue, http://www.netvalue.fr/
Nielsen//NetRating, http://www.nielsen-netratings.com/
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. In: Proceedings of ICDE 2001, Germany (April 2001)
WebTrends, http://www.webtrends.com/
Zaki, M.: SPADE: An Efficient Algorithm for Mining Frequent Sequences. Machine Learning 42(1), 31–60 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chevalier, K., Bothorel, C., Corruble, V. (2003). Discovering Rich Navigation Patterns on a Web Site. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-39644-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive