Abstract
The information left behind by users who have visited a web site is recorded in the related web server log files. From analysing the data contained in such files, a web designer is able to understand the interaction between the users and a web site, and then to improve the web topology. We assume that the information of web usage can be generated from log files via a cleaning process, from which we identify a set of navigation sessions that represent the trails formed by users during the navigation process. The trails are modelled as a weighted directed graph, called a transition graph, and then a corresponding navigation matrix is computed with respect to the underlying web topology. The main contribution in this paper is that we formally define a minimal set of binary operators on navigation matrices, which consists of the sum, union, intersection and difference operators. These operations afford us the ability to analyse users navigation from the contents of two given navigation matrices.
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-0-387-35658-7_21
Chapter PDF
Similar content being viewed by others
References
J. Borges and M. Levene. Mining association rules in hypertext databases. In Proc. of the 4th Int. Conf. on Knowledge Discovery and Data Mining, pp. 149 - 153, (1998).
A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins and J. Wiener. Graph structure in the web. In Proc. of the 9th WWW Conf., (2000).
M.S. Chen, J. S. Park and P. S. Yu. Efficient data mining for traversal patterns. IEEE Transactions on Knowledge and Data Engineering, 10 (2) pp. 209 - 221, (1998).
R. Cooley, B. Mobasher and J. Srivastava. Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems, 1(1) pp. 5 - 32, (1999).
L. D. Catledge, and J. E. Pitkow. Characterizing browsing strategies in the world wide web. Computer Networks and ISDN Systems, 27 (6) pp. 1065 - 1073, (1995).
J. Mena. Data mining your website. Digital Press, (1999).
W. Ng. Evaluating the client side approach and the server side approach to the WWW and DBMSs integration. In Proc. of the 9th Int. Database Workshop. pp. 72 - 82, (1999).
W. Ng. and C. Chan. WHAT: A web hypertext associated trail mining system. In Proc. of the 9th IFIP 2.6 Working Conf. on Database Semantics, pp. 205 - 220, (2001).
J. Pei, J. Han, B. Mortazavi-asl and H. Zhu. Mining access patterns efficiently from web logs. In Proc. of PAKDD Cont., pp. 396-407, Japan. (2000).
M. Perkowitz and O. Etzioni. Towards adaptive web sites: conceptual framework and case study. Artificial Intelligence, 118 (2000) pp. 245 - 275. (2000).
M. Shaw. Handbook on electronic commerce. Springer-Verlag, (1999).
M. Spiliopoulou and L. Faulstich. WUM: a tool for web utilization analysis. In Proc. of the International Workshop on the Web and Databases, pp. 184 - 203, (1998).
N. Zin and M. Levene. Constructing web-views from automated navigation sessions. In Proc. of the ACM Digital Lib. Workshop on Organizing Web Space, pp. 54 - 58, (1999).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 IFIP International Federation for Information Processing
About this chapter
Cite this chapter
Ng, W. (2003). Capturing the Semantics of Web Log Data by Navigation Matrices. In: Meersman, R., Aberer, K., Dillon, T. (eds) Semantic Issues in E-Commerce Systems. IFIP - The International Federation for Information Processing, vol 111. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35658-7_10
Download citation
DOI: https://doi.org/10.1007/978-0-387-35658-7_10
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-1035-9
Online ISBN: 978-0-387-35658-7
eBook Packages: Springer Book Archive