Advertisement

Monitoring the Evolution of Web Usage Patterns

  • Steffan Baron
  • Myra Spiliopoulou
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3209)

Abstract

With the ongoing shift from off-line to on-line business processes, the Web has become an important business platform, and for most companies it is crucial to have an on-line presence which can be used to gather information about their products and/or services. However, in many cases there is a difference between the intended and the effective usage of a web site and, presently, many web site operators analyse the usage of their sites to improve their usability. But particularly in the context of the Internet, content and structure change rather quickly, and the way a web site is used may change often, either due to changing information needs of its visitors, or due to an evolving user group. Therefore, the discovered usage patterns need to be updated continuously to always reflect the actual behaviour of the visitors.

In this article, we introduce PAM, an automated Pattern Monitor, which can be used to observe changes to the behaviour of a web site’s visitors. It is based on a temporal representation of rules in which both the content of the rule and its statistical properties are modelled. It observes pattern change as evolution of the statistical measurements captured for a rule throughout its entire lifetime and notifies the user about interesting changes within the rule base. We present PAM in a case study on the evolution of web usage patterns. In particular, we discovered association rules from a web-server log that show which pages tend to be visited within the same user session. These patterns have been imported into the monitor, and their evolution throughout a period of 8 months has been analysed. Our results show that PAM is particularly suitable to gain insights into the changes of a rule base over time.

Keywords

Change Detector Association Rule Training Phase Rule Base Domain Expert 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Psaila, G., Wimmers, E.L., Zaït, M.: Querying shapes of histories. In: VLDB, Zürich, Switzerland (1995)Google Scholar
  2. 2.
    Ayan, N.F., Tansel, A.U., Arkun, E.: An Efficient Algorithm To Update Large Itemsets With Early Pruning. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 1999, pp. 287–291. ACM, New York (1999)CrossRefGoogle Scholar
  3. 3.
    Baron, S., Spiliopoulou, M.: Monitoring change in mining results. In: 3rd Int. Conf. on Data Warehousing and Knowledge Discovery - DaWaK 2001, Munich, Germany (September 2001)Google Scholar
  4. 4.
    Baron, S., Spiliopoulou, M.: Monitoring the Results of the KDD Process: An Overview of Pattern Evolution. In: Meij, J. M. (ed.) Dealing with the Data Flood: Mining data, text and+multimedia, ch. 6, pp. 845–863. STT Netherlands Study Center for Technology Trends, The Hague, Netherlands (April 2002)Google Scholar
  5. 5.
    Baron, S., Spiliopoulou, M., Günther, O.: Efficient Monitoring of Patterns in Data Mining Environments. In: Kalinichenko, L.A., Manthey, R., Thalheim, B., Wloka, U. (eds.) ADBIS 2003. LNCS, vol. 2798, Springer, Heidelberg (2003) (to appear)CrossRefGoogle Scholar
  6. 6.
    Berry, M.J., Linoff, G.: Data Mining Techniques: For Marketing, Sales and Customer Support. John Wiley & Sons, Inc., Chichester (1997)Google Scholar
  7. 7.
    Bing Liu, Y.M., Lee, R.: Analyzing the interestingness of association rules from the temporal dimension. In: IEEE International Conference on Data Mining (ICDM 2001), Silicon Valley, USA, pp. 377–384 (November 2001)Google Scholar
  8. 8.
    Chakrabarti, S., Sarawagi, S., Dom, B.: Mining surprising patterns using temporal description length. In: Gupta, A., Shmueli, O., Widom, J. (eds.) VLDB 1998, New York City, NY, August 1998, pp. 606–617. Morgan Kaufmann, San Francisco (1998)Google Scholar
  9. 9.
    Chen, X., Petrounias, I.: Mining Temporal Features in Association Rules. In: Proceedings of the 3rd European Conference on Principles of Data Mining and Knowledge Discovery, Prague, Czech Republic, September 1999. LNCS, pp. 295–300. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  10. 10.
    Cheung, D.W., Lee, S., Kao, B.: A general incremental technique for maintaining discovered association rules. In: DASFAA 1997, Melbourne, Australia (April 1997)Google Scholar
  11. 11.
    Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., Xu, X.: Incremental Clustering for Mining in a Data Warehousing Environment. In: Proceedings of the 24th International Conference on Very Large Data Bases, New York City New York, USA, August 1998, pp. 323–333. Morgan Kaufmann, San Francisco (1998)Google Scholar
  12. 12.
    Freitas, A.: On objective measures of rule surprisingness. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 1–9. Springer, Heidelberg (1998)CrossRefGoogle Scholar
  13. 13.
    Gago, P., Bento, C.: A Metric for Selection of the Most Promising Rules. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, Springer, Heidelberg (1998)CrossRefGoogle Scholar
  14. 14.
    Ganti, V., Gehrke, J., Ramakrishnan, R.: A Framework for Measuring Changes in Data Characteristics. In: Proceedings of the Eighteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Philadelphia, Pennsylvania, May 1999, pp. 126–137. ACM Press, New York (1999)CrossRefGoogle Scholar
  15. 15.
    Ganti, V., Gehrke, J., Ramakrishnan, R.: DEMON: Mining and Monitoring Evolving Data. In: Proceedings of the 15th International Conference on Data Engineering, San Diego, California, USA, February 2000, pp. 439–448. IEEE Computer Society, Los Alamitos (2000)Google Scholar
  16. 16.
    Lee, S., Cheung, D., Kao, B.: Is Sampling Useful in Data Mining?A Case in the Maintenance of Discovered Association Rules. Data Mining and Knowledge Discovery 2(3), 233–262 (1998)CrossRefGoogle Scholar
  17. 17.
    Lee, S.D., Cheung, D.W.-L.: Maintenance of Discovered Association Rules: When to update. In: ACM-SIGMODWorkshop on Data Mining and Knowledge Discovery (DMKD 1997), Tucson, Arizona (May 1997)Google Scholar
  18. 18.
    Liu, B., Hsu, W., Ma, Y.: Discovering the set of fundamental rule changes. In: 7th ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining (KDD 2001), San Francisco, USA, pp. 335–340 (August 2001)Google Scholar
  19. 19.
    Mark, N.M.A., Kelly, G.: David J. Hand. The Impact of Changing Populations on Classifier Performance. In: Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, August 1999, pp. 367–371. ACM, New York (1999)Google Scholar
  20. 20.
    Omiecinski, E., Savasere, A.: Efficient Mining of Association Rules in Large Databases. In: Proceedings of the British National Conference on Databases, pp. 49–63 (1998)Google Scholar
  21. 21.
    Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: Proc. of the Eighth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (2002)Google Scholar
  22. 22.
    Thomas, S., Bodagala, S., Alsabti, K., Ranka, S.: An Efficient Algorithm for the Incremental Updation of Association Rules in Large Databases. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining (KDD 1997), Newport Beach, California, USA, pp. 263–266 (August 1997)Google Scholar
  23. 23.
    Wang, K.: Discovering patterns from large and dynamic sequential data. Intelligent Information Systems 9, 8–33 (1997)Google Scholar
  24. 24.
    Witten, H., Frank, E.: Data Mining: Practical machine learning tools with Java implementations. Morgan Kaufmann, San Francisco (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Steffan Baron
    • 1
  • Myra Spiliopoulou
    • 2
  1. 1.Institute of Information SystemsHumboldt-Universität zu BerlinBerlinGermany
  2. 2.Institute of Technical and Business Information SystemsOtto-von-Guericke-Universität MagdeburgMagdeburgGermany

Personalised recommendations