Trend and Behavior Detection from Web Queries

  • Peiling Wang
  • Jennifer Bownas
  • Michael W. Berry


In this chapter, we demonstrate the type and nature of query characteristics that can be mined from web server logs. Based on a study of over half a million queries (spanning four academic years) to a university’s website, it is shown that the vocabulary (terms) generated from these queries do not have a well-defined Zipf distribution. However, some regularities in term frequency and ranking correlations suggest that piecewise polynomial data fits are reasonable for trend representations.


Search Engine Word Pair Word Association Query Statement Behavior Detection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [BYRN99]
    R. Baeza-Yates and B. Ribeiro-Neto.Modern Information Retrieval.AddisonWesley, Boston, 1999.Google Scholar
  2. [JP01]
    B.J. Jansen and U. Pooch.A review of Web searching studies and a framework for future research. Journal of the American Society for Information Science and Technology, 52 (3): 235–246, 2001.CrossRefGoogle Scholar
  3. [JSS00]
    B.J. Jansen, A. Spink, and T. Saracevic.Real life, real users, and real needs: A study and analysis of user queries on the Web.Information Processing and Management, 36 (2): 207–227, 2000.CrossRefGoogle Scholar
  4. [Kor77]
    R.R. Korfhage.Information Storage and Retrieval.Wiley,New York, 1977.Google Scholar
  5. [RW00]
    N. Ross and D. Wolfram.End user searching on the Internet: An analysis of term pair topics submitted to the Excite Search Engine.Journal of the American Society for Information Science and Technology, 51 (10): 949–958, 2000.CrossRefGoogle Scholar
  6. [SBC97]
    B. Shneiderman, D. Byrd, and W.B. Croft.Clarifying search: A user-interface framework for text searches.D-Lib Magazine, 1:1–18, 1997.Google Scholar
  7. [SHMM99]
    C. Silverstein, M. Henzinger, H. Marais, and M. Moricz.Analysis of a very large Web search engine query log.SIGIR Forum, 33 (1): 6–12, 1999.CrossRefGoogle Scholar
  8. [SWJS01]
    A. Spink, D. Wolfram, B. Jansen, and T. Saracevic.Searching the Web: The public and their queries. Journal of the American Society for Information Science and Technology, 52 (3): 226–234, 2001.CrossRefGoogle Scholar
  9. [Wo199]
    D. Wolfram.Term co-occurrence in Internet search engine queries: An analysis of the Excite data set.Canadian Journal of Information and Library Science, 24 (2/3): 12–33, 1999.Google Scholar
  10. [WP97]
    P. Wand and L. Pouchard.End-user searching of Web resources: Problems and implications.In Proceedings of the Eighth ASIS SIG/CR Workshop, Washington DC, pages 73–85, 1997.Google Scholar

Copyright information

© Springer Science+Business Media New York 2004

Authors and Affiliations

  • Peiling Wang
  • Jennifer Bownas
  • Michael W. Berry

There are no affiliations available

Personalised recommendations