Patterns of Search: Analyzing and Modeling Web Query Refinement
We discuss the construction of probabilistic models centering on temporal patterns of query refinement. Our analyses are derived from a large corpus of Web search queries extracted from server logs recorded by a popular Internet search service. We frame the modeling task in terms of pursuing an understanding of probabilistic relationships among temporal patterns of activity, informational goals, and classes of query refinement. We construct Bayesian networks that predict search behavior, with a focus on the progression of queries over time. We review a methodology for abstracting and tagging user queries. After presenting key statistics on query length, query frequency, and informational goals, we describe user models that capture the dynamics of query refinement.
KeywordsBayesian Network User Model Bayesian Network Model Search Service Query Length
Unable to display preview. Download preview PDF.
- Albrecht, D., Zukerman, I., Nicholson, A., and Bud, A. (1997). Towards a Bayesian model for keyhole plan recognition in large domains. In Jameson, A., Paris, C., and Tasso, C., eds., Proceedings of the Sixth International Conference on User Modeling. New York: Springer-Verlag. 365–376.Google Scholar
- Conati, C., Germer, A., VanLehn, K., and Druzdzel, M. (1997). Online student modeling for coached problem solving using Bayesian networks. In Jameson, A., Paris, C., and Tasso, C., eds., Proceedings of the Sixth International Conference on User Modeling. New York: Springer-Verlag. 231–242.Google Scholar
- Horvitz, E., and Barry, M. (1995). Display of information for time-critical decision making. In Besnard, P., and Hanks, S., eds., Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, 296–305. San FranciscoGoogle Scholar
- Horvitz, E., and Barry, M. (1995). Display of information for time-critical decision making. In Besnard, P., and Hanks, S., eds., Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, 296–305. San Francisco: Morgan Kaufmann.Google Scholar
- Horvitz, E., Breese, J., Heckerman, D., Hovel, D., and Rommeise, D. (1998). The Lumiere Project: Bayesian User Modeling for Inferring the Goals and Needs of Software Users. In Fourteenth Conference on Uncertainty in Artificial Intelligence, 256–265. Morgan Kaufmann Publishers.Google Scholar
- Maglio, P. P., and Barrett, R. (1997). How to Build Modeling Agents to Support Web Searchers. In Jameson, A., Paris, C., and Tasso, C., eds., User Modeling: Proceedings of the Sixth International Conference, UM97, 5–16. Vienna, New York: Springer Wien New York.Google Scholar
- Pearl, J. (1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Francisco: Morgan Kaufmann Publishers.Google Scholar
- Silverstein, C., Henzinger, M., Marais, H., and Moricz, M. (1998). Analysis of a Very Large AltaVista Query Log. Technical Report 1998–014, Digital Systems Research Center, Palo Alto, CA.Google Scholar
- Spencer, G. (1998). Personal communication. Email correspondence between Eric Horvitz and Excite CTO, 8/30/98 and 9/9/98.Google Scholar