Abstract
In this paper we present and empirically evaluate a ‘continuity-cost model’ for Internet query sessions made by users. We study the relation of different ‘cost factors’ for a user query session, with the continuity of the user in that query session, and the order of the query in the query session. We define cost indicators from the available query log data, which are to be studied in relation to continuity and to the order/number of the query (1st, 2nd, 3rd, ..). One of our hypotheses is that cost related factors will reflect the step by step nature of the query session process. We use descriptive statistics together with rule induction to identify the most relevant factors and observable trends, and produce three classifier data models, one for each ‘query number’, using the ‘continuity flag’ as classifier label. Using the cost factors, we identify trends relating continuity/query number to user behavior, and we can use that information, for example, to make decisions about caching and query recommendation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fox, S., Karnawat, K., Mydland, M., Dumais, S., White, T.: Evaluating implicit measures to improve web search. ACM Trans. Inf. Syst. 23(2), 147–168 (2005)
Hassan, A., Jones, R., Klinkner, K.L.: Beyond DCG: user behavior as a predictor of a successful search. In: Proc. 3rd ACM Int. Conf. WSDM 2010, New York, pp. 221–230 (2010)
Baeza-Yates, R., Hurtado, C., Mendoza, M., Dupret, G.: Modeling user search behavior. In: Proc. 3rd Latin Am. Web Congress 2005, Buenos Aires, pp. 242–251 (October 2005)
Nettleton, D.F., Baeza-Yates, R.: Web retrieval: Techniques for the aggregation and selection of queries and answers. Int. Journal of Intelligent Systems 23(12), 1223–1234 (2008)
Ntoulas, A., Cho, J., Olston, C.: What’s new on the web? The evolution of the web from a search engine perspective. In: Proc. 13th Int. WWW Conf., New York, US (May 2004)
Sugiyama, K., Hatano, K., Yoshikawa, M.: Adaptive web search based on user profile constructed without any effort from users. In: Proc. 13th Int. WWW Conf. (May 2004)
Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: Proc. 14th Int. World Wide Web Conference, Chiba, Japan (May 2005)
Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a Very Large Web Search Engine Query Log. ACM SIGIR Forum 33(1), 6–12 (1999)
Craswell, N., Zoeter, O., Taylor, M., Ramsey, B.: An Experimental Comparison of Click Position-Bias Models. In: Proc. Int. Conf. on Web Search and Web Data Mining, WSDM 2008, Palo Alto, California, USA, pp. 87–94 (2008)
Teevan, J., Dumais, S.T., Horvitz, E.: Personalizing Search via Automated Analysis of Interests and Activities. In: Proc. 28th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, Salvador, Brazil, pp. 449–456 (2005)
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting Query Performance. In: Proc. 25th Int. ACM SIGIR Conf. on R+D in Information Retrieval, Finland, pp. 299–306 (2002)
McKelvey, R.D., Palfrey, T.R.: An Experimental Study of the Centipede Game. Econometrica 60, 803–836 (1992)
Nash, J.: Equilibrium points in n-person games. Proceedings of the National Academy of the USA 36(1), 48–49 (1950)
TodoCL. Chilean Internet Search Engine (2007), http://www.todocl.com
Im4Data, Using the Intelligent Miner for Data V8 Rel. 1. IBM Redbooks, SH12-6394-00 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nettleton, D.F., Codina, J. (2010). A Cost-Continuity Model for Web Search. In: Torra, V., Narukawa, Y., Daumas, M. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2010. Lecture Notes in Computer Science(), vol 6408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16292-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-16292-3_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16291-6
Online ISBN: 978-3-642-16292-3
eBook Packages: Computer ScienceComputer Science (R0)