Abstract
At the Web Intelligence conference in 2009, Jiang, Zilles, and Holte introduced a novel approach to query suggestion based on query search (QSQS), as well as a system-centered evaluation method. For each potentially relevant document, QSQS creates a complex query—called a lexical alias for the document—that ranks the document in its top 20. A technique called Query Search then builds query suggestions by simplifying the lexical alias.
The present paper improves the state of the art by proposing two new query suggestion systems, IQSQS and GQSQS. Both replace the generation of lexical aliases by a simpler and more effective term selection process. They differ in their control structure: IQSQS builds query suggestions separately for each potentially relevant document, GQSQS builds them for a set of documents at once.
Both our new systems substantially outperform QSQS in the measures introduced by Jiang et al. to evaluate QSQS; we achieve improvements of up to 30 percent in these measures for short user queries and up to 100 percent for long user queries. We show empirically that query expansion, which forces the user’s query to be included in each suggested query, is significantly superior to allowing the system the freedom to include or exclude terms from the user’s query at its discretion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Carpineto, C., Mori, R., Romano, G., Bigi, B.: An information-theoretic approach to automatic query expansion. ACM Transactions on Information Systems (TOIS) 19, 1–27 (2001)
Chen, J., Zaïane, O.R., Goebel, R.: An unsupervised approach to cluster web search results based on word sense communities. In: WI 2008, pp. 725–729 (2008)
Cutting, D., Karger, D., Pederson, J., Tukey, J.: Scatter/gather: a cluster-based approach to browsing large document collections. In: ACM SIGIR 1992, pp. 318–329 (1992)
Geraci, F., Pellegrini, M., Maggini, M., Sebastiani, F.: Cluster generation and labeling for web snippets: A fast, accurate hierarchical solution. Internet Mathematics 3, 413–443 (2006)
Jansen, B., Spink, A.: How are we searching the world wide web?: a comparison of nine search engine transaction logs. Inf. Process. Manage. 42(1), 248–263 (2006)
Jansen, B., Spink, A., Saracevic, T.: Real life, real users, and real needs: a study and analysis of user queries on the web. Inf. Process. Manage. 36(2), 207–227 (2000)
Jiang, S.: Searching for queries to improve document retrieval in web search. Master’s thesis, University of Alberta (2009)
Jiang, S., Zilles, S., Holte, R.: Empirical analysis of the rank distribution of relevant documents in web search. In: WI 2008, pp. 208–213 (2008)
Jiang, S., Zilles, S., Holte, R.: Query suggestion by query search: a new approach to user support in web search. In: WI 2009, pp. 679–684 (2009)
Lin, C.-Y., Hovy, E.: From single to multi-document summarization. In: ACL, pp. 457–464 (2002)
Manning, C., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
Martin, J., Holte, R.: Searching for content-based addresses on the world-wide web. In: Proceedings of the 3rd ACM Conference on Digital Libraries, pp. 299–300 (1998)
Mitra, M., Singhal, A., Buckley, C.: Improving automatic query expansion. In: ACM SIGIR 1998, pp. 206–214 (1998)
Radev, D., Jing, H., Sty, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manage. 40(6), 919–938 (2004)
Silverstein, C., Rauch Henzinger, M., Marais, H., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6–12 (1999)
Stein, B., Zu Eissen, S.M.: Topic identification: framework and application. In: Proceedings of the International Conference on Knowledge Management, pp. 522–531 (2004)
Treeratpituk, P., Callan, J.: Automatically labeling hierarchical clusters. In: Proceedings of the 2006 International Conference on Digital Government Research, pp. 167–176 (2006)
Voorhees, E.M.: Query expansion using lexical-semantic relations. In: ACM SIGIR 1994, pp. 61–69 (1994)
Wang, X., Zhai, C.: Mining term association patterns from search logs for effective query reformulation. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 479–488 (2008)
White, R., Clarke, C., Cucerzan, S.: Comparing query logs and pseudo-relevance feedback for web-search query refinement. In: ACM SIGIR 2007, pp. 831–832 (2007)
Xu, J., Croft, W.: Query expansion using local and global document analysis. In: ACM SIGIR 1996, pp. 4–11 (1996)
Zhang, X.: Search term selection and document clustering for query suggestion. Master’s thesis, University of Alberta (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Zilles, S., Holte, R.C. (2012). Improved Query Suggestion by Query Search. In: Glimm, B., Krüger, A. (eds) KI 2012: Advances in Artificial Intelligence. KI 2012. Lecture Notes in Computer Science(), vol 7526. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33347-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-33347-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33346-0
Online ISBN: 978-3-642-33347-7
eBook Packages: Computer ScienceComputer Science (R0)