Skip to main content

Part of the book series: The Information Retrieval Series ((INRE,volume 27))

  • 1066 Accesses

Abstract

This chapter extends the basic MRF model by automatically learning query-dependent concept weights. The extension is a generic framework for learning the importance of query term concepts in a way that directly optimizes an underlying retrieval metric. By implementing concept weighting directly into the underlying retrieval model it avoids the issue of metric divergence. The chapter concludes with a rigorous experimental evaluation that demonstrates this weighting strategy is capable of yielding strong gains in retrieval effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Available from the Linguistic Data Consortium catalog.

  2. 2.

    Available as a part of Microsoft 2006 RFP dataset.

  3. 3.

    Available at: http://download.wikimedia.org/enwiki/.

References

  • Bai, J., Chang, Y., Cui, H., Zheng, Z., Sun, G., & Li, X. (2008). Investigation of partial query proximity in web search. In Proc. 17th international conference on World Wide Web (pp. 1183–1184).

    Chapter  Google Scholar 

  • Bendersky, M., & Croft, W. B. (2008). Discovering key concepts in verbose queries. In Proc. 31st ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Bendersky, M., Croft, W. B., & Smith, D. A. (2009). Two-stage query segmentation for information retrieval. In Proc. 32nd ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Bendersky, M., Metzler, D., & Croft, W. B. (2010). Learning concept importance using a weighted dependence model. In Proceedings of the third ACM international conference on Web search and data mining (WSDM 2010) (pp. 31–40), New York.

    Chapter  Google Scholar 

  • Bergsma, S., & Wang, Q. I. (2007). Learning noun phrase query segmentation. In Proc. of EMNLP-CoNLL.

    Google Scholar 

  • Cao, G., Nie, J.-Y., Gao, J., & Robertson, S. (2008). Selecting good expansion terms for pseudo-relevance feedback. In Proc. 31st ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Cummins, R., & O’Riordan, C. (2009). Learning in a pairwise term-term proximity framework for information retrieval. In Proc. 32nd ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Gey, F. (1994). Inferring probability of relevance using the method of logistic regression. In Proc. 17th ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Guo, J., Xu, G., Li, H., & Cheng, X. (2008). A unified and discriminative model for query refinement. In Proc. of 31st ann. intl. ACM SIGIR conf. on research and development in information retrieval. New York: ACM.

    Google Scholar 

  • Järvelin, K., & Kekäläinen, J. (2002). Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 20(4), 422–446.

    Article  Google Scholar 

  • Kumaran, G., & Carvalho, V. R. (2009). Reducing long queries using query quality predictors. In Proc. 32nd ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Lavrenko, V., & Croft, W. B. (2001). Relevance-based language models. In Proc. 24th ann. intl. ACM SIGIR conf. on research and development in information retrieval (pp. 120–127).

    Chapter  Google Scholar 

  • Lease, M. (2009). An improved Markov random field model for supporting verbose queries. In Proc. 32nd ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Lease, M., Croft, W. B., & Allan, J. (2009). Regression rank: Learning to meet the opportunity of descriptive queries. In Proc. 31st European conf. on information retrieval.

    Google Scholar 

  • Liu, T.-Y. (2009). Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3).

    Google Scholar 

  • Metzler, D., & Croft, W. B. (2005). A Markov random field model for term dependencies. In Proc. 28th ann. intl. ACM SIGIR conf. on research and development in information retrieval (pp. 472–479).

    Chapter  Google Scholar 

  • Metzler, D., & Croft, W. B. (2007). Latent concept expansion using Markov random fields. In Proc. 30th ann. intl. ACM SIGIR conf. on research and development in information retrieval.

    Google Scholar 

  • Mishne, G., & de Rijke, M. (2005). Boosting web retrieval through query operations. In Proc. 27th European conf. on information retrieval (pp. 502–516).

    Google Scholar 

  • Morgan, W., Greiff, W., & Henderson, J. (2004). Direct maximization of average precision by hill-climbing with a comparison to a maximum entropy approach (Technical report). MITRE.

    Google Scholar 

  • Pickens, J., & Croft, W. B. (1999). An exploratory analysis of phrases in text retrieval. In Proc. of RIAO 2000.

    Google Scholar 

  • Ponte, J., & Croft, W. B. (1998). A language modeling approach to information retrieval. In Proc. 21st ann. intl. ACM SIGIR conf. on research and development in information retrieval (pp. 275–281).

    Chapter  Google Scholar 

  • Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.

    Article  Google Scholar 

  • Tan, B., & Peng, F. (2008). Unsupervised query segmentation using generative language models and wikipedia. In Proc. 17th intl. World Wide Web conf. New York: ACM.

    Google Scholar 

  • Tao, T. & Zhai, C. X. (2007). An exploration of proximity measures in information retrieval. In Proc. 30th ann. intl. ACM SIGIR conf. on research and development in information retrieval (pp. 295–302). New York: ACM.

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Donald Metzler .

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Metzler, D. (2011). Query-Dependent Feature Weighting. In: A Feature-Centric View of Information Retrieval. The Information Retrieval Series, vol 27. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22898-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22898-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22897-1

  • Online ISBN: 978-3-642-22898-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics