Language Models for Relevance Feedback

Ponte, Jay M.

doi:10.1007/0-306-47019-5_3

Jay M. Ponte³

Part of the book series: The Information Retrieval Series ((INRE,volume 7))

283 Accesses
2 Citations

Abstract

The language modeling approach to Information Retrieval (IR) is a conceptually simple model of IR originally developed by Ponte and Croft (1998). In this approach, the query is treated as a random event and documents are ranked according to the likelihood that the query would be generated via a language model estimated for each document. The intuition behind this approach is that users have a prototypical document in mind and will choose query terms accordingly. The intuitive appeal of this method is that inferences about the semantic content of documents do not need to be made resulting in a conceptually simple model. In this paper, techniques for relevance feedback and routing are derived from the language modeling approach in a straightforward manner and their effectiveness is demonstrated empirically. These experiments demonstrate further proof of concept for the language modeling approach to retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Beeferman, D., Berger, A., and Lafferty, J. (1997). Text segmentation using exponential models. In Proceedings of Empirical Methods in Natural Language Processing.
Google Scholar
Ghosh, M. J., Hwang, T., and Tsui, K. W. (1983). Construction of improved estimators in multiparameter estimation for discrete exponential families. Annals of Statistics, 11:351–367.
MathSciNet Google Scholar
Haines, D. (1996). Adaptive query modification in a probabilistic information retrieval model. PhD thesis, Computer Science Department, University of Massachusetts.
Google Scholar
Harman, D. (1996). Routing results. In Proceedings of the 4th Text Retrieval Conference (TREC-4), pages A53–A81.
Google Scholar
Harper, D. J. and van Rijsbergen, C. J. (1978). An Evaluation of Feedback in Document Retrieval Using Co-occurrence Data. Journal of Documentation, 34(3):189–216.
Google Scholar
Ponte, J. and Croft, W. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275–281.
Google Scholar
Rocchio, J. J. (1971). Relevance Feedback in Information Retrieval, chapter 14, pages 313–323. Prentice-Hall Inc.
Google Scholar
Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. Chapman and Hall.
Google Scholar
Turtle, H. R. (1991). Inference networks for document retrieval. Technical report, University of Massachusetts Ph.D. dissertation.
Google Scholar
van Rijsbergen, C. J. (1977). A theoretical basis for the use of co-occurrence data in information retrieval. Journal of Documentation, pages 106–119.
Google Scholar

Download references

Author information

Authors and Affiliations

GTE Laboratories, 40 Sylvan Rd, Waltham, MA, 02451, USA
Jay M. Ponte

Authors

Jay M. Ponte
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Massachusetts, Amherst
W. Bruce Croft

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ponte, J.M. (2002). Language Models for Relevance Feedback. In: Croft, W.B. (eds) Advances in Information Retrieval. The Information Retrieval Series, vol 7. Springer, Boston, MA. https://doi.org/10.1007/0-306-47019-5_3

Download citation

DOI: https://doi.org/10.1007/0-306-47019-5_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-7812-9
Online ISBN: 978-0-306-47019-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics