Skip to main content

Query Model Estimations for Relevance Feedback in Language Modeling Approach

  • Conference paper
Information Retrieval Technology (AIRS 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3411))

Included in the following conference series:

  • 409 Accesses

Abstract

Recently, researchers have successfully augmented the language modeling approach with a well-founded framework in order to incorporate relevance feedback. A critical problem in this framework is to estimate a query language model that encodes detailed knowledge about a user’s information need. This paper explores several methods for query model estimation, motivated by Zhai’s generative model. The generative model is an estimation method that maximizes the generative likelihood of feedback documents according to the estimated query language model. Focusing on some limitations of the original generative model, we propose several estimation methods to resolve these limitations: 1) three-component mixture model, 2) re-sampling feedback documents with document language models, and 3) sampling a relevance document from a relevance document language model. In addition, several hybrid methods are also examined, which combine the query specific smoothing method and the estimated query language model. In experiments, our estimation methods outperform a simple generative model, showing a significant improvement over an initial retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berger, A., Lafferty, J.: Information Retrieval as Statistical Translation. In: Proceedings of 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 222–229 (1999)

    Google Scholar 

  2. Dempster, A.: Maximum Likelihood from Incomplete Data via the EM algorithm. Journal of Royal Statistical Society 39(1), 1–39 (1977)

    MATH  MathSciNet  Google Scholar 

  3. Hiemstra, D.: Term Specific Smoothing for Language Modeling Approach to Information Retrieval: The Importance of a Query Term. In: Proceedings of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 35–41 (2002)

    Google Scholar 

  4. Hiemstra, D.: Using Language Models for Information Retrieval. In PhD Thesis, University of Twente (2001)

    Google Scholar 

  5. Lafferty, J., Zhai, C.: Document Language Models, Query Models, and Risk Minimization for Information Retrieval. In: Proceedings of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 111–119 (2001)

    Google Scholar 

  6. Lam-Adesina, A., Jones, G.: Applying Summarization Techniques for Term Selection in Relevance Feedback. In: Proceedings of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1–9 (2001)

    Google Scholar 

  7. Lavrenko, V., Croft, B.: Relevance-based language models. In: Proceedings of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 120–127 (2001)

    Google Scholar 

  8. Miller, D., Leek, T., Schwartz, R.: A Hidden Markov Model Information Retrieval System. In: Proceedings of 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 214–221 (1999)

    Google Scholar 

  9. Ng, K.: A Maximum Likelihood Ratio Information Retrieval Model. In: TREC-8 Workshop notebook (1999)

    Google Scholar 

  10. Ponte, A.: A Language Modeling Approach to Information Retrieval. In: PhD thesis, Dept., o Computer Science, Univercity of Massachusetts (1998)

    Google Scholar 

  11. Robertson, S., Hiemstra, D.: Language Models and Probability of Relevance. In: Proceedings of the Workshop on Language Modeling and Information Retrieval (2001)

    Google Scholar 

  12. Song, F., Croft, W.: A General Language Model for Information Retrieval. In: Proceedings of 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 279–280 (1999)

    Google Scholar 

  13. Srikanth, M., Srihari, R.: Biterm Language Models for Document Retrieval. In: Proceedings of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 425–426 (2002)

    Google Scholar 

  14. Xu, J., Croft, W.: Query Expansion using Local and Global Document Analysis. In: Proceedings of 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 4–11 (1996)

    Google Scholar 

  15. Zaragoza, H., Hiemstra, D.: Bayesian Extension to the Language Model for Ad Hoc Information Retrieval. In: Proceedings of 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 4–9 (2003)

    Google Scholar 

  16. Zhai, C., Lafferty, J.: Model-based Feedback in the Language Modeling Approach to Information Retrieval. In: Proceedings of the 10th international conference on Information and knowledge management, pp. 430–410 (2001)

    Google Scholar 

  17. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In: Proceedings of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 334–342 (2001)

    Google Scholar 

  18. Srikanth, M., Srihari, R.: Biterm Language Models for Document Retrieval. In: Proceedings of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 425–426 (2002)

    Google Scholar 

  19. Zaragoza, H., Hiemstra, D.: Bayesian Extension to the Language Model for Ad Hoc Information Retrieval. In: Proceedings of 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 4–9 (2003)

    Google Scholar 

  20. Zhai, C., Lafferty, J.: Model-based Feedback in the Language Modeling Approach to Information Retrieval. In: Proceedings of the 10th international conference on Information and knowledge management, pp. 430–410 (2002)

    Google Scholar 

  21. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In: Proceedings of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 334–342 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Na, SH., Kang, IS., Moon, K., Lee, JH. (2005). Query Model Estimations for Relevance Feedback in Language Modeling Approach. In: Myaeng, S.H., Zhou, M., Wong, KF., Zhang, HJ. (eds) Information Retrieval Technology. AIRS 2004. Lecture Notes in Computer Science, vol 3411. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31871-2_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-31871-2_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25065-4

  • Online ISBN: 978-3-540-31871-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics