Probabilistic Relevance Models Based on Document and Query Generation
We give a unified account of the probabilistic semantics underlying the language modeling approach and the traditional probabilistic model for information retrieval, showing that the two approaches can be viewed as being equivalent probabilistically, since they are based on different factorizations of the same generative relevance model. We also discuss how the two approaches lead to different retrieval frameworks in practice, since they involve component models that are estimated quite differently.
KeywordsLanguage models relevance models generative models
Unable to display preview. Download preview PDF.
- Berger, A. and Lafferty, J. (1999). Information retrieval as statistical transla tion. In Proceedings of the 22nd International Conference on Research and Development in Information Retrieval (SIGIR’99), pages 222–229.Google Scholar
- Brin, S. and Page, L. (1998). Anatomy of a large-scale hypertextual web search engine. In Proceedings of the 7th International World Wide Web Conference. Google Scholar
- Brown, P. F., Cocke, J., Della Pietra, S. A., Della Pietra, V. J., Jelinek, F., Lafferty, J. D., Mercer, R. L., and Roossin, P. S. (1990). A statistical approach to machine translation. Computational Linguistics, 16 (2): 79–85.Google Scholar
- Croft, W. B. and Harper, D. (1979). Using probabilistic models of document retrieval without relevance information. Journal of Documentation, 35: 285295.Google Scholar
- Lavrenko, V. and Croft, W. B. (2001). Relevance-based language models. In 24th ACM SIGIR Conference on Research and Development in Information Retrieval. Google Scholar
- Miller, D., Leek, T., and Schwartz, R. (1999). A hidden Markov model information retrieval system. In Proceedings of the 22nd International Conference on Research and Development in Information Retrieval (SIGIR’99), pages 214–221.Google Scholar
- Ponte, J. and Croft, W. B. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st International Conference on Research and Development in Information Retrieval (SIGIR’98), pages 275–281Google Scholar
- Robertson, S. and Walker, S. (1997). On relevance weights with little relevance information. In Proceedings of SIGIR’97, pages 16–24.Google Scholar
- Robertson, S. E., Maron, M. E., and Cooper, W. S. (1982). Probability of relevance: a unification of two competing models for information retrieval. Information Technology - Research and Development, 1:1–21.Google Scholar