Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Information Retrieval Models

  • Giambattista Amati
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_916

Synonyms

Ad hoc retrieval models; Document term weighting; Term-document matching function

Definition

A model of information retrieval (IR) selects and ranks the relevant documents with respect to a user’s query. The texts of the documents and the queries are represented in the same way, so that document selection and ranking can be formalized by a matching function that returns a retrieval status value (RSV) for each document in the collection. Most of the IR systems represent document contents by a set of descriptors, called terms, belonging to a vocabulary V.

An IR model defines the query-document matching function according to four main approaches:
  • The estimation of the probability of user’s relevance rel for each document d and query q with respect to a set R q of training documents
    $$ \mathrm{Prob}\kern0.24em \left( rel|\mathbf{d},\mathbf{q},{R}_{\mathbf{q}}\right) $$
This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Robertson SE, Walker S. Some simple approximations to the 2-Poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1994. p. 232–41.Google Scholar
  2. 2.
    Salton G, McGill MJ. Introduction to modern information retrieval. New York: McGraw-Hill; 1983.zbMATHGoogle Scholar
  3. 3.
    Ponte J, Croft BA. Language modeling approach in information retrieval. In: Croft B, Moffat A, Van Rijsbergen CJ, editors. Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1998. p. 275–81.Google Scholar
  4. 4.
    Amati G, Van Rijsbergen CJ. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inform Syst. 2002;20(4):357–89.CrossRefGoogle Scholar
  5. 5.
    Clinchant S, Gaussier E. Information-based models for ad hoc IR. In: Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2010. p. 234–41.Google Scholar
  6. 6.
    Harter SP. A probabilistic approach to automatic keyword indexing. Part I: on the distribution of specialty words in a technical literature. J ASIS. 1975;26(4):197–216.Google Scholar
  7. 7.
    Berger A, Lafferty J. Information retrieval as statistical translation. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1999. p. 222–9.Google Scholar
  8. 8.
    Croft WB, Lafferty J (Eds). Language modeling for information retrieval. Boston: Kluwer; 2003.zbMATHGoogle Scholar
  9. 9.
    Turtle H, Bruce Croft W. Evaluation of an inference network-based retrieval model. ACM Trans Inform Syst. 1991;9(3):187–222.CrossRefGoogle Scholar
  10. 10.
    Salton G, Fox EA, Wu H. Extended boolean information retrieval. Commun ACM. 1983;26(11):1022–36.MathSciNetzbMATHCrossRefGoogle Scholar
  11. 11.
    Van Rijsbergen CJA. New theorethical framework for information retrieval. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1986. p. 194–200.Google Scholar
  12. 12.
    Crestani F, Van Rijsbergen CJ. Probability kinematics in information retrieval. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1995. p. 291–9.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Fondazione Ugo BordoniRomeItaly

Section editors and affiliations

  • Giambattista Amati
    • 1
  1. 1.Fondazione Ugo BordoniRomeItaly