Skip to main content

Information Retrieval Models

  • Reference work entry
  • First Online:

Synonyms

Ad hoc retrieval models; Document term weighting; Term-document matching function

Definition

A model of information retrieval (IR) selects and ranks the relevant documents with respect to a user’s query. The texts of the documents and the queries are represented in the same way, so that document selection and ranking can be formalized by a matching function that returns a retrieval status value (RSV) for each document in the collection. Most of the IR systems represent document contents by a set of descriptors, called terms, belonging to a vocabulary V.

An IR model defines the query-document matching function according to four main approaches:

  • The estimation of the probability of user’s relevance rel for each document d and query q with respect to a set R q of training documents

    $$ \mathrm{Prob}\kern0.24em \left( rel|\mathbf{d},\mathbf{q},{R}_{\mathbf{q}}\right) $$
  • The computation of a similarity function between queries and documents in a vector space

    $$...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

  1. Robertson SE, Walker S. Some simple approximations to the 2-Poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1994. p. 232–41.

    Google Scholar 

  2. Salton G, McGill MJ. Introduction to modern information retrieval. New York: McGraw-Hill; 1983.

    MATH  Google Scholar 

  3. Ponte J, Croft BA. Language modeling approach in information retrieval. In: Croft B, Moffat A, Van Rijsbergen CJ, editors. Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1998. p. 275–81.

    Google Scholar 

  4. Amati G, Van Rijsbergen CJ. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inform Syst. 2002;20(4):357–89.

    Article  Google Scholar 

  5. Clinchant S, Gaussier E. Information-based models for ad hoc IR. In: Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2010. p. 234–41.

    Google Scholar 

  6. Harter SP. A probabilistic approach to automatic keyword indexing. Part I: on the distribution of specialty words in a technical literature. J ASIS. 1975;26(4):197–216.

    Google Scholar 

  7. Berger A, Lafferty J. Information retrieval as statistical translation. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1999. p. 222–9.

    Google Scholar 

  8. Croft WB, Lafferty J (Eds). Language modeling for information retrieval. Boston: Kluwer; 2003.

    MATH  Google Scholar 

  9. Turtle H, Bruce Croft W. Evaluation of an inference network-based retrieval model. ACM Trans Inform Syst. 1991;9(3):187–222.

    Article  Google Scholar 

  10. Salton G, Fox EA, Wu H. Extended boolean information retrieval. Commun ACM. 1983;26(11):1022–36.

    Article  MathSciNet  MATH  Google Scholar 

  11. Van Rijsbergen CJA. New theorethical framework for information retrieval. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1986. p. 194–200.

    Google Scholar 

  12. Crestani F, Van Rijsbergen CJ. Probability kinematics in information retrieval. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1995. p. 291–9.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giambattista Amati .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Amati, G. (2018). Information Retrieval Models. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_916

Download citation

Publish with us

Policies and ethics