Skip to main content

Information Retrieval Models

  • Living reference work entry
  • First Online:
Encyclopedia of Database Systems
  • 68 Accesses

Synonyms

Ad hoc retrieval models; Document term weighting; Term-document matching function

Definition

A model of information retrieval (IR) selects and ranks the relevant documents with respect to a user’s query. The texts of the documents and the queries are represented in the same way, so that document selection and ranking can be formalized by a matching function that returns a retrieval status value (RSV) for each document in the collection. Most of the IR systems represent document contents by a set of descriptors, called terms, belonging to a vocabulary V.

An IR model defines the query-document matching function according to four main approaches:

  • The estimation of the probability of user’s relevance rel for each document d and query q with respect to a set Rq of training documents

    $$ \mathrm{Prob}0.24em \left( rel|\mathbf{d},\mathbf{q},{R}_{\mathbf{q}}\right) $$
  • The computation of a similarity function between queries and documents in a vector space

    $$...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Recommended Reading

  1. Robertson SE, Walker S. Some simple approximations to the 2-Poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th annual international ACM SIGIR conference on research and development in information retrieval. Dublin: Springer; 1994. p. 232–41.

    Google Scholar 

  2. Salton G, McGill MJ. Introduction to modern information retrieval. New York: McGraw-Hill; 1983.

    MATH  Google Scholar 

  3. Ponte J, Croft BA. Language modeling approach in information retrieval. In: Croft B, Moffat A, Van Rijsbergen CJ, editors. Proceedings of the 21st ACM SIGIR conference on research and development in information retrieval. Melbourne: ACM; 1998. p. 275–81.

    Google Scholar 

  4. Amati G, Van Rijsbergen CJ. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inform Syst. 2002;20(4):357–89.

    Article  Google Scholar 

  5. Clinchant S, Gaussier E. Information-based models for ad hoc IR. In: Proceedings of the 33rd ACM SIGIR conference on research and development in information retrieval. New York: ACM; 2010. p. 234–41.

    Google Scholar 

  6. Harter SP. A probabilistic approach to automatic keyword indexing. Part I: on the distribution of specialty words in a technical literature. J ASIS. 1975;26:197–216.

    Google Scholar 

  7. Berger A, Lafferty J. Information retrieval as statistical translation. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval; 1999. p. 222–9.

    Google Scholar 

  8. Croft WB, Lafferty J (Eds). Language modeling for information retrieval. Boston: Kluwer; 2003.

    MATH  Google Scholar 

  9. Turtle H, Bruce Croft W. Evaluation of an inference network-based retrieval model. ACM Trans Inform Syst. 1991;9(3):187–222.

    Article  Google Scholar 

  10. Salton G, Fox EA, Wu H. Extended boolean information retrieval. Commun ACM. 1983;26(11):1022–36.

    Article  MathSciNet  Google Scholar 

  11. Van Rijsbergen CJA. New theorethical framework for information retrieval. In: Proceedings of the 9th annual international ACM SIGIR conference on research and development in information retrieval; 1986. p. 194–200.

    Google Scholar 

  12. Crestani F, Van Rijsbergen CJ. Probability kinematics in information retrieval. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval; 1995. p. 291–9.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giambattista Amati .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media LLC

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Amati, G. (2017). Information Retrieval Models. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_916-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_916-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4899-7993-3

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics