Skip to main content

BM25

  • Reference work entry
  • First Online:
  • 44 Accesses

Synonyms

OKAPI retrieval function; Probabilistic model

Definition

BM25 is a ranking function that ranks a set of documents based on the query terms appearing in each document, regardless of the interrelationship between the query terms within a document (e.g., their relative proximity). It is not a single function, but actually a whole family of scoring functions, with slightly different components and parameters. It is used by search engines to rank matching documents according to their relevance to a given search query and is often referred to as “Okapi BM25,” since the Okapi information retrieval system was the first system implementing this function. The BM25 retrieval formula belongs to the BM family of retrieval models (BM stands for Best Match) that is the weight of a term t in a document d is

$$ \frac{\mathrm{t}\mathrm{f}}{k+\mathrm{tf}}\; \ln\;\frac{\left({r}_{\mathrm{t}}+0.5\right)\cdotp...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

  1. Robertson SE, Walker S, Beaulieu MM, Gatford M, Payne A. Okapi at trec-4. In: Harman DK, editor. NIST special publication 500-236. Proceedings of the 4th Text Retrieval Conference; 1996.

    Google Scholar 

  2. Robertson SE, Walker S. Some simple approximations to the 2-poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1994. p. 232–41.

    Google Scholar 

  3. Robertson SE, Van Rijsbergen CJ, Porter M. Probabilistic models of indexing and searching, Chapter 4. In: Robertson SE, Van Rijsbergen CJ, Williams PW, editors. Information retrieval research. London: Butterworths; 1981. p. 35–56.

    Google Scholar 

  4. Robertson SE, Sparck-Jones K. Relevance weighting of search terms. J Am Soc Inf Sci. 1976;27(3):129–46.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giambattista Amati .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Amati, G. (2018). BM25. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_921

Download citation

Publish with us

Policies and ethics