Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Vector-Space Model

  • Massimo MelucciEmail author
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_918




In the Vector-Space Model (VSM) for Information Retrieval (IR), every informative object (e.g., document, query, fragment, cluster, collection) can be described as a vector of a vector space defined over the real field. In most applications, the VSM for IR represents documents and queries as vectors of weights (i.e., coordinates of a vector space). Each weight is a measure of the importance of an index term in a document or a query. The index term weights are computed on the basis of the frequency of the index terms in the document, the query, or the collection. At retrieval time, the documents are ranked by a function of the inner product between the document vectors and the query vector; for example, the retrieval function can be the cosine of the angle between a document vector and a query vector. If x is the vector of the n-dimensional real field which represents an informative object to be ranked against another informative object, which is represented by...
This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Deerwester S, Dumais S, Furnas G, Landauer T, Harshman R. Indexing by latent semantic analysis. J Am Soc Inform Sci. 1990;41(6):391–407.CrossRefGoogle Scholar
  2. 2.
    Dubin D. The most influential paper Gerard Salton never wrote. Libr Trends. 2004;52(4):748–64.Google Scholar
  3. 3.
    Halmos P. Finite-dimensional vector spaces. Undergraduate texts in mathematics. New York: Springer; 1987.Google Scholar
  4. 4.
    Melucci M. Context modeling and discovery using vector space bases. In: Proceedings of the Annual ACM Conference on Information and Knowledge Management; 2005. p. 808–15.Google Scholar
  5. 5.
    Melucci M. A basis for information retrieval in context. ACM Trans Inform Syst. 2008;26(3):1–41.CrossRefGoogle Scholar
  6. 6.
    Melucci M. Introduction to information retrieval and quantum mechanics. Berlin: Springer; 2015.zbMATHCrossRefGoogle Scholar
  7. 7.
    Paik JH. A novel TF-IDF weighting scheme for effective ranking. In: Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2013. p. 343–52.Google Scholar
  8. 8.
    Salton G. Associative document retrieval techniques using bibliographic information. J ACM. 1963;10(4):440–57.zbMATHCrossRefGoogle Scholar
  9. 9.
    Salton G. Mathematics and information retrieval. J Doc. 1979;35(1):1–29.CrossRefGoogle Scholar
  10. 10.
    Salton G. Automatic text processing. Reading: Addison-Wesley; 1989.Google Scholar
  11. 11.
    Salton G, Buckley C. Term weighting approaches in automatic text retrieval. Inform Process Manage. 1988;24(5):513–23.CrossRefGoogle Scholar
  12. 12.
    Salton G, Wong A, Yang C. A vector space model for automatic indexing. Commun ACM. 1975;18(11):613–20zbMATHCrossRefGoogle Scholar
  13. 13.
    Salton G, Yang C, Yu C. A theory of term importance in automatic text analysis. J Am Soc Inform Sci. 1975;26(1):33–44.CrossRefGoogle Scholar
  14. 14.
    Salton G, Allan J, Buckley C. Approaches to passage retrieval in full text information systems. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1993. p. 49–58.Google Scholar
  15. 15.
    Salton G, Singhal A, Buckley C, Mitra M. Automatic text decomposition using text segments and text themes. In: Proceedings of the ACM Hypertext Conference; 1996. p. 53–65.Google Scholar
  16. 16.
    Salton G, Singhal A, Mitra M, Buckley C. Automatic text structuring and summarization. Inf Proc Manag. 1997;33(2):193–207.CrossRefGoogle Scholar
  17. 17.
    Singhal A, Buckley C, Mitra M. Pivoted document length normalization. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1996. p. 21–29.Google Scholar
  18. 18.
    van Rijsbergen C. The geometry of information retrieval. Cambridge: Cambridge University Press; 2004.zbMATHCrossRefGoogle Scholar
  19. 19.
    Wong S, Raghavan V. Vector space model of information retrieval – a reevaluation. In: Proceedings of the 7th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1984. p. 167–85.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of PaduaPaduaItaly