Synonyms
VSM
Definition
In the Vector-Space Model (VSM) for Information Retrieval (IR), every informative object (e.g., document, query, fragment, cluster, collection) can be described as a vector of a vector space defined over the real field. In most applications, the VSM for IR represents documents and queries as vectors of weights (i.e., coordinates of a vector space). Each weight is a measure of the importance of an index term in a document or a query. The index term weights are computed on the basis of the frequency of the index terms in the document, the query, or the collection. At retrieval time, the documents are ranked by a function of the inner product between the document vectors and the query vector; for example, the retrieval function can be the cosine of the angle between a document vector and a query vector. If x is the vector of the n-dimensional real field which represents an informative object to be ranked against another informative object, which is represented by...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsRecommended Reading
Deerwester S, Dumais S, Furnas G, Landauer T, Harshman R. Indexing by latent semantic analysis. J Am Soc Inform Sci. 1990;41(6):391–407.
Dubin D. The most influential paper Gerard Salton never wrote. Libr Trends. 2004;52(4):748–64.
Halmos P. Finite-dimensional vector spaces. Undergraduate texts in mathematics. New York: Springer; 1987.
Melucci M. Context modeling and discovery using vector space bases. In: Proceedings of the Annual ACM Conference on Information and Knowledge Management; 2005. p. 808–15.
Melucci M. A basis for information retrieval in context. ACM Trans Inform Syst. 2008;26(3):1–41.
Melucci M. Introduction to information retrieval and quantum mechanics. Berlin: Springer; 2015.
Paik JH. A novel TF-IDF weighting scheme for effective ranking. In: Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2013. p. 343–52.
Salton G. Associative document retrieval techniques using bibliographic information. J ACM. 1963;10(4):440–57.
Salton G. Mathematics and information retrieval. J Doc. 1979;35(1):1–29.
Salton G. Automatic text processing. Reading: Addison-Wesley; 1989.
Salton G, Buckley C. Term weighting approaches in automatic text retrieval. Inform Process Manage. 1988;24(5):513–23.
Salton G, Wong A, Yang C. A vector space model for automatic indexing. Commun ACM. 1975;18(11):613–20
Salton G, Yang C, Yu C. A theory of term importance in automatic text analysis. J Am Soc Inform Sci. 1975;26(1):33–44.
Salton G, Allan J, Buckley C. Approaches to passage retrieval in full text information systems. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1993. p. 49–58.
Salton G, Singhal A, Buckley C, Mitra M. Automatic text decomposition using text segments and text themes. In: Proceedings of the ACM Hypertext Conference; 1996. p. 53–65.
Salton G, Singhal A, Mitra M, Buckley C. Automatic text structuring and summarization. Inf Proc Manag. 1997;33(2):193–207.
Singhal A, Buckley C, Mitra M. Pivoted document length normalization. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1996. p. 21–29.
van Rijsbergen C. The geometry of information retrieval. Cambridge: Cambridge University Press; 2004.
Wong S, Raghavan V. Vector space model of information retrieval – a reevaluation. In: Proceedings of the 7th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1984. p. 167–85.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Melucci, M. (2018). Vector-Space Model. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_918
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_918
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering