Abstract
Empirical distributional methods account for the meaning of syntactic structures by combining word vectors according to algebraic operators. In this paper, a novel approach for semantic composition based on space projection techniques over lexical vector representations is proposed. In line with the principle of compositionality, the meaning of a phrase is modeled in terms of the subset of properties shared by co-occurring words. Syntactic bi-grams are thus projected in the so called Support Subspace, corresponding to such properties. State-of-the-art results are achieved in a well known phrase similarity task, used as a benchmark for this class of methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Montague, R.: Formal Philosophy: Selected Papers of Richard Montague. Yale University Press (1974)
Coecke, B., Ssdrzaden, M., Clark, S.: Mathematical foundations for a compositional distributed model of meaning. Lambek Festschrift, Linguistic Analysis 36 (2010)
Firth, J.: A synopsis of linguistic theory 1930-1955. In: Studies in Linguistic Analysis. Philological Society, Oxford (1957); reprinted in Palmer, F. (ed.) Selected Papers of J. R. Firth. Longman, Harlow (1968)
Schütze, H.: Automatic Word Sense Discrimination. Computational Linguistics 24, 97–124 (1998)
Wittgenstein, L.: Philosophical Investigations. Blackwells, Oxford (1953)
Schütze, H.: Word space. In: Hanson, S.J., Cowan, J.D., Giles, C.L. (eds.) NIPS 5, pp. 895–902. Morgan Kaufmann Publishers, San Mateo (1993)
Turney, P.D., Pantel, P.: From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 37, 141 (2010)
Mitchell, J., Lapata, M.: Vector-based models of semantic composition. In: Proceedings of ACL/HLT 2008, pp. 236–244 (2008)
Baroni, M., Zamparelli, R.: Nouns are vectors, adjectives are matrices: representing adjective-noun constructions in semantic space. In: EMNLP 2010, pp. 1183–1193. Association for Computational Linguistics, Stroudsburg (2010)
Grefenstette, E., Sadrzadeh, M.: Experimental support for a categorical compositional distributional model of meaning. CoRR abs/1106.4058 (2011)
Salton, G., Wong, A., Yang, C.: A vector space model for automatic indexing. Communications of the ACM 18, 613–620 (1975)
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of The American Society For Information Science 41, 391–407 (1990)
Landauer, T.K., Dutnais, S.T.: A solution to plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 211–240 (1997)
Harris, Z.S.: Mathematical Structures of Language. Wiley, NY (1968)
Lin, D.: Automatic retrieval and clustering of similar word. In: Proceedings of COLING-ACL, Montreal, Canada (1998)
Pantel, P., Lin, D.: Document clustering with committees. In: Proceedigs of SIGIR 2002, Montreal, Canada, pp. 199–206 (2002)
Pennacchiotti, M., Cao, D.D., Basili, R., Croce, D., Roth, M.: Automatic induction of framenet lexical units. In: EMNLP, pp. 457–465 (2008)
Croce, D., Giannone, C., Annesi, P., Basili, R.: Towards open-domain semantic role labeling. In: Proceedings of ACL, pp. 237–246 (2010)
Foltz, P.W., Kintsch, W., Landauer, T.K.: The measurement of textual coherence with latent semantic analysis. Discourse Processes 25, 285–307 (1998)
Erk, K., Pad, S.: A structured vector space model for word meaning in context. In: EMNLP 2008: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 897–906. ACL (2008)
Mitchell, J., Lapata, M.: Composition in distributional models of semantics. Cognitive Science 34, 1388–1429 (2010)
Baroni, M., Bernardini, S., Ferraresi, A., Zanchetta, E.: The wacky wide web: a collection of very large linguistically processed web-crawled corpora. Language Resources And Evaluation 43, 209–226 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Annesi, P., Storch, V., Basili, R. (2012). Space Projections as Distributional Models for Semantic Composition. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28604-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-28604-9_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28603-2
Online ISBN: 978-3-642-28604-9
eBook Packages: Computer ScienceComputer Science (R0)