Abstract
Noun compounds are a frequently occurring yet highly ambiguous construction in natural language; their interpretation relies on extra-syntactic information. Several statistical methods for compound disambiguation have been reported in the literature; however, a striking feature of all these approaches is that disambiguation relies on statistics derived from unambiguous compounds in training, meaning they are prone to the problem of sparse data. Other researchers have overcome this difficulty somewhat by using manually crafted knowledge resources to collect statistics on “concepts” rather than noun tokens, but have sacrificed domain-independence by doing so. We report here on work investigating the application of Latent Semantic Indexing [4], an Information Retrieval technique, to the task of noun compound disambiguation. We achieved an accuracy of 84%, indicating the potential of applying vectorbased distributional information measures to syntactic disambiguation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barker, K.: A trainable bracketer for noun modifiers. In Proceedings of the Twelfth Canadian Conference on Artificial Intelligence, pages 196–210, Vancouver, 1998.
Berry, M. W., Dumais, S. T., Letsche, T. A.: Computational methods for intelligent information access. In Proceedings of Supercomputing’ 95, San Diego, CA, 1995.
Burgess, C., Lund, K.: The dynamics of meaning in memory. In E. Dietrich and A. Markman, editors, Cognitive Dynamics: Conceptual and Representational Change in Humans and Machines, pages 17–56. Lawrence Erlbaum Associates Inc., Hillsdale, NJ, 1999.
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41:391–407, 1990.
Evans, D. A., Zhai, C.: Noun-phrase analysis in unrestricted text for information retrieval. In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, pages 17–24, Santa-Cruz, CA, June 1996.
Landauer, T. K., Dumais, S. T.: A solution to Plato’s problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104:211–240, 1997.
Lauer, M.: Designing Statistical Language Learners: Experiments on Noun Compounds. PhD thesis, Macquarie University, Sydney, Australia, 1995.
Levy, J.P., Bullinaria, J. A.: Learning lexical properties from word usage patterns: Which context words should be used? In Proceedings of the Sixth Neural Computation and Psychology Workshop, pages 273–282. London: Springer, 2001.
Lowe, W., McDonald, S.: The direct route: Mediated priming in semantic space. In Proceedings of the 22nd Annual Meeting of the Cognitive Science Society, pages 806–811. Lawrence Erlbaum Associates, 2000.
Marcus, M.: A Theory of Syntactic Recognition for Natural Language. MIT Press, Cambridge, MA, 1980.
Mason, O.: http://www.english.bham.ac.uk/staff/oliver/software/tagger/.
McDonald, S., Ramscar, M.: Testing the distributional hypothesis: The influence of context on judgements of semantic similarity. In Proceedings of the 23rd Annual Conference of the Cognitive Science Society, 2001.
Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Five papers on WordNet. Technical Report 43, Cognitive Science Laboratory, Princeton University, July 1990.
Pustejovsky, J., Bergler, S., Anick, P.: Lexical semantic techniques for corpus analysis. Computational Linguistics, 19(2):331–358, 1993.
Resnik, P. S.: Selection and Information: A Class-Based Approach to Lexical Relationships. PhD thesis, University of Pennsylvania, 1993.
Strzalkowski, T., Vauthey, B.: Information retrieval using robust natural language processing. In Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics, pages 104–111, Newark, Delaware, USA, 1992.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Buckeridge, A.M., Sutcliffe, R.F. (2002). Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun Compound Disambiguation. In: O’Neill, M., Sutcliffe, R.F.E., Ryan, C., Eaton, M., Griffith, N.J.L. (eds) Artificial Intelligence and Cognitive Science. AICS 2002. Lecture Notes in Computer Science(), vol 2464. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45750-X_2
Download citation
DOI: https://doi.org/10.1007/3-540-45750-X_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44184-7
Online ISBN: 978-3-540-45750-3
eBook Packages: Springer Book Archive