Skip to main content

Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun Compound Disambiguation

  • Conference paper
  • First Online:
Artificial Intelligence and Cognitive Science (AICS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2464))

Included in the following conference series:

Abstract

Noun compounds are a frequently occurring yet highly ambiguous construction in natural language; their interpretation relies on extra-syntactic information. Several statistical methods for compound disambiguation have been reported in the literature; however, a striking feature of all these approaches is that disambiguation relies on statistics derived from unambiguous compounds in training, meaning they are prone to the problem of sparse data. Other researchers have overcome this difficulty somewhat by using manually crafted knowledge resources to collect statistics on “concepts” rather than noun tokens, but have sacrificed domain-independence by doing so. We report here on work investigating the application of Latent Semantic Indexing [4], an Information Retrieval technique, to the task of noun compound disambiguation. We achieved an accuracy of 84%, indicating the potential of applying vectorbased distributional information measures to syntactic disambiguation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barker, K.: A trainable bracketer for noun modifiers. In Proceedings of the Twelfth Canadian Conference on Artificial Intelligence, pages 196–210, Vancouver, 1998.

    Google Scholar 

  2. Berry, M. W., Dumais, S. T., Letsche, T. A.: Computational methods for intelligent information access. In Proceedings of Supercomputing’ 95, San Diego, CA, 1995.

    Google Scholar 

  3. Burgess, C., Lund, K.: The dynamics of meaning in memory. In E. Dietrich and A. Markman, editors, Cognitive Dynamics: Conceptual and Representational Change in Humans and Machines, pages 17–56. Lawrence Erlbaum Associates Inc., Hillsdale, NJ, 1999.

    Google Scholar 

  4. Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41:391–407, 1990.

    Article  Google Scholar 

  5. Evans, D. A., Zhai, C.: Noun-phrase analysis in unrestricted text for information retrieval. In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, pages 17–24, Santa-Cruz, CA, June 1996.

    Google Scholar 

  6. Landauer, T. K., Dumais, S. T.: A solution to Plato’s problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104:211–240, 1997.

    Article  Google Scholar 

  7. Lauer, M.: Designing Statistical Language Learners: Experiments on Noun Compounds. PhD thesis, Macquarie University, Sydney, Australia, 1995.

    Google Scholar 

  8. Levy, J.P., Bullinaria, J. A.: Learning lexical properties from word usage patterns: Which context words should be used? In Proceedings of the Sixth Neural Computation and Psychology Workshop, pages 273–282. London: Springer, 2001.

    Google Scholar 

  9. Lowe, W., McDonald, S.: The direct route: Mediated priming in semantic space. In Proceedings of the 22nd Annual Meeting of the Cognitive Science Society, pages 806–811. Lawrence Erlbaum Associates, 2000.

    Google Scholar 

  10. Marcus, M.: A Theory of Syntactic Recognition for Natural Language. MIT Press, Cambridge, MA, 1980.

    MATH  Google Scholar 

  11. Mason, O.: http://www.english.bham.ac.uk/staff/oliver/software/tagger/.

  12. McDonald, S., Ramscar, M.: Testing the distributional hypothesis: The influence of context on judgements of semantic similarity. In Proceedings of the 23rd Annual Conference of the Cognitive Science Society, 2001.

    Google Scholar 

  13. Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Five papers on WordNet. Technical Report 43, Cognitive Science Laboratory, Princeton University, July 1990.

    Google Scholar 

  14. Pustejovsky, J., Bergler, S., Anick, P.: Lexical semantic techniques for corpus analysis. Computational Linguistics, 19(2):331–358, 1993.

    Google Scholar 

  15. Resnik, P. S.: Selection and Information: A Class-Based Approach to Lexical Relationships. PhD thesis, University of Pennsylvania, 1993.

    Google Scholar 

  16. Strzalkowski, T., Vauthey, B.: Information retrieval using robust natural language processing. In Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics, pages 104–111, Newark, Delaware, USA, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Buckeridge, A.M., Sutcliffe, R.F. (2002). Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun Compound Disambiguation. In: O’Neill, M., Sutcliffe, R.F.E., Ryan, C., Eaton, M., Griffith, N.J.L. (eds) Artificial Intelligence and Cognitive Science. AICS 2002. Lecture Notes in Computer Science(), vol 2464. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45750-X_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-45750-X_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44184-7

  • Online ISBN: 978-3-540-45750-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics