Skip to main content

Combining Supervised-Unsupervised Methods for Word Sense Disambiguation

  • Conference paper
  • First Online:
Book cover Computational Linguistics and Intelligent Text Processing (CICLing 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2276))

Abstract

This paper presents a method to combine two unsupervised methods (Specification Marks, Conceptual Density) and one supervised (Maximum Entropy) for the automatic resolution of lexical ambiguity of nouns in English texts. The main objective is to improved the accuracy of knowledge-based methods with statistical information supplied by the corpus-based method. We explore a way of combining the classification results of the three methods: “voting” is the way we have chosen to combine the three methods in one unique decision.

These three methods have been applied both individually as in a combined way to disambiguate a set of polysemous words. Our results show that a combination of different knowledge-based methods and the addition of statistical information from a corpus-based method might eventually lead to improve accuracy of first ones.

This paper has been partially supported by the Spanish Government (CICYT) project number TIC2000-0664-C02-02.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Eneko Agirre and German Rigau. A proposal for Word Sense Disambiguation using Conceptual Distance. In Proceedings of the International Conference “Recent Advances in Natural Language Processing” (RANLP 95), 1995.

    Google Scholar 

  2. Eneko Agirre and German Rigau. Word Sense Disambiguation using Conceptual Density. In Proceedings of the 16th International Conference on Computational Linguistic (COLING 96, Copenhagen, Denmark, 1996.

    Google Scholar 

  3. Gerard Escudero, Lluis Màrquez, and German Rigau. On the portability and tuning of supervised word sense disambiguation systems. In Hinrich Schütze and Keh-Yih Su, editors, Proceedings of the Joint Sigdat Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Hong Kong, China, 2000.

    Google Scholar 

  4. Christopher D. Manning and Hinrich Schütze. Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts, 1999.

    MATH  Google Scholar 

  5. G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. WordNet: An on-line lexical database. International journal of lexicography, 3(4):235–244, 1990.

    Article  Google Scholar 

  6. Andrés Montoyo, Manuel Palomar, and German Rigau. Wordnet enrichment with classification systems. In Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customisations Workshop. The Second Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL-01), pages 101–106. Carnegie Mellon University. Pittsburgh, PA, USA, 2001.

    Google Scholar 

  7. Andrés Montoyo and Manuel Palomar. Word Sense Disambiguation with Specification Marks in Unrestricted Texts. In Proceedings of 11th International Workshop on Database and Expert Systems Applications (DEXA 2000). 11th International Workshop on Database and Expert Systems Applications, pages 103–107, Greenwich, London, UK, September 2000. IEEE Computer Society.

    Google Scholar 

  8. Andrés Montoyo and Manuel Palomar. WSD Algorithm Applied to a NLP System. In Mokrane Bouzeghoub, Zoubida Kedad, and Elisabeth M tais, editors, Proceedings of 5th International conference on Applications of Natural Language to Information Systems (NLDB-2000). Natural Language Processing and Information Systems, Lecture Notes in Computer Science, pages 54–65, Versailles, France, June 2000. Springer-Verlag.

    Google Scholar 

  9. Andrés Montoyo and Manuel Palomar. Specification Marks for Word Sense Disambiguation: New Development. In A. Gelbukh, editor, Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, pages 182–191, Mexico City, February 2001. Springer-Verlag.

    Chapter  Google Scholar 

  10. M. Palomar, M. Saiz-Noeda, R. Mūoz, A. Suárez, P. Martínez-Barco, and A. Montoyo. PHORA: A NLP aystem for Spanish. In A. Gelbukh, editor, Proceedings of 2nd International conference on Intelligent Text Processing and Computational Linguistics (CICLing-2001). Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, pages 126–139, Mexico City, February 2001. Springer-Verlag.

    Chapter  Google Scholar 

  11. Ted Pedersen. A decision tree of bigrams is an accurate predictor of word sense. In ACL, editor, Proceedings of NAACL Workshop WordNet and Other Lexical Resources: Applications, Extensions and Customizations, Pittsburgh, PA, USA, 2001.

    Google Scholar 

  12. Adwait Ratnaparkhi. Maximum Entropy Models for Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania, 1998.

    Google Scholar 

  13. Maximiliano Saiz-Noeda, Armando Suárez, and Manuel Palomar. Semantic pattern learning through maximum entropy-based wsd technique. In Proceedings of CoNLL-2001, pages 23–29. Toulouse, France, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Montoyo, A., Suárez, A., Palomar, M. (2002). Combining Supervised-Unsupervised Methods for Word Sense Disambiguation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2002. Lecture Notes in Computer Science, vol 2276. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45715-1_13

Download citation

  • DOI: https://doi.org/10.1007/3-540-45715-1_13

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43219-7

  • Online ISBN: 978-3-540-45715-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics