Skip to main content

Improving Feature Selection for Maximum Entropy-Based Word Sense Disambiguation

  • Conference paper
  • First Online:
Advances in Natural Language Processing (PorTAL 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2389))

Included in the following conference series:

  • 480 Accesses

Abstract

In this paper, an evaluation of several feature selections for word sense disambiguation is presented. The method used to classify linguistic contexts in its correct sense is based on maximum entropy probability models. In order to study their relevance for each word, several types of features have been analyzed for a few words selected from the DSO corpus. An improved definition of features in order to increase efficiency is presented as well.

This paper has been partially supported by the Spanish Government (CICYT) under project number TIC2000-0664-C02-02.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. SENSEVAL-2: Second international workshop on evaluating word sense disambiguation systems: system descriptions. http://www.sle.sharp.co.uk/senseval2/ (2001)

  2. Yarowsky, D.: Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34 (2000)

    Google Scholar 

  3. Brill, E.: Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics 21 (1995) 543–565

    Google Scholar 

  4. Florian, R., Ngai, G.: Multidimensional transformation-based learning. In Daelemans, W., Zajac, R., eds.: Proceedings of CoNLL-2001, Toulouse, France (2001) 1–8

    Google Scholar 

  5. Mihalcea, R., Moldovan, D.: An iterative approach to word sens disambiguation. In: Proceedings of FLAIRS-2000, Orlando, FL (2000) 219–223

    Google Scholar 

  6. Seo, H.C., Lee, S.Z., Rim, H.C.: Classification information model. http://nlp.korea.ac.kr/ hcseo/senseval2/cim.htm (2001)

  7. Pedersen, T.: A baseline methodology for word sense disambiguation. In [17] 126–135

    Chapter  Google Scholar 

  8. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts (1999)

    MATH  Google Scholar 

  9. Ratnaparkhi, A.: Maximum Entropy Models for Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania (1998)

    Google Scholar 

  10. Lin, D.: Dependency-based evaluation of minipar. In: Proceedings of the Workshop on the Evaluation of Parsing Systems, First International Conference on Language Resources and Evaluation, Granada, Spain (1998)

    Google Scholar 

  11. Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word senses: An exemplar-based approach. In Joshi, A., Palmer, M., eds.: Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, San Francisco, Morgan Kaufmann Publishers (1996)

    Google Scholar 

  12. Escudero, G., Màrquez, L., Rigau, G.: Boosting applied to word sense disambiguation. In: Proceedings of the 12th Conference on Machine Learning ECML2000, Barcelona, Spain (2000)

    Google Scholar 

  13. Suárez, A., Palomar, M.: Feature selection analysis for maximum entropy-based wsd. In [17] 146–155

    Chapter  Google Scholar 

  14. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Five Papers on WordNet. Special Issue of the International journal of lexicography 3 (1993)

    Google Scholar 

  15. Tapanainen, P., Järvinen, T.: A non-projective dependency parser. In: Proceedings of the Fifth Conference on Applied Natural Language Processing. (1997) 64–71

    Google Scholar 

  16. Magnini, B., Strapparava, C.: Experiments in Word Domain Disambiguation for Parallel Texts. In: Proceedings of the ACL Workshop on Word Senses and Multilinguality, Hong Kong, China (2000)

    Google Scholar 

  17. Gelbukh, A., ed.: Proceedings of 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2002). In Gelbukh, A., ed.: Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science, Mexico City, Springer-Verlag (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Suárez, A., Palomar, M. (2002). Improving Feature Selection for Maximum Entropy-Based Word Sense Disambiguation. In: Ranchhod, E., Mamede, N.J. (eds) Advances in Natural Language Processing. PorTAL 2002. Lecture Notes in Computer Science(), vol 2389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45433-0_4

Download citation

  • DOI: https://doi.org/10.1007/3-540-45433-0_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43829-8

  • Online ISBN: 978-3-540-45433-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics