Improving Feature Selection for Maximum Entropy-Based Word Sense Disambiguation

Suárez, Armando; Palomar, Manuel

doi:10.1007/3-540-45433-0_4

Armando Suárez³ &
Manuel Palomar³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2389))

Included in the following conference series:

International Conference for Natural Language Processing in Portugal

480 Accesses

Abstract

In this paper, an evaluation of several feature selections for word sense disambiguation is presented. The method used to classify linguistic contexts in its correct sense is based on maximum entropy probability models. In order to study their relevance for each word, several types of features have been analyzed for a few words selected from the DSO corpus. An improved definition of features in order to increase efficiency is presented as well.

This paper has been partially supported by the Spanish Government (CICYT) under project number TIC2000-0664-C02-02.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

SENSEVAL-2: Second international workshop on evaluating word sense disambiguation systems: system descriptions. http://www.sle.sharp.co.uk/senseval2/ (2001)
Yarowsky, D.: Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34 (2000)
Google Scholar
Brill, E.: Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics 21 (1995) 543–565
Google Scholar
Florian, R., Ngai, G.: Multidimensional transformation-based learning. In Daelemans, W., Zajac, R., eds.: Proceedings of CoNLL-2001, Toulouse, France (2001) 1–8
Google Scholar
Mihalcea, R., Moldovan, D.: An iterative approach to word sens disambiguation. In: Proceedings of FLAIRS-2000, Orlando, FL (2000) 219–223
Google Scholar
Seo, H.C., Lee, S.Z., Rim, H.C.: Classification information model. http://nlp.korea.ac.kr/ hcseo/senseval2/cim.htm (2001)
Pedersen, T.: A baseline methodology for word sense disambiguation. In [17] 126–135
Chapter Google Scholar
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts (1999)
MATH Google Scholar
Ratnaparkhi, A.: Maximum Entropy Models for Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania (1998)
Google Scholar
Lin, D.: Dependency-based evaluation of minipar. In: Proceedings of the Workshop on the Evaluation of Parsing Systems, First International Conference on Language Resources and Evaluation, Granada, Spain (1998)
Google Scholar
Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word senses: An exemplar-based approach. In Joshi, A., Palmer, M., eds.: Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, San Francisco, Morgan Kaufmann Publishers (1996)
Google Scholar
Escudero, G., Màrquez, L., Rigau, G.: Boosting applied to word sense disambiguation. In: Proceedings of the 12th Conference on Machine Learning ECML2000, Barcelona, Spain (2000)
Google Scholar
Suárez, A., Palomar, M.: Feature selection analysis for maximum entropy-based wsd. In [17] 146–155
Chapter Google Scholar
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Five Papers on WordNet. Special Issue of the International journal of lexicography 3 (1993)
Google Scholar
Tapanainen, P., Järvinen, T.: A non-projective dependency parser. In: Proceedings of the Fifth Conference on Applied Natural Language Processing. (1997) 64–71
Google Scholar
Magnini, B., Strapparava, C.: Experiments in Word Domain Disambiguation for Parallel Texts. In: Proceedings of the ACL Workshop on Word Senses and Multilinguality, Hong Kong, China (2000)
Google Scholar
Gelbukh, A., ed.: Proceedings of 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2002). In Gelbukh, A., ed.: Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science, Mexico City, Springer-Verlag (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, Alicante, Spain
Armando Suárez & Manuel Palomar

Authors

Armando Suárez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Palomar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidade de Lisboa e CAUTL (IST), Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Elisabete Ranchhod
L2F/INESC ID Lisboa, Technical University of Lisbon, Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Nuno J. Mamede

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suárez, A., Palomar, M. (2002). Improving Feature Selection for Maximum Entropy-Based Word Sense Disambiguation. In: Ranchhod, E., Mamede, N.J. (eds) Advances in Natural Language Processing. PorTAL 2002. Lecture Notes in Computer Science(), vol 2389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45433-0_4

Download citation

DOI: https://doi.org/10.1007/3-540-45433-0_4
Published: 21 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43829-8
Online ISBN: 978-3-540-45433-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics