Abstract
In this paper, an evaluation of several feature selections for word sense disambiguation is presented. The method used to classify linguistic contexts in its correct sense is based on maximum entropy probability models. In order to study their relevance for each word, several types of features have been analyzed for a few words selected from the DSO corpus. An improved definition of features in order to increase efficiency is presented as well.
This paper has been partially supported by the Spanish Government (CICYT) under project number TIC2000-0664-C02-02.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
SENSEVAL-2: Second international workshop on evaluating word sense disambiguation systems: system descriptions. http://www.sle.sharp.co.uk/senseval2/ (2001)
Yarowsky, D.: Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34 (2000)
Brill, E.: Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics 21 (1995) 543–565
Florian, R., Ngai, G.: Multidimensional transformation-based learning. In Daelemans, W., Zajac, R., eds.: Proceedings of CoNLL-2001, Toulouse, France (2001) 1–8
Mihalcea, R., Moldovan, D.: An iterative approach to word sens disambiguation. In: Proceedings of FLAIRS-2000, Orlando, FL (2000) 219–223
Seo, H.C., Lee, S.Z., Rim, H.C.: Classification information model. http://nlp.korea.ac.kr/ hcseo/senseval2/cim.htm (2001)
Pedersen, T.: A baseline methodology for word sense disambiguation. In [17] 126–135
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts (1999)
Ratnaparkhi, A.: Maximum Entropy Models for Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania (1998)
Lin, D.: Dependency-based evaluation of minipar. In: Proceedings of the Workshop on the Evaluation of Parsing Systems, First International Conference on Language Resources and Evaluation, Granada, Spain (1998)
Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word senses: An exemplar-based approach. In Joshi, A., Palmer, M., eds.: Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, San Francisco, Morgan Kaufmann Publishers (1996)
Escudero, G., Màrquez, L., Rigau, G.: Boosting applied to word sense disambiguation. In: Proceedings of the 12th Conference on Machine Learning ECML2000, Barcelona, Spain (2000)
Suárez, A., Palomar, M.: Feature selection analysis for maximum entropy-based wsd. In [17] 146–155
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Five Papers on WordNet. Special Issue of the International journal of lexicography 3 (1993)
Tapanainen, P., Järvinen, T.: A non-projective dependency parser. In: Proceedings of the Fifth Conference on Applied Natural Language Processing. (1997) 64–71
Magnini, B., Strapparava, C.: Experiments in Word Domain Disambiguation for Parallel Texts. In: Proceedings of the ACL Workshop on Word Senses and Multilinguality, Hong Kong, China (2000)
Gelbukh, A., ed.: Proceedings of 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2002). In Gelbukh, A., ed.: Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science, Mexico City, Springer-Verlag (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Suárez, A., Palomar, M. (2002). Improving Feature Selection for Maximum Entropy-Based Word Sense Disambiguation. In: Ranchhod, E., Mamede, N.J. (eds) Advances in Natural Language Processing. PorTAL 2002. Lecture Notes in Computer Science(), vol 2389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45433-0_4
Download citation
DOI: https://doi.org/10.1007/3-540-45433-0_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43829-8
Online ISBN: 978-3-540-45433-5
eBook Packages: Springer Book Archive