Skip to main content

Word Sense vs. Word Domain Disambiguation: A Maximum Entropy Approach

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Abstract

In this paper, a supervised learning system of word sense disambiguation is presented. It is based on conditional maximum entropy models. This system acquires the linguistic knowledge from an annotated corpus and this knowledge is represented in the form of features. The system were evaluated both using WordNet’s senses and domains as the sets of classes of each word. Domain labels are obtained from the enrichment ofWordNet with subject field codes which produces a polysemy reduction. Several types of features has been analyzed for a few words selected from the DSO corpus. Using the domain enrichment of WordNet, a 7% of accuracy improvement is achieved.

This paper has been partially supported by the Spanish Government (CICYT) under project number TIC2000-0664-C02-02.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pedersen, T.: A decision tree of bigrams is an accurate predictor of word sense. In: Proceedings of the Second Annual Meeting of the North American Chapter of the Association for Computational Linguistics, Pittsburgh (2001) 79–86.

    Google Scholar 

  2. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts (1999).

    MATH  Google Scholar 

  3. Preiss, J., Yarowsky, D., eds.: Proceedings of SENSEVAL-2. In Preiss, J., Yarowsky, D., eds.: Proceedings of the 2nd International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, ACL-SIGLEX (2001).

    Google Scholar 

  4. Yarowsky, D.: Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34 (2000) 179–186.

    Article  Google Scholar 

  5. Escudero, G., Màrquez, L., Rigau, G.: Boosting applied to word sense disambiguation. In: Proceedings of the 12th Conference on Machine Learning ECML2000, Barcelona, Spain (2000).

    Google Scholar 

  6. Pedersen, T.: A baseline methodology for word sense disambiguation. [18] 126–135.

    Google Scholar 

  7. García-Varea, I., Och, F.J., Ney, H., Casacuberta, F.: Refined lexicon models for statistical machine translation using a maximum entropy approach. In: Proceedings of 39th Annual Meeting of the Association for Computational Linguistics. (2001) 204–211.

    Google Scholar 

  8. Magnini, B., Strapparava, C.: Experiments in Word Domain Disambiguation for Parallel Texts. In: Proceedings of the ACL Workshop on Word Senses and Multilinguality, Hong Kong, China (2000).

    Google Scholar 

  9. Magnini, B., Strapparava, C., Pezzulo, G., Gliozzo, A.: Using Domain Information forWord Sense Disambiguation. [3] 111–114.

    Google Scholar 

  10. Montoyo, A., Palomar, M., Rigau, G.: WordNet Enrichment with Classification Systems. In Preiss, J., Yarowsky, D., eds.: Proceedings of NAACL Workshop WordNet and Other Lexical Resources: Applications, Extensions and Customizations, Pittsburgh, PA, USA (2001).

    Google Scholar 

  11. Ratnaparkhi, A.: Maximum Entropy Models for Natural Language Ambiguity Resolution. Ph.D. thesis, University of Pennsylvania (1998).

    Google Scholar 

  12. Suárez, A., Palomar, M.: Feature selection analysis for maximum entropy-based wsd. [18] 146–155.

    Google Scholar 

  13. Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word senses: An exemplar-based approach. In Joshi, A., Palmer, M., (Eds.): Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, San Francisco, Morgan Kaufmann Publishers (1996).

    Google Scholar 

  14. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Five Papers onWordNet. Special Issue of the International journal of lexicography 3 (1993).

    Google Scholar 

  15. Daude, J., Padro, L., Rigau, G.: Mapping wordnets using structural information. In: Proceedings of the 38th Anual Meeting of the Association for Computational Linguistics (ACL 2000), Hong Kong (2000).

    Google Scholar 

  16. Lin, D.: Dependency-based evaluation of minipar. In: Proceedings of the Workshop on the Evaluation of Parsing Systems, First International Conference on Language Resources and Evaluation, Granada, Spain (1998).

    Google Scholar 

  17. Dietterich, T.G.: Approximate statistical test for comparing supervised classification learning algorithms. Neural Computation 10 (1998) 1895–1923.

    Article  Google Scholar 

  18. Gelbukh, A.F.: Computational Linguistics and Intelligent Text Processing, Third International Conference, CICLing 2002, Mexico City, Mexico, February 17–23, 2002, Proceedings. In: Gelbukh, A.F., (Ed.): CICLing. Volume 2276 of Lecture Notes in Computer Science, Springer (2002).

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Suárez, A., Palomar, M. (2002). Word Sense vs. Word Domain Disambiguation: A Maximum Entropy Approach. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_17

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_17

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics