Abstract
Nowadays, the need of advanced free text filtering is increasing. Therefore, when searching for specific keywords, it is desirable to eliminate occurrences where the word or words are used in an inappropriate sense. This task could be exploited in internet browsers, and resource discovery systems, relational databases containing free text fields, electronic document management systems, data warehouse and data mining systems, etc. In order to resolve this problem in this paper a method for the automatic disambiguating of nouns, using the notion of Specification Marks and the noun taxonomy of the WordNet lexical knowledge base [8] is presented. This method is applied to a Natural Language Processing System (NLP). The method resolves the lexical ambiguity of nouns in any sort of text, and although it relies on the semantics relations (Hypernymy/Hyponymy) and the hierarchic organization of WordNet. However, it does not require any sort of training process, no hand-coding of lexical entries, nor the hand-tagging of texts. An evaluation of the method was done on both the Semantic Concordance Corpus (Semcor)[9], and on Microsoft’s electronic encyclopaedia („Microsoft 98 Encarta Encyclopaedia Deluxe“). The percentage of correct resolutions achieved with these two corpora were: Semcor 65.8% and Microsoft 65.6%. This percentages show that successful results with different domain corpus have been obtained, so our proposed method can be applied successfully on any corpus.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre E. and Rigau G. (1996) Word Sense Disambiguation using Conceptual Density. Proc. 16th International Conference on COLING. Copenhagen.
Cowie J., Guthrie J. and Guthrie L. (1992) Lexical disambiguation using simulated annealing. Proc. DARPA Workshop on Speech and Natural Language. 238–242. New York.
Hale, Michael L. Mc. A comparison of WordNet and Roget’s taxonomy for measuring semantic similarity.
Ide N. and Véronis J. (1998) Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics. 24 (1), 1–40.
Lesk, M. (1986) Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proc. 1986 SIGDOC Conference, ACM 24–26, New York.
McRoy S. (1992) Using Multiple Knowledge Sources for Word Sense Discrimination. Computational Linguistics 18 (1).
Mihalcea R. and Moldovan D. (1999) A Method for word sense disambiguation of unrestricted text. Proc. 37th Annual Meeting of the ACL 152–158, Maryland, USA.
Miller G. A., Beckwith R., Fellbaum C., Gross D., and Miller K. J. (1990) WordNet: An online lexical database. International Journal of Lexicography, 3(4): 235–244.
Miller G., Leacock C., Randee T. and Bunker R. (1993) A Semantic Concordance. Proc. 3rd DARPA Workshop on Human Language Tecnology, 303–308, Plainsboro, New Jersey.
Resnik P. (1995) Disambiguating noun groupings with respect to WordNet senses. Proc. Third Workshop on Very Large Corpora. 54–68.Cambridge, MA.
Resnik P. and Yarowsky D. (1997) A perspective on word sense disambiguation methods and their evaluation. Proc. ACL Siglex Wordshop on Tagging Text with Lexical Semantics, why, what and how?, Washington DC.
Resnik P. (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural lenguage. In Journal of Artificial Intelligence Research 11. 95–130.
Rigau G., Atserias J. and Agirre E. (1997) Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. Proc. 35th Annual Meeting of the ACL, 48–55, Madrid, Spain.
Slator B. and Wilks Y. (1987) Towards semantic structures from dictionary entries. Proc. 2nd Annual Rocky Mountain Conference on Artificial Inteligence, 85–96. Boulder, CO.
Stetina J., Kurohashi S. and Nagao M. (1998) General word sense disambiguation method based on full sentencial context. In Usage of WordNet in Natural Language Processing. COLING-ACL Workshop, Montreal, Canada.
Sussna M. (1993) Word sense disambiguation for free-text indexing using a massive semantic network. Proc. Second International CIKM, 67-74, Airlington, VA.
Voorhees E. (1993) Using WordNet to disambiguation word senses for text retrieval. Proc. 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 171–180, Pittsburgh, PA.
Wilks Y., Fass D., Guo C., McDonal J., Plate T. and Slator B. (1993) Providing Machine Tractablle Dictionary Tools. In Semantics and the lexicon (Pustejowsky J. Ed.) 341–401.
Wilks Y. And Stevenson M. (1996) The grammar of sense: Is word sense tagging much more than part-of-speech tagging? Technical Report CS-96-05, University of Sheffield, UK.
Yarowsky D. (1992) Word Sense disambiguation using statistical models of Roget’s categories trainined on large corpora. Proc. 14th COLING, 454–460, Nantes, France.
Yarowsky, D. (1995) Unsupervised word Sense disambiguation rivaling supervised methods. Proc. 32nd Annual Meeting of the ACL.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Montoyo, A., Palomar, M. (2001). WSD Algorithm Applied to a NLP System. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2000. Lecture Notes in Computer Science, vol 1959. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45399-7_5
Download citation
DOI: https://doi.org/10.1007/3-540-45399-7_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41943-3
Online ISBN: 978-3-540-45399-4
eBook Packages: Springer Book Archive