Making Sense About Sense

Ide, Nancy; Wilks, Yorick

doi:10.1007/978-1-4020-4809-8_3

Nancy Ide⁵ &
Yorick Wilks⁶

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 33))

943 Accesses
26 Citations

We suggest that the standard fine-grained division of senses and (larger) homographs by a lexicographer for use by a human reader may not be an appropriate goal for the computational WSD task. We argue that the level of sense-discrimination that natural language processing (NLP) needs corresponds roughly to homographs, though we discuss psycholinguistic evidence that there are broad sense divisions with some etymological derivation (i.e., non-homographic) that are as distinct for humans as homographic ones and they may be part of the broad class of sense-divisions we seek to identify here. We link this discussion to the observation that major NLP tasks like machine translation (MT) and information retrieval (IR) seem not to need independent WSD modules of the sort produced in the Research field, even though they are undoubtedly doing WSD by other means. Our conclusion is that WSD should continue to focus on these broad discriminations, at which it can do very well, thereby possibly offering the close-to-100% success that most NLP seemingly requires, with the possible exception of very fine questions of target word choice in MT. This proposal can be seen as reorienting WSD to what it can actually perform at the standard success levels, but we argue that this, rather than some more idealized vision of sense inherited from lexicography, is what humans and machines can reliably discriminate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anderson, Richard C. & Andrew Ortony. 1975. On putting apples into bottles - A problem of polysemy. Cognitive Psychology, 7: 167-180.
Article Google Scholar
Antal, László. 1965. Content, Meaning and Understanding. The Hague: Mouton.
Google Scholar
Bird, Steven, David Day, John Garofolo, John Henderson, Christophe Laprun & Mark Liberman. 2000. ATLAS: A flexible and extensible architecture for linguistic annotation. Proceedings of the Second International Language Resources and Evaluation Conference (LREC), May, 2000, Athens, Greece, 1699-1706.
Google Scholar
Brown, Peter F., John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Frederick Jelinek, John Lafferty, Robert Mercer & Paul Roosin. 1990. A statistical approach to machine translation. Computational Linguistics, 16(2): 79-85.
Google Scholar
Calzolari, Nicoletta, Claudia Soria, Francesca Bertagna, & Francesco Barsotti. 2002. Evaluating lexical resources using Senseval. Natural Language Engineering, 8(4): 375-390.
Article Google Scholar
Caramazza, Alfonso & Ellen Grober. 1976. Polysemy and the structure of the subjective lexicon. Semantics: Theory and application (27th Georgetown University Round Table on Languages and Linguistics), ed. by Clea Rameh, 181-206. Washington, DC: Georgetown University Press.
Google Scholar
Chen, Jen Nan & Jason S. Chang. 1998. Topical clustering of MRD senses based on information retrieval techniques. Computational Linguistics, 24(1): 61-95.
Google Scholar
Church, Kenneth W. & Patrick Hanks. 1990. Word association norms, mutual in- formation, and lexicography. Computational Linguistics, 16(1): 22-29.
Google Scholar
Cowie, James, Joe Guthrie & Louise Guthrie. 1992. Lexical disambiguation using simulated annealing. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 359-365.
Chapter Google Scholar
Cruse, David. 1986. Lexical semantics. Cambridge: Cambridge University Press. Cunningham, Hamish. 2002. GATE, A general architecture for text engineering. Computers and the Humanities, 36(2): 223-254.
Google Scholar
Dagan, Ido & Alon Itai. 1994. Word sense disambiguation using a second language monolingual corpus. Computational Linguistics, 20(4): 563-596.
Google Scholar
Diab, Mona & Philip Resnik. 2002. An unsupervised method for word sense tagging using parallel corpora. Proceedings of the 40th Meeting of the Association for Computational Linguistics (ACL), Philadelphia, U.S.A., 255-262.
Google Scholar
Dolan, William. 1994. Word sense ambiguation: Clustering related senses. Proceedings of the 14th International Conference on Computational Linguistics (COLING-94), Kyoto, Japan, 712-716.
Google Scholar
Durkin, Kevin & Jocelyn Manning. 1989. Polysemy and the subjective lexicon: Semantic relatedness and the salience of intraword senses. Journal of Psycholinguistic Research, 18: 577-612.
Article Google Scholar
Dyvik, Helge. 1998. Translations as semantic mirrors. Proceedings of the ECAI Workshop on Multilinguality in the Lexicon II, Brighton, U.K., 24-44.
Google Scholar
Dyvik, Helge. 2004. Translations as semantic mirrors: From parallel corpus to Wordnet. Language and Computers, 1: 311-326.
Google Scholar
Edmonds, Philip & Graeme Hirst. 2002. Near-synonymy and lexical choice. Computational Linguistics, 28(2): 105-144.
Article Google Scholar
Edmonds, Philip & Adam Kilgarriff. 2002. Introduction to the special issue on evaluating word sense disambiguation systems. Journal of Natural Language Engineering, 8(4): 279-291.
Article Google Scholar
Hanks, Patrick. 1994. personal communication.
Google Scholar
Hanks, Patrick. 2000. Do word meanings exist? Computers and the Humanities, 34(2): 205-215.
Article Google Scholar
Hanks, Patrick. 2003. WordNet: What is to be done? Panel presentation at Prague Workshop on Lexico-Semantic Classification and Tagging Linguistic and Knowledge-Based Foundations, Existing Schemes and Taxonomies, and Possible Applications, Prague, Czech.
Google Scholar
Hanks, Patrick & James Pustejovsky. 2005. A pattern dictionary for natural language processing. Revue Française de Linguistique Appliquée, 10(2).
Google Scholar
Heine, Bernd. 1992. Grammaticalization chains. Studies in Language, 16: 335-368.
Google Scholar
Ide, Nancy. 1998. Cross-lingual sense determination: Can it work?. Computers and the Humanities, 34(1-2): 223-34.
Google Scholar
Ide, Nancy, Tomaz Erjavec & Dan Tufiú. 2001. Automatic sense tagging using parallel corpora. Proceedings of the 6th Natural Language Processing Pacific Rim Symposium, Tokyo, Japan, 212-219.
Google Scholar
Ide, Nancy, Tomaz Erjavec & Dan Tufiú. 2002. Sense discrimination with parallel corpora. Proceedings of the ACL SIGLEX Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, U.S.A., 56-60.
Google Scholar
Ide, Nancy & Jean Véronis. 1990. Mapping dictionaries: A spreading activation approach. Proceedings of the 6th Annual Conference of the Centre for the New Oxford English Dictionary, Waterloo, Canada, 52-64.
Google Scholar
Ide, Nancy & Jean Véronis. 1993. Extracting knowledge bases from machinereadable dictionaries: Have we wasted our time? Proceedings of the First International Conference on Building and Sharing of Very Large-Scale Knowledge Bases (KB&KS), Tokyo, Japan, 257-266.
Google Scholar
Ide, Nancy & Jean Véronis. 1994. MULTEXT: Multilingual text tools and corpora. Proceedings of the 15th International Conference on Computational Linguistics (COLING), Kyoto, Japan, 588-592.
Google Scholar
Ide, Nancy & Jean Véronis. 1998. Word sense disambiguation: The state of the art. Computational Linguistics, 24(1): 1-40.
Google Scholar
Kilgarriff, Adam. 1993. Dictionary word sense distinctions: An enquiry into their nature. Computers and the Humanities, 26: 356-387
Google Scholar
Kilgarriff, Adam. 1997. “I don’t believe in word senses”. Computers and the Humanities, 31(2): 91-113.
Article Google Scholar
Kilgarriff, Adam & David Tugwell. 2001. WASP-Bench: An MT lexicographers’ workstation supporting state-of-the-art lexical disambiguation. Proceedings of 7 ^th Machine Translation Summit, Santiago de Compostela, Spain, 187-190.
Google Scholar
Klein, Devorah & Gregory Murphy. 2001. The representation of polysemous words. Journal of Memory and Language, 45: 259-82.
Article Google Scholar
Klein, Devorah & Gregory Murphy. 2002. Paper has been my ruin: Conceptual reations of polysemous senses. Journal of Memory and Language, 47: 548-70.
Article Google Scholar
Krovetz, Robert & Bruce Croft. 1992. Lexical ambiguity and information retrieval. ACM Transactions on Information Systems (TOIS), 10(2): 115-141.
Article Google Scholar
Lakoff, George. 1987. Women, Fire, and Dangerous Things. Chicago: University of Chicago Press.
Google Scholar
Malt, Barbara C., Steven A. Sloman, Silvia Gennari, Meiyi Shi & Yuan Wang. 1999. Knowing vs. naming: Similarity and the linguistic categorization of artifacts. Journal of Memory and Language, 40: 230-262.
Article Google Scholar
McKelvie, David, Chris Brew & Henry Thompson. 1998. Using SGML as a basis for data-intensive natural language processing. Computers and the Humanities, 31(5): 367-388.
Google Scholar
Ng, Hwee Tou, Bin Wang & Yee Seng Chan. 2003. Exploiting parallel texts for word sense disambiguation: An empirical study. Proceedings of the 41 ^st Annual Meeting of the Association for Computational Linguistics (ACL), Sapporo, Japan, 455-462.
Chapter Google Scholar
Nunberg, Geoffrey. 1979. The non-uniqueness of semantic solutions: Polysemy. Linguistics and Philosophy, 3: 143-184.
Article Google Scholar
Olney, John, Carter Revard & Panl Ziff. 1966. Some Monsters in Noah’s Ark. Research Memorandum, Systems Development Corp., Santa Monica, U.S.A.
Google Scholar
Palmer, Martha, Christiane Fellbaum & Hoa Trang Dang. 2006. Making finegrained and coarse-grained sense distinctions, both manually and automatically. Natural Language Engineering, 12(3).
Google Scholar
Peters, Wim, Piek Vossen, Pedro Diez-Orzas & Geert Adrians. 1998. Crosslinguistic alignment of wordnets with an inter-lingual index. Computers and the Humanities, 32(2-3): 221-51.
Article Google Scholar
Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. Harlow, UK: Longman Group.
Google Scholar
Pustejovsky, James. 1995. The Generative Lexicon. Cambridge, U.S.A.: MIT Press.
Google Scholar
Resnik, Philip & David Yarowsky. 1997a. Distinguishing systems and distinguishing senses: New evaluation methods for word sense disambiguation. Natural Language Engineering, 5(2): 113-133.
Google Scholar
Resnik, Philip & David Yarowsky. 1997b. A perspective on word sense disambiguation methods and their evaluation. Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What, and How?, Washington, U.S.A, 79-86.
Google Scholar
Resnik, Philip & David Yarowsky. 2000. Distinguishing systems and distinguishing senses: New evaluation methods for word sense disambiguation”. Natural Language Engineering, 5(2): 113-133.
Google Scholar
Rodd, Jennifer, M. Gareth Gaskell & William Marslen-Wilson. 2002. Making sense of semantic ambiguity: Semantic competition in lexical access. Journal of Memory and Language, 46: 245-266.
Article Google Scholar
Rodd, Jennifer, M. Gareth Gaskell & William Marslen-Wilson. 2004. Modelling the effects of semantic ambiguity in word recognition. Cognitive Science, 28: 89-104.
Article Google Scholar
Ruhl, Charles. 1989. On Monosemy: A Study in Linguistic Semantics. Albany: State University of New York Press.
Google Scholar
Sanderson, Mark. 1994. Word sense disambiguation and information retrieval. Proceedings of the 17th ACM Special Interest Group on Information Retrieval Conference (SIGIR), 142-151.
Google Scholar
Schütze. Hinrich. 1998. Automatic word sense discrimination. Computational Linguistics, 24(1): 97-124.
Google Scholar
Sparck Jones, Karen. 1986/1964. Synonymy and Semantic Classification. Edinburgh: Edinburgh University Press.
Google Scholar
Stevenson, Mark & Paul Clough. 2004. EuroWordNet as a resource for crosslanguage information retrieval. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC), Lisbon, Portugal, 777-780.
Google Scholar
Stevenson, Mark & Yorick Wilks. 1999. Combining weak knowledge sources for sense disambiguation. Proceedings of the International Joint Conference for Artificial Intelligence (IJCAI), Stockholm, Sweden, 884-889.
Google Scholar
Tufiú, Dan, Radu Ion & Nancy Ide. 2004. Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering, and aligned WordNets. Proceedings of the 20 ^th International Conference on Computational Linguistics (COLING), Geneva, Switzerland, 1312-1318.
Google Scholar
Voorhees, Ellen. 1999. Natural language processing and information retrieval. Information Extraction: Towards Scalable, Adaptable Systems, ed. by Maria Teresa Pazienza, 32-48. Germany: Springer.
Google Scholar
Vossen, Piek. 2001. Extending, trimming and fusing WordNet for technical documents. Proceedings of the NAACL Workshop on WordNet and Other Lexical Resources Applications, Extensions and Customizations, Pittsburgh, U.S.A.
Google Scholar
Vossen, Piek, ed. 1998. EuroWordNet: A Multilingual Database With Lexical Semantic Networks. Amsterdam: Kluwer.
Google Scholar
Wierzbicka, Anna. 1989. Semantic primitives and lexical universals. Quaderni di Semantica, X(1): 103-121.
Google Scholar
Wilks, Yorick. 1972. Grammar, Meaning and the Machine Analysis of Language. London and Boston: Routledge.
Google Scholar
Wilks, Yorick & Roberta Catizone. 2002. What is lexical tuning?. Journal of Semantics, 19(2): 167-190.
Article Google Scholar
Wilks, Yorick & Mark Stevenson. 1998. The grammar of sense: Using partof-speech tags as a first step in semantic disambiguation. Natural Language Engineering, 4(2): 74-87.
Google Scholar
Wilks, Yorick, Brian Slator & Louise Guthrie. 1996. Electric Words: Dictionaries, Computers and Meanings. Cambridge, U.S.A.: MIT Press.
Google Scholar
Yarowsky, David. 2000. Hierarchical decision lists for word sense disambiguation. Computers and the Humanities, 34(2): 179-186.
Article Google Scholar
Zgusta, Ladislav. 1971. Manual of Lexicography. The Hague: Mouton.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Vassar College, 124 Raymond Avenue, 12604-0520, Poughkeepsie, New York, USA
Professor Nancy Ide
Department of Computer Science, University of Sheffield, S1 4DP, Sheffield, UK
Professor Yorick Wilks

Authors

Professor Nancy Ide
View author publications
You can also search for this author in PubMed Google Scholar
Professor Yorick Wilks
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of the Basque Country, Manuel de Lardizabal 1, E-20018, Donostia, Basque Country, Spain
Eneko Agirre
Sharp Laboratories of Europe Limited, Oxford Science Park, OX4 4GB, Oxford, UK
Philip Edmonds

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ide, N., Wilks, Y. (2007). Making Sense About Sense. In: Agirre, E., Edmonds, P. (eds) Word Sense Disambiguation. Text, Speech and Language Technology, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4809-8_3

Download citation

DOI: https://doi.org/10.1007/978-1-4020-4809-8_3
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4808-1
Online ISBN: 978-1-4020-4809-8
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)

Publish with us

Policies and ethics