Abstract
This paper documents research toward building a complete lexicon containing all the words found in general newspaper text. It is intended to provide the reader with an understanding of the inherent limitations of existing vocabulary collection methods and the need for greater attention to multi-word phrases as the building blocks of text. Additionally, while traditional reference books define many proper nouns, they appear to be very limited in their coverage of the new proper nouns appearing daily in newspapers. Proper nouns appear to require a grammar and lexicon of components much the way general parsing of text requires syntactic rules and a lexicon of common nouns.
This paper is an expanded version of a paper that appeared in the Proceedings of the 1989 SIGIR Conference, Cambridge, MA, 25–28 June 1989.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amsler, R., “The Structure of the Merriam-Webster Pocket Dictionary”, Ph. D. Thesis, University of Texas at Austin, Austin, TX, 1980.
Amsler, R., “Computational Lexicology: A Research Program”, in AFIPS Conference Proceedings: 1982 National Computer Conference, American Federation of Information processing Societies, Arlington, VA, 1982, 657–663.
Amsler, R., “Words and Worlds”, in Proceedings of the Third Workshop on Theoretical Issues in Natural Language Processing (TINLAP3), New Mexico State University at Las Cruces, NM, January 7–9, 1987.
Amsler, R., J. White, “Development of a Computational Methodology for Deriving Natural Language Semantic Structures via Analysis of Machine-Readable Dictionaries”, Linguistics Research Center, University of Texas at Austin, Final Report on NSF Project MCS77–01315, July 1979.
Botha, R., Morphological Mechanisms: Lexicalist Analysis of Synthetic Compounding, Pergammon Press, Oxford, England, 1984.
Bresnan, J., The Mental Representation of Grammatical Relations, MIT Press, Cambridge, MA, 1983.
Carroll, J., What’s in a Name, W.H. Freeman, New York, 1985.
Cruse, D., Lexical Semantics, Cambridge University Press, Cambridge, England, 1986.
Flexner, S., I Hear America Talking: An Illustrated History of American Words and Phrases, Simon and Schuster, New York, 1976.
Isitt, D., Crazic, Menty and Idiotal, Acta Universitatis Gothoburgensis, Goeteborg, Sweden, 1983.
Leonard, R., The Interpretation of English Noun Sequences on the Computer, North-Holland, Amsterdam, 1984.
Levi, J., The Syntax and Semantics of Complex Nominals, Academic Press, New York, 1978.
Meys, W., Compound Adjectives in English and the Ideal Speaker-Listener, North-Holland, Amsterdam, 1975.
Michiels, A., “Exploiting a Large Dictionary Data Base”, Ph. D. Thesis, University of Liege, Liege, Belgium, 1981.
Peterson, J., “Webster’s Seventh New Collegiate Dictionary: A Computer-Readable File Format”, TR-196, Dept. of Computer Sciences, Univ. of Texas at Austin, May 1982.
Peterson, J., “Webster’s Seventh New Collegiate Dictionary: A Computer-Readable File Format”, Microelectronics and Computer Technology Corporation, Austin TX, 1987.
Quillian, R., “Semantic Memory”, in Semantic Information Processing, MIT Press, Cambridge, MA, 1968, 227–270.
Reichert, R., J. Olney, J. Paris, “Two Dictionary Transcripts and Programs for Processing Them. Volume I: The Encoding Scheme, PARSENT and CONIX”, TM-3978/001/00, System Development Corporation, Santa Monica, CA, 15 June,1969.
Robins, G., “The NIKL Manual”, The Knowledge Representation Project, Information Sciences Institute, Marina Del Rey, CA, 1986.
Rusiecki, J., Adjectives and Comparison in English: A Semantic Study, Longman, London, 1985.
Shapiro, S., “The SNePS Semantic Network Processing System”, in Associative Networks: Representation and Use of Knowledge by Computers, Academic Press, New York, 1979, 179–203.
Sherman, D., “A New Computer Format for ”Webster’s Seventh Collegiate Dictionary, Computers and the Humanities, 8, 1974, 21–26.
Simmons, R., “Semantic Networks: Their Computation and Use for Understanding English Sentences”, in Computer Models of Thought and Language, W.H. Freeman & Co., San Francisco, CA, 1973, 63–113.
Sowa, J., Conceptual Structures: Information Processing in Mind and Machine, Addison-Wesley, Reading, MA, 1984.
Walker, D., R. Amsler, “The Use of Machine-Readable Dictionaries in Sublanguage Analysis”, in R. Grishman, R. Kittridge, (eds.), Analyzing Language in Restricted Domains: Sublanguage Description and Processing, Lawrence Erlbaum Associates, Publishers, Hillsdale, NJ, 1986, Chapter 5, 69–83.
Warren, B., Semantic Patterns of Noun-Noun Compounds, Acta Universitatis Gothoburgensis, Goeteborg, Sweden, 1978.
Wittenburg, K., “Natural Language Parsing with Combinatory Categorial Grammars in a Graph-Unification-Based Formalism”, Ph. D. Thesis, University of Texas at Austin, Austin, TX, 1986.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Amsler, R.A. (1994). Research Toward the Development of a Lexical Knowledge Base for Natural Language Processing. In: Zampolli, A., Calzolari, N., Palmer, M. (eds) Current Issues in Computational Linguistics: In Honour of Don Walker. Linguistica Computazionale, vol 9. Springer, Dordrecht. https://doi.org/10.1007/978-0-585-35958-8_8
Download citation
DOI: https://doi.org/10.1007/978-0-585-35958-8_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-2998-5
Online ISBN: 978-0-585-35958-8
eBook Packages: Springer Book Archive