Abstract
Spell Checking is an integral part of modern word-processing applications. Current spellcheckers can only detect and correct non-word errors. They cannot effectively deal with real-word errors; misspelled words that result in valid English words. Current techniques for detecting real-word errors require huge volume of training corpus and the learned knowledge is represented by opaque set of features that are not apparent. This paper proposes a new method for dealing with real-word errors using selectional preferences of predicates for arguments in a case slot. The method requires very little in terms of resources and can use existing lexicons slightly modified to suit the above task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wagner, R.A.: Order–n correction for regular languages. Communications of ACM 17(5), 265–268 (1974)
Kukich, K.: Techniques for automatically correcting words in text. ACM Computing Surveys 24(4), 377–439 (1992)
Atwell, E., Elliott, S.: Dealing with ill formed English text. In: Garside, R., Leach, G., Sampson, G. (eds.) The Computational Analysis of English: A Corpus–Based approach, Longman, Inc., New York (1987)
Church, K.W., Gale, W.A.: Probability scoring for spelling correction. Stat. Comp. 1, 93–103 (1991)
Mays, E., Damerau, F.J., Mercer, R.L.: Context based spelling correction. Information Processing and Management 27(5), 517–522 (1991)
Golding, A.: A Bayesian hybrid method for context-sensitive spelling correction. In: Proceedings of Third Workshop on Very Large Corpora, Boston, MA, pp. 39–53 (1995)
Golding, A., Roth, D.: Applying Winnow to Context-Sensitive Spelling Correction. In: Machine Learning: Proceedings of the 13th International Conference, San Fransico, pp. 182–190 (1996)
Golding, A., Schabes, Y.: Combining Trigram-based and Feature-based Methods for context-Sensitive Spelling Correction. In: Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, CA, pp. 71–78 (1996)
Brill, E., Mangu, L.: Automatic Rule Acquisition for Spelling Correction. In: Proceedings of the 14th International Conference on Machine Learning (ICML 1997), Nashville, TN, pp. 187–194. Morgan Kaufmann, San Francisco (1997)
Budanitsky, A.: Lexical semantic Relatedness and its Applications in NLP. TR CSRG –390, Department of Computer Science, University of Toronto (1999)
Hirst, G., Onge, D.S.: Lexical Chains as representations of context for detection and correction of malapropism. In: Fellbaum, pp. 305–332 (1998)
Resnik, P.: Selectional preferences and Sense disambiguation. In: Proceedings of ANLP 1997 workshop: tagging text with lexical semantics,Washington DC, pp. 52–56 (1997)
Miller, G.: WordNet: An online lexical database. International Journal of Lexicography 3(4) (1990)
Li, H., Abe, N.: Generalizing case frames using a thesaurus and the MDL principle. Computational Linguistics 24(2), 217–244 (1998)
Clark, S., Weir, D.: An iterative approach to estimating frequencies over a semantic hierarchy, In: Proceedings of 1999 Joint SIGDAT Conference on EMNLP and very large corpora (1999)
Abney, S., Light, M.: Hiding a semantic hierarchy in a Markov model. In: Proceedings of the ACL Workshop on Unsupervised Learning in Natural Language Processing (1999)
Ciaramita, M., Johnson, M.: Explaining away ambiguity: Learning verb selectional preferences with Bayesian Networks. In: Proceedings of the 18th International Conference on Computational Linguistics, COLING 2000 (2000)
Rissanen, J.: Modeling by shortest data description. Automatic 14, 37–38 (1978)
Lin, D.: MINIPAR – a broad coverage parser for English. Nalante Inc, Alberta (1998)
Rennie, J.: WordNet: Query Data: a Perl module for accessing the WordNet database (2000), http://www.ai.mit.edu/people/jrennie/WordNet
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Banu, R.S.D.W., Kumar, R.S. (2004). Using Selectional Restrictions for Real Word Error Correction. In: Manandhar, S., Austin, J., Desai, U., Oyanagi, Y., Talukder, A.K. (eds) Applied Computing. AACC 2004. Lecture Notes in Computer Science, vol 3285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30176-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-30176-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23659-7
Online ISBN: 978-3-540-30176-9
eBook Packages: Springer Book Archive