Abstract
Disambiguating the polysemous word is one of the major issues in the process of Machine Translation. The word may have many senses, selecting the most appropriate sense for an ambiguous word in a sentence is a central problem in Machine Translation. Because, each sense of a word in a source language sentence may generate different target language sentences. Knowledge and corpus based methods are usually applied for disambiguation task. In the present paper, we propose an algorithm to disambiguate Kannada polysemous words using association rules. We built Kannada corpora using web resources. The corpora are divided in to training and testing corpora. The association rules required for disambiguation tasks are extracted from training corpora. The example sentences needs to be disambiguated are stored in testing corpora. The proposed algorithm attempts to disambiguate all the content words such as nouns, verbs, adverbs, adjectives in an unrestricted text using association rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In: 34th Annual Meeting of the ACL, pp. 40–47. Morgan Kaufmann Publication, San Francisco (1996)
Yarowsky, D.: Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34(1-2), 179–186 (2000)
Martinez, D., Agirre, E., Marquez, L.: Syntactic features for high precision word sense disambiguation. In: COLING (2002)
Mihalcea, R., Moldovan, D.: A method for word sense disambiguation of unrestricted text. In: 37th Annual Meeting of the ACL, Maryland, pp. 152–158 (1999)
Wilks, Y., Stevenson, M.: Word Sense Disambiguation using optimized combination of knowledge sources. In: 36th Annual Meeting of the ACL, Canada, pp. 1398–1402 (1998)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from a ice cream cone. In: SIGDOC Conference, Toronto Canada, pp. 24–26 (1986)
Walker, D., Amsler, R.: The Use of Machine Readable Dictionaries in Sublanguage Analysis. In: Analyzing Language in Restricted Domains, pp. 69–83. LEA Press (1986)
Eneko, A., Rigau, G.: Word sense disambiguation using conceptual density. In: 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark (1996)
Rada, M.: Large vocabulary unsupervised word sense disambiguation with graph based algorithms for sequence data labeling. In: Joint Human Language Technology and Empirical Methods in Natural Language Processing Conference (HLT/EMNLP), Vancouver, Canada, pp. 411–418 (2005)
David, Y.: Decision lists for lexical ambiguity resolution: Application to accent restoration in Spanish and French. In: 32nd Annual Meeting of the Association for Computational Linguistics (ACL), Las Cruces, USA, pp. 88–95 (1994)
Gale, W.A., Church, K.W., Yarowsky, D.: Estimating upper and lower bounds on the performance of word sense disambiguation programs. In: 30th Annual Meeting of the Association for Computational Linguistics, pp. 249–256. University of Delaware, Newark (1992)
Fakhrahmad, S.M., Rezapour, A.R., Zolghadri Jahromi, M., Sadreddini, M.H.: A New Word Sense Disambiguation System Based on Deduction. In: World Congress on Engineering, London, U.K, vol. II (2011)
Firth, R.: Man and Culture, An evaluation of the work of bronislaw malinowski, pp. 167–169. The Humanities Press (1957)
Schutze, H.: Dimensions of meaning. In: Supercomputing 1992, Minneapolis, pp. 787–796 (1992)
Miller, G., Charles, W.: Contextual correlates of semantic similarity. Language and Cognitive Proceesses 6, 1–28 (1991)
Navigli, R., Velardi, P.: Structural Semantic Interconnections: A Knowledge- Based Approach to Word Sense Disambiguation. IEEE Transaction on Pattern Analysis and Machine Intelligence (2005)
Fellbaum, C.: WordNet: An Electronic Lexical Database. The MIT Press (1998)
Sridhar, S.N.: Modern Kannada Grammar. Manohar Publishers & Distributors, New Delhi (2007)
Kushaalappa Gowda, K.: Kannada Bhashe & VyakaranagaLa Ondu samiikshe (Kannada Language and its Grammar). Karthik enterprises, Bangalore, India (2008)
Narayana Murthy, K.: Computer Processing of Kannada Language. In: Workshop at Kannada University, Hampi, University of Hyderabad, Hyderabad (2001)
Agrawal, R., Imielinski, T., Swami, A.N.: Mining Association rules between sets of items in large databases. In: ACM SIGMOD International Conference on Management of Data, Washington D.C, pp. 207–216 (1993)
Sinclair, J.: Corpus, Concordance, Collocation. Oxford University Press, Oxford (1991)
Barlow, M.: Corpora for theory and Practice. J. Corpus Linguistics 1(1), 1–38 (1996)
Dash, N.S., Chaudhuri, B.B.: Relevance of Corpus in Language research and applications. J. Dravidian Linguistics 32(2), 101–122 (2002)
Kannada Shallow Parser. IIIT, Hyderabad, http://ltrc.iiit.ac.in/analyzer/Kannada
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Parameswarappa, S., Narayana, V.N. (2012). Kannada Word Sense Disambiguation Using Association Rules. In: Krishna, P.V., Babu, M.R., Ariwa, E. (eds) Global Trends in Information Systems and Software Applications. ObCom 2011. Communications in Computer and Information Science, vol 270. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29216-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-29216-3_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29215-6
Online ISBN: 978-3-642-29216-3
eBook Packages: Computer ScienceComputer Science (R0)