Abstract
In this paper, the concept of Relative Frequency Ratio (RFR) is presented to evaluate the strength of collocation. Based on RFR, a WSD Model RFR-SUM is put forward to disambiguate polysemous Chinese word sense. It selects 9 frequently used polysemous words as examples, and achieves the average precision up to 92:50% in open test. It has compared the model with Naïve Bayesian Model and Maximum Entropy Model. The results show that the precision by RFR-SUM Model is 5:95% and 4:48% higher than that of Naïve Bayesian Model and Max- imum Entropy Model respectively. It also tries to prune RFR lists. The results reveal that leaving only 5% important collocation information can keep almost the same precision. At the same time, the speed is 20 times higher.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, Y., Gong, L., Wang, Y.: Chinese Word Sense Disambiguation Using HowNet. In: Wang, L., Chen, K., Ong, Y.S. (eds.) ICNC 2005. LNCS, vol. 3610, pp. 925–932. Springer, Heidelberg (2005)
Ide, N., Veronis, J.: Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1), 1–40 (1998)
Ng, H.T., Wang, B., Chan, Y.S.: Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study. In: Dignum, F.P.M. (ed.) ACL 2003. LNCS (LNAI), vol. 2922, pp. 455–462. Springer, Heidelberg (2004)
Li, H., Li, C.: Word Translation Disambiguation Using Bilingual Bootstrapping. Computational Linguistics 30(1), 1–22 (2004)
Changqin, Q., Tingting, H., et al.: Chinese WSD based on Selecting the Best Seeds from Collocations. Journal of Chinese Information Processing (in Chinese) 19(1), 30–35 (2005)
Dang, H.T., Chia, C.-y., et al.: Simple Features for Chinese Word Sense Disambiguation. In: Proc. Of COLING-2002, Philadelphia, USA, pp. 769–772 (2002)
Li, W., Lu, Q., Li, W.: Integrating Collocation Features in Chinese Word Sense Disambiguation. In: Proceedings of the Fourth Sighan Workshop on Chinese Language Processing, Jeju, Korea pp. 87–94 (2005)
Smadja, F.: Retrieving Collocations from Text: Xtract. Computational Linguistics 19(1), 143–177 (1993)
Lin, D.: Extracting Collocations from Text Corpora. In: Proceedings of COLLING/ACL-98 Workshop on Computational Terminology, Montreal, Canada. pp. 57–63 (1998)
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Qu, W.: Generalized Collocation and Context-based Computational Model, Ph. D Dissertation, Nanjing Normal University (2005)
Qu, W.: A Frame-based Approach to Chinese Collocation Automatic Extracting. Computer Engineering (in Chinese) 30(23), 22–24 (2004)
Wang, Z., Wang, H., Duan, H., Han, S., Yu, S.: Chinese Noun Phrase Metaphor Recognition with Maximum Entropy Approach. In: Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics, Mexico, pp. 235–244 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Qu, W., Sui, Z., Ji, G., Yu, S., Zhou, J. (2007). A Collocation-Based WSD Model: RFR-SUM. In: Okuno, H.G., Ali, M. (eds) New Trends in Applied Artificial Intelligence. IEA/AIE 2007. Lecture Notes in Computer Science(), vol 4570. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73325-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-73325-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73322-5
Online ISBN: 978-3-540-73325-6
eBook Packages: Computer ScienceComputer Science (R0)