Skip to main content

A Collocation-Based WSD Model: RFR-SUM

  • Conference paper
New Trends in Applied Artificial Intelligence (IEA/AIE 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4570))

Abstract

In this paper, the concept of Relative Frequency Ratio (RFR) is presented to evaluate the strength of collocation. Based on RFR, a WSD Model RFR-SUM is put forward to disambiguate polysemous Chinese word sense. It selects 9 frequently used polysemous words as examples, and achieves the average precision up to 92:50% in open test. It has compared the model with Naïve Bayesian Model and Maximum Entropy Model. The results show that the precision by RFR-SUM Model is 5:95% and 4:48% higher than that of Naïve Bayesian Model and Max- imum Entropy Model respectively. It also tries to prune RFR lists. The results reveal that leaving only 5% important collocation information can keep almost the same precision. At the same time, the speed is 20 times higher.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, Y., Gong, L., Wang, Y.: Chinese Word Sense Disambiguation Using HowNet. In: Wang, L., Chen, K., Ong, Y.S. (eds.) ICNC 2005. LNCS, vol. 3610, pp. 925–932. Springer, Heidelberg (2005)

    Google Scholar 

  2. Ide, N., Veronis, J.: Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1), 1–40 (1998)

    Google Scholar 

  3. Ng, H.T., Wang, B., Chan, Y.S.: Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study. In: Dignum, F.P.M. (ed.) ACL 2003. LNCS (LNAI), vol. 2922, pp. 455–462. Springer, Heidelberg (2004)

    Google Scholar 

  4. Li, H., Li, C.: Word Translation Disambiguation Using Bilingual Bootstrapping. Computational Linguistics 30(1), 1–22 (2004)

    Article  MATH  Google Scholar 

  5. Changqin, Q., Tingting, H., et al.: Chinese WSD based on Selecting the Best Seeds from Collocations. Journal of Chinese Information Processing (in Chinese) 19(1), 30–35 (2005)

    Google Scholar 

  6. Dang, H.T., Chia, C.-y., et al.: Simple Features for Chinese Word Sense Disambiguation. In: Proc. Of COLING-2002, Philadelphia, USA, pp. 769–772 (2002)

    Google Scholar 

  7. Li, W., Lu, Q., Li, W.: Integrating Collocation Features in Chinese Word Sense Disambiguation. In: Proceedings of the Fourth Sighan Workshop on Chinese Language Processing, Jeju, Korea pp. 87–94 (2005)

    Google Scholar 

  8. Smadja, F.: Retrieving Collocations from Text: Xtract. Computational Linguistics 19(1), 143–177 (1993)

    Google Scholar 

  9. Lin, D.: Extracting Collocations from Text Corpora. In: Proceedings of COLLING/ACL-98 Workshop on Computational Terminology, Montreal, Canada. pp. 57–63 (1998)

    Google Scholar 

  10. Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  11. Qu, W.: Generalized Collocation and Context-based Computational Model, Ph. D Dissertation, Nanjing Normal University (2005)

    Google Scholar 

  12. Qu, W.: A Frame-based Approach to Chinese Collocation Automatic Extracting. Computer Engineering (in Chinese) 30(23), 22–24 (2004)

    Google Scholar 

  13. Wang, Z., Wang, H., Duan, H., Han, S., Yu, S.: Chinese Noun Phrase Metaphor Recognition with Maximum Entropy Approach. In: Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics, Mexico, pp. 235–244 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hiroshi G. Okuno Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Qu, W., Sui, Z., Ji, G., Yu, S., Zhou, J. (2007). A Collocation-Based WSD Model: RFR-SUM. In: Okuno, H.G., Ali, M. (eds) New Trends in Applied Artificial Intelligence. IEA/AIE 2007. Lecture Notes in Computer Science(), vol 4570. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73325-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73325-6_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73322-5

  • Online ISBN: 978-3-540-73325-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics