Skip to main content

On Detection of Synonyms Between Simplified Chinese of Mainland China and Traditional Chinese of Taiwan: A Semantic Similarity Method

  • Conference paper
  • First Online:
Book cover Chinese Lexical Semantics (CLSW 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9332))

Included in the following conference series:

Abstract

We present an approach for automatically detecting synonyms between simplified Chinese used in mainland China and traditional Chinese used in Taiwan from large scale corpus. After pre-processing step (including doing segmentation and POS tagging on our corpora), all words are classified into 3 categories according to their frequency: words exclusively used in mainland China, words exclusively used in Taiwan, and words commonly used in both sides. We use word vectors to represent meanings of words, calculate semantic similarities between words of both sides, and extract synonyms. The experiment shows that our approach can find synonyms that are not present in handcrafted dictionary.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Su, J.: Research on Homographs across the Straits. Studies of the Chinese Language 1995(2), 107–117 (1995). (苏金智: 海峡两岸同形异义词研究. 中国语文. 1995(2), 107–117 (1995)). (in Chinese)

    Google Scholar 

  2. Li, X., Qiu, Z.: Determination and Treatment of Diverse Words in the Cross-Straits Dictionary. Applied Linguistics 2012(4), 74–81 (2012). (in Chinese)

    Google Scholar 

  3. The Common Words Dictionary of the Cross-Straits. http://www.zhonghuayuwen.org/PageInfo.aspx?Id=375. (in Chinese)

  4. Richardson, R., Smeaton, A., Murphy, J.: Using WordNet as a knowledge base for measuring semantic similarity between words. In: Proceedings of AICS Conference (1994)

    Google Scholar 

  5. Liu, Q., Li, S.: Word Similarity Computing Based on How-net. Computational Linguistics and Chinese Language Processing 7(2), 59–76 (2002). (in Chinese)

    Google Scholar 

  6. Chen, Y., Shi, X., Zhou, C.: A simplified-traditional chinese character conversion model based on log-linear models. In: Proceedings of International Conference on Asian Language Processing (2011)

    Google Scholar 

  7. Wang, S., Cao, C., Pei, Y., Xia, F.: A Collocation-based Method for Semantic Similarity Measure for Chinese Words. Journal of Chinese Information Processing. 27(1), 7–14 (2013). (in Chinese)

    Google Scholar 

  8. Shi, J., Wu, Y., Qiu, L., Lv, X.: Chinese Lexical Semantic Similarity Computing Based on Large-scale Corpus. Journal of Chinese Information Processing 27(1), 1–6+80 (2013). (in Chinese)

    Google Scholar 

  9. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of Workshop at ICLR (2013)

    Google Scholar 

  10. Swain, M.J., Ballard, D.H.: Color Indexing. IJCV 7(1), 11–32 (1991)

    Article  Google Scholar 

  11. Salton, G., Buckley, C.: Term-weighting Approaches in Automatic Text Retrieval. Information Processing & Management 24(5), 513–523 (1988)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaodong Shi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, B., Shi, X. (2015). On Detection of Synonyms Between Simplified Chinese of Mainland China and Traditional Chinese of Taiwan: A Semantic Similarity Method. In: Lu, Q., Gao, H. (eds) Chinese Lexical Semantics. CLSW 2015. Lecture Notes in Computer Science(), vol 9332. Springer, Cham. https://doi.org/10.1007/978-3-319-27194-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27194-1_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27193-4

  • Online ISBN: 978-3-319-27194-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics