A Unified Framework for Text Analysis in Chinese TTS

Fu, Guohong; Zhang, Min; Zhou, GuoDong; Luke, Kang-Kuong

doi:10.1007/11939993_24

Guohong Fu^22,23,
Min Zhang²⁴,
GuoDong Zhou^24,25 &
…
Kang-Kuong Luke²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

1583 Accesses
1 Citations

Abstract

This paper presents a robust text analysis system for Chinese text-to-speech synthesis. In this study, a lexicon word or a continuum of non-hanzi characters with the same category (e.g. a digit string) are defined as a morpheme, which is the basic unit forming a Chinese word. Based on this definition, the three key issues concerning the interpretation of real Chinese text, namely lexical disambiguation, unknown word resolution and non-standard word (NSW) normalization can be unified in a single framework and reformulated as a two-pass tagging task on a sequence of morphemes. Our system consists of four main components: (1) a pre-segmenter for sentence segmentation and morpheme segmentation; and (2) a lexicalized HMM-based chunker for identifying unknown words and guessing their part-of-speech categories; and (3) a HMM-based tagger for converting orthographic morphemes to their Chinese phonetic representation (viz. pinyin), given their word-formation patterns and part-of-speech information; (4) a post-processing for interpreting phonetic tags and fine-tuning pronunciation order for some special NSWs if necessary. The evaluation on a pinyin-notated corpus built from the Peking University corpus shows that our system can achieve correct interpretation for most words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shih, C., Sproat, R.: Issues in text-to-speech conversion for Mandarin. Computational Linguistics and Chinese Language Processing 1(1), 37–86 (1996)
Google Scholar
Xu, J., Fu, G., Li, H.: Grapheme-to-Pinyin for Chinese text-to-speech system. In: Proceedings of the 8th International Conference on Spoken Language Processing (INTERSPEECH 2004 - ICSLP), Jeju Island, Korea, pp. 1885–1888 (2004)
Google Scholar
Lemmetty, S.: Review of speech synthesis technology, Master’s Thesis, Helsinki University of Technology, Finland (1999)
Google Scholar
Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorf, M., Richards, C.: Normalization of non-standard words. Computer Speech and Language 15(3), 287–333 (2001)
Article Google Scholar
Yu, S., Duan, H., Zhu, X., Swen, B., Chang, B.: Specification for corpus processing at Peking University: Word segmentation, POS tagging and phonetic notation. Journal of Chinese Language and Computing 13(2), 121–158 (2003)
Google Scholar
Fu, G., Luke, K.-K.: Chinese unknown word identification using classbased LM. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, pp. 704–713. Springer, Heidelberg (2005)
Chapter Google Scholar
Fu, G., Luke, K.-K.: Chinese named entity recognition using lexicalized HMMs. ACM SIGKDD Explorations Newsletter 7(1), 19–25 (2005)
Article Google Scholar
Fu, G.: User rule specification for text normalization. InfoTalk Technical Report, InfoTalk -R&D -2002-001
Google Scholar
Zhang, Z., Chu, M., Chang, E.: An efficient way to learn rules for grapheme- ophoneme conversion in Chinese. In: Proceedings of 2002 International Symposium on Chinese Spoken Language Processing (ISCSLP 2002), Taipei, Taiwan, pp. 59–63 (2002)
Google Scholar
Zheng, M., Shi, Q., Zhang, W., Cai, L.: Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS system. In: Proceedings of ITERSPEECH 2005, Lisbon, Portugal, pp. 1897–1900 (2005)
Google Scholar
Sproat, R., Emerson, T.: The first international Chinese word segmentation bakeoff. In: Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan, pp. 133–143 (2003)
Google Scholar
Emerson, T.: The second international Chinese word segmentation bakeoff. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, Jeju Island, Korea, pp. 123–133 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept of Chinese, Translation and Linguistics, City University of Hong Kong, Hong Kong
Guohong Fu
Departmant of Linguistics, The University of Hong Kong, Hong Kong
Guohong Fu & Kang-Kuong Luke
Institute for Infocomm Research, 119613, Singapore
Min Zhang & GuoDong Zhou
School of Computer Science and Technology, Suzhou University, 215006, Suzhou
GuoDong Zhou

Authors

Guohong Fu
View author publications
You can also search for this author in PubMed Google Scholar
Min Zhang
View author publications
You can also search for this author in PubMed Google Scholar
GuoDong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Kang-Kuong Luke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fu, G., Zhang, M., Zhou, G., Luke, KK. (2006). A Unified Framework for Text Analysis in Chinese TTS. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_24

Download citation

DOI: https://doi.org/10.1007/11939993_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics