Abstract
In order to improve unlimited TTS, a framework to organize the multiple perceived units into discourse is proposed in [1]. To make an unlimited TTS system, we must transform the original text to the text with corresponding boundary breaks. So we describe how we predicate prosody from Text in this paper. We use the corpora with boundary breaks which follow the prosody framework. Then we use the lexical and syntactic information to predict prosody from text. The result shows that the weighted precision in our model is better than some speakers. We have shown our model can predict a reasonable prosody form text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Tseng, C.-y., Pin, S.-h., Lee, Y.-l., Wang, H.-m., Chen, Y.-c.: Fluent speech prosody: framework and modeling. Speech Communication 46(3-4) (July 2005); Special Issue on Quantitative Prosody Modelling for Natural Speech Description and Generation, 284–309
Tseng, C.-y., Cheng, Y.-C., Chang, C.-H.: Sinica COSPRO and Toolkit—Corpora and Platform of Mandarin Chinese Fluent Speech. In: Proceedings of Oriental COCOSDA 2005, Jakarata, Indonesia, December 6-8, pp. 23–28 (2005)
Sinica COSPRO and Toolkit: http://www.myet.com/COSPRO/
Peng, H., Chen, C., Tseng, C., Chen, K.: Predicting Prosodic Words From Lexical Words–A First Step Towards Predicting Prosody From Text. In: International Symposium on Chinese Spoken Language Processing, ISCSLP (2004)
Hsieh, Y.-M., Yang, D.-C., Chen, K.-J.: Linguistically-motivated grammar extraction, generalization and adaptation. In: Proceedings of the Second International Join Conference on Natural Language Processing, Jeju Island, Republic of Korea, pp. 177–187 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, KJ., Tseng, Cy., Tai, Ch. (2006). Predicting Prosody from Text. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_22
Download citation
DOI: https://doi.org/10.1007/11939993_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)