Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification

Zhang, Xiaonan; Xu, Jun; Cai, Lianhong

doi:10.1007/11939993_19

Xiaonan Zhang²²,
Jun Xu²² &
Lianhong Cai²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

1575 Accesses
5 Citations

Abstract

Prosodic boundary prediction is the key to improving the intelligibility and naturalness of synthetic speech for a TTS system. This paper investigated the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin Chinese. Maximum Entropy (ME) Model was used at the front end for both prosodic word and prosodic phrase prediction, but with different feature selection schemes. A multi-pass prediction approach was adopted. Besides, an error-driven rule-based modification module was introduced into the back end to amend the initial prediction. Experiments showed that this combined approach outperformed many other methods like C4.5 and TBL.

This work is supported by China National Natural Science Foundation(60433030, 60418012).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cao, J.: Prediction of Prosodic Organization Based on Grammatical Information. Journal of Chinese Information Processing 17, 41–46 (2003)
Google Scholar
Gee, J.P., Grosjean, F.: Performance structures: A psycholinguistic and Linguistic Appraisal. Cognitive Psychology 15, 411–458 (1983)
Article Google Scholar
Cao, J., Zhu, W.: Syntactic and Lexical Constraint in Prosodic Segmentation and Grouping. In: Proceedings of Speech Prosody 2002, Aix-en-Provence, France (2002)
Google Scholar
Wang, H.: Prosodic words and prosodic phrases in Chinese. Chinese Language 6, 525–536 (2000)
Google Scholar
Wang, M., Hirschberg, J.: Predicting Intonational Boundaries Automatically from Text. In: The ATIS Domain Proceedings of the DARPA Speech and Natural Language Workshop, pp. 378–383 (1991)
Google Scholar
Taylor, P., Black, A.W.: Assigning phrase breaks from part-of speech sequences. Computer Speech and Language 12(4), 99–117 (1998)
Article Google Scholar
Sheng, Z., Jianhua, T., Lianhong, C.: Learning rules for Chinese prosodic phrase prediction. In: International Conference on Computational Linguistics, Proceeding of the first SIGHAN workshop on Chinese language processing, vol. 18 (2002)
Google Scholar
Li, J.-F., Hu, G.-P., Wang, R.: Chinese prosody phrase break prediction based on maximum entropy model. In: Interspeech 2004, Jeju Island, Korea, 729–732 (2004)
Google Scholar
Berger, A.L., Stephen, A., Sa, D.P., et al.: A maximum entropy approach to natural language processing. Computational Linguistics 22(1), 39–71 (1996)
Google Scholar
Zheng, M., Cai, L.: Prosodic Constituents Segmentation and Syntax of Chinese. In: Proceeding of 5th Chinese Lexical Semantics Workshop, Singapore (2004)
Google Scholar
Brill, E.: Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Computational Linguistics 21(4), 543–565
Google Scholar
Chu, M.: The Uncertainty in Prosody of Natural Speech and Its Application in Speech Synthesis. Journal of Chinese Information Processing 18(4), 66–71 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Pervasive Computing (Tsinghua University), Ministry of Education, 100084, Beijing
Xiaonan Zhang, Jun Xu & Lianhong Cai

Authors

Xiaonan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lianhong Cai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, X., Xu, J., Cai, L. (2006). Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification . In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_19

Download citation

DOI: https://doi.org/10.1007/11939993_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics