Skip to main content

Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification

  • Conference paper
Book cover Chinese Spoken Language Processing (ISCSLP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

Abstract

Prosodic boundary prediction is the key to improving the intelligibility and naturalness of synthetic speech for a TTS system. This paper investigated the problem of automatic segmentation of prosodic word and prosodic phrase, which are two fundamental layers in the hierarchical prosodic structure of Mandarin Chinese. Maximum Entropy (ME) Model was used at the front end for both prosodic word and prosodic phrase prediction, but with different feature selection schemes. A multi-pass prediction approach was adopted. Besides, an error-driven rule-based modification module was introduced into the back end to amend the initial prediction. Experiments showed that this combined approach outperformed many other methods like C4.5 and TBL.

This work is supported by China National Natural Science Foundation(60433030, 60418012).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cao, J.: Prediction of Prosodic Organization Based on Grammatical Information. Journal of Chinese Information Processing 17, 41–46 (2003)

    Google Scholar 

  2. Gee, J.P., Grosjean, F.: Performance structures: A psycholinguistic and Linguistic Appraisal. Cognitive Psychology 15, 411–458 (1983)

    Article  Google Scholar 

  3. Cao, J., Zhu, W.: Syntactic and Lexical Constraint in Prosodic Segmentation and Grouping. In: Proceedings of Speech Prosody 2002, Aix-en-Provence, France (2002)

    Google Scholar 

  4. Wang, H.: Prosodic words and prosodic phrases in Chinese. Chinese Language 6, 525–536 (2000)

    Google Scholar 

  5. Wang, M., Hirschberg, J.: Predicting Intonational Boundaries Automatically from Text. In: The ATIS Domain Proceedings of the DARPA Speech and Natural Language Workshop, pp. 378–383 (1991)

    Google Scholar 

  6. Taylor, P., Black, A.W.: Assigning phrase breaks from part-of speech sequences. Computer Speech and Language 12(4), 99–117 (1998)

    Article  Google Scholar 

  7. Sheng, Z., Jianhua, T., Lianhong, C.: Learning rules for Chinese prosodic phrase prediction. In: International Conference on Computational Linguistics, Proceeding of the first SIGHAN workshop on Chinese language processing, vol. 18 (2002)

    Google Scholar 

  8. Li, J.-F., Hu, G.-P., Wang, R.: Chinese prosody phrase break prediction based on maximum entropy model. In: Interspeech 2004, Jeju Island, Korea, 729–732 (2004)

    Google Scholar 

  9. Berger, A.L., Stephen, A., Sa, D.P., et al.: A maximum entropy approach to natural language processing. Computational Linguistics 22(1), 39–71 (1996)

    Google Scholar 

  10. Zheng, M., Cai, L.: Prosodic Constituents Segmentation and Syntax of Chinese. In: Proceeding of 5th Chinese Lexical Semantics Workshop, Singapore (2004)

    Google Scholar 

  11. Brill, E.: Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Computational Linguistics 21(4), 543–565

    Google Scholar 

  12. Chu, M.: The Uncertainty in Prosody of Natural Speech and Its Application in Speech Synthesis. Journal of Chinese Information Processing 18(4), 66–71 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, X., Xu, J., Cai, L. (2006). Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification . In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_19

Download citation

  • DOI: https://doi.org/10.1007/11939993_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49665-6

  • Online ISBN: 978-3-540-49666-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics