Abstract
For providing naturalness in synthesized speech it is imperative to give appropriate intonation on the synthesized sentences. The problem is not with synthesis engines but with the fact that comprehensive intonation rules of natural intonation are not available for any of the major spoken language of India. The knowledge available in this area is primarily subjective with the risk of unintentional personal bias. It lacks plurality in the sense that these do not reflect the natural intonation of common people. It is imperative to derive intonation rules through analysis of large amount of sentences spoken by common people. Manual processing is time consuming and extremely cumbrous. The present paper describes briefly an automated approach for such a task. A pilot study on about 1000 complex and interrogative sentences spoken by five female and four male native speakers is presented. 93% accuracy is obtained for the desired objective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fujisaki, H.: Prosody, Models, and Spontaneous Speech. In: Sagisaka, Y., Campbell, N., Higuchi, N. (eds.) Computing Prosody, pp. 27–42. Springer, New York (1996)
Chowdhury, S., Datta, A.K., Chaudhuri, B.B.: Intonation Patterns for Text Reading in Standard Colloquial Bengali. Journal of the Acoustical Society of India 30, 160–163 (2002)
http://www.cdackolkata.in/html/txttospeeh/corpora/corpora_main/MainB.html
Chowdhury, S., Datta, A.K., Choudhury, B.B.: Pitch detection Algorithm using State Phase Analysis. J. Acous. Ind. 28, 247–250 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Datta, A.K., Saha, A. (2012). A System for Analysis of Large Scale Speech Data for the Development of Rules of Intonation for Speech Synthesis. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2011. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-31980-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31979-2
Online ISBN: 978-3-642-31980-8
eBook Packages: Computer ScienceComputer Science (R0)