Abstract
Speech Synthesis functions as a medium which converts text into speech. Speech Recognition and Speech Synthesis plays a vital role in Human-Machine Interaction. Synthesized speeches are extracted from concatenating the pieces of pre-recorded speech utterances from the database. The proposed work converts the written text into a syllables (syllable text representation) using rule based approach and subsequently it converts the syllable representation to modified syllable waveform clips that can be combined together to produce as sound. Syllabic transcription attempts to describe the individual variations that occur between speakers of a dialect or language. Syllable based concatenative synthesis aims to record the syllables that a speaker uses rather than the actual spoken variants of those syllables that are produced when a speaker converse a word. The Concatenative Speech Synthesis methods provide highly understandable speech utterance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Schuller, B., Zhang, Z., Weninger, F., Burkhardt, F.: Synthesized speech for model training in cross-corpus recognition of human emotion. Int. J. Speech Technol. 15, 313–323 (2012)
Campbell, N.: Developments in corpus-based speech synthesis: approaching natural conversational speech. IEICE Trans. 87, 497–500 (2004)
Campbell, N.: Conversational speech synthesis and the need for some laughter. IEEE Trans. Audio Speech Lang. Process. 17(4), 1171–1179 (2006)
Sreenivasa Rao, K., Yegnanarayana, B.: Intonation modeling for Indian languages. Comput. Speech Lang. 23, 240–256 (2009)
Vowel: Online etymology dictionary. http://www.etymonline.com/index.php?allowed_in_frame=0&search=vowel&searchmode=nl. Accessed 21 Nov 2013
Atal, B.S., Hanauer, S.L.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Am. 50, 637–655 (1971)
Badin, P., Fant, G.: Notes on vocal tract computation. Techical Report, STL-QPSR (1984)
Carlson, R., Sigvardson, T., Sjolander, A.: Data-driven formant synthesis. Technical Report, TMH-QPSR (2008)
Banks, G.F., Hoaglin, L. W.: An experimental study of duration characteristics of voice during the expression of emotion. Speech Monogr. 8, 85–90 (1941)
Clark, R.A.J., Richmond, K., King, S.: Multisyn: opendomain unit selection for the festival speech synthesis system. Speech Commun. 49, 317–330 (2007)
Courbon, J.L., Emerald, F.: A text to speech machine by synthesis from diphones. In: Proceeding of ICASSP. PTR, Upper Saddle River (2002)
Kim, J.K., Hahn, H.S., Bae, M.J.: On a speech multiple system implementation for speech synthesis. Wireless Pers. Commun. 49, 533–543 (2009)
Saraswathi, S., Vishalakshy, R.: Design of multilingual speech synthesis system. Intell. Inform. Manage. 2, 58–64 (2010)
Ahmed, M., Nisar, S.: Text-to-speech synthesis using phoneme concatenation. Int. J. Sci. Eng. Technol. 3(2), 193–197 (2014)
Campell, N., Hamza, W., Hog, H., Tao, J.: Editorial special section on expressive speech synthesis. IEEE Trans. Audio Speech Lang. Process. 14, 1097–1098 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer India
About this paper
Cite this paper
Ananthi, S., Dhanalakshmi, P. (2015). Syllable Based Concatenative Synthesis for Text to Speech Conversion. In: Jain, L., Behera, H., Mandal, J., Mohapatra, D. (eds) Computational Intelligence in Data Mining - Volume 3. Smart Innovation, Systems and Technologies, vol 33. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2202-6_6
Download citation
DOI: https://doi.org/10.1007/978-81-322-2202-6_6
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2201-9
Online ISBN: 978-81-322-2202-6
eBook Packages: EngineeringEngineering (R0)