Syllable Based Concatenative Synthesis for Text to Speech Conversion

Ananthi, S.; Dhanalakshmi, P.

doi:10.1007/978-81-322-2202-6_6

S. Ananthi⁷ &
P. Dhanalakshmi⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 33))

1404 Accesses
1 Citations

Abstract

Speech Synthesis functions as a medium which converts text into speech. Speech Recognition and Speech Synthesis plays a vital role in Human-Machine Interaction. Synthesized speeches are extracted from concatenating the pieces of pre-recorded speech utterances from the database. The proposed work converts the written text into a syllables (syllable text representation) using rule based approach and subsequently it converts the syllable representation to modified syllable waveform clips that can be combined together to produce as sound. Syllabic transcription attempts to describe the individual variations that occur between speakers of a dialect or language. Syllable based concatenative synthesis aims to record the syllables that a speaker uses rather than the actual spoken variants of those syllables that are produced when a speaker converse a word. The Concatenative Speech Synthesis methods provide highly understandable speech utterance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Schuller, B., Zhang, Z., Weninger, F., Burkhardt, F.: Synthesized speech for model training in cross-corpus recognition of human emotion. Int. J. Speech Technol. 15, 313–323 (2012)
Article Google Scholar
Campbell, N.: Developments in corpus-based speech synthesis: approaching natural conversational speech. IEICE Trans. 87, 497–500 (2004)
Google Scholar
Campbell, N.: Conversational speech synthesis and the need for some laughter. IEEE Trans. Audio Speech Lang. Process. 17(4), 1171–1179 (2006)
Article Google Scholar
Sreenivasa Rao, K., Yegnanarayana, B.: Intonation modeling for Indian languages. Comput. Speech Lang. 23, 240–256 (2009)
Article Google Scholar
Vowel: Online etymology dictionary. http://www.etymonline.com/index.php?allowed_in_frame=0&search=vowel&searchmode=nl. Accessed 21 Nov 2013
Atal, B.S., Hanauer, S.L.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Am. 50, 637–655 (1971)
Article Google Scholar
Badin, P., Fant, G.: Notes on vocal tract computation. Techical Report, STL-QPSR (1984)
Google Scholar
Carlson, R., Sigvardson, T., Sjolander, A.: Data-driven formant synthesis. Technical Report, TMH-QPSR (2008)
Google Scholar
Banks, G.F., Hoaglin, L. W.: An experimental study of duration characteristics of voice during the expression of emotion. Speech Monogr. 8, 85–90 (1941)
Google Scholar
Clark, R.A.J., Richmond, K., King, S.: Multisyn: opendomain unit selection for the festival speech synthesis system. Speech Commun. 49, 317–330 (2007)
Google Scholar
Courbon, J.L., Emerald, F.: A text to speech machine by synthesis from diphones. In: Proceeding of ICASSP. PTR, Upper Saddle River (2002)
Google Scholar
Kim, J.K., Hahn, H.S., Bae, M.J.: On a speech multiple system implementation for speech synthesis. Wireless Pers. Commun. 49, 533–543 (2009)
Google Scholar
Saraswathi, S., Vishalakshy, R.: Design of multilingual speech synthesis system. Intell. Inform. Manage. 2, 58–64 (2010)
Google Scholar
Ahmed, M., Nisar, S.: Text-to-speech synthesis using phoneme concatenation. Int. J. Sci. Eng. Technol. 3(2), 193–197 (2014)
Google Scholar
Campell, N., Hamza, W., Hog, H., Tao, J.: Editorial special section on expressive speech synthesis. IEEE Trans. Audio Speech Lang. Process. 14, 1097–1098 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Annamalai University, Chidambaram, Tamil Nadu, India
S. Ananthi & P. Dhanalakshmi

Authors

S. Ananthi
View author publications
You can also search for this author in PubMed Google Scholar
P. Dhanalakshmi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Ananthi .

Editor information

Editors and Affiliations

School of Electrical and Information Engineering, University of South Australia, South Australia, Australia
Lakhmi C. Jain
Computer Science and Engineering, Veer Surendra Sai University of Technolo, Sambalpur, Odisha, India
Himansu Sekhar Behera
Computer Science & Engineering, Kalyani University, Nadia, West Bengal, India
Jyotsna Kumar Mandal
Dept. of Computer Science and Eng., National Institute of Technology Rourkela, Rourkela, India
Durga Prasad Mohapatra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ananthi, S., Dhanalakshmi, P. (2015). Syllable Based Concatenative Synthesis for Text to Speech Conversion. In: Jain, L., Behera, H., Mandal, J., Mohapatra, D. (eds) Computational Intelligence in Data Mining - Volume 3. Smart Innovation, Systems and Technologies, vol 33. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2202-6_6

Download citation

DOI: https://doi.org/10.1007/978-81-322-2202-6_6
Published: 12 December 2014
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2201-9
Online ISBN: 978-81-322-2202-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics