Skip to main content

English Text to Speech Synthesizer Using Concatenation Technique

  • Conference paper
  • First Online:
  • 1063 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 905))

Abstract

Text to speech synthesis (TTS) system is used to produce artificial human speech for input text. Any language text can be converted into speech signal using TTS system. This paper presents a method to design a text to speech synthesis system for English language. Container map data structure is used to design the TTS system. Phoneme concatenation is performed to get speech signal for input text. Phonetically rich 42 words in English language are recorded then phonemes are extracted from these recorded words using PRAAT tool. The extracted phonemes are compared with input text phonemes and then concatenated sequentially to reconstruct the desired words. Implementation of this method is simple and requires less memory usage.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Panda, S.P., Nayak, A.K.: A waveform concatenation technique for text-to-speech synthesis. Int. J. Speech Technol. 20(4), 959–976 (2017)

    Article  Google Scholar 

  2. Abu-Soud, S.M.: ILATalk: a new multilingual text-to-speech synthesizer with machine learning. Int. J. Speech Technol. 19(1), 55–64 (2016)

    Article  Google Scholar 

  3. Mullah, H.U., Pyrtuh, F., Singh, L.J.: Development of an HMM-based speech synthesis system for Indian English language. In: 2015 International Symposium on Advanced Computing and Communication (ISACC), Silchar, pp. 124–127 (2015). https://doi.org/10.1109/ISACC.2015.7377327

  4. Suryawanshi, S.D., Itkarkar, R.R., Mane, D.T.: High quality text to speech synthesizer using phonetic integration. Int. J. Adv. Res. Electron. Commun. Eng. (IJARECE) 3(2), 133–136 (2014)

    Google Scholar 

  5. Joshi, A., Chabbi, D., Suman, M., Kulkarni, S.: Text to speech system for Kannada language. In: 2015 International Conference on Communications and Signal Processing (ICCSP), Melmaruvathur, pp. 1901–1904 (2015). https://doi.org/10.1109/ICCSP.2015.7322855

  6. Lukose, S., Upadhya, S.S.: Text to speech synthesizer-formant synthesis. In: 2017 International Conference on Nascent Technologies in Engineering (ICNTE), Navi Mumbai, pp. 1–4 (2017). https://doi.org/10.1109/ICNTE.2017.7947945

  7. Mahanta, D., Sharma, B., Sarmah, P., Prasanna, S.R.M.: Text to speech synthesis system in Indian English. In: 2016 IEEE Region 10 Conference (TENCON), Singapore, pp. 2614–2618 (2016). https://doi.org/10.1109/TENCON.2016.7848511

  8. Narendra, N.P., Rao, K.S., Ghosh, K., Vempada, R.R., Maity, S.: Development of syllable-based text to speech synthesis system in Bengali. Int. J. Speech Technol. 14, 167 (2011)

    Article  Google Scholar 

  9. Sangeetha, S., Jothilakshmi, S.: Syllable based text to speech synthesis system using auto associative neural network prosody prediction. Int. J. Speech Technol. 17(2), 91–98 (2014)

    Article  Google Scholar 

  10. Swarna, K., Naser, A.: A TDPSOLA based concatenation technique for Bengali text to speech synthesis system Subachan. In: 2016 9th International Conference on Electrical and Computer Engineering (ICECE), Dhaka, pp. 102–105 (2016). https://doi.org/10.1109/ICECE.2016.7853866

  11. Apte, S.D.: Speech and Audio Processing, Wiley-India, New Delhi (2012)

    Google Scholar 

  12. Kumari, R.S.S., Sangeetha, R.: Conversion of English text to speech (TTS) using Indian speech signal. IJSET 4(8), 447–450 (2015)

    Google Scholar 

  13. Orchestrating Success in Reading by Dawn Reithaug (2002)

    Google Scholar 

  14. Boersma, P., Weenink, D.: Praat: doing phonetics by computer [Computer program] (2013). http://www.praat.org

  15. Shirbahadurkar, S.D., Bormane, D.S.: Marathi language speech synthesizer using concatenative synthesis strategy (spoken in Maharashtra, India). In: 2009 Second International Conference on Machine Vision, Dubai, pp. 181–185 (2009). https://doi.org/10.1109/ICMV.2009.52

  16. Patra, T.K., Patra, B, Mohapatra, P.: Text to speech conversion with phonematic concatenation. Int. J. Electron. Commun. Comput. Technol. (IJECCT) 2(5), 223–226 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sai Sawant .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sawant, S., Deshpande, M. (2018). English Text to Speech Synthesizer Using Concatenation Technique. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T. (eds) Advances in Computing and Data Sciences. ICACDS 2018. Communications in Computer and Information Science, vol 905. Springer, Singapore. https://doi.org/10.1007/978-981-13-1810-8_47

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-1810-8_47

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-1809-2

  • Online ISBN: 978-981-13-1810-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics