A Study on Vocal Tract Shape Estimation and Modelling of Vocal Tract

  • Vikas
  • Deepak
  • P. K. Verma
  • R. K. Sharma
Conference paper
Part of the Algorithms for Intelligent Systems book series (AIS)


Nowadays, electronics is widespread and seeking its scope in medical and health sector enabling different tasks for persons with disability either partial or full. Diseases like asthma, Parkinson’s, etc., make the patient unable to speak. The person suffering from asthma losses muscle functionality, also unable to speak properly in the severe cases. In case of Parkinson’s diseases (PD) affected person, the voice is very vibrating in nature which is difficult to understand. From the ancient times till day, many researchers have provided different types of speech production systems that reproduce the speech very similar to the natural one. This paper reviews different speech models including some of the first mechanical model of speech synthesizer that produce five vowels, the first electrical model of vocal tract that produces continuous speech, an analogue integrated circuit vocal tract chip and many more.


Vocal tract Estimation Modelling Speech Integrated circuit 


  1. 1.
    von Kempelen W (1791) Le Mechanisme de la Parole Suivid’une Description d’une Machine Parlante, ViennaGoogle Scholar
  2. 2.
    Willis R (1829) On vowel sounds and on reed organ pipes. Trans Camb Phil Soc 3:231–268Google Scholar
  3. 3.
    Dudley H (1950) The speaking machine of Wolfgang von Kempelen. J Acoust Soc Am 22(2):151–166CrossRefGoogle Scholar
  4. 4.
    Stewart JQ (1922) An electrical analog of the vocal organs. Nature 110:311–312CrossRefGoogle Scholar
  5. 5.
    Dunn HK (1950) The calculation of vowel resonances, and an electrical vocal tract. J Acoust Soc Am 22(6):740–753CrossRefGoogle Scholar
  6. 6.
    Stevens KN, Kasowski S (1953) An electrical analog of the vocal tract. J Acoust Soc Am 25(4):734–742CrossRefGoogle Scholar
  7. 7.
    Rosen G (1960) Dynamic analog speech synthesizer. Technical report 353, 10 Feb 1960Google Scholar
  8. 8.
    Dang J (2001) Estimation of vocal tract shapes from speech sounds with a physiological articulatory model. J PhoneticsGoogle Scholar
  9. 9.
    Mermelstein P (1966) Determination of the vocal-tract shape from measured formant frequencies. J Acoust Soc Am 7Google Scholar
  10. 10.
    Wakita H (1973) Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms. IEEE Trans Audio Electroacoust Au-21(5):417–427CrossRefGoogle Scholar
  11. 11.
    Schroeder MR. Determination of the geometry of the human vocal tract by acoustic measurementsGoogle Scholar
  12. 12.
    Burrows TL, Niranjan M (1995) Vocal tract modelling with recurrent neural networks. 0-7803-2431 4/95 $4.00 0 1995. IEEEGoogle Scholar
  13. 13.
    Shadle CH, Barney A, Davies POAL (1999) Fluid flow in a dynamic mechanical model of the vocal folds and tract. II. Implications for speech production studies. J Acoust Soc Am 105(1):444–455CrossRefGoogle Scholar
  14. 14.
    Ruty N, van Hirtum A, Pelorson X, Lopez I, Hirschberg A (2005) A mechanical experimental setup to simulate vocal folds vibrations Preliminary results. ZAS Pap Linguist 40:161–175Google Scholar
  15. 15.
    Mullen J, Howard DM, Murphy DT (2006) Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality. IEEE Trans Audio Speech Lang Process 14(3):964–971CrossRefGoogle Scholar
  16. 16.
    Wee KH, Turicchia L, Sarpeshkar R (2008) An analog integrated-circuit vocal tract. IEEE Trans Biomed Circ Syst 2(4):316–327CrossRefGoogle Scholar
  17. 17.
    Barney HL, Haworth FE, Dunn HK (1959) An experimental transistorized artificial larynx. Bell Syst Tech J XXXVIII(6):1337–1356CrossRefGoogle Scholar
  18. 18.
    Kuc R, Tuteur F, Vaisnys J (1985) Determining vocal tract shape by applying dynamic constraints. In: ICASSP ‘85 IEEE international conference on acoustics, speech, and signal processing, pp 1101–1104Google Scholar
  19. 19.
    Sondhi M, Schroeter J (1987) A hybrid time-frequency domain articulatory speech synthesizer. IEEE Trans Acoust Speech Sig Process 35(7):955–967CrossRefGoogle Scholar
  20. 20.
    Dang J, Honda K (2002) Estimation of vocal tract shapes from speech sounds with a physiological articulatory model. J PhoneticsGoogle Scholar
  21. 21.
    Wee KH, Turicchia L, Sarpeshkar R (2010) An articulatory speech-prosthesis system. In: 2010 International conference on body sensor networks, Singapore, pp 133–138Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.Institute of Electronics EngineeringNational Tsing Hua UniversityHsinchuTaiwan
  2. 2.Department of ECEPDM UniversityBahadurgarhIndia
  3. 3.Electronics Engineering DepartmentRajkiya Engineering CollegeSonbhadraIndia
  4. 4.School of VLSI Design & Embedded SystemsNIT KurukshetraKurukshetraIndia

Personalised recommendations