Modeling of the Speech Process Including Anatomical Structure of the Vocal Tract
The most important features of voice processing have been presented. The properties of glottal waves have been extracted using recorded microphone signals of the speech. Therefore, it was necessary to solve the inverse problem, of finding the glottis input of the whole vocal tract, having the resulting output waves of the speech process. The frequency parameters of glottal waves have been extracted using a vocal tract model. The autocorrelation and cepstrum methods are also helpful in such extraction. The results are important not only for speaker identification and emotion recognition, but can also be helpful for glottis malfunction diagnosis.
KeywordsSpeech Signal Emotion Recognition Vocal Tract Speaker Verification Speaker Identification
- 3.Ciota Z (2004) Speaker verification for multimedia application. In: Proceedings of the IEEE international conference on systems, man and cybernetics. The Hague, The Netherlands, pp 2752–2756Google Scholar
- 4.Ciota Z (2005) Emotion recognition on the basis of human speech. In: Proceedings of the international conference on applied electromagnetics and communications. Dubrovnik, Croatia, pp 467–470Google Scholar
- 7.Mozzafry B, Tinati MA, Aghagolzadeh A, Erfanian A (2006) An adaptive algorithm for speech source separation in overcomplete cases using wavelet packets. In: Proceedings of the 5th WSEAS international conference on signal processing. Istanbul, Turkey, pp 140–144Google Scholar
- 8.Santon J (ed) (1996) Progress in speech synthesis. Springer, New YorkGoogle Scholar