Summary
This paper deals with esophageal speech, which is a voice of substitution used by alaryngeal persons in order to be able to communicate with others. This voice, characterized by a low intensity and poor intelligibility, is hard to understand.In this paper, we propose ideas to enhance this kind of voice. More precisely, we enhance the source excitation signal and the formant structure of the speech vocal track. We modify pitch values by those of natural speech and we replace the source by a synthetic one based on LF model. We also enhance the formant structure by enlarging formant bandwidth and we amplify their amplitudes without increasing the background noise. Then, we englobe all modification in the same scheme including all improvements.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Gualberto AGUILAR, Mariko Nakano-Miyatake, Hector Perez-Meana: Alaryngeal Speech Enhancement Using Pattern Recognition Techniques. IEICE TRANS. INF. &SYST. (2005)E88-D
A Hisada, H Sawada: Real-time clarification of esophageal speech using a comb filter. International Conference Series On Disability, Virtual Reality And Associated Technologies,
A Hisada, N Takeuchi, H Sawada: Real-time clarification filter of a dysphonic speech and its evaluation by listening experiments. Intl Conf. Disability. Virtual Reality & Assoc. Tech.(2004)
Gunnar Fant, Johan Liljencrants, Qi-guang Lin: A four parameter model of glottal flow. French-Sweden symposium, Grenoble,
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Ali, R.H., Jebara, S.B. (2008). Esophageal speech enhancement using source synthesis and formant patterns modification. In: Damiani, E., Yétongnon, K., Schelkens, P., Dipanda, A., Legrand, L., Chbeir, R. (eds) Signal Processing for Image Enhancement and Multimedia Processing. Multimedia Systems and Applications Series, vol 31. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-72500-0_24
Download citation
DOI: https://doi.org/10.1007/978-0-387-72500-0_24
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-72499-7
Online ISBN: 978-0-387-72500-0
eBook Packages: Computer ScienceComputer Science (R0)