Esophageal speech enhancement using source synthesis and formant patterns modification

Ali, Rym Haj; Jebara, Sofia Ben

doi:10.1007/978-0-387-72500-0_24

Esophageal speech enhancement using source synthesis and formant patterns modification

Rym Haj Ali⁶ &
Sofia Ben Jebara⁷

Chapter

1064 Accesses
1 Citations

Part of the book series: Multimedia Systems and Applications Series ((MMSA,volume 31))

Summary

This paper deals with esophageal speech, which is a voice of substitution used by alaryngeal persons in order to be able to communicate with others. This voice, characterized by a low intensity and poor intelligibility, is hard to understand.In this paper, we propose ideas to enhance this kind of voice. More precisely, we enhance the source excitation signal and the formant structure of the speech vocal track. We modify pitch values by those of natural speech and we replace the source by a synthetic one based on LF model. We also enhance the formant structure by enlarging formant bandwidth and we amplify their amplitudes without increasing the background noise. Then, we englobe all modification in the same scheme including all improvements.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gualberto AGUILAR, Mariko Nakano-Miyatake, Hector Perez-Meana: Alaryngeal Speech Enhancement Using Pattern Recognition Techniques. IEICE TRANS. INF. &SYST. (2005)E88-D
Google Scholar
A Hisada, H Sawada: Real-time clarification of esophageal speech using a comb filter. International Conference Series On Disability, Virtual Reality And Associated Technologies,
Google Scholar
A Hisada, N Takeuchi, H Sawada: Real-time clarification filter of a dysphonic speech and its evaluation by listening experiments. Intl Conf. Disability. Virtual Reality & Assoc. Tech.(2004)
Google Scholar
Gunnar Fant, Johan Liljencrants, Qi-guang Lin: A four parameter model of glottal flow. French-Sweden symposium, Grenoble,
Google Scholar

Download references

Author information

Authors and Affiliations

Ecole Superieure des Communications de Tunis, Tunis, Tunisia
Rym Haj Ali
Ecole Superieure des Communications de Tunis, Tunis, Tunisia
Sofia Ben Jebara

Authors

Rym Haj Ali
View author publications
You can also search for this author in PubMed Google Scholar
Sofia Ben Jebara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipto. Tecnologie dell′Informazione, Università Milano-Bicocca, via Festa del Perdono,7, 20122 MILANO, Italy
Ernesto Damiani
Université de Bourgogne LE2I-CNRS, Aile de l′ingénieur, 21000 Dijon, France
Kokou Yétongnon , Albert Dipanda & Richard Chbeir , &
Dept. Electronics and Info. Processing (ETRO), Vrije Universiteit Brussels, Pleinlaan 2, 1050 BRUXELLES, Belgium
Peter Schelkens
Université de Bourgogne LE2I-CNRS, 21000 Dijon, France
Louis Legrand

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ali, R.H., Jebara, S.B. (2008). Esophageal speech enhancement using source synthesis and formant patterns modification. In: Damiani, E., Yétongnon, K., Schelkens, P., Dipanda, A., Legrand, L., Chbeir, R. (eds) Signal Processing for Image Enhancement and Multimedia Processing. Multimedia Systems and Applications Series, vol 31. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-72500-0_24

Download citation

DOI: https://doi.org/10.1007/978-0-387-72500-0_24
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-72499-7
Online ISBN: 978-0-387-72500-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics