Skip to main content

Esophageal speech enhancement using source synthesis and formant patterns modification

  • Chapter

Part of the book series: Multimedia Systems and Applications Series ((MMSA,volume 31))

Summary

This paper deals with esophageal speech, which is a voice of substitution used by alaryngeal persons in order to be able to communicate with others. This voice, characterized by a low intensity and poor intelligibility, is hard to understand.In this paper, we propose ideas to enhance this kind of voice. More precisely, we enhance the source excitation signal and the formant structure of the speech vocal track. We modify pitch values by those of natural speech and we replace the source by a synthetic one based on LF model. We also enhance the formant structure by enlarging formant bandwidth and we amplify their amplitudes without increasing the background noise. Then, we englobe all modification in the same scheme including all improvements.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   89.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gualberto AGUILAR, Mariko Nakano-Miyatake, Hector Perez-Meana: Alaryngeal Speech Enhancement Using Pattern Recognition Techniques. IEICE TRANS. INF. &SYST. (2005)E88-D

    Google Scholar 

  2. A Hisada, H Sawada: Real-time clarification of esophageal speech using a comb filter. International Conference Series On Disability, Virtual Reality And Associated Technologies,

    Google Scholar 

  3. A Hisada, N Takeuchi, H Sawada: Real-time clarification filter of a dysphonic speech and its evaluation by listening experiments. Intl Conf. Disability. Virtual Reality & Assoc. Tech.(2004)

    Google Scholar 

  4. Gunnar Fant, Johan Liljencrants, Qi-guang Lin: A four parameter model of glottal flow. French-Sweden symposium, Grenoble,

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Ali, R.H., Jebara, S.B. (2008). Esophageal speech enhancement using source synthesis and formant patterns modification. In: Damiani, E., Yétongnon, K., Schelkens, P., Dipanda, A., Legrand, L., Chbeir, R. (eds) Signal Processing for Image Enhancement and Multimedia Processing. Multimedia Systems and Applications Series, vol 31. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-72500-0_24

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-72500-0_24

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-72499-7

  • Online ISBN: 978-0-387-72500-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics