Skip to main content

Modeling Fluctuations of Voiced Excitation for Speech Generation Based on Recursive Volterra Systems

  • Conference paper
  • 687 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3817))

Abstract

For the modeling of the speech production system linear models are widely used. However, not all features of speech can be covered by linear systems. Therefore nonlinear systems are interesting in speech processing. In this contribution a time variable recursive Volterra system is used to model the fluctuations of the voiced excitation while a linear system models the resonances of the speech production system. The estimation of the Volterra system is performed by a prediction algorithm. The prediction problem is solved with the aid of an approximation by series expansion. Speech examples show that the use of a time variable Volterra system improves the naturalness of the synthetic speech.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. McLaughlin, S.: Nonlinear Speech Synthesis. In: Proc. EUSIPCO-2002, Toulouse France, pp. 211–218 (2002)

    Google Scholar 

  2. Bavegard, B., Fant, G.: Notes on glottal source interaction ripple. STL-QPSR (4). Royal Institute of Technology Stockholm, 63–77 (1994)

    Google Scholar 

  3. Sambur, M.R., Rosenberg, A.E., Rabiner, L.R., McGonegal, C.A.: On reducing the buzz in LPC synthesis. J. Acoust. Soc. Am. 63, 918–924 (1978)

    Article  Google Scholar 

  4. Schnell, K., Lacroix, A.: Speech Production Based on Lossy Tube Models: Unit Concatenation and Sound Transitions. In: Proc. INTERSPEECH-2004 ICSLP, Jeju-Island Korea, vol. I, pp. 505–508 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schnell, K., Lacroix, A. (2006). Modeling Fluctuations of Voiced Excitation for Speech Generation Based on Recursive Volterra Systems. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_30

Download citation

  • DOI: https://doi.org/10.1007/11613107_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31257-4

  • Online ISBN: 978-3-540-32586-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics