Skip to main content

Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

  • 1052 Accesses

Abstract

In this paper a preliminary study on voice excitation modeling by single glottal shape parameter selection is presented. A strategy for direct model selection by matching derivative glottal source estimates with LF-based candidates driven by the Rd parameter is explored by means of two state-of-the-art similarity measures and a novel one considering spectral envelope information. An experimental study on synthetic singing-voice was carried out aiming to compare the performance of the different measures and to observe potential relations with respect to different voice characteristics (e.g. vocal effort, pitch range, amount of aperiodicities and aspiration noise). The results of this study allow us to claim competitive performance of the proposed strategy and suggest us preferable source modeling conditions for stationary singing-voice.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)

    Article  Google Scholar 

  2. Drugman, T., Bozkurt, B., Dutoit, T.: Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation. Speech Communication 53, 855–866 (2011)

    Article  Google Scholar 

  3. Degottex, G., Röbel, A., Rodet, X.: Joint estimate of shape and time-synchronization of a glottal source model by phase flatness. In: Proc. of ICASSP, Dallas, USA, pp. 5058–5061 (2010)

    Google Scholar 

  4. Kane, J., Yanushevskaya, I., Chasaide, A.N., Gobl, C.: Exploiting time and frequency domain measures for precise voice source parameterisation. In: Proc. of Speech Prosody, Shanghai, China, pp. 143–146 (May 2012)

    Google Scholar 

  5. Fant, G.: The lf-model revisited. transformations and frequency domain analysis. STL-QPSR Journal 36(2-3), 119–156 (1995)

    Google Scholar 

  6. Lu, H.-L.: Toward a High-Quality Singing-Voice Synthesizer with Vocal Texture Control, Ph.D. thesis, Stanford University (2002)

    Google Scholar 

  7. Henrich, N.: Etude de la source glottique en voix parlée et chantée, Ph.d. thesis, Université Paris 6, France (2001)

    Google Scholar 

  8. Röbel, A., Rodet, X.: Efficient spectral envelope estimation and its application to pitch shifting and envelope preservation. In: Proc. of DAFx, Spain (2005)

    Google Scholar 

  9. Villavicencio, F., Röbel, A., Rodet, X.: Improving lpc spectral envelope extraction of voiced speech by true-envelope estimation. In: Proc. of ICASSP (2006)

    Google Scholar 

  10. Kreiman, J., Gerratt, B.R.: Perception of aperiodicity in pathological voice. Journal of the Acoustical Society of America 117, 2201–2211 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Villavicencio, F. (2013). Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38847-7_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38846-0

  • Online ISBN: 978-3-642-38847-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics