Improvements to the HNR Estimation Based-on Generalized Variogram

  • Diana Torres-Boza
  • Carlos A. Ferrer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8258)


The presence of an unusual high level of turbulent noise in voice signals is related to air leakage in the glottis as a result of incomplete closure of the vocal cords. Harmonics to Noise Ratio (HNR) is an acoustic measure that intends to appraise the amount of that turbulent noise. Several algorithms have been proposed in both time and frequency domain to estimate HNR. The Generalized Variogram (GV) is a time-domain technique proposed for HNR estimation based on a similitude function between two speech windows. The drawbacks of the GV are related to the biased estimation of the amplitude ratio and the final HNR value. The present work deals with these limitations and proposes unbiased estimators. The experimental results show that the described improvements outperform the original GV.


harmonics to noise ratio additive noise shimmer variogram 


  1. 1.
    Hillenbrand, J.A.: Methothological Study of Perturbations and Additive Noise in Synthetically Generated Voice Signals. Journal of Speech and Hearing Research 30, 448–461 (1987)Google Scholar
  2. 2.
    Hillenbrand, J.A.: Perception of aperiodicities in synthetically generated signals. Journal of Speech and Hearing Research 83, 2361–2371 (1988)Google Scholar
  3. 3.
    Hillenbrand, J., Cleveland, R., Erickson, R.: Acoustic Correlates of Breathy Vocal Quality. Journal of Speech and Hearing Research 30, 448–461 (1994)Google Scholar
  4. 4.
    Yumoto, E., Gould, J.W., Baer, T.: Harmonics to Noise ratio as an index of the degree of hoarseness 71, 1544–1550 (1982)Google Scholar
  5. 5.
    Schoengten, J.: Spectral Models of Additive and Modulation Noise in Speech and Phonatory Excitation Signals. Journal of the Acoustical Society of America 113, 553–562 (2003)CrossRefGoogle Scholar
  6. 6.
    Qi, Y., Hillman, R.E.: Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. Journal of the Acoustical Society of America 102, 537–543 (1997)CrossRefGoogle Scholar
  7. 7.
    Qi, Y., Hillman, R.E., Milstein, C.: The estimation of signal-to-noise ratio in continuous speech for disordered voices. JASA 105, 2532–2535 (1999)CrossRefGoogle Scholar
  8. 8.
    Bettens, F., Grenez, F., Schoengten, J.: Estimation of vocal dysperiodicities in disordered speech by means of distant-sample bidirectional linear predictive analysis. Journal of the Acoustical Society of America 117, 328–334 (2005)CrossRefGoogle Scholar
  9. 9.
    Kacha, A., Grenez, F., Schoentgen, J., Benmahammend, K.: Dysphonic Speech Analysis Using Generalized Variogram. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 917–920. IEEE Press, Philadelphia (2005)Google Scholar
  10. 10.
    Haslett, J.: On the sample variogram and sample autocovariance for non-stationary time series. The Statistician 46(4), 475–485 (1997)Google Scholar
  11. 11.
    Murphy, P., Akande, O.: Quantification of glottal and voiced speech harmonics to noise ratio using cepstral-based estimation. In: ITRW on Nonlinear Speech Processing, pp. 19–22 (2005)Google Scholar
  12. 12.
    Mienkovic, P.: Least mean Squared Error Measures of Voice Perturbations. Journal of Speech and Hearing Research 30, 529–538 (1987)Google Scholar
  13. 13.
    Medan, Y.: Super resolution pitch determination of speech signals. IEEE Transaction on Signal Processing 39, 40–48 (1991)CrossRefGoogle Scholar
  14. 14.
    Parsa, V., Jamieson, D.G.: A comparison of high precision F0 extraction algorithms for sustained vowels. Journal of Speech, Language and Hearing Research 42, 112–126 (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Diana Torres-Boza
    • 1
  • Carlos A. Ferrer
    • 1
  1. 1.Center for Studies on Electronic and Information TechnologiesCentral University Marta Abreu of Las VillasSanta ClaraCuba

Personalised recommendations