Skip to main content

Stochastic Feature Compensation for Robust Speaker Verification

  • Chapter
  • First Online:
Robust Speaker Recognition in Noisy Environments

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

Abstract

This chapter explores the impact of standard stereo-based stochastic feature compensation (SFC) methods for robust speaker verification in uniform noisy environments. In this work, SFC using independent as well as joint probability models are explored for compensating the effect of noise. Integration of a SFC stage in the GMM-UBM framework is proposed for speaker verification evaluation under mismatched conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. T. Kinnunen, Spectral features for automatic text-independent speaker recognition. PhD thesis, Department of Computer Science, University of Joensuu, 2004

    Google Scholar 

  2. D.A. Reynolds, Experimental evaluation of features for robust speaker identification. IEEE Trans. Speech Audio Process. 2(4), 639–643 (1994)

    Article  Google Scholar 

  3. R. Mammone, X. Zhang, R. Ramachandran, Robust speaker recognition: a feature-based approach. IEEE Signal Process. Mag. 13(5), 58–71 (1996)

    Article  Google Scholar 

  4. S. Furui, Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech Signal Process. 29(2), 254–272 (1981)

    Article  Google Scholar 

  5. H. Hermansky, N. Morgan, RASTA processing of speech. IEEE Trans. Speech Audio Process. 2(4), 578–589 (1994)

    Article  Google Scholar 

  6. S. Boll, Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27(2), 113–120 (1979)

    Article  Google Scholar 

  7. A. Acero, R.M. Stern, Environmental robustness in automatic speech recognition, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’90), Albuquerque, 1990, vol. 2, pp. 849–852

    Google Scholar 

  8. S. Sarkar, K.S. Rao, Stochastic feature compensation methods for speaker verification in noisy environments. Appl. Soft Comput. 19, 198–214 (2014). Elsevier

    Google Scholar 

  9. P.J. Moreno, B. Raj, R.M. Stern, Data-driven environmental compensation for speech recognition: a unified approach. Speech Commun. 24(4), 267–285 (1998)

    Article  Google Scholar 

  10. L. Deng, A. Acero, L. Jiang, J. Droppo, X. Huang, High-performance robust speech recognition using stereo training data, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, 2001, vol. 1, pp. 301–304

    Google Scholar 

  11. M.J.F. Gales, P.C. Woodland, Mean and variance adaptation within the MLLR framework. Comput. Speech Lang. 10, 249–264 (1996)

    Article  Google Scholar 

  12. L. Buera, E. Lleida, A. Miguel, A. Ortega, Multi-environment models based linear normalization for speech recognition in car conditions, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’04), Montreal, 2004

    Google Scholar 

  13. M. Afify, X. Cui, Y. Gao, Stereo-based stochastic mapping for robust speech recognition. IEEE Trans. Audio Speech Lang. Process. 17(7), 1325–1334 (2009)

    Article  Google Scholar 

  14. C.M. Bishop, Pattern Recognition and Machine Learning (Springer, New York, 2006)

    MATH  Google Scholar 

  15. V. Digalakis, D. Rtischev, L. Neumeyer, E. Sa, Speaker adaptation using constrained estimation of Gaussian mixtures. IEEE Trans. Speech Audio Process. 3(5), 357–366 (1995)

    Article  Google Scholar 

  16. Y. Stylianou, O. Cappe, E. Moulines, Continuous probabilistic transform for voice conversion. IEEE Trans. Speech Audio Process. 6(2), 131–142 (1998)

    Article  Google Scholar 

  17. T. Toda, A.W. Black, K. Tokuda, Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Trans. Audio Speech Lang. Process. 15(8), 2222–2235 (2007)

    Article  Google Scholar 

  18. H. Zen, Y. Nankaku, K. Tokuda, Stereo-based stochastic noise compensation based on trajectory GMMs, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’09), Taipei, 2009

    Google Scholar 

  19. K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, S. Imai, An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features, in Proceedings of the European Conference of Speech Communication Technology (EUROSPEECH ’95), Madrid, Sept 1995, pp. 757–760

    Google Scholar 

  20. NIST-speaker recognition evaluations (1995) http://www.itl.nist.gov/iad/mig/tests/spk/

  21. H. Hirsch, D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in Proceedings of the International Conference of Spoken Language Processing (ICSLP ’00), Beijing, 2000

    Google Scholar 

  22. D. Reynolds, T. Quatieri, R. Dunn, Speaker verification using adapted Gaussian mixture models. Digit. Signal Process. 10(1), 19–41 (2000)

    Article  Google Scholar 

  23. A. Varga, H.J. Steeneken, Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun. 12, 247–251 (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 The Author(s)

About this chapter

Cite this chapter

Rao, K.S., Sarkar, S. (2014). Stochastic Feature Compensation for Robust Speaker Verification. In: Robust Speaker Recognition in Noisy Environments. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-07130-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07130-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07129-9

  • Online ISBN: 978-3-319-07130-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics