Advertisement

Single channel speech enhancement using an MVDR filter in the frequency domain

  • Sonay KammiEmail author
Article
  • 18 Downloads

Abstract

Single channel speech enhancement is typically referred to the methods in which a filter is applied to the noisy speech to recover enhanced speech signal. In these methods, noise reduction causes speech distortion. So, it is a key concern to control the tradeoff between noise reduction and speech distortion in designing speech enhancement algorithms. This paper deals with the frequency domain single channel speech enhancement performed via short time furrier transform (STFT). Conventional frequency domain methods treat the STFT coefficients independently ignoring neighboring correlations. In this paper, we take into account neighboring correlations and derive a minimum variance distortionless response filter in the frequency domain. Experimental results show merits of our proposed method.

Keywords

Single channel speech enhancement MVDR filter Noise reduction Speech distortion 

Notes

References

  1. Benesty, J., & Chen, J. (2011). Optimal time-domain noise reduction filters—A theoretical study. Springer briefs in electrical and computer engineering. New York: Springer.CrossRefGoogle Scholar
  2. Benesty, J., Chen, J., Huang, Y., & Cohen, I. (2009). Noise reduction in speech processing. New York: Springer.Google Scholar
  3. Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech and Signal Processing, 27, 113–120.CrossRefGoogle Scholar
  4. Chen, B., & Loizou, C. (2007). A Laplacian-based MMSE estimator for speech enhancement. Speech Communication, 49, 134–143.CrossRefGoogle Scholar
  5. Chen, I., & Benesty, J. (2012). Single-channel noise reduction in the STFT domain based on the bifrequency spectrum. In Proceeding of IEEE international conference on acoustics, speech and signal processing, pp. 97–100.Google Scholar
  6. Cohen, I. (2003). Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Transactions on Speech and Audio Processing, 11, 466–475.CrossRefGoogle Scholar
  7. Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum mean square error short time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech and Signal Processing, 32, 1109–1130.CrossRefGoogle Scholar
  8. Hermus, K., Wambacq, P., & Hamme, H. V. (2007). A review of signal subspace speech enhancement and its application to noise robust speech recognition. EURASIP Journal on Advances in Signal Processing.  https://doi.org/10.1155/2007/45821.MathSciNetzbMATHGoogle Scholar
  9. Hu, Y., & Loizou, P. C. (2006). Evaluation of objective measures for speech enhancement. In Proceedings of interspeech, pp. 1447–1450.Google Scholar
  10. Hu, Y., & Loizou, P. C. (2007). Subjective comparison and evaluation of speech enhancement algorithms. Speech Communication, 49, 588–601.CrossRefGoogle Scholar
  11. Huang, H., Zhao, L., Chen, J., & Benesty, J. (2014). A minimum variance distortionless response filter based on the bifrequency spectrum for single-channel noise reduction. Digital Signal Processing, 33, 169–179.CrossRefGoogle Scholar
  12. Huang, Y. A., & Benesty, J. (2012). A multi-frame approach to the frequency-domain single-channel noise reduction problem. IEEE Transactions on Audio, Speech, and Language Processing, 20, 1256–1269.CrossRefGoogle Scholar
  13. ITU. (2000). Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs. In ITU-T recommendation 862.Google Scholar
  14. Jensen, J. R., Benesty, J., Christensen, M. G., & Chen, J. (2013). A class of optimal rectangular filtering matrices for single-channel signal enhancement in the time domain. IEEE Transactions on Audio, Speech, and Language Processing, 21, 2595–2606.CrossRefGoogle Scholar
  15. Loizou, P. C. (2007). Speech enhancement: Theory and practice. Boca Raton: CRC Press.CrossRefGoogle Scholar
  16. Lu, Y., & Loizou, P. C. (2008). A geometric approach to spectral subtraction. Speech Communication, 50, 453–466.CrossRefGoogle Scholar
  17. Pearce, D., & Hirsch, H. G. (2000). The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In Proceeding of international conference on spoken language processing.Google Scholar
  18. Saadoune, A., Amrouche, A., & Selouani, S. A. (2014). Perceptual subspace speech enhancement using variance of the reconstruction error. Digital Signal Processing, 24, 187–196.CrossRefGoogle Scholar
  19. Trawicki, M. B., & Johnson, M. T. (2014). Speech enhancement using Bayesian estimators of the perceptually-motivated short-time spectral amplitude (STSA) with Chi speech priors. Speech Communication, 57, 101–113.CrossRefGoogle Scholar
  20. Upadhyay, N., & Karmakar, A. (2015). Speech enhancement using spectral subtraction-type algorithms: A comparison and simulation study. Procedia Computer Science, 54, 574–584.CrossRefGoogle Scholar
  21. Varga, A., & Steeneken, H. J. M. (1993). NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12, 247–251.CrossRefGoogle Scholar
  22. Vincent, E., Fevotte, C., & Gribonval, R. (2006). Performance measurement in blind audio source separation. IEEE Transactions on Audio, Speech and Language Processing, 14, 1462–1469.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.Faculty of Electrical and Computer EngineeringBabol Noshirvani University of TechnologyBabolIran

Personalised recommendations