Improving Performance of a Noise Reduction Algorithm by Switching the Analysis Filter Bank

  • Hamid Sepehr
  • Amir Y. Nooralahiyan
  • Paul V. Brennan
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6134)

Abstract

A new approach for preservation of transient parts of speech in a noise reduction system is proposed in this paper. Transient components of speech such as vowel onset and beginning of some consonants such as stop sounds are important parts for intelligibility of speech. These components are usually attenuated by noise reduction algorithms due to the low temporal resolution of block-based noise reduction techniques. A method is proposed to detect the transient component of speech, followed by dynamic switching of the analysis filter bank at the front end of the noise reduction system to provide higher resolution in the time domain. The optimal spectral gain values are transformed into the time domain to form a linear filter in order to achieve noise reduction and only group delay equalisation is performed to avoid discontinuity. Our objective evaluation shows that the proposed method provides superior performance compared to noise reduction with fixed time/frequency resolution analysis filter banks.

Keywords

Speech Signal Noise Reduction Filter Bank Audio Signal Speech Enhancement 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Martin, R.: Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. on Speech and Audio Processing 9(5), 504–512 (2001)CrossRefGoogle Scholar
  2. 2.
    Cohen, I., Berdugo, B.: Noise estimation by minima controlled recursive averaging for robust speech enhancement. IEEE Signal Proc. Letters 9(1), 12–15 (2002)CrossRefGoogle Scholar
  3. 3.
    Cohen, I.: Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging. IEEE Transactions On Speech And Audio Processing 11(5) (2003)Google Scholar
  4. 4.
    Louizo, P.: Speech Enhancement Theory and Practice. CRC Press, Boca Raton (2007)Google Scholar
  5. 5.
    Kamath, S.D., Loizou, P.C.: A Multi-Band Spectral Subtraction Method For Enhancing Speech Corrupted By Colored Noise. In: ICASSP (2002)Google Scholar
  6. 6.
    Hu, Y., Loizou, P.: Speech enhancement by Wavelet thresholding the multitaper spectrum. IEEE Transactions on Speech and Audio Processing 12(1), 59–67 (2004)CrossRefGoogle Scholar
  7. 7.
    Hendriks, R.C., Heusdens, R., Jensen, J.: Adaptive time segmentation for improved speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing 14(6), 2064–2074 (2006)CrossRefGoogle Scholar
  8. 8.
    Kauppinen, I., Roth, K.: Improved Noise Reduction in Audio Signals Using Spectral Resolution Enhancement With Time-Domain Signal Extrapolation. IEEE Transactions On Speech And Audio Processing 13(6) (2005)Google Scholar
  9. 9.
    Lukin, A., Todd, J.: Adaptive Time-Frequency Resolution for Analysis and Processing of Audio. In: AES 120th Convention, Paris, France (2006)Google Scholar
  10. 10.
    Mauler, D., Martin, R.: Improved Reproduction of Stops in Noise Reduction Systems with Adaptive Windows and Non-stationary Detection. EURASIP Journal on Advances in Signal Processing (2009)Google Scholar
  11. 11.
    Quatieri, T.F., Dunn, R.B.: Speech Enhancement Based on Auditory Spectral Change. In: ICASSP, pp. 257–260 (2002)Google Scholar
  12. 12.
    Song, W.-J., Pexrlman, W.A.: Edge-Preserving Noise Filtering Based on Adaptive Windowing. IEEE Transactions On Circuits And Systems 35(8) (1988)Google Scholar
  13. 13.
    Gustafsson, H.: Speech enhancement for mobile communications. Department of Telecommunications and Signal processing, University of Karlskrona/Ronneby, Ronneby (2000)Google Scholar
  14. 14.
    Gustafsson, et al: Signal Noise Reduction By Time_Domain Spectral Subtraction. US Patent US6507623B1 (2003)Google Scholar
  15. 15.
    Szwoch, G., Kulesza, M., Czyżewski, A.: Transient Detection for Speech Coding Applications. IJCSNS International Journal of Computer Science and Network Security 6(12) (2006)Google Scholar
  16. 16.
    Li, Y., Fan, Y.-L., Tong, Q.-Y.: Endpoint Detection In Noisy Environment Using Complexity Measure. In: Proceedings of the 2007 International Conference on Wavelet Analysis and Pattern Recognition, Beijing, China (2007)Google Scholar
  17. 17.
    Hazan, V., Simpson, A.: The effect of cue-enhancement on the intelligibility of nonsense word and sentence materials presented in noise. Speech Communication 24(3), 211–226 (1998)CrossRefGoogle Scholar
  18. 18.
    PESQ and other objective measures for evaluating quality of speech, http://www.utdallas.edu/~loizou/speech/software.htm

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Hamid Sepehr
    • 1
    • 2
  • Amir Y. Nooralahiyan
    • 1
  • Paul V. Brennan
    • 2
  1. 1.ElaraTek LTD 
  2. 2.University College London 

Personalised recommendations