Abstract
This paper proposes a speech enhancement system based on an auditory system for noise reduction in speech that is degraded by background noises. Accordingly, the proposed system adjusts frame by frame the coefficients for both lateral inhibition and amplitude component according to the detected sections for each input frame, then reduces the noise signal using a time-delay neural network. Based on measuring signal-to-noise ratios, experiments confirm that the proposed system is effective for speech that is degraded by various noises.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chien, J.T., Lee, L.M., Wang, H.C.: Noisy speech recognition by using variance adapted hidden Markov models. IEE Electronics Letters 31(18), 1555–1556 (1995)
Sreenivas, T.V., Kirnapure, P.: Codebook constrained wiener filtering for speech enhancement. IEEE Transactions on Speech and Audio Processing 4(5), 383–389 (1996)
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, Signal Processing 27(2), 113–120 (1979)
Shamma, S.A.: Speech Processing in the Auditory System II: Lateral Inhibition and the Central Processing of Speech Evoked Activity in the Auditory Nerve. The Journal of the Acoustical Society of America 78(7), 1622–1632 (1985)
Cheng, Y.M., O’Shaughnessy, D.: Speech enhancement based conceptually on auditory evidence. IEEE Trans. Signal Processing. 39(9), 1943–1954 (1991)
Hansen, J.H.L., Nandkumar, S.: Robust Estimation of Speech in Noisy Backgrounds Based on Aspects of the Auditory Process. The Journal of the Acoustical Society of America 97(6), 3833–3849 (1995)
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: Phoneme Recognition using Time-delay Neural Networks. IEEE Transactions on Acoustics, Speech, and Signal Processing 37(3), 328–339 (1989)
Wu, Y., Li, Y.: Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method. In: IEEE International Conference on Systems, Man, and Cybernetics, vol. 4, pp. 2935–2939 (2000)
Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 33(2), 443–445 (1985)
Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum-Mean Square Error Short-Time Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 32(6), 1109–1121 (1984)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Choi, JS., Park, SJ. (2007). Speech Enhancement System Based on Auditory System and Time-Delay Neural Network. In: Beliczynski, B., Dzielinski, A., Iwanowski, M., Ribeiro, B. (eds) Adaptive and Natural Computing Algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4432. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71629-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-71629-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71590-0
Online ISBN: 978-3-540-71629-7
eBook Packages: Computer ScienceComputer Science (R0)