Speech Enhancement System Based on Auditory System and Time-Delay Neural Network

Choi, Jae-Seung; Park, Seung-Jin

doi:10.1007/978-3-540-71629-7_18

Jae-Seung Choi¹ &
Seung-Jin Park²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4432))

Included in the following conference series:

International Conference on Adaptive and Natural Computing Algorithms

1979 Accesses
2 Citations

Abstract

This paper proposes a speech enhancement system based on an auditory system for noise reduction in speech that is degraded by background noises. Accordingly, the proposed system adjusts frame by frame the coefficients for both lateral inhibition and amplitude component according to the detected sections for each input frame, then reduces the noise signal using a time-delay neural network. Based on measuring signal-to-noise ratios, experiments confirm that the proposed system is effective for speech that is degraded by various noises.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chien, J.T., Lee, L.M., Wang, H.C.: Noisy speech recognition by using variance adapted hidden Markov models. IEE Electronics Letters 31(18), 1555–1556 (1995)
Article Google Scholar
Sreenivas, T.V., Kirnapure, P.: Codebook constrained wiener filtering for speech enhancement. IEEE Transactions on Speech and Audio Processing 4(5), 383–389 (1996)
Article Google Scholar
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, Signal Processing 27(2), 113–120 (1979)
Article Google Scholar
Shamma, S.A.: Speech Processing in the Auditory System II: Lateral Inhibition and the Central Processing of Speech Evoked Activity in the Auditory Nerve. The Journal of the Acoustical Society of America 78(7), 1622–1632 (1985)
Article Google Scholar
Cheng, Y.M., O’Shaughnessy, D.: Speech enhancement based conceptually on auditory evidence. IEEE Trans. Signal Processing. 39(9), 1943–1954 (1991)
Article Google Scholar
Hansen, J.H.L., Nandkumar, S.: Robust Estimation of Speech in Noisy Backgrounds Based on Aspects of the Auditory Process. The Journal of the Acoustical Society of America 97(6), 3833–3849 (1995)
Article Google Scholar
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: Phoneme Recognition using Time-delay Neural Networks. IEEE Transactions on Acoustics, Speech, and Signal Processing 37(3), 328–339 (1989)
Article Google Scholar
Wu, Y., Li, Y.: Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method. In: IEEE International Conference on Systems, Man, and Cybernetics, vol. 4, pp. 2935–2939 (2000)
Google Scholar
Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 33(2), 443–445 (1985)
Article Google Scholar
Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum-Mean Square Error Short-Time Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 32(6), 1109–1121 (1984)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics Engineering, Silla University, San 1-1, Gwaebop-dong, Sasang-gu, Busan, Korea
Jae-Seung Choi
Department of Biomedical Engineering, Chonnam National University Hospital & Medical School, Gwangju, Korea
Seung-Jin Park

Authors

Jae-Seung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Jin Park
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Bartlomiej Beliczynski Andrzej Dzielinski Marcin Iwanowski Bernardete Ribeiro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Choi, JS., Park, SJ. (2007). Speech Enhancement System Based on Auditory System and Time-Delay Neural Network. In: Beliczynski, B., Dzielinski, A., Iwanowski, M., Ribeiro, B. (eds) Adaptive and Natural Computing Algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4432. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71629-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-71629-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71590-0
Online ISBN: 978-3-540-71629-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics