Deep Neural Network Based Speech Enhancement

Ram, Rashmirekha; Mohanty, Mihir Narayan

doi:10.1007/978-981-13-0617-4_27

Deep Neural Network Based Speech Enhancement

Rashmirekha Ram¹⁸ &
Mihir Narayan Mohanty¹⁸

Conference paper
First Online: 12 August 2018

964 Accesses
5 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 768))

Abstract

Enhancement of the speech signal is an essential task in the adverse environment. Several algorithms have been designed from several years to improve the quality. Mostly Neural Network and its variants are utilized for classification purpose. This paper exhibits the speech enhancement method based on the Deep Neural Network (DNN) to improve the quality and to increase the Signal-to-Noise Ratio of the speech signal. Different hidden layers are set to test the results. The audio features are extracted by using the short time Fourier transforms. The use of audio features improves the speech enhancement performance of DNN. Segmental Signal-to-Noise Ratio (SegSNR) and Perceptual Evaluation of Speech Quality (PESQ) are measured to test the results.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Loizou, P.: Speech Enhancement: Theory and Practice. CRC Press, Boca Raton (2007)
Google Scholar
Haykin, S.: Adaptive Filter Theory, 3rd edn. Prentice Hall, Upper Saddle River (1996)
MATH Google Scholar
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. ASSP 27, 113–120 (1979)
Article Google Scholar
Chaudhari, A., Dhonde, S.B.: A review on speech enhancement techniques. In: International Conference on Pervasive Computing (ICPC) (2015)
Google Scholar
Vihari, S., Murthy, A.S., Soni, P., Naik, D.C.: Comparison of speech enhancement algorithms. Procedia Comput. Sci. 89, 666–676 (2016)
Article Google Scholar
Ram, R., Mohanty, M.N.: Performance analysis of adaptive algorithms for speech enhancement applications. Indian J. Sci. Technol. 9(44), 6 (2016)
Article Google Scholar
Fah, L.B., Hussain, A., Samad, S.A.: Speech enhancement by noise cancellation using neural network. In: IEEE Conference on TENCON (2000)
Google Scholar
Daqrouq, K., Abu-Isbeih, I.N., Alfauori, M.: Speech signal enhancement using neural network and wavelet transform. In: International Multi-Conference on Systems, Signals and Devices (2009)
Google Scholar
Ram, R., Mohanty, M.N.: Fractional DCT ADALINE method for speech enhancement. In: International Conference on Machine Learning and Computational Intelligence (2017) (Communicated)
Google Scholar
Goehring, T., Bolner, F., Monaghan, J.J.M., Dijk, B., Zarowski, A., Bleeck, S.: Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users. Hear. Res. 344, 183–194 (2017)
Article Google Scholar
Prieto, A., Prieto, B., Ortigosa, E.M., Ros, E., Pelayo, F., Ortega, J., Rojas, I.: Neural networks: an overview of early research, current frameworks and new challenges. Neurocomputing 214, 242–268 (2016)
Article Google Scholar
Xu, Y., Du, J., Dai, L., Lee, C.: A regression approach to speech enhancement based on deep neural networks. IEEE/ACM Trans. Audio, Speech, Lang. Process. 23(1), 7–19 (2015)
Article Google Scholar
Li, Y., Kang, S.: Deep neural network based linear predictive parameter estimations for speech enhancement. IET Signal Process. 11(4), 469–476 (2017)
Article Google Scholar
Li, R., Liu, Y., Shi, Y., Dong, L., Cui, W.: ILMSAF based speech enhancement with DNN and noise classification. Speech Commun. 85, 53–70 (2016)
Article Google Scholar
Goehring, T., Yang, X., Monaghan, J.J.M., Bleeck, S.: Speech enhancement for hearing-impaired listeners using deep neural networks with auditory-model based features. In: European Signal Processing Conference (2016)
Google Scholar
Hou, J.C., Wang, S.S., Lai, Y.H., Lin, J.C., Tsao, Y., Chang, H.W., Wang, H.M.: Audio-visual speech enhancement using deep neural networks. In: IEEE Signal and Information Processing Association Annual Summit and Conference (APSIPA), pp. 1–6 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, India
Rashmirekha Ram & Mihir Narayan Mohanty

Authors

Rashmirekha Ram
View author publications
You can also search for this author in PubMed Google Scholar
Mihir Narayan Mohanty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihir Narayan Mohanty .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Vignana Bharathi Institute of Technology, Hyderabad, Telangana, India
Pradeep Kumar Mallick
Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Department of Electrical and Electronics Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Rangpo, India
Akash Kumar Bhoi
Department of Electronic and Computer Engineering, Brunel University London, Uxbridge, Middlesex, United Kingdom
Ahmed F. Zobaa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ram, R., Mohanty, M.N. (2019). Deep Neural Network Based Speech Enhancement. In: Mallick, P., Balas, V., Bhoi, A., Zobaa, A. (eds) Cognitive Informatics and Soft Computing. Advances in Intelligent Systems and Computing, vol 768. Springer, Singapore. https://doi.org/10.1007/978-981-13-0617-4_27

Download citation

DOI: https://doi.org/10.1007/978-981-13-0617-4_27
Published: 12 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0616-7
Online ISBN: 978-981-13-0617-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics