Performance Evaluation of STSA Based Speech Enhancement Techniques for Speech Communication System

Ramprasad, Boriwal Poojakumari; Jain, Naveen; Sabir, Mohammad; Maurya, Vijendra

doi:10.1007/978-3-030-24322-7_68

Performance Evaluation of STSA Based Speech Enhancement Techniques for Speech Communication System

Boriwal Poojakumari Ramprasad⁹,
Naveen Jain¹⁰,
Mohammad Sabir¹⁰ &
…
Vijendra Maurya¹⁰

Conference paper
First Online: 13 July 2019

777 Accesses

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 3))

Abstract

Researchers present noise suppression model for reducing the spectral effects of acoustically added noise in speech. Background noise which is acoustically added to speech may decrease the performance of digital voice processors that are used for applications such as speech compression, recognition, and authentication. [6, 7] In proposed paper different types of Short Time Spectral Amplitude (STSA) [1, 17] based methods are explained to decrease the noise. Spectral subtraction gives a computationally efficient, processor- independent approach to effective digital speech analysis. But as a result of artifact, another synthetic noise may be produced by algorithm that is called musical noise. In spectral subtraction methods, there is shown less trade-off between residual and musical noise so the quality and intelligibility of signal is not maximized at required level. [8] To overcome from the problem of musical noise, wiener filter and statistical based model methods are discovered and some proposed modifications [7,8,9,10,11] are suggested in every methods to make it more effective.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Li J, Deng L, Haeb-Umbach R, Gong Y (2016) Robust automatic speech recognition, Chapter 6.3: Sampling-based methods, ISBN: 978-0-12802398-3. Elsevier
Google Scholar
Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP-27:113–120, April
Google Scholar
Wang Y, Brookes M (2016) Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior. In: Proc. IEEE intl. conf. on Acoustics, Speech and Signal Processing (ICASSP), pp 5225–5229, March
Google Scholar
Scalart P, Filho JV (1996) Speech enhancement based on a priori signal to noise ratio estimation. In: Proc. IEEE international conference on Acoustics, Speech and Signal Processing ICASSP 96, pp 629–632, May
Google Scholar
Ephrahim Y, Malah D (1985) Speech enhancement using a minimum mean square error log spectral amplitude estimator. IEEE trans. on Acoustics, Speech and Signal Processing, vol. ASSP-33, no. 2, pp 443–445, April
Google Scholar
Xie D, Zhang W (2014) Estimating speech spectral amplitude based on the Nakagami approximation. IEEE Signal Process. Lett. 21(11):1375–1379
Article Google Scholar
Doire CSJ (2016) Single-channel enhancement of speech corrupted by reverberation and noise. Ph.D. dissertation, Imperial College London
Google Scholar
Liang D, Hoffman MD, Mysore GJ (2015) Speech dereverberation using a learned speech model. IEEE international conference on Acoustic, Speech and Signal Processing (ICASSP)
Google Scholar
Wang Y, Brookes M (2016) Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior. IEEE international conference on Acoustics, Speech and Signal Processing (ICASSP)
Google Scholar
Duchi J, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12(Jul):2121–2159
Google Scholar
Erkelens JS, Hendriks RC, Heusdens R, Jensen J (2007) Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors. IEEE Trans. Speech Audio Process. 15(6):1741–1752
Article Google Scholar
Navya Sri. Ramakrishna Murty MM, et al (2017) Robust features from speech by using Gaussian mixture model classification. International conference and published proceeding in SIST series, vol 2. Springer, pp 437–444, August
Google Scholar
Yu Wang, Mike Beookes (2018) IEEE members. IEEE/ACM transactions on audio, speech and language processing, vol 26, no 3, March
Google Scholar
Dionelis N, Brookes M (2017) 25th European Signal Processing Conference (EUSIPCO)
Google Scholar
Brookes M (1998–2016) VOICEBOX: A speech processing toolbox for MATLAB. http://www.ee.imperial.ac.uk/hp/staff/dmb/voicebox/voiceboxhtml
Wang Y, Narayanan A, Wang D (2014) On training targets for supervised speech separation. IEEE/ACM trans. on audio, speech and language processing 22(12):1849–1858
Google Scholar
Boulanger-Lewandowski N, Mysore GJ, Hoffman M (2014) Exploiting long-term temporal dependencies in NMF using recurrent neural networks with application to source separation. IEEE international conference on Acoustic, Speech and Signal Processing (ICASSP)
Google Scholar

Download references

Author information

Authors and Affiliations

GITS, Udaipur, India
Boriwal Poojakumari Ramprasad
ECE Department, GITS, Udaipur, India
Naveen Jain, Mohammad Sabir & Vijendra Maurya

Authors

Boriwal Poojakumari Ramprasad
View author publications
You can also search for this author in PubMed Google Scholar
Naveen Jain
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Sabir
View author publications
You can also search for this author in PubMed Google Scholar
Vijendra Maurya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Naveen Jain , Mohammad Sabir or Vijendra Maurya .

Editor information

Editors and Affiliations

School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
Suresh Chandra Satapathy
Department of CSE, CMR Technical Campus, Hyderabad, Telangana, India
K. Srujan Raju
Department of CSE, University College of Engineering, Osmania University, Hyderabad, Telangana, India
K. Shyamala
Department of ECE, University College of Engineering, Osmania University, Hyderabad, Telangana, India
D. Rama Krishna
Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, Krasnoyarsk, Russia
Margarita N. Favorskaya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramprasad, B.P., Jain, N., Sabir, M., Maurya, V. (2020). Performance Evaluation of STSA Based Speech Enhancement Techniques for Speech Communication System. In: Satapathy, S.C., Raju, K.S., Shyamala, K., Krishna, D.R., Favorskaya, M.N. (eds) Advances in Decision Sciences, Image Processing, Security and Computer Vision. ICETE 2019. Learning and Analytics in Intelligent Systems, vol 3. Springer, Cham. https://doi.org/10.1007/978-3-030-24322-7_68

Download citation

DOI: https://doi.org/10.1007/978-3-030-24322-7_68
Published: 13 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24321-0
Online ISBN: 978-3-030-24322-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics