Skip to main content

Performance Evaluation of STSA Based Speech Enhancement Techniques for Speech Communication System

  • Conference paper
  • First Online:
  • 777 Accesses

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 3))

Abstract

Researchers present noise suppression model for reducing the spectral effects of acoustically added noise in speech. Background noise which is acoustically added to speech may decrease the performance of digital voice processors that are used for applications such as speech compression, recognition, and authentication. [6, 7] In proposed paper different types of Short Time Spectral Amplitude (STSA) [1, 17] based methods are explained to decrease the noise. Spectral subtraction gives a computationally efficient, processor- independent approach to effective digital speech analysis. But as a result of artifact, another synthetic noise may be produced by algorithm that is called musical noise. In spectral subtraction methods, there is shown less trade-off between residual and musical noise so the quality and intelligibility of signal is not maximized at required level. [8] To overcome from the problem of musical noise, wiener filter and statistical based model methods are discovered and some proposed modifications [7,8,9,10,11] are suggested in every methods to make it more effective.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Li J, Deng L, Haeb-Umbach R, Gong Y (2016) Robust automatic speech recognition, Chapter 6.3: Sampling-based methods, ISBN: 978-0-12802398-3. Elsevier

    Google Scholar 

  2. Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP-27:113–120, April

    Google Scholar 

  3. Wang Y, Brookes M (2016) Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior. In: Proc. IEEE intl. conf. on Acoustics, Speech and Signal Processing (ICASSP), pp 5225–5229, March

    Google Scholar 

  4. Scalart P, Filho JV (1996) Speech enhancement based on a priori signal to noise ratio estimation. In: Proc. IEEE international conference on Acoustics, Speech and Signal Processing ICASSP 96, pp 629–632, May

    Google Scholar 

  5. Ephrahim Y, Malah D (1985) Speech enhancement using a minimum mean square error log spectral amplitude estimator. IEEE trans. on Acoustics, Speech and Signal Processing, vol. ASSP-33, no. 2, pp 443–445, April

    Google Scholar 

  6. Xie D, Zhang W (2014) Estimating speech spectral amplitude based on the Nakagami approximation. IEEE Signal Process. Lett. 21(11):1375–1379

    Article  Google Scholar 

  7. Doire CSJ (2016) Single-channel enhancement of speech corrupted by reverberation and noise. Ph.D. dissertation, Imperial College London

    Google Scholar 

  8. Liang D, Hoffman MD, Mysore GJ (2015) Speech dereverberation using a learned speech model. IEEE international conference on Acoustic, Speech and Signal Processing (ICASSP)

    Google Scholar 

  9. Wang Y, Brookes M (2016) Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior. IEEE international conference on Acoustics, Speech and Signal Processing (ICASSP)

    Google Scholar 

  10. Duchi J, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12(Jul):2121–2159

    Google Scholar 

  11. Erkelens JS, Hendriks RC, Heusdens R, Jensen J (2007) Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors. IEEE Trans. Speech Audio Process. 15(6):1741–1752

    Article  Google Scholar 

  12. Navya Sri. Ramakrishna Murty MM, et al (2017) Robust features from speech by using Gaussian mixture model classification. International conference and published proceeding in SIST series, vol 2. Springer, pp 437–444, August

    Google Scholar 

  13. Yu Wang, Mike Beookes (2018) IEEE members. IEEE/ACM transactions on audio, speech and language processing, vol 26, no 3, March

    Google Scholar 

  14. Dionelis N, Brookes M (2017) 25th European Signal Processing Conference (EUSIPCO)

    Google Scholar 

  15. Brookes M (1998–2016) VOICEBOX: A speech processing toolbox for MATLAB. http://www.ee.imperial.ac.uk/hp/staff/dmb/voicebox/voiceboxhtml

  16. Wang Y, Narayanan A, Wang D (2014) On training targets for supervised speech separation. IEEE/ACM trans. on audio, speech and language processing 22(12):1849–1858

    Google Scholar 

  17. Boulanger-Lewandowski N, Mysore GJ, Hoffman M (2014) Exploiting long-term temporal dependencies in NMF using recurrent neural networks with application to source separation. IEEE international conference on Acoustic, Speech and Signal Processing (ICASSP)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Naveen Jain , Mohammad Sabir or Vijendra Maurya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ramprasad, B.P., Jain, N., Sabir, M., Maurya, V. (2020). Performance Evaluation of STSA Based Speech Enhancement Techniques for Speech Communication System. In: Satapathy, S.C., Raju, K.S., Shyamala, K., Krishna, D.R., Favorskaya, M.N. (eds) Advances in Decision Sciences, Image Processing, Security and Computer Vision. ICETE 2019. Learning and Analytics in Intelligent Systems, vol 3. Springer, Cham. https://doi.org/10.1007/978-3-030-24322-7_68

Download citation

Publish with us

Policies and ethics