A wavelet based method for removal of highly non-stationary noises from single-channel hindi speech patterns of low input SNR

Singh, Sachin; Tripathy, Manoj; Anand, R. S.

doi:10.1007/s10772-014-9255-3

A wavelet based method for removal of highly non-stationary noises from single-channel hindi speech patterns of low input SNR

Published: 14 October 2014

Volume 18, pages 157–166, (2015)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Sachin Singh¹,
Manoj Tripathy¹ &
R. S. Anand¹

247 Accesses
5 Citations
Explore all metrics

Abstract

This paper presents a binary mask thresholding function in Doubachies10 wavelet transform for enhancement of highly non-stationary noise mixed single-channel Hindi speech patterns of low (negative) SNR. In the wavelet transform, a five level of decomposition is used and detailed coefficients of all five levels are given to binary mask thresholding function for removing noise and enhancing the speech patterns. The robustness of the proposed method is compared with the wildly popular methods such as log-mmse, test-psc, Wiener, IdBM, and spectral-subtraction on the basis of performance measure parameters viz SNR, PSNR, PESQ, and Cepstrum distance. The algorithms were implemented in MATLAB 7.1.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speech Intelligibility Quality in Telugu Speech Patterns Using a Wavelet-Based Hybrid Threshold Transform Method

A Wavelet-Based De-Noising Speech Signal Performance with Objective Measures

Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments

References

Bahoura, M., & Rouat, J. (2006). Wavelet speech enhancement based on time-scale adaptation. Speech Communication, 48(12), 1620–1637.
Article Google Scholar
Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics Speech and Signal Processing, 27(2), 113–120.
Article Google Scholar
Dendrinos, M., Bakamidis, S., & Carayannis, G. (1991). Speech enhancement from noise: A regenerative approach. Speech Communication, 10(1), 45–57.
Article Google Scholar
Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean square error log-spectral amplitude estimator. IEEE Transactions on Audio, Speech and Language Processing, 33, 443–445.
Google Scholar
Ephraim, Y. (1992). Statistical-model-based speech enhancement systems. Proceedings of the IEEE, 80, 1526–1555.
Article Google Scholar
Ephraim, Y., & Van Trees, H. L. (1995). A signal subspace approach for speech enhancement. IEEE Transactions on Acoustics Speech and Signal Processing, 3(4), 251–266.
Article Google Scholar
Gabor, D. (1946). Theory of communication. The Journal of Electrical Engineering, 93, 429–457.
Google Scholar
Goupillaud, P., Grossmann, A., & Morlet, J. (1984). Cycle-octave and related transforms in seismic analysis. Journal of Applied Geophysics, 23(1), 85–102.
Google Scholar
Hu, Y., & Loizou, P. C. (2007). A comparative intelligibility study of single-microphone noise reduction algorithms. Journal Acoustic Socity of America, 122, 1777–1786.
Article Google Scholar
Jensen, S. H., & Hansen, P. C. (1995). Reduction of broad-band noise in speech by truncated QSVD. IEEE Transactions on Acoustics Speech and Signal Processing, 3(6), 439–448.
Article Google Scholar
Johnson, M. T., Yuan, X., & Ren, Y. (2007). Speech signal enhancement through adaptive wavelet thresholding. Speech Communication, 2(49), 123–133.
Article Google Scholar
Kitawaki, N., & Nagabuchi, H. (1988). Objective quality evaluation for low bit-rate speech coding systems. IEEE Journal on Selected Areas in Communications, 6, 262–273.
Article Google Scholar
Li, J., & Liu, H. (2012). New wavelet packet transform algorithm based on critical bandwidth. Computer Engineering and Applications, 14(48), 5–7.
Google Scholar
McAulay, R., & Malpass, M. (1980). Speech enhancement using a soft-decision noise suppression filter. IEEE Transactions on Acoustics Speech and Signal Processing, 28(2), 137–145.
Article Google Scholar
Pearce, D., & Hirsch, H. G. (2000). The aurora experimental framework for the performance evaluation of speech recognition system under noisy conditions. International conference on spoken language processing, Beijing, 16–20 Oct 2000.
Perceptual evaluation of speech quality (PESQ) An objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs. ITU-T Recommendation P.862.1 (2003).
Prahallad, K., Elluru, N. K., Keri, V., Rajendran, S., Black, A. W. (2012). The IIIT-H indic speech databases. In Proceedings of Interspeech, Portland, Oregon, USA (2012). http://speech.iiit.ac.in/index.php/research-svl/69.html.
Rangachari, S., & Loizou, P. C. (2006). A noise-estimation algorithm for highly non-stationary environments. Speech Communication, 48, 220–231.
Article Google Scholar
Sanam, T. F. (2012). Enhancement of noisy speech based on a custom thresholding function with a statistically determined threshold. The International Journal of Speech Technology, 15(4), 463–475.
Scalart, P., & Filho, J. (1996). Speech enhancement based on a priori signal to noise estimation. In Proceedings of IEEE International conference on acoust speech, signal processing (pp. 629–632).
Singh, S., Tripathy, M., & Anand, R. S. (2013). Noise removal in single channel Hindi speech patterns by using binary mask thresholding function in various mother wavelets. IEEE International Conference on Signal Processing, Computing and Control (ISPCC), Shimla, India, 26–28 Sept 2013.
Singh, S.,Tripathy, M., & Anand, R. S. (2014). Wavelet packet based multiple noises suppression in single channel speech using binary mask threshold. IEEE international conference on signal propagation and computer technology (ICSPCT), Ajmer, India, 12–13 July 2014.
Singh, S., Tripathy, M., & Anand, R. S. (2014). “Subjective and objective analysis of speech enhancement algorithms for single channel speech patterns of Indian and english languages”, Taylor & Francis. IETE Technical Review, 31(1), 34–46.
Article Google Scholar
Stark, A. P., et al. (2008). Noise driven short-time phase spectrum compensation procedure for speech enhancement. In Proceedings of Interspeech, Brisbane, Australia.
Tao, H., & Qin, H. (2008). Chengbo Research of signal denoising method based on an improved wavelet thresholding. Piezoelectronics & Acoustooptics, 1(30), 93–95.
Google Scholar
Wojcicki, K., & Loizou, P. C. (2012). Channel selection in the modulation domain for improved speech intelligibility in noise. The Journal of the Acoustical Society of America, 131(4), 2904–2913.
Yi, H., & Loizou, P. C. (2004). Speech enhancement based on wavelet thresholding the multitaper Spectrum. IEEE Signal Processing Letters, 12, 59–67.
Google Scholar
Zhang, X. (2010). Digital : Speech signal processing and MATLAB simulation. Beijing: Publishing House of Electronics Industry.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Sachin Singh, Manoj Tripathy & R. S. Anand

Authors

Sachin Singh
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Tripathy
View author publications
You can also search for this author in PubMed Google Scholar
R. S. Anand
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sachin Singh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Singh, S., Tripathy, M. & Anand, R.S. A wavelet based method for removal of highly non-stationary noises from single-channel hindi speech patterns of low input SNR. Int J Speech Technol 18, 157–166 (2015). https://doi.org/10.1007/s10772-014-9255-3

Download citation

Received: 06 May 2013
Accepted: 30 September 2014
Published: 14 October 2014
Issue Date: June 2015
DOI: https://doi.org/10.1007/s10772-014-9255-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A wavelet based method for removal of highly non-stationary noises from single-channel hindi speech patterns of low input SNR

Abstract

Access this article

Similar content being viewed by others

Speech Intelligibility Quality in Telugu Speech Patterns Using a Wavelet-Based Hybrid Threshold Transform Method

A Wavelet-Based De-Noising Speech Signal Performance with Objective Measures

Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A wavelet based method for removal of highly non-stationary noises from single-channel hindi speech patterns of low input SNR

Abstract

Access this article

Similar content being viewed by others

Speech Intelligibility Quality in Telugu Speech Patterns Using a Wavelet-Based Hybrid Threshold Transform Method

A Wavelet-Based De-Noising Speech Signal Performance with Objective Measures

Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation