Speech Detection and Separation

Farouk, Mohamed Hesham

doi:10.1007/978-3-319-69002-5_5

Mohamed Hesham Farouk⁶

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

820 Accesses

Abstract

Several methods which are used for speech detection usually fail when SNR is low. The wavelet analysis has properties which can help in separating the speech from other signals. Many works report better detection and separation performance using wavelet analysis than using other techniques. On another level, as segmentation of speech into many classes is so hard, WT is well localized in time-frequency domain, and boundaries of speech segments can be willingly detected.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

C. Juang, C. Cheng, T. Chen, Speech detection in noisy environments by wavelet energy-based recurrent neural fuzzy network. Expert Syst. Appl. 36(1), 321–332 (2009)
Article Google Scholar
S.M. Joseph, A.P. Babu, Wavelet energy based voice activity detection and adaptive thresholding for efficient speech coding. Int. J. Speech Technol. 19(3), 537–550 (2016)
Article Google Scholar
C.C. Tu, C. Juang, Recurrent type-2 fuzzy neural network using Haar wavelet energy and entropy features for speech detection in noisy environments. Expert Syst. Appl. 39(3), 2479–2488 (2012)
Article Google Scholar
S.H. Chen, R. Guido, T.K. Truong, Y. Chang, Improved voice activity detection algorithm using wavelet and support vector machine. Comput. Speech Lang. 24(3), 531–543 (2010)
Article Google Scholar
W. Xue, S. Du, C. Fang, Y. Ye, Voice activity detection using wavelet-based multiresolution Spectrum and support vector machines and audio mixing algorithm, computer vision in human-computer interaction, lecture notes in computer science. Spring 3979, 78–88 (2006)
Google Scholar
M. Eshaghi, M. Mollaei, Voice activity detection based on using wavelet packet. Digit.Signal Process. 20(4), 1102–1115 (2010)
Article Google Scholar
B. Tan, R. Lang, H. Schroder, A. Spray, P. Dermody. Applying wavelet analysis to speech segmentation and classification. In H. H. Szu, Wavelet Appl. Proc. SPIE 2242, 750–761, (1994)
Google Scholar
M. Ziolko, J. Galka, B. Ziolko, T. Drwiega, Perceptual wavelet decomposition for speech segmentation, in 11th Annual Conference of the International Speech Communication Association 2010 (INTERSPEECH 2010), Vols. 3 and 4, 2234–2237
Google Scholar
M. Sarma, K.K. Sarma, Segmentation and classification of vowel phonemes of assamese speech using a hybrid neural framework. Appl. Comput. Intell. Soft Comput. 2012, 8 (2012)
Google Scholar
Α. Koutras, E. Dermatas, G. Kokkinakis, Blind speech separation using wavelet decomposition, in 6th International Workshop on Speech and Computers, Moscow, Russia, Oct 2001, pp. 146–149
Google Scholar
B. Mozaffari, M.A. Tinati, Blind source separation of speech sources in wavelet packet domains using Laplacian mixture model expectation maximization estimation in over-complete- cases. J. Stat. Mech. Theory Exp. An IOP and SISSA J. 1–31 (2007)
Google Scholar
X. Wu, J. He, S. Jin, A. Xu, W. Wang, Blind separation of speech signals based on wavelet transform and independent component analysis. Trans. Tianjin University 16(2), 123–128 (2010)
Article Google Scholar
I. Missaoui, Z. Lachiri, Undecimated wavelet packet for blind speech separation using independent component analysis, in Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, ed. by A. Abraham, J. L. Mauri, J. F. Buford, J. Suzuki, S. M. Thampi, vol. 193, (Springer, Berlin, Heidelberg, 2011), pp. 318–328
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering, Math and Physics, Cairo University, Faculty of Engineering, Giza, Egypt
Mohamed Hesham Farouk

Authors

Mohamed Hesham Farouk
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Farouk, M.H. (2018). Speech Detection and Separation. In: Application of Wavelets in Speech Processing. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-69002-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-69002-5_5
Published: 30 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69001-8
Online ISBN: 978-3-319-69002-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics