Abstract
Several methods which are used for speech detection usually fail when SNR is low. The wavelet analysis has properties which can help in separating the speech from other signals. Many works report better detection and separation performance using wavelet analysis than using other techniques. On another level, as segmentation of speech into many classes is so hard, WT is well localized in time-frequency domain, and boundaries of speech segments can be willingly detected.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
C. Juang, C. Cheng, T. Chen, Speech detection in noisy environments by wavelet energy-based recurrent neural fuzzy network. Expert Syst. Appl. 36(1), 321–332 (2009)
S.M. Joseph, A.P. Babu, Wavelet energy based voice activity detection and adaptive thresholding for efficient speech coding. Int. J. Speech Technol. 19(3), 537–550 (2016)
C.C. Tu, C. Juang, Recurrent type-2 fuzzy neural network using Haar wavelet energy and entropy features for speech detection in noisy environments. Expert Syst. Appl. 39(3), 2479–2488 (2012)
S.H. Chen, R. Guido, T.K. Truong, Y. Chang, Improved voice activity detection algorithm using wavelet and support vector machine. Comput. Speech Lang. 24(3), 531–543 (2010)
W. Xue, S. Du, C. Fang, Y. Ye, Voice activity detection using wavelet-based multiresolution Spectrum and support vector machines and audio mixing algorithm, computer vision in human-computer interaction, lecture notes in computer science. Spring 3979, 78–88 (2006)
M. Eshaghi, M. Mollaei, Voice activity detection based on using wavelet packet. Digit.Signal Process. 20(4), 1102–1115 (2010)
B. Tan, R. Lang, H. Schroder, A. Spray, P. Dermody. Applying wavelet analysis to speech segmentation and classification. In H. H. Szu, Wavelet Appl. Proc. SPIE 2242, 750–761, (1994)
M. Ziolko, J. Galka, B. Ziolko, T. Drwiega, Perceptual wavelet decomposition for speech segmentation, in 11th Annual Conference of the International Speech Communication Association 2010 (INTERSPEECH 2010), Vols. 3 and 4, 2234–2237
M. Sarma, K.K. Sarma, Segmentation and classification of vowel phonemes of assamese speech using a hybrid neural framework. Appl. Comput. Intell. Soft Comput. 2012, 8 (2012)
Α. Koutras, E. Dermatas, G. Kokkinakis, Blind speech separation using wavelet decomposition, in 6th International Workshop on Speech and Computers, Moscow, Russia, Oct 2001, pp. 146–149
B. Mozaffari, M.A. Tinati, Blind source separation of speech sources in wavelet packet domains using Laplacian mixture model expectation maximization estimation in over-complete- cases. J. Stat. Mech. Theory Exp. An IOP and SISSA J. 1–31 (2007)
X. Wu, J. He, S. Jin, A. Xu, W. Wang, Blind separation of speech signals based on wavelet transform and independent component analysis. Trans. Tianjin University 16(2), 123–128 (2010)
I. Missaoui, Z. Lachiri, Undecimated wavelet packet for blind speech separation using independent component analysis, in Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, ed. by A. Abraham, J. L. Mauri, J. F. Buford, J. Suzuki, S. M. Thampi, vol. 193, (Springer, Berlin, Heidelberg, 2011), pp. 318–328
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 The Author(s)
About this chapter
Cite this chapter
Farouk, M.H. (2018). Speech Detection and Separation. In: Application of Wavelets in Speech Processing. SpringerBriefs in Electrical and Computer Engineering(). Springer, Cham. https://doi.org/10.1007/978-3-319-69002-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-69002-5_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69001-8
Online ISBN: 978-3-319-69002-5
eBook Packages: EngineeringEngineering (R0)