Abstract
In this chapter, we present a method for the real-time blind source separation (BSS) of moving speech signals in a room. The method employs frequen-cydomain independent component analysis (ICA) using a blockwise batch algorithm in the first stage, and the separated signals are refined by postprocessing using crosstalk component estimation and non-stationary crosstalk cancellation in the second stage. The blockwise batch algorithm achieves better performance than an online algorithm when sources are stationary, and the postprocessing compensates for performance degradation caused by source movement. Experimental results using speech signals recorded in a real room show that our method realizes robust real-time separation for moving sources.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
A. J. Bell and T. J. Sejnowski, “An information-maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, pp. 1129–1159, 1995.
S. Haykin, ed., Unsupervised Adaptive Filtering. John Wiley & Sons, 2000.
T. W. Lee, Independent Component Analysis. Kluwer Academic Publishers, 1998.
A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing. John Wiley & Sons, 2002.
A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. John Wiley & Sons, 2001.
R. Mukai, H. Sawada, S. Araki, and S. Makino, “Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,” IEICE Trans. Fundamentals, vol. E87-A, pp. 1941–1948, 2004.
J. Anemüller and T. Gramss, “On-line blind separation of moving sound sources,” in Proc. ICA, 1999, pp. 331–334.
A. Koutras, E. Dermatas, and G. Kokkinakis, “Blind speech separation of moving speakers in real reverberant environment,” in Proc. IEEE ICASSP, 2000, pp. 1133–1136.
I. Kopriva, Z. Devcic, and H. Szu, “An adaptive short-time frequency domain algorithm for blind separation of non-stationary convolved mixtures,” in Proc. IJCNN, 2001, pp. 424–429.
K. E. Hild II, D. Erdogmus, and J. C. Principe, “Blind source separation of time-varying, instantaneous mixtures using an on-line algorithm,” in Proc. IEEE ICASSP, 2002, pp. 993–996.
R. Aichner, H. Buchner, F. Yan, and W. Kellermann, “Real-time convolutive blind source separation based on a broadband approach,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 840–848.
D. W. E. Schobben, Real-Time Adaptive Concepts in Acoustics. Kluwer Academic Publishers, 2001.
H. Sawada, R. Mukai, and S. Makino, “Direction of arrival estimation for multiple source signals using independent component analysis,” in Proc. ISSPA, vol. 2, 2003, pp. 411–414.
S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, and H. Saruwatari, “Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming for convolutive mixtures,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157–1166, 2003.
S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-27, pp. 113–120, Apr. 1979.
R. Mukai, S. Araki, H. Sawada and S. Makino, “Removal of residual cross-talk components in blind source separation using time-delayed spectral subtraction,” in Proc. IEEE ICASSP, 2002, pp. 1789–1792.
R. Mukai, S. Araki, H. Sawada, and S. Makino, “Removal of residual crosstalk components in blind source separation using LMS filters,” in Proc. NNSP, 2002, pp. 435–444.
S. Y. Low, S. Nordholm, and R. Togneri, “Convolutive blind signal separation with post-processing,” IEEE Trans. Speech Audio Processing, vol. 12, pp. 539–548, Sept. 2004.
D. Kolossa, and R. Orglmeister, “Nonlinear postprocessing for blind speech separation,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 832–839.
C. Choi, G. Jang, Y. Lee, and S. R. Kim, “Adaptive cross-channel interference cancellation on blind source separation outputs,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 857–864.
S. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Advances in Neural Information Processing Systems 8, pp. 757–763, The MIT Press, 1996.
H. Sawada, R. Mukai, S. Araki, and S. Makino, “Polar coordinate based nonlinear function for frequency-domain blind source separation,” IEICE Trans. Fundamentals, vol. E86-A, pp. 590–596, Mar. 2003.
K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” in Proc. ICA, 2001, pp. 722–727.
R. Mukai, S. Araki, and S. Makino, “Separation and dereverberation performance of frequency domain blind source separation,” in Proc. ICA, 2001, pp. 230–235.
S. Haykin, Adaptive Filter Theory. Fourth Edition. Prentice Hall, 2002.
M. Aoki, M. Okamoto, S. Aoki, H. Matsui, T. Sakurai, and Y. Kaneda, “Sound source segregation based on estimating incident angle of each frequency component of input signals acquired by multiple microphones,” Acoust. Sci. & Tech., vol. 22, pp. 149–157, Feb. 2001.
Ö. Yilmaz and S. Rickard, “Blind separation of speech mixtures via time-frequency masking,” IEEE Trans. Signal Processing, vol. 52, pp. 1830–1847, 2004.
http://www.kecl.ntt.co.jp/icl/signal/mukai/demo/ieice2004/
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Mukai, R., Sawada, H., Araki, S., Makino, S. (2005). Real-Time Blind Source Separation for Moving Speech Signals. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_15
Download citation
DOI: https://doi.org/10.1007/3-540-27489-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)