Real-Time Blind Source Separation for Moving Speech Signals

Mukai, Ryo; Sawada, Hiroshi; Araki, Shoko; Makino, Shoji

doi:10.1007/3-540-27489-8_15

Real-Time Blind Source Separation for Moving Speech Signals

Ryo Mukai⁴,
Hiroshi Sawada⁴,
Shoko Araki⁴ &
…
Shoji Makino⁴

Chapter

2422 Accesses
2 Citations

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

In this chapter, we present a method for the real-time blind source separation (BSS) of moving speech signals in a room. The method employs frequen-cydomain independent component analysis (ICA) using a blockwise batch algorithm in the first stage, and the separated signals are refined by postprocessing using crosstalk component estimation and non-stationary crosstalk cancellation in the second stage. The blockwise batch algorithm achieves better performance than an online algorithm when sources are stationary, and the postprocessing compensates for performance degradation caused by source movement. Experimental results using speech signals recorded in a real room show that our method realizes robust real-time separation for moving sources.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. J. Bell and T. J. Sejnowski, “An information-maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, pp. 1129–1159, 1995.
Google Scholar
S. Haykin, ed., Unsupervised Adaptive Filtering. John Wiley & Sons, 2000.
Google Scholar
T. W. Lee, Independent Component Analysis. Kluwer Academic Publishers, 1998.
Google Scholar
A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing. John Wiley & Sons, 2002.
Google Scholar
A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. John Wiley & Sons, 2001.
Google Scholar
R. Mukai, H. Sawada, S. Araki, and S. Makino, “Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,” IEICE Trans. Fundamentals, vol. E87-A, pp. 1941–1948, 2004.
Google Scholar
J. Anemüller and T. Gramss, “On-line blind separation of moving sound sources,” in Proc. ICA, 1999, pp. 331–334.
Google Scholar
A. Koutras, E. Dermatas, and G. Kokkinakis, “Blind speech separation of moving speakers in real reverberant environment,” in Proc. IEEE ICASSP, 2000, pp. 1133–1136.
Google Scholar
I. Kopriva, Z. Devcic, and H. Szu, “An adaptive short-time frequency domain algorithm for blind separation of non-stationary convolved mixtures,” in Proc. IJCNN, 2001, pp. 424–429.
Google Scholar
K. E. Hild II, D. Erdogmus, and J. C. Principe, “Blind source separation of time-varying, instantaneous mixtures using an on-line algorithm,” in Proc. IEEE ICASSP, 2002, pp. 993–996.
Google Scholar
R. Aichner, H. Buchner, F. Yan, and W. Kellermann, “Real-time convolutive blind source separation based on a broadband approach,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 840–848.
Google Scholar
D. W. E. Schobben, Real-Time Adaptive Concepts in Acoustics. Kluwer Academic Publishers, 2001.
Google Scholar
H. Sawada, R. Mukai, and S. Makino, “Direction of arrival estimation for multiple source signals using independent component analysis,” in Proc. ISSPA, vol. 2, 2003, pp. 411–414.
Google Scholar
S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, and H. Saruwatari, “Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming for convolutive mixtures,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157–1166, 2003.
Article Google Scholar
S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-27, pp. 113–120, Apr. 1979.
Article Google Scholar
R. Mukai, S. Araki, H. Sawada and S. Makino, “Removal of residual cross-talk components in blind source separation using time-delayed spectral subtraction,” in Proc. IEEE ICASSP, 2002, pp. 1789–1792.
Google Scholar
R. Mukai, S. Araki, H. Sawada, and S. Makino, “Removal of residual crosstalk components in blind source separation using LMS filters,” in Proc. NNSP, 2002, pp. 435–444.
Google Scholar
S. Y. Low, S. Nordholm, and R. Togneri, “Convolutive blind signal separation with post-processing,” IEEE Trans. Speech Audio Processing, vol. 12, pp. 539–548, Sept. 2004.
Article Google Scholar
D. Kolossa, and R. Orglmeister, “Nonlinear postprocessing for blind speech separation,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 832–839.
Google Scholar
C. Choi, G. Jang, Y. Lee, and S. R. Kim, “Adaptive cross-channel interference cancellation on blind source separation outputs,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 857–864.
Google Scholar
S. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Advances in Neural Information Processing Systems 8, pp. 757–763, The MIT Press, 1996.
Google Scholar
H. Sawada, R. Mukai, S. Araki, and S. Makino, “Polar coordinate based nonlinear function for frequency-domain blind source separation,” IEICE Trans. Fundamentals, vol. E86-A, pp. 590–596, Mar. 2003.
Google Scholar
K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” in Proc. ICA, 2001, pp. 722–727.
Google Scholar
R. Mukai, S. Araki, and S. Makino, “Separation and dereverberation performance of frequency domain blind source separation,” in Proc. ICA, 2001, pp. 230–235.
Google Scholar
S. Haykin, Adaptive Filter Theory. Fourth Edition. Prentice Hall, 2002.
Google Scholar
M. Aoki, M. Okamoto, S. Aoki, H. Matsui, T. Sakurai, and Y. Kaneda, “Sound source segregation based on estimating incident angle of each frequency component of input signals acquired by multiple microphones,” Acoust. Sci. & Tech., vol. 22, pp. 149–157, Feb. 2001.
Article Google Scholar
Ö. Yilmaz and S. Rickard, “Blind separation of speech mixtures via time-frequency masking,” IEEE Trans. Signal Processing, vol. 52, pp. 1830–1847, 2004.
Article MathSciNet Google Scholar
http://www.kecl.ntt.co.jp/icl/signal/mukai/demo/ieice2004/
Google Scholar

Download references

Author information

Authors and Affiliations

NTT Communication Science Laboratories, Soraku-gun, Kyoto, 619-0237, Japan
Ryo Mukai, Hiroshi Sawada, Shoko Araki & Shoji Makino

Authors

Ryo Mukai
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Sawada
View author publications
You can also search for this author in PubMed Google Scholar
Shoko Araki
View author publications
You can also search for this author in PubMed Google Scholar
Shoji Makino
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mukai, R., Sawada, H., Araki, S., Makino, S. (2005). Real-Time Blind Source Separation for Moving Speech Signals. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_15

Download citation

DOI: https://doi.org/10.1007/3-540-27489-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics