Skip to main content

Real-Time Blind Source Separation for Moving Speech Signals

  • Chapter

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

In this chapter, we present a method for the real-time blind source separation (BSS) of moving speech signals in a room. The method employs frequen-cydomain independent component analysis (ICA) using a blockwise batch algorithm in the first stage, and the separated signals are refined by postprocessing using crosstalk component estimation and non-stationary crosstalk cancellation in the second stage. The blockwise batch algorithm achieves better performance than an online algorithm when sources are stationary, and the postprocessing compensates for performance degradation caused by source movement. Experimental results using speech signals recorded in a real room show that our method realizes robust real-time separation for moving sources.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. J. Bell and T. J. Sejnowski, “An information-maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, pp. 1129–1159, 1995.

    Google Scholar 

  2. S. Haykin, ed., Unsupervised Adaptive Filtering. John Wiley & Sons, 2000.

    Google Scholar 

  3. T. W. Lee, Independent Component Analysis. Kluwer Academic Publishers, 1998.

    Google Scholar 

  4. A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing. John Wiley & Sons, 2002.

    Google Scholar 

  5. A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. John Wiley & Sons, 2001.

    Google Scholar 

  6. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Blind Source Separation for Moving Speech Signals using Blockwise ICA and Residual Crosstalk Subtraction,” IEICE Trans. Fundamentals, vol. E87-A, pp. 1941–1948, 2004.

    Google Scholar 

  7. J. Anemüller and T. Gramss, “On-line blind separation of moving sound sources,” in Proc. ICA, 1999, pp. 331–334.

    Google Scholar 

  8. A. Koutras, E. Dermatas, and G. Kokkinakis, “Blind speech separation of moving speakers in real reverberant environment,” in Proc. IEEE ICASSP, 2000, pp. 1133–1136.

    Google Scholar 

  9. I. Kopriva, Z. Devcic, and H. Szu, “An adaptive short-time frequency domain algorithm for blind separation of non-stationary convolved mixtures,” in Proc. IJCNN, 2001, pp. 424–429.

    Google Scholar 

  10. K. E. Hild II, D. Erdogmus, and J. C. Principe, “Blind source separation of time-varying, instantaneous mixtures using an on-line algorithm,” in Proc. IEEE ICASSP, 2002, pp. 993–996.

    Google Scholar 

  11. R. Aichner, H. Buchner, F. Yan, and W. Kellermann, “Real-time convolutive blind source separation based on a broadband approach,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 840–848.

    Google Scholar 

  12. D. W. E. Schobben, Real-Time Adaptive Concepts in Acoustics. Kluwer Academic Publishers, 2001.

    Google Scholar 

  13. H. Sawada, R. Mukai, and S. Makino, “Direction of arrival estimation for multiple source signals using independent component analysis,” in Proc. ISSPA, vol. 2, 2003, pp. 411–414.

    Google Scholar 

  14. S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, and H. Saruwatari, “Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming for convolutive mixtures,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157–1166, 2003.

    Article  Google Scholar 

  15. S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, and Signal Processing, vol. ASSP-27, pp. 113–120, Apr. 1979.

    Article  Google Scholar 

  16. R. Mukai, S. Araki, H. Sawada and S. Makino, “Removal of residual cross-talk components in blind source separation using time-delayed spectral subtraction,” in Proc. IEEE ICASSP, 2002, pp. 1789–1792.

    Google Scholar 

  17. R. Mukai, S. Araki, H. Sawada, and S. Makino, “Removal of residual crosstalk components in blind source separation using LMS filters,” in Proc. NNSP, 2002, pp. 435–444.

    Google Scholar 

  18. S. Y. Low, S. Nordholm, and R. Togneri, “Convolutive blind signal separation with post-processing,” IEEE Trans. Speech Audio Processing, vol. 12, pp. 539–548, Sept. 2004.

    Article  Google Scholar 

  19. D. Kolossa, and R. Orglmeister, “Nonlinear postprocessing for blind speech separation,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 832–839.

    Google Scholar 

  20. C. Choi, G. Jang, Y. Lee, and S. R. Kim, “Adaptive cross-channel interference cancellation on blind source separation outputs,” in Proc. ICA (Lecture Notes in Computer Science 3195), Springer-Verlag, 2004, pp. 857–864.

    Google Scholar 

  21. S. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Advances in Neural Information Processing Systems 8, pp. 757–763, The MIT Press, 1996.

    Google Scholar 

  22. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Polar coordinate based nonlinear function for frequency-domain blind source separation,” IEICE Trans. Fundamentals, vol. E86-A, pp. 590–596, Mar. 2003.

    Google Scholar 

  23. K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” in Proc. ICA, 2001, pp. 722–727.

    Google Scholar 

  24. R. Mukai, S. Araki, and S. Makino, “Separation and dereverberation performance of frequency domain blind source separation,” in Proc. ICA, 2001, pp. 230–235.

    Google Scholar 

  25. S. Haykin, Adaptive Filter Theory. Fourth Edition. Prentice Hall, 2002.

    Google Scholar 

  26. M. Aoki, M. Okamoto, S. Aoki, H. Matsui, T. Sakurai, and Y. Kaneda, “Sound source segregation based on estimating incident angle of each frequency component of input signals acquired by multiple microphones,” Acoust. Sci. & Tech., vol. 22, pp. 149–157, Feb. 2001.

    Article  Google Scholar 

  27. Ö. Yilmaz and S. Rickard, “Blind separation of speech mixtures via time-frequency masking,” IEEE Trans. Signal Processing, vol. 52, pp. 1830–1847, 2004.

    Article  MathSciNet  Google Scholar 

  28. http://www.kecl.ntt.co.jp/icl/signal/mukai/demo/ieice2004/

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Mukai, R., Sawada, H., Araki, S., Makino, S. (2005). Real-Time Blind Source Separation for Moving Speech Signals. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_15

Download citation

  • DOI: https://doi.org/10.1007/3-540-27489-8_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24039-6

  • Online ISBN: 978-3-540-27489-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics