Skip to main content
Log in

Beamspace blind signal separation for speech enhancement

  • Published:
Optimization and Engineering Aims and scope Submit manuscript

Abstract

Signal processing methods for speech enhancement are of vital interest for communications equipments. In particular, multichannel algorithms, which perform spatial filtering to separate signals that have overlapping frequency content but different spatial origins, are important for a wide range of applications. Two of the most popular multichannel methods are blind signal separation (BSS) and beamforming. Briefly, (BSS) separates mixed sources by optimizing the statistical independence among the outputs whilst beamforming optimizes the look direction of the desired source(s). However, both methods have separation limitations, in that BSS succumbs to reverberant environments and beamforming is very sensitive to array model mismatch. In this paper, we propose a novel hybrid scheme, called beamspace BSS, which is intended to compensate the aforementioned separation weaknesses by jointly optimizing the spatial selectivity and statistical independence of the sources. We show that beamspace BSS outperforms the separation performance of the conventional sensor space BSS significantly, particularly in reverberant room environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aichner R, Araki S, Makino S, Nishikawa T, Saruwatari H (2002) Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming. In: PIEEE workshop on neural networks for signal processing, pp 445–454, September

  • Amin MG, Bhalla N (1998) Minimum bias spatial filters for beamspace direction-of-arrival estimation. J Franklin Inst 335(1):35–52

    Article  Google Scholar 

  • Araki S, Mukai R, Makino S, Nishikawa T, Saruwatari H (2003) The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech. IEEE Trans Speech Audio Process 11(2):109–116

    Article  Google Scholar 

  • Benesty J, Makino S, Chen J (2005) Speech enhancement. Springer, Berlin

    Google Scholar 

  • Brandstein M, Ward D (2001) Microphone arrays: Signal processing techniques and applications. Springer, Berlin

    Google Scholar 

  • Buckley K, Xu XL (1990) Spatial spectrum estimation in a location sector. IEEE Trans Acoust Speech Signal Process 38(11):1842–1852

    Article  Google Scholar 

  • Cardoso JF (1998) Blind signal separation: Statistical principles. Proc IEEE 86(10):2009–2025

    Article  Google Scholar 

  • Cherry EC (1953) Some experiments on the recognition of speech, with one and with two ears. J Acoust Soc Am 25(5):975–979

    Article  Google Scholar 

  • Chin DN (2003) Blind source separation of convolutive mixtures of speech. In: Kobsa A, Wahlster W (eds) Adaptive signal processing: Applications to the real world. Signals and communications technology. Springer, Berlin

    Google Scholar 

  • Cichocki A, Amari S (2002) Adaptive blind signal and image processing. Wiley, West Sussex

    Book  Google Scholar 

  • Douglas SC, Cichocki A (1997) Neural networks for blind decorrelation of signals. IEEE Trans Signal Process 45(11):2829–2842

    Article  Google Scholar 

  • Fancourt C, Parra LC (2001) The generalized sidelobe decorrelator. In: IEEE workshop on the apps of signal processing to audio and acoustics, pp 167–170, October

  • Grbić N, Nordholm S (2002) Soft constrained subband beamforming for handsfree speech enhancement. In: IEEE int conf on acoustics, speech and signal processing, pp 885–888, May

  • Griffiths LJ, Jim CW (1982) An alternative approach to linearly constrained adaptive beamforming. IEEE Trans Antennas Propag 30(1):27–34

    Article  Google Scholar 

  • Hassanien A, Elkader SA, Gershman AB, Wong KM (2004) Beamspace preprocessing with an improved robustness against out-of-sector sources using second-order cone programming. In: IEEE workshop on sensor array and multichannel signal processing, pp 347–351, July

  • Haykin S (2000) Unsupervised adaptive filtering: Blind source separation. Wiley, New York

    Google Scholar 

  • Horn R, Johnson C (1985) Matrix analysis. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  • Ikram MZ, Morgan DR (2002) A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation. In: IEEE int conf on acoustics, speech and signal processing, pp 881–884, May

  • Lee H, Wengrovitz M (1990) Resolution threshold of beamspace MUSIC for two closely spaced emitters. IEEE Trans Acoust Speech Signal Process 38(9):1545–1559

    Article  MATH  Google Scholar 

  • Linebarger DA, DeGroat RD, Dowling EM, Stoica P, Fudge GL (1995) Incorporating a priori information into MUSIC algorithm and analysis. Signal Process 46(1):85–104

    Article  MATH  Google Scholar 

  • Nordholm S, Claesson I, Dahl M (1999) Adaptive microphone array employing calibration signals: Analytical evaluation. IEEE Trans Speech Audio Process 7(3):241–252

    Article  Google Scholar 

  • Parra LC, Alvino CV (2002) Geometric source separation: Merging convolutive source separation with geometric beamforming. IEEE Trans Speech Audio Process 6(10):352–362

    Article  Google Scholar 

  • Parra LC, Alvino CV (2000) Convolutive blind separation of non-stationary sources. IEEE Trans Speech Audio Process 8(3):320–327

    Article  Google Scholar 

  • Peterson PM (1986) Simulating the response of multiple microphones to a single acoustic source in a reverberant room. J Acoust Soc Am 80(5):1527–1529

    Article  Google Scholar 

  • Rodriguez A, Baryshnikov BV, Van Veen BD, Wakai RT (2006) MEG and EEG source localization in beamspace. IEEE Trans Biomed Eng 53(3):430–441

    Article  Google Scholar 

  • Sawada H, Mukai R, Araki S, Makino S (2004) A robust and precise method for solving the permutation problem of frequency domain blind source separation. IEEE Trans Speech Audio Process 12(5):530–538

    Article  Google Scholar 

  • Tian Z, Van Trees HL (2001) Beamspace MODE. In: Asilomar conf on signals, systems and computers, pp 926–930, November

  • Weinstein E, Feder M, Oppenheim AV (1993) Multi-channel signal separation by decorrelation. IEEE Trans Speech Audio Process 1(4):405–413

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ka-Fai Cedric Yiu.

Additional information

K.F.C. Yiu is supported by RGC Grant PolyU. 7191/06E and the research committee of the Hong Kong Polytechnic University.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Low, S.Y., Yiu, KF.C. & Nordholm, S. Beamspace blind signal separation for speech enhancement. Optim Eng 10, 313–330 (2009). https://doi.org/10.1007/s11081-008-9060-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11081-008-9060-4

Keywords

Navigation