Beamspace blind signal separation for speech enhancement

Low, Siow Yong; Yiu, Ka-Fai Cedric; Nordholm, Sven

doi:10.1007/s11081-008-9060-4

Beamspace blind signal separation for speech enhancement

Published: 06 November 2008

Volume 10, pages 313–330, (2009)
Cite this article

Optimization and Engineering Aims and scope Submit manuscript

Siow Yong Low¹,
Ka-Fai Cedric Yiu² &
Sven Nordholm¹

120 Accesses
1 Citation
Explore all metrics

Abstract

Signal processing methods for speech enhancement are of vital interest for communications equipments. In particular, multichannel algorithms, which perform spatial filtering to separate signals that have overlapping frequency content but different spatial origins, are important for a wide range of applications. Two of the most popular multichannel methods are blind signal separation (BSS) and beamforming. Briefly, (BSS) separates mixed sources by optimizing the statistical independence among the outputs whilst beamforming optimizes the look direction of the desired source(s). However, both methods have separation limitations, in that BSS succumbs to reverberant environments and beamforming is very sensitive to array model mismatch. In this paper, we propose a novel hybrid scheme, called beamspace BSS, which is intended to compensate the aforementioned separation weaknesses by jointly optimizing the spatial selectivity and statistical independence of the sources. We show that beamspace BSS outperforms the separation performance of the conventional sensor space BSS significantly, particularly in reverberant room environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speech Separation and Extraction by Combining Superdirective Beamforming and Blind Source Separation

Multimicrophone MMSE-Based Speech Source Separation

Statistical Analysis and Evaluation of Blind Speech Extraction Algorithms

References

Aichner R, Araki S, Makino S, Nishikawa T, Saruwatari H (2002) Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming. In: PIEEE workshop on neural networks for signal processing, pp 445–454, September
Amin MG, Bhalla N (1998) Minimum bias spatial filters for beamspace direction-of-arrival estimation. J Franklin Inst 335(1):35–52
Article Google Scholar
Araki S, Mukai R, Makino S, Nishikawa T, Saruwatari H (2003) The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech. IEEE Trans Speech Audio Process 11(2):109–116
Article Google Scholar
Benesty J, Makino S, Chen J (2005) Speech enhancement. Springer, Berlin
Google Scholar
Brandstein M, Ward D (2001) Microphone arrays: Signal processing techniques and applications. Springer, Berlin
Google Scholar
Buckley K, Xu XL (1990) Spatial spectrum estimation in a location sector. IEEE Trans Acoust Speech Signal Process 38(11):1842–1852
Article Google Scholar
Cardoso JF (1998) Blind signal separation: Statistical principles. Proc IEEE 86(10):2009–2025
Article Google Scholar
Cherry EC (1953) Some experiments on the recognition of speech, with one and with two ears. J Acoust Soc Am 25(5):975–979
Article Google Scholar
Chin DN (2003) Blind source separation of convolutive mixtures of speech. In: Kobsa A, Wahlster W (eds) Adaptive signal processing: Applications to the real world. Signals and communications technology. Springer, Berlin
Google Scholar
Cichocki A, Amari S (2002) Adaptive blind signal and image processing. Wiley, West Sussex
Book Google Scholar
Douglas SC, Cichocki A (1997) Neural networks for blind decorrelation of signals. IEEE Trans Signal Process 45(11):2829–2842
Article Google Scholar
Fancourt C, Parra LC (2001) The generalized sidelobe decorrelator. In: IEEE workshop on the apps of signal processing to audio and acoustics, pp 167–170, October
Grbić N, Nordholm S (2002) Soft constrained subband beamforming for handsfree speech enhancement. In: IEEE int conf on acoustics, speech and signal processing, pp 885–888, May
Griffiths LJ, Jim CW (1982) An alternative approach to linearly constrained adaptive beamforming. IEEE Trans Antennas Propag 30(1):27–34
Article Google Scholar
Hassanien A, Elkader SA, Gershman AB, Wong KM (2004) Beamspace preprocessing with an improved robustness against out-of-sector sources using second-order cone programming. In: IEEE workshop on sensor array and multichannel signal processing, pp 347–351, July
Haykin S (2000) Unsupervised adaptive filtering: Blind source separation. Wiley, New York
Google Scholar
Horn R, Johnson C (1985) Matrix analysis. Cambridge University Press, Cambridge
MATH Google Scholar
Ikram MZ, Morgan DR (2002) A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation. In: IEEE int conf on acoustics, speech and signal processing, pp 881–884, May
Lee H, Wengrovitz M (1990) Resolution threshold of beamspace MUSIC for two closely spaced emitters. IEEE Trans Acoust Speech Signal Process 38(9):1545–1559
Article MATH Google Scholar
Linebarger DA, DeGroat RD, Dowling EM, Stoica P, Fudge GL (1995) Incorporating a priori information into MUSIC algorithm and analysis. Signal Process 46(1):85–104
Article MATH Google Scholar
Nordholm S, Claesson I, Dahl M (1999) Adaptive microphone array employing calibration signals: Analytical evaluation. IEEE Trans Speech Audio Process 7(3):241–252
Article Google Scholar
Parra LC, Alvino CV (2002) Geometric source separation: Merging convolutive source separation with geometric beamforming. IEEE Trans Speech Audio Process 6(10):352–362
Article Google Scholar
Parra LC, Alvino CV (2000) Convolutive blind separation of non-stationary sources. IEEE Trans Speech Audio Process 8(3):320–327
Article Google Scholar
Peterson PM (1986) Simulating the response of multiple microphones to a single acoustic source in a reverberant room. J Acoust Soc Am 80(5):1527–1529
Article Google Scholar
Rodriguez A, Baryshnikov BV, Van Veen BD, Wakai RT (2006) MEG and EEG source localization in beamspace. IEEE Trans Biomed Eng 53(3):430–441
Article Google Scholar
Sawada H, Mukai R, Araki S, Makino S (2004) A robust and precise method for solving the permutation problem of frequency domain blind source separation. IEEE Trans Speech Audio Process 12(5):530–538
Article Google Scholar
Tian Z, Van Trees HL (2001) Beamspace MODE. In: Asilomar conf on signals, systems and computers, pp 926–930, November
Weinstein E, Feder M, Oppenheim AV (1993) Multi-channel signal separation by decorrelation. IEEE Trans Speech Audio Process 1(4):405–413
Article Google Scholar

Download references

Author information

Authors and Affiliations

Western Australian Telecommunications Research Institute, Crawley, WA, 6009, Australia
Siow Yong Low & Sven Nordholm
Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong, People’s Republic of China
Ka-Fai Cedric Yiu

Authors

Siow Yong Low
View author publications
You can also search for this author in PubMed Google Scholar
Ka-Fai Cedric Yiu
View author publications
You can also search for this author in PubMed Google Scholar
Sven Nordholm
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ka-Fai Cedric Yiu.

Additional information

K.F.C. Yiu is supported by RGC Grant PolyU. 7191/06E and the research committee of the Hong Kong Polytechnic University.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Low, S.Y., Yiu, KF.C. & Nordholm, S. Beamspace blind signal separation for speech enhancement. Optim Eng 10, 313–330 (2009). https://doi.org/10.1007/s11081-008-9060-4

Download citation

Received: 04 October 2008
Accepted: 20 October 2008
Published: 06 November 2008
Issue Date: June 2009
DOI: https://doi.org/10.1007/s11081-008-9060-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Beamspace blind signal separation for speech enhancement

Abstract

Access this article

Similar content being viewed by others

Speech Separation and Extraction by Combining Superdirective Beamforming and Blind Source Separation

Multimicrophone MMSE-Based Speech Source Separation

Statistical Analysis and Evaluation of Blind Speech Extraction Algorithms

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Beamspace blind signal separation for speech enhancement

Abstract

Access this article

Similar content being viewed by others

Speech Separation and Extraction by Combining Superdirective Beamforming and Blind Source Separation

Multimicrophone MMSE-Based Speech Source Separation

Statistical Analysis and Evaluation of Blind Speech Extraction Algorithms

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation