Signal Subspace Techniques for Speech Enhancement

Jabloun, Firas; Champagne, Benoit

doi:10.1007/3-540-27489-8_7

Signal Subspace Techniques for Speech Enhancement

Firas Jabloun⁴ &
Benoit Champagne⁵

Chapter

2504 Accesses
9 Citations

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

In this chapter, we present the signal subspace approach (SSA) for speech enhancement. The SSA is becoming a serious competitor to its already widely used frequency-domain counterparts since it seems to offer a better compromise between signal distortion and the level of the residual noise. We provide a detailed description of the technique in terms of its underlying theory as well as the implementation issues. We also discuss the methods, proposed in the literature, to deal with the colored noise case and to cope with the complexity concerns usually associated with the SSA. In addition to that, we provide a filterbank interpretation to the SSA which allows it to be viewed from a frequency-domain perspective which is a more intuitive domain as far as speech signals are concerned. Finally, we present some of the latest variations and extensions to the SSA found in the literature which also serve as suggestions to further research in this area.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Badeau, K. Abed-Meraim, G. Richard, and B. David, “Sliding window orthonormal past algorithm,” in Proc. IEEE ICASSP, vol. 5, 2003, pp. 261–264.
Google Scholar
S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 27, pp. 113–120, Apr. 1979.
Article Google Scholar
Y. Bresler and A. Macovski, “Exact maximum likelihood parameter estimation of superimposed exponential signals in noise,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 34, pp. 1081–1089, Oct. 1986.
Article Google Scholar
T. Chonavel, B. Champagne, and C. Riou, “Fast adaptive eigenvalue decomposition: a maximum likelihood approach,” IEEE Trans. on Signal Processing, vol. 83, pp. 317–324, Feb. 2003.
Google Scholar
J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals. Prentice-Hall, Englewood Cliffs, NJ, 1993.
Google Scholar
M. Dendrinos, S. Bakamidis, and G. Carayannis, “Speech enhancement from noise: a regenerative approach,” Speech Communications, vol. 10, pp. 45–57, Feb. 1991.
Article Google Scholar
S. Doclo and M. Moonen, “GSVD-based optimal filtering for single and multimicrophone speech enhancement,” IEEE Trans. on Signal Processing, vol. 50, pp. 2230–2244, Sept. 2002.
Article Google Scholar
Y. Ephraim and H. L. Van Trees, “A signal subspace approach for speech enhancement,” IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 251–266, July 1995.
Article Google Scholar
G. H. Golub and C. F. Van Loan, Matrix Computations. Johns Hopkins University Press, Baltimore, MD, 2nd edition, 1989.
MATH Google Scholar
P. C. Hansen and S. H. Jensen, “FIR filter representation of reduced-rank noise reduction,” IEEE Trans. on Signal Processing, vol. 46, pp. 737–1741, June 1998.
Article Google Scholar
P. S. K. Hansen, P. C. Hansen, S. D. Hansen, and J. A. Sorensen, “Noise reduction of speech signals using the rank-revealing ULLV decomposition,” in Proc. EUSIPCO, 1996, pp. 182–185.
Google Scholar
M. H. Hayes, Statistical Digital Signal Processing and Modeling. John Wiley & Sons, Inc., New York, 1996.
Google Scholar
S. Haykin, Adaptive Filter Theory. Prentice-Hall, Englewood Cliffs, NJ, 4th edition, 2002.
Google Scholar
K. Hermus and P. Wambacq, “Assessment of signal subspace based speech enhancement for noise robust speech recognition,” in Proc. IEEE ICASSP, vol. 1, 2004, pp. 954–948.
Google Scholar
Y. Hu and C. Loizou, “A subspace approach for enhancing speech corrupted by colored noise,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 573–576.
Google Scholar
J. Huang and Y. Zhao, “An energy-constrained signal subspace method for speech enhancement and recognition in colored noise,” Speech Communications, vol. 1, pp. 165–181, 1998.
Article Google Scholar
J. Huang and Y. Zhao, “A DCT-based fast signal subspace technique for robust speech recognition,” IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 747–751, Nov. 2000.
Article Google Scholar
F. Jabloun, Perceptual and Multi-Microphone Signal Subspace Techniques for Speech Enhancement. Ph.D. thesis, McGill University, Montreal, Canada, 2004.
Google Scholar
F. Jabloun and B. Champagne, “A multi-microphone signal subspace approach for speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 205–208.
Google Scholar
F. Jabloun and B. Champagne, “A perceptual signal subspace approach for speech enhancement in colored noise,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 569–572.
Google Scholar
F. Jabloun and B. Champagne, “Incorporating the human hearing properties in the signal subspace approach for speech enhancement,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 11, pp. 700–708, Nov. 2003.
Google Scholar
J. Jensen and J. H. L. Hansen, “Speech enhancement using a constrained iterative sinusoidal model,” IEEE Trans. on Speech and Audio Processing, vol. 9, pp. 731–740, Oct. 2001.
Article Google Scholar
S. H. Jensen, P. C. Hansen, S. D. Hansen, and J. A. Sorensen, “Reduction of broad-band noise in speech by truncated QSVD,” IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 439–448, Nov. 1995.
Article Google Scholar
M. Jeppesen, C. A. Rodbro, and S. H. Jensen, “Recursively updated eigenfilterbank for speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 653–656.
Google Scholar
J. U. Kim, S. G. Kim, and C. D. Yoo, “The incorporation of masking threshold to subspace speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2003, pp. 76–79.
Google Scholar
M. Klein and P. Kabal, “Signal subspace speech enhancement with perceptual post-filtering,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 537–540.
Article Google Scholar
H. Krim and M. Viberg, “Two decades of array signal proceesing research: the parametric approach,” IEEE Signal Processing Magazine, pp. 67–94, July 1996.
Google Scholar
D. G. Luenberger, Linear and Nonlinear Programming. Addison-Wesley, Reading, MA, 1984.
MATH Google Scholar
U. Mittal and N. Phamdo, “Signal/noise KLT based approach for enhancing speech degraded by colored noise,” IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 159–167, Mar. 2000.
Article Google Scholar
T. F. Quatieri and R. J. McAulay, “Noise reduction using a soft-decision sinewave vector quantizer,” in Proc. IEEE ICASSP, 1990, pp. 821–824.
Google Scholar
A. Rezayee and S. Gazor, “An adaptive KLT approach for speech enhancement,” IEEE Trans. on Speech and Audio Processing, vol. 9, pp. 87–95, Feb. 2001.
Article Google Scholar
R. O. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. on Antennas and Propagation, vol. 34, pp. 276–280, Mar. 1986.
Article Google Scholar
G. W. Stewart, “An updating algorithm for subspace tracking,” IEEE Trans. on Signal Processing, vol. 40, pp. 1535–1541, June 1992.
Article Google Scholar
P. Strobach, “Low-rank adaptive filters,” IEEE Trans. on Signal Processing, vol. 44, pp. 2932–2947, Dec. 1996.
Article Google Scholar
D. E. Tsoukalas, J. N. Mourjopoulos, and G. Kokkinakis, “Speech enhancement based on audible noise suppression,” IEEE Trans. on Speech and Audio Processing, vol. 5, pp. 479–514, Nov. 1997.
Google Scholar
R. Vetter, “Single channel speech enhancement using MDL-based subspace approach in bark domain,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 641–644.
Google Scholar
N. Virag, “Single channel speech enhancement based on masking properties of the human auditory system,” IEEE Trans. on Speech and Audio Processing, vol. 7, pp. 126–137, Mar. 1999.
Article Google Scholar
J. F. Wang, C. H. Yang, and K. H. Chang, “Subsapce tracking for speech enhancement in car noise environments,” in Proc. IEEE ICASSP, vol. 2, 2004, pp. 789–792.
Google Scholar
G. Xu and T. Kailath, “Fast subspace decomposition,” IEEE Trans. on Signal Processing, vol. 42, pp. 539–551, Mar. 1994.
Article Google Scholar
B. Yang, “Projection approximation subspace tracking,” IEEE Trans. on Signal Processing, vol. 43, pp. 95–107, Jan. 1995.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Toshiba Research, Cambridge, UK
Firas Jabloun
McGill University, Montreal, Canada
Benoit Champagne

Authors

Firas Jabloun
View author publications
You can also search for this author in PubMed Google Scholar
Benoit Champagne
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jabloun, F., Champagne, B. (2005). Signal Subspace Techniques for Speech Enhancement. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_7

Download citation

DOI: https://doi.org/10.1007/3-540-27489-8_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics