Abstract
In this chapter, we present the signal subspace approach (SSA) for speech enhancement. The SSA is becoming a serious competitor to its already widely used frequency-domain counterparts since it seems to offer a better compromise between signal distortion and the level of the residual noise. We provide a detailed description of the technique in terms of its underlying theory as well as the implementation issues. We also discuss the methods, proposed in the literature, to deal with the colored noise case and to cope with the complexity concerns usually associated with the SSA. In addition to that, we provide a filterbank interpretation to the SSA which allows it to be viewed from a frequency-domain perspective which is a more intuitive domain as far as speech signals are concerned. Finally, we present some of the latest variations and extensions to the SSA found in the literature which also serve as suggestions to further research in this area.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R. Badeau, K. Abed-Meraim, G. Richard, and B. David, “Sliding window orthonormal past algorithm,” in Proc. IEEE ICASSP, vol. 5, 2003, pp. 261–264.
S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 27, pp. 113–120, Apr. 1979.
Y. Bresler and A. Macovski, “Exact maximum likelihood parameter estimation of superimposed exponential signals in noise,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 34, pp. 1081–1089, Oct. 1986.
T. Chonavel, B. Champagne, and C. Riou, “Fast adaptive eigenvalue decomposition: a maximum likelihood approach,” IEEE Trans. on Signal Processing, vol. 83, pp. 317–324, Feb. 2003.
J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals. Prentice-Hall, Englewood Cliffs, NJ, 1993.
M. Dendrinos, S. Bakamidis, and G. Carayannis, “Speech enhancement from noise: a regenerative approach,” Speech Communications, vol. 10, pp. 45–57, Feb. 1991.
S. Doclo and M. Moonen, “GSVD-based optimal filtering for single and multimicrophone speech enhancement,” IEEE Trans. on Signal Processing, vol. 50, pp. 2230–2244, Sept. 2002.
Y. Ephraim and H. L. Van Trees, “A signal subspace approach for speech enhancement,” IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 251–266, July 1995.
G. H. Golub and C. F. Van Loan, Matrix Computations. Johns Hopkins University Press, Baltimore, MD, 2nd edition, 1989.
P. C. Hansen and S. H. Jensen, “FIR filter representation of reduced-rank noise reduction,” IEEE Trans. on Signal Processing, vol. 46, pp. 737–1741, June 1998.
P. S. K. Hansen, P. C. Hansen, S. D. Hansen, and J. A. Sorensen, “Noise reduction of speech signals using the rank-revealing ULLV decomposition,” in Proc. EUSIPCO, 1996, pp. 182–185.
M. H. Hayes, Statistical Digital Signal Processing and Modeling. John Wiley & Sons, Inc., New York, 1996.
S. Haykin, Adaptive Filter Theory. Prentice-Hall, Englewood Cliffs, NJ, 4th edition, 2002.
K. Hermus and P. Wambacq, “Assessment of signal subspace based speech enhancement for noise robust speech recognition,” in Proc. IEEE ICASSP, vol. 1, 2004, pp. 954–948.
Y. Hu and C. Loizou, “A subspace approach for enhancing speech corrupted by colored noise,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 573–576.
J. Huang and Y. Zhao, “An energy-constrained signal subspace method for speech enhancement and recognition in colored noise,” Speech Communications, vol. 1, pp. 165–181, 1998.
J. Huang and Y. Zhao, “A DCT-based fast signal subspace technique for robust speech recognition,” IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 747–751, Nov. 2000.
F. Jabloun, Perceptual and Multi-Microphone Signal Subspace Techniques for Speech Enhancement. Ph.D. thesis, McGill University, Montreal, Canada, 2004.
F. Jabloun and B. Champagne, “A multi-microphone signal subspace approach for speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 205–208.
F. Jabloun and B. Champagne, “A perceptual signal subspace approach for speech enhancement in colored noise,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 569–572.
F. Jabloun and B. Champagne, “Incorporating the human hearing properties in the signal subspace approach for speech enhancement,” IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 11, pp. 700–708, Nov. 2003.
J. Jensen and J. H. L. Hansen, “Speech enhancement using a constrained iterative sinusoidal model,” IEEE Trans. on Speech and Audio Processing, vol. 9, pp. 731–740, Oct. 2001.
S. H. Jensen, P. C. Hansen, S. D. Hansen, and J. A. Sorensen, “Reduction of broad-band noise in speech by truncated QSVD,” IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 439–448, Nov. 1995.
M. Jeppesen, C. A. Rodbro, and S. H. Jensen, “Recursively updated eigenfilterbank for speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 653–656.
J. U. Kim, S. G. Kim, and C. D. Yoo, “The incorporation of masking threshold to subspace speech enhancement,” in Proc. IEEE ICASSP, vol. 1, 2003, pp. 76–79.
M. Klein and P. Kabal, “Signal subspace speech enhancement with perceptual post-filtering,” in Proc. IEEE ICASSP, vol. 1, 2002, pp. 537–540.
H. Krim and M. Viberg, “Two decades of array signal proceesing research: the parametric approach,” IEEE Signal Processing Magazine, pp. 67–94, July 1996.
D. G. Luenberger, Linear and Nonlinear Programming. Addison-Wesley, Reading, MA, 1984.
U. Mittal and N. Phamdo, “Signal/noise KLT based approach for enhancing speech degraded by colored noise,” IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 159–167, Mar. 2000.
T. F. Quatieri and R. J. McAulay, “Noise reduction using a soft-decision sinewave vector quantizer,” in Proc. IEEE ICASSP, 1990, pp. 821–824.
A. Rezayee and S. Gazor, “An adaptive KLT approach for speech enhancement,” IEEE Trans. on Speech and Audio Processing, vol. 9, pp. 87–95, Feb. 2001.
R. O. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. on Antennas and Propagation, vol. 34, pp. 276–280, Mar. 1986.
G. W. Stewart, “An updating algorithm for subspace tracking,” IEEE Trans. on Signal Processing, vol. 40, pp. 1535–1541, June 1992.
P. Strobach, “Low-rank adaptive filters,” IEEE Trans. on Signal Processing, vol. 44, pp. 2932–2947, Dec. 1996.
D. E. Tsoukalas, J. N. Mourjopoulos, and G. Kokkinakis, “Speech enhancement based on audible noise suppression,” IEEE Trans. on Speech and Audio Processing, vol. 5, pp. 479–514, Nov. 1997.
R. Vetter, “Single channel speech enhancement using MDL-based subspace approach in bark domain,” in Proc. IEEE ICASSP, vol. 1, 2001, pp. 641–644.
N. Virag, “Single channel speech enhancement based on masking properties of the human auditory system,” IEEE Trans. on Speech and Audio Processing, vol. 7, pp. 126–137, Mar. 1999.
J. F. Wang, C. H. Yang, and K. H. Chang, “Subsapce tracking for speech enhancement in car noise environments,” in Proc. IEEE ICASSP, vol. 2, 2004, pp. 789–792.
G. Xu and T. Kailath, “Fast subspace decomposition,” IEEE Trans. on Signal Processing, vol. 42, pp. 539–551, Mar. 1994.
B. Yang, “Projection approximation subspace tracking,” IEEE Trans. on Signal Processing, vol. 43, pp. 95–107, Jan. 1995.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Jabloun, F., Champagne, B. (2005). Signal Subspace Techniques for Speech Enhancement. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_7
Download citation
DOI: https://doi.org/10.1007/3-540-27489-8_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)