Abstract
In this paper, we use kernel principal component analysis (kPCA) for speech enhancement. To synthesize the de-noised audio signal we rely on an iterative pre-image method. In order to gain better understanding about the pre-image step we performed experiments with different pre-image methods, first on synthetic data and then on audio data. The results of these experiments led to a reduction of artifacts in the original speech enhancement method, tested on speech corrupted by additive white Gaussian noise at several SNR levels. The evaluation with perceptually motivated quality measures confirms the improvement.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abrahamsen, T.J., Hansen, L.K.: Input Space Regularization Stabilizes Pre-Images for Kernel PCA De-Noising. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (2009)
Berouti, M., Schwartz, M., Makhoul, J.: Enhancement of Speech Corrupted by Acoustic Noise. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 1979), pp. 208–211 (1979)
Griffin, D., Lim, J.: Signal Estimation from Modified Short-Time Fourier Transform. IEEE Transactions on Acoustics, Speech and Signal Processing 32(2), 236–243 (1984)
Honeine, P., Richard, C.: Solving the Pre-Image Problem in Kernel Machines: A Direct Method. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (2009)
Hu, Y., Loizou, P.: Evaluation of Objective Quality Measures for Speech Enhancement. IEEE Transactions on Audio, Speech, and Language Processing 16(1), 229–238 (2008)
Hu, Y., Loizou, P.C.: A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise. IEEE Transactions on Speech and Audio Processing 11, 334–341 (2003)
Kwok, J.T., Tsang, I.W.: The Pre-Image Problem in Kernel Methods. IEEE Transactions on Neural Networks 15, 408–415 (2004)
Leitner, C., Pernkopf, F., Kubin, G.: Kernel PCA for Speech Enhancement. In: Interspeech 2011 (accepted, 2011)
Loizou, P.C.: Speech Enhancement: Theory and Practice. CRC (2007)
Mika, S., Schölkopf, B., Smola, A., Müller, K.R., Scholz, M., Rätsch, G.: Kernel PCA and De-Noising in Feature Spaces. In: Advances in Neural Information Processing Systems, vol. 11, pp. 536–542 (1999)
Schölkopf, B., Smola, A., Müller, K.R.: Nonlinear Component Analysis as a Kernel Eigenvalue Problem. Tech. rep., Max Planck Institute for Biological Cybernetics (1996)
Zölzer, U. (ed.): DAFX - Digital Audio Effects. John Wiley & Sons (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Leitner, C., Pernkopf, F. (2011). The Pre-image Problem and Kernel PCA for Speech Enhancement. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-25020-0_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25019-4
Online ISBN: 978-3-642-25020-0
eBook Packages: Computer ScienceComputer Science (R0)