Skip to main content

The Pre-image Problem and Kernel PCA for Speech Enhancement

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7015))

Abstract

In this paper, we use kernel principal component analysis (kPCA) for speech enhancement. To synthesize the de-noised audio signal we rely on an iterative pre-image method. In order to gain better understanding about the pre-image step we performed experiments with different pre-image methods, first on synthetic data and then on audio data. The results of these experiments led to a reduction of artifacts in the original speech enhancement method, tested on speech corrupted by additive white Gaussian noise at several SNR levels. The evaluation with perceptually motivated quality measures confirms the improvement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abrahamsen, T.J., Hansen, L.K.: Input Space Regularization Stabilizes Pre-Images for Kernel PCA De-Noising. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (2009)

    Google Scholar 

  2. Berouti, M., Schwartz, M., Makhoul, J.: Enhancement of Speech Corrupted by Acoustic Noise. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 1979), pp. 208–211 (1979)

    Google Scholar 

  3. Griffin, D., Lim, J.: Signal Estimation from Modified Short-Time Fourier Transform. IEEE Transactions on Acoustics, Speech and Signal Processing 32(2), 236–243 (1984)

    Article  Google Scholar 

  4. Honeine, P., Richard, C.: Solving the Pre-Image Problem in Kernel Machines: A Direct Method. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) (2009)

    Google Scholar 

  5. Hu, Y., Loizou, P.: Evaluation of Objective Quality Measures for Speech Enhancement. IEEE Transactions on Audio, Speech, and Language Processing 16(1), 229–238 (2008)

    Article  Google Scholar 

  6. Hu, Y., Loizou, P.C.: A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise. IEEE Transactions on Speech and Audio Processing 11, 334–341 (2003)

    Article  Google Scholar 

  7. Kwok, J.T., Tsang, I.W.: The Pre-Image Problem in Kernel Methods. IEEE Transactions on Neural Networks 15, 408–415 (2004)

    Article  Google Scholar 

  8. Leitner, C., Pernkopf, F., Kubin, G.: Kernel PCA for Speech Enhancement. In: Interspeech 2011 (accepted, 2011)

    Google Scholar 

  9. Loizou, P.C.: Speech Enhancement: Theory and Practice. CRC (2007)

    Google Scholar 

  10. Mika, S., Schölkopf, B., Smola, A., Müller, K.R., Scholz, M., Rätsch, G.: Kernel PCA and De-Noising in Feature Spaces. In: Advances in Neural Information Processing Systems, vol. 11, pp. 536–542 (1999)

    Google Scholar 

  11. Schölkopf, B., Smola, A., Müller, K.R.: Nonlinear Component Analysis as a Kernel Eigenvalue Problem. Tech. rep., Max Planck Institute for Biological Cybernetics (1996)

    Google Scholar 

  12. Zölzer, U. (ed.): DAFX - Digital Audio Effects. John Wiley & Sons (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leitner, C., Pernkopf, F. (2011). The Pre-image Problem and Kernel PCA for Speech Enhancement. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25020-0_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25019-4

  • Online ISBN: 978-3-642-25020-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics