Multivariate Non-Linear Feature Selection with Kernel Methods

Guyon, Isabelle; Bitter, Hans-Marcus; Ahmed, Zulfikar; Brown, Michael; Heller, Jonathan

doi:10.1007/3-540-32365-1_12

Multivariate Non-Linear Feature Selection with Kernel Methods

Isabelle Guyon⁵,
Hans-Marcus Bitter⁶,
Zulfikar Ahmed⁶,
Michael Brown⁶ &
…
Jonathan Heller⁶

Chapter

1702 Accesses
5 Citations

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 164))

Abstract

We address problems of classification in which the number of input components (variables, features) is very large compared to the number of training samples. Such problems are encountered in Internet application such as text filtering, in biomedical applications such as medical diagnosis from genomic or protemic data, and drug screening from combinatorial chemistry data. In this setting, it is often desirable to perform a feature selection to reduce the number of inputs, either for efficiency, performance, or to gain understanding of the data and the classifiers. We compare a number of methods on mass-spectrometric data of human protein sera from asymptomatic patients and prostate cancer patients. We show empirical evidence that, in spite of the high danger of overfitting, non-linear methods can outperform linear methods, both in performance and number of features selected.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

B.-L Adam, et al, Serum Protein Fingerprinting Coupled with a Pattern-matching Algorithm Distinguishes Prostate Cancer from Benign Prostate Hyperplasia and Healthy Men, Cancer Research 62, 3609–3614, July 1, 2002.
Google Scholar
B. Boser, I. Guyon, and V. Vapnik, An training algorithm for optimal margin classifiers. In Fifth Annual Workshop on Computational Learning Theory, pages 144–152, Pittsburgh, ACM. 1992.
Google Scholar
T. G. Dietterich, Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms. Neural Computation, 10(7) 1895–1924.
Google Scholar
H. Drucker, D. Wu and V. Vapnik. Support Vector Machines for Spam Categorization. IEEE Trans. on Neural Networks, vol 10, number 5, pp. 1048–1054. 1999.
Article Google Scholar
I. Guyon, J. Makhoul, R. Schwartz, and V. Vapnik, What size test set gives good error rate estimates?. PAMI, 20(1), pages 52–64, IEEE. 1998.
Google Scholar
I. Guyon, J. Weston, S. Bamhill, and V. Vapnik. Gene selection for cancer classification using support vector machines. Machine Learning, 46(1–3), pages 389–422, 2002.
Article MATH Google Scholar
I. Guyon, A. Elisseeff, An Introduction to Variable and Feature Selection. JMLR, 3(Mar):l157–1182, 2003.
Google Scholar
I. Guyon, S. Gunn, M. Nikravesh, L. Zadeh. Feature extraction: foundations and applications. Book in preparation http://clopinet.com/isabelle/Proiects/NIPS2003/call-for-papers.html.
Google Scholar
K. K. Jain. Biochips for Gene Spotting. Science, vol. 294, pages 621–625. Oct. 2001
Article Google Scholar
K. Kira, and L. Rendell, A practical approach to feature selection. In D. Sleeman and P. Edwards (Eds.), Proceedings of the Ninth International Workshop on Machine Learning (ML92) (pp. 249–256). San Mateo, California: Morgan Kaufmann.
Google Scholar
B. Schölkopf and A. Smola. Learning with Kernels. MIT Press, Cambridge, MA, 2002.
Google Scholar
H. Stoppiglia, G. Dreyfus, R. Dubois, Y. Oussar. Ranking a Random Feature for Variable and Feature Selection. JMLR, 3(Mar):1399–1414, 2003.
Article MATH Google Scholar
C. M. Surman, The Use of Capillary Electrophoresis in Proteomics. GE Global Research Technical Report 2002GCRC138, June 2002.
Google Scholar
R. Tibshirani, T. Hastie, B. Narasimhan, and G. Chu, Diagnosis of multiple cancer types by shrunken centroids of gene expression. R. Tibshirani, T. Hastie, B. Narasimhan, and G. Chu. PNAS, 99(10):6567–6572, 2002.
Google Scholar
V. Vapnik, Statistical Learning Theory. V. Vapnik. John Wiley & Sons, N.Y., 1998.
Google Scholar
J. Weston, F. Perez-Cruz, O. Bousquet, O. Chapelle, A. Elisseeff and B. Schoelkopf. “Feature Selection and Transduction for Prediction of Molecular Bioactivity for Drug Design”. Bioinformatics, vol. 19 no. 6, pages 764–771, 2003.
Article Google Scholar
J. Weston, A. Elisseeff, B. Schölkopf, Use of the Zero-Norm with Linear Models and Kernel Methods, Mike Tipping; JMLR, 3(Mar):1439–1461, 2003.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Clopinet, 955 Creston Road, Berkeley, CA, 94708, USA
Isabelle Guyon
Biospect, 201 Gateway Blvd, So. San Francisco, CA, 94080
Hans-Marcus Bitter, Zulfikar Ahmed, Michael Brown & Jonathan Heller

Authors

Isabelle Guyon
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Marcus Bitter
View author publications
You can also search for this author in PubMed Google Scholar
Zulfikar Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Michael Brown
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Heller
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. Electrical Engineering and Computer Science — EECS, University of California, 94720, Berkeley, CA, USA
Masoud Nikravesh
Div. Computer Science, Lab. Electronics Research, University of California, Soda Hall 387, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Guyon, I., Bitter, HM., Ahmed, Z., Brown, M., Heller, J. (2005). Multivariate Non-Linear Feature Selection with Kernel Methods. In: Nikravesh, M., Zadeh, L.A., Kacprzyk, J. (eds) Soft Computing for Information Processing and Analysis. Studies in Fuzziness and Soft Computing, vol 164. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32365-1_12

Download citation

DOI: https://doi.org/10.1007/3-540-32365-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22930-8
Online ISBN: 978-3-540-32365-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics