Learning Convex Combinations of Continuously Parameterized Basic Kernels

Argyriou, Andreas; Micchelli, Charles A.; Pontil, Massimiliano

doi:10.1007/11503415_23

Learning Convex Combinations of Continuously Parameterized Basic Kernels

Andreas Argyriou²⁰,
Charles A. Micchelli²¹ &
Massimiliano Pontil²⁰

Conference paper

3524 Accesses
35 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3559))

Abstract

We study the problem of learning a kernel which minimizes a regularization error functional such as that used in regularization networks or support vector machines. We consider this problem when the kernel is in the convex hull of basic kernels, for example, Gaussian kernels which are continuously parameterized by a compact set. We show that there always exists an optimal kernel which is the convex combination of at most m+1 basic kernels, where m is the sample size, and provide a necessary and sufficient condition for a kernel to be optimal. The proof of our results is constructive and leads to a greedy algorithm for learning the kernel. We discuss the properties of this algorithm and present some preliminary numerical simulations.

This work was supported by EPSRC Grant GR/T18707/01, NSF Grant ITR-0312113 and the PASCAL European Network of Excellence.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aronszajn, N.: Theory of reproducing kernels. Trans. Amer. Math. Soc. 686, 337–404 (1950)
Article MathSciNet Google Scholar
Aubin, J.P.: Mathematical Methods of Game and Economic Theory. In: Studies in Mathematics and its applications, vol. 7, North-Holland, Amsterdam (1982)
Google Scholar
Bach, F.R., Lanckriet, G.R.G., Jordan, M.I.: Multiple kernels learning, conic duality, and the SMO algorithm. In: Proc. of the Int. Conf. on Machine Learning (2004)
Google Scholar
Bousquet, O., Herrmann, D.J.L.: On the complexity of learning the kernel matrix. Advances in Neural Information Processing Systems 15 (2003)
Google Scholar
Borwein, J.M., Lewis, A.S.: Convex Analysis and Nonlinear Optimization. Theory and Examples. CMS (Canadian Math. Soc.). Springer, New York (2000)
Google Scholar
Chapelle, O., Vapnik, V.N., Bousquet, O., Mukherjee, S.: Choosing multiple parameters for support vector machines. Machine Learning 46(1), 131–159 (2002)
Article MATH Google Scholar
Herbster, M.: Relative Loss Bounds and Polynomial-time Predictions for the K-LMS-NET Algorithm. In: Proc. of the 15-th Int. Conference on Algorithmic Learning Theory (October 2004)
Google Scholar
Lanckriet, G.R.G., Cristianini, N., Bartlett, P., El Ghaoui, L., Jordan, M.I.: Learning the kernel matrix with semi-definite programming. J. of Machine Learning Research 5, 27–72 (2004)
Google Scholar
Micchelli, C.A., Pontil, M.: Learning the kernel function via regularization. To appear in J. of Machine Learning Research (see also Research Note RN/04/11, Department of Computer Science, UCL (June 2004)
Google Scholar
Micchelli, C.A., Rivlin, T.J.: Lectures on optimal recovery. In: Turner, P.R. (ed.) Lecture Notes in Mathematics, vol. 1129, Springer, Heidelberg (1985)
Google Scholar
Ong, C.S., Smola, A.J., Williamson, R.C.: Hyperkernels. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems 15, MIT Press, Cambridge (2003)
Google Scholar
Royden, H.L.: Real Analysis, 3rd edn. Macmillan Publ. Company, New York (1988)
MATH Google Scholar
Schoenberg, I.J.: Metric spaces and completely monotone functions. Annals of Mathematics 39, 811–841 (1938)
Article MathSciNet Google Scholar
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Wahba, G.: Spline Models for Observational Data. Series in Applied Mathematics, vol. 59. SIAM, Philadelphia (1990)
MATH Google Scholar
Zhang, T.: On the dual formulation of regularized linear systems with convex risks. Machine Learning 46, 91–129 (2002)
Article MATH Google Scholar
Wu, Q., Ying, Y., Zhou, D.X.: Multi-kernel regularization classifiers. In: Preprint, City University of Hong Kong (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, England, UK
Andreas Argyriou & Massimiliano Pontil
Department of Mathematics and Statistics, State University of New York, The University at Albany, 1400 Washington Avenue, Albany, NY, 12222, USA
Charles A. Micchelli

Authors

Andreas Argyriou
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Micchelli
View author publications
You can also search for this author in PubMed Google Scholar
Massimiliano Pontil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Leoben, A-8700, Leoben, Austria
Peter Auer
Department of Electrical Engineering, Technion, P.O. Box, 3200, Haifa, Israel
Ron Meir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Argyriou, A., Micchelli, C.A., Pontil, M. (2005). Learning Convex Combinations of Continuously Parameterized Basic Kernels. In: Auer, P., Meir, R. (eds) Learning Theory. COLT 2005. Lecture Notes in Computer Science(), vol 3559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11503415_23

Download citation

DOI: https://doi.org/10.1007/11503415_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26556-6
Online ISBN: 978-3-540-31892-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics