Aggregation by Exponential Weighting and Sharp Oracle Inequalities

Dalalyan, Arnak S.; Tsybakov, Alexandre B.

doi:10.1007/978-3-540-72927-3_9

Arnak S. Dalalyan¹ &
Alexandre B. Tsybakov¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4539))

Included in the following conference series:

International Conference on Computational Learning Theory

3336 Accesses
35 Citations

Abstract

In the present paper, we study the problem of aggregation under the squared loss in the model of regression with deterministic design. We obtain sharp oracle inequalities for convex aggregates defined via exponential weights, under general assumptions on the distribution of errors and on the functions to aggregate. We show how these results can be applied to derive a sparsity oracle inequality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Audibert, J.-Y.: Une approche PAC-bayésienne de la théorie statistique de l’apprentissage. PhD Thesis. University of Paris 6 (2004)
Google Scholar
Audibert, J.-Y.: A randomized online learning algorithm for better variance control. In: COLT 2006. Proceedings of the 19th Annual Conference on Learning Theory. LNCS (LNAI), vol. 4005, pp. 392–407. Springer, Heidelberg (2006)
Google Scholar
Bunea, F., Nobel, A.B.: Sequential Procedures for Aggregating Arbitrary Estimators of a Conditional Mean. Preprint Florida State University (2005), http://www.stat.fsu.edu/~flori
Bunea, F., Tsybakov, A.B., Wegkamp, M.H.: Aggregation and sparsity via ℓ₁-penalized least squares. In: Lugosi, G., Simon, H.U. (eds.) COLT 2006. LNCS (LNAI), vol. 4005, pp. 379–391. Springer, Heidelberg (2006)
Chapter Google Scholar
Bunea, F., Tsybakov, A.B., Wegkamp, M.H.: Aggregation for gaussian regression. Annals of Statistics, to appear (2007), http://www.stat.fsu.edu/~wegkamp
Bunea, F., Tsybakov, A.B., Wegkamp, M.H.: Sparsity oracle inequalities for the Lasso, Submitted (2006)
Google Scholar
Candes, E., Tao, T.: The Dantzig selector: statistical estimation when p is much larger than n. Annals of Statistics, to appear (2007)
Google Scholar
Catoni, O.: Universal. aggregation rules with exact bias bounds. Preprint n.510, Laboratoire de Probabilités et Modèles Aléatoires, Universités Paris 6 and Paris 7 (1999), http://www.proba.jussieu.fr/mathdoc/preprints/index.html#1999
Catoni, O.: Statistical Learning Theory and Stochastic Optimization. In: Ecole d’été de Probabilités de Saint-Flour 2001. Lecture Notes in Mathematics, Springer, Heidelberg (2004)
Google Scholar
Cesa-Bianchi, N., Conconi, A., Gentile, G.: On the generalization ability of on-line learning algorithms. IEEE Trans. on Information Theory 50, 2050–2057 (2004)
Article MathSciNet Google Scholar
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, New York (2006)
MATH Google Scholar
Donoho, D.L., Elad, M., Temlyakov, V.: Stable Recovery of Sparse Overcomplete Representations in the Presence of Noise. IEEE Trans. on Information Theory 52, 6–18 (2006)
Article MathSciNet Google Scholar
Juditsky, A., Rigollet, P., Tsybakov, A.: Learning by mirror averaging. Preprint n, Laboratoire de Probabilités et Modèle aléatoires, Universités Paris 6 and Paris 7, (2005). n. 1034, https://hal.ccsd.cnrs.fr/ccsd-00014097
Juditsky, A.B., Nazin, A.V., Tsybakov, A.B., Vayatis, N.: Recursive aggregation of estimators via the Mirror Descent Algorithm with averaging. Problems of Information Transmission 41, 368–384 (2005)
Article MATH MathSciNet Google Scholar
Koltchinskii, V.: Sparsity in penalized empirical risk minimization, Submitted (2006)
Google Scholar
Leung, G., Barron, A.: Information theory and mixing least-square regressions. IEEE Transactions on Information Theory 52, 3396–3410 (2006)
Article MathSciNet Google Scholar
Littlestone, N., Warmuth, M.K.: The weighted majority algorithm. Information and Computation 108, 212–261 (1994)
Article MATH MathSciNet Google Scholar
Obloj, J.: The Skorokhod embedding problem and its offspring. Probability Surveys 1, 321–392 (2004)
Article MathSciNet Google Scholar
Petrov, V.V.: Limit Theorems of Probability Theory. Clarendon Press, Oxford (1995)
MATH Google Scholar
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion. Springer, Heidelberg (1999)
MATH Google Scholar
Tsybakov, A.B.: Optimal rates of aggregation. In: Schölkopf, B., Warmuth, M. (eds.) Computational Learning Theory and Kernel Machines. LNCS (LNAI), vol. 2777, pp. 303–313. Springer, Heidelberg (2003)
Google Scholar
Tsybakov, A.B.: Regularization, boosting and mirror averaging. Comments on “Regularization in Statistics”, by Bickel, P., Li, B., Test 15, 303–310 ( 2006)
Google Scholar
van de Geer, S.A.: High dimensional generalized linear models and the Lasso. Research report No.133. Seminar für Statistik, ETH, Zürich (2006)
Google Scholar
Vovk, V.: Aggregating Strategies. In: Proceedings of the 3rd Annual Workshop on Computational Learning Theory, COLT1990, pp. 371–386. Morgan Kaufmann, San Francisco, CA (1990)
Google Scholar
Vovk, V.: Competitive on-line statistics. International Statistical Review 69, 213–248 (2001)
Article MATH Google Scholar
Yang, Y.: Combining different procedures for adaptive regression. Journal of Multivariate Analysis 74, 135–161 (2000)
Article MATH MathSciNet Google Scholar
Yang, Y.: Adaptive regression by mixing. Journal of the American Statistical Association 96, 574–588 (2001)
Article MATH MathSciNet Google Scholar
Yang, Y.: Regression with multiple candidate models: selecting or mixing? Statist. Sinica 13, 783–809 (2003)
MATH Google Scholar
Zhang, T.: From epsilon-entropy to KL-complexity: analysis of minimum information complexity density estimation. Annals of Statistics, to appear (2007)
Google Scholar
Zhang, T.: Information theoretical upper and lower bounds for statistical estimation. IEEE Transactions on Information Theory, to appear (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Paris 6, 4, Place Jussieu, 75252 Paris cedex 05, France
Arnak S. Dalalyan & Alexandre B. Tsybakov

Authors

Arnak S. Dalalyan
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre B. Tsybakov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Nader H. Bshouty Claudio Gentile

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dalalyan, A.S., Tsybakov, A.B. (2007). Aggregation by Exponential Weighting and Sharp Oracle Inequalities. In: Bshouty, N.H., Gentile, C. (eds) Learning Theory. COLT 2007. Lecture Notes in Computer Science(), vol 4539. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72927-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-72927-3_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72925-9
Online ISBN: 978-3-540-72927-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics