Prediction with Expert Advice under Discounted Loss

Chernov, Alexey; Zhdanov, Fedor

doi:10.1007/978-3-642-16108-7_22

Prediction with Expert Advice under Discounted Loss

Alexey Chernov²³ &
Fedor Zhdanov²³

Conference paper

1176 Accesses
7 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6331))

Abstract

We study prediction with expert advice in the setting where the losses are accumulated with some discounting and the impact of old losses can gradually vanish. We generalize the Aggregating Algorithm and the Aggregating Algorithm for Regression, propose a new variant of exponentially weighted average algorithm, and prove bounds on the cumulative discounted loss.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Beckenbach, E.F., Bellman, R.: Inequalities. Springer, Berlin (1961)
Google Scholar
Cesa-Bianchi, N., Lugosi, G.: Prediction, learning, and games. Cambridge University Press, Cambridge (2006)
Book MATH Google Scholar
Chaudhuri, K., Freund, Y., Hsu, D.: A parameter-free hedging algorithm. In: Advances in Neural Information Processing Systems, vol. 22, pp. 297–305 (2009)
Google Scholar
Chernov, A., Kalnishkan, Y., Zhdanov, F., Vovk, V.: Supermartingales in prediction with expert advice. Theoretical Computer Science 411, 2647–2669 (2010); See also: arXiv:1003.2218 [cs.LG]
Article MATH MathSciNet Google Scholar
Chernov, A., Vovk, V.: Prediction with advice of unknown number of experts. In: Proc. of 26th Conf. on Uncertainty in Artificial Intelligence, pp. 117–125 (2010)
Google Scholar
Chernov, A., Zhdanov, F.: Prediction with expert advice under discounted loss. Technical report, arXiv:1005.1918v1 [cs.LG] (2010)
Google Scholar
Freund, Y., Hsu, D.: A new hedging algorithm and its application to inferring latent random variables. Technical report, arXiv:0806.4802v1 [cs.GT] (2008)
Google Scholar
Gammerman, A., Kalnishkan, Y., Vovk, V.: On-line prediction with kernels and the complexity approximation principle. In: Proc. of 20th Conf. on Uncertainty in Artificial Intelligence, pp. 170–176 (2004)
Google Scholar
Gardner, E.S.: Exponential smoothing: The state of the art – part II. International Journal of Forecasting 22, 637–666 (2006)
Article Google Scholar
Harville, D.A.: Matrix algebra from a statistician’s perspective. Springer, Heidelberg (1997)
MATH Google Scholar
Haussler, D., Kivinen, J., Warmuth, M.: Sequential prediction of individual sequences under general loss functions. IEEE Transactions on Information Theory 44, 1906–1925 (1998)
Article MATH MathSciNet Google Scholar
Herbster, M., Warmuth, M.K.: Tracking the best expert. Machine Learning 32, 151–178 (1998)
Article MATH Google Scholar
Kalnishkan, Y., Vyugin, M.: The weak aggregating algorithm and weak mixability. Technical report, CLRC-TR-03-01 (2003)
Google Scholar
Kalnishkan, Y., Vyugin, M.: The weak aggregating algorithm and weak mixability. Journal of Computer and System Sciences 74(8), 1228–1244 (2008)
Article MATH MathSciNet Google Scholar
Muth, J.F.: Optimal properties of exponentially weighted forecasts. Journal of the American Statistical Association 55, 299–306 (1960)
Article MATH Google Scholar
Schölkopf, B., Smola, A.J.: Learning with kernels: Support Vector Machines, regularization, optimization, and beyond. MIT Press, Cambridge (2002)
Google Scholar
Sutton, R., Barto, A.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Google Scholar
Vovk, V.: Aggregating strategies. In: Proc. of 3rd COLT, pp. 371–383 (1990)
Google Scholar
Vovk, V.: A game of prediction with expert advice. Journal of Computer and System Sciences 56, 153–173 (1998)
Article MATH MathSciNet Google Scholar
Vovk, V.: Derandomizing stochastic prediction strategies. Machine Learning 35, 247–282 (1999)
Article MATH Google Scholar
Vovk, V.: Competitive on-line statistics. Int. Stat. Review 69, 213–248 (2001)
MATH Google Scholar
Vovk, V.: On-line regression competitive with reproducing kernel Hilbert spaces. Technical report, arXiv:cs/0511058 [cs.LG] (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Learning Research Centre and Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, TW20 0EX, UK
Alexey Chernov & Fedor Zhdanov

Authors

Alexey Chernov
View author publications
You can also search for this author in PubMed Google Scholar
Fedor Zhdanov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Information Sciences and Engineering, Australian National University and NICTA, 0200, Canberra, ACT, Australia
Marcus Hutter
Department of Mathematics, National University of Singapore, Block S17, 10 Lower Kent Ridge Road, 119076, Singapore, Republic of Singapore
Frank Stephan
Department of Computer Science, University of London, Royal Holloway, TW20 0EX, Egham, Surrey, UK
Vladimir Vovk
Division of Computer Science, Hokkaido University, , ,, N-14, W-9, Sapporo, 060-0814, Japan
Thomas Zeugmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chernov, A., Zhdanov, F. (2010). Prediction with Expert Advice under Discounted Loss. In: Hutter, M., Stephan, F., Vovk, V., Zeugmann, T. (eds) Algorithmic Learning Theory. ALT 2010. Lecture Notes in Computer Science(), vol 6331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16108-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-16108-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16107-0
Online ISBN: 978-3-642-16108-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics