Skip to main content

Online Prediction Problems with Variation

  • Conference paper
Computing and Combinatorics (COCOON 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8591))

Included in the following conference series:

Abstract

We study the prediction with expert advice problem, where in each round, the player selects one of N actions and incurs the corresponding loss according to an N-dimensional linear loss vector, and aim to minimize the regret. In this paper, we consider a new measure of the loss functions, which we call L  ∞  -variation. Consider the loss functions with small L  ∞ -variation, if the player is allowed to have some information related to the variation in each round, we can obtain an online bandit algorithm for the problem without using the self-concordance methodology, which conditionally answers an open problem in [8]. Another related problem is the combinatorial prediction game, in which the set of actions is a subset of {0,1}d, and the loss function is in [–1,1]d. We provide an online algorithm in the semi-bandit setting when the loss functions have small L  ∞ -variation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abernethy, J., Hazan, E., Rakhlin, A.: Competing in the dark: An efficient algorithm for bandit linear optimization. In: COLT, pp. 263–274 (2008)

    Google Scholar 

  2. Audibert, J.-Y., Bubeck, S.: Regret Bounds and Minimax Policies under Partial Monitoring. Journal of Machine Learning Research 11, 2635–2686 (2010)

    MathSciNet  Google Scholar 

  3. Audibert, J.-Y., Bubeck, S., Lugosi, G.: Minimax Policies for Combinatorial Prediction Games. In: COLT, pp. 107–132 (2011)

    Google Scholar 

  4. Chiang, C.-K., Yang, T., Lee, C.-J., Mahdavi, M., Lu, C.-J., Jin, R., Zhu, S.: Online optimization with gradual variations. In: COLT, pp. 6.1–6.20 (2012)

    Google Scholar 

  5. Dani, V., Hayes, T., Kakade, S.M.: The Price of Bandit Information for Online Optimization. In: NIPS, pp. 345–352 (2008)

    Google Scholar 

  6. Freund, Y., Schapire, R.E.: A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)

    Article  MATH  MathSciNet  Google Scholar 

  7. Hazan, E., Kale, S.: Extracting certainty from uncertainty: Regret bounded by variation in costs. Machine Learning 80(2-3), 165–188 (2010)

    Article  MathSciNet  Google Scholar 

  8. Hazan, E., Kale, S.: Better Algorithms for Benign Bandits. Journal of Machine Learning Research 12, 1287–1311 (2011)

    MATH  MathSciNet  Google Scholar 

  9. Littlestone, N., Warmuth, M.K.: The Weighted Majority Algorithm. Inf. Comput. 108(2), 212–261 (1994)

    Article  MATH  MathSciNet  Google Scholar 

  10. Vitter, J.S.: Random sampling with a reservoir. ACM Trans. Math. Softw. 11(1), 37–57 (1985)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lee, CJ., Tsai, SC., Yang, MC. (2014). Online Prediction Problems with Variation. In: Cai, Z., Zelikovsky, A., Bourgeois, A. (eds) Computing and Combinatorics. COCOON 2014. Lecture Notes in Computer Science, vol 8591. Springer, Cham. https://doi.org/10.1007/978-3-319-08783-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08783-2_5

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08782-5

  • Online ISBN: 978-3-319-08783-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics