Exploration and Exploitation in Online Learning

Auer, Peter

doi:10.1007/978-3-642-23857-4_2

Peter Auer²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6943))

Included in the following conference series:

International Conference on Adaptive and Intelligent Systems

2001 Accesses

Abstract

Online learning does not distinguish between a training and an evaluation phase of learning, but considers learning as an ongoing process, such that learning algorithms need to perform and make predictions while they learn. After reviewing the online learning model and some algorithms, I will consider variants of the model where only partial information is revealed to the learner, in particular the bandit problem and reinforcement learning. The uncertainty of the learner caused by receiving only partial information, leads to an exploration-exploitation dilemma: is further information needed, or can the available information already be exploited? I will discuss how optimism in the face of uncertainty can address this dilemma in many cases.

Download to read the full chapter text

Chapter PDF

Reinforcement Learning Algorithms: Categorization and Structural Properties

A Survey of Preference-Based Online Learning with Bandit Algorithms

Reinforcement Learning

Author information

Authors and Affiliations

Chair for Information Technology, University of Leoben, Austria
Peter Auer

Authors

Peter Auer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics-Systems, University of Klagenfurt, Universitätsstr. 65-67, 9020, Klagenfurt, Austria
Abdelhamid Bouchachia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Auer, P. (2011). Exploration and Exploitation in Online Learning. In: Bouchachia, A. (eds) Adaptive and Intelligent Systems. ICAIS 2011. Lecture Notes in Computer Science(), vol 6943. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23857-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-23857-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23856-7
Online ISBN: 978-3-642-23857-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Exploration and Exploitation in Online Learning

Abstract

Chapter PDF

Similar content being viewed by others

Reinforcement Learning Algorithms: Categorization and Structural Properties

A Survey of Preference-Based Online Learning with Bandit Algorithms

Reinforcement Learning

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Exploration and Exploitation in Online Learning

Abstract

Chapter PDF

Similar content being viewed by others

Reinforcement Learning Algorithms: Categorization and Structural Properties

A Survey of Preference-Based Online Learning with Bandit Algorithms

Reinforcement Learning

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation