Inverse Reinforcement Learning

Abbeel, Pieter; Ng, Andrew Y.

doi:10.1007/978-1-4899-7502-7_142-1

Pieter Abbeel³ &
Andrew Y. Ng⁴

618 Accesses
2 Citations

Synonyms

Intent recognition; Inverse optimal control; Plan recognition

Definition

Inverse reinforcement learning (inverse RL) considers the problem of extracting a reward function from observed (nearly) optimal behavior of an expert acting in an environment.

Motivation and Background

The motivation for inverse RL is twofold:

For many RL applications, it is difficult to write down an explicit reward function specifying how different desiderata should be traded off exactly. In fact, engineers often spend significant effort tweaking the reward function such that the optimal policy corresponds to performing the task they have in mind. For example, consider the task of driving a car well. Various desiderata have to be traded off, such as speed, following distance, lane preference, frequency of lane changes, distance from the curb, etc. Specifying the reward function for the task of driving requires explicitly writing down the trade-off between these features.
Inverse RL algorithms provide...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Author information

Authors and Affiliations

EECS Department, UC Berkeley, Stanford, CA, USA
Pieter Abbeel
CS Department, Stanford University, Stanford, CA, USA
Andrew Y. Ng

Authors

Pieter Abbeel
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Y. Ng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Pieter Abbeel or Andrew Y. Ng .

Editor information

Editors and Affiliations

Engineering (CSE), University of New South Wales School of Computer Science &, Sydney, New South Wales, Australia
Claude Sammut
Software Engineering, Monash University School of Computer Science &, Melbourne, Victoria, Australia
Geoffrey I. Webb

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Abbeel, P., Ng, A.Y. (2016). Inverse Reinforcement Learning. In: Sammut, C., Webb, G. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7502-7_142-1

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7502-7_142-1
Received: 02 September 2014
Accepted: 21 June 2016
Published: 05 August 2016
Publisher Name: Springer, Boston, MA
Online ISBN: 978-1-4899-7502-7
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Inverse Reinforcement Learning

Synonyms

Definition

Motivation and Background

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Inverse Reinforcement Learning

Synonyms

Definition

Motivation and Background

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Search

Navigation