Deep Reinforcement Learning pp 127-140 | Cite as
Policy-Based Reinforcement Learning Approaches
Stochastic Policy Gradient and the REINFORCE Algorithm
Chapter
First Online:
- 3.7k Downloads
Abstract
In this chapter, we will cover the basics of the policy-based approaches especially the policy gradient-based approaches. We will understand why policy-based approaches are superior to that of value-based approaches under some circumstances and why they are also tough to implement. We will subsequently cover some simplifications that will help make policy-based approaches practical to implement and also cover the REINFORCE algorithm.
Copyright information
© Springer Nature Singapore Pte Ltd. 2019