Mathematical and Algorithmic Understanding of Reinforcement Learning

Sewak, Mohit

doi:10.1007/978-981-13-8285-7_2

Mohit Sewak²

7939 Accesses
1 Citations

Abstract

In this chapter, we will discuss the Bellman Equation and the Markov Decision Process (MDP ), which are the basis for almost all the approaches that we will be discussing further. We will thereafter discuss some of the non-model-based approaches for Reinforcement Learning like Dynamic Programming. It is imperative to understand these concepts before going forward to discussing some advanced topics ahead. Finally, we will cover the algorithms like value iteration and policy iteration for solving the MDP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Pune, Maharashtra, India
Mohit Sewak

Authors

Mohit Sewak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohit Sewak .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sewak, M. (2019). Mathematical and Algorithmic Understanding of Reinforcement Learning. In: Deep Reinforcement Learning. Springer, Singapore. https://doi.org/10.1007/978-981-13-8285-7_2

Download citation

DOI: https://doi.org/10.1007/978-981-13-8285-7_2
Published: 28 June 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8284-0
Online ISBN: 978-981-13-8285-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics