Q-Learning in Code

Coding the Off-Policy Q-Learning Agent and Behavior Policy
  • Mohit SewakEmail author


In this chapter, we would put what we have learnt on Q-Learning in the last chapter in code. We would implement a Q-Table-based Off-Policy Q-Learning agent class, and to complement with a behavior policy, we would implement another class on Behavior Policy with an implementation of the epsilon-greedy algorithm.

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.PuneIndia

Personalised recommendations