Deep Reinforcement Learning pp 109-126 | Cite as
Double DQN in Code
Coding the DDQN with Epsilon-Decay Behavior Policy
Chapter
First Online:
- 3.8k Downloads
Abstract
In this chapter, we will implement the Double DQN (DDQN) agent in code. As compared to a conventional DQN, the DDQN agent is more stable as it uses a dedicated target network which remains relatively stable. We also put into practice the concepts of MLP-DNN we learnt in Chap. 6 and have used Keras and TensorFlow for our deep learning models. We have also used the OpenAI gym for instantiating standardized environments to train and test out agents. We use the CartPole environment from the gym for training our model.
Copyright information
© Springer Nature Singapore Pte Ltd. 2019