Despite the distinct challenges in the last chapter, the experimental studies confirmed that ACS2 is normally able to evolve a compact, complete, and accurate model of an environment. While the experiments in chapter 3 mainly showed the performance of ACS2 in terms of its model and reinforcement learning capabilities, this chapter exhibits experiments in which performance is increased by exploiting the environmental model. Particularly, promising results of two model-exploitation types are provided. (1) A complete and accurate model is evolved faster and more reliable. (2) A further adaptivity beyond the usual reinforcement learning capabilities is realized.
KeywordsReinforcement Learning Model Exploitation Environmental Model Reward Prediction Unknown Region
Unable to display preview. Download preview PDF.