Solving Safety Problems with Ensemble Reinforcement Learning

Ferreira, Leonardo A.; dos Santos, Thiago F.; Bianchi, Reinaldo A. C.; Santos, Paulo E.

doi:10.1007/978-3-030-35288-2_17

Solving Safety Problems with Ensemble Reinforcement Learning

Leonardo A. Ferreira¹⁰,
Thiago F. dos Santos¹⁰,
Reinaldo A. C. Bianchi¹⁰ &
…
Paulo E. Santos^10,11

Conference paper
First Online: 25 November 2019

2177 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11919))

Abstract

An agent that learns by interacting with an environment may find unexpected solutions to decision-making problems. This solution can be an improvement over well-known ones, such as new strategies for games, but in some cases the unexpected solution is unwanted and should be avoided for reasons such as safety. This paper proposes a Reinforcement Learning Ensemble Framework called ReLeEF. This framework combines decision making methods to provide a finer grained control of the agent’s behaviour while still letting it learn by interacting with the environment. It has been tested in the safety gridworlds and the results show that it can find optimal solutions while fulfilling safety concerns described for each domain, something that state of the art Deep Reinforcement Learning methods were unable to do.

L. A. Ferreira—Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001.

T. F. dos Santos and P. E. Santos—FAPESP-IBM Process number 17/07833-9.

R. A. C. Bianchi—FAPESP Process number 2019/07665-4.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bougie, N., Cheng, L.K., Ichise, R.: Combining deep reinforcement learning with prior knowledge and reasoning. SIGAPP Appl. Comput. Rev. 18(2), 33–45 (2018)
Article Google Scholar
Dietterich, T.G.: Ensemble methods in machine learning. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 1–15. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-45014-9_1
Chapter Google Scholar
Ferreira, L.A., Bianchi, R.A.C., Santos, P.E., de Mantaras, R.L.: A method for the online construction of the set of states of a Markov decision process using answer set programming. In: Mouhoub, M., Sadaoui, S., Ait Mohamed, O., Ali, M. (eds.) IEA/AIE 2018. LNCS (LNAI), vol. 10868, pp. 3–15. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-92058-0_1
Chapter Google Scholar
d’Avila Garcez, A.S., Dutra, A.R.R., Alonso, E.: Towards symbolic reinforcement learning with common sense. CoRR abs/1804.08597 (2018)
Google Scholar
Garnelo, M., Arulkumaran, K., Shanahan, M.: Towards deep symbolic reinforcement learning. In: Deep Reinforcement Learning Workshop at the 30th Conference on Neural Information Processing Systems (2016)
Google Scholar
Garnelo, M., Shanahan, M.: Reconciling deep learning with symbolic artificial intelligence: representing objects and relations. Curr. Opin. Behav. Sci. 29, 17–23 (2019)
Article Google Scholar
Leike, J., et al.: AI safety gridworlds. CoRR abs/1711.09883 (2017)
Google Scholar
Leonetti, M., Iocchi, L., Stone, P.: A synthesis of automated planning and reinforcement learning for efficient, robust decision-making. Artif. Intell. 241, 103–130 (2016)
Article MathSciNet Google Scholar
Liang, Y., Machado, M.C., Talvitie, E., Bowling, M.: State of the art control of Atari games using shallow reinforcement learning. In: Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016, pp. 485–493. International Foundation for Autonomous Agents and Multiagent Systems, Richland (2016)
Google Scholar
Lu, K., Zhang, S., Stone, P., Chen, X.: Robot representing and reasoning with knowledge from reinforcement learning. CoRR abs/1809.11074 (2018)
Google Scholar
Lyu, D., Yang, F., Liu, B., Gustafson, S.: SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning. CoRR abs/1811.00090 (2018)
Google Scholar
McCarthy, J.: Elaboration tolerance (1999)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)
Article Google Scholar
Pease, A.: Ontology: A Practical Guide. Articulate Software Press, Angwin (2011)
Google Scholar
Silver, D., et al.: Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484 (2016)
Article Google Scholar
Silver, D., et al.: Mastering the game of go without human knowledge. Nature 550(7676), 354 (2017)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn. The MIT Press, Cambridge (2018)
MATH Google Scholar
Tesauro, G.: Temporal difference learning and TD-gammon. Commun. ACM 38(3), 58–68 (1995)
Article Google Scholar
Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3–4), 279–292 (1992)
MATH Google Scholar
Yang, F., Lyu, D., Liu, B., Gustafson, S.: PEORL: integrating symbolic planning and hierarchical reinforcement learning for robust decision-making. CoRR abs/1804.07779 (2018)
Google Scholar
Zamani, M.A., Magg, S., Weber, C., Wermter, S.: Deep reinforcement learning using symbolic representation for performing spoken language instructions. In: 2nd Workshop on Behavior Adaptation, Interaction and Learning for Assistive Robotics on Robot and Human Interactive Communication (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Centro Universitário FEI, São Bernardo do Campo, SP, Brazil
Leonardo A. Ferreira, Thiago F. dos Santos, Reinaldo A. C. Bianchi & Paulo E. Santos
College of Science and Engineering, Flinders University, Adelaide, Australia
Paulo E. Santos

Authors

Leonardo A. Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Thiago F. dos Santos
View author publications
You can also search for this author in PubMed Google Scholar
Reinaldo A. C. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar
Paulo E. Santos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonardo A. Ferreira .

Editor information

Editors and Affiliations

University of South Australia, Adelaide, SA, Australia
Jixue Liu
The University of Melbourne, Melbourne, VIC, Australia
James Bailey

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferreira, L.A., dos Santos, T.F., Bianchi, R.A.C., Santos, P.E. (2019). Solving Safety Problems with Ensemble Reinforcement Learning. In: Liu, J., Bailey, J. (eds) AI 2019: Advances in Artificial Intelligence. AI 2019. Lecture Notes in Computer Science(), vol 11919. Springer, Cham. https://doi.org/10.1007/978-3-030-35288-2_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-35288-2_17
Published: 25 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35287-5
Online ISBN: 978-3-030-35288-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics