Safe Robot Learning by Energy Limitation

Albrektsen, Sigurd Mørkved; Fjerdingen, Sigurd Aksnes

doi:10.1007/978-3-642-33503-7_22

Sigurd Mørkved Albrektsen²² &
Sigurd Aksnes Fjerdingen²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7508))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

3568 Accesses

Abstract

Online robot learning has been a goal for researchers for several decades. A problem arises when learning algorithms need to explore the environment as actions cannot easily be anticipated. Because of this, safety is a major issue when using learning algorithms.

This paper presents a framework for safe robot learning by the use of region-classification and energy limitation. The main task of the framework is to ensure safety regardless of a learning algorithm’s input to a system. This is necessary to allow a learning robot to explore environments without damaging itself or its surroundings. To ensure safety, the state-space is divided into fatal, supercritical, critical and safe regions, depending on the energy of the system.

To show the adaptability of the framework it is used on two different systems; an actuated swinging pendulum and a mobile platform. In both cases obstacles with unknown locations must are avoided successfully.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Connell, J.H., Mahadevan, S.: Introduction to Robot Learning. Springer (1993)
Google Scholar
Olivier Chapelle, A.Z., Schölkopf, B.: Semi-Supervised Learning. The MIT Press (2006)
Google Scholar
Gelly, S., Silver, D.: Combining online and offline knowledge in uct. In: Proceedings of the 24th International Conference on Machine Learning, ICML 2007, pp. 273–280. ACM, New York (2007)
Chapter Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning. Journal of Cognitive Neuroscience 11(1), 126–130 (1999)
Article Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Gillula, J.H., Tomlin, C.J.: Guaranteed safe online learning of a bounded system. In: IROS 2011, pp. 2979–2984 (September 2011)
Google Scholar
Hans, A., Schneegaß, D., Schäfer, A.M., Udluft, S.: Safe exploration for reinforcement learning. In: European Symposium on Artificial Neural Network, pp. 143–148 (April 2008)
Google Scholar
Fjerdingen, S.A., Kyrkjebø, E.: Safe reinforcement learning for continuous spaces through Lyapunov-constrained behavior. In: Frontiers in Artificial Intelligence and Applications, pp. 70–79 (May 2011)
Google Scholar
Perkins, T.J., Barto, A.G.: Lyapunov design for safe reinforcement learning. J. Mach. Learn. Res. 3, 803–832 (2003)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Applied Cybernetics, SINTEF ICT, Norway
Sigurd Mørkved Albrektsen & Sigurd Aksnes Fjerdingen

Authors

Sigurd Mørkved Albrektsen
View author publications
You can also search for this author in PubMed Google Scholar
Sigurd Aksnes Fjerdingen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mechanical and Industrial Engineering, Concordia University, H3G 1M8, Montreal, Quebec, Canada
Chun-Yi Su
Department of Mechanical and Industrial Engineering, Concordia University, H3G 1M8, Montral, Quebec, Canada
Subhash Rakheja
School of Creative Technologies, The University of Portsmouth, PO1 2DJ, Portsmouth, UK
Honghai Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Albrektsen, S.M., Fjerdingen, S.A. (2012). Safe Robot Learning by Energy Limitation. In: Su, CY., Rakheja, S., Liu, H. (eds) Intelligent Robotics and Applications. ICIRA 2012. Lecture Notes in Computer Science(), vol 7508. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33503-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-33503-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33502-0
Online ISBN: 978-3-642-33503-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics