Teaching a Robot to Perform Task through Imitation and On-line Feedback

León, Adrián; Morales, Eduardo F.; Altamirano, Leopoldo; Ruiz, Jaime R.

doi:10.1007/978-3-642-25085-9_65

Adrián León¹⁸,
Eduardo F. Morales¹⁸,
Leopoldo Altamirano¹⁸ &
…
Jaime R. Ruiz¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7042))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

2821 Accesses
14 Citations

Abstract

Service robots are becoming increasingly available and it is expected that they will be part of many human activities in the near future. It is desirable for these robots to adapt themselves to the user’s needs, so non-expert users will have to teach them how to perform new tasks in natural ways. In this paper a new teaching by demonstration algorithm is described. It uses a Kinect® sensor to track the movements of a user, eliminating the need of special sensors or environment conditions, it represents the tasks with a relational representation to facilitate the correspondence problem between the user and robot arm and to learn how to perform tasks in a more general description, it uses reinforcement learning to improve over the initial sequences provided by the user, and it incorporates on-line feedback from the user during the learning process creating a novel dynamic reward shaping mechanism to converge faster to an optimal policy. We demonstrate the approach by learning simple manipulation tasks of a robot arm and show its superiority over more traditional reinforcement learning algorithms.

Download to read the full chapter text

Chapter PDF

Learning from Humans

Robot learning from demonstration for path planning: A review

Article 06 July 2020

ZongWu Xie, Qi Zhang, … Hong Liu

Robot life-long task learning from human demonstrations: a Bayesian approach

Article 28 July 2016

Nathan Koenig & Maja J. Matarić

Keywords

References

Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: Proceedings of the Twenty-first International Conference on Machine Learning. ACM Press (2004)
Google Scholar
Argall, B., Browning, B., Veloso, M.: Learning by demonstration with critique from a human teacher. In: 2nd Conf. on Human-Robot Interaction (HRI), pp. 57–64 (2007)
Google Scholar
Argall, B.D., Chernova, S., Veloso, M., Browning, B.: A survey of robot learning from demonstration (2009)
Google Scholar
Billard, A.G., Calinon, S., Dillmann, R., Schaal, S.: Robot programming by demonstration. In: Siciliano, B., Khatib, O. (eds.) Handbook of Robotics, ch. 59. Springer, New York (2008)
Google Scholar
Grzes, M., Kudenko, D.: Learning shaping rewards in model-based reinforcement learning (2009)
Google Scholar
Judah, K., Roy, S., Fern, A., Dietterich, T.G.: Reinforcement learning via practice and critique advice. In: AAAI (2010)
Google Scholar
Bradley Knox, W., Stone, P.: Combining manual feedback with subsequent mdp reward signals for reinforcement learning (2010)
Google Scholar
Konidaris, G., Barto, A.: Autonomous shaping: knowledge transfer in reinforcement learning. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 489–496 (2006)
Google Scholar
Laud, A.: Theory and application of reward shaping in reinforcement learning (2004)
Google Scholar
Mataric, M.J.: Reward functions for accelerated learning. In: Proceedings of the Eleventh International Conference on Machine Learning, pp. 181–189. Morgan Kaufmann (1994)
Google Scholar
Ng, A.Y., Harada, D., Russell, S.: Policy invariance under reward transformations: Theory and application to reward shaping. In: Proceedings of the Sixteenth International Conference on Machine Learning, pp. 278–287. Morgan Kaufmann (1999)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. The MIT Press, Cambridge (1998)
Google Scholar
Tenorio-Gonzalez, A.C., Morales, E.F., Villaseñor-Pineda, L.: Dynamic Reward Shaping: Training a Robot by Voice. In: Kuri-Morales, A., Simari, G.R. (eds.) IBERAMIA 2010. LNCS, vol. 6433, pp. 483–492. Springer, Heidelberg (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Optics and Electronics, National Institute of Astrophysics, Luis Enrique Erro No. 1, 72840, Tonantzintla, México
Adrián León, Eduardo F. Morales, Leopoldo Altamirano & Jaime R. Ruiz

Authors

Adrián León
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo F. Morales
View author publications
You can also search for this author in PubMed Google Scholar
Leopoldo Altamirano
View author publications
You can also search for this author in PubMed Google Scholar
Jaime R. Ruiz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad de La Frontera, Avda. Francisco Salazar, 01145, Temuco, Chile
César San Martin
Myongji University, San 38-2, Namdong, 449-728, Cheoingu, Yongin, Republic of Korea
Sang-Woon Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

León, A., Morales, E.F., Altamirano, L., Ruiz, J.R. (2011). Teaching a Robot to Perform Task through Imitation and On-line Feedback. In: San Martin, C., Kim, SW. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2011. Lecture Notes in Computer Science, vol 7042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25085-9_65

Download citation

DOI: https://doi.org/10.1007/978-3-642-25085-9_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25084-2
Online ISBN: 978-3-642-25085-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Teaching a Robot to Perform Task through Imitation and On-line Feedback

Abstract

Chapter PDF