Autonomous Design of Experiments for Learning by Experimentation

Prassler, Erwin; Kahl, Björn; Henne, Timo; Juarez, Alex; Reggianni, Monica

doi:10.1007/978-3-7908-2127-7_13

Erwin Prassler²,
Björn Kahl²,
Timo Henne²,
Alex Juarez² &
…
Monica Reggianni³

1928 Accesses
1 Citations

Abstract

In Artificial Intelligence, numerous learning paradigms have been developed over the past decades. In most cases of embodied and situated agents, the learning goal for the artificial agent is to „map“ or classify the environment and the objects therein [1, 2], in order to improve navigation or the execution of some other domain-specific task. Dynamic environments and changing tasks still pose a major challenge for robotic learning in real-world domains. In order to intelligently adapt its task strategies, the agent needs cognitive abilities to more deeply understand its environment and the effects of its actions. In order to approach this challenge within an open-ended learning loop, the XPERO project (http://www.xpero.org) explores the paradigm of Learning by Experimentation to increase the robot's conceptual world knowledge autonomously. In this setting, tasks which are selected by an actionselection mechanism are interrupted by a learning loop in those cases where the robot identifies learning as necessary for solving a task or for explaining observations. It is important to note that our approach targets unsupervised learning, since there is no oracle available to the agent, nor does it have access to a reward function providing direct feedback on the quality of its learned model, as e.g. in reinforcement learning approaches. In the following sections we present our framework for integrating autonomous robotic experimentation into such a learning loop. In section 1 we explain the different modules for stimulation and design of experiments and their interaction. In section 2 we describe our implementation of these modules and how we applied them to a real world scenario to gather target-oriented data for learning conceptual knowledge. There we also indicate how the goaloriented data generation enables machine learning algorithms to revise the failed prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D.F. Wolf and G.S. Sukhatme, ‘Mobile robot simultaneous localization and mapping in dynamic environments’, Autonomous Robots, 19(1), 53–65, (2005).
Article Google Scholar
S. Petti and T. Fraichard, ‘Safe motion planning in dynamic environments’, Intelligent Robots and Systems (IROS). August, 2005.
Google Scholar
W.-M. Shen, ‘The process of discovery’, Foundations of Science, 1(2), 233–251, (1995).
Google Scholar
P.C.-H. Cheng, ‘Modelling experiments in scientific discovery’, in International Joint Conferences on Artificial Intelligence, IJCAI, pp. 739–745, (1991).
Google Scholar
W.-M. Shen, ‘Discovery as autonomous learning from the environment’, Machine Learning, 12(1), 143–165, (1993).
Google Scholar
M. Peters, ‘Towards artificial forms of intelligence, creativity, and surprise.’, in Proceedings of the 20 th Meeting of the Cognitive Science Society, pp. 836–841, (1998).
Google Scholar
M. Peters and A. Sowmya, ‘Active vision and adaptive learning’, in Proceedings of the 15th. Conference on Intelligent Robots and Computer Vision, volume 2904, pp. 413–424, (1996).
Google Scholar
L. Itti and P. Baldi, ‘A principled approach to detecting surprising events in video’, in Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 631–637, San Siego, CA, (June 2005).
Google Scholar
L. Itti and P. Baldi, ‘Bayesian surprise attracts human attention’, in Advances in Neural Information Processing Systems, Vol. 19 (NIPS*2005), pp. 1–8, Cambridge, MA, (2006). MIT Press.
Google Scholar
L. Macedo and A. Cardoso, ‘Exploration of unknown environments with motivational agents’, in Third International Joint Conference on Autonomous Agents and Multiagent Systems, (2004).
Google Scholar
L. Macedo, A. Cardoso, and R. Reisenzein, ‘A surprise-based agent architecture’, in Cybernetics and Systems, ed., R. Trappl, volume 2. Austrian Society for Cybernetics Studies, (2006).
Google Scholar
F. Kaplan and P-Y. Oudeyer, ‘Curiosity-driven development.’, in Proceedings of the International Workshop on Synergistic Intelligence Dynamics, (2006).
Google Scholar
P-Y. Oudeyer, F. Kaplan, and V. Hafner, ‘Intrinsic motivation systems for autonomous mental development’, IEEE Transactions on Evolutionary Computation, 11(2), 265–286, (2007).
Article Google Scholar
M.R. Dogar, M. Cakmak, E. Ugur, and E. Sahin, ‘From primitive behaviors to goal-directed behavior using affordances’, in IEEE/RSJ Intern. Conf. on Intelligent Robots and Systems, (2007).
Google Scholar
J. Mugan and B. Kuipers, ‘Learning distinctions and rules in a continuous world through active exploration’, in 7th International Conference on Epigenetic Robotics, (2007).
Google Scholar
S. Thrun, ‘The role of exploration in learning control’, in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, eds., D.A. White and D.A. Sofge, Van Nostrand Reinhold, Florence, Kentucky 41022, (1992).
Google Scholar
L. Bertelli, F. Bovo, L. Grespan, S. Galvan, and P. Fiorini, ‘Eddy: an open hardware robot for education’, in 4th International Symposium on Autonomous Minirobots for Research and Edutainment (AMIRE), Buenos Aires, Argentina, (October 2007).
Google Scholar
F. Di Palma, M. Reggiani, and P. Fiorini, ‘Design of experiment for qualitative equation discovery: a comparison’, in Eurosim Congress on Modeling and Simulation, Ljubljana, Slovenia, (September 2007).
Google Scholar
I. Bratko, Prolog Programming for Artificial Intelligence, Addison Wesley Publishing Company, 2001.
Google Scholar
G. Leban and I. Bratko, ‘Discovering notions using hyper’, Technical report, University of Ljubljana, Artificial Intelligence Laboratory, (February 2008).
Google Scholar
M. Henning, ‘A new approach to object-oriented middleware’, IEEE Internet Computing, 8(1), 66–75, (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, FH Bonn-Rhein-Sieg, Sankt Augustin, Germany
Erwin Prassler, Björn Kahl, Timo Henne & Alex Juarez
Dipartimento di Ingegneria e Gestione dei Sistemi Industriali, University of Padua, Padua, Italy
Monica Reggianni

Authors

Erwin Prassler
View author publications
You can also search for this author in PubMed Google Scholar
Björn Kahl
View author publications
You can also search for this author in PubMed Google Scholar
Timo Henne
View author publications
You can also search for this author in PubMed Google Scholar
Alex Juarez
View author publications
You can also search for this author in PubMed Google Scholar
Monica Reggianni
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fachhochschule Bonn-Rhein-Sieg, 53754, Sankt Augustin, Deutschland
Christoph Zacharias , Klaus W. ter Horst , Kurt-Ulrich Witt , Volker Sommer , Marc Ant , Ulrich Essmann & Laurenz Mülheims , , , , , &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Prassler, E., Kahl, B., Henne, T., Juarez, A., Reggianni, M. (2009). Autonomous Design of Experiments for Learning by Experimentation. In: Zacharias, C., et al. Forschungsspitzen und Spitzenforschung. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2127-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-7908-2127-7_13
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2126-0
Online ISBN: 978-3-7908-2127-7
eBook Packages: Humanities, Social Science (German Language)

Publish with us

Policies and ethics