Skip to main content

Autonomous Design of Experiments for Learning by Experimentation

  • Chapter
Forschungsspitzen und Spitzenforschung

Abstract

In Artificial Intelligence, numerous learning paradigms have been developed over the past decades. In most cases of embodied and situated agents, the learning goal for the artificial agent is to „map“ or classify the environment and the objects therein [1, 2], in order to improve navigation or the execution of some other domain-specific task. Dynamic environments and changing tasks still pose a major challenge for robotic learning in real-world domains. In order to intelligently adapt its task strategies, the agent needs cognitive abilities to more deeply understand its environment and the effects of its actions. In order to approach this challenge within an open-ended learning loop, the XPERO project (http://www.xpero.org) explores the paradigm of Learning by Experimentation to increase the robot's conceptual world knowledge autonomously. In this setting, tasks which are selected by an actionselection mechanism are interrupted by a learning loop in those cases where the robot identifies learning as necessary for solving a task or for explaining observations. It is important to note that our approach targets unsupervised learning, since there is no oracle available to the agent, nor does it have access to a reward function providing direct feedback on the quality of its learned model, as e.g. in reinforcement learning approaches. In the following sections we present our framework for integrating autonomous robotic experimentation into such a learning loop. In section 1 we explain the different modules for stimulation and design of experiments and their interaction. In section 2 we describe our implementation of these modules and how we applied them to a real world scenario to gather target-oriented data for learning conceptual knowledge. There we also indicate how the goaloriented data generation enables machine learning algorithms to revise the failed prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D.F. Wolf and G.S. Sukhatme, ‘Mobile robot simultaneous localization and mapping in dynamic environments’, Autonomous Robots, 19(1), 53–65, (2005).

    Article  Google Scholar 

  2. S. Petti and T. Fraichard, ‘Safe motion planning in dynamic environments’, Intelligent Robots and Systems (IROS). August, 2005.

    Google Scholar 

  3. W.-M. Shen, ‘The process of discovery’, Foundations of Science, 1(2), 233–251, (1995).

    Google Scholar 

  4. P.C.-H. Cheng, ‘Modelling experiments in scientific discovery’, in International Joint Conferences on Artificial Intelligence, IJCAI, pp. 739–745, (1991).

    Google Scholar 

  5. W.-M. Shen, ‘Discovery as autonomous learning from the environment’, Machine Learning, 12(1), 143–165, (1993).

    Google Scholar 

  6. M. Peters, ‘Towards artificial forms of intelligence, creativity, and surprise.’, in Proceedings of the 20 th Meeting of the Cognitive Science Society, pp. 836–841, (1998).

    Google Scholar 

  7. M. Peters and A. Sowmya, ‘Active vision and adaptive learning’, in Proceedings of the 15th. Conference on Intelligent Robots and Computer Vision, volume 2904, pp. 413–424, (1996).

    Google Scholar 

  8. L. Itti and P. Baldi, ‘A principled approach to detecting surprising events in video’, in Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 631–637, San Siego, CA, (June 2005).

    Google Scholar 

  9. L. Itti and P. Baldi, ‘Bayesian surprise attracts human attention’, in Advances in Neural Information Processing Systems, Vol. 19 (NIPS*2005), pp. 1–8, Cambridge, MA, (2006). MIT Press.

    Google Scholar 

  10. L. Macedo and A. Cardoso, ‘Exploration of unknown environments with motivational agents’, in Third International Joint Conference on Autonomous Agents and Multiagent Systems, (2004).

    Google Scholar 

  11. L. Macedo, A. Cardoso, and R. Reisenzein, ‘A surprise-based agent architecture’, in Cybernetics and Systems, ed., R. Trappl, volume 2. Austrian Society for Cybernetics Studies, (2006).

    Google Scholar 

  12. F. Kaplan and P-Y. Oudeyer, ‘Curiosity-driven development.’, in Proceedings of the International Workshop on Synergistic Intelligence Dynamics, (2006).

    Google Scholar 

  13. P-Y. Oudeyer, F. Kaplan, and V. Hafner, ‘Intrinsic motivation systems for autonomous mental development’, IEEE Transactions on Evolutionary Computation, 11(2), 265–286, (2007).

    Article  Google Scholar 

  14. M.R. Dogar, M. Cakmak, E. Ugur, and E. Sahin, ‘From primitive behaviors to goal-directed behavior using affordances’, in IEEE/RSJ Intern. Conf. on Intelligent Robots and Systems, (2007).

    Google Scholar 

  15. J. Mugan and B. Kuipers, ‘Learning distinctions and rules in a continuous world through active exploration’, in 7th International Conference on Epigenetic Robotics, (2007).

    Google Scholar 

  16. S. Thrun, ‘The role of exploration in learning control’, in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, eds., D.A. White and D.A. Sofge, Van Nostrand Reinhold, Florence, Kentucky 41022, (1992).

    Google Scholar 

  17. L. Bertelli, F. Bovo, L. Grespan, S. Galvan, and P. Fiorini, ‘Eddy: an open hardware robot for education’, in 4th International Symposium on Autonomous Minirobots for Research and Edutainment (AMIRE), Buenos Aires, Argentina, (October 2007).

    Google Scholar 

  18. F. Di Palma, M. Reggiani, and P. Fiorini, ‘Design of experiment for qualitative equation discovery: a comparison’, in Eurosim Congress on Modeling and Simulation, Ljubljana, Slovenia, (September 2007).

    Google Scholar 

  19. I. Bratko, Prolog Programming for Artificial Intelligence, Addison Wesley Publishing Company, 2001.

    Google Scholar 

  20. G. Leban and I. Bratko, ‘Discovering notions using hyper’, Technical report, University of Ljubljana, Artificial Intelligence Laboratory, (February 2008).

    Google Scholar 

  21. M. Henning, ‘A new approach to object-oriented middleware’, IEEE Internet Computing, 8(1), 66–75, (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Physica-Verlag Heidelberg

About this chapter

Cite this chapter

Prassler, E., Kahl, B., Henne, T., Juarez, A., Reggianni, M. (2009). Autonomous Design of Experiments for Learning by Experimentation. In: Zacharias, C., et al. Forschungsspitzen und Spitzenforschung. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2127-7_13

Download citation

Publish with us

Policies and ethics