Learning to Execute Navigation Plans

Belker, Thorsten; Beetz, Michael

doi:10.1007/3-540-45422-5_30

Thorsten Belker &
Michael Beetz

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2174))

Included in the following conference series:

Annual Conference on Artificial Intelligence

423 Accesses

Abstract

Most state-of-the-art navigation systems for autonomous service robots decompose navigation into global navigation planning and local reactive navigation. While the methods for navigation planning and local navigation are well understood, the plan execution problem, the problem of how to generate and parameterize local navigation tasks from a given navigation plan, is largely unsolved. This article describes how a robot can autonomously learn to execute navigation plans. We formalize the problem as a Markov Decision Problem (mdp), discuss how it can be simplified to make its solution feasible, and describe how the robot can acquire the necessary action models. We show, both in simulation and on a RWI B21 mobile robot, that the learned models are able to produce competent navigation behavior.

The research reported in this paper is partly funded by the Deutsche Forschungsgemeinschaft (DFG) under contract number BE 2200/3-1.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Reference

R.C. Arkin. Behavior-Based Robotics. MIT Press, Cambridge, MA, 1998.
Google Scholar
M. Beetz and T. Belker. Environment and task adaptation for robotic agents. In Procs. of the 14th European Conference on Artificial Intelligence, 2000.
Google Scholar
L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone. Classification and Regression Trees. Wadsworth, Inc., Belmont, CA, 1984.
Google Scholar
W. Burgard, D. Fox, and S. Thrun. Active mobile robot localization. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, Nagoya, Japan, 1997.
Google Scholar
M. Balac, D.M. Gaines, and D. Fisher. Using regression trees to learn action models. In Proceedings 2000 IEEE Systems, Man and Cybernetics Conference, 2000.
Google Scholar
P. Cohen. Empirical Methods for Artificial Intelligence. MIT Press, Cambridge, MA, 1995.
MATH Google Scholar
D. Fox, W. Burgard, and S. Thrun. The dynamic window approach to collision avoidance. IEEE Robotics and Automation Magazine, 4(1), 1997.
Google Scholar
D. Kortenkamp, R.P. Bonasso, and R. Murphy, editors. AI-based Mobile Robots: Case studies of successful robot systems. MIT Press, Cambridge, MA, 1998.
Google Scholar
L. Kaelbling, A. Cassandra, and J. Kurien. Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 1996.
Google Scholar
L. Kaelbling, M. Littman, and A. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99–134, 1998.
Article MATH MathSciNet Google Scholar
J.-C. Latombe. Robot Motion Planning. Kluwer Academic Publishers, Boston, MA, 1991.
Google Scholar
R.J. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo, California, 1993.
Google Scholar
A. Ram and J. Santamaria. Continous case-based reasoning. Artificial Intelligence, 90(1-2):25–77, 1997.
Article MATH Google Scholar
Reid Simmons. The curvature-velocity method for local obstacle avoidance. In International Conference on Robotics and Automation, 1996.
Google Scholar
R. Simmons and S. Koenig. Probabilistic robot navigation in partially observable environments. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995.
Google Scholar
R. Sutton, D. Precup, and S. Singh. Between mdps and semi-mdps:A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181–211, 1999.
Article MATH MathSciNet Google Scholar
M. Sridharan and G. J. Tesauro. Multi-agent q-learning and regression trees for automated pricing decisions. In Proceedings of the Seventeenth International Conference on Machine Learning, 2000.
Google Scholar
P. Stone and M. Veloso. Using decision tree confidence factors for multiagent control. In RoboCup-97: The First Robot World Cup Soccer Games and Conferences. 1998.
Google Scholar
S. Thrun, A. Bücken, W. Burgard, D. Fox, T. Fröhlinghaus, D. Hennig, T. Hofmann, M. Krell, and T. Schmidt. Map learning and high-speed navigation in RHINO. In D. Kortenkamp, R.P. Bonasso, and R. Murphy, editors, AI-based Mobile Robots: Case studies of successful robot systems. MIT Press, Cambridge, MA, 1998.
Google Scholar
S. Thrun. An approach to learning mobile robot navigation. Robotics and Autonomous Systems, 15:301–319, 1996.
Article Google Scholar
X. Wang and T. Dietterich. Efficient value function approximation using regression trees. In Proceedings of the IJCAI-99Workshop on Statistical Machine Learning for Large-Scale Optimization, 1999.
Google Scholar

Download references

Authors

Thorsten Belker
View author publications
You can also search for this author in PubMed Google Scholar
Michael Beetz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

RWTH Aachen, Theoretical Computer Science, Ahornstrasse 55, 52074, Aachen, Germany
Franz Baader
Intelligent Systems Department, University of Leipzig, Computer Science Institute, Augustusplatz 10-11, 04109, Leipzig, Germany
Gerhard Brewka
Institute of Information Systems Knowledge-Based Systems Group, Vienna University of Technology, Favoritenstrasse 11, 1040, Wien, Austria
Thomas Eiter

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Belker, T., Beetz, M. (2001). Learning to Execute Navigation Plans. In: Baader, F., Brewka, G., Eiter, T. (eds) KI 2001: Advances in Artificial Intelligence. KI 2001. Lecture Notes in Computer Science(), vol 2174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45422-5_30

Download citation

DOI: https://doi.org/10.1007/3-540-45422-5_30
Published: 03 September 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42612-7
Online ISBN: 978-3-540-45422-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics