Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

Crook, Paul A.; Hayes, Gillian

doi:10.1007/978-3-540-39857-8_9

Paul A. Crook¹⁰ &
Gillian Hayes¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2837))

Included in the following conference series:

European Conference on Machine Learning

2056 Accesses
1 Citations

Abstract

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can find itself unable to distinguish between differing states of the world (the world is in effect partially observable). If reinforcement learning is used to train the robot, then this confounding of states can have a serious effect on its ability to learn optimal and stable policies. Good results have been achieved by enhancing reinforcement learning algorithms through the addition of memory or the use of internal models. In our work we take a different approach and consider whether active perception could be used instead. We test this using omniscient oracles, who play the role of a robot’s active perceptual system, in a simple grid world navigation problem. Our results indicate that simple reinforcement learning algorithms can learn when to consult these oracles, and as a result learn optimal policies.

Download to read the full chapter text

Chapter PDF

Learning High-Level Navigation Strategies via Inverse Reinforcement Learning: A Comparative Analysis

Navigating Conceptual Space; A New Take on AGI

Behavior Adaptation by Means of Reinforcement Learning

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Crook, P.A., Hayes, G.: Learning in a state of confusion: Perceptual aliasing in grid world navigation. In: Towards Intelligent Mobile Robots 2003 (TIMR 2003), 4th British Conference on (Mobile) Robotics, UWE, Bristol (2003) (in press)
Google Scholar
Whitehead, S.D.: Reinforcement Learning for the Adaptive Control of Perception and Action. PhD thesis, University of Rochester, Department of Computer Science, Rochester, New York (1992)
Google Scholar
Lanzi, P.L.: Adaptive agents with reinforcement learning and internal memory. In: Meyer, J.-A., et al. (eds.) From Animals to Animats 6: Proceedings of the Sixth International Conference on the Simulation of Adaptive Behavior (SAB 2000), pp. 333–342. The MIT Press, Cambridge (2000)
Google Scholar
Chrisman, L.: Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In: Tenth National Conference on Artificial Intelligence, pp. 183–188. AAAI/MIT Press (1992)
Google Scholar
McCallum, A.K.: Overcoming incomplete perception with utile distinction memory. In: Tenth International Machine Learning Conference (ML 1993), Amherst, MA, pp. 190–196 (1993)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)
Google Scholar
Littman, M.L.: Memoryless policies: Theoretical limitations and practical results. In: Cliff, D., et al. (eds.) From Animals to Animats 3: Proceedings of the Third International Conference on Simulation of Adaptive Behavior (SAB 1994), pp. 238–245. The MIT Press, Cambridge (1994)
Google Scholar
Singh, S.P., Jaakkola, T., Jordan, M.I.: Learning without stateestimation in partially observable Markovian decision processes. In: International Conference on Machine Learning, pp. 284–292 (1994)
Google Scholar
Loch, J., Singh, S.: Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes. In: Proc. 15th International Conf. on Machine Learning, pp. 323–331. Morgan Kaufmann, San Francisco (1998)
Google Scholar
McCallum, A.K.: Learning to use selective attention and short-term memory in sequential tasks. In: Maes, P., et al. (eds.) From Animals to Animats 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior (SAB 1996), pp. 315–324. The MIT Press, Cambridge (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Perception, Action and Behaviour, School of Informatics, University of Edinburgh, 5 Forrest Hill, Edinburgh, EH1 2QL, UK
Paul A. Crook & Gillian Hayes

Authors

Paul A. Crook
View author publications
You can also search for this author in PubMed Google Scholar
Gillian Hayes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Nova Gorica, Nova Gorica, Slovenia
Nada Lavrač
Rudjer Bošković Institute, Bijenička 54, 10000, Zagreb, Croatia
Dragan Gamberger
Leiden Institute of Advanced Computer Science, Leiden University,
Hendrik Blockeel
Jozef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Ljupčo Todorovski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Crook, P.A., Hayes, G. (2003). Could Active Perception Aid Navigation of Partially Observable Grid Worlds?. In: Lavrač, N., Gamberger, D., Blockeel, H., Todorovski, L. (eds) Machine Learning: ECML 2003. ECML 2003. Lecture Notes in Computer Science(), vol 2837. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39857-8_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-39857-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20121-2
Online ISBN: 978-3-540-39857-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics