Learning subjective “cognitive maps” in the presence of sensory-motor errors

Pipe, A. G.; Carse, B.; Fogarty, T. C.; Winfield, A.

doi:10.1007/3-540-59496-5_318

A. G. Pipe¹,
B. Carse¹,
T. C. Fogarty² &
…
A. Winfield¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 929))

Included in the following conference series:

European Conference on Artificial Life

222 Accesses
3 Citations

Abstract

In this paper we present a new version of our previous work on a maze learning animat. Its sensory/motor capabilities have been extended and modified so that they are more biologically plausible than before. The animat's learning architecture is based around a hybrid RBF Neural Network/Evolutionary Strategy implementation of an Adaptive Heuristic Critic. We conduct experiments in which the animat either acquires persistent but undetectable internal errors in its sensory equipment, or operates in an environment where undetectable factors influence motor actions. We also observe the effects of random sensory errors on the usefulness of the information which the animat acquires. Through interactions with its environment the animat learns a subjective “cognitive map” which is a fusion of the features in its surroundings, the path to a goal state, and the errors/environmental influences which it cannot directly detect. We find that despite the subjective nature of the map it remains useful under quite high levels of error/distortion in our experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barto A. G., Bradtke S. J., Singh S. P., 1991, ‘Real-Time Learning and Control using Asynchronous Dynamic Programming', Dept. of Computer Science, University of Massachusetts, USA, Tech. Report 91-57
Google Scholar
Barto A. G., Sutton R. S., Watkins C. J. C. H., 1989, ‘Learning and Sequential Decision Making', COINS Technical Report 89-95
Google Scholar
Booker L. B., Goldberg D. E., Holland J. H., 1989, ‘Classifier Systems and Genetic Algorithms', Artificial Intelligence 40, pp.235–282
Google Scholar
Cliff D., Harvey I., Husbands P., 1993, ‘Explorations in Evolutionary Robotics', Journal of Adaptive Behaviour, 2(1), pp.71–104
Google Scholar
ECAL I, 1991, ‘Towards a Practice of Autonomous Systems', Proceedings of the First & Second European Conference on Artificial Life, Eds. Varela F. J., Bourgine P., MIT Press
Google Scholar
ECAL II, 1993, Proceedings of the Second European Conference on Artificial Life, MIT Press
Google Scholar
Gallistel C. R., 1990, ‘The Organization of Learning', MIT Press
Google Scholar
Grefenstette J. J., 1991, ‘Lamarckian Learning in Multi-agent Environments', Proceedings of the Fourth International Conference on Genetic Algorithms, Morgan-Kaufmann, pp.303–310
Google Scholar
Lin L., PhD thesis, 1993, ‘Reinforcement Learning for Robots using Neural Networks', School of Computer Science, Carnegie Mellon University Pittsburgh, USA
Google Scholar
Pipe A. G. 1, Carse B., 1994, 'A Comparison between Two Architectures for Searching and Learning in Maze Problems', Selected papers from AISB Workshop in Evolutionary Computation, Springer-Verlag Lecture Notes in Computer Science #865, pp.238–249
Google Scholar
Pipe A. G. 2, Fogarty T. C., Winfield A., 1994, ‘A Hybrid Architecture for Learning Continuous Environmental Models in Maze Problems', From Animals to Animats 3, Proceedings of third International Conference on Simulation of Adaptive Behaviour, Eds. Cliff D., Husbands P., Meyer J-A., Wilson S. W., MIT Press, pp. 198–205
Google Scholar
Pipe A. G. 3, Fogarty T. C., Winfield A., 1994, ‘Hybrid Adaptive Heuristic Critic Architectures for Learning in Mazes with Continuous Search Spaces', Parallel Problem Solving from Nature (PPSNIII), Proceedings of the third International Conference on Evolutionary Computation, Springer-Verlag Lecture Notes in Computer Science #866, pp.482–491
Google Scholar
Roitblat H. L., 1994, ‘Mechanism and Process in Animal Behaviour: Models of Animals, Animals as Models', From Animals to Animats 3, Proceedings of third International Conference on Simulation of Adaptive Behaviour, Eds. Cliff D., Husbands P., Meyer J-A., Wilson S. W., MIT Press, pp. 12–21
Google Scholar
Roberts G., 1993, ‘Dynamic Planning for Classifier Systems', Proceedings of the 5th International Conference on Genetic Algorithms, pp.231–237
Google Scholar
SAB92, From Animals to Animats 2, Proceedings of the Seconds International Conference on Simulation of Adaptive Behaviour, Eds. Meyer J-A., Roitblat H. L., Wilson S. W., MIT Press
Google Scholar
SAB94, From Animals to Animats 3, Proceedings of third International Conference on Simulation of Adaptive Behaviour, Eds. Cliff D., Husbands P., Meyer J-A., Wilson S. W., MIT Press
Google Scholar
Sutton R. S., 1984, PhD thesis ‘Temporal Credit Assignment in Reinforcement Learning', University of Massachusetts, Dept. of computer and Information Science
Google Scholar
Sutton R. S., 1991, ‘Reinforcement Learning Architectures for Animats', From Animals to Animats, pp288–296, Editors Meyer, J., Wilson, S., MIT Press
Google Scholar
Tolman E. C., Ritchie B. F., Kalish D., 1946, ‘Studies in Spatial Learning I. Orientation and the Short-Cut', Journal of Experimental Psychology #36, pp. 13–24
Google Scholar
Watkins C. J. C. H., 1989, PhD thesis ‘Learning from Delayed Rewards', King's College, Cambridge.
Google Scholar
Werbos, P. J., 1992, ‘Approximate Dynamic Programming for Real-Time Control and Neural Modelling', Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, Van Nostrand Reinhold, Ed. White D. A., Sofge D. A.
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Autonomous Systems Laboratory, Faculty of Engineering, University of the West of England, Coldharbour Lane, BS16 1QY, Frenchay, Bristol, UK
A. G. Pipe, B. Carse & A. Winfield
Faculty of Computer Science & Mathematics, University of the West of England, Coldharbour Lane, BS16 1QY, Frenchay, Bristol, UK
T. C. Fogarty

Authors

A. G. Pipe
View author publications
You can also search for this author in PubMed Google Scholar
B. Carse
View author publications
You can also search for this author in PubMed Google Scholar
T. C. Fogarty
View author publications
You can also search for this author in PubMed Google Scholar
A. Winfield
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Federico Morán Alvaro Moreno Juan Julián Merelo Pablo Chacón

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pipe, A.G., Carse, B., Fogarty, T.C., Winfield, A. (1995). Learning subjective “cognitive maps” in the presence of sensory-motor errors. In: Morán, F., Moreno, A., Merelo, J.J., Chacón, P. (eds) Advances in Artificial Life. ECAL 1995. Lecture Notes in Computer Science, vol 929. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-59496-5_318

Download citation

DOI: https://doi.org/10.1007/3-540-59496-5_318
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-59496-3
Online ISBN: 978-3-540-49286-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics