Abstract
In this work we propose the Ant-ViBRA system, which uses a Swarm Intelligence Algorithm that combines a Reinforcement Learning (RL) approach with Heuristic Search in order to coordinate agent actions in a Multi Agent System. The goal of Ant-ViBRA is to create plans that minimize the execution time of assembly tasks. To achieve this goal, a swarm algorithm called the Ant Colony System algorithm (ACS) was modified to be able to cope with planning when several agents are involved in a combinatorial optimization problem where interleaved execution is needed. Aiming at the reduction of the learning time, Ant-ViBRA uses a priori domain knowledge to decompose the assembly problem into subtasks and to define the relationship between actions and states based on the interactions among subtasks. Ant-ViBRA was applied to the domain of visually guided assembly tasks performed by a manipulator working in an assembly cell. Results acquired using Ant-ViBRA are encouraging and show that the combination of RL, Heuristic Search and the use of explicit domain knowledge presents better results than any of the techniques alone.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. Bonabeau, M. Dorigo, and G. Theraulaz. Swarm Intelligence: From Natural to Artificial Systems. Oxford University Press, New York, 1999.
E. Bonabeau, M. Dorigo, and G. Theraulaz. Inspiration for optimization from social insect behaviour. Nature 406[6791], 2000.
M. Dorigo. Ant algorithms and swarm intelligence. Proceedings of the Seventeen International Joint Conference on Artificial Intelligence, Tutorial MP-1, 2001.
M. Dorigo and L. M. Gambardella. Ant colony system: A cooperative learning approach to the traveling salesman problem. IEEE Transactions on Evolutionary Computation, 1(1), 1997.
C.R. Kube and H. Zhang. Collective robotics: from social insects to robots. Adaptive Behavior, 2:189–218, 1994.
C. R. Kube and H. Zhang. Task modelling in collective robotics. Autonomous Robots, 4:53–72, 1997.
V. Maniezzo, M. Dorigo, and A. Colorni. Algodesk: an experimental comparison of eight evolutionary heuristics applied to the qap problem. European Journal of Operational Research, 81:188–204, 1995.
A.H. Reali-Costa, L.N. Barros, and R. A. C. Bianchi. Integrating purposive vision with deliberative and reactive planning: An engineering support on robotics applications. Journal of the Brazilian Computer Society, 4(3):52–60, April 1998.
A. H. Reali-Costa and R. A. C. Bianchi. L-vibra: Learning in the vibra architecture. Lecture Notes in Artificial Intelligence, 1952:280–289, 2000.
R. Schoonderwoerd, O. Holland, J. Bruten, and L. Rothkrantz. Ant-based load balancing in telecommunications networks. Adapt. Behav., 5:169–207, 1997.
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD Thesis, University of Cambridge, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bianchi, R.A.C., Costa, A.H.R. (2002). Ant-ViBRA: A Swarm Intelligence Approach to Learn Task Coordination. In: Bittencourt, G., Ramalho, G.L. (eds) Advances in Artificial Intelligence. SBIA 2002. Lecture Notes in Computer Science(), vol 2507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36127-8_19
Download citation
DOI: https://doi.org/10.1007/3-540-36127-8_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00124-9
Online ISBN: 978-3-540-36127-5
eBook Packages: Springer Book Archive