Abstract
We present a distributed mechanism for automatically allocating tasks to robots in a manner sensitive to each robot’s performance level without handcoding these levels in advance. This mechanism is an important part of improving multi-robot task allocation (MRTA) in systems where communication is restricted or where the complexity of the group dynamics makes it necessary to make allocation decisions locally. The general mechanism is demonstrated as an improvement on our previously published task allocation through vacancy chains (TAVC) algorithm for distributed MRTA. The TAVC algorithm uses individual reinforcement learning of task utilities and relies on the specializing abilities of the members of the group to produce dedicated optimal allocations. Through experiments with realistic simulator we evaluate the improved algorithm by comparing it to random allocation. We conclude that using softmax action selection functions on task utility values makes algorithms responsive to different performance levels in a group of heterogeneous robots.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
T. R. Balch. Reward and Diversity in Multirobot Foraging. In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI’99) Workshop: Learning About, From and With other Agents, Stockholm, Sweden, July 31–August 6 1999.
Tucker R. Balch. The impact of diversity on performance in multi-robot foraging. In Oren Etzioni, Jörg P. Müller, and Jeffrey M. Bradshaw, editors, The proceedings of the Third International Conference on Autonomous Agents (Agents’99), pages 92–99, Seattle, Washington, May 1–5 1999. ACM Press.
Wilfried Brauer and Gerhard Weiß. Multi-machine scheduling — a multi-agent learning approach. In Proceedings of the 3rd International Conference on Multi-Agent Systems (ICMAS’98), pages 42–48, Paris, Prance, July 4–7 1998. IEEE Press.
Ivan D. Chase, Marc Weissburg, and Theodore H. Dewitt. The vacancy chain process: a new mechanism of resource distribution in animals with application to hermit crabs. Animal Behavior, 36:1265–1274, 1988.
Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to algorithms. MIT Press, Cambridge, Massachusetts, second edition, 2001.
Torbjørn S. Dahl, Maja J Matarić, and Gaurav S. Sukhatme. Scheduling with group dynamics: A multi-robot task allocation algorithm based on vacancy chains. Technical Report CRES-002-07, Center for Robotics and Embedded Systems, University of Southern California, Los angeles, CA, 2002.
Torbjørn S. Dahl, Maja J. Matarić, and Gaurav S. Sukhatme. Multi-robot task-allocation through vacancy chains. In Proceedings of the 2003 IEEE International Conference on Robotics and Automation (ICRA’ 03), pages 2293–2298, Taipei, Taiwan, September 9–14 2003. IEEE Press.
Brian P. Gerkey and Maja J Matarić. Sold!: Auction methods for multi-robot coordination. IEEE Transactions on Robotics and Automation, 18(5):758–768, October 2002.
Dani Goldberg and Maja J Matarić. Learning multiple models for reward maximization. In Pat Langley, editor, Proceedings of the 17th International Conference on Machine Learning (ICML’00), pages 319–326, Stanford, California, June 29–July 2 2000. Morgan Kaufmann.
Kristina Lerman, Asram. Galstyan, Alcherio Martinoli, and Auke J. Ijspeert. A macroscopic analytical model of collaboration in distributed robotic systems. Artificial Life, 7(4):375–393, 2001.
Maja J. Matarić. Behavior-based control: Examples from navigation, learning, and group behavior. Journal of Experimental and Theoretical Artificial Intelligence, special issue on Software Architectures for Physical Agents, 9(2–3):323–336, 1997.
Lynne E. Parker. L-ALLIANCE: Task-Oriented Multi-Robot Learning in Behaviour-Based Systems. Advanced Robotics, Special Issue on Selected Papers from IROS’96, 11(4):305–322, 1997.
Richard S. Sutton and Andrew G. Barto. Reinforcement learning: an introduction. MIT Press, Cambridge, Massachusetts, 1998.
Helen Yan and Maja J Matarić. General spatial features for analysis of multirobot and human activities from raw position data. In Proceedings of the 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS’02), pages 2770–2775, Lausanne, Switzerland, September 30–October 4 2002. IEEE Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this paper
Cite this paper
Dahl, T.S., Matarić, M.J., Sukhatme, G.S. (2007). Emergent Robot Differentiation for Distributed Multi-Robot Task Allocation. In: Alami, R., Chatila, R., Asama, H. (eds) Distributed Autonomous Robotic Systems 6. Springer, Tokyo. https://doi.org/10.1007/978-4-431-35873-2_20
Download citation
DOI: https://doi.org/10.1007/978-4-431-35873-2_20
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-35869-5
Online ISBN: 978-4-431-35873-2
eBook Packages: EngineeringEngineering (R0)