Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction

Sartoretti, Guillaume; Wu, Yue; Paivine, William; Kumar, T. K. Satish; Koenig, Sven; Choset, Howie

doi:10.1007/978-3-030-05816-6_3

Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction

Guillaume Sartoretti¹³,
Yue Wu¹³,
William Paivine¹³,
T. K. Satish Kumar¹⁴,
Sven Koenig¹⁴ &
…
Howie Choset¹³

Conference paper
First Online: 30 January 2019

2304 Accesses
40 Citations

Part of the book series: Springer Proceedings in Advanced Robotics ((SPAR,volume 9))

Abstract

Inspired by recent advances in single agent reinforcement learning, this paper extends the single-agent asynchronous advantage actor-critic (A3C) algorithm to enable multiple agents to learn a homogeneous, distributed policy, where agents work together toward a common goal without explicitly interacting. Our approach relies on centralized policy and critic learning, but decentralized policy execution, in a fully-observable system. We show that the sum of experience of all agents can be leveraged to quickly train a collaborative policy that naturally scales to smaller and larger swarms. We demonstrate the applicability of our method on a multi-robot construction problem, where agents need to arrange simple block elements to build a user-specified structure. We present simulation results where swarms of various sizes successfully construct different test structures without the need for additional training.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Buoniu, L., Babuška, R., De Schutter, B., Srinivasan, D., Jain, L.C., Buoniu, L., Babuška, R., De Schutter, B.: Multi-agent reinforcement learning: an overview. Stud. Comput. Intell. 310, 183–221 (2010)
Google Scholar
Cai, T., Zhang, D., Kumar, S., Koenig, S., Ayanian, N.: Local search on trees and a framework for automated construction using multiple identical robots [Short paper]. In: International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), pp. 1301–1302 (2016)
Google Scholar
Foerster, J., Assael, I.A., de Freitas, N., Whiteson, S.: Learning to communicate with deep multi-agent reinforcement learning. In: Advances in Neural Information Processing Systems, pp. 2137–2145 (2016)
Google Scholar
Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent Policy Gradients (2017). arXiv:1705.08926
Guestrin, C., Lagoudakis, M., Parr, R.: Coordinated reinforcement learning. In: International Conference on Machine Learning, pp. 227–234 (2002)
Google Scholar
Gupta, J.K., Egorov, M., Kochenderfer, M.: Cooperative multi-agent control using deep reinforcement learning. In: International Conference on Autonomous Agents and Multiagent Systems, pp. 66–83. Springer (2017)
Google Scholar
Hausknecht, M., Stone, P.: Deep Recurrent Q-Learning for Partially Observable MDPs, vol. 7, no. 1 (2015). arxiv:1507.06527
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
Google Scholar
Koenig, S., Kumar, S.: A case for collaborative construction as testbed for cooperative multi-agent planning. In: ICAPS-17 Scheduling and Planning Applications Workshop (2017)
Google Scholar
Kumar, S., Jung, S., Koenig, S.: A tree-based algorithm for construction robots. In: International Conference on Automated Planning and Scheduling (2014)
Google Scholar
Kumar, T.S., Jung, S.J., Koenig, S.: A tree-based algorithm for construction robots. In: International Conference on Automated Planning and Scheduling (2014)
Google Scholar
Lample, G., Chaplot, D.S.: Playing FPS games with deep reinforcement learning. In: AAAI Conference on Artificial Intelligence, pp. 2140–2146 (2017)
Google Scholar
Lau, Q.P., Lee, M.L., Hsu, W.: Coordination guided reinforcement learning. In: International Conference on Autonomous Agents and Multiagent Systems, pp. 215–222 (2012)
Google Scholar
Le, H.M., Yue, Y., Carr, P.: Coordinated Multi-agent Imitation Learning (2017). arXiv:1703.03121
Lowe, R., Wu, Y., Tamar, A., Harb, J., Abbeel, O.P., Mordatch, I.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Advances in Neural Information Processing Systems, pp. 6382–6393 (2017)
Google Scholar
Mnih, V., Badia, A.P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., Silver, D., Kavukcuoglu, K.: Asynchronous methods for deep reinforcement learning. In: Proceedings of the International Conference on Machine Learning, pp. 1928–1937 (2016)
Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M.: Playing Atari with Deep Reinforcement Learning (2013). arXiv:1312.5602
Neto, G.: From single-agent to multi-agent reinforcement learning: foundational concepts and methods. In: Learning Theory Course (2005)
Google Scholar
Oh, J., Guo, X., Lee, H., Lewis, R.L., Singh, S.: Action-conditional video prediction using deep networks in Atari games. In: Advances in Neural Information Processing Systems, pp. 2863–2871 (2015)
Google Scholar
Petersen, K., Nagpal, R., Werfel, J.: TERMES: an autonomous robotic system for three-dimensional collective construction. In: International Conference on Robotics: Science and Systems (2011)
Google Scholar
Sartoretti, G., Shi, Y., Paivine, W., Travers, M., Choset, H.: Distributed learning for the decentralized control of articulated mobile robots. In: Accepted to ICRA 2018, Brisbane, Australia, 21–25 May 2018
Google Scholar
Schaul, T., Quan, J., Antonoglou, I., Silver, D.: Prioritized Experience Replay (2015). arXiv:1511.05952
Schwartz, H.M.: Multi-agent Machine Learning: A Reinforcement Approach. Wiley (2014)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. 1998. In: A Bradford Book (1998)
Google Scholar
Van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double Q-learning. In: AAAI Conference on Artificial Intelligence, pp. 2094–2100 (2016)
Google Scholar

Download references

Acknowledgements

The research at the University of Southern California was supported by the National Science Foundation (NSF) under grant numbers 1409987, 1724392 and 1817189. Detailed comments from anonymous referees contributed to the presentation and quality of this paper.

Author information

Authors and Affiliations

Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, 15203, USA
Guillaume Sartoretti, Yue Wu, William Paivine & Howie Choset
University of Southern California, Los Angeles, CA, 90007, USA
T. K. Satish Kumar & Sven Koenig

Authors

Guillaume Sartoretti
View author publications
You can also search for this author in PubMed Google Scholar
Yue Wu
View author publications
You can also search for this author in PubMed Google Scholar
William Paivine
View author publications
You can also search for this author in PubMed Google Scholar
T. K. Satish Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Sven Koenig
View author publications
You can also search for this author in PubMed Google Scholar
Howie Choset
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guillaume Sartoretti .

Editor information

Editors and Affiliations

Department of Computer Science, University of Colorado Boulder, Boulder, CO, USA
Nikolaus Correll
Department of Aeronautics and Astronautics, Stanford University, Stanford, CA, USA
Mac Schwager
Department of Aerospace Engineering, University of Maryland, College Park, MD, USA
Michael Otte

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sartoretti, G., Wu, Y., Paivine, W., Kumar, T.K.S., Koenig, S., Choset, H. (2019). Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction. In: Correll, N., Schwager, M., Otte, M. (eds) Distributed Autonomous Robotic Systems. Springer Proceedings in Advanced Robotics, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-030-05816-6_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-05816-6_3
Published: 30 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05815-9
Online ISBN: 978-3-030-05816-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics