Coordinating Randomized Policies for Increasing Security in Multiagent Systems

Paruchuri, Praveen; Tambe, Milind; Ordóñez, Fernando; Kraus, Sarit

doi:10.1007/978-3-642-04879-1_14

Coordinating Randomized Policies for Increasing Security in Multiagent Systems

Praveen Paruchuri²⁵,
Milind Tambe²⁵,
Fernando Ordóñez²⁵ &
…
Sarit Kraus²⁶

Conference paper

474 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4324))

Abstract

Despite significant recent advances in decision theoretic frameworks for reasoning about multiagent teams, little attention has been paid to applying such frameworks in adversarial domains, where the agent team may face security threats from other agents. This paper focuses on domains where such threats are caused by unseen adversaries whose actions or payoffs are unknown. In such domains, action randomization is recognized as a key technique to deteriorate an adversary’s capability to predict and exploit an agent/agent team’s actions. Unfortunately, there are two key challenges in such randomization. First, randomization can reduce the expected reward (quality) of the agent team’s plans, and thus we must provide some guarantees on such rewards. Second, randomization results in miscoordination in teams. While communication within an agent team can help in alleviating the miscoordination problem, communication is unavailable in many real domains or sometimes scarcely available. To address these challenges, this paper provides the following contributions. First, we recall the Multiagent Constrained MDP (MCMDP) framework that enables policy generation for a team of agents where each agent may have a limited or no(communication) resource. Second, since randomized policies generated directly for MCMDPs lead to miscoordination, we introduce a transformation algorithm that converts the MCMDP into a transformed MCMDP incorporating explicit communication and no communication actions. Third, we show that incorporating randomization results in a non-linear program and the unavailability/limited availability of communication results in addition of non-convex constraints to the non-linear program. Finally, we experimentally illustrate the benefits of our work.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burstein, M.H., Mulvehill, A.M., Deutsch, S.: An approach to mixed-initiative management of heterogeneous software agent teams. In: HICSS, p. 8055. IEEE Computer Society, Los Alamitos (1999)
Google Scholar
Boutilier, C.: Sequential Optimality and Coordination in Multiagent Systems. In: IJCAI (1999)
Google Scholar
Becker, R., Zilberstein, S., Lesser, V., Goldman, C.V.: Transition-Independent Decentralized Markov Decision Processes. In: AAMAS (2003)
Google Scholar
Nair, R., Pynadath, D., Yokoo, M., Tambe, M., Marsella, S.: Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings. In: IJCAI (2003)
Google Scholar
Paruchuri, P., Tambe, M., Ordonez, F., Kraus, S.: Security in Multiagent Systems by Policy Randomization. In: AAMAS (2006)
Google Scholar
Carroll, D., Mikell, K., Denewiler, T.: Unmanned Ground Vehicles for Integrated Force Protection. In: SPIE Proc., vol. 5422 (2004)
Google Scholar
Lewis, P.J., Torrie, M.R., Omilon, P.M.: Applications suitable for unmanned and autonomous missions utilizing the Tactical Amphibious Ground Support (TAGS) platform (2005), http://www.autonomoussolutions.com/Press/SPIE%20TAGS.html
Call for Papers: Safety and Security in Multiagent Systems, http://www.multiagent.com/dailist/msg00129.html
Beard, R., McLain, T.: Multiple UAV Cooperative Search under Collision Avoidance and Limited Range Communication Constraints. In: IEEE CDC (2003)
Google Scholar
Serjantov, A.: On the Anonymity of Anonymity Systems. PhD Dissertation, University of Cambridge (2004)
Google Scholar
Paruchuri, P., Tambe, M., Ordonez, F., Kraus, S.: Towards a Formalization of Teamwork With Resource Constraints. In: AAMAS (2004)
Google Scholar
Rahimi, M.H., Shah, H., Sukhatme, G.S., Heidemann, J., Estrin, D.: Studying the Feasibility of Energy Harvesting in a Mobile Sensor Network. In: ICRA (2003)
Google Scholar
Dolgov, D., Durfee, E.: Approximating Optimal Policies for Agents with Limited Execution Resources. In: IJCAI (2003)
Google Scholar
Altman, E.: Constrained Markov Decision Process. Chapman and Hall, Boca Raton (1999)
MATH Google Scholar
Littman, M.: Markov Games as a Framework for Multi-Agent Reinforcement Learning (1994), http://citeseer.ist.psu.edu/littman94markov.html
Dolgov, D., Durfee, E.: Resource Allocation and Policy Formulation for Multiple Resource-Limited Agents Under Uncertainty. In: ICAPS (2004)
Google Scholar
Shannon, C.: A Mathematical Theory of Communication. The Bell Labs Technical Journal (1948)
Google Scholar
Pynadath, D., Tambe, M.: The communicative multiagent team decision problem: analyzing teamwork theories and models. In: JAIR (2002)
Google Scholar
Goldman, C.V., Zilberstein, S.: Optimizing Information Exchange in Cooperative Multi-agent Systems. In: AAMAS (2003)
Google Scholar
Jaakkola, T., Singh, S., Jordan, M.: Reinforcement learning algorithm for partially observable markov decision problems. In: Advances in NIPS (1994)
Google Scholar
Parr, R., Russel, S.: Approximating Optimal Policies for partially observable stochastic domains. In: IJCAI (1995)
Google Scholar
Kaelbling, L., Littman, M., Cassandra, A.: Planning and Acting in Partially Observable Stochastic Domains. In: Technical Report, Brown University (1995)
Google Scholar
Poupart, P., Boutilier, C.: Bounded finite state controllers. In: NIPS (2003)
Google Scholar
Bernstein, D.S., Hansen, E.A., Zilberstein, S.: Bounded Policy Iteration for Decentralized POMDPs. In: IJCAI (2005)
Google Scholar
Xuan, P., Lesser, V.: Multi-Agent Policies: From Centralized Ones to Decentralized Ones. In: AAMAS (2002)
Google Scholar
Becker, R., Lesser, V., Zilberstein, S.: Analyzing Myopic Approaches for Multi-Agent Communication. In: Proceedings of IAT (2005)
Google Scholar
Ghavamzadeh, M., Mahadevan, S.: Learning to Communicate and Act in Cooperative Multiagent Systems using Hierarchical Reinforcement Learning. In: AAMAS (2004)
Google Scholar
Nair, R., Roth, M., Yokoo, M., Tambe, M.: Communication for Improving Policy Computation in Distributed POMDPs. In: AAMAS (2004)
Google Scholar
Hu, J., Wellman, P.: Multiagent reinforcement learning: theoretical framework and an algorithm. In: ICML (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Southern California, Los Angeles, CA, 90089
Praveen Paruchuri, Milind Tambe & Fernando Ordóñez
Bar-Ilan University, Ramat-Gan, 52900, Israel
Sarit Kraus

Authors

Praveen Paruchuri
View author publications
You can also search for this author in PubMed Google Scholar
Milind Tambe
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Ordóñez
View author publications
You can also search for this author in PubMed Google Scholar
Sarit Kraus
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Auckland, P.O. Box, New Zealand
Mike Barley
School of Computing and Technology, University of East London, UK
Haralambos Mouratidis
Dept. of Computer Science and Software Engineering, The University of Melbourne, P.O. Box, VIC 3010, Australia
Amy Unruh
Computer Science Department, University of Wyoming, 82070, Laramie, WY, USA
Diana Spears
Robotics Institute, Carnegie Mellon University, 5000 Forbes Avenue, PA 15213, Pittsburgh, USA
Paul Scerri
University of Trento, Povo (Trento), Italy
Fabio Massacci

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Paruchuri, P., Tambe, M., Ordóñez, F., Kraus, S. (2009). Coordinating Randomized Policies for Increasing Security in Multiagent Systems. In: Barley, M., Mouratidis, H., Unruh, A., Spears, D., Scerri, P., Massacci, F. (eds) Safety and Security in Multiagent Systems. Lecture Notes in Computer Science(), vol 4324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04879-1_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-04879-1_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04878-4
Online ISBN: 978-3-642-04879-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics