A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty

Palomares, Iván; Bauters, Kim; Liu, Weiru; Hong, Jun

doi:10.1007/978-3-319-45856-4_15

Iván Palomares¹⁵,
Kim Bauters¹⁵,
Weiru Liu¹⁵ &
…
Jun Hong¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9858))

Included in the following conference series:

International Conference on Scalable Uncertainty Management

826 Accesses
1 Citations

Abstract

In a team of multiple agents, the pursuance of a common goal is a defining characteristic. Since agents may have different capabilities, and effects of actions may be uncertain, a common goal can generally only be achieved through a careful cooperation between the different agents. In this work, we propose a novel two-stage planner that combines online planning at both team level and individual level through a subgoal delegation scheme. The proposal brings the advantages of online planning approaches to the multi-agent setting. A number of modifications are made to a classical UCT approximate algorithm to (i) adapt it to the application domains considered, (ii) reduce the branching factor in the underlying search process, and (iii) effectively manage uncertain information of action effects by using information fusion mechanisms. The proposed online multi-agent planner reduces the cost of planning and decreases the temporal cost of reaching a goal, while significantly increasing the chance of success of achieving the common goal.

The original version of this chapter has been revised: In an older version Fig. 6 was represented incorrectly. An erratum to this chapter is available at 10.1007/978-3-319-45856-4_27

An erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-319-45856-4_27

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Neighbouring PoIs are those which can be reached from the current agent position without getting through any other PoI.
2.
Undesired outcomes are considered as terminal states: if an unexpected situation is encountered, the remaining agents start another planning process upon the resulting environment state.
3.
In the country park scenario, primitive actions have at most one non-terminal outcome, but this could not be the case in other different scenarios with multiple stochastic action outcomes.

References

Brafman, R., Domshlak, C.: From one to many: planning for loosely coupled multi-agent systems. In: Proceedings of ICAPS 2008 (2008)
Google Scholar
Brafman, R.I.: A privacy preserving algorithm for multi-agent planning and search. In: Proceedings of IJCAI 2015, pp. 1530–1536 (2015)
Google Scholar
Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P.I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of Monte Carlo tree search methods. IEEE Trans. Comput. Intell. AI Games 4(1), 1–43 (2012)
Article Google Scholar
Durfee, E.: Distributed problem solving and planning. In: Weiss, G. (ed.): A Modern Approach to Distributed Artificial Intelligence (1999)
Google Scholar
Keller, T., Eyerich, P.: PROST: probabilistic planning based on UCT. In: Proceedings of ICAPS 2012 (2012)
Google Scholar
Killough, R., Bauters, K., McAreavey, K., Liu, W., Hong, J.: Risk-aware planning in BDI agents. In: Proceedings of the 8th International Conference on Agents and Artificial Intelligence (ICAART 2016) (2016)
Google Scholar
Klement, E., Mesiar, R., Pap, E.: On the relationship of associative compensatory operators to triangular norms and conorms. Int. J. Uncertainty Fuzziness Knowl.-Based Syst. 04(02), 129–144 (1996)
Article MathSciNet MATH Google Scholar
Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar
Marcolino, L.S., Matsubara, H.: Multi-agent Monte Carlo Go. In: Proceedings of AAMAS 2011, pp. 21–28. International Foundation for Autonomous Agents and Multiagent Systems (2011)
Google Scholar
Melo, F.S., Sardinha, A.: Ad hoc teamwork by learning teammates’ task. Auton. Agent. Multi-Agent Syst. 30(2), 175–219 (2015)
Article Google Scholar
Paquet, S., Tobin, L., Chaib-Draa, B.: An online POMDP algorithm for complex multiagent environments. In: Proceedings of AAMAS 2005, pp. 970–977 (2005)
Google Scholar
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Pearson Education, Upper Saddle River (2009)
MATH Google Scholar
Semsar-Kazerooni, E., Khorasani, K.: Multi-agent team cooperation: a game theory approach. Automatica 45(10), 2205–2213 (2009)
Article MathSciNet MATH Google Scholar
Smith, R.: Coordination of temporal plans in dynamic environments for mobile agents. Laboratoire d’Informatique de Paris VI (2012)
Google Scholar
Torreño, A., Onaindia, E., Sapena, O.: FMAP: distributed cooperative multi-agent planning. Appl. Intell. 41(2), 606–626 (2014)
Article Google Scholar
Torreño, A., Sapena, O., Onaindia, E.: Global heuristics for distributed cooperative multi-agent planning. In: 25th International Conference on Automated Planning and Scheduling, ICAPS 2015, pp. 225–233. AAAI Press (2015)
Google Scholar
Wang, Y., Yang, J., Xu, D.: A preference aggregation method through the estimation of utility intervals. Comput. Oper. Res. 32, 2027–2049 (2005)
Article MATH Google Scholar
de Weerdt, M., Clement, B.: Introduction to planning in multiagent systems. Multiagent Grid Syst. 5(4), 345–355 (2009)
Article Google Scholar
Weld, D.: Recent advances in AI planning. AI Mag. 20, 93–123 (1999)
Google Scholar
Wu, F., Zilberstein, S., Chen, X.: Online planning for ad hoc autonomous agent teams. In: Proceedings of IJCAI 2011, pp. 439–445 (2011)
Google Scholar
Wu, F., Zilberstein, S., Chen, X.: Online planning for multi-agent systems with bounded communication. Artif. Intell. 175(2), 487–511 (2011)
Article MathSciNet MATH Google Scholar
Yager, R., Rybalov, A.: Uninorm aggregation operators. Fuzzy Sets Syst. 80, 111–120 (1996)
Article MathSciNet MATH Google Scholar
Younes, H., Littman, M.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. In: Proceedings of ICAPS 2003 (2003)
Google Scholar

Download references

Acknowledgments

This work has been funded by EPSRC PACES project (Ref: EP/J012149/1).

Author information

Authors and Affiliations

School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast, Northern Ireland
Iván Palomares, Kim Bauters, Weiru Liu & Jun Hong

Authors

Iván Palomares
View author publications
You can also search for this author in PubMed Google Scholar
Kim Bauters
View author publications
You can also search for this author in PubMed Google Scholar
Weiru Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jun Hong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Iván Palomares .

Editor information

Editors and Affiliations

Cardiff University , Cardiff, United Kingdom
Steven Schockaert
Télécom ParisTech , Paris, Paris, France
Pierre Senellart

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Palomares, I., Bauters, K., Liu, W., Hong, J. (2016). A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty. In: Schockaert, S., Senellart, P. (eds) Scalable Uncertainty Management. SUM 2016. Lecture Notes in Computer Science(), vol 9858. Springer, Cham. https://doi.org/10.1007/978-3-319-45856-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-45856-4_15
Published: 30 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45855-7
Online ISBN: 978-3-319-45856-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics