Abstract
In a team of multiple agents, the pursuance of a common goal is a defining characteristic. Since agents may have different capabilities, and effects of actions may be uncertain, a common goal can generally only be achieved through a careful cooperation between the different agents. In this work, we propose a novel two-stage planner that combines online planning at both team level and individual level through a subgoal delegation scheme. The proposal brings the advantages of online planning approaches to the multi-agent setting. A number of modifications are made to a classical UCT approximate algorithm to (i) adapt it to the application domains considered, (ii) reduce the branching factor in the underlying search process, and (iii) effectively manage uncertain information of action effects by using information fusion mechanisms. The proposed online multi-agent planner reduces the cost of planning and decreases the temporal cost of reaching a goal, while significantly increasing the chance of success of achieving the common goal.
The original version of this chapter has been revised: In an older version Fig. 6 was represented incorrectly. An erratum to this chapter is available at 10.1007/978-3-319-45856-4_27
An erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-319-45856-4_27
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Neighbouring PoIs are those which can be reached from the current agent position without getting through any other PoI.
- 2.
Undesired outcomes are considered as terminal states: if an unexpected situation is encountered, the remaining agents start another planning process upon the resulting environment state.
- 3.
In the country park scenario, primitive actions have at most one non-terminal outcome, but this could not be the case in other different scenarios with multiple stochastic action outcomes.
References
Brafman, R., Domshlak, C.: From one to many: planning for loosely coupled multi-agent systems. In: Proceedings of ICAPS 2008 (2008)
Brafman, R.I.: A privacy preserving algorithm for multi-agent planning and search. In: Proceedings of IJCAI 2015, pp. 1530–1536 (2015)
Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P.I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of Monte Carlo tree search methods. IEEE Trans. Comput. Intell. AI Games 4(1), 1–43 (2012)
Durfee, E.: Distributed problem solving and planning. In: Weiss, G. (ed.): A Modern Approach to Distributed Artificial Intelligence (1999)
Keller, T., Eyerich, P.: PROST: probabilistic planning based on UCT. In: Proceedings of ICAPS 2012 (2012)
Killough, R., Bauters, K., McAreavey, K., Liu, W., Hong, J.: Risk-aware planning in BDI agents. In: Proceedings of the 8th International Conference on Agents and Artificial Intelligence (ICAART 2016) (2016)
Klement, E., Mesiar, R., Pap, E.: On the relationship of associative compensatory operators to triangular norms and conorms. Int. J. Uncertainty Fuzziness Knowl.-Based Syst. 04(02), 129–144 (1996)
Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Marcolino, L.S., Matsubara, H.: Multi-agent Monte Carlo Go. In: Proceedings of AAMAS 2011, pp. 21–28. International Foundation for Autonomous Agents and Multiagent Systems (2011)
Melo, F.S., Sardinha, A.: Ad hoc teamwork by learning teammates’ task. Auton. Agent. Multi-Agent Syst. 30(2), 175–219 (2015)
Paquet, S., Tobin, L., Chaib-Draa, B.: An online POMDP algorithm for complex multiagent environments. In: Proceedings of AAMAS 2005, pp. 970–977 (2005)
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Pearson Education, Upper Saddle River (2009)
Semsar-Kazerooni, E., Khorasani, K.: Multi-agent team cooperation: a game theory approach. Automatica 45(10), 2205–2213 (2009)
Smith, R.: Coordination of temporal plans in dynamic environments for mobile agents. Laboratoire d’Informatique de Paris VI (2012)
Torreño, A., Onaindia, E., Sapena, O.: FMAP: distributed cooperative multi-agent planning. Appl. Intell. 41(2), 606–626 (2014)
Torreño, A., Sapena, O., Onaindia, E.: Global heuristics for distributed cooperative multi-agent planning. In: 25th International Conference on Automated Planning and Scheduling, ICAPS 2015, pp. 225–233. AAAI Press (2015)
Wang, Y., Yang, J., Xu, D.: A preference aggregation method through the estimation of utility intervals. Comput. Oper. Res. 32, 2027–2049 (2005)
de Weerdt, M., Clement, B.: Introduction to planning in multiagent systems. Multiagent Grid Syst. 5(4), 345–355 (2009)
Weld, D.: Recent advances in AI planning. AI Mag. 20, 93–123 (1999)
Wu, F., Zilberstein, S., Chen, X.: Online planning for ad hoc autonomous agent teams. In: Proceedings of IJCAI 2011, pp. 439–445 (2011)
Wu, F., Zilberstein, S., Chen, X.: Online planning for multi-agent systems with bounded communication. Artif. Intell. 175(2), 487–511 (2011)
Yager, R., Rybalov, A.: Uninorm aggregation operators. Fuzzy Sets Syst. 80, 111–120 (1996)
Younes, H., Littman, M.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. In: Proceedings of ICAPS 2003 (2003)
Acknowledgments
This work has been funded by EPSRC PACES project (Ref: EP/J012149/1).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Palomares, I., Bauters, K., Liu, W., Hong, J. (2016). A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty. In: Schockaert, S., Senellart, P. (eds) Scalable Uncertainty Management. SUM 2016. Lecture Notes in Computer Science(), vol 9858. Springer, Cham. https://doi.org/10.1007/978-3-319-45856-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-45856-4_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45855-7
Online ISBN: 978-3-319-45856-4
eBook Packages: Computer ScienceComputer Science (R0)