Skip to main content

A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty

  • Conference paper
  • First Online:
Scalable Uncertainty Management (SUM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9858))

Included in the following conference series:

Abstract

In a team of multiple agents, the pursuance of a common goal is a defining characteristic. Since agents may have different capabilities, and effects of actions may be uncertain, a common goal can generally only be achieved through a careful cooperation between the different agents. In this work, we propose a novel two-stage planner that combines online planning at both team level and individual level through a subgoal delegation scheme. The proposal brings the advantages of online planning approaches to the multi-agent setting. A number of modifications are made to a classical UCT approximate algorithm to (i) adapt it to the application domains considered, (ii) reduce the branching factor in the underlying search process, and (iii) effectively manage uncertain information of action effects by using information fusion mechanisms. The proposed online multi-agent planner reduces the cost of planning and decreases the temporal cost of reaching a goal, while significantly increasing the chance of success of achieving the common goal.

The original version of this chapter has been revised: In an older version Fig. 6 was represented incorrectly. An erratum to this chapter is available at 10.1007/978-3-319-45856-4_27

An erratum to this chapter can be found at http://dx.doi.org/10.1007/978-3-319-45856-4_27

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Neighbouring PoIs are those which can be reached from the current agent position without getting through any other PoI.

  2. 2.

    Undesired outcomes are considered as terminal states: if an unexpected situation is encountered, the remaining agents start another planning process upon the resulting environment state.

  3. 3.

    In the country park scenario, primitive actions have at most one non-terminal outcome, but this could not be the case in other different scenarios with multiple stochastic action outcomes.

References

  1. Brafman, R., Domshlak, C.: From one to many: planning for loosely coupled multi-agent systems. In: Proceedings of ICAPS 2008 (2008)

    Google Scholar 

  2. Brafman, R.I.: A privacy preserving algorithm for multi-agent planning and search. In: Proceedings of IJCAI 2015, pp. 1530–1536 (2015)

    Google Scholar 

  3. Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P.I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of Monte Carlo tree search methods. IEEE Trans. Comput. Intell. AI Games 4(1), 1–43 (2012)

    Article  Google Scholar 

  4. Durfee, E.: Distributed problem solving and planning. In: Weiss, G. (ed.): A Modern Approach to Distributed Artificial Intelligence (1999)

    Google Scholar 

  5. Keller, T., Eyerich, P.: PROST: probabilistic planning based on UCT. In: Proceedings of ICAPS 2012 (2012)

    Google Scholar 

  6. Killough, R., Bauters, K., McAreavey, K., Liu, W., Hong, J.: Risk-aware planning in BDI agents. In: Proceedings of the 8th International Conference on Agents and Artificial Intelligence (ICAART 2016) (2016)

    Google Scholar 

  7. Klement, E., Mesiar, R., Pap, E.: On the relationship of associative compensatory operators to triangular norms and conorms. Int. J. Uncertainty Fuzziness Knowl.-Based Syst. 04(02), 129–144 (1996)

    Article  MathSciNet  MATH  Google Scholar 

  8. Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  9. Marcolino, L.S., Matsubara, H.: Multi-agent Monte Carlo Go. In: Proceedings of AAMAS 2011, pp. 21–28. International Foundation for Autonomous Agents and Multiagent Systems (2011)

    Google Scholar 

  10. Melo, F.S., Sardinha, A.: Ad hoc teamwork by learning teammates’ task. Auton. Agent. Multi-Agent Syst. 30(2), 175–219 (2015)

    Article  Google Scholar 

  11. Paquet, S., Tobin, L., Chaib-Draa, B.: An online POMDP algorithm for complex multiagent environments. In: Proceedings of AAMAS 2005, pp. 970–977 (2005)

    Google Scholar 

  12. Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Pearson Education, Upper Saddle River (2009)

    MATH  Google Scholar 

  13. Semsar-Kazerooni, E., Khorasani, K.: Multi-agent team cooperation: a game theory approach. Automatica 45(10), 2205–2213 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  14. Smith, R.: Coordination of temporal plans in dynamic environments for mobile agents. Laboratoire d’Informatique de Paris VI (2012)

    Google Scholar 

  15. Torreño, A., Onaindia, E., Sapena, O.: FMAP: distributed cooperative multi-agent planning. Appl. Intell. 41(2), 606–626 (2014)

    Article  Google Scholar 

  16. Torreño, A., Sapena, O., Onaindia, E.: Global heuristics for distributed cooperative multi-agent planning. In: 25th International Conference on Automated Planning and Scheduling, ICAPS 2015, pp. 225–233. AAAI Press (2015)

    Google Scholar 

  17. Wang, Y., Yang, J., Xu, D.: A preference aggregation method through the estimation of utility intervals. Comput. Oper. Res. 32, 2027–2049 (2005)

    Article  MATH  Google Scholar 

  18. de Weerdt, M., Clement, B.: Introduction to planning in multiagent systems. Multiagent Grid Syst. 5(4), 345–355 (2009)

    Article  Google Scholar 

  19. Weld, D.: Recent advances in AI planning. AI Mag. 20, 93–123 (1999)

    Google Scholar 

  20. Wu, F., Zilberstein, S., Chen, X.: Online planning for ad hoc autonomous agent teams. In: Proceedings of IJCAI 2011, pp. 439–445 (2011)

    Google Scholar 

  21. Wu, F., Zilberstein, S., Chen, X.: Online planning for multi-agent systems with bounded communication. Artif. Intell. 175(2), 487–511 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  22. Yager, R., Rybalov, A.: Uninorm aggregation operators. Fuzzy Sets Syst. 80, 111–120 (1996)

    Article  MathSciNet  MATH  Google Scholar 

  23. Younes, H., Littman, M.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. In: Proceedings of ICAPS 2003 (2003)

    Google Scholar 

Download references

Acknowledgments

This work has been funded by EPSRC PACES project (Ref: EP/J012149/1).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Iván Palomares .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Palomares, I., Bauters, K., Liu, W., Hong, J. (2016). A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty. In: Schockaert, S., Senellart, P. (eds) Scalable Uncertainty Management. SUM 2016. Lecture Notes in Computer Science(), vol 9858. Springer, Cham. https://doi.org/10.1007/978-3-319-45856-4_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45856-4_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45855-7

  • Online ISBN: 978-3-319-45856-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics