Skip to main content

An UCT Approach for Anytime Agent-Based Planning

  • Conference paper

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 70))

Abstract

In this paper, we introduce a new heuristic search algorithm based on mean values for anytime planning, called MHSP. It consists in associating the principles of UCT, a bandit-based algorithm which gave very good results in computer games, and especially in Computer Go, with heuristic search in order to obtain an anytime planner that provides partial plans before finding a solution plan, and furthermore finding an optimal plan. The algorithm is evaluated in different classical planning problems and compared to some major planning algorithms. Finally, our results highlight the capacity of MHSP to return partial plans which tend to an optimal plan over the time.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Auer, P., Cesa-Bianchi, N., Fisher, P.: Finite-time Analysis of the Multiarmed Bandit Problem. Machine Learning 47(2–3), 235–256 (2002)

    Article  MATH  Google Scholar 

  2. Bonet, B., Geffner, H.: Planning as Heuristic Search. Artificial Intelligence 129(1–2), 5–33 (2001)

    Article  MATH  MathSciNet  Google Scholar 

  3. Chaslot, G., Winands, M., van den Herik, H., Uiterwijk, J., Bouzy, B.: Progressive Strategies for Monte-Carlo Tree Search. New Mathematics and Natural Computation 4(3), 343–357 (2008)

    Article  MATH  MathSciNet  Google Scholar 

  4. Chen, Y., Huang, R., Zhang, W.: Fast Planning by Search in Domain Transition Graphs. In: Proc. AAAI, pp. 886–891 (2008)

    Google Scholar 

  5. Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with Patterns in Monte-Carlo Go. Tech. Rep. RR-6062, INRIA (2006)

    Google Scholar 

  6. Gerevini, A., Serina, I.: LPG: A Planner Based on Local Search for Planning Graphs with Action Costs. In: Proc. ICAPS, pp. 13–22 (2002)

    Google Scholar 

  7. Grandcolas, S., Pain-Barre, C.: Filtering, Decomposition and Search Space Reduction for Optimal Sequential Planning. In: Proc. AAAI (2007)

    Google Scholar 

  8. Hansen, E.A., Zhou, R.: Anytime Heuristic Search. JAIR 28(1), 267–297 (2007)

    MATH  MathSciNet  Google Scholar 

  9. Hayes-Roth, B.: An architecture for adaptive intelligent systems. Artificial Intelligence 72, 329–365 (1995)

    Article  Google Scholar 

  10. Hoffmann, J., Nebel, B.: The FF Planning System: Fast Plan Generation Through Heuristic Search. JAIR 14(1), 253–302 (2001)

    MATH  Google Scholar 

  11. Hsu, C.W., Wah, B., Huang, R., Chen, Y.: Handling Soft Constraints and Goals Preferences in SGPlan. In: Proc. of the ICAPS Workshop on Preferences and Soft Constraints in Planning (2006)

    Google Scholar 

  12. Kautz, H.A., Selman, B.: Unifying SAT-based and Graph-based Planning. In: Proc. IJCAI, pp. 318–325 (1999)

    Google Scholar 

  13. Kocsis, L., Szepesvari, C.: Bandit-based Monte-Carlo Planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  14. Koehler, J., Nebel, B., Hoffmann, J., Dimopoulos, Y.: Extending planning graphs to an ADL subset. In: Steel, S. (ed.) ECP 1997. LNCS, vol. 1348, pp. 273–285. Springer, Heidelberg (1997)

    Chapter  Google Scholar 

  15. Korf, R.: Real-Time Heuristic Search. Artificial Intelligence 42(2-3), 189–211 (1990)

    Article  MATH  Google Scholar 

  16. Musliner, D., Goldman, R., Krebsbach, K.: Deliberation scheduling strategies for adaptive mission planning in real-time environments. In: Proceedings of the Third International Workshop on Self Adaptive Software (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pellier, D., Bouzy, B., Métivier, M. (2010). An UCT Approach for Anytime Agent-Based Planning. In: Demazeau, Y., Dignum, F., Corchado, J.M., Pérez, J.B. (eds) Advances in Practical Applications of Agents and Multiagent Systems. Advances in Intelligent and Soft Computing, vol 70. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12384-9_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12384-9_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12383-2

  • Online ISBN: 978-3-642-12384-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics