An UCT Approach for Anytime Agent-Based Planning

Pellier, Damien; Bouzy, Bruno; Métivier, Marc

doi:10.1007/978-3-642-12384-9_26

An UCT Approach for Anytime Agent-Based Planning

Damien Pellier⁶,
Bruno Bouzy⁶ &
Marc Métivier⁶

Conference paper

610 Accesses
3 Citations

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 70))

Abstract

In this paper, we introduce a new heuristic search algorithm based on mean values for anytime planning, called MHSP. It consists in associating the principles of UCT, a bandit-based algorithm which gave very good results in computer games, and especially in Computer Go, with heuristic search in order to obtain an anytime planner that provides partial plans before finding a solution plan, and furthermore finding an optimal plan. The algorithm is evaluated in different classical planning problems and compared to some major planning algorithms. Finally, our results highlight the capacity of MHSP to return partial plans which tend to an optimal plan over the time.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Auer, P., Cesa-Bianchi, N., Fisher, P.: Finite-time Analysis of the Multiarmed Bandit Problem. Machine Learning 47(2–3), 235–256 (2002)
Article MATH Google Scholar
Bonet, B., Geffner, H.: Planning as Heuristic Search. Artificial Intelligence 129(1–2), 5–33 (2001)
Article MATH MathSciNet Google Scholar
Chaslot, G., Winands, M., van den Herik, H., Uiterwijk, J., Bouzy, B.: Progressive Strategies for Monte-Carlo Tree Search. New Mathematics and Natural Computation 4(3), 343–357 (2008)
Article MATH MathSciNet Google Scholar
Chen, Y., Huang, R., Zhang, W.: Fast Planning by Search in Domain Transition Graphs. In: Proc. AAAI, pp. 886–891 (2008)
Google Scholar
Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with Patterns in Monte-Carlo Go. Tech. Rep. RR-6062, INRIA (2006)
Google Scholar
Gerevini, A., Serina, I.: LPG: A Planner Based on Local Search for Planning Graphs with Action Costs. In: Proc. ICAPS, pp. 13–22 (2002)
Google Scholar
Grandcolas, S., Pain-Barre, C.: Filtering, Decomposition and Search Space Reduction for Optimal Sequential Planning. In: Proc. AAAI (2007)
Google Scholar
Hansen, E.A., Zhou, R.: Anytime Heuristic Search. JAIR 28(1), 267–297 (2007)
MATH MathSciNet Google Scholar
Hayes-Roth, B.: An architecture for adaptive intelligent systems. Artificial Intelligence 72, 329–365 (1995)
Article Google Scholar
Hoffmann, J., Nebel, B.: The FF Planning System: Fast Plan Generation Through Heuristic Search. JAIR 14(1), 253–302 (2001)
MATH Google Scholar
Hsu, C.W., Wah, B., Huang, R., Chen, Y.: Handling Soft Constraints and Goals Preferences in SGPlan. In: Proc. of the ICAPS Workshop on Preferences and Soft Constraints in Planning (2006)
Google Scholar
Kautz, H.A., Selman, B.: Unifying SAT-based and Graph-based Planning. In: Proc. IJCAI, pp. 318–325 (1999)
Google Scholar
Kocsis, L., Szepesvari, C.: Bandit-based Monte-Carlo Planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar
Koehler, J., Nebel, B., Hoffmann, J., Dimopoulos, Y.: Extending planning graphs to an ADL subset. In: Steel, S. (ed.) ECP 1997. LNCS, vol. 1348, pp. 273–285. Springer, Heidelberg (1997)
Chapter Google Scholar
Korf, R.: Real-Time Heuristic Search. Artificial Intelligence 42(2-3), 189–211 (1990)
Article MATH Google Scholar
Musliner, D., Goldman, R., Krebsbach, K.: Deliberation scheduling strategies for adaptive mission planning in real-time environments. In: Proceedings of the Third International Workshop on Self Adaptive Software (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire d’Informatique de Paris Descartes 45, rue des Saints Pères, 75006, France
Damien Pellier, Bruno Bouzy & Marc Métivier

Authors

Damien Pellier
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Bouzy
View author publications
You can also search for this author in PubMed Google Scholar
Marc Métivier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Laboratoire d’Informatique de Grenoble, Centre National de la, Recherche Scientifique,, Maison Jean Kuntzmann, 110 av. de la Chimie, F-38041, Grenoble, France
Yves Demazeau
Department of Information and Computing Sciences, Universiteit Utrecht, Centrumgebouw Noord, office A117, Padualaan 14, De Uitho, 3584CH, Utrecht, The Netherlands
Frank Dignum
Departamento de Informática, y Automática, Facultad de Ciencias, Universidad de Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Juan M. Corchado
Escuela Universitaria de Informática, Universidad Pontificia de Salamanca, Compañía 5, 37002, Salamanca, Spain
Javier Bajo Pérez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pellier, D., Bouzy, B., Métivier, M. (2010). An UCT Approach for Anytime Agent-Based Planning. In: Demazeau, Y., Dignum, F., Corchado, J.M., Pérez, J.B. (eds) Advances in Practical Applications of Agents and Multiagent Systems. Advances in Intelligent and Soft Computing, vol 70. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12384-9_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-12384-9_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12383-2
Online ISBN: 978-3-642-12384-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics