EDA-RL: EDA with Conditional Random Fields for Solving Reinforcement Learning Problems

Handa, Hisashi

doi:10.1007/978-3-642-28900-2_14

Hisashi Handa³

Part of the book series: Adaptation, Learning, and Optimization ((ALO,volume 14))

1135 Accesses

Abstract

This chapter introduces a novel Estimation of Distribution Algorithm for solving Reinforcement Learning Problems, i.e., EDA-RL. As the probabilistic model of the EDA-RL, the Conditional Random Fields proposed by Lafferty et al. are employed. The Conditional Random Fields can estimate conditional probability distributions by using Markov Network. Moreover, the structural search of probabilistic model by using X ²-test, and data correction method are examined. One of the primary features of the EDA-RL is the direct estimation of reinforcement learning agents’ policies by using the Conditional Random Fields. Another feature is that a kind of undirected graphical probabilistic model is used in the proposed method. The experimental results on Probabilistic Transition Problems and Maze Problems show the effectiveness of the EDA-RL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Butz, M.V., Pelikan, M.: Studying XCS/BOA learning in boolean functions: structure encoding and random boolean functions. In: Proc. of the 2006 Genetic and Evol. Comput. Conf., pp. 1449–1456 (2006)
Google Scholar
Butz, M.V., Pelikan, M., Llorá, X., Goldberg, D.E.: Automated global structure extraction for effective local building block processing in XCS. Evolutionary Computation 14(3), 345–380 (2006)
Article Google Scholar
Handa, H.: EDA-RL: estimation of distribution algorithms for reinforcement learning problems. In: GECCO 2009: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, pp. 405–412 (2009)
Google Scholar
Handa, H., Isozaki, M.: Evolutionary fuzzy systems for generating better Ms.PacMan players. In: 2008 IEEE International Conference on Fuzzy Systems, pp. 2182–2185 (2008)
Google Scholar
Lafferty, J.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of 18th International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann (2001)
Google Scholar
Miyazaki, K., Yamamura, M., Kobayashi, S.: On the rationality of profit sharing in reinforcement learning. In: Proc. 3rd International Conference on Fuzzy Logic, Neural Nets and Soft Computing (IIZUKA 1994), pp. 285–288 (1994)
Google Scholar
Ono, I., Nijo, T., Ono, N.: A genetic algorithm for automatically designing modular reinforcement learning agents. In: Proc. of the Genetic and Evol. Comput. Conf., pp. 203–210 (2000)
Google Scholar
Sutton, C., Mccallum, A.: An introduction to conditional random fields for relational learning. In: Getoor, L., Taskar, B. (eds.) Introduction to Statistical Relational Learning, ch. 4, pp. 93–128. MIT Press, Cambridge (2007)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
Google Scholar
Watkins, C., Dayan, P.: Technical note: -learning. Machine Learning 08, 279–292 (1992)
Google Scholar
Yamazaki, A., Shibuya, T., Hamagami, T.: Complex-valued reinforcement learning with hierarchical architecture. In: Proc. of IEEE International Conference on Systems, Man, and Cybernetics, pp. 1925–1931 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Okayama University, Okayama, 700-8530, Japan
Hisashi Handa

Authors

Hisashi Handa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hisashi Handa .

Editor information

Editors and Affiliations

Transformation Practice, BT Innovate & Design, Business Modelling and Operational, Cauldwell Hall Road 172, Ipswich, IP4 5DB, United Kingdom
Siddhartha Shakya
Intelligent Systems Group, Faculty of Informatics, University of the Basque Country, Boadilla del Monte, San Sebastian, Spain
Roberto Santana

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Handa, H. (2012). EDA-RL: EDA with Conditional Random Fields for Solving Reinforcement Learning Problems. In: Shakya, S., Santana, R. (eds) Markov Networks in Evolutionary Computation. Adaptation, Learning, and Optimization, vol 14. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28900-2_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-28900-2_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28899-9
Online ISBN: 978-3-642-28900-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics