Abstract
This paper proposed a robot reinforcement learning method based on learning classifier system. A Learning Classifier System is a accuracy-based machine learning system with gradient descent that combines reinforcement learning and rule discovery system. The genetic algorithm and the covering operator act as innovation discovery components which are responsible for discovering new better reinforcement learning rules. The reinforcement learning component is responsible for adjusting the fitness of rules in the system according to some reward obtained from the environment. The advantage of this approach is its accuracy-based representation, which can easily reduce learning space, improve online learning ability and robot robustness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barid, L.C.: Residual algorithms: Reinforcement Learning with function approximation. In: Proc. 12th Int. Conf. Mach. Learn., pp. 30–37 (July 1995)
Glorennec, P.Y.: Reinforcement learning: An overview. In: Eur. Symp. Intell. Tech., Aachen, Germany, pp. 17–35 (2000)
Wiering, M.: Multi-agent reinforcement learning for traffic light control. In: Proc. 17th Int. Conf. Mach. Learn. (ICML 2000), June 29-July 2, pp. 1151–1158. Stanford Univ., Stanford (2000)
Dixon, P.W., Corne, D.W., Oates, M.J.: A Preliminary Investigation of Modified XCS as a Generic Data Mining Tool. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2001. LNCS (LNAI), vol. 2321, pp. 133–150. Springer, Heidelberg (2002)
Kovacs, T., Kerber, M.: Some dimensions of problem complexity for XCS. In: Wu, A.S. (ed.) Proc. 2000 Genetic and Evolutionary Computation Conf. Workshop Program, pp. 289–292 (2000)
Butz, M.V., Goldberg, D.E., Lanzi, P.L.: Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems. IEEE Trans. Evol. Comput. 9(5), 452–473 (2005)
Bernadó-Mansilla, E., Garrell, J.: Accuracy-based Learning Classifier Systems: Models, analysis and applications to classification tasks. Evolutionary Computation 11(3), 209–238 (2003)
Hung, K.-T., Liu, J.-S., Chang, Y.-Z.: Smooth path planning for a mobile robot by evolutionary multiobjective optimization. In: IEEE Int. Symposium on Computational Intelligence in Robotics and Automation, Jacksonville, Florida (June 2007)
Butz, M.V., Lanzi, P.L., Wilson, S.W.: Function approximation with XCS: Hyperellipsoidal conditions, recursive least squares, and compaction. IEEE Trans. Evol. Comput. 12(3), 355–376 (2008)
Bagnall, A.J., Cawley, G.C.: Learning classifier systems for data mining: A comparison of XCS with other classifiers for the Forest Cover dataset. In: Proc. IEEE/INNS Int. Joint Conf. Artificial Neural Netw., Portland, OR, July 20-24, vol. 3, pp. 1802–1807 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Shao, J., Chen, S., Zhao, C. (2012). Robot Reinforcement Learning Methods Based on XCSG. In: Jin, D., Lin, S. (eds) Advances in Computer Science and Information Engineering. Advances in Intelligent and Soft Computing, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30223-7_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-30223-7_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30222-0
Online ISBN: 978-3-642-30223-7
eBook Packages: EngineeringEngineering (R0)