Co-evolving a Neural-Net Evaluation Function for Othello by Combining Genetic Algorithms and Reinforcement Learning

Singer, _ Joshua A.

doi:10.1007/3-540-45718-6_42

Co-evolving a Neural-Net Evaluation Function for Othello by Combining Genetic Algorithms and Reinforcement Learning

_ Joshua A. Singer

Conference paper
First Online: 01 January 2001

1266 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2074))

Abstract

The neural network has been used extensively as a vehicle for both genetic algorithms and reinforcement learning. This paper shows a natural way to combine the two methods and suggests that reinforcement learning may be superior to random mutation as an engine for the discovery of useful substructures. The paper also describes a software experiment that applies this technique to produce an Othello-playing computer program. The experiment subjects a pool of Othello-playing programs to a regime of successive adaptation cycles, where each cycle consists of an evolutionary phase, based on the genetic algorithm, followed by a learning phase, based on reinforcement learning. A key idea of the genetic implementation is the concept of feature-level crossover. The regime was run for three months through 900,000 individual matches of Othello. It ultimately yielded a program that is competitive with a human-designed Othello-program that plays at roughly intermediate level.

Download to read the full chapter text

Chapter PDF

References

G. Tesauro, “Temporal Difference Learning and TD-Gammon”. Communications of the ACM, vol. 38, no. 3, pp. 58–68, 1995
Article Google Scholar
K. Chellapilla and D. B. Fogel, “Evolution, Neural Networks, Games, And Intelligence”. Proceedings of the IEEE, vol. 87,no. 9, pp. 1471–96, 1999
Article Google Scholar
D. E. Moriarty and R. Miikkulainen, “Discovering Complex Othello Strategies Through Evolutionary Neural Networks”. Connection Science, vol. 7, no.3-4, pp. 195–209, 1995
Article Google Scholar
Terry Jones, “Crossover, Macromutation, and Population-Based Search”, Proceedings of the Sixth International Conference on Genetic Algorithms
Google Scholar
Dimitri Bertsekas and John Tsitsiklis, Neuro-Dynamic Programming. Belmont, Massachusetts: Athena Scientific, 1996.
MATH Google Scholar
John Koza, Genetic Programming. MIT, 1996
Google Scholar
J. B. Pollack and A. D. Blair, “Co-Evolution in the Successful Learning of Backgammon Strategy”, Machine Learning, vol. 32,no. 3, pp. 225–40, 1998
Article MATH Google Scholar
Brian D. Ripley, Pattern Recognition And Neural Networks. Cambridge University Press, 1996.
Google Scholar
Richard Sutton and Andrew Bartow, Reinforcement Learning. Windfall Software, 1999.
Google Scholar

Download references

Authors

_ Joshua A. Singer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Cybernetics and Electronic Engineering, University of Reading, Whiteknights, PO Box 225, Reading, RG 6 6AY, UK
Vassil N. Alexandrov
Innovative Computing Lab, Computer Sciences Department, University of Tennessee, 1122 Volunteer Blvd, Knoxville, TN, 37996-3450, USA
Jack J. Dongarra
Computer Science Department, California State University, Chico, CA, 95929-0410, USA
Benjoe A. Juliano & René S. Renner &
The Queen’s University of Belfast, School of Computer Science, Belfast BT7 1NN, Northern Ireland, UK
C. J. Kenneth Tan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singer, _.J.A. (2001). Co-evolving a Neural-Net Evaluation Function for Othello by Combining Genetic Algorithms and Reinforcement Learning. In: Alexandrov, V.N., Dongarra, J.J., Juliano, B.A., Renner, R.S., Tan, C.J.K. (eds) Computational Science - ICCS 2001. ICCS 2001. Lecture Notes in Computer Science, vol 2074. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45718-6_42

Download citation

DOI: https://doi.org/10.1007/3-540-45718-6_42
Published: 17 July 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42233-4
Online ISBN: 978-3-540-45718-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics