Technical reports
Exploratory learning in the game of GO
Barney Pell
18 pages
DOI: 10.48456/tr-275
Abstract
This paper considers the importance of exploration to game-playing programs which learn by playing against opponents. The central question is whether a learning program should play the move which offers the best chance of winning the present game, or if it should play the move which has the best chance of providing useful information for future games. An approach to addressing this question is developed using probability theory, and then implemented in two different learning methods. Initial experiments in the game of Go suggest that a program which takes exploration into account can learn better against a knowledgeable opponent than a program which does not.
Full text
PS (0.0 MB)
BibTeX record
@TechReport{UCAM-CL-TR-275, author = {Pell, Barney}, title = {{Exploratory learning in the game of GO}}, url = {https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-275.ps.gz}, institution = {University of Cambridge, Computer Laboratory}, doi = {10.48456/tr-275}, number = {UCAM-CL-TR-275} }