Evolving the MCTS Upper Confidence Bounds for Trees Using a Semantic-inspired Evolutionary Algorithm in the Game of Carcassonne
Created by W.Langdon from
gp-bibliography.bib Revision:1.7906
- @Article{Galvan:2022:games,
-
author = "Edgar Galvan and Gavin Simpson and
Fred Valdez Ameneyro",
-
journal = "IEEE Transactions on Games",
-
title = "Evolving the {MCTS} Upper Confidence Bounds for Trees
Using a Semantic-inspired Evolutionary Algorithm in the
Game of {Carcassonne}",
-
year = "2023",
-
volume = "15",
-
number = "3",
-
pages = "420--429",
-
month = sep,
-
keywords = "genetic algorithms, genetic programming, Carcassonne,
MonteCarlo tree search (MCTS), semantics",
-
ISSN = "2475-1510",
-
DOI = "doi:10.1109/TG.2022.3203232",
-
size = "10 pages",
-
abstract = "Monte Carlo Tree Search (MCTS) is a sampling
best-first method to search for optimal decisions. The
success of MCTS depends heavily on how the tree is
built and the selection process plays a fundamental
role in this. One particular selection mechanism that
has proved to be reliable is based on the Upper
Confidence Bounds for Trees (UCT). The UCT attempts to
balance exploration and exploitation by considering the
values stored in the statistical tree of the MCTS.
However, some tuning of the MCTS UCT is necessary for
this to work well. In this work, we use Evolutionary
Algorithms (EAs) to evolve mathematical expressions
with the goal to substitute the UCT formula and use the
evolved expressions in MCTS. More specifically, we
evolve expressions by means of our proposed
Semantic-inspired Evolutionary Algorithm in MCTS
approach (SIEA-MCTS). This is inspired by semantics in
Genetic Programming (GP), where the use of fitness
cases is seen as a requirement to be adopted in GP.
Fitness cases are normally used to determine the
fitness of individuals and can be used to compute the
semantic similarity (or dissimilarity) of individuals.
However, fitness cases are not available in MCTS. We
extend this notion by using multiple reward values from
MCTS that allow us to determine both the fitness of an
individual and its semantics. By doing so, we show how
SIEA-MCTS is able to successfully evolve mathematical
expressions that yield better or competitive results
compared to UCT without the need of tuning these
evolved expressions. We compare the performance of the
proposed SIEA-MCTS against MCTS algorithms, MCTS Rapid
Action Value Estimation algorithms, three variants of
the *-minimax family of algorithms, a random controller
and two more EA approaches. We consistently show how
SIEA-MCTS outperforms most of these intelligent
controllers in the challenging game of Carcassonne,
whose state-space complexity is, approx., 1e40.",
-
notes = "Also known as \cite{9872022}",
- }
Genetic Programming entries for
Edgar Galvan Lopez
Gavin Simpson
Fred Valdez Ameneyro
Citations