Interpretable policies for reinforcement learning by genetic programming
Created by W.Langdon from
gp-bibliography.bib Revision:1.7917
- @Article{HEIN2018158,
-
author = "Daniel Hein and Steffen Udluft and Thomas A. Runkler",
-
title = "Interpretable policies for reinforcement learning by
genetic programming",
-
journal = "Engineering Applications of Artificial Intelligence",
-
year = "2018",
-
volume = "76",
-
pages = "158--169",
-
month = nov,
-
keywords = "genetic algorithms, genetic programming,
Interpretable, Reinforcement learning, Model-based,
Symbolic regression, Industrial benchmark",
-
ISSN = "0952-1976",
-
URL = "http://www.sciencedirect.com/science/article/pii/S0952197618301933",
-
DOI = "doi:10.1016/j.engappai.2018.09.007",
-
abstract = "The search for interpretable reinforcement learning
policies is of high academic and industrial interest.
Especially for industrial systems, domain experts are
more likely to deploy autonomously learned controllers
if they are understandable and convenient to evaluate.
Basic algebraic equations are supposed to meet these
requirements, as long as they are restricted to an
adequate complexity. Here we introduce the genetic
programming for reinforcement learning (GPRL) approach
based on model-based batch reinforcement learning and
genetic programming, which autonomously learns policy
equations from pre-existing default state-action
trajectory samples. GPRL is compared to a
straightforward method which uses genetic programming
for symbolic regression, yielding policies imitating an
existing well-performing, but non-interpretable policy.
Experiments on three reinforcement learning benchmarks,
i.e., mountain car, cart-pole balancing, and industrial
benchmark, demonstrate the superiority of our GPRL
approach compared to the symbolic regression method.
GPRL is capable of producing well-performing
interpretable reinforcement learning policies from
pre-existing default trajectory data.",
- }
Genetic Programming entries for
Daniel Hein
Steffen Udluft
Thomas A Runkler
Citations