Zobrist hash-based duplicate detection in symbolic regression
Created by W.Langdon from
gp-bibliography.bib Revision:1.8880
- @Article{Burlacu:2026:RSTA,
-
author = "Bogdan Burlacu",
-
title = "Zobrist hash-based duplicate detection in symbolic
regression",
-
journal = "Philosophical Transactions of the Royal Society A:
Mathematical, Physical and Engineering Sciences",
-
year = "2026",
-
volume = "384",
-
number = "2317",
-
pages = "20240594",
-
month = "9 " # apr,
-
keywords = "genetic algorithms, genetic programming, Operon,
symbolic regression, Zobrist hash, fitness cache, tree
hash, genetic diversity, artificial intelligence, AI,
computer modelling and simulation, software",
-
ISSN = "1364-503X",
-
URL = "
https://royalsocietypublishing.org/rsta/article-pdf/doi/10.1098/rsta.2024.0594/6131762/rsta.2024.0594.pdf",
-
DOI = "
10.1098/rsta.2024.0594",
-
size = "15 pages",
-
abstract = "Symbolic regression encompasses a family of search
algorithms that aim to discover the best-fitting
function for a set of data without requiring an a
priori specification of the model structure. The most
successful and commonly used technique for symbolic
regression is genetic programming (GP), an evolutionary
search method that evolves a population of mathematical
expressions through the mechanism of natural selection.
we analyse the efficiency of the evolutionary search in
GP and show that many points in the search space are
revisited and re-evaluated multiple times by the
algorithm, leading to wasted computational effort. We
address this issue by introducing a caching mechanism
based on the Zobrist hash, a type of hashing frequently
used in abstract board games for the efficient
construction and subsequent update of transposition
tables. We implement our caching approach using the
open-source framework Operon and demonstrate its
performance on a selection of real-world regression
problems, where we observe up to 34 percent speedups
without any detrimental effects on search quality. The
hashing approach represents a straightforward way of
improving runtime performance while also offering some
interesting possibilities for adjusting the search
strategy based on cached information.",
-
notes = "part of the discussion meeting issue Symbolic
regression in the physical sciences
\cite{Bartlett:2026:RSTAintro}",
- }
Genetic Programming entries for
Bogdan Burlacu
Citations