Surrogate Genetic Programming: A semantic aware evolutionary search

doi:10.1016/j.ins.2014.10.053

Information Sciences

Volume 296, 1 March 2015, Pages 345-359

https://doi.org/10.1016/j.ins.2014.10.053 Get rights and content

Abstract

Many semantic search based on Genetic Programming (GP) use a trial-and-error scheme to attain semantically diverse offspring in the evolutionary search. This results in significant impediments on the success of semantic-based GP in solving real world problems, due to the additional computational overheads incurred. This paper proposes a surrogate Genetic Programming (or sGP in short) to retain the appeal of semantic-based evolutionary search for handling challenging problems with enhanced efficiency. The proposed sGP divides the population into two parts (μ and λ) then it evolves μ percentage of the population using standard GP search operators, while the remaining λ percentage of the population are evolved with the aid of meta-models (or approximation models) that serve as surrogate to the original objective function evaluation (which is computationally intensive). In contrast to previous works, two forms of meta-models are introduced in this study to make the idea of using surrogate in GP search feasible and successful. The first denotes a “Semantic-model” for prototyping the semantic representation space of the GP trees (genotype/syntactic-space). The second is a “Fitness-model”, which maps solutions in the semantic space to the objective or fitness space. By exploiting the two meta-models collectively in serving as a surrogate that replaces the original problem landscape of the GP search process, more cost-effective generation of offspring that guides the search in exploring regions where high quality solutions resides can then be attained. Experimental studies covering three separate GP domains, namely, (1) Symbolic regression, (2) Even n-parity bit, and (3) a real-world Time-series forecasting problem domain involving three datasets, demonstrate that sGP is capable of attaining reliable, high quality, and efficient performance under a limited computational budget. Results also showed that sGP outperformed the standard GP, GP based on random training-set technique, and GP based on conventional data-centric objectives as surrogate.

Introduction

Evolutionary algorithms (EAs) are approaches that take their inspirations from the principles of natural selection and survival of the fittest in the biological kingdom. Among the many variants of EAs, Genetic Programming (GP) is among one of those that have withstood the realms of time with success stories reported in a plethora of real-world applications. In particular, GP has been deemed as capable of providing transparency into how decisions or solutions are made. While a technique to evolve trees was suggested by Cramer in 1985 [6], the field of GP was founded by John Koza in 1992 [16]. GP is a powerful learning engine inspired by biology and natural evolution, for automatically generating working computer programs. Based on Darwins theory of evolution, computer programs are measured against the task they are intended to do and then receive scores accordingly. The programs with the highest scores are considered the fittest. The fittest programs are selected to join the evolutionary process via three standard genetic operators: crossover, mutation and reproduction. These operators aim to amend the programs structure and create new offspring, which will hopefully be better. Computer programs are treated as a collection of functions and inputs for these functions; which can be seen as their genetic material. In the standard representations of GP, programs are represented as trees where the functions appear in the internal nodes of the tree and the inputs for the functions appear on the leaves. This representation flexibility adds an extra advantage to the GP since it can solve complex problems and even comes up with solutions beyond human thoughts in some cases.

In GP, the search space can be viewed from multiple facets; (A) Structural space or Syntactic space or Genotype in which most GP systems operate on, in attempting to evolve/alter the trees, via the genetic operators, hoping to find better ones that translate to desirable behaviours, (B) Phenotype or Semantic space where GP systems try to alter the behaviours of the programs directly, and finally, and (C) Fitness space where individuals are evaluated into numerics that quantify the quality in relation to solving the given problem. In general, it is common for the semantic space of GP to be represented in the form of real number vectors and are defined by the outputs of GP trees when their instructions (or functions) are executed.

In traditional GP, although remarkable success on different real-world problems have been achieved (e.g., [36], [14]), existing systems operate fundamentally within the syntactic space, ignoring the semantic information of the candidate GP trees or programs. Advancing studies on semantic GP have highlighted the potentials of using the semantic information within the search. In particular, incorporations on semantic information of candidate solutions into the GP evolutionary process have recently been reported to generate significant enhancements in the search performances [33], [4], [26]. Generally, the term semantic refers to the chromosomes or individuals’ behaviour (for example, represented in the form of a real-valued vector), while syntactic refers to the structure of the evolved program. In most real-world applications, if not all, there is no obvious relationship between the syntactic space and semantic space. Thus, small changes in the shape of a tree can result in a significant departure in the resultant behaviour and vice versa, any small deviations requested in the program behaviour (or semantic) would need major variations in the tree structure. Due to the complex structure of GP in the syntactic space, i.e., tree-like structure representation, it is hard to identify a suitable syntactic distance measure that correlates well with the fitness landscape. In other words, it is hard to define a distance (or similarity measure) that quantifies the structural similarities between two trees and at the same time quantifies the similarities between their fitness values. Even if such distance measure is available, it is mostly problem-dependent. Generally, it is often deemed easier to determine the distance that correlates fitness landscape between individuals’ semantics (where semantics represented as vectors of real numbers) than between individuals’ syntactic (i.e., tree-like structure representation). This is because it is easier to use a generic distance metrics that applies across problem domains between vectors than trees. For these reasons, it is often argued that the semantic space is easier to search upon, than the syntactic space.

Despite the increasing research interests and potentials of semantic GP, one of the main criticism has been on the excessive slow nature of the approach, attributed not only to the natural mechanisms of evolution which involves an iterative process of candidate solutions that evolves across many generations, but more importantly, the trial-and-error scheme used to facilitate semantically diverse offspring in the search have led to significant increase in the computational resources needed before convergence to credible and reliable results can be attained. In this work, our interest is to retain the appeal of GP algorithms, especially semantic GP, that can handle challenging problems with high quality designs at enhanced computational efficiency.

We present a study on surrogate Genetic Programming or sGP in short. The core characteristics and motivations for proposing the sGP can be summarised as follows: (1) Present Semantic GPs heavily follow a trial-and-error scheme [25], [34], [11], which led to highly computationally intensive search. (2) It is desirable to conduct a search using Semantic GPs for high quality solutions under a limited computational budget. (3) It is non-trivial to map from syntactic to the semantic space [25]. Particularly, in contrast to previous studies on semantic based operators, where efforts have been placed on exploiting individuals’ behaviour to maintain semantic diversity in the search population, our study on sGP in this paper differs in the use of meta-models for exploring the semantic space, so as to enhance search efficiency through eliminating the need to evaluate each computationally intensive candidate solutions completely.

The proposed model of sGP divides the population into two parts μ and λ then it evolves μ portion of the population using standard search operators and the original objective function, while the remaining λ population via the surrogate. On the λ portion of the population, sGP uses two forms of meta-models, namely the “semantic-model” and “fitness-model” as the surrogate that aids in the GP search. The semantic-model prototypes the semantic space of the GP trees (genotype/syntactic-space), while the fitness-model maps the semantic space to the fitness space (more details are given in Section 3). In order to construct the semantic space, it is generally necessary to evaluate the GP programs completely. But this is often deemed as computationally intensive and impractical for many real-world complex applications. By using the semantic-model, on the other hand, solution individuals only need to be partially evaluated and the complete behaviours of the GP trees (i.e., fitness value) is then predicted via the fitness-model. The semantic and fitness-models thus serve as a partial replacement or surrogate of the computationally expensive objective function in semantic GP, leading to enhanced search efficiency. As will be shown in the results section, sGP is capable of achieving good solutions with smaller numbers of function evaluations.

In summary, the contributions of this paper is fourfold: (1) to the best of our knowledge, this is the first successful attempt to apply surrogate model to Semantic GP with tree-like representation, (2) we introduce the notion of semantic-model and fitness-model as surrogate, (3) the use of surrogate model to approximate the semantic search space allows generalisation to a metric space that is non problem-dependent, and last but not least (4) an efficient form of Semantic-aware GP is presented, labelled here as Surrogate GP (or sGP).

The remainder of this papers is organised as follows: Section 2 presents some related works, while Section 3 describes sGP in details, Section 4 presents the results of empirical studies on the proposed framework, and finally Section 5 gives some conclusive remarks and discusses potential directions of future work.

Section snippets

Related work

In this section, the related works on semantic aware operator in GP and a brief review of surrogate modelling in evolutionary computation are presented.

sGP

As mentioned previously, sGP divides the population into two parts. We use the notation μ to denote the percentage of the population that will evolve using standard GP search (i.e., based on the original objective function evaluation) and λ to denote the remaining percentage of the population that will evolve using the surrogate. To this end, in sGP, $μ + λ = 1$ .

Similar to standard GP systems, sGP begins with the initialisation of a random population and evaluates the μ part of the population using

Experimental settings

The aim of the reported experiments is to show that sGP is able utilises the surrogate model to explore the search space effectively under limited small number of evaluations (assuming that the given problem is computationally expensive) and achieve superior solutions in comparison to other versions of GP systems when given exactly the same number of evaluations. Experiments are made to cover three different classic GP problems; (1) six different symbolic regression problems to test sGP on

Conclusions

In this work we propose a new form of GP called sGP. The proposed framework evolves μ percentage of the population using standard search operators and the remaining λ percentage using surrogate models. For the λ part of the population, sGP uses surrogate model as a search operator to produce new offspring. sGP uses two specialised models, namely, semantic-model to map the syntactic space of the problem into semantic space and fitness-model to map the semantic space into fitness space. The

References (40)

K.C. Giannakoglou
Design of optimal aerodynamic shapes using stochastic optimization methods and computational intelligence
Prog. Aerospace Sci.
(2002)
L.E. Zerpa et al.
An optimization methodology of alkaline–surfactant–polymer flooding processes using field scale numerical simulation and multiple surrogates
J. Petrol. Sci. Eng.
(2005)
E. Acar et al.
Ensemble of metamodels with optimized weight factors
Struct. Multidiscip. Optim.
(2009)
J.R. Anderson et al.
(1986)
L. Beadle et al.
Semantically driven crossover in genetic programming
L. Beadle et al.
Semantically driven mutation in genetic programming
A.G. Bors, Introduction of the Radial Basis Function (rbf) Networks, Technical Report, Department of Computer Science,...
N.L. Cramer
A representation for the adaptive generation of simple sequential programs
D.L. Donoho, High-dimensional data analysis: the curses and blessings of dimensionality, in: American Mathematical...
M. Emmerich et al.
Metamodel-assisted evolution strategies
Parallel Problem Solving NaturePPSN VII
(2005)

T. Goel et al.

Ensemble of surrogates

Struct. Multidiscip. Optim.

(2007)

D. Jackson

Promoting phenotypic diversity in genetic programming

Y. Jin

A comprehensive survey of fitness approximation in evolutionary computation

Soft Comput.

(2005)

A. Kattan et al.

Evolving radial basis function networks via gp for estimating fitness values using surrogate models

A. Kattan, R. Poli, Genetic programming as a predictor of data compression saving, in: Evolution Artificielle, 9th...

A. Keane, N. Petruzzelli, Aircraft wing design using GA-based multi-level strategies, in: AIAA/USAF/NASA/ISSMO...

J.R. Koza

Genetic Programming II: Automatic Discovery of Reusable Programs

(1994)

F.H. Lesh

Muti-dimensional least-squares polynomial curve fitting

Commun. ACM

(1959)

D. Lim et al.

Generalizing surrogate-assisted evolutionary computation

IEEE Trans. Evol. Comput.

(2010)

D. Lim, Y.S. Ong, Y. Jin, B. Sendhoff, A study on metamodelling techniques, ensembles, and muti-surrogates in...

Cited by (27)

A structural reanalysis assisted harmony search for the optimal design of structures
2022, Computers and Structures
Structural optimization usually requires a large amount of time-consuming structural analyses to evaluate the feasibility of the designs, which obstructs its application in large-scale problems. This study proposed a structural reanalysis-assisted harmony search (RAHS) algorithm for structural optimization. The RAHS utilizes the combined approximation (CA) technique instead of direct finite element analysis to evaluate the performances of the modified structures generated in iterations. Different from existing approaches, the proposed RAHS will update the reanalysis design point set and select the best reanalysis design point based on the similarity of the design vectors to improve the accuracy of the CA in a large search space. The RAHS also develops a diversity-based error criterion to accommodate the accuracy requirements of CA in different search phases of the harmony search. Five benchmark examples are utilized to examine the performances of the proposed RAHS. The results demonstrate that the Cosine distance outperforms the other four design point selection approaches, and the diversity-based error criterion can significantly reduce the computational cost. Compared with the penalty-based method and the improved Deb rule, the number of FE-based structural analyses can be lessened by about 90% and 50%, respectively, while yielding similar optimal designs.
A surrogate-ensemble assisted expensive many-objective optimization
2021, Knowledge-Based Systems
Citation Excerpt :
The performances of all these proposed approaches are good for solving the many-objective problems, however, a large number of objective evaluations are required before locating at the Pareto optimal front, which makes these algorithms impractical to be applied for solving computationally expensive many-objective problems. Surrogate models, also called meta-models or approximation model, such as polynomial response surface methodology (PR) [17], radial basis function (RBF) [18], Gaussian process model (GP) [19] (also known as Kriging model), artificial neural networks (ANN) [20], and support vector machines (SVM) [21], are commonly used to assist evolutionary algorithms to solve computationally expensive problems. Literature reviews show that most surrogate-assisted evolutionary algorithms have been proposed for solving time-consuming single-objective problems, in which the models can be classified into two categories based on the contributions they play in the algorithms, i.e., global and local surrogate models.
The surrogate ensemble has been shown to have a good performance for assisting the evolutionary algorithms to solve computationally expensive single-objective problems. However, it has not been paid much attention to assist in multi-/many-objective optimization. In this paper, we propose to train multiple surrogate models to assist many-objective optimization algorithm for solving expensive many-objective problems. In the proposed method, some non-dominated solutions, which have been evaluated using the expensive objectives, will be utilized together with the current population to generate the offspring in order to prevent the population from moving forward to a bad region. In the model management, the way to determine the method used for selecting individuals to be evaluated using the exact expensive objectives depends on the convergence degree of the current population. A new method to measure the uncertainty is also proposed in our method by considering the distance to the samples in the decision space and the approximation variance in the objective space. The experimental results on a number of benchmark problems show that our proposed method is competitive to two state-of-the-art surrogate-assisted evolutionary algorithms for solving expensive many-objective problems in a limited computational budget.
Truncation-learning-driven surrogate assisted social learning particle swarm optimization for computationally expensive problem
2020, Applied Soft Computing Journal
Surrogate-assisted evolutionary optimization greatly slashes the computational burden of evolutionary algorithms for computationally expensive problems. However, new issues arise concerning the compatibility and fault tolerance of surrogates, evolutionary learning operators, and problem property. To this end, this paper proposes a truncation-learning-driven surrogate assisted social learning particle swarm optimizer (TL-SSLPSO) to coordinate these three ingredients. For avoiding and correcting the deceptions induced by the low confidence exemplars due to the surrogate in behavior learning, TL-SSLPSO equally segments the iterative population into multiple sub-populations with different fitness levels and selects exemplars from the randomly selected high-level sub-populations for the behavior learning of low-level sub-population, while truncating the behavior learning of the highest-level sub-population composed of some of the best approximated or real evaluated particles and retaining the sub-population directly to the next generation. Besides, a greedy sampling strategy is employed to find promising solutions with better fitness versus the global best to complement the truncation learning. Extensive experiments on twenty-four widely used benchmark problems and a stepped cantilever beam design problem with 17 steps are conducted to assess the effectiveness of cooperation between truncation learning and greedy sampling and comparisons with several state-of-the-art algorithms demonstrate the superiority of the proposed method.
Boosting evolutionary optimization via fuzzy-classification-assisted selection
2020, Information Sciences
Citation Excerpt :
In order to boost the efficiency of EAs by reducing the number of FEs, model-assisted selection strategies are of interest [44]. Among them, the surrogate (meta) model-based approaches [12,16,17,34,36] are the most popular, where the surrogate (meta) models are built to approximate the original problems to guide the optimization. However, solution selection can be treated as a classification problem in EAs, which means the selected solutions and discarded solutions belong to different classes.
In evolutionary optimization, solution selection is an important operator since it will be normally used to decide the optimization direction via determining new solutions. Most selection methods are objective fitness-based approaches which will lead to a waste of fitness evaluations. This is because some evaluated but unpromising solutions are discarded without contributing useful search information. We are, thus, motivated to treat the solution selection as a classification procedure, where the selected solutions and discarded solutions belong to different classes. However, another problem is that the difference between ‘promising’ and ‘unpromising’ solutions becomes fuzzy when iterations go on. Therefore, we employ fuzzy classification to predict the categories of solutions by the fuzzy membership function. And then the predicted results are used to assist solution selection to reduce the number of fitness evaluations. Finally, we propose a fuzzy-classification-assisted selection (FCAS) strategy to boost evolutionary optimization. FCAS is experimentally integrated into two state-of-the-art algorithms and studied on three test suites. The results reveal the efficiency of FCAS for boosting evolutionary optimization.
A generation-based optimal restart strategy for surrogate-assisted social learning particle swarm optimization
2019, Knowledge-Based Systems
Evolutionary algorithm provides a powerful tool to the solution of modern complex engineering optimization problems. In general, a great deal of evaluation effort often requires to be made in evolutionary optimization to locate a reasonable optimum. This poses a serious challenge to extend its application to computationally expensive problems. To alleviate this difficulty, surrogate-assisted evolutionary algorithms (SAEAs) have drawn great attention over the past decades. However, in order to ensure the performance of SAEAs, the use of appropriate model management is indispensable. This paper proposes a generation-based optimal restart strategy for a surrogate-assisted social learning particle swarm optimization (SL-PSO). In the proposed method, the SL-PSO restarts every few generations in the global radial-basis-function model landscape, and the best sample points archived in the database are employed to reinitialize the swarm at each restart. Promising individual with the best estimated fitness value is chosen for exact evaluation before each restart of the SL-PSO. The proposed method skillfully integrates the restart strategy, generation-based and individual-based model managements into a whole, whilst those three ingredients coordinate with each other, thus offering a powerful optimizer for the computationally expensive problems. To assess the performance of the proposed method, comprehensive experiments are conducted on a benchmark test suit of dimensions ranging from 10 to 100. Experimental results demonstrate that the proposed method shows superior performance in comparison with four state-of-the-art algorithms in a majority of benchmarks when only a limited computational budget is available.
A complete expected improvement criterion for Gaussian process assisted highly constrained expensive optimization
2019, Information Sciences
Citation Excerpt :
A lot of efforts and progress have been made in developing the surrogate-based EAs. Many machine learning models can be utilized to build surrogates, including: Gaussian process (GP) [22], multivariate polynomials (particularly quadratic models) [13], artificial neural networks [17,33], radial basis function (RBF) [23,41,45], support vector machines (SVMs) [16] and hybrid models [5,24,27]. A comprehensive survey with detailed discussions of the existing techniques for SAEAs can be found in [20].
Expected improvement (EI) is a popular infill criterion in Gaussian process assisted optimization of expensive problems for determining which candidate solution is to be assessed using the expensive evaluation method. An EI criterion for constrained expensive optimization (constrained EI) has also been suggested, which requires that feasible solutions exist in the candidate solutions. However, the constrained EI criterion will fail to work in case there are no feasible solutions. To address the above issue, this paper proposes a new EI criterion for highly constrained optimization that can work properly even when no feasible solution is available in the current population. The proposed constrained EI criterion can not only exploit local feasible regions, but also explore infeasible yet promising regions, making it a complete constrained EI criterion. The complete constrained EI is theoretically validated and empirically verified. Simulation results demonstrate that the proposed complete constrained EI is better than or comparable to five existing infill criteria.

View all citing articles on Scopus

View full text

Surrogate Genetic Programming: A semantic aware evolutionary search

Abstract

Introduction

Section snippets

Related work

sGP

Experimental settings

Conclusions

Prog. Aerospace Sci.

J. Petrol. Sci. Eng.

Ensemble of metamodels with optimized weight factors

Struct. Multidiscip. Optim.

Semantically driven crossover in genetic programming

Semantically driven mutation in genetic programming

A representation for the adaptive generation of simple sequential programs

Metamodel-assisted evolution strategies

Parallel Problem Solving NaturePPSN VII

Ensemble of surrogates

Struct. Multidiscip. Optim.

Promoting phenotypic diversity in genetic programming

A comprehensive survey of fitness approximation in evolutionary computation

Soft Comput.

Evolving radial basis function networks via gp for estimating fitness values using surrogate models

Genetic Programming II: Automatic Discovery of Reusable Programs

Muti-dimensional least-squares polynomial curve fitting

Commun. ACM

Generalizing surrogate-assisted evolutionary computation

IEEE Trans. Evol. Comput.