Predicting the Compression Index of Clayey Soils Using a Hybrid Genetic Programming and XGBoost Model
Created by W.Langdon from
gp-bibliography.bib Revision:1.8528
- @Article{baghbani:2025:AS,
-
author = "Abolfazl Baghbani and Katayoon Kiany and
Hossam Abuel-Naga and Yi Lu",
-
title = "Predicting the Compression Index of Clayey Soils Using
a Hybrid Genetic Programming and {XGBoost} Model",
-
journal = "Applied Sciences",
-
year = "2025",
-
volume = "15",
-
number = "4",
-
pages = "Article No. 1926",
-
keywords = "genetic algorithms, genetic programming",
-
ISSN = "2076-3417",
-
URL = "
https://www.mdpi.com/2076-3417/15/4/1926",
-
DOI = "
doi:10.3390/app15041926",
-
abstract = "The accurate prediction of the compression index (Cc)
is crucial for understanding the settlement behaviour
of clayey soils, which is a key factor in geotechnical
design. Traditional empirical models, while widely
used, often fail to generalise across diverse soil
conditions due to their reliance on simplified
assumptions and regional dependencies. This study
proposed a novel hybrid method combining Genetic
Programming (GP) and XGBoost methods. A large database
(including 385 datasets) of geotechnical properties,
including the liquid limit (LL), the plasticity index
(PI), the initial void ratio (e0), and the water
content (w), was used. The hybrid GP-XGBoost model
achieved remarkable predictive performance, with an R2
of 0.966 and 0.927 and mean squared error (MSE) values
of 0.001 and 0.001 for training and testing datasets,
respectively. The mean absolute error (MAE) was also
exceptionally low at 0.030 for training and 0.028 for
testing datasets. Comparative analysis showed that the
hybrid model outperformed the standalone GP (R2 =
0.934, MSE = 0.003) and XGBoost (R2 = 0.939, MSE =
0.002) models, as well as traditional empirical methods
such as Terzaghi and Peck (R2 = 0.149, MSE = 0.090).
Key findings highlighted that the initial void ratio
and water content are the most influential predictors
of Cc, with feature importance scores of 0.55 and 0.27,
respectively. The novelty of the proposed method lies
in its ability to combine the interpretability of GP
with the computational efficiency of XGBoost and
results in a robust and adaptable predictive tool. This
hybrid approach has the potential to advance
geotechnical engineering practices by providing
accurate and interpretable models for diverse soil
profiles and complex site conditions.",
-
notes = "also known as \cite{app15041926}",
- }
Genetic Programming entries for
Abolfazl Baghbani
Katayoon Kiany
Hossam Abuel-Naga
Yi Lu
Citations