Cooling load prediction of a double-story terrace house using ensemble learning techniques and genetic programming with SHAP approach
Created by W.Langdon from
gp-bibliography.bib Revision:1.8414
- @Article{Cakiroglu:2024:enbuild,
-
author = "Celal Cakiroglu and Yaren Aydin and Gebrail Bekdas and
Umit Isikdag and Aidin Nobahar Sadeghifam and
Laith Abualigah",
-
title = "Cooling load prediction of a double-story terrace
house using ensemble learning techniques and genetic
programming with {SHAP} approach",
-
journal = "Energy and Buildings",
-
year = "2024",
-
volume = "313",
-
pages = "114254",
-
keywords = "genetic algorithms, genetic programming, Cooling load,
BIM, Energy efficiency, Predictive modeling",
-
ISSN = "0378-7788",
-
URL = "
https://www.sciencedirect.com/science/article/pii/S0378778824003700",
-
DOI = "
doi:10.1016/j.enbuild.2024.114254",
-
abstract = "Since the cooling systems used in buildings in hot
climates account for a significant portion of the
energy consumption, it is very important for both
economy and environment to accurately predict the
cooling load and consider it in building designs. This
study aimed to maximize energy efficiency by
appropriately selecting the features of a building that
affect its cooling load. To this end, data-driven,
accurate, and accessible tools were developed that
enable the prediction of the cooling load of a building
by practitioners. The study involves simulating the
energy consumption of a mid-rise, double-story terrace
house in Malaysia using building information modelling
(BIM) and estimating the cooling load using ensemble
machine learning models and genetic programming.
Categorical Boosting (CatBoost), eXtreme Gradient
Boosting (XGBoost), Light Gradient Boosting Machine
(LightGBM), and Random Forest (RF) models have been
developed and made available as an online interactive
graphical user interface on the Streamlit platform.
Furthermore, the symbolic regression technique has been
used to obtain a closed-form equation that predicts the
cooling load. The dataset used for training the
predictive models comprised 94,310 data points with 10
input variables and the cooling load as the output
variable. Performance metrics such as the coefficient
of determination (R2), root mean squared error (RMSE),
and mean absolute error (MAE) were used to measure the
predictive model performances. The results of the
machine learning models indicated successful
prediction, with the CatBoost model achieving the
highest score (R2 = 0.9990) among the four ensemble
models and the predictive equation. The SHAP analysis
determined the aspect ratio of the building as the most
impactful feature of the building",
-
notes = "See also \cite{Cakiroglu:2024:enbuild_Corrigendum}",
- }
Genetic Programming entries for
Celal Cakiroglu
Yaren Aydin
Gebrail Bekdas
Umit Isikdag
Aidin Nobahar Sadeghifam
Laith Abualigah
Citations