Evaluation of water quality indexes with novel machine learning and SHapley Additive ExPlanation (SHAP) approaches
Created by W.Langdon from
gp-bibliography.bib Revision:1.8129
- @Article{ALDREES:2024:jwpe,
-
author = "Ali Aldrees and Majid Khan and
Abubakr Taha Bakheit Taha and Mujahid Ali",
-
title = "Evaluation of water quality indexes with novel machine
learning and {SHapley} Additive {ExPlanation} ({SHAP)}
approaches",
-
journal = "Journal of Water Process Engineering",
-
volume = "58",
-
pages = "104789",
-
year = "2024",
-
ISSN = "2214-7144",
-
DOI = "doi:10.1016/j.jwpe.2024.104789",
-
URL = "https://www.sciencedirect.com/science/article/pii/S2214714424000199",
-
keywords = "genetic algorithms, genetic programming, Gene
expression programming, Water quality indexes, ANN,
Deep neural networks, Optimizable Gaussian process
regressor, SHAP",
-
abstract = "Water quality indexes (WQI) are pivotal in assessing
aquatic systems. Conventional modeling approaches rely
on extensive datasets with numerous unspecified inputs,
leading to time-consuming WQI assessment procedures.
Numerous studies have used machine learning (ML)
methods for WQI analysis but often lack model
interpretability. To address this issue, this study
developed five interpretable predictive models,
including two gene expression programming (GEP) models,
two deep neural networks (DNN) models, and one
optimizable Gaussian process regressor (OGPR) model for
estimating electrical conductivity (EC) and total
dissolved solids (TDS). For the model development, a
total of 372 records on a monthly basis were collected
in the Upper Indus River at two outlet stations. The
efficacy and accuracy of the models were assessed using
various statistical measures, such as correlation (R),
mean square error (MAE), root mean square error (RMSE),
and 5-fold cross-validation. The DNN2 model
demonstrated outstanding performance compared to the
other five models, exhibiting R-values closer to 1.0
for both EC and TDS. However, the genetic
programming-based models, GEP1 and GEP2, exhibited
comparatively lower accuracy in predicting the water
quality indexes. The SHapely Additive exPlanation
(SHAP) analysis revealed that bicarbonate, calcium, and
sulphate jointly contribute approximately 78 percent to
EC, while the combined presence of sodium, bicarbonate,
calcium, and magnesium accounts for around 87 percent
of TDS in water. Notably, the influence of pH and
chloride was minimal on both water quality indexes. In
conclusion, the study highlights the cost-effective and
practical potential of predictive models for EC and TDS
in assessing and monitoring river water quality",
- }
Genetic Programming entries for
Ali Aldrees
Majid Khan
Abubakr Taha Bakheit Taha
Mujahid Ali
Citations