A comprehensive study on symbolic expressions for fault detection-classification in photovoltaic farms
Created by W.Langdon from
gp-bibliography.bib Revision:1.8414
- @Article{Andelic:2025:apenergy,
-
author = "Nikola Andelic and Sandi {Baressi Segota} and
Vedran Mrzljak",
-
title = "A comprehensive study on symbolic expressions for
fault detection-classification in photovoltaic farms",
-
journal = "Applied Energy",
-
year = "2025",
-
volume = "383",
-
pages = "125370",
-
keywords = "genetic algorithms, genetic programming, Data
preprocessing and oversampling, Genetic programming
symbolic classifier, Random hyperparameter value search
method, Photovoltaic farms fault detection and
classification, Threshold based voting ensemble",
-
ISSN = "0306-2619",
-
URL = "
https://www.sciencedirect.com/science/article/pii/S030626192500100X",
-
DOI = "
doi:10.1016/j.apenergy.2025.125370",
-
abstract = "Large-scale photovoltaic (solar) farms play a crucial
role in harnessing solar energy for electricity
generation through photovoltaic (PV) technology.
However, the control and management of such systems
pose significant challenges, particularly in fault
detection. This paper introduces the application of a
genetic programming symbolic classifier (GPSC) to a
publicly available dataset for fault detection in
photovoltaic farms. Given the imbalanced nature of the
original dataset, the study necessitated the
application of oversampling techniques to achieve a
balanced representation of class samples. Additionally,
the impact of scaling and normalizing techniques on the
performance of the GPSC was thoroughly investigated.
The GPSC was systematically applied to each scaled or
normalised balanced dataset variation, and its
hyperparameters were fine-tuned using a random
hyperparameter values search (RHVS) method. The
algorithm underwent training, via a 5-fold
cross-validation (5FCV) process, and the best symbolic
expressions (SEs) were determined based on accuracy,
area under the receiver operating characteristics
curve, precision, recall, and F1-score. The research
yielded many SEs, which were used to develop a
threshold-based voting ensemble (TBVE). The TBVE for
each class was tested on the initial dataset and the
threshold was finely tuned to achieve even higher
classification performance in photovoltaic
detection/classification. Results demonstrated that
this approach produced highly accurate TBVE for each
class (accuracy in the majority of cases equal to 1.0),
showcasing the effectiveness of the GPSC and TBVE in
fault detection/classification for photovoltaic farms",
- }
Genetic Programming entries for
Nikola Andelic
Sandi Baressi Segota
Vedran Mrzljak
Citations