Hyperparameter optimized classification pipeline for handling unbalanced urban and rural energy consumption patterns
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @Article{KUMARPANDA:2023:eswa,
-
author = "Deepak {Kumar Panda} and Saptarshi Das and
Stuart Townley",
-
title = "Hyperparameter optimized classification pipeline for
handling unbalanced urban and rural energy consumption
patterns",
-
journal = "Expert Systems with Applications",
-
volume = "214",
-
pages = "119127",
-
year = "2023",
-
ISSN = "0957-4174",
-
DOI = "doi:10.1016/j.eswa.2022.119127",
-
URL = "https://www.sciencedirect.com/science/article/pii/S0957417422021455",
-
keywords = "genetic algorithms, genetic programming, Residential
energy consumption, Unbalanced data classification, ROC
curve",
-
abstract = "Energy consumer locations are required for framing
effective energy policies. However, due to privacy
concerns, it is becoming increasingly difficult to
obtain the locational data of the consumers. Machine
learning (ML) based classification strategies can be
used to find the locational information of the
consumers based on their historical energy consumption
patterns. The ML methods in this paper are applied to
the Residential Energy Consumption Survey 2009 dataset.
In this dataset, the number of consumers in the urban
area is higher than the rural area, thus making the
classification problem unbalanced. The unbalanced
classification problem has been solved in original and
transformed or reduced feature space using Monte Carlo
based under-sampling of the majority class datapoints.
The hyperparameters for each classification algorithm
family is represented as an optimized pipeline,
obtained using the genetic programming (GP) optimizer.
The classification performance metrics are then
obtained for different algorithm families on the
original and transformed feature spaces. Performance
comparisons have been reported using univariate and
bivariate distributions of the classification metrics
viz. accuracy, geometric mean score (GMS), F1 score,
precision, area under the curve (AUC) of receiver
operator characteristics (ROC). The energy policy
aspects for the urban and rural residential consumers
based on the classification results have also been
discussed",
- }
Genetic Programming entries for
Deepak Kumar Panda
Saptarshi Das
Stuart Townley
Citations