Gene expression programming as a basis for new generation of electricity demand prediction models

https://doi.org/10.1016/j.cie.2014.05.010Get rights and content

Highlights

  • Gene expression programming is presented for the estimation of the electricity demand.

  • Annual data collected through years 1986–2009 in Thailand are used to develop the model.

  • GEP significantly outperform the existing electricity demand prediction methods.

Abstract

This study proposes a new gene expression programming (GEP) approach for the prediction of electricity demand. The annual population, gross domestic product, stock index, and total revenue from exporting industrial products were used to predict the electricity demand of the same year in Thailand. Several statistical criteria were used to verify the validity of the model. Further, the contributions of the influencing variables to the prediction of the electricity demand were analyzed. Correlation coefficient, root mean squared error and mean absolute percent error were used to evaluate the performance of the model. In addition to its high accuracy, the derived model outperforms regression and other soft computing-based models.

Introduction

Providing precise estimation of the electricity demand has been always an important concern for energy management. Several methods have been developed to cope with this issue such as autoregressive integrated moving average (ARIMA), multiple linear regression (MLR) and machine learning techniques (Azadeh et al., 2008b, Kandananond, 2011, Mostafavi, Mostafavi et al., 2013). ARIMA is mostly employed for the modeling of stochastic disturbances in time series analysis (Box & Jenkins, 1970). This technique has been applied to different energy problem such as prediction of the customer short term load and domestic electric energy consumption (Abdel-Aal and Al-Garni, 1997, Cho et al., 1995, Ediger and Akar, 2007). Moreover, several researchers used double seasonal ARIMA for electricity demand forecasting (Mohamed et al., 2010, Suhartono and Endharta, 2009, Taylor, 2003, Taylor, 2010). Besides, conventional approaches such as MLR are widely used for the energy demand estimation (Kandananond, 2011). However, ARIMA, MLR and other statistical analyses are based on defining the linear or nonlinear structure of the model in advance, which is not always true (Alavi and Gandomi, 2011b, Mostafavi, Mostafavi et al., 2013).

Soft computing techniques are considered as alternative to traditional methods for tackling real world problems. They automatically learn from experience and extract various discriminators (Mitchell, 1997). Artificial neural networks (ANNs) are one of the widely used branches of soft computing. ANNs and other soft computing techniques have been successfully applied to different real world problems including industrial engineering and energy management (e.g., Abdel-Aal et al., 1997, Miranda et al., 1998, Pino et al., 2000, Tien Pao, 2007, Pino et al., 2008, Abdel-Aal, 2008, Alavi and Gandomi, 2011a, Chiang and Roy, 2012, Currie, 1992, Daim et al., 2010, Moghrabi and Eid, 1998, Sharma and Srinivasan, 2013, Srinivasan, 2008, Tsujimura et al., 1997). ANNs have been used to predict the electricity demand in different countries such as Taiwan (Hsu & Chen, 2003), Ireland (Ringwood, Bofelli, & Murray, 2001), Spain (Catalao et al., 2007, González-Romera et al., 2007), Saudi Arabia (Abdel-Aal, 2008), and Iran (Azadeh, Ghaderi et al., 2007, Azadeh et al., 2008a, Azadeh et al., 2008b, Azadeh, Ghaderi et al., 2007, Azadeh, Saberi et al., 2008c, Kheirkhah et al., 2013). Support vector machine (SVM) is another soft computing technique that has been successfully applied to predict the electric consumption (Fan and Chen, 2006, Hong, 2009). Despite their good performance, ANNs and SVM are considered as black-box models. That is, they are not capable of generating practical prediction equations. The structure of ANNs should be defined in advance, which limits their practicability (Alavi and Gandomi, 2011b, Gandomi and Alavi, 2012, Mostafavi, Mostafavi et al., 2013, Mostafavi, Saeedi, et al., 2013, Yang et al., 2012).

In order to cope with the limitations of the existing methods, a robust soft computing approach, namely genetic programming (GP) is introduced (Koza, 1992). In fact, GP uses the principle of Darwinian natural selection to generate computer programs for solving a problem. GP has several advantages over the conventional and ANN techniques. A notable feature of GP is that it can produce practical prediction equations without a need to pre-define the form of the existing relationship (Alavi, Ameri, et al., 2011, Alavi and Gandomi, 2012, Alavi, Gandomi et al., 2011, Can and Heavey, 2011, Can and Heavey, 2012, Gandomi and Alavi, 2011, Gandomi and Alavi, 2013a, Mostafavi, Saeedi, et al., 2013, Tay and Ho, 2008). GP and its variants have been shown to be powerful tools for the electricity demand prediction (Bhattacharya et al., 2001, Lee et al., 1997). Gene expression programming (GEP) (Ferreira, 2001) is a recent extension to GP. GEP creates computer programs of different sizes and shapes encoded in linear chromosomes of fixed length. The numerical experiments proved the superiority of GEP to similar techniques (Gandomi and Alavi, 2013b, Oltean and Grosşan, 2003). In contrast with classical GP and ANNs, application of GEP in the field of energy conversion and management is new and original.

This study presents GEP as a new approach to develop a new generation of electricity demand prediction models. The data for total electricity demand in Thailand from 1986 to 2009 were used for the model development. The results provided by the developed GEP model were further compared with those obtained by other existing methods. The paper is organized as follows: Section 2 presents brief descriptions of the GEP technique. Section 3 outlines the model development using GEP and reviews the results. The detailed performance analysis of the proposed model is discussed in Section 4. The results of the sensitivity analysis are given in Section 5. Finally, concluding remarks are outlined in Section 6.

Section snippets

Gene expression programming

GP is an extension of genetic algorithms (GA) where the solutions are computer programs rather than fixed length binary strings (Mostafavi, Saeedi, et al., 2013). GP automatically generates computer models based on the rules of natural genetic evolution. While GA creates a string of numbers to represent the solution, the GP solutions are computer programs represented as tree structures and expressed in a functional programming language. GP optimizes a population of computer programs according

Methodology

The steps followed by the soft computing techniques to find optimal models are generally similar. A methodology similar to that successfully used in previously published studies was considered to derive a precise GEP-based prediction model for the electricity demand (Azadeh et al., 2008b, Mostafavi, Mostafavi et al., 2013, Mostafavi, Saeedi, et al., 2013). The steps followed to derive the model were as follows:

  • I.

    The input variables affecting the electricity demand were selected.

  • II.

    Annual energy data

Performance analysis

Smith (1986) suggested the following criteria for judging performance of a model:

  • if a model gives |R| > 0.8, a strong correlation exists between the predicted and measured values.

In all cases, the error values (e.g., RMSE, MAPE) should be at the minimum. It can be observed from Fig. 6 that the GEP model provides very good predictions both for the training (RTraining = 0.998, RMSETraining = 703.461, MAPETraining = 2.30) and testing (Rtesting = 0.997, RMSEtesting = 360.761, MAPEtesting = 0.50) data. Besides,

Sensitivity analysis

In order to evaluate the importance of the input parameters to the prediction of the the electricity demand, their frequency values (Gandomi et al., 2011) were obtained. A frequency value equal to 100 for an input indicates that this input variable has been appeared in 100% of the best thirty programs evolved by GEP (Alavi, Ameri, et al., 2011, Alavi, Gandomi et al., 2011). The frequency values of the predictor variables are presented in Fig. 8. According to this figure, the electricity demand

Conclusion

Modeling of the electricity demand is often a complex task. To cope with this difficulty, alternative methods such as soft computing techniques can be used. This study presents a novel application of GEP for the empirical modeling of the electricity demand in Thailand based on historical data from 1986 to 2009. This case study illustrated the success of the GEP technique for the prediction of the electricity demand. The validation of the derived model was verified using different criteria.

References (68)

  • A. Azadeh et al.

    Improved estimation of electricity demand function by integration of fuzzy system and data mining approach

    Energy Conversion and Management

    (2008)
  • B. Can et al.

    Comparison of experimental designs for simulation-based symbolic regression of manufacturing systems

    Computers & Industrial Engineering

    (2011)
  • B. Can et al.

    A comparison of genetic programming and artificial neural networks in metamodeling of discrete-event simulation models

    Computers & Operations Research

    (2012)
  • J.P.S. Catalao et al.

    Short-term electricity prices forecasting in a competitive market: A neural network approach

    Electric Power Systems Research

    (2007)
  • T.A. Chiang et al.

    An intelligent benchmark-based design for environment system for derivative electronic product development

    Computers in Industry

    (2012)
  • K.R. Currie

    An intelligent grouping algorithm for cellular manufacturing

    Computers & Industrial Engineering

    (1992)
  • T.U. Daim et al.

    Developing Oregon’s renewable energy portfolio using fuzzy goal programming model

    Computers & Industrial Engineering

    (2010)
  • V.S. Ediger et al.

    ARIMA forecasting of primary energy demand by fuel in Turkey

    Energy Policy

    (2007)
  • A.H. Gandomi et al.

    Multi-stage genetic programming: A new strategy to nonlinear system modeling

    Information Sciences

    (2011)
  • A.H. Gandomi et al.

    A new prediction model for load capacity of castellated steel beams

    Journal of Constructional Steel Research

    (2011)
  • A. Golbraikh et al.

    Beware of q2

    Journal of Molecular Graphics and Modelling

    (2002)
  • E. González-Romera et al.

    Forecasting of the electric energy demand trend and monthly fluctuation with neural networks

    Computers & Industrial Engineering

    (2007)
  • W.C. Hong

    Electric load forecasting by support vector model

    Applied Mathematical Modelling

    (2009)
  • C.C. Hsu et al.

    Regional load forecasting in Taiwan–Applications of artificial neural networks

    Energy Conversion and Management

    (2003)
  • A. Kheirkhah et al.

    Improved estimation of electricity demand function by using of artificial neural network, principal component analysis and data envelopment analysis

    Computers & Industrial Engineering

    (2013)
  • D.G. Lee et al.

    Genetic programming model for long-term forecasting of electric power demand

    Electric Power Systems Research

    (1997)
  • V. Miranda et al.

    Evolutionary computation in power systems

    International Journal of Power & Energy Systems

    (1998)
  • C. Moghrabi et al.

    Modeling users through an expert system and a neural network

    Computers & Industrial Engineering

    (1998)
  • E.S. Mostafavi et al.

    A novel machine learning approach for the estimation of electricity demand

    Energy Conversion and Management

    (2013)
  • E.S. Mostafavi et al.

    A hybrid computational approach to estimate solar global radiation: An empirical evidence from Iran

    Energy

    (2013)
  • R. Pino et al.

    Forecasting next-day price of electricity in the Spanish energy market using artificial neural networks

    Engineering Applications of Artificial Intelligence

    (2008)
  • V. Sharma et al.

    A hybrid intelligent model based on recurrent neural networks and excitable dynamics for price prediction in deregulated electricity market

    Engineering Applications of Artificial Intelligence

    (2013)
  • D. Srinivasan

    Energy demand prediction using GMDH networks

    Neurocomputing

    (2008)
  • J.C. Tay et al.

    Evolving dispatching rules using genetic programming for solving multi-objective flexible job-shop problems

    Computers & Industrial Engineering

    (2008)
  • Cited by (55)

    • Real-time CO<inf>2</inf> emissions estimation in Spain and application to the COVID-19 pandemic

      2021, Journal of Cleaner Production
      Citation Excerpt :

      A useful review can be found in Table 1 from reference Hong et al. (2018). Most of those techniques are based on Evolutionary Algorithms (Karabulut et al., 2008; Mousavi et al., 2014; Fang et al., 2018), although Artificial Networks are also very popular on this domain (Behrang et al., 2011; Kankal et al., 2011; Ardakani and Ardehali, 2014; Guo et al., 2018; Heydari et al., 2019). Fewer references can be found on Support Vector Machines (Sun and Liu, 2016; Saleh et al., 2016; Ahmadi et al., 2019) or Regressions (Köne and Büke, 2010; Azadeh et al., 2017; Hosseini et al., 2019).

    View all citing articles on Scopus

    This manuscript was processed by Area Editor Mitsuo Gen.

    View full text