Seismic indicators based earthquake predictor system using Genetic Programming and AdaBoost classification

https://doi.org/10.1016/j.soildyn.2018.04.020Get rights and content

Highlights

  • Three seismic regions are considered for earthquake prediction research.

  • Seismic indicators, representing the geophysical state of the regions are computed.

  • Genetic Programming coupled with AdaBoost is proposed to model indicators with earthquakes.

  • Prediction results are computed and compared with the available models.

Abstract

In this study an earthquake predictor system is proposed by combining seismic indicators along with Genetic Programming (GP) and AdaBoost (GP-AdaBoost) based ensemble method. Seismic indicators are computed through a novel methodology in which, the indicators are computed to obtain maximum information regarding seismic state of the region. The computed seismic indicators are used with GP-AdaBoost algorithm to develop an Earthquake Predictor system (EP-GPBoost). The setup has been arranged to provide predictions for earthquakes of magnitude 5.0 and above, fifteen days prior to the earthquake. The regions of Hindukush, Chile and Southern California are considered for experimentation. The EP-GPBoost has produced noticeable improvement in earthquake prediction due to collaboration of strong searching and boosting capabilities of GP and AdaBoost, respectively. The earthquake predictor system shows enhanced results in terms of accuracy, precision and Matthews Correlation Coefficient for the three considered regions in comparison to contemporary results.

Introduction

Earthquakes are highly feared natural catastrophic events that pose threat to human lives and cause economic damages. The early predictions of such damaging events have a potential for saving human lives and diminish financial losses. Earthquake Predictor System (EPS) aims to generate an alarm about the occurrence of earthquakes [1]. The seismic indicators based EPS aims to predict earthquakes fifteen days prior to earthquake. The seismic indicators are computed using the temporal sequence of past earthquakes, recorded in earthquake catalog. These seismic indicators are provided to computationally intelligent algorithms to generate earthquake predictions, which eventually lead to creation of EPS regarding forthcoming earthquakes. Moreover, the contemporary literature sites numerous studies, which have employed seismic indicators in collaboration with machine learning based methods, to predict earthquake occurrences, thus findings of such studies are equally significant for the development of an earthquake predictor system.

Earthquake prediction is a challenging topic [2] and endeavors have been made to predict earthquakes for over a century [3]. Earthquake prediction approaches can be categorized into three types [4]: a) mathematical and statistical methods [5], [6], b) precursor investigations [7], [8], [9], c) machine learning methodologies [10], [11], [12]. The recent encouraging results obtained in this field of research are the outcome of interdisciplinary interaction mainly, between seismology and Computational Intelligence (CI).

Machine learning based methodologies for earthquake prediction use seismic indicators in order to develop a correlation between the indicators and subsequent earthquakes. Thus, the study relies on the temporal seismic behavior of a region. The computation of seismic indicators is an effort to express the renowned principles of Gutenberg-Richter's law, seismic energy release, foreshock frequency and seismic rate changes, in numeric form. Machine learning based earthquake prediction methodologies can also be classified into two categories, depending upon the calculation approach of seismic indicators:

  • a)

    Computation of seismic indicators after a fixed duration of time [11], [12], [13].

  • b)

    Computation of seismic indicators after every earthquake, inclusive of the recent earthquake [10], [14], [15].

The former approach is designed to consider a fixed duration, such as 1 month, 2 weeks, so forth, for single prediction period. It does not address the issue, if multiple earthquakes strike the same region within the single prearranged prediction period. However, this issue of making multiple predictions for a single duration can be addressed through the latter approach. A seismic event occurrence may change the internal seismic state of the region. Therefore, fresh seismic indicators are essentially computed if an earthquake strikes the region during a prediction period. A new earthquake prediction is obtained based upon latest indicators, without waiting for the end of a prearranged prediction period. The latter approach is also advantageous in terms of number of feature instances to be used for developing prediction model. The greater is the number of feature instances, the better a model is trained. Since seismic indicators are computed for every recorded earthquake, therefore greater number of feature instances are available for training of model.

EPS is a field of science that has a potential for advancements. With the advent of computer based techniques, rapid progress has been observed in research and technology. CI and machine learning techniques have been used vastly for classification and, regression to obtain solution for many problems. For example, diagnosis through medical images [16], churn prediction through customer profiling [17], automatic surveillance in videos [18], differentiation between micro seismic events and quarry blasts [19], geological interpretation of structures [20] and so forth.

In this study a novel idea of ensemble classification where Genetic Programming (GP) is evolved using boosting (GP-Adaboost), has been applied for earthquake prediction (EP-GPBoost). Seismically active regions of Hindukush, Chile and Southern California are considered in this study for modelling earthquakes and seismic indicators through GP-AdaBoost. GP-AdaBoost is a unique ensemble classifier, where searching capabilities of GP and boosting of AdaBoost are combined to develop a strong classifier. The GP's evolution is supported through boosting, where multiple GP strings are evolved per class, which act as single class classifier.

In rest of the manuscript, Section 2 contains details of the related literature. Section 3 encompasses the employed methodology, including the computation of seismic indicators along with details of GP and AdaBoost based methodology. Results and discussions can be found in Section 4.

Section snippets

Related work

This section provides the overview of the research methodologies offering the use of varied seismic indicators and precursors along with various machine learning techniques. Seismic precursory analysis has been carried out to develop earthquake prediction model through detecting anomalous patterns in these signals. It is presumed that unusual variations in seismic precursors are observed during earthquake preparation process caused by tectonic movements beneath earth surface [21]. The some of

Data and methods

The regions considered for performing research on seismic indicators based EPS are Hindukush, Chile and Southern California. A large number of earthquakes have occurred in aforementioned regions which make them interesting for earthquake prediction. In seismic indicators based EPS, the required raw dataset is temporal sequence of past seismicity for the selected regions. The past seismicity is available in the form of a catalog, which is publicly available from the United States Geological

Results and discussion

In this research EPS is modelled as a binary classification task with aim to generate prediction for earthquakes of magnitude 5.0 and greater 15 days prior to an earthquake. The results of the proposed methodology are evaluated through parameters given below.

Conclusion

In this research, seismic indicators based EP-GPBoost has been proposed. A unique methodology is devised, which encompasses the maximum information of a region through the computation of available seismic indicators. These indicators are fed to a Genetic Programming (GP) and AdaBoost (GP-AdaBoost) based ensemble classification methodology. GP-AdaBoost is a unique combination of strong searching and boosting capabilities of GP and AdaBoost, respectively. The GP-AdaBoost based model has been

References (37)

  • G. Asencio-Cortés et al.

    A sensitivity study of seismicity indicators in supervised learning to improve earthquake prediction

    Knowl-Based Syst

    (2016)
  • A. Ikram et al.

    Developing an expert system based on association rules and predicate logic for earthquake prediction

    Knowl-Based Syst

    (2015)
  • A. Idris et al.

    Churn prediction in telecom using random forest and PSO based data balancing in combination with various feature selection strategies

    Comput Electr Eng

    (2012)
  • C.R. Allen

    Responsibilities in earthquake prediction: to the seismological Society of America, delivered in Edmonton, Alberta, May 12, 1976

    Bull Seismol Soc Am

    (1976)
  • R.J. Geller et al.

    Enhanced: earthquakes cannot be predicted

    Science

    (1997)
  • R.J. Geller

    Earthquake prediction: a critical review

    Geophys J Int

    (1997)
  • Q. Wang et al.

    Earthquake prediction based on spatio-temporal data mining: an LSTM network approach

    IEEE Trans Emerg Top Comput

    (2017)
  • A. Sil et al.

    Probabilistic models for forecasting earthquakes in the northeast region of India

    Bull Seismol Soc Am

    (2015)
  • Cited by (63)

    • EPT: A data-driven transformer model for earthquake prediction

      2023, Engineering Applications of Artificial Intelligence
    • Towards advancing the earthquake forecasting by machine learning of satellite data

      2021, Science of the Total Environment
      Citation Excerpt :

      Moreover, the applications of anomaly-evaluation applied to specific earthquakes often are with lack of understanding of the underlying physics, which may cause various or even contradictory conclusions for the same earthquake (Blackett et al., 2011a, 2011b). With the rapid development of artificial intelligence and machine learning, researchers have made progress in the domain of earth sciences (Sarkar and Mishra, 2018), especially in earthquake forecasting (Asencio-Cortés et al., 2018; Asim et al., 2018a, 2018b; Bergen et al., 2019; Hulbert et al., 2018; Lubbers et al., 2018; Rafiei and Adeli, 2017; Reyes et al., 2013; Rouet-Leduc et al., 2018). At the same time, machine learning are also very effective for spatial remote sensing data handling (Du et al., 2020).

    • Spatiotemporally explicit earthquake prediction using deep neural network

      2021, Soil Dynamics and Earthquake Engineering
      Citation Excerpt :

      Ultimately, after training and determining the optimal hyper-parameters for the four models based on SNN, SVM, DT, and DNN using the validation score, each trained model predicted the test data that had not been fed to the models during training and validation. In Equations (4)–(8), TP, TN, FP, and FN are defined based on the confusion table as follows [102]: TP (true positive): An earthquake occurred and predicted by the model.

    View all citing articles on Scopus
    View full text