1 Introduction

The determination of seismic attenuation (s) (dB/cm) is an imperative task in earthquake science. s depends on different rock properties, for example, porosity (n) (%), permeability (k) (milidarcy), grain size (d) (μm), and clay content (c) (%). Researchers gave different attenuation mechanisms (Biot 1956a, b; Walsh 1966, 1969; Stoll and Bryan 1970; Solomon 1973; Kuster and Toksoz 1974; Mavko and Nur 1975, 1979). Klimentos and Mccann (1990) used regression model for prediction of s. Brzostowski and McMechan (1992), and Leggett et al. (1992) developed the model based on the change in seismic amplitude as observed data for prediction of s. Researchers also used the rise time associated with the broadening effect for determination of s (Kjartansson 1979; Zucca et al. 1994). Boadu (1997) successfully used artificial neural network (ANN) for prediction of s. However, ANN has various limitations such as black-box approach, low generalization capability, arriving at local minima, overtraining, etc. (Park and Rilett 1999; Kecman 2001). Quan and Harris (1997) successfully adopted frequency shift method to predict s. Roth et al. (2000) developed an empirical equation for prediction of s.

This article employs genetic programming (GP) and minimax probability machine regression (MPMR) for prediction of s. GP is developed based on genetic algorithm (GA) (Koza 1992). It has been successfully applied for modeling different problems in engineering (Rezapour et al. 2010; Guven and Kisi 2011; Azamathulla and Zahiri 2012). MPMR is developed based on Minimax Probability Machine (MPM) (Lanckriet et al. 2002, 2003). Many applications of MPMR are available in literatures (Yang et al. 2010; Zhou et al. 2011, 2013). GP and MPMR have been developed based on the database collected from the work of Boadu (1997). The developed GP and MPMR have been compared with the ANN model. The paper is organized as follows. The details of GP are given in Sect. 2. Section 3 describes the details of MPMR. Section 4 gives the details of results and discussion. Major conclusions have been drawn in Sect. 5.

2 Details of GP

GP predicts output based on the dataset. In GP, a random population of equation is created. In next step, fitness of each equation is determined. “Parents” are selected out of these individuals through tournament. “Offsprings” are created from “Parents” through the process of reproduction, mutation, and crossover. The details of reproduction, mutation, and crossover are given by Koza (1992). The best equation that produced any generation is the solution of the problem. n, k, d, and c are considered as inputs of the GP. The output of GP is s. Training and testing dataset has been used to develop the GP model. Training dataset is used to construct the GP model. Testing dataset is adopted to verify the constructed GP model. Table 1 shows the statistical parameters of the dataset. This study uses 34 (see Table 2) datasets as training dataset. The remaining eight (see Table 3) datasets have been used as testing dataset. The program of GP has been implemented by MATLAB.

Table 1 Statistical parameters of the dataset
Table 2 Training dataset
Table 3 Testing dataset

3 Details of MPMR

MPMR is developed by constructing a dichotomy classifier (Strohmann and Grudic 2002). MPMR adopts the following regression model for prediction of output (y).

y = i = 1 N β i K x i , x + b ,
(1)

where y is output, x is input, K(xi, x) is kernel function, and β, b are outputs of the MPMR algorithm. This study uses c, n, k, and d as inputs. The output of MPMR is s. So,

x = c , n , k , d and y = s .

MPMR creates two classes (ui and vi) of data. The expression of ui and vi is given below.

u i = y i + ε , x i 1 , x i 2 , , x i d ,
v i = y i - ε , x i 1 , x i 2 , , x i d ,
(2)

where i = 1,…,N.

The classification boundary between ui and vi is a regression surface. MPMR uses the same training and testing dataset as used by the GP. Radial basis function ( K x i , x = exp - x i - x x i - x T 2 σ 2 where σ is width of radial basis function) has been used as kernel function. The data is normalized between 0 and 1. The program of MPMR has been constructed by using MATLAB.

4 Results and discussion

For GP, the number of population is set to 800. The number of generation is kept to 100. The mutation frequency and crossover frequency is kept to 70 and 50, respectively. Figures 1 and 2 illustrates the performance of training and testing, respectively. The performance of GP has been assessed in terms of coefficient of correlation (R) value. For a good model, the value of R is close to one. The value of R has been shown in Figs. 1 and 2. The developed GP gives the following expression for determination of s.

Fig. 1
figure 1

Performance of training dataset

Fig. 2
figure 2

Performance of testing dataset

s = 5.89 c - 10.61 n + 0.23 exp ( n - d ) + 4.26 exp ( n - c ) + 48.19 exp ( exp ( n ) ) - 61.18 exp ( n ) 2 - 10.4 ( n ) 4 - 39.24 exp ( n ) 2 2 + 61.81 c 2 - 34.52 .
(3)

Figures 1 and 2 shows that the value of R is close to one. So, the developed GP predicts s reasonably well.

For MPMR, the design values of σ and ε have been determined by trial and error approach. The developed MPMR gives the best performance at σ = 0.5 and ε = 0.003. So, the design values of σ and ε is 0.5 and 0.003, respectively. The performance of training and testing has been shown in Figs. 1 and 2. The value of R is close to one for training as well as testing dataset. Therefore, the developed MPMR proves his capability for prediction.

The developed GP and MPMR have been compared with the ANN model developed by Boadu (1997). Comparison has been done in terms of standard error. Figure 3 shows the bar chart of standard error of the GP, MPMR, and ANN models. The developed GP and MPMR outperform the ANN model. The performance of GP and MPMR is almost same. The developed MPMR has control over future prediction. However, ANN and GP have no control over future prediction.

Fig. 3
figure 3

Bar chart of standard error for ANN, GP, and MPMR models

5 Conclusions

This article describes GP and MPMR for prediction of seismic attenuation (s). The developed GP and MPMR uses n, c, k, and d as inputs. Training and testing datasets have been used to develop the GP and MPMR models. The predictive capability of GP and MPMR has been assessed in terms of R value. The performance of GP and MPMR is encouraging. The developed GP gives an equation for prediction of s. The performance of GP and MPMR is better than the ANN model. The results confirm that the developed GP and MPMR show excellent predictive accuracy.