1 Introduction

The determination of friction capacity (fs) of driven pile in clay is a challenging task for geotechnical engineers. The different methods are available for determination of fs of driven pile in clay (Chandler 1968; Tomlinson 1971; McClelland 1972; Burland 1973; Meyerhof 1976; Parry and Swain 1977a, b). The available methods are not reliable (Randolph et al. 1979). Randolph et al. (1979) has nicely explained the mechanism of driven pile in clay. Goh (1995) successfully adopted Artificial Neural Network (ANN) for determination of fs of driven pile in clay. However, ANN has some limitations such as black box approach, arriving at local minima, low generalization capability, absence of probabilistic output, etc. (Park and Rilett 1999; Kecman 2001).

The article uses Gaussian Process Regression (GPR) and Minimac Probability Machine Regression (MPMR) for determination of fs of driven pile in clay. GPR is a probabilistic non-parametric modeling approach. It estimates distributions over functions from training data. Researchers have successfully used GPR for solving different problems in engineering (Pal and Deswal 2010; Xia and Tang 2011; Ni et al. 2012). MPMR is developed based on Minimax Probability Machine Classification (MPMC) by constructing a dichotomy classifier (Strohmann and Grudic 2002). MPMR has been successfully used in the various filed of engineering (Yang et al. 2010; Zhou and Xia 2011; Zhou et al. 2013). This study uses the database collected by Goh (1995) (see Table 1). The dataset contains information about pile length (L), pile diameter (D), effective vertical stress (s’v), undrained shear strength (Su) and fs. The developed GPR and MPMR have been compared with the ANN (Goh 1995). This article is organized as follows. The details of GPR for prediction of fs of driven piles in clay are described in Sect. 2. Section 3 describes MPMR model. The results and discussion are provided in Sect. 4. In Sect. 5, the major conclusions are drawn.

Table 1 Dataset used in this study

2 Details of GPR

In GPR, the problem is to determine a function \(y = f\left( x \right)\) from the following dataset

$$D = \left\{ {x_{i} ,y_{i} } \right\}_{i = 1}^{N} x \in R^{d} ,\quad y \in R$$
(1)

where x is input, y is output, Rd is d-dimensional vector space and R is one dimensional vector space. In this study, L, D, σ’v, and Su are used as input variable. fs is output of the GPR. So, \(x = \left[ {L,D,\sigma^{\prime}_{v} ,S_{u} } \right]\) and \(y = \left[ {f_{s} } \right]\).

The joint distribution of y is given by

$$p\left( y \right) = N\left( {0,K\left( {x,x} \right) + \sigma^{2} I} \right)$$
(2)

where \(K\left( {x_{i} ,x} \right)\) is kernel function and I is identity matrix.

For a new input xN+1, the distribution of yN+1 is Gaussian with mean and variance:

$$\mu = K\left( {x_{N + 1} ,x} \right)\left( {K\left( {x,x} \right) + \sigma^{2} I} \right)^{ - 1} y$$
(3)
$$\sigma = k\left( {x_{N + 1} ,x} \right) - \sigma^{2} I - K\left( {x_{N + 1} ,x} \right)\left( {K\left( {x,x} \right) + \sigma^{2} I} \right)^{ - 1} K\left( {x,x_{N + 1} } \right)$$
(4)

The optimum value of hyperparameters of the GPR for a particular data set can be derived by maximizing the log marginal likelihood using common optimization procedures.

In carrying out the formulation, the data has been divided into two sub-sets; such as:

  1. (a)

    A training dataset: This is required to construct the model. In this study, 45 out of the 65 cases of pile load test are considered for training dataset.

  2. (b)

    A testing dataset: This is required to estimate the model performance. In this study, the remaining 20 data is considered as testing dataset.

The data is normalized between 0 and 1. The normalization has been done by using the following equation.

$$d_{normalized} = \frac{{\left( {d - d_{\hbox{min} } } \right)}}{{\left( {d_{\hbox{max} } - d_{\hbox{min} } } \right)}}$$
(5)

where d = any data (input or output), dmin = minimum value of the entire dataset, dmax = maximum value of the entire dataset, and dnormalized = normalized value of the data.

Radial basis function (\(K\left( {x_{i} ,x} \right) = \exp \left\{ { - \frac{{\left( {x_{i} - x} \right)\left( {x_{i} - x} \right)^{T} }}{{2s^{2} }}} \right\}\), s is width of radial basis function) has been used as a kernel function. The program of GPR has been developed by using MATLAB.

3 Details of MPMR

This section will describe briefly MPMR for prediction of fs of driven piles in clay. MPMR has the following form.

$$y = \sum\limits_{i = 1}^{N} {\beta_{i} K\left( {x_{i} ,x} \right) + b}$$
(6)

where K(xi,x) is kernel function, x is input, y is output, N is number of data, βi and b are output of the MPMR algorithm.

In this study, \(x = \left[ {L,D,\sigma^{\prime}_{v} ,S_{u} } \right]\) and \(y = \left[ {f_{s} } \right]\).

One dataset is obtained by shifting all of the regression data + ε along the output variable axis. The other dataset is obtained by shifting all of the regression data − ε along the output variable axis. A regression surface is the classification boundary between these two classes. More details about MPMR are given by Strohmann and Grudic (2002). MPMR uses the same training dataset, testing dataset, and normalization technique as used by GPR. Radial basis function has been used as kernel function. MATLAB has been used to develop the MPMR

4 Results and Discussion

The performance of GPR model depends on the value of Gaussian Noise (ε) and s. The design values of ε and s have been determined by trial and error approach. The developed GPR gives best performance at ε = 0.01 and s = 0.3. Figures 1 and 2 shows the performance of training and testing dataset respectively. The performance of GPR has been assessed in terms of Coefficient of Correlation (R) and variance account for (VAF). The value of R is calculated from the following equation.

Fig. 1
figure 1

Performance of training dataset

Fig. 2
figure 2

Performance of teasing dataset

$$R = \frac{{\sum\nolimits_{i = 1}^{n} {\left( {f_{sai} - \overline{f}_{sa} } \right)\left( {f_{spi} - \overline{f}_{sp} } \right)} }}{{\sqrt {\sum\nolimits_{i = 1}^{n} {\left( {f_{sai} - \overline{f}_{sa} } \right)} } \sqrt {\sum\nolimits_{i = 1}^{n} {\left( {f_{spi} - \overline{f}_{sp} } \right)} } }}$$
(7)

where fsai and fspi are the actual and predicted fs values, respectively, \(\overline{f}_{sa}\) and \(\overline{f}_{sp}\) are mean of actual and predicted fs values corresponding to n patterns. For a good model, the value of R should be close to one.

The following equation has been used to determine VAF

$$VAF = \left[ {1 - \frac{{\text{var} \left( {f_{sa} - f_{sp} } \right)}}{{\text{var} (f_{sa} )}}} \right] \times 100$$
(8)

For a good model, the value of VAF should be close to 100 (Erzin and Cetin 2013). It is observed from Figs. 1 and 2 that the value of R and VAF is close to 1 and 100 respectively. So, the developed GPR predicts fs reasonable well. The variance is also obtained from the developed GPR. Figures 3 and 4 illustrates the variance of training and testing dataset respectively. These figures can be used to determine uncertainty. It can be also used to determine probability of failure.

Fig. 3
figure 3

Variance of training dataset

Fig. 4
figure 4

Variance of testing dataset

The design values of σ and ε have been determined by trial and error approach for developing the MPMR model. The developed MPMR gives best performance at s = 0.8 and ε = 0.007. Figures 1 and 2 shows the performance of training and testing dataset respectively. The value of R and VAF is close to one and 100 respectively for training as well as testing datasets. Therefore, the developed MPMR proves his capability for prediction of fc of driven piles in clay. The developed GPR and MPMR have been compared with the ANN model developed by Goh (1995). The comparison has been carried out in terms of Root Mean Square Error (RMSE) and Mean Absolute Error (MAE). The values of RMSE and MAE have been determined by using the following relation.

$$RMSE = \sqrt {\frac{{\sum\nolimits_{i = 1}^{n} {\left( {f_{sai} - f_{spi} } \right)^{2} } }}{n}}$$
(9)
$$MAE = \frac{{\sum\nolimits_{i = 1}^{n} {\left| {f_{sai} - f_{spi} } \right|} }}{n}$$
(10)

where fsa and fsp are actual and predicted fs values respectively and n is the number of data. Figure 5 shows the bar chart of RMSE and MAE values of the ANN, GPR and MPMR models. It is observed from Fig. 5 that the developed GPR and MPMR outperform the ANN model. The performance of GPR and MPMR is almost same. The developed GPR produces variance of the predicted fs. However, the ANN and MPMR models do not give variance of the predicted fs. GPR assumes data distribution for developing the model. However, ANN and MPMR do not assume any data distribution.

Fig. 5
figure 5

Bar chart of RMSE and MAE values of the GPR, ANN and MPMR models

5 Conclusion

This article describes GPR and MPMR for prediction of fs of driven piles in clay. 65 datasets have used to develop the GPR and MPMR models. The data division and normalization technique have been described in the manuscript. The developed GPR and MPMR give reasonable performance. The predicted variance is useful for determination risk. The performance of GPR and MPMR is better than the ANN model. This study gives excellent tools based on the developed GPR and MPMR for prediction of fs of driven piles in clay.