1 Introduction

Strength parameters of rock are the most basic and important mechanical parameters for engineering geologists, geotechnical engineers and mining engineers. These parameters have great importance in rock engineering such as tunnel and dam design, rock blasting and drilling, mechanical rock excavation and slope stability (Ceryan et al. 2013a). There are two methods for assessing the strength properties of rocks. First is the direct method by conducting laboratory tests on crafted specimens; the other, known as the indirect method, uses the previously derived empirical equations or models from literatures (Baykasoglu et al. 2008; Madhubabu et al. 2016). The direct method by performing rock mechanical tests in the laboratory should follow the testing procedures standardized by the International Society for Rock Mechanics (ISRM) (Ulusay and Hudson 2007). Although the method is relatively simple, it is time-consuming and expensive; also, it requires well prepared and high-quality core specimens (Heidari et al. 2012), and this cannot always be extracted from weak, highly fractured, weathered and thinly bedded rocks (Ceryan et al. 2013a). For these reasons, indirect methods from empirical equations with parameters such as sonic velocity (Sharma and Singh 2008; Kahraman 2001; Moradian and Behnia 2009), Schmidt rebound number (Sachpazis 1990; Yagiz 2009), point load strength (Basu and Aydin 2006; Mishara and Basu 2012; Singh et al. 2012) have been proposed for predicting rock strengths (Fener et al. 2005). These indirect parameters are all easy to obtain because of low requirements for rock cores and simple testing equipment. And these tests can be performed at the engineering field. Compared to the traditional rock mechanical tests, the indirect prediction method can make the acquisition of rock strength parameters much easier, faster and more economical.

At present, there are two commonly used mathematical methods for performing prediction problems (Singh et al. 2016), which are statistics and soft computing technique. Using the statistical method, statistical regression models between rock strength indices and indirect parameters (e.g. point load strength, P-wave velocity, Schmidt rebound number) could be built with linear (Sachpazis 1990; Basu and Aydin 2006; Sharma and Singh 2008), power (Kahraman 2001; Yagiz 2009) and exponential (Moradian and Behnia 2009) functions. Except that simple prediction models with single one indirect parameter have been proposed (Fattahi 2016), some studies have dealt with models relating all determined indices simultaneously with rock strength parameters (i.e. multiple regression analysis) (Mishara and Basu 2013). As the complexity of rock types, these empirical equations have limitations, notably being site and type specific (Sarkar et al. 2010). The statistical regression method will not be applicable when many rock types exist in a rock data base for prediction, hence new technologies should be proposed for the estimation of rock strength parameters.

With the development of intelligent computing, methods such as neural networks (Dehghan et al. 2010; Singh and Verma 2012; Torabi et al. 2014; Mert 2014; Mohamad et al. 2015), genetic algorithm (Beiki et al. 2013), fuzzy inference (Yilmaz and Yuksek 2009; Rezaei et al. 2012), and support vector machine (Ceryan et al. 2013b) have been put forward and are widely used in rock mechanics and rock engineering with great superiority. Least squares support vector machine (LS-SVM) is a pattern recognition and regression analysis structure based on statistics theory and structure risk minimum criterion. It integrates square error variable into traditional support vector machine that substitutes least square linear system for quadratic programming to resolve function estimating problems, which shows good generalization performance and calculation speed in dealing with multivariate nonlinear problems by small samples (Tan et al. 2014; Xu et al. 2015).

The aim of this study is to build prediction models for estimation of rock strength properties. For this purpose, the relations between indirect parameters (rock density, point load strength, P-wave velocity and slake durability index) and rock strengths (uniaxial compressive and shear strength) were analyzed with rock physical and mechanical test results of four rock types. The prediction models were built based on LS-SVM, and statistical performances were tested.

2 Rock Physical and Mechanical Test Results

The physical and mechanical test results of four rock types, which were limestone, slate, quartzite and quartz mica schist, were used in this research with 10 data sets for each rock type. Rock samples were collected from different locations in Luhri region, Himachal Pradesh and rock parameters were determined according to the standard testing methods of ISRM (1981) (Sarkar et al. 2010). The relations between rock indirect parameters (rock density, P-wave velocity, point load strength and slake durability index) and rock strengths (UCS and SS) were shown in Figs. 1 and 2, respectively. With regard to a specified rock type, statistical regression models can be obtained after performing regression analysis to the data points, and the relations and correlation coefficients (R) were listed in Table 1.

Fig. 1
figure 1

Relations of rock indirect parameters with rock uniaxial compressive strength

Fig. 2
figure 2

Relations of rock indirect parameters with rock shear strength

Table 1 Model of rock strength and indirect parameter

Seen from the Table 1, it shows that the relations between indirect parameters and rock strengths all have a high correlation coefficient, except one which is less than 0.85. Although linear models with high correlation coefficients exist for each rock type to predict its rock UCS and SS based on the indirect parameters, these models are rock type specific and with little applicability when the rock type of a different investigation site is unknown and the strengths need to be predicted. A model which considers rock types as many as possible may have a probability to predict an unknown rock type, that is to say, a prediction model considering these four rock types simultaneously will be a probable prediction model in this research. While the scatter plots in Figs. 1 and 2 show that there is no such a statistical prediction model that could express the relations between the indirect parameters (rock density, P-wave velocity and slake durability index) and rock strengths for these four rock types, that is to say, just given one parameter, the rock strength parameters cannot be predicted. Only the point load strength of these rocks may have a linear relation with rock UCS and SS for the whole data points, which can be expressed as followed:

$$\sigma_{\text{c}} = 21.9483I_{{{\text{s}}(50)}} - 1.13531\quad R^{2} = 0.98174$$
(1)
$$\tau = 4.13715I_{s(50)} + 0.19087\quad R^{2} = 0.98623$$
(2)

3 Prediction Model of Rock Strength Using Least Squares Support Vector Machine

3.1 Foundation of LS-SVM Technique

LS-SVM is now widely used and can achieve acceptable results. The two problem types for which LS-SVM is used are regression and classification problems. Prediction belongs to the regression problem (Xu et al. 2015).

For regression problems, suppose T is a training set and n is a sample number. In this case:

$${\mathbf{T}} = \left\{ {(x_{1} ,y_{1} ),(x_{2} ,y_{2} ), \ldots ,(x_{n} ,y_{n} )} \right\}$$
(3)

where x i \(\in\) R n is the input vector and y i \(\in\) R is the output variable that corresponds to x i.

The optimization problem of LS-SVM can be described as follows:

$$\left\{ \begin{aligned} \mathop {\hbox{min} }\limits_{w.b,e} J(\varvec{w},{\mathbf{b}},{\mathbf{e}}) = \frac{1}{2}{\mathbf{w}}^{T} {\mathbf{w}} + \frac{\mu }{2}\sum\limits_{i = 1}^{n} {{\mathbf{e}}_{i}^{2} } \hfill \\ s.t.\quad {\mathbf{y}}_{i} = {\mathbf{w}}^{T} \varphi ({\mathbf{x}}_{i} ) + {\mathbf{b}} + {\mathbf{e}}_{i} \quad i = 1,2, \ldots ,n \hfill \\ \end{aligned} \right.$$
(4)

where w is the weight vector, μ is the regularization parameter (also called the penalty parameter), e i is the error variance, φ(·) denotes nonlinear mapping from the input space to high-dimensional feature space, and b is a partial vector.

The function Lagrange of the optimization problem (4) is as follows:

$$L(\varvec{w},\varvec{b},\varvec{e},\varvec{\alpha}) = J(\varvec{w},\varvec{b},\varvec{e}) - \sum\limits_{i = 1}^{n} {\varvec{\alpha}\left\{ {\varvec{w}\varphi (\varvec{x}) + \varvec{b} + \varvec{e} - \varvec{y}} \right\}}$$
(5)

where α i is the Lagrange multiplier and sample (α i ≠ 0) is the support vector.

The following equations are obtained based on the Karush–Kuhn–Tucker (KKT) condition, which is a necessary and sufficient condition for the optimal solution of the object function in a nonlinear optimization problem:

$$\frac{\partial L}{{\partial \varvec{w}}} = 0\quad \frac{\partial L}{{\partial \varvec{b}}} = 0\quad \frac{\partial L}{{\partial \varvec{e}_{i} }} = 0\quad \frac{\partial L}{{\partial\varvec{\alpha}_{i} }} = 0 \Rightarrow \left\{ \begin{aligned} \varvec{w} = \sum\limits_{i = 1}^{n} {\varvec{\alpha}_{i} \varphi (x_{i} )} \hfill \\ \sum\limits_{i = 1}^{n} {\varvec{\alpha}_{i} = 0} \hfill \\\varvec{\alpha}_{i} = \mu e_{i} \quad i = 1,2, \ldots ,n \hfill \\ \varvec{w}^{T} \varphi (x_{i} ) + \varvec{b} + \varvec{e}_{i} - \varvec{y}_{i} = 0\quad i = 1,2, \ldots ,n \hfill \\ \end{aligned} \right.$$
(6)

where α and b are obtained by solving the right part of Eq. (6). The output value y(x) of the new input vector x can be calculated through the following formula:

$$\varvec{y}(\varvec{x}) = \sum\limits_{i = 1}^{n} {\varvec{\alpha}_{i} K(\varvec{x},\varvec{x}_{i} ) + \varvec{b}}$$
(7)

where K(x,x i) = φ(x)T φ(x i) is called the kernel function. The radial basis function (RBF) is one of the most popular kernel functions for SVM. The RBF can be described in the following way:

$$K({\mathbf{x}},{\mathbf{x}}_{i} ) = e^{{ - \frac{{\left\| {{\mathbf{x}}_{i} - {\mathbf{x}}} \right\|^{2} }}{{2\sigma^{2} }}}}$$
(8)

where σ 2 is the squared bandwidth, which is optimized through an external optimization technique during the training process (Cao et al. 2008; Xu et al. 2015).

3.2 Establishment of Prediction Model and Results Analysis

Based on the above fundamental theory of LS-SVM, an input matrix including rock density, point load strength, P-wave velocity and slake durability index and two output matrixes including rock UCS and SS were separately built in the MATLAB environment. The radial basis function (RBF) was used as the kernel functions for LS-SVM. 80% (32 datasets) of the total samples were used for training and the other 20% (8 datasets) samples were used for testing. The training and testing data used in LS-SVM were listed in Tables 2 and 3. The raw data of rock UCS show that the values range from 20.32 to 112.25 MPa, which means that the prediction models built with LS-SVM are suitable for rocks with these similar strengths. The prediction results based on LS-SVM were shown in Figs. 3 and 4, and the determination coefficients (R 2) for training and testing results were calculated, respectively. The results show that with regard to rock UCS prediction, the R 2 values for training and testing results are 0.9997 and 0.9995, respectively. And with regard to rock SS prediction, the R 2 values for training and testing results are 0.9990 and 0.9964, respectively. It can be concluded that the accuracy of rock strength prediction models based on LS-SVM is extremely high and the models could be accepted for prediction.

Table 2 Training data used in LS-SVM
Table 3 Testing data used in LS-SVM
Fig. 3
figure 3

Comparison of Rock measured and predicted UCS

Fig. 4
figure 4

Comparison of Rock measured and predicted SS

3.3 Performance Validation of LS-SVM Prediction Models

In order to check the validation of prediction models based LS-SVM, the relations of predicted values versus measured values were plotted in Figs. 5 and 6. The error in the predicted value is represented by the distance that each data point plots from the 1:1 diagonal line (Kahraman et al. 2016). It can be seen that the predicted values for both models are almost lying on the diagonal line.

Fig. 5
figure 5

Relation between predicted and measured UCS values

Fig. 6
figure 6

Relation between predicted and measured SS values

To verify the performance of the models, four statistical criteria viz. squared correlation coefficient (R 2), variance account for (VAF), root mean squared error (RMSE), and mean absolute percentage error (MAPE) were chosen to be the measure of accuracy. Let y i be the actual value and \(\hat{y}_{i}\) be the predicted value of the ith observation, \(\bar{y}\) be the mean value of the observations and N be the number of observations, then R 2, VAF, RMSE and MAPE could be defined below (Fattahi 2016), and the results were shown in Table 4. The performance indices show that the rock strength models based on LS-SVM are excellent because the values are all extremely close to the best condition at which the R 2 is 1, VAF is 100, RMSE and MAPE is 0 (Armaghani et al. 2016a, b). The prediction models built based on LS-SVM are applicable for the prediction of strength parameters of rock.

$$R^{2} = 1 - \frac{{\sum\nolimits_{i = 1}^{N} {(y_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{i} )^{2} } }}{{\sum\nolimits_{i = 1}^{N} {\left( {y_{i} - \bar{y}} \right)^{2} } }}$$
(9)
$$VAF = \left( {1 - \frac{{\text{var} (y_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{i} )^{2} }}{{\text{var} (y_{i} )}}} \right) \times 100$$
(10)
$$RMSE = \sqrt {\frac{1}{N}\sum\limits_{i = 1}^{N} {\left( {y_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{i} } \right)}^{2} }$$
(11)
$$MAPE = \frac{1}{N}\sum\limits_{i = 1}^{N} {\left| {\frac{{y_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{i} }}{{y_{i} }}} \right|} \times 100$$
(12)
Table 4 Validation results of rock strength prediction models

4 Discussion

ANN prediction models of rock strengths have been developed by Sarkar et al. (2010) based on these raw data. The results of ANN models show that the predicted and determined values indicate a very good correlation (R = 0.99) and confirm the applicability of ANN for estimation of rock strength parameters (Sarkar et al. 2010; Singh and Verma 2012; Mert 2014; Mohamad et al. 2015). By comparing the models built using LS-SVM and ANN, it shows that these two soft computing techniques can both used for prediction and have an equivalent prediction performances. All these support the applicability of soft computing techniques for the estimation of strength parameters of rock.

5 Conclusion

Aimed at estimation of strength parameters of rock, a soft computing technique, which is least squares support vector machine (LS-SVM), is used for the establishment of rock strength prediction models. The prediction results based on LS-SVM prediction models of rock UCS and SS are both extremely close to the measured values, and the squared correlation coefficients (R 2) of rock UCS and SS prediction models are 0.9997 and 0.9985, respectively, which indicates the prediction models based on LS-SVM could be accepted for prediction. By introducing the prediction results based on another soft computing technique (i.e. ANN), a conclusion that the applicability of soft computing techniques for estimation of strength parameters of rock is further supported.