Machine learning models to estimate the elastic modulus of weathered magmatic rocks

Ceryan, Nurcihan; Ozkat, Erkan Caner; Korkmaz Can, Nuray; Ceryan, Sener

doi:10.1007/s12665-021-09738-9

Machine learning models to estimate the elastic modulus of weathered magmatic rocks

Original Article
Published: 17 June 2021

Volume 80, article number 448, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Earth Sciences Aims and scope Submit manuscript

Machine learning models to estimate the elastic modulus of weathered magmatic rocks

Download PDF

Nurcihan Ceryan¹,
Erkan Caner Ozkat ORCID: orcid.org/0000-0003-0530-5439²,
Nuray Korkmaz Can³ &
…
Sener Ceryan⁴

918 Accesses
20 Citations
Explore all metrics

Abstract

In recent years, several soft computing models have been proposed to estimate the elastic modulus of magmatic rocks. However, there are lacks in models that consider the different weathering degrees in determining the elastic modulus of rocks. In the literature, mechanical properties are widely used as inputs in predictive models for weathered rocks; however, there are only a few models that use index properties representing the effect of weathering on magmatic rocks. In this study, support vector regression (SVR) Gaussian process regression (GPR), and artificial neural network (ANN) models were developed to predict the elastic modulus of magmatic rocks with different degrees of weathering. The inputs selected by the best subset regression approach were porosity, P-wave velocity, and slake durability index. Key performance indicators (KPIs) were computed to validate the accuracy of the developed models. In addition to KPIs, Taylor diagrams and regression error characteristic (REC) curves were used to assess the performance of the developed prediction models. In this study, considering the difficulties of expressing the error using only RMSE and MAE, a new performance index (PI), PI_MAE, was proposed using normalized MAE instead of normalized RMSE. It was also indicated that PI_RMSE and PI_MAE should be used together in performance analysis. When considering the Taylor diagram, PI_RMSE, and PI_MAE, the GPR models performed best, and the SVR model performed the worst in both the training and test periods. Similarly, according to the REC curve in both periods, the performance of the SVR was the worst, while the performance of the ANN model was the best. The PI_RMSE and PI_MAE values of the GPR model for the test data were 1.3779 and 1.4142, respectively, and they were 1.2567 and 1.4139, respectively, for the ANN model. According to the computed response surfaces, an increase in the P-wave velocity, and a decrease in the porosity increased the elastic modulus. However, changes in slake durability index only had a minor effect on the elastic modulus.

Prediction of Young’s modulus of weathered igneous rocks using GRNN, RVM, and MPMR models with a new index

Article 12 January 2021

Assessment of the uniaxial compressive strength of intact rocks: an extended comparison between machine and advanced machine learning models

Article 26 March 2024

Using non-destructive tests for estimating uniaxial compressive strength and static Young’s modulus of carbonate rocks via some modeling techniques

Article 11 April 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Natural rock masses consist of intact rock blocks separated by discontinuities. Therefore, the mechanical properties of intact rock, the geotechnical properties of discontinuities, and the rock mass structure are the most important parameters affecting the engineering deformation properties of rock masses. It is also known that weathering significantly affects these engineering properties of rocks (Ceryan 2015). One of the most important intact rock properties affecting the engineering behavior of rock mass is the elastic modulus (${E}_{\text{s}}$) just as uniaxial compressive strength (UCS), and Poisson ratio ($\nu$). Therefore, ${E}_{\text{s}}$ of intact rock is generally used as an input in both numerical models and in empirical relationships to evaluate the engineering behavior of rock masses (Kayabasi et al. 2003; Sonmez et al. 2006; Hoek and Diederichs 2006; Bidgoli et al. 2013; Zhang 2017; Saedi et al. 2019; Alemdag et al. 2016).

UCS tests, standardized by the International Society for Rock Mechanics (ISRM), are utilized to measure the ${E}_{\text{s}}$ and UCS of rock materials directly. The tests require well-prepared rock specimens that cannot always be extracted from weak, thinly bedded, stratified, highly fractured, highly weathered, high-porosity, clay-containing, coarse-grained, and block-in-matrix rocks. Furthermore, the tests are expensive, complicated, time-consuming, and require sophisticated and expensive instruments (Gokceoglu and Zorlu 2004; Xia et al. 2014; Ko et al. 2016), making it difficult to evaluate ${E}_{\text{s}}$ using standard laboratory tests. To overcome these difficulties for directly measuring ${E}_{\text{s}}$, regression models have been developed (Nefeslioglu 2013; Ozkat et al. 2017a,b; Aboutaleb et al. 2018; Mashayekhi et al. 2020). The independent variables (i.e., the inputs) used in these models, are commonly the physical properties of rock samples that can be measured easily and cheaply. These simple and non-destructive index properties include the elastic wave velocity (Kurtuluş et al. 2010; Brotons et al. 2016), porosity, effective porosity, void ratio, unit weight, density, and water saturation (Tugrul 2004; Yilmaz and Yuksek 2008; Erguler and Ulusay 2009; Marques et al. 2010; Wang et al. 2014; Kim et al. 2017). Other inputs derived from the index tests are the slake durability index (Yagiz et al. 2012; Ceryan 2014; Ghasemi et al. 2018), weathering indices (Ceryan 2015, 2016), texture coefficient, and mineral content; especially determining quartz, plagioclase, and clay content (Shakoor and Bonelli 1991; Singh and Verma 2012; Pan et al. 2013; Diamantis et al. 2014; Heap et al. 2014; Undul and Florian 2015; Ajalloeian et al. 2017). Simple mechanical test results, including the Schmidt hammer rebound hardness, the point loading index, tensile strength, the blocked punch index, and the cylindrical punch index are also employed in regression models (Dinçer et al. 2004; Karakus et al. 2005; Yilmaz and Yuksek 2009; Khandelwal and Singh 2011; Singh and Verma 2012; Singh et al. 2012; Alikarami et al. 2013; Armaghani et al. 2015; Saedi et al. 2018; Mahdiabadi and Khanlari 2019).

Conventional regression methods, such as simple linear regression and multi-linear regression methods, are widely used in the literature to estimate ${E}_{\text{s}}$. In general, the equations obtained with conventional regression methods are recommended only for specific rock types (Fener et al. 2005; Beiki et al. 2013). If new data are substantially different from the original data, the form of the obtained equations needs to be updated. Moreover, in some cases, the prediction results are inadequate (Sonmez et al. 2006; Yilmaz and Yuksek 2009; Beiki et al. 2013; Rezaei et al. 2014). Considering these difficulties in the prediction of ${E}_{\text{s}}$ using conventional regression methods, many researchers have employed soft computing methods (Table 1).

Table 1 Soft computing models proposed to estimate the modulus of elasticity of rock materials

Full size table

The main purpose of this study is to examine the applicability and capability of Support Vector Regression (SVR), Gaussian Process Regression (GPR), and Artificial Neural Network (ANN) models in the ${E}_{\text{s}}$ prediction of magmatic rocks with different degrees of weathering. The inputs, porosity (n), P-wave velocity (${V}_{p}$), and slake durability index (${I}_{d}$) used in the proposed models were determined by the best subset regression method, and the performances of these models were evaluated with the maximum determination coefficient (R²), Root Mean Square Error (RMSE), normalized RMSE (NRMSE), Mean Absolute Error (MAE), normalized MAE (NMAE), Mean Squared Error (MSE), Nash–Sutcliffe coefficient (NS), Variance Account Factor (VAF), Performance Index (PI), Regression Error Characteristic Curve (REC), and Taylor diagrams.

The remainder of the paper is organized as follows: “Materials and experimental details” describes the materials and experimental procedures; “Problem formulation” defines the inputs and output and explains the proposed regression models. The results and discussions are introduced in “Results” and “Discussion”, respectively. The conclusions are presented in “Conclusion”.

Materials and experimental details

The sample blocks in the present study were gathered from three different formation/lithodemes exposed in the Eastern Pontides, NE Turkey. They mainly consisted of (1) Late Cretaceous volcanic rocks, (2) volcano-sedimentary rock aged Eosen, and (3) granitic rocks (Fig. 1).

The tests for Schmidt hardness, specific gravity, slake durability, P-wave velocity, and UCS were conducted according to standards defined by ISRM (2007). The core samples used in these tests were extracted from block samples that had different degrees of weathering (Fig. 2). The index properties and modulus of elasticity of the samples used in this study are presented in Table 2.

Table 2 Lithology, index properties, and the modulus of elasticity (E_s) of the samples

Full size table

The N-type rebound hammer was used in the Schmidt hammer test to determine the hardness value (SHH). The ${\text{SHH}} \times \gamma$ term (Deere and Miller 1966; Aufmuth 1974) was obtained by multiplying the measured hardness values (SHH) by unit weight ($\gamma$). The grain density $\left({\rho }_{\text{s}}\right)$ was determined by employing the specific gravity test, and the dry density was obtained experimentally. Subsequently, the porosity value ($n$) was calculated using Eq. (1).

$$n = 1 - \frac{{\rho _{d} }}{{\rho _{{\text{s}}} }}$$

(1)

The samples were subjected to a four-cycle slake durability test to determine the slake durability index (Gokceoglu et al. 2000). This test was performed three times for each sample. The ultrasonic pulse velocity (UPV) test was conducted using the Portable Ultrasonic Non-destructive Digital Indicating Tester (PUNDIT^®) without applying any pressure to the samples. The equipment generates an ultrasonic pulse with a frequency of 400 kHz and measures the transit time from the transmitter transducer through the sample to the receiving transducers. The time of ultrasonic pulses was read with an accuracy of 0.1 ms. The P-wave velocity of the rock samples without pores and fissures (${V}_{\text{m}}$) was calculated using Eq. (2) (Barton 2007), and the P-durability index ($V_{{{\text{id}}}}$) is defined in Eq. (3) (Ceryan 2016)

$$\frac{1}{{V_{p} }} = \frac{\phi }{{V_{{{\text{fl}}}} }} + \frac{{1 - \phi }}{{V_{{\text{m}}} }}$$

(2)

$$V_{{{\text{id}}}} = 0.01\;V_{{\text{m}}} I_{d}$$

(3)

where $V_{{{\text{fl}}}}$ is the velocity in the fluid, $\phi$ is the ratio of the path length in the fluid to the total path length (the porosity), and ${I}_{d}$ is the slake durability index.

To determine ${E}_{\text{s}}$, representing the sample stiffness against a uniaxial load, initially, the stress–strain curve for axial deformations was obtained for UCS. Then, ${E}_{\text{s}}$ was defined as the slope of a line tangent to the stress–strain curve at a fixed percentage of the ultimate strength, 50%, for the test sample (Table 2).

Problem formulation

Methodology

The methodology for predicting the elastic modulus (${E}_{\text{s}}$) is illustrated in Fig. 3. The flow chart consists of three main steps, which are (1) selection of inputs, (2) model development, and (3) model performance evaluation. In Step 1, the best subset regression approach was employed to determine inputs from the experimental study. The dataset was divided into two parts for the training (75% of the original data) and test (25% of the original data) sets of the regression models. In Step 2, the proposed regression models, namely, SVR, GPR, and ANN were developed using the training dataset. In Step 3, the developed regression models were validated using the test dataset. The statistical key performance indicators (KPIs), namely RMSE, NRMSE, R², MAE, NMAE, NS, VAF, AOC, Taylor diagram, the performance index with RMSE (PI_RMSE), and the performance index with MAE (PI_MAE), were applied in this step to evaluate the performances of the models.

Selection of inputs

The accuracy of the regression models largely depends on the type of function being used, as well as the quality and quantity of observed data. If there are more inputs than there should be, selecting a suitable set of inputs is necessary to reduce the noise resulting from unnecessary input data. Using suitable inputs, the interpretability of the model can be improved and the predictive ability increases (Omoruyi et al. 2019).

The input selection methods for the regression model can be collected under three main categories: (1) filter method (for example, Pearson’s Correlation, Akaike information criterion, Bayesian information criterion, linear discriminant analysis, principal component analysis, analysis of variance, and chi-square), (2) embedded methods (for example, least angle and shrinkage selection operator and ridge regression), and (3) wrapper methods (for example, forward selection, backward elimination, recursive future elimination, stepwise method, best subset regression) (Huang et al. 2010; Desboulets 2018; Haque et al. 2018; Park and Klabjan 2020).

Here, the best subset regression approach was employed to determine the inputs. First, all possible regression models derived from all possible combinations of the inputs were defined. Then, the best model was determined according to KPIs, including Mean Square Error (MSE), RMSE, R², adjusted determination coefficient (adjR²), and Mallow’s ${C}_{p}$ (Mallows and Sloane 1973), given in Eqs. (4)–(8).

$${\text{MSE}} = \frac{1}{{n - p}}\mathop \sum \limits_{{i = 1}}^{n} \left( {y_{i} - \hat{y}_{i} } \right)^{2}$$

(4)

$${\text{RMSE}} = \sqrt {{\text{MSE}}}$$

(5)

$$R^{2} = 1 - \frac{{{\text{MSE}}_{i} }}{{\sigma ^{2} }}$$

(6)

$${\text{Adj}}R^{2} = 1 - \frac{{n - 1}}{{n - i - 1}}\left( {1 - R^{2} } \right)$$

(7)

$$C_{p} = \left( {n - p} \right)\frac{{{\text{MSE}}_{i} }}{{{\text{MSE}}_{F} }} - \left( {n - 2i - 1} \right)$$

(8)

where ${y}_{i}$ and ${\widehat{y}}_{i}$ are the measured and predicted values, respectively. ${\text{MSE}}_{i}$ is the mean of residual squares in the model with $i$ parameters, ${\text{MSE}}_{F}$ is the mean of residual squares in the full model with $p$ parameters, ${\sigma }^{2}$ is the variance of the dependent variable, $n$ is the number of data, $i$ is the number of inputs in the model, and $p$ is the number of parameters in the aforementioned model.

R² increases with an increasing number of inputs, and therefore, it does not indicate the correct regression model. Instead, adjR² is generally considered to be a more accurate goodness-of-fit measure than R² because it applies a penalty score to the model when considering more inputs. The RSME is frequently used as a measure of the differences between the predicted and measured values. The performance of the model improves as the RMSE value decreases. The goal of Mallow’s ${C}_{p}$ is to achieve a balanced number of inputs in the model. It compares the precision and bias of the full model generated using all inputs to the models generated using a subset of inputs. The full model always yields a Mallow’s ${C}_{p}$ value equal to the number of parameters in the regression model ($p$), so the full model based on ${C}_{p}$ should not be selected. Furthermore, a Mallow’s ${C}_{p}$ value below the p value represents sampling errors. If several models have Mallow’s ${C}_{p}$ near the p value, the model with the smallest value of the difference between the Mallow’s ${C}_{p}$ and the p value can be chosen as the best model.

The index properties, measured experimentally in this study, are: (1) Schmidt hammer hardness (SHH), (2) ${\text{SHH}} \times \gamma$ index, (3) slake durability index (${I}_{d}$), (4) porosity ($n$), (5) P-wave velocity (${V}_{p}$), (6) P-velocity in the solid part of the samples (${V}_{\text{m}}$), and (7) P-durability index ($V_{{{\text{id}}}}$). These properties are also, the most frequently used inputs in the ${E}_{\text{s}}$ prediction. The ${V}_{p}$ and ${I}_{d}$ values were measured experimentally, and the ${V}_{\text{m}}$ and $V_{{{\text{id}}}}$ values were calculated empirically from Eqs. (2) and (3). Therefore, the prediction models, namely, SVR, GPR, and ANN should be created by selecting only one of the ${V}_{\text{m}}$, $V_{{{\text{id}}}}$, and ${V}_{p}$ values. Similarly, $V_{{{\text{id}}}}$ and ${I}_{d}$ cannot be used as inputs for the same prediction model at the same time. According to these defined conditions, the seven inputs were divided into three groups as follows: (1) $n$, ${V}_{p}$, ${I}_{d}$, SHH, and ${\text{SHH}} \times \gamma$, (2) $n$, $V_{{{\text{id}}}}$, SHH, and ${\text{SHH}} \times \gamma$, and (3) $n$, ${V}_{\text{m}}$, SHH, and ${\text{SHH}} \times \gamma$. Then, the best subset regression analysis was performed for each group to determine the inputs for the prediction models.

According to the results of the best subset regression analysis, the highest R² and adjR² values, as well as the lowest RSME value, for the linear regression model using the inputs in the second and third groups were 0.611, 0.575, and 4.930 MPa, respectively. However, the R², adjR², and RSME values for the linear regression model using the inputs of the first group were 0.774, 0.753, and 3.7635 MPa, respectively. Considering these values, the inputs in the first group, namely, $n$, ${V}_{p}$, ${I}_{d}$, SHH, and ${\text{SHH}} \times \gamma$ were selected to determine the optimum inputs that would be employed during the development of the prediction models.

Table 3 presents the results of the best subset regression analysis performed for different combinations of inputs from the first group and illustrates the two best-fitting models for each combination. The differences between Mallow’s ${C}_{p}$ and p value (the number of parameters in the full model) obtained for the 1st, 2nd, and 4th rows are larger than others. Although the smallest values of the differences between Mallow’s ${C}_{p}$ and the number of the parameters in the full model are obtained in the 7th and 9th rows, the input parameters in the 7th and 9th rows cannot be used because both SHH, and ${\text{SHH}} \times \gamma$ are used together in the regressions (Table 3). SHH, and ${\text{SHH}} \times \gamma$ are not independent of each other as ${\text{SHH}} \times \gamma$ is a function of R. The performance of the regression model in the 3rd row is slightly better than that of the model in the 8th row. In addition, while there are two inputs in the 3rd row, there are four inputs in the 8th row. Although the Mallow’s C_p value obtained in the 3rd row is smaller than that in the 5th row, the R², adjR², RSME, and Mallow’s ${C}_{p}$ − p values in the 5th row are better than those in the 3rd row (Table 3).

Table 3 Best subset regression; E_s versus $n$, ${V}_{p}$, ${I}_{d}$, SHH and ${\text{SHH}} \times \gamma$

Full size table

As a result of the best subset regression analysis, the regression in which porosity ($n$), P-wave velocity (${V}_{p}$) and slake durability index (${I}_{d}$) are used together yields the best performance. Thus, $n$, ${V}_{p}$_, and ${I}_{d}$ were employed as inputs for the regression models developed for the modulus of elasticity of the investigated samples.

Figure 4 illustrates the resulting histograms, cumulative distribution functions (CDFs), and additional statistical information, namely the number of data (N), maximum and minimum values, mean and standard deviations of both inputs (porosity, p-wave velocity, and slake durability index) and output (elastic modulus).

Most, if not all, machine learning algorithms, such as regression models, rely on selected training and test sets. These are often selected based on simple random sampling with prescribed ratios while considering that the training set should be larger than the test set. The experimental results (i.e. original dataset) are arbitrarily divided into two datasets that are the training dataset, which contains 75% of the original data, and the test dataset, which contains 25% of the original data. The training dataset was employed to fit the machine learning models, and the test dataset was used to evaluate the machine learning model fit based on the defined KPIs.

Regression model development

This study aims to develop reliable predictive models to determine the elastic modulus of magmatic rocks using porosity ($n$), P-wave velocity (${V}_{p}$) and Slake durability index (${I}_{d}$); and then, to compare the developed models utilizing the KPIs, the Performance Index, the REC curves, and Taylor diagram. Let us gather three inputs that are Porosity $\left(n\right)$ (i.e. x₁), P-wave velocity $\left({V}_{p}\right)$ (i.e. x₂), and the Slake durability index $\left({I}_{d}\right)$ (i.e. x₃) in Eq. (9), where $i$ and $j$ are the index of experiments and inputs; whereas, ${N}_{i}$ and ${N}_{j}$ are the numbers of experiments and inputs, respectively.

$${\mathbf{X}} = \left\{ {{\mathbf{x}}_{{\mathbf{i}}} } \right\} = \left\{ {x_{{i,j}} } \right\}~~~\forall i = 1, \ldots ,Ni~~,~~\forall j = 1, \ldots ,Nj$$

(9)

Similarly, let us gather the dependent variable E_s (i.e. y₁) in a set given in Eq. (10).

$${\mathbf{Y}} = \left\{ {y_{i} } \right\}~~~\forall i = 1, \ldots ,Ni~~$$

(10)

In general, the regression models $\left( {f\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{\beta }}} \right)} \right)$, between independent variables (inputs) and the dependent variables (targets/outputs), can be represented as follows:

$$\begin{gathered} \hat{y}_{i} = f\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{\beta }}} \right) + \epsilon _{i} \hfill \\ {\mathbf{\beta }} = \left\{ {\beta _{0} ,\beta _{1} , \ldots ,\beta _{j} , \ldots ,\beta _{{Nj}} } \right\} \hfill \\ {\mathbf{x}}_{{\mathbf{i}}} = \left\{ {x_{{i,1}} ,x_{{i,2}} , \ldots ,x_{{i,j}} , \ldots ,x_{{i,Nj}} } \right\} \hfill \\ \end{gathered}$$

(11)

where ($\boldsymbol{\beta }$) is the set of unknown parameters, and ($\epsilon$) is the error term. One of the aims of this research is to develop regression models that most closely fits the experimental results, and then evaluate and compare the performance of the developed regression models using defined KPIs. The background of the SVR, GPR, and ANN models are briefly explained in the following subsections.

Support vector regression

SVR model derived from statistical learning theory (Vapnik 1995) is used the sigmoid kernel function which is equivalent to a two-layer perceptron neural network. SVRs are alternative training methods for polynomial, radial basis function, and multilayer perceptron classifiers in which the weights of the network are found by solving a quadratic programming problem with linear constraints, rather than by solving a non-convex, unconstrained minimization problem as in standard ANN training (Huang et al. 2010).

The goal in linear regression is to minimize the error term between the actual and the predicted values of the dependent variables; whereas, the goal in SVR is to make sure that the errors do not exceed the threshold value (i.e. $\varepsilon$) (Suykens and Vandewalle 1999). Suppose that the empirical risk (ER) value is minimized, as follows:

$${\text{ER}} = \frac{1}{N}\sum\limits_{{i = 1}}^{{Ni}} | y_{i} - \hat{y}_{i} |$$

(12)

where $|y_{i} - \hat{y}_{i} |$ is $\varepsilon$-insensitive loss function written as:

$$\left| {y_{i} - \hat{y}_{i} } \right| = \left\{ {\begin{array}{ll} 0 & {{\text{if}}\;\left| {y_{i} - \hat{y}_{i} } \right| < \varepsilon } \\ {\left| {y_{i} - f\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{\beta }}} \right) - \varepsilon } \right| - \varepsilon } & {{\text{otherwise~~~~~~~~}}} \\ \end{array} } \right.$$

(13)

The proposed solution to this minimization problem is presented in Eq. (14).

$$\begin{gathered} \begin{array}{*{20}c} {{\text{minimize}}} & {\frac{1}{2}{\mathbf{\beta }}^{T} {\mathbf{\beta }} + C\sum\limits_{{i = 1}}^{{Ni}} {\left( {\xi _{i} + \xi _{i}^{*} } \right)} } \\ \end{array} \hfill \\ \begin{array}{*{20}c} {{\text{subjec to}}} & {\begin{array}{*{20}c} {y_{i} - f({\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{\beta }}) - \varepsilon \le \varepsilon + \xi _{i} } \\ {f({\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{\beta }}) + \varepsilon - y_{i} \le \varepsilon + \xi _{i}^{*} } \\ {\xi _{i} ,\xi _{i}^{*} \ge 0} \\ \end{array} } \\ \end{array} \hfill \\ \end{gathered}$$

(14)

where $\xi _{i} ,\xi _{i}^{*}$ are positive and negative slack variables from the threshold value $(\varepsilon )$, respectively, and $C$ is the meta-parameter which controls the trade between the model complexity (i.e. flatness) and the deviations from the $(\varepsilon )$. If C is too large, the target only minimizes the empirical risk without considering model complexity in the optimization formulation. Additionally, the value of $(\varepsilon )$ influences the number of support vectors used for constructing the regression function. The bigger the value $(\varepsilon )$, the fewer support vectors are selected.

Gaussian process regression

GPR is a non-parametric kernel-based probabilistic regression model. Despite GPR is a powerful modeling tool, this method has been addressed in solving very few problems related to rock mechanics (Momeni et al. 2020; Kumar et al. 2013, 2014; Huang et al. 2017). The basic idea behind GPR is to predict the value of a function at a given point by computing a weighted average of the known values of the function in the neighborhood of the point. It combines a global model and local deviations (Rasmussen 2004; Hong et al. 2014). Now consider that the unseen observation, where the prediction will happen, ${\mathbf{x}}_{*} = \{ x_{{*,j}} \} ~~~\forall j = 1, \cdots ,Nj$. The GPR model is of the form (Eq. 15)

$$\hat{y}({\mathbf{x}}_{*} ) = f({\mathbf{x}}_{*} ,\beta ) + Z({\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} )$$

(15)

where $f({\mathbf{x}}_{*} ,{\mathbf{\beta }})$ is the unknown polynomial function, and $Z({\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} )$ is the realization of a stochastic process with mean zero and nonzero covariance, which is given in Eq. (16).

$${\text{cov}}\left( {Z\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} } \right)} \right) = \sigma ^{2} {\mathbf{R}}\left( {R\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} } \right)} \right)$$

(16)

where ${\mathbf{R}}$ is the correlation matrix, and $R({\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} )$ is the correlation function between two observations (i.e. ${\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}}$). When the Gaussian correlation function is employed, the correlation function is expressed as:

$$R\left( {{\mathbf{x}}_{{\mathbf{i}}} ,{\mathbf{x}}_{{\mathbf{k}}} } \right) = \exp \left[ {\sum\limits_{{j = 1}}^{{Nj}} {\theta _{j} } \left( {x_{{i,j}} - x_{{k,j}} } \right)} \right]\quad \forall i,k = 1, \ldots ,Ni$$

(17)

where $\theta _{j}$ is the unknown correlation parameter to be determined, and ${N}_{j}$ is the number of inputs.

$$f\left( {{\mathbf{x}}_{*} ,\beta } \right) = \widehat{{\mathbf{\beta }}} + {\mathbf{r}}^{{\mathbf{T}}} \left( {{\mathbf{x}}_{*} } \right){\mathbf{R}}^{{ - 1}} \left( {{\mathbf{y}} - {\mathbf{f}}\widehat{\beta }} \right)$$

(18)

where ${\mathbf{y}}$ is the column vector of length ${N}_{i}$ that contains the responses of the experimental data and ${\mathbf{f}}$ is a column vector of length ${N}_{i}$ that is filled with ones when ${\mathbf{f}}({\mathbf{x}})$ is taken as a constant. ${\mathbf{r}}^{{\mathbf{T}}} ({\mathbf{x}}_{*} )$ is the correlation vector between the unseen observation $x_{*}$ and the whole seen observation (i.e.${\mathbf{X}} = \{ {\mathbf{x}}_{{\mathbf{i}}} \} \quad \forall i = 1, \cdots ,Ni$), which is defined as:

$${\mathbf{r}}^{{\mathbf{T}}} ({\mathbf{x}}_{*} ) = \left[ {R\left( {{\mathbf{x}}_{*} ,{\mathbf{x}}_{{\mathbf{1}}} } \right), \ldots ,R\left( {{\mathbf{x}}_{*} ,{\mathbf{x}}_{{\mathbf{i}}} } \right), \ldots ,R\left( {{\mathbf{x}}_{*} ,{\mathbf{x}}_{{{\mathbf{Ni}}}} } \right)} \right]^{T}$$

(19)

The parameter $\widehat{\beta }$ is calculated in Eq. (20).

$$\hat{\beta } = \left( {{\mathbf{f}}^{{\mathbf{T}}} {\mathbf{R}}^{{ - {\mathbf{1}}}} {\mathbf{f}}} \right)^{{ - {\mathbf{1}}}} {\mathbf{f}}^{{\mathbf{T}}} {\mathbf{R}}^{{ - {\mathbf{1}}}} {\mathbf{y}}$$

(20)

The estimated variance is calculated as:

$$\hat{\sigma }^{2} = \frac{{({\mathbf{y}} - {\mathbf{f}}\beta )^{T} {\mathbf{R}}^{{ - 1}} ({\mathbf{y}} - {\mathbf{f}}\beta )}}{{N_{i} }}$$

(21)

The unknown parameters $\theta _{j}$ in Eq. (21) obtained employing the maximum likelihood approached defined as:

$${\text{max}}~\phi \left( {\theta _{j} } \right) = - \frac{{N_{i} \ln \left( {\hat{\sigma }^{2} } \right) + \ln (|{\mathbf{R}}|)}}{2}$$

(22)

Artificial neural network

ANN model is an alternative fitting method to map a relation between inputs and outputs. A neuron, which is the building blocks of ANN, is composed of the input layer, one or more hidden layers, and the output layer. Every neuron has connections, which are called weights, with every neuron in both the previous and the following layer. Moreover, each neuron has a bias that makes it work or not work depending on the level of the input signal (Agatonovic-Kustrin and Beresford 2000; Bektas et al. 2019a). The inter-connectivity between neurons is generally done either feed-forward (FFNN) and recurrent (RNN) architectures. In the presented work, a feed-forward hierarchical topology is used, and the neuron’s output z can be expressed in Eq. (23).

$$z_{j} = \sum\limits_{{j = 1}}^{n} {w_{j} } x_{j} + \beta$$

(23)

where x_j is the input, w_j is the weight of the neuron, β is the bias value, and n is the number of the elements in inputs. The relationship between the output and the inputs is formulated as:

$$y_{j} = f_{h} \left( {z_{j} } \right)$$

(24)

where ${f}_{h}$ is activation function which is generally linear and sigmoid. In the presented work, the sigmoid activation function was employed as defined in Eq. (25) (Ham and Kostanic 2000; Bektas et al. 2019b)

$$f_{h} = \frac{1}{{1 + e^{{ - x}} }}$$

(25)

The general network structure for a two-layer feedforward with a sigmoid transfer function (f_h) in the hidden layer and a linear transfer function (f_o) in the output is written in Eq. (26).

$$y_{j} = f_{0} \left( {b + \sum\limits_{{h = 1}}^{{n_{h} }} {w_{h} \cdot f_{h} \left( {b_{h} + \sum\limits_{{i = 1}}^{n} {w_{{ih}} \cdot x_{{t - i}} } } \right)} } \right)$$

(26)

Depending on the techniques used to train the feed-forward neural network models, different back-propagation algorithms have been developed. In this study, the Levenberg–Marquardt back-propagation algorithm was used for training. In the back-propagation phase, the performance index E(W) to be minimized was defined as the sum of the squared errors between the target and network output as defined in Eq. (27).

$$E(W) = e^{T} e$$

(27)

where W consists of all weights in the network, and e is the error vector comprising the errors for all the training examples. When training with the Levenberg–Marquardt algorithm, the changing weights ΔW can be computed as follows:

$$\Delta W_{k} = - \left[ {J_{k}^{T} J_{k} + \mu _{k} I} \right]^{{ - 1}} J_{k}^{T} e_{k}$$

(28)

Then, the update of the weights can be adjusted according to Eq. (29)

$$W_{{k + 1}} = W_{k} + \Delta W_{k}$$

(29)

where J is the Jacobian matrix, I is the identity matrix, and µ is the Marquardt parameter to be updated using the decay rate β depending on the outcome. In particular, µ is multiplied by the decay rate β (0 < β < 1) when E(W) decreases, while µ is divided by β when E(W) increases in a new k-step. After the construction of the overall network, weights and biases parameters are iteratively adjusted to meet the predefined error criteria. Iteration by iteration the weights and the biases are updated proportionally to the mean squared error between the calculated output and the desired targets.

Criteria for the performance evaluation and data normalization

To justify the accuracies of the SVR, GPR, and ANN models, the following KPIs, namely, MSE, RMSE, R², adjR², MAE, the Nash–Sutcliffe coefficient (NS) to evaluate the capability of the model at simulating output data from the mean statistics, the variance account factor (VAF) to represent the ratio of the error variance to the measured data variance, and the Performance Index (PI), were computed for each model. MSE, RMSE, R², and adjR² were calculated using Eqs. (4)–(8). MAE, NS, and VAF were calculated using Eqs. (30)–(32), where ${y}_{i}$ and ${\widehat{y}}_{i}$ are the measured and predicted values. Here, the PI index is calculated separately according to both the normalized RMSE and normalized MAE values written in Eqs. (33) and (34). The NRMSE and NMAE values were obtained by dividing the RMSE and MAE values by the standard deviation of the E_s values measured.

$${\text{MAE}} = \frac{1}{{N_{i} }}\sum\limits_{{t = 1}}^{{N_{i} }} {\left| {y_{t} - \hat{y}_{t} } \right|}$$

(30)

$${\text{NS}} = 1 - \frac{{\sum\limits_{{t = 1}}^{{N_{i} }} {\left( {y_{t} - \hat{y}_{t} } \right)^{2} } }}{{\sum\limits_{{t = 1}}^{{N_{i} }} {\left( {y_{t} - \bar{y}_{t} } \right)^{2} } }}$$

(31)

$${\text{VAF}}_{i} = \left( {1 - \frac{{{\text{var}}\left( {y_{i} - \hat{y}_{i} } \right)}}{{{\text{var}}\left( {y_{i} } \right)}}} \right) \cdot 100\%$$

(32)

$${\text{PI}}_{{{\text{RMSE}}}} = {\text{adj}}R^{2} + 0.01\;{\text{VAF}} - {\text{NRMSE}}$$

(33)

$${\text{PI}}_{{{\text{RMAE}}}} = {\text{adj}}R^{2} + 0.01\;{\text{VAF}} - {\text{NMAE}}$$

(34)

The Taylor diagram is also used to assess the performance of the regression models (Taylor 2001). It is a two-dimensional plot showing four statistical quantities: (1) the standard variation of observed data (${\sigma }_{o}$), (2) the standard variation of predicted data (${\sigma }_{p}$), (3) the correlation coefficient (R), and (4) the centered RMSE, which is the key to constructing the Taylor diagram, is defined Eq. (35).

$${\text{RMSE}}_{{{\text{centered}}}} = \sigma _{o}^{2} + \sigma _{p}^{2} - 2\sigma _{o} \sigma _{p} R$$

(35)

The standard deviation shown in Fig. 5 is denoted by the radial distance from the origin. When the standard deviation of the predicted value (${\sigma }_{p}$) is closer to the standard deviation of the observed (${\sigma }_{o}$), the performance of the model is higher. R is represented by the azimuthal angle (Fig. 5). The centered RMSE is related to the distance between the observed (OBS) and developed model (MDL), assessed in units identical to those of the standard deviation. While the performance of the model increases with increasing R, it decreases with increasing centered RMSE value.

Another graphical assessment of the performance of the models was conducted with the REC, which is a generalization of receiver operating characteristic (ROC) curves for regression (Bi and Bennett 2003). The REC curves plot the error tolerance on the x-axis versus the percentage of points predicted within the tolerance on the y-axis. The resulting curve estimates the cumulative distribution function of the error. The area over the REC curve (AOC) provides an approximation of the expected error. The AOC reveals additional information that can be used to assess the model. The smaller AOC, the better the model will perform (Bi and Bennett 2003).

The experimental results differ in scale. Therefore, an adjustment was made before employing the suggested methods to prevent the models from being dominated by variables with large values. Z-score normalization is widely used in this field, and its formulation is defined in Eq. (36).

$$\begin{gathered} {\mathbf{x}}_{{\mathbf{i}}}^{{{\mathbf{norm}}}} = \frac{{x_{{i,j}} - {\mathbf{\bar{x}}}_{{\mathbf{i}}} }}{{\sqrt {\frac{1}{{Nj}}\sum\limits_{{j = 1}}^{{Nj}} {\left( {x_{{i.j}} - {\mathbf{\bar{x}}}_{{\mathbf{i}}} } \right)^{2} } } }}\quad \forall j = 1, \ldots ,Nj \hfill \\ {\mathbf{X}}^{{{\mathbf{norm}}}} = {\mathbf{x}}_{{\mathbf{i}}}^{{{\mathbf{norm}}}} \quad \forall i = 1, \ldots ,Ni \hfill \\ \end{gathered}$$

(36)

Results

Application and prediction

In this study, the SVR, GPR, and ANN models developed for the prediction of ${E}_{\text{s}}$ were implemented in the MATLAB 2019a software environment. Considering the results of best subset regression, porosity ($n$), P-wave velocity (${V}_{p}$) and slake durability index (${I}_{d}$) were taken into account as input parameters in the models. The experimental results, that is, the original dataset, were randomly divided into two datasets, which were (1) the training dataset that contained 75% of the original data and (2) the test dataset that contained 25% of the original data.

In supervised machine learning, the covariance function (kernel function) expresses the statistical relationship between two data points (${x}_{i}$, ${x}_{j}$). The kernel function (k(${x}_{i}$, ${x}_{j}$)), which is specified by hyperparameters ($\theta$), can be defined in various forms, such as Gaussian, exponential Gaussian, rational quadratic, Matern 5/2, and radial basis function. In this study, the MATLAB hyper-parameter optimization method was employed to obtain the optimum hyper-parameters for each regression model. For the SVR model, the Gaussian function was selected as the kernel and its hyper-parameters were optimized by the kernel scale method. The kernel scale, input means (µ), input standard deviation (σ), and bias terms (β) were found to be 6.9, (4.9, 3.5, 90.73), (3.14, 0.46, 8.68), and 19.44, respectively. For the GPR model, the exponential Gaussian function was chosen as the kernel and its hyper-parameters were optimized by the sigma method. The characteristic length scale, input standard deviation, and bias term were computed as 9.29, 10.03, and 18.75, respectively. The Levenberg–Marquardt back-propagation approach was used to train a feedforward neural network using supervised learning. The constructed neural network was composed of one input layer with three neurons, two hidden layers with ten neurons in the 1st hidden layer and six neurons in the 2nd hidden layer, and an output layer with one neuron. The tansig function was a suitable function for smooth prediction in the ANN model and was thus used as a kernel function. The ANN model constructed was run for 300 iterations, and the best ANN model was selected according to the lowest MSE. Thirty-nine training epochs were used, and the 29th epoch was found to be the best, with an MSE of 1.678.

A comparison of the estimated values with the measured true values is shown in Figs. 6 and 7. Figure 6 indicates that the E_s values obtained from the ANN and GPR models are located closer to the measured values, compared to the values obtained for the SVR model. When considering the outputs of the regression line drawn according to the measured and predicted ${E}_{\text{s}}$ values, and the line (1:1 line) where the measured and predicted values are equal (Fig. 7), the ANN and GPR model outputs are more successful than the other models.

Response surface

The training dataset was used to develop the regression models. The unseen values required for response surface plots were estimated using the developed models. The impacts of P-wave velocity and slake durability index with a constant porosity on ${E}_{\text{s}}$ are depicted in Figs. 8, 9 and 10. According to all the regression models developed, ${E}_{\text{s}}$ tends to increase with decreasing porosity, and with increasing P-wave velocity, and slake durability index. When the $n$ value is kept constant, the ${E}_{\text{s}}$ value increases as the ${V}_{p}$ value increases. However, the change of ${E}_{\text{s}}$ with ${I}_{d}$ is not very clear. Moreover, when considering the plotted response surfaces, the SVR model shows linear behaviors, and the GPR model indicates quasi-linear behaviors, but the ANN model demonstrates completely nonlinear behaviors. Owing to the calculation algorithms of the SVR model, the response surfaces exhibit more linear behaviors compared to the GPR and ANN models. As shown in Fig. 9, the ${E}_{\text{s}}$ value reaches its maximum at the maximum ${V}_{p}$ value, regardless of the ${I}_{d}$ value. This is because ${V}_{p}$ plays an important role in estimating the ${E}_{\text{s}}$ value in the ANN model. The response surfaces of the training and test data illustrate the same trends for each regression model.

Performance evaluation

In the training and test periods, the performance of the ANN model is very close to that of the GPR model, but there is a significant difference between the ANN and SVR models. This is also true for the SVR and GPR models (Table 4, Figs. 11 and 12).

Table 4 Performance evaluation of the developed models using the defined KPIs

Full size table

During the training period, the RMSE, NRMSE, MAE, NMAE R², NS, and VAF values for the SVR model were 3.244 GPa, 0.459, 2.558 GPa, 0.362, 0.787, 0.782, and 78.26%, respectively (Table 4). When considering the RMSE, the standard deviation of the ${E}_{\text{s}}$ values estimated, and R values in the Taylor diagram, the performance of the SVR model is significantly worse than the performances of the ANN and GPR models (Fig. 11). The AOC value of the SVR model is 2.521 MPa (Fig. 12). The PI_RMSE and PI_MAE values obtained for the SVR model are 1.101 and 1.198, respectively (Table 4). From the performance indicators obtained for the SVR model, it can be concluded that the learning ability of the SVR model is insufficient.

During the training period, the RMSE, NRMSE, R², NS, and VAF values of the GPR model were 2.066 GPa, 0.292, 0.912, 0.908, and 91.17%, respectively, while the values for the ANN model were 2.486 GPa, 0.352, 0.870, 0.872, and 87.24%, respectively (Table 4). Additionally, the AOC, MAE, and NMAE values of the ANN model were 1.6846 GPa, 1.493 GPa, and 0.211, respectively, and were 2.377 GPa, 1.532, and 0.217, respectively, for the GPR model (Table 4 and Fig. 12). The standard deviation of the measured ${E}_{\text{s}}$ was 7.0707 GPa, that of the predicted ${E}_{\text{s}}$ GPa using the ANN model was 7.0673 GPa, and that of the predicted ${E}_{\text{s}}$ using the GPR model was 6.3447. Therefore, the variability of the ${E}_{\text{s}}$ values measured and that estimated by the ANN model are similar. The ANN model is more successful in terms of proximity to the measured E_s (Fig. 11). In addition, the best result for approaching the maximum and minimum values of ${E}_{\text{s}}$ is obtained with the ANN model. However, as shown in the Taylor diagram (Fig. 11), the GPR model is more successful in terms of the RMSE and R values, than the ANN model. In terms of the PI_RMSE and PI_MAE values, the GPR model performed better than the ANN model. While the PI_RMSE and PI_MAE values of the GPR model are 1.5251 and 1.6006, and those of the ANN model are 1.3844 and 1.5249, respectively.

For the test period, the RMSE, MAE, NRMSE, NMAE, R², NS, VAF, and AOC values of the SVR model are 5.662 GPa, 5.196 GPa, 0.619, 0.568, 0.695, 0.615, 62.7%, and 4.198 GPa, respectively (Table 4 and Fig. 12). The PI_RMSE and PI_MAE values obtained with the SVR model for the test data are 0.6519 and 0.7029, respectively. When considering the values of the key performance indices, the PI_RMSE, and PI_MAE, the SVR model is not successful in predicting the ${E}_{\text{s}}$ of the samples investigated.

For the test data, the ANN and GPR models are better than the SVR model in terms of R, RMSE, proximity to the standard deviation of the measured ${E}_{\text{s}}$, and AOC values (Figs. 11 and 12).

The performance of the GPR model for the test data for R², RMSE, NRMSE, NS, and VAF is higher than that of the ANN model (Table 4). In the test period, the R², RMSE, NRMSE, NS, and VAF values obtained for the GPR model are 3.365 GPa, 0.368, 0.898, 0.864, and 86.5%, respectively, and those for the ANN model are 3.767 GPa, 0.412, 0.859, 0.835, and 83.4%, respectively (Table 4). When considering only the MAE, NMAE, and AOC values, the success of the ANN model in predicting ${E}_{\text{s}}$ is higher than that of the GPR method. For the test period, the MAE, NMAE, and AOC values for the ANN model are 2.337 GPa, 0.255, and 1.3822 GPa, respectively, while those for the GPR model are 3.043 GPa, 0.332, and 1.5598 GPa, respectively (Table 4 and Fig. 12).

As illustrated in the Taylor diagram for the test data, the slope of the line connecting the point representing the GPR model with the origin is lower than the point representing the ANN model (Fig. 11). This indicates that the GPR model has a higher R value and a lower RMSE value than the ANN model in the test period. However, the point representing the ANN model is closer to the circle passing through the OBS (observed value) than the point representing the GPR model (Fig. 11). In the test period, the standard deviation of the measured ${E}_{\text{s}}$ is 9.0517 GPa, while the standard deviations of ${E}_{\text{s}}$ predicted using the ANN and GPR models are 7.637 GPa and 7.157 GPa, respectively. Therefore, for the test data, as for the training data, the variability of the measured ${E}_{\text{s}}$ values and the variability of ${E}_{\text{s}}$ values estimated by the ANN model are significantly closer to each other than that of the GPR model (Fig. 11). In the test period, the PI_RMSE and PI_MAE values obtained for the ANN model are 1.2567, and 1.4139, respectively, while those obtained for the GPR model are 1.3779 and 1.4142, respectively. When considering that PI_RMSE and PI_MAE were formed by more than one KPI, the GPR model performed better than the ANN model.

Discussion

To compare the performance of the models developed in this study to the prediction models given in the literature, the studies aimed to predict ${E}_{\text{s}}$ of magmatic, and metamorphic rocks and include the prediction models based on ANN, SVR, and GPR models. As shown in Table 1, the properties of intact rock commonly used in the prediction models suggested for ${E}_{\text{s}}$ of magmatic and metamorphic rocks are the Schmidt hammer hardness (SHH), shore hardness (SH), UCS, tensile strength (TS), point load index (PLI), and block punch index (BPI). Although the measurement of SHH is rapid and easily executed, simple, and portable, it has limitations, such as the anisotropy and heterogeneity of the rocks, a very small test conduction area, and surface roughness (Yilmaz 2009). UCS, TS, and BPI tests have two main limitations and problems, which are (1) a standard sample cannot always be obtained and (2) the tests cannot be repeatable. The BPI test is only valid for very thin small discs, and an irregular failure causes the need for a substantial amount of rock specimens (Yilmaz 2009). These limitations and problems related to the UCS, TS, and BPI are more concerning for weathered rocks. The disadvantages of PLI that do not need a standard sample are as follows: (1) tests are applied in very small areas, (2) invalid test results frequently occur, (3) the specimen may move during loading, and (4) micro-fissures may cross the conical platens (Yilmaz 2009). In addition, the force measurement accuracy in PLI may not be sufficient for the testing of weathered rock samples, and in these samples, there may be penetrating conical platens in the sample. The main shortcoming of the SH test is that the measurements are obtained from a random mineral, and anisotropy and/or heterogeneity of the rocks. Taking into account the limitations and problems, it can be acknowledged that there may be difficulties in using the results of such hardness and strength tests as inputs in the regression models.

The performances of the soft computing models to estimate the ${E}_{\text{s}}$ of magmatic and metamorphic rock samples reported in Khandelwal and Singh (2011), Saedi et al. (2018), Tian et al. (2019), Armaghani et al. (2020), and Acar and Kaya (2020) are higher than those of the GPR model proposed in this study when comparing their R² values (Table 1). However, these prediction models use one or more of the SHH, SH, UCS, TS, PLI, and BPI as inputs. Other prediction models developed for the ${E}_{\text{s}}$ of magmatic and metamorphic rock samples reported by Sonmez et al. (2006), Manouchehrian et al. (2013), Armaghani et al. (2016), Atici (2016), and Behzadafshar et al. (2019) also used hardness and strength properties as inputs, even though they had limitations and problems (Table 1). Furthermore, the performances of the regression models were not better than those of the GPR and ANN models developed in this study.

Kumar et al. (2013) adopted a relevance vector machine (RVM), GPR, and minimax probability machine regression (MPMR) for the prediction of UCS and ${E}_{\text{s}}$ of travertine samples. The GPR model in their study had a higher R² value than the GPR and ANN models developed in this study, but UCS and PLI were used as inputs in their study. The ANFIS models used by Armaghani et al. (2015) and Singh et al. (2017) performed better than the GPR models developed in this study but here, the GPR models were developed with different magmatic rock types with different degrees of weathering, while the ANFIS models were developed with only one rock type (Table 1). The input parameters of the LS-SVM models for predicting the ${E}_{\text{s}}$ of weathered rock samples given in Ceryan (2016) are the effective porosity and the P-durability index. The P-durability index is based on the slake durability index and P-wave velocity. Although the LS-SVM model successfully predicted ${E}_{\text{s}}$, it had a lower R² value than the GPR and ANN models developed in this study. In addition, the P-durability index could not be selected as a suitable input parameter while selecting input parameters using the best subset regression analysis applied in this study. Behnia et al. (2017) developed a gene expression programming (GEP) model to predict the ${E}_{\text{s}}$ and UCS of different rocks. The model used quartz content, porosity, and density as input parameters and performed successfully, with an R² value of 0.927. It can be concluded that using the GEP model to predict the ${E}_{\text{s}}$ of weathered magmatic and metamorphic rock is useful.

When selecting inputs used in the suggested models in this study, the following factors were taken into account: (1) inputs are able to characterize these intrinsic characteristics and the state of weathering, (2) their measurements can be performed easily and rapidly in practical and economic terms, and (3) they provide high performance to the prediction models in which they are used. The simple and non-destructive properties ($n$, ${V}_{p}$, ${V}_{\text{m}}$, $V_{{{\text{id}}}}$, ${I}_{d}$, SHH, and ${\text{SHH}} \times \gamma$) were obtained experimentally for use in the ${E}_{s}$ estimation of the studied samples. As indicated in Table 1, these properties are the most frequently used inputs in prediction models. However, not all of them have been used in the same prediction models because of the practical and technical difficulties discussed here. For this, the best subset regression approach was employed to determine the inputs. As a result, porosity ($n$), P-wave velocity (${V}_{p}$), and the slake durability index (${I}_{d}$) were employed as inputs for the proposed SVR, GPR, and ANN models.

There are good relationships between the elastic wave velocity and the chemical and mineralogical composition of rocks (Ceryan 2015). This is also valid for weathered rock because the fresh mineral content decreases, while micro-fracture voids increase with weathering. Therefore, ${V}_{p}$ decreases with increasing weathering (Ceryan et al. 2008a; Wyering et al. 2014; Momeni et al. 2017; de Vilder et al. 2019), and it is possible to characterize weathered magmatic and metamorphic rock material properties by ${V}_{p}$ measurements (Ceryan et al. 2008b). Furthermore, it was also proposed that a non-destructive measurement of ${V}_{p}$ offers an alternative input for ${E}_{\text{s}}$ estimation with relative ease and at a low operational cost (Yasar and Erdogan 2004). The pore characteristics and fundamental microstructural parameters are important physical properties that govern the physical attributes of rocks, for example, strength, deformability, and hydraulic conductivity (Tugrul 2004). There are difficulties in determining pore size distribution, pore geometry, pore infilling, and pore connectivity, and therefore, porosity and effective porosity are commonly used to define the pores of rock materials (Ceryan 2014, 2015). The total porosity and the number of connected pores generally characterize the weathering state and these properties increase with weathering (Ceryan et al. 2008a). For these reasons, porosity or effective porosity is commonly used for estimating the ${E}_{\text{s}}$ of rock materials (Table 1). The slake durability test is an inexpensive and easy test to conduct and requires very little sample preparation. Therefore, it is a good index for representing weathering processes (Lee and De Freitas 1989; Cargill and Shakoor 1990; Ceryan et al. 2008b; Sharma et al. 2008; Ceryan 2015). The slake durability index can be used inexpensively and easily to estimate the deformation modulus of weathered or soft rocks (Table 1). When the result of considering the best subset regression analysis performed, and the characterization of the weathering process in magmatic and metamorphic rocks, it is apparent that using porosity, slake durability index, and P-wave velocity as inputs in prediction models developed for rock materials is a practical and economical approach.

In all regression models, the combination of ${V}_{p}$ and ${I}_{d}$ can estimate the behavior of ${E}_{\text{s}}$ referred to response surfaces illustrated in Figs. 7, 8, 9 and 10. Furthermore, the maximum ${E}_{\text{s}}$ value is obtained at higher ${V}_{p}$ and ${I}_{d}$ values and lower $n$ values as expected.

When the $n$ value is kept constant, the ${E}_{\text{s}}$ value enhances as the ${V}_{p}$ value increases. On the other hand, the increase in the ${E}_{\text{s}}$ value is very subtle only increasing with the ${I}_{d}$ value when other inputs are held constant. For example, considering the same data point [${V}_{p}$: 4.16 km/s, ${I}_{d}$: 99.5%] the ${E}_{\text{s}}$ values are around 27 GPa with $n$: 2.52% for SVR, and GPR; whereas the ${E}_{\text{s}}$ value is around 21 GPa with $n$: 11.057% for SVR, and GPR. Moreover, the estimated ${E}_{\text{s}}$ value at the given data point for lower porosity value was around 27 GPa, but it decreased to around 12 GPa with the increase of the porosity value. As a result, ANN is more sensitive chances in ${V}_{p}$ and $n$. On the other hand, the SVR and GPR models are more sensitive chances in ${V}_{p}$. As seen in the response surfaces, while the output values of the models are most sensitive to the change of ${V}_{p}$, they are least sensitive to the change of ${I}_{d}$. The sensitivity of the output of the models in chances of the input parameters is related to the results of weathering processes on the rock material, as is related to the characteristics of soft computing techniques. Due to that the weathering product content, which having lower ${V}_{p}$ than fresh minerals, and micro-fracture voids increase with weathering, the decreasing of ${V}_{p}$ with weathering is more regularly and faster than ${I}_{d}$ and $n$.

KPIs, namely VAF, RMSE, MAE, and R², can be separately used to examine model accuracy, but none are superior and therefore, the PI, which combines these KPIs, was suggested (Yagiz et al. 2012). Successively, adjR² was employed in computing the PI value instead of R² because it is a statistic with systematic error based on the number of independent variables in the equation, sample size, and the coefficient of variation (Ceryan 2014). However, the PI value given in Ceryan (2014) depends on the RMSE. The RMSE and MAE metrics, can range from 0 to ∞ and are indifferent to the direction of errors. The errors are squared before they are averaged, and therefore, the RMSE gives a relatively high weight to large errors. Owing to this characteristic of RMSE, it usually determines model performance differences better than other indices (Chai and Draxler and 2014). However, RMSE has important disadvantages because it is a function of three characteristics of a set of errors, rather than of one (the average error). RMSE varies with the variability within the distribution of error magnitudes, and the square root of the sample number, as well as with the average error magnitude (MAE) (Willmott and Matsuura 2005). Without the benefit of other information, for example, MAE, it is impossible to discern the extent to which RMSE reflects the central tendency (average error) or the variability within the distribution of squared errors (Mielke and Berry 2001; Willmott and Matsuura 2005; Willmott et al. 2009). Given the definition of MAE, it is clear that MAE, unlike RMSE, is an unambiguous measure of average error magnitude (Willmott and Matsuura 2005). However, the MAE might be affected by a large number of average error values without adequately reflecting some large errors (Chai and Draxler 2014). For these reasons, using the KPI, which considers both the errors themselves and the squares of the errors, would be more accurate. With this in mind, a new PI, PI_MAE, has been proposed using normalized MAE instead of normalized RMSE. Both PI_RMSE and PI_MAE were used in this study. When the values of these performance indices are less than 1, the prediction model fails, and the success of the model is higher as the value approaches 2. A prediction model is more successful if it has a larger PI_RMSE and PI_MAE value than the other model. If the PI_RMSE is large and the PI_MAE is small, other performance criteria should also be considered.

In this study, the REC diagram is drawn using the absolute value of errors (as in MAE). In contrast, RMSE is used in the Taylor diagram and does not provide the differences between the measured and obtained values directly. To overcome this deficiency, the REC curve was used to evaluate the performance of the models developed in this study.

Based on performance evaluation according to Performance Indexes, Taylor, and REC diagrams, it is clear that the SVR model developed in this study is the worst model for estimating the ${E}_{\text{s}}$ of the investigated samples. In the training and test periods, the GPR model performs better than the ANN model in terms of the R and centered RMSE components of the Taylor diagram, while the variability (standard deviation) of ${E}_{\text{s}}$ values obtained with the ANN model is closer to the variability of the observed ${E}_{\text{s}}$ values. The AOC value obtained according to the absolute values of the errors (as in the MAE) is lower for the ANN model. However, the GPR model has a better performance than the ANN model in terms of PI_RMSE and PI_MAE values.

The SVR, GPR, and ANN models can approximate almost all types of non-linear functions, including quadratic functions. The soft computing models use a “black box” approach and have some difficulties in sharing the methodology with other researchers (Suykens and Vandewalle 1999; Agatonovic-Kustrin and Beresford 2000; Rasmussen 2004; Desai et al. 2008; Ahmadi and Rodehutscord 2017; Ozkat et al. 2017c). The SVR and GPR models are based on the same probabilistic regressive model, while ANN is not a probabilistic model. The GPR and SVR models developed here assume a Gaussian data distribution, but the ANN model does not assume any data distribution. The ANN and GPR models do not have a sparse solution. They use all sample/feature information to perform the prediction. The SVR model has the noteworthy advantage of frequently yielding sparse solutions. It minimizes reconstruction errors through convex optimization, ensuring that the optimal estimate is found, but it is not a unique solution. In the ANN model, the optimization is not always convex, and therefore, the solution is not always a global minimum (Suykens and Vandewalle 1999; Agatonovic-Kustrin and Beresford 2000; Rasmussen 2004; Khandelwal and Singh 2009; Kumar et al. 2013; Samui et al. 2019).

Parametric approaches distill knowledge about the training data into a set of numbers. They require a large amount of data, especially for architectures with many layers because of the vast number of weights and connections in ANN models. In contrast, Gaussian processes are non-parametric methods. A Gaussian processes kernel allows for the specification of a prior control on the function space, which can be extremely useful, especially when there are scant data (Bijl et al. 2017). However, as Gaussian processes are non-parametric, they need to take all the training data into account each time they make a prediction. This means that the computational cost of predictions increases with the number of training samples (Agatonovic-Kustrin and Beresford 2000; Rasmussen 2004; Bijl et al. 2017).

The originality and limitations of this study are as follows:

The best subset regression approach was employed to determine the inputs in the proposed prediction model developed. As a result of this analysis, the most suitable parameters were found to be porosity ($n$), P-wave velocity (${V}_{p}$), and the slake durability index (${I}_{d}$). They are frequently used inputs in prediction models (Table 1) and are commonly used to define the state of weathering and in predicting the UCS and ${E}_{\text{s}}$ of weathered magmatic and metamorphic rocks (Ceryan 2018). The measurements of ${V}_{p}$ and $n$ are non-destructive, repeatable, easy, and economical. The slake durability test is an inexpensive and easy test to conduct and requires very little sample preparation.

The ANN and GPR models developed in this study successfully predicted the ${E}_{\text{s}}$ of the samples investigated. Although there are many ANN models to predict the ${E}_{\text{s}}$ of rock materials, the number of GPR methods is quite low (Table 1). Moreover, according to the literature, there are very few soft computing models developed to assess the ${E}_{\text{s}}$ of magmatic rock material with different degrees of weathering (Table 1). This study provides data and approaches to overcome this deficiency.

This study demonstrates that it is more useful to use together criteria based on the square of errors (e.g., RMSE and Taylor diagram) and criteria based on the absolute value of errors (i.e., MAE and REC have drawn depending on the absolute value of errors). In this study, the new Performance Index, PIMAE, which takes MAE into account, was created and it was stated that it would be beneficial to use together with this performance index and the Performance index based on RMSE. The ANN and GPR models given in this study have been developed to estimate the E_s of magmatic and metamorphic rock samples with different degrees of weathering. These models can be applied to magmatic and metamorphic rock samples containing at least three different degrees of weathering.

Conclusion

This study from NE Turkey, examined the applicability and capability of the SVR, GPR, and ANN models in ${E}_{\text{s}}$ prediction of magmatic rocks with different degrees of weathering. The selection of the inputs for use in the models was performed using the best subset regression approach. As a result of these analyses, porosity, P-wave velocity, and the slake durability index that are used commonly in defining the weathering state and in predicting the ${E}_{\text{s}}$ of weathered rocks, were selected as inputs for the prediction models developed in this study. Here, the weathering effect on the engineering behaviors of rock material is considered, and it is shown that using porosity, slake-durability index, and P-wave velocity together is a very powerful tool for estimating the elastic modulus of weathered magmatic and metamorphic rocks.

Given the difficulties of RMSE and MAE in expressing the error alone, it is useful to use both the index based on the absolute value of errors and the index based on the square value of error in evaluating the performance of the prediction models. For this, a new PI, PI_MAE, is proposed here, using normalized MAE instead of normalized RMSE.

According to the computed KPIs, Taylor diagram, and REC curves, it is concluded that the SVR model is insufficient for predicting the elastic modulus of the weathered magmatic rock samples. In the test period, the R², RMSE, and AOC values obtained for the SVR model were 0695, 5.662 GPa, and 4.198 GPa, respectively. In addition, the PI_RMSE and PI_MAE obtained for the SVR model developed were lower than 1.0, being 0.6519 and 0.7028, respectively.

During the training and test periods, the results of the performance analysis of the ANN and GPR models using KPIs, the Taylor diagram, and the REC curves were excellent. In both the training and test periods, when the performance metrics were considered one by one, in the criteria based on the absolute value of error, for example, MAE, NMAE, and AOC, and when approaching extreme measured values and standard deviation of ${E}_{\text{s}}$ value, the ANN model was more successful than the GPR model. The MAE, NMAE, and AOC values of the ANN model for the test data were 2.337 GPa, 0.255, and 1.3822 GPa, respectively, while those of the GPR model were 0.043, 0.332, and 1.5598 GPa, respectively. Conversely, in terms of R², RMSE, NRMSE, NS, and VAF, the GPR model performed better than the ANN model. In the test period, the R² value obtained for the GPR model was 0.898, while that for the ANN model was 0.859. For PI_RMSE and PI_MAE, which were created by combining multiple KPIs, the performance of the GPR model was better than that of the ANN model. The PI_RMSE and PI_MAE values of the GPR model for the test data were 1.3779 and 1.4142, respectively, while those for the ANN model were 1.2567 and 1.4139, respectively. In the Taylor diagram, the GPR model performed better than the ANN model. Moreover, because of the probabilistic and non-parametric nature of the GPR system, it can be easily simulated and projected.

The performance of the GPR model is slightly better than that of the ANN model, although both the models are successful in predicting the ${E}_{\text{s}}$ of the magmatic rock samples with different degrees of weathering. It would be useful to develop GPR and ANN models with porosity, P-wave velocity, and the slake durability index to predict the ${E}_{\text{s}}$ and UCS of other weathered rock samples, including samples with at least three different degrees of weathering.

Availability of data and material

The manuscript has included data and is given as electronic supplementary data.

Code availability

Not applicable.

References

Aboutaleb S, Behnia M, Bagherpour R, Bluekian B (2018) Using non-destructive tests for estimating uniaxial compressive strength and static Young’s modulus of carbonate rocks via some modeling techniques. Bull Eng Geol Environ 77:1717–1728. https://doi.org/10.1007/s10064-017-1043-2
Article Google Scholar
Acar MC, Kaya B (2020) Models to estimate the elastic modulus of weak rocks based on least square support vector machine. Arab J Geosci 13:1–12. https://doi.org/10.1007/s12517-020-05566-6
Article Google Scholar
Agatonovic-Kustrin S, Beresford R (2000) Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. J Pharm Biomed Anal 22:717–727. https://doi.org/10.1016/S0731-7085(99)00272-1
Article Google Scholar
Ahmadi H, Rodehutscord M (2017) Application of artificial neural network and support vector machines in predicting metabolizable energy in compound feeds for pigs. Front Nutr 4:27. https://doi.org/10.3389/fnut.2017.00027
Article Google Scholar
Ajalloeian R, Mansouri H, Baradaran E (2017) Some carbonate rock texture effects on mechanical behavior, based on Koohrang tunnel data. Iran Bull Eng Geol Environ 76:295–307. https://doi.org/10.1007/s10064-016-0861-y
Article Google Scholar
Alemdag S, Gurocak Z, Cevik A, Cabalar AF, Gokceoglu C (2016) Modeling deformation modulus of a stratified sedimentary rock mass using neural network, fuzzy inference and genetic programming. Eng Geol 203:70–82. https://doi.org/10.1016/j.enggeo.2015.12.002
Article Google Scholar
Alikarami R, Torabi A, Kolyukhin D, Skurtveit E (2013) Geostatistical relationships between mechanical and petrophysical properties of deformed sandstone. Int J Rock Mech Min Sci 63:27–38. https://doi.org/10.1016/j.ijrmms.2013.06.002
Article Google Scholar
Armaghani DJ, Tonnizam Mohamad E, Momeni E, Narayanasamy MS, Amin MFM (2015) An adaptive neuro-fuzzy inference system for predicting unconfined compressive strength and Young’s modulus: a study on main range granite. Bull Eng Geol Environ 74:1301–1319. https://doi.org/10.1007/s10064-014-0687-4
Article Google Scholar
Armaghani DA, Mohamad TE, Momeni E, Monjezi M, Narayanasamy MS (2016) Prediction of the strength and elasticity modulus of granite through an expert artificial neural network. Arab J Geosci 9:48. https://doi.org/10.1007/s12517-015-2057-3
Article Google Scholar
Armaghani DJ, Momeni E, Asteris P (2020) Application of group method of data handling technique in assessing deformation of rock mass. Metaheuristic Comput Appl 1:1–18. https://doi.org/10.12989/mca.2020.1.1.001
Article Google Scholar
Atici U (2016) Modelling of the elasticity modulus for rock using genetic expression programming. Adv Mater Sci Eng 8:45. https://doi.org/10.1155/2016/2063987
Article Google Scholar
Aufmuth RE (1974) A systematic determination of engineering criteria for rock (No. CERL-TR-M-799 Final Rept)
Barton N (2007) Fracture-induced seismic anisotropy when sharing is induced in production from fractured reservoirs. J Seism Explor 16:115
Google Scholar
Behnia D, Behn M, Shahriar K, Goshtasbi K (2017) A New predictive model for rock strength parameters utilizing GEP method. Proc Eng 191:591–599. https://doi.org/10.1016/j.proeng.2017.05.222
Article Google Scholar
Behzadafshar K, Sarafraz ME, Hasanipanah M, Mojtahedi Tahir MM (2019) Proposing a new model to approximate the elasticity modulus of granite rock samples based on laboratory tests results. Bull Eng Geol Environ 78:527–1536. https://doi.org/10.1007/s10064-017-1210-5
Article Google Scholar
Beiki M, Majdi A, Givshad AD (2013) Application of genetic programming to predict the uniaxial compressive strength and elastic modulus of carbonate rocks. Int J Rock Mech Min Sci 63:159–169. https://doi.org/10.1016/j.ijrmms.2013.08.004
Article Google Scholar
Bejarbaneh BY, Bejarbaneh EY, Amin MFMl, (2018) Intelligent modelling of sandstone deformation behaviour using fuzzy logic and neural network systems. Bull Eng Geol Environ 77:345–361. https://doi.org/10.1007/s10064-016-0983-2
Article Google Scholar
Bektas O, Jones JA, Sankararaman S, Roychoudhury I, Goebel K (2019a) A neural network framework for similarity-based prognostics. MethodsX 6:383–390. https://doi.org/10.1016/j.mex.2019.02.015
Article Google Scholar
Bektas O, Jones JA, Sankararaman S, Roychoudhury I, Goebel K (2019b) A neural network filtering approach for similarity-based remaining. Int J Adv Manuf Technol 101:87–103. https://doi.org/10.1007/s00170-018-2874-0
Article Google Scholar
Bi J, Bennett KP (2003) Regression error characteristic curves. In: Proceedings of the 20th international conference on machine learning (ICML-03), pp 43–50
Bidgoli MN, Zhao Z, Jing L (2013) Numerical evaluation of strength and deformability of fractured rocks. J Rock Mech Geotech Eng 5(2013):419–430. https://doi.org/10.1016/j.jrmge.2013.09.002
Article Google Scholar
Bijl H, Schön TB, van Wingerden J-W, Michel Verhaegen M (2017) System identification through online sparse Gaussian process regression with input noise. IFAC J Syst Control 2:1–11. https://doi.org/10.1016/j.ifacsc.2017.09.001
Article Google Scholar
Brotons V, Tomás R, Ivorra S, Grediaga A, Martínez-Martínez J, Benavente D, Gómez-Heras M (2016) Improved correlation between the static and dynamic elastic modulus of different types of rocks. Mater Struct Constr. https://doi.org/10.1617/s11527-015-0702-7
Article Google Scholar
Cargill JS, Shakoor A (1990) Evaluation of empirical methods for measuring the uniaxial compressive strength of rock. Int J Rock Mech Min Sci 27:495–503. https://doi.org/10.1016/0148-9062(90)91001-N
Article Google Scholar
Ceryan N (2014) Application of support vector machines and relevance vector machines in predicting uniaxial compressive strength of volcanic rocks. J African Earth Sci. https://doi.org/10.1016/j.jafrearsci.2014.08.006
Article Google Scholar
Ceryan S (2015) New weathering indices for evaluating durability and weathering characterization of crystalline rock material: a case study from NE Turkey. J Afr Earth Sci 103:54–64. https://doi.org/10.1016/j.jafrearsci.2014.12.005
Article Google Scholar
Ceryan N (2016) A review of soft computing methods application in rock mechanic engineering. Handbook of research on advanced computational techniques for simulation-based engineering. IGI Global, pp 1–70
Google Scholar
Ceryan S (2018) Weathering indices used in evaluation of the weathering state of rock material. In: Ceryan N (ed) Handbook of research on trends and digital advances in engineering geology, Chap 4. IGI Global United States of America, pp 132–186
Ceryan S, Tudes S, Ceryan N (2008a) A new quantitative weathering classification for igneous rocks. Environ Geol 55:1319. https://doi.org/10.1007/s00254-007-1080-4
Article Google Scholar
Ceryan S, Tudes S, Ceryan N (2008b) Influence of weathering on the engineering properties of Harsit granitic rocks (NE Turkey). Bull Eng Geol Environ 67:97–104. https://doi.org/10.1007/s10064-007-0115-0
Article Google Scholar
Chai T, Draxler TT (2014) Root mean square error (RMSE) or mean absolute error (MAE)?—arguments against avoiding RMSE in the literature. Geosci Model Dev 7:1247–1250. https://doi.org/10.5194/gmd-7-1247-2014
Article Google Scholar
de Vilder SJ, Brain MJ, Rosser NJ (2019) Controls on the geotechnical response of sedimentary rocks to weathering. Earth Surf Process Landf 44:1910–1929. https://doi.org/10.1002/esp.4619
Article Google Scholar
Deere DU, Miller RP (1966) Engineering classification and index properties for intact rock. Illinois Univ At Urbana Dept Of Civil Engineering
Dehghan S, Sattari G, Chehreh CS, Aliabadi MA (2010) Prediction of uniaxial compressive strength and modulus of elasticity for Travertine samples using regression and artificial neural networks. Min Sci Technol. https://doi.org/10.1016/S1674-5264(09)60158-7
Article Google Scholar
Desai KM, Survase SA, Saudagar PS et al (2008) Comparison of artificial neural network (ANN) and response surface methodology (RSM) in fermentation media optimization: case study of fermentative production of scleroglucan. Biochem Eng J 41:266–273. https://doi.org/10.1016/j.bej.2008.05.009
Article Google Scholar
Desboulets LDD (2018) A review on variable selection in regression analysis. Econometrics 6:45. https://doi.org/10.3390/econometrics6040045
Article Google Scholar
Diamantis K, Gartzos E, Migiros G (2014) Influence of petrographic characteristics on physico-mechanical properties of ultrabasic rocks from central Greece. Bull Eng Geol Environ 73:1273–1292. https://doi.org/10.1007/s10064-014-0584-x
Article Google Scholar
Dinçer I, Acar A, Çobanoğlu I, Uras Y (2004) Correlation between Schmidt hardness, uniaxial compressive strength and Young’s modulus for andesites, basalts and tuffs. Bull Eng Geol Environ 63:141–148. https://doi.org/10.1007/s10064-004-0230-0
Article Google Scholar
Erguler ZA, Ulusay R (2009) Water-induced variations in mechanical properties of clay-bearing rocks. Int J Rock Mech Min Sci 46:355–370. https://doi.org/10.1016/j.ijrmms.2008.07.002
Article Google Scholar
Fener M, Kahraman S, Bilgil A, Gunaydin O (2005) A comparative evaluation of indirect methods to estimate the compressive strength of rocks. Rock Mech Rock Eng 38:329–343. https://doi.org/10.1007/s00603-005-0061-8
Article Google Scholar
Ghasemi E, Kalhori H, Bagherpour R, Yagiz S (2018) Model tree approach for predicting uniaxial compressive strength and Young’s modulus of carbonate rocks. Bull Eng Geol Environ 77:331–343. https://doi.org/10.1007/s10064-016-0931-1
Article Google Scholar
Gokceoglu C, Zorlu K (2004) A fuzzy model to predict the uniaxial compressive strength and the modulus of elasticity of a problematic rock. Eng Appl Artif Intell 17:61–72. https://doi.org/10.1016/j.engappai.2003.11.006
Article Google Scholar
Gokceoglu C, Ulusay R, Sonmez H (2000) Factors affecting the durability of selected weak and clay-bearing rocks from Turkey, with particular emphasis on the influence of the number of drying and wetting cycles. Eng Geol 57:215–237. https://doi.org/10.1016/S0013-7952(00)00031-4
Article Google Scholar
Guven IH (1993) Geological and metallogenic map of the eastern black sea region; 1: 250000 Map. Publications of Mineral Research and Exploration General Directorate of Turkey
Google Scholar
Ham FM, Kostanic I (2000) Principles of neurocomputing for science and engineering. McGraw-Hill Higher Education, p 672 (ISBN:978-0-07-025966-9)
Google Scholar
Haque MM, Rahman A, Hagare D, Chowdhury RK (2018) A comparative assessment of variable selection methods in urban water demand forecasting. Water 10:419. https://doi.org/10.3390/w10040419
Article Google Scholar
Heap MJ, Lavallée Y, Petrakova L et al (2014) Microstructural controls on the physical and mechanical properties of edifice-forming andesites at Volcán de Colima, Mexico. J Geophys Res Solid Earth 119:2925–2963. https://doi.org/10.1002/2013JB010521
Article Google Scholar
Heidari M, Khanlari GR, Momeni AA (2010) Prediction of elastic modulus of intact rocks using artificial neural networks and non-linear regression methods. J Appl Sci Res 4:5869–5879
Google Scholar
Hippolyte JC, Müller C, Sangu E, Kaymakci N (2017) Stratigraphic comparisons along the Pontides (Turkey) based on new nannoplankton age determinations in the Eastern Pontides: geodynamic implications. Geol Soc Spec Publ 428:323–358. https://doi.org/10.1144/SP428.9
Article Google Scholar
Hoek E, Diederichs MS (2006) Empirical estimation of rock mass modulus. Int J Rock Mech Min Sci 36:203–215. https://doi.org/10.1016/j.ijrmms.2005.06.005
Article Google Scholar
Hong X, Gao J, Jiang X, Harris CJ (2014) Estimation of Gaussian process regression model using probability distance measures. Syst Sci Control Eng 1:655–663. https://doi.org/10.1080/21642583.2014.970731
Article Google Scholar
Huang Y, Lan Y, Thomson SJ, Fang A, Hoffmann WC, Lacey RE (2010) Development of soft computing and applications in agricultural and biological engineering. Comput Electron Agric 71(2):107–127. https://doi.org/10.1016/j.compag.2010.01.001
Article Google Scholar
Huang XB, Zhang Q, Zhu HH, Zhang LY (2017) An estimated method of intact rock strength using gaussian process regression. In: 51st US rock mechanics/geomechanics symposium. American Rock Mechanics Association
International Society for Rock Mechanics (2007) The complete ISRM suggested methods for rock characterization, testing and monitoring: 1974–2006. In: Ulusay H (ed) Suggested methods prepared by the commission on testing methods. International Society for Rock Mechanics, p 628
Google Scholar
Kahraman S, Gunaydin O, Alber M, Fener M (2009) Evaluating the strength and deformability properties of Misis fault breccia using artificial neural networks. Expert Syst Appl 36:6874–6878. https://doi.org/10.1016/j.eswa.2008.08.002
Article Google Scholar
Karakus M, Kumral M, Kilic O (2005) Predicting elastic properties of intact rocks from index tests using multiple regression modelling. Int J Rock Mech Min Sci 42:323–330. https://doi.org/10.1016/j.ijrmms.2004.08.005
Article Google Scholar
Kayabasi A, Gokceoglu C, Ercanoglu M (2003) Estimating the deformation modulus of rock masses: a comparative study. Int J Rock Mech Min Sci 40:55–63. https://doi.org/10.1016/S1365-1609(02)00112-0
Article Google Scholar
Khandelwal M, Singh TN (2009) Correlating static properties of coal measures rocks with P-wave velocity. Int J Coal Geol 79:55–60. https://doi.org/10.1016/j.coal.2009.01.004
Article Google Scholar
Khandelwal M, Singh TN (2011) Predicting elastic properties of schistose rocks from unconfined strength using intelligent approach. Arab J Geosci 4:435–442. https://doi.org/10.1007/s12517-009-0093-6
Article Google Scholar
Kim E, Stine MA, de Oliveira DBM, Changani H (2017) Correlations between the physical and mechanical properties of sandstones with changes of water content and loading rates. Int J Rock Mech Min Sci 100:255–262. https://doi.org/10.1016/j.ijrmms.2017.11.005
Article Google Scholar
Ko J, Jeong S, Lee JK (2016) Large deformation FE analysis of driven steel pipe piles with soil plugging. Comput Geotech 71:82–97. https://doi.org/10.1016/j.compgeo.2015.08.005
Article Google Scholar
Kumar M, Samui P, Naithani AK (2013) Determination of uniaxial compressive strength and modulus of elasticity of travertine using machine learning techniques. Int J Adv Soft Comput Appl 54:1–13
Google Scholar
Kumar M, Bhatt MR, Samui P (2014) Modeling of elastic modulus of jointed rock mass: Gaussian process regression approach. Int J Geomech 14:06014001. https://doi.org/10.1061/(ASCE)GM.1943-5622.0000318
Article Google Scholar
Kurtuluş C, Irmak TS, Sertçelik I (2010) Physical and mechanical properties of Gokceada: Imbros (NE Aegean Sea) island andesites. Bull Eng Geol Environ 69:321–324. https://doi.org/10.1007/s10064-010-0270-6
Article Google Scholar
Lee SG, De Freitas MH (1989) A revision of the description and classification of weathered granite and its application to granites in Korea. Q J Eng Geol 22:31–48. https://doi.org/10.1144/gsl.qjeg.1989.022.01.03
Article Google Scholar
Liu Z, Shao J, Xu W, Shi C (2013) Estimation of elasticity of porous rock based on mineral composition and microstructure. Adv Mater Sci Eng. https://doi.org/10.1155/2013/512727
Article Google Scholar
Liu Z, Shao J, Xu W et al (2014) Prediction of elastic compressibility of rock material with soft computing techniques. Appl Soft Comput J 22:118–125. https://doi.org/10.1016/j.asoc.2014.05.009
Article Google Scholar
Madhubabu N, Singh PK, Kainthola A et al (2016) Prediction of compressive strength and elastic modulus of carbonate rocks. Measurement 88:202–213. https://doi.org/10.1016/j.measurement.2016.03.050
Article Google Scholar
Mahdiabadi N, Khanlari G (2019) Prediction of uniaxial compressive strength and modulus of elasticity in calcareous mudstones using neural networks, fuzzy systems, and regression analysis. Period Polytech Civ Eng 63:104–114. https://doi.org/10.3311/PPci.13035
Article Google Scholar
Mallows CL, Sloane NJA (1973) An upper bound for self-dual codes. Inf Control 22:188–200. https://doi.org/10.1016/S0019-9958(73)90273-8
Article Google Scholar
Manouchehrian A, Sharifzadeh M, Hamidzadeh Moghadam R, Nouri T (2013) Selection of regression models for predicting strength and deformability properties of rocks using GA. Int J Min Sci Technol 23:495–501. https://doi.org/10.1016/j.ijmst.2013.07.006
Article Google Scholar
Marques EAG, Barroso EV, Filho APM, do Vargas EA (2010) Weathering zones on metamorphic rocks from Rio de Janeiro-Physical, mineralogical and geomechanical characterization. Eng Geol 111:1–18. https://doi.org/10.1016/j.enggeo.2009.11.001
Article Google Scholar
Mashayekhi M, Kaliakin VN, Meehan CL et al (2020) Simulation of aggregate behavior in low confinement geotechnical applications. Comput Geotech 125:103678
Article Google Scholar
Matin SS, Farahzadi L, Makaremi S et al (2018) Variable selection and prediction of uniaxial compressive strength and modulus of elasticity by random forest. Appl Soft Comput J 70:980–987. https://doi.org/10.1016/j.asoc.2017.06.030
Article Google Scholar
Mielke PW Jr, Berry KJ (2001) Permutation methods: a distance function approach. Springer, New York (ISBN 978-1-4757-3449-2)
Book Google Scholar
Mokhtari M, Behnia M (2019) Comparison of LLNF, ANN, and COA-ANN techniques in modeling the uniaxial compressive strength and static Young’s modulus of limestone of the Dalan formation. Nat Resour Res 28:223–239. https://doi.org/10.1007/s11053-018-9383-6
Article Google Scholar
Momeni A, Hashemi SS, Khanlari GR, Heidari M (2017) The effect of weathering on durability and deformability properties of granitoid rocks. Bull Eng Geol Environ 76:1037–1049. https://doi.org/10.1007/s10064-016-0999-7
Article Google Scholar
Momeni E, Dowlatshahi MB, Omidinasab F, Maizir H, Armaghani DJ (2020) Gaussian process regression technique to estimate the pile bearing capacity. Arab J Sci Eng 45:8255–8267. https://doi.org/10.1007/s13369-020-04683-4
Article Google Scholar
Nefeslioglu HA (2013) Evaluation of geo-mechanical properties of very weak and weak rock materials by using non-destructive techniques: ultrasonic pulse velocity measurements and reflectance spectroscopy. Eng Geol 160:8–20. https://doi.org/10.1016/j.enggeo.2013.03.023
Article Google Scholar
Ocak I, Seker SE (2012) Estimation of elastic modulus of intact rocks by artificial neural network. Rock Mech Rock Eng 45:1047–1054. https://doi.org/10.1007/s00603-012-0236-z
Article Google Scholar
Omoruyi F, Obubu M, Ifunanya O et al (2019) Comparison of some variable selection techniques in regression analysis. Am J Biomed Sci Res 6:281–293. https://doi.org/10.34297/AJBSR.2019.06.001044
Article Google Scholar
Ozkat EC, Franciosa P, Ceglarek D (2017a) A framework for physics-driven in-process monitoring of penetration and interface width in laser overlap welding. Proc CIRP 60:44–49. https://doi.org/10.1016/j.procir.2017.01.043
Article Google Scholar
Ozkat EC, Franciosa P, Ceglarek D (2017b) Laser dimpling process parameters selection and optimization using surrogate-driven process capability space. Opt Laser Technol 93:149–164. https://doi.org/10.1016/j.optlastec.2017.02.012
Article Google Scholar
Ozkat EC, Franciosa P, Ceglarek D (2017c) Development of decoupled multi-physics simulation for laser lap welding considering part-to-part gap. J Laser Appl 29:022423. https://doi.org/10.2351/1.4983234
Article Google Scholar
Pan J, Meng Z, Hou Q et al (2013) Coal strength and Young’s modulus related to coal rank, compressional velocity and maceral composition. J Struct Geol 54:129–135. https://doi.org/10.1016/j.jsg.2013.07.008
Article Google Scholar
Park YW, Klabjan D (2020) Subset selection for multiple linear regression via optimization. J Glob Optim 1:1–32. https://doi.org/10.1007/s10898-020-00876-1
Article Google Scholar
Ranjbar-Karami R, Kadkhodaie-Ilkhchi A, Shiri M (2014) A modified fuzzy inference system for estimation of the static rock elastic properties: a case study from the Kangan and Dalan gas reservoirs, South Pars gas field, the Persian Gulf. J Nat Gas Sci Eng 21:962–976. https://doi.org/10.1016/j.jngse.2014.10.034
Article Google Scholar
Rasmussen CE (2004) Gaussian processes in machine learning. Lect Notes Comput Sci. https://doi.org/10.1007/978-3-540-28650-9_4
Article Google Scholar
Rezaei M (2018) Indirect measurement of the elastic modulus of intact rocks using the Mamdani fuzzy inference system. Measurement 129:319–331. https://doi.org/10.1016/j.measurement.2018.07.047
Article Google Scholar
Rezaei M (2020) Feasibility of novel techniques to predict the elastic modulus of rocks based on the laboratory data. Int J Geotech Eng 14:25–34. https://doi.org/10.1080/19386362.2017.1397873
Article Google Scholar
Rezaei M, Majdi A, Monjezi M (2014) An intelligent approach to predict unconfined compressive strength of rock surrounding access tunnels in longwall coal mining. Neural Comput Appl 24:233–241. https://doi.org/10.1007/s00521-012-1221-x
Article Google Scholar
Roy GD, Singh TN (2018) Regression and soft computing models to estimate young’s modulus of CO₂ saturated coals. Measurement 129:91–101. https://doi.org/10.1016/j.measurement.2018.07.016
Article Google Scholar
Roy GD, Singh TN (2020) Predicting deformational properties of Indian coal: soft computing and regression analysis approach. Measurement 149:106975. https://doi.org/10.1016/j.measurement.2019.106975
Article Google Scholar
Saedi B, Mohammadi SD, Shahbazi H (2018) Prediction of uniaxial compressive strength and elastic modulus of migmatites using various modeling techniques. Arab J Geosci 11:574. https://doi.org/10.1007/s12517-018-3912-9
Article Google Scholar
Saedi B, Mohammadi SD, Shahbazi H (2019) Application of fuzzy inference system to predict uniaxial compressive strength and elastic modulus of migmatites. Environ Earth Sci 78:208. https://doi.org/10.1007/s12665-019-8219-y
Article Google Scholar
Samui P, Kim D, Jagan J, Roy SS (2019) Determination of uplift capacity of suction caisson using gaussian process regression, minimax probability machine regression and extreme learning machine. Iran J Sci Technol Trans Civ Eng 43:651–657. https://doi.org/10.1007/s40996-018-0155-7
Article Google Scholar
Shakoor A, Bonelli RE (1991) Relationship between petrographic characteristics, engineering index properties, and mechanical properties of selected sandstones. Environ Eng Geosci 28:55–71. https://doi.org/10.2113/gseegeosci.xxviii.1.55
Article Google Scholar
Sharma VS, Sharma SK, Sharma AK (2008) Cutting tool wear estimation for turning. J Intell Manuf 19:99–108. https://doi.org/10.1007/s10845-007-0048-2
Article Google Scholar
Singh TN, Verma AK (2012) Comparative analysis of intelligent algorithms to correlate strength and petrographic properties of some schistose rocks. Eng Comput 28:1–12. https://doi.org/10.1007/s00366-011-0210-5
Article Google Scholar
Singh R, Kainthola A, Singh TN (2012) Estimation of elastic constant of rocks using an ANFIS approach. Appl Soft Comput J 12:40–45. https://doi.org/10.1016/j.asoc.2011.09.010
Article Google Scholar
Singh R, Umrao RK, Ahmad M et al (2017) Prediction of geomechanical parameters using soft computing and multiple regression approach. Measurement 99:108–119. https://doi.org/10.1016/j.measurement.2016.12.023
Article Google Scholar
Sonmez H, Tuncay E, Gokceoglu C (2004) Models to predict the uniaxial compressive strength and the modulus of elasticity for Ankara Agglomerate. Int J Rock Mech Min Sci 41:717–729. https://doi.org/10.1016/j.ijrmms.2004.01.011
Article Google Scholar
Sonmez H, Gokceoglu C, Nefeslioglu HA, Kayabasi A (2006) Estimation of rock modulus: for intact rocks with an artificial neural network and for rock masses with a new empirical equation. Int J Rock Mech Min Sci 43:224–235. https://doi.org/10.1016/j.ijrmms.2005.06.007
Article Google Scholar
Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9:293–300. https://doi.org/10.1023/A:1018628609742
Article Google Scholar
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106:7183–7192. https://doi.org/10.1029/2000JD900719
Article Google Scholar
Tian H, Shu J, Han L (2019) The effect of ICA and PSO on ANN results in approximating elasticity modulus of rock material. Eng Comput 35:305–314. https://doi.org/10.1007/s00366-018-0600-z
Article Google Scholar
Tiryaki B (2008) Predicting intact rock strength for mechanical excavation using multivariate statistics, artificial neural networks, and regression trees. Eng Geol 99:51–60. https://doi.org/10.1016/j.enggeo.2008.02.003
Article Google Scholar
Torabi-Kaveh M, Naseri F, Saneie S, Sarshari B (2015) Application of artificial neural networks and multivariate statistics to predict UCS and E using physical properties of Asmari limestones. Arab J Geosci 8:2889–2897. https://doi.org/10.1007/s12517-014-1331-0
Article Google Scholar
Tugrul A (2004) The effect of weathering on pore geometry and compressive strength of selected rock types from Turkey. Eng Geol 75:215–227. https://doi.org/10.1016/j.enggeo.2004.05.008
Article Google Scholar
Umrao RK, Sharma LK, Singh R, Singh TN (2018) Determination of strength and modulus of elasticity of heterogenous sedimentary rocks: an ANFIS predictive technique. Measurement 126:194–201. https://doi.org/10.1016/j.measurement.2018.05.064
Article Google Scholar
Undul O, Florian A (2015) Influence of micro-texture on the geo-engineering properties of low porosity volcanic rocks. Eng Geol Soc Territ 6:69–72. https://doi.org/10.1007/978-3-319-09060-3_12
Article Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York, p 187 (ISBN 0-387-94559-8)
Book Google Scholar
Wang HY, Ding WX, Yang JJ (2014) Study on the engineering properties of saturated red sandstone. Appl Mech Mater 638:589–593
Google Scholar
Willmott C, Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim Res 30:79–82
Article Google Scholar
Willmott CJ, Matsuura K, Robeson SM (2009) Ambiguities inherent in sums of squares based error statistics. Atmos Environ 43:749–752. https://doi.org/10.1016/j.atmosenv.2008.10.005
Article Google Scholar
Wyering LD, Villeneuve MC, Wallis IC et al (2014) Mechanical and physical properties of hydrothermally altered rocks, Taupo Volcanic Zone, New Zealand. J Volcanol Geotherm Res 288:76–93. https://doi.org/10.1016/j.jvolgeores.2014.10.008
Article Google Scholar
Xia M, Zhao C, Hobbs BE (2014) Particle simulation of thermally-induced rock damage with consideration of temperature-dependent elastic modulus and strength. Comput Geotech 55:461–473. https://doi.org/10.1016/j.compgeo.2013.09.004
Article Google Scholar
Yagiz S, Sezer EA, Gokceoglu C (2012) Artificial neural networks and nonlinear regression techniques to assess the influence of slake durability cycles on the prediction of uniaxial compressive strength and modulus of elasticity for carbonate rocks. Int J Numer Anal Methods Geomech 36:1636–1650. https://doi.org/10.1002/nag.1066
Article Google Scholar
Yasar E, Erdogan Y (2004) Correlating sound velocity with the density, compressive strength and Young’s modulus of carbonate rocks. Int J Rock Mech Min Sci 41:871–875. https://doi.org/10.1016/j.ijrmms.2004.01.012
Article Google Scholar
Yilmaz I (2009) A new testing method for indirect determination of the unconfined compressive strength of rocks. Int J Rock Mech Min Sci 46:1349–1357. https://doi.org/10.1016/j.ijrmms.2009.04.009
Article Google Scholar
Yilmaz I, Yuksek AG (2008) An example of artificial neural network (ANN) application for indirect estimation of rock parameters. Rock Mech Rock Eng 41:781–795. https://doi.org/10.1007/s00603-007-0138-7
Article Google Scholar
Yilmaz I, Yuksek G (2009) Prediction of the strength and elasticity modulus of gypsum using multiple regression, ANN, and ANFIS models. Int J Rock Mech Min Sci 46:803–810. https://doi.org/10.1016/j.ijrmms.2008.09.002
Article Google Scholar
Zhang L (2017) Evaluation of rock mass deformability using empirical methods—a review. Undergr Space 2:1–15. https://doi.org/10.1016/j.undsp.2017.03.003
Article Google Scholar

Download references

Acknowledgements

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this manuscript. The manuscript has included data and is given as electronic supplementary data.

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Mining and Mineral Extraction, Balikesir Vocational School, Balikesir University, Balikesir, Turkey
Nurcihan Ceryan
Faculty of Engineering and Architecture, Department of Mechanical Engineering, Recep Tayyip Erdogan University, Rize, Turkey
Erkan Caner Ozkat
Department of Mechanical Engineering, Istanbul University-Cerrahpasa, Istanbul, Turkey
Nuray Korkmaz Can
Department of Geological Engineering, Balıkesir University, Balikesir, Turkey
Sener Ceryan

Authors

Nurcihan Ceryan
View author publications
You can also search for this author in PubMed Google Scholar
Erkan Caner Ozkat
View author publications
You can also search for this author in PubMed Google Scholar
Nuray Korkmaz Can
View author publications
You can also search for this author in PubMed Google Scholar
Sener Ceryan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Not applicable.

Corresponding author

Correspondence to Erkan Caner Ozkat.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ceryan, N., Ozkat, E.C., Korkmaz Can, N. et al. Machine learning models to estimate the elastic modulus of weathered magmatic rocks. Environ Earth Sci 80, 448 (2021). https://doi.org/10.1007/s12665-021-09738-9

Download citation

Received: 18 October 2020
Accepted: 07 June 2021
Published: 17 June 2021
DOI: https://doi.org/10.1007/s12665-021-09738-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning models to estimate the elastic modulus of weathered magmatic rocks

Abstract

Similar content being viewed by others

Prediction of Young’s modulus of weathered igneous rocks using GRNN, RVM, and MPMR models with a new index

Assessment of the uniaxial compressive strength of intact rocks: an extended comparison between machine and advanced machine learning models

Using non-destructive tests for estimating uniaxial compressive strength and static Young’s modulus of carbonate rocks via some modeling techniques

Introduction

Materials and experimental details

Problem formulation

Methodology

Selection of inputs

Regression model development

Support vector regression

Gaussian process regression

Artificial neural network

Criteria for the performance evaluation and data normalization

Results

Application and prediction

Response surface

Performance evaluation

Discussion

Conclusion

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Machine learning models to estimate the elastic modulus of weathered magmatic rocks

Abstract

Similar content being viewed by others

Prediction of Young’s modulus of weathered igneous rocks using GRNN, RVM, and MPMR models with a new index

Assessment of the uniaxial compressive strength of intact rocks: an extended comparison between machine and advanced machine learning models

Using non-destructive tests for estimating uniaxial compressive strength and static Young’s modulus of carbonate rocks via some modeling techniques

Introduction

Materials and experimental details

Problem formulation

Methodology

Selection of inputs

Regression model development

Support vector regression

Gaussian process regression

Artificial neural network

Criteria for the performance evaluation and data normalization

Results

Application and prediction

Response surface

Performance evaluation

Discussion

Conclusion

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation