Introduction

The deformation modulus of a rock mass is an important parameter for the design and successful execution of rock engineering structures, because the modulus of deformation is the best representative parameter of the pre-failure mechanical behavior of the rock mass (Fattahi 2016; Gholamnejad et al. 2013). There are several approaches to determine the deformation modulus of rock mass directly, by in-situ tests; plate loading test (PLT), like a pressure meter (Chun et al. 2009), plate jacking, cable jack, flat jack, radial jacking and geophysical methods. Although in-situ techniques are the best methods to determine the deformability modulus of rock masses, they are expensive, time-consuming and can only be performed when the exploration spaces are excavated (Gholamnejad et al. 2013). In recent years, the number of empirical approaches used for estimating the deformation modulus of rock masses has increased. The first empirical equation, which considers only rock mass rating (RMR) as an input parameter, was proposed by Bieniawski (1973). The main limitation of Bieniawski’s approach is that it has to be used for rock masses with RMR >50. Serafim and Pereira (1983) proposed an equation for rock masses with RMR <50 to overcome the limitation of Bieniawski’s equation. Hoek and Brown (1997) proposed an empirical equation based on the geology strength index (GSI) and uniaxial compressive strength (UCS) of intact rock. Mitri et al. (1995) and Nicholson and Bieniawski (1990) used two empirical equations to estimate the deformation modulus of a rock mass by reducing the elasticity modulus of the intact rock (E i ) based on the RMR value. Barton (2002) obtained a formula including tunneling quality index (Q) system and UCS. Gokceoglu et al. (2003) introduced an empirical equation based on UCS, rock quality designation (RQD) and weathering degree (WD) of rock. Kayabasi et al. (2003) presented the relation based on WD, E i and RQD. Zhang and Einstein (2004) presented an empirical equation based on E i and RQD. Palmström and Singh (2001) proposed relations based on the rock mass index (RMI) classification system. Hoek and Diederichs (2006) proposed formulas based on GSI and D (factor of disturbance). Sonmez et al. (2004) presented formulas based on Ei, GSI and D.

Also, some research works were carried out using computational intelligence methods to predict rock mass deformation modulus. For example, Sonmez et al. (2006) used artificial neural networks (ANN) for estimation of rock modulus. Majdi and Beiki (2010) used hybrid ANN and genetic algorithms for predicting the deformation modulus of rock masses. Gokceoglu et al. (2004) presented an adaptive network-based fuzzy inference system (ANFIS) model for prediction of deformation modulus of jointed rock masses.

The empirical and computational intelligence methods that are based upon the survey data from various in-situ test methods, in a certain range of rock types, cannot be generalized for various ground conditions. Furthermore, all of above models do not simultaneously consider all the pertinent parameters in the modeling. Under such limitations or constraints, the estimation of the rock mass deformation modulus needs new innovative methods such as the rock engineering systems (RES)-based model, capable of accounting for unlimited parameters in the model. The RES concept has been applied to a number of rock engineering fields, for example, evaluation of the stability of underground excavations (Ping and Hudson 1993), hazard and risk assessment of rockfall (Cancelli and Crosta 1993), evaluation and classification of the spontaneous coal combustion potential (Saffari et al. 2013), development of an assessment system for the blast ability of rock masses (Lu and Latham 1994a), prediction of backbreak in bench blasting (Faramarzi et al. 2013a), rock mass characterization for indicating natural slope instability (Mazzoccola and Hudson 1996), prediction of flyrock distance in surface blasting (Faramarzi et al. 2014), prediction of out-of-seam dilution in longwall mining (Bahri Najafi et al. 2014), assessing geotechnical hazards for tunnel boring machine (TBM) tunneling (Benardos and Kaliampakos 2004), prediction of rock fragmentation by blasting (Faramarzi et al. 2013b), prediction of the advance rate in rock TBM tunneling (Moradi and Farsangi 2014) and quantitative hazard assessment for tunnel collapses (Shin et al. 2009). It has also been widely used for different engineering problems such as forest ecosystems (Avila and Moberg 1999; Velasco et al. 2006), environmental studies on the disposal of spent fuel (Skagius et al. 1997), radioactive waste management (Agüero et al. 2008; van Dorp et al. 1998), river catchment pollution (Matthews and Lloyd 1998), risk of reservoir pollution (Condor and Asghari 2009) and traffic-induced air pollution (Mavroulidou et al. 2004).

In this paper, the RES-based model, capable of accounting for many parameters in the model, was used to estimate the rock mass deformation modulus. To validate the performance of the model proposed, it is applied to field data from road and railway construction sites in Korea. Furthermore, the results obtained are compared with multiple regression models, which are carried out for the same data.

Properties of the database

To establish an RES-based method for prediction of rock mass deformation modulus, providing a dataset which includes a wide geographic distribution is the most important requirement. To achieve this, datasets given in a previous paper are borrowed (Chun et al. 2009). The collected datasets used to construct the database are from road and railway construction sites in Korea. A total of 60 datasets were collected. Each dataset contains the parameters of the RMR system (Bienniawski 1989) [rock quality designation (RQD), the discontinuity density (D d), the condition of discontinuities (D c), the groundwater condition (DW), an adjustment for the orientation of discontinuities (D o)], the measured uniaxial compressive strength (UCS; MPa), the measured deformation modulus (E m; GPa) and the depth (m) of the measurement (Table 1). The E m values were measured using pressure meter tests in most cases and performed at eight field sites which included six rock types such as granite, gneiss, andesite, tuff, sandstone and shale. The primary advantage of the pressure meter test is a relatively low cost compared to other types of tests (Chun et al. 2009). All data were randomly divided into two subsets: 80% of the total data (60 data points) was allotted to training data of RES model construction and 20% of the total data (12 data points) was allocated for test data used to assess the reliability of the developed RES model. For comparisons between RES and regression models, the same training and testing datasets were then used in regression models. the partial datasets used in this study are presented in Table 1. Also, the histograms and statistical evaluations of the data used in this research are shown in Fig. 1.

Table 1 Partial datasets used for constructing the models (Chun et al. 2009)
Fig. 1
figure 1figure 1

The histograms and statistical evaluations of the data used in this research

Statistical modeling

In reviewing the literatures published (Barton 2002; Bieniawski 1973; Gokceoglu et al. 2003; Hoek and Brown 1997; Hoek and Diederichs 2006; Kayabasi et al. 2003; Lagina Serafim and Pereira 1983; Mitri et al. 1995; Nicholson and Bieniawski 1990; Palmström and Singh 2001; Sonmez et al. 2004; Zhang and Einstein 2004), it is clear that many parameters can influence the deformation modulus. However, the most important parameters (D d, RQD, D c, GW, D o, UCS, D), which are easily obtainable, are shown in Table 2.

Table 2 Statistical description of the dataset utilized for construction of the models

At the first stage of analysis, the significance of parameters in the modeling was investigated based upon correlations between the individual independent variables and the actual measured rock mass deformation modulus. Coefficient of determination (R 2) was used as an indicator of correlation strength. R 2 values for independent variables versus rock mass deformation modulus are presented in Table 3. It can be concluded that RQD, D d, D c, UCS and D have negligible effects on rock mass deformation modulus and should be excluded in the regression modeling. Therefore, for further statistical analysis and development of a prediction model, five independent variables were selected (RQD, D d, D c, UCS and D).

Table 3 Relations between individual independent variables and E m (GPa)

Multiple linear regression analysis

A multiple linear regression analysis was carried out between RQD, D d, D c, UCS and D as independent variables and rock mass deformation modulus as a dependent variable, using the commercial software packages for standard statistical analysis (SPSS). Based on the statistical analysis, the predictive model is as follows:

$$ E_{\text{m}} ({\text{GPa}}) = - 6.6787 - 0.2758{\text{RQD}} + 1.0347D_{\text{d}} - 0.2749D_{\text{c}} + 0.0592{\text{UCS}} ({\text{MPa}}) + 0.08938D ({\text{m}}) $$
(1)

A multicollinearity analysis was carried out to check whether two or more independent variables are highly correlated. In the case of multicollinearity occurring, redundancy of the independent variables could be expected, which can lead to erroneous results. One of the most common tools for finding the degree of multicollinearity is the variance inflation factor (VIF). It has a range of 1 to infinity. Generally, if the calculated VIF is greater than 10, there may be a problem with multicollinearity (Jammalamadaka 2003). The VIF values of the independent variables in Eq. (1) were calculated, as shown in Table 4.

Table 4 Multiple linear regression coefficients and collinearity statistics for Eq. (1)

Also, the regression statistics, analysis of variance (ANOVA), for Eq. (1) are shown in Table 5. The model statistic value F and significance (Sig.) are used to provide enough evidence to reject the hypothesis of ‘‘no effect’’. From Table 5, an F of 32.238 and a Sig. of 0.000 (less than 0.05) were obtained, which show that the null hypothesis can be rejected.

Table 5 Analysis of variance (ANOVA) for Eq. (1)

Multiple non-linear regression analysis

Power, logarithmic, polynomial and exponential models with the same independent variables and rock mass deformation modulus as a dependent variable and also the same sets of data were used to carry out non-linear regression modeling. The mathematical equation obtained for the power model with R 2 = 0.8878 is

$$ E_{\text{m}} = 10^{{(0.34122 + 0.014836RQD + 0.023019D_{\text{d}} - 0.0025D_{\text{c}} + 0.00184{\text{UCS}}\,({\text{MPa}}) + 0.001515D\,({\text{m}}))}} $$
(2)

For logarithmic regression modeling, R 2 = 0.6397 is obtained and the relation is

$$ E_{\text{m}} \,({\text{GPa}}) = - 33.4265 + 3.8028\ln ({\text{RQD}}) + 13.9788\ln (D_{\text{d}} ) - 8.5885\ln (D_{\text{c}} ) + 4.2875\ln ({\text{UCS}}\,({\text{MPa}})) + 3.3874\ln (D\,({\text{m}})) $$
(3)

Furthermore, for the polynomial model relation with R 2 = 0.8824 is

$$ E_{\text{m}} ({\text{GPa}}) = 1.9131 + 0.2011{\text{RQD}} + 0.03858D_{\text{d}}^{2} + 0.00005D_{\text{c}}^{3} + 0.0000002{\text{UCS}}^{4} ({\text{MPa}}) + 0.0000003D^{5} ({\text{m}}) $$
(4)

Finally, the exponential relation for regression modeling with R 2 = 0.8878 is

$$ E_{\text{m}} ({\text{GPa}}) = \exp (0.7857 + 0.03416{\text{RQD}} + 0.053002D_{\text{d}} - 0.005746D_{\text{c}} + 0.004227{\text{UCS}}\,({\text{MPa}}) + 0.003488D\,({\text{m}})) $$
(5)

Rock engineering systems (RES)

The RES was introduced by Hudson (1992). It is a method which has the capability of simultaneous analysis of relations among effective parameters of a rock mass, site or structure, and discusses their interactions. For rock mechanics modeling and rock engineering design for a specific project, it is needed to be able to identify the relevant physical variables and the linking mechanisms, and then consider their combined operation. It is important to ensure that all the relevant factors and their interactions will be taken into account (Hudson 1992).

In the RES application, the interaction matrix device is the basic analytical tool and presentational technique for characterizing the important parameters and the interaction mechanisms in an RES. In the interaction matrix for a given RES, all parameters influencing the system are arranged along the leading diagonal of the matrix, called the diagonal terms. The influence of each individual factor on any other factors is accounted for at the corresponding off-diagonal position, named the off-diagonal terms. The off-diagonal terms are assigned numerical values which describe the influence degree of one factor on the other factors. Assigning these values is called coding the matrix.

Several coding methods have been developed for this purpose, such as the 0–1 binary method, the expert semi-quantitative (ESQ) method (Hudson 1992), the explicit method, continuous quantitative coding (CQC; Lu and Latham 1994b) and the probabilistic expert semi-quantitative (PESQ) method were proposed for numerically coding the interaction matrix. The most common coding method is the “expert semi-quantitative” (ESQ) method. ESQ coding has been used in nearly all previous works cited above. In this method, one unique code is deterministically assigned to each interaction, thereby expressing the effect of a parameter on another in the matrix. Typically, coding values vary between 0 and 4, with 0 indicating no interaction and 4 indicating the hyper level of interaction or “critical interaction” (Table 6). The general concept of the influences in a system is described by the interaction matrix, which is shown in Fig. 2. Here, the influence of ‘‘A’’ on ‘‘B’’ is not the same as that of ‘‘B’’ on ‘‘A’’, which means the matrix is asymmetric. Thus, it is important to put the parameter interactions in clockwise direction in the matrix.

Table 6 Expert semi-quantitative (ESQ) method for interaction matrix coding (Hudson 1992)
Fig. 2
figure 2

)modified after Hudson 1992)

Illustration of the interaction matrix: a interaction matrix of two factors, b general illustration of the coding of interaction matrix and the set-up of the cause and effect coordinates

In the interaction matrix, the sum of a row is called the ‘‘cause’’ value and the sum of a column is the ‘‘effect’’ value, denoted as coordinates (C,E, respectively) for a particular parameter. The coordinate values for each parameter can be plotted in cause and effect space, forming the so-called CE plot. The interactive intensity value of each parameter is denoted as the sum of the C and E values (C + E) and it can be used as an indicator of parameters’ significance in the system. That is, the weight for parameter i, indicated by a i , is given by its ‘‘parameter interaction intensity’’ (C i  + E i ) divided by the (total) sum of interaction intensities of all parameters in the system (Hudson 1992).

$$ a_{i} = \frac{{(C_{i} + E_{i} )}}{{\left( {\sum\nolimits_{i = 1}^{n} {C_{i} } +\sum\nolimits_{i = 1}^{n} {E_{i} } } \right)}} \times 100 $$
(6)

An RES-based model to predict rock mass deformation modulus

The principles of RES were used in the vulnerability index (VI) methodology concept, first introduced by Benardos and Kaliampakos (2004), to identify the vulnerable areas that may pose a threat to the TBM tunneling operation. As there is an obvious relation between advance rate and the associated risk encountered, this concept was also used to predict the advance rate in TBM tunneling.

In this research, a similar methodology, inspired by previous works carried out (Faramarzi et al. 2013a, b; Benardos and Kaliampakos 2004) is adopted to define a model and predict the rock mass deformation modulus. In defining the new model, three main steps must be taken into account. The first steps are to identify the parameters that are responsible for the occurrence of risk, analyze their behavior and evaluate their significance (weight) in the overall risk conditions. In this step, the RES principles can be used to assess the weighting of the parameters involved.

In the second step, the VI can be determined (Benardos and Kaliampakos 2004):

$$ {\text{VI}} = 100 - \sum\limits_{i = 1} {a_{i} \frac{{Q_{i} }}{{Q_{\hbox{max} } }}} , $$
(7)

where a i is the weighting of the ith parameter, Q i is the value (rating) of the ith parameter, and Q max is the maximum value assigned for ith parameter (normalization factor).

Based upon the estimated VI and the classification of the VI, which is divided into three main categories with different severity of the normalized scale of 0–100 (Table 7; Benardos and Kaliampakos 2004). In category I, small-scale problems are expected that cannot significantly affect the results. In category II, which must be taken into account. In category III, which might cause several difficulties during the loading and unloading must be considered.

Table 7 Classification of the VI (Benardos and Kaliampakos 2004)

In the third step, a relation between rock mass deformation modulus and VI can be determined. Based upon this new relation, the rock mass deformation modulus for every in-situ test can be obtained having VI.

The most important parameters in the RES-based model

Parameters in Table 8 as well as three descriptive parameters, discontinuity condition, rock properties and groundwater condition, were used to define the RES-based model.

Table 8 The most important parameters used to define the RES-based model

Interaction matrix and rating of parameters

Interaction matrix

The seven principal parameters affecting on the rock mass deformation modulus are located along the leading diagonal of the matrix and the effects of each individual parameter on any other parameter (interactions) are placed on the off-diagonal cells. The assigning of values to off-diagonal cells and coding the matrix were carried out using the ESQ coding method as proposed by Hudson (1992). Based upon the views of three experts, working in the field of rock engineering for many years, the interaction matrix for the parameters affecting the rock mass deformation modulus is established as presented in Table 9. As it can be seen in Tables 3 and 9, the views of three experts (Table 9) have good compliance with individual relations in Table 3.

Table 9 Interaction matrix for the parameters affecting the rock mass deformation modulus

Table 10 gives cause (C), effect (E), interactive intensity (C + E), dominance (CE) and weight of each parameter (a i ). As it can be seen in Table 10, burden has the highest weight in the system, and highly controls other elements. The EC histogram and C + E for each parameter are illustrated in Figs. 3 and 4, respectively. The points below the C = E line are called dominant and the points above the C = E line are called subordinate.

Table 10 Weighting of the principal parameters in the rock mass deformation modulus
Fig. 3
figure 3

CE plot for principal parameters of the rock mass deformation modulus

Fig. 4
figure 4

The C + E values for principal parameters of the rock mass deformation modulus

Rating of parameters

The rating of the parameter’s values was carried out based upon their effect on the rock mass deformation modulus. In total, five classes of rating, from 0 to 4, were considered, where 0 denotes the worst case (most unfavorable condition) and 4 the best (most favorable condition). The rating of each parameter is presented in Table 11. The ranges of parameters in Table 11 were proposed based on the judgments of three experienced experts in the field of rock engineering.

Table 11 Proposed ranges for the parameters impacting the rock mass deformation modulus

Risk analysis and rock mass deformation modulus prediction

A dataset that includes 60 data points was employed in the current study, while 48 data points (80%) were applied to determine the associated VI for each data point using Eq. (7) and the remaining data points (12 data points) were utilized for assessing the degree of accuracy and robustness.

To make the methodology more understandable, an example of determining VI for data point no. 1 is shown in Table 12. Variations in the VI for the 48 data points are shown in Fig. 5. As it can be seen, the mean of VI is close to 40, showing that the level of risk is in the second category (medium–high).

Table 12 Parameter values and the corresponding VI for data point no. 1
Fig. 5
figure 5

VI for 48 data points

Based on the calculated VI and measured rock mass deformation modulus for 48 data points, a logarithmic regression analysis was carried out [Fig. 6 and Eq. (8)] with a coefficient of determination (R 2) of 0.81 being obtained. This relation can be used as a predictive model to predict the rock mass deformation modulus based on VI.

$$ E_{\text{m}} ({\text{GPa}}) = - 17.9\,\ln ({\text{VI}}) + 79.326 $$
(8)
Fig. 6
figure 6

E m–VI predictive model

Evaluation of models performance

12 data points (out of 60 data points) were used and the results obtained are shown in Table 13. Also, for 12 data points, a comparison was made between the predicted and the measured rock mass deformation modulus for different models as shown in Fig. 7.

Table 13 Rock mass deformation modulus predicted by different models for 12 data points
Fig. 7
figure 7

Comparison between the measured and predicted rock mass deformation modulus, a RES-based model, b polynomial model, c multiple linear regression model, d power model, e logarithmic model, f exponential model

To verify the performance of the models, two statistical criteria viz. mean squared error (MSE) and squared correlation coefficient (R 2) were chosen to be the measures of accuracy. Let t k be the actual value and \( \hat{t}_{k} \) be the predicted value of the kth observation and n be the number of observations; then, MSE and R 2 could be defined, respectively, as follows:

$$ {\text{MSE}} = \frac{1}{n}\sum\limits_{k = 1}^{n} {(t_{k} - \hat{t}_{k} )^{2} } $$
(9)
$$ R^{2} = 1 - \frac{{\sum\nolimits_{k = 1}^{n} {(t_{k} - \hat{t}_{k} )^{2} } }}{{\sum\nolimits_{k = 1}^{n} {t_{k}^{2} } - \frac{{\sum\nolimits_{i = 1}^{n} {\hat{t}^{2} } }}{n}}} $$
(10)

The results of performance analysis of different models are shown in Table 14.

Table 14 The results of performance analysis of different models

As it can be seen from the performance indices (Table 14), the RES-based model with R 2 = 0.9245 and MSE = 0.0175 shows the best agreement with the measured rock mass deformation modulus and works better in comparison with other models.

Conclusions

In this paper, a new approach, namely an RES-based model, is proposed for predicting the rock mass deformation modulus. The RES-based model is an expert-based model, which can deal with the inherent uncertainties in the geological systems. Also, it has the advantage of considering unlimited input parameters, which may effect the system. Moreover, it has the merit of considering descriptive input parameters which are not applicable in statistical modeling.

It is concluded that the RES-based model with performance indices R 2 = 0.9245 and MSE = 0.0175 performs better than linear, polynomial, power, logarithmic and exponential models. This study shows that the RES-based model can be used as a powerful tool for modeling of some problems involved in rock engineering.