Introduction

For the design of structures that are built into rock masses (tunnels, dams, underground structures, etc.), it is necessary to know the mechanical and deformation properties of the rock mass. One of the most significant deformation properties is the deformation modulus of the rock mass. The best methods for determining the deformation modulus are in situ experiments: the plate load test and flat jack test (Hoek and Diederichs 2006) and dilatometer test (Wittke 2014). Also, the Institute for the Development of Water Resources Jaroslav Černi developed two more in situ tests for determining deformation modulus within the rock masses: a hydraulic jack method (Kujundzic 1970, 1977) and radial press method (Kujundzic 1970). These in situ tests are expensive, difficult to conduct, and time consuming (Aksoy et al. 2012). Therefore, in recent years many researchers have developed indirect methods and models for the estimation of deformation modulus. Some authors (Bieniawski 1978; Serafim and Pereira 1983; Nicholson and Bieniawski 1990; Hoek and Brown 1997; Kayabasi et al. 2003; Gokceoglu et al. 2003; Chun et al. 2009; Alemdag et al. 2015b; Ajalloeian and Mohammadi 2014; Nejati et al. 2014; Shen et al. 2012) established indirect equations for determining modulus of deformation. These equations are based on statistical linear regression, using the relation between deformation modulus and geotechnical parameters (such as RMR, GSI, UCS, E i ). These correlations are useful for practical application. Geotechnical parameters (i.e. RMR, GSI, UCS, E i ) are usually determined during field investigations in rock masses and are, therefore, available for the estimation of deformation modulus. However, the classification parameters are subjective indicators of the rock mass, as they are obtained on the basis of geological description and not on exact measurements.

With development of soft computing methods such as fuzzy inference system, artificial neural networks (ANNs) and adaptive neuro–fuzzy inference systems further progress of models for estimation of deformation modulus and other geotechnical parameters was made possible. ANNs have shown to be robust technique for predicting many geotechnical engineering problems (Sonmez et al. 2006; Monjezi and Dehghani 2008; Majdi and Beiki 2010; Banimahd et al. 2005; Nejati et al. 2014; Alemdag et al. 2015a). Available literature suggests that soft computing techniques are superior in establishing predictive models in the area of rock mechanics.

Furthermore, some authors (Pappalardo 2015; Brotons et al. 2014; Khandelwal 2013; Azimian and Ajalloeian 2014) have noticed correlations between the velocity of P-waves and other geotechnical parameters (such as UCS, E i , density, porosity) based on laboratory tests. These relations are not easily applicable due to scale effects in rock masses. For example, deformation modulus of rock samples can be more than two times higher than deformation modulus of the rock mass (Chun et al. 2009). The velocity of P-waves in rock masses can also be smaller than the ones obtained in tested samples (Shen et al. 2016). Authors (Kujundzic and Grujic 1966; Song et al. 2011; Shen et al. 2016) have noticed the dependence between deformation modulus and velocity of elastic waves based on in situ measurements. Models suggested by these authors have an advantage over the models based on laboratory performed tests due to the scale effect. Some of the correlations mentioned in the works above are given in Table 1.

Table 1 Correlations between deformation modulus and other geotechnical parameters of rock masses

The aim of the research presented in this paper is to establish new models for the estimation of deformation modulus of rock masses based on correlations between modulus of deformation (\(D\)) and velocity of longitudinal waves (\(V_{p}\)) and the pressure in the rock mass (\(p\)) obtained from in situ measurements. The input parameters used in this study were measured during the construction of the dam Iron Gate 1 HPP. In order to determine more precise method, this study applied two different methods for estimation of deformation modulus: multiple linear regression (MLR) and artificial neural networks (ANNs).

Dataset

The results used for the estimation of deformation modulus in this paper were obtained from the construction of dam Iron Gate 1 HPP. For the analysis, 47 measurements of deformation modulus, velocity of waves, and pressures in the rock mass were obtained. For the modelling, 38 values were used, and for the testing, nine values of measured parameters were used.

This dam is located on the Danube River at the chainage 943 km of the Danube river flow. The dam construction began in 1964, and the first generator units were commissioned in 1970. The dam was built in accordance with the agreement between the former SFR of Yugoslavia and Romania. Both sides, Yugoslav and Romanian, have six generator units each of the individual installed power 1200 MW.

The dam at the Iron Gate 1 HPP is mainly located in metamorphic rocks, such as: biotite gneiss, chlorite gneiss, granogneiss, and chlorite-sericite schists. For defining the deformation properties of the rock masses, in situ tests were conducted using hydraulic jack test (Kujundzic 1970, 1977) which is used for determining the deformation modulus and elasticity modulus of the rock masses. Figure 1 shows the scheme of test site for hydraulic jack with all the elements. Since there is no photo of hydraulic jack test on dam Iron Gate 1, Fig. 2 displays a photo of a sheet metal jack located at the Komarnica Dam, Montenegro. Figure 3 is a photo of a slot in the rock mass where hydraulic jack and a concrete fill were installed, also located at the Komarnica dam. Hydraulic jack tests (Kujundzic 1970, 1977) were conducted in adits on both riverbanks of the Danube.

Fig. 1
figure 1

Hydraulic jack method: 1 slot in the rock mass; 2 flexible flat jack ø 2 m; 3 concrete fill; 4 hydraulic reservoir tube and indicator tube for measurement of average deformations; 5 hydraulic pump; 6 deflection gages for measurement of boundary deformations

Fig. 2
figure 2

Sheet metal jack (example from the investigation works at the Komarnica Dam, Montenegro)

Fig. 3
figure 3

The slot in the rock mass where a hydraulic jack was installed and concrete fill placed (example from the investigation works at the Komarnica Dam, Montenegro)

By applying the pressure in hydraulic jack the deformation in the rock mass is obtained. Mean deformation in rock mass (Kujundzic 1970, 1977) is calculated according to Eq. (1).

$$u_{s} = \frac{a\Delta h}{2A} - \frac{pd}{{2E_{c} }},$$
(1)

where \(u_{s}\) is mean deformation of the rock mass \(a\) is a cross sectional area of hydraulic reservoir tube, \(A\) is area of sheet metal jack and \(\Delta h\) is difference in levels in the hydraulic reservoir tube, \(p\) is the applied pressure in the hydraulic jack, \(d\) is the thickness of the concrete infill, \(E_{c}\) is modulus of elasticity of the concrete. The first member in the Eq. 1 represents the deformation resulting from the change in volume of the jack and the second member represents a correction due to deformation of the concrete.

The test was repeated with several cycles of loading and unloading, and the typical pressure-deformation diagram in rock masses was obtained and is shown in Fig. 4.

Fig. 4
figure 4

Typical diagram of pressure-deformation in rock masses

Based on the pressure-deformation diagram, the values of the deformation modulus can be determined in the rock mass for each cycle of loading. The deformation modulus value is calculated according to Eq. (2) (Kujundzic 1970).

$$D_{i} = 0.54\frac{{A(1 - \nu^{2} )}}{r}\frac{{p_{i} - p_{0} }}{{u_{i} - u_{0} }},$$
(2)

where \(D_{i}\) is deformation modulus of the ith cycle, \(p_{i}\) is pressure at the end of the ith cycle, \(p_{0}\) is initial pressure, \(u_{0}\) is initial deformation, \(u_{i}\) is the mean deformation in the rock mass at the end of the ith cycle, \(A\) is area of sheet metal jack, \(r\) is radius of sheet metal jack and ν is Poisson’s ratio of rock mass.

The diagram given in Fig. 5 is the result of measuring of deformations at the measuring point EGD-2 on the right riverbank of Danube.

Fig. 5
figure 5

Diagram pressure-deformation at the measuring point EGD-2

Loading of the hydraulic jack started with the initial pressure of 0 MPa and it was increased to the determined level, when unloading began down to the pressure \(p_{0}\)(0.2 MPa in this case). Then, the next cycle of loading began at other pressure level and unloaded to the pressure \(p_{0}\). This procedure was repeated through several cycles of loading and unloading until the required maximum stress was reached or until hydraulic jack had cracked.

Measurement of velocity of longitudinal seismic waves can be conducted at the same place where hydraulic jack test is performed by applying a polar method (Kujundzic and Grujic 1966; Kujundzic 1970). Figure 6 displays a diagram of a hydraulic jack and the principle of measuring P-wave velocities around the hydraulic jack. In the borehole located at a distance of 4 m from the slot the explosion caused seismic waves. The waves were measured using geophones placed in the slot where the jack was located. Thus, the polar diagram of velocities around the hydraulic jack was obtained. For the analysis of data, a mean value of spreading velocities of seismic longitudinal waves was used. This method is suitable because it is cost effective, easy to perform and does not require special technical support, and most importantly it is applied in the undisturbed rock mass (Kujundzic and Grujic 1966).

Fig. 6
figure 6

Microseismic method with polar procedure: 1 slot for hydraulic jack, 2 sheet-metal jack, 3 hole, 4 aftershock, 5 geophones, 6 polar velocity graph

Gneisses and schists, the most common rock masses at the site of the dam, can have a very pronounced anisotropy; therefore, the level of deformation characteristics of the rock mass will depend on the direction of loading, normal or parallel to schistosity. Pinto da Cunha and Muralha (1990) presented the research results of the deformation modulus obtained using the flat jack test perpendicular to the schistosity and parallel to the direction of schistosity. Pinto da Cunha and Muralha 1990 also gave the results of dynamic modulus in the rock mass, which were calculated based on the measurements of velocity of longitudinal waves around the flat jack. Based on these results it can be concluded that the average deformation modulus parallel to schistosity is about 20% higher than the modulus perpendicular to schistosity and that the mean dynamic modulus parallel to schistosity is 100% higher than the dynamic modulus perpendicular to schistosity.

As far as dam Iron Gate 1 is concerned, tests of the velocity of propagation of longitudinal waves parallel and perpendicular to the schistosity showed that there is no significant difference in the results (velocities parallel to the schistosity are 6–7% higher than the velocities perpendicular to the schistosity) which indicates that anisotropy at the site of the dam is not significantly present.

In this research, values of deformation moduli and velocity of longitudinal seismic waves were determined from eight measuring points: five measuring points were at the exploratory adits on the right bank of the Danube, and three measuring points were at the exploratory adits on the left bank.

Development of a model for prediction of deformation modulus

Multiple regression analysis

Modelling implies making correlations between the deformation modulus and pressure and wave velocity. The analysis showed that Eq. (3) best describes the dependence between the deformation modulus and velocity of elastic waves under various pressures.

$$D = a\ln (Vp) + b.$$
(3)

Figure 7 displays the results of the dependence of the deformation modulus from velocity under various pressures applied during the test. There is a good correlation between the deformation modulus and velocity of longitudinal waves. The coefficient of determination is in most cases over 0.75. For the purpose of introducing the influence of pressure during testing into the regression model, variables \(a = f(p)\) and \(b = f(p)\) were proposed, where \(a\) and \(b\) are the unknown coefficients in Eq. (3). Equations (4) and  (5) are assumed for calculation of the \(a\) and \(b\) variables:

Fig. 7
figure 7

Relations between the velocity of P-wave and deformation modulus under various pressures in the rock mass

$$a = a_{1} p + a_{2}$$
(4)
$$b = b_{1} p^{2} + b_{2} p + b_{3} .$$
(5)

When Eq. (4) and Eq. (5) are inserted into Eq. (3), Eq. (6) is obtained.

$$D = a_{{_{1} }} p\ln (V_{p} ) + a_{{_{2} }} \ln (V_{p} ) + b_{{_{1} }} p^{{^{2} }} + b_{{_{2} }} p + b_{{_{3} }} .$$
(6)

The given equation can be transformed into Eq. (7):

$$D = a_{o} + a_{1} x_{1} + a_{2} x_{2} + a_{3} x_{3} + a_{4} x_{4} .$$
(7)

The regressive functions are: \(x_{1} = p\ln (V_{p} )\), \(x_{2} = \ln (V_{p} )\), \(x_{3} = p^{2}\) and \(x_{4} = p\), and the Eq. (7) is a multiple linear regression model (MLR).

The main part of defining the model for prediction of the deformation modulus is determination of the coefficients:\(a_{0}\), \(a_{1}\), \(a_{2}\), \(a_{3}\), \(a_{4}\) using regression functions. Determining the coefficients using regressors was conducted in MATLAB software.

The prediction performance of each model is evaluated by calculating the root mean square error (RMSE), mean absolute error (MAE) and the Pearson correlation coefficient (r):

$${\text{RMSE}} = \sqrt {\frac{1}{N}\sum\limits_{i = 1}^{N} {(y_{mi} - y_{i} )^{2} } }$$
(8)
$${\text{MAE}} = \frac{1}{N}\sum\limits_{k = 1}^{N} {\left| {y_{mi} - y_{i} } \right|}$$
(9)
$$r = \frac{{\sum\nolimits_{i = 1}^{N} {(y_{mi} - \bar{y}_{m} )(y_{i} - \bar{y})} }}{{\sqrt {\sum\nolimits_{i = 1}^{N} {(y_{mi} - \bar{y}_{m} )^{2} } \sum\nolimits_{k = 4}^{N} {(y_{i} - \bar{y})^{2} } } }},$$
(10)

where \(y_{mi}\) and \(y_{i}\) denote the measured value and the multi linear regression output, respectively; \(\bar{y}_{m}\) and \(\bar{y}\) denote their average values respectively, and \(N\) represents the number of available samples.

Based on the Eq. (7) using the MATLAB software package multiple linear regression analysis was conducted and the coefficients were obtained using predictive functions. The Eq. (11) is a prediction model.

$$D = - 40194 + 700p\ln (V_{p} ) + 5168\ln (V_{p} ) - 30p^{2} - 5765p.$$
(11)

The Eq. (11) is valid in cases where the velocity of longitudinal waves is greater than 2500 m/s. The velocity in Eq. (11) is in units of m/s and pressure is in units of MPa. Deformation modulus is obtained in MPa.

Using the regression model given in the Eq. (11), the values for deformation modulus were calculated for the values of pressure and velocities. The Fig. 8 shows the values of the deformation modulus calculated from the proposed model (Eq. 11) and the values of deformation modulus obtained experimentally which were used for modeling.

Fig. 8
figure 8

Comparison of calculated values of deformation modulus on the basis of regression model (Eq. 11) and experimental values of deformation modulus for the dataset which was used for modeling

The Fig. 9 is a scatter diagram of the calculated values of the deformation modulus from the proposed model (Eq. 11) and experimental values of deformation modulus for the dataset which was used for testing.

Fig. 9
figure 9

Comparison of values of deformation modulus calculated on the basis of regression model (Eq. 11) and the values of deformation modulus obtained experimentally for the dataset which was used for testing

The Table 2 contains the values of RMSE, MAE, and r for the case of training and testing of multiple linear regression model.

Table 2 Values of RMSE, MAE, and r

Artificial neural network modelling

Neural network used in this study is feedforward neural network (FNN), which is widely used as a soft computing method in statistical models because of its good ability of nonlinear mapping and generalization of the functional relationships between input and output variables. FNN has been implemented and designed using Neural Network Toolbox of MATLAB.

An example of a FNN with R input variables, a single hidden layer with \(n\) neurons and one output variable is shown in Fig. 10.

Fig. 10
figure 10

Feedforward Neural Network with one hidden layer and one neuron in the output layer

The output of the hidden neurons of the FNN can be expressed by Eq. (12).

$$v_{i\left( h \right)} = f_{h} \left( {\sum\limits_{j = 1}^{R} {\omega_{ij(h)} p_{j} + b_{i(h)} } } \right),\;\quad j = 1, \, 2, \ldots ,n,$$
(12)

where \(v_{i\left( h \right)}\) is the output of the ith neuron of the hidden layer, \(f_{h}\) is the activation function of the hidden neurons, \(\omega_{ij\left( h \right)}\) is the weight value between the ith neuron in the hidden layer and jth input variable in a neural network model, \(p_{j}\) is the jth input variable, \(b_{i\left( h \right)}\) is the bias value of the ith neurons in the hidden layer, \(n\) is the number of the neurons in the hidden layer.

The most commonly used activation functions of the hidden neurons in the neural networks are logistic sigmoid function and hyperbolic tangent sigmoid function, which can be expressed by Eq. (13).

$$f_{h} (x) = \frac{1}{{1 + e^{ - x} }}.$$
(13)

and Eq. (14).

$$f_{h} (x) = \frac{2}{{1 + e^{ - 2x} }} - 1.$$
(14)

The output of the neural network when the activation function for the output layer is linear can be obtained according to the Eq. (15).

$$y = \sum\limits_{j = 1}^{n} {\omega_{1j(o)} v_{j(h)} + b_{(o)} }$$
(15)

where \(\omega_{1j\left( o \right)}\) is the weight of the jth neuron in the hidden layer to the output neuron, \(b_{(o)}\) is the bias of the output layer.

The most widely used steepest descent algorithm, also known as the backpropagation algorithm (Rumelhart et al. 1986) and the different versions of it can be used to find the optimal values of the network weights and biases (i.e. train the network) in order to minimize an error criterion. Also, the Gauss–Newton algorithm (Osborne 1992) and the Levenberg–Marquardt algorithm (Hagan and Menhaj 1994) are often used. The Levenberg–Marquardt (LM) algorithm essentially combines the steepest descent method and the Gauss–Newton algorithm and is suitable for training small- and medium-sized data modelling problems (Yu and Wilamowski 2011).

There is no general rule for determining the optimum number of neurons in hidden layer and usually it is determined through the trial-and-error method. Different network architectures were developed by varying the number of neurons in the hidden layer and using the logistic sigmoid (logsig) and the hyperbolic tangent sigmoid (tansig) activation functions in the hidden layer.

To select the optimal number of neurons in the hidden layer, networks were trained with 4, 6, 7, and 8 hidden neurons. Also, the efficiency of the models developed using the logistic sigmoid (logsig) function and the hyperbolic tangent sigmoid (tansig) activation function has been analysed based on the performance parameters (RMSE, MAE, r), Table 3.

Table 3 Comparison of different FNN network structure performances

The neural network with the highest r and the lowest MAE and RMSE was chosen (FNN 5 from Table 3). It can be observed that the best FNN contains 7 neurons in the hidden layers.

The weights and biases necessary to implement the adopted optimal FNN are presented in Table 4.

Table 4 The weights and biases necessary to implement the adopted FNN

Adopted ANN model has 2 neurons in input layer, 7 neurons u hidden layer and 1 neuron in output layer.

The Fig. 11 is a comparison of values of deformation moduli calculated with selected neural network and experimental values of deformation modulus for the data set which was used for modeling.

Fig. 11
figure 11

Comparison of calculated values of deformation modulus obtained on the basis of predictive model using FNN and experimental values of deformation modulus for the dataset, which was used for modeling

The Fig. 12 is a scatter diagram of the calculated values of the deformation modulus with selected neural network and experimental values of deformation modulus for the dataset, which was used for testing.

Fig. 12
figure 12

Comparison of calculated values of deformation modulus obtained on the basis of predictive model using FNN and experimental values of deformation modulus for the dataset, which was used for testing

Prediction of deformation modulus using feedforward neural networks (FNN) shows better results than the multiple linear regression models. The Pearson’s ratio is greater when calculated using FNN than using MLR. For the values of RMSE, MAE are smaller for the model based on the FNN than for the one based on MLR. These observations are given in the Tables 2 and  3. Better performance of the dataset for training and testing can also be noticed in the case of FNN based model.

Results and discussion

The results obtained in this study were compared with the results obtained by other authors shown in Table 5 since all empirical models depend on the quality and quantity of data.

Table 5 Estimation model properties proposed by other authors

The values of correlation coefficients in the previously described studies in Table 5 are close to the values obtained from the models described in this paper for both ANN and MLR. In all mentioned research, RMSE values are greater than the value of RMSE in this study. The correlation coefficients between the estimated and measured deformation modulus for MLR were 0.93 for training and 0.83 for testing. The correlation coefficients for ANN models were 0.99 for training and 0.97 for testing. The values of RMSE for training and testing for the MLR model were 0.549 and 0.72, respectively. The values of RMSE for training and testing for the ANN model were 0.258 and 0.394, respectively. Presented results prove that the accuracy of the model established in this study is within the range of accuracy of the models published by other authors. Also, it is noted that there is an intensive use of ANN models for deformation modulus estimation and for the estimation of other geotechnical parameters. Application of ANN model shows greater accuracy than the application of MLR model, which is also the case in this study. Number of datasets used for modelling in this study corresponds well to the number of datasets used by other authors.

As for the number of input variables researchers (Nicholson and Bieniawski 1990; Hoek and Brown 1997) had two input variables in the models, (Pappalardo 2015; Azimian and Ajalloeian 2014; Kujundzic and Grujic 1966) used one input variable for estimation of the modulus, while Kayabasi et al. (2003) used 3 input variables and Gokceoglu et al. (2003) used 4 input variables. Number of variables can affect the accuracy of the model. For example Nejati et al. (2014) analysed the correlation between the deformation modulus and RMR, as well as the correlation between the deformation modulus and parameters: UCS ratings, RQD rating, JSR, JCR, GWR. The correlation coefficient in the first case was 0.67 and in the second case was 0.84. Obviously, the reason for deviation is the number of variables.

In this paper, the authors adopted two input variables: velocity of longitudinal waves and the pressure in rock mass. Compared to the works of Kujundzic and Grujic (1966) and Shen et al. (2016) that dealt with similar issues to this study, another input variable was introduced—the pressure in the rock mass. The pressure is important for the estimation of deformation modulus, because the modulus depends on the level of loading in the rock mass. In relation to the works of Kujundzic and Grujic (1966) and Shen et al. (2016) who used only MLR, the authors of this paper performed analysis using ANNs and MLR.

Suggested models have the disadvantage that they cannot be applied for the estimation of deformation modulus for every rock type. The model from this paper can be upgraded in two ways. The first way is expansion of the input data to different types of rock masses and the second way is increasing the number of different input parameters. In this study, the input parameters were P-wave velocity and pressure; other input parameters that can be introduced for example are UCS or degree of cracks in the rock and the effect of anisotropy where applicable. The authors of this paper did not take into account the effect of anisotropy because it is not significantly demonstrated at the site of the dam as described in Sect. 2.

Conclusions

In this paper the models for estimating the modulus of deformation in the rock mass at the site of the Iron Gate 1 HPP are presented. The estimation was based on the measurements of velocity of longitudinal waves in the same places where hydraulic jack tests were performed. In addition to wave velocities, the values of pressure at which the hydraulic jack tests were executed, were also used. Models described in this study have an acceptable accuracy within the range obtained by other authors in models of a similar type. Models based on the FNN gave better accuracy than the model based on the MLR.

The advantage of the proposed model is primarily that it used in situ measurements of moduli of deformation, velocity of waves and pressures during the experiment. In this way, the effect of scale that exists in the rock masses is taken into account.

Modelling in this study was conducted on a limited number of data. Therefore, this research cannot be generalized for all rock masses, but solely for the area around the dam and the rock masses present in that area.

Measuring deformation modulus in the rock mass is an expensive process, but determining the velocity of waves in boreholes is cheaper, and it is common in geological surveys for construction of dams, tunnels, and other structures. This means that if the speed of longitudinal waves in the relevant cross sections in the rock mass (obtained by geophysical methods) is available, deformation modulus can be estimated based on the developed models for the expected level of stresses in the rock mass.

The polar method used for determining the velocity of elastic waves can be applied with some modifications at measuring points using plate loading test. The plate loading test is a cheaper and simpler test in comparison to the hydraulic jack and it has wider application than hydraulic jack. Hence, using the same principle, the results obtained by plate loading test can be also modelled using relevant measuring of velocity of waves.

The proposed models can be improved and applied on a number of rock masses and they can be used for more general purposes when defining deformation properties of rocks for different structures both during design and construction and during exploitation of structures. Authors have the intention to further the research by adapting the model for application in other rock masses and eventually increasing the number of input parameters.