1 Introduction

One of the important features in the rock mass study of civil and mining project sites is the permeability which is determined using the Lugeon test and is used to assess the groutability of the dam sites. In the construction of dams, water escape occurs more than through the joints and cracks of rocks due to hydraulic pressures of water behind the dam. Lack of attention to the permeability of the sites in recent decades has caused the failure of a number of dams or the main purpose of the dam in terms of water storage and safety of the dam structure has been questioned. In this regard, the assessment of permeability and leakage in the dam site is one of the necessary items in the initial studies of dams (Nonveiller 1989; Houlsby 1990; Fatahi Nafchi et al. 2021; Yin et al. 2022; Yang et al. 2014).

Barton (2004) investigated the relationship between Lugeon, hydraulic joint aperture, and joint spacing. Barton also introduced the Q logging system (i.e., determination of parameters of this classification system in boreholes) to determine the rock mass quality of the dam sites (Barton 2002). Many studies have been conducted to estimate Lugeon based on rock mass characteristics and have been stated that the characteristics of discontinuities have a great effect on rock mass permeability (Table 1). Qureshi et al. (2022) assessed the relationship between rock quality designation (RQD) and Lugeon number. Piscopo et al. (2018) and Chen et al. (2018) developed some experimental equations for estimating Lugeon number based on depth. Shahbazi et al. (2020) summarized the different methods for determining permeability and stated that rock mass quality and joint characteristics are the best parameters to estimate permeability. Morshedy et al. (2019) indicated that with increasing depth, the amount of Lugeon decreases and the quality of the rock mass increases. Kayabasi et al. (2015) used ANFIS and non-linear multiple regression to estimate Lugeon number. Ma et al. (2021) stated that cracks and joints angle is the most important factor affecting permeability and SVR method can accurately predict the rock mass permeability. Results of Matinkia et al. (2022) study, revealed that soft computing approaches such as multilayer perceptron (MLP) and MLP-PSO (particle swarm optimization) are capable to estimate rock permeability with determination coefficient more than 99%. Jamshidi Gohari et al. (2021) used ANN (artificial neural network) to estimate permeability of the carbonate rocks. Shi and Jian (2018) sated that Elman neural network could predict permeability with average relative error lees than 6.25%. Adegbite et al. (2021) used ANN, ANFIS, multiple linear regression (MLR) to predict permeability of the carbonate rocks. Li et al. (2019) estimated curtain grouting efficiency by ANFIS method. Akbarimehr and Aflaki (2019) used ANN and statistical methods to predict rock mass permeability. Zadhesh et al. (2015) estimated permeability at Cheraghvays dam site using ANN and multivariate linear regression (MVLR) approaches. Chen et al. (2021a, b) assessed Lugeon correlation with depth and rock mass classification system. Hariri-Ardebili and Salazar (2020) used RF, SVR, and ANN to assess engineering problems of the dam sites. Huang et al. (2020) estimated concrete permeability using RF method. Rosid et al. (2019) stated that RF is more accurate than Naive Bayes in estimating the permeability of intact rock. Zhang and Cai (2021) predicted carbonate rock permeability using RF algorithm.

Table 1 Proposed empirical relationships to predict permeability

As mentioned above, many studies have been conducted to estimate permeability using some intelligent methods. In previous researches, comprehensive studies with the aim of comparing intelligent methods in estimating rock permeability (especially rock mass) and at the site of dams (especially large dams) have not been used. In this research, six statistical and intelligent methods have been compared to estimate the permeability of rock mass at the site of large dams in Khersan II and Bazoft dam sites. The Q classification system, the joint spacing and the Lugeon number were calculated in 5 m intervals of the boreholes. For this purpose, 175 data related to the results of Lugeon test, depth, Q classification system and joint spacing in 5-m sections of trial and control boreholes dam sites were used. Statistical analysis was performed for assessing the effect of independent variables on the dependent variable. Finally, the performance of SR, MVLR, ANN, SVR, RF and ANFIS methods were evaluated using statistical indicators in estimating Lugeon number.

2 Study areas

The studied dam sites are located in the southwest of Iran in the Karun and Dez catchments and are of arched concrete type. Khersan II dam, with a proposed height of 240 m and a reservoir volume of 2142 million cubic meters, is located in Chahar-Mahal and Bakhtiari province, Iran. This dam is in the design stage. The dam site is situated on the Khersan River with coordinates of 31/25 degrees of north latitude and 50/36 degrees of east longitude in the southwestern region of Iran, in the Zagros Mountains. Asmari and Gachsaran formations form the abutments of the site (Figs. 1, 2). Data from four boreholes used in the analyses were taken from the Khersan II dam site. Bazoft dam with a proposed height of 211 m is also in the investigation stage. The geological formations of the Bazoft dam site include Jahrom and Asmari formations (Figs. 1, 2). Data from eight boreholes used in the analyses were taken from the Bazoft dam site.

Fig. 1
figure 1

Geological map of dam sites of Khersan II (lower) and Bazoft (upper), (Mahab Ghods 2009; Niru 2011)

Fig. 2
figure 2

Geological section of Bazoft dam site (A) and Khersan II dam site (B) (Mahab Ghods 2009; Niru 2011)

3 Methodology

3.1 Water pressure test (WPT)

The WPT was conducted at different sections of trial and grouting boreholes. The test pressures are increased in steps to the maximum pressure and then reduced to the initial pressure. The amount of effective pressure (Pe) and flow value (q) recorded in each pressure step. Then, Pe-q curves were plotted and the flow behavior was analyzed. The Lugeon number (Lu) is calculated using Eq. 1 (Nonveiller 1989).

$$ {\text{Lu}} = \frac{10q}{{P_{{\text{e}}} L}} $$
(1)

In Eq. 1, q is in Li/min; L is section length in m; and Pe is in atmosphere.

According to the flow behavior during the experimental pressures in the WPT, five behavior types are determined for the water flow in the rock mass (Fig. 3), (Kutzner 1985; Shroff and Shah 1999; Xu et al. 2022; Ewert 1997).

  • Laminar flow: The permeability values at five steps are approximately the same regardless of the test pressure. The equivalent permeability for this case is an average of five Lugeon values.

  • Turbulent flow: In this case, the amount of Lugeon decreases with increasing pressure, and the amount of Lugeon at the same pressures is almost the same. The permeability for this condition is the amount of recorded Lugeon by the maximum pressure.

  • Dilation flow: In this group, the amount of Lugeon resulting from the maximum pressure is greater than the amount of Lugeon obtained at medium and low pressures, and the Lugeon at similar pressures are approximately equal. In this case, temporary expansion of the rock joints occurred. High pressures can temporarily open the fractures or compress the joint fillers during the test (Kutzner 1985).

  • Washout flow: Consecutive increases in the amount of Lugeon during the test are a sign of permanent leaching of the joint fillers. Most of the time, this indicates very high test pressures. The representative permeability for this group is the amount of Lugeon obtained in the final step.

  • Void filling: Successive reduction of Lugeon amounts during the test process indicates that the water flow gradually fills the empty cavities and joints and forms them as closed networks. The permeability for this case is the amount of Lugeon obtained from the final step (Fig. 3).

Fig. 3
figure 3

Determination of hydromechanical behavior and Lugeon number (Kutzner 1985; Shroff and Shah 1999)

3.2 Q classification system

In this study, the Q classification system was determined in five-meter intervals of control and trial grouting boreholes using the relevant tables and graphs (Barton 2002; Barton et al. 1974). The Q classification system values are computed using Eq. 2.

$$ Q = \frac{{{\text{RQD}}}}{{J_{n} }}*\frac{{J_{{\text{r}}} }}{{J_{{\text{a}}} }}*\frac{{J_{{\text{w}}} }}{{{\text{SRF}}}} $$
(2)

where RQD is the percentage of cores with a length more than 100 mm in each drilling run (Deere 1989), Jn is the joint number score, Jr is joint roughness score, Ja is joint alteration score, Jw is water flow or pressure score, and stress reduction factor (SRF) is score of strength to stress ratio of the solid rock, swelling or crushing.

The permeability description and Q values based on the classification provided by Evert (1985) and Barton et al. (1974) are given in Table 2.

Table 2 Description of Q and Lugeon values (Ewert 1985; Barton et al. 1974)

3.3 Multilayer perceptron neural network (MPANN)

In terms of learning algorithm and topology, ANN is divided into two classes based on learning algorithm and topology: feed-back neural network (FBNN) and feed–forward neural network (FFNN), (Idrisovich Ismagilov et al. 2020; Sun et al. 2020; Sharifi et al. 2016). In FFNN, the signal is transmitted only in the forward direction. These types of ANNs are simple and common networks and are commonly used for data mining. Perceptron neural networks are a type of FFNN (Rustamovich Sultanbekov et al. 2020; Zhao and Wang 2022).

In multilayer networks, there is an input layer that receives information. Input layer has a number of neurons. In principle, the existence of a hidden layer is useful when it is a nonlinear activation function, and finally, there is an output layer that results from calculations entering it and network output is reached (Abraham 2005; Ostad-Ali-Askari et al. 2017; Suthar 2020). In these networks, after comparing the outputs with real values, the error values are determined. The computed error is then circulated to adjust the weights and values of the network bias, a method called error propagation (Dianati Tilaki et al. 2020; Ghadimi and Ebrahimian 2015; Abraham 2005).

In current study, using MATLAB software, multilayer feed-forward Perceptron neural network was used to estimate Lugeon number.

3.4 The ANFIS

The ANFIS method is very suitable for modeling complex problems that have unknown variables. In classical logic, the membership function value of each member is 1 if it is in the set and 0 if it is not (Jalili et al. 2015; Mahdavi et al. 2015). In contrast, each member of the fuzzy set can have a membership function value between 0 and 1. According to mathematical laws, this is generally referred to as Eq. 3:

$$ A = \left\{ {x;\,\,\mu_{A} (x)} \right\}\left| {x \in x} \right| $$
(3)

Membership function degree indicates the value of the level of dependence of the member on the fuzzy set. Gaussian membership function (GMFs) was used in this study. In various researches, two types of fuzzy systems are commonly used, which include Mamdani and Sugeno algorithms (Sobhani and Safarianzengir, 2020). Fuzzy inference systems (FIS) introduced as basic rule systems, which are made up of a set of linguistic rules and are able to denote any system with high precision which operates like an all-purpose predictor. Rule systems based on fuzzy logic (FL) theory use linguistic variables such as results and rules, where rules are expressed as inference or inequality (Sobhani and Safarianzengir 2020). The rule system based on FL is the if–then base rule system that is denoted by the if rule and the then result. ANFIS is a neuro-fuzzy system that permits fuzzy systems to learn variables by a back-propagation algorithm (Jang 1993; Rashidi Tazhan et al. 2019; Dorfan et al. 2020). In this study, Sugeno-type FIS was used in which each rule is specified as a linear grouping of inputs. The final output of the FIS is a simplification of the given mean weight of each output rule. A Sugeno FIS is a combination of inputs x, y. Hence, an output variable f is driven by two fuzzy rules (Gholami et al. 2020):

$$ \begin{aligned} & {\text{Rule}}\,{1}:\,\,{\text{If}}\,x\,{\text{is}}\,A_{1} ,\quad y\,{\text{is}}\,B_{1} \quad {\text{then}}\quad f_{1} = p_{1} x + q_{1} y + r_{1} \\ & {\text{Rule}}\,{2:}\quad {\text{If}}\,x\,{\text{is}}\,A_{2} ,\,\,y\,{\text{is}}\,B_{2} \quad {\text{then}}\quad f_{2} = p_{2} x + q_{2} y + r_{2} \\ \end{aligned} $$
(4)

The summary of working with ANFIS system in MATLAB software environment is that first the inputs and outputs are introduced to the system and the system performs the learning stage. Then, to validate the model, a system test is performed.

3.5 The SVR approach

Support vector machine (SVM) theory was developed based on Vapnik's theory of statistical learning (Vapnik 1995). The support vector regression (SVR) fits the curve with ε thickness into the data to minimize error of test data. In this model, a set of functions (for example: f(x) = w.x + b) is used to estimate. Where w is the weight vector and x and b are the bias values. The weight vector must be minimized for minimizing test error. SVR uses a new error function to overlook errors that are at a certain distance from the actual data (Jiang et al. 2022; Fallah et al. 2021; Qasem et al. 2019; Tekin 2014; Maleki and Emami 2019). Hence, some deviation from ε must be overlooked. The deviation is described as Eq. 5 and is involved in Eq. 6 with considering \(\xi_{i}^{ + }\) and \(\xi_{i}^{ - }\) deficiency variables. Finally, using the structural error minimization(SEM), the error value is optimized by using Eq. 6.

$$ \left| \xi \right|_{\varepsilon } = \left[ {\begin{array}{*{20}l} 0 \hfill & {\quad {\text{if}}\quad{\mkern 1mu} \left| \xi \right| \le \varepsilon } \hfill \\ {\left| \xi \right| - \varepsilon } \hfill & {\quad {\text{otherwise}} } \hfill \\ \end{array} } \right] $$
(5)
$$ \begin{aligned} & {\text{Minimize:}}\,\frac{1}{2}\left\| w \right\|^{2} + C\sum\limits_{i = 1}^{N} {(\xi_{i}^{ + } } + \xi_{i}^{ - } ) \\ & \varepsilon_{{{\text{Constrains}}}} {:}\,\left[ {\begin{array}{*{20}l} {w \cdot x_{i} + b - y_{i} \le \varepsilon + \xi_{i}^{ + } \quad i = 1,2,. \ldots ,N} \hfill \\ {y_{i} - \left( {w \cdot x_{i} + b} \right) \le \varepsilon + \xi_{i}^{ - } \quad i = 1,2,. \ldots ,N} \hfill \\ {\xi_{i}^{ + } \ge 0,\quad \xi_{i}^{ - } \ge 0\quad i = 1,2,. \ldots ,N} \hfill \\ \end{array} } \right] \\ \end{aligned} $$
(6)

In Eq. 6, \(\frac{1}{2}\left\| w \right\|^{2}\) is the regulatory elements, \(\xi_{i}^{ + }\) and \(\xi_{i}^{ - }\) are variables to make boundaries flexible, N is number of sample, ε is allowable error, C is the complexity balance coefficient to equilibrium the empirical risk with the regulatory elements, and the ε is acceptable error limit. Different kernel functions such as linear, quadratic, radial and polynomial are used in the SVR (Xie et al. 2021a; b; Yang et al. 2020). Usually, the radial kernel function has better performance for predicting. The main reason for choosing this function in the present study was its high generalizability in studies related to rock mechanics and geotechnics (Jiang et al. 2022; Kookalani and Cheng 2021; Mahmoodzadeh et al. 2021; Zhou et al. 2016). The equation of this kernel function is as follows.

$$ k\left( {x_{i} , \, x_{j} } \right) = \exp \left( { - \frac{{\left\| {x_{i} - x_{j} } \right\|^{2} }}{{2\sigma^{2} }}} \right),\quad \sigma > 0 $$
(7)

where σ is radial kernel function width and k(xi, xj) is internal multiplication of variables. The prediction precision using SVR by the radial basis function (RBF) kernel depends on the choice of ε, γ and C. In current research, the SVR based on the RBF was used to estimate Lugeon number of the rock mass using MATLAB software.

3.6 Data normalization

One of the advantages of data normalization is the improvement of gradient descent performance on normalized data compared to abnormal data. Also, the input values are normalized to avoid a very large or very small effect on the network weight (Seyfi 2017; Zhao et al. 2022; Kalteh 2008). In this study, the data were normalized between − 1 and 1 based on Eq. 8.

$$ X_{i} = 2 \, \left( {\frac{{X \, - \, X_{\min } }}{{X_{\max } - \, X_{\min } }}} \right) - 1 $$
(8)

where x is the experimental value, xmin is the minimum data between whole data, and xmax is the maximum data.

3.7 Performance evaluation of models

Correlation coefficient, the mean absolute percentage error (MAPE) (Eq. 9), root mean square error (RMSE) (Eq. 10), and variance account for (VAF) (Eq. 11) were used for evaluating the methods. These criteria have been widely used to evaluate the models and relationships obtained from soft computing approaches (Du et al. 2022; Dong et al. 2021; Zhu et al. 2022; Zhou et al. 2021a, 2021b).

$$ {\text{MAPE\% }} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {\frac{{y - y^{{\prime }} }}{y}} \right|} \times 100 $$
(9)
$$ {\text{RMSE }} = \frac{1}{{s^{2} n}}\sum\limits_{i = 1}^{n} {(y - y^{{\prime }} )^{2} } $$
(10)
$$ {\text{VAF\% }} = 100\left[ {1 - \frac{{{\text{var}} (y - y^{{\prime }} )}}{{{\text{var}} (y)}}} \right] $$
(11)

In relationships 9–11, y is the measured value, y′ is the variable estimated by the relationship, n is total data and s2 is the variance of the samples.

4 Results and discussions

Due to the importance of recognizing the discontinuities in the dam sites, in addition to studying the discontinuities at the ground level, the joints were studied on the cores obtained from the boreholes. The study of exploratory galleries indicates the existence of three major categories of discontinuities (two sets of joints and a bedding system) at the Bazoft dam site and three categories of joints at the Khersan II dam site (Table 3).

Table 3 Identified discontinuities at studied dam sites

4.1 Trial and control boreholes in the dam sites

In the Bazoft dam site, trial grouting operations have been conducted on the abutments (Fig. 4).

Fig. 4
figure 4

Overview of the position of trial grouting panels at and left abutment (A) right abutment (B) of the Bazoft dam site

The trial grouting boreholes at the Bazoft dam site are located above the water table and have been drilled vertically to a depth of 75 m. Simultaneously with the drilling operation in all boreholes, WPTs were conducted at intervals of 5 m to determine the Lugeon and to control the impact of the grouting. At the end of the cement grouting operation, for evaluating the trial grouting effect, in the center of each triangular a check borehole was drilled and Lugeon test was performed in 5-m sections. The depth of control boreholes in the right abutment is 90 m and in the left abutment is 75.20 m. A total of 8 boreholes (6 grouting boreholes and 2 control boreholes) were drilled at the Bazoft dam site.

In Khersan II dam site, according to the topographic conditions of the right abutment, trial grouting has been done on the left abutment. The proposed arrangement of the grouting boreholes is in the form of an equilateral triangle with sides of 3 m (similar to the arrangement of the trial grouting boreholes at Bazoft site). The boreholes of the trial grouting panel were located beyond the water table and were drilled vertically to a depth of 80 m. Simultaneously with the drilling operations by a thin-wall core samples in all boreholes, water pressure WPTs tests were performed at intervals of 5 m. At the end of the grouting operations, a check borehole was drilled at triangle center with a depth of 80 m to evaluate the effect of the grouting, and permeability was tested in 5-m sections. A total of 4 control and grouting boreholes were drilled in Khersan II dam site.

4.2 Rock quality designation (RQD)

Based on the average RQD, all sections of the left, right and riverbeds in the Bazoft dam site are classified in the excellent category. Also in Khersan II dam site, the rock mass is categorized in excellent condition in terms of RQD. In the Bazoft dam site, the average RQD of the left side is more than the right side.

4.3 Permeability and hydromechanical behavior of the sites

Figure 5 shows two examples of rock mass behavior at the depth of 10–15 m of the borehole GSR3 (Bazoft dam site) and CH1 borehole (Khersan II dam site). In these stages, the type of behavior is turbulent and dilation.

Fig. 5
figure 5

Examples of Lugeon test results: Bazoft dam site (top), Khersan II site (bottom)

In the Bazoft dam site, the permeability value more than 60% in the left abutment was 1.5%, while this permeability value was 29% in the right abutment. Lugeon higher than 100 on the left abutment are due to karst development. In this construction, the percentage of impermeable class (0–3) is the highest. In the site of Khersan II dam, frequency percentage of the Lugeon in the range of 0–3 is the maximum value (Fig. 6). This indicates low permeability of the site.

Fig. 6
figure 6

Lugeon distribution in the studied sites

Normally, permeability is expected to be low in sections with high RQD and high permeability in sections with low RQD. In nature, given the complexities involved, other relationships can prevail. Sometimes with increasing RQD, no decrease in permeability is observed. In some sections of the borehole, even with decreasing RQD, the permeability decreases. In general, the following conditions exist between RQD and Lugeon value (Ewert 1997):

  1. (A)

    High RQD and low Lugeon: In such sections, the rock mass has less joints and cracks, or the joints may be filled with fine materials.

  2. (B)

    High RQD and high Lugeon: This condition occurs mostly due to the phenomena of hydraulic failure, elastic opening of joints, crushing and roughness of their surfaces. In this case, high borehole permeability is not due to crushing of the rock mass and it can be related to the presence of a karst channel.

  3. (C)

    Low RQD and high Lugeon: In such cases, the rock mass is full of joints and cracks and the openings of the joints are relatively high and their filling is low.

  4. (D)

    Low RQD and low Lugeon: This condition indicates the filling of joints or lack of hydraulic connection of rock masses.

Lugeon has a direct corelation with RQD in karst sections (Assari et al. 2016; Hiller et al. 2011; Rastegarnia et al. 2017, 2019; White 2002).

Figure 7 shows the types of hydromechanical behavior in the sites of Khersan II and Bazoft dams. In the Bazoft dam site, the predominant flow type is laminar flow with 34% and the lowest percentage is related to turbulent flow with 4%. Also, the dilation flow is about 27%, the washout flow is about 5.5%, the void filling flow is about 17% and 13% is impermeable.

Fig. 7
figure 7

Percentage of the rock mass hydromechanical behavior of the sites

As shown in Fig. 7, the percentage of turbulent and washout behaviors in Khersan II dam site is zero. In this site, laminar flow is 20%, dilation is about 15%, void filling is about 12% and 54% is impermeable. The absence of turbulent behavior indicates the low groutability of the Khersan II dam site. Turbulence and washout flow types indicate that the condition of the rock mass is not suitable in terms of permeability and arrangements should be made for its sealing.

4.4 Statistical properties of variables

The used data for modeling were obtained from 12 trial and control grouting boreholes at the Bazoft and Khersan II dam sites. Table 4 shows the statistical characteristics of the variables used in the analysis. The histogram of the variables is also presented in Fig. 8. The average of Q and Lugeon in these two sites are 31.72 and 15.73, respectively. Based on the average of Lugeon (15.73), it can be said that the rock mass has moderate permeability.

Table 4 Statistical characteristics of the variables used in the analysis
Fig. 8
figure 8

Histogram of variables

In general, if the amount of skewness and kurtosis of the data is outside the distance of 3 to − 3, the data is not normally distributed and the data should be normalized. According to the results, it can be said that the data are normal (Table 4; Fig. 8). On the other hand, since the number of data is more than 25 (175 data), the data can be considered normal according to the Pearson suggestion (Bai et al. 2019; Bai et al. 2022; Wu et al. 2021; Zhou et al. 2021c). Some outlier data were deleted using box plot diagrams and engineering judgments. For example, in sections that Lugeon and RQD are both ascending (for example, both are equal to 100) are considered as karst areas. Most of these sections were corroborated by evidence in engineering geological reports such as drilling rod drop and no core recovery.

4.5 Lugeon relationship with Q classification system, depth and joint spacing

Characteristics of joints such as opening, roughness, continuity, orientation and slope of joints are effective parameters in the degree of permeability of the rock masses (Bell 2000; Liu et al. 2020; Yang et al. 2018; Zhang et al. 2021a, 2021b, 2021c, 2022b; Li et al. 2022a). Therefore, by assessing the relationship between Lugeon and the joint characteristics (i.e., joint aperture, number of joint sets, roughness, and persistence), the permeability of a site can be determined. Relationships of the Q classification system, depth and joint spacing (Js) with Lugeon number in 12 trial grouting and control boreholes in Bazoft and Khersan II dam sites (on 175 data) are presented in Fig. 9. The most accurate relationship is logarithmic function (Fig. 9). Most previous studies have reported logarithmic and exponential relationships between Lugeon and joint properties (Qureshi et al. 2014; Kayabasi et al. 2015; Jiang et al. 2009; Farid and Rizwan 2017; El-Naqa 2001).

Fig. 9
figure 9

Lugeon relationship with Q classification system, joint spacing (Js) and depth

Table 5 shows the evaluation results of bivariate relationships between Lugeon and rock mass characteristics based on different criteria. According to this table, joint spacing has the greatest effect on the Lugeon number.

Table 5 Evaluation of bivariate relationships between Lugeon and rock mass characteristics

4.6 Multivariate linear regression (MVLR) analysis

Table 4 shows developed equations for predicting Lugeon using the MVLR method. Durbin-Watson (DW), variance inflation factor (VIF), MAPE, and determination coefficient were used to appraise the relationships (Table 6). The results of analysis of variance (P value = 0.00) show that the models were appropriately developed. The suitability of the constant values and coefficients was evaluated in more detail by the T-test. The results of this test are presented in Table 7. The DW test is used to examine the independence of errors from each other. The DW value must be between 1.5 and 2. Based on results, the errors are independent from each other and it is possible to use the developed models. Because the DW values are placed between 1.5 and 2.5 (Table 6). The VIF criterion is also used to evaluate the correlation of independent variables. Results of the MVLR shows that with removing the depth (D) variable does not significantly change the accuracy of the model 2. Therefore, model 2 is recommended for estimating Lugeon. As well as Fig. 10 shows that depth has a small effect on the Lugeon value.

Table 6 Developed equations and model summary
Table 7 Coefficients and criteria
Fig. 10
figure 10

The relationship of measured Lugeon with forecasted values

4.7 Evaluation and comparison with previous studies

Several empirical relationships have been developed to predict Lugeon number (Table 1). In this study, using the relationships of previous researchers, Lugeon was estimated. Then, the relationship of estimated Lugeon with the measured Lugeon was investigated (Fig. 10). Results show that there is a good to moderate correlation between measured Lugeon with predicted ones based on previous studies (Fig. 10). This figure also shows the performance of the proposed MVLR model of the present study. The MVLR model with 79% is able to predict Lugeon number of Asmari limestone rocks at Bazoft and Khersan II dam sites. The determination coefficients for each of the evaluated relationships of previous researchers to estimate Lugeon values are presented in Fig. 10. Coefficient of determination from 0.32 for Oge and Çırak (2019) relationship to 0.71 for Jiang et al. (2009) relationship are variable.

Due to the various joint properties of the rocks, there isn’t a simple relationship with very high accuracy (R2 > 0.90) to estimate Lugeon number based on rock properties. Similar results have been presented by other researchers (Kutzner 1996; Ewert 1997; Nia et al. 2017). For this reason, in this study, intelligent methods for estimating Lugeon number were used. Many problems in geology are so complex that their separation and study are possible only by soft computational methods. Because these methods use technology and knowledge to build an intelligent system to solve complex problems (Davis 2002; Kayabasi et al. 2015; Li et al. 2022b). Previous studies showed that soft computing methods have higher accuracy than statistical methods for predicting rock mass properties (Kayabasi et al. 2015; Rahimi et al. 2019; Zadhesh et al. 2015; Shahbazi et al. 2020).

4.8 Multilayer Perceptron Neural Network (MPNN)

Today in the field of engineering, the neural networks have a good performance for predicting and modeling (Golmohammadi et al. 2014; Gholami et al. 2015; Mohan et al. 2021; Tabatabaei and Salehpour Jam 2017). The MPANN contains of three layers such as input layer, hidden layer and output layer. The number of neurons in the input layer depends on the number of independent variables (Alizadeh et al. 2022; Sun et al. 2021; Liu et al. 2021a). The output of the layer performances as a dependent variable and its number. The hidden layer as the interface leads to the process of computing the output (Rath et al. 2021; Li et al. 2017; Liu et al. 2021b; Çevik and Tabaru-Örnek 2020). In current study, different neurons were tested to achieve optimal results. The optimal MPNN contains of ten neurons in a hidden layer. The Q classification system and joint spacing (Js) were considered as inputs and Lugeon number was output. The whole data was categorized as three sets including training (70%), testing (15%) and validation (15%). For teaching the algorithm and obtaining the weights for the preferred outcomes the training group was used (Kavyanifar et al. 2020; Ghalandari et al. 2019; Zhang et al. 2022a; Meng et al. 2022). The validation set is used to ensure that the network does not depend on the training data set (Al-Masaeed et al. 2021; Ansari and Hashemi 2017; Shamsashtiany and Ameri 2018). The test set is used to test the network in predicting new data (Sanaei et al. 2015; Liu et al. 2021c; Zheng et al. 2021). The trained model should be verified by an independent set of experimental data (Moshahedi and Mehranfar 2021; Rastegarnia et al. 2021; Wang et al. 2022; Yang et al. 2022; Zhang et al. 2020). The performance of the Levenberg–Marquardt (LM) training algorithm for predicting output using inputs was examined by MATLAB software. Sigmoid and Purelin transfer functions were chosen for hidden and output layers, respectively. Figure 11 shows the optimum MPNN structure used in this study.

Fig. 11
figure 11

The optimum MPNN structure used in this research (with ten neurons)

Figure 12 shows the mean square error (MSE) variations by the LM algorithm in the optimum results. The lowest MSE was obtained at the first epoch for predicting Lugeon (Fig. 12). Also, Fig. 12 shows the correlation coefficient between Lugeon and inputs using the optimal MPNN model (Fig. 13).

Fig. 12
figure 12

The error trend using MPNN in the optimum model (in the tenth neurons)

Fig. 13
figure 13

The correlation coefficients using MPNN in the optimum model

4.9 ANFIS results for predicting Lugeon number

In this study, using ANFIS method as a combination of fuzzy logic with neural networks, an attempt was made to estimate Lugeon number. Prior to modeling using ANFIS, the data were categorized in training and testing sets with 75% and 25% of the whole data, respectively. To train the model in ANFIS method, the combined method of recursive error propagation with the least squares was used. In this research, obtained ANFIS models are based on the Sugeno method. The GENFIS2 command was used to form the ANFIS model using a differential clustering method. The input data for modelling include Q-system and joint spacing (Js) and the output parameter is Lugeon (Fig. 14). In this model, the membership functions (MFs) of input data for each of the variables are 4 and the MFs of the output data whose output is Lugeon, are shown in Fig. 14. In the input membership function (inputmf) layer, inputs pass through membership functions. The membership function degree indicates the membership level of the member to the fuzzy set (Suthar 2020).

Fig. 14
figure 14

ANFIS model structure, number of inputs, outputs and rules used

Rule viewer inference diagram is a MATLAB technical computing environment used to display fuzzy inference diagrams. This tool is used to identify and display the rules and how the functions affect the final results (Moshahedi and Mehranfar 2021) (Fig. 15). Each row corresponds to a rule, the number of each rule is shown on the left side of the charts, and each column corresponds to a variable.

Fig. 15
figure 15

Rule displayer (there are 4 rules for predicting data relationships)

The best performance of the ANFIS model was obtained after 3500 training sessions with clustering radius of 0.30 (Table 8). Figure 16 shows GMFs for Lugeon using ANFIS model.

Table 8 Modeling features using ANFIS and SVR
Fig. 16
figure 16

Generated GMFs by Sugeno-FIS method for ANFIS model input variables

The ANFIS model led to the formation of 4 rules for Lugeon, which has the best answer among the ANFIS models with a differential clustering method. Figure 17 shows the correlation of the ANFIS model in the training stages. This figure shows that the ANFIS model can explain 87% of the Lugeon's changes by the Q classification system and joint spacing. A coefficient of determination greater than 60% indicates that the independent variables have largely been able to explain the changes in the dependent variable (Taylor 1990).

Fig. 17
figure 17

Correlation between real and predicted Lugeon by ANFIS model

The error histogram using ANFIS method is shown in Fig. 18. This shows the difference between the forecasted Lugeon using ANFIS method and actual value of Lugeon.

Fig. 18
figure 18

The error histogram using ANFIS method

4.10 The SVR Results

In this study, to train and test the models using radial basis function (RBF), 75% and 25% of the total data were used, respectively. The obtained optimal parameters of ε, γ and C to estimate Lugeon were 0.01, 92 and 80, respectively. Error histogram and relationships between measured and predicted Lugeon using SVR method for all data are demonstrated in Fig. 19. By comparing the results of the SVR method with other methods used in the present study, it is observed that SVR shows higher accuracy than other methods. Numerous studies have shown that on a small amount of data the SVR model has a high accuracy in predicting the dependent variable due to the use of structural risk optimization principle (SROP), (Al-Anazi and Gates 2012; Moghaddam et al. 2020). The SROP seeks to minimize the high limit of generalization error. The SVR solution can also be turned into a global optimum, while MPNN approaches tend to be a local optimal solution. Therefore, over fitting rarely occurs in the SVR method (Kim 2003; Luo et al. 2022a, b; Zhan et al. 2022; Chen et al. 2021a, b).

Fig. 19
figure 19

SVR results for predicting Lugeon number

4.11 The RF Results

The RF algorithm, which is a learning method based on a group of decision trees, was used to estimate the permeability of the dam sites. In this algorithm to form each tree, a different set of existing patterns are selected considering the replacement of each selected pattern. The size of the selected category will be equal to the total number of available patterns (Breiman 2001). RF was presented by Breiman (2001) as a method of a new development of decision trees, which combines the prediction of several individual algorithms together using rules-based. The general principles of group training techniques are based on the assumption that their accuracy is higher than other training algorithms (Kotsiantis and Pintelas 2004; Luo et al. 2022a, b).

In the RF method, bagging is used to create training data. This is done through random resampling of the original data set with replacement. In this step, none of the data selected from the input samples is removed to produce the next subset, and thus the variance is also reduced. Therefore, some data may be used more than once in training branches, while some other data that are not effective in modeling were never used. Therefore, more stability is obtained for the model and it makes the model more reliable against slight changes in the input data and increases its prediction accuracy (Breiman 2001). The set of samples that are not selected in the bagging process in the training of trees is included in a subset called Out-of-Bag (OOB) patterns. This part of the RF can be used to evaluate the performance of the model (Peters et al. 2007). In this way, RF can calculate an internal uncorrelated estimate of the generalization error without using external data subsets. The general trend of the RF algorithm is mentioned in various sources (Koohestani et al. 2022; Soleimannejad et al. 2018; Parsakhoo et al. 2016) In this research, the modeling process was done using the R software package (R 4.2.1), (Liaw and Wiener 2002). 75% of the samples (131 cases) were used for training the model and 25% of the samples (44 cases) were used to evaluate the constructed models. In order to determine the number of selected variables in each tree node as well as the number of trees, the tenfold cross-validation method was used. According to this method, 400 trees and two variables in each node have provided the most favorable conditions for the model. After modeling, the model was evaluated by experimental data. The importance of the input parameters is obtained based on the Gini importance index and permutation importance (Mantas et al. 2019). Accordingly, joint spacing is more important than the Q classification system. The RF model makes predictions using the OOB error value, and in this method, data is not used for testing (Chehata et al. 2009). Figure 20A shows the prediction results of the RF model against the observational data for permeability, which shows a high correlation between the estimated and observed permeability. Also, the distribution of the error values resulting from the RF model is shown in Fig. 20B. It can be seen that the accuracy of this model is the highest after the SVR method.

Fig. 20
figure 20

RF results for predicting Lugeon number

4.12 Comparison of results

Comparison of results based on various criteria shows that correlation coefficients of the used methods and previous works are higher than 50% (Table 9). Various criterial have been used by researchers to check the validity of the relationships and models (Fayaz et al. 2022; Srinivasareddy et al. 2021; Hassanzadeh et al. 2021; Wu et al. 2018; Saghi et al. 2019; Sui et al. 2020). Correlation coefficient higher than 70% and between 70 and 50% are considered as strong and moderate correlation, respectively (Taylor 1990). Precision of the used methods in current study is as following: MVLR < MPNN < ANFIS < RF < SVR. Also, results of most of the previous works show strong correlation.

Table 9 Comparison of the used methods and previous works

5 Conclusions

In this study, the relationships of the Q classification system, joint spacing and depth with Lugeon number were investigated in five-meter sections of the boreholes in Bazoft and Khersan II dam sites, west of Iran. Based on the average Lugeon and Q classification system of both sites, the rock mass was classified as moderate permeability category in terms of Lugeon and good category in terms of strength (Q value). The SVR, MPNN, ANFIS, RF, MVLR, and simple regression methods were used to predict Lugeon based on Q-system and joint spacing (Js). The performance of the empirical relationships and models was evaluated using various criteria. Due to the use of structural risk minimization principles, the SVR (with R = 0.97) using radial basis function showed higher accuracy than other methods to estimate the Lugeon value. Precision of the used methods is as MVLR < MPNN < ANFIS < RF < SVR. Comparison of results with previous researches showed that there is a good to moderate correlation between measured Lugeon with predicted Lugeon based on previous studies. Analysis of all model criteria (R2, RMSE, MAPE, DW, analysis of variance, t-test, normality, VIF and VAF) using simple and multiple regression disclosed that it is possible to forecast the Lugeon number using the Q classification system and joint spacing.