Introduction

Hydraulic conductivity is one of the most important hydrogeological variables through which many characteristics of water flow in porous media can be evaluated. The movement of water inside the soil is affected by initial humidity, degree of saturation and physical properties, including soil grain size distribution. It is very important for modeling water flow in the soil and is an important parameter for drain design in the construction of earth dams and embankments (Odong 2007). In addition, hydraulic conductivity is very important in some geotechnical problems such as seepage, settlement calculation and stability analysis (Boadu 2000).

With the increasing development of large cities, the need to build massive engineering projects such as the Urban Train increases. Proper management and control of groundwater at the site of such large structures is essential. Therefore, evaluation of parameters such as hydraulic conductivity that affect the flow of water in porous media is important for the optimal and reliable design of such structures, and its determination is one of the main objectives in geotechnical studies. Many parameters affect the hydraulic conductivity, such as soil texture properties including pore size, grain size distribution, grain shape, grain density, etc. (Engler 2010). Accordingly, this research focuses on grain size distribution.

Different techniques have been proposed for determining the hydraulic conductivity including field and laboratorial methods as well as calculations based on experimental formulas (Chapuis 1990; Ankeny et al. 1991; Chandel and Shankar 2021). Pumping test, slug test and packer test are among the field methods. So far, three different methods have been presented for determining this variable in the field for the slug test method (Cooper et al. 1967; Bouwer 1989; Hvorslev 1951) and several common methods have been proposed for the pumping test (Theis 1935; Cooper and Jacob 1946; Chow 1952; Neuman 1975). Also, five well-known methods including Fair and Hatch (1933), Carman (1956), Shepherd (1989), Alyamani and Şen (1993) and Hazen (1892) have been presented to determine this variable in the laboratory. Sperry and Peirce (1995) after evaluating the capability of the presented laboratorial methods and experimental formulas, it was concluded that the best method in this context is the experimental formula Hazen (1892) except for environments having particles with very irregular shapes and, in general, the experimental formula Alyamani and Şen (1993) is closer to the real results. Although these methods have various advantages such as recognizing subsurface conditions by drilling, they are time consuming and costly and require more manpower. Also, the variables measured by these methods have inherent uncertainty due to the nonlinear behavior and heterogeneous and anisotropic conditions in hydrogeological environments. In addition, all these recent methods were the result of solving the equations governing groundwater flow by different methods and with boundary conditions and constraints or experimental formulas. With the development of science and technology in the field of hydrogeology, AI models for determining hydrogeological variables have also expanded. Among the various AI methods, FL, SVM and WANN methods can be mentioned. In the last few years, various studies have been performed to estimate the hydrogeological variables using FL (Morankar et al. 2013; Tayfur et al. 2014; Nadiri et al. 2017; Khatibi and Nadiri 2021) and ANN (Schaap and Leij 1998; Chitsazan et al. 2015; Andalib and Nourani 2019; Faloye et al. 2022) methods. Also, review studies conducted on the estimation and prediction of hydrological and hydrogeological variables such as runoff, water level changes, storage coefficient, porosity, transmissibility, surface water and groundwater quality, river sediment transport, etc. have confirmed the success of AI methods such as SVM (Nourani and Andalib 2015; Hosseini and Mahjouri 2016; Eini et al. 2020; Gharekhani et al. 2022; Nadiri et al. 2022) and WANN (Nourani and Andalib 2015; Sharghi et al. 2019; Alizadeh et al. 2017; Li et al. 2022).

Hydrogeological variables such as hydraulic conductivity do not have constant and certain values ​​and are associated with uncertainty in most cases (Tayfur et al. 2014; Javadi et al. 2021; Singh et al. 2022). However, FL (Table 1) (Tayfur et al. 2014; Sihag 2018) ANN (Sun et al. 2011; Sihag 2018; Faloye et al. 2022) and Neuro Fuzzy (NF) (Hurtado et al. 2009; Tayfur et al. 2014; Sezer et al. 2010; Faloye et al. 2022) methods have been used to estimate the hydraulic conductivity.

Table 1 Summary of previous studies on hydraulic conductivity

Ross et al. (2007) estimated the hydraulic conductivity via fuzzy logic analysis of grain size data with high accuracy. They defined the membership degree for soil samples based on the expert opinion, which may be another source of the subjectivity. In recent years, hydraulic conductivity has been estimated based on grain size data by using individual AI models that apply optimization algorithms (Table 1) (Erzin et al. 2009; Sedaghat et al. 2016).

Considering the inherent, unique and different abilities of each of the individual AI models, multiple models have been considered. Nadiri et al. (2013) presented the combined SCMAI method for the prediction of fluoride concentration. Tayfur et al. (2014) conducted a study to estimate the hydraulic conductivity using SICM model. Using electrical conductivity and saturation thickness, these researchers estimated the hydraulic conductivity and transmissibility of Tasuj plain through the use of FL, ANN and NF. Then they optimized the output values ​​of each individual model, using ANN. Saemi and Ahmadi (2008) applied a composite model that was a combination of NF and Genetic Algorithm models to predict permeability. They used borehole data in their study. Their methodology was as follows: after running the NF model, they used the genetic algorithm to optimize the parameters and reduce the error containing data in NF model.

Hydraulic conductivity has been estimated based on grain size data by using composite artificial intelligence. Rogiers et al. (2012) estimated the hydraulic conductivity in the Neogene aquifer, northern Belgium, using the GLUE-ANN composite method based on grain size data. Kashani et al. (2020) predicted hydraulic conductivity in Tabriz plain using several AI models. These models including Multivariate Adaptive Regression Splines (MARS), M5 model Tree (M5Tree), Support Vector Machine (SVM) and Extreme Learning Machine (ELM) were. Then the results of these models were integrated and optimized with ANN model for more accuracy. Silt, clay, bulk density and electrical conductivity were used as model inputs to predict of hydraulic conductivity. The previous studies are summarized in Table 1.

In this study, SCMAI method was used to estimate the hydraulic conductivity in Tabriz Urban Train. Considering that the SCMAI model used in this study applies the output of individual AI models, LSSVM, LFL and WANN, the advantages of each of these methods are used to estimate the desired variable, which is hydraulic conductivity. ANN method is applied to combine the output of AI model as a nonlinear combiner method. Therefore, this method is the accumulation of the advantages of methods that are performed individually. Because each of the individual models has a unique strength, the estimated values ​​of the individual models are combined in the SCMAI model. To the best of our knowledge, SCMAI method has not been adopted to estimate the hydraulic conductivity by using the grain sizes of soil samples and this research is the first try to predict hydraulic conductivity based on the soil grain size using SCMAI method.

Materials and methods

Study area

The city of Tabriz is located in East Azerbaijan province in northwestern Iran (Fig. 1). Tabriz Urban Train consists of 5 lines (4 main lines and one suburban line) measuring 100 km long. Line 1 is in use and line 2 is under construction. The other 3 lines are being studied and researched. In the current research, lines 1 and 2 were studied. Line 1 of Tabriz Urban Train starts from the southeast of Tabriz and then leads from the central part of the city to the southwest. Line 2 of Tabriz Urban Train starts from Gharamalek area in the west of Tabriz and reaches Baghmisheh neighborhood after passing through Ghods, Jomhouri, Daneshsara, and Abbasi streets. It then extends to the east and ends after passing through Marzdaran district near Tabriz International Exhibition Center (TURO 2019). The information used in this research has been obtained from Tabriz Urban Train Organization.

Fig. 1
figure 1

City of Tabriz and the position of the boreholes in Tabriz Urban Train

Data analysis

The study area is located on line 1 of Urban Train in the central part of Tabriz. Five boreholes are located along Timsar Mohagheghi Street from Amin Forked Road to the former Terminal Square, and 15 boreholes are located on Imam Street from Daneshgah Square to Saat Square. After studying and thoroughly examining the borehole data, from among these 20 boreholes, we used the data from 15 boreholes on line 1 of Urban Train for this study. The length of the studied area along ​​line 2 of Tabriz Urban Train, which includes almost the entire length of line 2, is about 22 km. Geotechnical studies of this line included 133 boreholes, from among which the data of 34 boreholes were used in this study after a complete review of the information of all boreholes. From 15 boreholes of Line 1 and 34 boreholes of Line 2, 51 and 43 soil samples were selected at different depths, respectively. A total of 94 soil samples were selected from 49 boreholes and used. The position of the boreholes in Tabriz Urban Train is shown in Fig. 1.

The soil samples were selected from depths at 3m intervals. Depths of the boreholes along line 1 are at least 30 and at most 41 meters, and these depths along line 2 are 20 and 45 meters, respectively.

The purpose of this study was to predict the hydraulic conductivity using multiple artificial intelligence based on grain size distribution. In this research, the grain size data was the input of the individual models, and hydraulic conductivity was the output. Grain size data were obtained from hydrometer analysis, and grain size distribution tests were performed on the samples taken from different depths of the boreholes according to AASHTO T27 standard. D30, D60 and D80 were determined according to the grain size distribution curves. Hydraulic conductivity determination tests were carried out by LEFRANC test with constant and step drawdown pumping in the boreholes. 94 values ​​of horizontal hydraulic conductivity were used in this research.

After examining the different modes for the number of data in the training and testing steps (trial and error), in the best case, 70% of the data was allocated for the training step and the remaining 30% for the testing step. The maximum and minimum hydraulic conductivity values are 0.00284 and 0.000000108 cm/sec, respectively. The average hydraulic conductivity value is 0.000146 cm/sec. The maximum values of D30, D60 and D80 are 0.94, 9.38 and 27.5 mm, respectively, their minimum values are 0.006, 0.076 and 0.152 mm, respectively, and their average values are 0.123, 0.845 and 3.142 mm, respectively (Table 2) (TURO (2019)).

Table 2 Statistical specifications of input and output parameters

The data selection method for training and testing steps was random so that the statistical characteristics of both clusters would be almost similar. In addition, the maximum and minimum values ​​of the data were included in the training step data, because artificial intelligence models have strong interpolator and weak extrapolator capabilities. LFL, LSSVM and WANN models were used in this study to estimate the hydraulic conductivity. The output of these three individual models were then introduced into the multilayer perceptron neural network as input, and the neural network was used as the combiner of the new inputs.

Artificial intelligence methods

Fuzzy Logic (FL)

The basis of fuzzy theory was first introduced by Zadeh (1965), which in recent years has shown a high ability in reducing the estimation error compared to the probabilistic method. Fuzzy Logic has a multi-valued view of events, which is contrary to explicit or divalent logic, ​​in which everything is either right or wrong. In fact, the fuzzy method is appropriate for reducing the estimation and human error compared to other reliable theories (Chiu 1994; Nikravesh et al. 2003).

Hydrogeological data usually have an inherent estimation error and are not considered to be explicit and error-free data. For example, obtaining hydraulic conductivity from the LEFRANC test (pumping) has a noteworthy error. The membership function is a function by which input data are fuzzified. That is, each input to the fuzzy system is converted to a number at an interval of zero to one. Membership functions are defined for both output and input data. There are different types of membership functions, including Sigmoidal, Gaussian, Triangular, Trapezoidal, etc. In general, there are two discussions of modeling and fuzzy clustering in fuzzy logic. Fuzzy modeling is used to estimate the hydrogeological parameters such as porosity, hydraulic conductivity and transmissivity. Fuzzy modeling can be implemented in three fuzzy methods of MFL (Mamdani 1976), SFL (Sugeno 1985) and LFL (Larsen 1980). The difference between the SFL method and the other two methods is in their output. The membership function of the fuzzy system output data in the SFL method is a linear or constant relationship obtained by clustering method. The MFL output membership function can also vary depending on the type and nature of the data and the study, e.g., the Gaussian membership function. The first step in creating a fuzzy model is to classify the data so that different clustering methods can be used in terms of the type of the fuzzy model used. The subtractive clustering method is used for the SFL model, and the Fuzzy C-Means (FCM) method is used for the MFL and LFL methods.

Each fuzzy model consists of three main steps: (i) data fuzzification, which is done by defining the membership function; (ii) creating a connection between input and output, which is also done by a series of rules such as if-then; and (iii) the final step, which is the step of system review, the result aggregation and defuzzification. This is done by fuzzy operators including “AND”, “OR”, and “NOT”. The AND operator also behaves as minimization (min) and weighting factor (prod), and the OR operator as maximization factor. The LFL method was used in the present study.

Support Vector Machine (SVM)

Support Vector Machines (SVM) were introduced in 1995 by Vapnik and Cortes. SVM is an algorithm that finds a specific type of linear model that produces the maximum hyperplane margin. Maximizing the hyperplane margin leads to maximizing the decomposition between the clusters. The nearest training points to the maximum hyperplane margin are called support vectors, and only these vectors are used to define the boundary between the clusters. The decision-making rules that are defined and decompose the binary decision classes by an optimal plane are according to the Eq. (1) (Cortes and Vapnik 1995):

$$y=sign \left(\sum\nolimits_{i=1}^{n}{y}_{i}{a}_{i} (x.{x}_{i})+b\right)$$
(1)

Where y is the output of the equation, yi is the value of the training sample class Xi and “.” represents the internal multiplication.

The vectors \(x=[ {x}_{1},{x}_{2},\dots ,{x}_{n}]\) represent an input data and the Xi vectors are the support vectors. Also, unlike neural networks, SVM does not have the problem of getting stuck in the local minima of the error function (Hong 2011).

The problem-solving steps in the support vector machines algorithm, like the neural network’s algorithm, are divided into two stages of training and testing. Types of SVM models include Support Vector Classification, Support Vector Regression, Least Squares Support Vector Machine, Linear Programming Support Vector Machine, and Nu-Support Vector Machines. The Least Squares Support Vector Machine (LSSVM) was used in this study. LSSVM is a modified SVM model and is an implementable machine learning method for classification and regression (Suykens et al. 2002). The LSSVM method uses linear Karush-Kuhn-Tucker (KKT) equations instead of the SVM quadratic programming problem. SVM uses a quadratic loss function without any rules, which leads to poorer estimation; therefore, weighted LSSVM is taken in order to prevent this, and in cases where the small weights are assigned to the data, the training method is presented in two stages (Shabri and Suhartono 2012). A training set of N data \({\left\{{x}_{i}, {y}_{i}\right\}}_{i=1}^{N}\) is considered where \({X}_{t}\in {R}^{d}\) is the input data and \({y}_{t}\in\) R is the output data.

Different kernel functions can be used according to the type of studies. There are different types of kernel functions such as Linear, Sigmoidal, Radial Basis and Polynomial. RBF kernel has been used in this study because this type of kernel has the highest efficiency in hydrological models (Suykens and Vandewalle 1999).

Wavelet-ANN Model (WANN)

Artificial Neural Network (ANN)

Neural networks have the ability to recognize a dynamic nonlinear system without any presuppositions in the modeling process. Neural network architecture in most predictive engineering problems consists of a three-layer system that includes the input layer, the hidden layer, and the output layer. In this structure, the input layer first processes the input data for transfer to the hidden layer. Then the hidden layer calculates the appropriate weight coefficients using transfer functions such as the Hyperbolic tangent or logic function before sending the information to the output layer. Three-layered network is based on a linear combination of the input variables, which is transformed by a non-linear activation function. This structure of the network is called the feed forward network because the communication between the neurons is such that they connect from the input layer to the hidden layer and from this layer to the output layer, but they do not communicate with each other within one layer of the neurons (Nourani and Andalib 2015).

Wavelet function

Wavelet Function is a function that has two important features: oscillatory and short-term; ψ(x) is the wavelet function if and only if the Fourier Transform ψ(ω) satisfies the following condition (Mallat 1998):

$${\int }_{-{\infty }}^{+{\infty }}\frac{\left|\left({\upomega }\right)\right|}{{\left|{\upomega }\right|}^{2}}d{\upomega }<+{\infty }$$
(2)

This condition is known as the Admissibility condition for the ψ(x) wavelet. Eq. (2) can be considered equivalent to the following equation:

$$\left(0\right)={\textstyle\int_{-\infty}^{+\infty}\left(\mathrm X\right)dx=0}$$
(3)

This property of the function with mean of zero is not very restrictive, and many functions can be called wavelet function based on it. ψ(x) is the mother wavelet function in which the size and position of the functions used in the analysis change with the two mathematical operations of Translation and Dilation during the analyzed signal (Mallat 1998).

$${}_{a,b}\left(x\right)=\frac{1}{\surd \text{a}} \left(\frac{x-b}{a}\right)$$
(4)

Finally, the wavelet coefficients can be calculated at any point in the signal (b) and for any value of the dilation (a) with Eq. (5) (Mallat 1998).

$$\text{C}\text{W}\text{T} (\text{a}, \text{b}) = \text{W} \text{f} (\text{a}, \text{b}) = \frac{1}{\sqrt{a}} {\int }_{-{\infty }}^{+{\infty }}f\left(x\right)\left(\frac{x-b}{a}\right)dx={\int }_{-{\infty }}^{+{\infty }}f\left(x\right){}_{a,b}\left(x\right)dx$$
(5)

There are many types of wavelet functions, the most important and widely used of which are the Daubechies, Haar, Symlet, Morlet, Mexican hat, Coiflet and Meyer Wavelet Functions (Mallat 1998). The different types of these functions have been described in the MATLAB software toolbox.

In general, WANN method is used in the simulation and prediction of signals. That is, the signal is converted into several sub-series using wavelet decomposition, one of which is the estimate or background of the main series, and the rest of the sub-series are the details. Then, by considering these sub-series as the input of the neural network, the signals are predicted and analyzed.

In this study, the data were decomposed according to their number in two and three levels. The results showed that with the decomposition level 2, more desirable results are obtained. Accordingly, each of the input signals were decomposed into two sub-signals using discrete wavelet transform. Also in this study, db2, db3, db4 and db5 were used to analyze the signals, and db4 was selected due to the better results obtained. In addition, the mathematical relationships related to hybrid wavelet-artificial intelligence models have been explained in detail in the research works of Nourani et al. (2014).

Supervised Committee Machine Artificial Intelligence (SCMAI)

Individual LSSVM, WANN and LFL models were used to implement this model. Each of these models was implemented individually; then their output was optimized using artificial neural network. That is, the output of individual models, which was the same as hydraulic conductivity, was used as the input of the Artificial Neural Network. To implement the individual models, grain size data (D30, D60 and D80), which are effective factors in the hydraulic conductivity, was used as input for individual models. In this study, the Supervised Committee Machine Artificial Intelligence (SCMAI) model, which uses an Artificial Neural Network has been presented, as a nonlinear combiner instead of simple averaging and weighted averaging methods. The mathematical expression of the SCMAI model is:

$${\widehat{k}}_{i}=A {I}_{i} \left( {D}_{30}, {D}_{60}, {D}_{80}\right)$$
(6)
$${O}_{j}={f}_{1}({b}_{j}+ {\sum }_{i}{W}_{ji}{\widehat{k}}_{i})$$
(7)
$${O}_{k}= {\widehat{k}}_{SCMAI}={f}_{2} ({b}_{k}+{\sum }_{J}{W}_{kj} {O}_{j})$$
(8)

Where \({\widehat{k}}_{i}\) represents the output of individual AI models, which is considered as the i-th input for Artificial Neural Network. f1 and f2 represent the Activation Functions for the hidden layer and the output layer, respectively. oj is the j-th output of the nodes in the hidden layer. bk, bj, and wkj, wji are the biases and weights of the output layer and the hidden layer, respectively. ok is the final output of the SCMAI ​​model. Weights and Biases are optimized with the Levenberg-Marquardt (LM) Training Algorithm. Figure 2 shows the general framework of the SCMAI model.

Fig. 2
figure 2

General framework of the SCMAI model

Evaluation criteria

In order to measure the accuracy of the models in making predictions, the validation of the models is performed. Various criteria are used for this purpose, two of the most widely used of which were applied in this study. DC or R2 (Determination Coefficient) is calculated according to Eq. (9) and RMSE (Root Mean Squared Error) according to Eq. (10) (Legates and McCabe 1999):

$$DC={R}^{2}=1-\frac{\sum _{i=1}^{n}({Q}_{i}-{{\widehat{Q}}_{i })}^{2}}{\sum _{i=1}^{n}({Q}_{i}-\stackrel{-}{Q}{)}^{2}}$$
(9)
$$RMSE=\sqrt{\frac{1}{n}\sum\nolimits _{i=1}^{n}({Q}_{i}-{\widehat{Q}}_{i }{)}^{2}}$$
(10)

Where DC (R2), RMSE,\({Q}_{i},\) \({\widehat{Q}}_{i , }\stackrel{-}{Q }\) and n are determination coefficient, root mean squared error, observed data, computed values, mean of observed data and number of observations, respectively. The best result obtained when the value of Eq. (9) gets closer to one, and the value of Eq. (10) get closer to zero.

Results

LFL

To implement the LFL model, first the MFL model was created based on the inputs and outputs. The FCM clustering method was used in the MFL model. The optimal number of clusters was determined by trial and error. In this method, the input and output membership functions were Gaussian. The LFL model was implemented after applying the MFL model based on its results. The LFL method is similar to the MFL method with the main difference of using the product operator for the fuzzy implication, which scales the output fuzzy set. To achieve the best result, the LFL model was implemented with the same clusters after implementing the MFL model with the clusters 3, 6, 9, 12, 14, 15 and 18. The best results for training and testing steps were obtained with 15 clusters. Figure 3 shows the relationship between clustering and RMSE of the testing step, where the lowest value for RMSE is obtained in clustering 15. This value for RMSE is 0.000236 cm/sec. In the LFL model, R2 and RMSE were 0.85 and 0.000144 cm/sec in the training step and 0.64 and 0.000236 cm/sec in the testing step, respectively (Table 3 and Fig. 5a).

Fig. 3
figure 3

Relationship between clustering and RMSE of the testing step

Table 3 Results of artificial intelligence models

LSSVM

As mentioned earlier, in SVM implementation, first the SVM type was determined (LSSVM). Next, the regulator values (γ) ​​and the Kernel variable (σ) were determined. These values ​​were determined by coding in MATLAB software and then optimized using trial and error method. In this study, it was found that the optimal determination of γ and σ values ​​is very effective on the results of the SVM model. The best values ​​for γ and σ variables were 2.8 and 7.9, respectively, which led to the optimal result for the SVM model. From among the types of Kernel Functions, namely RBF, Poly, Lin and MLP functions, RBF type kernel was selected in this research. In this model, R2 and RMSE values ​​in the training step were 0.88 and 0.000129 cm/sec, respectively, and in the testing step they were 0.64 and 0.000238 cm/sec, respectively (Table 3 and Fig. 5b).

WANN

In WANN model, data preprocessing is performed by wavelet transform and then the results of this processing are used as the input of ANN model. The efficiency of the WANN model depends on the variables such as the type of Wavelet Function, the decomposition level, and the number of hidden neurons. In this study, trial and error method was used to determine the type of Wavelet Function for decomposition input signals, and it was observed that among the wavelets, the db4 Wavelet Function as the mother wavelet was more accurate than the other functions. The mother Wavelet Functions studied in this study were db2, db3, db4 and db5. The reason that the db4 mother wavelet performs better than the other db’s may be due to its wavelet form, which is in good agreement with the data series analyzed in the study. In this study, the data were decomposed at 2 and 3 levels. The results showed that better outcomes can be achieved at decomposition level 2 due to the number of data (Fig. 4). After decomposing the data with db4 and at decomposition level 2, the decomposed data are entered into the Multilayer Perceptron Neural Network. Trial and error method was used to select the number of the hidden neurons, and it was observed that the best results were obtained when the number of the hidden neurons was considered to be 3. When the sub-series enter the neural network as input, the neural network assigns a specific weight to each of the decomposition sub-series. For example, the neural network assigns a higher weight to d2 at decomposition level 2, because d2 has the highest dependence on the main signal of hydraulic conductivity compared to other sub-signals and has a significant role in estimating the hydraulic conductivity. In this model, R2 and RMSE of the training step were 0.88 and 0.000128 cm/sec, respectively, and in the testing step, 0.69 and 0.000222 cm/sec, respectively (Table 3 and Fig. 5c).

Fig. 4
figure 4

Evaluation of mother wavelet functions for decomposition levels 2 and 3

SCMAI

After implementing three individual models, the output of these individual models, which is the hydraulic conductivity, was used as the input of the SCMAI model such that each of these models was implemented individually, and their output was combined and optimized using ANN. The LM Training Algorithm was used in ANN, which was used as a nonlinear combiner of the individual models. The reason for this selection was its acceptable accuracy and speed. TANSIG and PURELIN were selected as the Transfer Functions for the hidden layer and output layer, respectively. Epoch type was used for the training process. After implementing the SCMAI model, it was observed that the results of estimating the hydraulic conductivity became more desirable than those of the individual models and this model had a much higher efficiency. In SCMAI model, R2 and RMSE of the training step were 0.92 and 0.000103 cm/sec, respectively, and in the testing step they were 0.87 and 0.000141 cm/sec, respectively (Table 3 and Fig. 5d).

Fig. 5
figure 5

Comparison of computed and observed (K) using a LFL; b LSSVM; c WANN and d SCMAI

Discussion

In this study, it was observed that the SCMAI model performed the hydraulic conductivity estimation with high accuracy and that better results were obtained compared to the results of the individual models. For example, the maximum R2 and the minimum RMSE of the testing step are related to the individual models of WANN model, which are equal to 0.69 and 0.000222 cm/sec, respectively. These values for the SCMAI model were 0.87 and 0.000141 cm/sec, respectively, which indicate the significant promotion of the estimation of the hydraulic conductivity compared to those of the individual models (Table 3). Also, comparison of the results of this study showed that the two individual models LFL and LSSVM show approximately similar results in predicting hydraulic conductivity in the testing step (Table 3). In the training step, among the three individual models, the WANN model, having R2 = 0.88 and RMSE = 0.000128 cm/sec, has provided better results than the other two individual models. At this step, among the two individual models LFL and LSSVM, the LSSVM model performed better than the LFL model. In LSSVM and LFL models, R2 and RMSE of the training step were 0.88 and 0.000129 cm/sec and 0.85 and 0.000144 cm/sec, respectively (Table 3).

For visual comparison, proximity of the computational values of all the four models (three individual and one SCMAI) to the observational values of the testing step was plotted in curves (Figs. 6a-e). According to Figs. 6(a and e), it was observed that the degree of proximity of the observed and computed values ​​of the hydraulic conductivity had increased in the SCMAI model, and the measurement error had decreased as a result. This important result showed that the SCMAI model can significantly improve the computed values ​​obtained from individual models. As a result, instead of using an individual model, several individual models can be used simultaneously in the studies related to estimating geotechnical variables such as hydraulic conductivity, and more desirable results can be obtained by combining their outputs. Also, by carefully examining Figs. 6(a and e), it was concluded that the SCMAI model was able to bring 13 of the 28 computed values ​​of the testing step closer to the observed data (Points 2, 3, 4, 5, 6, 7, 9, 11, 17, 20, 21, 22 and 28) (46% of the values). It means that in these 13 points, the SCMAI model has been able to perform better than all three individual models in bringing the computed values closer to the observed data. That the proximity of the observational and computational values ​​was better observed at 2 points. The observational and computational values at point 2 of the curve ​​were 0.00150 and 0.00153 cm/sec, respectively, and at point 28 of the curve they were 0.00143 and 0.00139 cm/sec, respectively. In 7 of the remaining 15 points, the SCMAI model has performed better than the two individual models (points 1, 12, 13, 14, 15, 19 and 24) (25% of the values). In 5 of these 15 points, this model has performed better than an individual model (points 8, 10, 16, 26 and 27) (18% of the values). Only in 3 of these 15 points, all three individual models have performed better than the SCMAI model (points 18, 23 and 25) (11% of the values). In general, it can be said that the SCMAI model has been effective in bringing about 89% of the computed values closer to the observed data.

Fig. 6
figure 6

Comparison of curves of computed and observed K (cm/sec); a observed with LFL, LSSVM, WANN and SCMAI; b observed with LFL, c observed with LSSVM, d observed with WANN and e observed with SCMAI

According to Fig. 6a, it was observed that although the SCMAI model provides better results in estimating the hydraulic conductivity, each of the individual models used in this study have certain strengths and advantages over the others. For example, the computed values ​​of the LFL model are closer to the observed data ​​than those obtained from the other models at 4 points (12, 13, 19 and 27). Also, this number in LSSVM model is 4 points (8, 10, 14 and 16) and in WANN model, which renders better results than the other individual models, this number is 7 points (1, 15, 18, 23, 24, 25 and 26) (Figs. 6a-d).

Using the SCMAI model, Nadiri et al. (2013) predicted the groundwater fluoride concentration in Mako Plain. For this purpose, they used NF, MFL, SFL and ANN models and used ANN as a nonlinear combiner. The results obtained by these researchers showed that among the individual models, the MFL model has lower accuracy than the other models. In contrast, ANN model has a higher efficiency. These results were optimized by SCMAI and better results were obtained compared to each model. In another study conducted by Tayfur et al. (2014), the SICM model was implemented to estimate the hydraulic conductivity in Tasuj Plain. In this study, NF, MFL, SFL and ANN models were used as individual models and ANN was used as a nonlinear combiner.

The difference between the present study and the previous studies is that LFL, WANN and LSSVM models were used in this study as the latest methods in estimating hydraulic conductivity, and ANN model was used as a nonlinear combiner. The estimation values by these models were optimized by SCMAI model, and better results were obtained (Table 1). Another difference between the present study and the study of Tasuj plain (SICM model) is that in this study, hydraulic conductivity was estimated based on grain size distribution, which is one of the important parameters in permeability, while this variable was estimated from geophysical data such as electrical conductivity in Tasuj plain (Table 1).

Conclusion

The SCMAI model was used in this study. This model combined the output of individual AI models to estimate the hydraulic conductivity in Tabriz Urban Train (lines 1 and 2) such that the outputs of LSSVM, WANN and LFL models were entered into ANN, and a new hydraulic conductivity was estimated as the output. This study showed that by using this model, more desirable results can be achieved in estimating the hydraulic conductivity than the results of the individual models used in this study. SCMAI model led to a significant improvement in the results due to using the advantages of all the individual models so that the R2 testing step of the SCMAI model was promoted by about 26% compared to the R2 value of the testing step of WANN model. Also, the RMSE of this model decreased by about 36% compared to that of WANN model. Also, in the testing step, the SCMAI model has an increase of about 36% in the R2 value compared to both LSSVM and LFL models. And the RMSE of this model has decreased by about 41% compared to LSSVM model and by about 40% compared to LFL model in the testing step. The WANN model provides better results among the three individual models. Also, LFL and LSSVM models provide almost similar results in estimating the hydraulic conductivity and proximity of observational and computational values in the testing step.

Another result of this study was that although the individual models provided less R2 and more RMSE than SCMAI model, each of them was closer to the observed data ​​due to their inherent abilities in some parts of the curve (compared to SCMAI model).

This study showed that SCMAI model, with the ability to use the capabilities and combine the results of different models, can be used to estimate other hydrogeological variables such as storage coefficient, transmissibility, etc. In addition, for other studies, it is suggested that in addition to grain size data, other parameters affecting hydraulic conductivity can be used to achieve hydraulic conductivity in Tabriz Urban Train. It is also possible to use other individual AI models along with the three individual models used in this study and to use a nonlinear combiner other than ANN for combining the individual models.