Introduction

A significant portion of the precipitation that falls onto the earth's surface enters the subsurface through infiltration. The infiltrated water passes through the unsaturated zone before reaching the groundwater table. The movement of water through the unsaturated–saturated zone is highly complex since the moisture content of the soil changes within this zone. In order to study how water moves from the ground surface to the groundwater aquifers, it is necessary to develop a model that replicates the flow phenomena in unsaturated–saturated zones. Using numerical models to study groundwater flow, solute transport, and groundwater management has become essential over the past few decades. With the increased use of groundwater for irrigation and domestic purposes, the importance of such models has increased drastically. As such, it is necessary to incorporate the soil and hydraulic parameters to develop an accurate numerical simulation model along with natural boundary conditions at the field scale. The hydraulic parameters are those that define the relationship between hydraulic conductivity (K), volumetric water content (ϴ), and pressure head (h). Such parameters are measured or estimated based on different experimental and empirical relations. It is, however, difficult to measure some of these parameters at the desired field or laboratory scale. In practice, if the hydraulic properties of the aquifer are unknown, these must be estimated using hydrogeologic data by the model calibration process.

The model calibration process has recently gained significant attention (McLaughlin and Townley 1996). However, hydraulic parameter identification or inverse problem involves using a mathematical or numerical model to identify hydraulic parameters from field or laboratory observations (Hyun and Lee 1998). In the subsequent step, the soil and hydraulic parameters are estimated by clubbing the numerical and optimization models. The parameters are estimated by satisfying the objective function, which minimizes the error function between the observed and predicted hydraulic heads (Dane and Hruska 1983; Kool et al. 1987; Yeh 1986). The observed hydraulic head was obtained from the field study, while the simulated head was obtained by running the numerical simulation model. The optimization model uses various algorithms to provide new solutions to attain the objective function. Estimating parameters in unsaturated flow studies have traditionally been carried out using gradient-based classical optimization methods (Eching and Hopmans1993; Kool and Parker 1988; Šimůnek and Van Genuchten 1996). However, due to the nonlinear behavior of the response function, they sometimes fail to find the optimal global solution to the problem. Woodbury and Ulrych (2000); Woodbury and Rubin (2000) applied a full-Bayesian approach using both Bayesian and maximum entropy to estimate transmissivity from the hydrostatic head and transmissivity measurements viewpoints. A simulation–optimization-based model was developed using a meshless local Petrov–Galerkin method and particle swarm algorithm to estimate saturated flow parameters (Swathi and Eldho 2018). This model predicted only one or two parameters at a time among hydraulic conductivity or transmissivity and specific storage. The model, however, could not provide conclusions about its suitability for different groundwater systems. Another model was developed to estimate the storage coefficient, transmissivity, and leakage factor by using pumping test data in one-dimensional confined and leaky confined aquifers (Ayvaz and Gurarslan 2019).In many groundwater studies, stochastic optimization techniques, such as Pattern Search, Genetic Algorithms, or Simulated Annealing, have been used to reach the optimal global solution. These models were developed to estimate parameters in groundwater aquifers (Huang et al. 2008; Şahin 2018; Samuel and Jha 2003).All such models independently estimated the soil and hydraulic parameters for the unsaturated zone or the saturated zone. Thus, an effective parameter estimation model is yet to develop to estimate the unsaturated and saturated flow parameter together in a single model.

This study proposes a methodology to estimate the unsaturated and saturated flow parameters together in a single inverse optimization model. As such, the numerical simulation model needs to be developed by considering both the unsaturated and saturated zone. Due to the presence of an unsaturated zone in the study domain, the groundwater flow model becomes highly nonlinear. Thus, it becomes computationally expensive to combine this simulation model with the optimization algorithm. This is because the simulation model will be called as many times as the number of population sizes, leading to time-consuming computations. In order to overcome this limitation, an alternate simulator should be used in conjunction with the optimization model to estimate the flow parameters. In the field of civil and environmental engineering, artificial neural networks (ANNs) have shown successful results in mapping complex nonlinear relations (Flood and Kartam 1994). The groundwater flow model developed by Balkhair (2002) could estimate transmission coefficients and storage coefficients using trained neural networks. Also, as a result of back propagation, training of multilayer perceptrons, complex relationships, such as rainfall-runoff processes, have been successfully modeled in hydrology and water resources (Smith and Eli 1995), and water quality parameters have also been forecasted (Maier and Dandy 1996).

There are many problems associated with parameter estimation models, including nonlinearity, non-uniqueness, and instability (Carrera and Neuman 1986). Non-identifiability of solutions occurs when a solution cannot be found with the proposed technique. Whereas multiple solutions that satisfy imposed constraints are indicative of the problem of non-uniqueness of solutions. Such types of problems can be solved using meta-heuristic algorithms, and those algorithms are effective for solving inverse optimization problems as well. One such efficient meta-heuristic algorithm is the Shuffled Frog Leaping Algorithm (SFLA). This algorithm solves highly nonlinear non-convex problems using a population-based metaheuristic and a memetic approach. It was designed the way that an army of frogs searched for food in a swamp. For a better search, they leap onto the nearest possible rock and communicate with each other. Consequently, they develop a strategy that allows them to gather the most food in the least amount of time. An optimization algorithm designed to replicate this process is called the Shuffled Frog Leaping Algorithm (SFLA). A combination of Particle Swarm Optimization (PSO) and Shuffled Complex Evolution (SCE) are the principles behind this algorithm. This algorithm is relatively very fast compared to the traditional meta-heuristic evolutionary genetic algorithm (Gandhi and Bhattacharjya 2020).

All the optimization models available in the literature estimated the flow parameters for unsaturated and saturated zone separately, whereas, in real field problems, there may be situations where both the unsaturated and saturated flow parameters have to be considered together in modeling. Thus, to overcome this limitation, this paper proposes an effective parameter estimation model to estimate both the unsaturated and saturated parameters together using Shuffled Frog Leaping Algorithm in conjunction with the simulation model. However, coupling the flow simulation model with the optimization algorithm for the entire computational domain requires more time. As such, an alternate simulator developed by using Artificial Neural Networks (ANN) that replicates the groundwater simulation model is used to reduce the computational time. In addition, it was found that the input values significantly affect the model’s outputs. Therefore, Sobol’s global sensitivity analysis based on variance decomposition is used to determine the most relevant flow parameters associated with the groundwater flow model.

Materials and methods

Estimation is performed by minimizing the error function between the observed and simulated hydraulic heads. The observed hydraulic head is obtained from the field study, and the simulated head is obtained from the groundwater simulation model. Initially, the numerical simulation model is developed to study the groundwater flow considering both the unsaturated and saturated zone. The governing equation that is used to develop the groundwater flow model is discussed below.

Flow equation

The three-dimensional unsaturated and saturated groundwater flow equation is the modified form of Richards’ equation given by Dogan and Motz (2005).

$$\frac{\partial }{\partial x}\left( {K_{xx} \left( h \right)\frac{\partial h}{{\partial x}}} \right) + \frac{\partial }{\partial y}\left( {K_{yy} \left( h \right)\frac{\partial h}{{\partial y}}} \right) + \frac{\partial }{\partial z}\left( {K_{zz} \left( h \right)\frac{\partial h}{{\partial z}} + K_{zz} \left( h \right)} \right) + q_{e} = C(h) + S_{w} S_{s} \frac{\partial h}{{\partial t}}$$
(1)

where, θ is the water content; h is the pressure head [L]; Kxx, Kyy, and Kzz are the hydraulic conductivity along x, y, and z directions, considering the coordinate system as the principal directions of the hydraulic conductivity tensor [L T−1]; qe represents pumping or recharge rate [L1 T−1]; C(h) is the specific moisture capacity (L−1), Sw is the saturation ratio, Ss is the specific storage [L−1]; and t represents the time.

Constitutive relationship:

From the above equations, it is observed that the specific moisture content C(h), hydraulic conductivity K(h), and ϴ(h) are nonlinear, which makes the equation more complex. To overcome this nonlinearity, the model uses the constitutive relationship given by Van Genuchten and Nielsen (1985).

Constitutive relation for K(h):

For h < 0

$$K_{r} = \frac{K\left( h \right)}{{K_{s} }} = \left( {1 + \beta } \right)^{{ - \frac{5}{2}\left( {1 - 1/n} \right)}} \left[ {\left( {1 + \beta } \right)^{(1 - 1/n)} - \beta^{(1 - 1/n)} } \right]^{2}$$
(2)

For \(h\ge 0\)

$$K_{r} = \frac{K\left( h \right)}{{K_{s} }} = 1$$
(3)

Constitutive relation for C(h):

When \(h\le {h}_{0}\)

$$C(h) = \frac{{(n - 1)\left( {\theta _{s} - \theta _{r} } \right)\left| h \right|^{{n - 1}} }}{{\left| {h_{a} } \right|^{n} \left( {1 + \beta } \right)^{{2 - 1/n}} }}$$
(4)

When \(h>{h}_{0}\)

$$C(h) = 0$$
(5)

Constitutive relation for θ(h):

When \(h\le {h}_{0}\)

$$\theta \left( h \right) = \theta_{r} + \left( {\theta_{s} - \theta_{r} } \right)\left( {1 + \beta } \right)^{{\left( {{1/n} - 1} \right)}}$$
(6)

\(h>{h}_{0}\)

$$\theta \left( h \right) = \theta _{r} + \left( {\theta _{s} - \theta _{r} } \right)\left( {1 + \beta _{0} } \right)^{{1/n - 1}} + S_{S} \left( {h - h_{0} } \right)$$
(7)

where, \(\beta = \left| {\frac{h}{{h_{a} }}} \right|^{n}\), ha is the air entry pressure [L], n is the fitting parameter in the moisture retention curve, or,\(\beta_{0} = \left| {\frac{{h_{0} }}{{h_{a} }}} \right|^{n}\), h0 is a parameter depending upon the Specific storage (SS). When, h ≥ ha, the Eq. (1) solves for the saturated flow condition, i.e., C(h) = 0, K(h) = Ks, Sw = 1, and when h < ha, then the Eq. (1) solves for the unsaturated flow condition. Then C(h) ≠ 0, K(h) is the function of pressure head, Sw < 1 and Ss = 0.

This study uses the block-centered finite difference form to solve Eq. (1). In order to develop the model, the sum of inflows into and out of a unit volume of aquifer must be equal to the rate of change in the volume of storage within the cell. Since the modified form of Richard’s equation is highly nonlinear, Picard iteration method is adopted at each time step to overcome the nonlinearity. Using the numerical scheme and applying the necessary boundary condition, a linear system of equations is developed at every modified Picard iteration level. This set of equations can be solved using the preconditioned conjugate gradient method (PCGM), which is more memory-efficient than other iterative methods and has a faster convergence rate (Celia et al. 1990; Clement et al. 1994).

Development of ANN model

The artificial neural network (ANN) model is a very effective and popular substitute for numerical aquifer simulations (Afzaal et al. 2020; Chang and Zhang 2019; Mohanty et al. 2013; Shen et al. 2018; Zhang et al. 2018, 2020). In this proposed methodology, the ANN model acts as the surrogate model of the groundwater flow model. A three-dimensional unsaturated–saturated groundwater flow model developed using Eq. (1) is used to generate data for training the ANN model. To develop the ANN model, the input parameters are the flow parameters (θs, θr, α, Ks, n, and Ss) to be estimated, and the output of the ANN model are the hydraulic head at different observation well location for different time steps. The developed ANN model can further predict the hydraulic heads without evoking the numerical simulation model, thereby reducing the computational time considerably. In this study, six observation wells are considered, and as such, six ANN models are developed. A feed-forward neural network is used to generate the ANN pattern (Fig. 1), which features one hidden layer with 40 neurons and 1000 input–output patterns, which are generated using the groundwater simulation model. In total, 60% of the generated data is utilized for training ANN models, and 40% is used for testing and validating the ANN model. Training of the ANN model is carried out using the Levenberg–Marquardt (LM) algorithm. A unipolar sigmoidal transfer function and a purely linear transfer function are used for the hidden layer and the output layer of the network.

Fig. 1
figure 1

ANN model network

Parameter estimation model

In this study, the unsaturated hydraulic properties—water content (ϴ), hydraulic conductivity (K), and pressure head (h) are related using Van Genuchten and Nielsen's (1985) constitutive relationship. The five hydraulic parameters (θs, θr, α, Ks and n) need to be estimated in order to get the constitutive relations. On the other hand, the specific storage (Ss) is an important parameter in a saturated zone that also needs to be estimated. Henceforth, this optimization model considers six decision variables (θs, θr, α, Ks, n and Ss) that need to be estimated. In this inverse optimization technique, the numerical groundwater simulation model initially uses the candidate solutions generated by the optimization algorithm as input parameters. After that, using this simulation model, the spatial and temporal hydraulic head is generated and matched with the measured hydraulic head at the different observation well. In the optimization model, the objective function value is determined by the difference between the simulated and observed hydraulic heads. The candidate solution is modified based on the objective function value, and the process is repeated until the optimal solution is obtained. The objective function used to estimate all flow parameters is given by Eq. (8). However, this combination took around one day, 5 h, 45 min, and 10 s to estimate all the parameters, which is very time-consuming. To overcome this disadvantage, the numerical simulation model is replaced with an alternate simulator using an Artificial Neural Network (ANN). This ANN model is linked externally with the optimization model, and the whole methodology is represented in a flowchart, as given in Fig. 2.

Fig. 2
figure 2

Flow chart showing the ANN-SFLA based parameter estimation model

$${\text{Minimize}} \quad f_{x} = \sum\limits_{j = 1}^{M} {\sum\limits_{i = 1}^{N} {\left| {OH_{i}^{j} - SH_{i}^{j} } \right|} }$$
(8)
$${\text{Subject to}} \quad H = f\left( {\theta_{s} ,\theta_{r} ,\alpha ,n,K_{s} ,S_{s} } \right)$$
$$\theta_{s}^{\min } \le \theta_{s} \le \theta_{s}^{\max }$$
$$\theta_{r}^{\min } \le \theta_{r} \le \theta_{r}^{\max }$$
$$\alpha^{\min } \le \alpha \le \alpha^{\max }$$
$$n^{\min } \le n \le n^{\max }$$
$$K_{s}^{\min } \le K_{s} \le K_{s}^{\max }$$
$$S_{s}^{\min } \le S_{s} \le S_{s}^{\max }$$

where fx is the objective function for the present optimization model with x number of parameters; \(OH_{i}^{j}\) is the observed hydraulic head at ith time step for jth well location; \(SH_{i}^{j}\) is the simulated hydraulic head at ith time step for jth well location obtained from ANN model or numerical simulation model; M is the total number of observation wells and; N is the total number of time steps; \(\theta_{r}^{\min }\) and \(\theta_{r}^{\max }\) are the lower and upper bound of θr; \(\theta_{s}^{\min }\) and \(\theta_{s}^{\max }\) are the lower and upper bound of θs;\(\alpha^{\min }\) and \(\alpha^{\max }\) are the lower and upper bound of α; \(K_{S}^{\min }\) and \(K_{S}^{\max }\) are the lower and upper bound of Ks;\(n^{\min }\) and \(n^{\max }\) are the lower and upper bound of n and \(S_{s}^{\min }\) and \(S_{s}^{\max }\) are the lower and upper bound of Ss.

Shuffled frog leaping algorithm

In this study, Shuffled Frog Leaping Algorithm (SFLA) is used as the optimization algorithm for estimating the flow parameters. SFLA is a metaheuristic optimization algorithm that solves nonlinear non-convex optimization problems. As the flow parameter estimation is a non-convex problem having multiple local optima, the SFLA algorithm is suitably employed. The model developed for the ANN-SFLA study, as depicted in Fig. 2, is coded in MATLAB environment. The process of Shuffled Frog Leaping Algorithm (SFLA) is shown in Fig. 3. To begin the problem, the first step is to select the number of memeplexes (nm) and virtual frogs within each memeplex (nf).This gives the total number of frogs as \({n}_{m}\times {n}_{f}\). The algorithm continues by assigning a random position to all the frogs and calculating the corresponding fitness. The best frog is then marked as the global best, and then the frogs are sorted into memeplexes. This portion of the flowchart is presented in green colour. The next portion is local evolution in each memeplex, represented in grey colour. The frogs are distributed using the triangular distribution, as shown in Eq. (9), where \({p}_{i}\) represents the probability for triangular distribution, and from them, \(q\) frogs are selected.

$$p_{j} = \tfrac{{2\left( {n_{f} + 1 - j} \right)}}{{n_{f} \left( {n_{f} + 1} \right)}},\;\;{\text{where}}\;j = \, 1 \ldots n_{f} ,$$
(9)
Fig. 3
figure 3

Flowchart showing the process of shuffled frog leaping algorithm (SFLA)

The best and the worst frog are then marked. The position of the worst frog is improved by choosing an appropriate step size based on the best of the memeplex. Then the condition of whether this frog is better than the previous worst and within the feasible space is checked. If the condition is satisfied, then the next step is to update the memeplex and shuffle the frogs. If the condition is not satisfied, then the worst position is improved based on the global best. This set of conditions is represented with blue connectors and arrows with the decision matrix in the flowchart. The improved position is again checked with the same condition as the previous one. If the condition is satisfied, then updating the memeplex and shuffling is continued. If the condition is not satisfied, the new position is improved randomly, and then the memeplex is updated. The flowchart represents this set of conditions with red connectors and arrows. The termination criteria are then checked after updating the global best, and reshuffling into the memeplexes is done. The global optimal solution is reached if the termination criteria are satisfied. If the termination criteria are not satisfied, then the algorithm resumes the step of evolution from each memeplex.

Results and discussion

Validation of the numerical flow model with one-dimensional infiltration problem

To validate the groundwater flow model considering both the unsaturated and saturated zone, a one-dimensional groundwater flow model is selected with transient infiltration towards the groundwater table (Paniconi et al. 1991). The model is simulated for 32 h with the same boundary conditions and input parameters. Figure 4 compares the solutions obtained from the numerical simulation model developed using the code written in MATLAB with the solution obtained by Paniconi et al. (1991). The scatter plots (Fig. 4b) correlate the pressure head (m) obtained from Paniconi et al. (1991) and the numerical simulation model. The regression coefficient is 0.9981, which ascertains that the numerical simulation groundwater flow model could provide an accurate solution as obtained by Paniconi et al. (1991).

Fig. 4
figure 4

Comparison of the pressure head distribution obtained from Paniconi et al. (1991) and numerical simulation model

Three-dimensional hypothetical groundwater flow model considering both unsaturated and saturated zone

In this study, a three-dimensional hypothetical numerical flow model is developed for a homogeneous medium considering both unsaturated–saturated zones. This hypothetical groundwater flow model is developed by solving Eq. (1) using MATLAB. The graphical representation of the groundwater flow model used in this study showing the location of the injection well, observation well, and pumping well is given in Fig. 5. The parameters used to develop this model are listed in Table 1. Further, it is assumed that two injection wells are applied at the ground surface, and two pumping wells are located to pump out water from the saturated zone.

Fig. 5
figure 5

Hypothetical study area showing a dimensions and boundary conditions, b injection wells, observation wells and pumping well location

Table 1 Parameters used to develop the groundwater flow model as considered by Dogan and Motz (2005)

The hydraulic head obtained at time steps 10 h, and 15 h is presented (Fig. 6) for both the unsaturated and saturated zones. For the unsaturated zone, the contour plots of the hydraulic head are shown at the top layer of the flow domain, whereas for the saturated zone, the hydraulic head is shown at a depth of 1.625 m from the datum (Initial position of the water table). The x-axis and y-axis are the dimensions of the flow domain in the XY plane. A total of (20 × 20 × 40) grids are considered for the analysis. However, the solutions are refined for grid size (200 × 200) along the XY plane by interpolating and smoothening the results of the head obtained. Figure 6 shows an increase in hydraulic head in the location of injection wells, whereas a cone of depletion is seen in the pumping well location.

Fig. 6
figure 6

Contour plots showing the hydraulic head (m) at different time steps in a unsaturated zone and b saturated zone

Performance of the ANN models

The numerical simulation model developed for the three dimensional groundwater aquifer is used to generate 1000 input–output data to develop the ANN model. The model uses six parameters as input data and the hydraulic head at different time steps as the output. Using the generated data, the ANN model is trained, tested, and validated. The performance of the developed ANN simulator is shown in Figs. 7 and 8. For each observation well, scatter plots at twenty-four-time steps are plotted. The groundwater numerical flow model provides the observed hydraulic head (OH), while the ANN model provides the simulated hydraulic head (SH). Among all observation wells, the best and the worst coefficient of correlation (R2) values are 0.9999 and 0.9375, respectively. It can be seen that R2 is very close to 1, which implies a strong correlation between the actual hydraulic head and the predicted pressure head. Figure 8 shows the performance of all 6 ANN models for training, testing, and validating the data using Mean square error. This graph shows MSE for training, testing, and validating batches as it converges toward the best with each Epoch. The calculated error terms are found to be very negligible as it ranges from 2.3 × 10–2 to 4.07 × 10–6. Therefore, we can conclude that the developed ANN model can serve as an approximate simulator for simulating the hydraulic head for the proposed study area.

Fig. 7
figure 7

Regression plot of the numerical simulation model with the six ANN models

Fig. 8
figure 8

Training, testing, and validation plots of the six ANN models

Performance of the parameter estimation model

ANN-shuffled frog leaping algorithm

The ANN model was developed for six observation wells and is coupled with the optimization model (SFLA) to minimize the objective function with six input variables (i.e., θs, θr, α, Ks, n, and Ss) as the decision variable. Since this study considers a hypothetical problem, the observed hydraulic heads (OH) for the optimization's objective function are the values taken from the numerical simulation model, and simulated heads (SH) for the same objective function are taken from ANN models. The lower and upper limits for these parameters are decided based on the previous experimental evidence, as listed in Table 2. The number of memeplexes is selected as 7, and the number of virtual frogs is chosen as the maximum number of variables plus one and is equal to 7. Therefore, 7 virtual frogs for each memeplex were selected, comprising a total of 49 frogs. The maximum step size was taken as 1 unit. The maximum number of evolutions in each memeplex is 6, the step length coefficient is 2, and the maximum number of iterations is restricted to 200. The predicted hydraulic parameters using the ANN-SFLA-based parameter estimation model are listed in Table 2.

Table 2 Optimization results 1: ANN-shuffled frog leaping algorithm

Table 2 provides the predicted values obtained from the SFLA-ANN-based parameter estimation model. Here, the unsaturated and saturated parameters are estimated in a single model. The relative efficiency of the ANN-SFLA model in predicting the flow parameters is also checked by evaluating the relative error concerning the actual flow parameters. The model observations indicate that the model could predict all the parameters up to a fair degree of accuracy. Considering the relative error among the actual and predicted values, it ranges from 0.03 to 1.00%. But these values are subsequently low and considered within the acceptable accuracy range. The model converges toward the optimal solution when the objective function value of 6.3084E-05 is reached. It is further compared with another established ANN-Genetic Algorithm-based parameter estimation tool to illustrate the performance of our ANN-SFLA model.

ANN-genetic algorithm

The genetic algorithm (GA) available in the MATLAB toolbox compares the results with SFLA. Genetic algorithms search for optimal solutions through natural selection and genetic evolution (Abdel and El-Hadi 2009; Cavazzuti 2012; Holland 1992). Due to the non-gradient-based search method of GA, it typically produces nearly global optimal solutions instead of true solutions. Thus, the solution obtained by using ANN-GA is presented in Table 3. In this study, GA uses a population size of 50, a maximum generation of 100, a function tolerance of 1 × 10–5, and a crossover probability of 0.8. Mutation functions are constraint-dependent, and the number of stall generations is 60.

Table 3 Optimization results II: ANN-genetic algorithm

When the relative error is calculated, it is observed that ANN-GA could correctly predict three parameters while the remaining three parameters showed errors. For a number of trials considering both the ANN-SFLA and ANN-Genetic algorithms, a number of solutions are generated to verify the accuracy of the proposed algorithms. Figure 9 shows a box plot representing the estimated parameters after 20 trials from both models. The plots show that the average value of α, Ks, and n for both models is very close to the optimal solution. In the ANN-GA model, the estimated value of θs, θr, and SS varies with a wide range of values as compared to ANN-SFLA. The median value obtained for θs is 0.34, θr is 0.0135, and SS = 0.0018 (m−1) using the ANN-GA model, while for the ANN-SFLA model, the median values for θs are 0.315, θr is 0.011, and SS = 0.0012 (m−1). From the investigation, it is clear that the solution obtained after 20 trials shows better performance in the ANN-SFLA model than in the ANN-GA model.

Fig. 9
figure 9

Box plot representing the estimated value of the flow parameters using ANN-Genetic algorithm and ANN-SFLA

The variation of the objective function with the iteration for the Genetic Algorithm (GA) and Shuffled Frog Leaping Algorithm (SFLA) is plotted in Fig. 10 to study the reason for this observation. A total number of 200 generations are taken for both GA and SFLA. The population size is also relatively similar − 50 for GA and 49 for SFLA. Therefore, the total number of function evaluations is almost identical for both algorithms. As observed, SFLA gets convergence faster and yields better results (Fig. 10). This may be because the memetic evolution is faster and consists of different sets of evolution happening at the same time. Genetic evolution consists of the population (a set of solutions) and evolves altogether. On the other hand, memetic evolution follows a different approach where the population is divided into different memeplexes, and each memeplex evolves independently on a population basis. The population is mixed again to communicate so that the global best is updated, and reshuffling is done again to continue the evolution into the memeplexes. It may be noted that the problem considered in the study has multiple local optimal solutions. As such, a large population size must be taken to obtain the optimal global solution. The population size of 50 is considered in GA just to compare the result with SFLA. The GA may yield a better solution if we increase the population size by more than 50.

Fig. 10
figure 10

Variation of the objective function with iteration for genetic algorithm (GA) and shuffled frog leaping algorithm (SFLA)

Sobol’s sensitivity analysis

The Sobol's global sensitivity method is used to analyze the most influential flow parameters in the unsaturated–saturated flow model. With the use of variance decomposition, one can determine the effect of each parameter on the output, and the interactions between them, based on a large sample of input variables.

Using this method, we can deal with nonlinear and non-monotonic models due to its variance decomposition approach. Using functional representations, the models can be expressed as follows:

$$y = f\left( x \right) = f\left( {x_{1} ,x_{2}, \ldots, x_{p} } \right)$$
(10)

where y is the goodness of fit metric for the model output, and x is the set of input parameters: (x1, x2,….,xp). Sobol's method is a variance decomposition approach. D(y) represents the total variance of the function f. Depending on individual parameters and interactions, D(y) is subdivided into different components.

$$D\left( y \right) = \sum\limits_{i} {D_{i} } + \sum\limits_{i < j} {D_{ij} } + \sum\limits_{i < j < k} {D_{ijk} } + \cdots+ D_{12 \ldots p}$$
(11)

By considering the percentage contribution of the total variance D, Sobol's sensitivity indices are derived for different orders.

The first order sensitivity indices (Si) on y is then defined as:

$${\text{First-order indices}} \quad S_{i} = \frac{{D_{i} }}{D}$$
(12)

The second order indices (Sij) on y due to the direct effect between the two parameters xiand xjis given by:

$${\text{Second-order indices}} \quad S_{ij} = \frac{{D_{ij} }}{D}$$
(13)

The total-order indices (STi) account for the direct effects between one parameter xi with the other parameters and are given by Eq. (14):

$${\text{Total order indices}} \quad S_{Ti} = 1 - \frac{{D_{\sim i} }}{D}$$
(14)

The variance due to the ith parameter is represented by Di, while the variance between the two parameters is represented by Dij. D~i represent the total variance relating to all parameters except the one for which total order indices are being calculated. In Eq. 19, the variance can be found by using Monte Carlo approximations based on these equations (Hall et al. 2005; Sobol 1993, 2001).

$$\hat{f}_{0} = \frac{1}{n}\sum\limits_{{S = 1}}^{n} {f\left( {x_{s} } \right)}$$
(15)
$$\hat{D} = \frac{1}{n}\sum\limits_{S = 1}^{n} {f^{2} \left( {x_{s} } \right)} - \hat{f}_{0}^{2}$$
(16)
$$\hat{D}D_{i} = \frac{1}{n}\sum\limits_{S = 1}^{n} {f\left( {x_{s}^{\left( a \right)} } \right)} f\left( {x_{{\left( {\sim i} \right)s}}^{\left( b \right)} - x_{is}^{\left( a \right)} } \right) - \hat{f}_{0}^{2}$$
(17)
$$\hat{D}D_{ij}^{c} = \frac{1}{n}\sum\limits_{S = 1}^{n} {f\left( {x_{s}^{\left( a \right)} } \right)} f\left( {x_{{\left( {\sim i,\sim j} \right)s}}^{\left( b \right)} - x_{{\left( {i,j} \right)s}}^{\left( a \right)} } \right) - \hat{f}_{0}^{2}$$
(18)
$$\hat{D}D_{ij} = \hat{D}D_{ij}^{c} - \hat{D}D_{i} - \hat{D}D_{j}$$
(19)
$$\hat{D}_{\sim i} = \frac{1}{n}\sum\limits_{S = 1}^{n} {f\left( {x_{s}^{\left( a \right)} } \right)} f\left( {x_{{\left( {\sim i} \right)s}}^{\left( a \right)} ,x_{is}^{\left( b \right)} } \right) - \hat{f}_{0}^{2}$$
(20)

where, Superscripts (a) and (b) represent different samples in the sampled unit hypercube, where n represents sample size, and xs represents the sampled individual. Parameters that take their values from a sample (a) are represented by \(x_{s}^{\left( a \right)}\). The variables \(x_{is}^{\left( a \right)}\) and \(x_{is}^{\left( b \right)}\) are variables that denote parameter xis using sampled values from samples (a) and (b). The \(x_{\sim is}^{\left( a \right)}\) and \(x_{\sim is}^{\left( b \right)}\) symbols represent cases where all parameters, except xis, are based on sampled values from samples (a) and (b). Parameters xis and xjs are represented by \(x_{{\left( {ij} \right)s}}^{\left( a \right)}\) in sample (a) with sampled values. Finally, \(x_{{\left( {\sim i\sim j} \right)s}}^{\left( a \right)}\) illustrates the case when all parameters except xis and xjs are based on sampled values from sample (b).

Selection of the sample size is one of the most significant steps while carrying out Sobol's sensitivity analysis. The sensitivity indices (total order effect and first-order effect) are calculated with the decision variable as input and hydraulic head as output with different sample sizes. The most suitable sample sizes are selected accordingly. As the sample size increases beyond 10,000, the values of Sobol's indices do not change. This means that at least 10,000 samples should be considered while performing the sensitivity analysis in this study. Using Eq. (12) to (14), the effect of all the parameters on the model output using Sobol's First Order indices (FOI), Second Order indices (SOI), and Total Order indices (TOI) is calculated. The FOI, SOI, and TOI values are shown in Fig. 15 for all the six parameters used in the groundwater flow model, considering a sample size of 15,000.

As per the SOBOL analysis, when the value of the FOI and TOI approaches to 1, this means that the parameter is highly sensitive. On the other hand, the value of FOI should always be less than TOI. In this study, it was observed that a high value of FOI and TOI (> 0.9) is observed for Van Genuchten's fitting parameter (α), which means that the parameter (α) is a highly sensitive input parameter. The second-order Sobol indices are also determined to understand the influence of two parameters to the model output. In this flow domain, the highest value of SOI is obtained for (α–n), followed by (α–θs), (α–θr), (α–Ss), and (α–Ks). This result indicates that α is the most sensitive input parameter. When interacting with the other parameters, it shows the highest value. These findings indicated that the hydraulic head obtained from the model output had a synchronized effect when the parameter α interacted with the other flow parameters, which was impossible to observe during the FOI calculation.

Conclusion

This study proposes an effective methodology to estimate the unsaturated and saturated flow parameters together in a single inverse optimization model. As such three-dimensional hypothetical groundwater flow model is developed considering both the saturated and unsaturated zone. The parameters that need to be estimated are the hydraulic parameters given by Van Genuchten and Nielsen (1985) that are θs, θr, α, Ks, n, and specific storage (Ss), an essential parameter in the saturated zone. This parameter estimation model is developed using Artificial Neural Network (ANN) and Shuffled Frog Leaping Algorithm to achieve efficiency in computation time and predicting performance. The ANN model is trained using the data generated from the three-dimensional groundwater flow model considering both unsaturated and saturated zones. The result indicates that the ANN-SFLA-based parameter estimation model can predict all six flow parameters well with a minimum relative error and less computational time. Due to its faster convergence and better results, SFLA has shown competitive results when compared to Genetic Algorithm. This may be because memetic evolution in SFLA occurs more rapidly and consists of different evolution sets occurring simultaneously. On the other hand, in GA, genetic evolution is composed of a population (a set of solutions) and evolves as a whole. Therefore, we conclude that ANN-SFLA-based parameter estimation models are a better alternative to solve this parameter estimation problem. The sensitivity study shows that the fitting parameter (α) is a highly sensitive input parameter in the developed groundwater flow model. When analyzing the Sobol indices, it is observed that when α associates with other parameters, it provides high sensitivity values, as shown in Fig. 11. Thus, the Van Genuchten Parameter (α) is considered to be the most sensitive input parameter when developing a groundwater flow model considering both the unsaturated and saturated zones.

Fig. 11
figure 11

Sensitivity analysis of a first-order indices and total indices and b second-order indices for groundwater flow model