Introduction

Coastal groundwater plays a vital role in maintaining freshwater reserves essential for fulfilling the agricultural, domestic, and industrial water requirements of populations residing near coastal regions. With over 50% of the world’s population residing in these regions and the potential for this number to increase to 75% in the next century (Neumann et al. 2015), the demand for freshwater resources is increasing. This growing trend in coastal settlements inevitably leads to the excessive utilization of valuable groundwater reserves. Overexploiting coastal groundwater can lead to saline water intrusion, which, in turn, can have severe consequences on natural equilibrium. Addressing the complex and interconnected threats of salinization is crucial for the well-being of populations living in coastal regions (Feist et al. 2023). To prevent seawater intrusion, it is imperative to adopt a prudent and sustainable groundwater pumping plan in coastal aquifers. Effective management of groundwater resources serves as a pivotal factor in fostering sustainable development within any given region, notably contributing to agricultural, residential, industrial, and economic stability (Masoud 2020). A solution to sustainable groundwater resources management lies in the implementation of multiple objective management models that can prescribe an optimum groundwater pumping plan whilst keeping the aquifer’s saltwater levels within allowable bounds. These models are developed using the concept of a combined simulation–optimization (S–O) methodology, wherein the computationally efficient surrogate models replace the simulation model (Dhar and Datta 2009; Sreekanth and Datta 2010). To support the development of an integrated S–O-based seawater intrusion management model, this study proposes an automatic model selection (AutoML)-based machine learning (ML) algorithms to develop surrogate models for predicting salinity intrusion in a coastal aquifer located in Bangladesh.

Recently, there has been an increasing attention among researchers in developing computationally efficient surrogate models to predict seawater intrusion mechanisms in coastal aquifers. These surrogates have been employed as computationally feasible alternatives to the intricate numerical models in the combined S–O-based seawater intrusion management models (Dhar and Datta 2009; Roy and Datta 2018a). Prediction accuracies of the surrogate models are of utmost importance as they largely determine the overall precision and dependability of the developed management model. In the last few years, different ML-based algorithms have been employed to construct the surrogate models for seawater intrusion problems with different performance abilities. The ML-based methodologies involve the utilization of various cutting-edge ML algorithms, which have been utilized to construct models for managing saltwater intrusion. A synopsis of recent investigations concerning the prediction of saltwater intrusion through ML algorithms for the development of management models in coastal aquifers is outlined in Table 1. Overall, the prediction capabilities of these prediction models have achieved satisfactory results. However, selecting the most appropriate model from a multitude of algorithms designed for estimating seawater intrusion, each with varying performance levels, can be a formidable task. Hence, there is a need to seek an approach that simplifies the model selection task by automating the process. This automated model selection process has not been previously applied to choose the most suitable saltwater intrusion prediction models. To address this research gap, this study introduces an AutoML approach to select the best saltwater intrusion prediction models from a number of different alternative ML-based algorithms. These prediction models are known as the surrogate or meta-models in the context of a combined S–O approach to construct a management model to control seawater intrusion.

Table 1 Summary of recent studies on predicting saltwater intrusion using ML algorithms for developing saltwater intrusion management models in coastal aquifers

Implementing new techniques of seawater intrusion prediction is crucial to lowering modelling errors and model parameter uncertainties (Moravej et al. 2020). However, it is still a complex scientific problem for seawater intrusion prediction to reach the requirement of higher prediction accuracy with the desired model. Previous studies of seawater intrusion prediction compared a few ML-based modelling approaches and proposed the best predictive model based on the comparison results. This approach is constrained by the selection of appropriate candidate models for comparison and finding the top performing model. In other words, traditional ML-based approaches to seawater intrusion prediction involve manual model selection, which can be time-consuming, and subjective. To address these limitations, it is often beneficial to compare multiple approaches through optimizing their optimizable hyperparameters using optimization algorithms to arrive at the most appropriate prediction model for a given data set. This AutoML refers to the process of automatically choosing the most suitable regression model for a given data set. The objective is to determine the model that offers the best fit to the data and provides the most accurate predictions. To this end, the present study proposes an AutoML technique to select the top-performing model for enhancing prediction accuracy and streamlining the modelling process. The asynchronous successive halving algorithm (ASHA) (Li et al. 2020) was used in the AutoML technique to search for the top-performing model through hyperparameters tuning.

Selection of models for a particular problem is quite difficult given that a vast majority of ML-based models are available. Training numerous models and optimizing their hyperparameters can frequently span days or even weeks. Consequently, employing an automated model optimization technique for the development and comparison of various models can significantly expedite the process. For model hyperparameters tuning, Bayesian and ASHA optimization can be used to speed up the process. This research paper focuses on the application of the ASHA, for AutoML in seawater intrusion prediction tasks. The ASHA algorithm is a state-of-the-art model selection method that balances exploration and exploitation through adaptive resource allocation. Rather than individually training each model with distinct hyperparameter configurations, a selection of different models can be chosen, and their default hyperparameters can be fine-tuned using the ASHA optimization method. Optimization algorithms aim to discover the ideal combination of hyperparameters for a specific model by minimizing the model’s objective function. These optimization algorithms select new hyperparameters strategically in each iteration and generally reach the optimal hyperparameter set more efficiently than a straightforward grid search. Consequently, this study aims to employ ASHA optimization to train several regression models on a given training data set and determine which model performs most effectively on a test data set. Our goal is to look into the effectiveness of the AutoML-based surrogate model selection using the ASHA optimization algorithm to predict seawater intrusion processes within coastal aquifers. By automating the model selection process, this research intends to overcome the limitations of manual model selection process, which often involves subjectivity and leads to suboptimal choices.

Our proposed approach automates and excludes the manual phases needed to transition from a data set to a predictive model. We present a novel approach that automates the model selection process for seawater intrusion prediction, providing a more objective and efficient alternative to manual selection. Therefore, the contribution of this study includes (a) building multiple regression models for a given training data set of salinity intrusion through optimizing their hyperparameters using the ASHA optimization algorithm, and (b) selecting the top-performing models that can be used as surrogate models in a combined S–O-based management model for controlling seawater intrusion.

Methodology

The study seeks to introduce an AutoML-based approach for model selection in predicting seawater intrusion processes within coastal aquifer systems subject to spatiotemporal groundwater withdrawal. This AutoML approach aims to replace manual surrogate model selection for a combined S–O framework to construct seawater intrusion management models. The methodology involves several key steps: simulating groundwater flow and solute transport phenomena through a 3D simulation model, generating spatiotemporal groundwater extraction values within practical limits through a sampling strategy, creating predictor–response data set for training with the assistance of a calibrated and validated simulation model, and developing surrogate models using the AutoML approach. This research is based on data produced using a 3D seawater intrusion simulation model that has been applied to an actual coastal aquifer in southern Bangladesh, as initially proposed in Roy and Datta (2020b). To enhance readability, the methodologies are summarized briefly in the subsequent paragraphs.

Simulation of aquifer processes

Aquifer dynamics were modeled in the context of transient flow and salt transport using a 3D density-driven combined flow and solute conveyance numerical code known as FEMWATER (Lin et al. 1997). FEMWATER is a robust and widely adopted finite-element-based modelling approach that has demonstrated its effectiveness in numerous prior investigations, specifically in addressing density-driven combined flow and salinity transport phenomena in coastal aquifers (Sreekanth and Datta 2011b; Roy and Datta 2017a). The controlling equation for 3-D flow within FEMWATER is expressed through the modified Richards equation (Lin et al. 1997):

$$\frac{\rho }{{\rho }_{0}}F\frac{\delta h}{\delta t}=\nabla .\left[K.\left(\nabla h+\frac{\rho }{{\rho }_{0}}\nabla z\right)\right]+\frac{\rho }{{\rho }^{*}}q,$$
(1)

where \(F\) denotes storage coefficient, \(h\) represents pressure head, \(K\) refers to hydraulic conductivity tensor, \(z\) is the potential head, \(q\) represents a source or a sink, \(\rho\) denotes water density at chemical concentration \(C\), \({\rho }_{0}\) symbolizes referenced water density at zero chemical concentration, \({\rho }^{*}\) is the density of injection fluid or that of the withdrawn water.

The hydraulic conductivity, denoted as \(K\), can be represented as

$$K=\frac{\rho g}{\mu }k=\frac{\left(\rho /{\rho }_{0}\right)}{\mu /{\mu }_{0}}\frac{{\rho }_{0}g}{{\mu }_{0}}{k}_{s}{k}_{r}=\frac{\rho /{\rho }_{0}}{\mu /{\mu }_{0}}{k}_{so}{k}_{r},$$
(2)

where \(\mu\) represents the dynamic viscosity of water at a given chemical concentration denoted as \(C\), \({\mu }_{0}\) denotes reference dynamic viscosity at zero chemical concentration, \({k}_{s}\) is the saturated permeability tensor, \({k}_{r}\) is the relative permeability or relative hydraulic conductivity, \({k}_{so}\) refers to referenced saturated conductivity tensor.

In the context of the seawater intrusion problem, the connection between concentrations of fluid density and saltwater is described by the following equation:

$$\frac{\rho }{{\rho }_{0}}=1+\varepsilon \frac{C}{{C}_{{\text{max}}}},$$
(3)

where \(\varepsilon\) represents the density reference ratio (dimensionless), \(C\) denotes the material concentration in the aqueous phase, and \({C}_{{\text{max}}}\) is the highest material concentration.

The 3D transport equation is expressed through the following equation:

$$\begin{aligned}&\theta \frac{\delta C}{\delta t}+{\rho }_{b}\frac{\delta S}{\delta t}+V.\nabla {\text{C}}-\nabla .\left(\theta D.\nabla {\text{C}}\right)\\ &\quad=-\left({\alpha }^{\prime}\frac{\delta h}{\delta t}+\lambda \right)\left(\theta C+{\rho }_{b}S\right)-\left(\theta {K}_{w}C+{\rho }_{b}{K}_{s}S\right)\\ & \qquad +m-\frac{{\rho }^{*}}{\rho }qC+\left(F\frac{\delta h}{\delta t}+\frac{{\rho }_{0}}{\rho }V.\nabla \left(\frac{\rho }{{\rho }_{0}}\right)-\frac{\delta C}{\delta t}\right)C,\end{aligned}$$
(4)

where \({\rho }_{b}\) is the bulk density of the medium, \(C\) refers to material concentration in aqueous phase, \(S\) is the material concentration in adsorbed phase, \(t\) denotes time, \(V\) represents discharge, \(\nabla\) is the del operator, \(D\) denotes dispersion coefficient tensor, \(\lambda\) represents decay constant, \(M=q{C}_{m}\) = artificial mass rate, \(q\) is the source rate of water, \({C}_{m}\) is the material concentration in the source, \({K}_{w}\) represents the first order biodegradation rate constant through dissolved phase, \({K}_{s}\) is the first order biodegradation rate through adsorbed phase, \({K}_{d}\) refers to the distribution coefficient, \(\theta\) refers to moisture content, \({\alpha }^{\prime}\) is the modified compressibility of water, \({\beta }{\prime}\) is the modified compressibility of the medium, \(n\) refers to porosity, \(S\) is the saturation. Equations (1) and (4) describe, respectively, the flow and salinity transport phenomena. The salinity intrusion phenomenon becomes highly nonlinear due to the interconnection between the density coupling coefficient and Darcy velocities in these two equations. Consequently, the equations for flow and salinity transport processes were concurrently resolved using a numerical simulation model grounded on finite elements.

Multiple simulation to generate predictor–response data set for training

The calibrated and validated seawater intrusion model was employed to produce sets of predictor–response arrays to train AutoML-based surrogate models, using randomized inputs representing spatiotemporal pumping conditions. The spatiotemporal groundwater abstraction stress imposed on the aquifer was linked to the extraction of water from a series of pumping and barrier abstraction wells situated at distinct positions and time intervals. These randomized inputs were derived from a uniform sampling distribution, specifically employing Halton Sequences (HA) (Halton 1960). This selection of sampling method was adopted in the present study due to its demonstrated advantages over more frequently employed methods (Loyola et al. 2016). These sampled inputs were then fed into the simulation model to compute the resultant salinity values at designated monitoring locations (MLs). Each run of the simulation model, with inputs and outputs, constitutes a single input–output pattern. Executing the simulation model multiple times produces a collection of such input–output patterns. The aquifer properties, along with the simulation conditions, stayed consistent across various simulation runs. The sole variable that changed in successive simulations was the transient groundwater abstraction values. This variation allowed for the acquisition of distinct estimations of salinity concentrations, which were exclusively influenced by the applied groundwater abstraction stress on the aquifer system.

The simulation was conducted over a 3-year management period utilizing 43 production and 13 barrier wells. As a result, the study accounted for 168 decision variables [(43 + 13) × 3 = 168], representing spatiotemporal groundwater extraction values. To adequately capture the input–output training patterns, which encompassed numerous input variables (168 in this study), an adequately sufficient number of input–output training patterns were generated. The input pumping patterns were generated using Halton sequences, with specified upper and lower bounds for pumping values. These bounds were set at 0 and 6000 m3/day, respectively. The selection of the upper bounds of pumping was based on actual groundwater withdrawal values recorded in the study area in April 2017 (Roy and Datta 2020b). A three-dimensional visualization of the spatiotemporal groundwater extraction values has been presented in Fig. 1 depicting the groundwater abstraction patterns over time.

Fig. 1
figure 1

Three-dimensional view of the transient groundwater pumping values

The necessary input–output training patterns for developing a coupled S–O-based saltwater intrusion management model vary based on system complexity and decision variable count. Complex problems may demand numerous patterns for a reliable surrogate model. Typically, a large data set is generated, and training starts with a subset, incrementally adding patterns until no significant improvement is observed. Despite its time demand, this method yields surrogate models capturing true input–output trends across the decision space. Coupled with optimization algorithms, these models provide optimal groundwater extraction values. However, generating training patterns through multiple calibrated model simulations is time-intensive. An alternative is adaptive training, which uses fewer patterns to develop reasonably accurate emulators of saltwater intrusion processes (Sreekanth and Datta 2010; Christelis and Mantoglou 2016; Christelis et al. 2018).

To develop adaptive surrogate models, Sreekanth and Datta (2010) employed an expanding set method, progressively adding training patterns generated by an optimization strategy based on the search direction. Initially, limited training patterns were used with surrogate models alongside an optimization algorithm to find near-optimal solutions. Once an approximate optimal solution was found, additional training data near this solution were generated to improve accuracy. The surrogate model was then retrained, and the optimization routine rerun to achieve a more accurate optimal solution. This process iterated until the desired level of surrogate model accuracy was attained.

Christelis and Mantoglou (2016) employed an adaptive approach to train surrogate models for optimizing pumping in coastal aquifers, dealing with ten wells and ten corresponding constraints per well. They devised an online Radial Basis Function (RBF) surrogate model training scheme integrated with an optimization algorithm. Their method involved adding infill points to the initial sampling plan using the best solutions found by the RBF model during optimization, prioritizing local exploitation over global improvement. While providing accurate predictions in limited regions of the decision space, this approach may overlook the global optimum (Forrester et al. 2008). It also integrates feedback from numerical simulations to assess current best solutions. Christelis et al. (2018) extended this method to larger problems within computational constraints, evaluating its performance.

In an alternative approach, Song et al. (2018) adaptively trained a surrogate model during evolutionary search to meet a desired fidelity level, preventing error accumulation in forecasting and ensuring convergence to the true Pareto-optimal front. Initially, a set number of training samples spanning the feasible variable space established the preliminary surrogate model. Offspring individuals were then evaluated using this model, integrated with the current population, and subjected to fast non-domination sorting. About 20% of the population size was selected based on convergence and diversity criteria as promising individuals at intervals of several generations. In addition, a modified local search operator enhanced local optimality. Selected local optima were reevaluated using a numerical simulation model and merged with approximate Pareto optimal solutions. Solutions evaluated by the simulation model were archived to improve surrogate model accuracy. This approach focuses on local optimal solutions, offering more accurate predictions within limited regions of the input variable decision space.

However, in its pursuit of computational efficiency in generating necessary input–output patterns for surrogate model training, this approach overlooks the accuracy and uncertainty of surrogate model predictions. Furthermore, these adaptive methods necessitate evaluating the generated training patterns (obtained through optimization algorithms) using numerical simulation models, inevitably demanding additional computational time and effort compared to the initial fewer input–output training patterns. Relying on optimal solutions from limited decision space regions and local optimality, the adaptive approach may miss the global optimum while generating additional training patterns from existing ones. Considering these limitations, our study aims to prioritize the development of highly accurate surrogate models using the requisite number of input–output training patterns.

Surrogate model development

Surrogate models play a pivotal role in a combined S–O method, which is the central component for developing a management model for seawater intrusion control that prescribes ideal water abstraction rates from coastal aquifers. To aid in constructing a combined S–O-based management model for seawater intrusion, this study proposes automatic selection of ML-based surrogate models instead of manually selecting a few of them and determining the top-performing one. The AutoML process was built upon the data set used for model development. The spatiotemporal pumping rates, originating from 43 production wells and 13 barrier wells, spanning a 3-year timeframe, served as input data for the surrogate models. Consequently, the surrogate model’s input comprised a total of 168 (56 wells × 3 time steps). On the other hand, the surrogate model's output represents the concentrations recorded at designated MLs throughout all time steps. Salinity concentrations were monitored at 16 locations. Consequently, 16 models were developed representing 1 model at each monitoring location. This input–output configuration for the surrogate models can be expressed as follows:

$$\text{Salinity concentration at a certain monitoring location}=f\left(\text{Spatiotemporal groundwater extraction from the production and barrier wells}\right).$$
(5)

Equation 5 can be mathematically represented by

$$\left\{{C}_{i}^{j},i=ML1,\dots ,ML16,j=1,\dots ,3\right\}=f\left\{\begin{array}{c}\left({PW}_{k}^{j},k=1,\dots ,43,j=1,\dots ,3\right)\\ \left({BW}_{l}^{j},l=1,\dots ,13, j=1,\dots ,3\right)\end{array}\right\},$$
(6)

where \(i\) and \(j\) represent indexes for monitoring places and management time periods, accordingly; \(k\) and \(l\) indicate indices for the production and barrier well locations, respectively; \({C}_{i}^{j}\) denotes saltwater concentration levels at the ith monitoring places for the jth management time steps; \({PW}_{k}^{j}\) symbolizes water abstraction from the \(k{\text{th}}\) production well places for the jth management time steps; and \({BW}_{l}^{j}\) represents water abstraction from the lth barrier well places for the \(j{\text{th}}\) management time steps.

The proposed surrogate model selection using AutoML approach is presented in Fig. 2.

Fig. 2
figure 2

Machine learning-based modelling workflows with automated machine learning (AutoML) approach. The steps where AutoML is applicable are displayed in a light green shade

Selection of candidate machine learning algorithms

The precision of a surrogate model primarily hinges on the judicious choice of models and their prediction accuracy. To predict salinity intrusion in coastal aquifer systems, surrogate models were created through the training and testing of ML algorithms. Nonetheless, training multiple ML algorithms and identifying their optimal parameter configurations presents a time-intensive endeavor. To streamline and simplify this procedure, a set of commonly employed ML algorithms with distinct hyperparameters was chosen as the candidate algorithms. Table 2 presents these candidate ML algorithms alongside their adjustable hyperparameters.

Table 2 Machine learning algorithms and their tunable hyperparameters

In the process of model development at a specific monitoring location, the hyperparameters of the seven ML algorithms listed in Table 1 were fine-tuned utilizing the ASHA (Li et al. 2020) optimization algorithm. The ASHA optimization served a dual purpose: not only did it fine-tune the parameters of the seven ML algorithms, but it also determined the best-performing ML algorithm among the seven options.

Selection of the best models through hyperparameters optimization

It is of crucial importance to discern which model, from the multitude of options available, excels in achieving the desired task. Subsequently, fine-tuning its hyperparameters becomes paramount for optimal performance. AutoML streamlines this by optimizing both the model and its related hyperparameters in one operation. The majority of ML algorithms require careful selection of hyperparameters (Probst et al. 2019). The choice of hyperparameters significantly influences the performance of ML models (Zhang et al. 2016), and haphazard selection can yield subpar results. Many studies utilized trial-and-error hyperparameter selection (Zhang et al. 2016; Jeong and Park 2019), grid and/or random search (Bergstra and Bengio 2012) to develop ML-based models. Some even employ heuristic optimization methods like genetic algorithm and particle swarm optimization (Bergstra and Bengio 2012). Therefore, a precise and dependable automated hyperparameter optimization approach is of great desirability (Zhang et al. 2016) and proves valuable for ensuring an impartial comparative evaluation among various ML alternatives. Moreover, when evaluating various ML models, fair assessments can only be made if all models are similarly optimized (or receive equivalent consideration) for the specific problem under consideration.

ASHA (Li et al. 2020) optimization algorithm is an upgraded variant of Successive Halving algorithm (SHA) (Karnin et al. 2013; Jamieson and Talwalkar 2015). It is an easy-to-use hyperparameter optimization technique that takes advantage of aggressive early halting and is appropriate for large-scale hyperparameter optimization problems (Li et al. 2020). As demonstrated on a workload with 500 workers, ASHA outperforms current advanced hyperparameter tuning techniques, scales in a linearly manner with the number of workers in distributed environments, and is appropriate for huge parallelism (Li et al. 2020). The user does not need to indicate in advance how many configurations to evaluate, because it is asynchronous, but it still needs the same inputs as SHA. A thorough description of ASHA can be found in Li et al. (2020), and is not duplicated in this work. As far as the authors are aware, this is the first effort that ASHA optimization algorithm is used to tune the hyperparameters of several ML algorithms to automatically select the best proxy model to predict salinity intrusion in coastal aquifer systems.

In this research, surrogate models were developed by tuning hyperparameters of seven commonly used ML algorithms and finding the best model for the seawater intrusion data set at 16 MLs. In this approach, rather than utilizing distinct combinations of hyperparameters to train every model, seven ML algorithms were selected and their default hyperparameters were optimized using ASHA optimization algorithm. This optimization algorithm searches for an optimal set of hyperparameters for a particular model through optimizing the cost function of the model (minimization of the Mean Squared Error (MSE)). The optimization algorithm deliberately selects new hyperparameters in each iteration and produces an optimal set of hyperparameters for a given training data set and identifies the model with the highest performance on a test data set. The ASHA optimization algorithm used seven ML algorithms with various hyperparameters values and trained them using a tiny portion of the learning data set. If the log(1 + valLoss) value for a particular model is promising, where valLoss is the cross-validation MSE, the model is retained for the next phase and retrained using more training data. Repetitively, this process utilizes increasingly greater volumes of data for training efficient models. The selected best model was one that produced the lowest training and test errors. Table 3 presents the selected best models and their optimized hyperparameters at different MLs.

Table 3 Selected best models and their optimal parameters at different monitoring locations

Evaluation of surrogate model performance

The effectiveness of the proposed AutoML-based surrogate models was assessed using a range of statistical evaluation metrics, including Accuracy (A), Correlation Coefficient (R), Nash–Sutcliffe Efficiency Coefficient (NSE), Willmott’s Index of Agreement (IOA), Kling–Gupta Efficiency (KGE), Root Mean Squared Error (RMSE), Normalized RMSE (NRMSE), Mean Absolute Error (MAE), Median Absolute Deviation (MAD), Mean Bias Error (MBE), and Percentage Bias (PBIAS). The mathematical expressions defining these performance indices are provided below:

$$A=1-abs\left({\text{mean}}\frac{{C}_{i,P}-{C}_{i,S}}{{C}_{i,S}}\right),$$
(7)
$$R=\frac{\sum_{i=1}^{n}\left({C}_{i,S}-\overline{{C }_{S}}\right)\left({C}_{i,S}-\overline{{C }_{P}}\right)}{\sqrt{\sum_{i=1}^{n}{\left({C}_{i,S}-\overline{{C }_{S}}\right)}^{2}}\sqrt{\sum_{i=1}^{n}{\left({C}_{i,P}-\overline{{C }_{P}}\right)}^{2}}},$$
(8)
$${\text{NSE}}=1-\frac{\sum_{i=1}^{n}{\left({C}_{i,S}-{C}_{i,P}\right)}^{2}}{\sum_{i=1}^{n}{\left({C}_{i,S}-\overline{{C }_{S}}\right)}^{2}},$$
(9)
$${\text{IOA}}=1-\frac{\sum_{i=1}^{n}{\left({C}_{i,S}-{C}_{i,P}\right)}^{2}}{\sum_{i=1}^{n}{\left(\left|{C}_{i,P}-\overline{{C }_{S}}\right|+\left|{C}_{i,S}-\overline{{C }_{S}}\right|\right)}^{2}},$$
(10)
$${\text{KGE}}=1-ED=1-\sqrt{{\left(R-1\right)}^{2}+{\left(\propto -1\right)}^{2}+{\left(\beta -1\right)}^{2}},$$
(11)
$$\propto = \frac{\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({C}_{i,P}-\overline{{C }_{P}}\right)}^{2}}}{\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({C}_{i,S}-\overline{{C }_{S}}\right)}^{2}}},$$
(12)
$$\beta =\frac{\frac{1}{n}\sum_{i=1}^{n}{C}_{i,P}}{\frac{1}{n}\sum_{i=1}^{n}{C}_{i,S}},$$
(13)
$${\text{RMSE}}=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({C}_{i,S}-{C}_{i,P}\right)}^{2}},$$
(14)
$${\text{NRMSE}}=\frac{{\text{RMSE}}}{\overline{{C }_{S}}}\times 100,$$
(15)
$${\text{MAE}}={\text{mean}}\left[\left|{C}_{i,S}-{C}_{i,P}\right|\right],$$
(16)
$${\text{MAD}}\left({C}_{S},{C}_{P}\right)={\text{ median }}\left(\left|{C}_{S, i=1}-{C}_{P, i=1}\right|,\left|{C}_{S, i=2}-C\right|,\dots ,\left|{C}_{S, i=n}-{C}_{P, i=n}\right| \right) \quad {\text{for }} i=\mathrm{1,2},\dots ,n,$$
(17)
$${\text{MBE}}=\frac{1}{n}\sum_{i=1}^{n}\left({C}_{i,P}-{C}_{i,S}\right),$$
(18)
$${\text{PBIAS}}=\frac{\sum_{i-1}^{n}\left({C}_{i,P}-{C}_{i,S}\right)}{\sum_{i=1}^{n}{C}_{i,S}}\times 100,$$
(19)

where \({C}_{i,S}\) represents the simulated salinity concentrations derived from the simulation model and is measured in milligrams per liter (mg/L), \({C}_{i,P}\) represents the AutoML-based surrogate model predicted salinity concentrations (mg/L), \(\overline{{C }_{S}}\) denotes the average or mean simulated salinity concentrations as calculated by the simulation model (mg/L), \(\overline{{C }_{P}}\) symbolizes the mean AutoML-based surrogate model predicted salinity concentrations (mg/L), \(n\) represents the total number of input–output data sets, \(ED\) indicates Euclidian distance from the ideal data points, \(\propto\) signifies the corresponding differences between the simulated and model predicted salinity concentrations, \(\beta\) represents the ratio of the average predicted and average simulated salt concentration levels, reflecting the bias in the predictions.

Application of the proposed methodology

The proposed methodology was assessed using the simulation model and data presented in Roy and Datta (2020b). For the ease of readership, brief descriptions of the study area, data, and model calibration process are presented in the following sections.

Study area and data

The research domain is situated within the Barguna district, which is part of the coastal regions of Bangladesh. It is positioned in the southern region of the country, spanning the geographical coordinates of 21º 48ʹ to 22º 29ʹ north latitudes and 89º 52ʹ to 90º 22ʹ east longitudes. In this study area, the coastal aquifer is experiencing contamination due to salinity intrusion caused by extensive groundwater extraction. The primary source of water for agricultural purposes in this region is groundwater, used for cultivating major crops, including rice, wheat, and potatoes. The study area encompasses two administrative units, known as upazillas, within the Barguna district: Patharghata covering an area of 258.63 square kilometers and Barguna Sadar spanning 339.54 square kilometers. To the south, this region is bordered by the Bay of Bengal. Figure 3 presents a visual representation of the research domain.

Fig. 3
figure 3

Location and aerial map of the study area

Aquifer material layers were selected based on lithological data from the study area. Up to a depth of 300 m, the area consists mainly of alluvium soil types, including clay, silt, sand, and occasional gravel (Faneca Sànchez et al. 2015). Given that most physical processes occur within the first few meters of the aquifer, a thickness of 150 m was chosen. This thickness was divided into four layers: sandy silt (40 m), sandy loam (50 m), sand (40 m), and sandy clay (20 m). Each layer was assumed to have homogeneous aquifer material, with only vertical heterogeneity in terms of hydraulic conductivity considered. These values align with previous studies in the Bengal Delta (Faneca Sànchez et al. 2015; Michael and Voss 2009). An anisotropy ratio (kx/ky) of 2.0 was applied, with kx representing horizontal hydraulic conductivity in the X-direction, and ky representing horizontal hydraulic conductivity in the Y-direction. The vertical hydraulic conductivity (kz) was set as one-tenth of the values in the X-direction. Figure 4 illustrates a 3-D view of the model domain with finite-element meshes.

Fig. 4
figure 4

Three-dimensional view of the study area (after Roy and Datta 2020b)

The Northern boundary, defined as the administrative boundary, was designated as a no-flow boundary. While conclusively determining it as such is challenging, this condition was assumed due to the negligible hydraulic gradient (approximately 1: 20,000) near this boundary in the study area. Consequently, lateral groundwater movement across this boundary (illustrated in Fig. 3) was considered insignificant (Faneca Sànchez et al. 2015), leading to its designation as a no-flow boundary for modeling purposes. The sea face boundary was set as both a constant head and constant concentration boundary, with values of zero meters above sea level (MSL) and 35,000 mg/l, respectively. Upstream ends of rivers were assigned specified head values that linearly varied along the streams, reaching zero meters at the sea face boundary. Specific head values of 0.8 m, 0.86 m, and 0.70 m were designated at the upstream ends of the Haringhata, Bishkhali, and Burishwar Rivers, respectively. Tidal river salinity concentrations remained constant at 10,000 mg/l throughout the simulation period.

Figure 5 depicts a plan view of the study area, delineating boundaries and wells. In this figure, production wells, barrier wells, and monitoring locations are denoted by P1–P43, B1–B13, and M1–M16, respectively. During calibration and validation, pumping from barrier wells was disregarded, and hydraulic heads were monitored at M1 and M2. Following successful calibration and validation, barrier extraction wells were implemented as hydraulic control measures for saltwater intrusion processes. In addition, 14 additional monitoring locations were employed to track salinity concentrations for developing the saltwater intrusion management model in the study area.

Fig. 5
figure 5

Plan view of the study area showing the boundaries and wells (after Roy and Datta 2020b)

The calibration process began by establishing steady-state hydraulic head conditions across the finite-element nodes of the model domain. This involved running the transient simulation model over an 80-year period (from April 1930 to April 2009) in stages of 10 years each, using average pumping rates. The outputs at the end of each 10-year interval served as initial conditions for the subsequent stages until a stable hydraulic head condition was reached by April 2009. These stable hydraulic head values across different nodes were then utilized as initial conditions for the calibration process. The calibrated model was subsequently validated over a 3-year period from April 2015 to April 2017, using outputs from April 2014 as initial conditions for the validation period.

The process of model calibration spanned five years, covering the period from April 2010 to April 2014. During this time frame, hydraulic head measurements were consistently recorded at the specified MLs in April of each year (2010, 2011, 2012, 2013, and 2014). This calibration effort involved the refinement of hydraulic conductivity and recharge values to bring the simulated hydraulic heads into closer alignment with the observed hydraulic head values. Following the calibration phase, the model was authenticated throughout the subsequent three years, spanning from 2015 to 2017. The hydraulic head values recorded in 2014 were employed as the starting settings for the simulation during the validation phase. During this validation stage, the boundary conditions of the model were kept consistent with those established during the calibration phase.

The calibration and validation processes utilized a uniform time step of 5 days to analyze hydraulic heads. Computational efficiency is crucial when conducting numerous simulations for training a meta-model in an integrated S–O approach. Hence, a sensitivity analysis was conducted to assess the impact of simulation time steps on computed hydraulic heads. Time steps of 1 day, 5 days, 10 days, and 73 days were employed during the calibration periods from April 2010 to April 2014. Results revealed negligible differences in hydraulic head estimates across varying time steps during calibration. However, significant computational efficiency was observed with a 73-day time step, reducing simulation times considerably. Specifically, simulation times for time steps of 1 day, 5 days, 10 days, and 73 days were 14.61 min, 4.82 min, 3.17 min, and 0.95 min, respectively. Consequently, a time step of 73 days was adopted for multiple simulations with different transient pumping values, facilitating the generation of input–output training patterns for surrogate model training.

The calibrated and validated model was subjected to multiple simulations with varying spatiotemporal groundwater pumping values. This was done to capture a diverse range of predictor–response data sets needed to train the surrogate models. The stress imposed on the coastal aquifer due to spatiotemporal groundwater withdrawal resulted through the usage of both pumping and barrier wells situated at specified places and time intervals. The spatiotemporal water pumping amounts served as inputs to the numerical model to predict salinity concentrations at indicated MLs. This process generated a series of paired data sets, consisting of the spatiotemporal groundwater pumping values as predictors and the corresponding salinity concentration amounts as responses. Salinity measurements were collected from 16 MLs, and multiple such data sets were created by running the aquifer simulations numerous times, each time using diverse groups of spatiotemporal water pumping values derived from both pumping and barrier wells. Throughout these simulations, the aquifer properties, along with the other simulation conditions, stayed consistent. The only variable factor was the varying transient groundwater extraction values. This approach allowed us to capture various scenarios representing the obtained salinity concentrations exclusively owing to the applied spatiotemporal groundwater abstraction stress on the aquifer. Table 4 displays the descriptive statistics of the salinity concentrations produced at the 16 MLs in response to the spatiotemporal groundwater abstraction stress imposed to the aquifer.

Table 4 Statistics of the obtained salinity concentrations at the monitoring locations

Evaluation of the developed surrogate models

To assess the effectiveness of surrogate models in mimicking the aquifer processes within the study area, various statistical metrics were employed. A data set comprising 5000 predictor–response arrays was utilized for both training and validation purposes. Of these predictor–response arrays, 80% were allocated to train the surrogate models, while the other 20% were set aside to assess the effectiveness of the models that were generated. The RMSE and MAE criteria during the training and testing phase were employed to assess the efficiencies of the selected best surrogate models at different MLs. The 12 statistical performance indicators were computed using the test data set, which included 1000 predictor–response arrays. These indices were categorized into two groups: benefit indices (where higher values are favorable) and cost indices (where lower values are favorable). The benefit performance indices include accuracy, R, NS, IOA, and KGE, whereas the cost indices used for the evaluation were the RMSE, NRMSE, MAPRE, MAE, MAD, MBE, and PBIAS. In general, the surrogate models yielded greater values for the benefit indices and smaller values for the cost indices at the designated 16 MPs. These outcomes indicate the effectiveness of the proposed surrogate models in capturing the predictor–response relationships between spatiotemporal water abstractions and the resulting salinity concentration values at the designated MPs.

Training performance of the selected best models

The errors during the training and testing phases were computed and compared to assess the training performance and guarantee that the models developed at various MPs were neither over fit nor under fit. Table 5 provides training and test errors noticed during the training of selected models at specific MPs, as demonstrated by the RMSE and MAE values. The training phase error reflects the degree to which the models match or represent the training data, while the testing phase error provides insights into their generalization to new, unseen data. Smaller disparities between the errors in the training and testing phases indicate better model performance. Table 5 shows that all models exhibit very low errors during the phases of testing and training, suggesting excellent predictive performance. Moreover, the disparities between the errors in the training and testing phases are minimal, indicating no overfitting during model development. This holds true for models developed at all MLs, except for ML5, ML6, ML8, ML11, ML12, and ML14, where the differences between the training and testing phase errors were slightly higher. However, it is worth noting that the testing errors at these MLs were relatively smaller, indicating the models' good generalization capability. Overall, Table 5 serves as a valuable reference for assessing the efficacy of the selected models in terms of training performance, focusing on error metrics for both training and testing phases.

Table 5 Training and test errors during training of the selected models

The training time requirements for developing ML-based surrogate models are of significant importance when considering applicability and scalability. Scalability becomes more accessible with models that train quickly, enabling them to handle large data sets and real-time demands efficiently. In this study, we developed seawater intrusion prediction models to support the creation of a combined S–O-based management model for controlling seawater intrusion in coastal aquifers. These prediction models serve as effective surrogates for the complicated seawater intrusion simulation model. Therefore, it is imperative that the prediction models remain as simple as possible, with minimal training time requirements. Figure 6 offers a visual depiction of the training time requirements for models developed at various MLs. This figure offers valuable insights into the computational resources and time investments needed for effective model training. It serves as a visual aid for understanding how training time varies across different locations and can inform decision-making regarding model development strategies. Observations from Fig. 6 reveal that training times vary for models developed at different MLs. The time required for training these models ranged from as low as 643 s at ML1 to as high as 1824s at ML13, depending on the numerical values of the training data set at these MLs. Considering a substantial training data set of 5000 input–output patterns with 168 input variables, these training times remained relatively short when using a moderately configured computer (Intel(R) Core (TM) i7-8700 CPU @ 3.20 GHz 3.19 GHz & 16.0 GB RAM). Following successful training, the trained models demonstrated the ability to produce predictions for a new, unseen data set within a fraction of a second. This capability of these developed models enables their utilization as effective substitutes for the intricate simulation model inside the framework of a combined S–O-assisted management model for controlling seawater intrusion.

Fig. 6
figure 6

Training time requirements for models developed at different monitoring locations

Performance assessment of the selected best models on an unidentified data

After appropriately training the selected best models at various MLs, these models underwent testing with a new, unseen test data set, employing 12 statistical performance evaluation indices categorized into benefit and cost indices. The outcomes are presented in Table 5 (benefit indices) and Table 6 (cost indices). Table 6 provides a detailed assessment of the selected best models’ performance on an unseen test data set across a range of MLs with respect to accuracy, R, NS, IOA, and KGE criteria. The automatically selected models developed at the designated MLs exhibited high performances with respect to the computed benefit indices. The benefit indices attained similar performances while different benefit indices varied in magnitude among different MLs. All models exhibited superior performances based on accuracy (= 1), R (~ 0.99), NS (~ 0.99), IOA (~ 0.99) and KGE (~ 0.99) criteria. These findings align well with those presented in Roy and Datta (2020b), who reported a similar performance using a MARS-based seawater intrusion prediction model. An ideal accuracy value is 1.0, and any value close to 1.0 indicates excellent model performance (Elbeltagi et al. 2020). The R and IOA values exceeding 0.8 suggest very good performance of the model (Willmott 1981; Kirch 2008), while NS values exceeding 0.8 also suggest excellent model performance (Nash and Sutcliffe 1970). The model performances were deemed excellent with according to the KGE criterion with KGE values very close to 1 (Gupta et al. 2009; Kling et al. 2012). It can be argued that the selected models can effectively represent the numerical simulation model as the benefit indices showed indications of high model performance. Consequently, these models can function effectively as suitable surrogates for the intricate simulation model within the context of the combined S–O-based management model for controlling seawater intrusion.

Table 6 Performance assessment of the selected best models on an unused test data set based on benefit indices

Table 7 presents a comprehensive evaluation of the effectiveness of the top-selected surrogates on an unobserved data set, encompassing various MLs, in relation to the computed cost evaluation indices. Models developed at different MLs exhibited equivalent performance when evaluated using the specified cost evaluation indices. In general, the numerical values of all cost indices were remarkably small, indicating the exceptional performance of the surrogate models (Heinemann et al. 2012; Hyndman and Koehler 2006; Legates and McCabe 1999; Li et al. 2013; Pal 2017; Pham-Gia and Hung 2001; Yapo et al. 1996). The computed cost evaluation indices, including RMSE, NRMSE, MAPRE, MAE, MAD, MBE, and PBIAS, strongly suggest that the developed surrogate models authentically represent the numerical simulation model from which they were derived. Consequently, these models emerge as the prime candidates for serving as computationally efficient alternatives to the simulation model in the context of the framework of a combined S–O approach for the creation of a management model for controlling seawater intrusion

Table 7 Performance assessment of the selected best models on an unused test data set based on cost indices

The prediction accuracies of the selected top-performing models at different MLs, as presented in the absolute error boxplots (Fig. 7), reveal that all models generated lower absolute errors between the simulation model outputs and surrogate predicted saltwater concentrations. However, it is noteworthy that models M1, M2, M3, M4, M9, M10, M12, and M14 exhibited greater efficiency in comparison with other surrogate models. Furthermore, Fig. 7 illustrates that the prediction accuracies of models M1 and M2 surpassed those of the other models, while the performance of model M5 was the poorest among all the models. These performance variations may be attributed to the dependence of model performance on the numerical values and data properties used for training. In addition, optimal parameter sets for the selected best-performing models, determined by the ASHA optimization algorithm, can impact performance. However, it is important to emphasize that each iteration of the ASHA optimization used similar optimization parameters and the same random seeds in the implementation engine (MATLAB) to minimize model selection errors. Therefore, it becomes apparent that the training data (input–output training patterns) at various MLs considerably affect the effectiveness of the selected best-performing models.

Fig. 7
figure 7

Absolute error boxplots of the models selected at different monitoring locations. M1, M2, M3,…, M16 represent the top-performing models selected for the monitoring locations ML1, ML2, ML3,…, ML16. A horizontal line inside each box indicates the median absolute error between the actual and predicted salinity concentrations for the respective prediction model. The black circle denotes the mean absolute error, while the ‘ × ’symbols represent the outliers

Performance comparison of the proposed AutoML approach with ANN, GPR, and SVR models

Figure 8 presents a comparison of the performance of the best-selected models against the subjectively chosen ANN, GPR, and SVR models using the same unseen test data set across three monitoring locations (ML6, ML11, and ML16) with respect to R, NS, and RMSE criteria. The results reveal that the proposed AutoML approach outperforms the subjectively selected ANN, GPR, and SVR models based on the computed performance evaluation criteria. This performance-centric analysis highlights the superiority or parity of the proposed AutoML approach over subjectively chosen models, offering valuable insights into the evolving landscape of machine learning model selection and optimization. These findings have the potential to redefine best practices in predictive modeling.

Fig. 8
figure 8

Performance comparison of the proposed AutoML approach with ANN, GPR, and SVR models

Practical implications

The practical importance of this research resides in its capability to demonstrate the capability of an AutoML-based ML approach to predict seawater intrusion in coastal aquifer systems. The proposed AutoML-based prediction models (surrogates) can be used as computationally viable proxies of the complex simulation models in a combined S–O method to create management plans for regional-scale coastal aquifers within intricate aquifer systems. To demonstrate this, a management model aimed at controlling seawater intrusion was developed by integrating the selected best performing models at the 16 MLs with a controlled elitist multiple objective genetic algorithm (CEMOGA) (Deb and Goel 2001). The suggested management model takes into account 168 decision variables that encompass spatiotemporal water extraction rates. Specifically, variables X-1 to X-129 denote groundwater extraction from pumping wells, while variables X-130 to X-168 pertain to barrier extraction bores. For instance, variables X-1 to X-3 indicate groundwater extraction from the pumping well P-1 during the first, second, and third time periods, respectively. Similarly, variables X-4 to X-6 represent groundwater extraction from the pumping well P-2, and so forth, spanning three designated time periods. On the other hand, variables X-130 to X-132, X-133 to X-135, X-136 to X-138, and so forth, up to X-166 to X-168, correspond to groundwater withdrawal from barrier extraction wells B-1, B-2, B-3, and so on, throughout the initial, second, and third time periods, in that order.

The 16 selected models, which predict salinity at specified MLs, were integrated into CEMOGA to forecast salinity within the combined S–O framework for the development of the seawater intrusion management model. In addition, these externally linked surrogate models were employed to verify the constraint satisfactions concerning the highest salinity levels that are permitted at particular MLs. With CEMOGA, the highest generation size was 16,000, the crossover percentage was 0.92, the population size was 2000, and the Pareto front population fraction was 0.70. The value of function tolerance was configured at 1 × 10–5, while the constraint tolerance configured at 1 × 10–4. The best parameters for CEMOGA were selected after several trials, even though an exhaustive sensitivity assessment was not performed. To find the optimum solution, the optimization procedure examined 116,001 functions across 116 generations. It took 142 min for the optimization routine to reach to the optimal solutions. The management model offered optimal solutions for the seawater intrusion management issue using a Pareto optimum front comprising various feasible options that demonstrate the trade-off between opposing goals. Figure 9’s Pareto optimum front illustrates that increasing water extraction from barrier wells can lead to higher water withdrawal from pumping wells.

Fig. 9
figure 9

Pareto optimal front

The results obtained from our solution indicate that this approach is not only feasible but also reasonably accurate. Importantly, this methodology can potentially be adapted for use in other aquifer systems along the coast. However, it is essential to note that for every new application, the seawater intrusion models must undergo calibration and validation specific to the research site. In summary, even though initially constructed for a particular aquifer system, our suggested approach holds promise for broader application across different coastal aquifer contexts.

The findings presented in the manuscript hold broader implications that extend beyond the immediate scope of saltwater intrusion prediction. The AutoML methodology developed in this study demonstrates its adaptability and potential applicability to various fields within ML-based and environmental modeling. With the proposed modeling approach's enhanced predictive accuracy and efficiency, it could be employed in other environmental studies, hydrogeological assessments, and water resource management. The insights gained from this research contribute to the advancement of ML-based techniques, fostering their integration into diverse scientific domains for more effective and informed decision-making processes. Its adaptability allows for application in various environmental prediction scenarios, such as groundwater contamination in different geological settings or the forecasting of other complex hydrological phenomena. This cross-disciplinary applicability underscores the broader impact and significance of the research findings, opening avenues for advancements in predictive modeling and decision support systems across diverse scientific and environmental disciplines.

In addition, the proposed computationally effective proxies (AutoML-based prediction models) of the numerical simulation code, FEMWATER can be further employed to develop management strategies in other related fields of hydrology and water resources management. Furthermore, prediction models developed using the proposed AutoML technique can be integrated with various multiple objective optimization algorithms to obtain Pareto optimal solutions. These solutions will aid decision-makers in selecting the best alternatives from a range of feasible options. The proposed methodology can be extended to other conditions by employing the trained and validated prediction models on new, unseen data sets representing those conditions. This process will also confirm the generalization capability of the developed AutoML-based saltwater prediction models. This has been performed by testing the model’s generalization capability on an unseen test data set for this case study.

Validation of the surrogate-based coupled S–O approach

The accuracy of the surrogate-based saltwater intrusion management model was evaluated by monitoring the adherence to defined constraints. It was observed that the saltwater concentrations derived from the optimization model’s solution, facilitated by surrogate models within the optimization framework, consistently remained below the predetermined maximum allowable levels at all monitoring sites. This indicates the fulfillment of imposed constraints, with no violations detected throughout the optimization process. Furthermore, the obtained saltwater concentrations closely aligned with the specified values, suggesting convergence of the optimization model towards the upper bounds of the imposed constraints.

In the second stage, the efficacy of the optimal groundwater abstraction strategies derived from the optimization model was validated through comparison with results obtained from FEMWATER. Five solutions were randomly selected from various regions of the Pareto optimal front for this analysis. Table 8 presents a comparison between the saltwater concentration values predicted by the surrogate model and those simulated by FEMWATER, utilizing the optimal groundwater abstraction strategies prescribed by the saltwater intrusion management model. The results for 5 monitoring locations are displayed in Table 8, with similar trends observed at other monitoring locations. It is evident from Table 8 that the solutions yielded by the FEMWATER model closely aligned with the predictions of the surrogate model.

Table 8 Surrogate model predicted and FEMWATER simulated salinity concentrations calculated from optimal solutions of groundwater abstraction

The percentage relative errors between the surrogate predicted and FEMWATER simulated salinity concentrations using five randomly selected optimal groundwater abstractions is presented in Fig. 10. The relative percentage errors, consistently below 3% in all instances, indicate a highly satisfactory constraint satisfaction outcome and a strong correlation between the simulated and computed groundwater head values. This finding aligns well with the results reported by Roy and Datta (2017b), who suggested that a relative percentage error of less than 5% is deemed suitable and efficient for achieving an optimal management strategy using an integrated S–O approach.

Fig. 10
figure 10

Percentage relative errors between the surrogate predicted and FEMWATER simulated salinity concentrations using optimal groundwater abstractions

Conclusions

Utilizing a combined S–O approach to devise an optimum groundwater withdrawal strategy has proven effective in mitigating seawater intrusion within coastal aquifers. While various ML-based surrogate models have recently emerged as computationally efficient alternatives to simulation models for addressing seawater intrusion issues, there has been a notable absence of studies focusing on the automatic selection of these models through optimization algorithms. This study fills this gap by demonstrating the applicability of the ASHA optimization algorithm to automatically select the surrogate models that can aid in developing surrogate assisted combined S–O method to solve complex coastal aquifer management issues. In our investigation, we employed FEMWATER, a numerical code that relies on finite-element methods and is designed for three-dimensional simulations of integrated flow and salt transport. This code was employed to model and analyze the mechanisms behind seawater intrusion within an aquifer system situated in the coastal areas of Bangladesh. To support our analysis, a diverse set of input data was gathered from various sources, encompassing a study area of approximately 598 square kilometers.

To address the challenges of selecting the most appropriate prediction models, this study proposes an innovative approach for selecting the best surrogate models using ASHA hyperparameter tuning method to perform automated model selection. Notably, this study is the first to utilize the ASHA algorithm for automating the model selection process to provide accurate predictions of salinity intrusion. Top-performing models at the 16 MLs were identified. The best models chosen for different monitoring locations demonstrated exceptional performance, demonstrating high accuracy with metrics, such as accuracy (= 1), R (~ 0.99), NS (~ 0.99), IOA (~ 0.99), and KGE (~ 0.99), all close to 1 and indicative of excellent model accuracy. In addition, the models exhibited a narrow range of RMSE values, ranging from 0.0003 to 1.4987 mg/L. This limited range is considered favorable for any predictive modeling approach. Moreover, the identified top-performing models at diverse monitoring locations can be utilized to formulate effective management strategies for determining optimal groundwater pumping rates to mitigate saltwater intrusion. In addition, the prediction models identified through the proposed AutoML technique can be seamlessly integrated with various multiple objective optimization algorithms, resulting in the derivation of Pareto optimal solutions. These solutions, characterized by a balance of trade-offs, will assist decision-makers in the selection of optimal alternatives from a spectrum of feasible options. The evaluation findings show that the ASHA optimization algorithm can be effectively employed to create accurate and robust seawater intrusion prediction models, making them suitable for developing a surrogate-assisted seawater intrusion management plan applicable to complex coastal aquifer study areas.

This study makes a significant contribution to the field of hydrogeology and environmental modeling. Unlike existing research, this study introduces a novel AutoML methodology within a machine learning framework for predicting seawater intrusion into coastal aquifers. The approach not only enhances predictive accuracy and efficiency but also demonstrates adaptability for broader applications in machine learning and environmental modeling. Compared to similar research endeavors, this study distinguishes itself by its focus on an AutoML strategy, which streamlines the modeling process and ensures the identification of the most suitable model for the given data set. This innovative methodology contributes to the efficiency and reliability of the predictive models, setting it apart from traditional approaches. Furthermore, the study provides insights into the potential applicability of the developed AutoML technique in diverse fields, fostering advancements in decision-making processes for water resource management.