Introduction

One of the most vital management tools in water resources field is the ability to determine and track the spatial and temporal variations trends of groundwater quality parameters with a high level of confidence. To do this, it is essential that the spatial and temporal distribution of physical and chemical aquifer characteristics are well understood in an aquifer for contamination risk assessment, an issue which does not seem possible without consideration of the spatial relationships of data points (Lee et al. 2007). To this end, geostatistical techniques can be used for this purpose which take the spatial relationships of the distances of the points and also the orientation and values of the variables into consideration. These techniques offer advantages such: an increase in the measurement accuracy of key parameters; and an ability to reduce the sampling density and frequency, which can decrease the cost of sampling programs (Bryan 1988). The geostatistical approach has provided useful techniques for investigating the spatial distribution of data such as groundwater contamination (Arslan 2012; Nas and Berktay 2010). By identifying spatial patterns and undertaking interpolation in locations that lack sufficient information, interpolation analysis can estimate the input parameters of random points in network grid cells using the observed data and, therefore, play an important role in sustainable groundwater management (Arslan 2012; Shan et al. 2013). A paucity of groundwater data results in significant uncertainty (Liu et al. 2004). Geostatistical techniques have been used in various scientific studies in fields such as groundwater assessment, geochemistry, water resources assessment and soil science as a powerful technique to determine the level of uncertainty that is associated with data collection and assessment (Pyrcz and Deutsch 2014; Chilès and Delfiner 2012; Arslan 2012).

The interpolation methods in geographic information system (GIS) are powerful tools for data interpolation and the prediction of values (Gong et al. 2014). Linear geostatistical methods such as conventional kriging methods are not appropriate due to the smoothing effect which resulted from equation exponents and they also require the calculation of a variogram. Their use, therefore, is often not practical (Chilès and Delfiner 2012; Costa 2003). Nonlinear geostatistical methods, which do not contain the weaknesses of requiring equation exponents and smoothing effects are recommended for the estimation of renewable resources and mapping their spatial characteristics (Emery 2008; Mare'chal 1984).

For example, Martinez-Cob (1996) utilized the three geostatistical interpolation methods of ordinary kriging (OK), co-kriging (Co-K) and modified residual kriging (MRK) for the interpolation of the long-term average of annual reference evapotranspiration and precipitation and found that none of these methods outperform others. Khattak et al. (2014) utilized ordinary and Bayesian kriging (BK) methods to determine the salinity levels of shallow wells in Pakistan and concluded that the second method shows more accuracy. Yao et al. (2014) studied eight interpolation methods to identify the spatial distribution of groundwater levels and showed that the kriging method yielded the best results. Afzal et al. (2015) used ordinary multi-Gaussian kriging (OMK) and simple multi-Gaussian kriging (SMK) for modeling the spatial distribution of iron based on subsurface data in the iron ore deposit. The results indicated that the SMK method was more accurate in moderately and weakly iron mineralized zones, and that the OMK method was more accurate in enriched, highly iron mineralized zones.

Arslan and Turan (2015) compared the interpolation methods of inverse distance weighting (IDW), radial basis function (RBF) and OK to estimate the spatial distribution of17 heavy metals of groundwater. They found that all of these methods were suitable for estimating the distribution of one or several specific metals. Mirzaei and Sakizadeh (2015) reported the use of IDW, OK, and empirical Bayesian kriging (EBK) interpolation methods in the estimation of 17 groundwater contaminant variables. The results of this investigation indicated that EBK possessed a slightly higher accuracy compared with the two other methods. Plouffe et al. (2015) used IDW, thin plate spline (TPS), OK, and BK methods for the measurement of monthly rainfall in the low and heavy rain months for a 2-year period for minimum and maximum annual rainfall values. They found that BK and TPS methods showed a higher accuracy in the estimation of minimum and maximum rainfall values, respectively. Babu (2016) employed the trend surface (TS), IDW, OK, universal kriging (UK), and Spline methods to assess the spatial and temporal patterns of rainfall. The results of this investigation showed that the TS method demonstrated more accuracy due to the minimum error between the observed and estimated rainfall. However, the OK and UK methods were identified as being the best methods, despite the higher root mean square error (RMSE) values, since they produced a smooth and more realistic interpolated surface. Gol et al. (2017) compared the block kriging, Co-K, and IDW methods with the power of 1, 2, and 4 to estimate the spatial distribution of soil carbon and demonstrated that the Co-K method yielded better results. To estimate electromagnetic levels, Rufo et al. (2018) utilized the spline, IDW, and OK methods after optimizing the characteristic parameters of each method. They reported a higher accuracy using the IDW method. Njeban (2018) investigated the use of the RBF, simple kriging (SK), UK, OK, and IDW methods to measure the spatial distribution of groundwater levels and to produce groundwater level and prediction standard error maps, and found that the UK methods was the method that produced the best results.

However, to date, no study has provided a precise evaluation of how interpolation methods are affected by the characteristics of aquifers that are under investigation. Consequently, this study was undertaken to compare linear and nonlinear interpolation methods to provide a cohesive instruction and guideline for the achievement of the most accurate method based on aquifer type and its characteristics. In this study, the total dissolved solids (TDS) parameter, which is the most widely used parameters for assessing groundwater quality, was measured in two shallow aquifers with different hydrogeological characteristics which are located in two important geographical regions in Iran. The aquifers that were assessed using interpolation techniques were the shallow aquifer that underlies the Lenjanat plain of Isfahan, and the shallow aquifer beneath the Babol–Amol plain of Mazandaran area. The study was undertaken using a combination of 24 different interpolation methods including the inverse distance weighted (IDW), OK, lognormal ordinary kriging (Log_OK), universal kriging (UK), disjunctive kriging (DK), EBK, simple kriging (SK), natural neighbor (NN), trend surface (TS) and Spline interpolation methods with seven error parameters in cross-validation of all sampling wells. In the end, the best interpolation method was introduced based on the aquifer characteristics.

Methodology and materials

Study area

In this research, two shallow aquifers have been studied: one is located in the center of Iran and another one located in the north of Iran near the Caspian Sea shore. A location map of the study areas is illustrated in Fig. 1.

Fig. 1
figure 1

Location of the study area with spatial distribution of sampling wells: a Lenjanat b Babol–Amol

Lenjanat aquifer

Area description

This aquifer, has an area of 1189 km2 and is located between longitudes 51° 2′ to 51° 47′ and latitudes 31° 53′ to 32° 31′, in the southwestern part of Isfahan province in the center of Iran. The average annual precipitation and temperatures in the area are equal to 209 mm and 12.46 °C, respectively.

One of the most important rivers in Iran, the Zayandehrud River, in an easterly direction through the plain and has an average discharge rate of 42 m3/s. Furthermore, the Salt River is located in the southern half of the region with an average discharge of 12.2 million cubic meters (MCM) per year that is used for agriculture.

Based on groundwater potentiometric head contours and the topography of the area, the groundwater discharges into the Zayandehrud River in a 26 km reach from the entrance to the Lenjanat aquifer. On the other hand, this river recharges the aquifer in the remaining 17 km due to the high rate of groundwater withdrawal in the area.

Two of the most important industrial facilities in the country are located in this area. These are Isfahan and Mobarekeh Steel Co. which has 28,000 employers). The main and traditional activity of the population is agriculture and consequently 400,000 ha of the study area is used for the irrigated cultivation of wheat, barley and rice.

The water demand of the plain is supplied by groundwater abstraction from the underlying aquifer, from rivers, from karst springs, and from Qanats (sloping underground channels in an aquifer).

The Lenjanat aquifer is the most significant water source as groundwater pumped from this aquifer supplies 70% of the water demand for the industrial, agricultural and domestic sectors in the study area. A map of the study area is illustrated in Fig. 1a.

Hydrogeological characteristics

The Lenjanat aquifer consists of alluvial sediments and is unconfined. The main source of its recharge is precipitation. The bedrock of the Lenjanat aquifer is Jurassic shale and, in some areas, Cretaceous limestone. The deepest alluvial layers that are deposited at the top of the bedrock are fine clay-marl sediments, which have a higher percentage of salt than other sediments. This layer is highly impermeable due to its degree of compaction.

Evaporite minerals have developed in the form of gypsum and salt layers and crystals within these sediments. The percentage of salt and clay in the sediments increases in the lower layers. The thickness of young alluvial sediments gradually increases at the intersection of flood channels towards Zayandehrud. In the middle areas of the plain, the thickness of sediments reaches more than 150 m. Based on geophysical studies, the average saturated thickness of the aquifer is about 50 m.

Despite the anisotropy of alluvial sediments beneath the plain, impermeable horizons are not observed among discontinuous sediments and subsurface information indicates the existence of an unconfined aquifer in the Lenjanat plain. The geological map of this area is shown in Fig. 2a.

Fig. 2
figure 2

Geology maps of: a Lenjanat aquifer b Babol–Amol aquifer

The transmissivity of aquifer varies from 600 to 2000 m2/day and average specific yield is 2%. The spatial variation of transmissivity is illustrated in Fig. 3a.

Fig. 3
figure 3

Transmissivity spatial distribution maps of: a Lenjanat aquifer b Babol–Amol aquifer

The depth of groundwater level varies from 2 m near the Zayandehrud River to about 50 m in the center of the plain. The hydraulic gradient varies between 3 and 21 m/1000 m. The groundwater level map and streamlines are shown in Fig. 4a.

Fig. 4
figure 4

Groundwater-level maps and streamlines of: a Lenjanat aquifer b Babol–Amol aquifer

In this study area, the aquifer output from the pumping wells, spring, and Qanat for industrial, agricultural and domestic water demands are 132, 1.4, and 57 MCM, respectively. A 6 MCM negative groundwater budget was calculated in this aquifer for the study year.

Based on geochemical evidence, the variation of EC depends on partial dissolution of evaporite minerals in the aquifer sediments.

Due to the presence of calcareous formations in sediments that enclose the plain, groundwater in the aquifer is dominantly a calcium bicarbonate composition type.

The dissolution of halite in aquifer sediments leads to the evolution of the anionic composition of groundwater along flowpaths, with a progressive change from bicarbonate dominance to sulfate and then to chloride dominance with the formation of Cl–Na facies groundwater. The release of sodium ions into groundwater due to the dissolution of halite and the contact of groundwater with alluvial marl sediments causes Na absorption and the release of Ca into groundwater and the formation of Cl–Ca facies in the middle of the plain. Recent geochemical changes in this part of the aquifer have caused the groundwater electrical conductivity (EC) to increase by more than 11,000 µmoh/cm.in a few points.

The EC of groundwater in this area varies from 700 to 8500 µmoh/cm. It usually decreases during spring rainfalls. The total dissolved solids (TDS) data were calculated from EC measurements using the following equation (Rhoades et al. 1999):

$$ {\text{TDS}}\left( {{\text{ppm}}} \right) = 0.64 \times {\text{EC}}\left( {{{\upmu moh/cm}}} \right), $$
(1)

The TDS level map (ppm) including main rivers of aquifer is shown in Fig. 5a.

Fig. 5
figure 5

TDS-level maps with main rivers of: a Lenjanat aquifer b Babol–Amol aquifer

The Babol–Amol aquifer

Area description

This aquifer has an area of 1445 km2 and lies between longitudes 52°11′ to 52° 52′ and latitudes f 36° 18′ to 36° 44′ It is located in a coastal region (Caspian Sea Coast) of the Mazandaran province in the north of Iran. The average annual rainfall and temperatures of the region are 870 mm and 17.9 °C, respectively.

Three main rivers, the Babol, Haraz, and Talar rivers, flow from the south to the north of the plain with average discharge rates of 493, 940, and 311 MCM/year, respectively. The main sources of these rivers, are snowmelt and precipitation. After the irrigation of a large area of agricultural land, these rivers enter the Caspian Sea. The main and traditional activity of the local people is agriculture and horticulture with a 101,000 ha irrigated area that is cultivated for rice and oranges. Both groundwater and water from the rivers are used as sources of irrigation water. Rivers in the area are both sources of groundwater recharge and receive groundwater discharge.

The Babol–Amol aquifer plays an important role in this area as the supplier of 63% water demand in the agricultural–horticulture and domestic sectors. The general map of the study area is illustrated in Fig. 1b.

Hydrogeological characteristics

The Babol–Amol aquifer is alluvial and unconfined. The main source of its recharge is precipitation and leakage through riverbeds. The bedrock of this aquifer is clay.

The alluvial fan deposits are usually coarse-grained and include rubble, sand, which are alternated with fine-grained sediments such as clays and silts. The granulation and material of discontinuous sediments in lateral expansion and depth are very different in this plain. The western regions of the Babol–Amol aquifer are affected by the Haraz River and have created a deep basin with a large thickness of alluvial sediments, which has been extended to the Caspian Sea. The sediment-resistant zones in the central and eastern part of the area have been formed under the influence of Sajadrud alluvial fans and the Babol and Talar Rivers. This basin has a limited expansion compared to the Haraz sedimentary basin. The geological map of the area is shown in Fig. 2b.

According to the results of pumping tests, the transmissivity of the Babol–Amol aquifer varies from 146 to 1512 m2/day and the average specific yield is 6%. The western part of the plain has a higher transmissivity compared to the eastern part due to the expansion of alluvial of the Haraz River. The spatial variation of the transmissivity of the aquifer in the area is shown in Fig. 3b.

The depth of the groundwater level varies from 0.5 to 37 m and the hydraulic gradient variation in this aquifer is between 2.4 and 5.5 m/1000 m. Based on the geophysical survey results, the average saturated thickness of the Babol–Amol aquifer is about 80 m. The groundwater levels and streamlines are illustrated in Fig. 4b.

In the study area, aquifer output from pumping wells for agricultural and domestic water demands are 323 MCM and a there is a 69 MCM negative average groundwater budget during a period of 12 years with a groundwater level decline of 0.8 m.

In terms of hydrogeochemistry, the EC of groundwater in the Babol–Amol aquifer varies from 500 to 3000 µmoh/cm, which was transformed into TDS. The TDS level map (ppm) including main rivers of the aquifer is shown in Fig. 5b.

Data sources

The sampling wells are homogenously distributed in the region so that they represent the characteristics of the whole aquifer. The quality sampling well network is suitable and covers the all of aquifer zones. The sampling wells in saturated layer of aquifer are all screened.

The TDS samples which are collected from 72 observation wells of Lenjanat aquifer and 50 observation wells of the Babol–Amol aquifer were obtained from the Isfahan Regional Water Organization, Isfahan Water and Wastewater company, and the Department of Environment of Tehran for a period of 14 water years between 1995 and 2009, respectively. The locations of the observation wells, which were uniformly distributed across the aquifer to indicate the whole area hydrogeological characteristics, are illustrated in Fig. 1.

The statistical and mathematical basics

This study made use of primary concepts of geostatistics such as the definitions of isotropy and anisotropy and different interpolation methods which were used in many studies. For more information refer to Mastroianni and Milovanovic (2008) and Duffy and Germani (2013). As depicted in Table 1, the methods used in this study including IDW, OK, Log_OK, UK, DK, EBK, SK, NN, TS, and Spline were identified as the superior methods in prior studies. This section only provides an explanation of the Log_OK method which is the most accurate interpolation method of this study.

Table 1 Superior methods of the previous studies

Lognormal ordinary kriging (Log_OK)

Despite the nonlinear interpolation methods, skewed data cannot be used in linear interpolation, because there would be incompatibility of variance with the average. The solution of this problem is a proper approach based on transformation data (Roth 1998). It is better to use one of the linear interpolation methods such as OK to estimate the data after normalization using the appropriate transformation method. In next step, the estimated values are transformed into real values using the back-transformation approach (Isaaks and Srivastava 1989).

If Z(x) represents the value of regional variable at a point with the location of x at unobserved points in a region D, so that the normal distribution of Z(x) is \(Y\left(x\right)=\mathrm{ln}Z\left(x\right), x\in D\), the objective is to estimate the value of the regional variable of Z(x) (Yamamoto 2000). The main step is the transformation of the problem from Z to stationary normal distribution, Y. The value of \(Y\left({x}_{0}\right)\) is estimated as follows:

$$\widehat{Y}\left({x}_{0}\right)=\sum_{i=1}^{n}{\lambda }_{i} \mathrm{ln}Z\left({x}_{i}\right)=\sum_{i=1}^{n}{\lambda }_{i} Y\left({x}_{i}\right),$$
(2)

where \(\widehat{Y}\left({x}_{0}\right)\) is the estimated value of \(Y\left({x}_{0}\right)\), \({\lambda }_{i}\) is the weight of kriging, and n is the total observed data.

Back-transformation of \(\widehat{Y}\left({x}_{0}\right)\) to \(\widehat{Z}\left({x}_{0}\right)\) is skewed based on the following equation:

$$\widehat{Z}\left({x}_{0}\right)=\mathrm{exp}\left(\widehat{Y}\left({x}_{0}\right)+\frac{{\sigma }_{Y,K}^{2}}{2}-{m}_{Y}\right),$$
(3)

where \(\widehat{Z}\left({x}_{0}\right)\) is the estimated value of \(Z\left({x}_{0}\right)\), \({\sigma }_{Y,K}^{2}\) is the kriging variance of Y, and \({m}_{Y}\) is the Lagrange multiplier value (Cressie 1993). A detailed explanation of this method is provided in the reference books of Journel and Huijbregts (1978) and Dowd (1982).

Accuracy analysis of different interpolation methods

Cross validation is one of the most widely used methods to compare the performance of interpolation methods (Yao et al. 2014). In this method, one of the observed points \(\left(Z\left({x}_{i}\right)\right)\) from the set of input data \(\left\{Z\left({x}_{1}\right),\dots .,Z\left({x}_{n}\right)\right\}\) is eliminated. Then, the value of the variable at location \({x}_{i}\) \(\left({Z}^{*}\left({x}_{i}\right)\right)\) is estimated using other data with one of the interpolation methods. This process must be repeated until all input data is estimated (Kitanidis 1993).

In this study, the cross-validation of all sampling wells was used to estimate the best interpolation method. Then seven error criteria were calculated using mean error (ME), mean bias error (MBE), mean absolute error (MAE), mean relative error (MRE), mean squared error (MSE), root mean squared error (RMSE), Nash–Sutcliffe efficiency (NSE) and percentage BIAS (PBIAS):

$$\mathrm{ME or MBE}=\frac{1}{n}{\sum }_{i=1}^{n}\left({Z}^{*}\left({x}_{i}\right)-Z\left({x}_{i}\right)\right),$$
(4)
$$\mathrm{MAE}=\frac{1}{n}{\sum }_{i=1}^{n}\left|{Z}^{*}\left({x}_{i}\right)-Z\left({x}_{i}\right)\right|,$$
(5)
$$\mathrm{MRE}=\frac{1}{n}{\sum }_{i=1}^{n}\left|\frac{{Z}^{*}\left({x}_{i}\right)-Z\left({x}_{i}\right)}{Z\left({x}_{i}\right)}\right|,$$
(6)
$$\mathrm{MSE}=\frac{1}{n}{\sum }_{i=1}^{n}{\left[{Z}^{*}\left({x}_{i}\right)-Z\left({x}_{i}\right)\right]}^{2},$$
(7)
$$ {\text{RMSE}} = \left[ {\frac{1}{n}\mathop \sum \limits_{{i = 1}}^{n} \left[ {Z^{*} \left( {x_{i} } \right) - Z\left( {x_{i} } \right)} \right]^{2} } \right]^{{1/2}}, $$
(8)
$$\mathrm{NSE}=1-\frac{{\sum }_{i=1}^{n}{\left[Z\left({x}_{i}\right)-{Z}^{*}\left({x}_{i}\right)\right]}^{2}}{{\sum }_{i=1}^{n}{\left[Z\left({x}_{i}\right)-O\right]}^{2}},$$
(9)
$$\mathrm{PBIAS}=\frac{{\sum }_{i=1}^{n}\left[Z\left({x}_{i}\right)-{Z}^{*}\left({x}_{i}\right)\right]}{{\sum }_{i=1}^{n}Z\left({x}_{i}\right)}*100,$$
(10)

where \({Z}^{*}\left({x}_{i}\right)\) is the interpolated value for \(i\mathrm{th}\) well, \({Z}^{ }\left({x}_{i}\right)\) is the observed value for \(i\mathrm{th}\) well, \(n\) is the number of all wells, and O is the average value for all the observation wells (Xie et al. 2011).

Due to the dependence of these values to data scale, the use of standardized error is recommended (Krivoruchko 2011). The best fit model is the one in which the standard error of ME, MSE, and PBIAS is close to zero, NSE is close to 1, and contain the lowest values in RMSE, MAE, MRE.

Study procedure

This study made use of ArcGIS 10.5 software in the following processes:

Data analysis

Spatial autocorrelation of data

Data semi-variograms were produced by the GIS software using the OK method. If the semi-variogram of the spatial data increases regularly with the increase in the distance until it reaches the sill, the data will have an appropriate spatial correlation.

Removing local and global outliers

Usually, in the presence of global outliers, the semi-variogram is divided into two completely separate parts consisting of upper sparse and dense points. To detect global outliers, first the sparse points at the upper layer of semi-variogram were selected and their locations were marked on the map. Then, one or two specific points, in which all the selected points were joined, were recognized. These specific points were global outliers which were removed due to the lack of logical reasoning and the measurement error (Krivoruchko 2011).

Removal of the local outliers was performed by the clustering and Voronoi section of the GIS software. A Voronoi polygon was made around each of the TDS observation wells. Considering the range of data variability, the polygons were subdivided into several clusters. In the absence of common cluster between each Voronoi cell and neighbor polygons, that cell is potentially an outlier and is shown in grey. After the analysis of the hydrogeological characteristics of observation wells located in the grey Voronoi polygon, in case of the absence of logical reasoning, these points were considered as local outliers and were removed from the network.

Data trend identification

The identification of the data trend is important for the selection of an interpolation method. Therefore, the trend analysis of the data in two north–south and east–west directions was performed.

Interpolation and providing spatial distribution maps of groundwater contamination

In addition to the type of interpolation method, the method parameters including the number of effective neighboring points for estimating unknown points, the type of semi-variogram model, the size of the lags of the semi-variogram model, the number of lags, the overlaying factor, the number of simulations, etc. are influential in determining the accuracy of the results. To this end, after processing the data and optimizing the parameters related to each method, TDS interpolation was performed with cross-validation of the different interpolation methods. Subsequently, TDS spatial distribution maps were generated to compare the difference between the contamination estimations in different interpolation methods, and also to select the best method. Eventually, the superior method was selected based on error criteria and was used for generating the TDS spatial distribution maps after determining its compatibility with the hydrogeological characteristics of the aquifer. Based on the standards of the World Health Organization (WHO), the TDS threshold selected was 1200 parts per millions (ppm). The flowchart of the study methodology is provided in Fig. 6.

Fig. 6
figure 6

Flowchart of the proposed methodology for groundwater quality assessment

Results and discussion

The produced maps using 24 different linear and nonlinear interpolation methods are demonstrated in Fig. 7. These methods were used to assess TDS levels in a total of 122 observation wells in two different characteristics shallow aquifers of Lenjanat and Babol–Amol, so that the obtained results would be able to demonstrate the different conditions of the aquifers in general. The produced maps show a high concentration of TDS in the northern, eastern, and central boundaries of the Lenjanat aquifer, which was justifiable regarding the location of the industrial sites, factories, cemeteries and wastewater plants of Zarinshahr and Safaeiyeh which are generally in the east-northern and central parts of the Lenjanat plain. Spatial distribution maps indicated TDS concentrations in the northern, northeastern, and central parts of the Babol–Amol aquifer. The results made sense with the regional geological formations and the land uses in the Qaemshahr city neighborhood in the south of Babol. As predicted, the quality of groundwater decreased with the flow of water into downstream of the aquifer which was mainly the result of the leach-scour of the municipal and agricultural sewage which conformed to the TDS spatial distribution maps of Fig. 7. Besides, the reduction of groundwater quality downstream of the studied areas was congruent with distribution of the population which is greater in downstream areas than elsewhere in the study areas.

Fig. 7
figure 7figure 7figure 7figure 7figure 7figure 7

Interpolated maps for TDS using different interpolation methods of: a Lenjanat aquifer b Babol–Amol aquifer (the legend indicates the predicted TDS concentration)

Table 2 illustrates the criteria of standard cross validation of the TDS parameter related to different interpolation methods in two shallow aquifers of Lenjanat and Babol–Amol with different hydrogeological characteristics. In the Lenjanat shallow aquifer, Log_OK had the best results concerning the RMSE value. Next comes the IDW methods with the respective power of 4 and 3 given the lowest RMSE values compared to the other methods. \(\left|\mathrm{ME}\right|\) near to zero demonstrated the less skewedness of the data. Spline-t, SK, and Log_OK with the lowest \(\left|\mathrm{ME}\right|\) values yielded better results, respectively. Log_OK produced the best result with the lowest values of MAE and MSE. Log_OK, NN, and IDW_4, with NSE close to 1, contained the best prediction accuracy, respectively. IDW_4, Log_OK, and IDW_3, with the lowest MRE values, were identified as the best methods, respectively. Spline-t with the lowest \(\left|\mathrm{PBIAS}\right|\) yielded the best results. After that, minimal \(\left|\mathrm{PBIAS}\right|\) was obtained by SK and Log_OK, respectively. The weight parameter had a significant effect on the accuracy of interpolation so that the more IDW power leads to lower values of RMSE, \(\left|\mathrm{ME}\right|,\) MAE, MRE, MSE and PBIAS and greater NSE.

Table 2 Cross-validation performance and different interpolation methods ranking

In the Babol–Amol shallow aquifer, Log_OK produced the best result with the lowest amount of RMSE, MAE, MRE, and MSE. Trend-1 yielded the most accurate result with the lowest values of ME. EBK-emp and EBK-logemp came in the next rank, respectively. Log_OK and EBK-logemp, with NSE closer to 1, demonstrated the best results. The lowest \(\left|\mathrm{PBIAS}\right|\) belonged to Trend-1. With the increase in the IDW power, there was a decrease in the values of MAE and MRE and an increase in NSE.

As a result, the Log_OK method provided more accurate results with 57% of the error criteria in the Lenjanat shallow aquifer and 71% in the Babol–Amol shallow aquifer. However, the IDW method also had a reliable performance.

Conclusions

The present research, for the first time, compared the accuracy of 24 interpolation methods in estimating the spatial variation of groundwater TDS concentration in two shallow aquifers with different hydrogeological characteristics. The main contributions of this study were as follows:

  1. 1.

    In the Lenjanat shallow aquifer, Log_OK produced the best result with the lowest values of MAE and MSE. Log_OK, NN, and IDW_4, with the NSE closer to 1, contained the best prediction accuracy, respectively. IDW_4, Log_OK, and IDW_3, with the lowest MRE values, were identified as the best methods, respectively. Spline-t with the lowest \(\left|\mathrm{PBIAS}\right|\) yielded the best results. After that, minimal \(\left|\mathrm{PBIAS}\right|\) is obtained by SK and Log_OK, respectively.

  2. 2.

    In the Babol–Amol shallow aquifer, Log_OK produced the best result with the lowest amount of RMSE, MAE, MRE, and MSE. Trend-1 yielded the most accurate result with the lowest values of ME. EBK-emp and EBK-logemp came in the next ranks, respectively. Log_OK and EBK-logemp, with NSE of almost 1, demonstrated the best results. The lowest \(\left|\mathrm{PBIAS}\right|\) belonged to Trend-1. With the increase in the IDW power, there was a decrease in the values of MAE and MRE and an increase in NSE.

  3. 3.

    As a result, Log_OK method provided more accurate results with 57% of error criteria in Lenjanat and 71% in Babol–Amol aquifers. However, IDW method has also a good performance.

  4. 4.

    Furthermore, we can conclude that the interpolation pattern of the TDS concentration in shallow aquifers was not highly affected by different hydrogeological characterization. It could be due to some common points of view in these two shallow aquifers such as: existence of a Salt River in the Lenjanat plain vs. the Babol–Amol Coastal plain, shallow groundwater depth, almost the same groundwater transmissivity, agricultural activities, and a closed fresh river–aquifer interaction.