Introduction

The knowledge of the spatial variability of the water table level in aquifers with limited monitoring provides information to understand the aquifer behaviour at different locations of the basin. This information becomes more important in basins that are under the threat of over-pumping where the water table has dropped significantly. The spatial distribution feedback gives the potential to identify vulnerable locations. The accurate mapping of groundwater levels in an aquifer is important for effective management and monitoring decisions. However, the number and spatial distribution of hydraulic head measurements are not always sufficient to accurately represent the groundwater level in a given aquifer. Estimates at unsampled locations can be obtained by applying geostatistical methods to the available data in order to map the spatial distribution of an aquifer level. In sparsely monitored basins, accurate mapping of the spatial variability of groundwater level requires the interpolation of scattered data.

Ordinary Kriging (OK) bases its estimates at unsampled locations only on the sampled primary variable. OK interpolation is widely used to determine the spatial variability of groundwater levels in hydrological basins e.g., (Ahmadi and Sedghamiz 2007; Buchanan and Triantafilis 2009; Chung and Rogers 2012; Nikroo et al. 2009; Sun et al. 2009; Theodossiou and Latinopoulos 2006; Varouchakis and Hristopulos 2013a). Alternatively, Residual Kriging (RK) and Kriging with External Drift (KED), embody secondary information in a drift term. KED and RK are practically equivalent but differ in the methodological steps used (Hengl 2007; Hengl et al. 2003). RK has been applied to the interpolation of water table elevation using deterministic trend models that include e.g. space polynomials (Neuman and Jacobson 1984), topographic metrics such as surface elevation and topographic index, and distance from riverbed (Desbarats et al. 2002; Varouchakis and Hristopulos 2013b), numerical solutions for the hydraulic head field (Rivest et al. 2008; Varouchakis and Hristopulos 2013b) and rainfall data (Moukana and Koike 2008).

Indicator Kriging (IK) has been widely used mainly for risk assessment of pollutant concentrations in ground and surface waters leading to significant decision support regarding the prevention and/or remediation of certain sites (Anane et al. 2014; Arslan 2012; Chica-Olmo et al. 2014; Liu et al. 2004; Neshat et al. 2015). However, it can be also applied for the risk assessment of groundwater level spatial distribution in arid areas or in those with high aquifer pumping. Demir et al. (2009) used variable groundwater level thresholds to create indicators under rain and irrigation periods presenting the aquifer response. However, the groundwater level scale was different compared to the scale of this work. In addition, the number of thresholds is important if one aims to assess different scenarios and also to calculate the cumulative effect. This work on the other hand, used a single groundwater level threshold that was calculated from the physical characteristics of the basin, which is under water scarcity threat. The role of this limit is significant as it sets the threshold of balance between pumping and groundwater level necessary for the aquifer sustainability. Other similar works applied Indicator Kriging to address soil saturation probability regarding depth to water table (Lyon et al. 2006a), or to produce probability maps a shallow water table to exceed certain level thresholds per rainfall event (Lyon et al. 2006b). However, most of the works used groundwater level as one of many parameters to perform assessment regarding pollution risk. This work differs from all those, as to the authors knowledge there is not any similar work to address aquifer depletion risk.

The current project presents the application of Ordinary Kriging, Residual Kriging (e.g. surface elevation), and Indicator Kriging to predict the groundwater level spatial variability as well as the associated risk considering an aquifer level threshold respectively in a sparsely gauged basin. In this work the hole-effect property of the Spartan variogram model is presented for the first time in measurement data. The specific shape is obtained when its shape parameter receives negative values. This is an important evidence for the functionality of the Spartan variogram model. Furthermore, the set of a sustainable groundwater level limit based on the physical characteristics of a basin is discussed, risk assessment using IK regarding a groundwater level threshold is interpreted as not many similar works exist and a new dataset is used that has not been published before.

Area of Study

The present research focuses on Mires basin of the Messara Valley (Fig. 1) at the island of Crete (Greece). The study basin consists of an unconfined aquifer, is sparsely sampled and has limited groundwater resources which are vital for the area’s ecosystem and agriculture. The mean annual rainfall in Mires basin has been lately estimated around 625 mm. Approximately 65 % of the rainfall is lost to evapotranspiration and 10 % is lost as runoff to the sea, leaving only 25 % to recharge the groundwater store. A detailed hydrogeological and hydrological description of the basin is available in Varouchakis (2016). Knowledge of the spatial variations of the groundwater level is important for developing sound management and monitoring strategies. Over-exploitation during the past 30 years has led to a dramatic decrease, in excess of 35 m, of the groundwater level.

Fig. 1
figure 1

Map of Greece presenting the Messara valley location (red rectangle) in Crete and a topographic map of Mires basin showing the locations of groundwater head measurements along with the corresponding surface elevation and the temporary river path. The black triangles denote the 43 monitoring locations at the basin for the year 2007–08 (modified after Varouchakis and Hristopulos (2013b))

Efficient groundwater management in the basin is crucial in light of predictions based on regional climate change models that show a substantial risk of desertification for Crete. In this work accurate spatial models of the basin’s groundwater level are generated that help to identify the susceptible locations and to provide input for potential groundwater resources management plans. The data used in this research consist of hydraulic head measurements (wet period of 2007–2008 hydrological year) from the 43 monitoring locations that operate in the basin which are unevenly distributed and mostly concentrated along a temporary river. The range of hydraulic heads varies from an extremely low value of 11.45 to 72.93 masl. An initial statistical analysis shows that the head data have skewness and kurtosis coefficients equal to ŝ z  = 0.76 and \( {\widehat{k}}_z=2.80 \) respectively, implying a mild deviation from Gaussian statistics (ŝ z  = 0 and \( {\widehat{k}}_z=3 \) respectively). The data analysis is performed by codes developed and run in the Matlab® programming environment (Matlab v.7.10). Standardized spatial coordinates in the interval [0, 1] are used to avoid numerical instabilities.

Methodology

Skewed or erratic data can often be made more suitable for geostatistical modeling by appropriate transformation. A normal distribution for the variable under study is desirable in linear geostatistics (Clark and Harper 2000). Even though normality may not be strictly required, serious violation of normality, such as too high skewness and outliers, can impair the variogram structure and the Kriging results (Gringarten and Deutsch 2001; Ouyang et al. 2006). Ordinary Kriging is well-known to be optimal when the data have a multivariate normal distribution. Transformation of data therefore may be required before Kriging to normalize the data distribution, suppress outliers and improve data stationarity (Armstrong 1998; Deutsch and Journel 1992). The estimation then is performed in the Gaussian domain before back-transforming the estimates to the original domain. An advantage of the Gaussian distribution is that spatial variability is easier to be modelled because it reduces effects of extreme values providing more stable variograms (Armstrong 1998; Goovaerts 1997; Pardo-Iguzquiza and Dowd 2005). Kriging represents variability only up to the second order moment (covariance), so the random field of the transformed variable should be Gaussian to derive unbiased estimates at unsampled locations (Deutsch and Journel 1992; Goovaerts et al. 2005).

A non-linear normalizing data transformation is applied in conjunction with Kriging for the accurate prediction of groundwater level spatial variability. The Modified Box-Cox (MBC) transformation method has recently been proposed and applied successfully to normalize hydraulic heads and residuals (Varouchakis et al. 2012). This recently proposed method focuses on normalizing the skewness and kurtosis coefficients of the data, but it neglects higher-order moments (Varouchakis and Hristopulos 2013b). It is defined by the following function,

$$ \begin{array}{cc}\hfill y:={g}_{MBC}\left(z;\boldsymbol{\upkappa} \right)=\frac{{\left(z-{z}_{\min }+{k_2}^2\right)}^{k_1}-1}{k_1},\hfill & \hfill {\boldsymbol{\upkappa}}^T=\left({k}_1,{k}_2\right),\hfill \end{array} $$
(1)

where k 1 is the power exponent and k 2 is an offset parameter. Use of the latter allows negative z values and so the transformation can be applied to fluctuations as well. Parameters (k 1,k 2) are estimated from the numerical solution of the equations \( {\widehat{s}}_z=0,\kern0.83em {\widehat{k}}_z=3 \) in the form below, where ŝ z and \( {\widehat{k}}_z \) are the sample skewness and kurtosis coefficients respectively,

$$ {\left(\frac{{\widehat{m}}_z-{\tilde{m}}_z}{\sigma_z}\right)}^2+{\left({\widehat{k}}_z-3\right)}^2\simeq 0. $$
(2)

In the above equation \( {\widehat{m}}_z \) is the sample mean, \( {\tilde{m}}_z \) is the sample’s median and σ z the standard deviation. The minimization is performed using the Nelder-Mead simplex optimization method (Nelder and Mead 1965; Press et al. 1992).

Besides the classical OK interpolation, the prediction of the hydraulic head spatial variability is also performed using RK by incorporating local geographic features, such as the ground surface elevation in the trend function. Previous studies have shown that incorporating such auxiliary information in the trend function improves the accuracy of the spatial interpolation (Varouchakis 2012). Herein, there are two reasons for employing ground surface elevation as an auxiliary variable. The first is the important correlation between surface elevation and groundwater level; at high elevation the groundwater level is also high following a decreasing trend towards lower elevations. The second reason is the application of a tool by Haitjema and Mitchell-Bruker (2005). This tool examines if the aquifer is recharge or topography controlled. The equation that describes the tool involves the average annual recharge rate R [m/d], the average distance between surface waters, L [m], a factor m between 8 and 16 for aquifers that are strip like or circular in shape respectively, the (horizontal) aquifer hydraulic conductivity, k [m/d], the aquifer thickness, H [m] and the maximum distance between the average surface water levels and the terrain elevation, d [m].

$$ \frac{R{L}^2}{mkHd}\left\{\begin{array}{l}>1,\kern1em \mathrm{topography}\ \mathrm{controlled}\\ {}<1,\kern1em \mathrm{recharge}\ \mathrm{controlled}\end{array}\right. $$
(3)

Although Mires aquifer does not comply exactly with the conditions applied to produce this inequality, an estimate can be obtained. The surface waters in the basin are limited and there is only a main river crossing the plain. Therefore, a distance between surface waters cannot be exactly defined. However, the eastern and western parts of the main river are connected with two tributaries that their in between distance can define the variable L (approximately 15,000 m). On the other hand, the aquifer is partly circular in shape at the up-stream and stripe like at the down-stream. Thus m on average would be set equal to 12. The other variables are based on average aquifer values: k = 50 m/d. R = 0.0004 m/d, H = 21.5 m and d = 10 m (Varouchakis 2015). Solving Eq. (3) the calculated ratio is equal to 0.7 and thus less than 1. This result means that the aquifer is recharge controlled. However, the result is quite close to 1 and one can assume that topography has also a vital role. Thus, considering also the important relation between the elevation and the groundwater level trend (65 %) and based on the absence of information on the spatial distribution of recharge in the basin, one can use the ground surface elevation as an auxiliary variable. Its significance and usefulness will be assessed based on the derived results.

In the following, it is assumed that the water table level is represented by a spatial random field (SRF), Z(s ∈ S), where S is the set of sampling points with cardinal number N. The values of the SRF in a given state will be denoted by lower-case letters. The target is to derive estimates, (s ∈ P) of the water table elevation at the prediction set points, P that lie on a rectangular grid that covers the basin. Therefore, s i , i = 1, …, N denote the sampling points, z(s i ) are the head values (in masl) at these points, and s 0 denotes an estimation point, which is assumed to lie inside the convex hull of the sampling network. For mapping purposes, it is assumed that s 0 moves sequentially through all the nodes of the mapping grid.

Spatial interpolation of the groundwater levels was initially applied by means of OK and RK in combination with MBC normalizing transformation, while IK based on developed indicators from the data was then implemented. In the first approach, a normalizing transformation g(·) is applied to the data. Then, OK is used to predict the transformed field Y(s) = g(Z(s)), and the predictions are back-transformed to obtain head estimates. In the second approach, a trend model m Z (s) is introduced that captures local features. Since the fluctuation SRF, Z′(s) = Z(s) − m Z (s), is non-Gaussian, a transformation g(·) is applied to obtain a normalized SRF, Y(s) = g(Z′(s)). The experimental variogram is then estimated and is fitted to theoretical models. Next, the Gaussian field Ŷ(s ∈ P) is estimated at the prediction points using OK. Finally, head estimates are retrieved from Ŷ(s ∈ P) by applying the back-transformation and adding the trend. Leave-one-out cross-validation analysis was used to determine the optimal spatial model applied to water table level data and to assess the accuracy of the interpolated head field.

Spatial dependence

The variogram is commonly used in geostatistical analysis to measure the spatial dependence between neighboring observations. The omnidirectional empirical (experimental) variogram of the hydraulic head and of the residuals was determined using the method of moments. The empirical variogram was fitted with isotropic classical models (Gaussian, Exponential, Linear, Spherical, and Power-law), the Matérn model (Goovaerts 1997), and the new family of Spartan variograms (3D, 2D models) (Hristopulos 2003; Hristopulos and Elogne 2007).

Spartan Spatial Random Fields (SSRFs) are a geostatistical model (Hristopulos 2002, 2003) inspired from statistical field theory with applications in environmental risk assessment and environmental monitoring (Elogne et al. 2008; Elogne and Hristopulos 2008; Hristopulos and Elogne 2009). SSRFs are generalized Gibbs random fields with an energy functional that is based on local interactions between the field values. The term Spartan indicates parametrically compact model families that involve a small number of parameters. SSRFs provide a new class of generalized covariance functions. The SSRFs covariance functions can be used for spatial interpolation with the classical Kriging estimators. Spartan covariance and variogram functions have been applied to various environmental data sets (Elogne et al. 2008; Varouchakis and Hristopulos 2013b). Herein, the Spartan covariance derived for d = 3 dimensions is applied.

The model parameters are obtained by fitting the SSRFs function (4) to the experimental variogram (Hristopulos and Elogne 2007). The exponential covariance is formed for η 1 = 2, while for |η 1| < 2 the product of the exponential and hole-effect model is obtained. According to Christakos (1991) a covariance function that is permissible in three spatial dimensions is also permissible in two dimensions. The components of the equation are presented and explained in Table 1.

Table 1 Interpretation of the Spartan covariance function components
$$ {C}_z\left(\mathbf{h}\right)=\left\{\begin{array}{l}\begin{array}{lll}\frac{\eta_0{e}^{-h{\beta}_2}}{2\pi \sqrt{\left|{\eta}_1^2-4\right|}}\left[\frac{ \sin \left(h{\beta}_1\right)}{h{\beta}_1}\right],\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\left|{\eta}_1\right|<2,\hfill & {\sigma}_z^2=\frac{\eta_0}{2\pi \sqrt{\left|{\eta}_1^2-4\right|}}\hfill \end{array}\hfill \\ {}\begin{array}{lll}\frac{\eta_0{e}^{-h}}{8\pi },\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\kern0.5em {\eta}_1=2,\hfill & {\sigma}_z^2=\frac{\eta_0}{8\pi}\hfill \end{array}\hfill \\ {}\begin{array}{lll}\frac{\eta_0\left({e}^{-h{\omega}_1}-{e}^{-h{\omega}_2}\right)}{4\pi \left({\omega}_2-{\omega}_1\right)h\sqrt{\left|{\eta}_1^2-4\right|}},\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\kern0.5em {\eta}_1>2,\hfill & {\sigma}_z^2=\frac{\eta_0}{4\pi \sqrt{\left|{\eta}_1^2-4\right|}}\hfill \end{array}\hfill \end{array}\right.. $$
(4)

Spatial models

In spatial linear interpolation methods such as OK and RK, it generally holds that,

$$ \widehat{z}\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i}z\left({\mathbf{s}}_i\right), $$
(5)

where \( {\mathbb{S}}_0 \) is the set of sampling points in the search neighborhood of s 0. The neighborhood is empirically chosen so as to optimize the cross validation measures. The weights λ i are obtained by minimizing the mean square estimation error conditionally on the zero-bias constraint (Cressie 1993) and they depend on the variogram model γ z (r), where r are the spatial lags of the experimental variogram (Deutsch and Journel 1992). The OK estimation variance is given by the following equation, with the Lagrange coefficient μ compensating for the uncertainty of the mean value:

$$ {\sigma}_E^2\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i{\gamma}_z\left({\mathbf{s}}_i,{\mathbf{s}}_0\right)}+\mu . $$
(6)

Overall OK variance is termed as the weighted average of variograms from the new point s 0 to all calibration points s j , plus the Lagrange multiplier.

RK combines a trend function with interpolation of the residuals. Herein, a deterministic trend is applied based on the basin topography. In RK the estimate is expressed as:

$$ \widehat{z}\left({\mathbf{s}}_0\right)={m}_z\left({\mathbf{s}}_0\right)+\widehat{z}^{\prime}\left({\mathbf{s}}_0\right), $$
(7)

where m z (s 0) is the trend function, and ′(s 0) is the interpolated residual by means of OK (Rivoirard 2002). Typically, the trend function is modeled as:

$$ \begin{array}{cc}\hfill {m}_z\left({\mathbf{s}}_0\right)={\displaystyle \sum_{k=0}^p{\beta}_k}{q}_k\left({\mathbf{s}}_0\right);\hfill & \hfill {q}_k\left({\mathbf{s}}_0\right)\equiv 1,\hfill \end{array} $$
(8)

where q k (s 0) are the values of auxiliary variables at s 0, β k are the estimated regression coefficients and p is the number of auxiliary variables (Draper and Smith 1981; Hengl 2007; Hengl et al. 2007). The regression coefficients are estimated from the sample using ordinary least squares (OLS) (Kitanidis 1993). The variance of the estimates follows from the equation (Hengl 2007; Hengl et al. 2003):

$$ {\sigma}^2\left({\mathbf{s}}_0\right)={\sigma}_d^2\left({\mathbf{s}}_0\right)+{\sigma}_f^2\left({\mathbf{s}}_0\right), $$
(9)
$$ {\sigma}_d^2\left({\mathbf{s}}_0\right)={\mathbf{q}}_0^T{\left({\mathbf{q}}^T{\boldsymbol{\gamma}}_{z\prime}^{-1}\mathbf{q}\right)}^{-1}{\mathbf{q}}_0, $$
(10)
$$ {\sigma}_f^2\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i{\gamma}_{z\prime}\left({\mathbf{s}}_i,{\mathbf{s}}_0\right)+\mu }, $$
(11)

where σ 2 d  (s 0) is the drift prediction variance, q 0 is the vector of (p + 1) × 1 predictors at the unvisited location, q is the matrix of (N 0 + 1) × (p + 1) predictors at the sampling points in the search neighborhood (N 0 is the number of points within the search neighborhood of s 0), γ z ' is the variogram matrix of the (N 0 + 1) × (N 0 + 1) residuals at the measured locations (neighborhood) and σ 2 f (s 0) is the kriging (OK) variance of residuals.

Herein a trend model for Mires basin groundwater level data is presented. Following other studies, based on the fact that usually the water table level of an unconfined aquifer follows the elevation trend, secondary information in the trend is considered from a Digital Elevation Model (DEM) of the area (Desbarats et al. 2002; Deutsch and Journel 1992; Goovaerts 1997; Hoeksema et al. 1989; Nikroo et al. 2009; Rivest et al. 2008). This choice was supported by the interpretation provided earlier in the Methodology section, and primarily by the important correlation coefficient between the groundwater level and the ground surface elevation in the basin (R = 0.65, is characterized as important correlation (Tichy 1993)). A scatter plot of ground surface elevation vs groundwater level is presented in Fig. 2.

Fig. 2
figure 2

Scatter plot of groundwater level (m) vs surface elevation (m)

The following expression for the trend of the hydraulic head (in masl) is applied (T-DEM):

$$ {m}_z\left(\mathbf{s}\right)=f\kern0.5em DEM\left(\mathbf{s}\right)+c, $$
(12)

where f, c are linear coefficients and DEM(s) is the local DEM value.

Another interpolation method applied is IK. This is a non-parametric geostatistical method for estimating the probability of a variable to exceed or lie below a specific threshold value at a given location (Goovaerts 1997). In this work, IK is applied for mapping the risk associated with a specified groundwater level limit that could lead to significant problem of groundwater availability. A critical aquifer level limit for the basin’s groundwater resources availability can be set in terms of a statistical and a physical based approach. The first involves the 25th, lower, percentile of the available data values which is equal to 25 m above sea level (m.a.s.l). This value was validated by a physically based approach that involves physical characteristics of the basin. The aquifer capacity has been recently estimated equal to 55 Mm3, the aquifer area equal to 26.1 km2 and the porosity equal to 0.085. Dividing the first two figures and then their result with the porosity the aquifer level is calculated. This is equal to 24.7 m.a.s.l, similar to the lower percentile of the available data. Thus, 25 m.a.s.l can be set as the aquifer level threshold for sustainable groundwater resources management at the basin.

IK is applied to determine the conditional probability at unsampled points based on the spatial dependence structure of indicator-transformed data points with a binary distribution (e.g. 0 and 1). IK proceeds as the classical Ordinary Kriging (the main change is the choice of a cutoff value) with the difference that results is now maps with values between 0 and 1 expressing probability a condition to apply (Chica-Olmo et al. 2014; Deutsch and Journel 1992). Indicator variogram analysis is also performed using the models and the procedure previously stated for OK and RK (Isaaks and Srivastava 1989).

This method does not make assumptions regarding the variables distribution and has the ability to take into account, to a large extent, the uncertainty of the data. The IK is based on the conversion of all data from continuous to a binary form according to a specific threshold value. Thus, is robust to outliers handling. This value can be either a percentile of our data or a default value of marginal importance for the system under study. Subsequently, data with values below the threshold take a value of 1, while the remaining taking a value of 0.

$$ I\left(z\left(\mathbf{s}\right)\right) = \left\{\begin{array}{l}\begin{array}{ll}1,\hfill & z\left(\mathbf{s}\right)\le z^{\prime}\hfill \end{array}\hfill \\ {}\begin{array}{ll}0,\hfill & otherwise\hfill \end{array}\hfill \end{array}\right., $$
(13)

where, I(z(s)) is a binary variable, z(s) is the measured value and z is the cut-off (threshold) value.

Indicator Kriging is a geostatistical method best suited for issues that involve a threshold value (Goovaerts 1999; Isaaks and Srivastava 1989; Webster and Oliver 2001). However, most practical problems that require indicator techniques require well-chosen threshold which have a special significance to the problem being addressed. Probability maps delineate suitable and unsuitable sites regarding the examined issue, while help to take decisions to prevent and/or remediate a site compared to locations with reduced or no risk.

The method proceeds as follows: a) convert the given values to indicators: divide the range evenly or based on different quintiles (q0.25; q0.50; q0.75), b) estimate the indicator variogram, c) apply Kriging using the usual equations and obtain predictions. On the other hand, the methodology has a set of disadvantages such as it will not necessarily provide probabilities to add up to 1 and sometimes the prediction may end up beyond the zero to one interval, e.g. Kriging occasionally provides negative weights-screening effect (Goovaerts 1997; Papadopoulou et al. 2009).

Results and Discussion

The performance of the Kriging-based geostatistical models is evaluated by using the leave one out cross validation technique that is usually applied in small datasets (Witten et al. 2011). A series of well-known statistical measures is employed to compare the true and estimated values of the cross-validation procedure, such as the correlation coefficient R, the Mean Absolute Error (MAE), the Root Mean Square Error (RMSE) and the Mean Absolute Relative Error (MARE).

Anisotropy was investigated by comparing directional variograms in four main directions (Goovaerts 1997) using an angle tolerance of 30°. Smaller tolerance values (15°) do not permit a sufficient number of data pairs (i.e., at least 30) at each lag. According to Journel and Huijbregts ( 1978 ), in order to acquire a good variogram, there should be approximately 30 pairs of distances and values for every lag. In addition the number of lags was reduced in order to achieve the required pair number for directional variograms. As shown in Fig. 3, there is no clear difference among the directional variograms for the original data. In addition a test of geometric anisotropy was performed based on the method of Covariance Hessian Identity (Chorti and Hristopulos 2008; Hristopulos 2002). This method is non-parametric, in the sense that it provides an estimate of the aspect ratio (i.e. the ratio of the two principal correlation lengths) and the orientation of the principal axes, without requiring variogram estimation and modeling. The aspect ratio is thus estimated at 0.75, while the short principal axis is rotated by 8° with respect to the E-W direction. The value of 0.75 does not differ significantly from unity. Indeed, the isotropic hypothesis cannot be rejected with 95 % confidence for ratios in the range [0.70 – 1.07] using the test given (Spiliopoulos et al. 2011). In light of the above analysis, the variogram function of the groundwater level is considered to be isotropic (Ahmadi and Sedghamiz 2007).

Fig. 3
figure 3

Experimental directional variograms of groundwater level fluctuations with 30° angle tolerance along the four main geographical directions, E–W, N–S, NE and NW

The general approach that is used for interpolation applies a normalizing transformation followed by OK on the transformed variables, and it finally back-transforms the predictions. The application of MBC methodology to the initial head dataset improves their normality (Table 2). The normality improvement is also supported by histograms of the data before and after the transformation (Fig. 4).

Table 2 Normalization results using Modified Box-Cox (MBC) transformation: skewness coefficient ŝ z ; kurtosis coefficient \( {\widehat{k}}_z \)
Fig. 4
figure 4

Histograms of the groundwater level data before (a) and after (b) the transformation with MBC

In terms of the spatial model that considers the head data the parameters of the theoretical variogram models tested are obtained by least squares fitting to the experimental omnidirectional variogram of the transformed hydraulic head. The 3D Spartan model gives the best fit in terms of cross validation results (Table 3) while the Spherical and the Matérn variogram come close. The 2D Spartan function did not provide a good fit for this dataset.

Table 3 Cross validation measures for spatial MBC-OK model with optimal variograms: MBC & OK: Ordinary Kriging with modified Box-Cox transformation of data and back-transformation

In the case of spatial model with trend component, RK is applied. RK combines a trend function with interpolation of the residuals. The residuals of the trend model also display deviations from normality that are reduced by means of the MBC transformation (Table 4). Similarly, histograms (Fig. 5) present the normality improvement.

Table 4 Skewness ŝ z and kurtosis \( {\widehat{k}}_z \) coefficients of trend models residuals following modified Box-Cox (MBC) normalization
Fig. 5
figure 5

Histograms of the trend model residuals before (a) and after (b) the transformation with MBC

The omnidirectional experimental variogram is calculated by applying the method of moments to the transformed residuals of the T-DEM model. The Spartan variogram model (Fig. 6) again provides the best fit in terms of cross validation results (Table 5). The Spherical variogram provides similar results to the Spartan model while third best is the Matérn model.

Fig. 6
figure 6

Plot of omnidirectional variogram of residuals (stars) after applying MBC normalization and the best-fitted variogram models. The residuals are derived by applying to the trend the ground surface elevation (T-DEM). The number of pairs used at each lag distance (17 lags) are also shown on the plot

Table 5 Cross validation measures for spatial MBC-RK-T-DEM model with optimal variograms: T-DEM trend using DEM surface elevation

Another method to test data normality improvement, is the non-parametric Kolmogorov-Smirnov test. The test is applied to examine if a sample comes from a reference probability Distribution (Massey 1951). The test was implemented in Matlab® environment using the function «kstest» for both the transformed datasets. The null hypothesis for the Kolmogorov-Smirnov test is that data follows the standard normal distribution. Therefore, the null hypothesis was not rejected for the transformed datasets at significant levels 5 % and of 10 %.

The MBC-RK approach improves significantly the mean absolute prediction error (4.27 masl) by over 1 m compared to the MBC-OK (5.32 masl) approach. In addition the other estimation measures are at least similar (BIAS) but mostly improved (RMSE, R, MARE). Considering overall the cross validation measures the estimates based on the Spartan model prevails compared to the other two optimal models.

The least squares sum for each fitted variogram model is considered, which is an index of optimal fitting, for selecting the optimal variogram model with Indicator Kriging interpolation. Spartan model achieves the best fit (Fig. 7) over the range of lags considered providing a value of 0.023 compared to 0.029 for the spherical and 0.031 for the Matérn models.

Fig. 7
figure 7

Indicator omnidirectional variogram and the best-fitted variogram models. The number of pairs used at each lag distance (18 lags) are also shown on the plot

The T-DEM trend model with RK and the IK methodology are applied to estimate the groundwater level and the probabilities of groundwater level to lie below a threshold value on a 100 × 100 grid defined in normalized coordinate space (actual cell size: 114 × 47 m). In addition the uncertainties of the estimations are also determined on a same grid size. Estimates are obtained only at points that lie inside the convex hull of the measurement locations (7317 grid points). The contour maps in physical space are shown in Figs. 8, 9, 10, and 11. The residuals of the T-DEM model are interpolated using the Spartan variogram model (Fig. 6) with the following optimal parameter values: σ 2 = 17.77, ξ = 0.27 (in normalized units), η 1 = −1.99 while the indicators applying the Spartan variogram model (Fig. 7) with optimal parameter values: σ 2 = 0.25, ξ = 0.26 (in normalized units), η 1 = −1.90. The optimum search radius used with the Spartan model (determined by the leave-one-out cross validation test) is equal to 0.38 (normalized units) for both models. Near the origin and at intermediate distances, which are crucial for the interpolation, the Spartan model fitting is very good and overall follows closely the trend of the experimental variogram. The negative values of η 1 causes a negative hole effect in the Spartan correlation (Žukovič and Hristopulos 2008) that can be observed in both variogram figures (Figs. 6 and 7).

Fig. 8
figure 8

Map of estimated groundwater level in the Mires basin using MBC-RK-T-DEM spatial model, adapted on the real basin coordinates and location in the valley (circles denote the monitoring locations and solid black line the temporary river path)

Fig. 9
figure 9

Map of estimated groundwater level standard deviation in the Mires basin using MBC-RK-T-DEM spatial model, adapted on the real basin coordinates and location in the valley (circles denote the monitoring locations and solid black line the temporary river path)

Fig. 10
figure 10

Map of estimated probabilities the aquifer level to decline below the set 25 m.a.s.l threshold value in Mires basin using IK, adapted on the real basin coordinates and location in the valley (circles denote the monitoring locations and solid black line the temporary river path)

Fig. 11
figure 11

Map of estimated probabilities variance in the Mires basin using IK, adapted on the real basin coordinates and location in the valley (circles denote the monitoring locations and solid black line the temporary river path)

The groundwater level map of the basin (Fig. 8) presents the spatial variability of the groundwater level that change from East towards West direction following the ground surface elevation trend (Fig. 1). The higher levels are met at the East of the basin while the lowest towards the West. The error map (Fig. 9) identifies the locations of the Mires Basin with the largest Kriging standard deviation. Hence, the borders of the basin can benefit from further sampling according to RK standard deviation results.

Indicator Kriging predictions (Fig. 10) shows that in the center and towards the West borders of the basin the risk of the aquifer level to decline below the set 25 m.a.s.l threshold is significant. Probabilities are increased closer to the river path than higher away. The dependence is reasonable considering that the agricultural activity in the area is concentrated along the temporary river.

Estimation variance calculated through IK is usually highest where wells distribution density is poor and variability among neighbouring observations is large, while lowest where wells distribution is good and variability is low (Hohn 1999). The variance range though depends on the quality of the fitted theoretical variogram model. The accurate knowledge of the correlation between point measurements at different locations produces estimates of the prediction variance that are minimal. This is succeeded when the fit of the model variogram to the experimental is the optimum, as occurs herein (Fig. 7). Thus, the properties of the variogram model occur through the whole area of interest leading to accurate estimates with low variance even for regions of poor monitoring density.

Significantly low IK variance values (maximum variance is equal to 0.0525) are obtained herein due to the optimum spatial dependence inference that is provided by Spartan variogram model (Fig. 7). According to Chiles and Delfiner (1999) the computed kriging variance is directly affected by the variogram fit. In this work IK variance (Fig. 11) is very low even at ungauged locations of 0 or close to 1 probability the groundwater level to lie below the set threshold value. Therefore, no further variance analysis is required such as conditional simulations to calculate the cumulative distribution function of the predictions (Deutsch and Journel 1992; Goovaerts 1997; Kanevski et al. 2009; Olea 1999).

The final risk (probability) map (Fig. 10), considering the calculated variance of the estimates (Fig. 11), shows that around 25 % of the aquifer’s surface present significant probability the aquifer level to lie below the 25 m.a.s.l. This area corresponds to almost 40 % of the productive agricultural land of the basin.

A very interesting characteristic that is identified in this work is the shape of the modeled Spartan variogram. According to a previous work (Varouchakis and Hristopulos 2013b) this shape can be explained with respect to the pumping activity of the basin. The average distance between the increment and the decrement in the variograms is equal to 150 m that lies between the range 105 to 160 m, which correspond to the radii of influence range of the pumping wells in Mires basin (Varouchakis and Hristopulos 2013b). Therefore, the trend of the experimental and of the model variogram expresses the aquifer behavior under pumping activity. As it has been stated in a previous work more than 200 wells operate in the basin affecting the measurements at the monitoring locations (Varouchakis and Hristopulos 2013b).

Conclusions

The optimal spatial interpolation approach for the spatial variability of the groundwater level in Mires basin is based on Residual Kriging with the Spartan variogram model applied to the normalized (MBC) fluctuations. The present findings are supported by the results of cross validation analysis. In addition, risk maps based on IK identify the vulnerable areas of the basin that require intense monitoring and remedial actions to avoid further decline of the aquifer. These are located at the west part of the basin mainly along the river path. The recently developed MBC transformation method shows an excellent behaviour transforming both data and residuals closer to normal distribution. In addition the Spartan variogram model has an excellent fit to the experimental variogram of the data, residuals and indicators following closely their trend. Thus, it constitutes a reliable alternative to assess the spatial dependence of groundwater level data in interpolation studies.