Spatial variability estimation and risk assessment of the aquifer level at sparsely gauged basins using geostatistical methodologies

Varouchakis, Emmanouil A.; Kolosionis, Kostantinos; Karatzas, George P.

doi:10.1007/s12145-016-0265-3

Spatial variability estimation and risk assessment of the aquifer level at sparsely gauged basins using geostatistical methodologies

Research Article
Published: 06 May 2016

Volume 9, pages 437–448, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Earth Science Informatics Aims and scope Submit manuscript

Spatial variability estimation and risk assessment of the aquifer level at sparsely gauged basins using geostatistical methodologies

Download PDF

Emmanouil A. Varouchakis¹,
Kostantinos Kolosionis¹ &
George P. Karatzas¹

403 Accesses
13 Citations
Explore all metrics

Abstract

The spatial variability evaluation of the water table level of an aquifer provides useful information in water resources management plans. Three different approaches are applied to estimate the spatial variability of the water table in the study basin. All of them are based on the Kriging methodology. The first is the classical Ordinary Kriging approach, while the second involves information from a secondary variable (surface elevation) and the application of Residual Kriging. The third calculates the probability to lie below a certain groundwater level limit that could cause significant problems in groundwater resources availability. The latter is achieved by means of Indicator Kriging. A recently developed non-linear normalization method is used to transform both data and residuals closer to normal distribution for improved prediction results. In addition, the recently developed Spartan variogram model is applied to determine the spatial dependence of the measurements. The latter proves to be the optimal model, compared to a series of models tested, which provides in combination with the Kriging methodologies the most accurate cross validation estimations. The variogram form is explained with respect to the radius of influence of the pumping wells representing the spatial impact of the pumping activity. Groundwater level and probability maps are developed providing the ability to assess the spatial variability of the groundwater level in the basin and the risk that certain locations have in terms of a safe groundwater level limit that has been set for the sustainability of the groundwater resources of the basin.

Fundamentals of Geostatistics for Assessing Spatial Variation of Groundwater Resources

Spatial Interpolation for the Distribution of Groundwater Level in an Area of Complex Geology Using Widely Available GIS Tools

Article 14 July 2021

Assessment of changes in regional groundwater levels through spatio-temporal kriging: application to the southern Basin of Mexico aquifer system

Article Open access 01 September 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The knowledge of the spatial variability of the water table level in aquifers with limited monitoring provides information to understand the aquifer behaviour at different locations of the basin. This information becomes more important in basins that are under the threat of over-pumping where the water table has dropped significantly. The spatial distribution feedback gives the potential to identify vulnerable locations. The accurate mapping of groundwater levels in an aquifer is important for effective management and monitoring decisions. However, the number and spatial distribution of hydraulic head measurements are not always sufficient to accurately represent the groundwater level in a given aquifer. Estimates at unsampled locations can be obtained by applying geostatistical methods to the available data in order to map the spatial distribution of an aquifer level. In sparsely monitored basins, accurate mapping of the spatial variability of groundwater level requires the interpolation of scattered data.

Ordinary Kriging (OK) bases its estimates at unsampled locations only on the sampled primary variable. OK interpolation is widely used to determine the spatial variability of groundwater levels in hydrological basins e.g., (Ahmadi and Sedghamiz 2007; Buchanan and Triantafilis 2009; Chung and Rogers 2012; Nikroo et al. 2009; Sun et al. 2009; Theodossiou and Latinopoulos 2006; Varouchakis and Hristopulos 2013a). Alternatively, Residual Kriging (RK) and Kriging with External Drift (KED), embody secondary information in a drift term. KED and RK are practically equivalent but differ in the methodological steps used (Hengl 2007; Hengl et al. 2003). RK has been applied to the interpolation of water table elevation using deterministic trend models that include e.g. space polynomials (Neuman and Jacobson 1984), topographic metrics such as surface elevation and topographic index, and distance from riverbed (Desbarats et al. 2002; Varouchakis and Hristopulos 2013b), numerical solutions for the hydraulic head field (Rivest et al. 2008; Varouchakis and Hristopulos 2013b) and rainfall data (Moukana and Koike 2008).

Indicator Kriging (IK) has been widely used mainly for risk assessment of pollutant concentrations in ground and surface waters leading to significant decision support regarding the prevention and/or remediation of certain sites (Anane et al. 2014; Arslan 2012; Chica-Olmo et al. 2014; Liu et al. 2004; Neshat et al. 2015). However, it can be also applied for the risk assessment of groundwater level spatial distribution in arid areas or in those with high aquifer pumping. Demir et al. (2009) used variable groundwater level thresholds to create indicators under rain and irrigation periods presenting the aquifer response. However, the groundwater level scale was different compared to the scale of this work. In addition, the number of thresholds is important if one aims to assess different scenarios and also to calculate the cumulative effect. This work on the other hand, used a single groundwater level threshold that was calculated from the physical characteristics of the basin, which is under water scarcity threat. The role of this limit is significant as it sets the threshold of balance between pumping and groundwater level necessary for the aquifer sustainability. Other similar works applied Indicator Kriging to address soil saturation probability regarding depth to water table (Lyon et al. 2006a), or to produce probability maps a shallow water table to exceed certain level thresholds per rainfall event (Lyon et al. 2006b). However, most of the works used groundwater level as one of many parameters to perform assessment regarding pollution risk. This work differs from all those, as to the authors knowledge there is not any similar work to address aquifer depletion risk.

The current project presents the application of Ordinary Kriging, Residual Kriging (e.g. surface elevation), and Indicator Kriging to predict the groundwater level spatial variability as well as the associated risk considering an aquifer level threshold respectively in a sparsely gauged basin. In this work the hole-effect property of the Spartan variogram model is presented for the first time in measurement data. The specific shape is obtained when its shape parameter receives negative values. This is an important evidence for the functionality of the Spartan variogram model. Furthermore, the set of a sustainable groundwater level limit based on the physical characteristics of a basin is discussed, risk assessment using IK regarding a groundwater level threshold is interpreted as not many similar works exist and a new dataset is used that has not been published before.

Area of Study

The present research focuses on Mires basin of the Messara Valley (Fig. 1) at the island of Crete (Greece). The study basin consists of an unconfined aquifer, is sparsely sampled and has limited groundwater resources which are vital for the area’s ecosystem and agriculture. The mean annual rainfall in Mires basin has been lately estimated around 625 mm. Approximately 65 % of the rainfall is lost to evapotranspiration and 10 % is lost as runoff to the sea, leaving only 25 % to recharge the groundwater store. A detailed hydrogeological and hydrological description of the basin is available in Varouchakis (2016). Knowledge of the spatial variations of the groundwater level is important for developing sound management and monitoring strategies. Over-exploitation during the past 30 years has led to a dramatic decrease, in excess of 35 m, of the groundwater level.

Efficient groundwater management in the basin is crucial in light of predictions based on regional climate change models that show a substantial risk of desertification for Crete. In this work accurate spatial models of the basin’s groundwater level are generated that help to identify the susceptible locations and to provide input for potential groundwater resources management plans. The data used in this research consist of hydraulic head measurements (wet period of 2007–2008 hydrological year) from the 43 monitoring locations that operate in the basin which are unevenly distributed and mostly concentrated along a temporary river. The range of hydraulic heads varies from an extremely low value of 11.45 to 72.93 masl. An initial statistical analysis shows that the head data have skewness and kurtosis coefficients equal to ŝ _z = 0.76 and $ {\widehat{k}}_z=2.80 $ respectively, implying a mild deviation from Gaussian statistics (ŝ _z = 0 and $ {\widehat{k}}_z=3 $ respectively). The data analysis is performed by codes developed and run in the Matlab® programming environment (Matlab v.7.10). Standardized spatial coordinates in the interval [0, 1] are used to avoid numerical instabilities.

Methodology

Skewed or erratic data can often be made more suitable for geostatistical modeling by appropriate transformation. A normal distribution for the variable under study is desirable in linear geostatistics (Clark and Harper 2000). Even though normality may not be strictly required, serious violation of normality, such as too high skewness and outliers, can impair the variogram structure and the Kriging results (Gringarten and Deutsch 2001; Ouyang et al. 2006). Ordinary Kriging is well-known to be optimal when the data have a multivariate normal distribution. Transformation of data therefore may be required before Kriging to normalize the data distribution, suppress outliers and improve data stationarity (Armstrong 1998; Deutsch and Journel 1992). The estimation then is performed in the Gaussian domain before back-transforming the estimates to the original domain. An advantage of the Gaussian distribution is that spatial variability is easier to be modelled because it reduces effects of extreme values providing more stable variograms (Armstrong 1998; Goovaerts 1997; Pardo-Iguzquiza and Dowd 2005). Kriging represents variability only up to the second order moment (covariance), so the random field of the transformed variable should be Gaussian to derive unbiased estimates at unsampled locations (Deutsch and Journel 1992; Goovaerts et al. 2005).

A non-linear normalizing data transformation is applied in conjunction with Kriging for the accurate prediction of groundwater level spatial variability. The Modified Box-Cox (MBC) transformation method has recently been proposed and applied successfully to normalize hydraulic heads and residuals (Varouchakis et al. 2012). This recently proposed method focuses on normalizing the skewness and kurtosis coefficients of the data, but it neglects higher-order moments (Varouchakis and Hristopulos 2013b). It is defined by the following function,

$$ \begin{array}{cc}\hfill y:={g}_{MBC}\left(z;\boldsymbol{\upkappa} \right)=\frac{{\left(z-{z}_{\min }+{k_2}^2\right)}^{k_1}-1}{k_1},\hfill & \hfill {\boldsymbol{\upkappa}}^T=\left({k}_1,{k}_2\right),\hfill \end{array} $$

(1)

where k ₁ is the power exponent and k ₂ is an offset parameter. Use of the latter allows negative z values and so the transformation can be applied to fluctuations as well. Parameters (k ₁,k ₂) are estimated from the numerical solution of the equations $ {\widehat{s}}_z=0,\kern0.83em {\widehat{k}}_z=3 $ in the form below, where ŝ _z and $ {\widehat{k}}_z $ are the sample skewness and kurtosis coefficients respectively,

$$ {\left(\frac{{\widehat{m}}_z-{\tilde{m}}_z}{\sigma_z}\right)}^2+{\left({\widehat{k}}_z-3\right)}^2\simeq 0. $$

(2)

In the above equation $ {\widehat{m}}_z $ is the sample mean, $ {\tilde{m}}_z $ is the sample’s median and σ _z the standard deviation. The minimization is performed using the Nelder-Mead simplex optimization method (Nelder and Mead 1965; Press et al. 1992).

Besides the classical OK interpolation, the prediction of the hydraulic head spatial variability is also performed using RK by incorporating local geographic features, such as the ground surface elevation in the trend function. Previous studies have shown that incorporating such auxiliary information in the trend function improves the accuracy of the spatial interpolation (Varouchakis 2012). Herein, there are two reasons for employing ground surface elevation as an auxiliary variable. The first is the important correlation between surface elevation and groundwater level; at high elevation the groundwater level is also high following a decreasing trend towards lower elevations. The second reason is the application of a tool by Haitjema and Mitchell-Bruker (2005). This tool examines if the aquifer is recharge or topography controlled. The equation that describes the tool involves the average annual recharge rate R [m/d], the average distance between surface waters, L [m], a factor m between 8 and 16 for aquifers that are strip like or circular in shape respectively, the (horizontal) aquifer hydraulic conductivity, k [m/d], the aquifer thickness, H [m] and the maximum distance between the average surface water levels and the terrain elevation, d [m].

$$ \frac{R{L}^2}{mkHd}\left\{\begin{array}{l}>1,\kern1em \mathrm{topography}\ \mathrm{controlled}\\ {}<1,\kern1em \mathrm{recharge}\ \mathrm{controlled}\end{array}\right. $$

(3)

Although Mires aquifer does not comply exactly with the conditions applied to produce this inequality, an estimate can be obtained. The surface waters in the basin are limited and there is only a main river crossing the plain. Therefore, a distance between surface waters cannot be exactly defined. However, the eastern and western parts of the main river are connected with two tributaries that their in between distance can define the variable L (approximately 15,000 m). On the other hand, the aquifer is partly circular in shape at the up-stream and stripe like at the down-stream. Thus m on average would be set equal to 12. The other variables are based on average aquifer values: k = 50 m/d. R = 0.0004 m/d, H = 21.5 m and d = 10 m (Varouchakis 2015). Solving Eq. (3) the calculated ratio is equal to 0.7 and thus less than 1. This result means that the aquifer is recharge controlled. However, the result is quite close to 1 and one can assume that topography has also a vital role. Thus, considering also the important relation between the elevation and the groundwater level trend (65 %) and based on the absence of information on the spatial distribution of recharge in the basin, one can use the ground surface elevation as an auxiliary variable. Its significance and usefulness will be assessed based on the derived results.

In the following, it is assumed that the water table level is represented by a spatial random field (SRF), Z(s ∈ S), where S is the set of sampling points with cardinal number N. The values of the SRF in a given state will be denoted by lower-case letters. The target is to derive estimates, Ẑ(s ∈ P) of the water table elevation at the prediction set points, P that lie on a rectangular grid that covers the basin. Therefore, s _i, i = 1, …, N denote the sampling points, z(s _i) are the head values (in masl) at these points, and s ₀ denotes an estimation point, which is assumed to lie inside the convex hull of the sampling network. For mapping purposes, it is assumed that s ₀ moves sequentially through all the nodes of the mapping grid.

Spatial interpolation of the groundwater levels was initially applied by means of OK and RK in combination with MBC normalizing transformation, while IK based on developed indicators from the data was then implemented. In the first approach, a normalizing transformation g(·) is applied to the data. Then, OK is used to predict the transformed field Y(s) = g(Z(s)), and the predictions are back-transformed to obtain head estimates. In the second approach, a trend model m _Z(s) is introduced that captures local features. Since the fluctuation SRF, Z′(s) = Z(s) − m _Z(s), is non-Gaussian, a transformation g(·) is applied to obtain a normalized SRF, Y(s) = g(Z′(s)). The experimental variogram is then estimated and is fitted to theoretical models. Next, the Gaussian field Ŷ(s ∈ P) is estimated at the prediction points using OK. Finally, head estimates are retrieved from Ŷ(s ∈ P) by applying the back-transformation and adding the trend. Leave-one-out cross-validation analysis was used to determine the optimal spatial model applied to water table level data and to assess the accuracy of the interpolated head field.

Spatial dependence

The variogram is commonly used in geostatistical analysis to measure the spatial dependence between neighboring observations. The omnidirectional empirical (experimental) variogram of the hydraulic head and of the residuals was determined using the method of moments. The empirical variogram was fitted with isotropic classical models (Gaussian, Exponential, Linear, Spherical, and Power-law), the Matérn model (Goovaerts 1997), and the new family of Spartan variograms (3D, 2D models) (Hristopulos 2003; Hristopulos and Elogne 2007).

Spartan Spatial Random Fields (SSRFs) are a geostatistical model (Hristopulos 2002, 2003) inspired from statistical field theory with applications in environmental risk assessment and environmental monitoring (Elogne et al. 2008; Elogne and Hristopulos 2008; Hristopulos and Elogne 2009). SSRFs are generalized Gibbs random fields with an energy functional that is based on local interactions between the field values. The term Spartan indicates parametrically compact model families that involve a small number of parameters. SSRFs provide a new class of generalized covariance functions. The SSRFs covariance functions can be used for spatial interpolation with the classical Kriging estimators. Spartan covariance and variogram functions have been applied to various environmental data sets (Elogne et al. 2008; Varouchakis and Hristopulos 2013b). Herein, the Spartan covariance derived for d = 3 dimensions is applied.

The model parameters are obtained by fitting the SSRFs function (4) to the experimental variogram (Hristopulos and Elogne 2007). The exponential covariance is formed for η ₁ = 2, while for |η ₁| < 2 the product of the exponential and hole-effect model is obtained. According to Christakos (1991) a covariance function that is permissible in three spatial dimensions is also permissible in two dimensions. The components of the equation are presented and explained in Table 1.

Table 1 Interpretation of the Spartan covariance function components

Full size table

$$ {C}_z\left(\mathbf{h}\right)=\left\{\begin{array}{l}\begin{array}{lll}\frac{\eta_0{e}^{-h{\beta}_2}}{2\pi \sqrt{\left|{\eta}_1^2-4\right|}}\left[\frac{ \sin \left(h{\beta}_1\right)}{h{\beta}_1}\right],\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\left|{\eta}_1\right|<2,\hfill & {\sigma}_z^2=\frac{\eta_0}{2\pi \sqrt{\left|{\eta}_1^2-4\right|}}\hfill \end{array}\hfill \\ {}\begin{array}{lll}\frac{\eta_0{e}^{-h}}{8\pi },\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\kern0.5em {\eta}_1=2,\hfill & {\sigma}_z^2=\frac{\eta_0}{8\pi}\hfill \end{array}\hfill \\ {}\begin{array}{lll}\frac{\eta_0\left({e}^{-h{\omega}_1}-{e}^{-h{\omega}_2}\right)}{4\pi \left({\omega}_2-{\omega}_1\right)h\sqrt{\left|{\eta}_1^2-4\right|}},\hfill & \mathrm{f}\mathrm{o}\mathrm{r}\kern0.5em {\eta}_1>2,\hfill & {\sigma}_z^2=\frac{\eta_0}{4\pi \sqrt{\left|{\eta}_1^2-4\right|}}\hfill \end{array}\hfill \end{array}\right.. $$

(4)

Spatial models

In spatial linear interpolation methods such as OK and RK, it generally holds that,

$$ \widehat{z}\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i}z\left({\mathbf{s}}_i\right), $$

(5)

where $ {\mathbb{S}}_0 $ is the set of sampling points in the search neighborhood of s ₀. The neighborhood is empirically chosen so as to optimize the cross validation measures. The weights λ _i are obtained by minimizing the mean square estimation error conditionally on the zero-bias constraint (Cressie 1993) and they depend on the variogram model γ _z(r), where r are the spatial lags of the experimental variogram (Deutsch and Journel 1992). The OK estimation variance is given by the following equation, with the Lagrange coefficient μ compensating for the uncertainty of the mean value:

$$ {\sigma}_E^2\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i{\gamma}_z\left({\mathbf{s}}_i,{\mathbf{s}}_0\right)}+\mu . $$

(6)

Overall OK variance is termed as the weighted average of variograms from the new point s ₀ to all calibration points s _j, plus the Lagrange multiplier.

RK combines a trend function with interpolation of the residuals. Herein, a deterministic trend is applied based on the basin topography. In RK the estimate is expressed as:

$$ \widehat{z}\left({\mathbf{s}}_0\right)={m}_z\left({\mathbf{s}}_0\right)+\widehat{z}^{\prime}\left({\mathbf{s}}_0\right), $$

(7)

where m _z(s ₀) is the trend function, and ẑ′(s ₀) is the interpolated residual by means of OK (Rivoirard 2002). Typically, the trend function is modeled as:

$$ \begin{array}{cc}\hfill {m}_z\left({\mathbf{s}}_0\right)={\displaystyle \sum_{k=0}^p{\beta}_k}{q}_k\left({\mathbf{s}}_0\right);\hfill & \hfill {q}_k\left({\mathbf{s}}_0\right)\equiv 1,\hfill \end{array} $$

(8)

where q _k(s ₀) are the values of auxiliary variables at s ₀, β _k are the estimated regression coefficients and p is the number of auxiliary variables (Draper and Smith 1981; Hengl 2007; Hengl et al. 2007). The regression coefficients are estimated from the sample using ordinary least squares (OLS) (Kitanidis 1993). The variance of the estimates follows from the equation (Hengl 2007; Hengl et al. 2003):

$$ {\sigma}^2\left({\mathbf{s}}_0\right)={\sigma}_d^2\left({\mathbf{s}}_0\right)+{\sigma}_f^2\left({\mathbf{s}}_0\right), $$

(9)

$$ {\sigma}_d^2\left({\mathbf{s}}_0\right)={\mathbf{q}}_0^T{\left({\mathbf{q}}^T{\boldsymbol{\gamma}}_{z\prime}^{-1}\mathbf{q}\right)}^{-1}{\mathbf{q}}_0, $$

(10)

$$ {\sigma}_f^2\left({\mathbf{s}}_0\right)={\displaystyle {\sum}_{\left\{i:{\mathbf{s}}_i\in {\mathbb{S}}_0\right\}}{\lambda}_i{\gamma}_{z\prime}\left({\mathbf{s}}_i,{\mathbf{s}}_0\right)+\mu }, $$

(11)

where σ ²_d (s ₀) is the drift prediction variance, q ₀ is the vector of (p + 1) × 1 predictors at the unvisited location, q is the matrix of (N ₀ + 1) × (p + 1) predictors at the sampling points in the search neighborhood (N ₀ is the number of points within the search neighborhood of s ₀), γ _z ' is the variogram matrix of the (N ₀ + 1) × (N ₀ + 1) residuals at the measured locations (neighborhood) and σ ²_f (s ₀) is the kriging (OK) variance of residuals.

Herein a trend model for Mires basin groundwater level data is presented. Following other studies, based on the fact that usually the water table level of an unconfined aquifer follows the elevation trend, secondary information in the trend is considered from a Digital Elevation Model (DEM) of the area (Desbarats et al. 2002; Deutsch and Journel 1992; Goovaerts 1997; Hoeksema et al. 1989; Nikroo et al. 2009; Rivest et al. 2008). This choice was supported by the interpretation provided earlier in the Methodology section, and primarily by the important correlation coefficient between the groundwater level and the ground surface elevation in the basin (R = 0.65, is characterized as important correlation (Tichy 1993)). A scatter plot of ground surface elevation vs groundwater level is presented in Fig. 2.

The following expression for the trend of the hydraulic head (in masl) is applied (T-DEM):

$$ {m}_z\left(\mathbf{s}\right)=f\kern0.5em DEM\left(\mathbf{s}\right)+c, $$

(12)

where f, c are linear coefficients and DEM(s) is the local DEM value.

Another interpolation method applied is IK. This is a non-parametric geostatistical method for estimating the probability of a variable to exceed or lie below a specific threshold value at a given location (Goovaerts 1997). In this work, IK is applied for mapping the risk associated with a specified groundwater level limit that could lead to significant problem of groundwater availability. A critical aquifer level limit for the basin’s groundwater resources availability can be set in terms of a statistical and a physical based approach. The first involves the 25th, lower, percentile of the available data values which is equal to 25 m above sea level (m.a.s.l). This value was validated by a physically based approach that involves physical characteristics of the basin. The aquifer capacity has been recently estimated equal to 55 Mm³, the aquifer area equal to 26.1 km² and the porosity equal to 0.085. Dividing the first two figures and then their result with the porosity the aquifer level is calculated. This is equal to 24.7 m.a.s.l, similar to the lower percentile of the available data. Thus, 25 m.a.s.l can be set as the aquifer level threshold for sustainable groundwater resources management at the basin.

IK is applied to determine the conditional probability at unsampled points based on the spatial dependence structure of indicator-transformed data points with a binary distribution (e.g. 0 and 1). IK proceeds as the classical Ordinary Kriging (the main change is the choice of a cutoff value) with the difference that results is now maps with values between 0 and 1 expressing probability a condition to apply (Chica-Olmo et al. 2014; Deutsch and Journel 1992). Indicator variogram analysis is also performed using the models and the procedure previously stated for OK and RK (Isaaks and Srivastava 1989).

This method does not make assumptions regarding the variables distribution and has the ability to take into account, to a large extent, the uncertainty of the data. The IK is based on the conversion of all data from continuous to a binary form according to a specific threshold value. Thus, is robust to outliers handling. This value can be either a percentile of our data or a default value of marginal importance for the system under study. Subsequently, data with values below the threshold take a value of 1, while the remaining taking a value of 0.

$$ I\left(z\left(\mathbf{s}\right)\right) = \left\{\begin{array}{l}\begin{array}{ll}1,\hfill & z\left(\mathbf{s}\right)\le z^{\prime}\hfill \end{array}\hfill \\ {}\begin{array}{ll}0,\hfill & otherwise\hfill \end{array}\hfill \end{array}\right., $$

(13)

where, I(z(s)) is a binary variable, z(s) is the measured value and z ^′ is the cut-off (threshold) value.

Indicator Kriging is a geostatistical method best suited for issues that involve a threshold value (Goovaerts 1999; Isaaks and Srivastava 1989; Webster and Oliver 2001). However, most practical problems that require indicator techniques require well-chosen threshold which have a special significance to the problem being addressed. Probability maps delineate suitable and unsuitable sites regarding the examined issue, while help to take decisions to prevent and/or remediate a site compared to locations with reduced or no risk.

The method proceeds as follows: a) convert the given values to indicators: divide the range evenly or based on different quintiles (q_0.25; q_0.50; q_0.75), b) estimate the indicator variogram, c) apply Kriging using the usual equations and obtain predictions. On the other hand, the methodology has a set of disadvantages such as it will not necessarily provide probabilities to add up to 1 and sometimes the prediction may end up beyond the zero to one interval, e.g. Kriging occasionally provides negative weights-screening effect (Goovaerts 1997; Papadopoulou et al. 2009).

Results and Discussion

The performance of the Kriging-based geostatistical models is evaluated by using the leave one out cross validation technique that is usually applied in small datasets (Witten et al. 2011). A series of well-known statistical measures is employed to compare the true and estimated values of the cross-validation procedure, such as the correlation coefficient R, the Mean Absolute Error (MAE), the Root Mean Square Error (RMSE) and the Mean Absolute Relative Error (MARE).

Anisotropy was investigated by comparing directional variograms in four main directions (Goovaerts 1997) using an angle tolerance of 30°. Smaller tolerance values (15°) do not permit a sufficient number of data pairs (i.e., at least 30) at each lag. According to Journel and Huijbregts ( 1978 ), in order to acquire a good variogram, there should be approximately 30 pairs of distances and values for every lag. In addition the number of lags was reduced in order to achieve the required pair number for directional variograms. As shown in Fig. 3, there is no clear difference among the directional variograms for the original data. In addition a test of geometric anisotropy was performed based on the method of Covariance Hessian Identity (Chorti and Hristopulos 2008; Hristopulos 2002). This method is non-parametric, in the sense that it provides an estimate of the aspect ratio (i.e. the ratio of the two principal correlation lengths) and the orientation of the principal axes, without requiring variogram estimation and modeling. The aspect ratio is thus estimated at 0.75, while the short principal axis is rotated by 8° with respect to the E-W direction. The value of 0.75 does not differ significantly from unity. Indeed, the isotropic hypothesis cannot be rejected with 95 % confidence for ratios in the range [0.70 – 1.07] using the test given (Spiliopoulos et al. 2011). In light of the above analysis, the variogram function of the groundwater level is considered to be isotropic (Ahmadi and Sedghamiz 2007).

The general approach that is used for interpolation applies a normalizing transformation followed by OK on the transformed variables, and it finally back-transforms the predictions. The application of MBC methodology to the initial head dataset improves their normality (Table 2). The normality improvement is also supported by histograms of the data before and after the transformation (Fig. 4).

Table 2 Normalization results using Modified Box-Cox (MBC) transformation: skewness coefficient ŝ _z; kurtosis coefficient $ {\widehat{k}}_z $

Full size table

In terms of the spatial model that considers the head data the parameters of the theoretical variogram models tested are obtained by least squares fitting to the experimental omnidirectional variogram of the transformed hydraulic head. The 3D Spartan model gives the best fit in terms of cross validation results (Table 3) while the Spherical and the Matérn variogram come close. The 2D Spartan function did not provide a good fit for this dataset.

Table 3 Cross validation measures for spatial MBC-OK model with optimal variograms: MBC & OK: Ordinary Kriging with modified Box-Cox transformation of data and back-transformation

Full size table

In the case of spatial model with trend component, RK is applied. RK combines a trend function with interpolation of the residuals. The residuals of the trend model also display deviations from normality that are reduced by means of the MBC transformation (Table 4). Similarly, histograms (Fig. 5) present the normality improvement.

Table 4 Skewness ŝ _z and kurtosis $ {\widehat{k}}_z $ coefficients of trend models residuals following modified Box-Cox (MBC) normalization

Full size table

The omnidirectional experimental variogram is calculated by applying the method of moments to the transformed residuals of the T-DEM model. The Spartan variogram model (Fig. 6) again provides the best fit in terms of cross validation results (Table 5). The Spherical variogram provides similar results to the Spartan model while third best is the Matérn model.

Table 5 Cross validation measures for spatial MBC-RK-T-DEM model with optimal variograms: T-DEM trend using DEM surface elevation

Full size table

Another method to test data normality improvement, is the non-parametric Kolmogorov-Smirnov test. The test is applied to examine if a sample comes from a reference probability Distribution (Massey 1951). The test was implemented in Matlab® environment using the function «kstest» for both the transformed datasets. The null hypothesis for the Kolmogorov-Smirnov test is that data follows the standard normal distribution. Therefore, the null hypothesis was not rejected for the transformed datasets at significant levels 5 % and of 10 %.

The MBC-RK approach improves significantly the mean absolute prediction error (4.27 masl) by over 1 m compared to the MBC-OK (5.32 masl) approach. In addition the other estimation measures are at least similar (BIAS) but mostly improved (RMSE, R, MARE). Considering overall the cross validation measures the estimates based on the Spartan model prevails compared to the other two optimal models.

The least squares sum for each fitted variogram model is considered, which is an index of optimal fitting, for selecting the optimal variogram model with Indicator Kriging interpolation. Spartan model achieves the best fit (Fig. 7) over the range of lags considered providing a value of 0.023 compared to 0.029 for the spherical and 0.031 for the Matérn models.

The T-DEM trend model with RK and the IK methodology are applied to estimate the groundwater level and the probabilities of groundwater level to lie below a threshold value on a 100 × 100 grid defined in normalized coordinate space (actual cell size: 114 × 47 m). In addition the uncertainties of the estimations are also determined on a same grid size. Estimates are obtained only at points that lie inside the convex hull of the measurement locations (7317 grid points). The contour maps in physical space are shown in Figs. 8, 9, 10, and 11. The residuals of the T-DEM model are interpolated using the Spartan variogram model (Fig. 6) with the following optimal parameter values: σ ² = 17.77, ξ = 0.27 (in normalized units), η ₁ = −1.99 while the indicators applying the Spartan variogram model (Fig. 7) with optimal parameter values: σ ² = 0.25, ξ = 0.26 (in normalized units), η ₁ = −1.90. The optimum search radius used with the Spartan model (determined by the leave-one-out cross validation test) is equal to 0.38 (normalized units) for both models. Near the origin and at intermediate distances, which are crucial for the interpolation, the Spartan model fitting is very good and overall follows closely the trend of the experimental variogram. The negative values of η ₁ causes a negative hole effect in the Spartan correlation (Žukovič and Hristopulos 2008) that can be observed in both variogram figures (Figs. 6 and 7).

The groundwater level map of the basin (Fig. 8) presents the spatial variability of the groundwater level that change from East towards West direction following the ground surface elevation trend (Fig. 1). The higher levels are met at the East of the basin while the lowest towards the West. The error map (Fig. 9) identifies the locations of the Mires Basin with the largest Kriging standard deviation. Hence, the borders of the basin can benefit from further sampling according to RK standard deviation results.

Indicator Kriging predictions (Fig. 10) shows that in the center and towards the West borders of the basin the risk of the aquifer level to decline below the set 25 m.a.s.l threshold is significant. Probabilities are increased closer to the river path than higher away. The dependence is reasonable considering that the agricultural activity in the area is concentrated along the temporary river.

Estimation variance calculated through IK is usually highest where wells distribution density is poor and variability among neighbouring observations is large, while lowest where wells distribution is good and variability is low (Hohn 1999). The variance range though depends on the quality of the fitted theoretical variogram model. The accurate knowledge of the correlation between point measurements at different locations produces estimates of the prediction variance that are minimal. This is succeeded when the fit of the model variogram to the experimental is the optimum, as occurs herein (Fig. 7). Thus, the properties of the variogram model occur through the whole area of interest leading to accurate estimates with low variance even for regions of poor monitoring density.

Significantly low IK variance values (maximum variance is equal to 0.0525) are obtained herein due to the optimum spatial dependence inference that is provided by Spartan variogram model (Fig. 7). According to Chiles and Delfiner (1999) the computed kriging variance is directly affected by the variogram fit. In this work IK variance (Fig. 11) is very low even at ungauged locations of 0 or close to 1 probability the groundwater level to lie below the set threshold value. Therefore, no further variance analysis is required such as conditional simulations to calculate the cumulative distribution function of the predictions (Deutsch and Journel 1992; Goovaerts 1997; Kanevski et al. 2009; Olea 1999).

The final risk (probability) map (Fig. 10), considering the calculated variance of the estimates (Fig. 11), shows that around 25 % of the aquifer’s surface present significant probability the aquifer level to lie below the 25 m.a.s.l. This area corresponds to almost 40 % of the productive agricultural land of the basin.

A very interesting characteristic that is identified in this work is the shape of the modeled Spartan variogram. According to a previous work (Varouchakis and Hristopulos 2013b) this shape can be explained with respect to the pumping activity of the basin. The average distance between the increment and the decrement in the variograms is equal to 150 m that lies between the range 105 to 160 m, which correspond to the radii of influence range of the pumping wells in Mires basin (Varouchakis and Hristopulos 2013b). Therefore, the trend of the experimental and of the model variogram expresses the aquifer behavior under pumping activity. As it has been stated in a previous work more than 200 wells operate in the basin affecting the measurements at the monitoring locations (Varouchakis and Hristopulos 2013b).

Conclusions

The optimal spatial interpolation approach for the spatial variability of the groundwater level in Mires basin is based on Residual Kriging with the Spartan variogram model applied to the normalized (MBC) fluctuations. The present findings are supported by the results of cross validation analysis. In addition, risk maps based on IK identify the vulnerable areas of the basin that require intense monitoring and remedial actions to avoid further decline of the aquifer. These are located at the west part of the basin mainly along the river path. The recently developed MBC transformation method shows an excellent behaviour transforming both data and residuals closer to normal distribution. In addition the Spartan variogram model has an excellent fit to the experimental variogram of the data, residuals and indicators following closely their trend. Thus, it constitutes a reliable alternative to assess the spatial dependence of groundwater level data in interpolation studies.

References

Ahmadi S, Sedghamiz A (2007) Geostatistical analysis of spatial and temporal variations of groundwater level. Environ Monit Assess 129(1):277–294
Article Google Scholar
Anane M, Selmi Y, Limam A, Jedidi N, Jellali S (2014) Does irrigation with reclaimed water significantly pollute shallow aquifer with nitrate and salinity? An assay in a perurban area in North Tunisia. Environ Monit Assess 186(7):4367–4390
Article Google Scholar
Armstrong M (1998) Basic linear geostatistics. Springer Verlag, Berlin
Book Google Scholar
Arslan H (2012) Spatial and temporal mapping of groundwater salinity using ordinary kriging and indicator kriging: the case of Bafra Plain, Turkey. Agric Water Manag 113:57–63
Article Google Scholar
Buchanan S, Triantafilis J (2009) Mapping water table depth using geophysical and environmental variables. Ground Water 47(1):80–96
Article Google Scholar
Chica-Olmo M, Luque-Espinar JA, Rodriguez-Galiano V, Pardo-Igúzquiza E, Chica-Rivas L (2014) Categorical Indicator Kriging for assessing the risk of groundwater nitrate pollution: the case of Vega de Granada aquifer (SE Spain). Sci Total Environ 470–471:229–239
Article Google Scholar
Chiles JP, Delfiner A (1999) Geostatistics (modeling spatial uncertainty). Wiley, New York
Google Scholar
Chorti A, Hristopulos DT (2008) Non-parametric identification of anisotropic (elliptic) correlations in spatially distributed data sets. IEEE Trans Signal Process 56(10):4738–4751
Article Google Scholar
Christakos G (1991) Random field models in earth sciences. Academic, San Diego
Google Scholar
Chung J-W, Rogers JD (2012) Interpolations of groundwater table elevation in dissected uplands. Ground Water 50(4):598–607
Article Google Scholar
Clark I, Harper WV (2000) Practical geostatistics 2000. Ecosse North America Llc., Columbus
Google Scholar
Cressie N (1993) Statistics for spatial data (revised ed.). Wiley, New York
Google Scholar
Demir Y, Erşahin S, Güler M, Cemek B, Günal H, Arslan H (2009) Spatial variability of depth and salinity of groundwater under irrigated ustifluvents in the Middle Black Sea Region of Turkey. Environ Monit Assess 158(1–4):279–294
Article Google Scholar
Desbarats AJ, Logan CE, Hinton MJ, Sharpe DR (2002) On the kriging of water table elevations using collateral information from a digital elevation model. J Hydrol 255(1–4):25–38
Article Google Scholar
Deutsch CV, Journel AG (1992) GSLIB. Geostatistical software library and user’s guide. . Oxford University Press, New York
Google Scholar
Draper N, Smith H (1981) Applied regression analysis, 2nd edn. Wiley, New York
Google Scholar
Elogne SN, Hristopulos DT (2008) Geostatistical applications of Spartan spatial random fields. In: Soares A, Pereira MJ, Dimitrakopoulos R (eds) geoENV VI—geostatistics for environmental applications in series: quantitative geology and geostatistics, vol 15. Springer, Berlin, pp 477–488
Chapter Google Scholar
Elogne S, Hristopulos D, Varouchakis E (2008) An application of Spartan spatial random fields in environmental mapping: focus on automatic mapping capabilities. Stoch Env Res Risk A 22(5):633–646
Article Google Scholar
Goovaerts P (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York
Google Scholar
Goovaerts P (1999) Geostatistics in soil science: state-of-the-art and perspectives. Geoderma 89(1–2):1–45
Article Google Scholar
Goovaerts P, AvRuskin G, Meliker J, Slotnick M, Jacquez G, Nriagu J (2005) Geostatistical modeling of the spatial variability of arsenic in groundwater of southeast Michigan. Water Resour Res 41, W07013. doi:10.1029/2004WR003705
Article Google Scholar
Gringarten E, Deutsch CV (2001) Teacher’s aide: variogram interpretation and modeling. Math Geol 33(2001):507–534
Article Google Scholar
Haitjema HM, Mitchell-Bruker S (2005) Are water tables a subdued replica of the topography? Ground Water 43(6):781–786
Google Scholar
Hengl T (2007) A practical guide to geostatistical mapping of environmental variables. Office for Official Publications of the European Communities EUR 22904 EN-Scientific and Technical Research series 143
Hengl T, Heuvelink GBM, Stein A (2003) Comparison of kriging with external drift and regression-kriging. International Institute for Geo-information Science and Earth Observation (ITC) Technical note 17
Hengl T, Heuvelink GBM, Rossiter DG (2007) About regression-kriging: from equations to case studies. Comput Geosci 33(10):1301–1315
Article Google Scholar
Hoeksema RJ, Clapp RB, Thomas AL, Hunley AE, Farrow ND, Dearstone KC (1989) Cokriging model for estimation of water table elevation. Water Resour Res 25(3):429–438
Article Google Scholar
Hohn ME (1999) Geostatistics and petroleum geology. Springer, Dordrecht
Book Google Scholar
Hristopulos DT (2002) New anisotropic covariance models and estimation of anisotropic parameters based on the covariance tensor identity. Stoch Env Res Risk A 16(1):43–62
Article Google Scholar
Hristopulos DT (2003) Spartan Gibbs random field models for geostatistical applications. SIAM J Sci Comput 24(6):2125–2162
Article Google Scholar
Hristopulos DT, Elogne SN (2007) Analytic properties and covariance functions for a new class of generalized Gibbs random fields. IΕΕΕ Trans Inform Theory 53(12):4667–4679
Google Scholar
Hristopulos DT, Elogne SN (2009) Computationally efficient spatial interpolators based on Spartan spatial random fields. IEEE Trans Signal Process 57(9):3475–3487
Article Google Scholar
Isaaks EH, Srivastava RM (1989) An introduction to applied geostatisics. Oxford University Press, New York
Google Scholar
Journel AG, Huijbregts C (1978) Mining geostatistics. Academic, New York
Google Scholar
Kanevski M, Pozdnoukhov A, Timonin V (2009) Machine learning for spatial environmental data: theory, applications, and software. EPFL press, Lausanne
Book Google Scholar
Kitanidis P (1993) Generalized covariance functions in estimation. Math Geol 25(5):525–540
Article Google Scholar
Liu C-W, Jang C-S, Liao C-M (2004) Evaluation of arsenic contamination potential using indicator kriging in the Yun-Lin aquifer (Taiwan). Sci Total Environ 321(1–3):173–188
Article Google Scholar
Lyon SW, Lembo AJ Jr, Walter MT, Steenhuis TS (2006a) Defining probability of saturation with indicator kriging on hard and soft data. Adv Water Resour 29(2):181–193
Article Google Scholar
Lyon SW, Seibert J, Lembo AJ, Walter MT, Steenhuis TS (2006b) Geostatistical investigation into the temporal evolution of spatial structure in a shallow water table. Hydrol Earth Syst Sci 10(1):113–125
Article Google Scholar
Massey FJ (1951) The Kolmogorov-Smirnov test for goodness of fit. J Am Stat Assoc 46(253):68–78
Article Google Scholar
Moukana JA, Koike K (2008) Geostatistical model for correlating declining groundwater levels with changes in land cover detected from analyses of satellite images. Comput Geosci 34(11):1527–1540
Article Google Scholar
Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313
Article Google Scholar
Neshat A, Pradhan B, Javadi S (2015) Risk assessment of groundwater pollution using Monte Carlo approach in an agricultural region: an example from Kerman Plain, Iran. Comput Environ Urban Syst 50:66–73
Article Google Scholar
Neuman S, Jacobson E (1984) Analysis of nonintrinsic spatial variability by residual kriging with application to regional groundwater levels. Math Geol 16(5):499–521
Article Google Scholar
Nikroo L, Kompani-Zare M, Sepaskhah A, Fallah Shamsi S (2009) Groundwater depth and elevation interpolation by kriging methods in Mohr Basin of Fars province in Iran. Environ Monit Assess 166(1–4):387–407
Google Scholar
Olea RA (1999) Geostatistics for engineers and earth scientists. Kluwer Academic Publishers, New York
Book Google Scholar
Ouyang Y, Zhang JE, Ou LT (2006) Temporal and spatial distribution of sediment total organic carbon in an estuary river. J Environ Qual 35(1):93–100
Article Google Scholar
Papadopoulou MP, Varouchakis EA, Karatzas GP (2009) Simulation of complex aquifer behavior using numerical and geostatistical methodologies. Desalination 237(1–3):42–53
Article Google Scholar
Pardo-Iguzquiza E, Dowd P (2005) Empirical maximum likelihood Kriging: the general case. Math Geol 37(5):477–492
Article Google Scholar
Press WH, Teukolsky SA, Vettering WT, Flannery BP (1992) Numerical recipes in fortran, 2nd edn. Cambridge University Press, New York
Google Scholar
Rivest M, Marcotte D, Pasquier P (2008) Hydraulic head field estimation using kriging with an external drift: a way to consider conceptual model information. J Hydrol 361(3–4):349–361
Article Google Scholar
Rivoirard J (2002) On the structural link between variables in Kriging with external drift. Math Geol 34(7):797–808
Article Google Scholar
Spiliopoulos I, Hristopulos DT, Petrakis E, Chorti A (2011) A Multigrid method for the estimation of geometric anisotropy in environmental data from sensor networks. Comput Geosci 37(3):320–330
Article Google Scholar
Sun Y, Kang S, Li F, Zhang L (2009) Comparison of interpolation methods for depth to groundwater and its temporal and spatial variations in the Minqin oasis of northwest China. Environ Model Softw 24(10):1163–1170
Article Google Scholar
Theodossiou N, Latinopoulos P (2006) Evaluation and optimisation of groundwater observation networks using the kriging methodology. Environ Model Softw 21(7):991–1000
Article Google Scholar
Tichy M (1993) Applied methods of structural reliability. Springer, Dordrecht
Book Google Scholar
Varouchakis EA (2012) Geostatistical analysis and space-time models of aquifer levels: application to mires hydrological basin in the prefecture of crete. PhD Thesis, Mineral Resources Engineering, Technical University of Crete, Chania
Varouchakis EA (2016) Integrated water resources analysis at basin scale: a case study in Greece. J Irrig Drain. E-ASCE 142(3). doi:10.1061/(ASCE)IR.1943-4774.0000966
Varouchakis EA, Hristopulos DT (2013a) Comparison of stochastic and deterministic methods for mapping groundwater level spatial variability in sparsely monitored basins. Environ Monit Assess 185(1):1–19
Article Google Scholar
Varouchakis EA, Hristopulos DT (2013b) Improvement of groundwater level prediction in sparsely gauged basins using physical laws and local geographic features as auxiliary variables. Adv Water Resour 52(2013):34–49
Article Google Scholar
Varouchakis EA, Hristopulos DT, Karatzas GP (2012) Improving kriging of groundwater level data using nonlinear normalizing transformations-a field application. Hydrol Sci J 57(7):1404–1419
Article Google Scholar
Webster R, Oliver M (2001) Geostatistics for environmental scientists: statistics in practice. Wiley, Chichester
Google Scholar
Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques: practical machine learning tools and techniques. Elsevier, San Francisco
Google Scholar
Žukovič M, Hristopulos DT (2008) Environmental time series interpolation based on Spartan random processes. Atmos Environ 42(33):7669–7678
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Environmental Engineering, Technical University of Crete, Chania, Greece
Emmanouil A. Varouchakis, Kostantinos Kolosionis & George P. Karatzas

Authors

Emmanouil A. Varouchakis
View author publications
You can also search for this author in PubMed Google Scholar
Kostantinos Kolosionis
View author publications
You can also search for this author in PubMed Google Scholar
George P. Karatzas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Emmanouil A. Varouchakis.

Additional information

Communicated by: H. A. Babaie

Rights and permissions

Reprints and permissions

About this article

Cite this article

Varouchakis, E.A., Kolosionis, K. & Karatzas, G.P. Spatial variability estimation and risk assessment of the aquifer level at sparsely gauged basins using geostatistical methodologies. Earth Sci Inform 9, 437–448 (2016). https://doi.org/10.1007/s12145-016-0265-3

Download citation

Received: 23 September 2015
Accepted: 02 May 2016
Published: 06 May 2016
Issue Date: November 2016
DOI: https://doi.org/10.1007/s12145-016-0265-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Spatial variability estimation and risk assessment of the aquifer level at sparsely gauged basins using geostatistical methodologies

Abstract

Similar content being viewed by others

Fundamentals of Geostatistics for Assessing Spatial Variation of Groundwater Resources

Spatial Interpolation for the Distribution of Groundwater Level in an Area of Complex Geology Using Widely Available GIS Tools

Assessment of changes in regional groundwater levels through spatio-temporal kriging: application to the southern Basin of Mexico aquifer system

Introduction

Area of Study