Abstract
Proper knowledge of spatio-temporal variation of groundwater level are very important for efficient planning and management of groundwater level. In sparsely monitored basins, levels of groundwater are generally monitored at random points which may be far away from each other. Interpolation of groundwater level over a region in an accurate manner is a pre-requisite for modeling as well as management of water resources that can be achieved by adopting accurate and reliable techniques of interpolation of scattered data over a region. In the present study, two deterministic interpolation methods, i.e., inverse distance weighted (IDW), radial basis functions (RBFs), and three geostatistical interpolation methods, i.e., ordinary kriging (OK), simple kriging (SK), and ordinary cokriging (OCK) are evaluated to predict the spatial and temporal variation of groundwater levels. Cross-validation procedure is adopted to evaluate the prediction performance of adopted interpolation methods. Groundwater level data for the year 2019 from 31 sparsely located monitored observation wells over Sagar district of Madhya Pradesh, India are used to evaluate the performance of interpolation methods with respect to different statistical indicators of cross-validation. Interpolation maps of estimate groundwater level are produced using all five spatial interpolation techniques. Results of the analysis are presented and discussed.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Keywords
1 Introduction
Groundwater is one of the most important water sources. Groundwater management is important for their sustainable development. Therefore, we need appropriate information about the spatio-temporal behavior of the water table in a region. However, monitoring groundwater levels is inherently costly and time consuming, especially during installation stages that require the drilling of wells and piezometers. As a result, the number of surveillance sites available in a particular region is relatively small and often does not reflect the actual range of variation that may exist. Therefore, accurate groundwater level spatial interpolation at unsampled sites is required for better groundwater management.
Among various interpolation methods, there is no clear optimal method, thus results need to be compared depending on specific situation [10]. This study presents the use of GIS tools to generate the groundwater level surface for a sparsely monitored region from groundwater levels monitored at random locations. Geostatistical analyst tool in ArcGIS 10.8 is used to explore the spatial variability in groundwater levels for the Sagar district region located in Madhya Pradesh, India.
2 Study Area and Data Source
2.1 Sagar District
The Sagar district is located in the north central part of Madhya Pradesh, India, and occupies an area of 10,252 km2. The district extends between the latitude of 23° 10′ and 24° 27′N, longitude of 78° 04′ and 79° 21′E. Figure 1 shows the index map for the Sagar district.
2.2 Data Used
The pre (April–June) and post-monsoon (Oct–Dec) seasonal groundwater level data of 31 central ground water board (CGWB) monitoring wells for 2019 are used. The location and groundwater level of observation wells are collected directly from the India Water Resources Information System (WRIS). Traditionally, geostatistical studies are performed on at least 100 samples [11]. However, the sample size is small in this study.
3 Materials and Method
3.1 Exploratory Analysis
The datasets are initially visualized in order to identify incorrect coordinate information and illogical data points. The screened datasets are then subjected to exploratory information evaluation to identify the outliers which can be unfavorable to spatial prediction. The variogram, in particular, may be very touchy to outliers due to the fact it's far primarily based totally at the squared differences among information [8]. Description of the information values is done through fundamental precis of statistics, inclusive of means, medians, variances, and skewness.
The geostatistical method mainly kriging is taken into consideration the high-quality unbiased linear prediction (BLUP) if the information meets the situations of normality, variance uniformity, and stationarity [7]. However, spatial information, specifically weather information, violates those situations. High asymmetries and outliers have undesired consequences on variogram shape and kriging estimates [6]. For spatio-temporal data that follow a Gaussian distribution, the effects of extremes are reduced, and more stable variograms are obtained, making it easier to model spatial variability [5]. Data transformations is required prior to kriging to standardize data distribution, eliminate outliers, and improvise data stationarity [4]. In this study, the normality of groundwater level data is visualized using tools such as histograms and boxplots, and the mean and median, symmetry (skewness) and flattening (kurtosis) coefficients. It is checked numerically by comparing it with the normal distribution and also according to the formal statistical Shapiro–Wilk test.
3.2 Interpolation Methods
Inverse distance weighting directly works on the assumption that the point closest to the predicted position is weighted more heavily, and the weight is reduced as a function of distance, hence the name inverse distance weighting. The exact formula for this interpolator is
where \(z\left( {s_{0} } \right)\) is predicted value of unsampled point, n is total no. of sampling points, di is separation distance between unsampled point and ith sampled point, and p denotes the weighing power.
Radial basis functions (RBFs) are an exact set of interpolation methods. That is, the surface must pass through each measurement sample value. RBF is used to generate smooth surfaces from a large number of data points. There are five different functions: thin-plate spline (TPS), spline with tension (SPT) [2], fully regularized spline (CRS) [2], multiquadric function (MQ), and inverse multiquadric function (IMQ) [12]. Each basis function has a different form and has a different interpolation plane.
Kriging belongs to a family of generalized least squares regression methods in geostatistics which uses observations sampled in a particular search environment as a linear combination to estimate values in unsampled locations [5, 7].
\(\widehat{Z}\left( {s_{o} } \right)\) is the estimated variable of interest (groundwater level) at the unsampled location \(s_{o,}\) and \(Z\left( {s_{\alpha } } \right)\) is the observed values at the sampled locations in the vicinity of \(s_{o}\).
OK is most commonly used form of kriging, in which mean is considered unknown and fluctuates locally, which makes it possible to maintain stationarity in the local neighborhood.
Simple kriging is mathematically the simplest but the least common. It is assumed that the expected value (mean) of the random field is known and depends on the covariance function. SK assumes second-order stationary that is constant mean, variance, and covariance across the domain or the region of interest [11].
The OCK method is a modification of the OK method. The main advantage of OCK is that it uses multiple variables in the estimation process. The OCK method is used to improve the predictability of the primary variable by using the auxiliary variable, assuming that both primary and auxiliary variables are in good correlation [7]. This method is especially suitable when the main attributes of interest are sparse, but the relevant secondary information is abundant.
3.3 Variograms (Semivariogram)
The existence of spatial structures (spatial autocorrelation) in which nearby observations are more similar than distant observations is a pre-requisite for the application of geostatistics [5]. Experimental variograms measure the average dissimilarity between unsampled values and nearby data values [4] and can therefore represent autocorrelation at various distances. The kriging method requires a model of functions that characterize spatial variability, variograms, and key characteristic parameters such as nugget effects, thresholds, and ranges [5]. The experimental variogram is calculated based on the following formula:
with \(Z\left( {s_{\alpha } } \right)\) and \(Z\left( {s_{\alpha } + d} \right)\) being the values observed at the locations \(u_{\alpha }\) and \(u_{\alpha }\) + d separated by the distance d and N(d) being the number of such pairs. If the value at \(Z\left( {s_{\alpha } } \right)\) and \(Z\left( {s_{\alpha } + d} \right)\) is autocorrelated, the result of Eq. (3) will be small, relative to an uncorrelated pair of points. From an experimental variogram analysis, the appropriate model (Gaussian, spherical, etc.) is usually fitted by the weighted least squares method, and the parameters (range, threshold, and nugget) are used in kriging. Exponential, Gaussian, and spherical are the most commonly used (theoretical) variogram models in hydrological kriging applications [1] and are also used to model experimental variograms.
3.4 Cross-Validation
The performance of the various interpolation methods (IDW, RBF, SK, OK, and OCK) is evaluated through a cross-validation process. Cross-validation is a validation technique that removes observations one by one from the dataset and re-estimates from the remaining sampled data using the selected model. If the sample size of the data is very small, such as when there are only 31 observations, the method comparison is done by cross-validation [7]. This is a common way to verify the accuracy of the interpolation method [1].
The overall performance of the interpolation methods for groundwater level estimation is conducted using correlation and error-based measures. The correlation includes the coefficient of determination (R2), Nash–Sutcliffe efficiency coefficient (E), and Willmott agreement index (d), whereas the error measures include the mean relative error (MRE), the root mean square error (RMSE), and the mean error (ME).
4 Results and Discussion
4.1 Exploratory Data Analysis
Seasonal groundwater data from 31 observation wells are examined to understand the pattern in the data. Histogram (Fig. 2) and standard statistics (Table 1) are used to describe the data.
The histogram shows symmetry indicating normality of distribution. In fact, seasonal groundwater elevation data have mean and median close enough and skewness value close to zero, thus suggesting this distribution as symmetrical distribution. Finally, the Shapiro–Wilk test confirms the normality of the original data (p = 0.83 > 0.05, p = 0.71 > 0.05). Since original data follow a normal distribution hence, it is decided to work on original data without transformation.
4.2 Spatial Analysis of Groundwater Level Data
The spatial variation of seasonal groundwater elevation is considered to be isotropic ignoring the separation direction because the size of the sample (31) is limited and would not possibly detect anisotropy [7].
Table 3 and Figs. 3 and 4 show the variogram model and its parameters tuned for seasonal groundwater levels. The nugget effect to nugget to threshold ratio is used to classify the spatial dependence of the variable [3]. Variables have strong spatial dependence when the nugget-threshold ratio is less than 0.25, and moderate spatial dependence when the ratio is 0.25 to 0.75 [8]. Otherwise, the variables are less spatially dependent. Therefore, in our case, the groundwater level was strongly spatially correlated (Table 2).
Elevation, as auxiliary information, has decreased semi-variances. It may be visible that the sill is higher for OK and SK (4507.4 m2 and 4,295.7 m2) than for OCK (3713.8 m2 and 3577 m2). This is predicted due to the fact the covariate, elevation which become taken into consideration for OCK, however, now no longer for OK and SK variogram, in part explains the variability of the groundwater level data.
For IDW method, an optimal power value (p) as well as the number of the closest neighbors to include are determined, whereas for RBF method, the choice of radial basis function, their kernel parameter, and number of the closest neighbors to include are determined by minimizing the root mean square error (RMSE) statistics obtained from a cross-validation procedure. In this study, for IDW method, the optimizing parameter of the weight function (p) is taken as 2.0, whereas for RBF method, multiquadric radial basis function is found to be optimal among all the functions.
4.3 Groundwater Mapping
Tables 3 and 4 show various descriptive statistical parameters for the measured seasons (before and after the monsoon) and the parameters predicted by two deterministic and three geostatistical interpolation methods.
One of the hallmarks of geostatistical methods is smoothing, as predicted values are less variable than measured values. In other words, the minimum expected value is greater than the measured value, and the maximum expected value is less than the measured value [9]. This smoothing phenomenon is least for OCK while it is the most accentuated for SK which has 373.19, 376.34 m as minimal and 526.9, 533.14 m as maximal values compared to 348.93, 368.43 m as minimal and 586.18, 592.98 m as maximal for the measured groundwater values during pre- and post-monsoon periods, respectively.
This phenomenon is confirmed by the standard deviation, especially the decrease in the estimated variance for the measured data variance of 43, 39.7% of SK; 40.6, 39.2% for OK; 18.4, 15.6% for OCK during pre- and post-monsoon periods, respectively, and also by the coefficient of variation which is minimal for SK (9.62, 9.42%) followed by OK (9.64,9.47) and OCK (11.31,11.16%) compared to measured values (12.53, 12.17%) for pre- and post-monsoon periods.
Smoothing phenomenon among deterministic methods is least for RBF and most accentuated for IDW which has 389.37, 390.41 m as minimal and 524.98, 531.62 m as maximal values compared to 348.93, 368.43 m as minimal and 586.18, 592.98 m as maximal for the measured groundwater values during pre- and post-monsoon periods, respectively.
In geostatistical methods, estimates of SK are higher than those of OCK and OK with SK, OK, and OCK mean values of 456.34, 462.51 m; 456.16, 462.21 m; 456.10, 462.14 m for pre- and post-monsoon periods, respectively. Similar results can be seen in minimal values and for different percentiles. Similar analysis is carried out for deterministic interpolation methods, and overall estimates of IDW are found to be higher among all five interpolation methods used in this study.
Figures 5 and 6 represent the groundwater maps obtained from five methods of spatial interpolation (IDW, RBF, SK OK, and OCK) for pre- and post-monsoon periods for the year 2019.
4.4 Performance Evaluation Study
To deepen the comparative study of spatial interpolation, the performance indicators for cross-validation are shown in Figs. 7 and 5. The boxplot of groundwater level prediction error (Fig. 7) shows that interpolation generally corrects the predicted groundwater level of 31 observation wells. The perfect match between the predicted and measured values is represented by 0 in the Fig. 7. Comparing the five methods, the residual (error) between the measured and predicted groundwater levels of OCK is significantly reduced. This shows that OCK is the best interpolator, though there exists small underestimation and overestimation.
Because aim of this study was to compare various methods, first the impact of various interpolation method on accuracy was considered. The 3 kriging methods performed better than the deterministic approach to estimate groundwater levels for each season. Performance measures of interpolation methods are summarized in Table 5. High values of coefficients of determination, Nash–Sutcliffe efficiency, and Willmott agreement index suggested an amazing fit among measured and predicted water level depth. Of the five interpolation methods, OCK had the best overall performance, with OK, SK, and RBF significantly superior to IDW. Not only the performance indicators of the model, but also the errors confirmed the above facts. The low RMSE and ME of all interpolation methods showed applicability to the prediction of groundwater level, and the superiority of OCK over all other methods was fully demonstrated by its minimum error value.
The second approach to assess the accuracy of a method is done by adopting regression coefficients (intercept and slope). The best model performance is represented by small intercept and large gradient. Among the five interpolation methods, the OCK, method which considered elevation as an auxiliary variable for predicting groundwater levels, showed best results.
5 Conclusions
The following conclusions are derived from the foregoing study:
-
Introduction of elevation information improved the performance of covariate kriging method, OCK in particular, in sparsely monitored region.
-
Smoothing phenomenon in geostatistical method is least for OCK while it is the most accentuated for SK whereas smoothing phenomenon among deterministic methods is least for RBF and most accentuated for IDW.
-
In geostatistical methods, estimates of SK were higher than OCK and OK, whereas overall estimates of IDW are found to be higher among all five interpolation methods.
-
Geostatistical methods performed better than the deterministic methods for predicting groundwater levels.
References
Adhikary PP, Dash CJ (2017) Comparison of deterministic and stochastic methods to predict spatial variation of groundwater depth. Appl Water Sci 7:339–348
Arslan H (2014) Estimation of spatial distrubition of groundwater level and risky areas of seawater intrusion on the coastal region in Carsamba Plain, Turkey, using different interpolation methods. Environ Monit Assess 186:5123–5134
Cambardella CA, Moorman TB, Novak JM, Parkin TB, Karlen DL, Turco RF, Konopka AE (1994) Field-scale variability of soil properties in Central Iowa Soils. Soil Sci Soc Am J 58:1501–1511
Deutsch CV, Journel AG (1998) GSLIB: Geostatistical Software Library and user’s guide, 2nd edn. Oxford University Press, New York
Goovaerts P (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York
Gringarten E, Deutsch CV (2001) Teacher’s aide: variogram interpretation and modeling. Math Geol 33:507–534
Isaaks EH, Srivastava RM (1989) An introduction to applied geostatistics. Oxford University, New York
Liu D, Wang Z, Zhang B, Song K, Li X, Li J, Li F, Duan H (2006) Spatial distribution of soil organic carbon and analysis of related factors in croplands of the black soil region, Northeast China. Agric, Ecosyst Environ 113(1–4):73–81
Rata M, Douaoui A, Larid M (2020) Comparison of geostatistical interpolation methods to map annual rainfall in the Chéliff watershed, Algeria. Theoret Appl Climatol 141:1009–1024
Sun Y, Kang S, Li F, Zhang L (2009) Comparision of interpolation methods for depth to groundwater and its temporal and spatial variation in the Minqin oasis of Northwest China. Environ Model Softw 24:1163–1170
Webster R, Oliver MA (2007) Geostatistics for environmental scientists, 2nd edn. Wiley, Chichester
Xie Y, Chen T, Lei M, Yang J (2011) Spatial distribution of soil heavy metal pollution estimated by different interpolation methods: accuracy and uncertainty analysis. Chemosphere 82:468–476
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Additional information
Disclaimer: The presentation of material and details in maps used in this chapter does not imply the expression of any opinion whatsoever on the part of the Publisher or Author concerning the legal status of any country, area or territory or of its authorities, or concerning the delimitation of its borders. The depiction and use of boundaries, geographic names and related data shown on maps and included in lists, tables, documents, and databases in this chapter are not warranted to be error free nor do they necessarily imply official endorsement or acceptance by the Publisher or Author.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Raghuvanshi, A.S., Tiwari, H.L. (2023). Comparison of Spatial Interpolation Methods for Mapping Seasonal Groundwater Levels. In: Timbadiya, P.V., Patel, P.L., Singh, V.P., Mirajkar, A.B. (eds) Geospatial and Soft Computing Techniques. HYDRO 2021. Lecture Notes in Civil Engineering, vol 339. Springer, Singapore. https://doi.org/10.1007/978-981-99-1901-7_19
Download citation
DOI: https://doi.org/10.1007/978-981-99-1901-7_19
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1900-0
Online ISBN: 978-981-99-1901-7
eBook Packages: EngineeringEngineering (R0)