Introduction

Safer and accessible water is essential for human beings to survive. Water intended for any use must comply within the standards proposed for the purpose. Varieties of challenges, both natural and anthropogenic, have appeared as threats to maintain the water quality. Generally, groundwater is considered less contaminated compared to surface water due to its limited contact with the external environment and groundwater undergo natural filtration during percolation through the soil zone. Largest consumer of freshwater is agriculture accounting to 70% followed by industries (20%) and domestic use (10%) (Chiras 2001). Of these sectors, industries take large care in meeting the water standards failing which can cost the investments in machineries. The case may not be the same for wastewater resulting from various processes that is disposed-off by these industries into drains or oceans, except in countries that enforce stringent laws for not complying with the wastewater disposal quality (Brindha and Elango 2012a). Farmers use groundwater for irrigation as it is available throughout the year. But, use of water with enriched salts can also reduce soil fertility and hinder crop yield. In developing countries where treated and tested water is not available through piped networks, surface and groundwater aid as a domestic water source (Brindha et al. 2014, 2016). Indicators that determine the water suitability have been classified based on the physical, chemical and biological properties. Depending on the occurrence of water either in the surface or sub-surface, the main indicators vary.

Methods for determining water quality have evolved from simple comparison of water chemistry with the standards to the use of sophisticated models that are data intensive. Water quality index (WQI) developed by Horton (1965) is one of the established and common method used by applying weights to the various ions in water and determining the suitability index. This weighting and analysis method have improved by the inclusion of artificial neural networks, fuzzy theory, etc. Li et al. (2010, 2011, 2016) proposed an entropy-based weight assignment in WQI considering uncertainties associated with the classification which is superior to the traditional WQI. A new coupled groundwater quality assessment model based on rough set attribute reduction and technique for order preference by similarity to ideal solution (TOPSIS) was proposed by Li et al. (2012). Fuzzy approach has been widely used in WQI in recent times (Gorai et al. 2014; Li et al. 2014). Surface interpolation techniques help to predict the concentrations at locations where samples could not be collected. Statistical models have also played their role in narrowing down the pollution sources in many studies (Machiwal and Jha 2015; Venkatramanan et al. 2016). All these methods work in their own way, and the results from each of these methods may not always be similar. Many researches have used these methods in combination to arrive at meaningful interpretations. Inverse distance weighted (IDW) method was more accurate than the kriging (Gaussian, spherical and cokriging) approach in predicting some pollutant levels in groundwater (Gong et al. 2014). Study by Mirzaei and Sakizadeh (2016) compared ordinary kriging (OK), empirical Bayesian kriging and IDW using the WQI approach. OK and IDW predicted similar results of groundwater levels except in shallow water level areas around reservoirs (Buchanan and Triantafilis 2009). Superiority of IDW above kriging was also reported in a landfill site by Spokas et al. (2003). Venkatramanan et al. (2016) have used statistical methods with kriging interpolation to understand the hydrochemical characteristics. Gundogdu and Guney (2007) applied different semivariogram models to analyse depth to water level in an irrigation area.

Mapping areas vulnerable to pollution and its possible health implications is largely adopted by spatial interpolation using the following steps: identification of key parameters of study, applying relative weights for these parameters, assigning sub-weights for variables within the parameter and calculation of an aggregate index. DRASTIC (D is depth to groundwater, R is net recharge due to rainfall, A is aquifer media, S is soil media, T is topography, I is impact of vadose zone, C is hydraulic conductivity of the aquifer) is one of the largely adopted vulnerability mapping model (Aller et al. 1985). Qian et al. (2012) proposed a revised DRASTIC model which can be used in plains where the geomorphological variations are not large. These spatial interpolation methods give varied results while applied to geographically distinct regions (Stigter et al. 2006; Anane et al. 2013; Brindha and Elango 2015).

Application of multivariate statistical analyses is vast including evaluating hydrochemical characteristics of groundwater (Wu et al. 2014; Chung et al. 2015; Venkatramanan et al. 20152016),  identifying the groundwater contamination from landfills (Mouser et al. 2005; Srivastava and Ramanathan 2008), understanding groundwater dynamics of an area (Chen and Feng 2013; Machiwal and Singh 2015), identifying the occurrence of seawater intrusion (Kim et al. 2005; Arslan 2013), differentiating natural/geogenic process from anthropogenic sources (Machiwal and Jha 2015), etc. These models perform differently when applied to datasets from different regions. Hence, the performance of one model over other cannot be considered as universally applicable.

In recent times, use of one method for decision making is not sufficient due to the variety of emerging contamination sources as well as variability in the results from different methods. There is a need to rely on multiple methods and compare the results to arrive at more meaningful decisions. This study compares OK and IDW interpolation methods in mapping potential areas of risk to groundwater contamination. Multivariate statistics are also used to identify the source of chemical contents in water and to compare the results with interpolation methods. Groundwater chemical characteristics of Richards Bay, South Africa (Fig. 1) was used in this study as the published information were scarce for this area, and this research will serve as an insight into the status of water quality in this region. This area is bounded by the coast and has the largest coal export facility in South Africa. Thus, it is vulnerable to contamination from multiple sources. The aim of this study is not to check how well these methods can predict the concentrations of ions in groundwater, but, to compare the reliability of interpolation methods in using them as a tool by decision makers.

Fig. 1
figure 1

Study area with land use and monitoring locations

Methodology

Description of study area

Coastal city of Richards bay and the nearby town of Empangeni in the Mhlathuze catchment (582 km2) are located about 180 km north east of Durban, capital of KwaZulu-Natal province in South Africa (Fig. 1). Humid summers and mild winters are experienced with temperature range of 21–29 °C during January–March and 12–23 °C during June–August. The KwaZulu-Natal region is one of the highest rainfall regions in South Africa (Schulze 1982). Annual rainfall in the study area ranges from 850 mm in the west to over 1200 mm along the coast (DWAF 2000). Most of the rainfall occurs from October to March. This area is characterized by flat to undulating landform with low hills and flat bottomed drainage features. Mhlathuze River, the largest in the region, originates from the Babanago Mountains and flows through Empangeni and Richards Bay and drains into the Indian Ocean. Monthly evaporation from surface water bodies during summer ranges from about 150 to 250 mm and winter evaporation is generally 100 mm/month (Kelbe et al. 2001). The number of industries that function in the Richards Bay area includes aluminium smelters, fertilizer plants and mining of mineral ores. It has the largest coal export facility in South Africa. Agriculture in the region is inland and includes sugarcane, citrus, vegetables, maize, etc. apart from commercial products such as pine and eucalyptus. There is also a landfill site located in the centre of the study area.

Granite of Precambrian age forms the basement in this region (Germishuyse 1999). It is overlaid by sedimentary formations of Triassic. This is followed by the basaltic rocks of Jurassic. Sedimentary formation from Cretaceous to Holocene overlays the basalts. In some places, the Cretaceous sediments directly overlay the granitic basement rocks. Cretaceous sediments are of marine origin with sandstone, mudstone and shale. Marine fossils are also found. Palaeocene period has similar lithological units as Cretaceous. Most of the study area is covered by Pleistocene to Holocene sands. These sands are fine grained, with clay and silt formed by fluvial and aeolian processes. The formation at the ground surface is reasonably permeable and facilitates infiltration of rainwater. Groundwater occurs in these unconsolidated formations in unconfined condition in the upper part and confined condition in the lower part with a think clay layer in between. In this study, only the upper aquifer is considered. Rainfall is the major source of groundwater recharge in this area.

Sampling and analysis

Surface and groundwater samples were collected from Richards Bay and Empangeni areas in September 2015. Care was taken such that the groundwater sampling locations (N = 35) were distributed over the area (Fig. 1). Surface water samples include three samples from the river and one sample from a dam that stores water for irrigation use. Electrical conductivity (EC) and pH were measured in the field using portable meters that was pre-calibrated using 84 and 1413 μS/cm conductivity solutions for EC and 4.01, 7 and 10.1 for pH. Calcium, magnesium, sodium, potassium, chloride, sulphate and nitrate were analysed using inductively coupled plasma optical emission spectrometry (ICP-OES). Carbonate and bicarbonate were measured in the field by titration (APHA 1998). Standards and blanks were run frequently for ensuring the accurate determination of the ionic concentration. Analytical precision was determined by calculating the ion balance error that was within ±8%.

Total dissolved solids (TDS) and total hardness (TH) of water was calculated using the formulae proposed by Lloyd and Heathcote (1985) and Sawyer and McCarty (1978), respectively.

$$ \mathrm{TDS}\ \left(\mathrm{mg}/\mathrm{l}\right)=\mathrm{EC}\ \left(\upmu \mathrm{S}/\mathrm{cm}\right)\times 0.64 $$
(1)
$$ \mathrm{TH}\ \left(\mathrm{mg}/\mathrm{l}\right)=2.497\ {\mathrm{Ca}}^{2+}\ \left(\mathrm{mg}/\mathrm{l}\right)+4.115\ {\mathrm{Mg}}^{2+}\ \left(\mathrm{mg}/\mathrm{l}\right) $$
(2)

Irrigation water quality was determined from various indices—magnesium hazard (MH) (Szabolcs and Darab 1964), residual sodium carbonate (RSC) (Eaton 1950), sodium percent (Na%) (Wilcox 1955), sodium adsorption ratio (SAR) (Richards 1954), Kelly’s ratio (Kelly 1957) and permeability index (PI) (Doneen 1964).

$$ \mathrm{MH}={\mathrm{Mg}}^{2+}/\left({\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}\right)\times 100 $$
(3)
$$ \mathrm{RSC}=\left({{\mathrm{CO}}_3}^{2-}+{{\mathrm{HCO}}_3}^{-}\right)-\left({\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}\right) $$
(4)
$$ \mathrm{Na}\%=\frac{\left({\mathrm{Na}}^{+}+{\mathrm{K}}^{+}\right)\mathrm{X}100}{\left({\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}+{\mathrm{Na}}^{+}+{\mathrm{K}}^{+}\right)} $$
(5)
$$ \mathrm{SAR}=\frac{{\mathrm{Na}}^{+}}{\sqrt{\frac{{\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}}{2}}} $$
(6)
$$ \mathrm{PI}=\frac{\Big({\mathrm{Na}}^{+}+\sqrt{{\mathrm{HCO}}_3^{-}\Big)}}{\left({\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}+{\mathrm{Na}}^{+}\right)}\times 100 $$
(7)
$$ \mathrm{Kell}{\mathrm{y}}^{\prime}\mathrm{s}\ \mathrm{ratio}=\frac{{\mathrm{Na}}^{+}}{{\mathrm{Ca}}^{2+}+{\mathrm{Mg}}^{2+}} $$
(8)

All concentrations from Eqs. 38 are expressed in milliequivalents per litre.

Seawater mixing index

Seawater mixing index (SMI) was developed by Park et al. (2005) to identify seawater intrusion based on magnesium, sodium, chloride and sulphate concentration. This was modified by Mondal and Singh (2011) to understand the salinisation process due to tannery effluents and was called saline water mixing index (SWMI). Thus, this index can be used for studying the seawater intrusion as well as salinisation due to pollution using the following equation (Park et al. 2005).

$$ \mathrm{SMI}=\left( a\times \frac{C_{\mathrm{Na}}}{T_{\mathrm{Na}}}\right)+\left( b\times \frac{C_{\mathrm{Mg}}}{T_{\mathrm{Mg}}}\right)+\left( c\times \frac{C_{\mathrm{Cl}}}{T_{\mathrm{Cl}}}\right)+\left( d\times \frac{C_{{\mathrm{SO}}_4}}{T_{\mathrm{SO}4}}\right) $$
(9)

where a, b, c and d are constants that denote the relative concentration proportion of sodium, magnesium, chloride and sulphate, respectively, in seawater (i.e. a = 0.31, b = 0.04, c = 0.57 and d = 0.08), C is the measured concentration of ions in milligrammes per litre and T is the regional threshold values of the ions estimated from the interpretation of the cumulative probability curves plotted against log concentration of the ions. When applied to areas other than to identify seawater mixing, the values of constants will vary.

Factor analysis

Statistical analysis was performed using IBM SPSS (2013) version 22.0. Correlation matrix was used to identify the positive or negative relationship among ions. Principle component analysis (PCA) was carried out first in which the original variables are transformed into new uncorrelated variables. These variables are called principle components (PC) and are linear combinations of original variables. With minimum loss of original information, PC informs on the most meaningful variable for the whole dataset (Helena et al. 2000). Output from PCA is further simplified in factor analysis through reducing the contribution of less significant variables. This is done by rotating the axis defined by PCA (Shrestha and Kazama 2007). Varimax is the most commonly used rotation in factor analysis and is adopted in this study. The rotation results in smaller number of factors for the same variables as with the larger set of original variables. PCA and factor analysis are expressed as in Eqs. 10 and 11, respectively.

$$ {z}_{i j}={a}_{i1}{x}_{1 j}+{a}_{i2}{x}_{2 j}+{a}_{i3}{x}_{3 j}+\dots +{a}_{i m}{x}_{mj} $$
(10)

where z is the component score, i and j are the component and sample number, respectively, a is the component loading, x is the measured variable and m is the total number of variables.

$$ {z}_i={a}_{f1}{f}_{1 i}+{a}_{f2}{x}_{2 i}+{a}_{f3}{x}_{3 i}+\dots +{a}_{f m}{x}_{mi}+{e}_{f i} $$
(11)

where z is the measured variable, i is the sample number, a is the factor loading, f is the factor score, e the residual term accounting for errors or other source of variation and m is the total number of factors.

Spatial interpolation methods

ArcMap version 10.1 was used to predict values in unsampled locations. The basis  of interpolation methods is that it assumes that values that are closer to one another are more alike than those that are farther away. Measured values surrounding an unmeasured location are used for prediction. Most popularly used interpolation methods, i.e. OK and IDW are compared in this study for mapping groundwater contamination zones.

Ordinary kriging

This is a geostatistical method based on statistical models that calculates relationship among the measured points. This multistep model assumes that the distance or direction between sample points reflects a spatial correlation that can be used to explain variation in the surface. Computing steps include exploratory statistical analysis of the data, variogram modelling, creating the surface and/or exploring a variance surface (ESRI 2012). Kriging has many types—simple kriging, OK and universal kriging of which OK is the most widely used. Kriging is also based on the application of weights to measured values. Unlike IDW that depends only on the distance between the sample points, kriging depends on the distance and the spatial orientation of the measured points. Weights in kriging are derived from semivariogram models that are of various types—spherical, circular, exponential, Gaussian and linear. Spherical semivariogram model is the most commonly adopted and is used in this study. The semivariogram function is calculated as half the average squared difference between the paired data values given as

$$ \gamma (h)=\frac{1}{2 N(h)}{\sum}_{i=1}^{N(h)}{\left[ z\left({x}_i\right)- z\left({x}_i+ h\right)\right]}^2 $$
(12)

where γ(h) is the semivariance value at h which is the distance interval, N(h) is the number of sample pairs within h and z(x i ) and z(x i  + h) are the sample values at two points located at distance h (Journel and Huijbregts 1978).

Inverse distance weighted

Predictions by this deterministic method are based on a linear combination of closely located values. Shepard (1968) defines IDW as

$$ z=\frac{\sum_{i=1}^n{w}_i{z}_i}{\sum_{i=1}^n{w}_i} $$
(13)
$$ {w}_i=\frac{1}{d_i^p} $$
(14)

where z is the interpolated value, z i is the known value, n is the total number of known values used in interpolation, d i is the distance between known and interpolated values and p is the power parameter where weight decreases as distance increases from the interpolated values, hence, the name IDW. Interpolation results are highly influenced by this weighting power (Mueller et al. 2001). Most commonly used and default power in ArcGIS, i.e. inverse of the distance raised to the 2nd power (Gong et al. 2014) is used in this study.

Overlay analysis

To map groundwater contamination zones, overlay analysis was carried out for eight variables. First, these parameters were subjected to spatial interpolation using OK and IDW and then classified as desirable and undesirable areas for drinking purpose by comparing the concentrations with the World Health Organization (WHO) standards. The desirable areas are given a rank of ‘1’ and undesirable areas are ranked ‘2’. Usually after ranking, relative weights are assigned to the different parameters such as in WQI (Mirzaei and Sakizadeh 2016) and DRASTIC (Brindha and Elango 2015) methods. Brindha and Elango (2012b) have applied equal weightage for all parameters in groundwater suitability mapping based on the argument that even if one parameter exceeds the limit, it is unsuitable for the intended use. This method (Brindha and Elango 2012b) was followed without applying weights to the spatially interpolated layers for overlay analysis. The various spatial layers are overlaid using the ‘intersect’ tool in ArcMap 10.1. A suitability index is arrived by calculating the sum of ranks. A total index of 8 indicates that all parameters are within limits, 9 indicates that one parameter exceeds the drinking water limits and so on. Highest suitability index of 16 indicates that all the samples have exceeded limits for all parameters.

Result and discussion

Groundwater level ranged from 1 to 46.5 m bgl with an average of 12.2 m bgl. Spatial variation of depth to groundwater level indicates deeper water levels in the industrial areas located near the coast and in agricultural areas inland. Groundwater flows towards the sea (Fig. 2).

Fig. 2
figure 2

Spatial variation in groundwater level

Drinking and irrigation water quality

Statistical summary of various ions in groundwater and measured concentrations of surface water are given in Tables 1 and 2. Target water quality proposed by Department of Water Affairs and Forestry (DWAF), South Africa is the range that has no negative effect on human health. It denotes good or ideal water quality rather than being merely acceptable. This also means that concentrations of ions beyond this range can still be suitable for drinking under certain circumstances (DWAF 1996). Seventeen samples were within the target water quality range (<700 μS/cm) for EC, and 16 samples were permissible, i.e. drinking water ranging between 700 and 4500 μS/cm does not cause adverse health effects when consumed for a short time (DWAF 1996). River and dam water were also within the suitable to tolerable range for drinking and irrigation. TDS and TH were grouped based on the classification by Freeze and Cherry (1979) and Sawyer and McCarty (1978). Most of the area had hard water (N = 28) except for few locations (N = 7) based on TH (Fig. 3). TDS indicates that large part of the samples was fresh (N = 29) except for few samples being brackish (N = 5) and saline (N = 1) (Fig. 3). Brackish and saline water is unsuitable for irrigation, and if used for agricultural purpose over a long-term, it may decrease the soil fertility due to increase in salinity.

Table 1 Statistical summary of various parameters in groundwater (N = 35)
Table 2 Chemical constituents in surface water
Fig. 3
figure 3

Groundwater classification based on TH and TDS

Water quality requirements in South Africa are much more stringent than the values recommended by WHO. A comparison of these standards for various chemical ions is given in Table 3 along with the natural concentration of these ions in fresh water and seawater. Groundwater exceeded DWAF (1996) limits for pH, calcium, magnesium, sodium, potassium, chloride and sulphate in 3, 31, 17, 26, 14, 54 and 3% of sampling locations, respectively. Similarly, groundwater exceeds the highest desirable limit (WHO 1993) for pH, calcium, magnesium, chloride and sulphate in 20, 14, 9, 17 and 3% of the samples, respectively, and exceeds the maximum permissible limits of WHO (1993) in 14, 3, 3, 6 and 0% samples, respectively. Sodium and potassium do not have highest desirable limit, but, has maximum permissible limits which exceeded in 14 and 23% of the locations. Limits for bicarbonate are not given by DWAF and WHO. It is suggested that bicarbonate levels less than 200 mg/l is safe for human health through the drinking water pathway (Bhardwaj and Singh 2011). Surface water was within this limit, but, it exceeds in 10 groundwater sampling locations. Surface water is safe based on WHO standards for all ions except for pH (Table 2). DWAF standards also indicate that surface water is suitable for drinking. Mhlathuze River water exceeded the pH range at one location. Dam water exceeded limits for sodium and chloride concentrations.

Table 3 Permissible limits of chemical ions in drinking water and composition of ions in fresh and seawater

Water quality evaluation for irrigation is based on many indices (Table 4). MH and Kelly’s ratio had unsafe water for irrigation at more than 50% of the area. This shows that hardness is an issue for irrigation. Sodium and EC play an essential role in determining the irrigation suitability. In most locations, water was suitable or permissible for irrigation for sodium-based indices, i.e. RSC, PI, Na% and SAR. Sodium plotted against EC (Fig. 4a) indicates most area has high salinity and low sodium hazard. Two groundwater samples were undesirable for irrigation (C1S1 and C2S1) with high salinity and high sodium. River water had comparatively low salinity as well as low sodium hazard and fit for agricultural use. Dam water had high salinity and low sodium hazard (C3S1) like the nearby groundwater.

Table 4 Irrigation water quality
Fig. 4
figure 4

a Sodium and salinity hazard. b Water type

Geochemical processes

Sodium chloride was the dominant water type followed by Ca–Mg–Cl (Fig. 4b). Order of dominance of cations was Na > K > Ca > Mg, and that of anions was Cl > HCO3 > SO4 for groundwater and for surface water were Na > Mg > Ca > K and HCO3 > Cl > SO4. General processes controlling salinisation of aquifers are identified as oxidation–reduction, ion exchange, precipitation and dissolution and mineral weathering (Appelo and Postma 1993; Richter and Kreitler 1993; Jones et al. 1999). To narrow down to the processes controlling the groundwater characteristics, Gibbs plot was used. This plot indicated rock–water interaction as the dominant process with few cases of evaporation and precipitation. Rock–water interaction includes many processes and needs further refinement to identify the process that plays a major role in this area. Change in Na–Cl dominant water to Ca–Mg–Cl is suggestive of cation exchange reaction (Kumar et al. 2014). Plot of (Ca + Mg) − (HCO3 + SO4) versus Na–Cl determines the gain or loss of Ca and Mg due to carbonates and gypsum dissolution and gain or loss of Na due to halite dissolution (Bouzourra et al. 2015). A linear relationship with a slope of −1 indicates cation exchange and samples that plot close to 0 are not influenced by ion exchange (Jankowski et al. 1998). Figure 5 indicates that reverse ion exchange with a slope of −0.7 occurs at many cases. Sodium normalized with calcium, magnesium and bicarbonate indicate silicate weathering as the dominant process (Fig. 6). Silicate minerals such as quartz, amphibole and pyroxene are present in the aquifer materials of this area. This is likely to be the source for calcium, magnesium, sodium and potassium as reported by Rajesh et al. (2012) and Brindha et al. (2016) in similar situations. Carbonic acid causing weathering of silicate minerals is denoted as

$$ \left(\mathrm{Na},\mathrm{K},\mathrm{Ca},\mathrm{Mg}\right)\ \mathrm{silicate}+{\mathrm{H}}_2{\mathrm{CO}}_3\to {\mathrm{H}}_4{\mathrm{SiO}}_4+{\mathrm{H}\mathrm{CO}}_3+\mathrm{Na}+\mathrm{K}+\mathrm{Mg}+\mathrm{Ca}+\mathrm{solid}\ \mathrm{products} $$
(15)
Fig. 5
figure 5

Ion exchange process in groundwater

Fig. 6
figure 6

Weathering of minerals in groundwater

Points below the 1:1 line in the plot between sodium and chloride also indicate silicate weathering (Stallard and Edmond 1983) (Fig. 7). A linear increment of sodium and chloride suggests seawater intrusion (Vetrimurugan et al. 2013), i.e. the samples that plot near the fresh-saline water mixing line. Halite dissolution in samples also plots on the 1:1 line. This denotes interactive process between groundwater and the aquifer clay matrix (Bouzourra et al. 2015). However, there is no likely geological source for halite in this aquifer. Hence, mixing with seawater from the rivers as well as directly from the sea is the likely cause for this relationship (Fig. 7).

Fig. 7
figure 7

Sodium versus chloride indicating various geochemical processes

Threshold values for SMI were calculated by plotting the cumulative probability percentages of the parameters against the measured concentration of corresponding parameters. The inflexion point for sodium and chloride were at 300 and 550 mg/l (Fig. 8). Two inflexion points were noticed for magnesium and sulphate (Fig. 8). The threshold values represent the natural background levels of ions in groundwater, and the upper and lower limits for magnesium and sulphate were both within the drinking water standards limits (WHO 1993). If the lowest threshold values are considered, all samples will have SMI greater than 1 and will be classified as a record of seawater mixing (Park et al. 2005). In this area, seawater intrusion along with other sources is the cause for contamination. Hence, to differentiate these contamination zones, the upper limits for magnesium and sulphate, i.e. 40 and 65 mg/l, respectively, were chosen. Sodium and chloride, the significant ions representing seawater intrusion, have only one threshold limit that is higher than the maximum permissible limits for drinking. So, using the highest threshold values for magnesium and sulphate is considered suitable. SMI was higher at two locations (Fig. 9), i.e. the landfill and industrial sites. This indicates that seawater intrusion is not a major threat, but it could have contributed to the chemical composition of groundwater to some extent especially for sodium and chloride. Saline intrusion in the Lake Mzingazi (located north of the current study area) was reported by Simmonds (1990).

Fig. 8
figure 8

Cumulative probability distribution of ions for determining the regional threshold values

Fig. 9
figure 9

Spatial distribution in SMI in groundwater

Statistical analysis

Inter-ion relationship (Table 5) was strong among calcium, magnesium, sodium, potassium, chloride and sulphate. Bicarbonate correlates strongly with potassium and moderately with magnesium (Table 5). To further ascertain and group the sources identified for the presence of ions in groundwater, multivariate statistical analysis was performed for the hydrochemical data. Eigen values above 1 were exhibited by two factors with a cumulative variance of 87% (Table 6). First factor with the highest variance of 77.6% shows positive relation among calcium, magnesium, sodium, chloride and sulphate. Comparing the measured concentrations of ions in this study (Table 1) with the natural concentrations in fresh and seawater (Table 3) indicates that calcium, magnesium and sulphate were in the fresh water range. Measured concentration of sodium and chloride was in the fresh water and seawater range. Rock–water interaction processes including silicate weathering, reverse ion exchange and mixing of seawater are the major sources of these ions in groundwater. Second factor shows stronger relation for bicarbonate and potassium due to alkaline nature of groundwater. This is because factor analysis was carried out with groundwater samples collected from the entire region including the landfill and industrial locations. In regions other than the landfill and industries, the relationship is negative indicating the influence of seawater mixing with groundwater (Fig. 10a). Only in landfill and industrial areas, a strong relation between potassium and bicarbonate was observed (Fig. 10b) indicting a common source of contamination.

Table 5 Correlation among variables in groundwater
Table 6 Factor scores of groundwater samples
Fig. 10
figure 10

Relationship between bicarbonate and potassium in groundwater (a) except in landfill and industrial areas (b) at landfill and industrial areas

Groundwater contamination mapping

Due to the different permissible limits suggested by different organisations (DWAF and WHO), there are varied number of samples that exceed the limits for the same ion. In such case, it is a challenge to choose the standards to adopt for decision making. Nature of these studies only informs us on the quality of water at the sampled location and not for areas between these locations. Also, one location can be suitable based on most ions but unsuitable for few. Spatial tools assist in the interpolation of values in areas where samples are not collected, and overlay analysis can help to arrive at an integrated groundwater suitability or contamination map.

Accuracy of these methods in groundwater contamination mapping was compared using OK and IDW interpolations. Each parameter was interpolated with these methods and differentiated into desirable and undesirable areas based on WHO (1993) drinking water standards (Figs. 11 and 12). In this study, the internationally recognized WHO standards are used as the main aim is to test the capability of these methods in using them as a management tool that can be adopted to any part of the world. The standards used for differentiating desirable and undesirable zones as per WHO standards for pH, calcium, magnesium, sodium and potassium were 6.5 to 8.5, 75, 50, 200 and 12 mg/l, respectively. Limit adopted for bicarbonate, chloride and sulphate was 200 mg/l. EC represents the total ionic concentration, and it will be double accounted if included in the analysis. Carbonate was not present in groundwater. TDS and TH were calculated from other parameters. To avoid errors, these four parameters were not included in the overlay analysis.

Fig. 11
figure 11

Spatial interpolation of parameters by OK

Fig. 12
figure 12

Spatial interpolation of parameters by IDW

Comparing the interpolation among these methods indicates similar spatial variation in pH, calcium, magnesium and sulphate. Sodium, potassium and chloride highly differ in their suitable areas and bicarbonate shows moderate variation (Figs. 11 and 12). Area classified as suitable and unsuitable by OK and IDW is given in Table 7. These eight variables were subjected to overlay analysis without assigning any weights (i.e. all parameters are considered equally important). Larger variation in sodium, potassium and chloride by the two interpolation methods is because of two samples, one located in the centre and one near the coast. These two locations have high concentration of most ions than the other samples. IDW exhibits that most of the area is contaminated (70%) compared to OK (28%) (Fig. 13). Three zones within the study area exhibiting high concentration of most ions were correctly mapped by OK than by IDW. Similar results were also reported between these methods by Mirzaei and Sakizadeh (2016). Areas exceeding the limits by one, two or more parameters by OK and IDW are given in Table 8. High discrepancy in the results is found in areas where groundwater exceeds the limits in 0, 1 and 2 parameters.

Table 7 Suitable and unsuitable areas for various parameters and interpolation methods
Fig. 13
figure 13

Groundwater contamination maps

Table 8 Groundwater contamination areas by OK and IDW

OK performs better than IDW for the dataset of this study. Mirzaei and Sakizadeh (2016) indicate that the spatial distribution of wells plays an important role in the interpolation analysis. It is argued by Falivene et al. (2007, 2010) that kriging provides better results than IDW in regularly spaced datasets as the former considers the relative position of the data from the interpolated point. Kriging was better than IDW for many studies, and the uncertainty of the interpolation methods associated with the results has been questioned by many researchers (Xie et al. 2011; Gong et al. 2014; Mirzaei and Sakizadeh 2016). Number of wells, their location, distribution, depth, distance between the samples and choice of interpolation techniques influence the decision-making process. Unusual values at one location also can influence the interpolation of results. Hence, using one of the spatial methods alone is not sufficient to determine and delineate the areas of groundwater pollution. It should be used along with several other statistical analyses. From field visits and multiple methods used in this study, three contamination zones were identified in Richards Bay. They are (1) agricultural areas in the north-west, (2) landfill site in the centre and (3) effluent from industrial operations near the harbour/sea in the eastern part of the study area (Fig. 13).

Conclusion

This study demonstrates the use of spatial interpolation and statistical methods in mapping groundwater contamination in Richards Bay, South Africa. Groundwater quality exceeded limits for drinking use for most ions at certain locations. Irrigation water quality was unsuitable in most locations based on MH, Kelly’s ratio and Na%. Sodium and chloride were the abundant ions in groundwater because of seawater intrusion. Concentration of calcium, magnesium and sulphate was in the fresh water range. Sodium and chloride were in the fresh water and seawater range. Dominant geochemical processes controlling the presence of these ions in groundwater include silicate weathering, ion exchange and mixing of fresh water with seawater. SMI was higher in two clusters indicating contamination from landfill and industrial sources. Water quality in this area was poor, and management measures are needed especially at the landfill site and industrial areas to prevent percolation of leachate and wastewater.

OK and IDW showed diverse results with contaminated zones at 28 and 70% of the areas, respectively. Highly contaminated areas surrounded by less contaminated areas are mapped better by OK. The three contaminated zones by landfill, industries and agriculture were identified precisely by OK while IDW connected all the contaminated zones indicating most of the study area as contaminated. Discrepancy in results predicted by OK and IDW was greater when (1) one or two parameters exceeded the prescribed limits for water quality and (2) when the water quality was within limits for all eight parameters. Less variation in results was observed when four to eight parameters exceed the water quality criteria. Because of the unclear nature of outcome from different interpolation methods, the results and maps obtained by various techniques should be compared with one another along with field information to arrive at more meaningful understanding of the situation. This study sets as an example for such a multi-method analysis.