Introduction

Groundwater in the Algerian Sahara represents the main source of water for domestic, agricultural and industrial uses. The Hassi Messaoud oil field is well known in Algeria by its intense industrial activity of hydrocarbon upstream and downstream. Nearby, the city of Hassi Messaoud which is situated 650 km southeastern Algiers, dependents on groundwater supply especially from the Terminal Complex (CT) aquifer. The demand for water increases at a quick rate as a result of the development of needs by habitation and to satisfy the industrial request. Therefore, ensuring the quantity and the quality of water supply can often appear a real challenge.

The CT aquifer in this area is mainly formed by sandy and gavels deposits of Mio-Pliocene and carbonates of Senonian rocks (UNESCO 1972), and is a part of a large aquifer system called Western Sahara Aquifer System (NWSAS). This system is shared by three countries (Algeria, Tunisia, and Libya) with over a one million km2 of lateral extension and a notable variation of the thickness from one region to another (UNESCO 1972). In this aquifer, waters often display discharge temperatures around ~ 25 °C and are highly mineralized thus it needs to be treated before its uses. In addition, water as everywhere in the Algerian Sahara, is characterized by high concentrations of fluoride which exceed the standards of World Health Organization (WHO) (≥ 1.5 mg/l) for drinking water as reported in many studies (e.g. Youcef and Achour 2001, 2004; Guendouz et al. 2003; Djellouli et al. 2005; Messaitfa 2008; Nezli et al. 2009). The fluoride (F) has an atomic number of 9 with a molecular mass of 19 g/mol and is characterized by high reactivity. The inorganic fluoride occurrence is usually free in fluid solutions (Wedepohl 1974). According to the WHO (2017), fluoride has a protective effect which increases with content up to about 2 mg/l in drinking water while the minimum concentration is around 0.5 mg/l. On the contrary, high fluoride content (≥ 1.5 mg/l) has an undesirable effect and it accelerates dental and skeletal fluorosis. Furthermore, this risk may increase depending on drinking water intake which is the most likely high in warmer and arid areas such as Algerian Sahara. Therfore, good knowledge of its spatial distribution into the aquifer is required.

This study is performed on groundwater from the CT aquifer below the Hassi Messaoud city and is aimed at (1) investigating fluoride contents from CT wells that are used for water supply (2) studying the distribution of fluoride in the aquifer using indicator kriging (IK). IK will be applied on a set of available data and it helps to perform a probability map of not exceeding a threshold of fluoride. The selected threshold here is represented by the median value of fluoride content in investigated samples (Me = 2.1 mg/l) corresponding to a critical fluoride content in drinking water which accelerates significantly dental and skeletal fluorosis. IK method which is a nonlinear geostatistical interpolation is more suitable in our case study than ordinary kriging (OK) which has some limitations such as requiring an assumption of a normal distribution of data (Krige 1951; Matheron 1963; Lloyd and Atkinson 2001). IK can provide a solution to that required assumption by transforming continuous to indictor data based on a threshold that can be defined by a median value (e.g. Lloyd and Atkinson 2001; Mohammadpour et al. 2019). Note that IK is applied by Journel (1983) to carry out spatial distribution for interpolation and it is favoured here, for studying fluoride distribution regarding 2.1 mg/l without any precondition of the distribution of data (Jang et al. 2008). In the literature, IK is largely used for mapping of heavy metals in groundwater such as arsenic (Lee et al. 2007) and water salinities (Belkesier et al. 2018). Fluoride data will be transformed into binary coding (0, 1) regarding the 2.1 mg/l value. In other words, all interpolations will be between 0 and 1, and the obtained map will show the probability of not exceeding the indicator value of fluoride in water. The obtained map allows classifying different zones in the aquifer regarding their probabilities of not exceeding 2.1 mg/l. If a low probability of not exceeding 2.1 mg/l threshold is recorded, water can therefore represent a risk to health and needs defluoridation before its consumption. The obtained map can act as a help of decision-making for water management and for selecting the ideal position to drill new wells for drinking water in the future.

Study area

Geology

The Hassi Messaoud city is located 650 km southeastern of Algiers, about 86 km from Ouargla City and 350 km from the Algerian–Tunisian border (Fig. 1). This area is well known in Algeria and in worldwide by its large oil and gas industry. Therefore, the population has concentrated in this city nearby working sites and is estimated at around 45,000 inhabitants in 2008. The studied area lies between latitudes 31° 39′ 16.47″ N and 31°45′ 12.47″ N, and longitudes from 6° 2′ 11.47″ E to 6° 4′ 41.30″ E. The climate is arid with low precipitations (average monthly: 0.1–7 mm), and high temperatures particularly in summer which may exceed 45 °C.

Fig. 1
figure 1

Location map of the study area and sampled wells from the Complex Terminal aquifer in the Hassi Messaoud city

The geology of Hassi Messaoud area is well known because of many geological works that are conducted during exploration activity by oil companies such as SONATRACH and its partners. A synthesis of the petroleum geology in Algeria which includes detailed geological background of Hassi Messaoud and other oil–gas basins is published by Sonatrach-Schlumberger (WEC 2007). According to this report and other unpublished documents of drilling logs (Sonatrach 2002), the Hassi Messaoud area is considered to belong within to intracratonic basin of Oued Mya. Lithologies are composed of a succession of about 4393 m thickness. The sedimentary layers are aged from Cambrian to Quaternary, and are characterized by the absence of Silurian, Devonian, Carboniferous and Permian due to the Hercynian unconformity (WEC 2007).

Hydrogeology

The Hassi Messaoud area as the Northern Sahara of Algeria contains a system of two important aquifers which extends to Tunisia and Libya hosting considerable quantities of paleowater according to isotopic data (e.g. Edmunds and Gaye 1997; Guendouz et al. 1997; Edmunds et al. 2003). In this system, two main aquifers can be distinguished: (1) the confined and discharged Continental Intercalary (CI) represented by a multi-layer aquifer including Albian, Barremian and Neocomian formations. The Albian aquifer is the most exploited representing the main target of drilling wells for water in Northern Sahara of Algeria. (2) The Complex Terminal (CT) overlays the Continental Intercalaire (CI) and is formed mainly by Mio-Pliocene sands and Senonian carbonates. In the study area, the Senonian carbonates have a reduced thickness as great Oriental Erg where the Mio-Pliocene reservoir is exploited for water. The flow direction of water in this aquifer is from Tinrhert Plateau in the South towards the Chotts region in North (Fig. 2) (Guendouz et al. 2003).

Fig. 2
figure 2

Hydrogeological cross section of the Complex Terminal aquifer in Algeria (after UNSCO 1972; Guendouz et al. 2003)

Data and methods

Samples and laboratory analysis

Although the Hassi Messaoud area represents an industrial zone that attracts the population, only a few investigations were conducted on hydrochemical issues (e.g. Bouselsal 2017; Sahri et al. 2017). The high intake of groundwater from CT aquifer to human consumption especially in the summer period may increase health problems due to the concentration of some elements such as fluoride. Therefore, this study aims to discuss fluoride concentration together with major physic-chemical parameters. The wells that are used for water supply in the city were sampled and sixteen water samples were collected from the CT aquifer displaying relatively good spatial distribution for a nonparametric geostatistical study (see Fig. 1). Physicochemical analyses were performed in laboratories of the national agency of water resources (ANRH) and the Algerian waters (ADE) of Ouargla following Rodier (1996) standards. The concentrations of Na+ and K+ were determined using a flame spectrophotometer (PFP 7, JENWAY). F, SO42− and Cl were recorded by a spectrophotometer machine (DR2000, HACH) and HCO3 was determined by titration. Mg2+ and Ca2+ were analysed by complexometric titration using Ethylenediaminetetraacetic acid (EDTA) protocol (Mg2+ = TH-Ca2+). Other parameters were measured “in situ” directly after sampling such as Electrical Conductivity (EC) using a multiparameter. Based on the percentage ion-balance, the overall test of water analysis accuracy indicates that samples have charge imbalances less than ± 6% except one sample (F19 = ± 6.69%). The Durov diagram was used to determine water facies and the chemistry origin of water. Note that the Durov diagram helps to carry out a graphical representation of the water chemistry by plotting cations and anions into separate ternary diagrams (Durov 1948; Lloyd and Heathcote 1985; Belkhiri et al. 2010).

Saturation index

In natural solutions, water mineral interactions represent the main control of water alkalinity (e.g. Morel 1983; Drever 1988). The carbonate alkalinity (Alcc) is a part of total alkalinity represented by HCO3 and CO32− anions where calcite is the main mineral affected. Saturation state regarding preponderant minerals can be recognized by thermodynamic models and computed by ‹‹ Phreeqci.v.3, Parkhurst and Appelo 2013›› software using its database (Phreeqci.dat). The calculation of the saturation index as SI = log (Q)/log (Kps) of dissolved minerals at temperature of 25 °C in studied water samples based on the law of extended Debye–Hückel. A thermodynamic equilibrium state is considered when SI ranges between − 0.5 and + 0.5. Lower SI values than − 0.5 indicate unsaturation, whereas those higher than + 0.5 reflect oversaturation. The saturation indices were used to clarify possible origins of fluoride in studied samples.

Statistical analysis

Data were statistically analysed using the following methods: (1) descriptive statistics to determine data characteristics such as central tendency and dispersion parameters. (2) Bivariate statistics were used to perform a correlation matrix with the help of calculating Pearson ratios of correlation. (3) Multivariate statistical analysis was performed using Principal Component Analysis (PCA), which is considered an appropriate method related to factor analysis. It used on exploratory data analysis to carry out predictive models according to the correlation between variables in a multidimensional space. PCA consists in data projection into n dimensional space (scatter plot), i.e. a dimensionality-reduction technique that transforms high-dimensional datasets into smaller-dimensional subspace. This simplification helps the interpretation of different relations between the studied variables and individuals. The results of a PCA are usually discussed in terms of component scores (factor scores) and loadings (Shaw 2003). In this study, PCA was applied to evidence possible associations of physicochemical parameters which may have the proxy to recognize different origins in acquiring water chemistry.

Geostatistics

Geostatistical methods have been used in different scientific disciplines, especially in earth sciences, where first applications were carried out in mining sites (e.g. Krige 1951; Matheron 1963, 1971). The fundaments basis of geostatistics has been wholly given by many authors (e.g. Matheron 1971; Chauvet 1999). Geostatistics aims at studying and characterizing spatial data where sampling coordinates are coupled with the studied variables to carry out mathematical functions that interpret the spatial variability (regionalized variables). Geostatistical interpolation methods give an ideal estimation at an un-sampled location. We define here the following vocabulary and geostatistical methods that are used in this study:

The variogram is the basic tool in the geostatistical analysis that describes the degree of spatial correlation between pairs of observations separated by a certain distance (h). In other words, the variogram \(\gamma (h)\) consists in one-half of the variance of the difference between the attribute values at all points \(z\left( x \right)\) and \(\, z\left( {x + h} \right)\) separated by a distance (h) (Fig. 3a). The experimental variogram is calculated by the following formula:

Fig. 3
figure 3

a Graphic representation of two points for variogram calculation and, b Graphic representation of variogram showing the fitting parameters (Matheron 1971)

$$\gamma (h) = \tfrac{1}{2N(h)}\left\{ {\sum\limits_{i = 1}^{N(h)} {\mathop {\left[ {z(x) - z(x + h)} \right]}\nolimits^{2} } } \right\}$$
(1)

where \(\gamma (h)\) is the semi-variogram for the distance \((h)\); \(N(h)\) is the number of pairs for observed data in each lag \((h)\); \(z(x)\) and \(z(x + h)\) represent the values of studied regionalized variable in the location \((x)\) and \((x + h)\), respectively.

The experimental variograms are after fitted by theoretical models including the following parameters: nugget effect (c0), Sill (c) and range (a). Note that each parameter has a significant role in the spatial study (Fig. 3b). The nugget effect (c0) represents the variogram value at zero lag distance \(\gamma (h = 0)\) indicating the variability in very small scale than the sampling distance. Large nugget effect can be interpreted as measurement errors. The sill (c) represents the magnitude of variogram at the distance when reaching the range (a), which indicates the influence radius and the distance which measured observations are no longer correlated spatially (Fig. 3b). We denote that several mathematical models are available for fitting experimental variograms such as spherical, exponential, gaussian and linear models are founded in the literature (see Xavier 2001).

Indicator kriging (IK)

Indicator kriging (IK) is a nonparametric geostatistical method which is based on transforming measurements to binary values (0, 1) depending on if a threshold (Zk) is exceeded or not (Goovaerts 2000). In this study, the transformation was performed according to fluoride threshold (2.1 mg/l) which represents the median value. The 2.1 mg/l fluoride threshold represents a cut-off where fluoride concentration becomes considerably harmful to human health. Indictor data were transformed for each sampling well where continuous data were converted to discrete indicator variables by giving the probability of “0” for the fluoride values exceeding the selected threshold and the probability of “1” for fluoride values that are equal or below the same selected threshold.

$$I(xi; \, Z_{k} ) = \left\{ {\begin{array}{*{20}l} 0 \hfill & {{\text{if }}Z(xi) \, > \, Z_{k} } \hfill \\ 1 \hfill & {{\text{if }}Z(xi) \, \le \, Z_{k} } \hfill \\ \end{array} } \right.\quad Z_{k} = 2.1\,{\text{mg/l}}$$
(2)

The variogram will be established and fitted on the newly transformed data while IK is used to create a probability map of not exceeding fluoride threshold. The mapping will be performed on a regular grid 250 m × 250 m size.

Cross-validation

The cross-validation is a procedure used to evaluate the performance of IK estimation and the fitting quality of variogram. In cross-validation, each measured data value will be removed individually and re-estimated using neighbour data. Measured and re-estimated data are after compared according to mean error (ME), average standard error (ASE), mean standard error (MSE), root square error (RMSE), mean square standard error (MSSE) and root-mean-square standardized error (RMSSE) using the following formulas from the literature (e.g. Chilès and Delfiner 1999; ESRI 2008; Ashrafzadeh et al. 2016):

$${\text{ME}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {(Zx - Zx*)}$$
(3)
$${\text{ASE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\sigma^{2} } }$$
(4)
$${\text{MSE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\frac{{(Zx* - Zx)^{2} }}{{\sigma_{k} }}}$$
(5)
$${\text{RMSE}} = \sqrt {\frac{{\sum\nolimits_{i = 1}^{n} {(Zx - Zx*)^{2} } }}{n}}$$
(6)
$${\text{MSSE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\frac{{(Zx* - Zx)^{2} }}{{\sigma_{k}^{2} }}}$$
(7)
$${\text{RMSSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\frac{{(Zx* - Zx)^{2} }}{{\sigma_{k}^{2} }}} }$$
(8)

Note that Z(x) is the observed value, Z(x)* is the re-estimated value and n represents the number of observation used. For good estimation performance, the ME and MSE should be close to zero (0) together with smaller RMSE.

Results and discussion

Elementary statistics

Main elementary statistics of physico-chemical parameters of analysed samples are given in Table 1. The pH varies between 6.6 and 7.8. Water samples display high electrical conductivities (EC from 1900 to 4890 µS/cm; average = 2758 µS/cm) associated with high standard deviation (σ = 892 µS/cm) indicating substantial variability from one sample to another. Samples show high concentrations of Ca2+, Mg2+, Na+, SO42− and Cl (average = 199 ± 44 mg/l, 92 ± 33 mg/l, 312 ± 205 mg/l, 660 ± 336 mg/l and 529 ± 258 mg/l, respectively (Table 1).

Table 1 Physico-chemical parameters and descriptive statistics (all values in mg/l except electrical conductivity EC in µS/cm)

Major chemical composition reflects the origin of mineralization from hosting rocks in water reservoirs that are represented mainly by evaporite minerals, and minor carbonate minerals. Fluoride content varies between 1.6 and 2.9 mg/l (average = 2.1 ± 0.4 mg/l) exceeding the WHO standards for drinkable water. The obtained Fcontents are quite similar to those reported in other location from the Complex Terminal aquifer of northern Algerian Sahara (e.g. Messaitfa 2008; Nezli et al. 2009).

Hydrochemical characterization

Based on the Durov diagram, the studied samples from the Complex Terminal aquifer in the Hassi Messaoud are mainly plotted into the field 5 and 8 indicating that they are acquiring mineralization from the dissolution of hosting formation (Fig. 4). According to Lloyd and Heathcoat (1985), the field “5” shows that there is no dominance any cation or anion suggesting probable freshwater recharge (40% of analysed samples). The field “8” exhibits the dominance of Na+ and Cl indicating the influence of basic ions exchanges on water. The field “9” represents water samples that display high saturation of Na+ and Cl which is linked to halite dissolution. We denote that the last two fields represent 60% of analysed samples and it reflects the role of geological formation in acquiring mineralization of the Complex Terminal aquifer and is represented by sand and evaporites (Mio-Pliocene). The direct contact with the hosting formation conducts to the dissolution of several minerals in water (Fig. 4).

Fig. 4
figure 4

The Durov plot (Lloyd and Heathcote 1985) depicting hydrochemical processes involved in the Complex Terminal of Hassi Messaoud aquifer

Correlation matrix

Pearson correlation ratios between physic-chemical parameters are given in Table 2. It shows high and positive correlation (r) between EC and the following elements: TDS (0.99) K+ (0.91), Na+ (0.98), SO42− (0.83), Cl (0.90), Ca2+ (0.77), Mg2+ (0.66) (r significant at p < 0.01).

Table 2 Pearson’s correlations between physico-chemical parameters in CT groundwater at Hassi Messaoud (significant correlations at p  <  0.05 are marked in bold)

The positive correlation between Ca2+ and SO42− (0.51, r significant at p < 0.06) is higher than that between Ca2+ and Mg2+ (r = 0.24) suggesting that the Ca2+ in CT water is derived mainly from evaporites (dissolution of gypsum and anhydrite). The low and non-significant correlations between F versus other elements indicates that fluoride may be originated from other mineral phases and is related to different thermodynamic equilibrium of fluoride minerals as suggested by Nezli et al. (2009) and is discussed below.

Principal component analysis (PCA)

Principal Component Analysis is used here to identify the origin of water mineralization. Selected physico-chemical parameters are plotted on main components PC1, PC2 and PC3. PC1 versus PC2 and PC1 versus PC3 gather a cumulative variance of 72.06% and 65.39%, respectively (Fig. 5). Both the two last projections show one cluster of physico-chemical parameters representing the mineralization pole which consists of EC, K+, Na+, SO42−, Cl, Ca2+ and Mg2+. This cluster reflects the origin of water mineralization that is represented mainly by leaching of evaporites.

Fig. 5
figure 5

Loading the factors from the PCA showing the mineralization association a PC1 versus PC2, b PC1 versus PC3

The F position against the mineralization cluster suggests that fluoridation is relatively unrelated with the dissolution of evaporites and carbonates but with other minerals containing fluoride (Fig. 5). According to Pearson’s correlations and PCA analysis, the fluoride presence may be linked to other elements such as fluorine (CaF2) and Fluor-apatite (Ca5 (PO4)3F) minerals as suggested by Nezli et al. (2009) in the CT groundwater from the Ouargla region.

Saturation indices and fluoride origin insights

The saturation indices (SI) of some common minerals are given in Table 3. It shows that waters are undersaturated with respect to evaporite minerals (gypsum, anhydrite, halite, and sylvite), indicating that these minerals are undergoing dissolution in the studied waters. However, waters are slightly saturated to equilibrium regarding calcite, aragonite and dolomite (Table 3). These observations reflect water facies which is represented mainly by Cl, Na-SO4 and less Ca water-types. It argued furthermore that water mineralization is originated mainly from the dissolution of evaporite-bearing Mio-Pliocene hosting formation and, secondly from carbonaceous rocks of Senonian rocks.

Table 3 Saturation indices with respect to main minerals and partial pressure of CO2 in the studied water wells

On other hand, positive Pearson’s correlations that have been recorded between Ca and saturation indices of dolomite (r = 0.52), fluorite (r = 0.42), calcite (r = 0.40) and, in a lesser degree with respect to gypsum (r = 0.33) indicating that Ca is chemically controlled by carbonates and evaporite minerals. The presence of Na–SO4 water type indicates a Ca2+ replacement from water by Na+ of clayey formation in the aquifer. Furthermore, the Ca water type that is hosted especially in carbonates of Senonian rocks shows CO2 pressure above than atmospheric pressure (Log10pCO2 = − 3.5) and is undersaturated with respect to dolomite, calcite and aragonite excepting some samples (F04, F06, F08) which show an oversaturation regarding to the same minerals together with lower CO2 pressure to similar to atmospheric. This may be interpreted by water degassing when collecting samples (Fig. 6).

Fig. 6
figure 6

Evolution of the saturation index with respect to Ca-minerals and partial pressure of CO2 (pCO2)

Saturation indices are also used to forecast reactive minerals in subsurface based on groundwater chemistry and to clarify water mineralization origin without investigating solid phase (Deutsch 1997). Saturation index appears to be a solution to track the fluoridation origin in studied groundwater. Fluorite (CaF2) is often considered to be a source of fluoride in waters. Considering the equilibrium equation is written as the following:

$$\begin{aligned} & {\text{Ca}}^{2 + } + 2{\text{F}}^{ - } \leftrightarrow {\text{CaF}}_{2} \\ & {\text{LogK}}_{\text{Fluorite}} = - 10, 60\left( {{\text{Phreeqc}}.{\text{v}}. 3,{\text{ Parkhurst and Appelo 2}}0 1 3} \right). \\ \end{aligned}$$

Note that thermodynamic studies indicate that F concentration in waters depends upon the state of equilibrium with regards to fluorite (Travi 1993). If water is undersaturated with respect to fluorite, the F content in the water therefore reflects its concentration in leaching rocks. Whereas if the water is saturated with respect to fluorite, the concentration of F would be lower and is governed by the solubility constant value of fluorite dissolution (KFluorite). In the studied samples, the precipitation of calcite and dolomite increase F activity when pH is somewhat alkaline that decreases consequently Ca2+ in solution. Consequently, Fluorite is leached together with gypsum and anhydrite leading to compensate Ca2+ as suggested by (Subba Rao et al. 2013). The precipitation of calcite and dolomite enhance dissolution Ca-bearing minerals such as fluorite that consequently increase fluoride concentration in water. In addition, the chemical activity evolution of fluoride is not only depending on the equilibrium of water with respect to carbonate minerals (Fig. 7). For instance, the speciation of fluoride in the studied water shows that the total concentration of fluoride reveals a dominance of F associated with MgF+ and NaF complexes. We denote low correlation ration between Log10MgF+ and pH (r = 0.33) but more significant than that between Log10F and pH. To sum up, these results indicate the influence of carbonate (calcic and magnesian) on calcium origin reflecting the geological source of fluoride.

Fig. 7
figure 7

Speciation of fluoride species and their evolution regarding pH in the studied water

Spatial distribution using indicator kriging

  1. (a)

    Indicator transformation and variography

Indicator kriging (IK) is highly recommended to study the fluoride distribution in CT aquifer than other interpolation methods that require a large database, good spatial distribution and some precondition of the distribution of data (e.g. ordinary kriging, Jang et al. 2008). Note that IK is largely used to carry out probability maps for pollution elements such as nitrates and other elements (e.g. Bettahar et al. 2010; Arslan 2012). It is also used to map the spatial distribution of health risk for metallic elements in aquifers (e.g. Belkhiri et al. 2017) where authors emphasize the use of IK to highlight risk probability regarding selected thresholds. For our investigation; we suggest a mapping for fluoride distribution using a fluoride threshold of 2.1 m/l. This selected value represents the median value of fluoride content in CT aquifer. Note that the median is commonly used to set the threshold by many authors (e.g. Isaaks and Srivastava 1989; Badel et al. 2011; Mohammadpour et al. 2019). This selected threshold value for this study represents a critical content of fluoride in drinking water in which dental and skeletal fluorosis may substantially accelerate.

The transformation of fluoride data to indicator values according to the median threshold (Me = 2.1 mg/l) is performed. The Omnidirectional experimental variogram of the new indicator value for the selected threshold is calculated and fitted using Variowin 2.1 Software that helps in choosing the best fit found regarding goodness parameters. The PREVAR2D. VARIO2D and MODEL are the main programs used in this variability study (Pannatier 1996). Experimental variogram was fitted by spherical model with the following parameters (range = 1794 m; sill = 0.168; nugget effect = 0.111) (Fig. 8). Nugget-sill ratio for the selected threshold is 0.66 indicating strong spatial dependence (Cambardella et al. 1994; Belkhiri et al. 2017).

Fig. 8
figure 8

Omnidirectional experimental and theoretical variogram for indicator values of fluoride (2.1 mg/l)

  1. (b)

    Fluoride probability map

Using variography results, the probability map for not exceeding the selected threshold of fluoride is generated by IK procedure. We denote that the performance of this estimation method was examined by several cross-validation parameters that are given with the obtained map together with IK variance (Fig. 9). These parameters are much lower such as ME and RMSE which are found close to zero (ME = 0.0546; RMSE = 0.2). They show an accuracy of prediction as well as indicating the excellent quality of variogram fitting and good quality of estimation (Sun et al. 2009; Arslan 2012). The spatial distribution map using IK results was performed to generate a probability map of not exceeding the selected threshold, and is presented in two dimensions showing a local estimation of square mesh gridding of (250 m * 250 m). Computed probabilities of not exceeding fluoride threshold (2.1 mg/l) were grouped into four classes: (0–0.25; 0.25–0.5; 0.5–0.75 and 0.75–1) allowing the classification of fluoride risk in investigated aquifer (Fig. 9a). The variance of IK estimation is lower showing values between zero and 0.2 (Fig. 9b). The mapping of these results shows a strong probability (≥ 0.75) of not exceeding the selected threshold value of fluoride in a limited area to the southwest part of Hassi Messaoud city representing only 10% of the investigated area. Moderate to strong probability (0.25–0.75) is observed in the central part of the study area, and low probability (0–0.25) is located in the northeastern part of Hassi Messaoud CT aquifer. In sum, moderate and low probability of not exceeding 2.1 mg/l represents 90% of the studied aquifer. Therefore a defluoridation of groundwater is highly recommended for drinking water. This map can further be used in the management of water supply to reduce the fluoride excess in drinking water on the one hand and helps decision-making for selecting the ideal position to drill new wells for water supply on the other one hand.

Fig. 9
figure 9

a Spatial variability of the probability of not exceeding fluoride threshold (2.1 m/l) and cross-validation parameters in the studied aquifer, b map of IK variance showing lower values

Conclusion

This study discusses fluoride-bearing groundwater in an arid region (Hassi Messaoud, southern Algeria) where the daily intake of water is important especially in the summer period. The hydrochemical investigation was conducted to identify the chemical characteristics of Complex Terminal groundwater and fluoride origin, whereas its spatial distribution in the aquifer has been established based on a nonlinear geostatistical method of indicator kriging (IK). The main conclusions of this piece of work are the following:

  1. (1)

    Fluoride concentration in the studied aquifer from the Hassi Messaoud Complex Terminal aquifer is relatively high and is largely exceeding the OMS standards for drinking water (from 1.6 to 2.9 mg/l; average = 2.1 ± 0.4 mg/l).

  2. (2)

    PCA helps in identifying chemical associations of investigated water indicating high mineralization of groundwater that is originated mainly by leaching of evaporite minerals.

  3. (3)

    Saturation indices (SI) of main minerals show that waters are undersaturated with respect to evaporite minerals (anhydrite, gypsum and halite) indicating their dissolution, whereas SI of carbonates minerals (calcite and dolomite) are saturated to oversaturated reflecting their precipitation. The saturation of water regarding carbonate minerals in slightly alkaline pH enhances the dissolution of fluorite minerals to compensate Ca2+ in water.

  4. (4)

    The use of the nonlinear geostatistical method of IK allows computing a probability map of not exceeding a selected threshold in which the median value of fluoride content (2.1 mg/l) was taken as a threshold for this study. The method consists in transforming fluoride data into a binary coding (0.1) according to the selected threshold.

  5. (5)

    IK estimation was examined by cross-validation showing a high performance of the interpolation process and good quality of variogram fitting. IK estimation in square mesh gridding of (250 m * 250 m) indicates a low probability of not exceeding (2.1 mg/l) in the whole studied aquifer excepting a limited zone which represents only 10% of the studied aquifer. The obtained map can provide help for decision-making in water management and for selecting the ideal position to drill new wells for supplying drinking water.