Introduction

Groundwater is considered as an indispensable water source for potable, agricultural, and industrial purposes in arid and semi-arid areas. The increased population and industrialization over the recent years have intensified the water extraction from aquifers, deteriorating the quality of groundwater resources (Amer et al. 2012). Groundwater resources in arid and semi-arid areas are typically located in the vicinity of salt domes. Dissolution of minerals from the salt domes into the groundwater can significantly decrease the quality of these limited freshwater (Islam et al. 2017). Hence, monitoring the groundwater quality variations in both spatial and temporal scales is of great importance in water-scarce regions.

In recent years, various data-driven models including artificial neural networks (ANNs), adaptive-network-based fuzzy inference system (ANFIS), support vector regressions (SVRs), locally weighted projection regressions (LWPRs), and relevance vector machines (LVMs) have been employed to simulate the groundwater quality (Bieroza et al. 2018; Diamantini et al. 2018; Rowles et al. 2018; Todorov et al. 2018; Villa-Achupallas et al. 2018; Wu et al. 2018). Among them, ANFIS and ANN systems have been proven to be promising in water resources quality modeling (Cordoba et al. 2014; Gazzaz et al. 2012; Lallahem and Hani 2017; Nadiri et al. 2017; Najah et al. 2013; Sarkar and Pandey 2015). Kuo et al. (2004) used three different ANN systems to model the groundwater quality variations in Taiwan. Their results indicated that including the data set’s maximum and minimum values in the training process could increase the forecast accuracy of the model. Khalil et al. (2005) used ANN, SVM, LWPR, and RVM systems to predict groundwater quality in Sumas-Blaine aquifer, north-western Washington State. They reported that the learning algorithm in RVM, SVM, and LWPR could filter the noise from the inputs, whereas insufficient training of ANNs could lead to overfitting. Khashei-Siuki and Sarbazi (2015) used ANFIS, ANN, inverse distance weighted (IDW), kriging, and co-kriging geostatistical models to predict groundwater quality in Mashhad plain, Iran. They reported that compared with other methods, the ANN system could more accurately predict the spatial groundwater quality variations.

Rahimi and Mokarram (2012) used a linear fuzzy method coupled with GIS to generate fuzzy groundwater quality maps for an aquifer in southwestern Iran. Their results indicated that the coupled system could accurately map the aquifer’s spatial water quality variations. Mokarram and Sathyamoorthy (2016) investigated the quality of groundwaters in western Iran, based on both concentrations of inorganic constituents and landform classes. Their results indicated that a trained fuzzy system based on the analytic hierarchy process (AHP) could successfully predict water quality of water resources based on the study site’s digital elevation model (DEM).

Discrete water quality measurements might follow a specific statistical distribution system. Hence, an appropriate uncertainty model that best suits the system’s statistical distribution should be used for uncertainty manage purposes. Non-definite model components, stemmed from either the nature of the data or model principles, can decrease the reliability of the model’s outputs, increasing the uncertainty in the decision-making process. Different methods including the Bayesian theory (Gershman and Blei 2012), fuzzy theory (Zadeh 1965), and Dempster–Shafer (Shafer 1976) have been employed to incorporate the uncertainty in the decision-making process. Fuzzy systems are able to consider different principle and categorizations in the decision-making process (Venkatramanan et al. 2015).

The Dempster–Shafer theory (DST) is particularly of interest as it can consider both observational and epistemological uncertainties. The mathematical theory of evidence was first introduced by Dempster in 1976 (Dempster 1968; Dempster 2008) and it was developed by Shafer in 1976 (Shafer 1976). This theory is the generalized form of the Bayesian theory that simultaneously considers both uncertainty and inaccuracy in the system (Chaabane et al. 2008). This theory is of significant importance as it is capable of considering conditions or events with different probabilities. This theory uses a probability interval, ranging from the lowest probability, known as belief, to the highest probability (plausibility) for considering uncertainty in a system. The DST can be used as a tool to analyze uncertainty in the imprecise probability theory (Helton 1997). The theory of interval statistical models (Cumming and Fidler 2005) and the interval probability theory (Dempster 2008) are two methods providing upper and lower probability bounds for imprecise probabilities. Only a few studies have used the DST in the areas of water resources and groundwater management. Neshat and Pradhan (2015) used the DST and GIS for risk assessment of groundwater pollution. They were able to present groundwater pollution risk maps based on the DST. Rahmati and Melesse (2016) also used the DST for predicting groundwater quality in Khuzestan province, Iran. The results showed that the DST model is suitable for the determination of groundwater quality.

Extensive fieldwork and the high cost required for regular water quality monitoring of large aquifers can limit the availability of high-quality data from the entire extents of the aquifer. Hence, the use of flexible methods based on DST or fuzzy theory can substantially befit the groundwater management practices to handle the uncertainties associated with non-inclusive, patchy but still precious water quality data, and to reliably estimate the groundwater quality of the entire area.

However, the use of differences in principles and methodologies used in Bayesian or Dempster–Shafer or fuzzy theories might result in different water quality predictions and thus different water quality maps, particularly when predictions with higher confidence levels are demanded.

Only a few studies have attempted to characterize the differences in water quality maps obtained from different theories. The current study compares the DST and fuzzy AHP methods for predicting water quality in Firoozabad aquifer, southern Iran. A prolonged drought in this region, combined with over-extraction from groundwater wells, and salinization from different geological units including salt dome have significantly influenced the groundwater quality of this aquifer. In addition, different geological formations including salt domes have significantly influenced the groundwater quality of this aquifer. This study compares the water quality maps with different confidence levels generated based on the DST and fuzzy AHP methods to investigate the effects of different geological units in the region on salinity and overall water quality index of the aquifer.

To date, no specific functional toolbox has been developed for incorporating DST in GIS and remote sensing packages. Hence, this study employed the imprecise probability propagation (IPP) Toolbox in MATLAB R2017a for applying the DST to raster image data sets.

Methodology

Study site

This study focuses on Firoozabad aquifer with a surface area of 566.25 km2, located in a semi-arid climate (between 28° 36′–28° 57′ N and 52° 16′–52° 46′ E) in southern Fars Province, Iran (Fig. 1). This aquifer is an important freshwater source for drinking water and agriculture applications in the region.

Fig. 1
figure 1

Location and digital elevation model of Firozabad aquifer (study area), and location of the salt dome in the south of the aquifer (source: http://earthexplorer.usgs.gov)

The lowest and highest elevations in the study site are 1134 m and 2885 m, respectively. The water quality of this aquifer is influenced by salt domes covering an area of 2.43 km2 in the south and east of aquifer (Fig. 2a). These domes with an average height of 220 m (1320 m above sea level) are results of Hormuz series evapotranspiration during late-Precambrian to the middle-Cambrian period which are now exposed to the Earth surface. Saline springs can also be found in the vicinity of the salt dome. Figure 2a shows different geological formations in the study area. Table 1 summarizes the surface area of different landforms in Firoozabad aquifer. As shown in Fig. 2a and Table 1, the quaternary (alluvial terraces) and OMas (Asmari) geological units with areas of 281.62 km2 and 119.82 km2, respectively cover a majority of surface area of the study site.

Fig. 2
figure 2

a Geological formations in the study area and b location of monitoring wells used for water quality analysis

Table 1 Surface area of different landforms in Firoozabad aquifer

In order to evaluate the groundwater quality over the study period (23 September 2017–21 March 2017), water samples from 150 different monitoring wells (Fig. 2b) were obtained in a monthly manner and were subjected to water quality analysis to measure Cl, Na+, total dissolved solids (TDS), electrical conductivity (EC), Ca2+, Mg2+, SO42−, and Th concentrations in samples (Fars Regional Water Authority, 2016). Three samples were collected from each sampling point, and the average of the three concentrations for each water quality constituent was reported.

Geostatistical analyses

Interpolations for different water quality data, i.e., Cl, Na+, TDS, EC, Ca2+, Mg2+, SO42−, and Th concentrations, were performed based on the ordinary Kriging method using ArcGIS v.10.5 (ESRI 2018), which subsequently were used as the input data for DST and fuzzy-AHP methods. The ordinary kriging is a promising interpolation technique for data sets with special autocorrelation or directional bias. The ordinary kinging method interpolates the sample values in unmeasured locations based on the influence of surrounding measured values. In the kriging method, the influence of surrounding measured values can be determined based on the semivariance of data’s spatial distribution. The semivariance (γ) for water quality parameter Z measured at different locations can be determined as (Oliver and Webster 1990):

$$ \gamma (h)=\frac{1}{2n}{\sum}_{i=1}^n{\left[Z\left({x}_i\right)-Z\left({x}_i+h\right)\right]}^2 $$
(1)

where h is the distance between the sampling locations of xi and xi + h, Z(xi) and Z(xi + h) are the measured values of variable Z at the corresponding locations, and n is the number of pairs of sample measurements.

Basic spatial parameters including nugget and partial sill for different semivariogram models including circular, spherical, exponential, and Gaussian were calculated. Nugget defines the micro-scale variability and measurement error of collected data points, whereas partial sill indicates the amount of variation in the data (López-Granados et al. 2002).

Preliminary evaluations showed that among different models (circular, spherical, exponential, and Gaussian models), the Gaussian model could best describe the distribution of data points; hence, semivariance model based on the Gaussian model was used for interpolation purposes.

Fuzzy analytic hierarchy process (AHP)

Membership functions (MFs) were used to prepare fuzzy maps for water quality parameters. In general, a fuzzy set of Z in B can be defined as:

$$ Z=\left\{b,\mu A(b)\ |\ b\in B\right\} $$
(2)

where μA(b) is called the membership function of b in B. The MF linearly maps each element of b to a membership value between 0 and 1, indicating the lowest and the highest degrees for a membership, respectively (Mokarram and Sathyamoorthy 2016; Shobha et al. 2013). Fuzzy maps for each input data in the study area were prepared based on the World Health Organization’s (WHO) drinking water quality standard (WHO, 2017). According to WHO, an elevated level for each water quality parameter indicates a decreased water quality (Shobha et al. 2013). The fuzzy rule equation for each parameter was defined as:

$$ \left\{\begin{array}{c}1,\kern0.5em x\le v\\ {}Q-x/Q-i\\ {}0,\kern0.5em x>Q\end{array}\right.\ v<x\le Q $$
(3)

In Eq. (3), Q was determined based on the permissible limit for each parameter in the WHO’s drinking water quality standard (WHO 2017). Table 2 summarizes the limits used based on WHO’s water quality standard for each water quality parameter.

Table 2 Fuzzy limits considered for each water quality parameter based on WHO’s drinking water quality standard

An analytical hierarchy process (AHP) was used in order to overly the fuzzy maps and subsequently preparing the overall drinking water quality index map (Fig. 3). AHP is a multi-criteria decision-making method based on a pair-wise comparison matrix (Saaty 1980). The matrix is called consistent if the transitivity (Eq. (4)) and reciprocity (Eq. (5)) rules are respected:

$$ {b}_{ij}={b}_{im}.{b}_{kj} $$
(4)
$$ {b}_{ij}=\frac{1}{b_{ji}} $$
(5)

where i, j, and m are all of the alternatives of the matrix.

Fig. 3
figure 3

Uncertainty in the belief space in Dempster–Shafer theory

In a consistent matrix (Eq. (5)), all the comparisons bij obey the equality aij = pi/pj, where pi is the priority of the alternative i. A detailed description of the fuzzy AHP method can be found in Bellman and Zadeh (1970).

Dempster–Shafer

The detection framework

Dempster-Shafer theory (DST) was used to prepare water quality maps with different confidence levels. The DST is based on the beliefs that are concluded from evidence such that the belief structure of the control theory is related to the classical probability model (Shafer 1976). Suppose, θ is a finite set of elements, where an element can be a hypothesis, an objective, or a system status. θ set is called the detection framework that can be determined by Ω(θ):

$$ \theta =\left\{a,b,c\right\}\ and\ \varOmega \left(\theta \right)=\left\{\phi \left\{a\right\},\left\{b\right\},\left\{c\right\},\left\{a,b\right\},\left\{a,c\right\},\left\{b,c\right\},\left\{a,b,c\right\}\right\} $$
(6)

Φ is an empty set that denotes the perfect status of the system. A = {a, b} is the subset of θ, indicating A ⊂ θ. Therefore, A presents a system malfunction in a or b, and θ represents the system malfunction in a, b, or c (Shafer 1976).

Mass function, focal elements, and core elements

In order to show the reliability of any subset of the decision-making framework, the mass function is defined. The mass function is shown by m and is defined as Eqs. (7), (8), and (9) (Shafer 1976):

$$ m:\Omega \left(\theta \right)\to \left[0,1\right] $$
(7)
$$ \Omega \left(\theta \right)=0 $$
(8)
$$ {\sum}_{A\subset \Omega \left(\theta \right)}m(A)=1 $$
(9)

where m mass function is called a function of the basic probability assignment (BPA).

Equation (8) means that the scope of this function is on the entire set of the detection framework that is (Ω2) and its range is [0 1]. The function m(A) represents the proportion of the share owned by of the set A from all the relevant and available evidence and support for the claim about a particular element of θ.

In evaluating a condition in which the system suffers from a defect, m(A) can be represented as a degree of belief that has been obtained as a result of the observations related to a particular defect. Different evidence and information create various degrees of belief with respect to the manifested defect. Each subset A from θ is called a focal element such that m(A) > 0. Moreover, c is defined as a core element of the mass function in θ as represented in Eq. (10) (Shafer 1976).

$$ c={U}_{m(A)\ne 0}A $$
(10)

The belief function is defined as Eq. (11):

$$ Bel(A)={\sum}_{B\subset A}m(B)\kern0.24em \mathrm{and}\kern0.24em Bel:\Omega \left(\theta \right)\to \left[0,1\right] $$
(11)

The plausibility function is determined as Eq. (12):

$$ Pl(A)=1- Bel\left(\overline{A}\right)={\sum}_{B\cap A\ne \Phi}m(B)\kern0.24em \mathrm{where}\kern0.24em Pl:\Omega \left(\theta \right)\to \left[0,1\right] $$
(12)

where Bel(A) function measures the total amount of probability that should be among the set A elements. As shown in Fig. 3, Bel(A) denotes the significant certainty of the belief A, or in other words, it shows the lower limit for probability A. The function PI(A) measures the maximum amount of probability that can be distributed among the elements of A. PI(A) describes the degree of the general belief related to A and is regarded as the upper limit function for probability A (Shafer 1976). Belief Interval [Bel(A) PI(A)] reflects the uncertainty belief space and the space size PI(A) − Bel(A) describes the unbeknownst related to set A (Fig. 3). Table 3 summarizes the meaning of different belief intervals in DST (Shafer 1976).

Table 3 Meaning of different belief intervals in the DST model

The plausibility function is related to the belief function by a mediator function called doubt function as shown in Eq. (13) (Shafer 1976):

$$ \left\{\begin{array}{c} Pl(A)=1- Bel\left(\overline{A}\right)\\ {} Pl(A)=1- Doubt(A)\end{array}\right. $$
(13)

where \( \overline{A} \) is the complement of A and Doubt(A) represent the doubt function for set A.

The methodology used in this research is summarized in Fig. 3.

To date, there is no specific functional toolbox for implementing the DST in GIS and remote sensing packages. In this study, a model based on DST was implemented using imprecise probability propagation (IPP) Toolbox in MATLAB R2017a. As the design of this toolbox is based on linear matrices, some modifications were performed to be able to process raster image data sets. As all input image data needed to be of the same size, hence images with a pixel size of 1 km × 1 km were prepared. In the next step, in order to increase a high computation speed, windows of 4 × 4 pixels were selected and subjected to the DST model analysis. This loop was continued until the DST model was implemented on entire pixels.

Model comparison

In order to evaluate the effects of salt domes on aquifer’s water, groundwater quality maps based on both fuzzy AHP and DST models were prepared. In both models, the overall water quality index for models was defined based on WHO’s drinking water quality standard (Table 2). Figure 4 represents the flowchart used for comparing the water quality maps generated by the fuzzy AHP and DST models. The DST model could generate water quality maps with three confidence levels of 99.5%, 99%, and 95%.

Fig. 4
figure 4

Flowchart used for generating the groundwater quality maps based on fuzzy-AHP and DST methods

Water quality data from 150 monitoring wells were used in this study. The data from 110 monitoring wells were used for generating the water quality maps based on both the DST and fuzzy AHP methods (train point in Fig. 2b). Water quality data from 40 different monitoring wells (test point in Fig. 2b) were used to evaluate the water quality maps by the two models.

Root mean squared error (RMSE) was used to evaluate the performance of different interpolation models. RMSE can be defined as:

$$ \mathrm{RMSE}=\sqrt{\frac{1}{N}{\sum}_{i=1}^N{\left[S\left({x}_i\right)-\widehat{S}\left({x}_i\right)\right]}^2} $$
(14)

where \( \widehat{S} \)(xi) is the estimated value, S(xi) is the observed value, and N is the number of values in the dataset.

Results and discussion

Geostatistical analysis

Table 4 summarizes the range of concentrations observed for different water quality parameters in 150 monitoring wells during September 23, 2017–March 21, 2017.

Table 4 Concentrations of different quality parameters obtained from 150 monitoring wells during for the study period

High spatial variations were observed for most of water quality parameters. This suggests the effects of specific geological units on water quality of the aquifer. The ordinary kriging interpolation method was used to evaluate the spatial variations of water quality parameters across the aquifer. Table 5 represents the RMSE between the measured and fitted semivariogram models in 40 test points for each water parameter. Among different theoretical models, the Gaussian model was found as the best fit. Table 5 also represents the calculated semivariogram parameters for each water parameter. The nugget values indicate the magnitude of spatial variability (including measurement errors) in data at scales smaller than sampling distances, and the sill values reflect to the total variance observed in data. The difference between the sill and the nugget (i.e., partial sill) for each parameter indicates the amount of variance that could be explained by the observations obtained from the sampling points with specific distances from each other (Karl and Maurer 2010; Oliver and Webster 1990). The nugget to sill ratio indicates the fraction of the total variation that cannot be described by observed spatial dependence of the variable (Karl and Maurer 2010). Figure 5 shows an example Gaussian semivariogram model fitted to the observed TDS data.

Table 5 Parameters in different semivariogram models employed for generating interpolation maps using the kriging method for different water quality constituents
Fig. 5
figure 5

Example semivariogram with a Gaussian model fitted to the TDS observations in the study area

As shown in Table 5, the Gaussian model typically exhibited smaller nugget to sill rations and yielded relatively lower RMSE values for all water quality constituents compared with other models. This means that the Gaussian exhibited higher spatial correlations at longer distances and less interpolation errors for all water quality constituents. Therefore, the ordinary kriging method based on a Gaussian semivariogram model was used for analyzing the spatial variations of different water quality constituents in the study area (Fig. 6).

Fig. 6
figure 6

Maps of different water quality constituents including EC (a), Mg2+ (b), Na (c), Cl (d), SO42− (e), TDS (f), Th (g), and Ca2+ (h). The maps were generated by interpolating the measured values across the aquifer based on the ordinary kriging interpolation method with a Gaussian semivariance model

Figures 6a and 4b show the distribution of EC (raging between 0.39 and 1.7 ds m−1) and Cl (ranging between 25.12 and 437 mg L−1) in the study area, respectively. The lowest concentrations of these water quality constituents occurred in northern and northwestern areas. Concentration Mg2+ ranged between 3.2 and 569 mg L−1 where the highest and the lowest concentrations occurred in southern and western parts of the aquifer, respectively (Fig. 6c). Na+ in the entire study area except for small parts of western parts was almost less than 10 mg L−1 (Fig. 6d). The concentration of SO42− was between 0.12 and 589 mg L−1 with the lowest concentrations in western and northern areas (Fig. 6e). Nearly similar spatial variations were for TDS and Th concentrations for which the lowest concentrations occurred in western areas (Fig. 6f and g). The spatial variation for Ca2+ was slightly different as its highest concentrations (more than 500 mg L−1) occurred in northern and northeastern parts (Fig. 6h).

Comparing the overall water quality index based on fuzzy-AHP and DST methods

Equation (3) was used to define MF of between [0, 1] for each water quality parameter. In each MF, the very low and very high concentrations for each constituent were close to 1 and 0, respectively. Hence, for Ca2+, Cl, Mg2+, Th, Na+, EC, SO42−, and TDS concentrations of greater than 200, 200, 150, 500, 200, 3000, 200, and 500 mg L−1, respectively, MFs = 0 were considered. Figure 7 shows the weight assigned to each water quality parameter for generating the overall water quality index based on the fuzzy AHP method (Mokarram and Sathyamoorthy 2016).

Fig. 7
figure 7

Weights assigned to each water quality constituent for creating the overall water quality index in the fuzzy AHP model

Figure 8 represents the overall water quality map generated based on the fuzzy AHP system. The map shows the average index during for the period of September 23, 2017 to March 21, 2017.

Fig. 8
figure 8

The average water quality index map generated based on the fuzzy AHP method for the period of September 23, 2017–March 21, 2017. A low water quality index (red) indicates a poor groundwater quality and a high water quality index (blue) indicates a high groundwater quality

As seen, the fuzzy-AHP water quality index ranged between 0 and 0.94. In Fig. 8 the blue color (high water quality index) indicates the areas with a high groundwater quality; the red color (low water quality index) indicates areas with a low groundwater quality. As seen, central areas towards the southern parts of the aquifer showed the lowest drinking water quality index, whereas small areas in the eastern, western, and northern parts showed a high water quality index.

Figure 9 shows the water quality maps with different confidence levels generated based on the DST model.

Fig. 9
figure 9

The overall water quality index based on the Dempster–Shafer model with 99.5% (a) confidence level, 99% confidence level (b), and 95% confidence level (c) for the period of September 23, 2017–March 21, 2017

Generally five water classes of poor, low, moderate, good, and high with the overall water quality index ranges of (0, 0.2), (0.2, 0.4), (0.4, 0.6), (0.6, 0.8), and (0.8, 1), respectively, were defined in both DST and fuzzy AHP models (Table 6). Table 6 represents the surface area of the study site covered by each water quality class based on the fuzzy AHP and DST models.

Table 6 The percentage of the area of each class of water quality for each method

Based on fuzzy AHP model, the majority of the study area (34%) was ranked with as moderate (0.4–0.6) water quality. In Dempster–Shafer models with different confidence levels, the majority of the study area was ranked as moderate or low (Table 6). Both fuzzy AHP and DST models recognized the western areas of the aquifer with a quaternary soil type as the regions with the highest water quality index.

In general, the results of the DST model showed that (Fig. 9) by decreasing the confidence level, the areas with higher water quality indices tended to increase. In comparison, the fuzzy model had a faster computation speed. However, the DST model was capable of generating water quality maps with different confidence levels. Therefore, the DST model can be used for generating management and planning water quality zoning maps with preferred confidence levels based on the economic conditions and importance of the study area.

Formation type could influence the water quality of an aquifer. Figure 2a was used to evaluate the effects of geological formation type in the region on water quality of the aquifer. As seen from Fig. 2a, quaternary sediments cover a majority of the study area. The coarse-grained sediments have resulted in a high porosity in and thus a high permeability in these areas (Panno and Hackley 2010). Hence, relatively higher water quality indices could be observed in areas with quaternary sediments. In contrast, the areas with an Asmari (OMas) formation composed of limestone, salt domes, dolomitic limestone, and marl (Fig. 2a) contain elevated levels of calcium, sodium, magnesium, and chloride ions (Panno and Hackley 2010). Razak formation composed of mainly conglomerates, marls, and shale with interactions of limestone (Ghazavi and Emami 2017) can also contribute to the elevated concentrations of minerals in central and southern areas. Hence, a poor water quality index could be observed in these areas. In areas with a Bangestan formation, water quality is better than the Asmari formation, which could be due to less salt content of the Bangestan (Panno and Hackley 2010).

Comparison of the geological maps and the water quality maps prepared based on the DST and fuzzy-AHP models indicates that in areas with Kb (Bangestan) and Mm (Mishan) formations, both methods yielded moderate water quality indices. In southeastern, and central towards the southern areas, the OMas formation could affect the quality of groundwater, and therefore both fuzzy-AHP and DST models showed low water quality indices in these regions. The elevation of OMas formation (Asmari) in the eastern and northeastern areas is about 1625 m higher than the central areas, resulting in a much higher depth of groundwater table in these areas. This would reduce the influence of OMas formation on groundwater quality in eastern and northeastern areas compared with central and southern areas as the steep slope of the ground results in a decreased percolation of surface runoff and soil layers beneath the OMas and could also decrease the percolation rate of minerals to the groundwater.

In order to compare the Fuzzy-AHP and DST models, 14 monitoring wells among the 40 test wells (test points in Fig. 2b) were randomly selected and the predicted water quality indices by fuzzy-AHP and DST models at this points were compared with measured EC values. The EC values and water quality indices were determined based on the two models which are shown in Fig. 10 and Table 7.

Fig. 10
figure 10

EC values and their corresponding water quality indices determined based on fuzzy-AHP and Dempster–Shafer theory with confidence levels of 95%, 99%, and 99.5% in 14 randomly select groundwater quality stations

Table 7 Comparison of water quality index in DST and fuzzy AHP models based on EC values in randomly selected 14 observation points

According to Fig. 10 and Table 5, by increasing EC water quality parameters, both methods showed decreased water quality indices. Moreover, a comparison of the two methods showed that the DST model was better for water quality and had better integration. Also, a comparison of EC values in Fig. 10 indicates that the results of the DST model were more accurate than the fuzzy-AHP method in determining the water quality index.

Conclusions

In this study, fuzzy AHP and Dempster Shafer theory were used to determine the drinking water quality maps for an aquifer. The results of fuzzy-AHP method showed that most parts of the study areas had a moderate water quality index, and eastern and western parts had the best drinking water quality index. Compared with the fuzzy AHP model, the Dempster Shafer model had the capability of generating maps with various confidence levels. However, in comparison with the Dempster–Shafer model, the Fuzzy AHP model had faster computation speed. The results of the DST model showed that by decreasing the confidence level, the areas with higher water quality indices increased. Comparison of the two models showed that the DST model could more effectively handle the uncertainty in the data and thus generated more accurate water quality maps. One of the advantages of the DST model is the capability of computing the outputs with different confidence levels. Therefore, this model can be used for generating water quality zoning maps with confidence levels that best suits the economic conditions and importance of the study area.