1 Introduction

Groundwater is defined as the main source of water demand in arid and semiarid areas. In comparison with surface water, the advantages of groundwater are having almost constant temperature, being less affected by drought, having better water quality and being tapped in when necessary, to mention but a few (Manap et al. 2013). Groundwater is always exploited when there is inadequate surface water. Nowadays, the use of groundwater is increasing rapidly. Therefore, the identification of a proper method for the management and prediction of groundwater at the national, regional and local levels is necessary (Vaux 2011; Le Page et al. 2012). In Iran, as a semi-arid country of which two-third is desert land without any green pasture, the water demand has been satisfied using groundwater (Bastani et al. 2010; Mehrdadi 2010; Nosrati and Van Den Eeckhaut 2012). The life of more than 70% of the rural population in Iran, for drinking and domestic requirements, depends on groundwater (Rahmati et al. 2015). Thus, groundwater potential mapping (GPM) in Iran is really necessary. Complex and costly instruments and methodology are required to gain information regarding groundwater resources. GPM is useful for the management of spatial groundwater resources, especially in regions with little data. The probability of groundwater occurrence in an area is defined as the groundwater potential (Jha et al. 2010). Traditional methods for groundwater potential identification and exploitation were accomplished through drilling. Moreover, geophysical methods involve more cost and time (Todd and Mays 1980; Roscoe 1990; Israil et al. 2006; Jha et al. 2010).

Nowadays, remote sensing and geographical information systems (GIS) are frequently used in studies related to water resources. Thus, many researchers have performed studies by considering some conditioning factors such as topography and geomorphology; drainage pattern; lineaments such as fault, lithology and geology and soil (Prasad et al. 2008; Chowdhury et al. 2009; Jasmin and Mallikarjuna 2011; Deepika et al. 2013; Chen et al. 2019). GIS is a useful tool with some advantages to manage huge amounts of data (Khosravi et al. 2016). Many studies have been carried out using the frequency ratio (FR) (Oh et al. 2011; Ozdemir 2011a; Manap et al. 2014; Davoodi et al. 2015; Naghibi et al. 2015), weights of evidence (Corsini et al. 2009; Lee et al. 2012; Pourtaghi and Pourghasemi 2014), logistic regression (Ozdemir 2011b), analytical hierarchy process (Awawdeh et al. 2014; Kaliraj et al. 2014) and evidential belief function (EBF) (Nampak et al. 2014). The results of all studies indicated that data-driven models have reasonable prediction ability and can be used in other areas to solve problems. In data-driven models, expert opinion does not have any impact on the results. Therefore, the results are more reliable.

The main aims of the current study are as follows: (i) spatial prediction of regions with high groundwater potential and (ii) identification and comparison of prediction abilities associated with the FR, Shannon entropy (SE) and EBF models in GPM. Evaluation of the GPM would be practical to the decision makers in groundwater management and restoration, e.g., the Regional Water Authority of Urmia (RWAU), and in determining suitable locations for future drilling of productive wells. However, such studies have not been carried out in the entire province up to now. Thus, the present research is a leading work in this research area and plays a significant role in quick groundwater assessment.

2 Study area description

The Sero plain is located in the west Azerbaijan province in Iran. The plain is considered as one of the most important ones in the province upon which people are dependent for groundwater supply for both drinking and agricultural activities. It lies between \(44^{\circ }37'30''\)\(44^{\circ }43'30''\hbox {E}\) and \(37^{\circ }41'\)\(37^{\circ }47'\hbox {N}\), and covers about \(52\hbox { km}^{2}\) (see figure 1). Topographically, the altitude varies between 1490 and 1697 m above the sea level and the highest and lowest slopes are 26 and 0, respectively. Based on the reports of the RWAU, the mean annual precipitation in the region is about 440 mm. The most dominant soil is inceptisol soil and agricultural land use covers about 94% of the study area. Geologically, the Quaternary and Permian cover about 58% and 18% of the study area, respectively. Lithologically, most of the area is covered by K2l (light blue to pink, thick-bedded to massive, piagic limestone with calcite veins), OPm (ophiolite melange of ultrabasic rocks, serpentinite, meta gabbro, berreciated volcanic, grey to red shale and plagic limestone), Pd (sandstone and conglomerate), CM (tectonic mixture of serpentinised ultramafic rocks, pillow lavas, gabbros, radiolarite and pelagic limestone), CMV (tectonic mixture mostly composed of pillow lava and sediments) and K (alternation of sandstone, shale and limestone at base followed by well grey limestone at the top (probably cretaceous). Also, a fence diagram of the study areas is shown in figure 2. Moreover, the distribution of major ions in the groundwater is as follows: \(\hbox {Ca}^{2+}> \hbox {Mg}^{2+}>\hbox {Na}^{+} > \hbox {K}^{+}\) and \(\hbox {HCO}^{3-}>\hbox {SO}_{4}^{2-} >\hbox {Cl}^{-}\).

Fig. 1
figure 1

Well location map with the DEM of the Sero plain, Iran.

Fig. 2
figure 2

Fence diagram of the study area.

3 Methodology

3.1 Groundwater well inventory

Groundwater well inventory mapping as the first step in any spatial prediction modelling is necessary to identify the correlation of the distribution of groundwater well locations with the conditioning factors. Groundwater with higher potential was identified through the predictions made for the best groundwater potential (Nampak et al. 2014). According to reports of UWRA and literature review, a threshold of \(11\hbox { m}^{3}\)/hr was considered for groundwater yield (Nampak et al. 2014). The groundwater yield was identified according to the actual pumping test. In the study area, only 75 well locations had groundwater yields \({>}11\hbox { m}^{3}\)/h, which were randomly divided into two groups of sizes of 70% (53 well locations), as the training dataset used to develop the model, and 30% (22 well locations), as the testing dataset used for model validation (Nampak et al. 2014; Khosravi et al. 2016). An extensive field survey was carried out to identify the locations of wells with a GPS hand hole.

3.2 Groundwater conditioning factors

In the present research, 10 conditioning factors, namely, ground slope or slope degree, aspect, altitude, curvature, distance from the fault and the river, lithology, land use, rainfall and soil media, were taken into consideration. The conditioning factor selection was through reviewing the literature and using the data available (Mukherjee 1996; Oh et al. 2011; Ozdemir 2011a; Nampak et al. 2014).

At first, the digital elevation model (DEM) for the study area was downloaded from the ASTER Global DEM (spatial resolution of 30 m). Using DEM, four input variables, namely, slope degree, slope aspect, altitude and curvature maps, were fabricated. Slope degree had a high impact on groundwater occurrence. This factor was created and classified into five categories, namely, 0–3.1, 3.2–5.7, 5.8–8.7, 8.8–12.8 and 12.9–26.7, using quantile scheme classification (figure 3a). The slope aspects were classified after preparation into nine classes comprising the flat, north, northeast, east, southeast, south, southwest, west and the northwest (figure 3b). The third conditioning factor was altitude, which, after preparation, was divided into five classes, namely, 1491–1548, 1548–1573, 1573–1599, 1599–1633 and 1633–1697 m, using the quantile method (figure 3c). The curvature map shows the topography of the earth’s surface and divides it into three classes, i.e., \(-\)0.05 or concave, from −0.05 to 0.05 or flat, and >0.05 or convex (figure 3d) (Pham et al. 2017).

Fig. 3
figure 3

Conditioning factors of groundwater occurrences: (a) altitude, (b) slope angle, (c) curvature, (d) aspect, (e) rainfall, (f) soil, (g) land-use, (h) geology, (i) distance from the fault and (j) distance from the river.

The fault in the study area was extracted from the fault of Iran, which was prepared using the Geological Survey of Iran (GSI) on a scale of 1:100,000. The factors of distance from the fault and the river were produced using the fault and the river of the study area in ArcGIS 10.2 software, and subsequently divided into five groups, namely, 0–100, 100–200, 200–500, 500–1000 and >1000 m (figure 3e and f) (Khosravi et al. 2018b).

Various lithologies have various infiltration rates; thus, lithology plays a crucial role in the identification of the groundwater potential occurrences (Pradhan 2009; Adiat et al. 2012; Nampak et al. 2014). This factor was prepared through the GSI. The formation ages of lithology were the Eocene, Middle-Eocene, Permian, Pre-Cambrian and the Quaternary (figure 3g). Land-use map was produced using supervised image classification techniques by Landsat 7 (ETM+), for which images were downloaded from the US Geological Survey (USGS). The images were divided into three classes, namely, irrigated land and agriculture, dry farming and moderate rangeland (figure 3h). The mean annual precipitation of four rain gauges in 20 yr was used to prepare the rainfall map which is a major conditioning factor in groundwater occurrences. The rainfall map was prepared using the Kriging method, due to its low RMSE, and divided into five classes, namely, 400–424, 425–446, 447–467, 468–488 and 489–510 mm (figure 3i). The characteristics of soil have a direct impact on water infiltration. Therefore, the soil map of the west Azerbaijan province with a scale of 1:50,000 was used in the analysis. The soil map was provided by UWRA, which was originally prepared by the Iranian Water Resources Department (IWRD). The dominant soil form in the area under study was inceptisols, followed by rock outcrop/inceptisols and rock outcrop/entisols (figure 3j).

3.3 Modelling of the groundwater potential assessment

Three models, namely, FR, SE and EBF, were applied for groundwater potential assessment and comparison at the Sero plain.

3.3.1 FR model

FR is one of the well-known and simple geospatial bivariate models for determining the probabilistic correlation between the dependent (groundwater) and independent (conditioning) factors. The FR method was previously utilised in various applications, such as flood susceptibility mapping (Tehrany et al. 2013; Khosravi et al. 2016, 2018b), groundwater qanat potential mapping (Pourtaghi and Pourghasemi 2014) and landslide assessment (Youssef et al. 2015). The most important benefit of the current model is its simple execution and easy-to-understand results (Gogu et al. 2001; Yalcin 2011). In the present study, the FR model was applied to evaluate the impact of each conditioning factor used for groundwater occurrences (Khosravi et al. 2018b). Equations (4) and (5) were used to calculate the FR model and the groundwater well potential index (GWPI), respectively (Khosravi et al. 2018b):

$$\begin{aligned} \hbox {FR}= & {} \frac{\big [N_{\mathrm{pix}} (SX_i )\Big /\sum \nolimits _{i=1}^m {SX_i } \big ]}{\big [N_{\mathrm{pix}} (X_j )\Big /\sum \nolimits _{j=1}^n {N_{\mathrm{pix}} (X_j )} \big ],} \end{aligned}$$
(1)
$$\begin{aligned} \hbox {FSI}= & {} \sum \limits _{j=1}^n {\hbox {FR}} , \end{aligned}$$
(2)

where \(N_{\mathrm{pix}}({ SX}_{i})\) is the amount of pixels with groundwater well within the factor variable X of class i, \(N_{\mathrm{pix}}(X_{j})\) denotes the amount of pixels within the factor variable \(X_{j}\), m represents the number of classes in the parameter variable \(X_{i}\) and n refers to the number of factors in the whole case study (Jaafari et al. 2014; Regmi et al. 2014).

3.3.2 Shannon entropy (SE) model

Overall, entropy shows the quantity of abnormality between events and results or decisions on various subjects under discussion (Wan 2009), and the entropy index is defined as the mean difference of the unit group to the whole system ratios (Theil 1972). SE, modified by the Boltzmann method, was used as the information theory (Pourghasemi et al. 2012). The SE model has been applied to the mapping of flood susceptibility (Khosravi et al. 2018a) and landslide susceptibility (Sharma et al. 2013) and has led to reasonable results. The following equations express the calculation of the weight of the incorporated information. \(V_j\) is the value of the parameter from the total figure (Bednarik et al. 2010), which is determined based on the following equation:

$$\begin{aligned} E_{ij} =\frac{\hbox {FR}}{\sum \nolimits _{j=1}^{M_j } {\hbox {FR}} }, \end{aligned}$$
(3)

where FR represents the frequency ratio and \(E_{ij}\) denotes the probability density:

$$\begin{aligned}&\displaystyle H_j =-\sum \limits _{i=1}^{M_j } {E_{ij} } \log _2 E_{ij} ,j=1,\ldots ,n, \end{aligned}$$
(4)
$$\begin{aligned}&\displaystyle H_{j\max } =\log _2 M_j ,M_{j-\mathrm{number~of~classes}} , \end{aligned}$$
(5)
$$\begin{aligned}&\displaystyle I_j =\left( {H_{j\max } -H_j /H_{j\max } } \right) ,\quad \nonumber \\&\displaystyle I=(0,1),\;j=1,\ldots , \end{aligned}$$
(6)
$$\begin{aligned}&\displaystyle V_j = I_j \hbox {FR}, \end{aligned}$$
(7)

where \(H_{j}\) and \(H_{j\mathrm{max}}\) denote the entropy values, \(I_{j}\) the information coefficient and M denotes the number of classes. Moreover, \(V_{j}\) represents the attained total weight value of the factor ranging between 0 and 1, where closer values to 1 indicate a higher disorder and imbalance (Khosravi et al. 2018a).

3.3.3 EBF model

The EBF model is developed based on the Dempster–Shafer theory of evidence. In the EBF model, evidential data estimation relates to a proposition (Shafer 1976; Dempster 2008). The EBF model consists of four parts, i.e., degrees of belief (Bel), disbelief (Dis), uncertainty (Unc) and plausibility (Pls), ranging between 0 and 1 (Carranza et al. 2005). The mapping of groundwater potential is performed based on the EBF model through the following equation (Shafer 1976; Dempster 2008):

$$\begin{aligned} {\uplambda }\left( {Tp} \right) E_{ij}= & {} N/D=\left[ {N\left( {L\cap E_{ij}} \right) /N\left( L \right) } \right] \nonumber \\&/\left[ N\left( {E_{ij}}\right) -N\left( {L\cap E_{ij}} \right) \right. \nonumber \\&\left. /\left( {N\left( A\right) -N\left( L \right) } \right) \right] , \end{aligned}$$
(8)
$$\begin{aligned} \hbox {Bel}= & {} {\uplambda }\left( {Tp} \right) E_{ij}/\left( {\sum \limits {\uplambda }\left( Tp \right) E_{ij}} \right) . \end{aligned}$$
(9)

In this equation, \(N(L\cap E_{ij})\) represents the number of groundwater well pixels in each class, N(L) the total number of groundwater wells, \(N(E_{ij})\) the number of pixels of each class, N(A) the total number of pixels and N and D represent the ratio of well location areas and the proportion of non-well areas, respectively. Similarly, Dis values are obtained by equations (10 and 11):

$$\begin{aligned} {\uplambda }\left( {T\bar{p} } \right) E_{ij}= & {} \frac{K}{H}\nonumber \\= & {} \left[ {\left( {N\left( L \right) -N\left( {L\cap E_{ij}} \right) } \right) / N\left( L \right) } \right] \nonumber \\&/[N\left( A \right) -N\left( L \right) -N\left( {E_{ij}} \right) \nonumber \\&+N\left( {L\cap E_{ij}} \right) /\left( {N\left( A \right) -N\left( L \right) } \right) ], \nonumber \\\end{aligned}$$
(10)
$$\begin{aligned} \hbox {Dis}= & {} {\uplambda }\left( {T\bar{p} } \right) E_{ij}/\left( {\sum \limits {\uplambda }\left( {T\bar{p} } \right) E_{ij}} \right) , \end{aligned}$$
(11)

where K denotes the non-occurring well locations ratio and H the non-occurring well areas proportion in the attributes of the class.

Uncertainty and plausibility are calculated as follows:

$$\begin{aligned}&\displaystyle \hbox {Unc}=(1-\hbox {Bel}-\hbox {Dis}), \end{aligned}$$
(12)
$$\begin{aligned}&\displaystyle \hbox {Pls}=(1-\hbox {Dis}). \end{aligned}$$
(13)

4 Results

4.1 Results of the FR model

In the FR model, a higher frequency ratio shows the significant spatial relationship of conditioning factors with the occurrences of groundwater (Tehrany et al. 2013); the values >1 show a strong relationship and those smaller than 1 represent a weak relationship. This model was utilised to identify the relationship between each class of conditioning factors and groundwater occurrences and the results are presented in table 1.

Table 1 Spatial relationship between well conditioning factors and well locations using the FR and SE models.

The results indicate that in the study area, a lower altitude has a higher impact on groundwater occurrence. Therefore, the classes of 1491–1548 and 1548–1573 m with FRs of 2.2 and 1, respectively, have the highest impacts on the occurrences of groundwater. Moreover, the slopes of 0–3.1 and 8.6–12.7 with FRs of 1.49 and 1.45, respectively, have the highest impacts on groundwater occurrences. Thus, the results show that there is no specific pattern for the relationship between groundwater occurrences and the slope factor. With regard to curvature, in the area under study, the flat curvatures (2.59) have a strong impact on groundwater occurrence, which is in accordance with the nature of groundwater occurrences. Therefore, in a flat area, the runoff infiltrates more than other areas do. In the case of slope aspects, the north has a high influence on groundwater occurrences (1.9) as it receives sunlight less than other slope aspects do; it is followed by the southwest (1.44) and the east (1.04). The result of the relationship between rainfall and groundwater by the FR model shows that a rainfall amount of 446–467 mm has a high impact on groundwater occurrences (1.28). In the case of soil type, the results of the FR model show that inceptisols have high impacts on groundwater occurrences, as the total wells are located in the inceptisols. It is worth mentioning that the area is mostly covered by the aforementioned soil. For land-use, results show that agricultural land-use has a high impact on groundwater occurrences (1.06), as the whole wells are located in the agricultural land and this land-use covers about 94% of the study area (the major crops are both irrigated- and dry farming wheat alongside with a School and Community Assistance for Recycling and Composting Education garden). For the conditioning factor of geology (age), the Permian age has a high impact on groundwater occurrences (1.72), as 52% of the wells are located in the Quaternary age. However, a large part of the area does not have enough FR. With regard to the distance from the fault, the FR results show that only the classes of >1000 m do not affect groundwater occurrences and the distances between 0 and 1000 m have a high impact on groundwater occurrences; the first class, i.e., 0–100 m (3.24), has the most significant effect on the occurrences of groundwater and the more the distance from fault, the lower will be the impact on groundwater occurrences. The results of distance from the fault show that the first class, i.e., 0–100 m, has the highest effect on the occurrences of groundwater (2.39), followed by the class of 200–500 m (1.17).

4.2 Results of the SE model

The SE model has been applied to determine the correlation between each conditioning factor and groundwater well location (table 1). The weights of the conditioning factors of plan aspect, slope angle, altitude and curvature are equal to 0.09, 0.16, 0.23 and 0.23, respectively. This shows that altitude and curvature, among all topographical factors, have the highest impact, and the next are slope angle and slope aspect. The weight of rainfall, as a hydrological factor, is 0.01. The estimated weights of SE for other conditioning factors are as follows: soil (0.35), land-use (0.48), geology (0.24), distance from the fault (0.17) and distance from the river (0.09). Generally, the most and the least important factors for groundwater occurrences in the study area are altitude, curvature and rainfall, respectively.

4.3 Results of the EBF model

The correlation results for the well locations and each conditioning factor were achieved through the EBF model (Bel, Dis, Unc and Pls) and are presented in table 2. Nampak et al. (2014) stated that an important weakness of the EBF model was that if there was no value for Bel in a given class, there would be no well occurrence in the same class.

Table 2 Spatial relationship between well conditioning factors and well locations using the EBF model.
Fig. 4
figure 4

Integrated EBF model parameter: (a) Bel, (b) Dis, (c) Unc and (d) Pls.

The spatial correlation between the location of the well and altitude with the EBF model demonstrates that the highest Bel values (0.515 and 0.226) belong to two classes of 1491–1548 m and 1458–1573 m, implying that the groundwater occurrence probability for these classes of altitude is higher than that for the other classes. The 1599–1633 m class has the lowest Bel (0). That is, there is no well location in this class of altitudes. In the case of slope degree, the slopes of 0–3.1 have the highest impact (0.346), followed by 8.6–12.7 (0.337). In contrast, the last slope class, i.e., 12.7–26.6, has the lowest impact on groundwater occurrences (\(\hbox {Bel} = 0\)). Regarding curvature, the flat regions have higher Bel (0.635) than other curvatures. In terms of slope aspect, the north aspect has the highest value of Bel (0.241) and the lowest Bel belongs to flat (0) areas. Results of the relationship between rainfall and groundwater well location show that the 446–467 mm class has the highest Bel (0.271). The highest (Bel) and lowest (Dis) probabilities of the occurrence of groundwater well are for the regions covered by inceptisols with values of 1 and 0, respectively. The highest Bel (1) belongs to the areas with agricultural land-use, and the dry-farming and moderate rangelands have the lowest Bel (0). In terms of geological (age) conditions, the highest (0.512) and the lowest (0) values of Bel belong to the Permian, and the Eocene and middle Eocene, respectively. Considering the distance from the fault, the highest Bel value as well as the lowest Dis value is in the distance between 0 and 100 m. It reveals that the probability of groundwater well occurrences decreases with the increase in the distance from the fault. In terms of the distance from the river, the highest Bel belongs to the 0–100 m class, i.e., the lower the distance from the river, the higher will be the Bel value and the more the impact on groundwater occurrences.

The integrated results of the EBF model are depicted in figure 4(a–d). The Bel (figure 4a) and Dis (figure 4b) maps have been compared with each other. The results of the comparison show high Bel values in regions with low Dis and vice versa. Hence, these two maps are in contrast to each other. Overall, high groundwater well potential occurrences belong to the areas with high Bel degrees and low Dis degrees. Moreover, low Unc values belong to the areas with high Bel degrees. As a result, the areas with high potential of groundwater occurrence have a low uncertainty. The Pls map results show that the values of Pls are high in regions with high Bel values and low Unc values. The results of the current research are in accordance with those of Nampak et al. (2014) and Pourghasemi and Beheshtirad (2015). According to Althuwaynee et al. (2014), the main benefit of the EBF mode is the modelling of the uncertainty degree (Dis) along with the predictive mapping of zones.

The calculated GPI indices were classified into five different classes of low, moderate, high and very high potential based on a natural break classification scheme (Zare et al. 2013). The results are shown in figure 5(a–c).

Fig. 5
figure 5

GPM developed by the (a) FR, (b) SE and (c) EBF models.

Fig. 6
figure 6

Validation of the GPM in the Sero plain: (a) the success rate curve and (b) the prediction rate curve.

5 Validation of the achieved maps for groundwater potential

Validation of the achieved maps by different models is a very important phase in any modelling, without which the results of modelling lose their scientific significance (Chung and Fabbri 2003). The training dataset was overlaid with final maps and, finally, the receiver operating characteristic curves (ROC curves) were plotted as the success-rate curves. The area under the ROC curve (AUC) was quantitatively calculated as the result of the prediction capability of the model in the SPSS 18 software (Akgun 2012; Mohammady et al. 2012; Pourghasemi et al. 2012). The ROC curve is a graphical plot demonstrating the diagnostic capability of a binary classifier system. In the ROC plot, X-axis and Y-axis are false positive rate and true positive rate, respectively. The range of AUC is between 0.5 and 1, and higher AUC values exhibit more prediction ability of the model. This range was classified as: 0.5–0.6, poor; 0.61–0.7, average; 0.71–0.8, good; 0.81–0.9, very good and finally, 0.9–1, excellent (Yesilnacar 2005). As the success rate curve was calculated with the training dataset, which was used for modelling, it could not be utilised as the result of model validation and it only showed how much the model built was useful in GPM. The prediction rate curve, which was attained using the final groundwater potential maps and testing dataset, revealed how efficient the built model was for GPM. Thus, its AUC was used as the model’s prediction capability. The AUC success rate values of FR, SE and EBF models are equal to 0.888, 0.847 and 0.897, respectively, which correspond to 88, 84 and 89% modelling accuracy rates (figure 6a and table 3). The AUC values of the prediction rate for the FR, SE and EBF models are 0.84, 0.81 and 0.85, which correspond to 0.84, 81 and 85% of the prediction accuracy rates (figure 6b and table 3).

Based on the results, the FR, SE and EBF bivariate models can be applied as a practical and simple tool to evaluate and manage groundwater. The advantages of the bivariate models are (i) simple implementation, (ii) reasonable accuracy in spatial prediction, and (iii) the ability to identify the factors or combinations of factors in the assessment (Van Westen et al. 2003).

Table 3 AUC results of the used models.

6 Conclusion

As groundwater resources have an increasingly important role in water supplies all over the world, the evaluation of groundwater potential zonation has turned into a hot topic for those holding responsible positions in authority in governments, groundwater resources managers and private and land-use planners. In the present paper, three data-driven models, namely, FR, SE and EBF, were successfully applied to delineate the groundwater potential of the Sero plain in west Azerbaijan province, Iran, as one of the principal regions in the province for agricultural activities. In the first step, 75 well locations with groundwater yields \({>}11\,\hbox {m}^{3}\)/h were identified and divided into two categories with the sizes of 70% (53 locations) to develop the model and 30% (22 locations) to validate the model. Afterwards, 10 groundwater conditioning factors were selected for modelling. Finally, using the above-mentioned three models, three groundwater potential maps were produced. The results of model validation showed that the AU-success rate and AU-prediction rate curves of FR, SE and EBF were equal to 88, 84 and 89% and 0.84, 81 and 85%, respectively, and, according to ROC classification, their accuracy was very good. The results revealed that the EBF model was the best model for groundwater assessment of the Sero plain in both training and validation phases; the next models were FR and SE. One of the major benefits of EBF was that it allowed for (i) the predictive mapping of favourable zones and (ii) modelling of the uncertainty degree in prediction. Based on the results of the SE model, the most and the least important factors of groundwater occurrences in the study area were altitude and curvature and rainfall, respectively.

Overall, the achievements of the current study revealed that the models investigated could be applied successfully in spatial prediction modelling. The results of this study are helpful for related agencies in Iran, e.g., the RWAU and the Bureau of Soils and Water Management, and decision makers in comprehensive assessment of the development of groundwater exploration and environmental management in future planning. The bivariate models provided rapid, accurate and cost-effective results. In future work, the water quality of the case study, especially in areas with high groundwater occurrence potential, may be investigated and classified according to its usability in drinking and agriculture for better management of the groundwater resources.