1 Introduction

Average annual precipitation in Iran has been measured as 270 mm, which is less than one third of the world’s annual average (Mahdavi 2004; Chezgi et al., 2015). Seventy percent of the precipitation falls on northern parts of the country, while other parts receive inadequate amount (Ahmadi et al. 2010). According to Mahdavi (2004), in the case of temporal distribution of the precipitation, a low share of the precipitation occurs during plan growing season. The rapid increase in the population of Iran in the last decades has caused an increasing demand for fresh water, especially for groundwater resources. Considering the mentioned facts, efficient management, conservation plans, and sustainable use of groundwater resources are necessary activities needed.

Qanat is an underground construction that collects and delivers groundwater from highland mountainous areas to lowland residential areas for different purposes especially drinking water and farming (Perrier and Salkini 1991; Nazari Samani and Farzadmehr 2006) (Fig. 1). Qanat technology was developed by the Persian people and spread from there to more than 34 countries in Asia, Africa, and Europe (Naghibi et al. 2015). There are about 32,164 active qanat systems in Iran with approximately 9 billion cubic meters (bcm) of total discharge (Naghibi et al. 2015). More description on qanat and its structure can be found in Nazari Samani and Farzadmehr (2006) and Naghibi et al. (2015).

Fig. 1
figure 1

A profile of qanat and its components (Nazari Samani and Farzadmehr 2006)

One new method which has provided much useful information for water resource managers is groundwater potential mapping (GPM). Different researchers have used different groundwater indicators for groundwater potential mapping such as spring, qanat, and well. In some studies, spring locations and different models were used for producing GPMs such as frequency ratio (FR) (Oh et al. 2011; Pourtaghi and Pourghasemi 2014; Davoodi Moghaddam et al. 2015); weights-of-evidence (WE) (Ozdemir 2011a); logistic regression (LR) (Ozdemir 2011a; Pourtaghi and Pourghasemi 2014); index of entropy (IE) (Naghibi et al. 2015); artificial neural network (ANN) (Lee et al. 2012a, b); analytical hierarchy process (AHP) (Rahmati et al. 2014; Razandi et al. 2015); evidential belief function (EBF) (Pourghasemi and Beheshtirad 2014); and data mining models such as random forest, classification and regression trees, and boosted regression tree (Naghibi and Pourghasemi 2015; Naghibi et al. 2016; Zabihi et al. 2016).

In the last decade, many predictive models have been used in other fields of sciences and engineering such as soil, spectroscopy, landslide susceptibility mapping, food quality, air quality, land use, and flood susceptibility mapping, including flexible discriminant analysis (FDA) (Tebaldi et al. 2002), k-nearest neighbor classification (KNN) (Paraskevas et al. 2015), linear discriminant analysis (LDA) (Ramos-Canon et al., 2015; Eker et al. 2015), multivariate adaptive regression spline (MARS) (Felicisimo et al. 2012; Samui and Kurup 2012; Tayyebi and Pijanowski 2014), penalized discriminant analysis (PDA) (Granitto et al. 2008; Zhu and Tan 2016) quadratic discriminant analysis (QDA) (Eker et al. 2015), and SVM (Ballabio and Sterlacchini 2012; Kavzoglu et al. 2014; Hong et al. 2015; Marjanović et al. 2011; Pourghasemi et al. 2013; Tehrany et al. 2014; Tehrany et al. 2015; Tien Bui et al. 2015; Karami et al. 2015), and the results were reported satisfactory.

Considering the aforementioned literature, there are two major novelties in current study: (i) the use of the ten advanced and soft computing models for groundwater qanat potential mapping and (ii) consideration of qanats as indicator for groundwater potential.

2 Study area

The study area lies between 31°50′36″N and 32°34′16″N latitude and 51°26′57″E and 59°21′51″E longitude with an area of 2321 km2. Beheshtabad is located in Chaharmahal-e-Bakhtiari Province, Iran. The elevation of the study area differs from 1660 to 3560 m with an average of 2301 m. The mean yearly rainfall in Beheshtabad is 618.8 mm (Mojiri and Zarei 2006). Based on the Geological Survey of Iran (GSI 1997), the most part of the area is covered by the lithology as A in Table 1. The study area is consisted of main land use classes of agricultural (29.83%), orchard (1.37%), rangeland (66.25%), and residential (2.54%). There are 1425 springs and 228 qanats in the study area. People in the study area have a high dependency to groundwater resources for water supply and other usages.

Table 1 Lithological characteristics of Beheshtabad Watershed, Iran

3 Methodology

The overall methodology (Fig. 2) includes the following: (1) preparing a qanat location map in the study area; (2) feature selection using frequency ratio model; (3) running ten models for groundwater qanat potential mapping; (4) validating the models; and (5) comparing different models and selection of the best models using ROC curve, Cohen’s kappa, specificity, and sensitivity indices. Figure 1 lists the factors used and the processes applied in the analysis.

Fig. 2
figure 2

Overall methodological flow chart adopted in this study

3.1 Qanat dataset

In the current study, the location of qanats was detected using extensive field surveys and national reports. In total, 228 qanats were detected in the Beheshtabad Watershed and mapped at 1: 50,000 scale (Fig. 3). Water resources provided by qanat structures in the Beheshtabad area are used for different sections such as drinking water purposes, farming, wild life, etc. Qanats were divided into two groups: training dataset (70%, 160 qanats) and validation dataset (30%, 68 qanats) (Oh et al. 2011; Ozdemir 2011a, b; Lee et al. 2012a, b). For this, Hawth extension (Gutiérrez et al. 2009) was used to randomly classify qanat locations into two groups. Hawth extension provides tools for sampling such as generating random points and creating random selections. In addition, Hawth tools can be used to calculate and add some indices and add to the table of layers such as area, perimeter, length, etc.

Fig. 3
figure 3

Qanat locations with digital elevation model (DEM) map of the study area

3.2 Groundwater influence factors

To assess groundwater potentiality in this study, it is vital to consider several qanat-effective factors. Fourteen influencing groundwater factors were selected based on literature review (Naghibi et al. 2015) and data availability. These factors contain three primary topographical attributes (i.e., altitude, slope angle, slope aspect), five secondary topographical attributes (i.e., plan curvature, profile curvature, topographic wetness index, slope length, stream power index), four river and fault maps (i.e., distance from rivers and faults, rivers and faults density), and finally categorical factors (i.e., lithology and land use). These factors were then classified according to literature review (Oh et al. 2011a, b; Ozdemir, 2011a, b; Naghibi and Pourghasemi 2015).

3.2.1 Primary topographical attributes maps

First, the digital elevation model (DEM) was extracted using the 1:50,000 scale topographic maps (contour lines and elevation points) in a 20-m cell size. Then, three primary factors including altitude, slope angle, and slope degree were calculated and classified. Altitude was divided into five classes (<2000, 2000–2400, 2400–2800, 2800–3200, and >3200 m) based on equal classification scheme (Fig. 4a). The slope angle map was prepared by dividing it into four classes: (0°–5°, 5°–15°, 15°–30°, and >30°) (Fig. 4b). Slope aspect was categorized into north, northeast, northwest, east, west, southeast, southwest, south, and flat based on normal or common standard classification (Fig. 4c).

Fig. 4
figure 4figure 4

Topographical parameter maps of the study area: a altitude, b slope angle, c slope aspect, d plan curvature, e profile curvature, f slope length, g stream power index, h topographic wetness index, i distance from rivers, j distance from faults, k drainage density, l fault density, m land use, n lithology

3.3 Secondary topographical attributes maps

Five secondary topographical attributes were used in the analysis: plan curvature, profile curvature, slope length (LS), stream power index (SPI), and topographic wetness index (TWI) (Fig. 3d–h). Plan curvature and profile curvature were prepared by using SAGA-GIS 2.8 (Fig. 4e). The plan curvature map was classified into three classes of convex, concave, and flat representing positive, negative, and zero values, respectively (Fig. 4d). Profile curvature map was prepared and then categorized into <−0.001, −0.001 to 0.001, and >0.001 groups. LS was classified into four classes (0–20, 20–40, 40–60, and >60) (Fig. 4f). SPI was classified into four classes of (0–200, 200–400, 400–600, and >600) (Fig. 4g). TWI is a topographical index which shows the aptitude of water to gather at each point in the watershed (Moore et al. 1991). This factor was classified into (<8, 8–12, and >12) (Fig. 4h).

3.3.1 River and fault maps

Four maps were extracted from river and fault layers including drainage density, distance from rivers, fault density, and distance from faults. Distance from river and drainage density was created using topographic maps of Beheshtabad watershed. In order to produce fault-related groundwater factors, distance from fault and fault density maps were calculated implementing a geological map of Beheshtabad watershed. Distance from river map was classified into (<100, 100–200, 200–300, 300–400, and >400 m) (Fig. 4i). Distance from fault map was classified into <250, 250–500, 500–750, 750–1000, and >1000 m (Fig. 4j). For drainage density and fault density, natural breaks were used for classification (Fig. 4k, l).

3.3.2 Categorical factors (land use and lithology)

Using Landsat images, the land use map was created. Supervised classification and maximum likelihood algorithms were used to produce land use map of the study area. Four land use classes were identified including agricultural, residential, orchard, and rangeland (Fig. 4m).

The lithology map of the study area that has 13 lithological classes was created from a geological map obtained from Geology Survey of Iran (GSI) (1997). In the study area, there are different lithology classes which were classified from A to M (Table l and Fig. 4n).

3.4 Feature selection

FR is defined as the probability of happening of an exact event (Bonham-Carter, 1994). An FR value of greater than 1 for an influence factor indicates a high correlation (Oh et al. 2011). FR also shows the relationship between qanat occurrence and groundwater influence factors. Although it ignores the interactions between influence factors, it was used as an indicator to show whether the factors are useful alone or not. In this step, FR values were calculated to determine whether the factors are influence or not. For this, FR was calculated for each of the factors’ classes by \( FR=\left(\frac{q}{Q}/\frac{p}{P}\right) \). In this equation, q is the number of pixels with a qanat for each factor, Q shows the number of total qanats in study area, p is the number of pixels in the class area of the factor, and P is the total number of pixels in the area.

3.5 Soft computing models for groundwater qanat potential assessment

In this study, four discriminant analysis methods were implemented for modeling groundwater potential including LDA, QDA, FDA, and PDA models.

Fisher (1936) introduced a linear discriminant analysis model which can separate two classes an object by seeking for a linear combination of variables (Eker et al. 2015). In the LDA, the estimated values (i.e., N) are determined using a linear combination of a set of explanatory variables (i.e., effective factors) such as N = fX + q (q = constant), which best differentiates the group of a case by finding the f coefficients (Eker et al. 2015). LDA has two assumptions including the existence of normal distribution in effective factors and close covariance values. For running LDA model, MASS package was used in R 3.0.2.

In quadratic discriminant analysis approach, a quadratic surface will be determined which could separate the group of a case. QDA searches for a group membership consisting of a matrix (m × m) (m = number of effective factors) and a linear combination of these factors such that Q = x T Vx + K T x + f (Eker et al. 2015), where V shows the m × m matrix of coefficients, K shows the linear combination coefficient, and f is a constant (Eker et al. 2015). In the QDA, the assumption of the normal distribution still exists, but there is no need for covariance values to be close. The QDA was analyzed using “MASS” package in R 3.0.2 as well.

FDA was developed to combine non-parametric regression models with LDA to get more flexibility in the decision boundaries (Mallet et al. 1996). This goal could be achieved by casting regression methods and classification methods into one framework (Mallet et al. 1996). On the other hand, FDA is the application of LDA on the matrix achieved with regression methods and on the transformed class matrix (Reyn et al. 2006).

The penalized discriminant analysis developed by Hastie et al. (2001) is a regularized version of the traditional Fisher’s LDA (Ripley 1996). PDA is suggested to be more appropriate for problems in which LDA has overfitting problem (Mallet et al. 1996). PDA, first, recasts LDA classification problem as a regularized linear regression problem. Then, it applies one of the many famous techniques available (Granitto et al. 2008).

Boosted regression tree uses both statistical and machine learning (SL and ML) techniques (Youssef et al. 2015). BRT has a different approach from traditional regression methods that produce a single best model (Elith et al. 2006; Leathwick et al. 2006). BRT combines a large number of simple tree models to improve the performance of prediction using boosting technique (Elith et al. 2006; Leathwick et al. 2006). In addition to boosting, the BRT also implements regression trees in the modeling process. Regression trees are categorized from the classification and regression tree approaches which are from decision tree group of models. An R script (gbm and dismo) was used to run BRT.

Random forest is a new non-parametric method and a very accurate classifier and robust against noise (Breiman, 2001). The algorithm extracts binary trees selected randomly that implement a sub-dataset of the feature observations through bootstrapping method. Then, RF selects a sub-dataset from the whole dataset to build the model (Zabihi et al. 2016). The data which is not included in the sub-dataset are called out-of-bag (OOB) (Breiman 2001; Catani et al. 2013). RF grows multiple decision trees on random sub-datasets from training dataset and related variables. Small changes in the training data cause a high variance in single classification trees and often results in rather low prediction accuracies (Breiman 1996). RF was fitted in R (R Development Core Team 2005) version 3.0.2, implementing the randomForest package (Ridgeway 2006). Artificial neural network comprises several layers of nodes (neurons) (Lee et al. 2012a, b). These layers of nodes exchange messages with each other (Lee et al. 2012a, b). Every node is connected to the other nodes in the next layer. Input layer includes influence factors and the output layer delivers one or more predictive values for the response variable(s), which in this case is the probability of qanat occurrence (Aertsen et al. 2010). Between them, there are one or more hidden layers and the network is trained by implementing an iterative method for determining the weights of the connections between the layers (Aertsen et al. 2010). In the current study, the back propagation (BP) algorithm with “mlpe” package was tested for groundwater qanat potential mapping and the results were compared with other soft computing models.

KNN classification is a non-parametric model for classification and regression problems (Chirici et al. 2015). Rote classifier, first, memorizes the whole training data. Then, the mentioned Rote classifier classifies only when the features of the test object (i.e., qanat) match with one of the training objects (Wu et al. 2008). However, there is a drawback in this approach. Since many test objects do not match with any of the training records, they cannot be categorized (Wu et al. 2008). The best selection of the k value depends on the data. Everitt et al. (2011) mentioned that larger values of k results in a reduction in the effect of noise on the classification. However, it makes boundaries among classes less clear (Everitt et al. 2011). The KNN model fit in R 3.0.2 was evaluated using the “rminer” package.

MARS is a method that is implemented in order to fit the relationship between input (in this case, groundwater influence factors) and output variables (in this case, qanat occurrence) (Friedman, 1991). MARS combines three techniques to build a new model. These techniques are (1) constructing splines mathematically, (2) binary recursive partitioning (BRP), and (3) linear regression (LR) (Friedman 1991). This model determines the type of relationship between the response factor (qanat occurrence) and the groundwater influence factors which could be linear or non-linear (Hastie et al. 2001). The MARS model was fitted using the “earth” package in R 3.0.2.

Support vector machines (SVMs) are a set of machine learning techniques based on the concept of optimal separating hyperplane which are developed by Vapnik (1995). SVM can be thought as non-linear classifiers which aim to find the most extensive margin between two classes in feature space (Ballabio and Sterlacchini 2012). The SVM approach aims to (1) reduce the error test and (2) reduce the model complexity (Ballabio and Sterlacchini 2012). In other words, SVM depends on data which means that the model capacity is calibrated to match data complexity which is called structural risk minimization, a paradigm on which SVM is based (Vapnik 1995; Cherkassky and Mulier 2007). In this study, SVM was fit in R 3.0.2 using the rminer package.

3.6 Data-based sensitivity analysis

Data-based sensitivity analysis (DSA) was used in order to define the importance of the groundwater influence factors in the ANN, QDA, and SVM models. The sensitivity analysis (SA) dataset is consisted of Ns random sub-datasets which are selected from the initial dataset. In the SA dataset, all xz values are altered by xak and the responses are gathered (Cortez and Embrechts 2013), where x z  : a∈ (1, 2, 3, …, B) and xak is the first input level, and B represents input variables. In the next step, the previous function is repeated implementing a different j value (j € {1, 2, 3, … , L}, L = level) (Cortez and Embrechts 2013). The mentioned process will be repeated for all groundwater influence factors and results in a complexity of the order ϑ(M * L * N s  * P), where Ns is the length of the training samples (Cortez and Embrechts 2013).

3.7 Validation and comparison of the groundwater potential maps

For evaluating the models, ROC, sensitivity, specificity, kappa, and qanat density indices were used. Sensitivity is proportion of qanats (in this study) which are correctly estimated as qanat (Negnevitsky 2002). On the other hand, specificity is called to the proportion of the non-qanats (in this study) that are correctly predicted as non-qanat (Negnevitsky 2002). Tradeoff between these indices is called ROC curve (Negnevitsky 2002; Hong et al. 2016; Karimi et al. 2016) which was also calculated and used in this study. Kappa index was also calculated as below:

$$ k=\frac{P_o-{P}_e}{1-{P}_o} $$
(1)

where, P o is TP + TN/n, and P e  = (TP + FN)(TP + FP) + (FP + TN)(FN + TN)/\( \sqrt{N} \). In these equations, TP is true positive, TN is called the true negative, FP is false positive, FN shows false negative, and N represents the total number of observations (Moosavi and Niazi 2015; Naghibi and Moradi Dashtpagerdi 2016).

Qanat density (QD) was also computed for the implemented models as below:

$$ QD=\frac{P_Q}{P_P} $$
(2)

where QD represents qanat density, P Q depicts percentage of the qanats in each class of the GPMs, and P P percentage of the pixels in each class of the GPMs.

4 Results

4.1 Results of the feature selection

According to Table 2, for slope angle, lower slopes had larger correlation with qanat occurrence as the class of 0–5 had the highest FR with a value of 1.71. For slope aspect, flat and southeast aspects had the highest FR values (1.87 and 1.34, respectively). For altitude, the class of 2000–2400 has the highest FR value (1.27). For plan curvature, the flat class had the highest FR with a value of 1.43, indicating a high probability qanat occurrence in this class. For the profile curvature class of −0.001 to 0.001 had the highest FR value (1.20). Qanats are concentrated in areas with LS <20 (FR value of 1.44), in areas with an SPI (<200) (1.40), and in areas with TWI >12 (1.69). For distance from rivers, the class of <100 had the highest FR value (2.63), while for distance from faults, the class of >1000 m had the highest FR value (1.11). The results of river density showed that the 0.31–0.78 class had a high density of qanats with FR value of 1.30. In the case of fault density, the <2.72 and 8.37–15.70 classes had the highest values (FR = 1.13, and 1.12, respectively). In respect of land use factor, orchard and agriculture had the highest qanat concentration with frequency values of 1.82 and 1.44, respectively. Finally, in the case of lithology, F and I had the highest FR values (2.72 and 1.30, respectively).

Table 2 Spatial relationship between each effective factor and qanat locations using frequency ratio (FR) model

Considering the FR values, it can be concluded that all of the factors, in one or some classes, have higher FR values than 1 which implies that the factors can be used as influence factors on groundwater potential.

4.2 Results of the models

The results of the LDA indicated that GPM could be calculated as below:

$$ {\mathrm{GQPI}}_{LDA}=-0.139*\mathrm{slope}\ \mathrm{degree}-0.071*\mathrm{slope}\ \mathrm{aspect}+0.002*\mathrm{altitude}-32.299*\mathrm{plan}\ \mathrm{curvature}-22.828*\mathrm{profile}\ \mathrm{curvature}+0.011*\mathrm{slope}\ \mathrm{length}-0.0000006*\mathrm{stream}\ \mathrm{power}\ \mathrm{index}-0.0004*\mathrm{topographic}\ \mathrm{wetness}\ \mathrm{index} - {0.0004}^{*}\ \mathrm{distance}\ \mathrm{from}\ \mathrm{rivers}+0.00002*\mathrm{distance}\ \mathrm{from}\ \mathrm{faults}-1.164*\mathrm{river}\ \mathrm{density}-0.015*\mathrm{fault}\ \mathrm{density}-0.031*\mathrm{land}\ \mathrm{use}-0.025*\mathrm{lithology} $$
(3)

In the GPM obtained by using LDA, low potential, moderate potential, high potential, and very high potential classes cover 21.29, 25.56, 33.58, and 19.57% of the study area, respectively (Fig. 5a).

Fig. 5
figure 5

Groundwater potential maps produced by a LDA, b QDA, c FDA, d PDA, e BRT, f RF, g ANN, h KNN, i MARS, j SVM

In the case of QDA, the GPM was grouped into four classes of low potential, moderate potential, high potential, and very high potential (Fig. 5b). The moderate potential GPM class derived using the QDA model covers 25.27% of the area; 24.69, 17.22, and 32.80% of the area are assigned to low potential, high potential, and very high potential GPM zones, respectively (Table 3). In this method, altitude, profile curvature, and plan curvature had the highest importance (Fig. 6a).

Table 3 Distribution of the qanat potential values and areas with respect to the groundwater qanat occurrence potential zones
Fig. 6
figure 6

The importance of influence factors in QDA (a), BRT (b), RF (c), ANN (d), and SVM (e) models

FDA resulted in a final model included degree value of 1 and nprune of 10. In this case, nprune depicts number of terms and degree represents product degree. In the GPM obtained by using the FDA model, high and very high classes covered 23.12 and 17.12% of the study area, while low potential and moderate potential classes of potentiality covered 32.25 and 24.52% of the area (Fig. 5c).

Penalized discriminant analysis was fitted and the final model had shrinkage penalty coefficient (lambda) value of 0.1 with accuracy value of 0.61 and kappa value of .023. In the case of GPM obtained by using PDA, moderate potential class covered 26.11% of the study area and low potential, high potential, and very high potential classes contained 21.43, 33.01, and 19.45% of the area (Fig. 4d).

In the case of BRT, the five most influential variables were altitude (18.42%), TWI (13.91%), distance from faults (12.52%), slope aspect (9.99%), and slope degree (9.70%), respectively (Fig. 6b). GPM obtained by using BRT model is illustrated in Fig. 5b. We found that low potential, moderate potential, high potential, and very high potential classes cover 24.44, 25.44, 25.06, and 25.05% of the study area, respectively (Fig. 4e).

RF provides two indices to determine the importance of the input variables, i.e., mean decrease in accuracy and mean decrease in Gini (Immitzer et al. 2012). Altitude, TWI, slope, and distance from faults had the highest importance between effective factors, respectively (Fig. 6c). On the other hand, fault density, land use, and aspect had the lowest power. Low potential, moderate potential, high potential, and very high potential classes cover 31.27, 30.74, 23.10, and 14.89% of the study area, respectively (Fig. 5f).

In the case of ANN, the final multilayer perceptron (MLP) was applied with 14 input layers, 7 hidden layers, and 1 output layer as a 14-7-1 network. A groundwater qanat potential map was calculated using artificial neural network (Fig. 5g). The range of classes and percentage of each class are presented in Table 3. According to the results, low potential, moderate potential, high potential, and very high potential classes cover 13.00, 34.87, 32.57, and 19.54% of the study area, respectively. According to Fig. 6d, TWI, slope degree, and river density had the most important influence factors, while land use, slope aspect, and fault density had the lowest importance.

Groundwater qanat potential map was calculated using KNN by a k = 3 in the study area (Fig. 5h). Based on this method, low potential, moderate potential, high potential, and very high potential classes cover 21.72, 34.57, 31.05, and 12.66% of the study area, respectively.

The main equation calculated by multivariate adaptive regression spline model is represented as

$$ \mathrm{MARS}=0.928\hbox{--} 0.018* \max \left(0,\mathrm{Slope}-8.324\right)-0.00004* \max \left(0,\mathrm{Fault}\ Dis-6596.97\right)+67.984* \max \left(0,\mathrm{Profile}--0.0009\right)*\mathrm{Landuse}2\hbox{--} 0.061* \max \left(0,14.489-TWI\right)*\mathrm{Landuse}2+0.005* \max \left(0,2280-\mathrm{Altitude}\right)* \max \left(0,0.353\hbox{--} \mathrm{River}\ \mathrm{Dens}\right)-0.0002* \max \left(0,2280-\mathrm{Altitude}\right)* \max \left(0,8.797\hbox{--} \mathrm{Fault}\ \mathrm{Dens}\right)-0.014* \max \left(0,\mathrm{Profile}--0.004\right)* \max \left(0,6596.969-\mathrm{Fault}\ Dis\right) $$
(4)

The groundwater potential map produced by MARS is shown in Fig. 5i. In this method, low potential, moderate potential, high potential, and very high potential classes cover 16.21, 23.88, 27.22, and 32.67% of the study area, respectively.

In the case of SVM, C-SVM (SVM type 1) with a Gaussian radial basic kernel function, hyperparameter sigma = 0.076, number of support vectors = 249, objective function value = −189.32, and training error = 0.2 was fitted. Finally, groundwater potential map produced by using SVM model is shown in Fig. 5j. The range and classes are represented in Table 3. According to the results, low potential, moderate potential, high potential, and very high potential classes cover 36.34, 29.49, 20.58, and 13.57% of the study area, respectively. According to Fig. 6e, altitude, slope degree, and plan curvature had the highest importance between effective factors, while lithology, land use, and slope aspect had the lowest importance.

4.3 Validation of qanat potential maps

The results of the sensitivity showed that QDA and ANN were the best models for predicting qanat locations (Table 4). On the other hand, MARS and KNN were the weakest models in this case. The results of the specificity depicted good performance of BRT and RF models and weak performance of QDA and ANN models (Table 4). Results of the AUC-ROCs indicate that RF, SVM, and BRT had the highest ROC values (0.846, 0.83, and 0.829, respectively) indicating better performance by these models in groundwater potential mapping, while ANN and KNN had the lowest values of ROC (0.632, and 0.703, respectively, indicating weak performances; Table 4). In the case of Kappa index, RF, BRT, FDA, and SVM showed better results, while ANN, KNN, and MARS showed weak performances (Table 4). In addition, QD was also calculated and represented in Table 5. According to the results, PDA, SVM, BRT, and RF had the highest values of 18.15, 2.76, 1.98, and 1.93, respectively. Overall, it can be concluded that RF, SVM, and BRT were the most successful method for groundwater modeling in this study.

Table 4 Prediction rate curve for the qanat potential maps produced by different models implemented in the current study
Table 5 Qanat density (QD) on groundwater potential maps developed from different machine learning models

5 Discussion

In this section, the results were discussed in three parts including the performance of models and their comparison, the comparison between qanat and spring as indicators of groundwater potential, and the importance of variables in groundwater modeling.

5.1 The performance of models and their comparison

According to the results, RF had the best performance in groundwater modeling, followed by SVM and BRT models. The better performance of RF can be due to its ability to run on large databases and capability to handle thousands of input variables without variable deletion. Stumpf and Kerle (2011) mentioned that random forests take advantage of the high variance among individual trees which lets each tree vote for the class membership. Then, RF determines the respective class based on the most number of votes. In addition, RF is able to deal with interactions and non-linearities between effective factors (Catani et al. 2013). Also, RF showed suitable performance in different fields of study including wildfire, ecology, groundwater spring potential mapping, and landslide susceptibility mapping (Peters et al. 2007; Oliveira et al. 2012; Vorpahl et al. 2012; Naghibi and Pourghasemi 2015). The SVM model has the advantage of handling complex, non-linear relationships, and is very robust to noise (Ballabio and Sterlacchini 2012; Tien Bui et al. 2016). It is shown that the SVM model performs well in different fields of study including flood susceptibility mapping (Tehrany et al. 2014; Tehrany et al. 2015) and landslide susceptibility mapping (Ballabio and Sterlacchini 2012; Pourghasemi et al. 2013). It was seen that BRT model had suitable performance and attained third rank in groundwater potential mapping among the implemented models. BRT model includes strong features of tree-based models; it can model different types of factors and cope with missing data. BRT is also able to handle interaction effects among effective factors (inputs) (Elith et al. 2008). In addition, BRT showed good performance in different fields of study such as ecology (Abeare 2009; Aertsen et al. 2011), groundwater spring potential mapping (Naghibi and Pourghasemi 2015), and landslide susceptibility mapping (Youssef et al. 2015). Overall, tree-based models have some advantages such as feature selection and pruning. Feature selection leads to selection of the most important factors which can be used for splitting and making decision. On the other hand, TB models employ pruning features which makes them more general and makes the results more acceptable. Among discriminant analysis models, it was seen that FDA had the best performance, followed by QDA, LDA, and PDA models. In the case of LDA, it is very sensitive to outliers, and no dependent factor may be definitely correlated to a linear combination of other variables. ANN is prone to overfitting and KNN performs poorly on high-dimensionality datasets (Tien Bui et al. 2012).

5.2 The comparison between qanat and spring as indicators of groundwater potential

Using qanat as an indicator for groundwater potential was a novelty in the current study. Naghibi and Pourghasemi (2015) used spring data to map groundwater potential by five models of BRT, RF, CART, EBF, and GLM in the Beheshtabad watershed, Iran. According to their results, BRT and RF models had ROC values of 86.12 and 86.05, respectively. In the current study, BRT and RF models showed ROC values of 84.90 and 86.31, respectively. The results obtained by spring and qanat as groundwater potential indicators are very similar. So, it can be concluded that qanat is also a good indicator for groundwater potential.

5.3 The importance of variables in groundwater modeling

Altitude, slope degree, plan curvature, and profile curvature were found to be more significant factors compared to others. On the other hand, lithology, land use, and slope aspect were the least important influence factors. Therefore, it can be concluded that two primary and two secondary parameters had the highest importance and contribution in the modeling process. The four mentioned primary and secondary topographical parameters affect water flow concentration as well as its infiltration in any part of the watershed. As slope degree increases, flow speed increases and a decrease in infiltration can be observed. Lower altitudes contain more developed drainage system which leads to more water flow and subsequently higher available water. Considering the topographical condition of the study area which is mountainous, higher importance of the primary and secondary topographical factors could be justified. In addition, the quality of the input factors could influence the results of the models and subsequently the factors’ importance. Thus, it can be concluded that the importance of the factors could be affected by general topographical condition of the area and its quality. In another study, Naghibi and Pourghasemi (2015) found altitude, distance from faults, SPI, and fault density to be more important for groundwater modeling. Hence, there are some differences in the sensitivity of different factors in the use of qanat versus spring for groundwater modeling.

6 Conclusion

This study evaluated the performance of ten models in the groundwater potential mapping using qanat locations as indicator in the Beheshtabad Watershed, Iran. For this purpose, 14 influence factors were prepared and used in the modeling. The ROC, sensitivity, specificity, kappa, and qanat density indices were used to evaluate the performance of the models. The results showed that the RF, SVM, and BRT models were more suitable for groundwater potential mapping using qanat location. Overall, it was seen that tree-based models (BRT and RF) and SVM had better performance than discriminant models, ANN, MARS, and KNN. In addition, among discriminant analysis models, FDA had the best performance. Furthermore, the suitability of qanat locations as groundwater potential indicator was verified. Therefore, in many countries, these constructions can be used as groundwater indicator for mapping groundwater potential. In addition, it was concluded that altitude, slope degree, plan curvature, and profile curvature were more effective compared to other factors. The results of the current study can be used by land use planners and water resource managers in order to reduce the costs of groundwater resource discovery and exploitation.