Introduction

Groundwater is one of the most precious natural resources, which supports human civilization (Bera and Bandyopadhyay 2012). Its essential qualities make it an immensely important and dependable source of water supplies in all climatic regions including both urban and rural areas of developed and developing countries (Waikar and Nilawar 2014). Geological strata act both as conduits for transmission of and reservoirs for groundwater. The suitability for exploitation of groundwater in a geological formation primarily depends on storage and transmissivity of the formation. High relief and downhill slopes impart higher runoff, while topographical depressions enhance groundwater recharge (Waikar and Nilawar 2014). Areas of high drainage density also increase surface runoff. Surface water bodies like rivers and ponds can operate as recharge zones (Murugesan et al. 2012; Waikar and Nilawar 2014).

Groundwater is not an unlimited resource so its use should be properly planned based on the understanding of the groundwater systems behavior in order to ensure its sustainable use (Bera and Bandyopadhyay 2012). Assessing the potential zone of groundwater recharge is therefore important to protect water quality and manage groundwater use. Groundwater recharge zones can be demarcated with the help of remote sensing (RS) and GIS techniques (Waikar and Nilawar 2014). One key advantage of RS data for hydrological investigations and monitoring is its capability to generate information in spatial and temporal domains, which is valuable for analysis, prediction, and validation (Waikar and Nilawar 2014). In addition, GIS technology provides suitable alternatives for efficient management of large and complex geospatial databases (Waikar and Nilawar 2014). Several studies have been conducted on groundwater evaluation using GIS and RS techniques (Jaiswal et al. 2003; Solomon and Quiel 2006; Jha et al. 2007; Ganapuram et al. 2009; Saha et al. 2010; Pourtaghi and Pourghasemi 2014; Naghibi et al. 2014; Davoodi Moghaddam et al. 2013; Rahmati et al. 2015). For example, Oh et al. (2011), Ozdemir (2011), Kaliraj et al. (2013) and Pourtaghi and Pourghasemi (2014) published various studies that have applied RS and GIS to groundwater spring potential mapping. Extending these techniques, numerous statistical modeling techniques are able to predict the potential distribution of a phenomenon from a set of independent variables: such as logistic multiple regression (LMR: Mair and El-Kadi 2013), generalized additive model (GAM: Sorichetta et al. 2013), random forest (RF: Rodriguez-Galiano et al. 2014; Naghibi and Pourghasemi 2015; Naghibi et al. 2016), and multivariate adaptive regression splines (MARS: Gutiérrez et al. 2009). In recent years, with the rapid development of information technology and database technology, data mining algorithms have seen applications beyond information technology into other societal applications (Yao et al. 2013). Data mining is a process of extracting potentially helpful information and knowledge, unknown in advance, from a large, incomplete, and noisy, fuzzy and random practical dataset (Yao et al. 2013). Although the MARS (multivariate adaptive regression spline) and RF (random forest) methods have been applied for landslide susceptibility mapping (Youssef et al. 2015), gully erosion modeling (Gutiérrez et al. 2009), and regional or local assessments of nitrate and pesticide contamination (Rodriguez-Galiano et al. 2014); this approach (MARS) and its comparison with a RF has not yet been used for groundwater spring potential mapping.

This study evaluates the GIS-based MARS and RF models for groundwater spring potential mapping at the Bojnurd Township in northern Khorasan Province, Iran (Fig. 1). The main objective of the study is to contribute towards systematic groundwater studies utilizing RF and MARS models to delineate groundwater spring potential areas which could be applied in other similar areas.

Fig. 1
figure 1

Location of the study area; a Iran map, b North Khorasan Province map, c spring location map of study area

Study area

The study area, as shown in Fig. 1, lies in the southern region of Bojnourd Township in North-Khorasan Province, Iran, known as the Bojnourd Plain. This 1243 km2 area is located between 55°44 and 56°18 longitude, and 38°17′ to 37°13′ latitude. The area’s elevation varies from 887 to 2967 m above mean sea level (m.s.l.) and the annual rainfall in the area is approximately 266.4 mm. The study area slopes gently from south to north and forms foothills for mountains to the north. The Bojnourd Plain is located in Kopet-Dagh geological formation that mainly covered by Quaternary sediments. The groundwater elevation in wells in the study area is between 1025 and 1074 m m.s.l. The groundwater depth varies between 4 and 80 m, and groundwater flows from the south and southwest to the north and northeast.

Methodology

Figure 2 shows a flowchart of the methodology applied in the current study. This figure demonstrates the explanatory variables used in the analysis and the processes applied according to the models. In the first step, the dataset for model development and application were assembled. Next, a certainty factor (CF) model was applied to determine spatial relationships among spring occurrence and explanatory variables. Then, MARS and RF models were applied to map groundwater spring potential. Finally, constructed models were validated and tested using the receiver operating characteristic (ROC) curve (success rate and prediction rate curves).

Fig. 2
figure 2

Flow chart of methodology used in spring potential mapping

Dataset for models development and application

Dataset and construction of a spatial database of explanatory variables are important parts of any research (Pourghasemi et al. 2013; Davoodi Moghaddam et al. 2013). At first, the spring locations were compiled from Iranian Department Water Resources Management (http://www.wrm.ir/index.php?l=EN) and extensive field surveys. In total, 234 springs were detected in Boujnurd watershed, North Khorasan, Iran (Fig. 1). 176 (70 %) of the spring locations were used for groundwater spring potential mapping and 58 (30 %) were set aside for validation. For conducting a spring potential map (SPM), it is necessary to evaluate mappable explanatory variables with the spring inventory map (Davoodi Moghaddam et al. 2013). In this study, 13 such explanatory variables were considered. These were; altitude, slope aspect, slope degree, slope-length (LS), topographic wetness index (TWI), plan curvature, profile curvature, land use, lithology, distance to rivers, drainage density, distance to faults, and fault density. A digital elevation model (DEM) was created using topographical maps at 1:50,000 scale. The DEM has a cell size of 30 m with 1284 rows and 1768 columns. The DEM was used to derive the altitude, slope aspect, slope degree, LS, TWI, plan curvature, and profile curvature values. The altitude map for the study area with cell size 30 m × 30 m was produced from the DEM and classified into five classes (Fig. 3a). Slope aspect strongly affects hydrologic processes via evapotranspiration (Sidle and Ochiai 2006) and has been categorized into nine classes (Fig. 3b). The slope map of the study area is obtained from the DEM using the slope function in ILWIS-GIS (http://www.ilwis.org/). These slope values (in degree) are divided into four classes (Fig. 3c). Slope-length (LS) is the combination of slope steepness (S) and slope length (L) which is implemented to represent soil loss potential from the combined slope properties (Fig. 3d). The LS factor was calculated according to Eq. 1 (Moore and Burch 1986) and classified into four categories.

$${\text{LS}} = \left( {\frac{{B_{\text{s}} }}{22.13}} \right)^{0.6} \times \left( {\frac{{{ \sin }_{\beta } }}{0.0896}} \right)^{1.3} ,$$
(1)

where B s = specific catchment’s area, β = slope angle.

Fig. 3
figure 3

Maps of explanatory variables in the study area; a altitude (m), b slope aspect, c slope degree, d slope length (LS), e topographic wetness index (TWI), f plan curvature (100/m), g profile curvature (100/m)

Another topographic factor is TWI which is defined in Eq. 2 (Beven and Kirkby 1979; Moore et al. 1991):

$${\text{TWI}} = { \ln }\left( {\frac{\alpha }{{{ \tan }\beta }}} \right)$$
(2)

where α = is the cumulative up slope area from a point (per unit contour length) and β = is the slope angle at the point (Fig. 3e). The plan curvature demonstrates the morphology of the topography. A positive curvature represents that the surface is upwards convex at that cell, and a negative curvature shows that the surface is upwards concave at that cell. A value of zero indicates a flat surface (Oh and Lee 2010) (Fig. 3f). The profile curvature shows the flow acceleration, erosion (negative values)/deposition (positive values) rate and it controls the change of speed of mass flowing down the slope (Yesilnacar 2005; Talebi et al. 2007). In this study, the profile curvature was prepared and classified into three groups based on common standard classification scheme (Pourghasemi et al. 2013) (Fig. 3g).

A land use layer was produced from Landsat-7/ETM+ satellite images using a supervised classification and maximum likelihood algorithm (Rahmati et al. 2016). The area is covered by six land use types; forest, rangeland, dray farming, irrigation farming, residential area, and bare land. The details of land use type are shown in Fig. 4 and summarized in Table 1. Lithological features of study region are represented in the geologic map (Fig. 5), which is derived from the geologic map at 1:100,000-scale prepared by Geological Survey of Iran (Geology Survey of Iran (GSI) 1997), digitized in ILWIS-GIS (version 3.8), and divided into 12 classes (Table 1).

Fig. 4
figure 4

Land use map of study area

Table 1 Lithology of the study area
Fig. 5
figure 5

Lithology map of study area

The distance to rivers was calculated using the vector river lines by manually applying the distance function in ArcGIS (version 9.3). Five classes corresponding to distance to rivers were calculated at 200-m intervals (Fig. 6a). The drainage density exhibits the flow of water through the study area and is defined as the ratio of sum of the drainage lengths in the cell and the area of the corresponding cell (Sarkar and Kanungo 2004; Pourghasemi et al. 2013). The drainage density was computed for each 30 × 30 m grid cell which ranges from 1.81 to 7.99 km/km2 and is classified into four classes (Fig. 6b). The distance to faults map was extracted from geologic maps at 1:100,000 scale, and then the buffer categories were defined (Fig. 7a). Finally, the fault density map was produced. The length of the faults from geological maps at 1:100,000 scale of the study area were extracted and divided by area for each 30 × 30 m grid cell with results ranging from 1.81 to 15.94 km/km2. The results were classified into four classes (Fig. 7b).

Fig. 6
figure 6

a Distance to rivers (m), b drainage density (Km/Km2) in the study area

Fig. 7
figure 7

a Distance to faults (m), b fault density (Km/Km2) in the study area

Models

Certainty factor (CF) model

In this study, the CF model was implemented to demonstrate the spatial link joining spring occurrence and explanatory variables. The CF (an approach that has seen widespread use in rule-based expert systems), is based on probabilistic reasoning (Chung and Leclerc 1994). This is one strategy to handle the problem of blending of different data layers and the heterogeneity and unreliability of the input data. The CF, defined as a function of probability, was originally suggested by Shortliffe and Buchanan (1975) and later modified by Heckerman (1986) (Kanungo et al. 2011):

$${\text{CF}} = \left\{ {\begin{array}{*{20}l} {\frac{{pp_{m} - pp_{n} }}{{pp_{m} (1 - pp_{n} )}}\quad {\text{if}}\; pp_{m} \ge pp_{n} } \\ {\frac{{pp_{m} - pp_{n} }}{{pp_{n} (1 - pp_{m} )}}\quad {\text{if}}\; pp_{m} < pp_{n} } \\ \end{array} } \right.$$
(3)

where, pp m is the conditional probability of having a number of spring events occurring in category m and pp n is the prior probability of having the total number of spring events occurring in the study area. The range of variation of the CF is [−1.0 to 1.0], where a positive value means an increasing certainty in spring occurrence, while a negative value corresponds to a decreasing certainty in spring occurrence. A value close to 0 means that the prior probability is very similar to the conditional one, so it is difficult to give any indication about the certainty of the spring occurrence. The favorability values (pp m , pp n ) are derived from overlaying each data layer with the existing spring distribution layer in GIS environment and calculating the spring occurrence frequency. CF values are then calculated for each layer and their sub-classes in Microsoft Excel 2010 (Kanungo et al. 2011).

Random forest (RF) model

“Random forest is an ensemble method which compounds multiple decision tree algorithms to produce repeated predictions of the same phenomenon. Random forests (RF) are very flexible ensemble classifiers based on decision trees, first developed by Breiman (2001)” (Breiman 2001; Catani et al. 2013; Micheletti et al. 2014). Decision trees can be separated to classification trees and regression trees (Rodriguez-Galiano et al. 2014). A regression tree (RT) indicates a set of restrictions or conditions which are hierarchically structured, and which are successively applied from a root to a terminal node or leaf of the tree (Breiman et al. 1984; Quinlan 1993). In order to derive the RT, recursive partitioning and multiple regressions are carried out from the dataset. From the root node, the data splitting process in each internal node of a rule of the tree is consecutive until a stop condition previously specified is reached. Each of the terminal nodes, or leaves, has joined to it a simple regression model which applies in that node only. Once the tree’s exaction process is finished, pruning can be applied with the aim of improving the tree’s generalization capacity by reducing its structural complexity. The number of cases in nodes can be derived as pruning criteria (Rodriguez-Galiano et al. 2014).

The RF algorithm handles random binary trees which use a subset of the observations through bootstrapping techniques: from the original data set a random choice of the training data is sampled and used to build the model, the data not included are referred to as “out-of-bag” (OOB) (Breiman 2001; Catani et al. 2013). Furthermore, a random selection of predictor variables is applied to split each node of the trees. Each tree is expanded to minimize classification errors, but the random selection influences the results, thus making a single-tree classification very unstable. The RF algorithm estimates the importance of a variable by looking for how much the prediction error increases when OOB data for that variable is permuted while all others are left unchanged (Liaw and Wiener 2002; Catani et al. 2013). This capability can be profitably applied to study the relative importance of the different explanatory variables, a critically important but often neglected aspect of SPM (spring potential mapping). In the R statistical package application of RF used in this work (the “randomForest” package in R 2.0.3 (Breiman and Cutler 2006)), the model output is a membership probability to one of the two possible classes “Spring” and “No spring”. Random forests need two parameters to be tuned by the user: (1) the number of trees T, (2) the number of variables m to be stochastically chosen from the available set of features. It is suggested (Breiman 2001; Micheletti et al. 2014) to pick a large number of trees and the square root of the dimensionality of the input space for m (Micheletti et al. 2014). Based on two parameters, the number of trees in RF has been fixed to 1000 after an introductory analysis and the number m of variables sampled at each node has been selected to be three to analyze the conjunct contribution of subsets of features while maintaining fast convergence during iterations. Moreover, two types of error were calculated: mean decrease in accuracy and mean decrease in node impurity (mean decrease Gini). These importance measures can be used for ranking variables and for variable selection (Calle and Urrea 2010).

Multivariate adaptive regression spline (MARS) model

The MARS models (implemented in this work using the “earth” package in R 3.0.2 (Milborrow 2012). Use a nonparametric modeling approach that does not require assumptions about the form of the relationship between the independent and dependent variables (Friedman 1991; Balashi et al. 2009). The MARS algorithm works by division the ranges of the explanatory variables into regions and by producing, for each of these regions, a linear regression equation. Breaks values between regions are called “knots”, while the term “basis function” (BF) is used to demonstrate each distinct interval of the predictors. BFs are functions of the following form (Eq. 4):

$${ \hbox{max} }\left( {0,x{-}k} \right){\text{ or max }}\left( {0,k{-}x} \right)$$
(4)

where x is an independent variable and k is a constant corresponding to a knot. The general formulation of MARS is:

$$\hat{y} = \hat{f}\left( x \right) = \beta + \mathop \sum \limits_{m = 1}^{M} \alpha_{m} H_{m} (x)$$
(5)

where, y is the dependent variable predicted by the function f(x), β is a constant, and M is the number of terms, each of them formed by a coefficient α m and H m (x) is an individual basis function or a product of two or more BFs (Conoscenti et al. 2014). The MARS models were developed in two steps. In the first step—the forward algorith—basis functions are presented to define Eq. 5. Many basis functions are added in Eq. 5 to get better performance. The developed MARS can experience overfitting due to large a number of basic functions. To mitigate this problem, the second step—the backward algorithm—prevents over fitting by removing redundant basis functions from Eq. 5. MARS adopts Generalized Cross-Validation (GCV) to delete the redundant basis functions (Craven and Wahba 1979; Samui and Kothari 2012). The expression of GCV is written as follows (Eq. 6):

$${\text{GCV}} = \frac{{\frac{1}{N}\mathop \sum \nolimits_{i = 1}^{N} \left[ {y_{i} - \hat{f}(x_{i} )} \right]^{2} }}{{\left[ {1 - \frac{C(H)}{N}} \right]^{2} }}$$
(6)

where N is the number of data and C(K) is a complexity penalty that increases with the number of basis function (BF) in the model and which is defined as (Eq. 7):

$$C\left( H \right) = \left( {{\text{H}} + 1} \right) + dH$$
(7)

where d is a penalty for each BF included into the model and H is number of basic functions in Eq. 5 (Friedman 1991; Samui and Kothari 2012).

Results

Application of certainty factor model

The results of spatial relationship between spring occurrence and explanatory variables using the CF technique are shown in Table 2. Based on Table 2, for altitude, for example, the 1575–1881 m class has the highest CF value (0.33). CF values generally increased with increasing altitude in the study area, and then spring occurrence probability decreases at altitudes above 2189 m. For slope aspect, most of the springs occurred in south and north-west facing slopes, while north east-facing slopes have the lowest abundance. In the study area, the drainage density, rivers, and faults are in the south and north-west facing parts of the study area, so these sites are considered as likely zones of groundwater recharge. The north and west regions have lower drainage density and fault values. These areas fall under low suitability for infiltration (Davoodi Moghaddam et al. 2013). For slope angles, the 5°–15° and 15°–30° classes have the highest CF values (0.57 and 0.20, respectively). Slope always plays a very important role in groundwater potential mapping and at the same time the slope increases, the runoff increases as well (Israil et al. 2006) leading to less infiltration (Jaiswal et al. 2003; Davoodi Moghaddam et al. 2013). For slope length index, the 3.32–8.39 m class has the highest CF value (0.12). The CF value for TWI clearly showed that class of 16.13–25.65 has the most effect on spring locations. The TWI factor illustrates the effect of topography on the location and size of saturated source areas of runoff generation under the steady-state assumption and uniform soil properties (i.e., transmissivity is constant throughout the catchments and equal to unity) (Pourghasemi et al. 2013; Davoodi Moghaddam et al. 2013). The relation between plan curvature and spring locations showed that concave class has the highest value of CF (0.22), and for profile curvature, the >0.001 class shows a high CF value (0.20). A concave slope contains more water and holds this water for a longer period especially during heavy rainfall (Lee and Pradhan 2006; Davoodi Moghaddam et al. 2013). In the case of land use, the highest CF value was for the forest land use type (0.76). When comparing the relationship between spring location and lithology, the CF values were positive for the classes of JKsi (Pale red argillaceous limestone, marl, Gypsiferous marl, sandstone and conglomerate) and Jl (Light grey, thin-bedded to massive limestone). In the case of distance to rivers, the 0 and 200 m class has a CF score of 0.21. The drainage density class <1.81 km/km2 has a CF value of 0.10. In general, we observed that as the drainage density increases, the spring frequency decreases. The drainage density depends on the slope, nature, and attitude of bedrock and the existing regional and local fracture templates. It is a reflection of the lithology and structure of a given area and can be of great value for groundwater resources evaluation (Godebo 2005; Davoodi Moghaddam et al. 2013). Assessment of distance to faults showed that the <2509 m class has high correlation with spring occurrence. Lineaments are linearly fractured zones in the geological structure of an area, such as faults and dykes, and they can control the exchange of water between surface and subsurface (Davoodi Moghaddam et al. 2013). Finally, for fault density, the 1.87–5.91 km/km2 class has a CF value of 0.47.

Table 2 Spatial relationship between each spring explanatory variable and springs by certainty factor (CF) model

Application of random forest model

The results of spatial relationship between springs and explanatory variables using the RF model are shown in Tables 3, 4 and Figs. 8, 9. The aggregate OOB predictions are presented in Fig. 8 and Table 3 (confusion matrix). The OOB results indicate a prediction error rate of about 30.11 %. In other words, the model can be considered 69.89 % accurate. Overall measure of accuracy is then followed by a confusion matrix that records the conflict between that final model’s predictions and the present outcomes of the training observations. The present observations are the rows (Table 3), whilst the columns correspond to the model predictions collocated with the observations: the number reflect the counts in each box (Williams 2011). The model incorrectly predicted springs where they were actually absent in 56 cases (type I error) and the absence of springs when they were actually present in 50 cases (Type II error). The model correctly predicted the absence of springs for 120 cases and the presence of springs for 126 cases. Results from variable selection for the RF model are presnted in Fig. 9. This shows the 13 variables ordered by two specific importance measures (mean decrease accuracy and mean decrease Gini). Based on Fig. 9 and Table 4, the higher values indicate that the variable is relatively more important (Williams 2011). The accuracy measure (mean decrease) lists distance to rivers (32.75), altitude (22.29), and land use (18.49) as the most important. The distance to rivers (14.78), altitude (14.13) and distance to faults (10.89) have the highest importance according to the Gini measure. Aside from the first two most important measures, the rankings are different according to the Gini measure relative to the Mean Decrease Accuracy measure.

Table 3 Confusion matrix for the RF model (0 = spring absent, 1 = spring present)
Table 4 Relative influence of explanatory variables in the RF model (0 = spring absent, 1 = spring present)
Fig. 8
figure 8

The error rate of the overall RF model (OOB out-of-bag (black line), 0 spring absent (red dash line), and 1 spring present (green dash line)

Fig. 9
figure 9

The error rate of the overall RF model (OOB out-of-bag, 0 spring absent and 1 spring present)

A full Spring Potential Map for the area was created using the RF model in ArcGIS 9.3 and categorize based on a natural break classification plan (Ozdemir 2011; Pourghasemi et al. 2013; Zare et al. 2013; Pourtaghi and Pourghasemi 2014) into low, moderate, high and very high potential categories. These results were represented in Fig. 10 and Table 6.

Fig. 10
figure 10

Spring potential map produced by the random forest (RF) model

Application of multivariate adaptive regression spline model

The optimal MARS model presents 20 terms and includes 20 BF (the term created during the forward pass were 96), with a GCV of 0.18. Only ten of the 13 independent variables were used in the optimal model (Table 5), because MARS only uses the necessary independent variables (Gutiérrez et al. 2009). In Table 5, nsubset is an index vector specifying which cases to use, i.e., which rows in x to use (default is NULL, meaning all), gcv is generalized cross validation (GCV) of the model (aggregated over all responses) (the GCV is calculated using the penalty argument) and rss is residual sum-of-squares (RSS) of the model. So, based on Table 5, the most important variable is distance to rivers. Other important variables to explain the spatial distribution of springs in the study area are altitude, land use, slope aspect, distance to faults and TWI. In this kind of model the importance of the independent variables should be interpreted with caution (Donati and Turrini 2002; Gutiérrez et al. 2009). The groundwater spring potential map produced by the MARS model was created according to Eq. (8), and is presented in Fig. 11, and Table 6.

Table 5 The distribution of the explanatory variables and areas with respect to the spring occurrence potential zones
Fig. 11
figure 11

Spring potential map produced by the multivariate adaptive regression spline (MARS) model

Table 6 The relative distribution of spring occurrence categories obtained by the two modeling approaches
$$\begin{aligned} {\mathbf{Spm}}_{{{\mathbf{MARS}}}} &= [\left( { + 1.0 3 9 5 4 8} \right) - 0. 2 6 8\, {*}\,{ \hbox{max} }\left( {0,{\text{ Landuse-5}}} \right) \, + \, 0. 1 2 9 { }*{ \hbox{max} }\left( {0,{\text{ FaultDens-6}}.0 9 8 4 4 3} \right) \\ & \quad - 0.0000 7 { }*{ \hbox{max} }\left( {0,{ 14689}. 2 5 {\text{-FaultDis}}} \right) - 0.0 2 9 { }*{ \hbox{max} }\left( {0,{\text{ 5-Landuse}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ 6-Aspect}}} \right) \\ &\quad - 0.0 4 7 { }*{ \hbox{max} }\left( {0,{\text{ 5-Landuse}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ FaultDens-2}}. 2 6 6 5 4 3} \right) \\ &\quad + 0.00000 9 { }*{ \hbox{max} }\left( {0,{\text{ Landuse-2}}} \right) \, *{ \hbox{max} }(0,{ 14689}. 2 5 {\text{ -FaultDis}}) \\ &\quad - 0.0000 9 { }*{ \hbox{max} }\left( {0,{\text{ 5-Landuse}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ FaultDis-1}}0 6 4 2. 1 8} \right) \\ &\quad \, + \, 0.00 4 { }*{ \hbox{max} }\left( {0,{\text{ Aspect-6}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ RiverDis-323}}. 10 9 9} \right) \\ &\quad - 0.0 1 5 { }*{ \hbox{max} }\left( {0,{\text{ Aspect-6}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ Slope-14}}. 2 2 20 5} \right) \\ &\quad + 0.00 1 { }*{ \hbox{max} }\left( {0,{\text{ 1754-Dem}}} \right) \, *{ \hbox{max} }(0,{ 2}.0 9 6 5 8 {\text{-RiverDens}}) \\ &\quad - 0.000 5 { }*{ \hbox{max} }\left( {0,{\text{ 1754-Dem}}} \right) \, *{ \hbox{max} }\left( {0,{ 3}. 10 9 6 3 3 {\text{-Slope}}} \right) \\ &\quad - 0.0000 9 { }*{ \hbox{max} }\left( {0,{\text{ 1754-Dem}}} \right) \, *{ \hbox{max} }\left( {0,{ 17}. 7 1 7 5 {\text{-TWI}}} \right) \\ &\quad + 0.0000 5 { }*{ \hbox{max} }\left( {0,{ 6}.0 9 8 4 4 3 {\text{-FaultDens}}} \right) \, *{ \hbox{max} }(0,{\text{FaultDis-13788}}. 5 8) \\ &\quad - 0.000000 3 { }*{ \hbox{max} }\left( {0,{ 14689}. 2 5 {\text{-FaultDis}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ RiverDis-161}}. 5 5 4 9} \right) \\ &\quad + 0.0000 1 { }*{ \hbox{max} }\left( {0,{ 8813}. 5 1 8 {\text{-FaultDis}}} \right) \, *{ \hbox{max} }\left( {0,{ 13}. 5 4 9 1 7 {\text{-LS}}} \right) \\ &\quad + 0.000000 4 { }*{ \hbox{max} }\left( {0,{ 14689}. 2 5 {\text{-FaultDis}}} \right) \, *{ \hbox{max} }\left( {0,{\text{ LS-3}}. 20 8 2} \right) \\ &\quad - 0.0 1 3 { }*{ \hbox{max} }(0,{\text{RiverDens-2}}. 1 2 3 9 4 7) \, *{ \hbox{max} }\left( {0,{ 13}. 5 4 9 1 7 {\text{-LS}}} \right) \\ &\quad-0.0 4 7 { }*{ \hbox{max} }\left( {0,{ 2}. 1 2 3 9 4 7 {\text{-RiverDens}}} \right) \, *{ \hbox{max} }\left( {0,{ 13}. 5 4 9 1 7 {\text{-LS}}} \right) \\ &\quad + 0.00 2 { }*{ \hbox{max} }\left( {0,{ 6}0{\text{-RiverDis}}} \right) \, *{ \hbox{max} }\left( {0,{ 13}. 2 8 7 5 {\text{ TWI}}} \right). \\ \end{aligned}$$
(8)

Validation of groundwater spring potential models

A key step in statistical modeling is an assessment of its quality is validation (Chung-Jo and Fabbri 2003). In this study, the spring potential model quality was validated using an independent dataset that was not used for constructing and building of model. From the 234 springs identified, 176 (70 %) locations were employed to produce spring potential maps, while the remaining 58 (30 %) cases were withheld for model validation. To determine the accuracy of applied models (MARS and RF), two verification methods—success rate and prediction rate curves—were used by comparing the existing spring locations with the two spring potential maps (Figs. 12, 13).

Fig. 12
figure 12

Receiver operating characteristic (ROC) curve for the spring potential maps produced by a RF and b MARS model

Fig. 13
figure 13

Prediction Receiver operating characteristic (ROC) curve for the spring potential maps produced by a RF and b MARS model

One method to represent the quality of deterministic and probabilistic models is the receiver operating characteristic (ROC) curve (Swets 1988). The area under the ROC curve (AUC) shows the forecast model quality by describing the model’s capability to forecast correctly the occurrence or non‐occurrence of pre‐defined “events” (Negnevitsky 2002). The ROC curve draws the false positive rate on the X axis and the true positive rate on the Y axis and evaluates the trade‐off between the two rates (Negnevitsky 2002). To obtain values for each prediction pattern, the calculated index values of all cells in the study area were sorted in descending order (Pradhan et al. 2010a, b). If the area under the ROC curve (AUC) is close to 1.0, the result of the test would be excellent. On the contrary, AUC of 0.5 indicates performance equivalent to random chance. Using the spring potential map grid cells in the training dataset, the success-rate results were calculated. The success rate curves were gained using the 70 % training dataset (176 spring locations). Figure 12a, b illustrates the ROCs for the two spring potential maps in this study. The FR and MARS models have nearly the same area under the curve (AUC) values (0. 79). The success rate alone is not a suitable technique for judging the models prediction power (Tien Bui et al. 2012), however, because the success rate technique utilize the training spring pixels that have already been employed for constructing the spring models. However, the prediction rate method may help to understand how well the resulting spring potential maps have classified the areas of existing spring (Tien Bui et al. 2012). The prediction rate describes how well the model and predictor variables predict the spring (Lee and Pradhan 2007; Tien Bui et al. 2012; Pourghasemi et al. 2012). The results of the ROC curve test or prediction rate are shown in Fig. 13a, b. These curves show that the MARS model has relatively higher prediction performance (AUC = 0.7326) than the RF model (AUC = 0.7098).

Discussion and conclusion

Groundwater occurrence and movement are most basically controlled by the aquifer’s permeability and the lithology of the underlying strata (Shahid et al. 2000; Ozdemir 2011). Especially in a fractured bedrock aquifer, movement of groundwater is governed by many other factors including topography, lithology, geological structures, fractures (density, aperture and connectivity), secondary porosity, groundwater recharge, drainage pattern, land-forms, land cover, and climatic conditions (Oh et al. 2011). Assessment of spring occurrence potential has become a valuable subject for water resource management authorities, and for regional land-use planning and environmental preservation. In the past, various methods have been applied to this task. In this study, groundwater potential maps were identified using MARS and RF models, predicting spring occurrence based on mappable explanatory variables. At first, using compiled information of Iranian Department Water Resources Management and extensive field investigations, a spring inventory map was prepared. Then, 13 data layers (altitude, slope aspect, slope degree, slope length, TWI, plan curvature, profile curvature, land use, lithology, distance to rivers, drainage density, distance to faults and fault density) were derived from the spatial database for use as explanatory variables. Using these explanatory variables, groundwater spring potential maps were produced using statistical modeling techniques random forest (RF) and multivariate adaptive regression spline (MARS). Carranza and Hale (2002) noted that expert knowledge is required to divide the dataset into training and validation data. For this reason, of 234 observed spring locations, 176 (70 %) cases were used as training data and the remaining 58 (30 %) was used for validation. AUC curves were prepared for the two models to test their accuracy. The validation results indicated that the MARS model has rather better predication accuracy (73.26 %) than the RF (70.98 %) model. The RF technique has several the advantages that growing large numbers of trees does not overfit the data, and random predictor selection keeps bias low, providing better models for prediction (Prasad et al. 2006). However, the RF model is prone to over fitting for some very noisy datasets and it do perform well when a majority of input variables are irrelevant (Breiman 2001). The MARS technique has advantages over traditional regression-based analyses. MARS picks only the most important explanatory variables from a user specified order. That is, the user may choose to include multiple variables at the beginning of the analysis and MARS will select out only the most important ones to include in the final result. This pruning process omits variables that have limited efficacy in the prediction of the outcome measure (Kennison and Cox 2013).

These techniques can be used in other areas, but they must be tuned to regions with similar characteristics to reflect the diversity of settings in which spring occur. However, other statistical modeling techniques may be suitable and more comparison would help guide selection of the best technique for a given. As a final conclusion, groundwater spring potential maps can be useful for planners and engineers in water-resource management and land-use planning. These spring potential maps can be applied to groundwater management and groundwater resource exploration, and the ability to create the maps using statistical modeling techniques shows great promise in wider application of spring potential mapping.