Introduction

The Earth system has been impacted by mankind abuse of resources, lack of management of land uses and planning, and climate changes that imposed changes to the natural environment such as the water, soil, and carbon or nitrogen cycles, which is resulting in land degradation (Keesstra et al. 2018). Thus, the United Nation has defined the Sustainable Development Goals (SDGs) (Griggs et al. 2013). Many of the defined goals have a strong relevance to water and soil management. Soil is one of the most important components of hydrological, biological, and geological cycles and, therefore, is in a state of permanent change and evolution (Kirchhoff et al. 2017). European Commission defined seven functions for soil including: (1) biomass production including forestry and agriculture; (2) storing, filtering, and transforming nutrients, substances, and water; (3) biodiversity pool such as habitats, species, and genes; (4) physical and cultural environment for humans and human activities; (5) source of raw material; (6) acting as carbon pool; and (7) archive of geological and archaeological heritage (EC, 2006). In the twenty-first century, soil protection depends not only on political decisions in the rules and regulations, but also on decisions and actions of land planners, foresters, and farmers (Cerdà et al. 2017a, b). Soil erosion is a universally threat to environment that leads to a various range of problems, such as land degradation in agriculture lands, soil fertility reduction, and sediment deposition in reservoirs, that must be solved by means of nature-based strategies (Cerdà et al. 2016; Keesstra et al. 2016; Mekonnen et al. 2017). High erosion rates in underdeveloped countries are mainly due to lack of vegetation cover, excessive grazing, deforestation, and intense plowing (Ligonja and Shrestha 2015), while in the developing countries is due to the heavy machinery (Cerdà et al. 2009). Intense erosion rates result in the loss of fertile soil and also change on the biological, hydrological, and geochemical cycles (Cerdà et al. 2015). Gully erosion is one of the most devastating forms of soil erosion that causes different types of damage to natural resources, agriculture, and infrastructures (Rahmati et al. 2016). A gully is defined as a profound channel created by concentrated flow of water, removing surface soil, and parent material, that is really big to wipe out by normal tillage operations (USDA-SCS 1966). When the geomorphologic threshold exceeds due to increase in flow water erosivity or sediment erodibility, gully erosion is occurred. In general, prediction of gully erosion is difficult and requires monitoring and mapping (Rahmati et al. 2017). Preparation of gully erosion susceptibility map (GESM) is necessary for better realization of the gully erosion mechanism and identifying areas that are prone to erosion. For this target, gully inventory map (GIM) and GEFs are required. Meanwhile, the results of a GESM strongly depend on the model hypothesis, parameter values, and parameter estimation methods. From the point of view of reducing the occurrence of gully erosion and sustainable development, there are many models for establishing a statistical relationship among GEFs and GIM and subsequently identifying sensitive areas to gully erosion. Among these are the following data-driven and knowledge-based models: logistic regression (Conoscenti et al. 2014; Kornejady et al. 2015), weights-of-evidence (Dube et al. 2014; Rahmati et al. 2016), frequency ratio (Rahmati et al. 2016), conditional probability (Rahmati et al. 2017); AHP (Svoray et al. 2012), maximum entropy (Pourghasemi et al. 2017; Kornejady et al. 2017), and information value (Conforti et al. 2011). In addition, computational intelligence methods including classification and regression trees (Märker et al. 2011), multivariate adaptive regression splines (Gómez-Gutiérrez et al. 2015), random forest (Kuhnert et al. 2010), artificial neural network (Pourghasemi et al. 2017), and support vector machine (Pourghasemi et al. 2017) were applied for preparing GESM in different countries.

In the study area, due to dry climatic conditions, extreme rainfall, favorable erosion geology, overgrazing, and vegetation destruction, gully erosion is the most important cause of soil degradation and soil loss. Therefore, gully erosion each year imposes damages on the inhabitants and agricultural lands. Despite the fact that gully erosion is one of the environmental problems in the Toroud Watershed, so far, no studies have been carried out in this area to assess and prepare a gully erosion susceptibility map for it. The main purpose of this research is identification of prone areas to gully erosion and gully erosion susceptibility mapping using data-driven and knowledge-based techniques in Toroud Watershed, Iran.

Materials and methods

Study area

The Toroud Watershed with area of 416 km2 is situated in Semnan Province, Iran. This area is located between 35°25′36″ and 35°37′9″ north latitudes, and 54°49′55″ and 55°11′30″ east longitudes (Fig. 1). According to the climatic classification in Iran (IDWRM 2013), the study area is classified as arid zone (mean annual rainfall of 85 mm). The absolute elevation in the study area ranges from about 759 to 1923 m, with a mean elevation of 1037 m a.s.l. It receives approximately 80% of its annual rainfall from November to April (WRCS 2015). In winter, the temperature ranges from − 5 to 12.7 °C. The main LC in the study area is irrigation agriculture, salt pan, and range lands. Most gullies in the study area are located on the southern parts due to erosion-sensitive formations and human activities such as agriculture, overgrazing, and development of roads. Eight photographs of the gullies identified in the study area are shown in Fig. 2.

Fig. 1
figure 1

Study area

Fig. 2
figure 2

Field photographs of some gullies identified in the study area

Methodology

The methodological work flow used in the current study is presented in Fig. 3. Therefore, the steps of this research are as follows: (1) preparation of GIM; (2) selection of GEFs using VIF and tolerance indices; (3) preparation of gully erosion susceptibility maps using data-driven techniques (FR, IOE, and WofE), and their comparison to knowledge-based model, namely, AHP; (4) validation of models using ROC and SCAI indicators; and (5) selection of the best model in the study area.

Fig. 3
figure 3

Methodological work flow used in this study

Data preparation

Gully inventory map (GIM)

The detailed and accurate GIM was prepared using extensive field surveys with a GPS (Global Positioning System) device and Google Earth imagery digitization. Identified gullies in the study area are either V-shaped or U-shaped. In total, 80 gullies were identified summing an area of about 49.23 ha and include 53 V-shaped and 27 U-shaped gullies. Maximum and minimum depth and width of the identified gullies in the study area are 13.5 and 1.5 and 21 and 2.10, respectively. Among all the 80 identified gully locations, 70% (56 gully locations) were randomly selected for modeling, while the remaining 30% (24 gully location) were employed for validation of models. In the next step, gullies in the polygon format were converted to point format and were used for modeling and validation aims. The locations of training and validating gullies are shown in Fig. 1.

Gullies in the study area are mainly created in low plains and slopes with high drainage density. The most important reason for occurring piping and gully in the study area is severe and flood rainfall, the existence of gypsum and salt minerals due to high evaporation, and the lack of vegetation and organic matter. Therefore, it can be admitted that superficial flows and the phenomenon of piping are the dominant processes of gully formation in the study area.

Gully erosion-conditioning factors

According to literature review (Conforti et al. 2011; Conoscenti et al. 2013; Zakerinejad and Maerker 2015; Rahmati et al. 2016, 2017; Pourghasemi et al. 2017; Zabihi et al. 2018), data availability, and field surveys, 18 effective factors were identified for considering gully erosion in the study area. After multi-collinearity analysis, 13 GEFs including elevation, slope degree, slope aspect, plan curvature, distance from river, drainage density, distance from road, lithology, LC, TWI, SPI, NDVI, and slope–length (LS) selected for modeling process (Fig. 4a–m).

Fig. 4
figure 4figure 4

Gully geo-environmental factors (GEFs). a Elevation, b slope degree, c slope aspect, d plan curvature, e distance from river, f drainage density, g distance from road, h lithology, i land cover, j topography wetness index (TWI), k stream power index (SPI), l normalized difference vegetation Index (NDVI), and m slope–length (LS)

Topographic factors are one of the most important geomorphological parameters that influence soil erosion (Achour et al. 2017; Rahmati et al. 2017). Topographic features such as elevation affect vegetation type and rainfall characteristics and, thus, can control the gully erosion process (Gómez-Gutiérrez et al. 2015).

In this research, to provide a digital elevation model, InSAR (Interferometric Synthetic Aperture Radar) technique and ALOS PALSAR data were used. For preparing DEM using InSAR, there are two phases including measurement phase and the conversion of measurement phase to height (Zhou et al. 2005). InSAR data-processing procedure is shown in Fig. 5.

Fig. 5
figure 5

InSAR data-processing procedure for DEM production

Elevation obtained from the ALOS PALSAR Digital Elevation Model with a spatial resolution of 12.5 m × 12.5 m and was divided into five categories (Rahmati et al. 2016): <916 m, 916–1065 m, 1065–1229 m, 1229–1473 m, and > 1473 m.

Slope factor is appropriate for the accumulation of flow surface and gully erosion (Rahmati et al. 2016). The slope angle map of the study area was obtained from the ALOS PALSAR-DEM and was divided into six categories (Conforti et al. 2011): <5°, 5°–10°, 10°–15°, 15°–20°, 20°–30°, and > 30°.

Slope aspect has an important effect on vegetation type, so the control duration of sunlight, moisture, evaporation and transpiration, and the distribution of vegetation on the slopes, and indirectly affect the erosion process (Jaafari et al. 2014). The slope aspect map was also prepared from ALOS PALSAR-DEM under nine directional classes (Rahmati et al. 2016): flat (− 1°), north (337.5°–360°, 0°–22.5°), northeast (22.5°–67.5°), east (67.5°–112.5°), southeast (112.5°–157.5°), south (157.5°–202.5°), southwest (202.5°–247.5°), west (247.5°–292.5°), and northwest (292.5°–337.5°).

Plan curvature is considered as the earth surface geometry and characterizes slope changes (Nefeslioglu et al. 2008). Plan curvature is effect on convergence or divergence of flow water during downhill flow (Yilmaz et al. 2012). Plan curvature was extracted from the ALOS PALSAR-DEM using ArcGIS 10.5 and classified into three categories including concave, flat, and convex.

In many cases, gullies are associated with stream network (Conoscenti et al. 2014). Using the quantile classification (Jiuchun et al. 2017), five different buffer zones are created within the study area to determine the effect of streams on gullies including: 0–100 m, 100–200 m, 200–300 m, 300–400 m, and > 400 m.

The drainage pattern of an area is affected by several factors such as the nature and structure of geological formations, soil characteristics, vegetation conditions, penetration rates, and degrees of gravity (Manap et al. 2014). High drainage density increases surface runoff rate and thus increases gully erosion. Drainage density map generated by Line Density Tool in ArcGIS 10.5 and its values classified into four categories (Conoscenti et al. 2014): 0.6–1.7, 1.7–2.1, 2.1–2.4, and > 2.4 km/km2.

According to literature review, the distance from road is important for the occurrence of gully erosion (Conoscenti et al. 2014; Pourghasemi et al. 2017). Roads will enhance the gully erosion process using interrupt and concentrate overland flow and draining it on downstream slopes (Conoscenti et al. 2014). The distance from road map was divided into five different buffer zones (Fig. 2h) using quantile classification (Conoscenti et al. 2014) in ArcGIS 10.5: 0–300 m, 300–600 m, 600–900 m, 900–1200 m, and > 1200 m.

The lithology layer was generated by digitizing the geological map prepared by GSDI (1997) (Geological Survey Department of Iran, Toroud Sheet at 1:100,000-scale). The lithology map according to susceptibility to soil erosion was then classified into 12 classes (Table 1).

Table 1 Lithology of the study area (GSI 1997)

Land cover (LC) management has a great influence on the geomorphological stability of the slope and the occurrence of gully erosion (Zakerinejad and Maerker 2015). In general, bare areas and areas with scattered vegetation are more susceptible to erosion compared to forest areas, because vegetation cover significantly reduces erosive power of surface flow (Hayas et al. 2017). The LC map of the study area was generated using Landsat 8 images from 31/10/2017 covering path and row 162 and 36, respectively. To create the LC map, a supervised classification using the maximum likelihood algorithm was applied. Four LC types were extracted such as irrigation agriculture, desert, range lands, and bare rock outcrops. The produced LC was validated using 265 ground control point (GCP) in the field. Kappa coefficient for the final map was estimated by Eq. 1 (Lo and Yeung 2002):

$$K={\raise0.7ex\hbox{${\left\{ {N\mathop \sum \nolimits_{{i=1}}^{r} \left( {{X_{ii}}} \right) - N\mathop \sum \nolimits_{{i=1}}^{r} \left( {{X_{i+}} \cdot {X_{+i}}} \right)} \right\}}$} \!\mathord{\left/ {\vphantom {{\left\{ {N\mathop \sum \nolimits_{{i=1}}^{r} \left( {{X_{ii}}} \right) - N\mathop \sum \nolimits_{{i=1}}^{r} \left( {{X_{i+}} \cdot {X_{+i}}} \right)} \right\}} {{N^2}}}}\right.\kern-0pt}\!\lower0.7ex\hbox{${{N^2}}$}} - \mathop \sum \nolimits_{{i=1}}^{r} \left( {{X_{i+}} \cdot {X_{+i}}} \right),$$
(1)

where r is number of rows in error matrix; Xii is number of observations in row i and column i; Xi + is total of observations in row i; X+i is total of observations in column i; and N is total number of observations included in the matrix. Kappa coefficient of the generated map was obtained 97.65%.

The TWI has been used widely to describe the effect of topography on the location and size of saturated areas of flow surface generation (Moore et al. 1991). TWI estimate the probability of water accumulation in soil due to slope and upstream catchment area, therefore, is an important factor for assessing prone areas to gully erosion (Gómez-Gutiérrez et al. 2015). The runoff erosion power, discharge potential, and carrying capacity are modeled using the mentioned factor (Tahmassebipoor et al. 2016). TWI computed using Eq. 2 and classified into four categories (Conforti et al. 2011): < 6, 6–10, 10–13, and > 13.

The SPI is a measure of the erosive power of water flow based on the assumption that discharge is commensurate to particular catchment area (Conforti et al. 2011). Stream power index is one of the most important factors controlling slope erosion processes. Regions with high stream power have high erosion potential (Gómez-Gutiérrez et al. 2015). SPI calculated using Eq. 3 and classified into five categories using natural break scheme (Conforti et al. 2011): < 300, 300–900, 900–1200, 1200–1500, and > 1500:

$${\text{TWI}}=In~\left( {{A_S}/\tan \beta } \right)$$
(2)
$${\text{SPI}}={A_{S~}} \times {\text{ta}}{{\text{n}}_\beta },$$
(3)

where AS is the specific catchment’s area (m2/m), and \(\beta\) is slope gradient (in degrees).

The NDVI is a measure of surface reflectance and gives a quantitative estimation of the vegetation growth and was calculated based on Landsat 8 images, which reflects the relation between the vegetation condition and gullies (Xie et al. 2017). The NDVI value is calculated by the following equation:

$${\text{NDVI}}= {{\text{IR}} - R} /{{\text{IR}}+R},$$
(4)

where IR is the infrared portion of the electromagnetic spectrum and R is the red portion of the electromagnetic spectrum. In general, in areas with intensive vegetation, the likelihood of gully erosion is low to very low. The NDVI map of the study area classified to three classes: <− 0.039, − 0.039 to 0.13, and > 0.13 (Rahmati et al. 2016).

The length–slope parameter (LS) is a factor used in the RUSLE equation to consider the effect of topography on erosion (Renard et al. 1997). The topographical factor depends on the slope–length (L) and the slope steepness (S). This parameter is considered as a sediment transport capacity index and computed by Eq. 5 (Moore and Burch 1986):

$${\text{LS}}={\left( {{f_{\text{a}}} \times {\text{cellsize}}/22.13} \right)^{0.4}} \times {\left( {\sin \sigma /0.0896} \right)^{1.3}},$$
(5)

where fa is flow accumulation and \(\sigma ~\) is slope in degrees. The LS map was prepared by Raster Calculator Tool in the ArcGIS10.5 and classified to five categories (Conforti et al. 2011): < 36 m, 36–70 m, 70–110 m, 110–140 m, and > 140 m.

Analytic hierarchy process (AHP) knowledge-based technique

Because decision making according to multiple criteria is a challenging issue, the application of Multi-criteria Decision-Making Models (MCDM) was offered in different cases (Yacov and Haimes 2011). Decision-making processes are often complicated according to multiple inconsistent criteria; nevertheless, the MCDM methods have been successfully engaged to identify favorite policy alternatives (Kim and Chung 2013). The application of MCDM methods in various fields of natural hazards has been illustrated in a number of international literatures (Chitsaz and Banihabib 2015; Chen et al. 2016; Wen et al. 2017). AHP is a multi-criteria, multi-objective, and semi-quantitative method that were introduced by Saaty (1980). In this method, decision-making weights are assigned based on expert knowledge and their experience in a form of pairwise relative comparison (Bathrellos et al. 2017; Papadakis and Karimalis 2017). Over the past two decades, the practical nature of AHP has led to the use of it on the large and complex decision-making problems (Bathrellos et al. 2012, 2016). In general, AHP consists of the following steps:

Step 1: structuring of the decision problem into a hierarchical model. A simple AHP model has three levels such as goal, criteria, and alternatives.

Step 2: making pairwise comparisons and obtaining the judgmental matrix. Bogdanovic et al. (2012) stated that a pair comparison has to start with a question on behalf decision makers. For example, depending on the purpose of the decision, the importance of the criteria to each other must assign with scales 1–9. Vidal et al. (2010) also stated that the mean numbers 2, 4, 6, and 8 have to use to correct the comparison (Table 2).

Table 2 Scale of preference between two factors in AHP (Saaty 1980)

Step 3: calculation of the weight of each criterion and alternative (local priority). Local weights of criteria and alternatives were obtained according to Eqs. 6 and 7 (Macharis et al. 2004):

$${a_{ij}}={a_j}/\mathop \sum \limits_{{i=1}}^{n} {a_j}\quad aj,\;j=1,2,3 \ldots ,n{\text{~}}$$
(6)
$${w_{ij}}=\mathop \sum \limits_{{i=1}}^{n} {a_{ij}}/n,\quad \;{a_{ij}}=1,2, \ldots ,n{\text{~,}}$$
(7)

where \(aj\) is the values of each inline of column, \({a_{ij}}~~\) is the normalized values of each of the in lines, and n is number of criteria.

Step 4: computation of the overall priority (final weight) by the following equation:

$${\text{AH}}{{\text{P}}_{{\text{Score}}}}=\mathop \sum \limits_{{j=1}}^{n} {w_{ij}} \times {W_j},$$
(8)

where wij is local priority for ith alternatives in jth criteria and Wj is local priority for jth criteria.

Step 5: computation of the inconsistency index by the following equation:

$${\text{I.I}}=\frac{{{\lambda _{{\text{max}}}} - n}}{{n - 1}},$$
(9)

where λmax is maximal eigen value and n is number of alternatives.

Step 6: calculation of the inconsistency ratio by the following equation:

$${\text{IR}\,{=}\,}\frac{{{\text{II}}}}{{{\text{RII}}}},$$
(10)

where RII is called random index and is obtained from Table 3. A value of IR less than 0.1 is considered acceptable (Saaty 1980), because human judgments are not always indistinct and there may be because of the nature of scale used, would have existed inconsistencies. The ability to identify inconsistent judgments through the calculation of consistency ratio is considered as one of the strong points of AHP (Table 3).

Table 3 Random index (Saaty and Vargas 2001)
Table 4 Multi-collinearity Analysis among gully erosion-conditioning factors

Data-driven models

Frequency ratio (FR)

Frequency ratio is a bivariate statistical method and a simple geospatial assessment tool (Wang and Li 2017) for computing the probabilistic relationship between dependent (gully inventory map) and independent variables (geo-environmental factors) and is very useful and efficient (Rahmati et al. 2016). The FR can be described as Eq. 11 (Oh et al. 2017):

$${\text{FR}}={\text{ }}\left( {\left( {A/B} \right)/\left( {C/D} \right)} \right),$$
(11)

where A is the number of pixels with gully erosion for each class of geo-environmental factors, B is the number of total gullies in study area, C is the number of pixels in each class of the geo-environmental factors, and D is the number of total pixels in the study area.

Index of entropy (IOE)

Theory of entropy expresses the extent of the disorder, instability, uncertainty, and imbalance of a system (Al-Abadi 2017). This theory was introduced by Boltzmann and quantitatively presented by Shannon (1948) (Pourghasemi et al. 2012). Actually, theory of entropy expresses a way to approximate main parameters among efficient parameters of a goal; in the other words, this theory characterizes variables that are more effective in phenomenon occurrence. Details of the mentioned theory and its equations are given in Pourghasemi et al. (2012), Jaafari et al. (2014), Al-Abadi (2017), and Hong et al. (2017).

After computation of the final weight of each factor and multiplying it in categories of that parameter and values related to each parameter, weighted maps were added up and the final map of gully erosion susceptibility was obtained using Eq. 12 (Haghizadeh et al. 2017):

$${\text{GESM}}=\mathop \sum \limits_{{I=1}}^{n} \left( {{W_J} \times {P_j}} \right),$$
(12)

where Wj and Pj are the final weight and the probability density for the jth feature.

Weights-of-evidence (WofE)

The WofE is one of the bivariate and statistical approaches that employed the log-linear form of the Bayesian probability method to estimate the relative importance of effective factors by statistical means (Xie et al. 2017). The WofE model calculates the relationship of between gully erosion-conditioning factors and gully occurrence according to the presence or the absence of the gullies in the study area (Razavizadeh et al. 2017) as follows:

$$W_{i}^{+}={\text{lo}}{{\text{g}}_{e~}}\left[ {\left( {{N_{{\text{pix1}}}}/\left( {{N_{{\text{pix1~}}}}+~{N_{{\text{pix2}}}}} \right)} \right)/\left( {{N_{{\text{pix3}}}}/\left( {{N_{{\text{pix3~}}}}+~{N_{{\text{pix4}}}}} \right)} \right)} \right]$$
(13)
$$W_{i}^{ - }=lo{g_{e~}}\left[ {\left( {{N_{pix2}}/\left( {{N_{pix1~}}+~{N_{pix2}}} \right)} \right)/\left( {{N_{pix4}}/\left( {{N_{pix3~}}+~{N_{pix4}}} \right)} \right)} \right],$$
(14)

where \({N_{{\text{pix1}}}}\) is number of gully erosion pixels in a specific class, Npix2 is (total number of gully erosion pixels in a map) − (number of gully erosion pixels in a class), Npix3 is (number of pixels in specific class) − (number of gully erosion pixels in a specific class), and Npix4 is (total number of pixels in a map) − (total number of gully erosion pixels in map) − (number of pixels in specific class). A positive weight \(W_{i}^{+}\) indicates that the gully erosion-conditioning factor exists at the locations of gully erosion and there is a positive correlation between the presence of the gully erosion-conditioning factor and the gully locations and vice versa. Final weight in the weights-of-evidence model obtained by Eq. 15 (Rahmati et al. 2016; Razavizadeh et al. 2017):

$$W=\left( {\frac{C}{{S(C)}}} \right)$$
(15)
$$C={W^+}+{W^ - }$$
(16)
$$S\left( C \right)=\sqrt {{S^2}} \left( {{W^+}} \right)+{\text{~}}{S^2}\left( {{W^ - }} \right)$$
(17)
$${S^2}\left( {{W^+}} \right)=\frac{1}{{N\left\{ {B \cap L} \right\}}}+\frac{1}{{\left\{ {B \cap L} \right\}}}$$
(18)
$${S^2}\left( {{W^ - }} \right)=\frac{1}{{\left\{ {\bar {B} \cap L} \right\}}}+\frac{1}{{\left\{ {\bar {B} \cap \bar {L}} \right\}}},$$
(19)

where W is final weight, and C is difference between positive and negative weights. C is negative for a negative correlation between GEFs and GESM and positive for a positive relationship (Pourghasemi et al. 2013). S (C) is the standard deviation of the contrast. \({S^2}\left( {{W^+}} \right)\) is the variance of \({W^+}\) and \({S^2}\left( {{W^ - }} \right)\) is the variance of \({W^ - }\). B and \(\bar {B}\) indicate the presence and absence of the GEFs, respectively. L is the presence of gully, and \(\bar {L}\) is the absence of a gully. After calculation the weight of each factor by WofE model, gully erosion susceptibility map (GESM) obtained by the following equation:

$${\text{GESM}}=\mathop \sum \limits_{{R=1}}^{{R=N}} W.$$
(20)

Results

Multi-collinearity analysis

In general, for GESM, considering of multi-collinearity of conditioning factors (GEFs) is essential. Multi-collinearity indicates a linear correlation that exists among factors. In this research, for checking multi-collinearity, the tolerance (TOL) and variance inflation factor (VIF) indices are used when values of TOL and VIF are ≤ 0.1 and ≥ 5 or 10 indicate multi-collinearity among independent variables, respectively (Park et al. 2017). The results of the multi-collinearity analysis among 18 gully erosion-conditioning factors used in this study are presented in Table 4. This analysis showed that the TOL and VIF of 13 variables including elevation, slope degree, slope aspect, plan curvature, distance from river, drainage density, distance from road, lithology, LC, TWI, SPI, NDVI, and slope–length (LS) were ≥ 0.1 and ≤ 10, respectively. As a result, these parameters are used for preparing the final GESMs by four models.

Applying AHP model

The relative weight of gully erosion-conditioning factors obtained from pairwise comparison matrix is shown in Table 5. Based on Table 5, it can be seen that the lithology, slope, and NDVI with a values of 0.222, 0.176, and 0.137 are the most important factors, respectively. This result is in line with Golestani et al. (2014), Gomez-Gutierrez et al. (2009), and Rahmati et al. (2016). Gómez-Gutiérrez et al. (2015) stated that vegetation cover strongly reduces the erosive of flow surface and as a result have a high effect in gully erosion occurrence. Golestani et al. (2014) proved that the areas with gentle slope are susceptible for surface flow accumulation and gully erosion. Rahmati et al. (2016) indicated that the lithology properties are very important in gully erosion. In contrast, factors of aspect, plan curvature, and TWI with score values of 0.019, 0.015, and 0.012 had the lowest impact on the gully erosion, respectively. Factors of LC, elevation, distance from road, drainage density, SPI, LS, and distance from stream are located in the ranks of 4 to 10. The inconsistency ratio (IR) for pairwise comparison matrix of criteria is obtained 0.032; the ratio indicates an appropriate degree of inconsistency for this approach (modeling by AHP). In addition, the relationship among the gully erosion locations and the classes of conditioning factors are presented in Table 6. By the way, the inconsistency ratio (IR) for each pairwise comparison matrix is shown in Table 6. According to Table 5, all IR values are less than 0.1. After computation of weights of criteria and their classes, to GESM using AHP, Eq. 21 is used in ArcGIS10.5 by Weighted Tool as follows:

Table 5 Relationship between gully erosion and gully conditioning factors using IOE and AHP models
Table 6 Area percentage of the susceptibility classes in different models. Gully cells training set and gully cells testing set falling in each susceptibility class and SCAI values
$$GES{M_{AHP}}=\left[ {\left( {elevatio{n_w} \times 0.085} \right)+\left( {slop{e_w} \times 0.176} \right)+\left( {aspe{t_w} \times 0.019} \right)+~\left( {plan~curvatur{e_w} \times 0.015} \right)+\left( {dis~strea{m_w} \times 0.024} \right)+\left( {drinage~densit{y_w} \times 0.052} \right)+\left( {dis~roa{d_w} \times 0.067} \right)+\left( {litholog{y_w} \times 0.226} \right)+\left( {L{C_w} \times 0.085} \right)+\left( {TW{I_w} \times 0.012} \right)+\left( {SP{I_w} \times 0.04} \right)+\left( {NDV{I_w} \times 0.137} \right)+\left( {L{S_w} \times 0.031} \right)} \right]$$
(21)

The results of gully erosion susceptibility map produced by AHP method are shown in Fig. 6a. It has reclassified into five susceptibility classes: very low, low, moderate, high, and very high based on natural break classification method (Xie et al. 2017). According to Fig. 6a, the very high susceptibility level is located in the south part of the study area. This area covered with Qft1 and Qft2 that have sensitive lithological units to erosion. In contrast, very low gully erosion susceptible areas widely distributed in the northern parts of area due to the Outburst of igneous rocks and low depth of soil. Area of the susceptibility classes are shown in Table 6. Based on Table 6, from total of the study area (416.82 km2), 17.5% (72.88 km2) is located in very low susceptibility class, 22.86% (95.20 km2) in low susceptibility, 25.49% (106.16 km2) in moderate susceptibility, 28.31% (117.9 km2) in high, and 5.85% (24.36 km2) in very high susceptibility class.

Fig. 6
figure 6

Gully erosion susceptibility maps a AHP, b FR, c WofE, and d IOE

Applying FR model

The FR values were estimated based on spatial relationship of gully erosion locations and 13 conditioning factors. As shown in Table 7, when a class of a factor has the FR value higher than 1, it may be assumed that the class is susceptible to gully erosion. Application of the frequency ratio model showed that most gully erosion locations are located at elevations of < 916 m. Elevation class < 916 m has the highest FR value of 2.95, followed by 916–1065 m. Elevations higher than 1065 m had the lowest frequency ratio (0.00). Rahmati et al. (2016) stated that gully distribution is mainly controlled by topographic factors such as elevation. In the case of slope degree, it could be seen that the classes of < 5° had higher FR values (1.47), followed by 5°–10°. In Slope degree higher than 10°, gully erosion has not observed. This result is in line with Ghorbani Nejad et al. (2016) that proved light slopes because of surface flow accumulation, increases the likelihood of gully erosion occurrence. The FR trend of the slope aspect factor showed that classes of flat, east, northeast, and southeast with FR higher than 1 have a clear positive correlation with gully erosion and flat faces with higher FR value (3.08) have the most correlation with gully occurrence. This result is in line with Rahmati et al. (2016). For plan curvature factor, flat areas proved to be the most susceptible to gully erosion with the highest FR value of 1.36. The concave and convex classes had the lowest FR values (1 and 0.68), respectively. The relationship between distance from river and gullies revealed that there is a strong and indirect correlation between them. Class of 0–100 m had the highest FR value (1.19), followed by 200–300 m, 100–200 m, 300–400 m, and > 400 m. The spatial relationship between gullies with drainage density revealed that high FR value is related to high and very high density areas and gully erosion was not occur at drainage density lower than 1.7 km/km2. In addition, results showed that class > 2.4 km/km2 with the highest density, had the highest FR value (2.08) and had the strongest relationship with gully erosion susceptibility. The correlation between the gully locations and the distance from road showed that the class less than 300 m had the highest FR weight (7.45), expresses high gully erosion susceptibility in this range of distance from road. In contrast, classes > 1200 m with the lowest FR value (0.39) had a weak relation with gully erosion. For the geology factor, the high level piedmont fan and valley terrace deposits class had the highest FR value (7.32), indicating high susceptibility in this area, followed by light-red-to-brown marl and Gypsiferous marl with sandstone intercalations and fluvial conglomerate, piedmont conglomerate, and sandstone with FR values of 3.27 and 2.80, respectively. For the LC, it can be seen that the salt pan class has FR value of 13.35, showed that the gully erosion susceptibility in this LC is high. The analysis of FR model for the relationship between gully erosion locations and TWI showed that class of 10–13 with the highest FR (1.14) had strong influences on gully erosion occurrence. For the SPI factor, the class 300–900 had the highest FR value (1.07), followed by classes 900–1200 with FR (0.83). In the case of NDVI, class of − 0.039 to 0.13 with highest FR (1.44) had strong relation with gully erosion. For the LS factor, class of 70–110 m, 110–144 m, and < 36 m with FR values of higher than 1, represent strong relation with gully erosion. Finally, In order to generate a gully erosion susceptibility map, the GESM was calculated by summing each weighted factor using the following equation:

Table 7 Relationship between gully erosion and 13 conditioning factors using FR and WofE models
$${\text{GES}}{{\text{M}}_{{\text{FR}}}}=\left[ {\left( {{\text{elevatio}}{{\text{n}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{slop}}{{\text{e}}_{{\text{FR}}}}} \right){\text{+}}\left( {{\text{aspe}}{{\text{t}}_{{\text{FR}}}}} \right){\text{+~}}\left( {{\text{plan~curvatur}}{{\text{e}}_{{\text{FR}}}}} \right){\text{+}}\left( {{\text{dis~strea}}{{\text{m}}_{{\text{FR}}}}} \right){\text{+}}\left( {{\text{drinage~densit}}{{\text{y}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{dis~roa}}{{\text{d}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{litholog}}{{\text{y}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{LU/L}}{{\text{C}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{TW}}{{\text{I}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{SP}}{{\text{I}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{NDV}}{{\text{I}}_{{\text{FR}}}}} \right)\, +\, \left( {{\text{L}}{{\text{S}}_{{\text{FR}}}}} \right)} \right].$$
(22)

FR model provides values of GEMS ranging from 3.32 to 115.12. The gully erosion map was divided into five susceptibility classes include very low, low, moderate, high, and very high (Fig. 6b) using the natural breaks method (Cao et al. 2016). Each class in the study was consisted of 25.82%, 10.62%, 35.94%, 26.38%, and 1.24% of total study area, respectively (Table 6). The mean and standard deviation of this model are 6.12 and 7.21. Most of the high and very high susceptibility classes are located in the south part of study area because of the high lineament density, lower slope, and prone nature of surface and subsurface materials for gully erosion.

Applying weight of evidence model

To apply the WoE modeling, map of every factor is crossed with the GIM using the ArcGIS 10.5, and the density of the gullies calculated in each class. The resultant weights and the spatial relationship between the gully erosion occurrence and classes of each conditioning factor based on WofE model are shown in Table 7. Negative weights indicate negative spatial relationship and positive weights indicate a positive relationship between gully erosion locations and conditioning factor. Classes with positive weights are prone area for gully erosion and vice versa. To generate the final GESM map, weight of classes of each conditioning factor is summed using Eq. 23 in the weighted sum option by Spatial Analyst Tool in ArcGIS 10.5 (Fig. 6c). The final produced GESM by WofE model ranges from about − 98.32 to 199.17. Obviously, larger GESM values indicate a higher susceptibility to gully erosion. The produced map based on these values was classified into five classes including very low, low, moderate, high, and very high using the natural break method (Xie et al. 2017). The areas in the very low, low, moderate, high, and very high gully erosion susceptibility classes were 5.10% (21.24 km2), 30.40% (126.61 km2), 43.67% (181.89 km2), 17.13% (71.35 km2), and 3.70% (15.40 km2), respectively:

$${\text{GES}}{{\text{M}}_{{\text{WofE}}}} \,=\, \left[ {\left( {{\text{elevatio}}{{\text{n}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{slop}}{{\text{e}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{aspe}}{{\text{t}}_{{\text{WofE}}}}} \right){\text{+~}}\left( {{\text{plan~curvatur}}{{\text{e}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{dis~strea}}{{\text{m}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{drinage~densit}}{{\text{y}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{dis~roa}}{{\text{d}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{litholog}}{{\text{y}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{LU/L}}{{\text{C}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{TW}}{{\text{I}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{SP}}{{\text{I}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{NDV}}{{\text{I}}_{{\text{WofE}}}}} \right)\, +\, \left( {{\text{L}}{{\text{S}}_{{\text{WofE}}}}} \right)} \right].$$
(23)

Applying index of entropy (IOE)

The results of index of entropy model for GESM are shown in Table 5. According to it, factors of LC, lithology, and elevation with weights of 3.86, 1.79, and 0.96 have had the greatest impact on the gully erosion. In contrast, factors of distance to stream, stream power index, and NDVI with 0.04, 0.107, and 0.304 scores had the lowest impact on gully erosion, respectively. The parameters of stream length (LS), distance from road, TWI, drainage density, slope aspect, slope degree, and plan curvature are located in the next ranks. The relation between classes of parameters and gully erosion locations by index of entropy model is shown in Table 5.

The final GESM using index of entropy model was generated using the following equation:

$${\text{GES}}{{\text{M}}_{{\text{shanoon}}}} \,=\, \left[ {\left( {{\text{elevatio}}{{\text{n}}_{\text{w}}} \times {\text{0.71}}} \right)\, +\, \left( {{\text{slop}}{{\text{e}}_{\text{w}}} \times {\text{0.23}}} \right)\, +\, \left( {{\text{aspe}}{{\text{t}}_{\text{w}}} \times {\text{0.38}}} \right)\, +\, \left( {{\text{plan~curvatur}}{{\text{e}}_{\text{w}}} \times {\text{0.31}}} \right)\, +\, \left( {{\text{dis~strea}}{{\text{m}}_{\text{w}}} \times {\text{0.04}}} \right)\, +\, \left( {{\text{drinage~densit}}{{\text{y}}_{\text{w}}} \times {\text{0.39}}} \right)\, +\, \left( {{\text{dis~roa}}{{\text{d}}_{\text{w}}} \times {\text{0.60}}} \right)\, +\, \left( {{\text{litholog}}{{\text{y}}_{\text{w}}} \times {\text{1.79}}} \right)\, +\, \left( {{\text{LU/L}}{{\text{C}}_{\text{w}}} \times {\text{3.86}}} \right)\, +\, \left( {{\text{TW}}{{\text{I}}_{\text{w}}} \times {\text{0.79}}} \right)\, +\, \left( {{\text{SP}}{{\text{I}}_{\text{w}}} \times {\text{0.25}}} \right)\, +\, \left( {{\text{NDV}}{{\text{I}}_{\text{w}}} \times {\text{22.79}}} \right)\, +\, \left( {{\text{L}}{{\text{S}}_{\text{w}}} \times {\text{0.07}}} \right)} \right].$$
(24)

The result of this summation is a continuous interval of values from 0.96 to 8.22, which represents the gully erosion susceptibility index. A natural break classification scheme was used to divide the values into five classes (Fig. 6d) and a susceptibility map was prepared. According to the gully erosion susceptibility map generated with the IOE model, it was found that 16.75%, 18.48%, 32.51%, 29.76%, and 2.51% of the total area falls in the very low, low, moderate, high, and very high susceptible classes, respectively.

Validation and sensitivity analysis

The success rate and prediction rate curves are used to verify the accurate of the gully erosion susceptibility results (Guo-liang et al. 2017). Validation was performed by comparison of gully locations and the produced gully erosion susceptibility maps. Both success rate and prediction rate curves of the gully erosion susceptibility maps were verified using training and validating data set, respectively. These rates can assess using AUC that ranges from 0.5 to 1.0. The AUC values can be classified as follows: 0.5–0.6, poor; 0.6–0.7, average; 0.7–0.8, good; 0.8–0.9, very good; and 0.9–1, excellent (Rahmati et al. 2016). Results of success rate and prediction rate are shown in Table 8 and Fig. 7. The results showed that value of success rate is 0.973 (97.3%) for AHP model that are better than success rates of 0.917 (91.7%) FR, 0.925 (92.5%) IOE, and 0.921 (92.1%) WofE models. In addition, prediction rate of 0.954 (95.4%) using AHP model is better than prediction rates of 0.912 (92.1%) FR, 0.939 (93.9.7%) IOE, and 0.926 (92.6%) WofE models. Results indicated that four model had excellent performance in prediction prone areas to gully erosion and can be used as an effective method to perform gully erosion susceptibility maps. In addition, the seed cell area index (SCAI) validation technique proposed by Süzen and Doyuran (2004) was applied in this study. SCAI shows the density of gullies among the landslide susceptibility zones. In general, high and very high susceptibility classes have very small SCAI values (Pawluszek and Borkowski 2017). Results of SCAI indicator are shown in Table 6. According to Table 6, SCAI values of the four models are desirable in the high and very high susceptibility classes.

Fig. 7
figure 7

Prediction and success rates a AHP, b FR, c WofE, and d IOE

Table 8 Area under the curve (AUC)

Discussion

Data-driven and knowledge-based models reflect two different perspectives in spatial modeling. A knowledge-based approach is based on field campaigns, expert knowledge and experience, evidence of varying quality, and guidelines, while a data-driven approach is based on the observational data. The advantages of AHP approach are (1) flexibility, intuitive appeal to the decision makers, and its ability to check inconsistencies, (2) the AHP method break down a decision problem into its constituent sections and builds hierarchies of criteria and calculate the importance of each criterion, (3) AHP reduces bias in decision making by providing a useful mechanism for checking consistency of the evaluation measures and alternatives, and (4) the AHP method supports group decision making. The main disadvantages of AHP are (1) the number of pairwise comparisons may become very large in this model, and, therefore, become a difficult task, (2) the artificial limitation of the use of the 9 point scale is another disadvantage and may difficult to distinguish for decision maker that one alternative is 3 or 5 times more important than another, and (3) in AHP model, compensation between good scores on some criteria and bad scores on other criteria can occur. Therefore, important information may be missing (Macharis et al. 2004). The results of the comparison performed in this research stated that knowledge-based models such as AHP and data-driven classification methods such as FR, WofE, and IOE had the excellent (AUC = 0.9–1) prediction capabilities for identifying prone areas to soil erosion that these results are in agreement with Dube et al. (2014), Wang et al. (2015), Rahmati et al. (2016), Pawluszek and Borkowski (2016), Wu et al. (2016); and Xie et al. (2017). Dube et al. (2014) applied weights-of-evidence model for gully erosion hazard assessment in Mbire District—Zimbabwe and indicated that this model had high accuracy in prediction of prone areas to gully erosion. Wang et al. (2015) used certainty factor and index of entropy models for assessment of landslide susceptibility in the Qianyang County of Baoji City, China. Their results showed that IOE model with prediction rate of 80.88% had good accuracy in identification of susceptible areas to landslide. Rahmati et al. (2016) investigated GESM using bivariate statistical models including weights-of-evidence and frequency ratio and stated that frequency ratio model with (AUC = 78.11%) had better accuracy in comparison to WofE model (AUC = 70.07%). Pawluszek and Borkowski (2016) considered impact of DEM-derived factors and AHP model for landslide susceptibility mapping in the region of Roznow Lake, Poland. Their results indicated that AHP model had high capability in prediction of susceptible areas to landslide. Wu et al. (2016) prepared a landslide susceptibility map using statistical index (SI), frequency ratio (FR), and certainty factor (CF) models in a Geographic Information System (GIS) for the Gangu County, Gansu Province, China and stated that FR model with prediction accuracy of 75.62% had performed better than CF model. Xie et al. (2017) compared weights-of-evidence, logistic regression, and support vector machine models and evaluated their results by SBAS-InSAR monitoring for landslide susceptibility mapping in China and showed that WofE model with prediction rate of 0.812 had good prediction capability in identifying the prone areas to landslide.

Conclusions

Gully erosion is one of the most hazards in the Toroud Watershed that it causes loss of soil and the destruction of infrastructure. Therefore, an accurate assessment of the probability of gully erosion occurrence is required for conservation of natural resources such as soil and reducing its potential risks. For this purpose, four GIS-based models such as AHP, FR, WofE, and IOE models as expert knowledge and data-driven techniques were used to generate the gully erosion susceptibility maps for the study area in Semnan Province, Iran. Performance of four models was compared by AUC and SCAI methods. To produce gully erosion susceptibility maps in the study area, among 18 geo-environmental factors selected based on literature review, data accessible, and field surveys, 13 gully erosion-conditioning factors were used for modeling purposes. The final gully susceptibility maps were classified based on natural break algorithm. Results of validation showed that the gully erosion susceptibility maps generated by four models show good prediction efficiency and the four models have been used successfully to produce gully erosion susceptibility maps for the study area. Therefore, the gully erosion susceptibility maps generated in this research are an important tool for planners, decision makers, and engineers. They can make affordable, rapid, and well-grounded decisions to minimize and avoid the damage and losses caused by existing and future gullies, or avoid the high and very high susceptible zones, by appropriate preventive measures and mitigation procedures.