1 Introduction

Landslides, responsible for considerable loss of property and lives every year, are dangerous natural hazards. In many countries, the annual loss of lives and property due to landslides are greater than any other natural disasters, including earthquakes, floods, and windstorms. Globally, landslides cause thousands of deaths and injuries each year and property damage of about billions of dollars (Lee and Pradhan 2007; Solaimani et al. 2013). In China, more than 10,000 hazardous landslides occurred in 2014, which caused deaths and injuries to a total number of 618 people and a direct economic loss of 5.41 billion CNY (http://www.cigem.gov.cn). In addition, currently tens of millions of people still live under the high-risk threat of landslides (Liu et al. 2013).

The main goal of landslide susceptibility analysis is to identify dangerous and high risk areas and thus landslide damage can be reduced through suitable mitigation measures (Kannan et al. 2013; Solaimani et al. 2013). Over the decades, number of approaches have been proposed to assess landslide susceptibility based on geographic information systems (GIS) using different models. Many of these studies have applied probabilistic models such as frequency ratio (FR), weight of evidence (WOE), etc. (Akgun et al. 2008; Vijith and Madhu 2008; Oh et al. 2009; Yilmaz and Keskin 2009; Youssef et al. 2009, 2012; Pradhan and Youssef 2010; Pradhan et al. 2011; Akgun et al 2012; Neuhäuser et al. 2012; Saponaro et al. 2014; Sujatha et al. 2014). Statistical models such as logistic regression (Akgun 2012; Devkota et al. 2013; Grozavu et al. 2013; Ozdemir and Altural 2013; Solaimani et al. 2013), bivariate (Bednarik et al. 2010; Pareek et al. 2010; Pradhan and Lee 2010; Pradhan and Youssef 2010; Pourghasemi et al. 2013a) and multivariate (Pradhan 2010a, 2010b; Choi et al. 2012) have been used in landslide susceptibility mapping. Other different methods such as spatial multicriteria decision analysis (MCDA) approach (Akgun and Turk 2010; Akgun 2012), analytical hierarchy process (AHP) (Yalcin et al. 2011; Pourghasemi et al. 2012a; Park et al. 2013), certainty factor (CF) model (Devkota et al. 2013; Pourghasemi et al. 2013c), artificial neural network model (Chauhan et al. 2010; Pouydal et al. 2010; Pradhan and Buchroithner 2010; Park et al. 2013), index of entropy (IOE) model (Mihaela et al. 2011; Devkota et al. 2013), decision-tree methods (Nefeslioglu et al. 2010; Pradhan 2013), fuzzy logic (Pourghasemi et al. 2012a, 2012b, 2012c; Guettouche 2013; Sharma et al. 2013), support vector machine (SVM) (Yilmaz 2010; Pradhan 2013) and neuro-fuzzy (Vahidnia et al. 2010; Sezer et al. 2011) have also been employed for the purpose of landslide susceptibility mapping. Nevertheless, few studies have so far been carried out on landslide susceptibility analysis in this area. The main difference between the present study and the approaches described in the aforementioned literature is that certainty factor and index of entropy models were applied and compared for landslide susceptibility mapping in this study area.

This study aims to apply both certainty factor (CF) and index of entropy (IOE) models to develop reliable landslide susceptibility maps of the Qianyang County of Baoji, China and to find a better model that is more accurate in landslide susceptibility mapping in the study area. In order to achieve this, both certainty factor and index of entropy analysis methodology, to obtain landslide susceptibility maps using the geographic information system were developed, applied, and verified in the study area.

2 Study area

Geographically, the study area is located in central-western part of China, and lies between latitudes 34 34\(^{\prime }\) 34\(^{\prime \prime }\)–34 56\(^{\prime } 56^{\prime \prime }\)N and longitudes 106 56\(^{\prime } 15^{\prime \prime }\)–107 22\(^{\prime } 31^{\prime \prime }\)E (figure 1). The study area covers roughly a surface area of 996.46 km 2 and belongs to the Qianyang County of Baoji city, China. The temperature of the area varies between −20.6 C in winter and 40.5 C in summer, with an yearly average of 11.8 C. The mean annual rainfall according to local weather station in a period of 40 years is around 627.4 mm, and the maximum rainfall appears in August (C.H. of China Meteorological Administration (CMA), 2014). Main streams in the study area are the Wei and Jing Rivers. The landform of the area can be classified into mountain, hill, and plain, which result in large elevation difference. On the whole, the northern regions are relatively higher with the highest elevation 1560 m, and the southern regions are relatively lower with the lowest elevation 752 m. The slope angle values vary between 0 and 38 . The main commercial agricultural products in the region are wheat and Sorghum. Apart from the agricultural areas, the other mainland cover types are pasture and forest. In this area, the traffic is convenient. Up to now, the County road mileage is about 389.8 km. The population of the County was about 132,000 in the year 2010. There is a wide gap in the population density between a plain terrain and a mountainous area. The average density of population in plain terrain is approximately 163 people per square kilometer, while in mountainous area it is about 28 people per square kilometer. Major settlements are shown in figure 1 (Yang 2009). The area is mainly distributed by loess. In total, 81 landslides were mapped in the study area.

Figure 1
figure 1

Location of the study area.

3 Data preparation

3.1 Landslide inventory map

The landslide inventorymap, providing information for the assessment of the influence of different causative factors on landslide occurrence, is the backbone of landslide susceptibility studies (Van Westen et al. 2006; Youssef et al. 2014a, 2014b). The reliability and accuracy of the collected data related to landslides affect the success of landslide susceptibility analysis. Landslide inventory maps can be prepared either by collecting the information related with landslides or by analyzing satellite imagery and aerial photographs coupled with field surveys using GPS (Pradhan and Kim 2014). In order to produce a detailed and reliable landslide inventory map, extensive field surveys and observationswere performed during 2009 in the study area. A total number of 81 landslides were identified and mapped by evaluating aerial photos in 1:50,000 scale, with well supported field surveys and subsequently digitized for further analysis. The locations (centroid) of 81 landslides are mapped in figure 1. From these landslides, randomly 56 (70%) locations were chosen for the preparation of landslide susceptibility maps, while the remaining 25 (30%) cases were used for testingthe model.

3.2 Thematic layers

In order to apply certainty factor and index of entropy models, a spatial database that considers 15 factors, including slope angle, slope aspect, general curvature, plan curvature, profile curvature, altitude, distance to faults, distance to rivers, distance to roads, STI, SPI, TWI, geomorphology, lithology and rainfall, was designed and constructed. Data of these factors, such as slope angle, slope aspect, general curvature, plan curvature, profile curvature, altitude, STI, SPI, TWI, were mainly produced from the DEM. Other parameters were mainly collected from available resources (geological structure map, environment geology map, road map and drainage map, etc.). All of these data were produced in raster format with a pixel size of 30 × 30 m 2 to be compatible with the spatial resolution. A brief description of each data layer preparation is given here.

Slope degree, with direct effect on landslide formation, is a very important parameter in the slope stability analysis, and it is frequently used in preparing landslide susceptibility maps (Lee and Min 2001; Saha et al. 2005; Gorsevski et al. 2012). Slope configuration and steepness play an important role in conjunction with lithology (Pourghasemi et al. 2012c). In this study, the slope degree map was prepared from the 30 ×30 m 2 digital elevation model (DEM) collected from the advanced space-borne thermal emission and reflection radiometer (ASTER) using ArcGIS 10.0 (ESRI Inc., Redlands, CA, USA) and was classified into five classes considering the steepness of the terrain (Kayastha et al. 2013; Liu et al. 2014), i.e., 0–7 , 7 –14 , 14 –21 , 21 –28 , and 28 –38 (figure 2a). ArcGIS 10.0 analysis was performed to discover in which slope group the landslide happened and the rate of occurrence was also observed. The landslide percentage in each slope group class is determined as a percentage of slopes. The result indicates that about 94.6% of the landslides occurred on slopes ranging from 0 to 21 (table 2).

Figure 2
figure 2figure 2figure 2

(a–f). (a) Slope angle map, (b) slope aspect map, (c) general curvature map, (d) plan curvature map, (e) profile curvature map, and (f) altitude map of the study area. (g–i). (g) Distance to faults map, (h) distance to rivers map, (i) distance to roads map, (j) STI map, (k) SPI map, and (l) TWI map of the study area. (m–o). (m) Geomorphology map, (n) lithology map, and (o) rainfall map of the study area.

Slope aspect describes the direction of slope. The slope aspect controls the formation of the landslide such as lineaments, rainfalls, wind effects, and exposure to sunshine (Yalcin and Bulut 2007; Pourghasemi et al. 2012a). In this study, the slope aspect map of the study area was produced to show the relationship between aspect and landslide. Aspect regions were divided into nine directional classes as flat (−1), north (337.5 –360 , 0 –22.5 ), northeast (22.5 –67.5 ), east (67.5 –112.5 ), southeast (112.5 –157.5 ), south (157.5 –202.5 ), southwest (202.5 –247.5 ), west (247.5 –292.5 ), and northwest (292.5 –337.5 ) (figure 2b).

Commonly, general curvature, defined as the rate of change of slope degree or aspect, has been argued to affect slope stability. The characterization of slope morphology and flow can be analyzed with the help of the general curvature map (Nefeslioglu et al. 2008). Plan curvature is described as the curvature of a contour line formed by intersection of a horizontal plane with the surface. It influences the convergence and divergence of flow across a surface. The profile curvature, the vertical plane parallel to the slope direction, affects the acceleration and deceleration of downslope flows and, as a result, influences erosion and deposition (Kannan et al. 2013; Kritikos and Davies 2014). In this study, the general curvature, plan curvature and profile curvature were derived from the DEM in ArcGIS 10.0, and they were divided into three classes: <–0.05, 0.05 to 0.05, and >0.05, respectively (figure 2c–e).

Altitude is another frequently conditioning factor for landslide susceptibility analysis because it is controlled by several geologic and geomorphological processes (Gorsevski et al. 2012; Pourghasemi et al. 2012b; Pradhan and Kim 2014). Landslides usually occur at intermediate elevations since slopes tend to be covered by a layer of thin colluvium that is prone to landslides (Dai and Lee 2002; Dragićević et al. 2015). In the study area, the elevation of the study area ranged from 720 to 1560 m, and five categories of elevations were identified, i.e., 720–850, 850–1000, 1000–1150, 1150–1300, and 1300–1560 m (figure 2f).

Geological fault, responsible for triggering a large number of landslides due to the tectonic breaks that usually decrease the rock strength, was also a necessary parameter in the susceptibility analysis. In study area, five different buffer zones of the distance to faults were prepared from geological maps using the Euclidean distance interpolation method (figure 2g).

Distance to rivers, an important parameter controlling the stability of a slope, is the saturation degree of the material on the slope. The closeness of the slope to drainage structures is another important factor for landslide susceptibility analysis (Pourghasemi et al. 2012b; Park et al. 2013). To determine the degree to which the streams affected the slopes, five different buffer categories were created using Euclidean distance interpolation method (figure 2h).

Similar to the distance of slopes to rivers, the distance to roads has been considered as one of the most important anthropogenic factors influencing landslide occurrences that can be the cause of cut slope creations through construction of roads that disturbs the natural topography and affects the stability of the slope (Pourghasemi et al. 2012b; Demir et al. 2013). The distance to roads was calculated by Euclidean distance tool of the GIS application and it reclassified the resultant map into five classes: 0–1000, 1000–2000, 2000–3000, 3000–4000, and >4000 m (figure 2i).

The sediment transport index (STI), reflecting the erosive power of the overland flow, was derived by considering the transport capacity limiting sediment flux and catchment evolution erosion theories (Devkota et al. 2013; Pradhan and Kim 2014). In this study, STI was divided into four classes <3, 3–9, 9–15, and >15 (figure 2j).

The stream power index (SPI), a measure of the erosion power of the stream, is also considered as a factor contributing towards stability within the study area (Conforti et al. 2011; Regmi et al. 2014). The SPI is expressed as SPI = A S × tan(β), where A S is the specific catchment’s area and β is the local slope gradient measured in degrees (Moore et al. 1988; Park et al. 2013). The SPI map of the study area was classified into four classes: <5, 5–10, 10–40, and >40 (figure 2k).

The topographic wetness index (TWI) describes the effect of topography on the location and size of saturated source areas of runoff generation, and it is another topographic factor within the runoff model (Pourghasemi et al. 2013b; Pradhan and Kim 2014). TWI is calculated as: Ln[ A S /tan(β)], where A S is the specific catchment area of each cell and β represents the slope gradient (in degrees) of the topographic heights (Moore et al. 1988; Saadatkhah et al. 2014). In the present study, the TWI values were arranged in four classes: <7, 7–10, 10–13, and >13 (figure 2l).

Geomorphology is considered as an important factor closely related to landslide occurrence. The detailed geomorphology map, derived from the geological maps with limited field check (Kannan et al. 2013), consists of five prominent units such as mountain areas, loess ridge and hill areas, loess tableland areas and plain areas (figure 2m).

Lithology is one of the most common determinant factors in landslide stability studies because different lithological units have different landslide susceptibility values, which can provide important information regarding landslide susceptibility of a region (Yalcin and Bulut 2007; Pourghasemi et al. 2012a). In this study, the lithology map of the study area is compiled from existing geological maps (1/100,000 scale) in ArcGIS 10.0. The study area is covered with various types of lithological units. Their names, lithologic characteristics, and ages of the geological units are provided in table 1, and the general geological setting of the area is shown on the source map (figure 2n).

Table 1 Description of geological units of the study area (Wang 2008).

The rainfall was taken into account as a triggering factor for landslide initiation, thus it is another one of the main parameters in landslide susceptibility mapping. For this study, the annual rainfall map was produced, using the data of the weather stations of eight towns in the area and applying the inverse distance weighted (IDW) interpolation method. This map was reclassified into five classes: <600, 600–650, 650–700, 700–750, and >750 mm/yr (figure 2o).

4 Modelling approach

4.1 Certainty factor model

Certainty factor approach, one of the proposed favorability functions to deal with the problem of combination of heterogeneous data, has been widely used for mapping landslide susceptibility (Kanungo et al. 2011; Sujatha et al. 2012; Pourghasemi et al. 2013c; Liu et al. 2014). The function of probability (certainty factor) was proposed by Shortliffe and Buchanan (1975) in the beginning, and later improved by Heckeman (1986), and the expression is as:

$$ CF\,=\,\left\{ \!\!\!\!{{\begin{array}{*{20}c} {{(PP_{a} \,-\,PP_{s}})/({PP_{a} (1\,-\,PP_{s} ))} ~\text{ if } PP_{a} \ge PP_{s}} \hfill \\ ({{PP_{a} \,-\,PP_{s}})/({PP_{s} (1\,-\,PP_{a} ))~}\text{ if } PP_{a} <PP_{s}} \end{array}} } \right. $$
(1)

where P P a is the conditional probability of landslide event occurring in category a. P P s is theprior probability of total number of landslide events in the study area a. The certainty factor ranges between −1 and 1; positive values means an increasing certainty in landslide occurrence, and negative values correspond to a decreasing certainty in landslide occurrence. A value close to zero means that there is not enough information about the variable and thus, it is difficult to give any indication about the certainty of the landslide occurrence (Sujatha et al. 2012; Dou et al. 2014).

The CF values of all the thematic layers used in the present study were calculated in ArcGIS 10.0 and Microsoft Excel based on equation (1), and the result is given in table 2. Then, the CF values of the causative factor were pair-wise based upon the combination rule given in the following equation (Pourghasemi et al. 2013c; Dou et al. 2014; Ilia et al. 2015):

$$ Z=\left\{ \!\!\!{{\begin{array}{*{20}l} {X+Y-XY ~~~~~~~~~~~~~~~~~~~~~~\,~~X,Y\ge 0} \\ {{(X+Y})/({1-\min (\left| X \right|,\left| Y \right|))} ~~~~X\ast Y<0} \\ {X+Y+XY ~~~~~~~~~~~~~~~~~~~~~~~~X,Y<0} \end{array}} } \right. $$
(2)

By using the integration rule of equation (2), the pair-wise combination is repeatedly performed until all the CF layers are combined to generate the landslide susceptibility.

Table 2 Spatial relationship between each landslide conditioning factor and landslide by CF and IOE models.

4.2 Index of entropy model

Index of entropy model, based on the principle of bivariate analysis, was used for evaluating the landslide susceptibility in this study. This approach allows calculation of the weight for each input variable. In this model, the weighing process is based on the methodology proposed by Vlcko et al. (1980). The weight value for each parameter taken separately is expressed as an entropy index. In the present study, the weight parameter was obtained from the defined level of entropy representing the extent that various factors influence the development of a landslide. The information coefficient W j representing the weight value for the parameter as a whole was calculated using the following equations (Constantin et al. 2011; Devkota et al. 2013; Jaafari et al. 2014; Youssef et al. 2014a, 2014b):

$$ P_{ij} =\frac{b}{a}, $$
(3)
$$ \left( {P_{ij}} \right)=\frac{P_{ij}} {{\sum}^{S_{j}}_{j=1} P_{ij} } , $$
(4)
$$ H_{j} =-\sum\limits_{i=1}^{S_{j}} {\left( {P_{ij}} \right)} \log_{2} \left( {P_{ij}} \right), ~~~ j=1, 2 , \mathellipsis ,n, $$
(5)
$$ H_{j\max} =\log_{2} S_{j} ,~~ S_{j} \text{is the number of classes,} $$
(6)
$$ I_{j} =\frac{H_{j\max} -H_{j}} {H_{j\max} } , ~~~I=(0, 1),~~~ j=1, \mathellipsis , n, $$
(7)
$$ W_{j} =I_{j} \times P_{j} , $$
(8)

where a and b are the domain and landslide percentages, respectively, (P i j ) is the probability density, H j and \(H_{j\!\max }\) represent entropy values, I j is the information coefficient and W j represents the resultant weight value for the parameter as a whole. The final landslide susceptibility map is developed based on equation (9) using the ArcGIS 10.0 software.

$$ Y_{\text{IOE}} ={\sum\limits_{i}^{n}} {\frac{z}{m_{i}} } \times C\times W_{j} , $$
(9)

where Y IOE is the sum of all the classes; i is the number of particular parametric map (1,2, …,n); z is the number of classes within parametric map with the greatest number of classes; m i is the number of classes within particular parametric map; C is the value of the class after secondary classification and W j is the weight of a parameter (Bednarik et al. 2010; Devkota et al. 2013; Jaafari et al. 2014).

5 Results and discussion

5.1 Certainty factor (CF) model

The certainty factor rating for different classes of the causative factors show the importance of the respective classes in the slope instability. It can be observed from table 2 that the slope angle class 14 –21 has the highest value of CF (0.14) and the lowest value of CF (−1.00) is for slope class 28 –38 . From this, it is clear that the landslide occurrence increases by the increase in slope gradient up to a certain extent, and then, it decreases (Kanungo et al. 2011). Few landslides occur on a very gentle slope and the landslide occurrence decreases as the slope becomes higher than 28 . A positive correlation between landslide occurrence and slope was also reported by Sun (2009). Within the slope aspect categories, the CF value is positive for north-facing, east-facing, southwest-facing and northwest-facing, with the maximum value (0.40) at north-facing followed by east-facing (0.22) slope. The west-facing slopes are less prone to landslides as they have negative CF value. The CF values of general curvature show that the CF value is positive (0.21) only on class <−0.05, indicating a high probability of landslide occurrence in this area. In the case of plan curvature, few landslides occur on >0.05 class. In the study area, class >0.05 of profile curvature has higher CF value (0.3), this means that the landslide probability is higher in this class. The relationship between landslide occurrence and altitude reflects that the elevations between 720 and 850 m, and 850 and 1000 m have positive CF value (0.71 and 0.46), indicating that the probability of occurrence of landslide in these altitudes is high. Meanwhile, altitude >1300 m has a low CF value (−0.88). The reason is mainly that the higher topographical elevation is formed by the lithological units resistant to landslide. In case of distance to faults, the closer the fault, the greater is the landslide probability. At a distance of <1000 m, the CF value is 0.31, showing a high probability of a landslide. The CF value is lower than zero at a distance >8000 m and this indicates a low probability. In the case of relationship between landslide occurrence and distance to rivers, as the distance from a drainage increases, the landslide probability generally decreases. At a distance of about <200 m, the CF value is positive (0.48), indicating a high probability of landslide occurrence, and at distances about >200 m, the CF value was <0, indicating a low probability. This can be attributed to the fact that terrain modification caused by gully erosion and undercutting may influence the initiation of landslides. For distance from roads, a positive association with landslides is obtained for classes <1 and 2–3 km and the areas with more than 4 km distances from roads have low correlations with landslide occurrence in this study. A similar trend was also reported by Youssef et al. (2014a). The relation between STI landslide probabilities showed that >15 class has the highest value of CF (0.33), and for SPI, the class of >40 shows a high CF value (0.30). Similarly, for TWI, the highest CF value (0.41) was obtained for the interval of 10–13. In the study area, landslides were commonly observed on and close to relatively flat valley bottoms with a huge contributing area, where TWI, SPI, and STI expose higher values. A similar pattern was also observed by Jaafari et al. (2014). Geomorphology yields terrain information, which is a vital pace in learning the process of landslide initiation (Kannan et al. 2013). It is noticed that loess tableland areas have the highest influence in triggering landslides. For the lithology, it can be seen that the lithology class 5 has highest CF values (0.82). This indicates that lithological unit of the glutenite, sandstone and siltstone has the highest influence in triggering landslides. For the rainfall, the results show that the CF values in class of 650–700, >750 mm/year are very high (0.51) than that of the other three classes. The final result of certainty factor (CF) model is a landslide susceptibility index (LSI) map, in which the LSI values vary from −8.27 to 5.66.

5.2 Index of entropy (IOE) model

To produce the landslide susceptibility map using the IOE models, every parameter map was crossed with the landslide inventory map using the ArcGIS 10.0 software, and the density (P i j ) of the landslide occurrence in each class is calculated. The resultant weights for each thematic map for the IOE model are given in table 2. From the W j value, it is seen that the altitude has the highest impact in the landslide susceptibility, followed by lithology, rainfall, and geomorphology, while others are less significant in the landslide susceptibility of the region. From the result (P i j ), it is seen that slope interval of 14 –21 is highly prone to landslide followed by the slope class 21 –28 . In the case of aspect, north-facing slopes followed by northwest-facing, east-facing, northeast-facing, and southwest-facing slopes are susceptible to landsliding. In the case of general curvature, most landslides occurred <−0.05. In terms of plan curvature and profile curvature, most of the existing landslides are distributed between the <−0.05 and −0.05–0.05 classes, and >0.05 class, respectively. The P i j value for altitude clearly showed that ranges 720–850 m, and 850–1000 m have the most effect on the occurrence of landslides with high values of 0.59 and 0.32, respectively. The other classes have low values. In the case of distance to faults, <2000 m range has the highest P i j value (0.34) followed by 4000–6000 m (0.23), 2000–4000 m (0.20), 6000–8000 m (0.20), and >8000 m (0.04). The distance to rivers shows that the P i j value decreases as the distance to river increases. From this, it is clear that the bank erosion is one of the main triggering factors. Assessment of distance to roads showed that distance of 0–1000 and 2000–3000 m has high correlation with landslide occurrence. Most of the landslides are located at >15 class for STI, as the values of P i j is highest (0.39) here. Relation between SPI, and TWI and landslide probability showed that >40 and 10–13 classes have highest value of P i j , respectively. The P i j value for geomorphology showed that loess tableland areas have the most effect on the occurrence of landslides with high value of 0.43. In the case of the relationship between landslide occurrence and lithology, the value of P i j was higher in glutenite, sandstone and siltstone. In rainfall, the highest P i j value (0.34) was located in the rainfall classes of 650–700 and >750 mm/year. The findings based on entropy approach show that altitude, lithology, and rainfall are the most important factors which explain better the landslide occurrence and distribution in the study area. It should be noted that the landslide contributing factors may vary from region to region, such that the rating scheme followed in this study area may not be suitable anywhere else (Bijukchhen et al. 2013). The final result of index of entropy (IOE) model is an LSI map, in which the LSI values vary from 0.50 to 5.42.

5.3 Landslide susceptibility maps

For visual interpretation of LSI maps, the data need to be classified into categorical susceptibility classes (Jaafari et al. 2014). In this study, using the natural breaks method in ArcGIS 10.0, the landslide susceptibility map generated with the certainty factor model was reclassified into five classes: very low, low, moderate, high, and very high (figure 3). From the output of analysis carried out using the ArcGIS 10.0 (table 3), it was found that 18.4% of the study area was placed in the group with very low susceptibility. Low, moderate and high susceptibility landslide classes comprised of 27.40, 20.49, and 19.52% of the area, respectively. In all, 14.19% of the region was placed in the class with very high landslide susceptibility. It can be observed from table 3 that 3.70 and 6.17% of the total landslides fall in the very low and low susceptibility zones respectively. Moderate, high, and very high susceptible zones represent 11.11, 4.98, and 37.04% of the landslides, respectively. In landslide susceptibility map produced from index of entropy model (figure 4), the very low susceptible zone covers 49.15% of the total study area, whereas low, moderate, high, and very high susceptible zones cover 17.51, 14.59, 13.57 and 5.18% of the total area, respectively. Meanwhile, the results show that the percentages of the total landslides in very low, low, moderate, high, and very high susceptibility classes are 11.11, 4.94, 20.98, 46.91, and 16.05%, respectively (table 3).

Figure 3
figure 3

Landslide susceptibility map derived from the CF model.

Figure 4
figure 4

Landslide susceptibility map derived from the IOE model.

Table 3 Comparison of predicted landslide hazard zones and observed landslides.

5.4 Validation of the landslide susceptibility maps

Validation of predictive models is an essential requirement to check the accuracy of the landslide susceptibility map produced (Gorsevski et al. 2000; Chung and Fabbri 2003). The validity of the landslide susceptibility maps can be graphically ascertained with the area under curve (AUC) method (Chung and Fabbri 2003; Van Westen et al. 2003; Intarawichian and Dasananda 2011; Kayastha et al. 2013). This method works by creating specific rate curve which explains percentage of known landslides that fall into each defined level of susceptibility rank and displays as the cumulative frequency diagram (Intarawichian and Dasananda 2011). Chung and Fabbri (2003) distinguished between success- and prediction-rate curves. A success rate curve is based on a comparison of the susceptibility map with the landslides used in modelling (i.e., the training set). This is considered as a degree of fit measure (Chung and Fabbri 2003; Pradhan and Kim 2014). Apart from the success rate curve, the prediction rate curve is a good indicator of the predictive power of the susceptibility map. The prediction rate curve can be created by the validation landslide inventory (Pradhan and Kim 2014). In rate curve, the y axis is normally considered as the cumulative percentage of observed landslide occurrences in different susceptibility classes and the x axis corresponds to the cumulative percentage of the area of the susceptibility classes. Total area under a rate curve (AUC) can be used to determine prediction accuracy of the susceptibility map qualitatively in which larger area means higher accuracy achieved (Lee 2005; Mathew et al. 2009; Intarawichian and Dasananda 2011; Pourghasemi et al. 2013b). In this study, at first, the total landslides observed in the study area were split into two parts, 56 (70%) landslides were randomly selected from the total 81 landslides as the training data and the remaining 25 (30%) landslides were kept for validation propose. Then the rate curves were obtained by comparing the landslide training data and validation data with the susceptibility maps and the areas under the curve were calculated.

The success rate curves of CF and IOE models shown in figure 5(a), indicate that the first 10% of the area of the susceptibility classes can explain about 52%, 48% of all used landslides (out of 56 events), respectively. The first about 72% for CF and IOE models can accommodate all known landslides. Similarly, the prediction rate curves (figure 5b) indicate that the first 10% of the area of the susceptibility classes can explain about 37% and 32% of all used landslides (out of 25 events), respectively. The first, about 93% for CF and IOE models can accommodate all known landslides. From calculation of the AUC for CF and IOE models, it was found that AUC for the success rate curves are 0.8433 and 0.8227, respectively. Namely, the training accuracy of the susceptibility maps is 84.33% and 82.27%, respectively. The areas of prediction rate curves are 0.8232 and 0.8088, which means that the overall prediction rates are 82.32% and 80.88% for the CF and IOE models, respectively. These results show that the CF and IOE models are successful estimators, and the two models employed in this study have reasonably good accuracy in predicting the landslide susceptibility of the study area. The map produced by CF model exhibited the better result for landslide susceptibility mapping in the study area.

Figure 5
figure 5

AUC representing quality model (a) success rate curve and (b) prediction rate curve.

6 Conclusions

In this study, both certainty factor and index of entropy models to estimate areas susceptible to landslides for the Qianyang County of Baoji city, China, using GIS has been presented. The relationship between a landslide occurrence and the identified 15 causative factors such as slope angle, slope aspect, general curvature, plan curvature, profile curvature, altitude, distance to faults, distance to rivers, distance to roads, STI, SPI, TWI, geomorphology, lithology, and rainfall was evaluated using CF and IOE methods. The selection of the 15 conditioning landslide factors, based on consideration of relevance, availability, and scale of data that was available for the study area, is relative and subjective, and can be improved in future research. Before susceptibility analysis, a total of 81 landslides observed in the area were separated into two groups, 56 (70%) landslides were randomly selected for generating a model and the remaining 25 (30%) were used for validation proposes. The susceptibility maps produced by CF and IOE models were divided into five different susceptibility classes such as very low, low, moderate, high, and very high. The area under the rate curve quantitatively indicates the performance of the susceptibility maps. The results show that the CF model with a success rate of 84.33% and predictive accuracy of 82.32% performs better than IOE (success rate, 82.27%; predictive accuracy, 80.88%) models. Overall, both models showed almost similar results and were reasonable models for the landslide susceptibility mapping of the study area. In addition, it should be noted that both the models were developed on some basic assumptions such as topography, geology, andstream, etc. If data on factors causing the landslides, such as extreme rainfall, earthquake shaking, exist, then a more accurate analysis could be done.

The output results of the present study can help the decision makers, managers, urban planners, engineers, and land-use developers to manage slopes and choose susceptible locations to accomplish development in wild land as well as urban areas. Also, it is worth mentioning that similar method can be used elsewhere where the same geological and topographical feature prevails.