Keywords

1 Introduction

Landslides are one of many types of natural processes and when threaten mankind they will represent as hazard (Glade et al. 2005). Globally, landslides cause thousands of deaths and injuries, and the direct and indirect costs of landslide damages go up to many billions of USD annually (Roberds 2005). Climate changes and its anticipated consequences are expected to lead to an increase in natural hazards including landslides, resulting in loss of lives and infrastructure damages (Korup et al. 2012).

Landslide damages can be reduced if we understand the mechanisms of occurrence, prediction, hazard assessment, early warning, and risk management (Sassa and Canuti 2008). Landslide hazard assessment can help authorities to reduce landslide damages through proper land use management for infrastructural development and for environmental protection (Tien Bui et al. 2013a). The spatial prediction of landslides is considered as one of the most difficult aspects in the assessment of landslide hazard. For this reason, various methods and techniques have been proposed and they range from simple qualitative techniques to sophisticated mathematical models (Chung and Fabbri 2008). Good overview of these methods including their disadvantages and advantages can be seen in Guzzetti et al. (1999) and Chacon et al. (2006).

In recent years, with the development of geographical information systems (GIS) and computer sciences, some new methods such as neural networks, fuzzy logic, and neuro-fuzzy have become new solutions for landslide modelling with good prediction capabilities (Pradhan et al. 2010; Sezer et al. 2011; Pourghasemi et al. 2012; Tien Bui et al. 2012c; Akgun et al. 2012; Althuwaynee et al. 2014). Although a series of methods and techniques have been proposed and implemented, no agreement has been reached so far on which method is the best one for landslide susceptibility mapping. It is clear that the quality of landslide susceptibility models is influenced both by the methods used and the sampling strategies employed. In more recent years, data mining and ensemble-based approaches have received much attention in many fields including landslide studies (Tien Bui et al. 2013b). They are reported having an improvement of the prediction performance of models (Rokach 2010; Tien Bui et al. 2013c).

The main objective of this study is to investigate potential application of the Fuzzy Unordered Rules Induction Algorithm (FURIA) with Bagging (an ensemble technique) in comparison with the Decision Tree model, for spatial prediction of shallow landslides at Lang Son city area (Vietnam). FURIA is a fuzzy rule based classification system that combines advantages of RIPPER (Cohen 1995) and fuzzy logic. FURIA and its ensemble have not been used in landslide modelling. The computation process was carried out using WEKA ver.3.6.6, MATLAB 7.11, and ArcGIS 10.

2 Study Area and Spatial Database

2.1 Study Area Characteristics

The study area that includes the Lang Son city and the Dong Dang town (Fig. 1) is located in the northeast mountainous province of Lang Son (Vietnam). It covers an area of about 168 km2 and lies between longitudes 106°41′34′′E and 106°48′32′′E, and latitudes 21°49′43′′N and 21°57′13′′N. Slopes in the study area are from 0° to 84°, around 66 % of the study area has slopes steeper than 15°. The elevation ranges from 194 to 800 m a.s.l with a mean of 328 m.

Fig. 1
figure 1

Study area location map showing the landslide inventory

The study area is comprised of approximately 45.2 % forest land, 21.5 % paddy land, 20.4 % barren land, and 5.7 % crop land, whereas settlement areas cover about 6.9 %. The soil types are mainly ferralic acrisols, dystric gleysols, rhodic ferralsols, and eutric fluvisols that account for 95.2 % of the total study area. Eleven lithologic formations are recognized in the region and six of them account for 80 % of the study area. They are Na Khuat, Tam Lung, Khon Lang, Lang Son, Tam Danh, and Mau Son formations. The main lithologies are marl, siltstone, tuffaceous conglomerate, griltstone, sandstone, basalt, and clay shale. Approximately 16 % of the study area is covered by Quaternary deposits that mainly contain granule, grit, breccia, boulder, sand, and clay.

Landslides in the study area mainly occurred during extreme rainfall events and tropical rainstorms. With the rapid development of economics in the province for the last two decades, the expansions of the infrastructures and the settlements which are shifted into the mountainous regions, have increased slope disturbance. In addition, the deforestation is still continuing leading to potential increase of landslides.

2.2 Spatial Database

2.2.1 Landslide Inventory

In the study area, the landslides were mainly rainfall-triggered shallow soil slides and debris flows. Rock fall was reported in some very few cases and is not included in this study. No information on earthquake-induced landslides has been reported so far. The landslide inventory map for this study was constructed from several sources: (1) Landslides that occurred before the year of 2003 detected by the interpretation of aerial photographs and field survey data. The aerial photographs have a resolution of about 1 m. The aerial photographs were acquired by the Aerial Photo—Topography Company 2003; (2) the landslide inventory map of 2006 (Tam et al. 2006); (3) the landslide inventory map of 2009 (Truong et al. 2009); (4) Some recent landslides were identified during field works. A total of 172 landslides depicted by polygons (Fig. 1) were identified and registered in the inventory map, including 86 rotational slides, 52 translational slides, and 34 debris flows.

2.2.2 Digital Elevation Model and Derivatives

In this study area, the digital elevation model (DEM) was generated from National Topographic Maps at scales 1:5,000 for the Lang Son city and 1:10,000 for the surrounding areas. The DEM has 5 m resolution. Slope and aspect were extracted from the DEM. In the case of the slope map, six categories were constructed (Fig. 2a), whereas nine layer classes were constructed for the aspect map (Fig. 2b).

Fig. 2
figure 2

a Slope; b Aspect; c Lithology; d Land use (ACL annual crop land; BL barren land; GL grass land; Pl paddy land; PCL perennial crop land; PA populated area; PDL productive forest land; PTL protective forest land; WSL water surface land)

2.3 Lithology and Distance to Faults

The lithological map was constructed with seven groups: conglomerate, basalt, quaternary, siltstone, limestone, sandstone, and tuff (Fig. 2c). The distance-to-faults map (Fig. 3b) was constructed by buffering the fault lines. Five fault buffer categories were constructed: 0–100, 100–200, 200–300, 300–400, and >400 m. The lithology and fault lines were extracted from four tiles of the Geological and Mineral Resources Map of Vietnam at 1:50,000 scale (Quoc et al. 1992; Truong et al. 2009).

Fig. 3
figure 3

a Soil type; b Distance to faults; (DF dystric fluvisols; DG dystric gleysols; EF eutric fluvisols; FA ferralic acrisols; PA plinthic acrisols; RF rhodic ferralsols; RM rocky mountain; WA water area)

2.4 Land Use and Soil Type

Land use was extracted from the land use status map from 2010 of the Lang Son province. The scale of the land use status map is 1:50,000 and this map is a result of the Status Land Use Project of the National Land Use Survey in Vietnam. A total of nine classes were constructed for the land use map (Fig. 2d). Regarding the soil type map, a total of eight layers were constructed for analysis (Fig. 3a). The soil types were extracted from the national pedology map at scale 1:100,000.

3 Methodology

3.1 Training and Validation Dataset

The landslide inventory and six conditioning factor maps (slope, aspect, lithology, landuse, soil type, and distance to faults) were converted to a grid cell format with spatial resolution of 5 m. Assuming N(LS) is the total number of grid cells in the study area and the training dataset D has N(D) total number of landslide grid cells. We define F ij as the j-th layer class of the landslide conditioning factors F i and N(F ij ) is the total number of grid cells in the class F ij . By overlaying the landslide grid cells in the training dataset on each of the six landslide conditioning maps, the number of grid cells in F ij overlapping with the landslide grid cells \({\text{N}}\left( {{\text{T}} \cap {\text{F}}_{\text{ij}} } \right)\) was determined. Then, each category of the six maps was assigned to an attribute value that was calculated using the following equation

$$Attribute\left( {F_{ij} } \right) = \frac{{W_{{F_{ij} }} }}{{\sum\nolimits_{j = 1}^{n} {W_{{F_{ij} }} } }}$$
(1)

where

$$W_{{F_{ij} }} = \frac{{N(D \cap F_{ij} )/N(D) \, }}{{[N(F_{ij} ) - N(D \cap F_{ij} )]/[N(LS) - N(D)]}}$$
(2)

The numerator in Eq. (2) is the proportion of landslide pixels that occur in the factor class, whereas the denominator is the proportion of non-landslide pixels in the factor class.

In landslide susceptibility modeling, a landslide inventory is suggested to be partitioned into two subsets (Chung and Fabbri 2003), one subset will be used for building the landslide models whereas the other will be used for model validation. In general, the partition of landslide inventories using temporal distribution is considered to be the best method (Chung and Fabbri 2008). However, the dates for the past landslide are unknown; therefore we randomly split the landslide inventory map in a 70/30 ratio for building and validation of the model, respectively. Resulting in a training dataset that contains 117 landslide locations (3,793 landslide grid cells), that was used for building models, and a validation dataset with 55 landslide locations (1,664 landslide grid cells). Landslide pixels were assigned a value of 1.

The same number of grid cells was randomly sampled from the no landslide areas and were assigned a value of 0. A total of 3,793 no-landslide grid cells were generated for the training data and 1,664 no-landslide grid cells for the validation data. At the final step, the values of the six landslide conditioning factors were extracted to build the training and validation datasets. The training and validation datasets contain 7,586 and 3,328 observations, one dependent variable, and six independent variables (the six landslide conditioning factors) (Table 1).

Table 1 Attribute classes of landslide conditioning factors used in the FURIA, FURIA with bagging, decision tree, decision tree with bagging

3.2 Fuzzy Unordered Rules Induction Algorithm

FURIA is a fuzzy rule based classification system proposed by Huhn and Hullermeier (2009). This algorithm is an extension of a state-of-the-art rule learner called RIPPER (Cohen 1995) in which fuzzy and unordered rules are to be used instead of conventional rules and rule lists, respectively.

Suppose that we have a training dataset D that have instance-label pairs (x i , y i ) where i is the i-th training instance, \(\varvec{x}_{\varvec{i}} \in R^{n}\), and \(y_{\varvec{i}} \in \left\{ {\text{1, 0}} \right\}.\) In the current context, x i is the vector of input of the six landslide conditioning factors: slope, aspect, lithology, land use, soil type, and distance to faults. The two classes of {1, 0} denote landslide and no-landslide pixels. RIPPER divides the training dataset into two subsets a growing set and a pruning set. The first one will be used for growing the rules whereas the second one is used for pruning. At the first step, rule sets will be generated and learned using the growing set. Each rule to be grown by greedily adding antecedents until the rule is satisfied. All possible combinations of landslide conditioning factors were tested and the final one with the highest value of FOIL’s Information Gain (IG) (Quinlan and Cameron-Jones 1993) was selected.

$$IG_{r} = p_{r} [\log_{2} (p_{r} /(p_{r} + n_{r} )) - \log_{2} (p/(p + n))]$$
(3)

where p r and n r are the number of positive and negative instances cover by the rule, whereas p and n are the number of positive and negative instances cover by the default rule.

For avoiding over-fitting, the rule pruning process was carried out by simplifying the rules. All of the learned antecedents will be pruned if the antecedents maximizing V r . Finally, the rule optimization process was carried out.

$$V_{r} = p_{r} /(p_{r} + n_{r} )$$
(4)

FURIA combines advantages of RIPPER and fuzzy logic, and the rule order in the rule list is not important and there is no default rule (Trawinski et al. 2011). Rules for each label class were induced separately using one-versus-the rest strategy. FURIA transforms the crisp rules of RIPPER into fuzzy rules using the trapezoidal membership function (Huhn and Hullermeier 2009). In this function, each fuzzy interval is specified by four parameters and is written as I = (T 1T 2T 3T 4).

$$I(x) = \left\{ {\begin{array}{*{20}c} 1 & {T_{2} \le v \le T_{3} } \\ {\frac{{v - T_{1} }}{{T_{2} - T_{1} }}} & {T_{1} \le v \le T_{2} } \\ {\frac{{T_{4} - v}}{{T_{4} - T_{2} }}} & {T_{3} \le v \le T_{4} } \\ 0 & \hbox{else} \\ \end{array} } \right.$$
(5)

For an instance v i  = (x i1, …, x i6), the fuzzy membership function can be expressed as

$$\mu (v_{i} ) = \,\prod\limits_{j = 1}^{6} {I_{j} (x_{j} )}$$
(6)

The fuzzification of a single antecedent of a rule is only relevant to a subset \({\text{D}}_{k} \in D\), and then Dk is divided into two subsets \({\text{D}}_{k + }\) and \({\text{D}}_{k - }\). The quality of the fuzzification is checked to choose the best one using the purity rule criteria as mentioned in Eq. (7)

$$pur = \frac{{p_{i} }}{{p_{i} + n_{i} }}{;}\quad p_{i} = \sum\limits_{{v \in D_{k + } }} \mu A_{i} (v);\quad n_{i} = \sum\limits_{{v \in D_{k - } }} \mu A_{i} (v)\,;\,A_{i} \in I(x)$$
(7)

Fuzzy rules were constructed for class y i and a certainty degree CDi for the consequence. The final decision for output is based on the largest V value as

$$V = \sum\limits_{i = 1}^{m} {\mu_{rule(i)} (v)\,*CD_{i} }$$
(8)

Finally, the rule generalization procedure is carried out to obtain the final fuzzy rule list. A detailed explanation can be seen in Huhn and Hullermeier (2009, 2010).

FURIA was training using stratified 10-fold cross-validation. First, the training dataset was randomly partitioned into 10-folds of equal size. Then, in each run, 9-folds were used for fitting the model whereas the remaining fold was used to assess the performance. The procedure is repeated ten times and results are averaged. In this study, the fuzzy aggregation operator of the product T-Norm (used as fuzzy AND) was selected to combine rule antecedents. This is because FURIA product was reported significant better than FURIA-min (the minimum of T-Norm) (Huhn and Hullermeier 2010). Since the selection of number folds for the training data used for pruning is significant affecting the model accuracy, a test was therefore performed with different folds versus classification accuracy. The result shows that 4-folds used for pruning and the rest for growing the fuzzy rules have the highest classification accuracy. Other parameters were set as default in WEKA. Finally, the FURIA model with 45 rules was constructed for landslide susceptibility in this study. The overall accuracy was 84.84 %. The details for the accuracy by class and performance by the FURIA model are shown in Tables 4 and 5.

3.3 Decision Tree

Decision tree classifiers are hierarchical models composed of a root, internal nodes, leaf nodes, and branches, and have been considered one of the most popular classification methods in data mining. The goal of decision tree modeling is to generate a tree structure that contains a set of rules using the training dataset. The tree structure has the capability to predict the output for a new similar dataset with good accuracy. Once a decision tree model is constructed; it can process new data by following a path from the root node to the leaves and values for the new data will be obtained. Since the output for pixels in landslide susceptibility modeling present continuous values, the decision trees are called regression trees. The key advantage of decision trees is that they are easy to construct. In addition, the results from decision trees are readily interpretable with clear information of the contribution of the variables on the model results. However, decision trees do not allow for multiple outputs and are susceptible to noisy data (Zhao and Zhang 2008).

Various algorithms for constructing decision trees have been successfully developed such as classification and regression tree (CART) (Breiman et al. 1984), Chi-square Automatic Interaction Detector decision tree (CHAID) (Michael and Gordon 1997), ID3 (Quinlan 1986), C4.5 (Quinlan 1993), and J48 (Witten and Frank 2005). However the C.45 algorithm has been considered as the fastest algorithm for machine learning with good classification accuracy (Lim et al. 2000). In this study, the J48 algorithm, which is a Java re-implementation of the C4.5 algorithm, was used. The detailed description of the C4.5 algorithm can be seen in Quinlan (1993). Only a short description of decision tree is discussed here. There are two steps in the decision tree construction, the first one is the tree building and the second one is the tree pruning. The first step of the tree building process is to find the input landslide conditioning factor with the highest gain ratio using the training data set, and then select as the first internal node called root node. In the next step, the training dataset was split based on the root values, and sub-notes were created. Then, the gain ratio was estimated for each sub-node. The variable with the highest gain ratio is selected, and the recursive partitioning of the training data set is continued until all instances in the training dataset are assigned to leaf nodes or no remaining variables in which the training data can be further split. In some cases, the resulting tree may be obtained with a large number of branches, and thus the tree may over-fit the training dataset with a perfect classification, but the model has a poor classification performance for a new dataset. Therefore, the tree pruning was carried out by removing unessential nodes but with the classification accuracy still remaining (Breiman et al. 1984).

In this study, the first step in constructing decision tree models is to determine the parameters that influencing the classification accuracy of the resulting tree. The type of pruning is based on sub-tree rising. Laplace smoothing was used here to improve probabilistic estimates at leaves (Tien Bui et al. 2012a, 2013b; Tehrany et al. 2013). A test was carried out to find the most suitable parameters for the study area based on the classification accuracy. The results are shown in Tables 2, 3. The results show that the best values for minimum number of instances per leaf and the confident factor are 14 and 0.15 respectively. The selection number of fold of training data used for reduce-error pruning does not affect the accuracy of the model.

Table 2 Minimum number of instances per leaf
Table 3 Confidence factor used for pruning

Using the training data set and the above mentioned parameters, decision tree model was trained using with stratified 10-fold cross-validation. The 10-fold cross-validation was preferred to be used in order to ensure that the decision trees generalize beyond the training data (Breiman et al. 1984). Finally, the decision tree model was constructed for landslide susceptibility. The size of the tree is 133. The tree has the root node, 65 internal nodes, and 67 leaves. The classification accuracy is 86.82 %. The more detail of accuracy by class and performance of the decision tree model is shown in Tables 4, 5.

Table 4 Performance of the FURIA, the FURIA with Bagging, the decision tree, and the decision tree with bagging
Table 5 Accuracy assessments by classes of the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

3.4 Bagging

Bagging known as bootstrap aggregation, is one of the earliest ensemble algorithms proposed by Breiman (1996). Bagging is a method that uses bootstrap sampling to generate multiple subsets from the training dataset. Each subset is called a bootstrap sample created by sampling the training dataset of the same size with replacement. In the next step, each of the subset will be used to construct a classifier based model. Then, the final model is determined by aggregating all the based classifiers (Fig. 4).

Using the training data set, the FURIA with Bagging and the Decision tree with Bagging models were trained. The parameters setting for the above two models are remaining the same as in Sects. 3.2 and 3.3. The models were trained and the final results were obtained. The classification accuracy is 86.38 % and 87.50 % for the FURIA with Bagging and for the Decision Tree with Bagging, respectively. The results from the trained models are shown in Tables 4 and 5.

Fig. 4
figure 4

General framework of the bagged FURIA and the bagged decision tree in this study

3.5 Generation of Landslide Susceptibility Maps

The successfully trained models were then applied to calculate landslide susceptibility indexes for all the pixels in the study area. The obtained results were converted into a GIS format and loaded in ArcGIS.10. The landslide susceptibility maps were visualized by mean of four susceptibility classes based on the percentage of area (Pradhan and Lee 2010a, b): high (10 %), moderate (10 %), low (20 %), very low (60 %). For the purpose of visualization, only two landslide susceptibility maps that were produced from the FURIA with Bagging and Decision Tree with Bagging models are shown (Figs. 5, 6).

Fig. 5
figure 5

Landslide susceptibility map of the Lang Son city areas using the FURIA with bagging model

Fig. 6
figure 6

Landslide susceptibility map of the Lang Son city area using the decision tree with bagging model

The landslide densities (Kanungo et al. 2008) analysis was carried out for the landslide susceptibility maps by overlaying the four susceptibility zones with the landslide inventory map. Ideally, the density value should increase from very low to high susceptibility zones. The graph of the density analysis for the two models in this study is shown in Fig. 7. The result shows that that there is a gradual increase in landslide density from the very low susceptible zone to the high susceptible zone.

Fig. 7
figure 7

Density plots of four landslide susceptibility classes of the FURIA with bagging and the decision tree with bagging models

4 Validation and Comparison of Landslide Susceptibility Models

4.1 Model Performance and Evaluation

The performance measurement of four landslide susceptibility models (FURIA, FURIA with Bagging, Decision tree, Decision tree with Bagging) were assessed using several statistical evaluation criteria (Tien Bui et al. 2012a) as follows:

$$Sensitivity = TP/\left( {TP + FN} \right)$$
(9)
$$Specificity = TN/\left( {TN + FP} \right)$$
(10)
$$Accuracy = \left( {TP + TN} \right)/\left( {TP + TN + FN + FP} \right)$$
(11)
$$F-measure = 2*Sensitivity*Specificity/\left( {Sensitivity + Specificity} \right)$$
(12)
$$Root\, mean\, squared \,error\, \text{(}RMSE\text{)} = \text{Sqrt}\left(\frac{{(pred_{1} - act_{1} )^{2} + \cdots + (pred_{n} - act_{n} )^{2} }}{n}\right)$$
(13)
$$Mean\, absolute\, error \,\text{(}MAE\text{)}\, = \,\frac{{|pred_{1} - act_{1} | + \cdots + |pred_{n} - act_{n} |}}{n}$$
(14)
$$\begin{aligned} Kappa\,index\,(\kappa ) & = \frac{{P_{C} - P_{exp} }}{{1 - P_{exp} }} \\ \text{where}\,P_{C} & = \left( {TP + TN} \right)/\left( {TP + TN + FN + FP} \right) \\ P_{exp} & = \text{[}\left( {TP + FN} \right)\left( {TP + FP} \right) + \left( {FP + TN} \right)\left( {FN + TN} \right)\text{]}/Sqrt\left( {TP + TN + FN + FP} \right) \\ \end{aligned}$$
(15)

True positive (TP) rate measures the proportion of number of pixels that are correctly classified as landslides. True negative (TN) rate measures the proportion of number of pixels that are correctly classified as non-landslide. False negatives (FN) are the number of landslide pixels classified as non-landslide pixel. True negatives (FN) are the number of non-landslide pixels classified as landslide pixels. Precision measures the proportion of the number of pixels that are correctly classified as landslide occurrences. F-measure combines precision and sensitivity into their harmonic mean. Act is the actual target value whereas pred is the predicted value. P C is the proportion of number of pixels that are correctly classified as landslide or non-landslide. P exp is the expected agreements.

It could be observed that there is a high and almost equal in term of classification accuracy for the three models, FURIA with Bagging, the Decision Tree, and the Decision Tree with Bagging (Table 4). Accuracy assessment by classes (Table 5) shows that the rate of correctly classified landslide pixels is higher than those for non-landslide pixels for all models.

The reliability of the susceptibility models was measured using the Kappa index (Guzzetti et al. 2006). Kappa indexes for the FURIA, FURIA with Bagging, the Decision Tree, and the Decision Tree with Bagging are 0.697, 0.728, 0.736, and 0.750 respectively. It indicates a substantial agreement (Table 6) between the observed and the predicted values. The reliability analysis results are satisfying compared with other works such as Saito et al. (2009) and Tien Bui et al. (2012a).

Table 6 The range of the kappa index and the corresponding agreement between the model and reality (Cohen 1960)

4.2 Model Validation

The prediction capability of the susceptibility models were evaluated using ROC curves. A ROC curve is used to plot sensitivity/1-specificity with different thresholds. Comparied to the to success and prediction rate curves (Chung and Fabbri 2003), ROC curves are considered not sensitive, by keeping in mind of the considerable difference between landslide and non-landslide pixels. Therefore ROC curves are considered as more appropriate evaluation and validation tool for landslide models (Van Den Eeckhaut et al. 2009).

The area under the ROC curve (AUC) is used as an important measurement of the landslide model performance. A landslide model will be considered a preferred model if it has a larger AUC value than other models. A perfect model will have an AUC of 1 whereas a random model has an AUC of approximately 0.5.

In this study, ROC curves and AUCs were prepared for each landslide model in two cases: the first one used the training dataset and the second one used the validation dataset. Since in the first case the same landslide pixels that have already been used to construct the landslide models, therefore, the ROC curve and AUC is only measured the degree of model fit of the model with the training dataset. The result (Fig. 8 and Table 7) shows that the highest degree of fit has the Decision Tree with Bagging (AUC = 0.925), followed by the FURIA with Bagging (AUC = 0.913), the Decision Tree (AUC = 0.908), and FURIA (AUC = 0.878). The prediction capabilities of the landslide models were obtained in the second case. This case uses the validation dataset that has not been used in the training phase and can provide the validation and explain how well the model and the conditioning factors predict the existing landslides (Pradhan and Lee 2010c). The result (Fig. 9 and Table 8) shows that the FURIA with Bagging has the highest prediction capability (AUC = 0.802). The remaining models have almost equal prediction capability (AUC from 0.773 to 0.783).

Fig. 8
figure 8

ROC curves based on the training dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

Table 7 Area under the curves (AUC) based on the training dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging
Fig. 9
figure 9

ROC curves based on the validation dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

Table 8 Area under the curves (AUC) based on the validation dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

4.3 Relative Contribution of the Conditioning Factors

The relative contribution of each conditioning factor on the susceptibility models can be estimated by excluding the factor in the models and then the classification accuracy was estimated. It is clear that the highest accuracy was obtained when all of the six factors are used (Table 9). Aspect has the highest contribution to the models whereas soil type has the lowest contribution. More details are shown in Table 9.

Table 9 Relative contribution of the conditioning factors

5 Conclusion

Over the last two decades, various methods and techniques for the landslide modeling have been used and discussed, however, the FURIA model and Bagging technique are seldom been applied and a comparison between FURIA with Decision Tree and their Bagging has not been carried out so far. Decision Tree models have only been applied in a limited number of studies. The recent development in geographic information systems (GIS) and computer science allows users to apply these techniques with huge GIS data (Pradhan 2013).

In general, there are three main steps used for the landslide susceptibility modeling in this study, data preparation, susceptibility analyses, and validation and comparison. In the first step, the landslide inventory map with 172 landslide polygons was constructed. Among them, approximately 70 % (117 cases) was selected for the training models, whereas the remaining 30 % (55 cases) were used for model validation. And then, landslide conditioning factors were determined. All maps were prepared with a spatial resolution of 5 m. In the next step, a total of four models were constructed. The validation result show that the FURIA with Bagging (AUC = 0.913) and the Decision Tree with Bagging (0.925) have the highest degree of fit with the training data. They are followed by the Decision Tree (AUC = 0.908), and FURIA (AUC = 0.878). Regarding the prediction capability, the FURIA with Bagging has the highest value (AUC = 0.802), the other models have almost equal prediction capability (AUC is around 0.77).

It is well known that the selection of sampling strategy influences the prediction capability of landslide models (Yilmaz 2010). As shown in Chung and Fabbri (2008), the temporal partitioning of landslides is considered to be the best method. However, the temporal partitioning method is not suitable for this study due to unknown dates of landslide occurrence. Therefore the randomly split method was used. The main disadvantage of this method is that it may cause an overestimated of prediction capability of future landslides if spatial separation between training and validation landslides are small (Brenning 2005).

The selection of conditioning factors are an important task for the assessment of landslide susceptibility and may impact on the overall prediction performance for landslide susceptibility models (Pradhan 2013). Although no agreement on universal guidelines has been reached for the selection of conditioning factors (Tien Bui et al. 2012b), the factors related to topography, geology, soil types, hydrology, geomorphology, and land use are considered to be the most commonly used in landslide analyses (Van Westen et al. 2008). Therefore six landslide conditioning factors (slope, aspect, lithology, distance to faults, landuse, and soil type) were selected for this study.

As a final conclusion, all the models exhibit reasonably satisfactory performance. However, we may conclude that the FURIA with Bagging is considered to be the best one from this study. And it is important to note that the performance of these landslide models depends not only on the methods but also on sampling strategy followed, as well as the quality of the data used. Therefore, the quality of the susceptibility maps produced by the four models can be improved if the quality of the data used increases. The analyzed result obtained from this study is valid for shallow landslides. These maps may be useful for natural hazard management policy, planning and decision-making in landslide prone areas.