A Comparative Assessment Between the Application of Fuzzy Unordered Rules Induction Algorithm and J48 Decision Tree Models in Spatial Prediction of Shallow Landslides at Lang Son City, Vietnam

Tien Bui, Dieu; Pradhan, Biswajeet; Revhaug, Inge; Trung Tran, Chuyen

doi:10.1007/978-3-319-05906-8_6

Dieu Tien Bui^5,6,
Biswajeet Pradhan⁸,
Inge Revhaug⁵ &
…
Chuyen Trung Tran⁷

Part of the book series: Society of Earth Scientists Series ((SESS))

2122 Accesses
23 Citations

Abstract

The main objective of this study is to investigate potential application of the Fuzzy Unordered Rules Induction Algorithm (FURIA) and the Bagging (an ensemble technique) in comparison with Decision Tree model for spatial prediction of shallow landslides in the Lang Son city area (Vietnam). First, a landslide inventory map was constructed from various sources. Then, the landslide inventory was randomly partitioned into 70 % for training the models and 30 % for the model validation. Second, six landslide conditioning factors (slope, aspect, lithology, land use, soil type, and distance to faults) were prepared. Using these factors and the training dataset, landslide susceptibility indexes were calculated using the FURIA, the FURIA with Bagging, the Decision Tree, and the Decision Tree with Bagging. Finally, prediction performances of these susceptibility maps were carried out using the Receiver Operating Characteristic (ROC) technique. The results show that area under the ROC curve (AUC) using training dataset has the largest for the Decision Tree with Bagging (0.925) and the FURIA with Bagging (0.913), followed by the Decision Tree (0.908) and the FURIA (0.878). The prediction capability of these models was estimated using the validation dataset. The highest prediction was achieved using the FURIA with Bagging (AUC = 0.802), followed by the Decision Tree (AUC = 0.783), the Decision Tree with Bagging (AUC = 0.777), and the FURIA (AUC = 0.773). We conclude that the FURIA with Bagging is the best model in this study.

Access provided by Autonomous University of Puebla. Download chapter PDF

Landslide spatial prediction utilizing fuzzy unordered rules induction ensemble models: a case study in Thai Nguyen, Vietnam

Article 02 December 2023

Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study

Article 22 May 2017

Stacking ensemble approach in data mining methods for landslide prediction

Article 21 December 2022

Keywords

1 Introduction

Landslides are one of many types of natural processes and when threaten mankind they will represent as hazard (Glade et al. 2005). Globally, landslides cause thousands of deaths and injuries, and the direct and indirect costs of landslide damages go up to many billions of USD annually (Roberds 2005). Climate changes and its anticipated consequences are expected to lead to an increase in natural hazards including landslides, resulting in loss of lives and infrastructure damages (Korup et al. 2012).

Landslide damages can be reduced if we understand the mechanisms of occurrence, prediction, hazard assessment, early warning, and risk management (Sassa and Canuti 2008). Landslide hazard assessment can help authorities to reduce landslide damages through proper land use management for infrastructural development and for environmental protection (Tien Bui et al. 2013a). The spatial prediction of landslides is considered as one of the most difficult aspects in the assessment of landslide hazard. For this reason, various methods and techniques have been proposed and they range from simple qualitative techniques to sophisticated mathematical models (Chung and Fabbri 2008). Good overview of these methods including their disadvantages and advantages can be seen in Guzzetti et al. (1999) and Chacon et al. (2006).

In recent years, with the development of geographical information systems (GIS) and computer sciences, some new methods such as neural networks, fuzzy logic, and neuro-fuzzy have become new solutions for landslide modelling with good prediction capabilities (Pradhan et al. 2010; Sezer et al. 2011; Pourghasemi et al. 2012; Tien Bui et al. 2012c; Akgun et al. 2012; Althuwaynee et al. 2014). Although a series of methods and techniques have been proposed and implemented, no agreement has been reached so far on which method is the best one for landslide susceptibility mapping. It is clear that the quality of landslide susceptibility models is influenced both by the methods used and the sampling strategies employed. In more recent years, data mining and ensemble-based approaches have received much attention in many fields including landslide studies (Tien Bui et al. 2013b). They are reported having an improvement of the prediction performance of models (Rokach 2010; Tien Bui et al. 2013c).

The main objective of this study is to investigate potential application of the Fuzzy Unordered Rules Induction Algorithm (FURIA) with Bagging (an ensemble technique) in comparison with the Decision Tree model, for spatial prediction of shallow landslides at Lang Son city area (Vietnam). FURIA is a fuzzy rule based classification system that combines advantages of RIPPER (Cohen 1995) and fuzzy logic. FURIA and its ensemble have not been used in landslide modelling. The computation process was carried out using WEKA ver.3.6.6, MATLAB 7.11, and ArcGIS 10.

2 Study Area and Spatial Database

2.1 Study Area Characteristics

The study area that includes the Lang Son city and the Dong Dang town (Fig. 1) is located in the northeast mountainous province of Lang Son (Vietnam). It covers an area of about 168 km² and lies between longitudes 106°41′34′′E and 106°48′32′′E, and latitudes 21°49′43′′N and 21°57′13′′N. Slopes in the study area are from 0° to 84°, around 66 % of the study area has slopes steeper than 15°. The elevation ranges from 194 to 800 m a.s.l with a mean of 328 m.

The study area is comprised of approximately 45.2 % forest land, 21.5 % paddy land, 20.4 % barren land, and 5.7 % crop land, whereas settlement areas cover about 6.9 %. The soil types are mainly ferralic acrisols, dystric gleysols, rhodic ferralsols, and eutric fluvisols that account for 95.2 % of the total study area. Eleven lithologic formations are recognized in the region and six of them account for 80 % of the study area. They are Na Khuat, Tam Lung, Khon Lang, Lang Son, Tam Danh, and Mau Son formations. The main lithologies are marl, siltstone, tuffaceous conglomerate, griltstone, sandstone, basalt, and clay shale. Approximately 16 % of the study area is covered by Quaternary deposits that mainly contain granule, grit, breccia, boulder, sand, and clay.

Landslides in the study area mainly occurred during extreme rainfall events and tropical rainstorms. With the rapid development of economics in the province for the last two decades, the expansions of the infrastructures and the settlements which are shifted into the mountainous regions, have increased slope disturbance. In addition, the deforestation is still continuing leading to potential increase of landslides.

2.2 Spatial Database

2.2.1 Landslide Inventory

In the study area, the landslides were mainly rainfall-triggered shallow soil slides and debris flows. Rock fall was reported in some very few cases and is not included in this study. No information on earthquake-induced landslides has been reported so far. The landslide inventory map for this study was constructed from several sources: (1) Landslides that occurred before the year of 2003 detected by the interpretation of aerial photographs and field survey data. The aerial photographs have a resolution of about 1 m. The aerial photographs were acquired by the Aerial Photo—Topography Company 2003; (2) the landslide inventory map of 2006 (Tam et al. 2006); (3) the landslide inventory map of 2009 (Truong et al. 2009); (4) Some recent landslides were identified during field works. A total of 172 landslides depicted by polygons (Fig. 1) were identified and registered in the inventory map, including 86 rotational slides, 52 translational slides, and 34 debris flows.

2.2.2 Digital Elevation Model and Derivatives

In this study area, the digital elevation model (DEM) was generated from National Topographic Maps at scales 1:5,000 for the Lang Son city and 1:10,000 for the surrounding areas. The DEM has 5 m resolution. Slope and aspect were extracted from the DEM. In the case of the slope map, six categories were constructed (Fig. 2a), whereas nine layer classes were constructed for the aspect map (Fig. 2b).

2.3 Lithology and Distance to Faults

The lithological map was constructed with seven groups: conglomerate, basalt, quaternary, siltstone, limestone, sandstone, and tuff (Fig. 2c). The distance-to-faults map (Fig. 3b) was constructed by buffering the fault lines. Five fault buffer categories were constructed: 0–100, 100–200, 200–300, 300–400, and >400 m. The lithology and fault lines were extracted from four tiles of the Geological and Mineral Resources Map of Vietnam at 1:50,000 scale (Quoc et al. 1992; Truong et al. 2009).

2.4 Land Use and Soil Type

Land use was extracted from the land use status map from 2010 of the Lang Son province. The scale of the land use status map is 1:50,000 and this map is a result of the Status Land Use Project of the National Land Use Survey in Vietnam. A total of nine classes were constructed for the land use map (Fig. 2d). Regarding the soil type map, a total of eight layers were constructed for analysis (Fig. 3a). The soil types were extracted from the national pedology map at scale 1:100,000.

3 Methodology

3.1 Training and Validation Dataset

The landslide inventory and six conditioning factor maps (slope, aspect, lithology, landuse, soil type, and distance to faults) were converted to a grid cell format with spatial resolution of 5 m. Assuming N(LS) is the total number of grid cells in the study area and the training dataset D has N(D) total number of landslide grid cells. We define F _ij as the j-th layer class of the landslide conditioning factors F _i and N(F _ij) is the total number of grid cells in the class F _ij. By overlaying the landslide grid cells in the training dataset on each of the six landslide conditioning maps, the number of grid cells in F _ij overlapping with the landslide grid cells ${\text{N}}\left( {{\text{T}} \cap {\text{F}}_{\text{ij}} } \right)$ was determined. Then, each category of the six maps was assigned to an attribute value that was calculated using the following equation

$$Attribute\left( {F_{ij} } \right) = \frac{{W_{{F_{ij} }} }}{{\sum\nolimits_{j = 1}^{n} {W_{{F_{ij} }} } }}$$

(1)

where

$$W_{{F_{ij} }} = \frac{{N(D \cap F_{ij} )/N(D) \, }}{{[N(F_{ij} ) - N(D \cap F_{ij} )]/[N(LS) - N(D)]}}$$

(2)

The numerator in Eq. (2) is the proportion of landslide pixels that occur in the factor class, whereas the denominator is the proportion of non-landslide pixels in the factor class.

In landslide susceptibility modeling, a landslide inventory is suggested to be partitioned into two subsets (Chung and Fabbri 2003), one subset will be used for building the landslide models whereas the other will be used for model validation. In general, the partition of landslide inventories using temporal distribution is considered to be the best method (Chung and Fabbri 2008). However, the dates for the past landslide are unknown; therefore we randomly split the landslide inventory map in a 70/30 ratio for building and validation of the model, respectively. Resulting in a training dataset that contains 117 landslide locations (3,793 landslide grid cells), that was used for building models, and a validation dataset with 55 landslide locations (1,664 landslide grid cells). Landslide pixels were assigned a value of 1.

The same number of grid cells was randomly sampled from the no landslide areas and were assigned a value of 0. A total of 3,793 no-landslide grid cells were generated for the training data and 1,664 no-landslide grid cells for the validation data. At the final step, the values of the six landslide conditioning factors were extracted to build the training and validation datasets. The training and validation datasets contain 7,586 and 3,328 observations, one dependent variable, and six independent variables (the six landslide conditioning factors) (Table 1).

Table 1 Attribute classes of landslide conditioning factors used in the FURIA, FURIA with bagging, decision tree, decision tree with bagging

Full size table

3.2 Fuzzy Unordered Rules Induction Algorithm

FURIA is a fuzzy rule based classification system proposed by Huhn and Hullermeier (2009). This algorithm is an extension of a state-of-the-art rule learner called RIPPER (Cohen 1995) in which fuzzy and unordered rules are to be used instead of conventional rules and rule lists, respectively.

Suppose that we have a training dataset D that have instance-label pairs (x _i, y _i) where i is the i-th training instance, $\varvec{x}_{\varvec{i}} \in R^{n}$, and $y_{\varvec{i}} \in \left\{ {\text{1, 0}} \right\}.$ In the current context, x _i is the vector of input of the six landslide conditioning factors: slope, aspect, lithology, land use, soil type, and distance to faults. The two classes of {1, 0} denote landslide and no-landslide pixels. RIPPER divides the training dataset into two subsets a growing set and a pruning set. The first one will be used for growing the rules whereas the second one is used for pruning. At the first step, rule sets will be generated and learned using the growing set. Each rule to be grown by greedily adding antecedents until the rule is satisfied. All possible combinations of landslide conditioning factors were tested and the final one with the highest value of FOIL’s Information Gain (IG) (Quinlan and Cameron-Jones 1993) was selected.

$$IG_{r} = p_{r} [\log_{2} (p_{r} /(p_{r} + n_{r} )) - \log_{2} (p/(p + n))]$$

(3)

where p _r and n _r are the number of positive and negative instances cover by the rule, whereas p and n are the number of positive and negative instances cover by the default rule.

For avoiding over-fitting, the rule pruning process was carried out by simplifying the rules. All of the learned antecedents will be pruned if the antecedents maximizing V _r. Finally, the rule optimization process was carried out.

$$V_{r} = p_{r} /(p_{r} + n_{r} )$$

(4)

FURIA combines advantages of RIPPER and fuzzy logic, and the rule order in the rule list is not important and there is no default rule (Trawinski et al. 2011). Rules for each label class were induced separately using one-versus-the rest strategy. FURIA transforms the crisp rules of RIPPER into fuzzy rules using the trapezoidal membership function (Huhn and Hullermeier 2009). In this function, each fuzzy interval is specified by four parameters and is written as I = (T ₁, T ₂, T ₃, T ₄).

$$I(x) = \left\{ {\begin{array}{*{20}c} 1 & {T_{2} \le v \le T_{3} } \\ {\frac{{v - T_{1} }}{{T_{2} - T_{1} }}} & {T_{1} \le v \le T_{2} } \\ {\frac{{T_{4} - v}}{{T_{4} - T_{2} }}} & {T_{3} \le v \le T_{4} } \\ 0 & \hbox{else} \\ \end{array} } \right.$$

(5)

For an instance v _i = (x _i1, …, x _i6), the fuzzy membership function can be expressed as

$$\mu (v_{i} ) = \,\prod\limits_{j = 1}^{6} {I_{j} (x_{j} )}$$

(6)

The fuzzification of a single antecedent of a rule is only relevant to a subset ${\text{D}}_{k} \in D$, and then D_k is divided into two subsets ${\text{D}}_{k + }$ and ${\text{D}}_{k - }$. The quality of the fuzzification is checked to choose the best one using the purity rule criteria as mentioned in Eq. (7)

$$pur = \frac{{p_{i} }}{{p_{i} + n_{i} }}{;}\quad p_{i} = \sum\limits_{{v \in D_{k + } }} \mu A_{i} (v);\quad n_{i} = \sum\limits_{{v \in D_{k - } }} \mu A_{i} (v)\,;\,A_{i} \in I(x)$$

(7)

Fuzzy rules were constructed for class y _i and a certainty degree CD_i for the consequence. The final decision for output is based on the largest V value as

$$V = \sum\limits_{i = 1}^{m} {\mu_{rule(i)} (v)\,*CD_{i} }$$

(8)

Finally, the rule generalization procedure is carried out to obtain the final fuzzy rule list. A detailed explanation can be seen in Huhn and Hullermeier (2009, 2010).

FURIA was training using stratified 10-fold cross-validation. First, the training dataset was randomly partitioned into 10-folds of equal size. Then, in each run, 9-folds were used for fitting the model whereas the remaining fold was used to assess the performance. The procedure is repeated ten times and results are averaged. In this study, the fuzzy aggregation operator of the product T-Norm (used as fuzzy AND) was selected to combine rule antecedents. This is because FURIA product was reported significant better than FURIA-min (the minimum of T-Norm) (Huhn and Hullermeier 2010). Since the selection of number folds for the training data used for pruning is significant affecting the model accuracy, a test was therefore performed with different folds versus classification accuracy. The result shows that 4-folds used for pruning and the rest for growing the fuzzy rules have the highest classification accuracy. Other parameters were set as default in WEKA. Finally, the FURIA model with 45 rules was constructed for landslide susceptibility in this study. The overall accuracy was 84.84 %. The details for the accuracy by class and performance by the FURIA model are shown in Tables 4 and 5.

3.3 Decision Tree

Decision tree classifiers are hierarchical models composed of a root, internal nodes, leaf nodes, and branches, and have been considered one of the most popular classification methods in data mining. The goal of decision tree modeling is to generate a tree structure that contains a set of rules using the training dataset. The tree structure has the capability to predict the output for a new similar dataset with good accuracy. Once a decision tree model is constructed; it can process new data by following a path from the root node to the leaves and values for the new data will be obtained. Since the output for pixels in landslide susceptibility modeling present continuous values, the decision trees are called regression trees. The key advantage of decision trees is that they are easy to construct. In addition, the results from decision trees are readily interpretable with clear information of the contribution of the variables on the model results. However, decision trees do not allow for multiple outputs and are susceptible to noisy data (Zhao and Zhang 2008).

Various algorithms for constructing decision trees have been successfully developed such as classification and regression tree (CART) (Breiman et al. 1984), Chi-square Automatic Interaction Detector decision tree (CHAID) (Michael and Gordon 1997), ID3 (Quinlan 1986), C4.5 (Quinlan 1993), and J48 (Witten and Frank 2005). However the C.45 algorithm has been considered as the fastest algorithm for machine learning with good classification accuracy (Lim et al. 2000). In this study, the J48 algorithm, which is a Java re-implementation of the C4.5 algorithm, was used. The detailed description of the C4.5 algorithm can be seen in Quinlan (1993). Only a short description of decision tree is discussed here. There are two steps in the decision tree construction, the first one is the tree building and the second one is the tree pruning. The first step of the tree building process is to find the input landslide conditioning factor with the highest gain ratio using the training data set, and then select as the first internal node called root node. In the next step, the training dataset was split based on the root values, and sub-notes were created. Then, the gain ratio was estimated for each sub-node. The variable with the highest gain ratio is selected, and the recursive partitioning of the training data set is continued until all instances in the training dataset are assigned to leaf nodes or no remaining variables in which the training data can be further split. In some cases, the resulting tree may be obtained with a large number of branches, and thus the tree may over-fit the training dataset with a perfect classification, but the model has a poor classification performance for a new dataset. Therefore, the tree pruning was carried out by removing unessential nodes but with the classification accuracy still remaining (Breiman et al. 1984).

In this study, the first step in constructing decision tree models is to determine the parameters that influencing the classification accuracy of the resulting tree. The type of pruning is based on sub-tree rising. Laplace smoothing was used here to improve probabilistic estimates at leaves (Tien Bui et al. 2012a, 2013b; Tehrany et al. 2013). A test was carried out to find the most suitable parameters for the study area based on the classification accuracy. The results are shown in Tables 2, 3. The results show that the best values for minimum number of instances per leaf and the confident factor are 14 and 0.15 respectively. The selection number of fold of training data used for reduce-error pruning does not affect the accuracy of the model.

Table 2 Minimum number of instances per leaf

Full size table

Table 3 Confidence factor used for pruning

Full size table

Using the training data set and the above mentioned parameters, decision tree model was trained using with stratified 10-fold cross-validation. The 10-fold cross-validation was preferred to be used in order to ensure that the decision trees generalize beyond the training data (Breiman et al. 1984). Finally, the decision tree model was constructed for landslide susceptibility. The size of the tree is 133. The tree has the root node, 65 internal nodes, and 67 leaves. The classification accuracy is 86.82 %. The more detail of accuracy by class and performance of the decision tree model is shown in Tables 4, 5.

Table 4 Performance of the FURIA, the FURIA with Bagging, the decision tree, and the decision tree with bagging

Full size table

Table 5 Accuracy assessments by classes of the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

Full size table

3.4 Bagging

Bagging known as bootstrap aggregation, is one of the earliest ensemble algorithms proposed by Breiman (1996). Bagging is a method that uses bootstrap sampling to generate multiple subsets from the training dataset. Each subset is called a bootstrap sample created by sampling the training dataset of the same size with replacement. In the next step, each of the subset will be used to construct a classifier based model. Then, the final model is determined by aggregating all the based classifiers (Fig. 4).

Using the training data set, the FURIA with Bagging and the Decision tree with Bagging models were trained. The parameters setting for the above two models are remaining the same as in Sects. 3.2 and 3.3. The models were trained and the final results were obtained. The classification accuracy is 86.38 % and 87.50 % for the FURIA with Bagging and for the Decision Tree with Bagging, respectively. The results from the trained models are shown in Tables 4 and 5.

3.5 Generation of Landslide Susceptibility Maps

The successfully trained models were then applied to calculate landslide susceptibility indexes for all the pixels in the study area. The obtained results were converted into a GIS format and loaded in ArcGIS.10. The landslide susceptibility maps were visualized by mean of four susceptibility classes based on the percentage of area (Pradhan and Lee 2010a, b): high (10 %), moderate (10 %), low (20 %), very low (60 %). For the purpose of visualization, only two landslide susceptibility maps that were produced from the FURIA with Bagging and Decision Tree with Bagging models are shown (Figs. 5, 6).

The landslide densities (Kanungo et al. 2008) analysis was carried out for the landslide susceptibility maps by overlaying the four susceptibility zones with the landslide inventory map. Ideally, the density value should increase from very low to high susceptibility zones. The graph of the density analysis for the two models in this study is shown in Fig. 7. The result shows that that there is a gradual increase in landslide density from the very low susceptible zone to the high susceptible zone.

4 Validation and Comparison of Landslide Susceptibility Models

4.1 Model Performance and Evaluation

The performance measurement of four landslide susceptibility models (FURIA, FURIA with Bagging, Decision tree, Decision tree with Bagging) were assessed using several statistical evaluation criteria (Tien Bui et al. 2012a) as follows:

$$Sensitivity = TP/\left( {TP + FN} \right)$$

(9)

$$Specificity = TN/\left( {TN + FP} \right)$$

(10)

$$Accuracy = \left( {TP + TN} \right)/\left( {TP + TN + FN + FP} \right)$$

(11)

$$F-measure = 2*Sensitivity*Specificity/\left( {Sensitivity + Specificity} \right)$$

(12)

$$Root\, mean\, squared \,error\, \text{(}RMSE\text{)} = \text{Sqrt}\left(\frac{{(pred_{1} - act_{1} )^{2} + \cdots + (pred_{n} - act_{n} )^{2} }}{n}\right)$$

(13)

$$Mean\, absolute\, error \,\text{(}MAE\text{)}\, = \,\frac{{|pred_{1} - act_{1} | + \cdots + |pred_{n} - act_{n} |}}{n}$$

(14)

$$\begin{aligned} Kappa\,index\,(\kappa ) & = \frac{{P_{C} - P_{exp} }}{{1 - P_{exp} }} \\ \text{where}\,P_{C} & = \left( {TP + TN} \right)/\left( {TP + TN + FN + FP} \right) \\ P_{exp} & = \text{[}\left( {TP + FN} \right)\left( {TP + FP} \right) + \left( {FP + TN} \right)\left( {FN + TN} \right)\text{]}/Sqrt\left( {TP + TN + FN + FP} \right) \\ \end{aligned}$$

(15)

True positive (TP) rate measures the proportion of number of pixels that are correctly classified as landslides. True negative (TN) rate measures the proportion of number of pixels that are correctly classified as non-landslide. False negatives (FN) are the number of landslide pixels classified as non-landslide pixel. True negatives (FN) are the number of non-landslide pixels classified as landslide pixels. Precision measures the proportion of the number of pixels that are correctly classified as landslide occurrences. F-measure combines precision and sensitivity into their harmonic mean. Act is the actual target value whereas pred is the predicted value. P _C is the proportion of number of pixels that are correctly classified as landslide or non-landslide. P _exp is the expected agreements.

It could be observed that there is a high and almost equal in term of classification accuracy for the three models, FURIA with Bagging, the Decision Tree, and the Decision Tree with Bagging (Table 4). Accuracy assessment by classes (Table 5) shows that the rate of correctly classified landslide pixels is higher than those for non-landslide pixels for all models.

The reliability of the susceptibility models was measured using the Kappa index (Guzzetti et al. 2006). Kappa indexes for the FURIA, FURIA with Bagging, the Decision Tree, and the Decision Tree with Bagging are 0.697, 0.728, 0.736, and 0.750 respectively. It indicates a substantial agreement (Table 6) between the observed and the predicted values. The reliability analysis results are satisfying compared with other works such as Saito et al. (2009) and Tien Bui et al. (2012a).

Table 6 The range of the kappa index and the corresponding agreement between the model and reality (Cohen 1960)

Full size table

4.2 Model Validation

The prediction capability of the susceptibility models were evaluated using ROC curves. A ROC curve is used to plot sensitivity/1-specificity with different thresholds. Comparied to the to success and prediction rate curves (Chung and Fabbri 2003), ROC curves are considered not sensitive, by keeping in mind of the considerable difference between landslide and non-landslide pixels. Therefore ROC curves are considered as more appropriate evaluation and validation tool for landslide models (Van Den Eeckhaut et al. 2009).

The area under the ROC curve (AUC) is used as an important measurement of the landslide model performance. A landslide model will be considered a preferred model if it has a larger AUC value than other models. A perfect model will have an AUC of 1 whereas a random model has an AUC of approximately 0.5.

In this study, ROC curves and AUCs were prepared for each landslide model in two cases: the first one used the training dataset and the second one used the validation dataset. Since in the first case the same landslide pixels that have already been used to construct the landslide models, therefore, the ROC curve and AUC is only measured the degree of model fit of the model with the training dataset. The result (Fig. 8 and Table 7) shows that the highest degree of fit has the Decision Tree with Bagging (AUC = 0.925), followed by the FURIA with Bagging (AUC = 0.913), the Decision Tree (AUC = 0.908), and FURIA (AUC = 0.878). The prediction capabilities of the landslide models were obtained in the second case. This case uses the validation dataset that has not been used in the training phase and can provide the validation and explain how well the model and the conditioning factors predict the existing landslides (Pradhan and Lee 2010c). The result (Fig. 9 and Table 8) shows that the FURIA with Bagging has the highest prediction capability (AUC = 0.802). The remaining models have almost equal prediction capability (AUC from 0.773 to 0.783).

Table 7 Area under the curves (AUC) based on the training dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

Full size table

Table 8 Area under the curves (AUC) based on the validation dataset for the FURIA, the FURIA with bagging, the decision tree, and the decision tree with bagging

Full size table

4.3 Relative Contribution of the Conditioning Factors

The relative contribution of each conditioning factor on the susceptibility models can be estimated by excluding the factor in the models and then the classification accuracy was estimated. It is clear that the highest accuracy was obtained when all of the six factors are used (Table 9). Aspect has the highest contribution to the models whereas soil type has the lowest contribution. More details are shown in Table 9.

Table 9 Relative contribution of the conditioning factors

Full size table

5 Conclusion

Over the last two decades, various methods and techniques for the landslide modeling have been used and discussed, however, the FURIA model and Bagging technique are seldom been applied and a comparison between FURIA with Decision Tree and their Bagging has not been carried out so far. Decision Tree models have only been applied in a limited number of studies. The recent development in geographic information systems (GIS) and computer science allows users to apply these techniques with huge GIS data (Pradhan 2013).

In general, there are three main steps used for the landslide susceptibility modeling in this study, data preparation, susceptibility analyses, and validation and comparison. In the first step, the landslide inventory map with 172 landslide polygons was constructed. Among them, approximately 70 % (117 cases) was selected for the training models, whereas the remaining 30 % (55 cases) were used for model validation. And then, landslide conditioning factors were determined. All maps were prepared with a spatial resolution of 5 m. In the next step, a total of four models were constructed. The validation result show that the FURIA with Bagging (AUC = 0.913) and the Decision Tree with Bagging (0.925) have the highest degree of fit with the training data. They are followed by the Decision Tree (AUC = 0.908), and FURIA (AUC = 0.878). Regarding the prediction capability, the FURIA with Bagging has the highest value (AUC = 0.802), the other models have almost equal prediction capability (AUC is around 0.77).

It is well known that the selection of sampling strategy influences the prediction capability of landslide models (Yilmaz 2010). As shown in Chung and Fabbri (2008), the temporal partitioning of landslides is considered to be the best method. However, the temporal partitioning method is not suitable for this study due to unknown dates of landslide occurrence. Therefore the randomly split method was used. The main disadvantage of this method is that it may cause an overestimated of prediction capability of future landslides if spatial separation between training and validation landslides are small (Brenning 2005).

The selection of conditioning factors are an important task for the assessment of landslide susceptibility and may impact on the overall prediction performance for landslide susceptibility models (Pradhan 2013). Although no agreement on universal guidelines has been reached for the selection of conditioning factors (Tien Bui et al. 2012b), the factors related to topography, geology, soil types, hydrology, geomorphology, and land use are considered to be the most commonly used in landslide analyses (Van Westen et al. 2008). Therefore six landslide conditioning factors (slope, aspect, lithology, distance to faults, landuse, and soil type) were selected for this study.

As a final conclusion, all the models exhibit reasonably satisfactory performance. However, we may conclude that the FURIA with Bagging is considered to be the best one from this study. And it is important to note that the performance of these landslide models depends not only on the methods but also on sampling strategy followed, as well as the quality of the data used. Therefore, the quality of the susceptibility maps produced by the four models can be improved if the quality of the data used increases. The analyzed result obtained from this study is valid for shallow landslides. These maps may be useful for natural hazard management policy, planning and decision-making in landslide prone areas.

References

Akgun A, Sezer EA, Nefeslioglu HA, Gokceoglu C, Pradhan, B (2012) An easy-to-use MATLAB program (MamLand) for the assessment of landslide susceptibility using a Mamdani fuzzy algorithm. Comput Geosci 38:23-34
Google Scholar
Althuwaynee OF, Pradhan B, Park HJ, Lee JH (2014) A novel ensemble decision-tree based CHi-squared automatic interaction detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping. Landslides (Article online first available). http://dx.doi.org/10.1007/s10346-014-0466-0
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
Google Scholar
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, Belmont
Google Scholar
Brenning A (2005) Spatial prediction models for landslide hazards: review, comparison and evaluation. Nat Hazards Earth Syst Sci 5:853–862
Article Google Scholar
Chacon J, Irigaray C, Fernandez T, El Hamdouni R (2006) Engineering geology maps: landslides and geographical information systems. Bull Eng Geol Environ 65:341–411
Article Google Scholar
Chung C-J, Fabbri AG (2008) Predicting landslides for risk analysis—spatial models tested by a cross-validation technique. Geomorphology 94:438–452
Article Google Scholar
Chung CJF, Fabbri AG (2003) Validation of spatial prediction models for landslide hazard mapping. Nat Hazards 30:451–472
Article Google Scholar
Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Measur 20:37–46
Article Google Scholar
Cohen WW (1995) Fast effective rule induction. In: Machine learning: proceedings of the twelfth international conference. Morgan Kaufmann, Lake Taho
Google Scholar
Glade T, Anderson M, Crozier MJ (2005) Landslide hazard and risk. Wiley, London
Book Google Scholar
Guzzetti F, Carrara A, Cardinali M, Reichenbach P (1999) Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 31:181–216
Article Google Scholar
Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81:166–184
Article Google Scholar
Huhn J, Hullermeier E (2010) An analysis of the FURIA algorithm for fuzzy rule induction. In: Koronacki J, Raś Z, Wierzchoń S, Kacprzyk J (eds) Advances in machine learning I, vol 262. Springer, Berlin, pp 321–344
Chapter Google Scholar
Huhn J, Hullermeier E (2009) FURIA: an algorithm for unordered fuzzy rule induction. Data Min Knowl Disc 19:293–319
Article Google Scholar
Kanungo D, Arora M, Gupta R, Sarkar S (2008) Landslide risk assessment using concepts of danger pixels and fuzzy set theory in Darjeeling Himalayas. Landslides 5:407–416
Article Google Scholar
Korup O, Gorum T, Hayakawa Y (2012) Without power? Landslide inventories in the face of climate change. Earth Surf Proc Land 37:92–99
Article Google Scholar
Lim TS, Loh WY, Shih YS (2000) A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Mach Learn 40:203–228
Article Google Scholar
Michael JA, Gordon SL (1997) Data mining technique: for marketing, sales and customer support. Wiley, New York
Google Scholar
Pourghasemi H, Pradhan B, Gokceoglu C (2012) Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed. Iran Nat Hazards 63:965–996
Article Google Scholar
Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350-365
Google Scholar
Pradhan B, Lee S (2010a) Delineation of landslide hazard areas on Penang Island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models. Environ Earth Sci 60:1037–1054
Article Google Scholar
Pradhan B, Lee S (2010b) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Model Softw 25:747–759
Article Google Scholar
Pradhan B, Lee S (2010c) Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia. Landslides 7:13–30
Article Google Scholar
Pradhan B, Sezer EA, Gokceoglu C, Buchroithner MF (2010) Landslide susceptibility mapping by neuro-fuzzy approach in a landslide-prone area (Cameron Highlands, Malaysia). IEEE Trans Geosci Remote Sens 48:4164–4177
Article Google Scholar
Quinlan JR (1993) C4.5 programs for machine learning. Morgan Kaufmann, San Mateo
Google Scholar
Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106
Google Scholar
Quinlan JR, Cameron-Jones RM (1993) FOIL: a midterm report. In: European conference on machine learning. Springer, Berlin
Google Scholar
Quoc NK, Dan TH, Hung L, Huyen DT (1992) Geological map. In: Binh Gia group (ed), Vietnam Institute of Geosciences and Mineral Resources, Hanoi
Google Scholar
Roberds W (2005) Estimating temporal and spatial variability and vulnerability. In: Hungr O, Fell R, Couture R, Eberhardt E (eds) Landslide risk management. Taylor and Francis, London
Google Scholar
Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev 33:1–39
Article Google Scholar
Saito H, Nakayama D, Matsuyama H (2009) Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: The Akaishi Mountains, Japan. Geomorphology 109:108–121
Article Google Scholar
Sassa K, Canuti P (2008) Landslides-disaster risk reduction. Springer, Berlin, p 650
Google Scholar
Sezer EA, Pradhan B, Gokceoglu C (2011) Manifestation of an adaptive neuro-fuzzy model on landslide susceptibility mapping: Klang valley, Malaysia. Expert Syst Appl 38:8208–8219
Article Google Scholar
Tam VT, Tuy PK, Nam NX, Tuan LC, Tuan ND, Trung ND et al (2006) Geohazard investigation in some key areas of the northern mountainous area of Vietnam for the planning of socio-economic development. Vietnam Institute of Geosciences and Mineral Resources, Hanoi, p 83
Google Scholar
Tehrany MS, Pradhan B, Jebur MN (2013) Spatial prediction of flood susceptible areas using rule based decision tree (DT) and ensemble bivariate and multivariate statistical models. J Hydrol 504:69-79. http://dx.doi.org/10.1016/j.jhydrol.2013.09.034
Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012a) Landslide susceptibility assessment in Vietnam using support vector machines, Decision tree and Naïve Bayes models. Math Prob Eng. doi.10.1155/2012/974638
Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick O (2013a) Regional prediction of landslide hazard using probability analysis of intense rainfall in the Hoa Binh province, Vietnam. Nat Hazards 2:707–730
Google Scholar
Tien Bui D, Ho TC, Revhaug I, Pradhan B, Nguyen DB (2013b) "Landslide Susceptibility Mapping Along the National Road 32 of Vietnam Using GIS-Based J48 Decision Tree Classifier and Its Ensembles." In Cartography from Pole to Pole, edited by Buchroithner M, Prechtel N, Burghardt D, 303-17. Springer Berlin Heidelberg
Google Scholar
Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012b) Landslide susceptibility assessment in the Hoa Binh Province of Vietnam: a comparison of the Levenberg–Marquardt and Bayesian regularized neural networks. Geomorphology 171–172:12–29
Article Google Scholar
Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012c) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211
Article Google Scholar
Trawinski K, Cordon O, Quirin A (2011) On designing fuzzy rule-based multiclassification systems by combining furia with bagging and feature selection. Int J Uncertainty Fuzziness Knowl Based Syst 19:589–633
Article Google Scholar
Truong PD, Nghi TH, Phuc PN, Quyet HB, The NV (2009) Geological mapping and mineral resource investigation at 1:50 000 scale for Lang Son area. Northern Geological Mapping Division, Hanoi
Google Scholar
Van Den Eeckhaut M, Reichenbach P, Guzzetti F, Rossi M, Poesen J (2009) Combined landslide inventory and susceptibility assessment based on different mapping units: an example from the Flemish Ardennes, Belgium. Nat Hazards Earth Syst Sci 9:507–521
Article Google Scholar
Van Westen CJ, Castellanos E, Kuriakose SL (2008) Spatial data for landslide susceptibility, hazard, and vulnerability assessment: an overview. Eng Geol 102:112–131
Article Google Scholar
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, Los Altos
Google Scholar
Yilmaz I (2010) The effect of the sampling strategies on the landslide susceptibility mapping by conditional probability and artificial neural networks. Environ Earth Sci 60:505–519
Article Google Scholar
Zhao Y, Zhang Y (2008) Comparison of decision tree methods for finding active objects. Adv Space Res 41:1955–1959
Article Google Scholar

Download references

Acknowledgement

This research was supported by the Geomatics Section, Department of Mathematical Sciences and Technology, Norwegian University of Life Sciences, Norway.

Author information

Authors and Affiliations

Department of Mathematical Sciences and Technology, Norwegian University of Life Sciences, P.O. Box 5003IMT, N-1432, Aas, Norway
Dieu Tien Bui & Inge Revhaug
Faculty of Surveying and Mapping, Hanoi University of Mining and Geology, Dong Ngac, Tu Liem, Hanoi, Vietnam
Dieu Tien Bui
Faculty of Information Technology, Hanoi University of Mining and Geology, Dong Ngac, Tu Liem, Hanoi, Vietnam
Chuyen Trung Tran
Faculty of Engineering, Department of Civil Engineering, University Putra Malaysia, 43400, Serdang, Selangor Darul Ehsan, Malaysia
Biswajeet Pradhan

Authors

Dieu Tien Bui
View author publications
You can also search for this author in PubMed Google Scholar
Biswajeet Pradhan
View author publications
You can also search for this author in PubMed Google Scholar
Inge Revhaug
View author publications
You can also search for this author in PubMed Google Scholar
Chuyen Trung Tran
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dieu Tien Bui .

Editor information

Editors and Affiliations

Hydrological Sciences, ESSIC/NASA GSFC, Greenbelt, Maryland, USA
Prashant K. Srivastava
School of Environmental Sciences, Jawaharlal Nehru University, New Delhi, India
Saumitra Mukherjee
Civil Engineering, Indian Institute of Technology (IIT), New Delhi, India
Manika Gupta
Atmospheric Sciences, CIRA/NOAA, College Park, Maryland, USA
Tanvir Islam

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Tien Bui, D., Pradhan, B., Revhaug, I., Trung Tran, C. (2014). A Comparative Assessment Between the Application of Fuzzy Unordered Rules Induction Algorithm and J48 Decision Tree Models in Spatial Prediction of Shallow Landslides at Lang Son City, Vietnam. In: Srivastava, P., Mukherjee, S., Gupta, M., Islam, T. (eds) Remote Sensing Applications in Environmental Research. Society of Earth Scientists Series. Springer, Cham. https://doi.org/10.1007/978-3-319-05906-8_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-05906-8_6
Published: 26 April 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05905-1
Online ISBN: 978-3-319-05906-8
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)

Publish with us

Policies and ethics

A Comparative Assessment Between the Application of Fuzzy Unordered Rules Induction Algorithm and J48 Decision Tree Models in Spatial Prediction of Shallow Landslides at Lang Son City, Vietnam

Abstract

Similar content being viewed by others

Landslide spatial prediction utilizing fuzzy unordered rules induction ensemble models: a case study in Thai Nguyen, Vietnam

Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study

Stacking ensemble approach in data mining methods for landslide prediction

Keywords

1 Introduction