Comparison of GA-BP and PSO-BP neural network models with initial BP model for rainfall-induced landslides risk assessment in regional scale: a case study in Sichuan, China

Zhu, Chonghao; Zhang, Jianjing; Liu, Yang; Ma, Donghua; Li, Mengfang; Xiang, Bo

doi:10.1007/s11069-019-03806-x

Comparison of GA-BP and PSO-BP neural network models with initial BP model for rainfall-induced landslides risk assessment in regional scale: a case study in Sichuan, China

Original Paper
Published: 03 October 2019

Volume 100, pages 173–204, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Natural Hazards Aims and scope Submit manuscript

Comparison of GA-BP and PSO-BP neural network models with initial BP model for rainfall-induced landslides risk assessment in regional scale: a case study in Sichuan, China

Download PDF

Chonghao Zhu¹,
Jianjing Zhang ORCID: orcid.org/0000-0002-3864-4341¹,
Yang Liu¹,
Donghua Ma¹,
Mengfang Li^1,2 &
…
Bo Xiang²

1799 Accesses
61 Citations
Explore all metrics

Abstract

With the increase in inclement weather conditions, many countries would experience more and more landslide hazards in the process of planning, designing and construction for engineering projects, especially in the mountainous regions. How to quickly and accurately assess potential landslide risk in a large region (> 10,000 km²) is facing challenge due to its complex geological conditions and large amount of landslides in the region. To optimize the accuracy of the existing models for a large region, in this study, the genetic algorithm (GA) and particle swarm optimization (PSO) are, respectively, coupled with the backpropagation (BP) neural network to determine the initial weights and thresholds in the BP neural network, which can be called GA-BP model and PSO-BP model. To show the reliability and accuracy of the new models in large region, the BP, GA-BP and PSO-BP models are evaluated based on root mean square error (RMSE), coefficient of determination (R²), Kappa coefficient (k), receiver operating characteristic (ROC), training time and condition factor weights by using 100 landslide samples from Sichuan Province, China. Results show that the RMSE values of the GA-BP model and the PSO model are, respectively, 22.6% and 5.1% lower than those of the BP model; the R² values of the GA-BP model and the PSO model are, respectively, 24.9% and 6.2% higher than those of the BP model; the k values of the GA-BP model and the PSO model are, respectively, 44.3% and 15.4% higher than those of the BP model, and the areas under ROC of the GA-BP model and the PSO model are, respectively, 32.4% and 9.6% larger than those of the BP model. The GA-BP model and the PSO-BP model have better accuracy in the assessment of the overall risk value and the risk-level classification. The difference of the training time is small, and the sequences of condition factor weights given by the three models are consistent. In general, the GA-BP model is more effective for landslide risk assessment in large region. At last, this study gives proposed models under different engineering conditions, which can increase efficiency of the risk assessment for landslides.

Prediction of flooding in the downstream of the Three Gorges Reservoir based on a back propagation neural network optimized using the AdaBoost algorithm

Article 01 March 2021

Prediction of Landslide Risk Based on Modified Generalized Regression Neural Network Algorithm

A Back-Propagation Neural Network Model Based on Genetic Algorithm for Prediction of Build-Up Rate in Drilling Process

Article 17 April 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Landslides are one of the most serious natural hazards that kill lots of people each year (Kirschbaum et al. 2015; Petley 2011). In order to accurately and quickly assess the landslide risk in a region, regional landslide risk assessment methods are often applied (Wu and Sidle 1995). Different from the risk assessment to a single landslide, which is often based on physical models and needs some specific physical parameters, such as geometry, shear strength (c, ϕ) and moisture content, depending on the model or software used (e.g., Duc 2013; Montgomery and Dietrich 1994; Thanh and De Smedt 2014; Van Westen and Terlien 1996; Gokceoglu and Aksoy 1996), the regional landslide risk assessment method is often based on the statistical models (including machine learning). Because lots of landslides may occur in a region, it is almost impossible to get the detailed physical parameters for each landslide (Van Westen and Terlien 1996). This is why the physical models of landslides are rarely used in the regional landslide risk assessment.

The statistical model is a kind of method based on statistical analysis of existing landslides, and then predicts future landslide risk (Bui et al. 2016). The basic assumption of this method is that the geographical and geological conditions of occurring landslide failures are more likely to occur in future landslides. Therefore, statistical methods usually require a large number of historical landslide data (i.e., landslide samples) to figure out harmful or triggering conditions for landslides (Bui et al. 2012c). The quality of landslide samples, the scale of maps and the features of statistical models together determine the accuracy of the outcomes. The common statistical models and machine learning adopted in landslides risk analysis include support vector machines (SVM) (Kavzoglu et al. 2014b), logistic regression (Atkinson and Massari 1998; Costanzo et al. 2014; Felicisimo et al. 2013; Kavzoglu et al. 2014a; Lee 2005; Pradhan and Lee 2010; Bui et al. 2011; Tunusluoglu et al. 2008), fuzzy logic analysis (Akgun et al. 2012; Ercanoglu and Gokceoglu 2002; Lee 2007; Pourghasemi et al. 2012; Pradhan 2011; Bui et al. 2012b); decision tree (Nefeslioglu et al. 2010; Pradhan 2012; Bui et al. 2012a, 2013a) and BP neural network (Lee et al. 2003a, b, 2004; Lu and Rosenbaum, 2003; Ermini et al. 2005; Gomez and Kavzoglu 2005). The above methods in the studies can get a good accuracy in the region with area of hundreds or thousands km², but how these models perform (such as calculation speed and accuracy) in the region with area of more than tens of thousands km² needs to be clarified. To meet the needs for large regional landslide risk analysis, it is essential to establish landslide risk assessment models which are suitable and effective for large regions, and some tests are also necessary.

Here, the large regional landslide risk analysis model means that the model is suitable for assessing an area over tens of thousands km² and can ensure the accuracy and reduce the time cost. However, most models would face the problems such as accuracy decreasing and computation speed slowing with the assessment region getting larger (Cascini 2008). To solve the above problems, we compared all models discussed above and found that the backpropagation (BP) neural network shows better applicability of the landslides risk assessment in different regions and relies less on the scale of maps. But it is noted that the BP neural network used for landslide risk assessment needs more nodes in input layer and hidden layers. With the number of nodes increasing, the main advantages are: The BP neural network would get better accuracy and be effective for more complex problems. However, main disadvantages also appear: The initial weights and thresholds that are generated randomly between nodes in the neural network may reduce its accuracy or cause unreliability to the assessment results.

The objectives of this work are: (1) Use existing algorithms to improve the BP neural network for landslide risk assessment and clarify their applicability to the large and common region (more than tens of thousands km²) and (2) do a comprehensive landslide risk assessment of Sichuan Province, China, and draw landslide risk zoning maps. To achieve these objectives, this paper adopts genetic algorithm (GA) (Belew et al. 1992) and particle swarm optimization (PSO) (Changuhan et al. 2015; Aydln et al. 2013) for optimizing the initial weights and thresholds determination in the BP neural network, called GA-BP model and PSO-BP model for landslides risk analysis in the large region. Afterward, based on 100 typical historical landslides in Sichuan Province, China, this paper compares the accuracy of the BP, the GA-BP and the PSO-BP neural network models in the assessment of the landslide risk in Sichuan Province by using root mean square error (RMSE), coefficient of determination (R²), Kappa coefficient, receiver operating characteristic (ROC), training time and weights of condition factors. And then, according to the risk value from the three models, the risk maps of Sichuan Province are performed in the geographic information system (GIS), which can provide the fundamental maps of landslide risk for the engineering planning and construction of mountainous regions in Sichuan Province. At last, this study gives proposed models under different engineering needs, which increase the efficiency of the risk assessment for landslides in the large region.

2 Methodology

2.1 Landslide risk assessment model

2.1.1 BP artificial neural network model

Backpropagation (BP) neural network is a multilayer feedforward network which is trained by the error inverse propagation algorithm (i.e., BP algorithm), and it is one of the most widely adopted neural networks. The basic idea of the BP algorithm is that the learning process consists of both the forward propagation of signals and the reverse propagation of errors. The BP neural network has three geometric topologies: input layer, hidden layer and output layer (see Fig. 1a). Our landslide risk analysis models in this paper are all based on the BP neural network. The condition factors in landslide risk assessment determine the number of nodes of the inputs layers in the BP neural network, and the risk assessment values determine the number of nodes of the output layers (see Fig. 1b). The weights of the BP neural network can be divided into two parts, one being the weights from input layer to hidden layer (w_ik) and the other being the weights from the hidden layer to the output layer (w_kj), and the thresholds are same as the weights, called threshold₁ and threshold₂, respectively. The weights and thresholds are updated over and over again during the training to fit the complex nonlinear relationships between condition factors and the risk, in which the initial weights and thresholds are important. However, the initial weights and thresholds are usually randomly generated and this can lead to unreliability of assessment results. To overcome this drawback, in this paper, the initial weights and thresholds are decided by the optimization algorithms, which are introduced as follows.

The calculation process of the BP neural network is drawn in Fig. 1c, and the other necessary parameters used in the BP neural network are shown in Table 1.

Table 1 The main parameters of the BP neural network

Comparison of GA-BP and PSO-BP neural network models with initial BP model for rainfall-induced landslides risk assessment in regional scale: a case study in Sichuan, China

Abstract

Similar content being viewed by others

Prediction of flooding in the downstream of the Three Gorges Reservoir based on a back propagation neural network optimized using the AdaBoost algorithm

Prediction of Landslide Risk Based on Modified Generalized Regression Neural Network Algorithm

A Back-Propagation Neural Network Model Based on Genetic Algorithm for Prediction of Build-Up Rate in Drilling Process

Explore related subjects

1 Introduction

2 Methodology

2.1 Landslide risk assessment model

2.1.1 BP artificial neural network model

2.1.2 GA-BP neural network model

2.1.3 PSO-BP neural network model

2.2 Indices of evaluating the proposed models

2.2.1 Indices of evaluating the overall risk accuracy: root mean square error (RMSE) and decision coefficient (R2)

2.2.2 Indices of evaluating accuracy of risk-level classification: Kappa coefficient

2.2.3 Receiver operating characteristic curve

2.2.4 Weights computation of condition factors

2.2.5 Training time

3 Study region and spatial database

3.1 Topography and climate conditions in the study region

3.2 Spatial database used in the study

3.2.1 Collection of landslide samples

3.2.2 Condition factor selection and fundamental maps preparation

3.2.3 Data pre-processing

3.2.4 Correlation of six condition factors

3.2.5 Preparation of training samples and validating samples

4 Results and analysis

4.1 Risk assessments of BP, GA-BP and PSO-BP models

4.1.1 The evaluation of the overall risk values from three models

4.1.2 The evaluation of risk levels from three models

4.2 Landslide risk zoning maps

4.3 The validation of proposed models in other region

4.4 Applicability of three models with more hidden layers

5 Discussion

5.1 Reliability about weights of condition factors

5.2 Errors

5.3 The performance of the GA-BP and PSO-BP model

5.4 Limitations of the research

6 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

2.2.1 Indices of evaluating the overall risk accuracy: root mean square error (RMSE) and decision coefficient (R²)