Comparative analysis of multiple conventional neural networks for landslide susceptibility mapping

Aslam, Bilal; Zafar, Adeel; Khalil, Umer

doi:10.1007/s11069-022-05570-x

Comparative analysis of multiple conventional neural networks for landslide susceptibility mapping

Original Paper
Published: 29 August 2022

Volume 115, pages 673–707, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Natural Hazards Aims and scope Submit manuscript

Comparative analysis of multiple conventional neural networks for landslide susceptibility mapping

Download PDF

1450 Accesses
33 Citations
Explore all metrics

Abstract

In landslide susceptible mountainous regions, the precondition for avoiding and alleviating perilous dangers is the susceptibility mapping of the landslide. In northern Pakistan, landslides due to vigorous seismic zones, monsoon rainfall, extremely sheer slopes, and unfavorable geological conditions present a considerable threat to the mountain areas. This study targets and advances the research in mapping landslide susceptibility in northern Pakistan (Mansehra and Muzaffarabad districts). The central objective of the analysis is to analyze different convolutional neural network (CNN) frameworks and residual network (ResNet) that were constructed by developing distinct data representation algorithms for landslide susceptibility assessment and compare the results. This study considers sixteen landslide conditioning factors related to the incident of landslides centered on the literature review and geologic attributes of the pondered area. The marked historical landslide positions in the deliberated area were arbitrarily split into training and testing datasets, with the earlier containing 70% and the former having 30% of the total datasets. Several commonly exploited measures were used to validate the CNN architectures and ResNet by comparing them with the most prevalent machine learning (ML) and deep learning (DL) techniques. The outcomes of this study revealed that the proportions of regions having very high susceptibility in all the landslide susceptibility maps of the ResNet model and CNN models are considerably alike and less than 20%, which implies that the CNN models are significantly helpful in managing and preventing landslides as to the orthodox techniques. Moreover, the suggested CNN architectures and ResNet attained greater or similar prediction accuracy than other orthodox ML and DL techniques. The values of OA (overall accuracy) and MCC (Matthew’s correlation coefficient) of proposed CNNs and ResNet were greater than those of the optimized SVM (support vector machine) and DNN (deep neural network).

Comparison of optimized data-driven models for landslide susceptibility mapping

Article 12 April 2023

Development of integrated deep learning and machine learning algorithm for the assessment of landslide hazard potential

Article 13 August 2021

Deep learning-based landslide susceptibility mapping

Article Open access 16 December 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Landslides are one of those devastating natural disasters that cause property and human life loss in many countries worldwide (Maqsoom et al. 2021). It is critical to assess landslide-prone areas to manage and mitigate landslide disasters (Hong et al. 2016a). Assessing landslide susceptibility is a vital way to identify the most vulnerable regions to landslides and the critical areas for preventing and managing landslides (Guzzetti et al. 2006, Flentje et al. 2007); thus, it is a valuable method and way to avoid landslides (Guzzetti et al. 2012).

Different statistical-based methods were frequently employed for mapping landslide susceptibility in areas around the world. The existing studies on landslide susceptibility analysis primarily utilized the following conventional regression analysis methods: logistical regression (LR) (Martinović et al. 2016, Tanyas et al. 2019, Nefeslioglu and Gorum 2020), fisher discriminant analysis (FDA) (He et al. 2012, Gupta et al. 2018, Pham and Prakash 2019a, Wang et al. 2020), index of entropy models (Youssef et al. 2015, Wang et al. 2016b, Mondal and Mandal 2019), frequency ratio (Lee et al. 2015, Li et al. 2017, Mandal et al. 2018b), weights of evidence (WOE) (Wang et al. 2016a, Tsangaratos et al. 2017, Bacha et al. 2018, Pamela et al. 2018).

In 25 earthquake-induced landslide (EQIL) areas, Tanyas et al. (2019) studied 64 EQIL inventories and implemented LR along with a slope unit for assessing landslide susceptibility. In one earthquake event area (Wenchuan event), the model obtained the area under curve (AUC) accuracy of 0.88. Nefeslioglu and Gorum (2020) used LR to reveal the landslide hazards through landslide susceptibility assessment of Melen Dam reservoir and achieved better results of landslide spatial probability. Gupta et al. (2018) employed an information value technique for computing the weights of different influencing factors and afterward utilized the weights in FDA and binary LR to assess landslide susceptibility. The prediction precision of the practiced methods was evaluated through Heidke skill score, and the obtained scores of FDA and LR were 0.89 and 0.90, correspondingly.

Furthermore, for evaluating landslide susceptibility, Barella et al. (2019) contrasted seven statistical techniques: likelihood ratio, Bayesian model, landslide density, discriminant analysis, information value, WOE, and LR, and in the accuracy assessment, the WOE technique outperformed the other methods. A landslide inventory is typically utilized as the predictive variable by the orthodox regression analysis techniques, and statistical regression models are established to project the possibility of landslide incidence. Nevertheless, the factor selection and weigh assignment processes are partial to some extent, so these techniques partly depend on expert experience (Kanungo et al. 2011, Gupta et al. 2018).

Moreover, conventional machine learning (ML) and ensemble techniques have also been used to evaluate landslide susceptibility. These techniques include support vector machine (SVM) (Tien Bui et al. 2019, Yu et al. 2019, Pham et al. 2019b), artificial neural network (ANN) (Chen et al. 2017a, Polykretis and Chalkias 2018, Sevgen et al. 2019, Aslam et al. 2022), genetic algorithm-SVM (Ramachandra et al. 2013, Niu et al. 2014) and random forest (RF) (Kim et al. 2018, Sevgen et al. 2019, Aslam et al. 2022). Dou et al. (2020) used four techniques, namely SVM, SVM-Bagging, SVM-Stacking, and SVM-Boosting, for evaluating landslide susceptibility. The SVM-Stacking model showed the poorest functioning while the SVM-boosting model topped all the models. Aslam et al. 2022 assessed and compared the proficiencies of six advanced ML techniques for mapping landslide susceptibility, including linear discriminant analysis, quadratic discriminant analysis, ANN, naive Bayes, multivariate adaptive regression spline, and RF, and discovered that ANN outperformed other techniques.

Zhao and Chen (2020) implemented four ensemble approaches for modeling landslide susceptibility: bagging-functional trees, functional trees, dagging-functional trees, and rotation forest-functional trees. They observed that the bagging-functional trees with an AUC accuracy of 0.804 outperformed the other techniques. Tien Bui et al. (2019) utilized an SVM classifier in combination with four ensemble techniques, namely rotation forest (RF), bagging (BA), random subspace (RS), and AdaBoost (AB), to formulate novel ensemble models for evaluating landslide susceptibility and discovered that the RS-SVM model is superior in the prediction accuracy as compared to the models of SVM, RF-SVM, AB-SVM, and BA-SVM. The orthodox ML techniques can assist the comprehensive analysis of numerous conditioning factors and comparatively illustrate the nonlinear correlation among the conditioning factors and landslide susceptibility satisfactorily; hence, they can obtain high accuracies (Bui et al. 2016).

More recently, various deep learning (DL) techniques have been practiced for evaluating landslide susceptibility. However, this research area is still going through many developments and advancements as presently, more research is focused on DL techniques to determine landslide susceptibility. Wang et al. (2020) assessed landslide susceptibility using recurrent neural networks (RNNs) in Yongxin County, China, and indicated that the RNN system is helpful for mapping landslide susceptibility. Wang et al. 2019 utilized convolutional neural networks (CNNs) for mapping landslide susceptibility in Yanshan County, China, showing that CNNs are better than conventional ML methods.

Compared to traditional ML techniques, DL techniques have more sophisticated structures (Ronoud and Asadi 2019). They thus are more capable of explaining a complex nonlinear problem, such as a landslide system (Cao et al. 2019). Moreover, due to the competent study approach, a DL technique can achieve superior generalization capabilities to an orthodox ML technique (Duo et al. 2019, Liu et al. 2019). Therefore, several researchers have integrated the DL techniques with ML techniques to enhance the performance of ML techniques. For instance, Aslam et al. 2021 recently combined a CNN with three traditional ML classifiers, SVM, LR, and RF, to assess landslide susceptibility of Mansehra and Muzaffarabad districts in northern Pakistan. The authors established that the integration of CNN with RF, SVM, and LR could efficiently enhance the prediction performances of the ML classifiers. Nevertheless, the accuracy of a technique for evaluating landslide susceptibility is highly dependent on a specific area, and the accuracy of a technique may differ when used for various areas, which still needs to be illustrated (Lagomarsino et al. 2017).

Moreover, several studies (Wang et al. 2019, Wang et al. 2020a, b, Yi et al. 2020, Youssef and Pourghasemi 2021, Aslam et al. 2021, Aslam et al. 2022) have established that performing feature selection plays a vital role in landslide susceptibility assessment as it helps to remove redundant features and only to retain the valuable features. Yi et al. (2020) conducted a study for mapping landslide susceptibility utilizing multiscale sampling strategy and CNN in Jiuzhaigou region, China, and used information gain ratio and multicollinearity analysis for shortlisting useful features for the modeling process. Youssef and Pourghasemi (2021) used RF techniques to assess the importance of individual conditioning factors while mapping landslide susceptibility for Abha Basin, Asir Region, Saudi Arabia using ML algorithms. Aslam et al. (2022) have also used RF for feature selection when mapping the landslide susceptibility in Pakistan. Wang et al. 2019 have used gain ratio (GR) and multicollinearity for selecting useful features when performing landslide susceptibility mapping in China. Moreover, CNN has also been used for choosing useful features by Wang et al. (2020a, b) and Aslam et al. (2021) during the landslide susceptibility mapping in China and Pakistan, respectively.

The geographical attributes, monsoon rains, and earthquakes (Dai et al. 2002, Crosta 2004) triggered by the active faults vigorously endanger the mountainous regions of Pakistan to landslide problems. This fact was truly witnessed during the October 8, 2005, Kashmir earthquake that jolted the region of Kashmir and northern areas of Pakistan. The region affected by this earthquake was more than 30,000 km², and it caused thousands of landslides, resulting in 25,500 fatalities (Kamp et al. 2008, Owen et al. 2008). Various researchers have carried out different studies evaluating the induced landslides due to this earthquake (Owen et al. 2008, Saba et al. 2010, Shafique, van der Meijde et al. 2016) and to map the landslide susceptibility of the October 8, 2005, earthquake-affected or adjacent areas (Kamp et al. 2008, Basharat et al. 2016, Bibi et al. 2016, Torizin et al. 2017, Khan et al. 2019). However, these studies, which targeted October 8, 2005, earthquake-affected or adjacent areas, have some limitations as they have primarily used conventional decision-making and quantitative techniques, which are not as accurate as orthodox ML or novel DL techniques. Recently, Aslam et al. (2021) and Aslam et al. (2022) made an effort to map landslide susceptibility in the Mansehra and Muzaffarabad districts in northern Pakistan and fill the existing research gap. In contrast to the previous studies, Aslam et al. (2021), and Aslam et al. (2022) used conventional ML and novel DL techniques and achieved promising results. Moreover, Aslam et al. (2021), and Aslam et al. (2022) used a broad range of landslide conditioning factors and well-known feature selection techniques to select useful features for assessing landslide susceptibility. However, several researchers are presently exploring multiple DL techniques to assess landslide susceptibility; as this field is progressing day by day, the authors efforted to extend the research in this direction further.

The present study aims to further explore the potential of different DL techniques by performing landslide susceptibility mapping for the Mansehra and Muzaffarabad districts in northern Pakistan using multiple CNN frameworks and residual network (ResNet). The current work has subsequent developments to the existing work on landslide susceptibility mapping in the area. First, it uses different advanced DL techniques to obtain high precision in landslide susceptibility assessment. It compares these techniques with orthodox DL and ML techniques to establish the most superior model for the desired objective. The used novel DL techniques were ResNet and different architectures of CNN. In contrast, for comparison purposes, the used orthodox DL technique was DNN, and conventional ML techniques were SVM and LR. Second, it uses two feature selection techniques to select the more valuable features and eliminate redundant features. All these developments are significant contributions of the present study to the current work in the proposed area.

2 Study area

The study area is enclosed by the administrative borders of the Mansehra and Muzaffarabad districts and encompasses an area of around 6220 km². The Mansehra district is situated in Khyber Pakhtunkhwa province, whereas the Muzaffarabad district is located in Pakistan’s administrated territory of Azad Jammu and Kashmir. The epicenter of the October 8, 2005, earthquake was detected in the vicinity of Muzaffarabad city, the regional capital of Azad Jammu and Kashmir. On the southern side, the area is characterized by hilly relief. On the southwestern side, primarily by plane to mountainous areas and by an elevated mountain range area on the eastern side. Most prominently, the mountainous areas around the Kaghan and Naran valleys in the Mansehra district are incredibly elevated. The area’s elevation ranges from around 360 m in the plains to approximately 5276 m in the hills. The map of the study area showing the elevation of the area and the past landslide points is presented in Fig. 1.

The Indus River, with its two major tributaries, the Kunhar River and the Siran River, is the primary drainage of the Mansehra district. The Indus River crosses the Mansehra district from north to south along the western edge. In contrast, the Jhelum River is the primary drainage of the Muzaffarabad district and has two tributaries, specifically the Kunhar River and the Neelum River. Creating deep antecedent valleys, these rivers flow westward before gushing southward beside wider valleys to the Indo-Gangetic Plain. During the monsoon spell in the summer, the temporary variations in the river discharge regime often trigger sporadic instant floods in the area.

The area’s weather conditions are pretty variable, owing to its topography. The area has a subtropical highland climate. During the summer, the weather is warm, while it is cold in the winter. Nevertheless, the northeastern side of the Mansehra district, primarily the Kaghan valley, has cold weather in summer and extremely cold in winter due to heavy snowfall in the mountains. In this area, the warmest month of June is the highest in the Muzaffarabad district, with average maximum and minimum conditions of 37.6 and 22.1 and 35 and 21 °C, respectively, in the Mansehra district. For the county of Muzaffarabad and Mansehra, January is the coolest month, with a high and a minimum mean temperature of 15.9 and 3.2 °C, respectively.

The rainfall in the area is significant; what is more, it receives rainfall even through the driest month. The average yearly rainfall in the Muzaffarabad district is about 1500 mm, whereas the Mansehra district receives between 1400 and 1800 mm rainfall on average per year. Out of the average yearly rainfall, almost one-third happens through the monsoon season, which lasts from the end of June until late August. This extensive rainfall often causes severe flooding and landslides, particularly debris flows. Through the winter, the elevated mountainous areas receive rainfall as snow. The area receives little rain during the spring, but the slopes get plentiful surface water through snowmelt. This infiltrating water increases the groundwater table height, thus resulting in erosion.

Geologically this region contains the Hazara-Kashmir Syntax, which borders on the Main Thrust and is an area of significant reduction and elevation of crustaceans (Kazmi and Jan 1997; Hussain and Yeats 2009). Overall, the terrain steepness, along with extreme rainfall through monsoon spells, constant swift river slitting, intermittent earthquake jolting, and anthropogenic impacts as destabilizing of slopes during the construction process, make this area extremely prone to slope failures.

3 Materials and methods

The detailed methodology is presented with the help of a flow chart in Fig. 2 and is explained in the following sections.

4 Landslide inventory mapping

A landslide inventory map comprises past and previously occurred landslide locations (Pham et al. 2016). It contains information regarding landslides’ type, locality, movement, and physical attributes (Rosi et al. 2018). Moreover, it is an important data source for spatial prediction of landslides (Pham et al. 2016) and for understanding the association between landslide incidence and the association with influencing factors (Jiao et al. 2019). High-resolution remotely sensed images could be visually inspected for interpreting landslide inventory mapping (Tsangaratos et al. 2017, Pham et al. 2017b, Pham et al. 2017c). Therefore, for this analysis, LANSAT-8 images were taken to prepare the region’s landslide inventory map with 3251 previous locations. All of these landslides represent the geo-environmental situation of landslide zones. 2276 locations (70%), including the outstanding 975 (30%) locations, were arbitrarily selected for training and testing purposes. The splitting was based on previous studies from the literature (Wang et al. 2019, Wang et al. 2020a, b, Yi et al. 2020, Youssef and Pourghasemi 2021, Aslam et al. 2021, Aslam et al. 2022).

Moreover, 3251 non-landslide locations were randomly selected to maintain the class balance and were split into the same proportions (70 and 30%) for particular purposes. The polygon outlines were drawn for clearly visible landslides on the satellite imagery. These polygons were then used for the measurement of the space and the validation of the final susceptibility maps. The polygons were transformed and displayed as points on the study area map shown in Fig. 1 because they had unusually low visibility due to the limited size of the inventory map.

5 Landslide conditioning factors

In order to map the susceptibility to landslides, it is typically presumed that future landslides would take into account the same environmental factors which have caused previous landslides. These landslide environmental factors are essential to mapping the susceptibility to landslides. This study uses thematic maps of several landslide factors that influence the susceptibility of landslides in the considered region. In addition to the environmental conditions and available data for the area under discussion, the conditioning factors in the current analysis have been based on the literature review and existing landslide susceptibility studies in the area under consideration (Kamp et al. 2008, Owen et al. 2008, Basharat et al. 2016, Bibi et al. 2016, Shafique et al. 2016, Torizin et al. 2017, Tsangaratos et al. 2017, Pham et al. 2017b, Khan et al. 2019, Fang et al. 2020a, b, Wang et al. 2020). The selected landslide conditioning factors include road density, stream power index (SPI), topographic wetness index (TWI), slope, fault density, elevation, lithology, sediment transport index (STI), aspect, profile curvature, earthquake, normalized difference vegetation index (NDVI), land cover classification system (LCCS), soil data, and plan curvature.

ASTER GDEM (DEM) data with a spatial resolution of 30 m were used to obtain geomorphological factors of elevation, slope, SPI, TWI, aspect, profile curvature, STI, and plan curvature. Data from LANDSAT-8 satellite images with a spatial resolution of 30 m were used to obtain the maps of NDVI and LCCS. The factor of NDVI was obtained using the Infrared (IR) and Red (R) bands in the following formula: NDVI = (IR−R)/(IR + R), (Hong, Pourghasemi et al. 2016b). The annual average rainfall data for the last decade was acquired from the Pakistan Meteorological Department (PMD) for preparing the rainfall map. The seismicity map of Pakistan was used for the earthquake map.

The geological and topographical factors were obtained from the geological and topographical maps of Pakistan correspondingly. The fault density and lithology factors were obtained using the scale of 1:500,000 from the geological maps. The road density factor was defined using the scale of 1:200,000 from the topographical maps. The soil data were acquired from the Soil Survey of Pakistan. All the prepared maps of the considered factors were adapted to the same scale as the obtained DEM (30 m of spatial resolution) for further assessment.

Moreover, we have tried our utmost to eliminate all the possible error-causing factors by taking protective measures such as for the datasets which are geo-temporal and have a seasonal effect, such as rainfall (rainfall data for a decade) and NDVI (NDVI data for one year); we have taken the mean to reduce the ambiguity. Additionally, for those datasets which are standard, we have used the best available datasets to minimize the error. All these efforts were in the wake of obtaining efficient prediction results and producing reliable susceptibility maps.

6 Selection of conditioning factors

The selection of conditioning factors as various conditioning factors affect landslides is one of the main steps in landslide susceptibility mapping. Therefore, it is important to choose suitable features in order to yield a comprehensible landslide susceptibility map. The modeling process might become intricate owing to the extreme dimensionality linked with the training and testing datasets. In the meantime, poor modeling precisions always occur due to the curse of dimensionality. The process of selecting features is useful as it uses recognized selection techniques and provides high-quality information. In addition, the unnecessary and surplus characteristics of the selected features can be eliminated during this stage. In the current study, two feature selection techniques, namely GR and multicollinearity analysis, are employed. The details about these techniques are provided next.

6.1 GR

For selecting an ideal subset to enhance the prediction performance for mapping the landslide susceptibility, the feature selection technique of GR (Dash and Liu 1997) was utilized. A feature at every node of the decision tree is selected by using the information gain measure. Subsequently, to avoid bias, GR, which is an augmentation of information gain, is anticipated. For better understanding, the GR method is presented followingly. Assuming that S is a training data plus n is the overall dataset, then the projected statistics are offered by the following comparison.

$$H\left(S\right)=-\sum _{i=1}^{n}{p}_{i}{\text{log}}_{2}\left({p}_{i}\right)$$

where p_i signifies the possibility that class Ci holds a sample, the feature F has m values, and the mean entropy related to it is provided by the following equation.

$$E\left(F\right)=-\sum _{i=1}^{n}{p}_{i}H\left(S\right)$$

then the information gain on feature F is

$$ {\text{Gain}}\left( F \right) = H\left( S \right) - E\left( F \right) $$

The prospective information attained by distributing S into m parts relating to m outcomes on feature F is represented by the split information, and it can be obtained by using the following formula.

$$ {\text{Split}}\;{\text{Info}}_{A} \left( S \right) = - \mathop \sum \limits_{{i = 1}}^{m} X_{i} \left( {\frac{{S_{i} }}{S}} \right)\log _{2} \left( {\frac{{S_{i} }}{S}} \right) $$

Lastly, the GR is characterized by the following equation:

$$ {\text{Gain}}\;{\text{Ratio}} = \frac{{{\text{Gain}}\left( F \right)}}{{{\text{Split}}\;{\text{Info}}_{A} \left( S \right)}} $$

The important association of conditioning factors with landslide incidence is revealed by the average merit (AM) derived from this technique. A factor is regarded as an irrelevant feature if the AM value is equal to or less than 0, and it should be excluded from the further modeling process prediction. On the contrary, the remaining factors are suitable to be used for further assessment.

7 Multicollinearity analysis

In order to measure the relationship between the landslide conditioning factors, multicollinearity analysis was employed. Multicollinearity is a statistical phenomenon that exhibits an excessive correlation among two or more predictor variables in a multiple regression model (O’brien 2007). This research uses the multicollinearity of conditioning factors to classify resistance (TL) and inflation variance factor (VIF). Assume that the x = (x₁, x₂…., x_n) symbolizes a particular separate variable set and that the ${R}_{j}^{2}$ is the code for the independent jth variable x_j that is reverted to all of the remaining variables of the model. The VIF value is computed by utilizing the following relation:

$$\text{V}\text{I}\text{F}=\frac{1}{1-{R}_{j}^{2}}$$

The extent of linear relation among independent variables is represented by the TL value, which is the multiplicative inverse of the VIF value. If a factor exhibits a VIF value of more than 10 or a TL value of less than 0.1, it exhibits multicollinearity and ought to be eliminated from the further modeling process.

8 Methods

Prior to employing DL methods for landslide susceptibility mapping, all the conditioning factors were initially assembled collectively, so the whole considered region can be regarded as a “multi-channel image.” Every single factor signifies a particular channel, and the final objective is to categorize every pixel in this “image.” The primary stage is to build and train an original DL model. The input layer of an original DL model comprises numerous neurons. Each neuron indicates a landslide conditioning factor. A DL can automatically extract valuable features from the original data, and then these futures are used to get the final modeling output. It is a two-stage process of feature extraction and classification. The structure of the model is first built and is trained by exploiting the initial training dataset. The trained model is then used for modeling purposes.

9 ResNet

Convolutional networks utilized by the computer vision fraternity have been increasing deeper every year since Krizhevsky et al. (2012) suggested AlexNet in 2012. Here, depth describes the sum of layers in a network, while width describes the sum of kernels of each layer. Recently proposed ResNets get state-of-the-art performance and permit training of exceedingly deep networks up to over 1000 layers (He et al. 2016). ResNet is the deepest network in the literature, with 1202 trainable layers (He, Zhang et al. 2016). Tiny images in the CIFAR-10 dataset were used to train this 1202-layer ResNet (Krizhevsky and Hinton 2009). The image size here is vital, as it indicates that the size of subsequent feature maps is comparatively small, which is crucial in practice to train exceptionally deep models. Like highway networks, ResNets make use of identity shortcut connections that facilitate the movement of information through layers devoid of attenuation that would be instigated by various stacked nonlinear transformations, producing enhanced optimization (Srivastava et al. 2015). Shortcut connections in ResNets are not gated, and untransformed input is constantly spread. He et al. 2016 have established an awe-inspiring empirical performance of ResNets.

9.1 CNN

CNNs are proficient in visual recognition and have many coalescents, maximum pooling, and fully linked layers. A CNN model is a class of feedforward NN (artificial neurons of this NN respond to a fraction of the adjacent elements), demonstrating vigorous execution in visual image assessment (Girshick 2015). This indicates that a CNN is a variant of a multilayer perceptron containing one or more convolution, max pooling, and fully connected layers (Shin et al. 2016). The structure of a conventional CNN is presented in Fig. 3. A conventional CNN always contains input, convolutional, max pooling, fully connected, and output layers. The input layer is basically an m × n matrix, and each element in this matrix has a feature value.

Consequently, the input parameter can be characterized as a 2D plot. There are several convolutionary units in every convolutionary layer. In order to optimize the parameters of every unit, a backpropagation algorithm is used. Overall usage aims to attain various input layer characteristics (Sharif Razavian et al. 2014). The initial convolution layer could give only some low-level characteristics such as corners, edges, and lines. Other convolutionary layers can learn more complex representations from these low-level characteristics. Pooling is a grave operation in the CNN strategy (Szegedy et al. 2015). The dimensionality of feature maps is reduced without modifying their depth. This reduces sampling. Max pooling is the most common manipulation in different approaches to pooling. It seeks to separate the feature maps and generate the maximum value for each region into several quadrangular areas. In addition, the dimensionality of the data can be continuously reduced, thereby reducing calculation cost and the number of parameters. This avoids the problem of overfitting. The fully connected layer arranges the achieved representations, and the output layer produces classification results to reduce the loss of feature information. The position of detected characteristics to other characteristics should be noted to be more significant than the actual position, which is unique to CNN.

10 The proposed CNN architectures

This work seeks to create a CNN system to map landslide susceptibility. The present landslide data may not be well adapted to the CNN architectures because of the various representations that may influence the susceptibility outcomes. Consequently, different architectures of CNN were designed to match multiple data representations. The following subparagraphs describe the four different ways of data representation for constructing CNN architectures. The CNN and the data representation algorithms 1D, 2D, and 3D are referred to as CNN-1D, CNN-2D, and CNN-3D.

11 LeNet-5

LeNet-5 is a CNN structure that can be used to recognize a handwritten digit and can solve several visual problems effectively (LeCun et al. 1995). However, LeNet-5 cannot be used directly to measure landslide susceptibility. In this study, we, therefore, proposed multiple CNN architectures that would evaluate landslide susceptibility and afterward compare it to the landslide susceptibility from the orthodox LeNet-5 CNN structure.

A commonly employed LeNet-5 consists of eight layers (LeCun et al. 1998). Given an input data as n × n, followed by a convolutional layer C1 with six (n – 4) × (n – 4) feature maps and a 5 × 5 neighborhood in the input map is linked to every element in the feature maps. Having six $\frac{{\left(n-4\right)}^{2}}{{2}^{2}}$ feature maps, a max pooling layer S2 is for subsampling. The kernel size of max pooling is 2 × 2, and every element in the feature maps is linked to the preceding layer. With sixteen $\frac{{\left(n-12\right)}^{2}}{{2}^{2}}$ feature maps, C3 is another convolutional layer, and each element of the layer is linked to a 5 × 5 neighborhood in the former layer. With sixteen $\frac{{\left(n-12\right)}^{2}}{{4}^{2}}$ feature maps, S4 is another layer that performs the max pooling process. F5 and F6 are fully connected layers with correspondingly 120 and 84 neural units. Lastly, two neural units are produced in the output layer to show the binary classification outcomes as landslide and non-landslide. The architecture of LeNet-5 for n = 24 is shown in Fig. 4.

11.1 CNN-1D

The input data can be considered an example of mapping landslide susceptibility such that each pixel possesses many landslide conditions. As a consequence, each input data grid cell with a length defined by the number of conditioning factors is described by a column vector. Each element of this vector also refers to a landslide conditioning function. A 1D CNN structure has been created to analyze landslide susceptibility based directly on landslide conditioning factors’ knowledge. One of any convolution, max pooling, and completely linked layers includes the 1D CNN architecture. When n landslide conditioning factors are present in the data input, the convoluted layer of m to 1 size N kernels filters the data input, then N feature vectors of length (n – m + 1). This layer is often used. Each element in the vector function is connected in the input vector to an m of the 1 neighborhood. The size of the max pooling layer is a × 1, and the consequent outcomes are composed of N vectors having a length of $\left[\frac{n-m+1}{a}\right]$. To represent the extracted features, the fully connected layer having k neural units comprehends the preceding layer. Lastly, two neural units are produced in the output layer to solve a binary classification problem. For n = 15, N = 20, m=3, a= 2 and k=50, the architecture of CNN-1D is illustrated in Fig. 5.

11.2 CNN-2D

The CNN technique was efficient in the processing of images. Still, the start should be by transforming a 1D input grid cell (vector) composed of several features into a 2D matrix in order to use this technique for mapping landslide susceptibility. The number of landslide conditioning factors was compared in the present study with characteristic values of each factor, and the highest of both was the 2D matrix dimension. For example, neither factor exceeds the overall number of factors by its characteristic values (16). Consequently, for each grid cell, we generated a matrix of 16 to 16. Figures 6 and 7 show the conversion handling of a 1D grid (vector) into a 2D matrix. Explicitly the item value in this matrix is assigned a value of 1 for each column vector. The rest of the vector unit values are given 0 for the associated characteristic value.

The 2D CNN structure proposed consists of one convolutionary and one max pooling layer. A dropout layer follows every layer of convolution. A transformation from n to n matrix is supposed to take place from every cell unit in the grid landslide. In the initial convolutionary layer, the input data are tested using N kernels m to m, and thus this layer includes 20 (n – m + 1) corresponding maps. (n – m + 1) function maps. Each grid cell in the function maps is associated with the m to the area around the input map. The dropout manipulation briefly leaves the NN groups, depending on a particular probability, through the training of CNN. This process resolves the over-fitting problem and enhances classification precisions. A drop manipulation is employed following every convolutional phase. The max pooling layer has a size of a × a. Consequently, the outcomes of this layer contain N matrices with a size of $\left[\frac{n-m+1}{a}\right]\times \left[\frac{n-m+1}{a}\right]$, followed by a dropout manipulation, were subsequently assigned to the second convolutional layer having M kernels with a size of m × m. The outcomes of this convolutional layer contain M matrices having a size of $\left[\frac{n-\left(a+1\right)\left(m-1\right)}{a}\right]\times \left[\frac{n-\left(a+1\right)\left(m-1\right)}{a}\right]$. These outcomes were later uninterruptedly assigned to the second max pooling layer with M kernels having a size of a × a. This max pooling layer produces resultant M feature maps having a size of $\left[\frac{n-\left(a+1\right)\left(m-1\right)}{{a}^{2}}\right]\times \left[\frac{n-\left(a+1\right)\left(m-1\right)}{{a}^{2}}\right]$. The extracted features were reorganized by a fully connected layer having k neural units following the previous layer. Ultimately, the binary classification outcomes in the form of “landslide” and “non-landslide” are indicated by the produced two neural units by the output layer. The architecture of CNN-2D when n = 24, m = 3, N = 20, M = 15, a = 2 and k = 78 is presented in Fig. 8.

11.3 CNN-3D

The input data for the pondered region can be represented by a size 3D matrix (c = n + n), where n means the row and column of every data layer and c refers to the number of landslide conditioning factors. Figure 9, for example, shows the 3D data representation of each grid cell neighborhood with a scale of 7 × 7.

We designed a 3D CNN architecture under these conditions to obtain details and the spatial relation of the conditions of landslide incidence. The 3D CNN network contained one convolutional layer of N kernels with a size of m × m × m, one max-pooling layer, and one fully connected layer. With a size of (c – m + 1) × (n – m + 1) × (n – m + 1), the convolutional layer has N feature maps, given the c × n × n input data. Every grid cell is linked to an m × m × m neighborhood in the input data. The subsequent hidden layer with a size of a × a is a max-pooling layer. Consequently, the outcomes from the max-pooling layer have N feature maps with a size of $\left[\frac{c-m+1}{a}\right]\times \left[\frac{n-m+1}{a}\right]\times \left[\frac{n-m+1}{a}\right]$. After the max-pooling layer, there is a fully connected layer with k neural units, and it pursues the former layer to learn the extracted features. Ultimately, two neural units are positioned in the output layer to signify “landslide” and “non-landslide” prediction. The architecture of CNN-3D for c = 15, n = 7, N = 20, m = 3, a = 2, and k = 78 is presented in Fig. 10.

12 The related parameters

The parameter settings greatly influence the performance of prediction/classification techniques. In the present study, we also incorporated related parameters in the proposed CNN architectures to improve the performance. This effort was made to enhance the prediction performance of the models and achieve superior susceptibility maps. The effects of loss and activation functions are highly important for CNN architecture. The effects of any successor layer are a linear feature of its predecessor layer as far as the ANN techniques are concerned. But the actual scenario through this linear association is challenging to embody (Huang and Babri 1998). The activation function approach has been used to overcome the appropriate problem of the output data. It can effectively form nonlinear ties from linear ties using predefined (nonlinear) activation functions (Dahl et al. 2011).

The current research incorporated the CNN architectures with the rectified linear unit (ReLU) function (Dahl et al. 2013). The ReLU function, with two major benefits, is among CNN’s most popular and effective activation functions. The first benefit is that this function facilitates overcoming the dilemma of gradient disappearance. Secondly, it is more economical as well as useful for training prediction techniques compared to the additional activation functions (Maas et al. 2013). The loss function was a categorical cross-entropy function, and its optimizer was an advanced adaptive gradient (AdaGrad) algorithm (Anthimopoulos, Christodoulidis et al. 2016). The AdaGrad technique can limit the learning rate and use various learning rates per iteration for individual learning parameters (Duchi et al. 2011). The adaptive moment estimation (Adam) (Kingma and Ba 2014) was used as an optimizer for ResNet. It is a first-order gradient-based algorithm of stochastic objective functions based on adaptive estimates of lower-order moments. The posterior probability was produced for each grid cell using the softmax function (Lawrence, Giles et al. 1997). The prediction output is greatly improved by dropping manipulation since the neural network units corresponding to a given possibility can be momentarily abandoned during training (Hinton et al. 2012). Explicitly, the dropout manipulation obliges a nerve unit, in addition to overfitting between secret units, to work with other arbitrary neural units (Srivastava et al. 2014). Furthermore, this manipulation will improve the generalization of prediction techniques (Dahl et al. 2013).

13 Model evaluation methods

The performance of the suggested methodology was evaluated using the measurements of OA and ROC (Tsangaratos and Ilia 2016, Pham et al. 2017b, Chen et al. 2017c). The proportion of the number of grid cells, either landslide or non-landslide that were classified accurately (represented by a) to the entire grid cells (represented by b) gives the OA value, which is calculated using the following formula:

$$ {\text{OA}} = \frac{a}{b} \times 100\% $$

A higher OA value embodies superior classification precision. For evaluating the functioning of landslide prediction techniques, the ROC curve is a standard practiced approach (Bradley 1997). The TP (true positive) rate, which is described as “sensitivity,” is plotted against the FP (false positive) rate, which is described as “100-specificity,” at different threshold values to produce the curve. Furthermore, for quantitatively evaluating the performance of landslide susceptibility mapping techniques, the measurement of AUC has been applied broadly (Wang et al. 2017; Mandal and Mandal 2018a). In particular, if the AUC value is in the proximity of 1, the prediction approach is deemed outstanding (Tsangaratos et al. 2017, Pham et al. 2017a, Zhu et al. 2018).

The Matthews correlation coefficient (MCC) (Matthews 1975) has also been practiced in various ML techniques, even though the two assortments are of very distinct volumes. The following expression characterizes the MCC:

$$ {\text{MCC}} = \frac{{{\text{TP}} \times {\text{TN}} - {\text{FP}} \times {\text{FN}}}}{{\sqrt {\left( {{\text{TP}} + {\text{FP}}} \right)\left( {{\text{TP}} + {\text{FN}}} \right)\left( {{\text{TN}} + {\text{FP}}} \right)\left( {{\text{TN}} + {\text{FN}}} \right)} }} $$

where the number of non-landslide and landslide locations classified precisely is represented by TN (true negative) and TP, and the number of non-landslide and landslide locations classified wrongly are represented by FN (false negative) and FP, respectively. Additionally, this measure is a correlation coefficient among the observed and predicted classes. Usually, the MCC value equal to 1 portrays a perfect prediction for the final result. On the other hand, the MCC value of 0 and − 1 signifies an arbitrary prediction and a complete disparity between the prediction and observation.

Furthermore, the significant difference among suggested techniques was evaluated using the Chi-square test (Kuncheva 2004). It is based on a former proposition that the techniques used to map landslide susceptibility have no substantial change (Tallarida and Murray 1987). For the validation purpose, the Chi-square and p-values were designated and computed. Generally, a p-value less than 0.05 and a Chi-square value greater than 3.841 indicates that between the two techniques, there exists a significant difference (Pham et al. 2017a).

14 Results and analysis

The training set designated was used to determine the predictive potential of the landslide conditioning factors by analyzing multicollinearity and using the GR method. The results of the landslide conditioning factor multicollinearity study are summarized in Table 1. The VIF value of SPI and STI is observed to be 9.772 and 9.43, which are very near the threshold value of 10. Thus, their contribution to the model is quite less. As far as the GR method is concerned, the factors with higher weights are additionally meaningful to the practiced techniques, while factors with weights equivalent to zero cannot contribute to the modeling process; therefore, they should be omitted from further analysis. Figure 11 shows the AM value of the respectively conditioning factor. The land use factor has the highest AM value of 0.0525 compared to the other factors, which means that it is comparatively significant as the other factors.

Table 1 Outcomes of multicollinearity analysis

Full size table

The AM values of aspect, lithology, elevation, slope, rainfall, soil, NDVI, and SPI are between 0.0474 and 0.0151. Furthermore, the AM values of plan curvature, fault density, road density, profile curvature, and TWI are positive but less than 0.01, which implies that their contribution to the models is very little. All the outstanding landslide conditioning factors with more than zero AM values participate in the models. Still, their contribution is very small, and they must be considered for the modeling process because they still have impact on landslides have high frequency ration as far as landslide is concern (Table 2). The reclassified maps of all the landslide conditioning factors are presented in Fig. 12a–p.

Table 2 Spatial relationship between past landslides and different classes of landslide conditioning factors using the FR model

Full size table

15 Model validation and comparison

The trial-and-error method was used to optimize all the parameter settings to develop the CNNs. Table 3 displays the parameter settings for all CNNs and ResNet. Every grid cell in the zone in question was designated a susceptibility index after the formulation of the models through training datasets. The weights obtained were then allocated for each factor class, and the final susceptibility maps were drawn up in an ArcGIS setting for each model. The natural breaking technique has been used to reclassify the higher vision indices in five very low, medium, medium, and very high classes. The obtained susceptibility maps for landslides from different CNN frameworks and ResNet are shown in Fig. 13.

Table 3 Parameter settings of the CNNs

Full size table

In contrast, for each of the models, the distribution of percentages for each of the susceptibility classes is shown in Fig. 14. All the maps show that the northeast portion of both districts was classified as high and very highly susceptible areas. The outcomes of ResNet, CNN-1D, and CNN-2D were very much alike. In Fig. 13b, the high and very high-class concentration is more intense than the concentration in other obtained maps. The high and very high classes in Fig. 13a and c, and 13d were quite similar and more intense than the SVM model in Fig. 13 f. However, in the outcome of CNN-3D in Fig. 13e, the high and very class in the deliberated area was more than the outcome of the SVM model in Fig. 13 f. The result of DNN in Fig. 13 g demonstrated that the very high and high susceptible zones in the study area are less than those in the maps obtained through ResNet, LeNet-5, CNN-1D, CNN-2D, CNN-3D, and SVM but are more than LR. The map obtained through LR has the least proportion of high and very high class, as seen from Fig. 13 h.

For the proposed multiple CNN frameworks and the ResNet technique, the OA and MCC values are listed in Table 4. The OA values of ResNet were higher than that of the suggested CNN frameworks. In particular, the OA value (83.52%) of ResNet was higher than other methods, which is almost 1% greater than that of CNN-3D (82.54%), which has the second-highest OA value. However, CNN-2D obtained an OA value of 81.71%, followed by CNN-1D with an OA value of 79.32%. Moreover, ResNet also obtained the highest MCC value (0.596) compared to other techniques. Secondly, it was CNN-3D which obtained the highest MCC value (0.582), followed by CNN-1D, LeNet-5 plus CNN-2D with MCC values of 0.535, 0.526, and 0.514, correspondingly.

Table 4 Performance comparison of different methods

Full size table

The obtained ROC curves using the testing set for all the used CNN frameworks are presented in Fig. 15. It can be witnessed from the figure that the CNN-3D method has the maximum AUC value of 0.874, which specifies that it has a superior predictive capacity to the other CNN methods. Moreover, the CNN-1D and CNN-2D methods attained comparable AUC values of 0.871 and 0.872, respectively, whereas the LeNet-5 technique attained the AUC value of 0.864. The significant difference between the proposed prediction techniques was evaluated using a Chi-square test. A Chi-square value greater than 3.841, whereas a significant level value (p) less than 0.05 indicates a significant difference among the prediction techniques. For the different proposed CNNs and ResNet, the Chi-square values plus the significant levels are listed in Table 5. It shows that ResNet and all the CNNs are quite distinct as their Chi-square and the significant level values comply with the threshold conditions.

Table 5 Chi-square values and significant levels of different CNNs

Full size table

For further validation of the efficacy of the proposed models, the best performing model (ResNet) in the preceding experiments was chosen to be compared with different common ML and DL techniques. DNNs are typically feedforward networks, where data flow from the input layer to the output layer without looping back (CireşAn et al. 2012). First, a simulated neural unit map is generated by DNN, then links these neural units by assessing their weights. Subsequently, a probability between 1 and 0 is produced by multiplying the input data with the weights. The chosen DNN has a network architecture composed of five layers containing four hidden, fully connected layers. The number of neural units in the four hidden layers is 50, 30, 20, and 10. The prediction outcomes in the output layer are obtained with two neural units, corresponding to landslide and non-landslide units. For comparison with an ML technique, the SVM classifier with a radial basis function kernel was utilized. For SVM, the γ (2^− 9) and optimal C (2⁷) were obtained through five-fold cross-validation varying from 2^− 5 and 2^− 15 to 2¹⁵ and 2⁵, correspondingly. The landslide susceptibility maps obtained from SVM and DNN methods are presented in Fig. 12f g, respectively.

The OA as well as the MCC values of the orthodox DL and ML techniques are listed in Table 6. ResNet achieved the highest OA value of 83.52%. However, CNN-3D achieved an OA value of 82.54%, almost 4% more than the optimized SVM (78.43%). Followingly, it is LeNet-5 plus DNN, which have quite comparable OA values of 78.12 and 77.3%, correspondingly. Moreover, ResNet also accomplished the highest MCC value (0.596). However, as compared to the optimized SVM, CNN-3D also accomplished a considerably greater MCC value. The obtained MCC value of CNN-3D is 0.514, followed by the LeNet-5 (0.526), DNN (0.511), and SVM (0.427).

Table 6 Performance comparison of different methods

Full size table

For comparison, the ROC curves using the validation set for ResNet, CNN-3D, LeNet-5, DNN, and SVM are shown in Fig. 16. The predictive power as per the AUC of ResNet was observed to be better than the optimized SVM and DNN. However, CNN-3D obtained an AUC value greater than SVM and DNN but less than ResNet. Moreover, the CNN-3D, and LeNet-5, correspondingly attained AUC values of 0.881, 0.874, and 0.864, whereas DNN and SVM likewise achieved AUC values of 0.849 and 0.825.

16 Discussion

The susceptibility mapping of landslides is of immense importance as it assists in visually examining landslide-prone areas. Previously, various models have been established for mapping the landslide susceptibility of a specific region, and their functioning has been contrasted. Yet, the prediction precision of these developed techniques is still under discussion (Akgun 2012, Martinović et al. 2016, Chen et al. 2017b). Hence, it is essential to explore novel techniques and approaches for assessing landslide susceptibility (Bui et al. 2016). In landslide susceptibility evaluation studies, significant progress has been accomplished using orthodox regression analysis or ML techniques. Comparatively, simple structure prediction models are constructed through these techniques.

Lately, numerous ML techniques have been utilized for the spatial prediction of landslides and compared with each other for a given area. These techniques include SVM (Chen et al. 2018), LR (Tsangaratos and Ilia 2016), decision tree (Chen et al. 2017c), and ANN (Chen et al. 2017a). Moreover, various ensemble techniques have been built and used for the same purpose. These include rotation forest (Pham, Shirzadi et al. 2018), bagging (Pham et al. 2017a), and AdaBoost (Hong, Liu et al. 2018). The raw data are not processed outstandingly by the orthodox ML methods because they have a limited ability to do so (LeCun, Bengio et al. 2015).

To some extent, these models have flaws in defining the intricate nonlinear landslide systems and in avoiding the overfitting problem. Modern DL techniques such as ResNet, CNNs, and DNN have essential improvements to the above problems compared to the orthodox methods. Two significant characteristics of DL techniques are their nonlinear and multilayer structures, which enable them to depict an intricate nonlinear landslide system influenced by several conditioning factors. Nevertheless, the DL techniques have a robust enhancement over ML techniques and play a progressively significant role in natural language processing, image processing, and computer vision. The DL techniques are smart enough to automatically study the representation from the raw data required for prediction. Thus, it is favorable to discover the possibility of using potent DL techniques for assessing landslide susceptibility (Wang, Fang et al. 2019).

Furthermore, valid algorithms, such as “batch” or “dropout,” are incorporated in DL techniques to prevent overfitting effectively. Thus, DL techniques are likely to enhance the assessment precision of landslide susceptibility. Nevertheless, comparatively limited studies performed landslide susceptibility assessments using DL techniques.

The pondered area (Mansehra and Muzaffarabad districts) for this study is in the northern part of Pakistan. As this region has suffered numerous landslides due to seismic activities and extensive rainfall and is expected to experience these types of events in the future; thus, it is vital to map landslide susceptibility for preventive measures. In the current study, an effort has been made to address this issue with the assessment and comparison of ResNet and multiple CNN architectures. The DL techniques: ResNet and CNN have superior fitting power, and they are comparatively effective in feature extraction, which enhances the prediction ability of the models, thus generating efficient susceptibility maps. Moreover, we used multiple activation functions and optimizers to enhance the prediction results and found that the models showed superior performance with ReLU (activation function) and AdaGrad (optimizer). Thus, we incorporated these functions into the models for better prediction results. These techniques were also compared with DNN, LR, and SVM classifiers to compare and show the performance differences.

The purpose of comparing DL and ML techniques was to establish the best-performing model among the multiple proposed conventional and modern models. Therefore, the best-performing CNN and the ResNet models were compared with the state-of-the-art ML technique (SVM) and DL technique (DNN) to establish the superiority of the best-performing model. Among the two used ML techniques, SVM was selected for comparison purposes because several studies have established its superiority over other conventional ML techniques for landslide susceptibility mapping (Wu et al. 2016, Tien Bui et al. 2019, Yu et al. 2019, Pham et al. 2019b).

Landslides are highly complex and organized progressions through many environmental and topographical variables known as landslide conditioners. Sixteen conditioning factors were analyzed based on previous research, available evidence, and environmental conditions in the region to map landslide susceptibility in the current investigation. Each considered factor was translated into spatially delimited layers or maps of 30 m to 30 m grid size, corresponding to the DEM data obtained within an ArcGIS setting. In order to map the landslide susceptibility, the choice of an effective terrain mapping unit is also essential. A standard grid cell model was used to assess landslide susceptibility because it is the most common approach to spatially represent this form of a dataset (Tsangaratos and Ilia 2016).

The predictive ability of all the conditioning factors must be assessed before investigating landslide susceptibility. For this purpose, multicollinearity analysis was used to assess correlations among the factors considered. Moreover, the GR technique ranked these factors based on their importance. The outcomes of the multicollinearity analysis disclosed that the STI has intense multicollinearity and ought to be eliminated from the additional process. On the contrary, the outcomes of the GR technique revealed that land use and NDVI have greater AM values as compared to the other factors, demonstrating that these two factors are additionally crucial for landslide incidence. NDVI can precisely display the surface vegetation coverage. It can be seen from Fig. 12j that those regions where the NDVI value is less have a higher potential for landslides and fall under the class of high and very high susceptibility in produced susceptibility maps (Fig. 13). Furthermore, it must be noted that in mountainous areas like the one under consideration, landslides, in addition to external forces, occur continuously because of precipitation, even on slopes overlaid with considerable vegetation.

After the feature selection process, the selected features were then used to build the multiple CNN architectures and ResNet model. Moreover, various data representations transformed from initial landslide data are introduced as well as incorporated in the suggested CNN architectures. The spatial information can be efficiently extracted by CNN through local connections and can considerably decrease the required network parameters by distributing weights. The local link can easily be used by the architecture of CNN-1D, which enables this CNN to learn the more convoluted representations from factor vectors progressively. As CNN-2D at first demonstrated outstanding performance in visual image analysis, we transformed every 1D factor vector into a 2D matrix to adequately obtain the important hidden features. Although the CNN-1D and CNN-2D can be considered the same regarding the convolution operation, and their outcomes are also very similar, the point of considering the two different designs and calling them two different models is to have more results for comparison purposes. Moreover, we intended to maintain the order rather than jumping directly to CNN-3D after CNN-1D. The only difference lies mainly in the data representation instead of the mechanism of landslide modeling. Besides learning factor representations, the CNN-3D also gets local spatial information.

The results disclosed that the total percentages of high and very high classes are approximately comparable for CNN-1D, CNN-2D, and ResNet. Moreover, the high and very high classes of the ResNet, CNN-1D, CNN-2D, as well as CNN-3D are comparatively less than the LeNet-5 method but greater than SVM, DNN, and LR. ResNet accomplished higher corresponding values than the OA plus the MCC values of the projected CNNs. Furthermore, the ResNet and CNN-3D obtained superior prediction performance in the succeeding experiments compared to the well-known DNN and SVM classifiers. Lastly, using the testing dataset, the AUC value (0.881) turned out to be the highest for ResNet, followed by CNN-3D (0.874), which shows that ResNet and the proposed 3D structure can effectively improve the prediction performance, and they may prove to be a good technique for upcoming investigations. Therefore, it can be asserted that the suggested techniques are more feasible for managing and preventing landslides.

The outcomes of this study exhibited that ResNet is better than the proposed CNN frameworks. However, CNN-3D is better than the conventional DL technique, namely DNN, and the orthodox ML techniques, namely LR and SVM, which indicates that the suggested data representation forms of CNNs might be favorable vigorous approaches for mapping the landslide susceptibility. Different CNN frameworks for mapping the susceptibility to landslides have also been used in Yanshan County, China, by Wang, Fang et al. 2019. The results showed that the CNN frameworks performed better as compared to orthodox ML and DL techniques. Finally, the proposed CNN architectures offer a new way of handling raw landslide data in the present analysis. Moreover, the produced landslide susceptibility maps through the employed techniques can essentially be used as a guide by the planners and policymakers to avert and alleviate the landslide risk by positing future land use zones appropriately and ascertaining and establishing alleviation urgencies for the endangered areas.

17 Conclusion

The core objective of this work was to examine the application of different convolutional neural network (CNN) frameworks and residual network (ResNet) for mapping landslide susceptibility in northern Pakistan (Mansehra and Muzaffarabad districts). The considered area is among those regions in the world which are severely prone to landslides due to the geological and environmental settings of the region. The practiced frameworks turned out to be a valuable approach, and they can be utilized for other regions around the globe with comparable attributes. The ResNet and intended CNNs were validated centered on the evaluation of sixteen conditioning factors that were obtained from several supplementary sources. The ResNet and the proposed CNNs were employed to produce landslide susceptibility maps of the deliberated area and were contrasted to the orthodox deep learning (DL) and machine learning (ML) techniques of DNN and SVM, and LR, respectively. Different objective measures such as OA, MCC, ROC, and AUC were utilized to confirm the outcomes. The obtained landslide susceptibility maps using ResNet and the suggested CNNs are more effective for managing and preventing landslides as compared to those from orthodox techniques, as confirmed by the investigational outcomes. The prediction results of the ResNet model outperformed all the other used models as per OA, MCC, and AUC values. However, the CNN-3D model was found to be better among the proposed CNNs. Thus, these models can be applied to generate definitive susceptibility maps. Lastly, when using a DL technique such as ResNet or the multiple CNN frameworks in the present study, the prediction precisions of susceptibility maps can be efficiently enhanced by following two strategies during the construction of the model’s architecture by including dropout manipulation and by selecting an activation function. In conclusion, ResNet and CNNs are very favorable for the spatial prediction of landslides and can be used for future research. Moreover, these techniques can also be compared with more efficient DL techniques in the future for mapping landslide susceptibility.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Akgun A (2012) A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir. Turk Landslides 9(1):93–106
Google Scholar
Anthimopoulos M et al (2016) Lung pattern classification for interstitial lung diseases using a deep convolutional neural network. IEEE Trans Med Imaging 35(5):1207–1216
Google Scholar
Aslam B et al (2021) Development of integrated deep learning and machine learning algorithm for the assessment of landslide hazard potential. Soft Comput 25(21):13493–13512
Google Scholar
Aslam B et al (2022) Comparison of multiple conventional and unconventional machine learning models for landslide susceptibility mapping of Northern part of Pakistan. Environ, Develop Sustain: 1–28
Bacha AS et al (2018) Landslide inventory and susceptibility modelling using geospatial tools, in Hunza-Nagar valley, northern Pakistan. J Mt Sci 15(6):1354–1370
Google Scholar
Ballabio C, Sterlacchini S (2012) Support vector machines for landslide susceptibility mapping: the Staffora River Basin case study Italy. Math Geosci 44(1):47–70
Google Scholar
Barella CF et al (2019) A comparative analysis of statistical landslide susceptibility mapping in the southeast region of Minas Gerais state. Brazil Bull Eng Geol Environ 78(5):3205–3221
Google Scholar
Basharat M et al (2016) Landslide susceptibility mapping using GIS and weighted overlay method: a case study from NW Himalayas Pakistan. Arabian Journal of Geosciences 9(4):1–19
Google Scholar
Bibi T et al (2016) Landslide susceptibility assessment through fuzzy logic inference system (FLIS). The International Archives of Photogrammetry. Remote Sens Spat Inform Sci 42:355
Google Scholar
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159
Google Scholar
Bui DT et al (2016) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13(2):361–378
Google Scholar
Cao J et al (2019) Urban noise recognition with convolutional neural network. Multimedia Tools and Applications 78(20):29021–29041
Google Scholar
Chen W et al (2017a) A GIS-based comparative study of Dempster-Shafer, logistic regression and artificial neural network models for landslide susceptibility mapping. Geocarto Int 32(4):367–385
Google Scholar
Chen W et al (2017b) GIS-based landslide susceptibility modelling: a comparative assessment of kernel logistic regression, Naïve-Bayes tree, and alternating decision tree models. Geomatics Nat Hazards Risk 8(2):950–973
Google Scholar
Chen W et al (2017c) A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. Catena 151:147–160
Google Scholar
Chen W et al (2018) A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China. Bull Eng Geol Environ 77(2):647–664
Google Scholar
CireşAn D et al (2012) Multi-column deep neural network for traffic sign classification. Neural Netw 32:333–338
Google Scholar
Crosta GB (2004) Introduction to the special issue on rainfall-triggered landslides and debris flows. Eng Geol 3(73):191–192
Google Scholar
Dahl GE et al (2011) Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Trans Audio Speech Lang Process 20(1):30–42
Google Scholar
Dahl GE et al (2013) Improving deep neural networks for LVCSR using rectified linear units and dropout. In: 2013 IEEE international conference on acoustics, speech and signal processing, IEEE
Dai F et al (2002) Landslide risk assessment and management: an overview. Eng Geol 64(1):65–87
Google Scholar
Dash M, Liu H (1997) Feature selection for classification. Intell data Anal 1(3):131–156
Google Scholar
Demir G (2019) GIS-based landslide susceptibility mapping for a part of the North Anatolian Fault Zone between Reşadiye and Koyulhisar (Turkey). Catena 183:104211
Google Scholar
Dou J et al. (2020) Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed. Japan Landslides 17(3):641–658
Duchi J et al (2011) Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res 12(7)
Duo Z et al (2019) Oceanic mesoscale eddy detection method based on deep learning. Remote Sensing 11(16): 1921
Fang Z et al (2020) Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping. Computers & Geosciences: 104470
Flentje PN et al (2007) Guidelines for landslide susceptibility, hazard and risk zoning for land use planning.
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision
Gupta SK et al (2018) Selection of weightages for causative factors used in preparation of landslide susceptibility zonation (LSZ). Geomatics Nat Hazards Risk 9(1):471–487
Google Scholar
Guzzetti F et al (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81(1–2):166–184
Google Scholar
Guzzetti F et al (2012) Landslide inventory maps: New tools for an old problem. Earth Sci Rev 112(1–2):42–66
Google Scholar
He S et al (2012) “Application of kernel-based Fisher discriminant analysis to map landslide susceptibility in the Qinggan River delta, Three Gorges China. Geomorphology 171:30–41
Google Scholar
He K et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Hinton GE et al (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580
Hong H et al (2016a) “GIS-based landslide spatial modeling in Ganzhou City China. Arab JGeosci 9(2):112
Google Scholar
Hong H et al (2016b) Landslide susceptibility assessment in Lianhua County (China): a comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 259:105–118
Google Scholar
Hong H et al (2017) A hybrid fuzzy weight of evidence method in landslide susceptibility analysis on the Wuyuan area China. Geomorphology 290:1–16
Google Scholar
Hong H et al (2018a) “Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). Catena 163:399–413
Google Scholar
Hong H et al (2018b) Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). Catena 163:399–413
Google Scholar
Huang G-B, Babri HA (1998) Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded non-linear activation functions. IEEE Trans Neural Networks 9(1):224–229
Google Scholar
Hussain A, Yeats RS (2009) Geological setting of the October 8 2005 Kashmir earthquake. J Seismolog 13(3):315–325
Google Scholar
Jiao Y et al (2019) Performance evaluation for four GIS-based models purposed to predict and map landslide susceptibility: a case study at a World Heritage site in Southwest China. Catena 183:104221
Google Scholar
Kamp U et al (2008) GIS-based landslide susceptibility mapping for the 2005 Kashmir earthquake region. Geomorphology 101(4):631–642
Google Scholar
Kanungo D et al (2011) Combining neural network with fuzzy, certainty factor and likelihood ratio concepts for spatial prediction of landslides. Nat Hazards 59(3):1491
Google Scholar
Kazmi AH, Jan MQ (1997) Geology and tectonics of Pakistan. Graphic publishers
Khan H et al (2019) Landslide susceptibility assessment using Frequency Ratio, a case study of northern Pakistan. Egypt J Remote Sens Space Sci 22(1):11–24
Google Scholar
Kim J-C et al (2018) Landslide susceptibility mapping using random forest and boosted tree models in Pyeong-Chang Korea. Geocarto Int 33(9):1000–1015
Google Scholar
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Krizhevsky A et al (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Google Scholar
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images
Kuncheva L (2004) Combining pattern classifiers methods and algorithms. John Wiley and Sons. Hoboken
Google Scholar
Lawrence S et al (1997) Face recognition: a convolutional neural-network approach. IEEE Trans Neural Networks 8(1):98–113
Google Scholar
LeCun Y et al (2015) Deep Learn. Nat 521(7553):436–444
Google Scholar
LeCun Y et al (1995) Comparison of learning algorithms for handwritten digit recognition. In: International conference on artificial neural networks, Perth, Australia
LeCun Y et al (1998) Gradient-based learning applied to document recognition. In: Proceedings of the IEEE 86(11): 2278–2324
Lee M-J et al (2015) Forecasting and validation of landslide susceptibility using an integration of frequency ratio and neuro-fuzzy models: a case study of Seorak mountain area in Korea”. Environ Earth Sci 74(1):413–429
Google Scholar
Li L et al (2017) A modified frequency ratio method for landslide susceptibility assessment. Landslides 14(2):727–741
Google Scholar
Liu Y et al (2019) Intelligent wind turbine blade icing detection using supervisory control and data acquisition data and ensemble deep learning Energy Sci Eng 7(6): 2633–2645
Maas AL et al (2013) Rectifier nonlinearities improve neural network acoustic models. Proc. icml
Mandal SP et al (2018b) Comparative evaluation of information value and frequency ratio in landslide susceptibility analysis along national highways of Sikkim Himalaya. Spat Inform Res 26(2):127–141
Google Scholar
Mandal S, Mandal K (2018a) Modeling and mapping landslide susceptibility zones using GIS based multivariate binary logistic regression (LR) model in the Rorachu river basin of eastern Sikkim Himalaya India. Model Earth Syst Environ 4(1):69–88
Google Scholar
Maqsoom A et al (2021) Landslide susceptibility mapping along the China Pakistan Economic Corridor (CPEC) route using multi-criteria decision-making method. Model Earth Syst Environ:1–15
Martinović K et al (2016) Development of a landslide susceptibility assessment for a rail network. Eng Geol 215:1–9
Google Scholar
Matthews BW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim et Biophys Acta (BBA)-Protein Struct 405(2):442–451
Google Scholar
Mondal S, Mandal S (2019) Landslide susceptibility mapping of Darjeeling Himalaya, India using index of entropy (IOE) model. Appl Geomatics 11(2):129–146
Google Scholar
Nefeslioglu HA, Gorum T (2020) The use of landslide hazard maps to determine mitigation priorities in a dam reservoir and its protection area. Land Use Policy 91:104363
Google Scholar
Niu R et al (2014) Susceptibility assessment of landslides triggered by the Lushan earthquake, April 20, 2013, China. IEEE J Sel Top Appl Earth Observations Remote Sens 7(9):3979–3992
Google Scholar
Owen LA et al (2008) Landslides triggered by the October 8 2005 Kashmir earthquake. Geomorphology 94(1–2):1–9
Google Scholar
O’brien RM (2007) A caution regarding rules of thumb for variance inflation factors. Qual Quant 41(5):673–690
Google Scholar
Pamela P et al (2018) Weights of evidence method for landslide susceptibility mapping in Takengon, Central Aceh, Indonesia. In: Proceedings of the IOP conference series: Earth & Environmental Science, Prague, Czech Republic
Pavelsky TM, Smith LC (2008) RivWidth: A software tool for the calculation of river widths from remotely sensed imagery. IEEE Geosci Remote Sens Lett 5(1):70–73
Google Scholar
Pham BT et al (2016) “Evaluation of predictive ability of support vector machines and naive Bayes trees methods for spatial prediction of landslides in Uttarakhand state (India) using GIS. J Geomat 10:71–79
Google Scholar
Pham BT et al (2017a) A novel ensemble classifier of rotation forest and Naïve Bayer for landslide susceptibility assessment at the Luc Yen district, Yen Bai Province (Viet Nam) using GIS. Geomatics. Nat Hazards Risk 8(2):649–671
Google Scholar
Pham BT et al (2017b) Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods. Theoret Appl Climatol 128(1–2):255–273
Google Scholar
Pham BT et al (2017c) “Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. Catena 149:52–63
Google Scholar
Pham BT et al (2018) A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation forest for landslide susceptibility modeling: a case study in the Himalayan area, India. Int J Sedim Res 33(2):157–170
Google Scholar
Pham BT et al (2019b) A novel intelligence approach of a sequential minimal optimization-based support vector machine for landslide susceptibility mapping. Sustainability 11(22):6323
Google Scholar
Pham BT, Prakash I (2019a) Evaluation and comparison of LogitBoost Ensemble, Fisher’s Linear Discriminant Analysis, logistic regression and support vector machines methods for landslide susceptibility mapping. Geocarto Int 34(3):316–333
Google Scholar
Polykretis C, Chalkias C (2018) Comparison and evaluation of landslide susceptibility maps obtained from weight of evidence, logistic regression, and artificial neural network models. Nat Hazards 93(1):249–274
Google Scholar
Ramachandra T et al (2013) Prediction of shallow landslide prone regions in undulating terrains. Disaster Adv 6(1):54–64
Google Scholar
Ronoud S, Asadi S (2019) An evolutionary deep belief network extreme learning-based for breast cancer diagnosis. Soft Comput 23(24):13139–13159
Google Scholar
Rosi A et al (2018) The new landslide inventory of Tuscany (Italy) updated with PS-InSAR: geomorphological features and landslide distribution. Landslides 15(1):5–19
Google Scholar
Saba SB et al (2010a) Spatiotemporal landslide detection for the 2005 Kashmir earthquake region. Geomorphology 124(1–2):17–25
Google Scholar
Saba SB et al (2010b) Spatiotemporal landslide detection for the 2005 Kashmir earthquake region. Geomorphology 124(1–2):17–25
Google Scholar
Sevgen E et al (2019a) A novel performance assessment approach using photogrammetric techniques for landslide susceptibility mapping with logistic regression. ANN and random forest. Sensors 19(18):3940
Google Scholar
Sevgen E et al (2019b) A novel performance assessment approach using photogrammetric techniques for landslide susceptibility mapping with logistic regression ANN and random forest. Sensors 19(18):3940
Google Scholar
Shafique M et al (2016) A review of the 2005 Kashmir earthquake-induced landslides; from a remote sensing prospective. J Asian Earth Sci 118:68–80
Google Scholar
Sharif Razavian A et al (2014) CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops
Shin H-C et al (2016) Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging 35(5):1285–1298
Google Scholar
Srivastava N et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
Google Scholar
Srivastava RK et al (2015) “Training very deep networks.“ arXiv preprint arXiv:1507.06228
Szegedy C et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Tallarida RJ, Murray RB (1987) Chi-square test. Manual of pharmacologic calculations. Springer, pp 140–142
Tanyas H et al (2019) A global slope unit-based method for the near real-time prediction of earthquake-induced landslides. Geomorphology 327:126–146
Google Scholar
Tien Bui D et al (2019) New ensemble models for shallow landslide susceptibility modeling in a semi-arid watershed. Forests 10(9):743
Google Scholar
Torizin J et al (2017) Statistical landslide susceptibility assessment of the Mansehra and Torghar districts, Khyber Pakhtunkhwa Province Pakistan. Natural hazards 89(2):757–784
Google Scholar
Tsangaratos P et al (2017) Applying information theory and GIS-based quantitative methods to produce landslide susceptibility maps in Nancheng County China. Landslides 14(3):1091–1111
Google Scholar
Tsangaratos P, Ilia I (2016) Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: the influence of models complexity and training dataset size. CATENA 145:164–179
Google Scholar
Wang L-J et al (2016a) A comparative study of landslide susceptibility maps using logistic regression, frequency ratio, decision tree, weights of evidence and artificial neural network. Geosci J 20(1):117–136
Google Scholar
Wang Q et al (2016b) Application of statistical index and index of entropy methods to landslide susceptibility assessment in Gongliu (Xinjiang, China). Environ Earth Sci 75(7):599
Google Scholar
Wang Q et al (2017) Integration of information theory, K-means cluster analysis and the logistic regression model for landslide susceptibility mapping in the Three Gorges Area China. Remote Sensing 9(9):938
Google Scholar
Wang Y et al (2019) Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County China. Sci Total Environ 666:975–993
Google Scholar
Wang G et al (2020a) Spatial prediction of landslide susceptibility based on GIS and discriminant functions. ISPRS Int J Geo-Information 9(3):144
Google Scholar
Wang Y et al (2020b) Comparative study of landslide susceptibility mapping with different recurrent neural networks. Comput Geosci 138:104445
Google Scholar
Wu Y, Li W, Wang Q, Liu Q, Yang D, Xing M et al (2016) Landslide susceptibility assessment using frequency ratio, statistical index and certainty factor models for the Gangu County, China. Arab J Geosci 9(2):1–16
Google Scholar
Yi Y et al (2020) Landslide susceptibility mapping using multiscale sampling strategy and convolutional neural network: a case study in Jiuzhaigou region. Catena 195:104851
Google Scholar
Youssef AM et al (2015) Landslide susceptibility mapping at Al-Hasher area, Jizan (Saudi Arabia) using GIS-based frequency ratio and index of entropy models. Geosci J 19(1):113–134
Google Scholar
Youssef AM, Pourghasemi HR (2021) “Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at Abha Basin, Asir Region, Saudi Arabia. Geosci Front 12(2):639–655
Google Scholar
Yu L et al (2019) Landslide susceptibility mapping combining information gain ratio and support vector machines: a case study from Wushan segment in the three gorges reservoir area China. Appl Sci 9(22):4756
Google Scholar
Zhao X, Chen W (2020) GIS-based evaluation of landslide susceptibility models using certainty factors and functional trees-based ensemble techniques. Appl Sci 10(1):16
Google Scholar
Zhu A-X et al (2018) A comparative study of an expert knowledge-based model and two data-driven models for landslide susceptibility mapping. CATENA 166:317–327
Google Scholar

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Department of Cyber Security and Data Science, Riphah International University, Islamabad Campus, Islamabad, Pakistan
Bilal Aslam & Adeel Zafar
ITC Faculty of Geo-information Science and Earth Observation, University of Twente, Enschede, The Netherlands
Umer Khalil
School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, United States
Bilal Aslam

Authors

Bilal Aslam
View author publications
You can also search for this author in PubMed Google Scholar
Adeel Zafar
View author publications
You can also search for this author in PubMed Google Scholar
Umer Khalil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adeel Zafar.

Ethics declarations

Conflict of interest

The authors declare no competing interests pertinent to the context of this work.

Ethical approval

The study does not include any human subjects. Hence, no ethical approval was required for this study.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Aslam, B., Zafar, A. & Khalil, U. Comparative analysis of multiple conventional neural networks for landslide susceptibility mapping. Nat Hazards 115, 673–707 (2023). https://doi.org/10.1007/s11069-022-05570-x

Download citation

Received: 02 December 2021
Accepted: 14 August 2022
Published: 29 August 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s11069-022-05570-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparative analysis of multiple conventional neural networks for landslide susceptibility mapping

Abstract

Similar content being viewed by others

Comparison of optimized data-driven models for landslide susceptibility mapping

Development of integrated deep learning and machine learning algorithm for the assessment of landslide hazard potential

Deep learning-based landslide susceptibility mapping

Explore related subjects

1 Introduction

2 Study area

3 Materials and methods

4 Landslide inventory mapping

5 Landslide conditioning factors

6 Selection of conditioning factors

6.1 GR

7 Multicollinearity analysis

8 Methods

9 ResNet

9.1 CNN

10 The proposed CNN architectures

11 LeNet-5

11.1 CNN-1D

11.2 CNN-2D

11.3 CNN-3D

12 The related parameters

13 Model evaluation methods

14 Results and analysis

15 Model validation and comparison

16 Discussion

17 Conclusion

Data Availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation