Keywords

1 Introduction

Due to the force of gravity [1,2,3,4], soil, debris, and rocks slide down a slope, causing landslides. Multiple fatalities and large monetary losses are very common in the highlands and hills. There are also long-term consequences to its secondary risks. Landslide formation is influenced by a variety of environmental factors, including topography, lithology, land cover, and hydrology, according to some studies [5,6,7,8]. In addition to gravity, landslides may be triggered by rain, earthquakes, and human activity. With that being the case, it’s vital to detect and anticipate landfalls’ locations in order to avoid or minimize potential losses [9,10,11,12].

Quantitative predictive techniques for regional landslide spatial prediction have recently been published. Detailed geological field surveys may offer very accurate results when employing slope stability and landslide models. The issue is that effective physical modelling requires a wide variety of components, which is difficult to accomplish. This series has [13,14,15,16]. A new study has employed landslide susceptibility mapping (LSM), GIS, and remote sensing (RS) to evaluate the possible links between landslide conditions and the likelihood of occurrence. LSM can identify landslide-prone areas and build disaster mitigation and prevention strategies [17,18,19].

To assist LSM, many quantitative methods have been developed. Machine learning (ML) is increasingly being used in LSM because of its capacity to efficiently capture the correlations between landslides and environmental factors, such as logistic regression (LGR), support vector machines (SVMs), and decision trees (DTs), and random forests (RFs). However, classification by most methodologies does not reveal many characteristics of the causes of landslides. Deep learning methods such as DBN, RNN, and CNN, as well as convolutional neural networks, have become increasingly popular because of their ability to extract features (CNNs). In LSM’s research, CNNs performed better in extracting spatial features than the others [24]. Twenty-five and a half. Landslide sampling, on the other hand, restricts the use of certain designs for feature extraction because of its data expression. There has been a lot of interest in deep learning approaches, which are capable of extracting features and capturing deep representations from large datasets. But, only a few hybrid models were used for the thorough usage of characteristics [26,27,28,29]. However, since most techniques only consider one feature dimension, they have poor generalizability in more complex situations. Hybrid methods are now in use that makes use of each approach’s advantages for optimal feature utilization because of the complex nonlinear relationship of components and over-fitting.

2 Literature Review

To quantify landslide risk in Zichang City, China, Chen et al. [1] developed the bivariate statistical kernel logistic regression models PLKLR, PUKLR, and RBFKLR in 2021. It is now possible to create landslide susceptibility maps by comparing three landslide susceptibility maps and examining geographical trends. To begin, a 263 site historical landslide inventory was established. 263 landslide sites were used to train and test model assumptions and hypotheses. Second, 14 landslide conditioning variables were derived from the geographic data. Then, using frequency ratios, we investigated the relationship between the conditioning elements and landslide incidence. Then maps of landslide susceptibility were created using the normalized frequency ratios of the three models. Using correlation statistics, we looked at multi-collinearity. Researchers employed AUC comparison and validation to assess a model’s predictive ability. As a result, quantitative comparisons of susceptibility maps are required to prevent over or underestimating factors (distance to the river and slope). The PUKLR model has AUC values of 0.884 and 0.766 for the training and validation datasets, respectively. The datasets were trained using RBFKLR and PLKLR models with AUCs of 0.879 and 0.797. These models were used to verify and train datasets (AUC values of 0.758 and 0.752, respectively). The landslide susceptibility map may assist Zichang’s decision-makers avoid future natural disasters.

In 2021, Zhang et al. [2] developed a deep learning system using spatial response characteristics and machine learning classifiers (SR-ML). There are three stages to the process. DSC collects geographical features to avoid confusing multi-factor data. Second, spatial pyramid pooling is used to obtain response characteristics of varied sizes (SPP). 3. High-level features are integrated with ML classifiers to enhance feature categorization. Examples of meaningful feature categorization using machine learning classifiers are provided in this framework. The Yarlung Zangbo Grand Canyon area gathered data on 203 landslides and 11 conditioning variables. The AUC for the suggested SR and SR-ML was 0.920 and 0.910, greater than the random forest (RF, with the largest AUC in ML group). Bigger landslide samples exhibited the lowest mean error (0.01), suggesting that LSM might benefit from utilizing larger landslide samples.

There will be an increased danger of earthquakes in the Zagros Mountains in Iran by 2021, according to Paryani et al. [3]. They used a combination of machine learning and metaheuristic algorithms, including the adaptive neuro-fuzzy inference system and the Harris hawks optimization (BA). Landslide data was divided in half using a 70/30 ratio for training and testing purposes. There were 14 landslide-related variables examined, and the stepwise weight assessment ratio (SWARA) was used to find the relationship between landslides and components. It was then used to create landslide susceptibility maps (LSMs) based on the hybrid models of ANFIS, HHO, SVR, SVR-HHO, and SVR-BA (LSMs). Lastly, two indices, namely MSE and AUROC, were utilized to compare and validate the models employed in the study. The AUROC values for the ANFIS-HHO, ANFIS-BA, SVR-HHO, and SVR-BA were 0.849, 0.82, 0.895, and 0.865, respectively, according to the validation results. With an AUROC of 0.895 and an MSE of 0.147, SVR-HHO was the most accurate while ANFIS-BA was the least accurate, both based on an AUROC value of 0.82. Based on the data, the SVR model is superior than the ANFIS model, and the HHO algorithm has beaten the bat approach in terms of performance. Property use planners may utilize the map created in this study to better manage their land.

Using computer-based sophisticated machine learning methodologies in the year 2021, Mandal et al. [4] built LSMs and compared the models’ performance. A total of twenty factors, including both starting and contributing components, were examined in order to properly appreciate the landslide’s spatial connection. One of the most popular machine learning techniques, convolutional neural network (CNN), was utilized to develop LSMs. Random forest (RF), artificial neural network model (ANN), and the bagging model were all used in conjunction with it. Landslide and non-landslide locations were randomly selected for training and validation datasets. The training and validation locations were selected in a ratio of 70:30. Multi-collinearity was assessed using the tolerance and variance inflation factor, while the information gain ratio was utilized to assess the significance of certain conditioning factors. Studies have shown a low degree of multi-collinearity when it comes to landslide conditioning factors, with rainfall being the most significant contributor. Using each model’s final prediction results, LSM was then split into five distinct categories, such as “very low,” low, medium, high, and very high susceptibility. Based on the landslide susceptibility class distribution, more than 90% of the landslide area is vulnerable to landslides. According to the area under the receiver operating characteristics curve (ROC) curve and statistical methodologies such as RMSE and MAE were used to evaluate the models’ accuracy (MAE). The CNN model achieved the highest AUC values in both datasets (training and validation): 0.903 and 0.939, respectively. As can be seen by the lower RMSE and MAE values, the CNN model performs better than other models. As a result, all models have performed well, but the CNN model has outperformed the other models in terms of accuracy.

With the use of sentinel-1D InSAR and MODIS data, Al-Najjar [5] hopes to demonstrate a method for predicting and mapping long-term and seasonal land surface deformation (subsidence/uplifting) and permafrost active layer thickness in Alaska’s Donnelly Training Area (DTA) by the year 2020 (ALT). SAR images were compared for coherence (or resemblance) to see whether they could be used together. They were tested for their overall quality by conducting sensitivity analyses and an accuracy review. Seasonal subsidence in June and July was forecast to be in the range of 0–0.43%, whereas the predicted uplifting from September to May of the next year was predicted to be in the range of 0.34 m. DTA’s southern and northern areas were expected to experience the majority of the long-term subsidence and rising (from 2015 to 2018). In the east river, west, and south, ALT estimates were greater, while those in the north were lower as a result of spatially variable time delays. For example, coherence estimates were considerably different from zero, and the average residual from ALT estimations was statistically indistinguishable from zero when compared to the referenced forecasts from a commonly used yearly prediction model. For seasonal surface deformation estimates, regional distributions of the uncertainties in model coefficient estimates, phase change estimates, modeling error estimates, and image pairs were similar. With the use of the InSAR pictures and MODIS data, this approach has been able to map and monitor permafrost changes in regions like DTA, where collecting field observations is difficult and costly. We also discovered that the DTA’s permafrost deformation is very variable in terms of location and time, which enabled us to develop a near-real-time monitoring system for the permafrost environment.

3 Problem Statement

LSM depends on aerial picture interpretation and field verification, but gathering aerial photos is difficult [7, 8]. Researchers have gradually applied LDM to environmental monitoring using remote sensing (RS) [9,10,11,12,13]. Initially, high-resolution laser images were used to locate large-scale landslides [14, 15]. In modest shallow landslides, laser scans cannot detect A landslide’s texture, color, and other features were compared to the surrounding ground objects in optical photographs. WorldView [17], QuickBird [18], GaoFen-2 [19], IKONOS [20], and InSAR technologies are increasingly employed to LDM [21]. Both pixel-based and object-oriented techniques are used [22]. In this method, high-resolution photographs are used to identify landslides. These and other stages of landslide detection generate error accumulation and fluctuation in accuracy (ACC) [5, 23]. However, pixel-based LDM may solve these difficulties by simply identifying single-pixel [24, 25]. In addition, support vector machines (SVM), random forests (RF), decision trees (DT), and artificial neural networks (ANN) are used (ANN). Landslides may also be retrieved from optical images [30]. Others exploited the study area’s attributes for LDM such as hydrology or evening light. However, most of the literature using RS images for LDM disregards landslide debris’ morphological, geological, and other features. So we are worried about landslides. Convolutional neural networks (CNNs) can train a large number of parameters efficiently by utilizing weight sharing and other qualities. LeNet, AlexNet, GoogleNet, VGGNet, ResNet, and DenseNet are all deep neural network models based on the basic CNN model. Ces models excel in image classification. The CNN model is often utilized in landslide research. In landslide detection, its upgraded approach residual neural networks (ResNets) has demonstrated promising results. Due to the large amount of RS data employed, the preceding strategies need long model training durations and a large number of training parameters. DenseNets are narrow networks with few parameters, reducing model training time. DenseNet excels in medical image segmentation [16]. In landslide detection, DenseNet has limited applicability. An ensemble of deep learning classifiers is proposed.

4 Methodology

In this research work, a novel landslide detection model for GIS images will be introduced by following their major phases: (i) pre-processing, (ii) feature extraction (iii) feature selection, and (iv) classification. The captured images will be pre-processed via Gabor filtering. Subsequently, the features like GLCM based texture features, temperature-vegetative index-based characteristics, Brightness Index (BR), Normalized Difference Vegetation Index (NDVI) and Green Normalized Difference Vegetation Index (GNDVI), Red-over-Green difference (RGD),Vegetation index difference (VID), Brightness difference (BRD), NDI based features and coloration index features are extracted from the pre-processed data. The extracted features will be fused together, and among those features, the optimal ones will be selected via a new hybrid optimization model. The new hybrid optimization model will be the conceptual blend of the standard Teamwork Optimization Algorithm (TOA) [31] and Poor and rich optimization algorithm (PRO) [32]. Finally, the selected optimal features are subjected to the newly constructed ensemble-of-classifiers model. The ensemble-of-classifiers model is constructed with Recurrent Neural Network (RNN), Bi-LSTM, and Bi-GRU. All these classifiers are trained with the selected optimal features acquired with the new hybrid optimization model. The ultimate outcome regarding the landslide forecasting will be acquired by fusing the outcomes acquired from RNN, Bi-LSTM and Bi-GRU as depicted in Fig. 1.

4.1 RNN

RNN is a type of neural network that has cyclic connections between its own nodes and is designed to mimic problems that a fully connected network cannot. Most neural networks, such as ANN and CNN, only build weight connections between layers; nodes between layers are disconnected, and nodes must be self-contained. In real life, however, many data are related to one another, necessitating a network model that can incorporate both prior and subsequent information, as well as handle data of any length at the same time. RNN was created to solve difficulties like these.

Fig. 1
figure 1

The architecture of the projected model

4.2 Bi-LSTM

The Bi-LSTM architecture consists of a forward LSTM and a reverse LSTM. The forward and backward layers both perform the usual LSTM function. The forward layer will compute a positive input sequence, whereas the backward layer will compute a reverse time sequence. The output of Bi-LSTMs can be described as a summation function of two hidden layer function outputs.

4.3 Bi-GRU

LSTM is proposed to tackle the “long dependencies” difficulties of regular RNNs, but it also fails to deal with very long-term and many dependencies. As a result, Bi-GRU is better suited to processing very lengthy dependencies since it may use both prior and subsequent data.

5 Results & Discussion

The proposed model has been tested in Python. The proposed model’s performance has been compared to other existing models using Type I and Type II metrics. Negative predictive value (NPV), F1-Score, and Mathews correlation coefficient (MCC) are positive measurements whereas false-positive rate (FPR), false-negative rate (FNR), and false discovery rate (FDR) are negative measures.

The ultimate outcome regarding the landslide forecasting will be acquired by fusing the outcomes acquired from RNN, Bi-LSTM and Bi-GRU. The outcome acquired from RNN is outRNN . The outcome acquired from Bi-LSTM is outBi − LSTM . The outcome acquired from Bi-GRU is outBi − GRU the final outcome is

$$ \textrm{out}=\frac{{\textrm{out}}^{\textrm{RNN}}+{\textrm{out}}^{\textrm{Bi}-\textrm{LSTM}}+{\textrm{out}}^{\textrm{Bi}-\textrm{GRU}}}{3}. $$

This outcome tells about the presence/absence of landslide in input GIS images.

An indication for evaluating the proposed prediction model’s performance is introduced to compare the the mean square error (MSE), root mean square error (RMSE), standard deviation (SD) and mean error (ME) of the simulation results. Equations (1)–(4) illustrate RMSE, MSE, SD, and ME:

$$ RMSE=\sqrt{\frac{\sum_{k=1}^n{\left({y}_k-\hat{y_k}\right)}^2}{N}} $$
(1)
$$ MSE=\frac{\sum_{k=1}^n{\left({y}_k-\hat{y_k}\right)}^2}{N} $$
(2)
$$ SD=\sqrt{\frac{1}{N}{\sum}_{k=1}^n{\left({y}_k-\mu \right)}^2} $$
(3)
$$ ME=\frac{1}{N}{\sum}_{k=1}^n{\left({y}_k-\hat{y_k}\right)}^2 $$
(4)

Where n implies a number of sample data, μ implies the observed data’s arithmetic mean, y k represents \( \hat{y_k} \) denotes the observation and prediction values.

6 Conclusion

We compared the ensemble-of-classifiers model to existing landslide detection methods using the same training and simulation parameters. The effectiveness of the ensemble of classifiers (RNN + Bi-LSTM + Bi-GRU) landslide detection system is compared with vanilla RNN + Bi-LSTM and RNN + Bi-GRU. The experiments proved that the proposed method has outperformed the RNN + Bi-LSTM and RNN + Bi-GRU both in terms of performance measures and robustness. With the hybrid optimization model, our network performs better, which achieves 87% Training Accuracy. The main problem that we faced is training images, which is a very time-consuming task. Our future work will focus on reducing the computational complexities of this landslide detection system.