Deformation prediction of reservoir landslides based on a Bayesian optimized random forest-combined Kalman filter

Zhang, Nanfang; Zhang, Wei; Liao, Ke; Zhu, Hong-hu; Li, Qi; Wang, Jingtao

doi:10.1007/s12665-022-10317-9

Deformation prediction of reservoir landslides based on a Bayesian optimized random forest-combined Kalman filter

Original Article
Published: 23 March 2022

Volume 81, article number 197, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Earth Sciences Aims and scope Submit manuscript

Deformation prediction of reservoir landslides based on a Bayesian optimized random forest-combined Kalman filter

Download PDF

Nanfang Zhang¹,
Wei Zhang ORCID: orcid.org/0000-0002-6853-9518¹,
Ke Liao¹,
Hong-hu Zhu¹,
Qi Li¹ &
…
Jingtao Wang¹

756 Accesses
14 Citations
Explore all metrics

Abstract

Prediction model plays an important role in the early warning of reservoir landslides. This paper proposes a novel synthetic prediction model, the Bayesian optimized random forest-combined Kalman filter (BORF-KF) in which the Kalman filter and random forest algorithm are used to predict the trend and periodic displacements of the cumulative landslide displacement, respectively. To improve the accuracy of the model, the Bayesian algorithm is used to optimize the parameters, and periodic changes of rainfall and reservoir water level are considered. The applicability, efficiency, and accuracy of the proposed prediction model is successfully verified against monitoring time series obtained from the Outang landslide, a giant resurrected ancient landslide in the Three Gorges reservoir of China. The results show the ground deformation of the reservoir landslide exhibits notable step-like characteristics, which has strong correlation with the concentrated rainfall and the decrease of the reservoir water level. Moreover, predicted cumulative displacement error is less than 2%, suggesting the BORF-KF model attains a high prediction accuracy and can be applied to the prediction of reservoir landslides with abrupt step-like sections.

Adaptive interval prediction method for step-like landslide displacement with dynamic switching between different deformation states

Article 04 October 2023

A hybrid interval displacement forecasting model for reservoir colluvial landslides with step-like deformation characteristics considering dynamic switching of deformation states

Article 28 October 2020

Land subsidence phenomena investigated by spatiotemporal analysis of groundwater resources, remote sensing techniques, and random forest method: the case of Western Thessaly, Greece

Article 01 October 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Landslides are a common geological hazard that cause huge casualties and property losses (Chae et al. 2017). In particular, the Three Gorges area of the Yangtze River in China has experienced a great number of landslides since the impoundment of the Three Gorges Reservoir in 2003 (Tang et al. 2019; Huang et al. 2016). Developing monitoring and early warning systems is a vital approach against such a threat, and landslide deformation predictions have played an important role in making the whole system function (Kirschbaum et al. 2010; Zhou et al. 2016).

Considerable progress in landslide deformation predictions has been made since Saito (1965) proposed an empirical model based on the creeping nature of soils. To date, four main types of prediction models for landslide deformation have been developed; the first three are empirical, statistical, and nonlinear models (Miao et al. 2018). Empirical models are based on creep experiments and monitoring landslide data (Saito 1965; Fukuzono 1985; Crosta and Agliardi 2012); however, they have strict application conditions that often restrict their practical use. Statistical models are based on mathematical statistics and their derivatives (Deng 1988; Yin and Yan 1996; Wang et al. 2019). They are used to determine the landslide trends when the physical mechanism of the landslides is too complicated to be understood using empirical models. However, the ability of statistical models to predict landslide trends determined by multiple impact factors is still insufficient (He et al. 2008; Ren et al. 2015). Nonlinear models are based on traditional nonlinear theories (Dong et al. 2011) and mainly include intelligent models, such as neural networks (Yang et al. 2019), support vector machines (Han et al. 2020), and extreme learning machines (Cao et al. 2016). Nevertheless, for complex landslides influenced by multiple factors, nonlinear models cannot always satisfactorily forecast the landslide deformation trend (Li et al. 2012). The fourth model is the synthetic model combining the multiple prediction models mentioned above; the synthetic model can integrate the impacts of multiple factors into one model. Thus, more satisfactory landslide deformation prediction results can be obtained even though the landslide mechanism is not fully understood (Ma et al. 2017; Cai et al. 2016). In particular, prediction models combining time series with machine learning models have exhibited high accuracy concerning complex landslides (Zhou et al. 2016; Zhu et al. 2018b; Zhang et al. 2021).

Notably, the cumulative displacement of landslides is caused by both internal geological conditions and external environmental factors (Wen et al. 2017). The former includes the lithology, geological structure, and progressive weathering, and the latter includes rainfall and variations in the reservoir water level. Therefore, landslide displacement can be mainly decomposed into a trend and periodic components (Zhang et al. 2021). The former can be represented by a monotonic function of time, which is generally approached using a polynomial function. The latter can be obtained by machine learning models, such as SVM, ANN and ELM (Zhu et al. 2018a; Guo et al. 2020; Zhou et al. 2018). However, overfitting often occurs in the training process of machine learning models and may notably lower the prediction reliability (Liang et al. 2020).

Ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any individual constituent learning algorithm; thus, these methods can lower the probability of overfitting. Moreover, hyperparameter tuning is essential for performance improvement when training machine learning models. As one of the most representative ensemble learning models, the random forest algorithm has the advantages of simplicity, high precision, strong anti-noise ability, and strong anti-overfitting ability, and has been used to predict landslide velocities (Krkač et al. 2020). Bayesian optimization is a simple but efficient optimization algorithm for hyperparameter tuning (Nguyen et al. 2018). Bayesian optimization can quickly find the global optimal solution, which may be expected to further improve the accuracy of the random forest model. However, few works have focused on predicting landslide deformation using the optimized random forest model.

In this paper, through time series analysis, the accumulated displacement was decomposed into a displacement trend and periodic displacement. The Bayesian optimized random forest (BORF) model was developed to predict the periodic displacement of the landslide deformation. Moreover, the Kalman filter, a simple but efficient statistical algorithm, was used to predict the displacement trend. The cumulative deformation of landslides can be predicted by the BORF combined Kalman filter (BORF-KF) model. The applicability, efficiency and accuracy of the novel synthetic model were verified against monitoring data obtained from the Outang landslide in Chongqing City, China.

Methodology

Deformation decomposition

Time series analysis methods are used to decompose cumulative landslide displacements into trends and periodic displacements; among them, the moving average method is the most widely used. When the time series data are influenced by the periodic variation and irregular fluctuation and the development trend is not obvious, the use of the moving average method helps to eliminate the influence of unfavorable factors and implement the analyses and prediction of long-term time series trends.

The time series of cumulative landslide displacement ${y}_{t}$ yields the following equation:

$$y_{t} = S_{t} + C_{t} ,$$

(1)

where ${S}_{t}$ and ${C}_{t}$ are the trend and periodic terms, respectively.

At moment t, the trend displacement ${S}_{t}$ yields the following equation:

$$S_{t} = \frac{{D_{t} + D_{t - 1} + \cdots + D_{t - k + 1} }}{k}\quad \left( {t = k,k + 1, \ldots n} \right),$$

(2)

where ${D}_{t}$ is the cumulative displacement at moment t and k is the time interval.

Kalman filter

Essentially, the Kalman filter is a type of optimal estimation, i.e., it estimates the parameters describing the underlying problem from noisy observations (Brown and Hwang 1997). It holds the advantages of high accuracy and robustness (Kalman 1960). The basic principle is shown in Fig. 1. As shown in the figure, the Kalman filter estimates the state of the process at a certain point in time by feedback control, obtaining feedback in the form of measurements. Notably, the Kalman filter equations comprise two groups: the time update equation and the measurement update equation. The former is in charge of predicting the current state and estimating the error covariance to obtain the a priori estimate for the next time step. The latter is in charge of the feedback, incorporating new measurements into the prior estimate to obtain an improved posterior estimate. The estimation algorithm of the Kalman filter is similar to the prediction-correction algorithm when solving numerical problems (Maklouf et al. 2009).

The displacement trend of the cumulative landslide deformation represents its long-term trend and can be considered a function of time (Lacroix et al. 2020). Given a small observation interval, the landslide deformation is assumed to be relatively small; thus, the displacement trend can be expanded in a Taylor series at moment ${t}_{k}$, which yields the following equation:

$$x_{k} = x_{k - 1} + v_{k - 1} \left( {t_{k} - t_{k - 1} } \right) + \frac{1}{2}a_{k - 1} \left( {t_{k} - t_{k - 1} } \right)^{2} + \frac{1}{6}s_{k - 1} \left( {t_{k} - t_{k - 1} } \right)^{3} + g_{k - 1} ,$$

(3)

where x_k is the trend displacement of the cumulative displacement, and ${v}_{k-1}$ and ${a}_{k-1}$ are the velocity and acceleration, respectively, all at moment t_k. Additionally, ${s}_{k-1}$ is the effect of the third power of time change on the deformation; ${g}_{k-1}$ is the residual term of the Taylor series.

Random forest model

Ensemble learning implements the target task by building and integrating multiple base learners. The multiple base learners combined by a certain strategy can obtain superior generalization performance than any single learner. A decision tree is a common base learner for ensemble learning that follows a divide-and-conquer strategy. Bagging is the most prominent representative of parallel ensemble learning methods and is based on bootstrap sampling; in this method, the learning process generates multiple training sets by bootstrap sampling and develops base learners for individual training sets.

A random forest is an ensemble learning model based on bagging methods that uses decision trees as the base learners (Breiman 2001). The training process of random forest is implemented by selecting features with randomly selected attributes. Notably, the feature selection uses relevant feature subsets to develop a learning method with strong robustness (Blum and Langley 1997). The individual nodes of the base decision tree can be divided by their optimal attributes based on random subsets of attributes (Norouzi and Shahmohammadi-Kalalagh 2019). The random forest enriches the diversity of the base learners by sample and attribute perturbation, reducing the instability, improving the generalization performance, and enhancing the robustness of the model (Belgiu and Drăguţ 2016; Prasad et al. 2006). In addition, overfitting is avoided using bagging, which allows the random forest to exhibit high accuracy and strong robustness when handling problems with natures such as missing data, outliers, and noise.

The flowchart of the random forest is shown in Fig. 2. As shown in the figure, n samples are selected by bootstrap sampling as a sample subset, and m attributes are randomly selected for feature selection to construct a single decision tree. The process is repeated n times to construct n decision trees, and finally, the results of each decision tree are combined as the final output of the model.

Bayesian optimization

Bayesian optimization (BO) (Ghahramani 2015) is a type of approximation method that is effective for global optimization. Based on Bayes' theorem, the BO estimates the posterior distribution of the objective function and constructs alternative functions, which is based on the a priori evaluation results of the unknown objective function. Thus, the next hyperparameter combination that matches the optimal solution can be promptly found (Donald et al. 1998). The BO makes full use of the information of the previous sampling point to find the parameter combinations maximizing the improvement of the objective function, such that the search efficiency can be enhanced by learning the shape of the objective function. The equation of the BO is shown as follows:

$$x^{*} = \mathop {\arg \min }\limits_{x \in X} f\left( x \right),$$

(4)

where $x$ denotes the parameter to be optimized; $X$ denotes the set of parameters to be optimized; $f\left(x\right)$ denotes the objective function; and $\mathrm{arg min}$ denotes the process of finding the minimum value instead of the optimal value.

The kernels of BO are the probabilistic agent model and the acquisition function. The probabilistic agent model is used to proxy the unknown objective function, which can improve the accuracy of the agent model by continuously correcting the prior probabilities through iterations of data increments. The collection function samples from the most likely global optimal solution and unsampled regions in accordance with the posterior distribution, searching the optimal solution from the candidate set to minimize the value of the loss function. The probabilistic model of the BO uses the Gaussian function to proxy the complex black box function (Sano et al. 2020) and introduces prior knowledge of the target function to be optimized in the probabilistic model, such that the sampling redundancy can be reduced. Additionally, the local neighborhood information for effective inference can be used to select the potential point matching the optimal solution more accurately (Cui and Yang 2018). Compared with other optimization methods, such as maximum likelihood estimation, Bayesian optimization is less likely to fall into the problem of the local optimal solution.

Modeling procedure

The whole modeling process of the BORF-KF model for landslide deformation prediction is shown in Fig. 3. As seen from the figure, the landslide cumulative displacement is decomposed into the trend and periodic displacements. The displacement value of the trend term is predicted using the Kalman filter, while that of the periodic term is predicted using the random forest model. In particular, the hyperparameters of random forest are tuned by Bayesian optimization. Finally, the predicted values of the trend and periodic displacements are superposed to obtain the predicted values of the final cumulative displacement.

Case study

Geological conditions

The Outang landslide is an ancient landslide, which is located in Fengjie County, Chongqing City, China. The geographical location is in the region comprising 30° 56′ 57″–30° 58′ 1″ N and 109° 20′ 41″–109° 21′ 14″ E. It is approximately 12 km upstream of Fengjie County and 177 km downstream from the Three Gorges Reservoir, as shown in Fig. 4a and b. Also, Fig. 4c shows the panoramic view of the frontal landform of the Outang landslide. As seen from the figure, the leading edge of the Outang landslide has been thrusted into the Yangtze River, whose details are shown in Fig. 5. The landslide area has a subtropical monsoon climate, with an average annual temperature of 16.3 °C and an average annual rainfall of 1147.9 mm. Rainfall is abundant and is concentrated in the flood season, from May to September annually.

The topography of the landslide is a shallow- to medium-cut monoclinic low mountain valley, with the tendency of the rock layer nearly consistent with the slope direction. The Yangtze River flows from the north side to the northeast from the west, with an angle of 10°–15° to the rock layer direction. As shown in Fig. 4c, the main slide direction of the landslide is 345° north. Overall, the landslide is inverted, ancient and bell-shaped, wide in front and narrow in the back, with an area of 171 × 104 m² and a total volume of 7510 × 10⁴ m³. The elevation of the front edge of the landslide ranges from 90 to 102 m; the elevation of the back edge is approximately 705 m; the north–south length of the slide body ranges from 1640 to 2230 m; and the east–west width ranges from 550 to 1300 m. Additionally, the thickness of the slide average is approximately 44 m, and the thickest slide is up to 114 m.

The Outang landslide is located in a secondary tectonic unit of the Yangzi para-tectonic platform at the intersection of the Sichuan platform syncline and the Upper Yangzi platform fold. Locally, the landslide area is also located in the southeast wing of the syncline end of Gulling town, without any regional fracture. The Outang landslide is an exceptionally large and deep compliant rocky landslide, with the topography being high in the south and low in the north and a steep top and a gentle bottom.

The landslide area is located on the south bank of the Yangtze River, where a seasonal ditch is developed with a cutting depth of 3–6 m. The groundwater at the front edge of the landslide is closely linked to the Yangtze River and the Three Gorges Reservoir area with good connectivity; the groundwater level is mainly affected by the regulation of the reservoir water level.

The groundwater in the middle and back edge of the landslide has insufficient recharge and poor groundwater storage conditions. The overall groundwater level of the west side is higher than that of the east side, owing to the groundwater runoff from the southeast to the northwest. Being blocked by the rock ridge in the northeast, the groundwater runoff turns to the north side of the Yangtze River for discharge and finally gathers in the western part of the region.

Data acquisition

The Outang landslide comprises three blocks, as shown in Fig. 6. Block 1 is located below, exhibiting an inverted ball shape, with an elevation of 90–370 m, an area of 92.2 × 10⁴ m², and a volume of 6480 × 10⁴ m³. In particular, two intensive deformation zones, located in the western and eastern regions, exist in Block 1. Notably, the leading edge of Block 1 is seasonally submerged and surfaced with the variation in the reservoir water level. Block 2, irregularly shaped and extending its front to the top of Block 1, is located in the middle with an elevation of 250–530 m, an area of 31.6 × 10⁴ m², and a volume of 1020 × 10⁴ m³. Block 3, which has an inverted, ancient clock shaped, is located at the top, with an elevation of 400–705 m, an area of 54.3 × 10⁴ m², and a volume of 1450 × 10⁴ m³.

As an ancient landslide, the Outang landslide resurrected after the experimental impoundment of the Three Gorges Reservoir in June 2008. A long period of deformation occurred without termination, which resulted in building damage and ground fissures within a wide range of 171 × 10⁴ m². Professional monitoring was performed by the government of Fengjie County in December 2010, mainly by means of GNSS for ground movement, borehole inclinometers for deep displacement, sensors for rainfall and reservoir water level, and apparent inspection. Typical GNSS monitoring points are selected to verify the prediction performance of the proposed BORF-KF model using the measured time series data of the surface deformation. As noted in Fig. 6, three selected monitoring points, MJ01, MJ08, and MJ20, are located in Blocks 1, 2, and 3, respectively. In particular, MJ01 is within the intensive deformation area. The engineering geology profile of section A–A′ in Fig. 6 is illustrated in Fig. 7.

Landslide deformation characteristics

The measured data were fitted using the least squares interpolation method and then transformed into an equal-interval time series with an interval of 15 days. Figure 8 shows the time series of the monitoring data of the Outang landslide from December 6, 2010, to April 8, 2016. The measurements of the monitoring points in Fig. 6 comprise three types, namely, rainfall, the reservoir water level, and the cumulative ground deformation. As shown in the figure, the general ground deformation trend is strongly correlated with those of the rainfall and reservoir water level. Additionally, the cumulative ground deformation of the landslide increases during the flood season and alleviates during the rest season. During the monitoring period, the cumulative ground deformation did not exhibit an obvious convergence trend.

Rainfall is an external motive that induces reservoir landslides (Tomas et al. 2014; Li et al. 2010). Rainfall infiltration not only increases the hydrostatic pressure but also generates dynamic water pressure, which reduces the effective stress inside the soil and leads to slope instability. Previous studies on the relationship between landslides and rainfall show that the cumulative rainfall in the 30 days before the landslide has a remarkable effect on the deformation of the landslide (Du et al. 2013; Roering et al. 2015). Additionally, the periodic variation in the reservoir water level is the main factor that induces the occurrence of step-like deformation of the landslide (Jiao et al. 2014; Guo et al. 2017). Moreover, the variation in the reservoir water level is influenced by rainfall. Herein, the rainfall over the past 15 and 30 days and the variation in the reservoir water level over 15 and 30 days are depicted in Fig. 8. As seen from the figure, the cumulative landslide displacement is positively correlated with rainfall; in addition, the decrease in the reservoir water level is positively correlated with the step-like deformation event of the landslide.

The deformation characteristics of landslides induced by external factors vary at different stages of landslide evolution (Glade et al. 2005). Given the evolution stage of the Outang landslide, the displacements measured 15 and 30 days before the current moment are taken as the state factors in the prediction model of periodic displacement (Zhou and Yin 2014). Setting the identification coefficient as 0.5 (Zhang et al. 2019), gray correlation theory is used to quantitatively characterize the correlation of the individual state factor. Once the correlation coefficient value of a factor exceeds 0.6, a strong correlation between this factor and the periodic displacement of the Outang landslide could be confirmed. Table 1 lists the values of all the gray correlation coefficients of those three monitoring points, all of which are greater than 0.6, indicating that the six factors, e.g., the rainfall over the past 15 days and 30 days, the variation in the reservoir water level over the past 15 days and 30 days, and the displacement variation over the past 15 days and 30 days, are all strongly correlated factors of the periodic displacement of the Outang landslide.

Table 1 Gray correlation coefficients

Full size table

Results

Cumulative displacement decomposition

Based on the measurements obtained at the three monitoring points, a prediction model of the ground deformation of the Outang landslide is developed. Given the three monitoring points, the whole dataset is divided into a training set and a test set.

Consider a hydrological year in Fig. 8 as a study period containing 24 sets of monitoring data. The moving average method is used to extract the trend term from the cumulative displacement, and then the trend displacement is subtracted from the cumulative displacement to obtain the periodic displacement.

Trend displacement prediction

Based on the extracted trend displacement data, the Kalman filter model was used for the one-step prediction, and the results were compared with those of the polynomial function, as shown in Fig. 9. As seen from the figure, given those three monitoring points, the trend trace of the prediction deformation using the Kalman filter agrees very well with that of the measured trend displacement, but minor divergence exists between the trends of the prediction deformation using the polynomial function and the trend displacement. Moreover, the errors of the prediction results obtained using both models are compared in Table 2. Notably, the mean absolute percentage error (MAPE) is used for the error analysis. The results show that the prediction accuracy of the Kalman filter is higher than that of the polynomial function, and the minimal proportional accuracy improvement of MAPE reaches 81%.

Table 2 Error analysis

Full size table

Periodic displacement prediction

The BO and random search algorithm (RSO) were used to optimize the four main hyperparameters of the random forest model. These four hyperparameters include the number of decision trees, the number of minimum split point samples, the maximum number of features, and the maximum depth in a single decision tree. The values of the optimized hyperparameters using both algorithms are listed in Table 3. Both hyperparameter combinations were used in the random forest model to predict the periodic displacement of the Outang landslide.

Table 3 Values of optimized hyperparameters

Full size table

Given the three monitoring points, Fig. 10 shows the periodic displacement prediction results obtained using the BORF and random search-optimized RF, i.e., RSORF. As seen from the figure, the curve trends of both models are in general agreement with that of the measured periodic displacement. Table 4 compares the accuracy of both models in detail, and the results show that the MAPE values of the BORF model are less than 15%, within the acceptable error range, while some MAPE values of the RSORF model exceed 15%, which is out of the acceptable range. The prediction accuracy of the BORF model is notably higher than that of the RSORF model; the minimum relative improvement in accuracy of MAPE reaches 21%, suggesting that the BORF model can obtain satisfactory accuracy in predicting the periodic displacement.

Table 4 Periodic displacement prediction

Full size table

Cumulative displacement prediction

The prediction results of the trend were superposed above the periodic displacement in accordance with the time series. Thus, the prediction results of the cumulative displacement of the Outang landslide are obtained, as shown in Fig. 11. As shown in the figure, given the three monitoring points, the predicted cumulative displacements of the landslide are in good agreement with the measured values. Table 5 gives the MAPE values of the predicted cumulative displacement, of which the maximum is 1.89%, suggesting that the proposed BORF-KF model has a high prediction accuracy.

Table 5 Cumulative displacement error

Full size table

Discussion

Although the implementation of the Three Gorges Dam was a great engineering achievement, the construction and operation of the reservoir inevitably caused potential negative impacts on the hydrological environment, aquatic and terrestrial ecosystems, regional climate, and soil conservation in the middle and lower reaches of the Yangtze River (An et al. 2009; Gao et al. 2010; Yu et al. 2014). Reservoir impoundment also varied the regional engineering geological and hydrogeological conditions, leading to frequent reservoir landslides, among which the Outang landslide is a typical representative.

Since the experimental impoundment of the Three Gorges reservoir in 2008, the periodic rise and fall of the reservoir water level resulted in the rise of the water-level fluctuation zone, within which the seepage pressure varied (Jiang et al. 2011). Additionally, the leading edge of the sliding mass is partially immersed in the reservoir water, reducing the sliding resistance and local stability, leading to the occurrence of local collapse. More importantly, the infiltration of rainfall increased the sliding force and weakened the slip resistance. Eventually, all of the abovementioned factors jointly induced the resurrection of the Outang landslide. Ground deformation continuously occurred after the resurrection of the Outang landslide, yielding disasters such as building damage and fissures within the landslide area of 171 × 10⁴ m². In June 2013, the government of Fengjie County enacted emergency reinforcement measures, including backfilling, toe compression and the placement of lattice revetments on the east and west sides of Block 1, significantly slowing the ground deformation rate of the landslide region. Nevertheless, the measurements in Fig. 7 show that the ground deformation did not exhibit a significant convergence trend during the monitoring period, which is mainly owing to the soil creep.

The Kalman filter is a one-step predictive model that can be used to track continuously changing systems. The unique advantage of the Kalman filter is its ability to correct estimates based on the latest measurements with dynamic weighted correction. The random forest algorithm, as a typical representative of ensemble learning, can effectively overcome the overfitting problem to obtain high prediction accuracy. In addition, compared with regular hyperparameter tuning measures, the Bayesian optimization algorithm can implement global optimization such that the main hyperparameters of the random forest are optimized to further improve the prediction accuracy. A synthetic model, BORF-KF, is developed based on these three techniques to predict the ground deformation of reservoir landslides. In this study, the prediction results using the BORF-KF model agree well with the measured data. Particularly, satisfactory prediction accuracy is shown even in the step-like period, highlighting the application potential of the BORF-KF model. However, the proposed method cannot be applied in regions where soils may exhibit considerable strain softening, which may trigger sudden rapid dynamic landslide movement such as that occurred during the Vaiont landslide (Ciabati 1964; Tika and Hutchinson 1999; Stamatopoulos and Di 2015).

Conclusion

The Outang landslide is an ancient giant rocky landslide resurrected after the experimental impoundment of the Three Gorges Reservoir in 2008. In this paper, a synthetic model based on a Kalman filter and Bayesian optimized random forest is proposed to predict the displacement of the Outang landslide. The following conclusions are drawn from this study:

1.
The impoundment of the Three Gorges reservoir varied the regional engineering geological and hydrogeological conditions, leading to frequent reservoir landslides, among which the Outang landslide is a typical representative. The recurrent of the Outang landslide is jointly induced by the rainfall, variation of reservoir water level and soil creep.
2.
The ground deformation of the Outang landslide exhibits notable step-like characteristics, which is positively correlated with the concentrated rainfall and the decrease of the reservoir water level. The ground deformation did not exhibit convergence during the monitoring period from December 2011 to April 2016.
3.
The comparison results of the prediction and monitoring time series indicate that the BORF-KF model attains a high prediction accuracy (cumulative deformation error less than 2%) from the initial to step-like stages of the landslide, suggesting that such a synthetic prediction model holds much potential in predicting the ground deformation of the reservoir landslide.

References

An Q, Wu YQ, Taylor S, Zhao B (2009) Influence of the Three Gorges Project on saltwater intrusion in the Yangtze River Estuary. Environ Geol 56:1679–1686
Article Google Scholar
Belgiu M, Drăguţ L (2016) Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogramm Remote Sens 114:24–31
Article Google Scholar
Blum AL, Langley P (1997) Selection of relevant features and examples in machine learning. J Artif Intell 97(1–2):245–271
Article Google Scholar
Breiman L (2001) Random Forests. Mach Learn 45(1):5–32
Article Google Scholar
Brown RG, Hwang PYC (1997) Introduction to random signals and applied Kalman filtering, 3rd edn. Wiley, New York
Google Scholar
Cai Z, Xu W, Meng Y, Shi C, Wang R (2016) Prediction of landslide displacement based on GA-LSSVM with multiple factors. Bull Eng Geol Environ 75(2):637–646
Article Google Scholar
Cao Y, Yin K, Alexander DE, Zhou C (2016) Using an extreme learning machine to predict the displacement of step-like landslides in relation to controlling factors. Landslides 13(4):725–736
Article Google Scholar
Chae B, Park H, Catani F, Simoni A, Berti M (2017) Landslide prediction, monitoring and early warning: a concise review of state-of-the-art. Geosci J 21(6):1033–1070
Article Google Scholar
Ciabati M (1964) La dinamica della frana del Vaiont. G Geol 32:139–154
Google Scholar
Crosta GB, Agliardi F (2012) How to obtain alert velocity thresholds for large rockslides. Phys Chem Earth 27(36):1557–1565
Article Google Scholar
Cui J, Yang B (2018) Overview of Bayesian optimization methods and applications. J Softw 29(10):3068–3090 (In Chinese)
Google Scholar
Deng J (1988) Grey forecasting and decision making. Huazhong University of Science and Technology Press, Wuhan, pp 86–128
Google Scholar
Donald RJ, Matthias S, William JW (1998) Efficient global optimization of expensive black-box functions. J Global Optim 13(4):455–492
Article Google Scholar
Dong J, Tung Y, Chen C, Liao J, Pan Y (2011) Logistic regression model for predicting the failure probability of a landslide dam. Eng Geol 117(1):52–61
Article Google Scholar
Du J, Yin K, Lacasse S (2013) Displacement prediction in colluvial landslides, Three Gorges Reservoir, China. Landslides 10(2):203–218
Article Google Scholar
Fukuzono T (1985) A new method for predicting the failure time of a slope. In: Proceedings of the 4th international conference and field workshop on landslides, Tokyo. University Press, Tokyo, pp 145–150
Gao X, Zeng Y, Wang J, Liu H (2010) Immediate impacts of the second impoundment on fish communities in the Three Gorges Reservoir. Environ Biol Fish 87:163–173
Article Google Scholar
Ghahramani Z (2015) Probabilistic machine learning and artificial intelligence. Nature 521(7553):452–459
Article Google Scholar
Glade T, Anderson M, Crozier MJ (2005) Landslide hazard and risk: issues, concepts and approach. In: Crozier MJ, Chichester GT (eds) Landslide hazard and risk. Wiley, pp 1–40
Chapter Google Scholar
Guo Z, Chen L, Gui L, Du J, Yin K, Do HM (2020) Landslide displacement prediction based on variational mode decomposition and WA-GWO-BP model. Landslides 17:567–658
Article Google Scholar
Guo Z, Yin K, Tang Y, Huang F, Fu X (2017) Stability evaluation and prediction of Maliulin landslide under reservoir water level decline and rainfall. Geol Sci Technol Inform 36(4):260–265 (In Chinese)
Google Scholar
Han H, Shi B, Zhang L (2020) Prediction of landslide sharp increase displacement by SVM with considering hysteresis of groundwater change. Eng Geol 280:105876
Article Google Scholar
He K, Li X, Yan X, Guo D (2008) The landslides in the Three Gorges Reservoir Region, China and the effects of water storage and rain on their stability. Environ Geol 55:55–63
Article Google Scholar
Huang F, Yin K, Zhang G, Gui L, Yang B, Liu L (2016) Landslide displacement prediction using discrete wavelet transform and extreme learning machine based on chaos theory. Environ Earth Sci 75(20):1376
Article Google Scholar
Jiang JW, Ehret D, Xiang W, Rohn J, Huang L, Yan SJ, Bi RN (2011) Numerical simulation of Qiaotou Landslide deformation caused by drawdown of the Three Gorges Reservoir, China. Environ Earth Sci 62:411–419
Article Google Scholar
Jiao Y, Zhang H, Tang H, Zhang X, Adoko AC, Tian H (2014) Simulating the process of reservoir-impoundment-induced landslide using the extended DDA method. Eng Geol 182:37–48
Article Google Scholar
Kalman RE (1960) A new approach to linear filtering and prediction problems. Trans Asme J Basic Eng 82:35–45
Article Google Scholar
Kirschbaum DB, Adler R, Hong Y, Hill S, Lerner-Lam A (2010) A global landslide catalog for hazard applications: method, results, and limitations. Nat Hazards 52(3):561–575
Article Google Scholar
Krkač M, Špoljarić D, Bernat S, Arbanas SM (2017) Method for prediction of landslide movements based on random forests. Landslides 14(3):947–960
Article Google Scholar
Lacroix P, Handwerger AL, Bièvre G (2020) Life and death of slow-moving landslides. Nat Rev Earth Environ 1:404–419
Article Google Scholar
Li D, Yin K, Leo C (2010) Analysis of Baishuihe landslide influenced by the effects of reservoir water and rainfall. Environ Earth Sci 60:677–687
Article Google Scholar
Li X, Kong J, Wang Z (2012) Landslide displacement prediction based on combining method with optimal weight. Nat Hazards 61:635–646
Article Google Scholar
Liang W, Sari A, Zhao G, Mckinnon S, Wu H (2020) Short-term rockburst risk prediction using ensemble learning methods. Nat Hazards 104:1923–1946
Article Google Scholar
Ma J, Tang H, Liu X, Hu X, Sun M, Song Y (2017) Establishment of a deformation forecasting model for a step-like landslide based on decision tree C5.0 and two-step cluster algorithms: a case study in the three Gorges Reservoir area, China. Landslides 14:1275
Article Google Scholar
Maklouf OM, Halwagy YE, Beumi M, Hassan SD (2009) Cascade Kalman filter application in GPS\INS integrated navigation for car like robot. In: 2009 Radio science conference IEEE, New Cairo, pp 1–15
Miao F, Wu Y, Xie Y, Li Y (2018) Prediction of landslide displacement with step-like behavior based on multialgorithm optimization and a support vector regression model. Landslides 15:475–488
Article Google Scholar
Nguyen TD, Gupta S, Rana S, Venkatesh S (2018) Stable Bayesian optimization. Int J Data Sci Anal 6(4):327–339
Article Google Scholar
Norouzi H, Shahmohammadi-Kalalagh S (2019) Locating groundwater artificial recharge sites using random forest: a case study of Shabestar region. Iran. Environ Earth Sci 78(13):380.1–380.11
Google Scholar
Prasad AM, Iverson LR, Liaw A (2006) Newer classification and regression tree techniques: bagging and random forests for ecological prediction. Ecosystems 9:181–199
Article Google Scholar
Ren F, Wu X, Zhang K, Niu R (2015) Application of wavelet analysis and a particle swarm-optimized support vector machine to predict the displacement of the Shuping landslide in the Three Gorges, China. Environ Earth Sci 73(8):4791–4804
Article Google Scholar
Roering JJ, Mackey BH, Handwerger AL, Booth AM, Schmidt DA, Bennett GL, Cerovski-Darriau C (2015) Beyond the angle of repose: a review and synthesis of landslide processes in response to rapid uplift, Eel River, Northern California. Geomorphology 236:109–131
Article Google Scholar
Saito M (1965) Forecasting the time of occurrence of a slope failure. In: Proceedings of the 6th international mechanics and foundation engineering, Montreal, Que. Pergamon Press, Oxford, pp 537–541
Sano S, Kadowaki T, Tsuda K, Kimura S (2020) Application of Bayesian optimization for pharmaceutical product development. J Pharm Innov 15:333–343
Article Google Scholar
Stamatopoulos CA, Di B (2015) Analytical and approximate expressions predicting post-failure landslide displacement using the multi-block model and energy methods. Landslides 12:1207–1213
Article Google Scholar
Tang H, Wasowski J, Jung C (2019) Geohazards in the three Gorges Reservoir Area, China-Lessons learned from decades of research. Eng Geol 261(9):105267
Article Google Scholar
Tika TE, Hutchinson JN (1999) Ring shear tests on soil from the Vaiont landslide slip surface. Geotechnique 49(1):59–74
Article Google Scholar
Tomas R, Li Z, Liu P, Singleton A, Hoey T, Cheng X (2014) Spatiotemporal characteristics of the Huangtupo landslide in the Three Gorges region (China) constrained by radar interferometry. Geophys J Int 197(1):213–232
Article Google Scholar
Wang W, Li J, Qu X, Han Z, Liu P (2019) Prediction on landslide displacement using a new combination model: a case study of Qinglong landslide in China. Nat Hazards 96:1121–1139
Article Google Scholar
Wen T, Tang H, Wang Y, Lin C, Xiong C (2017) Landslide displacement prediction using the GA-LSSVM model and time series analysis: a case study of Three Gorges Reservoir, China. Nat Hazards Earth Syst Sci 17(12):2181–2198
Article Google Scholar
Yang B, Yin K, Lacasse S, Liu Z (2019) Time series analysis and long short-term memory neural network to predict landslide displacement. Landslides 16(4):677–694
Article Google Scholar
Yin K, Yan T (1996) Landslide prediction and related models. Chin J Rock Mech Eng 01:1–8
Google Scholar
Yu S, Yang J, Liu G (2014) Impact assessment of Three Gorges Dam’s impoundment on river dynamics in the north branch of Yangtze River estuary, China. Environ Earth Sci 72:499–509
Article Google Scholar
Zhang L, Shi B, Zhu H, Yu X, Han H, Fan X (2021) PSO-SVM-based deep displacement prediction of Majiagou landslide considering the deformation hysteresis effect. Landslides 18:179–193
Article Google Scholar
Zhang W, Xiao R, Shi B, Zhu H, Sun Y (2019) Forecasting slope deformation field using correlation grey model updated with time correction factor and background value optimization. Eng Geol 260:105215
Article Google Scholar
Zhou C, Yin K (2014) Landslide displacement prediction of WA-SVM coupling model based on chaotic sequence. Electron J Geotech Eng 19:2973–2987
Google Scholar
Zhou C, Yin K, Cao Y, Ahmed B (2016) Application of time series analysis and PSO–SVM model in predicting the Bazimen landslide in the Three Gorges Reservoir, China. Eng Geol 204:108–120
Article Google Scholar
Zhou C, Yin K, Cao Y, Intrieri E, Ahmed B, Catani F (2018) Displacement prediction of step-like landslide by applying a novel kernel extreme learning machine method. Landslides 15:2211–2225
Article Google Scholar
Zhu X, Ma S, Xu Q, Liu W (2018a) A WD-GA-LSSVM model for rainfall-triggered landslide displacement prediction. J Mt Sci 15:156–166
Article Google Scholar
Zhu X, Xu Q, Tang M, Li H, Liu F (2018b) A hybrid machine learning and computing model for forecasting displacement of multifactor-induced landslides. Neural Comput Appl 30(12):3825–3835
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the editors and anonymous reviewers for their helpful comments and suggestions.

Funding

This study was financially supported by the National Key R & D Program of China (Grant no. 2018YFC1505104), the National Science Foundation of China (Grant nos. 42077232, 42077235), and the Science and Technology Foundation of Suzhou City (Grant no. SYG202132).

Author information

Authors and Affiliations

School of Earth Sciences and Engineering, Nanjing University, Nanjing, 210023, China
Nanfang Zhang, Wei Zhang, Ke Liao, Hong-hu Zhu, Qi Li & Jingtao Wang

Authors

Nanfang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ke Liao
View author publications
You can also search for this author in PubMed Google Scholar
Hong-hu Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Qi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jingtao Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Zhang.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, N., Zhang, W., Liao, K. et al. Deformation prediction of reservoir landslides based on a Bayesian optimized random forest-combined Kalman filter. Environ Earth Sci 81, 197 (2022). https://doi.org/10.1007/s12665-022-10317-9

Download citation

Received: 25 March 2021
Accepted: 06 March 2022
Published: 23 March 2022
DOI: https://doi.org/10.1007/s12665-022-10317-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deformation prediction of reservoir landslides based on a Bayesian optimized random forest-combined Kalman filter

Abstract

Similar content being viewed by others

Adaptive interval prediction method for step-like landslide displacement with dynamic switching between different deformation states

A hybrid interval displacement forecasting model for reservoir colluvial landslides with step-like deformation characteristics considering dynamic switching of deformation states

Land subsidence phenomena investigated by spatiotemporal analysis of groundwater resources, remote sensing techniques, and random forest method: the case of Western Thessaly, Greece

Introduction