1 Introduction

Landslide is one of the most common natural disasters worldwide, particularly in reservoir banks and mountainous areas, posing a significant threat to human life and loss of their properties [10, 30]. With the rapid economic development in China, the construction of hydropower projects has become a top priority. The Three Gorges Reservoir (TGR) project in China is a prominent example of such a project. Due to the complex geological environment, the TGR is susceptible to numerous rock mass movements. Based on an incomplete survey, approximately 5000 landslides have occurred since the initial impoundment of the TGR Dam in 2003, with around 3000 of them being wading rock landslides [13, 33, 34]. Landslides are more prone to creep and sudden change with RWL variation or rainfall [29, 37, 44].

In recent years, many scholars have analyzed and predicted the occurrence of landslides caused by both rainfall and RWL [19, 24]. They have also proposed some effective models for landslides prediction [28, 32]. Currently, these methods employed for analyzing landslide displacement mainly involve laboratory physical model tests and time series machine learning model [26]. The laboratory physical model is derived from a wide range of actual observed creep tests. It has a solid physical foundation and can be used for effective prediction of landslides. However, this model has strict limitations [11, 23, 33]. With the development of computer technology, nonlinear machine learning models based on nonlinear theory have been gradually developed, including traditional nonlinear models [17], neural networks [4, 14], XGboost [42, 43], SVR [3, 22, 31, 45] and extreme learning machines [1]. As science and technology gradually improved, artificial intelligence scientific algorithm began to flourish, which in turn led to the fruitful results obtained from the prediction of landslide displacement. Zhang [39] applied an advanced deep machine learning method-gated recurrent unit (GRU) to predict the displacement of Jiuxianping landslide in Yunyang County, Chongqing. Zhou and Yin [45] used the displacement of Bazimen landslide, in TGR, as an example to study typical step-like landslides displacement. Based on the time series theory, the average movement method was used to divide the total displacement of Bazimen landslide into trend items and periodic items. Polynomial fitting method and PSO-SVM model were used for its prediction. Han and Shi [7] used Support Vector Classification (SVC) to classify and identify the displacement of Majiagou landslide and used PSO-SVR to predict its abrupt displacement section. Single intelligent optimization algorithm used for data processing leads to incomplete identification, easy to local optimization, slow convergence and stagnation, which result in large errors. Multiple hybrid models have stronger prediction and improved generalization ability [38, 40, 46]. Single optimization algorithm needs to be improved, so it is necessary to apply hybrid intelligent optimization algorithm model [43].

From the displacement monitoring curve, landslides can be divided into four categories: stable-type landslides, exponential-type landslides, step-like landslides and convergence-type landslides [22]. For the displacement of step-like deformation of wading rock landslides, there are more unpredictable disasters, and it has been a hot issue [16, 20]. Landslide displacement prediction is mainly divided into three stages: empirical prediction stage, statistical prediction stage and nonlinear prediction stage [38]. In the nonlinear prediction stage, the total displacement of landslides accumulates periodic displacement by trend displacement. The trend term is always controlled by the internal evolution of the landslide and its own conditions, and the periodic term is induced by external factors, such as rainfall, RWL, etc. [35]. The trend displacement is predicted by function fitting method, and the periodic displacement can be predicted by machine learning methods [12]. Choosing a reasonable and fast method for decomposing the total displacement is the primary factor in verifying its reliability [9]. Displacement time series decomposition methods include moving average method, double exponential smoothing (DES), discrete wavelet transform (DWT), empirical mode decomposition (EMD), etc. [15]. Filter method is simple, but it needs to determine the trend item type. Wavelet method does not need to determine the trend, but it needs to estimate the basic function in advance. The process of smoothing method is simple, but the parameter selection is complex, and the displacement of trend term is step-like. Empirical mode method can easily be used to mix different modal signals [3, 21]. SSA is a method that deals with nonlinear time series data. It has no model, does not require stationary time series and does not assume a parametric model. It can be widely used in various time series. The time series to be studied are decomposed and reconstructed to identify different signals in the original sequence, so as to extract the trend term, periodic term, etc. [25]. However, field monitoring data sets are often contaminated and highly nonlinear. Golyandina [6] used a case to demonstrate SSA combined with R for analysis, prediction and parameter estimation, proving that SSA is a powerful tool for analyzing and predicting time series. In this context, Zhang and Wang [36] combined SSA with support vector machine (SVM) and cuckoo search (CS) algorithm. Compared with other methods (SVM, CS-SVM, SSA-SVM, SARIMA and BPNN), they found that the model can significantly improve the accuracy of forecasting short-term power load. Liu [18] proposed a novel multistep wind speed prediction model by combining Variational Mode Decomposition (VMD), SSA, Extreme Learning Machine (ELM) and a forecasting model. The wind speed is decomposed into several sub-layers, and it is found that the prediction speed of the sub-layer is fast and the generalization performance is good. Based on the aforementioned findings, this study combines SSA method with intelligent optimization algorithm, and achieves remarkable results in the field of short-term power load forecasting, medical forecasting and weather forecasting. However, it is less used in landslide disaster forecasting and has application prospects.

In summary, this paper uses Jiuxianping wading rock landslide as an example to analyze the displacement of step-like landslides. Firstly, the deformation characteristics and failure modes of the landslide are analyzed for the monitoring data set. Then, the SSA algorithm is used to decompose the landslide monitoring displacement, remove the random signal and extract the trend displacement and periodic displacement. After the SSA algorithm was reconstructed, it was evaluated to prove that the trend term and periodic term are reasonable enough after they were randomly decomposed. For the evaluation of the trend term using power function polynomial fitting prediction, the periodic displacement is trained and tested by the improved GSA-PSO and SVR model. Finally, the trend term and the periodic displacement are accumulated and compared with the actual monitoring displacement. The mean absolute error (MAE), mean square error (MSE) and correlation coefficient (R) are used as evaluation indexes to prove the feasibility of the method. This idea provides a new intelligent algorithm for predicting landslide displacement and it can be used as a new reference for predicting and providing early warnings about the occurrence of landslides.

2 Research areas and data

2.1 Jiuxianping landslide and geological environment

Jiuxianping landslide is located on the left bank of the Yangtze River in the lower reaches of Yunyang County, Chongqing. It is found in the TGR area, 239 km away from the dam site. It is a typical wading rock landslide. The coordinates of the center point are 108° 46′ 57.35″ E, 30° 56′ 27.63″ N. The main sliding direction is 144°, the length is about 1200 m, the middle width is about 850 m, the average thickness is 40 m, the area is 144 × 104 m2 and the volume is about 5700 × 104 m3. The boundary is roughly fan-shaped, and can be seen in the UAV orthographic projection (Fig. 1).

Fig. 1
figure 1

Location of Jiuxianping landslide and UAV orthophoto

The bedrock orientation at the trailing edge of the Jiuxianping landslide is 137°∠24°, while the leading edge of the Jiangbian landslide exhibits an orientation of 326°∠9°. This region features an erosion-prone low mountain valley landform, with higher elevations in the northeast gradually declining towards the southwest. The ground elevation ranges from 134 to 587 m, with a relative height difference of 453 m. The angle between the flow direction of the Yangtze River and the strike of the underlying bedrock strata ranges from 40° to 60°, resulting in an oblique slope shape. Generally, the longitudinal terrain is characterized by gentle slopes or platform-like features. The leading edge of the Jiuxianping landslide extends 145 m below the water level of the Yangtze River, indicating a wading landslide. The annual RWL before dam fluctuations ranges between 145, 175 and 145 m, significantly impacting the stability of the landslide due to RWL changes.

2.2 Deformation characteristics and failure modes

According to the description from site, many cracks appeared during this period, mainly tensile cracks, followed by shear cracks. Chongqing Institute of Geology and Mineral Resources conducted a detailed investigation of the area, and observed surface cracks on the landslide, up to 1 m deep and 0.5 m wide (Fig. 2c), tensile cracks clearly visible on the cement floor of residential courtyard and Concrete pavement (Fig. 2a and b), shear cracks and other macroscopic cracks on the wall of the house (Fig. 2a and d), and the bare rock of the wading part, which shows bedding rock (Fig. 2e). For wading rock landslides, in addition to the sliding body, sliding zone, free deformation space, internal and external forces, the primary condition for sliding deformation is to have lateral boundary and trailing edge boundary conditions. Under the action of structure, creep or slip deformation, the trailing edge boundary condition of tension crack is formed. The front edge of landslide is free, and the trailing edge is bounded by tension crack groove and slope top.

Fig. 2
figure 2

Jiuxianping landslide deformation characteristics: a house deformation characteristics, b road deformation characteristics, c landslide trailing edge tensile cracks, d wall deformation characteristics, e landslide front edge bare rock

Based on the above investigation and analysis, the landslide failure mode is slip-bending, which mainly occurs in the bedding slope. When the dip angle of the rock layer in the bank slope is steeper than the slope of the bank itself, indicating a steep dip in the same direction, and the dip angle of the sliding surface exceeds the comprehensive friction angle of the surface, the overlying rock mass possesses the conditions necessary to slide along the sliding surface. However, the sliding surface is not free, preventing its sliding. Under certain conditions, there could be bending deformation of the middle and lower rock mass until there is buckling failure. Generally, it can be divided into three stages: bending stage, uplift stage and penetration stage. During the deformation process, a certain amount of elastic strain energy is stored in the bending part. Therefore, once the lower sliding body starts, the release of elastic strain energy can accelerate the additional thrust of the sliding body.

2.3 Monitoring scheme and result analysis

The conditioning factor of the landslide is the early tectonic movement, which leads to the development of sandstone joints during the formation of syncline. The weak surface (belt) and main sliding surface (belt) are produced in the contact surface of sand mudstone and mudstone strata. The new tectonic movement, crustal uplift, river incision, the leading edge of the free landslide along the sliding deformation provide deformation space. The triggering factors of landslides are rainstorm or extreme rainfall and RWL fluctuation. The primary factor that causes the instability of landslides is RWL fluctuation.

The landslides displacement in the TGR area have step-like characteristics, and the short-term deformation of landslides is affected by RWL and rainfall. In the stage of landslide mutation, the displacement–time curve of landslide deformation evolution stage has three obvious processes: initial deformation stage, constant deformation stage and accelerated deformation stage [22]. Therefore, it is particularly important to identify the stages of landslide displacement to warn against disaster. To understand landslide deformation in the area, Jiuxianping landslide monitoring network is arranged for real-time and accurate monitoring (Fig. 3). In this paper, the II–II′ section is taken as the research object (Fig. 4), and the monitoring data and geological data of the section are selected correspondingly.

Fig. 3
figure 3

Jiuxianping landslide monitoring network

Fig. 4
figure 4

II-II′ profile and displacement monitoring layout

To facilitate the study of step-like displacement in the main sliding direction, the landslide surface displacement data collected by YY208, YY209 and YY210 displacement monitoring stations are selected for research. From Fig. 5, the displacement of the three monitoring points showed step-like-change, the periodic law is obvious, and it is easy to study the relationship between displacement and rainfall and RWL.

Fig. 5
figure 5

RWL (m), Rainfall (mm) and Displacement (mm) of II-II′ landslide profile

The rainfall of Jiuxianping landslide is most vigorous in summer and autumn, and the regulation of RWL shows obvious periodicity (Fig. 5). Under the combined action of RWL drop and rainstorm, the displacement of landslide has obvious step-like phenomenon (Fig. 5). The rise of RWL leads to stable deformation of the landslide. The decrease of RWL has sudden change on the landslide, and there is an obvious lag effect. When RWL rises, the RWL between 160 and 175 m points to the hydrodynamic pressure inside the landslide, which enhances the stability of the landslide. When RWL drops, the RWL is between 160 and 145 m and produces hydrodynamic pressure pointing out of the slope and downward. This accelerates the deformation of the landslide. The time point at which mutation of the landslide displacement occurs each year is the low RWL point, and each may belong to the accelerated deformation stage (Fig. 6).

Fig. 6
figure 6

A, B, C and D detail of regional monitoring data

Rainfall is also one of the factors that lead to landslide displacement. September to October 2017 were a dense rainfall period during autumn; there was relatively heavy rainfall, and the RWL increased (the dynamic water pressure pointed to the landslide interior to stabilize the landslide). However, the monitoring point YY210 near the water end of the RWL has a large mutation displacement. The sudden displacement that occurred in November 2017 occurred during a period of high RWL and without heavy rainfall. This indicates a lag effect in rainfall, with a relatively long delay. Furthermore, it can be observed that the infiltration of rainfall (due to material properties and slow seepage) leads to much slower changes in pore water pressure compared to the decrease in the RWL (Fig. 5). Based on the above, the deformation of landslide is affected by RWL level and rainfall. RWL plays a major role in the displacement and deformation of landslide, but the influence of rainfall cannot be ignored. Based on this, RWL and rainfall are selected as influencing factors.

The triggering of landslides (Segments A, B, C and D) is induced by rainfall. The cumulative displacement of the transitional segment exhibits an overall convex deformation. Under the interaction of RWL and rainfall, low RWL correspond to convex deformation displacement, while heavy rainfall corresponds to concave deformation displacement (Fig. 6).

3 Method and model developing

3.1 SSA

The steps are as follows[8]:

  1. (1)

    Embed

Assume that the one-dimensional displacement data time series is X = (x1, x2,, xN), 1 < L < N. The positive integer L is the length of the sliding window, where sampling is equally spaced and the length is N. One-dimensional displacement data is mapped into a multi-dimensional trajectory matrix, which is a Hankel matrix and can be expressed:

$$X = \left[ {\begin{array}{*{20}l} {x_{1} } \hfill & {x_{2} } \hfill & {x_{3} } \hfill & \cdots \hfill & {x_{K} } \hfill \\ {x_{2} } \hfill & {x_{3} } \hfill & {x_{4} } \hfill & \cdots \hfill & {x_{K + 1} } \hfill \\ \vdots \hfill & \vdots \hfill & \vdots \hfill & {} \hfill & \vdots \hfill \\ {x_{L} } \hfill & {x_{L + 1} } \hfill & {x_{L + 2} } \hfill & \cdots \hfill & {x_{N} } \hfill \\ \end{array} } \right]$$
(1)

where L is the embedding dimension, andXi = [xi, xi+1,…, xi+L−1]. Each element of the matrix is equal diagonally.

  1. (2)

    Decomposition

Calculate XXT and obtain L eigenvalues λ1, λ2,…, λL; U1, U2,…, UL is the orthogonal eigenvectors corresponding to the eigenvalues. L = max(I, λi > 0) = R(A), then the following equation is obtained:

$$\left. {\begin{array}{*{20}l} {X \;= X_{1} + X_{2} + \cdots + X_{L} } \hfill \\ {X_{i} \,= \sqrt {\lambda_{i} } U_{i} V_{i}^{T} } \hfill \\ {V_{i} \,\;= {{X^{T} U_{i} } \mathord{\left/ {\vphantom {{X^{T} U_{i} } {\sqrt {\lambda_{i} } \left( {i = 1,2, \ldots ,L^{\prime } } \right)}}} \right. \kern-0pt} {\sqrt {\lambda_{i} } \left( {i = 1,2, \ldots ,L^{\prime } } \right)}}} \hfill \\ \end{array} } \right\}$$
(2)

where Xi is the elementary matrix, \(\sqrt {\lambda_{i} }\) is the singular value of matrix X, \(U_{i}\) is the empirical orthogonal function of matrix X, \(V_{i}\) is the main component of matrix X, and \(\sqrt {\lambda_{i} }\), \(U_{i}\) and \(V_{i}\) is the i th triple eigenvector of matrix X.

  1. (3)

    Grouping

The matrices contained in each group are added, as shown in the following:

$$\left. \begin{array}{ll}{X} \;= X_{{I_{1} }} + X_{{I_{2} }} + \cdots + X_{{I_{m} }} \\ X_{I} = X_{{i_{1} }} + X_{{i_{2} }} + \cdots + X_{{i_{p} }} \end{array} \right\}$$
(3)

where the contribution rate of Xi is \(\alpha = \lambda_{i} /\sum\limits_{i = 1}^{L} {\lambda_{i} } .\)

  1. (4)

    Reconfiguration

The reconstructed sequence can be obtained by the following:

$$\left. {\begin{array}{*{20}l} {y_{{rc_{k} }} = \frac{1}{k}\sum\limits_{m = 1}^{k} {y_{m,k - m + 1}^{*} } } \hfill & {1 \le k < L^{*} } \hfill \\ {y_{{rc_{k} }} = 1/L^{*} \sum\limits_{m = 1}^{{L^{*} }} {y_{m,k - m + 1}^{*} } } \hfill & {L^{*} \le k \le K^{*} } \hfill \\ {y_{{rc_{k} }} = \sum\limits_{{m = k - K^{*} + 1}}^{{N - K^{*} + 1}} {y_{m,k - m + 1}^{*} } /N - k + 1} \hfill & {K^{*} < k \le N} \hfill \\ \end{array} } \right\}$$
(4)

3.2 PSO-GSA

In GSA, the definition of the gravitational mass of the particle and the updating formula of the inertial mass are obtained from the fitness function in the following equation [5]:

$$\left. \begin{aligned} M_{i} (t) = & \frac{{m_{i} (t)}}{{\sum\limits_{j = 1}^{N} {m_{j} } (t)}} \\ m_{i} (t) = & \frac{{f_{i} t_{i} (t) - f_{{{\text{worst}}}} (t)}}{{ \, f_{{{\text{best}}}} \, (t) - f_{{{\text{worst}}}} (t)}} \\ \end{aligned} \right\}$$
(5)

where, \(f_{i} t_{i} (t)\) is the moderate function value of particle i at time t, and the MSE of SVR is taken as the fitness function; \(\, f_{{{\text{best}}}} \, (t)\) and \(f_{{{\text{worst}}}} (t)\) are the best and worst fitness function values of the group at time t. Gravity mass is generally expressed in units of \(M_{i} (t)\).

After using the PSO-GSA, the calculation formulas of the resultant force \(F_{i}^{k} (t)\), acceleration \(\alpha_{i}^{k} (t)\), position \(X_{i}^{k} (t)\) and velocity \(V_{i}^{k} (t)\) of the k-dimensional particle i are updated as follows [27]:

$$\left. {\begin{array}{*{20}l} {\quad F_{i}^{k} (t) = \sum\limits_{j = 1,j \ne i}^{N} \lambda F_{ij}^{k} (t)} \hfill \\ {\quad \alpha_{i}^{k} (t) = \frac{{F_{i}^{k} (t)}}{{M_{i} (t)}}} \hfill \\ \begin{gathered} V_{i} (t + 1) = \omega V_{i} (t) + c_{1} r_{1} ac_{i} (t) \hfill \\ \quad \quad \quad \quad + c_{2} r_{2} \left( {P_{{\text{best }}} - X_{i} (t)} \right) \hfill \\ \end{gathered} \hfill \\ {X_{i} (t + 1) = X_{i} (t) + V_{i} (t + 1)} \hfill \\ \end{array} } \right\}$$
(6)

where λ is a random number in [0, 1], β is a uniform random distribution in [0, 1], and k is the dimension space. c1 and c2 are constants in [0, 1], r1and r2 are random numbers between [0, 1], Pbest is the local optimal value and ω is the inertia weight.

3.3 SVR

The optimal regression function is given in the following equation [41]:

$$f(x) = \omega^{T} x + b$$
(7)

where \(\omega\) is the weight vector, b is the bias vector. The prediction data \(f(x_{i} )\) is obtained according to the input influencing factor xi. By introducing the relaxation factor, the objective optimization function of SVR is given in the following equation:

$$\left. {\begin{array}{*{20}l} {\min \frac{1}{2}\left\| w \right\|^{2} + C\sum\limits_{i = 1}^{m} {\left( {\xi_{i} + \xi_{i}^{*} } \right)} } \hfill \\ {{\text{s.t}}.\quad f\left( {x_{i} } \right) - y_{i} \le \varepsilon + \xi_{i} } \hfill \\ {y_{i} - f\left( {x_{i} } \right) \le \varepsilon + \xi_{i}^{*} } \hfill \\ {\xi_{i} \ge 0,\xi_{i}^{*} \ge 0,i = 1,2, \ldots ,m} \hfill \\ \end{array} } \right\}$$
(8)

where C is the penalty function, \(\varepsilon\) is the maximum regression error, \(\xi_{i}\) and \(\xi_{i}^{*}\) is the slack variable. By introducing a Gaussian radial basis (RBF) kernel function:\(K\left( {x_{i} ,x_{j} } \right) = \exp \left( {{{ - \left\| {x - x_{i} } \right\|^{2} } \mathord{\left/ {\vphantom {{ - \left\| {x - x_{i} } \right\|^{2} } {2g^{2} }}} \right. \kern-0pt} {2g^{2} }}} \right)\) and Lagrange multiplier \(\alpha_{i}^{*}\) and \(\alpha_{i}\), the prediction regression equation is written again in the following equation:

$$f(x) = \sum\limits_{i = 1}^{m} {\left( {\alpha_{i}^{*} - \alpha_{i} } \right)} K\left( {x_{i} ,x_{j} } \right) + b$$
(9)

3.4 Model developing and validation

The calculation flow chart of the SSA-PSO-GSA-SVR model is shown in Fig. 7. Each monitoring displacement in time series is composed of several factors. After removing the influencing random factors, Jiuxianping landslide displacement is decomposed into trend term and periodic displacement by SSA algorithm. Therefore, the cumulative displacement is expressed as follows:

$$s_{{\text{t}}} = u_{t} + v_{t}$$
(10)

where St is the cumulative displacement time series of landslide displacement, ut is the trend displacement series and vt is the periodic displacement series. The cumulative displacement of the landslide can be obtained by superimposing the trend displacement and the periodic displacement.

Fig. 7
figure 7

Flowchart of the prediction method

The trend term is always controlled by the internal evolution of the landslide and its own conditions, and the periodic term is induced rainfall and RWL. The trend term is fitted and predicted by multiple polynomials, and the periodic term is trained and predicted by PSO-GSA-SVR algorithm. Finally, the cumulative displacement is evaluated.

In this paper, mean absolute error (MAE), mean square error (MSE) and correlation coefficient (R) are used to evaluate the error of the model in the following:

$$\left. {\begin{array}{*{20}l} {{\text{MAE}} = {{\sum\limits_{i = 1}^{n} {\left( {x - x_{i} } \right)} } \mathord{\left/ {\vphantom {{\sum\limits_{i = 1}^{n} {\left( {x - x_{i} } \right)} } n}} \right. \kern-0pt} n}} \hfill \\ {{\text{MSE}} = {{\sum\limits_{i = 1}^{n} {\left( {x - x_{i} } \right)^{2} } } \mathord{\left/ {\vphantom {{\sum\limits_{i = 1}^{n} {\left( {x - x_{i} } \right)^{2} } } n}} \right. \kern-0pt} n}} \hfill \\ \end{array} } \right\}$$
(11)
$$R = \frac{{\sum\limits_{i = 1}^{N} {\left( {x_{i} - \overline{x}} \right)} \left( {\hat{x}_{i} - \overline{\hat{x}}} \right)}}{{\sqrt {\sum\limits_{i = 1}^{N} {\left( {x_{i} - \overline{x}} \right)^{2} } } \sqrt {\sum\limits_{i = 1}^{N} {\left( {\hat{x}_{i} - \overline{\hat{x}}} \right)^{2} } } }}$$
(12)

where x is the measured value, xi is the predicted value and n is the sample size. The lower the MAE and MSE, the higher the predictive intensive reading; the closer R is to 1, the stronger the correlation.

4 Results and discussion

4.1 Decomposition and reconstruction of trend and periodic terms

In order to simultaneously verify the feasibility of the proposed method, three monitoring points are selected. The time series data of YY208, YY209 and YY210 displacement monitoring points are decomposed by SSA. SSA is used to analyze the data for classification and dimension transformation. There are 3 monitoring point data (each monitoring point for 60 months) for trend term displacement from June 2016 to June 2021. Through the decomposition and reconstruction process, it was found that the optimal choice for L is 9, as it achieves the best results. This selection of L strikes a balance between capturing the trend and periodic components, making it the most ideal configuration. After the decomposition and reconstruction of the trend term and periodic term, it is the most ideal. When L selects 28, the reconstruction error is the smallest, but it is not conducive for the decomposition of reasonable trends and periodic terms, and needs to be debugged back and forth. The small L often causes the sequence trend term and the periodic term to be mixed together. Selecting a larger L can often minimize random components hidden in the original signal, and very large L causes the singular value decomposition time to increase [2]. The number of singular values is determined to be 5 by infinite approximation of Fourier transform iterative experiment, and the first 4 groups are selected for reconstruction. When they are reconstructed, the fifth group of data are messy and non-periodic and considered to be a random sequence. Thus, they are eliminated. Finally, the trend term and the periodic term are obtained (Fig. 8).

Fig. 8
figure 8

Trend and periodic terms after SSA decomposition results: a YY208, b YY209, c YY210

SSA removes random parts, decomposes them into trend term and periodic term and then performs cumulative comparisons (Fig. 9). The trend term and periodic term reconstructed are evaluated with the actual displacement time series, and the reconstruction sequence is evaluated by MAE, MSE and R. The evaluation indicators are shown in Table 1.

Fig. 9
figure 9

Comparison of time series of actual displacement and reconstructed displacement results: a YY208, b YY209, c Y210

Table 1 Evaluation of reconstructed sequence and actual displacement fitting results

4.2 Trend displacement prediction

From the SSA of Sect. 4.1, all monitoring points are selected for trend displacement extraction. The trend training and test are also programmed in pycharm, and then the polynomial expansion coefficient and power order are fitted and solved. Training data are taken from June 2016 to June 2020 (80%), and test data are taken from July 2020 to June 2021 (20%). At present, the trend term of landslide prediction is generally predicted by time step, because the trend term is affected by the internal potential factor, and the influencing factor is difficult to quantify. In previous landslide displacement predictions, most monitoring periods can last 5–10 years, and the monitoring interval is typically 1 month. That is, limited monitoring data is distributed over a long time span. In all the fitting methods, the decomposition of the sequence in Sect. 4.1 can be seen to increase the trend more stably. In extracting the trend term displacement using multiple fitting is better. Iterative approximation fitting helps to select the most reasonable power and number of items. The polynomial fitting is the best after repeated attempts. The polynomial coefficients are shown in Table 2, and the fitting formula is shown in Eq. (13). The extracted trend items are fitted and trained, and the prediction results are obtained by the test data (Fig. 10). MAE, MSE and R evaluation indicators were used for the evaluation, as shown in the following equation and Table 3.

$$f(x) = a_{0} + a_{i} \sum\limits_{1}^{i} {x^{i} }$$
(13)

where \(a_{0}\) and \(a_{i}\) is the expansion coefficient.

Table 2 Polynomial coefficient
Fig. 10
figure 10

The displacement–time curve of the three monitoring points trend prediction results: a YY208, b YY209, c YY210

Table 3 Evaluation of trend displacement prediction results

4.3 Periodic displacement prediction

4.3.1 Factor selection and evaluation

Rainfall: Rainfall is one of the triggering factors that leads to landslide deformation. During the infiltration-seepage process of rainfall, the stability of the landslide is destroyed by changing the soil pressure, matrix suction and hydrodynamic pressure of the soil. This results in the deformation of the landslide. On the other hand, infiltration of rainfall in soil causes chemical reaction, and soil–water reaction leads to the reaction of mudding, softening and chemical dissolution of hydrophilic substances. Finally, it leads to the change of internal cohesion and internal friction angle of soil, which indirectly changes the stress–strain state. This is because rainfall infiltration-seepage is a slow process, including runoff rainwater. Therefore, rainfall during this month, rainfall during the last 1-month and rainfall during the last 2-months are the rainfall factors selected for evaluation (Fig. 11a).

Fig. 11
figure 11

a Periodic displacement. b Relationship between rainfall and periodic displacement. c Relationship between RWL, RWLchange and periodic displacement. d Relationship between annual displacement, monthly displacement and periodic displacement

RWL: RWL is the major triggering factor that leads to landslide deformation. The fluctuation of RWL affects the distribution of underground hydrodynamic stress field, which affects the reduction of soil strength and the weakening of permeability. The rise of RWL leads to the dynamic water pressure pointing to the landslide, and the dynamic water pressure caused by the rapid decline of water level changes sharply. The landslide is most vulnerable to instability at this moment. The influence of RWL depends on its fluctuation. The step-like mutation displacement of the wading landslide in the TGR is affected by the decline of RWL every year. At this moment, RWL elevation, RWL change in the last 1-month and RWL change in the last 2-months are the RWL factors selected for evaluation (Fig. 11b).

Displacement increase: Due to the annual stepwise change of the landslide, it has periodic change. Displacement during the last 1-year and displacement during the last 1-month are the influencing shadows selected for evaluation (Fig. 11c).

In order to verify the rationality of factor selection, the degree of geometric similarity of curves composed of two or more sequences (sequences can be understood as factors or indicators in the system) is studied by using grey correlation degree. Table 4 shows the relationship between the displacement of the three monitoring points YY208, YY209, YY210 and the influencing factors. Resolution coefficient greater than 0.5 is considered to have a correlation, and that greater than 0.6 is considered to have a close relationship. It indicates that the selected factors are reasonable enough. Using grey correlation analysis (YY208, YY209 and YY210), the spatial location plays a significant role in the correlation with rainfall, RWL and displacement. Specifically, for the rainfall category, the highest correlation was observed with the rainfall from the previous month. In the case of RWL category, it was associated with the elevation of the RWL. As for the displacement category, it was primarily correlated with the displacement from the previous month. However, the magnitude of correlation varied depending on the spatial location.

Table 4 Relationship between periodic displacement and eight influence factors

4.3.2 Parameter optimization and displacement prediction

As a single optimization algorithm cannot solve the optimization problem of high-dimensional search space, an improved GSA is proposed, which combines the GSA with PSO algorithm. By manipulating the boundary and activating the stagnated particles in the new search space, the particles jump out of the local area to find the optimal solution. The training data are taken from June 2016 to June 2020 (80%), and the test data are taken from July 2020 to June 2021 (20%) (Fig. 11a). In this paper, the improved GSA-PSO-SVR model is used to develop the periodic displacement model. The influencing factor and the sample data are read together, the parameters are selected according to the fitness in the iterative process, and SVR is selected to train and test the model. MSE is used as the fitness function return value of SVR. The specific settings are as follows.

  1. (1)

    Optimal settings for PSO-GSA algorithm: the particle swarm population is 20, the maximum iteration is 100, the gravitational constant is 3, the reference function is [1, 10] learning factors C1 = 0.5, C2 = 0.5 and the optimal penalty function is found in Table 5

  2. (2)

    Optimal settings for PSO algorithm: the number of population is 20, the maximum iteration is 100, inertia weight W = 1, learning factor C1 = 0.5, C2 = 0.5, the optimal penalty function is found in Table 5

  3. (3)

    Optimal settings for GSA algorithm: the number of population is 20, the maximum iteration is 100, the gravitational constant is 3, the optimal penalty factor and kernel function parameters are shown in Table 5. The optimal penalty factor C and kernel function g are used to train and predict the samples

Table 5 Optimal SVR model parameters

Compared with the prediction results of PSO, GSA and PSO-GSA (Fig. 12), the accuracy and error of each model are shown in Table 6. In terms of the goodness of fit, employing a hybrid optimized SVR approach resulted in an average improvement of 5% in the R. Additionally, both MAE and MSE were reduced to some extent. This indicates that by sufficiently optimizing the model, it is possible to enhance its predictive performance. Further analysis shows that the PSO-GSA-SVR prediction model works best as the final periodic displacement prediction model.

Fig. 12
figure 12

Comparison of actual–predicted periodic displacement by different optimized models: a YY208, b YY209, c YY210

Table 6 Evaluation and error comparison of all optimized SVR models

4.4 Cumulative displacement prediction and evaluation

Based on the time cumulative sequence observed in Eq. (10), the total predicted displacement is to add the trend term and the periodic term using PSO-GSA-SVR model. From Fig. 10 and 12, the corresponding predicted displacements of the three monitoring points from June 2016 to June 2021 can be obtained, and the predicted displacements are compared with the actual monitoring displacement values (Fig. 13). According to the error calculation and accuracy evaluation, it is concluded that the change trend is consistent with the actual situation. MSE and MAE are smaller, and the correlation coefficient R is relatively larger (in Table 7). It has engineering application prospects and predicts short-term displacement changes.

Fig. 13
figure 13

Comparison of predicted–actual total displacement of each monitoring point: a YY208, b YY209, c YY210

Table 7 cumulative displacement and actual monitoring displacement accuracy and error

5 Conclusion

This paper proposed the hybrid optimization algorithm-based displacement prediction model. Taking Jiuxianping wading rock landslide with obvious step-like displacement as an example, three monitoring displacement points are selected for modeling and validation. The major contributions of this study are as follows:

  1. (1)

    The periodicity of RWL affects the step-like distribution of landslide displacement, and the rise of RWL makes the landslide displacement to change steadily. Under the combined action of low RWL and rainstorm, the landslide displacement shows obvious step-like characteristic. RWL is the major factor that induced the landslide deformation. RWL below 160 m is prone to cause landslide disaster, followed by rainfall, which accelerates landslide deformation tendency

  2. (2)

    Due to the influence of SSA in short-term power load and medical results, the introduction of SSA is applicable. SSA in time series decomposition is model-free, parameter-free and does not require stationary time series. It has a good performance in decomposing the nonlinear step-like landslide displacement. The reconstructed sequence error is small after removing the random term. The MSE and MAE of the prediction model of the decomposed periodic term are small, and the correlation coefficient is high

  3. (3)

    The selected three monitoring points YY208, YY209 and YY210 have obvious step-like state, and the small sample data are collected once a month. Using the hybrid optimization coupled SVR model, the predicted total displacement of the three monitoring points has small MSE and MAE. It shows that the improved PSO-GSA has good optimization characteristics and can activate the stagnated particles to jump out of the local area. It also shows that SVR has enough advantages in solving the problem of small sample and high dimension nonlinear regression

As the displacement of Jiuxianping landslide is analyzed once in a month as well as its sample size, the improved GSA-PSO-SVR model is applied to predict the displacement of Jiuxianping landslide in the TGR area. It is reliable and has great engineering referencing significance.