Introduction

Landslide or slope failure may happen even it is designed with high safety factors because geomaterials are mostly dominated by uncertainties (He et al. 2010; Johari and Lari 2016). Hence, the performance of landslides determined by a factor of safety cannot characterize uncertainties explicitly and sufficiently (Pourghasemi et al. 2014; Wang et al. 2016) and may sometimes be inclined to yield misleading results (Duncan and Buchignani 1973). Fortunately, reliability analyses, such as the first-order second-moment method (FOSM) (Low 2013), Monte Carlo simulations (MC) (Wang et al. 2010; Wang 2012) and response surface methodology(RSM) (Li et al. 2015), were applied to quantify these uncertainties in recent years (Phoon and Kulhawy 1999; Phoon 2008; Li et al. 2009, 2011; Ching and Phoon 2012; Zhang et al.2012), which emerge as a more reasonable and rigorous way to handle uncertainties (Ji et al. 2012; Tang et al. 2013; Hicks et al. 2014).

RSM is used to determine how a response is affected by a set of quantitative factors over some specified region. This information can be used to optimize the settings of a process to give a maximum or minimum response. For a given number of variables, response surface analysis techniques require more trials than the two-level fractional factorial design techniques. In order to calculate simply, most cases do not include cross-terms and second-order terms in response function. Although this greatly simplifies the number of sampling points, it may not be appropriate in all cases (Box and Draper 2006). The results obtained from a first-order response surface may be local optimal solutions (Myers and Raymond 2012). Hence, it is necessary to find a globe optimal solution by using a high-order response surface.

In the conventional RSM, the first-order response surface cannot fit well the curvature of a true underlying surface, solutions may be wrong and the selection of the grid of experimental design points has no precise guidelines or theory in RSM (Guan and Melchers 2001). In SOED, the cross-correlations of uncertain variables are assumed to be 0, and the orthogonality of design matrix containing high-order terms is satisfied by changing the length of star points so that the variance of the predicated values and target values is minimized. In order to overcome the shortcomings of the conventional RSM, RSM is combined with SOED, which is called the SOED-based RSM. In the SOED-based RSM, not only the response surface is more precise, but also the significance of regression coefficients is easy to test because the sum of error between the fitting equation and the regression equation is equal to 0. Moreover, the orthogonality of the first-order terms, cross-terms and quadratic terms in design matrix is satisfied. The design matrix obtained by the proposed method can contribute to the selection of sampling points. Example of reliability analysis of a large-scale landslide is taken to prove the validity and capability of the proposed method. Firstly, the values of uncertain variables, such as the friction angle and cohesion, are transformed from encoded variables in SOED design matrix. Then, the response value, the factor of safety (FS), is calculated by 3D rigorous limit equilibrium method. Finally, the reliability index can be obtained by MATLAB optimization toolbox.

Construction of second-order orthogonal experimental design (SOED)

SOED is based on orthogonal Latin squares and group theory (Hall 1959; Johnson and Johnson 1991). The main characteristic of SOED is that the design matrix is diagonal; meanwhile, the sum of error between the theoretical response values by fitting equation and the actual response values is equal to 0.

Sampling points

The sampling points in an orthogonal experimental design are consisted of corner points, star points and mean points. The corner points are located at the corner of distribution of experimental points, the star points are located in the axis and the mean points are located at origin, as shown in Figs. 1 and 2. The length from the star points to the mean points is γ, which is introduced in the next section. It can be obviously found from Figs. 1 and 2 that the run by mean points is 1, the run by star points is 2m (m is the total number of variables) and the run by corner points is 2m. Hence, the total number of SOED experimental times is:

$$ N = 2^{m} + 2m + 1 $$
(1)
Fig. 1
figure 1

Distribution of design points in SOED with two variables

Fig. 2
figure 2

Distribution of design points of the SOED with three variables

It should be pointed out that the corner points can be taken by factorial design (2k design), which requires a large number of design points, as listed in Table 1 (four variables). In general, a 2pth fraction of a 2k design consists of 2kp points from a full 2k design (p < k). It is assumed that the number of runs is m c, for a whole fraction design, m c is 2m, a one-half fraction design m c is 2m−1 and a one-fourth fraction design m c is 2m−2.

Table 1 Encoded design variables (or matrix) by factorial design with four variables

It can be found from Table 1 that \( \sum\nolimits_{j = 1}^{N} {z_{j} } = 0 \) and \( \sum\nolimits_{i \ne j} {z_{i} z_{j} } = 0 \), i.e., the matrix is an orthogonal one. However, the orthogonality of the matrix cannot be satisfied when the second-order terms are considered. For instance, \( \sum\nolimits_{i \ne j} {z_{i}^{2} z_{j} } \ne 0 \), or \( \sum\nolimits_{i \ne j} {z_{i} z_{j}^{2} } \ne 0 \).

The length of star points

In order to construct a second-order orthogonal matrix, the length of star points γ should be carefully taken. Firstly, the quadratic terms should be centralized, and it can be written in the following form:

$$ z^{\prime}_{ij} = z_{ij}^{2} - \frac{1}{N}\sum\limits_{i = 1}^{N} {z_{ij}^{2} } = z_{ij}^{2} - {{\left( {m_{c} + 2\gamma^{2} } \right)} \mathord{\left/ {\vphantom {{\left( {m_{c} + 2\gamma^{2} } \right)} N}} \right. \kern-0pt} N} $$
(2)

where \( z^{\prime}_{ij} \) is the centralized quadratic terms, \( z_{ij} \) is the first-order terms and N is the total run of experimental design. Since any two second-order terms should satisfy the orthogonality, the following expression can be obtained:

$$ \sum\limits_{i = 1}^{N} {z^{\prime}_{i} z^{\prime}_{j} } = 0 $$
(3)

Substituting (2) into (3), we have

$$ m_{c} - {{(m_{c} + 2\gamma^{2} )^{2} } \mathord{\left/ {\vphantom {{(m_{c} + 2\gamma^{2} )^{2} } N}} \right. \kern-0pt} N} = - \frac{4}{N}\left[ {\gamma^{4} + m_{c} \gamma^{2} - \frac{1}{2}m_{c} \left( {m + \frac{1}{2}m_{0} } \right)} \right] = 0 $$
(4)

where m 0 is the run by the mean points, and it is equal to 1 in most case. Simplifying Eq. (4), the length of star points γ in SOED can be obtained as:

$$ \gamma = \sqrt {\frac{{\sqrt {(m_{c} + 2m + m_{0} )m_{c} } - m_{c} }}{2}} $$
(5)

Consider m = 4 and the one-half fraction of a 2k design is chosen, then m c is 24−1 = 8. From Eq. (1), N is 17. From (5), γ is 1.353. The design matrix with four variables is listed in Table 2.

Table 2 Design matrix in SOED with four variables (m = 4)

SOED-based RSM

A second-order RSM can be written as:

$$ \hat{y} = \alpha + \sum\limits_{i = 1}^{m} {\beta_{i} z_{i} } + \sum\limits_{j < i} {\beta_{ij} z_{i} z_{j} } + \sum\limits_{i = 1}^{m} {\beta_{ii} z_{i}^{2} } + \varepsilon ,\quad j = 1,2, \ldots n - 1(j \ne i) $$
(6)

where α is the constant terms,{β i },{β ij } and {β ii } are, respectively, the regression coefficient of first-order terms, cross-terms and second-order terms, ɛ is the error, z i and z j are the uncertain variables.

It can be found from Eq. (6) that there is (m + 1)(m + 2)/2 unknowns. Therefore, the total number of experiments should be larger than (m + 1)(m + 2)/2.

In the conventional RSM, the response surface is approximated using a conventional regression technique. Therefore, the selection of appropriate sampling points is important. Unfortunately, the selection of the grid of experimental design points has no precise guidelines or theory (Guan and Melchers 2001). In order to calculate simply, Eq. (6), which does not include cross-terms and quadratic terms, is always used as the approximation. This greatly simplifies the number of sampling points, but it may not be appropriate in all cases. However, the high order is the true one, which was also mentioned by Myers and Raymond (2012). Fortunately, the proposed SOED-based RSM takes the second-order terms into consideration, and it can also provide a guideline for sampling strategy in RSM.

The procedure of SOED is the same as the conventional RSM; the sampling points (input variables) should firstly be selected, e.g., the stability of landslide is influenced by the parameters, such as friction angle and cohesion. Then, the values of input variables should be determined. Next, the experiment table is selected, from which the response variables are obtained. Finally, a polynomial function can be determined.

Reliability index β RI for SOED

The reliability index is the shortest distance from the limit state function to the origin of the transformed space of random variables. It can be defined by (US Army Corps of Engineers 1995)

$$ \left. \begin{aligned} &\hbox{min} \beta_{{\text{RI}}} = \left\| {\mathbf{Y}} \right\| = \sqrt {{\mathbf{Y}}^{{\mathbf{T}}} {\mathbf{Y}}} = \sqrt {\sum\limits_{j = 1}^{n} {\left[ {\left( {x_{j} - \mu_{j} } \right)/\sigma_{j} } \right]^{2} } } , \\ & s.t. \, \left\{ {\begin{array}{*{20}l} {G^{\prime}\left( x \right) = 0} \\ {x_{\text{min} } \le x_{j} \le x_{\text{max} } {\kern 1pt} {\kern 1pt} } \\ \end{array} } \right.j = 1,2, \ldots n \\ \end{aligned} \right\} $$
(7)

where x j is the random variable, μ and σ are, respectively, the vector of mean values and standard deviation of random variables, Y is a vector of normalized (transformed) variables and G’(x) is limit state function.

Solving the minimum β RI ellipse, which is tangent to the limit state surface, can be iteratively performed using optimization tools, such as Excel Solver or MATLAB optimization toolbox. Once the reliability index β RI is determined, the probability of failure can be found. The reliability index β RI, the corresponding probability of failure P f and an auxiliary terminology regarding expected performance points are listed in Table 3.

Table 3 Target reliability indices (US Army Corps of Engineers 1995)

Case study: a large-scale landslide

In order to demonstrate the efficiency and accuracy of the proposed method, a practical landslide, Xiangjiashan landslide, is taken as an example, and the arbitrary slip surface for this landslide is shown in Fig. 3.

Fig. 3
figure 3

Photograph of the reinforced Xiangjiashan landslide

Xiangjiashan landside is located at Chongqing in China. It is observed from Fig. 3 that bird’s-eye view of the landslide is an irregular horseshoe shape with the width of 200 ~ 360 m and longitudinal length of 230 m. The slope angle of the landslide is about 70° ~ 80°. The area of the sliding region is approximately 70,000 m2, and the volume of the sliding body is estimated to be 1,400,000 m3. Therefore, Xiangjiashan landslide is a large-scale one (Zhou and Cheng 2015). The landslide was reinforced with the high safety factor during excavation since 1998, but great displacements occurred on June 1, 2004. Therefore, it is indicated that a landslide reinforced with high FS cannot be considered as ‘safe’ one. To study this landslide, the probabilistic way is more reasonable.

According to the geological exploration of Xiangjiashan landslide, there exist two sliding bodies (upper sliding body and lower sliding body), which are observed in Figs. 3 and 4. The upper sliding body contains sandstone, weathered mudstone, and so on. The lower sliding body mainly contains sandstone and weathered mudstone (Zhou and Cheng 2015). It is assumed that unit weight, cohesion and friction angle of geomaterials are in normal distributions, and they are assumed to be uncorrelated. The mechanical parameters are listed in Table 4.

Fig. 4
figure 4

The 3-3′ geological cross section of Xiangjiashan landslides (Zhou and Cheng 2015)

Table 4 Mechanical parameters of the Xiangjiashan landslide

The upper sliding mass

There are three uncertain variables in this study, i.e., from Eq. (1), N is 15 when m is 3. From Eq. (5), the length of star points γ is 1.215. In Table 5, the uncertain variables are changed from encoded variables in SOED. For instance, ‘0’ listed in the column z 1 is corresponding to mean value μ (‘24.92’) of uncertain variable η, ‘± 1’ represents ‘μ ± σ’, ‘± 1.215’ represents ‘μ ± 1.215σ’, etc.

Table 5 Design matrix in SOED with three variables (m = 3)

Sets of response value, FS, are obtained by uncertain variables in Table 5 using 3D rigorous limit equilibrium method. 3D rigorous limit equilibrium method is a more reasonable and rigorous way to calculate FS because six equilibrium equations are satisfied, while parts of equilibrium equations are satisfied in simplified Bishop method and Janbu method. For simplicity, 3D rigorous limit equilibrium method is illustrated in “Appendix.”

From Eq. (2), we have \( z^{\prime}_{1} = \left( {\frac{\eta - 24.92}{2}} \right)^{2} - 0.73 \), \( z^{\prime}_{2} = \left( {\frac{c - 55}{5}} \right)^{2} - 0.73 \), \( z^{\prime}_{3} = \left( {\frac{\varphi - 26}{2}} \right)^{2} - 0.73 \).

From Eq. (6), the second-order regression equation can be obtained as follows:

$$ \begin{aligned} \hat{y}_{\text{up - soed}} = & 1.1143 - 0.0216z_{1} - 0.0114z_{2} + 0.0476z_{3} + 0.002z_{1} z_{2} - 0.0375z_{1} z_{3} - 0.0375z_{2} z_{3} \\ & \quad + \,0.0024z^{\prime}_{1} + 0.001z^{\prime}_{2} + 0.002z^{\prime}_{3} \\ \end{aligned} $$
(8)

where \( z_{1} { = }\frac{{x_{1} - \mu_{\eta } }}{{\sigma_{\eta } }} = \frac{\eta - 24.92}{2} \), \( z_{2} { = }\frac{c - 55}{5} \),\( z_{3} { = }\frac{\varphi - 26}{2} \).

The encoded variables in Eq. (8) should be translated into ‘natural’ uncertain variables; a translated equation can be written as:

$$ \begin{aligned} \hat{y}_{{{\text{up}} - {\text{soed}}}} = & - 10.215 + 0.0006\eta^{2} + 0.00004c^{2} + \eta (0.192 + 0.0002c - 0.0094\varphi ) \\ & \quad + \,c(0.0858 - 0.00375\varphi ) + (0.4377 + 0.0005\varphi )\varphi \\ \end{aligned} $$
(9)

The performance function with respect to a limiting FS(FS min = 1) is:

$$ G^{\prime}\left( x \right)_{\text{up - soed}} = \hat{y}_{\text{up - soed}} - FS_{\hbox{min} }^{{}} = \hat{y}_{\text{up - soed}} - 1 $$
(10)

In order to make a comparison, the regression equation is also determined by the conventional RSM, in which the design matrix is based on central composite designs. FS is listed in Table 6. It can be noticed that the products by z11′ and z22′ are not equal to 0, i.e., the design matrix by a second-order forms does not satisfy the orthogonality in the conventional RSM.

Table 6 Design matrix in the conventional RSM with three variables (m = 3)

The regression equation by the conventional RSM is:

$$ \begin{aligned} \hat{y}_{\text{up - RSM}} = & 1.115 - 0.0217z_{1} - 0.0425z_{2} + 0.0530z_{3} + 0.002z_{1} z_{2} - 0.0375z_{1} z_{3} - 0.0375z_{2} z_{3} \\ & \quad + \,0.0018z^{\prime}_{11} + 0.0006z^{\prime}_{22} + 0.0013z^{\prime}_{33} \\ \end{aligned} $$
(11)

A translated equation can be written as:

$$ \begin{aligned} \hat{y}_{\text{up - RSM}} = & - 9.448 + 0.00045\eta^{2} + 0.000026c^{2} + \eta (0.1995 + 0.0002c - 0.0094\varphi ) \\ & \quad + \,c(0.08112 - 0.00375\varphi ) + (0.4494 + 0.0003\varphi )\varphi \\ \end{aligned} $$
(12)

Similarly, the performance function with respect to a limiting FS(FS min = 1) is:

$$ G^{\prime}\left( x \right)_{\text{up - RSM}} = \hat{y}_{\text{up - RSM}} - 1 $$
(13)

For comparison, the errors between FS obtained by 3D rigorous limit equilibrium method and the proposed method are plotted in Fig. 5, and the errors between FS obtained by 3D rigorous limit equilibrium method and the conventional RSM are plotted in Fig. 6. It can be found that the standard deviation of errors in Fig. 5 is less than that in Fig. 6. Meanwhile, the mean value of errors for the proposed method is equal to 0. Such a phenomenon appears because the variance of the error is minimized in the proposed method.

Fig. 5
figure 5

Errors between FS obtained by 3D rigorous limit equilibrium method and SOED-based RSM

Fig. 6
figure 6

Errors between FS obtained by 3D rigorous limit equilibrium method and the conventional RSM

For SOED-based RSM, the reliability index β RI and the probability of failure of the upper sliding mass can be obtained by Eqs. (7) and (10) with the help of MATLAB optimization toolbox. Similarly, for the conventional RSM, the reliability index β RI and the probability of failure of the upper sliding mass can be determined by Eqs. (10) and (13). The results are listed in Table 7. It can be found from Table 7 that the probability of failure of the upper sliding mass by the proposed method is in good agreement with that by Monte Carlo simulations (MC), which is larger than that by the conventional RSM. The main reason of such a phenomenon is that the high-order terms in the conventional RSM do not satisfy the orthogonality. That is to say, the second-order response surface for SOED-based RSM is more rigorous than that for the conventional RSM because the orthogonality of variables in high-order design matrix is ignored in the conventional RSM.

Table 7 Comparison of β RI and the probability of failure P f using the different methods

The lower sliding mass

Similarly, the response values of FS of the lower sliding mass are calculated using 3D rigorous limit equilibrium method, and uncertain variables are also transformed from Table 5. The regression equation by the proposed method can be obtained as follows:

$$ \hat{y}_{\text{low SOED}} =\, 1.6550 - 0.0681z_{1} - 0.0327z_{2} + 0.0737z_{3} + 0.0056z_{1} z_{2} + 0.0055z_{1}^{2} + 0.0012z_{3}^{2} $$
(14)

The translated equation for the lower sliding mass can be written as:

$$ \hat{y}_{\text{up - RSM}} =\, 30848 + \eta ( - 0.1055 + 0.0014\eta + 0.00056c) - 0.205c + (0.0219 + 0.0003\varphi )\varphi $$
(15)

FS obtained by 3D rigorous limit equilibrium method is compared with those obtained by SOED-based RSM, as shown in Figs. 7 and 8. It can be found from Figs. 7 and 8 that FS obtained by the proposed approach is in good agreement with that obtained by 3D rigorous limit equilibrium method. In order to prove the accuracy of SOED-based RSM, a well-accepted simulation method, Monte Carlo simulations, is employed herein.

Fig. 7
figure 7

FS obtained by 3D rigorous limit equilibrium method and SOED-based RSM

Fig. 8
figure 8

Errors between FS obtained by 3D rigorous limit equilibrium method and the proposed method

The reliability index β RI obtained by SOED-based RSM with the help of MATLAB optimization toolbox is 2.62, and the probability of failure of the lower sliding mass is 4.4%. The probability of failure obtained by Monte Carlo simulations with 1012 sampling points is 4.2%. The probability of failure obtained by SOED-based RSM is in good agreement with that obtained by Monte Carlo simulations. It can be found from Table 3 that the lower sliding mass is between ‘unsatisfactory’ and ‘poor,’ which means the landslide may fail. That is to say, it should be reinforced as soon as possible. In fact, Xiangjiashan landslide was reinforced.

Summary and conclusions

Rocks or soils in landslides are highly variable natural materials. In order to deal with uncertainties, reliability analysis approach, which is a reasonable and rigorous way, is applied in landslides or the other engineering. A SOED-based RSM is proposed to study the stability of landslides. Firstly, a second-order design matrix is built by changing the length of star points. Secondly, the unit weight, cohesion and friction angle of geomaterials are considered as random variables, and their values are transformed from the encoded variables in the proposed method. Thirdly, the response value, the safety factor, is calculated by 3D rigorous limit equilibrium method. Finally, an example is taken to verify the validity of the proposed approach. Compared with the conventional RSM, it can be found that the results obtained by the proposed method are more accurate. Therefore, the proposed method provides a new idea to analyze the stability of landslides.