1 Introduction

The utilization of high-speed railway tunnels is increasing in China to meet the growing infrastructure needs of residents. By the end of 2015, the total length of high-speed railway tunnels opened to traffic in China was approximately 3200 km, and the number of high-speed railway tunnels exceeded 2200. China has the most high-speed railway tunnels in the world (Zhao et al. 2017a, b). However, high-speed railway tunnel accidents occur all the time, so the safety of tunnel construction has become the top priority of high-speed railway tunnel engineering. Surrounding rock stability is important to guarantee safe and efficient construction. However, strict monitoring and accurate prediction of surrounding rock deformation (Zhu and Su 2015) should be performed in any project.

There are various studies on tunnel displacement (Sun et al. 2018; Kong et al. 2019), and Li proposed a three-dimensional analytical model of blowout for a shallow tunnel in sand (Li et al. 2019). Fu used the modified Loganathan’s formula, and a displacement field distribution prediction model was established (Fu et al. 2017). Wan proposed a new analytical model of loosening earth pressure in front of tunnel face (Wan et al. 2019). Zhang described the key influences of yaw excavation loadings on ground displacement and segmental stress for a curved shield tunnel (Zhang et al. 2020). At present, a variety of related research methods, mainly based on support vector machine (SVM), gray system, neural network and other theories, are applied to establish surrounding rock deformation prediction models. SVMs have the advantage of utilizing small sample sizes and nonlinear and high-dimensional pattern recognition. Yao et al. (2010) proposed a multistep advance prediction model for the displacement of rock surrounding a tunnel based on SVM. Hu et al. (2016) established a SVM model based on time series for the prediction of the deformation of rock surrounding a tunnel. However, the classic SVM algorithm gives only the binary classification, while in practical applications, solving the problem of multiclass classification is generally necessary. The gray system (GM) is suitable for situations in which few scattered samples are available and the data quality is poor. Lu adopted mathematical methods to improve the gray model and applied the model to the prediction of the deformation of rocks surrounding tunnels (Lu 2017). Neural networks are robust, and such predictive models can approach nonlinear problems. Li established a particle swarm optimization BP (PSO-BP) neural network by debugging and improving the prediction of surrounding rock deformation under complex geological conditions (Li et al. 2017). However, the neural network prediction results mainly depend on the accuracy of the samples. In addition, studies have been based on other methods (Weng et al. 2015; Zhang et al. 2019). Qiang proposed a multiscale nuclear limit learning machine prediction model for the deformation of rocks surrounding tunnels (Qiang et al. 2017). At present, many scholars have proposed a variety of surrounding rock deformation prediction methods; considering the diversity of high-speed railway tunnel construction and the complexity of geological conditions, each method has its limitations. In addition, most of the current studies focus on the prediction of surrounding rock deformation, while the prediction of the grade of tunnel deformation is less common.

In view of this situation, this paper studies the actual project of the Zheng-Wan high-speed railway tunnel in China. First, the deformation grade of the rock surrounding the tunnel is classified, and a standard of surrounding rock deformation grade suitable for high-speed railway tunnels is established. Then, the key factors affecting the deformation grade of the surrounding rock are analyzed, and the weight of each factor is obtained by using rough set theory. Finally, the model of surrounding rock deformation grade prediction is established by using cloud model theory and is applied to engineering practice, and some useful conclusions are obtained.

2 Project Summary

2.1 Engineering Cases

The Zheng-Wan high-speed railway is an important part of the Zhengzhou-Chongqing high-speed railway. In this paper, 11 tunnels in section zwzq-6 (D1K461 + 845.429–DK497 + 658.55) of the Zhengzhou-Wan high-speed railway are studied, in addition to 1 horizontal hole in Huang Jia Ditch and 1 inclined shaft in the Hou Ping tunnel. The studied tunnels are mainly distributed in Xiang Yang city, Nan Zhang County and Bao Kang County, as shown in Fig. 1.

Fig. 1
figure 1

Zheng-Wan high-speed railway route map

2.2 Geology

The study area contains the Huang Ling anticline, the Jindou-Anzi village reversed anticline and the Shenlongjia and Jing Men platform fold belt of the upper Yangtze continental fold belt, with complex regional geological structures, including the Tian Jia ditch reverse fault, Kong Jia Fan fault and Xiang Er mountain branch fault. The tunnel is constructed by a bench cut method and had a vault collapse as shown in Fig. 2. The geological conditions of Zheng-Wan high-speed railway tunnel are complicated; in order to ensure the efficient and safe construction, it is necessary to predict the deformation grade of surrounding rocks in front of the palm face.

Fig. 2
figure 2

Tunnel construction and vault collapse in the tunnel

3 Theoretical Method

3.1 Rough Set Theory

Rough set theory is a nonlinear mathematical method that was proposed by Polish scholar Pawlak (1982) and is used to process imprecise and incomplete information to discover hidden knowledge and reveal its potential rules (Zhang et al. 2011, 2009; Zhang et al. 2016a, b).

3.1.1 Decision Table and Dependence

If the decision table S = (U, A) is an information system, then C and D included in A are two attribute subsets, which are called the conditional attribute and decision attribute, respectively. If these attributes meet the conditions

$$U/C = \left\{ {X_{1} ,X_{2} , \ldots ,X_{n} } \right\}$$
(1)
$$U/D = \left\{ {Y_{1} ,Y_{2} , \ldots ,Y_{m} } \right\},$$
(2)

then the dependency of decision attribute D on conditional attribute C is defined as

$$K = \gamma_{{\text{C}}} \left( D \right) = \frac{1}{U}\mathop \sum \limits_{i = 1}^{m} \left| {\gamma_{{\text{C}}} \left( {D_{1} } \right)} \right|,\quad Y_{i} \in U/D$$
(3)

where |γC (D1)| is the sample compatible quantity. Clearly, 0 < K < 1, and the closer the K is to 1, the higher the dependence of decision attribute D on conditional attribute C.

3.1.2 Weight Calculation Method

The rough set theory determines the weight calculation method as follows:

$$\gamma_{{\text{C}}} \left( D \right) = \frac{1}{U}\mathop \sum \limits_{i = 1}^{m} \left| {\gamma_{{\text{C}}} \left( {D_{1} } \right)} \right|$$
(4)
$$\gamma_{c - ci} \left( D \right) = \frac{1}{\left| U \right|}\mathop \sum \limits_{i = 1}^{m} \left| {\gamma_{{c - c_{i} }} \left( {D_{i} } \right)} \right|$$
(5)
$$\omega_{i} = \frac{{\gamma_{{\text{c}}} \left( D \right) - \gamma_{{c - c_{i} }} \left( D \right)}}{{\mathop \sum \nolimits_{i = 1}^{m} \left[ {\gamma c\left( D \right) - \gamma_{{c - c_{i} }} \left( D \right)} \right]}}$$
(6)

where γC(D) and γc-ci (D) are the dependence of decision attribute D on conditional attribute C and the dependence of decision attribute D on conditional attribute C–Ci after the evaluation index i is deleted, respectively; |U| is the sample size; C (D1)| and c–ci)(Di)| are the number of compatible samples in the decision table and the sample size in the decision table after the evaluation index i is deleted, respectively; and \(w_{i}\) is the index weight.

3.1.3 Applicability of the Rough Set Method

In this paper, the rough set method is used to calculate the weights of the influencing factors. Commonly used weight analysis methods include the analytic hierarchy process (AHP) and entropy weight method. The AHP is based on an in-depth analysis of nature, influencing factors and internal relations of complex decision-making problems, using less quantitative information to make the process of decision-making mathematical. (Xu et al. 2011; Zhao et al. 2017a, b) Thus, a simple decision method is provided for complex decision problems with multiple objectives, multiple criteria or no structural characteristics. The entropy weight method is an objective weighting method (Gao et al. 2018). In the specific application process, the entropy weight method is based on the variation degree of each index, and the entropy weight of each indicator is calculated by information entropy. Then, the weight of each index is modified by the entropy weight to obtain a more objective index weight. The data in this paper are actual engineering samples from the Zheng-Wan high-speed railway. The AHP is suitable for models and methods of making decisions in complex systems that are difficult to fully quantify. The sample values are taken from different sections, so the changes in each corresponding influencing factor are not uniform, and sudden increases and decreases occur; the entropy weight method has some limitations for this kind of sample. The rough set method can determine rules for incomplete, uncertain and small sample data sets, and the weight of each influencing factor can be obtained through the analysis of attribute importance. Therefore, this paper appropriately adopts the rough set method to calculate the weights. The calculation results suggest that the largest weight corresponds to the structure surface behavior, followed by the groundwater content, which is in line with observations from actual engineering applications.

3.2 Cloud Model Theory

Cloud model theory was proposed by Li Deyi, an academician of the Chinese Academy of Engineering, and addresses the uncertain transformation model from a qualitative concept to a quantitative description (Li 2018; Zhang et al. 2016a, b).

3.2.1 Definition of Cloud

Definition of cloud: X = {x} is a common set, and X is called the domain. The fuzzy set A exists in the domain X, the random number μA(x) with stable tendency exists for any element x, and x is called the membership degree of A.

Definition of normal cloud: Let X be the theoretical domain and C be the qualitative concept of the theoretical domain X supposing that the quantitative value x ∈ X and x is a random implementation of the qualitative concept C; if x ~ N (Ex, En2) is met, then En ~ N (En, E2 e), where the determination of C is

$$\mu \left( x \right) = {\text{e}}^{{ - \frac{{\left( {x - E_{x} } \right)^{2} }}{{2E_{n}^{{^{{\prime}{2}} }} }}}}$$
(7)

The distribution over the domain U is called the normal cloud.

3.2.2 Normal Cloud Model

The normal cloud model is the most basic and applicable cloud model (Mou and Cai 2018; Wang et al. 2015, 2017, 2018). The digital characteristics of the normal cloud are mainly represented by the expectation Ex, entropy En and hyper-entropy He.

3.2.3 Forward Cloud Generator and Computing

Forward normal cloud generator: If I defines the point x in the domain X, the cloud generator can generate the certainty distribution that x belongs to the qualitative concept C, and the cloud generator at this time is called the forward cloud generator.

The forward cloud generator algorithm and calculation steps are as follows:

  1. 1.

    Calculate entropy En and hyper-entropy He according to the relevant data.

  2. 2.

    Based on the calculated entropy En and the hyper-entropy He, generate a normally distributed random number \(E_{n}^{^{\prime}} \sim N\left( {E_{n} ,E_{{\text{e}}}^{2} } \right)\).

  3. 3.

    Given the specific input value x and expected value Ex, use Eq. (1) to calculate the certainty:

    $${\upmu }\left( {\text{x}} \right) = e^{{ - \frac{{\left( {x - E_{x} } \right)^{2} }}{{2E_{n}^{{^{{\prime}{2}} }} }}}}$$
    (8)

Determine the characteristics of the forward cloud model: According to the definition of the normal cloud model, the evaluation object factor can be used to calculate the cloud digital characteristics of the standard of a certain level according to the following equations:

$$E_{x} = \frac{{B_{\min } + B_{\max } }}{2}$$
(9)
$$E_{ni} = \frac{{E_{{x_{i + 1} }} - E_{{x_{i} }} }}{3}$$
(10)
$$H_{{\text{e}}} = k$$
(11)

where \(k\) generally takes an empirical value and is 0.01 in this paper.

3.2.4 Normal Cloud Multi-index Comprehensive Probability Analysis

Normalize the certainty degree μ(x) of a single indicator at each level, and obtain a normalized certainty degree μ' (x). The comprehensive normal cloud certainty degree μk (x) for multiple indicators at each evaluation level can be determined by Eq. (12).

$$\mu_{k} \left( x \right) = \mathop \sum \limits_{j = }^{m} \mu^{^{\prime}} \left( x \right)w_{{\dot{J}}}$$
(12)

where wj is the weight of the jth indicator and satisfies 0 < wj ≤ 1, ∑m j = 1wj = 1.

3.3 Delphi Method

The Delphi method (Wang et al. 2010), also known as the expert evaluation method, is a subjective prediction method. It asks for and aggregates expert opinions in written form; the opinions of experts in the subject are collected and summarized and then shared back with them, in a method of clarifying and selecting the judgment given by an expert group on a problem for which accurate information is not readily available. The steps are as follows:

  1. 1.

    Determine the purpose of the investigation and select experts.

  2. 2.

    Design the evaluation consultation table.

  3. 3.

    Seek advice from experts and gather feedback.

  4. 4.

    Have the experts give a second judgment after reviewing the opinions.

  5. 5.

    Repeat steps (3) and (4) until broadly consistent results are achieved.

  6. 6.

    Obtain the results of data processing and weight determination.

4 Establishment of Rough Set and Cloud Model Prediction Model

4.1 Processes and Methods

4.1.1 Establishment Process

Based on the actual project of the Zheng-Wan high-speed railway tunnel in China, rough set theory is used to obtain the weight of each influencing factor. Finally, a model for predicting the deformation grade of surrounding rock is established based on the cloud model theory. The specific process is shown in Fig. 3.

Fig. 3
figure 3

Calculation flowchart

4.1.2 Method Applicability

First of all, the rough set theory is applicable to tunnel engineering because its mathematical foundation is mature and no prior knowledge is required. Then, rough set theory is easy to use and it can deal with the inaccuracy of data. Rough set theory can process qualitative and quantitative data. The original data of tunnel engineering are complex and diverse, so it is reasonable to adopt the rough set theory in this article. The cloud model theory reflects the uncertainty between qualitative and quantitative, especially the randomness and fuzziness. There are qualitative data in the original data collected in this article, and the cloud model can well realize the conversion between qualitative and quantitative. Therefore, it is reasonable to choose the cloud model theory in this article.

4.2 Classification of Surrounding Rock Deformation

According to the monitoring range of in situ vault settlement, the upper limit convergence and lower limit convergence were collected for the Zheng-Wan high-speed railway tunnel, and the degree of damage to the cavern caused by the deformation degree of the surrounding rock was observed. The classification criteria of this paper and the classification criteria of engineering specifications are different. This paper grades the surrounding rock deformation, not the surrounding rock. In addition, this article combines the single-factor method to grade the surrounding rock deformation according to the monitoring data of surrounding rock settlement and surrounding rock convergence of the Zheng-Wan high-speed railway tunnel. The monitoring values were divided into five ranges; the deformation degrees of surrounding rock corresponding to these five ranges were also used. Combined with the single-factor method, the surrounding rock deformation was divided into five grades: I, very little deformation; II, little deformation; III, moderate deformation; IV, major deformation; and V, intense deformation. The grade division of rock deformation around the high-speed railway tunnel and the range of the corresponding evaluation indexes are shown in Table 1.

Table 1 Ranges of the single-factor evaluation index grades

4.3 Calculation of Surrounding Rock Deformation Grade

These three deformation indexes are obtained from the long-term deformation monitoring data from 20 sections along the Zheng-Wan high-speed railway tunnel. In the evaluation index system, each corresponding index has a rank interval, and the grade interval of the three deformation indexes is divided according to engineering practice and related deformation; the results are shown in Table 2. Dimensionless processing of the original data is conducted by considering the magnitude range of each evaluation index grade to normalize the index between 0 and 1. Specifically, the ratio of the index value to the whole interval value is calculated, and the dimensionless results are shown in Table 3. According to the cloud model calculation, since the magnitude ranges of the three indexes are the same, the index evaluation model for the three evaluation indexes is shown in Fig. 4.

Table 2 Dimension ranges of the single-factor evaluation index grades (dimensionless)
Table 3 Tunnel deformation data samples from high-speed railway tunnels (dimensionless)
Fig. 4
figure 4

Index evaluation results of the cloud model

The surrounding rock deformation is classified into three classification indexes, and the weights of the three indexes on the stability of the surrounding rock are obtained by the Delphi method. In this paper, the results calculated by Xue are applied (Xue et al. 2018). The calculation results are shown in Table 4.

Table 4 Weight of each evaluation factor

Based on the normal cloud model, according to the sample data and Eqs. (1)–(3), entropy En and hyper-entropy He are calculated, with K equal to 0.01. Based on the calculated entropy En and the hyper-entropy He, a normally distributed random number E n ~ N (En, E2 e) is generated with the particular input value x and the expected value Ex; Eq. (12) is used to calculate certainty. The level with the highest determination is the deformation grade of the surrounding rock of the tunnel section. The results of the surrounding rock deformation grades obtained are shown in Table 5.

Table 5 Calculation results

4.4 Rough Sets are Used to Calculate Weights

4.4.1 Select Influencing Factors

The research object of this paper is a high-speed railway tunnel; according to the field investigation, part of the Zheng-Wan high-speed railway tunnel is affected by faults, and joints and fractures are extensively developed. Affected by groundwater, the self-stability of the surrounding rock is poor; therefore, the surrounding rock deformation is mainly affected by the surrounding rock strength and integrity and groundwater development. At the same time, considering the difficulty of data acquisition and the feasibility of practical engineering application (Panthee et al. 2016), when constructing the index evaluation system, more evaluation indexes are selected to reflect the above three aspects. The compressive strength of the rock reflects the general strength of the surrounding rock, the integrity coefficient and structure surface behavior of the rock reflect the integrity of the surrounding rock, and the groundwater content reflects the development of groundwater. Therefore, the selected evaluation indexes for the deformation of the rock surrounding the tunnel are the rock compressive strength, rock integrity coefficient, structure surface behavior and groundwater content.

  1. 1.

    Uniaxial compressive strength of rock: Rock is subjected to uniaxial compression until failure; the compressive strength of the rock is the load that a unit area of rock can withstand. The unit of compressive strength (Rc) is MPa.

  2. 2.

    Rock integrity coefficient: The rock integrity coefficient reflects the damage degree of the rock mass relative to intact rock and is used to determine an important indicator of rock mass quality. At present, the integrity of a rock mass is often measured by the integrity coefficient (Kv) of the rock mass under the domestic grading standards for surrounding rock.

  3. 3.

    The occurrence of geological discontinuity (S1): This index includes the roughness of the geological discontinuity and the continuity and morphology of the rock wall.

  4. 4.

    Groundwater content: This index reflects the development of groundwater (W) in the surrounding rock. The unit of W is (l·(min m)−1).

4.4.2 Constitute the Decision Table

Because the rough set program of MATLAB software can deal with only discrete data, the data obtained in the actual project should be discretized. According to the magnitude range or value of each conditional attribute, each single-factor index is classified, as shown in Table 6.

Table 6 Single-factor indicators

According to Table 6, the original data are discretized, taking the influence factors after discretization as conditional attributes, as shown in Table 7.

Table 7 Decision table of factors influencing high-speed railway tunnel deformation

4.4.3 Weight of Influencing Factors

According to the decision table of factors influencing tunnel deformation, the rough set program of MATLAB software is used to calculate the dependence, importance and weight of each influencing factor on the deformation grade of the rock surrounding the tunnel, as shown in Table 8.

Table 8 Dependence, importance and weight of each influencing factor

4.5 Model Establishment

Using the cloud model program of MATLAB software, the cloud model method is adopted to establish the prediction model for the surrounding rock deformation grade. The established model uses only the engineering data obtained from the actual project through advanced detection as input; that is, the four influencing factors can determine the prediction of the surrounding rock deformation grade. Ten other sections are selected to predict the deformation grade of the surrounding rock and compare the results with the actual situation.

According to the actual monitoring of the Zheng-Wan high-speed railway tunnel, the sample data from 10 tunnel sections are collected. In the evaluation system, each influencing factor or an evaluation index has a quantitative value range for each grade, as shown in Table 6. Dimensionless processing is carried out for the data shown in Table 6, and the results are given in Table 9. The sample data are nondimensionalized by considering the magnitude range of the single-factor evaluation index grade, as shown in Table 9, and the results are shown in Table 10. According to the cloud model theory, the index evaluation models of the four influencing factors are shown in Fig. 5.

Table 9 Scale ranges of single-factor evaluation indexes (dimensionless)
Table 10 Monitoring data samples from the high-speed railway tunnel (dimensionless)
Fig. 5
figure 5

a The cloud model of Rc. b The cloud model of Kv. c The cloud model of S1. d The cloud model of W

4.6 Model Inspection

Based on the cloud model theory and the weights calculated by the sample monitoring values and the rough set method, the determination degree of each grade in the 10 sections is calculated, and the predicted surrounding rock deformation grades of the 10 sections are obtained and compared with the actual surrounding rock deformation grade. The results are shown in Table 11.

Table 11 Comparison table of model test

According to the above prediction results, the predicted values of 8 out of the 10 sections are consistent with the actual values. No. 4 tunnel section and No. 6 tunnel section have made support in advance in order to ensure the stability of surrounding rocks, so the predicted results of the No. 4 tunnel section and No. 6 tunnel section are lower than the actual deformation grades, but the predicted values are close to the actual value. Therefore, this model is feasible for predicting the deformation grade of rock surrounding a high-speed railway tunnel, which also confirms the reliability of the weight of each influencing factor obtained above. In addition, the deformation anomaly of a tunnel section was discovered in time during the excavation process through on-site monitoring and measurement; if the deformation situation and surrounding rock deformation level can be predicted before excavation to provide an early warning, measures will be taken in advance; meanwhile, the stability of the tunnel will be better controlled. Such a prediction can also save time, improve efficiency and reduce cost. Therefore, the establishment of a prediction model for the surrounding rock deformation grade of high-speed railway tunnels has great engineering application value.

5 Discussion

5.1 Comparison of Cloud Model Theory and BP Neural Network Method

To verify the applicability of the cloud model theory, the sample data are taken as the training sample. The neural network method (Shi et al. 2016) and SVM method (Chen 2015) are used to teach this sample, and the prediction model for surrounding rock deformation grade is established and verified (Table 11). A comparison between the results of the two methods and the cloud model is shown in Table 12.

Table 12 Comparison of calculation results

The calculation results of the prediction model established by the BP neural network are different from the actual results in 4 tunnel sections. These differences show that the learning accuracy of the neural network is dependent on sample size and that small samples lead to some limitations in learning. The SVM method resulted in 7 tunnel sections that deviate from the actual results, and its prediction accuracy is only 30%, indicating that the computational accuracy of the SVM for multiclass classification problems needs to be improved. The cloud model exhibits better accuracy and higher engineering application value.

5.2 Discussion on Portability of Models

In this paper, based on the Zheng-Wan high-speed railway tunnel, the prediction model of surrounding rock deformation grade of high-speed railway tunnel is established. The model established in this paper is only applicable to Zheng-Wan high-speed railway, for the following reasons. First of all, this article selects the influence factors of surrounding rock deformation which are Zheng-Wan high-speed railway tunnel influential factors, these factors may not apply to other tunnels, and then, this article is based on Zheng-Wan settlement of high-speed railway tunnel surrounding rock and convergence data of surrounding rock deformation of surrounding rock classification, so the model for the other tunnel is not feasible. However, the methods, ideas and conclusions of this paper can provide some references for other tunnel projects.

6 Conclusions

  1. 1.

    This paper takes the Zheng-Wan high-speed railway tunnel as the research object. Based on rough set theory and cloud model theory, a deformation grade prediction model for high-speed railway tunnels is established. First, the tunnel deformation grade is classified and studied, and the deformation grade standard for the high-speed railway tunnel is established. The key factors affecting the deformation grade of the surrounding rock are analyzed, and the correlation between the deformation grade and influencing factors is established. Finally, cloud model theory is introduced to predict the deformation grades of tunnel sections. The application results show that the predictions of the model are consistent with the actual situation. This method has important value for engineering applications, can provide a certain reference for the stability analysis of surrounding rock and can guarantee the safe construction of a tunnel.

  2. 2.

    Rough set theory is adopted to determine the weight of each evaluation index, and the weight problem is transformed into an important evaluation problem regarding attributes in the rough set. This method requires no prior experience; the solution process is completely driven by the actual sample and has effective operability. Considering the randomness of the cloud model theory, when processing small sample data sets, some test samples should be retained, and repeated tests should be conducted to obtain the prediction model with the highest accuracy.