Introduction

During tunnel design and construction, it is crucial to determine the initial state of stress and underground material properties, for both understanding the structural behavior and modeling the construction area numerically (Sakurai and Takeuchi 1983). However, owing to the nonhomogeneous characteristics of the ground and uncertainties in exploration, these rock or soil parameters can only be identified within limited scope, and the values obtained from field or laboratory tests cannot be used directly for further analysis. Accidents have occurred occasionally because of the use of inappropriate parameters in tunnel design (Hoek and Diederichs 2006; Cai 2011; Zhang et al. 2020b).

The reliability of tunnel design parameters can be verified through field monitoring (Liu and Liu 2019). Further, the displacement, as the most common monitoring index in tunnel construction, can be a good base for the back analysis of rock or soil parameters. Ever since the first paper on displacement-based back analysis was published (Kavanagh and Clough 1971), this method has been used to estimate the following:

  • Basic mechanical rock or soil parameters (Feng et al. 2000; Fakhimi et al. 2004; Yazdani et al. 2012; Shreedharan and Kulatilake 2015; Kang et al. 2017)

  • Rheological parameters (Sharifzadeh et al. 2013; Qi and Fourie 2018; Zhang et al. 2020b)

  • Seepage parameters (Wu et al. 2019)

  • Parameters of rock or soil constitutive model (Rechea et al. 2008; Gao et al. 2020)

  • In situ stress (Kaiser et al. 1990; Gao and Ge 2016; Bertuzzi 2017; Xu and Ni 2019)

Generally, back analysis techniques based on displacement can be classified into three categories (Qi and Fourie 2018): (1) analytical or semi-analytical methods (Bertuzzi 2017; Zhang et al. 2017); (2) numerical modeling (Zhang et al. 2006; Hisatake and Hieda 2008; Luo et al. 2018), and (3) statistical learning (Huang et al. 2011; Song et al. 2015; Gao and Ge 2016; Liu and Liu 2019; Zhang et al. 2020b). In analytical or semi-analytical methods, assumptions and simplifications are often made to obtain feasible solutions, resulting in the restricted applicability of such methods in practical engineering cases. Numerical methods are widely utilized to calculate the displacement, and the input parameters for numerical models can be estimated through trial-and-error or other optimization algorithms (Hisatake and Hieda 2008; Manzanal et al. 2016). However, many numerical models have to be created and solved in the process of back analysis each time. Finally, the statistical learning method, with an artificial intelligent algorithm as the kernel, can well address this issue and improve the computational efficiency. With the rapid development of big data science in recent years, intelligent algorithms, including the artificial neural network (ANN), support vector machine (SVM), Gaussian process (GP), and extreme learning machine (ELM), have been successfully applied in geotechnical engineering (Zhou et al. 2016; Xue and Xiao 2017; Xue et al. 2020; Zhang et al. 2020a, b). These intelligent data-driven models can map any nonlinear relationship without prior assumptions relating to the data (Feng et al. 2004) and may become a substitute for numerical analysis through sample training. Thus, the problem of numerical back analysis can be transformed into an optimization problem without the excessive computational burden.

In earlier studies, when performing displacement-based back analysis, researchers often neglected two main points:

  • The first is displacement loss. In a rock tunnel, the displacement monitors are generally assembled after the completion of the first lining as the excavation work continues. Therefore, there would always be a distance d between the monitoring section and the tunnel face, as depicted in Fig. 1a. The rock displacement that occurs before the installation of instruments is not recorded. The monitoring curve is only a part of the total displacement curve (shown with a solid line in Fig. 1b), and the rest of the curve is termed the displacement loss (represented with the dashed line in Fig. 1b), which can be subdivided into three parts: the displacement occurs before the excavation of the tunnel face, the displacement occurs after the excavation and before the construction of first lining, and the displacement occurs after the first lining construction and before the installation of monitors. However, in numerical simulations, displacement can be recorded as soon as the rock excavation. In the process of displacement-based back analysis, if simulation displacement is calculated to approach the monitoring displacement without subtracting the displacement loss, a large negative effect would be caused on the precision of prediction. For several years, displacement loss has been considered based on the load release coefficient (Yang et al. 1983; Lu et al. 2014; Gao and Ge 2016), which is uncertain because it is determined by engineering experience. Recently, some researchers have noticed this issue (Zhang et al. 2020b), and more effective methods are in need to be proposed to consider displacement loss.

  • The second is the space effect. When a tunnel is excavated, the displacement of the rock varies with the excavation process. This space effect is sufficiently reflected in the displacement curve. However, in many back analysis studies (Song et al. 2015; Wu et al. 2019; Xu and Ni 2019), researchers often have neglected the process of displacement variation and have used the final value of displacement, which would affect the accuracy and applicability of back analysis method. Reasons are that the vault settlement or horizontal convergence may be the same for two tunnel excavation cases, whereas the displacement curves cannot be identical because of different rock categories, initial stress states, and construction methods. Therefore, it is necessary to consider the space effect when performing displacement-based back analysis.

Fig. 1
figure 1

a Positions of initial monitoring section and tunnel face in practical excavation. b Profiles of monitoring displacement and displacement loss

This paper proposes a displacement-based back analysis approach that considers the displacement loss and space effect, to predict the parameters of surrounding rocks during tunnel excavation. The core algorithm is referred to as PSO-ELM-PSO, in which the ELM model optimized by the PSO algorithm works as a substitution for the numerical analysis, and the PSO algorithm is utilized again to search for the optimal rock parameters. The process and detailed considerations of the proposed method are introduced in the “Methodology” section. A case study for validating the method and investigating the impact of the monitoring parameters on the accuracy of the algorithm is presented in the “Case study” section. In the “Engineering application” section, the application of the method to the excavation of a tunnel in Shenzhen, China, is described. Thereafter, the PSO-ELM-PSO and PSO-ELM algorithms are compared, and the limitations and prospects of the proposed method are discussed in the “Discussion” section. Finally, the principal conclusions of this study are summarized in the “Conclusions” section. The primary novelty of this study has two aspects. (1) An integrated algorithm named PSO-ELM-PSO is applied to back analyze rock parameters in the tunnel construction, in which the displacement loss and space effect are both considered. (2) The variation law of prediction error is revealed on the selection of the rock parameters and the change of the monitoring parameters. Measures are also provided to improve the quality of training samples and the precision of the algorithm.

Methodology

The proposed displacement-based back analysis method comprises three key parts:

Consideration of displacement loss

As discussed above, displacement loss cannot be avoided in field monitoring, and it is relevant to the initial monitoring time or distance. In numerical simulations, the displacement values are generally considered functions of the computation time, and the resultant curve is the time–displacement curve. Because the actual construction time is different from the computation time and the initial monitoring time is difficult to determine from the time–displacement curve, it is hard to correlate the monitoring curve with the simulation curve and perform displacement-based back analysis. Nevertheless, the distance between the monitoring section and the tunnel face can easily be measured, and the initial monitoring distance remains stable (usually equal to the length of one construction footage) for a specific tunnel section.

In this study, both the simulation and monitoring displacements were recorded according to the monitoring distance. The steps of a simple procedure used to predict the rock parameters and displacement loss are listed as follows: (1) conduct numerical analyses with different rock parameters, (2) select parts of the total distance–displacement curves as the simulation monitoring curves, (3) map the simulation monitoring curves to the rock parameters, (4) predict the actual rock parameters based on the actual monitoring curve, (5) compute the total displacement curve using the predicted rock parameters, and (6) predict the actual displacement loss. It should be noted that the displacement loss is considered the summation. The three parts of displacement loss are not studied in detail in this research.

Consideration of space effect

As mentioned in the “Introduction,” most previous studies have performed back analysis based on the final value of the monitoring displacement, regardless of the variation process. In the present study, the space effect is considered by introducing parameters representing the rock deformation process, together with the final values of displacement, as the output parameters of the algorithm. That is, the monitoring displacement curve is fitted with an empirical equation, and the fitting parameters are considered the process-representing parameters of rock deformation. This study employs the equation in the natural logarithmic form (Zhang et al. 2009):

$$ U=a+b\times \log \left(1+d\right) $$
(1)

where U is the rock displacement, a and b are the fitting parameters, and d is the monitoring distance.

In addition, the displacement curves for different monitoring positions in the tunnel cross-section are different, resulting in many sets of fitting parameters. To reduce the number of calculations, displacement values, normalized using Eq. (2), at one typical position (e.g., the tunnel vault) are chosen to calculate the process-representing parameters (a and b in Eq. (1)). The final values of vault settlement, horizontal convergence, and so on are the result-representing parameters. These two types of parameters constitute the output parameters of the algorithm.

$$ \overline{u_i}=\left(\left|{u}_i\right|-{\left|u\right|}_{\mathrm{min}}\right)/\left({\left|u\right|}_{\mathrm{max}}-{\left|u\right|}_{\mathrm{min}}\right) $$
(2)

where ui is a series of displacement values, \( \overline{u_i} \)is the normalized displacement values, i is the series number, and |u|max and |u|min are the maximum and minimum of the absolute displacement values.

Establishment of training set

Before initiating the primary algorithm, the training samples should be set as the objects for machine learning; the procedure is presented in Fig. 2(a). To ensure the quality of training samples and the accuracy of the algorithm, a relatively strong correlation is required between the input and output parameters. Sensitivity analysis (Sharifzadeh et al. 2013) can be performed to identify the input parameters most relevant to output parameters.

Fig. 2
figure 2

Flowchart of proposed back analysis method: processes of (a) setting train samples, (b) PSO algorithm, and (c) PSO-ELM-PSO algorithm

In this study, the outputs of the system are the displacement parameters (vault settlement, horizontal convergence, fitting parameters of displacement curve, and so on), which are affected by numerous factors (geostatic stress, rock mechanical parameters, and so on). These n factors, which can also be seen as potential inputs, are written as {α1, α2, α3, …, αn}, and the relevant output parameters are defined as f{α1, α2, α3, …, αn}. During the sensitivity analysis, the parameter αi varies within a specific range (±Δαi) whereas all the other parameters remain constant. The sensitivity of αi can be calculated using the sensitivity function, as follows:

$$ S\left({\alpha}_i\right)=\sum \limits_{j=1}^k{S}_j=\sum \limits_{j=1}^k\left|\frac{f\left({\alpha}_i+\varDelta {\alpha}_i\right)-f\left({\alpha}_i-\varDelta {\alpha}_i\right)}{f\left({\alpha}_i\right)}\right|/\left(\frac{2\varDelta {\alpha}_i}{\alpha_i}\right) $$
(3)

where S(αi) is the sensitivity of αi and k is the number of output parameters.

After the sensitivity analysis, parameters with large sensitivities would be selected as the inputs. Thereafter, numerical models with different sets of selected input parameters can be built and calculated. Subsequently, the computational deformation data would be recorded and processed as the outputs. By integrating the values of the input and output parameters, the training set could finally be established. Detailed procedures of setting the training samples for a specific case could be seen in the “Model settings and operation” section.

Process of back analysis algorithm

Particle swarm optimization

Inspired by the behavior of bird foraging, Eberhart and Kennedy (1995) proposed an innovative global optimization algorithm named particle swarm optimization. The PSO algorithm is initialized with a “swarm” of random particles. Each particle indicates a potential solution to the problem, and has three characteristic indices: position, velocity, and fitness value. During optimization, particles tend to move toward their individual best positions (Pbest) and the global best position (Gbest), based on the velocity and distance from the best positions (Jahed Armaghani et al. 2020). The values of Pbest and Gbest are determined using the particle fitness value, which is calculated using the predefined fitness function. In this study, the fitness function of PSO can be expressed as

$$ fitness=\sum \limits_{i=1}^n\sum \limits_{j=1}^m{\left({T}_{ij}-T{\prime}_{ij}\right)}^2/ mn $$
(4)

where m and n are the numbers of samples and output parameters, respectively, and T and T′ denote the actual and simulation values, respectively. The process of PSO can be seen in Fig. 2(b). In every iteration, the particle velocity and position are updated using the following equations:

$$ \left\{\begin{array}{c}{X}_i^{m+1}={V}_i^{m+1}+{X}_i^m\\ {}{V}_i^{m+1}={\omega}^m\times {V}_i^{m+1}+{c}_1\times {r}_1\times \left({P}_{\mathrm{best}}^m-{X}_i^m\right)+{c}_2\times {r}_2\times \left({G}_{\mathrm{best}}^m-{X}_i^m\right)\end{array}\right. $$
(5)

where m is the current iteration number; Vi and Xi are the velocity and position vectors, respectively, of particle i; c1 and c2 represent the acceleration constants and are set as 1.2 in this study; r1 and r2 are two random numbers uniformly distributed in [0 1]; and ω is the inertia weight factor with a linear decreasing form and can be calculated as follows:

$$ {\omega}^m={\omega}_{\mathrm{start}}-\left({\omega}_{\mathrm{start}}-{\omega}_{\mathrm{end}}\right)\times m/{m}_{\mathrm{max}} $$
(6)

where mmax is the maximum of the iteration number; ωstart and ωend correspond to the initial and final inertia weights and are usually set as 0.4 and 0.9, respectively (Xue and Xiao 2017; Zhang et al. 2020a, b).

Extreme learning machine

The extreme learning machine, proposed by Huang et al. (2004), is a modification of the feedforward neural network with a single hidden layer. It features a high learning speed and exhibits remarkable generalization performance, and has been widely applied in recent years (Song et al. 2015; Xue et al. 2020).

The typical structure of the ELM model, composed of the input layer (with m nodes), output layer (with n nodes), and hidden layer (with l nodes), can be seen in Fig. 3. We assume that there are Q different training samples (Xi, Yi) (i = 1, 2, …, Q) for the ELM model; Xi = [x1, x2, …, xm]T and Yi = [y1, y2, …, yn]T are the input and output parameters. αl × m and βn × l are the connection weights from the input layer to the hidden layer and the hidden layer to the output layer, respectively, and B = [b1, b2, …, bl]T is the bias of the hidden layer. Employing the active function g(x), the goal of the ELM training can be expressed as

$$ \sum \limits_{i=1}^Q\left\Vert {Y}_i-{\beta}_{n\times l}g\left({\alpha}_{l\times m}{X}_i+{B}_{l\times 1}\right)\right\Vert \to 0 $$
(7)
Fig. 3
figure 3

Structure of ELM algorithm

The input connection weight αl × m and hidden layer bias B are assigned randomly, and the output connection weight βn × l can be calculated using the least squares method (Huang et al. 2006).

Generally, in modeling data using ELM, the standard training algorithm includes three steps (Song et al. 2015): (1) determine the node number of the hidden layer and set the input connection weight αl × m and hidden layer bias B, (2) select an infinite differentiable function as the activation function and calculate the output array of the hidden layer, and (3) calculate the output connection weight βn × l. However, recent literature has revealed that the solution models of basic ELM might get trapped in local minima owing to the stochastic initialization of the input weights and hidden biases (Cao et al. 2012). Optimization algorithms, e.g., the cuckoo search algorithm (Mohapatra et al. 2015) and firefly algorithm (Satapathy et al. 2017), can be used to improve the generalization capacity of the ELM.

PSO-ELM-PSO

Table 1 shows the results obtained with several typical machine learning methods to fit the nonlinear functiony = x + x2 + 2 sin(πx),   x ∈ [−5, 5]. Two hundred data samples uniformly distributed in the domain are selected as the learning set, and 10 data samples are randomly chosen as the testing set. The performance of the methods are evaluated by the mean square error of the testing data and the computing time. It can be seen that the PSO-ELM method features high precision and relatively short computing time.

Table 1 Comparison of the results obtained with different machine learning methods

In this study, the process of the integrated use of PSO and ELM is presented in Fig. 2(c). First, PSO is employed to optimize the input weights and hidden biases of the ELM. Subsequently, the PSO-ELM model is trained to substitute for the numerical analysis of tunnel excavation. After the preparation of the testing samples, PSO is again used to determine the best rock parameters, based on which the displacement variation closest to the monitoring curve can be calculated. Finally, numerical models are solved using the optimal rock parameters, and the displacement loss of the surrounding rock can be predicted.

In addition, a slight fluctuation has been observed in the results of PSO-ELM-PSO owing to different sets of weights and thresholds in every searching process. To avoid the accidental error, the principal program is run 20 times, and the average value is taken as the final result. This has been proven to be a feasible way to improve the precision of the algorithm (Zhang et al. 2020a, b).

Case study

To verify the performance of the proposed back analysis method, simulations were conducted for the excavation of a deep buried tunnel.

Model settings and operation

A three-dimensional model (Fig. 4) was established using the finite element software ABAQUS 6.13 for predicting the parameters of rock mass in a tunnel under construction. This circular tunnel, with a diameter of 12 m, was buried 80–120 m beneath the ground and fully excavated. The area of the surrounding rock in the xz plane was 50 m × 50 m. The total excavation length was 36 m in the model and the excavation footage was 3 m. The horizontal displacement was constrained at the left, right, front, and back boundaries, and the vertical displacement was constrained at the bottom boundary of the model. Pressure was applied on the top boundary to simulate vertical load of rock. The widely used Mohr–Coulomb model was adopted as the constitutive model, while the elastic model was adopted for the tunnel lining, which was composed of C25 shotcrete. The values of the rock parameters that varied within certain ranges are listed in Table 2.

Fig. 4
figure 4

Numerical model of deep buried circular tunnel

Table 2 Parameter settings of rock and lining for the deep buried circular tunnel

According to the procedures described in the “Methodology” section and presented in Fig. 2, the following steps were adopted to execute the back analysis algorithm.

  1. STEP 1:

    The final values of vault settlement (UA) and horizontal convergence (UBB’), along with the fitting parameters (a, b) calculated using Eqs. (1) and (2), were selected as the output parameters of the algorithm. The monitoring positions are marked as A, B, and B′ in Fig. 4.

  2. STEP 2:

    The sensitivity analysis was conducted using Eq. (3); the sensitivity of the rock parameters can be determined from Table 3. Four parameters with relatively large sensitivities (elasticity modulus, Poisson's ratio, burial depth, and lateral pressure coefficient in the x-direction) were chosen as the input parameters. The other parameters were set constant in the numerical models.

  3. STEP 3:

    Based on the ranges of the rock parameters presented in Table 2, each selected input parameter was divided into five levels, as listed in Table 4. By employing the orthogonal table of six factors and five levels, 25 numerical models were built and calculated. The inputs were obtained by the orthogonal design table, and the outputs were acquired from the computational displacement data of the 25 numerical models. By integrating all the values of the input and output parameters, the training set was established.

  4. STEP 4:

    Six numerical models with the predefined rock parameters within their ranges were calculated and regarded as the actual situations. Referring to common engineering practice, the rock displacements began to be recorded after the construction of the first lining. The values of input and output parameters were combined as the testing set.

  5. STEP 5:

    Based on the training samples, the PSO-ELM-PSO algorithm, programmed in MATLAB R2013a, was applied, and the rock parameters of the three models were predicted.

  6. STEP 6:

    Numerical models were solved using the predicted rock parameters, and the total displacement curves and the displacement loss were finally computed.

Table 3 The sensitivity of rock parameters
Table 4 Orthogonal design levels of the rock parameters

Results of prediction

Table 5 presents the results of the prediction and the relative errors for three of the testing samples in details. The relative errors of all the parameters are acceptable, of which the maximum is below 5%. Relatively speaking, the errors of the parameters with large sensitivities are small. For example, the sensitivity of the lateral pressure coefficient in the x-direction (λx) is 2.27, which is the peak, and the error can be as low as 0.48%, which is the lowest among all the results. Nevertheless, the Poisson's ratio (μ) seems to be the exception. Although the sensitivity of the Poisson's ratio (μ) is only 0.59, the relative error is slight. This is because the variation range of μ is small, resulting in a relatively accurate prediction. The comparisons between the simulated and monitoring displacement curves are presented in Fig. 5. It can be seen that all the simulation curves agree well with the monitoring curves. The displacement loss occurring before the construction of the lining accounts for 20–30% of the total displacement, which is in line with engineering experiences and the results of previous studies (Zhang et al. 2009). This analysis of the results of the predictions indicates that the proposed back analysis method can provide reliable results and is feasible for engineering applications.

Table 5 Prediction results and relative errors obtained using PSO-ELM-PSO method for different testing samples.
Fig. 5
figure 5

Comparisons of monitoring and simulating displacement curves for a sample 1, b sample 2, and c sample 3

The prediction results with and without considering displacement loss and space effect can be seen in Table 6. To avoid the influence caused by the difference of testing samples, the relative errors of all the testing samples for the four rock parameters are averaged out, and the average error of the four rock parameters is also calculated. When the displacement loss and space effect are both considered in back analysis, the relative errors of the four rock parameters are low, and the average error is 2.76%. In the case without considering the space effect, the relative error in elastic modulus increases threefold, and the average error rises to 6.59%. When displacement loss is not considered, the errors of all the parameters increase significantly, and the average error is 4.6 times of that in the original case. Therefore, it is important and necessary to consider the displacement loss and space effect in back analysis of rock parameters.

Table 6 Prediction errors obtained using PSO-ELM-PSO method with and without considering displacement loss and space effect

Analysis of rock parameters

In previous literature on displacement-based back analysis, some researchers selected elasticity modulus and Poisson's ratio as the target parameters, while others selected three or more rock parameters based on different considerations. In this study, four rock parameters with relatively large sensitivities were chosen as the inputs. To study the effects of rock parameter selection, the simulations were performed as follows:

  • Case 1: Two rock parameters (elasticity modulus and Poisson's ratio) were selected as the input parameters.

  • Case 2: Four parameters (elasticity modulus, Poisson's ratio, burial depth, and lateral pressure coefficient in the x-direction) were selected as the input parameters.

  • Case 3: Six parameters (elasticity modulus, Poisson's ratio, burial depth, lateral pressure coefficient in the x-direction, cohesive force, and internal friction angle) were selected as the output parameters.

The results of the prediction are provided in Fig. 6. The errors of parameters in the case of 2 inputs are close to those in the case of 4 inputs. However, when input number increases to 6, a large growth is witnessed in errors of all the parameters. The average error rises to 7.74%, nearly three times of that in the case of 4 inputs. It can be explained that the sensitivity value is relatively low for cohesive force and internal friction angle (Table 3). Introducing these two parameters to the inputs would reduce the quality of training samples, causing the decrease of prediction accuracy. Therefore, when establishing the training set, it is suggested to choose parameters with high correlations as the inputs and outputs.

Fig. 6
figure 6

Error variations in rock parameters for different inputs

Analysis of monitoring parameters

During tunnel construction, some monitoring parameters, such as monitoring lines, initial monitoring distance, and monitoring frequency, may change because of different rock grades, construction programming, or some other factors. To test the adaptability of the proposed method to variations in the monitoring parameters, researches on monitoring parameters were performed as in Fig. 7.

Fig. 7
figure 7

Simulation arrangement for analysis of monitoring parameters

Effect of monitoring lines

When studying the effects of monitoring lines, the simulations were performed as follows:

  • Case 1: Lines A and BB′ were monitored. Two parameters (UA, UBB′) and four parameters (UA, UBB′, a, b) were selected as the output parameters.

  • Case 2: Lines A, BB′, and AB (equal to line AB′) were monitored. Five parameters (UA, UBB′, UAB, a, b) were selected as the output parameters.

  • Case 3: Lines A, BB′, AB (equal to line AB′), and CC′ were monitored. Six parameters (UA, UBB′, UAB, UCC′, a, b) were selected as the output parameters.

The results of the prediction are provided in Fig. 8a. In the different cases of monitoring lines, the errors in λx remain low, whereas those in E fluctuate sharply. This may be attributed to the relatively low sensitivity and a wide range of variation in E. With the increase of monitoring lines, the average error also grows. It is because the increase of monitoring lines does not mean the improvement of the quality of the training samples. In fact, adding displacement data with low correlation with rock parameters to the training set would reduce the prediction accuracy. As a consequence, it is preferred to use displacement data at typical positions (vault settlement and horizontal convergence) when performing displacement-based back analysis.

Fig. 8
figure 8

Error variations in rock parameters for a monitoring line, b initial monitoring distance, and c monitoring frequency

Effect of initial monitoring distance

In studying the effects of the initial monitoring distance, the simulations were arranged as follows:

  • Case 1: The distance between the initial monitoring section and tunnel face was 3 m.

  • Case 2: The distance between the initial monitoring section and tunnel face was 15 m.

  • Case 3: The distance between the initial monitoring section and tunnel face was 27 m.

The results of the prediction are provided in Fig. 8b. The relative errors vary a little with the changes in the initial monitoring distance. It can be inferred that the changes in the initial monitoring distance result in minor fluctuations in the results of the prediction, and that a certain delay in monitoring would not affect the accuracy of the algorithm. It should be noted that with the increase of the initial monitoring distance, the error in λx increases and that in H decreases. In the case that the initial monitoring distance is 27 m, the error in λx is close to that in E, and the error in H becomes the lowest. Possible reasons are given that the sensitivity value of rock parameter may change for different parts of displacement curve. The displacement variations are not so sensitive to the changes of E when the rock deformation becomes gradually stable, causing the relatively larger prediction error in E.

Effect of monitoring frequency

In studying the effects of monitoring frequency, the simulations were performed as follows:

  • Case 1: Monitoring data were recorded every 3 m of excavation length.

  • Case 2: Monitoring data were recorded every 6 m of excavation length.

  • Case 3: Monitoring data were recorded every 9 m of excavation length.

The results of the prediction are listed in Fig. 8c. For different monitoring frequencies, the order sorted by relative error of the rock parameter is unchanged. Although no apparent rule is found in the error variations of the rock parameters, the average error declines with the decrease in the monitoring frequency. It can be inferred that a low monitoring frequency, or generally, selecting typical monitoring values to set the training samples, would assist in the improvement of the accuracy of the algorithm.

Engineering application

The Jigongshan Mountain Tunnel, part of the Banyin Expressway connecting the Luohu District and the Longgang District in Shenzhen, China, is designed to pass underneath the Xiaping Municipal Solid Waste (MSW) Landfill (Feng et al. 2020). Figure 9a depicts the position and route of the tunnel. It is a separated tunnel with a length of 4.6 km and a six-lane expressway in both directions. The particular form of the three-center circle was adopted for the inner contour of the tunnel owing to its good mechanical performance and the tunnel net width was approximately 15 m. A new Austrian method and the benching tunneling method were employed for tunnel construction, and the vault settlement and horizontal convergence were monitored during tunnel excavation (Fig. 9b). In the preliminary geological investigation, the geostress was measured in the geological borehole SD-X24 using the hydro fracturing method. The position of the borehole is marked in Fig. 9a, and the results are presented in Table 7. Considering the orientation of the tunnel axis near SD-X24, the lateral pressure coefficient in the direction of the tunnel cross-section was calculated to be approximately 1.8.

Fig. 9
figure 9

a Location and route of tunnel in case study. b Tunnel cross-section of LK2+820 and monitoring lines

Table 7 Results of the geostress measurement using hydro fracturing method for borehole SD-X24

The proposed back analysis method was applied to the LK2+820 tunnel section, which is near the borehole SD-X24, to estimate the surrounding rock parameters and displacement loss. The numerical model is depicted in Fig. 10, and the parameter ranges are listed in Table 8. Because the burial depth of LK2+820 was ascertained to be 200 m via the preliminary geological investigation, three other parameters (elasticity modulus, Poisson's ratio, and lateral pressure coefficient in the x-direction) were chosen as the input parameters, based on the sensitivity analysis. All the training samples were set following orthogonal design and learned by the proposed algorithm.

Fig. 10
figure 10

a Numerical model of LK2+820 tunnel section. b Support system. c Mesh of model

Table 8 Parameter settings of rock and support system for the tunnel section of LK2+820

The total displacement curve and rock parameters were predicted based on the monitoring data recorded during the construction process. The monitoring and simulation displacement curves are exhibited in Fig. 11. Although the construction footage varies within 2–4 m in the practical excavation process, the simulation curves could still match the monitoring data well. Further, the displacement loss occurring before the construction of the lining accounts for 30% of the total displacement. The values of the elasticity modulus, Poisson's ratio, and lateral pressure coefficient were estimated to be 9.38 GPa, 0.266, and 1.83, respectively. The predicted lateral pressure coefficient is in accordance with the results of the geostress measurement, indicating that the proposed method provides a suitable alternative approach for estimating ground stress and, thus, can be studied further.

Fig. 11
figure 11

Comparison of monitoring and simulation displacement curves for LK2+820 tunnel section

Discussion

Comparison with PSO-ELM method

In the PSO-ELM-PSO method, the PSO-ELM algorithm was employed as a substitution for the numerical analysis, and the PSO algorithm was used to search for the best solution. Subsequently, in the PSO-ELM model proposed herein, the parameters to be predicted were set as the input parameters, and the known parameters were set as the outputs. However, several researchers have considered the opposite case (Song et al. 2015; Gao and Ge 2016; Zhang et al. 2017). Thus, the target parameters can be predicted as long as the monitoring data are inputted. In this section, we present a comparison between the two abovementioned methods based on the data provided in the case study.

The prediction results of PSO-ELM are presented in Table 9. The relative errors for some parameters are small, whereas others are significantly higher compared to those for PSO-ELM-PSO, as presented in Table 5. Figure 12 presents the convergence of PSO-ELM-PSO and PSO-ELM in the sample-learning procedure. It can be seen that the fitness value of PSO-ELM-PSO is much lower, and the convergence rate of PSO-ELM-PSO is significantly faster than that of PSO-ELM. This is because when the training samples of PSO-ELM-PSO are arranged, the input rock parameters can be set based on the orthogonal design. Moreover, the input parameters of PSO-ELM are displacement values, which are calculated using numerical models and are difficult to arrange. Therefore, the quality of the PSO-ELM training samples is worse, resulting in a higher fitness value and higher errors in prediction.

Table 9 Prediction results and relative errors obtained using PSO-ELM method for different testing samples
Fig. 12
figure 12

Convergence of PSO-ELM-PSO and PSO-ELM algorithms in sample-learning procedure

Limitations and prospects

Based on the comparison presented in the “Comparison with PSO-ELM method” section, it can be inferred that the quality of the training set, which is determined by the arrangement and number of samples, plays a vital role in the accuracy of the algorithm. In many studies, the high precision of the method is supported by a large number of training samples—up to hundreds or even thousands (Xue and Xiao 2017; Qi and Fourie 2018). One of the advantages of the proposed method is that although trained using relatively few samples (25 samples in this study), the error in prediction is still acceptable within the range of the parameters. However, the error can be large if some of the parameters required for the prediction lie outside the predefined range. Extending the parameter range may improve the compatibility. However, more samples have to be set to ensure precision, necessitating further simulation and prolonging the calculations. In addition, the stability of the proposed method must be improved because obtaining a stable average through multiple calculations is also time-consuming.

For predicting rock parameters using the intelligent algorithm for the excavation of tunnels, further studies will be required to focus on the following:

  • Improving the prediction accuracy using few training samples—with the rapid development of the intelligent algorithm, new machine learning and optimization methods can be introduced in this field.

  • Improving the stability of the algorithm—improved optimization algorithm could be employed to restrain the fluctuations in the results of the prediction.

  • Verifying the applicability of the method in other cases of tunnel excavation—the excavation methods, the support system, and some other factors could also affect the displacement curve, leading to further discussions on parameter selection and data processing.

Conclusions

A displacement-based back analysis method characterized by the PSO-ELM-PSO algorithm is presented herein for estimating the rock parameters during tunneling. The considerations of displacement loss and space effect are reflected in the process of setting the input and output parameters. The case study and engineering application presented prove that this method can be well generalized, offer high precision, and be applied to other underground parameter estimations. The other primary conclusions drawn from this study are as follows:

  1. (1)

    Owing to convenience in measurement, the monitoring distance is proven to be a suitable substitute for the monitoring time when recording the displacement data, both in simulation and practice. The loss in monitoring displacement is then correlated with the initial monitoring distance. Further, the space effect, which is usually reflected in the variations in displacement of the surrounding rock, is described by the key parameters of the curve-fitting function. Therefore, by selecting and processing the output parameters of the proposed algorithm, the displacement loss and space effect could be considered.

  2. (2)

    The case study illustrates that the errors in the prediction of the rock parameters are all below 5% for the different testing samples, and that the simulation displacement curves match the real curves well. The displacement loss occurring before the construction of the lining accounts for 20–30% of the total displacement. The relative errors in the rock parameters with relatively larger sensitivities usually remain small and stable with the changes in the monitoring parameters. Further, to improve the prediction accuracy, it is suggested to select the rock parameters with large sensitivities as the inputs and typical monitoring values as the outputs when setting the training samples for displacement-based back analysis.

  3. (3)

    The prediction results for the LK2+820 section in the Jigongshan Tunnel indicate that the proposed back analysis method, along with the PSO-ELM-PSO algorithm, could be suitably applied to engineering applications to predict the rock parameters and displacement loss. Moreover, as verified by the results of the field measurement, this method could offer an alternative approach for the estimation of ground stress.

  4. (4)

    Comparisons between the PSO-ELM-PSO and PSO-ELM algorithms demonstrate that the quality of training samples is significant for the accuracy of the algorithm. Conducting orthogonal design when arranging the training set could help improve the quality of the learning samples and the precision of the algorithm. Further research is required to improve the stability and applicability of the method using few training samples.