1 Introduction

In recent years, there has been an increasing interest in the tunnel construction [1,2,3,4]. Ground surface settlement due to tunneling is one of the inevitable problems in tunnel engineering. The long-term settlement and differential settlement of tunnels have led to serious longitudinal deformation. The deformation pattern of tunnels is the step between rings rather than by the beam/cylindrical shell bending. Most of the lining rings distort into the shape of a horizontal ellipse (tunnel squat), but a few acquire the shape of a vertical ellipse. Large tunnel deformation has caused groundwater infiltration and the separation of ballastless track bed and lining. Further analysis shows that long-term tunnel settlement is mainly due to urbanization induced land settlements in Shanghai. The magnitude of tunnel settlement is correlated with sublayer settlement rather than ground surface settlement. In the early operational years, post-construction settlement induced by tunneling and the cyclic loading of trains may contribute significantly to the tunnel settlement. However, at some special places, nearby construction and groundwater infiltration are responsible for the long-term settlement of the tunnel [5].

In the past few decades, as a new tool for analysis of the tough geotechnical problems, artificial neural networks (ANNs) have been successfully applied to address a number of engineering problems, including deformation due to tunneling in various types of rock mass. Unlike the classical regression methods in which a certain form for the approximation function must be presumed, ANNs do not require the complex constitutive models. Additionally, it is traced that the ANN prediction system is one of the most effective ways to predict the rock mass deformation. Furthermore, it could be envisaged that ANNs would be more feasible for the dynamic prediction of displacements in tunneling in the future, especially if ANN models are combined with other research methods [6].

In this study, the ground surface settlement has been investigated by tunnel depth, diameter, internal friction angle, adhesion, and soil permeability coefficients. Then, by transferring the data obtained from ABAQUS, as input data to MATLAB and by creating a multilayer perceptron network, the ground surface settlement prediction model is designed. First, in 1958, Martos, and Peck in 1969 carried out their studies to model, calculate, and predict ground surface settlement during tunneling [7, 8]. Pack developed his model with respect to the inverse normal distribution model. According to this theory, the ground surface settlement is obtained from Fig. 1 and the following equation.

$$S_{v} = S_{\max } e^{{\frac{{ - x^{2} }}{{2i^{2} }}}}$$
(1)

where s is the vertical ground surface settlement at any point, Smax is the maximum settlement of ground surface on the tunnel axis, i is the horizontal distance of the tunnel axis from the turning point of the settlement curve, and x is the horizontal distance from the tunnel axis.

Fig. 1
figure 1

Ground surface settlement above single tunnel described [8]

Thereafter, many researchers investigated and predicted the ground surface settlement and displacement, by modifying the parameters introduced by Pack et al. Research in this field can be divided into four categories. The first category is analytical studies that utilize relationships in soil and rock mechanics and combine them with mathematical foundations to predict the amount of settlement in single and twin circular tunnels [9,10,11,12,13,14,15,16]. Experimental and laboratory research falls into the second category. Most studies in this field are carried out by creating a tunnel element and its surroundings at a very smaller scale in the laboratory environment than the actual element and utilizing a centrifuge system [17,18,19,20]. The next category of research is numerical studies. Most numerical studies using finite element methods and finite difference methods have investigated surface settlement and subterraneous displacement due to tunneling [21,22,23,24,25,26,27,28,29]. Finally, artificial intelligence research and neural networks are used in most studies today. The high accuracy and speed of computation in predicting different parameters, taking into account the maximum of independent variables affecting the parameter in question, have led to the increasing use of neural networks in different researches [30,31,32,33,34,35]. In this study, it has been tried to predict the ground surface settlement due to tunneling by combining two numerical methods and using artificial neural networks.

2 Problem definition

The two-dimensional model created using ABAQUS finite element software, which is highly capable of analyzing soil environments, is 40 × 80. The type of analysis used in this model is standard analysis. The general form of the model considered in this study is given in Fig. 2. This model includes a frictional–adhesive homogeneous soil. The parameters investigated in this study are listed in Table 1. Other geometrical and mechanical parameters of the tunnel, such as tunnel diameter and modulus of elasticity, are assumed to be constant. In fact, this study analyzed three separate situations in accordance with Fig. 1 and that presented in Table 1. The behavioral model considered for the soil is the Mohr–Coulomb model.

Fig. 2
figure 2

Scheme of tunnel drilled in three depth modes

Table 1 Soil properties and used parameters [36]

The mesh created for the value \(H/D = 1\) is shown in Fig. 3. The model is divided into 5700 square corners. However, at points close to the tunnel section, the elements are rectangular in order to provide a more accurate analysis of the area. In both \(H/D = 1\) and \(H/D = 2\) models, in addition to the element geometry, the number of elements is equal to or near the number of elements in the first case. The finer elements are used in the surroundings of the tunnel section to increase the accuracy of the analysis. Also, the overall dimensions of the model are chosen to minimize the impact of lateral and lower boundaries. The groundwater level is selected 7 m below ground surface in all scenarios. As a result, in all the analyses, the tunnel drilling section is saturated. The mechanical properties of the soil with respect to φ and C and in situ stresses are assumed to be effective, given the saturation of the environment. Also, the relation 2 was used to calculate the lateral coefficient in the soil.

$$k = 1 - \sin \varphi$$
(2)

It should be noted that the amount of ground surface settlement is measured immediately after drilling. The analysis and measurement of settlements have been examined without considering the tools for tunnel stability. In fact, all the settlement values are obtained immediately after drilling.

Fig. 3
figure 3

Generally designed mesh in ABAQUS

In 2011, Yamamoto, in order to analyze the tunnel stability, using the upper and lower bound limit analysis method, which is considered one of the plastic analyses, obtained numbers as the stability number [36] and then similarly analyzed the tunneling settlements that used the Yamamoto method to utilize the upper and lower bound limit analysis [37]. The difference between the present study and the mentioned methods is considered the pore water pressure and soil saturation at the time of analysis.

Since the geometry of the model is complete, the finite element model, or mesh, is produced in Fig. 3.

(The program automatically generates the mesh and divides the geometric model according to the model structure into basic elements, according to which it fits the best base triangles in the model. Due to the fact that the analyses are performed in different phases, In all phases, an attempt has been made to select the coarseness mesh option and place the distribution element option on the fine every mode, and by reselecting the mass inside the tunnel and selecting a smaller mesh, the movement around the tunnel is calculated and observed more accurately. Specifications for the main model mesh are given in Table 2 (shown).

Table 2 Modeling mesh specifications

In general, all boundaries in each direction must have boundary conditions applied to them. If no boundary conditions are assigned to the model (free boundary), natural conditions apply. That is, the prescribed forces are equal to zero and the boundaries move freely. The software automatically sets a standard boundary to avoid the situation of indeterminate displacements, so that by designating reference points, the software completely rigs the horizontal boundary and by creating rolling boundaries in a vertical direction, it allows the soil to move and settle.

In this study, the urban tunnel with no overburden is considered at ground surface and the porosity ratio for the whole model is considered to be 0.58 with respect to its specific gravity and weight density of solid grains. The analysis is done in two steps. The first step includes entering the necessary information into the software, creating appropriate boundary conditions, and applying pore water pressures and effective in situ stresses in the soil model, and then, performing drilling operations, in which, immediately after drilling, ground surface settlement values are measured.

3 Results

Artificial neural network (ANN) is one of the scientific methods for predicting a variable through one or more other variables. Among the different methods of artificial neural networks is multilayer perceptron neural network, due to the use of post-diffusion algorithm. The method used in this study was used to calculate the error and reduce the error rate as much as possible and also to correct the error values by repeating the computational process.

In the study, after the set data required for network use were calculated and collected through finite element method software in 119 modes. A multilayered artificial neural network with a single input layer, the output layer and a hidden layer consisting of seven neurons or nodes was designed and built. Finally, the network is created by receiving three input variables, which include the diameter of the tunnel, the distance from the center to the center of the tunnel, and the depth of the center of the tunnel to the ground level, calculating the maximum subsidence value after passing through the network. The results indicate that the creation of a neural network, due to its very high speed and cost-effectiveness, is able to predict the surface subsidence resulting from the tunneling of twin tunnels.

The effect of the soil permeability coefficient and pore water pressure on the amount of ground surface settlement and soil displacement around the tunnel was investigated. As the permeability coefficient in the soil increased, the surface settlement due to tunneling decreased. According to the results obtained from H, as the ground depth increased, the higher the permeability coefficient in soil, the higher amount of surface settlement. However, as the angle of internal friction and adhesion in the soil increases, the intensity of this process is reduced, approaching the same trend for the other depths. Lowering the permeability beyond the values of a and b would have little effect on the surface settlement. Increasing the amount of adhesion in all the states and analyses has reduced the amount of ground surface settlement. Increasing the angle of internal friction of the soil, in most cases, has reduced the ground surface settlement. However, as soil adhesion increased simultaneously, this decreasing trend was less observed. In some cases, for larger adhesions up to 40 kPa, with increasing internal friction angle, the surface settlement has increased or been constant, which this trend has been observed more for tunnels closer to the surface. Ground surface settlement at different depths is completely dependent on the shear strength parameters of the soil. When the permeability coefficient is highest, and the internal friction angle of the soil was at its lowest, the surface settlement is a function of depth, which increased significantly with increasing depth. With increasing shear strength parameters, the dependence of ground surface settlement on depth is seen to be lower, so that when these parameters reach their maximum, it can be said that the surface settlement is independent of the depth of drilling.

4 Discussion

4.1 Investigation of the influence of permeability coefficient

One of the main objectives of this study is to investigate the effect of the soil permeability coefficient and pore water pressure on the amount of ground surface settlement and soil displacement around the tunnel environment. By investigating the changes in soil permeability, according to Table 3 and Fig. 4, with increasing soil permeability coefficient, the amount of ground surface settlement due to tunneling has decreased. This trend is clearly seen in the values obtained for \(H/D = 1\). The results for \(H/D = 2\) and \(H/D = 3\) in most cases illustrate this trend. With increasing depth, the angle of internal friction and adhesion in soil increased, and the inverse relationship between soil permeability coefficient and ground surface settlement was more pronounced.

Table 3 Ground surface settlement in various modes
Fig. 4
figure 4

Ground surface settlement curve for different permeability coefficients in three depths: a H/D = 1, b H/D = 2, c H/D = 3

It should be noted that, at less internal friction angles, this relationship is not true. Hence, for different values of the settlement with the permeability coefficient \(H/D = 3\), the ground surface settlement was higher than the values for the settling with the other values with less permeability coefficient. Also, according to the results obtained from A, with increasing penetration in the soil, the higher the permeability coefficient in the soil, the higher the saturation value of the soil, approaching the same trend for the other depths. Another point is that, with respect to the values in Table 3, the amount of settlement for the values of Η/D and k, especially for the values obtained in \(H/D = 2\) and \(H/D = 3\), are almost equal in all cases and follow a similar trend. It seems that more decreasing permeability values will not have a significant effect on the ground settlement.

4.2 Investigation of the effect of adhesion changes in soil

By increasing the values obtained from Table 3 and Fig. 5, the increase in adhesion in all scenarios and analyses has reduced the amount of ground settlement. However, as illustrated in Fig. 5, as the internal friction angle of the soil increases, their corresponding points get closer to the center of the diagram, indicating a decrease in settlement at the surface as the soil adhesion increases. It can also be said that the trend of subsidence difference is more pronounced with the simultaneous change of soil adhesion and internal friction angle at lower depths, for values of \(H/D = 1\) and \(H/D = 2\) than in the other case.

Fig. 5
figure 5

Ground surface settlement curve for different Cohesion strength in three depths: a H/D = 1, b H/D = 2, c H/D = 3

In this section, the simultaneous influence of the geometrical parameters of the tunnel depth and diameter is simultaneously observed and investigated. The ground surface behavior due to tunneling at different depths is not the same. As shown in Table 3 and the diagrams in Fig. 6, the influence of tunneling depth on the ground surface settlement cannot be conclusively commented. This means that the ground surface settlement at different depths is completely dependent on the shear strength parameters of the soil.

Fig. 6
figure 6

Maximum ground surface settlement curve for three depths with different Cohesion strengths in three permeability coefficients: a \(k = 1e - 2\), b \(k = 1e - 5\), c \(k = 1e - 8\)

When the permeability coefficient is highest and the internal friction angle of the soil is at its lowest, the ground surface settlement is a function of depth that has increased markedly with increasing subsurface depth. However, with increasing soil shear strength parameters, the dependence of the ground surface settlement on the depth is seen to be less so that when these parameters reach their maximum, it can be said that the ground settlement is independent of the depth of drilling. However, by decreasing the permeability coefficient in the soil, in almost all cases, when the internal friction angle is at its maximum, the ground settlement is independent of depth and converges to a constant settlement value at all depths. Increasing the angle of internal friction at these depths seems to have little effect on the settlement. But when soil adhesion and soil permeability coefficients are at their lowest value, the amount of ground settlement obtained increases with the approaching ground surface and decreases with increasing depth.

4.3 Investigation of the effect of internal friction angle of soil

It was expected that as the angle of internal friction increased and the density of soil particles increased, the amount of ground surface settlement caused by tunneling would decrease. According to the results of Table 3 and Fig. 6, with increasing soil internal friction angle, in most cases this decreasing trend for surface settlement is obtained. However, as the amount of soil adhesion increased simultaneously, this trend decreased, and in the analyses with greater adhesion, the difference between the maximum and minimum of settlement values for the different values of the internal friction angle was less.

Therefore, in some cases for adhesions larger than 40 kPa, as the internal friction angle of the soil increased, the surface settlement increased or was constant. This trend has been observed for tunnels closer to ground level. In general, the decrease in ground settlement due to the increase in the internal friction angle of the soil is evident for all three values of soil permeability coefficient and for tunnels with greater depth. Figure 7 shows the maximum settlement value for all three depths and in the three permeability coefficients, and as the angle of internal friction increases, the downward trend of the ground settlement was observed.

Fig. 7
figure 7

Ground surface settlement curve for different cohesion strengths in three depths: a \(k = 1e - 8\), b \(k = 1e - 5\), c \(k = 1e - 2\)

5 Artificial neural networks

The application of artificial neural networks and their advantages in numerical modeling has been discussed by many researchers [38, 39]. As regards, the purpose of the study was to predict the maximum of ground surface settlement. The neural network was used because of the high capability of the prediction model. Also, the maximum variation affecting soil behavior during tunneling, which included geometrical and soil mechanical variables, was used as input variables in the model. In general, it can be said that the artificial neural network is a model of the human brain, except that its speed is much higher than that of the human brain. Neural networks first process data by analyzing the relationship between data, and after the learning process, with high power and speed. Artificial neural networks that are used for prediction usually include an input layer, an output layer, and one or more middle layer. The network used in this study is a powerful multilayer perceptron network that has high power in predicting the desired variable. Each layer consists of nodes or nodes, also known as neurons. In the input layer, each node represents an independent variable, and in the output layer of each node, it represents a variable or dependent variable. The master brain of any neural network is the middle layer of that network. In fact, all math operations are done on this layer or layers. That is, each neuron in the input layer connects to all the middle layer (hidden layer) neurons through a connector. Each neuron in the middle layer stores the weighted variables of the input layer and adds them together. Then, a predetermined function designed by the user and known as the activating function calculates the desired output variable by applying the sum of the neurons. During the design of the network, the mean square error estimation criterion was used to calculate the error generated in the network and evaluate its optimal performance in predicting the dependent variable. Another criterion is also called the root mean square error. This criterion more clearly represents the amount of errors. In fact, the lower this value and the closer to zero, the neural network predicts the value of the dependent variable more accurately. The relationship between mean square error and root mean square error is given in Eqs. 3 and 4 [40].

In recent years, outcome prediction models using artificial neural network and multivariable logistic regression analysis have been developed in many areas of engineering research. Both these methods have advantages and disadvantages. Artificial neural networks are algorithms that can be used to perform nonlinear statistical modeling and provide a new alternative to logistic regression, the most commonly used method for developing predictive models for engineering purposes. Neural networks offer a number of advantages, including requiring less formal statistical training, ability to implicitly detect complex nonlinear relationships between dependent and independent variables, ability to detect all possible interactions between predictor variables, and the availability of multiple training algorithms. Disadvantages include its “black box” nature, greater computational burden, proneness to overfitting, and the empirical nature of model development. An overview of the features of neural networks and logistic regression is presented, and the advantages and disadvantages of using this modeling technique are discussed [41, 42].

$${\text{MSE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {Y_{i} - \hat{Y}_{i} } \right)}^{2}$$
(3)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {Y_{i} - \hat{Y}_{i} } \right)}^{2} }$$
(4)

where n is the number of data, b is the value of calculating dependent variable and b is the value of dependent variable predicted by the network. When the measured error reaches its minimum value, the network will show better performance. The network error is calculated based on a set of algorithms that are created by the user when designing.

A common way to improve the network and reduce the error in the MLP network is to use a feedforward algorithm. In this way, the data series are divided into three categories: training data, evaluation data, and test data.

The value of the allowed error (minimum computational error) is also determined by the user. In the training data series, after the output of this data is obtained, this value is compared with the actual value and its difference is determined as a network error. Then, this error value is distributed over the weighted connections between the input and the middle layer to reduce the error and this process continues until the network reaches the minimum possible error. In Table 4 the input layer variables are shown as independent variables. Also, the output layer variable (ground surface settlement) is defined as the dependent variable. In this research, after data handling in ABAQUS, ground surface settlement prediction model using an MLP artificial neural network has been created, in which, according to Table 5, 70% of the data series are considered as training data, 15% for the evaluation data, and 15% for the testing data.

Table 4 Number of data and variables
Table 5 Input variables of ANN

The Levenberg–Marquardt algorithm is also used to train the network and reduce the computational error. The Levenberg–Marquardt algorithm is used to solve nonlinear least-squares problems. The schematic diagram of the MLP artificial neural network is illustrated in Fig. 8. Among the different ways for artificial neural network training in feedforward backpropagation methods, the Levenberg–Marquardt algorithm has been chosen for use in the present study because of its faster convergence in the training of medium-sized networks.

Fig. 8
figure 8

General scheme of multilayer perceptron artificial neural network

In this division, the parameter x1 represents the diameter of the tunnel, x2 represents the distance from the center to the center of the tunnels, x3 shows the depth of the center of the second tunnel from the ground, and Y represents the maximum settlement of the earth's surface. Figure 8 shows the general structure of the network designed in this study, which uses the post-diffusion algorithm.

The design network consists of an input layer, an output layer, and a hidden layer with 7 nodes or neurons. The w11 represents the weighted connection of the first input to the first layer of the hidden layer. The w11 also indicates the weighted connection of the node or node of the output layer to the first hidden node. The w73 connection indicates the weighted connection of the third node or the neuron of the input layer to the seventh node or the hidden layer of the neuron. The w17 connection also shows the connection of the seventh neuron or the node of the output layer to the neuron or the node of the output layer. The grid shown in Fig. 8 is an artificial multilayer artificial neural network that uses a forward-looking post-propagation algorithm to reduce its error rate. The network uses the Marquardt–Levenberg law for its training phase. Also, the error calculation method is the MSE method, which calculates the error of each step of this method and is the main criterion for continuing the network training process. The data selection method is random in all three stages of network formation, which include training, evaluation, and network testing stages.

In each neural network, the number of middle layers and its neurons is usually determined and calculated according to the user experience as well as the number of input layer variables. This network contains seven neurons for the middle layer. The middle layer activator function is used in accordance with relation 5, which is called the sigmoid function. A linear function is also used to transfer the output from the middle layer to the last layer.

$$F\left( n \right) = \frac{1}{{1 + e^{ - n} }}$$
(5)

The data and neurons of the multilayer perceptron neural network are presented in Table 5, which separately represents the number of data used in the different categories, the number of independent and dependent variables, and the number of neurons in the different layers.

Table 6 presents the statistical results of the artificial neural network, which shows the accuracy of the prediction of the dependent variable, which is ground surface settlement. The closer correlation coefficient (R) value to 1 indicates the linear relationship between the independent variables of the input layer and the dependent variable of the output layer.

Table 6 Statistical results of ANN

Figure 9 shows the regression diagrams obtained for the training data sets, evaluation data sets, test data sets, and finally all data. The concentration of regression data on the x = y line indicates the highest power and accuracy of the network in predicting real values.

Fig. 9
figure 9

Regression curve for different data in the MLP artificial neural network

Table 7, in addition to the statistical results obtained from this study, compares the accuracy of the multilayer perceptron neural network with other intelligent networks and regression methods, where MLP is multilayer perceptron, ICA is an imperialist competitive algorithm, ANN is a new hybrid model of artificial neural network, MLR is multiple linear regression, and ABC is artificial bee colony.

Table 7 Comparison of the performance of different intelligent networks

6 Conclusion

In this study, parametric finite element analysis in ABAQUS software is carried out for investigating the ground surface settlement of a circular cross section single tunnel, with changes in geometrical characteristics such as diameter and depth of the tunnel (depth-to-diameter ratio) and changes in soil mechanical properties such as adhesion, internal friction angle, and soil permeability performed. Then, the values obtained for the surface settlement in different modes as dependent variables (output data) and four variables of adhesion, internal friction angle, permeability coefficient, and the depth-to-diameter ratio of the tunnel as independent variables (input data) are used to produce multilayer perceptron artificial neural network in MATLAB software. Multilayer perceptron artificial neural network, using the Levenberg–Marquardt error correction algorithm and using the feedforward backpropagation model, has demonstrated its ability to predict the surface settlement.

  1. (1)

    The correlation and determination coefficients indicate the direct and linear relationship between the results obtained in the ABAQUS and the results of ANN. A comparison between this predictive model based on correlation coefficient (R) indices shows that the MLP-ANN with R = 0.99097 for training data and R = 0.98409 for all data is more accurate than the other intelligent network and regression models.

  2. (2)

    All boundaries in each direction must have boundary conditions applied to them. If no boundary conditions are assigned to the model (free boundary), natural conditions apply. That is, the prescribed forces are equal to zero and the boundaries move freely. The software automatically sets a standard boundary to avoid the situation of indeterminate displacements, so that by designating reference points, the software completely rigs the horizontal boundary and by creating rolling boundaries in a vertical direction, it allows the soil to move and settle.

  3. (3)

    The urban tunnel with no overburden is considered at ground surface, and the porosity ratio for the whole model is considered to be 0.58 with respect to its specific gravity and weight density of solid grains. The analysis is done in two steps. The first step includes entering the necessary information into the software, creating appropriate boundary conditions and applying pore water pressures and effective in situ stresses in the soil model, and then, performing drilling operations, in which, immediately after drilling, ground surface settlement values are measured

  4. (4)

    Surface settlement and crown of the tunnel are directly related to the simultaneous increase in the diameter of tunnels. This means that increasing the diameter of the tunnels will increase the surface settlement and crown of the tunnel.

  5. (5)

    When the tunnels are located closer to the ground, we see more settlements, both on the surface and in the crown of the tunnels; while increasing the depth, we saw a decrease in the settlement. This reduction in settlement continued until the depth-to-diameter ratio of the tunnel reached a certain value, and since then we have again seen an increase in the surface settlement and crown of the tunnels.