1 Introduction

Cost overruns and delays often encounter tunnel construction projects. Delays may negatively affect the scope of tunneling projects, which leads to severe cost overruns. Applying contingencies and estimating risks at the project level often do not capture the multiple uncertainties in the construction process of tunnel projects. An example of cost and duration underestimation in subsurface projects is the Channel Tunnel between England and France. The construction of the Channel Tunnel started in 1988, and the project took approximately 20% longer than planned (6 years instead of the scheduled 5) and 80% over the budget (costs increased £2 billion, from £2.6 billion to £4.6 billion). The Channel Tunnel is just one among many examples of subsurface projects completed with high cost and duration overruns. While these projects are very large, the phenomenon of cost overruns and delays in subsurface projects is widespread, as described by Flyvbjerg et al. [1]. Thus, there is a necessity for innovative methods and tools to eschew significant construction cost and duration overruns. In this paper, two artificial intelligence methods of Gaussian Process Regression (GPR) and Support Vector Regression (SVR) will be described with which cost and duration underestimation in road tunnel projects can be avoided. The models will be applied to a road tunnel in Iran.

Insurers have reported major losses and delays in tunnel construction projects. The necessity for analyzing the uncertainty and risks of tunnel construction has been recognized by the tunneling community [2,3,4,5]. At present, construction time and costs of engineering projects were commonly assessed on a deterministic basis. The deterministic tool, however, does not appropriately reflect the uncertain reality. Recently, many innovative studies have been developed to minimize uncertainties regarding construction time and costs [6, 7]. Flyvbjerg [8] proposed utilizing reference class forecasting for transportation construction projects. For example, this tool estimates the cost or duration performance of a project based on statistical analyses of the previously constructed projects rather than on the specifics of the project itself. Different risk factors causing project cost underestimation have been described in detail in the transportation construction literature [9,10,11,12].

A variety approaches and tools have been used to estimate the tunnel construction time and costs, such as Markov chains [7, 13], Monte Carlo simulation [8, 14], dynamic Bayesian networks [15], decision aids for tunneling (DAT) developed at MIT in collaboration with Ecole Polytechnique Fédérale de Lausanne—EPFL (Moret and Einstein, 2016), Bayesian analysis and artificial neural networks [16, 17], and analytical solution [18]. In the more recent researches, different machine learning techniques have been applied in different civil engineering problems [19], for example optimization of geotechnical problems [20], project control forecasting [21], construction management [22], construction injury prediction [23], slope collapse prediction [24], prediction of TBM operating parameters [25], tunnels’ convergence rate forecasting [26], landslide displacement [27], rockburst prediction [28], manufacturing cost estimation [29], uniaxial compressive strength prediction [30], ground surface settlement [31], determination of earth pressure [32], risk assessment and costs estimation [33, 34], rock fragmentation forecasting [35], and other engineering problems.

Although many studies have been conducted by researchers to solve various engineering problems by the machine learning tools, so far, these methods have not been provided to predict a tunnel path geology and time and costs required to its construction. Therefore, a novel approach has been implemented for this research which intends to use three machine learning tools widely used in solving complex engineering problems, Gaussian Process Regression (GPR), Support Vector Regression (SVR), and Decision Tree (DT) for prediction of geology conditions, construction time, and construction costs in road tunnels construction. The paper also aims to determine the relationship between the tunnel geology parameters with the time and costs required for construction, which in this paper, rock mass rating (RMR) parameter was considered. In order to train the GPR, SVR, and DT tools, datasets from the previously constructed road tunnels were gathered. Also, another interesting work in this paper is to update the previously predicted results by applying the more datasets obtained during the tunnel construction in the constructed parts. To verify the feasibility of these tools, as well as the other machine learning tools, they were applied to the Garan tunnel on Sanandaj–Marivan road in Iran. Next, to assess the performances of the applied machine learning tools, fivefold cross-validation was adopted using MATLAB 2018 tool through the regression learner app, and the accuracy of the GPR, SVR, and DT predictions was investigated through different statistical evaluation criteria. Also, in each step of the predictions, to predict the tunnel geology, construction time, and construction costs along the Garan tunnel route, the obtained models of the prediction tools through the regression learner app were exported. By applying the datasets of the Garan tunnel route as the test data in the exported models, all the parameters were predicted along the tunnel route. Finally, the accuracy of the predictions made by each tool, as well as the impact of the updates on the predicted results, was investigated.

The rest of this paper is organized as follows: In the next section, the work methodology is described step by step. In Sect. 3, the application of the methodology in a case study is briefly outlined. Results analysis and validations are presented in Sect. 4. Discussion and conclusions are described in Sects. 5 and 6, respectively.

2 Methodology

2.1 Geology parameter selection

In the first phase of the work, a geology parameter should be considered that can describe the tunnel’s ground conditions and affect tunnel construction time and costs. Rock mass classification systems are a crucial part of underground projects and can well describe tunnel geology conditions. RMR parameter is one of the most commonly applied classification systems in numerous civil and mining projects [36]. Therefore, in this paper, it is considered as an effective parameter on the tunnel geology and the tunnel construction time and costs.

RMR system is a geomechanical classification system for rocks, developed by Bieniawski [37]. It combines the most significant geologic parameters of influence and represents them with one overall comprehensive index of rock mass quality, which is used for the design and construction of excavations in rock, such as tunnels, mines, slopes, and foundations. Over the years, the RMR system has been successively refined as more case records have been examined, and the reader should be aware that Bieniawski has made significant changes in the rating assigned to different parameters. The discussion which follows is based upon the 1989 version of the classification [38]. The following six parameters are estimating the strength of rock mass using the RMR system:

  • Uniaxial compressive strength of rock material

  • Rock quality designation (RQD)

  • Spacing of discontinuities

  • Condition of discontinuities

  • Groundwater conditions

  • Orientation of discontinuities

Each of the six parameters is assigned a value corresponding to the characteristics of the rock. These values are derived from field surveys and laboratory tests. The sum of the six parameters is the RMR value, which lies between 0 and 100. The classification of the RMR system is provided in Table 1.

Table 1 The classification for the RMR system [38]

2.2 Machine learning tools of GPR, SVR, and DT

2.2.1 GPR

GPR models as nonparametric kernel-based probabilistic models can be trained using a training set \(\left\{ {\left( {x_{i} ,y_{i} } \right);\;\;i = 1,2, \ldots ,n} \right\}\), \(x_{i} \in R^{d}\), \(y_{i} \in R\), drawn from an unknown distribution. A GPR model estimates the value of a response variable \(y_{\text{new}}\), given the new input vector \(x_{\text{new}}\), and the training data. A linear regressor can be modeled as \(y = x^{\text{T}} \beta + \varepsilon\), where \(\varepsilon \sim N\left( {0,\sigma^{2} } \right)\). The error variance \(\sigma^{2}\) and the coefficients \(\beta\) are estimated from the data. A GPR model describes the response by introducing latent variables, \(f\left( {x_{i} } \right),\;\;i = 1,2, \ldots ,n\), from a Gaussian process (GP), and explicit basis functions, \(h\). The covariance function of the latent variables captures the smoothness of the response, and basis functions map the inputs \(x\) into a \(p\)-dimensional feature space [39,40,41].

A GP is a set of random variables with a joint Gaussian distribution for any finite number of them. If \(\left\{ {f\left( x \right),x \in R^{d} } \right\}\) is a GP, then given \(n\) observations of \(x_{1} ,x_{2} , \ldots ,x_{n}\), the joint distribution of the random variables \(f\left( {x_{1} } \right),f\left( {x_{2} } \right), \ldots ,f\left( {x_{n} } \right)\) is Gaussian. A GP is given by its mean function \(m\left( x \right)\) and kernel function, \(k\left( {x,x'} \right)\). That is, if \(\left\{ {f\left( x \right),x \in R^{d} } \right\}\) is a GP, then:

$$E\left( {f\left( x \right)} \right) = m\left( x \right)$$
(1)
$${\text{Cov}}\left[ {f\left( x \right),f\left( {x'} \right)} \right] = E\left[ {\left\{ {f\left( x \right) - m\left( x \right)} \right\}\left\{ {f\left( {x'} \right) - m\left( {x'} \right)} \right\}} \right] = k\left( {x,x'} \right)$$
(2)

Let us consider the model of \(h\left( x \right)^{\text{T}} \beta + f\left( x \right)\), where \(f\left( x \right)\sim{\text{GP}}\left( {0,k\left( {x,x'} \right)} \right)\), that is, \(f\left( x \right)\) are from a zero-mean GP with covariance function, \(k\left( {x,x'} \right)\). \(h\left( x \right)\) are a set of basis functions that transform the original feature vector \(x\) in \(R^{d}\) into a new feature vector \(h\left( x \right)\) in \(R^{p}\). \(\beta\) is a \(p\)-by-1 vector of basis function coefficients. This model indicates a GPR model. An instance of response \(y\) can be modeled as [39,40,41]:

$$P(y_{i} |f\left( {x_{i} } \right),x_{i} ) \sim\,N(y_{i} |h\left( {x_{i} } \right)T\beta + f\left( {x_{i} } \right),\sigma^{2} )$$
(3)

Thus, a GPR model is a probabilistic model. There is a latent variable \(f\left( {x_{i} } \right)\) introduced for each observation \(x_{i}\), which makes the GPR model nonparametric. In vector form, this model is equivalent to:

$$P(y|f,X)\sim\,N(y|H\beta + f,\sigma^{2} I)$$
(4)

where \(X = \left( {\begin{array}{*{20}c} {x_{1}^{\text{T}} } \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {x_{2}^{\text{T}} } \\ \vdots \\ \end{array} } \\ {x_{n}^{\text{T}} } \\ \end{array} } \\ \end{array} } \right)\), \(y = \left( {\begin{array}{*{20}c} {y_{1} } \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {y_{2} } \\ \vdots \\ \end{array} } \\ {y_{n} } \\ \end{array} } \\ \end{array} } \right)\), \(H = \left( {\begin{array}{*{20}c} {h(x_{1}^{\text{T}} )} \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {h\left( {x_{2}^{\text{T}} } \right)} \\ \vdots \\ \end{array} } \\ {h\left( {x_{n}^{\text{T}} } \right)} \\ \end{array} } \\ \end{array} } \right)\), \(f = \left( {\begin{array}{*{20}c} {f(x_{1} )} \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {f(x_{2} )} \\ \vdots \\ \end{array} } \\ {f(x_{n} )} \\ \end{array} } \\ \end{array} } \right)\).

The joint distribution of latent variables \(f\left( {x_{1} } \right),f\left( {x_{2} } \right), \ldots ,f\left( {x_{n} } \right)\) in the GPR model is as follows:

$$P(f|X)\sim\,N(f|0,K\left( {X,X} \right))$$
(5)

close to a linear regression model, where \(K\left( {X,X} \right)\) is given as follows:

$$K\left( {X,X} \right) = \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\begin{array}{*{20}c} {k\left( {x_{1} ,x_{1} } \right)} \\ {k\left( {x_{2} ,x_{1} } \right)} \\ \end{array} } & {\begin{array}{*{20}c} {k\left( {x_{1} ,x_{2} } \right)} \\ {k\left( {x_{2} ,x_{2} } \right)} \\ \end{array} } \\ \end{array} } & {\begin{array}{*{20}c} \cdots \\ \cdots \\ \end{array} } & {\begin{array}{*{20}c} {k\left( {x_{1} ,x_{n} } \right)} \\ {k\left( {x_{2} ,x_{n} } \right)} \\ \end{array} } \\ {\begin{array}{*{20}c} { \vdots } & { \vdots } \\ \end{array} } & \vdots & \vdots \\ {\begin{array}{*{20}c} {k\left( {x_{n} ,x_{1} } \right)} & {k\left( {x_{n} ,x_{2} } \right)} \\ \end{array} } & \cdots & {k\left( {x_{n} ,x_{n} } \right)} \\ \end{array} } \right]$$

The covariance function \(k\left( {x,x'} \right)\) is usually parameterized by a set of kernel parameters or hyperparameters, \(\theta\). Often, \(k\left( {x,x'} \right)\) is represented as \(k(x,x'|\theta )\) to explicitly show the dependence on \(\theta\) [39,40,41].

2.2.2 SVR

In SVR, \(\left\{ {x_{i} ,y_{i} } \right\}_{i = 1}^{l}\) is considered as a training set, in which \(x_{i} \in R^{p}\) represents p-dimensional input vector and \(y_{i} \in R\) is a scalar measured output that indicates the system output. The goal is to construct a function \(y = f\left( x \right)\) that shows the dependency of the output \(y_{i}\) on the input \(x_{i}\) [42]. The function is expressed as:

$$y = w.\emptyset \left( x \right) + b$$
(6)

where w is the weight vector, and b is the bias.

The following convex optimization problem (Eq. 7) [42, 43] can be used to express the regression problem. Figure 1 presents the concept of nonlinear SVR, corresponding to Eq. 7.

Fig. 1
figure 1

Nonlinear SVR with Vapnik’s ɛ-insensitive loss function [43]

Minimize (w, b, \(\xi_{i} ,\xi_{i}^{*}\)):

$$\frac{1}{2}w^{2} + C\mathop \sum \limits_{i = 1}^{l} \left( {\xi_{i} + \xi_{i}^{*} } \right)$$
(7)

Subject to:

$$y_{i} - w.\emptyset \left( x \right) + b) \le \varepsilon + \xi_{i}^{*}$$
$$\left( {w.\emptyset \left( x \right) + b} \right) - y_{i} \le \varepsilon + \xi_{i}^{*}$$
$$\xi_{i} ,\xi_{i}^{*} \ge 1, \ldots ,l$$

where \(\xi_{i}\) and \(\xi_{i}^{*}\) are slack variables that specify upper and lower training errors that are subject to error tolerance, and \(C\) is a positive constant that determines the degree of penalized loss when an error occurs.

The dual form of the nonlinear SVR can be described as [43]:

Minimize (\(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} , \bar{\alpha }_{i}\)):

$$\frac{1}{2}\mathop \sum \limits_{i,j = 1}^{l} \left( {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} - \bar{\alpha }_{i} } \right)\left( {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{j} - \bar{\alpha }_{j} } \right)\emptyset \left( {x_{i} } \right) .\emptyset \left( {x_{j} } \right) + \varepsilon \mathop \sum \limits_{i = 1}^{l} \left( {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} + \bar{\alpha }_{i} } \right) - \mathop \sum \limits_{i = 1}^{l} y_{i} \left( {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} - \bar{\alpha }_{i} } \right)$$
(8)

Subject to:

$$\mathop \sum \limits_{i = 1}^{l} y_{i} \left( {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} - \bar{\alpha }_{i} } \right) = 0$$
$$0 \le \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{i} \le C, \quad i = 1,2, \ldots ,l$$
$$0 \le \bar{\alpha }_{i} \le C,\quad i = 1,2, \ldots ,l$$

Selection of a suitable nonlinear function \(\emptyset \left( {x_{i} } \right)\) and the computation of \(\emptyset \left( {x_{i} } \right) .\emptyset \left( {x_{j} } \right)\) in the feature space could be a difficult task. The input space computation can be performed using a kernel function \(K\left( {x_{i} ,x_{j} } \right) = \emptyset \left( {x_{i} } \right) .\emptyset \left( {x_{j} } \right)\) to yield the inner products in feature space, circumventing the problems intrinsic in determining the feature space. Functions that meet the condition of Mercer in feature space can be proven to match dot products. Any functions that fulfill the theorem of Mercer can, therefore, be used as a kernel. The following are some commonly used kernels in SVM [43]:

$$K\left( {x_{i} ,x_{j} } \right) = x_{i} . x_{j} \quad \quad {\text{Linear}}\;{\text{kernel}}$$
(9)
$$K\left( {x_{i} ,x_{j} } \right) = \left[ {\gamma \left( {x_{i} ,x_{j} } \right) + c} \right]^{d} \quad \quad {\text{Polynomial}}\;{\text{kernel}}$$
(10)
$$K\left( {x_{i} ,x_{j} } \right) = \tanh \left[ {\gamma \left( {x_{i} ,x_{j} } \right) + c} \right]\quad \quad {\text{Sigmoid}}\;{\text{kernel}}$$
(11)
$$K\left( {x_{i} ,x_{j} } \right) = \exp \left( { - \gamma \left| {x_{i} - x_{j} } \right|^{2} } \right)\quad \quad {\text{Radial}}\;{\text{basis}}\;{\text{function}}\;{\text{kernel }}$$
(12)

Finally, the feature of the kernel allows the nonlinear SVR decision function to be expressed as Eq. 13:

$$f(x_{i} ) = \mathop \sum \limits_{i = 1}^{l} ( - \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\alpha }_{k} + \bar{\alpha }_{k} )K\left( {x_{i} ,x_{k} } \right) + b$$
(13)

2.2.3 DT

Decision Tree (DT) as a nonparametric data mining technique is an efficient statistical tool to develop prediction algorithms for a response variable. DT comes with several advantages; for example, no assumption is needed about the distribution of explanatory variables, it is not affected by high correlations among independent variables, it can be applied to many types of dependent variables including categorical, numerical, and survival data, the most important variables explaining the dependent variable are included in decision trees, and the insignificant variables are excluded. Although DT has initially been proposed for large datasets, it also provides accurate estimates for small datasets.

Classification and Regression Tree (CART) is a specific type of DT algorithms. It covers both classification and regression trees. In the regression tree, the predicted outcome is considered a real number. In this case, variance reduction is used for splitting the nodes.

The steps for the algorithm are as follows:

  1. 1.

    First, the variance of the target is calculated

  2. 2.

    The datasets are divided into different attributes, and the variance of each branch is subtracted from the variance before the split. This is known as variance reduction.

    The variance reduction of a node \(N\) is defined as:

    $$I_{\text{V}} \left( N \right) = \frac{1}{{\left| S \right|^{2} }}\mathop \sum \limits_{i \in S} \mathop \sum \limits_{j \in S} \frac{1}{2}\left( {x_{i} - x_{j} } \right)^{2} - \left( {\frac{1}{{\left| {S_{\text{t}} } \right|^{2} }}\mathop \sum \limits_{{i \in S_{\text{t}} }} \mathop \sum \limits_{{j \in S_{\text{t}} }} \frac{1}{2}\left( {x_{i} - x_{j} } \right)^{2} + \frac{1}{{\left| {S_{\text{f}} } \right|^{2} }}\mathop \sum \limits_{{i \in S_{\text{f}} }} \mathop \sum \limits_{{j \in S_{\text{f}} }} \frac{1}{2}\left( {x_{i} - x_{j} } \right)^{2} } \right)$$
    (14)

    where \(S\), \(S_{\text{t}}\), and \(S_{\text{f}}\) are the set of presplit sample indices, set of sample indices for which the split test is true, and set of sample indices for which the split test is false, respectively. Each of the above summands is indeed variance estimates, though, written in a form without directly referring to the mean.

    The attribute with the highest VR will be the decision node.

  3. 3.

    The datasets are divided based on the values of selected attributes. If the variance of a branch is more than zero, it divided again.

  4. 4.

    Continue the process until all the data are processed.

2.3 Automatic resource forecasting model

Initial observations are crucial for training the machine learning tools, and depending on the type of projects and problems can be different. More observations for the training result in more precise results. Some of the most important observations in the road tunnels before construction are the inlet and outlet portals and location of boreholes, and perhaps these observations are often the only observations available for training at this stage.

In this work, a geological or geotechnical parameter (for example, parameter \(X\)) in the tunnel path is considered first, so that there are preliminary observations about it in the tunnel path. These observations are then used to train the GPR, SVR, and DT tools. After the training process, parameter \(X\) state can be forecasted along the entire path of the tunnel. In the next stage, in order to predict the construction time and costs of the tunnel, the previously constructed road tunnels are applied to train the GPR, SVR, and DT tools. In this way, by studying the previously constructed road tunnels, the different time and costs per each meter of construction based on the different states of the parameter \(X\) are applied to train the prediction tools. After training, according to the forecasted states of the parameter \(X\) along the tunnel path using the GPR, SVR, and DT tools, the construction time and costs of the tunnel can be forecasted. It can be noted that the selected previously constructed tunnels should be similar to the tunnel under investigation in terms of the maintenance system, drilling method, tunnel cross section, which affects the tunnel construction time and costs.

Since the initial data are either low or not accurate before the construction stage of the tunnel, these can affect the predictions made on the ground conditions and the time and costs of the tunnel construction. To cope this problem, it is possible to update the previous prediction results during tunnel construction by using the actual data on parameter \(X\), construction time, and construction costs obtained in the constructed parts as the new train datasets. Therefore, in order to update the previous prediction results during tunnel construction, in addition to the initial datasets, the field-observed datasets also can be utilized to train the GPR, SVR, and DT tools. In Fig. 2, all the prediction stages of this article are presented in a schematically way.

Fig. 2
figure 2

The prediction stages of parameter X and the time and costs of tunnel construction by using the GPR, SVR, and DT tools

In the next section, an engineering application of the GPR, SVR, and DT tools is presented on Garan road tunnel to predict the RMR parameter state along the tunnel route and the construction time and costs of the tunnel before and during construction.

3 Engineering application

3.1 Engineering background

The new road among Sanandaj and Marivan cities is an under-development venture in the northwest of Iran. In light of the section through the Zagros Mountains, a few passages extend on this course, including the Garan, Hamru, Baghan, and Gezerdareh, which have been actualized or are under development. The Sanandaj–Marivan old road is a standout among the most hazardous roads in Iran, where numerous mishaps happen each year, prompting money-related misfortunes and passing of individuals. The length of the old Sanandaj–Marivan road is 126 km. On the new road, the length will be decreased to 105 km, which can lessen the traveling time and mishaps.

The present examination is directed on Garan tunnel with a length of 1900 m and a cross-segment region of 97 m2 as a major aspect of the under-development Sanandaj–Marivan road. The channel of Garan burrow is viewed as the eastern mouth (to the Sanandaj-S), and its outlet is viewed as the western (to the Marivan-M). The area of Garan tunnel is shown in Fig. 3.

Fig. 3
figure 3

Project location of Garan tunnel

The inlet and outlet portals of Garan tunnel are situated on the SM39 + 490 and SM41 + 390 (km + m), individually. The topographical profile of the Garan tunnel with the situation of the four boreholes is shown in Fig. 4a. As per the accessible information, a sum of four kinds of designing geography in the passage course is recognizable and isolated from one another: limestone (Li), shale (Sh), sand shales with shale–limestone arrangement (ShL), and limestone–shale succession (LSh).

Fig. 4
figure 4

a Geological map of Garan tunnel, b Garan tunnel cross section associated with its projected support system

Garan tunnel was excavated using the top heading and benching method. The support system used in Garan tunnel construction is as follows:

  • IPE 180: spacing 0.75–1.5 m

  • Rock bolts: fully grouted, φ25 mm, L: 4–6 m

  • Shotcrete: 22 cm reinforced by two-layer mesh φ6@100 × 100 mm.

In Fig. 4b, Garan tunnel cross section associated with projected/applied support system is presented.

3.2 Sample selection and model establishment

In this article, fivefold cross-validation is adopted at each stage of the predictions using the MATLAB tool, 2018, through the regression learner app. In this application, there are several different model types of GPR, SVR, and DT tools. At the forecasting stage, all the model types are used individually, and the results of the model with the highest accuracy were considered. In the MATLAB application, the GPR tool consists of four model types named rational quadratic, squared exponential, Matern 5/2, and exponential. The SVR tool consists of six model types called linear, quadratic, cubic, fine Gaussian, medium Gaussian, and coarse Gaussian. Also, the DT tool consists of three models of a fine tree, medium tree, and coarse tree.

Fitting a GPR model involves estimating the following model parameters from the data:

  • Covariance function \(k\left( {x_{i} ,x_{j} \left| \theta \right.} \right)\) parameterized in terms of kernel parameters in vector \(\theta\).

  • Noise variance (\(\sigma^{2}\)).

  • Coefficient vector of fixed basis functions (\(\beta\)).

The value of the ‘KernelParameters’ name–value pair argument is a vector that consists of initial values for the signal standard deviation (\(\sigma_{f}\)) and the characteristic length scales (\(\sigma_{l}\)). The ‘fitrgp’ function uses these values to determine the kernel parameters. Similarly, the ‘Sigma’ name–value pair argument contains the initial value for the noise standard deviation \(\sigma\). During optimization, ‘fitrgp’ creates a vector of unconstrained initial parameter values \(\eta_{0}\) by using the noise standard deviation and the kernel parameters. ‘fitrgp’ analytically determines the explicit basis coefficient \(\beta\), specified by the ‘Beta’ name–value pair argument, from estimated values of \(\theta\) and \(\sigma^{2}\). Therefore, \(\beta\) does not appear in the \(\eta_{0}\) vector when ‘fitrgp’ initializes numerical optimization.

For SVR, the regularization and portion width parameters were tuned utilizing programmed hyper-parameter improvement technique given in MATLAB with the ‘fitrsvm’ work. ‘fitrsvm’ trains a Support Vector Machine (SVM) regression model on a low- through moderate-dimensional predictor dataset. It supports mapping the predictor data using kernel functions and supports sequential minimal optimization (SMO), iterative single data algorithm (ISDA), or L1 soft-margin minimization via quadratic programming (L1QP) for objective function minimization.

DT has a plethora of hyperparameters that require fine-tuning in order to derive the best possible model that reduces the generalization error as much as possible. Usually, the tree complexity is measured by one of the following metrics: the total number of nodes, the total number of leaves, tree depth, and a number of attributes used. max_depth, min_samples_split, and min_samples_leaf are all stopping criteria, whereas min_weight_fraction_leaf and min_impurity_decrease are pruning methods.

In each prediction step, to assess the performances of the results, seven statistical evaluation criteria are used. These criteria are the coefficient of determination (R2), mean absolute error (MAE), mean square error (MSE), root mean square error (RMSE), relative RMSE (RRMSE), mean absolute percentage error (MAPE), and mean relative error (MRE), respectively, given in Eqs. 1521. All of these criteria were applied to validate the prediction results obtained by the intelligence approached in the previous researches [44,45,46,47,48,49]:

$$R^{2} = \left( {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {f\left( {(x_{i} } \right) - \bar{f}\left( x \right)} \right)\left( {f^{*} \left( {x_{i} } \right) - \bar{f}^{*} \left( x \right)} \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {f(x_{i} ) - \bar{f}\left( x \right)} \right)^{2} \mathop \sum \nolimits_{i = 1}^{n} \left( {f^{*} \left( {x_{i} } \right) - \bar{f}^{*} \left( x \right)} \right)^{2} } }}} \right)^{2}$$
(15)
$${\text{MAE}} = \left( {\frac{1}{n}} \right)\mathop \sum \limits_{i = 1}^{n} \left| {f(x_{i} ) - f^{*} \left( {x_{i} } \right)} \right|$$
(16)
$${\text{MSE}} = \frac{1}{n}\mathop \sum \limits_{1}^{n} \left( {f(x_{i} ) - f^{*} \left( {x_{i} } \right)} \right)^{2}$$
(17)
$${\text{RMSE}} = \sqrt {\left( {\frac{1}{n}} \right)\mathop \sum \limits_{i = 1}^{n} \left( {f(x_{i} ) - f^{*} \left( {x_{i} } \right)} \right)^{2} }$$
(18)
$${\text{RRMSE}} = \sqrt {\left( {\frac{1}{n}} \right)\mathop \sum \limits_{i = 1}^{n} \left( {\frac{{f(x_{i} ) - f^{*} \left( {x_{i} } \right)}}{{f(x_{i} )}}} \right)^{2} }$$
(19)
$${\text{MAPE}} = \frac{100\% }{n}\mathop \sum \limits_{i = 1}^{n} \left| {\frac{{f(x_{i} ) - f^{*} \left( {x_{i} } \right)}}{{f(x_{i} )}}} \right|$$
(20)
$${\text{MRE}} = \left( {\frac{1}{n}} \right)\mathop \sum \limits_{i = 1}^{n} \frac{{\left| {f(x_{i} ) - f^{*} \left( {x_{i} } \right)} \right|}}{{\left| {f(x_{i} )} \right|}}$$
(21)

where \(f\left( x \right)\) is the actual value and \(f^{*} \left( x \right)\) is the predicted value, \(\bar{f}\left( x \right)\) are the means of actual and predicted values, and \(n\) is the number of datasets.

Because the discontinuities, water conditions, and uniaxial compressive strength of intact rock are very effective in determining the ground conditions of the tunnel route, and since the geomechanical parameter of RMR considers all of the aforementioned properties, the RMR parameter is used to define the geology conditions of the tunnel route. It should be noted that in this study, the RMR values are expressed as an interval, but in the GPR, SVR, and DT tools, the mean of these two numbers is used. For example, if the RMR parameter is 5–10, the value in the tools is considered equal to 7.5.

4 Results analysis and comparison

In this paper, predictions are made in two steps of pre-updating and post-updating. In the following subsections, the results of each step will be discussed in more detail.

4.1 Pre-updating results

Before starting a tunneling project, a geological map is prepared by the engineers (Fig. 4a). This map is obtained from preliminary investigations of the study area through different surveys of geotechnical, geophysical, etc.

In this map, the status of important geological parameters that influence the tunnel construction is specified in the tunnel path. One of these parameters is the RMR parameter. In this paper, these data are used as the preliminary training data of the forecasting models. Note that these data are not very accurate due to the unknown subsurface conditions before the tunnel construction, but at the first stage of forecasts, there is no alternative. The initial training data of the RMR parameter available for the Garan tunnel route are presented in Table 2.

Table 2 The pre-construction training datasets of the RMR along Garan tunnel rout

For this step of predictions (pre-updating prediction step for RMR parameter), the initial values for the hyperparameters of the GPR, SVR, and DT tools are presented in Table 3. The validation criteria results of fivefold cross-validation are presented in Table 4. According to the results, all the machine learning tools of GPR, SVR, and DT have presented a good accuracy in the predictions. All the predictions are close together. However, the GPR provides more accurate results compared to the SVR and DT models.

Table 3 Pre-updating initial values of the GPR, SVR, and DT parameters for the RMR parameter prediction
Table 4 The pre-updating results for RMR parameter

To obtain the RMR status along the Garan tunnel route, after the fivefold cross-validation, the obtained models of the GPR, SVR, and DT tools were exported, and the datasets available along Garan tunnel route were applied as the test datasets in the exported models. Finally, the RMR parameter status was predicted by the GPR, SVR, and DT tools, as shown in Fig. 5. Also, to validate the prediction results accuracy of the GPR, SVR, and DT tools along the Garan tunnel route, the actual RMR status of the tunnel route is shown in Fig. 5. Looking at Fig. 5, it can be seen that all the methods have presented close results together. Also, a comparison of the prediction results with the actual model of the Garan tunnel route shows the high accuracy of the GPR, SVR, and DT tools in predicting the RMR parameter.

Fig. 5
figure 5

The pre-updating predicted results of GPR, SVR, and DT tools for RMR parameter along Garan tunnel route

The prediction errors of the GPR, SVR, and DT tools in predicting the RMR parameter along the Garan tunnel route are shown in Fig. 6. The mean error values for the GPR, SVR, and DT tools are less than 2.2518, 2.9303, and 3.2037, respectively. In Fig. 7, the R2 values of the regression results by the GPR, SVR, and DT tools versus the field-observed RMR parameter are shown. The R2 values for the GPR, SVR, and DT tools are evaluated by 0.9347, 0.239, and 0.8541, respectively.

Fig. 6
figure 6

The pre-updating results of prediction errors of GPR, SVR, and DT tools for RMR parameter along Garan tunnel route

Fig. 7
figure 7

The pre-updating results of prediction performance of GPR, SVR, and DT tools on the RMR parameter

Through comparisons made between the prediction results of the RMR parameter along the Garan tunnel route with the actual mode, Table 5 was obtained. In Table 5, the results of the different validation criteria for this step of prediction are presented. From all of these results obtained for the Garan tunnel route, it can be concluded that the GPR, SVR, and DT results are close together and to the actual mode, but still, GPR performs better prediction results than SVR, and SVR results are more accurate than DT.

Table 5 The pre-updating prediction results for the RMR parameter along the Garan tunnel route

In the next step, to predict the time and cost of constructing the tunnel, several input data presented in Table 6 were used as the train datasets of the GPR, SVR, and DT. These data were collected from the previously constructed road tunnels, which were similar to the Garan tunnel in terms of drilling, support system, area, and cross-sectional shape of the tunnel. The data of Table 6 explain the relationship between the RMR parameter and the time and costs of tunnel construction. For this step of prediction, the initial values used for the parameters of the GPR, SVR, and DT are presented in Table 7.

Table 6 Training datasets of GPR, SVR, and DT gathered from the previously constructed tunnels
Table 7 Pre-updating values of the GPR, SVR, and DT parameters for predicting the construction time and costs

After applying the datasets presented in Table 6 as the training datasets of the GPR, SVR, and DT tools, the different evaluation statistical criteria resulting from the fivefold cross-validation were obtained as shown in Table 8. Looking at Table 8, still, like the previous prediction step (for RMR parameter), the GPR results have more accuracy than the SVR and DT results. Also, SVR provides higher accuracy compared to the DT.

Table 8 The pre-updating prediction results of construction time and construction costs parameters

Then, the fivefold cross-validation models obtained by the regression learner app of the MATLAB 2018 tool were exported and the construction time and construction costs of Garan tunnel route were applied in the exported models as the test datasets. Finally, according to Figs. 8 and 9, respectively, the construction time and construction costs of the Garan tunnel were predicted in the whole tunnel route. Also, to validate the prediction results along the Garan tunnel route, the actual construction time and costs are shown in Figs. 8 and 9, respectively. Considering Figs. 8 and 9, it can be concluded that all the GPR, SVR, and DT tools have presented good results, and they all are close together.

Fig. 8
figure 8

The pre-updating prediction results of construction time obtained by GPR, SVR and DT along Garan tunnel route

Fig. 9
figure 9

The pre-updating prediction results of construction costs obtained by GPR, SVR, and DT along the Garan tunnel route

The prediction errors of the GPR, SVR, and DT for the two parameters of construction time and construction costs along the Garan road tunnel are shown in Figs. 10 and 11, respectively. For the construction time, the mean error values of the GPR, SVR, and DT are less than 0.0143 days, 0.0170 days, and 0.0175 days, respectively. These values for the construction costs are less than US$20.6073, US$23.3594, and US$30.4157, respectively. In Figs. 12 and 13, the R2 values of the regression results by the GPR, SVR, and DT tools versus the field-observed construction time and costs along the tunnel route are shown. For the construction time (Fig. 12), the R2 values of the GPR, SVR, and DT are equal to 0.9645, 0.9424, and 0.9369, respectively. Also, for the construction costs (Fig. 13), the R2 values of the GPR, SVR, and DT are equal to 0.9412, 0.9362, and 0.9141, respectively.

Fig. 10
figure 10

The pre-updating prediction errors of construction time obtained by GPR, SVR and DT along Garan tunnel route

Fig. 11
figure 11

The pre-updating prediction errors of construction costs obtained by GPR, SVR and DT along garan tunnel route

Fig. 12
figure 12

The pre-updating prediction performance of GPR, SVR, and DT tools on construction time

Fig. 13
figure 13

The pre-updating prediction performance of GPR, SVR, and DT tools on construction costs

Through comparison of the construction time and cost results predicted by the GPR, SVR, and DT tools along the Garan tunnel route with the actual mode, the statistical evaluation criteria applied in this article were estimated (Table 9). From all of these results, it can be seen that the GPR, SVR, and DT tools have presented good results. Still, the GPR results are better than the SVR and DT. Also, SVR has presented more accurate results than DT.

Table 9 The pre-updating prediction results of construction time and costs parameters along the Garan tunnel route

Until now, the status of the RMR, construction time, and construction costs were predicted by the GPR, SVR, and DT tools before Garan tunnel construction (pre-updating step). All the results predicted by the GPR, SVR, and DT tools were compared together through the different statistical evaluation criteria in the two modes of fivefold cross-validation and along Garan tunnel route. Comparing the predicted results with the field-observed mode, it was concluded that the GPR tool provides more accurate results than the SVR and DT. Also, the SVR performs better results than DT. All of these predictions were related to the construction of the Garan tunnel, in which datasets were gathered from the pre-existing tunnel and the previously constructed road tunnels. In the next step, in addition to the initial datasets, new input datasets obtained during the tunnel construction are used to train the GPR, SVR, and DT tools, and the previously predicted results will be updated. Also, the impact of these updates on the predicted results will be investigated.

4.2 Post-updating results

To update the previously forecasted results, about 200 m from both inlet and outlet portals of the tunnel was assumed as the constructed parts, and the data obtained in these locations were used as the new train datasets. Also, for the post-updating stage, the initial values used for the parameters of the GPR, SVR, and DT tools for each of the three parameters of RMR, construction time, and construction costs are presented in Table 10.

Table 10 The post-updating values of the GPR, SVR, and DT parameters for the prediction of RMR, construction time, and construction costs

To update the previously predicted results of the RMR parameter, in addition to the pre-updating training datasets (Table 2), the more training datasets were obtained from the constructed parts of the tunnel and the number of the dataset was raised. Therefore, machine learning tools are applicable to the broader range [50]. The post-updating results of the RMR parameter are presented in Table 11 through the statistical evaluation criteria. For this prediction step, the GPR results still have more accuracy than the SVR and DT. Also, SVR still provides better results than DT.

Table 11 The post-updating results of the RMR parameter

After applying the actual RMR parameter of the Garan tunnel route in the fivefold cross-validation exported models, the RMR parameter was predicted along the whole tunnel route by all of the GPR, SVR, and DT tools (Fig. 14). According to Fig. 14, all the results predicted by the GPR, SVR, and DT tools are very good and close together.

Fig. 14
figure 14

The post-updating prediction results of GPR, SVR, and DT for the RMR parameter along the Garan tunnel route

In Fig. 15, the error values of the GPR, SVR, and DT tools in predicting the RMR parameter are shown along the Garan tunnel route for the post-updating step. The mean error values for the GPR, SVR, and DT tools are less than 0.6049, 1.1269, and 2.0827, respectively. In comparison with the pre-updating results, the average values of errors in predicting the status of the RMR parameter are reduced by 1.647, 1.8034, and 1.121, for GPR, SVR, and DT, respectively. Also, according to Fig. 16, in predicting the status of the RMR parameter, the R2 values for the GPR, SVR, and DT increased by 0.059, 0.0582, and 0.0883, respectively.

Fig. 15
figure 15

The post-updating prediction error results of GPR, SVR, and DT tools for RMR parameter along Garan tunnel route

Fig. 16
figure 16

The updated prediction performance of GPR, SVR, and DT on the RMR parameter

Comparing the updated predictions of the RMR parameter along the Garan tunnel route with the actual RMR, the different statistical validation criteria are evaluated as shown in Table 12. From all of these results, it can be concluded that the GPR, SVR, and DT results are close together and to the actual mode, but GPR performs better than SVR, and SVR performs better than DT. Also, there is a significant upgrade potential while constructing the tunnel in predicting the RMR parameter.

Table 12 The post-updating prediction results of the RMR parameter along the Garan tunnel route

The actual datasets of the RMR, construction time, and construction costs parameters that were obtained during Garan tunnel construction (200 m from both inlet and outlet portals) are given in Table 13. In order to update the previous prediction results of the construction time and costs parameters along the Garan tunnel route, in addition to the preliminary datasets obtained from the previously constructed road tunnels (Table 6), the new obtained datasets presented in Table 13 were also used to train the forecasting tools.

Table 13 The more training datasets obtained from the constructed parts of Garan tunnel

The post-updating results for the construction time and costs obtained by the GPR, SVR, and DT tools are compared together in Table 14 through the different statistical evaluation criteria. As the pre-updating step, GPR presented more accurate results than the SVR and DT tools. Also, the SVR tool still has shown a better potential ability in the predictions than the DT.

Table 14 The post-updating prediction results of the construction time and construction costs parameters

Using the construction time and costs datasets of the Garan tunnel route as the input datasets in the exported models of the fivefold cross-validation, the GPR, SVR, and DT tools have predicted the construction time and costs along Garan tunnel route as shown in Figs. 17 and 18, respectively. In Figs. 17 and 18, the updated forecasts of the GPR, SVR, and DT tools on the construction time and costs parameters are compared with the actual parameter one in each location along the tunnel route. According to Figs. 17 and 18, the more proximity between the predicted results and the actual state of the construction time and costs can be seen compared to the pre-updating step.

Fig. 17
figure 17

The post-updating prediction results of construction time obtained by GPR, SVR, and DT along Garan tunnel route

Fig. 18
figure 18

The post-updating prediction results of construction costs obtained by GPR, SVR, and DT along the Garan tunnel route

In Fig. 19, the post-updating error values of the GPR, SVR, and DT tools on the construction time are shown along the tunnel route. According to Fig. 19, the mean error values of the GPR, SVR, and DT tools are less than 0.0073 days, 0.0138 days, and 0.0142, respectively. These values for the construction costs predicted by the GPR, SVR, and DT (Fig. 20) are less than 8.7907 US$, 10.3969 US$, and 10.7844 US$, respectively. Therefore, in comparison with the pre-updating step, the mean error values of the GPR, SVR, and DT tools for the construction time of the tunnel route are reduced by 0.007 days, 0.0033 days, and 0.0033 days, respectively. The mean error values of the GPR, SVR, and DT tools for the predicted construction costs along the tunnel route compared to the pre-updating levels are reduced by 11.8166 US$, 12.9625 US$, and 19.6312 US$, respectively.

Fig. 19
figure 19

The post-updating prediction error results of GPR, SVR, and DT tools for the construction time along Garan tunnel route

Fig. 20
figure 20

The post-updating prediction error results of GPR, SVR, and DT tools for the construction costs along Garan tunnel route

As shown in Fig. 21, in comparison with the pre-updating step, the R2 values of the construction time predicted by the GPR, SVR, and DT tools for the Garan tunnel route have increased by 0.0266, 0.0201, and 0.0129, respectively. Also, according to Fig. 22, comparing with the pre-updating results, the R2 values of the tunnel route construction costs predicted by the GPR, SVR, and DT tools have increased by 0.0449, 0.0464, and 0.0653, respectively.

Fig. 21
figure 21

The updated prediction performance of GPR, SVR, and DT tools on construction time

Fig. 22
figure 22

The updated prediction performance of GPR, SVR, and DT tools on construction costs

Also, to further support the post-updating prediction results of the GPR, SVR, and DT tools on the construction time and costs parameters along Garan tunnel route, the more statistical validation criteria are evaluated by applying the Garan tunnel route datasets in the models exported from the post-updating results of the construction time and costs (Table 15). By studying Table 15, it is clear that all three GPR, SVR, and DT tools provided high accuracy in the predictions, but at all stages, the accuracy of the GPR tool is higher than the SVR; also, SVR performs more accurate than the DT. Furthermore, from these results, the effect of the updating process is objective, and the prediction accuracy is increased.

Table 15 The post-updating prediction results of the construction time and costs parameters along the Garan tunnel route

According to the predicted results before and after the updating step and comparing them with the actual state, it can be said that GPR, SVR, and DT are the three potential prediction tools that can be used to predict the tunnel geology parameters, as well as the time and costs of tunnel construction. However, the GPR performs better than SVR, and SVR performs better than DT. In addition, the updating procedure can significantly reduce uncertainties regarding the unknown tunnel geology conditions and tunnel construction time and costs.

5 Discussion

Undoubtedly, reducing the uncertainties of the underground conditions in tunneling projects is one of the most serious issues. The most important of these uncertainties is the geological and geotechnical conditions of the tunnel route. The uncertainty of these conditions, in turn, increases the risk of the time and costs of the tunnel construction. Therefore, it is necessary to reduce the uncertainty about the geological and geotechnical conditions of the tunnel route in order to reduce risk about the time and costs of constructing the tunnel. Usually, in road tunnels, limited data are available regarding the geological and geotechnical conditions of the tunnel route. Under such circumstances, it is not possible to estimate the time and costs of constructing the tunnel at an acceptable level. To reduce these uncertainties, the GPR, SVR, and DT tools were used to predict tunnel geology and time and costs of its construction. To illustrate the correct functioning of these tools, they were used in a case study of Garan road tunnel. To predict the tunnel route geology, the RMR parameter was considered. Initially, before the construction of the tunnel, the data about the RMR parameter were available only in the inlet and outlet portals and borehole locations. In this stage, only these data were applied to train the prediction tools. Then, the status of the RMR parameter was predicted in each location in the tunnel path by each of the tools. Since no data were available on the construction time and costs before construction of the tunnel, so to train the GPR, SVR, and DT tools, the previously constructed road tunnels were used. With the help of these old tunnels, for different values of the RMR parameter, different construction time and costs were obtained and used as the train datasets of the prediction tools. Then, according to the predetermined RMR values, the time and costs estimations for each meter of construction in any position in the tunnel path were predicted by the GPR, SVR, and DT tools.

In the next step, to show the impact of upgrades during the tunnel construction on the previous predictions, it was assumed that only 200 m was constructed on the inlet and outlet of the tunnel route. Using actual data on the RMR parameter and construction time and costs in the constructed parts as the new train datasets, the previous forecasted results were updated.

To validate the prediction results of the GPR, SVR, and DT tools at each step, the fivefold cross-validation was adopted using the MATLAB tool 2018 through the regression learner app. In this application, there were several different model types of GPR, SVR, and DT tools. At the forecasting stage, all the model types were used individually, and the results of that model were taken into account that was more accurate. To assess the performances of selected models, seven statistical evaluation criteria of R2, MAE, MSE, RMSE, RRMSE, MAPE, and MRE were used. All of these fivefold cross-validation results are summarized in Table 16. Also, in each step of the predictions, the fivefold cross-validation models were exported through MATLAB 2018 tool, and the datasets available along Garan tunnel route were applied in the models to predict the RMR, construction time, and construction costs along Garan tunnel path. For the Garan tunnel route predictions, the predicted results were compared with the actual mode, and like the fivefold cross-validation results, also, the different statistical evaluation criteria were evaluated for these results. All of these results are summarized in Table 17.

Table 16 A summary of the all fivefold cross-validation results predicted by the GPR, SVR, and DT tools
Table 17 A summary of all results along the Garan tunnel route obtained through the exported models of the fivefold cross-validation results

6 Conclusions

In this article, the effects of geological uncertainties in tunnel construction time and costs were tried to be reduced through the machine learning tools of GPR, SVR, and DT. To train the prediction tools, data were gathered from the under-studying tunnel observations and the previously constructed tunnels. To assess the effect of updated outcomes during the tunnel construction, the tools used were updated for one time after constructing 200 m from both inlet and outlet portals of the tunnel. Also, in each stage of the predictions, the fivefold cross-validation was adopted. Finally, all the predicted results were validated through seven validation criteria of MAE, MSE, RMSE, RRMSE, MAPE, MRE, and R2. The validations showed that the GPR results were more accurate than the SVR results, and SVR has more accuracy than the DT. Also, during the tunnel construction, updating the previously predicted results by applying the actual data obtained in the constructed parts can significantly increase the accuracy of the predictions comparing to the pre-updating mode. In conclusion, according to the results of this study, machine learning methods are useful tools for solving problems with complex mechanisms such as geology, construction time, and construction costs of a road tunnel project. Furthermore, the existing models are suitable to use as predictors for tunnel future levels.

In this study, only one feature (RMR) was considered to predict the tunnel path geology, and multiple features can be considered to have a better description of tunnel geology.