Introduction

Blasting is a key technique adopted mostly in the civil and mining engineering fields for rock fragmentation purposes. The challenging issue is that, in each blasting event, only around 20% of the generated energy is applied to rock fragmentation and the rest of the energy brings about different adverse impacts on surrounding environment and structures, for instance, ground vibration, flyrock and airblast (e.g., Hajihassani et al. 2014, 2018a, b; Matidza et al. 2020; Chen et al. 2019). The phenomena induced by blasting are illustrated in Figure 1. Among the induced adverse effects of blasting, ground vibration is recognized as the most destructive impact because it typically causes structural vibrations, demolition of buildings, instability of bench and slope, and in some cases, significant damage to underground water (e.g., Monjezi et al. 2010, 2011, 2013; Khandelwal et al. 2011; Ghasemi et al. 2013; Saadat et al. 2014; Hajihassani et al. 2015; Abbas and Asheghi 2018). Ground vibration is generally measured based on peak particle velocity (PPV). Accordingly, to minimize the impact of blasting events on the environment and structures, there is a need to accurately estimate blast-induced PPV.

Figure 1
figure 1

Blasting-induced phenomenon

Different experiments and techniques have been proposed by different researchers to estimate PPV induced by blasting. The experimental studies were aimed generally at establishing empirical equations based on relationships between distance of PPV measurement (D) and explosive charge per blasting delay (W) (Ghosh and Daemen 1983; Roy 1991). However, these empirical equations have provided low-quality prediction accuracy in several cases; therefore, artificial intelligence (AI)-based techniques have received more attention in recent years. Numerous researchers (e.g., Monjezi et al. 2009; Zhou et al. 2015; Nikafshan Rad et al. 2018; Zhou et al. 2019a, b, c) have applied different AI methods to solve different problems in engineering fields, and they have become popular in the prediction of PPV.

Artificial neural network (ANN) was employed by Monjezi et al. (2013) to estimate PPV. The results obtained by ANN were compared with those of empirical models, which revealed that ANN was more efficient than the rivals regarding the task defined. In another study, Hasanipanah et al. (2017) presented the classification and regression tree (CART) for predicting PPV. They also developed multiple regression (MR) and several empirical models for the same purpose. Their findings confirmed the superiority of CART over MR and empirical models regarding the PPV estimation. With the same objective, Zhang et al. (2019) made use of an optimized XGBoost to estimate PPV. They used particle swarm optimization (PSO) to optimize XGBoost parameters. To check the acceptability of the model, they also employed some empirical models. Their results revealed that PSO-XGBoost outperformed the empirical models in regard to PPV estimation. Bui et al. (2020) integrated the quantile regression neural network (QRNN) and the fuzzy C-means clustering (FCM) for predicting PPV. Their results obtained by the FCM-QRNN model were compared to those of the random forest (RF) and ANN. The comparative results confirmed the superiority of FCM-QRNN over the others in estimating PPV.

Jiang et al. (2019) examined the capacity of a neuro-fuzzy inference system for prediction of PPV and made a comparison with the results of the MR model. The neuro-fuzzy inference system delivered more accurate results than MR. Nguyen et al. (2020) tested the capability of a hybridized model integrating the ANN and k-means clustering algorithm (HKM) in PPV estimation. They also used classical ANN, support vector regression (SVR), HKM and hybridized form of SVR and empirical models in their study. Their findings revealed that the proposed HKM-ANN achieved performed better in forecasting PPV compared to the other models noted above. Fang et al. (2020a) hybridized the imperialist competitive algorithm (ICA) with M5Rules to estimate PPV. Their results confirmed higher efficiency of ICA-M5Rules in comparison with other models in PPV prediction. For the same goal, Ding et al. (2020) offered the ICA to optimize XGBoost parameters. They also made use of SVR, ANN and gradient boosting machine (GBM). Their results showed that ICA-XGBoost outperformed the ANN, SVR and GBM methods in terms of PPV estimation. Yang et al. (2020b) hybridized SVR with optimization algorithms such as the genetic and firefly algorithms. Based on their results, the firefly-SVR model provided more acceptable predictions for PPV. In a study by Li et al. (2020), a biogeography-based optimization algorithm was combined with ANN. They showed that the proposed model outperformed the extreme learning machine and ANN. Amiri et al. (2020) used another strategy for optimizing ANN. In this regard, they used the itemset mining algorithm and demonstrated its superiority in this field. Yang et al. (2020a) predicted PPV using ANFIS combined with genetic algorithm (GA) and PSO. According to their results, both GA and PSO were useful algorithms for improving the ANFIS performance. Shang et al. (2020) combined the ANN and firefly algorithm (FA) to predict PPV. They indicated the effectiveness of ANN-FA model in the field. A combination of cubist algorithm (CA) and GA was proposed by Fang et al. (2020b) to predict PPV. They compared the performance of the proposed model with several machine learning methods. Their results showed the superiority of CA-GA model over other models. Yu et al. (2020c) offered an advanced relevance vector machine method for predicting PPV and concluded the acceptability of the proposed method. In the same purpose, a modified PSO algorithm was combined with extreme learning machine by Jahed Armaghani et al. (2020). Their results revealed the modified PSO-extreme learning machine method was perfectly able in predicting the PPV.

A review of literature shows that optimization algorithms, especially the particle swarm optimization algorithm, are becoming increasingly popular for PPV prediction. These algorithms have demonstrated high capacity in improving the effectiveness of predictive models. This has been considered only for ANN and XGBoost models. However, there is a need for innovative hybridized models in the field of engineering to mitigate the destructive impacts that blasting operations in a mine may exert on surrounding environment. The present study attempted to expand the body of knowledge by proposing the relevance vector regression (RVR) optimized by grey wolf optimization (GWO) (the RVR-GWO model) and by bat-inspired algorithm (BA) (the RVR-BA model) for predicting blast-induced PPV.

Field Investigation

A comprehensive research was carried out in the Harapan Ramai granite quarry, located in Johor, Malaysia, to measure and predict PPV. Geographically, the quarry is situated at latitude \(1^\circ 30^{'} 42^{''} {\text{N}}\) and longitude \(103^\circ 50^{'} 54^{''} {\text{S}}\). This quarry has the capacity of producing almost 40,000 tons of granite monthly. In the excavation operations, the drilling-and-blasting method is generally used to displace and fragment rock mass. In the Harapan Ramai project, dynamite and ANFO are used as the main explosives. Blast holes are drilled usually with a diameter of 150 mm. After charging the blast holes with explosive material, fine gravels were used as stemming material. In each blasting operation, values of W, burden-to-spacing ratio (B/S), stemming length (SL) and D are measured. Additionally, values of PPV in each blasting event are measured and recorded using the VibraZEB seismograph. To measure D, the GPS (global positioning system) is used; with this instrument, distances between blast-points and the VibraZEB seismograph are carefully measured. More details regarding the datasets used in this study are mentioned in the next sections.

Models Explanation

In this part, the review of literature related to the RVR is presented; then, the GWO and BA are explained. Optimization improves the performance of the RVR model by selecting the optimal value of its parameters.

Relevance Vector Regression

The RVR proposed by Tipping (2001) is a probabilistic method that works based on the Bayesian approach. It does not need to predict the error/margin tradeoff parameter C, which can decrease the time and the kernel function, and it does not need to satisfy the Mercer condition. Due to the RVR advantages over the SVR approach, RVR has been applied increasingly to regression prediction problems in recent years (Fang et al. 2015; Fang and Su 2020). With RVR, which assumes that the model is single-output (t) multiple-input (x), \(\left\{ {x_{n} ,t_{n} } \right\}_{n = 1}^{N} ,\) the output \(t = \left( {t_{1} , \ldots ,t_{N} } \right)^{T}\) can be represented as the sum of a vector \(y = \left( {y(x_{1} ), \ldots ,y(x_{N} )} \right)^{T}\). The target output is defined as:

$$t_{n} = y\left( {x_{n} ,w} \right) + e$$
(1)

where e signifies random noise and w is a weight vector. The y(x) function is defined as:

$$y\left( {x,w} \right) = \sum\limits_{i = 1}^{N} {w_{i} K\left( {x,x_{i} } \right)} + w_{0} = \sum\limits_{i = 1}^{N} {w_{i} \varPhi \left( x \right)}$$
(2)

where \(\varPhi \left( x \right) = \left[ {1,K\left( {x,x_{1} } \right),K\left( {x,x_{2} } \right), \ldots ,K\left( {x,x_{N} } \right)} \right]\). The target output can then be written as \(p\left( {t_{n} \left| {x_{n} } \right.} \right) = N\left( {t\left| {y(x_{n} ),\sigma^{2} } \right.} \right)\). The likelihood can be given as:

$$p\left( {t\left| {w,\sigma^{2} } \right.} \right) = \frac{1}{{2\pi \sigma^{2} }}\exp \left\{ { - \frac{1}{{2\sigma^{2} }}\left\| {t - \varPhi \left( x \right)\left. w \right\|} \right.} \right\}$$
(3)

where \(w = \left( {w_{0} ,w_{1} , \ldots ,w_{N} } \right)\,\), \(\,t = \left( {t_{1} ,t_{2} , \ldots ,t_{N} } \right)\) and \(\varPhi\) is the \(N \times \left( {N + 1} \right)\) design matrix. Thus, the RVR method adopts a Bayesian perspective and constrains (w and \(\sigma^{2}\)), thus:

$$p\left( {w\left| \alpha \right.} \right) = \prod\limits_{i = 1}^{N} {N\left( {w_{i} \left| {0,\alpha_{i}^{ - 1} } \right.} \right)} = \frac{1}{{2\pi^{{{{(N + 1)} \mathord{\left/ {\vphantom {{(N + 1)} 2}} \right. \kern-0pt} 2}}} }}\prod\limits_{i = 1}^{N} {\alpha_{i}^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0pt} 2}}} } \exp \left( { - \frac{{\alpha_{i} w_{i}^{2} }}{2}} \right)$$
(4)
$$p(\alpha ) = \prod\limits_{i = 1}^{N} {{\text{gamma}}\left( {\alpha_{i} \left| {a,b} \right.} \right)}$$
(5)
$$p(\beta ) = {\text{gamma}}\left( {\beta \left| {a,b} \right.} \right)$$
(6)

where a is a N + 1 hyper-parameter, b = \(\sigma^{2}\), and gamma \(\left( {\alpha \left| {a,b} \right.} \right)\) is defined as

$${\text{gamma}}\left( {\alpha \left| {a,b} \right.} \right) = 1\varGamma (a)^{ - 1} b^{a} \alpha^{a - 1} e^{ - b\alpha } \varGamma (a) = \int\limits_{0}^{\infty } {t^{a - 1} e^{ - t} dt}$$
(7)

In addition, the posterior over weights can be given as:

$$p(w\left| {t,\alpha ,\sigma^{2} } \right.) = \frac{{p(t\left| {w,\sigma^{2} } \right.)p(w\left| \alpha \right.)}}{{p(t\left| {\alpha ,\sigma^{2} } \right.)}} = \frac{1}{{2\pi^{{{{(N + 1)} \mathord{\left/ {\vphantom {{(N + 1)} 2}} \right. \kern-0pt} 2}}} }}\left| {\sum {} } \right|^{{{{ - 1} \mathord{\left/ {\vphantom {{ - 1} 2}} \right. \kern-0pt} 2}}} \exp \left\{ { - \frac{1}{2}\left( {w - \mu } \right)^{T} \sum\nolimits_{{}}^{ - 1} {\left( {w - \mu } \right)} } \right\}$$
(8)
$$\sum { = \left( {\sigma^{ - 2} 2\varPhi^{T} 2\varPhi + A} \right)}^{ - 1}$$
(9)
$$\mu = \sigma^{ - 2} \sum {\varPhi^{T} t}$$
(10)

where \(A = {\text{diag}}\left( {\alpha_{1} ,\alpha_{2} , \ldots ,\alpha_{N} } \right)\) and the likelihood distribution can be given as:

$$p\left( {t\left| {\alpha ,\sigma^{2} } \right.} \right) = \int {p(t\left| {w,\sigma^{2} } \right.)p(w\left| \alpha \right.)dw = (2\pi )^{{{{ - N} \mathord{\left/ {\vphantom {{ - N} 2}} \right. \kern-0pt} 2}}} } \left| C \right|^{{ - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0pt} 2}}} \exp \left\{ {\frac{1}{2}t^{T} C^{ - 1} t} \right\}$$
(11)

where the covariance is given by \(C = \sigma^{ - 2} I + \varPhi A^{ - 1} \varPhi^{T}\). Detailed description of the RVR method has been provided in the literature (e.g., Geem et al. 2001; Fang et al. 2019).

Various fields of study make use of the RVR model for prediction purposes. In order to examine rock mass boreability, the RVR model was developed by Fattahi (2020a). In other studies, Fattahi (2020b, c) used the RVR model to predict the unconfined compressive strength and penetration rate of tunnel boring machines and found the effectiveness of RVR for prediction purposes especially in the mining and geotechnical fields.

Bat-Inspired Algorithm

The BA is an optimization algorithm suggested by Yang (2010). It is inspired by the echo-location behaviors of microbats. The ith bat flies randomly with position xi at velocity νi with a fixed frequency fi. It varies its loudness A0 and wavelength λ to search for prey. The frequency, positions and velocity of the bats are updated (Ansari and Gholami 2015). The best selected current solution can be given as:

$$x_{\text{new}} = x_{\text{old}} + \rho A^{t}$$
(12)

where ρ∈[− 1,1] and At denote the average loudness. In addition,

$$A_{i}^{t + 1} = \alpha A_{i}^{t} \,\,\,\,\,\,\,0 \le \alpha \le 1$$
(13)
$$r_{i}^{t + 1} = r_{i}^{0} \left[ {1 - \exp \left( { - \gamma t} \right)} \right]$$
(14)

Note that ri and Ai are updated during the algorithm operation procedure. A detailed description of the BA can be found in Yang (2010). Moreover, Figure 2 presents the flowchart of the BA. In this study, we adopted the BA for selection of appropriate variables RVR in order to increase the runtime efficiency of RVR-BA.

Figure 2
figure 2

Flowchart of BA

The acceptability and reliability of the BA have been investigated by many researchers. For instance, Saba et al. (2017) predicted time and intensity of future earthquakes using a combination of ANN and BA. Additionally, hybridizing ANN with BA was used to predict air travel demand in a study carried out by Mostafaeipour et al. (2018). In another research, Chen et al. (2019) used the BA in data-driven mineral prospectivity mapping. The findings of the afore-mentioned studies indicate effectiveness of BA for optimization purposes.

Grey Wolf Optimization Algorithm

The GWO is a new population-based algorithm proposed by Mirjalili et al. (2014). It is inspired by grey wolves’ behavior in nature. In this algorithm, four groups are defined: omega, alpha, delta and beta. Moreover, the three hunting steps (i.e., encircling prey, attacking prey and searching for prey) are simulated. In the GWO, some parameters need to be set in numbers, namely delta, the number of sites selected for neighborhood search, the stopping criterion, the maximum number of iterations, beta, initialization of alpha and the number of search agents. A detailed description of the GWO has been provided in the literature by Mirjalili et al. (2014, 2016). Figure 3 presents a flowchart of the GWO. In the current study, the GWO was used to select the appropriate parameters of RVR model.

Figure 3
figure 3

Flowchart of GWO

The potential of the GWO has been highlighted in many studies. Xu et al. (2020) offered it for optimizing SVR in approximating shear strength and unconfined compressive strength of rock. In another study, Gao et al. (2020) developed the GWO to predict peak shear strength of rock. The application of GWO was also investigated by Yu et al. (2020b) for optimizing the SVR parameters for evaluation of rock movement induced by blasting events in mines. Recently, Shariati et al. (2020) predicted compressive strength of concrete using a hybrid of the GWO and extreme learning machine. The afore-mentioned studies confirmed that GWO can be used as a powerful algorithm for optimizing purposes.

RVR Optimized by BA and GWO

An epsilon RVR model with the radial basis kernel function is defined with some parameters, on which its performance depends greatly. In this paper, the GWO and BA are applied as optimizers for the hyper-parameters of RVR. Typically, RVR is hybridized separately with GWO and BA, and the prediction result achieved by a GWO- or BA-hybridized RVR acts as a fitness function evaluation. The optimized hyper-parameters of the RVR can be obtained after the specified maximum iteration number has been reached. Regulated parameters for running the GWO and BA are presented in Tables 1 and 2, respectively.

Table 1 Parameters of GWO
Table 2 Parameters of BA

In this paper, the objective function was served by the root mean squared error (RMSE); the lower the RMSE, the higher the estimation accuracy. The RMSE can be defined as:

$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum \left( {a - p} \right)^{2} }$$
(15)

where a and p denote actual and predicted values. The procedure of optimizing the RVR variables with GWO and BA is presented in Figure 4.

Figure 4
figure 4

Optimization of RVR parameters using GWO and BA

Prediction of ppv using RVR-GWO and RVR-BA

In this section, the implementations of the RVR-GWO and RVR-BA models based on the prepared database are explained briefly.

Database

For RVR-GWO and RVR-BA modeling, 95 datasets were collected and divided into two subsets: 75 datasets were allotted for training the models and 20 datasets for testing and verifying the constructed models. The input parameters were W, B/S, SL, and D parameters, and the output parameter was PPV. The elementary statistics of datasets used in this study are presented in Table 3. In addition, the ranges of the parameters implemented in the modeling processes are shown in Figure 5 and summarized as follows:

Table 3 Statistical details of the datasets used
Figure 5
figure 5

Information about the variables in the database

  • W parameter: 33%, 16%, 23% and 28% of the whole data were between 0–350, 350–450, 450–550 and 550–650 kg, respectively.

  • B/S parameter: 22%, 34%, 29% and 15% of the whole data were between 0–0.8, 0.8–0.83, 0.83–0.86 and 0.86–0.9, respectively.

  • SL parameter: 13%, 21%, 34% and 32% of the whole data were between 0–2.5, 2.5–4.5, 4.5–6.5 and 6.5–8 m, respectively.

  • D parameter: 16%, 11%, 39% and 34% of the whole data were between 0–150, 150–250, 250–350 and 350–450 m, respectively.

  • PPV parameter: 19%, 37%, 35% and 9% of the whole data were between 0–8, 8–14, 14–25 and 25–34 mm/s, respectively.

Pre-processing of Data and Evaluation of Model Performance

To improve the stability of training of the RVR-BA and RVR-GWO models, both output and input data need to be normalized. In this study, all data were converted into values in the range [0, 1] using the following equation:

$$x_{M} = \frac{{x - x_{\hbox{min} } }}{{x_{\hbox{max} } - x_{\hbox{min} } }}$$
(16)

where x, xminxmax and xM are values to be normalized, minimum value, maximum value and normalized values, respectively.

To evaluate model performance, the correlation coefficient (R), mean squared error (MSE), mean absolute percentage error (MAPE) and variance account for (VAF) were used as measures of accuracy. MSE, R, MAPE and VAF could be defined, respectively, as follows (e.g., Rezaei et al. 2011; Fattahi 2015a, b; Mostafaeipour et al. 2018; Mehrdanesh et al. 2019; Hasanipanah et al. 2018a, b, 2020a, b, c, d; Gao et al. 2020; Jing et al. 2020; Ramezanalizadeh et al. 2020a, b; Yu et al. 2020a; Zhou et al. 2020):

$${\text{MSE}} = \frac{1}{n}\sum \left( {a - p} \right)^{2}$$
(17)
$$R = \frac{n(\sum ap) - (\sum a)(\sum p)}{{\sqrt {\left[ {n\sum a^{2} - (\sum a)^{2} } \right]\left[ {n\sum p^{2} - (\sum p)^{2} } \right]} }}$$
(18)
$${\text{MAPE}} = \left[ {\frac{1}{n}\sum \frac{{\left| {a - p} \right|}}{{a_{m} }}} \right] \times 100$$
(19)
$${\text{VAF}} = \left[ {1 - \frac{{{\text{var}} \left( {a - p} \right)}}{{{\text{var}}a_{m} }}} \right] \times 100$$
(20)

where a, p and n are actual value, predicted value and observations number, respectively.

Results and Discussion

In this study, the RVR-GWO and RVR-BA models were proposed to predict PPV. The results obtained from the comparative experiments (MSE, R, MAPE and VAF) on these two hybrid intelligence models are listed in Table 4. It is worth mentioning that the lowest MSE and MAPE, and the highest R and VAF are the most ideal results. Table 4 shows that, in the testing phase, the RVR-GWO model (with R = 0.915, MSE = 7.920, MAPE = 15.343 and VAF = 83.476%) is more accurate than the RVR-BA model (with R = 0.867, MSE = 8.551, MAPE = 16.109 and VAF = 75.012%) in predicting PPV. In addition, a correlation between the estimated and measured PPV for all 76 data points is depicted in Figure 6. This figure indicates the RVR-GWO model possessed a superior predictive capacity over the RVR-BA model for predicting PPV because a very close agreement was obtained between the measured and predicted PPV. Moreover, comparison of PPV estimated by the RVR-BA and RVR-GWO models and the measured PPV for all 76 data points is depicted in Figures 7 and 8, respectively. Furthermore, the Taylor diagram related to predictive models related to all data points is shown in Figure 9. These figures and Table 4 show that the RVR-GWO and RVR-BA models were able to successfully estimate PPV; however, the RVR-GWO performed better than the RVR-BA in both training and testing datasets.

Table 4 Performance indices for the two hybrid intelligent models
Figure 6
figure 6

Plots of estimated PPV versus measured PPV: (a) RVR-BA; (b) RVR-GWO

Figure 7
figure 7

Errors of PPV estimation for training datasets using: (a) RVR-BA; (b) RVR-GWO

Figure 8
figure 8

Errors of PPV estimation for testing datasets using: (a) RVR-BA; (b) RVR-GWO

Figure 9
figure 9

Taylor diagram for all data points

Conclusions

Ground vibration induced by blasting is a major concern in surface mines because it can cause damage to nearby structures. Therefore, predicting ground vibration is a practical need, especially for safety issues. The objective of this study was the development of two advanced machine learning models to predict ground vibration in the Harapan Ramai granite quarry located in Malaysia. To do so, the RVR-BA and RVR-GWO models were developed. Totally, 95 sets of data were used to develop the two models. Then, the performances of the developed models were checked through comparative experiments using MSE, R, MAPE and VAF%. The main conclusions of the research are the following:

  • The proposed RVR-GWO model was more effective and robust than the proposed RVR-BA model in predicting ground vibration. This confirms the effectiveness of the GWO in optimizing the RVR performance and its capacity for generalization.

  • The proposed RVR-GWO model can be addressed as a powerful tool for the prediction of other phenomena induced by mine blasting such as flyrock and air-overpressure.

  • For future research, it is interesting to develop the RVR model optimized with other metaheuristic algorithms such as water wave optimization, wind-driven optimization, simplified swarm optimization, shark smell optimization, sine cosine algorithm, locust swarm algorithm, krill herd algorithm, etc.