Introduction

Several mining companies encounter blast-induced ground vibration problems in the form of both social and environmental impacts on nearby residential areas. Blasting is an essential fragmentation and displacement process of rock to enable more efficient excavation in mining operations (Afeni & Osasan, 2009). Blast vibration levels need to be accurately monitored and reduced to a level that is suitable for both the local community and the mining company.

Blasting is a complex phenomenon that is controlled by a multitude of variables. The difficulty in predicting and controlling the vibration levels is influenced largely by the nature of rock at different geological conditions. Several methods are used to assess the influences of ground vibrations, but peak particle velocity (PPV) is the most preferred kinematic descriptor of ground motion (Saadat et al., 2014). The determination of PPV is therefore of particular importance to mining engineers when designing blasting parameters (Khandelwal, 2012). It was suggested that the distance between the blast location and the monitoring point, coupled with the maximum charge of explosives, are the two main factors that influence PPV primarily, where the level of effect can be influenced directly by the quantity of explosives used (Khandelwal & Singh, 2007).

There are three factors that influence predominantly the degree of shaking, namely PPV, its frequency and its duration (Saadat et al., 2015). Mining operations near residential areas are bound by rules and regulations that prohibit excessive vibration levels (Chen et al., 2015). Altering blast timing sequences and parameters, such as size, depth, orientation, layout, location and number of holes, can potentially play a major role in controlling PPV levels. The geology of the surrounding rock also needs to be taken into account to strengthen conclusions made (Simangunsong & Wahyudi, 2015). However, these geological features constantly change due to the blast locations, meaning that a model suited to a general scenario lacks credibility. Studies undertaken on the predictive capabilities of various models are required for each new mine location. It is very challenging to predict the PPV levels precisely due to the uncertainty of various factors that affect vibration force.

Several benefits will arise if accurate predictions of PPV can be made. The effectiveness of blasting operations will increase as the safety factors set for predicted vibrations levels can be lowered. This can increase profit margins for mining companies by allowing engineers to optimize blasting parameters to achieve more effective and efficient outcomes, and at the same time reduce social and environmental impacts (Mohamed, 2009; Khandelwal, 2011; Khandelwal et al., 2011; Monjezi et al., 2013).

Several models can be used to predict vibration levels, and the accuracy of each model varies for different geological conditions. Many conventional predicting equations have been developed over the years based on explosive charge used per delay and distance between blast face to monitoring point, but in recent times improved models have been used to reduce the variance in the predicted results. One of the main reasons for this improvement is largely due to the increased number of parameters used to provide a more accurate description of the wave propagation effect. Many researchers (Monjezi et al., 2006; Khandelwal & Singh, 2006, 2007, 2009, 2013; Khandelwal, 2010, 2012; Khandelwal et al., 2010, 2017; Fisne et al., 2011; Álvarez-Vigil et al., 2012; Bakhshandeh et al., 2012; Mohamadnejad et al., 2012; Armaghani et al., 2015; Bakhtavar et al., 2017a, 2017b; Bakhtavar & Yousefi, 2019; Ding et al., 2020; Fang et al., 2020; Rezaeineshat et al., 2020; Zhang et al., 2020; Bayat et al., 2021; Bui et al., 2021; Fattahi et al., 2021; He et al., 2021; Qiu et al., 2021) have applied various artificial intelligence techniques, such as artificial neural networks (ANNs) and support vector machines (SVMs), to study PPV. It was found that the maximum charge per delay and distance between the blast face to the monitoring point are important, but other parameters, such as stemming, hole depth, physical and mechanical properties of rock mass, burden and spacing, can also alter the intensity of ground vibration. It is very strenuous to obtain a PPV equation based on all the affected parameters using an ANN, despite its capability in function approximation and prediction of nonlinear relationships among various parameters.

Dehghani and Ataee-pour (2011) developed a PPV predictor equation using dimensional analysis (DA). It was proposed that empirical methods are not suitable due to the large number of important parameters that can affect vibration levels. They selected various input model parameters based on an artificial neural network (ANN) approach. The study established that the effectiveness of the DA model decreased when parameters have limited correlations with vibration levels. Therefore, choosing variables with high correlation effect is particularly important to the success of any model. It was found that when ANN was used to remove all unnecessary variables, the DA model produced much higher levels of correlation compared to using any of the conventional predictor equations. DA was applied successfully by Khandelwal and Saadat (2015) to approximate the effects of blast-induced ground vibrations. The Buckingham Pi theorem was employed to create dimensionless groups. Both linear and nonlinear regression analysis were used to relate these groups, but the nonlinear method was more accurate compared to the linear approach. They also compared the proposed model to other conventional models and found that the DA approach can be suitable for predicting PPV. A similar DA approach was undertaken by Cheng and Chau (2015), who established the approach is feasible. Their aim was to construct a new empirical formula for a standardized PPV, which is dimensionless. The used 500 blast vibration data measurements from the Anderson Road Quarry in Hong Kong to compare the accuracy of the results using conventional and new dimensionless formulas. The parameters considered were explosive energy, duration times and densities of rock. It was realized that the proposed approach obtained better PPV estimates with a correlation coefficient of 0.9509 as opposed to 0.7607 for the conventional methods. Bakhtavar et al., (2017a, 2017b) applied a hybrid DA fuzzy inference system to study and predict blast-induced flyrock in a copper mine based on 320 data sets and found that flyrock predictions by the system were in close agreement with real measurements. Sanchidrián and Ouchterlony (2017) developed a DA approach to study rock fragmentation in bench blasting. Their model could be used from 5 to 100 percentile sizes range with an expected error of < 25% at any percentile. Bakhtavar et al. (2015) applied a DA approach along with multivariate nonlinear regression analysis to study and predict rock fragmentation of Sungun surface mine. They developed a rock fragmentation equation by incorporating various independent dimensionless products derived from DA and compared their fragmentation prediction results with sieve analysis, image processing technique as well as the Kuz-Ram model. They found that their prediction outcome was closer to actual fragmentation results obtained from sieve analysis compared to the other models. Thus, DA has been utilized extensively in the past to analyze complex relationships among various physics-based quantities. The DA method identifies the base quantities of the different variables, where different procedures can be applied to combine certain variables into a form that has dimensionless units in order to facilitate the interpretation and extend the range of application of experimental data (Alhama & Madrid, 2007).

In the present study, a DA approach was exercised to study and assess PPV in an underground gold mine in Australia using various blast design parameters. The novelty of this research work is that both linear and nonlinear DA approaches have been applied to predict and assess blast vibrations in an underground gold mine. Then, a simple and easy-to-use blast vibration equation is proposed based on DA. A comparative analysis of the newly proposed equation and conventional empirical models was performed to obtain the prediction capability of each model.

Dimensional Analysis

DA is an engineering problem solving method that examines complex relationships among a set of physical quantities (Sharp et al., 1992). The procedure involves manipulating the individual variables involved in the problem, whereby the relationships are simplified into an equation that involves a smaller number of dimensionless parameters. The Buckingham Pi theorem is a method that is used when a large number of independent variables is present (Reddy & Reddy, 2014). This theorem is one of the many methods that can be used to develop dimensionless variables. In mathematical terms, the Buckingham Pi theorem states that if a number of measurable quantities (or variables) form a complete functional relationship shown in Eq. (1), then the solution has the form of Eq. (2) (Dehghani & Ataee-pour, 2011), thus:

$$\varphi \left( {A,B,C, \ldots } \right) = 0$$
(1)
$$f\left( {\pi_{1} ,\pi_{2} ,\pi_{3} , \ldots } \right) = 0$$
(2)

where the terms A, B and C are, in this study, the various variables that affect vibration levels. The π terms in Eq. (2) are the dimensionless groups that represent the products of the A, B and C terms. If there are enough fundamental units to describe the magnitude of the quantities considered, then Eq. (2) can represent a complete relationship.

Dimensional quantities employed in DA are represented by m and the variables associated with these quantities are denoted by n. Once the dimensional quantities have been established, they need to be transformed into a combination of either force, length or time. The number of π terms is equal to the difference between the number of measurable variables and fundamental units. Equation (3) is a mathematical representation of this definition, where “r” is the number of π dimensionless terms, thus:

$$r = n - m$$
(3)

Once the desired variables have been chosen, they are placed in matrix form where the fundamental quantities make up the rows and each variable forms the columns. The powers of the fundamental dimensions m corresponding to each variable n are used to construct the matrix (Eq. (3)).

Among the n variables, a number of so-called “repeating variables” are chosen. The number of repeating variables chosen is equivalent to the number of fundamental quantities m. These repeating variables will appear in all the π terms and must be chosen based on the following two restricting rules. First, all repeating variables must have dimensions; they cannot be dimensionless. Second, the overall dimensions of each repeating variable must have different combinations of dimensions compared to all other variables used in the procedure. Once the repeating variables have been selected, the determinant of their fundamental dimensions can be calculated. If the determinant of the square matrix is not equal to zero, then the rows of the matrix are linearly independent. If the determinant is found to be zero, then a different set of repeating variables needs to be considered.

Once all the prerequisite information has been identified, the dimensionless π variables can be calculated. By selecting one of the non-repeating variables (D) and assuming there are three repeating variables (A, B, C), the first π dimensionless group can be calculated as:

$$\pi_{1} = \left[ A \right]^{{a_{1} }} \cdot \left[ B \right]^{{b_{1} }} \cdot \left[ C \right]^{{c_{1} }} \cdot \left[ D \right]$$
(4)

The second dimensionless group can be calculated by once again multiplying the three repeating variables by the next non-repeating variable (E), thus:

$$\pi_{2} = \left[ A \right]^{{a_{2} }} \cdot \left[ B \right]^{{b_{2} }} \cdot \left[ C \right]^{{c_{2} }} \cdot \left[ E \right]$$
(5)

Note that the subscript for the a, b, c constants relates to the subscript for the π groups. The procedure for Eqs. (4)–(5) is continued for all the remaining variables. For the π variables to be dimensionless, the summation of the powers represented by “a, b, c” in the equations needs to be eliminated. By doing this, the “a, b, c” constant will multiply each variable in the group in such a way that the π group will be dimensionless.

Once each π group has been made dimensionless, they need to be manipulated to form an equation. The method chosen to relate these variables is multiple regression analysis. A typical linear regression equation has the form:

$$y = k + x_{1} \cdot (\pi_{1} ) + x_{2} \cdot (\pi_{2} ) + \cdots + x_{r} \cdot (\pi_{r} )$$
(6)

where the r subscript is the number of dimensionless groups determined from Eq. (3) and the x variables are the site constants. Nonlinear regression analysis can also be used, thus:

$$\ln\, \left( y \right) = k + x_{1} \cdot \ln\, \left( {\pi _{1} } \right) + x_{2} \cdot \ln\, \left( {\pi _{2} } \right) + \cdots + x_{r} \cdot \ln\, \left( {\pi _{r} } \right)$$
(7)

Linear Regression Analysis

To determine the site constants, a regression analysis toolpack of Microsoft Excel 2016 was applied. The recorded data were entered into Excel, as per the format shown in Table 1. The columns are all the variables considered and the rows are the iterations of each test.

Table 1 Recorded data used to determine the site constants for the DA equation

For linear regression analysis, a new set of columns needs to be created to the right of the measured variables. Each new column is a dimensionless π group. For example, if the first π group was found to have a dimensionless combination of A × B × C, then the new column would have this combination as a result. The formula in Excel used for this example is shown in Table 2. This process is completed for all dimensionless groups.

Table 2 Dimensionless variables added to the Excel sheet

Once Table 2 is completed, the data analysis tool can be used as it is able to compute a regression analysis on the dimensionless groups. To run this regression analysis, two sets of variables are required. The Y input range comprises the dependent variables and the X input range comprises all the other variables that are independent of the Y values.

The problem with Buckingham Pi theorem is that the dependent variable is typically one of the variables within one of the dimensionless groups. If it is assumed that variable B is the dependent variable, then it means that the \({\pi }_{1}\) column needs to be adjusted to make the B value by itself. This means that, before the regression can be run, the independent dimensionless groups must be modified. If \(\pi_{1} = A \times B \times C\), where B is the dependent variable, then the operation to get B by itself is:

$$f\left( {\pi_{1} \div \left( {A \times C} \right),\pi_{2} \div \left( {A \times C} \right), \ldots ,\pi_{r} \div \left( {A \times C} \right)} \right) = 0$$
(8)

This means that the same operation performed on \(\pi_{1}\) to get B by itself must also be applied to the other dimensionless groups. The Excel spreadsheet can be updated using Eq. (8) (Table 3).

Table 3 Modified Excel spreadsheet to allow linear regression analysis to be run

Running the regression analysis produces the same number of coefficients as the number of modified independent variables, plus an additional intercept coefficient k. Using the above example, Eq. (6) becomes:

$$\begin{aligned} y = & k + x_{2} \cdot \left[ {\frac{{\pi _{2} }}{{\left( {A \times C} \right)}}} \right] + x_{3} \cdot \left[ {\frac{{\pi _{3} }}{{\left( {A \times C} \right)}}} \right] + \cdots \\ & + x_{r} \cdot \left[ {\frac{{\pi _{r} }}{{\left( {A \times C} \right)}}} \right] \\ \end{aligned}$$
(9)

where y is the predicted variable, k is the intercept coefficient generated by Excel and the x coefficients are the site constants that relate to each independent variable selected when running the regression. The terms in brackets in Eq. (9) are no longer the original dimensionless groups. They are now the modified dimensionless groups. The site constants k and x are related to the modified groups, not the original groups. This is true for the predicted variable as well. Y is not the same as \(\pi_{1}\), because replacing \(\pi_{1}\) means that all the site constants are affected as well.

Nonlinear Regression Analysis

Nonlinear regression analysis follows the same process as the linear one, but the former takes the natural logarithm of each dimensionless group, which alters Eq. (2) into:

$$f\left( {\ln (\pi_{1} ),\ln \left( {\pi_{2} } \right), \ldots ,\ln \left( {\pi_{r} } \right)} \right) = 0$$
(10)

Using Eq. (10), Table 2 can be modified into Table 4.

Table 4 Natural logarithms of the dimensionless groups

Again, assuming \(\pi_{1} = A \times B \times C\) and B is the dependent variable, Eq. (10) is modified to get ln(B) by itself as:

$$\ln (\pi_{1} ) = \ln \left( {A \times B \times C} \right)$$
$$\ln \left( {A \times B \times C} \right) = \ln \left( A \right) + \ln \left( B \right) + \ln \left( c \right)$$
$$\ln \left( B \right) = \ln (\pi_{1} ) - \ln \left( A \right) - \ln \left( C \right)$$
$$\therefore \,f\left( {\left[ {\ln (\pi_{1} ) - \ln \left( A \right) - \ln \left( C \right)} \right],\left[ {\ln \left( {\pi_{2} } \right) - \ln \left( A \right) - \ln \left( C \right)} \right], \ldots ,\left[ {\ln \left( {\pi_{r} } \right) - \ln \left( A \right) - \ln \left( C \right)} \right]} \right) = 0$$

Thus, the appropriate columns in Table 4 are modified as shown in Table 5.

Table 5 Modified dimensionless groups to allow nonlinear regression analysis to be performed

The predicted variable y can be calculated using Eq. (7), thus:

$$\begin{aligned} \ln \left( y \right) = & k + x_{2} \cdot \left[ {\ln (\pi _{2} ) - \ln \left( A \right) - \ln \left( C \right)} \right] + \cdots \\ & + x_{r} \cdot \left[ {\ln (\pi _{r} ) - \ln \left( A \right) - \ln \left( C \right)} \right] \\ \end{aligned}$$

To get the predicted variable by itself, the exponential of both sides can be taken as:

$$\begin{aligned} y = & \exp \left( {k + x_{2} \cdot \left[ {\ln (\pi _{2} ) - \ln \left( A \right) - \ln \left( C \right)} \right] + \cdots } \right. \\ & \left. { + x_{r} \cdot \left[ {\ln (\pi _{r} ) - \ln \left( A \right) - \ln \left( C \right)} \right]} \right) \\ \end{aligned}$$

where the y value is the predicted variable, k is the intercept coefficient generated by Excel, and the x coefficients are the site constants that relate to each independent variable selected when running the regression.

Conventional Predictors

Conventional predictor equations are all in the form of a basic power function, thus:

$$y = a\cdot(x)^{b}$$
(11)

where the terms a and b are site constants and x represents various scaled distance combinations with respect to total explosives used per delay. Table 6 highlights some of the common conventional predictor equations proposed by various researchers.

Table 6 Conventional predictor equations used for this study

Field Study

The study was conducted at one of the underground gold mines in Australia. The mineralization transpires within the lower Ordovician mudstones, sandstones and siltstones, which have been metamorphosed and tightly folded around a north-dipping axis. Of all the known western segments of the anticlines, the dip is roughly 70°W; the dip in the eastern sectors range from 85°W to 85°E, and the folded axial planes dip approximately 80°W. The regional strike of the bedding is in a northerly direction. The mineable quartz veins occurring lodes and stockworks typically located within folded sections. The lodes and stockworks exist in west-dipping fault zones. The mineralization is characterized by notable coarse gold contained in the quartz veins. In some sections, a high nugget effect is found with grades reaching 50 g/t or higher over a few meters.

Due to the proximity of the mine to the residential area, the vibration levels and noise produced from the mine operations have strict licensing conditions. At the higher end of the scale are the blast vibration levels, where current restrictions prevent vibration levels from exceeding 10 mm/s at any time and no more than 5% of blasts are permitted to exceed 5 mm/s over a 12-month period. In addition, the maximum blast vibration levels between the hours of 10:00 pm and 7:00 am are 3 mm/s.

At the mine, the production blasting method is an open stoping technique that follows the veins of the gold ore. Once the ore has been mined out the open stope is backfilled, allowing for the next section of the vein to be blasted and mined out. The hole lengths generally range from 10 to 25 m and depend on local ground conditions and orientations of the ore lodes. The diameters of the blast holes are 76 mm. Production blast holes use programmable electronic detonators inserted into a cast primer. Most of the hole is then filled with either ANFO or an impact 50/50 blend depending on the required output. Due to the large sections of rock requiring fragmentation, production blasts typically produce much higher PPV than ordinary development blasts.

Development blast holes are typically 3.7 m in length with 45 mm diameter. These blast holes use non-electronic development detonators connected to an electric detonating cord, which sets off each hole based on the preset delay sequence. Due to the low quantity of explosives used, development blasting practices rarely raise cause for concern and thus are not analyzed for predictions of the PPV.

At the mine, approximately two development blasts per shift are completed, whereas the number of production blasts is approximately two per week. In various locations around the neighboring houses, monitoring instruments are used to measure blast-induced ground vibrations. These instruments are highly sensitive and can on occasion record vibration levels that are not related to blasting activities such as a vehicle driving past. For each production blast, a plethora of variables are recorded, and some of the important variables are listed in Table 7. In conjunction with these variables, the locations within the mine and the monitoring site are also recorded. The blasting parameters for production are highly specialized and are designed to allow for appropriate fragmentation of the surrounding rock. This means that there is no set design for a production blast. These designs can incorporate the number of holes, the diameter of these holes and their lengths, as well as the burden and spacing dimensions for these holes.

Table 7 Variables used to perform DA

The geological characteristics of the surrounding rock between the blast location and the monitoring site are extremely difficult to analyze. What can be examined and approximately measured is the powder factor, which is the amount of force required to adequately fracture the rock. The total tonnage of rock that needs to be fragmented can also be recorded.

The properties associated with the explosives can be evaluated, where the total amount of explosives can play an important role in affecting the peak vibration levels. If all the explosive content used was detonated at the same time, then the peak vibration level can be large. For this reason, there is a delay between sets of holes to prevent excessive vibration levels. The maximum instantaneous charge is recorded, which is the highest charge that can detonate at any given time. The explosives used are both ANFO and impact 50/50, which is an explosive mixture that contains some inert material. The 50/50 impact explosive has a lower explosive force than ANFO; the latter is employed if a more controlled fragmentation is required, which is generally within the perimeters of the blast pattern. This ultimately reduces unnecessary fragmentation.

For the present study, 419 blast vibration data sets were acquired from December 31, 2014, to June 5, 2017, of which 360 data sets were examined to determine the site constants for the various models analyzed. The DA model was specifically designed for the mine; therefore, the variables chosen to develop the equation were based on the data that were previously been recorded (Table 7). To validate the accuracy of the DA model, another ten sets of data acquired from June 6, 2017, to August 29, 2017, were used to predict PPV. These new sets of data were also examined for the most effective conventional predictor equations, whereby the results of the two models were compared to one another.

Error Analysis for Comparing Predictor Models

There are many error indices that can be applied to analyze the effectiveness of a prediction equation. As mentioned by Cheng et al. (2014), the coefficient of determination (R2) is an error analysis tool for judging how effective a regression line predicts the actual results. It aims to show the correlation of predicted value. s against actual measured values. The value of R2 typically ranges between 0 and 1, where 1 indicates a perfect relationship and 0 no correlation. To calculate R2, the first step is to determine the correlation coefficient (r):

$$r = \frac{{n\cdot\sum \left( {x \cdot y} \right) - \sum \left( x \right) \cdot \sum \left( y \right)}}{{\sqrt {\left[ {n\sum \left( {x^{2} } \right) - \left( {\sum x} \right)^{2} } \right] \cdot \left[ {n\sum \left( {y^{2} } \right) - \left( {\sum y} \right)^{2} } \right]} }}$$
(12)

where n is the number of input/output variables, x represent predicted values and y represent actual measured values. Then, R2 is computed as:

$$R^{2} = \left( {\frac{{n\cdot\sum \left( {x \cdot y} \right) - \sum \left( x \right) \cdot \sum \left( y \right)}}{{\sqrt {\left[ {n\sum \left( {x^{2} } \right) - \left( {\sum x} \right)^{2} } \right] \cdot \left[ {n\sum \left( {y^{2} } \right) - \left( {\sum y} \right)^{2} } \right]} }}} \right)^{2}$$
(13)

Expressing R2 as a percentage shows what percentage of data points fall on the regression line. This value is an indicator of the likelihood a predicted value will fall on the regression line. However, the R2 does not necessarily determine causality, even if the relationship is high. For this reason, another error analysis measurements can be applied. The mean absolute error (MAE) and root mean squared error (RMSE) can both be used in conjunction with the R2 to provide a more accurate interpretation of the relationship between the predicted and actual variables. MAE and RMSE are calculated, respectively, as:

$${\text{MAE}} = \frac{1}{n}\sum \left| {y_{j} - \hat{y}_{j} } \right|$$
(14)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum \left( {y_{j} - \hat{y}_{j} } \right)^{2} }$$
(15)

where n is the number of data points analyzed, yj represents actual values and \(\hat{y}_{j}\) represents predicted values. The MAE and RMSE are two of the most commonly considered accuracy measures for continuous variables. The MAE indicates the average magnitude of errors between the predictions made and the actual results. The RMSE also measures the average magnitude of the error. However, but the benefit of RMSE is that it squares the errors before they are averaged, which allows for the RMSE to assign a high weighting to large errors. This is highly desirable for this study as large errors need to be negated as much as possible. Because the MAE and RMSE are both negatively orientated, the lower their values, the better. In other words, if RMSE or MAE is zero, it means a perfect relationship.

Results

Dimensional Analysis

The choice of variables used to develop the dimensionless groups was based on variables the gold mine had previously measured. These variables, with their common dimensions as well as their basic fundamental units, are shown in Table 7. Using Eq. (1), the functional relationships of these variables must therefore satisfy the following relationship:

$$\varphi \left( {{\text{PPV}},\theta ,\delta ,{\text{PF}},{\text{ch}},I,{\text{De}},{\text{MIC}},D,R} \right) = 0$$

The matrix of these variables is presented in Table 8.

Table 8 Matrix of fundamental dimensions of the chosen variables for DA

From the matrix in Table 8, there are three basic fundamental units, namely force, length and time. From Eq. (3), the total number of dimensionless π groups can be calculated as:

$$r = n - m$$
$$\therefore r = 10 - 3 = 7$$

where n is number of variables and m is number of basic fundamental units. Therefore, Eq. (2) is modified as:

$$f\left( {\pi_{1} ,\pi_{2} ,\pi_{3} ,\pi_{4} ,\pi_{5} ,\pi_{6} ,\pi_{7} } \right) = 0$$
(16)

Only three of the variables in Table 8 satisfy the rules for choosing the repeating variables that will appear in each dimensionless group. These variables are powder factor (PF), distance (D) and duration (De). The determinant of the matrix for these variables was found to be 1, thus:

$$\det \left| {\begin{array}{*{20}c} 1 & 0 & 0 \\ { - 4} & 1 & 0 \\ 2 & 0 & 1 \\ \end{array} } \right| = 1$$

Therefore, the rows of the matrix are linearly independent. Developing the dimensionless groups was thus undertaken as displayed in Table 9:

Table 9 Dimensionless groups with their respective dimensional quantities

From Table 9, the dimensions for \(\pi_{2} ,\pi_{3} ,\pi_{4} ,\pi_{5}\) are all equal. This means that all the power constants a, b and c will be the same. \(\pi_{6}\) and \(\pi_{7}\) are dimensionless; therefore, the number of dimensionless groups is equal to the number of variables.

The power constants for \(\pi_{1}\) is calculated as follows:

For force (F): \(a_{1} = 0\).

For time (T): \(2 \cdot a_{1} + c_{1} - 1 = 0\).

For length (L): \(- 4 \cdot a_{1} + b_{1} + 1 = 0\).

By solving these system of equations, the power constants were found to be:

\(a_{1} = 0, b_{1} = - 1, c_{1} = 1\).

Substituting these constant into the equation yields:

$$\pi_{1} = \left[ {{\text{PF}}} \right]^{0} \cdot \left[ D \right]^{ - 1} \cdot \left[ {{\text{De}}} \right]^{1} \cdot \left[ {{\text{PPV}}} \right]$$
$$\therefore \pi_{1} = {\text{PPV}} \cdot \frac{{{\text{De}}}}{D}$$

The power constants for \(\pi_{2}\) is calculated as follows:

For force (F): \(a_{2} + 1 = 0\).

For time (T): \(2 \cdot a_{2} + c_{2} + 2 = 0\).

For length (L): \(- 4 \cdot a_{2} + b_{2} - 1 = 0\).

By solving these system of equations, the power constants were found to be:

\(a_{2} = - 1, b_{2} = - 3, c_{2} = 0\).

Substituting these constants back into the equation yields:

$$\pi_{2} = \left[ {{\text{PF}}} \right]^{ - 1} \cdot \left[ D \right]^{ - 3} \cdot \left[ {{\text{De}}} \right]^{0} \cdot \left[ {{\text{MIC}}} \right]$$
$$\therefore \pi_{2} = \frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{3} }}$$

Therefore, the dimensionless groups were:

$$\begin{aligned} \pi_{1} = & {\text{PPV}} \cdot \frac{{{\text{De}}}}{D} \\ \pi_{2} = & \frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{3} }} \\ \pi_{3} = & \frac{R}{{{\text{PF}} \cdot D^{3} }} \\ \pi_{4} = & \frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{3} }} \\ \pi_{5} = & \frac{I}{{{\text{PF}} \cdot D^{3} }} \\ \pi_{6} = & \theta \\ \pi_{7} = & \delta \\ \end{aligned}$$

From Eq. (6), the linear regression equation used to relate the dimensionless groups was:

$$\begin{aligned} {\text{PPV}} \cdot \frac{{{\text{De}}}}{D} = & k + \left( {x_{2} \cdot \frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{3} }}} \right) + \left( {x_{3} \cdot\frac{R}{{{\text{PF}} \cdot D^{3} }}} \right) \\ & + \left( {x_{4} \cdot \frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{3} }}} \right) + \left( {x_{5} \cdot \frac{I}{{{\text{PF}} \cdot D^{3} }}} \right)~ \\ & + \left( {x_{6} \cdot \theta } \right) + \left( {x_{7} \cdot \delta } \right) \\ \end{aligned}$$
(17)

Equation (17) is the linear DA equation.

From Eq. (7), the nonlinear regression equation used to relate the dimensionless groups was:

$$\begin{aligned} \ln \left( {{\text{PPV}} \cdot \frac{{{\text{De}}}}{D}} \right) = & k + x_{2} \cdot \ln \left[ {\frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{3} }}} \right] + x_{3} \cdot \ln \left[ {\frac{R}{{{\text{PF}} \cdot D^{3} }}} \right] \\ & + x_{4} \cdot \ln \left[ {\frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{3} }}} \right] \\ & + x_{5} \cdot \ln \left[ {\frac{I}{{{\text{PF}} \cdot D^{3} }}} \right] + x_{6} \cdot \ln \left[ \theta \right] + x_{7} \cdot \ln \left[ \delta \right] \\ \end{aligned}$$
(18)

Equation 18 is the nonlinear DA equation.

Equations (17) and (18) were modified to Eqs. (19) and (20), respectively, for PPV determination; thus:

$$\begin{aligned} {\text{PPV}} = & k - x_{2} \cdot \left[ {\frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] - x_{3} \cdot \left[ {\frac{R}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] \\ & + x_{4} \cdot \left[ {\frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] + x_{5} \cdot \left[ {\frac{I}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] - x_{6} \cdot \left[ {\frac{{\theta \cdot D}}{{{\text{De}}}}} \right] \\ & + x_{7} \cdot \left[ {\frac{{\delta \cdot D}}{{{\text{De}}}}} \right] \\ \end{aligned}$$
(19)
$$\begin{aligned} \ln \left( {{\text{PPV}}} \right) = & k - x_{2} \cdot \ln \left( {\frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) + x_{3} \cdot \ln \left( {\frac{R}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) \\ & + x_{4} \cdot \ln \left( {\frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) + x_{5} \cdot \ln \left( {\frac{I}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) \\ & - x_{6} \cdot \ln \left( {\frac{{\theta \cdot D}}{{{\text{De}}}}} \right) + x_{7} \cdot \ln \left( {\frac{{\delta \cdot D}}{{{\text{De}}}}} \right) \\ \end{aligned}$$
(20)

Linear and Nonlinear Regression Analyses Considering All Blasting Data sets

Multiple regression analysis in Microsoft Excel 2016 was used to develop the site constants for Eqs. (19) and (20) and is given in Table 10.

Table 10 Site constants for the linear and nonlinear equations when all the data sets are considered

Therefore, the DA equations using the site constants generated in Excel (Table 10) are:

$$\begin{aligned} {\text{PPV}} = & 1.02 - 1158.2 \cdot \left[ {\frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] - 67.64 \cdot \left[ {\frac{R}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] \\ & + 455.36 \cdot \left[ {\frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] + 401.5 \cdot \left[ {\frac{I}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right] \\ & - 6.99 \times 10^{{ - 5}} \cdot \left[ {\frac{{\theta \cdot D}}{{{\text{De}}}}} \right] + 5.95 \times 10^{{ - 5}} \cdot \left[ {\frac{{\delta \cdot D}}{{{\text{De}}}}} \right] \\ \end{aligned}$$
(21)
$$\begin{aligned} \ln \left( {{\text{PPV}}} \right) = & 2.33 - 0.18 \cdot \ln \left( {\frac{{{\text{MIC}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) + 0.003 \cdot \ln \left( {\frac{R}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) \\ & + 0.47 \cdot \ln \left( {\frac{{{\text{ch}}}}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) \\ & + 0.007 \cdot \ln \left( {\frac{I}{{{\text{PF}} \cdot D^{2} \cdot {\text{De}}}}} \right) - 0.59 \cdot \ln \left( {\frac{{\theta \cdot D}}{{{\text{De}}}}} \right) \\ & + 0.53 \cdot \ln \left( {\frac{{\delta \cdot D}}{{{\text{De}}}}} \right) \\ \end{aligned}$$
(22)

The predicted values PPV, by using Eqs. (21) and (22), were then compared to the actual measured variables (Figs. 1, 2).

Figure 1
figure 1

Actual PPV versus predicted PPV by DA using linear regression analysis

Figure 2
figure 2

Actual PPV versus predicted PPV by DA using nonlinear regression analysis

The R2 for the linear DA regression was 0.2542, whereas for the nonlinear DA regression it was 0.2915. The linear DA regression showed higher MAE and RMSE compared to non-linear DA regression. The errors associated with the linear and nonlinear predictions are reported in Table 11.

Table 11 Errors between predicted and actual PPV values when a single set of site constants is used

DA—Segregation of Site Constants for Comparable Conditions

The gold mine has four different sections within the mine. It also has nine different monitoring locations. Different combinations of monitoring locations with different mine sections were observed in the data. Therefore, the data were segregated to compare constant characteristics relating to distance between the blast location and the monitoring site. This generated a set of site constants for each comparison, thus, creating a variety of predictor equations. Comparisons of the predicted PPV results against the actual PPV results using these site constants are shown in Figures 3 and 4. From these figures, it can be said that the nonlinear DA regression showed higher R2 compared to the linear DA regression. The MAE and RMSE were also lower for the nonlinear DA regression compared to the linear DA regression. The errors associated with the linear and nonlinear predictions are listed in Table 12.

Figure 3
figure 3

Actual PPV versus predicted PPV by DA using linear regression considering location characteristics

Figure 4
figure 4

Actual PPV versus predicted PPV by DA using nonlinear regression considering location characteristics

Table 12 Comparison of actual with predicted PPV values considering location characteristics

Conventional Predictors

The blast monitoring data from the gold mine were used to determine site constants for the four conventional predictor models depicted in Table 6. The site constants of these four conventional predictor equations are shown in Table 13.

Table 13 Site constants for the conventional predictor equations

Figures 5, 6, 7, 8 illustrate the relationships between predicted PPV values and actual PPV values for each of the equations in Table 13. Table 14 shows the errors in terms of R2, MAE and RMSE between the predicted and actual PPV levels for the four conventional equations. It can be seen that the R2ranged from 0.1116 to 0.1611, and the Ambraseys–Hendron predictor equation showed the highest R2and lowest errors among all the four conventional predictor equations.

Figure 5
figure 5

Actual PPV versus predicted PPV by the USBM equation

Figure 6
figure 6

Actual PPV versus predicted PPV by the Langefors-Kihlström equation

Figure 7
figure 7

Actual PPV versus predicted PPV by the Ambraseys–Hendron equation

Figure 8
figure 8

Actual PPV versus predicted PPV by the BIS equation

Table 14 Comparison of actual PPV and predicted PPV using the conventional predictor equations

Conventional Predictors—Segregation of Site Constants for Comparable Conditions

Because, among the four conventional predictor equations, the Ambraseys–Hendron predictor equation got the highest R2 and with least MAPE and RMSE for the predicted PPV compared to actual PPV, it was employed further using segregated data to compare constant characteristics relating to distance between the blast location and the monitoring site. Figure 9 shows a plot of actual PPV versus predicted PPV by the Ambraseys–Hendron equation considering location characteristics. The R2 was 0.4098, MAE 0.4145 and RMSE 0.5762 (Table 15).

Figure 9
figure 9

Actual PPV versus predicted PPV using the Ambraseys–Hendron equation considering location characteristics

Table 15 Comparison between the predicted and actual PPV’s for the Ambraseys–Hendron equation when site constants based on location characteristics are considered

Validation of Effectiveness of DA vs Conventional Models

A new set of ten data points was used to compare the effectiveness of the segregated nonlinear DA equation and the Ambraseys–Hendron equation (Tables 16, 17, 18, 19).

Table 16 Set of results used to compare the accuracy of the segregated DA model against the Ambraseys–Hendron model
Table 17 DA by using the log of modified dimensionless variables in Eq. 20
Table 18 DA by using site constants relating to relevant field in Eq. 20
Table 19 Relevant variable results required to predict PPV using the Ambraseys–Hendron equation

Figure 10 depicts the comparison of the nonlinear DA model and the Ambraseys–Hendron model. Table 20 shows the errors in terms of R2, MAE and RMSE between the predicted and actual PPV levels for the nonlinear DA equation and the Ambraseys–Hendron equation. From Figure 10, it can be said that nonlinear DA model yielded higher R2 and less errors.

Figure 10
figure 10

Plots to compare the nonlinear DA model and the Ambraseys–Hendron model. Dotted line represents the perfect relationship between actual and predicted values

Table 20 Comparison of the nonlinear DA and the Ambraseys–Hendron models

Discussion

Effectiveness of Conventional Predictors

The test conducted at the gold mine supports the claim that no single conventional predictor equation is accurate to predict PPV. If conventional predictor equations are used at a mine location, all of them need to be analyzed and compared to one another to determine the most appropriate model.

On examining the recorded data at the mine, it was found that, of the four conventional predictor equations analyzed, the Ambraseys–Hendron equation was the most accurate. It had a 16.11% R2 whereas the USBM equation had a lower R2of 14.88% (Table 14). This means that, compared to the USBM equation, the Ambraseys–Hendron equation has a slightly higher chance of predicted PPV values falling on the regression line. The Ambraseys–Hendron equation also had slightly lower RMSE and MAE compared to the USBM equation, meaning that the average errors between the actual PPV and the predicted PPV using the Ambraseys–Hendron equation were lower than those using the USBM equation.

Effectiveness of DA Predictors

It was theorized that, to increase the accuracy of predictions, a variety of variables that influence predictions need to be incorporated into the prediction model. Dropping a pebble into a pond creates waves that can be measured accurately. The reason for this is that enough information to fully describe the phenomenon is known. Water as a medium is uniform and the characteristics of the force impacting this medium can be calculated. Blasting can be compared to this analogy, where the medium is the surrounding rock, which is highly variable making vibration waves challenging to measure. The location of the blast is never in the same spot, meaning that relationships between variables are problematic to analyze. However, including more factors that describe wave propagation will increase the accuracy of future predictions. For this reason, a DA approach was used to relate the influencing factors to increase the accuracy of the predictions made at the mine. The equation developed also needed to be usable, whereby the variables chosen to construct the equation are based on variables that the mine already measured and recorded. This means that if a model was to replace the current system, no additional measurements that were not already recorded should be used.

In this study, all the variables were used except for the diameters and lengths of blast holes. The diameters of the blast holes are the same for all blasts, which effectively means that it will not affect the results, and so adding this variable serves no real purpose. There is no consistency to the lengths of the blast holes, whereby most of them have differing lengths for each blast. If the relationship on how a set of blast hole lengths affects PPV were to be analyzed, all holes for a single blast must the same. Another approach to analyzing this relationship is to use the longest blast hole for a given blast, because the longer the hole, the more explosive will be set off when the hole is detonated. Instead, the MIC can be employed. It can be suggested that the greater the hole lengths, the higher the MIC, because there will be more explosives in a given hole. Therefore, the MIC makes the effects of altering the hole lengths redundant.

To inspect the dimensionless groups, both linear and nonlinear multiple regression analyses was used. When comparing the predicted PPV values against the actual PPV values, it was found that the nonlinear regression model proved to be dominant. It had a stronger relationship with an R2 of 29.15% compared to linear model’s R2 of 25.42% (Table 11). Additionally, the nonlinear regression model had a lower RMSE of 0.632 compared to linear regression model’s 0.635. Moreover, the MAE of nonlinear regression model was lower than the linear regression model’s 0.480. For the data collected, it was found that some predicted PPV values of the linear regression model were negative, which is theoretically impossible. To negate this issue, an absolute function on the predicted PPV was taken.

Segregating Site Constants Based on Location Characteristics

The distance used in the analysis using both the conventional predictors and the DA models is the vertical depth of the blast. If the monitoring site was directly above the blast location, then this distance would be an accurate representation of the distance traveled by the blast vibrations. The straight-line diagonal distance between the blast location and the monitoring site (Fig. 11) would be a more accurate representation of the distance. This diagonal distance is not recorded, however, which would mean that the horizontal distance between the monitoring site and the blast location is required to work out this diagonal distance. The gold mine records the section of the mine where the blast is undertaken, as well as the monitoring site used to measure the PPV. However, not all blasts for a section of the mine are recorded at the same monitoring station. Within each mine section, the vertical depth is recorded as depicted in Figure 11. Section A for example, can use any of the monitoring stations, where it is clear that the distance between section A and M2 is less than the distance between A and M4. The issue is that the results are being analyzed on the assumption that differences in horizontal distance between mine sections and monitoring stations do not affect the distance traveled by the blast waves. However, this is an important factor that needs to be addressed.

Figure 11
figure 11

Representation of the monitoring site locations in respect to the mine sections

To overcome the above problems, the results were grouped into their respective mine sections and then again for each monitoring station. This is allowed for the vertical depth to provide a more accurate distance measure, as it can essentially negate the effect of the horizontal distance. The vertical distances would then become beneficial for comparing how the depth within each mine section would affect the PPV at the same monitoring stations. Analysis of the blast monitoring data showed that there were 15 combinations between the various sections of the mines and the monitoring sites. A set of site constants was generated for each of these combinations for the most accurate DA model, which was by nonlinear regression, and the most accurate conventional predictor, which was the Ambraseys–Hendron equation. From Figure 4, which presents the plots of the predicted PPV versus actual PPV, it was found that, by segregating the data into comparable conditions, the MAE was reduced from 0.480 to 0.311 and the RMSE was reduced from 0.632 to 0.436. In addition, the R2 increased from 29.15% to 65.69%. For the Ambraseys–Hendron equation, similar outcomes were achieved when the adjusted distance traveled by the vibrations was considered (Fig. 9), improved accuracy scores of MAE = 0.415, RMSE = 0.576 and R2 = 40.98% (Table 15).

Comparison between DA and Conventional Predictors

By analyzing the accuracy scores, it was determined that the DA approach was more accurate than the conventional predictors, but the latter were easier to use. The most effective conventional predictor had a R2 of 16.11% whereas the nonlinear DA equation had a R2 of 29.15%. The RMSE and MAE for the nonlinear DA equation were also lower than the most effective conventional predictor. These results support the hypothesis that more variables that affect PPV are required to improve the accuracy of the predictions made. While it is important to incorporate a range of parameters, not all parameters necessarily improve the quality of the DA model. Many researchers (Dehghani & Ataee-pour, 2011; Khandelwal & Saadat, 2015) came up with the same conclusions. Only using quantity of explosives and distance does not incorporate enough variables to model PPV, whereas the only benefit to using the conventional predictors is that they can be used as a quick measure to predict PPV roughly.

To further demonstrate the concept of including more variables that affect PPV, the vertical distance measure was modified to incorporate variations in horizontal distance. This was done for the most effective DA model as well as the most effective conventional predictor. A new set of data was retrieved to analyze the effectiveness of each of these two models. Comparing the regression lines to the perfect regression line in figure, it is evident that the DA model was more effective. It is clear that, compared to the Ambraseys–Hendron equation with R2 of 50.87%, the nonlinear DA equation with R2 of 62.47% resembles more closely the perfect regression line. It is evident further that the average errors of regression lines were higher for the Ambraseys–Hendron equation than for the nonlinear DA equation.

The model for predicting PPV in the gold mine depends, therefore, on what the users of the model wish to achieve. If a quick and rough estimate is required, the Ambraseys–Hendron equation with one set of site constants can be used, thus:

$${\text{PPV}} = 7512.323\left( {\frac{{\text{D}}}{{{\text{MIC}}^{\frac{1}{3}} }}} \right)^{ - 1.6761}$$

Using this equation is immensely simple once a computing program has been set up to calculate PPV automatically. This model can be the suggested method for many mining operations located near residential areas, because to be able to predict PPV effectively can improve business performance. For example, increasing the PPV prediction capability reduces the need to incorporate unnecessarily high safety factors.

Conclusions

Dimensional analysis (DA) and conventional predictors were used to study the blast vibration levels at one of the underground gold mines in Australia. For the study, a DA approach was employed to develop an equation that can increase PPV prediction accuracy at the mine site. The current model used at the mine is the United States Bureau of Mines (USBM) equation, which is one of the many conventional predictors that have been developed in the past. Conventional predictors incorporate the maximum explosive charge at any given time and the distance from the monitoring point to the blast location. However, DA includes other parameters that provide additional effect on wave propagation.

Most of the conventional predictors are based on only two to three parameters (i.e., explosive charge per delay, distance between blast face to monitoring point, attenuation factor), whereas it is a well-known fact that blast vibrations are influenced by various blast designs, explosives and rock parameters. Thus, here, a greater number of blast design parameters were taken into consideration with the help of DA to assess and evaluate blast vibrations due to underground mining and a new, simple and easy-to-use blast vibrations equation was proposed to evaluate and predict underground blast vibrations in an effective and efficient manner.

It was found that the DA equation obtained by nonlinear multiple regression had the highest accuracy scores. The data also showed that of the four conventional predictors, the Ambraseys–Hendron equation was the most effective at the gold mine. These two models were compared directly to one another to validate the effectiveness of the DA model. Before the models were compared, the vertical depth used as the distance for the two equations was modified by grouping data with similar horizontal distances between blasts and monitoring sites. This process reduced the effect of the horizontal variance, which proved to greatly increase the accuracy of prediction. To validate the results, ten new sets of blast data were obtained and used in where equations predict PPV. The R2 of the nonlinear DA equation was found to be 0.62 as opposed to 0.51 for the Ambraseys–Hendron equation (Table 20), and the RMSE for the DA model was 0.28 compared to 0.32 for the Ambraseys–Hendron equation.

By analyzing all the data, it was suggested that the DA model is used when a higher degree of accuracy between predicted and actual PPV is required. The advantage of a DA model is that there are little restrictions on the selection of important variables, and its predictive capability increases when more of these important variables are added to the model. The only main drawback of the DA approach compared to the conventional predictors is that it can take longer to set up. However, adding irrelevant parameters that show no relationship can taint the results. In future, a greater number of appropriate blast design, explosive and rock parameters can be taken into consideration to develop a more generalized blast vibration equation.