1 Introduction

Modern cities are usually congested and the supply of space becomes an emerging issue. The use of underground spaces, rock caverns, or partial removal of rocky hills becomes attractive to society. The drill-and-blast method is commonly adopted to break the rock mass, turn it into fragments of handleable size, and, therefore, to create spaces. Records showed that the discovery of explosives, also known as “gunpowder” or “black powder”, could be traced back to the 9th century in China (Needham 1987). Despite its long history, the well-documented use of explosives to replace mechanical digging in mining engineering can only be traced back to about 400 years ago (Persson et al. 1993). Nowadays, a wide variety of explosives having different compositions and performance characteristics are used for rock blasting, e.g., ammonium nitrate/fuel oil (ANFO), dynamite, emulsion, etc. With the advances in science and technology, the efficiency and safety of explosives no longer remains the sole concern. On the other hand, blast-induced vibrations and, thus, damage, particularly in densely populated urban areas, arouses more public attention. Excessive vibration does not only cause human annoyance but also leads to cracks or even the collapse of nearby structures. A huge amount of insurance claims could be a big concern.

In blast engineering practice, vibration is usually described simply by the peak particle velocity (PPV). Based on the consideration of energy decay, the empirical relationship among the PPV, the charge weight per delay, and the distance from the blast source to the receiver can be formulated. Engineers use the established relation to make a decision on the amount of explosives subject to a given PPV limit, which is often specified either by the client or governing bodies. In a blasting project, site-specific fitting coefficients would be preferable, as they include more precise site-specific geological and blasting information (such as rock quality, type of explosive, delay time, etc.). However, they can only be obtained in a progressive manner, since vibration records are gradually collected at different stages of the project. How to use limited site-specific data to obtain reliable PPV prediction becomes a burning issue. In recent years, probabilistic analysis has aroused more and more attention in geotechnical engineering in order to deal with various types of modeling error and uncertainty (Griffiths and Fenton 1998; Zhang et al. 2004, 2009; Fenton and Griffiths 2005, 2008; Phoon and Kulhawy 2005; Yuen and Katafygiotis 2005; Griffiths et al. 2009; Huang et al. 2010; Ching et al. 2010, 2012; Phoon et al. 2010; Wang 2011; Wang et al. 2010; Yuen 2010a, b; Blair 2011a, b; Yuen and Mu 2011; Ching and Phoon 2012; Chiu et al. 2012a, b; Yan et al. 2012, 2013; Cao and Wang 2012, 2014; etc.). In this study, a methodology which employs the Bayesian probabilistic approach to shed light on the prediction of the PPV is presented.

The main objective of this paper is to demonstrate the process of deriving a probabilistic-based blast design formula using site-specific vibration records. A reliable confidence level of the PPV can be constructed by the proposed techniques with different computational efforts. It will be shown that a simple spreadsheet can be developed by the users to perform the calculation, without the need for any add-on tools. The proposed method is extremely useful and friendly to engineers to better predict the magnitude of blast-induced vibration when site-specific data are particularly limited at an early stage of a project.

2 Empirical Design Formula

Blair (2011a) has comprehensively reviewed two broad categories of vibration prediction methods being employed in blast engineering: the waveform superposition technique and the charge weight scaling technique, respectively. The study concluded that the charge weight scaling technique remains the most widely used approach to predict vibrations due to rock blasting. Over the past several decades, many different empirical formulas have been proposed to estimate the PPV from the amount of charge weight per delay W and the distance between the blasting source and the receiver d (Duvall and Petkof 1959; Langefors and Kihlström 1963; Davies et al. 1964; Ambraseys and Hendron 1968; Bureau of Indian Standards, BIS 1973; Ghosh and Daemen 1983; Pal Roy 1993). Nevertheless, basic flaws in the charge weight scaling laws have been discussed by Blair (2004) in detail. Among these formulas, two of them proposed by Duvall and Petkof (1959) and Ambraseys and Hendron (1968), respectively, are the most common ones. They are often referred to as the square- and cube-root attenuation formula, respectively. The functional forms of these formulas are obtained from the consideration of energy decay:

$$ v = a_{2} \left( {{\text{SD}}_{2} } \right)^{{ - b_{2} }}\, = a_{2} \left( {\frac{d}{\sqrt W }} \right)^{{ - b_{2} }} $$
(1)
$$ v = a_{3} \left( {{\text{SD}}_{3} } \right)^{{ - b_{3} }}\, = a_{3} \left( {\frac{d}{{\sqrt[3]{W}}}} \right)^{{ - b_{3} }} $$
(2)

where v is the PPV value in mm/s; d and W are the distance in m and charge weight per delay in kg; \( SD_{k} = d/\sqrt[k]{W} \) (\( k = 2,3 \)) is called the scaled distance; and \( a_{k} \) and \( b_{k} \) (\( k = 2,3 \)) are two positive fitting constants calibrated from a given database of blast records. Traditionally, the cube-root scaling law is used for airblast and is rarely used to describe ground vibration. It is worth noting that the formulas are empirical. Therefore, the coefficient values cannot be derived in an analytical manner.

To estimate the regression coefficients, the vibration records of a project are collected and plotted in the \( \ln v - \ln {\text{SD}} \) space. Data points are always scattering severely due to the fact that each blast is carried out under different conditions. In other words, there are other influencing factors not being incorporated into the formula. They include the difference in geology, the type of explosives, the geometry of the explosions, the blast timing, etc. A simple linear regression technique has been used to derive the “best-fitted” line for the collected data. Various statistical approaches can then be undertaken to derive the confidence level of the blast-induced vibration. By assuming a log-normal occurrence of the PPV at any particular scaled distance, Dowding (1985, 1996) presented an approximate statistical method to derive the 84 and 95 % confidence lines of the PPV using the standard error of the prediction (Benjamin and Cornell 1970). The 95 % confidence line refers to the PPV value that will not be exceeded in 95 out of 100 blasts. In other words, the 95 % confidence line can be viewed as the 95th percentile line. Equations (3a) and (3b) summarize the respective confidence lines:

$$ v_{84\,\%} = 10^{{\hat{v}\left( {\hat{a},\hat{b};{\text{SD}}} \right) + {\text{SE}}\left( {\log \left( v \right),\log \left( {\text{SD}} \right)} \right)}} $$
(3a)
$$ v_{95\,\% } = 1.645 \times 10^{{\hat{v}\left( {\hat{a},\hat{b};{\text{SD}}} \right) + {\text{SE}}\left( {\log \left( v \right),\log \left( {\text{SD}} \right)} \right)}} $$
(3b)

where \( v_{84\,\% } \) and \( v_{95\,\% } \) are the 84 and 95 % confidence lines, respectively; \( \hat{v}\left( {\hat{a},\hat{b};{\text{SD}}} \right) \) is the predicted \( v \) at \( {\text{SD}} \) based on the best-fitted parameters \( \hat{a} \) and \( \hat{b} \) derived from regression analysis; and \( {\text{SE}}\left( {\log \left( v \right),\log \left( {SD} \right)} \right) \) is the standard error of the predicted \( \log \left( v \right) , \) which is given by:

$$ {\text{SE}}\left( {\log \left( v \right),\log \left( {\text{SD}} \right)} \right) = {\text{SE}}\left( {Y,X} \right) = \sqrt {\frac{1}{N - 2}\left[ {\sum {\left( {Y - \bar{Y}} \right)^{2} } - \frac{{\left[ {\sum {\left( {X - \bar{X}} \right)\left( {Y - \bar{Y}} \right)} } \right]^{2} }}{{\sum {\left( {X - \bar{X}} \right)^{2} } }}} \right]} $$
(4)

in which \( Y = \log \left( v \right) \) and \( X = \log \left( {\text{SD}} \right) \) for brevity, and N is the total number of datasets. This is referred to as Method I in this investigation. It is worth noting that a set of commonly used empirical fitting coefficients in Hong Kong was derived based on this approach (Li and Ng 1992).

Statistically, a confidence interval for a new observation \( Y_{\alpha } \) can be evaluated by the following equation (Draper and Smith 1981):

$$ Y_{\alpha } = \hat{Y} \pm t\left( {0.5 + \frac{\alpha }{2},{\text{dof}}} \right) \times {\text{SE}}\left( {Y,X} \right) \times \sqrt {1 + \frac{1}{N} + \frac{{\left( {X - \bar{X}} \right)^{2} }}{{\sum {\left( {X - \bar{X}} \right)^{2} } }}} $$
(5)

where Y and X is assumed to be linearly related; \( \hat{Y} \) is the prediction based on the best-fitted parameters resulting from linear regression analysis; \( \alpha \) denotes the confidence interval, say 0.9; \( t \) represents the Student’s t distribution; dof denotes the degrees of freedom, which is N − 2 in this case; and \( {\text{SE}}\left( {Y,X} \right) \) is the standard error of the prediction referring to Eq. (4). This is referred to as Method II in this study.

Finally, engineers can use the confidence line as a design curve to determine the appropriate amount of explosives for each blast operation, as illustrated in the following. A horizontal line at a prescribed PPV limit cuts the design curve at a scaled distance. This scaled distance can be used to calculate the charge weight per delay of the blasts when the distance between the blast source and the sensitive receiver is known. As a result, the amount of charge weight per delay can be determined.

Recently, Blair (2011a) presented a probabilistic analysis in which the probability of exceeding a prescribed PPV limit can be directly computed using a simple spreadsheet when the set of site-specific vibration data is given. The strength of this method is that only the recorded data are needed and the only assumption involved is that the scatter in data in forthcoming blast activities will be similar to that of the previously recorded blasts. The method offers a simple yet robust way to plan a blast.

3 Blasting Projects and Vibration Limits in Hong Kong

In Hong Kong, the use of explosives is controlled by the Hong Kong Ordinance Chapter 295: Dangerous Goods Ordinance. The regulations are monitored and controlled by the Mines Division of the Civil Engineering and Development Department (CEDD). A blasting assessment report should be prepared in accordance to the guidelines/requirements given by the Geotechnical Engineering Office (GEO 2006) and Buildings Department (2007). Various guidelines have been provided in Hong Kong when blasting is adopted for site formation/foundation works (Wong and Pang 1992; Buildings Department 2004, 2007, 2009; Hong Kong Special Administrative Region, HKSAR 2006; etc.). In general, a prescribed PPV limit of 25 mm/s is adopted regardless of the vibration frequency. However, this limit could become more stringent when other stakeholders are involved. For instance, the PPV limit for water-retaining structures, electric power stations, and historical monuments could drop to 13, 11, and 5 mm/s, respectively, as specified by different parties. Furthermore, a no-blast zone can be set up depending on the nature of the sensitive receivers. For example, blasting is generally not permitted to be carried out within 3 m of a tunnel owned by the Mass Transit Railway Corporation Limited or within the protective zones of the Water Supplies Department unless prior approval is obtained.

In recent years, there has been a sharp increase in the number of infrastructure projects in Hong Kong. These projects can be broadly divided into three main categories according to their functions: (1) transportation systems, mainly due to the extension lines of the underground transits and some other road tunnels; (2) drainage systems, which were designed to relieve the flooding risk of some low-lying urban areas in Hong Kong; and (3) site formation, the restoration of quarry or hilly terrain for land development. Furthermore, the HKSAR Government has been promoting the development of rock caverns as an option with great potential to increase the land supply in Hong Kong, whereas other options like reclamation, redevelopment, and land rezoning and resumption are more controversial and difficult to be implemented. Therefore, increasing rock drill-and-blast activities have taken place in the past few years and this trend is expected to continue in the coming years or even decades. In Hong Kong, at the early stage of a blasting project when insufficient field monitoring records are available to establish a more precise site-specific fitting formula, the fitting coefficients (and, thus, formula) suggested by Li and Ng (1992) are often referred. Li and Ng (1992) summarized 520 sets of blasting data which were collected between 1984 and 1991. The database contained blast records taken from various types of projects, including site formation, quarrying, tunneling, caisson sinking, excavation, and submarine blasting. Blasting can be categorized into confined and free-face blasts. Confined blasting includes most of the tunnel blasts and sink blasts, while free-face blasting includes site formation and excavation.

Linear regression analysis of the records was undertaken to determine the best-fitted coefficients \( a_{2} \) and \( b_{2} \) in Eq. (1) and, thus, the best-fitted line. As mentioned in the previous section, Li and Ng (1992) then constructed the 84 and 95 % confidence lines following the method suggested by Dowding (1996), as shown in Eqs. (3a) and (3b). Table 1 summarizes the fitting constants from their study. Before sufficient site-specific vibration records are available, the Hong Kong Government accepts the 84 % confidence line as proposed by Li and Ng (1992) to be adopted in the blast design of a different project. As presented earlier, the amount of explosive charge per delay can then be determined based on the known physical distance between the blast source and the receiver, and the prescribed PPV limit.

Table 1 Fitting coefficients suggested by Li and Ng (1992)

Nevertheless, the use of the fitting coefficients provided by Li and Ng (1992) in a different project is definitely not ideal. The coefficients simply miss out any site-specific characteristics of the project, such as the geological setting and material properties. It is, therefore, highly desirable to obtain site-specific fitting coefficients and, thus, to use the design formula containing these coefficients for subsequent blasting activities. However, site-specific measurements can only be obtained in a progressive manner. The reliability of a PPV-scaled distance relation often becomes questionable when very few data are used to calibrate the fitting coefficients. In Hong Kong, the number of minimum blast records required to derive the site-specific fitting constants is advised by the Mines Division on a case-by-case basis.

More importantly, the modeling uncertainty is not explicitly taken into account in the construction of the confidence line in either Methods I or II. The modeling uncertainty becomes more critical when site-specific fitting coefficients are derived at the early stage of a project where only very limited monitored data are available. In this case, the uncertainty of the model parameters, and, hence, the model output, is so large such that negligence of the modeling uncertainty induces substantial error in establishing the confidence interval. A Bayesian probabilistic method is presented in this study to resolve this problem. Details of the method are presented next.

4 Bayesian Framework

Consider a database D with N blast records that comprise the measurements of v, d, and W. The scaled distance \( {\text{SD}}_{2} \) can then be computed from d and W. With the Bayesian probabilistic approach, the following objectives can be achieved: (a) to estimate the optimal model parameters \( a_{2} \) and \( b_{2} \) in the empirical relation; (b) to quantify their associated uncertainty by evaluating their posterior distribution; and (c) to quantify the uncertainty of the empirical formula output for the construction of the confidence interval of any prescribed reliability levels.

The square-root formula shown in Eq. (1) is used here to illustrate the computational concept of the proposed method. It should be noted that the same procedure is also valid for Eq. (2) when airblast is concerned. First, Eq. (1) is rewritten as a linear regression problem:

$$ \ln v_{m} = \ln v + \varepsilon = \ln a_{2} - b_{2} \ln {\text{SD}}_{2} + \varepsilon $$
(6)

where \( v_{m} \) is the measured PPV value and \( \varepsilon \) represents the model output error in the logarithmic scale. In the \( \ln v - \ln {\text{SD}}_{2} \) space, the term \( \ln a_{2} \) represents the vertical intercept of the fitting line at \( {\text{SD}}_{2} = 1 , \) while \( - b_{2} \) gives its slope. The model output error, which is a collective quantity of the measurement noise and the modeling error, is assumed to be Gaussian with zero mean and variance \( \sigma_{\varepsilon }^{2} . \) In this case, the modeling error includes the deficiency of this simple empirical formula in considering the underlying problem. This includes the negligence of the characteristics of wave-transmitting media, the geometric setting of the blast area, the blast-hole configurations, the type of explosives, etc. The variable \( \sigma_{\varepsilon }^{2} \) represents the variance of the model output error and it is an unknown parameter to be estimated using the measurements. The updated/posterior probability density function (PDF) of the model parameters given database D can be expressed as (Yan et al. 2009):

$$ p(\ln a_{2} ,b_{2} ,\sigma_{\varepsilon }^{{}} |D) = c_{0} p(\ln a_{2} ,b_{2} ,\sigma_{\varepsilon }^{{}} )(2\pi )^{ - N/2} \sigma_{\varepsilon }^{ - N} \exp \left[ { - \frac{N}{{2\sigma_{\varepsilon }^{2} }}J_{g} (\ln a_{2} ,b_{2} |D)} \right] $$
(7)

where c 0 is the normalizing constant such that the volume under the updated PDF is unity and \( p(\ln a_{2} ,b_{2} ,\sigma_{\varepsilon }^{{}} ) \) is the prior PDF of the uncertain parameters to express the user’s judgment about the relative plausibility of the values of these parameters before the data are obtained. In this study, a non-informative prior distribution is employed so that the prior PDF can be absorbed into the normalizing constant. As a result, the Bayesian inference will rely solely on the likelihood of the measurements. The function \( J_{\text{g}} (\ln a_{2} ,b_{2} |D) \) is the goodness-of-fit function which indicates how well the model fits the data (Yuen and Kuok 2011) and it takes the following form:

$$ J_{\text{g}} (\ln a_{2} ,b_{2} |D) = \frac{1}{N}\sum\limits_{i = 1}^{N} {\left[ {\ln v\left( {a_{2} ,b_{2} ;{\text{SD}}_{2(i)} } \right) - \ln v_{m(i)} } \right]}^{2} $$
(8)

where \( v(a_{2} ,b_{2} ;{\text{SD}}_{2(i)} ) \) is the model output of the ith record with parameters \( \ln a_{2} \) and \( b_{2} , \) and \( v_{m\left( i \right)} \) is the corresponding measured value. This function gives the variance of the fitting error between the measurements and the corresponding model outputs. The optimal model parameters \( \ln \hat{a}_{2} \) and \( \hat{b}_{2} \) can be obtained by maximizing the updated PDF \( p(\ln a_{2} ,b_{2} ,\sigma_{\varepsilon }^{{}} |D) \) in Eq. (7) (the symbol \( \hat{x} \) is used to denote the optimal value of the variable \( x \)).

There are three unknowns (\( \ln a_{2} , \) \( b_{2} , \) and \( \sigma_{\varepsilon }^{2} \)) for the empirical model considered in Eq. (1). Nevertheless, with a non-informative prior distribution, the optimization problem can be decomposed into two sub-problems: (i) to minimize the objective function \( J_{g} \) with respect to the model parameters \( \ln a_{2} \) and \( b_{2} , \) and (ii) to maximize the updated PDF in Eq. (7) with respect to \( \sigma_{\varepsilon }^{2} \) given the model parameters from step (i). The first optimization problem can be tackled by solving simultaneously \( \partial J_{\text{g}} /\partial \ln a_{2} = 0 \) and \( \partial J_{\text{g}} /\partial b_{2} = 0 , \) and this yields the following closed-form solution of the optimal parameters (Yuen 2010a):

$$ \left[ {\begin{array}{*{20}c} {\ln \hat{a}_{2} } \\ {\hat{b}_{2} } \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} N & { - \sum\limits_{i = 1}^{N} {\ln {\text{SD}}_{2\left( i \right)} } } \\ { - \sum\limits_{i = 1}^{N} {\ln {\text{SD}}_{2\left( i \right)} } } & {\sum\limits_{i = 1}^{N} {\left[ {\ln {\text{SD}}_{2\left( i \right)} } \right]^{2} } } \\ \end{array} } \right]^{ - 1} \left[ {\begin{array}{*{20}c} {\sum\limits_{i = 1}^{N} {\ln v_{m\left( i \right)} } } \\ { - \sum\limits_{i = 1}^{N} {\ln {\text{SD}}_{2\left( i \right)} \ln v_{m\left( i \right)} } } \\ \end{array} } \right] $$
(9)

After obtaining the optimal model parameters \( \ln \hat{a}_{2} \) and \( \hat{b}_{2} , \) the fitting error variance \( \hat{\sigma }_{\varepsilon }^{2} \) can be found by solving \( \partial p(\ln a_{2} ,b_{2} |D)/\partial \sigma_{\varepsilon } = 0 . \) By Beck and Katafygiotis (1998), it is equal to the minimum of the goodness-of-fit function:

$$ \hat{\sigma }_{\varepsilon }^{2} = \mathop {\hbox{min} }\limits_{{\ln a_{2} ,b_{2} }} J_{g} (\ln a_{2} ,b_{2} |D) = J_{\text{g}} (\ln \hat{a}_{2} ,\hat{b}_{2} |D) $$
(10)

Furthermore, the posterior covariance matrix \( \varSigma_{{\ln a_{2} ,b_{2} }} \) for the model parameters can be computed as follows (Yuen 2010a):

$$ \varSigma_{{\ln a_{2} ,b_{2} }} = \hat{\sigma }_{\varepsilon }^{2} \left[ {\begin{array}{*{20}c} N & { - \sum\limits_{i = 1}^{N} {\ln {\text{SD}}_{2(i)} } } \\ { - \sum\limits_{i = 1}^{N} {\ln {\text{SD}}_{2(i)} } } & {\sum\limits_{i = 1}^{N} {\left[ {\ln {\text{SD}}_{2(i)} } \right]^{2} } } \\ \end{array} } \right]^{ - 1} $$
(11)

This allows for the computation of not only the posterior variance of \( \ln a_{2} \) and \( b_{2} , \) but also the correlation coefficient between them. In the following sections, two different approaches, Monte Carlo simulation and direct computation, respectively, involving different degrees of computational effort, to derive the confidence intervals of the vibration formula will be presented.

5 Reliability Analysis

5.1 Monte Carlo Simulation

Instead of obtaining a single empirical relationship using the optimal values for \( \ln a_{2} \) and \( b_{2} , \) the posterior PDF \( p(\ln a_{2} ,b_{2} ,\sigma_{\varepsilon } |D) \) in Eq. (7) is used to generate numeric samples of the parameters for the quantification of the modeling uncertainty. Specifically, a large number of samples \( N_{\text{s}} \) of these parameters (\( \ln a_{2(j)} , \) \( b_{2(j)} , \) \( \sigma_{\varepsilon (j)} ; \) \( j = 1,2, \ldots ,N_{\text{s}} \)) can be simulated efficiently by the Markov chain Monte Carlo simulation (Metropolis et al. 1953; Hastings 1970). Then, the jth PPV sample can be estimated by Eq. (6), as shown below:

$$ \ln v_{(j)} = \ln a_{2(j)} - b_{2(j)} \ln {\text{SD}}_{2} + \varepsilon_{(j)} $$
(12)

where the measurement noise \( \varepsilon_{(j)} \) can be generated by the Gaussian generator with zero mean and variance \( \hat{\sigma }_{\varepsilon (j)}^{2} . \) By doing so, a large collection of \( N_{\text{s}} \) PPV samples at any values of scaled distance can be generated. Then, at any particular scaled distance, the PPV values are sorted to identify the corresponding value at a prescribed percentile level. For example, the 5th and 95th percentile levels are used in this study. Therefore, the lower and upper bounds of the PPV for the desired confidence levels can be obtained under such a scaled distance (e.g., the 5th and 95th percentiles give a 90 % confidence interval). These steps are repeated for different scaled distances and, thus, a confidence region for different scaled distance values can be obtained. It is referred to as Method III in this study.

5.2 Direct Computation

Alternatively, a direct approach can be used with less computation. First, Eq. (6) can be rewritten as follows:

$$ \ln v_{m} = \left[ {\begin{array}{*{20}c} 1 & { - \ln {\text{SD}}_{2} } \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} {\ln a_{2} } \\ {b_{2} } \\ \end{array} } \right] + \hat{\sigma }_{\varepsilon } \bar{\varepsilon } = {\mathbf{X}}^{\text{T}} {\mathbf{C}} + \hat{\sigma }_{\varepsilon } \bar{\varepsilon } $$
(13)

where \( {\mathbf{X}} = \left[ {\begin{array}{*{20}c} 1 & { - \ln SD_{2} } \\ \end{array} } \right]^{\text{T}} , \) \( {\mathbf{C}} = \left[ {\begin{array}{*{20}c} {\ln a_{2} } & {b_{2} } \\ \end{array} } \right]^{\text{T}} , \) and \( \bar{\varepsilon } \) follows the standard Gaussian distribution (denoted by \( N\left( {0,1} \right) \)). The variance of \( \varepsilon \) is taken as the optimal value \( \hat{\sigma }_{\varepsilon }^{2} \) in Eq. (10) to simplify the calculation. The variance of the first term on the right-hand side of Eq. (13) can be written as:

$$ {\text{Var}}\left( {\ln v} \right) = {\mathbf{X}}^{\text{T}} {\text{E}}\left[ {\left( {{\mathbf{C}} - {\hat{\mathbf{C}}}} \right)\left( {{\mathbf{C}} - {\hat{\mathbf{C}}}} \right)^{\text{T}} } \right]{\mathbf{X}} $$
(14)

where \( {\hat{\mathbf{C}}} \) is the optimal parameter vector. By definition, the term \( {\text{E}}[ {( {{\mathbf{C}} - {\hat{\mathbf{C}}}})( {{\mathbf{C}} - {\hat{\mathbf{C}}}})^{\text{T}} }] \) is the posterior covariance matrix \( \varSigma_{{\ln a_{2} ,b_{2} }} \) as shown in Eq. (11) (Yuen 2010a). Therefore, the variance of \( \ln v_{m} \) can be evaluated as follows.

$$ {\text{Var}}\left( {\ln v_{m} } \right) = {\mathbf{X}}^{\text{T}} \varSigma_{{\ln a_{2} ,b_{2} }} {\mathbf{X}} + \hat{\sigma }_{\varepsilon }^{2} $$
(15)

Therefore, it can be seen clearly that the confidence interval bounds are in general parabolic curves instead of straight lines.

The standard deviation \( \sigma_{{\ln v_{m} }} \) of \( \ln v_{m} \) can then be quantified (i.e., \( \sigma_{{\ln v_{m} }} = \sqrt {{\text{Var}}\left( {\ln v_{m} } \right)} \)). The upper and lower bounds of \( \ln v_{m} \) (\( \ln v^{ \pm } \)) with various degrees of confidence levels can be formulated by:

$$ \ln v^{ \pm } = \ln v\left( {\ln \hat{a}_{2} ,\hat{b}_{2} } \right) \pm \chi \sigma_{{\ln v_{m} }} $$
(16)

where \( \chi \) is associated with the confidence level specified by the user. For a confidence level of α, it is given by \( \chi = \varPhi^{ - 1} (0.5 + \alpha /2) , \) where \( \varPhi^{ - 1} \) is the quantile function [i.e., the inverse function of the cumulative distribution function (CDF) of the standard Gaussian random variable]. For instance, \( \chi \) = 1.645 and 1.96 give curves enclosing 90 and 95 % of the probability for the prediction of \( \ln v_{m} , \) respectively. This is referred to as Method IV. Both Methods III and IV originate from the same Bayesian approach, but Method IV is much simpler than Method III (Monte Carlo simulation), as less computational effort is required. It is favorable to practicing engineers in comparison with Method III since the analysis can be done simply by a standard package of spreadsheets without the need for any add-on tools. The “Appendix” shows the template of such a spreadsheet for readers’ reference. Detailed steps of the procedure are also given.

6 Case Study

Blast-induced vibration data collected from a site formation project in Hong Kong are used to demonstrate the performance of the proposed methods. The project comprises the formation of a ~20-ha platform and the construction of infrastructure facilities, including roads, bridges, retaining structures, and drainage works for residential and community uses. The site is a rocky rugged area developed on granite. The site works commenced on Jan 31, 2008 and the tasks are expected to be completed between 2011 and 2014 in stages. A large amount of rock blast activities have been carried out in the three major blasting zones inside the site. Both cartridge and bulk emulsion are used as explosives and the blasts are initiated by the Austin non-electric detonator. A total of 194 sensitive receivers have been identified in the vicinity of the site, including high-rise residential blocks, slopes, roads, schools, transformer room, water reservoir, etc.

Ground vibration is monitored using a NOMIS 7000 seismograph. The sensor is able to record vibration in the range 0.127 and 254 mm/s (0.005 and 10 in/s). The vibration data recorded on slopes (identified as one of the sensitive receivers) are used for the analyses presented in this paper. Note that the same procedure is applicable to other sensitive receivers. In a blast record, the maximum vibration among the vibrations in three orthogonal directions is used as the PPV in the analysis. A total of 957 vibration records of bench blasting using cartridge emulsion were collected for blasts that had taken place from November 2008 to February 2011. In this study, the records are first separated into two groups. The first group contains only the blast records collected at the early stage of the project (N 1 data points), while the second group contains the remaining records (N 2 data points). As a result, N 1 + N 2 = 957. Later in this paper, the effect of N 1 on the prediction of vibration will be examined.

The records in the first group are used to estimate the fitting coefficients of the empirical formula and, thus, to provide an estimation of the PPV with the scaled distance according to the methods shown in Eqs. (3a), (3b), and (5), Methods I and II, respectively. Furthermore, the same N 1 data points are fed into the proposed Bayesian methods and the confidence interval can be constructed based on the aforementioned reliability analyses. The effect of N 1 on the prediction performance of the formula can be studied. The entire database (957 data) is then used to assess the prediction performance of these approaches. Table 2 summarizes the number of blast records and ranges of parameters at different stages of the project. It is noted that the database for N 1 = 119 and N 1 = 153 contain the same ranges of PPV, W, d, and SD2. However, the data distributions are slightly different, as the latter contains more data points. The analyses will be carried out for the cases of N 1 = 40, 82, 119, 153, and 196, corresponding to the first few stages of the project.

Table 2 Summary of data records

7 Results

7.1 Fitting Coefficients for N 1 = 82

In this section, the results obtained by analyzing the database with N 1 = 82 (records collected up to Feb 27, 2009; 3 months of blasting) are used to demonstrate the proposed Bayesian methods. Note that similar analyses are performed also for other values of N 1, namely, 40, 119, 153, and 196 (see Table 2) to investigate the effect of N 1 and the results will be shown in a later section. Figure 1 shows the 82 PPV records plotted against the corresponding scaled distance. Figure 2 shows the conditional posterior PDF \( p(\ln a_{2} ,b_{2} |D,\hat{\sigma }_{\varepsilon }^{{}} ) \) computed using Eq. (7) and its contours. The optimal coefficients are found to be \( \ln \hat{a}_{2} = 8.216 \) and \( \hat{b}_{2} = 1.884 \) (for N 1 = 82), at which the goodness-of-fit function \( J_{\text{g}} (\ln a_{2} ,b_{2} |D) \) is minimized. The solid line shown in Fig. 1 represents this “best-fitted” line which is associated with the optimal coefficients. Table 3 summarizes the optimal values (^), and covariance (\( \sigma^{2} \)) of the estimated parameters \( \ln a_{2} , \) \( b_{2} , \) and \( \varepsilon \) for different N 1. Furthermore, the column \( \rho \left( {\ln a_{2} ,b_{2} } \right) \) shows the correlation coefficients between \( \ln a_{2} \) and \( b_{2} \) for different cases. One can see that the two model parameters are highly correlated (\( \rho > 0.99 \)).

Fig. 1
figure 1

Vibration data collected at the site (N 1 = 82)

Fig. 2
figure 2

Posterior probability density function (PDF) of ln a 2 and b 2 (N 1 = 82)

Table 3 Statistics of the obtained fitting coefficients for different N 1

7.2 Confidence Interval for N 1 = 82

The confidence line (CL) or confidence interval (CI) is constructed by the four different approaches previously mentioned: (Method I) approximate method by Dowding (1996) [see Eqs. (3a) and (3b)], which derives CL; (Method II) statistical approach shown in Eq. (5), which gives CI; (Method III) Monte Carlo simulation for CI; and (Method IV) direct computation for the CI. Details of each method are shown as follows.

In Method I, the best-fitted line is first obtained by linear regression analysis with N 1 data points (82 in this case, represented by diamonds in Fig. 3). It gives \( a_{2} = 3699, \, b_{2} = 1.884{\text{ , and }}R^{2} = 0.677 . \) The 95 % CL, which is represented by the dotted line in Fig. 3, is then constructed using Eq. (3b). All 957 blast data points (denoted by the triangles) are then plotted in the same figure. Except for two data points, they appear to show a minimum PPV of about 1.1 mm/s (1.143 mm/s by reading from the monitoring records). This is actually not the minimum detectable vibration as specified by the sensor, which is 0.127 mm/s, as mentioned earlier. The vibration magnitude of 1.143 mm/s is simply the maximum among the vibrations in three orthogonal directions recorded in the studied slopes, and this maximum is adopted for the analysis. When other sensitive receivers are studied (for instance, transformer rooms), smaller maximum PPV (<1.1 mm/s) can be found. However, the reason why many blasts tend to give a maximum PPV of about 1.1 mm/s on the studied slopes in an individual blast remains unclear. The performance of Method I is evaluated when the dotted line in Fig. 3 is compared to all the collected data. Among the total of 957 records, it is found that 51 points fall above this 95 % CL. It may give an impression that the result is, indeed, very good (51/957 = 5.3 % = 5 %) and Method I performs well. This issue will be revisited later when the results from all four methods of analysis with different N 1 are ready.

Fig. 3
figure 3

Vibration data collected at the site. The best-fitted line (solid line) is obtained by linear regression, whereas the 95 % confidence line (dotted line) is derived according to Eq. (3b) (Method I)

The performance of Method II is then investigated. Figure 4 shows the best fitted line (solid line) and the 5th and 95th percentile lines (dotted lines), which offer a 90 % confidence interval, derived according to Eq. (5) based on N 1 = 82. It is found that 81 (81/957 = 8.5 %) and 87 (9.1 %) data points, among the 957 data records, fall below and above the two percentile lines, respectively.

Fig. 4
figure 4

Vibration data collected at the site. The dotted lines represent the 90 % confidence interval evaluated by Eq. (5) based on the first 82 data records (Method II)

Next, the results obtained using the proposed Monte Carlo approach is presented. First, note that the best-fitted line of the proposed approach is identical to conventional linear regression and the difference is in the confidence interval. According to the posterior PDF of the model parameters obtained from the Bayesian analysis, Markov chain Monte Carlo simulation is carried out to generate 500,000 samples of the uncertain parameters (\( N_{\text{s}} = 500{,}000 \)). In other words, 500,000 sets of \( \ln a_{2} , \) \( b_{2} , \) and \( \varepsilon \) can be obtained. For any scaled distance value, the corresponding values of \( \ln v \) (and, thus, PPV) can be computed by using Eq. (12). In other words, 500,000 samples of the PPV value are obtained for any particular scaled distance value. The PPV values are then sorted to determine the percentile positions (i.e., the PPV values for the 5th and 95th percentiles in this application). Finally, by drawing the two curves to connect these percentile positions at different scaled distances, the confidence interval can be obtained. In this problem, it is found that the bounds of the CI are slightly curved in the \( \ln v - \ln {\text{SD}}_{2} \) space, and they are depicted in Fig. 5. The solid line represents the 50th percentile line of the 500,000 simulated PPV values, while the dotted lines (5th and 95th percentiles, respectively) enclose 90 % of the simulated vibration records. It is found that 83 and 96 records (i.e., 8.6 % and 9.8 % of the total of 957 records) fall below and above the 5th and 95th percentile lines, respectively.

Fig. 5
figure 5

Vibration data collected at the site. The dotted lines represent bound curves enclosing 90 % of the data from the Monte Carlo simulation (N s = 500,000) based on the first 82 data records (Method III)

Lastly, results of the direct computation method are presented. Equation (15) is used to quantify the variance of \( \ln v_{m} , \) which is summarized in Table 4. The term \( {\mathbf{X}}^{\text{T}} \varSigma_{{\ln a_{2} ,b_{2} }} {\mathbf{X}} , \) which quantifies the uncertainty of the model parameters, decreases with increasing N 1 at any scaled distance. It is expected because the uncertainty of the model parameters should decrease when more data are collected for the analysis. As shown in the table, \( \sigma_{{\ln v_{m} }}^{2} \) decreases noticeably with increasing N 1 at extreme SD2 (\( {\text{SD}}_{2} = 1{\text{ or 1000 m/}}\sqrt {kg} \)). At intermediate values of SD2, the change of \( \sigma_{{\ln v_{m} }}^{2} \) with N 1 is minor. In fact, most of the collected data fall into the range \( 10 \le {\text{SD}}_{2} \le 100{\text{ m/}}\sqrt {kg} . \) With a choice of \( \chi = 1.645 , \) the 5th and 95th percentiles of \( \ln v_{m} \) (i.e., enclosing 90 % of the data) can be computed. It is worth noting that a different choice of \( \chi \) will give bounds with different confidence intervals. Figure 6 shows the results of direct computation. The obtained percentile lines locate in essentially the same position as those derived from the Monte Carlo simulation. The results should be expected, as the two methods are essentially equivalent. It is found that 8.6 and 9.7 % of the data lie below and above the 5th and 95th percentile lines, respectively, when all blast records are used to quantify the performance of this direct computation method. The “Appendix” illustrates how a spreadsheet can be developed to carry out the study easily.

Table 4 Variance of the peak particle velocity (PPV) derived from different N 1 records
Fig. 6
figure 6

Vibration data collected at the site. The dotted lines represent bound curves enclosing 90 % of the data from direct computation based on the first 82 data records (Method IV)

7.3 Effect of the N 1 Value

It is expected that the reliability of the confidence interval can be enhanced by using more data points for the Bayesian analysis to update the posterior PDF of the fitting coefficients. This is because more information about the model can be reflected by a larger number of data points N 1. The effect of N 1 on the reliability of the model prediction is studied and summarized as follows. As shown in Table 3, the number of data points, N 1, used to evaluate the confidence intervals varies from 40 to 196, corresponding to about 4 to 20 % of the entire dataset. Note that, in a real project, the data are collected in a progressive manner. Increasing N 1 results in the reduction of the standard deviation of both fitting parameters (\( \ln a_{2} \) and \( b_{2} \)). The trend is expected, as more measurements in a database give more information about the model and can, thus, reduce the uncertainty of the model prediction. Table 5 summarizes the effect of N 1 on the performance of the reliability of CL or CI. Method III (Monte Carlo simulation) and Method IV (direct computation) give essentially the same results. For brevity, only the results of Method IV are shown in the table. The following observations can be made:

  1. 1.

    At N 1 = 40, all methods give rather poor predictions. Yet, Method I, among all, appears to give the best performance: 10.9 % PPV of the 957 blasts fall above the 95 % CL, in comparison with 18.4 and 19.9 % for Methods II and IV, respectively.

  2. 2.

    By increasing N 1, all methods show a decreasing amount of PPV falling above the 95 % CL and the 95th percentile line. In particular, Method I shows only 2.4 % of data lying above the 95 % CL when N 1 becomes 196, in comparison with 10.9 % at N 1 = 40.

  3. 3.

    Methods II and IV give very similar results, particularly when N 1 becomes large. The amount of data lying above the 95th percentile line decreases from about 19 to 5 % when N 1 increases from 40 to 196.

Table 5 Effect of N 1 on prediction reliability

7.4 Performance of Various Methods

As shown in Table 5, it can be seen that there are always more than 5 % PPV measurements falling below the 5th percentile of both Methods II and IV. The number has an increasing trend with N 1. It is due to the fact that more conservative blast design is made occasionally in a blasting project, where less explosive is used in some planned blasts. As a result, only small vibration is then recorded in some blasts, which explains the data scattering below the 5th percentile line. In practice, engineers are more concerned with the amount of vibration records which show the PPV exceeding the design limit, rather than those falling below a certain percentile line. When one examines the amount of PPV records lying above the 95 % CL or 95th percentile line predicted by N 1 = 82, Method I gives a close to 5 % prediction, which appears to outperform the other methods. However, it does not imply that Method I is a robust approach. The overconservatism of this method can be readily seen when N 1 increases (less than 3 % of data falling above the 95 % CL at N 1 = 153 and 196). An increase in N 1, yet, does not improve the performance of this method. In other words, the method cannot learn from additional data. On the contrary, Methods II and IV (including Method III) show a clear improvement when an increasing amount of data are used to compute the prediction (i.e., close to 5 % of data is predicted to fall above the 95th percentile line with increasing N 1). Despite the fact that Method II gives very similar results to Method IV, the Bayesian-based Method IV is recommended. This is because, in this newly proposed method, the contribution of uncertainty from fitting parameters and modeling error can be clearly quantified, which offers better understanding of the model prediction.

8 Conclusion

Blast-induced vibration has received much attention over the last several decades. Determination of the explosive charge weight subject to a prescribed vibration limit is an important task. In this paper, two existing methods for the construction of confidence intervals of the blast design formula are reviewed. The problem is then revisited from a Bayesian probabilistic perspective, where uncertainty of the formula output can be explicitly quantified. By using the proposed methods, blast design formula with different confidence intervals can be obtained. Detailed computation of the methods and their performance are illustrated by a recent site formation project in Hong Kong where many rock blasting activities have been undertaken. First, both proposed methods are useful at the early stages of a blasting project when only limited site-specific data are available. They provide reliable confidence intervals of blast-induced vibrations and their results are essentially the same. Furthermore, the direct computation approach is extremely useful, as simple spreadsheet calculation can be developed by the users to conduct the analysis. Neither add-on tools nor complicated iterations are required for the analysis, and this would be very favorable to practicing engineers.