1 Introduction

Blasting is a dominant practice for fragmenting rocks in mining and civil projects. In this operation, only a small portion of the explosives’ energy is really consumed in the process of rock fragmentation [1, 2] and the rest of it is exhausted in the form of unwanted events such as ground vibration, air overpressure, fly rock and back break [3,4,5,6]. Since, in the open pit mines, destination of the blasted rock is the primary crusher for which size distribution of the feed is very important, therefore, blast design should be managed in such a way that crusher performance be reasonable to maintain the whole process economical from mine to mill [7, 8]. However, it should be mentioned that getting a specific size distribution normally is not an easy task because there are some effective factors that not in the hand of blast engineer. Broadly, the most relevant factors affecting the result of a blast can be divided in two categories: uncontrollable (rock mass properties) and controllable (blast geometry and explosive specifications) [9, 10]. Nowadays, Artificial Intelligence (AI) such as artificial neural network (ANN) is utilized for solving complicated problems in various fields of science and engineering [11,12,13,14]. Specifically, many research works are available regarding implementation of AI in prediction of rock fragmentation [15]. In this paper, it was tried to recognize the most effective parameters on rock fragmentation by using various approaches of conventional (regression analysis) and machine learning (ANN and decision tree) methods.

2 Artificial neural network

Artificial neural network is actually an imitation of the human brain [16]. It contains several interconnected layers. In each layer there exist computational components known as neurons. ANN can be applied for solving problems with high non-linearity. Robustness of the ANN can be highlighted in its capability of function approximation and feature selection [17,18,19,20,21,22]. The first step in applying ANN is training which require datasets including inputs and outputs. There are several tactics that can be considered in the training process of the multi-layer perception (MLP), however, back propagation algorithm has more benefits comparing to the other available approaches. MLP network contains at least three main parts known as input, transitional and output layers. Number of the neurons in the transitional layer is determined according to the nature of the problem in hand. In the training, a weight is primarily given to each of the connections between the existing nodes in each of the layers. This initial weight should be modified to examine the efficiency of the network [23,24,25].

The model accuracy is studied by considering the model outputs and the actual measured outputs. Coefficient of determination (R 2), root-mean-square error (RMSE), mean absolute error (MAE) and variance account for (VAF) (Eqs. 14), are normally used to observe the model performance [26]:

$${R^2}=1 - \frac{{\mathop \sum \nolimits_{{i=1}}^{N} {{\left( {y - y^{\prime}} \right)}^2}}}{{\mathop \sum \nolimits_{{i=1}}^{N} {{\left( {y - \tilde {y}} \right)}^2}}},$$
(1)
$${\text{VAF}}=\left[ {1 - \frac{{{\text{VAR}}\left( {y - y^{\prime}} \right)}}{{{\text{VAR}}\left( y \right)}}} \right] \times 100,$$
(2)
$${\text{RMSE}}=\sqrt {\frac{1}{N}\mathop \sum \limits_{{i=1}}^{N} {{\left( {y - y^{\prime}} \right)}^2}} ,$$
(3)
$${\text{MAE}}=\frac{1}{N}\mathop \sum \limits_{{i=1}}^{N} \left( {y - y^{\prime}} \right),$$
(4)

where \(y\), \({y}^{\prime}\)and \(\stackrel{~}{y }\)are the measured, predicted and mean of the y values, respectively, and N is the total number of data.

3 Case studies

In this research, the blast database is taken from previous study results collected in various mines and rock formation in the world [27,28,29,30,31,32,33,34,35,36,37,38] that were combined with blast data collected from the open pit mines of Iran to create the blast database as given in Table 1.

Table 1 Various mines and rock formation of case studies

4 Collection of datasets

Descriptive statistics of the input and output variables are given in Table 2. Parameters such as burden, spacing, stemming, height of bench, hole diameter, powder factor, UCS, UTS, brittleness, Is50, Density, Young’s Modulus, P-wave velocity, Schmidt hardness value, Poisson’s ratio, RQD, Cohesion and friction angles were used as inputs and X50 was selected as output.

Table 2 Variables used for developing models

5 ANN architecture

In this study, a total of 432 datasets were randomly split into training and testing groups. Training of the model was accomplished by back propagation procedure using 342 datasets. The entire datasets were normalized to values between −  1 and 1 to improve the efficiency of the training process. Afterwards, several models with different network elements (number of neurons in hidden layer, transfer function, etc.) were constructed to find out the most appropriate configuration with lowest error. MAE, RMSE, VAF and R2 were determined for the various network structures (Table 3). From this table, it is seen that the best case is a back propagation network with an architecture 19-28-1 having the hyperbolic-tangent transfer function in hidden layer and exponential transfer function in output layer (No.12). Figure 1 shows the optimum architecture of ANN model. An illustrative plot of measured versus predicted fragmentation using the ANN model is reported in Fig. 2. The result showed that the training R2 was 0.99, which indicates that the designed ANN was capable to predict the fragmentation with the least error.

Table 3 MAE, RMSE, VAF and R 2 for some of the models
Fig. 1
figure 1

Architecture of ANN model

Fig. 2
figure 2

Correlation between measured and predicted X50 in ANN model (Training accuracy)

6 Multivariate linear regression (MLR)

Multivariate linear regression analysis was used to assess the mapping between the input and output parameters. MLR is widely used in various branches of science and technology [39, 40]. Equation 5 shows the results obtained from the regression analysis. Correlation between measured and predicted fragmentation using the MLR model is shown in Fig. 3.

Fig. 3
figure 3

Correlation between measured and predicted X 50 in MLR model

$$\begin{aligned} {X_{50}} & =0.009(B)+0.004(S) - 0.003(H) - 0.0002(D) \hfill \\ &\quad - 0.001({\text{ST)}} - 0.33(PF)+0.165({X_B}) - 0.002(I{S_{50}}) \hfill \\ & \quad +0.004({\text{UCS}}) - 0.014({\text{UTS}}) - 0.006({\text{BT}}) \hfill \\ & \quad - 0.024(\rho ) - 0.0008(E) - 0.12({V_p})+0.006({\text{SHV}}) \hfill \\ & \quad +0.035(\vartheta ) - 0.003({\text{RQD}})+0.68(C)+0.005(\varphi ). \hfill \\ \end{aligned}$$
(5)

As shown in Eq. 5, parameters including burden, spacing, mean in-situ block size (X B), uniaxial compressive strength (UCS), Schmidt hardness value (SHV), Poisson’s ration (υ), cohesion (C) and friction angle (φ) have a direct relationship with X 50. Whereas, bench height, hole diameter, stemming, powder factor, UTS, brittleness, Is50, density, Young’s modulus, P-wave velocity and RQD have an inverse relationship with X 50.

7 Classification and regression tree (CART)

The decision tree is one of the hierarchical techniques extensively used for classification and regression because of its interpretability and efficacy [41]. There are several decision tree algorithms that can be applied to regression problems, however, CART (classification and regression tree) has substantial advantages comparing to the other existing approaches [42,43,44,45]. In this paper, MatLab software was used to predict rock fragmentation using CART model. Figure 4 shows the appropriate tree built for predicting the X50. The correlation between measured and predicted X50 using CART model is shown in Fig. 5.

Fig. 4
figure 4

CART model developed for predicting the X 50

Fig. 5
figure 5

Correlation between measured and predicted X 50 in CART model

8 Performance evaluation of the models

Model evaluation of the obtained MLR, CART and ANN models was performed applying 90 test datasets which were not used in the model development. Table 4 shows the calculated values of validation indexes for all three models. According to Table 4, it can be seen that the developed ANN model with the obtained values of 0.00009, 0.0095, 98.6% and 0.986 in the validation phase for MAE, RMSE, VAF and R 2, respectively, is superior compared to MLR model with these values of 0.001, 0.033, 83.41% and 0.836, respectively. Furthermore, comparing the obtained results from CART model showed the low competence of it to predict rock fragmentation precisely. The correlation between predicted and measured X50 using all three models are shown in Figs. 6, 7, 8, 9, 10 and 11. Altogether, these figures demonstrate that the ANN model has the best performance in prediction of X 50 in comparison to the other models.

Table 4 Calculated validation indices for the ANN, MLR and DT models
Fig. 6
figure 6

Correlation between measured and predicted X 50 in MLR model

Fig. 7
figure 7

Comparison of predicted and measured rock fragmentation in MLR model

Fig. 8
figure 8

Correlation between measured and predicted X 50 in CART model

Fig. 9
figure 9

Comparison of predicted and measured rock fragmentation in CART model

Fig. 10
figure 10

Correlation between measured and predicted X 50 in ANN model

Fig. 11
figure 11

Comparison of predicted and measured rock fragmentation in ANN model

9 Sensitivity analysis

To evaluate the influence degrees of uncontrollable and controllable parameters on rock fragmentation, a sensitivity analysis using the ANN model based on the relevancy factor (RF) was carried out [46]. The RF values can be calculated by Eq. 6:

$${\text{RF}}=\left| {~\frac{{\mathop \sum \nolimits_{{i=1}}^{n} \left( {{x_{l,i}} - {{\bar {x}}_l}} \right)({y_i} - \bar {y})}}{{\sqrt {\mathop \sum \nolimits_{{i=1}}^{n} {{\left( {{x_{l,i}} - {{\bar {x}}_l}} \right)}^2}\mathop \sum \nolimits_{{i=1}}^{n} {{\left( {{y_i} - \bar {y}} \right)}^2}} }}~} \right|,$$
(6)

Where \({x}_{l,i}\) and \({\stackrel{-}{x}}_{l}\) are the ith value and the average value of the lth input variable, respectively, \({y}_{i}\) and \(\stackrel{-}{y}\) are the ith value and the average value of the predicted output, respectively.

As it is observed in the Fig. 12, it was concluded that in comparison of controllable parameters, uncontrollable parameters are more effective on rock fragmentation. In this regard, from the prior group, Point Load Index, uniaxial compressive strength, Poisson’s ratio, cohesion and rock quality designation, respectively, are the most important parameters on rock fragmentation and from the second group, stemming, spacing and hole diameter are the least important parameters on the quality of rock fragmentation.

Fig. 12
figure 12

Sensitivity analysis of the input parameters

10 Conclusions

In this paper, artificial neural network, decision tree and regression analysis was implemented to investigate the effect of uncontrollable and controllable parameters on the fragmentation quality in the blasting operation. For this study, a database was prepared from several mines situated in different parts of the world. In the first step, superiority of the different models was inspected from which competence of the neural network modeling was explored. The values of MAE, RMSE, VAF and R2 for ANN model were 0.00009, 0.0095, 98.6% and 0.986, respectively. According to outcomes of the network modeling, it was generally concluded that compared to controllable parameters, uncontrollable parameters are more effective regarding fragmentation. In this respect, from the uncontrollable parameters, Point Load Index, uniaxial compressive strength, Poisson’s ratio, cohesion and rock quality designation, respectively, are the most effective factors on fragmentation quality and from the controllable parameters, stemming, spacing and hole diameter are the least effective factors in this regard.