1 Introduction

The construction of underground infrastructure and the exploitation of underground resources require venturing further deep into the earth. With the increasing complexity of deep geological environments and higher geo-stress levels, deep underground engineering projects have seen rockbursts that seriously impact construction and production safety (Li et al. 2019a, b; Feng et al. 2019, 2021; Liu et al. 2019; Zhang et al. 2020a, b, c, d). The microseismic monitoring technology used in this field can help continuously monitor and obtain the elastic waves of surrounding rock fractures for 24 h. The changes in the surrounding rock mechanics and fracture behavior can be described using the quantitative source theory, i.e., the time series law of microseismic parameters, so as to judge the risk of rockbursts (Liu et al. 2017a; De Santis et al. 2019; Li et al. 2019a, b; Kumar et al. 2019; Xiao et al. 2019; Dip et al. 2021). Thus, the microseismic monitoring technology can be used to reveal the mechanism and characteristics of rockbursts, providing an effective tool for their interpretation and evaluation in the field of deep underground engineering. The microseismic multi-parameters (such as the microseismic event count, seismic energy, apparent volume, energy index, apparent stress, and their cumulative values) exhibit unique evolution trends and characteristics prior to the occurrence of a rockburst (Srinivasan et al. 1999; Dou et al. 2018; Liu et al. 2017b; Feng et al. 2015; Ma et al. 2020). Some of the characteristics include strong microseismic activity in rockburst areas, a large amount of energy released due to microseismicity, significant increase in the cumulative apparent volume, and prolonged decrease in the energy index (Tan et al. 2015; Li et al. 2016; Hu et al. 2017). With the expansion of the rockburst database and the rapid development of machine learning methods, the feasibility, accuracy, and timeliness of rockburst prediction and early warning are expected to be further improved through the time series prediction of the microseismic parameters based on deep learning. This type of prediction will play an important role in ensuring the economic benefits of projects and the safety of construction personnel.

Time series data reflect the state or law of factors varying with time. Time series prediction based on deep learning has been developed and widely employed for trend prediction in the stock market, finance, transportation, and other industries (Polson et al. 2017; Thomas et al. 2018; Hou et al. 2019; Anita et al. 2020; Bashar et al. 2020; Zhang et al. 2020a, b, c, d). Rockburst development in deep underground areas is accompanied by multiple microseismic events over a certain period in an enclosed rock area, and there is an evident abnormal variation law in the time series data of the microseismic parameters prior to the occurrence of a rockburst. The time series prediction of the microseismic parameters involves predicting the future trend and evaluating the rockburst risk by thoroughly learning existing time series rules. Rockburst prediction based on machine/deep learning being rapidly developed (Pu et al. 2019). Zhou et al. (2016) used a stochastic gradient boosting model to analyze five potentially related indicators that can help predict rockbursts. Liang et al. (2020) used ensemble learning methods, such as the random forest (RF), adaptive boosting, gradient boosted decision tree, extreme gradient boosting, and light gradient boosting machine, and selected three microseismic parameters as evaluation indicators for rockburst prediction. Ji et al. (2020) used a combination of SVM and genetic algorithm (GA) to study the roles of microseismic waveform data and microseismic energy data in rockburst risk prediction. Moreover, some basic studies on the use of machine/deep learning for classifying microseismic events, denoising of microseismic waveforms, and arrival-time selection of the s/p waves have contributed to improving the accuracy and efficiency of rockburst prediction. Dong et al. (2016) established a classification model for microseismic and blasting events based on the Fisher classifier, naive Bayes classifier, and logistic regression. Shang et al. (2017) provided an idea for the feature extraction and pattern recognition of rock rupture and blasting signals based on a series of models. Andy et al. (2020) found that a convolutional neural network (CNN) trained on a large amount of microseismic data can effectively detect microseismic events. Jiang et al. (2020) proposed an automatic classification method for microseismic events and explosions using an improved complete ensemble empirical mode decomposition with adaptive noise, singular value decomposition, and k-nearest neighbor algorithm. Pu et al. (2020) compared ten machine learning methods, including SVM, BP neural network, naive Bayes classifier, and RF method, in terms of their performance in identifying microseismic events in underground mining. Zhang et al. (2020a, b, c, d) proposed a denoising method based on a fully convolutional encoder–decoder neural network that simultaneously learns the sparse features in the frequency domain and a mask-related mapping function for signal separation. Based on the residual link nested U-Net network (RLU-Net) and Wasserstein generative adversarial network, Zhang et al. (2020a, b, c, d) realized the automatic collection of microseismic waveforms.

In summary, significant advancements have been made in rockburst prediction and microseismic research based on machine/deep learning models. In particular, rockbursts can be prevented effectively and accurately through short-term prediction in the construction stage. However, this type of prediction is mainly based on an on-site monitoring index that helps predict the grade of current rockburst risk; on the other hand, microseismic research based on machine learning mainly uses signal processing methods such as classification, denoising, and detection (arrival-time picking). To improve timeliness, effectiveness, and accuracy of rockburst prediction and provide early warning, it is necessary to study the time series prediction of microseismic parameters based on deep learning.

To overcome the current problems in microseismic and rockburst studies, we performed a time series prediction of the microseismic parameters based on a deep learning method by proposing univariate, multivariate, and multivariate multi-step prediction models. A large amount of time series data pertaining to different rockburst cases are used to train and test these models, and the rockburst risk is interpreted. In addition, two different methods, namely the long short-term memory network (LSTM) and CNN, are used compare the performance of the prediction models. The results show that our method can accurately predict the evolution of the microseismic multi-parameters from the time series data of the complete rockburst process (including in the precursory stage of the rockburst), thus providing a feasible and effective technical method for the prediction and early warning of rockbursts in deep-buried tunnel engineering applications.

2 Time Series Prediction Model of Microseismic Multi-parameter

2.1 Methodology

Generally, a complete rockburst can be broadly divided into three stages, including rockburst incubation stage, development stage, and rockburst stage. The multiple parameters associated with microseismic events in each stage have their own performances (Li et al. 2016; Liu et al. 2019; Xue et al. 2020). Figure 1 shows the evolution of multiple microseismic parameters during a typical rockburst. It can be found that three kinds of rockburst indicators (f (t), ∑g (t), and h (t)) or their combinations are conducive to evaluate the rockburst risk and further predict rockburst. For example, the cumulative apparent volume increases rapidly, the energy index decreases largely, and the rockburst probability continues to increase in rockburst development stage, indicating that the fracture of surrounding rock expands and approaches to the failure state of rockburst. This stage contains abundant precursor information related to rockburst hazards, which can be used for rockburst prediction. Moreover, the evolution trend of the microseismic parameters over a period of time contributes to the rockburst prediction and avoids the impact of the parameter outliers on the prediction at some time. Therefore, it is important to ascertain which stage the rockburst belongs, that is, to predict the evolution trend of microseismic parameters. To this end, the different indicators and their combinations require various prediction models, and the time series prediction model for the multiple microseismic parameters is constructed in this study.

Fig. 1
figure 1

Rockburst stage division based on the different indicators. (1) f (t) (blue line) represents the indicator of real-time fluctuations, including energy index, stress drop, apparent stress, etc. (2) ∑g (t) (yellow line) represents the cumulative indicator, including cumulative seismic energy, cumulative apparent volume, cumulative event count, etc. (3) h (t) (red line) represents other indicator constructed from the microseismic parameters, such as rockburst probability

2.2 Time Series Prediction Model

2.2.1 Univariate Prediction Model

We call the prediction model for the evolution of the univariate microseismic parameters from time series data as the “univariate prediction model”. Figure 2a shows the time series curve of a single variable whose transverse coordinate is calibrated with the equal time distance; the blue and yellow lines indicate the observed and predicted time series, respectively. We need to build a map or model to predict future observations from past observations, and the implementation can be described as follows (see Fig. 2b). In calculation step 1, the input time window of the model is the observation sequence of certain time steps (i.e., the length of the time window), and the output is the predicted value of the next time step. Thus, the predicted value is inferred from the observed values. In calculation step 2, the input time window is slid forward, and the input to the model should contain the predicted one-step value from the previous calculation step. In calculation step n, both the input and output of the model may be predicted values, i.e., the predicted value is inferred from the previous predicted values. The above implementation process requires sliding the input window forward step by step to obtain the prediction curve, which we call the moving-window method.

Fig. 2
figure 2

Univariate prediction model for microseismic parameters. a Time series curve of univariate parameter; b implementation of univariate prediction; c architecture of the convolution neural network

We need to train the model, e.g., a CNN, that takes a series of past observations as input and outputs the predicted value of the next time step. As such, the sequence of observations must be transformed into multiple examples from which the model can learn. We can divide the sequence into multiple input/output patterns called samples, where certain time steps are used as input, and the next one step is used as the output for the one-step prediction being learned.

Figure 2c shows the architecture of the CNN model. The model input is a 1D vector (n × 1 size) of the observed time series and is followed by a convolutional layer, a max pooling layer, a flattening layer, and a densely connected layer. A padding operation is first performed, and then, the convolution operation expands the depth of the feature while extracting the feature (e.g., the feature expands from 1 layer to m layers, and the feature size becomes n × m). In some cases, the first convolution layer may be followed by a second convolution layer and then connected to the max pooling layer, the task of which is to extract the most significant features from the output of the convolution layer. The feature map is simplified as a 1D vector using the flattening layer between the max pooling and densely connected layers. Finally, the densely connected layer is used to explain the features and output the one-step prediction.

2.2.2 Multivariate Prediction Model

  1. 1.

    Multiple Parallel Series Model

    We often analyze rockburst evolution and judge the rockburst risk by combining multiple microseismic parameters (e.g., energy index and cumulative apparent volume) that are typically interrelated. Figure 3a shows the sample curve of the multivariate time series. We need to build a model based on the observations of the multivariate time series to predict the multivariate values of the next time step, which is called the multivariate prediction model of the parallel time series (or multiple parallel series model). Figure 3b illustrates the implementation of the model. In calculation step 1, the model input is the observed multivariate sequence (e.g., two variables) with certain time steps, and the output is the multivariate prediction value of the next time step. In calculation step 2, the input to the model should contain the multivariate prediction from the previous calculation step. In calculation step n, both the input and output of the model can be multivariate predicted values.

    The network structure of the multiple parallel series model is adjusted on the basis of the univariate prediction model (see Fig. 3c). First, the input data dimension (e.g., n×2 represents n time steps and two layers) is put forward for the two time series. At the model output, the features connect multiple independent densely connected layers, i.e., the multivariate prediction values of the next time step are outputted separately. This output strategy may offer more flexibility or better performance depending on the specifics of the problem being modeled. The feature extraction in the middle process of the model is similar to that of the univariate prediction model.

  2. 2.

    Multiple Input Series Model

Fig. 3
figure 3

Multivariate prediction model for parallel time series. a Time series of multivariate parameter; b implementation process of prediction; c architecture of the convolution neural network

In judging the evolution of a rockburst, it is sometimes necessary to predict the evolution of a new variable based on the existing multivariable observation sequence. We call this the multiple input series prediction model. The new variable and existing multiple variables satisfy certain functional relationships; for example, a functional relationship can be established between a new variable (rockburst probability) and multiple microseismic parameters (cumulative event number, cumulative seismic energy, and cumulative apparent volume). Figure 4a shows the sample curve of the multivariate input series; the input multivariate time series includes observation and prediction, and the output time series is the prediction of the new variable (indicated by the green line). The implementation process of the prediction model is described in Fig. 4b: in calculation step 1, the model input is the observed multivariate sequence with certain time steps (e.g., three variables), and the output is the predicted new variable at the current time step; in calculation step 2, the input should contain the one-step prediction of the multiple variables, and the output is the prediction of the new variable; in calculation step n, all the inputs may be predicted multivariate values, and the output is also the prediction of the new variable at the current time step.

Fig. 4
figure 4

Multivariate prediction model for multiple input series. a Time series of multivariate parameter; b implementation process of prediction; c architecture of the convolution neural network

The network structure of the multivariate input series model is established, as shown in Fig. 4c, after some special considerations. First, the input multivariate sequence is no longer processed by a single convolutional module but by multiple convolutional modules (e.g., three modules), which we call a multi-headed CNN model. This structure may offer more flexibility or better model performance; for example, it allows to configure each module and its parameter settings differently for each input sequence, such as the number of filter maps and the kernel size. After a series of operations of convolution, max pooling, and flattening, the extracted features of the different sequences are concatenated. Finally, the resolution of the feature is reduced by the densely connected layer, and the single-step prediction of the new variable is outputted.

2.2.3 Multivariate Multi-step CNN Model

For the prediction of the above univariate or multivariate time series, the output of the model is the prediction of a single time step; however, some problems may require multi-time step prediction, i.e., a multivariate multi-step prediction model. Figure 5a and b shows the multi-step output of the multivariate time series. In calculation step 1, the observed multivariate sequences with certain time steps (e.g., two variables) serve as the model input, and the predicted multivariate values of the next two steps are outputted. In calculation step 2, the model input should contain the predicted two-step multivariate values from the previous calculation step. In calculation step n, both the input and output of the model may be predicted multivariate values.

Fig. 5
figure 5

Multivariate multi-step prediction model for parallel time series. a Time series of multivariate parameter; b implementation process of prediction; c architecture of the convolution neural network

Figure 5c shows the network structure of the multivariate multi-step output model, whose input and feature extraction layers are similar to those of the above models. By inputting the parallel time series of the two variables, the feature is extracted and outputted as a 2 × 2 matrix after a series of operations in the convolution layer, max pooling layer, flattening layer, and densely connected layer. This output matrix represents the predicted two-step values of the two variables.

3 Case Study

3.1 Geological Conditions and Microseismic Monitoring System

We performed a time series prediction of multiple microseismic parameters associated with the development of a rockburst in the Micangshan extra-long tunnel of the Sichuan–Shaanxi expressway; this tunnel is the second longest (13.8 km) expressway tunnel in China and the third longest in the world. In the excavation of the K46 + 170 − K45 + 800 mileage section of the tunnel, rockburst hazards due to the high ground stress occurred frequently (Fig. 6). The rockburst areas have the following conditions: the buried depth is in the range of 530–760 m, and the lithology is mainly gabbro and tectonic granulite; the classification of the surrounding rock quality is mainly grade III (medium), indicating a certain number of joints or fissures in the rock mass; the strata of the gabbro and tectonic granulite are interlaced, and multiple rockburst hazards occurred at the section of the tectonic granulite. Based on the identification, the original rock of the tectonic granulite should be gabbro, which undergoes compressive rupture and recrystallizes to form granulite. The rock mass has many microfractures but still has a high residual geo-stress, which is likely to induce rockburst.

Fig. 6
figure 6

Geological cross-section along the Micangshan tunnel, China (Ma et al. 2019)

Figure 7 shows a microseismic monitoring system and a sensor array layout. Both the tunnels were monitored with three sensors as one group; two sensor groups were distributed at the leading tunnel and one at the following tunnel; three sensors in each group were located at the top and two sides of the tunnel, respectively. The sensor group at the leading tunnel covered the monitoring range of the following tunnel and improved the accuracy of the microseismic source location in this range. A data acquisition station was installed at the lining support area, located in a crossing tunnel between two tubes. The sampling frequency and sampling window were set to 20 kHz and 1.5 s, respectively. The recorded data were collected from 30,000 sampling points with the values in voltage.

Fig. 7
figure 7

Layout of the microseismic monitoring system installed in the Micangshan tunnel

To ensure the calculation accuracy of the microseismic parameters, the follow steps were taken: (1) moving invalid sensors to avoid interfering with parameter calculation; (2) distinguishing microseismic data from noise for an effective signal extraction (Zhang et al. 2020a, b, c, d); (3) denoising the microseismic data to obtain clean data (Zhang et al. 2020a, b, c, d); (4) selecting an appropriate index based on the geological conditions of tunnel for microseismic parameter calculation. Based on the above steps, the microseismic parameters of the recorded data were processed using a set of standards, ensuring the normalization of the parameter calculation and the accuracy of the subsequent analysis. Table 1 presents the definitions of the microseismic parameters.

Table 1 Definitions of microseismic parameters (Zhao et al. 2018; Dai et al. 2017; Liu et al. 2018)

3.2 Process and Description of Rockbursts

Rockbursts in the Micangshan tunnel originate from the excavation of the tectonic granulite section and often occur on the roof or excavation face of the tunnel. The rockbursts occur with a thunder-like bang and subsequently exhibit a splitting sound; the broken rocks are ejected or they collapse. The rockburst zones are mostly controlled by a dominant structural plane, or present a wedge or fornix shape controlled by two or more structural planes, with a maximum depth of up to 3 m (Fig. 8a). These rockbursts exhibit characteristics such as a high-energy release and a wide influence range. The maximum energy of a microseismic event can reach 1 million joules, which is even comparable to that reported during tunnel blasting. The high energy released affects the tunnel equipment and surrounding rock, resulting in shaking or overturning of the equipment, cracking of the tunnel floor (floor heave), cracking of the tunnel sidewall (sidewall heave), and initial support failure (Fig. 8b). For example, the high energy released from a rockburst may cause cracks at the same mileage in the adjacent tunnel (Fig. 8c).

  1. 1.

    Microseismic Event Count and Seismic Energy

    The evolution of the microseismic parameters during rockburst is significant. Figure 9 shows the variations in the daily microseismic event count and microseismic activity index (S value, proposed by Liu et al. 2018) during a typical rockburst. The activity and frequency of the microseismic events before the rockburst are high, and the overall trend is gradually rising (except for the “1.14” rockburst, where the trend first increases and then decreases). The microseismic event count and S value are still high on the day of the rockburst; however, both decrease significantly after the rockburst.

    Microseismic events can be classified into three categories based on the amount of energy released: a low-energy event (≤104 J), a medium-energy event (104–105 J), and a high-energy event (≥105 J). Figure 10 shows that the daily cumulative energy and medium- and high-energy incidences of the microseismic events reach maximum in 1–2 days before the rockburst. They remain high on the day of the rockburst but decrease significantly thereafter.

  2. 2.

    Energy Index and Cumulative Apparent Volume

    Figure 11 shows the evolution of the energy index and cumulative apparent volume during representative rockbursts. The energy index fluctuates, and the cumulative apparent volume continues to increase. The closer the occurrence of rockburst, the lower the energy index is in the oscillation, and the cumulative apparent volume increases sharply. Notably, this precursor characteristic of rockburst development also appears during the strong stress adjustment activity after tunnel blasting; however, in this case, the energy index curve recovers quickly and exhibits a significant fluctuation.

  3. 3.

    Rockburst Probability

Fig. 8
figure 8

Rockburst and damage characteristics of the Micangshan tunnel

Fig. 9
figure 9

Number of daily microseismic events and S value before and after rockburst occurrence. The processes of four typical rockbursts, namely “12.17,” “12.29,” “1.14,” and “3.12,” are illustrated. “4BR” denotes 4 days before the rockburst; “R” denotes the day of rockburst occurrence; “1AR” denotes 1 day after rockburst

Fig. 10
figure 10

Daily cumulative seismic energy and incidences of high- and medium-energy events during the rockburst process

Fig. 11
figure 11

Time series of cumulative apparent volume and energy index during rockbursts

We establish an evaluation index of the rockburst probability using the similarity measure method, which is based on three parameters: the microseismic event count, seismic energy, and apparent volume (referring to the Chinese technical specification for rockburst risk assessment in hydropower engineering). The similarity measure (or distance measure) method involves measuring the relationship between different objects in the mining of time series data. The Euclidean distance is the most widely used similarity measure method. The Euclidean distance D between data points A and B can be defined as

$$D(A,B) = \sqrt {\sum\limits_{i = 1}^{n} {(a_{i} - b_{i} )}^{2} } .$$
(1)

Here, n denotes the number of components of the data point; where n equals 3, which refers to the above three microseismic parameters. Because the microseismic event count, seismic energy, and apparent volume have different dimensions, it is better to normalize the three parameters and ensure that their values are distributed between 0 and 1. Subsequently, each Euclidean distance between the parameter values in the rockburst development process and that at the rockburst occurrence is calculated; the result is also distributed between 0 and 1. The lower the value, the higher the similarity between the microseismic parameters at the current time and that at rockburst occurrence, and the greater the possibility of a rockburst. Therefore, the probability of rockburst occurrence is defined as (1-normalized Euclidean distance) × 100%, i.e., the probability at the rockburst occurrence time is 100%. Figure 12 shows the probability evolution of the “1.5” rockburst in the Micangshan tunnel. The rockburst probability fluctuates and increases continuously to 1.0 (i.e., 100%).

Fig. 12
figure 12

Rockburst probability evolution during “1.5” rockburst

3.3 Training and Testing of Time Series Prediction Model

  1. 1.

    Model Construction and Training

    Because the index magnitude of the cumulative event number, cumulative seismic energy, and cumulative apparent volume are not uniform (ranging from 102 to 108), it is necessary to standardize the data before model training. The following formula is used to scale the index value between 0 and 1:

    $$\overline{X} = \log_{10}^{X} /10.$$
    (2)

    Here, X denotes the adopted microseismic parameters. Standardization can not only reduce the numerical difference but also accelerate the convergence of the model. Moreover, the original value can be easily recovered by anti-standardization. Figure 13 shows the image before and after the data standardization of a complete rockburst in the Micangshan tunnel (taking the cumulative event number, cumulative energy, and cumulative apparent volume as examples).

    Through the data preprocessing (standardization and isochronization), the time series curves of the microseismic multi-parameters were established, and the following sample databases were established for model training:

  2. i.

    The database of the cumulative event number, cumulative apparent volume, and cumulative energy was established for training the univariate prediction model.

  3. ii.

    The database was established on the basis of the cumulative event number, cumulative energy, cumulative apparent volume, and rockburst probability. It is mainly used for training the multiple input series model, i.e., the inputs are multiple sequences of the cumulative event number, cumulative energy, and cumulative apparent volume, and the output is the constructed rockburst probability.

  4. iii.

    The database was constructed with the cumulative apparent volume and energy index for training the multiple parallel series model (including the single-step and multi-step output models), i.e., the input is the parallel time series of the cumulative apparent volume and energy index, and the output is the single-step or multi-step predictive values of these two variables.

    The data pertaining to the “12.29,” “1.5,” “1.14,” “3.2,” and “3.12” rockbursts were used to construct a database for model training, which is different from general database integration construction. The data of these rockbursts were randomly selected and inputted to the model for training in a successive manner; that is, after completing the training with one set of rockburst data, the model was trained with the next set of rockburst data.

    In the model training, the selected length of the time window (time steps) for the input sample will affect the training result. When the time window is set too small, it is difficult for the model to learn the features of the microseismic parameters in the time dimension, and when the time window is too large, it may contain too many features of the time dimension, thereby affecting the prediction performance of the model. Therefore, a suitable time window length can make the model obtain a good fitting effect (Song et al. 2017; Wollmer et al. 2010). Through a comprehensive consideration, the input length of the time window is set to 12 for the training model. Thus, the input sample dimensions of the univariate prediction model, multiple input series model, multiple parallel series model, and multivariate multi-step prediction model are 12×1, 12×3, 12×2, and 12×2, respectively, where the first number (i.e., 12) is the length of the time window and the second number is the channel. The Batch size is set to 16 in the process of model training; the learning rate is set to 0.001 considering the prediction effect and training speed; the activation functions are set to Elu and Sigmoid; the loss function is selected as the MSE, and it is optimized using the Adam algorithm. Moreover, the Earlystopping operation is used to avoid over-fitting during model training. Table 2 presents the structure and parameter setting of the established prediction models.

  5. 2.

    Testing of Univariate Prediction Model

    We test the data pertaining to the “12.17” rockburst, which was not involved in the training. The test was implemented on the basis of the observation data of the complete rockburst process, i.e., all the input samples are from the observation sequence and do not include the predicted value of the model output. The indicators (Table 3) are used to evaluate the test results of each model. The MAE and RMSE are indicators used to evaluate the error between the prediction and the observation; MAPE not only considers the error between the prediction and the observation but also considers the ratio of the error to the true value, which is a relative value; R2 is in the range of 0–1, and the closer the value is to 1, the closer the predicted value of the model is to the true value.

    Figure 14 shows the test results of the univariate prediction model; the yellow line represents the observation curve, and the red line represents the test curve obtained based on the observation curve as the input sample. The fitting between the observed and predicted values of the cumulative energy, cumulative apparent volume, and cumulative event number is good. Because of the large dimensional difference between each microseismic parameter, the determination coefficient R2 is selected to evaluate the model performance quantitatively. The R2 values of the univariate prediction model for the cumulative energy, cumulative apparent volume, and cumulative event number are 0.978, 0.962, and 0.982, respectively, indicating that the test performance of the model is ideal.

  6. 3.

    Testing of Multiple Input Series Model

    Figure 15 shows the test results obtained using the multiple input series model. The test curve of the rockburst probability is obtained on the basis of the time series of the cumulative energy, cumulative apparent volume, and cumulative event number as input samples. The fitting between the true and test values is good, and the R2 coefficient is 0.847, which also proves the good prediction performance of the model.

  7. 4.

    Testing of Multiple Parallel Series Model

    Figure 16 shows the test results of the multiple parallel sequence models. Clearly, the fitting between the observation and test curves of the cumulative apparent volume is good, and the R2 coefficient is 0.968. The fitting between the observed and predicted values of the energy index is moderate, and the R2 coefficient is only 0.684, which may be due to the large fluctuation in the energy index curve. However, the test curve of the energy index is consistent with the trend in the observation curve, indicating that the multiple parallel series model can yield an accurate prediction.

  8. 5.

    Testing of Multivariate Multi-step Prediction Model

    Figure 17 shows the test results of the multivariate multi-step output model. The fitting between the observed and predicted values of the cumulative apparent volume and energy index is worse than that of the multiple parallel series model, and the determination coefficient R2 is also reduced to 0.910 and 0.617, respectively. Therefore, the prediction performance of the multi-step output model is lower than that of the single-step output model.

  9. 6.

    Robustness Test of Time Series Prediction Models

    To further test the performance of the time series prediction models, the disturbances were introduced in the data to evaluate the robustness and stability of the models. The red line in Figure 18 represents the test curve based on the observation curve corresponding to the disturbances as the input (blue dotted line). The models have a good time series prediction performance, even when the observed data contain disturbances. Moreover, there is a good fitting between the observation and test curves of the cumulative energy, cumulative apparent volume, cumulative event number, and rockburst probability, with their R2 values being 0.974, 0.959, 0.979, and 0.801, respectively. This demonstrates the robustness and stability of the univariate prediction model and multiple input series model. For the multiple parallel series model and multivariate multi-step prediction model, the observation and test curves of the cumulative apparent volume have a good fit, with R2 values of 0.945 and 0.939, respectively. However, the fitting of the energy index is not ideal, with R2 values of 0.454 and 0.437, respectively. Nevertheless, the trend in test curve of the energy index is consistent with the observation curve trend, which is similar to the test results of the observations (Figures 16 and 17). Therefore, the time series prediction models have good stability and robustness under the influence of disturbances.

Fig. 13
figure 13

Data image before and after processing of rockburst

Table 2 Structure and parameter setting of different prediction models
Table 3 Evaluation indicators of model performance
Fig. 14
figure 14

Test results of the univariate prediction model

Fig. 15
figure 15

Test result of multiple input series model

Fig. 16
figure 16

Test result of multiple parallel series model

Fig. 17
figure 17

Test result of the multivariate multi-step prediction model

Fig. 18
figure 18

Test result of the robustness of time series prediction models

4 Discussion

4.1 Comparative Studies

The classical LSTM is selected to compare and verify the result of the time series prediction models of the microseismic multi-parameters. The LSTM model has significant advantages in time series modeling owing to its unique structural characteristics, comprising an input gate, an output gate, and a forgetting gate, as well as a memory unit. A three-layer LSTM network structure was adopted in this study. The numbers of hidden units in the first, second, and third layers were 64, 128, and 256, respectively. Before the comparative analysis, the parameters and structure of the two networks were adjusted, and the optimal performance was achieved.

  1. 1.

    Comparison of Univariate Prediction Model

    Figure 19 shows the test curve of the univariate prediction model based on the two networks for a complete rockburst process. The MAE, RMSE, MAPE, and R2 evaluation indices of the model are recorded in Table 4. The R2 coefficients of the LSTM and CNN models are greater than 0.96 for the accumulative energy and accumulative apparent volume, indicating a good fit between the observation and test curves; the performance of the LSTM model is slightly better as the MAE, RMSE, and MAPE indicators decrease. The test performance of the CNN model is slightly better for the cumulative event number, and its MAE, RMSE, and MAPE indices are reduced by 6.6%, 6.6%, and 10.5%, respectively, compared with that of the LSTM model. In summary, both the LSTM and CNN models can achieve good univariate prediction results.

  2. 2.

    Comparison of Multiple Input Series Model

    Figure 20 shows the test curve generated by the multiple input series prediction model based on the two networks, and the evaluation results in terms of the MAE, RMSE, MAPE, and R2 are recorded in Table 3. The CNN model performs slightly better than the LSTM model for the constructed index of the rockburst probability. The R2 coefficient of 0.847 indicates that the observed values fit better with the test values; the MAE, RMSE, and MAPE indices are reduced by 28.3%, 20.9%, and 36.2%, respectively, indicating that the multiple input series model based on the CNN is more suitable for predicting the constructed index of the rockburst probability.

  3. 3.

    Comparison of Multiple Parallel Series Model

Fig. 19
figure 19

Comparison of univariate prediction results with CNN and LSTM networks

Table 4 Performance comparison of different prediction models by CNN and LSTM methods
Fig. 20
figure 20

Comparison of multiple input series models with CNN and LSTM networks

Figure 21 shows the test curve generated by the multiple parallel series model based on the two networks. The test results of the CNN and LSTM models for the cumulative apparent volume are good; the CNN slightly outperforms the LSTM model, the R2 coefficient is higher, and the MAE, RMSE, and MAPE indices are reduced. The performance of the two models is relatively poor in terms of the energy index, as the R2 coefficient of the CNN model is only 0.644, which is due to the large fluctuation in the energy index curve. However, the test curves of the energy index of the two models are consistent with the trend in the observed curve, indicating that the evolution of the energy index curve can be predicted by the models.

Fig. 21
figure 21

Comparison of multiple parallel series models with CNN and LSTM networks

4.2 Rockburst Risk Interpretation Based on Microseismic Time Series Prediction

In the above section, we used a trained model to test the microseismic data collected in the complete process of rockbursts, and the model performance was validated by comparing the test curve with the observation curve. Here, we make some new attempts to apply the prediction model of the microseismic multivariate time series. In this case, the prediction curve of the microseismic multi-parameters is inferred from the observation curve. The input samples for the models come from the values observed at the beginning. As the prediction curve extends, more predictive values are gradually added to the input sample until the predictive values fill the input sample. This is consistent with the application scenario of the time series prediction model presented in Sect. 2.

  1. 1.

    Prediction Made by Univariate Time Series Model and Interpretation of Rockburst Risk

    The untrained data pertaining to the “12.17” rockburst were selected for the prediction using the univariate time series model. As shown in Fig. 22, the predicted time period lasts 11 h, which is from the 25th hour before the rockburst to the 36th hour after the rockburst (rockburst occurred at the 30th hour). In the prediction of the microseismic values at the 25th hour, the input sample is from the observation sequence. For the subsequent microseismic parameter prediction at each hour, the input sample should gradually include more previous predictive values. Although there are some differences between the prediction and observation curves, their evolution trends are the same. The cumulative energy, cumulative event number, and cumulative apparent volume before rockburst occurrence increase quickly with a steep-shaped curve, indicating that the risk of a rockburst increases gradually. After the rockburst, the growth of each index becomes slower, and a gentle step appears in the curve.

  2. 2.

    Prediction Made by the Multiple Input Series Model and Interpretation of Rockburst Risk

    Figure 23 shows the prediction result of the evolution of the “12.17” rockburst, obtained using the multiple input series model. The input to the model comes from the observed and predicted values of the three variables, namely the cumulative apparent volume, cumulative event number, and cumulative energy, and the output is the index evolution of the rockburst probability in the future time period. The diagram shows that the prediction curve of the rockburst probability fluctuates significantly, mainly because the training sample of this index also exhibits a significant fluctuation. Although the predicted curve of the index is different from the real value, their evolution trends are in good agreement, indicating that the index continues to increase and reaches the highest fluctuation point before rockburst occurrence.

  3. 3.

    Prediction Made by Multiple Parallel Series Model and Interpretation of Rockburst Risk

Fig. 22
figure 22

Prediction result of univariate model and interpretation of rockburst risk

Fig. 23
figure 23

Prediction result of multiple input series model and interpretation of rockburst risk

Figure 24 shows the prediction result of the “12.17” rockburst evolution, obtained using the multiple parallel series model. The input to the model comes from the observed and predicted values of the cumulative apparent volume and energy index. The output is the index evolution of the future time period corresponding to the two variables. The diagram shows that the prediction curves of the cumulative apparent volume and energy index are slightly different from their observation curves. The prediction curve of the cumulative apparent volume growth is ahead of its observation curve, and the decrease in the energy index in the prediction curve lags behind that in the observation curve. The predicted value is in good agreement with the evolution trend in the real value, indicating that the cumulative apparent volume increases significantly and that the energy index decreases rapidly before rockburst occurrence. The model can predict and infer the rockburst risk in advance.

Fig. 24
figure 24

Prediction result of multiple parallel series model and interpretation of rockburst risk

Unlike conventional methods that focus on predicting the potential rockburst grade, the proposed method can theoretically provide a time tag for rockburst risk assessment. The specific implementation process involves training the model with isochronized microseismic data obtained during a rockburst, and their evolution trends in the future are obtained using the moving-window method. Some microseismic indices (e.g., rockburst probability and energy index) have evident characteristics during a rockburst. Therefore, the time series prediction method proposed in this paper combines the future evolution trends of these indices to better interpret rockburst risk (and not by simply using the current state or values of the indices, in which case the result may be one-sided due to curve fluctuation). Moreover, using the deep convolution neural network method for the time series prediction of the microseismic multi-parameters is simple a choice preference. The LSTM neural network was used for a comparative study. In fact, any deep learning method that can reflect the mapping relationship between the existing observation sequence and the future prediction sequence is feasible. Thus, this paper provides an effective strategy for rockburst prediction based on data related to microseismic activity.

5 Conclusion

In this study, the evolution prediction of microseismic parameters based on a deep learning method was explored to evaluate the risk of rockbursts in deep underground engineering construction. Various prediction models of the microseismic univariable and multivariable based on different microseismic parameters are proposed, including a univariate prediction model, multiple parallel series model, multiple input series model, and multivariate multi-step prediction model. The results of model tests and comparative analyses show that the CNN-based models can accurately predict evolution trends in microseismic parameters and are generally better than the LSTM-based models. By deducing the prediction curves of the microseismic parameters from the predicted values as the input sample, we found the prediction curve to be in good agreement with the real curve. This proves that the selected microseismic parameters, including the cumulative energy, cumulative apparent volume, cumulative event number, rockburst probability, and energy index, are effective and reasonable for time series prediction. The proposed method has the potential to predict and infer rockburst risk in advance. In addition, the time series prediction model of the microseismic multi-parameters can be combined with a quantitative early warning model to form an integrated model for rockburst prediction and early warning. The proposed method can help reveal the evolution trends in the key characteristics associated with future rockburst development and provide an effective technical means for the prediction and early warning of rockbursts in the field of deep underground and mining engineering. The proposed method can also be applied to the prediction of other types of geological disasters using other appropriate indicators.