t-SNE and variational auto-encoder with a bi-LSTM neural network-based model for prediction of gas concentration in a sealed-off area of underground coal mines

Dey, Prasanjit; Saurabh, K.; Kumar, C.; Pandit, D.; Chaulya, S. K.; Ray, S. K.; Prasad, G. M.; Mandal, S. K.

doi:10.1007/s00500-021-06261-8

t-SNE and variational auto-encoder with a bi-LSTM neural network-based model for prediction of gas concentration in a sealed-off area of underground coal mines

Data analytics and machine learning
Published: 05 October 2021

Volume 25, pages 14183–14207, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

t-SNE and variational auto-encoder with a bi-LSTM neural network-based model for prediction of gas concentration in a sealed-off area of underground coal mines

Download PDF

Prasanjit Dey¹,
K. Saurabh¹,
C. Kumar¹,
D. Pandit¹,
S. K. Chaulya ORCID: orcid.org/0000-0002-5396-0086¹,
S. K. Ray¹,
G. M. Prasad¹ &
…
S. K. Mandal¹

811 Accesses
16 Citations
Explore all metrics

Abstract

A deep learning network is introduced to predict concentrations of gases in the underground coal mine enclosed region using various IoT-enabled gas sensors installed in a metallic gas chamber. The air is sucked automatically at specific intervals from the sealed-off site utilizing a solenoid valve, suction pump, and programmed microprocessor. The gas sensors monitor the gas content in the underground coal mine and communicate gas concentration to the surface server room through a wireless network and cloud storage media. The t-SNE_VAE_bi-LSTM model is proposed in this study as a prediction model that combines the t-SNE, VAE, and bi-LSTM networks. The proposed model's t-SNE method aims to minimize the dimensionality of the recorded gas concentration; and VAE layer intends to retrieve the inner characteristics of low-dimensional gas concentration. Finally, the given model's Bi-LSTM layer tries to forecast the concentrations of CH₄, CO₂, CO, O₂, and H₂ gases. The proposed model's prediction accuracy is compared with the existing two models, namely auto-regressive integrated average moving (ARIMA) and chaos time series (CHAOS). The experiment findings demonstrate that the t-SNE_VAE_bi-LSTM model forecasted mean square error (MSE) is more accurate, and it has lesser MSE value of 0.029 and 0.069 for CH₄; 0.037 and 0.019 for CO₂; 0.092 and 0.92 for CO; 1.881 and 1.892 for O₂; and 1.235 and 1.200 for H₂ than the ARIMA and CHAOS models, respectively.

Gas concentration prediction based on ED-SLSTM model under the framework of Trend Prediction-Time Point Prediction

Article 25 May 2024

Short-term natural gas load forecasting based on EL-VMD-Transformer-ResLSTM

Article Open access 02 September 2024

Data-Driven Modeling for the Prediction of Stack Gas Concentration in a Coal-Fired Power Plant in Türkiye

Article Open access 29 April 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Due to the harsh and unpredicted working conditions in the underground coal mine, many accidents occurred (Qiao and Zeng 2011; Wang et al. 2013). Furthermore, as per Mine Safety and Health Administration (MSHA), the explosion of flammable gases was occurred in underground mines (MSHA 2018). As a result, it had reduced the production in the affected mines. Therefore, there is a necessity for designing a system that continually monitors and forecasts gas concentrations in underground coal mine enclosed regions for the safety of workers and the early opening of the enclosed site to begin production. Some traditional sensing and monitoring technologies are available for monitoring gas concentrations (Kumar et al. 2013; Mandal et al. 2013; Chaulya and Prasad 2016). However, these traditional methods are incapable of processing massive amounts of multidimensional data provided by various sensors.

In this study, the IoT-enabled gas sensors (CH₄, CO₂, CO, O₂, and H₂) are deployed in a gas collecting chamber installed outside the enclosed site wall of an underground coal mine. The inlet of the metallic gas chamber is connected to a solenoid valve followed by a metallic pipe inserted deep into the fireside through a sealed-off wall. The solenoid valve automatically opens in the fixed time interval for collecting different gas concentrations of a sealed-off area. The five gas sensors gather the appropriate gas concentrations and transmit them to the prediction system via a wireless local area network (WLAN). After collecting 2-min sampling, the solenoid valve closes automatically and opens after a predefined programmable interval. The multidimensional data are generated from IoT-enabled gas sensors. Then generated data send to the prediction model, where the model is trained. After completing the training and validation processes, the prediction model predicts the respective gas concentration of a sealed-off area and sends the prediction result to the cloud storage. Finally, the mine management accesses the cloud storage from the surface control room and monitors the sealed-off area's environmental condition.

Recently, Huang and Kuo (2018) have developed a deep convolutional neural network (CNN)-long short-term memory (LSTM) method for weather forecasting in a smart city. On the other hand, deep learning networks have been broadly used in natural language processing (NLP), computer vision, and object recognition (He et al. 2016; Collobert and Weston 2008). However, deep learning networks effectively process complex multidimensional data (Rashid and Rehmani 2016; Jo and Khan 2018; Saeed et al. 2019). But this method does not correlate the gas concentration, air velocity, and temperature parameters of the underground mine. Hence, this method cannot accurately predict the concentrations of gases in the enclosed region. Therefore, the combination of a deep learning network and IoT technologies has been adopted to monitor the underground mine environment (Rashid and Rehmani 2016; Jo and Khan 2018; Muduli et al. 2019; Saeed et al. 2019).

To achieve this goal, a reliable and efficient deep learning model has been developed that can predict the error-free gas concentration in real time for an underground mine's sealed-off area. Thus, a deep learning method is proposed as a prediction model to combine t-distributed stochastic neighbor embedding (t-SNE), variational auto-encoder (VAE), and bidirectional LSTM (bi-LSTM) network, which is named as a t-SNE_VAE_bi-LSTM model. The t-SNE algorithm preprocesses multidimensional time series data from the CH₄, CO₂, CO, O₂, and H₂ gases data and reduces the dimension. Subsequently, the model combines VAE and bi-LSTM layers, extracting useful information and predicting the concentration of gases. Traditionally, several statistical models proposed inefficient methods for extracting the data's implicit feature during the analysis phase (Liu et al. 2004; McKeen et al. 2007). The VAE can extract potential information and reduce data volume, improving the prediction accuracy of the gas concentration. The LSTM is based on a recursive neural network (RNN) to predict the incident over time interval value (Sundermeyer et al. 2012). The bi-LSTM is connected to the hidden layer in both orders between forward and backward. The bi-LSTM receives input from the VAE layer's output, which generates the essential feature of five gas sensors data. The bidirectional LSTM model is trained using the present and past gas sensor data feature, improving prediction accuracy. Thus, the VAE layer extracts the import feature, and bi-LSTM efficiently predicts the gas concentration in the enclosed region of an underground mine.

Therefore, this study aims to develop a unique model for predicting CH₄, CO₂, CO, O₂, and H₂ concentrations, known as the t-SNE_VAE_bi-LSTM model, by learning the in-depth characteristics of these gases. Furthermore, as a result of this study, the proposed model can predict other mine hazards, such as roof fall in underground mines and slope failure in opencast mines with minor modifications.

The following are the paper's main contributions:

CH₄, CO₂, CO, O₂, and H₂ concentration prediction helps to improve mine safety and early reopening of the sealed-off area to start production.
The state-of-the-art approach, namely deep learning algorithms, has been used to build a prediction model for CH₄, CO₂, CO, O₂, and H₂ concentrations.
An automated metallic gas chamber has been designed that regularly collects gas samples from an underground coal mine sealed-off area at fixed time intervals.
In the enclosed region of coal mines, a real-time system for monitoring and forecasting CH₄, CO₂, CO, O₂, and H₂ gases concentration has been designed.
The CH₄, CO₂, CO, O₂, and H₂ gas concentration prediction model has been proposed by combining t-SNE, VAE, and bi-LSTM models named the t-SNE_VAE_bi-LSTM model.
The proposed model's t-SNE method aims to lower the dimension of the collected gas concentration; the proposed model's VAE layer seeks to retrieve the inner characteristics of low-dimension gas concentration. Finally, the proposed model's Bi-LSTM layer tries to forecast the concentrations of CH₄, CO₂, CO, O₂, and H₂ gases at regular intervals.
The developed system and prediction model have been validated by deploying the complete unit in a coal mine.

The rest of this paper has been arranged as follows. Section 2 presents a recent survey of the literature on the prediction of gas concentrations. Section 3 provides an overview of the automated and intelligent prediction system. The problem statement for the concentration of gases in the deep coal mine sealed-off region is presented in Sect. 4. Section 5 presents the proposed methodology for predicting real-time gas concentrations. Section 6 presents the detailed dataset, results, and comparative analysis of the different prediction models for gas concentration prediction. Finally, Sect. 7 brings the findings to a conclusion. Figure 1 depicts the paper's structure.

2 Related works

Mine accidents have increased yearly due to abnormally increased gas concentration levels (Qiao and Zeng 2011; Wang et al. 2013) as underground coal mines extend up to deeper depth. Xia et al. (2016) have mentioned the sudden increase in gas levels and controlling disaster in the coal mine. However, the gas concentration level was changed abnormally before the underground mine accident occurred. Rodriguez et al. (2014) and Song et al. (2019) have also described such a situation and tried to determine the gas concentration of the underground mine but were unable to predict the gas concentration. Different machine learning (ML) methods were also applied by various researchers (Karaca et al. 2006; Xi et al. 2015) to predict gas concentration. There are multiple traditional methods available for predicting the concentration of gases in the underground coal mine. The existing prediction methods can be divided into different categories, like the machine learning forecasting approach (Karaca et al. 2006; Xi et al. 2015), statically prediction approach (Brooks et al. 2016), and mathematical prediction approach (Sundermeyer et al. 2012). Said approaches are not capable of processing vast amounts of data generated from five gas sensors in real time. It is challenging to employ stated techniques in the actual forecasting status of a sealed-off area's gas level. To predict gas emission of the working face, Ye et al. (2006), Yang et al. (2009), Chen et al. (2016), and Guo et al. (2018) have divided the different gas emission sources. Their plan came from a distinct source and forecasting method. They formed equations for gas emission of other emission areas of a mine, which delivered a significant vision and assistance for gas extraction and ventilation design. However, numerous variables, such as the rate of coal descending per unit of time and the amount of coal that remained in the goaf, were present in this approach (Zheng et al. 2019). Thus, applying the above techniques to forecast future gas levels in the underground coal mine is not easy. To decide the rule of gas concentration trek in the surface of a mine where working is still progress, the researchers observed the equation of gas flow, equation of gas dispersal, and trek equations by a bulky number of mathematical simulations (Cao and Li 2017; Xia et al. 2017). But, mesh separation of the mathematical model was an essential consequence of this experimentation. The simulation result of gas extraction in a mine's working face depends on a basic model with the perfect margin environment. A mathematical model demonstrated the rule of gas extraction, but it is problematic to accomplish actual forecasting of different gas concentrations.

Many researchers have studied on-time serial sensor data and developed several prediction models. For example, the chaotic time series (CHAOS) model (Zhang et al. 2007; Cheng et al. 2008; Liu 2010), auto-regressive integrated average moving (ARIMA) model (Rekhi et al. 2020), and support vector regression (SVR) model (Kun et al. 2016) proposed for predicting the concentration of gases in underground coal mines. However, as mentioned above, the models' prediction speed and accuracy were insufficient for actual gas level monitoring. The LSTM neural network is a popular recurrent method for time series data prediction (Lyu et al. 2020). It can remember long-term historical data, which can be used in various applications such as speech recognition, emotion detection, and forecasting detection.

Many researchers have recently developed optimization algorithms. These algorithms described how vital features could be extracted from the gas concentration in underground mines. For example, Abualigah (2019) has proposed a novel classifier for classifying text documents. Further, particle swarm optimization algorithm and dimension reduction technique has been used to get the novel feature in low-dimensional space. This unique characteristic is then employed to boost productivity while lowering the computational cost of the text clustering (TC) technique.

Furthermore, Abualigah et al. (2021a) have proposed a mathematical optimizer to find a solution in an ample search space. It used the distribution behavior of the four basic mathematical operations. Similarly, Abualigah et al. (2021b) have developed a lightweight optimizer motivated by Aquila's behavior. The optimization procedures of the proposed algorithm are divided into four methodologies for selecting and discovering separate search spaces.

Altabeeb et al. (2021) have proposed a collaborative modified firefly algorithm to cooperate with the capacitated vehicle routing problem (CVRP). This proposed method efforts to find transport routes with the shortest distance traveled. Abd Elaziz et al. (2021) have proposed modified artificial ecosystem-based optimization (AEO) to solve the task scheduling in the cloud environment. Task scheduling is critical, and optimizing scheduling for IoT task requests can enhance organizational quality and profitability. Hassan et al. (2021) have presented an improved slime mold algorithm (ISMA). They used it to effectively resolve single- and bi-objective financial and emission dispatch (FED) problems while considering valve-point effects. Eid et al. (2021) have developed an improved marine predators algorithm (IMPA) to extend the previous marine predator's algorithm (MPA). The proposed enhancements result provides faster convergence and avoids possible minima instability for the previous MPA. In addition, IMPA regulates the voltage and current injected into the distributed generation to minimize overall system losses and total voltage deviations. Şahin and Abualigah (2021) have proposed a unique deep neural network-based intrusion prevention technique for determining features using the grouping system. A deep neural network is also utilized to store the time sequence characteristic mapped from actual past data. The proposed approach used an impervious features extraction model to improve the recognition skills of static analyses. Hati et al. (2019) have proposed an intelligent wireless framework to manage the network.

Dey et al. (2021a) and Kumari et al. (2021) have proposed a deep neural network to forecast mining risks and explosive states in the underground mining site. The proposed system indicates the safe condition of the working zone of the underground mines by correlating several hazards parameters. In addition, Dey et al. (2021b) have presented a safe architecture for training the model securely. Similarly, Dey et al. (2021c) have introduced a deep network-based secure communication channel in the mining site to secure communication in underground mines. Muduli et al. (2019) have conducted a comprehensive survey on deploying wireless sensor network technology in the enclosed region of the mine site. It understood the coal mine limits as well as other aspects of the operating zone. Jiang et al. (2018) have used a hazard adjustment approach based on a machine-learning algorithm to anticipate rock bolt incompetence in underground coal mines.

In the predictive training phase, the support vector machine (SVM) algorithm was used for various mine gas concentrations. Zhang et al. (2016) have optimized the weight of the artificial neural of the machine learning model and predicted gas level using the old and disorder principles. Deng et al. (2018) have developed a combined architecture of regression and swarm optimization to estimate the atmospheric pressure of unexpected combustion processes in the goaf region. Qiang and Pu (2018) have presented a technique for predicting short-term electricity supply. The preceding predictions are based on machine learning algorithms and swarm optimization. Zhao et al. (2020) have recommended a wastewater treatment plant based on artificial intelligence (AI). This framework quickly separated the toxic substances from the wastewater. In addition, AI was used to improve the efficiency and data processing in sewerage systems. Osarogiagbon et al. (2020) have presented a trained machine learning approach for the milling process. This approach smoothly recognized the numerous hazardous events that occurred, mainly during the milling process. Finally, Sharafati et al. (2020) have developed an enhanced data mining technique for forecasting effluent sewage's mean values and predictability.

Deep learning has mostly overtaken standard machine learning models in recent times. The deep learning method automatically extracts the vital feature from the sequence of data. Moreover, it compresses the input data using multi-layer applications. Thus, it reduces the over-fitting problem during the model learning time. However, the performance accuracy of the deep neural network is not efficient for complex time series data and is unable to processed massive amounts of data in real time. Hence, the t-SNE_VAE_bi-LSTM model has been proposed in this paper for the accurate and efficient prediction of gases present in underground coal mines in real time. Some of the recent research for predicting hazards and different optimization techniques is summarized in Table 1.

Table 1 Summary of recent prediction models developed for forecasting or optimization of process

Full size table

3 System description

An automated and intelligent system is designed to monitor the condition of a blaze inside an enclosed region of the underground mining site in real time. The tracking of fire status is achieved by predicting concentrations of gases inside the enclosed region. The sealed-off area is defined as a part of the underground mine where the fire has occurred. The area has been sealed by constructing a wall to control fire by cutting off the oxygen supply and isolate the area from other working faces of the underground mining site. The fire intensity of the isolated area is gradually decreased by cutoff the oxygen flow in this area. Mine fires and explosions take many lives and cause much property damage every year. However, current methods of analyzing sealed-off areas in mines involve slow, cumbersome manual processes and are prone to error. Thus, an automated gas sampling and predicting system are developed for the sealed-off area to track the status of the fire. The details of the system are depicted in Fig. 2. The system consists of a data acquisition system, WLAN, five gas sensors (CH₄, CO₂, CO, O₂, and H₂) fitted inside a box/chamber, two solenoid valves, a suction pump, and a prediction model for collecting gas concentration from the sealed-off area at a fixed time interval. The detailed specification of gas sensors is given in Table 2. The system is connected with a pipe fitting in a fire-stopping brick wall for air sampling purposes from a sealed-off area. After each predefined time interval, the solenoid valve near the pipe opens, and the suction pump starts sucking air from the sealed-off area. The rear solenoid valve also opens subsequently. The sucked air passes through the box/chamber, and gas concentration is measured by different sensors fitted inside the room. It measures the gas concentration and sends it to the prediction model using WLAN. The other gas concentration is predicted using t-SNE and VAE with the bi-LSTM model in the prediction model. After the prediction process, the prediction result is uploaded to the cloud storage using WLAN. The concerned mine management of an underground mine accesses the prediction result from the cloud storage. It continuously monitors the gas level in the enclosed region of the underground mining site. The measurement process continues for 2 min. Then the system stops operation till the next cycle starts.

Table 2 Permissible limit in underground coal mine and specification of different gas sensors

Full size table

4 Problem scenario

The previously defined models are unable to process the complex and multidimensional time series sensor data effectively. As a result, the models cannot efficiently predict the gas concentration in the enclosed region of the underground mining site. The five fixed time series gas sensors data are collected from the designed metallic gas chamber and uploaded to the prediction model to predict the gas concentration accurately. The accumulated time series data includes 150 days of CH₄, CO₂, CO, O₂, and H₂ concentrations, including complex, multidimensional, and noisy data. Hence, there is a requirement for reducing dimension and noise from the collected data. The extracted important feature from collected data has been utilized for efficiently predicting gas concentration. Appropriate deep learning is employed to reduce dimension and noise and extract the vital feature of the multidimensional time series data. Therefore, t-SNE, VAE, and bi-LSTM neural networks have been utilized to predict gas concentration efficiently.

In this case, we explore a 2D matrix $A(i,j)$ where the ith row is denoted as a group of different gases, and the jth column is indicated as a group of timestamps $T = (t_{1} ,t_{2} , \ldots ,t_{d} )$ where $t_{1} < t_{2} < \cdots < t_{d}$. For example, the matrix $a_{{t_{1} ,1}}$ is marked as the value of CH₄ concentration at the timestamp t₁.

$$ A(i,j) = \begin{array}{*{20}c} {CH}_{4}\;\; \cdots\;\; {H}_{2}\\ \left({\begin{array}{*{20}c} {a_{{t_{1} ,1}} } & \ldots & {a_{{t_{1} n}} } \\ \vdots & \ddots & \vdots \\ {a_{{t_{d} 1}} } & \cdots & {a_{{t_{d} n}} } \end{array} } \right) \end{array}$$

(1)

The first objective is to minimize the dimension $A(i,j)$ and mapping $B(i,j)$ using the t-SNE method. The second objective is to extracts the important feature from $B(i,j)$ and mapping to $C(i,j)$ using the VAE method. Finally, the bi-LSTM layer predicts the future gas sensor value input as a $C(i,j)$.

The main problem scenario of the underground coal mines are as follows:

There is no real-time CH₄, CO₂, CO, O₂, and H₂ concentration prediction system available for the enclosed region of the underground mining site to improve mine safety.
There is no automated system available to collect gas concentrations from an enclosed region of the underground mining site. Traditionally, the gas concentration was manually collected from the seal-off site using a sampling bag.
Previously defined models are incapable of processing complex and multidimensional gas concentrations in real-time.
Earlier described models cannot reduce dimension, and noise as well as cannot extract the vital feature from the multidimensional gas concentration.

Therefore, an automated metallic gas chamber that collects different gases at a fixed time interval has been designed. Section 3 contains a detailed description of the automated system. In addition, a prediction model has been developed based on t-SNE, VAE, and bi-LSTM neural network techniques. The model can process complex and multidimensional gas concentration data in real time and extract vital features that improve mine safety by reducing the noise and dimension of the collected gas concentration.

5 The proposed method for real-time gas concentration prediction

The real-time gas level prediction method is depicted in Fig. 3. It is divided into three parts. The first part of Fig. 3 describes the dimension reduction process of gas sensor data. The second part is the VAE layer, where data are de-noised and extracted from the critical feature. The last part is the bi-LSTM-based prediction model, trained, and validated using past and future features input from the VAE layer.

The VAE is a type of deep learning for de-noising the sensor data and extracting the import feature. Various researchers have already applied the VAE for video anomaly detection (Fan et al. 2020), pattern recognition (Ma et al. 2019), and feature learning (Zhang et al. 2019), and it produced a good result. Hence, the present study has employed the VAE layer to extract the potential feature from multiple sensor data. The bi-LSTM is a deep learning model where data processing, classification, and prediction process are performed. The historical and forecasting sensor values are playing an essential role in efficiently predicting gas concentration. The Bi-LSTM model knows short and long-term dependencies without holding duplicate data from both historical and forecast values. Many researchers have already applied bi-LSTM for speech identification (Ogawa and Hori 2017), classification (Zhao et al. 2018), biomedicine (Tutubalina et al. 2018), and sentimental analysis (Chen et al. 2017), which efficiently generated prediction results from time series data. Therefore, the bi-LSTM method is employed for the prediction process. Also, t-SNE, VAE, and bi-LSTM techniques enhanced the prediction accuracy and decreased model training time.

5.1 Preprocessing of input data

Due to the complex, multidimensional, and noisy data sample, it is difficult to directly train the model in the prediction process, generating inaccurate prediction results. As a result, the data preprocessing approach is critical in decreasing data imbalance and improving prediction outcomes.

In this paper, the t-SNE method is adopted nonlinearly by reducing the dimension of gas sensor data. It shrinks time series data by translating the multidimensional spatially neighborhood's Gaussian distribution to the low-dimensional space. As a result, the t-SNE technique can successfully capture a considerable fraction of local and global structures on a wide scale (Maaten and Hinton 2008). Furthermore, the similarity between multidimensional sensor data points and low-dimensional space is maintained by measuring the Gaussian joint probabilities between two data points (Fooladgar and Duwig 2018).

Here, we consider multidimensional gas concentration as a two-dimensional matrix $A(1,j) = (a_{{t_{1} }} ,a_{{t_{2} }} , \cdots ,a_{{t_{d} }} ) \in {\mathbb{R}}^{\mathbb{Z}}$ where $t_{1} < t_{2} < \cdots < t_{d}$. In $A(1,j)$ a matrix, first row represents the CH₄ gas concentration, and the jth column is denoted as the timestamp of the CH₄ gas concentration. The detailed descriptions of the matrix are given in Eq. 1. Similarly, we have been represented the dimensionality of the remaining gas concentration. The conditional probability $p_{l|k}$ between two neighboring data points $a_{{t_{l} }}$ and $a_{{t_{k} }}$ in timestamp $t_{k} ,t_{l}$ is given by:

$$ p_{l|k} = \frac{{\exp ( - \left\| {a_{{t_{k} }} - a_{{t_{l} }} } \right\|^{2} /2\sigma_{{t_{k} }}^{2} )}}{{\sum\nolimits_{m \ne k} {\exp ( - \left\| {a_{{t_{k} }} - a_{{t_{m} }} } \right\|^{2} /2\sigma_{{t_{k} }}^{2} )} }} $$

(2)

where $\sigma_{{t_{k} }}$ is the Gaussian variance concerning the central data point $a_{{t_{k} }}$, and $a_{{t_{m} }}$ is another neighboring data point in the timestamp $t_{m}$. When $p_{l|k} = 0$, the joint probability $P_{kl}$ of the multidimensional space is determined as:

$$ P_{kl} = \frac{{(p_{l|k} + p_{k|l} )}}{2d} $$

(3)

where d is denoted as a set of data points of multidimensional gas concentration with a different timestamp. The low-dimensional gas concentration is represented as $B(1,j) = (b_{{t_{1} }} ,b_{{t_{2} }} , \cdots ,b_{{t_{d} }} ) \in {{{\rm R}}}^{z}$ where $z < {{{\rm Z}}}$. Similar to the above, there is a set of Gaussian variance $\sigma_{{t_{k} }}$ in the conditional probability $q_{l|k}$ to $\frac{1}{\sqrt 2 }$. The joint probability $Q_{kl}$ of low-dimensional space is defined as:

$$ Q_{kl} = \frac{{(1 + \left\| {b_{{t_{k} }} - b_{{t_{l} }} } \right\|^{2} )^{ - 1} }}{{\sum\nolimits_{m \ne o} {(1 + \left\| {b_{{t_{m} }} - b_{{t_{o} }} } \right\|^{2} )^{ - 1} } }} $$

(4)

where $b_{{t_{m} }} ,b_{{t_{o} }}$ are another two neighboring data points at the timestamp $t_{m} ,t_{o}$. The t-SNE algorithm seeks a low-dimensional $B(i,j)$ that minimizes the mismatch between P and Q in order to make the low-dimensional gas concentration have the identical joint probability distribution as the multidimensional gas concentration. The Kullback–Leibler (KL) divergence between multidimensional and low-dimension gas concentration is used to measure the correlation between P and Q. The loss functions $L$ between P and Q is calculated as:

$$ L(b_{{t_{1} }} ,b_{{t_{2} }} , \cdots b_{{t_{d} }} ) = \sum\limits_{k} {KL(P_{k} ||Q_{k} )} = \sum\limits_{k} {\sum\limits_{l} {P_{kl} \log \frac{{P_{kl} }}{{Q_{kl} }}} } $$

(5)

The loss function L is minimized in the weight updating process using a gradient descent algorithm. The t-SNE algorithm's gradient is defined as:

$$ \frac{\partial L}{{\partial b_{{t_{k} }} }} = 4\sum {(P_{kl} - Q_{kl} )} (b_{{t_{k} }} - b_{{t_{l} }} )(1 + \left\| {b_{{t_{k} }} - b_{{t_{l} }} } \right\|^{2} )^{ - 1} $$

(6)

The weight updating of Eq. (6) is derived as:

$$ b_{t}^{n} = b_{t}^{n - 1} + \eta \frac{\partial L}{{\partial b_{t} }} + \alpha (n)(b_{t}^{n - 1} - b_{t}^{n - 2} ) $$

(7)

where $\eta$ is the learning rate, $\alpha (n)$ is the momentum at iteration n. The dimensionally is reduced in each iterated process described in Eq. (7). The dimension reduction process for a multi-dimension time series dataset using the t-SNE method is given in Algorithm 1.

5.2 Essential feature extraction using VAE layer

The VAE layer is taking input from preprocessing layer. In this layer, essential features are extracted from the low dimension gas concentration $B(i,j) \in {{\rm R}}^{z}$. The extracted dataset is passed to the bi-LSTM layer, which predicts gas concentration. The VAE is a generic deep learning model. The working principle of VAE is similar to variational Bayesian learning. The VAE extracts essential features from a low-dimensional gas concentration and produces new information. Figure 4 represents the VAE layer. According to Fig. 4, the VAE layer is split into two components: encoder and decoder. The encoder is created the latent vector from the input dataset, which extracted the main feature. The decoder rebuilds the input dataset using the latent vector to back to the original input dataset. The input dataset is denoted as $B(i,j)$; latent vector pointed as $C(i,j)$ which extracted the main features, encoder parameter is represented as $\phi$, and decoder parameter is marked as $\theta$. The encoder is labeled as $q_{\phi } (C(i,j)|B(i,j))$, and decoder is described as $p_{\phi } (B(i,j)|C(i,j))$.

The VAE training procedure is described as follows:

i
The encoder $q_{\phi } (C(i,j)|B(i,j))$ takes the input from the input dataset $B(i,j)$. Then encoder is generated a latent vector $C(i,j)$ using two vectors; the means vector $\mu_{\phi } (B(i,j))$ and the variance vector $\sigma_{\phi }^{2} (B(i,j))$.
ii
The latent vector $C(i,j)$ is sampled based on Gaussian distribution using the mean vector $\mu_{\phi } (B(i,j))$ and the variance vector $\sigma_{\phi }^{2} (B(i,j))$. The reparameterization trick (Kingma and Welling 2013; Kingma et al. 2015) is used in the sample $C(i,j)$.
iii
The decoder $p_{\phi } (B(i,j)|C(i,j))$ has been reconstructed $B(i,j)$ from the latent vector $C(i,j)$. The decoder's posterior distribution is assumed to be Gaussian in this case. The decoder may still immediately measure the mean vector $\mu_{\theta } (C(i,j))$ and the variance vector $\sigma_{\theta }^{2} (C(i,j))$ to regenerate the $B(i,j)$.
iv
We are using the lower bound of the periphery likelihood $p_{\theta } (B(i,j))$ for calculating the gradient. Then, the parameter is updated in the backpropagation process.

In this paper, the VAE layer utilizes the Gaussian distribution to generate an essential feature from the input dataset. The encoder ($\phi$) and decoder ($\theta$) constraints are trained by maximizing the periphery likelihood $\log p_{\theta } (B(i,j))$. The $\log p_{\theta } (B(i,j))$ is calculated in the following equation:

$$ \begin{aligned} \log p_{\theta } (B(i,j)) & = \log \int {p_{\theta } (B(i,j)|C(i,j))p(C(i,j))} dC \\ & = \log \int {q_{\phi } (C(i,j)|B(i,j))\frac{{p_{\theta } (B(i,j)|C(i,j))p(C(i,j))}}{{q_{\phi } (C(i,j)|B(i,j))}}} dC \\ \end{aligned} $$

(8)

$$ \ge \int {q_{\phi } (C(i,j)|B(i,j))\frac{{p_{\theta } (B(i,j)|C(i,j))p(C(i,j))}}{{q_{\phi } (C(i,j)|B(i,j))}}} dC $$

(9)

$$ \begin{gathered} = \int {q_{\phi } (C(i,j)|B(i,j))\{ \log \frac{p(C(i,j))}{{q_{\phi } (C(i,j)|B(i,j))}} + \log p_{\theta } (B(i,j)|C(i,j))dC\} } \hfill \\ = \int {q_{\phi } (C(i,j)|B(i,j))} \log p_{\theta } (B(i,j)|C(i,j))dC \hfill \\ - \int {q_{\phi } (C(i,j)|B(i,j))\log \frac{p(C(i,j))}{{q_{\phi } (C(i,j)|B(i,j))}}} \hfill \\ \end{gathered} $$

$$ = E_{{C(i,j) \sim q_{\phi } (C(i,j)|B(i,j))}} [p_{\theta } (B(i,j)|C(i,j))] - KL(q_{\phi } C(i,j)|B(i,j)||p(C(i,j))) $$

(10)

where $p(C(i,j)) = \aleph (C(i,j);0,I)$ and $p_{\theta } (B(i,j)|C(i,j)) = \aleph (B(i,j);\mu_{\theta } ,\sigma_{\theta }^{2} )$. Equation (10) is generated according to the number of sampling approximations. Assume that the number of gas concentration samples L, the approximation is measured as:

$$ \log p_{\theta } (B(i,j)) \cong \frac{1}{L}\sum\limits_{l = 1}^{L} {\log p_{\theta } (B(i,j)|C(i,j)^{l} )} - KL(q_{\phi } (C(i,j)|B(i,j)||p(C(i,j))) $$

(11)

The latent vector is measured from the mean vector $\mu_{\phi } (B(i,j))$ and the variance vector $\sigma_{\phi }^{2} (C(i,j))$ using the following reparameterization trick.

$$ C(i,j) = \mu_{\phi } (B(i,j)) + \sigma_{\phi } (B(i,j)) \odot \in ( \in \sim \aleph (0,I)) $$

(12)

Finally, the VAE error rate is described as follows:

$$ \zeta (\theta ,\phi ,B(i,j)) = - \log p_{\theta } (B(i,j)) $$

(13)

5.3 Prediction layer based on bi-LSTM

In this layer, the real-time prediction result is generated using bi-LSTM. The working principle of the LSTM layer is similar to the recurrent neural network (RNN) model. The LSTM model maintains one hidden layer, followed by a regular feed-forward output layer. The traditional RNN cannot resolve the vanishing gradient and long-standing dependents problem. But LSTM efficiently solves the vanishing gradient and long-standing dependents problem. The long-standing dependents problem is defined as when the time interval is increased for time series data. The learned information cannot connect to significantly past information, leading to the vanishing gradients problem. The historical and future characteristics of time series data are helpful in the prediction process. If the model is built using historical and future time series data characteristics, it efficiently predicts the future concentration of gases. But the hidden layer of LSTM only contains the feature from the historical data. As a result, a bidirectional LSTM model is used in this research. The model is trained using history and future features from the time series data, efficiently predicting the gas concentration. The bi-LSTM prediction model is represented in Fig. 5. The left side of Fig. 5 explains the bi-LSTM architecture, and the right side designates LSTM neural network.

The LSTM comprises an input gate, a forget gate, and an output gate. The logistic nonlinearity $\sigma$ is included in the three defined gates. The input gate controls the input data, defining how long data are read from input time series data. The VAE model extracted the feature $C_{(t,t - 1, \cdots ,t - l)}$ of l hours before the timestamp T and passed it to the bi-LSTM model as an input. This paper aims to predict the gas concentration of an enclosed region N hours after the timestamp T. Both l and N are preset time intervals. The intake is calculated using the subsequent Equation:

$$ i_{t} = \sigma (U_{i} h_{t - 1} + W_{i} x_{t} + b_{i} ) $$

(14)

$$ c_{t} = f_{t} *c_{t - 1} + i_{t} *\tanh (U_{c} h_{t - 1} + W_{c} x_{t} + b_{c} ) $$

(15)

where $\sigma$ is denoted as sigmoid function, i, c and f are represented as input gate, cell state vector, and forget gate. The h_t is marked as a hidden state vector of the bi-LSTM neural network. The U_i and U_c are defined as the weighted value of the hidden state of the bi-LSTM neural network. The W_i and W_c are the weighted value of the input gate and cell state for the input x_t of the bi-LSTM neural network. The b_c is defined as a bias vector and, c_t is defined as a cell state vector.

The middle gate (forget gate) determines how far to overlook the present state data. The forget gate manages some data features from the input data feature. The forget gate is defined as:

$$ f_{t} = \sigma (U_{f} h_{t - 1} + W_{f} x_{t} + b_{f} ) $$

(16)

where $\sigma$ is denoted as a sigmoid function, the h_t−1 is indicated the hidden state vector, and the subscript t is represented as the timestamp. The U_f is represented as weighed matrices of the hidden state $h_{t - 1}$. The W_f is the weighted value of the forget gate for the input x_t of the bi-LSTM neural network. The b_f is denoted as a bias vector.

The output gate finally predicts gas concentration. The hidden state h_t is represented in the next movement. The output gate is defined as:

$$ o_{t} = \sigma (U_{o} h_{t - 1} + W_{o} x_{t} + b_{o} ) $$

(17)

$$ h_{t} = o_{t} * \tanh (c_{t} ) $$

(18)

where $\sigma$ is denoted as sigmoid function. The U_o is represented as weighed matrices of the hidden state h_t. The W_o is the weighted value of the output gate for the input x_t of the bi-LSTM neural network. The b_o is denoted as a bias vector.

The proposed bi-LSTM neural network model efficiently analyses time interval values than LSTM neural network model. It is an analysis of data in both backward and forward movements. The historical and future time interval value can affect the forecasting of the present value. The feature using past and present time series data can more accurately predict gas concentration. The bidirectional LSTM deep learning model's parameters can be used in the forecasting process. The information is stored in the backward direction vector. From backward to forward, the bi-LSTM model is improved. Therefore, the combination of backward and forward information enhances the prediction result. Figure 6 shows the training process of the bi-LSTM model. Here time series of different gas sensor data are selected for the model training process. The forward LSTM takes the input from $t = 1$ to 2T, and the backward LSTM takes the information from $t = 2T$ to 1. It predicts gas concentration. The combination of backward and forward LSTM produces efficient and error-free prediction results.

6 Experimental results

6.1 Dataset

The experiment was conducted for an Indian underground coal mining site to determine the prediction accuracy of CH₄, CO₂, CO, O₂, and H₂ gases using the proposed model. The dataset contained the hourly time interval gas concentration of a sealed-off area from September 12, 2019, to February 12, 2020, depicted in Fig. 7. The $\mathrm{X}$-axis of Fig. 7 represents one hour's time interval from September 12, 2019 to February 12, 2020, and $\mathrm{Y}$-axis represents the corresponding concentration values of different gases. Table 3 displays the names and units of the input variables.

Table 3 Input variables of the dataset

Full size table

The t-SNE_VAE_bi-LSTM, ARIMA, and CHAOS models were trained using the collected data. After model training, different gas concentrations were predicted using the proposed trained model and compared the proposed model's effectiveness with the existing ARIMA and CHAOS models.

In the experiment process, 80$\%$ of the collected data were used for model training, and reaming 20$\%$ of data was used to test the system. Table 4 gives the information about the distribution process of the collected dataset. The training dataset of gas concentration was used to train the model. The model was trained using 300 iterations. In each iteration process, the performance of the model was optimized by updating the gradient error. In the model's validation process, the training parameters were adjusted to increase the trained model's generalization capability and remove the over-fitting problem by dropping some training parameters. Figure 8 represents the training loss and validation loss of t-SNE_VAE_bi-LSTM of the trained model. Finally, the testing process demonstrated the effectiveness of the trained model, which is described in the result section.

Table 4 Details of the dissemination process of the concentration of gases

Full size table

6.2 Prediction result

The experiment was conducted on the proposed prediction model for evaluating the prediction accuracy. The proposed model's prediction accuracy was compared with two traditional machine learning models, namely ARIMA and CHAOS. Here input gas concentration was passed to the proposed model, and it produced the forecasting gas concentration. The mean squared error (MSE) and mean absolute error (MAE) was used to associate the t-SNE_VAE_bi-LSIM model and two traditional machine learning models, namely ARIMA and CHAOS models. The RMSE and MAE are defined as:

$$ RMSE = \sqrt {\frac{{\sum\limits_{i = 1}^{l} {(A_{i} - P_{i} )^{2} } }}{l}} $$

(19)

$$ MAE = \sqrt {\frac{{\sum\limits_{i = 1}^{l} {\left| {A_{i} - P_{i} } \right|} }}{l}} $$

(20)

where A_i is the actual value of CH₄, CO₂, CO, O₂, and H₂ gas concentration, P_i is denoted as predicted results of five gases and i is indicated the number of gases. Therefore, the smaller RMSE and MAE values represent the forecasting accuracy and efficiency of the prediction model.

Before the model training process, each gas concentration was preprocessed, as described in Sect. 5.1. After preprocessing, each value of the CH₄, CO₂, CO, O₂, and H₂ was normalized to [0, 1]. The normalization process is described below:

$$ Normalize_{Variable} = \frac{V - \min (V)}{{Max(V) - Min(V)}} $$

(21)

where V is denoted as gas concentration. After the normalization process, the normalized gas concentrations were sent to VAE with LSTM models for training purposes. Three hundred iterations performed the training process. A backpropagation method was employed to minimize the training error, and the LSTM model was optimized using the Adam optimizer. The batch size was set to 64 samples each iteration throughout the training phase, and the training rate was set at 10⁻³. Table 5 gives the training parameters used in VAE and bi-LSTM models.

Table 5 Training parameters used in VAE and bi-LSTM models

Full size table

The correlations among CH₄, CO₂, CO, O₂, and H₂ concentrations of gases were verified in the validation process. Then, the prediction model was trained using alone CH₄, CO₂, CO, O₂, and H₂ gas concentration (before correlation) and after correlation of the respective gas concentration. After the training process, the correlated prediction accuracy of the five gases was compared with the proposed t-SNE_VAE_bi-LSTM method with the existing ARIMA and CHAOS machine learning models. Figure 9 shows the real value versus correlated prediction value of CH₄, CO₂, CO, O₂, and H₂ for t-SNE_VAE_bi-LSTM, ARIMA, and CHAOS models over 739 h (from January 13, 2020, to February 12, 2020) from the testing data in the forecasting phase. The X-axis of Fig. 9 denotes the time interval (in an hour), and Y-axis indicates the predicted level of CH₄, CO₂, CO, O₂, and H₂ gases. The upper part of each figure's color bar represents the observed value using different machine learning models. The experimental results indicated that the t-SNE_VAE_bi-LSTM model achieved better accuracy than ARIMA and CHAOS models in the forecasting phase.

Table 6 indicates the standard deviation of the percentage difference of the predicted CH₄ gas concentrations for the t-SNE VAE bi-LSTM, ARIMA, and CHAOS models from January 13, 2020 to February 12, 2020. It has 739 h of predicted data from three models, with an average of 24 h of data in each row. The standard deviation has been calculated based on the percentage difference between the actual and predicted CH₄ gas concentrations for the three models. Therefore, the standard deviation for the three models has been included in the last row of Table 6. Similarly, in the electronic supplementary material file, Tables A1–A4 have the standard deviation of the three models' CO₂, CO, O₂, and H₂ concentrations of gases.

Table 6 Standard deviation of percentage difference of the predicted result of CH₄ gas concentration for the t-SNE_VAE_bi-LSTM, ARIMA, and CHAOS models from January 13, 2020 to February 12, 2020

Full size table

Figure 10 depicts a comparison of the standard deviation of CH₄, CO₂, CO, O₂, and H₂ gas concentrations for the proposed model, ARIMA, and CHAOS models. For CH₄ prediction, the standard deviation of the proposed model, ARIMA, and CHAOS models was found to be 5.05%, 8.88%, and 8.89%, respectively. Similarly, CO₂ was found to be 4.48%, 5.65%, and 5.99%; CO was found to be 4.28%, 4.30%, and 4.62%; O₂ was found to be 10.85%, 43.0%, and 43.51%; and H₂ was found to be 6.94%, 8.49%, and 8.25%. Thus, when compared to the ARIMA and CHAOS models, the proposed model has a lower standard deviation. Consequently, to achieve accuracy, the proposed model beats the ARIMA and CHAOS models.

Mean square error (MSE) and mean absolute error (MAE) results are described for the correlated prediction values of the proposed t-SNE_VAE_bi-LSTM model with ARIMA and CHAOS models. Table 7 gives MSE and MAE results of the t-SNE VAE_bi-LSTM model with ARIMA and CHAOS models. The MSE results of CH₄, CO₂, CO, O₂, and H₂ were 0.077, 0.998, 0.077, 0.298 and 0.233, respectively, for the proposed t_SNE_VAE_bi-LSTM model. The MSE results of CH₄, CO₂, CO, O₂, and H₂ were 0.106, 1.035, 0.169, 2.179, and 1.468, respectively, for the ARIMA model. The MSE results of CH₄, CO₂, CO, O₂, and H₂ were 0.146, 1.017, 0.169, 2.190, and 1.433, respectively, for the CHAOS model. The MAE results of CH₄, CO₂, CO, O₂ and H₂ were 0.369, 1.018, 0.296, 0.58 and 0.549, respectively, for the proposed t_SNE_VAE_bi-LSTM model. The MAE results of CH₄, CO₂, CO, O₂, and H₂ were 0.489, 1.082, 0.412, 1.476, and 1.211, respectively, for the ARIMA model. The MAE results of CH₄, CO₂, CO, O₂, and H₂ were 0.510, 1.091, 0.411, 1.480, and 1.191, respectively, for the CHAOS model. Figures 11 and 12 represent the comparative analysis of MSE and MAE results of the proposed t-SNE_VAE_bi-LSTM, ARIMA, and CHAOS models. Figure 11 indicates that the MSE result of the proposed t-SNE VAE bi-LSTM model for the CH4 forecasted value is less than 0.029 and 0.069 for the ARIMA and CHAOS models, respectively. Similarly, the proposed t-SNE VAE bi-LSTM model outperformed the ARIMA and CHAOS models by 0.037 and 0.019 for CO₂; 0.092, 0.092 for CO; 1.881, 1.892 for O₂; 1.235, 1.200 for H₂. Figure 12 shows that the MAE result of the proposed t-SNE_VAE_bi-LSTM model is less than 0.120 and 0.141 for ARIMA and CHAOS models, respectively, CH₄ predicted value. Similarly, the proposed t-SNE_VAE_bi-LSTM model's MAE result was 0.064 and 0.073 lower than the ARIMA and CHAOS models for CO₂; 0.116 and 0.115 for CO; 0.896 and 0.900 for O₂; 0.662 and 0.642 for H₂. The prediction accuracy is increased if the value of MSE and MAE is decreased. Figures 11 and 12 clearly show that the t-SNE_VAE_bi-LSTM model has less MSE and MAE value than ARIMA and CHAOS models. Thus, the proposed t-SNE_VAE_bi-LSTM model has achieved better prediction accuracy than ARIMA and CHAOS models.

Table 7 MSE and MAE values of the proposed model, ARIMA and CHAOS models in the forecasting phase

Full size table

The proposed t-SNE_VAE_bi-LSTM model was trained using CH₄, CO₂, CO, O₂, and H₂ gas concentration from the dataset and the respective correlation value for each concentration of gases. Figure 13 depicts the actual measured value versus before and after the correlated prediction of CH₄, CO₂, CO, O₂, and H₂ gas concentrations using the t-SNE_VAE_bi-LSTM, ARIMA, and CHAOS models over 739 h in the forecasting phase, where the X-axis represents the time interval (h), and the Y-axis represents the individual gas level.

Mean square error (MSE) and mean absolute error (MAE) results are described before correlation and after correlation processes of the proposed t-SNE_VAE_bi-LSTM model. Table 8 gives MSE and MAE results before correlation and after correlation processes of forecasting value of 5 gases. The MSE results of CH₄, CO₂, CO, O₂, and H₂ were 0.094, 1.073, 0.077, 0.333, and 0.240, respectively, before correlation. After correlation, the MSE results of CH4, CO2, CO, O2, and H2 were 0.077, 0.998, 0.077, 0.298, and 0.233, respectively. The MAE results of CH₄, CO₂, CO, O₂, and H₂ were 0.378, 1.053, 0.296, 0.596, and 0.550, respectively, before correlation. After correlation, the MAE results of CH₄, CO₂, CO, O₂, and H₂ were 0.369, 1.018, 0.296, 0.581, and 0.549, respectively. Figures 14 and 15 represent the comparative analysis of MSE and MAE results before and after correlation processes for the t-SNE_VAE_bi-LSTM based forecasting model. Figure 14 shows that MSE results after the correlation process were 0.017, 0.075, 0.000, 0.035, and 0.007 less than before the correlation for CH₄, CO₂, CO, O₂, and H₂ forecasting results t-SNE_VAE_bi-LSTM model. Figure 15 depicts that MAE results after the correlation process were 0.009, 0.035, 0.000, 0.015, and 0.001, less than before correlation for CH₄, CO₂, CO, O₂, and H₂ forecasting results, respectively, by t-SNE_VAE_bi-LSTM model. The fewer MSE and MAE results increased the efficiency of the t-SNE_VAE_bi-LSTM model in the forecasting phase. Figures 14 and 15 clearly show that MSE and MAE results after correlation were less than before correlation. Thus, after correlating five gas concentrations, the t-SNE_VAE_bi-LSTM model has achieved batter accuracy in the prediction process.

Table 8 MSE and MAE values of the t-SNE_VAE_bi-LSTM model before and after correlations

Full size table

7 Conclusions

A novel prediction technique has been proposed to predict the CH₄, CO₂, CO, O₂, and H₂ concentration of gases in the enclosed region in the underground coal-mining site. The said five gas values had been correlated during training phases. The correlation among five gas concentrations increased the forecasting precision of the proposed model. Before the training process, the five gas concentrations were preprocessed using the t-SNE algorithm, reducing the dimension of the gas concentrations. The preprocess input data were given to the VAE layer, where the essential features were extracted from the input data. It has improved the efficiency of the prediction model. The output data of VAE were sent to the bi-LSTM model, where the actual forecasting model was trained to predict the sealed-off area's gas concentration. The forwarding and backward direction in bi-LSTM efficiently handled the time interval value and increased the forecasting precision. The forecasting value shows that the proposed model has fewer predicted MSE and MAE values than ARIMA and CHAOS models. Thus, the proposed model may be utilized for online monitoring and predicting concentrations of gases in the enclosed region of the underground coal mining site.

Future works include predicting other mine hazards, like roof fall in underground mines, slope failure in opencast mines, etc.

Data availability

The datasets created and evaluated during the present study have included electronic supplementary material and will be available from the corresponding author upon reasonable request.

References

Abd Elaziz M, Abualigah L, Attiya I (2021) Advanced optimization technique for scheduling IoT tasks in cloud-fog computing environments. Future Gener Comput Syst 124:142–154
Article Google Scholar
Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin, pp 1–165
Book Google Scholar
Abualigah L, Diabat A, Mirjalili S, AbdElaziz M, Gandomi AH (2021a) The arithmetic optimization algorithm. Comput Methods Appl Mech Eng 376:113609. https://doi.org/10.1016/j.cma.2020.113609
Article MathSciNet MATH Google Scholar
Abualigah L, Yousri D, AbdElaziz M, Ewees AA, Al-qaness MA, Gandomi AH (2021b) Aquila optimizer: A novel meta-heuristic optimization algorithm. Comput Ind Eng 157:107250. https://doi.org/10.1016/j.cie.2021.107250
Article Google Scholar
Altabeeb AM, Mohsen AM, Abualigah L, Ghallab A (2021) Solving capacitated vehicle routing problem using cooperative firefly algorithm. Appl Soft Comput 108:107403. https://doi.org/10.1016/j.asoc.2021.107403
Article Google Scholar
Brooks W, Corsi S, Fienen M, Carvin R (2016) Predicting recreational water quality advisories: A comparison of statistical methods. Environ Model Softw 76:81–94
Article Google Scholar
Cao J, Li WP (2017) Numerical simulation of gas migration into mining-induced fracture network in the goaf. Int J Mining Sci Techno 27(4):681–685
Article Google Scholar
Chaulya SK, Prasad GM (2016) Sensing and monitoring technologies for mines and hazardous areas. Elsevier, USA
Google Scholar
Chen L, Wang E, Feng J, Kong X, Li X, Zhang Z (2016) A dynamic gas emission prediction model at the heading face and its engineering application. J Nat Gas Sci Eng 30:228–236
Article Google Scholar
Chen T, Xu R, He Y, Wang X (2017) Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst Appl 72:221–230
Article Google Scholar
Cheng J, Bai JY, Qian JS et al (2008) Short-Term Forecasting Method of Coalmine Gas Concentration Based on Chaotic Time Series. J China U Min Techno 37(2):231–235
Google Scholar
Collobert R, Weston J (2008) A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th international conference on machine learning, USA, pp 160–167
Deng J, Lei C, Xiao Y, Cao K, Ma L, Wang W, Laiwang B (2018) Determination and prediction on ¨three zones¨ of coal spontaneous combustion in a gob of fully mechanized caving face. Fuel 211:458–470
Article Google Scholar
Dey P, Chaulya SK, Kumar S (2021a) Hybrid CNN-LSTM and IoT-based coal mine hazards monitoring and prediction system. Process Saf Environ Prot 125:249–263
Article Google Scholar
Dey P, Chaulya SK, Kumar S (2021b) Secure decision tree twin support vector machine training and classification process for encrypted IoT data via blockchain platform. Concurr Comput 33:16. https://doi.org/10.1002/cpe.6264
Article Google Scholar
Dey P et al (2021c) Deep convolutional neural network based secure wireless voice communication for underground mines. J Ambient Intell Hum Comput. https://doi.org/10.1007/s12652-020-02700-w
Article Google Scholar
Eid A, Kamel S, Abualigah L (2021) Marine predators algorithm for optimal allocation of active and reactive power resources in distribution networks. Neural Comput Appl. https://doi.org/10.1007/s00521-021-06078-4
Article Google Scholar
Fan Y, Wen G, Li D, Qiu S, Levine MD, Xiao F (2020) Video anomaly detection and localization via gaussian mixture fully convolutional variational autoencoder. Comput Vis Image Underst 102920
Fooladgar E, Duwig C (2018) A new post-processing technique for analyzing high-dimensional combustion data. Combust Flame 191:226–238
Article Google Scholar
Guo JH, Cheng ZH, Kong WY (2018) Establishment and application of mathematical prediction model of gas emission rate in fully mechanized coal face. Coal Eng 50:109–113
Google Scholar
Hassan MH, Kamel S, Abualigah L, Eid A (2021) Development and application of slime mould algorithm for optimal economic emission dispatch. Expert Syst Appl 182:115205. https://doi.org/10.1016/j.eswa.2021.115205
Article Google Scholar
Hati S, Dey P, De D (2019) WLAN based energy efficient smart city design. Microsyst Technol 25(5):1599–1612
Article Google Scholar
He K, et al. (2016) Deep residual learning for image recognition. In: Proceedings of IEEE conference on computer vision and pattern recognition, USA, pp 770–778
Huang CJ, Kuo PH (2018) A deep CNN-LSTM model for particulate matter (PM2.5) forecasting in smart cities. Sensors 18(7):2220. https://doi.org/10.3390/s18072220
Article Google Scholar
Jiang P, Craig P, Crosky A, Maghrebi M, Canbulat I, Saydam S (2018) Risk assessment of failure of rock bolts in underground coal mines using support vector machines. Appl Stoch Models Bus Ind 34(3):293–304
Article MathSciNet Google Scholar
Jo B, Khan RMA (2018) An internet of things system for underground mine air quality pollutant prediction based on azure machine learning. Sensors 18(4):930. https://doi.org/10.3390/s18040930
Article Google Scholar
Karaca F, Nikov A, Alagha O (2006) NN-AirPol: a neural-networks-based method for air pollution evaluation and control. Int J Environ Pollut 28(3–4):310–325
Article Google Scholar
Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv preprint arrXiv:1312.6114
Kingma DP, Salimans T, Welling M (2015) Variational dropout and the local reparameterization trick. In: Proceedings of advances in neural information processing systems, pp 2575–2583
Kumar A et al (2013) Application of gas monitoring sensors in underground coal mines and hazardous areas. Int J Comput Electr Eng 3(3):9–23
Google Scholar
Kumari K et al (2021) UMAP and LSTM based fire status and explosibility prediction for sealed-off area in underground coal mine. Process Saf Environ Prot 146:837–852
Article Google Scholar
Kun L, Ling-Kai Y, Mei-Ling Z, Jian C (2016) Coalmine gas concentration analysis based on support vector machine. In: Proceedings of the IEEE 3rd international conference on information science and control engineering (ICISCE), Beijing, China, pp 257–261
Liu Z (2010) Chaotic time series analysis. Math Probl Eng 1–2:1–31. https://doi.org/10.1155/2010/720190
Article MathSciNet Google Scholar
Liu Y, Park RJ, Jacob DJ, Li Q, Kilaru V, Sarnat JA (2004) Mapping annual mean ground-level PM2.5 concentrations using multiangle imaging spectroradiometer aerosol optical thickness over the contiguous United States. J Geophys Res Atmos 109(D22):5025. https://doi.org/10.1029/2004JD005025
Article Google Scholar
Lyu P, Chen N, Mao S, Li M (2020) LSTM based encoder-decoder for short-term predictions of gas concentration using multi-sensor fusion. Process Saf Environ Protect 137:93–105
Article Google Scholar
Ma F, Li Y, Zhang C, Gao J, Du N, Fan W (2019) Mcvae: Margin-based conditional variational autoencoder for relation classification and pattern generation. In: Proceedings of the World Wide Web conference, pp 3041–3048
Maaten LVD, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605
MATH Google Scholar
Mandal R et al (2013) Application of programmable logic controller for gases monitoring in underground coal mines. Engg Sci and Techno 3(3):516–522
Google Scholar
McKeen S, Chung SH, Wilczak J et al (2007) Evaluation of several PM2.5 forecast models using data collected during the ICARTT/NEAQS 2004 field study. J Geophys Res Atmos 112(D10):7608. https://doi.org/10.1029/2006JD007608
Article Google Scholar
Mine Safety and Health Administration (MSHA) (2018). Accident/Illness Investigations Procedures. https://arlweb.msha.gov/READROOM/HANDBOOK/PH11-I-1.pdf. Accessed 19 Feb 2018
Muduli L, Mishra DP, Jana PK (2019) Wireless sensor network based underground coal mine environmental monitoring using machine learning approach. In: Proceedings of the 11th international conference of mine ventilation congress. Springer, Singapore, pp 776–786
Ogawa A, Hori T (2017) Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks. Speech Commu 89:70–83
Article Google Scholar
Osarogiagbon AU, Khan F, Venkatesan R, Gillard P (2020) Review and analysis of supervised machine learning algorithms for hazardous events in drilling operations. Process Saf Environ Prot 147:367–384
Article Google Scholar
Qiang S, Pu Y (2018) Short-term power load forecasting based on support vector machine and particle swarm optimization. J Algorithm Comput Technol 13:1748301818797061. https://doi.org/10.1177/1748301818797061
Article MathSciNet Google Scholar
Qiao G, Zeng J (2011) An underground mobile wireless sensor network routing protocol for the coal mine environment. J Comp Info Sys 7(7):2487–2495
Google Scholar
Rashid B, Rehmani MH (2016) Applications of wireless sensor networks for urban areas: A survey. J Netw Comput Appl 60:192–219
Article Google Scholar
Rekhi JK, Nagrath P, Jain R (2020) Forecasting Air Quality of Delhi Using ARIMA Model. In: Proceedings of the advances in data sciences, security and applications. Springer, Singapore, pp 315–325
Rodriguez G, Dorado AD, Fortuny M, Gabriel D, Gamisans X (2014) Biotrickling filters for biogas sweetening: Oxygen transfer improvement for a reliable operation. Process Saf Environ Prot 92(3):261–268
Article Google Scholar
Saeed N, Alouini MS, Al-Naffouri TY (2019) Towards the internet of underground things: A systematic survey. IEEE Commun Surv Tutor 21(4):3443–3466
Article Google Scholar
Şahin CB, Abualigah L (2021) A novel deep learning-based feature selection model for improving the static analysis of vulnerability detection. Neural Comput Appl. https://doi.org/10.1007/s00521-021-06047-x
Article Google Scholar
Sharafati A, Asadollah SBHS, Hosseinzadeh M (2020) The potential of new ensemble machine learning models for effluent quality parameters prediction and related uncertainty. Process Saf Environ Prot 140:68–78
Article Google Scholar
Song Y, Yang S, Hu X, Song W, Sang N, Cai J, Xu Q (2019) Prediction of gas and coal spontaneous combustion coexisting disaster through the chaotic characteristic analysis of gas indexes in goaf gas extraction. Process Saf Environ Prot 129:8–16
Article Google Scholar
Sundermeyer M, Schlüter R, Ney H (2012) LSTM neural networks for language modeling. In: Proceedings of the 13th annual conference of the international speech communication association, Portland, Oregon September 9–13, USA
Tutubalina E, Miftahutdinov Z, Nikolenko S, Malykh V (2018) Medical concept normalization in social media posts with recurrent neural networks. J Biomed Inf 84:93–102
Article Google Scholar
Wang H, Cheng Y, Yuan L (2013) Gas outburst disasters and the mining technology of key protective seam in coal seam group in the Huainan coalfield. Nat Hazards 67(2):763–782
Article Google Scholar
Xi X, Wei Z, Xiaoguang R, Yijie W, Xinxin B, Wenjun Y, Jin D (2015) A comprehensive evaluation of air pollution prediction improvement by a machine learning method. In: Proceedings of the IEEE international conference on service operations and logistics, and informatics, Tunisia, pp 176–181
Xia T, Zhou F, Wang X, Zhang Y, Li Y, Kang J, Liu J (2016) Controlling factors of symbiotic disaster between coal gas and spontaneous combustion in longwall mining gobs. Fuel 182:886–896
Article Google Scholar
Xia TQ, Zhou FB, Wang XX et al (2017) Safety evaluation of combustion-prone longwall mining gobs induced by gas extraction: A simulation study. Process Saf Environ Prot 109:677–687
Article Google Scholar
Yang ML, Xue YX, Jiang YD et al (2009) Study on pattern of gas emission at fully-mechanized coal face in Liliu mining area. J China Coal Soc 34:1349–1353
Google Scholar
Ye Q, Lin BQ, Jiang WZ (2006) The study of methane law in coal mining face. China Min Mag 5:38–41
Google Scholar
Zhang JY, Cheng J, Hou YH et al (2007) Forecasting coalmine gas concentration based on adaptive neuro-fuzzy inference system. J China U Min Techno 4:494–498
Google Scholar
Zhang S, Wang B, Li X, Chen H (2016) Research and application of improved gas concentration prediction model based on grey theory and BP neural network in digital mine. Procedia CIRP 56:471–475
Article Google Scholar
Zhang Z, Jiang T, Zhan C, Yang Y (2019) Gaussian feature learning based on variational autoencoder for improving nonlinear process monitoring. J Proc Control 75:136–155
Article Google Scholar
Zhao Y, Yang R, Chevalier G, Shah RC, Romijnders R (2018) Applying deep bidirectional LSTM and mixture density network for basketball trajectory prediction. Optik 158:266–272
Article Google Scholar
Zhao L, Dai T, Qiao Z, Sun P, Hao J, Yang Y (2020) Application of artificial intelligence to wastewater treatment: a bibliometric analysis and systematic review of technology, economy, management, and wastewater reuse. Process Saf Environ Prot 133:169–182
Article Google Scholar
Zheng CS, Jiang BY, Xue S, Chen ZW, Li H (2019) Coalbed methane emissions and drainage methods in underground mining for mining safety and environmental benefits: a review. Process Saf Environ Prot 127:103–124
Article Google Scholar

Download references

Acknowledgements

The authors would like to express their appreciation to Dr. Pradeep K. Singh, Director of the CSIR-Central Institute of Mining and Fuel Research in Dhanbad, India, for publishing this work. The authors would also like to acknowledge the Ministry of Electronics and Information Technology of the Government of India for funding this research under Grant No. 13(8)/2015-CC&BT.

Author information

Authors and Affiliations

CSIR-Central Institute of Mining and Fuel Research, Dhanbad, 826001, India
Prasanjit Dey, K. Saurabh, C. Kumar, D. Pandit, S. K. Chaulya, S. K. Ray, G. M. Prasad & S. K. Mandal

Authors

Prasanjit Dey
View author publications
You can also search for this author in PubMed Google Scholar
K. Saurabh
View author publications
You can also search for this author in PubMed Google Scholar
C. Kumar
View author publications
You can also search for this author in PubMed Google Scholar
D. Pandit
View author publications
You can also search for this author in PubMed Google Scholar
S. K. Chaulya
View author publications
You can also search for this author in PubMed Google Scholar
S. K. Ray
View author publications
You can also search for this author in PubMed Google Scholar
G. M. Prasad
View author publications
You can also search for this author in PubMed Google Scholar
S. K. Mandal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. K. Chaulya.

Ethics declarations

Conflict of interest

The authors claim that they do not have any conflicts of interest.

Ethical approval

There are no investigations involving human subjects or animals done by any of the writers in this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 40 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dey, P., Saurabh, K., Kumar, C. et al. t-SNE and variational auto-encoder with a bi-LSTM neural network-based model for prediction of gas concentration in a sealed-off area of underground coal mines. Soft Comput 25, 14183–14207 (2021). https://doi.org/10.1007/s00500-021-06261-8

Download citation

Accepted: 10 September 2021
Published: 05 October 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s00500-021-06261-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

t-SNE and variational auto-encoder with a bi-LSTM neural network-based model for prediction of gas concentration in a sealed-off area of underground coal mines

Abstract

Similar content being viewed by others

Gas concentration prediction based on ED-SLSTM model under the framework of Trend Prediction-Time Point Prediction

Short-term natural gas load forecasting based on EL-VMD-Transformer-ResLSTM

Data-Driven Modeling for the Prediction of Stack Gas Concentration in a Coal-Fired Power Plant in Türkiye

1 Introduction

2 Related works

3 System description

4 Problem scenario