A new method for prediction of air pollution based on intelligent computation

Al-Janabi, Samaher; Mohammad, Mustafa; Al-Sultan, Ali

doi:10.1007/s00500-019-04495-1

A new method for prediction of air pollution based on intelligent computation

Methodologies and Application
Published: 28 November 2019

Volume 24, pages 661–680, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

A new method for prediction of air pollution based on intelligent computation

Download PDF

2632 Accesses
144 Citations
Explore all metrics

Abstract

The detection and treatment of increasing air pollution due to technological developments represent some of the most important challenges facing the world today. Indeed, there has been a significant increase in levels of environmental pollution in recent years. The aim of the work presented herein is to design an intelligent predictor for the concentrations of air pollutants over the next 2 days based on deep learning techniques using a recurrent neural network (RNN). The best structure for its operation is then determined using a particle swarm optimization (PSO) algorithm. The new predictor based on intelligent computation relying on unsupervised learning, i.e., long short-term memory (LSTM) and optimization (i.e., PSO), is called the smart air quality prediction model (SAQPM). The main goal is to predict six the concentrations of six types of air pollution, viz. PM2.5 particulate matter, PM10, particulate matter, nitrogen dioxide (NO₂), carbon monoxide (CO), ozone (O₃), and sulfur dioxide (SO₂). SAQPM consists of four stages. The first stage involves data collection from multiple stations (35 in this case). The second stage involves preprocessing of the data, including (a) separation of each station with an independent focus, (b) handle missing values, and (c) normalization of the dataset to the range of (0, 1) using the MinMaxScalar method. The third stage relates to building the predictor based on the LSTM method by identifying the best structure and parameter values (weight, bias, number of hidden layers, number of nodes in each hidden layer, and activation function) for the network using the functional PSO algorithm to achieve a goal. Thereafter, the dataset is split into training and testing parts based on the ten cross-validation principle. The training dataset is then used to build the predictor. In the fourth stage, evaluation results for each station are obtained by reading the concentration of each pollutant each hour for at most 30 days then taking the average of the symmetric mean absolute percentage error (SMAPE) for 25 days only.

A novel hybrid model for six main pollutant concentrations forecasting based on improved LSTM neural networks

Article Open access 24 August 2022

Air quality assessment and pollution forecasting using artificial neural networks in Metropolitan Lima-Peru

Article Open access 20 December 2021

Estimation of Air Quality Index from Seasonal Trends Using Deep Neural Network

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Data is one of the most valuable treasures in the world, forming the basis of different branches of computer science. Data refers to any set of objects with organized features, or specific characteristics of an object or collection thereof and their features. Data can be of different types and can be obtained by observation, search, or recording (Alkaim & Al-Janabi 2020). In general, researchers dealing with the concept called data science work in three domains, related to data, intelligence, and statistics (Buyya et al. 2016). Data science can be divided into three fields, viz. small, normal, and big/huge data. Small data is organized into uniform structures such as tables or lists containing no more than 30 samples and thus does not follow the normal distribution and cannot be used for decision-making. On the other hand, normal data is also structured but does follow the normal distribution and is thus useful for taking different types of decision such as clustering, classification, prediction, optimization, etc. Finally, big data can have different types such as structured, semistructured, or unstructured, with size ranging from 1 TB to 1 ZB. Extraction of useful knowledge or patterns from big data can be achieved by the combination of the two main concepts of machine learning and cloud computing.

Deep learning is a branch of modern science that considers multilevel learning processes, where learning is applied at each level for a specific part of the problem and aggregation of the corresponding results enables the overall problem to be solved. It is thus classified as a branch of artificial intelligence (Liu et al. 2019).

Prediction is a type of decision-making technique where future events are forecast based on historical information. Among the three types of prediction technique, viz. traditional (offering accuracy), self (offering speed), and intelligent (offering both speed and accuracy), this work relates to the latter (Al-Janabi et al. 2015).

Increasing air pollution caused by technological development represents one of the most important challenges facing the world today. It can be categorized into several classes depending on its origin, viz. pollution due to living organisms such as bacteria and fungi in the environment such as water, air, or soil; chemical air pollution due to an the imbalance in the ecosystem resulting from chemical effects, being in the form of solid particles or liquid droplets or gases; and technological, due to a change in the balance between the components of an ecosystem that prevents its efficient operation and ability to perform its natural role in the disposal of pollutants.

2 Related work

The issue of air quality prediction is one of the critical topics related to human lives and health. The aim of the work presented herein is to develop a new method for such prediction based on the huge amount of data that is available and operating on data series. This section first reviews previous studies by researchers in this area and compares them based on the database used in each case, the methods applied to assess the results, the advantages of each method, and its limitations.

Ong et al. (2015) used a deep recurrent neural network (DRNN) reinforced with a novel pretraining system using an autoencoder, principally designed for time-series prediction. Moreover, the sensors were chosen within the DRNN without degrading the accuracy of the predictions by considering the sparsity of the system. This method was applied to the prediction of air pollution, in particular for PM2.5 particulate matter concentration, offering more accurate results compared with the poor performance achieved using the noise reduction approach. The results were evaluated using four measures, viz. the root-mean-square error (RMSE), precision (P), recall (R), and F measure. The work presented herein is similar in that it uses the same technique (RNN), albeit based on the LSTM approach.

Al-Janabi et al. (2015) applied a hybrid system using genetic neural computing (GNC) to analyze and understand data corresponding to the concentration of dissolved gases in four subgroups for analysis based on the IEEE C57.104 specification using a genetic algorithm (GA). The clustering data was input to the neural network to predict the different types of errors. The hybrid system generates decision rules which identify the error accurately. Two measures were used in that work, viz. the Davies–Bouldin (DB) index and the mean square error (MSE). The results indicated that the problem could be solved at lower cost and that the described method facilitated the prediction process and enabled a more accurate approach through the analysis of errors and ways to address them. This work is similar to that presented herein in that it uses neural networks, while the difference lies in the use of the PSO algorithm combined with LSTM.

Li et al. (2016) described an air quality prediction method based on a spatiotemporal deep learning (STDL) model. A stacked autoencoder (SAE) method was applied to extract inherent air quality characteristics, being trained using a greedy layerwise method. In comparison with traditional time-series prediction models, the described model could predict the air quality at all stations at the same time and exhibited temporal stability across all seasons. In addition, a comparison with the spatiotemporal artificial neural network (STANN), autoregression moving average (ARMA), and support vector regression (SVR) models was presented. The results of the model were evaluated using three measures, viz. RMSE, mean absolute error (MAE), and mean absolute percentage error (MAPE). The work presented herein is similar in that the same technique (RNN) is applied to prediction the air quality indexes, but now dealing with huge data and also applying the LSTM approach to enhance the operation of the network.

Li et al. (2017) used a long short-term memory extended (LSTME) neural network model with combined spatial–temporal links to predict concentrations of air pollutants. In that approach, the LSTM layers automatically extract potential intrinsic properties from historical air pollutant and accompanying data, while meteorological data and timestamp data are also incorporated into the proposed model to improve its performance. The technique was evaluated using three measures (RMSE, MAE, and MAPE) and compared with the STANN, ARMA, and SVR models. The work presented herein is similar in its use of the LSTM approach as part of a recurrent neural network structure but differs in its use of another evaluation measure.

Ghoneim and Manjunatha (2017) described a new prediction model based on deep learning for ozone levels, considering pollution and weather correlations in an integrated fashion. This deep learning model was used to learn ozone level features, and trained using a grid search technique. A deep architecture model is utilized to represent the ozone level features for the predictions. Experiments demonstrated that the proposed method offered superior performance for ozone level predictions. The results of this study could be helpful for predicting ozone level pollution in Aarhus City as a model for smart cities, to improve the accuracy of ozone forecasting tools. The results of the model were evaluated based on the RMSE, MAE, MAPE, squared R², and correlation coefficient. The work presented herein also uses a memory (LSTM in this case) for processing of large data, but differs in that the optimal structure of the neural network is found by applying a PSO algorithm.

Lifeng et al. (2018) reported that the best predictions of air quality could be obtained using the GM model (1.1) with fractional order accumulation, i.e., FGM (1.1), to find the expected average annual concentrations of PM2.5, PM10, SO₂, NO₂, 8-h O₃, and O-24 h. The measure used in that work was the MAPE. Application of the FGM (1.1) method resulted in much better performance compared with the traditional GM model (1.1), revealing that the average annual concentrations of PM2.5, PM10, SO₂, NO₂, O₈–O₃, and O₃ 24-h will decrease from 2017 to 2020. That work presented herein is similar in that it predicts the concentration of air pollutants and finds ways to address them, but differs in its use of the LSTM method for the predictions.

Popoola et al. (2018) considered sensor measurements including SNAQ boxes and network deployment, sensor measurement validation, and source apportionment to build a predictive model for the ADMS-Airport tool, using the concentration of pollutants to determine the air quality model. The results showed that such a method can be applied in many environments that suffer from air pollution, potentially reducing the health effects of reduced air quality and decreasing cost, as well as for monitoring of greenhouse-gas emissions. The work presented herein is similar in that the concentration of air pollutants is determined, but differs in its use of the LSTM RNN method.

For effective extraction of spatiotemporal features, Wen et al. (2019) combined a convolutional neural network (CNN) and LSTM neural network (NN), as well as meteorological and aerosol data, to refine the prediction performance of the model. Data collected from 1233 air quality monitoring stations in Beijing and the whole of China were used to verify the effectiveness of the proposed model (C-LSTME). The results showed that the model achieved better performance than state-of-the-art technologies for predictions over different durations at various regional and environmental scales. The technique was evaluated using three measures (RMSE, MAE, and MAPE). In comparison, the LSTM approach is also applied in a RNN in this work, but after having identified the best structure for the network. In addition, another evaluation measure is used herein.

Shang et al. (2019) described a prediction method based on a classification and regression tree (CART) approach in combination with the ensemble extreme learning machine (EELM) method. Subgroups were created by dividing the datasets using a shallow hierarchy tree through the CART approach. At each node of the tree, EEL models were constructed using the training samples of the node, to minimize the verification errors sequentially in all of the subtrees of each tree by identifying the number of hidden intestines, where each node is considered to be a root. Finally, the EEL models for each path to a leaf are compared with the root of each leaf, selecting only the path with the smallest error to check the leaf. The measures used in that work were the RMSE and MAPE. This experimental measurement results revealed that such a method can address the issue of global–local duplication of the prediction method at each leaf and that the combined CART–EELM approach worked better than the random forest (RF), v-(SVR), and EELM models, while also showing superior performance compared with EELM or k-means-EELM seasonal. The work presented herein is similar in that it uses the same set of six air pollution indexes (PM2.5, O₃, PM10, SO₂, NO₂, CO) but differs in terms of the mechanism applied to reduce air pollutants, applying the RNN method.

Li et al. (2019) applied a new air quality forecasting method and proposed a new positive analysis mechanism that includes complex analysis, improved prediction units, data pretreatment, and air quality control problems. The system analyzes the original series using an entropy model and a data processing process. The multiobjective multiverse optimization (MOMVO) algorithm is used to achieve the required performance, revealing that the least-squares (LS)SVM achieved the best accuracy in addition to stable predictions. Three measures were used for the evaluation in that work , viz. RMSE, MAE, and MAPE. The results of the application of the proposed method to the dataset revealed good performance for the analysis and control of air quality, in addition to the approximation of values with high precision. The work presented herein uses the same evaluation measures but differs in its use of the LSTM approach in the RNN after identifying the best structure for the network.

Table 1 presents a comparison of the cited previous works based on the type of dataset considered, the methodology used, the evaluation measures applied, and the advantages offered.

Table 1 Comparison of previous works

Item	Description
DLSTM	Developed long short-term memory
LSTM	Long short-term memory
PSO	Particle swarm optimization
SMAPE	Symmetric mean absolute percentage error
PM2.5	Particulate matter with diameter less than 2.5 μm
PM10	Particulate matter with diameter less than 10 μm
O₃	Ozone, the unstable triatomic form of oxygen
SO_x	Sulfur oxides
CO	Carbon monoxide
NO_x	Nitrogen oxides
\( \odot \)	Elementwise or Hadamard product
⊗	Outer product
\( \sigma \)	Sigmoid function
a_t	Input activation
i_t	Input gate
f_t	Forget gate
o_t	Output gate
State_t	Internal state
Out_t	Output
W	The weights of the input
U	The weights of recurrent connections
\( V_{i}^{t} \):	Velocity of particle i in swarm in dimension j and frequency t
\( X_{i}^{t} \)	Location of particle i in swarm in dimension j and frequency t
\( c_{1} \)	Acceleration factor related to P_best
\( c_{2} \)	Acceleration factor related to g_best
\( r_{1}^{t} \), \( r_{2}^{t} \):	Random number between 0 and 1
_t	Number of occurrences specified by type of problem
\( G_{{{\text{best}},i}}^{t} \)	g_best position of swarm
\( P_{{{\text{best}},i}}^{t} \)	p_best position of particle

A new method for prediction of air pollution based on intelligent computation

Abstract

Similar content being viewed by others

A novel hybrid model for six main pollutant concentrations forecasting based on improved LSTM neural networks

Air quality assessment and pollution forecasting using artificial neural networks in Metropolitan Lima-Peru

Estimation of Air Quality Index from Seasonal Trends Using Deep Neural Network

Explore related subjects

1 Introduction

2 Related work

3 Main concept

3.1 Big data

3.2 Big data analysis stages

3.3 Deep learning

3.4 Prediction

3.5 Air pollution

4 Building the DLSTM-PSO model

4.1 Data preprocessing stage

4.2 Determined structure network-particle swarm (DSN-PS)

4.3 Development of the long short-term memory (DLSTM) approach

4.3.1 The variables in LSTM–RNN

5 Experiment

5.1 Dataset used

5.2 Data visualization

5.3 Normalizing the data

5.4 Data generator

6 Discussion and conclusions

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation