Keywords

1 Introduction

Air pollution is a serious environmental problem across major cities in India. The problem of air pollution has significant health impacts on human and the environment [1, 2]. Atmospheric pollution sourced by industrial activities and congestion of roads and traffic is of principal concern. Air pollution concentration is mainly due to various combination of pollutant and their physiochemical interactions or processes with other components in the atmosphere, properties of earth surface and geometry [3]. PM10, SO2 and NO2 are the three main pollutants responsible for the degradation of the ambient air quality. These pollutants exert a wide range of impacts on biological, physical, and economic systems, especially, effects on plant and human health are of particular concern. Worldwide there are more deaths due to poor air quality than from automobile accidents [4]. Air pollution is a serious problem in urban areas, expose to higher air pollutant concentrations for longer period may increase the risk of asthma, respiratory and cardiovascular systems, cancer and mortality. Therefore, there is an urgent need to development an efficient forecasting system to provide air quality information to the general public. In recent years statistical methods have been used for air pollution predictive models which are not capable of predicting short term pollution levels. Regression modelling is most popular statistical approach has been used to develop air quality predictive models in number of studies [5]. Linear regression models have good predictability with linear process. They will underperform when we try to model nonlinear processes. Therefore, artificial neural networks (ANN) approach is the best as compared with statistical linear methods, especially where the problem being analyzed includes nonlinear behavior [6]. ANN models have been used for the forecasting a wide range of pollutants and their concentrations at various time scales, with very good results [7, 8]. The modelling tools are widely used in many scientific fields, especially in environmental sciences and have been widely applied for modelling air pollutant concentrations with the aim to forecast them. ANN modelling is the best tool for nonlinear relationships. Their performance capability is superior when compared to other statistical methods [9, 10]. The literature showed that fairly good estimates can be achieved by different models developed using ANN. The most common structure of the neural network is the “feed forward” where the data flow from input to output units is strictly feed forward and are able to find and identify complex patterns in datasets which may not be well described by a simple mathematical formula or a set of known processes [11]. The neural network approach is applied for highly complex pattern recognition and to solve the problem in presence of noisy dataset [12, 13]. ANN models performed better when they combined with traditional deterministic modelling approach [14]. In this paper, the back propagation algorithms are used to model and predict the daily average concentrations of PM10, NO2 and SO2 using meteorological variables (Input).

1.1 Artificial Neural Network (ANN)

ANNs are computing systems motivated by biological models, and made up of a number of easy and highly interconnected processing components, which process information by its dynamic state response to external inputs. The processing components called neurons are organized into interconnected layers [14,15,16]. Multilayer perceptron (MLP) was introduced by Rumelhart in 1986. It is most widely used and composed of three layers of neurons. Several neurons are organised into input layer, hidden layer and the output layer. The input layers take the value from the model input and serves to pass the values to the hidden layer as shown in Fig. 1. In this study, feed forward ANN was used to predict air pollutant concentrations based on different meteorological variables. The number of hidden layers were selected by using trial and error procedure varying for 8–12 hidden layers in the network structure was examined in order to check the effectiveness of network predictability.

Fig. 1
figure 1

Architecture of multilayer neural network

2 Study Area

Figure 2 shows satellite view of Belagavi city highlighted with four monitoring stations, is located at 15.87 °N 74.5 °E, with an average elevation of 751 m. The city is in the north western parts of Karnataka state and is along the border of two states, Goa and Maharashtra. Belagavi is a district head quarter and officially known as second capital of Karnataka State, India.

Fig. 2
figure 2

Satellite picture of Belagavi city showing four monitoring sites

Belagavi is known for its foundry Hub. More than 200 foundries are producing automotive and industrial castings of ferrous base and supporting ancillaries and one of the largest alumina producing company is located at Belagavi city. Vehicle population and traffic, industrial emissions are the major sources of pollution in the Belagavi atmosphere, a problem that has been annoyed by the drastic increase in the number of mobile sources from last five years. Therefore, there is an urgent need for the assessment and evaluation of air quality in Belagavi city. It has been found that air pollutant concentrations of PM10, SO2, and NO2 distribution is strongly affected by meteorological factors.

2.1 Data Sets

The meteorological data temperature (in °C), wind speed (in mps), wind direction (in degrees), relative humidity (in %) and rain (in mm) were collected from meteorological station located at Sambra, Belagavi city. Table 1 shows the mean, standard deviation, minimum and maximum values of meteorological parameters and pollutant concentrations for the year 2011–2013. The mean annual temperature is around 24.65 °C, annual relative humidity is between 17.15 and 99.07% and prevailing wind direction NNE to NNW and annual wind speed is between 2 and 13.73 mps and heavy rain occurs in the period of June to September is between 0 and 9.23 mm.

Table 1 Daily average means, standard deviation, minimum and maximum values of Meteorological parameters and pollutant concentrations of PM10, SO2 and NO2

The site characteristics are different resulting industrial (Autonagar and Udyambag), traffic (Railway station) and residential (Vadgaon). Air quality monitoring sites were classified as industrial, traffic and residential sites and the air pollution monitoring was carried for the period of 2011–2013. Respirable dust sampler RDS (Envirotech APM 460 NL) was used to monitor and measure the concentration of PM10 (µg/m3) and Gaseous pollutant sampler (Environtech APM 433) was used to measure NO2 (µg/m3) and SO2 (µg/m3) concentrations using suitable reagents. The data sets of meteorological parameters and pollutant concentrations are used on their daily average for the period of 2011–2013.

We had problems in the data sets mainly outlier and missing data. The outlier was due to the malfunction of instrument or incorrect measurement of the pollutant. The outliers are maximum or minimum values of data. They are analysed with care, because they cause more deviation in the model development and prediction. Missing data was due to instrument calibration or malfunctions and this problem was very limited (2%), these gaps are filled by linear interpolation method [16]. In order to support the neural network to efficiently handle a data, all the input variables are normalised to the range (0, 1) by Eq. 1.

$$ X_{n} = \frac{{(X_{i} - X_{\hbox{min} } )}}{{(X_{\hbox{max} } - X_{\hbox{min} } )}} $$
(1)

X n is the normalised data, X i actual measured data, X min minimum value of the measured data, X max maximum value of the measured data.

2.2 ANN Modelling

The ANN models were developed by using the MATLAB R2015a software from Mathworks group Inc. The feed forward Back propagation (BP) multilayer perceptron (MLP) network model was used for the present study. The BP algorithm is used to train a given feed-forward multilayer neural network for a known set of input patterns with known classifications. The BP algorithm is based on Widrow-Hoff delta learning rule which is based on weight adjustment through mean square error of the output to the sample input [17]. Therefore, BP networks are the simplest and most widely used network models [18]. The meteorological variables (inputs) and pollutant concentrations (outputs) are divided into training, validation and testing subsets. One third data was used for validation, one third data was used for testing and two third of the data was used for the training set. The neuron number in hidden layer has the significant importance for the model development and its accuracy and performance. Another important step in model development is determination of activation function. The most widely used activation functions are liner, sigmoid and hyperbolic tangent. The optimization of one hidden layer was conducting by several tests with various network structures, optimised networks have selected based on lower prediction error and smaller convergence times. Present study applied back propagation network with three layers to predict air pollutant concentrations. Selected network structure was used and trained after definition of subset with Levenberg-Marquardt optimization (trainlm) in hidden layer (nodes of 10, 12, 18), log-sigmoid (logsig) transfer function is used in output layer [19, 20].

2.3 Evaluation of Model Performance

Evaluation of the performance of developed model is very important and is important to evaluate forecast accuracy. We have selected statistical indicator to describe goodness of the estimates. The accuracy of model was determined by considering how well a model performed with new data which are not used in model fitting. The model performance was checked using Mean Absolute Error (MAE), and Root Mean-Square Error (RMSE) and correlation Coefficient (R) are given by Eqs. (2)–(6).

$$ MSE = \frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{i} - X_{i} } \right)^{2} } }}{n} $$
(2)
$$ MAE = \frac{{\sum\nolimits_{i = 1}^{n} {\left| {Y_{i} - X_{i} } \right|} }}{n} $$
(3)
$$ RMSE = \sqrt {\frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{i} - X_{i} } \right)^{2} } }}{n}} $$
(4)
$$ MAPE = \sqrt {\frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{i} - X_{i} } \right)^{2} } }}{n}} $$
(5)
$$ R = \frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{i} - \overline{{Y_{i} }} } \right)\left( {X_{i} - \overline{{X_{i} }} } \right)} }}{{\left\{ {\left[ {\sum\nolimits_{i = 1}^{n} {\left( {X_{i} - \overline{{X_{i} }} } \right)^{2} } } \right]\left[ {\sum\nolimits_{i = 1}^{n} {\left( {Y_{i} - \overline{{Y_{i} }} } \right)^{2} } } \right]} \right\}^{1/2} }} $$
(6)

where, n is the number of data, Y i is the modeled pollutant concentration, X i is the observed concentration. Zero Error indicates that all the modeled concentrations of various pollutants computed by ANN models were perfectly match the observed concentrations.

3 Results and Discussions

Overall, the annual average concentrations of PM10 were above the national ambient air quality standard (NAAQS) at Autonagar and Railway station and is almost at alarming stage at Udyambag and Vadgaon. PM10 concentrations are mainly due to construction activity, industrial activity and bad road conditions. The annual average NO2 and SO2 concentrations were below NAAQS standards. The concentrations of NO2 are higher on traffic sites (Railway station) and industrial area (Autonagar and Udyambag) and it confirms industries and traffic as important sources of NO2 in Belagavi city. SO2 concentration is very low (negligible) and is mainly due to diesel vehicles and old vehicles, industrial burning and commercial burning of various fuel oils.

The optimisation of a neural network is most important objective to developed ANN based models [21]. The process of optimization plays an important role in the selection and performance of the network. Hence, an optimisation was carried out with number of neurons and MSE [19]. Then, the multilayer layer neural network was evaluated using BP algorithm with 10, 12 and 18 nodes in the hidden layer. With increasing in neuron number, the network gave several local minimum values and different MSE values were obtained for the training set. Increasing neuron number to more than 20 gave unrealistic results for all the pollutants.

The various performance indicators were used to determine to measure the goodness of the fit and the results of ANN model are summarized in Table 2. The best performing ANN network was trainlm in the hidden layer. The results shows excellent performance for the developed models for PM10 for all four monitoring sites are optimised with nodes of 10, 10, 18 and 12 according to values of MAPE was found to be 7.2, 6.2, 10.8, and 7.7%. Good results were obtained for NO2 with nodes of 18, 12, 10, 10 with values of MAPE are 12.1, 3.5, 8.6, 8.7%. Similarly, good performance by the developed network of SO2 for all monitoring sites using nodes of 12, 12, 18 and 10 with MAPE of 2.0, 7.1, 3.6 and 8.6%. The smallest MSE was obtained for trainlm function and the mean absolute percentage error was used to select the models. The following models have performed best based on the change in number of hidden nodes and keeping the logsig transfer function. The training stopped after 8, 9 and 9 iterations for PM10, The training stopped after 10, 11 and 9 iterations for SO2, for NO2, training stopped after 6, 7 and 8 iterations.

Table 2 Performance indices for the testing data for four monitoring sites

Figure 3 shows observed and predicted PM10, NO2 and SO2 concentrations using developed ANN models. It demonstrates that these models can estimate air pollutant concentration for the given set with an accuracy of approximately 90%. As discussed earlier the used data set was divided into two subsets, the first set was included data from the period of 2011–2012 and 2013 data was totally unknown data for the model was used to evaluate the forecasting ability of the developed model is shown in Fig. 4. According to Fig. 4, it seems that the prediction of all developed models is in a very satisfactory level (p < 0.05). The performance of developed models for training data set and performance in terms R are shown in Fig. 4. The results indicate that developed ANN models predicated the PM10 with good accuracy of 95.92, 85.50, 85.11 and 90.15%, NO2 with 89.82, 89.27, 92.49 and 93.20%, for SO2 with excellent accuracy of 98.77, 88.47, 95.68 and 89.16% for the monitoring sites Autonagar, Railway station, Udyambag and Vadgaon.

Fig. 3
figure 3

Measured and predicted concentrations of PM10, SO2 and NO2 for training data

Fig. 4
figure 4

ANN outputof training data set for PM10 (Autonagar), NO2 (Vadgaon) and SO2 (Railway station)

4 Conclusion

The optimization study was done for better ANN mode with different structures in terms of hidden layer and number of nodes. Three layer model used for the present study showed a precise and effective prediction of air pollutant concentrations. The trainlm has given good prediction with relatively good accuracy. For the air pollution predictive the ANN models have faster predictive power and are viable as compared to other statistical models. The performance of the model in all four locations was satisfactory and is able to efficiently simulate the atmospheric time-series pollutant concentrations. ANN models can be considered as appropriate for operational usage in urban air quality management. The models discussed in this study are easily implemented and are helpful to the local authorities in providing the information to the general public and to protect the health of people by implementing appropriate controlling measures.