Forecast of daily PM2.5 concentrations applying artificial neural networks and Holt–Winters models

Ventura, Luciana Maria Baptista; de Oliveira Pinto, Fellipe; Soares, Laiza Molezon; Luna, Aderval S.; Gioda, Adriana

doi:10.1007/s11869-018-00660-x

Forecast of daily PM_2.5 concentrations applying artificial neural networks and Holt–Winters models

Published: 07 January 2019

Volume 12, pages 317–325, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Air Quality, Atmosphere & Health Aims and scope Submit manuscript

Forecast of daily PM_2.5 concentrations applying artificial neural networks and Holt–Winters models

Download PDF

Luciana Maria Baptista Ventura^1,2,
Fellipe de Oliveira Pinto²,
Laiza Molezon Soares³,
Aderval S. Luna³ &
…
Adriana Gioda ORCID: orcid.org/0000-0002-5315-5650¹

971 Accesses
48 Citations
Explore all metrics

Abstract

Fine particulate matter (PM_2.5) has been considered one of the most harmful atmospheric pollutants to the health. PM_2.5 has as its main origin vehicular emissions, a characteristic source in megacities. In order to predict pollution episodes in different areas (rural, industrial, and urban), two models were applied, Holt–Winters (HW) and artificial neural network (ANN), using PM_2.5 concentration time series. PM_2.5 samples were collected using Hi-Vol samplers during a period of 24 h, every 6 days, from January 2011 to December 2013, in Rio de Janeiro, Brazil. Meteorological data was also obtained for use in the models. The PM_2.5 dataset was the longest obtained for this megacity and the Holt–Winters (HW) model was used, for the first time, to predict air quality. The results of the PM_2.5 data series showed daily concentrations ranging from 1 to 65 μg m⁻³. The root mean square error (RMSE) was calculated for each model for the three sites. The HW model best explained the simulation of PM_2.5 in the industrial area, since it presented the lowest RMSE (5.8 to 14.9 μg m⁻³). The ANN was the most appropriate model for urban and rural areas with RMSE between 4.2 to 9.3 μg m⁻³. Overall, both forecast models proved accurate enough to be considered useful tools for air quality management and can be applied in other world regions.

A neural network-based method for modeling PM 2.5 measurements obtained from the surface particulate matter network

Article 12 April 2021

Applying Artificial Neural Networks to Short-Term PM2.5 Forecasting Modeling

Machine Learning Based PM 2.5 and 10 Concentration Modeling for Delhi City

Article 24 August 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Fine particle (PM_2.5) concentrations in the air contribute to harmful health effects (Rodríguez-Cotto et al. 2014). In November 2019, the Brazilian Environmental Council (CONAMA 2018) updated air quality standards, including the standard for PM_2.5. This update, however, will be implemented in steps. Initially, the intermediate standard for PM_2.5 will be 20 μg m⁻³ per year and 60 μg m⁻³ in 24 h. The final standard will meet the World Health Organization (WHO) recommendations, i.e., 10 μg m⁻³ or 25 μg m⁻³ per day. In this study, we will compare PM_2.5 levels with the guidelines recommended by the WHO.

Rio de Janeiro state has the oldest air quality monitoring network in Brazil and one of the oldest in Latin America, having been in operation since the 1960s (Gioda et al. 2016). However, only in 2010 did the Rio de Janeiro network start PM_2.5 monitoring (Ventura et al. 2017a, b).

Due to the mortality and morbidity caused by PM_2.5, air pollution controls have become urgent (Liu and Peng 2018; Pope et al. 2018). Making predictions based on time series prediction techniques is fundamental to the analysis and support needed for environmental agencies to make decisions (Relvas and Miranda 2018; Mehdipour et al. 2018).

Statistical models of air quality forecasting have been sparsely used in South America to predict critical pollution episodes, hampering the ability to control emissions on critical days (Perez 2012). There are many statistical models available to predict air pollutant concentrations, such as principal component analysis with multiple linear regression (PCA-MLR) (e.g., Ul-Saufie et al. 2013), autoregressive integrated moving average (ARIMA) (e.g., Díaz-Robles et al. 2008), nearest neighbor model (NNM) (e.g., Perez 2012), and artificial neural network (ANN) (e.g., Perez and Reyes 2006; Chattopadhyay and Chattopadhyay 2012; Luna et al. 2014). Among these models, artificial neural networks have proven to be useful and effective to predict PM_2.5 concentrations in local scale (Perez et al. 2000; Mckendry 2002; Ordieres et al. 2005; Thomas and Jacko 2007; Voukantsis et al. 2011). On the other hand, the Holt–Winters (HW) model, which is a seasonal time series prediction technique widely known and used (Dantas et al. 2017), was never applied to predict air quality levels. According to Tratar and Strmčnik (2016), the Holt–Winters model is very simple and robust. This model has been used to predict sales (Ribeiro et al. 2017), energy demand (Tratar and Strmčnik 2016), food industry (Veiga et al. 2014), air transportation demand (Dantas et al. 2017), traffic in cities (Baghyasree et al. 2014), tourism (Lim et al. 2009), power generation (Muche 2014), and solid waste generation (Bezerra 2006). This method allows modeling data through an average, a trend and a seasonal factor, all of which are updated by an exponential smoothing (Winters 1960).

This study aims to evaluate two air quality forecasting models (Holt–Winters [HW] and artificial neural networks [ANN]) using time series of PM_2.5 concentrations from three different areas (rural, urban, and industrial) in the metropolitan region of Rio de Janeiro, Brazil. It is the longest time series data of PM_2.5 obtained in Rio de Janeiro and one of the longest in South America. According to our knowledge, this is the first study developed in a tropical region of South America applying ANN and HW models to predict PM_2.5 daily concentrations.

Material and methods

Sampling

The metropolitan region of Rio de Janeiro (MRRJ) has different air pollution sources associated with a complex topography that inhibits air mass movement and consequently affects pollutant dispersions (Ventura et al. 2017a, b).

For this study, three sampling sites in MRRJ were selected: (1) Seropédica (22° 45′ 28.14 S/43° 415.85″ W), a region with rural characteristics; (2) Taquara (22° 56′ 58.34″ S/43° 21′ 33.94″ W), an urban area with high population concentration, heavy traffic and some pharmaceutical industries; and (3) Duque de Caxias (22° 40′ 26.50″ S/43° 17′ 12.99″ W), the main industrial area in MRRJ, composed by many petrochemical industries, a power plant, and a refinery, in addition to heavy traffic.

PM_2.5 samples were collected by the Environment Institute of Rio de Janeiro State (INEA) from January 2011 to December 2013, including rain days, totaling 180 samples per site. The samples were collected in glass fiber filters using Hi-Vol samplers (AGVMP252/Energy, Brazil) with a flow rate of 1.14 m³ min⁻¹, simultaneously at all sites, every 6 days, for 24 h (NBR 13412 method, similar to ASTM-D 4096 method).

Meteorological data

Meteorological data were measured every 15 min by public agencies (INMET and INEA) from January 2011 to December 2013. Meteorological variables were temperature (T), relative humidity (RH), wind speed (WS), and atmosphere pressure (P). Although solar radiation is a good meteorological parameter to be used for modeling, it was not monitored at all sites; therefore, this parameter was not used in the models. Wind direction was also not used because several studies did not show significant correlation with PM_2.5 (Luna et al. 2014; Ventura et al. 2017a, b; Mehdipour et al. 2018).

Descriptive statistics for these variables are presented in Table S1 in Supplementary Material.

Prediction of PM_2.5 concentration applying Holt–Winters model

Holt–Winters is an exponential prediction model used since the 1960s to predict the linear trend and seasonality of time series, and it is an extension of exponential smoothing method. The HW model uses a modified form of exponential smoothing, and it applies three exponential smoothing formulae. The mean is smoothed to give a local average value for the series (Eq. 1). The trend is smoothed, and lastly each seasonal sub-series is smoothed separately to give a seasonal estimate for each season (Eq. 2). The exponential smoothing formula is applied to a series with a trend and constant seasonal component using the HW additive or multiplicative methods (Eq. 3).

$$ {L}_t=\alpha \left({Y}_t-{S}_{t-s}\right)+\left(1-\alpha \right)\left({L}_{t-1}+{b}_{t-1}\right) $$

(1)

$$ {b}_t=\beta \left({L}_t-{L}_{t-1}\right)+\left(1-\beta \right){b}_{t-1} $$

(2)

$$ {S}_t=\gamma \left({Y}_t-{L}_t\right)+\left(1-\gamma \right){S}_{t-s} $$

(3)

Where α, β, and γ are the smoothing parameters, at is the smoothed level at time L_t, b_t is the change in the trend at time t, S_t is the seasonal smoothing parameter at time t, and s is the number of seasons per year.

An additive model is used when the amplitude of seasonal variation remains constant, and a multiplicative model is used when the amplitude of the seasonal variation increases in time (Winters 1960). The Holt–Winters additive model was used in this study, and it was calculated as the sum of the level components, trend, and seasonality (Eq. 4).

$$ {F}_{t+m}=\left({L}_t+{b}_{t-m}\right){S}_{t-s+m} $$

(4)

All data values in a series contribute to the calculation of the prediction model (Winters 1960). To optimize estimative smoothing parameters (α, β, and γ), the lowest mean squared error (MSE) was adopted. The seasonality used was four seasons during the year. The training set used was the first 173 observations of PM_2.5 time series. On the other hand, the tests were performed with the last 5 and last 10 observations registered in the PM_2.5 time series to estimate the next 5 or 10 PM_2.5 concentrations, respectively.

Prediction of PM_2.5 concentrations by applying artificial neural network

Artificial neural networks were originally developed to mimic basic biological neural systems, which are composed of neurons. An artificial neuron is a mathematical structure that seeks to simulate the shape, behavior, and functions of a biological neuron. An artificial neural network corresponds to a set of artificial neurons separated in layers (input, hidden, and output) (Chattopadhyay and Chattopadhyay 2012; Relvas and Miranda 2018). According to Luna et al. (2014), this model was successfully used for other authors to predict air quality levels.

To apply the artificial neural networks, it was necessary to set the lag number (input variables) and the neuron number in each hidden layer. Lag numbers were defined by the autocorrelation function (ACF) selecting the last number of autocorrelation that exceeds the confidence level of 95% (Fig. S1, Suppl. Material). To apply the ANN, the validation set was not used and the training was conducted with 1000 interactions. Applying the descent gradient method by batch, using as the training, the 173 PM_2.5 concentrations initial dataset without the last 10 observations of each input variable for the performance of the tests with the next 5 and 10 PM_2.5 concentrations predicted. Due to the low number of input variables, topologies (neurons quantity in the hidden layer) equal to 1 were used for Taquara and Seropédica and equal to 2 for Duque de Caxias.

Prediction of PM_2.5 concentrations by applying ANN associated to meteorological parameters

To input in prediction model, the meteorological variables were transformed in daily means. The lowest MSE was used as the criterion for the choice of topology (Table S2, Suppl. Material). The MSE averages for Taquara, Seropédica, and Duque de Caxias were based on 100 initializations of the network calculated for the next 10 PM_2.5 concentrations predicted, considering the input variables and different topologies. The results indicated that the ideal number of the topology is 2 for Taquara and Seropédica and 4 for Duque de Caxias.

Comparison between the prediction models

After the model is specified, its performance characteristics should be verified or validated by comparing its forecasts with historical data using accuracy measures. Root mean square error (RMSE) results and the prediction of the next 5 and 10 PM_2.5 concentrations were adopted to compare the PM_2.5 concentrations predicted in MRRJ by Holt–Winters and artificial neural networks models. Furthermore, to improve the assessment of the forecasting models, determination coefficients (R²) were generated from estimated values against the real values for each model, separated by wet and dry season and for each sampling year (2011–2013).

All statistical analyses were performed using statistical computing free R Language (R Development Core Team 2014).

Results and discussion

PM_2.5 concentrations

Table 1 shows the maximum, minimum, and annual average of PM_2.5 concentrations from 2011 to 2013, and the number of times that concentrations exceeded the daily guideline recommended by the WHO (25 μg m⁻³). In Seropédica, a rural area, PM_2.5 average concentrations were the lowest registered (10–11 μg m⁻³). These averages were slightly above the annual limit recommended by the WHO (10 μg m⁻³) and only 2 exceedances occurred.

Table 1 Maximum, minimum, and average PM_2.5 concentrations (μg m⁻³) and the exceeding number of the air quality guidelines from the World Health Organization (WHO 2006)

Full size table

The monitoring data from Duque de Caxias revealed that from 2011 to 2013, the annual average PM_2.5 concentrations had similar behavior (18–20 μg m⁻³) in exceeding the WHO limit, with 9–13 exceeding days. The maximum concentration in this area was two times higher than in Seropédica in 2013. These high concentrations are related to the proximity of an industrial complex and roads with heavy traffic.

Compared to other sites, Taquara presented the highest concentrations in 2011 (30 μg m⁻³) and 2012 (23 μg m⁻³). In 2011, it registered the highest number of exceedances (35) of the daily standard—three times higher than others in the study. However, as a positive fact, the annual average PM_2.5 concentrations suffered a 23% reduction from 2011 to 2013. These results were similar to the ones observed by Godoy et al. (2009) and Ventura et al. (2017a, b), which also indicated Taquara as the place with the worst air quality among sites analyzed in MRRJ.

Comparing the three sites, it can be observed that the annual average of PM_2.5 concentrations was higher in Taquara and lower in Seropédica. These results showed that vehicle emissions are the main source of fine particles in MRRJ, with Taquara as most representative of urban areas.

Predictions of PM_2.5 concentration

Holt–Winters model

The Holt–Winters model was applied in the time series of PM_2.5 concentrations monitored from January 2011 to December 2013 in the rural, urban, and industrial areas (Fig. 1). The estimations using the HW model explained very well the central results (median) observed for all sites. However, when taking into account the seasonality and the linear trends, this model was not able to explain PM_2.5 concentration peaks in Seropédica (Fig. 1a). Nonetheless, for data from Duque de Caxias, the model estimated successfully (Fig. 1c). The effectiveness of this model for the industrial region was due to the fact that emissions are continuous, so seasonality is an extremely important factor in determining the success of this prediction model.

For a method to be adequate for predictions, estimating the concentration peaks is extremely important, because peak days indicate critical episodes of air pollution, demanding that authorities take action to control the emission source and minimize future concentrations. The HW model showed itself to be weak at rendering this kind of estimation in rural and urban areas.

The HW model did not predict satisfactorily PM_2.5 concentrations from urban areas. According to Winters (1960), this model is not suitable for estimating variables with cyclical frequencies, which is what occurs in cities, due to the cycle of vehicle circulation, which can vary during the weekends and holidays.

Artificial neural network model

The same time series for PM_2.5 concentrations applied to the HW model was used in artificial neural networks (ANN1) in the industrial, rural, and urban areas (Fig. 2). For urban and rural areas, ANN1 estimated the PM_2.5 concentrations better than the HW model, as can be seen in Fig. 2a, b, where the values observed corresponded well to the values estimated. However, for the industrial area (Fig. 2c), the model was inaccurate.

In industrial areas, PM_2.5 concentrations have a more constant profile than in urban areas, due to the fact that emissions in industrial areas follow a certain linearity while urban areas have a more cyclical profile due to irregular traffic emissions. Besides, any changes in the daily activity of the region, such as traffic accidents or civil work, can influence the monitored concentrations.

Artificial neural network with meteorological variables

It is well established that air quality relies strongly on meteorological conditions. Therefore, the effects of different meteorological variables are already implied in the structure of the time series of a determined pollutant, such as PM_2.5. Due to the complexity of the correlation and also because of the presence of noise, an explicit consideration of variable effects, such as temperature, wind speed, and relative humidity, can yield a better prediction of particle concentrations (Perez et al. 2000; Luna et al. 2014). Therefore, the time series of PM_2.5 concentrations measured from 2011 to 2013, together with meteorological data, were evaluated by artificial neural networks (ANN2) (Fig. 3).

The artificial neural network model, when associated to meteorological data, improved at estimating PM_2.5 concentrations in urban and rural areas (Fig. 3a, b). An increase in information can augment the interpretation of the artificial neural network model because it facilitates synapse learning. Many studies (e.g., Ordieres et al. 2005; Thomas and Jacko 2007; Perez 2012) have already indicated that the adequate choice of variable access could be the most important step in statistical modeling, considerably increasing the predictive power of models when local meteorological variables are added. On the other hand, this model remained insufficient to explain the concentrations observed in the industrial area (Fig. 3c).

Predicted models accuracy

In order to evaluate numerically the accuracy of the models (HW, ANN1, and ANN2), the root mean square error (RMSE) was used for the stages of simulation (train) and the estimation of the next 5 and 10 PM_2.5 concentrations (Table 2). For the three forecasting models, the train RMSE varied from 3.6 to 11.1 μg m⁻³ and the prediction RMSE ranged from 4.2 to 14.9 μg m⁻³. The lowest RMSE in the train simulation are justified, since the more information number to be input for the model about study variation, the better its response function will be.

Table 2 Root mean square error (RMSE) applied to PM_2.5 concentrations (μg m⁻³) using artificial neural networks and Holt–Winters models

Full size table

For future concentrations, the estimated RMSE were always higher for the next 10 PM_2.5 concentrations. However, the difference in the prediction ranged only from 4 to 11% in relation to the forecast of the next 5 PM_2.5 concentrations.

Many researchers found that the meteorological conditions input into the ANN model improved the results’ precision (e.g., Thomas and Jacko 2007; Perez et al. 2000). In this study, a 20–30% reduction was observed in the RMSE when meteorological data was added to ANN models. Furthermore, RMSE results for ANN2 were 37% to 62% smaller than for the HW model. This was possible because meteorological variables introduce seasonal information, improving the generalization. According to Ospina and Zamprogno (2003), the ANN model reports a better performance during long periods of time, because it adjusts more quickly to structural changes through time.

Studies using ANN to preview atmospheric particulate matter concentrations found RMSE between 4 and 37 μg m⁻³ (Perez et al. 2000; Mckendry 2002; Ordieres et al. 2005; Thomas and Jacko 2007; Voukantsis et al. 2011). Therefore, the verified results in these studies were very similar to the smallest RMSE observed in previous ones.

Predicted models assessment

In the rural area, the three models evaluated presented a determination coefficient (R²) above 0.9 (Fig. 4), which shows that both are good PM_2.5 estimators for this type of zone. When evaluated to determine whether the estimates were influenced by sampling years or season (Table 3), it was verified that all the models maintained steady performance.

Table 3 Coefficient of determination (R²) of PM_2.5 concentrations (μg m⁻³) observed against estimated for each area by year and period

Full size table

Artificial neural network models were more appropriate to estimate PM_2.5 concentration in urban areas, with an R² higher than 0.95 (Fig. 5), regardless of sampling years or whether it was wet or dry season (Table 3). The Holt–Winters model proved ineffective for the prediction of this pollutant in this zone, since its R² did not reach 0.7.

It is possible to verify in Fig. 6 that the model that best estimated PM_2.5 concentrations in the industrial area was the Holt–Winters (R² = 0.83). The artificial neural network models presented a low determination coefficient (R² < 0.7). When the prediction by sampling years was evaluated, the R² was verified at 0.84 ± 0.05. However, it can be seen in Table 3 that the wet season, which is from November to March, hinders the prediction of the HW model, since the concentrations fluctuate more in their values, due to the concentration drops on rainy days. This same behavior was also repeated in the urban area.

It is noteworthy in Table 3 that the addition of daily meteorological information did not significantly improve (< 3%) the artificial neural network model’s (ANN2) performance when compared to the model without that information (ANN1) in rural and urban areas, where ANN proved most useful.

Conclusion

The use of predictive models, such as Holt–Winters and artificial neural networks, constitutes powerful tools to help make decisions about air quality management. The models aid in the anticipation of air pollution critical episodes, e.g., with regard to fine particulate matter (PM_2.5). These models, when applied using different observed data in Brazil, reported a good accuracy, with RMSE ranging from 4.2 to 14.9 μg m⁻³. Overall, both models have enough precision to be considered useful tools in air pollution management by environmental agencies, allowing those agencies to warn the population about future adverse conditions. Moreover, they will help to implant palliative control actions to avoid critical episodes previously predicted.

The Holt–Winters model, though not previously used for air quality prediction, proved efficient at forecasting PM_2.5 concentrations in industrial and rural areas where emissions are relatively constant throughout the year. However, it has been shown inadequate in areas with seasonal influences, such as wet periods, due to the fluctuation of concentrations on rainy days.

The artificial neural networks models achieved consistent predictions of PM_2.5 concentrations in urban and rural areas, as their predictive power is not subject to cyclical influences. However, the input of meteorological variables into the artificial neural network model was expected to improve the modeling result in estimating PM_2.5 concentrations, but this was not verified since the R² did not increase by more than 5%.

References

Baghyasree T, Kumar P, Janakiraman K, Seethalakshmi R (2014) Real-time variable bit rate video traffic using a simple and efficient prediction approach. World Appl Sci J 29:48–52. https://doi.org/10.5829/idosi.wasj.2014.29.dmsct.9
Article Google Scholar
Bezerra C (2006) Evaluation of Holt-Winters models in the solid residua forecasting: a case study in the city of Toledo – PR. In: Third International Conference on Production Research – Americas’ Region, August 2006, Curitiba, Brazil
Chattopadhyay S, Chattopadhyay G (2012) Modeling and prediction of monthly total ozone concentrations by use of an artificial neural network based on principal component analysis. Pure Appl Geophys 169:1891–1908. https://doi.org/10.1007/s00024-011-0437-5
Article Google Scholar
CONAMA (2018) Padrões de qualidade do ar, RESOLUÇÃO Nº 491, DE 19 DE NOVEMBRO DE 2018, Brazilian Environmental Council, Brasília-DF
Dantas TM, Oliveira FLC, Repolho HMV (2017) Air transportation demand forecast through bagging Holt Winters methods. J Air Transp Manag 59:116–123. https://doi.org/10.1016/j.jairtraman.2016.12.006
Article Google Scholar
Díaz-Robles L, Ortega J, Fu J, Reed G, Chow J, Watson J, Moncada-Herrera J (2008) A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: the case of Temuco, Chile. Atmos Environ 42:8331–8340. https://doi.org/10.1016/j.atmosenv.2008.07.020
Article CAS Google Scholar
Gioda A, Ventura LMB, Ramos MB, Silva MPR (2016) Half century monitoring air pollution in a megacity: a case study of Rio de Janeiro. Water Air Soil Pollut 227:86–103. https://doi.org/10.1007/s11270-016-2780-8
Article CAS Google Scholar
Godoy ML, Soluri D, Godoy JM, Roldão LA, Donagemma R (2009) Coarse and fine aerosol source apportionment in Rio de Janeiro, Brazil. Atmos Environ 43:2366–2374. https://doi.org/10.1016/j.atmosenv.2008.12.046
Article CAS Google Scholar
Lim C, Chang C, Mcaleer M (2009) Forecasting h(m)otel guest nights in New Zealand. Int J Hosp Manag 28:228–235. https://doi.org/10.1016/j.ijhm.2008.08.001
Article Google Scholar
Liu JC, Peng RD (2018) Health effect of mixtures of ozone, nitrogen dioxide, and fine particulates in 85 US counties. Air Qual Atmos Health 11:311–324. https://doi.org/10.1007/s11869-017-0544-2
Article CAS Google Scholar
Luna A, Paredes M, Oliveira G, Correa S (2014) Prediction of ozone concentration in tropospheric levels using artificial neural networks and support vector machine at Rio de Janeiro, Brazil. Atmos Environ 98:98–104. https://doi.org/10.1016/j.atmosenv.2014.08.060
Article CAS Google Scholar
Mckendry I (2002) Evaluation of artificial neural networks for fine particulate pollution (PM10 and PM2.5) forecasting. J Air Waste Manage Assoc 52:1096–1101. https://doi.org/10.1080/10473289.2002.10470836
Article CAS Google Scholar
Mehdipour V, Stevenson DS, Memarianfard M, Sihag P (2018) Comparing different methods for statistical modeling of particulate matter in Tehran, Iran. Air Qual Atmos Health 11:1155–1165. https://doi.org/10.1007/s11869-018-0615-z
Article CAS Google Scholar
Muche T (2014) Optimal operation and forecasting policy for pump storage plants in day-ahead markets. Appl Energy 113:1089–1099. https://doi.org/10.1016/j.apenergy.2013.08.049
Article Google Scholar
Ordieres J, Vergara E, Capuz R, Salazar R (2005) Neural network prediction model for fine particulate matter (PM2.5) on the US–Mexico border in El Paso (Texas) and Ciudad Juarez (Chihuahua). Environ Model Softw 20:547–559. https://doi.org/10.1016/j.envsoft.2004.03.010
Article Google Scholar
Ospina R, Zamprogno B (2003) Comparação de Algumas Técnicas de Previsão em Análise de Séries Temporais. Revista Colombiana de Estadística 26:129–157
Google Scholar
Perez P (2012) Combined model for PM10 forecasting in a large city. Atmos Environ 60:271–276. https://doi.org/10.1016/j.atmosenv.2012.06.024
Article CAS Google Scholar
Perez P, Reyes J (2006) An integrated neural network model for PM10 forecasting. Atmos Environ 40:2845–2851. https://doi.org/10.1016/j.atmosenv.2006.01.010
Article CAS Google Scholar
Perez P, Trier A, Reyes J (2000) Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmos Environ 34:1189–1196. https://doi.org/10.1016/S1352-2310(99)00316-7
Article CAS Google Scholar
Pope CA, Ezzati M, Cannon JB, Allen RT, Jerrett M, Burnett RT (2018) Mortality risk and PM2.5 air pollution in the USA: an analysis of a national prospective cohort. Air Qual Atmos Health 11:245–252. https://doi.org/10.1007/s11869-017-0535-3
Article CAS Google Scholar
R Development Core Team (2014) R: a language and environment for statistical computing. ISBN 3–900051–07-0. R Foundation for Statistical Computing, Viena, Austria
Relvas H, Miranda AI (2018) An urban air quality modeling system to support decision-making: design and implementation. Air Qual Atmos Health 11:815–824. https://doi.org/10.1007/s11869-018-0587-z
Article CAS Google Scholar
Ribeiro A, Seruca I, Durão N (2017) Improving organizational decision support: detection of outliers and sales prediction for a pharmaceutical distribution company. Procedia Computer Science 121:282–290. https://doi.org/10.1016/j.procs.2017.11.039
Article Google Scholar
Rodríguez-Cotto R, Ortiz-Martínez M, Rivera-Ramírez E, Mateus V, Amaral B, Jiménez-Vélez B, Gioda A (2014) Particle pollution in Rio de Janeiro, Brazil: increase and decrease of pro-inflammatory cytokines IL-6 and IL-8 in human lung cells. Environ Pollut 194:112–120. https://doi.org/10.1016/j.envpol.2014.07.010
Article CAS Google Scholar
Thomas S, Jacko R (2007) Model for forecasting expressway fine particulate matter and carbon monoxide concentration: application of regression and neural network models. J Air Waste Manage Assoc 57:480–488. https://doi.org/10.3155/1047-3289.57.4.480
Article CAS Google Scholar
Tratar LF, Strmčnik E (2016) The comparison of Holt–Winters method and multiple regression method: a case study. Energy 109:266–276. https://doi.org/10.1016/j.energy.2016.04.115
Article Google Scholar
Ul-Saufie A, Yahaya A, Ramli N, Rosaida N, Hamid H (2013) Future daily PM10 concentrations prediction by combining regression models and feed forward back propagation models with principle component analysis (PCA). Atmos Environ 77:621–630. https://doi.org/10.1016/j.atmosenv.2013.05.017
Article CAS Google Scholar
Veiga C, Veiga CR, Catapan A, Tortato U, Silva W (2014) Demand forecasting in food retail: a comparison between the Holt-Winters and ARIMA models. WSEAS Trans Bus Econ 11:608–614
Google Scholar
Ventura LMB, Pinto FO, Soares LM, Luna AS, Gioda A (2017a) Evaluation of air quality in a megacity using statistics tools. Meteorog Atmos Phys 130:361–370. https://doi.org/10.1007/s00703-017-0517-x
Article Google Scholar
Ventura LMB, Mateus VL, De Almeida ACSL, Wanderley KB, Taira FT, Saint’Pierre TD, Gioda A (2017b) Chemical composition of fine particles (PM_2.5): water-soluble organic fraction and trace metals. Air Qual Atmos Health 10:845–852. https://doi.org/10.1007/s11869-017-0474-z
Article CAS Google Scholar
Voukantsis D, Karatzas K, Kukkonen J, Räsänen T, Karppinen A, Kolehmainen M (2011) Intercomparison of air quality data using principal component analysis, and forecasting of PM10 and PM2.5 concentrations using artificial neural networks, in Thessaloniki and Helsinki. Sci Total Environ 409:1266–1276. https://doi.org/10.1016/j.scitotenv.2010.12.039
Article CAS Google Scholar
WHO (2006) Air quality guidelines for particulate matter, ozone, nitrogen dioxide and sulphur dioxide. Global update 2005. Summary of Risk Assessment. World Health Organization, Geneva
Winters P (1960) Forecasting sales by exponentially weighted moving averages. Manag Sci 6:324–342. https://doi.org/10.1287/mnsc.6.3.324
Article Google Scholar

Download references

Acknowledgements

The authors are grateful to the Environment Institute of Rio de Janeiro State (INEA) for providing PM_2.5 concentrations and meteorological data and to the National Council for Scientific and Technological Development (CNPq) and Foundation for Research Support of the Rio de Janeiro State (FAPERJ) for financial support. A. S. L thanks to Programa Prociência, UERJ.

Author information

Authors and Affiliations

Department of Chemistry, Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Marquês de São Vicente Street, 225, Gávea, Rio de Janeiro, RJ, 22451-900, Brazil
Luciana Maria Baptista Ventura & Adriana Gioda
Environment Institute of Rio de Janeiro State (INEA), Rio de Janeiro, Brazil
Luciana Maria Baptista Ventura & Fellipe de Oliveira Pinto
University of Rio de Janeiro State (UERJ), Rio de Janeiro, RJ, Brazil
Laiza Molezon Soares & Aderval S. Luna

Authors

Luciana Maria Baptista Ventura
View author publications
You can also search for this author in PubMed Google Scholar
Fellipe de Oliveira Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Laiza Molezon Soares
View author publications
You can also search for this author in PubMed Google Scholar
Aderval S. Luna
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Gioda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adriana Gioda.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOC 104 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ventura, L.M.B., de Oliveira Pinto, F., Soares, L.M. et al. Forecast of daily PM_2.5 concentrations applying artificial neural networks and Holt–Winters models. Air Qual Atmos Health 12, 317–325 (2019). https://doi.org/10.1007/s11869-018-00660-x

Download citation

Received: 05 November 2018
Accepted: 21 December 2018
Published: 07 January 2019
Issue Date: 11 March 2019
DOI: https://doi.org/10.1007/s11869-018-00660-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Forecast of daily PM_2.5 concentrations applying artificial neural networks and Holt–Winters models

Abstract

Similar content being viewed by others

A neural network-based method for modeling PM 2.5 measurements obtained from the surface particulate matter network

Applying Artificial Neural Networks to Short-Term PM2.5 Forecasting Modeling

Machine Learning Based PM 2.5 and 10 Concentration Modeling for Delhi City

Introduction