Abstract
Air pollution is a major concern issue on Macao since the concentration levels of several of the most common pollutants are frequently above the internationally recommended values. The low air quality episodes impacts on human health paired with highly populated urban areas are important motivations to develop forecast methodologies in order to anticipate pollution episodes, allowing establishing warnings to the local community to take precautionary measures and avoid outdoor activities during this period. Using statistical methods (multiple linear regression (MLR) and classification and regression tree (CART) analysis) we were able to develop forecasting models for the main pollutants (NO2, PM2.5, and O3) enabling us to know the next day concentrations with a good skill, translated by high coefficients of determination (0.82–0.90) on a 95% confidence level. The model development was based on six years of historical data, 2013 to 2018, consisting of surface and upper-air meteorological observations and surface air quality observations. The year of 2019 was used for model validation. From an initially large group of meteorological and air quality variables only a few were identified as significant dependent variables in the model. The selected meteorological variables included geopotential height, relative humidity and air temperature at different altitude levels and atmospheric stability characterization parameters. The air quality predictors used included recent past hourly levels of mean concentrations for NO2 and PM2.5 and maximum concentrations for O3. The application of the obtained models provides the expected daily mean concentrations for NO2 and PM2.5 and maximum hourly concentrations O3 for the next day in Taipa Ambient air quality monitoring stations. The described methodology is now operational, in Macao, since 2020.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
1 Introduction
The World Health Organization has last update Air Quality Guidelines (AQG) in 2005, which sets the recommended threshold levels for each pollutant. Also, Macao has an air quality problem, in particular of high levels of nitrogen dioxide (NO2), particulate matter (PM2.5), and ozone (O3), which often overstep the guidance values of AQG. In addition, there are a lot of studies that show that exposure to NO2, PM2.5, and O3 have increased hospital admissions and emergency room visits and even led to death from heart or lung diseases in extreme cases (WHO, 2003). Thus, it is extremely important to develop a reliable air quality forecast for the concentration of NO2, PM2.5, and O3 in Macao, which can alert the local population to take precautionary measures in case of a pollution episode (Neto et al., 2009). Figures 1, 2 and 3 showed the comparison of different air quality standards amongst WHO, EU, US, China, Macao, and Hong Kong for NO2, PM2.5, and O3 (MEE, 2012; SMG, 2019; WHO Europe, 2006).
2 Methodology
To now the air quality for the next day was used statistical methods that were based on past data series analysis. For this paper was utilized multiple linear regression (MLR) and classification and regression tree (CART) analysis. As showed in previous work (Cassmassi, 1997), statistical models based on MLR and CART analysis were developed to forecast the average daily concentration for NO2 and PM2.5, and the maximum hourly O3 levels for the next day, for the air quality monitoring stations of Taipa Ambient and Taipa Residential. Taipa Ambient is an ambient station, also a background representative station, which set the baseline for the levels of pollutant concentration. This station is located at Taipa Grande, the headquarter of Macao Meteorological and Geophysical Bureau (SMG). Taipa Residential is a high-density residential area station located in Taipa. This station is located at the Taipa Central Park, a leisure area for the local residents. A six-year period from 2013 to 2018 was selected as the period to develop the models, while the year of 2019 was selected for validation of the model.
Figure 4 shows the flowchart of the model development for the air quality forecast using statistical methods. The development of statistical model consists of collecting the air quality data and meteorological data, followed by computing the hourly air quality data into the daily concentrations of NO2, PM2.5, and O3. In addition, the meteorological data required to develop the statistical model would be extracted from different meteorological observations. These processed data would be analyzed by statistical methods such as multiple linear regression (MLR) and classification and regression trees (CART) analysis. The final procedure is to perform a model validation to ensure the accuracy of the next-day air quality forecast.
3 Results and Discussion
Table 1 provides the final list of meteorological and air quality parameters used as predictors, for each pollutant, in the obtained multiple regression models for the air quality air quality monitoring stations of Taipa Ambient and Taipa Residential. Table 1 shows PM2.5 has a highly correlated relationship with past concentration levels (PM25_16D1 as the average of the hourly values(µg/m3) between 16:00 of yesterday and 15:00 of today) for both Taipa Ambient and Taipa Residential air quality monitoring stations, geopotential height (m) at 850 hPa (H_850), and average relative air humidity (%) (HRMD) for Taipa Ambient, average dew point temperature (°C) (TD_MD), and air temperature (°C) at 925 hPa (TAR_925) for Taipa Residential. In addition, NO2 has a highly correlated relationship with past concentration levels NO2_16D1and geopotential height (m) at 850 hPa for both Taipa Ambient and Taipa Residential air quality monitoring stations, atmospheric stability (°C) at 925 hPa (STB_925) for Taipa Ambient, and average dew point temperature (°C) (TD_MD) for Taipa Residential. Furthermore, O3 has a highly correlated relationship with past concentration levels O3 MAX_16D1, O3 MAX_23D1 (as the maximum hourly values (µg/m3) between 00:00 and 23:00 of yesterday), geopotential height (m) at 850 hPa and minimum relative air humidity (%) (HRMN) for both Taipa Ambient and Taipa Residential air quality monitoring stations.
An example of one of the regression equations delivered is the following for next-day 24 h-average NO2 at Taipa Ambient:
Figure 5 shows the graph of the model validation for the MLR models of NO2 concentrations in Taipa Ambient air quality monitoring station in 2019.
Table 2 shows the model performance indicators of PM2.5, NO2, and O3 MAX for Taipa Ambient and Taipa Residential air quality monitoring stations. The model performance indicators include coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and BIAS. The results obtained from MLR and CART models perform a coefficient of determination between 0.86 and 0.87 for Taipa Ambient and between 0.78 and 0.88 for Taipa Residential. The air quality forecast is at best forecasting the levels of NO2 concentration in Taipa Ambient and is at best forecasting the levels of PM2.5 concentration in Taipa Residential. All of the statistical models were built using MLR, while the models for the maximum hourly ozone were built using both MLR and CART.
The ambient station was better at predicting NO2 while the residential station was better at predicting the maximum hourly concentration of O3. Also, the developed models provide a better understanding of different air quality and meteorological variables and also the relationship between these variables. Furthermore, the variable that explained most of the variability is the 16D1 concentration for NO2, PM2.5, and O3.
4 Conclusion
The work presented is an air quality forecast using statistical methods, based on a detailed analysis of both air quality and meteorological variables for NO2, PM2.5, and O3. The final objective of this study is to develop a daily air quality forecast using statistical methods to predict the daily average of NO2, PM2.5, and maximum hourly O3 levels for the next day, in the Taipa Ambient air quality monitoring station (background location) and the Taipa Residential air quality monitoring station (the high-density residential location). The models for NO2, PM2.5 and O3_MAX used independent variables including the average of the hourly values between 16:00 of yesterday and 15:00 of today for NO2, PM2.5 and O3 MAX respectively, the average of the hourly values between 00:00 and 23:00 of yesterday for O3 MAX, geopotential height at 850 hPa, atmospheric stability at 850 hPa and 925 hPa, air temperature at 925 hPa, average dew point temperature, minimum relative air humidity and average relative air humidity. The use of statistical models was successful in forecasting the average daily concentrations with MLR for NO2 and PM2.5 and MLR and CART analysis for the peak levels for maximum hourly O3 for next day and be able to forecast the high concentration of pollution episodes, for both Taipa Ambient and Taipa Residential.
References
Cassmassi, J. C. (1997). Objective ozone forecasting in south coast Air Basin: Updating the objective prediction models for the late 1990’s and Southern California ozone study (SCOS97-NARSTO) applications.
MEE. (2012). Ambient air quality standards. Retrieved from http://210.72.1.216:8080/gzaqi/Document/gjzlbz.pdf.
Neto, J., Torres, P., Ferreira, F., & Boavida, F. (2009). Lisbon air quality forecast using statistical methods. International Journal of Environment and Pollution, 39(3/4), 333. https://doi.org/10.1504/IJEP.2009.028695
SMG. (2019). Resumo anual sobre qualidade do ar em Macau—2019. Retrieved from https://cms.smg.gov.mo/uploads/sync/pdf/AIR_report/p_IQA_annual_report/IQA_2019.pdf.
WHO. (2003). Health aspects of air pollution with particulate matter, ozone and nitrogen dioxide. Health Aspects of Air Pollution with Particulate Matter, Ozone and Nitrogen Dioxide, (January), 98. https://doi.org/10.2105/AJPH.48.7.913.
WHO Europe. (2006). Air quality guidelines. Retrieved from http://202.171.253.71/www.euro.who.int/__data/assets/pdf_file/0005/78638/E90038.pdf.
Acknowledgements
The work developed was supported by The Macao Meteorological and Geophysical Bureau (SMG). The research work of CENSE is financed by Fundação para a Ciência e Tecnologia, I.P., Portugal (UID/AMB/04085/2019).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer-Verlag GmbH, DE, part of Springer Nature
About this paper
Cite this paper
Lei, M.T., Monjardino, J., Mendes, L., Gonçalves, D., Ferreira, F. (2021). Statistical Methods to Forecast Air Quality in Taipa Ambient and Taipa Residential of Macao. In: Mensink, C., Matthias, V. (eds) Air Pollution Modeling and its Application XXVII. ITM 2019. Springer Proceedings in Complexity. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-63760-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-662-63760-9_25
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-63759-3
Online ISBN: 978-3-662-63760-9
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)