Air Quality Modeling Using the PSO-SVM-Based Approach, MLP Neural Network, and M5 Model Tree in the Metropolitan Area of Oviedo (Northern Spain)

García Nieto, P. J.; García-Gonzalo, E.; Bernardo Sánchez, A.; Rodríguez Miranda, A. A.

doi:10.1007/s10666-017-9578-y

Air Quality Modeling Using the PSO-SVM-Based Approach, MLP Neural Network, and M5 Model Tree in the Metropolitan Area of Oviedo (Northern Spain)

Published: 26 August 2017

Volume 23, pages 229–247, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Modeling & Assessment Aims and scope Submit manuscript

Air Quality Modeling Using the PSO-SVM-Based Approach, MLP Neural Network, and M5 Model Tree in the Metropolitan Area of Oviedo (Northern Spain)

Download PDF

P. J. García Nieto ORCID: orcid.org/0000-0001-8880-6348¹,
E. García-Gonzalo¹,
A. Bernardo Sánchez² &
…
A. A. Rodríguez Miranda²

659 Accesses
21 Citations
Explore all metrics

Abstract

The main aim of this study was to construct several regression models of air quality using techniques based on the statistical learning, in the metropolitan area of Oviedo, in northern Spain. In this research, a hybrid particle swarm optimization-based evolutionary support vector regression is implemented to predict the air quality from the experimental dataset (specifically, nitrogen oxides, carbon monoxide, sulfur dioxide, ozone, and dust) collected from 2013 to 2015 in the metropolitan area of Oviedo. Furthermore, a multilayer perceptron network (MLP) and the M5 model tree were also fitted to the experimental dataset for comparison purposes. Finally, the predicted results show that the hybrid proposed model is more robust than the MLP and M5 model tree prediction methods in terms of statistical estimators and testing performances.

PM_2.5 concentration forecasting using ANFIS, EEMD-GRNN, MLP, and MLR models: a case study of Tehran, Iran

Article 18 December 2019

Forecasting PM₁₀ in Algiers: efficacy of multilayer perceptron networks

Article 18 September 2015

Air Pollutant Concentration Forecast Based on Support Vector Regression and Quantum-Behaved Particle Swarm Optimization

Article 29 September 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Air pollution can be defined as the introduction into the atmosphere of chemicals, particulates, or biological elements that can cause discomfort, disease, and even death to humans, animals, or plants. It can also deteriorate the natural or built environment [1,2,3]. Air pollution has many different sources: (a) natural sources such as volcanic eruptions and windblown dust; (b) static man-made sources such as factories or power plants, or dry-cleaning and degreasing operations; and (c) mobile man-made sources such as motorized vehicles, planes, and trains, all of which contribute to air pollution. Air pollution can be of natural or human origin.

In air quality control, the first response to a known or potential threat to the established air quality standard or guideline is to reduce it. State Implementation Plans (SIPs) formalize such responses in Spain [1,2,3]. Air pollution is an important environmental problem in metropolitan areas [1,2,3,4,5] like Oviedo (Principality of Asturias, Spain). It may cause health problems that lead to difficulty in breathing, coughing, and worsening of existing cardiac and respiratory problems [3,4,5]. For instance, diesel exhaust (DE) is one of the main sources of emission of particulate matter originated during combustion. DE has been linked to an increase in thrombosis and acute vascular dysfunction in several human health studies. This would explain the link between increased cardiovascular morbidity and mortality and the previously described particulate matter air pollution [1,2,3, 6].

Oviedo is the administrative center of the Principality of Asturias in northern Spain. It has a population of 221,202 and covers a land area of 186.65 km². It stands at 232 m above sea level and has a population density of 1185.12 inhabitants per square kilometer. The climate of Oviedo, like in the rest of northwest Spain, is more diverse than in other parts of Spain. Summers are generally warm and humid, with sunshine but also some rain. Winters are cold and very wet. Snow is usually present from October to May in the mountains that surround the city. Both rain and occasional snow are regular features in the winters of Oviedo.

The coal-fired power plant in Soto de Ribera is located 7 km south of the city of Oviedo (Fig. 1). This plant power supplies most of the electrical energy consumed in Oviedo. The geographical locations of the three meteorological stations and the Soto de Ribera coal-fired power plant are shown in Fig. 1. The Soto de Ribera plant is situated in the district of Ribera de Arriba at an altitude of 126.5 m above sea level.

The monitoring of meteorological pollution, measuring components such as carbon monoxide (CO), sulfur dioxide (SO₂), nitric oxide (NO), nitrogen dioxide (NO₂), ozone (O₃), and particulate matter less than 10 μm (PM₁₀), is becoming increasingly important due to their adverse effects on human health [1,2,3, 7,8,9,10,11]. Therefore, the EU and many national environmental agencies have established standards and air quality guidelines for permissible levels of these contaminants in the air [5, 11, 12]. The main aim of this work is to build a model for the average daily pollution that would be useful to the authority responsible for air pollution regulation in the corresponding region. The data used for this study has been collected within 3 years, specifically from 2013 to 2015. The numerical experiments applying the PSO-SVM-based technique have obtained good daily modeling accuracy for all pollutants considered. They will be presented and discussed in this paper.

To fix ideas, the aim of this study is to evaluate the application of the support vector machines (SVMs) approach [13,14,15,16,17,18,19,20] in combination with the evolutionary optimization technique known as particle swarm optimization (PSO) [21,22,23,24], as well as the multilayer perceptron (MLP) [25,26,27,28,29,30,31] and M5 model tree [32,33,34] to identify the air quality in the metropolitan area of Oviedo (northern Spain) on a local scale, comparing the results obtained. The theoretical support for the learning algorithms of SVMs is given by the statistical learning theory and structural risk minimization. Specifically, five PSO-SVM-based models were created for NO₂, SO₂, and aerosol particles less than 10 μm (PM₁₀) as a function that used the other measured relevant pollutants in air quality as independent variables, namely, NO, CO, and O₃. The purpose was to obtain accurate concentration estimates of the pollutants NO₂, SO₂, and PM₁₀ [35,36,37]. SVM models can be used as an alternative to the classic regression approaches, and they are a new family of models that can be used for estimating values from very different areas [13,14,15,16,17,18,19,20]. The five PSO-SVM-based models were found to improve the accuracy in the case of nonlinear regression problems, such as those related to air quality, which are studied in this paper.

The PSO technique was successfully used here to optimize the tuning of the kernel optimal hyperparameters in the SVM training phase. PSO was introduced by Kennedy and Eberhart in 1995 [21] and is a swarm intelligence (SI) bio-inspired algorithm. The PSO is based on the simulation of the flocking of birds [21,22,23,24] and it is similar to other evolutionary computation SI-based algorithms. It also exploits the model of social sharing of information [38, 39]. PSO hybridized with SVM (PSO-SVM) models [38, 39] was used as a learning tool, and trained to estimate the air quality in the metropolitan area of Oviedo from other air pollutants on a local scale.

Model, together with the MLP model and M5 model tree [25,26,27,28,29,30,31,32,33,34], was used as automated learning tools, training them in order to predict the air quality in the metropolitan area of Oviedo from the operation physical-chemical input pollutants measured experimentally.

This innovative paper is organized as follows: firstly, the necessary materials and methods to carry out the study are described. Secondly, the results obtained are shown and discussed. Finally, the main conclusions drawn from the results are presented.

2 Materials and Methods

2.1 Sources and Types of Air Pollution

An air pollutant is a substance contained in atmospheric air that can be unhealthy for humans and the environment. Pollutants can be found in the form of solid particles, liquid droplets, or gases. They may be man-made or natural and can be classified as primary or secondary. Mostly, primary pollutants come from a process, such as carbon monoxide from a motor vehicle exhaust, sulfur dioxide from factories, or ash from a volcanic eruption. Secondary pollutants form in the air when primary pollutants interact or react, and therefore, they are not emitted directly. For instance, an important secondary pollutant is ground-level ozone, which is one of the many secondary pollutants which make up photochemical smog [4, 35,36,37, 40]. Some pollutants can be both primary and secondary, that is, they have been both emitted directly and formed from other primary pollutants.

Human activity produces major primary pollutants such as [1,2,3,4,5,6,7,8,9,10,11,12, 35,36,37, 40,41,42] the following:

Particulate matter (PM): also called atmospheric particulate matter, or fine particles. These are tiny particles of solids or liquids suspended in a gas. On the other hand, an aerosol would indicate particles and gas together.
Sulfur oxides (SO_x): in particular, sulfur dioxide, a chemical compound with the formula SO₂. The combustion of coal and petroleum generates sulfur dioxide because these often contain sulfur compounds.
Nitrogen oxides (NO_x): mainly NO₂ that is emitted during high-temperature combustion. The first product formed is NO, and when NO oxidizes further in the atmosphere, it becomes NO₂.
Carbon monoxide (CO): is produced by the incomplete combustion of fuels such as coal, wood, or natural gas.

Secondary pollutants include [1,2,3,4,5,6,7,8,9,10,11,12, 35,36,37, 40,41,42] the following:

Particulate matter: this is composed of gaseous primary pollutants and compounds in photochemical smog. Smog is a special type of air pollution. Typical smog results from large amounts of coal burning in a particular area and is caused by a mixture of smoke and sulfur dioxide.
Ground-level ozone (O₃): this develops from NO_x and volatile organic compounds (VOCs). Short-term exposure to elevated levels of ozone can be the origin of eye and lung irritations.

Regarding trends in air quality, the Clean Air Act of 1970 established the setting of standards for four of the primary pollutants (aerosols, sulfur dioxide, carbon monoxide, and nitrogen oxides) and the secondary pollutant ozone. Back then, in 1970, these five pollutants were identified as the most widespread and undesirable. Nowadays, lead has been added and they are known collectively as the criteria pollutants and are covered by the United States National Ambient Air Quality Standards (Table 1) [1,2,3,4,5,6,7,8,9,10,11,12]. The primary standard for each pollutant can be seen in Table 1, which is based on the highest level that can be tolerated by humans without noticeable negative effects, minus a 10–50% margin for safety reasons.

Table 1 National Ambient Air Quality Standards by the United States Environmental Protection Agency (USEPA) [1,2,3,4,5,6,7,8,9,10,11,12, 40,41,42]

Air Quality Modeling Using the PSO-SVM-Based Approach, MLP Neural Network, and M5 Model Tree in the Metropolitan Area of Oviedo (Northern Spain)

Abstract

Similar content being viewed by others

PM2.5 concentration forecasting using ANFIS, EEMD-GRNN, MLP, and MLR models: a case study of Tehran, Iran

Forecasting PM10 in Algiers: efficacy of multilayer perceptron networks

Air Pollutant Concentration Forecast Based on Support Vector Regression and Quantum-Behaved Particle Swarm Optimization

Explore related subjects

1 Introduction

2 Materials and Methods

2.1 Sources and Types of Air Pollution

2.2 Experimental Dataset

2.3 Support Vector Machine Method

2.4 The Particle Swarm Optimization Algorithm

2.5 Artificial Neural Network: Multilayer Perceptron

2.6 M5 Model Tree

3 Results and Discussion

4 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

PM_2.5 concentration forecasting using ANFIS, EEMD-GRNN, MLP, and MLR models: a case study of Tehran, Iran

Forecasting PM₁₀ in Algiers: efficacy of multilayer perceptron networks