Introduction

The World Health Organization (WHO) has provided evidence of the effects of air pollution on human health, at concentrations even lower than those established by WHO in 2005. Every year, approximately seven million premature deaths and millions of healthy life year losses can be attributed to exposure to polluted air according to this specialized agency of the United Nations (WHO 2020).

The emission of tropospheric ozone (O3) and particulate matter (PM) is considered a serious threat for its contribution to global warming, for being a menace to marine and terrestrial ecosystems, as well as to people’s health and well-being through problems of water and food security, plant damage, and migrations caused by drought and floods. According to research results, in 2019, about 90% of the world’s population lived in areas where concentrations exceeded air quality thresholds established by WHO in 2005 (Guarin et al. 2019; WHO 2020).

Long-term exposure to high levels of O3 and PM increases the probability of suffering from chronic respiratory and cardiovascular diseases (McDonnell et al. 2010; WHO 2013; Chen et al. 2018; Rohr 2018; Orellano et al. 2020). Even short-term exposure to these pollutants can lead to decreased lung mucociliary function and trigger asthma exacerbations (Karthik et al., 2017; Cesari et al. 2018). Moreover, studies have suggested that long-term exposure to pollutants is associated with both reduced survival and life expectancy in the order of 1–2 years (Dedoussi et al. 2020). Other recent researchers have shown that the prevalence of bronchitis symptoms in children and adults is associated with exposure to PM (Amoatey et al. 2021; Yan et al. 2021). According to Ortiz-González et al. (2018), about 16,400 premature passings can be caused by tropospheric O3 pollution in the European Union, out of which 1,700 correspond to Spain. However, almost 80% of O3 and PM-related deaths could be avoided worldwide if current levels of air pollution were reduced to those proposed in the updated guideline, according to an analysis conducted by WHO (WHO 2021).

Traditionally, particulate air pollution has been considered to be mainly an urban phenomenon. It is now clear that, in many areas of Europe, the differences between urban and rural areas in terms of PM10 are small or even non-existent, indicating that exposure to particulate matter is widespread due to, among other issues, the actions of climatological variables that enable the transport of these particles over very long distances (Rodríguez et al. 2004).

The atmospheric PM is made up of solid and/or liquid particles, which enter the atmosphere by natural pathways or through anthropogenic activities. This pollutant may present different natura: natural resuspension (by means of marine and mineral sources), biomass burning, coal combustion, road dust, traffic, industrial emissions, or heating generated by households and enterprises (Rodríguez et al. 2004; Hopke et al. 2020).

Tropospheric O3 is a secondary atmospheric pollutant with strong oxidizing attributes. It is produced by the photooxidation of volatile organic compounds (VOCs) in the presence of nitrogen oxides (NOx) and sunlight, but in a non-linear manner, which turns O3 forecasting and control into a challenging task for policymakers (Atkinson 2000; Wang et al. 2016). Sillman and He (2002) outline ozone chemistry depending on these two main precursors. When the VOCs/NOx ratio is low, then the O3 production regime is said to be VOC-limited, and an increase (decrease) in NOx may lead to a drop (growth) in O3 concentrations. This kind of regime is more common in urban areas. By contrast, if the VOCs/NOx ratio is high, then the O3 production regime is considered to be NOx-limited, and an increase in NOx may lead to higher O3 levels. This regime is more common in rural areas located downwind from emission sources, as precursors are transported by local and regional air mass flows away from urban sites (Ortiz-González et al. 2018). Moreover, high biogenic hydrocarbon levels emitted by forests and other vegetations also contribute to O3 formation (Calfapietra et al. 2013). Consequently, adjacent rural areas tend to display more prominent O3 levels than urban agglomerations.

Other factors that favor tropospheric O3 generation are the following ones:

  • High temperatures and strong solar radiation (Crutzen 1973; Monks et al. 2015; Guo et al. 2016), which explain why daily O3 concentrations are the highest at midday-afternoon and during summer.

  • Large wildfires, which are also prone to happen in summer (Yang et al. 2015).

  • Low humidity and precipitation scarcity (Doherty et al. 2013).

  • Meso-meteorological recirculations of air masses through winds and sea breezes in mountainous ranges that favor stagnation episodes of polluted air masses (Querol et al. 2016; Finardi et al. 2018).

  • Economic growth (Hilboll et al. 2013).

  • Global warming through rising emissions of greenhouse gases and methane (Fiore et al. 2015; Lin et al. 2017).

The circulation and redistribution of O3 and PM across the planet is a phenomenon that poses a serious challenge to governments and health organizations when drafting air quality policies. Several articles have assessed the sources and migration of atmospheric pollutants at rural and urban sites and some of them have shown significant differences in trends and levels over time (Rodrı́guez et al. 2004; Paoletti et al. 2014;¸ Jaén et al. 2021). However, there is a lack of a comprehensive analysis at local level that explicitly analyzes the migration and behavior of O3 and PM10 pollution between localities. Zong et al. (2018) are the only ones who carried out research which is similar to the one regarding O3. An observation-based model was proposed in their paper to analyze the relationship between O3 levels in the Chinese Shandong province’s capital city and a downwind rural region. According to their results, regional air mass transport is responsible for more than 50% of the observed accumulation of O3 in Chinese rural areas.

The Western Mediterranean Basin (WMB) is among the most exposed ones to PM and O3 pollution in Europe according to the Intergovernmental Panel on Climate Change (2014) and the European Environmental Agency (2020) due to its abrupt orography and potential emission sources of VOCs in the urbanized and industrialized coastal and forested areas (Castell et al. 2008; Seco et al. 2011; Escudero et al. 2014). Moreover, multiple factors favor these pollutants’ latter stagnation-recirculation episodes within the region (Querol et al. 2001; Rodríguez et al. 2003; Cristofanelli and Bonasoni 2009; Valverde et al. 2016).

First of all, the Mediterranean-type climate, characterized by hot summers with high temperatures, low winds and precipitations, and strong solar radiation, favors O3 formation (Millán et al. 1997). Second, Saharan dust outbreaks (commonly known as calima) transport African dust to South Europe and substantially increase PM10 levels, especially in summer and in winter-spring (Rodríguez et al. 2002; Rodríguez et al. 2004; Michelot et al. 2016). In third place, O3 is subject to long-range transport towards the Mediterranean basin from Central and Eastern Europe, North America, and Asia (Lelieveld et al. 2002; Auvray and Bey 2005; Boudjella et al. 2017). Fourth, intense sea breeze dynamics are common along the shore during March–September, which contribute to PM increases in coastal regions and the transport of O3 to inner areas (Querol et al. 2004). Fifth, the presence of important urban concentrations and economic activities on the coastal areas, such as tourism, transport, traffic, and construction, contribute to PM emissions and represent a source of NOx and VOCs, which enhance O3 formation. Finally, a complex geographical setup comprised of several mountainous ranges (the Alps, Apennines, Atlas, Betic, Iberian, and Catalan Ranges) favors the stagnation and recirculation of polluted air masses that get trapped within the region (Millan et al. 2000; Gangoiti et al. 2001; Rodriguez et al. 2002; Van-Drooge and Grimalt 2015; Largeron and Staquet 2016; Querol et al. 2016, 2017, 2018).

Related to O3, despite a wide set of proven evidence, public authorities have undertaken a limited number of measures to tackle atmospheric ozone air pollution, and their main efforts have been focused on PM2.5 and NO2 pollutants. PMs are composed of those with aerodynamic size below 10 µm (PM10) and the fine particles with aerodynamic size below 2.5 µm (PM2.5). Both can be inhaled with some depositing through the airways, but due to its size, fine PM2.5 has a higher likelihood to get into deeper parts of the lung, while PM10 is more prone to deposit on the surfaces of the larger airways. Thus, PM2.5 poses a higher risk to human health compared to larger particulate matters. Moreover, the number of premature deaths associated with PM2.5 and NO2 pollutants surpasses that attributed to O3 (for instance, 27,900 and 8,900 against 1,800 in 2015, respectively; Ortiz-González et al. 2018) and the former have more intense effects on the Spanish healthcare expenses (WHO Regional Office for Europe 2015). In addition, O3 concentrations are higher in more remote rural areas with low population, despite the fact that its precursors are generally produced at urban and industrial sites. The lack of interest in O3 control at rural sites could be explained by data scarcity at a local level and a limited number of articles centered on O3 dynamics in these zones.

Metropolitan areas generally concentrate on road traffic emissions and polluting economic activities that emit most atmospheric pollutants; therefore, the bulk of research has focused on their effects on urban inhabitants (Striebig et al. 2019). Nevertheless, rural areas may also register high pollution levels and be related to emissions generated at urban sites.

In the case of PM10, there is a greater number of results within the CC compared to O3 in the last 5 years. However, most of the articles only consider a specific number of localities as their study areas within the CC. However, the lack of a detailed local analysis of O3 and PM10 pollution behavior between apparently different areas could help establish similar environmental policies between these zones, and thus optimize the use of the resources employed to reduce atmospheric pollution at all sites.

This article aims at shedding more light on the aforementioned issue. Towards this end, hourly data on O3 and PM10 pollution concentrations were extracted from 44 and 18 air monitoring stations, respectively, which cover the whole CC during the period 2012–2020.

Our article adds new insights to the existing literature along several dimensions. First, most analyses on air pollution have tracked the dynamics of O3 and PM10 by geographical zones, focusing on the effects of meteorology, seasons, calendar effects, atmospheric and air recirculation dynamics, and altitude (Paoletti et al. 2014; Porter et al. 2014; Klein et al. 2017; Querol et al. 2018). As rural sites are prone to exhibit higher mean O3 and PM10 concentrations compared to metropolitan agglomerations despite fewer pollution sources, literature started to place more emphasis on this issue. Nevertheless, 8 years have rarely been considered to study this relationship. A few papers report evidence of noticeable transfers and dynamic effects among rural and urban sites, though these links are case-specific (Rovira et al. 2018; Zong et al. 2018; Yan et al. 2019; Jaén et al. 2020) and are difficult to apply to other geographical areas. Therefore, our paper selected a specific Spanish region for this purpose, the CC.

Second, and closely related to the previous point, a more accurate detection of the major sources of air pollution at a local level can mitigate the number of deaths caused by harmful air quality. Nonetheless, because of data scarcity and the paucity of longitudinal and precise pollution measures over a significant range of geographical scenarios, it is challenging to find evidence regarding this issue. In addition, the control of O3 and PM10 turns more complicated as both pollutants depend on complex precursors and are affected by both local and distant sources (Wang et al. 2016; Xue et al. 2016; Zong et al. 2018). This paper traces significant correlations regarding O3 and PM10 pollutants between Catalan sites by counting on more than 180,000 observations (from each station) collected over 8 years from 44 and 18 air quality monitoring stations, respectively. Health authorities can use these findings to outline effective measures to develop more successful environmental policies and reduce citizens’ exposure to poor air quality. As economic activities in Catalan rural areas, such as Montsec, depend on a large extent on leisure and tourism coming from several parts of Spain, high O3 levels registered at these sites are an additional matter of concern.

Third, this article assesses the relationship between three types of sites based on orography (urban, suburban, rural) using functional data based on functional Kendall’s tau (FKT), developed by Valencia et al. (2019). This approach accepts the continuous nature of the variable that is the object of analysis and avoids several simplifying hypotheses which are typically imposed by other quantitative methods, such as multivariate regression, principal component analysis, and analysis of variance. In particular, the FKT approach provides three pros: it does not assume normality for the data of interest; it is largely insensitive to outliers; and it is able to handle nonlinear relationships. These features are valuable in a context where the variable of interest, a pollution measure, exhibits a large degree of dispersion over time and seasons. The availability of hourly pollution data during 8 years and covering the entire territory of the CC, under the umbrella of statistics used on functional data, should be able to reveal those areas with similarities in pollution behavior, all to establish more efficient environmental policies. Environmental assessment (Martínez et al. 2014; Martínez et al. 2020) and medical research (Dannenmaiera et al. 2020) are two fields where functional data analysis has been applied (Li and Chow 2005).

The selection of ozone and PM10 is justified firstly by the close relationship between both pollutants in their formation process and secondly by the environmental impact that this link produces for the CC.

Material and methods

Study area and data selection

CC is a Spanish autonomous community located in the northeast of the Iberian Peninsula, bordered by France and Andorra to the north, the autonomous communities of Aragon to the west, and Comunidad Valenciana to the south. To the east, it is bordered by the Mediterranean Sea. Its territory of 32,107 km2 is divided into four provinces and its capital city is Barcelona, Spain’s second most populated municipality. Most of its 7.7 million inhabitants are located in the coastal plains and approximately 95% of them are concentrated in urban areas (municipalities above 200 inhabitants) according to the Catalan Regional Government (Generalitat de Catalunya 2021). The community has experienced substantial population growth rates between 2016 and 2020 compared to those of Spain and the EU average, and its regional GDP represented 19.3% of the Spanish total GDP.

Catalonia is characterized by a very complex topography, made up of high mountains, inland depressions, and a 580-km-long coastline. Such diverse orography exerts a remarkable influence on climate and pollution.

The Barcelona Metropolitan Area (BMA), which includes the capital and 35 surrounding towns, is situated on the shore and is a highly industrialized and densely populated area along the Pre-Coastal and the Coastal Depressions. It is responsible for high pollutant emissions originating from transportation, industrial, and agricultural activities. One of the most prominent BMA towns along the Besòs and Llobregat rivers is L’Hospitalet de Llobregat, with an important service sector hub. In addition, a gas power station (at Besòs) and two urban waste incinerators (Sant Adrià and a concrete factory at Montcada I Reixac) are also located in this area. Nevertheless, the major sources of atmospheric pollution at the BMA are the road, airport, and shipping traffic emissions (Querol et al. 2001; Rodríguez et al. 2003).

Two mountain chains run parallel to the coast: the Coastal Range and the Pre-Coastal Range. Montserrat and Montseny Massif (1,706.7 m above sea level (m.a.s.l.)) belong to the Pre-Coastal Range, which is longer and higher than the Coastal one: 1000–1500 m.a.s.l. versus 250–500 m.a.s.l. The Central Catalan Depression lies between the Pre-Coastal Range and the Pre-Pyrenees and is suitable for agricultural activities due to its fertile lands. It widens towards the west, linking with the Ebro Depression.

The Vic Plain is located in an intra-mountain 30-km valley embedded in the Catalan Central Depression, 40–70 km northern Barcelona. The most prominent towns of the plain are Vic, Manlleu, and Tona. This region is characterized by a complex orography that, coupled with summer sea breezes and daily anabatic winds, favors the penetration of polluted air masses from the BMA into the Vic Plain and towards the Pyrenees through the north-eastern and western Barcelona valleys (Besòs-Congost valley and Llobregat pathway). Consequently, vertical recirculation and fumigation of O3-rich air masses prevail in the plain, exacerbated by heatwaves and strong solar radiation in summer (Querol et al. 2017 Massagué et al. 2019; Jaén et al. 2021).

Another industrial center in the Catalan Mediterranean coast is Tarragona, located about 90 km southwest of Barcelona. The town is surrounded by power plants, incinerators, and chemical and petrochemical complexes that spread out until the municipalities of Reus, Salou, and Constantí. The region’s orographic and meteorological features bear many resemblances to the summer scenario described for the Vic Plain. Most anthropogenic emissions are located in the Tarragona industrial area and anabatic winds, in conjunction with sea breezes, drive polluted air mases inland until the northern regions of Tarragona province, such as Alcover and Ponts (Jiménez et al. 2005).

Catalonia is bordered in the north by the Pre-Pyrenees and the Pyrenees. These mountainous formations extend approximately 200 km and reach high altitudes (2,000–3,000 m.a.s.l. in case of the Pyrenees). This area includes the air quality stations of Bellver de Cerdanya (1,040 m.a.s.l.), Sort (692 m.a.s.l.), Pardines (1,224 m.a.s.l.), and Montsec (1,676 m.a.s.l.), all located in rural villages and almost no industrial activity, and thus low pollutant emissions. However, in summer, both the eastern and the central parts of the Pyrenees receive anabatic winds in the daytime, which tend to carry polluted air masses from urban and traffic sites on the coast and central Catalonia towards the mountains (Díaz-de-Quijano et al. 2009; Gonçalves et al. 2009). Apart from these sources of O3, high atmospheric altitudes and solar radiation favor O3 mixing ratios in summer, and minimal local NO emissions hinder O3 depletion (Ribas and Peñuelas 2006). Therefore, the remote mountainous areas of the Pyrenees are also prone to high O3 episodes due to the high mixing ratios and meso-meteorological recirculations.

To monitor and control pollution within the CC, the regional government relies on a network of 75 air quality stations, which are included in the Network of Surveillance and Prevention of Air Pollution (Xarxa de Vigilància i Previsió de la Contaminació Atmosfèrica (XVPCA)). Out of all air quality monitoring stations, 18 measure hourly pollution levels for PM10 and 44 for O3 (Fig. 1). Figures 2 and 3 display the location of these air monitoring stations outside the BMA and within the BMA by type (background, industrial, and traffic).

Fig. 1
figure 1

Distribution of air-quality monitoring stations by type in the CC outside the BMA

Fig. 2
figure 2

The 18 air quality monitoring stations of the XVPCA by type—within BMA

Fig. 3
figure 3

Annual average levels for O3 (µg/m.3) in the period 2013–2020 per station in the CC

Although XVPCA validates the hourly pollution data, algorithms were programmed in MATLAB, making use of parallel processing and data structures to adapt them to the needs of the calculations. In addition, Tableau was used to map the different air quality measurement stations and to facilitate the interpretation of the FKT results between stations through spatial analysis.

The CC is one of the most polluted regions within the Western Mediterranean Basin. In 2018, PM10 exceedances of the daily values recommended by WHO were recorded in most of the CC, except Alt Llobregat and the Western Pyrenees. The worst records took place in the BMA, Vallès-Baix Llobregat, Central Catalonia, and Vic Plain. In the case of O3, all the stations of the air monitoring network, except for a few located in BMA, recorded high exceedances of the 8-h value recommended by WHO (Ecologistas en Acción 2018).

In 22 of the 34 reference stations that assess pollution levels in the CC aimed at the protection of vegetation, the legal objective established for ozone has also been exceeded during the 5 years 2014–2018, affecting mainly crops and forests of Penedès-Garraf, Vic Plain, Comarques de Girona, Alt Llobregat, Eastern Pyrenees, Pre-Pyrenees, Terres de Ponent, and Terres de l’Ebre (Ecologistas en Acción 2018).

Several scientific papers have been written on the behavior of air pollutants within the CC. Querol et al. (2001) described PM10 sources at rural, urban, and industrial sites during PM episodes in different local areas around the city of Barcelona. Their results suggest that at rural sites, during the highest daily average PM10 episodes, the main source of pollution is African dust, while vehicle exhaust product is the major contributor in urban locations. In a related paper, Plasencia-Sánchez et al. (2018) assessed the exposure and health effects of PM in areas of intensive industrial activity (Tarragona) between 2016 and 2017. They found that the maximum PM concentrations reached 54 µg/m3. Moreover, they pointed out that the areas located far away from the coast shared a similar profile.

Querol et al. (2016) estimated that the contribution of the Barcelona city harbor emissions to the urban background reached 9–12% for PM10 and was linked to primary emissions from fuel oil combustion, but also to the developments of secondary ammonium sulfate within the city.

To assess the behavior of O3, Massagué et al. (2019) conducted a research in which they analyzed data sets on ozone concentrations in the Vic Plain between 2007 and 2017, the atmospheric basin of Spain that recorded the highest number of exceedances of the hourly O3 information threshold, frequently affected by the northward transport of atmospheric plume from the BMA. They evaluated how the implementation of short-term measures at a local level could mitigate emission precursors at the BMA. A generalized decrease in regional background O3 levels was found, together with an increasing trend as far as urban ozone is concerned.

Another recent research developed by Jaén et al. (2020) is related to heatwave-driven ozone episodes in June and July 2019. They pointed out that the phenomenon was characterized by synoptic anticyclonic conditions, surface temperatures over the 98th percentile (over the last 40 years), and high ozone values recorded inland due to precursors’ advection (NOx and VOCs) mainly from BMA, limited by an eastern and a western valley. Moreover, the authors showed that NO2 measured in Barcelona influenced urban and rural ozone concentrations (100 km from BMA).

Method by statistical analysis

To study the relationships regarding the behavior of O3 and PM10 pollutants between different geographic regions of the CC, this article employs FKT developed by Valencia et al. (2019). This statistic always considers a pair of air quality monitoring stations and measures which periods show similar pollution patterns, and which do not (defined as concordant and discordant periods, respectively). FKT assumes that air pollution levels recorded every hour of the day are discrete observations within the infinite pollution points in time defined by a curve that exhibits continuous behavior.

Because the classification grade according to a total ordering is not a viable option when dealing with functions, the paper is based on pre-orders, the two of which are as follows:

Pre-orders

Let us consider the continuous functions g and h on the compact interval I = [a,b]

$$f\underset{m}{\preccurlyeq }g \equiv \underset{t\in I}{\mathrm{max}} f(t) \le \underset{t\in I}{\mathrm{max}} g(t)$$
(1)
$$f\underset{i}{\preccurlyeq }g \equiv \underset{a}{\overset{b}{\int }}\left[g\left(t\right)-f\left(t\right)\right]dt \ge 0,$$
(2)

in which Condition (1), known as the max pre-order, is used to rank functions based on their maximum value, while Condition (2), usually called integral pre-order, is used to order functions based on their underlying area.

This paper follows the second alternative, for it provides more quality information for the study of air contamination. The definitions of functional concordance and functional tau were developed by Valencia et al. (2019).

Functional concordance

Let \(\preccurlyeq\) be a preorder between functions and let ≺ address the case without considering ties. Two pairs of functions \(({f}_{1 , }{g}_{1})\) and \(({f}_{2 , }{g}_{2})\) are called functionally concordant if either \({f}_{1 }\prec {f}_{2}\) and \({g}_{1 }\prec {g}_{2}\) or \({f}_{2 }\prec {f}_{1}\) and\({g}_{2 }\prec {g}_{1}\). Otherwise, they are called discordant.

Functional Kendall’s tau

Let \(({X}_{1}, {Y}_{1})({X}_{2}, {Y}_{2})\) be two realizations of a bivariate stochastic process \(\{(X\left(t\right), Y\left(t\right):t\in I\}\), where \(I=\left[a, b\right].\) FKT for the population is given by:

$$\tau =\left[P\left\{{X}_{1}\prec {X}_{2}, {Y}_{1}\prec {Y}_{2}\right\}+P\left\{{X}_{2}\prec {X}_{1}, {Y}_{2}\prec {Y}_{1}\right\}\right]-1$$
(3)

Be \(({x}_{1},{y}_{1})\dots ({x}_{n}, {y}_{n})\) of a bivariate random process \(\left(X, Y\right)=\{\left(X\left(t\right), Y\left(t\right):t\in I\right\}\) within the compact interval \(I=\left[a, b\right]\). Then, the empirical version of FKT is given by

$$\widehat{\tau }={\left(\genfrac{}{}{0pt}{}{n}{2}\right)}^{-1}\sum_{i<j}^{n}[2I({x}_{i}\prec {x}_{j}, .{y}_{i}\prec {y}_{j})+2I({x}_{j}\prec {x}_{i}, .{y}_{j}\prec {y}_{i})]-1,$$
(4)

where the realizations of the stochastic process are taken as functions. We calculate this statistic by programming Formulas (2) and (4) in MATLAB-R2020.

With the available data, defined generically as \({L}_{psdah}\) describing the pollution level of pollutant p at air quality measurement station s in the year a, day d, and hour h, the area under the curve of each generated function is calculated by (2). With these results and applying the definition of functional concordance, we use (4) to find FKT. In order not to lose generality \(p\in P\), \(s\in {S}_{p}, a\in A\), \(d\in {D}_{a}\), \(h\in {H}_{p}\) where \(P\) is defined as the set of pollutants, \({S}_{p}\) is defined as the set of air quality stations that measure the pollution levels of particulate \(p\), \(A\) is defined as the set of years taken into account for the study, \({D}_{a}\) is defined as the total of days in the year a, and \({H}_{p}\) is the set defining the number of hours of day where the pollutant \(p\) was measured.

When FKT takes value 1, pollution behavior in both air quality stations under study is the same during the observation period. On the other hand, if FKT is equal to − 1, patterns are entirely opposed; in other words, when the levels of a pollutant at a specific air quality station decrease in a year, these levels increase at the other station during the same year. When FKT is 0, the pollution behavior outlined by the functions is independent. Finally, the statistic FKT is a robust estimator as it exhibits a low sensitivity to outlier data. This characteristic is a significant property in this framework because pollution levels display a large degree of dispersion over time and the time of year. Under the methodology followed by this work, extreme values do not significantly affect the FKT estimates. While other quantitative approaches regard observations as discrete realizations of a random variable, FKT is based on functions to describe daily pollution levels. Thus, it allows us to fully capture the behavioral pattern of a continuous variable during the whole observation period.

Results

Preliminary results

Figures 3 and 4 depict the behavior of O3 and PM10 pollution levels, respectively, within the period 2013–2020. The air quality monitoring stations are ordered from left to right according to their distance from Barcelona city center (the Autonomous Community’s capital city), considered the conventional center of the CC. Regarding O3, distant stations tend to be located in rural areas such as Ponts, Pardines, Begur, Bellver de Cerdanya, Els Guiamets, Montsec, and La Senia. These stations tend to display higher pollution levels compared to city centers. However, when it comes to PM10, higher levels stand out in industrial and urban areas, as in Sant Vicenç dels Horts or in Montcada i Reixac—this latter one is located next to Montcada i Reixac’s industrial district, called Can Sant Joan. As displayed in Fig. 4, annual mean levels are above those established by WHO (2021), except for Berga and Bellver de Cerdanya.

Fig. 4
figure 4

Annual average levels for PM10 (µg/m.3) in the period 2013–2020 per station in the CC

Even if the scientific community has not yet agreed on an objective threshold above which O3 and PM10 are harmful to people’s health, WHO published in September 2021 its updated global air quality guidelines (AQGs) based on more than 500 scientific results. Some of these results associate pollution levels with the exacerbation of certain diseases among adults and children (WHO 2021). In this sense, WHO’s new updated pollution limits have set a guideline of 60 µg/m3 as daily 8-h average for O3 during peak seasons (defined as an average of daily maximum 8-h mean O3 concentration in the six consecutive months with the highest 6-month running average O3 concentration), and a 15 µg/m3 annual mean for PM10.

Figure 5ac show, at some of the CC’s most illustrative stations, the number of days per month when average O3 levels exceeded the recommended short-term (daily maximum 8-h) AQG level for ozone (100 µg/m3) by WHO (2021) in the period 2013–2020. O3 levels are calculated as an hourly updated 8-h rolling average. June, July, and August are the months when excess pollution is usually observed. This is a very similar pattern to the trend recorded in former years and can be applied to other thresholds too. WHO guideline level exceedances in rural areas of the CC during summer months—particularly between July and August (Valverde et al. 2016)—are not isolated phenomena in rural regions, which means that environmental policies are not yet sufficient to address the problem.

Fig. 5
figure 5

Several days above the 100 µg/m3 and 45 µg/m.3 target values established by WHO for O3 (af)

It is worth mentioning that throughout the COVID-induced strict lockdown period (March 15–June 21, 2020) almost all pollutants experienced significant declines due to the standstill of economic activities. Nevertheless, nearly all air quality monitoring stations registered higher O3 levels compared to the pre-pandemic values, which reveals the challenges for policymaking aimed at achieving the goals proposed by WHO (Betancourt et al. 2021, Morales-Solís et al. 2021).

Pollutant PM10 displays a different seasonal pattern than O3. Between years 2018 and 2020, the daily average levels exceeded the recommended short-term (24-h) AQG level for PM10 (45 µg/m3) by WHO (2021), particularly in winter (Fig. 5df), which justifies that those environmental policies should not be applied in a linear way throughout the whole year but need to take into account each pollutant’s seasonal pattern.

Figures 6 and 7 showcase the mean daily concentration of O3 and PM10 registered in specific Catalan regions based on different types of air monitoring stations according to their geographic location. Figure 6ad allude to the regions in urban, suburban agglomerations, and rural areas of the CC for pollutant O3, while Fig. 7ad refer to PM10 levels at several air monitoring stations. In each diagram, the period 2018–2019 is reflected, as it stands out for high levels of O3 and PM10 pollutants. During this period, levels of O3 in urban and suburban areas begin to rise around 8:00 a.m., attain their peak around 17:00, descend until 21:00, and then stay generally stable at night. However, at rural stations, O3 concentrations start to increase approximately at 10:00 a.m., reach their peak at around 16:00, drop until 22:00, and then stay roughly constant throughout the night. This profile can be observed across charts as meteorological conditions enhance the swift migration of pollutants from urban to rural regions. In addition, the reason why daily levels of O3 at rural sites within the 2012–2020 period exceeded those of urban sites is the circulation of polluted gases. These exceedances can be witnessed even during the daily maxima.

Fig. 6
figure 6

ad Hourly average concentrations for O3 in different areas (urban, suburban, and rural) in the CC during the period 2018–2019

Fig. 7
figure 7

ad Hourly average concentrations for PM10 in different areas (urban, suburban, and rural) in the CC during the period 2018–2019

As for PM10 levels, Fig. 7ad showcase their cyclical daily behavior, consisting in three distinguishable periods: they begin their upward trend around 6:00 a.m., reach their top value approximately at 9:00 a.m., fall until 16:00, start to increase anew at 16:00, get another maximum value at 22:00, and then decline during the night. This pattern may be influenced by the rush hours concentrating more significant movements of people and, therefore, by the use of road transport in the CC.

Kendall functional correlations analysis for O3

The data collected at each station are represented using eight continuous functions, each showing the daily pollution concentration over a year between 2013 and 2020. Each continuous function consists of around 364 discrete points, corresponding to daily observations. Subsequently, FKT between pairs of stations is computed, generating a symmetric matrix of correlation coefficients. These values allow us to assess the interaction between different types of stations regarding O3 and PM10. A positive and a FKT above 0.6 refers to stations whose daily pollution levels feature permanently similar behaviors during the period under analysis. A FKT close to 0 suggests an independence in the daily patterns of pollution levels at the stations being compared.

Figure 8ad display rural stations located in the northeast of CC and shows their highest and lowest FKTs concerning the rest of the stations. Figures 910ad proceed likewise, choosing other rural stations located in the east and south of CC. The right panel offers a map to outline the location between each rural reference station and the station that is more closely related to its pollution behavior, as measured by FKT. Regarding rural stations located in the northeast, one finds that at most of air quality stations, the station that is most correlated with the rural site is remote (Montsec-Vic, 125.4 km; Pardines-Gavà, 113 km; Bellver de Cerdanya-Vic, 61.4 km; Santa Pau-Vic, 32 km), except for Santa Pau-Vic.

Fig. 8
figure 8

ad FKT between rural areas located in the northeast of CC and stations with highest and lowest correlations (2013–2020)

Fig. 9
figure 9

ad FKT between rural areas located in the east of CC and stations with highest and lowest correlations (2013–2020)

Fig. 10
figure 10

ad FKT between rural areas located in the south of CC and stations with highest and lowest correlations (2013–2020)

One would expect nearby air quality stations or stations roughly equidistant from the aforementioned rural stations to show similar pollution patterns and have higher or analogous FKTs than those located further away. Such is the case of Reus station, located 105.2 km away from Montsec, which is 20 km less than the distance between Montsec and Vic. Nevertheless, Reus displays a FKT of merely 0.14 with Montsec, while the Montsec-Vic FKT is higher than 0.6. Other examples illustrating this phenomenon take place between Santa Pau and Manlleu, which are only 24 km away from each other, but exhibit a relatively low FKT of 0.43. At the same time, FKT corresponding to Santa Pau and Vic increases to 0.8, even if Vic is located 6 km further from Santa Pau than Manlleu. The distance between Bellver de Cerdanya and Berga is 30 km and these two stations register a FKT of 0.64, whereas Bellver and Vic are 61 km away from each other and record a FKT of 1. In other words, throughout our 8-year study period, O3 pollution behavior among these air quality stations was the same (Fig. 8c).

The stations located in eastern and southern Catalonia exhibit similar patterns as well. The distance between Montseny La Castanya and Berga is merely 55 km and records a FKT of 0.71, whereas FKT corresponding to Montseny-Granollers is 0 even if the stations are within an easy reach of 20 km. Santa María de Palautordera and Vic are separated by 31 km and exhibit a FKT of 0.8, while Santa María de Palautordera and Mataró are only 16.02 km away from each other, and their FKT is 0. The distance between Tona-Berga is 41.3 km and their FKT is 0.71, between La Senia-Constantí 96.5 km and their FKT is 0.64, and between El Guiamets-Berga 143 km and their FKT is 0.71.

These findings indicate that geographical proximity does not explain much about the dynamics of air contamination displayed by different local areas. Quite the contrary, the results signify the existence of latent and complex factors influencing these relationships. To add more clarity on this question, Fig. 10ac outline the FKT figures regarding the remaining three rural areas of the CC (La Senia, Els Guiamets and Gandesa). Once again, relatively distant locations exhibit higher correlations (La Senia-Constantí, 96.5 km, Els Guiamets-Amposta 46.5 km, Gandesa-Amposta 40 km) according to our findings, whereas lower correlations correspond to closer air quality stations (La Senia-Amposta, 25.7 km, FKT 0.21; Els Guiamets-Tarragona Parc de la Ciutat, 14 km, FKT − 0.14; and Gandesa-Tarragona Parc de la Ciutat, FKT − 0.07).

A closer review of the results reveals the role played by the Vic and Berga Plains (the former is located close to the Pyrenees and the latter to Barcelona city) and the regions where Reus, Amposta, and Constantí stations lie (all belong to Tarragona province). In the former area, important farming and agribusiness activities coexist with a diversified industrial fabric and growing retail and service sectors. The Vic air quality station is located at the middle of a flatland called Vic Plain and is roughly equidistant from Barcelona and France. It is characterized by persistent fogs during winter, which often trigger out thermal inversion episodes consisting of minimum temperatures of − 10 °C and maximum temperatures close to 0 °C. Storms are regular occurrences in summer and may even invert the Mediterranean climate’s typical minimum precipitation levels.

Berga, where the namesake air quality station is located, displays widespread textile, chemical, and machinery industries. Its climate is warm and rainy throughout the whole year and features some continental patterns due to the presence of mountain ranges that block maritime influence.

The behavior of O3 pollution at Reus, Amposta, and Constantí stations is likely to reflect the diversified industrial activity carried out in their outskirts, such as petrochemical (one of the most important ones in Spain), metallurgy, and construction sectors.

Therefore, for instance, the data in Fig. 8c and d show an outstanding relationship between two northeastern rural stations (Bellver de Cerdanya and Santa Pau) with Vic, located in the Vic Plain (FKT of 1 and 0.8 respectively). Similarly, Santa María de Palautordera and Tona, located in the east of CC, show a high correlation (0.79 and 0.64, respectively) with Vic, as shown in Figs. 8bc and 9bc. Other air quality stations also register high correlation with Berga station, which indicates that pollution patterns are very alike within the districts where these are located. Such is the case of Montseny La Castanya (FKT 0.71, Fig. 9a), Tona (FKT 0.71, Fig. 9c), Els Guiamets (FKT 0.79, Fig. 10b), or Ponts (FKT 0.8, Fig. 9d). This correlation pattern is permanent over time and suggests that O3 levels in rural areas may be influenced by the Vic and Berga Plains.

A possible answer in line with this behavior is that O3 generated in Catalan suburban areas, characterized by intense economic activities, spread throughout the entire autonomous community, including remote districts such as Bellver de Cerdanya, Pardines, or Santa Pau, which are among many local tourists’ preferred destinations when it comes to leisure and relaxation activities as they are considered the healthiest gateways near Barcelona (Prat and Cànoves 2014; Hernández et al. 2016; Ceballos et al. 2018a, b; Viana et al. 2018; Cors-Iglesias et al. 2020).

Finally, Fig. 11 displays the correlations between non-rural (urban or suburban) sites. The results are very exemplifying. In line with the air mass recirculation pattern outlined in the preceding sections, we find relatively high FKTs between distant sites. For example, a FKT value of 0.71 was found between Mataró and Reus, two urban areas situated at a distance of more than 100 km from each other. In contrast, the correlation between areas within an easy reach can be significantly low. This is the case of Mataró and Montcada i Reixac, two sites with a FKT below − 0.07 and less than 23 km away from each other. These patterns might be caused by the circulatory effects that take place mostly within April–August especially when the wind blows.

Fig. 11
figure 11

FKT between urban and suburban areas of CC and stations with the highest and lowest correlations (2013–2020)

Kendall functional correlations analysis for PM10

Figure 12ad display similarities and dissimilarities, measured by FKT, relative to PM10 pollution patterns in various Catalan municipalities during the period 2013–2020. Air quality stations have been grouped into three categories: urban, suburban, and rural areas.

Fig. 12
figure 12

ad FKT between stations of CC with highest and lowest correlations for PM10 (2013–2020)

Just like in the previous figures, the left panel shows at the top the reference air quality station and its highest and lowest FKTs with the remaining stations, whereas the right panel displays a map to locate each air quality station. Regarding traffic stations located at urban sites within the CC, one finds that most of air quality stations with high FKT values are located far away from each other. Such is the case of Santa Perpetua-Manresa (FKT 0.8, see Fig. 12a), 40.91 km; Montcada i Reixac-Reus (FKT 0.8, see Fig. 12b), 96.5 km; Sant Vicenç dels Horts-Reus (FKT 0.8), 78.58 km; and Bellver de Cerdanya-Barcelona Eixample (FKT 0.86, see Fig. 12c), 113.67 km, with the exception of Bellver de Cerdanya-Berga 30.63 km (FKT 0.79).

Montcada i Reixac is a municipality comprised of 34,802 inhabitants, located within the BMA in a river basin flanked by hills. Two dense highways, a cement plant, and a waste treatment facility are found in the area, which considerably contribute to industrial emissions. Moreover, heavy traffic may exacerbate the effect of pollution sources in the nearby environment (Rovira et al. 2018).

On the other hand, Reus is a city belonging to Tarragona province, located the southwest of Barcelona. It features important chemical and petrochemical industries, an airport, and a growing urban area of 104,373 inhabitants. High air pollution levels in the area are explained by the aforementioned industrial activities in conjunction with the predominant northwest wind that influences the transport of PM10 pollutants (Mari et al. 2016; Mbengue et al. 2017).

The high FKT of 0.8 registered between the previous two sites suggests that PM10 behavior between the years 2013–2020 was very similar. This may be the consequence of several factors: first, both areas’ economic activities bear resemblances; second, urban development in both municipalities is alike; and finally, both are affected by complex air mass recirculation patterns.

Sant Vicenç dels Horts is a municipality based on industrial and retail sector development just like the former two sites, contributing to higher average daily PM10 pollution levels through the years (see Fig. 4). Despite a distance of more than 70 km, the fact that FKT correspondent to Reus and Sant Vicenç dels Horts is 0.8 indicates that pollution behaviors are very alike. However, Montcada i Reixac-Can Sant Joan, a station separated by less than 20 km from Sant Vicenç dels Horts, records a FKT of merely 0.14.

The Bellver de Cerdanya rural station is the only one in the CC that records hourly data during the whole 2013–2020 study period, and it recorded the lowest average daily PM10 levels together with Berga station among all 18 stations considered for this study within the same period (see Fig. 4). For the case of Bellver de Cardanya, low PM10 was probably related to local circulations of air masses in the nearby mountain valleys (Jaén et al. 2021). FKT between Bellver de Cerdanya and Berga is 0.8, relatively high. These results prove that using the FKT statistic on functional data allows the detection of sites with similar pollution patterns in the long run, enabling the accurate implementation of specific policies at a local level.

Discussion

Increasing O3 and PM10 pollution showcases an issue of concern in Catalonia as it jeopardizes citizens’ health and the environment (Schembari et al. 2014; Duran et al. 2017; Gamarra et al. 2020; Pierangeli et al. 2020; Rovira et al. 2020). This article focuses on the link between local pollution generated in metropolitan agglomerations and rural areas within the CC. Having observed massive concentrations of O3 and PM10 in the most peripheral regions of the CC, the article uses functional data in conjunction with an innovative technique by Valencia et al. (2019) to assess the relationship between these two pollutants’ levels in urban, suburban, and rural locations.

According to our results, a high degree of pollution between distant areas is revealed, while on occasions, they suggest very mild correlations between nearby sites. These results prove that the availability of mass data managed through the usage of functional statistics allows the detection of long-term similarities and differences in O3 and PM10 pollution levels between regions.

Former research featuring molecular organic analysis exhibited higher concentrations of precursors for the formation of ozone at the traffic stations of Barcelona (such as anthropogenic NO2, which is also an indicator for traffic emissions) compared to rural areas (Jaén et al. 2021). This phenomenon in conjunction with wind direction increases the likelihood of high O3 records in rural areas in comparison with those sites where ozone precursors are generated. Our findings are consistent with the peculiar redistribution pattern of O3 particles from urban towards rural areas, and vice versa. In an earlier paper, Gangoiti et al. (2001) found that the local breeze conditions, which are common in summer, contributes to background ozone and circulations of pollutants. Such is the case of Bellver de Cerdanya, where many wind trajectories in this locality passed through Barcelona in the previous hours via Llobregat River Valley as it is outlined in Fig. 12a and b (Jaén et al. 2021). These air mass recirculation dynamics support some of the results outlined in this article. For instance, Bellver de Cerdanya, one of the prominent rural stations considered in the paper, records O3 maximum values about 1 h after peak O3 levels are reached at Barcelona Ciudatella, a site located in an urban area where heavy traffic and economic activities are common during the daytime. The delay in the arrival of ozone-loaded air masses produced during the transport of pollution from Barcelona city could explain this phenomenon (Jaén et al. 2021). The succession of the peaks obeys to the typical wind movements in Catalonia, as can be observed in Fig. 13.

Fig. 13
figure 13

a, b Average hourly O3 levels in an urban and a rural district in CC (spring and summer, 2018–2019)

According to the results, in general terms, Vic and Berga Plains on the one hand, and Reus, Amposta, and Constantí on the other hand (these latter three embedded in a crucial industrial district within the CC), are regions of influence relative to O3 concentration. It is suitable to embed this statement within former research that analyzes the interplay between pollution and the CC’s climatological and orographic features. The geographic layout of Pardines, Bellver de Cerdanya, or that of the Montsec Range explains their high O3 records. The arrival of air masses rich in O3 and its precursors from BMA can be one of the causes of such records, as these air masses are then transported by mesoscale air flows across the valleys connecting the coast to the Pyrenees. When it comes to the rural stations located in the south of Catalonia (Gandesa, Els Guimets, La Senia, and Alcover), the level of O3 could be attributed to the inflow of high VOC concentrations from industries located in Tarragona (Felipe-Sotelo et al. 2006).

This study offers proof that distance plays a non-linear and complex role relative to O3 concentrations by generating a pattern of correlations between a wide range of air quality stations located within the CC. Relatively remote urban and rural areas showcase high correlations. In contrast, correlations observed between nearby regions were only moderate, or even non-existent. The profile is attributed to complex air recirculation dynamics influenced by wind flows.

Furthermore, our PM10 pollution analysis revealed that very distant sites record high FKT values and hence show similar features. This phenomenon can be attributed to similarities in population mobility, economic activities, and the influence of air mass circulation patterns (Jaén et al. 2021). For instance, air masses formed in North Africa may carry high PM10 concentrations to the CC, commonly known as Saharan dust episodes. Such is the case of Santa Perpètua de Mogoda or Manresa, both located along the winding paths that arrive at Manresa station between 6:00 and 18:00 h. These wind paths—constructed by recording hourly wind patterns from 2013 to 2020 at climate monitoring station Pont de Vilomara, which lies close to Manresa air quality station—allow us to observe that this area registers winds coming from North Africa in almost 45% of the cases. On the other hand, local air masses could form complex paths transporting PM10 concentrations up to Reus from distant sites like Montcada i Reixac (Jaén et al. 2021). These air mass movements produced at southern coastal regions follow a path that passes through the urban sites of Barcelona city, rise towards northern Manlleu, and then follow a downward route towards the Amposta area. Throughout this complex trajectory, intermediate sites, such as Reus, could be influenced by the winds circulating along the northern areas, which may bring about high PM10 levels. In this sense, it is worth mentioning that almost 30% of the predominant winds in the Reus area come from the north (see Fig. 14b).

Fig. 14
figure 14

Wind paths were arriving at Manresa (a) and Reus (b) air quality stations from 6:00 to 18:00 (2013–2020)

Prior research on PM10 at Bellver de Cerdanya station found that pollution levels are low compared to other air monitoring stations located at urban and suburban sites. These results were probably related to local circulations of air masses in the nearby mountain valleys (Jaén et al. 2021). Judging by the relatively high FKT value recorded between Bellver de Cerdanya and Berga (0.8), one can deduce that the behavior of PM10 pollution in the Berga region is very similar to that of Bellver de Cerdanya (as observed in Fig. 4). The opposite pattern can be outlined between Bellver de Cerdanya and Montcada i Reixac, Can Sant Joan (FKT of − 0.07), suggesting that PM10 levels between these two stations are not related. These results prove that the use of the FKT statistic on functional data allows the detection of sites with similar/dissimilar pollution patterns in the long run, enabling the accurate implementation of specific policies at a local level, not just for one single pollutant, but for any pollutant of interest in the long term.

Conclusions

The process of continuous atmospheric pollution can only be modeled using continuous functions with specific features. This is why functional data coupled with Kendall’s functional tau (FKT) was employed throughout this article to analyze the link between O3 and PM10 pollution between urban, suburban, and rural sites located in the Spanish Community of Catalonia. Despite its potential and popularity in other areas, the application of the FKT method in the analysis of pollution levels among areas is novel. To the best of our knowledge, the present paper is the first one to apply it using data from Catalonia.

The results obtained indicate the existence of complex and non-linear relation between these geographical sites influenced by wind patterns and orography in the CC. In the first place, relatively high FKT coefficients were registered between outlying urban and rural areas regarding O3, suggestive of similar tropospheric ozone pollution patterns between distant air quality stations. Second, there is a lack of proportionality concerning FKT coefficients and distance between sites. In many cases, distant urban and rural sites exhibited high functional correlation, while some adjacent areas featured low correlations. This phenomenon should be attributed to complex recirculation patterns within the CC. Even if our estimates cannot be interpreted as casual effects, the likelihood that the regions of Reus, Amposta, and Constantí (located in Tarragona province) and the Vic and Berga Plains (an area of ​intense economic activity in the north of the BMA) exert strong influences on O3 concentrations at rural sites is high. Moreover, regional wind flows may exacerbate these effects.

As for PM10, high pollution levels at a local level with a positive and high FKT values can be explained by several factors: first, both areas’ economic activities bear resemblances; second, urban development in both municipalities is alike; and finally, both are affected by complex air mass recirculation patterns.

The paper was embedded in a functional context, an approach that has rarely been employed. The FKT method provides three main advantages when compared to other quantitative approaches: it is seldom affected by outliers; it does not assume normality for the data of interest; and it can record non-linear relationships. As pollution concentrations, our variable of interest, tend to display a high degree of dispersion over seasons and time, FKT’s features are remarkably convenient.

As a handicap, the article does not capture causal effects. The assessment of the impact exerted by a specific air station upon the remaining would demand finding an exogenous source of change in O3 and PM10 levels for each station. Nonetheless, tracing external shocks that finally impact O3 and PM10 levels in a specific area without affecting the rest of sites is very difficult. Furthermore, it is non-viable to perform this approach given the many areas considered in this article. An additional handicap is the lack of control for orographic and demographic variables in this paper. Arguably, population density, geographical position, transportation, and economic activities may, among other factors, exert a considerable influence on O3 and PM10 levels. More precise correlations between several sites would be registered by considering such factors.

Our results provide two implications for environmental policymaking. In the first place, policymakers should focus on the entire CC when mitigating the adverse effects of pollution, not just Barcelona and Tarragona cities. This implication is based on rural sites’ high dependence on polluted air masses generated in urban regions. Secondly, geographical distance is only mildly related to correlations relative to O3 and PM10 levels between regions. Consequently, it may be misleading to use geographical distance to Barcelona or Tarragona cities as a threshold of expected pollution concentrations and peaks at rural sites. The major contribution of the article is the provision of a set of correlations between areas with diverse features within the CC and an outline of empirical air circulation patterns. Public policies whose goal is the mitigation of O3 and PM10 levels among the CC’s regions in an equitable way should consider these phenomena in the future.