Introduction

Emissions data is an important part of air quality modelling and plays a central role in the accuracy and credibility of modelling results. Incomplete datasets or inaccurate estimates can significantly impair our ability to evaluate the impact of emission sources on air quality, or perform a detailed analysis of emission scenarios (Davidson and Kanter 2014; Quilcaille et al. 2018). There will always be a need to output reliable, accurate and up-to-date emissions inventories for air quality studies. In addition, knowing the uncertainty in emissions and measured air quality data is key to understanding the level of confidence in the results, and has been widely discussed in the scientific community (Lindley et al. 2000; Winiwarter and Rypdal 2001; Zheng et al. 2009, 2017; Milne et al. 2014; La Notte et al. 2018; Pisoni et al. 2018). By minimising uncertainty, we can have reliable sources of data, which will in turn allow for a more consistent and accurate input for policy makers to make the best decisions. The EMEP Centre on Emission Inventories and Projections (CEIP, ceip.at) is tasked with collecting data related to the Convention on Long-Range Transboundary Air Pollution (CLRTAP), which requires qualified scientific information focusing on three main activities: collection of emissions data, atmospheric and precipitation measurements and air quality modelling. Since various assumptions have to be made, there are numerous sources of uncertainty when estimating data using emissions models, including activity data of the polluting activity or incorrect emission factors from polluting sources (Briggs 1995; Pacyna and Graedel 1995; Zachariadis and Samaras 1997). For spatially resolved inventories, there is an additional layer of uncertainty due to the need of including the spatial distribution of the emissions. The task of calculating uncertainties when faced with a large number of different methods, estimates and models can be very complex (Pacyna and Graedel 1995; Mobley and Saeger 1996), as such, having any type of data associated with uncertainties is a valuable asset.

Currently, the most widely used emissions inventory is EMEP, which has recently updated its horizontal resolution, from 0.5° × 0.5° to 0.1° × 0.1°. This study aims to assess the main differences in both the emissions inventory data that are used as input to an air quality model (spatial distribution and emission values) as well as air quality modelling results, to evaluate the accuracy of each inventory. Since 2015 is the common and updated year between the inventories, simulations were made for summer and winter months, June and December 2016, using these emissions.

There are other emissions inventories with data available for Europe, such as EDGAR, EPRTR and TNO. These are independent from the EMEP database and can have sources of data, namely, a global dataset (EDGAR, http://edgar.jrc.ec.europa.eu/overview.php), a database of European Pollutant Release and Transfer Register (EPRTR, http://prtr.ec.europa.eu) or officially reported data together with model and expert estimates (TNO, Kuenen et al. 2014).

In Europe, each country is responsible for reporting and building their own inventory according to the EMEP guideline. As different countries applied different methodologies to build the new inventory, a detailed analysis is only possible on a country-by-country basis. This study will review the spatial distribution and differences for Europe but focus on Portugal for a more detailed analysis for total values and air quality simulations.

The paper is organised as follows: in “The EMEP emission inventory,” the analysed emission inventories are described in detail and a comparison of emissions data is performed. In “Air quality modelling setup,” the modelling setup is presented. In “Analysis of air quality modelling results,” results from the air quality simulations are compared and quantified. “Model validation” is dedicated to the validation of the model and statistical analysis of the simulations. Finally, in “Conclusions,” the main conclusions are summarised.

The EMEP emission inventory

The old (0.5° × 0.5°) and new (0.1° × 0.1°) EMEP inventories

While EMEP (emep.int) collects data to fulfil the goals of the LRTAP convention, emissions inventories and projections are managed by the EMEP Task Force on Emission Inventories and Projections (TFEIP). Reported emissions and projections of acidifying air pollutants, heavy metals, particulate matter and photochemical oxidants are collected by CEIP. Recently, a new version of the EMEP inventory was made available, with a 0.1° × 0.1° grid (E01), which is a horizontal resolution upgrade compared to the older 0.5° × 0.5° version (E05). A significant difference between the two emission inventories should be highlighted, which concerns the inclusion and not inclusion of the (industrial) point. While E05 includes them in the cell value of their corresponding grid position, E01 provides these emissions in a separate file and does not include them in the gridded cell value. In Portugal, the Portuguese Environment Agency (APA) is the one responsible for compiling the data and building the EMEP inventory for the country. To build the new inventory, the same methodology as previous versions of E05 was applied to estimating the emissions, then the gridded emissions were downscaled to match the 0.1° × 0.1° resolution.

Analysis per sector and pollutant

To understand how the different resolutions affect emission values, Figs. 1 and 2 show the inventory maps regarding their spatial differences (E01–E05) and absolute values, respectively. The data were plotted for the most important pollutants and specific sectors (transport, industry, residential combustion and agriculture).

Fig. 1
figure 1

Emissions delta, in t y−1, between E01 (0.1° × 0.1°) and E05 (0.5° × 0.5°), for NO2, PM10 and NH3 and sectors S2, S3/S4, S7/S8 and S10 (green E05 > E01, red E01 > E05)

Fig. 2
figure 2

Total emissions (t y−1), for Portugal, of each inventory and each of the studied sector/pollutant pairs

First, the different deltas found over Europe can be justified by the different methodologies applied by each Member State for the updated EMEP inventory. There are countries where the deltas are mostly positive and others where it is negative. Additionally, there is the case of Turkey and the North African region, where the deltas could indicate missing data.

Nevertheless, differences in NO2 emissions from the transport sectors (S7/S8) are largely focused in urban areas and shipping routes, while industrial sources (S3/S4) are concentrated in and around cities. Due to the more coarse resolution of E05, the E01 emission values are consistently higher in the centre of these areas, and lower further away from the centre. These hotspots are where the difference between the resolutions of the emissions data is most evident. Transport emission deltas can be over 1000 t y−1 in shipping routes, especially in the Mediterranean, and in metropolitan areas throughout Europe. Industrial emissions deltas are mainly focused in and near urban areas, which is generally the location of large industrial installations, with deltas in the same order of magnitude of S7/S8, which can represent approximately 75% of a grid cell value in this sector. Regarding PM10, the same conclusions from the NO2 industrial sector emissions can be applied. The differences are also focused in and around urban areas, which is to be expected from the residential combustion sector (S2), with deltas over 1000 t y−1. The agriculture sector (S10) is where the spatial differences cover the biggest area, throughout almost every country in Europe, with large hotspots located over vast agricultural areas. The largest deltas for this sector are also over 1000 t y−1. Although these differences are similar in absolute values, in relative terms, the differences in these sectors can represent up to 44% (S7/S8), 75% (S3/S4), 82% (S2) and 97% (s10) of the total emissions values in a grid cell, due to initial resolution differences alone.

It is worth to note that for the studied pollutants the largest differences in the spatial distribution and in values do not occur in Portugal due to overall emissions being lower. Nonetheless, the deltas in Portugal are still significant, approximately 500 t y−1 in some sectors, such as agriculture (S10).

In Portugal, there are noticeable differences between the inventories when it comes to total and per sector values, especially in urban areas, with lower values registered for E01 when compared to E05.

Figure 2 shows the total emissions of each inventory for the aforementioned sectors and pollutants, for Portugal.

Emission values present a negligible difference for the transport sectors (S7/S8), with noticeable differences for residential combustion (S2: PM10, 0.2 × 104 t y−1) and agriculture (S10: NH3, 0.2 × 104 t y−1). The largest difference is for industrial sources (S3/S4), where E01 has 0.7 × 104 t y−1 lower NOx emissions compared to E05, which could be due to the updated inventory (E01) not having point sources included in the gridded emissions. Even if these differences are small compared to total values, they may affect air quality simulations and the accuracy of modelled data, which is discussed in the following sections.

In terms of absolute values for Portugal, there are small differences between the totals of both inventories. These results confirm that the main differences between the two inventories, E01 and E05, will be associated to the spatial disaggregation, and not an update on emission values.

Air quality modelling setup

To evaluate the accuracy of each emissions inventory, an air quality modelling system was applied with high-resolution simulations, using each of the inventories, and the resulting pollutant concentrations were compared between them and validated using observations.

To perform this evaluation, the WRF-CHIMERE modelling setup was used. WRF is the Weather Research & Forecasting model, developed by the National Center for Atmospheric Research (NCAR) and is a mesoscale numerical model (Skamarock and Klemp 2008). CHIMERE is a chemistry transport model, in a non-hydrostatic configuration, with nesting capabilities, combining both high grid resolutions and the representation of large-scale transport processes and long-term simulations for emissions control scenarios (Menut et al. 2013; Mailler et al. 2017). This system has been extensively applied for Europe, and Portugal in particular (Monteiro et al. 2007, 2018; Borrego et al. 2011), and is used for daily operational air quality forecast (http://previsao-qar.web.ua.pt/).

The emissions data is pre-processed with the emiSURF programme which generates anthropogenic surface emissions data for CHIMERE air quality simulations. This programme reads the annual inventory and performs a spatial allocation of surface emissions based on land use data. Then, the annual data is distributed into the 12 months of the year based on seasonal factors, followed by a second temporal allocation according to the day of the week, and finally a 24-h profile for each day of the week. This is done for each pollutant from each emission sector.

The numerical air quality simulations were performed using three domains, using nesting capabilities, to obtain high-resolution simulations for Portugal. The first and largest domain encompasses the majority of Europe at a low horizontal resolution of 27 × 27 km2 (CONT27), followed by an intermediate resolution of 9 × 9 km2 (IP09) covering the Iberian Peninsula. The smallest and highest resolution domain is focused on Portugal, with a 3 × 3 km2 (PT03) horizontal resolution.

Further details regarding the model setup and the simulations performed, such as vertical resolution, parametrizations and boundary conditions, can be found summarised in Table 1.

Table 1 WRF and CHIMERE specifications

Due to computational limitations and the extended time required to run the modelling setup, two months were chosen to perform the study, one in the summer (June) and another in the winter (December). Different seasons typically have characteristic main emission sources and distinct synoptic conditions, which result in different air quality issues. During winter, for example, residential wood combustion is an important source of atmospheric pollutants (Carvalho et al. 2009). The combination of greater PM emissions during winter, in urban areas, with thinner and more stable atmospheric boundary layers results in higher monthly mean PM concentrations in winter than in summer (Gama et al. 2018). During summer, the typical anticyclonic conditions with associated high air temperatures and high irradiation favour the occurrence of photochemical pollution episodes, with high ozone concentrations (Borrego et al. 2016).

Analysis of air quality modelling results

Spatial differences

Figure 3 shows the spatial distribution of the differences (mean and maximum deltas, E01–E05) found between the hourly simulations made with each of the inventories, for the European domain and for the studied pollutants (NO2, PM10 and O3). NH3 was not considered in the following sections due to observations for this pollutant not being available. The results for this section consider the entire simulation period (June + December).

Fig. 3
figure 3

Mean deltas (left) and maximum deltas (right) between the simulations, E01–E05, for NO2, PM10 and O3 concentrations, for the European domain (CONT27)

Over the European domain, the largest deltas for NO2, PM10 and O3 are over international shipping routes and metropolitan areas. Maximum deltas are above 80 μg m−3 for NO2, 35 μg m−3 for PM10 and 80 μg m−3 for O3. Regarding the mean differences, they reach 25 μg −3 for NO2, 7 μg.m−3 for PM10 and 25 μg.m−3 for O3. These mean deltas are of the same order of magnitude as the annual average concentrations observed in these areas (European Environment Agency 2017), which means that the deltas found between the two EMEP inventories can be highly significant.

Figure 4 shows a similar analysis for the Portugal domain.

Fig. 4
figure 4

Mean deltas (left) and maximum deltas (right) between the simulations, E01–E05, for NO2, PM10 and O3 concentrations, for the Portuguese domain (PT03)

Focusing on Portugal, the main conclusions addressed for Europe still apply. Regarding NO2, the areas with the highest differences are cities and shipping routes (including some ports). Maximum differences for NO2 are above 45 μg m−3, and mean differences are in the 10 μg m−3 range, over the Porto and Lisbon metropolitan areas (where the magnitude of NO2 mean levels is around 10–50 μg m−3, see Fig. 4). PM10 deltas have a similar distribution to NO2 although mostly concentrated in larger urban areas, where the maximum differences are over 36 μg m−3 and average 6 μg m−3 (observations are in the range of 10–30 μg m−3). O3 deltas located away from NO2 hotspots are due to the secondary photochemical origin of O3, with average values of 16 μg m−3 and maximums of 54 μg m−3 (observed maximum values are around 25–120 μg m−3).

Model validation

To better evaluate and quantify the performance of each inventory, the simulated results were compared to the observations data from the Portuguese air quality monitoring network. Since there are numerous stations throughout the country, an average value for each station type was considered to aid in the comparison of the results obtained with each inventory and is shown in Figs. 6, 7 and 8. Only background stations (urban, suburban and rural; Fig. 5) are considered due to their concentration values not being significantly influenced by any specific source, but rather a combination of all upwind sources in the surrounding areas. Daily averages (NO2 and PM10) and daily maximum values (O3) were considered. Later in this section, the calculated statistical parameters are shown.

Fig. 5
figure 5

Background air quality monitoring station locations considered for the study (urban, green squares; suburban, blue triangles; rural, red circles)

The time series analysis shows an overall good model performance, with NO2 and O3 having the best match between observed and modelled values, although there is an overestimation of O3, particularly in suburban stations (Fig. 8). In the case of PM10, the model underperforms when simulating PM10 concentrations (Fig. 8), which is to be expected. Many studies have recognised the difficulty in modelling this pollutant (Matthias 2008; Pay et al. 2010). As summarised by Basart et al. (2012), the underestimation of PM10 may be related with the lack of fugitive dust emissions and resuspended matter, as well as sources that are inaccurate or not considered in the emission inventory. Regarding the inventories, similarities are found for every pollutant. Nevertheless, there is evidence of higher accuracy when using the coarse E05 inventory in urban and rural observations, which was not expected since E01 should better represent urban land use due to its higher resolution. However, as previously mentioned, the new grid was only downscaled from the older, more coarse, E05 gridded emissions. Nonetheless, in suburban stations, E01 achieves better results.

Fig. 6
figure 6

Daily average NO2 time series per air quality monitoring station type (E05 in orange, E01 in indigo and observed data in light blue circles)

Fig. 7
figure 7

Daily average PM10 time series per air quality monitoring station type (E05 in orange, E01 in indigo and observed data in light blue circles)

Fig. 8
figure 8

O3 maximum daily time series per air quality monitoring station type (E05 in orange, E01 in indigo and observed data in light blue circles)

To better understand and compare the different results from the simulations, statistical parameters were calculated to evaluate the accuracy of the model when using each of the inventories. Statistical metrics were chosen and calculated according to Borrego et al. (2008) (Table 2).

Table 2 Statistical model of quality indicators used in this study

The correlation coefficient, R, measures the strength of the linear relationship between predicted and observed concentrations. However, as it is insensitive to either an additive or a multiplicative factor, an R value of 1.0 is not a sufficient condition for a model to be considered accurate. The index of agreement (IOA) proposed by Willmott (1981), can be seen as an alternative to R; although it is not a measure of correlation in its formal sense, it reflects the degree of accuracy of the estimated variable. Unlike R, IOA can detect additive and proportional differences in the observed and simulated means and variances. A model that would show a perfect agreement with the observations would have an IOA of 1.0. The root mean square error (RMSE) provides the mean relative scatter of the modelled values, indicating both systematic and random errors. Mean relative bias indicates only systematic errors; the calculated value of this metric depends only on the average of the predicted and observed concentrations. Chang and Hanna (2004) highlight that it is possible for a model to have predictions completely out of phase of observations and still have an ideal bias value (0.0), due to cancelling errors.

These metrics were calculated for the entire study period and the results are presented in Table 3, organised in terms of average values per station type and for each of the EMEP inventories and pollutants. Hourly data for NO2 and O3 was considered, and so were daily averages for PM10.

Table 3 Average statistical parameters for each type of air quality monitoring stations and each EMEP inventory

Overall, the analysis of the statistical parameters confirms that E05 has a higher accuracy in terms of air quality modelling results, which means a higher resolution in the input data does not necessarily result in a better representation of the emissions data. While there were no significant differences found in the correlation factor, because the same time profiles are applied for both inventories, for the RMSE and bias, large differences are found for urban and suburban stations.

Conclusions

The specification of the EMEP emissions inventory has changed, from grid cells of 0.5° × 0.5° to a higher resolution grid (0.1° × 0.1°). Member States had to prepare this new grid according to EU requirements under the CLRTAP protocol, each using their own methodology. The purpose of this study was to investigate the main differences found between these two emissions inventories for Europe, focusing on Portugal as a case study, using the available common year (2015). Emission deltas were analysed per sector and pollutant, and both inventories were used for air quality modelling applications (using an already extensively validated air quality modelling system). The results found and highlighted major differences in several MS. Portugal was not one of the hotspots in the emissions delta maps but there were still significant differences found, mainly in the spatial distribution of shipping and agriculture emissions, as well as PM10 and NOx values. The analysis of the air quality modelling outputs, and their comparison with observed values, indicates that EMEP0.1 does not improve model performance over Portugal, suggesting that the higher resolution inventory was not built using the most appropriate methodology. This work highlights the importance in estimating the uncertainty associated to emissions data, and confirms what other studies have already pointed out, that improving emissions data resolution does not necessarily imply higher accuracy in modelling results, in particular for air quality modelling purposes. The authors recommend that each MS perform a similar study, to analyse their specific methodology and emissions data, before using the new high-resolution EMEP inventory.