Introduction

After rapid industrialization and urbanization for more than four decades, China, in recent years, is facing a series of ecological and environmental challenges, such as land/soil degradation (Zhou et al. 2013), landscape fragmentation (Cui et al. 2019), biodiversity losses (Hou et al. 2014), water eutrophication (Malekmohammadi and Jahanishakib 2017), and urban air pollution (Huang et al. 2014). Since 2013, severe urban air pollution has started to occur extensively in many Chinese cities, especially in the rapid urbanization areas (Fu and Chen 2017), such as the Beijing-Tianjin-Hebei urban agglomeration (Zhang et al. 2018), the Yangtze River Delta urban agglomerations (Han and Ma 2020), the Pearl River Delta urban agglomerations (Li et al. 2018), and the Yellow River Basin (Chen et al. 2020).

Urban air pollution in Chinese cities is mainly manifested in the form of haze event, which is a weather phenomenon with a visibility of less than 10 km resulting from the dense accumulation of fine aerosol particles (Huang et al. 2014; Tao et al. 2012), not only affects people’s daily life and physical health, but also causes enormous economic losses and social instability. It contributes negatively to regional climate conditions (Fu and Chen 2017), visibility (Shen et al. 2015), agricultural production (Zhou et al. 2018b), and public health (Song et al. 2019). Currently, urban air pollution has become a great environmental hazard and caused widespread social concerns in China (Braithwaite et al. 2019; Liu et al. 2017a; Song et al. 2017; Wu et al. 2017; Wu et al. 2020; Wu et al. 2021; Zhang et al. 2015; Zhou et al. 2018a).

Previous studies on urban air pollution have mainly focused on sources and types of air pollutants (e.g., PM2.5, PM10, NO2, SO2.) (Fu and Chen 2017), temporal and spatial variation of pollutants (Song et al. 2017), factors and formation mechanism of air pollution (Zhan et al. 2018), the relationship between human activities and air pollution (Li et al. 2020), and health damage caused by urban air pollution (Song et al. 2019). Studies find that urban air pollution usually resulted from a combination of high levels of air pollutant emissions and adverse meteorological conditions at the same time (Fu and Chen 2017). In terms of the sources of air pollutants and the process of pollutants transmission, the factors contributing to the formation of urban air pollution can be divided into two categories: (1) emissions of air pollutants from human activities, such as industrial exhaust, vehicle exhaust, coal, and dust; and (2) adverse climatic conditions caused by special landforms and unreasonable spatial structure of land use. Furthermore, Fu and Chen (2017) suggested that over 70 particulate matter sources have been identified, ranging from natural to anthropogenic sources. Chen et al. (2015) explored the spatial variations of air pollutants and air quality of Beijing at multiple temporal scales such as daily, weekly, and monthly. Cheng et al. (2017) discovered that there was a spillover effect in the time and space dimensions of the urban air pollution in China. Other studies also suggested that urban air pollution in Chinese cities were mainly a result of the interactions between natural (e.g., climate, topography, etc.) and socio-economic (e.g., industry structure, population, urbanization, and ventilation corridors) factors (e.g., Dong et al. 2019; Hao and Liu 2016).

Although the existing studies have made considerable achievements on this topic, they are not without limitations. First, regarding the selection of influencing factors, the existing studies mainly relied on subjective judgment that usually does not consider the causal relationship between urban air pollution and its influencing factors (Lin and Zhu 2018). Niemeijer and de Groot (2007) eloquently stated that a conceptual framework should play an important role in the processes of selecting influencing factors and developing a consistent influencing factor set. This is especially true in the case of assessing the urban air pollution risks and identifying their key influencing factors ranging from natural to human dimensions. Second, the quantitative methods employed for analyzing the influencing factors of urban air pollution employed by the existing studies mainly included factor analysis (Zhang et al. 2018), dynamic factor analysis (DFA) (Yu et al. 2015), principal component analysis (Pandey et al. 2014), extreme boundary analysis (Wang and Chen 2016), spatial Durbin model (Liu et al. 2017a), spatial lag model and spatial error model (Hao and Liu 2016), and land use regression (Huang et al. 2017). These statistical tools were effective in identifying the influence of individual factors on urban air pollution; however, most of them were global models that ignored the local effects of the influencing factors and interactive effects of multiple factors (Zhan et al. 2018). The geographical detector model proposed by Wang et al. (2010) was intended to address these problems by dividing the study area into a few subareas to identify the spatially stratified heterogeneity and the factors that are responsible for such spatial heterogeneity. The geographical detector was used in Zhou et al. (2018a) and Ding et al. (2019) to examine the effects of socio-economic development on PM2.5 in China. Furthermore, the geographically weighted regression (GWR), a local form of spatial analysis tool, was also widely used to explore the local spatial relationship between the explanatory variables and the response variable (You et al. 2016) as the regression coefficients in the GWR vary with locations (Zeng et al. 2016).

Therefore, our study aims to develop an index for the urban air pollution occurrence and to employ spatial analysis tools to answer the following questions:

  1. (1)

    What is the frequency and spatial pattern of urban air pollution occurrence in a certain period (a year, a season, or a month) in China?

  2. (2)

    What are the influencing factors of urban air pollution and how do they interact with each other?

  3. (3)

    What are the positive and negative effects and the local effects of the influencing factors on urban air pollution?

  4. (4)

    What measures should we take to alleviate the occurrence of urban air pollution, given that it is impossible to completely eradicate the emissions of air pollutants in China in a short period?

The main innovations of this paper are: (1) developing a Driver-Pressure-State-Impact-Response (DPSIR) framework to qualitatively elucidate the complex causal relationship between urban air pollution and their influencing factors and to avoid the subjective judgment in the process of selecting the influencing factors, and (2) employing the geographical detector and the geographically weighted regression model to quantitatively detect the effect of the influencing factors’ interactions on urban air pollution and the local effects of the influencing factors on urban air pollution, respectively.

The paper is organized as follows: the “Materials and methods” section describes the materials and methods; the “Results” section presents the findings of our study; the “Discussion” section discusses the complexity of the influencing factors of urban air pollution and proposes five paths to alleviating the occurrence of urban air pollution in China. Conclusions are given in the “Conclusions” section. The technical flowchart of this study is shown in Fig. 1.

Fig. 1
figure 1

The technical flowchart of this study. Note: DPSIR, Driver-Pressure-State-Impact-Response

Materials and methods

Developing the DPSIR-based framework for urban air pollution

Proposed by the European Environmental Agency (EEA), the DPSIR framework is an improvement over the Pressure-State-Response (PSR) framework. In recent years, the DPSIR framework has been widely applied in sustainability assessment (Maurya et al. 2020), eco-environmental quality assessment (Chen et al. 2019), soil quality risk assessment (Zhou et al. 2013), land use assessment (Potschin 2009), wetland resources assessment (Malekmohammadi and Jahanishakib 2017), ecological security assessment (Gari et al. 2015), biodiversity risk assessment (Hou et al. 2014), and industrial economic sustainability assessment (Liu et al. 2018). The DPSIR framework is mainly used for identifying and evaluating environmental problems that are generally complex.

The DPSIR framework describes that the natural factors and/or socio-economic factors act as the driver (underlying drivers) and underpin the pressure (direct or proximate drivers—for example, secondary industrial development in this study) to change the environment (state, e.g., urban air quality), which further affects the socio-economic condition of the region of interest (impact). Finally, this complex situation is responded by the government or society through different initiatives (response) to reduce the negative impacts or to encourage the positive impacts (Potschin 2009; Qu et al. 2020).

On the one hand, the DPSIR framework provides a systematic mechanism to monitor the status of an environment or system, to describe the system’s dynamics, and to express the coupling relationship among various influencing factors (Rapport and Friend 1979). Within the DPSIR framework, feedback may be provided to policymakers based on environmental quality and/or the resulting impact of policies made or to be made in the future (Bidone and Lacerda 2004). On the other hand, establishing the DPSIR framework under certain circumstances is a complex task because various cause-effect relationships must be carefully examined, and the environmental changes are rarely attributed to a single cause (Feld et al. 2010).

In this study, the DPSIR framework was used to elucidate the complex causal relationship between urban air pollution and its influencing factors by avoiding subjective factor selection. As illustrated in Fig. 2, by considering the complex relationship between urban air pollution and its influencing factors, we employed the DPSIR framework to construct an index system for assessing the influencing factors of urban air pollution (Table 1): (1) the “driver” factors include climate, topography, landform, and socio-economic activities that have positive or negative influences on urban air quality; (2) the “pressure” factors include urban spread, population growth, and energy consumption; (3) the driver and pressure factors contribute collectively to the change of urban air quality (i.e., “state”); (4) urban air pollution imposes negative “impacts” on a living quality, public health, social security, and economic development; and (5) the “response” factors include the actions taken by the government, society, and industrial sectors to mitigate the occurrence or risk of urban air pollution.

Fig. 2
figure 2

The DPSIR framework for the cause-effect chain of urban air pollution and its influencing factors

Table 1 The selected influencing factors of urban air pollution based on the DPSIR framework

In Table 1, the state in the DPSIR framework for our study is the risk of urban air pollution represented by the urban air pollution index, which is the dependent variable in the following influencing factor assessment. The impact factors are the negative effects of urban air pollution on public health and social security, such as respiratory diseases, carcinogenesis, social panic, and psychological illness risk (Braithwaite et al. 2019; Song et al. 2019). Urban air pollution also reduces visibility and increases the number of traffic accidents (Shen et al. 2015). Unfortunately, due to limited data, these impact indicators were not included in the following case study. Therefore, a set of 27 individual indicators were selected to assess the influencing factors of urban air pollution in this study (Table 1).

Developing the urban air pollution index

Based on the air quality index (AQI) (Appendix A), we developed the urban air pollution index (UAPI) to represent the potential that may lead to haze events (urban air pollution) in Chinese cities in a certain period (e.g., annual, seasonal, or monthly) when AQI is at the 3 to 6 levels. The UAPI is defined as follows:

$$ \mathrm{UAPI}=\frac{\sum_{k=3}^6{D}_k}{S_k} $$
(1)

where k is the level of AQI (k = 3,4,5, and 6); Dk is the number of days at the k level; Sk is the total number of days in a certain period (annual, seasonal, or monthly). A higher UAPI value corresponds to more days with bad air quality.

Figure 3 compares the daily characteristics of AQI (Fig. 3a) and the monthly characteristics of UAPI (Fig. 3b) from 2015 to 2019 in China. It is evident that both AQI and UAPI display the U-shape periodic characteristics.

Fig. 3
figure 3

The temporal characteristics of a the air quality index (AQI) and b the urban air pollution index (UAPI) from 2015 to 2019

Spatial autocorrelation analysis

Spatial autocorrelation analysis, which reveals the degree of similarity of the attributes of neighboring geospatial units, includes calculations of the global Moran’ I and the local indicators of spatial association (LISA). While the global Moran’s I quantifies the spatial autocorrelation as a whole, the LISA measures the degree of spatial autocorrelation (spatial clustering) at each specific location by using local Moran’s I (Anselin 1995). The global Moran’ I can be expressed as (Moran 1948):

$$ \mathrm{Global}\ \mathrm{Moran}'\mathrm{s}\ I=\frac{n\sum \limits_{i=1}^{\mathrm{n}}\sum \limits_{j=1}^{\mathrm{n}}{\mathrm{W}}_{ij}\left({x}_i-\overline{x}\right)\left({x}_j-\overline{x}\right)}{\left(n\sum \limits_{i=1}^{\mathrm{n}}\sum \limits_{j=1}^{\mathrm{n}}{\mathrm{W}}_{ij}\right)\sum \limits_{i=1}^n{\left({x}_i-\overline{x}\right)}^2},\left(i\ne j\right) $$
(2)

where xi and xj are the values of variables for evaluation units i and j respectively; i = 1, 2, …, n; j = 1, 2,…, m; n is the number of evaluation units; m represents the number of evaluation units which geographically adjacent to evaluation unit i; ‾x is the average value of x; Wij is a weight parameter for the pair of evaluation units i and j in proximity; when i and j are adjacent, Wij = 1; otherwise, Wij = 0. The Moran’s I ranges from − 1 to 1. When I > 0, it indicates a positive spatial correlation with the spatial pattern being agglomeration distribution; when I < 0, it indicates a negative spatial correlation the spatial pattern being diffusion or uniform distribution; and when I = 0, it indicates no spatial autocorrelation with the spatial pattern being random distribution.

The local Moran’s I is used to identify local spatial clustering and is defined as (Anselin 1995):

$$ \mathrm{Local}\ \mathrm{Moran}'\mathrm{s}\ I=\frac{n\left({x}_i-\overline{x}\right)\sum \limits_{j=1}^m{\mathrm{W}}_{ij}\left({x}_j-\overline{x}\right)}{\sum \limits_{i=1}^n{\left({x}_i-\overline{x}\right)}^2},\left(i\ne j\right) $$
(3)

The LISA has four kinds of local spatial associations: High-high or low-low indicates a spatial positive association or clustering; high-low or low-high represents a spatial negative association or heterogeneity (Li et al. 2014).

Assessment of influencing factors

Geographical detector

The geographical detector, a spatial statistical tool newly proposed by Jinfeng Wang (Wang et al. 2010; Shi et al. 2018), is used to reveal the global effects of the influencing factors of urban air pollution at the global scale in this study. It examines whether the spatial distribution of the dependent variable Y (an event or a phenomenon) and that of the independent variable X (the influencing factors) are likely to be the same (Luo et al. 2019).

The principle of the geographical detector is to divide a spatial region into several subregions with spatially stratified heterogeneity based on influencing factors’ subclasses. The geographical detector is similar to variance analysis (ANOVA) in that it provides feasible means to measure spatially (globally) stratified heterogeneity of the dependent variable Y between subregions (Ding et al. 2019; Peng et al. 2019; Liu et al. 2020). The hypothesis is that if the sum of the variances of the subareas, which are classified by the factor, is less than the variance of the whole region, spatially stratified heterogeneity exists (Wang and Xu 2017; Ding et al. 2019). The geographical detector has the following advantages over the classical statistical methods: (1) as a spatial analysis tool grounded in spatial statistics, it does not require the assumption that the independent or dependent variables must be independently and identically distributed (IID) as generally required by traditional classical statistics (Ding et al. 2019; Wang et al. 2017; Zhou et al. 2018a; Zhao et al. 2020); (2) the geographical detector does not require consideration of the collinearity of multiple independent variables so that any influencing factors can be included in the analysis (Zhou et al. 2018a; Ding et al. 2019; Duan and Tan 2020); (3) the geographical detector has also a unique advantage that can be used to explore the interactions of two influencing factors affecting the dependent variable and to reveal whether the interactions of the two factors are linear or nonlinear (Bai et al. 2019); and (4) the stratified independent variables in the geographical detector can enhance the representation of a sample unit to afford a greater level of statistical accuracy (Duan and Tan 2020).

Therefore, we used the geographical detector to further answer the following three questions (see also Qiao et al. 2019): (1) which influencing factors contribute to urban air pollution? (2) what is the level of contribution from each influencing factor? and (3) whether these factors work independently or interdependently to influence urban air pollution?

The geographical detector consists of four processes (Wang and Xu 2017): factor detection, risk detection, interaction detection, and ecological detection (see Appendix A for more about the geographical detector). The geographical detector employs the q-statistic to measure the explanatory power of the independent variables to the dependent variable. The q-statistic is defined as in Eq. (4).

$$ q=1\frac{\sum \limits_{h=1}^1{N}_h{\sigma}_h^2}{N{\sigma}^2} $$
(4)

where h = 1, …, l is the stratification of the independent variable; Nh and N are the sample size of the h (h = 1, …, l) layer and across all subregions, respectively; σh2 and σ2 are the variances of the dependent variable of the h (h = 1,…, l) layer and across all subregions, respectively. The values of q range from 0 to 1, meaning the independent variables could explain 100×q% of the dependent variable. The greater the value of q is, the stronger the explanatory power of the independent variable to the dependent variable, or vice versa. Because discrete variables are superior to continuous variables in the geographic detector (Wang et al. 2010), the continuous data of independent variables were discretized using the quantile and the natural breakpoint method.

Geographically weighted regression

The geographically weighted regression (GWR) was employed to assess the influencing factors of urban air pollution at the local scale. GWR is a locally weighted least squares linear regression model for spatial analysis (You et al. 2016). The spatial weighting function employed in a GWR model assumes that spatial locations close to each other have similar characteristics than those distant from each other. The GWR model can be expressed as follows:

$$ {y}_i={\beta}_{0\left({\mu}_i{v}_i\right)}+\sum \limits_{j=1}^k{\beta}_{j\left({\mu}_i{v}_i\right)}{x}_{ij}+{\varepsilon}_i $$
(5)

where yi and xij are the observed values of the response variable and the explanatory variables at the location (μi, vi); (μi, vi) is the x–y coordinate of the ith location; βj (μi, vi) is the regression coefficient associated with the jth explanatory variable at location i whose geographical coordinates are (μi, vi); εi is the model error at location i.

For the GWR model defined in Eq. (5), the regression coefficient βj (μi, vi) varies from location to location, which can be estimated using

$$ {\hat{\beta}}_{\left({\mu}_i{v}_i\right)}{\left[{\boldsymbol{X}}^T\boldsymbol{W}\left({\mu}_i,{v}_i\right)\boldsymbol{X}\right]}^{-1}+{\boldsymbol{X}}^T\boldsymbol{W}\left({\mu}_{i,}{v}_i\right)\boldsymbol{Y} $$
(6)

where X is the matrix of the observed explanatory variables; Y is the vector of observed values of the response variable; and W (μi, vi), is the spatial weight matrix, which can be estimated using Gaussian distance function or chi-square distance function. In our study, the Gaussian distance function was used.

Data sources and pre-processing

Due to limitations in data availability, we only tested the assessment of the influencing factors of the urban air pollution in 337 Chinese cities in 2015. The data in this study include three categories: air quality, geographical and meteorological data, and socio-economic data. The air quality data was collected from the Ministry of Environmental Protection (http://www.mee.gov.cn/hjzl/), which mainly included the 24-h average values of six pollutants (PM10, NO2, SO2, PM2.5, O3, and CO). The geographical data, including altitude, temperature, humidity, and precipitation, were collected from the Resource and Environment Data Cloud Platform (http://www.resdc.cn). The meteorological data, including annual mean temperature, annual mean precipitation, and humidity index, were retrieved from 1915 meteorological stations in China, and their spatial interpolation maps were produced in ArcMap 10.5 (ESRI Inc., CA) by using the inverse distance weighted method. Because more than 60% of China is mountainous and the meteorological conditions in mountainous areas are significantly affected by topography, we used the national digital elevation model (DEM) to adjust the temperatures at different elevations at the rate that temperature declines 0.6 °C per 100 m increase. The wind speed data was downloaded from the National Meteorological Information Center (https://data.cma.cn/en). The socio-economic data were obtained from China Urban Construction Statistical Yearbook (National Bureau of Statistics 2016), China Urban Statistical Yearbook (National Bureau of Statistics 2016), and local statistical yearbooks and annual reports. The four seasons are defined as follows: winter (January 1–March 20, December 24–December 31), spring (March 21–June 22), summer (June 23–September 22), and autumn (September 23–December 23).

Results

Spatial-temporal characteristics of UAPI

Annual characteristics

Figure 4 shows that the average of the UAPI in the 337 Chinese cities in 2015 was 0.22. The 337 cities were classified into ten groups with their UAPI values falling into the following intervals [0, 0.1], (0.1, 0.2], …, (0.9, 1.0], respectively. In 2015, about 30 cities (8.9%) had UAPI greater than 0.5.

Fig. 4
figure 4

The urban air pollution index (UAPI) in 337 Chinese cities in 2015

Interestingly, as shown in Fig. 4, the spatial pattern of the UAPI was largely consistent with the population-based Hu Line that generally divides China into east and west regions (Hu et al. 2016). The cities in the eastern region generally had higher UAPI values than the cities in the west region. Similarly, the Yangtze River divides China into north and south regions, and the cities in the north had higher UAPI than the cities in the south. The cities with annual average UAPI greater than 0.5 were mostly concentrated in North China, Central China, Northeast China, and East China, whereas the cities with lower UAPI (< 0.5) were mostly distributed in the west of the Hu Line and the south of the Yangtze River, except for the four cities in west-central Xinjiang and northern Tibet Autonomous Regions.

Seasonal characteristics

Figures 5 and 6 present the seasonal characteristics and spatial patterns of the UAPI in 2015, respectively. Not surprisingly, the winter had the most frequent urban air pollution days in 2015, with an average UAPI of 0.42. In this winter, about 100 cities in North, Central, and Northwest China (Fig. 6d) had UAPI higher than 0.5. In contrast, the summer had the least urban air pollution days in 2015 (Fig. 5b), with an average UAPI of 0.11. Only 12 cities spread in South Xinjiang and North China had UAPI greater than 0.5 (Fig. 6b). In general, there were more urban air pollution days in the autumn (Fig. 6c) than in the spring (Fig. 6a). For example, there were 34 cities in the autumn and 30 cities in the spring that had UAPI higher than 0.5.

Fig. 5
figure 5

Seasonal characteristics of the urban air pollution index in 2015. a Spring. b Summer. c Autumn. d Winter

Fig. 6
figure 6

Spatial patterns of the urban air pollution index in four seasons in 2015. a Spring. b Summer. c Autumn. d Winter

Monthly characteristics

Figure 7 shows the monthly averages of UAPI at the spatial and temporal scales. The UAPI declined continuously from December/January to August, then increased continuously from August to November. In January when the UAPI’s peaked for almost all cities, there were 162 cities with UAPI higher than 0.5. These cities were mainly concentrated in the east of the Hu Line, the north of the Yangtze River, the eastern coastal region, and Northwest China.

Fig. 7
figure 7

Spatial variation of monthly average urban air pollution index (UAPI) in 2015. a January. b February. c March. d April. e May. f June. g July. h August. i September. j October. k November. l December

Spatial analysis

The global Moran’s I analysis of UAPI shows that annual Moran’s I was 0.7652 (p < 0.05) and the seasonal Moran’s Is were as follows (in decreasing order): autumn (0.7098) > spring (0.7036) > winter (0.6646) > summer (0.5337). The results indicate that the UAPI had significant positive spatial autocorrelation at the global scale, which means the occurrence of urban air pollution events such as haze days in one area was positively affected by its adjacent areas.

On the other hand, the LISA analysis shows that the annual and seasonal UAPIs can basically be divided into four or five clusters (Fig. 8, see also Fig. A1 in Appendix A for monthly information). The H-H (hot spot) agglomeration areas were mainly concentrated in North China, Central China, and the southern Xinjiang, whereas the L-L (cold spot) agglomeration areas were mainly located in Southwest China, Southeast China, and the Great Xing-an Mountains of Northeast China. There were a few regions with high values of UAPI surrounded by the regions with low values (H-L, heterogeneity spot), and they were mainly distributed in the southeast of Qinghai and Guizhou provinces. The areas with a low value of UAPI that were surrounded by the high-value areas (L-H) were mainly distributed in northern Tibet, North China, and the Shandong Peninsula, scattered across mainland China.

Fig. 8
figure 8

Annual and seasonal spatial clustering of the urban air pollution index in 2015. a Annual. b Spring. c Summer. d Autumn. e Winter

Global effects of the UAPI influencing factors—geographical detector

Factor detection

Table 2 presents the factor detection results for the UAPI at the annual and seasonal scales. The q values of the influencing factors were ranked for a better comparison of the contributions from individual influencing factors to the UAPI. First, the ranks of the influencing factors varied from season to season. For example, wind speed (D4) rose from 23rd place in the summer to 14th place in the winter at α = 0.05, which indicated that wind might have played a more important role in the formation of the air pollution events in the winter than in the summer. Second, annual average temperature (D2), humidity index (D3), and annual average precipitation (D5) were the biggest contributors among natural factors to urban air pollution in 2015. Third, the socio-economic factors in the pressure group, such as the proportion of the construction land to the total area of the city (P1), number of manufacturing enterprises (P3), nitrogen fertilizer application rate (P4), number of domestically made car ownership (P7), highway density (P8), and the ratio of the central heating (P9), had larger contributions than the factors in the driver’s group. Fourth, the responses to urban air pollution need to be strengthened because the contribution from the response factors was generally low, although some measures (such as industrial upgrading and energy efficiency) had already played an active role in alleviating urban air pollution. The complex, interactive relationships between the influencing factors were further explored by using risk detection and interaction detection as shown in the following paragraphs.

Table 2 The results of the factor detection for the influencing factors of urban air pollution in 2015

Risk detection

The risk detector was used to further identify the direction and magnitude of the influencing factors’ effect on urban air pollution. As shown in Fig. 9 and Table 3, the influence of natural factors on urban air pollution is more complicated than that of socio-economic factors when comparing the difference between the subregions of influencing factors.

Fig. 9
figure 9

The risk detection of the influencing factors to urban air pollution. Notes: the x-axis is the subregion of influencing factors; the y-axis is the urban air pollution index. (a) D1, relative elevation; (b) D2, annual average temperature; (c) D3, humidity index; (d) D4, wind speed; (e) D5, annual average precipitation; (f) D6, regional GDP growth rate; (g) D7, proportion of secondary industry in regional GDP; (h) D8, investment to real estate development; (i) D9, actually utilized foreign capital; (j) D10, natural population growth rate; (k) P1, proportion of construction land to the total area of the city; (l) P2, proportion of coal consumption; (m) P3, number of manufacturing enterprises; (n) P4, nitrogen fertilizer application rate; (o) P5, population density; (p) P6, per unit construction building area; (q) P7, number of domestically made car ownership; (r) P8, highway density; (s) P9, ratio of the central heating; (t) R1, rate of green land in built-up area; (u) R2, proportion of energy conservation and environmental protection expenditures to local fiscal expenditure; (v) R3, ratio between the third industry and the second industry; (w) R4, per unit area industrial output; (x) R5, ex-factory price index of industrial products; (y) R6, per unit GDP in energy consumption; (z) R7, gas penetration rate; (aa) R8, expenditure of research and experimental development funds

Table 3 The complex relationship between urban air pollution and its influencing factors

First, the natural factors such as annual average temperature (D2), humidity index (D3), and annual average precipitation (D5) had apparently nonlinear effects on the urban air pollution at the subregion level, whereas the effects of the relative elevation (D1) and the wind speed (D4) on the urban air pollution were monotonically decreasing with the subregion level. Second, in terms of the anthropogenic factors, the relationships between most influencing factors and urban air pollution were close to monotonically increasing or decreasing (Fig. 9 and Table 3). In general, the relationships between urban air pollution and its influencing factors at the subregion level may be categorized into six types of linear or nonlinear relationship as defined in Table 3.

Interaction detection and ecological detection

Table 4 lists the q values of interaction detection. It shows that the explanatory power (interactive effect) between any two influencing factors (i.e., the q values in the off-diagonal cells) was always greater than that of a single individual influencing factor to the urban air pollution (i.e., the q value in the diagonal cells). Furthermore, the interaction relationship was mainly expressed as nonlinear enhance (287 pairs, accounting for 81.77%) and bi-enhance (64 pairs, accounting for 18.23%) (refer to Table A2 for more details). First, in terms of nonlinear enhance, the explanatory power had a maximum value of 0.664 (D5∩P3) and a minimum value of 0.050 (R2∩R6). The explanatory powers for 16 pairs were greater than 0.5, of which 13 pairs were interactions between a natural factor and a socio-economic factor. Second, in terms of bi-enhance, the maximum value of the explanatory power was 0.4748 (D2∩P8), whereas the minimum value was 0.1172 (D1∩R8). These results show that, to some extent, the interactions between the natural factors and the socio-economic factors can enhance the explanatory ability of the influencing factors to the urban air pollution in China.

Table 4 The explanatory power between any two influencing factors

Table 5 further explores whether the influences from different influencing factors on urban air pollution were significantly different from each other. For example, the natural factors, including relative elevation (D1), annual average temperature (D2), humidity index (D3), and annual average precipitation (D5), had a significant effect on urban air pollution, compared with the effect of wind speed (D4). In terms of socio-economic factors, the effects of the factors D10, P5, P8, P9, R1, R2, R3, R4, and R6 on urban air pollution were significantly different from each other. These results show that these influencing factors dominated the temporal and spatial evolutions of urban air pollution and urban air pollution was a result of the interactions among multiple influencing factors.

Table 5 The results of the ecological detection

Local effects of the UAPI influencing factors—geographically weighted regression

Since the q statistic of the geographical detector can only describe the explanatory power of the influencing factors on urban air pollution, we also used the GWR model to further quantify the local positive and negative effects of the influencing factors, which were to be represented by the signs and magnitudes of the estimated regression coefficients of the GWR. As compared in Table 6, the GWR model is clearly a better model than the ordinary least squares (OLS) linear regression model to explain the relationships between urban air pollution and its influencing factors. This is because the OLS model ignored the potential spatial effects of the independent and dependent variables (Wang et al. 2017).

Table 6 Comparisons of regression models

Table 7 lists the descriptive statistics of the regression coefficients of the GWR model (see also Fig. A2 for the percentages of the positive and negative values of the grid-level regression coefficients). As shown in Table 7, all influencing factors had both positive and negative effects on urban air pollution at the local scale. The four factors that had the largest positive average local effects were the proportion of secondary industry in regional GDP (D7), the ratio between the third industry and the second industry (R3), number of domestically made car ownership (P7), and highway density (P8). The four factors that had the largest average negative local effects were relative elevation (D1), annual average precipitation (D5), actually utilized foreign capital (D9), and annual average temperature (D2). Figure 10 further depicts the spatial characteristics of the regression coefficients of the GWR model. Apparently, the effects of the influencing factors on UAPI were different in different regions. For example, most natural driving factors (D1–D5) contributed positively to UAPI in western China, but not in other regions. Some influencing factors such as the number of manufacturing enterprises (P3), number of domestically made car ownership (P7), and highway density (P8) contributed positively to urban air pollution in central China. The response factors such as the rate of green land in built-up area (R1) and per unit GDP in energy consumption (R6) had greater mitigation effects on urban air pollution in most areas from northwestern to southeastern China.

Table 7 Descriptive statistics of the regression coefficients of the geographically weighted regression model
Fig. 10
figure 10

Spatial patterns of the estimated regression coefficients of the geographically weighted regression (GWR) model. (a) D1, relative elevation; (b) D2, annual average temperature; (c) D4, wind speed; (d) D5, annual average precipitation; (e) D6, regional GDP growth rate; (f) D7, proportion of secondary industry in regional GDP; (g) D8, investment to real estate development; (h) D9, actually utilized foreign capital; (i) D10, natural population growth rate; (j) P1, proportion of construction land to the total area of the city; (k) P2, proportion of coal consumption; (l) P3, number of manufacturing enterprises; (m) P4, nitrogen fertilizer application rate; (n) P6, per unit construction building area; (o) P7, number of domestically made car ownership; (p) P8, highway density; (q) P9, ratio of the central heating; (r) R1, rate of green land in built-up area; (s) R2, proportion of energy conservation and environmental protection expenditures to local fiscal expenditure; (t) R3, ratio between the third industry and the second industry; (u) R4, per unit area industrial output; (v) R5, ex-factory price index of industrial products; (w) R6, per unit GDP in energy consumption; (x) R8, expenditure of research and experimental development funds

Discussion

Complexity of the influencing factors of urban air pollution

Driver

The drivers, natural or socio-economic, have an indirect (potential) positive or negative contribution to urban air pollution (Tables 1, 2, and 3). In this study, the explanatory power of the natural drivers (such as temperature, precipitation, and humidity) on the urban air pollution is slightly higher than that of the socio-economic drivers (Table 2), which is consistent with the findings of Zhou et al. (2017) and Zhan et al. (2018). Our analysis shows that urban air pollution is closely related to natural factors. For example, meteorology is the main factor that affects the self-purification ability of the air environment and indirectly affects the formation of air pollution events in Chinese cities. In terms of air temperature, high temperature promotes air photochemical reactions, which are conducive to the formation of air pollutants such as O3 (Song et al. 2017). On the other hand, cities with lower average annual temperatures tend to burn more coals or other fuels for heating in the winter, emitting a large number of air pollutants. Moreover, the altitude of a city has an important effect on the dispersion and concentration of air pollutants. Han et al. (2016) also found that urban air pollution events occurred more frequently in the low-altitude areas than in the high-altitude areas. Wind can transport air pollutants and dilute the concentration of air pollutants, thus reducing the possibility of the formation and persistence of urban air pollution events. Precipitation can remove the suspended particles and gaseous pollutants from the air and clean the atmosphere to reduce the possibility of urban air pollution events.

Since its opening up to foreign trade and investment and implementing free-market reforms in 1979, China has been among the world’s fastest-growing economies. Social and economic drivers also resulted in various environmental problems. First, an unsound industrial structure led to considerable pollution emission. For example, Hao and Liu (2016) reported that the secondary industry had contributed most to pollutant emissions and the formation of urban air pollution events. Second, the real estate industry has driven the rapid development of the relevant industries. Cheng et al. (2017) found that once the real estate industry had exceeded a reasonable scale, it would be an important driving factor to aggravate urban air pollution indirectly. Third, as shown in Fig. 9 and Table 3, our findings simultaneously support the two popular views about the effect of the foreign direct investment (D9) on the environment of China. One is the “pollution haven” hypothesis (Bakirtas and Cetin 2017), namely, foreign enterprises carry out industrial migration through investment abroad, transferring environmental pollution to China. The opposing view is the “pollution halo” hypothesis (Wang et al. 2019a), i.e., foreign companies can bring efficient production and clean technologies, promoting the efficiency of energy utilization in China, reducing pollutant emissions, and improving environmental quality. Fourth, studies found that population growth made the slope of the environmental Kuznets curve steeper; that is, the increment of GDP per capita led to higher natural resource consumption and pollution accumulation (Wang et al. 2015).

Pressure

In this study, the pressure factors include four aspects: land use, energy consumption, economy, and society. First, the land use pressure mainly comes from construction land sprawl. Unreasonable land use structure is not conducive to the overall ventilation capacity of a city, resulting accumulation of air pollutants in a certain area. Second, energy consumption is the primary source of air pollutants. The pollutant emission coefficients vary among different energy sources, especially the air pollutant emission coefficient of coal combustion is significantly higher than other energy production sources (Hao and Liu 2016). Third, although the emission from industries is the primary source of many air pollutants, the excessive use of nitrogen fertilizer in agriculture is the main cause of ammonia pollution, which plays an important role in increasing the concentration of PM2.5 (Zhao et al. 2019). During the process of industrialization in China, the additive effect of industrial pollution and soil and water pollutions in the vast rural areas also formed a mechanism of severe rural air pollution (Gu 2014). Unfortunately, few studies explored the relationship between agricultural production and air pollution in rural China (Zhou et al. 2018b). Fourth, additional pressures also come from population growth, building construction, vehicle holdings, and road density. Increasing the residential density in urban areas stimulates private consumption, which, in turn, may result in more industrial pollutions. Lin et al. (2016) found that the dusts from construction activities were important sources of urban air particles in China. Chen et al. (2015) and Lin and Zhu (2018) also found that the emissions from the vehicles would gradually become a key driving force of urban air pollution in China. Also, the winter heating in North China promoted harmful gas emission, which was one of the important causes of urban air pollution at the regional scale.

Response

The response includes various countermeasures implemented by policymakers and environmental managers. The responses to mitigate the urban air pollution included ecological construction, economic adjustment, energy utilization adjustment, and technology innovation (Table 1). First, ecological construction, such as forest coverage and urban green space, played an important role in absorbing air pollutants and purifying the air. Second, López et al. (2011) reported that increasing investment in environmental protection helped to reduce the emission of air pollutants. Economic adjustment such as industrial upgrading promoted the transformation of industrial structures that might have potentially contributed to alleviating the haze air pollution risks. For example, Zeng and Zhao (2009) deemed that industrial agglomeration led to cleaner production due to the efficient utilization of capital and labor resources. Zhao et al. (2019) suggested that economic agglomeration was convenient for cities to deal with industrial pollution and air quality control by optimizing the industrial structure and improving business innovation. Third, energy utilization adjustment included energy price adjustment and energy efficiency improvement. Through adjusting energy prices, the government can control the total consumption of coal and optimize the composition of energy sources. Also, air quality may be improved by increasing energy efficiency. The development of clean energy technology is an important way to reduce the occurrence of urban air pollution events (Zhan et al. 2018).

Path to mitigating the urban air pollution in China

Based on the above analysis of spatial and temporal characteristics of urban air pollution in China and the influencing factors, we attempt to propose a specific path to reducing the occurrence of urban air pollution events in China. The proposed path includes the following five themes.

The first theme is multi-sectoral, multi-regional, and collaborative legislation and governance. The spatial pattern and variation analysis in this study show that the urban air pollution days in a city is not only related to local influencing factors but also related to the urban air quality in the neighboring cities. These results indicate that urban air pollutants have strong spatial diffusion or spatial spillover effects. Consequently, we suggest that local governments should strengthen regional cooperation on scientific research, promote the unification of regional environmental standards on energy conservation and emission reduction, and improve the mechanism for industry entry and exit. Moreover, it is important to establish a cooperation and law enforcement system for regional information sharing on air quality and emergency response.

The second theme is industrial upgrading and reconstruction. On the one hand, the industrial upgrade should be oriented toward raising the standards for environmental protection and energy consumption, increasing the penalties for environmental protection violations, and shutting down backward and excessive production capacities. On the other hand, China needs additional industrial upgrades, which will be driven by the new-type urbanization development and ecological civilization construction (Zhou et al. 2015). Chinese cities should strengthen structural adjustment in heavily polluting industries, strengthen source control, and reduce pollutant emissions by increasing technological innovation and upgrading heavily polluting industries, so as to change the pattern of economic growth and alleviate the increasing pressure on the environment.

The third theme is to upgrade the energy consumption structure. Economic development is not only a process of goods production and city infrastructure construction, but also a process of material and energy consumption (Lin and Zhu 2018). In the future, China should limit the use of low-grade coals, promote central heating, and combine heat and power supplies, so as to minimize fuel consumption and pollutant emissions. China should also increase the supply and consumption of clean energy such as natural gas, liquefied petroleum gas, and solar energy in the cities. In addition, China should promote the use of new energy vehicles and speed up the construction of supporting facilities and transportation networks for new energy vehicles in the cities.

The fourth theme is to optimize the “spatial distribution” of the population. China’s rapid urbanization has resulted in a series of population issues, such as population overgrowth, rural population moving to urban areas, and spatial imbalance between population and economy (Wang et al. 2015). Currently, the population’s spatial distribution and changes are becoming more and more important for regional socio-economic development in China. China’s urbanization needs to transform from the “land-based urbanization” to the “population-based urbanization” (Chen et al. 2013). Therefore, China should put a limit on megacity’ population and form the “new-type” urbanization (Chen et al. 2016) development plans that take into account the sustainability of the environment and resources in a region for balanced economic, social, and resource allocations.

The fifth theme is the reconstruction of the multi-level air duct systems. In recent years, spatial layout and structure change driven by urban expansion and land use have been among the main research areas on air pollution control (Xu et al. 2015; Yuan et al. 2017; Zhao et al. 2019). For example, Xu et al. (2015) found that unreasonable urban expansion would lead to the obstruction of the urban ventilation corridors, further reducing the speed and intensity of wind flowing through the city and increasing the frequency of stagnant air. The urban heat island effect and the temperature inversion effect may weaken the convection and diffusion of air pollutants (Fu and Chen 2017), which may lead to the microclimate conditions that favor the formation of urban air pollution events. Furthermore, Wang et al. (2019b) reported that the Chinese Academy of Sciences proposed a new low-cost charge-coupled device (CCD) Lidar detection system to measure the near-ground (< 500 m) aerosol profiles in haze days. Unfortunately, urban ventilation corridors have often been disregarded in the process of rapid urbanization development and land utilization in China in the past. Our results indicate that (Table 2) topographic and climate conditions are important factors of urban air pollution on a global scale. As shown in Fig. 4, urban air pollution events frequently occurred in the Northwest and North China. From summer to winter, air pollution mainly centered in North China and gradually expanded from there. Under the influence of the northwest monsoon, northwestern China has also become a “hotspot” of urban air pollution events and has gradually spread from Kashi in South Xinjiang to the northwest and the southeast Xinjiang. With the arrival of the southeast monsoon and the increase of precipitation in summer and autumn, the air pollutants are dissipated in the southeastern region of China. Therefore, we propose two measures for the reconstruction of air duct systems. One is to understand the importance of regional air duct systems based on regional climate, terrain, and wind conditions, making good use of the natural air duct in a region (such as green open space, ecological space, and ecological corridor). The other is to reduce near-ground (< 500 m) urban air pollution by combining urban planning, land use planning, and spatial planning, by designing the height and density of urban buildings reasonably at the city level, and by developing computer models to improve the urban air pollution forecasting based on urban air duct optimization and reconstruction.

Conclusions

In this study, we integrated the DPSIR framework, the geographical detector, and the geographically weighted regression model to understand the spatial and temporal characteristics of urban air pollution in China and to study the individual and interactive effects of the 27 influencing factors on the urban air pollution. Our empirical study in the 337 Chinese cities demonstrates that the proposed urban air pollution assessment framework is an effective tool to examine the important links among nature, society, and economic systems.

On the one hand, the DPSIR framework assumes a chain of causal links starting with the drivers (natural conditions and human activities) through the pressures (production, living, and emissions) to the states (air environment and quality) and the impacts on the nature-society-economy complex systems (public health and functions), finally leading to the “responses” (legal, economic, and administrative). The DPSIR framework is able to capture the cause-effect relationships among social, economic, environmental, and other systems, to simplify the complex human-environmental integrated system, and to contribute to the formulation of policies. On the other hand, the geographical detector is used to explore the influencing factors of urban air pollution and how they interact with each other at the global scale, while the geographically weighted regression model is used to detect the positive/negative effects of the influencing factors on the urban air pollution at the local scale.

Our findings suggest that the influencing factors of urban air pollution are complex and diverse, varying from season to season. Urban air pollution is the result of the interactions of multiple influencing factors, while the explanatory power between any two influencing factors is greater than that of a single influencing factor to the urban air pollution. Furthermore, the interaction of natural factors and socio-economic factors can enhance the explanatory ability of the influencing factors to urban air pollution.

Nevertheless, our study is not without limitations. The main limitation is that the cross-section dataset of the UAPI in the Chinese 337 cities is from a single year (2015). It is necessary to use multiple years of data over a long period for trend analysis of the urban air pollution in China. Another limitation is that urban air pollution is a systemic global or local problem, reflecting the complex relationship between human activities and the environment. It is also necessary to adopt a multidisciplinary approach to reveal the deep-rooted causes of urban air pollution. We plan to integrate process-based simulation and risk assessment to study the long-term dynamics of the urban air pollution in China and to recommend science-based mitigation strategies. In addition, data precision mismatch and partially missing data caused by compiling multiple data sources are also important factors that may affect the reliability of the results.