Introduction

Since 2013, the problem of smog pollution, the major particular component of which is PM2.5, has become increasingly serious, which seriously affects public health and daily life. According to the data from a bulletin reporting on the state of China’s ecological environment published by the government in 2017, the air conditions of 338 cities above the prefecture level have been tested in China, of which 239 cities have exceeded the standard, and polluted weather accounts for 70.7% of the total weather. Polluted days with PM2.5 as the main particular component account for 74.2% of polluted days, and weather with PM10 as the main particular component accounts for 20.4% of weather. According to a report on the cost of pollution released by the World Bank in 2016, air pollution has become a risk factor for premature death.

Haze has threatened public health and normal life and caused huge losses to the economy, which have aroused widespread concern from the state, the society, and the public. In 2013, the Chinese government issued an “Air Pollution Control Action Plan,” which proposed to strengthen the comprehensive treatment of air pollution, strictly supervise the discharge of pollutants, accelerate the upgrading and optimization of industrial structures, and eliminate industries with high pollution and backwardness. In 2017, Premier Li Keqiang first puts forward the concept of “defense against the blue sky,” which mandated strengthening the management of the ecological environment, improving the current atmospheric environment, and realizing the requirements of sustainable development. According to the report of the Nineteenth National Congress of the Communist Party of China, at present, the problem of air pollution in China is still prominent. China should continue to carry out air pollution prevention and control actions to win the battle of blue sky protection.

Since the implementation of reform and opening-up in 1978, China’s economy has gradually developed and accomplished extraordinary achievements. China’s position in the world has become increasingly prominent, and its comprehensive economic strength ranks among the highest in the world. However, during the sustainable economic development process, the accelerated urbanization produced a large amount of dust from an increase in per capita income and the increase in the number of public cars, resulting in traffic congestion and more automobile exhaust emissions. Through economic development, China’s investments in innovation increased, encouraging enterprises to invent new green environmental protection technologies and improving the utilization rate of technologies, thus reducing haze pollution. In addition, with increasing population size, human demand for energy is improving, resulting in a shortage of land resources. The haze problem is becoming serious; however, population agglomeration can improve the utilization rate of public resources and reduce environmental pressure. Therefore, an exploration of the relationship between economic development and haze pollution and how economic development affects haze pollution shows that the impact degree and direction of population agglomeration on haze pollution are conducive to the coordinated development of economy and environment. Moreover, an exploration of the relationship among economic development, population agglomeration, and haze can provide the basis for the reasonable formulation of policies and haze pollution measures.

Many scholars have studied problems related to haze pollution. At present, research mainly focuses on the composition, formation mechanism, and meteorological causes of haze particles, with few studies focusing on economic and social factors. In addition, haze pollution is diffuse, which means that haze is not an independent variable in space, and the traditional measurement method is no longer used in this study. Therefore, this paper considers the spatial spillover effect of haze pollution and uses a spatial econometric model to explore the impact of the economy and population agglomeration on haze pollution. In terms of data selection, this paper takes PM2.5 as the measurement index of haze and uses urban panel data from 1998 to 2016 for the research.

Literature review

Haze, a combination of fog and haze, has become a major concern of the public in recent years. The main particular component of haze is PM2.5. At present, in the study of PM2.5, some articles study the source and components of PM2.5 from the perspectives of chemistry and meteorology, as well as perform a risk assessment of PM2.5 on human health. Li et al. (2019b) conducted a sampling survey using the gas chromatography-mass spectrometry method. The main components of PM2.5 are NPAHs and OPAHs, the main source of which is coal combustion. In winter, NPAHs and OPAHs are significantly higher than in other seasons. Huang et al. (2014) used a quartz fiber filter (QFF) membrane and polyurethane foam (PUF) to collect samples to study the toxicity of these substances and their derivatives. The toxicity of the derivatives of NPAHs and OPAHs is worse than that of the parent substances. In addition, Ma et al. (2016a) used the positive matrix factorization (PMF) model to study NPAHs and OPAHs. The results showed that the emission of automobile exhaust and coal combustion are the main causes of pollution. Lu et al. (2019) studied PM2.5 in the Pearl River Delta of China, showing that this region was the main contributor of PM2.5.

Scholars have studied the source and composition of PM2.5 in different countries and regions, including France (Albinet et al. 2007), southern European cities (Alves et al. 2017), Athens (Andreou and Rapsomanikis 2009), and Italy (Belis et al. 2011). In recent years, research on PM2.5 in China has been increasing (Li et al. 2015; Guo et al. 2003; Bandowe et al. 2014). Bandowe et al. (2014) tested the PAC concentration in PM2.5 in Xi’an and explained that its changes were due to the urban environmental temperature, air pressure, wind speed, etc. Meanwhile, research shows that chemical substances in PM2.5 in China are significantly higher than those in PM2.5 in most Western countries, the chemical content is highest in winter, and the risk of cancer in winter is six times higher than that in summer. However, these studies all involve natural factors of PM2.5 and ignore human factors. The impact of human social and economic activities on haze pollution cannot be ignored.

Smoggy weather occurs frequently, and although some natural factors cause smog, the most common cause of smog is that industrial production is extensive, the industrial structure is unreasonable, population growth is too fast, and the speed of economic development exceeds the carrying capacity of the environment. Zambrano-Monserrate et al. (2020) used the dynamic spatial Durbin model to explore the relationship and impact between GDP and ecology. Studying the relationship between the economy and environmental quality is a hot topic in environmental economics. Since the 1990s, Grossman and Krueger (1992) put forward the environmental Kuznets curve. They believe that an inverted U-shaped relationship exists between contaminants and economic income. With the continuous increase of income, the problem of pollution is increasingly serious. After reaching a turning point, pollution decreases with increasing income. The literature on EKC is increasingly rich. Some scholars have verified the relationship between economy and pollution. Hishan et al. (2019) selected data from sub-Saharan African countries and used generalized moment estimation analysis (GMM) to study the impact of urbanization on PM2.5. The relationship between urbanization and PM2.5 was an inverted U-shaped relationship. Some Chinese scholars have also verified this result (Dong et al. 2018; Zhao et al. 2018; Qi and Yan 2017). As research results become increasingly abundant, scholars have developed different views on EKC research due to the different objects and methods selected. U-shaped, inverted N-shaped, and N-shaped relationships have been found between the economy and the environment (He et al. 2016; Wang 2016; Wu et al. 2018). In addition, the most commonly used research methods are the IPAT model (Shang and Mao 2016) and the STIRPAT model (Wang et al. 2019; Guo and Ren 2018; Zhang and Han 2018).

Rapid population growth also places enormous pressure on the environment, which has aroused widespread concern about society. There is no consensus on the impact of population agglomeration on pollution. Some scholars believe that population agglomeration can improve haze pollution because population agglomeration can allow the sharing of resources, such as factories and roads, and avoid duplicate construction (Andreoni and Levinson 2001). The spatial structure of high-density cities reduces the use of private cars, increases the demand for public transportation, and reduces the carbon emissions of transportation (Brownstone and Golob 2008; Glaeser and Kahn 2009). In addition, population agglomeration has a scaling effect, which improves the utilization rate of public infrastructure and resources, thus improving the environment (Xiao and Shen 2019). Some scholars believe that population agglomeration aggravates pollution because a large number of people moving to cities and towns increases population density, leads to population urbanization, causes the consumption of extensive materials, causes more waste emissions, and leads to environmental pollution (Shao et al. 2016; Wang and Liu 2017).

With the deepening of research, the study of other social and economic factors that affect smog has gradually increased. Wang et al. (2018) found that the acceleration of the urbanization process increased haze pollution. Dong et al. (2018) selected an autoregressive distribution lag model to explore the impact of natural gas in Beijing on PM2.5. They found that whether in the long or short term, the combustion of natural gas reduced PM2.5, but the effect of the reduction gradually weakened over time. Xu and Lin (2016) used a time series to analyze the socioeconomic factors that affect smog, ignoring the characteristics of spatial diffusion and accumulation. Zhou et al. (2019) used data envelopment analysis (DEA) to study the human activity factors that affect PM2.5. Guan et al. (2014) used an input-output model, taking the regional emission list as the input, and studied the factors that influence haze pollution through the structural decomposition method. The results of this method had hysteresis, and only specific factors were decomposed. These studies ignore the influence of natural factors, such as airflow, wind, and human factors. The diffusion characteristics of haze show that it is not mutually independent in space, so these traditional models cannot make accurate estimates of haze pollution. In view of the shortcomings of traditional research methods, a few scholars have started to use spatial metrology methods to study the causes and impact mechanisms of haze pollution (Li et al. 2019a; Luo and Li 2018; Anselin 1995).

In recent years, haze has attracted the attention of the Chinese government. China started to establish stations to monitor PM2.5 in 2013. Therefore, PM2.5 as an indicator of haze pollution is rarely studied, and the data available for analysis are limited. Most of the studies use data from recent years (He and Ma 2015; Liu and Du 2018). Pan et al. (2015) selected PM2.5 data from October 2013 to September 2014 for their research. In terms of research scope, many projects study the overall haze situation in China, and the use of data is still at the provincial level (Shao et al. 2016; Kang 2016; Wu et al. 2018). Wu et al. (2018) used provincial data from China from 2000 to 2011 to explore PM2.5. Some studies are limited to a few key cities or urban agglomerations (He and Ma 2015; Du et al. 2018; Liu and Pei 2017). Some scholars use cross-section data to study haze. Zhou et al. (2018) collected data from 945 sites in China in 2014, and Hao and Liu (2016) used data from 73 cities in China in 2013 to study factors affecting PM2.5. The above literature greatly expands our research horizons but has some shortcomings, such as (1) the period of PM2.5 is too short to reflect long-term trends in smog pollution. In the present article, the data span is 1998–2016, which avoids the shortcoming of studying too short a time period. (2) This present study studies data at the urban level, which provides a more focused view than using data at the provincial level and leads to more practical and instructive research conclusions. (3) Research in the existing literature focuses on national or local areas, and research combining national and local areas is relatively rare. First, this paper analyzes China as a whole and then divides China into three parts: the east, the middle, and the west. The conclusions of the present study are more pertinent and practical than those of previous studies. (4) Considering the characteristics of haze diffusion, the traditional model cannot make an accurate estimate. Therefore, the present study uses the spatial metrology method to study the characteristics of haze diffusion. Based on the abovementioned research, this article selects PM2.5 as an indicator to measure smog and uses SDM to study the overall impact of economic development and population agglomeration on haze pollution in 287 urban panel datasets from 1998 to 2016 and then analyzes the three regions and puts forward corresponding policy recommendations.

Data

China’s air pollution index did not include PM2.5 in monitoring before 2012. The Chinese government issued the “Implementation Plan of the First Phase of the New Air Quality Standards” in 2012, which requires PM2.5 monitoring in 74 cities across the country, and PM2.5 officially became a monitoring index. Due to the short time of data acquisition in China, these data cannot reflect long-term trends in haze pollution, so this paper uses grid data obtained by the Battle Institute and the International Earth Science Information Network Center of Columbia University for analysis. On the basis of Van Donkelaar et al. (2015), grid data are converted from aerosol optical thickness (AOD) data. In this paper, ArcGis 10.0 is used to analyze grid data from 1998 to 2016 and convert them into PM2.5 concentration values for 287 Chinese cities. Although China currently uses ground monitoring data, which are not easily affected by meteorological factors, China is large and the distribution of monitoring points is not dense, so ground monitoring based on points and areas cannot truly reflect more atmospheric conditions (Yu and Xiu 2018). Data obtained using satellites are non-point source data, which can comprehensively capture the characteristics of PM2.5 concentration changes, making these data more realistic and accurate.

Spatial correlation analysis of haze

To study the spatial spillover effect of haze, this paper tests global and local autocorrelations.

Global spatial correlation test

Based on the first law of geography,Footnote 1 this article uses global Moran’s I to examine the characteristics of PM2.5 and to further explore the spatial correlations of haze. The formula is:

$$ I=\frac{\sum \limits_{i=1}^n\sum \limits_{j=1}^n{w}_{ij}\left({x}_i-\frac{1}{n}\sum \limits_{i=1}^n{x}_i\right)\left({x}_j-\frac{1}{n}\sum \limits_{i=1}^n{x}_i\right)}{\frac{1}{n}\sum \limits_{i=1}^n{\left({x}_i-\overline{x}\right)}^2\sum \limits_{i=1}^n\sum \limits_{j=1}^n{w}_{ij}} $$
(1)

where I is the global index of PM2.5, n represents the number of cities, and Wij represents the spatial weight matrix. xi and xj represent haze observation values of city i and city j, respectively. When the value range of I is [− 1.0) and it is infinitely close to − 1, PM2.5 shows a negative correlation in space; when the value range is (0,1] and I is infinitely close to 1, PM2.5 shows a positive correlation in space; and when I is infinitely close to 0, the PM2.5 has no correlation in space. In the selection of the spatial weight matrix, most scholars use the 0–1 adjacency matrix. If there is a common edge or point in the two areas, they are adjacent and the value is 1; otherwise, it is 0 (Liu and Jiang 2017; Ma and Zhang 2014). However, there will be a city that is not adjacent to any other city, causing deviations in the results.

When introducing a spatial econometric model, we must construct the spatial weight matrix. The 0–1 matrix is commonly used. However, this paper uses data from 287 cities, and the problem of independent cities affects the accuracy of the data. This paper uses the commonly used inverse distance matrix and economic geographical distance matrix, which not only consider the spatial impact of geographical distance but also reflect the fact that economic factors have regional spillover and radiation effects that can reflect the spatial correlation degree of cross-section units more comprehensively and objectively. On the other hand, these matrixes can test the stability of the model.

This paper draws on the perspective of Luo and Li (2018) and uses the inverse distance spatial weight matrix (W1). The basic idea of W1 is that as the distance increases, the spatial correlation of haze will gradually weaken. The formula is:

$$ {w}_{1, ij}=\left\{\begin{array}{c}1/{d}_{ij}\\ {}0\end{array}\right.{\displaystyle \begin{array}{cc}& i\ne j\\ {}& i=j\end{array}} $$
(2)

where W1 refers to the reciprocal of the nearest distance between area i and area j. This matrix can not only reflect the degree of adjacency of two regions but can also reflect the traffic convenience between two regions.

The second matrix is the economic-geographic distance weight matrix (W2), which is the product of the reciprocal of the absolute value of the difference between the per capita GDP of regions i and j and the reciprocal of the nearest distance between regions i and j. The formula is:

$$ {w}_{2, ij}=\left\{\begin{array}{c}\frac{1}{d_{ij}}\ast \frac{1}{\left|\overline{Y_i}-\overline{Y_j}\right|}\\ {}0\end{array}\right.{\displaystyle \begin{array}{cc}& i\ne j\\ {}& i=j\end{array}} $$
(3)

where \( \overline{Y_i}=\sum \limits_{t={T}_0}^T{Y}_{it}/\left(T-{T}_0\right) \) and Yij is the per capita GDP of city i in year j. This weight considers the spatial impact of geographical distance and also reflects the spillover and diffusion effects of economic factors, which can more comprehensively reflect the spatial correlation.

According to Table 1, under the two spatial weight matrices, W1 and W2, Moran’s I of PM2.5 passes the test at the level of 5%, and the value of the coefficient is greater than 0, indicating that PM2.5 has a spatial spillover and haze pollution has diffusion and agglomeration.

Table 1 Global Moran index table of smog pollution from 1998 to 2016

Local spatial correlation test

Although the global test reflects the overall correlation of variables in space, it also ignores the characteristics of local areas. To analyze the regional characteristics of haze more carefully, this paper introduces the local spatial correlation index (LISA) for further research. The formula is:

$$ {I}_i=\frac{\left({x}_i-\frac{1}{n}\sum \limits_{i=1}^n{x}_i\right)}{\frac{1}{n}\sum \limits_{i=1}^n{\left({x}_i-\overline{x}\right)}^2}\sum \limits_{i=1}^n{W}_{ij}\left({x}_j-\frac{1}{n}\sum \limits_{i=1}^n{x}_i\right) $$
(4)

where Ii is the correlation degree of PM2.5 between region i and neighboring regions. The other variable settings are the same as those for the global test. When the value range of I is (0,1], it means that a high PM2.5 concentration is surrounded by a high PM2.5 concentration or a low haze concentration is surrounded by a low haze concentration, and PM2.5 shows a H-H or L-L concentration; when the value range of I is [− 1,0), that area i has a negative correlation with the surrounding area; that is, a low PM2.5 concentration is surrounded by a high PM2.5 concentration or a high haze concentration is surrounded by a low haze concentration, and PM2.5 shows a L-H or H-L concentration.

To further study the spatial distribution characteristics of haze pollution, Stata 14.0 is used to calculate the local Moran index value of Chinese PM2.5 from 1998 to 2016, and a scatter map of the corresponding year is drawn. Taking the data from the inverse distance matrix (W1) as a representative, this paper selects the Moran scatter diagram from 1998, 2004, 2010, and 2016 (Fig. 1).

Fig. 1
figure 1

Moran scatter plot of haze pollution in Chinese cities

In Fig. 1, the scatter diagram is divided into four quadrants. The first quadrant is an H-H concentration area, that is, an area where a high haze area is adjacent to another high haze area; the third quadrant is an L-L concentration area; i.e., a low haze area is adjacent to another low haze area. The second quadrant is a low-high clustering region, and the fourth quadrant is a high-low clustering region. On the whole, the local Moran index of PM2.5 over 4 years is greater than 0, and most samples are in the first and third quadrants, which reflects a positive correlation, indicating that smog has a spatial correlation.

This paper uses ArcGis 10.0 to plot the distribution of PM2.5 over 4 years. PM2.5 is divided into five grades: excellent (0~15 μg/m3), good (15~30 μg/m3), light (30~45 μg/m3), medium (45~60 μg/m3), and heavy pollution (> 60 μg/m3), which are shown in Fig. 2. It can be seen from the figure that there is a clustering phenomenon of haze pollution. Low haze pollution and surrounding areas are clustered, especially in the western region. High haze pollution areas are also clustered. In addition, haze pollution shows an increasingly serious trend over time. In the chart for 2010 to 2016, the concentration of PM2.5 has a downward trend.

Fig. 2
figure 2

National distribution of the PM2.5 concentration (In order to show the change of the PM2.5 concentration in China from 1998 to 2016, we developed a website http://www.aqipm.top/, which can be accessed for details)

Theoretical mechanism, model construction, and empirical research

Theoretical mechanism

Effect of economic development on haze pollution

The relationship between the environment and the economy has always been a hot topic in academic circles. Given the increasingly serious haze pollution in China, scholars began to explore the relationship between haze pollution and economic development. In an exploration of the causes of haze pollution, the influencing factors of haze pollution were determined, and corresponding haze control policies were implemented to not only improve the atmospheric environment but also ensure human health. Given China’s reform and opening-up, the country’s comprehensive national strength has been continuously enhanced, and the degree of opening to the outside world has gradually strengthened. Simultaneously, inexpensive labor has attracted many foreign enterprises to invest. To protect their environment, enterprises in developed countries transfer some industries with high pollution and high energy consumption to China, which puts pressure on China’s environment (Shahbaz et al. 2015). In addition, the improvement in China’s degree of opening-up has brought new energy technologies, which has produced technological effects and has improved the country’s haze situation (Zhang et al. 2019). During the critical period of economic transformation, the development of the economy also promotes continuous optimization and upgrades to industrial structures. The continuous expansion of market scale and the increasing diversity of the public’s demands have increased their demand for low pollution products with environmental protection awareness under the condition of meeting basic material conditions. Therefore, the proportion of the tertiary industry dominated by the service industry is increasing, and the manufacturing industry with its high energy consumption and high pollution is being reformed to improve production capacity and reduce haze pollution.

Given the development of the economy and the progress of science, China has realized the importance of innovation. The government has been increasing its innovation efforts, encouraging enterprises to develop new energy and green environmental protection technologies, optimizing traditional high-pollution industries, and improving the use efficiency of resources and energy to improve the environment and develop technical effects. However, the progress of technology also increases the demand for energy, which leads to a stronger pollution and energy rebound effect (Shao et al. 2019). Because China has a large population base and increasing population density, the country consumes significant energy and resources, and the emission of pollutants is also increasing. Population increases cause housing demand to increase, which drives the construction of real estate and produces significant dust. At the same time, population increases also lead to demand for automobiles, which produces a large volume of vehicle exhaust emissions that aggravates haze pollution and produces the population scale effect (Zhao et al. 2019; Tong and Wang 2014). Since the reform and opening-up, urbanization has become a new driving force for China’s economic development. The inevitable result of economic and social development is an important sign for a country or region to engage in modernization. The construction of new urbanization structures should adhere to green development, change the traditional production and lifestyle, eliminate backward industries, accelerate the construction of new industries, and promote industrial innovation and upgrading to improve the quality of haze pollution (Liu and Xu 2017).

Variable selection and source

In the 1970s, Holdren and Ehrlich (1974) proposed the IPAT model to study the impact of wealth, population size, and technological level on the ecological environment. However, the model ignored the application limitations of multiple factors that affect the environment. York et al. (2002) redefined the IPAT model and expressed it in a random form. The STIRPAT model was established and improved in this paper, and its variables are selected based on the STIRPAT model.

Explained variable

Haze pollution (lnPM): PM2.5 is the primary particle in haze, which can easily cause respiratory diseases. High concentrations of PM2.5 can cause death and seriously endanger human health. Therefore, the PM2.5 concentration is used to express the degree of haze pollution in this paper, and the data come from the PM2.5 concentration value parsed from grid data.

Core explanatory variables

This article uses the actual GDP to represent the level of economic development (lnGDP). This paper takes 2000 as the base period and adjusts the nominal GDP of each city to its real GDP through a deflator. Population agglomeration (lnP) is represented by population density (Liu et al. 2019; Luo and Li 2018). At present, scholars have not reached a consistent conclusion about the influence of population agglomeration on smog pollution.

Control variables

Industrial structure (lnTI): with the development of the economy and science, the industrial structure continues to be optimized and upgraded, and the proportion of tertiary industries with low pollution levels is constantly increasing. Therefore, this paper chooses the proportion of tertiary industries in GDP to represent the industrial structure (Chen 2016). In the existing literature, urbanization (lnAE) is usually measured by the percentage of permanent residents in the total population. However, due to the long time period and large number of cities studied, our data access is limited. Therefore, we select the average number of on-the-job employees to represent lnTI (Song 2000). Technical level (lnS): the fundamental driving force of technological progress is science and technology. The expenditure on science in the public finance of each city is used to express the technical level (Dong 2016). At present, scholars have different views on the impact of opening to the outside world (lnFDI) on haze pollution. There is no consensus on whether opening to the outside world improves or intensifies haze pollution. This paper uses the actual amount of foreign capital used in a year to express this index (Liu et al. 2019; Shao et al. 2016).

Market scale (lnM): the continuous expansion of the market scale leads to economic agglomeration and can be used to further explore the impact of the economy on haze pollution. This paper uses total retail sales of social consumer goods to measure the market scale (Luo and Li 2018). Transportation intensity (T): convenient transportation is conducive to exchanges with the regional external economy, attracts investment, and improves the economic level. In addition, emissions of automobile exhaust aggravate haze pollution. To explore the impact of this factor, this paper uses the per capita urban road area to measure emissions of automobile exhaust (Liu et al. 2019; Shao et al. 2016). Material capital (lnK): Zivin and Neidell (2012) believe that the combination of material capital and labor can improve production efficiency, thus affecting economic development. The indicators are measured by fixed-asset investments. The details are shown in Table 2.

Table 2 Economic indicators and sources of haze pollution

The data come from the “China City Statistical Yearbook”. To ensure the authenticity, integrity, and continuity of the data, 287 cities from 1998 to 2016 were ultimately selected. The logarithm of the relevant variables was taken to reduce errors.

Model selection and construction

Based on EKC, this paper introduces the quadratic term of real GDP into the model using the method of Ma et al. (2016b). Three commonly used spatial econometric methods are used. The spatial lag model (SLM) is mainly used to study the spatial spillover effect of this region on other regions. In the spatial error model (SEM), the influence of spatial factors on the error term becomes complex and can no longer be simply subjected to a normal distribution. The error term may contain other factors affecting haze pollution, which causes a strong correlation; therefore, a spatial error model is constructed. The spatial Durbin model (SDM) contains spatial lag factors of the explanatory variables. This model can better solve the problem of variable omission than the SEM and the SLM and ensures an unbiased estimation of coefficients.

As for the selection of models, the LR test can be performed according to the judgment criteria proposed by LeSage and Pace (2009). The test has two hypotheses: H0 : θ = 0 and H0 : θ + ρβ = 0. The first hypothesis tests whether the SDM degenerates into the SLM, and the second hypothesis is used to test whether the SDM degenerates into the SEM. If both hypotheses are rejected, the SDM is optimal; otherwise, the SLM or the SEM should be selected. It can be seen that the spatial Durbin model is more general.

Spatial model selection

First, we can further judge whether the spatial panel model should be used through the LM test and diagnose the four statistics of the traditional OLS model. If all the OLS models are rejected by the four statistics, we cannot directly judge whether the spatial lag or spatial error model should be used. We should also judge the three spatial models through the WALD and LR tests.

The results presented in Table 3 show that the four statistics of traditional OLS reject the original hypothesis. It is further confirmed that smog pollution has a spatial spillover effect. From the results of the WALD and LR tests, the estimates reject the original hypothesis. Therefore, the spatial Durbin model (SDM) is more suitable for this research. Based on the results of the Hausman test and the LR test, this article finally chooses the time-fixed effect of the SDM for empirical study.

Table 3 Diagnostic results of the spatial model

Model construction

Through this spatial correlation test, haze is observed to be not independent in each region. In the traditional OLS model, the variables are independent of each other. Therefore, the traditional OLS model is no longer applicable to this paper. This paper attempts to use spatial econometric models to study whether the economy of each city affects smog in neighboring areas. To comprehensively and carefully study the long-term relationship between the economy and PM2.5, according to the results of the model’s test, this paper selects the SDM for the empirical research. The model is as follows:

$$ {\displaystyle \begin{array}{c}\begin{array}{c}\ln P{M}_{it}={\beta}_0+\rho \times W\ln PM+{\beta}_1\times \ln GDP+{\beta}_2\times {\left(\ln GDP\right)}^2+{\beta}_3\times \ln P+{\beta}_4\times \ln AE\\ {}\kern1.00em +{\beta}_5\times \ln S+\beta \times {}_6\ln FDI+{\beta}_7\times \ln M+{\beta}_8\times \ln T+{\beta}_9\times \ln TI+{\beta}_{10}\times \ln K+{\theta}_1\\ {}\kern1.00em \times W\ln GDP+{\theta}_2\times W{\left(\ln GDP\right)}^2+{\theta}_3\times W\ln P+{\theta}_4\times W\ln AE+{\theta}_5\times W\ln S\\ {}\begin{array}{ccc}& & \end{array}\begin{array}{ccc}& & \end{array}+\theta \times {}_6W\ln FDI+{\theta}_7\times W\ln M+{\theta}_8\times W\ln T+{\theta}_9\times W\ln TI+\\ {}\begin{array}{cccc}\begin{array}{cccc}\begin{array}{ccc}& & \end{array}& & & \end{array}& & & \end{array}{\theta}_{10}\times W\ln K+{\delta}_i+{\mu}_i+{\varepsilon}_{it}\end{array}\\ {}{\varepsilon}_{it}\sim N\left(0,{\sigma}_{it}^2{I}_n\right)\end{array}} $$
(5)

In the above model, δi is the spatial effect and μt is the temporal effect.

Results analysis

The empirical results are shown in Table 4. Under the two spatial weight matrices, the two indicators of the economic development level pass the significance test. The primary term of the coefficient is negative, and the secondary term is positive. It can be seen that China’s haze pollution conforms to the EKC, which is an inverted U-shaped relationship. It is consistent with the research results of Liu et al. (2019). During the initial economic development stage, industrialization develops rapidly, pollutant emissions exceed the standard, urban buildings lead to increases in dust, and haze pollution continues to aggravate. When reaching a turning point, with the improvement of the economic level, the haze pollution situation improves. The reason for this phenomenon is that the harm caused by haze attracts the attention of all sectors of society and attention is paid to environmental protection while developing the economy. The product of the economy and the matrix is tested at the level of 5% in W2, which shows that the economic development of neighboring areas has a spillover effect on haze pollution in the local area, and the relationship between them is also an inverted U shape. With the continuous increase of per capita income in neighboring areas, the concentration of PM2.5 in local area increases first and then decreases.

Table 4 Spatial metrology model results

Under two spatial weight matrices, the coefficient of population agglomeration (lnP) is significantly positive. The value of population agglomeration is the largest of all the influencing factors; Liu et al. (2020) also considered that population agglomeration is the decisive human factor affecting haze pollution, which indicates that it is the primary factor affecting haze pollution. Population agglomeration has a certain incremental effect on the emission of haze pollution; that is, population agglomeration increases infrastructure construction, consumption, and industrial output as well as energy and resource consumption and pollution emissions increase haze pollution. Under the W2 weight matrix, W*lnP is significantly negative, showing that the increase of the population density and the flow of people in neighboring areas alleviate haze in this area, reflecting the siphoning effect of one region on its surrounding areas.

The influencing factor of the technical level (lnS) passes the significance test in W1 and W2, the continuous progress of the technology level can improve haze pollution (Sun et al. 2019; Dong 2016), and the coefficient of W*lnS on adjacent areas is also negative in W1, which shows that the continuous progress of the technical level can improve smog in surrounding areas. Given the development of the economy, the government has increased its innovation efforts and increased investments in scientific and technological research, which provides strong support for enterprises, encourages enterprises to develop new energy technologies, develops green environmental protection technologies, eliminates backward high pollution and high energy consumption technologies, and optimizes traditional technologies, such as reducing the use efficiency of coal and other resources through technical optimization. The coefficient of urbanization (lnAE) is negative in W1. The acceleration of urbanization has improved haze pollution. When the urbanization level is increased by 1%, haze pollution is improved by 4%. This is because great changes have occurred in the development of urbanization, and the development of the non-agricultural industry has led to the vigorous development of a low pollution service industry. In the process of vigorously developing urbanization, the government has paid increasing attention to the construction of ecological civilization, emphasizing energy conservation in the development mode, reducing the harm to nature, and promoting green travel lifestyle and urban construction modes.

The coefficients of lnFDI under the two spatial matrices are different. Under the W1 weight matrix, the coefficient is small; therefore, this paper analyzes the significance of the results in W2. The coefficient of openness to the outside world (lnFDI) is significantly positive and has passed the significance test at the 1% level, which shows that FDI aggravates haze pollution (Shahbaz et al. 2015; List and Co 2000; Ren et al. 2014). To obtain low-cost labor, developed countries transfer highly polluting industries to the region, resulting in the deterioration of the local environment. The coefficient of the opening-up level and matrix product (W*lnFDI) is significantly negative. FDI in neighboring regions reduces haze pollution in the region (Zhang and Zhou 2016; Dong et al. 2012). The reason for this phenomenon is that the region learns to introduce green advanced energy technologies from foreign-funded enterprises in neighboring regions to improve the region’s environment.

The influencing factor of the industrial structure (lnTI) is negative, showing that the increase of the tertiary industry improves haze pollution (Dong et al. 2019), because tertiary industry is mostly composed of low-energy green industries, which have little impact on the environment. In addition, heavy industry, which causes serious pollution, has been gradually rectified or transferred to neighboring areas, resulting in the aggravation of haze pollution in neighboring areas. The coefficient of W*lnTI in W2 is positive, indicating that the optimization and upgrading of the industrial structure in neighboring areas aggravate local haze pollution.

The coefficient of material capital (lnK) is positive. Although China’s fixed-asset investment is increasing, the proportion of investment in green industry is still low, which increases the burden of the environment in the region. Under the W1 weight matrix, W*lnK is significantly positive, meaning that the increase of fixed-asset investments in neighboring areas also aggravates haze pollution in the local area. The coefficients of the market scale (lnM) and traffic factor (T) are negative, indicating that the economic agglomeration brought by the market scale and convenient transportation can improve smog pollution.

Analysis of the spatial spillover results

A general panel model studies independent variables and dependent variables without considering spatial factors. However, when considering spatial factors, the problem becomes more complex because the change of a certain factor can not only cause a change in haze pollution in this area but may also affect haze in adjacent areas. Lesage and Pace (2009) put forward three kinds of spatial effects: direct, indirect, and total effects. The direct effect indicates the average degree of influence of the independent variable on haze in the local area. The indirect effect refers to the influence of a given factor on smog in adjacent areas. The total effect is the impact of the explanatory variable on haze in all areas.

The basic form of the spatial Durbin model:y = ρWy + ατn +  + WXγ + ε

Transform the formula: (In − ρW)y = ατn +  + WXγ + ε

$$ y=\sum \limits_{r=1}^k{S}_r(W){x}_r+V(W){\alpha \tau}_n+V(W)\varepsilon $$
(6)

where:

$$ {\displaystyle \begin{array}{c}{S}_r(W)=V(W)\left({I}_n{\beta}_{\gamma }+W{\theta}_r\right)\\ {}V(W)={\left({I}_n-\rho W\right)}^{-1}={I}_n+\rho W+{\rho}^2{W}^2+{\rho}^3{W}^3+\cdots \end{array}} $$

Expand formula (6):

$$ \left[\begin{array}{c}{y}_1\\ {}{y}_2\\ {}\vdots \\ {}{y}_n\end{array}\right]=\sum \limits_{r=1}^k\left[\begin{array}{cccc}{S}_r{(W)}_{11}& {S}_r{(W)}_{12}& \cdots & {S}_r{(W)}_{1n}\\ {}{S}_r{(W)}_{21}& {S}_r{(W)}_{22}& \cdots & {S}_r{(W)}_{2n}\\ {}\cdots & \cdots & \cdots & \cdots \\ {}{S}_r{(W)}_{n1}& {S}_r{(W)}_{n2}& \cdots & {S}_r{(W)}_{nn}\end{array}\right]\left[\begin{array}{c}{x}_{1r}\\ {}{x}_{2r}\\ {}\vdots \\ {}{x}_{nr}\end{array}\right]+V(W)\alpha {\tau}_n+V(W)\varepsilon $$

The partial derivative of yi to xir is \( \frac{\partial {y}_i}{\partial {x}_{ir}}={S}_r(W){}_{ii} \), which represents the direct effect. This derivative is obtained by calculating the mean value of the diagonal of the matrix Sr(W). The partial derivative of yi to xjr is \( \frac{\partial {y}_i}{\partial {x}_{jr}}={S}_r(W){}_{ji} \), which represents the indirect effect in space. This derivative is obtained by calculating the average value of the elements on the non-diagonal of the matrix Sr(W). The total effect is the sum of the direct effect and the indirect effect, which is obtained from the average value of many elements in the matrix Sr(W).

Compared with the two matrices presented in Table 5, the results in W2 pass the significance test. Therefore, this paper mainly analyzes the results of W2. From these results, we can see that the direct effects are consistent with the signs of the independent variables in the spatial Durbin model. The coefficients of the primary and secondary terms of economic development in the indirect effects are 1.697 and − 0.058, respectively. The influence of the regional economy on neighboring areas also has an inverted U-shaped relationship, so in general, China’s economy conforms to the EKC hypothesis. For the indirect effect, the population size (lnP) coefficient is positive and has passed the significance test at the 1% level. This coefficient shows that population agglomeration also has adverse effects on the environment of neighboring areas. The reason may be that population agglomeration makes urban buildings expand continuously, putting land and other resources in short supply. Moreover, automobile exhaust emissions increase and dust and other particles increase, worsening and expanding haze in the region. From the overall effect, population factors still affect the primary factor of haze pollution.

Table 5 Space spillover results

The coefficient of lnTI on neighboring areas is positive, and through a significance test at the 1% level, every increase in the proportion of the tertiary industry aggravates haze in neighboring areas by 0.440. Because of upgrades to the industrial structure in this region, extensive industries are transferred to neighboring areas under strict control, which causes smog pollution in neighboring regions to continue to rise. The positive impact of industrial structure on adjacent regions is greater, making the overall effect positive, and the industrial structure is positive for haze pollution. The indirect effect coefficient of lnFDI is significantly negative and has passed the significance test, which shows that FDI in this region can reduce haze in neighboring areas. This is because local governments share the experience of foreign advanced technology, green energy technology, and environmental pollution reduction to surrounding areas to reduce haze pollution. From the overall effect, FDI can reduce haze pollution. The indirect effect coefficient of lnS is negative, but the result is not significant. From the total effect, the coefficient of the technology level is significantly negative, which indicates that environmental quality has improved from the continuous improvement in technology. Both the indirect effect coefficients of market size (lnM) and traffic intensity (T) are negative. For each additional unit of variables, market size and traffic intensity improve haze pollution by 0.168 and 0.005, respectively. From the total effect, the continuous expansion of market scale and the improvement in traffic convenience can reduce haze pollution.

Study on the subregion of haze pollution

To further investigate the influences of economic development and population agglomeration on haze pollution, we divided the 287 cities into east, central, and west regions. Due to the difficulty of data acquisition, 31 provinces except Hong Kong, Macao, Taiwan, and Nansha Islands are taken as the study area, and their distribution is shown in Fig. 3.

Fig. 3
figure 3

Geographical distribution of the eastern, central, and western regions

In the eastern and central parts of China, the Moran’s I values of PM2.5 are all significantly positive. The Moran index of most years is positive and passed the significance test in the west, except that a few years that it did not pass the significance test. Therefore, there is a significant spatial correlation among the three regions of China (Table 6).

Table 6 Spatial correlation test results based on region

In this paper, we use the time-fixed effect of the SDM to analyze the results of the three regions. The primary coefficients of the economy are all positive and the secondary coefficients are all negative in the three regions, but the coefficients only pass the significance test in the eastern and central regions. The relationship between real GDP and haze is an inverted U-shaped curve, which is consistent with the overall situation of the country. However, the impact of income on haze in neighboring areas is not the same. The eastern area has a U-shaped relationship, while the central and western areas have an inverted U-shaped relationship. The coefficients of population agglomeration are positive in all three regions. Among all the positive factors, the lnP value of the three regions is the largest, indicating that the population concentration is the primary factor that aggravates haze pollution. In the eastern region, the coefficient value of the product of the spatial weight matrix and population density (W*lnP) is significantly positive, and population aggregation in neighboring areas aggravates haze in the region. In the central region, the coefficient value of W*lnP is significantly negative. The population concentration in neighboring areas reduces haze in this region, whereas the coefficient value in the western region is not significant (Table 7).

Table 7 Test results of region-based spatial models

All of the industrial structure and technical level coefficients of the three regions are all negative and pass the significance test, which shows that the increasing output value of the tertiary industry and the continuous progress of technology can improve haze pollution, which is consistent with the overall situation of the entire country. The coefficient of the product of industrial structure and the spatial weight matrix (W*lnTI) in central and western regions is positive, and the optimization and upgrading of the industrial structure in neighboring regions bring pressure on the local environment. The urbanization (lnAE) coefficient of eastern and western regions is positive and passes the significance test at the 1% level. In the early stage of urbanization, the eastern region only pays attention to development and ignores damage to the environment; haze pollution increased by 0.070 per unit of urbanization level. In the western region, because of its underdeveloped economy, backward technology, and low level of urbanization, haze pollution is aggravated. The urbanization coefficient is negative, which indicates that urbanization improves haze pollution in the central region. The coefficient of W*lnAE in eastern and central regions is significantly positive, and the acceleration of the urbanization process in adjacent areas aggravates haze pollution in the region. Except for the central region, the coefficients of lnFDI are negative, which indicates that FDI improves haze pollution by introducing foreign advanced technology, while FDI in the central region intensifies pollution. In the eastern and western regions, haze pollution in the neighboring areas can be improved by 0.095 and 0.028 for each unit of the opening-up level. The material capital coefficient of all three regions is positive. Increasing population causes labor-intensive industries to continue to rise. Although material capital investment increases, it has not yet reached the ideal effect. The market scale (lnM) in the eastern region reduces haze pollution, and the expansion of the market scale in the central region damages the environment. The coefficient of W*lnM in the central and western regions is significantly negative and has passed the significance test at the 1% level. The market scale of neighboring regions is expanding continuously, which reduces the environmental pressure of the central and western regions. In the eastern and central regions, the coefficients of the traffic factor are positive, but the values are very small: for each 1% increase, haze pollution increases by 0.004 and 0.002, respectively. In the central and western regions, the convenient transportation in neighboring areas has also reduced haze in the region.

Conclusions and countermeasures

To study the effects of the mechanism and path of economic development and population agglomeration on haze, this paper selects data from 287 cities from 1998 to 2016 and conducts an empirical analysis using the spatial Durbin model through the WALD, LR, and LM tests. At the same time, cities in China are divided into three regions to study the heterogeneity of haze pollution. The main results are as follows:

  1. 1

    From the empirical results and correlation tests, it can be concluded that haze causes spillover in space. Haze pollution in most areas appears to follow the phenomenon of H-H and L-L concentrations.

  2. 2

    From the perspective of the whole country, the relationship between the economy and haze pollution is an inverted U-shaped curve. From the results of regional heterogeneity, the relationship is an inverted U shape in the eastern and central regions; there is no obvious relationship between the economy and haze pollution in the western region. By further studying the indirect effects of real GDP and the PM2.5 concentration, it is found that the relationship between real GDP in this region and haze in neighboring areas is also an inverted U shape. In all the studies, it is confirmed that population agglomeration has the greatest impact on haze and that it is the primary factor that aggravates haze pollution.

  3. 3

    On the whole, the coefficients of the industrial structure, technology level, urbanization level, and market scale are negative, which shows that these indexes can improve the atmospheric environment. The coefficient of material capital is positive. The main influencing factors of haze pollution in the eastern area are foreign direct investment and the market scale, which can reduce haze pollution. In the central region, with the acceleration of urbanization and the continuous expansion of the market scale, pollution has been improved, while the increase of foreign investment has placed pressure on the environment. Traffic factors in the western region can improve the environment.

According to the above conclusions, the following policy suggestions are made:

  1. 1.

    Whether from the perspective of the whole country or individual regions, haze pollution has a significant correlation in space. The respective management mode cannot fundamentally solve the problem of environmental pollution. Each region should not be limited locally but should instead cooperate with surrounding areas to realize the governance mechanism of “joint prevention and control.”

  2. 2.

    Attentions should be paid to the balance between the economy and the environment. Some cities have passed the inflection point of economic development and should continue to follow the green development route. Cities that have not reached the inflection point should strengthen government supervision, pay attention to environmental protection while developing their economy, and improve the green innovation ability of enterprises. The appraisal of China’s performance is based on GDP, which makes local governments only pay attention to economic development without considering environmental issues. Therefore, the local environmental quality should be included in the scope of the performance appraisal to quantify environmental losses, deduct the environmental cost in GDP, and change the traditional concept of attaching importance to the economy and ignoring the environment, thus realizing the decoupling of the economy and haze as soon as possible.

  3. 3.

    Cities in the eastern region have basically entered the late stage of industrial development, but haze pollution still has a spillover effect, which shows that it is difficult for the eastern region to improve air pollution alone. Therefore, the government should break the boundary restrictions of cities, divide pollution areas according to PM2.5 data, and determine urban agglomerations of haze pollution. Cities should cooperate with each other and put forward new policies in terms of air quality standard improvements, the pollution management framework and mode, air quality monitoring, compensation, etc. For the central region, we should learn from the rich experience of the east to accelerate the transformation of the industrial structure, eliminate high-pollution industries, improve the environmental entry standards of enterprises, strictly restrict the entry of foreign-funded enterprises with high pollution into the region, and introduce enterprises with low pollution and adhere to green development. In the western region, increasing road construction and convenient transportation can attract more investments, but at the same time, we should learn from the experience of the eastern and central regions and improve the environmental access standards of enterprises.

  4. 4.

    Optimize the industrial structure, improve the technical level, and accelerate the transformation from labor-intensive to technology-intensive industry. We should vigorously develop a circular economy, use advanced technology to transform traditional industries, enhance the innovation capacity of traditional enterprises, eliminate heavy industries with high pollution, and focus on developing green industries. The government should increase its investment in scientific and technological research and development, change the energy structure, vigorously develop new sources of energy, reduce the use of coal resources with high pollution, and strengthen research on the UHV power grid and energy storage technology. In addition, we should increase research on clean conversion technology for coal and realize the recycling of coal gasification through scientific and technological innovation.

  5. 5.

    Because population agglomeration is the primary factor affecting haze pollution, cities should reasonably control their population density, shifting the focus of haze management from population control to actively promoting pollution source management and optimizing population structure. We should reasonably arrange the population distribution of cities, establish compact cities, pay attention to scientific spatial planning, and constantly improve the quality and level of public services. We should properly expand urban agglomerations, give scope to the economic advantages brought by population and capital accumulation in large cities, and maximize the spillover effect of haze pollution control technologies. The governance of haze pollution requires the participation of the whole population. People should pay attention to saving resources in their daily life and production activities, advocate for a low-carbon and green life, establish strong environmental awareness, achieve green consumption and travel, and reduce emissions from waste articles in life to protect the atmospheric environment and reduce the occurrence of haze weather.

  6. 6.

    The governance of haze pollution is a long-term project, which is the public’s demand for improving the quality of the environment, and also provides a good opportunity to promote China’s green development, economic structure optimization, and technical level improvement. From a legal perspective, national environmental protection legislation should be strengthened, laws should be improved, regulations and policy standards on atmospheric governance should be implemented, environmental taxes, resource taxes and other means of environmental regulation should be collected, environmental access threshold of enterprises should be improved, and the introduction of high-pollution and high-energy consumption technologies should be limited.

Shortage and prospect

First, this paper uses data on prefecture-level cities. Because of the limited data sources and some missing data, we can only use data on 287 prefecture-level cities for this research, which cannot reflect the haze situation of each city. In addition, haze has many influencing factors. This paper only analyzes haze from the perspective of the economic society. In addition to the influence of economic factors, natural conditions such as wind and temperature also have a certain influence on haze, which makes the research of this paper not completely accurate. The relationship between haze pollution and economic development is actually a two-way relationship. This paper only studies haze from economic development and does not consider the reaction of haze to the economy, which provides direction for follow-up research. The research on haze pollution is a long-term process. The exploration of PM2.5 still needs long-term data accumulation and frontier research methods, and the influencing factors and paths of PM2.5 need to be further strengthened and improved.