Introduction

Damage to the environment, both in terms of quality and quantity, has recently been experienced to a greater extent than ever before (IPCC 2013). Acres of forest destroyed, amount of soil and organic matter eroded, number of wildlife lost and extent of biodiversity threatened are part of everyday news around the world (Hayward 2005; Weber and Kunzelman 2012). Reduction in air quality, emission of dangerous pollutants, apparent global warming and other environmental confrontations are often mentioned as a result of uncontrolled human interactions with environment. These interactions are diverse, but most importantly, they are the results of activities experienced at different stages of economic growth.

As described by Selden and Song (1994), agricultural modernization and industrialization, typical characteristics of the “take-off ” stage of economic growth, may initially lead to increased air and water pollution. On the other hand, a host of other favorable factors such as positive income elasticity for environmental quality, changes in the composition of production and consumption, higher levels of education and environmental awareness, and more open political systems would cause eventual reduction in certain pollutants. As the same time, it is recognized that it is possible to grow economically without experiencing environmental problems (Shafik and Bandyopadhyay 1992).

More specifically, accumulation of hazardous agricultural and industrial wastes and by-products, such as Particulate Matter 10 (PM10), is growing at an unprecedented rate in developing world (Panyacosit 2000; Guo et al. 2008). This is due to extensive use of agricultural chemicals, machines and equipments that are outdated which release more emissions than the standards; tax and resource incentives to transfer these technologies to developing countries; and less preference and hence lower investment to environment, to mention few factors (Wheeler 2001; Dinda 2004). In this respect, developing countries could hurt the environment on their way to economic growth. On the other hand, extensive use of vehicle for transportation and coal in generating electricity are often blamed for high volume of PM10 in developed countries. Negative environmental externalities associated with intensified agriculture, for instance, have been major policy issues in industrialized countries. There is no guarantee that environmental problems will subside once countries advance to industrialized stage (Lee et al. 2001).

The relationship between economic growth and environmental quality and quantity has been broadly explored in recent years. This relationship also has an important implication in crafting appropriate joint economic and environmental policy, depending on whether there is a negative or a positive impact of economic growth on environmental quality and quantity (Azomahou et al. 2006). One widely applied tool used to understand this relationship is the environmental Kuznets curve (EKC) hypothesis.Footnote 1

The main notion of EKC is that at initial stage of economic growth, pressure on environment increases until a country reaches a certain level of GDP per capita (usually called a threshold level of income). When the intrinsic values of environmental goods and amenities exceed the value of goods and services they used to produce, stress on environment diminishes at a certain pace. In other words, after attaining a certain threshold level of income, effort is given to restore damaged resources and conserve existing amenities.

Empirical studies on EKC hypothesis are generally based on parametric specifications, the popular forms are quadratic and cubic polynomials, for few pollutants (such as sulfur dioxide, carbon dioxide and nitrogen oxide) with little attention paid to model robustness (Grossman and Krueger 1993; Selden and Song 1994; Holtz-Eakin and Selden 1995; Carson et al. 1997; List and Gallet 1999; Shin and Hashimoto 2004; Hayward 2005; Auffhammer and Carson 2008). In such specifications, the perceived income-environment interactions have been aggregated and averaged to satisfy strict assumptions with minor or no effort to understand each and every interaction as it presides. In addition, different parametric specification could lead to different conclusion and ultimately inconsistent policy recommendation. In using parametric models, it is apparent that functional misspecification problems are likely to occur.

In recent years, however, nonparametric (NP) and semiparametric (SP) methods have been introduced for detecting the relationship between environment and economic growth (Taskin and Zaim 2000; Millimet et al. 2003; Bertinelli and Strobl 2005; Azomahou et al. 2006; Zapata et al. 2008; Nguyen-Van 2010). One essential advantage of these methods is that interaction can be found at the local level, with minimal assumptions and with no advance specified functional forms (Pagan and Ullah 1999). In addition, the interaction of the occurrence of events (for instance, the likelihood of low GDP per capita and low or high PM10 level) can be studied by finding a smoothing function with minimal pre-established assumptions. More specifically, NP model is better in capturing neglected nonlinearities in data that can facilitate flexibility in analyzing relationship. Another advantage of NP method is that its result can be used to identify appropriate counterpart parametric models.

This is one of a few studies that deal with the relationship between economic growth, measured using GDP per capita in purchasing power parity (PPP), and the level of pollution, measured using PM10, in NP method at a country level. In addition, we present SP and parametric specification analyses and results that help understand the resulting relationship based on different model specification.

The paper is organized as follows. The next section specifies NP estimation and model testing methods. “Data” section illustrates the main data and their descriptive statistics. In “Result and Discussion” section, NP and SP results are presented along with their analysis and interpretation. Finally, a Conclusion summarizes the main findings of the study.

Method

Estimation

Pagan and Ullah (1999) stated the potential advantages of NP econometric in applied research, owing to the method’s ability to adapt many unknown features of data. NP regression is performed without imposing a specific functional form a priori to capture unknown and nonlinear effects of the determinate on the dependent variable. The method also helps to find a smooth representation of the prevailing dynamics within data. In addition, structural changes in analyzing the relationship can easily be captured by employing appropriate NP methods.

The NP panel model for the relationship between PM10 pollution for country i in period t, \(({ PM}_{it} )\), and the country’s GDP per capita measured in purchasing power parity in the same period, \(({ GDP}_{it})\) can be represented by

$$\begin{aligned} { PM}_{it} = f_t\left( {{ GDP}_{it} } \right) +\mu _i + u_{it} , \quad i=1, 2, \ldots , N=158; \quad t=1,2,\ldots ,T=20, \end{aligned}$$
(1)

where \(f_t \) is an unspecified smoothing functional form that makes the regression estimate NP. The random effect \(u_{it}\) is assumed to be i.i.d with zero mean, infinite variance and independent of \({ GDP}_{it}\) for all i and t. In addition, a country-specific effect i is allowed to be correlated with \({ GDP}_{it}\) with unknown correlation structure which completes fixed effect specification.

A common empirical approach to eliminate a country-specific effect, \(\mu _i\), is to take the first difference of Eq. (1). The new specification becomes

$$\begin{aligned} { PM}_{it} -{ PM}_{i,t-1} =f_t\left( {{ GDP}_{it} } \right) - f_{t-1}\left( {{ GDP}_{i,t-1} } \right) + u_{it} -u_{i,t-1} . \end{aligned}$$
(2)

Taking the expectation of Eq. (2) and assuming that the expectation of the first difference of the error term is zero, \(E[ {u_{it} -u_{i,t-1} }]=0\), the remaining first difference can be further specified as

$$\begin{aligned} {\Psi }_t\left( {{\mathbf {x}}_{{\mathbf {it}}} } \right) := E\left[ {\mathbf{PM}\big |{\mathbf {x}}_{{\mathbf {it}}} = f_t\left( {{ GDP}_{it} } \right) - f_{t-1}\left( {{ GDP}_{i,t-1} } \right) } \right] , \end{aligned}$$
(3)

where \({\mathbf {x}}_{{\mathbf {it}}} = ({ GDP}_{it} , { GDP}_{i, t-1} )^{\prime }\); and \(\mathbf{PM} =\left( {{ PM}_{it} - { PM}_{i,t-1} } \right) \) with dimension \(N\left( {T-1} \right) \).

For estimating the NP model, we follow the procedure described in Azomahou et al. (2006). Let \(\mathbf{X}^{*}\) is a \(N\left( {T-1} \right) X2 \) matrix organized in the same way as \(\mathbf{PM}\) with \({\mathbf {x}}_{it}^{\prime } =\left( {x_{i1, \ldots , } x_{it} } \right) ^{\prime }\) as typical row. With \(\ell \) a \(N\left( {T-1} \right) \) vector of 1, let \(\mathbf{X}= \left( {\ell , \mathbf{X}^{*}} \right) \). Let \( K_h (.)\) be a bivariate kernel, smoothing function with bandwidth \(h = \left( {h_1 , h_2 } \right) ^{\prime }\) corresponding to \({ GDP}_{it} \) and \({ GDP}_{i, t-1} \), respectively. The bandwidth parameter could be determined by ad-hoc, plugin or cross-validation methods, depending on the smoothness of the fitted relationship.

The next step is designing the estimation procedure for the item described in the LHS of Eq. (3), \({\Psi }_t\left( {{\mathbf {x}}_{{\mathbf {it}}} } \right) \). This can be done by formulating a local linear kernel estimator, \({\widehat{{\Psi }}}_{x_o }\), given by

$$\begin{aligned} {\widehat{{\Psi }}}_{x_o } = e^{\prime }\left( {\mathbf{X}}^{\prime } Z_{{x_{o} }} \mathbf{X} \right) ^{{-1}} \mathbf{X}^{\prime }Z_{{x_{o} }} {\mathbf{PM}}, \end{aligned}$$
(4)

where

$$\begin{aligned} \begin{array}{c} e=\left( {1,0,0} \right) ^{\prime };\\ Z_{x_o } =diag\left[ {K_h \left( { \mathbf{X}_{\mathbf{1,1}}^*-x_o } \right) , \ldots ,\left( {\mathbf{X}_{\mathbf{N},\mathbf{T}}^*-x_o } \right) } \right] ; \end{array} \end{aligned}$$

and \(x_o \) is a point of interest. The kernel function needs to satisfy some conditions before it can be used for smoothing purpose. One of the important features is that kernel is a density estimator that integrates to one.Footnote 2

Let \( \left( {\frac{\mathbf{X}^{*}_{\mathbf{it}} -\hbox {x}_{\mathrm{o}} }{\hbox {h}}} \right) ={ \Psi }\), the standard normal kernel, \(\hbox {K}\left( {\Psi } \right) ,\) can be defined as

$$\begin{aligned} \hbox {K}\left( {\Psi } \right) =(2{\uppi })^{-\frac{1}{2}}\hbox {exp}\left[ {\frac{-1}{2}({\Psi }^{2})} \right] . \end{aligned}$$
(5)

The rationale for using kernel regression is that the function gives more weight to observations that are closer to the point of interest, \(x_o\), but the weight is decreasing for further tails within a particular window of bandwidth. The objective of this functional specification is to get a smooth depicting relationship between the two variables in all evaluation points without a particular local or global restriction. This is in contrast to, for instance, parametric specification on which quadratic function allows one turning point as a global restriction.

Testing

The Li and Wang (1998) test, or sometimes called NP consistent model specification test, will be used to check whether or not parametric models can be rejected against NP model. The test is based on the residuals of the parametric model. The null hypothesis is the first-difference version of parametric models and the alternative is NP specification shown in Eq. (1). The statistic for this one-sided test has an asymptotic standard normal distribution under the null of correct specification of parametric model (Azomahou et al. 2006). Kernel regression significance test will be performed for a consistence test for the significance of the explanatory variable in NP regression setting that is similar to a simple t test in a parametric regression setting. This test is based on Hayfield and Racine (2008).

Finally, statistical test of poolability specified by Racine (2009) will be conducted. The basic framework of this method is to introduce an unordered categorical variable \(\delta _i \) for \(i=1,2,\ldots ,N\) and nonparametrically estimate \(E\left( {{ PM}_{it} \big |{ GDP}_{it} ,\delta _i } \right) =\hat{{f}}\left( {{ GDP}_{it} ,\delta _i } \right) \) using mixed categorical method described in Hayfield and Racine (2008). Let \(\hat{{\tau }}\) denotes the cross-validation smoothing parameter associated with i. If \(\hat{{\tau }}=1\), the data is then poolable for estimation. Under the assumption of poolability, the functional form can be written as \({ PM}_{it} =f\left( {{ GDP}_{it} } \right) + \mu _i +u_{it} \) or \(f\left( . \right) \) which does not change over time. If \(\hat{{\tau }}=0\) or close to 0, then the data is nonpoolable and effectively estimated using Eq. (1). Finally, if \(0<{\hat{{\tau }}}>1\) one may interpret this as a case in which the data is partially poolable.

Data

Panel data for 158 countries from 1991 to 2010 is collected from different sources. PM10 country level (the urban-population weighted PM10 levels in residential areas of cities with more than 100,000 residents), the proportion of urban population, total population, and GDP per capita based on purchasing power parity (PPP) in constant 2005 international $ are obtained from the World Bank (The World Bank 2013). Trade openness at 2005 constant prices (%) is collected from Penn World Table Version 7.1 (Heston et al. 2012). In addition, coal consumption (in thousand short tons) is collected from the Energy Information Administration (EIA) and converted to coal consumption per capita for analysis (EIA 2013).

The level of PM10 pollution varied from 6 micrograms per cubic meter \((\upmu \hbox {g/m}^{3})\), the level of pollution in Equatorial Guinea and Belarus in the late 2000s, to \(367 \,\upmu \hbox {g/m}^{3}\), in Armenia in the early 1990s, with a global average of \(59\, \upmu \hbox {g/m}^{3}\), as shown in Table 1.Footnote 3 GDP per capita PPP was recorded from $102 (measured in constant 2005 international $ in Liberia in 1995) to $74,021 (in Luxembourg in 2007) with the global average of $10,865.

Table 1 Descriptive statistics for 1991–2010

In addition to income per capita-pollution variables, this study includes trade openness, coal consumption per capita and proportion of urban population as control variables in SP setting. Singapore with trade openness value of 440 % was the most open economy, while Sudan (9.5 %) was the least open economy. The world’s average international trade openness was about 83 %. The highest coal consumption per capita (7.9 short tons per capita) was recorded in Czech Republic in 1993, while the global average was around 1 short ton per capita. Finally, 53 % the world population lived in urban areas.

Result and Discussion

NP Result

The NP regression significance test described in Hayfield and Racine (2008) is performed for the explanatory variable. It is found that GDP per capita is significant at all conventional levels (where \(\hbox {p}\,<\,0.001\)) for the local linear NP model with plugin bandwidth selection method.Footnote 4 For the Li and Wang (1998) NP model consistence specification test, the test statistics is 24.30 for cubic pooled estimate. This result reveals that the null hypothesis of correct parametric specification is rejected for cubic models. Hence, the two tests show that NP model is better than the parametric specifications for studying the relationship between PM10 and GDP per capita.

The estimated results for nonparametric and comparable parametric regressions are shown in Fig. 1 and Table 2. For low-income countries,Footnote 5 with GDP per capita less than $1000 (PPP in constant 2005 international $), the level of pollution rises as these countries experience economic growth, which is in line with EKC hypothesis. For middle-income countries, with GDP per capita between $1000 and $12,000, the level of pollution declines as these countries grow, as supported by EKC hypothesis. For many high-income countries, with GDP per capita between $12,000 and $32,000, the level of pollution declines as these countries grow, as supported by EKC hypothesis. For some high-income countries with GDP per capita more than $32,000, however, the trend shows that the level of pollution increases as income rises. Unlike for low- and middle-income countries, the result for high-income countries is inconsistent with EKC hypothesis. Therefore, the estimated result with pooled-data setting indicates an “N-shaped” relationship between PM10 pollution level and the level of economic growth, instead of an inverted “U-shaped”.

Fig. 1
figure 1

Nonparametric and parametric estimate using pooled and panel data

The challenge for complete existence of EKC comes from a situation that a decline in the level of pollution for middle-income countries is followed by unprecedented pollution surge for countries with GDP per capita above $32,000. For some high-income countries and more importantly for oil-producing high-income countries (such as United Arab Emirates, Kuwait, Brunei and Qatar) the trend for the level of pollution is exceptionally higher than what is normally expected from EKC hypothesis.

In a nonpooled data setting, Fig. 2 shows the pattern of pollution for different time period along the income level. Low-income countries experience a consistent increase in the level of pollution as they grow, but the level of pollution in the late 2000s is lower than what was recorded in the early 1990s. On the income side, low-income countries have higher income in the late 2000s than in the early 1990s. Middle-income countries experience a decreased level of pollution as they grow. In addition, they emit lower level of pollution in the late 2000s than in the early 1990s.

Fig. 2
figure 2

Nonparametric estimate using panel data

For high-income countries, the pattern of pollution varies for different time period. These countries experience an increased and a decreased level of pollution in the 1990s and in the late 2000s, respectively. Generally, in the late 2000s, high-income countries release low level of pollution than the other two income groups. Hence, seeing from different time perspective, an inverted U-shaped relationship between pollution and economic growth, which is postulated by EKC hypothesis, has happened at the most recent time periods.

With the adjusted R-squared values shown in Table 2, NP regression with panel data (nonpooled) estimate provides better result than other regression estimates. This can also be confirmed through statistical test of poolability specified by Racine (2009). The cross-validated smoothing parameter associated with individual country dummy is calculated and the value of the bandwidth is very close to zero (0.00034); hence, the data is nonpoolable. That means the given countries’ variable of interest is not stable over time, as also seen from Fig. 2. Consequently, the unspecified functional form of NP regression is not generalized using a single setting or constant functional form. Figure 2, which presents the level of pollution at different time period, is a better representation for the relationship between economic growth and the level of pollution.

Table 2 Regression results using pooled and panel data

Another advantage of NP results is to provide a functional relationship for the variables under consideration. Cubic regression is found to better represent the relationship using parametric specification than other polynomials. Both parametric-panel between and ordinary least square (OLS) estimates represent alternative to nonparametric estimate as seen in Table 2. More importantly, the estimated coefficients indicate that the level of pollution increases as income rises, as seen from the coefficient of GDP per capita (11.054 for between and 8.641 for OLS) but at a decreasing rate (\(-\)3.087 for between and \(-\)2.431 for OLS).

On the other hand, parametric-within estimate (fixed effect) provides a decreasing trend in the level of pollution as income per capita grows, as shown in Fig. 1. Nevertheless, the coefficients for this estimate are statistically insignificant. As shown in both Fig. 1 and Table 2, this functional relationship depicts completely different story and may lead to a different policy formulation than the other estimates. Hence, NP estimate can correct the misspecification error and suggest a corrected alternative that explains the existing relationship in parametric settings.

SP Result

Even though NP method has potential benefits of providing set of flexible tools for analyzing unknown regression relationships, it is not free from shortcomings. One of the challenges of embracing NP method is that it is rather difficult to account for more explanatory variables at the same time (Cooper 2002). An alternative approach is to use SP method.

EKC reflects an unconditional account of how the level of pollution changes with economic growth of a country. To the same extent, economic and noneconomic factors could affect the level of pollution as a country progresses. Along with economic growth, there are, however, other factors that could influence the level of pollution. These factors could have a direct and/or an indirect impact on the interaction between the level of pollution and economic growth. Therefore, by following some previous research works and by relying on economic theory about the level of pollution and economic growth, we have included few additional controls to investigate how their presence affect EKC hypothesis. The specification presented in Eq. (1) is modified to SP specification with additional controls, \(Z_{it} \). SP specification becomes

$$\begin{aligned} { PM}_{it} = m_t\left( {{ GDP}_{it} } \right) + Z_{it}^{\prime }\beta + \varepsilon _{it} , \end{aligned}$$
(6)

where \(\varepsilon _{it} \) are random disturbances, \(\varepsilon _{it} = \mu _i + u_{it} \) and \(E[ {\varepsilon _{it} \big |{\mathbf {x}}_{{\mathbf {it}}} } ]=0\). Equation (6) can be estimated by using a method developed by Verardi and Debarsy (2011).

Since a seminal work by Kraft and Kraft (1978), energy becomes another nexus on the analysis of EKC. One of the controls in this case is coal consumption per capita where its main by-product is PM10 (Guo et al. 2008). In this study, coal consumption per capita, that is, a country’s total coal consumption divided by total population at a given time, is used as a control in SP setting. The result is shown in Fig. 3a and Table 3 (Model a).

Fig. 3
figure 3

Semiparametric estimate results

With the inclusion of coal consumption per capita, the relationship resembles NP nonpooled data result, partially supports the existence of EKC for GDP per capita less than $32,000. Another important finding is an inverse relationship between coal consumption per capita and the level of PM10 pollution, keeping GDP per capita constant, as shown in Table 3 (Model (a) negative and statistically significant coefficients). Hence, it can be inferred that more coal consumption per capita is not directly associated with high level of PM10, keeping GDP per capita constant.

Another control is country’s openness to international trade (Frazer 2006). The effect of trade openness on environmental quality has attracted the attention of policy makers and been dealt with different settings and institutional frameworks since the original paper of Grossman and Krueger (1991). For instance, Dinda (2004) stated that international trade is one of the most important factors that explain EKC. Managi et al. (2009) investigated whether trade could improve environmental quality and found that the beneficial effect varies depending on the pollution and the country under consideration. For instance, trade is beneficial in improving the environment of OECD countries. The result of SP specification is given in Fig. 3b and Table 3 (Model b), the level of pollution decreases as trade openness increases, keeping GDP per capita constant. This is relatively similar with the finding that trade openness exhibits a negative, statistically significant relationship with pollution (Cole 2004).

Table 3 Semiparametric regression results

PM10 pollution is mostly ubiquitous in urban areas. The economic activities of urban population (such as vehicle exhaust) are among the major sources of PM10 (Fischer et al. 2000). As seen in Fig. 3c and Table 3 (Model c), there is a direct relationship between the proportion of urban population and PM10 while maintaining comparable EKC results with NP nonpooled data results.

SP regression result with all the control variables is shown in Fig. 3d and Table 3 (Model d). It has all the features of individual SP result: coal consumption and trade openness have statistically significant and indirect effect on the level of PM10 pollution; while the proportion of urban population has statistically insignificant but direct effect on the level of PM10 pollution, keeping GDP per capita constant. Hence, it can be inferred that SP specification reveals similar GDP per capita-PM10 relationship found in NP model.

Conclusion

When it comes to the extent of economic growth and pollution, the first assumption that least developed countries start to pollute the environment when they begin to grow economically is found to be valid using both NP and SP results. After reaching a certain threshold level of income, however, low-income countries released lesser level of PM10 pollution. This phase is followed by a declined level of pollution for middle- and many high-income countries.

For some oil-producing high-income countries the trend for the level of pollution is exceptionally higher than what is normally expected from EKC hypothesis. These countries experience an increased level of pollution during the 1990s, but a diminished level around the late 2000s as their income grows. This result is not consistent with ECK hypothesis.

Model significance test reveal that NP method provides statistically better estimate for the relationship between GDP per capita and PM10. In addition, the relationship can be expressed in cubic parametric specifications, widely used in previous studies. Adding more control variables in SP setting provides similar results with NP ones. Among the control variables, it is found that the higher the proportion of urban population, the higher is the level of pollutions, keeping GDP per capita constant.

The level of pollution is higher in low-income countries than in both middle- and high-income countries. Most of the world population continues to live in urban areas; and in countries with higher proportion of urban population, the level of PM10 pollution is found to be high. Hence, economic and demographic intervention and policies would be designed to address the problem of PM10 pollution. In addition, some high-income countries may experience an increased level of pollution as their income grows. This finding indicates that high level of PM10 pollution may occur in countries where environmental amenities and institutions are well-developed. The general hypothesis that pollution decreases as countries reach a certain threshold level of income may not be acceptable for all countries.