1 Introduction

The environmental Kuznets curve (EKC) is a hypothesized relationship between various indicators of environmental degradation and income per capita. In the early stages of economic growth, pollution emissions increase and environmental quality declines, but beyond some level of income per capita (which will vary for different indicators) the trend reverses, so that at high income levels economic growth leads to environmental improvement. This implies that environmental impacts or emissions per capita are an inverted U-shaped function of income per capita. The EKC has been the dominant approach among economists to modeling ambient pollution concentrations and aggregate emissions since Grossman and Krueger (1991) introduced it a quarter of a century ago. The EKC has been applied to a wide range of issues from threatened species (McPherson and Nieswiadomy 2005) to nitrogen fertilizers (Zhang et al. 2015), and is even found in introductory textbooks (e.g. Frank et al. 2012), yet debate continues in the academic literature (e.g. Carson 2010; Kaika and Zervas 2013; Chow and Li 2014; Wagner 2015). Though the EKC is an essentially empirical phenomenon, most estimates of EKC models are not statistically robust. While concentrations of some local pollutants have clearly declined in developed countries and so have emissions of some pollutants, there is still no consensus on the drivers of changes in emissions.

This article critically reviews the EKC, discusses alternative approaches, and provides some empirical evidence that synthesizes the various approaches to modeling pollution emissions and concentrations avoiding various statistical pitfalls. This evidence shows that per capita emissions of pollutants rise with increasing income per capita when other factors are held constant. However, changes in these other factors may be sufficient to reduce pollution. In rapidly growing middle-income countries, the effect of growth overwhelms these other effects. In wealthy countries, growth is slower, and pollution reduction efforts can overcome the growth effect. On the other hand, growth might reduce the ambient concentrations of some pollutants after a turning point is reached. Understanding the nature of the factors that are not related to economic growth will be important for understanding what actually reduces pollution. Other recent reviews of the EKC literature (e.g. Carson 2010; Pasten and Figueroa 2012; Kaika and Zervas 2013) do not cover the recent empirical and theoretical developments reviewed in this paper that fundamentally change our understanding of the income-pollution relationship.

The following section sets the scene by reviewing the origin and history of the EKC and the debate that ensued on its policy implications. This is followed by reviews of theoretical models of the EKC and econometric techniques and evidence. I then turn to look at the main alternative approaches and a possible synthesis between them and the EKC. The final sections of the article present my own empirical evidence and conclusions.

2 Background

Prior to the introduction of the concept of sustainable development in the 1980s, the mainstream environmental view was that environmental impacts increase with the scale of economic activity, though either more or less environmentally friendly technology can be chosen. This approach is represented by the IPAT identity (Ehrlich and Holdren 1971), which is given by impact \(\equiv \) population*affluence*technology. If affluence is income per capita, then the technology term is impact or emissions per dollar of income. By contrast, a popular view of sustainable development was that development is not necessarily damaging to the environment and that poverty reduction is essential for environmental protection (WCED 1987). In line with this sustainable development idea, Grossman and Krueger (1991) introduced the EKC concept in their seminal study of the potential impacts of the North American Free Trade Agreement (NAFTA). Environmentalist critics of NAFTA claimed that the economic growth that would result from introducing free trade would damage the environment in Mexico (e.g. Daly 1993). Grossman and Krueger (1991) argued instead that increased income would improve environmental quality in Mexico. To support this argument, they carried out an empirical analysis of the relationship between ambient pollution levels in many cities around the world and income per capita. They found that the concentrations of various pollutants peaked when a country reached roughly the level of Mexico’s per capita income at the time.

The World Bank’s 1992 World Development Report (WDR) popularized the EKC, arguing that: “The view that greater economic activity inevitably hurts the environment is based on static assumptions about technology, tastes, and environmental investments” (p. 38) and that “As incomes rise, the demand for improvements in environmental quality will increase, as will the resources available for investment” (p. 39). Beckerman (1992) made this argument even more forcefully, claiming that “there is clear evidence that, although economic growth usually leads to environmental degradation in the early stages of the process, in the end the best—and probably the only—way to attain a decent environment in most countries is to become rich” (p. 482). Arrow et al. (1995) criticized this approach because it assumes that environmental damage does not reduce economic activity sufficiently to stop the growth process and that any irreversibility is not too severe to reduce the level of income in the future. In other words, there is an assumption that growth is sustainable.

Shafik’s (1994) research, which the WDR was based on, in fact showed that not all environmental impacts declined at high income levels. She found that both urban waste and carbon dioxide (\(\hbox {CO}_{2}\)) emissions rose monotonically with income per capita. Subsequent research confirmed the results for \(\hbox {CO}_{2}\) and has cast doubt on the validity of the EKC hypothesis for emissions of other pollutants too. The ambient concentrations of many pollutants have declined in developed countries over time with increasingly stringent environmental regulations and technological innovations. However, the mix of air pollution, for example, has shifted from particulate pollution to sulfur dioxide (SO\(_2\)) and nitrogen oxides (\(\hbox {NO}_{\mathrm{x}}\)) to \(\hbox {CO}_{2}\). Economic activity is inevitably environmentally disruptive in some way. Therefore, an effort to reduce some environmental impacts may just aggravate other problems.

Some early EKC studies showed that a number of indicators, including \(\hbox {SO}_{2}\) concentrations and deforestation, peaked at income levels around the then current world mean per capita income. The WDR implied that this meant that growth would reduce these impacts going forward. However, income is not normally distributed but very skewed, with much larger numbers of people below mean income per capita than above it. Therefore, it is median rather than mean income that is the relevant variable. Selden and Song (1994) and Stern et al. (1996) performed simulations that, assuming that the EKC relationship is valid, showed that global environmental degradation was set to rise for a long time to come. More recent estimates including those in this paper show that the pollution turning point is at much higher income levels and, therefore, there should no longer be confusion on this issue.

There has also been much debate about why some environmental impacts appear to follow an inverted U-shape curve. I address these questions in the next section.

3 Theory

If there were no change in the structure or technology of the economy, pure growth in the scale of the economy would result in proportional growth in pollution and other environmental impacts. This is called the scale effect. The traditional view that economic development and environmental quality are conflicting goals reflects the scale effect alone. Panayotou (1993) provided an early rationale for the existence of an EKC arguing that: “At higher levels of development, structural change towards information-intensive industries and services, coupled with increased environmental awareness, enforcement of environmental regulations, better technology and higher environmental expenditures, result in leveling off and gradual decline of environmental degradation.” (p. 1)

Therefore, the EKC can be explained by the following ‘proximate factors’:

  1. 1.

    An increase in the scale of production.

  2. 2.

    Different industries have different pollution intensities and typically, over the course of economic development the output mix changes. This is often referred to as the composition effect (e.g. Copeland and Taylor 2004).

  3. 3.

    Changes in input mix involve the substitution of less environmentally damaging inputs to production for more damaging inputs and vice versa.

  4. 4.

    Improvements in the state of technology involve changes in both:

    1. a.

      Production efficiency in terms of using less, ceteris paribus, of the polluting inputs per unit of output.

    2. b.

      Emissions specific changes in process result in less pollutant being emitted per unit of potentially polluting input.

The third and fourth factors are together often referred to as the technique effect (e.g. Copeland and Taylor 2004). These proximate factors may in turn be driven by changes in variables such as environmental regulation or innovation policy, which themselves may be driven by other more fundamental underlying variables. For example, the composition effect might be partly driven by comparative advantage. Developing countries are expected to specialize in the production of goods that are intensive in labor and natural resources, while developed countries would specialize in human capital and manufactured capital-intensive activities. Environmental regulation in developed countries might further encourage polluting activities to gravitate towards developing countries (Stern et al. 1996).Footnote 1

Various theoretical models attempt to explain how preferences and technology might interact to result in different time paths of environmental quality. There are two main approaches in this literature—static models that treat economic growth as simply shifts in the level of output and dynamic models that model the economic growth process as well as the evolution of emissions or environmental quality (Kijima et al. 2010).

In the typical static model, a representative consumer maximizes a utility function that depends on consumption and the level of pollution. Pollution is also treated as an input to the production of consumer goods. These models assume that there are no un-internalized externalities or equivalently that there is a socially efficient price for pollution. Pasten and Figueroa (2012) show that under the simplifying assumption of additive preferences:

$$\begin{aligned} \frac{dP}{dK}>0\hbox { if and only if}\quad \frac{1}{\sigma }>\eta \hbox { and } \textit{vice\; versa} \end{aligned}$$
(1)

where P is pollution, K is “capital”—all other inputs to production apart from pollution—\(\sigma \) is the elasticity of substitution between K and P in production, and \(\eta \) is the (absolute value of the) elasticity of the marginal utility of consumption with respect to consumption. The smaller \(\sigma \) is, the harder it is to reduce pollution by substituting other inputs for pollution. The larger \(\eta \) is, the harder it is to increase utility with more consumption. So, in other words, pollution is more likely to increase as the economy expands, the harder it is to substitute other inputs for pollution and the easier it is to increase utility with more consumption. This result also implies that, if either of these parameters is constant, then there cannot be an EKC where pollution first increases and then decreases. There are two main types of static theoretical models: those where the EKC is driven by changes in the elasticity of substitution as the economy grows and those where the EKC is primarily driven by changes in the elasticity of marginal utility (Pasten and Figueroa 2012).

Additive preferences imply that the marginal utility of consumption does not depend on the level of environmental quality. If preferences are non-additive but homothetic, the elasticity of substitution between consumption and environmental quality in the utility function, \(\phi \), becomes the critical parameter in place of \(\eta \) (Figueroa and Pasten 2015). The second inequality in (1) then becomes \(\phi \ge \sigma \).Footnote 2

Dynamic models of the EKC vary in their assumptions about how institutions govern environmental quality and there is no simple way to summarize their results. For example, in Jones and Manuelli’s (2001) model, the young can choose to tax the pollution that will exist when they are older, while Stokey (1998) assumes that countries do not adopt any environmental policies until they reach a threshold income level.Footnote 3

Brock and Taylor’s (2010) Green Solow Model is notable for taking into account more features of the data, such as abatement costs and the decline over time in emissions intensity, than previous research does. The model builds on Solow’s (1956) economic growth model by adding the assumptions that production generates pollution but that allocating some final production to pollution abatement can reduce pollution. It makes no explicit assumption about either consumer preferences or the pricing of pollution, assuming that a constant share of economic output is spent on abating pollution. The model implies that countries’ level of emissions will converge over time, though emissions may rise initially in poorer countries due to rapid economic growth. While the predictions of the Green Solow Model seem plausible given the recent empirical evidence, discussed below, it is not a very satisfying model of the evolution of the economy and emissions. First, it leaves assumptions such as the constant share of abatement in the costs of production unexplained. Second, there is actually little correlation between countries’ initial levels of income per capita and their subsequent growth rates, which is the mechanism that is supposed to drive convergence of income in the Solow model (Durlauf et al. 2005; Stefanski 2013).Footnote 4

Ordás Criado et al. (2011) also develop a neoclassical growth model, which finds that along the optimal path, pollution growth rates are positively related to the growth rate of output and negatively related to emission levels. The latter arises because utility is a function of both the consumption of goods and the level of pollution, and defensive expenditures can be used to reduce pollution. Econometrically, this model reduces to a beta convergence equation with the addition of an economic growth effect. This is a more elegant theoretical model than the Green Solow model, and empirically the model explains more of the variation in the data.

Both Brock and Taylor’s and Ordás Criado et al.’s dynamic models depart from the EKC hypothesis that growth itself eventually reduces pollution. Lopez and Yoon (2014) develop a dynamic model with endogenous growth with multiple outputs that does generate more traditional EKC predictions by allowing for a composition effect. To overcome this increase in complexity, the model makes many strong assumptions. The clean sector consists of an AK endogenous growth model, while the dirty sector consists of a constant elasticity of substitution (CES) production function that uses capital and pollution inputs and constant total factor productivity. The consumer also has CES preferences over the dirty and clean good but pollution damage enters welfare additively. The government internalizes the pollution externality with a pollution tax. As a result of the two different technologies in the two production sectors, the elasticity of substitution between pollution and capital will vary over time and, depending on the value of the elasticity of substitution in dirty production, the elasticity of substitution between the dirty and clean good in consumption, and the elasticity of marginal utility, an EKC may or may not be generated.

4 Econometric methods and evidence

The standard EKC regression model is:

$$\begin{aligned} E_{it} =\alpha _i +\gamma _t +\beta _1 Y_{it} +\beta _2 Y_{it}^2 +\varepsilon _{it} \end{aligned}$$
(2)

where E is the natural logarithm of either ambient environmental quality or emissions per person, Y is the natural logarithm of gross domestic product per capita, and \(\upvarepsilon \) is a random error term. i indexes countries and t time. Use of logarithms constrains projections of the dependent variable to be non-zero, which is appropriate except in the case of net rates of change of the stock of renewable resources, where, for example, afforestation can occur. The first two terms on the right-hand side of the equation are country and time effects. The country effects imply that, though the level of emissions per capita may differ over countries at any particular income level, the elasticity of emissions with respect to income is the same in all countries at a given income level. The time effects are intended to account for time varying omitted variables and stochastic shocks that are common to all countries. We can find the “turning point” level of income, \(\tau \), where emissions or concentrations are at a maximum, using:

$$\begin{aligned} \tau =\hbox {exp}\left( {-0.5\beta _1 /\beta _2 } \right) \end{aligned}$$
(3)

Usually the model is estimated with panel data, most commonly using the fixed effects estimator. But time-series and cross-section data have also been used, and a very large number of estimation methods have been tried including non-parametric methods (e.g. Carson et al. 1997; Azomahou et al. 2006; Tsurumi and Managi 2015), though these do not generally produce radically different results from parametric estimates.

Grossman and Krueger (1991) estimated the first EKC models as part of a study of the potential environmental impacts of NAFTA. They estimated EKCs for SO\(_2\), dark matter (fine smoke), and suspended particulate matter (SPM) using the GEMS dataset. This dataset is a panel of ambient measurements from a number of locations in cities around the world. They regressed each indicator on a cubic function in levels (not logarithms) of PPP (Purchasing Power Parity adjusted) per capita GDP, various site-related variables, a time trend, and a trade intensity variable. The turning points for SO\(_2\) and dark matter were at around $4000–5000 while SPM appeared to decline even at low income levels.

Shafik’s (1994) study was particularly influential, as the results were used in the 1992 WDR. Shafik estimated EKCs for ten different indicators using three different functional forms. She found that lack of clean water and lack of urban sanitation declined with increasing income and over time. Deforestation regressions showed no relation between income and deforestation. Local air pollutant concentrations, however, conformed to the EKC hypothesis with turning points between $3,000 and $4,000. Finally, river quality, municipal waste, and \(\hbox {CO}_{2}\) emissions per capita increased with rising income. Holtz-Eakin and Selden (1995) confirmed this result for \(\hbox {CO}_{2}\), which has stood the test of time despite a minority of contrary findings (Dobes et al. 2014).

Selden and Song (1994) estimated EKCs for four emissions series: SO\(_2\), NOx, SPM, and carbon monoxide (CO). The estimated turning points were all very high compared to the two earlier studies. For the fixed effects version of their model they are (in 1990 US dollars): SO\(_2\), $10,391; NOx, $13,383; SPM, $12,275; and CO, $7114. This showed that the turning points for emissions were likely to be higher than for ambient concentrations. In the initial stages of economic development, urban and industrial development tends to become more concentrated in a smaller number of cities, which also have rising central population densities, with the reverse happening in the later stages of development. So, it is possible for ambient pollution concentrations in urban areas to fall as income rises, even if total national emissions are rising (Stern et al. 1996).

There are several econometric problems that affect interpretation of EKC estimates. The most important of these are: omitted variables bias, integrated variables and the problem of spurious regression, and the identification of time effects. There is plenty of evidence that Eq. (2) is too simple a model and that other variables are also important in explaining the level of emissions. Early studies used data that was mostly from developed countries. Subsequent studies that used data sets with greater income variation found increasingly higher turning points (Stern 2004). Using an emissions database produced for the US Department of Energy (Lefohn et al. 1999) that covered a greater range of income levels than any previous SO\(_2\) EKC studies, Stern and Common (2001) estimated the turning point for SO\(_2\) emissions at over $100,000. Stern and Common (2001) showed that estimates of the EKC for SO\(_2\) emissions were very sensitive to the choice of sample. For OECD countries alone, the turning point was at $9000. Both Harbaugh et al. (2002) and Stern and Common found using Hausman test statistics that there is a significant difference in the regression parameter estimates when equation (2) is estimated using the random effects estimator and the fixed effects estimator. This indicates that the regressors are correlated with the country effects and time effects, which indicates that the regressors are likely correlated with omitted variables. Harbaugh et al. (2002) re-examined an updated version of Grossman and Krueger’s data. They found that the locations of the turning points for the various pollutants, as well as even their existence, were sensitive both to variations in the data sampled and to reasonable changes in the econometric specification.

Tests for integrated variables designed for use with panel data usually find that SO\(_2\) and \(\hbox {CO}_{2}\) emissions and GDP per capita are integrated variables. This means that we can only rely on regression estimates of (2) using panel (or time series) data if the regression exhibits cointegration.Footnote 5 Otherwise, the model must be estimated using another approach such as first differencing the data or the between estimator, which first averages the data over time (Stern 2010). Otherwise, the EKC estimate will be a spurious regression. As an illustration of this point, Verbeke and Clerq (2006) carried out a Monte Carlo analysis where they generated large numbers of artificial integrated time series and then tested for an inverted U-shape relationship between the series. They found an “EKC” in 40% of cases despite using entirely arbitrary and unrelated data series.

Using data on SO\(_2\) emissions in 74 countries from 1960 to 1990, Perman and Stern (2003) found that around half the individual country EKC regressions cointegrate using standard time series cointegration tests but that many of these had parameters with “incorrect signs”. Some panel cointegration tests indicated cointegration in all countries and some could not reject the non-cointegration hypothesis. But even when cointegration was found, the form of the EKC relationship varies radically across countries with many countries having U-shaped EKCs and a common cointegrating vector for all countries was strongly rejected. These results also suggest that the simple EKC model omits important factors.

Wagner (2008) noted that first generation panel cointegration tests are not appropriate when there are nonlinear functions of unit root variables or cross-sectional dependence in the data. Wagner (2008) uses de-factored regressions and so-called second-generation panel unit root tests to address these two issues. Wagner (2015) uses time series tests for nonlinear cointegration finding cointegration in only a subset of the 19 countries tested.

Vollebergh et al. (2009) pointed out that time, income, or other effects are not uniquely identified in reduced form models such as the EKC and that existing EKC regression results depend on the specific identifying assumptions that are implicitly imposed. Equation (2) assumes that the time effect is common to all countries. Vollebergh et al. instead assume that there is a common time effect in each pair of most similar countries. They argue that this imposes the minimum restrictions on the nature of the time effect. Instead, Stern (2010) uses the between estimator—a regression using the cross-section of time-averaged variables—to estimate the effect of income. This model is then used to predict the effect of income on emissions using the time series of income in each country. The difference between the prediction and reality is the individual time effect for that country. This approach is, though, particularly vulnerable to omitted variables bias. Chow and Li (2014) estimate a set of simple cross-section \(\hbox {CO}_{2}\) EKC regressions for 132 countries in each year from 1992 to 2004. Though they find a highly significant negative coefficient on the square of the log of GDP per capita (t = −22.9), the mean turning point in their sample is, in fact, $378,000, and, therefore, the emissions-income relationship is again effectively monotonic.

These recent studies find that the relationship between the levels of both SO\(_2\) and \(\hbox {CO}_{2}\) emissions and income per capita is monotonic when the effect of the passage of time is controlled for (Wagner 2008; Vollebergh et al. 2009; Stern 2010). Both Vollebergh et al. (2009) and Stern (2010) find very large negative time effects for SO\(_2\) and smaller negative time effects for \(\hbox {CO}_{2}\) since the mid-1970s.Footnote 6

Many studies extend the basic EKC model by introducing additional explanatory variables intended to model underlying or proximate factors such as political freedom (e.g. Torras and Boyce 1998), output structure (e.g. Panayotou 1997), or trade (e.g. Suri and Chapman 1998). On the whole, the included variables turn out to be significant at traditional significance levels (Stern 1998). However, testing different variables individually is subject to the problem of potential omitted variables bias and there do not appear to be robust conclusions that can be drawn from these studies (Carson 2010).

A popular view is that trade and the offshoring of pollution intensive activities from developed to developing countries might drive the EKC (e.g. Peters and Hertwich 2008). However, testing whether offshoring drives emissions reductions is not simple. The popular consumption based emissions approach does not answer this question. Developed countries might be net importers of emissions because developing countries use more emissions intensive technologies than do developed countries to produce the same products (Kander et al. 2015). Research has found a weak role if any for offshoring of production in reducing emissions in developed countries (Cole 2004; Stern 2007; Levinson 2010) though trade in electricity among U.S. states might have allowed a reduction in \(\hbox {CO}_{2}\) emissions in the richer states (Aldy 2005).

5 Alternative approaches

There are several alternative approaches to modeling the income-emissions relationship. The most prominent of these are decomposition analysis and convergence analysis.

Decomposition analysis breaks down emissions into the proximate sources of emissions changes listed in Sect. 3. The usual approach is to utilize index numbers and detailed sectoral information on fuel use, production, emissions etc. Stern (2002) and Antweiler et al. (2001) develop econometric decomposition models that require less detailed data, and cruder decompositions can employ the Kaya identity (e.g. Raupach et al. 2007). These studies find that the main means by which emissions of pollutants can be reduced are time-related technique effects and in particular those directed specifically at emissions reduction. General productivity growth or declining energy intensity has a role to play, particularly in the case of \(\hbox {CO}_{2}\) emissions where specific emissions reduction technologies are not yet implemented (Stern 2004). Though the contributions of structural change in the output mix of the economy and shifts in fuel composition may be important in some countries at some times, their average effect across countries seems less important quantitatively.

Those studies that include developing countries, find that changes in technology occur in both developing and developed countries. Innovations may be adopted first in higher income countries but seem to be adopted in developing countries with relatively short lags (Stern 2004). This is seen for example for lead in gasoline where most developed countries had substantially reduced the average lead content of gasoline by the early 1990s but many poorer countries also had low lead contents (Hilton and Levinson 1998). Lead content was much more variable at low income levels than at high income levels.

The convergence hypothesis proposes that pollution falls faster in countries with high levels of pollution than in countries with low levels of pollution or that it falls in the former and rises in the latter. If initially rich countries have high levels of pollution and poor countries low levels, then the outcome could resemble the EKC, but this is not necessarily the case. Also, under the convergence hypothesis, change in pollution is not necessarily dependent on economic growth. Pettersson et al. (2013) provide a review of the literature on convergence of \(\hbox {CO}_{2}\) emissions. There are three main approaches to testing for convergence: sigma convergence, which tests whether the dispersion of the variable in question declines over time using either just the variance or the full distribution (e.g. Ezcurra 2007); stochastic convergence, which tests whether the time series for different countries cointegrate; and beta convergence, which tests whether the growth rate of a variable is negatively correlated to the initial level. Using beta and stochastic convergence tests, Strazicich and List (2003) found convergence among the developed economies. Brock and Taylor (2010) find beta convergence across 165 countries between 1960 and 1998. However, using sigma convergence approaches, Aldy (2006) found convergence for the developed economies but not for the world as a whole. Using stochastic convergence, Westerlund and Basher (2008) reported convergence for a panel of 28 developed and developing countries over a very long period, but recent research using stochastic convergence finds evidence of club convergence rather than global convergence (Herrerias 2013; Pettersson et al. 2013).

The beta convergence approach has been heavily criticized (e.g. Quah 1993; Evans 1996; Evans and Karras 1996) because dependence of the growth rate on the initial level of the variable is insufficient, though necessary (Pettersson et al. 2013), for sigma convergence. Beta convergence could also be purely due to regression to the mean (Friedman 1992; Quah 1993). However, it is hard to believe that, for example, the high levels of emissions intensity in formerly centrally planned economies are simply random fluctuations. In any case, economic theory suggests that the initial level of emissions should be a factor in explaining emissions growth. Two models that do so are Brock and Taylor’s (2010) Green Solow model and Ordás Criado et al.’s (2011) model.

In Brock and Taylor’s empirical analysis, the growth rate of emissions is a function of initial emissions per capita and there is convergence in emissions per capita across countries over time. Depending on the specification chosen, this model explains 14–42% of the variance in average national 1960–1998 \(\hbox {CO}_{2}\) emissions growth rates. Ordás Criado et al.’s (2011) model reduces econometrically to a beta convergence equation with the addition of an economic growth effect. They estimate the model for a panel of 25 European countries from 1980 to 2005 using 5-year period averages. Parametric estimates for \(\hbox {SO}_{2}\) emissions find that the rate of convergence is −0.021, the emissions-income elasticity is 0.653, and that there are strong negative time effects, particularly in countries with initially high levels of income. For \(\hbox {NO}_{\mathrm{x}}\) the rate of convergence is −0.036 and there are again strong negative time effects, but the initial level of income has only a small and not very significant effect. Non-parametric estimates largely confirm their parametric estimates.

6 Testing alternative explanations

So, is the environmental Kuznets curve still a valuable approach to modeling the relationship between economic growth and environmental impacts or are the alternative approaches introduced in the previous section more powerful explanations? In this section, I describe a modeling approach that integrates the EKC and convergence approaches and present some recent results obtained by myself and coauthors using this model. This model is similar to the Ordás Criado et al. (2011) model with the addition of control variables and a term to test or measure the EKC effect. Figures 1 and 2 present the data used in these analyses.

Fig. 1
figure 1

Environmental Kuznets curves

Figure 1 plots mean values by country for a few decades for each of the variables against GDP per capita in 2005 PPP dollars. Per capita \(\hbox {CO}_{2}\) emissions from fossil fuel combustion and cement production (Boden et al. 2013) are almost linear in GDP per capita when plotted on log scales. There is little sign of an EKC effect in this raw data. On the other hand it does look like SO\(_2\) emissions (Smith et al. 2011) flatten out with increasing income but there is little sign of an inverted U shape curve. Both non-industrial GHG emissions (Sanchez and Stern 2016) and PM 2.5 (Particulate matter smaller than \(2.5 \upmu \)m diameter) concentrations (World Bank Development Indicators) show little relationship with income per capita. Clearly, additional variables or country effects would be needed to tease out any relationship.

An alternative way of visualizing the data, first used in Blanco et al. (2014), plots the growth rate of emissions per capita against the growth rate of income per capita. Figure 2 presents this alternative view. The three emissions series show some positive correlation between the growth rates of the two variables. Clearly the distribution of data is shifted downwards for SO\(_2\) and non-industrial emissions relative to \(\hbox {CO}_{2}\). This implies that the intercept of a simple regression is negative and so for a country with zero economic growth emissions will be declining. This indicates that there is a negative time effect. There does not seem to be much relationship between the growth rate of PM 2.5 concentrations and the rate of economic growth.

Fig. 2
figure 2

Growth rates

This growth rates representation of the data is used in the regression analysis. The general formFootnote 7 of the regression model is:

$$\begin{aligned} {\hat{E}}_i =\beta _0 +\beta _1 {\hat{Y}}_i +\beta _2 {\hat{Y}}_i Y_{i0} +\beta _3 E_{i0} +\beta _4 Y_{i0} +\mathop {\sum }\limits _{j=1}^J \beta _{j+4} X_{ji0} +\varepsilon _i \end{aligned}$$
(4)

where \({\hat{E}}_i =\left( {E_{iT} -E_{i0} } \right) /T\) and \({\hat{Y}}_i =\left( {Y_{iT} -Y_{i0} } \right) /T\). \(T+1\) is the time dimension of the data, the initial year is normalized to 0 so that T indicates the final year, and i indexes countries. E is the log of emissions per capita and Yis the log of GDP per capita. \(\mathbf{X}_{i0} =\left[ {X_{1i0} ,\ldots ,X_{Ji0} } \right] ^{\prime }\) is a vector of control variables, which are observed in the initial year. We deduct the sample mean from all the continuous levels variables prior to estimation. \(\beta _0 \) is, therefore, an estimate of the mean of \({\hat{E}}_i \) for countries with zero economic growth, the continuous levels variables at their sample means, and the dummy variables at their default value of zero. This is equivalent to the average change in the time effect in traditional panel data EKC models. If \(\beta _0 <0\), then in the absence of economic growth (and when the other variables are at their mean or default values) there is on average a reduction in emissions over time, and vice versa. Similarly, \(\beta _1 \) is an estimate of the emissions-income elasticity at the sample mean initial log income when all other continuous variables are at their sample mean and dummies are set to zero.

The third term on the RHS,\({\hat{Y}}_i Y_{i0} \), is the interaction between the rate of economic growth and the initial level of log income. This term is intended to test the EKC hypothesis that there is a level of income, the “turning point”, so that, ceteris paribus, economic growth is associated with a decline in emissions when income increases above this threshold.Footnote 8 For the EKC hypothesis to hold, \(\beta _2 \) must be significantly less than zero.

In order to estimate the EKC turning point we re-estimate (4) without demeaning \(Y_{i0} \). We then compute \(\mu =\hbox {exp}\big ( {-{\tilde{\beta }} _1 /{\tilde{\beta }} _2 } \big )\), where the tildes indicate the estimates from this non-demeaned model. If \({\tilde{\beta }} _1 >0\) and \({\tilde{\beta }} _2 <0\), then this will be the income turning point for maximum pollution. We use the delta method to compute the standard error of this turning point. If \(\beta _2 \) is significantly less than zero but the EKC turning point is statistically insignificant, we can conclude that while the emissions-income elasticity is lower for countries with higher GDP per capita, it does not become negative as would be required for an EKC downturn.

The fourth and fifth terms are the initial levels of emissions and income, which are intended to test convergence-type theories. If \(\beta _3 <0\) then there is beta convergence in the level of emissions per capita. If \(\beta _4 =-\beta _3 \) then there is beta convergence in emissions intensity without an additional effect of the initial level of income on the emissions growth rate.

We only use control variables that will be unaffected by the rate of economic growth so that we can measure the full effect of growth on emissions. The control variables used in each studyFootnote 9 reported here vary a little but fall into the following categories:

  • Legal and political organization: Dummy variables for non-English legal origin and centrally planned economies.

  • Climate and geography: Country averages of temperatures over the three summer months and the three winter months, annual precipitation, mean elevation, landlocked status.

  • Energy resource endowments: Fossil fuel endowments (Norman 2009), freshwater per capita, and forest area per capita.

  • Population density.

These control variables as well as the initial levels of emissions and income effectively allow the time effects to vary across countries as in Vollebergh et al. (2009). The combined term \(\beta _1 {\hat{Y}}_i +\beta _2 {\hat{Y}}_i Y_{i0} \) is then the growth effect and \(\beta _0 +\beta _3 E_{i0} +\beta _4 Y_{i0} +\sum _{j=1}^K \beta _{j+4} X_{ji0} \) is the time effect.

Table 1 reports results for the following variables from three papers that I have coauthored:

  • Anjum et al. (2014): 1971–2010 \(\hbox {CO}_{2}\) emissions from fossil fuel combustion and cement production (Boden et al. 2013) and 1971–2005 SO\(_2\) emissions (Smith et al. 2011).

  • Sanchez and Stern (2016): 1971–2010 industrial (energy use and industrial processes) greenhouse gas (GHG) emissions and non-industrial (agriculture, forestry, land-use change etc.) GHG emissions from the EDGAR database (version 4.2). Sanchez and Stern aggregated the various sources and gases using 100-year global warming potential coefficients.

  • Stern and Van Dijk (2016): 1990–2010 population weighted concentrations of PM 2.5 pollution (World Bank Development Indicators).

Anjum et al. and Sanchez and Stern use GDP per capita data from the Penn World Table version 8.0 (Feenstra et al. 2015), whereas Stern and van Dijk use World Bank GDP data. Both use 2005 prices.

Table 1 Regression results

The elasticity of pollution with respect to GDP at the sample mean is positive for all pollutants. It is smallest for PM 2.5 concentrations (0.2) and highest for the various industrial and energy related emissions (0.85–0.9). The elasticity for non-industrial GHG emissions is 0.45. The point estimate of the turning point is either out-of-sample (\(\hbox {CO}_{2}\) and SO\(_2\) emissions), very high (PM 2.5), or there is no turning point (industrial and non-industrial GHG emisions). In each case, the turning point is very imprecisely estimated. Pollution grows more slowly in countries with high initial levels of pollution or emissions intensity. This effect is strongest for SO\(_2\) and weakest for PM 2.5.

There are strong negative time effects for SO\(_2\) and industrial and non-industrial GHG emissions, ranging from 1.0 to 1.5% p.a. in a country with mean income and other variables and English legal origin. Time effects are insignificant for \(\hbox {CO}_{2}\) and PM 2.5. Among the control variables (not reported), non-English legal origin has significant negative effects for SO\(_2\) emissions. Effects on GHG emissions are insignificant or positive despite Fredriksson and Wollscheid’s (2015) finding that non-OECD French legal origin countries have stricter climate change policies than British legal origin countries.

Early findings (Selden and Song 1994; Stern et al. 1996) that concentrations of pollution were likely to have a lower income turning point than emissions are not strictly confirmed by these results, but clearly economic growth has much weaker effects on concentrations of pollution than on emissions. The results confirm later findings (Stern and Common 2001) that the effect of growth on emissions is monotonic. Like Ordás Criado et al. (2011), these results show that both economic growth and initial emissions or concentration levels are needed to explain pollution growth and that negative time effects are important for some pollutants. Stern (2004) argued that negative time effects might overcome the scale effect of growth in slower-growing higher income countries, while in faster-growing middle-income countries the scale effect dominated and emissions rose. This seems to be the case for SO\(_2\) and GHG emissions but not for \(\hbox {CO}_{2}\) or PM 2.5.

7 Conclusions and future research directions

The evidence presented in this article shows that over recent decades economic growth has increased both pollution emissions and concentrations, ceteris paribus. Negative time effects may be important for some pollutants such as SO\(_2\). Convergence is also important—higher initial levels of pollution are generally associated with slower growth in pollution.

How can these results best be used by policy-makers? The findings of this and other recent studies (e.g. Vollebergh et al. 2009; Ordás Criado et al. 2011) that the effect of economic growth on pollution is generally positive, reinforces the early concerns of Stern et al. (1996) and Arrow et al. (1995) that the EKC literature might encourage policy-makers to incorrectly de-emphasize environmental policy and pursue growth as a solution instead. On the other hand, the results show that pollution can be reduced over time and for some pollutants the effect of growth is less in richer countries. The best positive use of these results is to inform business as usual projections of pollution that are used as baselines to assess climate and other environmental policies (e.g. Clarke et al. 2014; Riahi et al. 2016).

On the theoretical front, the assumption of most static models that pollution externalities are optimally internalized over the course of economic development does not seem very plausible. There is still scope for developing more complete dynamic models of the evolution of the economy and pollution emissions. Empirical research so far has not provided very sharp tests of alternative theoretical models, so that there is still scope for work of this sort too. Therefore, I expect that in coming years this will continue to be an active area of research interest. New related topics also continue to emerge. One that has emerged in the wake of the great recession in North America and Europe is the question of what happens to emissions in the short run over the course of the business cycle. York (2012) found that \(\hbox {CO}_{2}\) emissions rise faster with economic growth than they fall in recessions but Burke et al. (2015) conclude that there is no strong evidence that the emissions-income elasticity is larger during individual years of economic expansion as compared to recession but that significant evidence of asymmetry emerges when effects over longer periods are considered. Emissions tend to grow more quickly after booms and more slowly after recessions.

Twenty-five years on, is the EKC still a useful model? The effects of growth on pollution diminish with increasing income for some pollutants and especially for the pollution concentration data that Grossman and Krueger (1991) first applied the EKC to. But it is generally other factors that actually reduce pollution. The naïve econometric approaches used in much of the literature are also problematic. Convergence effects are important for most pollutants and time effects are important for many. These effects and others should get more attention than the EKC effect as opposing forces to the scale effect when modeling aggregate pollution emissions. As these are the factors that actually reduce pollution more research into what actually drives these effects is needed.