Abstract
This paper tests the hypothesis that democratisation is conducive to less environmental depletion due to human activity. Using interrupted time series design for a panel of 47 transition countries and two indexes of pollution, CO\(_{2}\) emissions and PM10 concentrations, I find that democracies and dictatorships have two different targets of environmental quality, with those of democracies higher than those of dictatorships. Income inequality may as well alter this targets, but with opposite effects in the two different regimes.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The relation between various measures of pollution and democracy is highly debated. Two different views about the effect of democracy on the environment have been put forward: a dominant thinking of the 1970s was that democracy and its associated liberties to pollute, consume and procreate would generate ecological catastrophes, and the inability of governments to control the “tragedy of the commons” is an evident example of such a failure of democracies in the management of environmental issues (Desai 1998; Hardin 1968). Democracy is also known to be a means for redistributing income and power to the poor, so to those whose consumption has an higher marginal impact on the environment; Rodrik (1999) for example, presents evidence that democracy is associated with an higher share of wages in GDP and thus lower Milanovic (2000) finds that democracies with greater inequality of factor income redistribute more in favour of the poorer.
At present, however, there is a growing empirical research on the possible affinities between democracy and ecology. There are several reasons for such a positive relationshipFootnote 1, basically: (1) democracies respect individual rights and so environmentalists are free to market their ideas and transform them into environmental legislation; (2) the necessity of democratic government to be elected (or reelected) makes them more responsive to their citizens; (3) open political systems are more likely to learn from scientists and other concerned citizens than are autocracies; (4) democratic states tend to cooperate with each other within international environmental agencies, and finally, (5) because democracies all have free market economies, business in the market can be subject both to environmental incentives and sanctions (Dasgupta and Mäler 1995; Schultz and Crockett 1990; Payne 1995). Furthermore, democracies respect human life more than autocracies and therefore they are more responsive to life-threatening environmental degradation (Gleditsch and Sverdrup 2003).
In the empirical literature, Bhattarai and Hammig (2001), for example, use a measure of institutional quality (measured by an index of political rights and civil liberties) to account for the role of different policy regimes in the causes of deforestation in Latin America, Africa and Asia. Torras and Boyce (1998) use a similar technique for a panel data survey of a variety of air and water pollution indicators. Gallagher and Thacker (2008) introduce a concept of “stock of democracy” to study its implications through time. They all find positive evidence that civil liberties and political rights are associated with more pro-environmental behaviour.
All these empirical contributions, however, are not immune from criticisms. In general, cross country studies fail to establish causation between regimes and various measures of pollution because of the positive correlation between income and democratic institutions (Barro 1996a, b, 1999; Przeworski and Limongi 1993) and between income and clean technologies (Hausman 1979). Empirical works which use composite indexes of civil liberties and political rights do not distinguish clearly democracies from other types of regimes. Regressions in panel data are usually done by assuming fixed effect models, which can solve the issue of accounting for time invariant heterogeneity, but when distinction between democracies and autocracies is done by using dummy variables, they do not disentangle the effects of democracy from other effects in play during periods of democracy. It is well known that other factors may influence the performance of a country. In the ideal situation, one could observe the same country simultaneously under two different types of regime, and this would guarantee that all the other variables are held constant. But since this is not feasible (what one can observe is the same country at different points in time), a simple regression, even if it is able to control for time invariant unobservables, could be biased due to the time variant unobservable variables. Several effects may cause this bias: History, such as events which occurred during the period considered providing an alternative explanation of a given phenomenon, maturation, or processes through which the country produces changes as a function of time per se (like for example the maturation of technology towards less energy intensive production processes) or even instability, referred at the fact that all time series are unstable even when no treatments are applied are all typical unobservable effects that may bias these estimates. The paper by Gallagher and Thacker (2008) partially solves the issue of the time-variant unobservables by introducing the concept of “stock of democracy” but they fail to check what is the effect of autocracy on the level of CO\(_{2}\) emissions. Observing that democracy has a positive effect on the environment does not exclude a priori that dictatorship has not.
In this paper, I make use of interrupted time series (ITS) design to study the effect of democratisation on two indicators of air quality, CO\(_{2}\) emissions and levels of PM10 concentration, in a panel of 47 transition countries during the period 1950–2002 (for CO\(_{2}\)) and 1990–2002 for PM10 concentrations.
The method of ITS is a powerful quasi-experimental approach for evaluating the effects of interventions or naturally occurring events. In the literature, it is traditionally proposed for single time series studies (Shadish et al. 2002; Campbell and Stanley 1966; Cook and Campbell 1979), and only recently it has been applied in panel data contexts. Allison (1994) develops several methods for using multiwave panel data to estimate the effects of interventions by synthesizing the literature on interrupted time series with econometric treatment of pooled time series, cross section data. Bloom (2003) uses ITS approach for studying how a school reform affects three indexes of student performance for different cohorts of students, Çíftçí (2005) examines the collective responses of the Europeans to different environments of integration by using multiple ITS design with panels of data, and Gao et al. (2007) use ITS analysis to study the impact of the introduction of critical care outreach services in 172 different critical care units in England between 1996 and 2004.
Combining ITS design and panels of data allows to take the advantages of both those tools: panel data estimation methods allow to control for unobservable country-specific effects that result in a missing-variable bias in cross-sectional studies. I included in my sample only transition countries because I believe it is a natural and transparent way to study the effect of democratisation. The inclusion of countries which have not shifted regime may bias the estimate due to the fact that other events (and not democracy) may be responsible for such a change (like, for instance, technological progress). This effect may be more easily disentangled in transition countries due to the possibility of running segmented regressions, as it will become clearer later.
Segmented regression analysis of ITS data allows us to assess, in statistical terms, how much an intervention changed an outcome of interest, immediately and over time, instantly or with delay, and whether factors other than the intervention could explain the change.
So, contrary to the other methods commonly used in the literature, this approach is suitable to measure the effect of democratisation through time, taking into account of other time-variant effects, and moreover, it is capable of identifying whether other factors (possibly unknown) are driving the results.
This paper is organised as follows: Sect. 2 motivates the paper, Sect. 3 describes the dataset used, Sect. 4 illustrates the estimation techniques and results, and Sect. 5 concludes.
2 Motivations
Does democracy really benefit environmental quality? Using a sample of 47 transition countries, I observe that during the period 1990–2002 the average concentration of PM10 recorded during spells of dictatorship is about 1.36 times bigger than the concentration recorded during democratic periods, despite the average level of GDP is 2.25 times bigger during democracy than during dictatorship spellsFootnote 2.
The average intensity of CO\(_{2}\) emissions produced per unit of GDP is 1.17 times larger in periods of dictatorship than in periods of democracy.Footnote 3 Several countries show clearly a decrease in the intensity of emission per unit of GDP in proximity to the regime shift:
Figure 1 shows, for Colombia, South Africa, Spain and Bulgaria, the actual (dotted) and fitted (line) levels of intensity of emissions (expressed in tons of carbon per unit of GDP). The vertical line in each subfigure represents the date of the regime shift. All the four countries have experienced a transition to democracy after long periods of dictatorship. After the regime shift we observe a reverse pattern for emissions; while before democratisation the tendency is to increase the intensity of CO\(_{2}\) emissions in production, later we observe a decline, which is persistent through time.
Variations in percapita emissions are consistently reduced during democratic periods as the following simple fixed-effects regression that includes all the sample of 47 countries and 53 years shows (standard errors in parenthesis)Footnote 4:
with \(\Delta \mathrm{{CO}_2}_{it}\) denoting predicted variations in the level of percapita CO\(_{2}\) emissions expressed in Kg of carbon occurred between \(t-1\) and \(t\) for country \(i\), \(\Delta GDP_{it}\) variations in the level of percapita income occurred between \(t-1\) and \(t\) for country \(i\), and \(Dem_{it}\) is a dummy variable coded 1 during periods of democracy, and 0 otherwise. All the estimated coefficients are significant at 1 % level.
As previously anticipated, variations in CO\(_{2}\) emissions are consistently reduced during periods characterised by democratic institutions. The coefficient related to the variable \(Dem\) denotes exactly how the variation in the level of emissions decreases as a consequence of democratisation. In other words, it represents the average “kink” in the two fitted lines before and after the regime change, for the whole sample of countries.
3 Data and Descriptive Statistics
I consider percapita CO\(_{2}\) emissions and PM10 concentrations as dependent variables. Data on CO\(_{2}\) are from Marland et al. (2008) and are available for the period 1950–2002. They refer to emissions from burning fossil fuels and manufacturing cement, expressed in Kg of carbon. Their estimates are based on energy data from the United Nations and cement data from the US Geological Survey (for an analytical description of the estimation procedures see Marland and Rotty 1984). Data for PM10’s concentration are available for the period 1990–2002 and the source is World Bank-World Development indicators’ Database. Among the independent variables, I consider percapita GDP, from Maddison (2009) and two trends, one for democracy (\(D\)) and another for dictatorship (\(A\)), extrapolated from Przeworski’s dataset (for a list of variables and sources, refer to Table 4—Data and Sources in “Appendix”). \(D\) (\(A\)) represents the number of consecutive years since the last regime change to democracy (dictatorship). It is coded 0 the year of transition to democracy (dictatorship), and 1, 2, 3 after one, two, three consecutive years of democracy (dictatorship) and so on. During periods of dictatorship (democracy) it is coded zero. The last independent variable used is Inequality, from the EHII dataset of the University of Texas Inequality Project (UTIP). This measure is an estimate of the Theil’s index of household’s income inequality derived from econometric relationships between UTP-UNIDO (dataset that calculates the industrial pay inequality measures based on the UNIDO Industrial statistics), other conditioning variables, and the World Bank’s Deninger and Squire DatasetFootnote 5.
The reason for including this variable into the regression model relies mainly on the fact that it may “alter” the underlying decision process about the environmental policies undertaken in the two different regimes and—in this econometric model—its omission could bias the sign or the magnitude of the coefficients attached to democracy and/or autocracy. It is indeed well known that, from a pure theoretically point of view, in a direct democracy, a high inequality reduces the median voter’s income and therefore the demand for environmental quality. In “real” representative democracies however, high levels of income inequality are often associated to a scarce participation of poorer people to the electoral process, allowing small (and often rich and strongly interested) groups of people to take all the decisions. This would make a democracy resemble from several aspects to an autocracy. During autocratic periods, however, if increased inequality is in favour of the dictatorFootnote 6, it could increase environmental quality since the dictator values the environment more, compared to consumption, with respect to the population average, or simply because the opportunities to substitute environment with private consumption become scarce. So, the inclusion of this variable is done with the aim of reducing the bias that income inequality may generate in the estimation of the effects the two different regimes have on the level of CO\(_{2}\) emissions and PM10 concentrations.
Table 5 in “Appendix” lists all the countries included in the sample, and their respective regime changes to and from democracy. They are mainly developing countries, with the exception of the countries distinguished by an asterisk which are considered by IMF (World Economic Outlook updated to April 2009) developed or advanced. The following Table 1 shows the summary statistics of the primary explanatory variables:
4 Estimation Techniques and Results
4.1 Diagnostic Tests and Model Selection
The first step to decide the best way to estimate a relationship of interest, is to choose the appropriate econometric tool. In order to do so, it is necessary to verify the characteristics of the data generating processes, by testing for eventual unit root. I test the occurrence of unit root on the series percapita CO\(_{2}\) emissions, percapita GDP, PM10 concentrations and inequality using four tests for panel data: Levin, Lin and Chu (LLC), Im, Pesaran and Shin (IPS), ADF and PP-Fisher tests. All these tests suggest that the data generating processes of CO\(_{2}\) emissions and GDP have unit root, while PM10 concentration and inequality appear stationary.Footnote 7
For what concerns the dependent variable PM10, given its stationarity, it cannot be cointegrated with any other variable and therefore a standard least squares dummy variable (LSDV) model for testing the effects of democracy on its level is appropriate, as it should produce stationary (possibly normal) residuals.
For what concerns instead the model for CO\(_{2}\) emissions, I cannot say a priori if the same LSDV model is appropriate, and a test for cointegration is mandatory so to exclude possible problems related to spurious regression.Footnote 8 If indeed the two series were cointegrated, standard OLS techniques would produce superconsistent estimates of the parameters, but if they were not, the model would suffer of the spurious regression problem with invalid inference of the parameters of interest.
In the Engle–Granger approach, cointegration is tested by verifying that the residual series generated by the regression of one I(1) variable over another I(1) variable is stationaryFootnote 9. To verify whether percapita CO\(_{2}\) emissions and percapita GDP are cointegrated, I use the approach suggested by Kao (1999). The global ADF statistic for Kao residual cointegration test with the null of no cointegration shows a \(t\) stat of 3.499820 with a \(p\) value of 0.0002, so this test strongly suggests that those two series are cointegrated.
Maddala and Shaowen (1999) combined tests from individual cross-sections (trace and maximum eigenvalue tests) reject the hypothesis of absence of cointegration at 1 % level, and, as expected, accept the hypothesis that there exists one cointegrating relation.Footnote 10 So, substantial evidence points out that a cointegrating relation between emissions and income exists, so the use of a procedure that takes into account this fact is justified.
The choice of estimating the model using a single equation or a vector error correction model (VECM) is made by testing for weak exogeneity of the variables on the right-hand side of the equation describing the long run relationship of CO\(_{2}\) with GDP. Estimating with a single equation when the variables are not weakly exogenous is potentially inefficient and it does not lead to the smallest variance against alternatives, because information is lost. Using the approach suggested by Urbain (1992), I test weak exogeneity by testing whether the error correction term embedded in the short run ECMFootnote 11 is significant in the equation determining \(\Delta GDP_{it}\). In particular, weak exogeneity requires that \(\Delta GDP_{it}\) does not depend on disequilibrium changes represented by \(\hat{\epsilon }_{it-1}\). Results of this test, deeply detailed in the “Appendix”, show that GDP is weakly exogenous so the estimation of the model for CO\(_{2}\) emissions using a simple equation approach is suitable for my purpose.
4.2 Interrupted Time Series
Interrupted time series (ITS) are a powerful quasi-experimental approach for evaluating the effects of interventions. Segmented regression analysis of ITS data allows to assess, in statistical terms, how much an intervention changed an outcome of interest, immediately and over time, instantly or with delay, and whether factors other than the intervention could explain the change. In this paper, I make use of this ITS design to estimate the effects of democratisation on the level of emissions produced, in a panel of 47 countries for the period 1950–2002 for CO\(_{2}\) and 1990–2002 for PM10. For each country in my sample, the levels of emissions and concentration of particulate matters, real percapita incomes and the regime are measured at regularly spaced intervals over time. To determine the effect of a regime change on the level of the two indexes of pollution, I generate two series (namely, \(D\) and \(A\) representing the number of consecutive years since the last regime change to, respectively, democracy and dictatorship) which are divided into two or more segments at changing points, when the regime type varies. So, if a country, at a given date, has shifted from dictatorship to democracy, and there is only one shift in the period taken into consideration, then the time series is divided into two segments. If the number of shifts is \(n\), the number of segment will be, in general, \(n+1\). In detail, \(D\) and \(A\) are trend variables which are evaluated progressively according to the age of the democracy or autocracy. The advantage of using those two variables is that they allow assessment of the effect of the regime through time as well whether other effects may explain the change. If indeed the coefficients relative to those trends have the same sign, this methods allows to conclude that other factors (like general technological progress) may have played a major role in the determination of the environmental policy, and not the regime.
4.3 Econometric Specifications and Results
Due to the nature of data generating process of my variables of interest, I have to specify two different models for estimating the effect of the regime on CO\(_{2}\) and on the level of PM10 concentrations. Previous tests for cointegration between CO\(_{2}\) and GDP suggest that those two variables have a long run relationship; they move closely together over time and their difference is stable. Test for weak exogeneity shows that GDP is weakly exogenous with respect to CO\(_{2}\) so cointegration approaches which use a single equation are appropriate and no loss of information (and efficiency) occurs. For estimating the effect of the regime on the level of CO\(_{2}\) emission, then, I will use the procedure proposed by Engle and Granger (1987). The Engle–Granger Error Correction Model (ECM) is appropriate in this setting and has the advantage of incorporating both long run and short run effects; if the model is in disequilibrium the ECM will provide information on the speed of adjustment to equilibrium. Moreover, all the terms in the model are stationary, and so standard regression techniques are valid, and finally, the fact that CO\(_{2}\) and GDP are cointegrated of order CI(1,1) implies that an ECM exists (and, conversely, an ECM generates cointegrated series) and so ECM is immune from the spurious regression problemFootnote 12.
For what concerns the estimation of the effects of the two different regimes on the level of PM10 concentrations, I use a LSDV model to take into account of country specific fixed effects, and since tests for unit root for the PM10 series show that it is stationary, standard OLS procedures can be applied. In both econometric specifications I introduce two different trends, or, more precisely, two interrupted series of trends, \(D\) and \(A\), which represent, respectively, the number of consecutive years since the last democratisation or autocratisation. During periods of dictatorship, \(D\) is coded zero, and during periods of democracy, \(A\) is coded zero. In this setup, \(D\) and \(A\) represent the effect through time on emissions or on concentration of pollution of the type of regime. The other independent variables included in the model are inequality and the cross effects of inequality with the two trends for regime. The reason for their inclusion is that depending on who is the decisive political actor in the two different regimes, inequality may have different effects.
So, on one hand, when the population as a whole is involved in the decision process (through the electoral mechanism), different levels of income inequality change the median voter’s wealth and so the perceived value of income (or consumption) relative to the environmental quality, and therefore inequality is expected to worsen the environmental quality. Another reason that may induce this result is that, in reality, high income inequalities are often associated to a scarce participation of poorer people to the electoral process. This “sick participation” allows decisions to be made by a small group of people (usually rich and strongly interested) which—from a point of view—makes democracy resemble from different aspects to an autocracy.
On the other hand, during autocratic periods, if increased inequality is in favor of the dictator, it could increase environmental quality, since the rich dictator values the environment more, compared to consumption, or simply because the opportunities to substitute environment with private consumption are limited.
In order to estimate the relation between different regimes and CO\(_{2}\) emissions, consider the following ARDL(1,1)
with \(j=1,\ldots ,4\) and
with some manipulations (shown in “Appendix”) we end up with the usual ECM:
where the expression in square brackets represents the long run relationship between CO\(_{2}\) and GDP. If the model is in equilibrium, this expression is equal to zero, and if it is not, the coefficient \(-(1-\gamma _1)\) represents the speed of adjustment toward it. Therefore, I first estimate the long run relationship between CO\(_{2}\) and the other covariates as follows:
with
using a standar LSDV to account for specific, country-level time invariant effects (whose coefficient is represented by \(\beta _{0i}\)). The results of this estimation are shown in Table 2. The second step is to estimate the residuals from Eq. 4 and proceed to the estimation of the unrestricted error correction model
where \(W_{Dit}\) and \(W_{Ait}\) are defined as:
and \(\hat{\epsilon }_{it-1}\) is
Results of the estimation of Eq. 5 are shown in Table 3. Table 9 in “Appendix” shows the coefficients relative to the fixed effects estimated in Eq. 4.
As previously written, the series PM10 is stationary and therefore there cannot exist any cointegrating relation. It follows that a simple LSDV model is appropriate to estimate the effects of the regime, since the residuals are expected to be stationary (results of the tests of unit root on the residuals are shown in the “Appendix”).
and the results are shown in Table 2.
As it is possible to see, the trends for democracy and dictatorship have, respectively, a negative effect on the level of emissions/pollution, in both models and all the coefficients (except that relative to inequality for the model of CO\(_{2}\) emissions) are statistically significant at the standard 5 % level.
This indicates that democracy and autocracy have two different targets of “desired” pollution, with those of democracy lower than those of autocracy. Inequality per se has not a statistically significant effect on the level of CO\(_{2}\) emissions or PM10 concentration, but the interaction effects between income inequality and the two regimes are highly significant, and mitigate their effects. Income inequality indeed may affect negatively the functioning of a democracy because poorer people often do not participate to the electoral process, allowing then a small, and often rich and strongly interested group of people to take all the decisions. This makes a democracy with high income inequality resemble to an autocracy. That is the reason why democracy interacted with inequality is positive (while democracy alone is negative on the level of emissions/concentrations). The same reasoning can be applied to the interaction effect between autocracy and inequality. Given the fact that in autocracy the level of environmental quality is lower than in a democracy, it might be that if inequality goes in favour of the dictator (see footnote 6 for some examples) he could start to value the environment more, compared to private consumption, or it might be also that the possibility to substitute environment with private consumption starts to be scarce. So, given the fact that in autocracy the level of pollution is higher, inequality (in favour of the dictator) might mitigate this effect because of those two reasons.
For model 1 (PM10), the elasticity of concentration of PM10 with respect to one year increase in democracy (\(\partial PM10/ \partial D\)), computed at the mean of inequality is equal to \(-\)2.14, and the same elasticity computed for one year increase in dictatorship (\(\partial PM10/ \partial A\)) is 0.1132. For the second model, relative to CO\(_{2}\) emissions, \(\partial CO_{2}/ \partial D = -1.316\) and \(\partial CO_{2}/ \partial A=1.118\). It is possible to say therefore that in both models (even in the second all the conclusions apply only to the long run period) democracy is good for the environment, and moreover the argument is reinforced by the fact that non-democracies are not. Table 3 shows the results from the estimation of the unrestricted ECM for CO\(_{2}\) emissions:
The main results are reported in Table 3. Variations in the level of \(CO_{2}\) are, of course, positively related to variations in global production, and the coefficient attached to \(\hat{\epsilon }\) represents the speed of adjustment towards the equilibrium. It is worth noting that the coefficient related to \(\Delta D^*_{it}\) is not significant at the standard 5 % level (in fact, it has a \(p\) value of 0.574), while the coefficient related to the variation in the trend for dictatorship is positive and significatively related to increases in the variation of emissions. This effect may be the cause of the slower reactions of democracies with respect to autocracies. Democracies are indeed constrained by consultations, elections, every decisions has often to pass the evaluations of another independent organism, and this procedure takes time. On the contrary, in dictatorship the decisions are taken by one individual only and they have not to be scrutinised by other independent powers. The reasons for this non-significancy for the variations of the democratic level on the variation of emissions is probably to to this difference in the time a policy takes to be implemented into the two different regimes.
5 Conclusion
Despite the different views about the effect of democracy on the environmental management, in this paper I show that democracy and environmental quality are positively correlated. To show that, I use the powerful approach of ITS design in cointegration analysis to show that democratic countries and autocratic ones have two different targets of environmental quality, with those for democracy higher than those for autocracies. Previous works on democracy and environmental quality were indeed unable to assert that democracy is really good for the environment because they did not show that non-democracies are not. Segmented regression analysis of ITS allows not only to see the effect of democracy through time, but also if this effect differs from the effect of autocracy. The weakness of the previous works in this field was therefore that not comparing the results with those for dictatorships, the positive effect of democracy on the environment might not be due to democracy per se, but from other effects, like maturation, or technological progress, common to both regimes. In this panel of 47 transition countries, this approach shows that democratisation is consistently associated to a reduction of CO\(_{2}\) emissions and PM10 concentrations, but this process may be quite slow because—at least in the ECM relative to CO\(_{2}\) emissions—it is detectable only in the long run. Due to the fact that democratic institutions tend to be slower than autocratic ones in taking decisions and acting, in the short run we do not observe a negative effect of democracy on the level of emissions, while the positive effect of dictatorship is quite consistent. Inequality has two different effects depending on the incumbent regime: in any case it counterbalances the global effect of the regime. In democracy, increased inequality means that the decisive citizen is poorer and so less willing to pay for environmental protection. This however assuming perfect democracy only and full participation to the poll. From another point of view, it is reasonable to think that democratic institutions work worse when inequality is high, due to the scarce participation of the poor to the electoral processes, and therefore democracies with high income inequality resemble more to a dictatorship than a democracy itself, because decisions are taken by a small (and usually strongly interested) group of people. Inequality during periods of autocracy, under the assumption that this inequality favors the dictator at the expense of the rest of the citizens, it may retain the negative effect of the regime since it increases the dictator’s income and so it increases his demand for environmental quality, which may be driven by a limited (although high) opportunity to substitute environment with private consumption. The overall effect, however, is that dictatorships tend to be associated to a worse environment than democracies.
Notes
for a systematisation of the argument, see Payne (1995).
The average concentration of PM10 during democracy is 69.67906 and during dictatorship is 89.14407. GDP during democratic periods is, on average, 117,648.9 against 52,189.88 during dictatorship.
This average is computed over all the 53 years and over all the countries, conditioned to periods of democracy or dictatorship. The data for periods of democracy show an intensity of CO\(_{2}\) emissions (in Kg of carbon) per unit of income of 0.1269 against 0.1485 during periods of dictatorship.
All the results are also significant at 1 % level.
For a detailed presentation of the techniques used to construct such an index, visit the University of Texas’ website at http://utip.gov.utexas.edu/default.html.
It is indeed well known that many countries in Africa and the Caribbean suffer and have suffered from “kleptocratic” regimes (Acemoglu et al. 2004) run by individulas who use their power to transfer a large fraction of society’s resources to themselves. Two of the most “kleptocratic successes” are those of Mubutu Sese Seko in the Democratic Republic of Congo (Zaire) and Rafael Trujillo in the Dominican Republic. To have an idea of the numbers, in the 70’s, 15–20 % of Congo’s operating budget went directly to Mubutu, and in 1977 Mubutu’s family took $71 million from the National Bank for personal use and in the 80’s his personal fortune was estimated in $5 billion (Leslie 1987, p. 72). In the Dominican Republic, Trujillo became in power after he elected himself in a fraudulent election and at the end of his regime, the fortune of Trujillo’s family amounted to about 100 % of GDP at current prices and the family “controlled almost 80 % of the country’s industrial production” (Moya Pons 1995, p. 398). Other examples of “keptocratic regimes” include Haiti under the Duvaliers, Nicaragua under the Somozas, Uganda under Idi Amin, Liberia under Charles Taylor and the Philippines under Ferdinand Marcos, but the list can go on, if we include less extreme cases.
For a detailed description of these tests, the tests that will follow and results, refer to the “Appendix”.
Spurious regressions occur whenever one regresses a nonstationary variable over another nonstationary variable which are not cointegrated. In general, the estimated coefficients appear to be significant but they lack any real and plausible correlation, since what is correlated is just a common trend and nothing else. Under a spurious regression, the tendency of both series to be growing is picked up by the regression model, even though each series is growing for very different reasons and at a rates which are uncorrelated, and produce nonstationary residuals.
Since in my dataset there are only two nonstationary variables, it follows that at most one cointegrating relation may exist.
In performing these tests, it’s been assumed that there is no deterministic trend.
\(\hat{\epsilon }_{it-1}\) is computed as a residual of the long run relationship explaining percapita CO\(_{2}\) emissions.
This can be referred to Granger’s representation theorem for dynamic modelling, in Engle and Granger (1987)
Similar results are obtained for longer lags specifications, up to four.
Maddala and Wu use the results obtained by Fisher (1932).
In performing this test, it’s been assumed that there is no deterministic trend.
\(\Delta D^*\) and \(\Delta A^*\) are equal, respectively, to \(\Delta D \cdot Dem\) and \(\Delta A \cdot (1- Dem)\).
References
Acemoglu D, Robinson JA, Verdier T (2004) Kleptocracy and divide-and-rule: a model of personal rule. J Eur Econ Assoc 2(2/3). In: Papers and proceedings of the 18th annual congress of the European economic association, pp 162–192
Allison PD (1994) Using panel data to estimate the effects of events. Sociol Methods Res 23:174–199
Barro R (1996) Getting it right. MIT Press, Cambridge, MA
Barro R (1996b) Democracy and growth. J Econ Growth 1:1–27
Barro RJ (1999) Determinants of democracy. J Polit Econ 107:158–183
Bhattarai M, Hammig M (2001) Institutions and the environmental Kuznets curve for deforestation: a cross country analysis for Latin America, Africa and Asia. World Dev 29:995–1010
Bloom HS (2003) Using short interrupted time-series analysis to measure the impacts of whole school reform. Eval Rev 27:3–49
Campbell D, Stanley J (1966) Experimental and quasi-experimental designs for research. Rand McNally, Chicago, IL
Çíftçí S (2005) Treaties, collective responses and the determinants of aggregate support for European integration. Eur Union Polit 6:469–492
Cook TD, Campbell DT (1979) Quasi-experimentation: design and analysis for field settings. Rand McNally, Chicago, IL
Dasgupta P, Mäler KG (1995) Poverty, institutions, and the environmental resource-base, Chapt. 39. In: Behrman J, Srinivaan TN (eds) Handbook of development economics, vol 3A. Elsevier Science, Amsterdam
Desai U (1998) Environment, economic growth, and government, ecological policy and politics in developing countries. State University Press of New York Press, Albany, NY, pp 1–45
Engle RF, Granger CWJ (1987) Cointegration and error correction: representation, estimation and testing. Econometrica 55:251–276
Fisher RA (1932) Statistical methods for research workers, 4th edn. Oliver & Boyd, Edimburgh
Gallagher KP, Thacker SC (2008) Democracy, income and environmental quality, political economy research institute (PERI). Working Paper No. 164, University of Massachusetts, Amhrest
Gao H, Harrison D, Parry G, Daly K, Subbe C, Rowan K (2007) The impact of the introduction of critical care outreach services in England: a multicentre interrupted time series analysis. Crit care 11(5):R113
Gleditsch NP, Sverdrup B (2003) Democracy and the environment. In: Paper E, Redclift M (eds) Human security and the environment: international comparisons. Elgar, London
Hardin G (1968) The tragedy of the commons. Science 162:1243–1248
Hausman J (1979) Individual discount rates and the purchase and utilization of energy-using durables. Bell J Econ 10:33–54
Im KS, Pesaran MH, Shin Y (2003) Testing for unit roots in heterogeneous panels. J Econom 115:53–74
Kao C (1999) Spurious regression and residual-based tests for cointegration in panel data. J Econom 90:1–44
Leslie WJ (1987) The world bank and structural transformationin developing countries: the case of Zaire. Lynne Rienner Publishers, Boulder, CO
Levin A, Lin CF, Chu C (2002) Unit root test in panel data: asymptotic and finite-sample properties. J Econom 108:1–24
Maddala GS, Shaowen Wu (1999) A comparative study of unit root test with panel data and a new simple test. Oxford Bull Econ Stat 61:631–652
Maddison A (2009) Statistics on world population, GDP and Per Capita GDP, 1–2006 AD, updated March 2009. http://www.ggdc.net/maddison/
Marland G, Boden TA, Andres RJ (2008) Global, regional, and national CO2 emissions. In: Trends: a compendium of data on global change. Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, U.S. Department of Energy, Oak Ridge, Tenn
Marland G, Rotty RM (1984) Carbon dioxide emissions from fossil fuels: a procedure for estimation and results for 1950–1982. Tellus 36(B):232–261
Milanovic B (2000) The median voter hypothesis, income inequality and income redistribution: an empirical test with the required data. Eur J Polit Econ 16(3):367–410
Payne RA (1995) Freedom and the environment. J Democr 6:41–55
Pons FM (1995) The Dominican Republic: a national history. Markus Weiner, Princeton, NJ
Przeworski A, Limongi F (1993) Political regimes and economic growth. J Econ Perspect 7:51–69
Rodrik D (1999) Democracies pay higher wages. Q J Econ 114:707–738
Schultz CB, Crockett TR (1990) Economic development, democratization, and environmental protection in Eastern Europe. Boston Coll Environ Aff Law Rev 18:53–84
Shadish WR, Cook TD, Campbell DT (2002) Experimental and quasiexperimental designs for generalized causal inference. Houghton Mifflin, Boston
Torras M, Boyce JK (1998) Income, inequality and pollution: a reassessment of the environmental Kuznets curve. Ecol Econ 25:147–160
United Nations (1992) Handbook of the international comparison program
Urbain JP (1992) On weak exogeneity in error correction models. Oxford Bull Econ Stat 54:187–207
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
1.1 Diagnostic Tests
1.1.1 Panel Unit Root Test
The first step to decide the best way to estimate a relationship of interest, is to choose the appropriate econometric tool. In order to do so, it is necessary to verify the characteristics of the data generating processes: for each nondeterministic series (CO\(_{2}\) emissions, percapita GDP, PM10 concentrations and Inequality), I test whether they have stationary mean and variance. Available tests for unit root on panel data are based on the Dickey Fuller test (or its augmented version), so I test the following:
with \(i=1,\ldots ,N\) and \(t=1,\ldots ,T_i\). \(\rho _i\) represents the autoregressive coefficient for country \(i\), \(X_{it}\) is matrix representing the exogenous variables in the model, including any fixed effects and individual trends and \(\epsilon _{it}\) are the errors which are assumed i.i.d. If \(|\rho _i|=1\), then \(y_i\) contains a unit root. For practical purpose, the tests for unit root are performed using the following basic ADF specification:
with \(\alpha _i=\rho _i-1\).
Levin, Lin and Chu test assumes that the unit root process is common to all the cross sections so it assumes \(\alpha _i=\alpha \) for every \(i\). The test is performed by testing the null hypothesis \(H_0:\alpha =0\) against the alternative \(H_1:\alpha <0\) for all the cross section units. Their procedure derives estimates of \(\alpha \) from proxies for \(\Delta y_{it}\) and \(y_{it}\) that are standardised and free of autocorrelation and deterministic components. For a given set of lag orders \(p_i\), their procedure begins by estimating two additional sets of equations,
and denoting \((\hat{\beta },\hat{\delta })\) the estimated coefficients of Eq. 9 and \((\dot{\beta },\dot{\delta })\) those of Eq. 10. They define, then, \(\Delta \bar{y}_{it}\) and \(\bar{y}_{it-1}\) by taking, respectively, \(\Delta y_{it}\) and \(y_{it-1}\) and removing the autocorrelations and deterministic components using their respective auxiliary estimates
and standardise both \(\Delta \bar{y}_{it}\) and \(\bar{y}_{it-1}\) by dividing by the regression standard error
where \(s_i\) are the estimated standard errors from estimating each ADF in Eq. 8. The estimate of \(\alpha \) is then obtained from the pooled proxy equation
which, under the null, a modified \(t\) statistics for the resulting \(\hat{\alpha }\) is asymptotically normally distributed
where \(t_{\alpha }\) is the standard statistic for \(\hat{\alpha }=0\), \(\hat{\sigma }^2\) is the estimated variance of the error term \(\eta \), \(se(\hat{\alpha })\) is the standard error of \(\hat{\alpha }\), and
\(S_N\) is the mean of the ratios of the long run standard deviation for each individual, and it is estimated using kernel-based techniques, and \(\mu _{m\tilde{T}^*}\) and \(\sigma _{m\tilde{T}^*}\) are adjustment terms for the mean and standard deviation (for more details, refer to the original article of Levin et al. 2002). In order to perform this test, I include individual constant terms (fixed effects), and an individual trend, so my \(X_{it}\) matrix is a \(2NT \times 2N\) matrix, where the first \(NT \times N\) block is a matrix of dummy variables, each representing one single country, and the other block going from row \(NT+1\) to \(2NT\) and from column \(N+1\) to \(2N\) is a matrix of trends, one for each single countries. All the other terms in the matrix are equal to zero.
If the test for common unit root fails, it might be convenient to check whether individual unit root exists. Tests available for that are Im, Peasaran and Shin, Fisher AD and PP tests. Im, Pesaran and Shin test begin by specifying a separate ADF regression for each cross-section according to Eq. 8 with the null
and the alternative hypothesis is
That can be interpreted as a nonzero fraction of the individual processes is stationary. After having estimated the separate ADF regressions, the average of the \(t\) statistics for \(\alpha _i\) from the individual ADF regressions, \(t_{iT_i}(p_i)\),
is adjusted to arrive to the desired statistics. Critical values are provided in the Im, Pesaran and Shin’s paper (2003) for different number of cross sections and time periods when \(p_i=0\) for all \(i\), but in the general case when the lag order in Eq. 8 may be nonzero for some cross-sections, they show that a properly standardised \(\bar{t}_{NT}\) has an asymptotic standard normal distribution
and the expression for the expected mean and variance of the ADF regression \(t\) statistics, \(E(\bar{t}_{iT}(p_i))\) and \(Var(\bar{t}_{iT}(p_i))\) are provided by Im, Pesaran and Shin for various values of \(T\) and \(p\) and different test equation assumptions. I will use, in this paper, one lag and, as deterministic component, an individual constant, without introducing any trend term. Finally, there are other two tests for checking individual unit root: Fisher ADF and Fisher PP tests, which are based upon the idea by Maddala, Wu and Choi, and combine the \(p\) values from individual unit root tests. They work as follows: define \(\pi _i\) the \(p\) value from individual unit root test for cross-section \(i\), then under the null of unit root for all the \(N\) cross sections, we have the asymptotic result that
and also, Choi demonstrates that
where \(\Phi ^{-1}\) is the inverse of a standard normal cumulative distribution function. For both the Fisher tests, I specify as exogenous variables an individual constant (fixed effect) and an individual time trend.
Table 6 in “Appendix” shows the results of the tests for all the nondeterministic series. These tests accept the hypothesis of unit root only for two out of four series, percapita GDP and percapita CO\(_{2}\) emissions. Since the level of concentration of particulate matters (PM10) is stationary, it cannot be cointegrated with any other variable, while in principle CO\(_{2}\) and GDP could be. For what concerns the dependent variable PM10, then, a standard Least Squares Dummy Variables (LSDV) model for testing the effect of democracy on its level it is appropriate, as it should produce stationary (possibly normal) residuals. For what concerns the model for CO\(_{2}\) emissions, we cannot say a priori whether the same LSDV model is appropriate, and a test for cointegration is mandatory. If indeed the two series were cointegrated, that model would produce superconsistent estimates of the parameters, but if they were not, the model would suffer of the problem of the spurious regression with invalid inference of the parameters of interest. In this case, the tendency of both series to be growing leads to correlation which is picked up by the regression model, even though each series is growing for very different reasons and at a rates which are uncorrelated. Thus, in absence of cointegration (which will be tested in the next subsection) correlation between non-stationary series does not imply the kind of causal relationship that might be inferred from stationary series, so standard estimation techniques like OLS cannot be used.
1.1.2 Panel Cointegration Test
If two or more variables are nonstationary or I(1), if they are not cointegrated, the residual series obtained by regressing one I(1) variable over another I(1) variable is expected to be nonstationary, or I(1). This would lead to spurious regression since the estimated coefficients do not reflect a real relationship between those two variables, but simply correlated time trends. This however would be different if those two variables were cointegrated, and the coefficients would benefit from the “superconsistency” property. Since in my dataset there are only two nonstationary variables, namely, CO\(_{2}\) and GDP, it follows that at most one cointegrating relation may exist. In the Engle–Granger approach, cointegration is tested by verifying that the residual series generated by the regression of one I(1) variable over another I(1) variable is stationary. To verify whether percapita CO\(_{2}\) emissions and percapita GDP are cointegrated, I use the approach suggested by Kao (1999). Kao uses a two-step procedure to test for cointegration: in the first step, he regress the dependent variable [which is I(1)] over the independent [also I(1)] specifying cross-section specific intercepts and homogeneous coefficient: he basically regresses
where \(X_{it}\) is a matrix of dummy variables representing each single country and assuming \(\mathrm{{CO}_2}_{it}={\mathrm{CO}_2}_{it-1}+u_{it}\) and \(GDP_{it}=GDP_{it-1}+\epsilon _{it}\) for \(t=1,\ldots ,T\) and \(i=1,\ldots ,N\). Then Kao runs the pooled auxiliary regression
Assuming \(p=1\) Footnote 13, this augmented Dikey Fuller test for panel data reject at 1 % level the hypothesis that \(\rho =1\). The global ADF \(t\) statistic for Kao residual cointegration test with the null hypothesis of no cointegration shows a \(t\) stat of 3.499820 with a \(p\) value of 0.0002, so this test strongly suggests that those two series are cointegrated (detailed results of this test are in this Table 7 in “Appendix”).
Maddala and Shaowen (1999) combined test from individual cross-sectionsFootnote 14 is specified as follows: consider the following VAR representation for each cross-section unit:
where \(Y\) is a vector of I(1) variables (in my case, \(CO_{2}\) and GDP). Subtracting \(Y_{t-1}\) on the left and right hand side of equation 16 and adding and subtracting \(A_2Y_{t-1}\) from the right hand side, we get
with \(\Pi =(A_1+A_2-I)\) and \(\Gamma = - A_2\). Granger’s representation theorem asserts that if the coefficient matrix \(\Pi \) has a reduced rank \(r<k\), then there exists \(kxr\) matrices \(\alpha \) and \(\beta \) each with rank \(r\) such that \(\Pi =\alpha \cdot \beta '\) and \(\beta 'Y_t\) is I(0). \(r\) is the number of cointegrating relations and each column of \(\beta \) is the cointegrating vector, and the elements of \(\alpha \) are known as the adjustment parameters in the VEC model. Johansen’s method is to estimate the \(\Pi \) matrix from an unrestricted VAR and to test whether we can reject the restrictions implied by the reduced rank of \(\Pi \). In performing this test, I assume that the level data \(Y_t\) have no deterministic trends and the cointegrating equations have only interceipts, so
To determine the number of cointegrating relations \(r\) conditional on assumption 18, we proceed sequentially from \(r=0\) to \(r=k-1\) until we fail to reject. The trace statistic for the null hypothesis of \(r\) cointegrating relations is computed as
where \(\lambda _i\) is the largest eigenvalue of the \(\Pi \) matrix.
The maximum eigenvalue statistic tests the null hypothesis of \(r\) cointegrating relations against the alternative of \(r+1\) relations. The test statistics is computed
If the test statistics are continuous, the significance levels for each cross-section unit, denoted by \(\pi _i\) for \(i=1,2,\ldots ,N\), are independent uniform (0,1) variables, and \(-2 \log \pi _i\) has a \(\chi ^2\) distribution with two degrees of freedom. The approach proposed by Maddala and Wu to test cointegration in panel is to combine tests from individual cross sections to obtain a test statistic for the full panel, using the additive property of the \(\chi ^2\) variables: if \(\pi \) is the \(p\) value from an individual cointegration test for the cross-section \(i\), under the null hypothesis for the panel we have
Both trace and maximum eigenvalue tests reject the hypothesis of absence of cointegration at 1 % level, and accept the hypothesis that there exists one cointegrating relationFootnote 15. So, substantial evidence points out that a cointegrating relation between emissions and income exists, so the use of a procedure that takes into account this fact is justified.
1.1.3 Weak Exogeneity Test
In the model for the estimation of the of the effect of the regime on the level of CO\(_{2}\) emissions, I have only two non-stationary series, while the other variables are either deterministic or I(0). It follows that only one cointegrating relation can exist, so in principle it is possible to estimate this relationship using a single equation. However, estimating with a single equation is not free of drawbacks, as it is potentially inefficient, and so it does not lead to the smallest variance against alternative approaches. In general, information is lost unless the right-hand side variables in the cointegration vector are weakly exogenous. Weak exogeneity of these variables is indeed a prerequisite to assert that no useful information is lost when we condition on these variables without specifying their generating process. In practical terms, it must be the case that GDP is weakly exogenous with respect to the level of CO\(_{2}\) emissions. As pointed out by Urbain (1992), testing for weak exogeneity requires testing whether the error-correction term embedded in the short-run ECM (\(\hat{\epsilon }_{it-1}\) computed as a residual of the long run relationship equation) is significant in the equation determining \(\Delta GDP_{it}\). In particular, weak exogeneity requires that \(\Delta GDP_{it}\) does not depend on the disequilibrium changes represented by \(\hat{\epsilon }_{it-1}\).
Consider the following long run relationship for the dependent variable percapita \(CO_{2}\) emissions:
where the betas are combinations of parameters deriving from the ARDL model (for more details, please refer to the “Appendix”). Testing for weak exogeneity requires, in order:
-
1.
Estimating the coefficients of Eq. 22
-
2.
Computing the estimated residual series as
$$\begin{aligned} \hat{\epsilon }_{it}&= \mathrm{{CO}_2}_{it}-\hat{\beta }_{0i} - \hat{\beta }_1 GDP_{it} -\hat{\beta }_2 D_{it} - \hat{\beta }_3 A_{it} - \hat{\beta }_4 INEQ_{it} +\\&-\,\hat{\beta }_6 D_{it}\cdot INEQ_{it} -\hat{\beta }_7 A_{it}\cdot INEQ_{it} \end{aligned}$$ -
3.
Estimating the following equationFootnote 16:
$$\begin{aligned} \Delta GDP_{it}&= \gamma _1 \Delta \mathrm{{CO}_2}_{it} + \gamma _2 \Delta D_{it}^* + \gamma _3 \Delta A_{it}^* + \gamma _4 \Delta INEQ_{it}\nonumber \\&+ \,\gamma _5 W_{Dit} + \gamma _6 W_{Ait} - (1-\alpha )\hat{\eta }_{it-1} + \delta \hat{\epsilon }_{it-1} \end{aligned}$$(23)where the short-run interaction effects between the two trends and inequality are
$$\begin{aligned} W_{Dit}&= D_{it}\cdot \Delta INEQ_{it}+\Delta D_{it}^* \cdot INEQ_{it}-\Delta D_{it}^* \cdot \Delta INEQ_{it}\\ W_{Ait}&= A_{it}\cdot \Delta INEQ_{it}+\Delta A_{it}^* \cdot INEQ_{it}-\Delta A_{it}^* \cdot \Delta INEQ_{it} \end{aligned}$$and \(\hat{\eta }_{it}\) is the error correction term for the equation defining the relationship between \(GDP\) and \(CO_{2}\), and it is estimated from the long run relationship between GDP and \(\textit{CO}_{2}\), given by
$$\begin{aligned} \textit{GDP}_{it}&= \theta _{0i} + \theta _1 CO_{2} + \theta _2 D_{it} + \theta _3 A_{it} + \theta _4 INEQ_{it}\nonumber \\&+\,\theta _6 D_{it}\cdot INEQ_{it} +\theta _7 A_{it}\cdot INEQ_{it} +\eta _{it} \end{aligned}$$(24)and \(\hat{\epsilon }_{it-1}\) is already defined at point 2.
-
4.
Checking and testing the significancy of the coefficient attached to \(\hat{\epsilon }_{it-1}\), \(\delta \), in the equation defined at point 3. If \(\delta \) is significant in Eq. 23, then we cannot say that \(GDP\) is weakly exogenous and then a multivariate model for estimation is necessary. If instead \(\delta \) is found to be non-significant, a standard error correction model with one single equation is enough to estimate efficiently the relations of interest, so no loss of information occurs.
Results from the estimation of Eq. 23 for \(CO_{2}\) show that \(t\) test for \(\delta \) accepts the null of \(\delta =0\) at the standard level of 5 %. This coefficient amounts to \(-\)0.0382668, with a standard error of 0.1013909, a \(t\) statistic of \(-\)0.38 and a \(p\) value of 0.706, so GDP is shown to be weakly exogenous.
It is possible to conclude, then, that the estimation of the model for CO\(_{2}\) emissions using a single equation approach is appropriate, and there is no loss of information (and efficiency) (Tables 4, 5, 6, 7, 8, 9, 10).
1.2 The Econometric Model for the Estimation of the Unrestricted ECM for CO\(_{2}\) Emissions
Consider the following ARDL(1,1)
with \(j=1,\ldots ,4\) and
with some manipulations
and we end up with the usual ECM:
Rights and permissions
About this article
Cite this article
Policardo, L. Is Democracy Good for the Environment? Quasi-Experimental Evidence from Regime Transitions. Environ Resource Econ 64, 275–300 (2016). https://doi.org/10.1007/s10640-014-9870-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10640-014-9870-0