Introduction

While renewable energy sources and conservation measures have grown in importance as policymakers attempt to mitigate the global impact of greenhouse gas emissions on climate change and the environment, fossil fuels continue to serve as the primary energy source for a vast majority of countries. With carbon dioxide emissions, a prominent component of greenhouse gas emissions, the debate continues in regard to the appropriate mitigation and emission allocation strategies, as reflected in the Framework Convention on Climate Change in 1992, the Kyoto Protocol in 1997, and the Paris agreement in 2015.Footnote 1 Indeed, the generation of carbon dioxide emissions is directly tied to the country’s energy mix, level of economic development, economic structure, natural resource endowments, among other factors, and as such, vary greatly across developed and developing countries. This is a relevant point in the discussions related to the emission allocation approaches that focus on the distribution of per capita emissions. Specifically, countries with lower per capita emissions (i.e., developing countries) may very well expect countries with higher per capita emissions (i.e., developed countries) to shoulder more of the burden for the mitigation efforts and the reduction in emissions (Aldy 2006). This issue of fairness and equity associated with emission allocation strategies on a per capita basis becomes less of a concern if there is convergence in per capita emissions. On the other hand, if per capita emissions fail to converge, then a per capita emissions allocation scheme would trigger the potential for the relocation of emission-intensive industries and resource transfers through international trading of carbon allowances.Footnote 2

The distinction in the convergence behavior between developed and developing is relevant in relation to the environmental Kuznets curve (EKC). The EKC hypothesis postulates that in the early stages of economic development and growth, environmental quality diminishes as income increases. However, at some threshold level of income, the demand for environmental quality increases whereby emissions decrease. Another facet influencing a country’s emissions profile is the adoption of clean energy technologies across industries with differing pollution intensities and the substitution toward more environmentally friendly inputs in the production process (Apergis and Payne 2020). Moreover, the green Solow model set forth by Brock and Taylor (2010) demonstrates that technological progress which enhances production efficiencies and abatement is a fundamental consideration in the relationship between the EKC hypothesis and the convergence of emissions.

In this context, the literature on the issue of carbon dioxide emissions convergence has been extensively explored in the literature, as documented in the survey articles by Pettersson et al. (2014), Acar et al. (2018), and Payne (2020).Footnote 3 In general, the evidence from large multi-country studies on the convergence of per capita carbon dioxide emissions has been generally mixed (see Nguyen Van 2005; Aldy 2006; Ezcurra 2007a; Westerlund and Basher 2008; Nourry 2009; Panopoulou and Pantelidis 2009; Brock and Taylor 2010; Ordas Criado and Grether 2011; Herrerias 2013; Li and Lin 2013; Acaravci and Erdogan 2016; Ahmed et al. 2017; Brannlund et al. 2017; Churchill et al. 2018; Rios and Gianmoena 2018; Haider and Akram 2019; and Fernandez-Amador et al. 2019). However, studies focused on countries grouped by institutional structure, income classification, and geographic region lend greater support for convergence in per capita carbon dioxide emissions (see Strazicich and List 2003; Barassi et al. 2008, 2011, 2018; Lee et al. 2008; Lee and Chang 2008, 2009; Romero-Avila 2008; Jobert et al. 2010; Herrerias 2012; Yavuz and Yilanci 2013; Solarin 2014; Robalino-Lopez et al. 2016; Presno et al. 2018; Erdogan and Acaravci 2019; and Karakaya et al. 2019).Footnote 4,Footnote 5

Given that the majority of the studies to date have focused primarily on more developed, industrialized countries, we explore the convergence of per capita carbon dioxide emissions in the case of developing countries due to the differences in their level of economic development and growth prospects relative to industrialized countries as the EKC hypothesis would suggest. Furthermore, this line of inquiry will provide additional insights on the environmental sustainability of the economic development process for developing countries. As such, we test for the convergence of emissions using two approaches: stochastic convergence and club convergence. Following Carlino and Mills (1993) and Bernard and Durlauf (1995, 1996), the stochastic convergence approach evaluates the stationarity of relative per capita carbon dioxide emissions defined for each country i as the natural logarithm of the ratio of per capita carbon dioxide emissions relative to the average of all countries. If relative per capita carbon dioxide emissions follow a stationary process (i.e., stochastic convergence), shocks will be transitory in nature. Unlike the stochastic convergence approach, which relies on unit root/stationarity tests, the club convergence approach of Phillips and Sul (2007, 2009), which is based on a nonlinear time-varying factor model, does not depend on the stationarity properties of variables in question and considers the possibility of multiple convergence clubs. As noted by Panopoulou and Pantelidis (2009), the Phillips-Sul approach is similar to examining conditional σ-convergence and β-convergence within a panel framework.Footnote 6 More specifically, the Phillips-Sul approach tests for a decline in the cross-sectional variation of per capita carbon dioxide emissions among countries over time (conditional σ-convergence), as well as tests whether or not heterogeneous time-varying idiosyncratic components converge over time to a constant after controlling for a common growth component among countries (conditional β-convergence).

The “Data, methodology, and results” section discusses the data, methodology, and results, while the “Concluding remarks” section provides concluding remarks.

Data, methodology, and results

Data

Annual data from 1972 to 2014 for per capita carbon dioxide emissions (in metric tons) is obtained from the World Bank Development Indicators.Footnote 7 The data is constructed into three panels: (1) low-income countries (27), lower middle-income countries (38), and the combination of both low- and lower middle-income countries (65) as shown in Appendix A. Table 1 displays the summary statistics of per capita carbon dioxide emissions by income classification. For the case of low-income countries in Table 1, we find that mean per capita carbon dioxide emissions ranges from 0.034 in Burundi and Chad to 2.644 in the Syrian Arab Republic, while the variation (standard deviation) ranges from 0.009 in Burundi to 0.643 in the Syrian Arab Republic. The distribution of per capita carbon dioxide emissions shows positive skewness in 21 of the 27 countries with the kurtosis measure less than three for 17 of the 27 countries. The null hypothesis of normality in the distribution of per capita carbon dioxide emissions is rejected in over half the countries.

Table 1 Summary statistics. low-income countries (27)

In Table 2 for lower middle-income countries, we find much more dramatic ranges in both the mean and variation of per capita carbon dioxide emissions. The mean per capita carbon dioxide emissions ranges from 0.120 in Bangladesh to 4.362 in Mongolia, and the variation (standard deviation) ranges from 0.040 in Comoros to 1.974 in Mongolia. The distribution of per capita carbon dioxide emission also reveals positive skewness in 30 of the 38 countries with the kurtosis measure less than three for 29 of the 38 countries. The null hypothesis of normality in the distribution of per capita carbon dioxide emissions is rejected in nearly half the countries.

Table 2 Summary statistics. lower middle-income countries (38)

Stochastic convergence

We begin our analysis with examining stochastic convergence within a panel data framework recognizing that first-generation panel unit root tests may yield biased results if positive residual cross-section dependence is present. As a result, second-generation panel unit root tests have evolved to address the need to first determine the degree to which cross-sectional dependence is an issue. As such, we explore whether or not cross-sectional dependence is present in the data using the Pesaran (2004) cross-sectional dependence (CD) statistic. The CD statistic is an average of all pair-wise correlation coefficients of the ordinary least square residuals from the standard augmented Dickey and Fuller (1979) regression. With the null hypothesis of cross-sectional independence, the CD statistic follows asymptotically a two-tailed normal distribution as follows:

$$ CD=\sqrt{\frac{2T}{N\left(N-1\right)}}\left({\sum}_{i=1}^{N-1}{\sum}_{j=i+1}^N{\hat{\rho}}_{ij}\right) $$
(1)

where T is the time period and N is the number of countries. \( {\hat{\rho}}_{ij} \) is the pair-wise correlation coefficient estimates of the residuals. The results in Table 3 show that up to three lags, the null hypothesis of cross-sectional independence is rejected for each of the three country panels.

Table 3 Tests of cross-sectional dependence

Given the presence of cross-sectional dependence, we proceed with Pesaran’s (2007) augmented ADF-panel unit root test, which incorporates the lagged cross-sectional mean and its first difference in recognition of cross-sectional dependence as follows:

$$ \Delta {y}_{it}={\alpha}_i+{\beta}_i{y}_{it-1}+{\theta}_i{\overline{y}}_{t-1}+{\gamma}_i\Delta {\overline{y}}_i+{\varepsilon}_{it} $$
(2)

where \( {\overline{\mathrm{y}}}_{\mathrm{t}-1} \) denotes the mean of the lagged levels; \( \Delta {\overline{\mathrm{y}}}_{\mathrm{i}} \) is the mean of the first-differences; and εit is the error term. Pesaran (2007) uses a modified Im et al. (2003) statistic given by the average of the individual cross-sectional-ADF statistics (CADF) from Eq. (2) in defining the cross-sectional augmented IPS (CIPS) to test the null hypothesis of a unit root:

$$ \mathrm{CIPS}=\frac{1}{N}{\sum}_{t=1}^N{t}_i\left(N,T\right) $$
(3)

where ti(N, T) represents the t statistic from the ordinary least squares estimate of β in Eq. (2). In addition, we also correct for potential small sample bias via the CIPS* statistic as follows:

$$ {\mathrm{CIPS}}^{\ast }=\frac{1}{N}{\sum}_{t=1}^N{t}_i^{\ast}\left(N,T\right) $$
(4)

where:

$$ {t}_i^{\ast}\left(N,T\right)=\left\{\begin{array}{cc}{K}_1& {t}_i\left(N,T\right)\le {K}_1\\ {}{t}_i\left(N,T\right)& {K}_1<{t}_i\left(N,T\right)<{K}_2\\ {}{K}_2& {t}_i\ge {K}_2\end{array}\right. $$

The constants K1 and K2 are fixed, where the probability that ti(N,T) resides in [K1, K2] and close to one. Panel A of Table 4 displays the results of the Pesaran (2007) panel unit root tests with respect to relative per capita carbon dioxide emissions for the three country panels. The null hypothesis of a unit root is rejected at the 1% significance level across all three country panels based on the CIPS and CIPS* statistics, thus supporting stochastic convergence with respect to relative per capita carbon dioxide emissions.

Table 4 Panel unit root tests of stochastic convergence

To address the possibility of spurious results due to the absence of structural breaks, we also report tests for panel unit roots under multiple structural breaks using the Bai and Carrion-i-Silvestre (2009) approach. The Bai and Carrion-i-Silvestre (2009) panel unit root test takes into consideration both multiple structural breaks and cross-section dependence through the common factors model proposed by Bai and Ng (2004). Their method allows for structural breaks in the level, slope, and both the level and slope, thus providing a certain degree of heterogeneity in the number of breaks across countries. This approach relies on the following two models:

$$ {D}_{it}={\mu}_i+{\sum}_{j=1}^{l_i}{\theta}_{ij}{DU}_{ij t} $$
(5)

and

$$ {D}_{it}={\mu}_i+{\beta}_{it}+{\sum}_{j-1}^{l_i}{\theta}_{ij}{DU}_{ij t}+{\sum}_{k=1}^{m_i}{\gamma}_{ik}{DT}_{ik t} $$
(6)

where the component Dit represents the deterministic component. The structural breaks associated with the mean and the trend of a series, respectively, are denoted by li and mi, in which the number of breaks, li and mi, may differ. The dummy variables are defined as DUijt = 1 for \( t>{T}_{aj}^i \) and 0 otherwise, and \( {DT}_{ikt}=\left(t-{T}_{bk}^i\right) \) for \( t>{T}_{bk}^i \), and 0 otherwise. \( {T}_{aj}^i \) and \( {T}_{bk}^i \) represent the jth and kth dates of the structural breaks in the level and trend, respectively, for the ith individual with j = 1, …, li and k = 1, …, mi. The test is based on simplified test statistics, which are invariant to both mean and trend breaks:

$$ {Z}^{\ast }=\sqrt{N\left\{\frac{\left[{MSB}^{\ast}\left(\lambda \right)-{\xi}^{\ast}\right]}{\zeta^{\ast 2}}\right\}}\to N\left(0,1\right) $$
(7)

where \( {MSB}^{\ast}\left(\lambda \right)=\frac{1}{N}{\sum}_{i=1}^N{MSB}_i^{\ast}\left({\lambda}_i\right) \), \( {\xi}^{\ast }=\frac{1}{N}{\sum}_{i=1}^N{\xi}_i^{\ast } \), and \( {\zeta}^{\ast 2}={\sum}_{i=1}^N{\zeta}_i^{\ast 2} \). MSB(λ) is the pool modified Sargan and Bhargava (1983) test for individual time series. \( {\xi}_i^{\ast } \) and \( {\zeta}_i^{\ast 2} \) denote the mean and the variance of the individual modified \( {MSB}_i^{\ast}\left({\lambda}_i\right) \) statistics, respectively, where \( {\lambda}_i={T}_i^b/T \) is the break fraction parameter. The results of the Bai and Carrion-i-Silvestre (2009) test rejects the null hypothesis of a unit root in relative per capita carbon dioxide emissions, confirming stochastic convergence.

Club convergence

Finally, we follow Panopoulou and Pantelidis (2009), Apergis et al. (2017), and Apergis and Payne (2020), among others, in the use of the time-varying nonlinear factor model approach by Phillips and Sul (2007, 2009). The Phillips-Sul approach tests whether there is convergence with respect to the heterogeneous time-varying idiosyncratic components after controlling for a common growth component among the countries that share the same convergence pattern. This approach has the comparative advantage that it does not rely on any assumptions regarding the stationarity of the variables as in tests of stochastic convergence. Specifically, the Phillips-Sul approach utilizes a time-varying common factor defined as:

$$ PCCO{2}_{it}={\delta}_{it}{\mu}_t $$
(8)

where PCCO2it represents per capita carbon dioxide emissions in country i at time t, which is comprised of a common component, μt, and an idiosyncratic component, δit, both of which are time-varying. Note that the idiosyncratic component is a measure of the distance between PCCO2it and the common component, μt. Phillips and Sul (2007, 2009) use the relative transition parameter, hit, as follows:

$$ {h}_{it}=\frac{PCCO{2}_{it}}{\frac{1}{N}{\sum}_{i=1}^N PCCO{2}_{it}}=\frac{\delta_{it}}{\frac{1}{N}{\sum}_{i=1}^N{\delta}_{it}} $$
(9)

Equation (9) measures the loading coefficient, δit, relative to the panel average, thus the transition path for per capita carbon dioxide emissions in country i relative to the panel average. In the case that the factor loadings, δit, converge to a constant, δ, then the cross-sectional mean of the relative transition path for country i, hit, converges to unity, and the cross-section variation, Ht, of the relative transition path converges to zero as t → ∞:

$$ {H}_t=\frac{1}{N}{\sum}_{i=1}^N{\left({h}_{it}-1\right)}^2\to 0 $$
(10)

The semi-parametric form of δit is given as:

$$ {\delta}_{it}={\delta}_i+\frac{\sigma_i{\xi}_{it}}{L(t){t}^{\alpha }} $$
(11)

where δi is fixed; ξit~iid (0,1) varies across countries i = 1, 2, …, N; σi is an idiosyncratic scale parameter; L(t) is a slow varying function where L(t)→∞ and t→∞; and α represents the speed of convergence. Equation (11) ensures that δit converges to δi for α ≥ 0. Hence, the null hypothesis of convergence is the following: H0 : δi = δ and α ≥ 0, against the alternative hypothesis, HA : δiδ for some i and/or α < 0.

Following Phillips and Sul (2007, 2009), we set L(t) = logt in the decay model, so the empirical log t regression can be used to test for convergence and the deployment of the clustering algorithm to identify convergence clubs as follows:

$$ \log \left(\frac{H_1}{H_t}\right)-2\log L(t)=\hat{a}+\hat{b}\log t+{\varepsilon}_t $$
(12)

for t = rT, rT + 1, …, T where r > 0 set on the interval (0.2, 0.3).Footnote 8 For \( \hat{\mathrm{b}}=2\upalpha \), the null hypothesis is considered a one-sided test of \( \hat{\mathrm{b}}\ge 0 \) against \( \hat{\mathrm{b}}<0 \). To address estimates in Eq. (12) that may be weakly time-dependent, heteroskedasticity and autocorrelation-consistent standard errors are employed in the least squares estimates of \( \hat{b} \).

The Phillips and Sul (2007, 2009) procedure uses a club convergence approach to identify convergence clubs as follows: (1) order the N countries in the panel using the final values of per capita carbon dioxide emissions for the respective countries; (2) starting from the highest-order country in terms of per capita carbon dioxide emissions, sequentially estimate Eq. (12) on the k highest member countries to identify a core group of countries using the cut-off point criterion: \( {k}^{\ast }= Arg{\mathit{\operatorname{Max}}}_k\left\{{t}_{{\hat{b}}_k}\right\}, \) subject to \( {\mathit{\operatorname{Min}}}_k\left\{{t}_{{\hat{b}}_k}\right\}>1.65 \), for k = 2, 3, …N; (3) add one country at a time from the remaining countries to the core group, and re-estimate Eq. (12) using the sign criterion (\( \hat{\mathrm{b}}\ge 0 \)) to determine whether to add a country to the core group; and (4) repeat the above steps iteratively for the remaining countries until clubs can no longer be formed. Given this iterative approach, each club formed is associated with its own convergence path. Countries that do not exhibit a convergence pattern are considered non-convergent.

We begin with examining tests of club convergence in the case of the panel of 27 low-income countries as shown in panel A of Table 5. The null hypothesis of overall panel convergence is rejected at the 1% significance level given the t statistic of − 30.606. Given the absence of overall panel convergence, we proceed with the algorithm of Phillips and Sul (2007, 2009) to determine whether convergence clubs are formed. As documented in panel A of Table 5, three convergence clubs emerge with only Haiti exhibiting non-convergent behavior. Club 1 consists of four countries: Afghanistan, Nepal, Syrian Arab Republic, and Yemen; Club 2 encompasses 18 African countries: Benin, Burkina Faso, Burundi, Central African Republic, Chad, Democratic Republic of Congo, Ethiopia, Madagascar, Malawi, Mali, Mozambique, Niger, Rwanda, Sierra Leone, Somalia, Tanzania, Togo, and Uganda; and Club 3 contains four West African countries: Gambia, Guinea, Guinea-Bissau, and Liberia. An examination of the speed of convergence, α, shows that club 2 (0.4790) has the fastest speed of convergence, followed by club 1 (0.3310) and club 3 (0.2480).Footnote 9 However, as noted by Phillips and Sul (2009), the convergence algorithm may lead to over-estimation of the true number of clubs. To address this potential issue, we evaluate merging adjacent numbered clubs into larger clubs by performing club merging tests via regression (12). The club merging tests in panel B of Table 5 reject the null hypothesis of merging clubs. Interestingly enough, the convergence clubs reveal the geographic proximity of the respective club members, similar to previous convergence studies tied to geographic regions.

Table 5 Tests of club convergence: low-income countries

Next, we undertake the same tests of club convergence, but in this case for the panel of 38 lower middle-income countries. In panel A of Table 6, the null hypothesis of overall panel convergence is again rejected at the 1% significance level with a t statistic of − 30.837. Following the algorithm of Phillips and Sul (2007, 2009), we determine the number of convergence clubs. From panel A of Table 6, we identify five convergence clubs with seven countries (Cabo Verde, Comoros, Mongolia, Papua New Guinea, Sao Tome and Principe, Solomon Islands, and Vanuatu) considered non-convergent. Club 1 consists of 15 African countries (Angola, Cameroon, Republic of Congo, Cote d’Ivoire, Djibouti, Eswatini, Ghana, Kenya, Kiribati, Mauritania, Nigeria, Senegal, Zambia, and Zimbabwe). Club 2 includes only three North African countries (Egypt, Morocco, and Tunisia); Club 3 comprises five Asian countries (Bangladesh, Bhutan, Cambodia, Laos, Myanmar, and Vietnam); Club 4 consists of four Central and Latin American countries (Bolivia, El Salvador, Honduras, and Nicaragua); and Club 5 contains four Asian countries (India, Indonesia, Pakistan, and the Philippines). A review of the speed of convergence associated with each convergence club reveals that club 5 (0.6685) exhibits the fastest speed of convergence, followed by club 4 (0.5140), club 2 (0.4195), club 1 (0.3490), and club 3 (0.2555). As in the case of the convergence clubs for low-income countries reported in Table 5, the club merging tests, shown in panel B of Table 6, do not support the merger of the respective convergence clubs. Likewise, convergence clubs among lower middle-income countries again reflect a high degree of geographic proximity.

Table 6 Tests of club convergence: lower middle-income countries

Finally, we combine low-income and lower middle-income countries to form a developing country panel of 65 countries. Panel A of Table 7 shows that the null hypothesis of overall panel convergence is once again rejected at the 1% significance level with a t statistic of − 31.219. We find six convergence clubs with 11 countries (Cabo Verde, Comoros, Haiti, Malawi, Mongolia, Papua New Guinea, Sao Tome and Principe, Solomon Islands, Syrian Arab Republic, Vanuatu, and Yemen) non-convergent. Club 1 includes 32 African countries (Angola, Benin, Burkina Faso, Burundi, Cameroon, Central African Republic, Chad, Republic of Congo, Cote d’Ivoire, Democratic Republic of Congo, Djibouti, Eswatini, Gambia, Ghana, Kenya, Kiribati, Liberia, Madagascar, Mali, Mauritania, Mozambique, Niger, Nigeria, Rwanda, Senegal, Sierra Leone, Somalia, Tanzania, Togo, Uganda, Zambia, and Zimbabwe); Club 2 comprises three North African countries (Egypt, Morocco, and Tunisia); Club 3 consists of six Asian countries (Bangladesh, Bhutan, Cambodia, Laos, Myanmar, and Vietnam); Club 4 encompasses four Central and Latin America countries (Bolivia, El Salvador, Honduras, and Nicaragua); Club 5 includes six Asian countries (Afghanistan, India, Indonesia, Nepal, Pakistan, and the Philippines), and three African countries in club 6 (Ethiopia, Guinea, and Guinea-Bissau). As is the case with Tables 5 and 6, the speed of convergence varies greatly across the convergence clubs with the fastest convergence in club 5 (0.5145), followed by club 4 (0.3915), club 2 (0.3310), club 1 (0.2705), club 3 (0.2195), and club 6 (0.2040). Similar to panel B of Tables 5 and 6, the club merging tests reported in panel B of Table 7 reject the null hypothesis of merging clubs. While panel unit root tests find stochastic convergence in relative per capita carbon dioxide emissions for each of the three country panels, the club convergence tests reveal multiple convergence clubs in each country panel that show unique transition paths for countries within each convergence club to a steady state.

Table 7 Tests of club convergence: low- and lower middle-income countries

Concluding remarks

With the ongoing debate on the appropriate mitigation and emission allocation strategies pertaining to per capita carbon dioxide emissions, this exploratory study provided additional evidence with respect to the convergence of per capita carbon dioxide emissions in the case of developing countries. Specifically, Pesaran (2007) and Bai and Carrion-i-Silvestre (2009) panel unit root tests with allowance for cross-sectional dependence lend support for stochastic convergence in per capita carbon dioxide emissions for the respective country panels. The nonlinear time-varying factor model of Phillips and Sul (2007, 2009) revealed multiple convergence clubs within the country panels with the speed of convergence varying across convergence clubs. Within the low-income country panel, the analysis identified three convergence clubs, five convergence clubs for the lower middle-income country panel, and six convergence clubs for the country panel that combined both low- and lower middle-income countries. A common theme for many of the convergence clubs was the geographical proximity of countries within the club. With respect to the non-convergent countries, a common characteristic was that many were island countries and to some extent geographically isolated.

As noted by Rios and Gianmoena (2018), rather than the two-track emission allocation framework in which developing countries did not have mitigation requirements, as in the case of industrialized countries under the Framework Convention on Climate Change and the Kyoto Protocol, the Paris 2015 agreement provided for carbon dioxide emissions mitigation to be tied to country-specific circumstances. This is particularly relevant as our results from the Phillips-Sul club convergence procedure illustrate that countries in geographic proximity, as defined within the convergence clubs, exhibit unique transition paths toward their respective steady states. The geographic proximity between countries within their respective convergence clubs may reflect similar natural resource endowments, weather conditions, and economic structure, all of which influence their energy consumption mix. Moreover, the geographical proximity may also indicate the potential for strategic interactions between governments with respect to environmental policy actions whose economies are spatially linked relative to other countries (Fredriksson et al. 2004). In addition, the quality of a country’s institutions and governance structure plays a critical role in the effective implementation of the appropriate economic instruments (price-based and rights-based measures) to mitigate emissions as their level of economic development evolves over time. The ability of developing countries to adopt emerging mitigation and low carbon technologies, alongside movement toward renewable energy sources and improvement in energy efficiency, should be given serious consideration in order to control carbon dioxide emissions in these countries.