Keywords

1 Introduction

The Balassa hypothesis tests the impact of productivity growth on the real exchange rate. It states that for a growing economy, the real exchange rate is expected to appreciate in the long run. Our study is based on a finding by Baylie (2008). The real effective exchange rate is an important policy parameter and among the most determining factors of growth in Ethiopia (Baylie 2008). Though Baylie recommends depreciation of the domestic currency for promoting economic growth in the short run, the author discovered that it is healthier to allow appreciation in the long run to encourage sustainable economic growth. Hence, he provided (an exchange rate) policy recommendations which promote appreciation of the domestic currency for sustainable growth in the long run.

Both depreciation and appreciation are not welcomed effortlessly by the monetary authority. As suggested by Baylie (2008), depreciation in particular is not favored by the monetary authority as it increases the burden on the importing capacity for a developing country like Ethiopia. In contrast, by the time it is recommended that a country allows appreciation, all advantages of depreciation have been exhausted while prospects of appreciation are pending. Depreciation may initially help promote exports and generate sufficient foreign earnings. Once this objective is met, there arises a need to promote imports of capital goods by allowing appreciation to establish import-substituting industries to transform the economy. The only issue to consider in this case is the ‘timing’ of switching policy. The solution to this dilemma is provided by the Balassa hypothesis.

At the time when the Balassa hypothesis holds in a particular economy, depreciation is not gainful. In short, it states that if economic growth is accompanied in appreciation of the domestic currency (Balassa hypothesis), the monetary authority should not constrain the appreciation for the simple reason that it may discourage exports. If economic growth by itself brings appreciation, it can be sustained as the latter further puts inertia on the former. There is a possibility of one driving the other in the long run when the hypothesis holds.

In short, the hypothesis states that the impact of growth on the exchange rate is positive; that is, there is appreciation of the domestic currency. The main purpose of our study, therefore, is to show whether this analysis can be extended to a group of low- and middle-income countries on various continents. While there is evidence in favor of the hypothesis, there are also some anti-Balassa results in some studies. The negative results could be associated with different reasons specific to each study.

Tica and Druzic’s (2006) survey shows that since its discovery in 1964, the hypothesis has been tested 58 times in 98 countries in time series or panel analyses and in 142 countries in cross-country analyses. In these analyzed estimates, country-specific Balassa hypothesis coefficients have been estimated 164 times. The first empirical test of the theory was carried out by Balassa (1964) himself . Kravis and (1983) and Bhagwati (1984) were also among the forerunners. The conclusions from all these studies confirm the difficulty in ignoring the significance of the hypothesis in general. The strongest empirical support in favor of the relationship between productivity and exchange rate is found in cross-sectional and panel empirical studies.

Chuoudhri and Kahn (2004) found evidence of Balassa –Samuelson effects in a panel of 16 developing countries. They found the traded and non-traded productivity differential to be a significant determinant of the relative prices of non-traded goods, and the relative price in turn exerted a significant effect on the real exchange rate. Similarly , Guo and Hall (2010) and Jabeen et al. (2011) also show that productivity differences directly explained changes in the real exchange rate by using the Johansen co-integration approach for China and Pakistan, respectively.

A positive relationship between productivity and the real exchange rate is not, however, a common fact in all studies. There are a number of studies that show anti-Balassa results. Drine and Rault (2002, 2004), for example, tested the Balassa hypothesis for 20 Latin American (middle-income) and six Asian (low-income) developing countries separately. They applied Pedroni’s co-integration techniques in both the studies. Though they were able to find evidence for the hypothesis in the first study for middle-income countries, they failed to replicate the result in the second study for low-income countries. The reason given for the failure is a break in the relationship between productivity and relative price, one of the assumptions of the hypothesis. Asea and Mendoza (1994a, b), Harberger (2003 ), Hassan (2011), Isard and Symansky (1996), Miyajima (2005), and Wilson (2010) also found anti-Balassa results in their studies on developing countries.

The study that comes the closest to our study is Chuah’s (2012). This study found mixed results from a panel study of 142 developing (middle- and low-income) and developed (high-income) countries. The estimation of the fixed effect model showed that productivity growth in developed economies resulted in real appreciation of domestic currencies, while the relationship was nonlinear in developing economies. In the latter group, the real exchange rate initially depreciated and then appreciated after per capita income jumped to a higher level (above $2200), the main reason being a level of development.

Our study makes three improvements over Chuah’s (2012) study in terms of data quality, methodology, and variables. First, our study uses data from the latest version of the Penn World Table (PWT), version 8. Data from this version address shortcomings associated with previous versions. In particularly, Chuah (2012) used an expenditure-based measure of GDP from version 7, while our study uses the output-based measure of GDP from version 8. Feenstra et al. (2013) suggested using the second measure for studies interested in an economy’s productive capacity. Second, our study accounts for cross-sectional dependence and heterogeneity by applying the common correlated effect approach of Pesaran (2013) and pooled mean group estimation, respectively. Third, our study controls for important supply- and demand-side factors.

The remaining paper is organized as follows. Section 12.2 provides the theoretical background of the hypothesis. Section 12.3 discusses the methodology. The findings are presented in Sects. 12.4 and 12.5 gives the conclusion and policy implications derived from the findings.

2 Theoretical Framework of the Model: The Balassa–Samuelson Hypothesis

Though the idea has been mentioned by several authors (like Ricardo 1911; Harrod 1933; Viner 1937), the contribution of other authors is not as bold as Paul A. Samuelson and Bela Balassa and hence the name Balassa-Samuelson hypothesis (Tica and Druzic 2006). The term ‘Balassa hypothesis’ is used in this study.

The Balassa hypothesis demonstrates the relationship between exchange rate, purchasing power parity (PPP), and inter-country income comparisons in general. The hypothesis emanates from the PPP theory. It explains the reason why the PPP theory of exchange rate is imperfect. In the absence of all frictions, the prices of a common basket of goods in two countries measured in the same currency should be the same at all times for absolute PPP to hold, that is, \( P/{\upvarepsilon }P^{*} = 1. \) The Balassa –Samuelson effect, first formulated by Harrod in 1934 and later by Balassa and Samuelson in 1964 separately, says that distortions in purchasing power parity are the result of international differences in relative productivity growth between the tradable goods sector (mainly manufacturing and agriculture) and the non-tradable goods sector (mainly services) (Herberger 2003; Tica and Druzic 2006). In contrast to the PPP theory, price levels are higher in rich countries than poor ones when converted to a common currency. This may be associated with higher productivity growth in the tradable sector in rich countries (Rogoff 1996).

A nation’s prosperity is mainly associated with productivity growth in the tradable goods sector. This has an effect of reducing costs in the same sector and increasing real wages in the economy and puts an upward pressure on relative prices of non-tradable goods where productivity has not grown by the same magnitude. This distorts the PPP relationship and results in appreciation of the real exchange rate. The same effect holds true across nations. A more prosperous nation experiences higher productivity growth in the tradable goods sector than a poor nation. Thus, an increase in the prices of non-tradable goods will be higher in a rich country. As a result, a rich country’s real exchange will appreciate compared to a poor’s nation currency (Asea and Corden 1994a, b).

The Balassa hypothesis may be tested in two forms: external and internal versions. The external version analyzes the impact of productivity growth on the real exchange rate. The internal version analyzes the impact of productivity on relative prices. If one fails to prove a relationship between productivity and the real exchange rate, it is most likely that the hypothesis is functioning through the internal version; that is, the relationship between relative prices and the real exchange rate or relative prices and productivity growth should be tested. The main objective of our study is to examine the validity of the external version of the hypothesis.

The core idea of the Balassa hypothesis is related to the concept of convergence (beta-β-convergence) in growth theories. Both describe features of developing economies. Convergence between economies may be roughly defined as the tendency for levels of per capita income or productivity to equalize over time. Growth theoriesFootnote 1 state that countries with low capital-to-labor ratios (high marginal productivity of capital) in general and with advantages of elements such as innovation ability, human capital formation, technical progress, and economies to scale in particular grow faster than others (Kumo 2011; Orlik 2003; Soukiazis 1995).

According to these growth theories, there is a tendency for developing countries to grow faster than developed countries if some conditions in particular are satisfied. Given that the Balassa hypothesis is related to the impact of economic (productivity) growth on the real exchange rate, there should be a greater probability of finding evidence for the hypothesis in converging economies as compared to developed ones. The convergence process, thus, may be used as a criterion for identifying candidate countries for a sample study.

3 Methodology

3.1 Data Type and Collection Methods

Data for all countries and variables are from Penn World Table for the period 1950–2011. The variables include exchange rate, per capita GDP, and government expenditure. While the choice of the study period for each country depends on data availability, countries are selected on the basis of the convergence criterion which suggests that the fastest growing economies are mainly the developing economies.

According to IMF’s World Economic Outlook Report (2015), all 15 countries in our sample are developing countries. However, for comparison purposes, the sample is divided into two categories on the basis of the size of economies (relative GDP). The first group represents the top five largest economies in the sample—BRICS (Brazil, Russia, India, China, and South Africa). They are from the (upper) middle-income countries’ category (except India) which together nearly represent 90% of the US economy. The second group consists of 10 low-income countries (Angola, Ethiopia, Ghana, Indonesia, Kenya, Nigeria, the Philippines, Rwanda, Tanzania, and Uganda). Lower middle-income countries (with per capita income lower than $4125) are included in the second group in our sample.

3.2 Model Specification

The original Balassa model was designed for a fully employed small open economy; a 2 × 2 × 2 system (two countries, two commodities, two factors); an inter-sector mobile labor (scarce factor) and inter-nation mobile capital; law of one price for factors within a nation and for tradables across nations; a constant return to scale production frontier; perfect competition in both markets (goods and factors); neural technical progress; and constant terms of trade (Podkaminer 2003).

A derivation of the Balassa–Samuelson model may be considered as a three-stage process. The first is to derive the relationship between the productivity differential and relative price. The second is to derive the relationship between relative price and exchange rate. The third is to derive the relationship between productivity differential and exchange rate.

  • STEP 1: The original Balassa–Samuelson model is framed on the basis of the traditional Ricardian trade model (Asea and Corden 1994a, b). It is a supply-side model defined by constant return to scale Cobb-Douglas style production functions in two sectors as (Podkaminer 2003):

$$ Y_{T} = A_{T} L_{T}^{\alpha } K_{T}^{1 - \alpha } $$
(12.1)
$$ Y_{N} = A_{N} L_{N}^{\beta } K_{N}^{1 - \beta } $$
(12.2)

where T and N refer to traded and non-traded sectors, and \( {\alpha } \) and \( {\beta } \) represent the share of labor in each sector, respectively, with \( {\beta } \ge {\alpha }. \)

In a perfectly competitive market, factor prices must equal their respective value of marginal products at equilibrium for both sectors:

$$ P_{T} A_{T} \alpha \left( {\frac{{K_{T} }}{{L_{T} }}} \right)^{1 - \alpha } = w $$
(12.3)
$$ P_{T} A_{T} (1 - \alpha )\left( {\frac{{K_{T} }}{{L_{T} }}} \right)^{ - \alpha } = r $$
(12.4)
$$ P_{N} A_{N} \beta \left( {\frac{{K_{N} }}{{L_{N} }}} \right)^{1 - \beta } = w $$
(12.5)
$$ P_{N} A_{N} (1 - \beta )\left( {\frac{{K_{N} }}{{L_{N} }}} \right)^{ - \beta } = r $$
(12.6)

Combing the two factor markets for each sector independently and taking the logarithm of both sides for each equation yields:

$$ \log\left( {P_{T} } \right) = \left( {1 - \alpha } \right)\log \left( {\text{r}} \right) - \left( {1 - \alpha } \right)\log \left( {1 - \alpha } \right) + \alpha \log(\text{w}) - \log\left( {A_{T} } \right) $$
(12.7)
$$ \log\left( {P_{N} } \right) = \left( {1 - \beta } \right)\log \left( {\text{r}} \right) - \left( {1 - \beta } \right)\log \left( {1 - \beta } \right) + \beta \,\log(\text{w}) - \log\left( {A_{N} } \right) $$
(12.8)

Recalling the assumption that price of tradables (numeraire) and interest rate (not technology) are the same across boundaries, differentiation of the above with respect to time yields:

$$ \frac{{\left( {\frac{{{\text{d}}P_{T} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{P_{T} \left(\uptau \right)}} = 0 = \frac{{\alpha \left( {\frac{{{\text{dw}}\left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{{\text{w}}\left(\uptau \right)}} - \frac{{\left( {\frac{{{\text{d}}A_{T} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{A_{T} \left(\uptau \right)}} $$
(12.9)
$$ \frac{{\left( {\frac{{{\text{d}}P_{N} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{P_{N} \left(\uptau \right)}} = \frac{{\beta \left( {\frac{{{\text{dw}}\left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{{\text{w}}\left(\uptau \right)}} - \frac{{\left( {\frac{{{\text{d}}A_{N} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{A_{N} \left(\uptau \right)}} $$
(12.10)

Substituting Eq. (12.9) into Eq. (12.10) helps define the relative price of non-tradables in terms of productivity differentials for home and foreign country (\( \hat{A} \) represents growth rate):

$$ \frac{{\left( {\frac{{{\text{d}}P_{N} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{P_{N} \left(\uptau \right)}} = \frac{\beta }{\alpha }\frac{{\left( {\frac{{{\text{d}}A_{T} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{A_{T} \left(\uptau \right)}} - \frac{{\left( {\frac{{{\text{d}}A_{N} \left(\uptau \right)}}{{{\text{d}}\uptau}}} \right)}}{{A_{N} \left(\uptau \right)}} $$
(12.11)
$$ \hat{p}_{N} = \left( {\frac{\beta }{\alpha }} \right)\hat{A}_{T} - \hat{A}_{N} $$
(12.12)
$$ \hat{p}_{N}^{*} = \left( {\frac{\beta }{\alpha }} \right)^{*} \hat{A}_{T}^{*} - \hat{A}_{N}^{*} $$
(12.13)

The difference between Eqs. (12.12) and (12.13) defines price differentials across countries:

$$ \hat{p}_{N} - \hat{p}_{N}^{*} = \left[ {\left( {\frac{\beta }{\alpha }} \right)\hat{A}_{T} - \hat{A}_{N} } \right] -\left[ \left( {\frac{\beta }{\alpha }} \right)^{*} \hat{A}_{T}^{*} - \hat{A}_{N}^{*} \right]$$
(12.14)

This means that the price differential between sectors and across countries can be explained by productivity differentials between sectors and across nations.

  • STEP 2: We follow Ahn (2009 ) to link the exchange rate and productivity differential through the price index. The real exchange rate is defined in a log-linear form as (increase shows appreciation):

$$ \begin{aligned} Q & = P/{\upvarepsilon }P^{*} \\ q & = p - e - p^{*} \\ \end{aligned} $$
(12.15)

Price indices are defined as weighted averages of prices in tradable and non-tradable sectors in both domestic and foreign markets:

$$ P = P_{N}^{\delta } P_{T}^{1 - \delta } \quad {\text{and}} \quad P^{*} = P_{N}^{*\theta } P_{T}^{{*\left( {1 - \theta } \right)}} $$

In log-linear form:

$$ p = \delta p_{N} + \left( {1 - \delta } \right)p_{T} $$
(12.16)
$$ p^{*} = \theta p_{N}^{*} + \left( {1 - \theta } \right)p_{T}^{*} $$
(12.17)

\( {\delta } \) and \( {\theta } \) represent the share of non-tradables in the consumer basket at home and abroad, respectively.

Substituting Eqs. (12.16) and (12.17) into Eq. (12.15) helps define the real exchange rate as a function of price differential:

$$ q = \left[ {{\delta }\left( {p_{N} - p_{T} } \right) - {\theta }\left( {p_{N}^{*} - p_{T}^{*} } \right)} \right] + p_{T} - e - p_{T}^{*} $$
(12.18)

Since \( p_{T} = e + p_{T}^{*} \) (law of one price for tradables), Eq. (12.18) will be:

$$ q = \left[ {{\delta }\left( {p_{N} - p_{T} } \right) - {\theta }\left( {p_{N}^{*} - p_{T}^{*} } \right)} \right] $$
(12.19)
  • STEP 3: Eq. (12.19) defines the real exchange rate as a function of the relative price differential between countries. Substituting Eq. (12.14) into Eq. (12.19) helps define the exchange rate as a function of the productivity differential. We assume that the share of non-tradables in the foreign consumer basket \( ({\theta }) \) is the same as home \( ({\delta }). \) Hence:

$$ \hat{q} = {\delta }\left( {\left[ {\left( {\frac{\beta }{\alpha }} \right)\hat{A}_{T} - \hat{A}_{N} } \right] - \left( {\frac{\beta }{\alpha }} \right)^{*} \hat{A}_{T}^{*} - \hat{A}_{N}^{*} } \right) $$
(12.20)

If the home market grows faster than the foreign one, then the domestic currency appreciates and vice versa.

In order to avoid the assumption of neutral technical progress, we introduced an intercept in the econometric model (Kohler 1998). We also introduced demand-side factors as the Balassa model is not complete by itself (De Gregorio and Wolf 1994).Therefore, the econometric model used in our study is derived from Eq. (12.20) (see Annexure 1 for derivation). It includes two more factors (demand and supply sides):

$$ \left( {\text{lnQ}} \right)_{it} = \alpha_{i} + \beta_{1i} { \ln }\left( {{\text{Y}}/{\text{Y}}^{ *} } \right)_{it} + \beta_{2i} { \ln }\left( {{\text{G}}/{\text{G}}^{ *} } \right)_{it} + \beta_{3i} \text{vol}\left( {\text{E}} \right)_{it} + e_{it} $$
(12.21)

where Q and E are real and nominal exchange rates .

\( \left( {\text{lnQ}} \right)_{it} \) is log of the real exchange rate of each country measured against the US dollar. Increase implies appreciation. ‘it’ refers to ith country in period t. \( { \ln }\left( {{\text{Y}}/{\text{Y}}^{ *} } \right)_{it} \) is log of real GDP per capita relative to the US economy. It is a proxy for the productivity growth differential in each country. The Balassa hypothesis declares that productivity growth has a positive impact on the real exchange rate.

\( { \ln }\left( {{\text{G}}/{\text{G}}^{ *} } \right)_{it} \) is log of relative real government expenditure. It is a proxy for fiscal policy . Kohler (1998) argues that government expenditure accounts for demand shifts toward non-tradables which results in appreciation of the real exchange rate in the short run. In the long run, it does not have an impact unless financed by distortionary taxes. Distortionary taxes reduce real wages and relative prices of non-tradables , and this leads to the depreciation of the real exchange rate in the long run.

\( vol\left( {\text{E}} \right)_{it} \) is exchange rate volatility measured as the absolute value of percentage change in the nominal exchange rate. It is a supply-side factor. The impact of volatility on the real exchange rate may be positive or negative; it depends on the time horizon and type of regime. Kohler (1998) shows that the impact of volatility is smaller in the short run and in poor countries due to greater nominal rigidities. In relatively fixed exchange rate regimes (mainly poor economies), movements in nominal exchange rate are restricted. In this case, growing economies experience inflation in both sectors with relative prices of non-tradables falling. This leads to a depreciation of the real exchange rate. In contrast, there is smaller rigidity in freely floating exchange rate regimes (mainly rich economies). With productivity growth, inflation in the non-tradable sector is balanced by deflation in the tradable sector (as a result of a nominal appreciation). Relative prices of non-tradables increase, and this leads to the real exchange rate appreciation.

3.3 Cross-sectional Dependence Test

Cross-sectional dependence is a problem associated with panel data that mixes information from different cross sections and leads to a difficulty in interpreting the individual effects of each section. It may be caused by socioeconomic network effects, spatial effects, or the influence of a dominant unit or common unobserved factors. When the problem is ignored, estimates are badly biased and the tests may be misleading (Shin 2014). Factor models are used to filter out cross-sectional dependence due to unobserved common factors. We used the Pesaran cross-sectional independence test in our study as it is the most powerful test (Eberhardt 2011). It is given by CD (cross-sectional dependence) which is \( N(0, 1). \)

$$ CD = \sqrt {\left( {\frac{2}{{N\left( {N - 1} \right)}}} \right)} \left( {\mathop \sum \limits_{i = 1}^{N - 1} \mathop \sum \limits_{j = i + 1}^{N} \sqrt {T_{ij} } {\hat{\rho }}_{ij} } \right) $$
(12.22)

3.4 Panel Unit Root Tests

Six types of panel unit root tests are available: Levin-Lin-Chu (LLC), Hariss-Tzavalis (HT), Breitung, Im-Pesaran-Shin (IPS), Fisher type, and Hadri LM. The panel data for our study are unbalanced, and N is fixed and smaller relative to T. It also assumes that the auto-regressive parameter, \( \rho , \) is panel specific. Hence, the candidate panel unit root tests that fit these criteria are the IPS and Fisher-type tests. Another advantage of these tests is that they can be used to test a series which is not serially independent across cross sections.

  1. (a)

    The Im–Pesaran–Shin test

The following is a panel unit root test as proposed by Pesaran (2007) which accounts for cross-sectional dependence . The standard Augmented Dickey–Fuller (ADF) regressions are further augmented with cross-sectional averages of lagged levels and first differences of individual series. Let \( y_{i,t} \) be the observation on the ith cross-sectional unit at time t, and suppose that it is generated according to the simple dynamic linear heterogeneous panel data model:

$$ y_{i,t} = \left( {1 - \phi_{i} } \right)\mu_{i} + \phi_{i} y_{i,t - 1} + e_{it} $$
(12.23)
$$ \text{where}\quad e_{it} = \gamma_{i} f_{t} + \upvarepsilon_{it} ;i = 1 \ldots N;\quad t = 1 \ldots T. $$

The initial value, \( y_{i,0} \), has a given density function with a finite mean and variance, and the error term, \( e_{it} \), has a single-factor structure. \( f_{t} \) is the unobserved common effect, and \( \upvarepsilon_{it} \) is an individual-specific (idiosyncratic) error. The unit root hypothesis of interest is expressed as:

$$ H_{0}{:}\,\phi_{i} = 1\,\text{for}\, \text{all}\,i\,{\text{against}}\,{\text{the}}\,{\text{possibly}}\,{\text{heterogeneous}}\,{\text{alternatives}} $$
$$ H_{1}{:}\,\phi_{i} < 1, i = 1, 2, \ldots .N_{1} ,\phi_{i} = 1, i = N_{1} + 1,N_{2} + 2, \ldots ., N $$

\( N_{1} /N, \) a fraction of the individual processes that are stationary, is nonzero and tends to the fixed value δ such that 0 < δ < 1 as \( N \to \infty \). This condition is necessary for the consistency of unit root tests.

  1. (b)

    Fisher-type tests

Maddala and Wu (1999) provide a Fisher-type panel unit root test which accounts for cross-sectional dependence. Like the IPS test, the Fisher-type test is a way of combining evidence on the unit root hypothesis from the N unit root tests performed on N cross-sectional units. The fisher-type test makes this approach more explicit. It combines p values from panel-specific unit root tests using four methods. Three of the methods differ in whether they use inverse chi-square, inverse-normal, or inverse-logit transformation of p values, and the fourth is a modification of the inverse Chi-square transformation. The inverse-normal Z statistic offers the best trade-off between size and power.

Let \( G_{{i,T_{i} }} \) be a unit root test statistic for the ith group, and assume that as T i  → ∞, then \( G_{{i,T_{i} }} = > G_{i} \). Let p i be the p value of a unit root test for cross section i, that is, pi = 1 − F(\( G_{{i,T_{i} }} \)), where F(·) is the distribution function of random variable \( G_{i} \). In Chen (2013), the Fisher-type test is given as:

$$ P = - 2\mathop \sum \limits_{i = 1}^{N} \ln p_{i} $$
(12.24)

P is distributed as χ2 with 2N degrees of freedom as T → ∞ for all N. \( p_{i} \) value closer to zero (\( \ln p_{i} \) closer to −∞) implies large value of P, and then, the null hypothesis of the existing panel unit root is rejected. \( p_{i} \) value closer to 1 (\( \ln p_{i} \) closer to zero) implies that the panel unit root does exist.

3.5 Panel Co-integration Tests

There are two possibilities to deal with nonstationary variables in a given model after the stationarity test. First, to test whether the linear combination of nonstationary variables is stationary by using the co-integration test. If they are co-integrated, then we proceed to a long-run analysis with the nonstationary variables. Otherwise, we difference the stationary variables for a short-run analysis.

Engle and Granger (1987 ) noted that ‘a test for co-integration can be thought as a pretest to avoid “spurious regression” situations.’ If regression of one nonstationary variable over another nonstationary variable yields a stationary series, it is known as a co-integrating regression and the slope parameter in such a regression is known as a co-integrating parameter.

We employ a residual-based Pedroni co-integration test which is simply a unit root test applied to the residuals obtained from a co-integrating regression. If variables are co-integrated, then the residuals should be I(0). If the variables are not co-integrated, then the residuals are not I(0) (Pedroni 2004). The test allows for heterogeneous intercepts and trend coefficients across cross sections. It is based on a residual obtained from a regression:

$$ y_{it} = \alpha_{i} + \delta_{i} t + \beta_{1i} x_{1i,t} + \beta_{2i} x_{2i,t} + \cdots + \beta_{Mi} x_{Mi,t} + e_{i,t} $$
(12.25)

for \( t = 1, \ldots,T; \) \( i = 1, \ldots , M; \) \( m = 1, \ldots , M; \) and \( x \) and \( y \) are assumed to be integrated of order 1, I(1). The parameters \( \alpha_{i} \) and \( \delta_{i} \) are individual and trend effects. Pedroni proposes seven different statistics to test panel data co-integration: panel v-statistic, panel rho-statistic, panel PP-statistic, panel ADF-statistic, group rho-statistic, group PP-statistic, and group ADF-statistic. The first four are based on pooling or the ‘within’ dimension, and the last three are based on the ‘between’ dimension. The null hypothesis is no co-integration for both. However, the alternative hypothesis is ρ i  = ρ < 1 for all i in the former, and it is ρ i  < 1 for all i in the latter (Pedroni 2004).

3.6 Estimation Method

The choice of estimation method mainly depends on the results of preliminary tests of data. In our case, we looked for a method that helped an analysis of nonstationary variables which were co-integrated. We considered a method that provides estimated coefficients for individual countries. Therefore, we are not supposed to consider traditional estimators such as Pooled OLS, fixed effect, and first-difference OLS models which assume homogeneous technology parameters and factor loadings (common slope) . Eberhardt et al. (2011) and others have suggested using the pooled mean group estimation method for analyzing nonstationary variables which are co-integrated in a long panel setting. This method is helpful for heterogeneous technology parameters and factor loadings in particular.

The pooled mean group (PMG) estimator involves averaging and pooling. It restricts long-run coefficients to be homogenous over cross sections, but allows for heterogeneity in intercepts, short-run coefficients (including the speed of adjustment), and error variances. It is argued that country heterogeneity is particularly relevant in short-run relationships given that countries may be affected by over-lending, borrowing constraints and financial crises in short-time horizons. Homogenous long-run relationships may be assumed for reasons such as budget or solvency constraints, arbitrage conditions, or common technologies (Cavalcanti et al. 2011).

The relationship in pooled mean group estimation may be defined by an ARDL model as:

$$ {\Delta }q_{it} = \alpha_{i} + \beta_{i} \Delta x_{it} + \lambda_{i} \left( {q_{i,t - 1} - \theta x_{i,t - 1} } \right) + e_{it} $$
(12.26)

where \( q = lnQ \) and \( x = lnX. \) \( \beta_{i} \) are short-run parameters, which like \( \sigma_{i}^{2} \) differ across countries. Error correction term¸ \( \lambda_{i} \), also differs across i, long-run parameter; θ, however, is constant across the groups. This estimator is quite appealing when studying small sets of arguably ‘similar’ countries. In I(1) panels, this estimator allows for a mix of co-integration (\( \lambda_{i} > 0) \) and non-co-integration (\( \lambda_{i} = 0) \). \( x_{i,t} \) represents the set of explanatory variables defined in Eq. (12.21).

To account for cross-sectional dependence which may result from any common unobserved factor incorporated in the error term, we follow Pesaran’s (2013) common correlated effect approach. Unlike de-meaning, the approach handles multiple factors which can be correlated with regressors and serial correlation in errors and lagged dependent variables (Shin 2014). It does not require prior knowledge of the number of unobserved common factors and can be applied to dynamic panels with heterogeneous coefficients and weakly exogenous regressors (Pesaran 2013). The procedure consists of approximating the linear combinations of unobserved common factors by cross-sectional averages of the dependent and explanatory variables and then running standard panel regressions augmented with these cross-sectional averages.

The PMG estimator for a cross-sectionally dependent series may be explicitly defined as:

$$ \Delta q_{it} = \alpha_{i} + \beta_{i} \Delta x_{it} + \lambda_{i} \left( {q_{i,t - 1} - \theta x_{i,t - 1} } \right) + \gamma_{it} f_{t} + {\upvarepsilon }_{it} $$
(12.27)
$$ \text{where}\,\gamma_{i}^{{\prime }} f_{t} + {\upvarepsilon }_{it} = e_{it} $$

\( f_{t} \) is a vector of unobserved common shocks which captures the source of error term dependencies across countries. It may be stationary or nonstationary. The impacts of these factors on each country are governed by the idiosyncratic loadings in \( \gamma_{it} \). The individual-specific errors, \( \upvarepsilon_{it} \), are distributed independently across i and t; they are not correlated with the unobserved common factors or the regressors; and they have zero mean, variance greater than zero, and finite fourth moments (Cavalcanti et al. 2011).

The augmented pooled mean group estimator is, therefore, defined by substituting cross-sectional averages for the unobserved common factors, \( f_{t} \).

$$ \Delta q_{it} = \alpha_{i} + \beta_{i} \Delta x_{it} + \lambda_{i} \left( {q_{i,t - 1} - \theta x_{i,t - 1} } \right) + \frac{1}{N}\mathop \sum \limits_{l = 1}^{PT} \delta \bar{z}_{w,t - l} + {\upvarepsilon }_{it} $$
(12.28)

where \( \bar{z}_{w,t} \) represents a set of cross-sectional averages of the dependent and independent variables and their lagged values which approximate/proxy the unobserved common factors \( (f_{t}) \). The focus of this estimator is on obtaining consistent estimates of parameters related to observable variables, while the estimated coefficients on cross-sectionally averaged variables are not interpretable in a meaningful way: They are merely present to alter the biasing impact of unobservable common factors (Eberhardt 2012).

3.7 Error Correction Mechanism (ECM)

If two/more variables are co-integrated or prove to have a long-run relationship, then one needs to go for an error correction mechanism. The error correction mechanism (ECM) is a method used to correct any short-run deviations of variables from their long-run equilibrium; that is, it corrects for short-run disequilibrium. An important theorem, the Granger representation theorem, states that if two variables Y and X are co-integrated, then the long-term or equilibrium relationship that exists between the two can be expressed as ECM (Engle and Granger 1987 ) . This means that one shall go for the construction of an error correction model if the two variables are co-integrated. ECM is given as follows in Bhattarai (2011) for ARDL (1,1) with \( \beta_{i} = 0: \)

$$ q_{it} = \alpha_{i} + \gamma_{i} q_{it - 1} + \beta_{i} x_{it} + \theta_{i} x_{i,t - 1} + {\upvarepsilon }_{it} $$
$$ {\Delta }q_{it} = \alpha_{i} + \beta_{i} {\Delta }x_{it} + \lambda_{i} u_{i,t - 1} + {\upvarepsilon }_{it} $$
(12.29)

Δ denotes the first-difference operator, \( {\upvarepsilon }_{it} \) is a random error term, and \( u_{i,t - 1} = \left( {q_{i,t - 1} - \theta x_{i,t - 1} } \right) \) is one-period lagged value of error term from a co-integrating regression.

This ECM equation states that \( {\delta }q_{it} \) depends on \( {\delta }x_{it} \) and also on the equilibrium error term. If the error term is nonzero, the model is out of equilibrium. Suppose \( {\delta }x_{it} \) is zero (Bhattarai 2011) and \( u_{i,t - 1} \) is positive, it means \( q_{it - 1} \) is too high (above) to be in equilibrium. Since \( \lambda_{i} \) is expected to be negative, the term \( \lambda_{i} u_{i,t - 1} \) is negative, and therefore, \( {\delta }q_{it} \) will be negative to restore equilibrium. That is, if \( q_{it} \) is above its equilibrium value, it will start falling in the next period to correct the equilibrium error. Similarly, if \( u_{i,t - 1} \) is negative (i.e., \( q_{it} \) is below its equilibrium value), \( \lambda_{i} u_{i,t - 1} \) will be positive, which causes \( {\delta }q_{it} \) to be positive, leading \( q_{it} \) to rise in the next period. The absolute value of \( \lambda_{i} \) determines how quickly the equilibrium is restored (Engle and Granger 1987 ) .

4 Empirical Results

4.1 Test Results

This analysis begins by performing different econometric tests. Since not all unit roots provide the appropriate results, a cross-sectional independence test was performed to decide the type of panel unit root test to be considered. Using the Pesaran CD test, and possibly all other tests, the null hypothesis of cross-sectional independence was rejected for the original data. Hence, the series for our data was initially cross-sectionally dependent. However, after the data were augmented for cross-sectional averages to eliminate unobserved common factors, the Pesaran CD test, and possibly two other tests, failed to reject the null hypothesis of cross-sectional independence. The test results are given in Annexure 2.

IPS and Fisher-type tests are panel unit root tests which account for cross-sectional dependence . The results of the tests with different assumptions are given in Annexure 2. All the variables are nonstationary at the 1% level of significance.

The next step is to test for co-integration—whether there is a long-run relation between our nonstationary variables. The test for co-integration is residual based. We used two Pedroni type tests (ADF and PP tests) and the IPS test. In all the cases, we strongly reject the null hypothesis of no co-integration for both types of models (augmented and non-augmented) (see Annexure 2). Augmented models include cross-sectional averages of dependent and independent variables to account for cross-sectional dependence .

We propose three types of augmented models for the model selection criterion: models I, II, and III with one, two, and three explanatory variables, respectively. Even though the model selection criterion suggests that a model with three variables is our ‘best model’ in terms of log-likelihood ratio and Akaike information criteria (see Annexure 2), we present the results of the other models as well for comparison.

4.2 Estimation Results

Unlike most previous studies, the results of our study were not uniform across all developing countries. The impact of productivity growth on the real exchange rate differed by income group or per capita income. Productivity growth led to an appreciation in middle-income countries and depreciation in low-income countries in the long run. Our results substantiate the findings of Drine and Rault (2002, 2004) and Chuah (2012). Drine and Rault (2002, 2004) found evidence for the hypothesis in a study for middle-income countries (MICs) in 2002 and failed to arrive at the same conclusion for low-income countries (LICs) in another study in 2004. Our findings also seem to be in implicit confirmation of Chuah’s (2012) results. He calculated a turning point ($2200) below which change in income resulted in depreciation of the real exchange rate. Almost all LICs in our study had a per capita income less than $2200. The conclusions of Chuah’s (2012) study coincide with our conclusions for LICs such as Indonesia, Kenya, Nigeria, Tanzania, and Uganda.

Table 12.1 shows the long-run results of the panel co-integration estimation using the augmented PMG estimator for different groups of countries and models in the sample. We follow the tradition of presenting estimated coefficients of only observable variables as cross-sectionally averaged variables are not directly interpretable in a meaningful way. Estimated coefficients of full models (with observable and unobservable variables) are reported in Annexure 3.

Table 12.1 Panel co-integration estimation: the augmented PMG estimator

Basically, we consider three types of models in comparing three types of groups: the all countries group (15 countries), country groups by income (middle-income countries (MICs), 5 countries; low-income countries (LICs), 10 countries), and country groups by region (Africa, 9 countries; Asia, 4 countries). For each group in Table 12.1, the first row shows a model with one explanatory variable (productivity); the second row shows a model with two explanatory variables (productivity and government expenditure); and the third row shows a model with three explanatory variables (productivity—\( { \ln }(Y/Y^{*} ), \) government expenditure—\( {\text{ln}}(G/G^{*} ), \) and exchange rate volatility—\( {\text{vol}}(E)) \). The center of our discussion is Model III (shaded rows) for each group below.

In general, the results in Table 12.1, in general, show that the Balassa hypothesis holds for all countries as a group in the sample in the long run; that is, a 1% improvement in productivity leads to an appreciation of domestic currencies in the developing countries in the group by 0.388% on average. We find a different result, however, when the sample is categorized into different groups. When categorized by level of per capita income, the results show that the Balassa hypothesis holds only for middle-income countries (MICs). The same fact holds when countries are categorized by region; that is, the Balassa hypothesis holds only for Asian countries. This may be related to the fact that in our sample, most middle-income countries are from Asia and poor countries are from Africa. In both the cases, a 1% increase in productivity appreciates the domestic currencies of countries in MICs and Asia groups nearly by 0.34 and 0.77%, respectively (though only at the 10% level of significance for the latter). For LICs and Africa groups, a 1% increase in productivity depreciates domestic currencies of countries in the groups by nearly 0.287 and 0.247%, respectively.

The long-run relationship between government expenditure and the real exchange rate shows that expansionary fiscal policies result in appreciation of domestic currencies in all cases except for the LICs and Asia groups. This may not be surprising as the major countries with ‘big economies’ in both the groups are almost similar (Indonesia and the Philippines are members of both groups). The results of these groups are in line with Kohler’s (1998) argument who states that government expenditure does not have an impact in the long run unless financed by distortionary taxes.

Exchange rate volatility has the impact of depreciating the real exchange rate for all countries in all groups except the middle-income group in the long run. This confirms theoretical arguments which associate relatively fixed or highly managed exchange rate systems (mainly in poor countries) to depreciation and flexible regimes to appreciation in the real exchange rate.

Table 12.2 presents the results of short-run dynamics of the same groups of countries and models as given in Table 12.1. The discussion that follows focuses on Model III (the shaded rows). Short-run dynamics show that the impact of change in productivity on change in the real exchange rate is significant but negative; that is, it has the impact of depreciating the real exchange rate for all countries in all groups in the short run.

Table 12.2 Short-run dynamics of panel co-integration estimation: the augmented PMG estimator

Fiscal policy does not significantly explain the variations in the real exchange rate. Exchange rate volatility has an impact only in MICs and all countries groups. It negatively impacts the real exchange rate in the short run. This may be due to greater rigidity in the short run.

The (negative) signs and statistical significance of the error correcting terms show that the system is stable. A stable co-integrating relationship adjusts short-run deviations by the extent of the error correcting term. The rate of adjustment is, however, higher (12%) in MICs than LICs (8%). This means MICs have a faster rate of adjustment and achieve equilibrium earlier than LICs. This may be associated with better conditions to fulfill assumptions of the model in the former group.

Tables 12.3 and 12.4 present the short-run dynamics for individual countries in two income groups (MICs and LICs), respectively. The results are for Model III.

Table 12.3 Short-run dynamics by country: middle-income group (BRICS): Model III
Table 12.4 Short-run dynamics by country: low-income group

The short-run dynamics show that the impact of productivity on the real exchange rate was significant and negative for all countries except Brazil and South Africa. Productivity did not have an impact on the real exchange rate in these countries in the short run. Expansionary fiscal policies resulted in depreciation of the real exchange rate in Brazil, Russia, and China. The role of exchange rate volatility was significant in all countries. However, the effect was exceptionally positive in Russia.

The rate of adjustment was the highest in Russia (56.25%) followed by Brazil (22.76%). This may be associated with the size and features of these economies. These are the two biggest economies in the group which account for 40 and 20% of the US economy, respectively. A faster rate of adjustment means that they can achieve equilibrium earlier than others.

Table 12.4 presents the short-run dynamics for LICs. The short-run dynamics shows that the impact of productivity on the real exchange rate was significant and negative for all countries except Indonesia and Uganda. Productivity did not impact the real exchange rate in the short run in these countries. The role of fiscal policy was significant in all countries even though the effect was different. The increase in government expenditure resulted in a depreciation of the real exchange rate in all countries except in Ghana, Kenya, Indonesia, and the Philippines. The strongest impact of the fiscal policy was shown by Uganda (0.15%). The impact of exchange rate volatility was significant in all countries except Indonesia.

The rate of adjustment was the highest in Kenya (24.89%) followed by the Philippines (14.58%) and Ethiopia (13.63%). These three countries may achieve equilibrium earlier than others in the group.

5 Conclusions and Policy Implications

5.1 Conclusions

Unlike most previous studies, the results of our study are not uniform across all the developing countries in our sample. The impact of productivity growth on the real exchange rate varied by income group or per capita income. Productivity growth led to an appreciation of the real exchange rate in middle-income countries and depreciation of the real exchange rate in low-income countries in the long run. In general, the results of our study confirm that the relationship between the real exchange rate and productivity does exist and is stronger for higher income countries in the long run. Real per capita income matters more than the rate of economic growth in explaining the effects of the Balassa term in our study.

In the short run, however, we find almost uniform results across income groups. Productivity growth (possibly of non-tradables ), expansionary fiscal policies, and high exchange rate volatility result in the real exchange rate depreciation. More specifically:

  • Improvements in productivity and expansionary fiscal policies both have the impact of depreciating the real exchange rate in almost all the countries, both middle and low incomes.

  • The impact of exchange rate volatility is significant only in middle-income countries. This may be associated with the type of exchange rate policy/regime adopted. It is mainly fixed (unchanged) in low-income countries in which case it may not be useful to explain variations in the real exchange rate in the short run.

The reasons for the anti-Balassa hypothesis results in low-income countries in our study may be associated with a failure to satisfy the basic assumptions of the model. The relationship between the real exchange rate and productivity in the external version of the hypothesis assumes a positive relationship between productivity and relative prices as well as relative prices and the real exchange rate in the internal version. In addition, the law of one price must hold in the tradable sector.

5.2 Policy Implications

On the basis of our findings, we recommend the following policy options for MICs and LICs:

  • The Balassa hypothesis holds for middle-income countries in our sample. Economic growth leads to an appreciation in the real exchange rate in these countries. Hence, countries in this group may promote growth by increasing productivity in the tradable sector.

  • Since the Balassa hypothesis does not hold for low-income countries in our sample, economic growth does not lead to the real exchange rate appreciation in these countries. Hence, countries in this group may continue to grow by promoting productivity growth in the non-tradable sector.

  • Depreciation of the real exchange rate can be associated with improvements in the productivity of the non-tradable sector for low-income countries and should be used accordingly.

  • The role of fiscal policy may not last long in low-income countries and so should be used accordingly.