1 Introduction

One of the central issues in macroeconomics and finance is the stochastic properties of the real interest rates. The question of whether the real interest rate is constant and follows a mean reverting stochastic process has attracted a great deal of attention among economists since the seminal work of Fisher (1930). At stake, among other things, is the validity of the intertemporal Euler equation implied by the consumption-based asset pricing models (CAPM) (Lucas 1978; Breeden 1979; Hansen and Singleton 1982), the Black-Scholes model used to determine the price of options (Black and Scholes 1973), the Fisher hypothesis and efficiency of capital markets, the long-run super neutrality hypothesis, the real interest rate parity hypothesis (RIPH) and the implications of many models of monetary transmission mechanism. Thus, many important theoretical models in the macroeconomics literature imply or assume stationary real interest rates. However, the empirical studies after decades of investigation still could not provide a definite answer regarding the stationary properties of the real interest rates.

After a comprehensive overview, the following conclusions can be drawn from the existing econometric analysisFootnote 1. First, when the conventional univariate unit roots are applied to the data, the null hypothesis of a unit root in general could not be rejected (e.g., Rose 1988; Moosa and Bahatti 1996; Kapetanios et al. 2003; Choi and Moh 2007; Tsong and Lee 2012). Second, there is strong evidence that the real interest rate reverts to a long-run mean, but that the adjustment to this mean follows a nonlinear process (e.g., Garcia and Perron 1996; Kapetanios et al. 2003; Million 2004; Lanne 2006; Christopoulos and Leon-Ledesma 2007). In particular, the real interest rates are shown to exhibit threshold dynamics, which coincides quite well with the central bank’s “opportunistic disinflation” policy strategy. Third, many empirical studies report the existence of structural breaks in the mean of the real interest rate series, which if exists greatly reduces the power of the conventional unit root tests (e.g., Clemente et al. 1998; Bai and Perron 1998; Caporale and Grier 2000; Bai and Perron 2003; Lai 2008). Fourth, studies that utilize a much longer span of data and/or allow for the use cross-sectional information in the panel data tend to find strong evidence in favor of mean reverting behavior in the real interest rates (e.g., Costantini and Lupi 2007; Sekioua and Zakane 2007; Westerlund 2008; Tsong and Lee 2013; Pesaran et al. 2013). Thus, it seems that the reported empirical results that drive a wedge between the theoretical models and the data are due to the econometric challenges involved in identifying stationary real interest rate series. To the best of our knowledge, the empirical literature has not considered testing for the long run mean reversion of the real interest rates using panel data allowing for structural breaks and regime-wise nonlinearity together with cross-sectional dependence. However, as mentioned above, existing evidence clearly points to the presence of both structural breaks and nonlinear dynamics in the real interest rate at the international level, which implies that these have to be accounted for when analyzing the stochastic properties of the series in a panel setting. Hence, in this paper to analyze the stochastic properties of the real interest rate series, we accomplish this task and develop unit root tests that allow simultaneously for the existence of structural breaks and regime-wise nonlinearity in heterogeneous panels (see Lundbergh et al. 2000; Skalin and Teresvirta 2002; Lütkepohl et al. 1998; Sollis et al. 2002 and Sollis 2004 for the univariate case).

To this end, the present study extends the nonlinear unit root testing framework of Sollis (2004) to heterogeneous panels by drawing upon the work of Im et al. (2003) (hereafter, IPS). This allows us to combine smooth transition (ST) and threshold autoregressive (TAR) models within a panel frameworkFootnote 2. Furthermore, to correct for the size distortion that is caused by cross-sectional dependence (CSD), we implement Chang’s (2004) sieve bootstrap methodologyFootnote 3. Small sample properties of the newly developed tests are compared to that of the conventional IPS panel unit root test, which ignores the presence of both structural breaks and nonlinearity. We have also compared our test with the panel Leybourne et al. (1998) (PLNV) test proposed by Omay et al. (2013) that only considers the presence of smooth structural breaks in a heterogeneous panel setting (henceforth, OHS). We show that our newly proposed test has significant power gains over both the IPS and PLNV test. This makes our proposed test quite relevant for analyzing the stationary properties of the real interest rate series, which are argued to exhibit both threshold dynamics and structural breaks.

By implementing our newly proposed panel unit root tests to the real interest rates of 17 OECD countries over the period 1979:Q2–2011:Q2, we find conclusive evidence that the real interest rates are integrated of order zero (i.e., I(0)). Also, the scope of the paper covers different terms of the real interest rate—short-term and long-term—that provides a comprehensive analysis of the topic. Particularly, using the panel ST-TAR unit root tests, we show that the short-term real interests rates of all the 17 OECD countries are stationary around a broken trend. For the long-term interest rates, only the real interest rate series of France is found to contain a unit root.

The remainder of the article is structured as follows. In Sect. 2, the newly proposed unit root tests are developed. In Sect. 3, the critical values of this test under cross sectional independence are presented. Sections 4 and 5 illustrate the methodology used to handle the cross-sectional dependence problem and provide the empirical powers of our test in comparison to that of the IPS and the OHS tests. Section 6 applies the newly proposed unit root tests to analyze the mean reversion of the real interest rates. Section 7 is reserved for concluding remarks.

2 The model and testing framework

Let \(y_{it}\) be a panel ST-TAR process with changing trend function on the time domain \(t=1,2,\ldots ,T\) for the cross-sectional units \(i=1,2,\ldots ,N\). Consider \(y_{it}\), which is generated by one of the following smooth transition (ST) processes:

$$\begin{aligned}&\hbox {Model A}{:}\quad y_{it} =\alpha _{i1} +\alpha _{i2} S_{it} \left( {\gamma _i ,\tau _i } \right) +\varepsilon _{it} \end{aligned}$$
(2.1)
$$\begin{aligned}&\hbox {Model B}{:}\quad y_{it} =\alpha _{i1} +\beta _{i1} t+\alpha _{i2} S_{it} \left( {\gamma _i ,\tau _i } \right) +\varepsilon _{it} \end{aligned}$$
(2.2)
$$\begin{aligned}&\hbox {Model C}{:}\quad y_{it} =\alpha _{i1} +\beta _{i1} t+\alpha _{i2} S_{it} \left( {\gamma _i ,\tau _i } \right) +\beta _{i2} tS_{it} \left( {\gamma _i ,\tau _i } \right) +\varepsilon _{it} \end{aligned}$$
(2.3)

where \(S_{it} (\gamma _i ,\tau _i )\) is the logistic smooth transition (LST) function based on a sample of size T and N, \(S_{it} \left( {\gamma _i ,\tau _i } \right) =[1+\exp \{-\gamma _i (t-\tau _i T)\}]^{-1}\) where \(\gamma _i >0\) and \(\tau _i \) determine the mid-point of transformation for each \(i=1,2,\ldots ,N\), and \(\varepsilon _{it} \) is generated by the following TAR model;

$$\begin{aligned} \Delta \varepsilon _{it} =\rho _{i1} I_{it} \varepsilon _{i,t-1} +\rho _{i2} (1-I_{it} )\varepsilon _{i,t-1} +\eta _{it} \end{aligned}$$
(2.4)

where \(I_{it} \) is the Heaviside indicator function such that \(I_{it} =1\) if \(\varepsilon _{i,t-1} \ge 0\), \(I_{it} =0\) if \(\varepsilon _{i,t-1} <0\), and \(\eta _{it} \) is a zero-mean stationary process.

In Eq. (2.4) if \(H_0{:}\,\rho _{i1} =\rho _{i2} =0\) for all i, then \(\varepsilon _{it}\) and therefore \(y_{it}\) contains a unit root, while if \(\rho _{i1} =\rho _{i2} <0\) for some i, \(y_{it}\) is a stationary panel ST-TAR process with symmetric adjustment. On the other hand, if \(\rho _{i1} <0\), \( \rho _{i2} <0\) and \(\rho _{i1}\ne \rho _{i2}\) for some i, then \(y_{it}\) is a stationary panel ST-TAR process this time displaying asymmetric adjustment. These hypotheses are valid irrespective of whether Model A, B or C is used to describe the deterministic components of \(y_{it}\).

In this modeling strategy, likewise Leybourne et al. (1998) (hereafter, LNV), the structural change is modeled as a smooth transition between different regimes rather than as an instantaneous structural break. The transition function \(S_{it}\left( {\gamma _i,\tau _i}\right) \) is a continuous function bounded between 0 and 1. Thus, the ST model can be interpreted as a regime-switching model that allows for two regimes associated with the extreme values of the transition function, \(S_{it}\left( {\gamma _i,\tau _i}\right) =0\) and \(S_{it}\left( {\gamma _i,\tau _i}\right) =1\), where the transition from one regime to the other is gradual. The parameter \(\gamma _i\) determines the smoothness of the transition, and thus, the smoothness of transition from one regime to the other. The two regimes are associated with the small and large values of the transition variable \(s_{it}=t\) relative to the threshold \(\tau _i\). For the large values of \(\gamma _i\), \(S_{it}\left( {\gamma _i,\tau _i}\right) \) passes through the interval (0,1) very rapidly, and as \(\gamma _i\) approaches \(+\infty \) this function changes value from 0 to 1 instantaneously at time \(t=\tau _i T\). Therefore, if we assume that \(\varepsilon _{it}\) is a zero mean I(0) process and use model (1), then \(y_{it}\) becomes a stationary process around a nonlinear mean which changes from the initial value \(\alpha _{i1}\) to the final value \(\alpha _{i1}+\alpha _{i2}\). However, assuming that \(\varepsilon _{it}\) is generated by the TAR model given in Eq. (2.4) allows asymmetric adjustment around this nonlinear mean with differing degrees of autoregressive persistence parameters given by \(\rho _{i1}\) and \(\rho _{i2}\). Similar conditions follow for models given by Eqs. (2.2) and (2.3)Footnote 4.

The models given in (2.1), (2.2) and (2.3) can be used to test the following hypothesis:

$$\begin{aligned}&H_0{:}\;y_{it} =\mu _{it} , \mu _{it} =\mu _{i,t-1} +\varepsilon _{it} ,\;\mu _{i0} =\psi _i \end{aligned}$$
(2.5a)
$$\begin{aligned}&H_1{:}\;\hbox {Model A}, \hbox {Model B} \hbox { or } \hbox {Model C} \end{aligned}$$
(2.5b)

or

$$\begin{aligned}&H_0{:}\;y_{it} =\mu _{it} , \mu _{it} =\kappa _i +\mu _{i,t-1} +\varepsilon _{it} ,\;\mu _{i0} =\psi _i \end{aligned}$$
(2.6a)
$$\begin{aligned}&H_1{:}\;\hbox {Model B} \hbox { or } \hbox {Model C} \end{aligned}$$
(2.6b)

where \(\varepsilon _{it}\) is assumed to be a stationary processes with zero mean.

Following Sollis (2004), LNV and OHS the test statistics proposed here are calculated using a two-step procedure:

Step 1 Using a nonlinear least squares (NLS) algorithm, we estimate only the deterministic components of the preferred model for each of the cross-sectional units and collect the NLS residuals by

$$\begin{aligned}&\hbox {Model A}\qquad \hat{{\varepsilon }}_{it} =y_{it} -\hat{{\alpha }}_{i1} -\hat{{\alpha }}_{i2} S_{it} \left( {\hat{{\gamma }}_i ,\hat{{\tau }}_i } \right) \\&\hbox {Model B}\qquad \hat{{\varepsilon }}_{it} =y_{it} -\hat{{\alpha }}_{i1} -\hat{{\beta }}_{i1} t-\hat{{\alpha }}_{i2} S_{it} \left( {\hat{{\gamma }}_i ,\hat{{\tau }}_i } \right) \\&\hbox {Model C}\qquad \hat{{\varepsilon }}_{it} =y_{it} -\hat{{\alpha }}_{i1} -\hat{{\beta }}_{i1} t-\hat{{\alpha }}_{i2} S_{it} \left( {\hat{{\gamma }}_i ,\hat{{\tau }}_i } \right) -\hat{{\beta }}_{i2} tS_{it} \left( {\hat{{\gamma }}_i ,\hat{{\tau }}_i } \right) \end{aligned}$$

Step 2 We test for the presence of a unit root in the residuals obtained from the first step regression using the following TAR model,

$$\begin{aligned} \Delta \hat{{\varepsilon }}_{it} =\rho _{i1} I_{it} \hat{{\varepsilon }}_{i,t-1} +\rho _{i2} (1-I_{it} )\hat{{\varepsilon }}_{i,t-1} +\sum _{j=1}^{k_i } {\delta _{ij} } \Delta \hat{{\varepsilon }}_{i,t-j} +\eta _{it} \end{aligned}$$
(2.7)

Sollis t-test for the \(i\mathrm{{th}}\) individual is the most significant of the t-statistics used for testing \(\rho _{i1} =0\) and \(\rho _{i2}=0\) in Eq. (2.7) and is defined by:

$$\begin{aligned} t_i =\max \left( {t_{i1} ,t_{i2} } \right) \end{aligned}$$
(2.8)

where \(t_{i1}\) and \(t_{i2}\) are individual t-statistics for \(\rho _{i1}=0\) and \(\rho _{i2}=0\), respectively. The t-statistics are defined as follows:

$$\begin{aligned} t_{ij} =\frac{\sqrt{T-k_i -2}\left( {\hat{{\varepsilon }}_{ij,-1}^{\prime } M_{Q_{ij} } \Delta \hat{{\varepsilon }}_i } \right) }{\left( {\hat{{\varepsilon }}_{ij,-1}^{\prime } M_{Q_{ij} } \hat{{\varepsilon }}_{ij,-1} } \right) ^{1/2}\left( {\Delta \hat{{\varepsilon }}_i^{\prime } M_{X_{ij} } \Delta \hat{{\varepsilon }}_i } \right) ^{1/2}} j=1,2 \end{aligned}$$
(2.9)

where \(\Delta \hat{{\varepsilon }}_i =\left( {\Delta \hat{{\varepsilon }}_{i1} ,\Delta \hat{{\varepsilon }}_{i2} ,...\Delta \hat{{\varepsilon }}_{iT} } \right) ^{{\prime }}\), \(\hat{{\varepsilon }}_{i1,-1} =\left( {I_i \hat{{\varepsilon }}_{i0} ,,\ldots ,I_i \hat{{\varepsilon }}_{i,T-1} } \right) ^{{\prime }}\), \(\hat{{\varepsilon }}_{i2,-1} =\left( {(1-I_i )\hat{{\varepsilon }}_{i0} ,\ldots ,(1-I_i )\hat{{\varepsilon }}_{i,T-1} } \right) ^{{\prime }} \), \(Q_{i1} =\left( {(1-I_i )\hat{{\varepsilon }}_{i2,-1} ,\Delta \hat{{\varepsilon }}_{i,-1} ,\ldots ,\Delta \hat{{\varepsilon }}_{i,-k_i } } \right) \), \(Q_{i2} =\left( {I_i \hat{{\varepsilon }}_{i1,-1} ,\Delta \hat{{\varepsilon }}_{i,-1} ,\ldots ,\Delta \hat{{\varepsilon }}_{i,-k_i } } \right) \), \(M_{Q_{ij}}=I_T -Q_{ij}\left( {Q_{ij}^{\prime }Q_{ij}}\right) ^{-1}Q_{ij}^{\prime } \), \(M_{X_{ij} } =I_T -X_{ij} \left( {X_{ij}^{\prime } X_{ij} } \right) ^{-1}X_{ij}^{\prime } \), \(X_{ij} =\left( {\hat{{\varepsilon }}_{ij,-1} ,Q_{ij} } \right) \) and \(I_T \) is an identity matrix of order T.

For testing the unit root null \(H_0{:}\;\rho _{i1} =\rho _{i2} =0\) in Eq. (2.7), non-standard individual F-test statistics is given by:

$$\begin{aligned} F_i =\left( {R\hat{{P}}_i } \right) ^{{\prime }}\left[ {\hat{{\sigma }}_i^2 R\left( {C_i^{\prime } C_i } \right) ^{-1}R^{{\prime }}} \right] ^{-1}\left( {R\hat{{{ P}}}_i } \right) /2 \end{aligned}$$
(2.10)

where \(C_i =\left[ {I_i \hat{{\varepsilon }}_{i,-1} , \left( {1-I_i } \right) \hat{{\varepsilon }}_{i,-1} , \Delta \hat{{\varepsilon }}_{i,-1} , \ldots ,\Delta \hat{{\varepsilon }}_{i,-k_i } } \right] \), \(R=\left[ {I_2 , 0_{2\times k_i } } \right] \), \(\hat{{\mathrm{P}}}_i =\left[ {\hat{{\rho }}_{i1} ,\hat{{\rho }}_{i2} } \right] ^{{\prime }}\), which \(\hat{{\rho }}_{i1} \) and \(\hat{{\rho }}_{i2} \) are the Ordinary Least Square (OLS) estimators of \(\rho _{i1} \) and \(\rho _{i2} \), and \(\hat{{\sigma }}_i^2 \) is the OLS estimator of \(\sigma _i^2 \).

We propose testing for whether \(y_{it}\) contains a unit root using the F-statistic for testing \(\rho _{i1} =\rho _{i2} =0\) in (2.7), and/or the most significant of the t-statistics from those for testing \(\rho _{i1} =0\) and \(\rho _{i2} =0\). When the full model is given by Eqs. (2.1) and (2.4), the relevant F and t-statistics will be referred to as \(\bar{{F}}_{1p,\alpha } \) and \(\bar{{t}}_{1p,\alpha } \). When the full model is given by Eqs. (2.2) and (2.4), the relevant F and t-statistics will be referred to as \(\bar{{F}}_{2p,\alpha }\) and \(\bar{{t}}_{2p,\alpha }\), and when the full model is given by Eqs. (2.3) and (2.4), the relevant F and t-statistics will be referred to as \(\bar{{F}}_{3p,\alpha }\) and \(\bar{{t}}_{3p,\alpha }\). For each of the models, we obtain the mean group statistics as follows:

$$\begin{aligned}&\displaystyle \bar{{F}}_{jp,\alpha }= N^{-1}\sum _{i=1}^N {F_i}\qquad j=1,2,3 \end{aligned}$$
(2.11)
$$\begin{aligned}&\displaystyle \bar{{t}}_{jp,\alpha }=N^{-1}\sum _{i=1}^N {t_i}\qquad j=1,2,3 \end{aligned}$$
(2.12)

As the NLS estimation of the parameters \(\gamma _i\) and \(\tau _i\) does not allow for closed-form solutions, it would be extremely hard to find any analytical relationship between the \(\hat{{\varepsilon }}_{it}\) and \(y_{it}\). This, of course, makes the determination of the null asymptotic distribution of the test statistics by analytical means more or less intractable (Leybourne et al. 1998). Leybourne et al. (1998) stated that the linearity property in the intercept and trend terms ensures that the residual \(\hat{{\varepsilon }}_i\) from all the models are invariant to the choice of the starting values (i.e., \(\mu _{i0} =\psi _i\)) and that Model A, B and C are invariant to both starting values and the drift term \(\kappa _i\). Therefore, the obtained critical values are invariant to the parameters \(\gamma \) and \(\tau \), and as a result invariant to \(\alpha \) and \(\beta \). To further prove this statement, we employ a new Monte Carlo design to see the invariance of the critical values with respect to the parameters \(\alpha ,\beta ,\gamma \) and \(\tau \). For this purpose, we use \(\left\{ {\gamma ,\tau } \right\} \): \(\left\{ {1.0,0.5} \right\} ,\left\{ {1.5,0.6} \right\} \) and \(\left\{ {0.5,0.4} \right\} \) for generating the critical values. The simulated density functions are obtained for \(N=1\), \(T=100\) for 1000 and 50000 trials and show us that the critical values are invariant to the parameters \(\alpha ,\beta ,\gamma \) and \(\tau \). The simulated density functions for different parameter values of \(\left\{ {\gamma ,\tau } \right\} \) are given in “Appendix”.

3 Critical values of the proposed tests under cross-sectional independence

In this section the critical values of the unit root tests proposed above will be given under the assumption of cross-sectional independence using Monte Carlo simulation. Although the panel unit root tests developed in this study are especially designed for analyzing unit root behavior in the real interest rates, they are also general enough to be used in testing for the presence of unit root in any macroeconomic variable that exhibits both nonlinearity and structural breaks. In studying group of countries that are heavily integrated like the group of OECD countries studied here and for macroeconomic variables with close cross country links like the real interest rates analyzed here, accounting for cross-sectional dependence is crucial. However, for other cases which involve the study of macroeconomic variables in which cross-sectional dependence does not constitute a serious problem the critical values tabulated in this section should be used.

The exact critical values of \(\bar{{t}}_{1p,\alpha }\) and \(\bar{{F}}_{1p,\alpha }\) statistics under the assumption of cross sectional independence are generated by Monte Carlo simulations with 5000 replications and reported for Model A in Table 1 Footnote 5.

Table 1 The exact critical values of test statistics for Model A

4 Sieve bootstrap algorithm under cross-sectional dependence

In this paper, we have followed Chang’s (2004) Sieve bootstrap methodology to overcome the cross-sectional dependence problem arising from investigating the real interest rates of the OECD countries that are heavily integrated. The Sieve bootstrap scheme used in this paper is summarized via a five-step algorithm in the following manner.

Step 1 Following Basawa et al. (1991), under the null of a unit root \((\hat{{\varepsilon }}_{it}=\Delta y_{it})\) we estimate the errors as:

$$\begin{aligned} \hat{{\upsilon }}_{it} =\hat{{\varepsilon }}_{it} -\sum _{j=1}^{p_i } {\hat{{\delta }}_{ij} \hat{{\varepsilon }}_{i,t-j} } \end{aligned}$$
(4.1)

where lag orders \(p_i\) are determined via information criteria (e.g. AIC, SBC) by starting with \(p_{\max }\).

Step 2 Stine (1987) states that the residuals have to be centered with

$$\begin{aligned} \tilde{\upsilon }_t =\hat{{\upsilon }}_t -(T-p)^{-1}\sum _{t=p+1}^T {\hat{{\upsilon }}_t } \end{aligned}$$
(4.2)

where \(\hat{{\upsilon }}_t =(\hat{{\upsilon }}_{1t} ,\hat{{\upsilon }}_{2t} ,...,\hat{{\upsilon }}_{Nt} )^{{\prime }}\) and \(p=\max \left( {p_i } \right) \). Moreover, we construct \(N\times T [\tilde{\upsilon }_{it} ]\) matrix from these residuals. We select the residuals column randomly with replacement at a time to preserve the cross-sectional structure of the errors. The bootstrap residuals are denoted by \(\tilde{\upsilon }_{it}^*\) where \(T=1,2,\ldots ,T^{*}\) and \(T^{*}=T+M\).

Step 3 We construct bootstrap samples \(\varepsilon _{it}^*\) recursively from

$$\begin{aligned} \varepsilon _{it}^*=\sum _{j=1}^{p_i } {\hat{{\delta }}_{ij} } \varepsilon _{i,t-j}^*+\tilde{\upsilon }_{it}^*\end{aligned}$$
(4.3)

where \(\hat{{\delta }}_{ij}\) are the estimates obtained from Step 1. Notice that we generate \(T+M\) values of \(\Delta y_{it}^*\) and discard the first M values to ensure stationarity.

Step 4 We produce bootstrap samples \(y_{it}^*\) from

$$\begin{aligned} y_{it}^*=y_{i,t-1}^*+\varepsilon _{it}^*\end{aligned}$$
(4.4)

Step 5 Using bootstrap samples \(y_{it}^*\), the bootstrap test statistics (e.g. \(\bar{{F}}_{_{1p,\alpha }}^*\) and \(\bar{{t}}_{_{1p,\alpha }}^*\)) are computed for each relevant model by employing a two-step procedure, which is presented in the previous section.

We repeat steps (2) to (5) many times to find bootstrap empirical distribution of these panel test statistics and obtain the bootstrap critical values by selecting the appropriate percentiles of the sampling distributions.

5 Finite sample performance

In this section, we investigate the Monte Carlo performance of the proposed panel test statistics under the assumption of cross sectional dependence. Before giving the Monte Carlo results for the proposed test statistics, it is important to demonstrate that the presence of cross-sectional dependence will distort the size of the tests. In this case, the data-generating process (DGP) with one factor is given as follows:

$$\begin{aligned} \begin{array}{ll} y_{it} =y_{i,t-1} +\varepsilon _{it}&{} y_{io} =0 \\ \varepsilon _{it} =\lambda _i f_t +u_{it}&{} \end{array} \end{aligned}$$
(5.1)

where \(f_t \sim i.i.d. N(0,1)\) and \(u_{it} \sim i.i.d.N(0,\sigma _i^2)\) with \(\sigma _i^2 \sim i.i.d.U[0.5,1.5]\). \(f_t \) and \(u_{it}\) are independent of each other. We distinguish between two degrees of cross-sectional dependence, so that we generate \(\lambda _i \sim i.i.d.U[0,0.2]\) for “low cross-sectional dependence”, and \(\lambda _i \sim i.i.d.U[1,4]\) for “high cross-sectional dependence”Footnote 6. We use 2000 replications to compute the size distortions of the tests at the 5 % nominal levelFootnote 7.

Tables 2 and 3 present the Monte Carlo results in the presence of high and low cross-sectional dependence, respectively. As to be expected, the presence of high cross sectional dependence given Table 2 leads to size distortions for both test statistics when N and T are large. Even if T is small for all values of N, both test statistics show size distortions. In general, their size distortions rise monotonically with N and T for all models. Otherwise, in case of low cross sectional dependence (Table 3) for all values of N and T, and for all models both test statistics have the correct empirical size.

Table 2 The exact size of \(\bar{{F}}_{1p,\alpha }\) and \(\bar{{t}}_{1p,\alpha }\) in case of DGP with factor (High cross section dependence)
Table 3 The exact size of \(\bar{{F}}_{1p,\alpha }\) and \(\bar{{t}}_{1p,\alpha }\) in case of DGP with factor (Low cross section dependence)

After illustrating the size distortions problem, next we investigate the small sample properties of our tests under cross-sectional dependence using Monte Carlo simulations with Sieve bootstrap. We apply the Sieve bootstrap method to obtain critical values of the test statistics. To this end, we report experiments based on 1000 replications with 199 bootstrap replications. The experiments were carried out for all the pair of \(N=5,10\), and 25 and \(T=50\), 100, 150 and 200. First we report the empirical size of the tests for all models. To this end, we employ the following DGP for the empirical size under the null (2.5a)

$$\begin{aligned} \begin{array}{ll} y_{it} =y_{i,t-1} +\varepsilon _{it}&{} y_{io} =0 \\ \varepsilon _{it} =\lambda _i f_t +u_{it}&{} \\ u_{it} =\rho _i u_{i,t-1} +v_{it}&{} \end{array} \end{aligned}$$
(5.2)

where \(f_t \sim i.i.d. N(0,1)\) and \(v_{it} \sim i.i.d.N(0,\sigma _i^2)\) with \(\sigma _i^2 \sim i.i.d.U[0.5,1.5]\). Also, we set as \(\lambda _i \sim i.i.d.U[1,4]\) Footnote 8. To examine the impact of residual serial correlation, we generate \(\rho _i \sim i.i.d.U\left[ {-0.4,0.4} \right] \) which will then be used in the generation of the errors, \(v_{it}\).

Table 4 presents the empirical size of our tests for all models. We see that the empirical size of both test statistics is close to the nominal size for almost all T and N.

Table 4 The exact size of \(\bar{{F}}_{1p,\alpha }\) and \(\bar{{t}}_{1p,\alpha }\) in case of cross-sectional dependence

For brevity, we consider the empirical size-adjusted powers of the \(\bar{{t}}_{1p,\alpha } \) and \(\bar{{F}}_{1p,\alpha }\). By way of comparison, we also report size-adjusted powers of IPS test \((\bar{{t}}_{ IPS})\) which includes intercept plus trend termFootnote 9 and the PNLV test \((\bar{{t}}_{LNV})\). To evaluate the size-adjusted power of the all test statistics, we employ the following panel ST-TAR(1) DGP under heterogeneous alternative (2.5b).

$$\begin{aligned} \begin{aligned}&y_{it}=\alpha _{i1} +\alpha _{i2} S_{it} \left( {\gamma _i,\tau _i} \right) +\varepsilon _{it}\\&\Delta \varepsilon _{it}=\rho _{i1} I_{it} \varepsilon _{i,t-1} +\rho _{i2} (1-I_{it})\varepsilon _{i,t-1} +u_{it}\\&u_{it}=\lambda _i f_t +v_{it} \\&v_{it}=\rho _i v_{i,t-1} +\eta _{it} \end{aligned} \end{aligned}$$
(5.3)

where \(\eta _{it}\sim i.i.d.N(0,\sigma _i^2)\) with \(\sigma _i^2\sim i.i.d.U[0.5,1.5]\). \(I_{it}\) is defined as before. Factor loadings \((\lambda _i)\) and \(f_t\) are generated as Eq. (5.2). We consider two different range of the smooth transition parameters, namely \(\gamma _i \sim i.i.d.U\left[ {0.5,1.5} \right] \) and \(\gamma _i\sim i.i.d.U\left[ {3,5} \right] \) to represent the slow and fast transitions, respectively. All other parameters are assigned as follows: \(\rho _{i1} \sim i.i.d.U\left[ {-0.9,-0.7} \right] \), \(\rho _{i2} \sim i.i.d.U\left[ {-0.2,0} \right] \), \(\alpha _{i1} =1\), \(\alpha _{i2} \in \left\{ {2, 10} \right\} \), \(\tau _i =0.5\). In size-adjusted power experiments, under the alternative hypothesis, we use the DGP given in Eq. (5.2) for \(i=1,\ldots ,\left[ {N/2} \right] \) and the one given in Eq. (5.3) for \(i=\left[ {N/2} \right] +1,\ldots ,N\) with \(\left[ {N/2} \right] \) being the nearest integer value of N / 2.

The results from Table 5 clearly indicate that the size-adjusted power of the new test statistics \(\bar{{t}}_{1p,\alpha }\) and \(\bar{{F}}_{1p,\alpha }\) increase quite significantly as the number of observations (T) increase for a given panel size (N). The same result holds as the panel size increases (N) for a given number of observations (T). In addition, the increase in the value of \(\gamma _i\) from slow to fast, other things being equal causes the power of the newly proposed tests to decrease. This outcome is not unexpected because as the value of \(\gamma _i\) increases the transition resembles more an abrupt structural change rather than a smooth one, and this leads to power losses in the LNV based tests like ours. The same pattern is observed in the empirical powers of LNV test itself. However, an increase in the time series dimension (T) helps to offset this power loss. As shown in the first two panels of Table 7 for a small break \((\alpha _{i2}=2)\), irrespective of the value of the parameter \(\gamma _i\), the power of IPS test \(\bar{{t}}_{IPS}\) exceeds that of the newly proposed tests \(\bar{{t}}_{1p,\alpha }\), \(\bar{{F}}_{1p,\alpha }\) and the OHS test \(\bar{{t}}_{LNV}\). This may stem from the fact that the calculation of \(\bar{{t}}_{1p,\alpha }\), \(\bar{{F}}_{1p,\alpha }\) and \(\bar{{t}}_{ LNV}\) requires the estimation of more parameters than does the IPS test. Thus, the loss of power from having to do so seems to be offset by the gain in power from using the correctly specified data-generating process. Although the empirical power of the new tests exceed that of the OHS test for the small break case, the power of the OHS test still remains close to power of the new tests with the difference being approximately 11 % at most. On the other hand, for a large break \((\alpha _{i2} =10)\), not only the power of the IPS test collapses but also the power of the newly proposed tests \(\bar{{t}}_{1p,\alpha }\) and \(\bar{{F}}_{1p,\alpha }\) still exceeds that of the OHS test. Thus, ignoring large breaks and nonlinear adjustment when both are present in the panel data series produces large power losses when testing for a unit rootFootnote 10.

Table 5 The empirical size-adjusted power comparison of alternative tests under cross-sectional dependence

6 Empirical results

In this study, to test the null hypothesis of a unit root in the real interest rate series (RIRs) we have employed quarterly data that include observations on the nominal interest rate and the consumer price index over the period 1979:Q2—2011:Q2 (i.e. 129 data points) for seventeen OECD member countries.Footnote 11 The seventeen OECD countries include Australia, Austria, Belgium, Canada, France, Germany, Italy, Japan, Korea, the Netherlands, New Zealand, Norway, Spain, Sweden, Switzerland, the United Kingdom (UK), and United States (US).

Since the seminal work of Rose (1988), many studies in the literature have recoursed to the ex post rate for investigating the stationarity properties of the RIRs. While the ex-ante real interest rate is the nominal interest rate minus the expected inflation rate, the ex post real interest rate is found by subtracting the realized inflation from the nominal rate. Economic decisions by nature depend on the ex-ante real interest rate; however, the ex-ante real rates are in fact unobservable in contrast to the ex post rates. To prevent the shortcomings involved in specifying the correct way, the agents form their expectations or using survey data that are not available for many countries over a long time span, we will stick with this approach and use the ex post real rate. An alternative solution would be to construct inflation forecasts using econometric models. However, as argued in Neely and Rapach (2008), such models fail to include all the necessary variables that agents use to forecast inflation and do not change as the structure of the economy changesFootnote 12. Moreover, under the assumption that expectations are formed rationally across countries, the ex post (actual) real interest rate will differ from the ex-ante real interest rate by a white noise random error which is orthogonal to past information. Therefore, as argued in Moosa and Bahatti (1996), under rational expectations if the ex post real interest rate is found to be stationary then this may be interpreted as the ex-ante real interest rate being constant over time. Even without the assumption of rational expectations, the difference between the two rates is a stationary nonpersistent process (Peláez 1995) that is believed to not have an intense effect on the results (Kapetanios et al. 2003).

Following Pesaran et al. (2013) the ex-post real interest rate for each country j is calculated as \(r_{jt} =0.25\ln (1+R_{jt}/100)-(p_{jt} -p_{j,t-1})\), where \(R_{jt}\) is the nominal rate of interest per annum in per cent in country j at time t, \(p_{jt} =\ln ({ CPI}_{jt})\) and \({ CPI}_{jt}\) is the consumer price index of country j at time t.

A vast majority of the studies in the literature have utilized the nominal short-term interest rate (i.e., the three month treasury-bill rate, money market rate, deposit rate or the discount rate) to calculate and test for the stationarity of the RIRs (Kapetanios et al. 2003; Neely and Rapach 2008; Pesaran et al. 2013; Kapetanios and Shin 2011). However, the nominal long-term interest rates not only are more relevant to saving and investment decisions than the short-term rates (Tsong and Lee 2012) but also more directly reflect the financing costs for capital investments (Obstfeld and Taylor 2002). Therefore, to provide a more comprehensive investigation of the stochastic behavior of the RIRs, we utilized both the short-term and long-term nominal interest rates of the countries included in the study. The long-term government bond yield (typically the yield on ten year government bonds) is used as a proxy for the long-term nominal interest rate. The short-term (three month) nominal interest rates are the treasury-bill rates for Canada, Sweden, UK and US; money market rates for Australia, Austria, Belgium, France, Germany, Italy, Japan, Korea, the Netherlands, Norway, Spain and Switzerland; and the discount rate for New Zealand.

We have carried out a preliminary analysis to determine the stochastic and deterministic features of our real interest rate series before demonstrating the results of our panel unit root tests. The results of this analysis are given in panel A of Table 6 for both the short-term and long-term RIRs. In panel A \(y_{it}\), \(\gamma _i\) and \(\tau _i\) denote the original real interest rate series (i.e., the \(r_{jt}\)’s), the estimate of the transition speed parameter, and the estimate of threshold parameter, respectively. To be comparable we used the same notation given in Eq. (2.1). In the third and seventh columns of panel A, we present the estimates of the threshold parameter \(c_i\) obtained from the estimation of the TAR model with de-trended series \(\hat{{\varepsilon }}_{it}\) given in (2.1) and (2.4) using Chan’s (1993) methodologyFootnote 13. Besides, we have also tested the symmetry of the two regime TAR parameters \(\rho _1\) and \(\rho _2\) given in Eq. (2.4). The results of these asymmetry tests are given in the fourth and eight columns of panel A for the short-term and long-term RIRs, respectively.

Table 6 Pre-test of deterministic and stochastic features of samples

As it can be seen from the second column of Table 6, the estimates of the threshold parameters \(\tau _i\) are clustered around 0.5. This shows us that the structural breaks in the short-term RIRs take place at the middle of the samples. Thus, the sample countries short-term RIRs face a structural break at the middle of the 1990s. Moreover, we can also classify the break dates in the short-term RIRs of these countries with respect to the estimates of the \(\gamma _i\) (first column in panel A). Out of 17 countries included in the study, 8 of them have sharp transition (with estimates \(\hat{{\gamma }}_i >1.0\)), 5 of them have smooth transition (with estimates \(0.1<\hat{{\gamma }}_i <1.0\)) and 2 of them have highly smooth transitions \((\hat{{\gamma }}_i<0.1)\) Footnote 14. Therefore, the parameter values of \(\gamma _i\) and \(\tau _i\) obtained are consistent with the power simulations.

In addition, we have investigated the de-trended series \(\hat{{\varepsilon }}_{it}\) considering the threshold parameter and the tabulated results obtained in the third column. The results show us that the threshold parameters are located around \(\hat{{c}}_i=0.00\). This finding particularly justifies the usage of Enders and Granger’s (1998) TAR unit root testFootnote 15.

We also test the asymmetry of the parameters in the fourth column of Table 6 for the short-term RIRs. The test results exhibit a symmetric pattern in the TAR parameters. However, we can still order the TAR parameters of these countries from being the most symmetric to most asymmetric. Accordingly Spain, Japan, and Switzerland seem to have the most symmetric TAR estimation results, whereas Italy, Australia and the Netherlands are the most asymmetric ones. Therefore, we expect to find the short-term RIRs of Italy, Australia and the Netherlands’ to be stationary around a nonlinear trend with our proposed panel unit root test in SPSM procedure. On the other hand, these countries are not expected to be found stationary using the PLNV testFootnote 16. Fortunately, our proposed test has power against symmetric cases as well. Therefore, the countries which are found symmetric in the fourth column should be found as stationary with the newly proposed tests as well, since the newly proposed test nests the PLNV test. The same results are obtained in case of the long-term RIRs.

In panel B of Table 6, cross-sectional dependence of the short-term and long-term RIRs are tested using the CD tests proposed by Pesaran (2004). All the CD test results indicate high cross sectional dependence between the RIRs of the countries in the sample. Therefore, when testing for the integration properties of the RIRs, the cross-sectionally dependent versions of our proposed tests (bootstrap) should be used.

To test whether the short-term and long-term RIRs exhibit mean reversion, in addition to our newly proposed combined tests \(\bar{{t}}_{1p,\alpha }\) and \(\bar{{F}}_{1p,\alpha }\), we have also applied two alternative panel unit root tests to the RIRs of the OECD countries for comparison. The first of these tests is the conventional panel unit root test of IPS. However, an aforementioned weakness of the standard IPS test is that it ignores the possibility of breaks. Since the literature has documented that the RIRs may display structural breaks, the panel LNV test was applied as a second test to the RIRs that allows for both smooth breaks and cross sectional dependence. However, both the IPS and the panel LNV tests are based on the assumption that the RIRs follow a linear adjustment process, which may not be true according to the existing empirical evidence. Thus, to consider the presence of structural breaks and nonlinearity simultaneously, we further apply our proposed tests to the RIRs that also uses additional information from the cross sectional units to achieve a power gain.

The results of the three aforementioned tests are given in Tables 7 and 8 for the short-term and long-term RIRs, respectively. All of the three tests are applied on a reducing dataset using the SPSM procedureFootnote 17.

Table 7 Panel unit root test results for short-term real interest rates
Table 8 Panel unit root test results for long-term real interest rates

Evidence from panel A of Table 7 shows that the IPS test rejects the null hypothesis of a unit root in the short-term RIRs for the entire panel and in only 6 countries out of 17. The 11 short-term RIRs that are found nonstationary according to the linear panel unit root test belongs to USA, Austria, Japan, Germany, France, Spain, Korea, UK, Belgium, the Netherlands, and Italy. On the other hand, when potential breaks in the RIRs are considered and the panel unit root test with breaks (the panel LNV test) is applied to the panel, the results again point to the stationarity of the panel around a broken trend, but this time with 12 out of 17 countries behaving like an I(0) process (panel B). Thus, accounting for structural breaks within a panel context has considerably increased the number of countries for which the unit root null is rejected for the short-term RIRs, namely the RIRs of only 5 countries (Austria, Spain, Australia, the Netherlands, and Italy) are found to be nonstationary.

At this point, two main conclusions can be made with regard to our findings in Table 7. First, the existence of unit root in the short-term RIRs cannot be conclusively refuted for more than 50 %—about 65 %—of our sample when the linear panel unit root test without breaks is applied. Second, by considering the presence of structural breaks in the RIRs and further applying the panel LNV test we can uncover mean reverting dynamics in the RIRs for about 70 % of the sample.

Altogether our findings so far highlight the importance of taking structural breaks into account when studying the stochastic properties of the RIRs. However, even if a possible structural break in the short-term RIRs was taken into account, the test did not yield very strong evidence that the short-term RIRs behave like an I(0) process. The panel LNV test still cannot reject the unit root null in 30 % of the countries, though it provides additional 6 rejections of unit root compared to the results of the IPS test. On the whole, neither the linear panel model nor the panel model with smooth breaks can overwhelmingly support the long-run mean reversion of the short-term RIRs for our sample. Hence, the more powerful tests proposed in this study that exploits the possibility of nonlinearities and breaks in the RIRs simultaneously will be implemented.

Panel C of Table 7 contains the results of our proposed tests on the short-term RIRs. By applying the same SPSM outlined above, the null hypothesis of a unit root in the short-term RIRs was rejected for all the countries included in the study. Thus, we have obtained conclusive evidence in favor of the long-run mean reversion of the short-term RIRs for all the OECD countries included in the study. Ability to obtain overwhelming evidence of mean reversion seems to suggest that nonlinearity and breaks in the short-term RIRs co-exist in all the countries we studied.

Table 8 contains the results on the long-term RIRs. The results are similar to those obtained for the short-term RIRs. We again started by conducting a standard IPS panel unit root test with bootstrap and the SPSM methodology (Panel A in Table 7) and concluded that for a vast majority of the countries the unit root null hypothesis cannot be rejected (only in 6 out of 17 cases, which corresponds to about 35 % of rejections). Afterward, to consider the presence of structural breaks in the long-term RIRs, we applied the panel smooth break unit root test with again bootstrap and the SPSM methodology (Panel B in Table 7), but even in this case we were able to provide only some support for the long-run mean reversion of the long-term RIRs (about 65 % of rejections of the unit root null hypothesis). This finding probably suggests that allowing for structural breaks only offers a partial solution to the extensive non-rejection of the unit root null again in the long-term RIRs.

On the other hand, when both nonlinearity and smooth breaks in the long-term RIRs are simultaneously considered while tackling cross-sectional dependence, the null hypothesis of a unit root in the RIRs is considerably rejected in Panel C in sharp contrast with the results in Panels A and B. Using our proposed tests together with the SPSM methodology outlined above, we find that 16 out of 17 countries exhibit mean-reverting dynamics in their long-term RIRs. Particularly, the unit root null is not rejected only for France. Comparing the results in Panels B and C, we find that an additional five rejections of the unit root dynamics can be found in our sample, including the Netherlands, UK, Sweden, Korea, and Italy. More rejections for the long-term RIRs may again be attributed to the improved power of our proposed tests.

After determining the stochastic properties of the RIR series, we next employed a heterogeneous Panel ST-TAR estimation. As in Sollis (2004), this estimation is carried out in two steps. First the deterministic component is estimated (the smooth trend is removed) and then the panel TAR estimation is carried out using Eq. (2.4) and thus the residuals obtained from the first step. For the short-term RIRs, we have found out that the group mean estimator of the speed of adjustment coefficient is -0.549 \((\bar{{\hat{{\rho }}}}_1)\) if \(\varepsilon _{i,t-1} \ge 0\), which is also classified as the high regime, and -0.572 \((\bar{{\hat{{\rho }}}}_2)\) for the low regime. Both of the regime parameters are significant at 1 % significance level. In case of the long-term RIRs the estimates obtained are \(-\)0.549 and \(-\)0.647 for the high regime and low regime, respectively. Thus, the RIRs are found to display differential speeds of adjustment with the positive phase of the RIR sequence being more persistent than the negative phase, which implies faster adjustment for negative RIRs than for positive ones.

7 Conclusion

Real interest rate remains one of the crucial variables in the macroeconomics and finance literature. Although many important theoretical models require the real interest rates to be stationary, the empirical evidence accumulated so far has not been able to provide conclusive evidence on the mean reverting dynamics of this variable. Thus, serious questions have been raised about the validity of many important macroeconomic relationships, models and hypotheses involving stationary real interest rates. In this study we show that a potential solution to this problem is using a test that is specifically designed to fit the stochastic features of the real interest series. A researcher should consider all factors pertinent to a situation when conducting an analysis. Likewise, to investigate the stochastic properties of the real interest rates one should also utilize a test that allows for the possibility of nonlinearity and structural breaks simultaneously in a panel setting that induces additional power gains with the use of cross-sectional information. Accounting for cross-sectional dependence is especially important in analyzing macroeconomic variables with close cross country links like the real interest rates.

The empirical power of our newly proposed unit root tests are compared with the powers of two alternative panel unit root tests—the IPS test ignoring both nonlinearity and structural breaks and the PLNV test considering only structural breaks—using Monte Carlo simulation exercises. Under the hypothesis that the data are generated by a globally asymmetric stationary ST-TAR process our tests are shown to have better small sample properties than the aforementioned tests.

When the newly proposed panel unit root tests are applied to the short-term and long-term real interest rates of the 17 OECD countries we were able to uncover overwhelming empirical support in favor of the mean reversion in the real interest rates. Particularly, the unit root null is rejected for all OECD countries when the new panel unit root tests are applied to the short-term RIRs. Our findings imply that for all OECD countries included in our study, a one-time shock to the short-term real interest rate will have no permanent effect on the economy. In case of the long-term RIRs, the unit root null is not rejected only in France, which implies that a one-time shock to the long-term real interest rate in France would be permanent and could not be eliminated with the passage of time.

Overall, the empirical results verify mean reversion in the ex post OECD real interest rates. However, the main contribution of this paper is twofold. First we show in this paper that the mean (or equilibrium value) to which the ex post real interest rates revert in the long-run, is not a constant but changes smoothly over time. However; as argued in Kesriyeli et al. (2006) while most studies have allowed for nonlinear dynamics in the interest rates, they have assumed that the interest rate dynamics was constant over time. Researchers have also typically assumed that the interest rate policy was constant in the period of relatively low interest rates since 1984. The authors argue that such assumptions may lead to significant misspecification. We show that this misspecification carries over to our panel of OECD countries. The long-run equilibrium real interest rate may be altered by forces caused by important structural events. In the general equilibrium growth models, these forces include permanent changes in the exogenous rate of time preference, risk aversion, or long-run growth rate of technology. In other more complicated macroeconomic models, permanent changes in public purchases and their financing or changes in the steady-state money growth rate also induce changes in the long-run equilibrium or steady-state real interest rate.

Besides the aforementioned long-run behavior of the real rates, the second contribution of this paper is to show that in the short-run the real interest rate adjusts asymmetrically toward this endogenously changing equilibrium real interest rate. The estimation results demonstrate that asymmetric behavior is detected between the regimes of positive and negative real interest rates. More specifically, the mean reverting process that governs the real interest rate adjustment to its long-run equilibrium value depends on the sign of the disequilibrium or shock. This implies that the real interest rate adjusts differently or asymmetrically to positive and negative deviations of the real interest rate from its long-run equilibrium valueFootnote 18. Particularly, for the OECD real interest rates, we have demonstrated that the real interest rates tend to adjust faster to equilibrium for negative real interest rates than for positive ones. This observed sign asymmetry can be rationalized with the existence of asymmetric optimal policy rules that arise in case of inflation and/or output targetingFootnote 19. According to the nonlinear Phillips curve literature positive aggregate demand shocks manifested in the form of positive inflation and/or output gaps (i.e., occurring in the expansionary phase of the business cycle) are more inflationary than negative shocks (i.e. in case of recessions) which are disinflationary. As argued in Schaling (2004), a positive deviation of inflation from its target level causes the real interest rates to be negative in the short-run. The danger of inflation caused in this case becomes much more severe than the linear case. This causes an inflation targeting central bank to increase the nominal rates by more than if the output inflation trade-off was symmetricFootnote 20. Similarly, in case of a negative deviation, the real rates will be positive in the short-run and the accompanying disinflationary pressure will be less than in the linear case. Hence, the central bank will only call for a small cut in the nominal rate. In other words, with nonlinear Phillips curve and inflation targeting the optimal monetary policy rule is asymmetric: Positive deviations from the inflation target and or in the output gap imply higher (absolute values of) real interest rates than negative deviations. Thus, inflation targeting central banks react more aggressively to positive aggregate demand shocks that cause negative real interest rates in the short-run and thereby induce a relatively rapid adjustment in the real rates toward equilibrium when they are in the negative regime. The same result carries over to central banks that give equal weights to both output and inflation stabilization (the case of flexible inflation targeting)Footnote 21.