Introduction

In economics, when a historical perspective is overlooked in a descriptive research design, misleading conclusions may often follow.Footnote 1 By historical perspective, we refer to the understanding of a subject matter in light of its previous stages of intellectual development and successive advancement. We therefore put our arguments against unit roots and cointegration analysis in a historical perspective.Footnote 2

The recognition of a spurious regression problem in the late 1970s contributed decisively to the development of unit roots and cointegration (Granger and Newbold 1974; Hendry 1980, 1986; Granger 1981, 1986). A spurious regression problem arises when a regression analysis indicates a relationship between two or more unrelated time series variables because each variable has either a trend, or is nonstationary, or both. While working with economic time series data, researchers, attempting to account for a spurious regression problem, began pretesting for nonstationarity before estimating regressions. If data were found to be nonstationary on the basis of an appropriate unit root test, researchers would routinely purge the nonstationarity by differencing and then estimating regression equations using only differenced data as a solution for the spurious regression problem. The practice of purging the nonstationarity by differencing would also result in the loss of valuable information from economic theory about the long-run equilibrium properties of the data (Kennedy 2003).

It was in this context that Granger proposed that if two nonstationary variables were I(1) process, the bivariate dynamic relationship between the two nonstationary variables would be misspecified when both of the nonstationary variables were differenced. This class of models has since become a dominant paradigm in empirical economic research and is known in the literature as a cointegrated process (Hamilton 1994).

In our review of the development of the concept of cointegration, we identified that the most important proposition of two integrated series was that if x t  ~ I(d x ), y t  ~ I(d y ) then z t  = bx t  + cy t  ~ I(max(d x , d y )) (Granger 1981). Put simply, this proposition states that the sum of the two time series variables of a different order of integration will always yield another time series variable that will retain the “order of integration property” of the two series that has the higher order of integration. Granger’s proposition was subsequently clarified by Hendry (1986), Engle and Granger (1987), Cuthbertson et al. (1992), and the Royal Swedish Academy of Science (2003). The clarified proposition included both the sum and the difference of the two time series. In this research, we offer evidence against this proposition.

Test of Order of Integration and Data

A time series is said to be strictly stationary if both its marginal and all joint distributions are independent of time. For practical purposes, however, it is the weak stationarity or covariance-stationarity that is more useful. A time series is said to be weakly stationary or covariance-stationary if the first two moments, mean and autocovariances, of a series do not depend on time. A stationary time series that does not need differencing is said to be integrated of order zero and is denoted I(0). A nonstationary time series that becomes stationary after first differencing is said to be integrated of order one and is denoted I(1). In general, a time series that requires differencing d times to become I(0) is said to be integrated of order d and is denoted I(d) (Granger 1986). Since the number d equals the number of unit roots in the characteristic equation for the time series (Said and Dickey 1984, page 599) unit root tests are often used to determine the order of integration of a series. Thus, we describe below the unit root tests that we will use in our analysis.

Consider the following difference equation:

$$ {Y}_t=\rho {Y}_{t-1}+{e}_t $$
(1)

where e t is a white noise error term. When ρ = 1, equation (1) is known as a pure random walk model, a nonstationary stochastic process. This model can be alternatively expressed as:

$$ \varDelta {Y}_t=\delta {Y}_{t-1}+{e}_t $$
(2)

where δ = (ρ - 1) and ∆ is the first-difference operator (e.g. Y t  - Y t - 1).

Testing for the presence of unit roots involves simultaneously determining whether an intercept and/or a time trend belong to the regression model, not including enough of them biases the test in favor of the unit root null hypothesis, whereas including too many of these parameters results in the loss of power (Elder and Kennedy 2001). With an economic time series, the main competing alternative to the presence of a unit root is a deterministic linear time trend. We therefore modify Eq. 2 to include (i) a drift term (Eq. 3), and (ii) a drift term and a linear trend term (Eq. 4):

$$ \varDelta {Y}_t=\alpha +\delta {Y}_{t-1}+{e}_t $$
(3)
$$ \varDelta {Y}_t=\alpha +\beta t+\delta {Y}_{t-1}+{e}_t $$
(4)

Due to its simplicity, we will use the test procedure proposed by Fuller (1976) and Dickey and Fuller (1979, 1981), known as the Dickey-Fuller (DF) test. In the DF test, it is assumed that e t are uncorrelated and independently and identically distributed (iid). If e t are correlated, Said and Dickey (1984) showed that the DF test may still be used provided that the lag length in the autoregression increases with the sample size. The test modified by Said and Dickey (1984) is known as the augmented Dickey-Fuller (ADF) test, and is given by:

$$ \varDelta {Y}_t=\alpha +\beta t+\delta {Y}_{t-1}+{\displaystyle {\sum}_{i=1}^p}{\gamma}_i\varDelta {Y}_{t-i}+{e}_t $$
(5)

The problem with the ADF test is that the choice of the lag length is arbitrary. Although the Akaike or Schwarz information criteria are generally used to decide the lag length, they do not always yield identical results. According to Harris (1992), the size and power properties of the ADF test are improved if a fairly generous lag is used. He proposes a formula, l i  = int{i(n/100)1/4}, to determine the lag length that allows for the order of autogression to grow with sample size. Previously, Schwert (1989) used lag lengths based on the formulas l 4 = int{4(n/100)1/4} and l 12 = int{12(n/100)1/4}. In contrast, Taylor (2000) recommended selecting a lag length using a data-based algorithm or using a much higher significance level (e.g. 0.2 level rather than the traditional 0.05 level) in the general-to-specific rule framework. Note that the critical values for the ADF test differ very little from the DF critical values, so in practice researchers often used the DF critical values. Above all, if the e t are not iid and are correlated, the DF tests do not have the correct asymptotic size (Phillips and Perron 1988).

In our analysis, we do not use the ADF test for reasons discussed in Luitel and Mahar (2016). Furthermore, using nonparametric methods, Phillips (1987) and Phillips and Perron (1988) show that the Philips-Perron (PP) unit root test takes into account serial correlation in the error terms without adding lagged difference terms and thus has an advantage over DF and ADF test procedures. We will use PP test procedure because it will arguably allow for a wide class of time series models in which a unit root may be present.

Even today much controversy in the literature still surrounds the most powerful test for unit roots. To overcome such controversy, Elliott et al. (1996) proposed a modified version of the Dickey-Fuller t test, known as the DF-GLS test, in which the time series was transformed using a generalized least square (GLS) method, rather than OLS, before performing the unit root test. Elliott et al. (1996) argued that the DF-GLS test substantially improved power when an unknown mean or trend was present, and as such, this test dominates all other unit root tests currently in common use such as DF, ADF and PP unit root tests. We will use DF-GLS test in our data analysis.

In order to show the violation of Granger’s proposition, we use two sets of data: (i) Gross Domestic Product (GDP) and some of its components and (ii) unemployment rate and its components. Many macroeconomic variables are either a sum or a difference or some algebraic manipulation of other macroeconomic variables. For example, consider the openness index, an economic variable, defined as the sum of exports and imports divided by GDP \( \left( Openness\ Index=\frac{Exports+ Imports}{GDP}\right) \). We test whether the openness index and all its components are stationary or nonstationary. For a second example of an algebraic manipulation of other macroeconomic variables, we use the unemployment rate, defined as the number of people unemployed and looking for work divided by the labor force (the sum of unemployed plus employed) \( \left( Unemployment\ Rate=\frac{Unemployed}{Labor\ Force}\right) \). We test whether unemployment rate and all its components are stationary or nonstationary. We obtained annual data on GDP, exports and imports for the United States from 1929 to 2012. Similarly, we obtained annual data on number of people unemployed and number of people employed for the United States from 1947 to 2012. We created other desired variables using the appropriate algebraic manipulation as described above. The literature suggests that data in logarithmic form achieves stationarity better than unlogged data. Nonetheless, we report results both for original series and for their natural logarithmic transformation. Table 1 provides the summary statistics of data for original series and Table 2 provides the summary statistics of data for their natural logarithmic transformation.Footnote 3

Table 1 Summary statistics (Original Series)
Table 2 Summary statistics (Natural Logarithm)

Empirical Results

In this section, we report results of the unit root tests used in the determination of the order of integration of a series. We performed five variants of three different unit root tests on each time series: DF test, PP test and Elliot, Rothenberg and Stock DF-GLS test. In each case, the null hypothesis was that the variable under investigation had a unit root, against the alternative that it did not have a unit root. A significant test statistic rejects the null hypothesis that the series has a unit root; thus, significant values indicate the series to be stationary. The results are presented in Table 3 for original series and in Table 4 for their natural logarithmic transformation and the tests were performed sequentially. The top half of Table 3 and Table 4 report the unit root test results for stationarity of the variables in levels. The bottom half of Table 3 and Table 4 report the results for the various tests of unit root on first differences of the variables.

Table 3 Unit root test results of order of integration (Original Series)
Table 4 Unit root test results of order of integration (Natural Logarithm)

Consider the test results for an openness index and its components. The results in Table 3 indicate that each component in the numerator, exports and imports, is individually I(d = 1), the variable in the denominator, GDP, is I(d = 1), and the resultant openness index is also I(d = 1). Denote exports + imports as x 1t , GDP as y 1t and the openness index as z 1t . The expression of openness index can be written as \( {z}_{1t}=\frac{x_{1t}}{y_{1t}} \). Taking the natural log of both sides, the expression becomes ln z 1t  =  ln x 1t  -  ln y 1t . The results in Table 4 indicate that lnx 1t  ~ I(1), lny 1t  ~ I(1) and ln z 1t  ~ I(1). This translates into I(1) ± I(1) = I(1), which appears to not contradict the Granger proposition. Although not reported here, we found a structural break in GDP, exports and imports in 1997. For a further discussion of structural break in US GDP in 1997, see Luitel and Mahar (2015a). Note that the unit root property of the data is inconsistent with the structural break, which implies that the parameters governing the data generating process have changed. Because the current literature treats the unit root property of the data as if not affected by a structural break, the finding of unit root itself becomes questionable.

Next, let us consider the unemployment rate and its components. The results in Table 3 indicate the number of people unemployed, the number of people employed and the labor force. Each individual series is I(d = 1) but the unemployment rate is I(d = 0). Our finding regarding unemployment rate is confirmatory to the finding of Nelson and Plosser (1982), who concluded “that the series (unemployment rate) is well described as a stationary process” (p. 152). Furthermore, we denote unemployed as x 2t , labor force as y 2t and unemployment rate as z 2t . The expression of unemployment rate can be written as \( {z}_{2t}=\frac{x_{2t}}{y_{2t}} \). Taking the natural log of both sides, the expression becomes lnz 2t  =  ln x 2t  -  ln y 2t . The results in Table 4 indicate that lnx 2t  ~ I(1), lny 2t  ~ I(1) but lnz 2t  ~ I(0). This translates into I(1) ± I(1) = I(0), that contradicts the Granger proposition (i.e. the sum or difference between two time series variables of different order of integration will always yield another time series variable that will retain the “order of integration property” of the two series that has the higher order of integration).

Furthermore, in a peer review process, a reviewer made comments that there are many different unit root tests and cointegration tests, and that they do not always give the same answer is unfortunate. The original DF test, which allows for no lags in the autoregression, would be an awful test to use for almost all macroeconomic data that displays serial correlation in first differences. Also, the PP testing framework has been shown in multiple Monte Carlo studies to have poor size and power properties. Comments such as these give the impression that the DF-GLS test for unit root was to be preferred over DF, ADF or PP tests. However, our results indicate that the DF-GLS test is even less reliable.Footnote 4 Consider the DF-GLS test results for the first difference of number of people employed, the first difference of number of people unemployed and the first difference of labor force as reported in Table 3 (Model 5). The results in the table indicate that the first difference of the number of people employed ~I(0), the first difference of number of people unemployed ~I(0), but the first difference of labor force ~I(1).Footnote 5 This translates into I(0) ± I(0) = I(1), which is clearly a violation of the Granger proposition both in meaning and in intent.

In addition to what is reported above, Table 5 presents a holistic view of violation of the Granger proposition. As noted in the introduction, Table 5 in essence summarizes the clarifications of the Granger proposition provided by Hendry (1986), Engle and Granger (1987), Cuthbertson et al. (1992), and Royal Swedish Academy of Science (2003). For ease of exposition, we only consider the values d = 0 and d = 1.

Table 5 Algebra of two integrated series. Based on proposition if x t  ~ I(d x ), y t  ~ I(d y ), then z t  = bx t  + cy t  ~ I(max(d x , d y )) (Granger 1981, page 126).

Note that the above outcomes are collectively exhaustive. Not surprisingly, allowing researchers to selectively pick one unit root test over another means that any outcome becomes possible. Thus, the finding of cointegration between two variables cannot be a special case as implied or emphasized in the literature. To put it in a nutshell, there is no uniqueness in the original Granger proposition.

Summary and Conclusion

The method of cointegration analysis for modeling nonstationary economic time series variables has become a dominant paradigm in empirical economic research. In our review of the development of the concept of cointegration, we identified that the most important proposition of two integrated series was that if x t  ~ I(d x ), y t  ~ I(d y ) then z t  = bx t  + cy t  ~ I(max(d x , d y )) (Granger 1981, page 126). In this research, we show this proposition, not highlighted previously, to be false for two reasons: First, it is not necessarily true that the sum or difference between two time series variables of different order of integration will always yield another time series variable nor will it retain the “order of integration property” of the two series that has the higher order of integration. Second, as shown in Table 5, there was no uniqueness in the proposition to begin with. Thus, the finding of cointegration between two variables does not reveal any special relationship as implied or emphasized in the extant literature. The importance of the violation of Granger’s key proposition is further discussed in Luitel and Mahar (2015b).

In conclusion, the failure of cointegration analysis, particularly, in the context of financial econometrics, was previously discussed in Moosa (2011). Due to the existence of disconnection between what the existing literature predicts and what actually goes on in the economy, we reassessed the major proposition of the unit root and cointegration literature in an effort to explain apparently unbelievable observed real world phenomena, and contributed to the existing literature by bringing knowledge of its failure to the attention of a general audience. Arguably, one of the most powerful uses of cointegration is in forecasting. On the one hand, vector error correction (VEC) models have been shown to outperform vector auto regression (VAR) models in many forecasting applications. Proponents of the unit root and cointegration argue this fact to be prima facie evidence of the value of the concept of cointegration. On the other hand, in the last decade or so, the nowcasting method has been on the rise in the economic literature and has gone far beyond the point of just being a “special analysis” technique. To consolidate such diverse economic thinking, our research indicates a need for a comparative analysis of forecasting, backcasting and nowcasting econometrics methods. We believe this is a fruitful area that needs more investigation and we leave it for future research endeavors.