1 Introduction

A current global problem is the role of the human contribution on climate changes. In fact, greenhouse gases seem to have a relevant impact on global surface temperature’s rise. Global warming may be also caused by natural forcing as solar or volcanic activities. Thus, the relationships between global surface temperature and natural or anthropogenic forcings have been the most important subjects of research in the last decades. The studies are often performed by means of Granger causality (GC) that is about an incremental forecasting power. In particular, a variable x Granger cause a variable y if the forecasts of y can be improved by means of x. Sun and Wang (1996) provide evidence of GC from CO2 emission to global temperature. Kaufmann and Stern (1997) suggest that human activity has played a role in the historical record of temperature. A study by Triacca (2001), a follow-up on Kaufmann and Stern, shows other conclusions obtained from the results of Kaufmann and Stern. Triacca (2005), using Toda and Yamamoto’s method (Toda and Yamamoto 1995), evidences that there is no Granger causality between atmospheric concentration of carbon dioxide and global temperature. Elsner (2007) finds GC from global temperature to Atlantic sea surface temperature, confirming the theory that climate change influences hurricane intensity, considering that changes in global temperature are due to anthropogenic forcings. Recently in Kodra et al. (2011), a detectable Granger causality from CO2 to global temperature is explained. The authors also perform an out-of-sample study in order to give more evidence for their in-sample results. The out-of-sample results are based only on descriptive statistical indices, and we do not know if out-of-sample Granger causality is statistically significant. In Attanasio et al. (2012), out-of-sample Granger noncausality tests are performed from anthropogenic or natural forcings to global temperature. The study evidences the rejection of the null hypothesis of Granger noncausality when greenhouse gases are used, whereas Granger causality is not found from natural forcings to global temperature. Other results (Mokhov and Smirnov 2008; Reichel et al. 2001), using in-sample tests, evidence the influence of solar activity on the global surface temperature.

The different results may depend on the models adopted in the analysis, or whether the tests are in-sample or out-of-sample. In particular, in-sample analysis depends on time series characteristics, and there is often the possibility of overfitting. In fact, significant in-sample Granger causality does not guarantee significant out-of-sample predictability. Out-of-sample tests are often recommended because they may stress the true forecasting ability of one variable for another, and the results are more robust in terms of overfitting (Chao et al. 2001; Clark 2004; Gelper and Croux 2007). Differently, Inoue and Kilian (2004) find the results of in-sample tests more credible than those of out-of-sample tests.

This paper is a follow-up of Attanasio et al. (2012). Here, we consider in-sample Granger noncausality test using the same dataset to investigate the possible differences between in-sample or out-of-sample results. The paper is organized as follows: a review of Granger causality is given in the next section. We present the data in Section 3. A preliminary study of the time series is explained in Section 4. Results of Granger causality analysis are discussed in Section 5, and a brief conclusion is drawn in Section 6.

2 Granger causality methodology

Granger (1969) has defined a concept of causality very useful in econometric research. Let \(I_{t}^{x,y}=\{x_{0},\ldots,\) x t ,y 0,...,y t } be the information set. We will say that the variable x Granger cause the variable y if the mean square error of the best predictor of y t + 1 based on \(I_{t}^{x,y}\) is smaller than the mean square error of the best predictor of y t + 1 that uses \(I_{t}^{y}=I_{t}^{x,y}-\{x_{s},s\leq t\}\). Before testing for Granger causality, it is important to establish the properties of the time series involved in the analysis, because the use of nonstationary time series can involve spurious causality results (Stock and Watson 1989; Sims et al. 1990). In this paper, the lag-augmented Wald test is applied (Toda and Yamamoto 1995) that is robust to the integration and possible cointegration properties of the variables. In fact, it is applicable if the variables are stationary, integrated or cointegrated of an arbitrary order. The procedure requires only the knowledge of the maximum order of integration of the series.

We consider two nested models, the unrestricted regression model

$$\label{eq1} y_{t}=\mu_{1}+\sum\limits_{j=1}^{k}\alpha_{j}y_{t-j}+\sum\limits_{j=1}^{k}\psi_{j}x_{t-j}+\varepsilon_{t} $$
(1)

and the restricted model

$$\label{eq2} y_{t}=\mu_{2}+\sum\limits_{j=1}^{k}\beta_{j}y_{t-j}+u_{t} $$
(2)

where μ 1 and μ 2 are constants, α, ψ, and β are the coefficients of the models, ε t and u t are univariate white noise, and k is the order of the models. If \(\left(\psi_{1},\ldots,\psi_{k}\right)\), in Eq. 1, is equal to the zero vector, then x does not Granger-cause y. The null hypothesis of noncausality corresponds to

$$\label{ipoGC} H_{0}: \psi_{1}=\psi_{2}=\ldots=\psi_{k}=0\ \mbox{.} $$
(3)

Estimating the parameters of the models (1) and (2) by ordinary least squares (OLS), the significance of this restriction is evaluated by the test statistics F,

$$\label{eqtest} F=\frac{\left(\textrm{RSS}_{\textrm{r}}-\textrm{RSS}_{\textrm{u}}\right)/q}{\textrm{RSS}_{\textrm{u}}/(T-m)} $$
(4)

where \(\textrm{RSS}_{{\rm r}}\) is the residuals sum of square of restricted model (2), \(\textrm{RSS}_{{\rm u}}\) is the residuals sum of square of unrestricted model (1), q is the number of coefficients restricted to zero (q = k), m is the number of coefficients of the unrestricted model (m = 2k + 1), and T is the number of observations. Under the assumption that the time series are stationary, the test statistics in Eq. 4 asymptotically has a F(q, T − m) distribution under H 0. A significant statistics implies that the null hypothesis of noncausality is rejected.

When the time series are not stationary, the standard asymptotic theory cannot be employed because the test statistics F in Eq. 4 has a nonstandard distribution (Sims et al. 1990). Toda and Yamamoto (1995) propose to overfit the unrestricted model (1) by d extra lags, where d is the maximum order of integration of x t and y t , in order to test the null hypothesis (3). In fact, if the true data generation process is Eq. 1, then the model

$$\begin{array}{rll} y_{t}&=&\mu_{1}+\sum\limits_{j=1}^{k}\alpha_{j}y_{t-j}+\sum\limits_{j=1}^{k}\psi_{j}x_{t-j}\\ &&+\sum\limits_{j=k+1}^{k+d}\left(\alpha_{j}y_{t-j}+\psi_{j}x_{t-j}\right)+\varepsilon_{t} \end{array}$$

with α k + 1 = ... = α k + d  = ψ k + 1 = ... = ψ k + d  = 0 describes the data-generating process equally well. In particular, we should point out that the parameters of the extra lags, \(\left\{\psi_{j}\right\}_{j=k+1}^{k+d}\), are unrestricted in testing for Granger causality from x to y because they are zero by assumption. The null hypothesis of noncausality is always as in Eq. 3. Therefore, this hypothesis can be tested considering the unrestricted model

$$\label{eq3} y_{t}=\mu_{1}+\sum\limits_{j=1}^{k+d}\alpha_{j}y_{t-j}+\sum\limits_{j=1}^{k+d}\psi_{j}x_{t-j}+\varepsilon_{t} $$
(5)

and the restricted model

$$\label{eq4} y_{t}=\mu_{2}+\sum\limits_{j=1}^{k+d}\beta_{j}y_{t-j}+\sum\limits_{j=k+1}^{k+d}\gamma_{j}x_{t-j}+u_{t} \ \mbox{.} $$
(6)

The coefficients of the two models are estimated by means of OLS, and the test statistics became

$$\label{testnew} F=\frac{\left(\textrm{RSS}_{\textrm{r}}-\textrm{RSS}_{\textrm{u}}\right)/k}{\textrm{RSS}_{\textrm{u}}/(T-(2k+2d+1))} $$
(7)

where here, \(\textrm{RSS}_{{\rm r}}\) is the residuals sum of square of restricted model (6) and \(\textrm{RSS}_{{\rm u}}\) is the residuals sum of square of unrestricted model (5). Toda and Yamamoto (1995) show the test statistics (Eq. 7) asymptotically has a standard distribution F(k,T − 2k − 2d − 1). So the function of the extra coefficients is only to guarantee the use of asymptotical distribution theory.

We know the lag-augmented Wald test can suffer from size distortion and low power especially for small samples. Then the bootstrap method is often used to determine the robustness of Granger causality results (Giles 1997; Hacker and Hatemi-J 2006; Hatemi-J and Shukur 2002; Lukasz 2010; Mantalos 2000; Mavrotas and Kelly 2001; Shukur and Mantalos 2000). Thus the following bootstrap scheme is applied:

  1. 1.

    Considering the d extra lags, estimate the parameters of the unrestricted and restricted models in Eqs. 5 and 6 and calculate the statistics F as in Eq. 7.

  2. 2.

    Under the null hypothesis of noncausality and without the d extra lags, estimate the parameters μ 2 and \(\{\beta_{j}\}_{j=1}^{k}\) of the restricted model (2) and calculate the residuals \(\hat{u}_{t}\).

  3. 3.

    Apply bootstrap procedure (resampling with replacement) on \(\hat{u}_{t}\) and obtain the pseudo-residuals \(u^{*}_{t}\).

  4. 4.

    Create the pseudo-data \(y_{t}^{*}\) given by

    $$ y_{t}^{*}=\hat{\mu}_{2}+\sum\limits_{j=1}^{k}\hat{\beta}_{j}y_{t-j}^{*}+u_{t}^{*} $$
    (8)
  5. 5.

    Using the pseudo-data \(y_{t}^{*}\), repeat the steps 1 and calculate the F bootstrap statistics.

  6. 6.

    Execute steps from 3 to 5 for N times.

  7. 7.

    Calculate the bootstrap p value which is the proportion of the F estimated bootstrap statistics that exceed the same statistic evaluated on the observed data.

In our application, the bootstrap p value is calculated using N = 10,000.

3 Data

Here, we deal with the following annual time series:

  • Global temperature anomalies y t : data available at http://www.cru.uea.ac.uk/cru/data/;

  • CO2, CH4, and N2O: data available at http://data.giss.nasa.gov. We have used IPCC (2001) expressions to convert greenhouse gas changes to instantaneous radiative forcing. In particular, c t is CO2 radiative forcing (RF), m t is CH4 RF, and n t is N2O RF;

  • Total solar irradiance \(\textrm{TSI}_{t}\): data available at www.geo.fu-berlin.de;

  • Cosmic ray intensity \(\textrm{CRI}_{t}\): data available at ftp.ncdc.noaa.gov;

  • Stratospheric aerosol optical thickness (SAOT) at 550 nm \(\textrm{SAOT}_{t}\): data available at http://data.giss.nasa.gov;

The period of study ranges from 1850 to 2007. We are interested in testing Granger causality from the single forcing to global temperature anomalies. Using the same approach of Kodra et al. (2011), we apply Granger noncausality tests for the last 58, 68, 78, 88, 98, 108, 118, 128, 138, 148, and 158 observations of our series. In this way, we may observe the influence of greenhouse gases or natural forcings to global temperature’s rise. In particular, we also analyze Granger causality from global radiative forcing g t (g t  = c t  + m t  + n t ) to global temperature.

4 Unit root tests on global temperature anomalies and forcings

In order to apply lag-augmented Wald test, the order of integration of the series is required. In particular, a time series w t is integrated of order h (w t ~I(h)) if \(\Delta^{h} w_{t}\) is stationary, where \(\Delta^{r}w_{t}\) is nonstationary for r < h. In this section, we focus on the presence or absence of stochastic trends in the global temperature anomalies and forcings employing the augmented Dickey–Fuller test (Dickey and Fuller 1981).

The model of the augmented Dickey–Fuller (ADF) test is specified as follows:

$$\label{eqADF} \Delta w_{t}=q_{0}+q_{1} t+\varphi w_{t-1}+\sum_{j=1}^{p^{*}}\xi_{j} \Delta w_{t-j}+v_{t} $$
(9)

where v t is a white noise. The lagged first differences of the dependent variables provide a correction for possible serial correlation. If φ = 0 and q 1 = 0, then w t has a unit root and a stochastic linear trend. Alternatively, if φ < 0, then the series is linear trend stationary. The null hypothesis is that the series is nontrend stationary. The ADF statistics is

$$ ADF_{t}=\frac{\hat{\varphi}}{se(\hat{\varphi})} $$
(10)

where \(\hat{\varphi}\) is the OLS estimate of φ, and \(se(\hat{\varphi})\) is the \(\hat{\varphi}\)’s standard error. The critical values are not standard, and they depend on the deterministic component selected in Eq. 9. The value of p *, where p * is the model order, is selected between 0 and 10 using Akaike information criteria given by

$$ \mbox{AIC}(p^{*})=\ln\left(\hat{\sigma}_{v}^{2}(p^{*})\right)+\frac{2\left(p^{*}+l\right)}{T} $$
(11)

where \(\hat{\sigma}_{v}^{2}(p^{*})=T^{-1}\sum_{t=1}^{T}\hat{v}_{t}^{2}\) is the error variance estimator based on the OLS residuals \(\hat{v}_{t}\) of the model (9), and l is the number of deterministic components.

The deterministic component in Eq. 9 is chosen, observing the graphic of the time series. In our cases, we consider linear trend and constant for the original series w t , constant for the first difference Δw t , and without trend and constant for the second difference \(\Delta^{2} w_{t}\). Only for \(\textrm{SAOT}_{t}\) the deterministic component is taken constant for the original series.

Haldrup and Lildholdt (2002) find that when w t is I(2), the ADF statistics gives rise to excessive rejection of the unit root null in favor of the stationary alternative. It depends on the ADF statistics which has a different distribution caused by the extra unit root. The authors suggest to test I(2) against I(1) prior to testing I(1) against I(0). Using the same approach described in Lutkepohl and Kratzig (2004), if w t can be I(2), a unit root test is applied to \(\Delta^{2}w_{t}\) first. If the null hypothesis is rejected, a unit root test is applied to Δw t . If the null hypothesis of unit root cannot be rejected in Δw t , then the time series w t is I(2); otherwise, considering w t as an I(2) series is not a good choice, and we test I(1) against I(0). In this last case, a unit root test is applied to w t .

4.1 Unit root test outcomes

The graphic of global temperature (Fig. 1) shows this time series may be I(1). Therefore, the original series y t is tested. The ADF test suggests that global temperature is I(1). This result is also one of the principal conclusions of recent researches (Kaufmann and Stern 1997; Kodra et al. 2011; Liu and Rodriguez 2005; Stern and Kaufmann 1999). Then we conclude that y t has a stochastic trend.

Fig. 1
figure 1

Plots of global temperature anomalies (GT anomalies, unit in degrees Celsius), total solar irradiance (TSI, unit in watts per square meter), cosmic ray intensity (CRI, unit in count rate of a polar neutron monitor), and stratospheric aerosol optical thickness at 550 nm (SAOT, unit in optical thickness at 550 nm)

The time series \(\textrm{TSI}_{t}\) and \(\textrm{CRI}_{t}\) (Fig. 1) have a periodicity of 11 years approximately. This is confirmed by means of a spectral analysis based on Bartlett estimator. The total solar irradiance and the cosmic ray intensity are I(1), whereas the time series \(\textrm{SAOT}_{t}\) is stationary. The results are shown in Table 1. Thus, the maximum order of integration between global temperature and the single natural forcing is d = 1.

Table 1 ADF test for global temperature anomalies y t , TSI, CRI, SAOT, carbon dioxide RF c t , methane RF m t , nitrous oxide RF n t , and global RF g t

The plots of radiative forcings (Fig. 2) indicate these time series may be I(2). Therefore, the second difference is tested first. ADF test suggests radiative forcings are I(2) (Table 1). Greenhouse gases have been found to have one unit root (Kaufmann and Stern 1997; Stern and Kaufmann 1999), then there is not a clear drawing about the order of integration of these series. Considering that y t is integrated of order one, we have two possible combinations: y t ~I(1) and radiative forcing I(1), and y t ~I(1) and radiative forcing I(2). In the first case, the maximum order of integration d is equal to 1; otherwise, d = 2. Which value of d do we select to apply the methodology of Toda and Yamamoto? We can perform this method considering d = 1 and d = 2 as in Triacca (2005). We observe an interesting notice. Let us consider the residuals of the following regression models:

  • y t  = ϑ 1 + φ 1 c t  + u 1t ;

  • y t  = ϑ 2 + φ 2 m t  + u 2t ;

  • y t  = ϑ 3 + φ 3 n t  + u 3t ;

  • y t  = ϑ 4 + φ 4 g t  + u 4t .

Fig. 2
figure 2

Plots of carbon dioxide radiative forcing (CO 2 RF), methane radiative forcing (CH 4 RF), nitrous oxide radiative forcing (N 2 O RF), and global radiative forcing (Global RF). Units in watts per square meter

In Fig. 3, the plots of \(\hat{u}_{it}\) are shown, for i = 1,2,3,4. Graphic analysis suggests these residuals are stationary. Also, the ADF (Table 2), with p * = 0, recommends u it ~I(0), i = 1,2,3,4. As described in Hamilton (1994), ADF test statistics is constructed as in Eq. 9 without constant and trend but the critical values are different because the test is applied to the residuals \(\hat{u}_{it}\) from a spurious regression (Phillips and Ouliaris 1990). Therefore, \(\hat{u}_{it}\) is a stationary linear combination of y t and radiative forcing, then these two series are of the same order of integration. Observing the previous two combinations, the only choice is that global temperature and the respective radiative forcing are I(1). In this way, we have also found an important structure of our series. In fact, global temperature and radiative forcing are cointegrated (Engle and Granger 1987; Granger 1981), so there is a long-run equilibrium relationship tying the individual series together. Therefore, the existence of a cointegrating relationship suggests that there must be a Granger causality in at least one direction (Granger 1988), but it does not indicate the direction of temporal causality between the variables.

Fig. 3
figure 3

Graphs of the residuals \(\hat{u}_{1t}\), \(\hat{u}_{2t}\), \(\hat{u}_{3t}\), and \(\hat{u}_{4t}\). Units in degrees Celsius

Table 2 ADF test for the residuals \(\left\{\hat{u}_{it}\right\}_{i=1}^{4}\)

5 Granger causality results

In the previous section, we have analyzed the order of integration of global temperature and forcings. We have found that the maximum order of integration is always equal to 1. Granger noncausality tests are performed for k = 1, k = 2, and k = 3 in Eqs. 1 and 2. In this way, the models are parsimonious, and the residuals are always uncorrelated. Therefore, the mean of bootstrapped residuals is zero.

Here, Granger causality is studied between different sets of two series. Of course, a multivariate approach (Gelper and Croux 2007) using models of higher dimension, as VAR models, could be envisaged in order to investigate the robustness of the bivariate results (Triacca 1998, 2002).

5.1 Natural forcings effect

It is clear in our results that there is no Granger causality from every natural forcings to global temperature. In fact, the null hypothesis of noncausality is never rejected exclusive of two cases for \(\textrm{CRI}_{t}\). These outcomes are statistically robust because the p values of the Toda and Yamamoto method are very similar to those of bootstrap method (Tables 3, 4 and 5). Furthermore, we have also analyzed the global effect of the natural forcings on global temperature, considering the variable x t in Eq. 1 as \(x_{t}=\left(\textrm{TSI, CRI, SAOT} \right)_{t}\). Granger causality is never foundFootnote 1 (with only one exception).

Table 3 Results of causality test (p values) from total solar irradiance to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods
Table 4 Results of causality test (p values) from cosmic ray intensity to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods
Table 5 Results of causality test (p values) from stratospheric aerosol optical thickness at 550 nm to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods

This evidence of Granger noncausality may depend on either linear models selected or in-sample tests. But Pasini et al. (2006), using neural network models, have shown that natural forcings have a weak nonlinear explanation on global temperature which does not permit to overcome the linear performance. Furthermore, in Attanasio et al. (2012), out-of-sample Granger causality is not found if natural forcings are used as regressors. These results confirm the low linear (or nonlinear) connection from natural forcings to global temperature.

5.2 Greenhouse gases effect

The results show that there is an evident Granger causality from CO2 radiative forcing to global temperature (Table 6). In fact, using a 5 % significant level, we cannot reject the null hypothesis of noncausality from CO2 RF to global temperature just in the first subperiod with k = 1. These outcomes are also confirmed by the bootstrap method.

Table 6 Results of causality test (p values) from CO2 RF to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods

Granger causality from CH4 RF to global temperature is always detectable for k = 1 and k = 2 exclusive of one subperiod for the bootstrap scheme (Table 7). There are some differences for k = 3. In this case, the null hypothesis of Granger noncausality is not rejected in two subperiods employing standard Wald test and in eight subperiods using bootstrap method. Therefore, Granger causality of methane is less strong than Granger causality of carbon dioxide.

Table 7 Results of causality test (p values) from CH4 RF to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods

Instead, there is no detectable Granger causality from N2O radiative forcing to global temperature (Table 8). The statistics is always insignificant for k = 1 and k = 2, whereas we can often reject the null hypothesis of noncausality for k = 3 by means of the Toda and Yamamoto method. This does not mean that Granger causality is not an appropriate method for studying the causal relationship between these variables. In fact, this weak Granger causality can depend on the possible limitation of in-sample approach. In fact, a strong linear out-of-sample Granger causality exists from N2O to global temperature (Attanasio et al. 2012). In addition, linear model cannot often catch the relationship in the complex climate system because this relationship may be nonlinear. Recent studies have shown that anthropogenic variables have nonlinear connection (Attanasio and Triacca 2011; Pasini et al. 2006). Hence, we will consider other approaches or methods in future investigations in order to understand better the role of this greenhouse gas on temperature trend.

Table 8 Results of causality test (p values) from N2O RF to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods

Global radiative forcing considers the global radiative forcings of carbon dioxide, nitrous oxide, and methane. It is interesting to understand the global impact of these gases on temperature using Granger causality. The outcomes show that there is a detectable Granger causality. In fact, the null hypothesis of noncausality is always rejected (Table 9), so a strong evidence of linear causality of g t on y t appears over all subperiods. It is an important outcome because it shows that the global contribution of greenhouse gases has a significant impact on the recent global warming.

Table 9 Results of causality test (p values) from global RF to global temperature, with d = 1, using Toda and Yamamoto (TY) and bootstrap methods

6 Conclusion

In this paper, we have analyzed the interaction between external forcings and global temperature. The results stress that natural forcings do not Granger-cause global temperature. Among a number of man-made greenhouse gases, CO2 has the greatest influence on the global climate. Granger causality from methane to global temperature is also evident, but it is less strong. Granger causality from nitrous oxide to global temperature is low because it depends on the model order selected; otherwise, the combined radiative forcing linearly Granger-cause global temperature. This important result proves that anthropogenic influences have a relevant role about the rise in global temperature.