Abstract
This article proposes omnibus portmanteau tests for contrasting adequacy of time series models. The test statistics are based on combining the autocorrelation function of the conditional residuals, the autocorrelation function of the conditional squared residuals, and the cross-correlation function between these residuals and their squares. The maximum likelihood estimator is used to derive the asymptotic distribution of the proposed test statistics under a general class of time series models, including ARMA, GARCH, and other nonlinear structures. An extensive Monte Carlo simulation study shows that the proposed tests successfully control the type I error probability and tend to have more power than other competitor tests in many scenarios. Two applications to a set of weekly stock returns for 92 companies from the S &P 500 demonstrate the practical use of the proposed tests.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Time series models often consist of two components: (i) the conditional mean part; and (ii) the conditional variance part. Traditionally, the autoregressive and moving average (ARMA) models are classified as linear and specify the mean part, whereas the generalized autoregressive conditional heteroscedasticity (GARCH) models are nonlinear and describe the variance part. During the past four decades, time series analysis was dominated by the ARMA models, where a good model should be able to specify the dependence structure of the series adequately (Box and Jenkins 1970). Dependency, in ARMA models, is often measured by using the residual autocorrelation function (ACF). To test the adequacy of an ARMA model, a portmanteau statistic was proposed by Box and Pierce (1970) based on the distribution of the residual ACF. Since then, several authors have improved the portmanteau tests (see, for example, Fisher and Gallagher 2012; Ljung and Box 1978; Peña and Rodríguez 2002, 2006; Mahdi 2017).
In the last two decades, the analysis of nonlinear time series models has attracted a great deal of interest in business, economics, finance, and other fields. Box and Jenkins (1970), Granger and Andersen (1978) and Tong and Lim (1980) noticed that the squared residuals of time series models are significantly autocorrelated even though the residuals are not autocorrelated. This indicates that the error term of these models might be uncorrelated but not independent. The authors suggested using the ACF of the squared values of the series to detect nonlinearity. In this respect, Engle (1982) showed that the classical portmanteau tests proposed by Box and Pierce (1970) and Ljung and Box (1978) fail to detect the presence of the autoregressive conditional heteroscedasticity (ARCH) in many financial time series models. To test for the presence of an ARCH process, Engle (1982) introduced a Lagrange multiplier statistic based on the autocorrelations of the squared residuals.
Several authors have developed portmanteau test statistics employing the ACF of the squared residuals to detect nonlinear structures and ARCH effect in time series models (see, for example, Fisher and Gallagher 2012; McLeod and Li 1983; Peña and Rodríguez 2002, 2006; Rodríguez and Ruiz 2005). All the above test statistics were derived under the assumptions of ARMA models and were not proposed for nonlinear time series models.
A portmanteau test was developed by Li and Mak (1994) to check the adequacy of nonlinear time series models, including ARMA-ARCH, and other conditional heteroscedastic structures. Under a general class of time series models, two mixed portmanteau tests, to detect the linear and nonlinear dependency in time series models, were considered by Wong and Ling (2005) summing the statistics derived by Box and Pierce (1970) and Li and Mak (1994) for the first one, and summing the statistics proposed by Ljung and Box (1978) and McLeod and Li (1983) for the second one. Wong and Ling (2005) showed that their mixed tests are, in many situations, more powerful than the tests proposed by Ljung and Box (1978) and McLeod and Li (1983), when the fitted model has a disparity in its first and second moments. Zhu (2013) proposed another mixed portmanteau test for ARMA-GARCH models with parameters estimated by a quasi-maximum exponential likelihood estimator. Li et al. (2018) proposed a first-order zero-drift GARCH (ZD-GARCH(1, 1)) model to study conditional heteroscedasticity and heteroscedasticity together, for which the authors constructed a portmanteau test for model checking. Their test statistic was derived based on the lag-k autocorrelation function of the sth power of the absolute residuals, where k is a positive integer and \(s>0\).
For the test statistics presented by Wong and Ling (2005); Zhu (2013) and Li et al. (2018), the authors did not consider the cross-correlation between the residuals at different powers. The idea of using the cross-correlation between the residuals at different powers to test for linearity was considered by Welsh and Jernigan (1983); Lawrance and Lewis (1985, 1987); Psaradakis and Vávra (2019).
In this article, we propose four mixed portmanteau statistics for time series models. The proposed test statistics are composed by three components: The first of them utilizes the autocorrelations of the residuals, which is designed to capture the linear dependency in the mean part of time series models. Then, the second component of these statistics utilizes the autocorrelation of the squared residuals, which can be used to test for conditional heteroscedastic effects. The third component of these statistics is related to the cross-correlations between the residuals and their squared values, which may be helpful to test for other types of nonlinear models in which the residuals and their squared values are cross-correlated. The cross-correlations between the residuals and their squared values allow us to propose two different tests. One of these tests is based on the positive lags and the other one uses the negative lags. Therefore, the tests proposed in the present study combine the statistics presented in Wong and Ling (2005) and Psaradakis and Vávra (2019).
The remainder of this article is organized as follows. Section 2 defines some popular time series models with their assumptions. In Sect. 3, we propose new auto-and-cross-correlated test statistics for contrasting the adequacy of fitted time series models and derive their asymptotic distributions. In Sect. 4, a Monte Carlo simulation study is conducted to compare the performance of the proposed statistics with some tests commonly used in the literature. We show that the empirical size of the proposed tests successfully controls the type I error probability and tends to have higher power than other tests in many cases. Section 5 presents illustrative applications to demonstrate the usefulness of the proposed tests for real-world data. We finish this article in Sect. 6 by providing concluding remarks.
2 The general time series model and its assumptions
Assume that \(\{z_t\text{: } t=0,\pm 1,\cdots \}\) is a time series that is generated by the strictly stationary and ergodic model defined by
where \(\mathcal {F}_{t-1}\) represents the information set (\(\sigma \)-algebra) generated by \(\{z_t,z_{t-1},\cdots \}\), and \(\varvec{\theta }\) denotes the \(l\times 1\) vector of unknown parameters and its true value is \(\varvec{\theta }_0\). \(\mu _{t}(\varvec{\theta })=\mu _{t}(\varvec{\theta },\mathcal {F}_{t-1})=\textrm{E}(z_t|\mathcal {F}_{t-1})\) and \(h_{t}(\varvec{\theta })=\textrm{Var}(\varepsilon _t|\mathcal {F}_{t-1})>0\) are the conditional mean and conditional variance of \(z_t\), respectively. Both are assumed to have continuous second order derivative almost surely (a.s.). The process \(\{\xi _t\}\) is a sequence of independent and identically distributed (i.i.d.) random variables with mean zero, variance one, and finite fourth moment.
The usual ARMA-GARCH model can be seen as a special case of this model that can be written as
where \(\{\xi _t\}\) is a sequence of i.i.d. random variables with mean zero, variance one, and \(\textrm{E}(\xi _t^4)<\infty \), with \(\omega >0\), \(\alpha _i\ge 0\), \(\beta _j\ge 0\), for \(i\in \{1,\cdots ,a\}, j\in \{1,\cdots ,b\}\), and \(\sum _{i=1}^{a}\alpha _i+\sum _{j=1}^{b}\beta _j<1\).
Ignoring the constant term, the Gaussian log-likelihood function of \(\{z_1,\cdots ,z_n\}\) given the initial values \(\{z_t \text{: } t\in \mathbb {Z}^{-}\cup \{0\}\}\) can be written as
where
where \(z^\star \equiv \{z_t,z_{t-1},\cdots \}\). Assuming the parameter space is \(\Theta \), where \(\varvec{\theta }_0\) is an interior vector in \(\Theta \), and and for convenience, let’s denote \(\varepsilon _{t}=\varepsilon _{t}(\varvec{\theta }_0)\), \(\mu _{t}=\mu _{t}(\varvec{\theta }_0)\), and \(h_t=h_t(\varvec{\theta }_0)\). The first derivative of the log-likelihood function is given by
By taking the conditional expectations of the iterative second derivatives with respect to \(\mathcal {F}_{t-1}\), we have
Assume that \(\partial \ell (\varvec{\theta }_0)/\partial \varvec{\theta }\) is a martingale difference in terms of \(\mathcal {F}_{t-1}\) and let \(\widehat{\varvec{\theta }}_n\) be the quasi-maximum likelihood estimator of \(\varvec{\theta }_0\), that is, \(\widehat{\varvec{\theta }}_n\overset{\mathrm {a.s.}}{\rightarrow }{\varvec{\theta }_0}\), where \(\overset{\mathrm {a.s.}}{\rightarrow }\) denotes a.s. convergence. Then, it follows that
where \(\varvec{\Sigma }^{-1}=\textrm{E}(-\partial ^2 \ell (\varvec{\theta }_0)/\partial \varvec{\theta }\partial \varvec{\theta }^\top )^{-1}\) and \(o_p(1)\rightarrow 0\) in probability as \(n\rightarrow \infty \). Furthermore, it has been shown that the asymptotic distribution of \(\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)\) is normal with zero mean \(l\times 1\) vector and variance-covariance \(l\times l\) matrix \(\varvec{\Sigma }^{-1}\) (see Hall and Heyde 1980; Higgins and Bera 1992; Ling and McAleer 2010).
3 The proposed test statistics
Let k be the lag of the series with \(k\in \{0, \pm 1, \pm 2,\cdots , \pm m\}\), where m is the largest value considered for the auto-and-cross-correlations and define
as the lag- k theoretical autocorrelation of the error process \(\left\{ \varepsilon _{t}(\varvec{\theta }_0)\right\} \) where \(\varvec{\theta }_0\) is the true but unknown parameter vector. Let
and
with
We derive the asymptotic distribution of the proposed test statistics under the null hypothesis that the time series model in (2.1) takes the correct functional forms given by \(\mathbb {H}_0:\mu _t = \mu _t(\varvec{\theta }_0)\text { and }h_t = h_t(\varvec{\theta }_0)\). The alternative hypothesis is \(\mathbb {H}_a:\mu _t \ne \mu _t(\varvec{\theta }_0)\text { or }h_t \ne h_t(\varvec{\theta }_0)\). Equivalently, the null and alternative hypotheses can be used for testing the lag residual auto-and-cross-correlation so that \(\mathbb {H}_0: \textbf{R}_{(r, s)}({\varvec{\theta }_0})=\varvec{0}_m\) and \(\mathbb {H}_a: \textbf{R}_{(r, s)}({\varvec{\theta }_0}) \ne \varvec{0}_m\), for all \(r, s\in \{1,2\}\). For simplicity, we dropped the symbol \(\varvec{\theta }_0\) so that \(\rho _{(r, s)}({\varvec{\theta }_0},k)=\rho _{(r, s)}(k)\) and \(\textbf{R}_{(r, s)}({\varvec{\theta }_0}) = \textbf{R}_{(r, s)}\).
Given a sample time series of length n observations \(z_1,z_2,\cdots ,z_n\), under the assumptions of \(\mathbb {H}_0\) and (2.4), we fit the model defined in (2.1). Subsequently, we calculate the standardized residuals (conditional residuals) raised to powers \(i\in {1,2}\) using the following expressions:
where \(\{{\widehat{\varepsilon }}_{t}\}, \{{\widehat{\varepsilon }}_{t}^2\}, \big \{\sqrt{{\widehat{h}}_{t}}\big \}\), and \(\{{\widehat{h}}_{t}\}\) denote the sample residuals, squared-residuals, conditional volatility, and conditional variance of \(z_t\), respectively.
The corresponding sample correlation coefficient between the standardized residuals may be written as
where \({{\widehat{\gamma }}_{(r,s)}(k)=n^{-1}\sum _{t=k+1}^{n}({\widehat{e}}_t^r -{\widetilde{e}}^r)({\widehat{e}}_{t-k}^s-{\widetilde{e}}^s)}\), for \(k\ge 0\), \({\widehat{\gamma }}_{(r,s)}(-k)={\widehat{\gamma }}_{(s,r)}(k)\), for \(k<0\), is the autocovariance (cross-covariance), at lag-k, between the standardized residuals to rth power and the standardized residuals to sth power, and \({\widetilde{e}}^i=n^{-1}\sum _{t=1}^{n}{\widehat{e}}_t^i\), for \(i\in \{1,2\}\).
Under the regular assumptions, it can be shown that \({\widetilde{e}}{=}o_p(1)\), \({\widetilde{e}}^2{=}1+o_p(1)\), and \(n^{-1}\sum _{t=1}^{n}({\widehat{e}}_{t}^2{-}{\widetilde{e}}^2)^2{=}\sigma ^2+o_p(1)\), where \(\sigma ^2\) converges to the value two (see Li and Mak 1994; Wong and Ling 2005) and (Theorem Ling and McAleer 2003). Hence, at lag-k, if we define \(\varvec{\Gamma }=(\varvec{\Gamma }_{(r,r)},\varvec{\Gamma }_{(s,s)},\varvec{\Gamma }_{(r,s)})_{3\,m\times 1}^\top \) and \(\varvec{\Gamma }_{(r,s)}=(\gamma _{(r,s)}(1),\cdots ,\gamma _{(r,s)}(m))^\top \) as the counterparts of \(\widehat{\varvec{\Gamma }}=(\widehat{\varvec{\Gamma }}_{(r,r)},\widehat{\varvec{\Gamma }}_{(s,s)},\widehat{\varvec{\Gamma }}_{(r,s)})_{3\,m\times 1}^\top \) and \(\widehat{\varvec{\Gamma }}_{(r,s)}=({\widehat{\gamma }}_{(r,s)}(1),\cdots ,{\widehat{\gamma }}_{(r,s)} (m))^\top \), respectively, with the replacement of the fitted residual \({\widehat{\varepsilon }}_t\) and conditional variance \({\widehat{h}}_t\) by \(\varepsilon _t\) and \(h_t\), respectively, we obtain:
We employ these autocorrelation coefficients to propose new portmanteau goodness-of-fit tests, as later defined in (3.8), to check for linear and nonlinear dependencies within the residual series.
Theorem 1
Let the model defined in (2.1) be correctly specified and that (2.4) holds. Then, we have that
where
\(\overset{\textrm{D}}{\rightarrow }\) denotes convergence in distribution and \(\varvec{\Omega }_{rs} = \textrm{E}\big [\varvec{R}(\varvec{\theta }_0) \varvec{R}^\top (\varvec{\theta }_0)\big ]\) is the covariance matrix, which can be replaced by a consistent estimator \(\widehat{\varvec{\Omega }}_{rs}\):
with \(r\ne s\in \{1,2\}\), \(\varvec{I}_{m}\) denotes the identity \(m\times m\) matrix,
and
Proof
The proof is given in the Appendix (1). \(\square \)
By the results of Theorem 1, we propose the portmanteau statistic, \({\dot{C}}_{rs}\) namely, and its modified version, \(C_{rs}\) namely, to test \(\mathbb {H}_0\) that the model stated in (2.1) is correctly specified. Thus, we have that
where \(\widehat{\varvec{R}}_{(1,1)},\widehat{\varvec{R}}_{(2,2)},\widehat{\varvec{R}}_{(r,s)}\) are defined in (3.2), and \(\widetilde{\varvec{R}}_{(1,1)}, \widetilde{\varvec{R}}_{(2,2)},\widetilde{\varvec{R}}_{(r,s)}\) are obtained after replacing the autocorrelation coefficients in (3.1) by their standardized values formulated as
From the theorem on quadratic forms given in Box (1954), it is straightforward to show that \({\dot{C}}_{rs}\) and \(C_{rs}\) are asymptotically chi-square distributed with \(3m-(p+q+1)\) degrees of freedom.
Figure 1 illustrates the accuracy of the approximation of the empirical distribution of \({\dot{C}}_{rs}\) and \(C_{rs}\), for \(r\ne s\in \{1,2\}\) to the chi-square distribution employing \(10^3\) replicates when an ARMA(1,1) model fits to a sample of size \(n=200\) generated from an ARMA(1,1) process defined as
The parameters of the ARMA model stated in (3.10) are selected to be very close to non-stationarity and non-invertibility case, whereas the coefficient of the MA model is near to cancellation with the coefficient of the AR model to demonstrate the usefulness of the proposed tests, even with extreme cases.
We found similar results for small and large samples and our preliminary analysis indicates that the portmanteau tests based on the statistics \(C_{rs}\) control the type I error probability more successfully than the tests that consider the statistics \({\dot{C}}_{rs}\). Hence, we recommend the use of \(C_{rs}\).
Remark 1
As mentioned, the proposed test statistics in (3.8) can be seen as combinations of the statistics presented by Wong and Ling (2005) and Psaradakis and Vávra (2019). Thus, each test statistic \(C_{rs}\) may be seen as a linear combination of three existent test statistic proposed by Ljung and Box (1978), McLeod and Li (1983), and Psaradakis and Vávra (2019), modifying the corresponding statistic \({\dot{C}}_{rs}\), which are linear combinations of three statistics given by Box and Pierce (1970), Li and Mak (1994), and Psaradakis and Vávra (2019).
4 Simulation studies
We carry out Monte Carlo simulations to examine statistical properties of the proposed tests. For comparative purposes, we also consider three test statistics given by
where \(Q_{12}\) and \(Q_{21}\) represent the statistics presented in Psaradakis and Vávra (2019) and \(Q_{22}\) denotes the statistic proposed by McLeod and Li (1983). In addition, we consider two statistics introduced by Li and Mak (1994) and Wong and Ling (2005), which are denoted by \(Q_{\text {LM}}\) and \(Q_{\text {WL}}\), respectively, given by
First, we examined six statistics, \(C_{12}, C_{21},Q_{12},Q_{21}\), \(Q_{22}\) and \(Q_{\text {WL}}\) namely, assuming the following five linear models studied by Psaradakis and Vávra (2019):
- A1.:
-
AR(1) model: \(z_t=-0.9z_{t-1}+\varepsilon _t\);
- A2.:
-
AR(2) model: \(z_t=0.6z_{t-1}-0.5z_{t-2}+\varepsilon _t\);
- A3.:
-
MA(1) model: \(z_t=0.8\varepsilon _{t-1}+\varepsilon _t\);
- A4.:
-
ARMA(2,1) model: \(z_t=0.8z_{t-1} +0.15z_{t-2}+0.3\varepsilon _{t-1}+\varepsilon _t\);
- A5.:
-
ARMA(1,1) model: \(z_t=0.6z_{t-1} +0.4\varepsilon _{t-1}+\varepsilon _t\).
For model A1, the parameter used by Psaradakis and Vávra (2019) was \(\phi =0.8\). However, we considered here a value negative of the parameter close to non-stationarity case to assess the behavior of the test statistics associated with positive and negative values of the parameters. We also analyzed for this model the cases where \(\phi \in \{\pm 0.1, \pm 0.5, \pm 0.8\}\) whose results showed very minor changes in the behavior of the proposed tests. For model A3, we also explored the cases where \({\theta }\in \{\pm 0.3, \pm 0.6, \pm 0.9\}\) and obtained good results.
Second, we investigated four statistics, \(C_{12},C_{21},Q_{\text {WL}}\), and \(Q_{\text {LM}}\) namely, according to the following three nonlinear models studied by Velasco and Wang (2015) and Han and Ling (2017):
- A6.:
-
GARCH(1,1) model: \(\varepsilon _t=\xi _t\sqrt{h_t}, \, h_t=0.1+0.3\varepsilon _{t-1}^2+0.5 h_{t-1}\);
- A7.:
-
AR(1)-ARCH(1) model: \(z_t=0.5z_{t-1}+\varepsilon _t, \,\varepsilon _t=\xi _t\sqrt{h_t}, \, h_t=0.1+0.4\varepsilon _{t-1}^2\);
- A8.:
-
AR(1)-GARCH(1,1) model: \(z_t=0.5z_{t-1}+\varepsilon _t, \, \varepsilon _t=\xi _t\sqrt{h_t}, h_t=0.1+0.3\varepsilon _{t-1}^2+0.5 h_{t-1}\);
where \(\eta _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)\) or Student-t distribution with 10 degrees of freedom.
In all experiments, we use the R software (www.R-project.org, R Core Team 2020) to simulate 1000 replicates of artificial series of size \(n+n/2\) with \(n\in \{100,300\}\). However, only the last n data points are used to carry out portmanteau tests with the residuals of some fitted models.
The empirical size and power of the tests are calculated based on a nominal level 5%. Simulation results for nominal levels 1% and 10% are not reported, due to space conservation, but they are available upon request.
First, we calculate the type I error probability, at lags \(m \in \{5,10\}\), based on six statistics, \(C_{12}, C_{21},Q_{12},Q_{21}\), \(Q_{22}\) and \(Q_{\text {WL}}\) namely, when a true model is fitted to a series generated according to models A1-A5. Second, we investigated the accuracy of estimating type I error probability using four statistics, \(C_{12},C_{21},Q_{\text {WL}}\), and \(Q_{\text {LM}}\) namely, when a true model is fitted to a series generated according to models A6-A8.
The empirical sizes of the test corresponding to the nominal size \(5\%\) over 1000 independent simulations belong to the 95% confidence interval \([3.65\%,6.35\%]\) and to the 99% confidence interval \([3.22\%, 6.78\%]\). From Tables 1 and 2, note that tests based on the statistics \(Q_{\text {WL}}\) and \(Q_{\text {LM}}\) can distort the test size, whereas the tests using the statistics \(Q_{22},Q_{12},Q_{21}\) and the proposed statistics exhibit no substantial size distortion and, generally, have empirical levels that improve as n increases. Also, we found similar results for the cases where the error terms have either skew-normal distribution with asymmetry parameter \(\kappa \in \{-1.0, -0.5, 0.5, 1.0\}\) or Student-t distribution with degrees of freedom \(\nu \in \{5,15,20\}\). These results are omitted here due to space conservation, but they are available upon request.
4.1 Testing linearity in linear time series models
Now, we investigate the efficiency to distinguish power for the mean term of the test statistics \(C_{12},C_{21},Q_{12},Q_{21},Q_{22}\), and \(Q_{\text {WL}}\)Footnote 1. For expositional simplicity, we define \(C^\star \) as the highest test power attained by the statistics \(C_{12}\) and \(C_{21}\), whereas \(Q^\star \) as the highest test power attained by \(Q_{12}\) and \(Q_{21}\), that is,
The power of the tests are calculated under the null hypothesis \(\mathbb {H}_0\) that \(z_t\) satisfies the ARMA model
which can be seen as an AR(p) model, where \(p\rightarrow \infty \). We followed the approach presented by Ng and Perron (2005) who used the Bayesian information criterion (BIC) to select the order \(p\in \{0,1,\cdots ,\lfloor 8(n/100)^{1/4}\rfloor \}\), where \(\lfloor a \rfloor \) denotes the floor function (integer part) of the number \(a\in \mathbb {R}\), when an AR(p) model erroneously fits to series generated from the following models studied by Li and Mak (1994), Wong and Ling (2005), Han and Ling (2017), and Psaradakis and Vávra (2019):
- B1.:
-
Bilinear (BL) model: \(z_t=0.2+0.4z_{t-1}+\varepsilon _{t}+\varphi z_{t-1}\varepsilon _{t-1}\), where \(\varepsilon _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)\), with parameter values of \(\varphi \) being selected in the range \(0<\varphi <2.5\);
- B2.:
-
Random coefficient AR (RCAR) model: \(z_t= 0.2z_{t-1}+u_{t}, u_{t}=\varphi \eta _{t} z_{t-1}+\varepsilon _{t}\), where \(\{\varepsilon _t\}\) and \(\{\eta _t\}\) are two sequences of i.i.d. \(\mathcal {N}(0,1)\) random variables, which are independent from each other variable; note that the RCAR model is a special case of the AR(1) and ARCH(1) models as observed in \(\textrm{E}(u_t^2|\mathcal {F}_{t-1})=\varphi ^2 z_{t-1}^2+1\), which is the conditional variance over time. We select parameter values of \(\varphi \) from the range \(0<\varphi <2.5\);
- B3.:
-
TAR model: \(z_t=0.8 z_{t-1}\mathbb {I}_{\{z_{t-1}\le -1\}} - 0.8 z_{t-1}\mathbb {I}_{\{z_{t-1}> -1\}} +\varepsilon _t\), where \(\varepsilon _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)\);
- B4.:
-
AR(1)-ARCH(2) model: \(z_t=0.2 z_{t-1}+\varepsilon _{t}, \quad \varepsilon _t=\xi _t\sqrt{h_t}, \quad h_t=0.2+0.2\varepsilon _{t-1}^2+0.2\varepsilon _{t-2}^2\), where \(\xi _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)\).
Note that we found similar results based on the Akaike information criterion –AIC– (Akaike 1974).
Figure 2 displays the rejection frequencies considering a 5% nominal level of the statistics \(C^\star ,Q^\star ,Q_{22}\), and \(Q_{\text {WL}}\) when an AR(p) model erroneously fits data of size \(n\in \{100,300\}\) from BL (B1) and RCAR (B2) models at lag value \(m=\lfloor \sqrt{n}\rfloor \). The results for both models are based on parameter values \(\varphi \) varying from 0 to 2.5, as mentioned. In addition, Fig. 3 shows the rejection probability of the aforementioned tests (of nominal level 0.05) employing the nonlinear TAR (B3) and AR(1)-ARCH(2) (B4) models. For \(n\in \{100,300\}\), the tests are calculated at lags \(m\in \{2,4,6,8,10\}\) and \(m\in \{3,6,9,13,17\}\), respectively. From Figs. 2 and 3, note that the performance of the proposed statistic \(C^\star \) is, in general, the best, especially for small sample sizes. Thus, we conclude that the proposed statistics are helpful for testing for linearity of stationary time series.
4.2 Testing the AR-ARCH models
In order to examine the ability for discriminating power for the mean and conditional variance parts of a time series model, we consider the AR-GARCH model versus nonlinear models with GARCH errors. The process \(\{z_t\}\) satisfies the null hypothesis \(\mathbb {H}_0\) of heteroskedasticity given by
The alternative models are:
- D1.:
-
AR(1)-ARCH(2) model: \(z_t=0.5z_{t-1}+ \xi _{t}\sqrt{h_{t}}\), \(h_t=0.01+0.4\varepsilon _{t-1}^2+0.3\varepsilon _{t-2}^2\);
- D2.:
-
AR(1)-GARCH(1,1) model: \(z_t=0.5z_{t-1}+ \xi _{t}\sqrt{h_{t}}\), \(h_t=0.041+0.4\varepsilon _{t-1}^2+0.5 h_{t-1}\);
- D3.:
-
AR(2)-ARCH(2) model: \(z_t=0.5z_{t-1}+0.2z_{t-2}+ \xi _{t}\sqrt{h_{t}}\), \(h_t=0.01+0.4\varepsilon _{t-1}^2+0.2 \varepsilon _{t-2}^2\);
- D4.:
-
TAR model with GJR-GARCH(1,1) error:
$$\begin{aligned}{} & {} z_t=0.4z_{t-1}+0.5z_{t-1}\mathbb {I}_{\{z_{t-1}>0\}}+\xi _{t}\sqrt{h_{t}},\\{} & {} h_t=0.1+(0.3+0.4 \mathbb {I}_{\{\varepsilon _{t-1}<0\}})\varepsilon _{t-1}^2+0.4 h_{t-1}. \end{aligned}$$
For each model, the power of the test for the statistics \(C^\star ,Q_{\text {WL}}\), and \(Q_{\text {LM}}\) are calculated at lags \(m\in \{2,4,6,8,10\}\) and \(m\in \{3,6,9,13,17\}\) associated with \(n\in \{100,300\}\). The results are shown in Fig. 4. Note that the performance of the proposed test is in general better when compared with the other two tests.
Worth noting that, in general, as the lag order increases, the power of the portmanteau tests decreases, especially under models like the TAR model, for several reasons including the following:
-
When the lag is large compared to the overall sample size, we usually get less reliable estimates of autocorrelation and, consequently, less power in the portmanteau test.
-
As we increase the lag order, we are including more lags in the test, which means we are estimating more parameters. This results in a loss of degrees of freedom in the test statistics. With fewer degrees of freedom, the test becomes less sensitive to detecting the absence of autocorrelation.
-
TAR models have different nonlinear thresholds which complicate the estimation of the autocorrelation function at higher lag orders, making it more challenging to detect the absence of autocorrelation.
5 Empirical applications
5.1 Test for nonlinearity in AR models using stock returns
We demonstrate the usefulness of the proposed tests for detecting nonlinearity in AR models for a set of weekly stock returns. We select 92 companies studied by Kapetanios (2009) and Psaradakis and Vávra (2019). These companies are a subset of the Standard & Poor 500 composite index (S &P 500), spanning over the period from 18 June 1993 to 31 December 2007 (\(n=781\) observations).
Following the procedure presented by Psaradakis and Vávra (2019), we fit an AR(p) model for each series, where the order of p was selected by minimizing the BIC according to the algorithm explained in Sect. 4.1 (Ng and Perron 2005). The asymptotic p-values (at 5% significance level) for tests based on the statistics \(C^\star , Q^\star , Q_{22}\), and \(Q_{\text {WL}}\), with \(m=\ln \lfloor n\rfloor \), are reported in Table 3. Since the conditional heteroskedasticity is often considered as a main characteristic of asset returns, we expect that the AR model will not capture the nonlinear features in most of the stock returns considered in our analysis and the null hypothesis of linearity should be rejected.
From the results in Table 3, we found that the linearity assumption is rejected by the proposed tests in \(81 (88.0\%)\) cases compared with \(64 (69.6\%), 74 (80.4\%)\), and \(71 (77.2\%)\) cases on the basis of the test statistics \(Q^\star ,Q_{22}\), and \(Q_{\text {WL}}\), respectively. This arguably suggests that the proposed tests are preferable to test the presence of nonlinearity in AR models for asset returns.
5.2 Goodness-of-fit-tests for nonlinear time series models
We examine the ability of the portmanteau tests to distinguish an unsuitable model for weekly stock returns of Aon plc company studied in Sect. 5.1. We find a strong evidence against linearity in the AR model for the returns of this company by using portmanteau tests. The Aon plc returns are displayed in Fig. 5 which shows that the log-return series have high persistence in volatility with negative skewness and excess kurtosis. We conclude, therefore, that these returns might exhibit conditional heteroskedasticity effects, and a model that belongs to the ARCH family with a Student-t distribution of the error process might better explain the leptokurtic distribution of the returns. Thus, we fit ARCH(1), AR(1)-ARCH(1), ARCH(2), and GARCH(1,1) models and apply the test statistics \(C^\star , Q_{\text {WL}}\), and \(Q_{\text {LM}}\) at lag value \(m=6\). The asymptotic p-values for testing the model adequacy based on these proposed statistics are reported in Table 4. From this table, the test statistic \(Q_{\text {LM}}\) fails to detect the inadequacy in all of the fitted models, whereas the test statistic \(Q_{\text {WL}}\) suggests that the ARCH(1), ARCH(2) and GARCH(1,1) models might be suitable to describe for the Aon plc returns. Only the proposed test statistics suggest a clear indication of inadequacy of the ARCH(1), AR(1)-ARCH(1), and ARCH(2) models, while the GARCH(1,1) model might be an adequate model for the Aon plc returns according to the proposed test statistics.
6 Conclusions
In this article, we have introduced four mixed portmanteau statistics for assessing the adequacy of time series models. The tests we propose are based on a linear combination of three auto-and-cross-correlation components. The first and second components are derived from the autocorrelations of residuals and their squared values, respectively. Meanwhile, the third component takes into account the cross-correlations between the residuals and their square values, considering both positive and negative lags. Two of these tests can be viewed as an extended version of the Ljung and Box test, while the others can be considered an extension of the Box and Pierce test. Based on our simulation study, it is recommended to use the proposed tests, which can be seen as an extended version of the Ljung and Box test. These tests demonstrate better control over the type I error probability compared to existing tests. Furthermore, they generally exhibit more statistical power than tests relying on the statistics introduced by Li and Mak (1994), McLeod and Li (1983), Psaradakis and Vávra (2019), and Wong and Ling (2005).
Simulation results indicate that combining \(\varvec{R}{(1,2)}\) and \(\varvec{R}{(2,1)}\) in a test statistic significantly reduces the test’s power. This can be justified by the lack of independence between these two distinct components, as they share a substantial amount of information about correlation. Consequently, they will add complex redundant correlation which leads to decrease the power in the proposed test.
Some of the test statistics that we have discussed have high computational burdens, so we have implemented them in an R package named portes (Mahdi and McLeod 2020). The idea discussed in this article may be extended to formulate an omnibus portmanteau test that combines the cross-correlations between the residuals and their square values at both positive and negative lags with the autocorrelations of the residuals and their squared values. The framework we propose could be expanded to identify seasonality in time series and to detect various types of nonlinearity dependence in multivariate time series, as discussed by (Mahdi 2016).
In this article, our focus has been on measures derived from second-order mixed moments. Nevertheless, there is potential for extending these measures to higher-order moments. Our simulation study revealed that severe skewness, such as in the case of a skewed t-distribution, can distort the size of all portmanteau tests. When distributional assumptions are relaxed, the robustness of bootstrapping and Monte Carlo significance test approaches becomes evident (Efron and Tibshirani 1994; Lin and McLeod 2006; Mahdi and Ian McLeod 2012). Therefore, a possible extension of this article could involve considering these approaches for calculating p-values.
Data availibility
No datasets were generated or analysed during the current study.
Notes
We also test the form of the GARCH-type models by examining the GARCH(1,1) model versus the MA(1)-GARCH(1,1) and AR(1)-GARCH(1,1) models based on the test statistics \(C^\star ,Q_{\text {WL}}\), and \(Q_{\text {LM}}\). The results are available upon request.
References
Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)
Box, G.E.P.: Some theorems on quadratic forms applied in the study of analysis of variance problems, I effect of inequality of variance in the one-way classification. Ann. Math. Stat. 25(2), 290–302 (1954)
Box, G., Jenkins, G.: Time Series Analysis: Forecasting and Control. Holden-Day, San Francisco (1970)
Box, G.E.P., Pierce, D.A.: Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 65(332), 1509–1526 (1970)
Efron, B., Tibshirani, R.: An Introduction to the Bootstrap. Monographs on Statistics & Applied Probability, Taylor & Francis (1994)
Engle, R.: Autoregressive conditional heteroscedasticity with estimates of the variance of united kingdom inflation. Econometrica 50(4), 987–1007 (1982)
Fisher, T.J., Gallagher, C.M.: New weighted portmanteau statistics for time series goodness of fit testing. J. Am. Stat. Assoc. 107(498), 777–787 (2012)
Granger, C.W.J., Andersen, A.P.: An Introduction to Bilinear Time Series Models. Gottingen, Vandenhoeck and Ruprecht (1978)
Hall, P., Heyde, C.: Estimation of parameters from stochastic processes. In: Martingale Limit Theory and its Application, Probability and Mathematical Statistics: A Series of Monographs and Textbooks, pp. 155–199. Academic Press, New York (1980)
Han, N.S., Ling, S.: Goodness-of-fit test for nonlinear time series models. Ann. Financ. Econ. 12(02), 1750006 (2017)
Higgins, M.L., Bera, A.K.: A class of nonlinear arch models. Int. Econ. Rev. 33(1), 137–158 (1992)
Kapetanios, G.: Testing for strict stationarity in financial variables. J. Bank. Finance 33, 2346–2362 (2009)
Lawrance, A.J., Lewis, P.A.W.: Modelling and residual analysis of nonlinear autoregressive time series in exponential variables. J. Roy. Stat. Soc. Ser. B (Methodological) 47(2), 165–202 (1985)
Lawrance, A.J., Lewis, P.A.W.: Higher-order residual analysis for nonlinear time series with autoregressive correlation structures. Int. Stat. Rev. 55(1), 21–35 (1987)
Li, W.K., Mak, T.K.: On the squared residual autocorrelations in non-linear time series with conditional heteroskedasticity. J. Time Ser. Anal. 15(6), 627–636 (1994)
Li, D., Zhang, X., Zhu, K., Ling, S.: The ZD-GARCH model: a new way to study heteroscedasticity. J. Econ. 202(1), 1–17 (2018)
Lin, J.-W., McLeod, A.: Improved peña-rodríguez portmanteau test. Comput. Stat. Data Anal. 51(3), 1731–1738 (2006)
Ling, S., Li, W.K.: Diagnostic checking of nonlinear multivariate time series with multivariate arch errors. J. Time Ser. Anal. 18(5), 447–464 (1997)
Ling, S., McAleer, M.: Asymptotic theory for a vector ARMA-GARCH model. Economet. Theor. 19(2), 280–310 (2003)
Ling, S., McAleer, M.: A general asymptotic theory for time-series models. Stat. Neerl. 64(1), 97–111 (2010)
Ljung, G.M., Box, G.E.P.: On a measure of lack of fit in time series models. Biometrika 65(2), 297–303 (1978)
Mahdi, E., Ian McLeod, A.: Improved multivariate portmanteau test. J. Time Ser. Anal. 33(2), 211–222 (2012)
Mahdi, E.: Portmanteau test statistics for seasonal serial correlation in time series models. SpringerPlus 5, 1485 (2016)
Mahdi, E.: Kernel-based portmanteau diagnostic test for ARMA time series models. Cogent Math. 4(1), 1296327 (2017)
Mahdi, E., McLeod, A.I.: Portes: portmanteau tests for univariate and multivariate time series models. R package version 5.0 (2020)
McLeod, A.I., Li, W.K.: Diagnostic checking ARMA time series models using squared-residual autocorrelations. J. Time Ser. Anal. 4(4), 269–273 (1983)
Ng, S., Perron, P.: A note on the selection of time series models. Oxford Bull. Econ. Stat. 67(1), 115–134 (2005)
Peña, D., Rodríguez, J.: A powerful portmanteau test of lack of fit for time series. J. Am. Stat. Assoc. 97(458), 601–610 (2002)
Peña, D., Rodríguez, J.: The log of the determinant of the autocorrelation matrix for testing goodness of fit in time series. J. Stat. Plan. Inference 136(8), 2706–2718 (2006)
Psaradakis, Z., Vávra, M.: Portmanteau tests for linearity of stationary time series. Economet. Rev. 38(2), 248–262 (2019)
R Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2020)
Rodríguez, J., Ruiz, E.: A powerful test for conditional heteroscedasticity for financial time series with highly persistent volatilities. Stat. Sin. 15(2), 505–525 (2005)
Tong, H., Lim, K.S.: Threshold autoregression, limit cycles and cyclical data. J. Roy. Stat. Soc.: Ser. B (Methodol.) 42(3), 245–268 (1980)
Velasco, C., Wang, X.: A joint portmanteau test for conditional mean and variance time-series models. J. Time Ser. Anal. 36(1), 39–60 (2015)
Welsh, A.K., Jernigan, R.W.: A statistic to identify asymmetric time series. In: Proceedings of the Business and Economics Statistics Section, pp. 390–395. American Statistical Association, Alexandria (1983)
Wong, H., Ling, S.: Mixed portmanteau tests for time-series models. J. Time Ser. Anal. 26(4), 569–579 (2005)
Zhu, K.: A mixed portmanteau test for ARMA-GARCH models by the quasi-maximum exponential likelihood estimation approach. J. Time Ser. Anal. 34(2), 230–237 (2013)
Acknowledgements
Our sincere thanks go to Dr. Ajay Jasra, the editor, Dr. Mathieu Gerber, the associate editor, the two anonymous reviewers, Dr. Jan G. De Gooijer, and Kazem Ghanbari for their valuable and insightful suggestions on the manuscript.
Author information
Authors and Affiliations
Contributions
EM is sole contributor to this research paper.
Corresponding author
Ethics declarations
conflict of interest
The author reports no conflict of interest regarding this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix
Appendix
By Taylor’s theorem there exists a random vector \(\tilde{\varvec{\theta }}_n\) on the line segment between \(\varvec{\theta }_0\) and \(\varvec{{\hat{\theta }}}_n\) such that
where the second derivative \(\dfrac{\partial ^2 \varvec{\Gamma }_{(1,1)}(\tilde{\varvec{\theta }}_n)}{\partial \varvec{\theta }^\top \partial \varvec{\theta }}\) depends on the second-order derivatives \(\dfrac{\partial ^2\gamma _{(1,1)}(k)}{\partial \varvec{\theta }^\top \partial \varvec{\theta }}\). By assumption, the second-order partial derivatives are dominated by a fixed integrable function for every \(\varvec{\theta }\) in a ball B around \(\varvec{\theta }_0\), so that the probability of the event \(\left\{ \widehat{\varvec{\theta }}_n \in B\right\} \) tends to 1. Thus
As the sequence \(\left( \widehat{\varvec{\theta }}_n-\varvec{\theta }_0\right) O_P(1)=o_P(1) O_P(1)\) converges to 0 in probability when \(\widehat{\varvec{\theta }}_n\) is consistent for \(\varvec{\theta }_0\), we can rewrite (7.2) by employing the first-order Taylor series approximation:
where
By the ergodic theorem, for large n, and after taking the expectation with respect to \(\mathcal {F}_{t-1}\), it is straightforward to show that
where
is a \(1\times l\) vector, which can be consistently estimated by
It follows that \(\widehat{\varvec{\Gamma }}_{(1,1)}\) stated in (7.3) can be expressed as \(\widehat{\varvec{\Gamma }}_{(1,1)}\approx \varvec{\Gamma }_{(1,1)} - \varvec{X}_{11}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)\), where \(\varvec{X}_{11}=(\varvec{X}_{11}^\top (1),\cdots , \varvec{X}_{11}^\top (m))^\top \) is the resultant \(m\times l\) matrix. Thus, by scaling each term by the variance of the standardized residual, we have
When \(\xi _t\) is normally distributed, the random vector \(\sqrt{n}\widehat{\varvec{R}}_{(1,1)}\) is asymptotically normal distributed with a mean of zero vector and a variance \(\varvec{I}_{m}-\varvec{X}_{11}\varvec{\Sigma }^{-1}\varvec{X}_{11}^\top \), where \(\varvec{I}_{m}\) is the identity \(m\times m\) matrix.
For the case of \(\widehat{\varvec{\Gamma }}_{(2,2)}\) and \(\widehat{\varvec{R}}_{(2,2)}\), Li and Mak (1994) and Ling and Li (1997) showed that
where \(\varvec{X}_{22}=(\varvec{X}_{22}^\top (1),\cdots , \varvec{X}_{22}^\top (m))^\top \) is an \(m\times l\) matrix, and \(\varvec{X}_{22}(k)\) is given by
The authors proved that \(\sqrt{n}\widehat{\varvec{R}}_{(2,2)}\) is asymptotically normal distributed with a mean of zero vector and a variance \(\varvec{I}_{m}-\frac{1}{4}\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top \).
Now, we consider the case that \(r=1\) and \(s=2\). Analogous to the reasoning in equations (7.1-7.3), we can express the first-order Taylor series approximation of \(\widehat{\varvec{\Gamma }}_{(1,2)}\) as follows:
where
By the ergodic theorem, for large n, note that
where
is a \(1\times l\) vector, which can be consistently estimated by
Thus, \(\widehat{\varvec{\Gamma }}_{(1,2)}\) stated in (7.8) may be expressed as \(\widehat{\varvec{\Gamma }}_{(1,2)}\approx \varvec{\Gamma }_{(1,2)} - \varvec{X}_{12}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)\), and
where \(\varvec{X}_{12}=(\varvec{X}_{12}^\top (1),\cdots ,\varvec{X}_{12}^\top (m))^\top \).
Similarly, for the case \(r=2\) and \(s=1\), it is straightforward to show that
where \(\varvec{X}_{21}=(\varvec{X}_{21}^\top (1),\cdots ,\varvec{X}_{21}^\top (m))^\top \), and \(\varvec{X}_{21}(k)\) is a \(1\times l\) vector given by
The assumptions on \(\xi _t\) imply that the random vectors \(\sqrt{n}\widehat{\varvec{R}}_{(1,2)}\) and \(\sqrt{n}\widehat{\varvec{R}}_{(2,1)}\) are asymptotically normal distributed with mean zero and variance \(\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{12}\varvec{\Sigma }^{-1}\varvec{X}_{12}^\top \) and \(\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{21}\varvec{\Sigma }^{-1}\varvec{X}_{21}^\top \), respectively.
By utilizing the results from (7.5), (7.6), (7.10), and (7.11) we can deduce the joint distribution of \(\widehat{\varvec{R}}_{(1,1)},\widehat{\varvec{R}}_{(2,2)},\widehat{\varvec{R}}_{(r,s)}\), for the cases \(r=1, s = 2\) and \(r=2, s=1\). Without loss of generality, we establish the proof for the case \(r = 1\) and \(s = 2\). The proof for \(r = 2\) and \(s = 1\) readily ensues. Therefore, if the model defined in (2.1) is correctly specified, we obtain:
where
Note that \(\{\varepsilon _t\}\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)\) so that the factors \(1/\sigma ^2\) and \(1/\sigma \) can be replaced by 1/2 and \(1/\sqrt{2}\), respectively.
Let \({\varvec{W}_n=\sqrt{n}(\varvec{R}_{(1,1)}^\top , \varvec{R}_{(2,2)}^\top , \varvec{R}_{(1,2)}^\top ,n^{-1}{\partial \ell }/{\partial \varvec{\theta }^\top })^\top }\). By using a martingale difference approach in terms of \(\mathcal {F}_t\) and following the same arguments provided by (Wong and Ling (2005), Theorem 1), one can easily show that \(\varvec{W}_n\overset{\textrm{D}}{\rightarrow }\mathcal {N}({\varvec{0}},{\varvec{V}})\); hence, \(\sqrt{n}({\widehat{\varvec{R}}_{(1,1)}}^\top ,{\widehat{\varvec{R}}_{(2,2)}}^\top ,{\widehat{\varvec{R}}_{(1,2)}}^\top )^\top \overset{\textrm{D}}{\rightarrow }\mathcal {N}_{3\,m}(\varvec{0},\varvec{\Omega }_{12})\), where \(\varvec{\Omega }_{12}=\varvec{D}\varvec{V}\varvec{D}^\top \). The matrices \(\varvec{D},\varvec{V}\), and \(\varvec{\Omega }_{12}\) can be consistently estimated by their sample values, denoted by \(\widehat{\varvec{D}},\widehat{\varvec{V}}\), and \(\widehat{\varvec{\Omega }}_{12}\), respectively. Under the assumptions of the model stated in (2.1), we get
Thus, we reach
For the ARMA models, we have \(\varvec{X}_{11}\approx \varvec{0}, \varvec{X}_{22}=\varvec{0}\), and \(\varvec{X}_{12}=\varvec{0}\), whereas for GARCH models, we have \(\varvec{X}_{11}\approx 0\) and \(\varvec{X}_{12}=0\). Furthermore, for large n, when the model stated in (2.1) is correctly specified, the off-diagonal block matrices in the matrix \(\widehat{\varvec{\Omega }}_{12}\) are approximately zero. Therefore, in general, the matrix \(\widehat{\varvec{\Omega }}_{rs}\) has the form stated as
with \(r\ne s\in \{1,2\}\). \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mahdi, E. New mixed portmanteau tests for time series models. Stat Comput 34, 76 (2024). https://doi.org/10.1007/s11222-024-10393-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11222-024-10393-w