New mixed portmanteau tests for time series models

Mahdi, Esam

doi:10.1007/s11222-024-10393-w

New mixed portmanteau tests for time series models

Original Paper
Published: 12 February 2024

Volume 34, article number 76, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Statistics and Computing Aims and scope Submit manuscript

New mixed portmanteau tests for time series models

Download PDF

Esam Mahdi¹

104 Accesses
Explore all metrics

Abstract

This article proposes omnibus portmanteau tests for contrasting adequacy of time series models. The test statistics are based on combining the autocorrelation function of the conditional residuals, the autocorrelation function of the conditional squared residuals, and the cross-correlation function between these residuals and their squares. The maximum likelihood estimator is used to derive the asymptotic distribution of the proposed test statistics under a general class of time series models, including ARMA, GARCH, and other nonlinear structures. An extensive Monte Carlo simulation study shows that the proposed tests successfully control the type I error probability and tend to have more power than other competitor tests in many scenarios. Two applications to a set of weekly stock returns for 92 companies from the S &P 500 demonstrate the practical use of the proposed tests.

A New Look at Portmanteau Tests

Article 31 July 2017

Diagnostic Checks in Multiple Time Series Modelling

Testing the constancy of Spearman’s rho in multivariate time series

Article 01 May 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Time series models often consist of two components: (i) the conditional mean part; and (ii) the conditional variance part. Traditionally, the autoregressive and moving average (ARMA) models are classified as linear and specify the mean part, whereas the generalized autoregressive conditional heteroscedasticity (GARCH) models are nonlinear and describe the variance part. During the past four decades, time series analysis was dominated by the ARMA models, where a good model should be able to specify the dependence structure of the series adequately (Box and Jenkins 1970). Dependency, in ARMA models, is often measured by using the residual autocorrelation function (ACF). To test the adequacy of an ARMA model, a portmanteau statistic was proposed by Box and Pierce (1970) based on the distribution of the residual ACF. Since then, several authors have improved the portmanteau tests (see, for example, Fisher and Gallagher 2012; Ljung and Box 1978; Peña and Rodríguez 2002, 2006; Mahdi 2017).

In the last two decades, the analysis of nonlinear time series models has attracted a great deal of interest in business, economics, finance, and other fields. Box and Jenkins (1970), Granger and Andersen (1978) and Tong and Lim (1980) noticed that the squared residuals of time series models are significantly autocorrelated even though the residuals are not autocorrelated. This indicates that the error term of these models might be uncorrelated but not independent. The authors suggested using the ACF of the squared values of the series to detect nonlinearity. In this respect, Engle (1982) showed that the classical portmanteau tests proposed by Box and Pierce (1970) and Ljung and Box (1978) fail to detect the presence of the autoregressive conditional heteroscedasticity (ARCH) in many financial time series models. To test for the presence of an ARCH process, Engle (1982) introduced a Lagrange multiplier statistic based on the autocorrelations of the squared residuals.

Several authors have developed portmanteau test statistics employing the ACF of the squared residuals to detect nonlinear structures and ARCH effect in time series models (see, for example, Fisher and Gallagher 2012; McLeod and Li 1983; Peña and Rodríguez 2002, 2006; Rodríguez and Ruiz 2005). All the above test statistics were derived under the assumptions of ARMA models and were not proposed for nonlinear time series models.

A portmanteau test was developed by Li and Mak (1994) to check the adequacy of nonlinear time series models, including ARMA-ARCH, and other conditional heteroscedastic structures. Under a general class of time series models, two mixed portmanteau tests, to detect the linear and nonlinear dependency in time series models, were considered by Wong and Ling (2005) summing the statistics derived by Box and Pierce (1970) and Li and Mak (1994) for the first one, and summing the statistics proposed by Ljung and Box (1978) and McLeod and Li (1983) for the second one. Wong and Ling (2005) showed that their mixed tests are, in many situations, more powerful than the tests proposed by Ljung and Box (1978) and McLeod and Li (1983), when the fitted model has a disparity in its first and second moments. Zhu (2013) proposed another mixed portmanteau test for ARMA-GARCH models with parameters estimated by a quasi-maximum exponential likelihood estimator. Li et al. (2018) proposed a first-order zero-drift GARCH (ZD-GARCH(1, 1)) model to study conditional heteroscedasticity and heteroscedasticity together, for which the authors constructed a portmanteau test for model checking. Their test statistic was derived based on the lag-k autocorrelation function of the sth power of the absolute residuals, where k is a positive integer and $s>0$.

For the test statistics presented by Wong and Ling (2005); Zhu (2013) and Li et al. (2018), the authors did not consider the cross-correlation between the residuals at different powers. The idea of using the cross-correlation between the residuals at different powers to test for linearity was considered by Welsh and Jernigan (1983); Lawrance and Lewis (1985, 1987); Psaradakis and Vávra (2019).

In this article, we propose four mixed portmanteau statistics for time series models. The proposed test statistics are composed by three components: The first of them utilizes the autocorrelations of the residuals, which is designed to capture the linear dependency in the mean part of time series models. Then, the second component of these statistics utilizes the autocorrelation of the squared residuals, which can be used to test for conditional heteroscedastic effects. The third component of these statistics is related to the cross-correlations between the residuals and their squared values, which may be helpful to test for other types of nonlinear models in which the residuals and their squared values are cross-correlated. The cross-correlations between the residuals and their squared values allow us to propose two different tests. One of these tests is based on the positive lags and the other one uses the negative lags. Therefore, the tests proposed in the present study combine the statistics presented in Wong and Ling (2005) and Psaradakis and Vávra (2019).

The remainder of this article is organized as follows. Section 2 defines some popular time series models with their assumptions. In Sect. 3, we propose new auto-and-cross-correlated test statistics for contrasting the adequacy of fitted time series models and derive their asymptotic distributions. In Sect. 4, a Monte Carlo simulation study is conducted to compare the performance of the proposed statistics with some tests commonly used in the literature. We show that the empirical size of the proposed tests successfully controls the type I error probability and tends to have higher power than other tests in many cases. Section 5 presents illustrative applications to demonstrate the usefulness of the proposed tests for real-world data. We finish this article in Sect. 6 by providing concluding remarks.

2 The general time series model and its assumptions

Assume that $\{z_t\text{: } t=0,\pm 1,\cdots \}$ is a time series that is generated by the strictly stationary and ergodic model defined by

$$\begin{aligned} z_t=\mu _{t}(\varvec{\theta },\mathcal {F}_{t-1}) +\varepsilon _{t}, \qquad \varepsilon _{t}=\xi _t\sqrt{h_{t}(\varvec{\theta })} \end{aligned}$$

(2.1)

where $\mathcal {F}_{t-1}$ represents the information set ($\sigma $-algebra) generated by $\{z_t,z_{t-1},\cdots \}$, and $\varvec{\theta }$ denotes the $l\times 1$ vector of unknown parameters and its true value is $\varvec{\theta }_0$. $\mu _{t}(\varvec{\theta })=\mu _{t}(\varvec{\theta },\mathcal {F}_{t-1})=\textrm{E}(z_t|\mathcal {F}_{t-1})$ and $h_{t}(\varvec{\theta })=\textrm{Var}(\varepsilon _t|\mathcal {F}_{t-1})>0$ are the conditional mean and conditional variance of $z_t$, respectively. Both are assumed to have continuous second order derivative almost surely (a.s.). The process $\{\xi _t\}$ is a sequence of independent and identically distributed (i.i.d.) random variables with mean zero, variance one, and finite fourth moment.

The usual ARMA-GARCH model can be seen as a special case of this model that can be written as

$$\begin{aligned}&z_t = \sum _{i =1}^{p}\phi _i z_{t-i} + \sum _{i =0}^{q}\theta _i \varepsilon _{t-i} + \varepsilon _{t}\nonumber \\&\varepsilon _t =\xi _{t}\sqrt{h_{t}(\varvec{\theta })}, \nonumber \\&h_t(\varvec{\theta })=\omega +\sum _{i=1}^{a}\alpha _i\varepsilon _{t-i}^2(\varvec{\theta })+\sum _{j=1}^{b}\beta _j h_{t-j}(\varvec{\theta }), \end{aligned}$$

(2.2)

where $\{\xi _t\}$ is a sequence of i.i.d. random variables with mean zero, variance one, and $\textrm{E}(\xi _t^4)<\infty $, with $\omega >0$, $\alpha _i\ge 0$, $\beta _j\ge 0$, for $i\in \{1,\cdots ,a\}, j\in \{1,\cdots ,b\}$, and $\sum _{i=1}^{a}\alpha _i+\sum _{j=1}^{b}\beta _j<1$.

Ignoring the constant term, the Gaussian log-likelihood function of $\{z_1,\cdots ,z_n\}$ given the initial values $\{z_t \text{: } t\in \mathbb {Z}^{-}\cup \{0\}\}$ can be written as

$$\begin{aligned} \ell (\varvec{\theta })=\sum _{t=1}^{n}\ell _{t}(\varvec{\theta },z^\star ), \end{aligned}$$

(2.3)

where

$$\begin{aligned} \ell _{t}(\varvec{\theta },z^\star )=-\frac{1}{2}\log {(h_t(\varvec{\theta }))}-\frac{\varepsilon _{t}^2(\varvec{\theta })}{2h_t(\varvec{\theta })},\quad t \in \{1,\cdots ,n\}, \end{aligned}$$

where $z^\star \equiv \{z_t,z_{t-1},\cdots \}$. Assuming the parameter space is $\Theta $, where $\varvec{\theta }_0$ is an interior vector in $\Theta $, and and for convenience, let’s denote $\varepsilon _{t}=\varepsilon _{t}(\varvec{\theta }_0)$, $\mu _{t}=\mu _{t}(\varvec{\theta }_0)$, and $h_t=h_t(\varvec{\theta }_0)$. The first derivative of the log-likelihood function is given by

$$\begin{aligned} \frac{\partial \ell (\varvec{\theta }_0)}{\partial \varvec{\theta }}=\frac{1}{2}\sum _{t=1}^{n}\frac{1}{h_{t}}\frac{\partial h_{t}}{\partial \varvec{\theta }}\left( \frac{\varepsilon _{t}^2}{h_{t}}-1\right) +\sum _{t=1}^{n} \frac{\varepsilon _{t}}{h_{t}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }}. \end{aligned}$$

By taking the conditional expectations of the iterative second derivatives with respect to $\mathcal {F}_{t-1}$, we have

$$\begin{aligned} \textrm{E}\left[ \frac{\partial ^2 \ell (\varvec{\theta }_0)}{\partial \varvec{\theta }\partial \varvec{\theta }^\top }\right]{} & {} =-\frac{1}{2}\sum _{t=1}^{n}\frac{1}{h_{t}^2} \textrm{E}\left( \frac{1}{h_{t}^2}\left( \frac{\partial h_{t}}{\partial \varvec{\theta }}\right) \left( \frac{\partial h_{t}}{\partial \varvec{\theta }}\right) ^\top \right) \\{} & {} \quad -\sum _{t=1}^{n}\frac{1}{h_{t}} \textrm{E}\left( \frac{1}{h_{t}}\left( \frac{\partial \mu _{t}}{\partial \varvec{\theta }}\right) \left( \frac{\partial \mu _{t}}{\partial \varvec{\theta }}\right) ^\top \right) . \end{aligned}$$

Assume that $\partial \ell (\varvec{\theta }_0)/\partial \varvec{\theta }$ is a martingale difference in terms of $\mathcal {F}_{t-1}$ and let $\widehat{\varvec{\theta }}_n$ be the quasi-maximum likelihood estimator of $\varvec{\theta }_0$, that is, $\widehat{\varvec{\theta }}_n\overset{\mathrm {a.s.}}{\rightarrow }{\varvec{\theta }_0}$, where $\overset{\mathrm {a.s.}}{\rightarrow }$ denotes a.s. convergence. Then, it follows that

$$\begin{aligned} \sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)=-\frac{1}{\sqrt{n}} \varvec{\Sigma }^{-1}\frac{\partial \ell (\varvec{\theta }_0)}{\partial \varvec{\theta }} + o_p(1), \end{aligned}$$

(2.4)

where $\varvec{\Sigma }^{-1}=\textrm{E}(-\partial ^2 \ell (\varvec{\theta }_0)/\partial \varvec{\theta }\partial \varvec{\theta }^\top )^{-1}$ and $o_p(1)\rightarrow 0$ in probability as $n\rightarrow \infty $. Furthermore, it has been shown that the asymptotic distribution of $\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)$ is normal with zero mean $l\times 1$ vector and variance-covariance $l\times l$ matrix $\varvec{\Sigma }^{-1}$ (see Hall and Heyde 1980; Higgins and Bera 1992; Ling and McAleer 2010).

3 The proposed test statistics

Let k be the lag of the series with $k\in \{0, \pm 1, \pm 2,\cdots , \pm m\}$, where m is the largest value considered for the auto-and-cross-correlations and define

$$\begin{aligned} \rho _{(r, s)}(\varvec{\theta }_0,k)= & {} \frac{{\text {Cov}}\left( \varepsilon _{t}^{r}(\varvec{\theta }_0), \varepsilon _{t-k}^{s}(\varvec{\theta }_0)\right) }{\left\{ {\text {Var}} \left( \varepsilon _{t}^{r}(\varvec{\theta }_0)\right) {\text {Var}}\left( \varepsilon _{t}^{s}(\varvec{\theta }_0)\right) \right\} ^{1/ 2}}\\= & {} \frac{\gamma _{(r,s)}(k)}{\sqrt{\gamma _{(r,r)}(0)}\sqrt{\gamma _{(s,s)}(0)}}\quad (r, s=1,2) \end{aligned}$$

as the lag- k theoretical autocorrelation of the error process $\left\{ \varepsilon _{t}(\varvec{\theta }_0)\right\} $ where $\varvec{\theta }_0$ is the true but unknown parameter vector. Let

$$\begin{aligned} \varvec{\rho }(\varvec{\theta }_0,k)=\left[ \rho _{(r, r)}(\varvec{\theta }_0,k), \rho _{(s, s)}(\varvec{\theta }_0,k), \rho _{(r, s)}(\varvec{\theta }_0,k)\right] ^\top , \end{aligned}$$

and

$$\begin{aligned} \textbf{R}({\varvec{\theta }_0})=\left[ \textbf{R}_{(r, r)}^\top ({\varvec{\theta }_0}), \textbf{R}_{(s, s)}^\top ({\varvec{\theta }_0}), \textbf{R}_{(r, s)}^\top ({\varvec{\theta }_0})\right] _{3 m \times 1}^\top \end{aligned}$$

with

$$\begin{aligned} \textbf{R}_{(r, s)}({\varvec{\theta }_0})=\left[ \rho _{(r, s)}({\varvec{\theta }_0},1), \rho _{(r, s)}({\varvec{\theta }_0},2), \ldots , \rho _{(r, s)}({\varvec{\theta }_0},m)\right] ^\top . \end{aligned}$$

We derive the asymptotic distribution of the proposed test statistics under the null hypothesis that the time series model in (2.1) takes the correct functional forms given by $\mathbb {H}_0:\mu _t = \mu _t(\varvec{\theta }_0)\text { and }h_t = h_t(\varvec{\theta }_0)$. The alternative hypothesis is $\mathbb {H}_a:\mu _t \ne \mu _t(\varvec{\theta }_0)\text { or }h_t \ne h_t(\varvec{\theta }_0)$. Equivalently, the null and alternative hypotheses can be used for testing the lag residual auto-and-cross-correlation so that $\mathbb {H}_0: \textbf{R}_{(r, s)}({\varvec{\theta }_0})=\varvec{0}_m$ and $\mathbb {H}_a: \textbf{R}_{(r, s)}({\varvec{\theta }_0}) \ne \varvec{0}_m$, for all $r, s\in \{1,2\}$. For simplicity, we dropped the symbol $\varvec{\theta }_0$ so that $\rho _{(r, s)}({\varvec{\theta }_0},k)=\rho _{(r, s)}(k)$ and $\textbf{R}_{(r, s)}({\varvec{\theta }_0}) = \textbf{R}_{(r, s)}$.

Given a sample time series of length n observations $z_1,z_2,\cdots ,z_n$, under the assumptions of $\mathbb {H}_0$ and (2.4), we fit the model defined in (2.1). Subsequently, we calculate the standardized residuals (conditional residuals) raised to powers $i\in {1,2}$ using the following expressions:

$$\begin{aligned} {\widehat{e}}_{t}^{i}={\widehat{\varepsilon }}_{t}^{i} {\widehat{h}}_{t}^{-i/2}, \end{aligned}$$

where $\{{\widehat{\varepsilon }}_{t}\}, \{{\widehat{\varepsilon }}_{t}^2\}, \big \{\sqrt{{\widehat{h}}_{t}}\big \}$, and $\{{\widehat{h}}_{t}\}$ denote the sample residuals, squared-residuals, conditional volatility, and conditional variance of $z_t$, respectively.

The corresponding sample correlation coefficient between the standardized residuals may be written as

$$\begin{aligned} {\widehat{r}}_{(r,s)}(k)=\frac{{\widehat{\gamma }}_{(r,s)}(k)}{\sqrt{{\widehat{\gamma }}_{(r,r)}(0)} \sqrt{{\widehat{\gamma }}_{(s,s)}(0)}}, \end{aligned}$$

(3.1)

where ${{\widehat{\gamma }}_{(r,s)}(k)=n^{-1}\sum _{t=k+1}^{n}({\widehat{e}}_t^r -{\widetilde{e}}^r)({\widehat{e}}_{t-k}^s-{\widetilde{e}}^s)}$, for $k\ge 0$, ${\widehat{\gamma }}_{(r,s)}(-k)={\widehat{\gamma }}_{(s,r)}(k)$, for $k<0$, is the autocovariance (cross-covariance), at lag-k, between the standardized residuals to rth power and the standardized residuals to sth power, and ${\widetilde{e}}^i=n^{-1}\sum _{t=1}^{n}{\widehat{e}}_t^i$, for $i\in \{1,2\}$.

Under the regular assumptions, it can be shown that ${\widetilde{e}}{=}o_p(1)$, ${\widetilde{e}}^2{=}1+o_p(1)$, and $n^{-1}\sum _{t=1}^{n}({\widehat{e}}_{t}^2{-}{\widetilde{e}}^2)^2{=}\sigma ^2+o_p(1)$, where $\sigma ^2$ converges to the value two (see Li and Mak 1994; Wong and Ling 2005) and (Theorem Ling and McAleer 2003). Hence, at lag-k, if we define $\varvec{\Gamma }=(\varvec{\Gamma }_{(r,r)},\varvec{\Gamma }_{(s,s)},\varvec{\Gamma }_{(r,s)})_{3\,m\times 1}^\top $ and $\varvec{\Gamma }_{(r,s)}=(\gamma _{(r,s)}(1),\cdots ,\gamma _{(r,s)}(m))^\top $ as the counterparts of $\widehat{\varvec{\Gamma }}=(\widehat{\varvec{\Gamma }}_{(r,r)},\widehat{\varvec{\Gamma }}_{(s,s)},\widehat{\varvec{\Gamma }}_{(r,s)})_{3\,m\times 1}^\top $ and $\widehat{\varvec{\Gamma }}_{(r,s)}=({\widehat{\gamma }}_{(r,s)}(1),\cdots ,{\widehat{\gamma }}_{(r,s)} (m))^\top $, respectively, with the replacement of the fitted residual ${\widehat{\varepsilon }}_t$ and conditional variance ${\widehat{h}}_t$ by $\varepsilon _t$ and $h_t$, respectively, we obtain:

$$\begin{aligned} {\widehat{\gamma }}_{(1,1)}(k)= & {} \frac{1}{n}\sum _{t=k+1}^{n}\frac{{\widehat{\varepsilon }}_{t}}{\sqrt{{\widehat{h}}}_{t}} \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}}_{t-k}},\quad \\ {\widehat{r}}_{(1,1)}(k)= & {} \frac{1}{n}\sum _{t=k+1}^{n}\frac{{\widehat{\varepsilon }}_{t}}{\sqrt{{\widehat{h}}}_{t}} \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}}_{t-k}},\\ {\widehat{\gamma }}_{(2,2)}(k)= & {} \frac{1}{n}\sum _{t=k+1}^{n}\left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) ,\quad \\ {\widehat{r}}_{(2,2)}(k)= & {} \frac{1}{n\sigma ^2}\sum _{t=k+1}^{n}\left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) ,\\ {\widehat{\gamma }}_{(1,2)}(k)= & {} \frac{1}{n}\sum _{t=k+1}^{n}\frac{{\widehat{\varepsilon }}_{t}}{\sqrt{{\widehat{h}}_{t}}} \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) ,\quad \\ {\widehat{r}}_{(1,2)}(k)= & {} \frac{1}{n\sigma }\sum _{t=k+1}^{n}\frac{{\widehat{\varepsilon }}_{t}}{\sqrt{{\widehat{h}}_{t}}} \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) ,\\ {\widehat{\gamma }}_{21}(k)= & {} \frac{1}{n}\sum _{t=k+1}^{n}\left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}_{t-k}}},\quad \\ {\widehat{r}}_{(2,1)}(k)= & {} \frac{1}{n\sigma }\sum _{t=k+1}^{n}\left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}_{t-k}}}. \end{aligned}$$

We employ these autocorrelation coefficients to propose new portmanteau goodness-of-fit tests, as later defined in (3.8), to check for linear and nonlinear dependencies within the residual series.

Theorem 1

Let the model defined in (2.1) be correctly specified and that (2.4) holds. Then, we have that

$$\begin{aligned} \sqrt{n}({\widehat{\varvec{R}}_{(1,1)}}^\top ,{\widehat{\varvec{R}}_{(2,2)}}^\top ,{\widehat{\varvec{R}}_{(r,s)}}^\top )^\top \overset{\textrm{D}}{\rightarrow }\mathcal {N}_{3m}(\varvec{0},\varvec{\Omega }_{rs})\quad \text {as }n\rightarrow \infty , \end{aligned}$$

where

$$\begin{aligned}{} & {} \widehat{\varvec{R}}_{(i,j)}=({\widehat{r}}_{(i,j)}(1),{\widehat{r}}_{(i,j)}(2),\cdots ,{\widehat{r}}_{(i,j)}(m))^\top ,\nonumber \\{} & {} (i,j)\in \{(1,1),(2,2),(1,2),(2,1)\}, \end{aligned}$$

(3.2)

$\overset{\textrm{D}}{\rightarrow }$ denotes convergence in distribution and $\varvec{\Omega }_{rs} = \textrm{E}\big [\varvec{R}(\varvec{\theta }_0) \varvec{R}^\top (\varvec{\theta }_0)\big ]$ is the covariance matrix, which can be replaced by a consistent estimator $\widehat{\varvec{\Omega }}_{rs}$:

$$\begin{aligned} \widehat{\varvec{\Omega }}_{rs}= \left( \begin{array}{ccc} \varvec{I}_{m}-X_{11} &{} \varvec{0}&{} \varvec{0}\\ \varvec{\Sigma }^{-1}\varvec{X}_{11}^\top &{} &{} \\ \varvec{0}&{} \varvec{I}_{m}-\frac{1}{4}\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top &{} \varvec{0}\\ \varvec{0}&{} \varvec{0}&{}\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{rs}\varvec{\Sigma }^{-1}\varvec{X}_{rs}^\top \\ \end{array} \right) , \nonumber \\ \end{aligned}$$

(3.3)

with $r\ne s\in \{1,2\}$, $\varvec{I}_{m}$ denotes the identity $m\times m$ matrix,

$$\begin{aligned}{} & {} \varvec{X}_{11}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}_{t-k}}}, \end{aligned}$$

(3.4)

$$\begin{aligned}{} & {} \varvec{X}_{22}(k)=\frac{1}{n}\sum _{t=k+1}^{n}{\widehat{h}}_{t}^{-1}\frac{\partial h_{t}}{\partial \varvec{\theta }^\top }\left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) , \end{aligned}$$

(3.5)

$$\begin{aligned}{} & {} \varvec{X}_{12}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t}}} \frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) , \end{aligned}$$

(3.6)

and

$$\begin{aligned} \varvec{X}_{21}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t-k}}} \frac{\partial \mu _{t-k}}{\partial \varvec{\theta }^\top } \left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) . \end{aligned}$$

(3.7)

Proof

The proof is given in the Appendix (1). $\square $

By the results of Theorem 1, we propose the portmanteau statistic, ${\dot{C}}_{rs}$ namely, and its modified version, $C_{rs}$ namely, to test $\mathbb {H}_0$ that the model stated in (2.1) is correctly specified. Thus, we have that

$$\begin{aligned} {\dot{C}}_{rs}= & {} n\left( \begin{array}{c} \widehat{\varvec{R}}_{(1,1)} \\ \widehat{\varvec{R}}_{(2,2)}\\ \widehat{\varvec{R}}_{(r,s)} \end{array} \right) ^{\top }\widehat{\varvec{\Omega }}_{rs}^{-1} \left( \begin{array}{c} \widehat{\varvec{R}}_{(1,1)} \\ \widehat{\varvec{R}}_{(2,2)}\\ \widehat{\varvec{R}}_{(r,s)} \end{array} \right) ,\quad \nonumber \\ C_{rs}= & {} n\left( \begin{array}{c} \widetilde{\varvec{R}}_{(1,1)} \\ \widetilde{\varvec{R}}_{(2,2)}\\ \widetilde{\varvec{R}}_{(r,s)} \end{array} \right) ^{\top }\widehat{\varvec{\Omega }}_{rs}^{-1} \left( \begin{array}{c} \widetilde{\varvec{R}}_{(1,1)} \\ \widetilde{\varvec{R}}_{(2,2)}\\ \widetilde{\varvec{R}}_{(r,s)} \end{array} \right) , \end{aligned}$$

(3.8)

where $\widehat{\varvec{R}}_{(1,1)},\widehat{\varvec{R}}_{(2,2)},\widehat{\varvec{R}}_{(r,s)}$ are defined in (3.2), and $\widetilde{\varvec{R}}_{(1,1)}, \widetilde{\varvec{R}}_{(2,2)},\widetilde{\varvec{R}}_{(r,s)}$ are obtained after replacing the autocorrelation coefficients in (3.1) by their standardized values formulated as

$$\begin{aligned} \displaystyle {{\widetilde{r}}_{(r,s)}(k)=\sqrt{\frac{n+2}{n-k}}{\widehat{r}}_{(r,s)}(k)}, \quad k\in \{1,\cdots , m\}. \end{aligned}$$

(3.9)

From the theorem on quadratic forms given in Box (1954), it is straightforward to show that ${\dot{C}}_{rs}$ and $C_{rs}$ are asymptotically chi-square distributed with $3m-(p+q+1)$ degrees of freedom.

Figure 1 illustrates the accuracy of the approximation of the empirical distribution of ${\dot{C}}_{rs}$ and $C_{rs}$, for $r\ne s\in \{1,2\}$ to the chi-square distribution employing $10^3$ replicates when an ARMA(1,1) model fits to a sample of size $n=200$ generated from an ARMA(1,1) process defined as

$$\begin{aligned} z_t=0.9 z_{t-1}+\varepsilon _{t}-0.88\varepsilon _{t-1}. \end{aligned}$$

(3.10)

The parameters of the ARMA model stated in (3.10) are selected to be very close to non-stationarity and non-invertibility case, whereas the coefficient of the MA model is near to cancellation with the coefficient of the AR model to demonstrate the usefulness of the proposed tests, even with extreme cases.

We found similar results for small and large samples and our preliminary analysis indicates that the portmanteau tests based on the statistics $C_{rs}$ control the type I error probability more successfully than the tests that consider the statistics ${\dot{C}}_{rs}$. Hence, we recommend the use of $C_{rs}$.

Remark 1

As mentioned, the proposed test statistics in (3.8) can be seen as combinations of the statistics presented by Wong and Ling (2005) and Psaradakis and Vávra (2019). Thus, each test statistic $C_{rs}$ may be seen as a linear combination of three existent test statistic proposed by Ljung and Box (1978), McLeod and Li (1983), and Psaradakis and Vávra (2019), modifying the corresponding statistic ${\dot{C}}_{rs}$, which are linear combinations of three statistics given by Box and Pierce (1970), Li and Mak (1994), and Psaradakis and Vávra (2019).

4 Simulation studies

We carry out Monte Carlo simulations to examine statistical properties of the proposed tests. For comparative purposes, we also consider three test statistics given by

$$\begin{aligned}{} & {} Q_{rs}=n(n+2)\sum _{k=1}^{m}(n-k)^{-1}{\widehat{r}}_{(r,s)}^2(k),\nonumber \\{} & {} (r,s)\in \{(1,2),(2,1),(2,2)\}, \end{aligned}$$

(4.1)

where $Q_{12}$ and $Q_{21}$ represent the statistics presented in Psaradakis and Vávra (2019) and $Q_{22}$ denotes the statistic proposed by McLeod and Li (1983). In addition, we consider two statistics introduced by Li and Mak (1994) and Wong and Ling (2005), which are denoted by $Q_{\text {LM}}$ and $Q_{\text {WL}}$, respectively, given by

$$\begin{aligned} Q_{\text {LM}}= & {} n\sum _{k=1}^{m}{\widehat{r}}_{(2,2)}^2(k), \end{aligned}$$

(4.2)

$$\begin{aligned} Q_{\text {WL}}= & {} n\left( \begin{array}{c} \widehat{\varvec{R}}_{(1,1)} \\ \widehat{\varvec{R}}_{(2,2)} \end{array} \right) ^{\top } \left( \begin{array}{cc} \varvec{I}_{m}&{} \varvec{0} \\ \varvec{0} &{} \varvec{I}_{m}-\frac{1}{4}\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top \end{array} \right) ^{-1}\nonumber \\{} & {} \times \left( \begin{array}{c} \widehat{\varvec{R}}_{(1,1)} \\ \widehat{\varvec{R}}_{(2,2)} \end{array} \right) . \end{aligned}$$

(4.3)

First, we examined six statistics, $C_{12}, C_{21},Q_{12},Q_{21}$, $Q_{22}$ and $Q_{\text {WL}}$ namely, assuming the following five linear models studied by Psaradakis and Vávra (2019):

A1.:: AR(1) model: $z_t=-0.9z_{t-1}+\varepsilon _t$;
A2.:: AR(2) model: $z_t=0.6z_{t-1}-0.5z_{t-2}+\varepsilon _t$;
A3.:: MA(1) model: $z_t=0.8\varepsilon _{t-1}+\varepsilon _t$;
A4.:: ARMA(2,1) model: $z_t=0.8z_{t-1} +0.15z_{t-2}+0.3\varepsilon _{t-1}+\varepsilon _t$;
A5.:: ARMA(1,1) model: $z_t=0.6z_{t-1} +0.4\varepsilon _{t-1}+\varepsilon _t$.

For model A1, the parameter used by Psaradakis and Vávra (2019) was $\phi =0.8$. However, we considered here a value negative of the parameter close to non-stationarity case to assess the behavior of the test statistics associated with positive and negative values of the parameters. We also analyzed for this model the cases where $\phi \in \{\pm 0.1, \pm 0.5, \pm 0.8\}$ whose results showed very minor changes in the behavior of the proposed tests. For model A3, we also explored the cases where ${\theta }\in \{\pm 0.3, \pm 0.6, \pm 0.9\}$ and obtained good results.

Second, we investigated four statistics, $C_{12},C_{21},Q_{\text {WL}}$, and $Q_{\text {LM}}$ namely, according to the following three nonlinear models studied by Velasco and Wang (2015) and Han and Ling (2017):

A6.:: GARCH(1,1) model: $\varepsilon _t=\xi _t\sqrt{h_t}, \, h_t=0.1+0.3\varepsilon _{t-1}^2+0.5 h_{t-1}$;
A7.:: AR(1)-ARCH(1) model: $z_t=0.5z_{t-1}+\varepsilon _t, \,\varepsilon _t=\xi _t\sqrt{h_t}, \, h_t=0.1+0.4\varepsilon _{t-1}^2$;
A8.:: AR(1)-GARCH(1,1) model: $z_t=0.5z_{t-1}+\varepsilon _t, \, \varepsilon _t=\xi _t\sqrt{h_t}, h_t=0.1+0.3\varepsilon _{t-1}^2+0.5 h_{t-1}$;

where $\eta _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)$ or Student-t distribution with 10 degrees of freedom.

In all experiments, we use the R software (www.R-project.org, R Core Team 2020) to simulate 1000 replicates of artificial series of size $n+n/2$ with $n\in \{100,300\}$. However, only the last n data points are used to carry out portmanteau tests with the residuals of some fitted models.

The empirical size and power of the tests are calculated based on a nominal level 5%. Simulation results for nominal levels 1% and 10% are not reported, due to space conservation, but they are available upon request.

First, we calculate the type I error probability, at lags $m \in \{5,10\}$, based on six statistics, $C_{12}, C_{21},Q_{12},Q_{21}$, $Q_{22}$ and $Q_{\text {WL}}$ namely, when a true model is fitted to a series generated according to models A1-A5. Second, we investigated the accuracy of estimating type I error probability using four statistics, $C_{12},C_{21},Q_{\text {WL}}$, and $Q_{\text {LM}}$ namely, when a true model is fitted to a series generated according to models A6-A8.

Table 1 Empirical sizes, for 5% significance test, of the indicated statistic, distribution, model, n, and m

Full size table

The empirical sizes of the test corresponding to the nominal size $5\%$ over 1000 independent simulations belong to the 95% confidence interval $[3.65\%,6.35\%]$ and to the 99% confidence interval $[3.22\%, 6.78\%]$. From Tables 1 and 2, note that tests based on the statistics $Q_{\text {WL}}$ and $Q_{\text {LM}}$ can distort the test size, whereas the tests using the statistics $Q_{22},Q_{12},Q_{21}$ and the proposed statistics exhibit no substantial size distortion and, generally, have empirical levels that improve as n increases. Also, we found similar results for the cases where the error terms have either skew-normal distribution with asymmetry parameter $\kappa \in \{-1.0, -0.5, 0.5, 1.0\}$ or Student-t distribution with degrees of freedom $\nu \in \{5,15,20\}$. These results are omitted here due to space conservation, but they are available upon request.

4.1 Testing linearity in linear time series models

Now, we investigate the efficiency to distinguish power for the mean term of the test statistics $C_{12},C_{21},Q_{12},Q_{21},Q_{22}$, and $Q_{\text {WL}}$^{Footnote 1}. For expositional simplicity, we define $C^\star $ as the highest test power attained by the statistics $C_{12}$ and $C_{21}$, whereas $Q^\star $ as the highest test power attained by $Q_{12}$ and $Q_{21}$, that is,

$$\begin{aligned} C^\star =\max (C_{12},C_{21}),\quad Q^\star =\max (Q_{12},Q_{21}). \end{aligned}$$

(4.4)

The power of the tests are calculated under the null hypothesis $\mathbb {H}_0$ that $z_t$ satisfies the ARMA model

$$\begin{aligned} z_t = \mu _t + \sum _{i =1}^{p}\phi _i z_{t-i} + \sum _{i =1}^{q}\theta _i \varepsilon _{t-i} + \varepsilon _t, \end{aligned}$$

which can be seen as an AR(p) model, where $p\rightarrow \infty $. We followed the approach presented by Ng and Perron (2005) who used the Bayesian information criterion (BIC) to select the order $p\in \{0,1,\cdots ,\lfloor 8(n/100)^{1/4}\rfloor \}$, where $\lfloor a \rfloor $ denotes the floor function (integer part) of the number $a\in \mathbb {R}$, when an AR(p) model erroneously fits to series generated from the following models studied by Li and Mak (1994), Wong and Ling (2005), Han and Ling (2017), and Psaradakis and Vávra (2019):

B1.:: Bilinear (BL) model: $z_t=0.2+0.4z_{t-1}+\varepsilon _{t}+\varphi z_{t-1}\varepsilon _{t-1}$, where $\varepsilon _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)$, with parameter values of $\varphi $ being selected in the range $0<\varphi <2.5$;
B2.:: Random coefficient AR (RCAR) model: $z_t= 0.2z_{t-1}+u_{t}, u_{t}=\varphi \eta _{t} z_{t-1}+\varepsilon _{t}$, where $\{\varepsilon _t\}$ and $\{\eta _t\}$ are two sequences of i.i.d. $\mathcal {N}(0,1)$ random variables, which are independent from each other variable; note that the RCAR model is a special case of the AR(1) and ARCH(1) models as observed in $\textrm{E}(u_t^2|\mathcal {F}_{t-1})=\varphi ^2 z_{t-1}^2+1$, which is the conditional variance over time. We select parameter values of $\varphi $ from the range $0<\varphi <2.5$;
B3.:: TAR model: $z_t=0.8 z_{t-1}\mathbb {I}_{\{z_{t-1}\le -1\}} - 0.8 z_{t-1}\mathbb {I}_{\{z_{t-1}> -1\}} +\varepsilon _t$, where $\varepsilon _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)$;
B4.:: AR(1)-ARCH(2) model: $z_t=0.2 z_{t-1}+\varepsilon _{t}, \quad \varepsilon _t=\xi _t\sqrt{h_t}, \quad h_t=0.2+0.2\varepsilon _{t-1}^2+0.2\varepsilon _{t-2}^2$, where $\xi _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)$.

Table 2 Empirical sizes, for 5% significance test, of the indicated statistic, distribution, model, n, and m

Full size table

Note that we found similar results based on the Akaike information criterion –AIC– (Akaike 1974).

Figure 2 displays the rejection frequencies considering a 5% nominal level of the statistics $C^\star ,Q^\star ,Q_{22}$, and $Q_{\text {WL}}$ when an AR(p) model erroneously fits data of size $n\in \{100,300\}$ from BL (B1) and RCAR (B2) models at lag value $m=\lfloor \sqrt{n}\rfloor $. The results for both models are based on parameter values $\varphi $ varying from 0 to 2.5, as mentioned. In addition, Fig. 3 shows the rejection probability of the aforementioned tests (of nominal level 0.05) employing the nonlinear TAR (B3) and AR(1)-ARCH(2) (B4) models. For $n\in \{100,300\}$, the tests are calculated at lags $m\in \{2,4,6,8,10\}$ and $m\in \{3,6,9,13,17\}$, respectively. From Figs. 2 and 3, note that the performance of the proposed statistic $C^\star $ is, in general, the best, especially for small sample sizes. Thus, we conclude that the proposed statistics are helpful for testing for linearity of stationary time series.

4.2 Testing the AR-ARCH models

In order to examine the ability for discriminating power for the mean and conditional variance parts of a time series model, we consider the AR-GARCH model versus nonlinear models with GARCH errors. The process $\{z_t\}$ satisfies the null hypothesis $\mathbb {H}_0$ of heteroskedasticity given by

$$\begin{aligned}{} & {} z_t=\phi z_{t-1}+\varepsilon _t, \quad \varepsilon _t=\xi _{t}\sqrt{h_{t}}, \\{} & {} h_t=\omega +\alpha _1\varepsilon _{t-1}^2, \quad \xi _t\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1). \end{aligned}$$

The alternative models are:

D1.:: AR(1)-ARCH(2) model: $z_t=0.5z_{t-1}+ \xi _{t}\sqrt{h_{t}}$, $h_t=0.01+0.4\varepsilon _{t-1}^2+0.3\varepsilon _{t-2}^2$;
D2.:: AR(1)-GARCH(1,1) model: $z_t=0.5z_{t-1}+ \xi _{t}\sqrt{h_{t}}$, $h_t=0.041+0.4\varepsilon _{t-1}^2+0.5 h_{t-1}$;
D3.:: AR(2)-ARCH(2) model: $z_t=0.5z_{t-1}+0.2z_{t-2}+ \xi _{t}\sqrt{h_{t}}$, $h_t=0.01+0.4\varepsilon _{t-1}^2+0.2 \varepsilon _{t-2}^2$;
D4.:: TAR model with GJR-GARCH(1,1) error:
$$\begin{aligned}{} & {} z_t=0.4z_{t-1}+0.5z_{t-1}\mathbb {I}_{\{z_{t-1}>0\}}+\xi _{t}\sqrt{h_{t}},\\{} & {} h_t=0.1+(0.3+0.4 \mathbb {I}_{\{\varepsilon _{t-1}<0\}})\varepsilon _{t-1}^2+0.4 h_{t-1}. \end{aligned}$$

Table 3 p-values for testing the neglected nonlinearity in AR models with the indicated statistic for the listed company from the S &P 500 index

Full size table

For each model, the power of the test for the statistics $C^\star ,Q_{\text {WL}}$, and $Q_{\text {LM}}$ are calculated at lags $m\in \{2,4,6,8,10\}$ and $m\in \{3,6,9,13,17\}$ associated with $n\in \{100,300\}$. The results are shown in Fig. 4. Note that the performance of the proposed test is in general better when compared with the other two tests.

Worth noting that, in general, as the lag order increases, the power of the portmanteau tests decreases, especially under models like the TAR model, for several reasons including the following:

When the lag is large compared to the overall sample size, we usually get less reliable estimates of autocorrelation and, consequently, less power in the portmanteau test.
As we increase the lag order, we are including more lags in the test, which means we are estimating more parameters. This results in a loss of degrees of freedom in the test statistics. With fewer degrees of freedom, the test becomes less sensitive to detecting the absence of autocorrelation.
TAR models have different nonlinear thresholds which complicate the estimation of the autocorrelation function at higher lag orders, making it more challenging to detect the absence of autocorrelation.

5 Empirical applications

5.1 Test for nonlinearity in AR models using stock returns

We demonstrate the usefulness of the proposed tests for detecting nonlinearity in AR models for a set of weekly stock returns. We select 92 companies studied by Kapetanios (2009) and Psaradakis and Vávra (2019). These companies are a subset of the Standard & Poor 500 composite index (S &P 500), spanning over the period from 18 June 1993 to 31 December 2007 ($n=781$ observations).

Following the procedure presented by Psaradakis and Vávra (2019), we fit an AR(p) model for each series, where the order of p was selected by minimizing the BIC according to the algorithm explained in Sect. 4.1 (Ng and Perron 2005). The asymptotic p-values (at 5% significance level) for tests based on the statistics $C^\star , Q^\star , Q_{22}$, and $Q_{\text {WL}}$, with $m=\ln \lfloor n\rfloor $, are reported in Table 3. Since the conditional heteroskedasticity is often considered as a main characteristic of asset returns, we expect that the AR model will not capture the nonlinear features in most of the stock returns considered in our analysis and the null hypothesis of linearity should be rejected.

From the results in Table 3, we found that the linearity assumption is rejected by the proposed tests in $81 (88.0\%)$ cases compared with $64 (69.6\%), 74 (80.4\%)$, and $71 (77.2\%)$ cases on the basis of the test statistics $Q^\star ,Q_{22}$, and $Q_{\text {WL}}$, respectively. This arguably suggests that the proposed tests are preferable to test the presence of nonlinearity in AR models for asset returns.

5.2 Goodness-of-fit-tests for nonlinear time series models

We examine the ability of the portmanteau tests to distinguish an unsuitable model for weekly stock returns of Aon plc company studied in Sect. 5.1. We find a strong evidence against linearity in the AR model for the returns of this company by using portmanteau tests. The Aon plc returns are displayed in Fig. 5 which shows that the log-return series have high persistence in volatility with negative skewness and excess kurtosis. We conclude, therefore, that these returns might exhibit conditional heteroskedasticity effects, and a model that belongs to the ARCH family with a Student-t distribution of the error process might better explain the leptokurtic distribution of the returns. Thus, we fit ARCH(1), AR(1)-ARCH(1), ARCH(2), and GARCH(1,1) models and apply the test statistics $C^\star , Q_{\text {WL}}$, and $Q_{\text {LM}}$ at lag value $m=6$. The asymptotic p-values for testing the model adequacy based on these proposed statistics are reported in Table 4. From this table, the test statistic $Q_{\text {LM}}$ fails to detect the inadequacy in all of the fitted models, whereas the test statistic $Q_{\text {WL}}$ suggests that the ARCH(1), ARCH(2) and GARCH(1,1) models might be suitable to describe for the Aon plc returns. Only the proposed test statistics suggest a clear indication of inadequacy of the ARCH(1), AR(1)-ARCH(1), and ARCH(2) models, while the GARCH(1,1) model might be an adequate model for the Aon plc returns according to the proposed test statistics.

Table 4 p-values for testing the adequacy of the indicated model for Aon plc returns based on the listed statistic

Full size table

6 Conclusions

In this article, we have introduced four mixed portmanteau statistics for assessing the adequacy of time series models. The tests we propose are based on a linear combination of three auto-and-cross-correlation components. The first and second components are derived from the autocorrelations of residuals and their squared values, respectively. Meanwhile, the third component takes into account the cross-correlations between the residuals and their square values, considering both positive and negative lags. Two of these tests can be viewed as an extended version of the Ljung and Box test, while the others can be considered an extension of the Box and Pierce test. Based on our simulation study, it is recommended to use the proposed tests, which can be seen as an extended version of the Ljung and Box test. These tests demonstrate better control over the type I error probability compared to existing tests. Furthermore, they generally exhibit more statistical power than tests relying on the statistics introduced by Li and Mak (1994), McLeod and Li (1983), Psaradakis and Vávra (2019), and Wong and Ling (2005).

Simulation results indicate that combining $\varvec{R}{(1,2)}$ and $\varvec{R}{(2,1)}$ in a test statistic significantly reduces the test’s power. This can be justified by the lack of independence between these two distinct components, as they share a substantial amount of information about correlation. Consequently, they will add complex redundant correlation which leads to decrease the power in the proposed test.

Some of the test statistics that we have discussed have high computational burdens, so we have implemented them in an R package named portes (Mahdi and McLeod 2020). The idea discussed in this article may be extended to formulate an omnibus portmanteau test that combines the cross-correlations between the residuals and their square values at both positive and negative lags with the autocorrelations of the residuals and their squared values. The framework we propose could be expanded to identify seasonality in time series and to detect various types of nonlinearity dependence in multivariate time series, as discussed by (Mahdi 2016).

In this article, our focus has been on measures derived from second-order mixed moments. Nevertheless, there is potential for extending these measures to higher-order moments. Our simulation study revealed that severe skewness, such as in the case of a skewed t-distribution, can distort the size of all portmanteau tests. When distributional assumptions are relaxed, the robustness of bootstrapping and Monte Carlo significance test approaches becomes evident (Efron and Tibshirani 1994; Lin and McLeod 2006; Mahdi and Ian McLeod 2012). Therefore, a possible extension of this article could involve considering these approaches for calculating p-values.

Data availibility

No datasets were generated or analysed during the current study.

Notes

We also test the form of the GARCH-type models by examining the GARCH(1,1) model versus the MA(1)-GARCH(1,1) and AR(1)-GARCH(1,1) models based on the test statistics $C^\star ,Q_{\text {WL}}$, and $Q_{\text {LM}}$. The results are available upon request.

References

Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)
Article MathSciNet Google Scholar
Box, G.E.P.: Some theorems on quadratic forms applied in the study of analysis of variance problems, I effect of inequality of variance in the one-way classification. Ann. Math. Stat. 25(2), 290–302 (1954)
Article MathSciNet Google Scholar
Box, G., Jenkins, G.: Time Series Analysis: Forecasting and Control. Holden-Day, San Francisco (1970)
Google Scholar
Box, G.E.P., Pierce, D.A.: Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 65(332), 1509–1526 (1970)
Article MathSciNet Google Scholar
Efron, B., Tibshirani, R.: An Introduction to the Bootstrap. Monographs on Statistics & Applied Probability, Taylor & Francis (1994)
Book Google Scholar
Engle, R.: Autoregressive conditional heteroscedasticity with estimates of the variance of united kingdom inflation. Econometrica 50(4), 987–1007 (1982)
Article MathSciNet Google Scholar
Fisher, T.J., Gallagher, C.M.: New weighted portmanteau statistics for time series goodness of fit testing. J. Am. Stat. Assoc. 107(498), 777–787 (2012)
Article MathSciNet Google Scholar
Granger, C.W.J., Andersen, A.P.: An Introduction to Bilinear Time Series Models. Gottingen, Vandenhoeck and Ruprecht (1978)
Google Scholar
Hall, P., Heyde, C.: Estimation of parameters from stochastic processes. In: Martingale Limit Theory and its Application, Probability and Mathematical Statistics: A Series of Monographs and Textbooks, pp. 155–199. Academic Press, New York (1980)
Han, N.S., Ling, S.: Goodness-of-fit test for nonlinear time series models. Ann. Financ. Econ. 12(02), 1750006 (2017)
Article Google Scholar
Higgins, M.L., Bera, A.K.: A class of nonlinear arch models. Int. Econ. Rev. 33(1), 137–158 (1992)
Article Google Scholar
Kapetanios, G.: Testing for strict stationarity in financial variables. J. Bank. Finance 33, 2346–2362 (2009)
Article Google Scholar
Lawrance, A.J., Lewis, P.A.W.: Modelling and residual analysis of nonlinear autoregressive time series in exponential variables. J. Roy. Stat. Soc. Ser. B (Methodological) 47(2), 165–202 (1985)
MathSciNet Google Scholar
Lawrance, A.J., Lewis, P.A.W.: Higher-order residual analysis for nonlinear time series with autoregressive correlation structures. Int. Stat. Rev. 55(1), 21–35 (1987)
Article MathSciNet Google Scholar
Li, W.K., Mak, T.K.: On the squared residual autocorrelations in non-linear time series with conditional heteroskedasticity. J. Time Ser. Anal. 15(6), 627–636 (1994)
Article MathSciNet Google Scholar
Li, D., Zhang, X., Zhu, K., Ling, S.: The ZD-GARCH model: a new way to study heteroscedasticity. J. Econ. 202(1), 1–17 (2018)
Article MathSciNet Google Scholar
Lin, J.-W., McLeod, A.: Improved peña-rodríguez portmanteau test. Comput. Stat. Data Anal. 51(3), 1731–1738 (2006)
Article Google Scholar
Ling, S., Li, W.K.: Diagnostic checking of nonlinear multivariate time series with multivariate arch errors. J. Time Ser. Anal. 18(5), 447–464 (1997)
Article MathSciNet Google Scholar
Ling, S., McAleer, M.: Asymptotic theory for a vector ARMA-GARCH model. Economet. Theor. 19(2), 280–310 (2003)
Article MathSciNet Google Scholar
Ling, S., McAleer, M.: A general asymptotic theory for time-series models. Stat. Neerl. 64(1), 97–111 (2010)
Article MathSciNet Google Scholar
Ljung, G.M., Box, G.E.P.: On a measure of lack of fit in time series models. Biometrika 65(2), 297–303 (1978)
Article Google Scholar
Mahdi, E., Ian McLeod, A.: Improved multivariate portmanteau test. J. Time Ser. Anal. 33(2), 211–222 (2012)
Article MathSciNet Google Scholar
Mahdi, E.: Portmanteau test statistics for seasonal serial correlation in time series models. SpringerPlus 5, 1485 (2016)
Article Google Scholar
Mahdi, E.: Kernel-based portmanteau diagnostic test for ARMA time series models. Cogent Math. 4(1), 1296327 (2017)
Article MathSciNet Google Scholar
Mahdi, E., McLeod, A.I.: Portes: portmanteau tests for univariate and multivariate time series models. R package version 5.0 (2020)
McLeod, A.I., Li, W.K.: Diagnostic checking ARMA time series models using squared-residual autocorrelations. J. Time Ser. Anal. 4(4), 269–273 (1983)
Article MathSciNet Google Scholar
Ng, S., Perron, P.: A note on the selection of time series models. Oxford Bull. Econ. Stat. 67(1), 115–134 (2005)
Article Google Scholar
Peña, D., Rodríguez, J.: A powerful portmanteau test of lack of fit for time series. J. Am. Stat. Assoc. 97(458), 601–610 (2002)
Article MathSciNet Google Scholar
Peña, D., Rodríguez, J.: The log of the determinant of the autocorrelation matrix for testing goodness of fit in time series. J. Stat. Plan. Inference 136(8), 2706–2718 (2006)
Article MathSciNet Google Scholar
Psaradakis, Z., Vávra, M.: Portmanteau tests for linearity of stationary time series. Economet. Rev. 38(2), 248–262 (2019)
Article MathSciNet Google Scholar
R Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2020)
Rodríguez, J., Ruiz, E.: A powerful test for conditional heteroscedasticity for financial time series with highly persistent volatilities. Stat. Sin. 15(2), 505–525 (2005)
MathSciNet Google Scholar
Tong, H., Lim, K.S.: Threshold autoregression, limit cycles and cyclical data. J. Roy. Stat. Soc.: Ser. B (Methodol.) 42(3), 245–268 (1980)
Google Scholar
Velasco, C., Wang, X.: A joint portmanteau test for conditional mean and variance time-series models. J. Time Ser. Anal. 36(1), 39–60 (2015)
Article MathSciNet Google Scholar
Welsh, A.K., Jernigan, R.W.: A statistic to identify asymmetric time series. In: Proceedings of the Business and Economics Statistics Section, pp. 390–395. American Statistical Association, Alexandria (1983)
Wong, H., Ling, S.: Mixed portmanteau tests for time-series models. J. Time Ser. Anal. 26(4), 569–579 (2005)
Article MathSciNet Google Scholar
Zhu, K.: A mixed portmanteau test for ARMA-GARCH models by the quasi-maximum exponential likelihood estimation approach. J. Time Ser. Anal. 34(2), 230–237 (2013)
Article MathSciNet Google Scholar

Download references

Acknowledgements

Our sincere thanks go to Dr. Ajay Jasra, the editor, Dr. Mathieu Gerber, the associate editor, the two anonymous reviewers, Dr. Jan G. De Gooijer, and Kazem Ghanbari for their valuable and insightful suggestions on the manuscript.

Author information

Authors and Affiliations

School of Mathematics and Statistics, Carleton University, Ottawa, ON, Canada
Esam Mahdi

Authors

Esam Mahdi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

EM is sole contributor to this research paper.

Corresponding author

Correspondence to Esam Mahdi.

Ethics declarations

conflict of interest

The author reports no conflict of interest regarding this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (zip 11377 KB)

Appendix

By Taylor’s theorem there exists a random vector $\tilde{\varvec{\theta }}_n$ on the line segment between $\varvec{\theta }_0$ and $\varvec{{\hat{\theta }}}_n$ such that

$$\begin{aligned} \widehat{\varvec{\Gamma }}_{(1,1)}= & {} \varvec{\Gamma }_{(1,1)} + \frac{\partial \varvec{\Gamma }_{(1,1)}}{\partial \varvec{\theta }^\top }(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0) \nonumber \\{} & {} +\frac{1}{2} (\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)^\top \frac{\partial ^2 \varvec{\Gamma }_{(1,1)}(\tilde{\varvec{\theta }}_n)}{\partial \varvec{\theta }^\top \partial \varvec{\theta }} (\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.1)

where the second derivative $\dfrac{\partial ^2 \varvec{\Gamma }_{(1,1)}(\tilde{\varvec{\theta }}_n)}{\partial \varvec{\theta }^\top \partial \varvec{\theta }}$ depends on the second-order derivatives $\dfrac{\partial ^2\gamma _{(1,1)}(k)}{\partial \varvec{\theta }^\top \partial \varvec{\theta }}$. By assumption, the second-order partial derivatives are dominated by a fixed integrable function for every $\varvec{\theta }$ in a ball B around $\varvec{\theta }_0$, so that the probability of the event $\left\{ \widehat{\varvec{\theta }}_n \in B\right\} $ tends to 1. Thus

$$\begin{aligned} \widehat{\varvec{\Gamma }}_{(1,1)}= & {} \varvec{\Gamma }_{(1,1)} + \frac{\partial \varvec{\Gamma }_{(1,1)}}{\partial \varvec{\theta }^\top }(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0) \nonumber \\{} & {} + \frac{1}{2} (\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)^\top O_p(1) (\widehat{\varvec{\theta }}_n-\varvec{\theta }_0). \end{aligned}$$

(7.2)

As the sequence $\left( \widehat{\varvec{\theta }}_n-\varvec{\theta }_0\right) O_P(1)=o_P(1) O_P(1)$ converges to 0 in probability when $\widehat{\varvec{\theta }}_n$ is consistent for $\varvec{\theta }_0$, we can rewrite (7.2) by employing the first-order Taylor series approximation:

$$\begin{aligned} \widehat{\varvec{\Gamma }}_{(1,1)}\approx \varvec{\Gamma }_{(1,1)} + \frac{\partial \varvec{\Gamma }_{(1,1)}}{\partial \varvec{\theta }^\top }(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.3)

where

$$\begin{aligned} \frac{\partial \varvec{\Gamma }_{(1,1)}}{\partial \varvec{\theta }^\top }= & {} \bigg (\frac{\partial \gamma _{(1,1)}(1)}{\partial \varvec{\theta }^\top },\cdots ,\frac{\partial \gamma _{(1,1)}(m)}{\partial \varvec{\theta }^\top }\bigg )^\top ,\\ \frac{\partial \gamma _{(1,1)}(k)}{\partial \varvec{\theta }^\top }= & {} -\frac{1}{n}\sum _{t=k+1}^{n}\frac{\varepsilon _{t}}{2h_{t}^{3/2}}\frac{\partial h_{t}}{\partial \varvec{\theta }^\top }\frac{\varepsilon _{t-k}}{\sqrt{h_{t-k}}}\\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{h_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \frac{\varepsilon _{t-k}}{\sqrt{h_{t-k}}}\\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{\varepsilon _{t-k}}{2h_{t-k}^{3/2}}\frac{\partial h_{t-k}}{\partial \varvec{\theta }^\top }\frac{\varepsilon _{t}}{\sqrt{h_{t}}}\\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{h_{t-k}}}\frac{\partial \mu _{t-k}}{\partial \varvec{\theta }^\top }\frac{\varepsilon _{t}}{\sqrt{h_{t}}}. \end{aligned}$$

By the ergodic theorem, for large n, and after taking the expectation with respect to $\mathcal {F}_{t-1}$, it is straightforward to show that

$$\begin{aligned} \frac{\partial \gamma _{(1,1)}(k)}{\partial \varvec{\theta }^\top }\overset{\mathrm {a.s.}}{\rightarrow }-\widetilde{\varvec{X}}_{11}(k), \end{aligned}$$

where

$$\begin{aligned} \displaystyle {\widetilde{\varvec{X}}_{11}(k)=\textrm{E}\bigg (\frac{1}{\sqrt{h_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \frac{\varepsilon _{t-k}}{\sqrt{h_{t-k}}}\bigg )} \end{aligned}$$

is a $1\times l$ vector, which can be consistently estimated by

$$\begin{aligned} \varvec{X}_{11}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \frac{{\widehat{\varepsilon }}_{t-k}}{\sqrt{{\widehat{h}}_{t-k}}}. \end{aligned}$$

(7.4)

It follows that $\widehat{\varvec{\Gamma }}_{(1,1)}$ stated in (7.3) can be expressed as $\widehat{\varvec{\Gamma }}_{(1,1)}\approx \varvec{\Gamma }_{(1,1)} - \varvec{X}_{11}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)$, where $\varvec{X}_{11}=(\varvec{X}_{11}^\top (1),\cdots , \varvec{X}_{11}^\top (m))^\top $ is the resultant $m\times l$ matrix. Thus, by scaling each term by the variance of the standardized residual, we have

$$\begin{aligned} \sqrt{n}\widehat{\varvec{R}}_{(1,1)}&\approx \sqrt{n}({\widehat{r}}_{(1,1)}(1),\cdots ,{\widehat{r}}_{(1,1)}(m))^\top \nonumber \\&= \sqrt{n}(\rho _{(1,1)}(1),\cdots ,\rho _{(1,1)}(m))^\top \nonumber \\&\quad - \varvec{X}_{11}\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0). \end{aligned}$$

(7.5)

When $\xi _t$ is normally distributed, the random vector $\sqrt{n}\widehat{\varvec{R}}_{(1,1)}$ is asymptotically normal distributed with a mean of zero vector and a variance $\varvec{I}_{m}-\varvec{X}_{11}\varvec{\Sigma }^{-1}\varvec{X}_{11}^\top $, where $\varvec{I}_{m}$ is the identity $m\times m$ matrix.

For the case of $\widehat{\varvec{\Gamma }}_{(2,2)}$ and $\widehat{\varvec{R}}_{(2,2)}$, Li and Mak (1994) and Ling and Li (1997) showed that

$$\begin{aligned} \sqrt{n}\widehat{\varvec{R}}_{(2,2)}&\approx \sqrt{n}({\widehat{r}}_{(2,2)}(1),\cdots ,{\widehat{r}}_{(2,2)}(m))^\top \nonumber \\&= \sqrt{n}(\rho _{(2,2)}(1),\cdots ,\rho _{(2,2)}(m))^\top \nonumber \\&\quad - \frac{1}{\sigma ^2}\varvec{X}_{22}\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.6)

where $\varvec{X}_{22}=(\varvec{X}_{22}^\top (1),\cdots , \varvec{X}_{22}^\top (m))^\top $ is an $m\times l$ matrix, and $\varvec{X}_{22}(k)$ is given by

$$\begin{aligned} \varvec{X}_{22}(k)=\frac{1}{n}\sum _{t=k+1}^{n}{\widehat{h}}_{t}^{-1}\frac{\partial h_{t}}{\partial \varvec{\theta }^\top }\left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) . \end{aligned}$$

(7.7)

The authors proved that $\sqrt{n}\widehat{\varvec{R}}_{(2,2)}$ is asymptotically normal distributed with a mean of zero vector and a variance $\varvec{I}_{m}-\frac{1}{4}\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top $.

Now, we consider the case that $r=1$ and $s=2$. Analogous to the reasoning in equations (7.1-7.3), we can express the first-order Taylor series approximation of $\widehat{\varvec{\Gamma }}_{(1,2)}$ as follows:

$$\begin{aligned} \widehat{\varvec{\Gamma }}_{(1,2)}\approx \varvec{\Gamma }_{(1,2)} + \frac{\partial \varvec{\Gamma }_{(1,2)}}{\partial \varvec{\theta }^\top }(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.8)

where

$$\begin{aligned} \frac{\partial \varvec{\Gamma }_{(1,2)}}{\partial \varvec{\theta }^\top }= & {} \bigg (\frac{\partial \gamma _{(1,2)}(1)}{\partial \varvec{\theta }^\top },\cdots ,\frac{\partial \gamma _{(1,2)}(m)}{\partial \varvec{\theta }^\top }\bigg )^\top ,\\ \frac{\partial \gamma _{(1,2)}(k)}{\partial \varvec{\theta }^\top }= & {} -\frac{1}{n}\sum _{t=k+1}^{n}\frac{\varepsilon _{t}}{2h_{t}^{3/2}}\frac{\partial h_{t}}{\partial \varvec{\theta }^\top }\left( \frac{\varepsilon _{t-k}^2}{h_{t-k}}-1\right) \\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{h_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \left( \frac{\varepsilon _{t-k}^2}{h_{t-k}}-1\right) \\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{\varepsilon _{t-k}^2}{h_{t-k}^{2}}\frac{\partial h_{t-k}}{\partial \varvec{\theta }^\top }\frac{\varepsilon _{t}}{\sqrt{h_{t}}}\\{} & {} \quad -\frac{1}{n}\sum _{t=k+1}^{n}\frac{2\varepsilon _{t-k}}{h_{t-k}}\frac{\partial \mu _{t-k}}{\partial \varvec{\theta }^\top }\frac{\varepsilon _{t}}{\sqrt{h_{t}}}. \end{aligned}$$

By the ergodic theorem, for large n, note that

$$\begin{aligned} \frac{\partial \gamma _{(1,2)}(k)}{\partial \varvec{\theta }^\top }\overset{\mathrm {a.s.}}{\rightarrow }-\widetilde{\varvec{X}}_{12}(k), \end{aligned}$$

where

$$\begin{aligned} \displaystyle {\widetilde{\varvec{X}}_{12}(k)=\textrm{E}\left( \frac{1}{\sqrt{h_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \left( \frac{\varepsilon _{t-k}^2}{h_{t-k}}-1\right) \right) } \end{aligned}$$

is a $1\times l$ vector, which can be consistently estimated by

$$\begin{aligned} \varvec{X}_{12}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t}}}\frac{\partial \mu _{t}}{\partial \varvec{\theta }^\top } \left( \frac{{\widehat{\varepsilon }}_{t-k}^2}{{\widehat{h}}_{t-k}}-1\right) . \end{aligned}$$

(7.9)

Thus, $\widehat{\varvec{\Gamma }}_{(1,2)}$ stated in (7.8) may be expressed as $\widehat{\varvec{\Gamma }}_{(1,2)}\approx \varvec{\Gamma }_{(1,2)} - \varvec{X}_{12}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0)$, and

$$\begin{aligned} \sqrt{n}\widehat{\varvec{R}}_{(1,2)}&\approx \sqrt{n}({\widehat{r}}_{(1,2)}(1),\cdots ,{\widehat{r}}_{(1,2)}(m))^\top \nonumber \\&= \sqrt{n}(\rho _{(1,2)}(1),\cdots ,\rho _{(1,2)}(m))^\top \nonumber \\&\quad - \frac{1}{\sigma }\varvec{X}_{12}\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.10)

where $\varvec{X}_{12}=(\varvec{X}_{12}^\top (1),\cdots ,\varvec{X}_{12}^\top (m))^\top $.

Similarly, for the case $r=2$ and $s=1$, it is straightforward to show that

$$\begin{aligned} \sqrt{n}\widehat{\varvec{R}}_{(2,1)}&\approx \sqrt{n}({\widehat{r}}_{(2,1)}(1),\cdots ,{\widehat{r}}_{(2,1)}(m))^\top \nonumber \\&= \sqrt{n}(\rho _{(2,1)}(1),\cdots ,\rho _{(2,1)}(m))^\top \nonumber \\&\quad - \frac{1}{\sigma }X_{21}\sqrt{n}(\widehat{\varvec{\theta }}_n-\varvec{\theta }_0), \end{aligned}$$

(7.11)

where $\varvec{X}_{21}=(\varvec{X}_{21}^\top (1),\cdots ,\varvec{X}_{21}^\top (m))^\top $, and $\varvec{X}_{21}(k)$ is a $1\times l$ vector given by

$$\begin{aligned} \varvec{X}_{21}(k)=\frac{1}{n}\sum _{t=k+1}^{n}\frac{1}{\sqrt{{\widehat{h}}_{t-k}}}\frac{\partial \mu _{t-k}}{\partial \varvec{\theta }^\top } \left( \frac{{\widehat{\varepsilon }}_{t}^2}{{\widehat{h}}_{t}}-1\right) . \end{aligned}$$

(7.12)

The assumptions on $\xi _t$ imply that the random vectors $\sqrt{n}\widehat{\varvec{R}}_{(1,2)}$ and $\sqrt{n}\widehat{\varvec{R}}_{(2,1)}$ are asymptotically normal distributed with mean zero and variance $\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{12}\varvec{\Sigma }^{-1}\varvec{X}_{12}^\top $ and $\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{21}\varvec{\Sigma }^{-1}\varvec{X}_{21}^\top $, respectively.

By utilizing the results from (7.5), (7.6), (7.10), and (7.11) we can deduce the joint distribution of $\widehat{\varvec{R}}_{(1,1)},\widehat{\varvec{R}}_{(2,2)},\widehat{\varvec{R}}_{(r,s)}$, for the cases $r=1, s = 2$ and $r=2, s=1$. Without loss of generality, we establish the proof for the case $r = 1$ and $s = 2$. The proof for $r = 2$ and $s = 1$ readily ensues. Therefore, if the model defined in (2.1) is correctly specified, we obtain:

$$\begin{aligned} \sqrt{n} \left( \begin{array}{c} \widehat{\varvec{R}}_{(1,1)} \\ \widehat{\varvec{R}}_{(2,2)}\\ \widehat{\varvec{R}}_{(1,2)} \end{array} \right) \approx \sqrt{n}\varvec{D} \left( \begin{array}{c} \varvec{R}_{(1,1)} \\ \varvec{R}_{(2,2)}\\ \varvec{R}_{(1,2)}\\ \displaystyle {\frac{1}{n}\frac{\partial \ell }{\partial \varvec{\theta }}} \end{array} \right) , \end{aligned}$$

where

$$\begin{aligned} \varvec{D}= \left( \begin{array}{cccc} \varvec{I}_{m} &{} \varvec{0} &{} \varvec{0} &{} -\varvec{X}_{11}\varvec{\Sigma }^{-1} \\ \varvec{0} &{} \varvec{I}_{m} &{} \varvec{0} &{} -\frac{1}{\sigma ^2}\varvec{X}_{22}\varvec{\Sigma }^{-1} \\ \varvec{0} &{} \varvec{0} &{} \varvec{I}_{m} &{} -\frac{1}{\sigma }\varvec{X}_{12}\varvec{\Sigma }^{-1} \end{array} \right) . \end{aligned}$$

Note that $\{\varepsilon _t\}\overset{\mathrm {i.i.d.}}{\sim }\mathcal {N}(0,1)$ so that the factors $1/\sigma ^2$ and $1/\sigma $ can be replaced by 1/2 and $1/\sqrt{2}$, respectively.

Let ${\varvec{W}_n=\sqrt{n}(\varvec{R}_{(1,1)}^\top , \varvec{R}_{(2,2)}^\top , \varvec{R}_{(1,2)}^\top ,n^{-1}{\partial \ell }/{\partial \varvec{\theta }^\top })^\top }$. By using a martingale difference approach in terms of $\mathcal {F}_t$ and following the same arguments provided by (Wong and Ling (2005), Theorem 1), one can easily show that $\varvec{W}_n\overset{\textrm{D}}{\rightarrow }\mathcal {N}({\varvec{0}},{\varvec{V}})$; hence, $\sqrt{n}({\widehat{\varvec{R}}_{(1,1)}}^\top ,{\widehat{\varvec{R}}_{(2,2)}}^\top ,{\widehat{\varvec{R}}_{(1,2)}}^\top )^\top \overset{\textrm{D}}{\rightarrow }\mathcal {N}_{3\,m}(\varvec{0},\varvec{\Omega }_{12})$, where $\varvec{\Omega }_{12}=\varvec{D}\varvec{V}\varvec{D}^\top $. The matrices $\varvec{D},\varvec{V}$, and $\varvec{\Omega }_{12}$ can be consistently estimated by their sample values, denoted by $\widehat{\varvec{D}},\widehat{\varvec{V}}$, and $\widehat{\varvec{\Omega }}_{12}$, respectively. Under the assumptions of the model stated in (2.1), we get

$$\begin{aligned} \widehat{\varvec{V}}= \left( \begin{array}{cccc} \varvec{I}_{m} &{} \varvec{0} &{} \varvec{0}&{} \varvec{X}_{11} \\ \varvec{0} &{} \varvec{I}_{m} &{} \varvec{0}&{} \varvec{X}_{22} \\ \varvec{0} &{}\varvec{0}&{} \varvec{I}_{m} &{} \varvec{X}_{12} \\ \varvec{X}_{11}^\top &{} \varvec{X}_{22}^\top &{} \varvec{X}_{12}^\top &{}\varvec{\Sigma }^{-1} \end{array} \right) . \end{aligned}$$

Thus, we reach

$$\begin{aligned} \widehat{\varvec{\Omega }}_{12}= & {} \widehat{\varvec{D}}\widehat{\varvec{V}}\widehat{\varvec{D}}^\top \approx \left( \begin{array}{ccccc} \varvec{I}_{m}-\varvec{X}_{11}\varvec{\Sigma }^{-1}\varvec{X}_{11}^\top &{} -(1/2) \varvec{X}_{11}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top &{} -(1/\sqrt{2}) \varvec{X}_{11}\varvec{\Sigma }^{-1}\varvec{X}_{12}^\top \\ -(1/2)\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{11}^\top &{} \varvec{I}_{m}-({1}/{4})\varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top &{} -(1/\sqrt{8}) \varvec{X}_{22}\varvec{\Sigma }^{-1}\varvec{X}_{12}^\top \\ -(1/\sqrt{2}) \varvec{X}_{12}\Sigma ^{-1}\varvec{X}_{11}^\top &{} -(1/\sqrt{8}) \varvec{X}_{12}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top &{} \varvec{I}_{m}-({1}/{2})\varvec{X}_{12}\varvec{\Sigma }^{-1}\varvec{X}_{12}^\top . \end{array} \right) . \end{aligned}$$

For the ARMA models, we have $\varvec{X}_{11}\approx \varvec{0}, \varvec{X}_{22}=\varvec{0}$, and $\varvec{X}_{12}=\varvec{0}$, whereas for GARCH models, we have $\varvec{X}_{11}\approx 0$ and $\varvec{X}_{12}=0$. Furthermore, for large n, when the model stated in (2.1) is correctly specified, the off-diagonal block matrices in the matrix $\widehat{\varvec{\Omega }}_{12}$ are approximately zero. Therefore, in general, the matrix $\widehat{\varvec{\Omega }}_{rs}$ has the form stated as

$$\begin{aligned} \widehat{\varvec{\Omega }}_{rs}= \left( \begin{array}{ccc} \varvec{I}_{m}-X_{11} &{} \varvec{0}&{} \varvec{0}\\ \varvec{\Sigma }^{-1}\varvec{X}_{11}^\top &{} &{} \\ \varvec{0} &{} \varvec{I}_{m}-\frac{1}{4}\varvec{X}_{22} &{} \varvec{0}\\ &{}\varvec{\Sigma }^{-1}\varvec{X}_{22}^\top &{} \\ \varvec{0}&{} \varvec{0} &{}\varvec{I}_{m}-\frac{1}{2}\varvec{X}_{rs} \\ &{} &{}\varvec{\Sigma }^{-1}\varvec{X}_{rs}^\top \\ \end{array} \right) , \end{aligned}$$

(7.13)

with $r\ne s\in \{1,2\}$. $\square $

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mahdi, E. New mixed portmanteau tests for time series models. Stat Comput 34, 76 (2024). https://doi.org/10.1007/s11222-024-10393-w

Download citation

Received: 18 January 2024
Accepted: 19 January 2024
Published: 12 February 2024
DOI: https://doi.org/10.1007/s11222-024-10393-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

New mixed portmanteau tests for time series models

Abstract

Similar content being viewed by others

A New Look at Portmanteau Tests

Diagnostic Checks in Multiple Time Series Modelling

Testing the constancy of Spearman’s rho in multivariate time series

1 Introduction

2 The general time series model and its assumptions

3 The proposed test statistics

Theorem 1

Proof

Remark 1

4 Simulation studies

4.1 Testing linearity in linear time series models

4.2 Testing the AR-ARCH models

5 Empirical applications

5.1 Test for nonlinearity in AR models using stock returns

5.2 Goodness-of-fit-tests for nonlinear time series models

6 Conclusions

Data availibility

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (zip 11377 KB)

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

New mixed portmanteau tests for time series models

Abstract

Similar content being viewed by others

A New Look at Portmanteau Tests

Diagnostic Checks in Multiple Time Series Modelling

Testing the constancy of Spearman’s rho in multivariate time series

Explore related subjects

1 Introduction

2 The general time series model and its assumptions

3 The proposed test statistics

Theorem 1

Proof

Remark 1

4 Simulation studies

4.1 Testing linearity in linear time series models

4.2 Testing the AR-ARCH models

5 Empirical applications

5.1 Test for nonlinearity in AR models using stock returns

5.2 Goodness-of-fit-tests for nonlinear time series models

6 Conclusions

Data availibility

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (zip 11377 KB)

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation