Abstract
We derive the asymptotic distribution of residual autocorrelations for the Weibull autoregressive conditional duration (ACD) model, and this leads to a portmanteau test for the adequacy of the fitted Weibull ACD model. The finite-sample performance of this test is evaluated by simulation experiments and a real data example is also reported.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
- Autoregressive conditional duration model
- Weibull distribution
- Model diagnostic checking
- Residual autocorrelation
Mathematics Subject Classification (2010)
1 Introduction
First proposed by Engle and Russell [3], the autoregressive conditional duration (ACD) model has become very popular in the modeling of high-frequency financial data. ACD models are applied to describe the duration between trades for a frequently traded stock such as IBM and it provides useful information on the intraday market activity. Note that the ACD model for durations is analogous to the commonly used generalized autoregressive conditional heteroscedastic (GARCH) model [1, 2] for stock returns. Driven by the strong similarity between the ACD and GARCH models, various extensions to the original ACD model of Engle and Russell [3] have been suggested. However, despite the great variety of ACD specifications, the question of model diagnostic checking has received less attention.
The approach used by Engle and Russell [3] and widely adopted by subsequent authors to assess the adequacy of the estimated ACD model consists of applying the Ljung–Box Q-statistic [7] to the residuals from the fitted time series model and to its squared sequence. The latter case is commonly known as the McLeod–Li test [8]. As pointed out by Li and Mak [5] in the context of GARCH models, this approach is questionable, because this test statistic does not have the usual asymptotic chi-square distribution under the null hypothesis when it is applied to residuals of an estimated GARCH model. Following Li and Mak [5], Li and Yu [6] derived a portmanteau test for the goodness-of-fit of the fitted ACD model when the errors follow the exponential distribution.
In this paper, we consider a portmanteau test for checking the adequacy of the fitted ACD model when the errors have a Weibull distribution. This paper has similarities to [6] since the two papers both follow the approach by Li and Mak [5] to construct the portmanteau test statistic. Besides the difference in the distribution of the error term, the functional form of the ACD model in the present paper is more general than that of [6], because the latter only discusses the ACD model with an ARCH-like form of the conditional mean duration.
The remainder of this paper is organized as follows. Section 2 presents the portmanteau test for the Weibull ACD model estimated by the maximum likelihood method. In Sect. 3, two Monte Carlo simulations are performed to study the finite-sample performance of the diagnostic tool and an illustrative example is reported to demonstrate its usefulness.
2 A Portmanteau Test
2.1 Basic Definitions and the ML Estimation
Consider the autoregressive conditional duration (ACD) model,
where \(t_0<t_1<\cdots<t_n<\cdots \) are arrival times, \(x_i=t_i-t_{i-1}\) is an interval, \(\omega >0\), \(\alpha _j\ge 0\), \(\beta _j\ge 0\), and the innovations \(\{\varepsilon _t\}\) are identically and independently distributed (i.i.d.) nonnegative random variables with mean one [3].
For ACD model at (1), we assume that the innovation \(\varepsilon _i\) has the density of a standardized Weibull distribution,
where \(c_{\gamma }=[\varGamma (1+\gamma ^{-1})]^{\gamma }\), \(\varGamma (\cdot )\) is the Gamma function, and \(E(\varepsilon _i)=1\). The Weibull distribution has a decreasing (increasing) hazard function if \(\gamma <1\) (\(\gamma >1\)) and reduces to the standard exponential distribution if \(\gamma = 1\). We denote this model by WACD(p, q) in this paper.
Let \(\varvec{\alpha }=(\alpha _1,\ldots ,\alpha _p)^{\prime }\), \(\varvec{\beta }=(\beta _1,\ldots ,\beta _q)^{\prime }\) and \(\varvec{\theta }=(\omega ,\varvec{\alpha }^{\prime },\varvec{\beta }^{\prime })^{\prime }\). Denote by \(\varvec{\lambda }=(\gamma ,\varvec{\theta }^{\prime })^{\prime }\) the parameter vector of the Weibull ACD model, and its true value \(\varvec{\lambda }_0=(\gamma _0,\varvec{\theta }_0^{\prime })^{\prime }\) is an interior point of a compact set \(\varLambda \subset \mathbb {R}^{p+q+2}\). The following assumption gives some constraints on the parameter space \(\varLambda \).
Assumption 1
\(\omega >0\), \(\alpha _j>0\) for \(1\le j\le p\), \(\beta _j>0\) for \(1\le j\le q\), \(\sum _{j=1}^p\alpha _j+\sum _{j=1}^q\beta _j<1\), and Polynomials \(\sum _{j=1}^p\alpha _jx^j\) and \(1-\sum _{j=1}^q\beta _jx^j\) have no common root.
Given nonnegative observations \(x_1,\ldots ,x_n\), the log-likelihood function of the Weibull ACD model is
Note that the above functions all depend on unobservable values of \(x_i\) with \(i\le 0\), and some initial values are hence needed for \(x_0, x_{-1},\ldots ,x_{1-p}\) and \(\psi _0(\varvec{\theta }),\psi _{-1}(\varvec{\theta }),\ldots ,\psi _{1-q}(\varvec{\theta })\). We simply set them to be \(\bar{x}=n^{-1}\sum _{i=1}^nx_i\), and denote the corresponding functions respectively by \(\widetilde{\psi }_i(\varvec{\theta })\) and \(\widetilde{L}_n(\varvec{\lambda })\). Thus, the MLE can be defined as
Let
and
where \(c_{\gamma }^{\prime }=\partial c_{\gamma }/\partial \gamma \). It can be verified that \(E[c_1(\varepsilon _i,\gamma _0)]=0\) and \(E[c_2(\varepsilon _i,\gamma _0)]=0\). Denote \(\kappa _1=\text {var}[c_1(\varepsilon _i,\gamma _0)]\), \(\kappa _2=\text {var}[c_2(\varepsilon _i,\gamma _0)]\), \(\kappa _3=\text {cov}[c_1(\varepsilon _i,\gamma _0),c_2(\varepsilon _i,\gamma _0)]\) and
If Assumption 1 holds, then \(\widetilde{\varvec{\lambda }}_n\) converges to \({\varvec{\lambda }}_0\) in almost surely sense as \(n\rightarrow \infty \), and \(\sqrt{n}(\widetilde{\varvec{\lambda }}_n-\varvec{\lambda }_0) \rightarrow _d N(0,\varSigma ^{-1})\) as \(n\rightarrow \infty \); see Engle and Russell [3] and Francq and Zakoian [4].
Denote by \(\{\widetilde{\varepsilon }_i\}\) the residual sequence from the fitted Weibull ACD model, where \(\widetilde{\varepsilon }_i=x_i/\widetilde{\psi }_i(\widetilde{\varvec{\theta }}_n)\). For the quantities in the information matrix \(\varSigma \), \(\kappa _1\), \(\kappa _2\), \(\kappa _3\), \(E[\psi _i^{-1}(\varvec{\theta }_0){\partial \psi _i(\varvec{\theta }_0) }/{\partial \varvec{\theta }}]\), and \(E[\psi _i^{-2}(\varvec{\theta }_0)({\partial \psi _i(\varvec{\theta }_0) }/{\partial \varvec{\theta }})({\partial \psi _i(\varvec{\theta }_0) }/{\partial \varvec{\theta }^{\prime }})]\), we can estimate them respectively by
The above estimators are all consistent, and hence a consistent estimator of the information matrix \(\varSigma \). Moreover,
where
2.2 The Main Result
This subsection derives asymptotic distributions of the residual autocorrelations from the estimated Weibull ACD model, and hence a portmanteau test for checking the adequacy of this model. Note that the residuals are nonnegative, and the residual autocorrelations here are also the absolute residual autocorrelations.
Without confusion, we denote \(\widetilde{\psi }_i(\widetilde{\varvec{\theta }}_n)\) and \(\psi _i(\varvec{\theta }_0)\) respectively by \(\widetilde{\psi }_i\) and \(\psi _i\) for simplicity. Consider the residual sequence \(\{\widetilde{\varepsilon }_i\}\) with \(\widetilde{\varepsilon }_i=x_i/\widetilde{\psi }_i\). Note that \(n^{-1}\sum _{i=1}^n\widetilde{\varepsilon }_i=1+o_p(1)\) and then, for a positive integer k, the lag-k residual autocorrelation can be defined as
We next consider the asymptotic distributions of the first K residual autocorrelations, \(\widetilde{R}=(\widetilde{r}_1,\ldots ,\widetilde{r}_K)^{\prime }\), where K is a predetermined positive integer.
Denote \(\widetilde{\psi }_i(\widetilde{\varvec{\theta }}_n)\) and \(\psi _i(\varvec{\theta }_0)\) respectively by \(\widetilde{\psi }_i\) and \(\psi _i\), and let \(\widetilde{\varepsilon }_i=x_i/\widetilde{\psi }_i\). Let \(\widetilde{C}=(\widetilde{C}_1,\ldots ,\widetilde{C}_K)^{\prime }\) and \(C=(C_1,\ldots ,C_K)^{\prime }\), where
By the \(\sqrt{n}\)-consistency of \(\widetilde{\varvec{\theta }}_n\) at (2) and the ergodic theorem, it follows that \(n^{-1}\sum _{i=1}^n(\widetilde{\varepsilon }_i-1)^2=\sigma _{\gamma _0}^2+o_p(1),\) where \(\sigma _{\gamma _0}^2=\text {var}(\varepsilon _i)\), and thus it suffices to derive the asymptotic distribution of \(\widetilde{C}\).
By the Taylor expansion, it holds that
where \(H=(H_1,\ldots ,H_K)\) with \(H_k=-E [\psi _i^{-1}(\varepsilon _{i-k}-1)\partial \psi _i/\partial \varvec{\theta }]\). Moreover,
where the \(c_j(\varepsilon _i,\gamma _0)\) is as defined in Sect. 2.1, and the matrix \(A=(0,\mathbf {I})\) with \(\mathbf {I}\) being the \((p+q+1)\)-dimensional identity matrix. Note that \(E[\varepsilon _ic_2(\varepsilon _i,\gamma _0)]=0\) and \(E[\varepsilon _ic_1(\varepsilon _i,\gamma _0)]=1\). By (3), (4), the central limit theorem and the Cramér-Wold device, it follows that
where \(\varOmega =\mathbf {I}-\sigma _{\gamma _0}^{-4}H^{\prime }\varSigma _1^{-1}H\), \(\sigma _{\gamma _0}^2=\text {var}(\varepsilon _i)\), \(H=(H_1,\ldots ,H_K)\) with \(H_k=-E [\psi _i^{-1}(\varepsilon _{i-k}-1)\partial \psi _i/\partial \varvec{\theta }]\), and \(\varSigma _1\) is as defined in Sect. 2.1.
Let \(\widetilde{\sigma }_{\gamma _0}^2=n^{-1}\sum _{i=1}^n(\widetilde{\varepsilon }_i-1)^2\), \(\widetilde{H}_k=-n^{-1}\sum _{i=1}^n \widetilde{\psi }_i^{-1}(\widetilde{\varepsilon }_{i-k}-1)\partial \widetilde{\psi }_i/\partial \varvec{\theta }\) and \(\widetilde{H}=(\widetilde{H}_1,\ldots ,\widetilde{H}_K)\). Then we have \(\widetilde{H}=H+o_p(1)\) and hence a consistent estimator of \(\varOmega \) can be constructed, denoted by \(\widetilde{\varOmega }\). Let \(\widetilde{\varOmega }_{kk}\) be the diagonal elements of \(\widetilde{\varOmega }\), for \(1\le k\le K\). We therefore can check the significance of \(\widetilde{r}_k\) by comparing its absolute value with \(1.96\sqrt{\widetilde{\varOmega }_{kk}/n}\), where the significance level is 5 %.
To check the significance of \(\widetilde{R}=(\widetilde{r}_1,\ldots ,\widetilde{r}_K)^{\prime }\) jointly, we can construct a portmanteau test statistic,
and it will be asymptotically distributed as \(\chi _K^2\), the chi-square distribution with K degrees of freedom.
3 Numerical Studies
3.1 Simulation Experiments
This subsection conducts two Monte Carlo simulation experiments to check the finite-sample performance of the proposed portmanteau test in the previous section.
The first experiment evaluates the sample approximation for the asymptotic variance of residual autocorrelations \(\varOmega \), and the data generating process is
where \(\varepsilon _i\) follows the standardized Weibull distribution with the parameter of \(\gamma \). We consider \(\gamma =0.8\) and 1.2, corresponding to a heavy-tailed distribution and a light-tailed one, and \((\alpha ,\beta )^{\prime }=(0.2,0.6)^{\prime }\) and \((0.4,0.5)^{\prime }\). The sample size is set to \(n=200\), 500 or 1000, and there are 1000 replications for each sample size. As shown in Table 1, the asymptotic standard deviations (ASDs) of the residual autocorrelations at lags 2, 4 and 6 are close to their corresponding empirical standard deviations (ESDs) when the sample size is as small as \(n=500\).
In the second experiment, we check the size and power of the proposed portmanteau test Q(K) using the data generating process,
where \(\alpha _2 = 0\), 0.15 or 0.3, and \(\varepsilon _i\) follows the standardized Weibull distribution with \(\gamma =0.8\) or 1.2. All the other settings are preserved from the previous experiment. We fit the model of orders (1, 1) to the generated data; hence, the case with \(\alpha _2 = 0\) corresponds to the size and those with \(\alpha _2 > 0\) to the power. The rejection rates of test statistic Q(K) with \(K=6\) are given in Table 2. For comparison, the corresponding rejection rates of the Ljung–Box statistics for the residual series and its squared process are also reported, denoted by \(Q^*_1(K)\) and \(Q^*_2(K)\). The critical value is the upper 5th percentile of the \(\chi ^2_6\) distribution for all these tests. As shown in the table, the test Q(K) is oversized when \(n=1000\), while the other two tests are largely undersized for some \(\gamma \). Furthermore, we found that increasing the sample size to 9000 could result in Q(K) having sizes of 0.058 and 0.053 for \(\gamma =0.8\) and 1.2, while the sizes of the other two tests do not become closer to the nominal value even for very large n. For the power simulations, it can be seen clearly that Q(K) is the most powerful test among the three and \(Q^*_2(K)\) is the least powerful one. Moreover, the powers are interestingly observed to have smaller values when the generated data are heavy-tailed (\(\gamma =0.8\)).
3.2 An Empirical Example
As an illustrative example, this subsection considers the trade durations of the US IBM stock on fifteen consecutive trading days starting from November 1, 1990. The data are truncated from a larger data set which consists of the diurnally adjusted IBM trade durations data from November 1, 1990, to January 31, 1991, adjusted and analyzed by Tsay [9, Chap. 5]. Focusing on positive durations, we have 12,532 diurnally adjusted observations.
We consider the WACD(p, q) models with \(p=1\) and \(q=1, 2\) or 3. The major interest is on whether the models fit the data adequately. To this end, the p values for Q(K), \(Q^*_1(K)\) and \(Q^*_2(K)\) with \(K=6\), 12 and 18 at the 5 % significance level are reported in Table 3. It can be seen that the WACD(1, 3) model fits the data adequately according to all the test statistics. The fitted WACD(1, 1) model is clearly rejected by both Q(K) and \(Q^*_1(K)\) with \(K=6\), 12 and 18. For the fitted WACD(1, 2) model, both Q(K) and \(Q^*_1(K)\) suggest an adequate fit of the data with \(K=6\) or 12, but not with \(K=18\). While for the data, Q(K) and \(Q^*_1(K)\) always lead to the same conclusions, the fact that the p value for Q(K) is always smaller than that for \(Q^*_1(K)\) confirms that Q(K) is more powerful than \(Q^*_1(K)\). In contrast, \(Q^*_2(K)\) fails to detect any inadequacy of the fitted WACD models.
References
Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31, 307–327.
Engle, R. F. (1982). Autoregression conditional heteroscedasticity with estimates of the variance of U.K. inflation. Econometrica, 50, 987–1008.
Engle, R. F., & Russell, J. R. (1998). Autoregressive conditional duration: A new model for irregularly spaced transaction data. Econometrica, 66, 1127–1162.
Francq, C., & Zakoian, J. M. (2004). Maximum likelihood estimation of pure GARCH and ARMA-GARCH processes. Bernoulli, 10, 605–637.
Li, W. K., & Mak, T. K. (1994). On the squared residual autocorrelations in non-linear time series with conditoinal heteroskedasticity. Journal of Time Series Analysis, 15, 627–636.
Li, W. K., & Yu, P. L. H. (2003). On the residual autocorrelation of the autoregressive conditional duration model. Economic Letters, 79, 169–175.
Ljung, G. M., & Box, G. E. P. (1978). On a measure of lack of fit in time series models. Biometrika, 65, 297–303.
McLeod, A. I., & Li, W. K. (1983). Diagnostic checking ARMA time series models using squared residual autocorrelations. Journal of Time Series Analysis, 4, 269–273.
Tsay, R. S. (2010). Analysis of financial time series (3rd ed.). New York: Wiley.
Acknowledgements
We are grateful to the co-editor and two anonymous referees for their valuable comments and constructive suggestions that led to the substantial improvement of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this chapter
Cite this chapter
Zheng, Y., Li, Y., Li, W.K., Li, G. (2016). Diagnostic Checking for Weibull Autoregressive Conditional Duration Models. In: Li, W., Stanford, D., Yu, H. (eds) Advances in Time Series Methods and Applications . Fields Institute Communications, vol 78. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-6568-7_4
Download citation
DOI: https://doi.org/10.1007/978-1-4939-6568-7_4
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-6567-0
Online ISBN: 978-1-4939-6568-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)