Abstract
In this paper, we apply the least-squares method to estimate the unknown parameters in first-order random coefficient integer-valued autoregressive (\(\operatorname{RCINAR}(1)\)) processes. The least-squares estimator is derived and its limiting properties are discussed. Furthermore, we also derive a statistic to test the randomness of coefficients. Numerical results from simulation studies suggest that the proposed method is good for practical use.
Similar content being viewed by others
1 Introduction
Integer-valued time series have received increasing attention in the probabilistic and statistical literature over the past several years because of its applicability in many different areas such as the natural sciences, the social sciences, international tourism demand, and economy. See, for instance, Davis et al. [1] and MacDonald and Zucchini [2]. There are two main classes of time series models that have been developed recently for count data: state-space models and thinning models. For state-space models, we refer to Fukasawa and Basawa [3].
Steutal and Van Harn [4] defined a first-order integer-valued autoregressive (\(\operatorname{INAR}(1)\)) model. To this aim, they first proposed a ‘thinning’ operator ∘, which is defined as
where X is an integer-valued random variable and \(\phi\in[0,1]\), \(\{ B_{i}\}\) is an i.i.d. Bernoulli random sequence with \(P(B_{i}=1)=\phi\) that is independent of X. Based on the ‘thinning’ operator ∘, the \(\operatorname{INAR}(1)\) model is defined as
where \(\{Z_{t}\}\) is a sequence of i.i.d. non-negative integer-valued random variables.
The ‘thinning’ operator integer-valued models have been studied by many authors (see, e.g., [5–10]). Note that the parameter ϕ may vary with time and it may be random, Zheng et al. [11] extended the above model to the following first-order random coefficient integer-valued autoregressive (\(\operatorname{RCINAR}(1)\)) model:
where \(\{\phi_{t}\}\) are a sequence of i.i.d. sequence with cumulative distribution function \(p_{\phi}\) on \([0,1)\) with \(E(\phi_{t})=\phi\) and \(\operatorname{Var}(\phi_{t})=\sigma_{\phi}^{2}\); \(\{Z_{t}\}\) is a sequence of i.i.d. non-negative integer-valued random variables with \(E(Z_{t})=\lambda\) and \(\operatorname{Var}(Z_{t})=\sigma_{Z}^{2}\). Moreover, \(\{ \phi_{t}\}\) and \(\{Z_{t}\}\) are independent.
Obviously, when \(\sigma_{\phi}^{2}\) is equal to zero, the model (1.2) becomes an \(\operatorname{INAR}(1)\) model. Zheng et al. [12] also generalized the above model to a pth-order model. For model (1.2), Zheng et al. [11] established the ergodicity and derived the conditional least-squares and quasi-likelihood estimators of the model parameters. By employing the cumulative sum (CUSUM) test based on the conditional least-squares and modified quasi-likelihood estimators, Kang and Lee [13] considered the problem of testing for a parameter change in a \(\operatorname{RCINAR}(1)\) model. By using the empirical likelihood method, Zhang et al. (see, e.g., [14, 15]) described how to build confidence regions for the unknown parameters. Roitershtein and Zhong [16] studied the weak limits of extreme values and the growth rate of partial sums.
In this paper, we apply the least-squares method to estimate the variances of random coefficients and errors in model (1.2). The least-squares estimator is derived and its limiting properties are discussed. Furthermore, we also derive a statistic to test the randomness of coefficients.
The rest of this paper is organized as follows. In Section 2, we introduce the methodology and the main results. Simulation results are reported in Section 3. Section 4 provides the proofs of the main results.
The symbols ‘\(\stackrel{d}{\longrightarrow}\)’ and ‘\(\stackrel{p}{\longrightarrow}\)’ denote convergence in distribution and convergence in probability, respectively. Convergence ‘almost surely’ is written as ‘a.s.’. Furthermore, ‘\(M^{\tau}_{k \times p}\)’ denotes the transpose matrix of the \(k\times p\) matrix \(M_{k\times p}\), \(\|\cdot\|\) denotes the Euclidean norm of the matrix or vector.
2 Methodology and main results
In this section, we will first discuss how to apply least-squares method to estimate the unknown parameter \(\sigma_{\phi}^{2}\) and \(\sigma_{Z}^{2}\). Let \(\beta=(\sigma_{\phi}^{2}, \phi(1-\phi)-\sigma_{\phi}^{2}, \sigma_{Z}^{2})^{\tau}\) and \(R_{t}(\phi,\lambda)=X_{t}-E(X_{t}|X_{t-1})\). For simplicity, we write \(R_{t}(\phi,\lambda)\) as \(R_{t}\), omitting the parameter ϕ and λ. Note that \(E(X_{t}|X_{t-1})=\phi X_{t-1}+\lambda\) and \(E(R_{t}^{2}|X_{t-1})=Z_{t}^{\tau}\beta\), where \(Z_{t}=(X_{t-1}^{2}, X_{t-1}, 1)^{\tau}\) The conditional least-squares estimator β̂ of β, based on the sample \(X_{0}, X_{1}, \ldots, X_{n}\) is obtained by minimizing
with β. Substituting \(E(R_{t}^{2}|X_{t-1})=Z_{t}^{\tau}\beta\) in Q and solving
for β, we obtain
Let \(\tilde{\beta}=\hat{\beta}(\hat{\phi},\hat{\lambda})\), where ϕ̂ and λ̂ are given by Zheng et al. [11]. β̃ can be used to estimate the unknown parameter β.
In order to obtain the limiting properties of β̃, we assume the following conditions:
- (A1):
-
\(\{X_{t}\}\) is a strictly stationary and ergodic process.
- (A2):
-
\(E|X_{t}|^{8}<\infty\).
The following theorem gives the limit distribution of β̃.
Theorem 2.1
Assume that (A1) and (A2) hold. Then
where \(W=E(Z_{t} Z_{t}^{\tau}(R_{t}^{2}-Z_{t}^{\tau}\beta)^{2})\), \(\Gamma=E(Z_{t} Z_{t}^{\tau})\).
Let \(\theta=(\sigma_{\phi}^{2},\sigma_{Z}^{2})^{\tau}\), \(T=\Bigl ( {\scriptsize\begin{matrix}{} 1 & 0 \cr 0 & 0 \cr 0 & 1\end{matrix}} \Bigr ) \) and \(\tilde{T}=(1,0,0)^{\tau}\). Based on the estimate of β̃, the estimate θ̂ of θ can be given by \(T^{\tau}\beta\) and the estimate \(\hat{\sigma}_{\phi}^{2}\) of \(\sigma_{\phi}^{2}\) can be given by \(\tilde{T}^{\tau}\beta\). By Theorem 2.1, we have the following corollary.
Corollary 2.2
Assume that (A1) and (A2) hold. Then
where \(W=E(Z_{t} Z_{t}^{\tau}(R_{t}^{2}-Z_{t}^{\tau}\beta)^{2})\), \(\Gamma=E(Z_{t} Z_{t}^{\tau})\).
Corollary 2.3
Assume that (A1) and (A2) hold. Then
where \(W=E(Z_{t} Z_{t}^{\tau}(R_{t}^{2}-Z_{t}^{\tau}\beta)^{2})\), \(\Gamma=E(Z_{t} Z_{t}^{\tau})\).
If \(\sigma_{\phi}=0\), the model (1.2) becomes a \(\operatorname{INAR}(1)\) model. Therefore, in order to test the randomness of coefficients, we only need to test whether the \(\sigma_{\phi}\) is zero. To this aim, we consider the following hypothesis test:
In order to obtain the test statistic, we consider the estimation of W and Γ. Let \(\hat{W}=\frac{1}{n}\sum_{t=1}^{n}(Z_{t} Z_{t}^{\tau}(R_{t}^{2}(\hat{\phi },\hat{\lambda})-Z_{t}^{\tau}\hat{\beta})^{2})\) and \(\hat{\Gamma}=\sum_{t=1}^{n}Z_{t} Z_{t}^{\tau}\). Then Ŵ and Γ̂ are the consistent estimate of W and Γ, respectively.
Corollary 2.4
Assume that (A1) and (A2) hold. Then
and
By Corollary 2.4, it is easy to see that
Combining with Corollary 2.3, we have
By (2.5), we can obtain the confidence interval for the true parameter \(\sigma_{\phi}^{2}\). The asymptotic \(100(1-\nu)\%\) confidence interval of \(\sigma_{\phi}^{2}\) is
where \(u_{\frac{\nu}{2}}\) is the upper \({\nu}/{2}\)-quantile of the standard normal distribution.
3 Simulation study
In this section, we conduct some simulation studies which show that our proposed methods perform very well. We consider the \(\operatorname{RCINAR}(1)\) process
where \(\{\phi_{t}\}\) is a sequence of i.i.d. sequence with \(E(\phi_{t})=\phi \) and \(\operatorname{Var}(\phi_{t})=\sigma_{\phi}^{2}\); \(\{Z_{t}\}\) is a sequence of i.i.d. Poisson sequence with \(E(Z_{t})=\lambda\).
In the first simulation study, we calculate the probability of accepting the null hypothesis when it is true at the nominal level \(\alpha=0.1 \mbox{ and }0.05\). To this aim, we consider the following models.
Model I
\(\phi_{t}=\phi\), \(Z_{t}\sim\operatorname{Poisson}(\lambda)\).
We take \(\phi=0.1, 0.3, 0.5, 0.7, \mbox{and }0.9\), and take \(\lambda=1\mbox{ and }2\). Samples of size \(n=50, 100, \mbox{and }300\). All simulation studies are based on 1,000 repetitions. The results of the simulations are presented in Table 1 and the figures in parentheses are the simulation results at the nominal level \(\alpha=0.05\).
In the second simulation study, we calculate the probability of rejecting the null hypothesis when it is false at the nominal level \(\alpha=0.1\mbox{ and } 0.05\). To this aim, we consider the following model.
Model II
\(\phi_{t}\sim U(0, 2\phi)\), \(Z_{t}\sim\operatorname{Poisson}(\lambda)\).
We take \(\sigma_{\phi}^{2}=0.10, 0.15, 0.20, 0.25, 0.30, \mbox{and } 0.32\). Samples of size \(n=50, 100, \mbox{and }300\). All simulation studies are based on 1,000 repetitions. The results of the simulations are presented in Table 2 and the figures in parentheses are the simulation results at the nominal level \(\alpha=0.05\).
The results in Tables 1 and 2 lead to the following observations: When the null hypothesis is true, we have a larger probability to accept the null hypothesis. When the alternative hypothesis is true we also have a larger probability to reject the null hypothesis. Therefore, using the test method obtained by us, we have a larger probability to make a correct judgment.
4 Proofs of the main results
In order to prove Theorem 2.1, we first prove the following lemma.
Lemma 4.1
Assume that (A1) and (A2) hold. Then
where \(W=E(Z_{t} Z_{t}^{\tau})\), \(\Gamma=E(Z_{t} Z_{t}^{\tau})\).
Proof
After simple algebraic calculations, we have
By the ergodic theorem, we have
Therefore, in order to prove Lemma 4.1, we need only to prove that
By the Cramer-Wold device, it suffices to show that, for all \(c\in R^{3}\setminus(0,0,0)\),
For simplicity of notation, we write \(c^{\tau}Z_{t}(R_{t}^{2}-Z^{\tau}_{t}\beta)\) for \(G_{t,c}(\beta)\). Further, let \(\xi _{nt}=\frac{1}{\sqrt{n}}G_{t,c}(\beta)\) and \(\mathscr{F}_{nt}=\sigma(\xi_{nr}, 1\leq r \leq t) \). Then \(\{\sum_{t=1}^{n}\xi_{nt},\mathscr{F}_{nt}, 1\leq t\leq n, n\geq1\} \) is a zero-mean, square integrable martingale array. By making use of a martingale central limit theorem [17], it suffices to show that
and the σ-fields are nested:
Note that (4.7) is obvious. In the following, we first consider (4.4). By a simple calculation, we have, for all \(\varepsilon>0\),
Now by the Lebesgue control convergence theorem, we immediately see that (4.8) converges to 0 as \(n\rightarrow\infty\). This settles (4.4).
Next we consider (4.5). By the ergodic theorem, we have
Hence (4.5) is proved.
Finally we consider (4.6). Note that \(\{( \frac{1}{\sqrt{n}}G_{t,c}(\theta_{0}) )^{2},t\geq1\}\) is a stationary sequence. Then we have
This proves (4.6). Thus, we complete the proof of Lemma 4.1. □
Proof of Theorem 2.1
Note that
By Lemma 4.1, it suffices to prove that
Note that
By (4.1), we know that
In the following, we prove that
First note that, by the mean value theorem,
where \(\phi^{\ast}\) lies between ϕ̂ and ϕ, \(\lambda^{\ast}\) lies between λ̂ and λ. This implies that
In the following, we prove that \(J_{ni}=o_{p}(1)\), \(i=1,2,3,4,5,6\). First, we consider \(J_{n1}\). Note that
By Theorem 3.1 in Zheng et al. [11], we know that
Moreover, by the ergodic theorem, we have
Note that
This, together with (4.11) and (4.12), proves that
Similarly, we can prove that
Next, we prove that
Note that
By the ergodic theorem, we have
By (4.11), we have
Moreover, note that
which, combined with (4.16), (4.17), and (4.18), implies (4.15).
Similarly, we can prove that
and
Thus, by (4.13), (4.14), (4.15), (4.20), (4.21), and (4.22), (4.10) can be proved. The proof of Theorem 2.1 is thus completed. □
The proof of Corollary 2.2 and Corollary 2.3 is obvious, we omit it here.
Proof of Corollary 2.4
By the ergodic theorem, we can prove that
Next, we prove that
Note that
First, we consider \(H_{n2}\). Note that
By Theorem 2.1, we have
and
Further, by the ergodic theorem, we have
This, combined with (4.24), (4.25), and (4.26), implies that
Similar to the proof of (4.9), we can prove that
and
This, combined with (4.28), we can prove (4.23). The proof of Corollary 2.4 is thus completed. □
5 Conclusion
Integer-valued time series data are fairly common in economics and medicine, such as the number of patients in a hospital at a specific time. In this paper, we propose a method to estimate the unknown parameters in first-order random coefficient integer-valued autoregressive processes. The limiting properties are investigated and simulations indicate that the method is feasible. This method is particularly useful when establishing models for practical data.
References
Davis, RA, Dunsmuir, TM, Wang, Y: Modeling time series of count data. In: Asymptotics. Nonparametrics and Time Series, pp. 63-114 (1999)
MacDonald, IL, Zucchini, WZ: Hidden Markov and Other Models for Discrete-Valued Time Series. Chapman & Hall, London (1997)
Fukasawa, T, Basawa, IV: Estimation for a class of generalized state-space time series models. Stat. Probab. Lett. 60, 459-473 (2002)
Steutal, F, Van Harn, K: Discrete analogues of self-decomposability and stability. Ann. Probab. 7, 893-899 (1979)
McKenzie, E: Some simple models for discrete variate time series. J. Am. Water Resour. Assoc. 21, 645-650 (1985)
McKenzie, E: Some ARMA models for dependent sequences of Poisson counts. Adv. Appl. Probab. 20, 822-835 (1988)
Al-Osh, MA, Alzaid, AA: First order integer-valued autoregressive (\(\operatorname{INAR}(1)\)) processes. J. Time Ser. Anal. 8, 261-275 (1987)
Alzaid, AA, Al-Osh, MA: An integer-valued pth order autoregressive structure (\(\operatorname{INAR}(p)\)) process. J. Appl. Probab. 27, 314-324 (1990)
Du, JG, Li, Y: The integer-valued autoregressive (\(\operatorname{INAR}(p)\)) model. J. Time Ser. Anal. 12, 129-142 (1991)
Cardinal, M, Roy, R, Lambert, J: On the application of integer-valued time series models for the analysis of disease incidence. Stat. Med. 18, 2025-2039 (1999)
Zheng, H, Basawa, IV, Datta, S: First-order random coefficient integer-valued autoregressive processes. J. Stat. Plan. Inference 137, 212-229 (2007)
Zheng, H, Basawa, IV, Datta, S: Inference for pth-order random coefficient integer-valued autoregressive processes. J. Time Ser. Anal. 27, 411-440 (2006)
Kang, J, Lee, S: Parameter change test for random coefficient integer-valued autoregressive processes with application to polio data analysis. J. Time Ser. Anal. 30, 239-258 (2009)
Zhang, H, Wang, D, Zhu, F: The empirical likelihood for first-order random coefficient integer-valued autoregressive processes. Commun. Stat., Theory Methods 40, 492-509 (2011)
Zhang, H, Wang, D, Zhu, F: Empirical likelihood inference for random coefficient \(\operatorname{INAR }(p)\) process. J. Time Ser. Anal. 32, 195-203 (2011)
Roitershtein, A, Zhong, Z: On random coefficient \(\operatorname{INAR}(1)\) processes. Sci. China Math. 56, 177-200 (2013)
Hall, P, Heyde, CC: Martingale Limit Theory and Its Application. Academic Press, New York (1980)
Acknowledgements
We acknowledge the financial supports by National Natural Science Foundation of China (Nos. 11571138, 11271155, 11001105, 11071126, 10926156, 11071269), Specialized Research Fund for the Doctoral Program of Higher Education (No. 20110061110003), Program for New Century Excellent Talents in University (NCET-08-237), Scientific Research Fund of Jilin University (No. 201100011), and Jilin Province Natural Science Foundation (Nos. 20130101066JC, 20130522102JH, 20101596).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zhao, Z., Hu, Y. Statistical inference for first-order random coefficient integer-valued autoregressive processes. J Inequal Appl 2015, 359 (2015). https://doi.org/10.1186/s13660-015-0886-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13660-015-0886-y