Longitudinal Mixed Models with t Random Effects for Repeated Count and Binary Data

Rao, R. Prabhakar; Sutradhar, Brajendra C.; Pandit, V. N.

doi:10.1007/978-3-319-31260-6_2

R. Prabhakar Rao⁷,
Brajendra C. Sutradhar⁸ &
V. N. Pandit⁷

Part of the book series: Lecture Notes in Statistics ((LNSP,volume 218))

797 Accesses

Abstract

Unlike the estimation for the parameters in a linear longitudinal mixed model with independent t errors, the estimation of parameters of a generalized linear longitudinal mixed model (GLLMM) for discrete such as count and binary data with independent t random effects involved in the linear predictor of the model, may be challenging. The main difficulty arises in the estimation of the degrees of freedom parameter of the t distribution of the random effects involved in such models for discrete data. This is because, when the random effects follow a heavy tailed t-distribution, one can no longer compute the basic properties analytically, because of the fact that moment generating function of the t random variable is unknown or can not be computed, even though characteristic function exists and can be computed. In this paper, we develop a simulations based numerical approach to resolve this issue. The parameters involved in the numerically computed unconditional mean, variance and correlations are estimated by using the well known generalized quasi-likelihood (GQL) and method of moments approach. It is demonstrated that the marginal GQL estimator for the regression effects asymptotically follow a multivariate Gaussian distribution. The asymptotic properties of the estimators for the rest of the parameters are also indicated.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Inferences in semi-parametric dynamic mixed models for longitudinal count data

Article 28 November 2016

An approximate method for generalized linear and nonlinear mixed effects models with a mechanistic nonlinear covariate measurement error model

Article 17 October 2018

Inferences in Longitudinal Count Data Models with Measurement Errors in Time Dependent Covariates

Article 14 October 2015

Keywords

1 Introduction

Let (y _i1, …, y _it, …, y _iT) denote the T repeated count or binary responses for the ith subject, i = 1, …, K. Also, let x _it be the p × 1 vector of covariates corresponding to y _it, and β is the p × 1 regression effects of x _it on y _it. Next suppose that in addition to x _it, the repeated responses of the ith individual are also influenced by one random effect γ _i ^∗. Conditional on this random effect γ _i ^∗, some authors have modeled the longitudinal correlations of the repeated counts and binary data by using lag 1 dynamic relationships. More specifically, Sutradhar and Bari (2007) have used an AR(1) (auto-regressive order 1) type dynamic relationship to model the longitudinal correlations for repeated count data. Similarly, Sutradhar et al. (2008) [see also Amemiya (1985, p. 353), Manski (1987), and Honore and Kyriazidou (2000, p. 84)] have used a lag 1 dynamic binary mixed logit (BDML) model to accommodate the correlations of the repeated binary data. The unconditional correlation structures in both of these papers have been computed under the normality assumption for the random effects, specifically correlations are obtained by assuming that $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }N(0,\sigma _{\gamma }^{2})$. For convenience, we provide these correlation structures in brief for count and binary data as follows.

1.1 Conditional and Unconditional (Normality Based) Correlation Structures for Repeated Count Data

Suppose that

$$\displaystyle\begin{array}{rcl} y_{i1}\vert \gamma _{i}^{{\ast}}& \sim & \mbox{ Poi}(\mu _{ i1}^{{\ast}})\;\mbox{ with }\;\mu _{ i1}^{{\ast}} =\exp (x'_{ i1}\beta +\gamma _{ i}^{{\ast}}) \\ y_{it}\vert \gamma _{i}^{{\ast}}& =& \rho \circ [y_{ i,t-1}\vert \gamma _{i}^{{\ast}}] + [d_{ it}\vert \gamma _{i}^{{\ast}}],\;\mbox{ for}\;t = 2,\ldots,T,{}\end{array}$$

(1)

where Poi(μ _it ^∗) refers to the Poisson distribution with mean parameter μ _it ^∗, and $\rho \circ y_{i,t-1} =\sum _{ s=1}^{y_{i,t-1}}b_{s}(\rho )\;\mbox{ with}\;Pr[b_{s}(\rho ) = 1] =\rho,\;Pr[b_{s}(\rho ) = 0] = 1-\rho$, and $[d_{it}\vert \gamma _{i}^{{\ast}}] \sim Poi(\mu _{it}^{{\ast}}-\rho \mu _{i,t-1}^{{\ast}}),\mbox{ with}\;\mu _{it}^{{\ast}} =\exp (x'_{it}\beta +\sigma _{\gamma }\gamma _{i}^{{\ast}})$. This model in (1) is referred to as the Poisson AR(1) model which produces the correlation between y _iu and y _it as

$$\displaystyle{ \mbox{ corr}(Y _{iu},Y _{it}\vert \gamma _{i}^{{\ast}}) = \rho ^{\vert t-u\vert }\left [\frac{\mu _{iu}^{{\ast}}} {\mu _{it}^{{\ast}}}\right ]^{\frac{1} {2} }, }$$

(2)

which is free from γ _i ^∗, but depends on the time dependent covariates and on ρ, a correlation index parameter.

Note that the likelihood inference for the AR(1) model (1) is extremely complicated. This is because under this model, one writes

$$\displaystyle{ f((y_{i1},\ldots,y_{it},\ldots,y_{iT})\vert \gamma _{i}^{{\ast}}) = f(y_{ i1}\vert \gamma _{i}^{{\ast}})\varPi _{ t=2}^{T}[f_{ it\vert t-1}(y_{it}\vert y_{i,t-1},\gamma _{i}^{{\ast}})] }$$

(3)

where the conditional distribution, namely $f_{it\vert t-1}(y_{it}\vert y_{i,t-1},\gamma _{i}^{{\ast}})$ has a complicated form given by

$$\displaystyle\begin{array}{rcl} & & f_{it\vert t-1}(y_{it}\vert y_{i,t-1},\gamma _{i}^{{\ast}}) =\exp [-(\mu _{ it}^{{\ast}}-\rho \mu _{ i,t-1}^{{\ast}})] \\ & & \qquad \times \sum _{s_{it}=0}^{\mbox{ min}(y_{it},y_{i,t-1})}\frac{y_{i,t-1}!\rho ^{s_{it}}(1-\rho )^{y_{it}-s_{it}}(\mu _{ it}^{{\ast}}-\rho \mu _{ i,t-1}^{{\ast}})^{y_{it}-s_{it}}} {s_{it}!(y_{i,t-1} - s_{it})!(y_{it} - s_{it})!} {}\end{array}$$

(4)

(Freeland and McCabe 2004). Furthermore, the integration of the conditional likelihood function (3) over the Gaussian distribution of the random effects, i.e., $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }N(0,\sigma _{\gamma }^{2})$, is an additional complex problem. As opposed to the generalized linear longitudinal mixed model (GLLMM) setup, Over the last two decades many researchers, for example, Breslow and Clayton (1993), Lee and Nelder (1996), Jiang (1998), Sutradhar (2004), among others have used the normality assumption for the random effects in a generalized linear mixed model (GLMM) setup, and discussed the estimation of β and σ _γ ². In the present GLLMM setup (1)–(2), there is an additional correlation index parameter ρ to estimate.

When the normality assumption for the random effect γ _i ^∗ is used in the count panel data setup, the mean, variance and correlations of the repeated counts contain three unknown parameters, namely β, σ _γ ², and ρ. To be specific, by using the moment generating function (mgf) of $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }N(0,\sigma _{\gamma }^{2})$, that is, $E_{\gamma _{i}^{{\ast}}}(\exp (a\gamma _{i}^{{\ast}})) =\exp [\frac{1} {2}a^{2}\sigma _{ \gamma }^{2}]$, a being an auxiliary parameter, one obtains the three basic properties of the count panel data as follows (see Sutradhar 2011, Sect. 8.1.1):

$$\displaystyle\begin{array}{rcl} \mu _{it}& =& E[Y _{it}] = E_{\gamma _{i}^{{\ast}}}E[Y _{it}\vert \gamma _{i}^{{\ast}}] =\exp (x'_{ it}\beta )E_{\gamma _{i}^{{\ast}}}(\exp (\gamma _{i}^{{\ast}})) =\exp [x'_{ it}\beta + \frac{1} {2}\sigma _{\gamma }^{2}]{}\end{array}$$

(5)

$$\displaystyle\begin{array}{rcl} \sigma _{itt}& =& \mbox{ var}[Y _{it}] = E_{\gamma _{i}^{{\ast}}}\mbox{ var}[Y _{it}\vert \gamma _{i}^{{\ast}}] + \mbox{ var}_{\gamma _{ i}^{{\ast}}}E[Y _{it}\vert \gamma _{i}^{{\ast}}] = E_{\gamma _{i}^{{\ast}}}\mu _{it}^{{\ast}} + \mbox{ var}_{\gamma _{i}^{{\ast}}}(\mu _{it}^{{\ast}}) \\ & =& \exp (x'_{it}\beta )E_{\gamma _{i}^{{\ast}}}\exp (\gamma _{i}^{{\ast}}) +\exp (2x_{ it}^{'}\beta )\mbox{ var}_{\gamma _{ i}^{{\ast}}}(\exp (\gamma _{i}^{{\ast}})) \\ & =& \mu _{it} +\exp (2x_{it}^{'}\beta )[\exp (2\sigma _{\gamma }^{2}) -\exp (\sigma _{\gamma }^{2})] \\ & =& \mu _{it} + [\exp (\sigma _{\gamma }^{2}) - 1]\mu _{ it}^{2} {}\end{array}$$

(6)

and for u < t, the unconditional covariance between y _iu and y _it, is given by

$$\displaystyle\begin{array}{rcl} \sigma _{iut}& =& \mbox{ cov}[Y _{iu},Y _{it}] = E_{\gamma _{i}^{{\ast}}}[\mbox{ cov}\{(Y _{iu},Y _{it})\vert \gamma _{i}^{{\ast}}\}] + \mbox{ cov}_{\gamma _{ i}^{{\ast}}}[\mu _{iu}^{{\ast}},\mu _{it}^{{\ast}}] \\ & =& \rho ^{t-u}\exp (x'_{ iu}\beta )E_{\gamma _{i}^{{\ast}}}[\exp (\gamma _{i}^{{\ast}})] +\exp ([x_{ iu} + x_{it}]'\beta )\mbox{ var}_{\gamma _{i}^{{\ast}}}\{\exp (\gamma _{i}^{{\ast}})\} \\ & =& \rho ^{t-u}\mu _{ iu} + [\exp (\sigma _{\gamma }^{2}) - 1]\mu _{ iu}\mu _{it}, {}\end{array}$$

(7)

yielding the lag t − u correlation

$$\displaystyle{ \mbox{ corr}(Y _{iu},Y _{it}) = \frac{\rho ^{t-u}\mu _{iu} + [\exp (\sigma _{\gamma }^{2}) - 1]\mu _{iu}\mu _{it}} {[\{\mu _{iu} + [\exp (\sigma _{\gamma }^{2}) - 1]\mu _{iu}^{2}\}\{\mu _{it} + [\exp (\sigma _{\gamma }^{2}) - 1]\mu _{it}^{2}\}]^{\frac{1} {2} }}. }$$

(8)

Notice that the unconditional mean (5) and the unconditional variance (6) are functions in β and σ _γ ², whereas the unconditional covariances (7) and correlations (8) are functions in β, σ _γ ², as well as the dynamic dependence or correlation index parameter ρ. Remark that Sutradhar and Bari (2007), among others, have exploited the aforementioned moments (5)–(8) to develop a four-moments based generalized quasi-likelihood (GQL) approach for the estimation of these parameters β, σ _γ ², and ρ.

1.2 Conditional and Unconditional (Normality Based) Correlation Structures for Repeated Binary Data

As indicated earlier, over the last three decades, many econometricians such as Heckman (1981), Amemiya (1985, p. 353), Manski (1987), and Honore and Kyriazidou (2000, p. 844) have made attempts to accommodate the dynamic nature of the repeated binary responses by using a binary dynamic mixed logit (BDML) model given by

$$\displaystyle\begin{array}{rcl} Pr(y_{it} = 1\vert \gamma _{i},y_{i,t-1}) = \left \{\begin{array}{ll} \frac{\exp (x_{i1}^{'}\beta +\sigma _{\gamma }\gamma _{ i})} {1+\exp (x_{i1}^{'}\beta +\sigma _{\gamma }\gamma i)} & \mbox{ for}\;t = 1 \\ \frac{\exp (x'_{it}\beta +\theta y_{i,t-1}+\sigma _{\gamma }\gamma _{i})} {1+\exp (x'_{it}\beta +\theta y_{i,t-1}+\sigma _{\gamma }\gamma _{i})} & \mbox{ for}\;t = 2,\ldots,T,\\ \end{array} \right.& &{}\end{array}$$

(9)

where β is the effect of the covariates similar to the Poisson model, θ is referred to as the dynamic dependence parameter, and $\gamma _{i} = [\gamma _{i}^{{\ast}}/\sigma _{\gamma }]\stackrel{iid}{\sim }(0,1)$. Note that the distribution of γ _i is unknown. Also note that even if it is assumed that γ _i follows the Gaussian distribution, that is, $\gamma _{i}\stackrel{iid}{\sim }N(0,1)$, obtaining the likelihood estimates for β, θ, and σ _γ ² is complicated. Honore and Kyriazidou (2000, p. 844) attempted to avoid the estimation difficulty by estimating the β and θ parameters based on the transformed observations, such as the first differences of the responses $y_{i1} - y_{i0},\;y_{i2} - y_{i1},\ldots$, which are approximately independent of γ _i. They have used an approximate weighted log likelihood estimation approach, which however puts some impractical restrictions on covariates such as assuming x _i3 = x _i4, for the T = 4 case.

Remark that recently Bartolucci and Nigro (2010, Eq. (5), Sect. 3) have constructed a random effects free conditional likelihood for a binary model which is different from (9). More specifically, they exploited the conditional approach for a quadratic exponential type model (Cox 1972; Zhao and Prentice 1990) given by

$$\displaystyle\begin{array}{rcl} Pr(y_{i1}\ldots,y_{iT}\vert \gamma _{i}^{{\ast}},x_{ i1}\ldots,x_{iT}) =\varDelta _{ i}^{-1}\exp [y_{ i}^{'}\xi _{ i} +\theta g'_{i}(y_{i})1_{T(T-1)/2} + c_{i}(y_{i}) +\gamma _{i}y_{i}^{'}1_{ T}]& &{}\end{array}$$

(10)

where $y_{i} = [y_{i1},\ldots,y_{iT}]^{'},\;g_{i}(y_{i}) = [y_{i1}y_{i2},\ldots,y_{iT-1}y_{iT}]^{'},\;\mbox{ and}\;\xi _{i} = [\xi _{i1},\ldots,\xi _{iT}]^{'}$, with ξ _it = x′_it β. In (10), 1_n, for example, is an n-dimensional unit vector, Δ _i is a normalizing constant defined as

$$\displaystyle{\varDelta _{i} =\sum \exp [y_{i}^{'}\xi _{ i} +\theta g'_{i}(y_{i})1_{T(T-1)/2} + c_{i}(y_{i}) +\gamma _{i}y_{i}^{'}1_{ T}],}$$

with summation overall 2^T possible values of y _i. Also in (10), c _i(y _i) is referred to as a shape function that can be expressed as a linear combination of products of three or more of the elements of y _i. By ignoring c _i(y _i), i.e., c _i(y _i) = 0, it can be shown that for a given total score $\sum \limits _{t=1}^{T}y_{it} = y_{i+}$, the conditional distribution of y _i1, …, y _iT may be written as

$$\displaystyle{ Pr(y_{i1},\ldots,y_{iT}\vert y_{i+},\gamma _{i}^{{\ast}},x_{ i1},\ldots,x_{iT}) = \varDelta _{i}^{{\ast}}{}^{-1}\exp \left [y_{ i}^{'}\xi _{ i} +\theta \sum \limits _{ t=2}^{T}y_{ i,t-1}y_{it}\right ] }$$

(11)

where Δ _i ^∗ = Δ _i evaluated at $\sum \limits _{t=1}^{T}y_{it} = y_{i+}$, i.e., $y_{iT} = y_{i+} -\sum \limits _{t=1}^{T-1}y_{it}$. Because the conditional distribution in (11) is free from γ _i, Bartolucci and Nigro (2010) used this conditional distribution to estimate the main parameters β and θ.

We now turn back to the desired binary dynamic mixed model (9). It is clear that even if one is interested to estimate β and θ, neither the aforementioned weighted likelihood approach of Honore and Kyriazidou (2000), nor the conditional likelihood approach of Bartolucci and Nigro (2010) can be used to remove the random effects from dynamic mixed model (9) for easier estimation of β and θ. Moreover, for binary panel data analysis following (9), one, in fact, is interested to understand the mean and variance of the data, which, however, can not be computed by removing the random effects γ _i from the model. In stead, the computation of the moments require averaging over certain functions in γ _i over its distribution. Thus, rather than making any attempt to remove γ _i from (9), many authors such as Breslow and Clayton (1993), Lee and Nelder (1996), Jiang (1998), and Sutradhar (2004) have studied the inferences for the model (9) under the assumption that $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }N(0,\sigma _{\gamma }^{2})$.

Under this normality assumption, one may obtain the conditional and unconditional means, variance and covariances as follows (see Sutradhar 2011, Sect. 9.2.1). First, conditional on γ _i, the means of the repeated binary responses under model (9) are given by

$$\displaystyle\begin{array}{rcl} \pi _{it}^{{\ast}}(\gamma _{ i}) = E[Y _{it}\vert \gamma _{i}]& =& \left \{\begin{array}{l} \frac{\exp (x'_{i1}\beta +\sigma _{\gamma }\gamma _{i})} {1+\exp (x'_{i1}\beta +\sigma _{\gamma }\gamma _{i})},\;\;\mbox{ for }i = 1,\ldots,K;\;t = 1 \\ p_{it0} +\pi _{ i,t-1}^{{\ast}}(p_{it1} - p_{it0}),\;\;\mbox{ for }i = 1,\ldots,k;\;t = 2,\ldots,T \end{array} \right.{}\end{array}$$

(12)

where

$$\displaystyle{p_{it1} = \frac{\exp (x'_{it}\beta +\theta +\sigma _{\gamma }\gamma _{i})} {[1 +\exp (x'_{it}\beta +\theta +\sigma _{\gamma }\gamma _{i})]}\;\;\mbox{ and}\;\;p_{it0} = \frac{\exp (x'_{it}\beta +\sigma _{\gamma }\gamma _{i})} {[1 +\exp (x'_{it}\beta +\sigma _{\gamma }\gamma _{i})]}.}$$

Subsequently, one obtains the unconditional means as

$$\displaystyle\begin{array}{rcl} \mu _{it}& =& E(Y _{it}) = Pr(y_{it} = 1) \\ & =& M^{-1}\sum _{ w=1}^{M}\pi _{ it}^{{\ast}}(\gamma _{ iw}) \\ & =& M^{-1}\sum _{ w=1}^{M}[p_{ it0} +\pi _{ i,t-1}^{{\ast}}(p_{ it1} - p_{it0})]_{\vert \gamma _{i}=\gamma _{iw}}{}\end{array}$$

(13)

(Jiang 1998; Sutradhar 2004) where γ _iw is the wth (w = 1, …, M) realized value of γ _i generated from the standard normal distribution. Here M is a sufficiently large number, such as M = 5000. By (12), the p _{it1, w} involved in (13), for example, is written as

$$\displaystyle{p_{it1,w} = \frac{\exp (x'_{it}\beta +\theta +\sigma _{\gamma }\gamma _{iw})} {[1 +\exp (x'_{it}\beta +\theta +\sigma _{\gamma }\gamma _{iw})]}.}$$

Next, conditional on γ _i, for u < t, the second-order expectation may be written as

$$\displaystyle{ E(Y _{iu}Y _{it}\vert \gamma _{i}) =\lambda _{ iut}^{{\ast}}(\gamma _{ i}) = \mbox{ cov}(Y _{iu},Y _{it}\vert \gamma _{i}) +\pi _{iu}\pi _{it} =\sigma _{ iut}^{{\ast}} +\pi _{ iu}^{{\ast}}\pi _{ it}^{{\ast}}, }$$

(14)

where the conditional covariance between y _iu and y _it, conditional on γ _i, has the formula

$$\displaystyle{ \sigma _{iut}^{{\ast}} = \mbox{ cov}(Y _{ iu},Y _{it}\vert \gamma _{i}) =\pi _{ iu}^{{\ast}}(\gamma _{ i})(1 -\pi _{iu}^{{\ast}}(\gamma _{ i}))\varPi _{j=u+1}^{t}(p_{ ij1} - p_{ij0}). }$$

(15)

It then follows that the unconditional second-order raw moments have the formula

$$\displaystyle\begin{array}{rcl} \phi _{iut}& =& E(Y _{iu}Y _{it}) = M^{-1}\sum _{ w=1}^{M}\left [\pi _{ iu}^{{\ast}}(\gamma _{ iw})(1 -\pi _{iu}^{{\ast}}(\gamma _{ iw}))\right. \\ & & \left.\times \;\varPi _{j=u+1}^{t}(p_{ ij1,w} - p_{ij0,w}) +\pi _{ iu}^{{\ast}}(\gamma _{ iw})\pi _{it}^{{\ast}}(\gamma _{ iw})\right ], {}\end{array}$$

(16)

yielding the unconditional covariance as

$$\displaystyle{ \sigma _{iut} =\phi _{iut} -\mu _{iu}\mu _{it}, }$$

(17)

with μ _it is the unconditional mean given by (13).

1.3 Plan of the Paper Under the Proposed t Random Effects with Unknown Degrees of Freedom ν

In this paper, as opposed to the Gaussian distribution, we consider a wider class of t distributions for the random effects {γ _i ^∗}, with mean 0, a scale parameter λ _γ ², and shape or degrees of freedom parameter ν, i.e, $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }t_{\nu }(0,\lambda _{\gamma }^{2},\nu )$, with its probability density given by

$$\displaystyle{ f(\gamma _{i}^{{\ast}}) = \frac{\nu ^{\frac{1} {2} }\varGamma \frac{\nu +1} {2} } {\varGamma \frac{\nu } {2}} (\lambda _{\gamma }^{2})^{\frac{-1} {2} }\left [\nu +\frac{\gamma _{i}^{{\ast}}{}^{2}} {\lambda _{\gamma }^{2}} \right ]^{-\frac{\nu +1} {2} }. }$$

(18)

This t distribution exhibits heavy symmetric tails when ν is small, and it reduces to the normal distribution N(0, σ _γ ²) for ν → ∞. Note, however, that one can not compute the mgf, that is, $E_{\gamma _{i}^{{\ast}}}(\exp (a\gamma _{i}^{{\ast}}))$ under this t distribution (18). As a remedy, the moments of this t distribution (18) are computed either from the characteristic function (cf) (Sutradhar 1986) or by direct integrations over the distribution. For ν > 4, the first four moments, for example, are given by

$$\displaystyle\begin{array}{rcl} E(\gamma _{i}^{{\ast}})& =& 0,\;\mbox{ var}(\gamma _{ i}^{{\ast}}) = \frac{\nu } {\nu -2}\lambda _{\gamma }^{2} =\sigma _{ \gamma }^{2} \\ E(\gamma _{i}^{{\ast}}{}^{3})& =& 0,\;E(\gamma _{ i}^{{\ast}}{}^{4}) = \frac{3\lambda ^{4}_{ \gamma }\nu ^{2}} {(\nu -2)(\nu -4)} = 3\sigma _{\gamma }^{4}[\frac{\nu -2} {\nu -4}].{}\end{array}$$

(19)

But, it follows from (5)–(8) that in the present longitudinal mixed model setup for count data with $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }t_{\nu }(0,\lambda _{\gamma }^{2},\nu )$, one requires the result for the mgf $E_{\gamma _{i}^{{\ast}}}(\exp (a\gamma _{i}^{{\ast}}))$, which however can not be computed analytically under the t _ν distribution (18). A similar but different problem arises in the longitudinal mixed model setup for binary data, where for (13)–(15), one needs to generate random effect values γ _iw from standard t distribution t(0, 1, ν) with ν degrees of freedom which is however unknown in practice.

As a remedy, in this paper, we offer a simulation-based numerical approach to compute the mgf, and develop a GQL estimation approach for the estimation of all parameters of the models including the degrees of freedom parameter ν > 4. More specifically, in Sects. 2 and 3, we discuss the Poisson mixed model with t _ν random effects and the desired inferences. The binary model and the inferences with t _ν random effects are provided in Sects. 4 and 5. Some concluding remarks are given in Sect. 6.

2 Poisson Mixed Model with t _ν Random Effects

2.1 Basic Properties of the Poisson Mixed Model: Unconditional Mean and Variance

In the present setup, $\gamma _{i}^{{\ast}}\stackrel{iid}{\sim }t_{\nu }(0,\lambda _{\gamma }^{2},\nu )$. Now because, similar to (5)–(6), the unconditional mean and variance have the formulas

$$\displaystyle\begin{array}{rcl} \mu _{it}& =& E[Y _{it}] = E_{\gamma _{i}^{{\ast}}}E[Y _{it}\vert \gamma _{i}^{{\ast}}] = E_{\gamma _{ i}^{{\ast}}}\mu _{it}^{{\ast}} =\exp (x'_{it}\beta )E_{\gamma _{i}^{{\ast}}}\left \{\exp (\gamma _{i}^{{\ast}})\right \}{}\end{array}$$

(20)

$$\displaystyle\begin{array}{rcl} \sigma _{itt}& =& \mbox{ var}[Y _{it}] = E_{\gamma _{i}^{{\ast}}}\mbox{ var}[Y _{it}\vert \gamma _{i}^{{\ast}}] + \mbox{ var}_{\gamma _{ i}^{{\ast}}}E[Y _{it}\vert \gamma _{i}^{{\ast}}] \\ & =& \exp (x'_{it}\beta )E_{\gamma _{i}^{{\ast}}}\left \{\exp (\gamma _{i}^{{\ast}})\right \} +\exp (2x_{ it}^{'}\beta )\left [E_{\gamma _{ i}^{{\ast}}}\left \{\exp (2\gamma _{i}^{{\ast}})\right \} -\left [E_{\gamma _{i}^{{\ast}}}\left \{\exp (\gamma _{i}^{{\ast}})\right \}\right ]^{2}\right ],{}\end{array}$$

(21)

they could be evaluated numerically by simulating γ _iw, w = 1, …, W, for a large W such as W = 5000, from $\gamma _{iw}\stackrel{iid}{\sim }t_{\nu }(0,1,\nu )$, and using

$$\displaystyle{ E_{\gamma _{i}^{{\ast}}}\left \{\exp (a\gamma _{i}^{{\ast}})\right \} = E_{\gamma _{ i}}\left \{\exp (a\lambda _{\gamma }\gamma _{i})\right \} \approx \frac{1} {W}\sum _{w=1}^{W}\left [\exp (a\lambda _{\gamma }\gamma _{ iw})\right ], }$$

(22)

in (20)–(21) for a = 1, 2, provided ν were known. Note that for known ν, this simulated approximation in (22) is quite similar to the simulation approximation used by Sutradhar (2008, Sect. 3) [see also Sutradhar et al. (2008, Eq. (2.6))] for the binary case with random effects generated from N(0, 1) distribution. However, because in the present case, ν is unknown and requires to be estimated, we resolve this simulation issue by generating γ _iw, w = 1, …, W first from a reference t ₄(0, 1, 4) distribution (equivalent to standard normal reference distribution) and using the transformation from following Lemma 2.3 so that these γ _iw, w = 1, …, W subsequently follow the t _ν(0, 1, ν) distribution as desired. Lemmas 2.1 and 2.2 below are needed to write the Lemma 2.3.

Lemma 2.1.

Suppose that ψ _i ^∗ ∼ N _p (0,λ _γ ² ). Next, suppose that $\xi _{i}^{{\ast}}{}^{2}$ a scalar random variable which follows the well known χ _ν ² distribution with ν degrees of freedom, that is, $\xi _{i}^{{\ast}}{}^{2} \sim \chi _{\nu }^{2}$ , and ψ _i ^∗ and $\xi _{i}^{{\ast}}{}^{2}$ are independent. Then, for $\psi _{i} = \frac{\psi _{i}^{{\ast}}} {\lambda _{\gamma }}$ , the ratio variable γ _i ^∗ defined as

$$\displaystyle{ \gamma _{i}^{{\ast}} =\lambda _{\gamma }\psi _{ i}/[\sqrt{(}\xi _{i}^{{\ast}}{}^{2}/\nu )] =\lambda _{\gamma }\gamma _{ i} }$$

(23)

has the t _ν (0,λ _γ ² ,ν) distribution given by (18).

However, even though ψ _i in (23) is a parameter free normal variable, an observation γ _iw ^∗ following the t-distribution (18) for γ _i ^∗ (20)–(22), can not be drawn yet, because the distribution of $\xi _{i}^{{\ast}}{}^{2}$ is parameter ν dependent. Because ν > 4 in (18), to resolve this issue, we suggest to use a t-distribution with 4 degrees of freedom as a reference distribution. Suppose that ξ _i ² is generated from this χ ₄ ² distribution. One may then generate a $\xi _{i}^{{\ast}}{}^{2}$ from χ _ν ² approximately for any ν > 4, by using the relation between ξ _i ² and $\xi _{i}^{{\ast}}{}^{2}$ as in Lemma 2.2 below.

Lemma 2.2.

If ξ _i ² is generated from the χ ₄ ² distribution, one may then generate $\xi _{i}^{{\ast}}{}^{2}$ by using the relationship

$$\displaystyle{ \xi _{i}^{{\ast}}{}^{2} = \sqrt{2\nu }\left [\frac{\xi _{i}^{2} - 4} {\sqrt{8}} \right ]+\nu = \frac{1} {2}\sqrt{\nu }\left [\xi _{i}^{2} - 4\right ]+\nu, }$$

(24)

which has the same first two moments as that of χ _ν ² .

One may then generate an observation from a t _ν distribution as in Lemma 2.3.

Lemma 2.3.

For w = 1,…,W, with W = 5000 (say), the w-th observation γ _iw ^∗ from the t _ν distribution may be generated by applying Lemma 2.2 to Lemma 2.1. That is,

$$\displaystyle{ \gamma _{iw}^{{\ast}} =\lambda _{\gamma }\{2\nu \}^{\frac{1} {2} } \frac{\psi _{iw}} {\left [\sqrt{\nu }\left (\xi _{iw}^{2} - 4\right ) + 2\nu \right ]^{\frac{1} {2} }} =\lambda _{\gamma }\gamma _{iw}, }$$

(25)

where ψ _iw and ξ _iw ² are observations from the standard normal N(0,1) and χ ₄ ² distributions, respectively.

Consequently, by applying (25) under Lemma 2.3, to (22) and (20), one computes the unconditional mean as

$$\displaystyle\begin{array}{rcl} \mu _{it}(\beta,\lambda _{\gamma },\nu )& =& E[Y _{it}] =\exp (x'_{it}\beta )E_{\gamma _{i}^{{\ast}}}\left \{\exp (\gamma _{i}^{{\ast}})\right \} \\ & =& \exp (x'_{it}\beta )E_{\gamma _{i}^{{\ast}}}\left \{\exp (\lambda _{\gamma }\gamma _{i})\right \} \\ & =& \exp (x'_{it}\beta ) \frac{1} {W}\sum _{w=1}^{W}\left \{\exp (\lambda _{\gamma }\gamma _{ iw})\right \} \\ & =& \frac{1} {W}\exp (x'_{it}\beta )\sum _{w=1}^{W}\exp \left [ \frac{\lambda _{\gamma }\psi _{iw}} {\left \{ \frac{1} {2\sqrt{(}\nu )}(\xi _{iw}^{2} - 4) + 1\right \}}\right ] \\ & =& \frac{1} {W}\exp (x'_{it}\beta )\sum _{w=1}^{W}\exp \left \{R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}, {}\end{array}$$

(26)

where for w = 1, …, W, ψ _iw are generated from standard normal N(0, 1) distribution, and ξ _iw ² are generated from χ ₄ ² distributions. Furthermore, ψ _iw and ξ _iw ² are independent.

In order to compute the unconditional variance, use $\mu _{it}^{{\ast}} = E_{\gamma _{i}^{{\ast}}}\left [\exp (x'_{it}\beta +\gamma _{ i}^{{\ast}})\right ]$ from (20), and first compute ϕ _{i, tt} = E[Y _it ²] as follows:

$$\displaystyle\begin{array}{rcl} \phi _{i,tt}(\beta,\lambda _{\gamma },\nu ) =& & E[Y _{it}^{2}] = E_{\gamma _{ i}^{{\ast}}}\left [\mu _{it}^{{\ast}} + \mu _{it}^{{\ast}}{}^{2}\right ] \\ =& & \frac{1} {W}\sum _{w=1}^{W}\left [\exp [x'_{ it}\beta + \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & +\left.\exp [2x'_{it}\beta + \left \{2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ], {}\end{array}$$

(27)

where $R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )$ is defined in (26). Hence, the unconditional variance has the formula

$$\displaystyle{ \sigma _{i,tt}(\beta,\lambda _{\gamma },\nu ) =\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ) -\mu _{it}^{2}(\beta,\lambda _{\gamma },\nu ). }$$

(28)

Note that this variance formula can be obtained from (21) as well. We remark that unlike for the Poisson-normal mixed model, the mean and variance under the Poisson-t _ν mixed model are functions of the regression effects β, and variance parameter λ _γ and shape parameter ν of the random effect distribution.

2.2 Correlation Properties of the Poisson Mixed Model: Unconditional Covariances

To compute the unconditional covariance between y _iu and y _it (u < t), we first observe from (1)–(2) that their covariance conditional on the random effects γ _i ^∗ is not zero. Specifically, by (2), the conditional covariance is given by

$$\displaystyle{ \mbox{ cov}[(Y _{iu},Y _{it})\vert \gamma _{i}^{{\ast}}] =\rho ^{t-u}\mu _{ iu}^{{\ast}}, }$$

(29)

implying that

$$\displaystyle{ E[Y _{iu}Y _{it}\vert \gamma _{i}^{{\ast}}] =\rho ^{t-u}\mu _{ iu}^{{\ast}} +\mu _{ iu}^{{\ast}}\mu _{ it}^{{\ast}}. }$$

(30)

Consequently,

$$\displaystyle\begin{array}{rcl} \delta _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho )& =& E[Y _{iu}Y _{it}] \\ & =& E_{\gamma _{i}^{{\ast}}}E[Y _{iu}Y _{it}\vert \gamma _{i}^{{\ast}}] \\ & =& E_{\gamma _{i}^{{\ast}}}\left [\rho ^{t-u}\mu _{ iu}^{{\ast}} +\mu _{ iu}^{{\ast}}\mu _{ it}^{{\ast}}\right ] \\ & =& \rho ^{t-u}\mu _{ iu}(\beta,\lambda _{\gamma },\nu ) + E_{\gamma _{i}^{{\ast}}}\left [\mu _{iu}^{{\ast}}\mu _{ it}^{{\ast}}\right ],{}\end{array}$$

(31)

where the unconditional mean μ _iu(β, λ _γ, ν) has the formula similar to that of (26). Next, by similar computation as in (27), one obtains

$$\displaystyle\begin{array}{rcl} & \delta _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho ) =\rho ^{t-u}\mu _{iu}(\beta,\lambda _{\gamma },\nu ) + E_{\gamma _{i}^{{\ast}}}\left [\exp \{x_{iu} + x_{it}\}'\beta + 2\gamma _{i}^{{\ast}}\right ] & \\ & =\rho ^{t-u}\mu _{iu}(\beta,\lambda _{\gamma },\nu ) +\exp \left [\{x_{iu} + x_{it}\}'\beta \right ] \frac{1} {W}\sum _{w=1}^{W}\exp \left [2R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ].&{}\end{array}$$

(32)

Hence, for u < t, the unconditional covariance between y _iu and y _it is given by

$$\displaystyle{ \sigma _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho ) =\delta _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho ) -\mu _{iu}(\beta,\lambda _{\gamma },\nu )\mu _{it}(\beta,\lambda _{\gamma },\nu ), }$$

(33)

where δ _{i, ut}(β, λ _γ, ν, ρ) has the formula given by (32) and μ _it(β, λ _γ, ν) is given by (26).

3 GQL Estimation for the Parameters of the Poisson Mixed Model

The estimation of the parameters of the model will be done in cycle of iterations. In Sect. 3.1, we discuss a generalized quasi-likelihood (GQL) (Sutradhar 2003, Sect. 3) estimation approach for the estimation of the main regression parameter β under the assumption that other parameters (ρ, λ _γ, ν) are known or their consistent estimates are available. In subsequent sections, we discuss their consistent estimation.

3.1 GQL Estimation for the Regression Effects β

For β estimation, we exploit the first order responses, namely y _i = [y _i1, …, y _it, …, y _iT]′. Suppose that μ _i = E[Y _i]. This mean vector is given by $\mu _{i}(\beta,\lambda _{\gamma },\nu ) = [\mu _{i1},\ldots,\mu _{it},\ldots,\mu _{iT}]'$, where, by (26), μ _it(β, λ _γ, ν) has the formula

$$\displaystyle{\mu _{it}(\beta,\lambda _{\gamma },\nu ) =\exp (x'_{it}\beta ) \frac{1} {W}\sum _{w=1}^{W}\exp \left \{R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}: T \times 1.}$$

Next by using the formulas for the variances σ _{i, tt}(β, λ _γ, ν) from (28), and the covariances σ _{i, ut}(β, λ _γ, ν, ρ) from (33), we construct the T × T covariance matrix as

$$\displaystyle{\varSigma _{i}(\beta,\lambda _{\gamma },\nu,\rho ) = (\sigma _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho )): T \times T,\;\mbox{ for}\;u = t;\mbox{ and}\;u\neq t.}$$

Note that under the present model σ _{i, tt}(⋅ ) does not follow from σ _{i, ut}(⋅ ) as a special case. More specifically, σ _{i, ut}(⋅ )’s are constructed for u < t. The GQL estimating equation for β is then given by

$$\displaystyle{ \sum _{i=1}^{K}\frac{\partial \mu '_{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta } \varSigma _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\left (y_{ i} -\mu _{i}(\beta,\lambda _{\gamma },\nu )\right ) = 0, }$$

(34)

(Sutradhar 2003, 2004) where $\frac{\partial \mu '_{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta }$ may be computed by using the formula for $\frac{\partial \mu _{it}(\beta,\lambda _{\gamma },\nu )} {\partial \beta }$ for all t = 1, …, T. This derivative follows from (26), and is given by

$$\displaystyle{\frac{\partial \mu _{it}(\beta,\lambda _{\gamma },\nu )} {\partial \beta } =\mu _{it}(\beta,\lambda _{\gamma },\nu )x_{it}.}$$

Consequently, the GQL estimating equation in (34) reduces to

$$\displaystyle{ \sum _{i=1}^{K}X'_{ i}A_{i}\varSigma _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\left (y_{ i} -\mu _{i}(\beta,\lambda _{\gamma },\nu )\right ) = 0, }$$

(35)

where X′_i = (x _i1, …, x _it, …, x _iT) is the p × T covariate matrix for the ith individual, and

$$\displaystyle{A_{i} = \mbox{ diag}[\mu _{i1}(\beta,\lambda _{\gamma },\nu ),\ldots,\mu _{it}(\beta,\lambda _{\gamma },\nu ),\ldots,\mu _{iT}(\beta,\lambda _{\gamma },\nu )]:\; T \times T.}$$

3.1.1 Asymptotic Properties of the GQL Estimator of β

For true β, define

$$\displaystyle{ \bar{f}_{K}(\beta ) = \frac{1} {K}\sum _{i=1}^{K}f_{ i}(\beta ) = \frac{1} {K}\sum _{i=1}^{K}\frac{\partial \mu '_{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta } \varSigma _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\left (y_{ i} -\mu _{i}(\beta,\lambda _{\gamma },\nu )\right ), }$$

(36)

where y ₁, …, y _i, …, y _K are independent to each other as they are collected from K independent individuals, but they are not identically distributed because

$$\displaystyle{ Y _{i} \sim (\mu _{i}(\beta,\lambda _{\gamma },\nu ),\varSigma _{i}(\beta,\lambda _{\gamma },\nu,\rho )), }$$

(37)

where the mean vectors and covariance matrices vary for the individuals i = 1, …, K. By (37), it follows from (36) that

$$\displaystyle\begin{array}{rcl} E[\bar{f}_{K}(\beta )]& =& 0 \\ \mbox{ cov}[\bar{f}_{K}(\beta )]& =& \frac{1} {K^{2}}\sum _{i=1}^{K}\mbox{ cov}[f_{ i}(\beta )] \\ & =& \frac{1} {K^{2}}\sum _{i=1}^{K}\frac{\partial \mu '_{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta } \varSigma _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\frac{\partial \mu _{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta '} \\ & =& \frac{1} {K^{2}}\sum _{i=1}^{K}V _{ i}(\beta,\lambda _{\gamma },\nu,\rho ) = \frac{1} {K^{2}}V _{K}^{{\ast}}(\beta,\lambda _{\gamma },\nu,\rho ).{}\end{array}$$

(38)

Next if the multivariate version of Lindeberg’s condition holds, that is,

$$\displaystyle{ \lim _{K\rightarrow \infty }V ^{{\ast}}_{ K}{}^{-1}\sum _{ i=1}^{K}\sum _{ (f'_{i}V ^{{\ast}}_{K}{}^{-1}f_{i})>\epsilon }f_{i}f'_{i}g(f_{i}) = 0 }$$

(39)

for all ε > 0, g(⋅ ) being the probability distribution of f _i, then Lindeberg-Feller central limit theorem (Amemiya 1985, Theorem 3.3.6; McDonald 2005, Theorem 2.2) imply that

$$\displaystyle{ Z_{K} = K[V _{K}^{{\ast}}]^{-\frac{1} {2} }\bar{f}_{K}(\beta ) \rightarrow N_{p}(0,I_{p}). }$$

(40)

Next because $\hat{\beta }_{GQL}$ is a solution of (34), one writes by (36) that

$$\displaystyle{ \sum _{i=1}^{K}f_{ i}(\hat{\beta }_{GQL}) = 0, }$$

(41)

which by first order Taylor’s series expansion produces

$$\displaystyle{ \sum _{i=1}^{K}f_{ i}(\beta ) + (\hat{\beta }_{GQL}-\beta )\sum _{i=1}^{K}f'_{ i}(\beta ) = 0. }$$

(42)

That is,

$$\displaystyle\begin{array}{rcl} \hat{\beta }_{GQL}-\beta & =& -\left [\sum _{i=1}^{K}f'_{ i}(\beta )\right ]^{-1}\sum _{ i=1}^{K}f_{ i}(\beta ) \\ & =& -\left [-\sum _{i=1}^{K}\frac{\partial \mu '_{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta } \varSigma _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\frac{\partial \mu _{i}(\beta,\lambda _{\gamma },\nu )} {\partial \beta '} \right ]^{-1}\sum _{ i=1}^{K}f_{ i}(\beta ) \\ & =& \left [V _{K}^{{\ast}}(\beta,\lambda _{\gamma },\nu,\rho )\right ]^{-1}K\bar{f}(\beta ) \\ & =& \left [V _{K}^{{\ast}}(\beta,\lambda _{\gamma },\nu,\rho )\right ]^{-\frac{1} {2} }Z_{K} \rightarrow N(0,V ^{{\ast}}_{K}{}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )),{}\end{array}$$

(43)

by (40). It then follows that

$$\displaystyle{ \lim _{K\rightarrow \infty }\hat{\beta }_{GQL} \rightarrow N(\beta,V ^{{\ast}}_{ K}{}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )). }$$

(44)

Also it follows that

$$\displaystyle{ \vert \vert [V _{K}^{{\ast}}(\beta,\lambda _{\gamma },\nu,\rho )]^{\frac{1} {2} }[\hat{\beta }_{GQL}-\beta ]\vert \vert = O_{p}(\sqrt{p}). }$$

(45)

3.2 GQL Estimation for the Scale and Shape Parameters

Notice from Sect. 2.1 that all three basic moment properties, namely the mean function (26), variances in (28), and the covariances given by (33) contain the scale parameter λ _γ and the shape parameter ν. Thus, it is sensible to exploit all first and second order responses to estimate these parameters. Note that the second order responses consist of both squared (ss) and pair-wise products (pp) of all repeated observations. Consequently, we consider a vector g _i consisting of all first order and second order responses. In notation, g _i has the form

$$\displaystyle{ g_{i} = [y'_{i},\;y'_{iss},\;y'_{ipp}]': \frac{T(T + 3)} {2} \times 1, }$$

(46)

where y _i = [y _i1, …, y _it, …, y _iT]′: T × 1, as in (34), and

$$\begin{array}{rlrlrl} y_{iss} & = [y_{i1}^{2},\ldots,y_{ it}^{2},\ldots,y_{ iT}^{2}]': T \times 1,\;\mbox{ and}\; & & \\ y_{ipp} & = [y_{i1}y_{i2},\ldots,y_{iu}y_{it},\ldots,y_{i,T-1}y_{iT}]': \frac{T(T - 1)} {2} \times 1. & & \end{array}$$

Let

$$\displaystyle\begin{array}{rcl} E[g_{i}]& =& [\mu '_{i}(\beta,\lambda _{\gamma },\nu ),\phi '_{i}(\beta,\lambda _{\gamma },\nu ),\delta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )]' \\ & =& \eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )\;\mbox{ (say)}, {}\end{array}$$

(47)

where

$$\displaystyle\begin{array}{rcl} \mu _{i}(\beta,\lambda _{\gamma },\nu )& =& [\mu _{i1}(\beta,\lambda _{\gamma },\nu ),\ldots,\mu _{it}(\beta,\lambda _{\gamma },\nu ),\ldots,\mu _{iT}(\beta,\lambda _{\gamma },\nu )]' {}\\ \phi _{i}(\beta,\lambda _{\gamma },\nu )& =& [\phi _{i,11}(\beta,\lambda _{\gamma },\nu ),\ldots,\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ),\ldots,\phi _{i,TT}(\beta,\lambda _{\gamma },\nu )]' {}\\ \delta _{i}(\beta,\lambda _{\gamma },\nu,\rho )& =& [\delta _{i,12}(\beta,\lambda _{\gamma },\nu,\rho ),\ldots,\delta _{i,ut}(\beta,\lambda _{\gamma },\nu,\rho ),\ldots,\delta _{i,T-1,T}(\beta,\lambda _{\gamma },\nu,\rho )]', {}\\ \end{array}$$

with μ _it(β, λ _γ, ν) and ϕ _{i, tt}(β, λ _γ, ν) for all t = 1, …, T, are given by (26) and (27), respectively, and δ _{i, ut}(β, λ _γ, ν, ρ) for u < t, are defined as in (32). Further, let

$$\displaystyle\begin{array}{rcl} \varOmega _{i}(\beta,\lambda _{\gamma },\nu,\rho )& =& \mbox{ cov}[g_{i}] \\ & =& \left [\begin{array}{ccc} \varSigma _{i} & \varOmega _{i,ss} &\varOmega _{i,pp} \\ \varOmega '_{i,ss} & \varSigma _{i,ss} & \varOmega _{i,sp} \\ \varOmega '_{i,pp}&\varOmega '_{i,sp}&\varSigma _{i,pp}\\ \end{array} \right ],{}\end{array}$$

(48)

where

$$\displaystyle\begin{array}{rcl} \varSigma _{i}& =& \mbox{ cov}[Y _{i}],\;\varSigma _{i,ss} = \mbox{ cov}[Y _{iss}],\;\varSigma _{i,pp} = \mbox{ cov}[Y _{ipp}] {}\\ \varOmega _{i,ss}& =& \mbox{ cov}[Y _{i},Y '_{iss}],\;\varOmega _{i,pp} = \mbox{ cov}[Y _{i},Y '_{ipp}],\;\varOmega _{i,sp} = \mbox{ cov}[Y _{iss},Y '_{ipp}]. {}\\ \end{array}$$

Further, let π = [λ _γ, ν]′: 2 × 1, be a vector of the scale and shape parameters of the random effects distribution. Similar to (34), for known β and ρ, one may then estimate the π vector by solving the GQL estimation equation given by

$$\displaystyle{ \sum _{i=1}^{K}\frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } \varOmega _{i}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )\left (g_{ i} -\eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )\right ) = 0, }$$

(49)

where

$$\displaystyle{ \frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } = \left (\begin{array}{*{10}c} \frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \lambda _{\gamma }} \\ \frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \nu } \end{array} \right ). }$$

(50)

Note that the derivatives in (50) may be computed by using the following general derivatives with respect to λ _γ and ν:

$$\displaystyle\begin{array}{rcl} \frac{\partial \mu _{it}} {\partial \lambda _{\gamma }} =& & \exp (x'_{it}\beta ) \frac{1} {W}\sum _{w=1}^{W}\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} \exp \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}, {}\\ \frac{\partial \mu _{it}} {\partial \nu } =& & \exp (x'_{it}\beta ) \frac{1} {W}\sum _{w=1}^{W}\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \nu } \exp \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}, {}\\ \frac{\partial \phi _{i,tt}} {\partial \lambda _{\gamma }} =& & \frac{1} {W}\sum _{w=1}^{W}\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} \left [\exp [x'_{it}\beta + \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. {}\\ & & +\left.2\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} \exp [2x'_{it}\beta + \left \{2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ], {}\\ \frac{\partial \phi _{i,tt}} {\partial \nu } =& & \frac{1} {W}\sum _{w=1}^{W}\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \nu } \left [\exp [x'_{it}\beta + \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. {}\\ & & +\left.2\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \nu } \exp [2x'_{it}\beta + \left \{2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ], {}\\ \frac{\partial \delta _{i,ut}} {\partial \lambda _{\gamma }} =& & \rho ^{t-u}\frac{\partial \mu _{iu}(\beta,\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} +\exp \left [\{x_{iu} + x_{it}\}'\beta \right ] \frac{1} {W}\sum _{w=1}^{W}2\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} {}\\ & & \times \exp \left [2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ] -\left \{\frac{\partial \mu _{iu}(\beta,\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} \mu _{it} +\mu _{iu}\frac{\partial \mu _{it}(\beta,\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} \right \} {}\\ \frac{\partial \delta _{i,ut}} {\partial \nu } =& & \rho ^{t-u}\frac{\partial \mu _{iu}(\beta,\lambda _{\gamma },\nu )} {\partial \nu } +\exp \left [\{x_{iu} + x_{it}\}'\beta \right ] \frac{1} {W}\sum _{w=1}^{W}2\frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \nu } {}\\ & & \times \exp \left [2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ] -\left \{\frac{\partial \mu _{iu}(\beta,\lambda _{\gamma },\nu )} {\partial \nu } \mu _{it} +\mu _{iu}\frac{\partial \mu _{it}(\beta,\lambda _{\gamma },\nu )} {\partial \nu } \right \}, {}\\ \end{array}$$

where

$$\displaystyle\begin{array}{rcl} \frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \lambda _{\gamma }} & =& \left [ \frac{\psi _{iw}} {\left \{ \frac{1} {2\sqrt{(}\nu )}(\xi _{iw}^{2} - 4) + 1\right \}}\right ] \\ \frac{\partial R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )} {\partial \nu } & =& \left [ \frac{\lambda _{\gamma }\psi _{iw}\left \{ \frac{1} {4\{\sqrt{(}\nu )\}^{3}} (\xi _{iw}^{2} - 4)\right \}} {\left \{ \frac{1} {2\sqrt{(}\nu )}(\xi _{iw}^{2} - 4) + 1\right \}^{2}}\right ].{}\end{array}$$

(51)

The construction of the GQL estimating equation (49) still requires the computational formula for the weight matrix Ω _i(β, λ _γ, ν, ρ). Now because this weight matrix requires the computation of second, third and fourth order moments for the repeated count data, unlike for the Gaussian data the computation of these moments are complicated. Some of the fourth order moments may not be computable without further joint distributional assumption for these repeated counts. Thus, for simplicity and because the consistent estimation of the parameters in π does not require the use of exact weight matrix, in the next section, we provide an approximation for the computation of the elements of the weight matrix Ω _i(β, λ _γ, ν, ρ) by pretending that the correlation index is zero, that is, ρ = 0 in (2). This assumption is equivalent to say that the repeated counts are assumed to conditionally (conditional on the random effects) independent (CI).

3.2.1 Computation of Ω _i(CI) ≡ Ω _i ^∗(β, λ _γ, ν)

Note that as outlined above, the Ω _i(β, λ _γ, ν, ρ) = cov(g _i) matrix in (49) will be replaced by

$$\displaystyle{ \mbox{ cov}(g_{i}\vert \rho = 0) =\varOmega _{ i}^{{\ast}}(\beta,\lambda _{\gamma },\nu ), }$$

(52)

which contains moments up to order four under conditionally independence (CI) assumption. More specifically, we compute the Ω _i(⋅ ) matrix in (48), but, under the assumption that ρ = 0, that is,

$$\displaystyle\begin{array}{rcl} \varOmega _{i}^{{\ast}}(\beta,\lambda _{\gamma },\nu )& =& \mbox{ cov}[g_{ i}\vert \rho = 0] \\ & =& \left [\begin{array}{ccc} \varSigma _{i}^{{\ast}} &\varOmega _{i,ss}^{{\ast}}&\varOmega _{i,pp}^{{\ast}} \\ \varOmega ^{{\ast}}{}'_{i,ss} & \varSigma _{i,ss}^{{\ast}}&\varOmega _{i,sp}^{{\ast}} \\ \varOmega ^{{\ast}}{}'_{i,pp}&\varOmega ^{{\ast}}{}'_{i,sp}&\varSigma _{i,pp}^{{\ast}}\\ \end{array} \right ],{}\end{array}$$

(53)

where

$$\displaystyle\begin{array}{rcl} \varSigma _{i}^{{\ast}}& =& \mbox{ cov}[Y _{ i}\vert \rho = 0],\;\varSigma _{i,ss}^{{\ast}} = \mbox{ cov}[Y _{ iss}\vert \rho = 0],\;\varSigma _{i,pp}^{{\ast}} = \mbox{ cov}[Y _{ ipp}\vert \rho = 0] {}\\ \varOmega _{i,ss}^{{\ast}}& =& \mbox{ cov}[(Y _{ i},Y '_{iss})\vert \rho = 0],\;\varOmega _{i,pp}^{{\ast}} = \mbox{ cov}[(Y _{ i},Y '_{ipp})\vert \rho = 0], {}\\ \varOmega _{i,sp}^{{\ast}}& =& \mbox{ cov}[(Y _{ iss},Y '_{ipp})\vert \rho = 0]. {}\\ \end{array}$$

(a)
Computation of the Second Order Moments Matrix Σ _i ^∗:

Because the variances are not affected by the correlation index parameter, their formulas remain the same as in (28). However, the covariances under ρ = 0 will be different than (33). More specifically the formulas for the variances and covariances under the assumption ρ = 0 are given by

$$\displaystyle\begin{array}{rcl} \mbox{ var}[Y _{it}\vert \rho ] =& & \sigma _{i,tt}^{{\ast}}(\beta,\sigma _{\gamma }^{2},\nu ) \\ =& & \sigma _{i,tt}(\beta,\lambda _{\gamma },\nu ) =\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ) -\mu _{it}^{2}(\beta,\lambda _{\gamma },\nu ),\;\mbox{ by (28)};\quad {}\end{array}$$

(54)

$$\displaystyle\begin{array}{rcl} \mbox{ cov}[(Y _{iu},Y _{it})\vert \rho = 0] =& & \sigma _{i,ut}^{{\ast}}(\beta,\sigma _{\gamma }^{2},\nu ) \\ =& & \exp \left [\{x_{iu} + x_{it}\}'\beta \right ] \frac{1} {W}\sum _{w=1}^{W}\exp \left [2R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ] \\ & & -\mu _{iu}(\beta,\lambda _{\gamma },\nu )\mu _{it}(\beta,\lambda _{\gamma },\nu ),\;\mbox{ by (32)\textendash (33)}. {}\end{array}$$

(55)

(b)
Computation of the Third Order Moments Matrix Ω _i,ss ^∗:

To compute this matrix $\varOmega _{i,ss}^{{\ast}} = \mbox{ cov}[\{Y _{i},Y '_{iss}\}\vert \rho = 0]$, it is sufficient to compute the elements (i) $\mbox{ cov}[\{Y _{it},Y _{it}^{2}\}\vert \rho = 0]$, and (ii) $\mbox{ cov}[\{Y _{iu},Y _{it}^{2}\}\vert \rho = 0]$, for u < t.

(i)
Formula for $\mbox{ cov}[\{Y _{it},Y _{it}^{2}\}\vert \rho = 0]:$

Notice that this formula does not depend on ρ, and the conditioning on ρ = 0 is not needed. Thus
$$\displaystyle\begin{array}{rcl} \mbox{ cov}[\{Y _{it},Y _{it}^{2}\}\vert \rho = 0]& =& E[Y _{ it}^{3}] - E[Y _{ it}]E[Y _{it}^{2}] \\ & =& E[Y _{it}^{3}] -\mu _{ it}(\beta,\lambda _{\gamma },\nu )\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ),\;\mbox{ by}\;(54), {}\end{array}$$
(56)
where
$$\displaystyle\begin{array}{rcl} E[Y _{it}^{3}]& =& E_{\gamma _{ i}^{{\ast}}}\left [\mu _{it}^{{\ast}} + 3\mu _{it}^{{\ast}}{}^{2} + \mu _{it}^{{\ast}}{}^{3}\right ] \\ & =& \frac{1} {W}\sum _{w=1}^{W}\left [\exp [x'_{ it}\beta + \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & +\left.3\exp [2x'_{it}\beta + \left \{2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & +\left.\exp [3x'_{it}\beta + \left \{3R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ], {}\end{array}$$
(57)
with $R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )$ as defined in (26).
(ii)
Formula for $\mbox{ cov}[\{Y _{iu},Y _{it}^{2}\}\vert \rho = 0]$, for u < t:

Because u and t denote two different times points, the covariance between y _iu and y _it ² is a function of the correlation index parameter ρ. However, we now simplify this covariance formula as follows under the assumption that ρ = 0.
$$\displaystyle\begin{array}{rcl} \mbox{ cov}[\{Y _{iu},Y _{it}^{2}\}\vert \rho = 0]& =& E[Y _{ iu}Y _{it}^{2}\vert \rho = 0] - E[Y _{ iu}]E[Y _{it}^{2}] \\ & =& E[Y _{iu}Y _{it}^{2}\vert \rho = 0] -\mu _{ iu}(\beta,\lambda _{\gamma },\nu )\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ),\;\mbox{ by (54)}, {}\end{array}$$
(58)
where, by (30) and (26)–(27), one writes
$$\displaystyle\begin{array}{rcl} & E[Y _{iu}Y _{it}^{2}\vert \rho = 0] = E_{\gamma _{i}^{{\ast}}}[E\{Y _{iu}\vert \gamma _{i}^{{\ast}}\}E\{Y _{it}^{2}\vert \gamma _{i}^{{\ast}}\}]& \\ & = E_{\gamma _{i}^{{\ast}}}[\mu _{iu}^{{\ast}}\{\mu _{it}^{{\ast}} + \mu ^{{\ast}}_{it}{}^{2}\}] & {}\end{array}$$
(59)

$$\displaystyle\begin{array}{rcl} & = \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right.& \\ & +\left.\exp [\{x_{iu} + 2x_{it}\}'\beta + 3\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ]. & {}\end{array}$$
(60)

(c)
Computation of the Third Order Moments Matrix Ω _i,pp ^∗:

To compute this matrix $\varOmega _{i,pp}^{{\ast}} = \mbox{ cov}[\{Y _{i},Y '_{ipp}\}\vert \rho = 0]$, it is sufficient to compute the elements (i) $\mbox{ cov}[\{Y _{iu},Y _{iu}Y _{it}\}\vert \rho = 0]$ for u < t or u > t, and (ii) $\mbox{ cov}[\{Y _{iu},Y _{it}Y _{im}\}\vert \rho = 0]$, for u ≠ t, u ≠ m, t < m.

(i)
Formula for $\mbox{ cov}[\{Y _{iu},Y _{iu}Y _{it}\}\vert \rho = 0]:$

By similar calculations as in (59)–(60), we write
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu},Y _{iu}Y _{it}\}\vert \rho = 0] = E_{\gamma _{i}^{{\ast}}}[E\{Y _{iu}^{2}Y _{ it}\}\vert \gamma _{i}^{{\ast}}] - E[Y _{ iu}]E[\{Y _{iu}Y _{it}\}\vert \rho = 0] \\ & & \quad = E_{\gamma _{i}^{{\ast}}}[\{\mu _{iu}^{{\ast}} + \mu _{ iu}^{{\ast}}{}^{2}\}\mu _{ it}^{{\ast}}] -\mu _{ iu}(\beta,\lambda _{\gamma },\nu )E_{\gamma _{i}^{{\ast}}}[\mu _{iu}^{{\ast}}\mu _{ it}^{{\ast}}] \\ & & \quad = \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & \qquad +\exp [\{2x_{iu} + x_{it}\}'\beta + 3\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}] \\ & & \qquad -\left.\mu _{iu}(\beta,\lambda _{\gamma },\nu )\exp [\{x_{iu} + x_{it}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ]. {}\end{array}$$
(61)
(ii)
Formula for $\mbox{ cov}[\{Y _{iu},Y _{it}Y _{im}\}\vert \rho = 0]:$

By similar calculations as in (i), we write the formula for this covariance as
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu},Y _{it}Y _{im}\}\vert \rho = 0] \\ & & \quad = \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it} + x_{im}\}'\beta + 3\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & \qquad -\left.\mu _{iu}(\beta,\lambda _{\gamma },\nu )\exp [\{x_{it} + x_{im}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ]. {}\end{array}$$
(62)

(d)
Computation of the Fourth Order Moments Matrix Σ _i,ss ^∗:

To compute this fourth order matrix, one needs the formulas for two general elements, namely (i) var[Y _it ²], and (ii) $\mbox{ cov}[\{Y _{iu}^{2},Y _{it}^{2}\}\vert \rho = 0]$. These formulas are developed as follows:

(i)
Recall from (27) that $E[Y _{it}^{2}] =\phi _{i,tt}(\beta,\lambda _{\gamma },\nu )$. Next because
$$\displaystyle{E[Y _{it}^{4}\vert \gamma _{ i}^{{\ast}}] = \left [\mu _{ it}^{{\ast}} + 7\mu _{ it}^{{\ast}}{}^{2} + 6\mu _{ it}^{{\ast}}{}^{3} + \mu _{ it}^{{\ast}}{}^{4}\right ],}$$

it then follows that
$$\displaystyle\begin{array}{rcl} & & \mbox{ var}(Y _{it})^{2} = E[Y _{ it}^{4}] - [E\{Y _{ it}^{2}\}]^{2} \\ & & \quad = E_{\gamma _{i}^{{\ast}}}E[Y _{it}^{4}\vert \gamma _{ i}^{{\ast}}] - [\phi _{ i,tt}(\beta,\lambda _{\gamma },\nu )]^{2} \\ & & \quad = \frac{1} {W}\sum _{w=1}^{W}\left [\exp \left [x'_{ it}\beta + \left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}\right ] + 7\exp \left [2x'_{ it}\beta + \left \{2R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}\right ]\right. \\ & & \quad + \left.6\exp [3x'_{it}\beta + \left \{3R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}] +\exp [4x'_{ it}\beta + \left \{4R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ], {}\end{array}$$
(63)
with R(ψ _iw, ξ _iw ²; λ _γ, ν) as defined in (26).
(ii)
Formula for $\mbox{ cov}[\{Y _{iu}^{2},Y _{it}^{2}\}\vert \rho = 0]$, for u < t:

By similar calculations, this covariance has the computing formula given by
$$\displaystyle{ \mbox{ cov}[\{Y _{iu}^{2},Y _{ it}^{2}\}\vert \rho = 0] = E[\{Y _{ iu}^{2}Y _{ it}^{2}\}\vert \rho = 0] -\phi _{ i,uu}(\beta,\lambda _{\gamma },\nu )\phi _{i,tt}(\beta,\lambda _{\gamma },\nu ), }$$
(64)
where
$$\displaystyle\begin{array}{rcl} E[\{Y _{iu}^{2}Y _{ it}^{2}\}\vert \rho =& & 0] = E_{\gamma _{ i}^{{\ast}}}\left [\{\mu _{iu}^{{\ast}} + \mu _{iu}^{{\ast}}{}^{2}\}\{\mu _{it}^{{\ast}} + \mu _{it}^{{\ast}}{}^{2}\}\right ] \\ =& & \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & +\ \exp \left [\{x_{iu} + 2x_{it}\}'\beta + \left \{3R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}\right ] \\ & & +\ \exp \left [\{2x_{iu} + x_{it}\}'\beta + \left \{3R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}\right ] \\ & & +\ \left.\exp [2\{x_{iu} + x_{it}\}'\beta + \left \{4R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}\right ]. {}\end{array}$$
(65)

(e)
Computation of the Fourth Order Moments Matrix Ω _i,sp ^∗:

To compute this fourth order matrix, one needs the formulas for two general covariance elements, namely (i) $\mbox{ cov}[\{Y _{iu}^{2},Y _{iu}Y _{it}\}\vert \rho = 0]$, and (ii) $\mbox{ cov}[\{Y _{iu}^{2},Y _{it}Y _{im}\}\vert \rho = 0]$. These formulas are developed as follows:

(i)
Formula for $\mbox{ cov}[\{Y _{iu}^{2},Y _{iu}Y _{it}\}\vert \rho = 0]:$
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu}^{2},Y _{ iu}Y _{it}\}\vert \rho = 0] = E[\{Y _{iu}^{3}Y _{ it}\}\vert \rho = 0] \\ & & \quad -\phi _{i,uu}(\beta,\lambda _{\gamma },\nu ) \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it}\}'\beta \right. \\ & & \quad \left.+2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ],\;\mbox{ by (27) and (32)}, {}\end{array}$$
(66)

where
$$\displaystyle\begin{array}{rcl} E[\{Y _{iu}^{3}Y _{ it}\}\vert \rho =& & 0] = E_{\gamma _{i}^{{\ast}}}\left [E[Y _{iu}^{3}\vert \gamma _{ i}^{{\ast}}]E[Y _{ it}\vert \gamma _{i}^{{\ast}}]\right ] \\ =& & E_{\gamma _{i}^{{\ast}}}\left [\{\mu _{iu}^{{\ast}} + 3\mu _{ iu}^{{\ast}}{}^{2} + \mu _{ iu}^{{\ast}}{}^{3}\}\{\mu _{ it}^{{\ast}}\}\right ] \\ =& & \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & +\ 3\exp [\{2x_{iu} + x_{it}\}'\beta + 3\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}] \\ & & +\ \left.\exp [\{3x_{iu} + x_{it}\}'\beta + 4\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ]. {}\end{array}$$
(67)
(ii)
$\mbox{ cov}[\{Y _{iu}^{2},Y _{it}Y _{im}\}\vert \rho = 0]:$
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu}^{2},Y _{ it}Y _{im}\}\vert \rho = 0] = E_{\gamma _{i}^{{\ast}}}\left [\{\mu _{iu}^{{\ast}} + \mu _{ iu}^{{\ast}}{}^{2}\}\{\mu _{ it}^{{\ast}}\mu _{ im}^{{\ast}}\}\right ] \\ & & \qquad -\phi _{i,uu}(\beta,\lambda _{\gamma },\nu ) \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ it} + x_{im}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ],\quad {}\end{array}$$
(68)
where the first term in the right hand side of (68) has the formula
$$\displaystyle\begin{array}{rcl} & & E_{\gamma _{i}^{{\ast}}}\left [\{\mu _{iu}^{{\ast}} + \mu _{ iu}^{{\ast}}{}^{2}\}\{\mu _{ it}^{{\ast}}\mu _{ im}^{{\ast}}\}\right ] \\ & & \quad = \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it} + x_{im}\}'\beta + 3\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right. \\ & & \qquad + \left.\exp [\{2x_{iu} + x_{it} + x_{im}\}'\beta + 4\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ]. {}\end{array}$$
(69)

(f)
Computation of the Fourth Order Moments Matrix Ω _i,pp ^∗:

The computation for this matrix requires the formulas for (i) $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iu}Y _{it}\}\vert $ ρ = 0], (ii) $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iu}Y _{im}\}\vert \rho = 0]$, (iii) $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iv}Y _{it}\}\vert \rho = 0]$, and (iv) $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iv}Y _{im}\}\vert \rho = 0]$. The computations for all these four covariances are similar. We, for example, give the formulas for covariances in (i) and (iv).

(i)
Formula for $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iu}Y _{it}\}\vert \rho = 0]:$
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu}Y _{it},Y _{iu}Y _{it}\}\vert \rho = 0] = E[\{Y _{iu}^{2}Y _{ it}^{2}\}\vert \rho = 0] \\ & & \qquad -\left [ \frac{1} {W}\sum _{w=1}^{W}\left (\exp [\{x_{ it} + x_{im}\}'\beta + 2\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right )\right ]^{2}, {}\end{array}$$
(70)
where $E[\{Y _{iu}^{2}Y _{it}^{2}\}\vert \rho = 0]$ is computed in (65).
(iv)
Formula for $\mbox{ cov}[\{Y _{iu}Y _{it},Y _{iv}Y _{im}\}\vert \rho = 0]:$
$$\displaystyle\begin{array}{rcl} & & \mbox{ cov}[\{Y _{iu}Y _{it},Y _{iv}Y _{im}\}\vert \rho = 0] \\ & & \quad = \frac{1} {W}\sum _{w=1}^{W}\left [\exp [\{x_{ iu} + x_{it} + x_{iv} + x_{im}\}'\beta + 4\left \{R(\psi _{iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right \}]\right ] \\ & & \qquad -\mu _{iu}(\beta,\lambda _{\gamma },\nu )\mu _{it}(\beta,\lambda _{\gamma },\nu )\mu _{iv}(\beta,\lambda _{\gamma },\nu )\mu _{im}(\beta,\lambda _{\gamma },\nu ). {}\end{array}$$
(71)

3.2.2 Asymptotic Properties of the GQL Estimator $\hat{\pi }_{GQL} = [\hat{\lambda }_{\gamma,GQL},\hat{\nu }_{GQL}]': 2 \times 1$

Notice that when Ω _i ^∗(β, λ _γ, ν) from Sect. 3.2.1 is used in (49) for Ω _i(β, λ _γ, ν, ρ), one solves the approximate GQL estimating equation

$$\displaystyle{ \sum _{i=1}^{K}\frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } \varOmega _{i}^{{\ast}}{}^{-1}(\beta,\lambda _{\gamma },\nu )\left (g_{ i} -\eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )\right ) = 0, }$$

(72)

for π = (λ _γ, ν)′.

Let $\hat{\pi }_{GQL} = [\hat{\lambda }_{\gamma,GQL},\hat{\nu }_{GQL}]': 2 \times 1$ be the solution of (72). By similar calculations as in Sect. 3.1.1 (see (44)), it can be shown that

$$\displaystyle{ \lim _{K\rightarrow \infty }\hat{\pi }_{GQL} \rightarrow N(\pi,Q^{{\ast}}_{ K}{}^{-1}(\beta,\lambda _{\gamma },\nu,\rho )), }$$

(73)

where

$$\displaystyle\begin{array}{rcl} Q_{K}^{{\ast}} =& & \left [\sum _{ i=1}^{K}\frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } \varOmega _{i}^{{\ast}}{}^{-1}(\beta,\lambda _{\gamma },\nu )\frac{\partial \eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi '} \right ]^{-1} \\ & & \times \ \sum _{i=1}^{K}\frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } \varOmega _{i}^{{\ast}}{}^{-1}(\beta,\lambda _{\gamma },\nu )\varOmega _{ i}(\beta,\lambda _{\gamma },\nu,\rho )\varOmega _{i}^{{\ast}}{}^{-1}(\beta,\lambda _{\gamma },\nu )\frac{\partial \eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi '} \\ & & \times \ \left [\sum _{i=1}^{K}\frac{\partial \eta '_{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi } \varOmega _{i}^{{\ast}}{}^{-1}(\beta,\lambda _{\gamma },\nu )\frac{\partial \eta _{i}(\beta,\lambda _{\gamma },\nu,\rho )} {\partial \pi '} \right ]^{-1}. {}\end{array}$$

(74)

3.3 Moment Estimation of Correlation Index Parameter ρ

Recall from (33) that

$$\displaystyle\begin{array}{rcl} & & E[(Y _{iu} -\mu _{iu}(\cdot ))(Y _{it} -\mu _{it}(\cdot ))] =\rho ^{t-u}\mu _{ iu}(\beta,\lambda _{\gamma },\nu ) {}\\ & & \qquad +\exp \left [\{x_{iu} + x_{it}\}'\beta \right ] \frac{1} {W}\sum _{w=1}^{W}\exp \left [2R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ] -\mu _{ iu}(\beta,\lambda _{\gamma },\nu )\mu _{it}(\beta,\lambda _{\gamma },\nu ). {}\\ \end{array}$$

Consequently, by using lag 1 based pair-wise product responses, one obtains

$$\displaystyle\begin{array}{rcl} & & E\sum _{i=1}^{K}\sum _{ t=1}^{T-1}\frac{\{(Y _{it} -\mu _{it}(\cdot ))(Y _{i,t+1} -\mu _{i,t+1}(\cdot ))\}} {K(T - 1)} =\rho \sum _{ i=1}^{K}\sum _{ t=1}^{T-1} \frac{\mu _{it}(\cdot )} {K(T - 1)} \\ & & \quad + \frac{1} {KW}\sum _{i=1}^{K}\sum _{ w=1}^{W}\{\exp \left [2R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ]\}\frac{\{\sum _{t=1}^{T-1}\exp \left [\{x_{ iu} + x_{it}\}'\beta \right ]\}} {T - 1} \\ & & \quad -\sum _{i=1}^{K}\sum _{ t=1}^{T-1}\frac{\mu _{it}(\cdot )\mu _{i,t+1}(\cdot )} {K(T - 1)}. {}\end{array}$$

(75)

Further, one writes

$$\displaystyle{ E\sum _{i=1}^{K}\sum _{ t=1}^{T}\frac{\{(Y _{it} -\mu _{it}(\cdot ))^{2}\}} {KT} =\sum _{ i=1}^{K}\sum _{ t=1}^{T}\frac{\sigma _{i,tt}(\cdot )} {KT}, }$$

(76)

where the variance σ _{i, tt}(⋅ ) has the formula given by (28).

Now by dividing (75) by (76), and using first order approximation, one obtains the unbiased moment estimator $\hat{\rho }_{M}$ for ρ as

$$\displaystyle\begin{array}{rcl} & & \hat{\rho }_{M} \simeq \left [\frac{\sum _{i=1}^{K}\sum _{t=1}^{T-1}\{(Y _{it} -\mu _{it}(\cdot ))(Y _{i,t+1} -\mu _{i,t+1}(\cdot ))\}/\{K(T - 1)\}} {\sum _{i=1}^{K}\sum _{t=1}^{T}\{(Y _{it} -\mu _{it}(\cdot ))^{2}\}/\{KT\}} \right ] \\ & & \qquad \div \left [\frac{\sum _{i=1}^{K}\sum _{t=1}^{T-1}\mu _{it}(\cdot )/\{K(T - 1)\}} {\sum _{i=1}^{K}\sum _{t=1}^{T}\sigma _{i,tt}(\cdot )/\{KT\}} \right ] \\ & & \qquad -\left [ \frac{1} {KW}\sum _{i=1}^{K}\sum _{ w=1}^{W}\{\exp \left [2R(\psi _{ iw},\xi _{iw}^{2};\lambda _{\gamma },\nu )\right ]\}\frac{\{\sum _{t=1}^{T-1}\exp \left [\{x_{ iu} + x_{it}\}'\beta \right ]\}} {T - 1} \right ] \\ & & \qquad \div \left [\sum _{i=1}^{K}\sum _{ t=1}^{T-1} \frac{\mu _{it}(\cdot )} {K(T - 1)}\right ] + \left [\sum _{i=1}^{K}\sum _{ t=1}^{T-1}\frac{\mu _{it}(\cdot )\mu _{i,t+1}(\cdot )} {K(T - 1)} \right ] \div \left [\sum _{i=1}^{K}\sum _{ t=1}^{T-1} \frac{\mu _{it}(\cdot )} {K(T - 1)}\right ].{}\end{array}$$

(77)

Under some regularity conditions on the covariates so that $\mbox{ var}[\hat{\rho }_{M}]$ is bounded by a finite quantity, it follows that the moment estimator $\hat{\rho }_{M}$ is consistent for ρ. This is mainly because $\hat{\rho }_{M}$ given by (77) is approximately unbiased for ρ.

4 Binary Dynamic Mixed Logit Model with t _ν Random Effects

Recall the binary dynamic mixed logit (BDML) model given in (9), that is,

$$\displaystyle{Pr(y_{it} = 1\vert \gamma _{i},y_{i,t-1}) = \left \{\begin{array}{ll} \frac{\exp (x_{i1}^{'}\beta +\sigma _{\gamma }\gamma _{ i})} {1+\exp (x_{i1}^{'}\beta +\sigma _{\gamma }\gamma i)} & \mbox{ for}\;t = 1 \\ \frac{\exp (x'_{it}\beta +\theta y_{i,t-1}+\sigma _{\gamma }\gamma _{i})} {1+\exp (x'_{it}\beta +\theta y_{i,t-1}+\sigma _{\gamma }\gamma _{i})} & \mbox{ for}\;t = 2,\ldots,T,\\ \end{array} \right.}$$

Under the normality assumption for the random effects, i.e., when $\gamma _{i}\stackrel{iid}{\sim }N(0,1)$, the basic properties such as unconditional mean, variance and correlations under such BDML model is given by (13)–(17). In the following subsection, we provide these properties for the BDML model under the assumption that the random effects now follow a t-distribution with ν degrees of freedom.

4.1 Basic Properties of the Binary Mixed Model: Unconditional Mean and Variance

By similar calculations as in the normal case (13), one obtains an approximate unconditional mean based on t _ν random effects, as

$$\displaystyle\begin{array}{rcl} E[Y _{it}]& =& \mu _{it}(\beta,\;\theta,\;\lambda _{\gamma },\;\nu ) = W^{-1}\sum _{ w}^{W}\pi _{ it}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2}) \\ & =& W^{-1}\sum _{ w=1}^{W}\left [p_{ it0}(\psi _{iw},\;\xi _{iw}^{2}) +\pi _{ i,t-1}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\left \{p_{ it1}(\psi _{iw},\xi _{iw}^{2}) - p_{ it0}(\psi _{iw},\xi _{iw}^{2})\right \}\right ],{}\end{array}$$

(78)

where

$$\displaystyle\begin{array}{rcl} \pi _{i1}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})& =& p_{ i10}(\psi _{iw},\xi _{iw}^{2}) = \frac{\exp (x_{i1}^{'}\beta +\gamma _{ iw}^{{\ast}})} {1 +\exp (x_{i1}^{'}\beta +\gamma _{ iw}^{{\ast}})},\;\mbox{ and} {}\\ p_{ity_{i,t-1}}(\psi _{iw},\xi _{iw}^{2})& =& \frac{\exp (x'_{it}\beta +\theta y_{i,t-1} +\gamma _{ iw}^{{\ast}})} {1 +\exp (x'_{it}\beta +\theta y_{i,t-1} +\gamma _{ iw}^{{\ast}})}, {}\\ \end{array}$$

with γ _iw ^∗ as the t _ν(0, λ _γ ², ν) random effect given by (25). Next because y _it is a binary observation, it follows that

$$\displaystyle{ \mbox{ var}[Y _{it}] =\sigma _{i,tt}(\beta,\;\theta,\;\lambda _{\gamma },\;\nu ) =\mu _{it}(\beta,\;\theta,\;\lambda _{\gamma },\;\nu )[1 -\mu _{it}(\beta,\;\theta,\;\lambda _{\gamma },\;\nu )], }$$

(79)

where the unconditional mean μ _it(β, θ, λ _γ, ν) has the recursive type formula as in (78).

4.2 Computation of Unconditional Covariances for BDML Model with t _ν Random Effects

To compute the covariance between y _iu and y _it (u < t), we note that under the present dynamic model (9), conditional on the random effects γ _i ^∗ defined by (25), y _iu and y _it are not independent. This is because conditional on γ _i ^∗, y _it and y _{i, t−1}, for example, satisfy the dynamic dependence relationship (9). Next because

$$\begin{array}{rlrlrl} E[\{Y _{iu}Y _{it}\}\vert \gamma _{i}^{{\ast}}] & = \mbox{ cov}[\{Y _{ iu},Y _{it}\}\vert \gamma _{i}^{{\ast}}] + E[Y _{ iu}\vert \gamma _{i}^{{\ast}}]E[Y _{ iT}\vert \gamma _{i}^{{\ast}}] & & \\ & =\sigma _{ i,ut}^{{\ast}}(\psi _{ i},\xi _{i}^{2}) +\pi _{ iu}^{{\ast}}(\psi _{ i},\xi _{i}^{2})\pi _{ it}^{{\ast}}(\psi _{ i},\xi _{i}^{2}), & & \end{array}$$

with

$$\displaystyle{ \sigma _{i,ut}^{{\ast}}(\psi _{ i},\xi _{i}^{2}) =\pi _{ iu}^{{\ast}}(\psi _{ i},\xi _{i}^{2})[1 -\pi _{ iu}^{{\ast}}(\psi _{ i},\xi _{i}^{2})]\varPi _{ j=u+1}^{t}\left [p_{ ij1}(\psi _{i},\xi _{i}^{2}) - p_{ ij0}(\psi _{i},\xi _{i}^{2})\right ], }$$

(80)

(Sutradhar and Farrell 2007), one may compute the covariance between y _iu and y _it, first by computing E[Y _iu Y _it] using

$$\displaystyle{ E[Y _{iu}Y _{it}] = W^{-1}\sum _{ w=1}^{W}\left [\sigma _{ i,ut}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2}) +\pi _{ iu}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\pi _{ it}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\right ] =\tau _{ i,ut},\;\mbox{ (say)}, }$$

(81)

where ψ _iw and ξ _iw ² are generated from N(0, 1) and χ ₄ ², respectively, in order to compute γ _iw ^∗ by (25), and $\pi _{it}^{{\ast}}(\psi _{iw},\xi _{iw}^{2})$ is computed by (78).

5 GQL Estimation for the Parameters of the BDML Model with t _ν Random Effects

It is clear from Sects. 4.1 and 4.2 that the basic properties of the BDML (binary dynamic mixed logit) model (9), that is, the first and second order moments of the repeated binary responses contain all four parameters, namely β, θ, λ _γ, and ν, of the model. Consequently, we exploit all first and second order observations, and minimize their generalized distance from their corresponding means to construct a GQL estimating equations [Sutradhar (2010, Sect. 5.4); see also Sutradhar (2011, Sect. 9.2)] for these desired parameters. Note that for the binary data, y _it ² ≡ y _it. One may, thus, consider a vector of first and second order responses given by

$$\displaystyle{v_{i} = (y_{i1},\ldots,y_{it},\ldots,y_{iT},y_{i1}y_{i2},\ldots,y_{iu}y_{it},\ldots,y_{i(T-1)}y_{iT})',}$$

for the purpose of constructing the desired estimating equation. Now, denote the E[V _i] by

$$\displaystyle\begin{array}{rcl} \zeta _{i} = E[V _{i}]& =& \eta _{i}(\beta,\theta,\lambda _{\gamma },\nu ) = [\mu _{i1},\ldots,\mu _{it},\ldots,\mu _{iT},\tau _{i,12},\ldots,\tau _{i,ut},\ldots,\tau _{i,(T-1)T}]' \\ & =& [\mu '_{i},\tau '_{i}]', {}\end{array}$$

(82)

where the formula for the unconditional mean μ _it for all t = 1, …, T, is given by (78), and τ _{i, ut} = E[Y _iu Y _it] for all u < t, may be computed by (81). Further let α = (β, θ, λ _γ, ν)′, and Ω _i denote the $T(T + 2)/2 \times T(T + 1)/2$ covariance of v _i. Following Sutradhar (2010) [see also Sutradhar (2004)], one may then write the GQL estimating equation for α as

$$\displaystyle{ \sum _{i=1}^{K}\frac{\partial \zeta '_{i}} {\partial \alpha } \varOmega _{i}^{-1}(v_{ i} -\zeta _{i}) = 0, }$$

(83)

which may be solved by using the iterative equation

$$\displaystyle{ \hat{\alpha }_{GQL}(r + 1) =\hat{\alpha } _{GQL}(r) + \left [\left \{\sum _{i=1}^{K}\frac{\partial \zeta '_{i}} {\partial \alpha } \varOmega _{i}^{-1}\frac{\partial \zeta _{i}} {\partial \alpha '} \right \}^{-1}\sum _{ i=1}^{K}\frac{\partial \zeta '_{i}} {\partial \alpha } \varOmega _{i}^{-1}(v_{ i} -\zeta _{i})\right ]_{\vert \alpha =\hat{\alpha }_{GQL}(r)}. }$$

(84)

Note that to compute the Ω _i matrix for (83) and (84), one needs to compute the following elements: (a) var[Y _it]; (b) cov[Y _iu, Y _it]; (c) var[Y _iu Y _it]; (d) cov[Y _iu Y _it, Y _{i ℓ} Y _im]; and (e) cov[Y _iu, Y _im Y _it]. However, all these elements through (a) –(e), may be computed by using the moments up to order four given in Sects. 4.1, 4.2, and 5.1 below. For example,

$$\displaystyle\begin{array}{rcl} \mbox{ cov}[Y _{iu}Y _{it},Y _{i\ell}Y _{im}]& =& E[Y _{iu}Y _{it}Y _{i\ell}Y _{im}] - E[Y _{iu}Y _{it}]E[Y _{i\ell}Y _{im}] \\ & =& \tilde{\phi }_{i,ut\ell m} -\tau _{i,ut}\tau _{i,\ell m}, {}\end{array}$$

(85)

where the formula for $\tilde{\phi }_{i,ut\ell m}$ is given in (96), and the formula for τ _{i, ut}, for example, is given in (81).

Computation of $\frac{\partial \zeta '_{i}} {\partial \alpha }$ for (84):

Because ζ _i = [μ′_i, τ′_i]′, the gradients for searching for the estimate of α = (β′, θ, λ _γ, ν)′ can be computed by by using the formulas for $\frac{\partial \mu _{it}} {\partial \alpha }$ and $\frac{\partial \tau _{i,ut}} {\partial \alpha }$, where μ _it and τ _{i, ut} are given by (78) and (81), respectively. For the purpose, we derive these formulas by using

$$\displaystyle\begin{array}{rcl} \frac{\partial \mu _{it}} {\partial \alpha } =& & W^{-1}\sum _{ w=1}^{W}\left [\frac{\partial p_{it0}(\psi _{iw},\;\xi _{iw}^{2})} {\partial \alpha } +\{ \frac{\partial \pi _{i,t-1}^{{\ast}}(\psi _{iw},\xi _{iw}^{2})} {\partial \alpha } \}\left \{p_{it1}(\psi _{iw},\xi _{iw}^{2}) - p_{ it0}(\psi _{iw},\xi _{iw}^{2})\right \}\right. \\ & & +\ \left.\pi _{i,t-1}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\left \{\frac{\partial p_{it1}(\psi _{iw},\xi _{iw}^{2})} {\partial \alpha } -\frac{\partial p_{it0}(\psi _{iw},\xi _{iw}^{2})} {\partial \alpha } \right \}\right ], {}\end{array}$$

(86)

$$\displaystyle\begin{array}{rcl} \frac{\partial \tau _{i,ut}} {\partial \alpha } =& & W^{-1}\sum _{ w=1}^{W}\frac{\partial \left [\sigma _{i,ut}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2}) +\pi _{ iu}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\pi _{ it}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})\right ]} {\partial \alpha },{}\end{array}$$

(87)

where

$$\displaystyle{\sigma _{i,ut}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2}) =\pi _{ iu}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})[1-\pi _{ iu}^{{\ast}}(\psi _{ iw},\xi _{iw}^{2})]\varPi _{ j=u+1}^{t}\left [p_{ ij1}(\psi _{iw},\xi _{iw}^{2}) - p_{ ij0}(\psi _{iw},\xi _{iw}^{2})\right ].}$$

Note that to compute the derivative of the product factor involved in $\sigma _{i,ut}^{{\ast}}(\cdot )$, one can use the formula

$$\displaystyle\begin{array}{rcl} & & \frac{\partial \varPi _{j=u+1}^{t}\left [p_{ij1}(\psi _{iw},\xi _{iw}^{2}) - p_{ij0}(\psi _{iw},\xi _{iw}^{2})\right ]} {\partial \alpha } \\ =& & \varPi _{j=u+1}^{t}\left [p_{ ij1}(\psi _{iw},\xi _{iw}^{2}) - p_{ ij0}(\psi _{iw},\xi _{iw}^{2})\right ] \\ & & \times \ \sum _{j=u+1}^{t}\frac{\partial log\;\left [p_{ij1}(\psi _{iw},\xi _{iw}^{2}) - p_{ ij0}(\psi _{iw},\xi _{iw}^{2})\right ]\}} {\partial \alpha }.{}\end{array}$$

(88)

To complete the formulation of the above derivatives, we now give the derivatives for one term, namely $p_{ij1}(\psi _{iw},\xi _{iw}^{2})$, with respect to each element of α = (β′, θ, λ _γ, ν)′. To be specific,

$$\displaystyle\begin{array}{rcl} \frac{\partial p_{ij1}(\psi _{iw},\xi _{iw}^{2})} {\partial \beta } =& & p_{ij1}(\psi _{iw},\xi _{iw}^{2})(1 - p_{ ij1}(\psi _{iw},\xi _{iw}^{2}))x_{ ij};{}\end{array}$$

(89)

$$\displaystyle\begin{array}{rcl} \frac{\partial p_{ij1}(\psi _{iw},\xi _{iw}^{2})} {\partial \theta } =& & p_{ij1}(\psi _{iw},\xi _{iw}^{2})(1 - p_{ ij1}(\psi _{iw},\xi _{iw}^{2}));{}\end{array}$$

(90)

$$\displaystyle\begin{array}{rcl} \frac{\partial p_{ij1}(\psi _{iw},\xi _{iw}^{2})} {\partial \lambda _{\gamma }} =& & p_{ij1}(\psi _{iw},\xi _{iw}^{2})(1 - p_{ ij1}(\psi _{iw},\xi _{iw}^{2})) \\ & & \times \ \left [\psi _{iw}\{2\nu \}^{\frac{1} {2} }\left [\sqrt{\nu }\left (\xi _{iw}^{2} - 4\right ) + 2\nu \right ]^{-\frac{1} {2} }\right ];\;\mbox{ and}\;{}\end{array}$$

(91)

$$\displaystyle\begin{array}{rcl} \frac{\partial p_{ij1}(\psi _{iw},\xi _{iw}^{2})} {\partial \nu } =& & p_{ij1}(\psi _{iw},\xi _{iw}^{2})(1 - p_{ ij1}(\psi _{iw},\xi _{iw}^{2}))\sqrt{2}\lambda _{\psi }\psi _{ iw} \\ & & \times \ \left [\frac{1} {2}\{\nu \}^{-\frac{1} {2} }\left [\sqrt{\nu }\left (\xi _{iw}^{2} - 4\right ) + 2\nu \right ]^{-\frac{1} {2} }\right. \\ & & -\ \left.\frac{1} {2}\{\nu \}^{\frac{1} {2} }\left [\sqrt{\nu }\left (\xi _{iw}^{2} - 4\right ) + 2\nu \right ]^{-\frac{3} {2} }[\frac{1} {2}\{\nu \}^{-\frac{1} {2} }(\xi _{iw}^{2} - 4) + 2]\right ].{}\end{array}$$

(92)

5.1 Computation Higher Order Moments to Construct Ω _i in (84)

Note that when the first and second order responses are used to construct distance functions for the estimation of the parameters β, θ, λ _γ, and ν, one requires the third and fourth order moments which are used in the weight matrix to develop the estimating equations. The first (mean) and the second order moments are computed in (78), (79) and (81) using suitable close form expressions. For higher order such as the third and fourth order moments, it is convenient to compute them numerically (Sutradhar et al. 2008). To be specific, for the computation of the third order moments, let $\sum _{(y_{iu},y_{i\ell},y_{it})\ni s}$ indicates the summation over all binary variables in the sample space s that contain T − 3 elements out of T elements except y _iu, y _il, y _it. one may then compute the third order moments as

$$\displaystyle\begin{array}{rcl} E[Y _{iu}Y _{i\ell}Y _{it}]& =& P[y_{iu} = 1,y_{i\ell} = 1,y_{it} = 1] \equiv \tilde{\delta }_{i,u\ell t} \\ & =& W^{-1}\sum _{ w=1}^{W}\sum _{ y_{iu},y_{il},y_{it}\ni s}\left [f(y_{i1}\vert \gamma _{iw}^{{\ast}})\varPi _{ j=2}^{T}f(y_{ ij}\vert y_{i,j-1},\gamma _{iw}^{{\ast}})\right ]_{ y_{iu}=1,y_{i\ell}=1,y_{it}=1}{}\end{array}$$

(93)

where by (78)

$$\displaystyle\begin{array}{rcl} f(y_{i1}\vert \gamma _{iw}^{{\ast}})& =& [p_{ i10}(\gamma _{iw}^{{\ast}})]^{y_{i1} }[1 - p_{i10}(\gamma _{iw}^{{\ast}})]^{1-y_{i1} } \\ f(y_{ij}\vert y_{i,j-1},\gamma _{iw}^{{\ast}})& =& [p_{ ijy_{i,j-1}}(\gamma _{iw}^{{\ast}})]^{y_{ij} }[1 - p_{ijy_{i,j-1}}(\gamma _{iw}^{{\ast}})]^{1-y_{ij} }.{}\end{array}$$

(94)

After an algebra, one may simplify the third order moments in (93) as

$$\displaystyle{ \tilde{\delta }_{i,u\ell t} = W^{-1}\sum _{ w=1}^{W}\sum _{ y_{iu},y_{i\ell},y_{it}\ni s}\left [\tilde{p}_{i10}(y_{i1},\gamma _{iw}^{{\ast}})\varPi _{ j=2}^{T}\tilde{p}_{ ij1}(y_{ij},y_{i,j-1},\gamma _{iw}^{{\ast}})\right ]_{ y_{iu}=1,y_{i\ell}=1,y_{it}=1}, }$$

(95)

with $\tilde{p}_{i10}(y_{i1},\gamma _{iw}^{{\ast}}) = \frac{\exp \{y_{i1}(x_{i1}^{'}\beta +\gamma _{ iw}^{{\ast}})\}} {1+\exp (x_{i1}^{'}\beta +\gamma _{iw}^{{\ast}})}$, and $\tilde{p}_{ij1}(y_{ij},y_{i,j-1},\gamma _{iw}^{{\ast}}) = \frac{\exp \{y_{ij}(x_{ij}^{'}\beta +\theta y_{ i,j-1}+\gamma _{iw}^{{\ast}})\}} {1+\exp (x_{ij}^{'}\beta +\theta y_{i,j-1}+\gamma _{iw}^{{\ast}})},$ where $\gamma _{iw}^{{\ast}}\equiv \gamma _{iw}^{{\ast}}(\lambda _{\gamma },\nu;\psi _{iw},\xi _{iw}^{2})$ as defined by (25).

The computation for the fourth order moments is similar to that of the third order moments. Let $\sum _{(y_{iu},y_{i\ell},y_{im},y_{it})\ni s^{{\ast}}}$ indicates the summation over all binary variables in the sample space s ^∗ that contain T − 4 elements out of T elements except $y_{iu},y_{il},y_{im},y_{it}$. Now by implementing this summation, following (95), one writes the formula for the fourth order moments as

$$\displaystyle\begin{array}{rcl} E[Y _{iu}Y _{i\ell}Y _{im}Y _{it}] =& & W^{-1}\sum _{ w=1}^{W}\sum _{ y_{iu},y_{i\ell},y_{im},y_{it}\ni s^{{\ast}}}\left [\tilde{p}_{i10}(y_{i1},\gamma _{iw}^{{\ast}})\right. \\ & & \times \ \left.\varPi _{j=2}^{T}\tilde{p}_{ ij1}(y_{ij},y_{i,j-1},\gamma _{iw}^{{\ast}})\right ]_{ y_{iu}=1,y_{i\ell}=1,y_{im}=1,y_{it}=1} \\ =& & \tilde{\phi }_{i,u\ell mt},\;\mbox{ (say).} {}\end{array}$$

(96)

This completes the computation of all moments up to order four. These moments were exploited to construct the GQL estimating Eq. (84) for all parameters involved in the model, namely $\beta,\;\theta,\;\lambda _{\gamma },\;\mbox{ and}\;\nu$.

5.2 Asymptotic Normality and Consistency of $\hat{\alpha }_{GQL}$

Following (83), for true α, define

$$\displaystyle{ \bar{g}_{K}(\alpha ) = \frac{1} {K}\sum _{i=1}^{K}g_{ i}(\alpha ) = \frac{1} {K}\sum _{i=1}^{K}\frac{\partial \zeta '_{i}} {\partial \alpha } \varOmega _{i}^{-1}(v_{ i} -\zeta _{i}), }$$

(97)

where v ₁, …, v _i, …, v _K are independent to each other as they are collected from K independent individuals, but they are not identically distributed because

$$\displaystyle{ v_{i} \sim (\zeta _{i}(\beta,\theta,\lambda _{\gamma },\nu ),\varOmega _{i}(\beta,\theta,\lambda _{\gamma },\nu )), }$$

(98)

where the mean vectors in (82) and also the covariance matrices in (83) are different for different individuals.

Now one may derive the asymptotic distribution of $\hat{\alpha }_{GQL}$ by using the same technique as for the derivation of the asymptotic distribution of $\hat{\beta }_{GQL}$ given in Sect. 3.1.1 for the Poisson mixed model. Thus, it can be shown that

$$\displaystyle{ \lim _{K\rightarrow \infty }\hat{\alpha }_{GQL} \rightarrow N(\alpha,\tilde{V }_{K}^{-1}(\beta,\theta,\lambda _{\gamma },\nu )), }$$

(99)

or equivalently

$$\displaystyle{ \vert \vert [\tilde{V }_{K}(\beta,\theta,\lambda _{\gamma },\nu )]^{\frac{1} {2} }[\hat{\alpha }_{GQL}-\alpha ]\vert \vert = O_{p}(\sqrt{p + 3}), }$$

(100)

where

$$\displaystyle{\tilde{V }_{K}(\beta,\theta,\lambda _{\gamma },\nu ) =\sum _{ i=1}^{K}\frac{\partial (\zeta _{i})'} {\partial \alpha } \ [\varOmega _{i}(\beta,\theta,\lambda _{\gamma },\nu )]^{-1}\frac{\partial (\zeta _{i})} {\partial \alpha '}.}$$

This establishes the consistency of $\hat{\alpha }_{GQL}$ for α.

6 Discussion

It has been assumed in various econometric studies for count and binary panel data that the distribution of the random effects involved in the model is unknown. This makes the estimation of the regression effects β and dynamic dependence parameter ρ under the Poisson dynamic mixed model, and the estimation of β and the dynamic dependence parameter θ under the binary dynamic mixed model, very difficult. As a remedy, some authors such as Wooldridge (1999) and Montalvo (1997) developed certain estimation techniques those automatically remove the random effects from the model and estimate rest of the parameters, β and ρ in the Poisson case. However as demonstrated by Sutradhar et al. (2014), these estimation approaches have two drawbacks. First, the conditional maximum likelihood (CML) method used by Wooldridge (1999) and the instrumental variables based GMM (IVGMM) method used by Montalvo (1997) become useless for the estimation of the regression effects β when covariates are stationary (time independent), even though they are able to remove the random effects. Second, when the random effects γ _i or γ _i ^∗ are removed technically, the estimates of β and the dynamic dependence parameter (ρ) alone are not sufficient to compute the mean, variance and correlations of the data, which is a major drawback from the view point of data understanding/analysis. One encounters similar problems with the weighted kernel likelihood approach of Honore and Kyriazidou (2000, p. 84) for the inferences in binary dynamic mixed logit models.

The aforementioned inference issues do not arise when one can assume a suitable distribution for the random effects. Because the random effects appear in the linear predictive function of the generalized linear model, many studies mainly in statistics literature have considered normality as a reasonable assumption for the distribution of the random effects. Thus, under the assumption that the random effects involved in the longitudinal mixed models follow N(0, σ _γ ²), the GQL estimation of the regression effects β and σ _γ ² and moment estimation of the longitudinal correlation index parameter ρ were developed by Sutradhar and Bari (2007), for example, for longitudinal count data, and by Sutradhar (2008) for longitudinal binary data. See also Breslow and Clayton (1993), Breslow and Lin (1995), Lin and Breslow (1996), Jiang (1998), and Sutradhar and Qu (1998).

However, in this paper we have provided an extension of the normal latent effects based longitudinal mixed models for count and binary data to the t _ν latent effects based models. The inference for these extended models have been complex not only because of an additional degrees of freedom parameter but also for the difficulty that unlike simulation of N(0, 1) random effects in the Gaussian case, the simulation of t _ν(0, 1) is not possible when ν is unknown. In this paper we have resolved this issue through a new transformation which helps to generate data from a t ₄(0, 1) distribution for the purpose and then proceed for estimation of the ν parameter. In summary, we have developed a GQL estimation technique for the estimation of all parameters involved in the models including the degrees of freedom parameter.

References

Amemiya, T.: Advanced Econometrics. Harvard University Press, Cambridge, MA (1985)
Google Scholar
Bartolucci, F., Nigro, V.: A dynamic model for binary panel data with unobserved heterogenity admitting a $\sqrt{n}$-consistent conditional estimator. Econometrica 78, 719–733 (2010)
Article MathSciNet MATH Google Scholar
Breslow, N.E., Clayton, D.G.: Approximate inference in generalized linear mixed models. J. Am. Stat. Assoc. 88, 9–25 (1993)
MATH Google Scholar
Breslow, N.E., Lin, X.: Bias correction in generalized linear models with a single component of dispersion. J. Am. Stat. Assoc. 82, 81–92 (1995)
MathSciNet MATH Google Scholar
Cox, D.R.: The analysis of mulitivariate binary data. Appl. Stat. 21, 113–120 (1972)
Article Google Scholar
Freeland, R.K., McCabe, B.P.M.: Forecasting discrete valued low count time series. Int. J. Forecast. 20, 427–434 (2004)
Article MATH Google Scholar
Heckman, J.J.: Statistical models for discrete panel data. In: McFadden, D., Manski, C.F. (eds.) Structural Analysis of Discrete Data with Econometric Applications. MIT Press, Cambridge (1981)
Google Scholar
Honore, B.E., Kyriazidou, E.: Panel data discrete choice models with lagged dependent variables. Econometrica 68, 839–874 (2000)
Article MathSciNet MATH Google Scholar
Jiang, J.: Consistent estimators in generalized linear mixed models. J. Am. Stat. Assoc. 93, 720–729 (1998)
Article MathSciNet MATH Google Scholar
Lee, Y., Nelder, J.A.: Hierarchical generalized linear models. J. R. Stat. Soc. B 58, 619–678 (1996)
MathSciNet MATH Google Scholar
Lin, X., Breslow, N.E.: Bias correction in generalized linear mixed models with multiple components of dispersion. J. Am. Stat. Assoc. 91, 1007–1016 (1996)
Article MathSciNet MATH Google Scholar
Manski, C.F.: Semi-parametric analysis of random effects linear models from binary panel data. Econometrica 55, 357–362 (1987)
Article MathSciNet MATH Google Scholar
McDonald, D.R.: The local limit theorem: a historical perspective. J. Iran. Stat. Soc. 4, 73–86 (2005)
Google Scholar
Montalvo, J.G.: GMM estimation of count-panel data models with fixed effects and predetermined instruments. J. Bus. Econ. Stat. 15, 82–89 (1997)
Google Scholar
Sutradhar, B.C.: On the Characteristics function of multivariate student t-distribution. Can. J. Stat. 14, 329–337 (1986)
Article MathSciNet MATH Google Scholar
Sutradhar, B.C.: An overview on regression models for discrete longitudinal responses. Stat. Sci. 18, 377–393 (2003)
Article MathSciNet MATH Google Scholar
Sutradhar, B.C.: On exact quasilikelihood inference in generalized linear mixed models. Sankhya: Indian J. Stat. 66, 261–289 (2004)
MathSciNet MATH Google Scholar
Sutradhar, B.C.: On auto-regression type dynamic mixed models for binary panel data. Metron 56, 205–217 (2008)
Google Scholar
Sutradhar, B.C.: Inferences in generalized linear longitudinal mixed models. Can. J. Stat. 38, 174–196 (2010) [Special issue Ed. B.C. Sutradhar]
Google Scholar
Sutradhar, B.C.: Dynamic Mixed Models for Familial Longitudinal Data. Springer, New York (2011)
Book MATH Google Scholar
Sutradhar, B.C., Bari, W.: On generalized quasilikelihood inference in longitudinal mixed models for count data. Sankhya: Indian J. Stat. 69, 671–699 (2007)
MathSciNet MATH Google Scholar
Sutradhar, B.C., Farrell, P.J.: On optimal lag 1 dependence estimation for dynamic binary models with application to asthma data. Sankhya B: Indian J. Stat. 69, 448–467 (2007)
MathSciNet MATH Google Scholar
Sutradhar, B.C., Qu, Z.: On approximate likelihood inference in Poisson mixed model. Can. J. Stat. 26, 169–186 (1998)
Article MathSciNet MATH Google Scholar
Sutradhar, B.C., Prabhakar Rao, R., Pandit, V.N.: Generalized method of moments versus generalized quasilikelihood inferences in binary panel data. Sankhya B: Indian J. Stat. 70, 34–62 (2008)
MathSciNet MATH Google Scholar
Sutradhar, B.C., Jowaheer, V., Rao, R.P.: Remarks on asymptotic efficient estimation for regression effects in stationary and non-stationary models for panel count data. Br. J. Probab. Stat. 28, 241–254 (2014)
Article MathSciNet MATH Google Scholar
Wooldridge, J.: Distribution-free estimation of some non-linear panel data models. J. Econ. 90, 77–97 (1999)
Article MathSciNet MATH Google Scholar
Zhao, L.P., Prentice, R.L.: Correlated binary regression using a quadratic exponential model. Biometrika 77, 642–648 (1990)
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors thank a referee for comments and suggestions. The second author presented a part of this paper in the symposium as a part of his Key Note address part I. Special thanks go to the audience of the symposium for their feedback.

Author information

Authors and Affiliations

Department of Economics, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Andhra Pradesh, India
R. Prabhakar Rao & V. N. Pandit
Department of Mathematics and Statistics, Memorial University, St. John’s, NL, Canada, A1C5S7
Brajendra C. Sutradhar

Authors

R. Prabhakar Rao
View author publications
You can also search for this author in PubMed Google Scholar
Brajendra C. Sutradhar
View author publications
You can also search for this author in PubMed Google Scholar
V. N. Pandit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Prabhakar Rao .

Editor information

Editors and Affiliations

Department of Mathematics & Statistics, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador, Canada
Brajendra C. Sutradhar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rao, R.P., Sutradhar, B.C., Pandit, V.N. (2016). Longitudinal Mixed Models with t Random Effects for Repeated Count and Binary Data. In: Sutradhar, B. (eds) Advances and Challenges in Parametric and Semi-parametric Analysis for Correlated Data. Lecture Notes in Statistics(), vol 218. Springer, Cham. https://doi.org/10.1007/978-3-319-31260-6_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-31260-6_2
Published: 16 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31258-3
Online ISBN: 978-3-319-31260-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Longitudinal Mixed Models with t Random Effects for Repeated Count and Binary Data

Abstract

Similar content being viewed by others

Inferences in semi-parametric dynamic mixed models for longitudinal count data

An approximate method for generalized linear and nonlinear mixed effects models with a mechanistic nonlinear covariate measurement error model

Inferences in Longitudinal Count Data Models with Measurement Errors in Time Dependent Covariates

Keywords

1 Introduction

1.1 Conditional and Unconditional (Normality Based) Correlation Structures for Repeated Count Data

1.2 Conditional and Unconditional (Normality Based) Correlation Structures for Repeated Binary Data

1.3 Plan of the Paper Under the Proposed t Random Effects with Unknown Degrees of Freedom ν

2 Poisson Mixed Model with t ν Random Effects

2.1 Basic Properties of the Poisson Mixed Model: Unconditional Mean and Variance

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

2.2 Correlation Properties of the Poisson Mixed Model: Unconditional Covariances

3 GQL Estimation for the Parameters of the Poisson Mixed Model

3.1 GQL Estimation for the Regression Effects β

3.1.1 Asymptotic Properties of the GQL Estimator of β

3.2 GQL Estimation for the Scale and Shape Parameters

3.2.1 Computation of Ω i (CI) ≡ Ω i ∗(β, λ γ , ν)

3.2.2 Asymptotic Properties of the GQL Estimator \(\hat{\pi }_{GQL} = [\hat{\lambda }_{\gamma,GQL},\hat{\nu }_{GQL}]': 2 \times 1\)

3.3 Moment Estimation of Correlation Index Parameter ρ

4 Binary Dynamic Mixed Logit Model with t ν Random Effects

4.1 Basic Properties of the Binary Mixed Model: Unconditional Mean and Variance

4.2 Computation of Unconditional Covariances for BDML Model with t ν Random Effects

5 GQL Estimation for the Parameters of the BDML Model with t ν Random Effects

5.1 Computation Higher Order Moments to Construct Ω i in (84)

5.2 Asymptotic Normality and Consistency of \(\hat{\alpha }_{GQL}\)

6 Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation

2 Poisson Mixed Model with t _ν Random Effects

3.2.1 Computation of Ω _i(CI) ≡ Ω _i ^∗(β, λ _γ, ν)

4 Binary Dynamic Mixed Logit Model with t _ν Random Effects

4.2 Computation of Unconditional Covariances for BDML Model with t _ν Random Effects

5 GQL Estimation for the Parameters of the BDML Model with t _ν Random Effects

5.1 Computation Higher Order Moments to Construct Ω _i in (84)