Abstract
Spatial dynamic panel data (SDPD) models have received great attention in economics in recent 10 years. Existing approaches for the estimation and test of SDPD models are quasi-maximum likelihood (QML) approach and generalized method of moments (GMM). In this article, we introduce the empirical likelihood (EL) method to the statistical inference for SDPD models. The EL ratio statistics are constructed for the parameters of spatial dynamic panel data models. It is shown that the limiting distributions of the empirical likelihood ratio statistics are chi-squared distributions, which are used to construct confidence regions for the parameters of the models. Simulation results show that the EL based confidence regions outperform the normal approximation based confidence regions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Real data are often observed at different locations and times, which are called as spatial panel data (SPD). Examples are economic growth rates of major cities in China over last 40 years, monthly unemployment rates of states in USA in the last decade and daily infection rates of COVID-19 in major cites in Hubei province in China over 3 months since December 31, 2019. These data may be modelled by SPD models. The research to various SPD models can be found in Anselin (1988), Elhorst (2003), Baltagi et al. (2003), Baltagi and Li (2006), Chen and Conley (2001), Pesaran (2004), Kapoor et al. (2007), Baltagi et al. (2007), Lee and Yu (2010a), Mutl and Pfaffermayr (2011), Parent and LeSage (2011) and Baltagi et al. (2013), among others. By adding a dynamic element into a SPD model Anselin (2001) proposes a spatial dynamic panel data (SDPD) model, which increases the flexibility of a SPD model. Obviously, SPD models belong to SDPD models. There has been a growing interest in the statistical inferences for SDPD models since then. For an overview on the SPDP models, refer to Yu et al. (2008) and Lee and Yu (2010b), Su and Yang (2015), Lee and Yu (2015a, b), Yu and Lee (2010), Elhorst (2010), Elhorst (2005), Yang et al. (2006), Mutl (2006), Su and Yang (2007), and Lee and Yu (2010c), among others. There are two popular methods for the estimation and test of SPD and SDPD models: quasi-maximum likelihood (QML) approach and generalized method of moments (GMM), which can be seen in above references.
In this article, we use the empirical likelihood (EL) method, proposed by Owen (1988, 1990), to construct the confidence regions for the parameters in a SDPD model. It is observed, in the case of independent observations, that the EL method to construct confidence intervals/regions has many advantages over its counterparts like the normal-approximation-based method and the bootstrap method (e.g., Hall and La Scala 1990; Hall 1992). A excellent review on EL for regressions can be found in Chen and Keilegom (2009). There are a lot of references on EL methods for independent samples or in the context of sample surveys. To save space, we list a few of them such as Owen (2001), Qin and Lawless (1994), Chen and Qin (1993), Zhong and Rao (2000) and Wu (2004). The date are dependent and follow certain structures in SDPD models. The study of the EL method for some SPDP models enjoys a certain progress. For example, there are a few articles studying the EL method for the pure spatial data (PSD, the special case of SPD with a fixed time). For instance, Nordman (2008a, b) and Bandyopadhyay et al. (2015) use the blockwise EL (BEL) proposed by Kitamura (1997) to PSD. Recently, by exploring inherent martingale structures, Qin (2021) and Jin and Lee (2019) use the EL method to construct confidence intervals/regions in PSD models. Further, Li and Qin (2020) extends the EL method proposed by Qin (2021) and Jin and Lee (2019) to SPD models. We note that there is no research work on the EL method for SDPD models.
There are many kinds of SDPD models. Lee and Yu (2010c) gives a review on the classification and research development of some SDPD models. Su and Yang (2015) introduces the QML method to SDPD models with spatial errors, where different types of space-specific effects and different ways that initial observations being generated (exogenously or endogenously) are investigated. Since SDPD models are quite complicated, as a starting point, in this article, we study the EL method for the SDPD models in Su and Yang (2015) with the restriction that there is no space-specific effects (or named as zero drift) and initial observations are generated exogenously. The study of the EL method for general SDPD models without above restriction is left for our future study. Our research results show that the EL based confidence regions generally outperform the normal approximation (NA) based confidence regions when the space units are large enough.
The rest of the article is organized as follows. Section 2 presents the main results. Results from a simulation study are reported in Sect. 3. Section 4 gives the analysis of real data. All technical details are presented in Sect. 5.
2 Main results
In this article, we suppose that there are n individual units and T time periods and the sampling data satisfy the following SDPD model with spatial error:
where \(y_t=(y_{1t},\ldots , y_{nt})'\) is an n-dimensional column vector of observed dependent variables, \(\rho (|\rho |< 1)\) characterizes the dynamic effect, \(x_t=(x_{1t},\ldots , x_{nt})'\) is an \(n\times p\) matrix of time-varying exogenous variables, \(z=(z_1,\ldots , z_n)'\) is an \(n\times q\) matrix of time-invariant exogenous variables, and \(\beta \) and \(\gamma \) are \(p\times 1\) and \(q\times 1\) regression coefficients, respectively. The disturbance vector \(\epsilon _t=(\epsilon _{1t},\ldots , \epsilon _{nt})'\) is an \(n\times 1\) vector of errors. The parameter \(\lambda \) is a spatial autoregressive coefficient and \(W_n\) is an \(n\times n\) spatial weighting matrix of constants, \(\nu _t=(\nu _{1t},\ldots , \nu _{nt})'\) is an \(n\times 1\) column vector, and \(\{\nu _{it}\}\) are i.i.d. across t and i with zero mean and variance \(\sigma ^2_\nu \). The spatial weighting matrix is also called contiguity matrix, which is determined by the spatial dependence of n spatial units. There are many ways to define \(W_n\) (e.g. pages 17–19 in Anselin 1988). Let \(W_{ij}\) be the (i, j) element of \(W_n\). Commonly used \(W_n\) includes Rook contiguity, Bishop contiguity and Queen contiguity as follows. Rook contiguity: define \(W_{ij} = 1\) if the units i and j share a common side and \(W_{ij} = 0\), otherwise. Bishop contiguity: define \(W_{ij} = 1\) if the units i and j share a common vertex and \(W_{ij} = 0\), otherwise. Queen contiguity: define \(W_{ij} = 1\) if the units i and j share a common side or vertex and \(W_{ij} = 0\), otherwise. The choice of \(W_n\) is important. Our results hold true for all these commonly used \(W_n\).
The models (2.1)–(2.3) in Su and Yang (2015) are as follows:
where \(\mu =(\mu _1, \ldots , \mu _n)'\) represent the unobservable individual or space-specific effects and other notations are the same as in model (1) and (2). Compared to models (2.1)–(2.3) in Su and Yang (2015), the only difference is that there is no space-specific effects \(\mu \) in model (1) and (2). Our initial investigation shows that the EL method for SDPD models with space-specific effects may need an adjusted EL method. Further research is needed and left for our future work.
We develop the EL method for the SDPD model when \(y_0\) is exogenous. In this case, we can treat \(y_0\) as a fixed constant vector as it contains no information about the model parameters. For convenience, we use \(\mathbf{1} _k\) to denote a \(k\times 1\) vector of ones, \(\mathbf{0} _k\) to denote a \(k\times 1\) vector of zeros, and \(J_k=\mathbf{1} _k \mathbf{1} '_k\), where \(\otimes \) is the Kronecker product.
Let \(Y=(y'_1, y'_2, \ldots , y'_T)'\), \(Y_{-1}=(y'_0, y'_1, \ldots , y'_{T-1})'\), \(X=(x'_1, x'_2, \ldots , x'_T)'\), \(\nu =(\nu '_1, \nu '_2, \ldots , \nu '_T)'\), \(Z=\mathbf{1} _T\otimes z\), \(B=B(\lambda )=I_n-\lambda W_n\) and \(\epsilon =(\epsilon '_1, \epsilon '_2, \ldots , \epsilon '_T)'\). Then model (1) and (2) can be written in a matrix form as:
with
or
with
where \(\epsilon \sim (0, \sigma ^2_\nu \Omega )\), with
Let \(\theta =(\beta ', \gamma ', \rho )'\) and \(\psi =(\theta ', \sigma ^2_\nu , \lambda )'\). We adopt the QML method to derive the estimating equations for the EL method. Under the assumption of normality (which is only used at this moment), based on (3) and (4), the log-likelihood function (ignoring constants) is
where \(\epsilon =Y-\rho Y_{-1}-X\beta -Z\gamma \). It can be shown that
where \(\widetilde{X}=(X, Z, Y_{-1})\), \(A=(B' B)^{-1}(W'_nB+B' W_n)(B' B)^{-1}\). Letting above derivatives be 0, we obtain the following estimating equations of the QML method:
Substituting (4) into (7)–(9), we have
Noting that \(\widetilde{X}=(X, Z, Y_{-1})\) and \(Y_{-1}\) contains X, z and \(\nu \), we need to separate out \(\nu \) from \(\widetilde{X}\). To this end, denote \(l_{\rho }=(0, c_{\rho , 1}, \ldots , c_{\rho ,T-1})'\), \(c_{\rho , t}=(1-\rho ^t)/(1-\rho )\), \(Y_0=(Y'_{0, 0}, Y'_{0, 1}, \ldots , Y'_{0, T-1})'\), \(Y_{0, t}=\rho ^ty_0\),
\(A_x=F_{\rho }' \otimes I_n \) and \(A_\nu =F_{\rho }' \otimes B^{-1} \). We use (B.3) in Su and Yang (2015) to obtain that
Let \(\widetilde{X}_1=\left( X,\ Z \right) \) and \(\widetilde{X}_2= A_xX\beta +(l_{\rho }\otimes I_n )z\gamma +Y_0\). Then (10) can be decomposed into
For convenience, let \(e=\nu \), i.e.
Then (11)–(14) can be rewritten as
Observing that the above estimating equations include the quadratic forms of e, to use the EL method, we need to change the quadratic forms into the linear forms of a well behaved random variables. To this end, we let \(H_1={1\over 2}\left( A'_\nu (I_T\otimes B')+(I_T\otimes B)A_\nu \right) \) and \(H_2=I_T\otimes (BAB')\). Use \({h}_{ij,k}\), \(a_{i,1}\) and \(a_{i,2}\) to denote the (i, j) element of the matrix \(H_k\) (\(k=1, 2\)), the i-th column of the matrix \(\widetilde{X}_1'(I_T\otimes B')\) and the i-th element of the vector \(\widetilde{X}_2'(I_T\otimes B')\), respectively, and adapt the convention that any sum with an upper index of less than one is zero. To deal with the quadratic form in (17) and (19), we follow Kelejian and Prucha (2001) to introduce a martingale difference array. Define the \(\sigma \)-fields: \({\mathcal {F}}_{0}=\{ {\emptyset }, \Omega \}, {\mathcal {F}}_{i}=\sigma (e_1, e_2, \ldots , e_i), 1\le i\le nT\). Let
Then \( {\mathcal {F}}_{i-1} \subseteq {\mathcal {F}}_{i}, M_{ik}\) is \({\mathcal {F}}_{i}\)-measurable and \(E(M_{ik}|{\mathcal {F}}_{i-1})=0\). Thus \(\{M_{ik}, {\mathcal {F}}_{i}, 1\le i\le nT\}\) form a martingale difference array and
Based on (16)–(21), we propose the following EL ratio statistic for \(\psi \in R^{p+q+3}\):
where \(\{p_i\}\) satisfy
Let
where \(e_i\) is the ith component of \((I_T\otimes B)(Y-\rho Y_{-1}-X\beta -Z\gamma )\). Following Owen (1990), one can show that
where \(\tilde{\lambda }(\psi )\in R^{p+q+3}\) is the solution of the following equation:
Let \(\vartheta _j=E\nu _{11}^j, j=3, 4\). Use Vec(diagA) to denote the vector formed by the diagonal elements of a matrix A, ||a|| to denote the \(L_2\)-norm of a vector a, and \(\lambda _{min}(H)\) and \(\lambda _{max}(H)\) to denote the minimum and maximum eigenvalues of a matrix H, respectively. To obtain the asymptotic distribution of \(\ell (\psi )\), we need following assumptions.
-
A1. (1) \(\nu _{jt}\) are mutually independent, and they are independent of \(x_{ks}\) and \(z_k\) for all j, k, t, s;
-
(2) All elements in \((x_{it}, z_i)\) have \(4+\eta _1\) moments for some \(\eta _1>0\).
-
A2. (1) \(\{\nu _{it}, t=1,\ldots , T, i=1,\ldots , n \}\) are independent and identically distributed for all i and t with mean 0, variance \(\sigma ^2_\nu >0\) and \(E|\nu _{it}|^{4+\eta _1}<\infty \) for some \(\eta _1>0\).
-
(2) \(\{x_{it}, t=\ldots , -1, 0, 1, \ldots \}\) and \(\{z_i\}\) are strictly exogenous and independent across i.
-
(3) \(|\rho |<1\).
-
A3. Let \(W_n\) and \(\{B^{-1} \}\) be as described above. They satisfy the following conditions:
-
(1) The row and column sums of \(W_n\) are uniformly bounded in absolute value;
-
(2) \(\{B^{-1}\}\) are uniformly bounded in either row or column sums, uniformly in \(\lambda \) in a compact parameter space \(\Lambda \), and \(\underline{c}_\lambda \le \inf _{\lambda \in \Lambda }\lambda _{\max }(B' B)\le \sup _{\lambda \in \Lambda }\lambda _{\max }(B' B)\le \overline{c}_\lambda <\infty \).
-
A4. There are constants \(c_j>0, j=1, 2\), such that
$$\begin{aligned} 0<c_1\le \lambda _{min}\left( (nT)^{-1}\Sigma _{p+q+3} \right) \le \lambda _{max}\left( (nT)^{-1}\Sigma _{p+q+3} \right) \le c_2<\infty , \end{aligned}$$where
$$\begin{aligned}&\Sigma _{p+q+3}=\Sigma '_{p+q+3} =Cov\left\{ \sum ^{nT}_{i=1}\omega _{i}(\psi ) \right\} \nonumber \\&\quad =\left( \begin{array}{llll} \Sigma _{11}&{} \Sigma _{12} &{} \Sigma _{13} &{} \Sigma _{14}\\ *&{} \Sigma _{22} &{} \Sigma _{23}&{} \Sigma _{24}\\ *&{} *&{} \Sigma _{33} &{}\Sigma _{34}\\ *&{} *&{} *&{} \ \Sigma _{44}\\ \end{array} \right) _{(p+q+3)\times (p+q+3)}, \end{aligned}$$(24)where
$$\begin{aligned} \Sigma _{11}= & {} \sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1\right) , \Sigma _{12}=\vartheta _3 E(\widetilde{X}_1')(I_T\otimes B')\mathbf{1} _{nT}, \\ \Sigma _{13}= & {} \sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2\right) , \Sigma _{14}=\vartheta _3E(\widetilde{X}_1')(I_T\otimes B')vec_D(H_2) \\ \Sigma _{22}= & {} nT(\vartheta _4-\sigma _\nu ^4),\ \ \Sigma _{23}=(\vartheta _4-3\sigma _\nu ^4)\mathbf{1} '_{nT}vec_D(H_1)+2\sigma _\nu ^4tr(H_1)+\vartheta _3E(\widetilde{X}_2')(I_T\otimes B')\mathbf{1} _{nT}, \\ \Sigma _{24}= & {} (\vartheta _4-3\sigma _\nu ^4)\mathbf{1} '_{nT}vec_D(H_2)+2\sigma _\nu ^4tr(H_2),\\ \Sigma _{33}= & {} (\vartheta _4-3\sigma _\nu ^4)||vec_D(H_1)||^2+2\sigma _\nu ^4tr(H_1^2)+\sigma ^2_\nu E\left( \widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2\right) +2\vartheta _3 E(\widetilde{X}_2')(I_T\otimes B')vec_D(H_1), \\ \Sigma _{34}= & {} (\vartheta _4-3\sigma _\nu ^4)vec'_D(H_1)vec_D(H_2)+2\sigma _\nu ^4tr(H_1H_2)+\vartheta _3 E(\widetilde{X}_2')(I_T\otimes B')vec_D(H_2), \\ \Sigma _{44}= & {} (\vartheta _4-3\sigma _\nu ^4)||vec_D(H_2)||^2+2\sigma _\nu ^4tr(H_2^2). \end{aligned}$$ -
A5. \(n\rightarrow \infty \) but T is fixed.
Remark 1
Conditions A1–A3 are common assumptions for spatial models, which are used in Su and Yang (2015), and the analog of \(0<c_1\le \lambda _{min}\left( (nT)^{-1}\Sigma _{p+q+3} \right) \) is employed in the assumption of Theorem 1 in Kelejian and Prucha (2001).
We now state the main results.
Theorem 1
Suppose that Assumptions A1–A5 are satisfied. Then under model (1)–(2), as \(n\rightarrow \infty,\)
where \(\chi ^2_{p+q+3}\) is a chi-squared distributed random variable with \(p+q+3\) degrees of freedom.
Let \(z_{\alpha }(p+q+3)\) satisfy \(P(\chi ^2_{p+q+3}\le z_{\alpha }(p+q+3))=\alpha \) for \(0<\alpha <1\). It follows from Theorem 1 that an EL based confidence region for \(\psi \) with asymptotically correct coverage probability \(\alpha \) can be constructed as
3 Simulations
Recall that \(\theta =(\beta ', \gamma ', \rho )'\) and \(\psi =(\theta ', \sigma ^2_\nu , \lambda )'\). Denote \(P_\lambda =\Omega ^{-1}\Omega _\lambda \Omega ^{-1}\), \(\Omega _\lambda =I_T\otimes A\) and \(\Omega _{\lambda \lambda }=I_T\otimes \{2(B'B)^{-1}[(W'_nB+B' W_n)A-W_n'W_n(B'B)^{-1}]\}\). It can be shown that
According to Su and Yang (2015), the QMLE \(\widehat{\psi }\) of \(\psi \) satisfies:
where \(\Sigma =\lim _{n\rightarrow \infty }\frac{1}{nT}E[\Sigma _n(\psi )]\) and \(\Sigma _n(\psi )=\frac{\partial ^2}{\partial \psi \partial \psi '}\widetilde{L}(\psi )\).
Based on the above asymptotic result, we can obtain the NA based confidence region for \(\psi \). However, we note that the NA method depends on the availability of a consistent estimator of the asymptotic covariance matrix in practical applications, while the EL method does not. This can save the implementation time for the EL method and the EL method outperforms the NA method.
We conducted a small simulation study to compare the finite sample performances of the confidence regions based on EL and NA methods with confidence level \(\alpha =0.95\), and report the proportion of \(\ell (\psi ) \le z_{0.95}(p+q+3)\) and \((\widehat{\psi }-\psi )'(-\Sigma )(\widehat{\psi }-\psi )\le z_{0.95}(p+q+3)\) respectively in 1000 replications.
In the simulations, we used the following two models:
-
(1)
Model 1:
\(y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2,3\), where \(x_t\) were generated from N(0, 4), alternatively, \(x_t\) can be randomly generated in a similar fashion as in Hsiao et al. (2002), and the elements of z were randomly generated from Bernoulli(0.5). We selected \(\beta =1\), \(\gamma =1\), \(\sigma _\nu ^2=1\) and \((\rho , \lambda )\) were taken as \((-0.8, -0.7)\), \((-0.2, -0.1)\), (0.2, 0.1), (0.8, 0.7), \((-0.8, 0.7)\) and \((0.2, -0.1)\) respectively, and \(\nu _{it}'s\) were i.i.d. from N(0, 1), t(5) and \(\chi ^2_{4}-4\), respectively;
-
(2)
Model 2:
\(y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2,3\), where \(x_t=\left( x_t^{(1)}, x_t^{(2)}\right) \) is an \(n\times 2\) matrix, where \(x_t^{(1)}\) were randomly generated from N(0, 1) and \(x_t^{(2)}\) were randomly generated from N(0, 4). Moreover, \(z=\left( z^{(1)}, z^{(2)}\right) \) is an \(n\times 2\) matrix, the elements of \(z^{(1)}\) were randomly generated from Bernoulli(0.3) and the elements of \(z^{(2)}\) were randomly generated from Bernoulli(0.6). We selected \(\beta =(1.5, 1.0)'\), \(\gamma =(2, 1.2)'\), \((\rho , \lambda )\) were taken as \((-0.8, -0.7)\), \((-0.2, -0.1)\), (0.2, 0.1), (0.8, 0.7), \((-0.8, 0.7)\) and \((0.2, -0.1)\) respectively, and \(\nu _{it}'s\) were i.i.d. from N(0, 1), \(t(5), \chi ^2_{4}-4, 0.1N(0, 4)+0.9N(0, 1)\) and \(0.1t(3)+0.9t(5)\), respectively.
The results of simulations under model 1 are reported in Tables 1, 2 and 3, and the results of simulations under model 2 are reported in Tables 4, 5, 6, 7 and 8.
For the contiguity weight matrix \(W_n=(W_{ij})\), we took \(W_{ij}=1\) if spatial units i and j are neighbours by queen contiguity rule (namely, they share common border or vertex), \(W_{ij}=0\) otherwise (Anselin 1988, P.18). We considered five ideal cases of spatial units: \(n=m\times m\) regular grid with \(m=7, 10, 13,16, 20\), denoting \(W_n\) as \(grid_{49}, grid_{100}, grid_{169}, grid_{256} \) and \( grid_{400}\), respectively. A transformation is often used in applications to convert the matrix \(W_n\) to the unity of row-sums. We used the standardized version of \(W_n\) in our simulations, namely \(W_{ij}\) was replaced by \(W_{ij}/\sum _{j=1}^nW_{ij}\).
Simulation results under model 1 show that the confidence regions based on NA behave well with coverage probabilities being very close to the nominal level 0.95 when the error term \(\epsilon _i\) is normally distributed and n is large, but not well in other cases. The coverage probabilities of the confidence regions based on NA fall to the range [0.812, 0.862] for the t distribution and [0.809, 0.868] for the \(\chi ^2\) distribution, which are far from the nominal level 0.95. Simulation results under model 2 are similar to those under model 1.
We can see, from Tables 1, 2, 3, 4, 5, 6, 7 and 8, that the coverage probabilities of confidence regions based on EL method converge to the nominal level 0.95 as the number of spatial units n is large enough, whether the error term \(\epsilon _i\) is normally distributed or not. These results show that the EL based confidence regions generally outperform the NA based confidence regions when n is large enough.
4 A real data example
In order to illustrate the proposed method in Sect. 2, we conducted a real data analysis. The data come from 288 prefecture-level cities in China, collected from National Bureau of Statistics of China and Anjuke. There were three variables: the logarithm of housing price per square meter (\(y_t\)), the logarithm of income per household (\(x_t\)) and the urbanization rate (z) from the years of 2010 to 2017. In order to ensure the stability and eliminate the influence of dimension, we first did difference and standardization on the above data, and then considered fitting the data via the following model: \(y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2, \ldots , 8\), where \(n=288\) and the spatial weighting matrix \(W_n\) was selected by the method in Sect. 3.
We separately employed the EL method in Sect. 2 and the NA method in Sect. 3 to obtain the confidence intervals for parameters \(\beta , \gamma , \rho , \lambda \) and \(\sigma ^2_\nu \) with confidence level 0.95, which were shown in Table 9.
Table 9 shows that the estimator of the spatial parameter is \(\lambda = 0.3743\), and 0 is not in its confidence interval, which implies that there exists a spatial relationship among the disturbances. The results also show that the lengths of the EL based intervals are uniformly shorter than those of the NA based intervals, which implies that the EL based method performs better than the NA based method for the real data.
References
Anselin, L. (1988). Spatial Econometrics: Methods and Models. Kluwer Academic Press.
Anselin, L. (2001). Spatial econometrics. In B. H. Baltagi (Ed.), A Companion to Theoretical Econometrics (pp. 310–330). Blackwell Publishers Ltd.
Baltagi, B., Egger, P., & Pfaffermayr, M. (2013). A generalized spatial panel model with random effects. Econometric Reviews, 32, 650–685.
Baltagi, B., & Li, D. (2006). Prediction in the panel data model with spatial correlation: The case of liquor. Spatial Economic Analysis, 1, 175–185.
Baltagi, B., Song, S. K., Jung, B. C., & Koh, W. (2007). Testing for serial correlation, spatial autocorrelation and random effects using panel data. Journal of Econometrics, 140, 5–51.
Baltagi, B., Song, S. H., & Koh, W. (2003). Testing panel data regression models with spatial error correlation. Journal of Econometrics, 117, 123–150.
Bandyopadhyay, S., Lahiri, S. N., & Nordman, D. J. (2015). A frequency domain empirical likelihood method for irregularly spaced spatial data. The Annals of Statistics, 43(2), 519–545.
Chen, X., & Conley, T. G. (2001). A new semiparametric spatial model for panel time series. Journal of Econometrics, 105, 59–83.
Chen, S. X., & Keilegom, I. V. (2009). A review on empirical likelihood for regressions (with discussions). Test, 3, 415–447.
Chen, J., & Qin, J. (1993). Empirical likelihood estimation for finite populations and the effective usage of auxiliary information. Biometrika, 80, 107–116.
Elhorst, J. P. (2003). Specification and estimation of spatial panel data models. International Regional Science Review, 26, 244–268.
Elhorst, J. P. (2005). Unconditional maximum likelihood estimation of linear and loglinear dynamic models for spatial panels. Geographical Analysis, 37, 85–106.
Elhorst, J. P. (2010). Dynamic panels with endogenous interaction effects when T is small. Regional Science and Urban Economics, 40, 272–282.
Hall, P. (1992). The Bootstrap and Edgeworth Expansion. Springer-Verlag.
Hall, P., & La Scala, B. (1990). Methodology and algorithms of empirical likelihood. International Statistical Review, 58, 109–127.
Hsiao, C., Pesaran, M. H., & Tahmiscioglu, A. K. (2002). Maximum likelihood estimation of fixed effects dynamic panel data models covering short time periods. Journal of Econometrics, 109, 107–150.
Jin, F., & Lee, L. F. (2019). GEL estimation and tests of spatial autoregressive models. Journal of Econometrics, 208, 585–612.
Kapoor, M., Kelejian, H. H., & Prucha, I. R. (2007). Panel data models with spatially correlated error components. Journal of Econometrics, 140, 97–130.
Kelejian, H. H., & Prucha, I. R. (2001). On the asymptotic distribution of the Moran \(I\) test statistic with applications. Journal of Econometrics, 104, 219–257.
Kitamura, Y. (1997). Empirical likelihood methods with weakly dependent processes. The Annals of Statistics, 25, 2084–2102.
Lee, L. F., & Yu, J. (2010a). A spatial dynamic panel data model with both time and individual fixed effects. Econometric Theory, 26, 564–597.
Lee, L. F., & Yu, J. (2010b). Estimation of spatial autoregressive panel data models with fixed effects. Journal of Econometrics, 154(2), 165–185.
Lee, L. F., & Yu, J. (2010c). Some recent developments in spatial panel data models. Regional Science and Urban Economics, 40, 255–271.
Lee, L. F., Yu, J., (2015a). Spatial Panel Data Models. In: Baltagi B (ed) The Oxford Handbooks: Panel Data. Oxford University Press, Oxford, England.
Lee, L. F., & Yu, J. (2015b). Estimation of fixed effects panel regression models with separable and nonseparable space-time filters. Journal of Econometrics, 184, 174–192.
Li, Y., & Qin, Y. (2020). Empirical likelihood for panel data models with spatial errors. Communications in Statistics-Theory and Methods. https://doi.org/10.1080/03610926.2020.1780449.
Mutl, J. (2006). Dynamic Panel Data Models with Spatially Correlated Disturbances. College Park: University of Maryland. (Ph.D. thesis).
Mutl, J., & Pfaffermayr, M. (2011). The Hausman test in a Cliff and Ord panel model. The Economic Journal, 14, 48–76.
Nordman, D. J. (2008a). A blockwise empirical likelihood for spatial lattice data, Statist. Sinica, 18, 1111–1129.
Nordman, D. J. (2008b). An empirical likelihood method for spatial regression. Metrika, 68, 351–363.
Owen, A. B. (1988). Empirical likelihood ratio confidence intervals for a single functional. Biometrika, 75, 237–249.
Owen, A. B. (1990). Empirical likelihood ratio confidence regions. The Annals of Statistics, 18, 90–120.
Owen, A. B. (2001). Empirical Likelihood. Chapman & Hall.
Parent, O., & LeSage, J. P. (2011). A spaceCtime filter for panel data models containing random effects. Computational Statistics & Data Analysis, 55, 475–490.
Pesaran, M. H. (2004). General Diagnostic Tests for Cross Section Dependence in Panels, Working Paper No. 1229, University of Cambridge.
Qin, Y. (2021). Empirical likelihood for spatial autoregressive models with spatial autoregressive disturbances. Sankhyā A: The Indian Journal of Statistics, 83, 1–25.
Qin, J., & Lawless, J. (1994). Empirical likelihood and general estimating equations. The Annals of Statistics, 22, 300–325.
Su, L., & Yang, Z. (2007). QML Estimation of Dynamic Panel Data Models with Spatial Errors, Working Paper. Singapore Management University.
Su, L., & Yang, Z. (2015). QML estimation of dynamic panel data models with spatial errors. Journal of Econometrics, 185, 230–258.
Wu, C. B. (2004). Weighted empirical likelihood inference. Statistics & Probability Letters, 66, 67–79.
Yang, Z., Li, C., & Tse, Y. K. (2006). Functional form and spatial dependence in dynamic panels. Economics Letters, 91, 138–145.
Yu, J., de Jong, R., & Lee, L. F. (2008). Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large. Journal of Econometrics, 146, 118–134.
Yu, J., & Lee, L. F. (2010). Estimation of unit root spatial dynamic panel data models. Econometric Theory, 26, 1332–1362.
Zhong, B., & Rao, J. N. K. (2000). Empirical likelihood inference under stratified random sampling using auxiliary population information. Biometrika, 87, 929–938.
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (12061017, 12161009). The authors are thankful to the referees for constructive suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
In the proof of the main results, we need to use Theorem 1 in Kelejian and Prucha (2001). We now state this result. Let
where \(\epsilon _{ni}\) are real valued random variables, and the \(a_{nij}\) and \(b_{ni}\) denote the real valued coefficients of the linear-quadratic form. We need the following assumptions in Lemma 1.
-
(C1)
\(\{\epsilon _{ni}, 1\le i\le n\}\) are independent random variables with mean 0 and \(\sup _{1\le i\le n, n\ge 1}E|\epsilon _{ni}|^{4+\eta _1}<\infty \) for some \(\eta _1>0\);
-
(C2)
For all \(1\le i, j\le n, n\ge 1, a_{nij}=a_{nji}\), \(\sup _{1\le j\le n, n\ge 1} \sum ^n_{i=1}|a_{nij}|<\infty \), and \(\sup _{n\ge 1}n^{-1}\sum ^n_{i=1}|b_{ni}|^{2+\eta _2}<\infty \) for some \(\eta _2>0\).
Given above assumptions (C1) and (C2), the mean and variance of \(\tilde{Q}_n\) are given as (e.g. Kelejian & Prucha, 2001)
with \(\sigma ^2_{ni}=E(\epsilon _{ni}^2)\) and \(\mu ^{(s)}_{ni}=E(\epsilon _{ni}^s)\) for \(s=3, 4\).
Lemma 1
Suppose that Assumptions C1 and C2 hold true and \(n^{-1}\sigma ^2_{\widetilde{Q}}\ge c\) for some constant \(c>0\) . Then
Proof
See Theorem 1 in Kelejian and Prucha (2001). \(\square \)
Lemma 2
Let \(\xi _1, \xi _2,\ldots , \xi _n\) be a sequence of stationary random variables, with \(E|\xi _1|^s<\infty \) for some constants \(s>0\) . Then
Proof
Using Borel–Cantelli lemma and following the proof of (2.3) in Owen (1990), one can prove Lemma 2, where there is no need to assume that \(\xi _1, \xi _2,\ldots , \xi _n\) are in dependent in using Borel–Cantelli lemma. \(\square \)
Lemma 3
Suppose that Assumptions A1–A5 are satisfied. Then as \(n\rightarrow \infty,\)
Proof
Note that
By Conditions A1–A3 and Lemma 2, we have
In addition, by Lemma B.2. in Su and Yang (2015), \(A'_\nu (I_T\otimes B')\) and \((I_T\otimes (BAB'))\) are uniformly bounded in both row and column sums, it follows that
Thus \(Z_n=o_p((nT)^{2/(4+\eta _1)})\). (26) is proved. \(\square \)
We now prove (27). For any given \({l}=(l'_1,l_2, l_3, l_4)'\in R^{p+q+3}\) with \(||{l}||=1\), where \(l_1\in R^{p+q}\), \(l_2, l_3, l_4\in R\), it is clear that
Denote
where
Note that
The conditional expectation and variance given X, Z are denoted as \(E^*\) and \(Var^*\), respectively. Then from (15) and note that \(E(\nu )=0\), we know that the variance of \(Q_n\) is
and
Further,
where \(\tilde{l}=(l_2, l_3, l_4)'\), \(G_{1}=\left( \begin{array}{lll}nT &{} \mathbf{1} '_{nT}vec_D(H_1) &{} \mathbf{1} '_{nT}vec_D(H_2) \\ *&{} ||vec_D(H_1)||^2 &{} vec'_D(H_1)vec_D(H_2) \\ *&{}*&{} ||vec_D(H_2)||^2 \end{array} \right) \). And
where \(G_{2}=\left( \begin{array}{lll}nT &{} tr(H_1) &{} tr(H_2) \\ *&{} tr(H_1^2) &{} tr(H_1H_2) \\ *&{}*&{} tr(H_2^2) \end{array} \right) \). Moreover,
It is easy to show (e.g. Su & Yang, 2015) that, \(n^{-1}\widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1\), \(n^{-1}\widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2\), and \(n^{-1}\widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2\) converge in probability to their expectations. We have
and
where \(\Sigma _{p+q+3}\) is given in (24). From Condition A4, one can see that \((nT)^{-1}Var^*(Q_n)\ge c_1>0\). From Lemma 1, we have
where \(d^{*}\) stands for convergence in distribution given X, Z. Noting that \((nT)^{-1}Var^*(Q_n)\ge c_1>0\) and
one can show that
Combing \(E^*(Q_n)=0\), (38) and (39), we thus have
Then (27) holds true.
Next we will prove (28), i. e.
Let
where \(R_i=2\sum ^{i-1}_{j=1}u_{ij}e_j+b_{i}\). Let \({\mathcal {F}}_{0}=\{ {\emptyset }, \Omega \}, {\mathcal {F}}_{i}=\sigma (e_1, e_2, \ldots , e_i), 1\le i\le nT\). Then \(\{N_{in}, {\mathcal {F}}_{i}, 1\le i\le nT\}\) form a martingale difference array given X, Z. From (30) and (37), one can see that
It follows that
where \(S_{n1}=\sum _{i=1}^{nT}\{N_{in}^2-E^*(N_{in}^2|{\mathcal {F}}_{i-1})\}\), \(S_{n2}=\sum _{i=1}^{nT}\{E^*(N_{in}^2|{\mathcal {F}}_{i-1})-E^*(N_{in}^2)\}\). Next we will show that: (1)
and (2)
To show (1) and (2), it is sufficient to show that \((nT)^{-2}E(S^2_{n1})\rightarrow 0\) and \((nT)^{-2}E(S^2_{n2})\rightarrow 0\), respectively. Obviously,
Thus
It follows that
By Conditions A1–A3, we have
and
Similarly, we can prove that
From (45)–(48), we have \((nT)^{-2}E(S_{n1}^2)\rightarrow 0\). Furthermore,
Thus,
Note that
and
where we have used Conditions A2 and A3. From (49)–(52), we have \((nT)^{-2}ES_{n2}^2\rightarrow 0\). The proof of (28) is thus complete.
Finally, we will prove (29). Note that
By Conditions A2 and A3, we have
Similarly,
Further, using (58) and Markov inequality, we obtain \(\sum _{i=1}^{nT} ||\omega _i(\psi )||^3=O_p(nT^2)\). Thus (29) is proved.
Proof of Theorem 1
Using Lemma 3 and following the proof of Theorem 1 in Qin (2021), one can easily show that Theorem 1 holds true. \(\square \)
Rights and permissions
About this article
Cite this article
Li, Y., Qin, Y. Empirical likelihood for spatial dynamic panel data models. J. Korean Stat. Soc. 51, 500–525 (2022). https://doi.org/10.1007/s42952-021-00150-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42952-021-00150-4