Local influence for Liu estimators in semiparametric linear models

Emami, Hadi

doi:10.1007/s00362-016-0775-6

Local influence for Liu estimators in semiparametric linear models

Regular Article
Published: 17 May 2016

Volume 59, pages 529–544, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Statistical Papers Aims and scope Submit manuscript

Local influence for Liu estimators in semiparametric linear models

Download PDF

Hadi Emami¹

254 Accesses
5 Citations
Explore all metrics

Abstract

Semiparameric linear regression models are extensions of linear models to include a nonparametric function of some covariate. They have been found to be useful in data modelling. This paper provides local influence analysis to the Liu penalized least squares estimators that uses a smoothing spline as a solution to its nonparametric component. The diagnostics under the perturbations of constant variance, individual explanatory variables and assessing the influence on the selection of the Liu penalized least squares estimators parameter are derived. The diagnostics are applied to a real data set with informative results.

A mixed model approach to measurement error in semiparametric regression

Article 30 March 2021

Partly linear instrumental variables regressions without smoothing on the instruments

Article 30 May 2024

Effective identification and estimation for the semiparametric measurement error model

Article 03 June 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Diagnostic techniques for the parametric regression model have received a great deal of attention in statistical literature since the seminal work of Cook (1977) and others including Cook and Weisberg (1982), Belsley et al. (1989) and Walker and Brich (1988). In semiparametric regression models (SPRMs), diagnostic results are quite rare; among them Eubank (1985), Thomas and Cook (1989) and Kim (1996) studied the basic diagnostic building blocks such as residuals and leverages. Kim et al. (2001,2002) and Fung et al. (2002) proposed some type of Cook’s distances in SPRMs.

The existence of collinearity in the linear regression model can lead to a very sensitive least squares estimate, therefore mixed estimation and ridge regression are suggested to mitigate the effect of collinearity. However, as many authors noted, the influence of the observations on ridge regression is different from the corresponding least squares estimate, and collinearity can even disguise anomalous data (Belsley et al. 1989). Using case deletion method Walker and Brich (1988) and Jahufer and Chen (2009) studied the influence of observations in ordinary ridge regression (ORR) and modified ridge regression (MRR) respectively. They derived the dependence of several influence measures from case deletion on the estimates of ORR and MRR parameters. In SPRMs, Emami (2015) extended results from Walker and Brich (1988) and derived influence measures of ridge estimates using case deletion.

Instead of deleting cases one-by-one, the local influence approach considered in Cook (1986) assessed the influence by simultaneous perturbation of the assumed model, and the influence was measured by normal curvature of an influence graph based on likelihood displacement. This approach has received a lot of attention in the past. The local influence analysis does not involve recomputing the parameter estimates for every case deletion, so it is often computationally simpler. Furthermore, it permits perturbation of various aspects of the model to tell us more than what the case deletion approach is designed for. For example, it can help measure leverage of a design point and evaluate the impact of a small measurement error of x on our estimates. This approach has been extended to generalized models by Thomas and Cook (1989), to linear mixed models by Lesaffre and Verbeke (1998), to partial linear models by Zhu et al. (2003) and to linear measurement errors by Rasekh (2006). In multicollinearity problems, Shi and Wang (1999) and Jahufer and Chen (2012) studied the local influence of minor perturbations on the ridge estimate and Liu estimators in the ordinary regression, respectively. They derived the diagnostics under the perturbation of variance and explanatory variables.

In this paper we generalize the Shi and Wang (1999) results to the SPRMs and we assess the local influence of observations on the Liu estimates. We demonstrate that the local influence analysis of Cook (1986) can be extended to the Liu penalized least squares estimators (LPLSEs ) in SPRMs and provide some insight into the interplay between the linear and the nonparametric components in the context of influence diagnostics.

The paper is organized as follows. In the next section, SPRMs are introduced, the relevant notations and some inferential results are also given. Section 3 derives the local influence diagnostics of LPLSEs under the perturbation of constant variance and explanatory variables. Section 4 provides a diagnostic for detecting the local influential observations of selecting the LPLSEs parameter. In Sect. 5 the proposed methods are illustrated through a simulation study and a real data set. A discussion is given in the last section.

2 Model and inference

Consider the semiparametric regression model

$$\begin{aligned} y_{i}=x_{i}'\beta + g(t_{i})+\epsilon _{i} \quad 1\le i \le n, \end{aligned}$$

(1)

where $y_{i}$ is the scalar response, $\beta $ is a p-vector of regression coefficients, $x_{i}$ is a p-vector of explanatory variables, $t_{i}$ is a scalar $(a\le t_{i},\ldots t_{n}\le b)$, and $t_{i}'s$ are not all identical, g is a twice differentiable unknown smooth function on some finite interval and the errors $\epsilon _{i}$ are uncorrelated with zero mean and unknown constant variance $\sigma ^{2}$. This model is also called a partially linear model or a partial spline model. Model (1) has been used in discussion of many methods, e.g., penalized least square (see Fung et al. 2002; Chen and You 2005), smoothing spline (see Speckman 1988; Green and Silverman 1994). In this study we will focus our attentions on the local influence diagnostics for the penalized least square estimators as it is a well-studied method of estimation for such models.

2.1 Penalized least square estimators (PLSEs)

Let the ordered distinct values among $t_{1},\ldots ,t_{n}$ be denoted by $s_{1},\ldots ,s_{q}$. The connection between $t_{1},\ldots ,t_{n}$ and $s_{1},\ldots ,s_{q}$ is captured by means of $n\times q$ incidence matrix $\mathbf {N}$, with entries $N_{ij}=1$ if $t_{i}=s_{j}$ and 0 otherwise. Let g be the vector of value $a_{i}=g(s_{i})$. For model (1) the penalized sum of squares is

$$\begin{aligned} ||y-\mathbf {X}\beta -\mathbf {N}g||^2+\lambda \int g''(t)^2 dt, \end{aligned}$$

(2)

where y is the vector of n response values and $\mathbf {X}$ is $n\times p$ design matrix. Minimizing (2) with respect to $\beta $ and g, the PLSEs of $\beta $ and g are

$$\begin{aligned} \hat{\beta }=\{\mathbf {X}'(\mathbf {I}-\mathbf {S})\mathbf {X}\}^{-1}\mathbf {X}'(\mathbf {I}-\mathbf {S})y, \end{aligned}$$

(3)

and

$$\begin{aligned} \hat{g}=( \mathbf {N}'\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'(y-\mathbf {X}\hat{\beta }), \end{aligned}$$

(4)

respectively, where $\mathbf {I}$ is the identity matrix of size n, $\mathbf {S}=\mathbf {N}(\mathbf {N}'\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'$ is a smoothing matrix, $\lambda $ is a nonnegative tuning parameter and $\mathbf {K}$ is a $q\times q$ matrix whose entries only depend on the knots $\{s_{j}\}$ (see Speckman 1988).

2.2 Liu penalized least squares estimators (LPLSEs)

The multicollinearity is a problem when the primary interest is in the estimation of the parameters in a regression model. In the case of multicollinearity we know that when the correlation matrix has one or more small eigenvalues, the estimates of the regression coefficients can be large in absolute value. The least squares estimator performs poorly in the presence of multicollinearity. Some biased estimators have been suggested as a means to improve the accuracy of the parameter estimate in the model when multicollinearity exists. There are a few studies that looked at overcoming the rank-deficient and ill-conditioned or multicollinearity problems in SPRMs (see Hu 2005; Tabakan and Akdeniz 2010; Roozbeh 2015; Roozbeh and Arashi 2015; Amini and Roozbeh 2015; Arashi and Valizadeh 2015). To overcome near multicollinearity, Liu (1993) combined the Stein (1956) estimator with ordinary ridge estimator to obtain the Liu estimator. This approach extended in semiparametric linear regression models (see Akdeniz and Akdeniz Duran (2010); Akdeniz et al. (2015)). Here, we use Liu estimators which can be obtained by minimizing the term

$$\begin{aligned} ||y-\mathbf {X}\beta -\mathbf {N}g||^2+\lambda \int g''(t)^2 dt +||d\hat{\beta }-\beta ||^{2} \end{aligned}$$

(5)

Following Green and Silverman (1994), minimization of (5) can be done in a two steps estimation process: first we minimize it subject to $g(s_{j})=a_{j}, j=1,\ldots ,q$ and in the second step we minimize the result over the choice of g and $\beta $. The problem of minimizing $\int g''(t)^2 dt$ subject to g interpolating given points $g(s_{j})=a_{j}$ is given by Green and Silverman (1994), and minimizing function g provides a cubic spline with knots $\{s_{j}\}$. There exists a matrix $\mathbf {K}$ only depending on the knots $\{s_{j}\}$, such that the minimized value of $\int g''(t)^2 dt$ is $g'\mathbf {K}g$. The equation in (5) is therefore of the form

$$\begin{aligned} ||y-\mathbf {X}\beta -\mathbf {N}g||^2+\lambda g'\mathbf {K}g +||d\hat{\beta }-\beta ||^{2}. \end{aligned}$$

(6)

Minimizing (6) subject to $\beta $ and g, the LPLSEs, say, $\hat{\beta }_{d}$ and $\hat{g}_{d}$, solve

$$\begin{aligned} \left( {\begin{array}{ll} {\mathbf {X}'}{\mathbf {X}}+{\mathbf {I}}_{p}&{}\quad {\mathbf {X}}'{\mathbf {N}} \\ {\mathbf {N}}'{\mathbf {X}}&{}\quad {\mathbf {N}}'{\mathbf {N}}+\lambda {\mathbf {K}} \\ \end{array} } \right) \left( {\begin{array}{c} {\beta } \\ g\\ \end{array} } \right) = \left( {\begin{array}{c} \mathbf {X}'{y}+d\hat{\beta } \\ \mathbf {N}'y \\ \end{array} } \right) , \end{aligned}$$

(7)

where $\mathbf {I}_{p}$ is the $p\times p$ identity matrix. From (7) by simple calculation the LPLSEs are defined as

$$\begin{aligned} \hat{\beta }_{d}=\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] {y}+d\hat{{\beta }}\right) , \end{aligned}$$

(8)

and

$$\begin{aligned} \hat{g}_{d}=(\mathbf {N}'\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'( y-\mathbf {X}\hat{\beta }_{d}). \end{aligned}$$

(9)

Using (8) and (9), the vector of fitted values is

$$\begin{aligned} \hat{y}=\mathbf {X}\hat{\beta }_{d}+\mathbf {N}\hat{g}_{d}= & {} (\mathbf {H}_{\beta }+\mathbf {H}_{g})y\\= & {} \mathbf {H}_{d}y,\qquad \end{aligned}$$

where $\mathbf {H}_{\beta }=\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p})\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}\right) {\varvec{\tilde{\mathrm{X}}}}'$, ${\varvec{\tilde{\mathrm{X}}}}=(\mathbf {I}-\mathbf {S})\mathbf {X}$, $\mathbf {H}_{g}=\mathbf {S}(\mathbf {I}-\mathbf {H}_{\beta })$ and $\mathbf {H}_{d}$ is the hat matrix which in the expanded form is

$$\begin{aligned} \mathbf {H}_{d}=\mathbf {S}+{\varvec{\tilde{\mathrm{X}}}}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p})\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}\right) {\varvec{\tilde{\mathrm{X}}}}'. \end{aligned}$$

The first part, $\mathbf {S}$ is due to the nonparametric component of the model and the second part is due to the linear component of the model after adjusting for the former. The LPLSEs residual vector is evaluated as

$$\begin{aligned} {e}_{d}=y-\hat{y}_{d}=(\mathbf {I}-\mathbf {H}_{d})y. \end{aligned}$$

The small value d is called the Liu estimator biasing parameter or the shrinkage parameter. A prediction criteria suggested by Liu (1993), to choose biasing parameter d by minimizing Mallows (1973) statistic and it is can be generalized for PLSEs by

$$\begin{aligned} C_{d}={\textit{SSR}}_{d}/s^2+2tr(\mathbf {H}_{d})-(n-2). \end{aligned}$$

(10)

where ${\textit{SSR}}_{d}$ is the sum of squares residuals, $s^2=e_{d}'e_{d}/(n-p)$ is the estimator of $\sigma ^2$ from LPLSEs in SPRM.

3 Local influence of LPLSEs

It is necessary to start by giving a brief sketch of the local influence approach suggested by Shi (1997) and Shi and Wang (1999) in which the generalized influence function and generalized Cook statistic are defined to assess the local change of small perturbation on some key issues. The generalized influence function of a concerned quantity $\mathbf {T} \in {\mathbb {R}}^{n}$ is given by

$$\begin{aligned} \textit{GIF} (\mathbf {T},l)=\underset{a \rightarrow 0}{\lim }\dfrac{\mathbf {T}(\omega _{0}+l)-\mathbf {T}(\omega _{0})}{a}, \end{aligned}$$

where $\omega =\omega _{0}+al \in {\mathbb {R}}^{n}$ represents a perturbation, $\omega _{0}$ is an null perturbation which satisfies $\mathbf {T}({\omega }_{0})=\mathbf {T}$ denotes an unit length vector. To assess the influence of the perturbations on $\mathbf {T}$ , the generalized Cook’s statistic is defined as

$$\begin{aligned} \textit{GC}(\mathbf {T},l)=\dfrac{[\textit{GIF}(\mathbf {T},l)]'\mathbf {M}[\textit{GIF}(\mathbf {T},l)]}{c}, \end{aligned}$$

where $\mathbf {M}$ is a $p \times p$ positive definite matrix, and c is a scalar. By maximizing the absolute value of $GC(\mathbf {T} ; l)$ with respect to l, a direction $l_{max}(\mathbf {T} )$ is obtained. This direction shows how to perturb the data to obtain the greatest local change in $\mathbf {T}$ , and thus can be used as a main diagnostic. Maximum value $GC_{max}(\mathbf {T} ) = GC(\mathbf {T} ; l_{max)})$ indicates the serious local influence. This method removes the need of likelihood. For a discussion on this method and its relationship with Cook (1986) approach, see Shi (1997).

3.1 Perturbing the variance

We first perturb the data by modifying the weight given to each case in the Liu penalized least squares criterion. This is equivalent to perturbing the variance of $\epsilon _{i}$ in the model. Using the perturbation of constant error variance the term in (6) becomes

$$\begin{aligned} ||y-\mathbf {X}'\beta _{\omega }- g_{\omega }(t_{i}))||^{2}_{\mathbf {W}}+\lambda g_{\omega }\mathbf {K}g_{\omega }'+||d\hat{\beta }-\beta ||^{2}, \end{aligned}$$

(11)

where $\mathbf {W}=diag(\omega )$ is a diagonal matrix with diagonal elements of $\omega =(\omega _{1},\ldots ,\omega _{n})$ and $||.||^{2}_{\mathbf {W}}$ is weighted $l^2$-norm. Let $\omega =\omega _{0}+al$, where $\omega _{0}={1}$, the n vector of ones and $l=(l_{1},\ldots ,l_{n}).$

Theorem 1

Under the perturbation of variance the influence function of $\hat{\beta }_{d}$ and $\hat{g}_{d}$ are:

$$\begin{aligned} {\textit{GIF}(\hat{\beta }_{d},l)}=\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}( {l})\tilde{e}_{d}=\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(\tilde{e}_{d})l, \end{aligned}$$

$$\begin{aligned} {\textit{GIF}(\hat{g}_{d},l)}=\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}'\left( \mathbf {I}-\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}'\right) \mathbf {D} (\tilde{e}_{d})l , \end{aligned}$$

where $\tilde{e}_{d}=\tilde{y}-\tilde{\mathbf {X}}\hat{\beta }_{d}$ and $\tilde{{y}}=(\mathbf {I}-\mathbf {S}){y}$.

Proof

Minimizing (11), the perturbed version of the LPLSEs ${\hat{\beta }}_{d}$ and $\hat{g}_{d}$ are

$$\begin{aligned} \hat{\beta }_{d,\omega }=\left( \mathbf {X}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}( \mathbf {X}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] {y}+d\hat{{\beta }}), \end{aligned}$$

(12)

and

$$\begin{aligned} {\hat{g}}_{d,\omega }=(\mathbf {N}'\mathbf {W}\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'\mathbf {W}( y-\mathbf {X}\hat{\beta }_{d,\omega }), \end{aligned}$$

(13)

respectively.

Now, by differentiating and equating to the null matrix we obtain

$$\begin{aligned} \dfrac{\partial \left( {\varvec{X}}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}}{\partial \omega }\mid _{a=0}= -\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\qquad \qquad \qquad \nonumber \\ \times {\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}( {l})\mathbf {\,} {\tilde{X}}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}, \end{aligned}$$

(14)

and

$$\begin{aligned} \dfrac{\partial \left( \mathbf {X}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] y\right) }{\partial \omega }\mid _{a=0}={\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}( {l}){\tilde{y}},\qquad \qquad \qquad \qquad \qquad \quad \end{aligned}$$

(15)

Then using(12), (14) and (15) we have

$$\begin{aligned} \dfrac{\partial \hat{\beta }_{\omega ,d}}{\partial \omega }\mid _{a=0}= & {} -\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+ \mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(l){\varvec{\tilde{\mathrm{X}}}}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\nonumber \\&\times ( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] y+d\hat{\beta }) +\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(l) {\tilde{y}}\nonumber \\= & {} \left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(l)(\tilde{y}-{\varvec{\tilde{\mathrm{X}}}}\hat{\beta }_{d})\nonumber \\= & {} \left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(l)\tilde{e}_{d}, \end{aligned}$$

(16)

Similary, from (13) and (16) we get

$$\begin{aligned} \dfrac{\partial \hat{g}_{\omega ,d}}{\partial \omega }\mid _{a=0}=-\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}'\ \mathbf {D}(l)\mathbf {S} e_{d}+\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}' \mathbf {D}(l) e_{d}\qquad \quad \nonumber \\ \qquad -\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}'\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}' \mathbf {D}(l)\tilde{e}_{d}\quad \qquad \qquad \nonumber \\ =\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}'\left( \mathbf {I}-\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}'\right) \mathbf {D}(l)\tilde{e}_{d}.\quad \end{aligned}$$

(17)

Therefore from (16) and (17) the proof is complete. Analogous to case deletion, two versions of the generalized Cook’s statistic of $\beta $ can be constructed as

$$\begin{aligned} GC_{1}(\hat{\beta }_{d},l)=l'\mathbf {D}(\tilde{e}_{d}){\varvec{\tilde{\mathrm{X}}}}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\left( {\varvec{\tilde{\mathrm{X}}}}'{\varvec{\tilde{\mathrm{X}}}}\right) \qquad \qquad \qquad \qquad \nonumber \\ \times \left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}'\mathbf {D}(\tilde{e}_{d})l/s^{2}tr(\mathbf {H}_{\beta }) \qquad \qquad \nonumber \\ =l'\mathbf {A}_{\beta }l/s^{2}tr(\mathbf {H}_{\beta }),\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad \end{aligned}$$

(18)

and

$$\begin{aligned} GC_{2}(\hat{\beta }_{d},l)=l'\mathbf {D}(\tilde{e}_{d}){\varvec{\tilde{\mathrm{X}}}}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p}\right) ^{-1}(\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})({\varvec{\tilde{\mathrm{X}}}}'{\varvec{\tilde{\mathrm{X}}}})^{-1}\qquad \qquad \qquad \nonumber \\ \times (\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}) \left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}'\mathbf {D}(\tilde{e}_{d})l/s^{2}tr(\mathbf {H}_{\beta })\qquad \qquad \nonumber \\ =l'\mathbf {B}_{\beta }l/s^{2}tr(\mathbf {H}_{\beta }),\quad \quad \quad \qquad \qquad \qquad \qquad \qquad \qquad \quad \qquad \qquad \qquad \qquad \qquad \end{aligned}$$

(19)

In (18) generalized Cook’s statistic is scaled by $\mathbf {M}$ in the LS regression framework and in (19) generalized Cook’s statistic is scaled by $\mathbf {M}$ in the Liu version of SPRM framework using the fact that

$$\begin{aligned} cov(\hat{\beta }_{d})=\sigma ^{2}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p}\right) (\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}{\varvec{\tilde{\mathrm{X}}}}'{\varvec{\tilde{\mathrm{X}}}}\\(\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+d\mathbf {I}_{p}\right) \left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}. \end{aligned}$$

The generalized Cooks statistic of $\hat{g}_{d}$ will be

$$\begin{aligned} GC_{g}(\hat{g}_{d},l)=l'\mathbf {D}(\tilde{e}_{d})(\mathbf {I}-\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}')\mathbf {N}\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\qquad \qquad \nonumber \\ (\mathbf {N}'\mathbf {N})\left[ \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right] ^{-1}\mathbf {N}\left( \mathbf {I}-\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}'\right) \mathbf {D}(\tilde{e}_{d})l/s^{2}tr(\mathbf {H}_{g})\nonumber \\ =l'\mathbf {D}(\tilde{e}_{d})||(\mathbf {I}-\mathbf {X}\left( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}{\varvec{\tilde{\mathrm{X}}}}')\mathbf {S}||^{2}\mathbf {D}(\tilde{e}_{d})l/s^{2}tr(\mathbf {H}_{g})\quad \quad \nonumber \\ =l'\mathbf {A}_{g}l/s^{2}tr(\mathbf {H}_{g}).\qquad \quad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad \end{aligned}$$

(20)

Therefore associated diagnostics, denoted by $l^{1}_{max}(\hat{\beta }_{d})$, $l^{2}_{max}(\hat{\beta }_{d})$ and $l_{max}(\hat{g}_{d})$ are the eigenvectors corresponding to the largest absolute eigenvalues of matrices $\mathbf {A}_{\beta }$, $\mathbf {B}_{\beta }$ and $\mathbf {A}_{g}$ respectively. $\square $

3.2 Perturbing the explanatory variables

It is known that the minor perturbation of the explanatory variables can seriously influence the least squares results when collinearity is present (Cook 1986, p. 147). This section considers the influence that perturbation of explanatory variables has on the LPLSEs. We define the matrix $\mathbf {X}=[x_{1},\ldots ,x_{p}]$ in which $x_{i}, i=1,\ldots ,p$ vectors of explanatory variables and we refer to $\mathbf {X}_{\omega }$ as the matrix $\mathbf {X}$ after the perturbation of ith column. Therefore,

$$\begin{aligned} \mathbf {X}_{\omega }=\mathbf {X}+as_{i}l\xi '_{i} \end{aligned}$$

where $\xi _{i}$ is a $p\times 1$ vector with a 1 in the ith position and zeroes elsewhere. $s_{i}$ denotes the scale factor and accounts for the different measurement units associated with the columns of $\mathbf {X}$.

Theorem 2

Under the perturbation of explanatory variables the influence function of $\hat{\beta }_{d}$ and $\hat{g}_{d}$ are:

$$\begin{aligned} \textit{GIF}(\hat{\beta }_{d},l)=s_{i}\left( X'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p}\right) ^{-1}\left( \xi _{i}\tilde{e}_{d}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}'\right) l, \end{aligned}$$

$$\begin{aligned} \textit{GIF}(\hat{g}_{d},l)=-s_{i}(\mathbf {N}'\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'\left[ \hat{\beta }_{d,i}\mathbf {I}+\mathbf {X}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}(\xi _{i}\tilde{e}_{d}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}')\right] l. \end{aligned}$$

Proof

Under the perturbation of ith column in (6) the LPLSEs will be

$$\begin{aligned} \hat{\beta }_{\omega ,d}=\left( \mathbf {X}'_{\omega }\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}_{\omega }+\mathbf {I}_{p}\right) ^{-1}\left( \mathbf {X}'_{\omega }\left[ \mathbf {I}-\mathbf {S}\right] y+d\hat{\beta }\right) , \end{aligned}$$

(21)

and

$$\begin{aligned} \hat{g}_{d,\omega }=\left( \mathbf {N}'\mathbf {N}+\lambda \mathbf {K}\right) ^{-1}\mathbf {N}'\left( y-\mathbf {X}_{\omega }\hat{\beta }_{w,d}\right) . \end{aligned}$$

(22)

Since

$$\begin{aligned} \mathbf {X}'_{\omega }\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}_{\omega }=\bigg (\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+as_{i}l\xi _{i}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] as_{i}\xi _{i}l'\\+\,as_{i}l\xi _{i}'\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] as_{i}\xi _{i}l'\bigg ) \qquad \qquad \qquad \qquad \qquad \quad \end{aligned}$$

and $\mathbf {X}'_{\omega }\left[ \mathbf {I}-\mathbf {S}\right] y=\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] y+as_{i}l\xi _{i}'\left[ \mathbf {I}-\mathbf {S}\right] y,$ then it is easy to obtain that

$$\begin{aligned}&\left( \mathbf {X}'_{\omega }\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}_{\omega }+\mathbf {I}_{p}\right) ^{-1}=( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\nonumber \\&\quad -\,as_{i}\bigg ( ( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}{\varvec{\tilde{\mathrm{X}}}}'l\xi _{i}'( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\nonumber \\&\quad +( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\xi _{i}l' {\varvec{\tilde{\mathrm{X}}}}(\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\bigg )+o(a^2). \end{aligned}$$

(23)

Hence, from (23) the (21) becomes

$$\begin{aligned} \hat{\beta }_{\omega ,d}=\hat{\beta }_{d}+as_{i}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}(\xi _{i}\tilde{e}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}')l+o(a^{2}). \end{aligned}$$

(24)

From (24) by similar calculations for (22) we have

$$\begin{aligned}&\hat{g}_{\omega ,d}=\hat{g}_{d}-(\mathbf {N}'\mathbf {N}+\lambda \mathbf {K})^{-1}\mathbf {N}'as_{i}\nonumber \\&\qquad \qquad \times \left[ \hat{\beta }_{d,i}\mathbf {I}+\mathbf {X}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}(\xi _{i}\tilde{e}_{d}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}')\right] l-o(a^2)\qquad . \end{aligned}$$

(25)

Differentiating (24) and (25) with respect to a at $a=0$ the proof will be complete. Analogous to the Sect. 3.1, the generalized Cook statistics can be written

$$\begin{aligned} GC_{\beta _{\omega ,d}}=s_{i}^2l'(\tilde{e}_{d}\xi _{i}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}})( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] X+\mathbf {I}_{p})^{-1}\mathbf {X}'\mathbf {X} ( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\nonumber \\ \times (\xi _{i}\tilde{e}_{d}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}')l/s^2tr(\mathbf {H}_{\beta }),\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \end{aligned}$$

(26)

and

$$\begin{aligned} \textit{GC}_{g_{\omega ,d}}=s_{i}^2l'(\tilde{e}_{d}\xi _{i}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}})(\hat{\beta }_{d,i}\mathbf {I}+( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\mathbf {X}')\qquad \qquad \quad \qquad \nonumber \\ \times \mathbf {S}\left[ \hat{\beta }_{d,i}\mathbf {I}+\mathbf {X}( \mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}(\xi _{i}\tilde{e}_{d}'-\hat{\beta }_{d,i}{\varvec{\tilde{\mathrm{X}}}}')\right] l/s^2tr(\mathbf {H}_{g}).\nonumber \\ \end{aligned}$$

(27)

The diagnostic direction $l_{max}$ can be obtained by finding the eigenvector corresponding to the largest absolute eigenvalue of matrices in (26) and (27) respectively. $\square $

4 Assessing influence on the selection of Liu parameter d

In this section, using local influence analysis, a method is given to study the detection of the possible influential observations in the data which may have a serious influence on the estimation of d. The selection criterion we used is given in (10), and the perturbation scheme is (11). Let $C_{d,\omega }$, ${\textit{SSR}}_{\omega ,d}$ and $\mathbf {H}_{\omega }$ denote the perturbed versions of $C_{d}$, ${\textit{SSR}}_{d}$ and $\mathbf {H}_{d}$ respectively. Let $d_{\omega }$ denote the estimator of d by minimizing

$$\begin{aligned} C_{d,\omega }={\textit{SSR}}_{\omega }/s^2+2tr(\mathbf {H}_{d,\omega })-(n-2). \end{aligned}$$

Then the $l_{max}(\hat{d})$ which is the main diagnostic direction of local influence for $\hat{d}$ has the form

$$\begin{aligned} l_{max}(\hat{d})\propto \dfrac{\partial \hat{d}_{\omega }}{\partial \omega }\mid _{\omega =\omega _{0}}. \end{aligned}$$

Since $C_{d,\omega }$ achieves a local minimum at $\hat{d}_{\omega }$, we have

$$\begin{aligned} \dfrac{\partial C_{d,\omega }}{\partial d}\mid _{\hat{d}=\hat{d}_{\omega }}=0. \end{aligned}$$

(28)

Differentiating both sides of (28) with respect to $\omega $ and evaluating at $\omega _{0}$, we obtain

$$\begin{aligned} \dfrac{\partial ^{2} C_{d,\omega }}{\partial d^{2}}\dfrac{\partial \hat{d}_{\omega }}{\partial \omega }\mid _{\omega =\omega _{0},d=\hat{d}}+\dfrac{\partial ^{2} C_{d,\omega }}{\partial \omega \partial d}\mid _{\omega =\omega _{0},d=\hat{d}}=0. \end{aligned}$$

We can get the following relation

$$\begin{aligned} \dfrac{\partial \hat{d}_{\omega }}{\partial \omega }\mid _{\omega =\omega _{0}}=-\Delta /\ddot{C}_{d}, \end{aligned}$$

where $\Delta =\partial ^{2}C_{\omega ,d}/\partial d\partial \omega \mid _{\omega =\omega _{0},d=\hat{d}}$ and $\ddot{C}_{d}=\partial ^{2}C_{\omega ,d}/\partial d^{2}\mid _{\omega =\omega _{0},d=\hat{d}}$. Under perturbation of variance, the sum of the squares of the residual ${\textit{SSR}}_{d,\omega }$ and the hat matrix $\mathbf {H}_{d,\omega }$ in LPLSEs become ${\textit{SSR}}_{d,\omega }=(y-\mathbf {X}\hat{\beta }_{\omega }-\mathbf {N}\hat{g}_{\omega })'\mathbf {W}(y-\mathbf {X}\hat{\beta }_{\omega }-\mathbf {N}\hat{g}_{\omega })$ and

$$\begin{aligned} \mathbf {H}_{d,\omega }=\mathbf {S}_{\omega }\mathbf {W}+(\mathbf {I}-\mathbf {S}_{\omega }\mathbf {W})\mathbf {X}(\mathbf {X}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\qquad \quad \\\times (\mathbf {X}'\left[ \mathbf {W}-\mathbf {W}\mathbf {S}_{\omega }\mathbf {W}\right] +d(\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}\mathbf {X}'\left[ \mathbf {I}-\mathbf {S}\right] )\qquad . \end{aligned}$$

By the known matrix theory, we get

$$\begin{aligned} \dfrac{\partial {\textit{SSR}}_{d,\omega }}{\partial \omega }\mid _{\omega =\omega _{0}} \approx e^{2}_{i,d}, \end{aligned}$$

and

$$\begin{aligned} \dfrac{\partial e_{d,\omega }}{\partial d}=-{\varvec{\tilde{\mathrm{X}}}}(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] y\\ =-{\varvec{\tilde{\mathrm{X}}}}(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+I)^{-1}\hat{\beta },\qquad \qquad \qquad \quad \qquad \qquad \qquad \quad \end{aligned}$$

where $\hat{\beta }$ is PLSE of $\beta $.

$$\begin{aligned} \dfrac{\partial ^{2} {\textit{SSR}}_{d,\omega }}{\partial \omega \partial d}\mid _{\omega =\omega _{0},d=\hat{d}}=-2e_{i,d}\tilde{x}_{i}'(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\hat{\beta }. \end{aligned}$$

A similar matrix partial differentiation for $tr(\mathbf {H}_{d,\omega })$ gives that

$$\begin{aligned} \dfrac{\partial ^{2} tr(\mathbf {H}_{d,\omega })}{\partial \omega \partial d}\mid _{\omega =\omega _{0},d=\hat{d}}= -\left( S_{i}'+\tilde{x}_{i}'(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}{\varvec{\tilde{\mathrm{X}}}}'\right) {\varvec{\tilde{\mathrm{X}}}}(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\\\times (\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}\tilde{x}_{i},\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \end{aligned}$$

where $S_{i}'$ is ith row of $\mathbf {S}$. Therefore, the ith element of $l_{max}( \hat{d})$ is given by

$$\begin{aligned} l^{(i)}_{max}( \hat{d})\propto \Delta= & {} -e_{i,d}\tilde{x}_{i}'(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}\hat{\beta }/s^{2}\nonumber \\&-( S_{i}'+\tilde{x}_{i}'(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I}_{p})^{-1}{\varvec{\tilde{\mathrm{X}}}}'){\varvec{\tilde{\mathrm{X}}}}(\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X}+\mathbf {I})^{-1}\nonumber \\&\times (\mathbf {X}' \left[ \mathbf {I}-\mathbf {S}\right] \mathbf {X})^{-1}\tilde{x}_{i}. \end{aligned}$$

(29)

5 Numerical illustration

5.1 Simulation study

A simulation study has been carried out in order to evaluate the performances of the proposed method in different situation. To achieve different degrees of collinearity, following McDonald and Galarneau (1975), the explanatory were generated using the following device:

$$\begin{aligned} x_{ij}=(1-\gamma )^{1/2}z_{ij}+\gamma z_{i3}\quad i=1,\ldots ,n \quad j=1,\ldots ,3 \end{aligned}$$

where $z_{ij}$ are independent standard normal pseudo-random numbers, and is specified so that the correlation between any two explanatory variables is given by $\gamma ^2$. Three different sets of correlations corresponding to $\gamma =$ 0.80, 0.90, and 0.99 are considered. Then, n observations for the dependent variable are determined by

$$\begin{aligned} y_{i}=\beta _{1}x_{i1}+\beta _{2}x_{i2}+\beta _{3}x_{i3}+g(t_{i})+\epsilon _{i} \quad i=1 ,\ldots , n. \end{aligned}$$

with $g(t_{i})=cos(2\pi t_{i})$, $t_{i}\sim U(0,1)$ in which U(0, 1) denotes the uniform distribution in interval (0, 1). We vary the sample size with $n=15, 30$ and $n=50$. In ridge regression Newhouse and Oman (1971) stated that if the mean squared error is a function of $\beta $, $\sigma $, and ridge parameter and, if the explanatory variables are fixed, then the mean squared error is minimized when $\beta $ is the normalized eigenvector corresponding to the largest eigenvalue of $\mathbf {X}'\mathbf {X}$ matrix subject to constraint that $\beta '\beta =1$. Here we can selected the coefficients $\beta _{1}, \beta _{2}$ and $\beta _{3}$ as normalized eigenvectors corresponding to the largest eigenvalues of $\mathbf {X}'(\mathbf {I}-\mathbf {S})\mathbf {X}$ matrix so that $\beta '\beta =1$. An outlier is created by adding $\nu $ to the response $y_{10}$, i.e., $y_{10} = y_{10}+\nu $, where $\nu $ corresponds to the standard deviation of response y. We calculate the diagnostic measures of $l^{1}_{max}(\hat{\beta }_{d})$, $l^{2}_{max}(\hat{\beta }_{d})$ and $l_{max}(\hat{g}_{d})$ in different datasets. The results are shown in Table 1. It is easily seen from Table 1 that case 10 is the most influential observation. For example for $n=15$ with $\gamma =0.8$, the $l^{1}_{max}(\hat{\beta }_{d})$, $l^{2}_{max}(\hat{\beta }_{d})$ and $l_{max}(\hat{g}_{d})$ have maximum values for case 10 compared to any other observation (which have values less than 0.104, 0.153 and 0.096 respectively). These results imply that our proposed diagnostic measures can identify the potential outlier.

Table 1 Influential analysis of simulated data

Full size table

5.2 Real data

The Longley (1967) data consisting of 7 economical variables, $x_{1}=$ GNP implicit price deflator, $x_{2}=$ Gross National Product, $x_{3}=$ number of people in the armed forces, $x_{4}=$ number of unemployed, $x_{5}=$ Population, $x_{6}=$ Year and $y=$ number of people employed. This data has been used to explain the effect of extreme multicollinearity on the ordinary least squares estimators. The scaled condition number (see Walker and Brich 1988) of this data set is 43,275. This large value suggests the presence of an unusually high level of collinearity. Cook (1977) applied Cook’s distance to this data and found that cases 5, 16, 4, 10, and 15 (in this order) were the most influential observations in OLS. Walker and Brich (1988) analysed the same data to find anomalous observations in ORR using the method of case deletion influential measures. They found that cases 16, 10, 4, 15 and 5 (in this order) were the most influential observations in Cook’s and DFFITS measures. In local influence approach Shi and Wang (1999) find the cases 10, 4, 5 and 15 were the most influential observation for ridge estimation and Jahufer and Chen (2012) find the cases 4, 10, 1, 5 and 6 in this order were the five most influential observations for Liu estimator in ordinary regression. Recently, Emami (2015) used the same data to identify influential cases in ridge semiparametric regression model. By case deletion method he identified 12, 16, 2 and 5 were the most influential cases.

In this section, we use this data set to illustrate the method suggested in this article. The influence of observations on the LPLSEs of SPRM are studied based on small perturbations. Therefore, the influence of the different aspects of the model can be well approached. Here, we fit model (1) to the data, which $\mathbf {X}=[x_{1},x_{2},x_{4},x_{5},x_{6}]'$ and $g(t_{i})=g(x_{3})$. The parameter $\lambda $ in this model is 0.007, which is obtained by minimizing GCV criterion. Estimate of nonparametric function for Longley data for $d=0.985$ is shown in Fig. 1. First, we consider, the variance perturbation. The index plots of $l^{1}_{max}(\hat{\beta }_{d})$ and $l_{max}(\hat{g}_{d})$ are shown in Fig. 2, respectively (index plot of $l^{2}_{max}(\hat{\beta }_{d})$ for $d=0.985$ has a similar structure as $l^{1}_{max}(\hat{\beta }_{d})$. In Fig. 2a, cases 16, 10, 15 and 5 are four most influential cases in $\hat{\beta }_{d}$. However, the largest absolute component of $l_{max}(\hat{g}_{d})$ in Fig. 2b directs attention to cases 10, 16, 2, and 12 in order for $\hat{g}_{d}$. Therefore, local influential observations are slightly different from those in case deletion. This is partly due to the fact that local influence considers the joint influence instead of individual cases influence. Second, we considered the perturbation of individual explanatory variables. The maximum values of $l^{1}_{max}(\hat{\beta }_{d})$ for separately perturbing explanatory variables $x_{j}, (j=1,2,4,5$ and 6 ) are 10.28, 0.761 ,5.43 ,2.11 and 1.21 respectively, and also the maximum values of $l_{max}(\hat{g}_{d})$ are 8.91, 0.44, 3.28, 0.87 and 0.96 respectively. Hence local change caused by perturbing $x_{1}$ is the largest among the others and local changes by perturbing the other explanatory variables are almost the same. The index plots of $l^{1}_{max}(\hat{\beta }_{d})$ and $l_{max}(\hat{g}_{d})$ based on perturbation of $x_{j}$, $j=1,4,5 $ and 6 are listed in Figs. 3 and 4 respectively. Note that in these figures vertical scales have been chosen identically (except for the sign). From Fig. 3 it is obvious that the LPLSE $\hat{\beta }_{d}$ is sensitive for the values of $x_{1}, x_{4}$ and $x_{5}$ at cases 1 and 11 and 16 and values of $x_{5}$ at cases 5, 10 and 14. Also, from Fig. 4 LPLSE $\hat{g}_{d}$ is sensitive for the values of $x_{1}, x_{4}, x_{5}$ and $x_{6}$ at cases 2, 6 and 12. Finally we estimated the $l_{max}(\hat{d})$ values using (29). It is observed from the index plot of $l_{max}(\hat{d})$ shown in Fig. 5, cases 2, 10, 5, 15 and 16 in this order are the five most influential observations on LPLSEs parameter.

6 Conclusion

Local influence diagnostics consider the joint influence of the data set, therefore it is useful to identify some influential patterns appearing in the data set. In this paper, we have studied several local influence diagnostic measures that seem practical and can play a considerable part in LPLSEs of SPRMs. Instead of using case deletion, we use the local influence method to study the detection of influential observations. By perturbing different aspects of the model, the influence compact of the data on the LPLSEs of SPRMs can be studied. The proposed techniques provide to the practitioner numerical and pictorial results that complement the analysis. We believe that the local influence diagnostics we derive here can be useful as part of any serious data analysis. All the proposed measures are the function of residuals, leverage points and LPLSEs. Furthermore, we study the influence of observations on selection of d, which is also important in Liu type regression models. Although no conventional cutoff points are introduced or developed for the Liu estimator local influence diagnostic quantities, it seems that index plot is an optimistic and conventional procedure to disclose influential cases.

References

Akdeniz F, Akdeniz Duran E (2010) Liu-type estimator in semiparametric segression models. J Stat Comput Simul 80:853–871
Akdeniz F, Akdeniz Duran E, Roozbeh M, Arashi M (2015) Effciency of the generalized difference-based Liu estimators in semiparametric segression models with correlated errors. J Stat Comput Simul 85:147–165
Article MathSciNet Google Scholar
Amini M, Roozbeh M (2015) Optimal partial ridge estimation in restricted semiparametric regression models. J Multivar Anal 136:26–40
Article MathSciNet MATH Google Scholar
Arashi M, Valizadeh T (2015) Performance of Kibiria’s methods in partial linear ridge regression models. Stat Pap 56:231–246
Article MATH Google Scholar
Belsley DA, Kuh E, Welsch RE (1989) Regression diagnostics: identifying influential data and sources of collinearity. Wiley, New York
MATH Google Scholar
Chen G, You J (2005) An asymptotic theory for semiparametric generalized least squares estimation in partially linear regression models. Stat Pap 46:173–193
Article MathSciNet MATH Google Scholar
Cook RD (1977) Detection of influential observations in linear regression. Technometrics 19:15–18
MathSciNet MATH Google Scholar
Cook RD (1986) Assessment of local influence (with discussion). J R Stat Soc B 48:133–169
MATH Google Scholar
Cook RD, Weisberg S (1982) Residuals and influence in regression. Chapman & Hall, New York
MATH Google Scholar
Emami H (2015) Influence diagnostics in ridge semiparametric regression models. Stat Probab Lett 105:106–115
Article MathSciNet MATH Google Scholar
Eubank RL (1985) Diagnostics for smoothing splines. J R Stat Soc B47:332–341
MathSciNet MATH Google Scholar
Fung WK, Zhu ZY, Wei BC, He X (2002) Influence diagnostics and outlier tests for semiparametric mixed models. J R Stat Soc B 47:332–341
MATH Google Scholar
Green PJ, Silverman BW (1994) Nonparometric regression and generalized linear models. Chapman and Hall, London
Book Google Scholar
Hu H (2005) Ridge estimation of a semiparametric regression model. J Comput Appl Math B 176:215–222
Article MathSciNet MATH Google Scholar
Jahufer A, Chen J (2009) Assessing global influential observations in modified ridge regression. Stat Probab Lett 79:513–518
Article MathSciNet MATH Google Scholar
Jahufer A, Chen J (2012) Identifying local influential observations in Liu estimator. Metrika 75:425–438
Article MathSciNet MATH Google Scholar
Kim C (1996) Cook’s distance in spline smoothing. Stat Probab Lett 31:139–144
Article MathSciNet MATH Google Scholar
Kim C, Lee Y, Park BU (2001) Cook’s distance in local polynomial regression. Stat Probab Lett 54:33–40
Article MathSciNet MATH Google Scholar
Kim C, Lee Y, Park BU (2002) Influence diagnostics in semiparametric regression models. Stat Probab Lett 60:49–58
Lesaffre E, Verbeke G (1998) Local influence in linear mixed models. Biometrics 54:570–582
Article MATH Google Scholar
Liu K (1993) A new class of biased estimate in linear regression. Commun Stat Theory 22:393–402
Article MathSciNet MATH Google Scholar
Longley JW (1967) An appraisal of least squares programs for electronic computer from the point of view of the user. J Am Stat Assoc 62:819–841
Article MathSciNet Google Scholar
Mallows CL (1973) Some comments on Cp. Technometrics 15:661–675
MATH Google Scholar
McDonald GC, Galarneau DI (1975) A Monte Carlo evaluation of some ridge-type estimators. J Am Stat Assoc 70:407–416
Article MATH Google Scholar
Newhouse JP, Oman SD (1971) An evaluation of ridge estimators. http://www.rand.org/pubs/reports/2007/R716
Rasekh A (2006) Local influence in measurement error models with ridge estimate. Comput Stat Data Anal 50:2822–2834
Article MathSciNet MATH Google Scholar
Roozbeh M (2015) Shrinkage ridge estimators in semiparametric regression models. J Multivar Anal 136:56–74
Article MathSciNet MATH Google Scholar
Roozbeh M, Arashi M (2015) Feasible ridge estimator in partially linear models. J Multivar Anal 116:35–44
Article MathSciNet MATH Google Scholar
Shi L (1997) Local influence in principal component analysis. Biometrika 87:175–186
Article MathSciNet MATH Google Scholar
Shi L, Wang X (1999) Local influence in ridge regression. Comput Stat Data Anal 31:341–353
Article MathSciNet MATH Google Scholar
Speckman PE (1988) Regression analysis for partially linear models. J R Stat Soc B 50:413–436
MathSciNet MATH Google Scholar
Stein C (1956) Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In: Proceedings of the third Berkeley symposium on mathematical statistics and probability vol 1, pp 197–206
Tabakan G, Akdeniz F (2010) Difference-based ridge estimator of parameters in partial linear model. Stat Pap 51:357–368
Article MathSciNet MATH Google Scholar
Thomas W, Cook RD (1989) Assessing influence on regression coefficients in generalized linear models. Biometrika 76:741–749
Article MathSciNet MATH Google Scholar
Walker E, Birch JB (1988) Influence measures in ridge regression. Technometrics 30:221–227
Article Google Scholar
Zhu Z, He X, Fung W (2003) Local influence analysis for penalized Gaussian likelihood estimators in partially linear models. Scan J Stat 30:767–780
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

The authors would like to thank two anonymous referees and the associate editor for their valuable comments and suggestions on an earlier version of this manuscript which resulted in this improved version.

Author information

Authors and Affiliations

Department of Statistics, Faculty of Science, University of Zanjan, Zanjan, Iran
Hadi Emami

Authors

Hadi Emami
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hadi Emami.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Emami, H. Local influence for Liu estimators in semiparametric linear models. Stat Papers 59, 529–544 (2018). https://doi.org/10.1007/s00362-016-0775-6

Download citation

Received: 15 September 2015
Revised: 21 April 2016
Published: 17 May 2016
Issue Date: June 2018
DOI: https://doi.org/10.1007/s00362-016-0775-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Local influence for Liu estimators in semiparametric linear models

Abstract

Similar content being viewed by others

A mixed model approach to measurement error in semiparametric regression

Partly linear instrumental variables regressions without smoothing on the instruments

Effective identification and estimation for the semiparametric measurement error model

1 Introduction

2 Model and inference

2.1 Penalized least square estimators (PLSEs)

2.2 Liu penalized least squares estimators (LPLSEs)