The case-deletion and mean-shift outlier models: equivalence and beyond

Guo, J.

doi:10.1007/s40328-013-0017-5

The case-deletion and mean-shift outlier models: equivalence and beyond

Published: 10 April 2013

Volume 48, pages 191–197, (2013)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Acta Geodaetica et Geophysica Aims and scope Submit manuscript

The case-deletion and mean-shift outlier models: equivalence and beyond

Download PDF

J. Guo¹

314 Accesses
8 Citations
Explore all metrics

Abstract

Deletion diagnostics have been widely adopted to evaluate the influence of one or more observations on the adjustment outputs. Both the case-deletion model and the mean-shift outlier model can be used to develop multiple case-deletion diagnostics for linear models. These two multiple outlier detection models are identical from the statistical point of view. However, the mean-shift outlier model, in which the underlying observations are implicitly deleted, outweighs the case-deletion model in term of computational efficiency. The influence of outliers on the adjustment outputs is also addressed. It reveals that the precision, minimal detectable bias (MDB) measure and dilution of precision metric (DOP) are all overestimated when outliers exist but were neglected under the assumption that a priori variance factor is known before.

A new outlier detection method considering outliers as model errors

Article 01 January 2015

A new multiple outliers identification method in linear regression

Article 17 July 2019

Revisiting Baarda’s concept of minimal detectable bias with regard to outlier identifiability

Article Open access 02 July 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

When outliers are present in a data set, a least-squares (LS) adjustment may not be possible or will produce poor or invalid results (Wolf and Ghilani 1997). Many approaches to mitigate or even eliminate the deteriorating effect of outlying observations on the parameter estimates have been developed (Cook 1977; Koch 1999; Monhor and Verö 2011), albeit there is no universally-accepted definition for an outlier (Barnett and Lewis 1994; Monhor and Takemoto 2005; Monhor and Verö 2011).

There are two essential approaches to control the corrupt effects of outliers: conventional outlier detection test procedures developed in geodetic literature (Baarda 1968; Pope 1976) and robust methods (Huber 1981; Hampel et al. 1986, Rousseeuw and Leroy 1987; Koch 1999; Yang 1999; Hekimoglu and Koch 2000; Xu 2005; Hekimoglu 2005). However, the conventional test procedures are only applicable under the assumption that no more than one outlier is present. In case of multiple outliers, the most practical strategy is to employ the iterative data snooping presented by Kok (1984), whilst procedures for detecting all outliers at once have also been proposed (Hadi and Simonoff 1993; Snow and Schaffrin 2003; Baselga 2011).

To evaluate the influence of one or more observations on the adjustment outputs, the deletion diagnostics have been extensively adopted (Cook 1977, 1979; Chatterjee and Hadi 1988). There are two methods to implement the diagnostics, namely, delete the underlying observation(s) explicitly or implicitly. The explicit one is case-deletion model and the other one is referred to as mean-shift outlier model (Hekimoglu et al. 2012). The aim of this contribution is twofold: first to prove the equivalence of these two methods; second to address influence of outlying observations on the quality measures.

The paper is organized as follows: the equivalence of two multiple outlier detection models is investigated, followed by the computational considerations in performing the mean-shift outlier model. Furthermore, theoretical analyses state that the precision, Minimal Detectable Bias (MDB) measure and Dilution of Precision (DOP) metric are all overoptimistic when the outlying observations should have been taken into account but were neglected.

2 Model description

Let us consider a linear Gauss-Markov model defined by Koch (1999)

$$ E(\boldsymbol{L}) = \boldsymbol{AX}\quad \mbox{with}\ \operatorname {Cov}(\boldsymbol{L}) = \sigma _{0}^{2}\boldsymbol{P}^{ -1}, $$

(1)

where L is the n×1 vector of observations, A the n×u design matrix with full column rank, and X the u×1 vector of unknowns. $\sigma _{0}^{2}$ is the a priori variance factor of unit weight, and P the symmetric positive-definite weight matrix. Whenever necessary, the observations are supposed to be normally distributed.

Then, the (weighted) LS estimate of the unknowns in model Eq. (1) reads (Koch 1999)

$$ \hat{\boldsymbol{X}} = \bigl(\boldsymbol{A}^{T}\boldsymbol{PA} \bigr)^{ - 1}\boldsymbol{A}^{T}\boldsymbol{PL} $$

(2)

The corresponding residual vector is readily obtained as

$$ \boldsymbol{V} = \boldsymbol{L} - \boldsymbol{A}\hat{\boldsymbol{X}} = \boldsymbol{RL} $$

(3)

where R=I _n−A(A ^T PA)⁻¹ A ^T P maps the original observational vector onto the residual vector as a result of the LS adjustment (Schaffrin 1997; Guo et al. 2011). The matrix R plays an important role in linear adjustment techniques since it contains extremely useful information (Huber 1981; Guo et al. 2007, 2010). One can easily verify that R is idempotent and has the following useful properties

$$ \boldsymbol{R}^{T}\boldsymbol{P} = \boldsymbol{PR} = \boldsymbol{R}^{T}\boldsymbol{PR},\qquad \boldsymbol{RA} = \boldsymbol{O},\qquad \boldsymbol{A}^{T}\boldsymbol{PR} = \boldsymbol{O} $$

(4)

the weighted sum of squares of the LS residuals reads

$$ \varOmega = \boldsymbol{V}^{T}\boldsymbol{PV} = \boldsymbol{L}^{T} \boldsymbol{PRL} $$

(5)

3 Multiple outlier detection models

As is known, LS method is very susceptible to outliers (Wolf and Ghilani 1997; Koch 1999; Guo et al. 2010). There are two procedures to implement the deletion diagnostics, namely, the case-deletion model and the mean-shift outlier model.

Let us assume the i ₁th, the i ₂th, …, and the i _mth observations are to be deleted, while the i _m+1th, the i _m+2th, …, and the i _nth observations are the remaining ones.

3.1 Mean-shift outlier model

For convenience we introduce the following notations,

$$ \boldsymbol{H}_{b} = (\boldsymbol{h}_{i_{1}}, \boldsymbol{h}_{i_{2}}, \ldots,\boldsymbol{h}_{i_{m}}),\qquad \boldsymbol{H}_{r} = (\boldsymbol{h}_{i_{m + 1}}, \boldsymbol{h}_{i_{m + 2}}, \ldots,\boldsymbol{h}_{i_{n}}) $$

(6)

where h _i denotes the ith n-vector having a 1 as its ith entry and zeros otherwise. It can be seen (H _b,H _r) is a permutation matrix (Strang and Borre 1997). Since a permutation matrix is orthogonal, one can obtain

$$ (\boldsymbol{H}_{b},\boldsymbol{H}_{r}) ( \boldsymbol{H}_{b},\boldsymbol{H}_{r})^{T} = \boldsymbol{H}_{b}\boldsymbol{H}_{b}^{T} + \boldsymbol{H}_{r}\boldsymbol{H}_{r}^{T} = \boldsymbol{I}_{n} $$

(7)

and

$$(\boldsymbol{H}_{b},\boldsymbol{H}_{r})^{T}( \boldsymbol{H}_{b},\boldsymbol{H}_{r}) = \left ( \begin{array}{c@{\quad}c} \boldsymbol{H}_{b}^{T}\boldsymbol{H}_{b} & \boldsymbol{H}_{b}^{T}\boldsymbol{H}_{r} \\[3pt] \boldsymbol{H}_{r}^{T}\boldsymbol{H}_{b} & \boldsymbol{H}_{r}^{T}\boldsymbol{H}_{r} \end{array} \right ) = \boldsymbol{I}_{n} $$

it follows immediately that

$$ \boldsymbol{H}_{b}^{T}\boldsymbol{H}_{b} = \boldsymbol{I}_{m},\qquad \boldsymbol{H}_{b}^{T} \boldsymbol{H}_{r} = \boldsymbol{O},\qquad \boldsymbol{H}_{r}^{T} \boldsymbol{H}_{r} = \boldsymbol{I}_{n - m} $$

(8)

Accordingly, the corresponding mean-shift outlier model reads

$$ E(\boldsymbol{L}) = \boldsymbol{AX} + \boldsymbol{H}_{b}\boldsymbol{\nabla}\quad \mbox{with }\operatorname {Cov}(\boldsymbol{L}) = \sigma _{0}^{2} \boldsymbol{P}^{ - 1}, $$

(9)

in which (A,H _b) is of full column rank.

Based on the LS principle, one can obtain the following normal equation:

$$ \left ( \begin{array}{c@{\quad}c} \boldsymbol{A}^{T}\boldsymbol{PA} & \boldsymbol{A}^{T}\boldsymbol{PH}_{b} \\ \boldsymbol{H}_{b}^{T}\boldsymbol{PA} & \boldsymbol{H}_{b}^{T}\boldsymbol{PH}_{b} \end{array} \right )\left ( \begin{array}{c} \hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} \\ \hat{\boldsymbol{\nabla}} \end{array} \right ) = \left ( \begin{array}{c} \boldsymbol{A}^{T}\boldsymbol{PL} \\ \boldsymbol{H}_{b}^{T}\boldsymbol{PL} \end{array} \right ) $$

(10)

with which and denoting

$$ \boldsymbol{R}_{\boldsymbol{H}_{b}} = \boldsymbol{I}_{n} - \boldsymbol{H}_{b}\bigl(\boldsymbol{H}_{b}^{T} \boldsymbol{PH}_{b}\bigr)^{ - 1}\boldsymbol{H}_{b}^{T} \boldsymbol{P} $$

(11)

we have

$$ \left \{ \begin{array}{l} \hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} = \bigl(\boldsymbol{A}^{T} \cdot \boldsymbol{PR}_{\boldsymbol{H}_{b}} \cdot \boldsymbol{A}\bigr)^{ - 1}\boldsymbol{A}^{T} \cdot \boldsymbol{PR}_{\boldsymbol{H}_{b}} \cdot \boldsymbol{L} \\[6pt] \hat{\boldsymbol{\nabla}} = \bigl(\boldsymbol{H}_{b}^{T}\boldsymbol{PH}_{b}\bigr)^{ - 1}\boldsymbol{H}_{b}^{T}\boldsymbol{P}(\boldsymbol{L} - \boldsymbol{A}\hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} ) \end{array} \right . $$

(12)

It can be verified that $\boldsymbol{R}_{\boldsymbol{H}_{b}}$ is idempotent and has the following useful properties

$$ \boldsymbol{R}_{\boldsymbol{H}_{b}}^{T}\boldsymbol{PR}_{\boldsymbol{H}_{b}} = \boldsymbol{PR}_{\boldsymbol{H}_{b}} = \boldsymbol{R}_{\boldsymbol{H}_{b}}^{T} \boldsymbol{P},\qquad \boldsymbol{R}_{\boldsymbol{H}_{b}}\boldsymbol{H}_{b} = \boldsymbol{O},\qquad \boldsymbol{H}_{b}^{T} \boldsymbol{PR}_{\boldsymbol{H}_{b}} = \boldsymbol{O} $$

(13)

The corresponding residual vector is

(14)

and thus

$$ \hat{\sigma} _{\boldsymbol{\nabla}} ^{2} = \frac{\varOmega _{\boldsymbol{\nabla}}}{n - (m + u)} $$

(15)

with

$$ \varOmega _{\boldsymbol{\nabla}} = \boldsymbol{V}_{\boldsymbol{\nabla}} ^{T} \boldsymbol{PV}_{\boldsymbol{\nabla}} = (\boldsymbol{L} - \boldsymbol{A}\hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} )^{T}\boldsymbol{PR}_{\boldsymbol{H}_{b}}( \boldsymbol{L} - \boldsymbol{A}\hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} ) $$

(16)

3.2 Multiple case-deletion model

Under the same condition, the multiple case-deletion model reads

$$ E\bigl(\boldsymbol{H}_{r}^{T}\boldsymbol{L}\bigr) = \boldsymbol{H}_{r}^{T}\boldsymbol{AX}\quad \mbox{with}\ \operatorname {Cov}\bigl(\boldsymbol{H}_{r}^{T}\boldsymbol{L}\bigr) = \sigma _{0}^{2}\boldsymbol{H}_{r}^{T} \boldsymbol{P}^{ -1}\boldsymbol{H}_{r}, $$

(17)

with which one can obtain the LS estimator as follows

$$ \hat{\boldsymbol{X}}_{r} = \bigl(\boldsymbol{A}^{T} \boldsymbol{H}_{r} \cdot \boldsymbol{P}_{r} \cdot \boldsymbol{H}_{r}^{T}\boldsymbol{A}\bigr)^{ - 1} \boldsymbol{A}^{T}\boldsymbol{H}_{r} \cdot \boldsymbol{P}_{r} \cdot \boldsymbol{H}_{r}^{T} \boldsymbol{L} $$

(18)

where

$$ \boldsymbol{P}_{r} = \bigl(\boldsymbol{H}_{r}^{T} \boldsymbol{P}^{ - 1}\boldsymbol{H}_{r}\bigr)^{ - 1} $$

(19)

The permutation matrix (H _b,H _r) is invertible. Therefore, one can obtain

$$ \boldsymbol{P}^{ - 1} = (\boldsymbol{H}_{b}, \boldsymbol{H}_{r})\bigl[(\boldsymbol{H}_{b}, \boldsymbol{H}_{r})^{T}\boldsymbol{P}(\boldsymbol{H}_{b}, \boldsymbol{H}_{r})\bigr]^{ - 1}(\boldsymbol{H}_{b}, \boldsymbol{H}_{r})^{T} $$

(20)

which in combination with Eq. (8) yields

(21)

By virtue of Eqs. (7), (13), (19) and (21), we have

(22)

It follows that

$$ \hat{\boldsymbol{X}}_{r} = \hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} $$

(23)

The weighted sum of squares of the LS residuals in this multiple case-deletion model reads

and thus

$$ \hat{\sigma} _{r}^{2} = \frac{\varOmega _{r}}{(n - m) - u} = \hat{\sigma} _{\boldsymbol{\nabla}} ^{2} $$

(24)

It can be seen from Eqs. (23) and (24) that the mean-shift outlier model is equivalent to the multiple case-deletion model, no matter whether the deleted observations are correlated with the remaining or not.

According to the above discussions, one can conclude that the adjustment outputs are equal to each other no matter whether the (potential) outliers are deleted explicitly or implicitly, even though the removed observations are correlated with the remaining ones.

3.3 Computational consideration

With Eq. (12), one has to deal with the two matrix inversions with orders u and m, as opposed to the two matrix inversions with orders u and n−m in Eq. (18). Therefore, Eq. (12) outperforms Eq. (18) in term of computational efficiency for in most applications the number of outliers m is small relative to the number of the original observations n.

However, the computational burden can be further reduced by taking the partitioned structure of the normal matrix in Eq. (10) into account. In fact, the normal equation (10) can also be solved as

$$ \left \{ \begin{array}{l} \hat{\boldsymbol{\nabla}} = \bigl(\boldsymbol{H}_{b}^{T}\boldsymbol{PRH}_{b}\bigr)^{ - 1}\boldsymbol{H}_{b}^{T}\boldsymbol{PRL} \\[6pt] \hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} = \bigl(\boldsymbol{A}^{T}\boldsymbol{PA}\bigr)^{ - 1}\boldsymbol{A}^{T}\boldsymbol{P}(\boldsymbol{L} - \boldsymbol{H}_{b}\hat{\boldsymbol{\nabla}} ) \end{array} \right . $$

(25)

or in more explicit form

$$ \left \{ \begin{array}{l} \hat{\boldsymbol{\nabla}} = \bigl(\boldsymbol{H}_{b}^{T}\boldsymbol{PRH}_{b}\bigr)^{ - 1}\boldsymbol{H}_{b}^{T}\boldsymbol{PV} \\[6pt] \hat{\boldsymbol{X}}_{\boldsymbol{\nabla}} = \hat{\boldsymbol{X}} - \bigl(\boldsymbol{A}^{T}\boldsymbol{PA}\bigr)^{ - 1}\boldsymbol{A}^{T}\boldsymbol{PH}_{b}\hat{\boldsymbol{\nabla}} \end{array} \right . $$

(26)

with which we obtain

$$ \boldsymbol{V}_{\boldsymbol{\nabla}} = \boldsymbol{R}(\boldsymbol{L} - \boldsymbol{H}_{b}\hat{\boldsymbol{\nabla}} ) $$

(27)

and

(28)

Apparently, in this situation it only requires extra calculation of the inverse of the m×m normal matrix $\boldsymbol{H}_{b}^{T}\boldsymbol{PRH}_{b}$. As a by-product, the estimate of the vector of the disturbance parameters ∇ can also be obtained with Eq. (26). This is a sufficient reason for choosing the mean-shift outlier model over the case-deletion model from the computational point of view.

4 Quality Assessment of outlying observations

With Sherman-Morrison-Woodbury-Schur formula (Strang and Borre 1997), we have

(29)

This formula states the apparent increase in precision when the outlying observations should have been taken into account but were neglected under the assumption that a priori variance factor is known before (Schaffrin 1997).

The second term of Eq. (29) has a quadratic form, it follows that

$$ \bigl[\bigl(\boldsymbol{A}^{T}\boldsymbol{PR}_{\boldsymbol{H}_{b}} \boldsymbol{A}\bigr)^{ - 1}\bigr]_{ii} \ge \bigl[\bigl( \boldsymbol{A}^{T}\boldsymbol{PA}\bigr)^{ - 1} \bigr]_{ii},\quad i = 1,2, \ldots,u $$

(30)

This inequality shows that all types of DOP metrics (Strang and Borre 1997) will be over-optimistic if the outliers were ignored, even though the outlying observations are correlated with the remaining ones.

After some matrix manipulation, it follows that

$$ \boldsymbol{\varOmega} _{r} = \boldsymbol{L}^{T}\boldsymbol{H}_{r} \cdot \boldsymbol{P}_{r}\boldsymbol{R}_{r} \cdot \boldsymbol{H}_{r}^{T}\boldsymbol{L} $$

(31)

where $\boldsymbol{R}_{r} = \boldsymbol{I}_{r} - \boldsymbol{H}_{r}^{T}\boldsymbol{A} \cdot (\boldsymbol{A}^{T}\boldsymbol{H}_{r}\boldsymbol{P}_{r}\boldsymbol{H}_{r}^{T}\boldsymbol{A})^{ - 1} \cdot \boldsymbol{A}^{T}\boldsymbol{H}_{r} \cdot \boldsymbol{P}_{r}$.

By virtue of Eqs. (28) and (31) and since the two quadratic forms, Ω _∇ and Ω _r, are equivalent for any realization of the random observational vector L, we have

$$ \boldsymbol{H}_{r} \cdot \boldsymbol{P}_{r} \boldsymbol{R}_{r} \cdot \boldsymbol{H}_{r}^{T} = \boldsymbol{PR} - \boldsymbol{PRH}_{b}\bigl(\boldsymbol{H}_{b}^{T} \boldsymbol{PRH}_{b}\bigr)^{ - 1}\boldsymbol{H}_{b}^{T} \boldsymbol{PR} $$

(32)

Obviously, the kth observation in the multiple case-deletion model is just the i _m+kth one in the original linear Gauss–Markov model. Consequently, we get

$$ \boldsymbol{H}_{r}^{T}\boldsymbol{h}_{i_{m + k}} = \tilde{\boldsymbol{h}}_{k} $$

(33)

where $\tilde{\boldsymbol{h}}_{k}$ denotes the kth (n−m)-dimensional canonical unit vector with 1 as its ith entry.

The kth Baarda’s w-test in the multiple case-deletion model reads (Baarda 1968)

$$ \tilde{w}_{k} = \frac{\tilde{\boldsymbol{h}}_{k}^{T}\boldsymbol{P}_{r} \boldsymbol{R}_{r}\boldsymbol{H}_{r}^{T}\boldsymbol{L}}{\sigma _{0}\sqrt{\tilde{\boldsymbol{h}}_{k}^{T}\boldsymbol{P}_{r}\boldsymbol{R}_{r}\tilde{\boldsymbol{h}}_{k}}} \sim \mathcal{N}(0, 1) $$

(34)

The corresponding MDB measure is given by

$$ \sigma _{0}\sqrt{\frac{\lambda _{0}}{\tilde{\boldsymbol{h}}_{k}^{T}\boldsymbol{P}_{r}\boldsymbol{R}_{r}\tilde{\boldsymbol{h}}_{k}}} = \sigma _{0} \sqrt{\frac{\lambda _{0}}{\boldsymbol{h}_{i_{m + k}}^{T}\boldsymbol{H}_{r}\boldsymbol{P}_{r}\boldsymbol{R}_{r}\boldsymbol{H}_{r}^{T}\boldsymbol{h}_{i_{m + k}}}} $$

(35)

which in combination with Eq. (32) yields

$$ \sigma _{0}\sqrt{\frac{\lambda _{0}}{\tilde{\boldsymbol{h}}_{k}^{T}\boldsymbol{P}_{r}\boldsymbol{R}_{r}\tilde{\boldsymbol{h}}_{k}}} \ge \sigma _{0} \sqrt{\frac{\lambda _{0}}{\boldsymbol{h}_{i_{m + k}}^{T}\boldsymbol{PRh}_{i_{m + k}}}} $$

(36)

This indicates that all the MDB measures of the remaining observations will become larger.

5 Conclusions

Both the case-deletion model and the mean-shift outlier model can be employed to perform multiple case-deletion diagnostics for linear models. The advantage of the case-deletion model is its intuitive appeal, for the suspicious observations are removed explicitly. The mean-shift outlier model, in which the underlying observations are implicitly deleted, has found wider acceptance because of its computational simplicity. However, these two models are equivalent from the mathematical point of view. Under the assumption that a priori variance factor is known before, theoretical analyses indicate that the precision, MDB measure and all kinds of DOP metrics are all over-optimistic when outliers were neglected.

References

Baarda W (1968) A testing procedure for use in geodetic networks. Publication on geodesy, vol 2(5). Netherlands Geodetic Commission, Delft
Google Scholar
Barnett V, Lewis T (1994) Outliers in statistical data, 3rd edn. Wiley, New York
Google Scholar
Baselga S (2011) Acta Geod Geophys Hung 46(4):401–416
Article Google Scholar
Chatterjee S, Hadi AS (1988) Sensitivity analysis in linear regression. Wiley, New York
Book Google Scholar
Cook RD (1977) Technometrics 19(1):15–18
Article Google Scholar
Cook RD (1979) J Am Stat Assoc 74(365):169–174
Article Google Scholar
Guo J, Ou J, Wang H (2007) J Surv Eng 133(3):129–133
Google Scholar
Guo J, Ou J, Wang H (2010) J Geod 84(4):243–250
Google Scholar
Guo J, Ou J, Yuan Y (2011) J Surv Eng 137(1):9–13
Google Scholar
Hadi AS, Simonoff JS (1993) J Am Stat Assoc 88(424):1264–1272
Article Google Scholar
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (1986) Robust statistics: the approach based on influence functions. Wiley, New York
Google Scholar
Hekimoglu S (2005) ZFV, Z Vermess.wes 130(3):174–180
Google Scholar
Hekimoglu S, Koch KR (2000) AVN 107(7):247–253
Google Scholar
Hekimoglu S, Erdogan B, Erenoglu RC (2012) Exp Tech. doi:10.1111/j.1747-1567.2012.00876.x
Google Scholar
Huber PJ (1981) Robust statistics. Wiley, New York
Book Google Scholar
Kern M, Preimesberger T, Allesch M, Pail R, Bouman J, Koop R (2010) J Geod 78(9):509–519
Google Scholar
Koch KR (1999) Parameter estimation and hypothesis testing in linear models, 2nd edn. Springer, Berlin
Book Google Scholar
Kok JJ (1984) On data snooping and multiple outlier testing. NOAA technical report, NOS NGS 30, Rockville, MD
Monhor D, Takemoto S (2005) Earth Planets Space 57(11):1009–1018
Google Scholar
Monhor D, Verö J (2011) Acta Geod Geophys Hung 46(1):84–92
Article Google Scholar
Pope AJ (1976) The statistics of residuals and the detection of outliers. NOAA technical report, NOS 65, NGS 1 Rockville, MD
Rousseeuw PJ, Leroy AM (1987) Robust regression and outlier detection. Wiley, New York
Book Google Scholar
Schaffrin B (1997) J Surv Eng 123(3):126–137
Google Scholar
Snow KB, Schaffrin B (2003) GPS Solut 7(2):130–139
Article Google Scholar
Strang G, Borre K (1997) Linear algebra, geodesy, and GPS. Wellesley-Cambridge Press, Wellesley
Google Scholar
Wolf PR, Ghilani CD (1997) Adjustment computations: statistics and least squares in surveying and GIS, 3rd edn. Wiley, New York
Google Scholar
Xu P (2005) J Geod 79(1–3):146–159
Google Scholar
Yang Y (1999) J Geod 73(5):268–274
Google Scholar

Download references

Acknowledgements

This research was sponsored by National Key Basic Research Program of China (2012CB825604), and the Natural Science Foundation of China (Grant No. 40874007). The author is also supported by the China Scholarship Council (File No. 2011317045).

Author information

Authors and Affiliations

Information Engineering Univ., 62 Kexuedadao Road, P.O. Box 2201-160, Zhengzhou, 450001, China
J. Guo

Authors

J. Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Guo.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guo, J. The case-deletion and mean-shift outlier models: equivalence and beyond. Acta Geod Geophys 48, 191–197 (2013). https://doi.org/10.1007/s40328-013-0017-5

Download citation

Received: 16 October 2012
Accepted: 12 March 2013
Published: 10 April 2013
Issue Date: June 2013
DOI: https://doi.org/10.1007/s40328-013-0017-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The case-deletion and mean-shift outlier models: equivalence and beyond

Abstract

Similar content being viewed by others

A new outlier detection method considering outliers as model errors

A new multiple outliers identification method in linear regression

Revisiting Baarda’s concept of minimal detectable bias with regard to outlier identifiability

1 Introduction

2 Model description