1 Introduction

Geocenter motion is usually defined, with varying sign conventions, as the relative motion between the center of mass of the total Earth system (CM) and the center of figure of the solid Earth surface (CF). Its geophysical cause is the redistribution of masses within the Earth system, from daily and sub-daily periods (e.g. ocean tides) to secular time scales (e.g. post-glacial rebound, present-day ice melting) via seasonal and inter-annual periods (e.g. water mass exchanges). As Earth satellites orbit around CM, geocenter motion affects the measurements of surface processes made by geodetic satellites. An accurate determination of geocenter motion is therefore required for the most demanding geodetic applications. While the tidal part of geocenter motion is part of the IERS Conventions (Petit and Luzum 2010), the accurate determination of its non-tidal part remains challenging. We refer the reader to Wu et al. (2012) for a detailed description of geocenter motion and of its implications.

There currently exists two conceptual approaches for monitoring geocenter motion. The first one, often called network shift approach, or translational approach, relies on satellite tracking data. Since Earth satellites orbit CM, global sets of station-satellite range measurements for given periods (e.g. days, weeks or months) can be used to infer the tracking station coordinates in theoretically CM-centered frames. The origin of the International Terrestrial Reference Frame (ITRF) is obtained through a long-term stacking of such CM-centered frames derived from Satellite Laser Ranging (SLR) measurements and therefore theoretically follows CM on the long term. On the other hand, the translations between quasi-instantaneous CM-centered frames and the ITRF should reflect the non-secular part of geocenter motion (Dong et al. 2003; Collilieux et al. 2009).

The second approach, often called deformation approach, or degree-1 approach, was first proposed by Blewitt et al. (2001). It is based on the fact that geocenter motions due to surface mass redistributions are accompanied by deformations of the Earth’s crust. Such deformations can be accurately monitored using global networks of GNSS stations, which allows an indirect observation of geocenter motion. The deformation approach can however only sense non-secular geocenter motions induced by surface load variations and can therefore not help in defining the ITRF origin.

This article entirely focuses on the network shift approach. As it is the only approach capable of sensing secular geocenter motion, its performance is critical for the observation of long-term geophysical processes like global sea level rise, post-glacial rebound or present-day ice melting. Any inaccuracy in the long-term following of CM by the ITRF origin does in particular directly translate into errors in the estimation of global sea-level rise by satellite altimetry (Morel and Willis 2005; Beckley et al. 2007) or by global tide-gauge records and GPS (Collilieux and Wöppelmann 2011). While a stability of 0.1 mm/year is aimed for the ITRF origin, external evaluations of the origin of ITRF2008 (Altamimi et al. 2011) assessed its stability to a level of 0.5 mm/year (Wu et al. 2011; Collilieux 2013), or even 0.9 mm/year along the \(Z\) axis (Argus 2012). The performance of the network shift approach therefore still needs to be improved.

There currently exists three satellite tracking techniques theoretically capable of sensing geocenter motion through the network shift approach: SLR, DORIS and GNSS. However, only one of them, SLR, gives relatively satisfying results and contributes to the definition of the ITRF origin. DORIS-derived geocenter time series recently benefited from tremendous improvements, but remain much noisier than SLR-derived series, especially in the \(Z\) component (Gobinddass et al. 2009a, b). Finally, the GNSS technique, although benefits from the largest satellite constellation and tracking network, has so far not proven able to reliably observe geocenter motion through the network shift approach.

Collilieux et al. (2011), indeed, analyzed the translation time series between reprocessed weekly solutions provided by four Analysis Centers (ACs) of the International GNSS Service (IGS; Dow et al. 2009) and a preliminary version of ITRF2008. Significant long-term biases were noted with respect to the SLR translations, as well as AC-dependent trends in the \(Z\) component. Moreover, the annual signals present in the \(X\) and \(Z\) components of the IGS AC translation time series were in poor agreement with SLR. A frequency analysis of the IGS AC translation time series finally revealed that all were contaminated by spurious periodic signals at harmonics of the GPS draconitic year (351.5 days; Ray et al. 2008) with amplitudes reaching several millimetres. The question underlying this article is therefore: Why are current GNSS, despite their numerous satellites and dense tracking networks, unable to reliably observe geocenter motion through the network shift approach?

This question was recently addressed through orbital perturbation considerations by Meindl et al. (2013). The same question is addressed in this article from the perspective of collinearity (also known as multicollinearity). Collinearity in a least-squares regression can be defined as the existence of quasi-linear dependencies among the parameters. Its consequence is that some linear combinations of parameters are extremely sensitive to observation and model errors and may therefore not be reliably inferred from the observations. This article demonstrates that the inability of GNSS to reliably observe geocenter motion is precisely due to such a collinearity problem.

Section 2 gives a brief introduction to the least-squares regression method, to the issue of collinearity among least-squares parameters and to the notion of variance inflation factor. The collinearity diagnosis at the basis of the article is then presented in its simplest form. In Sect. 3, this collinearity diagnosis is generalized so as to handle several peculiarities of the GNSS geocenter determination problem. A practical method for performing collinearity diagnoses in global GNSS analyses is then given. Section 4 gives a concrete example of our generalized collinearity diagnosis, with its application to the well-known quasi-singular problem of determining the terrestrial scale simultaneously with the GNSS satellite phase center offsets. Section 5 describes in detail the signatures of the geocenter coordinates on satellite tracking observations. Finally, our collinearity diagnosis is respectively applied in Sects. 6 and 7 to the geocenter determination with SLR and GNSS.

2 Mathematical background

This section introduces the basic mathematical notions used throughout the article. After a brief introduction to least-squares regression and to the issue of collinearity in least-squares regression, the collinearity diagnosis at the basis of the article is then presented in its simplest form.

2.1 Least-squares regression

Consider the linear regression model \({\varvec{l}} = {\varvec{Ax}} + {\varvec{v}}\), where:

  • \({\varvec{l}}\) is a vector of \(n\) observations with given covariance matrix \({\varvec{Q_l}} = {\varvec{P_l}}^{-1}\);

  • \({\varvec{x}}\) a vector of \(p\) unknown parameters;

  • \({\varvec{A}} = \partial {\varvec{l}}/\partial {\varvec{x}}\) is the so-called design matrix (the \({i}\)th column of \({\varvec{A}}\) will be denoted \({\varvec{A}}_i\) in the following);

  • \({\varvec{Ax}}\) is the vector of observations predicted by the parameters \({\varvec{x}}\) (model prediction);

  • \({\varvec{v}}\) is a vector of unknown observational and model errors (residuals).

Specifying a weight matrix \({\varvec{P_l}}\) for the observations corresponds to the choice of a particular metric in the observation space \({\mathbb {R}}^n\), i.e. of an inner product \(\langle {\varvec{l}},{\varvec{l}}' \rangle = {\varvec{l}}^\mathrm{T}{\varvec{P_l}}{\varvec{l}}'\) and of the associated norm \(\Vert {\varvec{l}}\Vert = \sqrt{{\varvec{l}}^\mathrm{T}{\varvec{P_l}}{\varvec{l}}}\). Let \(E\) denote the Hilbert observation space \({\mathbb {R}}^n\) equipped with this metric. The column space of the design matrix, \(\text {Im}({\varvec{A}}) = \left\{ {\varvec{Ax}}, {\varvec{x}} \in {\mathbb {R}}^p\right\} \), is a subspace of \(E\) which spans all possible model predictions. It will therefore be called the model space.

The objective of a least-squares regression is to minimize the norm \(\Vert {\varvec{v}}\Vert = \Vert {\varvec{l}}-{\varvec{Ax}}\Vert \) of the residual vector. It is achieved when the vector of predicted observations \({\varvec{Ax}}\) is the orthogonal projection of the observation vector \({\varvec{l}}\) onto the model space \(\text {Im}({\varvec{A}})\). This condition leads to the well-known normal equation:

$$\begin{aligned} {\varvec{Nx}} = {\varvec{b}} \end{aligned}$$
(1)

where \({\varvec{N}} = {\varvec{A}}^\mathrm{T}{\varvec{P_l}}{\varvec{A}}\) is called the normal matrix and \({\varvec{b}} = {\varvec{A}}^\mathrm{T}{\varvec{P_l}}{\varvec{l}}\). When \({\varvec{A}}\) is of full rank, \({\varvec{N}}\) is invertible and the normal equation has a unique solution:

$$\begin{aligned} \hat{{\varvec{{x}}}} = {\varvec{N}}^{-1}{\varvec{b}} = ({\varvec{A}}^\mathrm{T}{\varvec{P_l}}{\varvec{A}})^{-1}{\varvec{A}}^\mathrm{T}{\varvec{P_l}}{\varvec{l}} \end{aligned}$$
(2)

The covariance matrix of the estimated parameters is in this case the inverse of the normal matrix: \({\varvec{Q}} = {\varvec{N}}^{-1}\). The goodness of fit can be measured by the coefficient of determination:

$$\begin{aligned} R^2 = \frac{\Vert {\varvec{A}}\hat{{\varvec{x}}}\Vert ^2}{\Vert {\varvec{l}}\Vert ^2} = \cos ^2\theta \end{aligned}$$
(3)

where \(\theta \) is the angle between the observation vector \({\varvec{l}}\) and the model space \(\text {Im}({\varvec{A}})\).

2.2 Conditioning and parameter scaling

When \({\varvec{A}}\) is of full rank but \({\varvec{N}}\) is however close to singularity, small perturbations in the observations may cause relatively large variations in the estimated parameters: the least-squares regression is ill conditioned. The closeness of \({\varvec{N}}\) to singularity can be measured by its condition number \(\kappa ({\varvec{N}})\), i.e. the ratio of its largest to its smallest singular value.

Because \({\varvec{N}}\) is symmetric, the ideal case \(\kappa ({\varvec{N}}) = 1\) is achieved only when:

$$\begin{aligned} {\varvec{N}} = k{\varvec{I}} \; \Leftrightarrow \; \left\{ \begin{array}{ll} \forall i, &{} {\varvec{N}}_{i,i} = \Vert {\varvec{A}}_i\Vert ^2 = k \\ \forall i\ne j, &{} {\varvec{N}}_{i,j} = \langle {\varvec{A}}_i,{\varvec{A}}_j \rangle = 0 \end{array} \right. \end{aligned}$$
(4)

i.e. when the columns of \({\varvec{A}}\) form an orthogonal basis of the model space \(\text {Im}({\varvec{A}})\) and all have the same norm. \({\varvec{N}}\) can thus be ill conditioned for two non-exclusive reasons: either because the columns of \({\varvec{A}}\) have disparate norms (scaling problem), or because they are far from being orthogonal to each other (collinearity problem).

The first possible reason for ill conditioning can be circumvented by solving for an auxiliary set of scaled parameters \(\tilde{{\varvec{x}}}\) such that \(\tilde{{\varvec{x}}}_i = \Vert {\varvec{A}}_i\Vert {\varvec{x}}_i = \sqrt{{\varvec{N}}_{i,i}}{\varvec{x}}_i\). In matrix notations, \(\tilde{{\varvec{x}}} = \tilde{{\varvec{D}}}^{-1}{\varvec{x}}\) where \(\tilde{{\varvec{D}}}\) denotes the diagonal matrix with elements \(\tilde{{\varvec{D}}}_{i,i} = 1 / \sqrt{{\varvec{N}}_{i,i}}\). This leads to the design matrix \(\tilde{{\varvec{A}}} = {\varvec{A}}\tilde{{\varvec{D}}}\) whose columns all have unit norms, and to the scaled normal equation \(\tilde{{\varvec{N}}}\tilde{{\varvec{x}}} = \tilde{{\varvec{b}}}\), where \(\tilde{{\varvec{N}}} = \tilde{{\varvec{D}}}{\varvec{N}}\tilde{{\varvec{D}}}\) has the form of a correlation matrix and \(\tilde{{\varvec{b}}} = \tilde{{\varvec{D}}}{\varvec{b}}\).

In terms of condition number, this particular scaling of the parameters may not be optimal: there may exist other matrices \({\varvec{D}}\) in the set \({\mathcal {D}}_p\) of all non-singular \(p \times p\) diagonal matrices such that \(\kappa ({\varvec{DND}}) < \kappa (\tilde{{\varvec{N}}})\). Nevertheless, a theorem by van der Sluis (1969) states that:

$$\begin{aligned} \kappa (\tilde{{\varvec{N}}}) \le p\,\underset{{\varvec{D}} \in {\mathcal {D}}_p}{\mathrm{min }}\,\kappa ({\varvec{DND}}) \end{aligned}$$
(5)

i.e. even if the scaling by \(\tilde{{\varvec{D}}}\) is not optimal, it is not far from being optimal. But the main justification for this particular scaling is that it isolates the second possible source for ill conditioning: collinearity.

2.3 Collinearity

When the design matrix \({\varvec{A}}\) is rank deficient, there exists one or more linear dependencies among its columns. One or more linear combinations of the parameters are not observable, i.e. cannot be inferred from the observations. The normal matrix \({\varvec{N}}\) is consequently singular. We will refer to this situation, well known in geodetic problems, as perfect collinearity.

More subtly, the columns of \({\varvec{A}}\) may be, not exactly, but nearly linearly dependent, meaning that some linear combinations of the parameters are only hardly observable. This situation, namely collinearity, or multicollinearity, translates into a large condition number for the scaled normal matrix \(\tilde{{\varvec{N}}}\). Geometrically, collinearity can be defined as a severe departure of the basis formed by the columns of \(\tilde{{\varvec{A}}}\) from orthogonality (Farrar and Glauber 1967), or as the closeness of this basis to singularity (Haitovsky 1969).

The harm caused by collinearity to the estimability of specific linear combinations of parameters is described as follows by Belsley et al. (1980): “The essential harm due to collinearity arises from the fact that a collinear relation can readily result in a situation in which some of the observed systematic influence of the explanatory variables on the response variable is swamped by the models random error term—or in the familiar terminology of electrical engineering, the signal is swamped by the noise. It is intuitively clear that, under these circumstances, estimation can be hindered.”

Belsley et al. (1980) (see also Belsley (1991)) also suggest a procedure, based on the singular value decomposition of \(\tilde{{\varvec{N}}}\), to identify near dependencies among the columns of \(\tilde{{\varvec{A}}}\) and the parameters involved in them. The application of Belsley’s procedure to global GNSS solutions would certainly be rich of lessons. However, because a parameter is generally involved, at different levels, in all eigenvectors of \(\tilde{{\varvec{N}}}\), Belsley’s procedure does not appear to us as the most appropriate tool to understand why a specific parameter is affected by collinearity, i.e. the mechanism that makes this parameter hardly observable. As the questions underlying this article are whether and, if so, why the GNSS geocenter determination is harmed by collinearity, we will thus prefer another diagnosis involving the notion of variance inflation factor (VIF).

2.4 Diagnosing the collinearity of a specific parameter

It will be assumed throughout this section that \({\varvec{A}}\) is of full rank, so that its columns form a basis of the model space \(\text {Im}({\varvec{A}})\).

Signature of a parameter Each column \({\varvec{A}}_i = \partial {\varvec{l}} / \partial {\varvec{x}}_i\) of \({\varvec{A}}\) is a vector of the model space, which represents the effect on the observations predicted by the model of a unit variation of the \(\textit{i}\)th parameter \({\varvec{x}}_i\). In the following, \({\varvec{A}}_i\) will be called the signature of \({\varvec{x}}_i\) on the observations.

Decomposition of \({\varvec{A}}_i\) Each parameter \({\varvec{x}}_i\) can also be associated with a particular hyperplane of the model space. Let \({\varvec{e}}_i = [\delta _{i,1}, \delta _{i,2},\ldots , \delta _{i,p}]^\mathrm{T}\) (vector with all zero elements, except a unit \(\textit{i}\)th element) be the coordinates of \({\varvec{x}}_i\) in the canonical basis of the parameter space \({\mathbb {R}}^p\). Let \({\varvec{K}}_i\) be any \(p \times p-1\) matrix whose columns form a basis of \(\text {Ker}({\varvec{e}}_i^\mathrm{T}) = \{ {\varvec{x}} \in {\mathbb {R}}^p / {\varvec{e}}_i^\mathrm{T} {\varvec{x}} = 0 \}\). One could choose for instance:

$$\begin{aligned} K_i = [{\varvec{e}}_1, \ldots , {\varvec{e}}_{i-1}, {\varvec{e}}_{i+1}, \ldots , {\varvec{e}}_p] \end{aligned}$$
(6)

\(\text {Im}({\varvec{A}}{\varvec{K}}_i)\) is the \(p-1\) dimensional subspace (hyperplane) of the model space, which contains the signatures of all parameters except \({\varvec{x}}_i\).

Let us now decompose the signature \({\varvec{A}}_i\) of \({\varvec{x}}_i\) into the sum of its orthogonal projection \({\varvec{A}}_i^u\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_i)\) and of its orthogonal projection \({\varvec{A}}_i^p\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_i)^{\bot }\) (Fig. 1). \({\varvec{A}}_i^u\), as it lies in \(\text {Im}({\varvec{A}}{\varvec{K}}_i)\), is strictly collinear with the signatures of \({\varvec{x}}_1, \ldots , {\varvec{x}}_{i-1}, {\varvec{x}}_{i+1}, \ldots {\varvec{x}}_p\). This part of the signature of \({\varvec{x}}_i\) is consequently indistinguishable from the signatures of the other parameters. The estimation of \({\varvec{x}}_i\) can therefore only rely on \({\varvec{A}}_i^p\) (part of \({\varvec{A}}_i\) which is orthogonal to each of the signatures of \({\varvec{x}}_1, \ldots , {\varvec{x}}_{i-1}, {\varvec{x}}_{i+1}, \ldots {\varvec{x}}_p\)). In the following, \({\varvec{A}}_i^p\) will be called the proper signature of \({\varvec{x}}_i\). It can be expressed as:

$$\begin{aligned} {\varvec{A}}_i^p = \frac{{\varvec{A}}{\varvec{Q}}{\varvec{e}}_i}{{\varvec{Q}}_{i,i}} \end{aligned}$$
(7)

Equation (7) is a particular case of Eq. (19) introduced in Sect. 3.1 and proven in Appendix B (Online Resource).

Fig. 1
figure 1

Decomposition of the signature of \({\varvec{x}}_3\) in a three-parameter problem. \(\text {Im}({\varvec{A}}{\varvec{K}}_3)\) is the plane containing \({\varvec{A}}_1\) and \({\varvec{A}}_2\). \({\varvec{A}}_3^u\) is the orthogonal projection of \({\varvec{A}}_3\) onto this plane. \({\varvec{A}}_3^p\) is the orthogonal projection of \({\varvec{A}}_3\) onto the line orthogonal to this plane. \(\theta _3\) is the angle between \({\varvec{A}}_3\) and this plane

Variance inflation factor Let us associate to each parameter \({\varvec{x}}_i\) the angle \(\theta _i\) between its signature \({\varvec{A}}_i\) and the hyperplane \(\text {Im}({\varvec{A}}{\varvec{K}}_i)\) (Fig. 1). \(R_i^2 = \cos ^2\theta _i\) is the coefficient of determination obtained from the regression of \({\varvec{A}}_i\) on the other columns of \({\varvec{A}}\). The variance inflation factor (VIF) of \({\varvec{x}}_i\) is defined from the angle \(\theta _i\), or from \(R_i^2\), by:

$$\begin{aligned} V_i = \frac{1}{\sin ^2\theta _i} = \frac{1}{1-R_i^2} \end{aligned}$$
(8)

\(V_i = 1\) (\(\theta _i = \pi /2\); \(R_i^2 = 0\)) means that \({\varvec{A}}_i\) is orthogonal to all other columns of \({\varvec{A}}\), i.e. \({\varvec{x}}_i\) is uncorrelated with any other parameter. On the other hand, \(V_i\) tends to infinity (\(\theta _i \rightarrow 0\); \(R_i^2 \rightarrow 1\)) when \({\varvec{A}}_i\) tends to lie in \(\text {Im}({\varvec{A}}{\varvec{K}}_i)\) and \({\varvec{x}}_i\) tends to be unobservable. \(V_i\) thus measures the degree of collinearity of \({\varvec{x}}_i\) with all other parameters.

VIFs are a well-established tool for diagnosing whether the estimation of specific parameters may be harmed by collinearity (see, e.g., Farrar and Glauber 1967; Belsley et al. 1980; Draper and Smith 1998; Montgomery et al. 2012). The term “variance inflation factor”, introduced by Marquardt (1970), derives from the fact that the variance of a parameter \({\varvec{x}}_i\) obeys the relation:

$$\begin{aligned} \text {var}(\hat{{\varvec{x}}}_i) = {\varvec{Q}}_{i,i} = \frac{1}{{\varvec{N}}_{i,i}}.V_i \end{aligned}$$
(9)

The first factor, \(1 / {\varvec{N}}_{i,i}\), would be the variance of \({\varvec{x}}_i\) if all other parameters were held fixed. \(V_i\) is thus the factor by which the presence of the other parameters inflates the variance of \({\varvec{x}}_i\). Equation (9) can be demonstrated by noting that:

$$\begin{aligned} \cos \theta _i = \frac{\langle {\varvec{A}}_i, {\varvec{A}}_i^u \rangle }{\Vert {\varvec{A}}_i\Vert \Vert {\varvec{A}}_i^u\Vert } = \frac{\langle {\varvec{A}}_i, {\varvec{A}}_i-{\varvec{A}}_i^p \rangle }{\Vert {\varvec{A}}_i\Vert \Vert {\varvec{A}}_i-{\varvec{A}}_i^p\Vert } \end{aligned}$$
(10)

Replacing \({\varvec{A}}_i^p\) by its expression in Eq. (7) leads to:

$$\begin{aligned} \cos \theta _i = \sqrt{1-\frac{1}{{\varvec{N}}_{i,i}{\varvec{Q}}_{i,i}}} \end{aligned}$$
(11)

which is equivalent to Eq. (9).

Let us mention another interpretation of VIFs. Using Eq. (7), it can be shown that:

$$\begin{aligned} V_i = \frac{\Vert {\varvec{A}}_i\Vert ^2}{\Vert {\varvec{A}}_i^p\Vert ^2} \end{aligned}$$
(12)

The numerator in Eq. (12) is the squared norm of the signature of \({\varvec{x}}_i\) on the observations, i.e. of the signal on which the estimation of \({\varvec{x}}_i\) would rely if all other parameters were held fixed. On the other hand, the denominator in Eq. (12) is the squared norm of the proper signature of \({\varvec{x}}_i\), i.e. of the signal on which the estimation of \({\varvec{x}}_i\) actually relies. \(V_i\) thus measures how the magnitude of the signal on which the estimation of \({\varvec{x}}_i\) relies is reduced because of the estimation of other parameters.

When analysing VIFs, one inevitably runs into the questions of what are large and what are small VIFs. Numerical thresholds are given in the literature, commonly seen values being 5 and 10 (see, e.g., Montgomery et al. 2012). But they are essentially arbitrary and probably not suited to problems with thousands of parameters, like global GNSS analyses. Note, finally, that because of their quotient nature, VIFs should be considered as varying on a logarithmic scale rather than on a linear scale.

For the ease of interpretation, numerical VIF values will always be accompanied, in the rest of the article, by bracketed percentage values corresponding to:

$$\begin{aligned} \cos \theta _i = R_i = \sqrt{1-\frac{1}{V_i}} \end{aligned}$$
(13)

By its nature, this quantity is indeed interpretable in the same way as a correlation coefficient.

Coordinates of \({\varvec{A}}_i^u\). When analysing the collinearity of \({\varvec{x}}_i\) with the other parameters, not only the values of \(\theta _i\), \(R_i^2\) and \(V_i\) are of interest, but also the coordinates of \({\varvec{A}}_i^u\) in the basis formed by the columns of \({\varvec{A}}\). If \({\varvec{x}}_i\) has a large VIF (small \(\theta _i\)), \({\varvec{A}}_i^u\) is indeed a linear combination of \({\varvec{A}}_1, \ldots , {\varvec{A}}_{i-1}, {\varvec{A}}_{i+1}, \ldots {\varvec{A}}_p\) which closely matches \({\varvec{A}}_i\). In other words, the coordinates of \({\varvec{A}}_i^u\) in the basis formed by the columns of \({\varvec{A}}\) are the coefficients of a linear combination of \({\varvec{x}}_1, \ldots , {\varvec{x}}_{i-1}, {\varvec{x}}_{i+1}, \ldots {\varvec{x}}_p\) which is almost indistinguishable from \({\varvec{x}}_i\). The coordinates of \({\varvec{A}}_i^u\) are thus the answer to the question of why \({\varvec{x}}_i\) may be hardly observable.

According to Eq. (8),

$$\begin{aligned} {\varvec{A}}_i^u = {\varvec{A}}_i - {\varvec{A}}_i^p = {\varvec{A}}\left( {\varvec{e}}_i - \frac{{\varvec{Q}}{\varvec{e}}_i}{{\varvec{Q}}_{i,i}}\right) \end{aligned}$$
(14)

so that the coordinates \({{\varvec{\gamma }}}_i\) of \({\varvec{A}}_i^u\) in the basis formed by the columns of \({\varvec{A}}\) are:

$$\begin{aligned} {{\varvec{\gamma }}}_i&= {\varvec{e}}_i - \frac{{\varvec{Q}}{\varvec{e}}_i}{{\varvec{Q}}_{i,i}} \nonumber \\&= -\left[ \frac{{\varvec{Q}}_{1,i}}{{\varvec{Q}}_{i,i}}, \ldots , \frac{{\varvec{Q}}_{i-1,i}}{{\varvec{Q}}_{i,i}}, 0, \frac{{\varvec{Q}}_{i+1,i}}{{\varvec{Q}}_{i,i}}, \ldots , \frac{{\varvec{Q}}_{p,i}}{{\varvec{Q}}_{i,i}} \right] ^\mathrm{T} \end{aligned}$$
(15)

3 Adaptations to the GNSS geocenter determination problem

Before the collinearity diagnosis exposed in Sect. 2.4 can be applied to the GNSS geocenter determination problem, some adaptations and further developments are first needed due to several peculiarities of global GNSS analyses.

The first difficulty comes from the fact that geocenter coordinates are usually not estimated as explicit parameters in GNSS solutions, for they would be fully redundant with station coordinates. Geocenter coordinates are rather implicitly realized through station coordinates. Section 3.1 will therefore propose a generalization of the notions introduced in Sect. 2.4 to such “implicit parameters”.

Secondly, unlike that assumed in the end of Sect. 2, it is well-known that the design matrix of a global GNSS problem is not of full rank (perfect collinearity). Rotations of the terrestrial frame can indeed be compensated by variations of the Earth orientation parameters (EOPs), so that \({\varvec{A}}\) has three orientation singularities. A fourth singularity stems from the correlation of the UT1-UTC offset with the longitudes of the orbit ascending nodes. In the presence of such singularities, straightforwardly computed VIFs would be infinite and useless for many parameters (including geocenter coordinates). Section 3.2 will therefore propose a way of performing meaningful collinearity diagnoses when \({\varvec{A}}\) is not of full rank.

Lastly, a practical difficulty arises from the large number of parameters in global GNSS problems. A typical daily GPS solution with 100 stations, 30 satellites, 5-min sampled undifferenced observations and all phase cycle ambiguities fixed indeed includes nearly 41,000 parameters, 92 % of which are epoch-wise estimated satellite and station clock offsets. These clock parameters are usually either reduced from the normal equation, or annihilated by the use of double-differenced observations, both methods being equivalent under certain conditions (Appendix F, Online Resource). The practical question answered in Sect. 3.3 is then: How can collinearity diagnoses be made without disposing of the whole normal matrix?

Another question one could legitimately ask is whether the geocenter determination and the potentially related collinearity issues are equivalent in the undifferenced and double-differenced processing methods. The answer follows of course immediately from the equivalence of both methods themselves. But we will make this equivalence even clearer with a relationship between “undifferenced VIFs” and “double-differenced VIFs” (Sect. 3.4).

3.1 Collinearity diagnosis for implicit parameters

The purpose of this section is to generalize the collinearity diagnosis exposed in Sect. 2.4 to quantities which are, like geocenter coordinates, not estimated as explicit model parameters, but implicitly realized through the model parameters. It is again assumed in this section that \({\varvec{A}}\) and \({\varvec{N}}\) are of full rank.

Implicit parameters Let us define an implicit parameter as a quantity \(y\) whose impact of a unit variation on the observations predicted by the model (signature) is a known vector of the model space:

$$\begin{aligned} {\varvec{S}}_y = \frac{\partial {\varvec{l}}}{\partial y} = {\varvec{A}}{{\varvec{\lambda }}} \end{aligned}$$
(16)

For instance, in a global GNSS problem, the signature of the implicit parameter corresponding to the \(X\) coordinate of the geocenter is:

$$\begin{aligned} {\varvec{A}} \; [ \underbrace{1, 0, 0, 1, 0, 0, \ldots , 1, 0, 0}_{\text {station coordinates}}, \underbrace{0, 0, 0, 0, \ldots , 0}_{\text {other parameters}}]^\mathrm{T} \end{aligned}$$
(17)

i.e. the impact on the observations predicted by the model of a unit \(X\) translation of the station network. Explicit model parameters are encompassed by the notion of implicit parameter. The signature of \({\varvec{x}}_i\) on the observations predicted by the model is indeed a known vector of the model space: \(\partial {\varvec{l}} / \partial {\varvec{x}}_i = {\varvec{A}}_i = {\varvec{A}}{\varvec{e}}_i\).

When \({\varvec{A}}\) is of full rank, a unit variation of an implicit parameter \(y\), i.e. a variation \({{\varvec{\delta l}}} = {\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}\) of the observations, can be univocally associated with a variation:

$$\begin{aligned} {{\varvec{\delta }}}{\varvec{x}} = {\varvec{N}}^{-1}{\varvec{A}}^\mathrm{T}{\varvec{P_l}}{{\varvec{\delta l}}} = {{\varvec{\lambda }}} \end{aligned}$$
(18)

of the estimated model parameters. \({{\varvec{\lambda }}}\) can thus be interpreted as the vector of partial derivatives of the explicit model parameters with respect to \(y\): \({{\varvec{\lambda }}} = \partial {\varvec{x}} / \partial y\).

Independence of two implicit parameters Two implicit parameters \(y\) and \(z,\) characterized, respectively by \(\partial {\varvec{x}} / \partial y = {{\varvec{\lambda }}}\) and \(\partial {\varvec{x}} / \partial z = {{\varvec{\mu }}}\) will be said to be independent if \({{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\mu }}} = 0\), i.e. if \({{\varvec{\lambda }}}\) and \({{\varvec{\mu }}}\) are orthogonal in the parameter space \({\mathbb {R}}^p\). Orthogonality in the parameter space may be thought in terms of degrees of freedom allowed within the model. For example, two explicit parameters \({\varvec{x}}_i\) and \({\varvec{x}}_j\) are clearly allowed to vary independently from each other within the model: \({\varvec{e}}_i^\mathrm{T}{\varvec{e}}_j = 0\). On the other hand, a quantity like \({\varvec{x}}_i + {\varvec{x}}_j\) cannot vary independently from \({\varvec{x}}_i\): \({\varvec{e}}_i^\mathrm{T}({\varvec{e}}_i+{\varvec{e}}_j) \ne 0\). More generally, \({{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\mu }} }= 0\) means that the implicit parameters \(y\) and \(z\) can vary independently from each other within the model.

Decomposition of \({\varvec{S}}_y\) Let \({\varvec{K}}_y\) denote any \(p \times p-1\) matrix whose columns form a basis of \(\text {Ker}({{\varvec{\lambda }}}^\mathrm{T})\). Provided that \({\varvec{A}}\) is of full rank, \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\) is a \(p-1\) dimensional subspace (hyperplane) of the model space \(\text {Im}({\varvec{A}})\) which spans the signatures of all implicit parameters independent of \(y\). The signature \({\varvec{S}}_y\) of \(y\) can be decomposed into the sum of its orthogonal projection \({\varvec{S}}_y^u\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\) and of its orthogonal projection \({\varvec{S}}_y^p\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_y)^{\bot }\). \({\varvec{S}}_y^u\), as it lies in \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\), can be exactly reproduced by the signatures of implicit parameters independent of \(y\), i.e. that are allowed to vary independently of \(y\) within the model. The estimation of \(y\) can therefore only rely on \({\varvec{S}}_y^p\), which will be called the proper signature of \(y\). Appendix B (Online Resource) proves that \({\varvec{S}}_y^p\) can be expressed as:

$$\begin{aligned} {\varvec{S}}_y^p = \frac{{{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\lambda }}}}{{{\varvec{\lambda }}}^\mathrm{T}{\varvec{Q}}{{\varvec{\lambda }}}}{\varvec{A}}{\varvec{Q}}{{\varvec{\lambda }}} \end{aligned}$$
(19)

VIF of an implicit parameter Let \(\theta _{y}\) denote the angle between the signature \({\varvec{S}}_y\) of \(y\) and the hyperplane \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\). Using Eq. (19), it can be shown that:

$$\begin{aligned} \cos \theta _y&= \frac{\langle {\varvec{S}}_y,{\varvec{S}}_y^u \rangle }{\Vert {\varvec{S}}_y\Vert \Vert {\varvec{S}}_y^u\Vert } = \frac{\langle {\varvec{S}}_y,{\varvec{S}}_y-{\varvec{S}}_y^p \rangle }{\Vert {\varvec{S}}_y\Vert \Vert {\varvec{S}}_y-{\varvec{S}}_y^p\Vert } \nonumber \\&= \sqrt{1-\frac{({{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\lambda }}})^2}{({{\varvec{\lambda }}}^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}})({{\varvec{\lambda }}}^\mathrm{T}{\varvec{Q}}{{\varvec{\lambda }}})}} \end{aligned}$$
(20)

One can also show that, if a regression of \({\varvec{S}}_y\) on the columns of \({\varvec{A}}{\varvec{K}}_y\) was made, the obtained coefficient of determination would be \(R_y^2 = \cos ^2\theta _y\).

The variance inflation factor \(V_y\) of an implicit parameter \(y\) can be defined from \(\theta _y\) or \(R_y^2\) by:

$$\begin{aligned} V_y = \frac{1}{\sin ^2\theta _y} = \frac{1}{1-R_y^2} \end{aligned}$$
(21)

Inserting Eq. (20) into Eq. (21) leads to a formula from which \(V_y\) can be practically computed when disposing of the normal matrix \({\varvec{N}}\) and of its inverse \({\varvec{Q}}\):

$$\begin{aligned} V_y = \frac{({{\varvec{\lambda }}}^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}})({{\varvec{\lambda }}}^\mathrm{T}{\varvec{Q}}{{\varvec{\lambda }}})}{({{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\lambda }}})^2} \end{aligned}$$
(22)

Replacing \({{\varvec{\lambda }}}\) by \({\varvec{e}}_i\) in the preceding developments leads back to the equations of Sect. 2.4, which proves that this extension of VIFs to implicit parameters encompasses usual VIFs.

The interpretation of such generalized VIFs remains the same. \(V_y = 1\) when \({\varvec{S}}_y\) is orthogonal to \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\), i.e. when \(y\) is uncorrelated with any other independent implicit parameter. On the other hand, \(V_y\) tends to infinity when \({\varvec{S}}_y\) tends to lie in \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\) and \(y\) tends to be unobservable. Note that \(V_y\) is still interpretable as a “variance inflation factor”. No proof is however given here; further developments would be needed to properly define the variance of an implicit parameter. Let us finally mention that \(V_y\) can still be interpreted as the ratio of the squared norm of the signature of \(y\) on the observations to the squared norm of its proper signature. Using Eqs. (16) and (19), one can in fact show that:

$$\begin{aligned} V_y = \frac{\Vert {\varvec{S}}_y\Vert ^2}{\Vert {\varvec{S}}_y^p\Vert ^2} \end{aligned}$$
(23)

Coordinates of \({\varvec{S}}_y^u\) Like the coordinates of \({\varvec{A}}_i^u\) in the basis formed by the columns of \({\varvec{A}}\) can enlighten why a parameter \({\varvec{x}}_i\) may be hardly observable, the coordinates of \({\varvec{S}}_y^u\) in this basis can similarly explain why an implicit parameter \(y\) is hardly observable. These coordinates can now be interpreted as \({{\varvec{\mu }}} = \partial {\varvec{x}} / \partial z\) where \(z\) is the implicit parameter independent of \(y\) whose signature \({\varvec{S}}_z = {\varvec{S}}_y^u\) most closely matches \({\varvec{S}}_y\).

According to Eqs. (16) and (19),

$$\begin{aligned} {\varvec{S}}_y^u = {\varvec{S}}_y - {\varvec{S}}_y^p = {\varvec{A}}\left( {{\varvec{\lambda }}} - \frac{{{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\lambda }}}}{{{\varvec{\lambda }}}^\mathrm{T}{\varvec{Q}}{{\varvec{\lambda }}}}{\varvec{Q}}{{\varvec{\lambda }}}\right) \end{aligned}$$
(24)

so that the coordinates \({{\varvec{\gamma }}}_y\) of \({{\varvec{S}}}_y^u\) in the basis formed by the columns of \({\varvec{A}}\) are:

$$\begin{aligned} {{\varvec{\gamma }}}_y = {{\varvec{\lambda }}} - \frac{{{\varvec{\lambda }}}^\mathrm{T}{{\varvec{\lambda }}}}{{{\varvec{\lambda }}}^\mathrm{T}{\varvec{Q}}{{\varvec{\lambda }}}}{\varvec{Q}}{{\varvec{\lambda }}} \end{aligned}$$
(25)

3.2 Collinearity diagnosis in the presence of singularities and minimal constraints

In many geodetic problems, the design matrix \({\varvec{A}}\) is not of full rank, so that the normal matrix \({\varvec{N}}\) is not invertible. The formulas derived in Sect. 3.1, as they involve the inverse \({\varvec{Q}}\) of the normal matrix, are therefore clearly not usable. Another view of the problem is that, when \({\varvec{A}}\) is not of full rank, an implicit parameter \(y\) can generally not be univocally associated with a specific variation of the model parameters. A third view of the problem is that, because of the singularities of \({\varvec{A}}\), the subspace \(\text {Im}({\varvec{A}}{\varvec{K}}_y)\) defined in Sect. 3.1 is not necessarily a hyperplane of the model space, but can be the model space \(\text {Im}({\varvec{A}})\) itself. This section therefore proposes a way of performing meaningful collinearity diagnoses when \({\varvec{A}}\) is not of full rank, and more precisely in the case where minimal constraints are used to supplement the rank deficiencies of \({\varvec{A}}\).

Minimal constraints When \({\varvec{A}}\) and \({\varvec{N}}\) have \(c > 0\) singularities, a well-established practice in geodesy consists in augmenting the normal equation with minimal constraints. This leads to a system of the form:

$$\begin{aligned} \left[ \begin{array}{c@{\quad }c} {\varvec{N}} &{} {\varvec{C}} \\ {\varvec{C}}^\mathrm{T} &{} {\varvec{0}}\\ \end{array} \right] \left[ \begin{array}{c} {\varvec{x}}_c \\ {\varvec{k}}\\ \end{array} \right] = \left[ \begin{array}{c} {\varvec{b}} \\ {\varvec{0}}\\ \end{array} \right] \end{aligned}$$
(26)

where

  • \({\varvec{C}}^\mathrm{T}\) is a \(c \times p\) full-rank constraint matrix such that \(\text {rank}[{\varvec{N}}\;{\varvec{C}}] = p\);

  • \({\varvec{k}}\) is a vector of \(c\) Lagrange multipliers.

In global GNSS problems, the constraint matrix classically has the form:

$$\begin{aligned} {\varvec{C}}^\mathrm{T} = \underbrace{ \begin{array}{|l@{\quad }l@{\quad }l@{\quad }l@{\quad }l} \ldots &{} 0 &{} -Z_i &{} Y_i &{} \ldots \\ \ldots &{} Z_i &{} 0 &{} -X_i &{} \ldots \\ \ldots &{} -Y_i &{} X_i &{} 0 &{} \ldots \\ \ldots &{} 0 &{} 0 &{} 0 &{} \ldots \\ \end{array}} _{\text {station coordinates}} \underbrace{ \begin{array}{l} 0 \\ 0 \\ 0 \\ 1\\ \end{array}} _{\text {UT1-UTC}} \underbrace{ \begin{array}{l@{\quad }l@{\quad }l|} 0 &{} \ldots &{} 0 \\ 0 &{} \ldots &{} 0 \\ 0 &{} \ldots &{} 0 \\ 0 &{} \ldots &{} 0\\ \end{array}} _{\text {other parameters}}\nonumber \\ \end{aligned}$$
(27)

The first three lines impose that the orientation of the estimated terrestrial frame be the same as that of the a priori frame (no-net-rotation constraint). The fourth line directly fixes the UT1-UTC offset to its a priori value.

Blaha (1971) shows that the augmented normal equation (26) has a unique solution \(\hat{{\varvec{x}}}_c\) which minimizes \(\Vert {\varvec{v}}\Vert = \Vert {\varvec{l}}-{\varvec{A}}{\varvec{x}}_c\Vert \) under the condition (constraint) that \({\varvec{C}}^\mathrm{T}{\varvec{x}}_c = {\varvec{0}}\). He also proves that the covariance matrix \({\varvec{Q}}_c\) of \(\hat{{\varvec{x}}}_c\) is the upper-left block of the inverse of the augmented normal matrix. Appendix C (Online Resource) shows that \(\hat{{\varvec{x}}}_c\) and \({\varvec{Q}}_c\) can be expressed as:

$$\begin{aligned} \hat{{\varvec{x}}}_c&= {\varvec{B}}({\varvec{B}}^\mathrm{T}{\varvec{N}}{\varvec{B}})^{-1}{\varvec{B}}^\mathrm{T}{\varvec{b}}\end{aligned}$$
(28)
$$\begin{aligned} {\varvec{Q}}_c&= {\varvec{B}}({\varvec{B}}^\mathrm{T}{\varvec{N}}{\varvec{B}})^{-1}{\varvec{B}}^\mathrm{T} \end{aligned}$$
(29)

where \({\varvec{B}}\) denotes any \(p \times p-c\) matrix whose columns form a basis of \(\text {Ker}({\varvec{C}}^\mathrm{T})\).

Note that we use the term “constraints” in the same sense as Blaha (1971), i.e. as a synonym for “condition equations”. Another common acceptance of this term refers to the introduction of weighted pseudo-observations, which would lead to a normal equation of type \(({\varvec{N}}+{\varvec{C}}{\varvec{P}}_c{\varvec{C}}^\mathrm{T}){\varvec{x}} = {\varvec{b}}\). Although this latter method is generally preferred in practice, we adopt here the condition equation approach for its mathematical appeal. Condition equations are in fact equivalent to pseudo-observations with infinite weights.

Implicit parameters Let \(y\) be an implicit parameter defined by its signature \({\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}\) on the observations. When \({\varvec{A}}\) is not of full rank, but minimal constraints are used, \(y\) can again be univocally associated with a specific variation of the model parameters. According to Eq. (28), a unit variation of \(y\), i.e. a variation \({{\varvec{\delta l}}} = {\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}\) of the observations, indeed leads to a variation:

$$\begin{aligned} {{\varvec{\delta }}}{\varvec{x}}_c&= {\varvec{B}}({\varvec{B}}^\mathrm{T}{\varvec{N}}{\varvec{B}})^{-1}{\varvec{B}}^\mathrm{T}{\varvec{A}}^\mathrm{T}{\varvec{P}}_{{\varvec{l}}}{{\varvec{\delta l}}} \nonumber \\&= {\varvec{B}}({\varvec{B}}^\mathrm{T}{\varvec{N}}{\varvec{B}})^{-1}{\varvec{B}}^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}} \end{aligned}$$
(30)

of the estimated parameters. In other words, the partial derivatives \({{\varvec{\lambda }}}_c\) of the model parameters with respect to \(y\) are:

$$\begin{aligned} {{\varvec{\lambda }}}_c = \frac{\partial {\varvec{x}}_c}{\partial y} = {\varvec{B}}({\varvec{B}}^\mathrm{T}{\varvec{N}}{\varvec{B}})^{-1}{\varvec{B}}^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}} = {\varvec{Q}}_c{\varvec{N}}{{\varvec{\lambda }}} \end{aligned}$$
(31)

\({{\varvec{\lambda }}}_c\) is in fact the unique solution of the system:

$$\begin{aligned} \left\{ \begin{array}{ll} {\varvec{A}}{{\varvec{\lambda }}}_c &{} = {\varvec{A}}{{\varvec{\lambda }}} \\ {\varvec{C}}^\mathrm{T}{{\varvec{\lambda }}}_c &{} = {\varvec{0}}\\ \end{array} \right. \end{aligned}$$
(32)

It can therefore be interpreted as the unique variation of the model parameters which is allowed by the constraints and has the same signature on the observations as \(y\).

The notion of independence between implicit parameters remains unchanged: \(y\) and \(z\) are independent if the corresponding parameter variations \({{\varvec{\lambda }}}_c = \partial {\varvec{x}}_c / \partial y\) and \({{\varvec{\mu }}}_c = \partial {\varvec{x}}_c / \partial z\) are orthogonal in the parameter space. For a given \(y\) and \({{\varvec{\lambda }}}_c\), the set of such \({{\varvec{\mu }}}_c\)s is \(\text {Ker}([{{\varvec{\lambda }}}_c \; {\varvec{C}}]^\mathrm{T})\).

Decomposition of \({\varvec{S}}_y\) Let \({\varvec{K}}_{y,c}\) denote any matrix whose columns form a basis of \(\text {Ker}([{{\varvec{\lambda }}}_c \; {\varvec{C}}]^\mathrm{T})\). It is proven in Appendix D (Online Resource) that as long as \({{\varvec{\lambda }}} \notin \text {Ker}({\varvec{A}})\), \(\text {Im}({\varvec{A}}{\varvec{K}}_{y,c})\) is a \(p-c-1\) dimensional subspace (hyperplane) of the model space. As previously, the signature \({\varvec{S}}_y\) of \(y\) can then be decomposed into the sum of its orthogonal projection \({\varvec{S}}_{y,c}^u\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_{y,c})\) and of its orthogonal projection \({\varvec{S}}_{y,c}^p\) onto \(\text {Im}({\varvec{A}}{\varvec{K}}_{y,c})^{\bot }\). \({\varvec{S}}_{y,c}^u\) is indistinguishable from the signatures of implicit parameters independent of \(y\). The estimation of \(y\) can therefore only rely on its proper signature \({\varvec{S}}_{y,c}^p\). Appendix D (Online Resource) proves that it can be expressed as:

$$\begin{aligned} {\varvec{S}}_{y,c}^p = \frac{{{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c}{{{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{Q}}_c{{\varvec{\lambda }}}_c}{\varvec{A}}{\varvec{Q}}_c{{\varvec{\lambda }}}_c \end{aligned}$$
(33)

Variance inflation factors When \({\varvec{A}}\) is not of full rank and minimal constraints are used, the variance inflation factor of an implicit parameter \(y\) can be defined from the angle \(\theta _{y,c}\) between its signature and the hyperplane \(\text {Im}({\varvec{A}}{\varvec{K}}_{y,c})\) by:

$$\begin{aligned} V_{y,c} = \frac{1}{\sin ^2\theta _{y,c}} \end{aligned}$$
(34)

Using Eq. (33), it can be shown that:

$$\begin{aligned} \cos \theta _{y,c}&= \frac{\langle {\varvec{S}}_y,{\varvec{S}}_{y,c}^u \rangle }{\Vert {\varvec{S}}_y\Vert \Vert {\varvec{S}}_{y,c}^u\Vert } = \frac{\langle {\varvec{S}}_y,{\varvec{S}}_y-{\varvec{S}}_{y,c}^p \rangle }{\Vert {\varvec{S}}_y\Vert \Vert {\varvec{S}}_y-{\varvec{S}}_{y,c}^p\Vert } \nonumber \\&= \sqrt{1-\frac{({{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c)^2}{({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}}_c)({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{Q}}_c{{\varvec{\lambda }}}_c)}} \end{aligned}$$
(35)

This leads to a formula from which \(V_{y,c}\) can be practically computed when disposing of \({\varvec{N}}\) and \({\varvec{Q}}_c\):

$$\begin{aligned} V_{y,c} = \frac{({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{N}}{{\varvec{\lambda }}}_c)({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{Q}}_c{{\varvec{\lambda }}}_c)}{({{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c)^2} \end{aligned}$$
(36)

As previously, \(V_{y,c}\) can be interpreted as the ratio of the squared norm of the signature of \(y\) on the observations to the squared norm of its proper signature. Using Eq. (33) and the fact that \({\varvec{A}}{{\varvec{\lambda }}} = {\varvec{A}}{{\varvec{\lambda _c}}}\), one can in fact show that:

$$\begin{aligned} V_y = \frac{\Vert {\varvec{S}}_y\Vert ^2}{\Vert {\varvec{S}}_{y,c}^p\Vert ^2} \end{aligned}$$
(37)

Coordinates of \({\varvec{S}}_{y,c}^u\) According to Eq. (33), \({\varvec{S}}_{y,c}^u\) can be expressed as:

$$\begin{aligned} {\varvec{S}}_{y,c}^u = {\varvec{S}}_y - {\varvec{S}}_{y,c}^p = {\varvec{A}}\left( {{\varvec{\lambda }}}_c - \frac{{{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c}{{{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{Q}}_c{{\varvec{\lambda }}}_c}{\varvec{Q}}_c{{\varvec{\lambda }}}_c\right) \end{aligned}$$
(38)

The vector

$$\begin{aligned} {{\varvec{\gamma }}}_{y,c} = {{\varvec{\lambda }}}_c - \frac{{{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c}{{{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{Q}}_c{{\varvec{\lambda }}}_c}{\varvec{Q}}_c{{\varvec{\lambda }}}_c \end{aligned}$$
(39)

can be interpreted as \(\partial {\varvec{x}}_c / \partial z\) where \(z\) is the implicit parameter independent of \(y\) whose signature most closely matches \({\varvec{S}}_{y}\). It can therefore explain why an implicit parameter \(y\) may be hardly observable.

3.3 Practical method for collinearity diagnoses

The expressions of \(V_{y,c}\) and \({{\varvec{\gamma }}}_{y,c}\) given in Sect. 3.2 involve the full normal matrix \({\varvec{N}}\) and the full covariance matrix of the parameters \({\varvec{Q}}_c\). However, in a standard global undifferenced GNSS analysis, station and satellite clock parameters are generally reduced, so that neither \({\varvec{N}}\) nor \({\varvec{Q}}_c\) are fully computed. A practical way for obtaining \(V_{y,c}\) and \({{\varvec{\gamma }}}_{y,c}\) is therefore exposed in this section.

Parameter reduction Assume that the vector of parameters can be split into \({\varvec{x}} = [{\varvec{x}}_1^\mathrm{T} \; {\varvec{x}}_2^\mathrm{T}]^\mathrm{T}\) where \({\varvec{x}}_1\) contains parameters of interest, while \({\varvec{x}}_2\) contains “nuisance parameters” (e.g. clock offsets). A constrained normal equation like Eq. (26) can be block-decomposed into:

$$\begin{aligned} \left[ \begin{array}{l@{\quad }l@{\quad }l} {\varvec{N}}_{11} &{} {\varvec{N}}_{12} &{} {\varvec{C}}_1 \\ {\varvec{N}}_{21} &{} {\varvec{N}}_{22} &{} {\varvec{C}}_2 \\ {\varvec{C}}_1^\mathrm{T} &{} {\varvec{C}}_2^\mathrm{T} &{} {\varvec{0}}\\ \end{array} \right] \left[ \begin{array}{l} {\varvec{x}}_1 \\ {\varvec{x}}_2 \\ {\varvec{k}}\\ \end{array} \right] = \left[ \begin{array}{l} {\varvec{b}}_1 \\ {\varvec{b}}_2 \\ {\varvec{0}}\\ \end{array} \right] \end{aligned}$$
(40)

If \({\varvec{N}}_{22}\) is invertible, solving Eq. (40) is equivalent to solving the reduced normal equation:

$$\begin{aligned}&\left[ \begin{array}{l@{\quad }l} {\varvec{N}}_{11} - {\varvec{N}}_{12}{\varvec{N}}_{22}^{-1}{\varvec{N}}_{21} &{} {\varvec{C}}_1 - {\varvec{N}}_{12}{\varvec{N}}_{22}^{-1}{\varvec{C}}_2 \\ {\varvec{C}}_1^\mathrm{T} - {\varvec{C}}_2^\mathrm{T}{\varvec{N}}_{22}^{-1}{\varvec{N}}_{21} &{} - {\varvec{C}}_2^\mathrm{T}{\varvec{N}}_{22}^{-1}{\varvec{C}}_{2}\\ \end{array} \right] \left[ \begin{array}{l} {\varvec{x}}_1 \\ {\varvec{k}}\\ \end{array} \right] \nonumber \\&\quad = \left[ \begin{array}{l} {\varvec{b}}_1 - {\varvec{N}}_{12}{\varvec{N}}_{22}^{-1}{\varvec{b}}_2 \\ -{\varvec{C}}_2^\mathrm{T}{\varvec{N}}_{22}^{-1}{\varvec{b}}_{2}\\ \end{array} \right] \end{aligned}$$
(41)

and “back-solving” \({\varvec{x}}_2\) by:

$$\begin{aligned} {\varvec{x}}_2 = {\varvec{N}}_{22}^{-1}({\varvec{b}}_2 - {\varvec{N}}_{21}{\varvec{x}}_1 - {\varvec{C}}_2{\varvec{k}}) \end{aligned}$$
(42)

This reduction method is widely employed in undifferenced GNSS analyses. It avoids the storage and inversion of the full normal matrix \({\varvec{N}}\). Moreover, since \({\varvec{x}}_2\) consists of epoch clock parameters, \({\varvec{N}}_{22}\) has a block-diagonal structure which makes it efficiently invertible.

Computing \({{\varvec{\lambda }}}_c\) Consider an implicit parameter \(y\) defined by its signature \({\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}\) on the observations. The first step in our practical method for diagnosing the collinearity of \(y\) consists in computing the partial derivatives \({{\varvec{\lambda }}}_c = \partial {\varvec{x}}_c / \partial y\) of the model parameters with respect to \(y\). From Eqs. (28) and (31), it is easily seen that \({{\varvec{\lambda }}}_c\) is the solution of the following normal equation:

$$\begin{aligned} \left[ \begin{array}{l@{\quad }l} {\varvec{N}} &{} {\varvec{C}} \\ {\varvec{C}}^\mathrm{T} &{} {\varvec{0}}\\ \end{array} \right] \left[ \begin{array}{l} {\varvec{x}}_c \\ {\varvec{k}}\\ \end{array} \right] = \left[ \begin{array}{l} {\varvec{N}}{{\varvec{\lambda }}} \\ {\varvec{0}}\\ \end{array} \right] = \left[ \begin{array}{l} {\varvec{A}}^\mathrm{T}{\varvec{P}}_{{\varvec{l}}}{\varvec{S}}_y \\ {\varvec{0}}\\ \end{array} \right] \end{aligned}$$
(43)

which can be reduced and efficiently solved.

Computing \({{\varvec{\gamma }}}_{y,c}\) Consider the following system:

$$\begin{aligned} \left[ \begin{array}{c@{\quad }c@{\quad }c} {\varvec{N}} &{} {\varvec{C}} &{} {{\varvec{\lambda }}}_c\\ {\varvec{C}}^\mathrm{T} &{} {\varvec{0}} &{} {\varvec{0}} \\ {{\varvec{\lambda }}}_c^\mathrm{T} &{} {\varvec{0}} &{} {\varvec{0}}\\ \end{array} \right] \left[ \begin{array}{c} {\varvec{x}}_c \\ {\varvec{k}} \\ {\varvec{k'}}\\ \end{array} \right] = \left[ \begin{array}{c} {\varvec{0}} \\ {\varvec{0}} \\ {{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c \end{array} \right] \end{aligned}$$
(44)

Its solution minimizes \(\Vert {\varvec{A}}{\varvec{x}}_c\Vert \) under the constraint that \({\varvec{C}}^\mathrm{T}{\varvec{x}}_c = 0\) and the additional condition that \({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{x}}_c = {{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c\). This latter condition is equivalent to imposing a unit change of \(y\) with respect to the a priori parameters. On the other hand, the minimization of \(\Vert {\varvec{A}}{\varvec{x}}_c\Vert \) means that the observations predicted by the estimated parameters should stay as close as possible to the observations predicted by the a priori parameters. In other words, an artificial unit error in \(y\) is introduced, but the system tries to compensate its impact on the observations as much as possible.

Appendix E (Online Resource) proves that the solution of Eq. (44) is \({{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c}\). The first term, \({{\varvec{\lambda }}}_c\) represents the artificially introduced error, while the second term, \(-{{\varvec{\gamma }}}_{y,c}\), is the best solution found by the system to compensate the impact of this error on the observations. Equation (44) can also be reduced and efficiently solved. \({{\varvec{\gamma }}}_{y,c}\) can then be obtained from its solution.

Computing \(V_{y,c}\) The last remaining step is a practical method for computing the VIF \(V_{y,c}\) of \(y\). For that purpose, let us rearrange Eq. (37) into:

$$\begin{aligned} V_{y,c} = \frac{\Vert {\varvec{S}}_y\Vert ^2}{\Vert {\varvec{S}}_{y,c}^p\Vert ^2} = \frac{\Vert {\varvec{A}}{{\varvec{\lambda }}}_c\Vert ^2}{\Vert {\varvec{A}}({{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c})\Vert ^2} \end{aligned}$$
(45)

This last expression shows how to obtain \(V_{y,c}\) from \({\varvec{A}}{{\varvec{\lambda }}}_c\) and \({\varvec{A}}({{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c})\). In practice, the full design matrix \({\varvec{A}}\) may not be available, but \({\varvec{A}}{{\varvec{\lambda }}}_c\) and \({\varvec{A}}({{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c})\) can nevertheless be computed by evaluating the model function at the set of a priori parameters \({\varvec{x}}_0\), at \({\varvec{x}}_0 + {{\varvec{\lambda }}}_c\) and at \({\varvec{x}}_0 + {{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c}\).

3.4 Impact of double-differencing on VIFs

It is shown in Appendix F (Online Resource) that, as long as a maximal set of independent double-differenced observations is used, the double differencing approach leads to an identical normal equation as the reduction of clock parameters. The use of double-differenced observations rather than undifferenced observations has therefore no theoretical impact on the GNSS geocenter determination problem. One could nevertheless compute VIFs directly from a double-differenced system and would obtain different values as with the full undifferenced system. We therefore deem it important to explain the relationship between such double-differenced and undifferenced VIFs.

Suppose that a maximal set of independent double-differenced observations \({\varvec{l}}_d\) is obtained from the undifferenced observations \({\varvec{l}}\) through the application of an \(n_d \times n\) differencing operator \({\varvec{D}}\): \({\varvec{l}}_d = {\varvec{Dl}}\). The weight matrix of the double-differenced observations:

$$\begin{aligned} {\varvec{P}}_{{\varvec{l}}_d} = {\varvec{Q}}_{{\varvec{l}}_d}^{-1} = ({\varvec{D}}{\varvec{Q}}_{{\varvec{l}}}{\varvec{D}}^\mathrm{T})^{-1} \end{aligned}$$
(46)

induces the norm \(\Vert {{\varvec{l}}}_d\Vert _d = \sqrt{{{\varvec{l}}}_d^\mathrm{T}{{\varvec{P}}}_{{\varvec{l}}_d}{\varvec{l}}_d}\) in the space of double-differenced observations \({\mathbb {R}}^{n_d}\). Consider an implicit parameter \(y\) defined by its signature \({\varvec{S}}_y\) on the undifferenced observations. The signature of \(y\) on the double-differenced observations is \({\varvec{S}}_{y,d} = {\varvec{D}}{\varvec{S}}_y\). Appendix G (Online Resource) proves that the VIFs \(V_y\) and \(V_{y,d},\) obtained, respectively in the undifferenced and in the double-differenced cases are related by:

$$\begin{aligned} \frac{V_{y,d}}{V_y} = \frac{\Vert {\varvec{S}}_{y,d}\Vert _d^2}{\Vert {\varvec{S}}_y\Vert ^2} \end{aligned}$$
(47)

This fraction is also proven to be always smaller than 1.

The left term in Eq. (47) represents a reduction of the collinearity of \(y\) with the other parameters in the double-differenced case. This reduction naturally stems from the fact that a double-differenced system has less parameters than an undifferenced system. On the other hand, the right term in Eq. (47) represents a loss of sensitivity to \(y\) due to double-differencing the observations. In summary, the formation of double differences has the apparent advantage of reducing collinearity among parameters, but this advantage is in fact totally offset by a reduction of the sensitivity of the observations.

Equation (47) can in particular be used to quantify the sensitivity reduction to geocenter coordinates qualitatively observed by Meindl et al. (2013) when forming double differences. This sensitivity reduction can equivalently be thought in terms of collinearity of the geocenter coordinates with satellite and station clock offsets. In the following, we will prefer undifferenced observations for their simpler geometric interpretation and adopt the latter point of view.

4 Application to the GNSS terrestrial scale determination problem

Because of:

  • a high correlation between the terrestrial scale and the satellite radial phase center offsets (z-PCOs) (Zhu et al. 2003; Cardellach et al. 2007),

  • the fact that pre-launch phase center offset calibrations are not available for any of the GPS and GLONASS satellites launched so far,

current GNSS are considered unable to provide terrestrial scale information and do not contribute to the definition of the ITRF scale (Altamimi et al. 2011). On the contrary, the IGS currently relies on the ITRF scale to determine conventional GPS and GLONASS z-PCOs (Ray et al. 2013; Rebischung et al. 2012).

The collinearity diagnosis developed in Sect. 3 is applied in this section to this well-known problem, mainly with an illustrative purpose. The VIFs obtained for the terrestrial scale will also serve as references for interpreting the VIFs obtained in Sect. 7 for the geocenter coordinates.

4.1 Signature of a terrestrial scale change

Let us consider the implicit parameter \(y\) corresponding to a terrestrial scale change of \(\delta s = 1\) ppb. The signature of \(y\) on a set of GNSS observations is \({{\varvec{S}}}_y = {{\varvec{A}}}{{\varvec{\lambda }}}\) with:

$$\begin{aligned} {{\varvec{\lambda }}} = \delta s \, [ \underbrace{\ldots , X_i, Y_i, Z_i, \ldots }_{\text {station coordinates}}, \underbrace{0, 0, 0, 0, \ldots , 0}_{\text {other parameters}}]^\mathrm{T} \end{aligned}$$
(48)

\({{\varvec{\lambda }}}\) corresponds to a change of the geocentric radius (height) of any station by \(\delta r \simeq a_\mathrm{E} \delta s \simeq 6.4\) mm, where \(a_\mathrm{E}\) denotes the mean Earth radius. The impact of such a height change on an observation \(l\) made at an elevation angle \(e\) is \(\delta l = - \delta r \sin (e) \simeq - a_\mathrm{E} \delta s \sin (e)\). The elements of \({\varvec{S}}_y\) are nothing but such \(\delta l\)s.

4.2 Simulation

Using the options listed in Table 1, a series of observations (station-satellite ranges) was simulated and a design matrix including the parameters listed in Table 2 was set up. The considered satellite empirical accelerations are the five terms of the Extended CODE Orbit Model (ECOM) (Beutler et al. 1994; Springer 1999) estimated by most of the IGS Analysis Centers (ACs): \(D_0\), \(Y_0\), \(B_0\), \(B_\mathrm{c}\) and \(B_\mathrm{s}\) using the notations of Rodriguez-Solano et al. (2012). The considered EOPs are the pole coordinates \(x_p\) and \(y_p\) at noon, their rates \({\dot{x}}_p\) and \({\dot{y}}_p\), the UT1-UTC offset at noon and the length-of-day (LOD). Note that simple mapping functions have been used for ZWDs (\(1{/}\!\sin (e)\)) and tropospheric gradients (from Chen and Herring 1997). Also note that a zero-mean condition has been applied at each epoch to all satellite and station clock offsets.

Table 1 Simulation options
Table 2 List of simulated parameters

Except satellite z-PCOs, the list of parameters in Table 2 is the minimal common set of parameters freely estimated by a majority of IGS ACs. It could in fact be the list of parameters set up by a slightly sub-standard AC. The collinearity diagnoses exposed in the following will be based on this particular sub-standard list of parameters. But note that the estimation of additional parameters (e.g. satellite velocity pulses, additional empirical accelerations, but also unfixed phase cycle ambiguities) by the real ACs can only increase collinearity issues.

The satellite z-PCOs are usually set up, but tightly constrained by the IGS ACs. In the simulations of this section, they are however freely estimated for the particular purpose of studying their collinearity with the terrestrial scale.

Except the fictive station network and constellation, the options listed in Table 1 could also be those used by an IGS AC. The choice of a perfectly distributed network and constellation was made to avoid perturbing the obtained results by some “network effect”. But additional simulations with real network and constellation will also be performed for comparison in Sect. 4.4. The influence of the cutoff angle and of the observation weighting will similarly be investigated through additional simulations in Sect. 4.4.

4.3 Collinearity diagnosis

Equation (43) was then solved, leading to \({{\varvec{\lambda }}}_c = \partial {\varvec{x}}_c / \partial y\), i.e. the impact of a 1 ppb scale change on the estimated parameters under no-net-rotation constraints. Because a perfectly distributed station network was used, \({{\varvec{\lambda }}}_c\) was in fact equal to \({{\varvec{\lambda }}}\).

Equation (44) was finally solved. A 1 ppb scale change with respect to the a priori terrestrial frame was imposed through the additional constraint \({{\varvec{\lambda }}}_c^\mathrm{T}{\varvec{x}}_c = {{\varvec{\lambda }}}_c^\mathrm{T}{{\varvec{\lambda }}}_c\). But the system compensated it as much as possible through an independent variation \(-{{\varvec{\gamma }}}_{y,c}\) of the parameters. The obtained parameter variation \(-{{\varvec{\gamma }}}_{y,c}\) was composed of:

  • a mean variation of the satellite z-PCOs by \(-\)137.06 mm, plus small satellite-specific z-PCO variations below 0.23 mm,

  • a mean variation of the satellite clock offsets by \(-\!\)25.18 mm (\(-\)83.99 ps), plus small satellite- and epoch-specific variations below 0.43 mm (1.43 ps),

  • a mean variation of the station clock offsets by 104.91 mm, plus small station- and epoch-specific variations below 0.29 mm,

  • a mean variation of the ZWDs by \(-\)0.40 mm, plus small station- and hour-specific variations below 0.11 mm.

The other parameters (station coordinates, tropospheric gradients and satellite orbit parameters) were not significantly affected.

The obtained mean variation of the satellite and station clock offsets has the effect of shortening the predicted observations by \(-25.18-104.91 = -130.09\) mm. The impact of the obtained mean ZWD variation is to shorten any observation made at an elevation \(e\) by \(-0.40 / \sin e\) mm. Finally, according to Cardellach et al. (2007), the impact of the obtained mean z-PCO variation is to lengthen any observation made at an elevation \(e\) by \(137.06\sqrt{1-a_\mathrm{E}^2\cos ^2e/a^2}\) mm. The sum of these three effects is compared in Fig. 2 with the impact on the observations of the introduced terrestrial scale change.

Fig. 2
figure 2

Blue opposite of the impact on the observations of a 1 ppb scale change, plotted as a function of the elevation angle (\(a_\mathrm{E} \delta s \sin (e)\)). Red sum of the impacts on the observations of the obtained clock, ZWD and z-PCO mean variations, plotted as a function of the elevation angle \(\Bigg (130.09 + 0.40 / \sin e - 137.06\sqrt{1-a_\mathrm{E}^2\cos ^2e/a^2}\Bigg )\)

Figure 2 shows that the impact of the introduced terrestrial scale change could be almost perfectly compensated by variations of independent parameters (clock offsets, ZWDs and z-PCOs), which indicates a severe collinearity of the terrestrial scale with those parameters. Another view of the same result is given in Fig. 3, where the signature of a 1 ppb terrestrial scale change on the observations made by a particular station is compared to the corresponding proper signature. There is a reduction by almost two orders of magnitude from the signature to the proper signature of the terrestrial scale change, which also points to a severe collinearity issue.

Fig. 3
figure 3

Left signature of a 1 ppb scale change on the observations made by a particular station, represented as a skyplot. Each dot represents an element of \({\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}_c\). Right corresponding proper signature. Each dot represents an element of \({\varvec{S}}_{y,c}^p = {\varvec{A}}({{\varvec{\lambda }}}_c - {{\varvec{\gamma }}}_{y,c})\). Note the different color scales

The VIF obtained for the terrestrial scale in this simulation was in fact 3,076.9 (\(\sqrt{1 - 1/\hbox {VIF}} = 99.984\,\%\)), far beyond the commonly seen thresholds of 5 (89.443 %) and 10 (94.868 %).

4.4 Complementary simulations

A second simulation was run with the same options as previously, except that the satellite z-PCOs were held fixed. The obtained VIF was 47.2 (98.935 %). When the satellite z-PCOs are not estimated, clock offsets and ZWDs are much less able to compensate the introduced terrestrial scale change. The collinearity of the terrestrial scale with the other parameters is thus greatly reduced.

Another simulation based on a real network of 150 stations and a real constellation of 30 GPS satellites (both extracted from a solution provided by ESA for July 11, 2008) was run, with all other options as described in Sect. 4.2. The obtained VIF was 3,123.9 (99.984 %), just slightly above the VIF obtained with a perfectly distributed station network and constellation. The mechanism described in Sect. 4.3 holds in fact for any network and constellation. The distribution of the station network and of the constellation does not seem to have a significant impact on the collinearity of the terrestrial scale with the other parameters, as long as satellite z-PCOs are estimated.

The impacts of the cutoff angle and of the observation weighting were finally investigated through different simulations. The obtained VIFs are given in Table 3. The collinearity of the terrestrial scale with the other parameters clearly increases when higher cutoff angles are used. This translates the fact that the more the signature of the terrestrial scale change (\(- a_\mathrm{E} \delta s \sin (e)\); blue curve in Fig. 2) is truncated, the more accurately it can be matched by linear combinations of the partial derivatives of clock offsets, z-PCOs and ZWDs. Downweighting low elevation observations similarly increases the collinearity of the terrestrial scale with the other parameters.

Table 3 VIFs obtained for the terrestrial scale with different cutoff angles and observation weighting functions

5 Signature of geocenter shifts on station-satellite ranges

As a last preliminary step before investigating the collinearity of the geocenter coordinates with other GNSS parameters, this section describes the signature of geocenter shifts on station-satellite range observations. The impact of a geocenter shift along the \(Z\) axis is first studied, followed by the impacts of geocenter shifts along the \(X\) and \(Y\) axes of the terrestrial frame.

5.1 Signature of a \(Z\) geocenter shift

Consider the implicit parameter \(y\) corresponding to a geocenter shift of \(\delta t = 1\) cm along the \(Z\) axis. The signature of \(y\) on a set of observations is \({\varvec{S}}_y = {\varvec{A}}{{\varvec{\lambda }}}\) with:

$$\begin{aligned} {{\varvec{\lambda }}} = [ \underbrace{0, 0, \delta t, \ldots , 0, 0, \delta t}_{\text {station coordinates}}, \underbrace{0, 0, 0, 0, \ldots , 0}_{\text {other parameters}}]^\mathrm{T} \end{aligned}$$
(49)

This signature was computed for the particular set of observations described in Sect. 4.2. It is represented in Fig. 4 from the point of view of a particular satellite.

Fig. 4
figure 4

Signature of a 1 cm \(Z\) geocenter shift on simulated observations made on a GPS-like satellite by the whole station network. Each blue dot represents an element of \({\varvec{S}}_y\). The magenta line is an epoch mean

The impact of a positive \(Z\) geocenter shift is basically to shorten the observations made when the satellite is in the Northern hemisphere and to lengthen the observations made when the satellite is in the Southern hemisphere. That is why, at first order, the signature of a positive \(Z\) geocenter shift follows the opposite of the satellite \(Z\) coordinate. For a circular orbit, this results in a sinusoid with an amplitude \(\delta t \sin i\), where \(i\) denotes the inclination of the satellite orbit.

The “width” of the blue sinusoid in Fig. 4 stems from the fact that, at a given epoch, observations are made on the same satellite by several stations under different angles, and are thus differently affected by the geocenter shift. This width mainly depends on the radius of the satellite orbit, but also of the employed cutoff angle.

With slight anticipation over Sect. 7, let us mention that the epoch mean signature shown in magenta in Fig. 4 can be strictly compensated by variations of the satellite clock offsets and is therefore unobservable in a standard GNSS analysis. The determination of the \(Z\) geocenter coordinate with GNSS can consequently only rely on the difference between its signature and this epoch mean signature (i.e. on the signal around the magenta line in Fig. 4). This second-order signature is represented in Fig. 5 from the points of view of five particular stations.

Fig. 5
figure 5

Signature of a 1 cm \(Z\) geocenter shift on simulated observations made by five particular stations. An epoch mean signature was first removed for each satellite

5.2 Signatures of \(X\) and \(Y\) geocenter shifts

Similarly, the impact of a positive \(X\) geocenter shift is to shorten the observations made on satellites in the positive \(X\) hemisphere and to lengthen the observations made on satellites in the negative \(X\) hemisphere. At first order, the signature of a positive \(X\) geocenter shift on the observations of a particular satellite thus follows the opposite of the satellite \(X\) coordinate in the terrestrial frame. For a circular orbit, this results in the product of two sinusoids, one at the satellite revolution period and the other at the Earth rotation period. The signature of a 1 cm \(X\) geocenter shift on the set of observations described in Sect. 4.2 is represented in Fig. 6 from the point of view of a particular satellite.

Fig. 6
figure 6

Signature of a 1 cm \(X\) geocenter shift on simulated observations made on a GPS-like satellite. Each blue dot represents an element of \({\varvec{S}}_y\). The magenta line is an epoch mean

As for the \(Z\) component, the epoch mean signature shown in magenta in Fig. 6 can be strictly compensated by variations of the satellite clock offsets and is therefore unobservable in a standard GNSS analysis. The determination of the \(X\) component of the geocenter with GNSS can consequently only rely on the difference between its signature and this epoch mean signature (i.e. on the signal around the magenta line in Fig. 6). This second-order signature is represented in Fig. 7 from the points of view of seven particular stations.

Fig. 7
figure 7

Signature of a 1 cm \(X\) geocenter shift on simulated observations made by seven particular stations. An epoch mean signature was first removed for each satellite

The signature of a \(Y\) geocenter shift on station-satellite ranges is similar in all respects to the signature of an \(X\) geocenter shift.

6 SLR interlude

The collinearity diagnosis developed in Sect. 3 is applied in this section to the SLR geocenter determination problem. This interlude will help in understanding why the SLR technique is much more able to retrieve geocenter motions than GNSS.

6.1 Simulation

A series of station-satellite range observations was simulated with the options listed in Table 1, except that a constellation of two LAGEOS-like satellites (\(a = 12{,}200\) km, \(e = 0\), \(i_1 = 110^\circ \), \(i_2 = 53^\circ \)) was used. A design matrix was set up, including the parameters listed in Table 4. The considered empirical accelerations were constant along-track accelerations and once-per-revolution periodic accelerations in the along-track and cross-track directions: \(S_0\), \(S_c\), \(S_s\), \(W_c\) and \(W_s\) in the notations of Rodriguez-Solano et al. (2012). The main differences with the GNSS case is that neither clock offsets nor tropospheric parameters needed to be set up.

Table 4 List of simulated parameters

The choice of a perfectly distributed network of 100 stations providing observations at a 5 min rate is rather unrealistic for an SLR simulation. But like in the GNSS case, this choice avoids perturbing the obtained results by some network effect. This will also facilitate the comparison with the GNSS results obtained in Sect. 7.

The choice of a 24 h data span is also questionable, as 7 day data batches are usually processed in SLR. 24 h data batches were used for comparison with GNSS. But the results of additional simulations using 7 day data batches will also be presented in Sects. 6.2 and 6.3.

6.2 Collinearity diagnosis of the \(X\) and \(Y\) geocenter coordinates

The implicit parameter corresponding to a 1 cm \(X\) geocenter shift was then considered, and Eq. (44) was solved. Concretely, a 1 cm \(X\) geocenter shift was imposed with respect to the a priori parameters. But the impact of this shift on the observations was compensated as much as possible through independent parameter variations. The VIF thus obtained for the \(X\) geocenter coordinate was 1.8 (67.068 %), which clearly indicates the absence of collinearity issues. This result is illustrated in Fig. 8, where the signature of the introduced \(X\) geocenter shift on the observations made on a particular satellite is compared to the corresponding proper signature. Both have similar magnitudes, meaning that the introduced \(X\) geocenter shift could hardly be compensated by independent parameter variations.

Fig. 8
figure 8

Blue signature of a 1 cm \(X\) geocenter shift on simulated observations made on a LAGEOS-like satellite. Each blue dot represents an element of \({\varvec{S}}_y\). Green corresponding proper signature. Each green dot is an element of \({\varvec{S}}_{y,c}^p\)

The same experience was repeated for the \(Y\) geocenter coordinate. An even lower VIF of 1.2 (42.698 %) was obtained. Additional simulations were carried out using a 7 day data batch. The obtained VIFs were respectively 1.04 (20.359 %) and 1.04 (19.078 %) for the \(X\) and \(Y\) geocenter coordinates. In conclusion, the determination of the \(X\) and \(Y\) geocenter coordinates with SLR does manifestly not suffer from collinearity issues.

6.3 Collinearity diagnosis of the \(Z\) geocenter coordinate

For the \(Z\) coordinate of the geocenter and a 24 h data batch, a larger VIF of 9.0 (94.285 %) was obtained. This result is illustrated in Fig. 9: the proper signature of the introduced \(Z\) geocenter shift is clearly smaller than its signature. The first-order sinusoidal signal could in particular be absorbed by independent parameter variations.

Fig. 9
figure 9

Blue signature of a 1 cm \(Z\) geocenter shift on simulated observations made on a LAGEOS-like satellite. Each blue dot represents an element of \({\varvec{S}}_y\). Green corresponding proper signature. Each green dot is an element of \({\varvec{S}}_{y,c}^p\)

The examination of those independent parameter variations, i.e. \(-{{\varvec{\gamma }}}_{y,c}\), revealed that the most affected parameters were the satellite initial state vectors. We therefore investigated how variations of the satellite initial state vectors could have absorbed the first-order sinusoidal signature of the introduced \(Z\) geocenter shift. It is easily seen that this first-order signal can be compensated by a periodic variation of the orbit radius with:

  • an amplitude of \(\delta t \sin i\),

  • its maximum when the satellite reaches the southernmost point of its orbit,

  • its minimum when the satellite reaches the northernmost point of its orbit.

Starting from a circular orbit, such a periodic variation of the orbit radius can in fact simply be obtained through a slight “ellipticization”. Let us arbitrarily place the perigee of the starting circular orbit at \(\omega = \pi /2\) (i.e. at its northernmost point) and consider the perturbed orbit with an eccentricity \(\delta e = \delta t \sin i / a\). The radial difference between both orbits is, at first order in \(\delta e\):

$$\begin{aligned} \delta r = - a \delta e \cos \nu = - \delta t \sin i \cos \nu \end{aligned}$$
(50)

It clearly meets the requirements listed above.

In summary, the first-order signature of a \(Z\) geocenter shift can be compensated, in case of circular orbits, by variations of the satellite osculating elements, or equivalently by variations of their initial state vectors. The parameter variation \(-{{\varvec{\gamma }}}_{y,c}\) obtained from our simulation was indeed corresponding to such ellipticizations of both LAGEOS orbits. It is worth mentioning that this problem is not restricted to circular orbits. Appendix H (Online Resource) shows that the first-order signature of a \(Z\) geocenter shift can similarly be compensated by variations of the satellite osculating elements in case of elliptical orbits.

The same experience was repeated with a 7 day data batch. The VIF obtained for the \(Z\) geocenter coordinate was 8.6 (93.988 %), only slightly lower than for the 24 h case. The collinearity of the \(Z\) geocenter coordinate with the satellite osculating elements explained above holds in fact for any data batch and orbital arc lengths. In addition to the uneven distribution of the SLR network, this slight collinearity issue may partly explain why the \(Z\) component of the SLR-derived geocenter motion is of lower quality than its \(X\) and \(Y\) components (see, e.g., Fig. 4 in Altamimi et al. 2011).

7 Collinearity diagnosis of the GNSS geocenter determination

The collinearity diagnosis developed in Sect. 3 is eventually applied in this section to the GNSS geocenter determination problem.

7.1 Collinearity diagnosis of the \(X\) and \(Y\) geocenter coordinates

A series of observations was simulated using the options listed in Table 1, and a design matrix including the parameters listed in Table 2 except satellite z-PCOs was set up. The implicit parameter corresponding to a 1 cm \(X\) geocenter shift was considered and Eq. (44) was solved. The obtained VIF was 2,367.5 (99.979 %). As illustrated from the point of view of a satellite in Fig. 10 and from the point of view of a station in Fig. 11, the proper signature of the \(X\) geocenter coordinate is indeed smaller than its signature by almost two orders of magnitude.

Fig. 10
figure 10

Blue signature of a 1 cm \(X\) geocenter shift on simulated observations made on a GPS-like satellite. Each blue dot represents an element of \({\varvec{S}}_y\). Green corresponding proper signature. Each green dot is an element of \({\varvec{S}}_{y,c}^p\)

Fig. 11
figure 11

Left signature of a 1 cm \(X\) geocenter shift on the observations made by a particular station, represented as a skyplot. Each dot represents an element of \({\varvec{S}}_y\). Right corresponding proper signature. Each dot represents an element of \({\varvec{S}}_{y,c}^p\). Note the different color scales

The independent parameter variations \(-{{\varvec{\gamma }}}_{y,c}\) by which the introduced \(X\) geocenter shift was almost completely absorbed was then examined. As anticipated in Sect. 5, the epoch mean signature of the introduced geocenter shift on the observations of each satellite (magenta line in Fig. 6) was absorbed by the respective satellite clock offsets. As one could have expected, the second-order signatures represented in Fig. 7 were absorbed to a great extent by the station-related parameters (positions, clock offsets, ZWDs and tropospheric gradients). On the other hand, neither the satellite initial state vectors nor the empirical accelerations were substantially affected.

Figure 12 illustrates the part of \(-{{\varvec{\gamma }}}_{y,c}\) corresponding to station coordinate variations, i.e. the distortion of the station network obtained in response to the introduction of the \(X\) geocenter shift. The distortion pattern visible in Fig. 12 resembles the deformation pattern associated with a degree 1, order 1 surface load (fitting a degree 1, order 1 load deformation pattern to the network distortion shown in Fig. 12 leads to a coefficient of determination of 94.5 %). This resemblance explains the significant correlation observed by Blewitt (personal communication, 2012) in the solutions of various IGS ACs between the \(X\) component of their origins and the deformation pattern associated with a degree 1, order 1 surface load.

Fig. 12
figure 12

Station coordinate variations obtained in response to the introduction of a 1 cm \(X\) geocenter shift. The horizontal variations are represented by the black arrows; the vertical variations by the color dots

The same experience was repeated for the \(Y\) coordinate of the geocenter. The obtained VIF was 2,359.8 (99.979 %). The independent parameter variations obtained in response to the introduction of the \(Y\) geocenter shift are similar to those obtained for the \(X\) coordinate of the geocenter. In particular, the obtained network distortion pattern resembles the deformation pattern associated with a degree 1, order \(-\)1 surface load.

Two other simulations were run using a real network of 150 stations and a real constellation of 30 GPS satellites (both extracted from a solution provided by ESA for July 11, 2008). The obtained VIFs were, respectively, 2,016.4 (99.975 %) and 2,270.2 (99.978 %) for the \(X\) and \(Y\) coordinates of the geocenter. Although slightly lower, they are of the same order as the VIFs obtained with perfectly distributed network and constellation.

In conclusion, the \(X\) and \(Y\) coordinates of the geocenter are hardly observable in a standard GNSS analysis. Their epoch mean signatures are indeed indistinguishable from the satellite clock parameters, while their second-order signatures are highly collinear with all station-related parameters. The determination of the \(X\) and \(Y\) geocenter coordinates with GNSS can therefore only rely on tiny third-order signals (a few tenths of millimeters for 1 cm geocenter shifts). Note that the VIFs obtained for the \(X\) and \(Y\) geocenter coordinates are roughly of the same order as the VIF obtained in Sect. 4.3 for the terrestrial scale.

7.2 Collinearity diagnosis of the \(Z\) geocenter coordinate

Using a fictive network of 100 stations and a fictive constellation of 24 satellites, a VIF of 3,147.5 (99.984 %) was obtained for the \(Z\) coordinate of the geocenter. Using a real network of 150 stations and a real constellation of 30 GPS satellites, the obtained VIF was 3,538.4 (99.986 %). The examination of \(-{{\varvec{\gamma }}}_{y,c}\) revealed that both the clock offsets (as for the \(X\) and \(Y\) geocenter coordinates) and the initial state vector of each satellite (as in the SLR case) “combined their strengths” to absorb the main part of the signature of the introduced \(Z\) geocenter shift. As for the \(X\) and \(Y\) coordinates, the remaining second-order signature was largely absorbed by variations of the station-related parameters, leaving a proper signature of only a few tenths of millimeters. The network distortion pattern obtained in response to the introduction of the \(Z\) geocenter shift resembles the deformation pattern associated with a degree 1, order 0 surface load.

As will be shown in Sect. 7.4, the larger VIFs of the \(Z\) geocenter coordinate compared to those of the \(X\) and \(Y\) geocenter coordinates stem from a particular role played by the ECOM parameters. The VIFs obtained for the \(Z\) geocenter coordinate are of the same order as those obtained in Sect. 4.3 for the terrestrial scale. This concretely means that current GNSS should be considered as unable to provide information about the \(Z\) component of geocenter motion as they are to provide terrestrial scale information apart from conventional satellite z-PCOs.

7.3 Influence of the cutoff angle and of the observation weighting

For completeness, the VIFs of the three geocenter coordinates have been computed for different cutoff angles and observation weighting functions. Results are summarized in Table 5. As for the terrestrial scale, using higher cutoff angles or downweighting low-elevation observations has the effect of increasing collinearity issues.

Table 5 VIFs obtained for the three geocenter coordinates with different cutoff angles and observation weighting functions

7.4 Role of the empirical accelerations

Meindl et al. (2013), on the basis of orbital perturbation considerations, concluded that the estimation of the ECOM \(D_0\) parameters (constant accelerations in the Sun-satellite directions) was an important limiting factor in the determination of the \(Z\) geocenter coordinate with GNSS. To investigate the role effectively played by the ECOM parameters in the collinearity of the \(Z\) geocenter coordinate, a series of simulations was run in which different subsets of ECOM parameters were estimated. The results are summarized in Table 6.

Table 6 VIFs obtained for the \(Z\) coordinate of the geocenter when estimating different subsets of ECOM parameters

The purpose here was to assess the individual and combined contributions of the five usually estimated ECOM parameters to the collinearity of the \(Z\) geocenter coordinate. It is clear that, when individually estimated, each of these five parameters, in particular \(D_0\), has no significant impact. When the five parameters are simultaneously estimated, the VIF of the \(Z\) geocenter coordinate increases from about 2,550 (99.980 %) to 3,147.5 (99.984 %). This increase can in fact be quasi-fully explained by the simultaneous estimation of three of the five parameters: \(D_0\), \(B_\mathrm{c}\) and \(B_\mathrm{s}\). It is nevertheless relatively minor (remember that VIFs should be considered as varying on a logarithmic scale). That is why we assert that the ECOM parameters do not play a predominant role in the problem of determining the \(Z\) geocenter coordinate with GNSS.

The results in Table 6 were however obtained with a particular configuration of the orbital planes with respect to the Sun. The \(\beta _\mathrm{s}\) angles (elevations of the Sun above the orbital planes) were in this case ranging from \(-30^\circ \) to \(68^\circ \). To confirm the above conclusion, we ran another series of simulations using a configuration considered by Meindl et al. (2013) as a worst case scenario. A constellation of 24 GLONASS-like satellites (\(a = 25,520\) km, \(e = 0,\, i = 65^\circ \)) on three orbital planes was used. The \(\beta _\mathrm{s}\) angles were respectively \(87^\circ \), \(-15^\circ \) and \(-15^\circ \) for the three orbital planes. Results are summarized in Table 7. The VIF of the \(Z\) geocenter coordinate is this time approximately doubled when estimating the five ECOM parameters. As previously, this collinearity increase can be explained by the simultaneous estimation of only three ECOM parameters: \(D_0\), \(B_\mathrm{c}\) and \(B_\mathrm{s}\). But this increase by a factor 2 in a worst case scenario remains a secondary issue. Also, note that estimating only the \(D_0\) parameters causes a marginal increase of collinearity.

Table 7 VIFs obtained for the \(Z\) coordinate of the geocenter when estimating different subsets of ECOM parameters and using the GLONASS-like constellation described in the text
Table 8 VIFs obtained for the three geocenter coordinates when estimating different subsets of GNSS-specific parameters

A last experiment was carried out to consolidate our view. The parameters estimated in GNSS analyses, but not in SLR analyses were grouped into three categories:

  • the five usually estimated ECOM parameters (“ECOM” in Table 8),

  • tropospheric parameters (ZWDs and tropospheric gradients; “tropo” in Table 8),

  • station and satellite clock offsets (“clocks” in Table 8).

Different simulations were run to assess the individual and combined contributions of these parameter categories to the collinearity of the three geocenter coordinates. The obtained VIFs are summarized in Table 8.

The GNSS parameter category with the largest individual impact is the clock offset category: a hypothetic GNSS that could dispense with the estimation of clock offsets would most certainly allow an accurate determination of geocenter motion. The combination of clock offsets with tropospheric parameters has a devastating effect. It concretely makes the second-order signatures shown in Figs. 5 and 7 quasi-unobservable. In case of the \(Z\) geocenter coordinate, estimating the five ECOM parameters has an additional, but relatively small impact. That is why we consider that the inability of GNSS, as opposed to SLR, to properly sense the \(Z\) component of geocenter motion is mostly due to the simultaneous estimation of clock offsets and tropospheric parameters. For the \(X\) and \(Y\) components of geocenter motion, this inability is quasi-entirely due to the simultaneous estimation of clock offsets and tropospheric parameters. This conclusion contradicts the one drawn by Meindl et al. (2013) according to whom the inability of GNSS, as opposed to SLR, to properly sense the \(Z\) component of geocenter motion stems from the correlation between the \(Z\) geocenter coordinate and the ECOM \(D_0\) parameters.

8 Discussion

This section discusses a number of questions related to the results obtained in Sect. 7:

  • Why did Meindl et al. (2013) come to a different conclusion (Sect. 8.1)?

  • Why are GLONASS-derived geocenter time series of much lower quality than GPS-derived geocenter time series (Sect. 8.2)?

  • Which paths can be considered toward a reliable observation of geocenter motion with GNSS (Sect. 8.3)?

8.1 Comments to Meindl et al. (2013)

The main argument of Meindl et al. (2013) relies on experiments that are similar in essence to our simulations. They compared two series of GNSS solutions obtained with and without constraining the \(Z\) geocenter coordinate, i.e. with and without introducing artificial \(\delta z\) geocenter shifts. And they observed that the variations of the estimated \(D_0\) parameters in response to the introduction of \(\delta z\) geocenter shifts were approximately obeying:

$$\begin{aligned} \delta z = -\frac{\sum \nolimits _{k=1}^P \delta D_0^k \sin \beta _\mathrm{s}^k}{n^2\cos i} \end{aligned}$$
(51)

where

  • \(P\) denotes the number of orbital planes,

  • \(\delta D_0^k\) is the mean variation of the \(D_0\) parameters observed for the satellites of plane \(k\),

  • \(\beta _\mathrm{s}^k\) is the elevation of the Sun above the orbital plane \(k\),

  • \(n\) denotes the mean motion of the satellites.

Meindl et al. (2013) justify Eq. (51) by the fact that a variation \(\delta D_0\) has the effect, among others, of translating the orbit of a satellite by \(\delta D_0 \sin \beta _\mathrm{s} / (n^2\cos i)\) along the \(Z\) axis. From Eq. (51), they conclude that the geocenter differences \(\delta z\) between their two solution series are “explained by the differences of the [\(D_0\)] parameters in both solution series”.

We were however unable to reproduce Meindl et al. (2013)’s result. We introduced for example 1 cm \(Z\) geocenter shifts into two simulated solutions that were only differing by the employed cutoff angles (\(0^\circ \) in the first, \(10^\circ \) in the second). The obtained right-hand sides of Eq. (51) were respectively 2.3 and 5.3 mm, both far from 1 cm, and also very different from each other. Repeating the same experiment with 1 year of real daily solutions provided by ESA led to right-hand sides that were rather constant (mean 4.6 mm, standard deviation 1.0 mm), but again far from 1 cm. We therefore think that Eq. (51) does probably only hold for the particular analysis settings used by Meindl et al. (2013).

In fact, the point where the reasoning of Meindl et al. (2013) fails is that their \(\delta z\) geocenter differences are not explained by the \(\delta D_0\) variations. On the contrary, these \(\delta D_0\) variations are only one particular consequence of the introduction of the \(\delta z\) geocenter shifts. Introducing an artificial \(Z\) geocenter shift into a solution admittedly induces variations of the \(D_0\) parameters. But all other parameter categories are affected as well. With given analysis settings, formulas like Eq. (51) could in fact be derived for any parameter category like, e.g., tropospheric gradients. But each would only give a partial view of the problem.

8.2 Lower quality of GLONASS-derived geocenter time series

The \(Z\) component of the GLONASS-derived geocenter time series studied by Meindl et al. (2013) is clearly of much lower quality than the corresponding GPS-derived time series. It contains in particular a strong artificial periodic signal with an amplitude of about 20 cm and its extrema when the angle \(\beta _\mathrm{s}\) reaches extremal values for one of the three GLONASS orbital planes. Meindl et al. (2013) attribute this lower quality to the facts that GLONASS satellites are spread over only three orbital planes (against 6 for GPS) and that the angle \(\beta _\mathrm{s}\) can reach values up to almost \(90^\circ \) for GLONASS (against about \(80^\circ \) for GPS).

But the lower quality of GLONASS-derived geocenter time series might have a different explanation. According to Meindl (2011), less than 50 % of the phase cycle ambiguities could indeed be fixed to integer values in their GLONASS analysis (against 90 % for GPS). Unfixed ambiguities thus constitute a large set of additional parameters in the GLONASS case, which may have seriously increased the collinearity of the \(Z\) geocenter coordinate. No dedicated simulations were carried out in our study to assess the impact of estimating unfixed ambiguities on the VIF of the \(Z\) geocenter coordinate. But ambiguity parameters (i.e. constants per pass of each satellite over each station) would likely significantly help in masking the second-order geocenter signatures shown in Fig. 5.

Also, note that the impact of fixing ambiguities on the GPS geocenter determination had been studied by Springer (2000). His figure 5.7 (b) shows the impact of fixing ambiguities on the \(Z\) component of the geocenter motion derived from GPS solutions. In the early part of the graph, when GPS ambiguities were not fixed, a strong artificial periodic signal with an amplitude of about 20 cm is visible, like in Meindl et al. (2013)’s GLONASS series.

8.3 Perspectives

The collinearity diagnosis made in Sect. 7 explains why the time series of translations derived from the IGS AC weekly solutions are far from reflecting a realistic geocenter motion. To paraphrase Belsley et al. (1980), the extreme collinearity of the three geocenter coordinates indeed results in a situation in which the systematic influence of geocenter motion on GNSS observations is swamped by the observation and modeling errors.

A different problem is to know what the IGS AC translation time series do actually reflect, i.e. what are the modeling deficiencies which contaminate the GNSS geocenter motion estimates. From the fact that the IGS AC translation time series contain strong spurious signals at harmonics of the GPS draconitic year, and from various experiments showing that changes in orbit modeling have significant impacts on GPS-derived geocenter motion (Springer 2000; Hugentobler and Marel 2006; Rodriguez-Solano et al. 2011), it seems clear that orbit modeling deficiencies are a major source of geocenter contamination. But other modeling aspects of the GNSS observations are also known to have serious impacts on GPS-derived geocenter motion, like the modeling of higher-order ionospheric effects (Petrie et al. 2010; Garcia-Fernandez et al. 2012).

Two paths can consequently be considered in view of a reliable observation of geocenter motion with GNSS: the reduction of collinearity issues and the reduction of modeling deficiencies. With the current state of collinearity, the latter solution would probably require immense progress in the modeling of orbit dynamics as well as in other modeling aspects. Remembering that the determination of a 1 cm geocenter offset relies on a signal of a few tenths of millimetres, it seems unlikely that modeling progresses alone will allow a reliable observation of geocenter motion with GNSS in the near term.

The other alternative is the reduction of collinearity issues, i.e. a reduction of the number of parameters to estimate. Bearing in mind that the most problematic parameter categories are clock offsets and tropospheric parameters, two paths can again be considered. The first one would be an improvement of global meteorological models up to a point where GNSS analyses could dispense with the estimation of tropospheric parameters, which also seems unfeasible in the near term.

The last remaining prospect is the modeling of station and/or satellite clocks by other means than epoch-wise offsets. The modeling of satellite clocks is especially attractive. If the time evolution of satellite clock offsets could be tightly constrained, then the epoch mean signatures of geocenter shifts (magenta lines in Figs. 4 and 6) could indeed become observable. The whole question is then to know how many satellites should carry stable clocks and how stable these clocks should be. This question will be the topic of a future study.

9 Summary

A collinearity diagnosis was developed, based on a generalization of the notion of variance inflation factor. It allows to assess and explain the collinearity of any explicit or implicit parameter in the presence of singularities supplemented by minimal constraints. Such a diagnosis can be practically performed by solving the constrained least-squares system (44): a unit error in the considered parameter is artificially introduced, but the impact of this error on the observations is compensated as much as possible by independent parameter variations.

This collinearity diagnosis was applied in Sect. 6 to the SLR geocenter determination problem. It revealed that the determination of the \(X\) and \(Y\) geocenter coordinates with SLR does not suffer from any collinearity issue. On the other hand, the epoch mean signature of a \(Z\) geocenter shift was shown to be absorbable by variations of the satellite initial state vectors. The \(Z\) geocenter coordinate thus has a VIF of the order of 9 in our simulated SLR analyses. In addition to the uneven distribution of the SLR network, this slight collinearity issue may partly explain why the \(Z\) component of the SLR-derived geocenter motion is of lower quality than its \(X\) and \(Y\) components.

In Sect. 7, the developed collinearity diagnosis was applied to the GNSS geocenter determination problem. A minimalist set of GNSS parameters was considered. With reasonable cutoff angles and observation weighting functions, the VIFs obtained for the \(Z\) geocenter coordinate are at the same level, or larger than those obtained for the terrestrial scale in Sect. 4. Current GNSS should therefore be considered as unable to provide information about the \(Z\) component of geocenter motion as they are to provide terrestrial scale information apart from conventional satellite z-PCOs. The VIFs of the \(X\) and \(Y\) geocenter coordinates are smaller, but of the same order. It can be concluded without much exaggerating that current GNSS are insensitive to any component of geocenter motion.

Among the parameters considered in this study, two specific categories quasi-fully explain the insensivity of GNSS to the \(X\) and \(Y\) geocenter coordinates: epoch-wise clock offsets and tropospheric parameters. These two parameter categories also explain most of the insensivity of GNSS to the \(Z\) geocenter coordinate, which is in addition slightly amplified by the simultaneous estimation of three ECOM parameters: \(D_0\), \(B_\mathrm{c}\) and \(B_\mathrm{s}\). We therefore assert that the inability of GNSS, as opposed to SLR, to properly sense geocenter motion, including its \(Z\) component, is due to the simultaneous estimation of clock offsets and tropospheric parameters.

The possible paths toward a reliable observation of geocenter motion with GNSS were considered in Sect. 8.3. As sufficient progress in any of these paths can hardly be expected soon, the ultimate conclusion of this article is that GNSS will likely not become able to complement SLR for geocenter determination with the network shift approach in a reasonably near future. The IGS might consequently consider abandoning the idea of providing GNSS geocenter estimates in favour of adopting a conventional a priori geocenter motion model.