1 Introduction

In this paper, we present a method for improving regional geoid modeling by including full covariance information from a high-resolution global geopotential model (GGM) in least-squares collocation (LSC). Since its foundation by Krarup (1969), and the key publication by Moritz (1980) LSC is considered as one of the most important methods for local and regional geoid modeling. Although the main concept of LSC has never changed, a few adaptations have been introduced recently. Nowadays, frequently a satellite-only model is used in LSC as background for the long wavelength part of the Earth’s gravity field. LSC thereby benefits from the good model quality in the long wavelength part that is mainly provided by the Gravity Recovery And Climate Experiment (GRACE; Tapley et al. 2004) mission and the Gravity field and steady-state Ocean Circulation Explorer (GOCE; Drinkwater et al. 2003) mission.

Compared to satellite-only GGMs, high-resolution GGMs (we use this term for models with a maximum degree of 719 or higher) have included a wider spectral range of the Earth’s gravity signal and therefore exhibit a higher commission error. For many of these high-resolution GGMs, the associated variance–covariance information is not fully available. As an example, in the case of EGM2008 only grids of geographic error variances of gravity anomalies and geoid undulations are provided (Pavlis et al. 2012). Additionally, EGM2008 and other high-resolution GGMs like EIGEN-6c (Förste et al. 2014) assume constant accuracy for their input ground data. However, with GOCO05c (Fecher et al. 2017) and its successor XGM2016 (Pail et al. 2018) we have two high-resolution GGMs that apply regional varying weighting which results in an improved, more realistic and location-dependent accuracy estimation that enables us to use covariance matrices from these models for LSC. In this paper, we present the first inclusion of high-resolution GGMs with their full covariance matrices in LSC, develop the corresponding methodology and demonstrate its improvement to standard LSC in a numerical closed-loop simulation. The benefit of consistently including covariance matrices from a satellite-only GGM has already been demonstrated in various publications. This was first performed with a full covariance matrix by Haagmans and van Gelderen (1991). Pail et al. (2010) then demonstrated the rigorous inclusion of a GGM and its full accuracy information to LSC in a remove–compute–restore (RCR) approach. However, unlike other methods, our approach is specifically adjusted to work with high-resolution GGMs which leads us to an extended formalism of the LSC problem. Gerlach and Fecher (2012) showed that for GOCE covariance information, the very high computation effort of calculating full covariance matrices can be significantly reduced using approximations such as sparse or block-diagonal covariance matrices without losing much benefit. However, this simplification is not valid in our case since GGMs with regional weighting cause strong correlations among the gravity field parameters (Fecher et al. 2017).

The LSC approach is in practice still a frequently applied method for regional gravity geoid modeling. According to Moritz (1980) and Sansò (1986) the disturbing gravity field \({ T}\) of the Earth can be described as a random field. Also, it is possible to derive a statistical and homogeneous description of \({ T}\) that determines a global average part of the gravity field (Moritz 1980; Tscherning 1999). However, this does not coincide with reality because the correlation length and the covariance change with location. As a result, the use of homogeneous and isotropic covariance functions (e.g., in Moritz 1980) does not always give an optimal result in LSC (Tscherning 1999). Therefore, we adapt the LSC notation from Moritz (1980) in several, closely connected ways: (1) we include consistent treatment of covariance information in an RCR approach, (2) we change the definition of covariance matrices by replacing the total average operator with the mathematical expectation value, and (3) we use only residuals as LSC input which changes the nature of the covariance matrices as they describe only uncertainties instead of signal content. Here, our approach differs from previous studies by Moritz (1980), Pail et al. (2010), Forsberg and Tscherning (1981), Tscherning and Rapp (1974), or Sansò (1986). Only through these adaptations, we are able to use the location-dependent covariance matrices that are derived directly from the normal equation system of a high-resolution GGM. Furthermore, our notation offers the advantage that every input quantity in LSC is directly described by a covariance matrix which is in contrast with, for example, the definition by Haagmans and van Gelderen (1991). We see these adaptations as a necessary step towards using modern high-quality models effectively in LSC. At the present stage, we use the XGM2016 (Pail et al. 2018) and the topographic gravity model dV_ELL_Earth2014 (Rexer et al. 2016) for the RCR approach.

This paper is structured as follows. In Sect. 2 we derive in detail the adaptations to LSC by Moritz (1980). Next, in Sect. 3 we describe the specifications of a synthetic test case scenario in South America as well as the data sets that we use to show the benefit of our method. The results of three numerical test cases are visualized and outlined in Sect. 4, and finally Sect. 5 draws conclusions, describes ways to benefit from the demonstrated approach, and provides an outlook.

2 Theoretical background

2.1 LSC according to Moritz (1980)

This section adopts the content and notation of Moritz (1980). For our purposes we rewrite only those parts that are essential for the next sections. The formulas of the least-squares prediction for random observations \({\mathbf {l}}\) and a random output signal \({\mathbf {s}}\) are given in Moritz (1980, Chapter 9) with

$$\begin{aligned}&{\mathbf {s}}= {\mathbf {C}}_{\mathrm{sl}} \, {\mathbf {C}}_{\mathrm{ll}}^{-1} {\mathbf {l}}, \end{aligned}$$
(1)
$$\begin{aligned}&{\mathbf {E}}_{\mathrm{ss}}= {\mathbf {C}}_{\mathrm{ss}} - {\mathbf {C}}_{\mathrm{sl}} {\mathbf {C}}_{\mathrm{ll}}^{-1} {\mathbf {C}}_{\mathrm{ls}}, \end{aligned}$$
(2)

where \({\mathbf {E}}_{\mathrm{ss}}\) is the error covariance of the output \({\mathbf {s}}\), and \({\mathbf {C}}\) are covariance matrices with the subscripts giving the positions and functionals of the related points. This is valid for all \({\mathbf {l}}\) and \({\mathbf {s}}\) with

$$\begin{aligned} \hbox {E}\{{\mathbf {l}}\} = {\mathbf {0}}, \quad \hbox {E}\{{\mathbf {s}}\} = {\mathbf {0}}, \end{aligned}$$
(3)

where \(\hbox {E}\) describes the expectation operator in form of a mathematical probability distribution. Afterwards, in the section ‘Collocation with random errors’ (Moritz 1980, Chapter 14) the observations \({\mathbf {l}}\) as a functional of the gravitational potential \({ T}\) are redefined as a combination of the true input signal \(\varvec{\mathfrak {t}}\) and a random (stochastic) noise \({\mathbf {n}}\). So that, in symbolic notation we have

$$\begin{aligned} {\mathbf {l}}= \varvec{\mathfrak {t}} + {\mathbf {n}}. \end{aligned}$$
(4)

In analogy to the definition of \(\varvec{\mathfrak {t}}\), \(\varvec{\mathfrak {s}}\) is defined as the true output signal. The gravity field functionals \(\varvec{\mathfrak {t}}\) and \(\varvec{\mathfrak {s}}\) are not random in a mere statistical manner since every evaluation point has a value without uncertainty. However, as it is done by Moritz (1980, Chapter 14), we treat \(\varvec{\mathfrak {t}}\) and \(\varvec{\mathfrak {s}}\) as statistical values in a formal sense. This justifies the usage of the expectation operator E which, however, leaves the signals \(\varvec{\mathfrak {t}}\) and \(\varvec{\mathfrak {s}}\) unaffected

$$\begin{aligned}&\hbox {E}\{\varvec{{\mathfrak {s}}}\} = \varvec{{\mathfrak {s}}}, \nonumber \\&\hbox {E}\{\varvec{{\mathfrak {t}}}\} = \varvec{{\mathfrak {t}}}, \nonumber \\&\hbox {E}\{{\mathbf {n}}\} = {\mathbf {0}}, \nonumber \\&\hbox {E}\{{\mathbf {l}}\} = \hbox {E}\{\varvec{{\mathfrak {t}}}\} + \hbox {E}\{{\mathbf {n}}\} = \varvec{{\mathfrak {t}}}. \end{aligned}$$
(5)

In contrast to the expectation in a probabilistic sense, the operator M describes a homogeneous, isotropic average over the sphere which gives the mean global behavior of the gravitational field. \(\varvec{\mathfrak {t}}\), \(\varvec{{\mathfrak {s}}}\) and \({\mathbf {l}}\) are all functionals of the disturbing potential. Therefore, applying the operator \(\hbox {M}\) to \(\varvec{\mathfrak {t}}\), \(\varvec{{\mathfrak {s}}}\) and \({\mathbf {l}}\) gets zero. Thus, we write for the spatial average \(\hbox {M}\)

$$\begin{aligned} \hbox {M}\{\varvec{{\mathfrak {s}}}\}= & {} {0},\nonumber \\ \hbox {M}\{\varvec{{\mathfrak {t}}}\}= & {} {0},\nonumber \\ \hbox {M}\{{\mathbf {l}}\}= & {} {0},\nonumber \\ \hbox {M}\{{\mathbf {n}}\}= & {} { n}. \end{aligned}$$
(6)

In Eq. 5, we can see that in general the observations \({\mathbf {l}}\) are not centered, but in a global view with the definition of a total average operator \(\bar{\hbox {E}}\)

$$\begin{aligned} \bar{{\hbox {E}}} = \hbox {EM}, \end{aligned}$$
(7)

which is an average over the probability distribution and the global distribution (sphere), they can be considered as centered quantities

$$\begin{aligned} \bar{{\hbox {E}}}\{\varvec{{\mathfrak {s}}}\}= & {} \hbox {EM}\{\varvec{{\mathfrak {s}}}\} = {0},\ \nonumber \\ \bar{{\hbox {E}}}\{\varvec{{\mathfrak {t}}}\}= & {} \hbox {EM}\{\varvec{{\mathfrak {t}}}\} = {0},\ \nonumber \\ \bar{{\hbox {E}}}\{{\mathbf {l}}\}= & {} \hbox {EM}\{\varvec{{\mathfrak {t}}}\} + \hbox {EM}\{{\mathbf {n}}\} = {0}. \end{aligned}$$
(8)

To be able to distinguish between different forms of covariance matrices in the following sections, we write covariance matrices that are derived from the total average \(\bar{{\hbox {E}}}\) with \(\bar{{\mathbf {C}}}\) instead of \({\mathbf {C}}\). Consequently, the covariance matrices of the centered quantities defined by the total average \(\bar{{\hbox {E}}}\) are

$$\begin{aligned} \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {s}}= & {} \bar{{\hbox {E}}}\{ \varvec{{\mathfrak {s}}}\varvec{{\mathfrak {s}}}^{\mathrm{T}} \} = \hbox {EM}\{ \varvec{{\mathfrak {s}}}\varvec{{\mathfrak {s}}}^{\mathrm{T}} \} = \hbox {M}\{ \varvec{{\mathfrak {s}}}\varvec{{\mathfrak {s}}}^{\mathrm{T}} \}, \nonumber \\ \bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}}= & {} \bar{{\hbox {E}}}\{ \varvec{{\mathfrak {t}}}\varvec{{\mathfrak {t}}}^{\mathrm{T}} \} = \hbox {EM}\{ \varvec{{\mathfrak {t}}}\varvec{{\mathfrak {t}}}^{\mathrm{T}} \} = \hbox {M}\{ \varvec{{\mathfrak {t}}}\varvec{{\mathfrak {t}}}^{\mathrm{T}} \}, \nonumber \\ \bar{{\mathbf {C}}}_{{\mathrm{nn}}}= & {} \bar{{\hbox {E}}}\{ {\mathbf {n}}{\mathbf {n}}^{\mathrm{T}} \} = \hbox {EM}\{ {\mathbf {n}}{\mathbf {n}}^{\mathrm{T}} \} = \hbox {E}\{ {\mathbf {n}}{\mathbf {n}}^{\mathrm{T}} \} = {{\mathbf {C}}}_{{\mathrm{nn}}}. \end{aligned}$$
(9)

By considering the fact that \(\varvec{{\mathfrak {t}}}\) and \(\varvec{{\mathfrak {s}}}\) are uncorrelated to \({\mathbf {n}}\), we write by using Eq. 4

$$\begin{aligned} \bar{{\mathbf {C}}}_{\mathrm{ll}}&= \bar{{\hbox {E}}}\{ {\mathbf {l}}{\mathbf {l}}^{\mathrm{T}} \} \nonumber \\&= \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {t}}}+{\mathbf {n}}) (\varvec{{\mathfrak {t}}}+{\mathbf {n}})^{\mathrm{T}} \} \nonumber \\&= \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {t}}}\varvec{{\mathfrak {t}}}^{\mathrm{T}})\} + \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {t}}}{\mathbf {n}}^{\mathrm{T}})\} + \bar{{\hbox {E}}}\{ ({\mathbf {n}}\varvec{{\mathfrak {t}}}^{\mathrm{T}})\} + \bar{{\hbox {E}}}\{ ({\mathbf {n}}{\mathbf {n}}^{\mathrm{T}})\} \nonumber \\&= \bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}} + \underbrace{\bar{{\mathbf {C}}}_{\mathfrak {t}\mathrm{n}}}_{={\mathbf {0}}} + \underbrace{\bar{{\mathbf {C}}}_{\mathrm{n}\mathfrak {t}}}_{={\mathbf {0}}} + {{\mathbf {C}}}_{\mathrm{nn}} \nonumber \\&= \bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}} + {{\mathbf {C}}}_{\mathrm{nn}}, \end{aligned}$$
(10)
$$\begin{aligned} \bar{{\mathbf {C}}}_{\mathfrak {s}\mathrm{l}}&= \bar{{\hbox {E}}}\{ \varvec{{\mathfrak {s}}}{\mathbf {l}}^{\mathrm{T}} \} \nonumber \\&= \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {s}}}) (\varvec{{\mathfrak {t}}}+{\mathbf {n}})^{\mathrm{T}} \} \nonumber \\&= \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {s}}}\varvec{{\mathfrak {t}}}^{\mathrm{T}})\} + \bar{{\hbox {E}}}\{ (\varvec{{\mathfrak {s}}}{\mathbf {n}}^{\mathrm{T}})\} \nonumber \\&= \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {t}} + \underbrace{\bar{{\mathbf {C}}}_{\mathfrak {s}\mathrm{n}}}_{={\mathbf {0}}}. \end{aligned}$$
(11)

Applying these expressions, the fundamental formula for least-squares collocation with noise is obtained from Eq. 1

$$\begin{aligned} \bar{{\mathbf {s}}} = \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {t}} \, (\bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}}+ {{\mathbf {C}}}_{{\mathrm{nn}}}) ^{-1} {\mathbf {l}}. \end{aligned}$$
(12)

For consistency with the following sections and in contrast to Moritz (1980) we retain the bar over all quantities with inclusion of the global average operator \(\bar{{\hbox {E}}}\), and write for the corresponding error covariance matrix \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) according to Eq. 2

$$\begin{aligned} \bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}} = \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {s}} - \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {t}} (\bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}}+ {{\mathbf {C}}}_{{\mathrm{nn}}}) ^{-1} \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {t}}^{\,\mathrm{T}}. \end{aligned}$$
(13)

This description of least-squares collocation by Moritz (1980, Chapter 14) has been established as standard procedure in the literature (see Tscherning 2015; Arabelos and Tscherning 2009; Hofmann-Wellenhof and Moritz 2006; Rieser 2015) and we follow this notation because in our opinion it is the most consistent and detailed description. However, it should be kept in mind that these definitions are only valid for centered observations \({\mathbf {l}}\) and a centered output \({\mathbf {s}}\), because this was set as a requirement (Eq. 3) and is used in the definitions of the covariance matrices (Eqs. 911). Furthermore, the covariance matrices as defined in this section assume a normal distribution of gravitational functionals and describe an average part of the Earth’s surface. Accordingly, these covariance matrices are mainly independent of the location on the Earth (homogeneous) as well as of the direction (isotropic), and as a consequence may not be optimal for local gravity field collocation (Tscherning 1999). Note, that most of the covariance matrices in Moritz (1980) are calculated from signal degree variances, but the discussion in this paragraph holds also for analytical covariance matrices which are calculated by empirical covariance fitting, e.g., from a Tscherning–Rapp model (Tscherning and Rapp 1974). While the latter do not consider a global view, they use the same assumptions, namely centered and normally distributed observations, for a target regional area, therefore resulting in homogeneous and isotropic covariance functions as well.

2.2 The remove–compute–restore (RCR) approach

For many applications, LSC is combined with a remove–compute–restore (RCR) approach (Forsberg and Tscherning 1981; Forsberg 1984). According to the name, RCR means that a part \(\hat{{\mathbf {l}}}\) of the signal is removed from the observations \({\mathbf {l}}\) before the computation

$$\begin{aligned} \Delta {\mathbf {l}}= {\mathbf {l}}- \hat{{\mathbf {l}}}. \end{aligned}$$
(14)

Accordingly, the collocation \(\varGamma \) is performed only with the residuals of the input signal \(\Delta {\mathbf {l}}\)

$$\begin{aligned} \Delta {\mathbf {s}}= \varGamma \,\Delta {\mathbf {l}}, \end{aligned}$$
(15)

and afterwards, the removed part \(\hat{{\mathbf {s}}}\) is restored again

$$\begin{aligned} {\mathbf {s}}= \Delta {\mathbf {s}}+ \hat{{\mathbf {s}}}, \end{aligned}$$
(16)

to yield the output \({\mathbf {s}}\) of the collocation. The main reason for using RCR in collocation is that LSC with residuals is more accurate than it would be with the full signal content. For more detailed background, we refer to Hofmann-Wellenhof and Moritz (2006, Chapter 11) or Rieser (2015, Chapter 4). Usually, the remove and restore steps describe different functionals of the gravity field, and the RCR thereby implicitly includes a field transformation. Moreover, because a field transformation moves signal energy between different degrees in the frequency domain an error in the remove step may not be consistently restored after a field transformation. This might even be the case when remove and restore steps are calculated consistently. Therefore, we see the need to model the accuracy of the remove step consistently in LSC. Only some of the LSC approaches take the accuracy of the reduction model into account (e.g., Haagmans and van Gelderen 1991; Pail et al. 2010; Sansò 2013) while other approaches do not (e.g., Forsberg and Tscherning 1981; Hofmann-Wellenhof and Moritz 2006; Rieser 2015).

In general, the corresponding covariance matrices in all these approaches are calculated either from signal degree variances (Heiskanen and Moritz 1967, Chapter 7.3; Moritz 1980, Chapter 10) or from empirical covariance fitting (Tscherning and Rapp 1974). Those approaches that include the accuracy of a reduction model directly in the calculation, additionally introduce the covariance matrix of a satellite-only model to the method (e.g., Haagmans and van Gelderen (1991)). Since all of these LSC approaches are formulated differently and use various notations, we do not go into further detail but point out only the differences of our approach to the existing ones in the following sections.

2.3 Residual least-squares collocation

Although observations \({\mathbf {l}}\) and output \({\mathbf {s}}\) of a LSC as functionals of the disturbing gravity field \({ T}\) of the Earth are centered globally (Heiskanen and Moritz 1967, Chapter 2.19), it is not possible to automatically assume the same for regional gravity field modeling. However, the definition of LSC according to Sect. 2.1 is only valid if the observations and the output are centered over the target area (Eq. 3). We propose rather to use covariance matrices from a purely stochastic point of view and thereby follow the standard definition of a covariance with an expectation value in a mathematical sense E (Eqs. 1719). As a result, we are no longer limited to the requirements of Eq. 3. The gravity functionals \(\varvec{{\mathfrak {t}}}\) and \(\varvec{{\mathfrak {s}}}\) are furthermore regarded as statistical quantities in a formal sense. As they are defined as true signal content of the gravity field, repeated error-free measurements at one point always give the same result (see Eq. 5). Considering this, we write for the covariance matrices \({\mathbf {C}}_{\mathfrak {s}\mathfrak {s}}\), \({\mathbf {C}}_{\mathfrak {t}\mathfrak {t}}\) and \({\mathbf {C}}_{\mathrm{ll}}\)

$$\begin{aligned} {\mathbf {C}}_{\mathfrak {s}\mathfrak {s}}= & {} \hbox {E}\{ (\varvec{{\mathfrak {s}}}- \underbrace{\hbox {E}\{\varvec{{\mathfrak {s}}}\}}_{=\,\varvec{{\mathfrak {s}}}}) (\varvec{{\mathfrak {s}}}- \underbrace{\hbox {E}\{\varvec{{\mathfrak {s}}}\}}_{=\,\varvec{{\mathfrak {s}}}})^{\mathrm{T}} \} = \{{\mathbf {0}}\}, \end{aligned}$$
(17)
$$\begin{aligned} {\mathbf {C}}_{\mathfrak {t}\mathfrak {t}}= & {} \hbox {E}\{ (\varvec{{\mathfrak {t}}}- \underbrace{\hbox {E}\{\varvec{{\mathfrak {t}}}\}}_{=\,\varvec{{\mathfrak {t}}}}) (\varvec{{\mathfrak {t}}}- \underbrace{\hbox {E}\{\varvec{{\mathfrak {t}}}\}}_{=\,\varvec{{\mathfrak {t}}}})^{\mathrm{T}} \} = \{{\mathbf {0}}\}, \end{aligned}$$
(18)
$$\begin{aligned} {\mathbf {C}}_{\mathrm{ll}}= & {} \hbox {E}\{ ({\mathbf {l}}- \underbrace{\hbox {E}\{{\mathbf {l}}\}}_{=\,\varvec{{\mathfrak {t}}}}) ({\mathbf {l}}- \underbrace{\hbox {E}\{{\mathbf {l}}\}}_{=\,\varvec{{\mathfrak {t}}}})^{\mathrm{T}} \} = \hbox {E}\{ {\mathbf {n}}{\mathbf {n}}^{\mathrm{T}}\} = {\mathbf {C}}_{{\mathrm{nn}}}, \end{aligned}$$
(19)

which differs from the equivalent formulation in Moritz (1980) (Eqs. 910). The covariance of the uncorrelated observations that is named \({\mathbf {C}}_{{\mathrm{nn}}}\) in Moritz (1980) is hereinafter called \({\mathbf {C}}_{\mathrm{ll}}\). We consider this notation as more consistent, because it describes the accuracy of the observations. Note at this point, that \(\varvec{{\mathfrak {t}}}\) and \({\mathbf {l}}\) are not centered (Eq. 5). Now, we assume that we can calculate an unbiased GGM without systematic errors that is able to describe the full signal content of the Earth’s gravity field. From this model we derive different gravity field functionals \(\varvec{{\mathfrak {t}}}\) at the input points. Since this variable is not error-free, we denote the quantity \(\hat{{\mathbf {l}}}\) and write for its expectation value E

$$\begin{aligned} \hbox {E} \{\hat{{\mathbf {l}}} \} = \varvec{{\mathfrak {t}}}. \end{aligned}$$
(20)

We use the hat operator for quantities that are derived from an introduced model in order to distinguish them from measurements and error-free quantities. Since we assumed that \(\hat{{\mathbf {l}}}\) does not contain systematic errors, the difference \(\Delta {\mathbf {l}}\)

$$\begin{aligned} \Delta {\mathbf {l}}= {\mathbf {l}}- \hat{{\mathbf {l}}}, \end{aligned}$$
(21)

is centered and describes a remove step (see Sect. 2.2). By analogy, we write \(\hat{{\mathbf {s}}}\) as the result of the same model as an approximation of the true gravity signal \(\varvec{{\mathfrak {s}}}\) and obtain

$$\begin{aligned} \Delta {\mathbf {s}}= \varvec{{\mathfrak {s}}}- \hat{{\mathbf {s}}}. \end{aligned}$$
(22)

The resulting covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\), \({\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{s}}}}\) of our estimated values \(\hat{{\mathbf {l}}}\) and \(\hat{{\mathbf {s}}}\) are defined by

$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}&= \hbox {E}\{ (\hat{{\mathbf {l}}} - \hbox {E}\{\hat{{\mathbf {l}}}\}) (\hat{{\mathbf {l}}} - \hbox {E}\{\hat{{\mathbf {l}}}\})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\hat{{\mathbf {l}}} - \varvec{{\mathfrak {t}}}) (\hat{{\mathbf {l}}} - \varvec{{\mathfrak {t}}})^{\mathrm{T}} \}, \end{aligned}$$
(23)
$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}&= \hbox {E}\{ (\hat{{\mathbf {s}}} - \hbox {E}\{\hat{{\mathbf {s}}}\}) (\hat{{\mathbf {l}}} - \hbox {E}\{\hat{{\mathbf {l}}}\})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\hat{{\mathbf {s}}} - \varvec{{\mathfrak {s}}}) (\hat{{\mathbf {l}}} - \varvec{{\mathfrak {t}}})^{\mathrm{T}} \}, \end{aligned}$$
(24)
$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{s}}}}&= \hbox {E}\{ (\hat{{\mathbf {s}}} - \hbox {E}\{\hat{{\mathbf {s}}}\}) (\hat{{\mathbf {s}}} - \hbox {E}\{\hat{{\mathbf {s}}}\})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\hat{{\mathbf {s}}} - \varvec{{\mathfrak {s}}}) (\hat{{\mathbf {s}}} - \varvec{{\mathfrak {s}}})^{\mathrm{T}} \}. \end{aligned}$$
(25)

They are used in the following to form the covariance matrices \({\mathbf {C}}_{\Delta \mathrm{l} \Delta \mathrm{l}}\), \({\mathbf {C}}_{\Delta \mathrm{s} \Delta \mathrm{l}}\) and \({\mathbf {C}}_{\Delta \mathrm{s} \Delta \mathrm{s}}\). For the transcription of \({\mathbf {C}}_{\Delta \mathrm{l} \Delta \mathrm{l}}\) we use Eqs. 41921 and 23 and the fact that the random noise \({\mathbf {n}}\) is uncorrelated to a signal.

$$\begin{aligned} {\mathbf {C}}_{\Delta {\mathrm{l}} \Delta {\mathrm{l}}}&= \hbox {E}\{ (\Delta {\mathbf {l}}- \underbrace{\hbox {E}\{\Delta {\mathbf {l}}\}}_{={\mathbf {0}}}) (\Delta {\mathbf {l}}- \underbrace{\hbox {E}\{\Delta {\mathbf {l}}\}}_{={\mathbf {0}}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ \Delta {\mathbf {l}}\Delta {\mathbf {l}}^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ ((\varvec{{\mathfrak {t}}}+{\mathbf {n}})- \hat{{\mathbf {l}}}) ((\varvec{{\mathfrak {t}}}+{\mathbf {n}})- \hat{{\mathbf {l}}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ ((\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}}) + {\mathbf {n}}) ((\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}}) + {\mathbf {n}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}}) (\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}})^{\mathrm{T}} \} + \underbrace{ \hbox {E}\{ (\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}}) {\mathbf {n}}^{\mathrm{T}} \}}_{\,={\mathbf {0}}}\nonumber \\&\quad + \underbrace{ \hbox {E}\{ {\mathbf {n}}(\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}})^{\mathrm{T}} \}}_{\,={\mathbf {0}}} +\, \hbox {E}\{ {\mathbf {n}}{\mathbf {n}}^{\mathrm{T}} \}\nonumber \\&= {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}} + {\mathbf {C}}_{\mathrm{ll}} \end{aligned}$$
(26)

\(\Delta {\mathbf {l}}\) contains the uncertainty of the observations \({\mathbf {l}}\) as well as the uncertainty of observations \(\hat{{\mathbf {l}}}\) synthesized from a GGM. Correspondingly, we can divide the covariance matrix \({\mathbf {C}}_{\Delta {\mathrm{l}} \Delta {\mathrm{l}}}\) into the covariance of the model accuracy \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\), and the covariance \({\mathbf {C}}_{\mathrm{ll}}\) that describes the observation noise according to Eq. 19. Again applying the fact that the random noise \({\mathbf {n}}\) is uncorrelated to \(\varvec{{\mathfrak {s}}}\) and \(\hat{{\mathbf {s}}}\), we write analogously for \({\mathbf {C}}_{\Delta \mathrm{s} \Delta {\mathrm{l}}}\) and \({\mathbf {C}}_{\Delta {\mathrm{s}} \Delta {\mathrm{s}}}\)

$$\begin{aligned} {\mathbf {C}}_{\Delta {\mathrm{s}} \Delta {\mathrm{l}}}&= \hbox {E}\{ (\Delta {\mathbf {s}}- \underbrace{\hbox {E}\{\Delta {\mathbf {s}}\}}_{={\mathbf {0}}}) (\Delta {\mathbf {l}}- \underbrace{\hbox {E}\{\Delta {\mathbf {l}}\}}_{={\mathbf {0}}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ \Delta {\mathbf {s}}\Delta {\mathbf {l}}^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\varvec{{\mathfrak {s}}}- \hat{{\mathbf {s}}}) ((\varvec{{\mathfrak {t}}}+{\mathbf {n}})- \hat{{\mathbf {l}}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\varvec{{\mathfrak {s}}}- \hat{{\mathbf {s}}}) ((\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}}) +{\mathbf {n}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\varvec{{\mathfrak {s}}}-\hat{{\mathbf {s}}}) (\varvec{{\mathfrak {t}}}-\hat{{\mathbf {l}}})^{\mathrm{T}} \} + \underbrace{ \hbox {E}\{ (\varvec{{\mathfrak {s}}}-\hat{{\mathbf {s}}}) {\mathbf {n}}^{\mathrm{T}} \}}_{\,={\mathbf {0}}} \nonumber \\&= {\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}, \end{aligned}$$
(27)
$$\begin{aligned} {\mathbf {C}}_{\Delta {\mathrm{s}} \Delta {\mathrm{s}}}&= \hbox {E}\{ (\Delta {\mathbf {s}}- \underbrace{\hbox {E}\{\Delta {\mathbf {s}}\}}_{\,={\mathbf {0}}}) (\Delta {\mathbf {s}}- \underbrace{\hbox {E}\{\Delta {\mathbf {s}}\}}_{\,={\mathbf {0}}})^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ \Delta {\mathbf {s}}\Delta {\mathbf {s}}^{\mathrm{T}} \} \nonumber \\&= \hbox {E}\{ (\varvec{{\mathfrak {s}}}-\hat{{\mathbf {s}}}) (\varvec{{\mathfrak {s}}}-\hat{{\mathbf {s}}})^{\mathrm{T}} \}\nonumber \\&= {\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{s}}}}, \end{aligned}$$
(28)

with \({\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}\) being the covariance of the introduced model that describes the uncertainties and correlations between the positions and functionals of the input to those of the output. Analogously, the covariance \({\mathbf {C}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{s}}}} \) describes the uncertainties and the correlations of \(\hat{{\mathbf {s}}}\) among different output positions (and functionals).

Thus, we can rewrite the definition of LSC from Eq. 12 with \(\Delta {\mathbf {l}}\) and \(\Delta {\mathbf {s}}\) instead of \({\mathbf {l}}\) and \({\mathbf {s}}\) and use the findings from Eqs. 2126 and 27, resulting in a notation of LSC that uses only residuals as input

$$\begin{aligned} \Delta {\mathbf {s}}&= {\mathbf {C}}_{\Delta {\mathrm{s}} \Delta {\mathrm{l}} } \, ({\mathbf {C}}_{\Delta {\mathrm{l}} \Delta {\mathrm{l}} }) ^{-1} \Delta \mathbf {l} \nonumber \\&= {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \, ({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}) ^{-1} ({\mathbf {l}}- \hat{{\mathbf {l}}} ). \end{aligned}$$
(29)

After restoring the subtracted signal part \(\hat{{\mathbf {s}}}\) we define the residual least-squares collocation (RLSC)

$$\begin{aligned} {\mathbf {s}}&= {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \, ({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}) ^{-1} ({\mathbf {l}}- \hat{{\mathbf {l}}} ) + \hat{{\mathbf {s}}}. \end{aligned}$$
(30)

Instead of centered observations and a centered output which are used as requirements in Sect. 2.1, we introduce the assumption that it is possible to describe the observations \({\mathbf {l}}\) and the output \({\mathbf {s}}\) by means of an unbiased model (\(\hat{{\mathbf {l}}}\) and \(\hat{{\mathbf {s}}}\)). In this way, the input \(({\mathbf {l}}- \hat{{\mathbf {l}}})\) of RLSC stays centered. We have two key factors of RLSC in comparison with other LSC methods: (1) the input consists only of residuals, and (2) the covariance matrices \({\mathbf {C}}_{\mathrm{ll}}\) and \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\) describe the accuracy of the input \({\mathbf {l}}\) and \(\hat{{\mathbf {l}}}\) directly. With these considerations, we differ from previous LSC approaches (e.g., Moritz 1980; Haagmans and van Gelderen 1991; Pail et al. 2010; Sansò 2013; Rieser 2015), which always include signal covariance matrices in LSC, while in our approach the covariance matrices contain only the uncertainties of input and output. For example, Haagmans and van Gelderen (1991) also include a reduction step with full covariance information in LSC, but at the same time they use a third covariance matrix inside the brackets of Eq. 30 that describes the covariance of the gravity signal itself (equal to \(\bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}}\) from Sect. 2.1). The reason for the difference at this point is that we assume the existence of a high-quality GGM, whose uncertainties can be fully described by a variance–covariance matrix (no systematic errors), which other approaches do not.

The interpretation of Eq. 30 is quite different from the definitions in Sect. 2.1 where the covariance matrices in the collocation describe the signal content instead of the error characteristics. Nevertheless, RLSC is consistent with the theory of Moritz (1980), which can be demonstrated by the following thought experiment. If the model \(\hat{{\mathbf {l}}}\) in Eq. 30 becomes worse, then the elements of the covariance matrix \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\) that describe its uncertainties will become larger. This is also valid for an extreme case where we do not subtract a model at all, so that \(\hat{{\mathbf {l}}}\) becomes zero. In this case, the covariance \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\) describes the full signal content of the observations \({\mathbf {l}}\) which is basically the same idea as in Moritz (1980). However, the covariance \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\) is still regarded as the uncertainty of the bad (or missing) model instead of the covariance of the signal content (as it is in Sect. 2.1). Even with \(\hat{{\mathbf {l}}} = {\mathbf {0}}\) there is a difference in the approaches, since in general the expectation value \(\hbox {E}\) is not equal to the total average \(\bar{{\hbox {E}}}\) (Eq. 7). However, it is possible to use the total average \(\bar{{\hbox {E}}}\) as an approximation of the expectation value \(\hbox {E}\), which means that for \(\hat{{\mathbf {l}}} = {\mathbf {0}}\) we can approximate

$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}} \approx \bar{{\mathbf {C}}}_{\mathfrak {t}\mathfrak {t}},\nonumber \\ {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \approx \bar{{\mathbf {C}}}_{\mathfrak {s}\mathfrak {t}}. \end{aligned}$$
(31)

With this approximation our formulation of least-squares collocation with errors (Eq. 30) becomes identical to standard LSC from Moritz (1980) (Eq. 12) including the RCR approach (Eqs. 1416). Accordingly, we write for the error covariance matrix \({\mathbf {E}}_{\mathrm{ss}}\) of the output \({\mathbf {s}}\) in Eq. 30 and refer to Moritz (1980, Chapter 14) for an analogous and detailed derivation

$$\begin{aligned} {\mathbf {E}}_{\mathrm{ss}}&= {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}} - {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \, ({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}) ^{-1} {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,{\mathrm{T}}}. \end{aligned}$$
(32)

Note that, by definition there is no difference between \({\mathbf {E}}\) and \({\mathbf {C}}\) in our approach since both describe error covariance matrices. Nevertheless, we retain this notation because it clarifies that \({\mathbf {E}}_{\mathrm{ss}}\) refers to the covariance of the output \({\mathbf {s}}\) instead of the true gravity signal \(\varvec{{\mathfrak {s}}}\) or the model \(\hat{{\mathbf {s}}}\).

2.4 Inclusion of a GGM and a topographic gravity model into RLSC

To calculate RLSC from Eq. 30 we require estimates of the input \(\hat{{\mathbf {l}}}\) and the output \(\hat{{\mathbf {s}}}\) as well as the related covariance matrices for the input \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}\), the output \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}\) and the combination \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \). All of these can be derived from the normal equation system related to a GGM

$$\begin{aligned} {\mathbf {N}}_{{\mathrm{ff}}}^{{\mathrm{m}}} \hat{{\mathbf {x}}}_{{\mathrm{f}}}^{{\mathrm{m}}} = {\mathbf {q}}_{\,{\mathrm{f}}}^{{\mathrm{m}}}. \end{aligned}$$
(33)

Here, \({\mathbf {N}}_{{\mathrm{ff}}}^{{\mathrm{m}}}\) is the normal equation matrix, \({\mathbf {q}}_{\,{\mathrm{f}}}^{{\mathrm{m}}}\) is the right-hand side, and \(\hat{{\mathbf {x}}}_{{\mathrm{f}}}^{\mathrm{m}}\) are the estimated Spherical Harmonic (SH) coefficients. The superscript ‘m’ refers to quantities that are derived from a GGM. In the following we continue to use superscripts to clarify the origin of covariance matrices and vectors. In contrast, the subscripts are continuously applied to describe the corresponding functionals and their positions with the subscript ‘f’ standing for the frequency domain of the SH coefficients. For more details about the normal equation system describing the SH coefficients we refer to Fecher et al. (2015). Next, the normal equation matrix \(\mathrm{N}_\mathrm{ff}^\mathrm{m}\) is inverted

$$\begin{aligned} {\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}} = ({\mathbf {N}}_{{\mathrm{ff}}}^{\mathrm{m}})^{-1}, \end{aligned}$$
(34)

to obtain the covariance matrix \({\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}}\) of the SH coefficients, which is used to solve the normal equation system and estimate the SH coefficients \(\hat{{\mathbf {x}}}_{{\mathrm{f}}}^{\mathrm{m}}\)

$$\begin{aligned} \hat{{\mathbf {x}}}_{{\mathrm{f}}}^{\mathrm{m}} = {\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}} \, {\mathbf {q}}_{\,\mathrm{f}}^{\mathrm{m}}. \end{aligned}$$
(35)

Afterwards, we write the transformations of the SH coefficients to different functionals and point positions in the space domain with the design matrices \({\mathbf {A}}_{\mathrm{to}}^{\mathrm{from}}\).

$$\begin{aligned} \hat{{\mathbf {l}}}^{\mathrm{m}} = {\mathbf {A}}_{\mathrm{l}}^{\mathrm{f}} \, \hat{{\mathbf {x}}}_{{\mathrm{f}}}^{\mathrm{m}} \nonumber ,\\ \hat{{\mathbf {s}}}^{\mathrm{m}} = {\mathbf {A}}_s^{{\mathrm{f}}} \, \hat{{\mathbf {x}}}_{{\mathrm{f}}}^{\mathrm{m}}, \end{aligned}$$
(36)

and for the calculation of the covariance matrices

$$\begin{aligned}&{\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} = {\mathbf {A}}_\mathrm{l}^{{\mathrm{f}}} \, {\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}} {\mathbf {A}}_{\mathrm{l}}^{{\mathrm{f}} \, {\mathrm{T}}} \nonumber ,\\&{\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} = {\mathbf {A}}_\mathrm{s}^{{\mathrm{f}}} \, {\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}} {\mathbf {A}}_{\mathrm{l}}^{{\mathrm{f}} \, {\mathrm{T}}} \nonumber ,\\&{\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}} = {\mathbf {A}}_\mathrm{s}^{{\mathrm{f}}} \, {\mathbf {C}}_{{\mathrm{ff}}}^{\mathrm{m}} {\mathbf {A}}_{\mathrm{s}}^{{\mathrm{f}} \, {\mathrm{T}}}. \end{aligned}$$
(37)

Here, the covariance \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) describes the uncertainties and correlations of the GGM at the input points, \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\) the covariance at the output points, and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) their cross correlations. The square root of the main diagonal of \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) gives the standard deviations of the GGM parameters at the input points in terms of the respective functional. Note that as the result of a GGM, \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) is a representative of the gravity field up to a certain spherical harmonic degree \({ N}\), but does not consider the full frequency spectrum of \({\mathbf {l}}\). However, we can use \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) from a GGM, because it centers the difference \(\Delta {\mathbf {l}}\) (in the degrees up to \({ N}\))

$$\begin{aligned} \Delta {\mathbf {l}}= {\mathbf {l}}- \hat{{\mathbf {l}}}^{\mathrm{m}}, \end{aligned}$$
(38)

as introduced in Eq. 14. When a GGM is considered, there are two possibilities: either a GGM from a satellite-only solution that describes the gravity field up to degrees around 200–280 (e.g., GOCO05s; Mayer-Gürr et al. 2015) is used, or a GGM with much higher spatial resolution that already contains terrestrial information like EGM2008 (Pavlis et al. 2012), EIGEN-6c4 (Förste et al. 2014) or XGM2016 (Pail et al. 2018). In both cases, the approximation of the true gravity field \(\varvec{{\mathfrak {t}}}\) by \(\hat{{\mathbf {l}}}\) can be improved by additionally considering the information in the frequencies above the maximum degree of the GGM. In general, large parts of the gravity signal beyond a GGM’s resolution are related to the topography (Forsberg and Tscherning 1981; Hirt et al. 2013; Rexer et al. 2016), which is why we use an additional model for the topographic gravity effect above the maximum degree N of the GGM and calculate its effect \(\hat{{\mathbf {l}}}^{\mathrm{t}}\) (with ‘t’ standing for topography) at the input points

$$\begin{aligned} \hat{{\mathbf {l}}}= \hat{{\mathbf {l}}}^{\mathrm{m}} + \hat{{\mathbf {l}}}^{\mathrm{t}}, \end{aligned}$$
(39)

and the output points

$$\begin{aligned} \hat{{\mathbf {s}}}= \hat{{\mathbf {s}}}^{\mathrm{m}} + \hat{{\mathbf {s}}}^{\mathrm{t}}. \end{aligned}$$
(40)

The same requirements that apply for the definition of \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) also apply for \(\hat{{\mathbf {l}}}^{\mathrm{t}}\), so that the resulting \(\Delta {\mathbf {l}}\) is normally distributed and centered (up to the maximum degree of the topographic gravity model)

$$\begin{aligned} \Delta {\mathbf {l}}= {\mathbf {l}}- \hat{{\mathbf {l}}}^{\mathrm{m}} - \hat{{\mathbf {l}}}^{\mathrm{t}}. \end{aligned}$$
(41)

From a practical point of view it is also important that \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) and \(\hat{{\mathbf {l}}}^{\mathrm{t}}\) can be considered as independent of each other so that the variance propagation from Eq. 39 yields

$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}} = {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}, \end{aligned}$$
(42)

with \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) being the covariance of the topographic gravity model that describes the models uncertainties and correlations. This is analogous to the character of \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) for the GGM, and the same is valid for the covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}\)

$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} = {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}},\nonumber \\ {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}} = {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}} + {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{t}}. \end{aligned}$$
(43)

As we use the topographic gravity model only in the degrees above the maximum degree \({ N}\) of the GGM, we regard the two models as uncorrelated and write for their degree \({ n}\)

$$\begin{aligned}&{ n}^{\mathrm{m}} \in \{2, { N}\}, \nonumber \\&{ n}^{\mathrm{t}} \in \{ { N} + 1, { N}_{\mathrm{max}} \}, \end{aligned}$$
(44)

where \({ N}_{\mathrm{max}}\) is the maximum degree of \(\hat{{\mathbf {l}}}^{\mathrm{t}}\). Usually, there is neither a normal equation system available for the degrees above a GGM nor another source for a covariance matrix that describes the accuracy of the topographic gravity model. Therefore, without the possibility of deriving direct accuracy or covariance information for the topographic gravity model, we must use the total average \(\bar{{\hbox {E}}}\) for the derivation of a covariance matrix. The resulting covariance is an approximation for the accuracy and the correlations of the topographic gravity model and is derived under the assumptions mentioned in Sect. 2.1. This covariance can for example be calculated from the residuals \(\Delta {\mathbf {l}}\), because the topography is usually associated with the largest source of uncertainties in \(\Delta {\mathbf {l}}\). The approach of empirical covariance fitting is quite common in regional geoid modeling and is described in Tscherning and Rapp (1974). Note that the resulting covariance matrices are designed to describe a finite dimensional space and therefore disregard the gravity signal above degree \({ N}_{\mathrm{max}}\). Adopting this approach our notation changes as follows, again marking the covariance matrices derived from the total average \(\bar{{\hbox {E}}}\) with a bar, as in Sect. 2.1.

$$\begin{aligned} {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \approx \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \nonumber \\ {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \approx \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \nonumber \\ {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{t}} \approx \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{t}} \end{aligned}$$
(45)

By inserting \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}\), \({\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}\), \(\hat{{\mathbf {l}}}\) and \(\hat{{\mathbf {s}}}\) from the previous equations into Eq. 30, we obtain the final formulation of RLSC including the GGM and the topographic gravity model

$$\begin{aligned} {\mathbf {s}}&= \underbrace{({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} )}_{{\mathrm{Part}} \,1} \, \underbrace{({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1}}_{{\mathrm{Part}} \,2} \underbrace{({\mathbf {l}}- \hat{{\mathbf {l}}}^{\mathrm{m}} - \hat{{\mathbf {l}}}^{\mathrm{t}})}_{{\mathrm{Part}} \,3} + \underbrace{\hat{{\mathbf {s}}}^{\mathrm{m}} + \hat{{\mathbf {s}}}^{\mathrm{t}}}_{{\mathrm{Part}} \,4}. \end{aligned}$$
(46)

Since Eq. 46 is very important in the following sections, we look at its various parts in more detail. Part 3 contains the residual input to RLSC subject to the condition that it is centered. Here, the remove step is performed by reducing \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) and \(\hat{{\mathbf {l}}}^{\mathrm{t}}\) from the observations \({\mathbf {l}}\). Part 2 describes the uncertainties of part 3 accordingly. Every quantity from the input (\({\mathbf {l}}\), \(\hat{{\mathbf {l}}}^{\mathrm{m}}\), \(\hat{{\mathbf {l}}}^{\mathrm{t}}\)) has its own covariance matrix (\({\mathbf {C}}_{\mathrm{ll}}\), \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\), \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\)). Similarly, we have the covariance matrices between input and output uncertainties in part 1, except that the covariance between the accuracy of the observations and the output \({\mathbf {s}}\) is missing because the two are assumed to be uncorrelated (Eq. 27). Finally, part 4 describes the restore step in the output functional (\(\hat{{\mathbf {s}}}^{\,\mathrm{m}}\), \(\hat{{\mathbf {s}}}^{\mathrm{t}}\)).

The approach taken from Eq. 46 uses the full covariance information of a GGM up to a certain degree \({ N}\), and topographic information above, from degree \({ N}+1\) up to degree \({ N}_{\mathrm{max}}\). The approach therefore delivers several advantages compared to the approach of Moritz (1980):

  1. 1.

    The observations \({\mathbf {l}}\) themselves do not have to be centered. Instead, we use the condition that GGM and topographic gravity model have the same offset as the observations \({\mathbf {l}}\) so that \({\mathbf {l}}- \hat{{\mathbf {l}}}^{\mathrm{m}} - \hat{{\mathbf {l}}}^{\mathrm{t}}\) is centered.

  2. 2.

    The covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) are neither homogeneous nor isotropic but fit perfectly to a target area and can benefit from the continuously improving quality of (high-resolution) GGMs.

  3. 3.

    This approach uses only residuals as input for LSC, because it can be shown that this yields the best results (see Sect. 2.2).

  4. 4.

    RLSC can give a realistic formal error estimate, because with the inclusion of the GGM uncertainties all the stochastic information is included in the resulting error covariance matrix \({\mathbf {E}}_{\mathrm{ss}}\), which is calculated (analogously to Eq. 46) by inserting \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\), \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\) into Eq. 32

$$\begin{aligned} {\mathbf {E}}_{\mathrm{ss}}&= {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}} - {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}} \, ( {\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}}\hat{{\mathrm{l}}}}) ^{-1} {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,{\mathrm{T}}} \nonumber \\&= ({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{t}} ) - ({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}} ) \nonumber \\&\quad \times ({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1} ({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}} )^{\,{\mathrm{T}}}. \end{aligned}$$
(47)

In comparison with the existing literature that has already included accuracy information from a satellite-only model (e.g., Haagmans and van Gelderen 1991; Pail et al. 2010; Sansò 2013) our approach still maintains the advantages 1 and 2. Point 4 is only advantageous in comparison with approaches that do not include all stochastic parts in LSC (e.g., Forsberg and Tscherning 1981; Hofmann-Wellenhof and Moritz 2006; Rieser 2015).

3 Data and simulation concept

We show the benefit of including full covariance matrices from a high-resolution GGM in collocation by a comparison between RLSC (Eq. 46) and a comparable approach without GGM covariance matrices, which we refer to as standard LSC. We derive standard LSC

$$\begin{aligned} \bar{{\mathbf {s}}} = \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}} \, ({{\mathbf {C}}}_{\mathrm{ll}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1} ({\mathbf {l}}- \hat{{\mathbf {l}}}^{\mathrm{m}} - \hat{{\mathbf {l}}}^{\mathrm{t}}) + \hat{{\mathbf {s}}}^{\,\mathrm{m}} + \hat{{\mathbf {s}}}^{\mathrm{t}}, \end{aligned}$$
(48)

from the approach by Moritz (1980) (Eq. 12), but include an RCR approach for the GGM and the topographic gravity model that is analogous to Eq. 46. Standard LSC is thereby equivalent to RLSC (Eq. 46) except that the two covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\) are missing, because the GGM is either assumed to be error-free or its noise component is implicitly included in the model covariance function, which is used to fit the empirical covariance function (Tscherning and Rapp 1974). In case of an error-free GGM the corresponding error covariance matrix \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) results in accuracy estimates of \(\bar{{\mathbf {s}}}\) that are overly optimistic

$$\begin{aligned} \bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}&= \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{t}} - \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \, ({\mathbf {C}}_{\mathrm{ll}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1} \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,t \,{\mathrm{T}}}. \end{aligned}$$
(49)

The comparison of the two methods works best within a synthetic test case scenario that allows us to compute the residuals of the two methods by knowing the pre-defined truth. Moreover, it makes it possible to evaluate and assess the formal error estimate of the stochastic part of the collocation (compare RLSC: Eq. 47 and standard LSC: Eq. 49). To obtain realistic results from a synthetic test case, we add noise to the observations and to the GGM (as shown below).

Fig. 1
figure 1

LSC input points and pre-defined truth of the test scenario with borders as white lines. a Terrain heights with the distribution of the input points with altimetry observations (red dots) and terrestrial measurements (black dots). b Regular \(5^{\prime }\) resolution grid of geoid heights that are the pre-defined truth \(\varvec{{\mathfrak {s}}}\) for the synthetic test case

The synthetic test case is calculated for one of the most common challenges of regional geoid modeling, i.e., the calculation of the geoid from gravity anomalies given at selected input points. As the calculation should be an evaluation test for a real geoid computation, we use the input positions of actual gravity observations for our synthetic test case. Output is a regular grid of geoid heights in the target area, the South American Andes. This is a useful test area for the study since the Andes are one of the most demanding regions worldwide in terms of gravity signal variations, heterogeneous data distribution, and topographic effects. Also, we have access to the actual terrestrial database for this area (Hosse et al. 2014) and can therefore realistically simulate the data distribution on land.

The terrestrial gravity observation points are inhomogeneously distributed in an area between longitude \([{-}\,72^{\circ }\) to \({-}\,66^{\circ }\)] and latitude [\({-}\,29^{\circ }\) to \({-}\,18^{\circ }\)] that is the northern part of Chile (CHL), north-western Argentina (ARG) and south-western Bolivia (BOL, Fig. 1a). Furthermore, we add altimetry observations to cover the ocean region of the test area. The altimetry observations are given as a regular grid with \(5^{\prime }\) spacing in the Pacific Ocean (PAC) bordered by the \({-}\,74^{\circ }\) longitude meridian and the \({-}\,31^{\circ }\) parallel. Altimetric gravity data on a regular grid are available from various ocean products, e.g., DTU13 (Andersen et al. 2015) or Sandwell and Smith (2009) and can be found for download at the corresponding websites. The distribution of the input points in Fig. 1a is displayed together with the terrain height. The image shows the Pacific Ocean to the West and the main ridge of the Andes from north to south. The test case includes areas with quite dense terrestrial observations, while others show large data gaps. Overall, there are 14,613 gravity anomaly input points with 7814 from terrestrial observations and 6799 altimetry grid points. As output, geoid heights \({\mathbf {N}}\) are estimated for the whole study area as a regular \(5^{\prime }\) grid, which results in 21,901 output points (Fig. 1b).

As a GGM we use the XGM2016 (Pail et al. 2018), because we have its full normal equation system available that allows the calculation of the covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\), \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\mathrm{m}}\) (Eq. 37), as well as input and output functionals (\(\hat{{\mathbf {l}}}^{\mathrm{m}}\), \(\hat{{\mathbf {s}}}^{\mathrm{m}}\)) in Eq. 36. The XGM2016 is a combined gravity field model up to degree 719 which uses relative regional weighting for the combination of terrestrial and satellite information. Compared to the GOCO05c model (Fecher et al. 2017), the XGM2016 includes an improved terrestrial data set provided by the National Geospatial-Intelligence Agency (NGA). In our opinion the XGM2016 is one of the most consistently calculated high-resolution GGMs, and its good performance in comparison with other high-resolution gravity field models such as EGM2008 (Pavlis et al. 2012), EIGEN-6c4 (Förste et al. 2014) and GOCO05c (Fecher et al. 2017) is demonstrated in Pail et al. (2018). In a post-analysis of the variance–covariance matrix of XGM2016, however, we found out that the accuracy estimates in the higher degrees (beyond the resolution of the satellite data) underestimate the true noise behavior. The regionally varying weights of XGM2016 have been computed empirically from comparison of satellite and low-pass filtered ground data (Pail et al. 2018). This procedure ensures an adequate relative weighting of satellite and ground data, but implicitly disregards signal content in the high degrees. In a new version of XGM2016, which shall be released soon, we will take this omission error into account by means of scaling the error estimates by their high-frequency signal content. From this analysis, we consider a factor of 3 as a reasonable value to calibrate the error estimates in the area of our synthetic test case. Therefore, we scale the XGM2016 accuracy estimates in Sects. 4.1 and 4.2 by a factor of 3 and hence must increase the elements of the covariance matrices \({\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\), \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\mathrm{m}}\) and \({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\mathrm{m}}\) by a factor of 9. However, in Sect. 4.3 we present results with the original XGM2016 accuracy and thus investigate the impact of the accuracy of the GGM.

Table 1 Overview of all quantities, their definition and source in our synthetic test case scenario

For the definition of the true gravity signal, we choose a XGM2016 and EIGEN-6c4 combination model [‘GOCE-OGMOC’, Gruber and Willberg (2019)] up to degree 2190 and calculate \(\varvec{{\mathfrak {s}}}\) and \(\varvec{{\mathfrak {t}}}\) from it. The main reason for using this model is that we can be sure it does not contain systematic errors relative to the XGM2016 model because the long wavelength parts are identical. The geoid heights \(\varvec{{\mathfrak {s}}}\) that we use as pre-defined truth, and to which we subsequently compare our results, are presented in Fig. 1b. In the target area, the geoid heights vary from 10 m in the Atacama Trench up to almost 50 m in the plateau of Bolivia. The true gravity anomalies \(\varvec{{\mathfrak {t}}}\) are used to calculate the simulated gravity observations \({\mathbf {l}}\) by adding white noise \({\mathbf {n}}\) which is calculated from a random noise vector \({\mathbf {e}}^{\mathrm{l}}\) and the accuracy of the observations \({\sigma }^{\mathrm{l}}\)

$$\begin{aligned} {{\mathbf {l}}} = \varvec{{\mathfrak {t}}}+ {\mathbf {n}}= \varvec{{\mathfrak {t}}}+ {\sigma }^{\mathrm{l}} {\mathbf {e}}^{\mathrm{l}}. \end{aligned}$$
(50)

\({\sigma }^{\mathrm{l}}\) is also used in the covariance matrix \({\mathbf {C}}_{\mathrm{ll}}\) of the observations where we assume the accuracies of different observations \({\mathbf {l}}\) to be uncorrelated and therefore obtain the diagonal matrix \({\mathbf {C}}_{\mathrm{ll}}\)

$$\begin{aligned} {\mathbf {C}}_{\mathrm{ll}} = {\sigma }^{\mathrm{l}} {\mathbf {I}}\,. \end{aligned}$$
(51)

We must consider this as a simplification because we assume that we know the accuracy \({\sigma }^{\mathrm{l}}\) of the input observations which is not guaranteed in reality.

For \(\hat{{\mathbf {l}}}^{\mathrm{t}}\) and \(\hat{{\mathbf {s}}}^{\mathrm{t}}\) we use a spherical harmonic synthesis from the topographic gravity model dV_ELL_Earth2014 (Rexer et al. 2016) from degree \({ N}+1\) to \({ N}_{\mathrm{max}}\). Note that it is not necessary for \(\hat{{\mathbf {l}}}^{\mathrm{t}}\) (or \(\hat{{\mathbf {s}}}^{\mathrm{t}}\)) to contain all remaining gravity signals above the degree \({ N}\) of the GGM provided it is without systematic error and independent of \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) (conditions in the definition of Sect. 2.4). An overview of all quantities of the synthetic test case can be found in Table 1.

Finally, we add noise to the GGM. Since the XGM2016 model uses regional varying weighting, colored noise that actually describes the regionally varying accuracy of the model is added in terms of a (random) realization based on the variance–covariance information. For the calculation of this covariance, we combine the observations \({\mathbf {l}}\) and the output \({\mathbf {s}}\) in one vector \(\varvec{\kappa }\)

$$\begin{aligned} {\varvec{{\kappa }}} = \begin{pmatrix} {\mathbf {l}}\\ {\mathbf {s}}\end{pmatrix}, \end{aligned}$$
(52)

and propagate the variance–covariance matrix of the GGM \({\mathbf {C}}_{\mathrm{ff}}^{\mathrm{m}}\) to the combined covariance matrix \({\mathbf {C}}_{\hat{\kappa }\hat{\kappa }}^{\,\mathrm{m}}\) of input and output by attaching the three formulas of Eq. 37 to give

$$\begin{aligned} {\mathbf {C}}_{\hat{\kappa } \hat{\kappa }}^{\,\mathrm{m}} = {\mathbf {A}}_{\kappa }^{\mathrm{f}} \, {\mathbf {C}}_{\mathrm{ff}}^{\mathrm{m}} {\mathbf {A}}_{\kappa }^{\mathrm{f} \, {\mathrm{T}}}, \end{aligned}$$
(53)

where \({\mathbf {A}}_{\kappa }^{\mathrm{f}}\) is the design matrix for the transformation from SH coefficients to the input and output points \(\varvec{\kappa }\). We obtain the colored noise \(\varvec{\kappa }^{\mathrm{n}}\) of input and output by multiplying a random vector \({\mathbf {e}}^{\mathrm{m}}\) with white noise characteristics and the Cholesky decomposition of the combined covariance matrix \({\mathbf {C}}_{\hat{\kappa } \hat{\kappa }}^{\,\mathrm{m}}\)

$$\begin{aligned} \varvec{\kappa }^{\mathrm{n}} = \hbox {chol}({{\mathbf {C}}_{\hat{\kappa } \hat{\kappa }}^{\,\mathrm{m}} }) \, {\mathbf {e}}^{\mathrm{m}}= \hbox {chol}{ \begin{pmatrix} {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}&{}{\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm {m}\,{\mathrm{T}}}\\ {\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}&{}{\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\\ \end{pmatrix} }\, {\mathbf {e}}^{\mathrm{m}}, \end{aligned}$$
(54)

which is defined by

Fig. 2
figure 2

LSC remove step for the gravity anomalies: a terrestrial observations with \(\sigma _{\mathrm{n}} = 1\) mGal, b observations after reduction of XGM2016 model, c LSC input \(\varvec{\Delta } {\mathbf{l}}\) after reducing XGM2016 and topographic gravity

$$\begin{aligned} {\mathbf {X}}= \hbox {chol}({\mathbf {X}}) \, \hbox {chol}({\mathbf {X}}) ^{\mathrm{T}}, \end{aligned}$$
(55)

where \({\mathbf {X}}\) is a positive definite matrix (e.g., covariance matrix). Consequently, we obtain the noise of the GGM for the input \({\mathbf {l}}^{\mathrm{n}}\) and the output \({\mathbf {s}}^{\mathrm{n}}\) analogously to the definition in Eq. 52 by

$$\begin{aligned} \varvec{\kappa }^{\mathrm{n}} = \begin{pmatrix} {\mathbf {l}}^{\mathrm{n}}\\ {\mathbf {s}}^{\mathrm{n}} \end{pmatrix}, \end{aligned}$$
(56)

and add them correspondingly to \(\hat{{\mathbf {l}}}^{\mathrm{m}}\) and \(\hat{{\mathbf {s}}}^{\mathrm{m}}\). Thus, for a synthetic test case we adjust the formula of RLSC (Eq. 46) with the inclusion of the observation noise \({\mathbf {n}}\) (Eq. 50) and the noise of the XGM2016 model \({\mathbf {l}}^{\mathrm{n}}\) and \({\mathbf {s}}^{\mathrm{n}}\) (Eqs. 5456)

$$\begin{aligned} {\mathbf {s}}&= ({\mathbf {C}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} ) \, ({\mathbf {C}}_{\mathrm{ll}} + {\mathbf {C}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1} ((\varvec{{\mathfrak {t}}}+ {\mathbf {n}}) - (\hat{{\mathbf {l}}}^{\mathrm{m}} + {\mathbf {l}}^{\mathrm{n}}) - \hat{{\mathbf {l}}}^{\mathrm{t}}) \nonumber \\&\quad + (\hat{{\mathbf {s}}}^{\mathrm{m}} + {\mathbf {s}}^{\mathrm{n}}) + \hat{{\mathbf {s}}}^{\mathrm{t}}, \end{aligned}$$
(57)

and analogously, for standard LSC from Eq. 48

$$\begin{aligned} \bar{{\mathbf {s}}}&= \bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}} \, ({\mathbf {C}}_{\mathrm{ll}} + \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) ^{-1} ((\varvec{{\mathfrak {t}}}+ {\mathbf {n}}) - (\hat{{\mathbf {l}}}^{\mathrm{m}} + {\mathbf {l}}^{\mathrm{n}}) - \hat{{\mathbf {l}}}^{\mathrm{t}}) + (\hat{{\mathbf {s}}}^{\mathrm{m}} + {\mathbf {s}}^{\mathrm{n}}) + \hat{{\mathbf {s}}}^{\mathrm{t}}. \end{aligned}$$
(58)

In both cases, the stochastic part of the collocation does not change, so that the calculation of the error covariance matrices \({{\mathbf {E}}}_{\mathrm{ss}}\) (Eq. 47) and \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) (Eq. 49) remain unchanged in the synthetic test case.

The effect of the remove step in Eqs. 57 and 58 is presented in Fig. 2, where the observations always include a white noise of \({\sigma }^{\mathrm{l}}\) = 1 mGal (see Eq. 50). The XGM2016 model up to degree \({ N}=719\) contains the colored noise that is calculated from the Cholesky decomposition (Eq. 54), and the topographic gravity reduction is used from \({ N}+1\) to 2190. Figure 2a shows the original, noisy observations \({\mathbf {l}}\). We reduce the observations \({\mathbf {l}}\) first by the high-resolution XGM2016 model \((\hat{{\mathbf {l}}}^{\mathrm{m}} + {\mathbf {l}}^{\mathrm{n}})\) (Fig. 2b) and afterwards also by the gravity signal that is related to the topography, resulting in the residual input vector \({\mathbf {l}}- (\hat{{\mathbf {l}}}^{\mathrm{m}} + {\mathbf {l}}^{\mathrm{n}}) - \hat{{\mathbf {l}}}^{\mathrm{t}}\) (Fig. 2c). Note that the color scale decreases significantly between Fig. 2a and Fig. 2c. This is also apparent in Table 2, which shows the mean value and the standard deviation of the corresponding data sets in Fig. 2. Regarding its standard deviation (SD), the signal is reduced by about 80% by subtracting the XGM2016 model and more than 90% in combination with the topography. With a mean value of only 0.2 mGal the condition of a centered LSC input is almost fulfilled in this case. In our numerical simulation, the final LSC input contains the noise of the observations and the GGM as well as the inaccuracy of the topographic gravity model. These three effects are modeled in the related error covariance matrices of our approach (Eq. 57). The inaccuracy of the topographic gravity model thereby also includes density anomalies in the spectral range from \({ N}+1\) to \({ N}_{\mathrm{max}}\), which are contained in the residual observations. These anomalies are not adequately represented in the topographic gravity model, because it assumes a constant topographic density (Rexer 2017, Chapter 3.2).

Table 2 Consistent numerical analysis of the LSC remove step in Fig. 2
Fig. 3
figure 3

Empirical covariance function that is calculated from the LSC residuals in the target area (red) and the scaled covariance from signal degree variances (blue) as it is used for LSC. Both covariance functions are shown dependent on the spherical distance between two points

Currently we do not have accuracy information for the topographic gravity model (see Sect. 2.4) and this kind of information is not available for any of the topographic gravity models (Rexer et al. 2016; Grombein et al. 2016). However, because we assume the largest part of the LSC input results from differences between the topographic gravity model and the observations in the high frequencies (as they contain also the effect of density anomalies), we calculate a Model Covariance Function (MCF) that fits the LSC input. This is considered to be a standard approach in LSC which is used for the calculation with signal degree variances (Moritz 1980, Chapter 12) as well as for empirical covariance fitting with the most common approach by Tscherning and Rapp (1974). The result is an isotropic and homogeneous covariance matrix for the spectral range above degree 719 that fits to our input residuals. In general, we calculate the covariance matrices \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) and \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) from signal degree variances and refer to Moritz (1980) for more details. However, global signal degree variance models describe an average Earth, but the South American Andes are not an average area in terms of gravity signal (due to large topographic masses). Thus, we have to use a scale factor to fit model degree variances (blue, Fig. 3) to the Empirical Covariance Function (ECF) of the LSC input residuals (red). In Fig. 3, we calculate signal degree variances, according to our data simulation, from the difference between the GOCE-OGMOC and the dV_ELL_Earth2014 model from degree 720 to 2190 and scale them by a factor of 4.25. Figure 3 then shows the ECF and MCF in dependence of the spherical distance between two points. In this case the correlation length amounts to only \(0.12^{\circ }\).

4 Results of the synthetic test scenario

Section 3 describes the method, the parameters and the sources that are used to evaluate RLSC (Eq. 57) in comparison with standard LSC (Eq. 58). Both methods use the RCR concept where a GGM and the topographic gravity are removed before the collocation, and their effects are restored (to the output functional) afterwards. For the calculation we set the degree \({ N}\) of the GGM first and use the topographic gravity model always from degree \({ N}+1\) to \({ N}_{\mathrm{max}}=2190\). The results of \({\mathbf {s}}\) (or \(\bar{{\mathbf {s}}}\)) are compared with the assumed truth \(\varvec{{\mathfrak {s}}}\), and the difference indicates the accuracy of the collocation result under these pre-defined conditions. In terms of numerical classification we define the Root Mean Square (RMS) over a target area with \({ n}^{\mathrm{points}}\) points as a quality criterion for the LSC result \({\mathbf {s}}\) (or \(\bar{{\mathbf {s}}}\))

$$\begin{aligned} \hbox {RMS} = \root \of {\frac{\sum _{\mathrm{points}}{({\mathbf {s}}-\varvec{{\mathfrak {s}}})^2}}{{ n^{\mathrm{points}} } }}. \end{aligned}$$
(59)

We use an equivalent formula to characterize the Mean estimated standard deviation (MSD), which describes the stochastic accuracies and is derived from \({{\mathbf {E}}}_{\mathrm{ss}}\) (or \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\)) with diag(\({\mathbf {X}}\)) giving the main diagonal of a square matrix \({\mathbf {X}}\)

$$\begin{aligned} \hbox {MSD} = \root \of {\frac{\sum _{\mathrm{points}}{\mathrm{diag}({{\mathbf {E}}}_{\mathrm{ss}})}}{{ n^{\mathrm{points}} } }}. \end{aligned}$$
(60)
Fig. 4
figure 4

Case A: Geoid height residuals with the XGM2016 model up to degree 200. Presented is a the absolute difference between the pre-defined truth \(\varvec{{\mathfrak {s}}}\) and the RLSC result \({\mathbf {s}}\), b the difference between the result of standard LSC \(\bar{{\mathbf {s}}}\) and RLSC \({\mathbf {s}}\)

4.1 Simulation with satellite-only model resolution

At first we compare LSC for calculations with the XGM2016 model but use the model only up to degree 200 (case A). Consequently, RLSC resembles other approaches in which covariance functions from satellite-only global gravity fields are included to LSC (e.g., Pail et al. 2010; Sansò 2013). However, the actual computation of the covariance matrices in our case is still different (Eq. 37). Standard LSC in case A is similar to LSC approaches that do not account for the accuracy of the GGM (e.g., Rieser 2015; Hofmann-Wellenhof and Moritz 2006). The main reason why it is so common to use a GGM up to degree of 180–250 in LSC is the high accuracy of GGMs in this frequency range (Gruber et al. 2011), which is mainly due to the GOCE mission (Drinkwater et al. 2003). Thus, in the first case we use the XGM2016 model with \({ n}^{\mathrm{m}}\in ~\{2,200\}\) for all quantities with the superscript ‘m’, and \({ n}^{\mathrm{t}}\in \{201,2190\}\) accordingly. The accuracy of the input observations \({\sigma }_n\) is assumed to be 1 mGal. For the calculations in Sects. 4.1 and 4.2, the accuracy of the XGM2016 model is multiplied by a factor of 3 (see Sect. 3). It is added as described in Sect. 3 but limited to degree 200 in case A. An overview of all quantities is given in Table 1.

The collocation result \({{\mathbf {s}}}\) of RLSC from Eq. 57 in case A is presented as an absolute geoid height difference (residuals) to the pre-defined truth \(\varvec{{\mathfrak {s}}}\) in Fig. 4a. In Fig. 4 and the following images, we separate a region with terrestrial data coverage (R1, left side) from an area without ground data (R2, right side) by a white line, because we see large differences between these two regions. For the interpretation we focus mainly on the region R1. In Fig. 4a, we see a high correlation between the LSC residuals and the positions of the input observations (see Fig. 1a): in the Pacific Ocean and the bulge in the north we have dense observations and therefore only small residuals mainly below 1 cm. In region R2, the residuals amount to more than 10 cm which is also the case for large parts of R1 in the Argentinian area (southeast). We conclude that the number of terrestrial observations in this area is not dense enough to describe the geoid height signals above degree 200. The RMS (Eq. 59) of the region R1 accounts for 6.0 cm, and is obviously significantly larger in R2.

Table 3 Overview of the three synthetic test cases and the numerical results for region R1
Fig. 5
figure 5

Case A: Comparison of the extracted variances that are used in RLSC. a Variances extracted from the covariance matrix \( \bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) of the topographic gravity model which depends only on latitude and height. b Regional varying variance elements from the covariance matrix \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) of the XGM2016 model (to degree 200)

In case A, the RLSC results \({{\mathbf {s}}}\) (Fig. 4a) and standard LSC \(\bar{{\mathbf {s}}}\) (Eq. 58) show almost the same RMS (Table 3) and only small differences among each other. The difference \(\bar{{\mathbf {s}}}\) minus \({{\mathbf {s}}}\) in Fig. 4b shows maximum values of about \({\pm }\,5.0\) cm, but these values occur to a large extent in areas without observations. In areas with dense gravity observations (e.g., Pacific Ocean) the differences between the two methods are of the order of just a few millimeters. The reason for the differences being so small is the fact that the extracted variances from \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) of the GGM are insignificantly small compared to the corresponding variances of the topographic gravity in \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) and \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}}\). Therefore, \(\bar{{\mathbf {C}}}_{\mathrm{ll}}^{\,\mathrm{m}}\) and \(\bar{{\mathbf {C}}}_{{\mathrm{sl}}}^{\,\mathrm{m}}\) from RLSC do not have an essential effect on the summation in Eq. 57.

$$\begin{aligned} \hbox {diag}({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}})<< \hbox {diag}(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}) \nonumber \\ \hbox {diag}({{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{m}})<< \hbox {diag}(\bar{{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{t}}) \end{aligned}$$
(61)

This is demonstrated in Fig. 5 which shows the variances that are extracted from \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) (Fig. 5a) and \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) (Fig. 5b). The variances from the topographic gravity in \(\bar{{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{t}}\) are in the order of 1000 times larger than the variances of the GGM in \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\). We can conclude that in case A the additional benefit of adding the covariance matrices \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}}\hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) to LSC is small. This is also the explanation why LSC approaches often ignore the accuracy of a satellite-only resolution GGM completely (e.g., Hofmann-Wellenhof and Moritz 2006; Rieser 2015).

Fig. 6
figure 6

Case B: Geoid height residuals with the XGM2016 model that is the absolute difference between the pre-defined truth \(\varvec{{\mathfrak {s}}}\) and a the RLSC result \(\mathbf{s}\), b the standard LSC result \(\bar{{\mathbf {s}}}\)

4.2 Simulation with full GGM resolution

However, this changes in case B where we use the full degree \({ N}= 719\) of the XGM2016 model for reduction \(({ n}^{\mathrm{m}}\in ~\{2,719\})\) and thus apply the topographic gravity for \({ n}^{\mathrm{t}}\in ~\{720,2190\}\). In principle, the procedure remains the same, but we would like to emphasize that this set-up increases all matrix elements in \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\), \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\), and therefore also the noise of the XGM2016 model (see Eq. 54).

The results of a collocation with XGM2016 to full degree (case B) are visualized in Fig. 6, where we show again the absolute differences to the pre-defined truth \(\varvec{{\mathfrak {s}}}\). From this point, we retain the form in which we present RLSC with GGM covariance (Eq. 57) on the left-hand side (Fig. 6a) and standard LSC (Eq. 58) on the right-hand side (Fig. 6b). It is obvious that RLSC performs much better in case B. In Fig. 6a, we again see a high correlation with the distribution of the input points. In areas with dense terrestrial observation, e.g., the Pacific Ocean or the bulge in the north, the residuals are mainly below 1 cm. The residuals over the well-covered Chilean area are much smaller than on the Argentinian side, which also results from the corresponding point distributions. In the region R2 the residuals are largest and often above 10 cm. In Fig. 6b, we have in general much larger residuals, so that even areas with dense gravity observations show residuals of at least 2 cm. The higher residuals primarily result from the noise of the GGM since standard LSC regards the noisy XGM2016 model as error-free and therefore fully reflects its error in the result. The RMS of standard LSC in region R1 accounts for 6.9 cm, while the one of RLSC is only 3.9 cm (Table 3).

In summary, RLSC and standard LSC perform very similarly with a satellite-only resolution GGM in case A, but when using a high-resolution GGM as in case B, there are significant improvements when including the GGM covariance (Table 3, Fig. 6). Furthermore, the benefit of including a high-resolution GGM itself is demonstrated when we compare the RLSC result in case B (Fig. 6a) with the result in case A (Fig. 4a). Even with the much higher error from the GGM in case B, the RMS for RLSC in region R1 is reduced from 6.0 to 3.9 cm. This improvement results at least partly from the area with sparse terrestrial measurements in Argentina that has clearly higher residuals in case A (Fig. 4a). However, even areas with small data gaps which are common in mountainous areas and the edges of observed areas benefit highly from the inclusion of a high-resolution GGM in RLSC. The latter is for example visible at the southern end of the Pacific Ocean where case A clearly shows edge effects while case B does not. Standard LSC, on the other hand, performs worse in case B, and its RMS increases due to the noise \({\mathbf {l}}^{\mathrm{n}}\) of the GGM from 6.1 cm in case A to 6.9 cm in case B. We conclude that, in contrast to standard LSC, RLSC is able to handle the noise in the GGM and gives good results in areas with a sufficient number of observations.

Figure 7 shows the formal error of case B, which is derived from the error covariance matrix \({\mathbf {E}}_{\mathrm{ss}}\) or respectively \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) as square root of the main diagonal elements (Eq. 60). The difference between Fig. 7a and Fig. 7b results from the neglected covariance matrices \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) in Eq. 58, and since covariance matrices are positive definite, the estimated error for standard LSC (Fig. 7b) is always smaller than the one from RLSC (Fig. 7a). The corresponding MSD (Eq. 60) in region R1 is 3.6 cm for RLSC and 2.2 cm for standard LSC. Thus, we see that the residuals and the formal errors agree much better (see Table 3) for RLSC (Figs. 6\({+}\) 7a) than for standard LSC (Figs. 6\({+}\) 7b). We again point out the Pacific Ocean and the bulge in Bolivia where in both cases the dark green values of Fig. 7b do not fit to the corresponding error in Fig. 6b. In contrast, we see that for RLSC essentially all peaks in the residuals (Fig. 6a) are indicated by higher values in Fig. 7a as well, which demonstrates that the formal error of RLSC fits much better to its residuals. We conclude that standard LSC cannot realistically represent the formal error since the accuracy of the GGM is not included. Therefore, the result of \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) is always too optimistic. The consistent calculation of \({{\mathbf {E}}}_{\mathrm{ss}}\) is considered as a main advantage of RLSC.

Fig. 7
figure 7

Case B: Formal error of the geoid height calculation. This derives a from \({\mathbf {E}}_{\mathrm{ss}}\) for RLSC, b from \(\bar{{\mathbf {E}}}_{\bar{{\mathrm{s}}} \bar{{\mathrm{s}}}}\) for standard LSC, as the square root of the main diagonal

Fig. 8
figure 8

Case C: Geoid height residuals with the original XGM2016 accuracy. Presented is the absolute difference between the pre-defined truth \(\varvec{{\mathfrak {s}}}\) and a the RLSC result \({\mathbf {s}}\), b the standard LSC result \(\bar{{\mathbf {s}}}\)

4.3 Simulation with a different XGM2016 accuracy

To analyze the behavior of RLSC in dependence of the accuracy of the GGM, we repeat the computations with the original XGM2016 accuracy, i.e., without multiplying it by a factor of 3. Therefore, we recalculate case B with the original XGM2016 covariance matrices \({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\), \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\) and name it case C. Note, that this will generally improve the results since the downscaling of the covariance matrices also decreases the noise \({\mathbf {l}}^{\mathrm{n}}\) and \({\mathbf {s}}^{\mathrm{n}}\) of the GGM which is used in the RCR step.

Figure 8 shows again the resulting absolute geoid height differences between the results of RLSC and standard LSC to the pre-defined truth \(\varvec{{\mathfrak {s}}}\). At first glance the two images in Fig. 8 look similar. As in case B, in general we have small errors in the Pacific Ocean, medium errors in the land areas with observations, and the largest errors in region R2. But nevertheless, we see that the result of RLSC (Fig. 8a) is better than standard LSC (Fig. 8b). The RMS from region R1 amounts to 2.9 cm (Fig. 8a) and 3.3 cm (Fig. 8b), respectively. Especially, the dark green area in the Pacific which indicates residuals below 1 cm is more uniform in Fig. 8a. Also, for RLSC the bulge in the north shows mainly residuals below 1 cm, while for standard LSC it shows much higher residuals.

Table 3 summarizes the RMS and MSD values of the three synthetic test cases in region R1, from which the following conclusions can be drawn. Generally, in all three test cases RLSC performs better than standard LSC. However, the benefit in case A is negligibly small. With the given point distribution in the target area we see a benefit from including a high-resolution GGM in RLSC which decreases the RMS from 6.0 cm (case A) to 3.9 cm (case B). This benefit is particularly apparent in areas with sparse or medium-dense data distributions and at the edges of the terrestrial observations. The inclusion of a high-resolution GGM without using RLSC yields with 6.9 cm the worst result among the three cases. Figure 6b demonstrates this even more clearly than the RMS comparison. As expected, the inclusion of a higher quality GGM (case C) gives better results in both of the LSC methods, but still favors RLSC. In contrast to standard LSC, in all three cases the accuracy of RLSC is well approximated by the formal error estimates. In case B and C the MSD of the formal error differs by only 0.3 cm from the RMS of the residuals (Table 3).

5 Conclusion and outlook

In this paper, we derive and evaluate an approach named residual least-squares collocation that includes several adaptations to previous LSC methods. Principally, we use only residuals as input to RLSC and the stochastic properties of all inputs in the RCR step are separately described by a corresponding covariance matrix. As a result, this adapted formulation only uses error covariance matrices instead of covariance matrices that describe the gravity signal as used in Moritz (1980), Pail et al. (2010), Sansò (2013) or Haagmans and van Gelderen (1991).

We derive the formulation of RLSC from the basics of Moritz (1980) but use the mere stochastic expectation operator E instead of the total average operator \(\bar{{\hbox {E}}}\) for the definition of the covariance matrices. On this basis, for the first time we include a full covariance matrix of a high-resolution GGM in regional geoid modeling. The method also allows us to use covariance matrices that are tailored to a target area. It was already formulated by Tscherning (1999) that the use of isotropic covariance functions does not yield the optimal result in estimating the non-isotropic functionals of the Earth’s disturbing gravity field \({ T}\). Therefore, we derive covariance functions directly from the normal equation system of the high-resolution GGM (here: XGM2016) and use them for the collocation. In general, these location-dependent and non-isotropic covariance matrices fit better to a target area than covariance matrices derived purely from signal degree variances (Moritz 1980; Heiskanen and Moritz 1967) or empirical covariance fitting (Tscherning and Rapp 1974), which both describe only an average part of the Earth (or respectively an average part of a target area). Furthermore, similar to the LSC methods in Pail et al. (2010) or Sansò (2013) RLSC offers the advantage that all stochastic effects are included in a consistent way directly in LSC, which should lead to a realistic accuracy estimation of the output quantities. The downside of the presented approach is that the computation of the covariance matrices for the input and the output points (\({{\mathbf {C}}}_{\hat{{\mathrm{l}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\), \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{l}}}}^{\,\mathrm{m}}\) and \({{\mathbf {C}}}_{\hat{{\mathrm{s}}} \hat{{\mathrm{s}}}}^{\,\mathrm{m}}\)) is very CPU-intensive. For the calculation we exploited the LRZ supercomputing system SuperMUC phase 2 where we used 80 Haswell nodes (Xeon E5-2697 v3). These nodes have 28 cores each with a peak performance of 41.6 GFlops/s, which results in a computation time less than an hour. Additionally, it is necessary to have access to the full normal equation system of the GGM which is frequently not available to the public.

For an optimum evaluation of the benefit of the RLSC method we formulate a synthetic test case with a pre-defined truth. To obtain realistic results from it we add noise to the GGM and to the terrestrial gravity observations. However, a number of assumptions must be included in the scenario. In particular, we assume that the variance–covariance information of the XGM2016 model is correct, since we use it to derive the noise of the GGM. Furthermore, we band-limit the gravity signal to degree 2190. In a case where we use real gravity data, we would have to extend the topographic gravity reduction to higher frequencies. Nevertheless, the simulation of three different test cases yields meaningful results and allows us to evaluate RLSC compared to a standard LSC. We see that regional gravity modeling can benefit from the inclusion of a high-resolution GGM with RLSC. In our test case, the numerical advantage of this is an RMS reduction from 6.0 cm (case A) to 3.9 cm (case B) for RLSC where the RMS values are largely affected by the areas with sparse ground data distribution. In areas with a good data coverage RLSC gives residuals mainly below 1 cm in all of the three test cases. In general, by including a high-resolution GGM in RLSC, we obtain better results especially in areas with sparse observations or close to data gaps. In our opinion, this constitutes a big advantage, because terrestrial gravity measurements are often inhomogeneously distributed and data gaps are almost inevitable especially in mountainous regions. Furthermore, it is shown that the stochastic error estimates of the RLSC approach provide realistic uncertainty estimates, which becomes very important when real gravity data is used.

An additional advantage of the inclusion of a high-resolution GGM in LSC and thus working with rather high-frequency residuals is, that it can significantly reduce the amount of terrestrial data that is needed for the calculation. As shown in Sect. 3, the correlation length when using XGM2016 and a topographic gravity model in the remove step is only \(0.12^{\circ }\). Therefore, it is possible to reduce the amount of data that is included to RLSC locally and for example use only those observations for the calculation that are within a specified distance from the output. Considering this, we regard the calculation of height reference stations for the International Height Reference System (IHRS; more information in Ihde et al. 2017 or Sánchez and Sideris 2017) as a predestined application for RLSC. The high-resolution GGM could be used as the basis of a worldwide height unification, and terrestrial observations would be used for the local refinement around a height reference station. The unique advantage of RLSC in this context is that terrestrial data coverage would not be needed in an area as large as about 200 km around an IHRF station, which is the current default for a satellite-only GGM. Furthermore, IHRS stations will be distributed globally, so that numerous IHRF stations can either benefit from the in general good quality of high-resolution GGMs over oceans that results from satellite altimetry, or are set to areas with little terrestrial data coverage where RLSC appears to perform best. Finally, a correct stochastic accuracy estimation is very important for the calculation of potential values at the IHRF stations.

The next step will be to validate the performance of the presented approach within the IAG Joint Working Group (JWG, 2.2.2: the 1 cm geoid experiment) with real measurements. JWG 2.2.2 aims to assess the calculation of gravity potential values at IHRS stations from different calculation methods, and therefore enables another comparison of RLSC to other regional gravity field determination methods. In any case, with continuously improving high-resolution GGMs and especially with the announcement of the Earth Gravitational Model 2020 (EGM2020) by the NGA, we see the impact of high-resolution GGMs for regional geoid modeling even increasing in the future.