1 Introduction

GSVS14 is a traverse of 200 miles (325 km) in the east-west direction (Fig. 1), crossing the Midcontinent Rift where the gravity anomaly changes from −60 to 80 mGals. The topography variation is moderate with elevations decreasing from approximately 400 m in the west to about 200 m in the east, but the Rocky Mountains stars at the west part.

Fig. 1
figure 1

Topographic height (unit in meter) and location of the GSVS14 traverse (black line)

The traverse contains 204 official marks spaced at about 1 mile (1.6 km). At each mark, long-session GPS data (12–24 h) were collected, then processed and adjusted using the NGS’s Online Positioning User Service (OPUS). At the same time (summer of 2014), the first-order class II spirit leveling (double-run) was conducted between all marks. Absolute gravity was measured at every 10th mark, and relative gravity was surveyed at the marks in between. The Compact Digital Astronomical Camera (CODIAC) of the Swiss Federal Institute of Technology in Zurich was operated off a trailer (“Appendix 2”, Fig. 13). As a result, many of the official marks were not accessible to the camera, and the CODIAC surveyed at an “eccentric mark” on the shoulder of the road within 40 m of the official mark. The DoVs were computed based on star images taken by the CODIAC.

A brief description of the data collection and its accuracy estimation is given in Sect. 2. Section 3 is devoted to the computations of the two geoid control profiles from the GSVS14 surface data. Section 4 describes the computation of the four gravimetric geoid models. All geoid models are first compared mark by mark and then compared differentially with various mark spacing ranging from 1.6 to 320 km. The results and further comparisons in terms of gravity and DoV are presented in Sect. 5. Discussions and conclusions are given in Sect. 6.

2 Data collected by GSVS14

2.1 Ellipsoidal heights determined from GPS data

The precise geodetic coordinates (latitude, longitude and the ellipsoidal height) at the 204 official marks along the traverse were computed from long-session GPS data. In order to have better GPS control for this project, three temporary CORS (Continuously Operating Reference Stations) were set at the beginning, middle and end of the traverse. A long-session static GPS campaign consisted of two groups of ten receivers with identical antennas. One crew started from the west, while the other started from the east of the traverse. Each group occupied 10 neighboring marks simultaneously for two occupation sessions with each session lasting between 12 and 20 h. Each static, multi-station session was processed using OPUS Projects. The software used double differenced carrier phase observations of GPS data with the ionospheric-free linear combination. Only the L1 and L2 GPS carrier phase data were used in the processing. Tropospheric zenith delays were estimated in piecewise linear fashion with a 2-h interval. The root mean square errors in the horizontal and vertical coordinates were estimated to be \({\pm }2\,\hbox {cm}\). All session solutions were then combined in a least-squares network adjustment also within OPUS Projects (Wang et al. 2016). After the network adjustment, the formal accuracy estimates for absolute ellipsoid heights ranged from 1 to 2 mm. When these were combined to estimate accuracy of differential ellipsoid heights and clustered into bins of similar distances, it was found that they ranged from 1.4 to 2.8 mm, with no dependence upon the distance between points. The formal differential error was smaller than the 4.4 mm value obtained in GSVS11 (Smith et al. 2013).

2.2 Orthometric heights by spirit leveling

Field height differences between the 204 marks were measured by leveling following first-order, class II guidelines of the Federal Geodetic Control Committee (Bossler 1984). Absolute gravity was observed at every 10th mark using an A10 absolute gravity meter. Between the marks, a relative gravimeter occupied each mark with 7 repeat observations to reduce random errors. Then the gravity data were adjusted and used to compute the geopotential numbers in a minimally constrained adjustment. The adjustment was fixed to the mark PID4549 with a published height of 289.247 m in the North American Vertical Datum 1988 (NAVD 88) at Ames.

A formal error was assigned to the orthometric height at each mark using the empirical formula \(\pm 0.7\sqrt{d}\,\,\hbox {mm}\) (Zilkoski et al. 1988), where d is the distance between two marks in km. The maximum error would be 12.5 mm for two marks spaced by 320 km.

2.3 DoV observed by CODIAC

The CODIAC is equipped with a GPS receiver, which gives low accuracy coordinates. A more precise location of the camera (to about 0.3 m) is needed to determine precise DoVs. This requirement was fulfilled by utilizing the Iowa Real Time Network (IaRTN). Two 1-min RTN observations were collected at each eccentric mark at the beginning and end of the DoV observation sessions. To check the accuracy of the RTN coordinates, 47 of the 204 official marks were also observed with 1-min RTN observations and compared to the long-session static GPS results. The differences in geodetic coordinates of the two methods are in the range of 2–3 cm in the horizontal and vertical directions (Wang et al. 2016), which correspond to an error of \(0.001''\) in the geodetic latitude and longitude. Note that one mark (GSVS 167), which shows an ellipsoidal height difference of about 30 cm, is considered an outlier and was removed from the coordinate comparison.

The CODIAC camera is an updated version of the DIADEM camera used in GSVS11 (Bürki 2015, personal communication). Automated leveling of the camera was added to make it more user-friendly without changing the main structure of the camera. Therefore, the errors listed in Tables 1 and  2 in Smith et al. (2013) are assumed to still apply. However, this is a new camera system and it was used in a different area, thus it is worthwhile to have this error estimate checked.

The CODIAC camera collected star images at 204 eccentric marks. The camera is equipped with two pairs of tiltmeters of types Lipmann and Wyler. Each pair of tiltmeters is used in a solution to determine astronomical latitude and longitude. These two solutions provide a measure of precision and consistency of the instrument. The following are the statistics of the solution differences.

Table 1 Statistics of differences of north–south \((\xi )\) and east–west \((\eta )\) components of DoV obtained from the solutions using different couples of tiltmeters—the statistics are calculated as results of 480 solutions, and units are in arc second

Table 1 shows a very good agreement between the solutions using different couples of tiltmeters. The RMS values of differences are around one hundredth of an arc second. This is one order of magnitude smaller than the formal accuracy of the camera (0.1\(^{\prime \prime }\)). Extreme differences do not exceed 0.05\(^{\prime \prime }\) along the north–south direction, but there are two solutions having differences larger than 0.05\(^{\prime \prime }\) along the east–west direction.

To test the repeatability and environmental effects on the system, such as changes in the atmospheric conditions, every 20th mark was re-observed after all marks were observed. A total of 11 marks, together with three marks with some uncertainty concerns, were reoccupied. The statistics of the differences are shown in Table 2.

Table 2 Statistics of differences of north–south and east–west components of DoV determined using the repeated observations at 14 marks, with 4 solutions at each reoccupied mark, total 56 solutions

In comparison with Table 1, the RMS values are nearly tripled. Notice that the reoccupation happened after the completion of the DoV survey which took nearly 40 calendar days. The increased differences could be caused by changes in atmospheric conditions as well as all of the error sources listed in Table 5 of Smith et al. (2013). Nevertheless, the RMS values of the solution differences are smaller than 0.05\(^{\prime \prime }\) for both components at the re-observed marks. Based on these statistics, one may conclude that the DoV obtained by the CODIAC camera in Iowa should have an accuracy of \(\pm 0.05^{\prime \prime }\) for its north–south and east–west components, about the same as DIADEM used in GSVS11.

Fig. 2
figure 2

The north–south \((\xi )\) and east–west \((\eta )\) components of DoV along the GSVS14 traverse. Since the traverse is directed east–west, the component Eta contributes the most to the geoid heights. The elevation decreases gradually from 400 m in Denison to 200 m in Cedar Rapids

3 Control geoid profiles computed from GPS/leveling and DoV data

3.1 GPS/leveling geoid profile

The ellipsoidal and orthometric heights at the GSVS14 official marks were computed from the GPS and spirit leveling data as described in Sect. 2. Using these heights, the geoid undulation at mark i, \(N_i^\mathrm{G} \), can be computed as

$$\begin{aligned} N_i^\mathrm{G} =h_i -H_i \end{aligned}$$
(1)

where \(h_{i}\) is the ellipsoidal height determined by GPS and \(H_{i}\) is the Helmert orthometric height determined by leveling (Hoffmann-Wellenhof and Moritz 2006, p. 163). This geoid height, computed at the 204 marks, is hereafter called the “GPSL geoid profile”. Noticing that the leveling adjustment is tied to the bench mark PID4549 in NAVD 88, the geoid undulation computed by (1) is biased when it is compared with gravimetric geoid models. Because the bias is not known accurately, the geoid comparisons can only be made in a relative sense for profile comparisons and in a differential sense for the slope validation.

It is reasonable to assume that errors in GPS and leveling are uncorrelated, and the differential accuracy of the GPSL geoid profile \(\sigma _k^\mathrm{G}\) at a given spacing k can be computed as

$$\begin{aligned} \left( \sigma _k^\mathrm{G} \right) ^{2}=\left( \sigma _k^h \right) ^{2}+(0.7\sqrt{k})^{2} \end{aligned}$$
(2)

where \(\sigma _{k}^h\) is the error variance of differential ellipsoidal height.

3.2 Geoid profile by integrating the DoVs

Figure 2 shows the north–south \((\xi )\) and east–west \((\eta )\) components of the DoV, and the height variation along the test line.

By inspecting the topographic variation and the DoV profile, one can see that the two are not correlated in the long wavelength. The change of Eta from \(-12^{\prime \prime }\) to \(15^{\prime \prime }\) is caused mostly by the Midcontinent Rift, a high density mass structure deep in the crust, with little topographic signature.

Fig. 3
figure 3

GPSL versus DoV geoid profiles and their differences (red curve)

The DoV values shown in Fig. 2 are the angular differences between the plumb line and the normal to the ellipsoid at the eccentric marks on the Earth surface. These values could be reduced to the geoid by correcting the curvature of the plumb line, and then geoid differences could be computed by integrating the DoV values on the geoid (Heiskanen and Moritz 1967, p. 197). However, the height anomaly can be computed using the surface DoV values directly. Instead of the plumb-line correction, the direction of the normal gravity vector is computed at the observation point on the Earth surface. The angle between the normal plumb line and the gravity vector at the observation point is called the dynamic DoV (Hoffmann-Wellenhof and Moritz 2006, p. 336) and is defined by:

$$\begin{aligned} \xi ^\mathrm{d}=\xi -0.17^{{\prime }{\prime }}h\sin 2\phi ,\qquad \eta ^\mathrm{d}=\eta \end{aligned}$$
(3)

where \(\xi ^\mathrm{d}\) and \(\eta ^\mathrm{d}\) are the components of dynamic DoV along a north–south and the east–west directions, respectively; h is the ellipsoidal height in km.

Using related equations in ibid, p. 338, the height anomaly difference between two marks can be computed by integrating the dynamic DoV and the gravity anomaly along the traverse. Notice that the DoVs are sampled at a distance of 1.6 km in GSVS14, thus the line integrals have to be approximated by summations, which introduces an omission error. To reduce the accumulated omission error along the line, a reference gravity model EGM2008 (Pavlis et al. 2012) is used. At a given point i, the reference DoVs, still at the Earth’s surface, \(\xi _i^\mathrm{Ref} \)and \(\eta _i^\mathrm{Ref} \) are computed from the reference model, and then removed from the observed dynamic DoV by:

$$\begin{aligned} \delta \xi _i =\xi _i^\mathrm{d} -\xi _i^\mathrm{Ref} ,\qquad \delta \eta _i =\eta _i^\mathrm{d} -\eta _i^\mathrm{Ref} \end{aligned}$$
(4)

Resulting in \(\delta \xi _i \) and \(\delta \eta _i \), the residual dynamic DoV components at the mark i.

The geoid height N at a given point K from the starting point \((K = 0)\) can be computed from the residual dynamic DoV components, by:

$$\begin{aligned} N_K =\delta \zeta _K +\zeta _\mathrm{Ref} +\delta _K \end{aligned}$$
(5)

where \(\zeta _\mathrm{Ref} \) is the height anomaly computed from the reference model, \(\delta _K \) is the geoid–quasigeoid separation term (Hoffmann-Wellenhof and Moritz 2006, Eq. 8-116), \(\delta \zeta _K \) is the residual height anomaly computed by (cf. ibid, p. 337):

$$\begin{aligned} \delta \zeta _K =\sum _{i=0}^K {\delta \chi _i } +\sum _{i=0}^K {\delta \nu _i } \end{aligned}$$
(6)

where

$$\begin{aligned} \delta \chi _i =-\int _i^{i+1} {\delta \varepsilon \mathrm{d}s} \approx -\frac{\delta \varepsilon _i +\delta \varepsilon _{i+1} }{2}(s_{i+1} -s_i ), \end{aligned}$$
(7)

and where \(s_i \) is the distance between the point i and the starting point, \({\delta \varepsilon }_{i} \) is the projection of the DoV along the line segment between point i and \(i+1\), given by:

$$\begin{aligned} \delta \varepsilon _i =\delta \xi _i \cos \alpha +\delta \eta _i \sin \alpha , \end{aligned}$$
(8)

where \(\alpha \) is the azimuth that can be computed by Eq. (2-388) in ibid, p. 119.

The quantity \(\delta \nu _i \) of Eq. (6) is given by:

$$\begin{aligned} \delta \nu _i =-\int _i^{i+1} {\frac{\delta \Delta g}{\gamma }\mathrm{d}h} \approx -\frac{\delta \Delta g_i +\delta \Delta g_{i+1} }{2}(h_{i+1} -h_i ), \end{aligned}$$
(9)

where

$$\begin{aligned} \delta \Delta g=\Delta g-\Delta g_\mathrm{Ref} . \end{aligned}$$
(10)

and \({\Delta {g}}_\mathrm{Ref}\) is a reference anomaly synthesized from the reference model. For a moderate topography, the 2nd term in (6) is very small. For instance, it only reaches a maximum of 3 mm for GSVS14.

Using the observed DoV data, gravity observed along the line and EGM2008 as a reference field, geoid heights at the eccentric marks are computed using Eqs. (3)–(10). Since the residual height anomaly at the first mark is zero, the DoV geoid profile has the same geoid height as the reference field at this mark. Notice there is a bias between the EGM2008 geoid height and the one of GPSL. For relative geoid comparison, this bias subtracts out. Figure 3 illustrates the GPSL and DoV geoid height profiles and their differences after removing the bias.

Fig. 4
figure 4

Gravity anomalies in the test area. Each dot represents one surface gravity measurement and lines in the east–west and north–south directions represent airborne data. The four slanted lines crossing the Midcontinent Rift are surface gravity measurements. The east–west black line is the GSVS14 traverse

The DoV geoid profile agrees very well with the GPSL profile. The standard deviation (STD) of the differences at the 204 marks is merely \(\pm 1.2\,\, \hbox {cm}\), which is at the same level as the formal accuracy of GPSL. This verifies the high accuracy of both GPSL and DoV control data sets. However, by inspecting the geoid differences in Fig. 3, a slope of 3.6 cm/317 km = 0.11 ppm in the geoid height differences is seen, which is most likely caused by the systematic errors in the DoV data, for reasons explained below. This tilt is about one-third of that seen in the GSVS11 line, where the tilt exceeded 10 cm (Smith et al. 2013; Wang et al. 2013). The smaller systematic error in the GSVS14 DoV data may be due to improvement in the CODIAC, such as the use of the auto-level mechanism and two new tilt-meters, the precise calibration of the system (“Appendix 2”), and modified survey procedure. In addition, better environmental conditions in GSVS14, such as the almost east–west direction of the GSVS14 traverse and favorable atmospheric conditions (less humidity and temperature variation than in Texas where GSVS11 was conducted) may have played a role in reducing the systematic errors of the DoV data.

Assuming that the GPSL geoid profile is tilt-free, the tilt in the DoV geoid profile accounts for a bias in the DoVs of \(0.11\,\, \hbox {mm/km} = 0.023^{\prime \prime }\). This is on the order of the systematic error sources listed in Tables 5 of Smith et al. (2013), e.g., the celestial calibration error. Thus, we conclude that the random and systematic errors of the CODIAC are estimated as \(\pm 0.05^{\prime \prime }\) and \(\pm 0.023^{\prime \prime }\) for the north–south and east–west components of DoV, respectively.

4 Gravimetric geoid models used in the validation

Since the GPS and leveled heights are in the tide-free system, so the geoid height compute from (1) is in the same tide-free system. The gravimetric geoid models were computed also in the tide systems for the validation.

The terrestrial gravity in the test area has a good coverage with varying data distribution density. The GRAV-D flight has an average altitude of 6.3 km, and the data were collected over same time period as GSVS14. The distribution of gravity data and the location of the GSVS14 traverse are shown in Fig. 4.

Using the data in Fig. 4 and the latest satellite gravity models GOCO3S and GOCO5S (Mayer-Gürr et al. 2012, 2015), four different geoid models were computed (see Table 3). The first, xEGM15, is computed using the EGM2008 approach to the same resolution and format. The xEGM15 model incorporates the GOCO03S satellite gravity model, NGS terrestrial gravimetry, and GRAV-D airborne data. The construction of this model is very similar to that of the xEGM-GA model as described in Smith et al. (2013). Essentially, xEGM15 results from the spectral combination of three separate global geopotential models, two of which are ‘disposable/temporary’ and are only created to support the final combination. The first temporary model is identical to EGM2008, except in the survey area where the model has been updated to reproduce the NGS terrestrial data that surrounds the GSVS14 line. The second temporary model is very similar, in that it is also identical to EGM2008, except in the GRAV-D survey area where the model has been updated to reproduce the GRAV-D airborne data that was collected over the GSVS14 line. The third model is the GOCO03S satellite gravity model. All three models are combined at the coefficient level, using ‘error degree variance’ models that have been customized for each data set in the GSVS14 local area. The error degree variances of terrestrial and airborne gravity data at lower degrees were estimated using harmonic analysis with help of satellite gravity models. For higher degrees, the error degree variances of terrestrial and airborne gravity data were computed from the error covariance function computed using the method of leave out-one cross validation (Jiang and Wang 2016). Figure 5 shows the spectral weights used for the combination.

Fig. 5
figure 5

Spectral weights used for geoid model xEGM15

Compared to the spectral weights used by xEGM-GA (Smith et al. 2013, Fig. 9), the contribution of airborne gravity is not limited to a narrow frequency band between harmonic degree 180–420, but from degree 160–2160 because of a much lower flight altitude (6.3 km) of GRAV-D in comparison with the 11 km flight altitude over GSVS11.

Table 3 Geoid models used in the validation

The second geoid model, xG15, is computed by taking the xEGM15 geoid model, and augmenting this with high-resolution (\(1^{\prime } \times 1^{\prime }\)) gravimetry and forward-modeling of a Residual Terrain Model or ‘RTM’ (Forsberg 1984). In this way, the xG15 geoid is computed using xEGM15 as a ‘reference model’ in the classic remove–restore approach.

The next two geoid models, xGEOID15A and xGEOID15B, were not computed specifically for this study. Instead, since 2014, NGS is required to compute two experimental geoid models for the USA and its territories. The most important of these is designated as the ‘B’ model and incorporates any gravimetric data sets available at the time, including all of the GRAV-D airborne data that is processed and ready for use. The corresponding ‘A’ model is intended to be identical to the ‘B’ model, except that it excludes the GRAV-D airborne data. In this way, the differences between corresponding ‘A’ and ‘B’ models show the extent to which the GRAV-D airborne data is contributing to the current geoid solution.

Similar to xG15, the xGEOID15A and xGEOID15B geoid models are both obtained by first computing a spherical harmonic reference model, and then augmenting this with high-resolution gravimetry and RTM information. For xGEOID15A, the supporting xGEOID15A reference model is a combination of EGM2008 and the GOCO05S satellite gravity model. This combination is achieved in two steps. The first step involves a highly localized combination in the space domain, in which geographically specific and degree-wise error models are applied over a small spherical cap. The final combination is performed at the coefficient level. For xGEOID15B, any additional GRAV-D airborne data are propagated into the final geoid model through its supporting reference model. Thus, the reference model for xGEOID15B is similar to that for xGEOID15A, except that EGM2008 has been replaced with a ‘disposable/temporary’ spherical harmonic model that has been augmented with GRAV-D airborne data. Outside of GRAV-D data areas, this temporary model reproduces EGM2008 gravity anomalies/disturbances. Within the GRAV-D data areas, the temporary model reproduces a cleaned, adjusted and filtered version of the GRAV-D airborne gravimetry. This temporary model is combined with GOCO05S using the same two-step procedure as was used for xGEOID15A. Once the spherical harmonic reference models for xGEOID15A and xGEOID15B are complete, they are augmented with \(1^{\prime } \times 1^{\prime }\) NGS gravimetry and RTM information using the same remove–restore methodology as was used for xG15.

The geoid models used in the validation are summarized in Table 3.

5 Geoid validation

5.1 Geoid profile comparisons

We start with the comparisons of geoid models, mark by mark, along the GSVS14 traverse. The differences (Model—GPSL) are shown in Fig. 6. Since the leveling data were constrained to a mark of NAVD 88, there are about 80 cm biases between the geoid models and the GPSL geoid profiles. In addition, EGM2008 and xEGM15 used a \(W_{0}\) value of 62,636,855.69 \(\hbox {m}^{2}\,\hbox {s}^{-2}\), while xGEOID15A/B used a new value of 62,636,856.0 \(\hbox {m}^{2}\,\hbox {s}^{-2}\). The difference between the two \(W_{0}\) values results in a bias of 3 cm between those geoid models of GSVS14 in the test area.

Fig. 6
figure 6

Geoid differences between the control GPSL profile and various models. The 75 cm biases are caused by the use of different geopotential numbers for geoid modeling, and NAVD 88. 80 cm is added to GeoidDoV to make the graph more readable

Table 4 Standard deviation of geoid height differences, relative to the GPSL profile, in cm

By inspecting Fig. 6, one can see that EGM2008 has a large (\({\sim }12\,\, \hbox {cm}\) from west to east) tilt along the traverse, a trend similar to that of the topography. This trend is long wavelength in nature, which could be caused by errors in the satellite model and/or the gravity data reduction in the Rocky Mountains near GSVS14. Another observation is that all gravimetric models have about a \(-9\) cm slope along the western 50 km of the traverse, closest to the Midcontinent Rift, where the gravity anomaly changes from negative to positive (Fig. 4) and the topography reaches its peak. The large slopes have not been corrected by the latest satellite models GOCO03S/5S nor the GRAV-D data. It was shown recently (Li et al. 2016) that large errors in terrestrial gravity over Lake Michigan were corrected by combining a satellite gravity model and GRAV-D data. The immediate question is why the same did not happen this time. The difference between Lake Michigan and the western 50 km along GSVS14 traverse is that there is no topography over a flat lake surface, thus the gravity and topographic reductions are the most likely source of the error. In addition, the spectral weights worked well for the east part of the area, but did not correct the errors in terrestrial gravity in the west 50 km. This may require a more comprehensive weighting scheme in a complex area and a realistic error assessment of terrestrial and airborne gravity data. Research is under the way and the GSVS14 surface data provide a guidance for this type of research.

Carefully inspecting Fig. 6, one can see that there are 2–5 cm dips and bumps at distances of 110, 150, 220, and 260 km along the line, which happen in models with or without the GRAV-D data, as well as in EGM2008. All gravimetric geoid models use the same terrestrial gravity data. Thus, these dips and bumps most likely indicate errors in the terrestrial gravity data. The absence of such relatively large bumps in the difference of the DoV-derived and GPSL control geoid profiles excludes GPS or leveling as a source of these errors. Furthermore, the lack of the large tilt between the DoV-derived and GPSL profiles in the western 50 km further points to the source of the error being terrestrial gravity data, despite the inability of GOCO and GRAV-D to correct it.

The large slopes in the western 50 km of the line have a large impact on the accuracy assessment of geoid models. The statistics of geoid comparisons (to the GPSL profile) with and without the western 50 km are listed in Table 4.

Table 4 shows that all geoid models, except EGM2008, agree with GPSL from 1.4 to 1.6 cm at the 204 marks in terms of standard deviation. The dispersion gets much smaller in the comparisons without the western 50 km. The standard deviation of the geoid height differences is reduced from 1.6 cm and 1.3 to 0.9 cm for both xEGM15 and xG15. Similar improvement happens for EGM2008 as the standard deviation values are reduced from 3.4 to 2.0 cm. xGEOID15A outperforms xGEOID15B slightly (1.4 vs. 1.5 cm) for the whole line, but the later performs slightly better without the west 50 km (1.2 vs. 1.1 cm).

5.2 Differential geoid comparisons

For practical applications, it is significantly more important to check how well the geoid models compare with the GPSL geoid profile differentially, over different baseline lengths. An effective way to do that is by binning geoid height differences over different baseline lengths to get the predicted geoid errors (Smith et al. 2013).

Fig. 7
figure 7

Variances of geoid differences as a function of baseline length. Number of possible pairs for each bin is around 1800. The last bin (248–340) has significant less number of pairs, and it is not shown here

Figure 7 shows that the variances of geoid differences are all below 3 cm for all geoid models except EGM2008. The variances consist of the differential errors in geoid models and GPSL data. Once the differential ellipsoid height accuracy and differential leveling accuracy are accounted for, the only remaining error source in the disagreement between the GPSL geoid profile and a gravimetric geoid should be the geoid itself. The estimated geoid errors for selected geoid models are presented in Fig. 8.

Fig. 8
figure 8

Predicted differential geoid errors (in cm) as a function of baseline length (in km) for the whole traverse

Fig. 9
figure 9

Predicted differential geoid errors (in cm) as a function of baseline length (in km) without the western 50-km section of the test line

Figure 8 shows that all geoid models have a differential accuracy better than 2.5 cm for baseline lengths from 0.4 to 247 km. Geoid height differences between GPSL and EGM2008 have a 12 cm tilt; thus, a similar trend appears in this differential geoid error too. The further the two marks are apart, the larger the differences become and the larger the predicted errors become. The geoid error of EGM2008 increases from 0.8 to 7.8 cm for baseline lengths of 0.4 and 247 km, respectively. A small slope (0.11 ppm) in the DoV geoid height differences also causes the geoid error to increase nearly linearly from 0.3 cm to about 2.3 cm from the shortest to longest baseline lengths. If the tilt in the DoV geoid profile is removed, the error in the DoV data is purely random, and then the error in the DoV geoid profile would act like a random walk, similar to the behavior of the leveling error. In this case, the error would be 0.2–0.5 cm for baseline lengths from 1.6 to 247 km, which agrees well with the error estimate computed using Eq. (16).

The second observation is that all high-resolution gravimetric geoid models (xG15, xGEOID15A and xGEOID15B) perform at similar accuracies, with only small deviations from one another. All high-resolution models have a predicted error between 1 and 1.5 cm for baselines shorter than 122 km and slightly increased errors for longer baselines because of the \(-9\) cm slopes in the western 50 km of the test line. xGEOID15A and xGEOID15B perform nearly the same for all baselines length, but it is troubling to see that the model with GRAV-D (xGEOID15B) does not improve the model without GRAV-D (xGEOID15A). xG15 uses xEGM15 as the reference model and applied an RTM effect and performs the best. As previously mentioned, the \(-9\) cm slopes in the western 50 km of the test line have profound impacts on the geoid comparisons. Thus, the statistics are recalculated without this portion of the traverse and shown in Fig. 9.

The geoid accuracy improvement due to the airborne gravity of GRAV-D becomes evident in Fig. 9. The geoid models xEGM15 and xG15 both have an accuracy around 1 cm or better at any baseline length, a 50–100% improvement to the whole line comparison. xGEOID15B outperforms xGEOID15A in baselines longer than 101 km, and the former has an accuracy at or better than 1.5 cm for any baseline lengths. Notice that xGEOID15B is 50% worse than xEGM15 or xG15 for baseline longer than 101 km. xEGM15 used GOCO03S and the weights were computed from the error degree variances of each data type (Jiang and Wang 2016); and xGEOID15B used GOCO05S and empirically determined weights (Smith et al. 2013). Disregarding the differences between GOCO03S and GOCO05S in the area, the improvement in xEGM15 may be predominately due to the different spectral weights. Thus, the results demonstrate the importance of the proper weights in the geoid modeling.

In summary, all gravimetric geoid models have an accuracy of 2 cm or better for any baselines. All gravimetric geoid models perform almost the same, with or without the GRAV-D airborne gravity data. The improvement of GRAV-D is not clear because of the \(-9\) cm slopes in the western 50-km section of the traverse where the terrestrial gravity is biased by 5 mGal (see Sect. 5.4 for gravity comparison) and is not corrected by GRAV-D for some unknown reason. If the west section is excluded in the comparisons, the GRAV-D airborne data contribution become evident. The long wavelength error in xGEOID15A (Fig. 8 vs. 9) is removed and the predicted accuracy of xGEOID15B outperforms that of xGEOID15A for baselines longer than 101 km. xEGM15 and xG15 perform the best after removing the western 50-km section. Its predicted accuracy is at the 1 cm level or better at any baseline length. Notice that xGEOID15B and xG15 used the same terrestrial and airborne gravity data. The differences between the two geoid models are mainly caused by the different spectral weighting. Spectral weights determined by error degree variances of the satellite model, airborne and terrestrial gravity data seem to give the best result in this case.

5.3 DOV comparison

DoV used in the following comparisons are the angle between the plumb line and the normal to the reference ellipsoid at the eccentric marks on the Earth’s surface, called the Helmert DoV. The north–south \((\xi )\) and east–west \((\eta )\) components of DoV are computed by a spherical harmonic synthesis of EGM08 at the eccentric marks. These components are also computed using the usual spherical formula (Heiskanen and Moritz Eq. 2-204, 1967) at every mark by:

$$\begin{aligned} \xi= & {} -\frac{1}{\gamma R}\frac{\partial T}{\partial \phi } \nonumber \\ \eta= & {} -\frac{1}{\gamma R\cos \phi }\frac{\partial T}{\partial \lambda } \end{aligned}$$
(11)

where R is the mean radius of the Earth.

This type of DoV (Jekeli 1999) is of the gravity type and needs to be transformed to Helmert’s DoV. Because the magnitude of the corrections is a few thousandths of an arc second, we ignore them in the comparisons. The two components of DoV of xGEOID15A/B are computed from the slopes of the geoid models, and the plumb-line correction was applied. The following figures show the differences and the statistics between the observed DoV and the one computed from the geoid models.

Fig. 10
figure 10

a the N–S component of DoV differences between CODIAC and geoid models. b the E–W component of DoV differences between CODIAC and geoid models

Figure 10a, b shows that all models perform similarly, and the large differences happen to be at the same locations. Since the residual DoVs consist mostly of high frequencies and since GRAV-D airborne data contribution is limited mostly to the medium frequencies, no improvements due to airborne data can be seen in Fig. 10a, b. The statistics of the DoV differences are given in the following (Table 5).

Table 5 Statistics of the DoV differences in (a) the north–south direction and (b) the east–west direction, units in arc seconds

5.4 Gravity comparison

GSVS14 collected gravity data at 204 official marks with an accuracy of \(\pm 0.05\) mGal with geolocation accuracy of 1–2 cm horizontally and vertically. This data set is used as a ground truth for the gravity comparison. Gravity anomalies are synthesized using EGM2008 and xEGM15 at the mark locations. The gravity anomalies of xG15 and xGEOID15A/B are computed from its residual grids and their reference fields, and all gravity anomaly profiles along the test line are presented in Fig. 11.

Fig. 11
figure 11

Surface gravity anomalies along the GSVS14 traverse

The gravity anomaly changes about 140 mGal crossing the Midcontinent Rift—from negative 60 to positive 80 mGal. Gravity anomalies of all geoid models, including EGM2008, match the observed gravity anomalies closely, but are missing some fine details. In comparison with Fig. 6, one can see immediately that the fine details in gravity have one-to-one correspondence with the dips and bumps in the geoid differences. The largest differences between the observed and modeled gravity anomalies happen at the western part of the traverse where the maximum geoid difference occurs.

The residuals between geoid model-implied gravity anomalies and gravity anomalies based on observed gravity from GSVS14 are shown in Fig. 12. Although there are large fluctuations in that graph, the difference has a mean of 1.1 mGal, with about 2.3 mGals standard deviation. The residual gravity anomalies associated with every geoid model, with or without the GRAV-D data, have a mean of about 5 mGal at the western 30-km section of the line. Because the same terrestrial gravity went in to every geoid model, but the residuals in Fig. 12 are with respect to newly observed gravity just for GSVS14, this confirms that the large discrepancies at the western section of the line are due to large error in the terrestrial gravity data in this area.

Fig. 12
figure 12

Gravity differences along the GSVS14 traverse

Table 6 Statistics of gravity differences (Model—observed)

Figure 12 and Table 6 show three things. First, the surface gravity anomalies of all gravity models are accurate to about 2.5 mGal, relative to newly acquired gravity. Secondly, since the existing terrestrial gravity is of good quality and the GRAV-D data contributes mostly in medium wavelengths, little improvement is seen in high-frequency variances by adding GRAV-D data. Third, the errors in terrestrial gravity data fluctuate around the mean. They have spatial resolutions ranging from a few km to 20 km and cannot be corrected by satellite models or GRAV-D. The use of a high-resolution DEM could provide higher frequency signal of the gravity data and reduce these errors (Higgins et al. 1996).

6 Conclusions

The first Geoid Slope Validation Survey in Texas confirmed that 1-cm relative geoid accuracy can be achievable in a flat coastal region over baselines from 0.4 to 320 km (Smith et al. 2013). The second survey GSVS14 was conducted in Iowa where the topography is moderate with elevations decreasing gradually from 400 m in the west to 200 m in the east. The traverse crosses the Midcontinent Rift, which causes gravity anomaly changes of about 140 mGal. Two control geoid profiles were computed from the GSVS14 data: one from GPS and leveling data, and another from the observed DoV using the method of remove–restore, where EGM2008 to degree and order 2160 was used as the reference model. The two geoid profiles agree to within \(\pm 1.2 \,\,\hbox {cm}\) in the geoid undulation, attesting a high accuracy of the collected data sets. However, there is a slope of 0.11 ppm in the geoid differences along the 320 km traverse. If we assume that the GPSL geoid profile is tilt-free, then this slope implies a \(0.023^{\prime \prime }\) bias in the DoV data. A bias in the DoV data is not unexpected, as one was also seen in GSVS11. However, this is about one-third of the bias of DIADEM used for GSVS11. The improvement is probably due to the auto-leveling mechanism of the CODIAC and the use of two new tiltmeters. The environmental conditions (less humidity and temperature variation, and the direction of the traverse) may have played a role, too. The random and systematic errors of the DoV data are estimated as \(0.05^{\prime \prime }\) and \(0.023^{\prime \prime }\) for the CODIAC based on the repeated observations and the GPSL geoid profile comparisons. If the systematic error is corrected, the DoV geoid profile should have an accuracy of 0.24 mm per square root of the distance between two marks based on the error propagation law (Eq. 16) which implies a maximum error of 0.5 cm over the GSVS14 traverse.

GSVS14 was used to validate three GRAV-D airborne data-enhanced geoid models: xEGM15, xG15, and xGEOID15B. For quantifying the GRAV-D contribution, xGEOID15A which is computed in a nearly identical way as xGEOID15B, but without airborne data, is included in the comparisons. EGM2008, which has been used in many ways for geoid computations, is also included. The comparisons showed that EGM2008 has a slope of 12 cm along the traverse with a trend similar to the topography. All other geoid models perform nearly the same as one another in the comparisons. They agree with the GPSL geoid profile by about 1.5 cm in mark-by-mark comparisons. In differential comparisons, all geoid models have a predicted accuracy of 1–2 cm for baselines of length ranging from 1.6 to 247 km. In these comparisons, the contribution of GRAV-D is not apparent. The reasons for this may be that the comparisons are distorted by the \(-9\) cm slopes in geoid differences in the western 50-km section of the test line in all geoid models. The gravity comparison confirms that there is about 5 mGal error in the terrestrial gravity that may be the major contributor to the \(-9\) cm slopes. It has been shown that the GRAV-D data corrected a similar type of data bias over Lake Michigan (Li et al. 2016). It is not clear why GRAV-D data has not corrected the error in terrestrial gravity data in the western section of the GSVS14 test line. Since there is no topography over Lake Michigan, the topography and gravity reduction could be the prime suspects. In the same time, the spectral weights worked well for the east part of the area, but did not correct the errors in terrestrial gravity in the west 50 km. This may require a more comprehensive weighting scheme in a complex area and a realistic error assessment of terrestrial and airborne gravity data.

If the western 50-km section is excluded in the comparisons, the improvement due to GRAV-D airborne data becomes evident. The geoid models with and without GRAV-D data, namely xEGM15, xG15 and xGEOID15A, agree with the GPSL geoid profile to 0.9, 0.9 and 1.2 cm, respectively, in point-by-point comparisons. The predicted differential geoid accuracy improves even more clearly. xEGM15 and xG15 have a predicted differential accuracy of 1.0 cm or better, and these outperforms xGEOID15A over all baseline lengths.

The primary goal of the GSVS project is to validate the geoid improvement by using the GRAV-D airborne data. At the same time, the GSVS data can also be used to identify problem areas in the geoid models, e.g., the west 50 km of GSVS14 line. Note that all geoid models were computed before the GSVS14 data became available. The GSVS data can be used to guide further improvement in geoid modeling.

The GSVS surface data has very high accuracy. The 1-cm accuracy of height differences between any pair of GSVS marks requires a clock used in the chronometric leveling to have a frequency accuracy of \(10^{-18}\) (Flury 2016) to reach the same accuracy. Thus, the GSVS data can be useful for developing and validating the ultrahigh accurate clocks. The GSVS surface data are also useful for other high accuracy applications, such as validation of the accuracy of ellipsoidal height differences obtained from the GPS data (e.g., Wang et al. 2014).