1 Introduction

Previous research has already given evidence for correlations between atmospheric pressure loading (ATML)-induced surface displacements and global GNSS height time series (van Dam et al. 1994, 2010; Brondeel and Willems 2003; Tregoning and van Dam 2005; Dach et al. 2011; Wijaya et al. 2013). These environmentally driven surface displacements add nonlinear variations to the GNSS time series being used for investigating Earth processes associated with sea level rise, postglacial rebound, etc. To remove this environmental signal from the GNSS data, surface pressure (SP) models are used to predict the surface displacements. A commonly used dataset for modeling ATML effects includes the 6-hourly, 2.5° × 2.5° SP grids from the National Centers for Environmental Prediction–National Center for Atmospheric Research (NCEP–NCAR) global reanalysis datasets (NCEP-R-1) (Kalnay et al. 1996). The 6-hourly NCEP-R-1 SP-induced displacements were usually averaged into weekly ATML displacements and compared with weekly GNSS station heights (van Dam et al. 1994; Tregoning and Watson 2009; Jiang et al. 2013). The correlations between ATML displacements and GNSS station heights at sub-weekly periods were, however, not studied in sufficient detail yet (Dach et al. 2011). Several new SP datasets have recently become available, which may potentially predict ATML displacements better than NCEP-R-1.

During and after the main production phase of the NCEP-R-l, several processing errors were discovered. To correct the known errors and also update the parameterizations of physical processes in NCEP-R-1, the NCEP–Department of Energy (NCEP–DOE) global reanalysis products (NCEP-R-2) was developed. Besides fixes of the human processing errors, NCEP-R-2 also incorporated upgrades to the forecast model and a diagnostic package that had been developed since the time the NCEP-R-1 system was frozen (Kanamitsu et al. 2002). Therefore, NCEP-R-2 should not be considered as a next-generation reanalysis, but should be regarded as an updated and corrected version of NCEP-R-l.

In addition to the NCEP products, there are other publicly available atmospheric reanalysis products from the European Centre for Medium-Range Weather Forecasts (ECMWF), i.e., the ERA-Interim (Berrisford et al. 2011). The model runs from 1979 to the present and is normally updated once per month (with a delay of two months) to allow for quality assurance and to correct technical problems with the production if any. The spatial resolution of ERA-Interim is 0.75° × 0.75° in latitude and longitude.

In 2009, the Modern-Era Retrospective Analysis for Research and Applications (MERRA) dataset was released. The MERRA project focuses on historical climate analyses at a broad range of weather and climate timescales. It uses a state-of-the-art data assimilation system that includes a variety of modern observing systems in a climate framework to produce a temporally and spatially consistent synthesis of observations and analyses of variables not easily observed (Rienecker et al. 2011). Compared to other updated reanalysis, MERRA has achieved its goals by including improvements in precipitation and water vapor climatology. Like NCEP-R-1, NCEP-R-2 and ERA-Interim, MERRA also provides 6-hourly SP grids. However, the spatial resolution of MERRA SP grids is 2/3° × 1/2° in longitude and latitude.

Finally, without a substantial investment to update assimilation routines, the MERRA data assimilation scheme was frozen in 2008 and, thus, lacked the capability to assimilate the most recent observations. As a result, a new full reanalysis was undertaken, and the second Modern-Era Retrospective Analysis for Research and Applications (MERRA-2) was introduced in early 2016 to replace the original MERRA reanalysis. For this reason, the MERRA reanalysis was completed on February 29, 2016, and MERRA-2 is currently in production. The output of MERRA-2 is on a regular 0.625° × 0.5° longitude by latitude grid (Bosilovich et al. 2016). MERRA-2 is the first long-term global reanalysis to assimilate space-based observations of aerosols and to represent their interactions with other physical processes in the climate system.

In this paper, we model the 3D loading displacement induced by the latest MERRA-2 SP grids and another four products, NCEP-R-1, NCEP-R-2, ERA-Interim and MERRA, at the same 6-hourly temporal resolution. We assess the quality of the five SP models by inter-comparing them with one another and by comparing the displacements predicted by each model with the ITRF2014 daily residual time series at 596 globally distributed GNSS stations (Altamimi et al. 2016). In particular, we determine whether SP products with a higher spatial resolution are better at reducing the scatter of daily GNSS time series in both the Up and horizontal components.

2 Data description

2.1 SP datasets for ATML modeling

We considered five SP models in this study. They all have a temporal resolution of 6 h and were compared over the period from January 1, 2000, until February 29, 2016, the last day of the MERRA dataset. General information about the five SP datasets is given in Table 1. The 6-hourly SP grids from NCEP-R-1 can be accessed directly via HTTP,Footnote 1 with a latency of about 3–5 days. Like NCEP-R-1, the 6-hourly NCEP-R-2 SP grids can be downloaded via HTTP.Footnote 2 They have the same spatial resolution as NCEP-R-1, but the latency is about 4 months due to the error fixing and parameterization update process. The ERA-Interim SP grids can be retrieved from the ECMWF server.Footnote 3 They have a higher spatial resolution than the NCEP-R-1 and NCEP-R-2 grids.

Table 1 General information of the five SP models

The MERRA SP grids are available at the Modeling and Assimilation Data and Information Services Center (MDISC), managed by the NASA Goddard Earth Sciences Data and Information Services Center (GES DISC), as one grid file for each day,Footnote 4 while the MERRA-2 SP grids are also available online through GES DISC,Footnote 5 from January 1, 1980, to present, with a latency of about one and a half months. The spatial resolution of MERRA-2 SP grids is slightly higher as compared to MERRA.

To be consistent with our selected ITRF2014 residuals, we do not remove the diurnal (S1) and the semi-diurnal (S2) frequencies from all the five SP grids, so that the modeled ATML displacement includes the impacts of both tidal and non-tidal ATML effects (see the second paragraph of Sect. 2.3 for details).

2.2 Integer land–sea masks for ATML modeling

In addition to the global surface pressure grids, an integer land–sea mask with the same spatial resolution as the surface pressure grids is also needed to define the land–sea boundary to model the ATML effects for the land and ocean areas separately. Within the integer land–sea mask, if the grid locates in the ocean, then the grid value is assigned as 0. Otherwise, it is equal to 1. For both the NCEP-R-1 and NCEP-R-2 data, an integer land–sea mask is provided at the same spacing as the SP grids. An integer land–sea mask is also provided together with the ERA-Interim SP grids.Footnote 6 These three land–sea masks can be used directly during ATML modeling.

The fraction of land-covered surface in each cell of the MERRA grids can be found in the MERRA constants file collection.Footnote 7 Unlike NCEP and ERA-Interim, the MERRA surface parameterization includes fractional coverage of land, ocean, lake and land ice (Lucchesi 2012). For example, at coastlines, if the grid cell includes a mixture of land and sea coverage, thus the grid value may assign as 0.75 or 0.125, depending on the distance from the ocean. The smaller the distance, the closer the grid value to 0. Under this circumstance, if we simply use the provided file as fractional land–sea mask, then all the grid cells with nonzero value would be considered as land, which definitely results in an overestimation of the land area. Correspondingly, quite different ATML loading displacement would be obtained for stations at coastlines due to the IB correction. Therefore, in order to obtain more realistic result than using the original provided fractional land–sea mask, the MERRA constants file that describes surface coverage should be tailored to an integer land–sea mask before modeling ATML displacements. The MERRA-2 constants fileFootnote 8 has exactly the same data structure as the MERRA constants file and was tailored to a binary integer land–sea mask in the same way. For the generation of MERRA and MERRA-2 integer land–sea mask, please refer to “Appendix.”

2.3 GNSS position time series

ATML-driven surface displacements can be observed in GNSS vertical position time series (van Dam et al. 1994, 2010; Tregoning and van Dam 2005). Comparison between ATML and GNSS time series can be used to test the different ATML modeling methods. However, in addition to ATML signals and other known geophysical effects, e.g., continental water storage and non-tidal ocean loading-induced displacements (van Dam et al. 1997, 2001, 2012; Tregoning et al. 2009; Fritsche et al. 2012; Williams and Penna 2011; Yuan et al. 2018), GNSS-related systematic errors also contribute to the observed nonlinear variations in GNSS time series (Ray et al. 2008, 2013). Previous results have shown that imperfect GNSS data processing models and strategies could result in spurious nonlinear displacements of GNSS stations, which mask real crustal displacements, thus affecting the comparison between GNSS and predicted loading signals (Deng et al. 2017; Li et al. 2014, 2018; King and Watson 2010; Penna and Stewart 2003; Penna et al. 2007; Tregoning and Watson 2009).

To minimize the impact of the GNSS-related errors, we used here daily GNSS residual time series that obtained in the frame of the ITRF2014 computation (Altamimi et al. 2016). The GNSS contribution to ITRF2014 consists of daily solutions provided by the International GNSS Service (IGS; Johnston et al. 2017) in the frame of its second reprocessing campaign (repro2; Rebischung et al. 2016) and covering the period January 2, 1994–February 14, 2015. Compared to previous ITRF realizations, the IGS repro2 solutions have improved due to the implementation of better background models and methodology (https://acc.igs.org/reprocess2.html). In addition, the IGS repro2 solutions represent a sum of contributions from nine independent analysis centers (ACs) and are as such potentially more precise than any one individual AC contribution. During repro2 implementation, the IGS analysis centers (ACs) were asked not to apply model corrections for the load displacements caused by large-scale non-tidal atmosphere, ocean and hydrological surface fluid motions; thus, the daily ITRF2014 residuals contain the non-tidal ATML signals. Besides, only two ACs (GFZ and GRG) applied the S1/S2 atmospheric tidal corrections, while the other seven ACs did not (Griffiths 2019). Therefore, we finally assume that the combined ITRF2014 residuals also contain the tidal ATML effect. This is actually an inconsistency here. However, since the impact of S1/S2 is rather small, we think that the assumption will not affect our final results much.

In the first step of the ITRF2014 computation, the IGS repro2 daily solutions were stacked and functional models, including piecewise linear functions, annual and semiannual sine waves, as well as logarithmic and/or exponential functions in case of post-seismic displacements (Altamimi et al. 2016), were adjusted to the IGS repro2 station position time series. The GNSS displacement time series we use here are the residuals from this stacking, to which the estimated annual and semiannual signals have been restored. They are free of linear trends, offsets and post-seismic signals and can be directly compared with ATML displacements predicted from the different SP models. Note that our obtained ITRF2014 daily residuals (directly from IGN) contain both position information and the formal error at each epoch, so that in the later data processing stage, we could compute the weighted root mean square (WRMS) of the GNSS coordinate time series.

To illustrate the advantage of our used GNSS time series, Fig. 1 plots the ITRF2014 residual time series together with the other three products from the Making Earth Science Data Records for Use in Research Environment (MEaSUREs) program,Footnote 9 including the cleaned residual time series from SOPAC, JPL, as well as the combined solution from SOPAC and JPL, for station ARTU as an example. To make the figure more readable, time series of only 5 years between 2010 and 2015 are shown here. It can be clearly seen that the ITRF2014 residuals exhibit the smallest scatter for all the three components among the four products without any sudden jumps.

Fig. 1
figure 1

Cleaned residual time series for station ARTU from different institutions. The blue curve represents the product from SOPAC, red indicates the JPL solution, orange illustrates the combined solution from SOPAC and JPL, while purple demonstrates the ITRF2014 residuals

3 Data processing

3.1 Modeling SP-induced surface displacement

We computed the 3D surface displacements at the ITRF2014 GNSS stations induced by SP variations over a spherical grid using Farrell’s Green’s function approach (Farrell 1972; van Dam and Wahr 1987; van Dam et al. 2010). They can be expressed as:

$$ \begin{aligned} d_{{\text{n}}} \left( {\theta ,\varphi } \right) & = \mathop \sum \limits_{i = 1}^{{n{\text{lon}}}} \mathop \sum \limits_{j = 1}^{{n{\text{lat}}}}\Delta P_{i,j} G_{i,j}^{n} A_{i,j} \\ d_{{\text{e}}} \left( {\theta ,\varphi } \right) & = \mathop \sum \limits_{i = 1}^{{n{\text{lon}}}} \mathop \sum \limits_{j = 1}^{{n{\text{lat}}}}\Delta P_{i,j} G_{i,j}^{e} A_{i,j} \\ d_{{\text{u}}} \left( {\theta ,\varphi } \right) & = \mathop \sum \limits_{i = 1}^{{n{\text{lon}}}} \mathop \sum \limits_{j = 1}^{{n{\text{lat}}}}\Delta P_{i,j} G_{i,j}^{u} A_{i,j} \\ \end{aligned} $$

where i and j denote a unique loading grid cell, Ai,j is the area of the grid cell, nlon and nlat are the numbers of grid cells in longitude and latitude, respectively. θ and φ represent the colatitude and longitude of a given station where the effect of the load is being determined. dn(θ, φ), de(θ, φ) and du(θ, φ) are the loading displacements of the station in the North (N), East (E) and Up (U) directions. \(G_{i,j}^{{\text{n}}}\), \(G_{i,j}^{{\text{e}}}\) and \(G_{i,j}^{{\text{u}}}\) denote the Green’s functions for each component (Farrell 1972), which depend on the angular distance between the loading grid cell and the station. The ITRF2014 residual displacement time series were expressed in the center of figure (CF) reference frame (Dong et al. 1997, 2003; Blewitt 2003). Therefore, we used Green’s functions in the CF frame to model ATML displacements, so as to maintain consistency between ATML and GNSS displacements. To avoid overestimating the loading displacement, we applied different data processing strategies for stations far away and close to the load grid cell. If the angular distance between the loading grid cell and the station is smaller than 5°, we divide the grid unit into 100 smaller units and recalculate the Green’s functions for the new refined grid cells. If the angular distance between the new grid cell and the station is less than 0.5°, we then divide the new grid into another 100 smaller units and recalculate the Green’s function for them separately. In this way, if 2.5° SP grid is used, for the station very close to the load grid cell, the highest spatial resolution is actually 0.025° (2.5/10/10). Therefore, no matter the original load grid is sparse or dense, our code works well in calculating the modeled ATML displacement.

Finally, ΔPi,j is the pressure variation at the grid cell. Since we are only interested in time variable effects, a 10-year mean surface pressure from 2000 to 2009 (hereafter called reference pressure) is removed, so that ΔPi,j corresponds to the original surface pressure from each dataset minus its corresponding reference pressure. Note that the reference pressure for each grid cell is independent of the selected continuous period, which refers to the uninterrupted SP data span from the first epoch on January 1 of the starting year until the last epoch on December 31 of the ending year. For example, if the data span is from 2000 to 2009, then the reference pressure at each grid cell equals to the summation of the gridded pressure value from the first epoch of January 1, 2000, until the last epoch of December 31, 2009, divided by the total epochs, that is, four epochs per day, 365 or 366 days per year, depending on the leap year or not.

During the calculation, an integer land–sea mask with the same spatial resolution as the surface pressure grids was used to define the land–sea boundary, and the pressure loading over the oceans was determined using a modified inverted barometer (IB). For a detailed discussion of the modified IB, readers are referred to van Dam and Wahr (1987). Concerning this, we should keep in mind that any observed differences in the displacement for coastal sites would depend not only on the pressure grid itself, but also on the IB correction based on different land–sea masks that used for each SP dataset.

3.2 Comparison of ATML displacements and ITRF2014 residuals

After having obtained predicted ATML displacement time series from the above five SP datasets, we compared them to the ITRF2014 residual time series of a selected set of GNSS stations. Due to the relatively lower quality of IGS products before 2000 (Griffiths 2019), we restricted this comparison to the period after 2000 and first selected the ITRF2014 GNSS stations with at least 5.5 years of observations after 2000, so as to obtain a more reliable global statistical result and also to ensure that there are as many stations as possible that evenly distributed around the globe. We additionally removed stations with abnormally scattered residual time series, i.e., with standard deviations larger than 10 mm and 5 mm for the Up and horizontal components, respectively. This selection procedure left us with 596 globally distributed GNSS stations shown in Fig. 2.

Fig. 2
figure 2

Spatial distribution of the 596 selected GNSS stations

Our ATML displacement time series all have four epochs per day at 00:00, 06:00, 12:00 and 18:00. In order to make them comparable with our daily GNSS time series, we first averaged them into daily estimates (no weighting has been applied here for each epoch) and then applied them as a posteriori corrections to remove the ATML effect from the daily ITRF2014 residuals. This procedure is easy to implement and also reasonable, as Tregoning and van Dam (2005) have already proved that the majority of the ATML effect could be modeled by applying a daily averaged correction. Moreover, it has advantages for the rigid processing scheme applied within the IGS, since in this way, the ATML effect can be corrected after data analysis and before combination in a consistent way using identical values for solutions based on the same or different space-geodetic techniques (Dach et al. 2011). A greater improvement would be expected for the performance of each SP model in correcting the GNSS scatter if the ATML correction is applied at the observation level (Tregoning and van Dam 2005; Dach et al. 2011). Nevertheless, no matter whether the ATML correction is applied on the observation level or as a posteriori daily averages, we think that the SP model difference would not be affected much, since we use the same GNSS data during comparison.

Finally, according to our previous research on the continental water storage (CWS) models, we found that the MERRA-Land product with the highest spatial resolution at the time performed the best in reducing the weekly scatter of the GPS height (Li et al. 2015). Therefore, we speculate that the ATML model would exhibit the same characteristics. Since MERRA-2 grid is currently the most recent product with the highest spatial resolution among the five SP datasets, we first described the ATML displacements derived from MERRA-2 and compared them with the selected ITRF2014 GNSS residual time series. We then used the MERRA-2 results as the reference to compare the differences between ATML models.

4 Comparison of MERRA-2-derived ATML displacements with ITRF2014 GNSS residual time series

4.1 Global variability of MERRA-2-derived ATML displacements

To have a global view of the MERRA-2-derived ATML displacement, Fig. 3 depicts the time series of some example stations with maximum ATML displacement for the Up, North and East component, while Fig. 4 depicts the scatter (top panels) and the absolute maximum displacement (bottom panels) of the 6-hourly ATML time series for the Up component of the selected 596 stations during the period January 1, 2000–February 29, 2016. For clear comparison, the ITRF2014 residuals time series for the example stations are also presented in Fig. 3. Note that in the middle panel of Fig. 3, there is a sudden large deviation in the North component of CAS1 on December 17, 2008. Then from December 18, 2008, until December 31, 2008, there are no data in the ITRF2014 residuals. We checked the log file for station CAS1 from IGS Web siteFootnote 10 and only found that the receiver type had been changed on December 3, 2008, from ASHTECH to LEICA. Since we could not find the exact reason that caused the sudden deviation, this epoch was excluded in the later comparison. Also, because coastal regions cannot be represented accurately enough by the relatively sparse spatial resolution of our selected SP grids, here we defined a special group of inland stations, which was model dependent based on the corresponding land–sea mask, so as to separate the impact of the IB correction. Specifically, inland stations indicated all stations 50 km, 54 km, 60 km and 200 km away from the nearest coast for MERRA-2, MERRA, ERA-Interim and the NCEP grids, while the number of the inland stations was 174, 168, 166 and 113, respectively.

Fig. 3
figure 3

Time series of stations with maximum MERRA-2-derived ATML displacement (blue curve) together with their corresponding ITRF2014 residuals (red curve)

Fig. 4
figure 4

RMS (top) and maximum amplitudes (bottom) of the 6-hourly MERRA-2-derived ATML displacement time series for the Up component. Units are mm

We notice that the spatial patterns in Fig. 4 are globally consistent with previous research focused on ATML displacements (van Dam et al. 1994; Dach et al. 2011). Due to the small pressure variation and the assumed IB response, the smallest ATML displacements in the Up component predicted from MERRA-2 SP mostly locate in the low- and mid-latitude oceanic and coastal regions, with scatters of less than 2 mm and maximum displacements of 5 mm (Fig. 4). Most continental stations in the Northern Hemisphere, especially the inland stations, have scatters of greater than 3 mm. In particular, the absolute maximum ATML displacements of some stations located at high latitudes reach more than 18 mm due to the larger storms found there (bottom panel of Fig. 4), for example station ARTU (Russia, 24.2 mm, top panel of Fig. 3), etc. Because of the irregular distribution of land and ocean on the Earth’s surface, there are significantly more ocean and much less land in the Southern Hemisphere, so that the Southern Hemisphere climates tend to be slightly milder than those at similar latitudes in the Northern Hemisphere. Correspondingly, the pressure variation and its induced ATML displacement for the Up component of continental stations in the Southern Hemisphere are also about 1/3–1/2 of the magnitude for those in the Northern Hemisphere (Fig. 4).

The MERRA-2 SP variations could only induce very small displacements in the East and North components (horizontal hereafter), with scatters of less than 1 mm and maximum displacements of 4 mm, respectively. To save space, in the following analyses, most of the figures that illustrate the characteristics of the horizontal components are presented in the supplementary material. The same as the Up component, stations at low- and mid-latitudes exhibit the smallest horizontal displacements due to the small pressure variation, particularly for those located in the oceans and along the coasts (Figure S1, S2), where the IB response exhibits the biggest impact. The largest horizontal ATML displacements are also found at high latitudes where big storms occur, especially in Antarctica, Northern Europe and Russia. For example, station CAS1 (Antarctica, middle panel of Fig. 3) and SCOR (Greenland, bottom panel of Fig. 3) exhibit the biggest horizontal scatter of 0.99 mm and 0.94 mm for the North and East component, respectively.

4.2 Correlation between MERRA-2-derived ATML displacements and ITRF2014 residuals

To compare MERRA-2-derived ATML displacements with the selected ITRF2014 GNSS residual time series, we first calculated the correlation coefficients between both datasets (Figs. 5, 6). The correlations are generally low in the horizontal components, with only 9.6% and 5.7% of all stations showing correlations of higher than 0.4, while 11.9% and 7.4% even exhibiting negative correlation in the East and North components, respectively, among which most belong to stations near the coasts.

Fig. 5
figure 5

Correlation coefficients between MERRA-2-derived ATML displacements and the ITRF2014 residual time series for all 596 stations

Fig. 6
figure 6

Histogram of correlation coefficients between MERRA-2-derived ATML displacements and the ITRF2014 residual time series. The X label describes the correlation coefficient. The Y label illustrates the number of stations with correlation inside different ranges. Blue color represents the results for all 596 stations, while orange denotes inland stations

The correlations for the Up component are larger, with a mean value of 0.33, and 34.4% of all stations exhibit correlations of higher than 0.4, mostly in the middle- and high-latitude continental regions (top panel, Fig. 5). For some Eurasian continental stations, the correlations even reach more than 0.7, for example at station IKRM (Russia, 0.78) and BJFS (China, 0.74). Stations with poor correlations are those located in the low- and mid-latitude oceanic and coastal regions, some of which also having negative correlations, e.g., station XMIS (Christmas Island, Australia) with the largest negative correlation of -0.22. Among the 174 inland stations (Fig. 6), there exists no negative correlations in the Up direction, and the mean correlation is 0.43, which is much larger than that for all stations (0.33).

4.3 WRMS reduction of ITRF2014 residuals

We calculated the relative WRMS reductions (see detailed definition in Jiang et al. 2013) obtained when removing the MERRA-2-derived ATML displacements from the ITRF2014 residual time series. The relative WRMS reductions of individual stations are shown in Fig. 7, while Fig. 8 depicts the number of stations with relative WRMS reductions inside different ranges. We find that 85.1%, 73.3% and 71.8% of all stations have their WRMS reduced by ATML corrections in the Up, East and North components, respectively. In the Up component, this number is larger than those obtained in previous studies (e.g., van Dam et al. 1994, 2010; Brondeel and Willems 2003; Wijaya et al. 2013). However, here we cannot yet determine whether this improvement stems from using improved GNSS coordinate time series or the MERRA-2 SP grids. We will explore this question further in the next section.

Fig. 7
figure 7

Relative WRMS reductions obtained when correcting ITRF2014 residual time series from MERRA-2-derived ATML displacements for all 596 stations. Unit is %

Fig. 8
figure 8

Histogram of relative WRMS reduction. The X label describes the WRMS reduction (unit: %). The Y label illustrates the number of stations with WRMS reduction inside different ranges. Blue color represents the results for all 596 stations, while orange denotes the 174 inland stations

The relative horizontal WRMS reductions are very small, usually less than 5%, with mean reduction of 3.1% for the North component and 3.5% for the East component (Fig. 8; Table 2, Sect. 5.2). For inland stations, the mean horizontal WRMS reduction is only slightly larger, at around 3.6%. The largest improvements are obtained for station POL2 in Kyrgyzstan and station SCOR in Greenland, with relative WRMS reductions of 15.0% in the North and 22.2% in the East.

Table 2 Statistics of the relative WRMS reductions obtained with different SP datasets for all 596 stations and inland stations

The relative WRMS reductions for the Up component are globally larger, with a mean reduction of 8.3%, and their spatial distribution is similar to that of the correlation coefficients in the Up direction (Fig. 5). Due to the small pressure variation and the assumed IB response, stations in the low- and mid-latitude oceanic and coastal regions mainly show small relative WRMS reductions (< 5%), while those stations in the middle- and high-latitude continental regions, where larger storms occur, mostly exhibit large relative WRMS reductions (> 10%), particularly for the Northern Hemisphere because of the irregular distribution of the land and ocean (Sect. 4.1). The largest improvement reaches more than 30% at some stations, for example station ULAB in Mongolia (34.3%) and stations BJFS and CHAN in China (33.9%). If only considering inland stations, 97.1% (174 in total) show WRMS reduction in the Up component, with mean reduction of 11.4%, and the best improvement concentrates mainly in the Eurasia continent (Figs. 7, 8).

5 Discussion

From the results presented in Sect. 4, we conclude that the scatter of the daily ITRF2014 GNSS residual time series can be globally reduced using the latest, high-resolution MERRA-2 SP grids, particularly in the Up component. In this section, we compared the ATML displacements derived from different SP datasets (Sect. 5.1) and their abilities to reduce the scatter of GNSS time series (Sects. 5.2 and 5.3). Then, we compared the daily pressure variation from the five SP datasets to further interpret the ATML displacement differences (Sect. 5.4).

5.1 Correlation between ATML displacements derived from MERRA-2 and other SP datasets

To investigate the differences of the ATML displacements derived from different SP datasets, we calculated the correlation coefficients between MERRA-2-derived displacement time series and time series derived from the other SP dataset at the 596 selected GNSS stations. The correlation results for the Up component are shown in Fig. 9, while those for the horizontal components are plotted in Figure S3 and Figure S4. Figure 10 depicts the histogram of the correlation coefficients for both all stations (blue color) and inland stations (orange color). We find that in general all the five sets of ATML displacement time series are very consistent with each other.

Fig. 9
figure 9

Correlation coefficients for the Up component between ATML displacements derived from MERRA-2 and the other four SP datasets for all 596 stations. Black dots indicate the correlation coefficients are smaller than the minimum value of the scale

Fig. 10
figure 10

Histogram of correlation coefficient between MERRA-2- and other model-derived ATML displacements for the Up, East and North components. The X label describes the correlation coefficient. The Y label illustrates the number of stations with correlation coefficient inside different ranges. Blue color represents the results for all 596 stations, while orange denotes the results for inland stations

Among the four other datasets, MERRA exhibits the highest correlations with MERRA-2 (bottom panel of Fig. 9, S3, S4; the last three panels of Fig. 10), with mean correlations of more than 0.995 in all three components. The correlations between NCEP-R-2 and MERRA-2 are slightly lower (top panel of Fig. 9, S3, S4; the first three panels of Fig. 10), with mean correlations of 0.986, 0.985 and 0.989 in the North, East and Up components, respectively. Further, only a few island and coastal stations (less than 3% of all stations) have correlations less than 0.90. NCEP-R-1 correlates closely with MERRA-2 as NCEP-R-2 (the second panel of Fig. 9, S3, S4; the three panels in the second row of Fig. 10), while ERA-Interim shows the lowest correlations with MERRA-2 (the third panel of Fig. 9, S3, S4; the three panels in the third row of Fig. 10), but with mean correlations still larger than 0.96 in all three components. Again, the lowest correlations (< 0.8) are found in the low- and mid-latitude oceanic and coastal regions, particularly for the island stations. There are two possible reasons that may cause this low correlation: The land–sea masks used for MERRA-2 and ERA-Interim may be quite different, or there may be quite different SP distribution over the oceans for these two products.

With respect to inland stations, we observe that the correlation between ATML displacement derived from MERRA-2 and the other four SP datasets is larger than those near the coasts. In particular, the lowest correlations in the Up direction are larger than 0.91 for all the four datasets (Fig. 10), while the horizontal correlation is slightly smaller, with the lowest correlation located at station HARB (0.61, South Africa, 480 km from coast) and RIOP (0.83, South America, 146 km from coast) in the North and East component for the ERA-Interim product. The difference between all stations and inland stations further confirms our deficiency during ATML modeling for coastal regions. However, at this stage, we cannot decide whether the SP grid itself, or the land–sea mask, is the main cause for the bigger differences near the coast. We will answer this question later.

5.2 Relative WRMS reductions of ITRF2014 residuals based on different SP datasets

In this section, we removed the predicted ATML displacement based on various SP models from the GNSS coordinate time series. We first calculated the relative WRMS reductions of the ITRF2014 residual time series for each SP dataset, as we have done in Sect. 4.3. Table 2 gives the percentages of stations with WRMS reduced after correcting the ATML effects using different models together with their average and maximum WRMS reduction for both all stations and inland stations. In general, all five SP datasets are able to reduce the nonlinear variations in the Up component of GNSS time series. ERA-Interim performs the best, NCEP-R-1 and NCEP-R-2 perform slightly worse, MERRA ranks fourth, and MERRA-2 shows the worst performance, although MERRA-2 is currently the most advanced SP dataset with the highest spatial resolution. This is a quite different situation as with continental water storage, for which the higher spatial resolution MERRA dataset was found to perform best (Li et al. 2015). We think that the differences mainly come from the different assimilation methods that used for each model, since similar raw observations have been assimilated for all the SP products. The advantage of ERA-Interim over the other datasets may be due to the four-dimensional variational data assimilation method (4D-Var) applied. In addition, its data assimilation benefits from more extensive use of radiances with an improved fast radiative transfer model. Both NCEP-R-1 and NCEP-R-2 only adopt a three-dimensional variational data assimilation system (3D-Var); thus, their performances are slightly worse. The MERRA and MERRA-2 also use 3D-Var as the assimilation framework, and the incremental analysis updates (IAU) procedure is implemented to slowly adjust the model state toward the observed state. However, the focus of the assimilation process is to simulate the hydrological cycle correctly (Lindsay et al. 2014). The water cycle has been improved (Li et al. 2015), but the quality of the surface pressure seems to be slightly degraded.

Depending on the SP dataset used, between 85.1 and 90.3% of the selected GNSS stations have their WRMS reduced by ATML corrections in the Up component for all 596 stations, with average WRMS reduction of around 8.3% (Table 2). These values are even 11–16% larger than the weekly comparison result reported after considering the topographic effects (van Dam et al. 2010). Since our numbers are similar among the different tested SP datasets, we think that this improvement over previous studies is mainly due to our use of the latest-generation, reprocessed GNSS time series. In addition, most of the stations (> 67%) also have their horizontal WRMS reduced by ATML corrections using any of the five SP datasets, although the magnitude of the horizontal WRMS reductions is quite small, with average WRMS reduction at around 3.1% (Table 2), and most improvements (> 76%) are less than 5%. An important thing that should be noticed is that although ERA-Interim does the best performance in reducing the WRMS for the Up and East components among the five SP models, it performs the worst in reducing the WRMS for the North component (Table 2). One possible reason may be the interference between ERA-Interim-derived ATML displacement and the impact of the temperature gradient in the North–South direction. Further research is still required to investigate the exact reasons.

With respect to inland stations only, we observe that more than 96% of the remaining stations exhibit WRMS reduction in the Up component for all the five SP datasets, and the average WRMS reduction is also 3–4% larger compared to the result of all 596 stations (Table 2). Nevertheless, with respect to the horizontal component, there is only a slight improvement on the average WRMS reduction (~ 3.5%) and the percentage of stations with WRMS reduced (> 67%). In particular, the maximum horizontal WRMS reduction exhibits mostly near the coasts using any of the products, especially for the E component, which is quite different from the Up component.

5.3 Relative WRMS reduction difference of the SP models w.r.t MERRA-2

We then compared the relative WRMS reductions obtained with the four other SP datasets to the relative WRMS reductions obtained with MERRA-2. The differences in relative WRMS reductions for the Up component are shown in Fig. 11a, while the results for the E and N components are plotted in Figures S5 and S6, respectively. Figure 11b illustrates the histogram of relative WRMS reduction differences for the U, E and N components. Negative values mean that MERRA-2 reduces the scatter of GNSS time series more than the other SP datasets. We observe that there is very little difference between MERRA and MERRA-2 in terms of relative WRMS reductions (bottom panel of Figs. 11a; S5; S6). This was expected from the very high correlations between MERRA-derived and MERRA-2-derived ATML displacement time series (Sect. 5.1).

Fig. 11
figure 11figure 11

a Relative WRMS reduction differences (%) of different models w.r.t MERRA-2 for the Up component of all 596 stations. b Histogram of relative WRMS reduction differences (%) of different models w.r.t MERRA-2 for the Up (U), East (E) and North (N) components. The X label describes the WRMS reduction difference. The Y label illustrates the number of stations with WRMS reduction difference inside different ranges. Blue color represents the results for all 596 stations, while orange denotes the results for inland stations

The relative WRMS reduction differences between ERA-Interim and MERRA-2 are larger (the third panel of Figs. 11a; S5; S6). This is also expected given the lower correlations between ERA-Interim-derived and MERRA-2-derived ATML displacement time series (Sect. 5.1). ERA-Interim is better at reducing the scatter of GNSS height time series than MERRA-2 for most stations (65.1%), especially in North America, Eurasia and Antarctica. However, the majority of stations in South America, South Africa, Australia, together with some island stations in the Indian Ocean and the Pacific Ocean, have their WRMS in the Up component more reduced with MERRA-2. On the other hand, MERRA-2 performs better at reducing the scatter of horizontal GNSS time series than ERA-Interim for more than half stations, especially in coastal regions. Nevertheless, most stations in Europe have their horizontal WRMS more reduced with ERA-Interim.

Using NCEP-R-1, 41.9%, 49.0% and 47.1% of the stations show improved WRMS reductions for the North, East and Up components, respectively, as compared to using MERRA-2 (see the second panel of Fig. 11a). The biggest improvements are located mainly in Europe, together with several coastal stations in North America, Antarctica and New Zealand. The spatial distribution of the relative WRMS reduction differences between NCEP-R-2 and MERRA-2 is quite similar as between NCEP-R-1 and MERRA-2 (top panel of Fig. 11a). NCEP-R-2 is slightly better at reducing the scatter of GNSS time series for most stations in Europe than NCEP-R-1 and MERRA-2. However, for most island and coastal stations, MERRA-2 is superior to NCEP-R-2.

The WRMS reduction difference between model and MERRA-2 becomes remarkably smaller for the inland stations, particularly for MERRA, of which 98.8% of the stations show differences of less than 1% for the Up component, while all the horizontal differences are within 1% (bottom panel of Fig. 11b). NCEP-R-1 and NCEP-R-2 show slightly bigger difference (top and the second panel of Fig. 11b). Stations with WRMS reduction differences of smaller than 1% account for more than 82%. ERA-Interim exhibits the biggest difference with respect to MERRA-2, but still more than 93% of the stations showing WRMS reduction difference of smaller than 2% for all the three components (the third panel of Fig. 11b). This comparison result further confirms the finding from Sect. 5.1 that stations near the coasts exhibit bigger difference than those far away from coasts in the SP-derived ATML displacement.

5.4 Daily surface pressure variation derived from different SP datasets

To investigate the differences between modeled ATML displacements from the SP grid itself, Fig. 12 illustrates the daily pressure variation difference of the four SP products w.r.t MERRA-2 on July 15, 2014, as an example. Positive (reddish color) means the MERRA-2 pressure variation is smaller than the selected model, while negative (bluish) means MERRA-2 is larger. The Generic Mapping Tools (GMT) software was used here to interpolate all the grids into the same spatial resolution as MERRA-2 for differencing (Wessel and Smith 2018). Note that here the date July 15, 2014, was arbitrarily selected, as the daily pressure variation differences were almost the same for different days in different seasons.

Fig. 12
figure 12

Daily pressure variation differences of the four SP models w.r.t MERRA-2 on July 15, 2014. Unit of the pressure variation difference is mbar. Top-to-bottom panels represent the NCEP-R-2, NCEP-R-1, ERA-Interim and MERRA product

We observe that in general MERRA shows the smallest pressure variation difference w.r.t MERRA-2 (bottom panel of Fig. 12). Grid points with difference of smaller than 20 millibar (mbar) account for 93.8%, including the entire ocean. ERA-Interim exhibits slightly bigger difference in the mountain regions compared with MERRA (the third panel of Fig. 12), while NCEP-R-2 and NCEP-R-1 (top and the second panel of Fig. 12) present much bigger pressure variation difference w.r.t MERRA-2 in ice-covered areas, like Greenland and Antarctica, as well as all the other mountain regions, especially for the NCEP-R-2 product. In particular, MERRA-2 exhibits much bigger pressure variation (> 360 mbar) in Mount Roraima (South America) than MERRA and ERA-Interim, while both the NCEP products show the maximum positive difference of more than 700 mbar w.r.t MERRA-2 in the New Guinea Highlands (Oceania). Our observed pressure difference between NCEP and ERA-Interim is more than ten times bigger than that from Salstein et al. (2008). The big discrepancy should be caused by the different surface topographic heights assumed for the two models. If referenced all the five SP models into the same surface topography, the pressure differences should be at least ten times smaller. However, the observed global trend of the pressure difference between models here is correct. Due to the less well-constrained analyses, larger pressure differences between models tend to occur in the ice-covered area and the high mountainous regions (Salstein et al. 2008).

Compared with the spatial pattern of the model differences described in the previous sections, we can therefore answer the question in Sect. 5.1 that the bigger model differences for coastal/island stations are mainly due to IB correction based on the different sparse land–sea masks. This is one of the deficiencies in this research. Currently, the Special Bureau for the Atmosphere (SBA) under the Global Geophysical Fluid Center (GGFC) within the International Earth Rotation and Reference Systems Service (IERS) provides operational ATML displacement for GNSS sites worldwide from NCEP and ECMWF.Footnote 11 They use a much more sophisticated treatment of the coastal zone based on the IB assumption, e.g., a land–sea mask with 0.25° for NCEPFootnote 12 or 0.1° for ECMWF,Footnote 13 and thus could represent more realistic displacement for coastal/island sites. To avoid the shortcoming of our ATML calculation, readers could download the SBA provided ATML displacement for stations near the coasts from different SP models and compare their differences instead. Moreover, with respect to those stations far away from coasts, Fig. 12 confirms that slightly lower correlations exhibit at higher altitude with bigger pressure variations. Since most of our selected GNSS sites are located at very low altitude, further research is still required to focus on the ATML effects in the high mountainous regions, e.g., the Qinghai–Tibet Plateau in China, the Andes in South America, etc.

6 Summary

ATML displacements contribute to the nonlinear variations in the GNSS coordinate time series. To remove this environmental signal from the GNSS data, SP models are required to predict ATML displacements. Based on Farrell’s Green’s function approach, we modeled the 3D surface displacements induced by the latest MERRA-2 SP grids, together with the other four products (NCEP-R-1, NCEP-R-2, ERA-Interim and MERRA) over the period January 1, 2000–February 29, 2016, at 596 GNSS stations from the ITRF2014 network. Our results confirm that SP variations induce only small displacements in the low- and mid-latitude oceanic and coastal regions, while most continental stations far away from coasts in the Northern Hemisphere exhibit ATML displacement scatters of larger than 3 mm for the Up component. The five sets of ATML displacement are highly consistent with each other, among which MERRA exhibits the highest correlation with MERRA-2, NCEP-R-1 and NCEP-R-2 rank second and third, while ERA-Interim shows the lowest correlations, with only several coastal/island stations having correlation of lower than 0.9.

Comparing the modeled ATML displacements with daily ITRF2014 residual time series, we find that all five SP products perform well in correcting the scatter of GNSS height, and the percentage of stations with WRMS reduced improves by more than 10% compared with previous results, due to our used latest-generation, reprocessed GNSS time series. Depending on the different assimilation methods, ERA-Interim performs best, reducing the scatter for 90.3% of the GNSS height, with maximum improvement of 35.2%. NCEP-R-1 and NCEP-R-2 do slightly worse (89.3%, 89.1%), MERRA ranks fourth (86.4%), while MERRA-2 exhibits the worst performance (85.1%), although it is the most recent advanced product. Most stations (> 67%) also exhibit horizontal WRMS reductions, but the improvement is usually less than 5%. Further improvement would be expected if correcting the ATML effect at the observation level rather than as daily averaged a posteriori corrections here.

Compared with MERRA-2, more than half of the stations show larger WRMS reductions in the Up direction using MERRA and ERA-Interim. However, MERRA-2 performs better in reducing the horizontal scatter than ERA-Interim, as well as 3D scatter than both NCEP products for more than half stations. In particular, the majority of stations in South America, South Africa, Oceania and the southern oceans show larger WRMS reductions with MERRA-2 than with any other SP datasets, while all other four SP datasets enable larger WRMS reduction in the Up component than MERRA-2 in Europe.

The model difference w.r.t MERRA-2 becomes remarkably smaller for inland stations only, with the lowest correlations in the Up component with MERRA-2 of larger than 0.91 for all the four datasets, while more than 96% of the inland stations exhibit WRMS reduction for all the five SP datasets in the Up direction, and the average WRMS reduction is also improved by 3–4%. Through comparison of the daily pressure variation between models, we conclude that the bigger model differences for coastal/island stations are mainly due to the different sparse land–sea masks used during IB correction. In future work, a unique high spatial resolution land–sea mask should be applied during comparison, so that the model differences would come from only SP grids. Further investigation is also required to evaluate the ATML effect in Greenland, Antarctica and other mountain areas where larger pressure differences occur among different SP models, e.g., the Qinghai–Tibet Plateau in China.

Finally, only ATML effect is considered here. Taking into account the hydrology loading and the non-tidal ocean loading would help reduce the GNSS scatter more, but not the full remaining part. In-depth research is still expected to investigate the possible reasons that may cause the inconsistency between loading and the GNSS displacement, e.g., the temperature gradient, etc.