1 Introduction

Radio waves are delayed and bent as they pass through the neutral atmosphere due to interaction with gas molecules. The resulting measurement error is called tropospheric delay (Vey et al. 2006; Boisits et al. 2020). One of the most accurate ways of obtaining tropospheric delay is ray tracing through numerical weather models (NWMs) (Landskron and Böhm 2018; Niell 2000; Ding et al. 2023). As the position of the satellite changes rapidly, ray tracing is challenging to compute for every radio wave observation. To quickly obtain the tropospheric delay at any elevation angle, also known as the slant tropospheric delay (STD), it is customary to calculate the tropospheric delay in the zenith direction (ZTD) and then multiply it by the mapping function (MF).

In GNSS meteorology, the accuracy of estimates for precipitable water vapor (PWV) depends on the accuracy of the zenith wet delay (ZWD), i.e., the wet component of the ZTD (Ding et al. 2022). Researchers have developed a variety of zenith delay models, including European Geostationary Navigation Overlay Service (EGNOS) model (Penna et al. 2001), Global Pressure and Temperature (GPT) series models (Böhm et al. 2007; Landskron and Böhm 2018), TropGrid series models (Krueger et al. 2004; Schüler 2014), SHAO and SHAtrop series models (Song et al. 2011; Chen et al. 2020), etc. These models generally perform well in evaluation studies (Zhang et al. 2016; Ding and Chen 2020). In contrast, work on MF modeling has been scarce and rather dates back to the end of the last century, such as Chao (1974), Marini (1972), Black and Einser (1984), MTT (Herring 1992), and NMF (Niell 1996). These models are based on standard atmospheric data at sea level or regional sounding data. Others simply provide reference latitudes and seasons in tabular form, such as Black (1978), CfA2.2 (Davis et al. 1985), Foelsche and Kirchengast (2002), UNBabc and UNBab (Guo and Langley 2003), UNB (Urquhart et al. 2014) model, etc.

Niell (2000) proposed the Isobaric Mapping Function (IMF), the first mapping function established by the NWMs ray tracing method, which greatly improved the MF precision. Drawing on this method, Böhm and Schuh (2004) established the Vienna Mapping Function (VMF) and overcome some limitations of the IMF in the wet delay. Böhm et al. (2006) designed the widely used Vienna Mapping Function 1 (VMF1), which has been considered to be the most accurate MF available (Landskron and Böhm 2018). With ensuing adaptations for real-time purposes, the forecast VMF1 model was designed by Böhm et al. (2009). Landskron and Böhm (2018) designed VMF3, which eliminates the deficiencies in the empirical coefficients b and c and in the tuning for a specific elevation angle 3° in the VMF1, thus approaching the underlying ray-traced delays more accurately at low elevation angles. By using different NWMs, alternative versions of VMF1 and VMF3, such as UNB-VMF1 (Urquhart and Santos 2011), GFZ-VMF1 (Zus et al. 2015) and GFZ-VMF3, have been derived. In addition, Gegout et al. (2011) introduced the concept of adaptive mapping functions (AMF). The rationale of this approach is to extend the azimuthal dependency of the coefficients through a Fourier series, thereby incorporating a multi-scale azimuthal decomposition. This method is capable of providing GPS range corrections at the measurement level with millimeter precision, even at low elevations. Balidakis et al. (2018) derived a new mapping function GFZ-PT, which describes diurnal and sub-diurnal in addition to long-wavelength variations and provides harmonic functions of ray tracing-derived gradients. Zhang et al. (2021) designed the TMF model assuming an angle between the tropospheric zenith direction and the geometric zenith direction. Zhou et al. (2021) proposed an improved tropospheric mapping function modeling method by adaptively determining the number of coefficients to be estimated in the continuous fraction of the mapping function based on the convergence of a least squares fit.

Overall, numerous zenith delay models and MF models are at hand and provide very accurate tropospheric delays (Kouba 2008, 2009; Yuan et al. 2019; Qiu et al. 2020). However, few of these models consider vertical information of tropospheric delays, which means that most of their applications are limited to the Earth surface. Dousa and Elias (2014) proposed an enhanced model for calculating the tropospheric wet delay by introducing a new parameter for the exponential decay of ZWD. When combined with a reference value, this parameter accurately represents ZWD corrections up to an altitude of 10 km. Li et al. (2018) designed two improved zenith tropospheric delay models, IGGtrop_SH and IGGtrop_rH, based on reanalysis of atmospheric data, which were built based on empirical vertical scaling functions. Validation results showed an average accuracy of 3.86 cm and 3.97 cm, respectively. Balidakis (2019) got hydrostatic and wet mapping factor differences at the orography of the fifth generation of the European Centre for Medium-Range Weather Forecasts (ECMWF) ReAnalysis (ERA5) and virtual orographies 25 m, 100 m, and 500 m above, it is up to ± 0.008 at 500 m height. Zhu et al. (2022) developed the GZTD-P model based on the global ERA5 using a segmentation function to vertically adjust the total zenith delay. The authors verified the accuracy of their model using other atmospheric reanalysis data and sounding station data. Wang et al. (2022) developed a new method to represent NWM-derived tropospheric zenith hydrostatic and wet delays, parameterized by surface values and an additional two or three coefficients, to achieve 1–2 mm precision when reconstructing the delays as NWM-determined delays at any altitude. In contrast to with zenith delay vertical modeling, very little research has been done on the MF vertical modeling. Niell first proposed a linear MF height correction model in 1996, but only for the hydrostatic part (Niell 1996). Qu et al. (2022) identified the shortcoming of constant coefficients in Niell's model and designed a quadratic polynomial to improve the MF coefficients. However, Qu et al. (2022) similarly ignored the wet and restricted the modeling range to altitudes of 0–5 km. In addition, the model was evaluated mainly at the Earth's surface, but not rigorously for the ocean and overhead regions. Yet such evaluation is indispensable today when cm-precision air-borne aircraft activities such as aerial refueling and UAVs traversing jungle obstacle avoidance are on the rise. Another factor to consider is that both linear and quadratic polynomial model are simply approximate mathematical representations of experimental values.

In this contribution, we aim to provide MF height correction models with a global coverage, from the Earth surface to 14 km of altitude, thus comprising the range of most aircraft activities and meteorological events. The models are formulated in a simple and effective way, while minimizing the loss of precision. To simulate the vertical variation of the coefficient \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) of the tropospheric delay MF, a novel method is presented, enabling the user to quickly and easily obtain the precise MF. The derivation of the methods is presented in Sect. 2 and the assessment of model performance is presented in Sect. 3. Finally, in Sect. 4, we summarize the work and present the conclusions.

2 Vertical modeling of tropospheric zenith delay mapping functions

In this section, we derive an extensible exponential model of the mapping function coefficients from an exponential function model of the zenith tropospheric delay. We highlight the shortcomings of the Niell height correction model, along with the fact that Niell's linear model was actually an approximation of the exponential model, and then propose an extensible polynomial model as an experimental reference. Finally, a grid point is chosen to demonstrate the new model’s performance. Note that this work utilizes the open-source software package RADIATE, developed by the Technische Universität Wien (TU Wien, TUW), to perform ray tracing. The latest VMF3 products are currently generated by this software (Hofmeister 2016; Landskron and Böhm 2018). Program RADIATE uses the Euler radius of curvature, with the gravity acceleration from the more accurate Kraus (2001) formula. More than half of the baselines have a better BLR (Baseline Length Repeatability) at sub-mm difference compared to the NSSA GSFC (http://lacerta.gsfc.nasa.gov/tropodelays) ray tracing (Hofmeister and Böhm 2017).

2.1 Modeling the mapping functions vertical profile

The total delay time experienced by radio waves traveling through the neutral atmosphere at the observation elevation angle \(\varepsilon\) is usually modeled with the following formula (Davis et al. 1985):

$$ \Delta L(\varepsilon ) = {\text{ZHD}} \cdot mf_{{{h}}} (\varepsilon ) + {\text{ZWD}} \cdot mf_{w} (\varepsilon ) $$
(1)

Here \({\text{ZHD}}\) and \({\text{ZWD}}\) are delays of the hydrostatic part (ZHD) and wet part respectively, in the zenith direction, and \(mf(\varepsilon )\) is the mapping function. The mapping function represents the ratio of the delay on the slant path to the delay in the zenith direction. Its analytical expression draws on three basic coefficients \(a\), \(b\) and \(c\):

$$ mf(\varepsilon ) = \frac{{1 + \frac{a}{{1 + \frac{b}{1 + c}}}}}{{\sin (\varepsilon ) + \frac{a}{{\sin (\varepsilon ) + \frac{b}{\sin (\varepsilon ) + c}}}}} $$
(2)

In a discrete mapping function, the coefficient \(a\) is determined from an NWM; while, the coefficients \(b\) and \(c\) depend on empirical functions.

Wang et al. (2022) proposed a vertical approximation for zenith delay via the barometric formula, which maintains a precision of 1–2 mm on a global scale. The relationship between zenith delay and height is as follows:

$$ {\text{ZD}}_{h} = {\text{ZD}}_{0} \cdot e^{{\sum\limits_{i = 1}^{n} {\left( {m_{i} \cdot h^{i} } \right)} }} ,\quad n = 1,2,3 $$
(3)

where \({\text{ZD}}_{h}\) and \({\text{ZD}}_{0}\) are the zenith delays at geodetic height \(h\) and 0, respectively, and \(m_{i}\) are the coefficients of the exponential function. The STD is the product of the zenith delay and the mapping function; while, the STD is not exactly corrected by the zenith delay for height correction, which suggests that the mapping function needs to be corrected. Assume that a function \(f(h)\) exists for the following relationship between the mapping functions at height \(h\) and 0:

$$ mf_{h} = mf_{0} \cdot f(h) $$
(4)

To hold Eqs. (1) and (3), \(f(h)\) should be an exponential function. Thus, the mapping function at height \(h\) and 0 can be expressed by the following formula (note that the exponential coefficients are different from those of Eq. 3):

$$ mf_{h} (\varepsilon ) = mf_{0} (\varepsilon ) \cdot e^{{\sum\limits_{i = 1}^{n} {\left( {m_{i} \cdot h^{i} } \right)} }} ,\quad n = 1,2,3 $$
(5)

The mapping function coefficient \(a\) can be derived from Eq. (2):

$$ a = - \frac{mf(\varepsilon ) \cdot \sin (\varepsilon ) - 1}{{\frac{mf(\varepsilon )}{{\sin (\varepsilon ) + \frac{b}{\sin (\varepsilon ) + c}}} - \frac{1}{{1 + \frac{b}{1 + c}}}}} $$
(6)

Inserting Eqs. (5) into (6), the relationship between the coefficient \(a_{h}\) at height \(h\) and the coefficient \(a_{0}\) at height 0 can be obtained:

$$ \frac{1}{{a_{h} }} = \frac{1}{{a_{0} }} \cdot e^{{\sum\limits_{i = 1}^{n} {\left( {m_{i} \cdot h^{i} } \right)} }} + \frac{{e^{{\sum\limits_{i = 1}^{n} {\left( {m_{i} \cdot h^{i} } \right)} }} - 1}}{{1 + \frac{b}{1 + c}}},\quad n = 1,2,3 $$
(7)

The term after the plus sign is approximately two orders of magnitude smaller than the term before it. Therefore, we propose the following simplified vertical approximation formula for the coefficient \(a\) of the mapping function:

$$ a_{h} \approx a_{0} \cdot e^{{\sum\limits_{i = 1}^{n} {\left( {m_{i} \cdot h^{i} } \right)} }} ,\quad n = 1,2,3 $$
(8)

where \(a_{h}\) and \(a_{0}\) are mapping function coefficient at height \(h\) and 0, respectively (The exponential coefficients are different from those of Eqs. 3, 5, 7, and 8). Note that when the subscripts h and w appear together, h and w refer to hydrostatic and wet, respectively, otherwise h refers to height.

2.2 The height correction of Niell

Niell (1996) proposed an empirical tropospheric delay MF height correction model (called Niell model in this paper), and its coefficients are followed by VMF1 (Böhm et al. 2006) and its successor VMF3 (Landskron and Böhm 2018). In VMF1 and VMF3, the versions with height correction are vmf1_ht and vmf3_ht respectively. The mapping function after height correction is as follows:

$$ mf_{h} (\varepsilon ) = mf_{0} (\varepsilon ) + \left( {\frac{1}{\sin \varepsilon } - \frac{{1 + \frac{{a_{{{\text{ht}}}} }}{{1 + \frac{{b_{{{\text{ht}}}} }}{{1 + c_{{{\text{ht}}}} }}}}}}{{\sin \varepsilon + \frac{{a_{{{\text{ht}}}} }}{{\sin \varepsilon + \frac{{b_{{{\text{ht}}}} }}{{\sin \varepsilon + c_{{{\text{ht}}}} }}}}}}} \right) \times \frac{h}{1000} $$
(9)

where the constants \(a_{ht} \, = \, 2.53 \times 10^{ - 5}\), \(b_{ht} \, = \, 5.49 \times 10^{ - 3}\) and \(c_{ht} \, = \, 1.14 \times 10^{ - 3}\) define the correction. These coefficients are constants, calculated by least squares using values at 9 elevation angles and then spatially averaged over 5 latitudes and temporally over 9 standard profiles (Niell 1996; Qiu et al. 2020). In addition, due to the assumption of the wet MF being independent of the station height (Qu et al. 2022), the Niell model, and the corresponding VMF1 and VMF3 versions, only perform height correction on the hydrostatic delay part.

The Niell model is a linear correction model with only three parameters, \(a_{ht}\), \(b_{ht}\), and \(c_{ht}\), as mentioned above. The Niell model tries to characterize the variation of MF with height at any location on the globe using only these three parameters. Its correction is very crude and rapidly deteriorates in accuracy with height. In addition, the Niell model only considers the hydrostatic part and does not model the wet part. We take wet MF into consideration, set the modeling object as MF coefficient \(a\), and then expand the linear model to a polynomial model:

$$ a_{h} = \sum\limits_{i = 0}^{n} {\left( {m_{i} \cdot h^{i} } \right)} ,\quad n = 1,2,3 $$
(10)

Note that the Niell model is a liner model but it is not equivalent to Eq. (10) when n is equal to 1. The polynomial model is actually an approximation of the exponential model. We model the MF coefficients \(a_{h}\) and \(a_{w}\) using second- and third-order polynomials, as well as exponential models of order 1 to 3. These solutions using these different models are denoted in the following by Poly2, Poly3, Exp1, Exp2 and Exp3 respectively.

We selected a grid point (45.5° S, 82.5° W) and calculated the MF coefficients \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) for this site at a total of 54 levels of height at 200 m or 300 m intervals vertically from 0 to 14 km, and then fitted the MF coefficients using the Niell and the five solutions mentioned above. Given atmospheric anisotropy, the MF coefficients for each height level are obtained by least squares of 7 elevation angles and averaged over 8 azimuths (same in the VMF3 strategy). The calculation is performed by ray tracing through ERA5 NWM. Figure 1 shows the results, with the upper panels depicting the fit results for \(a_{{\text{h}}}\) (left) and \(a_{{\text{w}}}\) (right) and the middle panels showing the relative residuals and the lower panels showing the slant hydrostatic delay (SHD) and slant wet delay (SWD) residuals at elevation angle 3°. It should be noted that as the coefficients \(a\) themselves are parameters with no magnitude or physical meaning, the numerical size of the residuals itself is hard to reflect the fitting effect, so we use the relative values to represent them. The relative residuals here are the difference between the model values minus the NWM values as a percentage of the NWM values. Furthermore, we found that for at altitudes above a certain value (about 6–8 km in the equatorial region and about 1–3 km in the polar regions), the variations in the wet delay MF residuals became erratic, since at these altitudes the vast majority of ZWD values were very small (< 0.1 mm) and the water vapor was already negligible. We did not use data with ZWD < 0.1 mm in the experiment because we think the precision of the reanalysis data may have reached its maximum. This problem may be solved in the future as NWM accuracy and number of layers continue to increase.

Fig. 1
figure 1

Tropospheric delay mapping function coefficients vertical profiles (black dots) and the polynomial and exponential approximations (upper), the relative fitting residuals (middle), and the slant delay residuals at elevation angle 3° (lower). The left panels and right panels are the hydrostatic part and the wet part respectively. Note that they have different scale ranges

In Fig. 1, both the polynomial and exponential solutions fit well when the order is second order and above, with relative \(a_{{\text{h}}}\) residuals within ± 1% and relative \(a_{{\text{w}}}\) residuals within ± 4%. The Niell solution performs well at low altitudes as the MF coefficients are very close to linear in this region, but once the altitude is greater than 8 km, the precision of the Niell solution deteriorates rapidly, with the relative residuals of the \(a_{{\text{h}}}\) being close to 10% at 14 km altitude. The Exp1 solution performs generally better than the Niell solution, as the fit considers the entire altitude data. The Exp1 solution, like the Niell solution, features a growth in the relative \(a_{{\text{h}}}\) or \(a_{{\text{w}}}\) residuals with increasing heights, while the rest of the solutions do not show such behavior. This phenomenon of MF coefficients with heights is not reflected in the slant wet delay residuals because the water vapor gradually thins to disappear in the experimentally selected height range, causing the zenith wet delay values to become very small. From the SHD/SWD residuals, it can be found that the model precision increases with increasing order, and the residuals of the slant path delay residuals are already very close to 0 in the third-order Exp3 and Poly3 solutions. Note that since the Niell model only corrects the height for hydrostatic MF coefficient, the Niell estimate in the right panels of the figure is the result of not making any corrections.

The precision of the STD is the key indicator of the model's performance since the goal of MF height correction is to increase its precision. However, the relationship between STD precision and elevation angle is not linear, so it is incomplete to select only the STD precision at a certain elevation angle for representing the model precision. Therefore, we use the relative \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) residuals directly as the evaluation metric in the next section (Another reason is that the STD precision at elevation angle 3° does not show significant spatiotemporal dependence). It should be noted that the SHD/SWD residuals in Fig. 1 are the residuals when the ERA5 ray tracing ZHD/ZWD values are used, and if using the model ZHD/ZWD values, this part of the model error will be amplified and cannot reach the precision level in Fig. 1.

3 Validating the method

In this section, we extend the experiments in Sect. 2.2 to a global 5° × 5° grid, at the first epoch of each day, at 00:00 UT of 2021, and then count the global distribution of modeling errors for the six solutions. Next, the variation of errors with latitude, altitude and time is analyzed and the precision of the polynomial and exponential models is assessed for the Earth surface using TUW’s site-wise VMF1 and VMF3 GNSS products. Note that the NWM \(a_{{\text{h}}}\) or \(a_{{\text{w}}}\) in this paper is obtained by 1° × 1° ERA5 ray tracing at 8 azimuthal angles and 7 elevation angles by least squares method. The azimuth and elevation angles used and the empirical MF coefficients \(b\) and \(c\) are the same as in the VMF3. Also note that this height correction coefficients, like VMF1 and VMF3, is provided per grid point and epoch. In this study the Earth surface refers to the orography of VMF 5° × 5° grid point coordinates (based on the model ETOPO5). There are two reasons as to why we did not choose a 1° × 1° grid for this experiment. First, the number of ray calculations on a 5° × 5° grid for the current experiment is already on the order of billions, and our arithmetic power is limited. Second, and most importantly, our aim is to get high precision MFs overhead, while the size of the grid's density has little bearing on the experiment's outcomes or conclusions. The validation in this study was performed on ERA5 data, but the height correction method can also be performed on other NWMs, as there are changes in the mapping function with height that are determined by the structure of the atmosphere. Performing the calculations on other NWMs may involve values that differ from those in the present manuscript, but not to an extent that they would change the conclusions.

3.1 Modeling precision of mapping function models

To count the relative biases throughout the year and for the total altitudes, we use the relative absolute residual as an assessment indicator. It represents the modeling average precision loss and refers to the absolute value of the bias as a percentage of the ERA5 ray tracing value. The indicator is calculated as shown in Eq. (11) below, where the level refers to the height level. The results of the Niell model, Poly2 and Poly3 are shown in Fig. 2 and the results of Exp1, Exp2 and Exp3 are shown in Fig. 3. Note that we use different scales for the panels for better visualization.

$$ {\text{indicator}} = \frac{{\sum\nolimits_{i = 1}^{{N_{{{\text{level}}}} }} {\left| {\frac{{a_{{{\text{model}},i}} - a_{era5,i} }}{{a_{era5,i} }}} \right|} }}{{N_{{{\text{level}}}} }} \times 100\% $$
(11)
Fig. 2
figure 2

Relative absolute residuals of mapping function coefficients \(a_{{\text{h}}}\) (left) and \(a_{{\text{w}}}\) (right) under Niell (upper), Poly2 (middle) and Poly3 (lower) solution during 2021. Note the different scales for the panels for better visualization

Fig. 3
figure 3

Relative absolute residuals of mapping function coefficients \(a_{{\text{h}}}\) (left) and \(a_{{\text{w}}}\) (right) under Exp1 (upper), Exp2 (middle) and Exp3 (lower) solution during 2021. Note the different scales for the panels for better visualization

In the Niell and Exp1 solutions, the relative absolute residual of the \(a_{{\text{h}}}\) fitting (upper left panels of Figs. 2 and 3) shows a latitudinal dependence with notably greater values in the high latitude regions than in the low latitude regions. The relative absolute residual in the Exp1 solution is smaller than that in Niell solution, featuring the upper limits of the distribution range of 1.25% (\(a_{{\text{h}}}\)) and 4.33% (\(a_{{\text{w}}}\)) respectively. The results for the Poly2 and Exp2 solutions (middle left panels of Figs. 2 and 3) are very similar in that they both show a clear topographic effect, with the relative absolute residual of the \(a_{{\text{h}}}\) fitting being smaller at higher altitudes than at lower altitudes, and the range of values for both solutions being similar, with upper limits of 0.3% and 0.35% respectively. The Poly3 and Exp3 solutions (lower left panels of Figs. 2 and 3) are also very similar, with the upper limit of the numerical distribution dropping further to around 0.19% for both, but the topographic effect giving way to the latitudinal dependence as the Niell and Exp1 solutions. These phenomena can be attributed to the varying rates of MF decline with altitude across different latitudes. They are also associated with the fewer number of height layers present in regions of high altitude.

In comparison to the outcomes of \(a_{{\text{h}}}\), the relative absolute residual of \(a_{{\text{w}}}\) has no more symmetry between the northern and southern hemispheres. As can be seen from the Niell solution (upper right panels of Fig. 2), without height correction, the values of residual are very large, up to twice the ERA5 ray tracing value, particularly in the middle and high latitudes of the northern hemisphere, and much bigger than in other regions. The relative \(a_{{\text{w}}}\) absolute residual is reduced to less than 5% with the Exp1 solution, despite its simplicity. The results of Poly2 and Exp3 are also very similar, with the upper limits of their numerical distributions being about 2.5% and 3% respectively. The results of Poly3 are reduced to 1.6% from that in Poly2, while in Exp3 it is reduced very little from that in Exp2, to only about 2.7%. The \(a_{{\text{w}}}\) results both show very weak topographic characteristics, with slightly smaller areas at higher elevations than at lower elevations. These characteristics in the distribution of the \(a_{{\text{w}}}\) residual reflect the sensitivity of the coefficient's precision to the water vapor content, which varies randomly with height and lacks the hydrostatic atmosphere's imprint of topography and latitude.

3.2 Modeling precision at different latitudes and altitudes

Given the results in the previous section, we have mapped the characteristics of the distribution of precision at different latitudes and altitudes more clearly. In addition, we carried out a statistical analysis by time. The mean global relative absolute residual for the six solutions were tabulated by altitude, latitude and day of years and the results are plotted in Fig. 4. As the Niell solution has a much larger residual span than the other solutions, the y-axis is broken in some panels for better visualization.

Fig. 4
figure 4

Modeling precision of hydrostatic MF coefficient \(a_{{\text{h}}}\) (left) and wet MF coefficient \(a_{{\text{w}}}\) (right) using 6 solutions at different altitudes, different latitudes and different day of year of 2021 (Northern hemisphere and southern hemisphere). The vertical black dotted line and the vertical black solid line are the dates of the summer and winter solstices, respectively

The upper panels of Fig. 4 show the altitude dependence of the modeling precision. In the Niell solution, the \(a_{{\text{h}}}\) relative absolute residual rises exponentially directly from 0% at the surface to ~ 9% at 14 km, while the \(a_{{\text{w}}}\) residual rises linearly from 0% at the surface to ~ 60% at ~ 6 km before fluctuating up and then down. The coefficient \(a_{{\text{h}}}\) precision for the Exp1 solution fluctuates within 0–2%; while, \(a_{{\text{w}}}\) precision fluctuates significantly between 6 and 10 km, and is relatively flat at other altitudes, less than 3%. The coefficient \(a_{{\text{h}}}\) precision for the Poly2 and Exp2 solutions varies very little between altitudes, both fluctuating between 0 and 0.5%. In contrast, the Poly3 and Exp3 solutions \(a_{{\text{h}}}\) precision showed little difference, both fluctuating between 0 and 0.2%. In addition, there is almost no difference in the \(a_{{\text{w}}}\) precision of the Poly2, Poly3, Exp2, Exp3 solutions, and it is barely noticeable that the 3rd order model is more accurate than the 2nd order.

The middle panels of Fig. 4 show the latitude dependence of the modeling precision. For \(a_{{\text{h}}}\), the Niell solution and the Exp1 solution are similar in that they both exhibit a ‘U’ shape, i.e., precision decreases with increasing latitude; Niell solutions precision is approximately 0.5% at equatorial regions and 4% at high latitudes; while, the two values for the Exp1 solution are approximately 0.3% and 1.2% respectively. Poly2 and Exp2 precision are very similar across latitudes, and Poly3 and Exp3 precision are also very similar across latitudes, with no significant latitudinal correlation for any of the four solutions. The two second-order solutions have precision of approximately 0.3% and the two third-order solutions are less than 0.1% at most latitudes. For \(a_{w}\), the Niell solution precision rises from about 20% at 90° S to about 40% at 60° S, remains largely at about 40% between 60° S and 30° N, then rises to about 100% between 30° N and 45° N, and finally drops to 50% between 45° N and 90° N. The latitude dependence is weak for the remaining five solutions of \(a_{{\text{w}}}\), with the Exp1 solution being slightly more correlated than the others.

The lower panels of Fig. 4 are divided into northern hemisphere and southern hemisphere two parts to show the time dependence of the modeling precision. The \(a_{{\text{h}}}\) precision of the different solutions did not differ markedly between the northern and southern hemispheres. The \(a_{{\text{h}}}\) precision of the Niell solution had a seasonal pattern, which was higher in summer than in winter in the northern hemisphere and higher in winter than in summer in the southern hemisphere. However, the other five solutions had no significant seasonal pattern. From the performance of relative residuals in the northern and southern hemispheres, the \(a_{{\text{w}}}\) precision of different solutions has significant seasonal pattern in the northern Hemisphere, with Niell solution having higher summer precision than winter, and other solutions having lower summer precision than winter precision. In the southern hemisphere, only the \(a_{{\text{w}}}\) precision of the Niell solution showed significant seasonal characteristics, with the same pattern as in the northern Hemisphere, with higher summer precision than winter precision. In addition, both \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) precision of the 5 solutions proposed in this paper are very stable throughout the year, and the higher the order, the better the precision. The relative absolute residuals of \(a_{{\text{h}}}\) at the first, second and third orders are about 0.8%, 0.2% and 0.1% respectively, and the relative absolute residuals of \(a_{{\text{w}}}\) at the first, second and third orders are about 1%, 0.4% and 0.25% respectively. Overall, these evaluations suggest that the newly introduced method exhibits both spatial and temporal stability. This further implies that there is no immediate need for higher grid resolution and temporal resolution.

To investigate whether our modeling method has sufficient fidelity also in other years, we selected the data on the first day of every month for a total of 10 years from 2012 to 2021 to model the MF coefficients \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) vertically, and obtained the modeling precision of the hydrostatic MF coefficient \(a_{{\text{h}}}\) (Fig. 5) and the wet MF coefficient \(a_{{\text{w}}}\) (Fig. 6) under six solutions. Figures 5 and 6 are both box plots, with the red point being the median, and the top of the black dashed line, the top of the blue bar, the bottom of the blue bar, and the bottom of the black dashed line, being the upper limit, the upper quartile, the lower quartile, and the lower limit, respectively.

Fig. 5
figure 5

Model precision of hydrostatic MF coefficient \(a_{{\text{h}}}\) using 6 solutions at year of 2012–2021. Each box in the boxplot diagram represents the result of an epoch. Note that the first-order Niell and Exp1 scales are larger than others

Fig. 6
figure 6

Model precision of wet MF coefficient \(a_{{\text{w}}}\) using 6 solutions at year of 2012–2021. Each box in the boxplot diagram represents the result of an epoch. Note that the first-order Niell and Exp1 scales are larger than others

Figures 5 and 6 show that the model precision is fairly consistent across years with no discernible inter-annual variations. This suggests that the models put forth in this work are appropriate for modeling longer time series of data. Figure 5 demonstrates that the Niell solutions has a significant periodic signal in the model precision, whereas the other solutions do not. This further demonstrates that the method proposed in this paper can effectively simulate the seasonal variations in the vertical profiles of the coefficient \(a_{{\text{h}}}\). While the results for the coefficient \(a_{{\text{w}}}\) under the various solutions (Fig. 6) are not as substantial as those for the coefficient \(a_{{\text{h}}}\), it is nevertheless evident that as the model order rises, both the median of model precision and the amplitude of the periodic signal are diminished.

In the preceding analysis, comparison of \(a_{{\text{h}}}\) or \(a_{{\text{w}}}\) relative residuals shed light on modeling effects between different models and the spatial–temporal correlation characteristics of the modeling residuals. However, a more intuitive depiction of the modeling effect would be desirable. Therefore, we counted the root-mean-square errors (RMSE) of different models at an elevation angle of 3° and plotted them in Fig. 7. Note that the RMSE here uses the slant tropospheric delay of the ERA5 ray tracing at an elevation angle of 3° as a reference. In Fig. 7, the blue error bar represents the standard deviation of the time series of the global mean of the RMSE. Evidently, the difference in the RMSE of the slant tropospheric delay between the different solutions is not as significant as the difference in the relative residuals between the different solutions. There are two reasons for that. First, the statistics in Fig. 7 represent average results of global and all height levels, and second, the zenith delay becomes smaller in the high-altitude region, such that the large \(a_{{\text{h}}}\) or \(a_{{\text{w}}}\) residuals do not show up as large SHD/SWD residuals. It is important to note that this result is based on the accurate zenith delay. Such high precision cannot be achieved if the zenith delay is obtained using an empirical model.

Fig. 7
figure 7

RMSE of slant hydrostatic delay (left) and slant wet delay (right) using 6 solutions at elevation angle 3° of 2021. The blue error bar in the figure is the standard deviation of the corresponding model throughout the year

3.3 Modeling precision with site-wise VMF1 and VMF3

In Sects. 3.1 and 3.2, we have presented an internal precision validation. To obtain external precision validation for the new method, it is necessary to introduce external data to evaluate the model. As there are no overhead vertical MF tropospheric delay products available, it cannot be evaluated in the full vertical plane currently. To facilitate comparisons with existing studies and to further guarantee the precision of the experiment, the site-wise GNSS VMF1 and VMF3 products from TUW were used to assess the precision of the new model at ground level. The method of calculating the \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) of GNSS site locations using the new models is the same as that of the VMF3, i.e., the four nearest grid points to the site are first obtained, then the MF coefficients of the four grid points at the site height are calculated by the new models, and finally the coefficients are interpolated to the site locations. Since there are no GNSS stations with latitude greater than 87.5°N and 87.5°S in this paper, nearest neighbor interpolation is not used, but bilinear interpolation is favored. The Niell model is not discussed here as it relies on ground data and is a ground-based model. The evaluation results for each site are shown in Fig. 8 (VMF1) and Fig. 9 (VMF3), and the statistical results are shown in Fig. 8. To facilitate comparison with previous studies, we use the RMSE of the slant delay at an elevation angle of 3° as a precision indicator. Note that NWM differences can also introduce differences in results on the millimeter level (Nikolaidou et al. 2018). To concentrate on the performance of the MF height correction models and to minimize the influence of other factors, a consistent approach was adopted in our comparisons. When the VMF1 product was used as a reference, all zenith delays were derived from the VMF1 product. When the VMF3 product was used as a reference, all zenith delays were obtained from the VMF3 product.

Fig. 8
figure 8

RMSE of slant hydrostatic delay (left) and slant wet delay (right) at elevation angle 3° using site-wise VMF1 GNSS as reference

Fig. 9
figure 9

RMSE of slant hydrostatic delay (left) and slant wet delay (right) at elevation angle 3° using site-wise VMF3 GNSS as reference

In the evaluation of the coefficient \(a_{{\text{h}}}\) (left panel of Fig. 8), using the VMF1 products as reference, the Exp1 solution of \(a_{{\text{h}}}\) has the worst precision, especially in the mid to high latitudes of the northern hemisphere and the high latitude regions of the southern hemisphere. The precision of \(a_{{\text{h}}}\) in other regions is very similar to those from the other four solutions. Poly2 and Exp2 solutions can be seen to be slightly less accurate than Poly3 and Exp3 solutions, mainly in the west coast of South America and throughout North America. For the coefficient \(a_{{\text{w}}}\) (right panel of Fig. 8), there is no significant difference between the five solutions, and the stations with poorer precision are mainly located in the Greenland, east-central North America, central South America and the Pacific.

The results for \(a_{{\text{h}}}\) (left panel of Fig. 9) with the VMF3 products as reference are very similar to those for \(a_{{\text{h}}}\) with the VMF1 products as reference. However, fewer stations feature poorer precision, and the 3rd order solutions Poly3 and Exp3 have better precision. The results of \(a_{{\text{w}}}\) are consistent in distribution among different models, with overall larger values than those of VMF1 and no obvious spatial pattern. Note that despite the increased complexity in describing the wet MF compared to the hydrostatic MF, the corrected slant wet delay RMSE is actually smaller than the slant hydrostatic delay RMSE. It can be attributed to the fact that the zenith wet delay itself is only about one-tenth of the zenith hydrostatic delay.

Figure 10 shows the statistical results for the full range of stations, presented in a box plot. From the \(a_{{\text{h}}}\) results (left panel), the results with VMF1 as the reference and with VMF3 as the reference are generally consistent, with the global average RMSE for the Exp1 solution being around 150 mm; while, the remaining solutions are all less than 50 mm (the red crosses are outliers). The results for Exp1 and the two third-order solutions Poly3 and Exp3 with VMF3 as the reference are slightly more accurate than those with VMF1 as the reference. The results for \(a_{{\text{w}}}\) (right panel) show that there is no significant difference between the five solutions for either VMF1 or VMF3 as a reference. The global averages for the results with VMF1 and VMF3 as reference are around 28 mm and 48 mm for the 5 solutions. The results with VMF1 as reference are more concentrated around the mean value than the results with VMF3 as reference. Niell et al. (2001) transformed this RMSE into station height errors with a rule of thumb, i.e., "The error in the station height is approximately one third of the delay error at the lowest elevation angle included in the analysis". This factor was subsequently corrected to one-fifth by Bohm (2004). Note the above factors are given at an elevation angle of 5°, and we calculate that the slant tropospheric delay at 5° is about 0.691 times that at 3° (global scale). Therefore, translating the RMSE here into station height errors is approximately 5.5 mm for SHD and 3.5–6 mm for SWD, respectively.

Fig. 10
figure 10

Statistical results of RMSE of slant hydrostatic delay (left) and slant wet delay (right) at elevation angle 3° using site-wise VMF1 (upper) and VMF3 (lower) GNSS as reference

4 Conclusions and perspectives

We have presented a novel method to model the tropospheric delay mapping function vertical profiles from the Earth surface up to 14 km in a simple and effective way, with a precision loss of less than 0.3% for coefficient \(a_{{\text{h}}}\) and less than 1% for coefficient \(a_{{\text{w}}}\) on the global scale. Compared to the Niell model, which was only height-corrected for \(a_{{\text{h}}}\) and had a ground data dependency, the new model adds a height correction for \(a_{{\text{w}}}\), is not dependent on ground data and takes full account of the height variation and spatial dependency of the mapping function coefficients. The global average loss of precision for \(a_{{\text{h}}}\) and \(a_{{\text{w}}}\) decreased from 2.20 and 47.44% for the Niell model to 0.11% and 0.36% for the new model (Exp3).

In our method, the vertical profile of mapping functions is represented by three to four coefficients. Given the large latitudinal dependence of the first-order model, we recommend using the second-order and higher for height correction. Third-order models can further reduce the loss of precision on top of second-order, but the improvement is limited. Using TUW site-wise VMF1 and VMF3 GNSS products as a reference, the new models also show good performance at the Earth surface, with RMSEs of about 2–5 cm for both SHD and SWD at 3° elevation angle for models of order 2 and higher.

The proposed method provides effective mapping function height correction products for the space geodesy technique community, ensuring reliable corrections from the Earth's surface to 14 km altitude (which covers the range of activities of most aircraft activities and meteorological events). The new models have no significant time dependence at second order and beyond, allowing stable precision even with empirical data. It can accommodate more users' needs, especially those who use airborne and high-altitude equipment.