Introduction

To ensure the precision and integrity of precision approach and landing under category (CAT)-I/II/III requirements in civil aviation, ground-based augmentation system (GBAS) employs differential techniques and anomaly monitoring algorithms to ensure user safety (Saito et al. 2017). The tropospheric delay emerges as a prominent source of error, particularly during tropospheric anomalies (Khanafseh et al. 2016). Based on the single differential positioning method, it is commonly assumed that the zenith total delay (ZTD) is approximately equal between the ground reference stations and the user aircraft. However, due to the fact that the ground reference stations and aircraft are situated at different heights, difference exists in the tropospheric delay encountered, respectively, at the ground stations and the aircraft. This difference, defined as the tropospheric delay error in this studied, is corrected using the tropospheric correction (TC) models in differential corrections (RTCA 2004). It should be noted that if it is not specified that the tropospheric delay error refers to difference in the tropospheric delay encountered at the aircraft and ground stations.

The application of TC model in aeronautical navigation, especially in GBAS, was shown to improve positioning accuracy and reduce integrity risk (Skidmore and van Graas 2004; McGraw 2012). However, the calculation formulas for the scale height parameters used in TC models are not fully confirmed in GBAS. There are four different scale height calculation methods, i.e., McGraw et al. (2000), Skidmore and van Graas (2004), Warburton (2010) and Saito et al. (2021). It should be noted that the tropospheric delay error is relatively minor. However, clarification is important because it can also cause potential safety issue and should be carefully considered in future applications that require high accuracy (Saito et al. 2021). Furthermore, the performance of the four different scale height calculation methods used in TC models is not comprehensively evaluated in literature, which poses threats to the application of GBAS with different scale height methods and may cause performance degradation. Therefore, it is essential to assess the performance of the TC models with different scale height estimations and verify their compliance with the relevant GBAS standards. This contributes to developing a more accurate model, which further helps to mitigate tropospheric delay errors, ensure aircraft safety in extreme conditions and separate it with other parameters such as location and receiver clocks (Wang et al. 2022).

To evaluate the performance of the TC models, an approach is the ray-tracing techniques based on numerical weather models (NWMs) (Hobiger et al. 2008; Nafisi et al. 2012). The basic principle of ray tracing is geometrical optics approximation. The Eikonal equation is the key result of this approximation, which is used to describe electromagnetic waves passing through slowly varying media (Born and Wolf 1999). The equation is solved by partial derivative equations based on analytical method. However, the ray-tracing algorithm based on this equation takes more time due to the iteration process (Hobiger et al. 2008). Under the assumption that the refractive index profile is composed of almost constant gradients, Thayer (1967) modeled the refractivity index as radial distance power-law profile function. Results indicated its obvious advantages in accuracy, calculation speed and compatibility with previous results. Additionally, Hobiger et al. (2008) proposed a refined piece-wise linear propagation algorithm by using Snell’s law. The average refractivity index above and below half-height between two layers was calculated. Since the ray path can be calculated in a single calculation, the computational cost is significantly reduced. Ray-tracing algorithm based on piece-wise linear method was applied to very long baseline interferometry (VLBI) and interferometric synthetic aperture radar (InSAR) analysis (Hofmeister and Böhm 2017). Hobiger et al. (2010) studied the calculation biases caused by the rapid change of wet refractivity due to the use of simplified ray-tracing algorithm in InSAR data processing. Tropospheric tomography based on ray-tracing algorithm is also widely used to estimate three-dimensional distribution of water vapor (Haji Aghajany and Amerian 2017; Zhang et al. 2022). However, the above algorithms mostly pay attention to the calculation accuracy and speed, there is little research on the requirements of data input to model the whole meteorological information between the ground and the aircraft. For example, it is difficult to capture the meteorological information of the lower atmosphere using only NWM products for ray tracing. For GBAS approach service type C (GAST-C), the decision height is less than 200 ft. Therefore, the decision height may be lower than the height of the lowest ERA5 product, which is approximately 492 ft. In this case, the conventional ray tracing is not applicable.

To address the issues above, this study proposed a modified ray-tracing algorithm that utilizes both the station surface and the upper-air meteorological data to calculate the tropospheric delay error. The effect of virtual temperature and station height anomaly are also considered to improve the accuracy of tropospheric delay error estimation and unify the spatial datum of different data. The estimated tropospheric delay error is then used to assess the performance of the TC models based on four scale height estimation methods. The principle of GBAS TC model is reviewed next. The modified ray-tracing algorithm is then present, followed by the description of the datasets. The performance of the TC models using four different scale height estimation methods is evaluated. Finally, conclusions and remarks are presented.

Overview of the TC models for GBAS

This section introduces the description of TC models in GBAS. The four different scale height calculation methods for the TC models are also reviewed.

GBAS TC models

To correct the tropospheric delay error, which is the difference of tropospheric delay between that encountered, respectively, at the GBAS ground station and the aircraft, as shown by a simplified differential architecture in Fig. 1, the TC model is applied in differential corrections. The model and its uncertainty are defined as (RTCA 2004, 2017)

$$\begin{array}{*{20}c} {{\text{TC}} = N_{{\text{R}}} h_{0} \frac{{10^{ - 6} }}{{\sqrt {0.002 + \sin^{2} \left( \theta \right)} }}\left( {1 - {\text{exp}}\left( { - \frac{\Delta h}{{h_{0} }}} \right)} \right)} \\ \end{array}$$
(1)
$$\begin{array}{*{20}c} {\sigma_{{{\text{trop}}}} = \sigma_{N} h_{0} \frac{{10^{ - 6} }}{{\sqrt {0.002 + \sin^{2} \left( \theta \right)} }}\left( {1 - {\text{exp}}\left( { - \frac{\Delta h}{{h_{0} }}} \right)} \right)} \\ \end{array}$$
(2)

where \(1/\sqrt {0.002 + \sin^{2} \left( \theta \right)}\) is the mapping function (MF) related to the satellite elevation \(\theta\). \(N_{{\text{R}}}\) is the surface refractivity index. \(\sigma_{N}\) is the uncertainty in the refractivity index. \(h_{0}\) is the tropospheric scale height. \(\Delta h\) is height of the aircraft relative to the GBAS station.

Fig. 1
figure 1

A simplified GBAS differential architecture. The tropospheric delay error is the difference of tropospheric delay between that encountered, respectively, at the GBAS station (path ①) and the aircraft (path ③*). The tropospheric delays of path ③ and path ③* are approximately equal in short-baseline differential positioning. The tropospheric delay error is corrected by the tropospheric correction model in GBAS

The TC model in the zenith direction, denoted as\({{\text{TC}}}_{{\text{zd}}}\), is the integral of refractivity index profile (Saito et al. 2021), i.e., \({\text{TC}}_{{{\text{zd}}}} = 10^{ - 6} \smallint N_{{\text{R}}} \exp \left( { - \frac{\Delta h}{{h_{0} }}} \right){\text{d}}h = 10^{ - 6} N_{{\text{R}}} h_{0} \left( {1 - {\text{exp}}\left( { - \frac{\Delta h}{{h_{0} }}} \right)} \right)\). The formula employs the exponential function and scale height to characterize the vertical variation of refractivity index. The uncertainty of the TC model is determined through the variance–covariance propagation law. It should be noted that the variation of scale height is disregarded and assumed to be constant in Equation (2).

Scale height estimation methods in TC models

Estimating the TC model parameters is essential for accurately compensating the tropospheric delay error. Using the four different methods, the scale height can be estimated as constant annual averaged values or real-time variations, while under extreme weathers during system operation, it is not reasonable to treat the scale height as a constant value. Although real-time estimation can potentially yield better performance, there are countries currently utilizing an annual averaging scheme. In this analysis, the scale height is used as a constant to evaluate the performance of the TC model under the worst conditions. For the sake of illustration, the scale height estimation follows the methods provided in McGraw et al. (2000), Skidmore and van Graas (2004), Warburton (2010) and Saito et al. (2021), which are referred to as method 1, 2, 3 and 4, respectively, in this study.

Figure 2 shows the variations of the tropospheric delay error estimated by the TC models using four scale height estimation methods at a height of 6 km at various elevations. It is evident that the tropospheric delay error calculated with different methods exhibits more similarities at higher elevations, while as the elevation decreases, the deviations between different methods present more disparities, which are due to the error induced by the MF. When the elevation angle is \({6}^{^\circ }\), the disparities between different models reach their maximum.

Fig. 2
figure 2

Variations of tropospheric delay error calculated among different scale height methods at various elevations. The cutoff elevation is \({6}^{^\circ }\)

Figure 3 presents the calculated scale heights by four different methods at the BJSH in Beijing, China, using meteorological data from the crustal movement observation network of China (CMONOC) in 2021. The average bias between the scale heights calculated by different methods is 9.2 km, and the maximum bias is 12.6 km. Additionally, the calculated scale heights with method 2 show a completely opposite seasonal variation in contrast to the other methods. Due to the significant differences in scale heights calculated by the four different methods, it is necessary to further evaluate the performance of the TC models using four different scale height methods.

Fig. 3
figure 3

Scale heights calculated by four methods at the BJSH station in Beijing, China, in 2021

Tropospheric delay error estimation by modified ray-tracing algorithm

As the relative and specific humidity contain different meteorological elements related to water vapor, these two humidity measurements are first converted to partial water vapor pressure based on empirical models. Additionally, to reduce the influence of layer height on the integration errors and efficiency of ray-tracing algorithm, the optimal layer height is determined next. With the convert partial water vapor pressure and determined optimal layer height, the modified ray-tracing algorithm is established to estimate the tropospheric delay error.

Partial water vapor pressure conversion

The well-known expression for the refractivity index of the troposphere is given by (Thayer 1974)

$$\begin{array}{*{20}c} {N = k_{1} \frac{{p_{{\text{d}}} }}{T}Z_{d}^{ - 1} + k_{2} \frac{e}{T}Z_{w}^{ - 1} + k_{3} \frac{e}{{T^{2} }}Z_{w}^{ - 1} } \\ \end{array}$$
(3)

where \(T\) is the temperature in Kelvin. \(p_{{\text{d}}}\) and \(e\) are the dry and water vapor partial pressures in millibars, respectively. \({ }Z_{d}\) and \(Z_{w}\) are compression factors. \({ }k_{1}\), \({ }k_{2} { }\) and \(k_{3}\) are empiric constants, where \(k_{1} = 77.604\;{\text{K}}/{\text{hPa}}\), \(k_{2} = 64.79\;{\text{K}}/{\text{hPa}}\) and \(k_{3} = 377600{ }\;{\text{K}}^{2} /{\text{hPa}}\).

Since the ground and upper atmosphere products utilize different meteorological parameters, the partial water vapor pressure can be calculated using the relative humidity at ground stations (McCarthy and Petit 2004; Leandro et al. 2006), i.e.,

$$\begin{array}{*{20}c} {e = \frac{{{\text{RH}}}}{100}e_{{{\text{satu}}}} f_{w} } \\ \end{array}$$
(4)

where \({\text{RH}}\) is the relative humidity. \(e_{{{\text{satu}}}}\) is the saturation vapor pressure. \(f_{w}\) is the enhancement factor given by

$$\begin{array}{*{20}c} {f_{w} = 1.00062 + 3.14 \times 10^{ - 6} P_{0} + 5.6 \times 10^{ - 7} \left( {T_{0} - 273.15} \right)^{2} } \\ \end{array}$$
(5)

The specific humidity of the upper atmosphere data of the fifth-generation European center for medium-range weather forecasts (ECMWF) re-analysis (ERA5) is converted to partial vapor pressure, which is given by (Wallace and Hobbs 2006; Nafisi et al. 2012)

$$\begin{array}{*{20}c} {p_{w} = \frac{qp}{{\left( {\varepsilon + \left( {1 - \varepsilon } \right)q} \right)}} \approx \frac{qp}{{\left( {0.622 + 0.378q} \right)}}} \\ \end{array}$$
(6)

where \(p\) denotes the total pressure. \(q\) is the specific humidity. \(\varepsilon ={M}_{{\text{w}}}/{M}_{{\text{d}}}\) is the ratio of the molar masses of water vapor and dry gas.

Determining the optimal layer height

By ray tracing the neutral atmosphere NWMs along the path ② in Fig. 1, the tropospheric delay error, referred to as \(D_{{{\text{trop}}}}\), can be estimated by

$$\begin{array}{*{20}c} {D_{{{\text{trop}}}} = 10^{ - 6} \smallint N{\text{d}}s = 10^{ - 6} \mathop \sum \limits_{i = 1}^{k} N_{i} s_{i} } \\ \end{array}$$
(7)

where \(k\) is the number of height layers (Zhou et al. 2020).\(N_{i}\) is the refractivity of the ith layer.

To minimize numerical integration errors and ensure calculation efficiency, a trade-off is made between the layer height and the computational error. Figure 4 shows the relationship between the layer height and the integration errors at different elevations. As the layer height decreases, the integration error also decreases. However, the time consumption also increases, which is shown as the purple line in the figure. It should be noted that the calculation of the time consumption is based on a laptop with 16 G random access memory (RAM) and Intel i5-10300H@2.50 GHz processor, thus the calculation time may differ at different platforms. The figure indicates that the integration errors converged when the layer height is less than 1 m. Further reducing the layer height only slightly improves the integration accuracy, while significantly increasing computation time, which largely limits the algorithm performance. Therefore, to achieve a balance between computation time and integration errors, an integration layer height of 1 m is chosen for this study.

Fig. 4
figure 4

Variations of integration errors and time consumption at different elevations when using different integration layer heights

Modified ray-tracing algorithm establishment

In order to unify the spatial datum of different products and improve the estimation accuracy of tropospheric delay error, a modified ray-tracing algorithm is proposed that considers height anomaly and virtual temperature. The decision height of an aircraft is typically less than 60 m in CAT I, while the height anomaly based on EGM96 may exceed 80 m in China. Therefore, this induces a discrepancy between the height of calculated tropospheric delay error and the actual height. On the other hand, by introducing virtual temperature, we can express the presence of moisture in the air using the dry air equation of state. In the calculation of the overall ZTD, the virtual temperature correction is generally negligible compared with the model error. However, according to equation (10) for the virtual temperature hereafter, the correction for virtual temperature cannot be neglected, as it is directly proportional to the temperature and water vapor content (Sheng et al. 2013). Therefore, the effect of virtual temperature and height anomaly are considered in the modified ray-tracing algorithm. Moreover, the modified algorithm not only utilizes ERA5 products but also incorporates meteorological data from ground stations. Since the integration height is very low, the Fresnel effect can be disregarded. Additionally, the vertical resolution of the ERA5 data is improved using a meteorological parameter profile, which is described in details in the subsequent sections.

The proposed algorithm consists of three main steps, i.e., (1) Calculating the longitude and latitude of the tropospheric piercing point (TPP) and the horizontal meteorological parameters using the bilinear interpolation algorithm. (2) Determining the vertical meteorological parameters at any given altitudes, which allows for the calculation of meteorological elements at various layers between the ground station and the aircraft. (3) Computing the tropospheric delay error along the signal propagation path using a hierarchical integration algorithm based on the refractivity index profile. Figure 5 presents the flowchart of the modified ray-tracing strategy.

Fig. 5
figure 5

Flowchart of modified ray-tracing strategy

The establishment of atmospheric pressure, temperature and water vapor pressure profiles are the key aspect in estimating the tropospheric delay error. The temperature profile is influenced by the thermal characteristics of the atmosphere and the thermal balance of the Earth. In the tropospheric region below 10 km, the temperature profile exhibits a linear decrease given by (Chao 1974).

$$\begin{array}{*{20}c} {T = T_{{\text{s}}} - \alpha \left( {h - h_{s} } \right)} \\ \end{array}$$
(8)

where the subscript \(s\) refers to the GBAS station. \(h\) is the height of the aircraft in kilometers. \(\alpha\) is the temperature lapse rate. It is worth mentioning that since the distance between neighboring pressure layers at the bottom of the ERA5 product is only a few hundred meters, the linear temperature lapse rate, which is only used to describe the temperature change between the neighboring pressure layers of ERA5 products, can accurately capture real atmospheric changes (Hobiger et al. 2008; Zhou et al. 2020).

According to the multivariate atmospheric hypothesis, the pressure profile is defined as (Hopfield 1971; Davis et al. 1985)

$$\begin{array}{*{20}c} {P = P_{{\text{s}}} \left[ {1 - \frac{{\alpha \left( {h - h_{{\text{s}}} } \right)}}{{T_{{\text{v}}} }}} \right]^{{\frac{g}{{R_{d} \alpha }}}} } \\ \end{array}$$
(9)

where \(g\) is the gravitational acceleration. \(R_{d}\) is the gas constant. \(T_{{\text{v}}}\) is the virtual temperature instead of the actual temperature, which is defined as (Sheng et al. 2013)

$$\begin{array}{*{20}c} {T_{{\text{v}}} = \left( {1 + 0.608q} \right)T = \left( {1 + 0.378e/p} \right)T} \\ \end{array}$$
(10)

It is evident that higher temperature contributes to the increase in water vapor pressure and subsequently larger corrections for virtual temperature. Therefore, the effect of virtual temperature cannot be ignored. On the other hand, the vertical variation of water vapor pressure profile can be approximated to exponential attenuation, which is defined as (Askne and Nordius 1987)

$$\begin{array}{*{20}c} {\frac{e}{{e_{{\text{s}}} }} = \left( {\frac{P}{{P_{{\text{s}}} }}} \right)^{\lambda + 1} } \\ \end{array}$$
(11)

where \(\lambda\) is the water vapor pressure height factor. es and Ps are the water vapor pressure and pressure at the GBAS station, respectively.

It is worth mentioning that although the modified ray-tracing algorithm creates temperature and pressure profile between the two layers of products, it essentially uses the numerical summation method. Thus, the computational performance of the algorithm is approximately equal at different elevations, where the computational performance is indicated by the average time required to calculate the tropospheric delay error for a given elevation and azimuth at a particular epoch (Hobiger et al. 2008). For precision approach, CAT‐I and CAT-III phases, it, respectively, takes 385, 245 and 236 ms for using the modified ray-tracing algorithm. Except for the time-consuming for CMONOC, ECMWF products and other data input and output, the interpolation is another key factor that limits the algorithm performance.

Datasets

To estimate the tropospheric delay error, an accurate characterization of both the surface conditions at the station and the upper-air meteorological environment is crucial. The surface meteorological data is obtained by Paroscientific model 760 meteorological sensors deployed in CMONOC, which are widely applied in various fields, including geodesy and weather forecasting (Shi et al. 2018). Additionally, to cover typical climatic environment at different latitudes in China, 13 ground meteorological stations are selected which are located in various regions, as depicted in Fig. 6.

Fig. 6
figure 6

Geographic distribution of the selected CMONOC meteorological stations

The time span is from January to December 2021. The sampling interval is 30 s. It should be clarified that according to the international civil aviation organization-navigation systems panel (ICAO-NSP), the duration of data required for estimating TC model parameters is at least one year (Saito et al. 2022). Consequently, the length of the dataset used in this study meets the requirements. On the other hand, it is obvious that there are significant differences in tropospheric conditions in different regions, thus the dataset is not globally representative and can only be used to evaluate the performance of the TC models in China.

Reanalysis products, including the ERA5, play a crucial role in estimating and modeling tropospheric delays. ERA5 is particularly notable for its exceptional performance, thanks to its high temporal and spatial resolution (Zhou et al. 2020). In this study, the upper atmosphere data derived from ERA5 is also processed. The data has a horizontal resolution of 0.25° × 0.25° in latitude and longitude, as well as a vertical resolution of 37 pressure levels. The sampling interval is 1 h. It should be noted that the ERA5 data refers to the geoid over land in the vertical direction, i.e., Earth gravitational model 1996 (EGM96). However, the spatial datum of CMONOC data is world geodetic system 1984 (WGS84). Therefore, to ensure the unification in the spatial datum, the EGM96 model is utilized to calculate the height anomaly of the ground station. Table 1 lists the selected ground stations, height anomaly and the corresponding climatic types (Song et al. 2011).

Table 1 Locations, height anomaly and the climate types of the selected meteorological stations

TC model performance evaluation

By utilizing the modified ray-tracing algorithm, reference values for tropospheric delay error in the zenith and slant direction are calculated to evaluate the performance of TC models with the four scale height estimation methods. Moreover, an inflation algorithm based on elevations is introduced to overbound the bias associated with the TC model.

TC model evaluation in the zenith direction

The performance of TC models with different scale height estimation methods is evaluated by calculating the tropospheric delay error in the zenith direction using meteorological data from 2021. The TC model bias, defined as the deviations with the reference tropospheric delay errors estimated by the modified ray-tracing algorithm, is calculated and analyzed.

Figure 7 depicts the TC model bias for four representative CMONOC stations with four different scale height methods in January 2021. Simulations are conducted for CAT I mode. The selected four stations represent diverse climate environments, encompassing various longitudes and latitudes. Despite employing different scale height estimation methods, the TC model bias for the same site is almost identical, primarily due to the low decision height. The TC model bias is mainly affected by the airport surface meteorological elements. As a result, the similar results obtained by different scale height methods are due to the lack of rapid changes in meteorological elements. At the same time, this also answers which scale height estimation method should be used to estimate the tropospheric delay error in China. Furthermore, the figure illustrates distinct bias trends at the four stations, which are primarily influenced by local climate conditions.

Fig. 7
figure 7

Variations of TC model bias at XJQM, HBES, GDZH and NMAL stations in January 2021

To further analyze the influence of local climate on the TC model performance, Fig. 8 shows the relationship between ground water vapor pressure and refractivity index at a height of 60 m in 2021. The top two panels show the changes of ground water vapor pressure at YNZD and GDZH stations, while the middle two panels show refractivity index changes calculated by interpolating ERA5 and CMONOC product, empirical refractivity index model, i.e., \(N_{{60\;{\text{m}}}} = N_{{\text{R}}} {\text{exp}}\left( { - 60/h_{0} } \right)\) at a height of 60 m. The parameters of the empirical refractivity index model are calculated by annual average and real-time methods, respectively. Results show that the TC model using the annually average method is unable to capture the local climate variation, as shown in the middle panel. On the contrary, compared with EAR5, the real-time estimation shows excellent performance. Therefore, the real-time refractivity index and scale height estimation should be used in precision approach to mitigate the effect of climate variation. In addition, the bottom two panels show that there is a high correlation between ground water vapor pressure and refractivity index bias, which implies that the bias of the refractivity index is influenced by the ground water vapor pressure.

Fig. 8
figure 8

Surface water vapor pressure and refractivity index at a height of 60 m at YNZD and GDZH stations. The panels from the top to bottom represent the variation of surface water vapor pressure, the refractivity index estimated by interpolating ERA5 and CMONOC product, empirical refractivity index model and variations of the refractivity index bias as a function of ground water vapor pressure

Figure 9 illustrates the distribution of the TC model bias in the zenith direction across 13 ground stations operating in GAST-C. It can be seen that the average bias remains consistency among different regions, with the mean values fluctuating around 0. However, the bias distribution at each station is obviously deviated with Gaussian distribution, which is commonly used to describe TC model bias. This bias is probably due to the fact that the scale height variations related to seasons are not considered. The tropospheric anomaly is considered to be the extreme point at both ends of the violin curves. A higher probability of the tropospheric anomaly is occurrence observed in low latitude regions. Compensating for the tropospheric delay error using empirical TC model becomes challenging during extreme weather events such as inversion layers. Therefore, it is crucial to consider monitoring extreme weather-related tropospheric phenomena to ensure system safety for GBAS in the future. It is important to note that GDZH, GXWZ and GXNN stations exhibit more extreme TC model bias compared with others. This can be attributed that all three stations are situated at low latitudes, making them more susceptible to subtropical monsoon climates with more frequent extreme weather events.

Fig. 9
figure 9

The distribution of the TC model bias in the zenith direction estimated at the 13 ground stations in GAST-C

Figure 10 presents the TC model bias based on Method 1 at different altitudes at BJSH station. It can be seen that the model bias gradually increases with height. Similar results are observed for other scale height estimation methods, although not shown here. This behavior can be attributed to the employed modeling framework of the TC model, which utilizes a whole modeling approach with an exponential function and scale height to describe the variation of the refractivity index profile. However, with the height increases, the complexity of variations intensifies in meteorological elements, leading to more challenges for the scale height to accurately capture the actual variations in refractivity index.

Fig. 10
figure 10

Variations of the TC model bias at altitudes of 60, 100, 200 and 400 m based on the scale height estimation method described in McGraw et al. (2000) at BJSH station in 2021

Table 2 summarizes the accuracy of the TC model in the zenith direction at different heights corresponding to the results shown in Fig. 10. At a decision height of 60 m, the TC model achieves an average bias of 0.12 mm and an RMS of 1.76 mm, meeting the precision landing requirements for CAT I. However, the average bias increases to 1.34 mm, and the RMS reaches 10.38 mm at a decision height of 400 m. Although the average bias is very small, the presence of large TC model biases, which exceed the range assured by the broadcasted uncertainty of the TC model, can cause loss of integrity. Additionally, the increased RMS value leads to a larger protection level, subsequently reducing system availability. On the other hand, a comprehensive tropospheric threat model, encompassing all potential tropospheric anomaly conditions, is essential for ensuring aircraft safe (Khanafseh et al. 2016). With the increase in height, the TC model bias is no longer influenced by the surface meteorological elements, the tropospheric refractivity index profile based on the exponential function is difficult to capture the change of refractivity index with height. Consequently, the TC model bias gradually increases with height.

Table 2 Statistics of the TC model bias and RMS at heights of 60, 100, 200 and 400 m based on the scale height estimation method described in McGraw et al. (2000) at BJSH station in 2021

TC model evaluation in the slant direction

With the assistance of the proposed ray-tracing algorithm, it becomes possible to evaluate the TC model bias associated with the MF in the slant direction. This section evaluates the performance of MF at different stations and seasons. Figure 11 presents the TC model bias in the slant direction at different elevations and azimuths in GAST-C on January 1, 2021. It can be seen that the TC model bias at different stations is smaller at higher elevations. Meanwhile, the anisotropy of the TC model bias across different azimuths is not pronounced. However, with the elevation decreases, the model bias increases with the decrease in elevations. The TC model bias reaches its maximum at an elevation of 6°. It should be noted that GBAS provides aircraft with differential corrections and integrity information for satellites with an elevation greater than 5° using very high frequency (VHF) data broadcasting stations (Saito et al. 2017). Therefore, considering the fact that the GBAS standard sets the cutoff elevation at 5°, this discrepancy leads to a larger TC model bias. The primary reason of the bias lies in the MF, which exhibits noticeable systematic bias at low elevations. To mitigate the impact of the MF-induced TC model bias on integrity, the systematic bias needs to be overbounded. Moreover, a more precise MF can also be developed to mitigate the effect of MF errors on GBAS.

Fig. 11
figure 11

The TC model bias at XJQM, HBES, GDZH and NMAL stations at different elevations and azimuths in GAST-C. The cutoff elevation angle is \(6^\circ\)

To investigate the performance of the TC model in different seasons, the scale height estimation method described in McGraw et al. (2000) method is utilized and the TC model bias is calculated at the BJSH station, as presented in Fig. 12. It is observed that the bias is larger at low elevations. Additionally, the bias is significantly larger in summer compared with other seasons. This is caused by a couple of reasons. Firstly, the abundance of water vapor in summer induces challenges in accurately estimating the refractivity index due to pronounced variations in wet delay. Secondly, it is difficult for the scale height using annual mean to accurately describe the tropospheric delay error at stations experiencing significant seasonal changes.

Fig. 12
figure 12

The TC model bias at different elevations and azimuths in different seasons in GAST-C. The cutoff elevation angle is \(6^\circ\)

To further analyze the effect of seasonal variation on the TC model performance,

the TC model bias in different seasons in GAST-C at the BJSH station is shown in the top panel in Fig. 13. It can be seen that seasonal variations are obvious. The wet component in the tropospheric delay error derived by the modified ray tracing is also shown in the second panel. A similar trend can be found between the TC model bias and the wet component, which is further verified by the correlation analysis in the third panel, indicating that the variation of the TC model bias is mainly influenced by the wet delay or water vapor in different seasons. To quantitatively evaluate the impact of seasonal changes on the TC model, the bottom panel shows that the tropospheric delay error is overestimated by the TC model during summer and autumn, reaching a maximum bias of 4.7 mm, while it is underestimated during spring and winter. The average bias of the TC model is in the millimeter magnitude. Therefore, similar to the conclusions from Fig. 8, the timely refractivity index and scale height estimation should be applied in precision approach to accurately capture seasonal variations and consequently estimating the tropospheric delay errors.

Fig. 13
figure 13

Variations of the TC model bias and wet component in GAST-C at BJSH station. The panels from top to bottom, respectively, represent the TC model bias, wet component in the tropospheric delay error, variation of TC model bias as a function of wet component and TC model bias statistics, respectively

Figure 14 shows the seasonal bias of the TC model and the mean precipitation (Peng 2020) in summer and winter at 13 stations in 2021. It can be seen that the TC models at XJSS, XJQM and NMBT stations, characterized by a temperate continental climate, exhibit the highest accuracy. Furthermore, the seasonal changes in summer and winter are not significant at these stations with low precipitation. On the other hand, the bias of TC model shows obvious seasonal variation at BJSH, NMAL, LNDD and JLYJ stations in temperate monsoon climate. The performance of the models in winter, with an average bias of 0.83 mm, surpasses that in summer with a bias of 4.5 mm. The model bias also exhibits seasonal changes at YNZD, LUZH and HBES stations, which are characterized by a subtropical monsoon climate. However, in contrast to the previous four stations characterized by the temperate monsoon climate, the seasonal variations in the model’s average bias are reversed for summer and winter. The GXNN, GXWZ and GDZH stations with more precipitation demonstrate the poorest model performance, with an average bias of 5.6 mm in 2021. This is due to the fact that the scale height estimation method is more applicable to mid-latitudes. In other words, the surface temperature parameter in the scale height estimation is assumed to be 15 °C and is not applicable at low latitudes. Additionally, the figure highlights a relationship between the performance of the TC model and the average precipitation. The TC model, which is based on the scale height, models the refractivity index as a unified entity, without separating hydrostatic and wet component. Consequently, this modeling approach is less effective in estimating wet delay in theory. This limitation leads to larger bias in regions with higher precipitation.

Fig. 14
figure 14

Comparison of the TC model bias and mean precipitation in summer and winter at 13 ground stations in 2021

To further evaluate the performance of the TC model in extreme environments, the maximum TC model bias and occurrence probability of anomaly are calculated at various elevations at a height of 400 m, as shown in Fig. 15. The left two panels show the maximum TC model bias and occurrence probability of anomaly in the zenith direction. The threshold for anomaly determination in the zenith direction is set as 10 mm. It can be seen that the maximum bias at most of the stations exceeds 10 mm with an occurrence probability exceeding 20%. The maximum bias of 28 mm in the zenith direction is observed at BJSH station, as shown in the top left panel. Furthermore, compared with other stations, the panel in the bottom left shows that the probability of anomalies exceeds 41% at GDZH, GXNN and GXWZ stations, which are located at low latitudes. This can be attributed to the more complex variations in water vapor at low latitudes, leading to more frequent extreme weather events. On the other hand, the maximum TC model bias in the slant direction is shown in the top right panel. It can be seen that the maximum bias increases as the elevation decreases. The maximum TC model bias found at an elevation of 6° is 363 mm. To evaluate the occurrence probability of anomaly in the slant direction and mitigate the impact of the MF, the thresholds at different elevations are scaled up using the formula \(10/\sqrt{0.002+{{\text{sin}}}^{2}\left(\theta \right)}\), as shown by the blue lines in the bottom right panel. The occurrence probability of anomaly remains stable at around 23% at higher elevations can be observed. However, there is a noticeable increase in the occurrence probability of anomaly when the elevation is below 10°. This indicates that the MF performs poorly at low elevations, making it challenging for TC models to accurately estimate the actual tropospheric delay error.

Fig. 15
figure 15

Statistics of the maximum TC model bias and occurrence probability of anomaly in the zenith and slant direction. The top two panels are the maximum TC model bias in the zenith and slant direction, respectively. The bottom two panels, respectively, represent the occurrence probability of anomaly in the zenith and slant direction

Uncertainty overbounding of TC model bias

Figure 16 depicts the TC model uncertainty and mean bias calculated for various elevations. The blue line represents the actual TC model uncertainty calculated using the modified ray-tracing algorithm, while the red line shows the theoretical uncertainty estimated based on equation (2) using the scale height method described in McGraw et al. (2000). Three representative stations, selected to represent airports situated at different latitudes with diverse climatic conditions, are included in the analysis. It can be observed that the actual and theoretical uncertainties exhibit a high level of consistency across almost all elevations.

Fig. 16
figure 16

Variations of uncertainty and averaged bias of the TC model as a function of elevation angles at XJSS, NMAL and GXWZ stations in 2021. The cutoff elevation angle is \(6^\circ\)

It is commonly assumed that the TC model bias transmitted by the ground station should follow a zero-mean normal distribution. However, significant TC model bias between the actual and theoretical uncertainties is observed at low elevations in this analysis, as shown by the black lines in Fig. 16. These biases may lead to unacceptable integrity risks. When the TC model bias deviates from a zero-mean Gaussian distribution, an overbounding method is required for airborne users. This renders the use of uncertainty overbounding to accommodate the presence of such the TC model bias. In theory, since the ground station possesses the knowledge of the parameters related to the error probability density function (PDF), overbounding nonzero-mean Gaussian distributions can be straightforward by considering it as a Gaussian envelope problem with a nonzero mean (Sayim 2003). In the case of the TC model bias, defined as \(\overline{\mu }_{{{\text{trop}}}}\), the true vertical protection level \({\text{VPL}}_{{{\text{true}}}}\) is given by

$$\begin{array}{*{20}c} {{\text{VPL}}_{{{\text{true}}}} = k\sqrt {\mathop \sum \limits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } + \mathop \sum \limits_{n = 1}^{N} S_{z} \overline{\mu }_{{{\text{trop}}}} } \\ \end{array}$$
(12)

where \(N\) is the number of satellites. \(k\) is the multiplier which determines the probability of fault-free missed detection. \(S_{z}\) is the element of the third column of projection matrix. \(\sigma_{n}\) is the uncertainty of each ranging measurement.

The inflated vertical protection level is defined as \({\text{VPL}}_{{{\text{inflated}}}}\)

$$\begin{array}{*{20}c} {{\text{VPL}}_{{{\text{inflated}}}} \ge {\text{VPL}}_{{{\text{true}}}} } \\ \end{array}$$
(13)

If equation (13) is true, the aircraft position error bound is expected and its integrity can be guaranteed. Equation (13) can be rewritten as,

$$\begin{array}{*{20}c} {k\sqrt {\mathop \sum \limits_{n = 1}^{N} S_{z}^{2} \sigma_{{{\text{inflated}}}}^{2} } \ge \frac{{k\sqrt {\mathop \sum \nolimits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } + \mathop \sum \nolimits_{n = 1}^{N} S_{z} \overline{\mu }_{trop} }}{{k\sqrt {\mathop \sum \nolimits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } }}k\sqrt {\mathop \sum \limits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } } \\ \end{array}$$
(14)

where \(\sigma_{{{\text{inflated}}}}\) is the uncertainty that is inflated after taking into account the TC model bias. By employing the inflated model uncertainty, the requirement for overbounding is effectively satisfied with (Sayim 2003)

$$\begin{array}{*{20}c} {k\sqrt {\mathop \sum \limits_{n = 1}^{N} S_{z}^{2} \sigma_{{{\text{inflated}}}}^{2} } \ge \xi k\sqrt {\mathop \sum \limits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } } \\ \end{array}$$
(15)

where \(\xi\) is the inflation factor. According to equations (14) and (15), the inflation factor is given by

$$\begin{array}{*{20}c} {\xi = \left( {1 + \frac{{\mathop \sum \nolimits_{n = 1}^{N} S_{z} \overline{\mu }_{{{\text{trop}}}} }}{{k\sqrt {\mathop \sum \nolimits_{n = 1}^{N} S_{z}^{2} \sigma_{n}^{2} } }}} \right)} \\ \end{array}$$
(16)

It can be found that the inflation factor is a function of the TC model bias and uncertainty of aircraft measurements. Although it is difficult to know the real measurement error of an aircraft, it is easy to determine the uncertainty of observations under conservative assumptions.

According to Fig. 16, there is a significant bias when the elevation is lower than around 20°. To reduce the risk of GBAS integrity, a variance inflation method with a threshold of 20° is used. In this method, \(\overline{\mu }_{{{\text{trop}}}}\) is 0 when the elevation exceeds 20°, which results in a value of 1 for \(\xi\). In other words, the uncertainty of the TC model does not need to be inflated. When the elevation is lower than 20°, \(\overline{\mu }_{{{\text{trop}}}}\) is shown by the black line in Fig. 16. Thus, \(\overline{\mu }_{{{\text{trop}}}}\) is given by

$$\begin{array}{*{20}c} {\overline{\mu }_{{{\text{trop}}}} = \left\{ {\begin{array}{*{20}l} {B_{{{\text{Mean}}}} } \hfill & {\theta \le 20^\circ } \hfill \\ 0 \hfill & {\theta > 20^\circ } \hfill \\ \end{array} } \right.} \\ \end{array}$$
(17)

where \(B_{{{\text{Mean}}}}\) is the TC model bias mean between the values calculated by the modified ray-tracing algorithm and the TC model in GBAS.

Conclusions and remarks

The study investigated the performance of the existing TC models which use four different scale height estimation methods in GBAS. A modified ray-tracing algorithm is proposed to calculate the tropospheric delay error, thus providing a reference for evaluating the accuracy of TC models. The hourly ERA5 and CMONOC data in the whole year of 2021 in China were processed. Based on the analysis, the following conclusions can be drawn.

  1. 1)

    For the evaluation of the performance of the TC model in the zenith direction, the TC model bias calculated using the four scale height estimation methods shows approximate consistency in GAST-C. This answers which scale height estimation method should be used in TC model in China. Additionally, it can be concluded that as long as the input surface meteorological parameters are the same, a similar modeling accuracy can be attained by depicting the refractivity index profile as a straightforward linear profile. However, the TC model bias increases as the aircraft decision height rises. At a height of 400 m, the average bias of the TC model in the zenith direction reaches 1.34 mm, with an RMS of 10.38 mm. This is primarily attributed to the overall modeling approach of the TC model. Although the average bias is small, the maximum TC model bias reaches 28 mm. The presence of large TC model bias, which exceeds the range assured by the broadcasted tropospheric uncertainty, can cause loss of integrity. Therefore, carefully considering and bounding the TC model bias is essential to ensure flight safety.

  2. 2)

    In terms of the performance of the TC model in difference regions in China, the TC model bias from the 13 stations revealed notable variations, which is influenced by surface meteorological elements. The majority of stations exhibit a maximum bias in the zenith direction exceeding 10 mm with an anomaly occurrence probability exceeding 20% based on the worst-case scenario. Among all the stations, GDZH presents the highest probability of experiencing a tropospheric anomaly, with a likelihood of 47%. This discrepancy can be attributed to the influence of low latitudes with more precipitation and higher temperatures, which leads to a higher probability of extreme weather events.

  3. 3)

    Regarding the MF-induced TC model bias in the slant direction, the TC model is almost unbiased when the elevation exceeds 20° at nominal conditions. However, a significant TC model bias arises for elevations below 20° due to the limitations of MF. Considering the fact that the magnitude of MF-induced bias is related to local climatic conditions and elevations, it is advisable not to applying a fixed inflation factor, as it may increase the TC model uncertainty and reduce the GBAS continuity. In extreme weather conditions, the TC model bias tends to increase as the elevation decreases. The maximum TC model bias found at an elevation of 6° is 363 mm. In terms of the TC model uncertainty, it is found that the distribution of the TC model bias deviates the Gaussian distribution. To reduce the influence of the MF and the non-Gaussian distribution of TC model bias on the integrity and continuity in GBAS, it is recommended to estimate the inflation factor dynamically based on real-time meteorological data. The strategy to define the dynamic inflation factor to overbound the GBAS MF-induced TC model bias will be the focus of the follow-on work. The development of a more precise MF will be also studied.

This study contributes to a better understanding of the GBAS TC model performance in China. In addition, it can provide valuable insights and guidance for developing more accurate TC models.