Introduction

The ability to predict the availability of satellite navigation augmentation systems has become increasingly important. In the field of airborne-based augmentation systems, receiver autonomous integrity monitoring (RAIM) availability prediction has become an essential part of the system. This is because for a receiver to execute RAIM algorithm, two conditions must be met: a minimum number of satellites and adequate satellite geometry (Feng et al. 2006), which means RAIM does not work in some cases referred to as RAIM holes. Consequently, availability prediction must be conducted to exclude the risk of running into a RAIM hole. Currently, the USA, Australia, Germany, and China have developed mature RAIM availability prediction systems, and they demand that availability prediction be performed before using RAIM (Zhu et al. 2009). In the field of ground-based augmentation systems, the Federal Aviation Administration has clearly defined the requirements of availability prediction in the paper AC20-138B (FAA 2010). Research on ground-based augmentation system availability prediction has been undertaken (Wang et al. 2014), and the SLS-4000 Service Prediction Tool, developed by the Navigation Branch (ANG-C32), has been recommended for the Federal Aviation Administration as an acceptable method for meeting prediction requirements. In the field of satellite-based augmentation systems (SBASs), there is similar demand for availability prediction. Although research on this subject is limited, some relevant agencies, such as the European Satellite Services Provider (ESSP) and Iguassu Software Systems, have commenced research programs (Rovira-Garcia et al. 2015). We attempt to build a model to predict availability based on system architecture and the operational mechanism of SBASs.

SBASs are commonly composed of multiple ground-based monitoring stations, upload stations, and GEO satellites. With messages broadcast by GEO satellites, the SBAS supports wide-area or regional augmentation. The SBAS not only provides differential corrections, but it also ensures integrity, continuity, and availability. Currently, SBASs should meet the requirements of LPV-200. The European Geostationary Navigation Overlay Service (EGNOS) is the second most mature operational SBAS serving all the European countries. EGNOS consists of three segments: the space segment, ground segment, and user segment, as shown in Fig. 1. The space segment is composed of two geostationary satellites: AOR-E (PRN120) and SES-5 (PRN136) which took over the role of IOR-W (PRN126) on August 19, 2015. The main task of this segment is to maintain and implement communication between the RIMS and the master control center (MCC), as well as to transmit instructional information sent by the MCC to the users. The ground segment comprises 39 RIMS, 2 MCCs, and six navigation earth stations. The main task of this segment is to conduct comprehensive control and data processing for EGNOS. The user segment consists of EGNOS standard receivers, which need to be able to receive signals from EGNOS, GPS, and GLONASS simultaneously and interpret them compatibly. EGNOS has been used in applications such as aviation, marine, ground transportation (Gicquel et al. 2016).

Fig. 1
figure 1

Space segment, ground segment, and user segment of EGNOS

Inevitably, the capability of monitoring stations used for providing error corrections might degenerate over time, and poor geometry between the RIMS and satellites could affect the reliability of the navigation system in supplying safety of life (SOL) service. For example, a report has suggested that frequent road traffic accidents happened on Calle Hartzenbusch in Spain (Telegraph 2013). The lack of accurate corrections for GPS is believed to be an important factor. Coincidentally, accidents occurred in December 2014, when an Embraer aircraft prepared to land on a runway under the guidance of a satellite navigation system. Therefore, SBAS availability has become increasingly important. In summary, it is necessary to propose a method for predicting SBAS availability to avoid unnecessary losses.

Primary availability prediction algorithm

In this section, a primary availability prediction algorithm is proposed by the following three steps. First, through combing the calculation flow of SBAS availability, the difficulties of prediction algorithm are attributed to two real-time integrity parameters, user difference range error indicator (UDREI) and grid ionospheric vertical error indicator (GIVEI). Second, based on the calculation process of UDREI and GIVEI, the relationship between UDREI and the number of visible RIMS and the relationship between GIVEI and the number of visible ionospheric pierce points (IPPs) are analyzed statistically. Finally, experiments are performed to evaluate the effectiveness of the primary algorithm.

Calculation flow of SBAS availability

The Radio Technical Commission for Aeronautics specifies calculations for SBAS availability. In fact, SBAS availability depends on whether a user’s protection level (PL) is greater than the alarm limit. The vertical protection level (VPL) and horizontal protection level (HPL) can be calculated as follows (RTCA 2006):

$${\text{HPL}}_{\text{SBAS}} = \left\{ {\begin{array}{*{20}c} {K_{{H,\,{\text{NPA}}}} \cdot d_{\text{major}} \quad {\text{for}}\,{\text{NPA}}} \\ {K_{{H,\,{\text{PA}}}} \cdot d_{\text{major}} {\kern 1pt} \quad {\text{for}}\,{\text{PA}}} \\ \end{array} } \right.$$
(1)
$${\text{VPL}}_{\text{SBAS}} = K_{v} \cdot d_{U}$$
(2)

where K H,NPA, K H,PA, K v are the fractiles corresponding to the allocation of integrity risk under different approach modes. Furthermore,

$$d_{\text{major}} = \sqrt {\frac{{d_{\text{est}}^{2} + d_{\text{north}}^{2} }}{2} + \sqrt {\left( {\frac{{d_{\text{est}}^{2} - d_{\text{north}}^{2} }}{2}} \right)^{2} + d_{\text{EN}}^{2} } }$$
(3)
$$\left. \begin{aligned} d_{\text{est}}^{2} = \sum\limits_{i = 1}^{N} {S_{{{\text{east}},i}}^{2} \sigma_{i}^{2} } \quad\quad d_{U}^{2} = \sum\limits_{i = 1}^{N} {S_{U,i}^{2} \sigma_{i}^{2} } \hfill \\ d_{\text{north}}^{2} = \sum\limits_{i = 1}^{N} {S_{{{\text{north}},i}}^{2} \sigma_{i}^{2} } \quad\quad {\kern 1pt} d_{\text{EN}} = \sum\limits_{i = 1}^{N} {S_{{{\text{east}},i}} S_{{{\text{north}},i}} \sigma_{i}^{2} } \hfill \\ \end{aligned} \right\}$$
(4)

where d major is the error uncertainty along the semimajor axis of the error ellipse, d 2est is the variance of model distribution that overbounds the true error distribution in the east axis, d 2north is the variance of model distribution that overbounds the true error distribution in the north axis, d EN is the covariance of model distribution in the east and north axes, d 2 U is the variance of model distribution that overbounds the true error distribution in the vertical axis, S east,i is the partial derivative of positional error in the east direction with respect to the pseudorange error on the ith satellite, S north,i is the partial derivative of positional error in the north direction with respect to the pseudorange error on the ith satellite, and S U,i is the partial derivative of positional error in the vertical direction with respect to the pseudorange error on the ith satellite. The S east,i , S north,i , and S U,i are elements of the projection matrix S from the pseudorange domain to the position domain, which is defined as:

$$S = \left[ {\begin{array}{*{20}l} {s_{{{\text{east}},1}} } \hfill & {s_{{{\text{east}},2}} } \hfill & {{\kern 1pt} {\kern 1pt} {\kern 1pt} \cdots } \hfill & {{\kern 1pt} {\kern 1pt} s_{{{\text{east}},N}} } \hfill \\ {s_{{{\text{north}},1}} } \hfill & {s_{{{\text{north}},2}} } \hfill & {{\kern 1pt} {\kern 1pt} {\kern 1pt} \cdots } \hfill & {s_{{{\text{north}},N}} } \hfill \\ {s_{U,1} {\kern 1pt} } \hfill & {s_{U,2} } \hfill & {{\kern 1pt} {\kern 1pt} {\kern 1pt} \cdots } \hfill & {s_{U,N} } \hfill \\ {s_{t,1} {\kern 1pt} } \hfill & {{\kern 1pt} s_{t,2} } \hfill & {{\kern 1pt} {\kern 1pt} {\kern 1pt} \cdots } \hfill & {s_{t,N} } \hfill \\ \end{array} } \right] = (H^{T} WH)^{ - 1} H^{T} W$$
(5)

where H is an observation matrix corresponding to the geometry of the visible satellites, and W is a diagonal weight matrix corresponding to the variance of measurement noise.

The variance of measurement noise on the ith satellite is defined as:

$$\sigma_{i}^{2} = \sigma_{{i,{\text{flt}}}}^{2} + \sigma_{{i,\,{\text{UIRE}}}}^{2} + \sigma_{{i,\,{\text{air}}}}^{2} + \sigma_{{i,\,{\text{tropo}}}}^{2}$$
(6)

where σ 2 i,flt is the model variance for the residual error, σ 2 i,UIRE is the model variance for the slant range ionospheric error, σ 2 i,air is the model variance for the multipath receiver noise error, and σ 2 i,tropo is the model variance for the tropospheric delay estimation error.

Analysis of the calculation process presented above indicates that the PL is determined by two factors. One is the geometry of the visible satellites, which could be predicted using a satellite almanac. The other is the standard deviation of the satellite pseudorange measurement, which consists of four parts: model variance for the residual error, model variance for the slant range ionospheric error, model variance for the multipath receiver noise error, and model variance for the tropospheric delay estimation error. The latter two are related to satellite elevation angles, which means they also could be predicted using a satellite almanac. However, to predict the first two variances, we need two real-time integrity parameters broadcast by GEO satellites. One is UDREI, which is used to describe integrity related to the satellite clock/ephemeris error correction for each satellite. The other is GIVEI, which is used to describe integrity as related to the ionospheric error correction for each ionospheric grid point (IGP). Therefore, the focus here is on predicting UDREI and GIVEI through mathematical statistics.

Statistical treatment of UDREI

The user difference range error (UDRE) bounds the error of the computed ephemeris and clock corrections for each satellite with a probability of 0.999. The following important steps are used in the UDREI computation (Sardon et al. 1998).

  1. 1.

    Compute the residual error at the ith monitoring station, dR i due to estimated range R i and measured range R mi as:

    $${\text{dR}}_{i} = R_{mi} - R_{i}$$
    (7)
  2. 2.

    Estimate the mean \(\overline{\text{dR}}\) and standard deviation σ R of the satellite’s ephemeris and clock error, respectively, for m monitoring stations as:

    $$\overline{\text{dR}} = \frac{1}{m}\sum\limits_{k = 1}^{m} {{\text{dR}}_{k} }$$
    (8)
    $$\sigma_{\text{dR}} = \sqrt {\frac{1}{(m - 1)}\sum\limits_{k = 1}^{m} {({\text{dR}}_{k} - \overline{\text{dR}} )^{2} } }$$
    (9)
  3. 3.

    UDRE is a conservative tolerance error bound as:

    $${\text{UDRE}} = \overline{\text{dR}} + \kappa (\Pr )\sigma_{\text{dR}}$$
    (10)

    where κ(Pr) is the statistical confidence factor given the confidence level Pr.

  4. 4.

    UDREI is obtained from UDRE using the conversion table specified by DO-229D.

Analysis of the above calculation process indicates that if a satellite has more visible RIMS, then it can get more pseudoranges, making the estimation of UDREI more accurate (Fang et al. 2013). Therefore, it is believed that a high correlation exists between UDREI and the number of visible RIMS, and practical measured data are processed to analyze their relationship. The processing procedure includes the following three steps. First, based on the positions of the RIMS and the satellite’s positions calculated from a satellite almanac, the number of visible RIMS for each satellite can be derived. Second, based on the satellite’s clock/ephemeris error correction information, broadcast by the GEO satellites, the UDREI value of each satellite can be parsed. Finally, according to the number of visible RIMS and the corresponding UDREI value for each satellite, the relationship between UDREI and the number of visible RIMS can be analyzed. Given this relationship, it would be possible to predict the UDREI of a satellite at any time, because the number of visible RIMS could be obtained easily from a satellite almanac. The processing flow is shown in Fig. 2.

Fig. 2
figure 2

Flowchart of the statistics and analysis of UDREI

Statistical treatment of GIVEI

There could be variations in the estimated delay at the IGP because of many error factors, e.g., modeling errors, mapping function errors, and measurement noise. These errors become translated from IPP locations to selected IGPs in the delay estimation process. Therefore, an error bound (grid ionospheric vertical error, GIVE) on such an error is also generated at each IGP. GIVE is the maximum error bound that an IGP can have. The error bound can be on either side of the estimated IGP delay value. The condition to GIVE estimation is that for every IGP there should be at least three surrounding squares, each with at least one IPP. This refers to the sufficiency of IPP density around an IGP. The following important steps are used in the GIVEI computation (Prasad and Sarma 2004):

  1. 1.

    Compute the residual error e IPP due to the estimated user’s IPP delay \(\hat{I}_{\text{IPP}}\) and measured IPP delay I IPP as:

    $$e_{\text{IPP}} (t) = I_{\text{IPP}} (t) - \hat{I}_{\text{IPP}} (t)$$
    (11)
  2. 2.

    Estimate the mean \(\left| {\bar{e}_{\text{IPP}} } \right|\) and standard deviation σ e of the user’s IPP vertical delays, respectively, for m epoch (i.e., time t 1 to t m ) measurements as:

    $$\left| {\bar{e}_{\text{IPP}} } \right| = \frac{1}{m}\sum\limits_{k = 1}^{m} {e_{\text{IPP}} (t_{k} )}$$
    (12)
    $$\sigma_{e} = \sqrt {\frac{1}{(m - 1)}\sum\limits_{k = 1}^{m} {(e_{\text{IPP}} (t_{k} ) - |\bar{e}_{\text{IPP}} |)^{2} } }$$
    (13)
  3. 3.

    Estimate the grid vertical absolute error bias \(\hat{e}_{\text{IGP}}\) at the surrounding four IGPs from the absolute value of the residual error at the ith user’s IPP, i.e., \(\left| {e_{{{\text{IPP}}_{i} }} (t)} \right|\), for m epoch data using:

    $$\hat{e}_{\text{IGP}} = \frac{{\sum\limits_{i = 1}^{n} {\left( {\frac{1}{{d_{i} }}} \right) \cdot \left| {e_{{{\text{IPP}}_{i} }} (t)} \right|} }}{{\sum\limits_{i = 1}^{n} {\frac{1}{{d_{i} }}} }}$$
    (14)

    where d i is the distance from the ith user’s IPP to the selected IGP, and n is the number of IPPs adjacent to the selected IGP.

  4. 4.

    Generate a conservative tolerance error bound E IPP for every valid user’s IPP in a surrounding square. E IPP is derived from a two-sided statistical tolerance interval γ that contains a proportion p of a normally distributed population over a given sample size m as:

    $$E_{\text{IPP}} = \left| {\bar{e}_{\text{IPP}} } \right| + g(\gamma ;p;m)\sigma_{e}$$
    (15)

    where \(\left| {\bar{e}_{\text{IPP}} } \right|\) is the absolute value of the mean error and g(γpm) is the statistical confidence factor. The confidence factor can be computed for given values of γ, p, and m. For γ = 0.999 (i.e., 99.9%), p = 0.999, and m = 5, g(γpm) is 23.54, and for m = 30, it is 5.43, i.e., it decreases with the number of data samples.

  5. 5.

    GIVE at the elected IGP is the sum of \(\hat{e}_{\text{IGP}}\), maximum tolerance error bound max (E IPP), and an allowance for vertical ionospheric delay quantization q as:

    $${\text{GIVE}} = \hbox{max} (E_{\text{IPP}} ) + \hat{e}_{\text{IGP}} + q/2$$
    (16)

    where q = 0.0625 m.

  6. 6.

    GIVEI is obtained from GIVE using the conversion table specified by DO-229D.

Although ionospheric delay is very dynamic, the residual error between the estimate value and the real value is relatively stable. Since GIVE is used to bound this residual error, the dynamic characteristic of GIVE is not obvious. Analysis of the above calculation process suggests it is possible to draw a law similar to that obeyed by UDREI, i.e., if the IGP has more visible IPPs, then it can get more ionospheric delay, making the estimation of GIVEI more accurate (Fang et al. 2013). Therefore, its value can be attributed to the number of visible IPPs. In fact, this view is reflected in many papers. Blanch (2002) points out that “integrity failures are more likely to happen when there are few IPP measurements.” Juan et al. (2002) believe that in regions well surrounded by IPP measurements, even in the most adverse ionosphere conditions, delay estimate can be tightly bounded. Li et al. (2011) analyzed ionospheric delay estimation residuals over China and found in the central region of China where IPP measurements are sufficient, the confidence bound are small, at 0.1 m or less, while in the edge region where IPP measurements are sparse, confidence bound can reach more than 0.4 m. However, the above research only qualitatively revealed the relationship between the GIVE and IPP measurements; we will quantitatively analyze how the IPP measurements affect GIVE values.

Practical measured data are processed to analyze the relationship. The processing procedure includes the following four steps. First, based on the positions of the RIMS and the satellite’s positions calculated using a satellite almanac, the locations of the IPPs can be calculated. Second, having derived the locations of the IPPs and given the positions of the IGPs, the number of visible IPPs for each IGP can be obtained. Third, based on the ionospheric correction information, broadcast by the GEO satellites, the GIVEI value of each IGP can be parsed. Finally, according to the number of visible IPPs and the corresponding GIVEI value for each IGP, the relationship between GIVEI and the number of IPPs can be established. Given this relationship, it would be possible to predict the GIVEI of an IGP at any time, because the number of visible IPPs could be obtained easily from a satellite almanac. The processing flow is shown in Fig. 3.

Fig. 3
figure 3

Flowchart of the statistics and analysis of GIVEI

Experiment and discussion

Currently, EGNOS has 39 operational RIMS and 287 IGPs, which are distributed in four vertical bands (band 3, band 4, band 5, and band 6) and one crosswise band (band 9). The positions of all the RIMS and IGP are known precisely, as shown in Fig. 4.

Fig. 4
figure 4

Positions of RIMS and IGPs distributed in four vertical bands and one crosswise band of EGNOS

The broadcast data of EGNOS are available from EGNOS data collection stations at ENAC University in France. Here, 1 month of EGNOS data (December 1–30, 2014) were processed using the method presented above, and the result of the statistical treatment of UDREI is shown in Fig. 5.

Fig. 5
figure 5

UDREI sample distribution based on 1 month of EGNOS data

It is evident from Fig. 5 that when the number of visible RIMS is small, the value of UDREI is 14, which means UDRE is not monitored. As the number of visible RIMS increases, UDREI decreases. When the number of visible RIMS reaches 32, the value of UDREI is 6.

The mean of UDREI for the same number of visible RIMS is calculated. To ensure the integrity of the system, the mean value is rounded up to be conservative; the results are shown in Table 1.

Table 1 Relationship between the number of visible RIMS and UDREI in EGNOS

The result of the statistical treatment of GIVEI is shown in Fig. 6. It is obvious that when the number of visible IPPs is small, the value of GIVEI is 15, which means GIVE is not monitored. As the number of visible IPPs increases, GIVEI decreases. When the number of visible IPPs reaches 15, the value of GIVEI is 8.

Fig. 6
figure 6

GIVEI sample distribution based on 1 month of EGNOS data

The mean of GIVEI for the same number of visible IPPs is calculated. To ensure the integrity of the system, the mean value is rounded up to be conservative; the results are shown in Table 2.

Table 2 Relationship between the number of visible IPPs and GIVEI in EGNOS

Given the relationships shown in Tables 1 and 2, GIVEI and UDREI can be predicted, which means it should be possible to predict the standard deviation of the satellite pseudorange measurement. Then, by combining the standard deviation with the geometry of the satellites, the user’s VPL and HPL could be predicted. Furthermore, the availability of the SBAS could also be obtained. Based on the MATLAB Algorithm Availability Simulation Tool (MAAST), which is the SBAS simulation tool developed by GPSLAB of Stanford University (Jan et al. 2001), the availability prediction algorithm presented above is simulated. Prediction results for May 18, 2015, are shown in Figs. 7, 9 and 11.

Fig. 7
figure 7

Predicted HPL on May 18, 2015. Different colors represent different ranges of values as defined in color bar, e.g., the HPL of the red area is between 30 and 40 m

Fig. 8
figure 8

Actual HPL on May 18, 2015. Different colors represent different ranges of values in meters as defined in color bar

Fig. 9
figure 9

Predicted VPL on May 18, 2015

Fig. 10
figure 10

Actual VPL on May 18, 2015

Fig. 11
figure 11

Predicted availability on May 18, 2015. Different colors represent different ranges of values in percent as defined in color bar

To verify these prediction results, they were compared with maps (Figs. 8, 10 and 12) produced by ESSP, which is an authoritative institution monitoring the actual performance of EGNOS. It can be concluded that the prediction results are consistent with the actual performance in the central region, while a slight difference exists on the edge of the region.

Fig. 12
figure 12

Actual availability on May 18, 2015

ESSP provided quantitative values of actual availability so that further analysis could be undertaken. Figure 13 shows the difference between the actual and predicted availabilities. Statistical analysis demonstrates that the prediction is consistent with the actual performance over 72.28% of the region; it is smaller than the actual availability over 23.79% of the region and larger than the actual availability over 3.93% of the region.

Fig. 13
figure 13

Difference between the actual and predicted availabilities on May 18, 2015. Different colors represent different ranges of values as defined in color bar, e.g., in red area the prediction is 40% smaller than the reality

These results indicate that the predicted availability conservatively reflects the actual availability. This means that as long as the predicted availability meets the operational requirements, the SBAS could be used safely with a probability of 96.07%.

Improved availability prediction algorithm

In this section, an improved availability prediction algorithm is proposed which mainly contains three aspects. First, the necessity of data preprocessing is discussed and a new sampling strategy is defined to ensure sample independence. Second, considering the effect of geometry on UDREI, S-GDOP is defined to distinguish different geometries such that the relationship between the number of visible RIMS and UDREI for different geometries could be obtained. Third, with regard to the effect of geometry on GIVEI, a weighted number of visible IPPs is defined to describe the geometric distribution of the IPPs. Finally, experiments are performed to evaluate the effectiveness of the improved algorithm.

Independence of samples

As mentioned earlier, PL is the key indicator in predicting SBAS availability. PL overbounds the positioning error, which can be estimated using independent samples. Therefore, to improve the accuracy of the prediction, the independence of the samples must be guaranteed. However, in reality, the samples obtained are often not independent (Hauschild and Montenbruck 2016) because of the following two factors. First, the satellite ground trace repeats each day. Taking GPS as an example, every 23 h and 56 min, the relative position of the satellite and the user will repeat. This means that the multipath error related to the relative position has strong correlation in the adjacent time. It has been proven that variations in satellite orbits and ground conditions can cause the multipath error to become independent after 1 week (Shively et al. 2000). Thus, 1 week is skipped between samples in the improved algorithm. Second, when calculating the integrity parameters, the code measurements are smoothed with carrier phase measurements in a complementary filter with a 100-s time constant. Although this operation can effectively reduce the receiver noise and multipath error, it makes the adjacent pseudorange relevant. It is generally accepted that samples should be spaced two time constants apart to achieve acceptable independence (Luo et al. 2012). In summary, a new sampling strategy is obtained, which involves selecting 1 day each week and setting the sampling interval to 200 s for the selected day.

Under the new sampling strategy, the mean of UDREI for the same number of visible RIMS is calculated. To ensure the integrity of the system, the mean value is rounded up conservatively; the results are shown in Table 3, which upgrades Table 1 considering the effect of the independence of samples.

Table 3 Relationship between the number of visible RIMS and UDREI in EGNOS considering sample independence

Then the mean of GIVEI for the same number of visible IPPs is calculated. The mean value is also rounded up conservatively to ensure the integrity of the system; the results are shown in Table 4, which upgrades Table 2 considering the effect of the independence of samples.

Table 4 Relationship between the number of visible IPPs and GIVEI in EGNOS considering sample independence

Given the relationships shown in Tables 3 and 4, GIVEI and UDREI can be predicted. Furthermore, the availability of the SBAS can be obtained. The prediction performance under the new sampling strategy is shown in Fig. 14, where it is clear that the difference between the actual and predicted availabilities transits gently in the edge region, in contrast to Fig. 13, which has noticeably sharper boundaries with the original sampling strategy. It means that the deviation is reduced in the edge region and that the performance of the prediction is improved under new sampling strategy.

Fig. 14
figure 14

Difference between the actual and predicted availabilities with the new sampling strategy

Effect of geometry on UDREI

The UDREI prediction algorithm presented above only considers the number of RIMS, ignoring the effects of the geometry between the RIMS and satellites. In fact, the geometry has direct impact on the error distribution; thus, it also influences UDREI (Pandya et al. 2000). As analyzed earlier, UDREI has the following two characteristics: It describes the UDRE of a single satellite, and it is calculated using the pseudorange from a plurality of stations.

Before analyzing the impact of geometry, a variable must be defined that can describe such multi-station versus mono-satellite geometry, denoted S-GDOP, which can be defined as:

$$S - {\text{GDOP}} = \sqrt {G_{11} + G_{22} + G_{33} + G_{44} }$$
(17)
$$G = (H^{\text{T}} H)^{ - 1}$$
(18)
$$H = \left[ {\begin{array}{*{20}c} {l_{1} } & {m_{1} } & {n_{1} } & 1 \\ {l_{2} } & {m_{2} } & {n_{2} } & 1 \\ \vdots & \vdots & \vdots & \vdots \\ {l_{n} } & {m_{n} } & {n_{{_{n} }} } & 1 \\ \end{array} } \right]$$
(19)

This description refers to the definition of GDOP, except that the ith row of the observation matrix H here (i.e., [l i m i n i ]), is the line-of-site vector from the ith RIMS to the given satellite.

The correlation between UDREI and the number of visible RIMS as well as the correlation between UDREI and S-GDOP is analyzed (Fig. 15). It can be seen that when UDREI has a peak, the number of visible RIMS probably reaches a minimum and S-GDOP tends to be larger. Quantitative analysis indicates that UDREI displays strong correlation with the number of visible RIMS (Pearson Correlation = −0.72, P = 0) and weak correlation with S-GDOP (Pearson Correlation = 0.21, P = 4 × 10−31). When analyzing the effect of geometry on UDREI, two facts should be considered. The first is that the significance level of S-GDOP is 4 × 10−31, which means S-GDOP must be considered when predicting UDREI. The second is that the Pearson Correlation of S-GDOP is 0.21, which means S-GDOP is only a secondary factor; i.e., it is less important than the number of visible RIMS. Based on these facts, a new prediction algorithm is designed, where samples are grouped by S-GDOP, and then UDREI is predicted separately for each group using the traditional algorithm presented above. In this case, samples whose deviation of S-GDOP is within 100 were classified as a group because these samples are believed to have similar geometries.

Fig. 15
figure 15

Correlation analysis among UDREI, the number of visible RIMS and S-GDOP

Statistical analysis shows that when S-GDOP is small, the geometry between the RIMS and satellites is good. Consequently, the number of visible RIMS tends to be sufficient and the UDREI is small accordingly. As S-GDOP increases, the geometry deteriorates, making the number of visible RIMS decreases and UDREI increases simultaneously. To demonstrate this feature, the relationship between the number of visible RIMS and average of UDREI in different geometries characterized by S-GDOP is shown in Fig. 16. These curves generally display a downward trend showing the impact of the number of visible RIMS, and the different groups, characterized by S-GDOP, have slightly different curves, characterizing the impact of geometry. The tail of most curves is abnormal, because when S-GDOP is large, the geometry between the RIMS and satellites is poor. Hence, the maximum number of visible RIMS revealed in samples can no longer reach 40. In this situation, the predicted UDREI is set to 15 to indicate “unmonitored.”

Fig. 16
figure 16

Relationship between the number of visible RIMS and average of UDREI in different geometries characterized by S-GDOP

To ensure the integrity of the system, the average of UDREI is rounded up conservatively; the results are shown in Fig. 17, which allows predicting UDREI with the given position of satellites and RIMS. The prediction is carried out through the following three steps: first, S-GDOP is computed as defined in (17); then, a specific color curve is selected according to the S-GDOP; finally, UDREI is predicted according to the number of visible RIMS with the selected curve.

Fig. 17
figure 17

Relationship between the number of visible RIMS and overbound average of UDREI in different geometries characterized by S-GDOP

Figure 18 shows consistency, i.e., the percentage of the area where the prediction is consistent with the reality, as a function of number of groups. The consistency between the prediction and reality is chosen as an indicator for evaluating the performance of the algorithm for different numbers of groups. For a larger number of groups, geometry is considered more adequately because the deviation of S-GDOP in each group becomes less. However, it does increase the complexity of the algorithm because more prediction curves need to be analyzed. In the meanwhile, an increase in the number of groups means a reduction in the number of samples within a single group, which could diminish the credibility of the statistical results. The figure shows that when the number of groups is small, an increase in that number improves the consistency significantly. When the number reaches three, the consistency tends to vary slightly with a further increase in the number of groups. When the number of groups reaches five, optimum consistency is achieved, which means the geometry has been considered adequately. If the number of groups is increased further, the consistency declines because overfitting magnifies the influence of accidental errors (Hawkins 2004). In our research, the number of groups is set to five, which is based on the result of the analyses shown in the figure.

Fig. 18
figure 18

Effect of the number of groups on consistency, i.e., the percentage of the area where the prediction is consistent with the reality

Effect of geometry on GIVEI

As discussed above, GIVEI has been modeled as a function of the number of visible IPPs, denoted N. This count can capture approximately the law of change for GIVEI. However, it omits the details of the geometric distribution of the IPPs. It is obvious that an IPP closer to the IGP contributes more when calculating GIVEI than one further away. According to the Triangular Interpolation (TRIN) model (Mannucci et al. 1993) adopted by EGNOS (Trilles et al. 2015), when interpolating the ionospheric delay of the IGP, inverse-distance weighting is believed to roughly weigh each IPP’s contribution to the generation of GIVEI. Thus, a weighted number of visible IPPs, denoted N weighted, is defined as:

$$N_{\text{weighted}} = \sum\limits_{i} {\frac{C}{{d_{i} }}}$$
(20)

where the summation goes over the visible IPPs, d i is the distance between the ith IPP and the IGP, and C is the constant distance between the IGP and the selected IPP whose weight is 1.

The relationship between the weighted number of visible IPPs and GIVEI is shown in Fig. 19. The top panel reveals the relationship between this number and the average of GIVEI. To ensure the integrity of the system, the average of GIVEI is rounded up conservatively; the results are shown in the bottom panel. The curve is believed to better predict GIVEI than Table 2, because it considers the details of the geometric distribution of the IPPs.

Fig. 19
figure 19

Relationship between the weighted number of visible IPPs and GIVEI in EGNOS

Performance of improved algorithm

With the improved algorithm, the predicted PL and availability are obtained. Quantitative analysis of the improved algorithm is shown in Fig. 20. The statistical analysis of the difference demonstrates that the prediction is consistent with the actual performance over 75.17% of the region; it is smaller than the actual availability over 22.14% of the region, and it is larger than the actual availability over 2.69% of the region. In particular, when focusing on Region I, shown by the red box in the figure (longitude −10° to 16°, latitude 37°–58°), the consistency can be up to 100%. This can be explained by the geometric distribution of RIMS. Apparently, the RIMS distribution in Region I is uniform, whereas in the edge regions, where the performance of the prediction algorithm is poor, the RIMS distribution is deficient.

Fig. 20
figure 20

Difference between the actual and predicted availabilities with the improved algorithm. Region I is shown by the red box (longitude −10° to 16°, latitude 37°–58°); Region II is shown by the purple box (longitude −18° to 25°, latitude 33°–65°); Region III is shown by the black box (longitude −23° to 32°, latitude 30°–68°)

The core of the prediction model is to obtain the number of visible RIMS and the number of visible IPPs. The height of the ionospheric spherical shell is so low in comparison with the height of the satellite that the geometric distribution of the IPPs can be approximately characterized by the geometric distribution of RIMS. Therefore, it is reasonable that the performance of the prediction algorithm is highly correlated with the geometric distribution of RIMS. To analyze this phenomenon quantitatively, two indicators that describe the geometric distribution of RIMS are defined. The first is the number of RIMS per unit area, denoted N wrs, which is defined as:

$$N_{\text{wrs}} = \frac{{N_{\text{area}} }}{{S_{0} }} = \frac{{N_{\text{area}} }}{{S_{\text{area}} /S_{\text{total}} }}$$
(21)

where N area is the number of RIMS in the given area, S area is the proportion of the given area, S total is the proportion of Europe, and S 0 is the normalized proportion of the given area.

The second indicator is the average distance of the total station pairs, which is an assembly consisting of any two stations, in the given area, and denoted D pairs, which is defined as:

$$D_{\text{pairs}} = \frac{{\sum\nolimits_{i = 1}^{N} {d_{i} } }}{N}$$
(22)

where d i is the distance of the ith station pair, and N is the total number of possible station pairs.

The performance of the improved algorithm in different regions is shown in Table 5. It is evident that if N wrs is large and D pairs is small, the geometric distribution of RIMS tends to be good, making the prediction more precise (Yun 2015).

Table 5 Performance of the improved algorithm in different regions

Conclusions

The aim of this research was to investigate a simplified method for predicting SBAS availability, given the intense computation and poor timeliness of the traditional algorithms specified in DO-229D. Our contribution is as follows. First, the relationship between UDREI and number of visible RIMS and between GIVEI and number of visible IPPs was established through statistical analysis of EGNOS data. Second, considering the effect of geometry, the relationship was optimized for application with different geometries. Given the relationship, a simplified calculation of PL was demonstrated.

Finally, experiments were performed to evaluate the effectiveness of the proposed method. Using the proposed method, in Region II, where N wrs is 45.78 and D pairs is 1.62 × 106, the consistency between the predicted and actual availabilities reached 95.16%. With improvement of the geometric distribution of RIMS in the given area, the performance of the prediction becomes even better. In particular, when the application range of the prediction algorithm is limited to Region I, where N wrs is 75.00 and D pairs is 1.29 × 106, indicating a uniform distribution of RIMS, the consistency reached 100%. Moreover, the prediction method proposed in this work could be extended easily to other SBAS systems. This research may provide BeiDou satellite-based augmentation systems with theoretical and technical guidance, particularly in the layout of its ground stations and design of ionospheric grid.

It must be acknowledged that the performance of the prediction algorithm over the entire European region is not that satisfactory (75.17%). The lack of RIMS at the edges of the region is the main reason for this problem, i.e., the scarcity of a priori information results in inaccuracy of the prediction algorithm. In terms of the presented methodology, we only considered the mean of the integrity parameters to make the predictions, ignoring the impact of variance, which might cause a difference between the prediction and the actual performance. Quantitative analysis of the effect of variance should be addressed with respect to the integrity of the prediction, which was not the focus here. Furthermore, the inverse-distance weighting regarding the prediction of GIVEI may be optimized by a Kalman Filter to adapt to different SBASs. Then PL at each epoch, which is not public, needs to be provided. Finally, we set the elevation mask angle to 5° to determine whether a satellite was visible, which might be impractical. This will be considered in future work as long as access to real geographic information on the EGNOS RIMS is available. However, it might be considered worth sacrificing some degree of accuracy in exchange for improved predictability.