1 Introduction

Water vapor covers a very small portion of the total atmospheric volume, but plays a very important role in maintaining atmospheric vertical distribution, as well as the formation of clouds and rain (Wang et al. 2014; Yao et al. 2016). Knowing water vapor information with highly temporal and spatial resolution is a prerequisite for prevention of the medium-to-small-scaled weather disastrous. In the past, the main tools for acquiring water vapor are radiosonde, microwave radiometer, and so on. However those methods are confined by their disadvantages, such as low temporal and spatial resolution, as well as high construction and maintenance costs. Consequently, water vapor information obtained by traditional methods is limited in terms of temporal and spatial resolution (Wan 2014).

The arrival of the GNSS tomographic technique provides a new method to acquire water vapor information with highly temporal and spatial resolution (Bevis et al. 1992). Since the validity of water vapor information obtained from GPS-observed data was confirmed by Bevis et al. (1992), the development of water vapor tomography technique has advanced by leaps and bounds (Flores et al. 2000; Troller et al. 2002; Champollion et al. 2009; Perler et al. 2011; Chen and Liu 2014; Adavi and Mashhadi-Hossainali 2014; Rohm et al. 2014). Due to the specific distribution of satellite constellation and receivers, as well as the specific choice of tomography area, the number of signals that passing through the whole tomographic area is limited. Ultimately, many voxels are not crossed by signal rays, especially those locate at periphery and lower space of the tomographic area. Chen and Liu (2014) proposed a method to improve the utilization of signal rays by moving the tomographic area according to the specific tomography condition. However, this approach is limited by the need to relocate the tomographic area before every tomography experiment according to the number of signals crossed the test area, which is hard to operate in practice.

To ensure maximal use of signal rays crossing the entire tomographic area, a novel approach with the advantage of easy operation is proposed. In our study, the maximum tomographic height is determined by radiosonde data over 40 years. In horizontal direction, the horizontal resolution for the target area is selected on the basis of non-uniform symmetrical division of horizontal voxels. Finally, the proposed approach was validated using data from Hong Kong Satellite Positioning Reference Station Network.

2 The principle of tomography

2.1 Building of observation equation

There are two classes of observations derived from GNSS-based receivers can be used for tropospheric tomography; first is slant wet delay (SWD), which can be used to acquire wet refractivity values (Flores et al. 2000; Skone and Hoyle 2005; Rohm and Bosy 2009; Notarpietro et al. 2011), and second is slant water vapor (SWV), which can be exploited to obtain the water vapor distribution information (Champollion et al. 2005; Chen and Liu 2014). Tomographic results derived from two classes of observation can be crossed-converted, based on the atmospheric temperature field (Bender et al. 2011). For our study, SWV is used to obtain three-dimensional water vapor information.

SWD is determined by wet refractivity value along a given slant path, and often refers to the wet delay on a given satellite signal time lapse from satellite to GNSS-based receiver, and can be expressed as follows (Flores et al. 2000):

$${\text{SWV}} = 10^{ - 6} \times \int_{\text{S}} {\rho_{w} \text{d}s}$$
(1)

where \(\text{s}\) is the propagation path of satellite signal. \(N_{{\text{wet}}}\) is the atmospheric wet refractivity (unit: mm/km). SWD can be obtained by processing the observation derived from GNSS-based receivers.

In our study, GAMIT (v10.50) (Herring et al. 2010) software was used for processing the GPS observation based on double-differenced model, and Niell mapping function was adopted. The interval of Zenith Tropospheric Delay (ZTD) and gradient parameters estimated was 30 min, and the elevation cut-off angle of 10° was selected so as to ignore the influence of ray bending on tomographic result (Mendes 1999). Due to the short baseline between receivers in the local region, the similar propagated paths of signals received by different stations and a strong correlation of the signal delay above the atmosphere among stations are existed. To reduce the correlation of tropospheric parameters among the network, three stations are introduced (BJFS, LHAZ and SHAO) with the length of baseline larger than 500 km (Rocken et al. 1995). By combining the surface pressure, the accurate Zenith Hydrostatic Delay (ZHD) can be calculated based on the empirical model (Saastamoinen 1972):

$${\text{ZHD}}\;\mathcal{ = }\;\frac{{0.002277\; \times P_{\text{s}} }}{1 - 0.00266 \times \cos (2\varphi ) - 0.00028\; \times \;H}$$
(2)

where \(P_{\text{s}}\) is surface pressure (unit: hPa), \(\varphi\) is the latitude of station, \(H\) is the geodetic height. Therefore, the Zenith Wet Delay (ZWD) would be separate from the ZTD by minus the ZHD, and the Slant Wet Delay (SWD) can be obtained as follows (Flores et al. 2000):

$${\text{SWD}}_{{{\text{ele}},\varvec{\varphi }}} = m_{\text{wet}} ({\text{elv}}) \times {\text{ZWD}} + m_{\text{wet}} ({\text{elv}}) \times \cot ({\text{elv}}) \times (G_{\text{NS}}^{w} \times \cos \varvec{\varphi } + G_{\text{WE}}^{w} \times \sin \varvec{\varphi }) + R_{\text{e}}$$
(3)

where \({\text{elv}}\) is satellite elevation, \(\varvec{\varphi }\) is azimuth, \(m_{\text{wet}}\) is the wet mapping function, \(G_{\text{WE}}^{w}\) and \(G_{\text{NS}}^{w}\) are wet delay gradient parameters in the east–west and north–south directions, \(R_{\text{e}}\) is the unmodelled residuals, which is obtained according to the method proposed by Alber et al. (2000).

Therefore, SWD can be converted to SWV by following formula (Chen and Liu 2014):

$${\text{SWV}} = \frac{{ 1 0^{ 6} }}{{\rho_{{\text{water}}} R_{\omega } [(k_{ 3} /T_{\text{m}} ) + k_{ 2}^{{\prime }} ]}} \times {\text{SWD}}$$
(4)

where \(k_{\text{2}}^{{\prime }} = 16.48\text{K}\;\text{hPa}^{ - 1}\), \(k_{3} = 3.776 \times 10^{5} \text{K}^{2} \;\text{hPa}^{ - 1}\), \(R_{\omega } = \text{461}(\text{J}\;\text{kg}^{{ - \text{1}}} \;\text{K}^{{ - \text{1}}} )\) and are the specific gas constants for water vapor. \(\rho_{{\text{water}}}\) is water vapor density (unit: g/m3). \(T_{\text{m}}\) is the weighted mean tropospheric temperature, in our study, \(T_{\text{m}}\) is the empirical formula, which is calculated by Liu et al. (2001) using the meteorological measurements.

Once SWV is obtained, the tomographic area can be discretized into a number of voxels, in which water vapor density is a constant in voxels over a given period of time. Therefore, SWV of signal propagation path can be discretized as follows (Chen and Liu 2014):

$$\text{SWV}^{p} = \sum\limits_{ijk} {(d_{ijk}^{p} \cdot \rho_{ijk} )}$$
(5)

where \(\text{SWV}^{p}\) is the slant water vapor of pth signal path (unit: mm). \(i,j,k\) are the position of tomographic voxel in longitude, latitude and vertical directions. \(d_{ijk}^{p}\) is the distance of pth signal in voxel \((i,j,k)\) (unit: km). \(\rho_{ijk}\) is the water vapor density in a given voxel \((i,j,k)\) (unit: g/m3). The above equation can be written in matrix form as follows (Flores et al. 2000; Chen and Liu 2014):

$$\varvec{y}_{m \times 1} = \varvec{A}_{m \times n} \cdot \varvec{\rho }_{n \times 1}$$
(6)

where \(m\) is the number of total SWVs, while \(n\) is the number of voxels in the tomographic area. \(\varvec{y}\) is the observation, refers to SWV here which penetrate the whole research area, while \(\varvec{A}\) is the coefficient matrix of signal transit distances through the voxels and \(\varvec{\rho }\) is the column vector of unknown, which refers to water vapor density in our study.

2.2 Constraint equation

To acquire the water vapor density value in Eq. (6), an inversion algorithm needs to be performed. However, many voxels in the tomographic area are not crossed by any signals. Matrix \(\varvec{A}\) in Eq. (6) is very a large sparse matrix, and water vapor information for those voxels that rays do not pass through are lost. To overcome this rank deficiency problem, the mostly widely-used method is to impose constraint information (Bi et al. 2006). In our study, the horizontal constraint equation is imposed based the Gauss-weighted functional method (Song et al. 2006) while the functional relationship of exponential distribution is established among the voxels on the vertical direction (Elósegui et al. 1998).

The horizontal constraint equation is as follows (Song et al. 2006):

$$H_{1} \rho_{1} \;{ + }\;H_{2} \rho_{2} { + } \cdots { + }H_{j - 1} \rho_{j - 1} - \rho_{j} { + }H_{{j{ + }1}} \rho_{{j{ + }1}} { = 0}$$
(7)

where, \(\rho\) is the water vapor density, \(H\) is the coefficient of horizontal constraint equation which is computed by the following formula:

$$H_{i ,j ,k}^{{i_{1} ,j_{1} ,k_{1} }} = \left\{ \begin{aligned} &1 , (i = i_{1} \& j = j_{1} \varvec{)} \hfill \\ &- \frac{{\text{exp}}{\left( { - \frac{{d_{{i_{1} ,j_{1} ,k_{1} }}^{2} }}{{2\sigma^{2} }}} \right)}}{{\sum\limits_{i} {\sum\limits_{j} {{\hbox{exp}}\left( { - \frac{{d_{{i_{1 ,} j_{1} ,k_{1} }}^{2} }}{{2\sigma^{2} }}} \right)} } }} ,\; (i \ne i_{1} /j \ne j_{1} ) \hfill \\ &0\text{,}\varvec{(}k \ne k_{1} \varvec{)} \hfill \\ \end{aligned} \right.\varvec{.}$$
(8)

As mentioned above, \(i,j,k\) are the positions of operational voxel while \(i_{1} ,j_{1} ,k{}_{1}\) are the marks of other voxels in three directions. \(d_{{i_{1} ,j_{1} ,k_{1} }}^{{}}\) is the distance between the operated voxel and some other voxels. \(\sigma\) is smooth factor. Therefore, the matrices form of horizontal constraint can be written as follows:

$$\varvec{H}_{m \times n} \cdot \varvec{\rho }_{n \times 1} = {\bf{0}}_{m \times n}$$
(9)

where \(\varvec{H}\) is the coefficient matrix of horizontal constraint.

The vertical constraint equation is established based on the following relation (Elósegui et al. 1998):

$$\rho_{i} /\rho_{i - 1} = e^{{(h_{i - 1} - h_{i} )/H}}$$
(10)

where \(\rho_{i}\) and \(h_{i}\) is the water vapor density and the height of the ith layer, separately. \(H\) is the water vapor scale height, which is selected to be 1–2 km. Therefore, the coefficient between two adjacent voxels vertically can be obtained as follows:

$$v_{i} = e^{{(h_{i - 1} - h_{i} )/H}} - 1$$
(11)

where \(v_{i}\) is the coefficient of vertical constraint between the ith and (i − 1)th layer. The matrices form of vertical constraint can be written as follows:

$$\varvec{V}_{m \times n} \cdot \varvec{\rho }_{n \times 1} = {\bf{0}}_{m \times n}$$
(12)

where \(\varvec{V}\) is the coefficient matrix of horizontal constraint.

Consequently, the final tomographic model can be acquired by combining formulas (6), (9) and (12) as follows:

$$\left( \begin{aligned} \varvec{A}_{m \times n} \hfill \\ \varvec{H}_{m \times n} \hfill \\ \varvec{V}_{m \times n} \hfill \\ \end{aligned} \right) \cdot \varvec{\rho }_{n \times 1} = \left( \begin{aligned} \varvec{y}_{m \times n} \hfill \\ {\bf{0}}_{m \times n} \hfill \\ {\bf{0}}_{m \times n} \hfill \\ \end{aligned} \right)$$
(13)

Following the singular value decomposition method, the general inverse matrix can be obtained (Ran and Ge 1997).

3 Tomographic voxel optimization

Atmospheric water vapor at a given altitude varies over changing geographic tomography area. In addition, the quality of tomographic result is affected by the division of the tomographic voxels, and unreasonable division of tomographic voxels would lead to the tomographic result divergent from actual water vapor distribution (Bi et al. 2006; Notarpietro et al. 2011). Consequently, the vertical height for a tomographic area should be determined according to the actual conditions of the research area, and the voxels also should be reasonably discretized.

3.1 Determination of vertical boundary

The most part of atmospheric water vapor is concentrated in the troposphere (Mohanakumar 2008), and the distribution of which cannot be expressed by a uniform formula. Decreasing with height, however, water vapor tend to zero gradually. For most tomographic experiments in the past, the vertical boundary was selected based upon the researcher’s experience. Yet, over various research areas, the distribution of water vapor varies differently. For example, water vapor value is close to zero at 8 km altitude in some areas, while water vapor concentration is still very high at the height of 10 km in other areas. In accord with change in actual water vapor distribution on the vertical direction of the test area, maximum vertical height should therefore be determined which is essential to the effective use tomographic observation.

Assuming a case that the vertical resolution is 0.8 km, using 8 km as the vertical boundary (10 voxels vertically) of tomographic area can save 23.1 % of unknowns than that of selecting vertical boundary 10.4 km (13 voxels vertically). The lower the vertical boundary of tomographic area selected is, the greater the number of signals crossing the whole area that can be used. As shown in Fig. 1, signals which pass through the gray area can also be utilized when the vertical boundary drops from 10.4 to 8 km. However, if the vertical boundary of tomographic area is too low, the influence of water vapor above vertical boundary to tomographic result is ignored, which causes the tomographic result deviate from actual conditions.

Fig. 1
figure 1

Schematic plan of signals rays passing through tomographic area

To make a direct comparison of signal utilization and the number of voxels crossed by signals for various selected vertical boundaries, two schemes were designed for this experiment as follows

  • Scheme 1, vertical height is 10.4 km, vertical resolution is 0.8 km, total voxels is 8 × 7 × 13;

  • Scheme 2, vertical height is 8km, vertical resolution is 0.8 km, total voxels is 8 × 7 × 10.

First we analyzed the observed data from GPS network from Hong Kong at Day of Year (DOY) 139. Figure 2 gives the number of signals used once per hour. Figure 3 gives the percentage of voxels crossed by signals. The calculation shows that the average utilization of signals increased by 7.51 %, from 51.51 % (Scheme 1) to 59.02 % (Scheme 2), once the vertical boundary was lowered from 10.4 to 8 km. In addition, the percentage of voxels crossed by signals also increased by 2.73 %. Subsequently, we analyzed the data for 31 days of DOY 121 to 151, 2013. Figure 4 gives the number of signals used once per day, while Fig. 5 gives the percentage of voxels crossed by signals. Here, calculation shows that the average utilization of signals increased by 8.69 % from 47.44 % (Scheme 1) to 56.13 % (Scheme 2), once the vertical boundary was lowered from 10.4 to 8 km. In addition, the percentage of voxels crossed by signals also increased by 6.52 %. Consequently, in terms of utilization of signal rays, reasonable choice of vertical boundary plays an important role in using as much of existing observation as possible and improving the number of voxels crossed by signals.

Fig. 2
figure 2

Number of signals used for scheme A and B once per hour at DOY 139, 2013

Fig. 3
figure 3

Number of voxels crossed by signal rays for scheme A and B once per hour at DOY 139, 2013

Fig. 4
figure 4

Average number of signals used for scheme A and B for 31 days of DOY 121 to 151, 2013

Fig. 5
figure 5

Average number of voxels crossed by signal rays for scheme A and B for 31 days of DOY 121 to 151, 2013

Radiosonde data can provide accurate water vapor profiles at different altitudes (Niell et al. 2001; Adeyemi and Joerg 2012). In our study, therefore, the vertical boundary is determined on the basis of 3D water vapor distribution derived from radiosonde for many years. There is a radiosonde station 45004 in King’s Park of Hong Kong, from which average vertical water vapor profile and STD for 40 years (1974–2014) are given in Fig. 6. It can be seen in Fig. 6 that water vapor density is less than 0.2 g/m3 over the height of 8 km (10th layer), while the STD is less than 0.05 g/m3. Consequently, 8 km was selected as the vertical boundary of tomographic area for our study.

Fig. 6
figure 6

Average water vapor density and STD from radiosonde 45004 for 40 years from 1974 to 2014

3.2 Non-uniform symmetrical division of horizontal voxel

For the purpose of convenience, past tomography studies divided voxels on the horizontal direction by uniform horizontal spacing, which leads to a great differences in actual observed information among various voxels-relatively richer information in voxels centrally located in the tomographic area, and less to no observed information in lowest and peripheral voxels. Obtained as observed data from 12 Hong Kong receiver stations at UTC 00:00 DOY 121, 2013, Fig. 7 gives the actual condition of signal distribution of tomographic area. Relative color depth covering the tomographic area corresponds positively to the frequency of signals crossed the voxels. It follows from Fig. 7 that more signals pass through the tomographic center, while fewer signals cross the edge and bottom of the tomographic area. In addition, many voxels positioned at the periphery and lowest altitudes of the tomographic area are not crossed by signals. To alleviate this phenomenon, a novel, approach of non-uniform symmetrical division of horizontal voxels is therefore proposed. Assuming that the center of the tomographic area as starting point, as shown by point O in Fig. 8, and combined the actual distribution of signal rays crossing the tomographic area, horizontal voxels are then dispersed by increment and symmetrical horizontal spacing along both longitudinal and latitudinal directions. Along both directions of longitude and latitude, the horizontal voxel spacing in Fig. 8 should adhere to the following relationship: \(Ld1 \le Ld2 \le Ld3 \le Ld4\), \(Bd1 \le Bd2 \le Bd3 \le Bd4\). Here, \(Ld\) and \(Bd\) are the distance in latitude and longitude directions. Finally, the tomographic area is discretized into non-uniform symmetrical horizontal spaces. The central principle of this method is that for those more densely areas which signals passing through, a small horizontal spacing is exploited; while a relative wider spacing is used for those less densely areas that signals passing through, which makes the observed information evenly distributed among voxels as soon as possible.

Fig. 7
figure 7

Three dimensional distribution of signal rays crossing the tomographic area

Fig. 8
figure 8

Schematic representation of non-uniform symmetrical division of horizontal voxels

Based on Sect. 3.1 and the idea of non-uniform symmetrical division of horizontal voxels, we consequently designed five schemes for tomographic experiment to validate the proposed method. scheme A is designed based on the idea of traditional uniform division of horizontal voxels. Schemes B–E are designed on the basis of non-uniform division of horizontal voxels following the principle that small horizontal spacing is exploited close to the center of tomographic area, while relative big horizontal spacing is utilized at the margin of tomographic area.

The range of tomographic area in our study is latitude N22.19°–N22.54°, longitude E113.87°–E114.35°, while the vertical resolution is 0.8 km and the total number of voxels is 7 × 8 × 10. The selected GPS stations are symbolized in Fig. 9 and their details are given in Table 1. The specific details of various schemes are shown in Table 2.

Fig. 9
figure 9

The geographic distribution of Hong Kong GPS network, radiosonde station and rainfall station

Table 1 Specific information of Hong Kong satellite positioning reference station network
Table 2 Specific information of steps in latitude and longitude directions for various schemes

To analyze the influence of different approaches on the number of voxels crossed by signals, the experiments were performed. Figure 10 gives the statistical result for the number of voxels crossed by signals between schemes A and B, (a) shows the statistical result of once per day for the period of DOY 121 to 151, 2013, and (b) gives the result of once per hour at DOY 130, 2013. It can be seen from Fig. 10 that, under the condition that the number of signals crossed the tomographic area are same, the number of voxels was increased by exploiting the approach of non-uniform symmetrical division of horizontal voxels. Pertinent calculation shows that the average number of voxels increased by 9.06 % from 58.40 to 67.46 % under the method proposed here.

Fig. 10
figure 10

Number of voxels crossed by rays under different division of horizontal voxel

4 Validation of the proposed method

As mentioned above, an accurate water vapor profile can be acquired using radiosonde, which is often regarded as the reference of choice to evaluate water vapor information derived from other methods (Niell et al. 2001; Liu et al. 2013). As shown in Fig. 8, there is a radiosonde station 45004 in King’s Park of study area, where the sounding balloon is launched every day at UTC 00:00 and 12:00. In addition, water vapor density value can also be obtained using data provided by ECMWF, which provides global reanalysis data four times per day at UTC 00:00, 06:00, 12:00 and 18:00. The resolution level of data provided by the ECMWF for our study is 0.125° × 0.125°, and there are twelve grid points in tomographic area. Table 3 gives the geographic location of grid points. Therefore, the result derived from radiosonde and ECMWF data are used to validate the tomographic result from different schemes with respect to the IWV, water vapor profile and error distribution.

Table 3 The geographic location of ECMWF grid point in Hong Kong

4.1 IWV comparison

To make a direct comparison of integrated water vapor (IWV) derived from different schemes, the data covering 31 days for the period of DOY 121 to 151, 2013 is analyzed. To begin, IWV for the location of radiosonde station is calculated using the voxels information acquired from different tomographic schemes for the UTC 00:00 and 12:00 epoch, and then compared with IWV derived from radiosonde and ECMWF. Here, the Mėso-NH model is used to compute the IWV by vertical integration of the water vapor content (\(\rho_{w}\)) through the model vertical layers (Brenot et al. 2006):

$$\text{IWV} = \int_{L} {\rho_{w} \text{d}z}$$
(14)

Figure 11 shows the IWV comparison between schemes A, B and radiosonde for the period of DOY 121 to 151, 2013. Figure 12 shows the IWV comparison between schemes A, B and ECMWF for the period of DOY 121 to 151, 2013. Tables 4 and 5 give the statistical result for five schemes compared with radiosonde and ECMWF during the same period. It can be seen from Figs. 11 and 12 that IWV derived from scheme B are in good agreement with those from radiosonde and ECMWF, relative to those from scheme A. The statistical results from Tables 4 and 5 show that the RMS, STD, MAE and Bias of IWV calculated on the basis of non-uniform symmetrical division of horizontal voxels (scheme B–E) are superior to those of traditional method (scheme A).

Fig. 11
figure 11

Comparison of IWV time series derived from radiosonde and various tomographic result for the period of DOY 121 to 151, 2013

Fig. 12
figure 12

Comparison of IWV time series derived from ECMWF and various tomographic result for the period of DOY 121 to 151, 2013

Table 4 Statistical result of IWV between various schemes and radiosonde for 31 days for the period of DOY 121 to 151, 2013
Table 5 Statistical result of IWV between various schemes and ECMWF for 31 days for the period of DOY 121 to 151, 2013

4.2 Water vapor profile comparison

Although IWV results derived from schemes B–E agree well with those from the radiosonde, while also showing low RMS, STD, MAE and Bias, one may not conclude that the vertical water vapor distribution profile has been obtained correctly. For instance, if two vertical layers are exchanged arbitrarily, the calculated IWV value remains unchanged, but the vertical distribution of water vapor has changed. Therefore, to further test the accuracy of vertical water vapor density, water vapor profile comparisons between radiosonde, ECMWF and different schemes were performed at two dates, UTC 00:00 DOY 121, 2013 and 12:00 DOY 136, 2013 as shown in Fig. 13. The two epochs were selected because they correspond to the minimum and maximum IWV value for the period of 31 test days. In addition, we compared the water vapor density value derived from schemes A and B for 31 days with that from radiosonde for the location of radiosonde station at dates, UTC 00:00 and 12:00. Figures 14 and 15 give the RMS of comparison between radiosonde, ECMWF and schemes A and B for the period of DOY 121 to 151, 2013, respectively. Table 6 gives the statistical result of five schemes for 31 days.

Fig. 13
figure 13

Water vapor density profile derived from radiosonde, ECMWF and various tomographic result. Time period in a and b are 00:00 UTC DOY 121, 2013 and 12:00 UTC DOY 136, 2013, respectively

Fig. 14
figure 14

RMS of scheme A and B compared with radiosonde for 31 days for the period of DOY 121 to 151, 2013

Fig. 15
figure 15

RMS of scheme A and B compared with ECMWF for 31 days for the period of DOY 121 to 151, 2013

Table 6 Statistical result of water vapor density between radiosonde, ECMWF and different schemes for 31 days for the period of DOY 121 to 151, 2013 (unit: g/m3)

It can be seen from Fig. 13 that the 3D water vapor profile derived from scheme B has a good agreement with that from radiosonde. Compared with radiosonde, the tomographic result derived from scheme B for two selected epochs (RMS are 1.42 and 0.86 g/m3, respectively) are superior to that of scheme A (RMS are 2.01 and 1.55 g/m3, respectively), yet both schemes show poor results when compared with ECMWF. Figures 14 and 15 also show that the RMS of scheme B is less than that of scheme A. Table 6 tells us that five schemes may all obtain a reliable tomographic result, and when result derived from radiosonde is regarded as a reference, tomographic results derived from various schemes (RMS < 2 g/m3) are better than those from ECMWF (RMS = 2.89 g/m3). In current applications, radiosonde data is the main resource for numerical weather forecast models, therefore, which makes it possible to improve accuracy of numerical weather forecast models with highly-accurate water vapor information using GNSS tomographic technique. By comparison we may also conclude that the RMS, STD, MAE and Bias values for tomographic results based on non-uniform symmetrical division of horizontal voxels (scheme B–E) are better than those of traditional method (scheme A).

To further analyze the relationship between vertical water vapor distribution and altitude, the RMS of schemes A and B at different altitudes were calculated in comparison with radiosonde and ECMWF. In addition, we also calculated the relative error using the following formula (Chen and Liu 2014):

$${\text{re}} = \frac{{|\varvec{\rho}-\varvec{\rho}_{0} |}}{\varvec{\rho}}$$
(15)

where \({\text{re}}\) is the relative error. \(\varvec{\rho}_{0}\) is the water vapor density value derived from different tomographic schemes. \(\varvec{\rho}\) is the water vapor density value derived from radiosonde or ECMWF.

Figures 16 and 17 give the comparison of RMS and relative error between radiosonde, ECMWF and schemes A and B for the period of DOY 121 to 151, 2013. Relative error and RMS of scheme B are less than those of scheme A, which can be observed from Figs. 16 and 17, further validating the improved nature of the novel method over the traditional method. Generally, relative error increases with altitude because the water vapor density is very low in the upper layers and a small discrepancy between tomographic result and radiosonde/ECMWF would also lead to a large relative error value.

Fig. 16
figure 16

RMS and relative error change with height of various schemes compared with radiosonde for 31 test days

Fig. 17
figure 17

RMS and relative error change with height of various schemes compared with ECMWF for 31 test days

4.3 Error distribution

The water vapor density derived from radiosonde during the tested period of DOY 121 to 151, 2013 is regarded as the true value, by which we analyzed the error distribution for five schemes. Figure 18 shows the error distribution of various schemes and Table 7 gives the statistical result of error percentage. It can be found from the statistical result that the error distribution is concentrated on −3 to 3 g/m3 for five tomographic results. From Table 7 we can see that the percentage of schemes B–E are 92.42, 92.42, 92.26 and 91.94 %, separately, which is large than that of scheme A (87.42 %). According to the result mentioned above, we may reasonable to conclude that the error distribution of water vapor density derived from tomographic results based on non-uniform symmetrical division of horizontal voxel (scheme B–E) is more closely parallels radiosonde results than that of the traditional method (scheme A).

Fig. 18
figure 18

Error distribution of various schemes for 31 days

Table 7 Statistical result of error distribution

4.4 The influence of tomographic result under different weather conditions

As shown in Fig. 9, there is a precipitation station which locates at King’s park in the tomographic area. The influence of tomographic result was analyzed for some days under different weather conditions according to the rainfall information provided by Hong Kong Observatory. Radiosonde station is next to the rainfall station, the precipitation of rainfall station is thus regarded as a reference, for which Table 8 gives the statistical information for some days in May 2013.

Table 8 Statistics of precipitation of rainfall station in King’s Park for some days, May 2013 (unit: mm)

In our study, we analyzed tomographic result of five schemes compared with those derived from radiosonde under different weather conditions for 8 days in May 2013. Table 9 gives the RMS, MAE and Bias values from 8 days for five schemes, while Table 10 gives the statistical result for 8 days. Referring to Tables 9 and 10, one may deduce that all five schemes are able to yield reliable tomographic results under different weather conditions. In addition, the accuracy of tomographic result on the basis of non-uniform symmetrical division of horizontal voxels (scheme B–E) is closer to the true value than that of traditional method (scheme A).

Table 9 Comparison of water vapor density between various schemes and radiosonde for 8 days (unit: g/m3)
Table 10 Statistical result of various schemes for 8 days

4.5 SWV comparison

To further show the superiority of proposed method, we selected data from different weather conditions for 8 days based on Table 8. Only eleven stations in the research area are used to carry out the tomographic experiment, while the other station (HKSL) is considered as a tested station under the condition that other settings are unchanged. Calculated using observed GPS and meteorological data, and regarded as the true value, the SWV of signal rays from HKSL station pass through the entire test area. Thereby, the comparison between SWV calculated by GPS-observed and meteorological data and SWV derived from tomographic results of various schemes is performed 1 h per day, Shown in Table 11 are the average RMS and MAE calculated from different schemes for 8 days. Table 12 gives the statistical result for 8 days.

Table 11 RMS of SWV for different schemes for 8 days (Unit: mm)
Table 12 Statistical result of various schemes for 8 days

Tables 11 and 12 reveal that RMS and MAE value of schemes B–E are less than those of scheme A, while the differences between scheme B to scheme E are very slight. This shows that the accuracy of the proposed method (scheme B–E) is higher than that of traditional method (scheme A), and further exhibit the superiority of proposed method.

5 Conclusion

A novel, optimized approach of voxel division has been proposed for water vapor tomography. In this study, the vertical boundary is determined based on the 3D water vapor profile derived from radiosonde for 40 years, which avoids the waste of observation data in selecting vertical boundary based on experiences. In horizontal direction, the concept of non-uniform symmetrical division of horizontal voxels is proposed on the basis of actual distribution of signals passing through the tomographic area.

Five schemes are designed based on above idea to validate the proposed method, on which experiments were performed using data from the GPS network across all of May 2013. Radiosonde and ECMWF data are both used here to test the tomographic result. The result shows that between radiosonde, ECMWF and the five schemes, the RMS, STD, MAE and Bias values of tomographic result derived from the proposed approach are superior to that of traditional method. Therefore, if the principle of horizontal voxel division is followed according to the proposed method, a preferable result would be obtained than that of traditional method. The selecting method of top boundary and voxel division proposed in the paper provides a reference for water vapour tomography study, which enhances the utilization of signals used and the number of voxels crossed by rays, and has very important application value as well.