Introduction

Water vapor, an important part of the atmospheric system, has attracted great attention in the study of weather and long-term climate, such as in research on global water cycling and energy balances (Wang et al. 2007; Wang and Zhang 2009; Jin and Luo 2009). Traditional water vapor monitoring techniques mainly depend on radiosondes, satellite-based instruments, and water vapor radiometers. Such conventional techniques have limitations in capturing fine variations due to their low spatiotemporal resolutions. The drawbacks of these traditional techniques are increasingly clear with the growing demands of modern meteorological applications. To overcome the shortages of these techniques, a global positioning system (GPS) technique is extensively being used to compensate for these disadvantages. Bevis et al. (1992) first proposed the use of GPS measurements to sense precipitable water vapor (PWV). Great efforts have been conducted to sense PWV using GPS/GNSS measurements (Rocken et al. 1995; Li et al. 2015; Manandhar et al. 2017). There has been a rapid establishment of GNSS monitoring stations network at the regional, national and global scales. The PWV data retrieved from GNSS signal are widely used for the analysis of severe weather conditions, such as heavy rainfall (Adams et al. 2013; Zhang et al. 2015; Benevide et al. 2015), flood (Suparta and Rahman 2016; Huelsing et al. 2017), drought (Jiang et al. 2017) and typhoon events monitoring (Zhao et al. 2018).

Electromagnetic signals sent by a GNSS satellite through the neutral atmosphere are affected by troposphere refraction, resulting in a tropospheric delay. This delay along the zenith path is defined as zenith total delay (ZTD). The ZTD can be divided into two components, i.e., zenith hydrostatic delay (ZHD) and zenith wet delay (ZWD). Bevis et al. (1992) proposed an approach to derive the atmospheric weighted mean temperature \(({T_{\text{m}}})\) for deriving PWV from the ZWD of GNSS signals. The ZTD can be precisely estimated from GNSS observations using a precise point positioning (PPP) technique or differential approaches. The ZHD can also be accurately calculated from an empirical tropospheric delay model, and the ZWD can be calculated by subtracting ZHD from ZTD. Therefore, once the ZWD is calculated, the PWV value can be estimated. The role of \({T_{\text{m}}}\) in the process of PWV estimation is as follows (Davis et al. 1985; Askne and Nordius 1987; Bevis et al. 1994; Ross and Rosenfeld 1997):

$${\text{PWV}}=\Pi \cdot {\text{ZWD}}$$
(1)
$$\Pi =\frac{{{{10}^6}}}{{{\rho _{\text{w}}}{R_{\text{v}}}\left( {\frac{{{k_3}}}{{{T_{\text{m}}}}}+k_{2}^{\prime }} \right)}}$$
(2)
$${T_{\text{m}}}=\frac{{\int {(e/T){\text{d}}H} }}{{\int {(e/{T^2}){\text{d}}H} }}$$
(3)

where \(\Pi\) is the dimensionless atmospheric conversion factor, \({\rho _{\text{w}}}\) is the density of water, \({R_{\text{v}}}\) is the specific gas constant for water vapor, \(k_{2}^{\prime }\) and k3 are the atmospheric refractivity constants given in Bevis et al. (1994), e denotes the water vapor pressure (hPa) and T denotes the absolute temperature (Kelvin). The calculating precision of PWV depends on multiple error sources. According to the law of error propagation, the error of PWV can be deduced from (1) and (2), and the following formulas illustrate how the errors can affect PWV (Yao et al. 2014a),

$${\mu _{{\text{PWV}}}}=\Pi \cdot {\mu _{{\text{ZWD}}}}+{\text{ZWD}} \cdot {\mu _\Pi }$$
(4)
$${\mu _\Pi }=\frac{{{{10}^6}{k_3}}}{{{\rho _{\text{w}}}{R_{\text{v}}}{{({k_3}+k_{2}^{\prime }{T_{\text{m}}})}^2}}}{\mu _{{T_{\text{m}}}}}$$
(5)

where \({\mu _{{\text{PWV}}}}\), \(~{\mu _{{T_{\text{m}}}}}\), \({\mu _{{\text{ZWD}}}}\), and \({\mu _\Pi }\) denote errors from PWV, \({T_{\text{m}}}\), ZWD, and \(\Pi\), respectively. Currently, the International GNSS Service (IGS) provides the ZTD product with a precision of better than 5 mm (Byun and Bar-Sever 2009). The ZHD can be accurately estimated using numerical weather models or surface pressure observations (Hobiger et al. 2008a; Lu et al. 2016). In real-time, one can use a short-range forecast from a weather model to derive ZHD (Hobiger et al. 2008b; Lu et al. 2017). Thus, the ZWD can be accurately obtained. Yao et al. (2014a) noted that the estimation error of PWV caused by ZWD error is approximately 1.5 mm if the ZWD has an estimation error of 1 cm. The accuracy of \({T_{\text{m}}}\), under this circumstance, is considered to have a dominant role in precise PWV calculation. In other words, precisely estimating \({T_{\text{m}}}\) is the key to enhancing the precision of PWV estimation.

Multi-source atmospheric profiles, such as radiosonde profiles, the America National Centers for Prediction (NCEP) reanalysis data, and the European Center for Medium-Range Weather Forecasts (ECMWF) reanalysis data, can be used to calculate \({T_{\text{m}}}\). The \({T_{\text{m}}}\) can be accurately estimated at a specific position using these atmospheric profiles based on the integration method. An accurate empirical model for \({T_{\text{m}}}\) is needed to enhance the efficiency of \({T_{\text{m}}}\) estimation and provide easy accessibility for users. Numerous studies have been conducted on \({T_{\text{m}}}\) modeling, and a great number of empirical \({T_{\text{m}}}\) models have also been developed for retrieving GNSS-PWV. Bevis et al. (1992) first introduced the concept of GPS meteorology and a widely used \({T_{\text{m}}}\) model. The Bevis formula \(({T_{\text{m}}}=a+b{T_{\text{s}}}),\) was developed by investigating the correlation between surface temperature \(({T_{\text{s}}})\) and \({T_{\text{m}}}\). The Bevis model has gained increasing attention and it has been found that the coefficients of a and b are largely depend on location and season and should be re-estimated using local observations for use in specific areas (Bevis et al. 1992; Ross and Rosenfeld 1997). Yao et al. (2014a) analyzed the relationship between \({T_{\text{m}}}\) and surface meteorological parameters, such as \({T_{\text{s}}}\), \({P_{\text{s}}}\), and \({e_{\text{s}}}\), and then proposed a one-factor and a multi-factor \({T_{\text{m}}}\) models that take geographic and seasonal variations into account. The two new empirical models obtained better results around the globe. Ding (2018) developed a new global \({T_{\text{m}}}\) model, the NN model, based on the neural network algorithm, which only requires surface temperature as input. These empirical regression \({T_{\text{m}}}\) models can obtain excellent results if in situ surface meteorological observations are available. Most GPS/GNSS monitoring stations, however, are initially installed for the purposes of geodesy, i.e., no meteorological instruments are installed at these GPS/GNSS stations. Therefore, these \({T_{\text{m}}}\) models cannot be used for real-time retrieval of the GNSS-PWV due to the lack of in situ meteorological observations.

To obtain real-time retrieval of GNSS-PWV, Emardson and Derks (2000) proposed an empirical \({T_{\text{m}}}\) model in Europe, which takes the seasonal and latitude variation of \({T_{\text{m}}}\) into account and does not require in situ meteorological observations for the determination of real-time GNSS-PWV. In recent years, the non-meteorological parameter \({T_{\text{m}}}\) models have received great attention as they are widely used for real-time GNSS meteorology. Yao et al. (2015) improved the \({T_{\text{m}}}\) model developed by Emardson et al. and established a new empirical atmospheric conversion factor \((\Pi )\) model for low-latitude China. Yao et al. (2012) developed a global \({T_{\text{m}}}\) model, the Global Weighted Mean Temperature (GWMT) model, using the spherical harmonics method. This new model can obtain excellent mean results over the globe; however, a relatively worse result was observed in parts of the southern Pacific Ocean due to the uneven distribution of the radiosonde stations involved in modeling. Yao et al. (2013) improved the GWMT model by combining the Bevis formula and the global pressure and temperature (GPT) model (Böhm et al. 2007), and an improved global \({T_{\text{m}}}\) model, GTm-II, was proposed. Chen et al. (2014) established a new global empirical \({T_{\text{m}}}\) model, GTm_N, using the NCEP reanalysis outputs, which considers both annual and semi-annual variations of \({T_{\text{m}}}\). Yao et al. (2014b) further refined previous models and then constructed a new global \({T_{\text{m}}}\) model (GTm-III). He et al. (2017) developed a global \({T_{\text{m}}}\) model (GWMT-D), which considers the height correction of \({T_{\text{m}}}\) and yields remarkable performance over the globe. Furthermore, Huang et al. (2019) constructed a new global grid \({T_{\text{m}}}\) model (GGTm) by employing the sliding window algorithm, which takes both latitude and altitude variation of \({T_{\text{m}}}\) into account in modeling and performs excellent and reliable performance on a global scale.

Great efforts have been conducted towards developing global empirical \({T_{\text{m}}}\) models. However, high-precision empirical \({T_{\text{m}}}\) models for China are still lacking, which limits the real-time and high-precision application of GNSS meteorology for China. China is a vast territory that contains complex undulating terrain and diverse climate systems, especially in the Qinghai–Tibetan and Yunnan–Guizhou plateaus, where only a few radiosonde stations are available. Additionally, the GNSS-PWV sounding has not yet been fully conducted in China. In recent years, the Crustal Movement Observation Network of China (CMONOC), which contains over 260 GNSS monitoring stations, has been gradually applied in geodesy and geodynamics and can provide continuous long-term GNSS measurements for the potential application in PWV retrieval. Thus, there is an urgent need to develop a high-precision empirical \({T_{\text{m}}}\) model for China for real-time GNSS-PWV retrieval.

The GPT2w (Global Pressure and Temperature 2 Wet) model developed by Böhm et al. (2015) is one of the newly released global tropospheric delay models and can also provide \({T_{\text{m}}}\). It should be noted that the GPT2w has excellent performance in \({T_{\text{m}}}\) estimation compared to other existing empirical models. Several investigations have been conducted on the GPT2w model showing a systematic bias in \({T_{\text{m}}}\) calculation (Zhang et al. 2017; Huang et al. 2019), mainly because the vertical correction of \({T_{\text{m}}}\) is ignored. Generally, understanding the spatial–temporal characteristics of the \({T_{\text{m}}}\) lapse rate is helpful in establishing a more sophisticated \({T_{\text{m}}}\) model. However, the analysis of sophisticated seasonal characteristics of the \({T_{\text{m}}}\) lapse rate is still lacking. In this study, we investigate the spatial–temporal characteristics of the \({T_{\text{m}}}\) lapse rate in China using both gridded \({T_{\text{m}}}\) data and ellipsoidal height grid data from the Global Geodetic Observing System (GGOS) Atmosphere. An improved \({T_{\text{m}}}\) model, namely, IGPT2w, will be developed by refining the \({T_{\text{m}}}\) derived from GPT2w model for China. The performance of IGPT2w will be evaluated using gridded \({T_{\text{m}}}\) data and radiosonde profiles.

Data description and \({T_{\text{m}}}\) calculation

In this work, the gridded data derived of the GGOS Atmosphere are used to investigate the spatiotemporal characteristics of the \({T_{\text{m}}}\) lapse rate and also to develop the IGPT2w model. Both gridded \({T_{\text{m}}}\) data and radiosonde profiles are employed to validate the performance of new model.

Gridded data from GGOS Atmosphere

GGOS Atmosphere can provide continuous long-term global surface atmospheric grid products, such as \({T_{\text{m}}}\), ZHD, and ZWD, which are derived from ECMWF reanalysis data and all correspond to the ellipsoidal heights. These surface grid data have a temporal resolution of 6 h and a spatial resolution of \(2.5^\circ \times 2^\circ {\text{ }}({\text{lon}}. \times {\text{lat}})\) (http://ggosatm.hg.tuwien.ac.at). In addition, the global ellipsoidal height grid data, which have the same spatial resolution as the gridded \({T_{\text{m}}}\) data, are also provided by the GGOS Atmosphere. Yao et al. (2014b) evaluated global surface gridded \({T_{\text{m}}}\) data and showed that the gridded \({T_{\text{m}}}\) data are highly accurate and reliable, which can be used to investigate spatial–temporal characteristics of \({T_{\text{m}}}\) as well as for \({T_{\text{m}}}\) modeling. In this study, we utilize gridded \({T_{\text{m}}}\) data from 2007 to 2014 and ellipsoidal height grid data to investigate the \({T_{\text{m}}}\) vertical features and the seasonal variations of \({T_{\text{m}}}\) lapse rates and for \({T_{\text{m}}}\) modeling in China. The gridded \({T_{\text{m}}}\) data in 2015 are treated as reference values for validating the IGPT2w model.

Radiosonde profiles

The websites of the University of Wyoming freely provide global radiosonde observations (http://weather.uwyo.edu/upperair/sounding.html). Currently, there are in total more than 1500 radiosonde stations globally. The radiosonde profiles, with a resolution of 12 h, i.e., at UTC 00:00 and 12:00 every day, mainly include pressure level profiles and surface observations. In this study, a total of 89 radiosonde stations in China are selected (Fig. 1). The \({T_{\text{m}}}\) values in 2015 derived from radiosonde observations based on the integration method are regarded as reference values to evaluate the IGPT2w for China. Additionally, the \({T_{\text{s}}}\) data in 2015 obtained from radiosonde stations are used for the \({T_{\text{m}}}\) calculation in the Bevis formula \(({T_{\text{m}}}=70.2+0.72{T_{\text{s}}})\) and the PWVs in 2015 received from radiosonde profiles are used to investigate the impact of \({T_{\text{m}}}\) on GNSS-PWV. Here, for more convenient calculation, the integral formula (3) for calculating \({T_{\text{m}}}\) can be discretized as follows:

$${T_{\text{m}}}=\frac{{\int {(e/T){\text{d}}H} }}{{\int {(e/{T^2}){\text{d}}H} }}=\frac{{\mathop \sum \nolimits_{1}^{n} {F_1}({e_i},{T_i})\Delta {H_i}}}{{\mathop \sum \nolimits_{1}^{n} {F_2}({e_i},{T_i})\Delta {H_i}}}$$
(6)

where \({F_1}({e_i},{T_i})=\frac{{{e_i}}}{{{T_i}}}\); \({F_2}({e_i},{T_i})=\frac{{{e_i}}}{{T_{i}^{2}}}\); \({T_i}\) and \({e_i}\) are the average temperature and water vapor pressure at the ith layer of the atmosphere, respectively; \(\Delta {H_i}\) denotes the thickness of the atmosphere at the ith layer (m), and n is the number of layers. As e cannot directly be obtained from radiosonde profiles, it can be calculated by the following formulas (Bolton 1980; Wang et al. 2016):

$$e=\frac{{{\text{RH}} \cdot {P_{\text{v}}}}}{{100}}$$
(7)
$${P_{\text{v}}}=6.11 \times {10^{\left( {\frac{{7.5 \times {T_{\text{a}}}}}{{237.3+{T_{\text{a}}}}}} \right)}}$$
(8)

where RH is the relative humidity, \({P_{\text{v}}}\) and \({T_{\text{a}}}\) are the saturated vapor pressure and the atmospheric temperature in celsius \((T={T_{\text{a}}}+273.15)\) respectively.

Fig. 1
figure 1

Geographic and topographic distribution of the 89 radiosonde stations in China. The upward triangles denote the radiosonde stations

Improving the \({T_{\text{m}}}\) derived from GPT2w model in China

The GPT2w model can provide several meteorological parameters, such as \({T_{\text{m}}}\), Ts, Ps, es, and water vapor pressure lapse rate \((\lambda )\), with two horizontal resolutions of \(1^\circ \times 1^\circ\) and \(5^\circ \times 5^\circ\). The \({T_{\text{m}}}\) can be estimated using the following formula:

$$T_{{\text{m}}}^{{\text{r}}}={\beta _0}+{\beta _1}\cos \left( {2\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+{\beta _2}\sin \left( {2\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+~{\beta _3}\cos \left( {4\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+{\beta _4}\sin \left( {4\pi \frac{{{\text{doy}}}}{{365.25}}} \right)$$
(9)

where \(T_{{\text{m}}}^{{\text{r}}}\) is the \({T_{\text{m}}}\) at the grid height, doy is the day of year, and the other coefficients are stored in a regular grid of \(1^\circ \times 1^\circ\) and \(5^\circ \times 5^\circ\). The realization of the \({T_{\text{m}}}\) derived from the GPT2w model is accomplished by performing bilinear interpolation for the \({T_{\text{m}}}\) derived from four surrounding grids of the target position. The code and specific use of the GPT2w model are provided at the website of GGOS Atmosphere (http://ggosatm.hg.tuwien.ac.at/DELAY/SOURCE/). The goal of this work is to refine the \({T_{\text{m}}}\) derived from GPT2w in China, therefore, a sophisticated investigation of spatial–temporal characteristics for the \({T_{\text{m}}}\) lapse rate is needed.

Investigation of the \({T_{\text{m}}}\) lapse rate

Height differences between the grid height and the target height have always existed, especially in regions with highly undulating terrain such as western China. Several studies have shown that \({T_{\text{m}}}\) has a strong correlation with height (Zhang et al. 2017; He et al. 2017). The GPT2w model does not consider the height adjustment for \({T_{\text{m}}}\) calculation, which results in significant systematic bias for China (Huang et al. 2019). Therefore, it is necessary to perform a height adjustment for the \({T_{\text{m}}}\) derived from GPT2w model. In this section, we explore the vertical dependence of \({T_{\text{m}}}\) using gridded \({T_{\text{m}}}\) data in 2014 and ellipsoidal height grid data in China. The results are shown in Fig. 2.

Fig. 2
figure 2

\({T_{\text{m}}}\) changes with height in China. The scatter points indicate the annual mean gridded \({T_{\text{m}}}\) data in 2014 in China, and the red line indicates the linear fitted line for the vertical dependence of \({T_{\text{m}}}\)

Figure 2 shows an approximate linear relationship between \({T_{\text{m}}}\) and height. The vertical dependence of \({T_{\text{m}}}\), therefore, can be expressed as a linear formula:

$${T_{\text{m}}}=\gamma \times \delta h+k$$
(10)

where \(~\gamma\) indicates the \({T_{\text{m}}}\) lapse rate (K/km) and \(\delta h\) indicates the ellipsoidal height (km).

The \({T_{\text{m}}}\) lapse rate is undoubtedly a key parameter for the vertical correction of \({T_{\text{m}}}\). He et al. (2017) performed an analysis for the global distribution of the \({T_{\text{m}}}\) lapse rate and showed that the annual mean \({T_{\text{m}}}\) lapse rate varies with latitude and land–sea distribution across the globe. Nevertheless, relatively smaller variations for the annual mean \({T_{\text{m}}}\) lapse rate were observed in China, meaning that a uniformed model can be constructed for the vertical correction of \({T_{\text{m}}}\) in China. Similarly, Zhang et al. (2017) investigation on the four seasons of the \({T_{\text{m}}}\) lapse rate revealed a seasonal dependence of the \({T_{\text{m}}}\) lapse rate. However, a sophisticated investigation of seasonal characteristics for the \({T_{\text{m}}}\) lapse rate is still lacking. In this section, the gridded \({T_{\text{m}}}\) data from 2007 to 2014 and corresponding ellipsoidal height grid data in China are used to explore the seasonal variations of the \({T_{\text{m}}}\) lapse rate. In addition, the fast Fourier transform (FFT) algorithm is performed to detect the periodicity characteristics of the \({T_{\text{m}}}\) lapse rate. The results are shown in Fig. 3.

Fig. 3
figure 3

Seasonal variations of the \({T_{\text{m}}}\) lapse rate from 2007 to 2014 in China (top) and results of periodicity detection for the \({T_{\text{m}}}\) lapse rate using the FFT algorithm (bottom)

Figure 3 shows that the \({T_{\text{m}}}\) lapse rate clearly has seasonal variations. Both annual cycles and semi-annual cycles are detected. The \({T_{\text{m}}}\) lapse rate, therefore, can be modeled as the following formula:

$$\gamma ={\alpha _0}+{\alpha _1}\cos \left( {2\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+{\alpha _2}\sin \left( {2\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+~{\alpha _3}\cos \left( {4\pi \frac{{{\text{doy}}}}{{365.25}}} \right)+{\alpha _4}\sin \left( {4\pi \frac{{{\text{doy}}}}{{365.25}}} \right)$$
(11)

where doy is the day of year; \({\alpha _0}\) is the annual mean value of the \({T_{\text{m}}}\) lapse rate; and (\({\alpha _1}\), \({\alpha _2}\)) and (\({\alpha _3}\), \({\alpha _4}\)) are the coefficients of the annual and semi-annual periodicity of the \({T_{\text{m}}}\) lapse rate, respectively.

Development of the IGPT2w model in China

The goal of this study is to refine the \({T_{\text{m}}}\) derived from the GPT2w model and to develop an improved \({T_{\text{m}}}\) model for China, named IGPT2w. The model formula is as follows:

$$T_{{\text{m}}}^{{\text{t}}}=T_{{\text{m}}}^{{\text{r}}}+\gamma \times (\delta {h_{\text{t}}} - \delta {h_{\text{r}}})$$
(12)

where \(T_{{\text{m}}}^{{\text{t}}}\) is the \({T_{\text{m}}}\) derived from the IGPT2w model at the target height; \(T_{{\text{m}}}^{{\text{r}}}\) is the \({T_{\text{m}}}\) derived from the GPT2w model (9) at the grid height; and \(\delta {h_{\text{t}}}\) and \(\delta {h_{\text{r}}}\) are the target height and the grid height in km, respectively. For the \({T_{\text{m}}}\) lapse rate \((\gamma),\) which can be estimated by performing the least squares method with gridded \({T_{\text{m}}}\) data from 2007 to 2014 and ellipsoidal gridded height data for China based on (11), the coefficients of the \({T_{\text{m}}}\) lapse rate model are shown in Table 1.

Table 1 Coefficients of the \({T_{\text{m}}}\) lapse rate model calculated using gridded \({T_{\text{m}}}\) data from 2007 to 2014 and ellipsoidal gridded height data for China

Here, a description of the use of the IGPT2w model is provided. There are two steps to realize the \({T_{\text{m}}}\) derived from the IGPT2w model. First, the vertical correction values of \({T_{\text{m}}}\) are computed for the four surrounding grids of the target position using the \({T_{\text{m}}}\) lapse rate model. Next, the \({T_{\text{m}}}\) at the target height derived from the IGPT2w model is obtained by conducting a bilinear interpolation for the \({T_{\text{m}}}\) derived from four surrounding grids of the target position. The \({T_{\text{m}}}\) is derived from the IGPT2w model with the same horizontal resolution as the GPT2w model, i.e., with resolutions of \(1^\circ \times 1^\circ\) and \(5^\circ \times 5^\circ\).

Validation of IGPT2w

The \({T_{\text{m}}}\) values derived from the gridded data from the GGOS Atmosphere and the radiosonde data are treated as reference values to evaluate the performance of the IGPT2w model. The results are also compared with the GPT2w model and the Bevis formula \(({T_{\text{m}}}=70.2+0.72{T_{\text{s}}}).\) As mentioned above, both the IGPT2w and GPT2w models have two horizontal resolutions of \(1^\circ \times 1^\circ\) and \(5^\circ \times 5^\circ\). For convenience, we defined these two resolutions for each model as IGPT2w-1, IGPT2w-5, GPT2w-1, and GPT2w-5, respectively. In this study, the bias and RMS error are regarded as criteria to assess the accuracy of the models and are calculated using the following formulas:

$${\text{bias}}=\frac{1}{N}\mathop \sum \limits_{{i=1}}^{N} (X_{{\text{m}}}^{{{M_i}}} - X_{{\text{m}}}^{{{R_i}}})$$
(13)
$${\text{RMS}}=\sqrt {\frac{1}{N}\mathop \sum \limits_{{i=1}}^{N} {{(X_{{\text{m}}}^{{{M_i}}} - X_{{\text{m}}}^{{{R_i}}})}^2}}$$
(14)

where \(X_{{\text{m}}}^{{{M_i}}}\) indicates the value calculated by the model, \(X_{{\text{m}}}^{{{R_i}}}\) indicates the reference value, and N indicates the number of samples.

Comparison with gridded data from the GGOS Atmosphere

In this experimental area, i.e., the longitude and longitude vary from \(70^\circ {\text{E}}\) to \(135^\circ {\text{E}}\) and \(15^\circ {\text{N}}\) to \(55^\circ {\text{N}}\), there are a total of 540 \({T_{\text{m}}}\) grids. The gridded \({T_{\text{m}}}\) data in 2015 are treated as reference values to evaluate the IGPT2w-1, IGPT2w-5, GPT2w-1, GPT2w-5, and Bevis models. For the Bevis formula \(({T_{\text{m}}}=70.2+0.72{T_{\text{s}}}),\) since in situ \({T_{\text{s}}}\) measurements at each grid are unavailable, the \({T_{\text{s}}}\) value derived from GPT2w-1 can serve as alternative data for the Bevis formula. Therefore, the statistical results of the annual bias and RMS values of the different models tested by gridded \({T_{\text{m}}}\) data for 2015 can be obtained and are shown in Table 2 and Figs. 4 and 5.

Table 2 Statistical results of different models validated using 2015 gridded \({T_{\text{m}}}\) data from the GGOS Atmosphere
Fig. 4
figure 4

Distribution of bias in different models validated using gridded \({T_{\text{m}}}\) data for 2015 over China

Fig. 5
figure 5

Distribution of RMS error in different models validated using gridded \({T_{\text{m}}}\) data from 2015 in China

Table 2 shows that GPT2w-5 and GPT2w-1 have large bias values that range from − 13.52 to 17.56 K and − 2.14 to 10.30 K, respectively. The Bevis formula has the largest annual mean bias, and both IGPT2w-5 and IGPT2w-1 show stable and small bias values. Similarly, larger RMS values are also obtained for GPT2w-5 and GPT2w-1. The GPT2w-1 has slightly better results than GPT2w-5, while the Bevis formula has the largest annual mean RMS value. In terms of RMS, IGPT2w-5 and IGPT2w-1 have improved by approximately 1.1 K (25%) and 0.5 K (13%) against GPT2w-5 and GPT2w-1, respectively. Thus, the IGPT2w shows significant improvement compared to GPT2w in calculating \({T_{\text{m}}}\). Figure 4 shows large warm biases in parts of western China for both GPT2w-5 and GPT2w-1, especially in the Tibetan area for GPT2w-5. The Bevis formula also has large warm biases in parts of northwest China, especially in the Qinghai–Tibetan Plateau, while both IGPT2w-5 and IGPT2w-1 have stable and small biases in China. Figure 5 shows larger RMS in parts of western China for GPT2w-5 and the Bevis formula, especially in the Tibetan, Qinghai, and Xinjiang areas, which are mainly affected by the areas’ complex terrain as GPT2w and the Bevis formula do not consider the vertical correction of \({T_{\text{m}}}\). GPT2w-1 produces a relatively small RMS compared to GPT2w-5 by improving the spatial resolution of model parameters, but large RMS values still exist in parts of western China. Since the diurnal variations of \({T_{\text{m}}}\) are ignored in all models, relatively larger RMS values are observed in parts of northeast China, especially for the Bevis formula, where the diurnal variations of \({T_{\text{m}}}\) are more significant than in other areas (Fig. 6). Nevertheless, the significant errors of the GPT2w and the Bevis formula in high-altitude areas are largely eradicated by the IGPT2w, especially western China. Thus, IGPT2w represents a marked improvement relative to other models in areas with highly undulating terrain. Besides, for IGPT2w, it appears the introduction of the improved altitude correction for \({T_{\text{m}}}\) reduces the need for a high spatial resolution of the GPT2w model for \({T_{\text{m}}}\).

Fig. 6
figure 6

Distribution of amplitudes of \({T_{\text{m}}}\) diurnal periodicity across China. The amplitude values of \({T_{\text{m}}}\) diurnal periodicity are calculated using 2015 gridded \({T_{\text{m}}}\) data from GGOS Atmosphere

Comparison with radiosonde data

Radiosonde profiles are in situ observations, which are often regarded as the reference values when assessing the performance of other meteorological observing networks or models due to their excellent accuracy and reliability. In this study, a total of 89 radiosonde stations evenly distributed over China are selected (Fig. 1) to validate the performance of IGPT2w and competing \({T_{\text{m}}}\) models. The \({T_{\text{m}}}\) values are calculated from radiosonde profiles stably providing data with 12 h time resolution for half a year. For the Bevis formula, the \({T_{\text{s}}}\) can also be derived from the radiosonde record. The abnormal radiosonde profiles are removed to ensure the reliability of the assessment. The statistical results of the different models are shown in Table 3 and Figs. 7 and 8.

Table 3 Statistical results of different models validated using radiosonde data for 2015
Fig. 7
figure 7

Distribution of bias for different models tested using radiosonde data for 2015

Fig. 8
figure 8

Distribution of RMS for different models tested using radiosonde data for 2015

In Table 3, one can see that GPT2w-5 and GPT2w-1 show larger mean cold bias, which illustrates that GPT2w has a significant systematic bias. The Bevis formula shows a relatively larger mean warm bias, while both IGPT2w-5 and IGPT2w-1 present stable and small biases for China, verifying that the systematic errors of GPT2w are largely eradicated by IGPT2w. In terms of RMS, GPT2w-5 still shows the largest RMS among the models, the Bevis formula has similar performance as GPT2w-1, and IGPT2w-5 and IGPT2w-1 show an approximately 0.95 K (20%) and 0.65 K (15%) improvement compared to GPT2w-5 and GPT2w-1, respectively. Clearly the IGPT2w model performs better than other models for China. Figure 7 reflects the larger cold bias in parts of both western China and north of China for GPT2w-5 and GPT2w-1, where significant warm biases are observed for the Bevis formula. In addition, the Bevis formula shows significant cold bias in parts of southeast China, while both IGPT2w-5 and IGPT2w-1 have small and stable biases in all of China. In Fig. 8, one can see that for the Bevis formula the GPT2w-5 and GPT2w-1 still have larger RMS values in parts of western and northern China, especially in Xinjiang province, which is mainly due to the highly undulating terrain and the significant diurnal variations of \({T_{\text{m}}}\). For both IGPT2w-5 and IGPT2w-1, those errors are strongly reduced. The largest remaining errors are observed in parts of northeast China, which are mainly due to not accounting for the diurnal variation of \({T_{\text{m}}}\). Overall, both IGPT2w-5 and IGPT2w-1 show stable and excellent performance over China. This further illustrates that IGPT2w shows marked improvement compared to GPT2w in China, especially in areas with strong topographic fluctuations. Additionally, the results of bias and RMS at all radiosonde stations were further analyzed, and the error distribution of the different models are shown in Fig. 9.

Fig. 9
figure 9

Histogram of bias and RMS for different models tested using radiosonde data for 2015

From Fig. 9 one can see that GPT2w-5 and GPT2w-1 show larger and significant negative biases. Both have a great proportion of biases below − 3 K, and the GPT2w-5 has the largest absolute bias of 13.9 K at the Ruoqiang site, located in Xinjiang province. Although the biases of the Bevis formula are relatively concentrated around zero, the number of strong negative biases is approximately equal to the number of positive ones, resulting in a relatively small mean bias. The biases of IGPT2w-5 and IGPT2w-1 are small and highly concentrated around zero and are less than 2 K at almost all stations. In terms of RMS, a large proportion of values above 5 K are observed for the Bevis formula, GPT2w-5 and GPT2w-1, while for IGPT2w-5 and IGPT2w-1 the RMS values have a large proportion below 4 K and show high concentration. These results further illustrate that the large errors of GPT2w in estimating \({T_{\text{m}}}\) are largely removed by IGPT2w.

To analyze the seasonal performance of different models the daily results at all selected stations of each model were statistically analyzed, and the bias and RMS values of different models are varying with the day of year (DOY) are shown in Fig. 10.

Fig. 10
figure 10

Results of different models validated using radiosonde data during different days of the year

In Fig. 10, one can see that both GPT2w-5 and GPT2w-1 show significant cold bias during most DOYs, and larger values are observed during spring and winter days, which further indicates that GPT2w has a significant systematic bias in calculating \({T_{\text{m}}}\). The Bevis formula presents a relatively clear warm bias during the spring and a relatively significant cold bias during the summer. Both IGPT2w-5 and IGPT2w-1 show smaller biases without obvious seasonal variation during most DOYs. In terms of RMS, all of these models show relatively clear seasonal variation, with relatively larger RMS values during spring and winter days and smaller ones during summer days. This is because most of the selected radiosonde stations are located in the middle latitudes where \({T_{\text{m}}}\) changes are small during the summer and larger during the winter. In addition, both GPT2w-5 and GPT2w-1 have larger RMS values than other models at most DOYs. In all, both IGPT2w-5 and IGPT2w-1 have stable and smaller RMS values compared to other models, demonstrating superior seasonal performance.

A great number of studies showed that \({T_{\text{m}}}\) has strong correlations with altitude and latitude (Emardson and Derks 2000; He et al. 2017; Zhang et al. 2017). To investigate how the bias and RMS of different models vary with altitude the 89 selected radiosonde stations were sorted into five groups in terms of altitude, i.e., lower than 500, 500–1000, 1000–1500, 1500–2000, and above 2000 m. The results of the bias and RMS for each altitude range are shown in Fig. 11.

Fig. 11
figure 11

Results of bias and RMS of the different models at different altitude ranges

Figure 11 shows that both GPT2w-5 and GPT2w-1 show marked cold bias in all altitude ranges, further indicating that GPT2w has a clear systematic bias in China. The Bevis formula shows larger and significant warm bias in altitudes above 500 m. In general, both bias and RMS of the Bevis formula increase with altitude. In addition, the Bevis formula, GPT2w-5 and the GPT2w-1 present larger RMS values in altitudes above 500 m, which is mainly due to the strongly undulating terrain, while in terms of bias and RMS, IGPT2w-5 and IGPT2w-1 have both stable and excellent performance in all altitude ranges. In all, IGPT2w has superior performance in high-altitude areas compared to other models, although a relative smaller improvement is observed in the altitude range of 1500–2000 m, which is mainly due to the small number of stations located in this altitude range.

Additionally, the relationships between latitude and bias and RMS in different models were also investigated. The 89 selected radiosonde stations were sorted in terms of latitude in 5° intervals. The results are shown in Fig. 12.

Fig. 12
figure 12

Results of bias and RMS of the different models at different latitude ranges

From Fig. 12, one can see that both GPT2w-5 and GPT2w-1 still show larger and significant cold biases in all latitude ranges, especially in the areas with a latitude range of 30°–40°. The Bevis formula has a clear warm bias in the latitudes above 35°, while both IGPT2w-5 and IGPT2w-1 show stable and smaller biases in all latitude ranges. The RMS values of all models, in general, increase with increasing latitude in China because the amplitudes of \({T_{\text{m}}}\) at low latitudes are much smaller than those at high latitudes (Yao et al. 2014b). For all models, relatively larger RMS values are observed in the latitudes above 40°, where the majority stations are located in northeast China, Xinjiang, and Inner Mongolia province, while in these areas the diurnal variations of \({T_{\text{m}}}\) are more significant than in other areas. Nevertheless, both IGPT2w-5 and IGPT2w-1 still perform significantly superior against other models in the latitudes above 30°.

Impact of \({T_{\text{m}}}\) on GNSS-PWV

The purpose of estimating \({T_{\text{m}}}\) is to convert the ZWD into GNSS-PWV. The GNSS stations and radiosonde stations are generally not co-located. In addition, most of the GNSS stations are not equipped with meteorological instruments. Therefore, it is hard to conduct a reliable and comprehensive investigation of the impact of \({T_{\text{m}}}\) on GNSS-PWV. However, several studies have been conducted to investigate the impact of \({T_{\text{m}}}\) on its resultant GNSS-PWV in terms of theoretical models (Wang et al. 2005, 2016; Huang et al. 2019). In this work, a similar approach is conducted to investigate the impact of \({T_{\text{m}}}\) on GNSS-PWV, and the widely used formula of RMS values between \({T_{\text{m}}}\) and PWV can be written as

$$\frac{{{\sigma _{{\text{PWV}}}}}}{{{\text{PWV}}}}=\frac{{{\sigma _\Pi }}}{\Pi }=\frac{{{k_3}{\sigma _{{T_{\text{m}}}}}}}{{\left( {k_{2}^{\prime }+\frac{{{k_3}}}{{{T_{\text{m}}}}}} \right)T_{{\text{m}}}^{2}}}=\frac{{{k_3}}}{{\left( {k_{2}^{\prime }+\frac{{{k_3}}}{{{T_{\text{m}}}}}} \right){T_{\text{m}}}}} \cdot \frac{{{\sigma _{{T_{\text{m}}}}}}}{{{T_{\text{m}}}}}$$
(15)

where \({\sigma _{{\text{PWV}}}}\) is the RMS error of PWV, \({\sigma _{{T_{\text{m}}}}}\) is the RMS error of \({T_{\text{m}}}\), \({T_{\text{m}}}\) and PWV are set to annual mean values, and the \({\sigma _{{\text{PWV}}}}/{\text{PWV}}\) is defined as the relative error of PWV. Thus, \({\sigma _{{\text{PWV}}}}\) and \({\sigma _{{\text{PWV}}}}/{\text{PWV}}\) are employed to assess the impact of the errors in \({T_{\text{m}}}\) on its resultant GNSS-PWV. In this section, 89 radiosonde stations are also selected throughout China, and the distribution of the theoretical results of \({\sigma _{{\text{PWV}}}}\) and \({\sigma _{{\text{PWV}}}}/{\text{PWV}}\) is shown in Fig. 13.

Fig. 13
figure 13

Theoretical RMS error and relative error of PWV resulting from the IGPT2w model using radiosonde data for 2015

Figure 13 shows relatively larger \({\sigma _{{\text{PWV}}}}\) values in southeast and south China than other areas for the IGPT2w-5 and IGPT2w-1 models, and the PWV values are larger than in other areas because PWV values play a dominant role in \({\sigma _{{\text{PWV}}}}\) calculation. The \({\sigma _{{\text{PWV}}}}\) values of the IGPT2w model are less than 0.59 mm and with a mean \({\sigma _{{\text{PWV}}}}\) value of 0.29 mm. In terms of \({\sigma _{{\text{PWV}}}}/{\text{PWV}}\), the IGPT2w has a mean value of 1.38% and ranges from 0.95 to 2.0%, which is more stable and smaller than when using \({T_{\text{m}}}\) from the other models under study. The IGPT2w is an empirical model, which can provide an accurate \({T_{\text{m}}}\) for retrieving accurate real-time PWV values. Therefore, IGPT2w has possible potential applications in nowcasting or real-time analysis of severe weather conditions, such as heavy rainfall, typhoons, and flood forecasting in China, and also can be used to retrieve high-precision GNSS-PWV values for the study of PWV transportation in the Tibetan plateau.

Conclusions

\({T_{\text{m}}}\) plays an important role in the process of retrieving PWV values from GNSS signals. In real-time GNSS-PWV sounding in China, an accurate and reliable empirical \({T_{\text{m}}}\) model is needed. The GPT2w is one of the state-of-the-art tropospheric models, which has excellent performance compared to existing empirical models for \({T_{\text{m}}}\) calculation. However, the GPT2w has a systematic error in \({T_{\text{m}}}\) estimation for China because the vertical correction of \({T_{\text{m}}}\) is ignored. We considered the vertical adjustment of \({T_{\text{m}}}\) in modeling, and the \({T_{\text{m}}}\) lapse rate model was developed for China using ellipsoidal height grid data and gridded \({T_{\text{m}}}\) data from the GGOS Atmosphere. Finally, an improved atmospheric weighted mean temperature model, IGPT2w, was developed by refining the \({T_{\text{m}}}\) derived from GPT2w in China.

The performance of IGPT2w was tested using both gridded \({T_{\text{m}}}\) data and radiosonde site records in 2015 over China, and comprehensive comparisons to GPT2w and the Bevis formula were also performed. The results show a strong performance enhancement of the IGPT2w model against other models for China, especially in western China, where larger errors of the GPT2w model were largely removed. The impact of \({T_{\text{m}}}\) derived from IGPT2w on GNSS-PWV was also investigated, showing that the mean values of \({\sigma _{{\text{PWV}}}}\) and \({\sigma _{{\text{PWV}}}}/{\text{PWV}}\) are 0.29 mm and 1.38% for IGPT2w in terms of GNSS-PWV retrieval, respectively.

In this work, the GPT2w had significant systematic errors in western China, and poorer results were achieved for the Bevis formula in both northwest and north China. IGPT2w had significant improvements compared to GPT2w in \({T_{\text{m}}}\) estimation over China, especially in western China. IGPT2w is an empirical model and has the ability to provide reliable and accurate \({T_{\text{m}}}\) values for China, making it more powerful for estimating real-time \({T_{\text{m}}}\) values and giving it wide potential in real-time GNSS-PWV sounding, especially for application in western China. In future work, we will focus on accounting for diurnal variation of \({T_{\text{m}}}\) in the model and will improve the estimation of \({T_{\text{m}}}\) from GPT2w on a global scale.