1 Introduction

In the pure rotational Raman (PRR) lidar technique, air temperature T is determined from a ratio Q(T) of backscattered signal intensities from two PRR spectrum bands of atmospheric molecules N2 and O2 [1,2,3,4]. For this reason, PRR lidars need to be calibrated and, therefore, a calibration function (CF) is required to retrieve temperature profiles from raw lidar signals. Several calibration methods aimed to improve temperature retrieval algorithms have been proposed in recent years [5, 6]. Behrendt et al. [7] made a correction for the elastic backscatter leakage into the nearest (to the laser line) PRR channel in the presence of cirrus clouds, whereas Su et al. [8] did the same for both PRR channels. Chen et al. [9] presented a novel method that provides accurate PRR lidar calibration under the low signal-to-noise ratio conditions. The corrections for the incomplete laser beam receiver-field-of-view overlap in the atmospheric boundary layer were proposed in [10,11,12]. The use of nonlinear CFs was shown in [13, 14] to increase the accuracy of temperature retrievals, especially in the troposphere where N2 and O2 PRR lines are significantly broadened by molecular collisions [15,16,17].

Two new techniques allowing retrieving temperature without lidar calibration have also been recently proposed and deserve attention. Weng et al. [18] designed a single-line-extracted PRR lidar system that effectively detects two isolated N2 PRR line signals together with the elastic backscatter signal. The system provides continuous temperature measurements without lidar calibration with external temperature measurement instruments (e.g., radiosondes). Mahagammulla Gamage et al. [19] reported on application of the Optimal Estimation Method (OEM) to retrieve temperature profiles along with instrumental (lidar) parameters from PRR lidar measurements. The PRR lidar technique that utilizes a reference temperature profile from an atmosphere model or radiosonde data for lidar calibration (determination of CF coefficients) is now called “traditional” one [19]. Both techniques [18, 19] have several advantages over the traditional PRR lidar one. For example, the OEM is computationally fast and practical for routine temperature retrievals. The same is valid for the single-line-extracted PRR lidar. However, the selection of the lidar system or retrieval method depends on the problem to solve. In some cases the traditional retrieval algorithm showed some advantages over the considered techniques. Analyzing the results presented in [18], one can see that the traditional algorithm more precisely retrieves temperatures from nighttime measurements and in the presence of cirrus clouds, whereas the OEM more precisely retrieves temperatures from daytime measurements and in the presence of low level clouds. Both 1 − σ absolute statistical uncertainty of temperature retrieval and difference between retrieved and radiosonde profiles presented in [19] are much larger compared to those that can be provided by the traditional retrieval algorithm. Moreover, the accuracy of temperature retrievals with the traditional algorithm can always be increased by selecting a suitable CF [17].

The influence of passband widths of interference filters and their central wavelength positions in N2 and O2 PRR spectrum on the accuracy of temperature measurements using PRR lidars was analyzed in detail by Hammann and Behrendt [20]. However, the authors did not consider the PRR line broadening in their simulation. Conversely, the collisional broadening of N2 and O2 PRR lines was taken into account in [17], but only for one set of spectral filter (SF) passbands in a PRR lidar receiving system. In this study, we estimate the effect of collisional PRR line broadening on the accuracy of tropospheric (0–11 km) temperature measurements and intercompare nine nonlinear CFs in the traditional PRR lidar technique via simulation. The simulation is performed for five frequently used sets of SF passband widths and two laser wavelengths (354.67 and 532 nm). Note that the design features of PRR lidars are not taken into account in this study.

2 Calibration functions

2.1 General calibration function

According to the PRR lidar theory [1, 2, 4], a simple temperature dependence can be obtained only for the intensity ratio of any two individual PRR lines i and k from N2 and/or O2 PRR spectrum:

$$Q^{{{\text{indiv}} .}} (T) = \frac{{I(J_{i} ,T)}}{{I(J_{k} ,T)}} = \exp \left( {\alpha + \frac{\beta }{T}} \right).$$
(1)

Here, Ji and Jk are the rotational quantum numbers of the initial states of the PRR transitions corresponding to lines i and k, and α and β are the constants defined from the theory. Taking the natural logarithm of Eq. (1), we have:

$${ \ln }Q^{{{\text{indiv}} .}} (T) = \alpha + \frac{\beta }{T}\quad \Leftrightarrow \quad y = \alpha + \beta x,$$
(2)

where x = 1/T is the reciprocal temperature and y = lnQindiv(T). The desired temperature is easily derived from the ratio of intensities:

$$T = \frac{\beta }{{\ln Q^{{{\text{indiv}} .}} - \alpha }}.$$
(3)

In practical measurements, SFs in PRR channels of a lidar receiving system extract two bands from the backscattered light spectrum [4]. Both bands, containing several adjacent lines from the Stokes and/or anti-Stokes branches of N2 and O2 PRR spectrum, are selected so that the intensities of the lines entering them have the opposite temperature dependence (Fig. 1). The lines, corresponding to low (Jlow) and high (Jhigh) rotational quantum numbers of the initial states of the PRR transitions, fall into the near and far (to the laser line) PRR channels (channels Jlow and Jhigh), respectively. The intensity of each N2 PRR line with Jlow ≤ 8 (Jlow ≤ 11 for O2 PRR lines) decreases with increasing temperature and, conversely, the intensity of N2 PRR lines with Jhigh ≥ 9 (Jhigh ≥ 13 for O2 PRR lines) increases with increasing temperature, in both branches of the spectrum. The choice of such lines makes it possible to minimize the overlap of the SF transmission functions of PRR channels (Fig. 1). Note that only odd lines beginning with odd J exist in O2 PRR spectrum [21]. Therefore, we have to consider the following ratio [2]:

$$Q^{\varSigma } (T) = \frac{{I_{\text{low}}^{\varSigma } (T)}}{{I_{\text{high}}^{\varSigma } (T)}} = \frac{{\left[ {\sum\nolimits_{{J_{i} }} {\sum\nolimits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {I_{i} (J_{i} ,T)} } } \right]_{\text{low}} }}{{\left[ {\sum\nolimits_{{J_{i} }} {\sum\nolimits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {I_{i} (J_{i} ,T)} } } \right]_{\text{high}} }},$$
(4)

where Ii(Ji, T) are the intensities of N2 and O2 individual PRR lines, corresponding to rotational quantum numbers Ji, \(I_{{\text{low}}}^{\varSigma } (T)\) and \(I_{\text{high}}^{\varSigma } (T)\) are the overall intensities of the PRR lines that enter the corresponding channels Jlow and Jhigh, indices “low” and “high” show that summations in the numerator and denominator refer to the corresponding PRR spectrum bands.

Fig. 1
figure 1

Equidistant PRR spectra of N2 and O2 linear molecules, schematic drawing of filter transmission functions (FTF), and envelopes of N2 PRR spectrum at different temperatures. The laser beam wavelengths are 354.67 and 532 nm. The index over a spectral line denotes the rotational quantum number J of the initial state of the transition. All PRR line intensities are normalized to the intensity of N2 PRR line with J = 6 of the anti-Stokes branch at T = 220 K

The right side of Eq. (4) represents the ratio of the sums of exponential expressions that cannot be reduced to a simple function of temperature and, therefore, the intensity ratio QΣ(T) needs to be calibrated. Arshinov et al. [2] proposed the simplest (linear) CF in the form of Eq. (2) and, hence, the air temperature is determined from the ratio QΣ(T) similarly to Eq. (3):

$$T = \frac{{B_{0} }}{{\ln Q^{\varSigma } - A_{0} }},$$
(5)

where A0 и B0 are the calibration (fit) coefficients. To reduce the statistical uncertainties of temperature retrievals, the second-order polynomial was proposed in [13] to be used as a CF:

$$\ln Q^{\varSigma } = A_{1} + \frac{{B_{1} }}{T} + \frac{{C_{1} }}{{T^{2} }}\,\quad \Leftrightarrow \quad y = A_{1} + B_{1} x + C_{1} x^{2} ,$$
(6)

where A1, B1 and C1 are the calibration coefficients. The other types of CFs have been used for temperature retrievals in [3, 4, 14,15,16,17].

Equations (1)–(6) were obtained under the assumption that PRR lines are not broadened and, therefore, their profiles are described by the Dirac function. In fact, PRR lines are broadened due to both Doppler and molecular collision effects. PRR lines broadened only by the Doppler effect have a Gaussian profile, whereas the lines broadened only due to molecular collisions have a Lorentzian profile. The real PRR line profiles are described by a Voigt function that takes into account both types of broadening [22]. The molecular collision effect dominates over the Doppler one in the troposphere (Fig. 2) and the Lorentzian profile wings are higher than those of the Gaussian profile of the same full width at half maximum (FWHM) [22].

Fig. 2
figure 2

FWHMs of the Gaussian, Lorentzian, and Voigt profiles of N2 PRR line (with J = 6 of the anti-Stokes branch) for the laser wavelengths of 354.67 and 532 nm. The dependence of temperature on altitude is set by the tropospheric (0–11 km) temperature profile of the U.S. Standard Atmosphere (1976)

Owing to the long Lorentzian tails, each broadened PRR line j lying outside the SF passbands leads to the parasitic signal leakage into both PRR channels (Fig. 3). At the same time, the collisional broadening of any PRR lines i and k lying inside lidar channels Jlow and Jhigh, respectively, results in both parasitic signal leakage into neighboring PRR channel and desired signal loss in both channels. The PRR lidar CF that takes into account the collisional broadening of all PRR lines was derived in the general analytical form in [15]:

$$\ln Q(T) = \sum\limits_{n = - \infty }^{\infty } {\alpha_{n} T^{{\frac{n}{2}}} } ,$$
(7)

where αn are the calibration coefficients.

Fig. 3
figure 3

A schematic drawing (not to scale) of Voigt profiles of broadened PRR lines i, j, k and filter transmission functions (see Sect. 3.1) \(F_{\text{low}} (\widetilde{{{\nu }}})\) and \(F_{\text{high}} (\widetilde{{{\nu }}})\) of lidar channels Jlow and Jhigh approximated by a rectangle function F = 1. The wavenumber \(\widetilde{{{\nu }}}_{ 0}\) and intervals \((\widetilde{{{\nu }}}_{ 1} ;\widetilde{{{\nu }}}_{ 2} )\) and \((\widetilde{{{\nu }}}_{ 3} ;\widetilde{{{\nu }}}_{ 4} )\) correspond to the laser line and passbands of channels Jlow and Jhigh, respectively

2.2 Temperature retrieval functions

To obtain temperature profiles from raw lidar signals, a temperature retrieval function (TRF) T = T(Q) derived from the selected CF Q = Q(T) is required in the traditional retrieval algorithm. In the case of the linear (2) and quadratic (6) CFs, the corresponding TRFs are easily derived. When choosing the third-order polynomial in 1/T as a CF, it is necessary to solve a cubic equation, the presence and number of real roots of which depend on the signs of the polynomial coefficients. Hence, any CF requiring solving a n-order equation (n ≥ 3) to derive the corresponding TRF can hardly be used for temperature retrieval. As seen from Eq. (7), the general CF is a series and cannot be directly used in the temperature retrieval algorithm. For this reason, nine nonlinear CFs representing simple special cases of Eq. (7) were considered in [17]. All the special cases take the collisional broadening of N2 and O2 PRR lines into account in varying degrees. Below we give only the TRFs required for this study without specifying their initial nonlinear CFs. Since the linear CF completely ignores PRR line broadenings and its corresponding TRF retrieves temperature with larger errors (uncertainties) compared to other TRFs derived from nonlinear CFs [4, 15,16,17], we will not consider Eqs. (2) and (5). Each TRF i and its calibration constants Ai, Bi, Ci, etc. are numbered (i = 1, 2, …, 9) as in [17].

Let us consider three-coefficient TRFs, the first of which (TFR 1) is derived from the CF (6) proposed in [13]:

$$T = \frac{{2C_{1} }}{{ - B_{1} + \sqrt {B_{1}^{2} + 4C_{1} (\ln Q - A_{1} )} }}.$$
(8)

TRF 2 corresponds to a CF proposed in [15]:

$$T = \frac{{2B_{2} }}{{(\ln Q - A_{2} ) + \sqrt {(\ln Q - A_{2} )^{2} - 4B_{2} C_{2} } }}.$$
(9)

TRF 3 was first applied in [14]:

$$T = \frac{{C_{3} }}{{(\ln Q)^{2} + B_{3} \ln Q + A_{3} }}.$$
(10)

TRF 4 is derived from a CF proposed in [15]:

$$T = \frac{\ln Q}{{B_{4} (\ln Q)^{2} + A_{4} \ln Q + C_{4} }}.$$
(11)

The following two functions (TRFs 5 and 6) with three calibration constants were first used in [17]:

$$T = \left[ {\frac{{2C_{5} }}{{ - B_{5} + \sqrt {B_{5}^{2} + 4C_{5} (\ln Q - A_{5} )} }}} \right]^{\;2} ,$$
(12)
$$T = \left[ {\frac{{2B_{6} }}{{(\ln Q - A_{6} ) + \sqrt {(\ln Q - A_{6} )^{2} - 4B_{6} C_{6} } }}} \right]^{\;2} .$$
(13)

TRFs 7, 8, and 9 with four calibration constants were also first proposed and used in [17]:

$$T = \frac{{D_{7} }}{{(\ln Q)^{3} + C_{7} (\ln Q)^{2} + B_{7} \ln Q + A_{7} }},$$
(14)
$$T = \frac{{(\ln Q)^{2} }}{{B_{8} (\ln Q)^{3} + A_{8} (\ln Q)^{2} + C_{8} \ln Q + D_{8} }},$$
(15)
$$T = \frac{\ln Q}{{C_{9} (\ln Q)^{3} + B_{9} (\ln Q)^{2} + A_{9} \ln Q + D_{9} }}.$$
(16)

The CFs which are initial for TRFs 1–9 along with the equations for the absolute and relative statistical uncertainties of temperature retrievals can be found in [17] and in the electronic supplementary material (Table S1).

3 Numerical simulation technique

In the next three Sects. 3.13.3, we make some assumptions to simplify the numerical calculation of the intensity ratio Q = Q(T).

3.1 Spectral filter transmission functions

Let us consider only the anti-Stokes branch of N2 and O2 PRR spectrum for definiteness. Due to the broadening of PRR lines and their contribution to the signals detected in both lidar channels, we can write [15] for the intensity ratio instead of Eq. (4):

$$Q^{\text{all}} (T) = \frac{{I_{\text{low}}^{\text{all}} (T)}}{{I_{\text{high}}^{\text{all}} (T)}} = \frac{{\sum\limits_{{J_{i} }} {\sum\limits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {\left[ {I_{i} (J_{i} ,T) \cdot X_{\text{low}}^{i} (\widetilde{\nu }_{i} ,T)} \right]} } }}{{\sum\limits_{{J_{i} }} {\sum\limits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {\left[ {I_{i} (J_{i} ,T) \cdot X_{\text{high}}^{i} (\widetilde{\nu }_{i} ,T)} \right]} } }}.$$
(17)

Here, the summations are carried out over all rotational quantum numbers Ji of the initial states of N2 and O2 PRR transitions, \(\widetilde{\nu }_{i} = \widetilde{\nu }(J_{i} )\) and Ii(Ji, T) are the central wavenumber and total intensity of i broadened PRR line with Ji, respectively. The functions \(X_{\text{low}}^{i}\) and \(X_{\text{high}}^{i}\) describe the fractions of the intensity Ii(Ji, T) that fall within the SF passbands of channels Jlow and Jhigh, respectively, and \(I_{\text{low}}^{\text{all}} (T)\) and \(I_{\text{high}}^{\text{all}} (T)\) are the overall intensities detected in the corresponding lidar channels (with contributions from all broadened lines). The functions \(X_{\text{low}}^{i}\) and \(X_{\text{high}}^{i}\) are defined by the following expressions:

$$X_{\text{low}}^{i} (\widetilde{\nu }_{i} ,T) = \int_{0}^{ + \infty } {F_{\text{low}} (\widetilde{\nu }) \cdot V(\widetilde{\nu }_{i} ,\widetilde{\nu },T){\text{d}}\widetilde{\nu }} ,$$
(18)
$$X_{\text{high}}^{i} (\widetilde{\nu }_{i} ,T) = \int_{0}^{ + \infty } {F_{\text{high}} (\widetilde{\nu }) \cdot V(\widetilde{\nu }_{i} ,\widetilde{\nu },T){\text{d}}\widetilde{\nu }} ,$$
(19)

where \(F_{\text{low}} (\widetilde{\nu })\) and \(F_{\text{high}} (\widetilde{\nu })\) are the SF transmission functions of channels Jlow and Jhigh, respectively, and \(V(\widetilde{\nu }_{i} ,\widetilde{\nu },T)\) is the Voigt profile of i broadened PRR line (Fig. 3).

There are several ways to approximate SF transmission functions by an analytical expression. For example, \(F_{\text{low}} (\widetilde{\nu })\) and \(F_{\text{high}} (\widetilde{\nu })\) are approximated by a Gaussian function [3] or by the function proposed in [23]. However, this complicates the calculation of integrals (18) and (19) and, therefore, ratio (17). As shown in [15], the SF transmission functions can be approximated by a piecewise-constant (staircase) function with any desired accuracy. To further simplify the calculations, the staircase function can be replaced by a constant one on the wavenumber interval, within which a SF passes the bulk of the backscattered signal intensity in the corresponding PRR channel (Fig. 3). For example, in the case of channel Jlow we can write:

$$F_{\text{low}} (\widetilde{\nu }) = \left\{ \begin{aligned} & 0,\quad \widetilde{\nu } \le \widetilde{\nu }_{1} \hfill \\ & F = {\text{const,}}\quad \widetilde{\nu } \in \hfill \\ & 0,\quad \widetilde{\nu } \ge \widetilde{\nu }_{2} \hfill \\ \end{aligned} \right.(\widetilde{\nu }_{1} ;\widetilde{\nu }_{2} ),$$
(20)

where \((\widetilde{\nu }_{1} ;\widetilde{\nu }_{2} )\) is the SF passband of the lidar channel. Without loss of generality, we can also put F = 1 for both SF transmission functions, and then instead of Eqs. (18) and (19) we get:

$$X_{\text{low}}^{i} (\widetilde{\nu }_{i} ,T) = \int_{{\widetilde{\nu }_{1} }}^{{\widetilde{\nu }_{2} }} {V(\widetilde{\nu }_{i} ,\widetilde{\nu },T){\text{d}}\widetilde{\nu }} ,$$
(21)
$$X_{\text{high}}^{i} (\widetilde{\nu }_{i} ,T) = \int_{{\widetilde{\nu }_{3} }}^{{\widetilde{\nu }_{4} }} {V(\widetilde{\nu }_{i} ,\widetilde{\nu },T){\text{d}}\widetilde{\nu }} ,$$
(22)

where \((\widetilde{\nu }_{3} ;\widetilde{\nu }_{4} )\) is the SF passband of channel Jhigh. Note that one can choose to use the FWHMs of the SF transmission functions as the SF passbands \((\widetilde{\nu }_{1} ;\widetilde{\nu }_{2} )\) and \((\widetilde{\nu }_{3} ;\widetilde{\nu }_{4} )\).

3.2 PRR line profiles

The backscatter profile of i PRR line inhomogeneously broadened only due to the Doppler effect is known to be described by a Gaussian function (normal distribution) [22]:

$$G(\widetilde{\nu }_{i} ,\widetilde{\nu },T) = \frac{1}{{\gamma_{G}^{i} \sqrt {2\pi } }}\exp \left[ { - \;\frac{{(\widetilde{\nu } - \mu_{i} )^{2} }}{{2(\gamma_{G}^{i} )^{2} }}} \right].$$
(23)

Here, \(\mu_{i} = \tilde{v}_{i}\) (cm–1) is the mathematical expectation (the central wavenumber of the line profile), and \(\gamma_{G}^{i}\) (cm–1) is the standard deviation defined by the equation:

$$\gamma_{G}^{i} = \widetilde{\nu }_{i} \sqrt {\frac{{k_{B} T}}{{m_{\text{air}} c^{2} }}} ,$$
(24)

where kB is the Boltzmann constant, mair is the average mass of air molecules, and c is the speed of light. The FWHM of the Gaussian profile is defined as:

$$\Delta \widetilde{\nu }_{{i,\,{\text{G}}}}^{\text{FWHM}} = 2\sqrt {2\ln 2} \cdot \gamma_{\text{G}}^{i} .$$
(25)

The backscatter profile of i PRR line homogeneously broadened only by molecular collisions is described by a Lorentzian function (Cauchy distribution) that is defined for the angular frequency ω (rad/s) [22]:

$$L(\omega_{i} ,\omega ,T) = \frac{1}{\pi } \cdot \frac{{\gamma_{L} }}{{(\omega - \omega_{i} )^{2} + \gamma_{\text{L}}^{2} }},$$
(26)

where ωi is the central angular frequency of the line profile, and γL (rad/s) specifies the half width at half maximum (HWHM) of the Lorentzian profile. Subsequently, the FWHM (rad/s) of the Lorentzian profile is defined as:

$$\Delta \omega_{\text{L}}^{\text{FWHM}} = 2\gamma_{\text{L}} .$$
(27)

After the substitution \(\omega = 2\pi c\widetilde{\nu }\), the Lorentzian function (26) can be expressed as a function of wavenumber:

$$L(\widetilde{\nu }_{i} ,\widetilde{\nu },T) = \frac{1}{\pi } \cdot \frac{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}{{(\widetilde{\nu } - \widetilde{\nu }_{i} )^{2} + \left( {{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}} \right)^{2} }}$$
(28)

with the corresponding FWHM defined in cm–1:

$$\Delta \widetilde{\nu }_{\text{L}}^{\text{FWHM}} = \frac{{\Delta \omega_{\text{L}}^{\text{FWHM}} }}{2\pi c} = \frac{{\gamma_{\text{L}} }}{\pi c},$$
(29)

where c is the speed of light in cm/s. For a two-component gas (e.g., a mixture of N2 and O2 molecules in air), the formula for estimation of the FWHM (29) in cm–1 at each altitude z can be written as follows:

$$\Delta \widetilde{\nu }_{\text{L}}^{\text{FWHM}} = p_{1}^{2} n_{\text{air}} d_{1}^{2} \sqrt {\frac{{8k_{\text{B}} T}}{{\pi \mu_{1} c^{2} }}} + 2p_{1} p_{2} n_{\text{air}} d_{12}^{2} \sqrt {\frac{{8k_{\text{B}} T}}{{\pi \mu_{12} c^{2} }}} + p_{2}^{2} n_{\text{air}} d_{2}^{2} \sqrt {\frac{{8k_{\text{B}} T}}{{\pi \mu_{2} c^{2} }}} ,$$
(30)

where p1 = 0.7809 and p2 = 0.2095 are the probabilities to find N2 and O2 molecules in the homosphere (0–100 km), respectively; nair is the air molecular number density; d1, d2, and d12 = (d1 + d2)/2 are the effective optical collision diameters in N2–N2, O2–O2, and N2–O2 collisions, respectively; μ1 = m1/2, μ2 = m2/2, and μ12 = m1m2/(m1 + m2) are the reduced masses of colliding molecules in N2–N2, O2–O2, and N2–O2 collisions, respectively. The temperature dependence of the effective optical collision diameters is described by Sutherland’s formula. Considering only binary collisions of molecules, for i atmospheric gas we have [24]:

$$d_{i}^{2} (T) = d_{i,\infty }^{2} \left( {1 + {{\varPhi_{i} } \mathord{\left/ {\vphantom {{\varPhi_{i} } T}} \right. \kern-0pt} T}} \right),$$
(31)

where di,∞ is the effective optical collision diameter at T → ∞, and Φi is the constant having the dimension of temperature (K).

The backscatter profile of i PRR line broadened by both Doppler effect and molecular collisions is described by a Voigt function. The function represents the convolution of Gaussian and Lorentzian functions and can only be calculated numerically [22]. For this reason, pseudo-Voigt functions, the linear combinations of the Gaussian and Lorentzian functions, are often used to approximate the Voigt ones. To estimate the FWHM (in cm–1) of a pseudo-Voigt function, one can apply the approximation formula proposed in [25]:

$$\Delta \widetilde{\nu }_{{i,{\text{V}}}}^{\text{FWHM}} \approx 0.5346\Delta \widetilde{\nu }_{\text{L}}^{\text{FWHM}} + \sqrt {(\Delta \widetilde{\nu }_{{i,{\text{G}}}}^{\text{FWHM}} )^{2} + 0.2166(\Delta \widetilde{\nu }_{\text{L}}^{\text{FWHM}} )^{2} } .$$
(32)

This approximation has zero error for the pure Gaussian function (when \(\Delta \widetilde{\nu }_{\text{L}}^{\text{FWHM}} = 0\)) and an error of about 3.05 × 10–6 cm–1 for the pure Lorentzian function (when \(\Delta \widetilde{\nu }_{{i,{\text{G}}}}^{\text{FWHM}} = 0\)).

Since the Voigt profile shape in its wings is very close to the Lorentzian one and the boundaries of SF passbands are mostly located in the wings of PRR lines (Fig. 3), it is reasonable to use \(L(\widetilde{\nu }_{i} ,\widetilde{\nu },T)\) (28) instead of \(V(\widetilde{\nu }_{i} ,\widetilde{\nu },T)\) for a PRR line shape description [15]. Hence, instead of integrals (21) and (22) we can write:

$$X_{\text{low}}^{i} (\widetilde{\nu }_{i} ,T) = \frac{1}{\pi }\int_{{\widetilde{\nu }_{1} }}^{{\widetilde{\nu }_{2} }} {\frac{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}{{(\widetilde{\nu } - \widetilde{\nu }_{i} )^{2} + \left( {{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}} \right)^{2} }}{\text{d}}\widetilde{\nu }} = \frac{1}{\pi }\left[ {\arctan \left( {\frac{{\widetilde{\nu }_{2} - \widetilde{\nu }_{i} }}{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}} \right) - \arctan \left( {\frac{{\widetilde{\nu }_{1} - \widetilde{\nu }_{i} }}{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}} \right)} \right],$$
(33)
$$X_{\text{high}}^{i} (\widetilde{\nu }_{i} ,T) = \frac{1}{\pi }\left[ {\arctan \left( {\frac{{\widetilde{\nu }_{4} - \widetilde{\nu }_{i} }}{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}} \right) - \arctan \left( {\frac{{\widetilde{\nu }_{3} - \widetilde{\nu }_{i} }}{{{{\gamma_{\text{L}} } \mathord{\left/ {\vphantom {{\gamma_{\text{L}} } {(2\pi c)}}} \right. \kern-0pt} {(2\pi c)}}}}} \right)} \right],$$
(34)

which can be easily calculated. Nevertheless, when calculating \(X_{\text{low}}^{i}\) and \(X_{\text{high}}^{i}\), we use the HWHM \({{\Delta \widetilde{\nu }_{{i,{\text{V}}}}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{{i,{\text{V}}}}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}\) of the pseudo-Voigt function instead of the Lorentzian HWHM γL/(2πc). This allows more accurate taking into account the contribution of broadened PRR lines to both lidar channels.

3.3 PRR line intensity

The intensity Ii(Ji, T) of an individual PRR line can be expressed as [26]:

$$I_{i} (J_{i} ,T) = PL\beta_{\pi ,i} (J_{i} ,T) = PLn_{i} \sigma_{\pi ,i} (J_{i} ,T) = PLn_{\text{air}} p_{i} \sigma_{\pi ,i} (J_{i} ,T),$$
(35)

where P is the average power of the incident laser beam, L is the length of the scattering volume, βπ,i(Ji, T) is the backscatter cross-section (atmospheric backscatter coefficient), σπ,i(Ji, T) is the differential backscatter cross-section, and ni = pi·nair is the partial number density of air component i. Substituting Eqs. (33), (34), and (35) into Eq. (17), we obtain for the intensity ratio:

$$Q(T) = \frac{{\sum\limits_{{J_{i} }} {\sum\limits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {\left\{ {p_{i} \sigma_{\pi ,i} (J_{i} ,T)\left[ {\arctan \left( {\frac{{\widetilde{\nu }_{2} - \widetilde{\nu }_{i} }}{{{{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}}}} \right) - \arctan \left( {\frac{{\widetilde{\nu }_{1} - \widetilde{\nu }_{i} }}{{{{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}}}} \right)} \right]} \right\}} } }}{{\sum\limits_{{J_{i} }} {\sum\limits_{{i = {\text{N}}_{ 2} , {\text{O}}_{ 2} }} {\left\{ {p_{i} \sigma_{\pi ,i} (J_{i} ,T)\left[ {\arctan \left( {\frac{{\widetilde{\nu }_{4} - \widetilde{\nu }_{i} }}{{{{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}}}} \right) - \arctan \left( {\frac{{\widetilde{\nu }_{3} - \widetilde{\nu }_{i} }}{{{{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}}}} \right)} \right]} \right\}} } }}.$$
(36)

4 Initial conditions for the simulation

Let us consider five sets of SF passbands that are mostly used in PRR lidar receiving systems (Fig. 4). We assume that a SF has a narrow passband if its width \(\Delta \widetilde{{{\nu }}} \le 25\) cm–1 and a wide one at \(\Delta \widetilde{{{\nu }}} > 40\) cm–1. The first set consists of SFs with wide passbands in both channels Jlow and Jhigh [27,28,29,30]. The second set has a SF with a narrow passband in channel Jlow and a SF with a wide passband in channel Jhigh [13, 23, 31, 32]. The third and fourth sets consist of SFs with equal narrow passbands in both PRR channels [5, 8, 16, 33], but with different central wavenumber positions in channel Jhigh. The fifth set has two SFs with very narrow passbands provided by a Fabry–Perot interferometer [18]. The SF passbands \((\widetilde{\nu }_{1} ;\widetilde{\nu }_{2} )\) and \((\widetilde{\nu }_{3} ;\widetilde{\nu }_{4} )\) of channels Jlow and Jhigh, their widths \(\Delta \widetilde{\nu }\), and N2 and O2 PRR lines (with J) falling into the channels are shown in Table 1.

Fig. 4
figure 4

ae Sets 1–5 of SF passbands \((\widetilde{\nu }_{1} ;\widetilde{\nu }_{2} )\) and \((\widetilde{\nu }_{3} ;\widetilde{\nu }_{4} )\) with equal transmission function F = 1 in lidar channels Jlow and Jhigh, respectively. The index over a spectral line denotes the rotational quantum number J of the initial state of the transition. N2 and O2 PRR spectrum corresponds to T = 280 K

Table 1 Parameters of spectral filter passbands (SFPs)

A narrow-linewidth (~ 0.001 cm–1) laser operating at wavelengths of 354.67 and 532 nm is considered as a PRR lidar transmitter in the simulation. Such a linewidth can be ignored compared to the widths of PRR lines broadened by both Doppler effect and molecular collisions (Fig. 2). We take into account the contribution from 56 N2 and O2 PRR lines to the signals detected in channels Jlow and Jhigh. These lines include the first 34 N2 lines with J = 0, 1, …, 16 (Stokes branch) and J = 2, 3, …, 18 (anti-Stokes branch) and 22 O2 lines with J = 1, 3, …, 21 and J = 3, 5, …, 23 in the Stokes and anti-Stokes branches, respectively. The values of the constants required for calculating the FWHMs and/or HWHMs of the Gaussian (25), Lorentzian (30), and pseudo-Voigt (32) profiles are the following: mair = 4.81 × 10–26 kg, m1 = 4.65 × 10–26 kg, m2 = 5.31 × 10–26 kg, d1,∞ = 3.51 × 10–10 m, d2,∞ = 3.52 × 10–10 m, d12,∞ ≈ 3.515 × 10–10 m [34], Φ1 = 105 K (for N2–N2 collisions), Φ2 = 125 K (for O2–O2 collisions), and Φ12 = 115 K (for N2–O2 collisions) [24]. The tropospheric (0–11 km) temperature profile of the U.S. Standard Atmosphere (1976) [35] is used as a reference profile (Fig. 5) to set the dependence of temperature on altitude in Eqs. (24), (30)–(36) and to determine the calibration coefficients of TRFs 1–9 using the least squares method. The formulas and all the necessary constants for calculating the cross-sections σπ,i(Ji, T) and intensity ratio (36) can be found in [4].

Fig. 5
figure 5

Tropospheric temperature profile of the U.S. Standard Atmosphere (1976)

5 Simulation results for λ = 354.67 nm

The simulation results for a laser wavelength of 354.67 nm are shown in Figs. 6, 7, 8, 9, 10 and 11. Figure 6 presents the dependences of tropospheric temperature on the intensity ratios Q354.67 calculated for five sets of SF passbands (Fig. 4, Table 1) using Eq. (36) under the conditions described above. To determine the best TRF for each of SFP sets 1–5, we intercompare temperature errors produced using TRFs 1–9. A temperature error (or calibration error) ΔT is defined as the difference between an ideal model profile and a temperature profile T = T(Q) retrieved from a simulated intensity ratio Q by one of the TRFs. The function that retrieves temperature with the smallest maximum calibration error in the troposphere will be considered as the best TRF. The calibration errors ΔT354.67 for each of SFP sets 1–5 are intercompared in Figs. 7, 8, 9, 10, 11, respectively.

Fig. 6
figure 6

Tropospheric temperature vs intensity ratios Q354.67 calculated for five sets of SF passbands (Fig. 4)

Fig. 7
figure 7

Calibration errors ΔT354.67 produced using TRFs 1–9 for SFP set 1 at λ = 354.67 nm

Fig. 8
figure 8

Calibration errors ΔT354.67 produced using TRFs 1–9 for SFP set 2 at λ = 354.67 nm

Fig. 9
figure 9

Calibration errors ΔT354.67 produced using TRFs 1–9 for SFP set 3 at λ = 354.67 nm

Fig. 10
figure 10

Calibration errors ΔT354.67 produced using TRFs 1–9 for SFP set 4 at λ = 354.67 nm

Fig. 11
figure 11

Calibration errors ΔT354.67 produced using TRFs 1–9 for SFP set 5 at λ = 354.67 nm

A comparative analysis of ΔT354.67 shows that in all five cases of SFP sets there are TRFs, the use of which is preferable for temperature retrievals and leads to smaller errors than that of the other functions. TRF 3 gives the smallest errors of all three-coefficient TRFs 1–6 for SFP sets 1, 2, and 4 (Figs. 7b, 8b, and 10b), whereas TRF 1 and TRF 4 are the best three-coefficient functions for SFP sets 3 and 5 (Figs. 9b and 11b), respectively. Four-coefficient TRF 7 gives the smallest errors of all TRFs 1–9 for SFP sets 1, 2, and 4 (Figs. 7c, 8c, and 10c), whereas TRF 9 is the best function for SFP sets 3 and 5 (Figs. 9c and 11c). In total, as seen in Figs. 7, 8, 9, 10, 11, the use of TRFs 3, 7, and 9 for temperature retrievals consistently leads to small calibration errors regardless of the SFP set. Namely, |ΔT354.67| does not exceed the value of 3 × 10–3 K when using TRF 3, while |ΔT354.67| < 4 × 10–4 K for both TRFs 7 and 9 (Fig. 12). Note that the use of TRF 6 yields comparatively high calibration errors for all sets of SF passbands, especially for SFP set 5. This can be explained by the fact that the CF corresponding to TRF 6 does not satisfy the selection rules introduced in [17] for special cases of the GCF (7) and it was considered as an exception. Recall that this nonlinear CF cannot be reduced to the linear CF (2) when excluding from consideration all nonlinear effects caused by PRR lines broadening.

Fig. 12
figure 12

Calibration errors ΔT354.67 produced using TRFs 3, 7, and 9 for SFP sets 1–5 at λ = 354.67 nm

As follows from Fig. 12b, TRF 7 more accurately takes into account the contribution from broadened PRR lines to the signals detected in both lidar channels for four out of five SFP sets (excluding SFP set 3). Hence, TRF 7 can be considered as the function most suited for tropospheric temperature retrievals for any set of SF passband widths (for λ = 354.67 nm). This is in agreement with the results obtained in [17] for temperature measurements using a PRR lidar at λ = 354.67 nm.

6 Simulation results for λ = 532 nm

The simulation for λ = 532 nm shows similar results as for λ = 354.67 nm. Figure 13 presents the differences ΔQ = Q354.67Q532 between the intensity ratios Q354.67 (Fig. 6) and Q532 (calculated at λ = 532 nm) for each of SFP sets 1–5. For example, the value of Q354.67 varies between 3.5 and 7.5 for SFP set 5, while the corresponding |ΔQ| < 2.5 × 10–4. Such small values of ΔQ are explained by the following: as mentioned in Sects. 2.1 and 3.2, the molecular collision effect dominates over the Doppler one in the troposphere (Fig. 2) and the homogeneous collisional broadening of PRR lines does not depend on wavelength (see Eq. (30)). Therefore, the PRR line broadening provides almost the same contribution to channels Jlow and Jhigh for the lidar wavelengths of 354.67 and 532 nm. This leads to close values of Q354.67 and Q532.

Fig. 13
figure 13

Tropospheric temperature vs the differences between the intensity ratios Q354.67 and Q532 calculated for each of SFP sets 1–5

For space considerations, the intercomparisons of calibration errors ΔT532 produced using TRFs 1–9 for each of SFP sets 1–5 at λ = 532 nm along with the differences ΔT354.67 − ΔT532 can be found in the electronic supplementary material. The errors ΔT532 produced using TRFs 3, 7, and 9 and the differences ΔT354.67 − ΔT532 (Fig. 14) confirm the conclusion drawn for ΔT354.67 (Fig. 12), i.e., TRF 7 is most suited for tropospheric temperature retrievals for any set of SF passband widths (for λ = 532 nm). An intercomparison of ΔT354.67 and ΔT532 for TRFs 3, 7, and 9 (Figs. 12 and 14a–c) does not allow us to unambiguously determine the best option from the considered sets of SF passband widths. However, the following can be stated: the use of SFP sets 1 and 3 results in larger calibration errors than that of SFP sets 2, 4, and 5.

Fig. 14
figure 14

ac Calibration errors ΔT532 produced using TRFs 3, 7, and 9 for SFP sets 1–5 at λ = 532 nm. df differences ΔT354.67 – ΔT532

7 Summary

In this study, we have calculated the ratios Q(T) of backscattered signal intensities from two PRR spectrum bands taking into account the collisional broadening of N2 and O2 PRR lines. The line broadening due to the Doppler effect was also considered using the HWHM \({{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } \mathord{\left/ {\vphantom {{\Delta \widetilde{\nu }_{i,V}^{\text{FWHM}} } 2}} \right. \kern-0pt} 2}\) of the pseudo-Voigt function instead of the Lorentzian HWHM γL/(2πc) in Eq. (36). The numerical simulation was performed for five frequently used sets of SF passband widths (Fig. 4, Table 1) and two laser wavelengths (354.67 and 532 nm). To obtain temperature profiles from the calculated ratios Q(T), we applied nine nonlinear CFs (TRFs) in the temperature retrieval algorithm of the traditional PRR lidar technique.

The comparative analysis of the calibration errors ΔT354.67 and ΔT532, produced using TRFs 1–9, and differences ΔT354.67 – ΔT532 for each of SFP sets 1–5 allow us to reveal the following:

  • TRF 3 is the best function for tropospheric temperature retrievals of all three-coefficient TRFs 1–6 for SFP sets 1, 2, and 4, while TRF 1 and TRF 4 are the best ones for SFP sets 3 and 5, respectively;

  • four-coefficient TRF 7 gives the smallest errors of all TRFs 1–9 for SFP sets 1, 2, and 4, whereas TRF 9 is the best function for SFP sets 3 and 5;

  • when calculating the intensity ratios Q(T), TRF 7 more accurately takes into account the contribution from broadened N2 and O2 PRR lines for four out of five SFP sets (excluding SFP set 3) at λ of 354.67 and 532 nm;

  • the use of SFP sets 1 and 3 leads to larger calibration errors than that of SFP sets 2, 4, and 5;

  • For two wavelengths (354.67 and 532 nm) and SFP sets 1–5, the absolute error |ΔT| does not exceed the value of 3 × 10–3 K when using TRF 3, while |ΔT| < 4 × 10–4 K for both TRFs 7 and 9 (Figs. 12 and 14a–c).

In total, TRFs 3, 7, and 9 can be recommended for use in the temperature retrieval algorithm regardless of the lidar wavelength and set of SF passbands.

Despite two new techniques [18, 19] do not require lidar calibration with external temperature measurement instruments and have several advantages over the traditional technique, the accuracy of temperature retrievals can be increased by selecting TRFs 3, 9, or better TRF 7. Therefore, if one wants to measure tropospheric temperature with an accuracy as high as possible, the traditional PRR lidar technique is the most preferable.