1 Introduction

The relaxed eddy accumulation (REA) method, proposed by Businger and Oncley (1990), is a simplification of the Eddy Accumulation Method conceived by Desjardins (1972, 1977). The most important feature of the REA method from the experimental point of view is that it does not require a fast-response instrument to measure the scalar concentration s whose turbulent flux is wanted. Instead, the sign of the vertical velocity w is used in real time to switch a valve drawing air at a constant flow rate into two different reservoirs. At the end of a block of measurement, the mean concentration of the scalar can be measured by a slow-response sensor in each of the reservoirs. In this work, note that the REA method is actually simulated using fast-response sensors.

The REA method has gained wide use to measure surface fluxes of substances for which fast-response gas analyzers are either non-existent or impractical. In that capacity, it has been reported to measure isoprene (Bowling et al. 1998), ammonia (Zhu et al. 2000), terpenoid (Mochizuki et al. 2014) and ethene, propene, buthene and isoprene (Rhew et al. 2017) fluxes, to cite but a few.

The REA predicts the scalar turbulent flux from:

$$\begin{aligned} \overline{w's'} = \beta _s\sigma _w(\overline{s^+} - \overline{s^-}), \end{aligned}$$
(1)

where \(\overline{s^+} = \overline{[s|w > 0]}\) and \(\overline{s^-} = \overline{[s|w < 0]}\) are the conditional means of s on the sign of the vertical velocity w (under the assumption that \(\overline{w}=0\), so that \(w = w'\)), and the overbars and primes are standard notation for Reynolds’ decomposition into a mean and the fluctuation around it. In the present work all means are taken over 30-min blocks. For conciseness, we denote \(\overline{s^+} - \overline{s^-}\) by \(\Delta \overline{s}\).

From its inception, it has been recognized that under validity of Monin–Obukhov similarity theory (MOST), \(\beta _s\) should be a function of Obukhov’s stability variable \(\zeta \) (Businger and Oncley 1990); several studies have found \(\beta _s \approx 0.6\) with a modest variation of \(\approx 10\)% over a wide range of stabilities when the measurements are made in the inertial sublayer of the atmospheric surface layer (see, for example, Businger and Oncley 1990; Baker et al. 1992; Katul et al. 2018).

Over forests, an important issue is to estimate the fluxes from the canopy to the atmosphere of volatile organic compounds (VOCs) such as isoprene; this is particularly critical in the Amazon, where secondary organic aerosols have a significant role in the formation of cloud condensation nuclei (CCNs) (Fuentes et al. 2016). Here, due to the aforementioned difficulty of measuring VOC concentrations with fast-response instruments, the REA method can have a significant impact on closing the knowledge gap on VOC emission rates from the forest. It is also noteworthy that new, better and cheaper technologies for in-situ analysis of \(\Delta {\overline{s}}\) are emerging that leverage the applicability of the REA method (Sarkar et al. 2020).

Invariably, REA measurements require a sonic anemometer measuring w at high frequency to control in real time the valve switching the flow of air into two reservoirs for the measurement of \(\overline{s^+}\) and \(\overline{s^-}\) at the end of an averaging period. This means that simultaneous measurements of sonic temperature \(\theta \) are available, allowing standard eddy covariance (EC) measurements of \(\overline{w'\theta '}\). This in turn means that, for each block, \(\beta _\theta \) can be calculated from (1) with \(s = \theta \). Therefore, if the scalar s of interest is perfectly correlated to temperature, \(\beta _s\) may be allowed to vary from block to block by setting \(\beta _s = \beta _\theta \) for each block. This strategy, which we call “REA-T ” (where “T ” stands for auxiliary sonic temperature measurements) appears to have originated with Bowling et al. (1998), and is in wide use (Ren et al. 2011; Mochizuki et al. 2014; Rhew et al. 2017; Sarkar et al. 2020, etc.). Alternatively, of course, one can still adopt a single value of \(\beta _s\) (which we call “REA-S”, where “S” stands for “single value”) based on measurements of s with eddy covariance (using instrumentation with adequate response time) or again an assumption of similar behavior with more easily measured quantities.

Either way, the REA invokes (i) an assumption of similarity between scalars, or (ii) the validity of MOST for the scalar of interest or (iii) at least the constancy of \(\beta _s\) even if MOST does not apply. Strictly speaking, it is known that if MOST is valid for any pair of scalars, their similarity functions must be the same, and their correlation must be perfect (Hill 1989; Dias and Brutsaert 1996; Dias 2013). Most of the time, therefore, either (i) or (ii) seems to be warranted for measurements made in the inertial sublayer of the atmospheric boundary layer, where MOST is assumed to hold over homogeneous surfaces with sufficient fetch.

However, even under these conditions, recently evidence has emerged that MOST may not be universally valid for all scalars, but rather that it appears to depend on the equilibrium between gradient production and dissipation of scalar semivariance: the presence of relatively large values of other terms in the scalar semivariance budget, such as the storage or the transport terms, can seriously disrupt MOST conformity for the scalar in question (Zahn et al. 2023). Interestingly, in the same study Zahn et al. (2023) have found that the REA method is much superior to the variance method (a classical MOST indirect method to estimate scalar fluxes) even when equilibrium between gradient production and molecular dissipation does not hold for the scalar, which suggests that in this case (iii) may be valid at least on average.

In the roughness sublayer (RSL) over a forest there is strong evidence of large departures of scalar behavior from MOST (Dias et al. 2009; Zahn et al. 2016b; Chor et al. 2017). Zahn et al. (2016b) found large departures for all 3 scalars (temperature, \({\textrm{H}_2\textrm{O}}\) and \({\textrm{CO}_2}\)) measured at the ATTO (Amazon Tall Tower Observatory) site in Central Amazonia (see description below), but noted that scalar similarity improved significantly for small zenith angles. They also found considerable scatter in the \(\beta _s\) values, which again was reduced for small zenith angles, which happen in the middle of the day, when the scalar fluxes tend to be largest in absolute value. In order to be concise, in this work we call fluxes which are large in absolute value “large-magnitude” fluxes.

Chor et al. (2017) equally found strong dissimilar behavior for the same scalars, as well as wide scatter in their Monin–Obukhov integral similarity functions. This casts doubt on the applicability of the REA method in any of the two forms (\(\beta _s\) constant or \(\beta _s = \beta _\theta \) for each block) mentioned above, while at the same time is at odds with the recent findings of Zahn et al. (2023) of good REA performance, although the latter were obtained for a lake, not a forest.

Using a large dataset recently measured at the ATTO site, this work therefore has the objective to clarify some of the issues mentioned above: the main point of investigation is to try to elucidate how \(\beta _s\) can have significant variability as disclosed by Zahn et al. (2016b) while at the same time the REA method can have a very good performance as found by Zahn et al. (2023). In particular, we seek to answer the following questions:

  1. 1.

    How close to constant is \(\beta _s\) (i.e. what its typical scatter is) in comparison with the results by Zahn et al. (2016b)?

  2. 2.

    To what extent do the dimensionless vertical velocity and scalar standard deviations follow MOST in the roughness sublayer? These functions have been shown by Zahn et al. (2023) to jointly explain the near-constancy of \(\beta _s\) under MOST.

  3. 3.

    How good are block-by-block REA flux estimates in the Amazonian roughness sublayer in comparison to eddy-covariance measurements?

  4. 4.

    Since block-by-block errors may be reduced by averaging if they are not strongly biased, how good is the REA method (in comparison with eddy-covariance) for “long-term” (of the order of many days to a whole season), “mean” flux estimates?

The relevant relationships among the quantities of interest in this work are reviewed in Sect. 2; the ATTO site and details of data processing are given in Sect. 3; results for similarity functions are analyzed in Sect. 4, and for the actual prediction of fluxes in Sect. 5. Discussion and conclusions are given in Sect. 6.

2 Methods

Since the “variance method” has shown to be a good indicator of the breakdown of MOST in the RSL (Dias et al. 2009; Zahn et al. 2016b; Chor et al. 2017; Dias-Júnior et al. 2019), and even in a classical inertial sublayer (Zahn et al. 2023), we test its standard form:

$$\begin{aligned} \frac{\sigma _s}{s_*} = \phi _{\sigma _s}(\zeta ), \end{aligned}$$
(2)

with:

$$\begin{aligned} u_*^2&= -\overline{u'w'}, \end{aligned}$$
(3)
$$\begin{aligned} \zeta&= - \frac{\kappa g (z - d_0) \theta _*}{\overline{\theta }u_*^2}, \end{aligned}$$
(4)
$$\begin{aligned} u_*s_*&= \overline{w's'}, \end{aligned}$$
(5)
$$\begin{aligned} \sigma _s&= \sqrt{\overline{s's'}}, \end{aligned}$$
(6)

where u is the longitudinal velocity, z is the measurement height, and \(d_0\) is the zero-plane displacement height.

In the case of the REA method, it can be readily seen by rearranging (1) that (see Zahn et al. 2023, Eq. (17)):

$$\begin{aligned} \frac{1}{\beta _s} = \frac{\sigma _w \Delta \overline{s}}{\overline{w's'}} = \frac{\sigma _w}{u_*}\,\frac{\Delta \overline{s}}{s_*} = \phi _{\sigma _w}(\zeta )\phi _{\Delta \overline{s}}(\zeta ), \end{aligned}$$
(7)

where \({\sigma _w}/{u_*} = \phi _{\sigma _w}(\zeta )\) and \({\Delta \overline{s}}/{s_*} = \phi _{\Delta \overline{s}}(\zeta )\), and that \(\beta _s\) is a MOST function under ideal conditions.

Regarding (1), (2) and (7), two points are concerned:

  1. 1.

    Experimentally, “to be or not to be” a funcion of \(\zeta \) is largely a matter of assessing the goodness-of-fit of data to a proposed model. Although this is seldom—if at all—done in practice for MOST functions (to the best of our knowledge), it is possible to quantify such goodness-of-fit by standard statistical indices. Therefore, in this study we compare quantitatively \(\phi _{\sigma _w}\), \(\phi _{\sigma _s}\) and \(\phi _{\Delta \overline{s}}\) as described in the sequence.

  2. 2.

    As we will see in Sect. 4, the quality of the \(\phi (\zeta )\)s or even the variability of \(\beta _s\) do not translate directly to the quality of the estimated fluxes \(\overline{w's'}\) over different time scales (block-by-block or longer term). Therefore, a second quantitative assessment must be made of the quality of the estimated fluxes themselves.

Besides graphical comparisons, we compare predicted (y) to observed (x) values using standard statistics: coefficient of correlation r, coefficient of determination \(C_d\), BIAS, mean absolute error MAE, root mean square error RMSE, their normalized versions NBIAS, NMAE, NRMSE, and Willmott’s refined index of model performance \(d_r\) (Willmott et al. 2012). An important issue here is that small magnitudes of predicted and observed fluxes tend to be masked both graphically in traditional “\(x \times y\)” plots and statistically when directly quantified, say, by RMSE or MAE. For this reason, the normalized versions are better to discern “relative” errors. For the sake of completeness, these quantities are:

$$\begin{aligned} r&= \frac{{{\,\textrm{Cov}\,}}(x,y)}{\sigma _x\sigma _y},&C_d&= 1 - \frac{{{\,\textrm{MSE}\,}}}{\sigma _x^2}, \end{aligned}$$
(8)
$$\begin{aligned} {{\,\textrm{BIAS}\,}}&= \frac{1}{n}\sum _{i=1}^n \left( y_i - x_i\right) ,&{{\,\textrm{NBIAS}\,}}&= \frac{{{\,\textrm{BIAS}\,}}}{|\overline{x}|},\end{aligned}$$
(9)
$$\begin{aligned} {{\,\textrm{MAE}\,}}&= \frac{1}{n}\sum _{i=1}^n \left| y_i - x_i\right| ,&{{\,\textrm{NMAE}\,}}&= \frac{{{\,\textrm{MAE}\,}}}{|\overline{x}|}, \end{aligned}$$
(10)
$$\begin{aligned} {{\,\textrm{RMSE}\,}}&= \left[ \frac{1}{n}\sum _{i=1}^n \left( y_i - x_i\right) ^2 \right] ^{1/2},&{{\,\textrm{NRMSE}\,}}&=\frac{{{\,\textrm{RMSE}\,}}}{|\overline{x}|}, \end{aligned}$$
(11)

where \({{\,\textrm{Cov}\,}}(x,y)\) is the covariance between predicted and observed values, \(\sigma _x\) and \(\sigma _y\) are their standard deviations, \(\overline{x}\) is the mean of the observed values, and \({{\,\textrm{MSE}\,}}= {{\,\textrm{RMSE}\,}}^2\) is the mean square error. Note that if all predicted values are equal, r makes no sense because \(\sigma _y = 0\). Willmott’s refined index of performance is:

$$\begin{aligned} d_r = {\left\{ \begin{array}{ll} 1 - \frac{\sum _{i=1}^n \left| y_i - x_i\right| }{2\sum _{i=1}^n \left| x_i - \overline{x}\right| }, &{} \sum _{i=1}^n \left| y_i - x_i\right| \le 2 \sum _{i=1}^n \left| x_i - \overline{x}\right| , \\ \frac{2 \sum _{i=1}^n \left| x_i - \overline{x}\right| }{\sum _{i=1}^n \left| y_i - x_i\right| } - 1, &{} \sum _{i=1}^n \left| y_i - x_i\right| > 2 \sum _{i=1}^n \left| x_i - \overline{x}\right| . \end{array}\right. } \end{aligned}$$
(12)

3 Experimental Site and Data

In this work we used data collected at a micrometeorological tower built up at the experimental ATTO (Amazon Tall Tower Observatory) site, in Central Amazon, in a terra firme forest, approximately 150 km northeast from the city of Manaus—AM, Brazil. At the site, there are two towers that are situated on an extensive plateau area (130 m above sea level) immersed in a large primary forest (see Fig. 1). The vegetation is typical of undisturbed terra firme forest. The average height of the vegetation is approximately 30–40 m and the leaf area index is around 5–6 \({\textrm{m}^{2}\,\textrm{m}^{-2}}\). The predominant wind is from the northeast (Andreae et al. 2015; Santana et al. 2018).

Fig. 1
figure 1

Overall location of the ATTO site

There are two towers at the ATTO site, known as the (main) ATTO tower (02\(^\circ \) 08\(^\prime \) 45\(^{\prime \prime }\) S, 59\(^\circ \) 00\(^\prime \) 20\(^{\prime \prime }\) W) with a height of 325 m and the “Instant” tower (02\(^\circ \) 08\(^\prime \) 39\(^{\prime \prime }\) S, 59\(^\circ \) 00\(^\prime \) 00\(^{\prime \prime }\) W) with a height of 81 m (Dias-Júnior et al. 2019). The experimental setup is shown in Fig. 2. The redundant data used in this work were collected during the months of August, September, and October 2021. These months correspond to the dry season in the Amazon region. The data were measured at 10 Hz by a sonic anemometer (model CSAT3B, Campbell Scientific, Inc) and an open-path gas analyzer (model LICOR 7500A LI-COR Inc), both installed 50 m above the ground at the Instant tower.

Fig. 2
figure 2

Instrument setup of the ATTO site. In this work, only the 50-m data from the Instant Tower were used

The dataset was divided in 3959 30-min data blocks. A quality control analysis similar to the one described in Zahn et al. (2016a) was applied to uvw (velocity components of the sonic anemometer), \(\theta \) (sonic temperature), \(\omega _q\) (\({\textrm{H}_2\textrm{O}}\) molar density) and \(\omega _c\) (\({\textrm{CO}_2}\) molar density) for each block. The procedure divides a block with n data points (\(n = 18{,}000\) in our case) into \(n_s = n/m\) sub-blocks with m points each. One-minute sub-blocks, \(m = 600\), were used. Before quality control, missing or erroneous data were flagged as NANs; then, for each sub-block, the median \({\widetilde{x}}_k\) and the mean absolute deviation (\({{\,\textrm{MAD}\,}}_k\)) around it,

$$\begin{aligned} {{\,\textrm{MAD}\,}}_k =\frac{1}{m} \sum _{i=0}^{m-1}\left| x_{km+i}-{\widetilde{x}}_k\right| \end{aligned}$$
(13)

were evaluated. The sub-block index, k, runs from 0 to \(n_s-1\). For each sub-block:

  • A spike is identified each time \(\left| x_{km+i} - {\widetilde{x}}_k \right| > 5\,{{\,\textrm{MAD}\,}}_k\);

  • a locking condition (i.e. \(x_{i}\) varies too little over a sub-block) is identified if \(\max _k({{\,\textrm{MAD}\,}}_k) < 0.01\) (\(\textrm{K}\) for \(\theta \), \({\textrm{m}\,\textrm{s}^{-1}}\) for u, v and w and \({\textrm{mmol}\,\textrm{m}^{-3}}\) for \(\omega _q\) and \(\omega _c\));

  • a non-stationary condition is identified if the difference between the maximum and minimum sub-block medians is larger than \(Q_x\), where \(Q_{u,v}= 5\) \({\mathrm{m\,s}^{-1}}\), \(Q_w\)= 3\( {\mathrm{m\,s}^{-1}}\), \(Q_{\theta }= 5\) \(\textrm{K}\), \(Q_{\omega _q}= 300\, {\mathrm{mmol\,m}^{-3}}\) and \(Q_{\omega _c}= 10\) \({\mathrm{mmol\,m}^{-3}}\).

When all the above conditions were met, the following criteria were applied sequentially:

  1. 1.

    If the number of values equal to NAN in the block was more than 1% of the block size, all \(x_i\) were set to NAN and the block was effectively discarded;

  2. 2.

    All spikes were flagged as NAN. After that, if the number of values equal to NAN in the block was more than 1% of the block size, all \(x_i\) were set to NAN and the block was effectively discarded;

  3. 3.

    If a locking condition and/or a non-stationary condition were identified, all the half-hour block was set to NAN and again discarded from the analysis;

  4. 4.

    If the block was not discarded, all runs of NANs in x were linearly interpolated from the valid extremities.

After conducting the quality control analysis, we obtained 1146 blocks of 30 min each for \(\omega _c\), 1163 blocks for \(\omega _q\), 1307 blocks for \(\theta \), uv, and w. In addition, we computed the half-hourly mean values of turbulence statistics by applying two coordinate rotations (McMillen 1988) to ensure that the average lateral and vertical velocities were zero. For all subsequent analyses the \({\textrm{H}_2\textrm{O}}\) and \({\textrm{CO}_2}\) molar densities \(\omega _q\) and \(\omega _c\) were converted to instantaneous mass concentrations using the pressure sensor from the LI7500 (see for instance Edson et al. 2011). In this work they are reported as q in \({\mathrm{g\,kg}^{-1}}\) and c in \({\mathrm{mg\,kg}^{-1}}\) respectively. Note that covariances calculated with mass concentrations dispense with the WPL density corrections (Webb et al. 1980).

4 Results for Similarity Functions

4.1 Similarity Functions for the Classical Standard Deviation Statistics

Standard inertial sublayer (ISL) predictions for \(\phi _{\sigma _w}\) and \(\phi _{\sigma _s}\) are (Chor et al. 2017):

$$\begin{aligned}{} & {} \phi _{\sigma _w}(\zeta ) = {\left\{ \begin{array}{ll} 1.25(1 - 3\zeta )^{1/3}, &{} \zeta < 0, \\ 1.25, &{} \zeta \ge 0, \end{array}\right. } \end{aligned}$$
(14)
$$\begin{aligned}{} & {} \phi _{\sigma _s}(\zeta ) = {\left\{ \begin{array}{ll} 2(1 - 9.5\zeta )^{-1/3}, &{} \zeta < 0, \\ 2, &{} \zeta \ge 0. \end{array}\right. } \end{aligned}$$
(15)

When calculating the statistics \(\sigma _s/s_*\), we further filtered the data with sign restrictions for each scalar and stability condition. Thus, in the case of \(\theta \), we only use \(\Delta \overline{\theta } < 0\) for stable conditions (\(\theta _* < 0\)) and \(\Delta \overline{\theta } > 0\) for unstable conditions (\(\theta _* > 0\)); for q, we only use positive values of \(\overline{w'q'}\) and only positive values of both \(\Delta \overline{q}\) and \(q_*\) for both stable and unstable conditions; and in the case of c we only use \(\Delta \overline{c} > 0\) for stable conditions, with \(c_* >0\), and \(\Delta \overline{c} < 0\) for unstable conditions, with \(c_* <0\). The further restrictions on the signs of \(\Delta \overline{s}\) were imposed so that exactly the same data sets were used for both similarity functions \(\phi _{\sigma _s}\) and \(\phi _{\Delta {\overline{s}}}\), and therefore corresponding statistics and figures can be compared.

Figure 3 shows the statistics \(\sigma _w/u_*\), and \(\sigma _s/s_*\) for \(s = \theta \), \(s = q\) and \(s = c\) for stable and unstable conditions, respectively. The findings of Zahn et al. (2016b) and Chor et al. (2017) for the ATTO site (but generally valid for the roughness sublayer over forests) are confirmed: overall, there is an excess of scalar variance in comparison to flux (actually \(s_*\)), giving strong indication that gradient production of \(\overline{s's'}/2\) often is far less than scalar molecular dissipation (see Zahn et al. 2023); however, a detailed analysis of the semivariance scalar budgets falls outside of our scope. The \(\sigma _w/u_*\) statistics are somewhat less “well-behaved” than those obtained by Zahn et al. (2016b) and Chor et al. (2017) for the same site: under unstable conditions, there is a clear tendency for many blocks to display less variance in comparison to flux (actually \(u_*\)), suggesting that now a part of the gradient and buoyant production of TKE (turbulence kinetic energy) is being “exported” rather than dissipated locally. This is indicative of a negative transport of TKE in the RSL above the canopy (see Fig. 1b of Chamecki et al. 2020, and Mortarini et al. 2023).

Fig. 3
figure 3

Dimensionless values of \(\sigma _w/u_*\) (a, b) and \(\sigma _s/s_*\) for \(s = \theta \) (c, d), \(s = q\) (e, f) and \(s = c\) (g, h), for stable (a, c, e, g) and unstable (b, d, f, h) conditions. The blue line is the standard inertial sublayer (ISL) prediction and the vermillion line is a least-squares (LSQ) fit

Besides the standard ISL versions of \(\phi _{\sigma }(\zeta )\) shown in blue, we also obtained least-squares (using the Levenberg–Marquardt algorithm) estimates depicted in vermillion in Fig. 3 with the general forms:

$$\begin{aligned} \phi _{\sigma _w}(\zeta )&= a_w(1 - b_w\zeta )^{1/3}, \qquad \zeta < 0, \end{aligned}$$
(16)
$$\begin{aligned} \phi _{\sigma _w}(\zeta )&= c_w, \qquad \qquad \qquad \qquad \quad \zeta \ge 0, \end{aligned}$$
(17)
$$\begin{aligned} \phi _{\sigma _s}(\zeta )&= a_s(1 - b_s\zeta )^{-1/3}, \qquad \zeta < 0, \end{aligned}$$
(18)
$$\begin{aligned} \phi _{\sigma _s}(\zeta )&= c_s, \quad \qquad \qquad \qquad \qquad \,\, \zeta \ge 0. \end{aligned}$$
(19)

The Levenberg–Marquard estimates were obtained with the built-in functionality available in the Gnuplot plotting program (www.gnuplot.info). The values of the fitted coefficients are given in Table 1. As it can be seen both visually and from the coefficients in the table, the fitted (LSQ) versions differ significantly from their ISL counterparts, but by definition they still provide the “best fits”. Note that the large excursions of a relatively small percentage of points can have a significant impact on the estimates of a, b and c; however, there is no clear cutoff point nor are there visible outliers: for this reason, they are all retained. These coefficients should be considered specific for this dataset. They are used here only because they provide the best statistics for the dimensionless standard deviations, and should not be considered in any way representative of RSL statistics.

Table 1 Least-squares estimates of coefficients for \(\phi _{\sigma _w}(\zeta )\) and \(\phi _{\sigma _s}(\zeta )\) for \(s = \theta \), q, c

Table 2 shows the performance statistics obtained for the LSQ versions (not the ILS versions) of the similarity functions \(\phi _{\sigma _w}\) and \(\phi _{\sigma _s}\). Note that these funcions are all dimensionless, so there is no need to calculate normalized performance statistics. The statistics confirm the visual impression that, particularly for the scalars, the dimensionless standard deviations perform rather poorly, with generally small correlations between data points and predicted \(\phi _{\sigma _s}\)s, and large values of MAE and RSME (for dimensionless functions, a good outcome would be MAE, RMSE \(\sim 0.1\)).

Table 2 Performance statistics for the LSQ functions \(\phi _{\sigma _w}(\zeta )\) and \(\phi _{\sigma _s}(\zeta )\) for \(s = \theta \), q, c

4.2 Similarity Functions for the REA Statistics

Figure 4 shows the statistics of \(\Delta \overline{s}/s_*\) and the similarity functions \(\phi _{\Delta \overline{s}}(\zeta )\) fitted by least squares. The same general form of (18)–(19) was used:

$$\begin{aligned}&\phi _{\Delta \overline{s}}(\zeta ) = a_{\Delta \overline{s}}(1 - b_{\Delta \overline{s}}\zeta )^{-1/3}, \quad \zeta < 0, \end{aligned}$$
(20)
$$\begin{aligned}&\phi _{\Delta \overline{s}}(\zeta ) = c_{\Delta \overline{s}}, \quad \zeta \ge 0. \end{aligned}$$
(21)

The corresponding values are shown in Table 3.

Table 3 Least-squares estimates of coefficients for \(\phi _{\Delta \overline{s}}(\zeta )\) for \(s = \theta \), q, c

Because these functions have only been explicitly proposed very recently by Zahn et al. (2023), these may be the first results obtained in a roughness sublayer over a forest. Visually, the \(\phi _{\Delta \overline{s}}\) values are signficantly less scattered than their \(\phi _{\sigma _s}\) counterparts, indicating that REA dimensionless statistics may be potentially more useful than standard deviation ones. This is confirmed quantitatively in Table 4, where MAE and RMSE for the \(\phi _{\Delta {\overline{s}}}\)s are at least an order of magnitude less than the corresponding values in Table 2. Clearly, the REA dimensionless statistics behave better (as a “function” of \(\zeta \)) in the roughness sublayer than their dimensionless standard deviation counterparts: a physical explanation for this is not at hand and will require further research. However, the scatter is still relatively large and this is probably a consequence of measurements made within the RSL.

Fig. 4
figure 4

Dimensionless values of \(\Delta \overline{s}/s_*\) for \(s = \theta \) (a, b), \(s = q\) (c, d) and \(s = c\) (e, f) for stable (a, c, e) and unstable (b, d, f) conditions. The vermillion line is a least-squares (LSQ) fit

Table 4 Performance statistics for the LSQ functions \(\phi _{\Delta \overline{s}}(\zeta )\) for \(s = \theta \), q, c

5 Results for Predicted Fluxes

5.1 Flux Estimates from Dimensionless Standard Deviations (Variance Method)

When MOST does not apply for a particular scalar s, it makes little sense to estimate its turbulent fluxes from the standard deviation similarity function \(\phi _{\sigma _s}(\zeta )\), given the unacceptable scatter seen in Fig. 3 and the corresponding performance statistics in Table 2. Here, we re-do this exercise not because it is applicable in practice for our site, but because (a) it highlights the role of large magnitude fluxes on visual and numerical evaluation of the results; and (b) it provides a baseline to assess the gains in applying the REA method, as done by Zahn et al. (2023). We estimate the fluxes \(\overline{w's'}\) using the observed values of \(u_*\); an iterative procedure is performed for \(s = \theta \) by starting at \(\zeta =0\) and calculating \(\theta _*\) (from Eq. (2)) and \(\zeta \) (from Eq. (4)) until convergence in \(\theta _*\); and then \(q_*\) and \(c_*\) are obtained again from Eqs. (2) and (4). All kinematic fluxes are then obtained from Eq. (5). Note that we are using the whole dataset both for estimating the coefficients a, b and c in (18)–(19) and for evaluating the performance of the estimated fluxes, since our intention here is only to assess the relative merits of the variance and REA methods.

Figure 5 shows the kinematic fluxes predicted by the variance method. For \(\theta \) under unstable conditions, the predicted fluxes are much smaller in magnitude than the observed ones. This means that \(\zeta \approx 0\) always for the prediction of the other two fluxes. In spite of that, we see that there is always a linear trend (and often a large linear correlation) between observed and predicted values. The corresponding performance statistics are given in Table 5. The very large magnitude of the errors is clearly discernible by the (large) values of NBIAS, NMAE and NRMSE.

Fig. 5
figure 5

Observed \(\times \) predicted kinematic fluxes \(\overline{w's'}\) by the variance method, \(s = \theta \) (a, b), \(s = q\) (c, d) and \(s = c\) (e, f) for stable (a, c, e) and unstable (b, d, f) conditions. The vermillion line is a least-squares fit through the origin, and the black line is \(y=x\)

Table 5 Performance statistics for the kinematic fluxes \(\overline{w'\theta '}\) (\({\mathrm{m\,s}^{-1}\textrm{K}}\)), \(\overline{w'q'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{g\,kg}^{-1}}\)), \(\overline{w'c'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{mg\,kg}^{-1}}\)), estimated by the variance method

The spread of the small fluxes is difficult to discern in Fig. 5. To emphasize the relative error made in flux estimation, we re-plot the same results in Fig. 6. Now we plot the ratios of predicted to observed fluxes in the vertical axes, against the observed fluxes in the horizontal axes. The blue lines are the medians of the ratios. The figure shows how the large magnitude fluxes tend to have a small spread around the median, but that the spread of the small magnitude fluxes is exceptionally large: we are truncating the vertical axes at a maximum value of 10, but larger ratios do occur in the dataset. Clearly, if we restrict the conditions so that only larger flux values are probed, the variance method will tend to perform better in the RSL: this is very likely what happened for small zenith angles in Zahn et al. (2016b). The physical mechanism explaining this is probably that, for large-magnitude fluxes, gradient production of scalar semivariance will be strong enough to balance, approximately, molecular dissipation of scalar variance.

Fig. 6
figure 6

Observed ratios of predicted to observed kinematic fluxes \(\overline{w's'}\) by the variance method, \(s = \theta \) (a, b), \(s = q\) (c, d) and \(s = c\) (e, f) versus the observed fluxes, for stable (a, c, e) and unstable (b, d, f) conditions. The blue line is the median

5.2 Flux Estimates from REA-S

We applied REA-S for all 3 scalars using the median \(\beta _s\) for each scalar and for each stability range (stable, unstable) to estimate the fluxes. \(\beta _s\) estimation and flux performance used the same dataset, since as mentioned earlier the intent is only to assess the relative potential of the method. The median values of the obtained \(\beta _s\)s are listed in Table 6. The values of \(\beta _s\) are remarkably close under each stability regime. They are also clearly different in stable (mean of 0.6049) and unstable (mean of 0.5564) conditions. While analyzing the REA method at the same site with a different dataset, Zahn et al. (2016b) obtained similar values for \(\beta _s\) under unstable conditions, namely a mean of 0.5287 between the medians of \(\beta _\theta \), \(\beta _q\) and \(\beta _c\) at 39.4 m and 0.5847 at 81.6 m. Thus, it appears that \(\beta _s\) increases slightly with height in the RSL over the canopy in unstable conditions.

Table 6 Median \(\beta _s\), for \(s = \theta \), q, c, under stable and unstable conditions

Figure 7 shows the predicted versus observed fluxes thus obtained: for each scalar, we predicted the flux with the corresponding median \(\beta _s\) in Table 6. The performance is very good, showing an excellent agreement. Note that the small variability of \(\beta _s\) among all scalars for the same stability conditions gives confidence on the applicability of the method, without calibration, at least for the ATTO site—but bear in mind the \(\beta _s\) dependency on height. The relative errors of REA-S can be better discerned in Fig. 8. The scatter of the small magnitude fluxes is now much smaller than in Fig. 6 for the variance method, but it is still present, and also likely due, at least in part, to the inherently more difficult RSL conditions. The scatter is larger for stable than for unstable conditions. Together, Figs. 7 and 8 reconcile the previous results of Zahn et al. (2016b) and Zahn et al. (2023): plotted on an \(x \times y\) graph, the REA method shows excellent performance, but this kind of plot hides the still large variability of the computed \(\beta _s\)s when plotted (for example) against \(\zeta \). Therefore, while the REA method is probably good enough for mean flux estimates over (say) many days, one must be cautious when the small-magnitude fluxes are of importance (say, for specific biophysical processes, etc.).

The errors of REA-S are quantified in Table 7. They are much smaller than those from the variance method (see Table 5), confirming in general the findings of Zahn et al. (2023), but now for an Amazonian RSL. Note that by estimating \(\beta _s\) as the median value, BIAS is virtually eliminated; NMAE remains below 10% for unstable conditions, and below \(\sim 15\)% for stable conditions.

Fig. 7
figure 7

Observed \(\times \) predicted kinematic fluxes \(\overline{w's'}\) by means of the REA-S method, \(s = \theta \) (a, b), \(s = q\) (c, d) and \(s = c\) (e, f) for stable (a, c, e) and unstable (b, d, f) conditions. The vermillion line is a least-squares fit through the origin, and the black line is \(y=x\)

Fig. 8
figure 8

Observed ratios of predicted to observed kinematic fluxes \(\overline{w's'}\) by means of the REA-S method, \(s = \theta \) (a, b), \(s = q\) (c, d) and \(s = c\) (e, f) versus the observed fluxes, for stable (a, c, e) and unstable (b, d, f) conditions. The blue line is the median

Table 7 Performance statistics for the kinematic fluxes \(\overline{w'\theta '}\) (\({\mathrm{m\,s}^{-1}\textrm{K}}\)), \(\overline{w'q'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{g\,kg}^{-1}}\)), \(\overline{w'c'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{mg\,kg}^{-1}}\)), estimated by REA-S

5.3 Flux Estimates from REA-T

We applied REA-T for q and c with simultaneous measurement of \(\beta _\theta \), assuming \(\beta _{q,c} = \beta _\theta \), and then estimating \(\overline{w'q'}\) and \(\overline{w'c'}\), for each stability regime (stable, unstable). This mimics the simultaneous measurement of sonic temperature, and dispenses with any a priori estimate of \(\beta _s\), but has a built-in assumption of \(\theta \)s scalar similarity. Figure 9 shows the predicted versus observed fluxes thus obtained. The performance again is very good. The relative errors of REA-T can be discerned in Fig. 10. Figures 9 and 10 look very similar to Figs. 7 and 8, showing that both REA-S and REA-T produce reasonably good results. The same observations about the large scatter of the predicted fluxes when their magnitude is small apply. The performance statistics in Table 8 are somewhat worse than their counterparts in Table 7, but by a small margin only. Therefore, we deem REA-T as capable as REA-S with the additional advantage that no a priori estimate of \(\beta _s\) is necessary.

Fig. 9
figure 9

Observed \(\times \) predicted kinematic fluxes \(\overline{w's'}\) by means of the REA-T method, \(s = q\) (a, b) and \(s = c\) (c, d) for stable (a, c) and unstable (b, d) conditions. The vermillion line is a least-squares fit through the origin, and the black line is \(y=x\)

Fig. 10
figure 10

Observed ratios of predicted to observed kinematic fluxes \(\overline{w's'}\) by means of the REA-T method, \(s = q\) (a, b) and \(s = c\) (c, d) versus the observed fluxes, for stable (a, c) and unstable (b, d) conditions. The blue line is the median

Table 8 Performance statistics for the kinematic fluxes \(\overline{w'q'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{g\,kg}^{-1}}\)), \(\overline{w'c'}\) (\({\mathrm{m\,s}^{-1}\,\mathrm{mg\,kg}^{-1}}\)), estimated by REA-T

5.4 Long-Term Hourly Predictive Ability of REA

Figure 11 shows the hourly means for the whole dataset, for both \(s=q\) and \(s=c\), for REA-S (a, c) and REA-T (b, d) against eddy covariance measurements. The performance of REA-S for \(s=\theta \) is very similar to that exhibited for q and c and is not shown while REA-T for temperature, obviously, makes no sense. As it can be seen, the REA method’s (both versions) ability to capture the daily cycle and its dispersion around the hourly means is very similar to the eddy covariance measurements themselves. For the purpose of quantifying mass exchanges between the canopy and the atmosphere at the ATTO site (and very likely many other similar forested regions), therefore, our results validate the use of REA as a valuable alternative when fast-response scalar sensors are not available. The one caveat is whether these results also apply for trace gases such as \({\textrm{CH}_4}\) or isoprene since, if their fluxes are all very small, they might fall in the high scatter region of Figs. 8 and 10. In all fairness, the good performance of the REA method over forests is not reported here for the first time; it can be found in Bowling et al. (1999) for water vapor and \({\textrm{CO}_2}\) fluxes (c.f. their Fig. 5a–d). At the same they also show that the scatter of the REA method in comparison to eddy covariance is considerably larger for isoprene flux (c.f. their Fig. 5e, f)—although this may also be a consequence of the precision of their isoprene measurements. Clearly, the subject of the ability of the REA method to produce consistently good results for trace gases will require further study.

Fig. 11
figure 11

Hourly means of EC, REA-S and REA-T fluxes at the ATTO site. The bars indicate 1 standard deviation around the means: a EC \(\times \) REA-S, \(s=q\); b EC \(\times \) REA-T, \(s=q\); c EC \(\times \) REA-S, \(s=c\); d EC \(\times \) REA-T, \(s=c\)

6 Discussion and Conclusions

The relatively large scatter found for \(\beta _s\) by Zahn et al. (2016b) at the same site (ATTO) as the present study’s might suggest that the REA method is not applicable in the RSL of ATTO. However, the recent finding of Zahn et al. (2023) that the REA can yield rather good results in comparison to EC measurements even when the scalar in question (in their case, mostly \({\textrm{CO}_2}\)) does not conform to MOST, has prompted us to re-visit the issue with ATTO data. The present work actually reconciles both findings. The \(\beta _s\) scatter is large in the RSL, as can be seen (indirectly) in Figs. 8 and 10. On the other hand, this affects mostly the small-magnitude fluxes. Zahn et al. (2023) showed very clearly that when any scalar “fails” MOST this is basically caused by relatively large (in their case) transport and storage terms: at the ATTO site, other causes such as horizontal and vertical advection might also be playing a role, on account of the underlying topography (see Chamecki et al. 2020; Allouche et al. 2022). In all likelihood, the failure is associated with the small-magnitude fluxes because then the corresponding gradient production term in the scalar semivariance budget is also relatively small and the other terms are causing the breakdown of MOST. The term “relatively” is key here, since (again, very likely) the gradient production ultimately is small with respect to the scalar dissipation term. This, however, remains a conjecture and will need to be addressed in future research studies.

Large-magnitude fluxes that occur in the middle of the day, on the other hand, are (again in all likelihood, pending a detailed analysis of the scalar semivariance budgets) associated with a large gradient production term. In this case the \(\beta _s\)s approach the approximately constant values around 0.6 found elsewhere in the literature when measurements were made in the ISL. This is exactly what Zahn et al. (2016b) found for small zenith angles, which naturally occur in the middle of the day, although an explanation based on the scalar semivariance budget was not offered then. These large-magnitude fluxes weigh more heavily in most performance statistics or visual analyses, which tend to hide the large scatter of the small-magnitude fluxes. Moreover, when hourly averages of all data were taken, in Fig. 11, the REA method showed considerable ability in reproducing the daily cycle of the EC measurements for the dry season. This lends confidence in the ability of the REA method, despite the shortcomings of RSL-measurements related to the small-magnitude fluxes, of producing reliable estimates of the canopy-atmosphere mass exchanges over timescales larger than (say) several days. In particular, REA-T seems to be a better choice than REA-S since, in spite of slightly larger overall errors, it dispenses with a priori assumptions on the value of \(\beta _s\).

However, to really settle the matter, further research needs to be done:

  1. 1.

    A better understanding of the interplay between the scalar semivariance budget and REA needs to be obtained. For this purpose, a budget as detailed as possible is needed in conjunction with the behavior of \(\beta _s\) under various situations, viz. when the gradient production term is large, and when the transport term is large or more generally when a large imbalance between gradient production and dissipation is present.

  2. 2.

    Unfortunately, a criterion for identifying such imbalance that dispenses with EC measurements is lacking; such a criterion would be highly useful for practical quality control of REA measurements. The scalar flux number proposed by Cancelli et al. (2012) might be a useful starting point for this purpose.

  3. 3.

    Scalar variance budgets involving trace gases measured by EC and simultaneous assessment of the REA for these gases are also needed. The present study considered scalars associated with intuitively large fluxes of heat, \({\textrm{H}_2\textrm{O}}\) and \({\textrm{CO}_2}\). A possibility remains that for trace gases the gradient production term is never able to balance dissipation alone. If this happens, then, the REA method is bound to become much more uncertain.