Introduction

Latent heat flux (LE) refers to the transferred energy during the process of evaporation of liquid water from various land surfaces (including small water bodies) and transpiration from the leaves of plants and sublimation of ice and snow (Liang et al. 2010; Vinukollu et al. 2011a; Wang and Dickinson 2012; Yao et al. 2013, 2014a, b). Especially, the amount of LE for terrestrial forest transpiration, approximately 5–30 times greater than the amount of LE for soil evaporation, has played an important role in the climate system by coupling the surface with the atmosphere, and the variable that links the water, energy, and carbon cycles (Mu et al. 2007; Fisher et al. 2008; Wang and Liang 2008; Yuan et al. 2010; Mueller et al. 2011; Yao et al. 2014c). Accurately estimating LE over forest ecosystems is therefore required to understand and simulate dynamics of surface vegetation in energy and hydrological cycles.

Numerous models with different levels of complexity and process parameterizations have been developed and widely used to for quantifying surface LE. These LE methods mainly include (1) statistical and empirical methods (Jackson et al. 1977; Wang et al. 2007; Jung et al. 2010; Mueller et al. 2011), (2) surface energy balance methods (Norman et al. 1995; Kustas and Norman 1996; Anderson et al. 1997), (3) Penman-Monteith (PM) method (Monteith 1965; Wang et al. 2010a, b; Mu et al. 2011), (4) Priestley-Taylor (PT) method (Priestley and Taylor 1972; Fisher et al. 2008; Jin et al. 2011; Yao et al. 2013), and (5) data assimilation (DA) methods (Pipunic et al. 2008; Xu et al. 2011a, b). However, most of the models described above vary in structural complexity, parameterization, and the level of data required to run them. Moreover, when applied at different surface vegetation types, these methods have resulted in large differences (Jiménez et al. 2011; Mueller et al. 2011).

In recent years, several studies have focused on estimating regional LE based on the PM and PT method driven by meteorological and remote sensing data (Cleugh et al. 2007; Fisher et al. 2008; Zhang et al. 2009, 2010; Wang et al. 2010a, b; Vinukollu et al. 2011b). Cleugh et al. (2007) proposed a simple surface resistance algorithm for the PM method using leaf area index (LAI) and other meteorological parameters. Fisher et al. (2008) designed a novel PT-based LE algorithm (PT-JPL) by introducing both atmospheric (relative humidity, RH, and vapor pressure deficit (VPD)) and eco-physiological constraints (fraction of photosynthetically active radiation, FPAR, and LAI) without using any ground-based observed data. Subsequently, Mu et al. (2011) developed a LE algorithm to generate moderate-resolution imaging spectroradiometer (MODIS) product by introducing a stomatal conductance parameterization based on environmental controlling factors and separating surface conductance over bare soil. To overcome the overestimation of canopy conductance when LAI is higher than 3, Wang et al. (2010a, b) used the normalized difference vegetation index (NDVI) and relative humidity deficit (RHD) to parameterize canopy conductance and proposed a semi-empirical Penman LE algorithm to estimate global surface LE. Although these models have proven to be effective for estimating LE at regional-scale applications, model parameters were calibrated locally, limiting the utility of the studied model to those specific locations or areas with similar meteorological and land surface conditions (Wang and Dickinson 2012; Ershadi et al. 2013).

Considering that it is difficult to select an appropriate LE algorithm for different land cover types, many scientist attempted to focus on a number of model intercomparison studies. Sumner and Jacobs (2005) compared PM and PT methods using eddy covariance (EC) observations of pasture sites in Florida and found that the PT method with a calibrated alpha coefficient outperforms the PM method. Vinukollu et al. (2011a) evaluated three LE algorithms (Surface Energy Balance System (SEBS), PM and PT algorithms) using eddy covariance observations from 16 FLUXNET tower sites and documented that PT algorithm provided the best estimates. Similarly, Chen et al. (2014) compared eight LE models, including five empirical and three process-based models, using the observed data from 23 eddy covariance towers in China and reported that the parameters of the empirical methods may have different combinations because the environmental factors of LE are not independent. Many of these intercomparison studies mainly focus on evaluation of different actual LE algorithms using few eddy covariance sites of small regions (Jiménez et al. 2011; Mueller et al. 2011; Ershadi et al. 2013). However, selection of the best LE algorithm for different forest ecosystems applications is still not documented in current method intercomparison contributions.

In this study, we evaluate the performance of three satellite-based LE algorithms, including a revised remote sensing-based Penman-Monteith LE (RRS-PM) algorithm, a modified satellite-based Priestley-Taylor LE (MS-PT) algorithm, and a semi-empirical LE (UMD-SEMI) algorithm of the University of Maryland, across a variety of forest types. Our study has two major objectives. First, we analyze the LE algorithms’ structure and the sensitivity of input variables for the three satellite-based LE algorithms. Second, we evaluate the LE algorithms’ performance over forest ecosystems based on 40 flux tower sites provided by FLUXNET network.

Datasets

Data at FLUXNET eddy covariance sites

Three satellite-based LE algorithms were evaluated with the flux data from six Ameriflux towers (http://public.ornl.gov/ameriflux/data-get.cfm), one Chinaflux tower (http://159.226.111.42/pingtai/LoginRe/opendata.jsp), nine Asiaflux towers (http://asiaflux.yonsei.kr/index.html) (Yu et al. 2006a, b, 2008, 2013), and 24 FLUXNET towers (http://www.fluxdata.org). These data sets include the longest continuous worldwide multisite measurements of LE, sensible heat flux (H), surface net radiation (Rn), and corresponding meteorological observations. The flux tower sites cover five major global land-surface forest biomes: deciduous broadleaf forest (DBF; 10 sites), deciduous needleleaf forest (DNF; 4 sites), evergreen broadleaf forest (EBF; 10 sites), evergreen needleleaf forest (ENF; 10 sites), and mixed forest (MF; 6 sites) (Table 1 and Fig. 1). Detailed information on site information and data gap-filling technique were available at the FLUXNET web site. The data cover the period from 2000 to 2007, and each flux tower has at least 1 year of reliable data. Although the eddy covariance method is considered best for directly measuring heat fluxes in global measurement experiments (Baldocchi et al. 2001; Gough et al. 2013), we selected the method proposed by Twine et al. (2000) to correct the LE from the FLUXNET flux towers and Chinese flux towers, due to the problem of energy imbalance.

Table 1 A description of site conditions
Fig. 1
figure 1

Location of the 40 flux tower sites used in this study. DBF deciduous broadleaf forest, DNF deciduous needleleaf forest, EBF evergreen broadleaf forest, ENF evergreen needleleaf forest, MF mixed forest

Satellite inputs to LE algorithms

To evaluate the performance of all LE algorithms over forest ecosystems in this study for all 40 flux tower sites, the 8-day MODIS FPAR/LAI (MOD15A2) product (Myneni et al. 2002) and the 16-day MODIS NDVI/EVI (MOD13A2) product (Huete et al. 2002) with 1-km spatial resolution were used to drive three algorithms. These data were downloaded directly from the Oak Ridge National Laboratory Distributed Active Center (ORNL DAAC) web site (http://daac.ornl.gov/MODIS/). Quality control (QC) flags were examined to screen and reject poor quality LAI/FPAR and NDVI/EVI data. The daily FPAR and LAI values were temporally interpolated from the 8-day averages using linear interpolation. Similarly, the daily NDVI/EVI values were temporally interpolated from the 16-day averages using linear interpolation.

Algorithms descriptions

Revised remote sensing-based Penman-Monteith LE algorithm

Based on the beta version of the MOD16 algorithm (Mu et al. 2007), Yuan et al. (2010) developed a revised remote sensing-based PM LE (RRS-PM) algorithm by revising the equations dealing with temperature constraint for stomatal conductance and energy allocation between vegetation canopy and soil surface (Yuan et al. 2012, 2014). The RRS-PM algorithm meets Penman-Monteith equation, namely,

$$ \mathrm{L}\mathrm{E}=\frac{\varDelta \left({R}_{\mathrm{n}}-G\right)+\rho {C}_{\mathrm{p}}\left({e}_{\mathrm{s}}-e\right)/{r}_{\mathrm{a}}}{\varDelta +\gamma \left(1+{r}_{\mathrm{s}}/{r}_{\mathrm{a}}\right)} $$
(1)

where ρ is the density of air, C p is the specific heat of air at constant pressure, e s is the saturated vapor pressure, e is the actual vapor pressure, Δ is the slope of the saturate vapor pressure curve, γ is the psychrometric constant, r s and r a are the aerodynamic and surface resistance, respectively.

In the RRS-PM algorithm, the temperature constraint (m T) for stomatal conductance follows the equation detailed by Fisher et al.(2008) with an optimum T opt set as 25 °C.

$$ {m}_{\mathrm{T}}= \exp \left(-{\left(\frac{T-{T}_{\mathrm{opt}}}{T_{\mathrm{opt}}}\right)}^2\right) $$
(2)

In the RRS-PM algorithm, the Beer-Lambert law was used to exponentially partition net radiation between the canopy (A c) and the soil surface (A soil).

$$ {A}_{\mathrm{soil}}={R}_{\mathrm{n}} \exp \left(-k\times \mathrm{L}\mathrm{A}\mathrm{I}\right) $$
(3)
$$ {A}_{\mathrm{c}}={R}_{\mathrm{n}}-{A}_{\mathrm{soil}} $$
(4)

where k is extinction coefficient (0.5). The validation for 23 EC flux tower sites in China revealed higher accuracy for the RRS-PM algorithm than for the beta version of the MOD16 algorithm (Chen et al. 2014).

Modified satellite-based Priestley-Taylor LE algorithm

The modified satellite-based Priestley-Taylor algorithm (MS-PT) is specifically designed to minimize the need for ancillary meteorological data (Vinukollu et al. 2011a; Yao et al. 2013). It only needs four variables—surface net radiation (Rn), air temperature (Ta), diurnal air temperature range (DT), and NDVI. In the MS-PT algorithm, LE is calculated by the sum of the unsaturated soil evaporation (LE s), the canopy transpiration (LE c), the saturated wet soil surface evaporation (LE ws), and the canopy interception evaporation (LE ic). The RRS-PM algorithm can be expressed as:

$$ \mathrm{L}\mathrm{E}={\mathrm{LE}}_{\mathrm{s}}+{\mathrm{LE}}_{\mathrm{c}}+{\mathrm{LE}}_{\mathrm{ws}}+{\mathrm{LE}}_{\mathrm{ic}} $$
(5)
$$ {\mathrm{LE}}_{\mathrm{s}}=\alpha \left(1-{f}_{\mathrm{wet}}\right){f}_{\mathrm{s}\mathrm{m}}\frac{\varDelta }{\varDelta +\gamma}\left({R}_{\mathrm{ns}}-G\right) $$
(6)
$$ {\mathrm{LE}}_{\mathrm{c}}=\alpha \left(1-{f}_{\mathrm{wet}}\right){f}_{\mathrm{T}}{f}_{\mathrm{c}}\frac{\varDelta }{\varDelta +\gamma }{R}_{\mathrm{nv}} $$
(7)
$$ L{E}_{ws}=\alpha {f}_{\mathrm{wet}}\frac{\varDelta }{\varDelta +\gamma}\left({R}_{\mathrm{ns}}-G\right) $$
(8)
$$ {\mathrm{LE}}_{\mathrm{ic}}=\alpha {f}_{\mathrm{wet}}\frac{\varDelta }{\varDelta +\gamma }{R}_{\mathrm{nv}} $$
(9)

where α is a coefficient of the Priestley-Taylor equation (1.26), f wet is the wet surface fraction, f sm is the soil moisture constraint, f c is the vegetation cover fraction, f T is the plant temperature constraint, G is soil heat flux, R ns is the surface net radiation to the soil, and R nv is the surface net radiation to the vegetation.

Semi-empirical LE algorithm of the University of Maryland

To detect LE variations on a scale of several decades, Wang et al. (2010a, b) developed a semi-empirical LE (UMD-SEMI) algorithm based on the Penman equation. The UMD-SEMI algorithm considers the impacts of wind speed (WS) on LE and it can be expressed as:

$$ \mathrm{E}\mathrm{T}={a}_1\left({\mathrm{E}\mathrm{T}}_{\mathrm{E}}+{\mathrm{E}\mathrm{T}}_{\mathrm{A}}\right)+{a}_2{\left({\mathrm{E}\mathrm{T}}_{\mathrm{E}}+{\mathrm{E}\mathrm{T}}_{\mathrm{A}}\right)}^2 $$
(10)
$$ {\mathrm{E}\mathrm{T}}_{\mathrm{E}}=\frac{\varDelta }{\varDelta +\gamma }{R}_{\mathrm{s}}\left[{a}_3+{a}_4\mathrm{NDVI}+\mathrm{R}\mathrm{H}\mathrm{D}\left({a}_5+{a}_6\mathrm{NDVI}\right)\right] $$
(11)
$$ {\mathrm{ET}}_{\mathrm{A}}=\frac{\gamma }{\varDelta +\gamma}\mathrm{W}\mathrm{S}\left[{a}_7+\mathrm{R}\mathrm{H}\mathrm{D}\left({a}_8+{a}_9\mathrm{NDVI}\right)\right]\mathrm{V}\mathrm{P}\mathrm{D} $$
(12)

where a 1 = 0.819, a 2 = 0.0017, a 3 = 0.476, a 4 = 0.284, a 5 = −0.654, a 6 = 0.264, a 7 = 3.06, a 8 = −3.86, a 9 = 3.64, RHD equals to 1 minus relative humidity (RH), and VPD refers to vapor pressure deficit.

Statistical analysis

We selected a simple method to analyze the sensitivity of input variables for three satellite-based LE algorithms (Wang et al. 2007; Yao et al. 2012), and this formula can be written as:

$$ \mathrm{S}\mathrm{R}\mathrm{C}=\frac{\partial \mathrm{L}\mathrm{E}}{\partial V(x)} $$
(13)

where SRC is the sensitivity coefficient, V(x) is the input variable (e.g., Rn, Ta, NDVI). We have also summarized the coefficient of determination (R 2), the root-mean-squared error (RMSE), the Nash-Sutcliffe efficiency coefficient (NSE), and the average bias and p values for the estimated LE and those derived from tower data to evaluate the relative predictive errors for three LE algorithms. The Nash-Sutcliffe efficiency represents a normalized statistic that determines the relative magnitude of the residual variance (noise) compared to the measured data variance (Nash and Sutcliffe 1970) and is expressed as:

$$ \mathrm{N}\mathrm{S}\mathrm{E}=1-\frac{{\displaystyle \sum_{i=1}^n{\left({\mathrm{LE}}_i^{\mathrm{obs}}-{\mathrm{LE}}_i^{\mathrm{sim}}\right)}^2}}{{\displaystyle \sum_{i=1}^n{\left({\mathrm{LE}}_i^{\mathrm{obs}}-{\mathrm{LE}}_{\mathrm{mean}}\right)}^2}} $$
(14)

where LE obs i is the ith observed LE, LE sim i is the ith simulated LE, LEmean is the mean of the observed LE, and n is the total number of observations. NSE varies from − ∞ to 1.0, with a NSE = 1 being the optimal value (Moriasi et al. 2007; Ershadi et al. 2013).

Results

Sensitivity analysis of three satellite-based LE algorithms

Given the great difference in three satellite-based LE algorithm structures and the input variables, it is of interest to examine the sensitivity of input variables for three satellite-based LE algorithms. Rn, Ta, LAI, and RH are the input variables for the RRS-PM algorithm. As illustrated for the MS-PT algorithm, except for Rn and Ta, it also acquires NDVI and DT. The UMD-SEMI algorithm is the only method that explicitly includes wind speed, which may play an important role in annual or decadal LE (Wang et al. 2010a, b; McVicar et al. 2012). The energy term of the UMD-SEMI algorithm is Rs rather than Rn. The averaged LE varies at all flux towers up to ±20 % for the MS-PT and RRS-PM algorithms by changing Rn with ±20 and ±16 %, respectively (Fig. 2). For the UMD-SEMI algorithm, in response to the ±20 % change in LE, Rs varies by ±18 %. LE varies by ±20 % for the RRS-PM and UMD-SEMI algorithms by changing RH with ±4 and ±6 %, respectively. However, for the MS-PT algorithm, LE relatively increases up to 3 % for DT change of −20 %. In response to the change in NDVI (LAI) with ±20 % for both MS-PT and UMD-SEMI (RRS-PM) algorithms, LE varies by ±10 and ±8 % (±6 %) at all flux towers. LE estimated by the UMD-SEMI algorithm varies by ±4 % for WS change of ±20 %. Overall, LE estimation by three algorithms shows different sensitivity orders: Rn > LAI > RH > Ta for the RRS-PM algorithm, Rn > NDVI > Ta > DT for the MS-PT algorithm, and Rs > NDVI > RH > Ta > WS for the UMD-SEMI algorithm.

Fig. 2
figure 2

Sensitivity analysis of the LE estimated by the a RRS-PM, b MS-PT, and c UMD-SEMI algorithms with the corresponding input variables, respectively

For the above three LE algorithms, energy (Rn or Rs) and vegetation (NDVI or LAI) terms account for more than 78 % variation of LE. The contribution of the energy term (Rn or Rs) is the highest in the three LE algorithms because these algorithms are calculated as the sum of the surface energy balance term. Hwang and Choi (2013) found the sensitivity orders: Rn > LAI > Pa for the revised remote sensing-based PM model. Yao et al. (2014b) considered the sensitivity orders of estimated LE using the MS-PT algorithm with Rn (±20 % change of LE for ±20 % change of the variable) > NDVI (±10 %) > Ta and DT (less than ±7 %) at different land cover types. Besides Rn, the dependency of NDVI (or LAI) in these algorithms is higher in all flux towers as vegetation amount quantified by vegetation index (NDVI) and LAI affect the vegetation photosynthesis and transpiration (Tucker 1979; Wang and Liang 2008; Yao et al. 2013, 2014a). Canopy conductance, which is directly related with plant transpiration, and vegetation index (or LAI) represented a nonlinear relationship. Wang and Dickinson (2012) pointed out that many equations will overestimate canopy conductance when LAI is higher than 3 or 4, and using vegetation index can effectively correct this issue. Compared with energy and vegetation terms, other variables play a minor role.

Comparisons of three satellite-based LE algorithms over forest ecosystems

Three satellite-based LE algorithms demonstrate substantial differences in algorithm performance for estimating daily LE variations among five forest ecosystem types (Fig. 3). For both ENF and EBF, all algorithms demonstrate low performance with high RMSE and low R 2. In contrast, all algorithms exhibit good performance over DBF and DNF sites. This may be attributed to the fact that strong seasonality for vegetation indices or LAI effectively characterizes the variations in the LE of broadleaf forest while reliable vegetation information is difficult to acquire for evergreen forest because seasonal evergreen forest variation is less evident (Yebra et al. 2013). For all five forest types, the MS-PT algorithm has a good overall performance and exhibits the highest R 2 and RRS-PM shows the lowest R 2 because the MS-PT algorithm has partitioned total evaporation (canopy transpiration, soil evaporation, wet canopy evaporation) and its lower uncertainties in the required forcing data. The second good model is the UMD algorithm and the performance of the RRS-PM algorithm is ranked third.

Fig. 3
figure 3

Observed LE at eddy covariance sites versus estimated LE from three algorithms

For a given forest type, algorithm performance differs greatly (Figs. 3 and 4). For all DBF sites, the MS-PT algorithm shows the highest performance with higher R 2 (averaged R 2 = 0.79) and NSE (averaged NSE = 0.81), and lower RMSE (averaged RMSE = 25.4 W/m2) when compared with the other two algorithms while the RRS-PM algorithm shows the lowest performance. For all DNF sites, the RRS-PM algorithm has the highest average RMSE (20.3 W/m2), and the UMD-SEMI algorithm has the lowest RMSE (17.1 W/m2) and the greatest average NSE (0.78). It is clear that the MS-PT and UMD-SEMI algorithms both overestimate terrestrial LE, but the RRS-PM algorithm underestimates terrestrial LE for all DNF sites. For all EBF sites, the RRS-PM algorithm has the lowest average RMSE (28.1 W/m2), with average R 2 of 0.55 and average NSE of 0.4 for all EBF sites. However, the MS-PT algorithm has the highest R 2 despite of its higher RMSE. For all ENF sites, the UMD-SEMI algorithm has the lowest average RMSE (28.1 W/m2), with average bias of 5.7 W/m2 and with average NSE of 0.51. Considering the UMD-SEMI algorithm was calibrated using the data from 64 flux tower sites provided by FLUXNET projects, the performance of the UMD-SEMI algorithm is strongly related to the regression coefficients (Yao et al. 2013, 2014a). For all MF sites, the performance of the MS-PT algorithm with the highest average NSE (0.62) and the lowest average RMSE (25.3 W/m2) is higher than both the RRS-PM and UMD-SEMI algorithms. In general, MF has the higher canopy conductance than that of deciduous forests (Eugster et al. 2000) and perhaps the MS-PT algorithm may capture this information by parameterization of the vegetation index.

Fig. 4
figure 4

Model performance of three LE algorithms for estimating daily LE for five forest biomes

Figure 5 shows an example of the seasonal cycles of the 8-day LE average simulated by three satellite-based latent heat flux algorithms at different type sites. Distinct seasonal cycles of LE can be found at different sites. In general, the measured and predicted seasonal curves are in good agreement. For DBF, DNF, ENF, and MF sites, the estimated LE is higher in summer when the Rs, Ta, and vegetation index are maximal. During spring and fall, the estimated LE is smaller due to the decreasing Rs and Ta. In contrast, compared with other forests, the LE for the EBF site is high in winter. Although the LE estimated by the three satellite-based algorithms has similar seasonal variation, in comparison to the RRS-PM and UMD-SEMI algorithms, the MS-PT algorithm yields seasonal LE variations that are closest to the ground-measured values at most forest types and this is generally consistent with the previous literature (Yao et al. 2014a, b; Ershadi et al. 2013; Chen et al. 2014). Ershadi et al. (2013) used FLUXNET data to evaluate four LE algorithms (SEBS, PM, PT, and complementary method) and found that the PT method has improved performance compared to the PM method. Chen et al. (2014) compared eight LE models, including five empirical and three process-based models and reported that although the three process-based models showed high model performance across the validation sites, there were substantial differences among them in the temporal and spatial patterns of LE.

Fig. 5
figure 5

Example of a time series for the 8-day LE average as measured and estimated using three algorithms for five forest biomes

Discussion

Regulators of the three LE algorithms

Available energy, moisture demand, and atmospheric evaporative demand have been considered as the three most factors controlling LE. In this study, the three LE algorithms have strongly correlated with Rn (Rs) because it has been widely recognized as the most important parameter that varies with spatiotemporal variation in LE in terrestrial ecosystems (Wang and Dickinson 2012). However, Rn (Rs) exerts greater influence on energy-limited water-yielding catchments than water-limited (Nemani et al. 2003; McVicar et al. 2012; Wang and Dickinson 2012). McVicar et al. (2012) mapped the global distribution of areas where LE was energy-limited or water-limited to conclude that energy drives LE in both tropical ecosystems (mainly EBF) and high-latitude ecosystems (mainly ENF), and water drives LE in arid regions. Therefore, most EBF trees of tropical regions sustain elevated LE during the dry season through deep roots and most ENF trees in high-latitude regions are less water stressed because of their slow transpiration (Karam and Bras 2008; Wang and Dickinson 2012).

Moisture demand is another key factor in determining LE in most ecosystems, especially in semiarid and arid regions. Satellite-derived NDVI and LAI are good at characterizing the spatial variability of soil and vegetation moisture. Therefore, many scientists attempted to use NDVI or LAI as a surrogate for vegetation moisture and use RH to replace soil moisture to develop satellite-based LE algorithms (Fisher et al. 2008; Mu et al. 2011; Yuan et al. 2010; Wang et al. 2010a, b). The three algorithms in this study have improved the accuracy of LE simulation by integrating satellite-based vegetation parameters (NDVI or LAI).

WS, as an aerodynamic controlling variable of atmospheric evaporative demand during the evapotranspiration (ET) process, also contributes to the long-term variation in LE (Wang et al. 2010a, b; McVicar et al. 2012). Several studies have focused on the causes of evaporative dynamics without considering WS, whereas the UMD-SEMI algorithm clearly illustrates the contributions of WS to LE (Mu et al. 2007; Fisher et al. 2008; Yao et al. 2013). Although many LE algorithms without WS have higher accuracy compared with the UMD-SEMI algorithm, detection of long-term variation in terrestrial LE may cause large uncertainty when ignoring the impacts of WS on LE.

Performance of the three LE algorithms

Although the three satellite-based LE algorithms performed well in most forest sites and displayed seasonality in its performance over the majority of the flux towers, the results of this study illustrated some limitations in the application of the three algorithms. We found that the RRS-PM algorithm underestimated LE for most deciduous forests flux tower sites and in winter, the performance of the RRS-PM algorithm is low because of large errors resulting in LE underestimation. In theory, the RRS-PM algorithm should outperform the MS-PT model because its theoretical advances of the PM approach over the PT method (Ershadi et al. 2013). However, surface resistance parameterization technique is complicated and the uncertainties in forcing data reduced the accuracy of RRS-PM-based LE estimation (Mu et al. 2007; Yuan et al. 2010; Chen et al. 2014; Ershadi et al. 2013). Yuan et al. (2010) found that choice of vegetation and temperature parameterization will lead to great impact on RRS-PM-based LE estimation. Ershadi et al. (2013) also reported that the surface resistance parameterization is more important than the actual structure of the algorithm, which affected the RRS-PM algorithm.

Compared with the RRS-PM algorithm, the MS-PT algorithm demonstrated the most consistent performance for most forests sites because this algorithm does not require aerodynamic and surface resistances to reduce the uncertainties in forcing data. However, the MS-PT algorithm reduced performance for the ENF tower sites and this may be resulted from the limitation of NDVI in capturing the vegetation seasonal variation of this type (Xiao et al. 2004; Ershadi et al. 2013). In addition, Rn and Ta are the main driving factors for the MS-PT algorithm and they generally have lower uncertainty in observations (Yao et al. 2013; Ershadi et al. 2013). Similar issues in the performance of the PT algorithm have been documented in substantial previous studies (Fisher et al. 2008; Vinukollu et al. 2011a; Yao et al. 2014b). Vinukollu et al. (2011a) reported the good performance of the PT algorithm in 12 eddy covariance tower sites. Yao et al. (2014b) found that compared with the PT-JPL algorithm, the MS-PT algorithm improves the LE estimates at 40 eddy covariance flux tower sites.

The UMD-SEMI algorithm performed well in ENF sites, which may be attributed to the coefficients calibration using ground-measured data from most ENF sites. Although the UMD-SEMI algorithm ignores explicit biophysical mechanisms, it provides a simple yet seemingly robust method by building functional relationships between the LE and predictor variables (Wang et al. 2010a, b). A similar conclusion has been reported by Wang et al. (2010a, b); the 16-day average daily LE can be reasonably predicted with an average correlation coefficient of 0.94 and average RMSE of 17 W/m2 using 64 FLUXNET sites. However, the representativeness of the limited training dataset will lead to large uncertainties by extrapolating other regions (Jung et al. 2010; Wang and Dickinson 2012; Chen et al. 2014). Chen et al. (2014) found that the parameters of the UMD-SEMI algorithm may have different combination because the environmental factors of ET are not independent.

The possible sources of uncertainty in the three satellite-based LE algorithms may stem from: (1) the errors of the ground-measured variables (Hollinger and Richardson 2005; Richardson et al. 2006; Jenkins et al. 2007), (2) the biases of the satellite-derived LAI retrieval, (3) the energy imbalance issue of the eddy covariance method, (4) the spatial scale mismatch between flux tower site and satellite-derived NDVI and LAI, and (5) the uncertainties inherent in the structure of the three algorithms for coefficients parameterization. The three algorithm used in this study are all developed with high-quality tower scale dataset. Therefore, application of these algorithms at large spatial scale will lead to large uncertainties.

Conclusion

The goal of the study was to evaluate three satellite-based LE algorithms over forest ecosystems using ground-measured data collected from 40 eddy covariance flux tower sites provided by the FLUXNET project. The LE algorithms used in this study include the RRS-PM algorithm, the MS-PT algorithm, and the UMD-SEMI algorithm. Given the great difference in the three satellite-based LE algorithm structures and the input variables, we first analyzed the sensitivity of input variables for the three satellite-based LE algorithms. Furthermore, we compared the three LE algorithms’ performance over forest ecosystems.

Results of the sensitivity analysis of input variables for the three satellite-based LE algorithms illustrate that overall, LE estimation by the three algorithms shows different sensitivity orders: Rn > LAI > RH > Ta for the RRS-PM algorithm, Rn > NDVI > Ta > DT for the MS-PT algorithm, and Rs > NDVI > RH > Ta > WS for the UMD-SEMI algorithm. For the three LE algorithms, energy (Rn or Rs) and vegetation (NDVI or LAI) terms can account for more than 78 % variation of LE. Results of the evaluation and intercomparison among these algorithms show that the three satellite-based LE algorithms demonstrate substantial differences in algorithm performance for estimating daily LE variations among five forest biomes. The MS-PT algorithm has the high performance over both DBF and MF sites, the RRS-PM algorithm has the lowest average RMSE (28.1 W/m2) over EBF sites, and the UMD-SEMI algorithm has the high average NSE of 0.51 over ENF sites and average NSE of 0.78 over DNF sites. Regulators and uncertainties of the three LE algorithms are also discussed.