Utilizing Machine Learning Models with Limited Meteorological Data as Alternatives for the FAO-56PM Model in Estimating Reference Evapotranspiration

Amani, Shima; Shafizadeh-Moghadam, Hossein; Morid, Saeid

doi:10.1007/s11269-023-03670-2

Utilizing Machine Learning Models with Limited Meteorological Data as Alternatives for the FAO-56PM Model in Estimating Reference Evapotranspiration

Published: 07 February 2024

Volume 38, pages 1921–1942, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Water Resources Management Aims and scope Submit manuscript

Utilizing Machine Learning Models with Limited Meteorological Data as Alternatives for the FAO-56PM Model in Estimating Reference Evapotranspiration

Download PDF

288 Accesses
4 Citations
Explore all metrics

Abstract

The current study evaluated the accuracy of four machine learning (ML) models and thirteen experimental methods calibrated to estimate reference evapotranspiration (ET₀) in arid and semi-arid regions. Various scenarios were examined utilizing meteorological data and FAO56-PM as a benchmark. According to the results, the ML models outperformed the experimental methods on both daily and monthly scales. Among the ML models, artificial neural networks (ANNs), generalized additive model (GAM), random forest (RF), and support vector machine (SVM), respectively, demonstrated higher accuracy on a monthly scale, while ANNs, SVM, RF, and GAM exhibited greater accuracy on a daily scale. Notably, ANNs and SVM achieved high accuracy even with a limited number of variables. Conversely, RF showed improved accuracy with an increased number of variables. Comparing the ML and experimental models with equivalent inputs revealed that ANN with inputs similar to Valiantzas-1 performed better on a monthly scale, while SVM with inputs akin to Valiantzas-3 showed superior performance on a daily scale. Our findings suggest that average temperature, wind speed, and sunshine hours contribute significantly to the accuracy of ML models. Consequently, these ML models can serve as an alternative to the FAO56-PM method for estimating ET₀.

Estimation of crop evapotranspiration using statistical and machine learning techniques with limited meteorological data: a case study in Udham Singh Nagar, India

Article 03 April 2024

Machine learning models for daily net radiation prediction across different climatic zones of China

Article Open access 03 September 2024

Revealing accuracy in climate dynamics: enhancing evapotranspiration estimation using advanced quantile regression and machine learning models

Article Open access 24 June 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Efficient management of water resources in the agricultural sector is crucial for mitigating water crises (Lu et al. 2023; Roy et al. 2023), particularly in arid and semi-arid regions. Iran allocates over 90% of its water resources to agriculture (Alizadeh and Keshavarz 2005; Fathi-Taperasht et al. 2022). Evapotranspiration (ET) plays a vital role in optimizing water demand in agriculture, as more than 90% of the water utilized in agricultural ecosystems is lost through ET (Shan et al. 2020; Wang et al. 2019). ET also constitutes a basis for various calculations in water resources management as well as in the design and operation of irrigation and drainage systems (Feng et al. 2017; Yan et al. 2023). Accurate estimation of ET at the field level can greatly enhance management planning for irrigation water, determining the irrigation cycle, estimating the hydromodule of the network (water demand of crops), and predicting crop yield (Allen et al. 1998; Bachour et al. 2016; Anderson et al. 2007; Teuling et al. 2009).

Various factors influence ET variations, a complex physical phenomenon comprising multiple nonlinear processes (Jovic et al. 2018; Li et al. 2022; Amani and Shafizadeh-Moghadam 2023). Over the years, researchers have proposed two general groups of ET measuring techniques: point methods and regional methods. Lysimeter, a point method, is used to measure ET directly with no assumptions (Holmes 1984) and is a benchmark for calibrating other methods (Liu et al. 2017). Nevertheless, its limited availability, high costs, operational challenges, and environmental impact restrict its usage (Fan et al. 2018; WMO 1963; Scanlon et al. 1997). As a result, mathematical models that utilize meteorological data to estimate ET have gained popularity (Ferreira et al. 2019), and numerous indirect methods for estimating ET based on influential factors have been developed (Almorox et al. 2015). The Penman–Monteith (PM) model modified by the Food and Agriculture Organization (FAO) is widely used as a reference for evaluating the performance and calibration of other ET estimation models (Allen et al. 1998).

The FAO56-PM model requires a complete set of meteorological data, comprising air temperature (maximum temperature (Tmax), minimum temperature (Tmin) and average temperature (Tm)), relative air humidity (RH), net solar radiation (Rn), wind speed (Ws), atmospheric pressure, and soil heat flux (G). However, the cost of collecting this data is considerable not only in developed countries (Chu et al. 2017), but also and most particularly in developing countries. Consequently, reliable data may not be consistently available over consecutive years (Bellido-Jiménez et al. 2021; De Paola and Giugni 2013; Eccel 2012). Therefore, preferred over the FAO56-PM model are alternative experimental methods, the most common of which are categorized as temperature-based methods that utilize Tmax and Tmin (Hargreaves and Samani 1985; Hargreaves et al. 1985; Blaney and Criddle 1962), solar radiation-based methods that use the difference between Rn, G, and latent heat (λ) (Abtew 1996; Irmak et al. 2003; Makkink 1957; Priestley and Taylor 1972), mass transfer-based methods that employ Dalton's law and the concept of water vapor flux transfer (Penman 1948; WMO 1963), and hybrid methods that combine various parameters such as solar radiation(Rs), T (Tm, Tmax, Tmin), and RH (Doorenbos and Pruitt 1977; Valiantzas 2013a, b). These methods are often complex, nonlinear, influenced by random factors, and rely on multiple assumptions. Each method is optimized based on the specific characteristics and unique weather conditions of the area under study (Küçüktopcu et al. 2023). Experimental methods for measuring ET, however, are limited to field or catchment-level applications. Furthermore, their results are dependent on time and location, hindering generalization of the findings to other areas. The need to calibrate equation coefficients and the inherent uncertainty associated with these methods have further contributed to their limitations (Islam and Alam 2021; Kisi et al. 2015).

The inherent nonlinearity and instability of meteorological variables makes challenging the complex phenomenon of ET estimation. Consequently, developing a precise physics-based formula for making accurate estimations is difficult. Thus, researchers have recently turned their attention to machine learning (ML) as an alternative approach for ET estimation (Krishnashetty et al. 2021). Numerous studies have demonstrated that ML techniques such as artificial neural networks (ANNs), support vector machines (SVMs), and random forest (RF) outperform empirical and semi-empirical methods in estimating reference evapotranspiration (ET₀). ML methods offer advantages such as fast computation, high accuracy, and strong generalization capability (Elbeltagi et al. 2021; Feng et al. 2016; Mousavi et al. 2015; Abd-Elaty et al. 2023). Kumar et al. (2002) introduced an ANN for calculating ET₀ that exhibited accuracy comparable to the FAO-56 PM method. Shi et al. (2020) investigated daily ET₀ in southeastern Australia and demonstrated the superior performance of RF over empirical equations. Rahimi Khoob (2008) developed an ANN model based on the Hargreaves method that used monthly data from the Khuzestan Plain of Iran, and it outperformed the Hargreaves model. Tabari et al. (2012) simulated ET₀ in Iran utilizing several ML methods, all of which outperformed the Blaney-Criddle, Hargreaves, and Jensen Haise models. Landeras et al. (2018) found that ANN models outperformed the Hargreaves method when using the same inputs. Rashid Niaghi et al. (2021) simulated ET₀ in a semi-humid climate using gene expression programming (GEP), SVM, ML, and RF methods with empirical equations as inputs. They found that the combination of radiation-based models and the RF model yielded the best performance results across all stations.

Evaluating ML models to reduce input data is crucial because of the significance of data availability in estimating ET. Wen et al. (2015) employed SVM and ANN to model ET₀ using limited meteorological data in arid regions of China and compared their results with experimental models like those of Priestly-Taylor and Hargreaves. They found that SVM performed best when using Tm, Rs, and Ws data. Mohammadrezapour et al. (2018) investigated the performance of SVM, adaptive neuro-fuzzy inference system (ANFIS), and GEP utilizing five combinations of inputs to simulate ET₀ in southeast Iran from 1970 to 2010 and found that SVM performed superiorly with inputs consisting of Tm, RH, Ws, and sunshine hours (Sshn). Ferreira et al. (2019) evaluated the performance of ANN and SVM models in estimating ET₀ across Brazil using either Tm and RH data or T (Tmin, Tm, Tmax) alone; both models demonstrated acceptable accuracy. Bellido-Jiménez et al. (2021) developed various neural intelligence methods, including MLP, generalized regression neural network (GRNN), extreme learning machine (ELM), SVM, and RF, to estimate ET₀ using temperature-based data as the only input in southern Spain. They concluded that ELM performed superiorly in all scenarios and locations. In general, ML models using fewer inputs exhibit comparable performances to the FAO-56PM model and outperform experimental methods.

Although ML models excel at unraveling intricate relationships, their effectiveness as data-driven models depends on the careful selection of variables, data quality, and the optimization of model parameters. Determining these parameters, however, typically depends on user expertise and the nature of the input data. In ET estimation, one approach for selecting ML variables is to align them with the inputs used in experimental methods. Despite numerous studies having explored ET estimation using different variables, few have compared ML models to experimental methods for estimation accuracy, identification of important variables, and the generalizability and stability of results. In the current study, 13 experimental methods and four ML models were examined to estimate ET₀ in a watershed located in southwestern Iran. The study objectives were: 1) to compare the accuracy of ML and experimental models with similar inputs, 2) to assess the accuracy of ML models compared to the FAO56-PM model using minimal input data, and 3) to identify the variables that influence ET.

2 Material and Methods

2.1 General Methodology

Figure 1 depicts a flowchart illustrating the primary steps of this study. Initially, annual precipitation was processed to identify wet, drought, and normal years. Next, meteorological data for these periods were gathered and utilized as input for estimating ET₀ using FAO-56PM, experimental models, and ML models. The results were then assessed using three indices: R² (coefficient of determination), RMSE (root mean square error), and MAE (mean absolute error).

2.2 Study Area

The current study considered the Karkheh Basin located in the southwest of Iran. Covering an area of 51,000 km², the basin originates from the Zagros mountain range, flows into Horul Azim (Fig. 2), and boasts elevations varying from 3626 (m.a.s.l) in upstream regions to -8 (m.a.s.l) in downstream areas. The upper parts of the basin are characterized as semi-arid, while the southern part is classified as dry. Average precipitation in the region measures 474 mm and daily Tm fluctuate between -13.7 and 45.9 °C. Dam construction and the expansion of agricultural lands, particularly irrigated areas, have been an enduring characteristics of this basin.

2.3 Wet, Drought, and Normal Year Selection

Meteorological data from 15 stations within the Karkheh Basin were procured from the National Meteorological Organization of Iran. Table S1, provided in the supplementary file, provides the main characteristics of the data. Precipitation data for the years 2000 to 2021 were analyzed. Average annual precipitation (± SD) was calculated and the mean ± 1SD was derived. Wet and drought years were defined as average annual precipitation exceeding the mean ± 1SD average precipitation lower than the mean-1SD, respectively; those years falling within these two intervals were considered normal (Chow et al. 1971; McCuen 2016). Thirteen stations reported drought conditions in 2019, eight experienced normal conditions in 2020, and 12 encountered drought in 2021.

2.4 ET Estimation Models

2.4.1 FAO56-PM

The effectiveness of 13 experimental and four ML models was evaluated using the PM equation, specifically the FAO56-PM model (Eq. 1), as the benchmark and calculated as (Allen et al. 1998):

$${ET}_{0}=\frac{0.408\Delta }{\Delta +\gamma }\left({R}_{n}-G\right)+\frac{\gamma }{\Delta +\gamma }\frac{\frac{900}{(Ta+273)}{u}_{2}({e}_{s}-{e}_{a})}{(1+0.34{u}_{2})}$$

(1)

where ET₀ denotes reference evapotranspiration (mm/day), Rn is the net solar radiation at the crop surface (MJ m⁻² d⁻¹), G represents soil heat flux (MJ m⁻² d⁻¹) (which is typically ignored for daily estimates), Ta indicates the daily mean air temperature (°C), u₂ represents the wind speed at a height of 2 m (m s⁻¹), es signifies the saturation vapor pressure (kPa), ea represents the actual vapor pressure (kPa) (obtained using maximum and minimum relative humidity), ∆ indicates the slope of the vapor pressure curve (kPa ºC⁻¹), and γ denotes the psychrometric constant (kPa ºC⁻¹).

2.4.2 Experimental Methods

The 13 experimental models utilized to estimate ET₀ were the Hargreaves-Samani and Blaney-Criddle (temperature-based); Penman and WMO (mass transfer-based); Makkink, Priestley-Taylor, Jensen-Haise, Abtew, and Irmak (radiation-based); and the Doorenbos-Pruitt, Valiantzas-1, Valiantzas-2, and Valiantzas-3 (combined approaches) models. These models were specifically developed to cater to diverse climatic conditions and geographical regions. Table 1 presents the equations and references for these experimental models.

Table 1 Experimental models for estimating reference evapotranspiration

Full size table

2.4.3 Machine Learning Models

Random Forest

RF, a tree-based model introduced in 2001 (Breiman 2001), was developed using a base learner called CART which has the capability to model nonlinear and complex patterns (Hastie et al. 2009). Unlike CART which can yield significantly different trees with minor variations in input data, RF employs the bootstrapping sampling method and generates multiple data samples using replacements from the original dataset. Each sample is then used to train a CART model, and the final output is determined by averaging the results. This ensemble approach produces more stable outcomes than CART (Carter and Liang 2019).

Artificial Neural Networks

Multiple ANNs with different architectures have been developed for various applications; among them, the multilayer perceptron is widely utilized. Regardless of the architecture, a learning algorithm is employed to discover the relationships between independent and dependent variables. The learning process entails adjusting the weights to minimize prediction error. During the ANN training phase, the learning algorithm optimizes the weights by reducing the prediction error through a repetitive procedure called backpropagation, which computes the difference between the predicted and the actual output of the network (Rumelhart et al. 1986). The direction and magnitude of the weight adjustments are determined by the partial derivative of the error with respect to each weight (Hecht-Nielsen 1992).

Support Vector Machine

SVM performs well when the available training data is limited (Mantero et al. 2005). This algorithm maps each data instance onto an n-dimensional space, where the dimensions represent the features or independent variables, and then separates them using a line or plane (Cortes and Vapnik 1995). In certain cases, separation is improved by transforming the samples to a higher-dimensional space using kernels. Commonly employed kernels include sigmoid, linear, radial basis function (rbf), and polynomial ones. The support vector acts as an optimal boundary that effectively separates the data groups, aiming to maximize the margin with the data.

Generalized Additive Model

GAM is suitable for situations in which the relationship between independent variables and the response variable is complex and non-linear, such as in environmental processes. GAM is a non-parametric extension of a generalized linear model (Hastie and Tibshirani 1990) that offers explicit insight into the relationships between variables. It allows the response curve to be determined by the observed data utilizing splines, i.e., mathematical functions that offer flexibility in fitting intricate curves to the data. Splines divide the curve into smaller, simpler segments, enabling the representation of the non-linear relationship between independent variables and the response variable (Hastie and Tibshirani 1990).

2.4.4 Variable Selection for ET Estimation

The efficacy of ML models can be affected by the existence of collinearity among independent variables. In this research, collinearity among variables was examined daily and monthly and input variables were selected using variable clustering and variance inflation factor (VIF). Variable clustering is advantageous in feature selection, as it allows for the identification of representative variables within each cluster, which can then be chosen for subsequent analysis or modeling purposes. VIF quantifies the degree of multicollinearity; a VIF value of 1 indicates no collinearity, while a value exceeding 5 is considered indicative of high multicollinearity (O'brien 2007).

2.5 Model Evaluation

RMSE, MAE, and R² (Eqs. 2–4) were used to assess the performance of both experimental and ML models. RMSE indicates an overall measure of the error, MAE indicates the average absolute error, and R² indicates the relationship between the observed and predicted values. R² should be as close to one as possible, and RMSE and MAE should be close to zero.

$$RMSE=\sqrt{\sum_{1}^{N}\frac{{(Pi-Qi)}^{2}}{N}}$$

(2)

$${R}^{2}=\frac{\sum_{1}^{N}(Qi-Qavg)(Pi-Pavg)}{\sqrt{\sum_{1}^{N}(Qi-Qavg)}\sqrt{\sum_{1}^{N}(Pi-Pavg)}}$$

(3)

$$MAE=\frac{\sum_{1}^{N}(Pi-Qi)}{n}$$

(4)

where Pi is the predicted value of ET₀, Pavg represents the predicted mean ET₀, Qi denotes the observed value, Qavg shows the mean observed ET₀, and n is the number of data.

3 Results and Discussion

3.1 Evaluation of Experimental Models

Different experimental models based on FAO56-PM were assessed using daily and monthly data for normal, drought, and wet years. The combined models exhibited higher accuracy on both daily and monthly scales (Figs. 3 and 4). Conversely, the mass transfer-based models displayed low accuracy. The temperature-based Blaney-Criddle method showed superior accuracy on both daily and monthly scales, potentially because of its incorporation of Ws in the calculation of constants a and b. Among the radiation-based models, the Abtew method (RMSE: 0.78, R²: 0.93 and MAE: 0.57) and Priestley-Taylor (RMSE: 1.57, R²: 0.90 and MAE: 1.17) achieved the highest and lowest accuracy, respectively on a monthly scale. On a daily scale, Priestley-Taylor was the most accurate (RMSE: 1.41, R²: 0.79 and MAE: 1.02), whereas the Jensen-Haise model was the least accurate (RMSE: 8.81, R²: 0.88 and MAE: 6.85). In the combined models, the Valiantzas-3 method demonstrated the highest accuracy, while Doorenbos and Pruitt's method exhibited the lowest accuracy on both daily and monthly scales. According to Fig. 4, the Valiantzas-3 and -1, Abtew, Makkink, and Jensen-Haise models showed higher monthly accuracy compared with other methods, while the Valiantzas-3, Blaney-Criddle, Valiantzas-2, and Priestley-Taylor models showed the most accuracy on a daily scale. The experimental models performed differently across various time scales. Because Valiantzas-3 and Valiantzas-1 required Ws and RH, the Abtew and Makkink models which require the least input were selected for the monthly scale. The Valiantzas-2 and Priestley-Taylor models were found to be most suitable for the daily scale, because Valiantzas-3 and Blaney-Criddle models incorporate Ws in their inputs.

Figure 4 presents a performance comparison between the FAO56-PM and various experimental models on the daily scale. As illustrated, the Valiantzas-3, Blaney-Criddle, Valiantzas-2, and Priestley-Taylor models exhibited superior performances for daily ET₀ estimation compared to the other models for daily, and thus, the Valiantzas-2 and Priestley-Taylor models were considered the optimal choices. Figures 3 and 4 show the results from evaluations of 13 experimental models on daily and monthly scales, respectively. Among these models, the Valiantzas-3, Valiantzas-1, and Abtew models demonstrated superior performances on the monthly scale. The Abtew model utilizes RS and Tmax as inputs, while the Valiantzas-1 model incorporates Rs, Tm, and RH as inputs. As the model requiring the minimum monthly ET₀ data, Abtew was deemed more suitable. It also benefits from a simpler equation.

3.2 Variable Selection for Machine Learning Models

Based on VIF analysis, Ws, vapor-pressure deficit (VPD), and Sshn demonstrated the least collinearity (Table 2); however, why temperature was excluded from the daily and monthly scales is inexplicable, considering its crucial role in ET estimation. The results were further investigated using the variable clustering method, and Fig. 5 presents the outcomes, where one variable from each group falling below the 0.8 dashed line should be chosen. Tm was selected from the group of variables (Tsoil, Tm, Tmax, Tmin), because Tmin and Tmax only represent specific times of the day and cannot adequately capture the Tm for water consumption throughout the entire day. Furthermore, measuring Tmin and Tmax may require specific instruments that are not universally available. Average relative humidity (RHm) was chosen from the group of relative humidity variables (RHm, minimum relative humidity (RHmin), maximum relative humidity (RHmax)), because it reflects the capacity of air to hold water vapor, and higher relative humidity indicates a closer proximity to saturation, resulting in lower ET. Rhmax represents the maximum relative humidity recorded during the day or month, leading to a lower estimation of ET. Conversely, RHmin causes overestimation. VPD was selected from the group (VPD and mean pressure (Pm)), as it directly indicates the atmosphere’s ability to accept water vapor. Based on theoretical considerations and the results of both methods, Tm, RHm, VPD, Sshn, and Ws were chosen for modeling ET estimation.

Table 2 Variable selection using VIF

Full size table

Apart from selecting variables statistically, the availability, low cost, and measurement accuracy of each variable must also be considered. Therefore, various combinations were explored for ET estimation (Table 3). Furthermore, to compare the outputs of the ML models and experimental methods, the input variables of experimental methods were also examined to be used as input for ML models.

Table 3 Combinations of input variables for ET estimation using ML models

Full size table

3.3 ML Models for ET Estimation

The current study assessed the use of RF, SVM, ANN, and GAM models for estimating ET using different combinations of input variables. The findings are presented in three sections: models utilizing input data similar to the FAO56-PM, models employing diverse input combinations, and models incorporating inputs similar to the experimental methods.

3.3.1 ML Models for ET Estimation Using the Same Input as FAO56-PM

Figure 6 shows the performance comparison of RF, SVM, ANN, and GAM models on daily and monthly scales using the same inputs as the FAO56-PM model. As seen, all models achieved high accuracy with performances similar to that of the FAO56-PM model. Nevertheless, ML models required a significantly longer computational time than the FAO56-PM model using software such as CropWat or Macro Excel. All ML models were executed in less than a minute, and the most accurate models for monthly and daily scales were ANN and SVM, respectively.

3.3.2 ML Models for ET Estimation Using Different Input Combinations

Figure 7 illustrates the results of ML models utilizing various combinations of inputs as presented in Table 4. Overall, both R² and RMSE values improved as more inputs were included, with the ANN model consistently outperforming other models across most combinations. ANN, GAM, RF, and SVM, respectively, exhibited higher accuracy when predicting on a monthly scale. When estimating on the daily scale, however, ANN, SVM, RF, and GAM were respectively more precise.

Table 4 Accuracy of the ML models with variables similar to the experimental models

Full size table

In terms of two-variable combinations, the ANN model incorporated Tm and Ws and demonstrated the highest accuracy for daily predictions. For monthly predictions, the SVM utilizing Tm and Ws as well as the GAM employing Ws and VPD exhibited superior accuracy. These findings suggest the importance of Ws in estimating ET. By adding Sshn to the Tm and Ws, an accuracy very close to that of the models with all inputs was achieved by SVM on the monthly scale and ANN on the daily scale. Models using four variables achieved similar accuracy to models using all variables. Among different combinations, adding RH or VPD had an equivalent effect on the combination set of Tm, Ws, and Sshn, which is in line with the findings of Mohammadrezapour et al. (2018). Furthermore, by introducing Sshn to the Tm and Ws variables in the subsequent combination, a level of accuracy comparable to that of models employing all inputs was achieved. The SVM and ANN models displayed greater accuracy for monthly and daily predictions, respectively.

To summarize, ANN and SVM demonstrated superior performances when utilizing a smaller number of variables, whereas RF exhibited better results when incorporating a larger number of variables. Tm, Ws, and Sshn were identified as influential factors in enhancing the accuracy of the ML models. Consequently, models incorporating these three inputs can serve as a viable alternative to the FAO-56PM method. Fan et al. (2019) discovered that including solar radiation further improved the accuracy of the models, and Pandey et al. (2016) demonstrated that models utilizing Ws data achieved higher levels of accuracy.

3.3.3 ML Models for ET Estimation Using the Same Input as Experimental Models

The findings of a comparison between ML models and experimental methods using the same input variables showed that ML models surpassed all experimental models in accuracy for both daily and monthly scales, as shown in Table 5. Specifically, the ANN employing identical inputs as Valiantzas-1 demonstrated a superior performance on the monthly scale, while the SVM utilizing the same inputs as Valiantzas-3 exhibited better results on the daily scale. As mentioned in the preceding section, the Valiantzas-2 and Priestley-Taylor models were determined to be appropriate for daily scale estimations, while the Abtew model was found to be suitable for monthly scale estimations. Among the ML models, the SVM aligned with the Priestley-Taylor model and the RF aligned with the Abtew model demonstrated superior accuracy compared to the other models. Notably, both of these models relied on radiation as a key input. These findings correspond with those of similar studies conducted by Heramb et al. (2023), Ünes et al. (2020), and Pendey et al. (2016), in which radiation-based models consistently demonstrated better performances.

Extensive research has consistently demonstrated the superior performance of ML models over empirical methods, a trend that was also observed in the present study. For example, Salam et al. (2020) reported the superiority of various ML models over empirical models (e.g., Ritchie, Thornthwaite, and Valiantzas) in predicting ET₀. Mehdizadeh et al. (2017) showed that ML models (SVM, GEP, and MARS) consistently outperformed empirical methods in estimating ET₀ across 44 meteorological stations in Iran. Additionally, Alazba et al. (2016) employed the temperature-based Hargreaves model and the radiation-based Priestley-Taylor model to estimate ET₀ using local meteorological data; they found that the ML-based model yielded the most accurate results among all the approaches considered.

3.4 Variable Importance in ML Models for ET Estimation

Figure 8 illustrates the ten most important variables in ML models for estimating daily and monthly ET₀. In the ANN model, Tm was the most important for both daily and monthly scales, followed by Ws for the monthly scale and RHm for the daily scale. In the GAM model, variables such as Pm, VPD, and Tm had the most substantial impact on both temporal scales, with Ws being the third influential factor for the daily scale. As for the RF model, Ws emerged as the most important for both scales, followed by Sshn for the daily scale and Pm for the monthly scale. Similarly in the SVM model, temperature exhibited the greatest effect on both temporal scales. In the daily scale, the three primary variables were Tm, Tmin, and Tmax, which aligns with the findings of Wu et al. (2019), who studied eight ML models with daily temperature and precipitation data from 14 different weather stations in China. The researchers recommended SVM models be used with temperature data only to predict daily ET₀ throughout China. Additionally, Yunfei et al. (2023) identified temperature and humidity as the most important factors in estimating ET in arid regions.

In monthly estimations, the variable Tm appeared most frequently with four repetitions, followed by Ws and Pm, each with two repetitions. Additionally, Tmax, Sshn, and precipitation 24 hour (P24) were deemed important, each with one repetition. For daily scale estimations, Tm was the most frequently repeated variable, followed by Tmax and Ws. Tmin, Sshn, and VPD variables were also considered important. Overall, it can be concluded that Tm, Ws, VPD, and Sshn had the most important impact on forecasting, highlighting their importance in ET estimation.

4 Conclusion

The present study assessed the accuracy of thirteen experimental methods and four ML models for daily and monthly ET₀ estimation during drought, wet, and normal years and compared their performances against the FAO56-PM method, which served as a benchmark model. The study aimed to identify those variables that impact ET₀ estimation notably. The experimental models were categorized into four groups, among which the combined methods exhibited the best performances, while methods based on mass transfer demonstrated weaker performances. Notably, the performance of the experimental models varied across different time intervals. Consequently, the Valiantzas-3, Valiantzas-1, Abtew, Makkink, and Jensen-Haise models were identified as more suitable for monthly scale ET₀ estimation. For daily scale ET₀ estimation, however, the Valiantzas-3, Blaney-Criddle, Valiantzas-2, and Priestley-Taylor models were considered more appropriate. For cases with minimal input, the Abtew and Makkink models are recommended for monthly scale, while the Valiantzas-2 and Priestley-Taylor models are suggested for daily scale estimations. Nevertheless, the ML and FAO-56PM models performed similarly and exhibited comparable accuracy on both daily and monthly scales. Overall, SVM showed higher accuracy at the monthly scale, while ANN performed better at the daily scale. Furthermore, both ANN and SVM achieved better accuracy when using fewer variables, whereas RF had greater accuracy with a larger number of variables.

In sum, our findings indicate that Tm, Ws, and Sshn contribute positively to enhancing the accuracy of ML models, and ML models can serve as an alternative to the FAO56-PM method. Additionally, with similar inputs, ML models outperformed experimental methods in both daily and monthly scales. In general, Tm, Ws, VPD, and Sshn were found to have the most significant influence on predicting ET₀. The present study was conducted in arid and semi-arid regions of Iran. Therefore, it is recommended this research be replicated under different climatic conditions to assess the applicability and performance of this research in diverse regions.

Data Availability

Data will be made available on request.

References

Abd-Elaty I, Kushwaha NL, Patel A (2023) Novel Hybrid Machine Learning Algorithms for Lakes Evaporation and Power Production using Floating Semitransparent Polymer Solar Cells. Water Resour Manage 37:4639–4661. https://doi.org/10.1007/s11269-023-03565-2
Article Google Scholar
Abtew W (1996) Evapotranspiration measurements and modeling for three wetland systems in South Florida 1. JAWRA 32(3). https://doi.org/10.1111/j.1752-1688.1996.tb04044.x
Alazba A, Yassin M, Mattar M (2016) Modeling daily evapotranspiration in hyper-arid environment using gene expression programming. Arab J Geosci 9:202. https://doi.org/10.1007/s12517-015-2273-x
Article Google Scholar
Alizadeh A, Keshavarz A (2005) Status of agricultural water use in Iran. In Water conservation, reuse, and recycling: Proceedings of an Iranian-American workshop 4:94–105. Washington DC, USA: National Academies Press
Allen RG, Pereira LS, Raes D, Smith M (1998) Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao Rome 300(9):D05109
Google Scholar
Almorox J, Quej VH, Marti P (2015) Global performance ranking of temperature-based approaches for evapotranspiration estimation considering Köppen climate classes. J Hydrol 528:514–522. https://doi.org/10.1016/j.jhydrol.2015.06.057
Article Google Scholar
Amani S, Shafizadeh-Moghadam H (2023) A review of machine learning models and influential factors for estimating evapotranspiration using remote sensing and ground-based data. Agric Water Manag 284:108324
Article Google Scholar
Anderson MC, Norman JM, Mecikalski JR, Otkin JA, Kustas WP (2007) A climatological study of evapotranspiration and moisture stress across the continental United States based on thermal remote sensing: 1. Model formulation. J Geophys Res Atmos 112(D10). https://doi.org/10.1029/2006JD007506
Bachour R, Maslova I, Ticlavilca AM, Walker WR, McKee M (2016) Wavelet-multivariate relevance vector machine hybrid model for forecasting daily evapotranspiration. Stoch Environ Res Risk Assess 30:103–117. https://doi.org/10.1007/s00477-015-1039-z
Article Google Scholar
Bellido-Jiménez JA, Estévez J, García-Marín AP (2021) New machine learning approaches to improve reference evapotranspiration estimates using intra-daily temperature-based variables in a semi-arid region of Spain. Agric Water Manag 245:106558. https://doi.org/10.1016/j.agwat.2020.106558
Article Google Scholar
Blaney HF, Criddle WD (1962) Determining consumptive use and irrigation water requirements. US Department of Agriculture
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Article Google Scholar
Carter C, Liang S (2019) Evaluation of ten machine learning methods for estimating terrestrial evapotranspiration from remote sensing. Int J Appl Earth Obs Geoinf 78:86–92. https://doi.org/10.1016/j.jag.2019.01.020
Article Google Scholar
Chow V, Maidment DR, Mays LW (1971) Applied hydrology. McGraw-Hill Series in Water Resources and Environmental Engineering. McGraw-Hill: New York. ISBN 0–07–010810–2
Chu R, Li M, Shen S, Islam AR, Cao W, Tao S, Gao P (2017) Changes in reference evapotranspiration and its contributing factors in Jiangsu, a major economic and agricultural province of eastern China. Water 9(7):486. https://doi.org/10.3390/w9070486
Article Google Scholar
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Article Google Scholar
De Paola F, Giugni M (2013) Coupled spatial distribution of rainfall and temperature in USA. Procedia Environ Sci 19:178–187. https://doi.org/10.1016/j.proenv.2013.06.020
Article Google Scholar
Doorenbos J, Pruitt WO (1977) Crop water requirements. FAO irrigation and drainage, vol 24. Land and Water Development Division, FAO, Rome, pp 1–144
Google Scholar
Eccel E (2012) Estimating air humidity from temperature and precipitation measures for modelling applications. Meteorol Appl 19(1):118–128. https://doi.org/10.1002/met.258
Article Google Scholar
Elbeltagi A, Kumari N, Dharpure JK, Mokhtar A, Alsafadi K, Kumar M, Mehdinejadiani B, Ramezani Etedali H, Brouziyne Y, Towfiqul Islam AR, Kuriqi A (2021) Prediction of combined terrestrial evapotranspiration index (CTEI) over large river basin based on machine learning approaches. Water 13(4):547. https://doi.org/10.3390/w13040547
Article Google Scholar
Fan J, Guyot A, Ostergaard KT, Lockington DA (2018) Effects of earlywood and latewood on sap flux density-based transpiration estimates in conifers. Agric For Meteorol 249:264–274
Article Google Scholar
Fan J, Ma X, Wu L, Zhang F, Yu X, Zeng W (2019) Light Gradient Boosting Machine: An efficient soft computing model for estimating daily reference evapotranspiration with local and external meteorological data. Agric Water Manag 225:105758. https://doi.org/10.1016/j.agwat.2019.105758
Article Google Scholar
Fathi-Taperasht A, Shafizadeh-Moghadam H, Minaei M, Xu T (2022) Influence of drought duration and severity on drought recovery period for different land cover types: evaluation using MODIS-based indices. Ecol Ind 141:109146. https://doi.org/10.1016/j.ecolind.2022.109146
Article Google Scholar
Feng Y, Cui N, Zhao L, Hu X, Gong D (2016) Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J Hydrol 536:376–383. https://doi.org/10.1016/j.jhydrol.2016.02.053
Article Google Scholar
Feng Y, Peng Y, Cui N, Gong D, Zhang K (2017) Modeling reference evapotranspiration using extreme learning machine and generalized regression neural network only with temperature data. Comput Electron Agric 136:71–78. https://doi.org/10.1016/j.compag.2017.01.027
Article Google Scholar
Ferreira LB, da Cunha FF, de Oliveira RA (2019) Fernandes Filho EI. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM–A new approach. J Hydrol 572:556–70. https://doi.org/10.1016/j.jhydrol.2019.03.028
Article Google Scholar
Hargreaves GH, Samani ZA (1985) Reference crop evapotranspiration from temperature. Appl Eng Agric 1(2):96–99. https://doi.org/10.13031/2013.26773
Article Google Scholar
Hargreaves GL, Hargreaves GH, Riley JP (1985) Irrigation water requirements for Senegal River basin. J Irrig Drain Eng 111(3):265–275. https://doi.org/10.1061/(ASCE)0733-9437(1985)111:3(265)
Article Google Scholar
Hastie TJ, Tibshirani R (1990) Generalized additive models. Chapman & Hall, London, p 352
Google Scholar
Hastie T, Tibshirani R, Friedman J (2009) "Random forests." The elements of statistical learning. Springer, New York, NY, 2009. 587–604. https://springerlink.bibliotecabuap.elogim.com/book/10.1007/978-0-387-84858-7
Hecht-Nielsen R (1992) Theory of the backpropagation neural network. Neural networks for perception. Academic Press, pp 65–93
Chapter Google Scholar
Heramb P, Ramana Rao KV, Subeesh A, Srivastava A (2023) Predictive modelling of reference evapotranspiration using machine learning models coupled with grey wolf optimizer. Water 15(5):856. https://doi.org/10.3390/w15050856
Article Google Scholar
Holmes JW (1984) Measuring evapotranspiration by hydrological methods. Agric Water Manag 8(1–3):29–40. https://doi.org/10.1016/0378-3774(84)90044-1
Article Google Scholar
Irmak S, Irmak A, Allen RG, Jones JW (2003) Solar and net radiation-based equations to estimate reference evapotranspiration in humid climates. J Irrig Drain Eng 129(5):336–347. https://doi.org/10.1061/(ASCE)0733-9437(2003)129:5(336)
Article Google Scholar
Islam S, Alam AR (2021) Performance evaluation of FAO Penman-Monteith and best alternative models for estimating reference evapotranspiration in Bangladesh. Heliyon 7(7):e07487. https://doi.org/10.1016/j.heliyon.2021.e07487
Article Google Scholar
Jensen ME, Haise HR (1963) Estimating evapotranspiration from solar radiation. J Irrig Drain Div 89(4):15–41. https://doi.org/10.1061/JRCEA4.0000287
Article Google Scholar
Jovic S, Nedeljkovic B, Golubovic Z, Kostic N (2018) Evolutionary algorithm for reference evapotranspiration analysis. Comput Electron Agric 150:1–4. https://doi.org/10.1016/j.compag.2018.04.003
Article Google Scholar
Kisi O, Sanikhani H, Zounemat-Kermani M, Niazi F (2015) Long-term monthly evapotranspiration modeling by several data-driven methods without climatic data. Comput Electron Agric 115:66–77. https://doi.org/10.1016/j.compag.2015.04.015
Article Google Scholar
Krishnashetty PH, Balasangameshwara J, Sreeman S, Desai S, Kantharaju AB (2021) Cognitive computing models for estimation of reference evapotranspiration: A review. Cogn Syst Res 70:109–116. https://doi.org/10.1016/j.cogsys.2021.07.012
Article Google Scholar
Küçüktopcu E, Cemek E, Cemek B, Simsek H (2023) Hybrid Statistical and Machine Learning Methods for Daily Evapotranspiration Modeling. Sustainability 15(7):5689. https://doi.org/10.3390/su15075689
Article Google Scholar
Kumar M, Raghuwanshi NS, Singh R, Wallender WW, Pruitt WO (2002) Estimating evapotranspiration using artificial neural network. J Irrig Drain Eng 128(4):224–233. https://doi.org/10.1061/(ASCE)0733-9437(2002)128:4(224)
Article Google Scholar
Landeras G, Bekoe E, Ampofo J, Logah F, Diop M, Cisse M, Shiri J (2018) New alternatives for reference evapotranspiration estimation in West Africa using limited weather data and ancillary data supply strategies. Theor Appl Climatol 132:701–716. https://doi.org/10.1007/s00704-017-2120-y
Article Google Scholar
Li Y, Qin Y, Rong P (2022) Evolution of potential evapotranspiration and its sensitivity to climate change based on the Thornthwaite, Hargreaves, and Penman-Monteith equation in environmental sensitive areas of China. Atmos Res 273:106178. https://doi.org/10.1016/j.atmosres.2022.106178
Article Google Scholar
Liu X, Xu C, Zhong X, Li Y, Yuan X, Cao J (2017) Comparison of 16 models for reference crop evapotranspiration against weighing lysimeter measurement. Agric Water Manag 184:145–155. https://doi.org/10.1016/j.agwat.2017.01.017
Article Google Scholar
Lu Y, Li T, Hu H, Zeng X (2023) Short-term prediction of reference crop evapotranspiration based on machine learning with different decomposition methods in arid areas of China. Agric Water Manag 279:108175. https://doi.org/10.1016/j.agwat.2023.108175
Article Google Scholar
Makkink GF (1957) Testing the Penman formula by means of lysimeters. Proc Inst Civ Eng 11:277–288
Google Scholar
Mantero P, Moser G, Serpico SB (2005) Partially supervised classification of remote sensing images through SVM-based probability density estimation. IEEE Trans Geosci Remote Sens 43(3):559–570
Article Google Scholar
McCuen RH (2016) Modeling hydrologic change: statistical methods. CRC Press
Book Google Scholar
Mehdizadeh S, Behmanesh J, Khalili K (2017) Using MARS, SVM, GEP and empirical equations for estimation of monthly mean reference evapotranspiration. Comput Electron Agric 139:103–114. https://doi.org/10.1016/j.compag.2017.05.002
Article Google Scholar
Mohammadrezapour O, Piri J, Kisi O (2018) Comparison of SVM, ANFIS and GEP in modeling monthly potential evapotranspiration in an arid region (Case study: Sistan and Baluchestan Province, Iran). Water Supply 19(2):392–403. https://doi.org/10.2166/ws.2018.084
Article Google Scholar
Mousavi R, Sabziparvar AA, Marofi S, Ebrahimi Pak NA, Heydari M (2015) Calibration of the Angström-Prescott solar radiation model for accurate estimation of reference evapotranspiration in the absence of observed solar radiation. Theor Appl Climatol 119:43–54. https://doi.org/10.1007/s00704-013-1086-7
Article Google Scholar
O’brien RM, (2007) A caution regarding rules of thumb for variance inflation factors. Qual Quant 41:673–90. https://doi.org/10.1007/s11135-006-9018-6
Article Google Scholar
Pandey PK, Dabral PP, Pandey V (2016) Evaluation of reference evapotranspiration methods for the northeastern region of India. ISWCR 1:52–63. https://doi.org/10.1016/j.iswcr.2016.02.003
Article Google Scholar
Penman HL (1948) Natural evaporation from open water, bare soil and grass. Proc R Soc Lon Ser A Math Phys 193(1032):120–45. https://doi.org/10.1098/rspa.1948.0037
Article CAS Google Scholar
Priestley CH, Taylor RJ (1972) On the assessment of surface heat flux and evaporation using large-scale parameters. MWR 100(2):81–92. https://doi.org/10.1175/1520-0493(1972)100%3C0081:OTAOSH%3E2.3.CO;2
Article Google Scholar
Rahimi khoob A (2008) Artificial neural network estimation of reference evapotranspiration from pan evaporation in a semi-arid environment. Irrig Sci 27:35–39. https://doi.org/10.1007/s00271-008-0119-y
Article Google Scholar
Rashid Niaghi A, Hassanijalilian O, Shiri J (2021) Estimation of reference evapotranspiration using spatial and temporal machine learning approaches. Hydrology 8(1):25. https://doi.org/10.3390/hydrology8010025
Article Google Scholar
Roy DK, Sarkar TK, Biswas SK (2023) Generalized Daily Reference Evapotranspiration Models Based on a Hybrid Optimization Algorithm Tuned Fuzzy Tree Approach. Water Resour Manage 37:193–218. https://doi.org/10.1007/s11269-022-03362-3
Article Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Article Google Scholar
Salam R, Islam AR (2020) Potential of RT, Bagging and RS ensemble learning algorithms for reference evapotranspiration prediction using climatic data-limited humid region in Bangladesh. J Hydrol 590:125241. https://doi.org/10.1016/j.jhydrol.2020.125241
Article Google Scholar
Scanlon BR, Tyler SW, Wierenga PJ (1997) Hydrologic issues in arid, unsaturated systems and implications for contaminant transport. Rev Geophys 35(4):461–490. https://doi.org/10.1029/97RG01172
Article CAS Google Scholar
Shan X, Cui N, Cai H, Hu X, Zhao L (2020) Estimation of summer maize evapotranspiration using MARS model in the semi-arid region of northwest China. Comput Electron Agric 174:105495. https://doi.org/10.1016/j.compag.2020.105495
Article Google Scholar
Shi L, Feng P, Wang B, Li Liu D, Cleverly J, Fang Q, Yu Q (2020) Projecting potential evapotranspiration change and quantifying its uncertainty under future climate scenarios: A case study in southeastern Australia. J Hydrol 584:124756. https://doi.org/10.1016/j.jhydrol.2020.124756
Article Google Scholar
Tabari H, Kisi O, Ezani A, Talaee PH (2012) SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J Hydrol 444:78–89. https://doi.org/10.1016/j.jhydrol.2012.04.007
Article Google Scholar
Teuling AJ, Hirschi M, Ohmura A, Wild M, Reichstein M, Ciais P, Buchmann N, Ammann C, Montagnani L, Richardson AD, Wohlfahrt G (2009) A regional perspective on trends in continental evaporation. Geophys Res Lett 36(2). https://doi.org/10.1029/2008GL036584
Üneş F, Kaya YZ, Mamak M (2020) Daily reference evapotranspiration prediction based on climatic conditions applying different data mining techniques and empirical equations. Theor Appl Climatol 141:763–773. https://doi.org/10.1007/s00704-020-03225-0
Article Google Scholar
Valiantzas JD (2013a) Simple ET 0 forms of Penman’s equation without wind and/or humidity data. I: Theoretical development. J Irrig Drain Eng 139(1):1–8. https://doi.org/10.1061/(ASCE)IR.1943-4774.0000520
Article Google Scholar
Valiantzas JD (2013b) Simple ET 0 forms of Penman’s equation without wind and/or humidity data. II: Comparisons with reduced set-FAO and other methodologies. J Irrig Drain Eng 139(1):9–19. https://doi.org/10.1061/(ASCE)IR.1943-4774.0000502
Wang S, Lian J, Peng Y, Hu B, Chen H (2019) Generalized reference evapotranspiration models with limited climatic data based on random forest and gene expression programming in Guangxi, China. Agric Water Manag 221:220–230. https://doi.org/10.1016/j.agwat.2019.03.027
Article Google Scholar
Wen X, Si J, He Z, Wu J, Shao H, Yu H (2015) Support-vector-machine-based models for modeling daily reference evapotranspiration with limited climatic data in extreme arid regions. Water Resour Manag 29:3195–3209. https://doi.org/10.1007/s11269-015-0990-2
Article Google Scholar
WMO (1963) Sites for wind-power installations; WMO No. 156, Technical Note No. 63. WMO, Geneva, Switzerland. https://library.wmo.int/index.php?lvl=notice_display&id=5475#.YjfmEHpBzDc. Accessed 8 Feb 2022
Wu L, Zhou H, Ma X, Fan J, Zhang F (2019) Daily reference evapotranspiration prediction based on hybridized extreme learning machine model with bio-inspired optimization algorithms: Application in contrasting climates of China. J Hydrol 577:123960. https://doi.org/10.1016/j.jhydrol.2019.123960
Article Google Scholar
Yan X, Yang N, Ao R, Mohammadian A, Liu J, Cao H, Yin P (2023) Deep learning for daily potential evapotranspiration using a HS-LSTM approach. Atmos Res 10:106856. https://doi.org/10.1016/j.atmosres.2023.106856
Article Google Scholar
Yunfei L, Dongwei G, Changjun Y (2023) Estimating the temporal and spatial variations in evapotranspiration with a nonlinear evaporation complementary relationship model in hyper-arid areas. Water Resour Manage 37:521–535. https://doi.org/10.1007/s11269-022-03384-x
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Water Engineering and Management, Tarbiat Modares University, Tehran, Iran
Shima Amani, Hossein Shafizadeh-Moghadam & Saeid Morid

Authors

Shima Amani
View author publications
You can also search for this author in PubMed Google Scholar
Hossein Shafizadeh-Moghadam
View author publications
You can also search for this author in PubMed Google Scholar
Saeid Morid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hossein Shafizadeh-Moghadam.

Ethics declarations

Ethical Approval

Not required as no animal/human was involved in the study.

Consent to Participate

Authors agree to participate any survey or feedback tasks.

Consent to Publish

Authors consent publication of the manuscript to the journal publisher.

Competing Interests

There is no competing interest in regard to the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 16 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Amani, S., Shafizadeh-Moghadam, H. & Morid, S. Utilizing Machine Learning Models with Limited Meteorological Data as Alternatives for the FAO-56PM Model in Estimating Reference Evapotranspiration. Water Resour Manage 38, 1921–1942 (2024). https://doi.org/10.1007/s11269-023-03670-2

Download citation

Received: 04 September 2023
Accepted: 13 November 2023
Published: 07 February 2024
Issue Date: April 2024
DOI: https://doi.org/10.1007/s11269-023-03670-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Utilizing Machine Learning Models with Limited Meteorological Data as Alternatives for the FAO-56PM Model in Estimating Reference Evapotranspiration

Abstract

Similar content being viewed by others

Estimation of crop evapotranspiration using statistical and machine learning techniques with limited meteorological data: a case study in Udham Singh Nagar, India

Machine learning models for daily net radiation prediction across different climatic zones of China

Revealing accuracy in climate dynamics: enhancing evapotranspiration estimation using advanced quantile regression and machine learning models

Explore related subjects

1 Introduction

2 Material and Methods

2.1 General Methodology

2.2 Study Area

2.3 Wet, Drought, and Normal Year Selection

2.4 ET Estimation Models

2.4.1 FAO56-PM

2.4.2 Experimental Methods

2.4.3 Machine Learning Models

Random Forest

Artificial Neural Networks

Support Vector Machine

Generalized Additive Model

2.4.4 Variable Selection for ET Estimation

2.5 Model Evaluation

3 Results and Discussion

3.1 Evaluation of Experimental Models

3.2 Variable Selection for Machine Learning Models

3.3 ML Models for ET Estimation

3.3.1 ML Models for ET Estimation Using the Same Input as FAO56-PM

3.3.2 ML Models for ET Estimation Using Different Input Combinations

3.3.3 ML Models for ET Estimation Using the Same Input as Experimental Models

3.4 Variable Importance in ML Models for ET Estimation

4 Conclusion

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Consent to Participate

Consent to Publish

Competing Interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 16 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation