Comparison of different empirical methods and data-driven models for estimating reference evapotranspiration in semi-arid Central Anatolian Region of Turkey

Yurtseven, Ibrahim; Serengil, Yusuf

doi:10.1007/s12517-021-08150-8

Comparison of different empirical methods and data-driven models for estimating reference evapotranspiration in semi-arid Central Anatolian Region of Turkey

Original Paper
Published: 19 September 2021

Volume 14, article number 2033, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Arabian Journal of Geosciences Aims and scope Submit manuscript

Comparison of different empirical methods and data-driven models for estimating reference evapotranspiration in semi-arid Central Anatolian Region of Turkey

Download PDF

545 Accesses
6 Citations
Explore all metrics

Abstract

Evapotranspiration (ET) is a major hydrologic process to assess water budgets in terrestrial ecosystems. Since measurement of ET may involve labor intensive field technics in large areas, estimation is preferred in most cases. The FAO Penman-Monteith (PM FAO-56) is a widely recognized reference evapotranspiration (ET_o) method for potential evapotranspiration calculations. The method requires a detailed and comprehensive meteorological data set; however, some empirical methods and models have attempted to calculate ET with less data. In this study, Makkink (ET_Mak), Hargreaves–Samani (ET_Har), Thornthwaite (ET_Thor), Blaney–Criddle (ET_BC), and Penman (ET_PM) were tested against the PM FAO-56. Penman method has achieved the highest accuracy among the empirical methods. In addition, the potential of artificial neural networks (ANN), support vector machines (SVM), random forest (RF), and multiple linear regression (MLR) for estimating ET_o were investigated in a semi-arid Central Anatolian Region of Turkey. The results obtained with the ANN (based on multi-layer perceptron) and SVM models performed better than other tested data-driven models and empirical methods. These models could be used most effectively at elevation range of 850–1000 m. According to our results MLP, SVM, and Penman methods provided good performances in semi-arid regions in agricultural planning and water resources management studies. Furthermore, we concluded that integrating maximum temperature may result in improved accuracy in ET model simulations in semi-arid regions.

Data-driven reference evapotranspiration (ET₀) estimation: a comparative study of regression and machine learning techniques

Article 13 October 2023

Machine Learning Approach for Reference Evapotranspiration Estimation in the Region of Fes, Morocco

Modelling monthly reference evapotranspiration estimation using machine learning approach in data-scarce North Western Himalaya region (Almora), Uttarakhand

Article 07 August 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Evapotranspiration (ET) plays a key role in water resources management, agriculture, drought, climate change adaptation, and ecosystem productivity (Currie 1991). There are various methods/models to estimate potential evapotranspiration but most of them give precise outputs for specific climate zones (Lu et al. 2005). ET_p calculated under certain properties can be regarded as the reference crop ET (ET_C). ET_c is usually estimated from reference evapotranspiration (ET_o), crop, and soil coefficients. FAO and working group of the International Commission on Irrigation and Drainage recommended standardized Penman-Monteith reference evapotranspiration (ET_o) as the potential evapotranspiration for short grass or a tall reference crop (alfalfa) (Allen et al. 1998). This hypothetical evapotranspiration considers a reference surface with an assumed crop height of 0.12 m, a fixed surface resistance of 70 s/m, and an albedo of 0.23; and the reference surface closely resembling an extensive surface of green grass of uniform height, actively growing, well-watered, and completely shading the ground. The ET_o calculation is an important issue for computing crop irrigation water requirements in agriculture. The practical value of pan evaporation with empirical coefficients (relating ET_o) has been widely used for 10 days or longer periods (Allen et al. 1998). Furthermore, many empirical or physically based equations have been developed and used to estimate ET_o under the climate regime of the country they were developed. How to choose the appropriate model to estimate ET_o among many evapotranspiration calculations is generally a major problem, and method selection under the climatic conditions of the research area is highly subjective unless certain techniques are used. Generally, empirical ET_o methods can be categorized under six groups: (1) combination (e.g., Shuttleworth); (2) radiation (e.g., Turc, Priestley and Taylor, Makking, Abtew); (3) temperature (e.g., Blaney and Criddle, Hargreaves–Samani, Thornthwaite, Hamon); (4) mass-transfer based (e.g., Penman, Dalton); (5) water budget methods (e.g. , Guitjens); and (6) pan evaporation methods (e.g., Allen et al. 1998). Many studies evaluate the reliability of these alternative empirical ET_o methods for the lack of the calculated ET_o data considering the United Nations Food and Agriculture Organization (FAO) Penman–Monteith (PM FAO-56) as the standard method (Table 1). These studies were conducted for purposes such as the effectiveness, improvement, and performance of PM FAO-56 at regional and global scales. Performances of these ET_o models have also been evaluated under different climate conditions and land cover. Assumptions and inputs are the most important causes for having different results of the methods (Maes et al. 2019).

Table 1 Summary of various methods or model adopted by different authors for ET_o estimation

Full size table

In recent years, interest has grown in testing models for non-linear relationships. Statistical tests have been proposed in many studies to help analysts check for the presence of non-linearities in an observed time series. Another alternative to ET_o estimation is the application of data-driven models. Recently, machine learning models generated simpler equations and require fewer inputs than the PM FAO-56 method. Thus, they are potentially good alternatives in ET_o calculation. As shown by numerous studies, machine-learning approaches such as Artificial Neural Networks (ANN) have been successfully applied in ET_o research (Zanetti et al. 2007; Traore et al. 2010; Käfer et al. 2020). ANN, which has a nonlinear mathematical structure, trains from the strength of correlation between input and simulated variables by checking previous trends (Yurtseven and Zengin 2013).

Sudheer et al. (2003) used radial basis function (RBF) to simulate crop evapotranspiration (ET_c) for rice crops. The simulated data was compared with the lysimetric data. The results clearly showed that RBF performed good (modeling efficiency of 98.2–99.0%) in ET_o estimation. Trajkovic et al. (2003) used a RBF type of ANN and found that the ANN gives accurate ET₀ estimates. Hashemi and Sepaskhah (2020) also reported the superiority of multi-layer perceptron with sunshine hours and wind speed and the radial basis function with sunshine hours. Zanetti et al. (2007) used the multilayer perceptron for estimating the ET_o by using only data from the maximum and minimum air temperatures in Brazil.

Machine learning approaches using support vector machine (SVM) have also been described and evaluated by many studies (Wen et al. 2015; Chia et al. 2020; Seifi and Riahi 2020). SVM, which is a useful estimator for practical applications, has the ability to provide a powerful algorithm between dependent and independent variables. This algorithm uses robust mathematical equations between dependent and independent variables to solve complex problems (Vapnik 1995). SVM has been a preferred approach as it adopts a global optimum rather than a local optimum compared to ANN method, and is less prone to overfitting than the ANN method. However, SVM models for estimating ET_o had limited applications compared to ANN models. Wen et al. (2015) developed SVM models for ET_o estimation and compared it with ANN model and three empirical models including Priestley-Taylor, Hargreaves, and Ritchie. The study showed that SVM showed relatively superior performance to ANN and empirical equations in modeling ET_o.

In recent years, the random forest (RF) model, which is an ensemble learning method for classification and regression, has become popular due to some of its advantages such as satisfactory performance, ability of preventing overfitting, and user-defined parameter selection in both classification and regression problems (Feng et al. 2017). The relative importance of variables can also be determined by this method. Wang et al. (2019) conducted a competitive analysis using different model-based approaches (random forest, gene-expression programming) on daily climatic data from the 24 meteorological stations recorded from 2010 to 2014 and concluded that random forest-based ET_o models performed slightly better than the gene expression-based models.

Multiple linear regression is a conventional model for estimating the value of one dependent variable based on two or more independent variables with linear relationship (Tabari et al. 2012). Many researchers have attempted to estimate the evaporation values from climatic variables with MLR. Yirga (2019) reported the performance of MLR in ET_o estimation. This research stated that the model is successfully employed for the estimation of the monthly reference evapotranspiration. da Silva et al. (2016) emphasized that models can be regarded as an alternative method to estimate the ET_o when the climatic variables are insufficient for other methods.

In this study, we tested some of the recent approaches in ET_o estimation. Our study area was the middle Anatolian region that is the driest region in Turkey. Drought has become an important and prominent phenomenon in Turkey, and especially semi-humid (semi-dry) drought classes have shifted to semi-dry (dry) conditions in Central Anatolia regions. Besides, the Central Anatolian Region has strong and big potential for marketing and growing of cereal production (wheat, barley, oat, etc.). In recent years, less rain and more ET have led to crop failure and economic losses in the region. Spatial variability of precipitation regimes influenced by the topography has been studied before (Türkeş and Tatlı 2011; Schemmel et al. 2013). The complex biotic and abiotic environment in different elevation zones makes it difficult to measure and estimate ET directly or indirectly. With respect to the climatic composition at different elevations, the variation of the ET_o in different elevation is quite complex. The elevation causes a manifold effect in ET_o in different locations since the dynamics of climatic parameters at different altitudes are also different. For example, relative humidity is one of the most relevant meteorological factors in ET_o measurement, and it is affected by elevation with a reverse relationship. Moisture availability affected by relative humidity and absolute vapor pressure decreases with elevation (Duane et al. 2008). Therefore, elevation dominates climatic parameters that affect ET_o at elevation gradients. Furthermore, the spatial variation in ET_o is also affected with R_S received by the surface (Vicente-Serrano et al. 2007). Ma et al. (2019) reported that available energy (shortwave radiation and air temperature) increased with elevation is a more influential factor than water vapor. Sun et al. (2020) found that net R_S leaf area index and air temperature have strong relationship with ET_o in mountainous regions. Wang et al. (2020) emphasized that the FAO-Penman Monteith (PM) and Hargreaves-Samani (HS) perform well as appropriate ET_o estimation methods in high elevation zones. Understanding the topographic characteristics, especially elevation controlling the ET_o in Central Anatolia and its variability, is one of the scientific gaps of climate research of Turkey. Furthermore, the elevation and ET_o interaction in dry regions of Turkey are poorly characterized despite obvious practical importance.

One of the main objectives of this study is to determine possible variations in ET_o at different elevations and the performance of selected methods/models that can be used in estimation of ET_o.

Other objectives were the following:

To calculate ET_o using six different empirical methods (FAO-56 Penman-Monteith method-ET_o, Hargreaves-ET_Har, Penman-ET_PM, Makking-ET_Mak, Thornthwaite-ET_Thor, and FAO-Blaney-Criddle-ET_BC) and make comparison between these five ET_o methods and the FAO-56 Penman-Monteith method (ET_o) in regional average values of 45 meteorological stations (represent the average of Central Anatolian Region) and four different elevation groups (650–850 m-G1, 850–1100 m-G2, 1100–1350 m-G3, and 1350–1600 m-G4).
To investigate the accuracy of data-driven modeling such as two different artificial neural network (ANN) techniques, namely the multi-layer perceptrons (MLPs), radial basis neural networks (RBNNs), support vector machine (SVM), random forest (RF), and multi linear regression (MLR) in estimating long-term monthly ET_o by using data from the same 45 stations in Central Anatolian Region in Turkey.
Statistical evaluation of the outputs of all ET_o approaches and climatic parameters used in the assessment.

Material and method

Study work-flow

The study work-flow (Fig. 1) presents the research steps represented by subdivision method. The methodology essentially seeks the possibility of different PET methods, ANN (MLP and RBF), SVM, RF, and MLR model as an alternative to the respective FAO-PM (ET_o). The flowchart illustrates the primary structure of the model involving three main parts, i.e., calculate ET_o with five simple empirical ET_o methods and PM FAO-56 method and generate alternative ET_o using data-driven models (ANN, SVM, RF, and MLR). The study was carried out in two steps/stages. In the first step, in which denotes regional average, a data formation was prepared by taking the average of the climate data of 45 meteorological stations used in all analysis. At this stage, 45 meteorological stations were evaluated as one station to represent the entire Central Anatolian Region of Turkey. In the second step, the data were grouped according to elevation of meteorological station as main data formations in the paper; this step is termed the “elevation group.” Thus, a large dataset was grouped along four different elevation gradients (650–850 m, 850–1100 m, 1100–1350 m, 1350–1600 m) using the elevation of 45 meteorological stations. All analyses were evaluated separately for both data formations. The conceptual background of the study consists of two main parts. First, ET_o was calculated with the equations of different researchers using the unnormalized climate data to compare ET_o (PM FAO–56). Second, the climate data were used in alternative data-driven model based on ET_o calculations. Performance evaluation was used for determining appropriate method or model to estimate ET_o in regional average and grouped data. Therefore, the coefficient of determination (R²), mean absolute deviation (MAD), Nash–Sutcliffe efficiency (NSE), the index of agreement (d), and percent bias (PBIAS) were used to identify the best method among the empirical methods and data-driven ET_o models.

Study area and data acquisition

According to multiple-year local assessments, Turkey is classified under seven geographical and 8 ecological regions (ecozones) (Serengil 2018). This research was conducted in the Central Anatolia geographical region, and Central Anatolia Steppe ecozone. Climatological data from 47 synoptic stations located in Central Anatolian Region of Turkey were obtained from the Climate Forecast System Reanalysis (CFSR) global meteorological dataset. The CFRS dataset consist of hourly weather forecast generated by National Weather Service’s NCEP Global Forecast Syetems. Studies showed that the CFSR data used in hydrological models provide satisfactory results (Fuka et al. 2014; Dile and Srinivasan 2014). All stations, with 35 years of monthly meteorological data, were selected for analysis. The data covered the time period between January 1979 and December 2013. The locations of the 47 stations are given in Fig. 2, and Table 2 shows some characteristics of these stations.

Table 2 Properties of 45 meteorological stations with long-term average climatic conditions. Elevation of stations, elevation group, annual average temperature (T_avg), annual total precipitation (P), annual total evapotranspiration (ET_O), UNEP aridity index (Middleton and Thomas 1997), and UNEP aridity index zone (Middleton and Thomas 1997)

Full size table

The Central Anatolian Region of Turkey is a generally semiarid area based on United Nations Environment Program (UNEP) aridity index (Middleton and Thomas 1997) with a size of about 151,000 km², representing 21% of the country. The region is located between 31° 21′ to 38° 07′ E longitude and 36° 59′ to 40° 55′ N latitude. In this region, average altitude of 1000 m and low precipitation plateaus are located and it is limited by Bolu-Köroğlu Mountain to the north, Sündiken and Uludağ Mountains to the west, Toros Mountain to the south, and Tecer Mountains of Turkey to the east. As the region is surrounded by high mountains, the humid mild sea air cannot easily penetrate into the region. Therefore, the region has a continental climate with hot and dry summers and cold and snowy winters. In the region, the terrestrial effect increases due to the increase in altitude, and winter temperatures reach extremely low values towards the east. The annual average temperature of the region is 10–11 °C (Table 1). Annual precipitation averages about 418 mm, and the actual amount is determined by elevation. Low precipitation amount in some areas of the region is not sufficient to satisfy the water need of the crops during especially summer months. In a dry period, it would thus be necessary to irrigate the crops, while in average wet seasons, irrigation is not needed in agricultural areas. Low precipitation generally causes low productivity in agriculture. Drought necessitates fallow practice in grain agriculture. The natural vegetation is mostly composed of steppes since drought prevents forest growth.

In this study, data processing follows the raw data download and converts into usable or readable form. The monthly values of maximum temperature (T_max), minimum temperature (T_min), average temperature (T_avg), precipitation (P), average wind speed (U), average (RH_avg), maximum (RH_max), minimum relative humidity (RH_min), and average solar radiation (R_S) were obtained for 45 stations located in Central Anatolian Region. There are two steps in the study. (1) The regional average of the Central Anatolian Region for each parameter was calculated by taking the average of the values obtained from 45 stations. Therefore, each station has not been evaluated separately in this first step. (2) In the second step, the 45 different climate stations were divided into four different elevation groups as follows: 650–850 m considered as “low elevation group-G1,” 850–1100 m considered as “moderate elevation group-G2,” 1100–1350 m considered as “high elevation group-G3,” and 1350–1600 m considered as “very high elevation group-G4.” The results of five different literature-based equations (ET_Har, ET_PM, ET_Mak, ET_Thor, and ET_BC), ANN (MLP and RBF), SVR, RF, and MLR models for 4 different elevation groups were subjected to performance evaluations with target output ET_o. The objective was to compare models for different elevation groups located in different local climatic conditions.

Empirical ETo methods

The following ET methods have been chosen for the assessment:

(a)
FAO-56 Penman–Monteith method (ET_o): This method is considered the most precise method to estimate ET_o. The FAO Penman-Monteith method for calculating reference (potential) evapotranspiration ET_o can be expressed as (Allen et al. 1998) follows:

$$ \mathrm{ETo}=\frac{0.408\Delta \left({R}_n-G\right)+\gamma \frac{900}{T_a+273}{u}_2\left({e}_s-{e}_a\right)}{\Delta +\upgamma \left(1+0.34{u}_2\right)} $$

(1)

where ET_o= reference evapotranspiration (mm day⁻¹); Δ is the slope of the saturated vapor pressure curve (kPa ⁸C⁻¹); R_n is the net radiation (MJ m⁻² day⁻¹); G is the soil heat flux density (MJ m⁻² day⁻¹), considered as null for daily estimates; T is the daily mean air temperature (°C) at 2 m, based on the average of maximum and minimum temperatures; U₂ is the average wind speed at 2 m height (m s⁻¹); e_s is the saturation vapor pressure (kPa); e_a is the actual vapor pressure (kPa); (e_s − e_a) is the saturation vapor pressure deficit (Δe, kPa) at temperature T; and γ is the psychrometric constant (0.0677 kPa °C⁻¹).

The following equations were recommended by Allen et al. (1998) to estimate Rn:

$$ {R}_n={R}_{\mathrm{ns}}-{R}_{\mathrm{nl}} $$

(2)

$$ {R}_{\mathrm{ns}}=0.77 SR $$

(3)

$$ {R}_{\mathrm{nl}}=\left[\sigma \left(\frac{T{\max_K}^4+T{\min_K}^4\ }{2}\right)\left(0.34-0.14\sqrt{e_a}\right)\left(1.35\frac{Rs}{Rs\mathrm{o}}-0.35\right)\right] $$

(4)

$$ Rs\mathrm{o}=0.75 Ra $$

(5)

where R_ns is the net shortwave radiation (MJ m⁻² day⁻¹); R_nl is the net longwave radiation (MJ m⁻² day⁻¹); R_s is the incoming solar radiation (MJ m⁻² day⁻¹); σ is the Stefan–Boltzmann constant (4.903 × 10⁻⁹ MJ K⁻⁴ m⁻² day⁻¹); Tmax_K is the maximum temperature (K); Tmin_K is the minimum temperature (K); SR/SRo is ratio between the incoming solar radiation and the clear sky solar radiation (MJ m⁻² day⁻¹), which is less or equal to 1; and Ra is the extraterrestrial solar radiation (MJ m⁻² day⁻¹). The other parameters of equation of ETo were determined as follows:

$$ \varDelta =\frac{4098\left[0.6108\exp \left(17.27T/\left(T+237.3\right)\right)\right]}{{\left(T+237.3\right)}^2} $$

(6)

$$ {e}_s=\frac{\begin{array}{c}\left[0.6108\ \exp \left(\frac{\left(17.27T{\max}_c\right)}{T\max_c+237.3}\right)\right]+\\ {}\left[0.6108\exp \left(\left(17.27T{\min}_c\right)/\left(T{\min}_c+273.3\right)\right)\right]\end{array}}{2} $$

(7)

$$ {e}_a=\frac{\mathrm{RH}}{100}{e}_s $$

(8)

where Tmax_c is the maximum temperature (°C); Tmin_c is the minimum temperature (°C); and RH is the mean daily relative humidity, calculated from maximum and minimum values.

The following equation was used to the equation of a logarithmic wind speed profile to convert wind speed data obtained at height of 10 m to the standard height of 2 m.

$$ {U}_2={U}_Z\left[\frac{4.87}{\ln \left(67.8z-5.42\right)}\right] $$

(9)

where z is the height of the wind speed measurement (=10 m).

(b)
Hargreaves method (ET_Har): The Hargreaves method (Hargreaves and Samani 1985), which is a temperature based equation, estimates ETo (mm d⁻¹); using only the maximum and minimum temperatures, and is expressed by Eq. 10:

$$ \mathrm{ETo}={C}_0 Rs{\left(T{\max}_c-T{\max}_c\right)}^{0.5}\left(T+17.8\right) $$

(10)

where Rs is the extraterrestrial solar radiation, in mm day⁻¹; and C_o the conversion parameter (=0.0023).

(iii)
Penman method (PET_PM): This method is still a mass-transfer-based method in estimating free water surface evaporation E because of its simplicity and reasonable accuracy. Penman (1948) proposed the following equation.

$$ \mathrm{ETo}=0.35\left(1+\frac{0.98}{100{U}_2}\right)\left({e}_s-{e}_a\right) $$

(11)

where U₂ wind speed at 2 m high in miles day⁻¹; e_s the saturation vapor pressure at the temperature of the water surface; e_a the actual vapor pressure in the air.

(iv)
Makking method (PET_Mak): For estimating potential evapotranspiration (mm d⁻¹) Makking (1957) proposed the following equation.

$$ \mathrm{ETo}=0.61\frac{\Delta}{\Delta +\gamma}\frac{R_s}{\lambda }-0.12 $$

(12)

where R_s = the total solar radiation in cal cm⁻² day⁻¹; Δ = the slope of saturation vapor pressure curve (in mb/⁸C); γ = the psychrometric constant (in mb/⁸C); λ = latent heat (in calories per gram); P = atmospheric pressure (in millibar).

(e)
Thornthwaite method (PET_Thor): The Thornthwaite method is a temperature-based method for calculating PET can be expressed as (Thornthwaite 1948):

$$ \mathrm{ETo}=\left\{\begin{array}{c}\ 0\kern0.5em ,\kern7.00em {\mathrm{T}}_{\mathrm{avg}}<0{}^{\circ}\mathrm{C}\\ {}\kern1.25em 16{\left(\frac{10\ {T}_{\mathrm{avg}}}{I}\right)}^a,\kern0.75em 0{}^{\circ}\mathrm{C}\le {\mathrm{T}}_{\mathrm{avg}}\le 26.5{}^{\circ}\mathrm{C}\\ {}-0.43{T_{\mathrm{avg}}}^2+32.24{T}_{\mathrm{avg}}-415.85\kern0.75em ,\kern0.75em {\mathrm{T}}_{\mathrm{avg}}>26.5{}^{\circ}\mathrm{C}\ \end{array}\right. $$

(13)

$$ I=\sum \limits_{k=1}^{12}{\left(0.2{T}_k\right)}^{1.514} $$

(14)

$$ a=0.000000675{I}^3-0:0000771{I}^2+0.01792I+0.49239 $$

(15)

where ETo = reference evapotranspiration estimated by Thornthwaite equation (mm month⁻¹), T_avg = mean monthly air temperature (°C), I = thermal index imposed by the local normal climatic temperature regime, and a = exponent being a function of I. The value of a varies from 0 to 4.25, while the thermal index I varies from 0 to 160.

(f)
FAO Blaney Criddle method (FAO_BC): Blaney-Criddle equation (BC) is a simpler method comparing than other empirical methods and the method use only air temperature as an input data. The equation calculates evapotranspiration for a “reference crop” and this crop is an actively growing green grass with 8–15 cm high. Blaney and Criddle (1950) proposed a very simplified calculating approach of the temperature-based equation.

$$ \mathrm{ETo}=\mathrm{kp}\ \left(0.46{T}_a+8.13\right) $$

(16)

where ETo = potential evapotranspiration from a reference crop, in mm, for the period in which p is expressed; T_a = mean temperature in °C; p = percentage of total daytime hours for the used period (daily or monthly) out of total daytime hours of the year (365 × 12); k = monthly consumptive use coefficient, depending on vegetation type, location and season, and for the growing season (May to October); k varies from 0.5 for orange tree to 1.2 for dense natural vegetation.

ANN method

Soft computing methods such as artificial neural networks (ANN) have been successfully employed to develop a new estimation model(s) for estimating the available model parameters. ANN is an information processing system that consists of three main layers as input, hidden, and output. ANN works in layers where send parallel-operated information with a series of processing elements called neurons. The function of these neurons provides various conversion functions for synaptic weights with their information. Training was occurred in this process. All neurons receive weighted inputs which run as interconnect between input variables or the outputs, add a bias term and pass the result by an activation function. The basis of this process can be formulized in the following equations.

$$ {I}_J=\sum \limits_{i=1}^n{w}_{ij}{x}_i+{b}_i $$

(17)

$$ {y}_j=f\left({I}_J\right) $$

(18)

$$ f(x)=\frac{e^x-{e}^{-x}}{e^x+{e}^{-x}} $$

(19)

where I_j is the activation value of neuron j of the ith layer; w_ij is the weight of the ith input and the neuron j of the layer; x_i is the ith input value, b_i is the ith bias term, y_j is the output of the neuron j, and f(x) is the activation function.

In ANN and MLR, the variables in dataset were normalized to increase the model performance. Min-max feature scaling (unity-based normalization) is used to bring all values into the range 0 and 1. The general form of normalization that is using in this study is presented in Eq. 20:

$$ {X}_{\mathrm{norm}}=\frac{x-{x}_{\mathrm{min}}}{x_{\mathrm{max}}-{x}_{\mathrm{min}}} $$

(20)

In this study, ANNs of the multi-layer perceptron (MLP) and radial basis function (RBF) were employed. The back propagation learning algorithm was used in MLP training process. A structure of MLP consists of at least three layers of nodes: an input layer, a hidden layer, and an output layer. Figure 3 represents a three-layer structure of MLP. Each neuron that uses a nonlinear activation functions except for the input layer. Every node is fully connected in MLP, and each node connects with a weight of w_ij and K_j from input layer to hidden layer and hidden layer to output layer, respectively.

Radial basic functions (RBF) calculate distance criteria with respect to the center, and the algorithm can be constructed accordingly. Figure 4 represents a RBF structure consisted of a three-layer structure namely (1) input layer, (2) hidden layer, and (3) output layer. The general construction is just like a MLP but there are some differences between MLP and RBF. The most characteristic feature of the RBF network is the activation function (H_p(x) as networks neuron) in hidden layers using Gaussian Bell function that is the most widely used function of RBF (Fig. 4). This function calculates the distance between the neuron center in the hidden layer and the input vector for each neuron in the input layer. The final output is obtained by running sum of dot products of activation function and distance. Therefore, it describes the way that the unit responds to the total input.

We selected parameters of the input layer considering using correlation performance with the reference evapotranspiration (Table 2). Some monthly climate variables that are T_max, RH_avg, and R_S were used in the input layer. The optimum hidden layer node numbers of the ANN models were obtained after trying different hidden layer network structures that errors can be minimized. The optimum iteration number of ANN networks was also tried. The training of the ANN models was stopped at 250 iterations due to the mean square error between the observed and estimated values decreased with increasing iteration numbers until this number of iterations. The learning process of the MLP and RBF was carried out with daily data series extracted from the 45 selected locations between January 1979 and January 2004 (70% of the whole data set). The data series from January 2004 to July 2014 (30% of the whole data set) were used for testing. The hyperbolic tangent and SoftMax activation functions were used for the hidden nodes for MLP and RBF models, respectively. It was found that the network structure of 3-5-1 in MLP and 3-9-1 in RBF leads to the best results. 3-5-1 denotes an MLP model comprising 3 inputs, 5 hidden, and 1 output node.

MLR method

Regression analysis is one of the statistical tools, which can be considered the process as fitting a model to data. In a linear regression model, data and linear functions can be used to construct the relation that model real-world applications and output parameters are estimated from the data. MLR use several (two or more) explanatory (independent) variables to estimate the outcome of a response (dependent) variable with a linear equation to fitting a linear model. The independent variable x is associated with a value of the dependent variable y in MLR analysis. A typical MLR model expressed as in Eq. 21 below:

$$ \hat{Y}={a}_0+\sum \limits_{j=1}^m{a}_j{X}_j $$

(21)

where Ŷ is the model’s output, X_j (from X₁ to X_m) is the independent input variables to the model, and a_j (from a₀ to a_m) is partial regression coefficients. The magnitude of each regression coefficient (a_j) in MLR model shows explanatory power of relationship between dependent and independent variables.

SVM method

Support vector machine (SVM), which is a well-known machine-learning method based on classification and regression analysis theory introduced by Vapnik (1995). The optimal support vector network automatically generated SVMs network architecture while ANN architecture generally involves manual trial-and-error procedures. The types of kernel functions namely linear, sigmoid, polynomial, radial basis function, and multi-layer perceptron are successful in explaining complex data sets. In this study, linear kernel function uses in SVM models. The kernel function is similar to a two-layer perceptron model of the neural network. Unlike the process in standard neural network, the weight of the network is found by solving a quadratic programming problem with linear constraints. In general architecture of the SVM (Fig. 5), the final output connected with hidden nodes are the support vectors (SVs) of the SVM and the weights of SVM network.

The relationship between a dependent variable (y) and a set of independent variables (x) is determined by f(x) in SVM for regression, according to the following equation:

$$ F(X)=\sum \limits_{k=1}^n{\overline{a}}_n.K\left(x,{x}_n\right)+b $$

(22)

where $ {\overline{a}}_n $ is the Lagrange multipliers, B is a bias term, and K(x, s_n) is the kernel function which is based upon reproducing kernel Hilbert spaces. In this study, the input vectors (x_n) refer to the daily records of T_max, RH_avg, and R_S while the target value (y) refers to ET₀ values calculated using the FAO-56 PM. In this study, the SVM (100, 10) model has the regularization constant = 100 and width of the RBF kernel = 10.

RF method

Random forest is one of the machine learning models that can be applied to both regression and classification problems. The algorithm uses decision trees using a CART-like procedure that uses a subset of observations through the bootstrap approach (Tsangaratos and Ilia 2017). It is necessary to understand the decision trees structure, which is the basic part of the model on the basis of the random forest. In the model, many individual trees are created by sampling the variables in the data set. Random forest aims to provide better accuracy by using these many decision trees to create a forest. The subsets of variables are generated in the method and each node in the decision tree is divided by the best of this subset of variables. Each variable is classified by each decision tree and thus contribution of variables is well determined to explaining the variance in the dependent variable. Breiman (2001) introduced that many regression trees in RF are installed on marginal functions which are dependent on random vector (Θ), indicator function (I), and specified numerical predictor h_k(X). The marginal functions might be given as follows (Breiman 2001):

$$ \mathrm{mg}\left(X,Y\right)={\mathrm{av}}_kI\left({h}_k(X)=Y\right)-\underset{j\ne Y}{\max }{\mathrm{av}}_kI\left({h}_k(X)=j\right) $$

(23)

The overall result is given as the average of the sub-results from each tree. The average generalization error of RF can be given as follows:

$$ {\mathrm{PE}}^{\ast}\left(\mathrm{forest}\right)={P}_{X,Y}\left(\mathrm{mg}\left(X,Y\right)<0\right) $$

(24)

There are two theorems that can be given in RF algorithm.

Theorem 1

By the number of trees increases, we will have the following:

$$ {P}_{X,Y}\left({P}_{\uptheta}h\left(X,\uptheta \right)=Y\right)-\underset{j\ne Y}{\max }{P}_{\uptheta}\left(h,\uptheta \right)=j<0\Big) $$

(25)

The average generalization error of a tree will be as follows:

$$ {\mathrm{PE}}^{\ast}\left(\mathrm{tree}\right)={P}_{\uptheta}{P}_{X,Y}{\left(Y-h\left(X,\uptheta \right)\right)}^2 $$

(26)

Theorem 2

Suppose that PY = P_Xh(X, θ) for all Θ, so:

$$ {\mathrm{PE}}^{\ast}\left(\mathrm{forest}\right)\leqslant \overline{\rho}.{PE}^{\ast}\left(\mathrm{tree}\right) $$

(27)

where $ \overline{\rho} $ is represented as a weighted correlation between the Y − h(X, θ) and (Y − h(X, θ^′) (Breiman 2001).

Overfitting, which is one of the biggest problems of decision trees, is decreasing with training on different data sets in the random forest model. In addition, the chance of finding an outlier in subset of variables created by bootstrap method is reduced. The random forest training algorithm (for both classification and regression) applies bootstrap aggregating, or bagging, to tree learners. More details about random forest can be found in Breiman (2001). In this study, RF is used as regression model to estimate ETo. The important tunable parameters are the number of trees (n_tree) and the number of estimators in the random subset of each node (m_try). The default values of m_try (one-third of all estimator variables) were used in this study. The process of n_tree decision which affects the forecast performance was used during parameter optimization to yield the minimum error. An iterative evaluation and out-of-bag error (mean squared error for regression problems) were used as the selection criteria in n_tree defining. The number of trees was especially used in terms of parameter optimization to yield the minimum error in the study. In general, RMSE decreased with increasing n_tree, and r increased correspondingly. In this study, two number of trees were considered differently, for first forest 100 trees and second forest 30 trees. Since the 100-tree gives, the random forest with 100 trees is not included in the results section due to its results are very similar to the results of RBF (ANN). Thus, the random forest with 30 trees was considered in the evaluation of the study.

Performance criteria

Two performance criteria are used in this study to assess the goodness of fit of the models, which are R², root mean square error (RMSE), Nash Sutcliffe efficiency (NSE), the index of agreement (d), and percent bias (PBIAS) by using the following equations (Moriasi et al. 2015).

$$ {R}^2={\left[\frac{\sum \limits_{i=1}^n\left({O}_i-\overline{O}\right)\left({P}_i-\overline{P}\right)}{\sqrt{\sum \limits_{i=1}^n{\left({O}_i-\overline{O}\right)}^2}\sqrt{\sum \limits_{i=1}^n{\left({P}_i-\overline{P}\right)}^2}}\right]}^2 $$

(28)

$$ \mathrm{RMSE}=\sqrt{\frac{1}{n}\sum \limits_{i=1}^n{\left({O}_i-{P}_i\right)}^2} $$

(29)

$$ \mathrm{NSE}=1-\frac{\sum \limits_{i=1}^n{\left({O}_i-{P}_i\right)}^2}{{\left({O}_i-\overline{O}\right)}^2} $$

(30)

$$ d=1-\frac{\sum \limits_{i=1}^n{\left({O}_i-{P}_i\right)}^2}{\sum \limits_{i=1}^n{\left(\left|{P}_i-\overline{O}\right|+\left|{O}_i-\overline{O}\right|\right)}^2} $$

(31)

$$ PBIAS\left(\%\right)=\frac{\sum \limits_{i=1}^n{O}_i-{P}_i}{\sum \limits_{i=1}^n{O}_i}x100 $$

(32)

where O_i is the results of methods or model as ETo in mm d⁻¹; P_i is the ET_o in mm d⁻¹; $ \overline{O} $ is the results of methods or model as ETo, and n is the total number of data.

Results and discussion

In the first step of the study, which denotes regional average, all approach and analysis were made by considering the average of data from 45 meteorological stations that represent the Central Anatolian Region. At this step, the stations were not compared, and the Central Anatolian Region was evaluated as a single station by taking the average of all stations. In the second step, 45 stations were divided into 4 groups according to their elevations, and the meteorological dataset of each stations are averaged within their elevation group. Thus, the performance of models and methods was analyzed according to four different groups in the second step.

The results of first step (regional average)

The mean, minimum, maximum, standard deviation, variation coefficient, and skewness of monthly statistical parameters of regional average dataset for the entire time series are given in Table 3. The statistical parameters of the training, testing, and whole data are shown in the table separately. The performance of ANN models was affected by skewness of the time series data (Zheng et al. 2018). It was shown that ET_o and all variables have quite low skewness values in the complete dataset. The precipitation shows higher skewed distribution comparing other parameters for each period (see SK values in Table 3). Accordingly, the skewness values for all data sets were seen to be roughly similar although the SK values of T_min quite differed from others for each period. The greater of CV values, which is defined as the standard deviation divided by the mean, shows the greater level of dispersion around the mean. The mean ET_o (131.70 mm/month) in testing period set is quite higher than the mean ET_o in the training and whole data period (119.27 and 127.96 mm/month, respectively). As can be seen from the R² in whole series, T_max (R²= 0.84, p<0.05), RH_avg (R²= 0.68, p<0.05), and R_S (R²= 0.79, p<0.05) are closely correlated with ET_o.

Table 3 Statistical parameter of climatic data and ET_o in training period, testing period, and entire period

Full size table

In this study, monthly ET was estimated using six different data-driven models including ANN (MLP and RBF), SVR, RF, and MLR. The ET_MLP, ET_RBF, ET_SVM, ET_RF, and ET_MLR models use the same input variables. The Penman-Monteith FAO-56 equation (ET_o) was accepted as the reference equation; and other empirical equations (Hargreaves-Samani, Penman, Makking, Thornthwaite, Blaney Criddle), data-driven models (ET_MLP, ET_RBF, ET_SVM, ET_RF), and statistical model (ET_MLR) were compared with ET_o. Table 4 shows the results of all models and equations based on the MAD, RMSE, d, and NSE calculations in training and testing period. Generally, considering its high R² and low MAD and RMSE, the ET_RBF model and ET_PM formula produced better results in the field of this study within all equation methods, while the worst performance belongs to the ET_Mak in data-driven models. This result is similar in the Kingdom of Saudi Arabia where the Makking equations perform worse than different selected methods (Islam et al. 2020). It is clear from Table 4 that the ET_MLP and SVM model outperformed all other models in terms of all performance criteria in training period. ET_RF and ET_MLR equation results are close to each other, based on their high R² and low RMSE in training period.

Table 4 The performance statistics of the models and equations in training and testing period

Full size table

It is apparent that all of the methods and models performed well in training and testing periods, and the values of RMSE, d, and NSE had very small difference between training and testing periods, and all R² were also greater than 0.85. In testing periods, it is apparent that MLP (R²=0.999, p<0.05) and SVM models (R²=0.998, p<0.05) were better than others in testing period for ET_o estimation, (Table 4). Therefore, ET_MLP and ET_SVM were selected as the best fit models for estimating the ET_o in training and testing period. The performance of the MLP and SVM model on the testing dataset showed that the MLP and SVM models can be used to provide accurate and reliable ET_o estimations. Based on the results of Table 4, Penman method (ET_PM) whose input combinations were U, actual and saturation vapor pressure had the highest value of R² (0.989; p<0.05), NSE (0.99), and d (0.99), than other empirical equations in the training period. The results of performance evaluation showed that ET_PM also performs clearly better than other empirical methods in testing period based on R² (0.988), RMSE (20.74 mm/month), d (0.98), and NSE (0.98). In both periods, it was found that the ET_MLP method provides best accuracy (R²=0.998), highest d value (1.00), and lowest RMSE value (2.02 mm/month) in all methods. Malik et al. (2017) reported better performances (RMSE = 0.214 mm/month) by multi-layer perceptron neural network to estimate monthly pan-evaporation (EPm) in Indian central Himalayas. This indicates that the accuracy of the models may vary according to the climate of the research site, the type of climatic data, and the sample size. In this study, ET_RBF and ET_RF models have almost same R², and both models performed worse than ET_SVM and ET_MLP models in testing period. As can be seen from the Table 4, all performance statistics illustrated a reasonably better performance for all data-driven models than empirical methods. These results are parallel with previous studies (Karimaldini et al. 2011; Tabari and Talaee 2013) which indicate that the performances of data-driven models were better than local calibrated physical model or conventional methods. It is evident that all data-driven models and statistical method (ET_MLR) are rather simple in terms of input parameter, and its difference from empirical methods is that it contains RH_avg in the input parameters group. The results of models show that the models, in which T_max, RH_avg, and R_S are needed, performed well in reference to ETo modeling and could be used with limited weather data. The results of performance show that the presence or absence of critical input significantly impacted the performances of equation methods. However, the performance values can vary with model dynamics (numbers of hidden nodes, epoch values, type of activation functions used, etc.) in data-driven models with the same input set.

The comparison of the ET_o values calculated by FAO PM-56 and the values estimated by different empirical methods and data-driven models in testing period was shown in Figs. 6 and 7, in the form of line graphs, scatter plots, and residual graphs. The slope of regression lines ranged from 0.23 to 1.08 in empirical methods while in data-driven models, these values ranged from 0.89 to 0.99. The ETo values estimated by the ET_MLP, ET_SVM, and ET_PM were close to that calculated using the ET_o values and followed the same trend as in ET_o. It was clearly shown from the figures that the ET_MLP, ET_SVM, and ET_PM models closely follow the corresponding ET_o values and less scattered estimates compared to other methods. Therefore, these methods are considered as best alternatives for estimating monthly averages of monthly ET_o based on the values of R². The slope of regression lines for each method was <1.0 except for the ET_PM and ET_RF method, indicating that ET_PM and ET_o methods had strong relationships with the ET_o among all empirical methods and data-driven models, respectively. However, in general, the estimated ET_o in empirical methods cannot catch the observed values and produce less accurate results than the data-driven methods including ANN, SVM, RF, and MLR in testing period based on R². For example, the R² of the ET_MLP, ET_RBF, ET_SVM, ET_RF, and ET_MLR models varies from 0.956 to 0.999 (Fig. 7); the R² of the ET_HAR, ET_PM, ET_Mak, ET_Thor, and ET_BC models slightly decreases and varies from 0.854 to 0.988 (Fig. 6) in testing period, respectively. These results indicate that types and number of input variables affect better efficiency in the ETo estimation. In equation methods, a radiation-based model (ET_Mak) compared to other empirical methods was not satisfactory, with R² value of 0.854. It is seen that the Hargraeves method shows less predictive accuracy when considering the peak values of estimated ET_o values in equation methods in Fig. 6. An evaluation that only base on R² may not be sufficient to decide since R² is oversensitive to extreme values and insensitive to both additive and proportional differences between observed and model-estimated values (Legates and McCabe 1999). The error term calculations based on goodness-of-fit indicators (d, RMSE, and MAD) are also suitable for model evaluation than R² as they calculate the deviation or error between each pair of observed and estimated values based on the measurement uncertainty. Thus, d, RMSE, and MAD were used in addition to evaluate the performance of all techniques and these values are shown in Table 4. Also shown in Figs. 6 and 7 is graphical representation of temporal variation between observed and estimated monthly ET_o values by empirical methods and data-driven models during testing period. Initially, ETo values of cooler months were observed as low and then increased gradually when number of high-temperature months increased in all trends. The record shows marked fluctuations between winter and summer, which implies that changes in climatic conditions that alter evapotranspiration, could easily affect balance and interaction with surface and subsurface water. However, it can perceptibly be seen in Fig. 6 that the Hargraves method did not accurately estimate the evapotranspiration values of the high-temperature months. ET_Har method was not good enough in forecasting peak ET_o values. This could be due to the fact that the study area is characterized by a semi-arid continental climate of mild cold winter and hot dry summer, where atmospheric conditions other than temperature, and R_S are more favorable to evaporation and transpiration. Therefore, peak ET_o values inefficiency could be caused by the formulation used in the ET_Har method. Likewise, the scatters of the ET_PM, which base on a combination technique using U and vapor pressure of input parameters, based models are less dispersed, generally overestimating the ET_o with very low errors. Generally, ET_PM models indicate overestimation while values of ET_Mak, ET_Thor, and ET_BC models remain under peak of ET_o values for the Central Anatolian Region. The result of empirical methods, which indicates the superiority of the ET_PM models on the combination-based one, could be considered as a reliable alternative method for ETo estimation among empirical methods. The Penman method uses vapor pressure deficit, actual vapor pressure, and an empirical U function. ET_PM method run underestimates ET_o little. Lee et al. (2004) reported that this difference derived from empirical wind function used in the equation and the function takes many different forms in literature. The estimation results of ET_o using data-driven methods for the regional average values in Central Anatolian Region have revealed that the SVM and MLP models can achieve reliable estimates. In the study, data-driven models estimated peak ET_o values more accurately. The data preprocessing such as normalization of the data in data-driven models enabled a finest accuracy for capturing peak magnitudes (Demirel et al. 2009). In particular, among data-driven models, the SVM and MLP-based models used in this study were found to have better performances than the RBF, RF, and MLR models (statistical method); they increased the estimation accuracy by up to 98% in regional average dataset. The obtained results were in well agreement with some previous studies (Sayyadi et al. 2009; Rahimikhoob 2010; Traore et al. 2010) that all reported the application of MLP model and their superior accuracy compared to other methods for ET_o estimation in different climates around the Earth. As far as the performance of the ET_SVM model is concerned, the results appeared to be quite satisfactory, and similar results were obtained by Tabari et al. (2012) and Mohammadrezapour et al. (2019) for semi-arid environment. The selection of kernel function type is responsible for performance of SVM model for estimating of ET_o (Seifi and Riahi 2020). As mentioned earlier, a very satisfactory performance has been obtained by using the linear kernel function of SVM model. However, Tabari et al. (2012) had found RBF is the best kernel function among the other functions of SVM models. With regard to the overall performance of the applied all empirical methods and data-driven models in testing period, the hierarchical performance for regional average in Central Anatolian Region follows the order: ET_MLP > ET_SVM > ET_PM > ET_MLR > ET_RF > ET_RBF > ET_BC > ET_Thor > ET_Har > ET_Mak, respectively.

All regression model residuals as a function of observed ET_o of testing period and month of year were also examined in Figs. 6 and 7. These graphs explain the vertical distance between the actual data point and the estimated point on the line. Figures show an example of model residuals versus observed ET_o for all (regional average) dataset. The seasonality can be seen in the residuals at all methods or models, which is more clearly pronounced at some models such as ET_Mak (Fig. 6) and ET_MLR (Fig. 7). The magnitude of seasonality, which increases with increasing estimated ET_o magnitude, is particularly pronounced for notable residuals of ET_RBF and ET_RF models. The reason of these results can be explained by considering that the RH_avg is a seasonally dynamic property. In other words, this parameter leads the seasonality more pronounced in residuals. Besides, the seasonal magnitude (the difference between the maximum and minimum value) of seasonally varying RH_avg values explained as a percentage is considerably higher than the other parameters. Therefore, the residuals in all models using RH_avg as an input parameter were found higher than others that did not use this parameter. For example, the residuals for ET_Har model are not strongly related to ET_o magnitude or month of year. The residuals also show relatively unbiased situations for the models. According to the results of equation methods depicted in Fig. 6, the ET_Har, ET_PM, and ET_Mak methods tended to overestimate observed ET_o while the ET_Thor method tended to underestimate ET_o. In the ET_BC method, residuals generally showed a balanced distribution by years. As can be seen from Fig. 8, the all methods were found to be mostly positive residuals after 2010 and 2011 (Fig. 8a, b). The graphs of cumulative average residuals clearly depict cumulative underestimate and overestimate estimations (Fig 8c, d). Before 2009, the ET_MLP model, which is the best data-driven model, estimated cumulative overestimate values in 2006, while ET_PM, which is the best empirical method, tended to overestimate after 2011. As has been shown in Fig 8c, d, cumulative residual plots may display a tendency to overestimate and underestimate with relation to control of wet and cold biases in considered years of the study area.

The results of the second step (elevation groups)

The statistics given in Table 5 illustrate the difference between some selected data characteristics in the 4 different elevation groups. T_min of all group except for G4 in whole data shows a significantly greater level of dispersion around the mean compared with other CV of all groups. The parameter with the highest CV value in the whole data of G4 is T_max. It is seen in Table 5 that the CV values of other parameters are also close to the value in T_max of G4. The precipitation has higher skewed distribution in all groups, just as in complete dataset shown in Table 5. Another important statistical characteristic of the selected climate data is the highest R² found between the ET_o and T_max in training period of all four groups in the ranges 0.84 and 0.86 (p<0.05) and the lowest R² between the ET_o and U₂ in training period of all groups ranges within an interval of 0.00 and 0.04 (p>0.05)

Table 5 Statistical parameter of climatic data and ET_o in four elevation groups

Full size table

Test results of the six different optimal data-driven models for each station are provided in training period (Table 6) and testing period (Table 7) using long-term monthly data of elevation-based groups. In training/testing period, it is clear from the Tables 6 and 7 that the RMSE values of empirical methods in training period are considerably higher than the RMSE results of testing period. For the ET_RBF, ET_RF, and ET_MLR models, the maximum RMSE (15.12, 29.17, and 12.93 mm/month) values were found for the G1, respectively. The maximum R² of all models in G1 were found in ET_MLR (R²= 0.998, p<0.05) and ET_SVM (R²= 0.998, p<0.05). These models presented the highest d value (1.00; 1.00) and NSE equal to 1.00 and 0.99, respectively. For the ET_Thor, however, the maximum RMSE value was found to be 3.50 mm/month in the G2. It can be clearly seen in Table 7, the G2 group shows already better performance for all performance criteria than the other groups in testing period. Therefore, it has been determined that the models used for ET_o estimation in the Central Anatolian Region can be used most effectively at an altitude between 850 and 1000 m. The values of performance are similar for all elevation groups in training period (Table 7). Once again, the ET_Har method performed the worst in G3, due to significant underestimations, with a RMSE value of 78.95 mm/month, NSE value of −0.11. The values d, NSE, and R² shown in Table 7 indicate that the ET_MLP was the best simple method for estimating ET_o in G4 (R² = 0.75). It is clearly seen from Table 6 that the accuracy of the ET_MLP is generally better than the other models in ET_o estimation. In four groups, the ET_MLP model has the best accuracy. The ET_SVM and ET_PM models respectively also performed well in all groups while the ET_Mak yielded the worst estimation in all groups in testing period (Table 7). Estimated ET_o values by models are lower than the observed ET_o values since 2009.

Table 6 Comparison of performances of all techniques for ET_o estimation of four elevation groups in training period

Full size table

Table 7 Comparison of performances of all techniques for ET_o estimation of four elevation groups in testing period

Full size table

PBIAS (%) indicates the model performance with overestimate (PBIAS < 0) or underestimate (PBIAS > 0) of ET_o, and values of the PBIAS nearer to 0 suggest a model or method with more predictive skill. Safeeq and Fares (2012) emphasize that value of PBIAS more than 15% and less than 25% was considered an indicator of average performance; however, a value between 10% and 15% indicates a good performance, and a value less than 10% indicates a very good performance. As it can be seen, model efficiency using PBIAS is higher for data-driven models as compared to the use of empirical methods in both training and testing period of each group (Fig. 9). The model performance of ET_MLP and ET_RBF in the entire training and testing period is considered “very good” on the basis of the PBIAS values vary between 0 and −2%, respectively (Fig. 9). Celestin et al. (2020) found that the World Meteorological Organization (WMO) and the Mahringer (MAHR) models performed well with monthly data compared to the PM FAO-56 model with PBIAS of −2.5% and −2.6% after the calibration period, respectively. With regard to PBIAS, The ET_Thor method provided the highest PBIAS values in all group in both periods. From the MAD, d, and NSE perspective, ET_Thor shows acceptable performance in both training and testing periods (Tables 4, 5, and 6). However, the maximum RMSE and MAD values are exhibited by ET_Har method in non-group in both periods (Table 4). ET_BC, ET_Mak, ET_PM, ET_SVM, and ET_RBF are underestimating the reference evapotranspiration for all elevation groups and regional average (non-group) in training and testing period. ET_MLP in regional average and ET_MLR in G2 group are equally suitable with 0% PBIAS, and they could also be used satisfactorily to estimate reference evapotranspiration for the study area.

For further analysis, the developed predictive models of reference evapotranspiration are examined by Taylor diagram (Taylor 2001). Taylor diagram classifies the results of methods or models by using standard deviation and the R² of observed and simulated data. The radial coordinate shows the value of standard deviation; the concentric semi-circles represent the magnitude of standard deviation, and the angular coordinate indicates the values of R². Estimated ET_o by different methods and models that run it with observed ET_o will lie nearest to the point marked “reference” on the x-axis. Figures 10 and 11 display the standard deviation and R² (with observed ET_o) for the results of different equation methods and models calculated from the various inputs, respectively. The data-driven models (Fig. 11), in general, are produced more accurately than empirical methods (Fig. 10), with the latter having a relatively low d and NSE values (Table 7). On the basis of the results shown in the Taylor diagram, four elevation groups for the variables are determined by concentric analysis, which falls in the range of 78–86, with respect to ideal model points of both empirical methods and data-driven models in testing period. The ET_PM for the Taylor diagram is composed of the models that perform highly for estimated ET_o (Fig. 10). Taylor diagram analysis reveals that ET_MLP has the R² (range between 0.997 and 0.999), lowest standard deviation (range between 74 and 80 mm/month), and smallest RMSE (range between 0.04 and 5.71, and captures observations better than all data-driven models in all group.

Conclusion

The performances of ET_o, developed based on two main approaches (regional average and elevation group) to the estimated ET_o produced by the five different empirical methods (ET_Har, ET_PM, ET_Mak, ET_Thor, and ET_BC) and the six different data-driven models (ET_MLP, ET_RBF, ET_SVM, ET_RF, and ET_MLR), were assessed for the Central Anatolian Region of Turkey. The performances of the empirical methods and data-driven models are reported to provide evidence for suitable techniques for estimating ET_o values.

Monthly selected climatic data variables of 45 meteorological stations, over a period of 35 years (1979–2013) were used in this study. This study conducted by two stages of data preparation. In the first stage, the average of all parameter values obtained from 45 meteorology stations was evaluated. In the second step, the data set was divided into 4 elevation groups. Correlation of the parameters with ET_o was taken into account in the selection of input parameters. Climatic variables considered in all stations showed that ET_o is strongly and positively correlated with T_max, RH_avg, and R_S, with a R² equal to 0.84, 0.68, and 0.79, respectively. It has been found that these three variables can be effective in modeling evapotranspiration in a semi-arid region. Therefore, these variables should be included in long-term monitoring programs, especially in agricultural planning and water resources management in semi-arid regions due to evapotranspiration is an essential factor that causes a great change in the water budget, especially in fragile semi-arid ecosystems.

Based on the performance of a grouping result evaluations, it is found that the MLP and SVM models in G2 (850–1100 m) can be employed successfully in modeling the monthly mean ET_o, because both approaches yield better estimates with high value of R², compared to other empirical methods and yet MLP being slightly more successful than SVM. Therefore, this research suggests that a reference evapotranspiration in semi-arid region can be modeled using only a few input parameters with the help of a simple but effective data-driven models. We find that Penman method has achieved the highest accuracy in terms of all performance criteria among the empirical methods. The Penman method is suitable for estimating the reference evapotranspiration, and it can be used reliably in semi-arid areas.

From this study, it can be concluded that in case a single climatic variable such as U or sunshine duration is missing, the alternative models can be used for computing accurate PM FAO-56 model semi-arid environments. The results are encouraging and suggest an easy-to-use and accurate estimate to assess reference evapotranspiration model as an alternative to empirical approaches, because the advantage of the soft computational methods lies in the possibility of having improvements in the performance criteria by modifying the important tunable parameters.

References

Alexandris S, Stricevic R, Petkovic S (2008) Comparative analysis of reference evapotranspiration from the surface of rainfed grass in central Serbia, calculated by six empirical methods against the Penman-Monteith formula. Euro Water 21:17–28
Google Scholar
Allen RG, Pereira LS, Raes D, Smith M (1998) In: FAO (ed) Crop evapotranspiration —guidelines for computing crop water requirements- FAO Irrigation and drainage paper 56. Food and Agriculture Organization, Rome, pp 1–326
Blaney HF, Criddle WD (1950) Determining water requirement in irrigated areas from climatological data. Soil Conservation Service Technical Publication No. 96, US Department of Agriculture, Washington DC
Breiman L (2001) Random forests. Mach Learn 45:5–32
Article Google Scholar
Celestin S, Qi F, Li R, Yu T, Cheng W (2020) Evaluation of 32 simple equations against the Penman–Monteith method to estimate the reference evapotranspiration in the Hexi Corridor, Northwest China. Water 12:2772
Article Google Scholar
Chia MY, Huang YF, Koo CH (2020) Support vector machine enhanced empirical reference evapotranspiration estimation with limited meteorological parameters. Comput Electron Agric 175:105577
Article Google Scholar
Currie DJ (1991) Energy and large-scale patterns of animal- and plant-species richness. Am Nat 137:27–49
Article Google Scholar
da Silva HJ, dos Santos MS, Junior JBC, Spyrides MH (2016) Modeling of reference evapotranspiration by multiple linear regression. Journal of Hyperspectral Remote Sensing 6:44–58
Demirel MC, Venancio A, Kahya E (2009) Flow forecast by SWAT model and ANN in Pracana basin, Portugal. Adv Eng Softw 40:467–473
Article Google Scholar
Dile YT, Srinivasan R (2014) Evaluation of CFSR climate data for hydrologic prediction in data-scarce watersheds: an application in the Blue Nile River Basin. J Am Water Resour Assoc 50:1226–1241
Article Google Scholar
Douglas EM, Jacobs JM, Sumner DM, Ray RL (2009) A comparison of models for estimating potential evapotranspiration for Florida land cover types. J Hydrol 373:366–376
Article Google Scholar
Duane WJ, Pepin NC, Losleben ML, Hardy DR (2008) General characteristics of temperature and humidity variability on Kilimanjaro, Tanzania. Arct Antarct Alp Res 40:323–334
Article Google Scholar
Efthimiou N, Alexandris S, Karavitis C, Mamassis N (2013) Comparative analysis of reference evapotranspiration estimation between various methods and the FAO56 Penman-Monteith procedure. Euro Water 42:19–34
Google Scholar
Feng Y, Cui N, Gong D, Zhang Q, Zhao L (2017) Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling. Agr Water Manag 193:163–173
Article Google Scholar
Fisher DK, Pringle HC III (2013) Evaluation of alternative methods for estimating reference evapotranspiration. Agric Sci 4:51–60
Google Scholar
Fuka DR, Walter MT, MacAlister C, Degaetano AT, Steenhuis TS, Easton ZM (2014) Using the climate forecast system reanalysis as weather input data for watershed models. Hydrol Process 28:5613–5623
Article Google Scholar
Hadria R, Benabdelouhab T, Lionboui H, Salhi A (2021) Comparative assessment of different reference evapotranspiration models towards a fit calibration for arid and semi-arid areas. J Arid Environ 184:104318
Article Google Scholar
Hargreaves GH, Samani ZA (1985) Reference crop evapotranspiration from temperature. Appl Eng Agric 1:96–99
Article Google Scholar
Hashemi M, Sepaskhah AR (2020) Evaluation of artificial neural network and Penman–Monteith equation for the prediction of barley standard evapotranspiration in a semi-arid region. Theor Appl Climatol 139:275–285
Article Google Scholar
Islam S, Abdullah RAB, Badruddin IA, Algahtani A, Shahid S (2020) Calibration and validation of reference evapotranspiration models in semi-arid conditions. Appl Ecol Environ Res 18:1361–1386
Article Google Scholar
Issaka AI, Paek J, Abdella K, Pollanen M, Huda AKS, Kaitibie S, Goktepe I, Haq MM, Moustafa AT (2017) Analysis and calibration of empirical relationships for estimating evapotranspiration in Qatar: case study. J Irrig Drain Eng 143:05016013
Article Google Scholar
Käfer PS, da Rocha NS, Diaz LR, Kaiser EA, Santos DC, Veeck GP, Robérti DR, Rolim SBA, de Oliveira GG (2020) Artificial neural networks model based on remote sensing to retrieve evapotranspiration over the Brazilian Pampa. J Appl Remote Sens 14:038504
Article Google Scholar
Karimaldini F, Teang Shui L, Ahmed Mohamed T, Abdollahi M, Khalili N (2011) Daily evapotranspiration modeling from limited weather data by using neuro-fuzzy computing technique. J Irrig Drain Eng 138:21–34
Article Google Scholar
Kisi O (2014) Comparison of different empirical methods for estimating daily reference evapotranspiration in Mediterranean climate. J Irrig Drain Eng 140:04013002
Article Google Scholar
Lang D, Zheng J, Shi J, Liao F, Ma X, Wang W, Chen X, Zhang M (2017) A comparative study of potential evapotranspiration estimation by eight methods with FAO Penman–Monteith method in southwestern China. Water 9:1–18
Article Google Scholar
Lee TS, Najim MMM, Aminul MH (2004) Estimating evapotranspiration of irrigated rice at the West Coast of the Peninsular of Malaysia. J Appl Irrig Sci 39:103–117
Google Scholar
Legates DR, McCabe GJ (1999) Evaluating the use of “goodness-of-fit” measures inhydrologic and hydroclimatic model validation. Water Resour Res 35:233–241
Article Google Scholar
Li S, Kang S, Zhang L, Zhang J, Du T, Tong L, Ding R (2016) Evaluation of six potential evapotranspiration models for estimating crop potential and actual evapotranspiration in arid regions. J Hydrol 543:450–461
Article Google Scholar
Lu J, Sun G, McNulty SG, Amatya DM (2005) A comparison of six potential evapotranspiration methods for regional use in the Southeastern United States 1. J Am Water Resour As 41:621–633
Article Google Scholar
Ma YJ, Li XY, Liu L, Yang XF, Wu XC, Wang P, Lin H, Zhang GH, Miao CY (2019) Evapotranspiration and its dominant controls along an elevation gradient in the Qinghai Lake watershed, northeast Qinghai-Tibet Plateau. J Hydrol 575:257–268
Article Google Scholar
Maes WH, Gentine P, Verhoest NEC, Miralles DG (2019) Potential evaporation at eddy-covariance sites across the globe. Hydrol Earth Syst Sci 23:925–948
Article Google Scholar
Makking GF (1957) Testing the Penman formula by means of lysimeters. J Inst Water Eng 11:277–288
Google Scholar
Malik A, Kumar A, Kisi O (2017) Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models. Comput Electron Agric 143:302–313
Article Google Scholar
Middleton N, Thomas D (1997) World atlas of desertification, 2nd edn. UNEP, London
Mohammadrezapour O, Piri J, Kisi O (2019) Comparison of SVM, ANFIS and GEP in modeling monthly potential evapotranspiration in an arid region (Case study: Sistan and Baluchestan Province, Iran). Water Supply 19:392–403
Article Google Scholar
Moriasi DN, Gitau MW, Pai N, Daggupati P (2015) Hydrologic and water quality models: performance measures and evaluation criteria. T Asabe 58:1763–1785
Article Google Scholar
Penman HL (1948) Natural evaporation from open water, bare soil, and grass. Proc Royal Soc Lond A 193:120–146
Article Google Scholar
Rácz C, Nagy J, Dobos AC (2013) Comparison of several methods for calculation of reference evapotranspiration. Acta Silv et Lignaria Hungarica 9:9–24
Article Google Scholar
Rahimikhoob A (2010) Estimation of evapotranspiration based on only air temperature data using artificial neural networks for a subtropical climate in Iran. Theor Appl Climatol 101:83–91
Article Google Scholar
Rahimikhoob A, Behbahani MR, Fakheri J (2012) An evaluation of four reference evapotranspiration models in a subtropical climate. Water Resour Manag 26:2867–2881
Article Google Scholar
Safeeq M, Fares A (2012) Hydrologic response of a Hawaiian watershed to future climate change scenarios. Hydrol Process 26:2745–2764
Sayyadi H, Oladghaffari A, Faalian A, Sadraddini AA (2009) Comparison of RBF and MLP neural networks performance for estimation of reference crop evapotranspiration. Water Soil Sci 19:1–12
Google Scholar
Schemmel F, Mikes T, Rojay B, Mulch A (2013) The impact of topography on isotopes in precipitation across the Central Anatolian Plateau (Turkey). Am J Sci 313:61–80
Article Google Scholar
Seifi A, Riahi H (2020) Estimating daily reference evapotranspiration using hybrid gamma test-least square support vector machine, gamma test-ANN, and gamma test-ANFIS models in an arid area of Iran. J Water Clim Change 11:217–240
Article Google Scholar
Sentelhas PC, Gillespie TJ, Santos EA (2010) Evaluation of FAO Penman–Monteith and alternative methods for estimating reference evapotranspiration with missing data in Southern Ontario, Canada. Agr Water Manag 97:635–644
Article Google Scholar
Serengil Y (2018) Climate change and carbon management. UNDP, Ankara
Stephens JC, Stewart EH (1963) A comparison of procedures for computing evaporation and evapotranspiration. Publication 62:123–133
Sudheer KP, Gosain AK, Ramasastri KS (2003) Estimating actual evapotranspiration from limited climatic data using neural computing technique. J Irrig Drain Eng 129:214–218
Article Google Scholar
Sun JY, Sun XY, Hu ZY, Wang GX (2020) Exploring the influence of environmental factors in partitioning evapotranspiration along an elevation gradient on Mount Gongga, eastern edge of the Qinghai-Tibet Platea, China. J Mt Sci 17:384–396
Article Google Scholar
Tabari H, Talaee PH (2013) Multilayer perceptron for reference evapotranspiration estimation in a semiarid region. Neural Comput & Applic 23:341–348
Article Google Scholar
Tabari H, Kisi O, Ezani A, Talaee PH (2012) SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J Hydrol 444:78–89
Article Google Scholar
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106:7183–7192
Tellen VA (2017) A comparative analysis of reference evapotranspiration from the surface of rainfed grass in Yaounde, calculated by six empirical methods against the penman-monteith formula. Earth Perspect 4:1–8
Article Google Scholar
Thornthwaite CW (1948) An approach toward a rational classification of climate. Geogr Rev 38:55–94
Article Google Scholar
Trajkovic S, Todorovic B, Stankovic M (2003) Forecasting reference evapotranspiration by artificial neural networks. J Irrig Drain E 129:454–457
Article Google Scholar
Traore S, Wang YM, Kerh T (2010) Artificial neural network for modeling reference evapotranspiration complex process in Sudano-Sahelian zone. Agr Water Manag 97:707–714
Article Google Scholar
Tsangaratos P, Ilia I (2017) Applying machine learning algorithms in landslide susceptibility assessments. In: Samui P, Sekhar S, Balas VE (eds) Handbook of neural computation. Academic Press, London, pp 433–457
Tukimat NNA, Harun S, Shahid S (2012) Comparison of different methods in estimating potential evapotranspiration at Muda Irrigation Scheme of Malaysia. J Agr Rural Dev Trop 113:77–85
Google Scholar
Türkeş M, Tatlı H (2011) Use of the spectral clustering to determine coherent precipitation regions in Turkey for the period 1929–2007. Int J Climatol 31:2055–2067
Article Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer Verlag, NewYork
Vicente-Serrano SM, Lanjeri S, López-Moreno JI (2007) Comparison of different procedures to map reference evapotranspiration using geographical information systems and regression-based techniques. Int J Climatol 27:1103–1118
Article Google Scholar
Wang S, Lian J, Peng Y, Hu B, Chen H (2019) Generalized reference evapotranspiration models with limited climatic data based on random forest and gene expression programming in Guangxi, China. Agr Water Manag 221:220–230
Article Google Scholar
Wang LH, He XB, Steiner JF, Zhang DW, Wu JK, Wang SY, Ding YJ (2020) Models and measurements of seven years of evapotranspiration on a high elevation site on the Central Tibetan Plateau. J Mt Sci 17:3039–3053
Article Google Scholar
Wen X, Si J, He Z, Wu J, Shao H, Yu H (2015) Support-vector-machine-based models for modeling daily reference evapotranspiration with limited climatic data in extreme arid regions. Water Resour Manag 29:3195–3209
Article Google Scholar
Xu CY, Singh VP (2002) Cross comparison of empirical equations for calculating potential evapotranspiration with data from Switzerland. Water Resour Manag 16:197–219
Article Google Scholar
Yirga SA (2019) Modelling reference evapotranspiration for Megecha catchment by multiple linear regression. Model Earth Syst Environ 5:471–477
Article Google Scholar
Yurtseven I, Zengin M (2013) Neural network modelling of rainfall interception in four different forest stands. Ann For Res 56:351–362
Google Scholar
Zanetti SS, Sousa EF, Oliveira VP, Almeida FT, Bernardo S (2007) Estimating evapotranspiration using artificial neural network and minimum climatological data. J Irrig Drain Eng 133:83–89
Article Google Scholar
Zheng F, Maier HR, Wu W, Dandy GC, Gupta HV, Zhang T (2018) On lack of robustness in hydrological model development due to absence of guidelines for selecting calibration and evaluation data: Demonstration for data-driven models. Water Resour Res 54:1013–1030
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Watershed Management, Faculty of Forestry, Istanbul University-Cerrahpasa, 34473, Istanbul, Turkey
Ibrahim Yurtseven & Yusuf Serengil

Authors

Ibrahim Yurtseven
View author publications
You can also search for this author in PubMed Google Scholar
Yusuf Serengil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ibrahim Yurtseven.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Additional information

Responsible Editor: Zhihua Zhang

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yurtseven, I., Serengil, Y. Comparison of different empirical methods and data-driven models for estimating reference evapotranspiration in semi-arid Central Anatolian Region of Turkey. Arab J Geosci 14, 2033 (2021). https://doi.org/10.1007/s12517-021-08150-8

Download citation

Received: 06 May 2021
Accepted: 29 July 2021
Published: 19 September 2021
DOI: https://doi.org/10.1007/s12517-021-08150-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of different empirical methods and data-driven models for estimating reference evapotranspiration in semi-arid Central Anatolian Region of Turkey

Abstract

Similar content being viewed by others

Data-driven reference evapotranspiration (ET₀) estimation: a comparative study of regression and machine learning techniques

Machine Learning Approach for Reference Evapotranspiration Estimation in the Region of Fes, Morocco

Modelling monthly reference evapotranspiration estimation using machine learning approach in data-scarce North Western Himalaya region (Almora), Uttarakhand

Introduction