Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration

Tikhamarine, Yazid; Malik, Anurag; Souag-Gamane, Doudja; Kisi, Ozgur

doi:10.1007/s11356-020-08792-3

Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration

Research Article
Published: 23 May 2020

Volume 27, pages 30001–30019, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Science and Pollution Research Aims and scope Submit manuscript

Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration

Download PDF

Yazid Tikhamarine¹,
Anurag Malik²,
Doudja Souag-Gamane¹ &
…
Ozgur Kisi³

1345 Accesses
83 Citations
Explore all metrics

Abstract

Accurate estimation of reference evapotranspiration (ET_o) is profoundly crucial in crop modeling, sustainable management, hydrological water simulation, and irrigation scheduling, since it accounts for more than two-thirds of global precipitation losses. Therefore, ET_o-based estimation is a major concern in the hydrological cycle. The estimation of ET_o can be determined using various methods, including field measurement (the scale of the lysimeter), experimental methods, and mathematical equations. The Food and Agriculture Organization recommended the Penman-Monteith (FAO-56 PM) method which was identified as the standard method of ET_o estimation. However, this equation requires a large number of measured climatic data (maximum and minimum air temperature, relative humidity, solar radiation, and wind speed) that are not always available on meteorological stations. Over the decade, the artificial intelligence (AI) models have received more attention for estimating ET_o on multi-time scales. This research explores the potential of new hybrid AI model, i.e., support vector regression (SVR) integrated with grey wolf optimizer (SVR-GWO) for estimating monthly ET_o at Algiers, Tlemcen, and Annaba stations located in the north of Algeria. Five climatic variables namely relative humidity (RH), maximum and minimum air temperatures (T_max and T_min), solar radiation (R_s), and wind speed (U_s) were used for model construction and evaluation. The proposed hybrid SVR-GWO model was compared against hybrid SVR-genetic algorithm (SVR-GA), SVR-particle swarm optimizer (SVR-PSO), conventional artificial neural network (ANN), and empirical (Turc, Ritchie, Thornthwaite, and three versions of Valiantzas methods) models by using root mean squared error (RMSE), Nash-Sutcliffe efficiency (NSE), Pearson correlation coefficient (PCC), and Willmott index (WI), and through graphical interpretation. Through the results obtained, the performance of the SVR-GWO provides very promising and occasionally competitive results compared to other data-driven and empirical methods at study stations. Thus, the proposed SVR-GWO model with five climatic input variables outperformed the other models (RMSE = 0.0776/0.0613/0.0374 mm, NSE = 0.9953/ 0.9990/0.9995, PCC = 0.9978/0.9995/0.9998 and WI = 0.9988/0.9997/0.9999) for estimating ET_o at Algiers, Tlemcen, and Annaba stations, respectively. In conclusion, the results of this research indicate the suitability of the proposed hybrid artificial intelligence model (SVR-GWO) at the study stations. Besides, promising results encourage researchers to transfer and test these models in other locations in the world in future works.

Estimation of reference evapotranspiration using data driven techniques under limited data conditions

Article 19 August 2017

Enhanced Daily Reference Evapotranspiration Estimation Using Optimized Hybrid Support Vector Regression Models

Article 11 May 2024

Modeling daily reference evapotranspiration using SVR machine learning algorithm with limited meteorological data in Dar-el-Beidha, Algeria

Article 29 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Reference evapotranspiration (ET_o) is a vital component of the hydrological cycle and plays an important role in the sustainable management of water resources because it accounts for more than two-thirds of rainfall losses (Shiri et al. 2013; Citakoglu et al. 2014; Shiri et al. 2014; Kisi et al. 2015). Therefore, it is necessary that irrigation managers and water researchers are provided with an accurate tool for estimating reference evapotranspiration. Typically, ET_o can be measured directly by using the lysimeters or eddy-covariance systems. However, making accurate estimation of ET_o is difficult because it is affected by several factors, and the use of lysimeters is very expensive with high construction and maintenance costs, especially in developing countries such as Algeria. Thus, the use of mathematical models (empirical and semi-empirical models based on meteorological variables) is a more suitable approach for practical applications and does not cost any price (Ferreira et al. 2019; Shiri et al. 2019a). The Food and Agriculture Organization recommended Penman-Monteith (FAO-56 PM) model for estimating ET_o, by refer to the default grass with a presumed height of 0.12 m, albedo of 0.23, and surface resistance of 70 s m⁻¹ (Allen et al. 1998). The main limitation of using the FAO-56 PM model is the lack of concrete climate data, whereas this model requires a large number of climatic variables (relative humidity, maximum and minimum air temperatures, solar radiation and wind speed) as input data. Unfortunately, in most of the developing countries, these climatic variables are often unavailable or incomplete and this impedes the ET_o account using the FAO-56 PM model (Djaman et al. 2017; Banda et al. 2018; Ferreira et al. 2019)

In the last two decades, various models of artificial intelligence (simple or mixed) have been applied in various fields of science and engineering to deal with various scientific issues such as modeling, optimization, and prediction taking into account the ability of artificial intelligence to address the nonlinear relationships between variables (Nourani et al. 2014; Yaseen et al. 2015). ET_o prediction received numerous applications of artificial intelligence (AI) models in the last decade such as (Shiri et al. 2014, 2013; Citakoglu et al. 2014; Kisi et al. 2015; Kisi 2016; Feng et al. 2017; Shiri 2017; Landeras et al. 2018; Tao et al. 2018; Wu et al. 2019; Mohammadrezapour et al. 2019; Malik et al. 2019a; Chia et al. 2020). The use of artificial intelligence models allows the identification and selection of inputs (adding or deleting inputs) and finding the appropriate relationship between variables (inputs and the target variable). This option attracts researchers to adapt and formulate an appropriate model using the lowest possible number of datasets considering the high performance of the chosen model into account. Kisi (2007) investigated the artificial neural networks (ANNs) with the Levenberg–Marquardt for estimation of ET_o and concluded that they could be successfully utilized for modeling ET_o from the available climatic data. Shiri et al. (2014) compared the efficacy of heuristic data-driven models (adaptive neuro-fuzzy inference system, ANNs, gene expression programming and support vector machine) against empirical models (Hargreaves–Samani, Makkink, Priestley–Taylor, and Turc) to estimate ET_o on the daily basis in Iran. The comparison results revealed the supremacy of heuristic data-driven models over the empirical models. Shiri (2018) applied wavelet-random forest (WRF) models against mass transfer-based models for daily ET_o estimating in southern Iran. The results showed that the WRF model performed superior to the mass transfer-based models. Saggi and Jain (2019) examined the performance of the generalized linear model (GLM), gradient boosting machine (GBM) random forest (RF), and deep learning (DL) for ET_o estimation in India. They stated that the best results of daily ET_o estimation are those obtained with the DL models.

Recently, ET_o estimation received the massive applications of AI models (Shiri et al. 2019b; Tikhamarine et al. 2019a). Ferreira et al. (2019) examined the ANNs and support vector machine (SVM) for estimating daily ET_o in Brazil. The comparison results proved the superiority of the ANNs than the SVM model. Granata (2019) applied support vector regression, RF, and regression tree to model daily ET_o in Florida and reported that the regression tree performs superior to the other models in estimating ET_o using six input variables, namely mean temperature, relative humidity, net solar radiation, wind speed, soil moisture content, and sensible heat flux. Keshtegar et al. (2019) examined multi-layer perceptron neural network, M5 tree, and polynomial chaos expansion models to estimate daily ET_o in Turkey. Through the results obtained, the polynomial chaos expansion model provides the best performance compared to other models in the study area. Nourani et al. (2019) applied AI models (support vector regression, adaptive neuro-fuzzy inference system and feed-forward neural network) and empirical models (Hargreaves-Samani, modified Hargreaves-Samani, Makkink, Ritchie, multi-linear regression) to estimate daily ET_o in Turkey, Cyprus, Iraq, Iran, and Libya. They found that the best estimates are those given by AI models. Shiri (2019) compared the performance of gene expressions programming (GEP) versus Hargreaves-Samani, Makkink, Trabert, Priestley-Taylor, Dalton, and Turc for daily ET_o modeling in Iran. The comparison results indicated the better accuracy of GEP to the other alternatives. Shiri et al. (2019a) used the data splitting strategy with gene expressions programming to model daily ET_o in northwestern Iran and reported that, under different scenarios, GEP have led to good results.

For the Algerian sites, ET_o estimation suffers from the problem of lack of qualitative and quantitative climatic data, at both the spatial and time scales. Therefore, it would be interesting to know the least demanding methods in terms of the quantity of data and leading to good estimation results. In this context, Heddam et al. (2018) proposed three methods (the on-line and off-line dynamic evolving neural fuzzy inference systems (DENFIS_ON/ DENFIS_OF), and evolving fuzzy neural network) for estimating daily ET_o in the north of Algeria. The proposed models were compared with the Penman FAO-56 PM method. The results obtained revealed that the proposed models showed high accuracy compared to the Penman-Monteith model and the DENFIS_OF outperformed the other models. Zakhrouf et al. (2019) examined the ability of subtractive clustering adaptive neuro-fuzzy inference system (SC_ANFIS) and ANFIS based on the fuzzy C-means clustering (FC_ANFIS) for estimation ET_o from Dar El Beida Station situated in Algiers, Algeria. The results revealed that the S_ANFIS model significantly outperformed multiple linear regression and F_ANFIS models.

The main objectives of this research are to find robust alternative methods that could achieve accurate estimation of ET_o even when available data are very limited such as the minimum and maximum temperatures that are easily determined. The generalized Penman-Monteith model (FAO-56 PM) is the most reliable method for estimating ET_o in agricultural and water resource research, as it fits well with evapotranspiration observations (Penman 1948). The FAO-56 PM model was recognized as a standard estimation method of ET_o although its application requires a large amount of climate data, which may not be available in certain locations, as is the case in developing countries.

In this context, in the current study, the focus will be on how to eliminate the inputs one by one in four proposed scenarios to overcome the problem of providing all climate data, along with taking into account the highest performance of the developed models in mind. The proposed models (support vector regression with the new heuristic algorithm) were examined for four scenarios using different meteorological data and compared with artificial neural network (ANN) and empirical methods for estimating ET_o. The accuracy of the support vector regression depends mostly on the correct identification of parameters that can be considered as an optimization problem. Therefore, in order to select the most suitable initialization for the internal parameters, an advanced nature-inspired optimization algorithm has been functionalized and integrated with the SVR model namely, grey wolf optimizer (GWO). In fact, there are several internal parameters within all the AI models; the optimization algorithm selected should be able to consider the scalability and the dimension of the optimization problem with avoiding the large numbers of optimal local solutions. The main objective behind using new algorithms is to obtain the best solution (best parameters) that achieve robust accuracy in a brief time possible. Therefore, the genetic algorithm (GA), and particle swarm optimizer (PSO) algorithms were integrated with the SVR model and compared with the proposed SVR-GWO model to achieve robust performance in all scenarios through visual inspection and statistical performance criteria.

Materials and methods

Study area and dataset

In the present study, three climatic stations located in the North of Algeria were chosen to test the developed models; Algiers station (latitude 36° 40’ 59” N, longitude 3° 13’ 1.2” E) located in center of North Algeria with an altitude of 25 m above sea level, Tlemcen (35° 1’ 1.2” N, 1° 15’ 9.7” E) located in west of North Algeria at 247 m above sea level, and Annaba (36° 49’ 58.8” N, 7° 49’ 1.2” E) located in east of North Algeria at 4 m above sea level. Figure 1 illustrates the location map of study stations. The observed monthly data of relative humidity (RH), solar radiation (R_s), wind speed (U_s), and maximum and minimum air temperatures (T_max and T_min) were obtained from meteorological national office (ONM) situated in Algiers, Algeria. The climatic data cover 14 years (168 months) from January 2000 to December 2013 for all study stations. The data were separated into two by utilizing data from January 2000 to October 2009 for training (70%) and data from November 2009 to December 2013 for testing (30%) for all study stations. The descriptive statistical characteristics of the meteorological variables are presented in Table 1 for the Algiers, Tlemcen, and Annaba stations.

Table 1 Statistical parameters of the meteorological variables at study stations

Full size table

Four scenarios were compared according to the following combinations of the climatic variables: (M1) T_max, T_min, RH, U_s, and R_s, (M2) T_max, T_min, RH and R_s, (M3) T_max, T_min, and R_s and (M4) T_max and T_min. Consequently, SVR-GWO-1, SVR-PSO-1, SVR-GA-1, ANN-1, and Valiantzas-1 correspond to the first combination (M1); SVR-GWO-2, SVR-PSO-2, SVR-GA-2, ANN-2, Turc, and Valiantzas-2 correspond to the second scenario (M2), SVR-GWO-3, SVR-PSO-3, SVR-GA-3, ANN-3, Ritchie and Valiantzas-2 correspond to the third scenario (M3); and the last scenario (M4) involves SVR-GWO-4, SVR-PSO-4, SVR-GA-4, ANN-4, and Thornthwaite. Table 2 summarized the details of input variables for hybrid AI and traditional climate-based models.

Table 2 Input combinations for hybrid AI and empirical models

Full size table

Empirical methods

Penman-Monteith equation (FAO-56 PM)

Due to the absence of experimental reference evapotranspiration around the study stations, the FAO-56 PM model was used as a target for empirical and artificial intelligence models. The Penman method (Allen et al. 1998) can be considered as the most popular equation in evaporation studies and it is accepted as the sole empirical method by the Food and Agriculture Organization of the United Nations (FAO) and very common practice for reference evapotranspiration (Allen et al. 1998). Equation (1) expresses the FAO-56 PM:

$$ {\mathrm{ET}}_o=\frac{0.408\ \Delta \ \left({R}_n-G\right)+\gamma \frac{900}{T+273}\mathrm{U}\left({e}_s-{e}_a\right)}{\Delta +\gamma \left(1+0.34\ \mathrm{U}\right)} $$

(1)

where, ET_o is the monthly reference evapotranspiration (mm/month), R_n is the net radiation (MJ/m²/month), ∆: slope of saturation vapor pressure curve (kPa ^oC⁻¹), G: soil heat flux (MJ/m²/month), e_s and e_a are saturated and actual vapor pressures (kPa), γ_: psychrometric constant (kPa ^oC⁻¹), U is the monthly wind speed at 2 m height (m/s), and T is the mean air temperature (°C).

Valiantzas method

In 2013, Valiantzas proposed empirical equations for modeling ET_o (Valiantzas 2013a, b). Valiantzas equations are empirical methods based on the simplification of the Penman FAO-56 equation. The three different versions of Valiantzas methods developed with and without a full set of climatic data can be expressed as:

Valiantzas method using all climatic data (Valiantzas-1)

$$ \mathrm{E}{\mathrm{T}}_0=0.0393{R}_s\sqrt{T+9.5}-0.19{R_s}^{0.6}{\phi}^{0.15}+0.048\left(T+20\right)\left(1-\frac{RH}{100}\right){U}^{0.7} $$

(2)

Valiantzas method without wind speed (Valiantzas-2)

$$ \mathrm{E}{\mathrm{T}}_0=0.0393{R}_s\sqrt{T+9.5}-0.19{R_s}^{0.6}{\phi}^{0.15}+0.078\left(T+20\right)\left(1-\frac{RH}{100}\right) $$

(3)

Valiantzas without relative humidity and wind speed (Valiantzas-3)

$$ \mathrm{E}{\mathrm{T}}_0=0.0393{R}_s\sqrt{T+9.5}-0.19{R_s}^{0.6}{\phi}^{0.15}+0.0061\left(T+20\right){\left(1.12\ T-{T}_{\mathrm{min}}-2\right)}^{0.7} $$

(4)

where, T_min: the minimum air temperature (°C), Rs: the solar radiation (MJ/m²/month), ϕ: latitude of station (rad), and RH: the mean relative humidity (%).

Turc method

The Turc method (Turc 1961) is a simplified version of the Makkink method (Makkink 1957), and demands mean air temperature, solar radiation, and relative humidity to calculate the reference evapotranspiration. The implementation of the Turc method is expressed as follows:

$$ {\mathrm{ET}}_o=\frac{23.89{R}_s+50}{\uplambda}\ast \frac{T}{T+15}0.0133\ast {a}_T $$

(5)

$$ \mathrm{If}\ \mathrm{RH}\ge 50\%\kern0.5em \mathrm{then}\ {a}_T=1+\frac{50-\mathrm{RH}}{70} $$

(6)

$$ \mathrm{Else},\mathrm{RH}<50\%\mathrm{then}\ {a}_T=1 $$

(7)

where λ is the latent heat of the evaporation (MJ/kg).

Ritchie method

The Ritchie method, suggested by Jones and Ritchie (1990), to calculate the potential atmospheric evaporative demand termed as reference evapotranspiration (ET_o) can be described as follows:

$$ \mathrm{E}{\mathrm{T}}_0={\upalpha}_1\left[\ 3.87\ast {10}^{-3}.{R}_s.{\left(0.6{T}_{\mathrm{max}}+0.4{T}_{\mathrm{min}}+29\right)}^1\right] $$

(8)

$$ {\upalpha}_1=1.1\ \mathrm{If}\ 5{}^{\circ}\mathrm{C}<{\mathrm{T}}_{\mathrm{max}}\le 35{}^{\circ}\mathrm{C} $$

(9)

$$ {\upalpha}_1=1.1+0.05\ast \left({\mathrm{T}}_{\mathrm{max}}\hbox{--} 35\right)\ \mathrm{If}\ {\mathrm{T}}_{\mathrm{max}}>35{}^{\circ}\mathrm{C} $$

(10)

$$ \mathrm{Else},{\mathrm{T}}_{\mathrm{max}}<5{}^{\circ}\mathrm{C}\kern1em {\upalpha}_1=0.010\mathrm{exp}\ \left[0.18\ast \left({\mathrm{T}}_{\mathrm{max}}+20\right)\right] $$

(11)

Thornthwaite method

Thornthwaite (Thornthwaite 1948) developed a formula for estimating monthly ET_o based only on the mean air temperature and given by:

$$ {\mathrm{ET}}_{\mathrm{tw}}=\left[1.6{\left(\frac{10{T}_a}{I}\right)}^a\right] $$

(12)

$$ I=\sum \limits_{n=1}^{12}{\left(0.20{T}_a\right)}^{1.514} $$

(13)

$$ a=0.49239+6.75\ast {10}^{-7}{I}^3-7.71\ast {10}^{-5}{I}^2+1.7912\ast {10}^{-2}I $$

(14)

where T_a: monthly mean temperature (°C); and, I: annual heat index. The temperature-based methods, in most cases, overestimate or underestimate ET_o obtained using the Penman method. Allen et al. (1994) prescribed that empirical models can be calibrated utilizing the FAO-56 PM. ET_o is computed as follows:

$$ {{\mathrm{ET}}_{\mathrm{o}}}^{\mathrm{Target}}=\upalpha \ast {\mathrm{ET}}_{\mathrm{tw}} $$

(15)

where α is a calibration factor, ET_o^Target denotes the FAO-56 PM reference evapotranspiration and ET_tw is the ET_o estimated by the Thornthwaite formula (Eq. 12).

Machine learning approaches

Artificial neural network (ANN)

Artificial neural network was presented by McCulloch and Pitts (1943). The ANN is a numerical model that imitates the aptitude of the human brain to learn and can accurately learn the complex relationships (Haykin 1998). The multi-layer perceptron (MLP) with training algorithms of Levenberg-Marquardt (LM) is selected to evaluate the ET_o for the present study. The MLP is a kind of ANN with three-layer: input layer, hidden layer, and output layer connected to each other with weights and biases. MLP was selected to evaluate ET_o due to its common use in the literature (Kisi 2007; Khatibi et al. 2017; Ghorbani et al. 2018). The LM algorithm, which is more effective than the normal gradient descent (Kisi 2007), is utilized to optimize network weights. Figure 2 a shows the proposed ANN structure. The explicit expression for calculating the output value, ET_o, using MLP can be given as follows:

$$ {\mathrm{ET}}_o={F}_2\left[{\sum}_{i=1}^m{W}_{kj}\times {F}_1\left({\sum}_{i=1}^n{X}_i{W}_{ji}+{b}_j\right)+{b}_o\right] $$

(16)

The sum of the inputs and their weights lead to a summation operation, which can be expressed as:

$$ {A}_j={\sum}_{i=1}^n{X}_i{W}_{ji}+{b}_j $$

(17)

$$ \mathrm{So};\kern0.5em {ET}_o={F}_2\left[{\sum}_{i=1}^m{W}_{kj}\times {F}_1\left({A}_j\right)+{b}_o\right] $$

(18)

$$ {F}_1\left({A}_j\right)=\frac{1}{1+\exp \left(-{A}_j\right)} $$

(19)

where ET_o: the reference evapotranspiration calculated using ANNs, x_i is the variables of input layer, w_ij is the weight connection between input and hidden neuron, w_jk is the weight of hidden neuron j to the output neuron k. b_j and b_o are the bias of hidden and output layers, respectively. F₁ is the sigmoid activation function, represented by Eq. (19) and F₂ is the activation function for the output neuron. In order to avoid the large numbers of trial and error process, Eq. (20) is used to calculate the number of hidden neurons (Faris et al. 2016; Aljarah et al. 2018; Tikhamarine et al. 2020).

$$ m=2\ast n+1 $$

(20)

where, n is the number of inputs, and m is the neurons number in the hidden layer.

Support vector regression

Support vector machine (SVM) was firstly presented by Vapnik in 1995; the idea of SVM is dependent on statistical learning theory and principle of structural risk minimization (Vapnik 1995). Support vector regression (SVR) is a type of regression model that is developed by Smola (1996). The SVR models were developed to resolve forecasting, prediction, and regression problems by combining regression functions with SVM (Smola and Scholkopf 1998). The main aim of the SVR model is to find a function that has the most ε deviation and has to be a linear as possible for all training data points and from the target vectors (Smola 1996). Figure 2 b gives the structural configuration of the SVR model. The SVR regression function is declared as:

$$ f(x)=w\times \phi (x)+b $$

(21)

where w is the weights vector, b is the bias, and ϕ is the transfer function. For minimization of the regularized risk function and get an appropriate SVR function f (x), the regression problem can be expressed as follows:

$$ \operatorname{Minimize}\ \frac{1}{2}{\left\Vert w\right\Vert}^2+C{\sum}_{i=1}^N\left({\zeta}_i+{\zeta}_i^{\ast}\right) $$

(22)

subject to the condition:

$$ \mathrm{for}\ \mathrm{i}=1\ \mathrm{to}\ \mathrm{l}\left\{\begin{array}{c}{y}_i-w\times \varphi \left({x}_i\right)-b\le \varepsilon +{\xi}_i\\ {}w\times \varphi \left({x}_i\right)+b-{y}_i\le \varepsilon +{\xi}_i^{\ast}\\ {}{\xi}_i,{\xi}_i^{\ast}\ge 0,i=1,2,\dots, N\end{array}\right. $$

(23)

where C is a penalty parameter, ξ_i and ξ_0i are the slack variables corresponding boundary values of ε. After obtaining the optimal conditions with the Lagrangian, a nonlinear regression function can be expressed by the following expression:

$$ f(x)={\sum}_{i=1}^n\left({\alpha}_i-{\alpha}_i^{\ast}\right).K\left(x,{x}_i\right)+b $$

(24)

where α_i, α_i* ≥ 0 are the Lagrangian multipliers, K(x, xi) denotes the Kernel function. There are several Kernels such as linear, polynomial, radial basis function (RBF), and sigmoid. According to the literature (Khan and Coulibaly 2006; Yin et al. 2017; Ferreira et al. 2019), it was reported that the radial basis kernel gives the most accurate results. Therefore, the RBF kernel was selected as a kernel function in the current study. The RBF is defined as follow:

$$ K\left({x}_i,{x}_j\right)=\mathit{\exp}\left(-\gamma {\left\Vert {x}_i-x\right\Vert}^2\right) $$

(25)

More details regarding SVM and SVR approaches are available in the technical report: Support vector machines for classification and regression (Gunn 1998).

Grey wolf optimizer

The grey wolf optimizer (GWO) is a new swarm intelligence algorithm that has been suggested by Mirjalili et al. (2014). The GWO algorithm has been effectively used and applied to numerous investigations (Faris et al. 2016; Aljarah et al. 2019; Maroufpoor et al. 2019; Tikhamarine et al. 2019b). The fundamental motivation for the GWO algorithm originated from the social chasing of gray wolves in nature. To map the social hierarchy of the gray wolf, the pack of wolves is grouped into four classes, alpha (α) as the best solution, followed by the second beta (ß) and the third solution is the delta (δ), and the rest of population are called omega (ω). The hunting process of gray wolves (victim tracking, encircling and attacking) is designed on a mathematical basis to design the grey wolf optimizer. The social hierarchy and illustration of the position updating mechanism are represented in Fig. 3 a and b. The gray wolf behavior of encircling the prey can be calculated as:

$$ \overrightarrow{D}=\left|\overrightarrow{C}\ast {\overrightarrow{X}}_P(t)-\overrightarrow{X}(t)\right| $$

(26)

$$ \overrightarrow{X}\left(t+1\right)={\overrightarrow{X}}_P(t)-\overrightarrow{A}\ast \overrightarrow{D} $$

(27)

$$ \overrightarrow{A}=2\overrightarrow{a}\ast {\overrightarrow{r}}_1-\overrightarrow{a} $$

(28)

$$ \overrightarrow{C}=2\ast {\overrightarrow{r}}_2 $$

(29)

where X: vector position of the gray wolf, Xp: vector position of the prey, D is the distance between X and Xp, t is the current iteration number, A and C corresponding component-wise multiplication.

To imitate the hunting conduct of gray wolves, Eqs. (30–34) demonstrate how gray wolves update their locations of α, β, and δ wolves. It is acknowledged that the wolves α, β, and δ are nearest to the prey and draw the rest ω wolves to the prey location. To decide the prey position, the gray wolf population can use the following equations:

$$ {\overrightarrow{D}}_{\alpha }=\left|{\overrightarrow{C}}_1\cdotp {\overrightarrow{X}}_{\alpha }(t)-\overrightarrow{X}\right| $$

(30)

$$ {\overrightarrow{D}}_{\beta }=\left|{\overrightarrow{C}}_2\cdotp {\overrightarrow{X}}_{\beta }(t)-\overrightarrow{X}\right| $$

(31)

$$ {\overrightarrow{D}}_{\delta }=\left|{\overrightarrow{C}}_3\cdotp {\overrightarrow{X}}_{\delta }(t)-\overrightarrow{X}\right| $$

(32)

$$ {\overrightarrow{X}}_1={\overrightarrow{X}}_{\alpha }(t)-{\overrightarrow{A}}_1\ast {\overrightarrow{D}}_{\alpha } $$

(33)

$$ {\overrightarrow{X}}_2={\overrightarrow{X}}_{\beta }(t)-{\overrightarrow{A}}_2\ast {\overrightarrow{D}}_{\beta } $$

(34)

$$ {\overrightarrow{X}}_3={\overrightarrow{X}}_{\delta }(t)-{\overrightarrow{A}}_3\ast {\overrightarrow{D}}_{\delta } $$

(35)

The obtained positions from Eqs. (33–35) are used to change the next position of wolves by Eq. (36):

$$ \overrightarrow{X}\left(t+1\right)=\frac{{\overrightarrow{X}}_1+{\overrightarrow{X}}_2+{\overrightarrow{X}}_3}{3} $$

(36)

Where $ \overrightarrow{X}\left(t+1\right) $ is the next iteration position. Finding a new location for the leading wolves using Eq. (36) forces the Omega wolves to update their positions to converge with prey.

Particle swarm optimization

The particle swarm optimization (PSO) algorithm is a swarm intelligence algorithm proposed by Kennedy and Eberhart (1995). The PSO algorithms mimic the behavior of birds in nature and can be considered as one of the most popular algorithms in the metaheuristic’s literature. Owing to its performance to find an optimal solution, PSO has been effectively utilized for solving optimization problems and find optimal solutions based on two main parameters: position (x) and velocity (V). The velocity could be updated according to the following equation:

$$ {V}_i\left(k+1\right)=w{V}_i(k)+{r}_1{c}_1\Big({x}_{{\mathrm{pbest}}_i}-{x}_i(k)+{r}_2{c}_2\left({x}_{{\mathrm{gbest}}_i}-{x}_i(k)\right) $$

(37)

where pbest is the best position of the ith particle, and gbest is the global best value attained by the different particles.

The x_i(k) is the position of the particle at time step (k), V_i(k) is the velocity of the particle i at the time (k), w is the coefficient of inertia, r₁ and r₂ are random coefficients, C₁ and C₂ are the acceleration coefficients and V_i(k + 1) is the newly updated velocity. The value of w can be determined as follows (Kennedy and Eberhart 1995):

$$ w={w}_{\mathrm{max}}-\frac{w_{\mathrm{max}}-{w}_{\mathrm{min}}}{{\mathrm{iter}}_{\mathrm{max}}}.\mathrm{iter} $$

(38)

where w_min is the minimum weight and w_max is the maximum weight, iter is the iteration number and the iter_max is the maximum iteration number. The particles are transformed into their new locations using the following equation:

$$ {x}_i\left(k+1\right)={x}_i(k)+{V}_i\left(k+1\right) $$

(39)

Genetic algorithm

The genetic algorithm (GA) is an evolutionary algorithm dependent on direct similarity of Darwinian natural genetics and selection in biological systems (Goldberg 1989; Holland 1992). This algorithm mimics a technique of natural selection and genetics for finding solutions to a problem based on employing three major operators to optimize chromosomes in each generation, such as selection, crossover, and mutation that recombined and modified chromosomes. In this operation, each of the new individuals makes genetic changes between the two individuals. A mutation factor is utilized to change chromosomes and convert genes to make decent diversity. Crossover is considered based on the following relationships:

$$ {\mathrm{Pop}}_i^{\mathrm{new}}={\alpha \mathrm{Pop}}_i^{\mathrm{old}}+\left(1-\alpha \right){\mathrm{Pop}}_j^{\mathrm{old}} $$

(40)

$$ {\mathrm{Pop}}_j^{\mathrm{new}}={\alpha \mathrm{Pop}}_j^{\mathrm{old}}+\left(1-\alpha \right){\mathrm{Pop}}_i^{\mathrm{old}} $$

(41)

where α was a random value, $ {\mathrm{Pop}}_i^{\mathrm{new}} $was the i^th child, $ {\mathrm{Pop}}_i^{\mathrm{old}} $ was the i^th parent, $ {\mathrm{Pop}}_j^{\mathrm{new}} $was the j^th parent, $ {\mathrm{Pop}}_j^{\mathrm{old}} $was the j^th child. The mutation is based on Eq. (42):

$$ {\mathrm{Pop}}_{j,i}^{\mathrm{new}}={\mathrm{Var}}_{j,i}^{\mathrm{low}}+\beta \left({\mathrm{Var}}_{j,i}^{\mathrm{hi}}-{\mathrm{Var}}_{j,i}^{\mathrm{low}}\right) $$

(42)

where β is a random value from 0 to 1, $ {\mathrm{Pop}}_{j,i}^{\mathrm{new}} $is the new gene i^th in the j^th chromosome, $ {\mathrm{Var}}_{j,i}^{\mathrm{hi}} $is the upper limit of the i^th gene in the j^th chromosome, $ {\mathrm{Var}}_{j,i}^{\mathrm{low}} $is the lower limit of the i^th gene in the chromosome j^th.

Hybrid SVR models

Usually, the SVR models achieve high performance in modeling linear and nonlinear relationships. Nevertheless, the accuracy of the support vector regression is dependent on the correct selection of parameters (C, ε, and γ). These three parameters are known to vary in a very wide range and to significantly affect the accuracy of the SVR. According to the literature, there is no fixed rule to select these parameters; thus, finding optimal parameters is computationally hard and can be considered as an optimization problem. Therefore, we have applied hybrid optimization algorithms (GWO, PSO, and GA) for this issue. Further, the initial parameters of the proposed algorithm are presented in Table 3. In the current study, the SVR model was implemented utilizing the LIBSVM software (version 3.23) promoted by Chang and Lin (2011). Furthermore, the RBF kernel was selected and ε-SVR was executed. The flowchart of the proposed hybrid SVR-GWO is shown in Fig. 4.

Table 3 Initial parameters of the GWO, GA, and PSO

Full size table

Performance evaluation indicators

The performance of empirical methods and developed artificial intelligence models was evaluated in this study with respect to the root mean squared error (RMSE), Nash–Sutcliff efficiency (NSE), Pearson correlation coefficient (PCC), and Willmott Index (WI), which can be expressed as:

I.
Root mean squared error (Malik and Kumar 2015; Malik et al. 2019b; Adnan et al. 2019):

$$ \mathrm{RMSE}=\sqrt{\frac{1}{n}{\sum}_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)\right)}^2} $$

(43)

II.
Nash-Sutcliffe efficiency (Nash and Sutcliffe 1970):

$$ \mathrm{NSE}=1-\frac{\sum_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)\right)}^2}{\sum_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\overline{\Big(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}}\right)}^2} $$

(44)

III.
Pearson correlation coefficient (Malik et al. 2017a, b; Pham et al. 2019):

$$ \mathrm{PCC}=\frac{\frac{1}{n}{\sum}_{i=1}^n\left(\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\overline{\Big(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}}\right)\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)-\overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}}\right)\right)}{\sqrt{\frac{1}{n}{\sum}_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}}\right)}^2}+\sqrt{\frac{1}{n}{\sum}_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)-\overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}}\right)}^2}} $$

(45)

IV.
Willmott Index (Willmott 1981; Malik et al. 2019c):

$$ \mathrm{WI}=1-\left[\frac{\sum_{i=1}^n{\left(\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)\right)}^2}{\sum_{i=1}^n{\left(\left|\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}(i)-\overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}}\right|+\left|\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}(i)-\overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}}\right|\right)}^2}\right] $$

(46)

where ET_{o_obt} is the monthly reference evapotranspiration obtained through the FAO-56 PM, ET_{o_est} is the estimated monthly reference evapotranspiration using other applicable models, n is the number of data points, and $ \overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{obt}}} $ and $ \overline{\mathrm{E}{\mathrm{T}}_o{\_}_{\mathrm{est}}} $ are the mean values of obtained and estimated ET_o, respectively. A model with a lower RMSE value and a higher R, Nash, and WI can be considered as the best model for the estimation of ET_o.

Results and discussion

The developed hybrid SVR-GWO model was implemented with different combinations of input variables and compared with SVR-PSO, SVR-GA, ANN, and traditional empirical climate-based models (Valiantzas-1, Valiantzas-2, Valiantzas-3, Turc, Ritchie, and Thornthwaite) using training and testing (November 2009 to December 2013) datasets at Algiers, Tlemcen, and Annaba stations. Tables 4, 5, and 6 summarize the overall performance of all applied models (hybrid AI and climate-based models) during the testing period for estimation of monthly ET_o of the three climatic stations. In the tables, “5-11-1” indicates a neural network structure having 5, 11, and 1 neurons in the input, hidden, and output layers, respectively. The comparison results revealed that the SVR combined with the GWO algorithm outperformed SVR-GA, SVR-PSO, ANN, and empirical models whatever the station and the input variables used. So, in terms of the optimization algorithms, GWO seems to be the most efficient algorithm leading to the most accurate hybrid models, whereas, GA and PSO algorithms give almost the same performance as the empirical models. SVR-GWO-1 model improved the RMSE accuracy of SVR-GA-1, SVR-PSO-1, ANN-1, and Valiantzas-1 by 49%, 54%, 43%, and 65% in Algiers Station, by 81%, 55%, 73%, and 49% in Tlemcen Station, and by 70%, 85%, 83%, and 86% in Annaba Station, respectively. Similarly, SVR-GWO with other input combinations (models 2, 3, and 4) also surpassed the corresponding SVR-GA, SVR-PSO, ANN, and empirical models by producing the least RMSE and the highest NSE, PCC, and WI values during the testing period in all stations.

Table 4 RMSE, NSE, PCC, and WI values during testing period of SVR-GWO, SVR-GA, SVR-PSO, ANN, Valiantzas, Turc, Ritchie, and Thornthwaite models at Algiers Station

Full size table

Table 5 RMSE, NSE, PCC, and WI values during testing period of SVR-GWO, SVR-GA, SVR-PSO, ANN, Valiantzas, Turc, Ritchie, and Thornthwaite models at Tlemcen Station

Full size table

Table 6 RMSE, NSE, PCC, and WI values during testing period of SVR-GWO, SVR-GA, SVR-PSO, ANN, Valiantzas, Turc, Ritchie, and Thornthwaite models at Annaba Station

Full size table

Furthermore, a reduction in prediction accuracy (NSE) on average about 0.06% has been achieved by the SVR-GWO hybrid models with five input variables regarding those with four input variables and about 4% compared to those with three input variables, which indicates that reliable ET_o estimation can be obtained with only temperature and solar radiation as input variables to the SVR-GWO. It should be also noted that more important differences have been obtained with the other hybrid AI models and empirical models. This research complements the previous work carried out by the authors (Tikhamarine et al. 2019a) which emphasized the overall significance and the practical importance of the GWO algorithm in enhancing the prediction capability of AI models.

The obtained results have well agreement with those of previous studies (Huang et al. 2019; Keshtegar et al. 2019; Nourani et al. 2019; Valipour et al. 2019; Wu et al. 2019; Malik et al. 2019a) which they all reported the application of hybrid AI models and their superior accuracy compared to empirical methods for ET_o estimation in different climates over the globe. The other important information which can be derived from Tables 4, 5, and 6 that the accuracy of the models considerably increases by adding more variable; for example, in the Algiers, Tlemcen, and Annaba stations, the accuracy of SVR-GWO model increases by 63%, 66%, and 58% including R_s variable in input (SVR-GWO-2), by 57%, 48%, and 90% including RH variable in input (SVR-GWO-3), 43%, 61%, and 32% including U_s variable in input (SVR-GWO-4), respectively. The estimation results of ET_o using the empirical models for the three stations have generally revealed that reliable estimates can be achieved by the radiation-based models, i.e., Valiantzas, Ritchie, and Turc models. In fact, the radiation-based models used in this study were found to have better performances than the temperature-based model (Thornthwaite); they increased the prediction accuracy by up to 30% in all the stations. This could be due to the fact that the stations are located in a semi-arid Mediterranean climate of mild cold winter and hot dry summer, where atmospheric conditions other than temperature are more favorable to evaporation and transpiration. In another hand, the temperature-based Thornthwaite model gives more accurate estimates in Annaba and Tlemcen stations than those obtained in Algiers station. This result is probably due to the differences noticed in Algiers station between the training and testing conditions especially the temperatures which influence the testing results. Lower correlation between T_min/T_max and ET_o (0.769/0.867) for Algiers compared to those of Tlemcen (0.777/0.871) and Annaba (0.828/0.915) might be another reason for this difference.

Figures 5, 6, and 7 a to d show the temporal distribution and scatter plots of the FAO-56 PM and estimated monthly ET_o of SVR-GWO, SVR-PSO, SVR-GA, ANN, Valiantzas, Turc, Ritchie, and Thornthwaite models during the testing period of three stations. It was noticed from the figures that the SVR-GWO-1 closely follows the corresponding FAO-56 PM ET_o values and less scattered estimates compared to other methods. It is also clear that the empirical models cannot catch the target values and produce less accurate results than the hybrid AI models. For example, for the Algiers Station, the coefficient of determination (R²) of the SVR-GWO-1, SVR-PSO-1, SVR-GA-1, ANN-1, and Valiantzas-1 models varies from 0.9955 to 0.9884 (Fig. 5a); the R² of the SVR-GWO-2, SVR-PSO-2, SVR-GA-2, ANN-2, Turc, and Valiantzas-2 models slightly decreases and varies from 0.9866 to 0.9744 (Fig. 5b); the R² of the SVR-GWO-3, SVR-PSO-3, SVR-GA-3, ANN-3, Ritchie and Valiantzas-3 models varies from 0.9290 to 0.8987 (Fig. 5c); and the R² of the SVR-GWO-4, SVR-PSO-4, SVR-GA-4, ANN-4, and Thornthwaite models varies from 0.5287 to 0.5771 (Fig. 5d), respectively. These results indicate that more input variables provide better efficiency in the ET_o estimation. The worst results are those with only temperatures (max and min) as inputs; however, with three input variables adding only the solar radiation R_s to the minimum and maximum temperatures as input variables, the accuracy of the models increases considerably. This result is in great agreement with those obtained with the empirical methods which indicate the superiority of the radial-based models on the temperature-based one, so all AI-hybrid models including solar radiation have led to the best results.

With regard to the performance of the applied models, the hierarchical performance for Algiers Station follows the order: SVR-GWO-1 > ANN-1 > SVR-GA-1 > SVR-PSO-1 > Valiantzas-1; SVR-GWO-2 > SVR-GA-2 > ANN-2 > Valiantzas-2 > SVR-PSO-2 > Turc; SVR-GWO-3 > SVR-PSO-3 > SVR-GA-3 > ANN-3 > Ritchie > Valiantzas-3; and SVR-GWO-4 > SVR-PSO-4 > SVR-GA-4 > ANN-4 > Thornthwaite, respectively. This classification slightly differs for the other two stations, but with remarkable superiority of the hybrid model SVR-GWO compared to the other hybrid AI models or the empirical models.

Taylor diagram was utilized to evaluate the performance of implemented models which can be summarized by the multiple aspects like standard deviation (SD), RMSE, and coefficient of correlation (COC) in a single frame through the polar plot. Figures 8 a–d, 9 a–d, and 10 a–d show the Taylor diagrams of the observed and estimated monthly ET_o values for the three climatic stations, by using the SVR-GWO, SVR-PSO, SVR-GA, ANN, Valiantzas, Turc, Ritchie, and Thornthwaite models during the testing period, respectively. It was observed from Fig. 8 a–d that the estimates of SVR-GWO models are close to the observed ET_o values with the least RMSE and SD, and the highest COC at Algiers station. It can be noticed that the same performances have been observed for the other two stations (Tlemcen and Annaba).

Feng et al. (2017) employed two machine learning methods, random forests (RF), and generalized regression neural networks (GRNN), for modeling ET_o of two stations in China using inputs of maximum/minimum air temperature, relative humidity, solar radiation, and wind speed. They obtained NSE for the RF and GRNN as 0.978 and 0.971 for the Chengdu Station and 0.987 and 0.982 for the Nanchong station, respectively. Khosravi et al. (2019) estimated ET_o of Baghdad and Mosul stations in Iraq using Nine models, including five data mining algorithms, i.e., M5P, random forest (RF), random tree (RT), reduced error pruning tree (REPT), and Kstar, and four neuro-fuzzy systems (ANFIS, ANFIS-differential evolution, ANFIS-genetic algorithm, and ANFIS-imperialistic competitive algorithm) with sunshine hours, maximum and minimum temperatures, and relative humidity inputs and they obtained NSE for the optimal M5P, RF, RT, REPT, Kstar, ANFIS, ANFIS-DE, ANFIS-GA, and ANFIS-ICA as 0.93, 0.94, 0.85, 0.91, 0.95, 0.90, 0.94, 0.95, and 0.94, respectively. In the current work, the NSE values of the best SVR-GWO model are 0.995, 0.999, and 0.999 for the Algiers, Tlemcen, and Annaba stations, respectively. This comparison recommends the usefulness of the hybrid SVR-GWO method in estimating ET_o.

Overall results indicated that the SVR-GWO model can better analyze the relationship between ET_o and other climate variables than the SVR-PSO, SVR-GA, ANN, and empirical methods. Since there are a large number of local optimal solutions in search spaces such as these, the search is more difficult and solutions tend to fall into local solutions, the movement of search agents to the global optimal level is very important. This requires to need a powerful algorithm to avoid this problem and ultimately find global optimization. The appropriate balance between exploration and exploitation by adopting an effective mechanism, such as a decrease during iterations (Fig. 3), ensures avoidance of large numbers of local solutions. This is one of the most important characteristics of the GWO algorithm that allows it to achieve the best global solution and allows the SVR-GWO model to perform better. Moreover, the GWO algorithm contains only one parameter (A) that has to be adapted (Table 3) compared to PSO and GA, which have several parameters that need to be adapted (C₁, C₂, w_max, w_min for PSO and mutation, crossover for GA algorithm). With a large number of algorithm parameters, search space and optimization process become more difficult either. The SVR-GWO can be considered as an alternative tool for estimating reference evapotranspiration regarding the availability of the data.

Conclusion

The main purpose of this study was to investigate the potential ability of hybrid SVR-GWO model for estimating monthly reference evapotranspiration (ET_o) using climatic variables at three stations: Algiers, Tlemcen, and Annaba located in the north of Algeria. The SVR-GWO performance was compared with those of SVR-PSO, SVR-GA, ANN, and traditional climate-based models (Turc, Ritchie, Thornthwaite, and three versions of Valiantzas methods). According to the obtained results, the following conclusions have been extracted from this investigation:

i.
The artificial intelligence models (i.e., SVR-GWO, SVR-PSO, SVR-GA, ANN) exhibited better performance compared with traditional empirical methods (i.e., Valiantzas, Turc, Ritchie, and Thornthwaite) at all studied stations.
ii.
The SVR-GWO model had better accuracy than the other models in all scenarios at Algiers, Tlemcen, and Annaba stations.
iii.
The SVR-GWO with five inputs variable (T_max, T_min, R_s, U_s, and RH) exposed the feasible model in estimation ET_o.
iv.
The efficiency of the GWO algorithm also found to be better than the PSO and GA algorithms.
v.
The traditional empirical methods used in this study, except the Thornthwaite model, can provide reliable ET_o estimates. Moreover, it should be highlighted that the Valiantzas methods showed better performance over the other traditional methods (Turc, Ritchie, and Thornthwaite) at the study stations and can be considered as good alternatives for ET_o estimation in these regions.

Three stations were used in this study, and in future work, more stations will be considered from different locations in the world to draw generalized conclusions about the performance of hybrid artificial intelligence models.

References

Adnan RM, Malik A, Kumar A, Parmar KS, Kisi O (2019) Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arab J Geosci 12:606–614. https://doi.org/10.1007/s12517-019-4781-6
Article Google Scholar
Aljarah I, Faris H, Mirjalili S (2018) Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput. https://doi.org/10.1007/s00500-016-2442-1
Aljarah I, Faris H, Mirjalili S, al-Madi N, Sheta A, Mafarja M (2019) Evolving neural networks using bird swarm algorithm for data classification and regression applications. Cluster Comput 22:1317–1345. https://doi.org/10.1007/s10586-019-02913-5
Article Google Scholar
Allen RG, Pereira LS, Raes D, Smith M (1998) Crop evapotranspiration: guidelines for computing crop requirements. Irrig Drain Pap No 56, FAO. https://doi.org/10.1016/j.eja.2010.12.001
Allen RG, Smith M, Pereira LS, Perrier A (1994) An update for the calculation of reference evapotranspiration. ICID Bull
Google Scholar
Banda P, Cemek B, Küçüktopcu E (2018) Estimation of daily reference evapotranspiration by neuro computing techniques using limited data in a semi-arid environment. Arch Agron Soil Sci. https://doi.org/10.1080/03650340.2017.1414196
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol. https://doi.org/10.1145/1961189.1961199
Chia MY, Huang YF, Koo CH, Fung KF (2020) Recent advances in evapotranspiration estimation using artificial intelligence approaches with a focus on hybridization techniques—a review. Agronomy
Book Google Scholar
Citakoglu H, Cobaner M, Haktanir T, Kisi O (2014) Estimation of monthly mean reference evapotranspiration in Turkey. Water Resour Manag. https://doi.org/10.1007/s11269-013-0474-1
Djaman K, Rudnick D, Mel VC et al (2017) Evaluation of Valiantzas’ simplified forms of the FAO-56 Penman-Monteith reference evapotranspiration model in a humid climate. J Irrig Drain Eng. https://doi.org/10.1061/(asce)ir.1943-4774.0001191
Faris H, Aljarah I, Mirjalili S (2016) Training feedforward neural networks using multi-verse optimizer for binary classification problems. Appl Intell. https://doi.org/10.1007/s10489-016-0767-1
Faris H, Aljarah I, Al-Betar MA, Mirjalili S (2018) Grey wolf optimizer: a review of recent variants and applications. Neural Comput Appl 30:413–435. https://doi.org/10.1007/s00521-017-3272-5
Feng Y, Cui N, Gong D et al (2017) Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling. Agric Water Manag. https://doi.org/10.1016/j.agwat.2017.08.003
Ferreira LB, da Cunha FF, de Oliveira RA, Fernandes Filho EI (2019) Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—a new approach. J Hydrol. https://doi.org/10.1016/j.jhydrol.2019.03.028
Ghorbani MA, Deo RC, Yaseen ZM et al (2018) Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: case study in North Iran. Theor Appl Climatol 133:1119–1131. https://doi.org/10.1007/s00704-017-2244-0
Article Google Scholar
Goldberg DE (1989) Genetic algorithms in search, optimization, and machine learning. Addison-Wesley Longman Publ Co Inc Bost MA USA
Granata F (2019) Evapotranspiration evaluation models based on machine learning algorithms—a comparative study. Agric Water Manag. https://doi.org/10.1016/j.agwat.2019.03.015
Gunn S (1998) Support vector machines for classification and regression. Univ Southapt, Image Speech Intell Syst Res Group
Google Scholar
Haykin S (1998) Neural networks: a comprehensive foundation
Heddam S, Watts MJ, Houichi L, Djemili L, Sebbar A (2018) Evolving connectionist systems (ECoSs): a new approach for modeling daily reference evapotranspiration (ET0). Environ Monit Assess 190:516. https://doi.org/10.1007/s10661-018-6903-0
Article Google Scholar
Holland JH (1992) Genetic algorithms. Sci Am 267:66–73
Article Google Scholar
Huang G, Wu L, Ma X et al (2019) Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. J Hydrol. https://doi.org/10.1016/j.jhydrol.2019.04.085
Jones JW, Ritchie JT (1990) Crop growth models. In: Hoffman GJ, Howel TA, Solomon KH (eds) Management of farm irrigation systems. ASAE, USA, pp 63–69
Google Scholar
Kennedy J, Eberhart R (1995) Proceedings of ICNN’95—International Conference on Neural Networks. Particle Swarm Optimization, In
Google Scholar
Keshtegar B, Kisi O, Zounemat-Kermani M (2019) Polynomial chaos expansion and response surface method for nonlinear modelling of reference evapotranspiration. Hydrol Sci J. https://doi.org/10.1080/02626667.2019.1601727
Khan MS, Coulibaly P (2006) Application of support vector machine in lake water level prediction. J Hydrol Eng. https://doi.org/10.1061/(ASCE)1084-0699(2006)11:3(199
Khatibi R, Ghorbani MA, Pourhosseini FA (2017) Stream flow predictions using nature-inspired firefly algorithms and a multiple model strategy—directions of innovation towards next generation practices. Adv Eng Informatics. https://doi.org/10.1016/j.aei.2017.10.002
Khosravi K, Daggupati P, Alami MT et al (2019) Meteorological data mining and hybrid data-intelligence models for reference evaporation simulation: a case study in Iraq. Comput Electron Agric. https://doi.org/10.1016/j.compag.2019.105041
Kisi O (2016) Modeling reference evapotranspiration using three different heuristic regression approaches. Agric Water Manag. https://doi.org/10.1016/j.agwat.2016.02.026
Kisi O (2007) Evapotranspiration modelling from climatic data using a neural computing technique. Hydrol Process. https://doi.org/10.1002/hyp.6403
Kisi O, Sanikhani H, Zounemat-Kermani M, Niazi F (2015) Long-term monthly evapotranspiration modeling by several data-driven methods without climatic data. Comput Electron Agric. https://doi.org/10.1016/j.compag.2015.04.015
Landeras G, Bekoe E, Ampofo J, Logah F, Diop M, Cisse M, Shiri J (2018) New alternatives for reference evapotranspiration estimation in West Africa using limited weather data and ancillary data supply strategies. Theor Appl Climatol 132:701–716. https://doi.org/10.1007/s00704-017-2120-y
Article Google Scholar
Makkink GF (1957) Testing the Penman formula by means of lysismeters. Int Water Eng
Google Scholar
Malik A, Kumar A (2015) Pan evaporation simulation based on daily meteorological data using soft computing techniques and multiple linear regression. Water Resour Manag 29:1859–1872. https://doi.org/10.1007/s11269-015-0915-0
Article Google Scholar
Malik A, Kumar A, Ghorbani MA et al (2019a) The viability of co-active fuzzy inference system model for monthly reference evapotranspiration estimation: case study of Uttarakhand State. Hydrol Res 50:1623–1644. https://doi.org/10.2166/nh.2019.059
Article Google Scholar
Malik A, Kumar A, Kisi O (2017a) Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate based models. Comput Electron Agric 143:302–313. https://doi.org/10.1016/j.compag.2017.11.008
Article Google Scholar
Malik A, Kumar A, Kisi O, Shiri J (2019b) Evaluating the performance of four different heuristic approaches with Gamma test for daily suspended sediment concentration modeling. Environ Sci Pollut Res 26:22670–22687. https://doi.org/10.1007/s11356-019-05553-9
Article Google Scholar
Malik A, Kumar A, Piri J (2017b) Daily suspended sediment concentration simulation using hydrological data of Pranhita River Basin, India. Comput Electron Agric 138:20–28. https://doi.org/10.1016/j.compag.2017.04.005
Article Google Scholar
Malik A, Kumar A, Singh RP (2019c) Application of heuristic approaches for prediction of hydrological drought using multi-scalar streamflow drought index. Water Resour Manag 33:3985–4006. https://doi.org/10.1007/s11269-019-02350-4
Article Google Scholar
Maroufpoor S, Maroufpoor E, Bozorg-Haddad O et al (2019) Soil moisture simulation using hybrid artificial intelligent model: hybridization of adaptive neuro fuzzy inference system with grey wolf optimizer algorithm. J Hydrol. https://doi.org/10.1016/j.jhydrol.2019.05.045
McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5:115–133. https://doi.org/10.1007/BF02478259
Article Google Scholar
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw. https://doi.org/10.1016/j.advengsoft.2013.12.007
Mohammadrezapour O, Piri J, Kisi O (2019) Comparison of SVM, ANFIS and GEP in modeling monthly potential evapotranspiration in an arid region (case study: Sistan and Baluchestan Province, Iran). Water Sci Technol Water Supply. https://doi.org/10.2166/ws.2018.084
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models part I—a discussion of principles. J Hydrol 10:282–290. https://doi.org/10.1016/0022-1694(70)90255-6
Article Google Scholar
Nourani V, Elkiran G, Abdullahi J (2019) Multi-station artificial intelligence based ensemble modeling of reference evapotranspiration using pan evaporation measurements. J Hydrol. https://doi.org/10.1016/j.jhydrol.2019.123958
Nourani V, Hosseini Baghanam A, Adamowski J, Kisi O (2014) Applications of hybrid wavelet–artificial intelligence models in hydrology: a review. J Hydrol 514:358–377. https://doi.org/10.1016/j.jhydrol.2014.03.057
Article Google Scholar
Penman HL (1948) Natural evaporation from open water, bare soil and grass. Proc R Soc Lond A 193:120–145
Article CAS Google Scholar
Pham QB, Abba SI, Usman AG, Linh NTT, Gupta V, Malik A, Costache R, Vo ND, Tri DQ (2019) Potential of hybrid data-intelligence algorithms for multi-station modelling of rainfall. Water Resour Manag 33:5067–5087. https://doi.org/10.1007/s11269-019-02408-3
Article Google Scholar
Saggi MK, Jain S (2019) Reference evapotranspiration estimation and modeling of the Punjab Northern India using deep learning. Comput Electron Agric. https://doi.org/10.1016/j.compag.2018.11.031
Shiri J (2017) Evaluation of FAO56-PM, empirical, semi-empirical and gene expression programming approaches for estimating daily reference evapotranspiration in hyper-arid regions of Iran. Agric Water Manag. https://doi.org/10.1016/j.agwat.2017.04.009
Shiri J (2018) Improving the performance of the mass transfer-based reference evapotranspiration estimation approaches through a coupled wavelet-random forest methodology. J Hydrol 561:737–750. https://doi.org/10.1016/j.jhydrol.2018.04.042
Article Google Scholar
Shiri J (2019) Modeling reference evapotranspiration in island environments: assessing the practical implications. J Hydrol. https://doi.org/10.1016/j.jhydrol.2018.12.068
Shiri J, Marti P, Karimi S, Landeras G (2019a) Data splitting strategies for improving data driven models for reference evapotranspiration estimation among similar stations. Comput Electron Agric. https://doi.org/10.1016/j.compag.2019.03.030
Shiri J, Nazemi AH, Sadraddini AA et al (2014) Comparison of heuristic and empirical approaches for estimating reference evapotranspiration from limited inputs in Iran. Comput Electron Agric. https://doi.org/10.1016/j.compag.2014.08.007
Shiri J, Nazemi AH, Sadraddini AA et al (2013) Global cross-station assessment of neuro-fuzzy models for estimating daily reference evapotranspiration. J Hydrol. https://doi.org/10.1016/j.jhydrol.2012.12.006
Shiri J, Nazemi AH, Sadraddini AA, Marti P, Fakheri Fard A, Kisi O, Landeras G (2019b) Alternative heuristics equations to the Priestley–Taylor approach: assessing reference evapotranspiration estimation. Theor Appl Climatol 138:831–848. https://doi.org/10.1007/s00704-019-02852-6
Article Google Scholar
Smola A (1996) Regression estimation with support vector learning machines. Master’s thesis, Tech Univ M unchen
Smola J, Scholkopf B (1998) A tutorial on support vector regression. R Hollow Coll London, UK, NeuroCOLT Tech,Technical Rep Ser
Tao H, Diop L, Bodian A et al (2018) Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: regional case study in Burkina Faso. Agric Water Manag 208:140–151. https://doi.org/10.1016/j.agwat.2018.06.018
Article Google Scholar
Thornthwaite CW (1948) An approach toward a rational classification of climate. Geogr Rev. https://doi.org/10.2307/210739
Tikhamarine Y, Malik A, Kumar A et al (2019a) Estimation of monthly reference evapotranspiration using novel hybrid machine learning approaches. Hydrol Sci J:1–19. https://doi.org/10.1080/02626667.2019.1678750
Tikhamarine Y, Souag-Gamane D, Kisi O (2019b) A new intelligent method for monthly streamflow prediction: hybrid wavelet support vector regression based on grey wolf optimizer (WSVR–GWO). Arab J Geosci 12:540–520. https://doi.org/10.1007/s12517-019-4697-1
Article Google Scholar
Tikhamarine Y, Souag-Gamane D, Najah Ahmed A et al (2020) Improving artificial intelligence models accuracy for monthly streamflow forecasting using grey wolf optimization (GWO) algorithm. J Hydrol. https://doi.org/10.1016/j.jhydrol.2019.124435
Turc L (1961) Water requirements assessment of irrigation, potential evapotranspiration: simplified and updated climatic formula. Ann Agron 12:13–49
Google Scholar
Valiantzas JD (2013a) Simplified reference evapotranspiration formula using an empirical impact factor for penman’s aerodynamic term. J Hydrol Eng. https://doi.org/10.1061/(ASCE)HE.1943-5584.0000590
Valiantzas JD (2013b) Simple ET0 forms of Penman’s equation without wind and/or humidity data. I: theoretical development. J Irrig Drain Eng 139:1–8. https://doi.org/10.1061/(ASCE)IR.1943-4774.0000520
Article Google Scholar
Valipour M, Sefidkouhi MAG, Raeini-Sarjaz M, Guzman SM (2019) A hybrid data-driven machine learning technique for evapotranspiration modeling in various climates. Atmosphere (Basel). https://doi.org/10.3390/atmos10060311
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Book Google Scholar
Willmott CJ (1981) On the validation of models. Phys Geogr 2:184–194. https://doi.org/10.1080/02723646.1981.10642213
Article Google Scholar
Wu L, Peng Y, Fan J, Wang Y (2019) Machine learning models for the estimation of monthly mean daily reference evapotranspiration based on cross-station and synthetic data. Hydrol Res. https://doi.org/10.2166/nh.2019.060
Yaseen ZM, El-shafie A, Jaafar O et al (2015) Artificial intelligence based models for stream-flow forecasting: 2000-2015. J Hydrol
Yin Z, Wen X, Feng Q et al (2017) Integrating genetic algorithm and support vector machine for modeling daily reference evapotranspiration in a semi-arid mountain area. Hydrol Res. https://doi.org/10.2166/nh.2016.205
Zakhrouf M, Bouchelkia H, Stamboul M (2019) Neuro-fuzzy systems to estimate reference evapotranspiration. Water SA 45:232–238. https://doi.org/10.4314/wsa.v45i2.10
Article Google Scholar

Download references

Author information

Authors and Affiliations

Leghyd Laboratory, Department of Civil Engineering, University of Sciences and Technology Houari Boumediene, BP 32 Al Alia, Babezzouar, Algiers, Algeria
Yazid Tikhamarine & Doudja Souag-Gamane
Department of Soil and Water Conservation Engineering, College of Technology, G.B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, 263145, India
Anurag Malik
School of Technology, Ilia State University, Tbilisi, Georgia
Ozgur Kisi

Authors

Yazid Tikhamarine
View author publications
You can also search for this author in PubMed Google Scholar
Anurag Malik
View author publications
You can also search for this author in PubMed Google Scholar
Doudja Souag-Gamane
View author publications
You can also search for this author in PubMed Google Scholar
Ozgur Kisi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anurag Malik.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Responsible Editor: Marcus Schulz

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tikhamarine, Y., Malik, A., Souag-Gamane, D. et al. Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration. Environ Sci Pollut Res 27, 30001–30019 (2020). https://doi.org/10.1007/s11356-020-08792-3

Download citation

Received: 25 November 2019
Accepted: 06 April 2020
Published: 23 May 2020
Issue Date: August 2020
DOI: https://doi.org/10.1007/s11356-020-08792-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Artificial intelligence models versus empirical equations for modeling monthly reference evapotranspiration

Abstract

Similar content being viewed by others

Estimation of reference evapotranspiration using data driven techniques under limited data conditions

Enhanced Daily Reference Evapotranspiration Estimation Using Optimized Hybrid Support Vector Regression Models

Modeling daily reference evapotranspiration using SVR machine learning algorithm with limited meteorological data in Dar-el-Beidha, Algeria

Explore related subjects

Introduction

Materials and methods

Study area and dataset

Empirical methods

Penman-Monteith equation (FAO-56 PM)

Valiantzas method

Valiantzas method using all climatic data (Valiantzas-1)

Valiantzas method without wind speed (Valiantzas-2)

Valiantzas without relative humidity and wind speed (Valiantzas-3)

Turc method

Ritchie method

Thornthwaite method

Machine learning approaches

Artificial neural network (ANN)

Support vector regression

Grey wolf optimizer

Particle swarm optimization

Genetic algorithm

Hybrid SVR models

Performance evaluation indicators

Results and discussion

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation