Comparative Assessment of Reference Evapotranspiration Estimation Using Conventional Method and Machine Learning Algorithms in Four Climatic Regions

Raza, Ali; Shoaib, Muhammad; Faiz, Muhammad Abrar; Baig, Faisal; Khan, Mudasser Muneer; Ullah, Muhammad Kaleem; Zubair, Muhammad

doi:10.1007/s00024-020-02473-5

Comparative Assessment of Reference Evapotranspiration Estimation Using Conventional Method and Machine Learning Algorithms in Four Climatic Regions

Published: 02 April 2020

Volume 177, pages 4479–4508, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Pure and Applied Geophysics Aims and scope Submit manuscript

Comparative Assessment of Reference Evapotranspiration Estimation Using Conventional Method and Machine Learning Algorithms in Four Climatic Regions

Download PDF

Ali Raza¹,
Muhammad Shoaib¹,
Muhammad Abrar Faiz²,
Faisal Baig¹,
Mudasser Muneer Khan³,
Muhammad Kaleem Ullah⁴ &
…
Muhammad Zubair⁵

889 Accesses
40 Citations
Explore all metrics

Abstract

Reference evapotranspiration (ET_o) is considered as an essential component in hydrological and agro meteorological processes. Its accurate estimation becomes an imperative in the planning and management of irrigation practices. ET_o estimation also plays a vital role in improving the irrigation efficiency, water reuse and irrigation scheduling. The conventional physical model of Penmen Montieth (PM56) developed by Food and agriculture organization (FAO) has been recommended worldwide for ET_o estimation. This model was firstly used in this study to determine ET_o by using required meteorological data and obtained results used as referenced values. Afterward, five data machine learning algorithms/data driven models, support vector machine (SVM), multilayer perceptron (MLP), group method of data handling (GMDH), general regression neural network (GRNN) and cascade correlation neural network (CCNN), were applied to estimate ET_o values. The climatic data of maximum and minimum temperatures, wind speed, average relative humidity and sunshine hours of six stations from Pakistan was used to train and test data driven model. Data driven models were also applied on other climatic stations without training data which lie in China, New Zealand and USA to further validate and investigate their performance. Comparison results indicated that model efficiency (ME) and correlation coefficient (r) of SVM were obtained (ranges: ME = 95–99%; r = 0.96–1) maximum for all the selected stations. Alternatively, model errors (RMSE = 0.016, MSE = 0.0001 & MAE = 0.08) for SVM were found minimum in comparison to GMDH, MLP, CCNN and GRNN. In addition, all data driven models show enough divergence from hyper arid to high humid climate except SVM which shows almost identical results for all the climatic zones in comparison to standard FAO-PM56 method. Finally, it can be concluded that SVM could be considered as a reliable alternative method for ET_o estimation among data driven models.

Artificial Intelligence Based and Linear Conventional Techniques for Reference Evapotranspiration Modeling

Performance evaluation of numerical and machine learning methods in estimating reference evapotranspiration in a Brazilian agricultural frontier

Article 24 September 2020

Monthly Prediction of Reference Evapotranspiration in Northcentral Nigeria Using Artificial Intelligence Tools: A Comparative Study

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The agriculture sector is found to be the huge water consumer in most of the countries. The quantity of water withdrawal from total amount in developing counties is nearly estimated 81% while accounted 71% globally. In addition to use, more than 55% of World's fresh water withdrawals are utilizing for irrigation purposes (Amarasinghe and Smakhtin 2014). It is impossible to yield large amount of food without the irrigation crop field in order to fulfill world's population needs (Lumia et al. 2005). Researches relevant with enhancement of water productivity and improvement of irrigation water management have attained higher importance in irrigation field (Barker et al. 1999). Irrigators and farmers can make real time decision for irrigation on basis of crop water requirements. The amount of water required to irrigate crop field for entire period is called crop water requirements (ET_c) which can be estimated by multiplying crop coefficient (K_c) with reference evapotranspiration (ET_o). Moreover, values of K_c for a particular crop can be determined by dividing crop-ET with the ET_o value (Allen et al. 1998). Therefore, accurate quantification of ET_o is direly needed in determining crop water requirements to overcome less and excessive irrigation problems. As we know, the amount of water given at right time is called irrigation scheduling which seems impossible without accurate estimation of ET_o. Different techniques and methods have been applied for ET_o estimation which include direct, indirect and data driven methods (Karimaldini et al. 2011). The estimation of ET_o through lysimeter is considered as the best choice but difficult mainly because of cost intensive, labor intensive and requires huge amount for properly setup. Consequently, indirect and data driven methods were developed to estimate ET_o by using meteorological data. The indirect methods developed for estimation of ET_o includes: (1) empirical and semi empirical Eq. (2) Pan-evaporation method (3) energy budget method (4) mass transfer Eq. (5) combination Eq. (6) radiation base methods. These methods are found in Xu and Singh (2002) and McMahon et al. (2013).

Food and Agriculture Organization (FAO) of United Nations has introduced one of the illustrious and well-known indirect standard method for ET_o estimation. This method involves incorporation of Penman–Monteith equation which was modified and reformed by Allen et al. (1998) as a reference equation (FAO-PM56). This equation (FAO-PM56) is influenced and controlled by various climatic, aerodynamic and surface resistance parameters. These comprise of maximum and minimum air temperature, maximum and minimum relative air humidity, wind speed and solar radiation, saturation vapor pressure deficit, slope vapor pressure curve and psychometric constant. There is an anomaly to air temperature available at the most weather stations, the remaining variables are often incomplete and not always found reliable for many locations (Rahimikhoob 2010). This point may be not valid for developed countries, but seems a big challenge in case of developing countries like Pakistan, where quantity and quality of data remain always questionable. Trajkovic and Kolakovic (2009) stated that reliable weather data sets of radiation, relative humidity, and wind speed are usually limited in developing countries.

Concerning and regarding the above contexts, there are myriad factors including physical, social and the environmental conditions in case of physical models influencing ET estimation which is the foundation of indirect methods. In addition, the geographic data (latitude, longitude, altitude) is also required for the local adjustment of the different weather parameters e.g. atmospheric pressure, extraterrestrial radiations (R_a) and daylight hours (N) in FAO-PM56 equation. Additionally, the experimental approaches or field measurement efforts are usually time consuming, labor-intensive and post processing output process. Subsequently, it is quite difficult to formulate the ET_o equation to overcome these effects that can produce reliable and verified results (Ventura et al. 1999; A. R. Pereira and Pruitt 2004; Lopez-Urrea et al. 2006; Gavilan et al. 2007).

A few software solutions, namely, DailyET (Hess 1996), Cropwat 8.0 (Swennenhuis 2009) and ET_o calculator (Raes 2009) are efficaciously used to calculate ET_o. These software packages are based upon empirical and semi-empirical equations which require numerous climatic and geographic parameters. Moreover, these software packages will not yield results if even one of the parameter is not available. Thus, the data driven models based on different machine learning algorithms may be utilized as substitute models because of several advantages, for example, no required information of inner framework factors, easier answers for multi-variable issues and realistic calculation. These are innovative approaches which based on computational intelligent system to overcome imprecision and vulnerability in producing results (Chaturvedi 2008; Huang et al. 2010).

The main issue with modeling ET process is its non-linear dynamic and high complexity nature. In this regard, data driven models can be considered and contemplated as proper methods for ET_o estimation. Their ability to deal with complex problems by using only available set of data made them superior to conventional methods. Furthermore, there is also no need to give geographical information or local adjustment in these data driven models. The following literature provides a comprehensive comparison among data driven models and conventional methods in hydrological studies especially for ET_o estimation.

Among various data driven models, artificial neural networks (ANNs); adaptive neuro-fuzzy interference system (ANFIS); gene expression programming (GEP) have been widely applied for ET_o estimation and proved best choice over indirect methods. Various studies reported in literature which used ANN for ET_o estimation and found best performance for ANN data driven model in comparison to indirect methods (Kumar et al. 2002; Sudheer et al. 2003; Trajkovic 2005; Kim and Kim 2008; Landeras et al. 2008; Traore et al. 2010). The potential for the use of ANNs was examined by Rahimikhoob (2010) to estimate ET_o based on air temperature data under humid subtropical conditions on southern coast of Caspian Sea. It was claimed that ANN model found superior in comparison to indirect empirical equations. In addition to ET_o estimation, ANFIS model was also investigated in parallel to other methods (Kişi and Öztürk 2007; Doğan 2009; Cobaner 2011; Aytek 2009; Kisi and Zounemat-Kermani 2014). Kişi and Öztürk 2007 have applied ANFIS model for ET_o estimation in Los Ageles, city of southern California. They concluded that ANFIS can be considered as a good approach in estimating ET_o. Aytek 2009 has investigated the potential of coactive adaptive neuro fuzzy interference system (CANFIS) in three stations for estimating daily ET_o. The results obtained with CANFIS were further compared with some empirical formulas. The study finally affirmed the CANFIS performance is superior as compared to the conventional methods. In similar manner, Karimaldini et al. (2011) have also examined ANFIS approach for estimating ET_o in arid condition by using limited climatic data. The findings confirmed the utmost performance of ANFIS over conventional empirical equations (calibrated FAO-PM56, Hargreaves, Priestley-Tailor, Makkink, and Blaney-Criddle). Cobaner (2011) examined two types of ANFIS based on grid partition (G-ANFIS) and subtractive clustering (S-ANFIS) methods in modeling ET_o by using daily climatic data. The results were compared with different types of ANNs and three empirical models. He claimed that S-ANFIS was performed better than ANNs and empirical formulas in estimation of ET_o. In addition, the study Kisi and Zounemat-Kermani (2014) have also applied G-ANFIS and S-ANFIS models in estimating ET_o with daily climatic data. They found ANFIS models superior to the corresponding conventional methods. Due to good generalization ability and attractive interference, gene expression programing (GEP) attained the higher significance in many hydrological studies e.g. stage–discharge curve, prediction in monthly precipitation (Azamathulla et al. 2011; Kisi and Sanikhani 2015). In addition to literature, the performance of GEP was also investigated in modeling ET_o by using climatic variables (Shiri et al. 2014b, 2014a, 2014c; Guven et al. 2008; Traore and Guven 2012). The studies claimed that the GEP model could be considered as an alternative tool to the selected conventional methods for ET_o estimation.

Nowadays, support vector machine (SVM) and self-organizing networks (SONs) are also considered as the emerging tools in solving complex problems among the data driven models with an efficient outcomes (Asefa et al. 2006; Carpenter and Grossberg 1991, 1992; Lu and Wang 2005; Jorguseski et al. 2014; Østerbø and Grøndalen 2012; Peng et al. 2013). Group method of data handling (GMDH) and cascade correlation neural networks (CCNN) are found as the robust data driven models because of their tendency to develop direct relationship between input and output layers without the hidden layer as required in case of ANNs (Sherrod 2008, Sherrod 2010). Application of SVM, GMDH and CCNN models found in literature is presented thereafter.

Collobert and Bengio (2000) investigated the SVM model for large scale regression poblems. It is claimed that time consumed in performing the analysis (training and testing phases) using SVM model was less than the other selected algorithms. The study Huang et al. (2002) also investigated the performance of SVM model. They found that the classifiers used in SVM model have more tendencies to solve complex non-liner problems as compared to the other classifiers. Kisi (2013) investigated the capability of SVM model for estimating the ET_o by using daily climatic data in two stations of Califormia. The comparison results showed that SVM model performed better than the selected empirical equations. Kisi and Cimen (2009) investigated the potential of SVM model in ET_o estimation using daily meteorological data and compared the results with empirical models (CIMIS Penmene, Hargreaves, Ritchie and Turc methods). They claimed that SVM performed better than empirical models and coud be empolyed as an alternative tool in estimating ET_o Shiri Nazemi et al. (2014a, b, c) applied heuristic approaches (including SVM) in comparison to the empirical and semi-empirical equations for estimation of ET_o. The comparison results confirmed the superiority of heuristic approaches to the existing conventional methods. Tabari et al. (2013) investigated the capability of ANFIS and SVM in potato ET_c estimation. The results of ANFIS and SVM were also compared with the empirical equations. They found that SVM model performed better than ANFIS and corresponding applied empirical equations. Diamantopoulou et al. (2011) examined the potential of back propagation algorithm (BPANN) and CCNN in estimating the daily ET_o using weather data obtained from automatic weather stations (AWS) of Northern Greece. The results indicated that CCNN performed better than BPANN and selected empirical equations. Therefore,it could be considered as an appropriate tool in modeling ET_o process. In another study, Awari (2018) also investigated the capability of CCNN in the estimation of ET_o. The comparison results indicated that CCNN performed best than the other selected methods. Najafzadeh et al. (2014) investgated the potential of GMDH and SVM in estimating the scour depth below the pipelines and compared the results with empirical equations. The comparison results showed that the GMDH performed better than SVM and could be successfully empolyed as an alternative tool to the existing conventional methods. To the best knowledge of the authors, there is not any publised article in the literature which investigate the performance of GMDH model in ET_o estimation.

Thus, it can be perceived from the above citations that data driven models are oftern found best for ET_o estimation. But most of the stuides do not apply devloped data driven modelson other locations, which arise a question reagrding models's perfroamnce. Nowadays, this practice is not justified for data driven modelsin deterinninng their perforamnce by incorporating similar data in trainning and testing phases. Therfore, a new practice must be followed to investigate and authenticate model's performance. According to the knowledge of authors, there is no study conducted in Pakistan that applies data driven modelsfor ET_o estimation. The mian aims of this study were: (1) to determine the ET_o using globally accepted method (FAO-PM56) across a range of climatic regions (2) to investigate the potential of different data driven models(i.e. SVM, GMDH, CCNN, MLP and GRNN) for ET_o estimation (3) to compare ET_o estimation of PM 56 method with that of data driven models.

2 Materials and Methods

2.1 Location and Data Collection of Studied Regions

Pakistan expands north-east to south-west from 23.5 to 37 N and 60 to 75 E in south-west Asia over world map. It is bordered with Afghanistan in north-west, Iran in west, China in far north-east and India in east. The total area of Pakistan is nearly about 80 mha out of which only 37 mha is available for raising crops. A variety of climatic conditions exists in Pakistan. The weather of northern and north-western parts is quite cool because of the presence of snow-packed mountains. Plains of Punjab province are located in the center and are quite hot and semi-arid in nature. Highlands of Balochistan province have cold climate but arid in nature and the coastal areas located in south are warm and humid. Temperature of colder regions located in north falls below freezing point during winter months while a high temperature of about 40 C prevails during summer months in the semi-arid and arid desert regions of Pakistan with a peak approaching 50 C frequently. The northern and western highlands of Pakistan receive an annual rainfall of 1500 mm and 500 mm respectively while desert area receives very less rainfall. Thus, climatic distribution of Pakistan is arid to semi-arid, except a short stretch of temperate in north with wide variations.

The climatic data of six selected cities located in various climatic regions of Pakistan, was used in current study for a duration of 30 years (1987–2016). The data includes long term monthly average values of minimum temperature (T_min), maximum temperature (T_max), average relative humidity (R.H_mean), sunlight hours (N), and average 24 h. wind speed (U). This dataset was divide into sections. First section contains 70% of the data (1987–2007) and used to train each selected model while remaining 30% of the data (2008–2016) employed to test the developed models. The performance of one machine learning based model could be hardly justified if applied to only one location/country. To further test the validity of the developed models, the climatic data of five more cities located in New Zealand, China and USA was also used. It is pertinent to mention here that the data of six cities located in Pakistan was used to train and test the developed models while the data of five cities located in three countries of New Zealand, China and USA was only used to test the developed models. The details of the data of different cities located in Pakistan, New Zealand, China and USA is given in Table 1.

Table 1 The geographic description of selected climatic regions

Full size table

The location of six selected climatic stations of Pakistan is presented in Fig. 1a while Fig. 1b shows the climatic stations of other selected countries. All selected climatic stations of Fig. 1a and b are classified into six climatic zones which can be seen in Table 2. In addition, classifications of these selected climatic stations are based on global aridity index (Todorovic et al. 2013). This index can be calculated by dividing the mean yearly precipitation (P) to the mean yearly ET_o. Additionally, the humid region is further categorized into warm, mild and high humid climatic zones. The division of these zones is based on average annual rainfall and mean temperature data. This is due to the fact that climatic conditions that lie in humid regions can easily be fathomable.

Table 2 Division of climate zones, respective precipitation to ET_o ratios

Full size table

2.2 Adopted Methodology

2.2.1 FAO-PM56 Method

This method is universally accepted as a standard method for the estimation of ET_o. It is a physical method which requires numerous climatic parameters. The method is based on combine equations which estimate ET_o values. The mathematical relation to estimate ET_o by using climatic parameters is given in Eq. 1.

$$ {\text{ET}}_{o} = \frac{{0.408\Delta \left( {R_{n} - G} \right) + \gamma \times \frac{900}{{T_{{{\text{mean}} + 273}} }} \times U_{2} \times \left( {e_{s} - e_{a} } \right)}}{{\Delta + \gamma \left( {1 + 0.34u_{2} } \right)}} $$

(1)

wherein R_n, and G are net radiation (MJ /m²/day) on crop surface and soil warmness flux density (MJ/m²/day), respectively. Additionally, T_mean, U_2, e_s, e_a, e_min, e_max, Δ and γ are means of maximum and minimum air temperatures (°C), air speed (m/s), saturation vapor pressure (kPa), actual vapor pressure (kPa), saturation vapor pressure at minimum air temperature, saturation vapor pressure at maximum air temperature, slope vapor pressure curve (kPa/°C) and psychrometric constant (kPa/°C), respectively. As the value of G is smaller than R_n, it may be taken as zero. The value of Δ can be determined by utilizing the procedure which is enlightened by Gocić et al. (2015). The complete derivation of PM equation either in case of complete or missing data can be found in Allen et al. (1998).

2.2.2 Multilayer perceptron (MLP)

MLP is frequently used neural network in solving complex classification problems. In this network, input is given to network along with target output and weights are adjusted in such a way that network can generate target output with several attempts. The MLP structure consists of an input, a hidden and an output layers. Each layer contains a number of neurons that are linked in same pattern with neurons inside the subsequent layer. Each neuron carries a number of predictor variables from previous layer and outputs to the following layer. Each layer plays a significant role in the overall performance of structure. The neurons of one layer are linked to the neurons existed in other layer through a specific connection called weights. These weights are made accountable to carry results from one to another layer. Output of each neuron within input layer will become an input for the hidden layer. Furthermore, output of each neuron from hidden layer converts into an input for every neuron inside output layer. A structure of MLP neural network with one hidden layer can be seen in Fig. 2a. A function is utilized to process input data among the neurons of each layer known as activation function. Moreover, the sigmoid and linear functions are utilized to process input data from input to hidden layer and hidden to output layer, respectively. The network can be trained by adjusting weights through an error back propagation process explained by Hassanpour Kashani et al. (2008). The present study utilizes scaled conjugate gradient (SCG) method which can be swiftly performed twice as compared to Conjugate gradient and converging up to 20 times than scaled conjugate gradient (Sherrod 2008). Møller (1993) has explained SCG algorithm in detail. development of MLP model is described in the next lines. five meteorological variables (T_max, T_min, R.H_mean, N, U) are fed in an input layer as predictor variables. calculated ET_o is used as a target variable in an output layer. The mapping of input–output relationship with MLP model is presented in Fig. 2b. Six neurons are selected in hidden layer after tuning the neurons. The V-fold cross validation is presently applied to check and validate input data. essential parameters for the development of MLP model are given in Table 3.

Table 3 Parametric values of MLP model after tuning

Full size table

2.2.3 General Regression Neural Networks (GRNN)

Specht (1990) has proposed a method that allows to contrive and formulate the weighted- neighbor method. A general regression neural network (GRNN) can be utilized for regression problems while probabilistic neural network (PNN) is implemented to solve classification problems. This study employs GRNN network in order to estimate ET_o by using meteorological variables. Architecture of GRNN model is shown in Fig. 3.

Development of GRNN model to predict ET_o is as followed: Five meteorological variables, (T_max, T_min, R.H_mean, N, U) are used as predictor variable while ET_o is obtained as a target variable. Gaussian function is chosen as a kernel function which is considered one of the best yet among invented (Sherrod 2008). Eliminating neurons frequently minimize the error of developed model. The procedure to eliminate redundant neurons is considered slow. This is an iterative process due to the fact that model needs to be assessed with each persisting neuron to determine the best one as a way to eliminate. Thus, leave on out validation (LOO) with minimize error method is employed to find optimal number of neurons. The essential parametric values for GRNN model are given in Table 4.

Table 4 Tuning parameters of GRNN model

Full size table

2.2.4 Support Vector Machine (SVM)

Eventually, the concept and idea of learning machine is suggested by Turing (1950). Support vector machine is scrutinized and regarded as a supervised learning model which is built on statistical learning theory. The statistical learning theory is familiarized and made conversant by Vapnik (1998). Support vector regression (SVR) is frequently used to narrate SVM approach. SVR has capability to apply structural risk minimization principle in a reliable way. It is an inductive principle for model selection which is used for finite training data sets. It describes a general model of capacity control and provides a relationship between space complexity and quality of fitting the training data. Moreover, it emphasizes on an upper bound instead of training error in the data set. Minimum value of upper bound produces a best model. Based on this principle, SVR achieves an optimum network structure. More details regarding this principle can be found in Wu et al. (2008). The main objective of SVM modeling is to find optimal hyper plane which isolates the clusters of vector. This can be accomplished by separating variables using linear and non-linear dividing. The division of attributes in non-linear case is more complex than linear case. Furthermore, the separation in high dimensional case is easier than low dimensional. Figure a–d illustrate the pictorial view of linear, complex, low and high dimensional cases, respectively (Han and Kamber 2001; Markowetz 2003). The vectors adjoining to hyper plane are termed as support vectors.

This study employs two kinds of SVR, namely, epsilon (€) and neuron (Nu). There is no general method to determine model of the largest capacity in order to estimate ET_o except iterative method. After a large number of iterations, it was found that €-SVR performed good than Nu-SVR. Therefore, €-SVR is used in SVM model for present study. The accuracy of SVR model depends upon following parameters: penalty factor(C), gamma (¥), P, Nu etc. In addition, there is a dire need to determine proper optimal values for these parameters to achieve satisfactory, better and acceptable results. For example, large value of C creates over fitting problem while a small value of C causes under-fitting the model. Therefore, it is mandatory to select optimal values for all parameters to obtain reliable and better results. Grid and pattern (GAP) search are concurrently applied in this study in order to find optimal parametric values. A grid search tries to attempt values of each parameter across the specified search range. This can be done by using geometric steps. A pattern search starts at the center of search range and makes trial steps in each direction for each parameter. More comprehensive details about these searching methods can be found in Sherrod (2008). The attributes (predictor variable) in SVM model are meteorological variables while ET_o is selected as predicands (target variable). Four kinds of kernel functions (linear, RBF, polynomial and sigmoid) are concerned and applied with V-fold cross validation method. It is used to construct an efficient model. In addition, number of parameters in SVM model largely depends upon the applied kernel function. Thus, Table 5 provides epigrammatic details about the development of SVM model. It also shows optimal values for all required parameters which are needed corresponding to applied function. Moreover, number of parameters which do not show any contribution in the development of SVM model regarding kernel function is represented by symbol "X" and can be seen in Table 5.

Table 5 Optimal ranges of parameters for SVR model

Full size table

2.2.5 Group Method of Data Handling (GMDH)

GMDH model was developed by Ivakhnenko (1968). This algorithm presents a unique approach to the problem of modeling and even a new philosophy to scientific research. It is used in various fields such as data mining, prediction and complex system modeling etc. Li et al. (2017) found GMDH better than other classical forecasting algorithms. A schematic diagram of GMDH is shown in Fig. 5. The network starts with just input neurons. The main layer (at the top) presents one contribution for every indicator variable. Each neuron within second layer draws its inputs from two of the input variables. In similar way, the neurons within third layer draw their inputs from two of the neurons inside preceding layer. This procedure proceeds until second final layers fetch their inputs to the ultimate layer. The final layer (at the lowest) draws its two inputs from past layer and produces a single value that's the output of network. Additionally, the links between neurons inside this network are not fixed. It can be observed in Fig. 5 that neuron at the left end of third layer is linked with an input variable rather than output of preceding layer. Thus, current study applies GMDH model in order to draw a relationship between climatic variables and ET_o. Five meteorological variables, symbolically, T_max, T_min, RH_avg, U, N are used as predictor variables. The output layer generates ET_o as a target value. Various types of functions such as linear, quadratic, cubic, asymptotic etc. (Table 6) are used in GMDH model to improve model's accuracy. combination of two quadratic functions (1 variable and 2 variable) is applied in the training and testing process for estimation of ET_o. results generated with this combination of functions were found the best among other functions. Table 6 indicates essential parametric values for development of GMDH model.

Table 6 Essential parameters and list of functions for GMDH model

Full size table

2.2.6 Cascade Correlation Neural Network (CCNN)

Primarily, a CCNN model includes the most effective input and output layers without a hidden layer. Every input is attached to each output neuron by way of a connection with an adjustable weight. Each output neuron receives its input as weighted sum from all of input neurons alongside the bias value. At that point, yield neuron sends this weighted input sum through its transfer function to withdraw final output as shown in Fig. 6a. A simple CCNN also shows good performance in case of small data. A hidden layer is introduced into CCNN model for solving large data set and complex problems. Neurons are delivered to hidden layer one after another. Every new hidden neuron gets a connection from each of the inputs and every pre-present hidden neuron. The hidden neuron's input weights train and then fixed. Simplest output connection weights train over and over. Each new neuron, consequently, provides a new one hidden unit to the network. This prompts the formation of an effective powerful high-order function which includes identifier and deep network with a big variety of inputs to output neurons. Now, the neurons inside hidden layer do not change, once they have been brought. It is known as a cascade because output from all neurons already inside network feeds into the new neurons. A complete structure of CCNN model which is used in the current study has illustrated in the Fig. 6(b). It can be observed that the ET_o values are obtained as target outputs in CCNN model after giving meteorological parameters as predictor variables. Mixture of Sigmoid and Gaussian functions is employed as a kernel function in the hidden layer. In the CCNN model, advance parameters are also used with training parameters which enhance the predictive accuracy and minimize the error. Table 7 provides essential parametric values for CCNN model to generate reliable results.

Table 7 Training and advance parameters of CCNN model

Full size table

2.2.7 Weaknesses of FAO-PM56 Method vs Strengths of Numerical Methods

The concept of ET_o has been widely discussed and used for decades (Doorenbos and Pruitt 1977; Pereira et al. 1999). Allen et al. (1998) introduced a clear definition of FAO-PM56 method for estimation of ET_o. One of the most important reasons for advocating a simpler method than FAO-PM56 is the substantial likelihood for inaccuracy in weather data measurement and collection, especially for the developing countries and meteorological stations managed by non-experts. In these situations, the accuracy of data and especially for more advanced parameters such as radiation and humidity, can be very low (Droogers and Allen 2002). Table 8 shows data requirements for FAO-PM56 and data driven models. It can be seen in Table 8 that FAO-PM56 depends upon several parameters which is not easily accessible especially in the developing countries. Alternatively, data driven models only depend upon climatic parameters and require less parameter in comparison to FAO-PM56 method.

Table 8 Data requirements for the applied ET_o methods (Standard (FAO-PM56) method vs. Data driven models)

Full size table

3 Model Performance Evaluation

The goodness of fit is assessed through a set of indicators that are used to compare all the pairs of observed and model-predicted values of selected variables. Nine performing indicators are selected in this study to evaluate the performance of MLP, GRNN, GMDH and CCNN models. These include square of Pearson correlation coefficient (r²), linear regression coefficient (R²) model efficiency (M.E), mean absolute error (MAE) and root mean square error (RMSE), scatter index (SI) which are defined as follows (Legates and McCabe 1999; Alazba et al. 2011; Mattar and Alamoud 2015; Mattar et al. 2015)

$$ r^{2} = \frac{{n \times \left[ {\left\{ {\left\{ {\sum\nolimits_{{i = 1}}^{n} {\left( {{\text{ET}}_{{{\text{obs}}}} \times \overline{{{\text{ET}}}} _{{{\text{est}}}} } \right)} } \right.} \right\}} \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {{\text{ET}}_{{{\text{obs}}}} } } \right) \times \left( {\sum\nolimits_{{i = 1}}^{n} {{\text{ET}}_{{{\text{est}}}} } } \right)} \right\}} \right]}}{{\sqrt {\left[ {n \times \left\{ {\sum\nolimits_{{i = 1}}^{n} {({\text{ET}}_{{{\text{obs}}}} )} } \right\}^{2} } \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {\mkern 1mu} {\text{ET}}_{{{\text{obs}}}} } \right)} \right\}^{2} } \right] \times \left[ {n \times \left\{ {\sum\nolimits_{{i = 1}}^{n} {({\text{ET}}_{{{\text{est}}}} )} } \right\}^{2} } \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {\mkern 1mu} {\text{ET}}_{{{\text{est}}}} } \right)} \right\}^{2} } \right]} }}$$

(2)

$$ R^{2} = \frac{{\left[ {\left( {\mathop \sum \nolimits_{i = 1}^{n} (ET_{{{\text{obs}}}} - \overline{{ET_{{{\text{obs}}}} }} } \right) \times \left( {ET_{{{\text{est}}}} - \overline{{ET_{{{\text{est}}}} }} } \right)} \right]^{2} }}{{\left[ {\left( {\mathop \sum \nolimits_{i = 1}^{n} (ET_{{{\text{obs}}}} - \overline{{ET_{{{\text{obs}}}} }} } \right) \times \left( {ET_{{{\text{est}}}} - \overline{{ET_{{{\text{est}}}} }} } \right)} \right]}} $$

(3)

$$ {\text{M}}.{\text{E}} = 1 - \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{N}}} ({\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{sim}}}} )}}{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{N}}} ({\text{ET}}_{{{\text{obs}}}} - \overline{{{\text{ET}}_{{{\text{obs}}}} }})}} $$

(4)

$$ {\text{MAE}} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \frac{{|{\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{sim}}}} |}}{{\text{N}}} $$

(5)

$$ {\text{MAPE}} = \frac{100\% }{N}\mathop \sum \limits_{i = 1}^{N} \left| {\frac{{{\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{est}}}} }}{{{\text{ET}}_{{{\text{obs}}}} }}} \right| $$

(6)

$$ {\text{RMSE}} = \sqrt {\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \frac{{({\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{est}}}} )^{2} }}{{\text{N}}}} $$

(7)

$$ {\text{SI}} = \frac{{{\text{RMSE}}}}{{{\text{ET}}_{{{\text{mean}}}} }} $$

(8)

wherein, ET_obs, ET_est, $\overline{{{\text{ET}}_{{{\text{obs}}}} }}$., $\overline{{{\text{ET}}_{{{\text{est}}}} }}$_., ET_mean are observed value, estimated value, average of observed value, average of estimated value, mean of observed value of ET_o, respectively and N represents total number of observations. Further two coefficients, namely, skewness (C_s) and variation (CV) are also used in present sdy to distinguish the data.

$$ C_{s} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {O_{i} - \mu } \right)^{3} }}{{\left( {n - 1} \right) \times \sigma^{3} }} $$

(9)

$$ C.V = \frac{\sigma }{\mu } $$

(10)

Here, O_i, n, σ, and µ represent observed data, number of data points, standard deviation and mean value of data set, respectively. C_s determine the symmetry of distribution and σ measures amount of variation in the given data set (Bland and Altman 1996).

4 Results and Discussions

4.1 Experimental Data

The values of meteorological data on monthly basis are used to develop data driven models. Data sets used for data driven models based on climatic data are divided into two subsets. These include training and testing data sets. Training data set of each climatic station includes 70% of observed data while testing data set contains 30% of observed data collected in the period of years. The climatic data duration corresponding to each climatic station is given in Table 1. Moreover, Table 9 shows the value of statistical parameters of climatic data such as minimum, maximum, mean, standard deviation and skewness coefficient values of the selected countries. Additionally, averages of aforementioned parameters for six selected cities of Pakistan are estimated and presented in Table 9. Likewise, averages of statistical parameters of meteorological parameters for other locations used in the current study are also given in Table 9.

Table 9 Statistical parameters of selected countries used in current study

Full size table

4.2 Performance Analysis

At the beginning, data driven models are trained with extracted data from the FAO-PM56 equation. As a baseline for comparison of the performance of SVM with other data driven models, the same training and testing data set was considered for stations of Pakistan. To estimate FAO-PM56 ET_o values, r², RMSE, MAE and ME are used in comparison between the FAO-PM56 ET_o and predicted values of SVM, GMDH, GRNN, CCNN and MLP models. The performing indices, R² and RMSE of testing data set for each selected city of Pakistan are presented in Fig. 7a and b, respectively. To present results in elegant way, abbreviations of the selected cities of Pakistan are used for graphical representations. These are followed as: Faisalabad (Fsd), Multan (Mul), Islamabad (ISD), Jacobabad (Jac), Peshawar (Pes) and skardu (Skr). It can be observed in Fig. 7a that SVM model looks superior to other numerical computing models in all the selected cities of Pakistan. However, GMDH and GRNN models show almost equal outcomes in comparison to MLP model. In addition, CCNN model generates low r² values except in Faisalabad city. It is evident from Fig. 7a that all models have acceptable square of Pearson correlation coefficient (r²), which is higher than 90%. It means that there is no over fitting of training and testing data of the numerical computing models. In other words, the data driven models capture all nonlinearity of ET_o, which is the main advantage of data driven models. Likewise, it can be seen in Fig. 7b that SVM model generates low RMSE value for the selected cities of Pakistan which varies from 0.13 to 0.50 mm/month. On contrary, GMDH, GRNN, MLP and CCNN models show high value of RMSE. Although GRNN and MLP models generate high value of RMSE as compared to SVM model but have performed well in comparison to GMDH and CCNN models for all the selected cities of Pakistan. The RMSE values for all the chosen cities of Pakistan by employing GRNN and MLP models varies from 0.7 to 0.70 and 0.18–0.81 mm/month, respectively. Thus, it can be concluded that SVM model has more capability among all other numerical computing models to predict accurate ET_o in various climatic regions. However, GRNN and MLP models also show good performance in comparison to GMDH and CCNN models for all selected cities of Pakistan. The summary performance indices, M.E MAE, MAPE, SI, of various approaches towards estimating ET_o for various climatic zones of Pakistan can be seen in Table 10. It can be observed in Table 10 that model efficiency of all the data driven models in various climatic zones of Pakistan was found greater than 90%. As it can be seen model efficiency is higher for SVM model as compared to the use of other models in each climatic zone of Pakistan. However, MLP model looks superior in comparison to GRNN, CCNN and GMDH models in Multan, Peshawar and Skardu climatic stations. In contrast, GMDH model shows good performance as compared to MLP, CCNN and GRNN models in Islamabad and Jacobabad cities. Furthermore, CCNN model has greater value for M.E in Multan city as compared to MLP and GMDH models while it shows slightly better performance than GRNN model. Table 10 further supports our above discussion and expresses the performing indices of all numerical computing methods for each climatic station of Pakistan. Additionally, SVM model has attained 99 percent value for M.E in order to estimate ET_o. Moreover, Table 10 also demonstrates M.E value for SVM approach in selected climatic stations of Pakistan which lies between 95 and 100 percent. In general, the results indicate that SVM approach has the best estimating capability for ET_o estimation. similar results were obtained by Shrestha and Shukla (2015) for tropical environment. Two indicators, namely, MAE and SI are further calculated to endorse the selected numerical computing methods for each climatic station. These two indicators have values range from 0 to 1. If value approaches to 0, it shows the perfect fit of model. Alternatively, model shows less predictive accuracy when its outcome reaches to 1. It can also be observed in Table 10 that MAE and SI indices have least values for SVM model in each climatic station of Pakistan. In addition, SVM model has 0 values for SI while it shows minimum value of MAE as compared to the other models in each selected cities of Pakistan. According to the results depicted in Table 10 that MLP has smaller values for SI and MAE in comparison of GRNN, CCNN and GMDH models in Multan, Peshawar and Skardu climatic stations. On the other hand, MLP, CCNN and GRNN models generate higher values for MAE and SI as compared to GMDH model in Islamabad and Jacobabad cities. Additionally, CCNN has lower values of SI and MAE than MLP, GMDH and GRNN models for Multan city. above whole discussion could be summarized as all selected data driven models except SVM do not maintain consistency of their results in each climatic zone which can be clearly observed in Table 10. This may be due to the fact that the climatic parameters in training process for all selected models excluding SVM generate higher residual values which become responsible to reduce their efficiencies in order to estimate ET_o. Furthermore, GMDH, MLP, CCNN and GRNN show acceptable values for all performing indices (M.E, MAE, MAPE and SI) in each selected climatic stations of Pakistan. An inclusive detail about each data driven model regarding performing indices for selected climatic stations of Pakistan is given in Table 10. In addition to summarized discussion, the model efficiencies (M.Es.) of SVM model for FSd, Jac, Mul, Pes and Skr are 95.4%, 99.06%, 99.36%, 98.7% and 99.64%, respectively that stipulate the supremacy of SVM among other applied data driven models. Surprisingly, the efficiency of SVM for ISD station is lower as compared to other models but the least value of error (i.e. MAPE = 7.63%) further assist the aforementioned remarks regarding consistency and strength. The performance of each data driven model depends upon its training algorithm. It is difficult to decide which data driven model has the best performance except applied in different conditions especially in case of climatic studies. Although data driven models always performed better than local calibrated physical model or conventional methods (Karimaldini et al. 2011; Aytek 2009; Tabari and Talaee 2013). But mostly, it was found that the model considered as the best for one climatic region does not produce efficient result in another climatic condition. This is due to fact that the model optimal structure determined by using data driven model based on machine learning algorithms are of critical importance. Determination of the optimal structure of data driven model seems impossible without iteration process. Hence, various types of training algorithm with a number of iterations have been used to determine the best fitted optimal structure. In addition to use of training algorithms, the number of iterations, convergence value and execution times are considered as the prime parameters. Thus, data driven models based on some advanced machine learning algorithms e.g. pattern and search training algorithm in SVM model and/or equation combinations (linear, polynomial, quadratic) in GMDH model were applied in this study in order to investigate the performance.

Table 10 Performance parameters for ET_o assessment in Pakistan using selected approaches

Full size table

4.3 Results of Developed Machine Learning Models in Other Climatic Regions

In section, developed data driven models were tested on five other stations located in China, New Zealand and USA (Illinois State) by using 4-year climatic data. The climatic data of these stations is also comprised of same meteorological input parameters. The statistical average values of all meteorological variables for selected countries calculated and presented in Table 8. Geographical description and data duration of each climatic station is mentioned in Table 2. climatic characteristics of these stations can be seen in Table 3. ET_o values of these stations were estimated by FAO-PM 56 standard equation. Selected data driven models, SVM, GMDH, MLP, CCNN and GRNN, directly applied on these climatic stations to estimate ET_o values without using their data in training phase. The performance indices, namely, CV, RMSE, r, and MSE are calculated for each station to investigate their performance. results of each data driven model are summarized in Table 11. As the results indicated (Table 11), the SVM model had the best performance in Auckland station (r = 0.99, RMSE = 0.09 mm/day, CV = 0.17 and MSE = 0.12 mm/day) among other models. However, the models GMDH and MLP had performed equally well with r = 0.98 and RMSE = 0.13 mm/day in the test phase. results generated with GRNN and CCNN models are comparatively lower than SVM, GMDH and MLP models. The GRNN model had yielded results as r = 0.97, RMSE = 0.12 mm/day, CV = 0.19, MSE = 0.15 mm/day while CCNN model had generated r = 0.96, RMSE = 0.15 mm/day, CV = 0.23 and MSE = 0.22 mm/day, respectively. It can be evidently observed that RMSE values of GRNN and MLP models are remained identical i.e. 0.12 mm/day. In addition, the correlation between observed and predicted values for all data driven models had persisted above 0.95, although SVM model attains value equivalent to 1. Likewise, SVM model shows good performance for the Harbin station which is located in china country and have extremely cold climatic conditions. The values of r = 1 and RMSE = 0.10 mm/day makes SVM an ideal model for this climatic station while the values for CV and MSE equal to 0 further legitimate it. It should be noted that r term provides information for the linear dependence between observations and corresponding estimates. Furthermore, MLP and GMDH models provide closer ET_o estimates to the benchmark ET_o than GRNN and CCNN models. The values of r and RMSE for both MLP and GMDH models are greater than GRNN and CNN models. However, MLP model had performed better as compared to GMDH model. As shown in Table 9, GMDH model generates exactly twice values for CV and RMSE which are followed as 0.12 and 0.30 mm/day, respectively. Although, the performance of GRNN and CCNN models are shown lower in comparison to SVM, MLP and GMDH models but yield good results in comparison to Auckland station. In addition, MSE value for each model is categorically zero while the value of correlation coefficient had attained 0.95 or above. Generally, it can be concluded from the performance evaluation indicators that SVM had the best performance and superior to other data driven models. The MLP model with a RMSE of 0.15 mm/day and MSE of 0.06 mm/day has provided the second best ET_o estimates. In contrast, the CCNN model has the highest error rates (RMSE = 0.46 mm/day and MSE = 0.21 mm/day). The outcomes of all data driven models are found in Table 11 for Belleville, Big Bend and Dixon Spring stations. According to the performance statistics, SVM model with extreme less values of CV, r and RMSE (CV = 0.13–0.16, r = 0.96–0.99 RMSE = 0.016–0.025 mm/day) is considered as the best among other data driven models for the Belleville, Big Bend and Dixon Spring stations. In addition, for all aforementioned climatic stations, SVM model has zero value for MSE which further supports its supremacy. It can also be inferred from results in Table 11 that the second best model for estimation of ET_o can be considered as a GMDH model. The outcomes of GMDH model are more precise as compared to MLP, GRNN and CCNN models. In addition, this model yields more accurate ET_o with increasing correlation coefficient from 0.96 to 0.99 and decreasing RMSE and MSE values from 0.025 and 0.0003 to 0.016 and 0.0001, respectively. From above analysis, it can be generally concluded that the performance of SVM model is superior to MLP, GMDH, GRNN and CCNN models for all five selected climatic regions situated in USA, China and New Zealand. As mentioned earlier and can be seen from Table 4, radial basis function (RBF) was the best kernel for SVM model. Tabari et al. (2012) had found RBF as a best kernel function among others for all developed SVM models. A precise selection of kernel function is responsible for the given data set to provide feature space for its separation This unique feature creates a boundary between SVM and other previously developed algorithms (Bray and Han 2004). Furthermore, Table 11 also revealed that GMDH is comparatively superior to MLP in Auckland while produces lower results as compared with MLP model in Harbin stations. In addition, MLP model had the best performance in Harbin station but generates poor results for Belleville and Big bend stations with increasing values of CV and RMSE from 0.17 and 0.022 to 0.34 and 0.039, respectively. However, GMDH model could be endured the second best model in order to estimate ET_o. Consequently, it may be evident that GMDH model has utmost performance as compared to MLP, GRNN and CCNN models but inferior to SVM model. Moreover, GRNN model generates more reliable results than CCNN model by decreasing values of RMSE from 0.021 to 0.43 mm/day. Additionally, the values of correlation coefficient for CCNN model are decreasing from 0.96 to 0.89. Finally, it can be concluded that MLP is the third best model while GRNN and CCNN models are ranked as fourth and fifth best model for estimation of ET_o.

Table 11 Results of each applied data driven model in testing phase for other five selected climatic stations

Full size table

The mean monthly ET_o of five selected stations estimated by data driven model and FAO-PM56 method to authenticate model's performance. The hydrographical form of ET_o values for five selected stations can perceptibly be seen in Fig. 8. Initially, ET_o values were observed as low and then increased gradually when number of months increased. But after certain interval, ET_o goes in downward direction from peak value. Similarly, this could be happened for all intervals and observed lower and higher peaks of ET_o. Surely, this trend could be indicated winter and summer seasons. In winter season, temperature recorded minimum and relative humidity was maximum, the value of ET_o estimated minimum. Alternatively, for summer season, temperature recorded maximum and relative humidity was minimum, ET_o value estimated maximum. To further check the underestimation and overestimation of ET_o, data driven models against FAO-PM56 method was compared. It can be perceived from Fig. 8 that MLP and CCNN models indicate overestimation while values of GRNN and GMDH models remain under peak of FAO-PM56 ET_o values for the Harbin station. For SVM data driven model, estimated ET_o is trailing well with FAO-PM56 ET_o values. In contrast, overestimation for MLP, GRNN and GMDH models in comparison to FAO-PM56 ET_o values was observed in Auckland station. However, the values of SVM model closely overlap with the standard FAO-PM56 ET_o values. Likewise, ET_o estimated by SVM model agrees with the FAO-PM56 ET_o and follows same trend in both Big bend and Belleville stations as observed in Fig. 8. Conversely, results of MLP model look extremely below from the FAO-PM56 ET_o values in Big bend station while remain equal with other models for Belleville station. Additionally, GMDH model has performed better than GRNN and CCNN models in both cities. This also authenticates our results for GMDH model indicated in Table 11. For Dixon Spring station, all data driven models generate smooth peaks for ET_o values as compared to FAO-PM56 method. This also indicates that these data driven models only deal with available data set and do not affect with other myriad factors. The deviation of ET_o values for SVM among data driven models was very minute and more close that estimated by FAO-PM56 method. Thus, it can be concluded from earlier discussion that SVM model performed the best in all climatic conditions for ET_o estimation. However, GMDH model generates reliable results as compared to MLP, GRNN and CNN models, therefore ranked as the 2nd best model. While MLP, CCNN and GRNN models ranked 3rd place to 5th, respectively in the estimation of ET_o. The linear regression test facilitates the extension of statistical analysis. Figures 9, 10, 11, 12, 13 indicates ET_o estimates of all data driven models in form of scatter plots for each station. In these plots, ET_o estimated by the FAO-PM56 method are plotted against with numerical estimated ET_o values, in the form of scatter plots. It can be observed that there is a linear pattern of points for all applied data driven models. There were some misplaced estimated points which correspond to incorrectly estimated ET_o data, with some high values for all methods except SVM model. similar results were observed in Tabari et al. (2012). This may be due to intermittent character of ET_o distribution and deviate from trend line. It can be observed in Fig. 9 that data driven models generate acceptable R² values for the Harbin station (China) ranges from 94 to 98%. For SVM, MLP and GRNN data driven models, R² value found in equal (i.e. 98%) while CCNN and SVM models have 94% nad 95% repectively. Likewise, ET_o values estimated by data driven modelsin comparison to FAO-PM56 methods for Auckland (New Zealand) is shown in Fig. 10. As indicated, SVM models performed the best with the highest value of R² (i.e. 95%) as compared to other data driven models. MLP and GRNN models performed equal while CCNN model generates loewest (i.e. 92%) value for Auckland station. In simialr manner, comparison of applied numrical models against the standard FAO-PM56 method for Big Bend, Belliville, Dixon Spring (Illinios state, USA) are presented in Fig. 11, Fig. 12 and Fig. 13, respectively. SVM and GMDH performed the best as compared to CCNN, MLP and GRNN model as obserevd in Fig. 11. The highest value of R² (93%) in Big Bend station was found for SVM model. Similarly, for Belliville (Fig. 12) and Dixon Spring stations (Fig. 13), SVM model outperformed in comaprison to GRNN, CCNN, MLP. The R² value for SVM in Belliville and Dixon Springs climaic stations was nealy found 94% and 92% respectively. It is obvious from given fitted lines in all scatter plots (Figs. 9, 10, 11, 12, 13) that SVM model have closer to PM-ET_o values in comparison to other data driven models. Plotted points of SVM model correlate well towards linear trend line. However, plotted points of other data driven model are of major underestimation or overestimation. It can be deduced from Figs. 9, 10, 11, 12, 13 that GMDH model has the higher value of R² after SVM model for Auckland station (i.e. R² = 94%). The superiority of GMDH model over MLP, GMDH, CCRN can also be noticeably comprehended in, Big Bend, Dixon Spring climatic stations. While MLP and CCRN performed better than GMDH model in the Harbin and Belleville climatic stations, respectively. similar results of GMDH model can be found in Table 11 which demonstrates the supremacy of GMDH model over data driven models except a SVM model. It can be concluded that SVM model has the highest linear regression coefficient (R²) in all types of climatic conditions and could be considered as the best data driven model for the ET_o estimation. From above discussions, GMDH and MLP data driven models are ranked as the 2nd and 3rd best models for estimation of ET_o. In contrast, GRNN and CCNN model have lower value of R² but can be acceptable for all climatic regions. Therefore, CCNN is ranked as 4th while GRNN model is considered as the 5th best model_. It is evident from scatter plots that all methods have acceptable linear regression coefficient values.

For all data driven models, regression equation of each selected climatic station is found in Figs. 9, 10, 11, 12, 13. It can be perceived that SVM outperformed than other data driven models and shows higher correlation coefficient value in each selected station. The value of slope(a) and y-intercept (b) in regression equation (Y = aX + b) for SVM model was also found good, for "a" achieved higher and "b" look smaller, in comparison to other data driven models (Figs. 9, 10, 11, 12, 13). In addition, a comparison of ET_o estimated by the standard conventional FAO-PM56 method and data driven models for selected countries is presented in Table 12 to conclude above mentioned results. Skardu station from Pakistan was included to make reasonable comparison among data driven models because it belongs to humid region. It can be concluded that all the data driven modelsshown good results when applied in differernt cliamtic conditions. Overall results indicate that SVM model outperformed for all selected climatic stations and yielded relaible results for ET_o estimation. This may be due to the fact that SVM model always seeks a global optimized solution, avoids over-fitting and no physical bias which eventually leads to the best performance over other data driven models.

Table 12 Comparison of FAO-PM56 and data driven models between monthly averages value of ET_o for selected stations

Full size table

5 Conclusions

In this study, the potential of five machine learning algorithms/data driven models (SVM, GMDH, CCNN, MLP and GRNN) were investigated to estimate long term monthly reference evapotranspiration (ET_o). For this purpose, meteorological variables of maximum and minimum temperatures, average relative humidity, wind speed and sunshine hours were employed as input in data driven models for six stations of Pakistan. The results obtained from all data driven models were found acceptable for selected climatic stations. Most of studies exited in literature used 70% of data as training/input data. And rest of 30% data for testing/validation. In this situation, R² mostly comes above 95%. This practice is not justified and always arose question regarding model's performance. Therefore, the data set from other countries have been used for validation and justification of each data driven model. The following conclusions can be drawn from the study:

The long term monthly reference evapotranspiration (ET_o) of four climatic regions were successfully estimated by conventional standard FAO-PM56 method
Machine learning algorithms/data driven models of SVM, GMDH, CCNN, MLP and GRNN were found capable for ET_o estimation under different climatic zones of Pakistan as well as in other parts of the world
By investigating the performing indices, SVM and GMDH model ranked as 1st and 2nd while 3rd, 4th and 5th followed by MLP, CCCN and GRNN data driven models
The comparison results indicate that by using the similar meteorological inputs, SVM can be employed as an alternative ET_o model to the existing conventional method

References

Alazba, A., Mattar, M., ElNesr, M., & Amin, M. (2011). Field assessment of friction head loss and friction correction factor equations. Journal of Irrigation and Drainage Engineering, 138(2), 166–176.
Google Scholar
Allen, R. G., Pereira, L. S., Raes, D., & Smith, M. (1998). Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao, 300(9), D05109.
Google Scholar
Amarasinghe, U. A., & Smakhtin, V. (2014). Global water demand projections: past, present and future. Colombo: IWMI.
Google Scholar
Asefa, T., Kemblowski, M., McKee, M., & Khalil, A. (2006). Multi-time scale stream flow predictions: The support vector machines approach. Journal of Hydrology, 318(1–4), 7–16.
Google Scholar
Awari, H. (2018). Estimation of reference evapotranspiration and crop water requirment using artificial neural network for semiarid. Ph.D, Vasantrao Naik Marathwada Krishi Vidyapeeth, Parbhani, Vasantrao Naik Marathwada Krishi Vidyapeeth, Parbhani.
Aytek, A. (2009). Co-active neurofuzzy inference system for evapotranspiration modeling. Soft Computing, 13(7), 691.
Google Scholar
Azamathulla, H. M., Ghani, A. A., Leow, C. S., Chang, C. K., & Zakaria, N. A. (2011). Gene-expression programming for the development of a stage-discharge curve of the Pahang River. Water Resources Management, 25(11), 2901–2916.
Google Scholar
Barker, R., Dawe, D., Tuong, T., Bhuiyan, S., & Guerra, L. (1999). The outlook for water resources in the year 2020: challenges for research on water management in rice production. Southeast Asia, 1, 1–5.
Google Scholar
Bland, J. M., & Altman, D. G. (1996). Measurement error. BMJ. British Medical Journal, 312(7047), 1654.
Google Scholar
Bray, M., & Han, D. (2004). Identification of support vector machines for runoff modelling. Journal of Hydroinformatics, 6(4), 265–280.
Google Scholar
Chaturvedi, D. K. (2008). Soft computing: Techniques and its applications in electrical engineering. Berlin, Heidelberg: Springer. https://doi.org/10.1007/978-3-540-77481-5.
Book Google Scholar
Cobaner, M. (2011). Evapotranspiration estimation by two different neuro-fuzzy inference systems. Journal of Hydrology, 398(3–4), 292–302.
Google Scholar
Collobert, R., & Bengio, S. (2000). Support vector machines for large-scale regression problems. Technical Report IDIAP-RR00-17 IDIAP. Switzerland.
Diamantopoulou, M., Georgiou, P., & Papamichail, D. (2011). Performance evaluation of artificial neural networks in estimating reference evapotranspiration with minimal meteorological data. Global Nest Journal, 13(1), 18–27.
Google Scholar
Doğan, E. (2009). Reference evapotranspiration estimation using adaptive neuro-fuzzy inference systems. Irrigation and Drainage, 58(5), 617–628.
Google Scholar
Doorenbos, J., & Pruitt, W. (1977). Crop water requirements. FAO irrigation and drainage paper 24. Land and Water Development Division. Rome: FAO.
Google Scholar
Droogers, P., & Allen, R. G. (2002). Estimating reference evapotranspiration under inaccurate data conditions. Irrigation and Drainage Systems, 16(1), 33–45.
Google Scholar
Gavilan, P., Berengena, J., & Allen, R. G. (2007). Measuring versus estimating net radiation and soil heat flux: impact on Penman-Monteith reference ET estimates in semiarid regions. Agricultural water management, 89(3), 275–286.
Google Scholar
Gocić, M., Motamedi, S., Shamshirband, S., Petković, D., Ch, S., Hashim, R., et al. (2015). Soft computing approaches for forecasting reference evapotranspiration. Computers and Electronics in Agriculture, 113, 164–173.
Google Scholar
Guven, A., Aytek, A., Yuce, M. I., & Aksoy, H. (2008). Genetic Programming-based empirical model for daily reference evapotranspiration estimation. Clean-Soil Air Water, 36(10–11), 905–912.
Google Scholar
Han, J., Kamber, M., & Pei, J. (2011). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann Publishers.
Hassanpour Kashani, M., Montaseri, M., & Lotfollahi Yaghin, M. A. (2008). Flood estimation at ungauged sites using a new hybrid model. Journal of Applied Sciences, 8, 1744–1749.
Google Scholar
Hess, T. (1996). Potential Evapotranspiration [DAILYET]. UK: Silsoe College.
Google Scholar
Huang, C., Davis, L. S., & Townshend, J. R. G. (2002). An assessment of support vector machines for land cover classification. International Journal of Remote Sensing, 23(4), 725–749. https://doi.org/10.1080/01431160110040323.
Article Google Scholar
Huang, Y., Lan, Y., Thomson, S. J., Fang, A., Hoffmann, W. C., & Lacey, R. E. (2010). Development of soft computing and applications in agricultural and biological engineering. Computers and Electronics in Agriculture, 71(2), 107–127.
Google Scholar
Ivakhnenko, A. G. (1968). The group method of data of handling; a rival of the method of stochastic approximation. Soviet Automatic Control, 13, 43–55.
Google Scholar
Jorguseski, L., Pais, A., Gunnarsson, F., Centonza, A., & Willcock, C. (2014). Self-organizing networks in 3GPP: Standardization and future trends. IEEE Communications Magazine, 52(12), 28–34.
Google Scholar
Karimaldini, F., Teang Shui, L., Ahmed Mohamed, T., Abdollahi, M., & Khalili, N. (2011). Daily evapotranspiration modeling from limited weather data by using neuro-fuzzy computing technique. Journal of Irrigation and Drainage Engineering, 138(1), 21–34.
Google Scholar
Kim, S., & Kim, H. S. (2008). Neural networks and genetic algorithm approach for nonlinear evaporation and evapotranspiration modeling. Journal of Hydrology, 351(3), 299–317.
Google Scholar
Kisi, O. (2013). Least squares support vector machine for modeling daily reference evapotranspiration. Irrigation Science, 31(4), 611–619.
Google Scholar
Kisi, O., & Cimen, M. (2009). Evapotranspiration modelling using support vector machines/Modélisation de l'évapotranspiration à l'aide de ‘support vector machines’. Hydrological Sciences Journal, 54(5), 918–928.
Google Scholar
Kişi, Ö., & Öztürk, Ö. (2007). Adaptive neurofuzzy computing technique for evapotranspiration estimation. Journal of Irrigation and Drainage Engineering, 133(4), 368–379.
Google Scholar
Kisi, O., & Sanikhani, H. (2015). Prediction of long-term monthly precipitation using several soft computing methods without climatic data. International Journal of Climatology, 35(14), 4139–4150.
Google Scholar
Kisi, O., & Zounemat-Kermani, M. (2014). Comparison of two different adaptive neuro-fuzzy inference systems in modelling daily reference evapotranspiration. Water Resources Management, 28(9), 2655–2675.
Google Scholar
Kumar, M., Raghuwanshi, N., Singh, R., Wallender, W., & Pruitt, W. (2002). Estimating evapotranspiration using artificial neural network. Journal of Irrigation and Drainage Engineering, 128(4), 224–233.
Google Scholar
Landeras, G., Ortiz-Barredo, A., & López, J. J. (2008). Comparison of artificial neural network models and empirical and semi-empirical equations for daily reference evapotranspiration estimation in the Basque Country (Northern Spain). Agricultural Water Management, 95(5), 553–565.
Google Scholar
Legates, D. R., & McCabe, G. J., Jr. (1999). Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resources Research, 35(1), 233–241.
Google Scholar
Li, R. Y. M., Fong, S., & Chong, K. W. S. (2017). Forecasting the REITs and stock indices: Group method of data handling neural network approach. Pacific Rim Property Research Journal, 23(2), 123–160.
Google Scholar
Lopez-Urrea, R., de Santa Olalla, F. M., Fabeiro, C., & Moratalla, A. (2006). An evaluation of two hourly reference evapotranspiration equations for semiarid conditions. Agricultural Water Management, 86(3), 277–282.
Google Scholar
Lu, W.-Z., & Wang, W.-J. (2005). Potential assessment of the “support vector machine” method in forecasting ambient air pollutant trends. Chemosphere, 59(5), 693–701.
Google Scholar
Lumia, D. S., Linsey, K. S., & Barber, N. L. (2005). Estimated use of water in the United States in 2000: US Department of the Interior US Geological Survey. Reston: US Geological Survey.
Google Scholar
Markowetz, F. (2003). Classification by support vector machines. https://www.academia.edu/download/3241129/Kapitel_16.pdf.
Mattar, M. A., & Alamoud, A. I. (2015). Artificial neural networks for estimating the hydraulic performance of labyrinth-channel emitters. Computers and Electronics in Agriculture, 114, 189–201.
Google Scholar
Mattar, M., Alazba, A., & El-Abedin, T. Z. (2015). Forecasting furrow irrigation infiltration using artificial neural networks. Agricultural Water Management, 148, 63–71.
Google Scholar
McMahon, T., Peel, M., Lowe, L., Srikanthan, R., & McVicar, T. (2013). Estimating actual, potential, reference crop and pan evaporation using standard meteorological data: A pragmatic synthesis. Hydrology and Earth System Sciences, 17(4), 1331.
Google Scholar
Møller, M. F. (1993). A scaled conjugate gradient algorithm for fast supervised learning. Neural Networks, 6(4), 525–533.
Google Scholar
Najafzadeh, M., Barani, G.-A., & Azamathulla, H. M. (2014). Prediction of pipeline scour depth in clear-water and live-bed conditions using group method of data handling. Neural Computing and Applications, 24(3–4), 629–635.
Google Scholar
Østerbø, O., & Grøndalen, O. (2012). Benefits of self-organizing networks (SON) for mobile operators. Journal of Computer Networks and Communications. https://doi.org/10.1155/2012/862527.
Article Google Scholar
Peng, M., Liang, D., Wei, Y., Li, J., & Chen, H.-H. (2013). Self-configuration and self-optimization in LTE-advanced heterogeneous networks. IEEE Communications Magazine, 51(5), 36–45.
Google Scholar
Pereira, A. R., & Pruitt, W. O. (2004). Adaptation of the Thornthwaite scheme for estimating daily reference evapotranspiration. Agricultural Water Management, 66(3), 251–257.
Google Scholar
Pereira, L. S., Perrier, A., Allen, R. G., & Alves, I. (1999). Evapotranspiration: concepts and future trends. Journal of Irrigation and Drainage Engineering, 125(2), 45–51.
Google Scholar
Raes, D. (2009). ETo Calculator: a software program to calculate evapotranspiration from a reference surface. FAO Land Water Division: Digital Media Service
Rahimikhoob, A. (2010). Estimation of evapotranspiration based on only air temperature data using artificial neural networks for a subtropical climate in Iran. Theoretical and Applied Climatology, 101(1–2), 83–91.
Google Scholar
Sherrod, P. H. (2008). DTREG predictive modeling software. Users Manual. Disponível online no url: (https://www.dtreg.com/DTREG.pdf) (Acedido 28 Março 2014).
Shiri, J., Marti, P., & Singh, V. P. (2014a). Evaluation of gene expression programming approaches for estimating daily evaporation through spatial and temporal data scanning. Hydrological Processes, 28(3), 1215–1225.
Google Scholar
Shiri, J., Nazemi, A. H., Sadraddini, A. A., Landeras, G., Kisi, O., Fard, A. F., et al. (2014b). Comparison of heuristic and empirical approaches for estimating reference evapotranspiration from limited inputs in Iran. Computers and Electronics in Agriculture, 108, 230–241.
Google Scholar
Shiri, J., Sadraddini, A. A., Nazemi, A. H., Kisi, O., Landeras, G., Fard, A. F., et al. (2014c). Generalizability of Gene Expression Programming-based approaches for estimating daily reference evapotranspiration in coastal stations of Iran. Journal of Hydrology, 508, 1–11.
Google Scholar
Shrestha, N., & Shukla, S. (2015). Support vector machine based modeling of evapotranspiration using hydro-climatic variables in a sub-tropical environment. Agricultural and Forest Meteorology, 200, 172–184.
Google Scholar
Specht, D. F. (1990). Probabilistic neural networks. Neural Networks, 3(1), 109–118.
Google Scholar
Sudheer, K., Gosain, A., & Ramasastri, K. (2003). Estimating actual evapotranspiration from limited climatic data using neural computing technique. Journal of Irrigation and Drainage Engineering, 129(3), 214–218.
Google Scholar
Swennenhuis, J. (2009). CROPWAT 8.0. Rome: Water Resources Development and Management Service of FAO.
Google Scholar
Tabari, H., Kisi, O., Ezani, A., & Talaee, P. H. (2012). SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. Journal of Hydrology, 444, 78–89.
Google Scholar
Tabari, H., Martinez, C., Ezani, A., & Talaee, P. H. (2013). Applicability of support vector machines and adaptive neurofuzzy inference system for modeling potato crop evapotranspiration. Irrigation Science, 31(4), 575–588.
Google Scholar
Tabari, H., & Talaee, P. H. (2013). Multilayer perceptron for reference evapotranspiration estimation in a semiarid region. Neural Computing and Applications, 23(2), 341–348.
Google Scholar
Todorovic, M., Karic, B., & Pereira, L. S. (2013). Reference evapotranspiration estimate with limited weather data across a range of Mediterranean climates. Journal of Hydrology, 481, 166–176.
Google Scholar
Trajkovic, S. (2005). Temperature-based approaches for estimating reference evapotranspiration. Journal of Irrigation and Drainage Engineering, 131(4), 316–323.
Google Scholar
Trajkovic, S., & Kolakovic, S. (2009). Estimating reference evapotranspiration using limited weather data. Journal of Irrigation and Drainage Engineering, 135(4), 443–449.
Google Scholar
Traore, S., & Guven, A. (2012). Regional-specific data driven modelsof evapotranspiration using gene-expression programming interface in Sahel. Water Resources Management, 26(15), 4367–4380.
Google Scholar
Traore, S., Wang, Y.-M., & Kerh, T. (2010). Artificial neural network for modeling reference evapotranspiration complex process in Sudano-Sahelian zone. Agricultural Water Management, 97(5), 707–714.
Google Scholar
Turing, A. (1950). Mind. Mind, 59(236), 433–460.
Google Scholar
Vapnik, V. (1998). Statistical learning theory. New York: Wiley.
Google Scholar
Ventura, F., Spano, D., Duce, P., & Snyder, R. (1999). An evaluation of common evapotranspiration equations. Irrigation Science, 18(4), 163–170.
Google Scholar
Wu, C., Chau, K., & Li, Y. (2008). River stage prediction based on a distributed support vector regression. Journal of Hydrology, 358(1–2), 96–111.
Google Scholar
Xu, C.-Y., & Singh, V. (2002). Cross comparison of empirical equations for calculating potential evapotranspiration with data from Switzerland. Water Resources Management, 16(3), 197–219.
Google Scholar

Download references

Acknowledgements

The authors are thankful to the Directors/Assistant Directors of Pakistan meteorological department (PMD) for the provision of climatic data. Cordial thanks extend to the Editor and anonymous reviewers to improve this manuscript.

Author information

Authors and Affiliations

Department of Agricultural Engineering, Bahauddin Zakariya University, Multan, Pakistan
Ali Raza, Muhammad Shoaib & Faisal Baig
College of Water Conservancy and Civil Engineering, Northeast Agricultural University, Harbin, China
Muhammad Abrar Faiz
Department of Civil Engineering, Bahauddin Zakariya University, Multan, Pakistan
Mudasser Muneer Khan
Department of Civil Engineering, The University of Lahore, Lahore, Pakistan
Muhammad Kaleem Ullah
Department of Civil Engineering, NFC Institute of Engineering and Technology, Multan, Pakistan
Muhammad Zubair

Authors

Ali Raza
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Shoaib
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Abrar Faiz
View author publications
You can also search for this author in PubMed Google Scholar
Faisal Baig
View author publications
You can also search for this author in PubMed Google Scholar
Mudasser Muneer Khan
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Kaleem Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Zubair
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muhammad Shoaib.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raza, A., Shoaib, M., Faiz, M.A. et al. Comparative Assessment of Reference Evapotranspiration Estimation Using Conventional Method and Machine Learning Algorithms in Four Climatic Regions. Pure Appl. Geophys. 177, 4479–4508 (2020). https://doi.org/10.1007/s00024-020-02473-5

Download citation

Received: 20 June 2019
Revised: 02 February 2020
Accepted: 20 March 2020
Published: 02 April 2020
Issue Date: September 2020
DOI: https://doi.org/10.1007/s00024-020-02473-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparative Assessment of Reference Evapotranspiration Estimation Using Conventional Method and Machine Learning Algorithms in Four Climatic Regions

Abstract

Similar content being viewed by others

Artificial Intelligence Based and Linear Conventional Techniques for Reference Evapotranspiration Modeling

Performance evaluation of numerical and machine learning methods in estimating reference evapotranspiration in a Brazilian agricultural frontier

Monthly Prediction of Reference Evapotranspiration in Northcentral Nigeria Using Artificial Intelligence Tools: A Comparative Study

1 Introduction

2 Materials and Methods