1 Introduction

The agriculture sector is found to be the huge water consumer in most of the countries. The quantity of water withdrawal from total amount in developing counties is nearly estimated 81% while accounted 71% globally. In addition to use, more than 55% of World's fresh water withdrawals are utilizing for irrigation purposes (Amarasinghe and Smakhtin 2014). It is impossible to yield large amount of food without the irrigation crop field in order to fulfill world's population needs (Lumia et al. 2005). Researches relevant with enhancement of water productivity and improvement of irrigation water management have attained higher importance in irrigation field (Barker et al. 1999). Irrigators and farmers can make real time decision for irrigation on basis of crop water requirements. The amount of water required to irrigate crop field for entire period is called crop water requirements (ETc) which can be estimated by multiplying crop coefficient (Kc) with reference evapotranspiration (ETo). Moreover, values of Kc for a particular crop can be determined by dividing crop-ET with the ETo value (Allen et al. 1998). Therefore, accurate quantification of ETo is direly needed in determining crop water requirements to overcome less and excessive irrigation problems. As we know, the amount of water given at right time is called irrigation scheduling which seems impossible without accurate estimation of ETo. Different techniques and methods have been applied for ETo estimation which include direct, indirect and data driven methods (Karimaldini et al. 2011). The estimation of ETo through lysimeter is considered as the best choice but difficult mainly because of cost intensive, labor intensive and requires huge amount for properly setup. Consequently, indirect and data driven methods were developed to estimate ETo by using meteorological data. The indirect methods developed for estimation of ETo includes: (1) empirical and semi empirical Eq. (2) Pan-evaporation method (3) energy budget method (4) mass transfer Eq. (5) combination Eq. (6) radiation base methods. These methods are found in Xu and Singh (2002) and McMahon et al. (2013).

Food and Agriculture Organization (FAO) of United Nations has introduced one of the illustrious and well-known indirect standard method for ETo estimation. This method involves incorporation of Penman–Monteith equation which was modified and reformed by Allen et al. (1998) as a reference equation (FAO-PM56). This equation (FAO-PM56) is influenced and controlled by various climatic, aerodynamic and surface resistance parameters. These comprise of maximum and minimum air temperature, maximum and minimum relative air humidity, wind speed and solar radiation, saturation vapor pressure deficit, slope vapor pressure curve and psychometric constant. There is an anomaly to air temperature available at the most weather stations, the remaining variables are often incomplete and not always found reliable for many locations (Rahimikhoob 2010). This point may be not valid for developed countries, but seems a big challenge in case of developing countries like Pakistan, where quantity and quality of data remain always questionable. Trajkovic and Kolakovic (2009) stated that reliable weather data sets of radiation, relative humidity, and wind speed are usually limited in developing countries.

Concerning and regarding the above contexts, there are myriad factors including physical, social and the environmental conditions in case of physical models influencing ET estimation which is the foundation of indirect methods. In addition, the geographic data (latitude, longitude, altitude) is also required for the local adjustment of the different weather parameters e.g. atmospheric pressure, extraterrestrial radiations (Ra) and daylight hours (N) in FAO-PM56 equation. Additionally, the experimental approaches or field measurement efforts are usually time consuming, labor-intensive and post processing output process. Subsequently, it is quite difficult to formulate the ETo equation to overcome these effects that can produce reliable and verified results (Ventura et al. 1999; A. R. Pereira and Pruitt 2004; Lopez-Urrea et al. 2006; Gavilan et al. 2007).

A few software solutions, namely, DailyET (Hess 1996), Cropwat 8.0 (Swennenhuis 2009) and ETo calculator (Raes 2009) are efficaciously used to calculate ETo. These software packages are based upon empirical and semi-empirical equations which require numerous climatic and geographic parameters. Moreover, these software packages will not yield results if even one of the parameter is not available. Thus, the data driven models based on different machine learning algorithms may be utilized as substitute models because of several advantages, for example, no required information of inner framework factors, easier answers for multi-variable issues and realistic calculation. These are innovative approaches which based on computational intelligent system to overcome imprecision and vulnerability in producing results (Chaturvedi 2008; Huang et al. 2010).

The main issue with modeling ET process is its non-linear dynamic and high complexity nature. In this regard, data driven models can be considered and contemplated as proper methods for ETo estimation. Their ability to deal with complex problems by using only available set of data made them superior to conventional methods. Furthermore, there is also no need to give geographical information or local adjustment in these data driven models. The following literature provides a comprehensive comparison among data driven models and conventional methods in hydrological studies especially for ETo estimation.

Among various data driven models, artificial neural networks (ANNs); adaptive neuro-fuzzy interference system (ANFIS); gene expression programming (GEP) have been widely applied for ETo estimation and proved best choice over indirect methods. Various studies reported in literature which used ANN for ETo estimation and found best performance for ANN data driven model in comparison to indirect methods (Kumar et al. 2002; Sudheer et al. 2003; Trajkovic 2005; Kim and Kim 2008; Landeras et al. 2008; Traore et al. 2010). The potential for the use of ANNs was examined by Rahimikhoob (2010) to estimate ETo based on air temperature data under humid subtropical conditions on southern coast of Caspian Sea. It was claimed that ANN model found superior in comparison to indirect empirical equations. In addition to ETo estimation, ANFIS model was also investigated in parallel to other methods (Kişi and Öztürk 2007; Doğan 2009; Cobaner 2011; Aytek 2009; Kisi and Zounemat-Kermani 2014). Kişi and Öztürk 2007 have applied ANFIS model for ETo estimation in Los Ageles, city of southern California. They concluded that ANFIS can be considered as a good approach in estimating ETo. Aytek 2009 has investigated the potential of coactive adaptive neuro fuzzy interference system (CANFIS) in three stations for estimating daily ETo. The results obtained with CANFIS were further compared with some empirical formulas. The study finally affirmed the CANFIS performance is superior as compared to the conventional methods. In similar manner, Karimaldini et al. (2011) have also examined ANFIS approach for estimating ETo in arid condition by using limited climatic data. The findings confirmed the utmost performance of ANFIS over conventional empirical equations (calibrated FAO-PM56, Hargreaves, Priestley-Tailor, Makkink, and Blaney-Criddle). Cobaner (2011) examined two types of ANFIS based on grid partition (G-ANFIS) and subtractive clustering (S-ANFIS) methods in modeling ETo by using daily climatic data. The results were compared with different types of ANNs and three empirical models. He claimed that S-ANFIS was performed better than ANNs and empirical formulas in estimation of ETo. In addition, the study Kisi and Zounemat-Kermani (2014) have also applied G-ANFIS and S-ANFIS models in estimating ETo with daily climatic data. They found ANFIS models superior to the corresponding conventional methods. Due to good generalization ability and attractive interference, gene expression programing (GEP) attained the higher significance in many hydrological studies e.g. stage–discharge curve, prediction in monthly precipitation (Azamathulla et al. 2011; Kisi and Sanikhani 2015). In addition to literature, the performance of GEP was also investigated in modeling ETo by using climatic variables (Shiri et al. 2014b, 2014a, 2014c; Guven et al. 2008; Traore and Guven 2012). The studies claimed that the GEP model could be considered as an alternative tool to the selected conventional methods for ETo estimation.

Nowadays, support vector machine (SVM) and self-organizing networks (SONs) are also considered as the emerging tools in solving complex problems among the data driven models with an efficient outcomes (Asefa et al. 2006; Carpenter and Grossberg 1991, 1992; Lu and Wang 2005; Jorguseski et al. 2014; Østerbø and Grøndalen 2012; Peng et al. 2013). Group method of data handling (GMDH) and cascade correlation neural networks (CCNN) are found as the robust data driven models because of their tendency to develop direct relationship between input and output layers without the hidden layer as required in case of ANNs (Sherrod 2008, Sherrod 2010). Application of SVM, GMDH and CCNN models found in literature is presented thereafter.

Collobert and Bengio (2000) investigated the SVM model for large scale regression poblems. It is claimed that time consumed in performing the analysis (training and testing phases) using SVM model was less than the other selected algorithms. The study Huang et al. (2002) also investigated the performance of SVM model. They found that the classifiers used in SVM model have more tendencies to solve complex non-liner problems as compared to the other classifiers. Kisi (2013) investigated the capability of SVM model for estimating the ETo by using daily climatic data in two stations of Califormia. The comparison results showed that SVM model performed better than the selected empirical equations. Kisi and Cimen (2009) investigated the potential of SVM model in ETo estimation using daily meteorological data and compared the results with empirical models (CIMIS Penmene, Hargreaves, Ritchie and Turc methods). They claimed that SVM performed better than empirical models and coud be empolyed as an alternative tool in estimating ETo Shiri Nazemi et al. (2014a, b, c) applied heuristic approaches (including SVM) in comparison to the empirical and semi-empirical equations for estimation of ETo. The comparison results confirmed the superiority of heuristic approaches to the existing conventional methods. Tabari et al. (2013) investigated the capability of ANFIS and SVM in potato ETc estimation. The results of ANFIS and SVM were also compared with the empirical equations. They found that SVM model performed better than ANFIS and corresponding applied empirical equations. Diamantopoulou et al. (2011) examined the potential of back propagation algorithm (BPANN) and CCNN in estimating the daily ETo using weather data obtained from automatic weather stations (AWS) of Northern Greece. The results indicated that CCNN performed better than BPANN and selected empirical equations. Therefore,it could be considered as an appropriate tool in modeling ETo process. In another study, Awari (2018) also investigated the capability of CCNN in the estimation of ETo. The comparison results indicated that CCNN performed best than the other selected methods. Najafzadeh et al. (2014) investgated the potential of GMDH and SVM in estimating the scour depth below the pipelines and compared the results with empirical equations. The comparison results showed that the GMDH performed better than SVM and could be successfully empolyed as an alternative tool to the existing conventional methods. To the best knowledge of the authors, there is not any publised article in the literature which investigate the performance of GMDH model in ETo estimation.

Thus, it can be perceived from the above citations that data driven models are oftern found best for ETo estimation. But most of the stuides do not apply devloped data driven modelson other locations, which arise a question reagrding models's perfroamnce. Nowadays, this practice is not justified for data driven modelsin deterinninng their perforamnce by incorporating similar data in trainning and testing phases. Therfore, a new practice must be followed to investigate and authenticate model's performance. According to the knowledge of authors, there is no study conducted in Pakistan that applies data driven modelsfor ETo estimation. The mian aims of this study were: (1) to determine the ETo using globally accepted method (FAO-PM56) across a range of climatic regions (2) to investigate the potential of different data driven models(i.e. SVM, GMDH, CCNN, MLP and GRNN) for ETo estimation (3) to compare ETo estimation of PM 56 method with that of data driven models.

2 Materials and Methods

2.1 Location and Data Collection of Studied Regions

Pakistan expands north-east to south-west from 23.5  to 37  N and 60  to 75  E in south-west Asia over world map. It is bordered with Afghanistan in north-west, Iran in west, China in far north-east and India in east. The total area of Pakistan is nearly about 80 mha out of which only 37 mha is available for raising crops. A variety of climatic conditions exists in Pakistan. The weather of northern and north-western parts is quite cool because of the presence of snow-packed mountains. Plains of Punjab province are located in the center and are quite hot and semi-arid in nature. Highlands of Balochistan province have cold climate but arid in nature and the coastal areas located in south are warm and humid. Temperature of colder regions located in north falls below freezing point during winter months while a high temperature of about 40  C prevails during summer months in the semi-arid and arid desert regions of Pakistan with a peak approaching 50  C frequently. The northern and western highlands of Pakistan receive an annual rainfall of 1500 mm and 500 mm respectively while desert area receives very less rainfall. Thus, climatic distribution of Pakistan is arid to semi-arid, except a short stretch of temperate in north with wide variations.

The climatic data of six selected cities located in various climatic regions of Pakistan, was used in current study for a duration of 30 years (1987–2016). The data includes long term monthly average values of minimum temperature (Tmin), maximum temperature (Tmax), average relative humidity (R.Hmean), sunlight hours (N), and average 24 h. wind speed (U). This dataset was divide into sections. First section contains 70% of the data (1987–2007) and used to train each selected model while remaining 30% of the data (2008–2016) employed to test the developed models. The performance of one machine learning based model could be hardly justified if applied to only one location/country. To further test the validity of the developed models, the climatic data of five more cities located in New Zealand, China and USA was also used. It is pertinent to mention here that the data of six cities located in Pakistan was used to train and test the developed models while the data of five cities located in three countries of New Zealand, China and USA was only used to test the developed models. The details of the data of different cities located in Pakistan, New Zealand, China and USA is given in Table 1.

Table 1 The geographic description of selected climatic regions

The location of six selected climatic stations of Pakistan is presented in Fig. 1a while Fig. 1b shows the climatic stations of other selected countries. All selected climatic stations of Fig. 1a and b are classified into six climatic zones which can be seen in Table 2. In addition, classifications of these selected climatic stations are based on global aridity index (Todorovic et al. 2013). This index can be calculated by dividing the mean yearly precipitation (P) to the mean yearly ETo. Additionally, the humid region is further categorized into warm, mild and high humid climatic zones. The division of these zones is based on average annual rainfall and mean temperature data. This is due to the fact that climatic conditions that lie in humid regions can easily be fathomable.

Fig. 1
figure 1

a Location of the six selected climatic stations in Pakistan. b Location of the selected climatic stations in other countries

Table 2 Division of climate zones, respective precipitation to ETo ratios

2.2 Adopted Methodology

2.2.1 FAO-PM56 Method

This method is universally accepted as a standard method for the estimation of ETo. It is a physical method which requires numerous climatic parameters. The method is based on combine equations which estimate ETo values. The mathematical relation to estimate ETo by using climatic parameters is given in Eq. 1.

$$ {\text{ET}}_{o} = \frac{{0.408\Delta \left( {R_{n} - G} \right) + \gamma \times \frac{900}{{T_{{{\text{mean}} + 273}} }} \times U_{2} \times \left( {e_{s} - e_{a} } \right)}}{{\Delta + \gamma \left( {1 + 0.34u_{2} } \right)}} $$
(1)

wherein Rn, and G are net radiation (MJ /m2/day) on crop surface and soil warmness flux density (MJ/m2/day), respectively. Additionally, Tmean, U2, es, ea, emin, emax, Δ and γ are means of maximum and minimum air temperatures (°C), air speed (m/s), saturation vapor pressure (kPa), actual vapor pressure (kPa), saturation vapor pressure at minimum air temperature, saturation vapor pressure at maximum air temperature, slope vapor pressure curve (kPa/°C) and psychrometric constant (kPa/°C), respectively. As the value of G is smaller than Rn, it may be taken as zero. The value of Δ can be determined by utilizing the procedure which is enlightened by Gocić et al. (2015). The complete derivation of PM equation either in case of complete or missing data can be found in Allen et al. (1998).

2.2.2 Multilayer perceptron (MLP)

MLP is frequently used neural network in solving complex classification problems. In this network, input is given to network along with target output and weights are adjusted in such a way that network can generate target output with several attempts. The MLP structure consists of an input, a hidden and an output layers. Each layer contains a number of neurons that are linked in same pattern with neurons inside the subsequent layer. Each neuron carries a number of predictor variables from previous layer and outputs to the following layer. Each layer plays a significant role in the overall performance of structure. The neurons of one layer are linked to the neurons existed in other layer through a specific connection called weights. These weights are made accountable to carry results from one to another layer. Output of each neuron within input layer will become an input for the hidden layer. Furthermore, output of each neuron from hidden layer converts into an input for every neuron inside output layer. A structure of MLP neural network with one hidden layer can be seen in Fig. 2a. A function is utilized to process input data among the neurons of each layer known as activation function. Moreover, the sigmoid and linear functions are utilized to process input data from input to hidden layer and hidden to output layer, respectively. The network can be trained by adjusting weights through an error back propagation process explained by Hassanpour Kashani et al. (2008). The present study utilizes scaled conjugate gradient (SCG) method which can be swiftly performed twice as compared to Conjugate gradient and converging up to 20 times than scaled conjugate gradient (Sherrod 2008). Møller (1993) has explained SCG algorithm in detail. development of MLP model is described in the next lines. five meteorological variables (Tmax, Tmin, R.Hmean, N, U) are fed in an input layer as predictor variables. calculated ETo is used as a target variable in an output layer. The mapping of input–output relationship with MLP model is presented in Fig. 2b. Six neurons are selected in hidden layer after tuning the neurons. The V-fold cross validation is presently applied to check and validate input data. essential parameters for the development of MLP model are given in Table 3.

Fig. 2
figure 2

Development of MLP model for ETo estimation

Table 3 Parametric values of MLP model after tuning

2.2.3 General Regression Neural Networks (GRNN)

Specht (1990) has proposed a method that allows to contrive and formulate the weighted- neighbor method. A general regression neural network (GRNN) can be utilized for regression problems while probabilistic neural network (PNN) is implemented to solve classification problems. This study employs GRNN network in order to estimate ETo by using meteorological variables. Architecture of GRNN model is shown in Fig. 3.

Fig. 3
figure 3

Configuration of GRNN model

Development of GRNN model to predict ETo is as followed: Five meteorological variables, (Tmax, Tmin, R.Hmean, N, U) are used as predictor variable while ETo is obtained as a target variable. Gaussian function is chosen as a kernel function which is considered one of the best yet among invented (Sherrod 2008). Eliminating neurons frequently minimize the error of developed model. The procedure to eliminate redundant neurons is considered slow. This is an iterative process due to the fact that model needs to be assessed with each persisting neuron to determine the best one as a way to eliminate. Thus, leave on out validation (LOO) with minimize error method is employed to find optimal number of neurons. The essential parametric values for GRNN model are given in Table 4.

Table 4 Tuning parameters of GRNN model

2.2.4 Support Vector Machine (SVM)

Eventually, the concept and idea of learning machine is suggested by Turing (1950). Support vector machine is scrutinized and regarded as a supervised learning model which is built on statistical learning theory. The statistical learning theory is familiarized and made conversant by Vapnik (1998). Support vector regression (SVR) is frequently used to narrate SVM approach. SVR has capability to apply structural risk minimization principle in a reliable way. It is an inductive principle for model selection which is used for finite training data sets. It describes a general model of capacity control and provides a relationship between space complexity and quality of fitting the training data. Moreover, it emphasizes on an upper bound instead of training error in the data set. Minimum value of upper bound produces a best model. Based on this principle, SVR achieves an optimum network structure. More details regarding this principle can be found in Wu et al. (2008). The main objective of SVM modeling is to find optimal hyper plane which isolates the clusters of vector. This can be accomplished by separating variables using linear and non-linear dividing. The division of attributes in non-linear case is more complex than linear case. Furthermore, the separation in high dimensional case is easier than low dimensional. Figure a–d illustrate the pictorial view of linear, complex, low and high dimensional cases, respectively (Han and Kamber 2001; Markowetz 2003). The vectors adjoining to hyper plane are termed as support vectors.

Fig. 4
figure 4

Schematic diagram of SVR model

This study employs two kinds of SVR, namely, epsilon (€) and neuron (Nu). There is no general method to determine model of the largest capacity in order to estimate ETo except iterative method. After a large number of iterations, it was found that €-SVR performed good than Nu-SVR. Therefore, €-SVR is used in SVM model for present study. The accuracy of SVR model depends upon following parameters: penalty factor(C), gamma (¥), P, Nu etc. In addition, there is a dire need to determine proper optimal values for these parameters to achieve satisfactory, better and acceptable results. For example, large value of C creates over fitting problem while a small value of C causes under-fitting the model. Therefore, it is mandatory to select optimal values for all parameters to obtain reliable and better results. Grid and pattern (GAP) search are concurrently applied in this study in order to find optimal parametric values. A grid search tries to attempt values of each parameter across the specified search range. This can be done by using geometric steps. A pattern search starts at the center of search range and makes trial steps in each direction for each parameter. More comprehensive details about these searching methods can be found in Sherrod (2008). The attributes (predictor variable) in SVM model are meteorological variables while ETo is selected as predicands (target variable). Four kinds of kernel functions (linear, RBF, polynomial and sigmoid) are concerned and applied with V-fold cross validation method. It is used to construct an efficient model. In addition, number of parameters in SVM model largely depends upon the applied kernel function. Thus, Table 5 provides epigrammatic details about the development of SVM model. It also shows optimal values for all required parameters which are needed corresponding to applied function. Moreover, number of parameters which do not show any contribution in the development of SVM model regarding kernel function is represented by symbol "X" and can be seen in Table 5.

Table 5 Optimal ranges of parameters for SVR model

2.2.5 Group Method of Data Handling (GMDH)

GMDH model was developed by Ivakhnenko (1968). This algorithm presents a unique approach to the problem of modeling and even a new philosophy to scientific research. It is used in various fields such as data mining, prediction and complex system modeling etc. Li et al. (2017) found GMDH better than other classical forecasting algorithms. A schematic diagram of GMDH is shown in Fig. 5. The network starts with just input neurons. The main layer (at the top) presents one contribution for every indicator variable. Each neuron within second layer draws its inputs from two of the input variables. In similar way, the neurons within third layer draw their inputs from two of the neurons inside preceding layer. This procedure proceeds until second final layers fetch their inputs to the ultimate layer. The final layer (at the lowest) draws its two inputs from past layer and produces a single value that's the output of network. Additionally, the links between neurons inside this network are not fixed. It can be observed in Fig. 5 that neuron at the left end of third layer is linked with an input variable rather than output of preceding layer. Thus, current study applies GMDH model in order to draw a relationship between climatic variables and ETo. Five meteorological variables, symbolically, Tmax, Tmin, RHavg, U, N are used as predictor variables. The output layer generates ETo as a target value. Various types of functions such as linear, quadratic, cubic, asymptotic etc. (Table 6) are used in GMDH model to improve model's accuracy. combination of two quadratic functions (1 variable and 2 variable) is applied in the training and testing process for estimation of ETo. results generated with this combination of functions were found the best among other functions. Table 6 indicates essential parametric values for development of GMDH model.

Fig. 5
figure 5

Structure of GMDH model

Table 6 Essential parameters and list of functions for GMDH model

2.2.6 Cascade Correlation Neural Network (CCNN)

Primarily, a CCNN model includes the most effective input and output layers without a hidden layer. Every input is attached to each output neuron by way of a connection with an adjustable weight. Each output neuron receives its input as weighted sum from all of input neurons alongside the bias value. At that point, yield neuron sends this weighted input sum through its transfer function to withdraw final output as shown in Fig. 6a. A simple CCNN also shows good performance in case of small data. A hidden layer is introduced into CCNN model for solving large data set and complex problems. Neurons are delivered to hidden layer one after another. Every new hidden neuron gets a connection from each of the inputs and every pre-present hidden neuron. The hidden neuron's input weights train and then fixed. Simplest output connection weights train over and over. Each new neuron, consequently, provides a new one hidden unit to the network. This prompts the formation of an effective powerful high-order function which includes identifier and deep network with a big variety of inputs to output neurons. Now, the neurons inside hidden layer do not change, once they have been brought. It is known as a cascade because output from all neurons already inside network feeds into the new neurons. A complete structure of CCNN model which is used in the current study has illustrated in the Fig. 6(b). It can be observed that the ETo values are obtained as target outputs in CCNN model after giving meteorological parameters as predictor variables. Mixture of Sigmoid and Gaussian functions is employed as a kernel function in the hidden layer. In the CCNN model, advance parameters are also used with training parameters which enhance the predictive accuracy and minimize the error. Table 7 provides essential parametric values for CCNN model to generate reliable results.

Fig. 6
figure 6

a CCNN simple Structure with no hidden layer. b Illustrative CCNN model for ETo prediction

Table 7 Training and advance parameters of CCNN model

2.2.7 Weaknesses of FAO-PM56 Method vs Strengths of Numerical Methods

The concept of ETo has been widely discussed and used for decades (Doorenbos and Pruitt 1977; Pereira et al. 1999). Allen et al. (1998) introduced a clear definition of FAO-PM56 method for estimation of ETo. One of the most important reasons for advocating a simpler method than FAO-PM56 is the substantial likelihood for inaccuracy in weather data measurement and collection, especially for the developing countries and meteorological stations managed by non-experts. In these situations, the accuracy of data and especially for more advanced parameters such as radiation and humidity, can be very low (Droogers and Allen 2002). Table 8 shows data requirements for FAO-PM56 and data driven models. It can be seen in Table 8 that FAO-PM56 depends upon several parameters which is not easily accessible especially in the developing countries. Alternatively, data driven models only depend upon climatic parameters and require less parameter in comparison to FAO-PM56 method.

Table 8 Data requirements for the applied ETo methods (Standard (FAO-PM56) method vs. Data driven models)

3 Model Performance Evaluation

The goodness of fit is assessed through a set of indicators that are used to compare all the pairs of observed and model-predicted values of selected variables. Nine performing indicators are selected in this study to evaluate the performance of MLP, GRNN, GMDH and CCNN models. These include square of Pearson correlation coefficient (r2), linear regression coefficient (R2) model efficiency (M.E), mean absolute error (MAE) and root mean square error (RMSE), scatter index (SI) which are defined as follows (Legates and McCabe 1999; Alazba et al. 2011; Mattar and Alamoud 2015; Mattar et al. 2015)

$$ r^{2} = \frac{{n \times \left[ {\left\{ {\left\{ {\sum\nolimits_{{i = 1}}^{n} {\left( {{\text{ET}}_{{{\text{obs}}}} \times \overline{{{\text{ET}}}} _{{{\text{est}}}} } \right)} } \right.} \right\}} \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {{\text{ET}}_{{{\text{obs}}}} } } \right) \times \left( {\sum\nolimits_{{i = 1}}^{n} {{\text{ET}}_{{{\text{est}}}} } } \right)} \right\}} \right]}}{{\sqrt {\left[ {n \times \left\{ {\sum\nolimits_{{i = 1}}^{n} {({\text{ET}}_{{{\text{obs}}}} )} } \right\}^{2} } \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {\mkern 1mu} {\text{ET}}_{{{\text{obs}}}} } \right)} \right\}^{2} } \right] \times \left[ {n \times \left\{ {\sum\nolimits_{{i = 1}}^{n} {({\text{ET}}_{{{\text{est}}}} )} } \right\}^{2} } \right] - \left[ {\left\{ {\left( {\sum\nolimits_{{i = 1}}^{n} {\mkern 1mu} {\text{ET}}_{{{\text{est}}}} } \right)} \right\}^{2} } \right]} }}$$
(2)
$$ R^{2} = \frac{{\left[ {\left( {\mathop \sum \nolimits_{i = 1}^{n} (ET_{{{\text{obs}}}} - \overline{{ET_{{{\text{obs}}}} }} } \right) \times \left( {ET_{{{\text{est}}}} - \overline{{ET_{{{\text{est}}}} }} } \right)} \right]^{2} }}{{\left[ {\left( {\mathop \sum \nolimits_{i = 1}^{n} (ET_{{{\text{obs}}}} - \overline{{ET_{{{\text{obs}}}} }} } \right) \times \left( {ET_{{{\text{est}}}} - \overline{{ET_{{{\text{est}}}} }} } \right)} \right]}} $$
(3)
$$ {\text{M}}.{\text{E}} = 1 - \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{N}}} ({\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{sim}}}} )}}{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{N}}} ({\text{ET}}_{{{\text{obs}}}} - \overline{{{\text{ET}}_{{{\text{obs}}}} }})}} $$
(4)
$$ {\text{MAE}} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \frac{{|{\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{sim}}}} |}}{{\text{N}}} $$
(5)
$$ {\text{MAPE}} = \frac{100\% }{N}\mathop \sum \limits_{i = 1}^{N} \left| {\frac{{{\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{est}}}} }}{{{\text{ET}}_{{{\text{obs}}}} }}} \right| $$
(6)
$$ {\text{RMSE}} = \sqrt {\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{N}}} \frac{{({\text{ET}}_{{{\text{obs}}}} - {\text{ET}}_{{{\text{est}}}} )^{2} }}{{\text{N}}}} $$
(7)
$$ {\text{SI}} = \frac{{{\text{RMSE}}}}{{{\text{ET}}_{{{\text{mean}}}} }} $$
(8)

wherein, ETobs, ETest, \(\overline{{{\text{ET}}_{{{\text{obs}}}} }}\)., \(\overline{{{\text{ET}}_{{{\text{est}}}} }}\)., ETmean are observed value, estimated value, average of observed value, average of estimated value, mean of observed value of ETo, respectively and N represents total number of observations. Further two coefficients, namely, skewness (Cs) and variation (CV) are also used in present sdy to distinguish the data.

$$ C_{s} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {O_{i} - \mu } \right)^{3} }}{{\left( {n - 1} \right) \times \sigma^{3} }} $$
(9)
$$ C.V = \frac{\sigma }{\mu } $$
(10)

Here, Oi, n, σ, and µ represent observed data, number of data points, standard deviation and mean value of data set, respectively. Cs determine the symmetry of distribution and σ measures amount of variation in the given data set (Bland and Altman 1996).

4 Results and Discussions

4.1 Experimental Data

The values of meteorological data on monthly basis are used to develop data driven models. Data sets used for data driven models based on climatic data are divided into two subsets. These include training and testing data sets. Training data set of each climatic station includes 70% of observed data while testing data set contains 30% of observed data collected in the period of years. The climatic data duration corresponding to each climatic station is given in Table 1. Moreover, Table 9 shows the value of statistical parameters of climatic data such as minimum, maximum, mean, standard deviation and skewness coefficient values of the selected countries. Additionally, averages of aforementioned parameters for six selected cities of Pakistan are estimated and presented in Table 9. Likewise, averages of statistical parameters of meteorological parameters for other locations used in the current study are also given in Table 9.

Table 9 Statistical parameters of selected countries used in current study

4.2 Performance Analysis

At the beginning, data driven models are trained with extracted data from the FAO-PM56 equation. As a baseline for comparison of the performance of SVM with other data driven models, the same training and testing data set was considered for stations of Pakistan. To estimate FAO-PM56 ETo values, r2, RMSE, MAE and ME are used in comparison between the FAO-PM56 ETo and predicted values of SVM, GMDH, GRNN, CCNN and MLP models. The performing indices, R2 and RMSE of testing data set for each selected city of Pakistan are presented in Fig. 7a and b, respectively. To present results in elegant way, abbreviations of the selected cities of Pakistan are used for graphical representations. These are followed as: Faisalabad (Fsd), Multan (Mul), Islamabad (ISD), Jacobabad (Jac), Peshawar (Pes) and skardu (Skr). It can be observed in Fig. 7a that SVM model looks superior to other numerical computing models in all the selected cities of Pakistan. However, GMDH and GRNN models show almost equal outcomes in comparison to MLP model. In addition, CCNN model generates low r2 values except in Faisalabad city. It is evident from Fig. 7a that all models have acceptable square of Pearson correlation coefficient (r2), which is higher than 90%. It means that there is no over fitting of training and testing data of the numerical computing models. In other words, the data driven models capture all nonlinearity of ETo, which is the main advantage of data driven models. Likewise, it can be seen in Fig. 7b that SVM model generates low RMSE value for the selected cities of Pakistan which varies from 0.13 to 0.50 mm/month. On contrary, GMDH, GRNN, MLP and CCNN models show high value of RMSE. Although GRNN and MLP models generate high value of RMSE as compared to SVM model but have performed well in comparison to GMDH and CCNN models for all the selected cities of Pakistan. The RMSE values for all the chosen cities of Pakistan by employing GRNN and MLP models varies from 0.7 to 0.70 and 0.18–0.81 mm/month, respectively. Thus, it can be concluded that SVM model has more capability among all other numerical computing models to predict accurate ETo in various climatic regions. However, GRNN and MLP models also show good performance in comparison to GMDH and CCNN models for all selected cities of Pakistan. The summary performance indices, M.E MAE, MAPE, SI, of various approaches towards estimating ETo for various climatic zones of Pakistan can be seen in Table 10. It can be observed in Table 10 that model efficiency of all the data driven models in various climatic zones of Pakistan was found greater than 90%. As it can be seen model efficiency is higher for SVM model as compared to the use of other models in each climatic zone of Pakistan. However, MLP model looks superior in comparison to GRNN, CCNN and GMDH models in Multan, Peshawar and Skardu climatic stations. In contrast, GMDH model shows good performance as compared to MLP, CCNN and GRNN models in Islamabad and Jacobabad cities. Furthermore, CCNN model has greater value for M.E in Multan city as compared to MLP and GMDH models while it shows slightly better performance than GRNN model. Table 10 further supports our above discussion and expresses the performing indices of all numerical computing methods for each climatic station of Pakistan. Additionally, SVM model has attained 99 percent value for M.E in order to estimate ETo. Moreover, Table 10 also demonstrates M.E value for SVM approach in selected climatic stations of Pakistan which lies between 95 and 100 percent. In general, the results indicate that SVM approach has the best estimating capability for ETo estimation. similar results were obtained by Shrestha and Shukla (2015) for tropical environment. Two indicators, namely, MAE and SI are further calculated to endorse the selected numerical computing methods for each climatic station. These two indicators have values range from 0 to 1. If value approaches to 0, it shows the perfect fit of model. Alternatively, model shows less predictive accuracy when its outcome reaches to 1. It can also be observed in Table 10 that MAE and SI indices have least values for SVM model in each climatic station of Pakistan. In addition, SVM model has 0 values for SI while it shows minimum value of MAE as compared to the other models in each selected cities of Pakistan. According to the results depicted in Table 10 that MLP has smaller values for SI and MAE in comparison of GRNN, CCNN and GMDH models in Multan, Peshawar and Skardu climatic stations. On the other hand, MLP, CCNN and GRNN models generate higher values for MAE and SI as compared to GMDH model in Islamabad and Jacobabad cities. Additionally, CCNN has lower values of SI and MAE than MLP, GMDH and GRNN models for Multan city. above whole discussion could be summarized as all selected data driven models except SVM do not maintain consistency of their results in each climatic zone which can be clearly observed in Table 10. This may be due to the fact that the climatic parameters in training process for all selected models excluding SVM generate higher residual values which become responsible to reduce their efficiencies in order to estimate ETo. Furthermore, GMDH, MLP, CCNN and GRNN show acceptable values for all performing indices (M.E, MAE, MAPE and SI) in each selected climatic stations of Pakistan. An inclusive detail about each data driven model regarding performing indices for selected climatic stations of Pakistan is given in Table 10. In addition to summarized discussion, the model efficiencies (M.Es.) of SVM model for FSd, Jac, Mul, Pes and Skr are 95.4%, 99.06%, 99.36%, 98.7% and 99.64%, respectively that stipulate the supremacy of SVM among other applied data driven models. Surprisingly, the efficiency of SVM for ISD station is lower as compared to other models but the least value of error (i.e. MAPE = 7.63%) further assist the aforementioned remarks regarding consistency and strength. The performance of each data driven model depends upon its training algorithm. It is difficult to decide which data driven model has the best performance except applied in different conditions especially in case of climatic studies. Although data driven models always performed better than local calibrated physical model or conventional methods (Karimaldini et al. 2011; Aytek 2009; Tabari and Talaee 2013). But mostly, it was found that the model considered as the best for one climatic region does not produce efficient result in another climatic condition. This is due to fact that the model optimal structure determined by using data driven model based on machine learning algorithms are of critical importance. Determination of the optimal structure of data driven model seems impossible without iteration process. Hence, various types of training algorithm with a number of iterations have been used to determine the best fitted optimal structure. In addition to use of training algorithms, the number of iterations, convergence value and execution times are considered as the prime parameters. Thus, data driven models based on some advanced machine learning algorithms e.g. pattern and search training algorithm in SVM model and/or equation combinations (linear, polynomial, quadratic) in GMDH model were applied in this study in order to investigate the performance.

Fig. 7
figure 7

r2 and RMSE values for the selected cities of Pakistan

Table 10 Performance parameters for ETo assessment in Pakistan using selected approaches

4.3 Results of Developed Machine Learning Models in Other Climatic Regions

In section, developed data driven models were tested on five other stations located in China, New Zealand and USA (Illinois State) by using 4-year climatic data. The climatic data of these stations is also comprised of same meteorological input parameters. The statistical average values of all meteorological variables for selected countries calculated and presented in Table 8. Geographical description and data duration of each climatic station is mentioned in Table 2. climatic characteristics of these stations can be seen in Table 3. ETo values of these stations were estimated by FAO-PM 56 standard equation. Selected data driven models, SVM, GMDH, MLP, CCNN and GRNN, directly applied on these climatic stations to estimate ETo values without using their data in training phase. The performance indices, namely, CV, RMSE, r, and MSE are calculated for each station to investigate their performance. results of each data driven model are summarized in Table 11. As the results indicated (Table 11), the SVM model had the best performance in Auckland station (r = 0.99, RMSE = 0.09 mm/day, CV = 0.17 and MSE = 0.12 mm/day) among other models. However, the models GMDH and MLP had performed equally well with r = 0.98 and RMSE = 0.13 mm/day in the test phase. results generated with GRNN and CCNN models are comparatively lower than SVM, GMDH and MLP models. The GRNN model had yielded results as r = 0.97, RMSE = 0.12 mm/day, CV = 0.19, MSE = 0.15 mm/day while CCNN model had generated r = 0.96, RMSE = 0.15 mm/day, CV = 0.23 and MSE = 0.22 mm/day, respectively. It can be evidently observed that RMSE values of GRNN and MLP models are remained identical i.e. 0.12 mm/day. In addition, the correlation between observed and predicted values for all data driven models had persisted above 0.95, although SVM model attains value equivalent to 1. Likewise, SVM model shows good performance for the Harbin station which is located in china country and have extremely cold climatic conditions. The values of r = 1 and RMSE = 0.10 mm/day makes SVM an ideal model for this climatic station while the values for CV and MSE equal to 0 further legitimate it. It should be noted that r term provides information for the linear dependence between observations and corresponding estimates. Furthermore, MLP and GMDH models provide closer ETo estimates to the benchmark ETo than GRNN and CCNN models. The values of r and RMSE for both MLP and GMDH models are greater than GRNN and CNN models. However, MLP model had performed better as compared to GMDH model. As shown in Table 9, GMDH model generates exactly twice values for CV and RMSE which are followed as 0.12 and 0.30 mm/day, respectively. Although, the performance of GRNN and CCNN models are shown lower in comparison to SVM, MLP and GMDH models but yield good results in comparison to Auckland station. In addition, MSE value for each model is categorically zero while the value of correlation coefficient had attained 0.95 or above. Generally, it can be concluded from the performance evaluation indicators that SVM had the best performance and superior to other data driven models. The MLP model with a RMSE of 0.15 mm/day and MSE of 0.06 mm/day has provided the second best ETo estimates. In contrast, the CCNN model has the highest error rates (RMSE = 0.46 mm/day and MSE = 0.21 mm/day). The outcomes of all data driven models are found in Table 11 for Belleville, Big Bend and Dixon Spring stations. According to the performance statistics, SVM model with extreme less values of CV, r and RMSE (CV = 0.13–0.16, r = 0.96–0.99 RMSE = 0.016–0.025 mm/day) is considered as the best among other data driven models for the Belleville, Big Bend and Dixon Spring stations. In addition, for all aforementioned climatic stations, SVM model has zero value for MSE which further supports its supremacy. It can also be inferred from results in Table 11 that the second best model for estimation of ETo can be considered as a GMDH model. The outcomes of GMDH model are more precise as compared to MLP, GRNN and CCNN models. In addition, this model yields more accurate ETo with increasing correlation coefficient from 0.96 to 0.99 and decreasing RMSE and MSE values from 0.025 and 0.0003 to 0.016 and 0.0001, respectively. From above analysis, it can be generally concluded that the performance of SVM model is superior to MLP, GMDH, GRNN and CCNN models for all five selected climatic regions situated in USA, China and New Zealand. As mentioned earlier and can be seen from Table 4, radial basis function (RBF) was the best kernel for SVM model. Tabari et al. (2012) had found RBF as a best kernel function among others for all developed SVM models. A precise selection of kernel function is responsible for the given data set to provide feature space for its separation This unique feature creates a boundary between SVM and other previously developed algorithms (Bray and Han 2004). Furthermore, Table 11 also revealed that GMDH is comparatively superior to MLP in Auckland while produces lower results as compared with MLP model in Harbin stations. In addition, MLP model had the best performance in Harbin station but generates poor results for Belleville and Big bend stations with increasing values of CV and RMSE from 0.17 and 0.022 to 0.34 and 0.039, respectively. However, GMDH model could be endured the second best model in order to estimate ETo. Consequently, it may be evident that GMDH model has utmost performance as compared to MLP, GRNN and CCNN models but inferior to SVM model. Moreover, GRNN model generates more reliable results than CCNN model by decreasing values of RMSE from 0.021 to 0.43 mm/day. Additionally, the values of correlation coefficient for CCNN model are decreasing from 0.96 to 0.89. Finally, it can be concluded that MLP is the third best model while GRNN and CCNN models are ranked as fourth and fifth best model for estimation of ETo.

Table 11 Results of each applied data driven model in testing phase for other five selected climatic stations

The mean monthly ETo of five selected stations estimated by data driven model and FAO-PM56 method to authenticate model's performance. The hydrographical form of ETo values for five selected stations can perceptibly be seen in Fig. 8. Initially, ETo values were observed as low and then increased gradually when number of months increased. But after certain interval, ETo goes in downward direction from peak value. Similarly, this could be happened for all intervals and observed lower and higher peaks of ETo. Surely, this trend could be indicated winter and summer seasons. In winter season, temperature recorded minimum and relative humidity was maximum, the value of ETo estimated minimum. Alternatively, for summer season, temperature recorded maximum and relative humidity was minimum, ETo value estimated maximum. To further check the underestimation and overestimation of ETo, data driven models against FAO-PM56 method was compared. It can be perceived from Fig. 8 that MLP and CCNN models indicate overestimation while values of GRNN and GMDH models remain under peak of FAO-PM56 ETo values for the Harbin station. For SVM data driven model, estimated ETo is trailing well with FAO-PM56 ETo values. In contrast, overestimation for MLP, GRNN and GMDH models in comparison to FAO-PM56 ETo values was observed in Auckland station. However, the values of SVM model closely overlap with the standard FAO-PM56 ETo values. Likewise, ETo estimated by SVM model agrees with the FAO-PM56 ETo and follows same trend in both Big bend and Belleville stations as observed in Fig. 8. Conversely, results of MLP model look extremely below from the FAO-PM56 ETo values in Big bend station while remain equal with other models for Belleville station. Additionally, GMDH model has performed better than GRNN and CCNN models in both cities. This also authenticates our results for GMDH model indicated in Table 11. For Dixon Spring station, all data driven models generate smooth peaks for ETo values as compared to FAO-PM56 method. This also indicates that these data driven models only deal with available data set and do not affect with other myriad factors. The deviation of ETo values for SVM among data driven models was very minute and more close that estimated by FAO-PM56 method. Thus, it can be concluded from earlier discussion that SVM model performed the best in all climatic conditions for ETo estimation. However, GMDH model generates reliable results as compared to MLP, GRNN and CNN models, therefore ranked as the 2nd best model. While MLP, CCNN and GRNN models ranked 3rd place to 5th, respectively in the estimation of ETo. The linear regression test facilitates the extension of statistical analysis. Figures 9, 10, 11, 12, 13 indicates ETo estimates of all data driven models in form of scatter plots for each station. In these plots, ETo estimated by the FAO-PM56 method are plotted against with numerical estimated ETo values, in the form of scatter plots. It can be observed that there is a linear pattern of points for all applied data driven models. There were some misplaced estimated points which correspond to incorrectly estimated ETo data, with some high values for all methods except SVM model. similar results were observed in Tabari et al. (2012). This may be due to intermittent character of ETo distribution and deviate from trend line. It can be observed in Fig. 9 that data driven models generate acceptable R2 values for the Harbin station (China) ranges from 94 to 98%. For SVM, MLP and GRNN data driven models, R2 value found in equal (i.e. 98%) while CCNN and SVM models have 94% nad 95% repectively. Likewise, ETo values estimated by data driven modelsin comparison to FAO-PM56 methods for Auckland (New Zealand) is shown in Fig. 10. As indicated, SVM models performed the best with the highest value of R2 (i.e. 95%) as compared to other data driven models. MLP and GRNN models performed equal while CCNN model generates loewest (i.e. 92%) value for Auckland station. In simialr manner, comparison of applied numrical models against the standard FAO-PM56 method for Big Bend, Belliville, Dixon Spring (Illinios state, USA) are presented in Fig. 11, Fig. 12 and Fig. 13, respectively. SVM and GMDH performed the best as compared to CCNN, MLP and GRNN model as obserevd in Fig. 11. The highest value of R2 (93%) in Big Bend station was found for SVM model. Similarly, for Belliville (Fig. 12) and Dixon Spring stations (Fig. 13), SVM model outperformed in comaprison to GRNN, CCNN, MLP. The R2 value for SVM in Belliville and Dixon Springs climaic stations was nealy found 94% and 92% respectively. It is obvious from given fitted lines in all scatter plots (Figs. 9, 10, 11, 12, 13) that SVM model have closer to PM-ETo values in comparison to other data driven models. Plotted points of SVM model correlate well towards linear trend line. However, plotted points of other data driven model are of major underestimation or overestimation. It can be deduced from Figs. 9, 10, 11, 12, 13 that GMDH model has the higher value of R2 after SVM model for Auckland station (i.e. R2 = 94%). The superiority of GMDH model over MLP, GMDH, CCRN can also be noticeably comprehended in, Big Bend, Dixon Spring climatic stations. While MLP and CCRN performed better than GMDH model in the Harbin and Belleville climatic stations, respectively. similar results of GMDH model can be found in Table 11 which demonstrates the supremacy of GMDH model over data driven models except a SVM model. It can be concluded that SVM model has the highest linear regression coefficient (R2) in all types of climatic conditions and could be considered as the best data driven model for the ETo estimation. From above discussions, GMDH and MLP data driven models are ranked as the 2nd and 3rd best models for estimation of ETo. In contrast, GRNN and CCNN model have lower value of R2 but can be acceptable for all climatic regions. Therefore, CCNN is ranked as 4th while GRNN model is considered as the 5th best model. It is evident from scatter plots that all methods have acceptable linear regression coefficient values.

Fig. 8
figure 8

The ETo values estimated by the FAO-PM56 and other models in the test phase

Fig. 9
figure 9

Scatter Plot for Harbin city (ETo of FAO-PM56 vs data driven models)

Fig. 10
figure 10

Scatter Plot for Auckland city (ETo of FAO-PM56 vs data driven models)

Fig. 11
figure 11

Scatter Plot for Big Bend city (ETo of FAO-PM56 vs data driven models)

Fig. 12
figure 12

Scatter Plot for Belleville city (ETo of FAO-PM56 vs data driven models)

Fig. 13
figure 13

Scatter Plot for Dixon Spring city (ETo of FAO-PM56 vs data driven models)

For all data driven models, regression equation of each selected climatic station is found in Figs. 9, 10, 11, 12, 13. It can be perceived that SVM outperformed than other data driven models and shows higher correlation coefficient value in each selected station. The value of slope(a) and y-intercept (b) in regression equation (Y = aX + b) for SVM model was also found good, for "a" achieved higher and "b" look smaller, in comparison to other data driven models (Figs. 9, 10, 11, 12, 13). In addition, a comparison of ETo estimated by the standard conventional FAO-PM56 method and data driven models for selected countries is presented in Table 12 to conclude above mentioned results. Skardu station from Pakistan was included to make reasonable comparison among data driven models because it belongs to humid region. It can be concluded that all the data driven modelsshown good results when applied in differernt cliamtic conditions. Overall results indicate that SVM model outperformed for all selected climatic stations and yielded relaible results for ETo estimation. This may be due to the fact that SVM model always seeks a global optimized solution, avoids over-fitting and no physical bias which eventually leads to the best performance over other data driven models.

Table 12 Comparison of FAO-PM56 and data driven models between monthly averages value of ETo for selected stations

5 Conclusions

In this study, the potential of five machine learning algorithms/data driven models (SVM, GMDH, CCNN, MLP and GRNN) were investigated to estimate long term monthly reference evapotranspiration (ETo). For this purpose, meteorological variables of maximum and minimum temperatures, average relative humidity, wind speed and sunshine hours were employed as input in data driven models for six stations of Pakistan. The results obtained from all data driven models were found acceptable for selected climatic stations. Most of studies exited in literature used 70% of data as training/input data. And rest of 30% data for testing/validation. In this situation, R2 mostly comes above 95%. This practice is not justified and always arose question regarding model's performance. Therefore, the data set from other countries have been used for validation and justification of each data driven model. The following conclusions can be drawn from the study:

  • The long term monthly reference evapotranspiration (ETo) of four climatic regions were successfully estimated by conventional standard FAO-PM56 method

  • Machine learning algorithms/data driven models of SVM, GMDH, CCNN, MLP and GRNN were found capable for ETo estimation under different climatic zones of Pakistan as well as in other parts of the world

  • By investigating the performing indices, SVM and GMDH model ranked as 1st and 2nd while 3rd, 4th and 5th followed by MLP, CCCN and GRNN data driven models

  • The comparison results indicate that by using the similar meteorological inputs, SVM can be employed as an alternative ETo model to the existing conventional method