1 Introduction

Evapotranspiration is one of the most important parameters in hydrologic cycle. It is very important for irrigation projects and determination of irrigation regime. Irrigation can be applied with considerable savings by correct estimation of evapotranspiration.

In general, the combination of energy balance/aerodynamic equations provides accurate results because they are based on physics and rational relationships (Jensen et al. 1990). For this reason, the Food and Agricultural Organization of the United Nations (FAO) accepted the FAO Penman-Monteith as the standard equation for estimation of evapotranspiration (Allen et al. 1998; Naoum and Tsanis 2003).

The application of artificial neural networks (ANNs) for reference evapotranspiration (ET 0 ) modeling has received much attention in recent years (Trajkovic et al. 2003; Trajkovic 2005; Kisi 2006a, b, 2007a; Jain et al. 2008; Kim and Kim 2008; Kumar et al. 2009; Hamid et al. 2011; Kilic 2011; Tabari et al. 2012; Shirin Manesh et al. 2013; Kim et al. 2014; Adamala et al. 2014; Deo and Sahin 2015). Trajkovic et al. (2003) forecasted ET 0 using radial basis neural network (RBNN). Trajkovic (2005) employed temperature-based RBNN models for estimating FAO-56PM ET 0 . Kisi (2006a) investigated the accuracy of ANN method in estimating ET 0 , and he compared ANN results with the Penman and Hargreaves models. He showed that the ANN model performed better than the empirical models. Kisi (2006b) modeled ET 0 by using generalized regression neural network (GRNN) models. Kisi (2007a) modeled ET 0 by using multi-layer perceptron and compared test results with the Penman, Hargreaves, and Turc models. The superiority of the multi-layer perceptron to the empirical models was shown in this study. Jain et al. (2008) employed ANN for estimating ET 0 and proposed a procedure to evaluate the effects of input variables on the output using the weight connections of ANN. Kim and Kim (2008) examined the ability of GRNN model for estimating the alfalfa ET 0 . Kumar et al. (2009) employed different ANN models for modeling ET 0 under the arid conditions and compared the results with the FAO-24 Radiation, Turc, and FAO-24 Blaney–Criddle methods. They indicated that the ANN model performed better than the empirical models. Hamid et al. (2011) investigated the ability of ANN in estimating ET 0 . Tabari et al. (2012) compared different data-driven approaches with climate-based models for ET 0 modeling using limited climatic data in a semi-arid highland environment. Shirin Manesh et al. (2013) evaluated the different ANN models to estimate and interpolate the ET 0 in Fars province of Iran. Kim et al. (2014) used two different ANN methods in estimation of ET 0 . Adamala et al. (2014) utilized the second-order ANN method to model the ET 0 in different climatic zones of India. Gocic et al. (2015) forecasted ET 0 data collected during the period 1980–2010 in Serbia using ANN and support vector machine. Deo and Sahin (2015) used ANN for prediction of monthly standardized precipitation and evapotranspiration index using hydrometeorological parameters and climate indices in eastern Australia. The use of M5 model tree (M5Tree) is limited in the hydrology literature (Bhattacharya and Solomatine 2005; Deswal 2008; Pal and Deswal 2009; Singh et al. 2010; Sattari et al. 2013; Rahimikhoob et al. 2013; Rahimikhoob 2014). Bhattacharya and Solomatine (2005) used ANN and M5Tree for modeling water level–discharge relationship. Deswal (2008) investigated the prediction of pan evaporation using M5Tree. Pal and Deswal (2009) investigated the potential of M5Tree to model daily ET 0 using climatic data of Davis station maintained by California irrigation Management Information System (CIMIS). Singh et al. (2010) explored the potential of ANN and M5Tree to estimate the mean annual flood. Sattari et al. (2013) investigated the potential of M5Tree tree in predicting daily stream flows in the Sohu River located within the municipal borders of Ankara, Turkey. Rahimikhoob et al. (2013) assessed the performance of M5Tree for converting pan evaporation data to ET 0 in semi-arid regions. Rahimikhoob (2014) evaluated the performances of ANN and M5Tree for estimating ET 0 at four meteorological sites in an arid climate. To the best knowledge of the authors, there is not any published work related to investigate the generalization ability of M5Tree in modeling ET 0 .

The main aim of the present study is to examine the ability of ANNs in (i) locally modeling ET 0 of six stations, (ii) estimating ET 0 of one station using the data from the other two stations, and (iii) estimating ET 0 of three stations located in the Southern region using data of three stations located in San Joaquin region. The performances of the ANN and M5Tree models are compared with the CIMIS Penman, Hargreaves, Ritchie, and Turc empirical models.

2 Artificial neural network

Artificial neural networks (ANNs), inspired from biological nervous system, are massively parallel systems composed of many processing elements. The network consists of layers and each one comprises processing elements, called neurons. Each layer is fully connected to the proceeding layer by interconnection weights. Initial weight values are randomly assigned, and then they are progressively corrected during a training process that compares calculated outputs to known outputs. By this was, the errors are backpropagated to determine the appropriate weight adjustments necessary to minimize the errors (Kisi, 2005).

In the current study, the Levenberg–Marquardt was used for the training of ANN models because this technique is more powerful and faster than the conventional gradient descent technique (Hagan and Menhaj, 1994; Kisi, 2007b). The detailed theoretical information about ANN can be obtained from Haykin (1998).

A difficult task with ANN involves choosing the hidden nodes’ number. In the current study, the network with one hidden layer was used and the hidden nodes’ number was determined by trial-and-error method. The sigmoid and linear activation functions were used for the hidden and output nodes of the ANN models, respectively. The network training was stopped after 100 epochs since the variation of error was too small after this epoch.

3 M5 model tree

M5 model tree (M5Tree) is first introduced by (Quinlan 1992). In M5Tree, the parameter space is split into subspaces and a local linear regression model is built for each of them. Regression trees have constant values at their leaves. M5Tree, however, has linear regression functions at the leaves and generalizes the concepts of regression trees (Breiman et al. 1984). So, they are similar to piece-wise linear functions. Model trees can adequately learn and succeed in tasks with very high dimensionality. The main advantage of M5Tree compared to regression trees is that it is much smaller than regression trees, the decision strength is clear, and the regression functions do not normally contain numerous variables (Bhattacharya and Solomatine 2005). The splitting criterion for the M5Tree depends on treating the standard deviation of the class values and calculating the probable reduction in this error in consequence of testing each attribute at that node. Equation (1) is used for calculating the standard deviation reduction (SDR) (Pal and Deswal, 2009):

$$ SDR=sd(T)-{\displaystyle \sum}\frac{\left|{T}_i\right|}{\left|T\right|}sd\left({T}_i\right) $$
(1)

where T indicates a set of examples that reaches the node, T i is the subset of examples that have the ith outcome of the potential set and sd refers the standard deviation (Rahimikhoob et al., 2013; Wang and Witten, 1997). Because of the splitting process, the data in child nodes have less sd in comparison to parent nodes and therefore are purer. Further details of an M5Tree can be attained from Quinlan (1992).

4 Case study

The daily climatic data of King-City Oasis Rd. Station (latitude 36° 07′ 17″ N, longitude 121° 05′ 02″ W), Arroyo Seco Station (latitude 36° 21′ 32″ N, longitude 121° 17′ 25″ W), Salinas North Station (latitude 36° 43′ 00″ N, longitude 121° 41′ 27″ W), San Luis Obispo Station (latitude 35° 18′ 22″ N, longitude 120° 39′ 37″ W), Santa Monica Station (latitude 34° 02′ 28″ N, longitude 118o 28′ 34″ W), and Santa Barbara Station (latitude 34° 26′ 16″ N, longitude 119° 44′ 10″ W) operated by the California Irrigation Management Information System (CIMIS) located in the USA were used in the study. King-City Oasis Rd., Arroyo Seco, and Salinas North stations are located in San Joaquin region while the San Luis Obispo, Santa Monica, and Santa Barbara stations are located in the Southern region (Figs. 1, 2, and 3). Detailed information about data measurements can be obtained from the CIMIS web site (http://www.cimis.water.ca.gov). The King-City Oasis Rd., Arroyo Seco, Salinas North, San Luis Obispo, Santa Monica, and Santa Barbara stations are 165, 72, 19, 101, 104, and 76 m below the sea level, respectively.

Fig. 1
figure 1

The locations of San Joaquin and Southern regions in California

Fig. 2
figure 2

The locations of King-City Oasis Rd. (113), Arroyo Seco (114), and Salinas North (116) stations in San Joaquin region, California

Fig. 3
figure 3

The locations of San Luis Obispo (52), Santa Monica (99), and Santa Barbara (107) in Southern region, California

The data sample covers 16 years (1994–2009) of daily records of air temperature (T), solar radiation (SR), wind speed (U 2), and relative humidity (RH). Before applying ANN models, missing data were removed from the whole data set. First, 8-year (1994–2001) data were used to train the ANN models; second, 4-year (2002–2005) data were used for validation; and the remaining 4-year (2006–2009) data were used for testing. The daily statistical parameters of the climatic data are given in Table 1. In this table, the x mean, Sx, Cv, Csx, x max, and x min denote the mean, standard deviation, variation coefficient, skewness, maximum, and minimum, respectively. The air temperature and solar radiation shows significantly low skewed distribution for the King-City Oasis Rd., Arroyo Seco, and Santa Barbara stations (see Csx values in Table 1). Among the climatic data used in the study, the wind speed data show high skewed distribution especially for the Santa Monica Station. As can be seen from the correlation coefficients in Table 1, the solar radiation has the highest correlation with the ET 0 . Wind speed has the least correlation with ET 0 except for the Santa Barbara Station.

Table 1 The daily statistical parameters of the climatic data used in the study

5 Application and results

5.1 Application of ANNs in modeling ET0

In the present study, the generalization ability of ANNs and M5Tree in modeling ET 0 is investigated. The ET 0 values were calculated using the FAO-56PM method as described in Allen et al. (1998):

$$ E{T}_0=\frac{0.408\varDelta \left({R}_n-G\right)+\gamma \frac{900}{T+273}{U}_2\left({e}_a-{e}_d\right)}{\varDelta +\gamma \left(1+0.34{U}_2\right)} $$
(1)

where ET 0  = reference evapotranspiration (mm day−1), Δ = slope of the saturation vapor pressure function (kPa °C−1), R n  = net radiation (MJ m−2 day−1), G = soil heat flux density (MJ m−2 day−1), γ = psychometric constant (kPa °C−1), T = mean air temperature (°C), U 2 = average 24-h wind speed at 2 m height (m s−1), e a  = saturation vapor pressure (kPa), and e d  = actual vapor pressure (kPa).

The ANN and M5Tree models were calibrated using the daily climatic inputs, T, SR, U 2, RH, and output ET 0 values calculated using the FAO-56PM method. Root mean square errors (RMSE), mean absolute errors (MAE), and determination coefficient (R2) were used for evaluating the accuracy of the models. The RMSE and MAE can be respectively expressed as:

$$ \mathrm{RMSE}=\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^N{\left({Ei}_{\mathrm{observed}}-{Ei}_{\mathrm{predicted}}\right)}^2}} $$
(2)
$$ MAE=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left|E{i}_{observed}-E{i}_{predicted}\right|} $$
(3)

where N and Ei denote the number of data and ET 0 , respectively.

In the first part of the study, the ANN and M5Tree models were used for estimating ET 0 of six stations and results were compared with the empirical methods, CIMIS Penman, Hargreaves-Samani, Ritchie, and Turc. The CIMIS Penman equation uses the modified Penman equation (Pruitt and Doorenbos 1977) with a wind function developed at the University of California, Davis. The method uses hourly average climatic data as an input to calculate hourly ET 0 . The 24-hourly ET 0 values for the day (midnight-to-midnight) are then summed to obtain daily ET 0 . The hourly PM equation that CIMIS uses to estimate hourly PM ET 0 is the Food and Agricultural Organization’s version that is described in Irrigation and Drainage Paper No. 56 (Allen et al. 1998). The CIMIS Penman equation is also described in detail in Hidalgo et al. (2005), (see CIMIS website: http://wwwcimis.water.ca.gov/cimis/infoEtoCimisEquation.jsp);

$$ E{T}_0=\left(\frac{\varDelta }{\varDelta +\gamma}\right){R}_n+\left(1-\frac{\varDelta }{\varDelta +\gamma}\right)\left({e}_a-{e}_d\right){f}_U $$
(4)

where ET 0  = hourly reference evapotranspiration (mm day−1), Δ = slope of the saturation vapor pressure function (kPa °C−1), R n  = mean hourly net radiation (Wm−2), γ = psychometric constant (kPa °C−1), e a is the saturation vapor pressure (kPa), e d is the actual vapor pressure (kPa), and the f U  = wind function (m s−1). Daily ET 0 equals to the sum of 24 h ET 0 (mm).

Hargreaves and Samani (1985) equation can be defined as

$$ E{T}_0=0.408\times 0.0023{R}_a\left(\frac{T_{\max }+{T}_{\min }}{2}+17.8\right){\left({T}_{\max }-{T}_{\min}\right)}^{0.5} $$
(5)

where ET 0 is reference evapotranspiration (mm day−1), T max and T min are maximum and minimum temperatures (°C), respectively, and R a is extraterrestrial radiation (MJ m−2 day−1).

The Ritchie method can be given as (Jones and Ritchie, 1990)

$$ E{T}_0={\alpha}_1.\left[3.87\times {10}^{-3}.{R}_s.\left(0.6{T}_{\max }+0.4{T}_{\min }+29\right)\right] $$
(6)

where ET 0  = reference evapotranspiration (mm day−1), T max and T min = maximum and minimum temperature (°C), and R s  = solar radiation (MJ m−2 day−1).

where

$$ \begin{array}{cc}\hfill 5<{T}_{\max}\le 35\;{}^0C\hfill & \hfill {\alpha}_1=1.1\hfill \\ {}\hfill {T}_{\max }>35\;{}^0C\hfill & \hfill {\alpha}_1=1.1+0.05\left({T}_{\max }-35\right)\hfill \\ {}\hfill {T}_{\max }<5\;{}^0C\kern0.24em \hfill & \hfill {\alpha}_1=0.01. \exp \left[0.18\left({T}_{\max }+20\right)\right]\hfill \end{array} $$
(7)

Turc (1961) formula with reduced wind data is also a common method for calculating ET 0

$$ E{T}_0=\left(23.89{R}_s+50\right)\frac{0.013T}{T+15}\left[1-{W}_{RH}\left(0.71-1.43RH/100\right)\right] $$
(8)

where W RH = 1 when RH < 50 % and W RH = 0 when RH > 50 %.

Two different ANN and M5Tree models were developed in the study. ANN1 and M5Tree1 models have four inputs comprising whole climatic variables. The ANN2 and M5Tree2 models with two inputs, T and SR, were also developed for the valid comparison with two-parameter Hargreaves-Samani, Ritchie, and Turc methods. For the ANN models, the optimal hidden node numbers were determined by using trial-and-error method. The optimal ANN and M5Tree models are compared with empirical methods in Table 2. It is clear from the table that the four-input ANN1 and M5Tree1 models perform better than the CIMIS Penman models for all stations. In five stations out of six, two-input ANN2 and M5Tree2 models showed better accuracy than the Hargreaves, Ritchie, and Turc models. In the Salinas North Station, Ritchie model has slightly better accuracy than the ANN2 model. M5Tree2 model performs worse than the Hargreaves and Ritchie models in this station. The ANN models generally performs better than the M5Tree models in local ET 0 estimation. In the Santa Monica Station, the M5Tree1 model has a slightly better accuracy than the ANN1 model. As an example, the estimates of the ANN, M5Tree, and empirical models are compared in Fig. 4 for the King-City Oasis Rd. Station. As seen from the scatterplots, the ANN and M5Tree models have less scattered estimates than those of the corresponding empirical models.

Table 2 The test statistics of the ANN, M5Tree, and empirical models in the first application
Fig. 4
figure 4

The FAO-56PM and estimated ET 0 values by the ANN, M5Tree, and empirical models in test period—King-City Oasis Rd. Station

In the second part of the study, the four- and two-input ANN and M5Tree models obtained from one station in the previous application were tested using the data from other two stations for each region. In the San Joaquin region, the optimal ANN and M5Tree models obtained for the King-City Oasis Rd. Station in the previous application were tested using the data of Arroyo Seco and Salinas North stations. In the Southern region, the optimal ANN and M5Tree models obtained for the San Luis Obispo Station were tested by the data of Santa Monica and Santa Barbara stations. Test results of the ANN, M5Tree, and empirical models are given in Table 3 for each station. In Arroyo Seco and Santa Monica stations, the four-input ANN1 model performs better than the CIMIS Penman model. In Salinas North and Santa Barbara stations, however, CIMIS Penman shows better accuracy than the ANN1 model. The main reason of this maybe the fact that the statistical properties of the data of the stations (King-City Oasis Rd. and San Luis Obispo) used in the calibration of ANN models are different with those of the stations used for testing (see Table 1). M5Tree1 models, however, perform better than the CIMIS Penman in all stations. In Arroyo Seco Station, the ANN1 model has better ET 0 estimates than the M5Tree1. Comparison of the two-parameter models indicates that the ANN2 model is better than the Hargreaves, Ritchie, and Turc methods in Salinas North and Santa Monica stations while the Ritchie model performs better than the ANN2 model in Arroyo Seco and Santa Barbara stations. Two-parameter M5Tree2 models generally provide inferior results compared to Hargreaves and Ritchie models. Among the two-input models, the Hargreaves is ranked as the second best in Santa Barbara Station. As an example for each region, the estimates obtained by ANN, M5Tree, and empirical models for the Arroyo Seco and Santa Barbara stations are shown in Figs. 5 and 6. It is clear from the figures that the four-input models give less scattered estimates than the two-input models. The superior accuracy of M5Tree model to the ANN in Santa Barbara Station can be clearly seen from the scatterplots. Turc model gives inaccurate estimates for both stations.

Table 3 The test statistics of the ANN, M5Tree, and empirical models in the second application
Fig. 5
figure 5

The FAO-56PM and estimated ET 0 values by the ANN, M5Tree and empirical models in test period—Arroyo Seco Station

Fig. 6
figure 6

The FAO-56PM and estimated ET 0 values by the ANN, M5Tree, and empirical models in test period—Santa Barbara Station

In the third part of the study, the ANN and M5Tree models were calibrated using the data of three stations located in San Joaquin region and obtained models were tested using the data from other three stations located in the Southern region. Test results of the ANN, M5Tree, and empirical models for each station are provided in Table 4. It is apparent from the table that four-input ANN1 and M5Tree models show better accuracy than the CIMIS Penman in San Luis Obispo and Santa Monica stations, respectively. The M5Tree1 models perform better than the ANN1 models in Santa Monica and Santa Barbara stations. On the other hand, two-input ANN2 model performs better than the Hargreaves, Ritchie, and Turc models in San Luis Obispo and Santa Monica stations while the M5Tree2 model provides worse accuracy than the Hargreaves and Ritchie models in Santa Monica and Santa Barbara stations. The main reason of the poor generalization ability of the ANN and M5Tree models is maybe the fact that the stations located in San Joaquin region have different statistical properties than those of the tested stations located in the Southern region (see Table 1). As an example, the estimates of the ANN, M5Tree, and empirical models are compared in Fig. 7 for the Santa Monica Station. It is clear from the figure that the M5Tree1 has less scattered estimates than the ANN1 and empirical models. All these results indicate that the ANN and M5Tree models should be carefully used without local calibration. M5Tree1 model may be a better alternative to ANN1 model in case of a lack of local climatic data.

Table 4 The test statistics of the ANN, M5Tree, and empirical models in the third application
Fig. 7
figure 7

The FAO-56PM and estimated ET 0 values by the ANN, M5Tree, and empirical models in test period—Santa Monica Station

6 Concluding remarks

The generalization ability of ANN and M5Tree in modeling ET 0 has been investigated in this study. The daily climatic data from King-City Oasis Rd., Arroyo Seco, and Salinas North stations located in San Joaquin region and San Luis Obispo, Santa Monica, and Santa Barbara stations located in the Southern region were used as inputs to the ANN and M5Tree models for estimating ET 0 obtained using the standard FAO-56 Penman–Monteith equation. Three different applications were employed in the study. In the first application, the ANN and M5Tree models were tested using each station’s data and results were compared with the empirical methods. The comparison results revealed that the ANN and M5Tree models performed better than the empirical models. In the second application, the ANN and M5Tree models were calibrated using climatic data of one station and tested by using the data of the other two stations for each region. Four-input ANN1 model performed better than the CIMIS Penman model in Arroyo Seco and Santa Monica stations while the M5Tree1 model provided better accuracy than the CIMIS Penman in all stations. The Hargreaves, Ritchie, and Turc models showed better accuracy than the two-input ANN2 model in Arroyo Seco and Santa Barbara stations. M5Tree2 model generally provided inferior results in ET 0 estimation compared to Hargreaves and Ritchie models. In the third application, three stations located in San Joaquin region were used for calibration of ANN and M5Tree models and obtained models were tested by the data of other three stations located in Southern region. Four-input ANN1 and M5Tree1 models were found to be better than the CIMIS Penman in San Luis Obispo and Santa Monica stations, respectively. Two-input ANN2 model performed better than the Hargreaves, Ritchie, and Turc models in San Luis Obispo and Santa Monica stations while the M5Tree2 model provided worse accuracy than the Hargreaves and Ritchie models in Santa Monica and Santa Barbara stations. The study indicated that the ANN and M5Tree models calibrated without local climatic data can give poor estimates in some cases and should be carefully employed in modeling ET 0 . M5Tree1 can be used as a better alternative to ANN1 in the case without local input and output data.