Introduction

Water loss through evaporation constitutes the main part in the water balance of a catchment, reservoir, and lake (McMahon et al. 2013), and must be estimated as accurately as possible. Exact quantification of the evaporation shall constitute a principal requirement for many applications, among them water balance studies, hydrological modeling, and irrigation scheduling (Cahoon et al. 1991). Over the years, several studies conducted worldwide have shown that direct measurement of evaporation using pan evaporimeter (Epan) is a commonly used approach for estimating the evaporation rate. Undoubtedly, direct measurement is a best and useful method. However, despite its importance, several alternative indirect empirical models were proposed and have been widely studied and they have a proven effectiveness (e.g., the Penman, Stephens-Stewart, and Hargreaves and Samani models). The use of indirect empirical models for evaporation estimation is often associated by the need of a large number of climatic variables. Unfortunately, except the temperature-based methods, that needs generally fewer climatic variables as input, the large amount of input variables were required for applying the empirical and semi empirical models reflects the difficulties which were needed to be overcome by using alternative methods, offered by artificial intelligence (AI), which relate the Epan to several climatic variables.

A number of researchers have been demonstrated the successful application of several AI models for modeling and predicting Epan using several climatic variables as input. Sebbar et al. (2019) proposed a new model for predicting daily Epan using four daily climatic variables from two stations in Algeria, which is based on the extreme learning machine (ELM) approach. They have applied and compared (i) the optimally pruned extreme learning machine (OPELM); and (ii) the online sequential extreme learning machine (OSELM) models. Minimum and maximum air temperatures (Tmin and Tmax), wind speed (U2), and relative humidity (H%) have been employed to predict Epan measured using class A evaporation pan, and the obtained result agrees well with the measured values with a coefficient of correlation (R) between 0.800 and 0.872 using the OSELM, and between 0.808 and 0.853 using the OPELM. Feng et al. (2018) applied three AI and two empirical models in predicting monthly Epan in China. The proposed AI models were (i) ELM, (ii) multilayer perceptron neural network (MLPNN) optimized by particle swarm optimization (MLPNN-PS), (iii) and MLPNN optimized by genetic algorithm (MLPNN-GAANN), and the obtained results were compare to those provided by the Stephens and Stewart (SS) and the Penman empirical models. From the obtained results, they reported that the best accuracy was achieved using the ELM model with average relative root mean square error (RRMSE) and mean absolute error (MAE) of 12.5–15.2% and 11.7–19.9 mm, respectively. In the same year, Lu et al. (2018) presented a new kind of models based on the tree-based machine learning (TBM) models: (i) M5 model tree (M5Tree), (ii) random forests (RFs), and (iii) gradient boosting decision tree (GBDT). In addition, the authors compared the obtained results using the TBM models with those provided by four empirical equations. All the models were applied and compared using data at daily time step in China. The more accurate prediction by the TBM models was obtained using the GBDT model compared to the M5Tree and RFs with root mean square error (RMSE), mean bias error (MBE), and Nash-Sutcliffe efficiency (NSE) of 0.86 mm, 0.07 mm, and 0.68, respectively. Regarding the empirical model, the authors reported that the Priestley-Taylor model was the most accurate model and the Trabert model performed worst. Another new kind of AI model was introduced by Eray et al. (2018). They applied at the first time an evolving connectionist systems (ECoS) called dynamic evolving neural-fuzzy inference systems named (DENFIS) for modeling monthly Epan using data from two stations in Turkey. Compared to another AI model, the multi-gene genetic programming (MGGP), DENFIS performed best in one station while the MGGP performed best into the second station. More recently, hybrid models evolutionary algorithms have been successfully applied for modeling Epan. For example, Ghorbani et al. (2018) combined the firefly algorithm (FFA) with the standard MLPNN and a hybrid model called MLP-FFA was employed for modeling daily Epan in the arid regions of Iran. From the obtained results, the authors reported that MLP-FFA was more accurate than the MLPNN and support vector machine models (SVM) for both tested stations, with Willmott’s index of agreement (WI), NSE, and RMSE ranged from 0.926 to 0.976, 0.791 to 0.922, and 1.007 mm to 1.406 mm, respectively. Shiri (2019) compared adaptive neuro-fuzzy inference systems (ANFIS) and multilayer perceptron neural network (MLPNN) for modeling daily Epan using data from four stations in the USA. Qasem et al. (2019) compared four AI models for predicting monthly Epan using data from two stations in Iran and Turkey. The proposed models were the MLPNN, SVM, and combination of them with wavelet transforms (WSVR and WANN). According to the obtained results, the authors demonstrated that Epan was highly related to the temperature and solar radiation and on the other hand, the wavelet decomposition does not contribute to the overall improvement of the models performances.

Different AI models for Epan can be found in the literature, including radial basis function neural network (Allawi and El-Shafie 2016), support vector machine (Rezaie-Balf et al. 2018), modified response surface method (Keshtegar and Kisi 2016, 2017), least square support vector regression (Wang et al. 2017), co-active neuro-fuzzy inference system (Malik et al. 2017), minimax probability machine regression (Deo et al. 2016), generalized regression neural networks (Kim and Kim 2008), linear genetic programming (Guven and Kisi 2011), gene-expression programming (Shiri and Kisi 2011), ANFIS (Shiri et al. 2011), evolutionary neural networks (Kisi 2013), fuzzy genetic approach (Kisi and Tombul 2013), and multivariate adaptive regression splines (Kisi 2015). Hence, it is clear that significant efforts to predict daily and monthly pan evaporation using AI models have been carried out in the last two decades. The review of the literature reveals that models based on response surface method were extremely rare. To the best of our knowledge, only the investigations conducted by Keshtegar and Kisi (2016, 2017), no other studies, have reported an application of the response surface method (RSM) for modeling evaporation. The purpose of this paper is to develop a hybrid model that combines the standard support vector regression (SVR) and the RSM in order to develop a hybrid model called SVR_RSM. The present study is based on measured data at monthly time step in Algeria. The suitability of the proposed hybrid method is analyzed in terms of prediction accuracy for estimating monthly Epan, and the obtained results were compared to those obtained using MLPNN, SVR, and RSM models.

Materials and methods

Case study

In this study, six sites were selected to develop the proposed models: Ain Dalia, Beni Haroun, Bouhamden, Chaffia, El Agram, and Zit Emba. The sites are located in northern East of Algeria. The spatial location of these stations is shown in Fig. 1. Table 1 shows the coordinates of these sites with respect to latitude, longitude, time period of record, and the total pattern used for developing the models. Evaporation pan (Epan), which is the predicted variable, was measured using the class A evaporation pan. In addition, data on measured, maximum and minimum temperatures (Tmax, Tmin), wind speed (U2), and relative humidity (H %) were selected as input variables. Table 2 reports the various input combinations for prediction of monthly Epan. The data for all stations were divided into two subset; training and validation subsets, which correspond to 70% and 30%, respectively. Six combinations of input variable were used in the MLPNN, SVR, RSM, and SVR_RSM models to predict the monthly Epan (Table 2). It is clear from Table 2 that the periodicity (α) that corresponds to the month number from 1 to 12 is included in the all combination. Combinations 4 to 6 include only two climatic variables in addition to α, while combination 2 is the unique combination for which the periodicity is not included. A model without wind speed was also considered as one of the combinations (combination 3).The descriptive statistics of the climatic variables and the Epan are presented in Table 3, where Xmean, Xmax, Xmin, Sx, Cv, and R denote the mean, maximum, minimum, standard deviation, coefficient of variation, and coefficient of correlation with Epan, respectively.

Fig. 1
figure 1

Geographical locations of the six weather stations across Algeria

Table 1 Data set presentation for all stations
Table 2 The six input combinations of different models
Table 3 Statistical parameters of the used data sets for all stations

Modeling methods

Artificial neural network

Artificial neural network (ANN) is a powerful modeling tool which is extended based on biological nervous system (Pathirage et al. 2018). Predicting data, classifying database, or performing pattern can be applied based on ANNs, and also in complex modeling events such as hydrology, reliability analysis, structural design, chemical process, environmental problems, and medicinal patterns. The multilayer perceptron neural network (MLPNN) is a popular well-known ANNs algorithm using train-based optimization methods. The MLPNN is usually structured by an input, one or more hidden and one output layers, that accuracy of prediction using this model strongly depended on the number of neurones in each hidden layer. In the first stage, the input layer is connected using relative weights to the neurones in first hidden layer (φ1) by the following relation:

$$ {\varphi}_j^1={f}_j^1\left({b}_j^1+\sum \limits_{i=1}^n{w}_{ij}^1{x}_i\right) $$
(1)

where \( {\varphi}_j^1 \) represents jth neuron of hidden layer 1, \( {b}_j^1 \) represents the bias of the jth node in hidden layer 1, \( {w}_{ij}^1 \)denotes weights to connect the jth node of hidden layer 1, and ith input node of input layer with n number of input nodes xi, i = 1, 2,…, n. f is the activation function, for which the sigmoid function f is commonly utilized as follows:

$$ {f}_j^1={\left[1+\exp \left[-\left({b}_j^1+\sum \limits_{i=1}^n{w}_{ij}^1{x}_i\right)\right]\right]}^{-1} $$
(2)

The second hidden layer is built using the first hidden layer as input data as well as the first hidden layer by the following relation:

$$ {\varphi}_j^2={f}_j^2\left({b}_j^2+\sum \limits_{i=1}^M{w}_{ij}^2{\varphi}_i^1\right) $$
(3)

where \( {\varphi}_j^2 \) represents jth neuron of hidden layer 2, \( {b}_j^2 \)and \( {w}_{ij}^2 \) respectively denote the bias of the jth node in hidden layer 2 and weights which connect jth node of hidden layer 2 to ith node of hidden layer 1 with M elements φii = 1,2,…, M and \( {f}_j^2 \) is the activation function in hidden layer 2 as follows:

$$ {f}_j^2={\left[1+\exp \left[-\left({b}_j^2+\sum \limits_{i=1}^n{w}_{ij}^2{\varphi}_i^1\right)\right]\right]}^{-1} $$
(4)

The predicted function for pan evaporation is applied to connect the input database on hidden layer 2 and output neuron as (Epan) which is expressed as below:

$$ \hat{E}=b+\sum \limits_{i=1}^h{w}_i{\varphi}_i^2 $$
(5)

where h represents number of hidden nodes in layer 2, b is the bias of the output layer, and wj represent the connection weights between the output node and to jth neuron in hidden layer 2. \( {\varphi}_i^2 \) is ith hidden node in layer 2 which is computed based on Eq. (2) with sigmoid active function.

Generally, back-propagation learning tools-based optimization approaches can provide the suitable connections between input and output data (Kurt and Kayfeci 2009). By using a random weight as initial weights, the optimum weights and biases are generally searched using a mathematical optimization method as gradient, conjugate gradient, or Newton approaches (Dao and Vemuri 2002). In the present study, the learning approach to obtain the nonlinear relation between the input and output variables was applied using Levenberg-Marquardt algorithm (Fun and Hagan 1996). The randomly weights are adjusted after each iteration process. In this study, the number of neuron in the two hidden layers is explored using mean squared errors (MSE) by trial and error to give the best connection between input layer and output layer with one neuron of pan evaporation. In Fig. 2, a MLPNN with two hidden layers and output layer with one neuron is presented which is applied in the current study.

Fig. 2
figure 2

Schematic view of the MLPNN model for calibrating pan evaporation with structure n-M-h-1

Response surface method

The response surface method (RSM) is a modeling-based mathematical simple tool with low computational burden to predict the engineering problem. The mathematical relation of this model using second-order polynomial functions is presented as follows (Keshtegar and Heddam 2018; Heddam et al. 2019; Keshtegar et al. 2019b):

$$ \hat{E}={w}_0+\sum \limits_{i=1}^n{w}_i{x}_i+\sum \limits_{i=1}^n\sum \limits_{j=1}^n{w}_{ij}{x}_i{x}_j $$
(6)

where \( \hat{E} \)is the predicted pan evaporation using n - input data, and wi and wij are connected weights between the polynomial functions and the observed data with bias w0. In the RSM, the polynomial nodes are directly computed using input data (x) with linear, second order and cross terms. In the modeling process of the RSM, N nodes, i.e., N = n (n + 1)/2 is applied using one hidden layer which is computed using the input layer elements. The schematic view of this model is presented in Fig. 3. Commonly, the weights and bias of RSM is computed using last square estimator (Keshtegar et al. 2018; Keshtegar and Seghier 2018).

Fig. 3
figure 3

Schematic view of the RSM model

Support vector regression

The nonparametric modeling approaches can be applied to predict the performances of complex real engineering problems. The learning theory basis the support vector machines is a powerful intelligence tool for regression (SVR) and classification problems (Brereton and Lloyd 2010). Consequently, the SVR modeling approach as a nonlinear model to provide the suitable relation between the pan evaporation and climatological data can be used to predict these complex environmental problems by the following model:

$$ \hat{E}=b+\sum \limits_{i=1}^N{w}_iK\left(x,{x}_i\right) $$
(7)

where b is bias and K(x, xi) represents the Kernel function which is transferred the n-input database from X- space into N-dimensional feature space. Generally, Gaussian kernel function is used for transferring the input data as follows (Brereton and Lloyd 2010):

$$ K\left(x,{x}_i\right)=\exp \left(-\frac{{\left\Vert x-{x}_i\right\Vert}^2}{2{\sigma}^2}\right) $$
(8)

where σ is the kernel parameter which provides the smoothness of the Kernel function; it is given as σ = 8 in this study. In Eq. (7), wi is the weight to connect the predicted pan evaporation and input random data basis in feature space which is computed using two slack variables \( {\xi}_i,{\xi}_i^{\ast } \) by the following optimization problem (Lu 2014):

$$ {\displaystyle \begin{array}{c}\operatorname{Minimize}\frac{{\left\Vert w\right\Vert}^2}{2}+C\sum \limits_{i=1}^N\left({\xi}_i+{\xi}_i^{\ast}\right)\\ {}\mathrm{Subjected}\kern0.17em \mathrm{to}\left\{\begin{array}{c}{y}_i-<w.K\left(x,{x}_i\right)>-b\le \varepsilon +{\xi}_i\\ {}<w.K\left(x,{x}_i\right)>+b-{y}_i\le \varepsilon +{\xi}_i^{\ast}\\ {}{\xi}_i,{\xi}_i^{\ast}\ge 0\end{array}\right.\end{array}} $$
(9)

In which, factor C ≥ 0 is the regularization coefficient which is given as C = 300 in this study, and ε is insensitive loss function which is given asε = 0.15. The ε- insensitive loss function is used to neglect the calibrating process-based SVR when differences between the predicted and observed pan evaporation are less than ε. The SVR model is schematically shown in Fig. 4a that the structure-based prediction using nonlinear Kernel function is presented in Fig. 4b with input data set (x). By applying the Karush-Kuhn-Tucker (KKT) condition, the optimum parameters of Eq. (9) can be computed using the Lagrange optimization model in regression process as below (Thissen et al. 2004):

$$ {\displaystyle \begin{array}{c}\operatorname{Maximize}-\frac{1}{2}\sum \limits_{i,j=1}^N\left({\alpha}_i-{\alpha}_i^{\ast}\right)\left({\alpha}_j-{\alpha}_j^{\ast}\right)K\left({x}_i,{x}_j\right)-\varepsilon \sum \limits_{i=1}^N\left({\alpha}_i-{\alpha}_i^{\ast}\right)+\sum \limits_{i=1}^N{E}_i\left({\alpha}_i-{\alpha}_i^{\ast}\right)\\ {}\mathrm{Subjected}\kern0.17em \mathrm{to}\left\{\begin{array}{c}\sum \limits_{i=1}^N\left({\alpha}_i-{\alpha}_i^{\ast}\right)=0\\ {}0\le {\alpha}_i\le C\\ {}0\le {\alpha}_i^{\ast}\le C\end{array}\right.\end{array}} $$
(10)
Fig. 4
figure 4

Schematic view of SVR model: a structure of SVR and b calibrating data with ɛ-insensitive loss function

where αi and \( {\alpha}_i^{\ast } \) are Lagrange multipliers. Thus, the predicted function-based SVR is given as follows:

$$ \hat{E}=b+\sum \limits_{i=1}^N\left({\alpha}_i-{\alpha}_i^{\ast}\right)K\left(x,{x}_i\right) $$
(11)

As seen, the weight is computed using the Lagrange multipliers as \( {w}_i={\alpha}_i-{\alpha}_i^{\ast } \) which depended on the parameters of the SVR asσ, ε, and C.

Hybrid model using SVR and RSM

In this study, the parameters of the SVR are selected as the constant values, while optimization processes can be applied to search the acceptable parameters for improving the accuracy of the predicted pan evaporation. Applying the optimization process for modeling using the SVR is a good strategy but it is a time-consuming algorithm with more computational burden. Consequently, the main effort in this study is to improve the predictions of SVR for calibrating pan evaporation with constant SVR parameters in order to reduce the computational burden of modeling process. The accuracy of the predicted model can be improved based on the applied input database as wavelet functions. Nevertheless, the effect of the input data can be controlled by filtering them using a mathematical relation based on the RSM. Generally, the AI-based data-driven using two-step calibration is a strategy to improve accuracy of prediction. This strategy is recently applied to enhance the RSM as modified RSM (Keshtegar and Kisi 2017) and multi-layer RSM (Kowsar et al. 2019). Using two calibrating process for improving the performance of the RSM approach, Keshtegar and Seghier (2018) applied the hybrid model for the burst pressure of corroded pipes and for solving hydrological problems (Keshtegar and Kisi 2017). This strategy is used to modify the prediction of SVR with hybrid modeling approach using RSM in first calibrating process and SVR in second calibrating process. Consequently, the nonlinear relations-based two modeling approaches may improve the accuracy predictions of pan evaporation.

These modeling methods SVR and RSM were combined using three basic layers in SVR_RSM model as the input data nodes in first layer, modeling nodes using RSM in second layer, and predicted pan evaporation using SVR in third layer. In the second and third layers, two modeling approaches of RSM and SVR are used to calibrate the pan evaporation. Generally, the nodes in the second calibrating process are computed using the predicted results of RSM with two individual input variables which are given from input database. Therefore, the RSM with cross terms is used in the first calibrating database on second layer as below relation:

$$ {\varphi}_m={w}_0+{w}_1{x}_{n-1}+{w}_2{x}_n+{w}_3{x}_{n-1}^2+{w}_4{x}_n^2+{w}_5{x}_{n-1}{x}_n $$
(12)

where w0-w5 are weights which represent the connection of the input databases xn − 1 and xn with the data of node m. The total number of nodes in the second layer which are obtained using Eq. (12) with n input data is \( m=\frac{n!}{2!\times \left(n-2\right)!} \), where (!) represents the factorial operator. Thus, it can be provided m predicted nodes-based RSM that this dataset is used to calibrate SVR in third layer by the following relation:

$$ \hat{E}=b+\sum \limits_{i=1}^N{w}_iK\left(\varphi, {\varphi}_i\right) $$
(13)

where b is bias and K(φ, φi) represents the Kernel function which transferred the data-based predicted RSM from the real-space into feature-space. Commonly, the Gaussian Kernel function as well as the original SVR with parameter of σ = 2 while the other SVR in this hybrid modeling approach are given as ε = 0.1 and C = 300. The structure of the SVR_RSM is presented in Fig. 5. As seen from Fig. 5, the SVR_RSM was structured using two nonlinear models of RSM combined by SVR which is calibrated using input data handing by RSM predictions. The SVR predictions depended on the input dataset which are provided using RSM. Consequently, the accuracy prediction can be affected on agreement and accuracy of SVR. The cross-linear correlation of input variables is considered in first calibrating process using RSM, while the nonlinear correlation between the input data and the observed pan evaporation are given using SVR in the second calibrating process by using m predicted nodes of RSM database. This calibrating model-based two phases may improve the prediction of monthly pan evaporation based on the following steps:

  1. Step 1:

    Input dataset as train and test data points

  2. Step 2:

    Separate different two data sets form training database

  3. Step 3:

    Calibrate the nodes of first stage using the RSM with 2-set original input variables which are given from step 2

  4. Step 4:

    Give the input database from data provided by RSM in step 3 for SVR as the input database in the training phase

  5. Step 5:

    Set the parameters of SVR

  6. Step 6:

    Calibrate SVR using the parameters in step 5 and input data in step 4

  7. Step 7:

    Predict the data using hybrid intelligent model by using steps 3–6 for test data point

Fig. 5
figure 5

Schematic view of SVR_RSM model

The SVR_RSM model using two calibrating processes may provide highly correlated between the observed and predicted pan evaporations. Consequently, it can provide the accurate predictions and is defined as a robust modeling approach.

Comparative statistics

Four models such as MLPNN, RSM, SVR, and SVR_RSM are used for nonlinear modeling of pan evaporation that their predictions were compared using five comparative statistics which are used to compare the accuracy and the agreement between the predicated and observed datasets. The statistics are root mean square error (RMSE), mean absolute error (MAE), modified index of agreement (d), coefficient of correlation (R), and modified Nash and Sutcliffe efficiency (NSE) with below relations (Keshtegar et al. 2019a, b):

$$ R=\left[\frac{\frac{1}{\mathrm{N}}\sum \left({E}_i-{E}_m\right)\left({\mathrm{P}}_i-{\mathrm{P}}_m\right)}{\sqrt{\frac{1}{\mathrm{N}}\sum \limits_{i=1}^n{\left({E}_i-{E}_m\right)}^2}\sqrt{\frac{1}{\mathrm{N}}\sum \limits_{i=1}^n{\left({\mathrm{P}}_i-{\mathrm{P}}_m\right)}^2}}\right] $$
(14)
$$ d=1-\frac{\sum \limits_{i=1}^N\left({E}_i-{P}_i\right)}{\sum \limits_{i=1}^N\left(\left|{\mathrm{O}}_i-{E}_m\right|+\left|{\mathrm{P}}_i-{E}_m\right|\right)} $$
(15)
$$ \mathrm{NSE}=1\hbox{-} \frac{\sum \limits_{i=1}^N\left[{E}_i-{P}_i\right]}{\sum \limits_{i=1}^N\left[{E}_i-{E}_m\right]} $$
(16)
$$ \mathrm{RMSE}=\sqrt{\frac{1}{\mathrm{N}}\sum \limits_{i=1}^{\mathrm{N}}{\left({E}_i-{\mathrm{P}}_i\right)}^2} $$
(17)
$$ \mathrm{MAE}=\frac{1}{N}\sum \limits_{i=1}^N\left|{E}_i-{\mathrm{P}}_i\right| $$
(18)

Where N is number of observed datasets Ei and Pi are observed and predicted pan evaporation for ith data point, respectively. Oi and Pi are the mean of observed and calculated data. The nonlinear model is better than the other modeling techniques when its comparative statistics are computed for MAE, RMSE values tended to zero and other agreement indexes as R, d, and NSE tended to 1.

Results and discussion

In this section, the MLPNN, SVR, RSM, and SVR_RSM algorithms were validated on the six dataset described in the previous section. The correspondence between the measured and model estimates of Epan are reported in Tables 4, 5, 6, 7, 8, and 9, in terms of RMSE, MAE, R, NSE, and d, during the training and validations phases. Hereafter, we focused our discussion on the results obtained during the validation phase. At Ain Dalia station (Table 4), the estimation of monthly Epan was robust, having RMSE ranging from 1.523 to 2.218 mm, MAE ranging from 1.094 to 1.651 mm, and correlation coefficient (R) ranged from 0.774 to 0.891, based on the six MLPNN models. Estimates based on NSE and d indexes were almost equally accurate results with NSE ranged from 0.508 to 0.768, and d ranged from 0.780 to 0.925. Table 4 shows that high variability of models performances has been observed between the first five MLPNN models (MLPNN1 to MLPNN5) and the sixth model (MLPNN6). These differences in accuracy of models could possibly be due to the exclusion of the relative humidity (H %) from the input variables of the MLPNN6 model. Among the six MLPNN models (Table 4), the MLPNN1 has the highest accuracy with lower RMSE and MAE, and high NSE and d values. The MLPNN1 has R value of 0.889, NSE of 0.768, and d of 0.922, when evaluated using the validation data. Predictor variables selected by this model included the four original climatic variables (Tmax, Tmin, H%, U2) coupled with the periodicity (the month number: α). Finally, the R, NSE, and d values were further improved to 0.891, 0.757, and 0.919 when H% was used (MLPNN5) instead of U2 (MLPNN6), and model errors as measured using RMSE and MAE were reduced by 29.75% and 32.89%, respectively. Table 4 indicates better error statistics in terms of MAE (1.455 mm) and RMSE (1.846 mm), and higher R (0.813), NSE (0.659), and d (0.890) values, using the RSM6 model against (R = 0.774), (NSE = 0.508), and (d = 0.780) obtained suing the MLPNN6 model. In addition, it is clear from Table 4 that using fewer input variables, the RSM model was able to provide high accuracy compared to the MLPNN model. Results clearly indicated that the MAE and RMSE of the RSM4 was decreased from 1.574 to 1.514 (2.6%) and from 1.153 to 1.123 (3.8%), respectively, compared to the MLPNN4. Overall, when comparing the RSM and the MLPNN models, it is clear that there was a strong relationship between the number of input variables and the accuracy of the models. The relationship varied, however, significantly for the MLPNN model, which means that RSM is more suitable for building robust models using only fewer inputs. The RSM6 with only Tmin, U2, and α as input variables improved the R, NSE, and d of the MLPNN6 by 3.9%, 11%, and 15.1%, respectively, and decreasing the values of the RMSE and MAE by 11.87% and 16.77%, respectively. According to Table 4, high accuracy was obtained using SVR models compared to the MLPNN and RSM models, for all the six input combination. The Epan showed good correlation between measured and calculated values using the SVR models. This class of models was characterized by R, NSE, and d values varying from 0.859 to 0.899, 0.734 to 0.804, and from 0.917 to 0.939, higher than the values obtained using the MLPNN and the RSM models. SVR1 exhibited a decrease in RMSE and MAE value by 8.78% and 0% compared to the MLPNN1, and by 9.67% and 1.46% compared the RSM1, respectively. The good accuracy of the SVR models is obvious, especially when comparing the models having a fewer input variables. SVR6 decreased the RMSE and the MAE of the MLPNN6 by 26.42% and 26.95%, respectively, and by 11.59% and 16.70% compared to the RSM6, respectively. The proposed hybrid method of SVR_RSM produces high nonlinear mapping of Epan which are substantially higher than the three other models, as stated in Table 4. This can be nearly always advantageous, especially in the situation where the input variables were selected smaller. As can be seen, using only the Tmax, H%, and the periodicity (α), an R, NSE, and d values of 0.903, 0.805, and 0.923 were achieved, which were not exhibited by any of the other models. An RMSE of 1.395 mm was achieved using the SVR_RSM5 model less than all the values provided by the MLPNN5, RSM5, and SVR5 models. Similarly, the hybrid SVR_RSM1 model generated with all input variables has produced an R of 0.920, NSE of 0.839, and d of 0.951 (Table 4) which are higher than the values provided by the MLPNN1, RSM1, and SVR1, models. Both SVR_RSM4 and SVR_RSM5 yielded similar accuracy in terms of all the five statistical indexes, and slightly higher than the SVR_RSM3. Performance characteristics also differ between SVR_RSM1 and SVR_RSM2, with and without periodicity (α). Generally, the SVR_RSM2 without periodicity (α) yielded higher RMSE (1.374 mm) compared to (1.270 mm) achieved using the SVR_RSM1. Of the six input combination, the SVR_RSM6 had the largest RMSE value (1.616 mm). Moreover, the SVR_RSM6 generally yields the highest MAE (1.190 mm) and the lowest R, NSE, and d values.

Table 4 Performances of different models in modeling monthly Epan at Ain Dalia Station
Table 5 Performances of different models in modeling monthly Epan at Beni Haroun Station
Table 6 Performances of different models in modeling monthly Epan at Bouhamden Station
Table 7 Performances of different models in modeling monthly Epan at Chaffia Station
Table 8 Performances of different models in modeling monthly Epan at El Agram Station
Table 9 Performances of different models in modeling monthly Epan at Zit Emba Station

Results at Beni Haroun station are reported in Table 5. The statistics indexes given in Table 5 show how well Epan can be estimated from the climatic variables using the proposed models. The results obtained using the four models showed generally strong relationships between measured and calculated Epan values (Table 5). The low RMSE and MAE values indicate small variations between measured and estimated Epan for the training and validation data set. Using all the four climatic variables in addition to the periodicity, the hybrid model SVR_RSM1 provided slightly better prediction results (R = 0.954 and NSE = 0.907) and the lowest errors indexes (RMSE = 1.095 mm and MAE = 0.745 mm). Overall accuracy results were high for all the proposed models and the SVR_RSM1 produced the highest overall accuracy, followed by SVR1, and the RSM1 performed best compared to the MLPNN1. In summary, the statistics performance calculated between measured and predicted Epan values showed that both SVR1 and SVR_RSM1 methods similarly perform with accuracy slightly higher than the RSM1 model, and largely higher than the MLPNN1 model. The performance differences between SVR1 and SVR_RSM1 methods, however, are not large and therefore, Epan based on climatic variables can be predicted very well by the models. Using only fewer input variables, the SVR_RSM6 showed the best performances among the other models, with overall R and NSE of 0.944 and 0.882, respectively, RMSE of 1.232 mm, and MAE of 0.938 mm. For comparison, the RSM6 and SVR6 performed with equal accuracy, while MLPNN6 modeled Epan reasonably and slightly less than the SVR_RSM6 (Table 5). When the periodicity (α) was excluded from the input variables, the RMSE and MAE of the SVR_RSM2 decreased by 13.11% and 25.97% compared to the SVR2, by 13.56% and 17.22% compared to the RSM2, and by 13.11% and 22.337% compared to the MLPNN2, respectively. In summary, MLPNN2 and SVR2 performed similarly in comparison with RSM2 and SVR_RSM2. RSM2 produces relatively large RMSE error (1.364 mm) slightly higher than the values calculated using MLPNN2 and SVR2 (RMSE = 1.357 mm). SVR_RSM2 shows improved performance estimates compared to RSM2, showing the best performance among the three models, indicated by the model performance statistics in Table 5. Nonetheless, the use of the data-driven models without hybridization did not result in substantial differences with respect to the statistical indexes, and only the hybrid model was characterized by the strongest accuracy.

Results at Bouhamden Station are reported in Table6. According to Table 6, the models performance varied significantly with R values ranging from 0.927 to 0.943, NSE values ranging from 0.851 to 0.882, and the d values ranging from 0.958 to 0.970, for the MLPNN models. The SVR_RSM1 model was able to predict Epan much better than the SVR1 (R = 0.961 versus R2 = 0.936). The RSM1 model performs better than the MLPNN1 (R = 0.941 versus R = 0.936) and better than SVR1 (R = 0.941 versus R = 0.936). Overall, except the high performances exhibited using the hybrid SVR_RSM1 model (R =0.941, NSE = 0.915, and d = 0.976), the difference in R among the various approaches was not large and did not considerably affect the correlation between measured and estimated monthly evaporation (Epan) and no greatest variability was observed between the models performances. Although slightly more accurate than the SVR_RSM1, estimated statistical indexes values using the SVR_RSM5 (R = 0.963, NSE = 0.926, and d = 0.980) were only weakly higher than those provided by the SVR_RSM1(R = 0.961, NSE = 0.915, and d = 0.976). Consequently, the hybrid SVR_RSM models are usually characterized by fairly high capabilities for providing high models accuracy using only fewer input variables and are more highly suitable for modeling nonlinear process such as pan evaporation. The different input combinations had a greater impact on the statistical errors index (RMSE and MAE). Errors were consistently lower when fewer variables were used. Excluding the U2 from the input variables of the MLPNN2 model resulted in smaller gains in accuracies compared to the MLPNN3: (RMSE = 0.576 mm versus R = 0.653 mm) and (MAE = 0.576 mm versus MAE = 0.653 mm). Similarly, using only Tmax and Tmin as input of the RSM4 instead of the all four climatic variables (RSM2) slightly improves its performances, despite the smaller gains in accuracies: (RMSE = 0.746 mm versus R = 0.798 mm) and (MAE = 0.549 mm versus MAE = 0.574 mm). Using only the Tmin and U2% coupled with the periodicity (α), the difference in RMSE and MAE terms is maximized between RSM6 and RSM2, permitting an improved estimation of Epan: the RMSE and MAE were decreased by 19.98% and 23.24%, respectively. Finally, when comparing the performances of the two hybrid models: SVR_RSM5 and SVR_RSM2 approaches, results were not significantly different. RMSE and MAE do not change more than 4.16% and 1.55% between these two approaches.

The comparisons between calculated Epan using the proposed models and measured values at Chaffia station are presented in Table 7. For this station, it is clear that all the models performed very well, and the statistical indexes indicate an important finding: inclusion of the periodicity coupled with the four climatic variables do not help to improving the performances of the models; on the contrary, the accuracy of the models was decreased. From Table 7, when the periodicity (α) is excluded from the inputs variable, we obtained the best performances among the compared four models. Globally, exclusion of the α improves the performance of the MLPNN2 model by slightly increasing the values of the R, NSE, and d by 0.5%, 0.7%, and 1.7%, and decreasing the values of the RMSE and MAE by 8.44% and 1.9%, respectively, (Table 7, models MLPNN1 and MLPNN2). In addition, the RSM2 model (without α) improves the accuracy of the RSM1 model (with α) by increasing the R and d values by 0.3%, and decreasing the RMSE and MAE values by 1.51% and 1.82%, respectively. Nevertheless, by comparing the performances of the SVR2 model with the performances of the SVR1 model, we concluded that the periodicity (α) had only a marginal effect and the two models have similar overall performances; SVR2 model, improves the accuracy of the SVR1 model by increasing the R, NSE, and d values by 0.3%, 0.1%, and 0.1%, respectively, and decreasing the RMSE and MAE values by 2.39% and 5.33%, respectively. Finally, the hybrid SVR_RSM2 model is a particular case in which the obtained results reveal significant improvement of the models performances. While the R, NSE, and d statistical indexes appear to be relatively equal, the SVR_RSM2 model outperforms the SVR_RSM1 when looking to the MAE value: the hybrid SVR_RSM2 decreased the MAE of the SVR_RSM1 by 12.15%. Overall, among all the proposed models, SVR_RSM5 was found to produce the highest correlation between calculated and measured Epan (R = 0.968, NSE = 0.931, d = 0.981).

Results reported in Table 8 deals with the accuracy of the proposed models at El Agram station, and it is clear that the four proposed models were found to have a high performance in estimating Epan. By comparison, the hybrid SVR_RSM model has better performances, both during training or validation phase. Also, it is found that models with Tmax, Tmin, and the periodicity (combination 4, Table 8) were more powerful than the others in terms of prediction accuracy. Results of the accuracy assessment for the MLPNN models showed that MLPNN4 produced the best accurate estimation of Epan as shown by all statistical indexes. For comparison, MLPNN4 decreased the RMSE and MAE of the MLPNN1 by 10.03% and 3.53%, respectively. Also, it is clear from Table 8 that when Tmin was used as input variable for the MLPNN4 instead of U2 for model MLPNN6, the performances was increased, by increasing the R, NSE, and d values, and decreasing the RMSE and MAE values. MLPNN4 improves the accuracy of the MLPNN6 model by increasing the R, NSE, and d values by 3.6%, 2.9%, and 8.7%, respectively, and significantly decreasing the RMSE and MAE by 42.28% and 42.97%, respectively. Regarding the RSM models, RSM4 produced the best accurate estimation of Epan and contributed to the overall decrease in RMSE and MAE of the RSM1 by 18.76% and 19.31%, respectively. In addition, it is clear from Table 8 that RSM4 is less affected by removing H% (RSM5) than U2 (RSM6): the RMSE and MAE of the RSM4 was decreased by 19.90% and 16.81% compared to the RSM5, and by 33.63% and 32.13% compared to RSM6. A comparison of results obtained using the SVR models shows that Epan estimates based on the Tmax, Tmin, and α variable (SVR4, Table 8) are better than those from the other models. The performances of the SVR4 model are improved by an increase of R from 0.944 to 0.983, NSE from 0.880 to 0.958, and d from 0.965 to 0.988 respectively, compared to the SVR1 with all the four climatic variables. The SVR1 method shows slightly less error than the SVR2 (without α) in terms of RMSE (10.81%) and MAE (12.23) statistics, while it performs worse than the SVR3: the SVR3 decreased the RMSE and MAE of the SVR1 by 21.24% and 22.98%, respectively. Finally, it can be seen that the hybrid SVR_RSM4 models shows the best performance for the all six input combinations, owing to its smallest RMSE and MAE values and largest values of R, NSE, and d. The SVR_RSM4 model reduce the RMSE and MAE to nearly 0.436 mm and 0.359 mm, less than any values calculated using the other models, and the peak values of the R (0.989), NSE (0.970), and d (0.992) reached during the validation phase clearly underlines the superiority and the robustness of the hybrid approach. The SVR_RSM1 model had a comparable performance with SVR_RSM3, with slight superiority in favor to the SVR_RSM1. While results obtained from the SVR_RSM1 show an increasing performances compared to the SVR_RSM2, for which the periodicity is removed with an increasing in the R, NSE, and d by 3.5%, 2.5%, and 7.5%, respectively. Similarly, SVR_RSM5 performs well compared to the SVR_RSM2, but relatively poor compared to the SVR_RSM1 and SVR_RSM4.

Table 9 summarizes the results obtained using the six different input combinations for the four different models. In all input combination without exception, all the statistical indexes values show that the hybrid SVR_RSM is the best model. The comparison between measured Epan and the calculated values are reported as well. During the validation phase, the statistical indexes for different models shows that the RMSE and MAE are lowest for the SVR_RSM model (Table 9), and the best accuracy with lowest values was obtained using SVR_RSM1 (RMSE = 0.991 mm and MAE = 0.781 mm). In general, the agreement between measured and calculated values of Epan is good: the R, NSE, and d values are generally larger than 0.90, 0.88, and 0.94, respectively for most of the models, and the RMSE are reasonably low. When using only the Tmax and Tmin as the unique input variables coupled with the periodicity (SVR_RSM4), the R values is often close to 0.947 and the RMSE is very low (1.007 mm). Compared to the other models, SVR_RSM1 is more accurate, followed by SVR_RSM4 (RMSE = 1.007 mm and MAE = 0.832 mm) and SVR_RSM6 (RMSE = 1.047 mm and MAE = 0.814 mm) with relatively similar accuracy and a favor to the SVR_RSM4 when looking to the NSE value, and finally the SVR_RSM2 is ranked in the last place with the highest RMSE (1.371 mm) and MAE (1.161 mm) values, respectively. From Table 9, it is clear that using all the four climatic variables without periodicity reveal important finding: MLPNN2, RSM2, SVR2, and SVR_RSM2 were the worst models, for which the R, NSE, and d values have been decreased consistently with an increasing in the RMSE and MAE values. Removing the periodicity increased the MAE and RMSE of the MLPNN1, RSM1, SVR1, and the hybrid SVR_RSM1 model by (19.77% and 17.39%), (31.6% and 24.69%), (23.63% and 20.18%), and (32.73% and 27.72%), respectively; consequently, an important point should be distinguished: the periodicity is more important for the hybrid model compared to the other.

Finally, Scatterplot comparison between measured and calculated values of the Epan was plotted in Fig. 6. Figures 7 and 8 show the boxplots and the violin plots for measured and calculated Epan (mm) values using the SVR, RSM, MLPNN, and SVR_RSM models for all stations. For the violin plot, the two lines with black and red color display the mean and the median values of Epan. Figure 9 presents a Taylor diagram plot calculated against the measured data at the six climatic stations. In the diagram, the accuracy of the four best models was compared during the validation phase using the correlation coefficient and the standard deviation. In the diagram, the measured values are plotted using a red circle along the X-axis. At all stations, SVR_RSM performs better than the three other models, clear evidence that proposed hybrid model improves the accuracy of the standard SVR model. The statistics for SVR, MLPNN, and RSM models varied generally from one climatic station to another, though one of the models has a lower standard deviation or has a higher coefficient of correlation. In each station, the differences between the three models (RSM, SVR, and MLPNN) are much smaller than the differences between the three and the SVR_RSM. Consequently, the improvement in SVR_RSM model throughout the use of the hybrid approach is further demonstrated in Fig. 9.

Fig. 6
figure 6

Scatterplots showing the relation between the measured and calculated values of monthly evaporation (Epan) in the validation phase for all stations

Fig. 7.
figure 7

Boxplots of measured and calculated values of monthly pan evaporation (Epan) in the validation phase of all stations. The central mark is the median, the edges of the box are the 25th and 75th percentiles, and the whiskers correspond to the most extreme data points

Fig. 8
figure 8

Violin plots of measured and calculated monthly pan evaporation (Epan) in the validation phase of all stations. The two lines with black and red color display the mean and the median values of Epan

Fig. 9
figure 9

Taylor diagram displaying a statistical comparison of the four developed models with measured values of monthly pan evaporation Epan (mm). The green circles correspond to circumferences of equal centered normalized root-mean-square (NRMS) difference between measured and calculated Epan, the blue lines correspond to lines of equal correlation coefficients, and doted red circles correspond to circumferences of equal standard deviations

Conclusion

This study investigated the utility of using the response surface model (RSM) for calibrating the original support vector regression (SVR) in order to obtain a new hybrid model (SVR_RSM), applied for modeling monthly pan evaporation. Different scenarios based on several input combinations have demonstrated that the hybrid SVR_RSM model performs overwhelmingly better than the RSM, SVR, and the MLPNN models. One of the most important conclusions of the present study is that, the proposed hybrid model provides an effective and practical approach to estimate Epan using only fewer input variables. For example, the SVR_RSM model with only the Tmax and Tmin, coupled with the periodicity exhibited high accuracy with R, NSE, and d values of 0.989, 0.970, and 0.992 during the validation phase. Superiority of the SVR_RSM for predicting Epan was evident at the all six climatic stations. In addition, the MLPNN, RSM, and SVR models provided relatively different results, and the accuracy varied from one station to another depending on the input variables. Furthermore, inclusions of the periodicity to the input variables contribute to an improvement of the models performances, except at Chaffia station for which the performances was decreased. The results of the study were encouraging, but need to be extended to other stations.