1 Introduction

One of the most important components of the water resources is a lake, which is highly vulnerable to external influences, and basins surrounding it. Lake water levels are affected by climate change and human activity. Water used for precipitation, flow, evaporation, supply of drinking water, irrigation and energy leads to an increase or decrease in lake water level. Increased or decreased lake water level causes huge economic losses in the lake environments and even sometimes irreversible changes in nature (Cengiz and Kahya 2006).

Recognition of changes in lake level is of vital importance for planning and construction works, design works in lakes and the areas surrounding them, ensuring the management and productivity of agricultural lands, sustaining existing or acquired aquatic life, and the operation of wetlands which are of special significance and protected by various international treaties and drinking water facilities.

Lake level changes can be observed regularly by monitoring stations to be built at certain places or estimated using mathematical methods, such as a water balance model. Nevertheless, a variety of artificial intelligence methods, including artificial neural networks (ANN), support vector machines (SVM), adaptive network-based fuzzy inference system (ANFIS), fuzzy logic (FL), and wavelet transform (WT), which have been recently employed to estimate many hydrological parameters, such as precipitation (Ramana et al. 2013), temperature (Chithra et al. 2014), sediment (Goyal 2014), flow (Kisi et al. 2012a), are also commonly used to estimate lake water level (Altunkaynak 2007, 2014; Ondimu and Murase 2007; Chang et al. 2014; Karimi et al. 2013; Seo et al. 2015).

De Domenico et al. (2013) compared the success of chaos theory with that of the auto-regressive integrated moving average (ARIMA) model in estimating sea water level for daily, weekly, 10-day, and monthly time scales at the Cocos (Keeling) Islands. In conclusion, an acceptable performance was achieved in both models; however, the chaos theory model was more successful than the ARIMA model in forecasting. Altunkaynak (2007) used ANN and auto-regressive moving average (ARMA) models to estimate water level and determined that ANN models performed better than ARMA models. Khan and Coulibaly (2006) investigated the ability of SVM method in estimating the water level of Lake Erie and compared the results with those of multilayer perceptron (MLP) and seasonal auto-regressive (SAR) models. Kavehkar et al. (2011) used genetic programming (GP) model to estimate the change in the water level of Lake Urmia in Iran and compared with those of the ANN. Cimen and Kisi (2009) studied the potential of SVM and ANN techniques in modeling lake level fluctuations and found that the SVM model performed better than the ANN. Wei (2012) investigated the performance of wavelet SVMs and classical Gaussian SVMs for forecasting the hourly water levels. He found that wavelet SVMs outperform the Gaussian SVMs for water level predictions. Khatibi et al. (2014) investigated the performance of the models, including seasonal auto-regressive integrated moving average (SARIMA), ANN, gene expression programming (GEP), multiple linear regression (MLR), and nonlinear local prediction (NLP), to estimate water levels of six lakes with different physical characteristics. Kisi et al. (2012b) studied the usability of GEP, ANN, ARMA, and ANFIS techniques to estimate the daily water level of Lake Iznik, located in west of Turkey.

Other than the methods such as ANN, ANFIS, SVM, wavelet, fuzzy logic, scientists have recently developed the evolutionary optimization methods inspired by nature which include genetic algorithm (GA), particle warm optimization (PSO), ant colony optimization (ACO), artificial bee colony (ABC), memetic algorithm (MA), differential evolution (DE), artificial immune systems (AIS). These optimization algorithms developed on the basis of events in nature are called heuristic methods (Karaboga 2004). Disadvantages of conventional techniques such as early convergence, local minima, and high computational complexity have led to the use of heuristic methods in training artificial neural networks (Kennedy and Eberhart 1995). Hybrid models have been developed by using heuristic methods also in the determination of the parameters of methods such as SVM, ANFIS, etc. PSO, one of these heuristic methods, is widely used to estimate hydrological parameters, particularly to train ANN; however, the number of studies on determination of SVM parameters is more limited. In the literature, there are a few examples of the use of PSO in determining ANFIS parameters for the prediction of hydrological data (Pousinho et al. 2011).

Chau (2006) developed PSO-based artificial neural network (PSO-ANN) approach to estimate the water level of Shing Mun River in Hong Kong using PSO to train multilayer perceptrons. Mohandes (2012) employed PSO-ANN model to predict global solar radiation using data from 41 stations in Saudi Arabia and compared with those of the BP algorithm. Sedki and Ouazar (2010) used PSO-ANN algorithm to estimate daily flows of a basin in a semiarid region of Morocco, while Tapoglou et al. (2014) used it to simulate the hydraulic head changes in an observation well in the region of Agia, Chania, Greece.

García Nieto et al. (2014) developed hybrid models using PSO and SVM methods for long-term estimation of turbidity in the Nalon river basin in the north of Spain. Sudheer et al. (2014) studied estimation performance of PSO-SVM model using monthly streamflow data from St. Regis River in the nearby Clark Fork and Swan River in the nearby Bigfork and compared with those of the ANN and ARMA models. Zhao and Wang (2010) used the PSO algorithm to determine the parameters of the SVR model to rainfall forecasting model using monthly rainfall data in Guangxi, China, during 1954–2008. They compared the results of PSO-SVR model with conventional SVR method.

Pousinho et al. (2011) employed the hybrid PSO-ANFIS approach to predict short-term wind power in Portugal and compared with those of the ARIMA, neural network (NN), NN combination with wavelet transform (NNWT), and wavelet neuro fuzzy (WNF).

Considering that lake water is usually fed by precipitation and streamflow and water loss is dependent on the amount of water drawn from lakes and evaporation, it would be an important step to establish the correlation between the change in lake water level and such hydrometeorological variables (Yurtcu 2006).

The aim of this study was to determine the usefulness of fast convergent PSO in the solution of nonlinear problems in prediction of optimal values of the parameters of SVM and ANFIS methods. The change in water level of Lake Beysehir, the largest freshwater lake in Turkey, was estimated using hybrid PSO models. The performance of the hybrid PSO models was compared with the GRNN. All models with various input structures were constructed, and the best model was determined using different performance criteria.

2 Study area and data.

Lake Beysehir (37° 40′ 54″ N, 31° 43′ 22″ E) is the largest freshwater lake and the third largest lake by surface area in Turkey. It is situated on the west end of Konya-Çumra closed sub-basin, which comprises the southwest of Konya closed basin, the largest closed basin in Anatolia, and it is 90 km away from the province of Konya. Lake Beysehir is within the borders of Konya and Isparta provinces. It is currently 1121.93 m above sea level; its surface area is approximately 650 km2, and rainfall area is 4086.4 km2. Although a variety of numbers was reported, the lake is approximately 8.5 m deep, 45 km long, and 10–25 km wide and has a circumference of 120 km. Lake Beysehir was opened for operation in 1914, and it is currently utilized for the purposes of supplying drinking water, irrigation, fishing, various commercial business, and tourism. Figure 1 shows Lake Beysehir and its location in Turkey.

Fig. 1
figure 1

Lake Beysehir and its location in Turkey

In this study, monthly inflow-lost flow (I), precipitation (P), evaporation (E), and outflow (O) values determined by the 4th Regional Directorate of State Hydraulic Works (DSI) were used as inputs in order to estimate the monthly change in Lake Beysehir water level (L). Three hundred forty-eight-month data in total covering the period from 1962 to 1990 were used, and the first 276-month data (1962–1984) were used in training process, and the remaining 72-month data (1985–1990), in testing process. Here, level values were given as elevation so the values of change in level per month were calculated by taking the differences in elevation compared to the previous month.

3 Methodology.

3.1 Generalized regression neural network

GRNN, developed by Specht (1991) as an alternative to feed forward backpropagation algorithms, does not require an iterative training procedure.

GRNN consists of four layers, including the input layer, the hidden layer, the summation layer, and the output layer. If f(x,y), the common probability intensity function, is known, regression of dependent y variable with respect to independent x variable is expressed by Eq. (1).

$$ E\left[\left.y\right|X\right]=\frac{{\displaystyle \underset{-\infty }{\overset{+\infty }{\int }}yf\left(X,y\right) dy}}{{\displaystyle \underset{-\infty }{\overset{+\infty }{\int }}f\left(X,y\right) dy}} $$
(1)

Otherwise, f(x,y) function is estimated using Eq. (2) by utilizing observed X i and Y i values.

$$ \overset{\frown }{f}\left(X,Y\right)=\frac{1}{{\left(2\pi \right)}^{\left(p+1\right)/2}{s}^{\left(p+1\right)}}\times \frac{1}{n}\sum_{i=1}^n \exp \left[-\frac{{\left(X-{X}^i\right)}^T\left(X-{X}^i\right)}{2{s}^2}\right]\times \exp \left[-\frac{{\left(Y-{Y}^i\right)}^2}{2{s}^2}\right] $$
(2)

where n indicates the number of observed data; s, the spread value; and p, the dimension of vector x. \( {D}_i^2 \) is a scalar function and defined as

$$ {D}_i^2={\left(X-{X}^i\right)}^T\left(X-{X}^i\right) $$
(3)

Equation (4) is obtained by solving the integrals in Eq. 1 (Alp and Cigizoglu 2004).

$$ \overset{\frown }{Y}(X)=\frac{{\displaystyle \sum_{i=1}^n{Y}^i \exp \left(-\frac{D_i^2}{2{s}^2}\right)}}{{\displaystyle \sum_{i=1}^n \exp \left(-\frac{D_i^2}{2{s}^2}\right)}} $$
(4)

3.2 Support vector regression

SVM was proposed by Vapnik for the first time to solve classification and regression type problems (Vapnik 1995). When SVM was used for regression, it is called as support vector regression (SVR). Traditional machine learning methods involve several problems, including the demand for numerous training data, low convergence rate, local minima, and overfitting/underfitting (Lu et al. 2002). SVM overcomes these problems by operating on the basis of structural risk minimization (Shen et al. 2004). SVM’s output function is obtained as Salat and Osowski (2004).

$$ f(x)={\displaystyle \sum_{i=1}^N\left({\alpha}_i-{\alpha}_i^{*}\right)K\left(x,z\right)+{b}_i} $$
(5)

where \( \left({\alpha}^i-{\alpha}_i^{*}\right) \) indicates Lagrange factors, K(x,z), the inner product core function defined in accordance with the Mercer theorem and b i , the bias value. A detailed description of SVR is available in Buyukyildiz et al. (2013).

In this study, ε-SVR model was used as SVR model. Vapnik proposed ε-SVR by introducing an alternative ε-insensitive loss function. This loss function allows the concept of margin to be used for regression problems. The purpose of SVR is to find a function having maximum ε deviation from actual target vectors for all training data given, which should be at the same level as much as possible. In other words, error on any training data should be smaller than ε (Pal and Arun 2006).

The most important thing to consider in SVR is the selection of ε, C, and kernel function parameters. ε-Insensitive error term constant indicates the range in which error is neglected. As this constant gets smaller, the number of support vectors to be found by SVM increases. C—the regularization constant maintains order between system complexity characterized by weight vector and prediction errors measured by insensitive variables. Using these variables in the most appropriate values is highly effective in the success of SVM (Ekici 2007). However, there is not any decisive criterion for selecting both kernel function and model parameters (Lin 2006). In this study, radial basis kernel function, one of kernel functions widely used in SVR applications, was used and it is defined as follows.

$$ K\left({x}_i,x\right)= \exp \left(-\gamma {\left\Vert {x}_i-{x}_j\right\Vert}^2\right) $$
(6)

γ is the kernel parameter that ensures radius control. SVM architecture is shown in Fig. 2.

Fig. 2
figure 2

General structure of support vector machines

3.3 Adaptive network-based fuzzy inference system

ANFIS is a hybrid intelligence method that utilizes parallel calculation and learning ability of artificial neural networks and inference feature of fuzzy logic. ANFIS model, developed by Jang (1993), uses Sugeno-type fuzzy inference system and hybrid learning algorithm. Learning algorithm of ANFIS is a hybrid learning algorithm consisting of combined use of the least squares method and backpropagation learning algorithm. Network-based inference systems have two important adjustments: structural adjustment and variable adjustment. Structural adjustment involves the number of variables to be calculated, the number of rules, expression of definition spaces of each input-output variable using fuzzy sets, and creation of the structure of rules, whereas variable adjustment, the centers of membership functions, gradients, widths, and calculation of weights of fuzzy logic rules. It is more advantageous than other systems due to the simplicity of optimization of its parameters (Jang 1993). A general ANFIS architecture consisting of two inputs such as x 1 and x 2, a single output, y and four rules is as that shown in Fig. 3.

Fig. 3
figure 3

Adaptive network-based fuzzy inference system (Jang 1993)

The layers in Fig. 3 comprise the input, the fuzzification, the rule, the normalization, the defuzzification, and overall layers. Detailed information on the functioning of these layers is available in Jang (1993), Jang et al. (1997).

3.4 Particle swarm optimization

PSO is a population-based stochastic search algorithm developed by Kennedy and Eberhart (1995), inspired by the behavior of bird flocks. It was designed to solve nonlinear problems. It is used to provide a solution to multi-parameter and multi-variable optimization problems. The system is initiated by a population containing random solutions and searches for the most optimum solution by updating generations. Potential possible solutions, called a particle in PSO, follow the optimum particle on that particular moment and travel around the problem space. The most important difference of PSO from conventional optimization techniques is that it does not require derivative knowledge. This property takes away the burden of complex operations required for solution of many problems. The algorithm of PSO has a small number of parameters, making it significantly easy to apply PSO. PSO can be successfully applied in many areas, including function optimization, fuzzy system control, and artificial neural network training (Zhao et al. 2005).

3.4.1 PSO-based SVR and ANFIS application

In this study, PSO was used to optimize the values of the parameters of SVR and ANFIS methods in PSO-SVR and PSO-ANFIS hybrid models. The algorithm of these hybrid models is as follows.

  1. (1)

    Selecting parameters for SVR/ANFIS

  1. a.

    SVR: type of Kernel function and the range of SVR parameters (C, ε, and γ) are set.

  2. b.

    ANFIS: the number and type of membership function for input are set.

  1. (2)

    PSO: the number of particles, acceleration constants c 1, c 2 of the particles, w (inertia weight) value, and the number of iterations are set.

  2. (3)

    Training

  1. a1.

    Training with SVR: The particles’ SVR parameters considered as position (C, epsilon, and gamma) are randomly assigned within the limits set for each particle. These parameters are used in SVR.

  2. a2.

    Training with ANFIS: The particles’ output membership function parameters considered as position are randomly assigned within the limits set for each particle. These parameters are used in the constants (A) of “constant” function specified as ANFIS output membership function

  3. b.

    SVR/ANFIS is trained for each particle using training data and the corresponding output values.

  1. (4)

    Calculation of fitness values using fitness function for each particle

  2. (5)

    The stop criterion is compared. Step 3–step 8 is repeated until stop criterion is met.

  3. (6)

    For all particles, the fitness value of each particle is compared with the best position of the particle (p best). If the current value is better than p best, p best value is replaced with the current value.

  4. (7)

    Best fitness value is compared with all previous best values (g best) of the population and if the current value is the best, array index of the particle is reset to the current value.

  5. (8)

    Speed and position values for each particle are updated using following equations.

$$ {v}_{ij}^t=w{v}_{ij}^{t-1}+{c}_1{r}_1\left(p{\mathrm{best}}_{ij}^{t-1}-{x}_{ij}^{t-1}\right)+{c}_2{r}_2\left(g{\mathrm{best}}_j^{t-1}-{x}_{ij}^{t-1}\right) $$
(7)
$$ {x}_{ij}^t={x}_{ij}^{t-1}+{v}_{ij}^t $$
(8)
  1. (9)

    Testing data is applied to the model developed according to SVR/ANFIS parameters by which the best fitness value has been obtained.

4 Application and results

In order to estimate the monthly change in water level of Lake Beysehir using PSO-εSVR, PSO-ANFIS, and GRNN methods; before applying the methods, the following equation was used to eliminate unit differences between parameters used and the data was made non-dimensional between 0 and 1.

$$ {X}_{\mathrm{norm}}=\frac{X_i-{X}_{\min }}{X_{\max }-{X}_{\min }} $$
(9)

where X norm, X i , X min, and X max represent normalized, observed, minimum, and maximum values for all parameters, respectively.

Models were constructed using five different input combinations of the variables I, P, E, and O, namely (i) I-P-E-O, (ii) I-P-E, (iii) P-E, (iv) P, and (v) E, and the effects of individual variables on water level change were investigated. Mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R 2) criteria, given in the following equations, were used to evaluate the success of the generated models.

$$ MAE=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left|Y{i}_{\mathrm{observed}}-Y{i}_{\mathrm{estimate}}\right|} $$
(10)
$$ \mathrm{RMSE}=\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^N{\left(Y{i}_{observed}-Y{i}_{\mathrm{estimate}}\right)}^2}} $$
(11)
$$ {R}^2=\frac{{\left[{\displaystyle \sum_{i=1}^N\left(Y{i}_{\mathrm{observed}}-{\overline{Y}}_{\mathrm{observed}}\right)\left(Y{i}_{\mathrm{estimate}}-{\overline{Y}}_{\mathrm{estimate}}\right)}\right]}^2}{{\displaystyle \sum_{i=1}^N{\left(Y{i}_{\mathrm{observed}}-{\overline{Y}}_{\mathrm{observed}}\right)}^2{\displaystyle \sum_{i=1}^N{\left(Y{i}_{\mathrm{estimate}}-{\overline{Y}}_{\mathrm{estimate}}\right)}^2}}} $$
(12)

For PSO-εSVR method, a software package known as LIBSVM Software (Chang and Lin 2001) was employed and the applications were performed on MATLAB 2010b.

While estimating monthly water level change, PSO-εSVR and PSO-ANFIS hybrid models were developed using PSO algorithm, a heuristic algorithm, for the determination of both εSVR and ANFIS-method parameters. While constructing both models, the following were selected: the number of particles in PSO = 30, initial inertia weight w max = 0.9, and perminate inertia weight w min = 0.4. The range of maximum speed values that a particle can have was [−1, 1], the number of iterations for each input combination was 10, and c 1 and c 2 learning factors were 2. c 1 and c 2 are constants representing acceleration terms, which draw each particle to its best position (p best) and to the best of the population (g best). c 1 helps the particle move according to its experiences, while c 2 helps it move according to the experiences of the other particles in the population.

4.1 PSO-εSVR application

For the estimation of monthly water level, error term (ε), regulatory factor (C) of εSVR method, and γ parameters of radial basis kernel function were determined using PSO algorithm. The number of runs while estimating εSVR parameters using PSO was 30. Search spaces for C, ε, γ parameters were [1 100], [0.01 0.5], [0.1 8], respectively. Box plots for training- and testing-MAE, RMSE, and R 2 values obtained for five different inputs at the end of 30 runs are shown in Fig. 4. According to the box plots in Fig. 4, the most successful PSO-εSVR model was the model comprised of I-P-E-O inputs, by which the lowest training- and testing-MAE and RMSE values and the highest training and testing R 2 values were obtained. The least successful model was the PSO-εSVR model with E input which gave the highest training MAE and RMSE values and the lowest training R 2 value and the model with P input for testing values. Testing data of 30 runs for I-P-E-O, which was the most successful input combination in estimating water level change according to box plots were as follows: minMAE = 0.0052 m, maxMAE = 0.04 m, and medianMAE = 0.0198 m; minRMSE = 0.0070 m, maxRMSE = 0.0518 m and medianRMSE = 0.0241 m; minR 2 = 0.9169, maxR 2 = 0.9995, medianR 2 = 0.9909. Standard deviation values for MAE, RMSE, and R 2 of 30 runs for I-P-E-O input were 0.0111, 0.0278, and 0.0171, respectively.

Fig. 4
figure 4

Box plots for training- and testing-MAE, RMSE, and R2 values obtained using PSO-εSVR method for five different inputs at the end of 30 runs

4.2 PSO-ANFIS application

In this study, Grid Partition (GP) technique was used to set the rules in the case of PSO-ANFIS models. In PSO-ANFIS-GP models, types of input and output membership functions and the numbers of membership functions were determined for each input combination. Trimf, gaussmf, trapmf, gauss2mf, and gbellmf functions were used for input membership functions. The numbers 2-3-4 were considered as the number of membership functions for each input combination. Constant function was used as the output membership function. Optimum values of a constant of output membership functions were determined using PSO. According to the type and number of different membership functions, each input combination was run 30 times, and the most successful models were identified by considering the run result with the highest R 2 value. Maximum testing-R 2 values obtained according to the number and type of membership functions of five different input combinations for the most successful models are given in Fig. 5.

Fig. 5
figure 5

Maximum testing-R 2 values obtained according to the number and type of membership functions of five different input combinations for the most successful models.

According to R 2 results in Fig. 5, the most successful PSO-ANFIS-GP models were obtained with trimf for I-P-E-O, I-P-E, P and E combinations and with gbellmf for P-E combination. The obtained number of membership functions for the most successful models was two in the cases of I-P-E-O, I-P-E, and E inputs and four in the cases of P-E and P inputs. Box plots for MAE, RMSE and R2 obtained using the results of 30 runs of these models are shown in Fig. 6.

Fig. 6
figure 6

Box plots for training- and testing-MAE, RMSE, and R 2 values obtained using PSO-ANFIS-GP method for five different inputs at the end of 30 runs

According to the box plots in Fig. 6, the most successful PSO-ANFIS-GP model was the model comprised of I-P-E-O input combination for which minimum training- and testing-MAE and RMSE values and maximum training- and testing-R2 values were obtained. The least successful model by R 2 value was the PSO-ANFIS model for which E input was used for training values and P input was used for testing values. Testing data of 30 runs for I-P-E-O, which was the most successful input combination in estimating water level change according to box plots were as follows: minMAE = 0.1755 m, maxMAE = 0.6333 m and medianMAE = 0.2998 m; minRMSE = 0.2062 m, maxRMSE = 0.7677 m and medianRMSE = 0.3507 m; minR2 = 0.9359, maxR2 = 0.9932, medianR2 = 0.9744. Standard deviation values for MAE, RMSE and R 2 of 30 runs for I-P-E-O input were 0.1436, 0.1747, and 0.0155, respectively.

4.3 GRNN application

Spread value (n) was tested with an increment of 0.01 in the range of [0.01 5] for 5 different input combinations in order to estimate water level change using GRNN technique. The results obtained for testing data at the end of the analysis are given in Fig. 7. According to these results, I-P-E-O input combination was the most successful GRNN model with a spread value of n = 0.12 and R 2 = 0.9527, whereas the least successful model was the GRNN model obtained in the case of P input combination with a spread value of n = 0.04 and R 2 = 0.6370. The highest and lowest MAE and RMSE criteria were obtained for P and I-P-E-O combinations, respectively.

Fig. 7
figure 7

Testing-MAE, RMSE, R 2, and spread values for the most successful models obtained by GRNN method

The results obtained from the models constructed using PSO-εSVR, PSO-ANFIS-GP, and GRNN methods employed in this study in order to estimate water level change in Lake Beysehir are listed in Table 1. Table 1 also shows the results of the study by Buyukyildiz et al. (2014).

Table 1 Results of PSO-εSVR, PSO-ANFIS-GP hybrid models, and GRNN models

In view of Table 1, when the results obtained by parameter optimization using PSO are compared with the results obtained using only SVR and ANFIS-GP (Buyukyildiz et al. 2014), it is clear that parameter optimization was effective on the success. In PSO-ANN hybrid model, PSO was used to update weights, rather than parameter optimization, and it was found to have no positive effect on the success of ANN.

The input combination for which the most successful results was obtained in all methods given in Table 1 was I-P-E-O combination. Among the employed methods, PSO-εSVR method was the method with the highest estimation success with an R 2 value of 0.9995 in I-P-E-O combination. PSO-ANN models were the least successful model in all combinations, while the other methods generally showed performances significantly similar to each other. The least successful method among the models employed in this study to estimate water level change was GRNN method. The obtained C, ε, and γ parameters for I-P-E-O combination of PSO-εSVR method, the most successful method, were 1.146, 0.273, and 1.738, respectively.

Minimum, maximum, mean, and standard deviation values for testing data performance criteria, scatter diagram, and time series graph of PSO-εSVR (1.146, 0.273, 1.738) model obtained in the combination of I-P-E-O for 30 runs are shown in Fig. 8a, b, c, respectively. The small standard deviation values in Fig. 8a demonstrate that PSO-εSVR model achieved a significant success in terms of estimation. Because all R 2 values were above 90 %. Scatter diagram and time series graph of monthly water level in Fig. 8b, c, respectively, clearly indicate that the constructed PSO-εSVR (1.146, 0.273, 1.738) model was successful in estimating monthly water level change of Lake Beysehir.

Fig. 8
figure 8

a) performance criteria b)time series c) scatter diagram for testing data of the most successful PSO- eSVR (1.146, 0.273, 1.738) model

5 Conclusions

The objective of this study was to optimize the most appropriate parameter values of SVR and ANFIS methods using PSO algorithm in order to estimate water level change of Lake Beysehir. ε-SVR method was employed in SVR applications, while GP method was used to set the rules in the case of ANFIS models. Additionally, GRNN method was used to predict water level change and the results of the model were compared with the results obtained from hybrid PSO-εSVR and PSO-ANFIS models. While constructing the models, five different input combinations of inflow-lost flow (I), precipitation (P), evaporation (E), and outflow (O) data were used, with monthly water level change as the output. According to the results of the models, the models developed using I-P-E-O combination were the most successful models in all three methods in estimating monthly water level change. The models with the least success were the models in which only precipitation was used as the input in all three methods. The most successful model among hybrid PSO-εSVR, PSO-ANFIS-GP, and GRNN models was PSO-εSVR with an R 2 value of 0.9995 obtained using I-P-E-O inputs. When the results of PSO-εSVR and PSO-ANFIS-GP models obtained in this study were compared with εSVR and ANFIS-GP results, hybrid PSO models were found to be slightly more successful.

These results suggest that the PSO algorithm, which does not require any derivative information, has a small number of parameters requiring computation and provides a fast solution by taking away the burden of complex operations in solving nonlinear problems and can be successfully used to optimize parameters of the estimator used to estimate lake water level change.

Estimation of water level change with satisfactory accuracy would shed light to further studies on water resources management such as flood control, management of local water supply, recreation activities, ecosystem sustainability, etc.