Estimation of the Change in Lake Water Level by Artificial Intelligence Methods

Buyukyildiz, Meral; Tezel, Gulay; Yilmaz, Volkan

doi:10.1007/s11269-014-0773-1

Estimation of the Change in Lake Water Level by Artificial Intelligence Methods

Published: 28 August 2014

Volume 28, pages 4747–4763, (2014)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Water Resources Management Aims and scope Submit manuscript

Estimation of the Change in Lake Water Level by Artificial Intelligence Methods

Download PDF

Meral Buyukyildiz¹,
Gulay Tezel² &
Volkan Yilmaz¹

1517 Accesses
77 Citations
Explore all metrics

Abstract

In this study, five different artificial intelligence methods, including Artificial Neural Networks based on Particle Swarm Optimization (PSO-ANN), Support Vector Regression (SVR), Multi- Layer Artificial Neural Networks (MLP), Radial Basis Neural Networks (RBNN) and Adaptive Network Based Fuzzy Inference System (ANFIS), were used to estimate monthly water level change in Lake Beysehir. By using different input combinations consisting of monthly Inflow - Lost flow (I), Precipitation (P), Evaporation (E) and Outflow (O), efforts were made to estimate the change in water level (L). Performance of models established was evaluated using root mean square error (RMSE), mean square error (MSE), mean absolute error (MAE) and coefficient of determination (R²). According to the results of models, ε-SVR model was obtained as the most successful model to estimate monthly water level of Lake Beysehir.

Utilization of PSO algorithm in estimation of water level change of Lake Beysehir

Article 17 December 2015

A Comparative Study of Artificial Neural Networks, Support Vector Machines and Adaptive Neuro Fuzzy Inference System for Forecasting Groundwater Levels near Lake Okeechobee, Florida

Article 19 November 2015

Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index

Article 20 July 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

From the beginning of 20th century, significant changes have been observed in many climatological variables such as temperature, precipitation, evaporation, flow, and water level due to global warming (Qin et al. 2010; Shahid et al. 2012; Burns et al. 2007; Hamed 2008; Shao and Li 2011). These changes lead to significant changes in spatial and temporal distribution of water resources. Reservoirs are important in terms of water resources, and have many benefits, including irrigation of agricultural land, meeting drinking water and utility water demands, making it available for tourism by creating a promenade around it and inland waterways transport. On the other hand, in the case of a very high water level, they negatively affect settlements, rail and road transportation, shore-based recreational and educational facilities as well as plantations due to floods (Erkek and Agiraliogu 2002).

In addition to their benefits, lake waters may cause damage in the case of an increased water level so, in order to obtain maximum benefits from them and bring damage caused by them down to a minimum, change of lake water level should be known in advance and controlled, where necessary. Considering that water level changes in lakes are considerably complex outcomes of many hydrological factors such as precipitations falling on the lake surface or lake watershed, evaporation from free water surface, direct and indirect runoffs from neighbor basins, humidity, air and water temperature, wind speed and groundwater change, etc., it would be an important step to establish the correlation between the change in lake water level and hydro meteorological variables (Kisi et al. 2012).

Lake level prediction can be performed using some complex models taking into consideration the relevant parameters, but susceptibility of lake water levels to these factors might be from one area to the other. Moreover accurate measurement of these factors are often severity and with a great amount of ambiguity. Therefore, instead of using traditional methods with many inputs such as water balance, they are preferable that soft computing techniques (SCTs) which simulate lake water level changes based on previously recorded lake water levels be available for research as well as practical purposes. SCTs are convenient in clarifying the relation between a process output and its input regardless of the water level of understanding the underlying physical processes (Abdel-Hafez 2007).

Lake water level prediction provides useful data on the flood control and water supply and is a challenging task, therefore, forecasting lake water level fluctuations at various time intervals using the records of past time series is an important tool in terms of water resources planning, designing, construction and management (engineering, etc.) (Kisi et al. 2012).

Recently, the use of STCs such as artificial neural networks (Kisi 2009; Dogan 2005; Sudheer et al. 2002; Shirmohammadi et al. 2013; Wu et al. 2010; Altunkaynak 2007; Solomatine and Dulal 2003; Giustolisi and Laucelli 2005; Cigizoglu 2003; Ondimu and Murase 2007), stochastic models (Kumar et al. 2004; Sveinsson et al. 2003), fuzzy logic, adaptive network -based fuzzy inference system (Afiq et al. 2013; El-Shafie et al. 2007; Keskin et al. 2006; Kisi 2006; Zounemat-Kermani and Teshnehlab 2008; Valizadeh and El-Shafie 2013; Khatibi et al. 2011), support vector machines (Afiq et al. 2013; Wu et al. 2008; Khan and Coulibaly 2006; Cimen and Kisi 2009; Zhao and Wang 2010; Cimen 2008) and Artificial Neural Networks based on Particle Swarm Optimization (Sedki and Ouazar 2010; Mohandes 2012; Chau 2006, 2007; Xi et al. 2012; Tapogloua et al. 2014) are widely used as an appropriate tool for modeling complex nonlinear phenomena covering hydrology and water resources systems.

Soft Computing Techniques (SCTs) are comparatively new statistically-based methods that are becoming commonly used for prediction, evaluation and modeling applications in hydrology and water resources applications. SCTs can be trained to estimate the behavior of real-time phenomena without the need of a physical model to relate the causes and effects of the phenomena. This makes SCT an economic technique for water resources planning, designing, construction and management (engineering, etc.).

The change in lake water level can be generally determined by water balance created by the difference between inflow to and outflow from the lake as well as by models formed using various artificial intelligence methods utilizing hydrological and hydro-meteorological data. For this purpose, various studies were undertaken and the results obtained from the models established using these methods were compared with actual values from observations (Valizadeh et al. 2011; Chang and Chang 2006; Sulaiman et al. 2011; Leahy et al. 2008; Yarar et al. 2009)

Altunkaynak (2007) used ANN for water level changes prediction of Van Lake in Turkey and compared with the traditional ARMAX models in forecasting 1 month-head lake water levels. Khan and Coulibaly (2006) studied the usability of SVM for Lake Erie water level prediction. Guldal and Tongal (2010) used Recurrent Neural Network (RNN) and ANFIS for predicting future lake levels of the Lake Egirdir in Turkey. Talebizadeh and Moridnejad (2011) carried out different ANNs and ANFIS models to forecast the lake water level variations in Lake Urmia in northwest of Iran. Kakahaji et al. (2013) used six models which are two mathematical models (water balance and multiple correlation equation), two dynamic linear models (ARX and BJ), as well as a couple of intelligent models (MLP and LLNF) to predict the lake water level fluctuations of Lake Urmia in Iran. SVM and ANN techniques were used to predict surface water level changes of Van and Egirdir Lakes in Turkey by Cimen and Kisi (2009). Kisi et al. (2012) applied GP, ANFIS, ANN and ARMA models to forecast daily lake water level, three time steps ahead. Chau (2006) used PSO-based neural network technique for river stage estimation in the Shing Mum River of Hong Kong. Ghorbani et al. (2010) used Genetic Programming (GP) to predict the hourly sea water level changes of Hillarys Boat Harbour in Australia and compared with ANNs models. Afiq et al. (2013) investigated the usability of SVM model to forecast the daily dam water level of the Klang gate using daily rainfall and water level data, and compared the data with the results determined using ANFIS model. Sivapragasam and Liong (2004) investigated the sequential elimination approach to determine the optimal training data set and then performed SVR to predict the water level changes. Yarar et al. (2009) assessed the performance of ANN, ANFIS and SARIMA models for the prediction of variations in the water level of Lake Beysehir in Turkey. Shiri et al. (2011) investigated the ability of a neuro-fuzzy (NF) model to estimate the sea level changes in the Hillarys Boat Harbour, Western Australia. Ondimu and Murase (2007) studied the accuracy of an ANN model in reservoir water level prediction.

Lake Beysehir, the 3rd largest lake by size and the largest freshwater lake in Turkey, is currently utilized for the purposes of supplying drinking water, irrigation, fishing and tourism, etc. However, in recent years, the water level of Beysehir Lake considerably fell due to inadequate rainfall and excessive water withdrawal. Nowadays, global climate change and the adverse effects thereof are intensively discussed, and Konya Closed Basin is one of the regions in Turkey where these adverse effects will be seen most. Therefore, the estimation of water level change in Lake Beysehir, which is of importance for the province of Konya, which has an important place in the agriculture of Turkey, and for water resources in the region, is of great importance in terms of efficiency to be provided from the lake.

Accurate prediction of water level variation in lakes and reservoirs in response to hydroclimatological changes is a significant step for the development of sustainable water usage policies, particularly for sophisticated hydrological systems such as Lake Beysehir, Turkey.

For this aim, in this study, models were established using artificial neural networks (ANNs), support vector machines (SVM), the adaptive network-based fuzzy inference system (ANFIS) and artificial neural networks based on particle swarm optimization (PSO-ANN) method to estimate the water level of the Lake Beysehir, Turkey’s largest freshwater lake, and the results obtained from the established models were compared with each other.

2 Methodology

2.1 Support Vector Machines (SVM)

In this study, ε-SVR model was used as SVR model. Vapnik (1995) proposed ε-SVR by introducing an alternative ε-insensitive loss function. This loss function allows the concept of margin to be used for regression problems. The purpose of SVR is to find a function having maximum ε deviation from actual target vectors for all training data given, which should be at the same level as much as possible. In other words, error on any training data should be smaller than ε (Pal and Arun 2006).

The three factors that affect prediction performance of SVM regression are the insensitive error term (ε), the regularization factor (additional capacity control) (C), type of kernel function and kernel parameter. ε constant indicates the range in which error is neglected. As this constant gets smaller, the number of support vectors to be found by SVM increases. C, the regulatory constant, maintains order between system complexity characterized by weight vector and prediction errors measured by insensitive variables. Using these variables in the most appropriate values is highly effective in the success of SVM (Ekici 2007).

Selection of the kernel function as well as model parameters to be used plays an important role in determining SVR performance. However, there isn’t any decisive criterion for selecting both kernel function and model parameters (Lin 2006). The Radial Basis Kernel Function is selected for SVR model in this study and expressed as

$$ K\left({x}_i, x\right)= \exp \left(-\gamma {\left\Vert {x}_i-{x}_j\right\Vert}^2\right) $$

(1)

where γ is the parameter of radial basis kernel function which gives the width of the kernel.

2.2 Multi-Layer Perceptron (MLP)

ANN models are often indicated by ANN (i,j,k) network architecture, where i is the number of neurons in the input layer; j, the number of neurons in hidden layer and k, the number of neurons in the output layer. MLP model utilizes a supervised learning method. Learning process is based on the principle of minimizing the error between the output data generated by the network by optimizing weights and the actual output data. Learning rule is a generalized state of Delta Learning Rule based on the method of least squares. Therefore, the learning algorithm is also called Generalized Delta Rule or the Error Back-Propagation Algorithm (Dogan 2005). According to this rule, training is conducted in three stages: a) forward propagation of input training data, b) calculation of error corresponding to target values given, c) adjusting weights by propagating this error backward (Zhao and Wang 2010; Kisi 2009; Khan and Coulibaly 2006).

2.3 Radial Based Neural Network (RBNN)

The basic idea in radial basis neural networks (RBNN) is simply to collect a group of radial basis functions by weighting them so that they approximate to the desired f function (Verleysen and Hlavackova 1994). RBNN is a three layered structure. Input layer is associated with input vector space, and output layer with pattern classes. Thus, the entire structure is reduced to the determination of the structure of the hidden layer and weights between the hidden layer and the output layer. The activation functions of the neurons in the hidden layer are determined by a C_j center and σ_j bandwidth. Activation function is a Gaussian curve defined by the equation:

$$ {\phi}_j(X)= \exp \left(-\frac{\left\Vert X-{C}_j\right\Vert }{2{\sigma}_j^2}\right) $$

(2)

The general equation for the output of neuron j in the output layer is as follows:

$$ {s}_j(X)={\displaystyle \sum_{i=1}^k{w}_{i j}{\phi}_i(X)+{b}_j} $$

(3)

where w_ij is the weight between hidden neuron i and the output neuron j, b_j is the bias constant (Paredes and Vidal 2000).

2.4 Particle Swarm Optimization (PSO)

Particle swarm optimization (PSO) was developed by Eberhart and Kennedy (1995) and, it has been preferred as one of the most popular population based heuristic methods because of its feasibility and simplicity. This algorithm based on the social behavior of biological organisms, specifically birds flocking to a food (Yalcin et al. 2012; Bratton and Kennedy 2007). This behavior associated with that of an optimization search for solutions to non-linear equations in a real-valued search space. The member of social behavior called as swarm’s particle. Each particle has a position in the search space. This algorithm searches the space of an objective function by adjusting the trajectories of the particles. When a particle finds a location that is better than any previously found, then it updates it as the current best for particle (p_i). There is a current best (local best) for all particles (Yang 2008).

The global best (p_g) is referred as a position among all the current best solutions. Until objective no longer improves or the number of iteration, the PSO algorithm updates particles at each step by recalculate the velocity and position of each particle in every dimension using Eqs. 4 and 5.

$$ {v}_{ij}={v}_{ij}+{c}_1{r}_1\left( pbes{t}_{ij}-{p}_{ij}\right)+{c}_2{r}_2\left( gbes{t}_{ij}-{p}_{ij}\right) $$

(4)

$$ {p}_{ij}={p}_{ij}+{v}_{ij} $$

(5)

where v_ij and p_ij are velocity and position respectively, c₁ and c₂ are constants, r₁ and r₂ are independent random numbers uniquely generated at every update for each individual dimension, pbest is the local best and gbest is the global best.

The pseudo code of PSO algorithm consists of update steps are illustrated in following (Bratton and Kennedy 2007; Yang 2008).

Algorithm 1 The PSO update process

for each time step t do

for each particle i in the swarm do

update position p using Eqs 4 and 5

calculate particle fitness f (p)

update p _best, g_best

end for

2.4.1 Partical Swarm Optimization Based Artificial Neural Network (PSO-ANN)

PSO is a common used optimization algorithm in literature. Furthermore, the learning procedure of ANN is a process of getting the weights to the appropriate values and this is called as optimization. Therefore, PSO was preferred to train ANN in this study and called PSO-ANN. In this algorithm connection weights of ANN were used as (assigned) particles in PSO. The process of ANN begins with assigning random values to the connection weights. So the number of weights expresses the dimension of particles. The PSO-ANN network is established according to the each particle. Training samples are presented to the network to be learnt respectively. While this process, the difference (error) between target value and produced value is calculated for each sample. After all training data set is presented to the network, sum of the error (MSE) is calculated to obtain fitness value of particle. In the first step, this fitness value is assigned as local best value (pbest). If this pbest is the best fitness value for all particles, it is accepted global best (gbest). Updating of weights is dependent of gbest and pbest values. If obtained fitness value is bigger than the target error the position value of particles is updating using pbest and gbest in the training process. Then the network is set up according to the new particles and these steps continue until the fitness value reaches the acceptable error value or maximum iterations.

2.5 Adaptive Network Based Fuzzy Inference System (ANFIS)

Adaptive network-based fuzzy inference system is based on the idea of combining the learning ability of artificial neural networks and advantages of fuzzy logic such as human-like decision-making and ease of providing specialized knowledge (Elmas 2003). Actual purpose of adaptive network-based fuzzy inference system is to use artificial neural networks in order to adjust and find the structure and variables of ANFIS system (Elmas 2003; Jang et al. 1997).

ANFIS consists of 6 layers. The functioning of layers in ANFIS structure are the input layer, the fuzzification layer, the rule layer, the normalization layer, the defuzzification layer and overall layer in the order given.

3 Case Study

Lake Beysehir (37°40′54″N, 31°43′22″E), which is the third largest lake as well as the largest freshwater lake in Turkey, is located within the provincial borders of Konya and Isparta, located in southwest of Turkey (Fig. 1). The water level in the lake often varies from year to year and from season to season. It has an average surface area of 650 km², and it is 1121.93 m above sea level. Its elevation is in the range of 1120 to 1125.5 m. Although a variety of numbers were reported, it is approximately 8.5 m deep, 45 km long, 10–25 km wide, and has a circumference of 120 km. The lake is currently utilized for the purposes of supplying drinking water, irrigation, fishing, commercial business and tourism (Babaloglu 1997).

In this study, efforts were made to estimate the change in lake water level (L) using monthly inflow - Lost flow (I), Precipitation (P), Evaporation (E) and Outflow (O) values between 1962 and 1990 determined by 4th Regional Directorate of State Hydraulic Works (DSI). For this purpose, the Multi-Layer Perceptron (MLP), which is a conventional ANN model, Radial-Based Neural Network (RBNN), Epsilon-Support Vector Regression (ε-SVR), Adaptive Neural Fuzzy Inference System (ANFIS) and Particle Swarm Optimization Based- Artificial Neural Network (PSO-ANN) methods were used, and the performance of the models was evaluated by comparing the estimated values from the models with values from observations.

348-month data between 1962 and 1990 were used to establish the models. 276-month data during the period from 1962 to 1984 of this data was used in training process, and the remaining 72-month data during the period from 1962 to 1984 was used in testing process. I, P, E and O data was used as input variables, while values of change in water level (L) were used as output. Here, level values were given as elevation so the values of change in level per month were calculated by taking the differences in elevation compared to the previous month.

4 Application and Results

In order to estimate the change in monthly water level of Lake Beysehir, the following equation was used to eliminate unit differences in parameters used in input and output layers and the data was made non-dimensional between 0 and 1, before applying MLP, RBNN, ε-SVR, ANFIS and PSO-ANN methods

$$ {X}_{norm}=\frac{X_i-{X}_{\min }}{X_{\max }-{X}_{\min }} $$

(6)

where X_norm, X_i; X_min and X_max represent normalized, observed, minimum and maximum values for all parameters, respectively.

In order to estimate the change in lake water level, models were created using different input combinations comprised of the variables I, P, E, and O, and the effects of individual variable on the change in water level was investigated.

Mean absolute error (MAE), root mean square error (RMSE), mean square error (MSE) and coefficient of determination (R²) criteria were used to evaluate the success of the models generated by using MLP, RBNN, ε-SVR, ANFIS and PSO-ANN methods.

For SVR method, a software package known as LIBSVM Software was employed, while SVR, MLP, RBNN, ANFIS and PSO-ANN were applied using MATLAB 2010b.

$$ MAE=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left| Y{i}_{observed}- Y{i}_{estimate}\right|} $$

(7)

$$ RMSE=\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^N{\left( Y{i}_{observed}- Y{i}_{estimate}\right)}^2}} $$

(8)

$$ MSE=\frac{1}{N}{\displaystyle \sum_{i=1}^N{\left( Y{i}_{observed}- Y{i}_{estimate}\right)}^2} $$

(9)

$$ {R}^2=\frac{{\left[{\displaystyle \sum_{i=1}^N\left( Y{i}_{observed}-{\overline{Y}}_{observed}\right)\left( Y{i}_{estimate}-{\overline{Y}}_{estimate}\right)}\right]}^2}{{\displaystyle \sum_{i=1}^N{\left( Y{i}_{{}_{observed}}-{\overline{Y}}_{{}_{observed}}\right)}^2{\displaystyle \sum_{i=1}^N{\left( Y{i}_{estimate}-{\overline{Y}}_{estimate}\right)}^2}}} $$

(10)

4.1 Application of MLP Model

The most difficult task in application of MLP is the selection of the number of neurons in hidden layers. There isn’t any decisive criterion related to the determination of the number of neurons yet. ANN structure varies depending on the problem. In this study, a 3-layer ANN architecture consisting of a single hidden layer was used to create MLP models. The number of neurons in the hidden layer, learning rate and momentum coefficient were determined by trial and error.

As the activation function, tangent sigmoid and logarithmic sigmoid functions were used in the hidden and output layers, respectively. The number of neurons in the hidden layer was investigated in the range of 1 to 20, whereas learning rate (lr) and momentum coefficient (mc), the parameters affecting convergence rate, were investigated in the range of 0.1 to 0.9. 1000 was taken as the number of iterations in the established MLP models. As a result of the trials made in line with these assumptions, model structures of the most successful MLP models obtained for 5 different input combinations are given in Table 1.

Table 1 The lr, mc, and R² values for MLP models in testing period

Full size table

In view of Table 1, according to test data from trials, the most successful MLP structure by the largest R² value was MLP1 (4,2,1) model in M1 combination, in which all parameters were used. In this model, 4 represents the inputs comprised of I-P-E-O; 2, the number of neurons in the hidden layer and 1, the output (L). According to MLP1 model, in which a learning rate (lr) of 0.3 and a momentum coefficient (mc) of 0.9 were obtained, R² = 0.991 was obtained for test data. According to input combinations used to determine the monthly change in lake water level, the least successful model was found as MLP4 (1,5,1) model for M4 combination with a minimum R² = 0.673, in which total precipitation (P) was used as input and the number of neurons in the hidden layer was obtained as 5 (Buyukyildiz et al. 2013).

4.2 Application of RBNN Model

In order to estimate the change in water level, efforts were made to determine the most appropriate model structure for different input combinations using Radial basis neural network algorithm, and the most successful model was determined by maximum R² value obtained for test data. For this purpose, the number of neurons in the hidden layer, which significantly change the performance of the model, was determined by testing with increments of 1 between 2 and 20, and in the case of spread parameter (σ), with increments of 0.01 between 0.01 and 5.0. The most successful RBNN models obtained for different inputs are given in Table 2. According to the results, the most successful model was established as RBNN1 (4, 6, 4.26, 1) model for M1 combination, for which R² = 0.9986 was obtained and I-P-E-O inputs were employed, in this algorithm. In this model, 4 represents the number of inputs (I-P-E-O); 6, the number of neurons in the hidden layer; 4.26, the spread parameter and 1, the output number (L). The input combination in which model success was the least was obtained as RBNN4 (1, 11, 0.02, 1) model (R² = 0.6891) for an input number of 1 (P), 11 neurons in hidden layer, a spread parameter of 0.02 and an output of 1 in M4 combination, in which precipitation was used as input.

Table 2 The number of neurons in hidden layer, spread parameter (σ) and R² values for RBNN models in testing period

Full size table

4.3 Application of ε-SVR Model

In the case of estimation of water level by ε-SVR, efforts were made to determine the most successful ε-SVR model using the same input combinations. In this study, Radial Basis Kernel Function, which has also been widely used in the literature, was used as kernel function. During implementation of ε-SVR models, trials were made for different input combinations with an increment of 0.01 in the range of [0.01 0.5] for error term (ε), an increment of 1 in the range of [1 100] for regulatory factor (C), an increment of 0.1 in the range of [0.1 8] for γ parameter of Radial Basis Kernel function. C,ε,γ and R² values of the most successful ε-SVR models obtained from these trials are given in Table 3 (Buyukyildiz et al. 2013).

Table 3 The C, ε, γ and R² values for ε-SVR models in testing period

Full size table

According to ε-SVR results obtained for different input combinations, the most successful model was again obtained in ε-SVR 1 model in M1 combination, in which all inputs were used. In this model, obtained as ε-SVR1 (1, 0.01, 0.1), 1 represents C; 0.01, the ε, and 0.1, the γ value. In ε-SVR1 (1, 0.01, 0.1) model architecture, a considerably high performance was achieved with an R² value of 0.9988 for test data. In SVR application, the least successful performance by R² values was in ε-SVR4 (R² = 0.666) model in M4 combination, in which precipitation was used.

4.4 Application of PSO-ANN Model

The same input combinations were also used to establish the optimum model for estimation of the change in water level by training artificial neural networks with PSO, and the resultant models were compared to determine the most appropriate input combination model.

Learning factors c₁ and c₂ given in Eqs. 4 and 5 were taken as 2. c₁ and c₂ are constants representing acceleration terms, which draws each particle to its best position (p_best) and to the best of the population (g_best). c₁ helps the particle move according to its experiences, while c₂ helps it move according to the experiences of the other particles in the population.

In this study, the number of particles was selected as 30, initial inertia weight as w_max = 0.9, perminate inertia weight as w_min = 0.4, and the number of neurons in the hidden layer were investigated in the range of [1 20]. PSO-ANN models were established by taking maximum velocity range which a particle can take as [−1 1], and the number of iterations as 10, 20, 30, 40 and 50 in the respective order for each input combination. The most successful PSO-ANN model structures determined according to the highest R² values are given in Table 4 for each iteration (Buyukyildiz et al. 2013).

Table 4 The PSO-ANN (i,j,k) model structures and R² values for PSO-ANN models in testing period

Full size table

According to the results of PSO-ANN models obtained for different input combinations and iteration numbers; considering the highest R² values, it is clear that the most successful models for each number of iterations were obtained in M1 combinations, in which 4 input parameters were used. The most successful PSO-ANN model by the number of iterations of PSO-ANN models established using M1 combination was obtained with R² value of 0.931 for 10 iterations. According to this model obtained as PSO-ANN (4, 5, 1), the number of neurons in the hidden layer against 4 input parameters was found as 5.

If the results in Table 4 are evaluated individually for each input combination, it is apparent that the most successful models in each combination were obtained at different epochs. The most successful network structure in M1 and M2 model combinations was found for 10 epoch, whereas that in M3, M4 and M5 model combinations was obtained at 20, 30 and 50 epochs, respectively.

4.5 Application of ANFIS Model

While both Back propagation and Hybrid learning algorithms were employed to perform ANFIS models to estimate the change in water level of the Lake Beysehir, Grid Partition (ANFIS-GP) and Subtractive Clustering (ANFIS-SC) techniques were utilized to set up the rules. In ANFIS model, types of input and output membership functions and the number of membership functions, which give an acceptable error value with trial-and- error method for each input combination, were determined. Gaussmf, trimf, trapmf functions were used for input membership functions. Values in the range of 2 to 4 were tried as the number of membership functions for each input function. Both constant and linear functions were used as output membership functions. The number of epochs was taken as 100 in ANFIS models established. The results obtained for ANFIS models are given in Table 5.

Table 5 The characteristics of ANFIS models in testing period

Full size table

According to these results, the most successful models created using the Grid Partition and Subtractive Clustering ANFIS techniques were achieved in ANFIS-GP (4, trimf, constant) structure with 16 rules for I-P-E-O input combination using back propagation learning algorithm and in ANFIS-SC (4, 0.7) structure with 2 rules using hybrid learning algorithm. Figure 2 shows ANFIS model structures of these two models. According to these model structures, the least successful input combination was the combination in which precipitation (P) was taken as input in both ANFIS techniques. Grid Partition technique was more successful than Subtractive Clustering technique in all combinations in ANFIS models created.

The most successful ANFIS model by R² values obtained in ANFIS-GP and ANFIS-SC models in Table 5 was achieved again in ANFIS-GP (4,trimf,constant) structure with a R² of 0.9952 in back propagation learning algorithm.

RMSE, MAE, MSE and R² values of the most successful models for all methods used are given in Fig. 3a-b and scatter diagrams of the models are given in Fig. 4.

As a result, according to the results obtained from models established in this study, in which MLP, RBNN, ε-SVR, PSO-ANN and ANFIS (ANFIS-GP and ANFIS-SC) models were used to estimate the change in water level of Lake Beysehir, the input combination by which the most successful results were obtained in all methods was I-P-E-O combination (Fig. 3a). The least successful input combination in estimating the change in water level was the combination in which evaporation was taken as input for PSO-ANN model, whereas it was the input combination in which precipitation was used as input in other methods. PSO-ANN models were the least successful model in all combinations. Other methods generally showed performances significantly similar to each other.

According to determination coefficients in Fig. 3a, the most successful model among the models established to estimate the change in water level was ε-SVR (1, 0.01, 0.1) model (R² = 0.9988), for which the largest determination coefficient was obtained, if an evaluation is made by testing period. However, ANFIS-GD (4, trimf, constant), ANFIS-SC (4, 0.7), MLP (4,2,1) and RBNN (4, 6, 4.26, 6) models also showed a fairly close success to ε-SVR model in terms of R², RMSE, MSE and MAE, which is also evident in scatter diagrams of the respective models. The success of models is also similar to each other by error criteria obtained for training data, while PSO-ANN model was the lowest -performing model in all input combinations, compared to other methods. However, a R² value of 0.931 obtained for the most successful PSO-ANN structure also demonstrates that this model achieved a significant success in terms of estimation.

According to Fig. 3b presenting the RMSE, MSE and MAE successes of each used model, the most successful models in predicting water levels were obtained as RBNN (RMSE = 0.006 m, MSE = 0.000036 m², MAE = 0.004 m) and ε-SVR (RMSE = 0.01 m, MSE = 0.0001 m², MAE = 0.009 m) models, and the successes of the MLP, ANFIS-GD and ANFIS-SC models were so much closer to each other. Moreover, the model having the lowest success is determined as the PSO-ANN (RMSE = 0.055 m, MSE = 0.003025 m², MAE = 0.752 m) model. In terms of the detection of the success of the used models, the values obtained for the performance criteria presented in Fig. 3b are in parallel with the R² values in Fig. 3a.

It can be clearly seen from the scatter diagrams in Fig. 4 that the success obtained by the models set by the most successful input I-P-E-O (M1) for all the methods used for the water level prediction was considerably high.

5 Conclusion

Planning, designing, construction and management of lakes and reservoirs requires accurate estimations of their water levels. In this study, monthly Inflow - Lost flow (I), Precipitation (P), Evaporation (E) and Outflow (O) observations from the Lake Beysehir in Turkey were used to predict monthly lake level variations using ε-SVR, MLP, RBNN, PSO-ANN and ANFIS methods. While establishing models, different combinations of inflow to the lake - lost flow, precipitation, outflow from the lake and evaporation data were used as input data, and elevation difference between consecutive monthly lake water levels as output data. When performances of models of all input combinations established were investigated, the most successful input combination in each method was M1 combination, in which all I-P-E-O variables were used. When the most successful results in the models established were compared, the most successful method by R² = 0.9988 value was ε-SVR, however, ANFIS, MLP and RBNN methods also proved to be successful to an extent close to the success of ε-SVR method. PSO-ANN method was the lowest-performing method in this study. This high success obtained in all five methods used to estimate the change in water level shows that this type of SCTs can be used to estimate many hydrological parameters required to water resources planning and management.

The use of the estimation power of SCTs can assist Beysehir Lake administrators to take the necessary precautions in the event of estimated sudden falls or rises in the lake water levels.

Future study will use PSO-SVR and PSO ANFIS models to estimate lake water fluctuations to compare the results of this study

References

Abdel-Hafez MH (2007) Application of Self-Learning Machines in Predicting Great SaltLake Water Level. M.Sc. Dissertation, Utah State University, Logan, Utah, USA.
Afiq H, Ahmed E, Ali N, Othman A, Aini K, Mukhlisi HM (2013) Daily forecasting of dam water levels: comparing a support vector machine (SVM) model with adaptive neuro fuzzy inference system (ANFIS). Water Resour Manag 27:3803–3823
Article Google Scholar
Altunkaynak A (2007) Forecasting surface water level fluctuations of Lake Van by artificial neural networks. Water Resour Manag 21:399–408
Article Google Scholar
Babaloglu M (1997) Beyşehir Gölünün Sorunları ve Alınması Gereken Önlemler. Konya İl Genel Meclisi, Beyşehir Gölü Araştırma Komisyonu, Konya
Google Scholar
Bratton D, Kennedy J (2007) Defining a Standard for Particle Swarm Optimization. Proceedings of the 2007 I.E. Swarm Intelligence Symposium
Burns DA, Klaus J, McHale MR (2007) Recent climate trends and implications for water resources in the Catskill Mountain region, New York, USA. J Hydrol 336:155–170
Article Google Scholar
Buyukyildiz M, Tezel G, Yilmaz V (2013) Water level prediction models based on soft computing techniques (SCTs) for Lake Beysehir, Turkey. Proceedings of the Second International Conference on Water, Energy and the Environment Kusadası, Turkey September 21–24
Chang FJ, Chang YT (2006) Adaptive neuro-fuzzy inference system for prediction of water level in reservoir. Adv Water Resour 29(1):1–10
Article Google Scholar
Chau KW (2006) Particle swarm optimization training algorithm for ANNs in stage prediction of Shing Mun River. J Hydrol 329(3–4):363–367
Article Google Scholar
Chau KW (2007) A split-step particle swarm optimization algorithm in river stage forecasting. J Hydrol 346:131–135
Article Google Scholar
Cigizoglu HK (2003) Estimation, forecasting and extrapolation of flow data by artificial neural networks. Hydr Sci J 48(3):349–361
Article Google Scholar
Cimen M (2008) Estimation of daily suspended sediments using support vector machines. Hydrological Science Journal 53(3):656–666
Article Google Scholar
Cimen M, Kisi O (2009) Comparison of two different data driven techniques in modeling surface water level fluctuations of Lake Van, Turkey. J Hydrol 378(3–4):253–262
Article Google Scholar
Dogan E (2005) Suspended sediment load estimation in lower Sakarya river by uing artificial neural networks, fuzzy logic and neuro-fuzzy models. Elec Lett Sci Eng 1:22–32
Google Scholar
Eberhart R, Kennedy J (1995) A New Optimizer using Particle Swarm Theory”, Proceedings of the Sixth International Symposium on Micro Machine and Human Science (Nagoya, Japan). IEEE Service Center, Piscataway, NJ: 39–43
Ekici S (2007) Elektrik Güç Sistemlerinde Akıllı Sistemler Yardımıyla Arıza Tipi ve Yerinin Belirlenmesi. PhD Thesis, Fırat Üniversitesi, Fen Bilimleri Enstitüsü (in Turkish)
Elmas C (2003) Bulanık mantık denetleyiciler. Seçkin Yayıncılık, Ankara
Google Scholar
El-Shafie A, Taha MR, Noureldin A (2007) A neuro-fuzzy model for inflow forecasting of the Nile river at Aswan high dam. Water Resour Manag 21(3):533–556
Article Google Scholar
Erkek C, Agiraliogu N (2002) Su Kaynakları Mühendisliği. Dördüncü baskı, Yayın no: 1247, Beta Basım, İstanbul
Google Scholar
Ghorbani MA, Khatibi R, Aytek A, Makarynskyy O, Shiri J (2010) Sea water level forecasting using genetic programming and comparing the performance with artificial neural networks. Comput Geosci 36(5):620–627
Article Google Scholar
Giustolisi O, Laucelli D (2005) Improving generalization of artificial neural networks in rainfall–runoff modeling. Hydrological Science Journal 50(3):439–457
Article Google Scholar
Guldal V, Tongal H (2010) Comparison of recurrent neural network, adaptive neuro-fuzzy inference system and stochastic models in Eğirdir lake level forecasting. Water Resour Manag 24(1):105–128
Article Google Scholar
Hamed KH (2008) Trend detection in hydrologic data: the Mann-Kendall trend test under the scaling hypothesis. J Hydrol 349:350–363
Article Google Scholar
Jang JSR, Sun CT, Mizutani E (1997) Neuro Fuzzy and Soft Computing a Computational Approach to Learning and Machine Intelligence. Prentice Hall, USA
Kakahaji H, Banadaki HD, Kakahaji A, Kakahaji A (2013) Prediction of Urmia Lake water-level fluctuations by using analytical, linear statistic and intelligent methods. Water Resour Manag 27:4469–4492
Article Google Scholar
Keskin ME, Taylan D, Terzi O (2006) Adaptive neural-based fuzzy inference system (ANFIS) approach for modelling hydrological time series. Hydrol Sci J 51(4):588–598
Article Google Scholar
Khan MS, Coulibaly P (2006) Application of support vector machine in lake water level prediction. Hydr Eng 11:199–205
Article Google Scholar
Khatibi R, Ghorbani MA, Kashani MH, Kisi O (2011) Comparison of three artificial intelligence techniques for discharge routing. J Hydrol 403:201–212
Article Google Scholar
Kisi O (2006) Daily pan evaporation modelling using a neuro-fuzzy computing technique. J Hydrol 329(3–4):636–646
Article Google Scholar
Kisi O (2009) Daily pan evaporation modelling using multi-layer perceptrons and radial basis neural Networks. Hydrol Process 23:213–223
Article Google Scholar
Kisi O, Shiri J, Nikoofar B (2012) Forecasting daily lake levels using artificial intelligence approaches. Comput Geosci 41:169–180
Article Google Scholar
Kumar K, Yadav AK, Singh MP, Hassan H, Jain VK (2004) Forecasting daily maximum surface ozone concentrations in Brunei Darussalam an ARIMA modeling approach. J Air Waste Manag Assoc 84:809–814
Article Google Scholar
Leahy P, Kiely G, Gearóid C (2008) Structural optimisation and input selection of an artificial neural network for river level prediction. J Hydrol 355:192–201
Article Google Scholar
Lin H (2006) Support vector machines for regression and its application for prediction of machine degradation based on vibration signals. Thesis Master of Science, University of Alberta, Edmonton, Alberta
Mohandes MA (2012) Modeling global solar radiation using particle swarm optimization (PSO). Sol Energy 86:3137–3145
Article Google Scholar
Ondimu S, Murase H (2007) Reservoir level forecasting using neural networks: Lake Naivasha. Biosyst Eng 96(1):135–138
Article Google Scholar
Pal M, Arun G (2006) Prediction of the end-depth ratio and discharge in semi-circular and circular shaped channels using support vector machines. Flow Meas Instrum 17(1):49–57
Article Google Scholar
Paredes V, Vidal E (2000) A class-dependent weighted dissimilarity measure for nearest neighbor classification problems. Patt Rec Lett 21:1027–1036
Article Google Scholar
Qin N, Chen X, Fu G, Zhai J, Xue X (2010) Precipitation and temperature trends fort he Southwest China: 1960–2007. Hydrol Process 24:3733–3744
Article Google Scholar
Sedki A, Ouazar D (2010) Hybrid particle swarm and neural network approach for stream flow forecasting. Math Model Nat Phenom 5(7):132–138
Article Google Scholar
Shahid S, Harun SB, Katimon A (2012) Chamges in diurnal temperature range in Bangladesh during in the time period 1961–2008. Atmos Res 118:260–270
Article Google Scholar
Shao Q, Li M (2011) A new trend analysis for seasonal time series with consideration of data dependence. J Hydrol 396:104–112
Article Google Scholar
Shiri J, Makarynskyy O, Kisi O, Dierickx W, Fard AF (2011) Prediction of short-term operational water levels using an adaptive neuro-fuzzy inference system. J Waterway, Port, Coastal, Ocean Eng 137:344–354
Article Google Scholar
Shirmohammadi B, Vafakhah M, Moosavi V, Moghaddamnia A (2013) Application of several data-driven techniques for predicting groundwater level. Water Resour Manag 27(2):419–432
Article Google Scholar
Sivapragasam C, Liong SY (2004). Identifying optimal training data set – a new approach. In: Liong, S.Y., Phoon, K.K., Babovic, V. (Eds.), Proceedings of the Sixth International Conference on Hydroinformatics, Singapore, 21–24 June 2004. World Scientific Publishing Co., Singapore.
Solomatine DP, Dulal KN (2003) Model trees as an alternative to neural networks in rainfall–runoff modeling. Hydrological Science Journal 48(3):399–411
Article Google Scholar
Sudheer KP, Gosain AK, Rangan M, Saheb SM (2002) Modelling evaporation using an artificial neural network algorithm. Hydrol Process 16:3189–3202
Article Google Scholar
Sulaiman M, El-Shafie A, Karim O, Basri H (2011) Improved water level forecasting performance by using optimal steepness coefficients in an artificial neural network. Water Resour Manag 25:2525–2541
Article Google Scholar
Sveinsson OGB, Salas JD, Lane WL, Frevert DK (2003) Progress in stochastic analysis, modeling and simulation. SAMS-2003, Hydrology Days: 165–175
Talebizadeh M, Moridnejad A (2011) Uncertainty analysis for the forecast of lake level fluctuations using ensembles of ANN and ANFIS models. Exp Syst with App 38:4126–4135
Article Google Scholar
Tapogloua E, Trichakisa IC, Dokoua Z, Nikolosb IK, Karatzasa GP (2014) Groundwater-level forecasting under climate change scenarios using an artificial neural network trained with particle swarm optimization. Hydrol Sci J 59(6):1225–1239
Article Google Scholar
Valizadeh N, El-Shafie A (2013) Forecasting the level of reservoirs using multiple input fuzzification in ANFIS. Water Resour Manage 27:3319–3331
Article Google Scholar
Valizadeh N, El-Shafie A, Mukhlisin M, El-Shafie AH (2011) Daily water level forecasting using adaptive neuro-fuzzy interface system with different scenarios: Klang Gate, Malaysia. Int J Phys Sci 6(32):7379–7389
Article Google Scholar
Vapnik V (1995) The nature of statistical learning theory. Springer, New York
Book Google Scholar
Verleysen M, Hlavackova K (1994) An Optimized RBF Network For Approximation of Functions, Proceedings European Symposium on Artificial Neural Networks. Brussels, Belgium, pp. 175–180
Wu CL, Chau KW, Li YS (2008) River stage prediction based on a distributed support vector regression. J Hydrol 358:96–111
Article Google Scholar
Wu CL, Chau KW, Fan C (2010) Prediction of rainfall time series using modular artificial neural networks coupled with data-preprocessing techniques. J Hydrol 389:146–167
Article Google Scholar
Xi Z, Zhang Y, Zhu C (2012) Application of PSO-neural network model in prediction of groundwater level in Handan City. Adv in infor Sci and Serv Sci (AISS) 4(6):21
Google Scholar
Yalcin N, Tezel G, Karakuzu C (2012) Epilepsy Diagnosis Using Artificial Neural Network Learned by PSO. 9th International Conference on Electronics, Computer and Computation, Ankara Turkey
Yang XS (2008) Nature-Inspired Metaheuristic Algorithms, Luniver Press, United Kingdom
Yarar A, Onucyildiz M, Copty NK (2009) Modelling level change in lakes using neuro-fuzzy and artificial neural networks. J Hydrol 365:329–334
Article Google Scholar
Zhao S, Wang L (2010) Support vector regression based on particle swarm optimization for rainfall forecasting. 2010 Third International Joint Conference on Computational Science and Optimization
Zounemat-Kermani M, Teshnehlab M (2008) Using adaptive neuro-fuzzy inference system for hydrological time series prediction. Appl Soft Comput 8(2):928–936
Article Google Scholar

Download references

Author information

Authors and Affiliations

Civil Engineering Department, Selçuk University, 42075, Konya, Turkey
Meral Buyukyildiz & Volkan Yilmaz
Computer Engineering Department, Selçuk University, 42075, Konya, Turkey
Gulay Tezel

Authors

Meral Buyukyildiz
View author publications
You can also search for this author in PubMed Google Scholar
Gulay Tezel
View author publications
You can also search for this author in PubMed Google Scholar
Volkan Yilmaz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Meral Buyukyildiz.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Buyukyildiz, M., Tezel, G. & Yilmaz, V. Estimation of the Change in Lake Water Level by Artificial Intelligence Methods. Water Resour Manage 28, 4747–4763 (2014). https://doi.org/10.1007/s11269-014-0773-1

Download citation

Received: 08 April 2014
Accepted: 18 August 2014
Published: 28 August 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s11269-014-0773-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Estimation of the Change in Lake Water Level by Artificial Intelligence Methods

Abstract

Similar content being viewed by others

Utilization of PSO algorithm in estimation of water level change of Lake Beysehir

A Comparative Study of Artificial Neural Networks, Support Vector Machines and Adaptive Neuro Fuzzy Inference System for Forecasting Groundwater Levels near Lake Okeechobee, Florida

Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index

1 Introduction