Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA

Heddam, Salim

doi:10.1007/s40710-016-0172-0

Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA

Original Article
Published: 07 June 2016

Volume 3, pages 909–937, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Processes Aims and scope Submit manuscript

Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA

Download PDF

Salim Heddam¹

642 Accesses
32 Citations
Explore all metrics

Abstract

This study presents a new method called optimally pruned extreme learning machine (OP-ELM) for forecasting dissolved oxygen concentration (DO) several hours in advance. The forecast time horizon ranges from 24-h ahead (one day) to 168-h ahead (seven days). The proposed OP-ELM model is compared to the standard multilayer perceptron neural network (MLPNN) with respect to their capabilities of forecasting DO in the Klamath River at Miller Island Boat Ramp, Oregon, USA. To demonstrate the forecasting capability of OP-ELM and MLPNN models, we used a long-term data set of hourly DO data for a ten-year period, from 1 January 2004 to 31 December 2013, collected by the United States Geological Survey (USGS Stations No: 420,853,121,505,500 [Top] and 420,853,121,505,501 [Bottom]). For developing the models, we split the data set into a training subset (from 2004 to 2010) that corresponded to 70 %, and a validation (from 2011 to 2013) that corresponded to 30 % of the total data set. We investigated the performance and accuracy of the proposed two models for three different horizons, i.e., short-term, medium-term and long-term forecasting; a total of six different models (FM1 to FM6), having the same data sets as inputs, were developed for short-term (24 h to 48 h), medium-term (72 h to 96 h) and long-term (120 h to 168 h) horizons. Input variables used in the six models were the six antecedent DO concentrations at (t-5), (t-4), (t-3), (t-2), (t-1) and (t). The performance of the OP-ELM and MLPNN models in training and validation sets were compared with the observed data. To get more accurate evaluation of the results of the two models, the following seven statistical performance indices were used: the coefficient of correlation (R), the Willmott index of agreement (d), the Nash-Sutcliffe efficiency (NSE), the root mean squared error (RMSE), the mean absolute error (MAE), the bias error (Bias), and the mean absolute percentage error (MAPE). The study reveals that OP-ELM and MLPN provided good results and they were successful in forecasting DO at a high level of accuracy. The reliability of forecasting decreased with increasing the step ahead. The measures of model performance fell within the acceptable ranges for the two stations. Regarding the fact that researches on medium and long-term forecasting are relatively limited, the present work aims to build and provide a good early warning system capable of preventing DO depletion and the associated problems of anoxia and hypoxia in river. Furthermore, the proposed forecasting models, when implemented appropriately, could be reliably used in detecting future change in DO concentration in rivers.

Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors

Article 30 May 2017

Predicting water quality through daily concentration of dissolved oxygen using improved artificial intelligence

Article Open access 21 November 2023

Non-tuned machine learning approach for hydrological time series forecasting

Article 16 December 2016

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Dissolved oxygen (DO) concentration is an important water quality parameter, which consists an important indicator of water pollution in rivers (Mohan and Pavan Kumar 2016). DO is used to assess the trophic state of rivers, canals and lakes (Gikas 2014; Chamoglou et al. 2014; Mellios et al. 2015). During the last decade, much effort has been devoted to the modelling of DO in river, lake and stream ecosystems using artificial intelligence (AI) techniques, and much work has been done on this subject. Of course, and not surprisingly, regarding the capabilities and usefulness of the AI techniques, the overwhelming attention given to their application is not just luck. Motivation for using AI techniques in DO modelling has received considerable attention in the last few years and an in-depth literature review suggests that the following four types of AI techniques have been broadly applied on the subject:

1.
Artificial neural networks based approach (ANNs): generally, there are the following three types of ANNs that are widely used in modelling DO: (i) generalized regression neural network (GRNN) (Heddam 2014a; Antanasijević et al. 2013, 2014); (ii) radial basis function neural network (RBFNN) (Akkoyunlu et al. 2011; Ay and Kisi 2012; Emamgholizadeh et al. 2014); and (iii) multilayer perceptron neural network (MLPNN) (Ranković et al. 2010; Areerachakul et al. 2013).
2.
Fuzzy logic and neurofuzzy system: generally, the famous adaptive neuro-fuzzy inference system (ANFIS) is the broadly reported model in modelling DO (Heddam 2014b; Nemati et al. 2015).
3.
Evolutionary models: generally, gene expression programming (GEP) (Kisi et al. 2013), dynamic evolving neural-fuzzy inference system (DENFIS) (Heddam 2014c), Particle Swarm Optimization (PSO) (Liu et al. 2014), and genetic algorithm (GA) (Liu et al. 2013) are the widely used models.
4.
Wavelet decomposition models (Evrendilek and Karakaya 2014, 2015).

Although the already developed models for estimating DO have shown good precision and accuracy, and they offer a good alternative in the absence of direct in situ measurements, these models suffer from a number of shortcomings and limitations. First, the models are linked to the water quality variables which already have been used in the calibration of the models; these will typically involve maintaining the input variables available regularly. Second, the models cannot be applied without timely and reliable water quality data. It would be interesting to investigate the capabilities of the AI models in forecasting DO at different level in advance using only the time series of DO and without the need for water quality variables as input to the models. Forecasting DO in river ecosystems is a relatively difficult task, since the concentration of DO varies significantly over different time periods of the day, especially between morning and evening, within the seasons, and from year to year. Hence, according to what was stated above, we can define three categories of forecasting models: short-term, medium-term and long-term forecasting models. Long-term forecasting refers to more than 5 days in advance (120 h) and refers to one of two horizons, one is 5 days in advance (120 h), the other is 7 days in advance (168 h). In medium-term forecasting, the horizon is usually three to four days in advance (72 h to 96 h). Finally, short-term forecasting involves a period of about one to two days (24 h to 48 h). Obviously, for the long-term forecasting horizon, using a long historical data set, may improve the accuracy of the developed models.

Understanding the concentration and change of DO in freshwater ecosystems is necessary and of great importance. DO varies depending on many factors, including salinity, temperature and pressure (USGS 2008). These factors can be classified as follows (Cox 2003a, 2003b): (i) reaeration from the atmosphere; (ii) enhanced aeration at weirs and other structures; (iii) photosynthetic oxygen production; and (iv) the introduction of DO from other sources such as tributaries. The primary causes of DO depletion are: (i) the oxidation of organic material in the water column; (ii) degassing of oxygen in supersaturated water; (iii) respiration by aquatic plants; (iv) oxygen demand exerted by river bed sediments (Cox 2003a, 2003b). Low levels of DO in the water are accompanied by hypoxia (HY) and anoxia (AX) (Nürnberg 2004). Anoxia is defined as a condition of zero (a total loss of DO) or very low DO concentrations, while hypoxia is defined as a condition of low DO. Seasonal hypoxia, and eventual anoxia, can be caused by high anthropogenic nutrient loadings (Breitburg et al. 2003). Hypoxia profoundly affects ecosystem functioning, with strong implications for all ecosystem components, i.e., biodiversity, biological and cause physiological stress, and even death, to associated aquatic organisms, which indicate a stressed environment (Friedrich et al. 2014). Regarding the importance of the HY and AX processes, it will be very interesting to develop a model that will be used as an early warning system, and will have the ability to deliver more accurate and time sensitive information during stress events.

Few studies have attempted to develop forecasting models for DO time series without the need of using the water quality variable as input to the models. Alizadeh and Kavianpour (2015) compared two ANNs models, the standard MLPNN and wavelet-neural network (WNN), in forecasting DO at hourly and daily time steps, at the Hilo Bay on the east side of the Big Island of Hawaii, Pacific Ocean. DO at different time lags were considered as input variables for the MLPNN and WNN models. They have tested four forecasting models having different input combinations: DO(t-1), DO(t-2), DO(t-3) and DO(t-4). The models developed had only one output: the DO at (t + 1). The t corresponds to the DO at the present time (day or hour), and the (t-1) until (t-4) correspond to the four previous values of DO. As a result of the study, for the models at daily time step, they have obtained a correlation coefficient (R) of 0.66 and 0.72 for the models having two inputs (i.e., (t-1) and (t-2)) and the models having four inputs (i.e., (t-1), (t-2), (t-3) and (t-4)), respectively. At hourly time step, the authors reported excellent results with an R of 0.94 in the validation phase. An et al. (2015) proposed a new forecasting model using the nonlinear grey Bernoulli model (NGBM (1, 1)) in forecasting DO in the Guanting reservoir (inlet and outlet), located at the upper reaches of the Yongding River in northwest Beijing, China. The authors used DO at weekly time step. Altunkaynak et al. (2005) examined the potential of fuzzy logic-based model (FL) to forecast monthly DO in the Golden Horn in Istanbul, Turkey. The authors have used two inputs (i.e., DO(t-1) and DO(t-2)) and one output (DO(t)). As a result of the study, it was seen that one can forecast the next month’s DO concentration from antecedent measurements within acceptable relative error limits. Wang et al. (2013) compared the bootstrapped wavelet neural network (BWNN) as a new tool in forecasting monthly DO in Harbin, China. The proposed BWN model has been compared with the standard MLPNN and the wavelet neural network models (WNN). They have compared three types of models: using only DO(t-1) as input; using only DO(t-3) as input and using DO(t-1) and DO(t-3) as inputs. In all three developed models the output was the DO(t). As a result of the study the authors reported a high Nash-Sutcliffe efficiency (NSE) value equal to 0.85 in the validation phase. Faruk (2010) applied the MLPNN in forecasting monthly DO time series using the data for the period between 1996 and 2004 from the Büyük Menderes basin, Turkey, and has obtained a NSE coefficient equal to 0.91.

As indicated above, a literature review demonstrates that modelling DO is broadly discussed and several kinds of models have been developed worldwide. However, few studies have been devoted to forecasting DO and many more are needed. In the previous forecasting studies, we can report the following drawbacks: (i) the time-step chosen is often constrained by the availability of DO data, and the majority of the models use data at monthly, weekly or infrequently daily time step, which is not really efficient as the amount of DO varies rapidly; (ii) in all reported models, the forecast horizon is generally the value at the next time (t + 1), and none of the studies has investigated a long term forecasting horizon. Therefore, in this paper we present the OP-ELM as a new and powerful tool in forecasting hourly dissolved oxygen concentration (DO) several hours in advance. The proposed model is compared to the standard MLPNN in order to demonstrate its superiority and usefulness. We test this new model with data from the Klamath River at Miller Island Boat Ramp, Oregon, USA, to illustrate how the proposed model can forecast DO very well. To the author’s knowledge, DO forecasting with OP-ELM is the first study in the literature. Therefore, the present study investigates the use of OP-ELM in the development of a robust model for DO forecasting.

2 Materials and Methods

2.1 Study Area and Data Set

Historical hourly DO data from 1 January 2004 to 31 December 2013 were used in this study; they are available at the United States Geological Survey (USGS) website: http://or.water.usgs.gov/cgi-bin/grapher/table_setup.pl?site_id. Two stations were chosen: at Klamath, River Miller Island Boat Ramp, Oregon USA [Top] (USGS ID: 420,853,121,505,500; Latitude 42°08′53″N; Longitude 121°50′55″E; NAD83); and at Klamath, River Miller Island Boat Ramp, Oregon USA [Bottom] (USGS ID:420,853,121,505,501). Figure 1 shows the locations of the stations. For the two stations, the data set was divided into two sub-data sets: (i) training set (70 %); and (ii) validation set (30 %). Hence, the first seven years (2004 to 2010) were chosen for training and the last three years (2011–2013) for validation. Data were stored in a table, in which each row corresponded to one hour (one pattern) and contained a measured DO value. In some hours, the DO variable was not available, in which case we removed the incomplete patterns. A detailed description of the original and final patterns is reported in Table 1. From Table 1, for the top station a total of 1047 patterns were missing and for the bottom station a total of 3763 patterns were missing. Figures 2 and 3 show the plot of the DO time series for the training and validation, for the two stations. Table 2 summarizes the descriptive statistics of the data set, where X_mean, X_max, X_min, S_x and C_v denote the mean, the maximum, the minimum, the standard deviation, and the coefficient of variation, respectively. All the data derived were normalized to have zero mean and unit variance using the Z-score method, calculated by the following formula:

$$ {x}_{ni,k}=\frac{{\displaystyle {x}_{i,k}}-{\displaystyle {m}_k}}{{\displaystyle {Sd}_K}} $$

(1)

Table 1 Dataset presentation includes years of available data, the description of the starting and ending dates covering the period of the study, the total patterns available, the incomplete patterns removed from the table, the final patterns used, and separating the data to the subset for training and validation

Full size table

Table 2 Statistical parameters of data set for forecasting models

Full size table

where x _{ni, k} is the normalized value of the variable k (input or output) for each sample i. x _i,_k is the original value of the variable k (input or output). m _k and S _dk are the mean value and standard deviation of the variable k (input or output). Normalization is an important process, which increases significantly the performance of the models (Kingston et al. 2005; Heddam et al. 2011, 2012, 2016; Heddam 2016a, 2016b).

2.2 Multilayer Perceptron Neural Network (MLPNN)

Artificial neural networks (ANNs) are an information processing system which belongs to the category of nonlinear models (Haykin 1999). ANNs are inspired from the function of the human brain. In ANNs, the neurons are tightly interconnected and organized into different layers. There are three types of layers, i.e., an input layer, one or more hidden layers, and an output layer. The layers are connected by weights (Haykin 1999). Multilayer perceptron neural network (MLPNN) (Rumelhart et al. 1986) is the most widely used type of ANNs reported in the literature, considered as a powerful tool in modeling nonlinear processes. The number of neurons contained in the input and output layers, is related to the problem that must be resolved. Each neuron in the input layer corresponds to a single variable, and the output neurons represent the solution of the problem being solved. Figure 4 presents the structure of the MLPNN models used in the present study. It consist of three layers: the input layer with six variables, denoted as x ₁ to x ₆ and corresponding to the DO at (t-5), (t-4), (t-3), (t-2), (t-1) and (t); the hidden layer with a number of neurons defined by trial and error; and finally, the output layer with only one neuron: the DO at (t + 24) to (t + 168), according to the model developed. The MLPNN are capable of approximating any function with a finite number of discontinuities (Hornik et al. 1989) and are considered as a universal approximator (Hornik et al. 1989; Hornik 1991). Let us denote with k the number of input variables, and m the number of neurons in the hidden layer; the mathematical structure of the MLPNN from the input to the output can be formulated as follows:

$$ {\displaystyle {A}_j}={\displaystyle {\beta}_j}+{\displaystyle \sum_{i=1}^k{\displaystyle {w}_{ij}}\times {\displaystyle {x}_i}} $$

(2)

where A _j is the weighted sum of the j hidden neuron, k is the total number of inputs, w _ij denotes the weight characterising the connection between the n ^th input to the m ^th hidden neuron, and β _j is the bias term of each hidden neuron. The output of the m ^th hidden neuron is given by

$$ {\displaystyle {Y}_j}=f\left({\mathrm{A}}_j\right) $$

(3)

The activation function f adopted in the present study was the sigmoid, given by Eq. (4):

$$ f(A)=\frac{1}{1+{\displaystyle {e}^{-A}}} $$

(4)

The neural network output is then given by:

$$ {\displaystyle {O}_k}={\displaystyle {\beta}_0}+{\displaystyle \sum_{j=1}^m{w}_{jk}\times {Y}_j} $$

(5)

where w _jk denotes the weight characterising the connection between the m ^th hidden neuron to the p ^th output neuron, m the total number of hidden neurons, and β ₀ is the bias term. A linear activation function is most commonly applied to the output layer.

During the last few years, AI techniques are largely used by scientists in developing different kind of models, and the ANNs are the most applied form of AI. Typical applications include the following, among many others: predicting the dispersion coefficient (D) in a river ecosystem (Antonopoulos et al. 2015); modelling the permeability losses in permeable reactive barriers (Santisukkasaem et al. 2015); estimating the reference evapotranspiration (ET₀) in India (Adamala et al. 2015); calculating the dynamic coefficient in porous media (Das et al. 2015); predicting Indian monsoon rainfall (Azad et al. 2015); modeling of arsenic (III) removal (Mandal et al. 2015); predicting effluent biochemical oxygen demand (BOD) in a wastewater treatment plant (Heddam et al. 2016); modeling Secchi disk depth (SD) in river (Heddam 2016a); and predicting phycocyanin (PC) pigment concentration in river (Heddam 2016b).

Unsurprisingly, regarding the high capabilities of ANNs in developing environmental models, they have rapidly gained much popularity. But even with this, ANNs models are not without criticism. The performance of the ANNs models depends mainly on (Maier and Dandy 2000; Maier et al. 2010): (i) the selection of the appropriates input; (ii) data division (the best selection of the splitting ratio between training and validation); (iii) the model structure (the number of hidden neurons, number of hidden layers and the type of transfer function); (iv) model calibration; and finally (v) model evaluation using different kind of statistical metrics. Some important details about the appropriate application of the ANN models can be found in: Bowden et al. (2005a, 2005b); Maier and Dandy (2000); and Maier et al. (2010).

2.3 Optimally Pruned Extreme Learning Machine (OP-ELM)

Extreme learning machine (ELM) has been proposed by Huang et al. (2006a, 2006b) for training single hidden layer feedforward neural networks (SLFN) (Huang et al. 2015). ELM is a universal function approximator (Huang and Chen 2007, 2008). In SLFN, we have two weight matrices, from the input layer to the hidden layer (input layer weights) and from the hidden layer to the output layer (output layer weights). These two matrices must be optimized during the training process, generally using the backpropagation training algorithm. According to Huang et al. (2006a, 2006b), when using the ELM algorithm, the input layer weights do not need to be tuned iteratively and to be generated randomly; however, the output layer weights are determined analytically using least-squares method (Huang et al. 2015). The ELM algorithm is presented briefly below.

Let be N arbitrary training samples represented by (x _i, y _i), where x _i = [x _i1, x _i2,…, x _iD]^T∈R^D and [y _i1, y _i2,…, y _iD]^T∈R^D. Single hidden layer feedforward neural networks with L hidden nodes and activation function f(x) are mathematically presented as (Huang et al. 2006a, 2006b):

$$ {\displaystyle \sum_{i=1}^L{\displaystyle {\beta}_i}}{\displaystyle {f}_i}\left({\displaystyle {x}_j}\right)={\displaystyle \sum_{i=1}^L{\displaystyle {\beta}_i}}{\displaystyle {f}_i}\left({w}_i\cdot {x}_j+{b}_i\right)={o}_j $$

(6)

where j = 1, 2,…, N. Here w _i = [w _i1, w _i2,…, w _iD]^T is the weight vector connecting the ith hidden node and the input nodes, β _i = [β _i1, β _i2,…, β _ik]^T is the weight vector connecting the ith hidden node and the output nodes, and b _i is the threshold of the ith hidden node (Zeng et al. 2013). The standard SLFN with L hidden nodes with activation function f(x) can be written as:

$$ H\beta =T $$

(7)

where:

$$ H=\left[\begin{array}{ccc}\hfill {f}_1\left({w}_1\cdot {x}_1+{b}_1\right)\hfill & \hfill \cdots \hfill & \hfill {f}_L\left({w}_L\cdot {x}_1+{b}_L\right)\hfill \\ {}\hfill \vdots \hfill & \hfill \vdots \hfill & \hfill \vdots \hfill \\ {}\hfill {f}_1\left({w}_1\cdot {x}_N+{b}_1\right)\hfill & \hfill \cdots \hfill & \hfill {\displaystyle {f}_L}\left({w}_L\cdot {x}_N+{b}_L\right)\hfill \end{array}\right] $$

(8)

$$ \beta =\left[\begin{array}{c}\hfill {\beta}_1^T\hfill \\ {}\hfill \vdots \hfill \\ {}\hfill {\beta}_L^T\hfill \end{array}\right]\ \mathrm{and}\ \mathrm{T}=\left[\begin{array}{c}\hfill {t}_1^T\hfill \\ {}\hfill \vdots \hfill \\ {}\hfill {t}_L^T\hfill \end{array}\right] $$

(9)

Different from the conventional gradient-based solution of SLFN, ELM simply solves the function by:

$$ \beta ={H}^{+}T $$

(10)

where H⁺ is the Moore-Penrose generalized inverse of matrix H (Zeng et al. 2013). More details about ELM can be found in: Huang et al. (2006a, 2006b); Huang et al. (2011, 2015); and Huang (2015).

Since it was proposed, ELM has received much popularity and several modifications were proposed to the original algorithm. Among them, we could name: the optimally pruned extreme learning machine (OP-ELM) (Miche et al. 2008, 2010); online sequential extreme learning machine (OS-ELM) (Liang et al. 2006); Bidirectional extreme learning machine B-ELM (Yang et al. 2012); Evolutionary extreme learning machine SaDE-ELM (Cao et al. 2012); Fully Complex Extreme Learning Machine C-ELM (Li et al. 2005); and others. In the present study we focused on the famous OP-ELM.

The optimally pruned extreme learning machine was developed by Miche et al. (2008, 2010). It is simply called OP-ELM (Miche et al. 2008, 2010; Sorjamaa et al. 2008), and is a modified version of the original extreme learning machines (ELM). According to Pouzols and Lendasse (2010a, 2010b), OP-ELM is designed for use for up to three different types of kernel functions, i.e., Gaussian, sigmoid and linear kernels, and the creation of an OP-ELM model occurs in three stages. Considering the fact that the original ELM suffer from an important drawback, which is the presence of irrelevant or correlated variables in the training data set, Miche et al. (2008, 2010) introduced the OP-ELM model as a good solution for pruning away irrelevant variables, thereby, further improving the reliability and efficiency of the ELM approach.

Given a training set (x _i, y _i), x _i∈R^d1 and y _i∈R^d2; OP-ELM implies three important steps (Grigorievskiy et al. 2014): (i) build a regular ELM model with initially large number of neurons; (ii) rank neurons using multiresponse sparse regression (MRSR) (Similä and Tikka 2005) or least angle regression (LARS) (Efron et al. 2004) if the output is one-dimensional; and (iii) use leave-one-out (LOO) validation to decide how many neurons to prune (Fig. 5). In the first step, a construction of a SLFN using the original ELM algorithm with a large number of neurons is made (by default one hundred neurons). It is important to note that while the ELM use only sigmoid kernels, the OP-ELM offers three possibilities of kernel types: sigmoid, Gaussian and linear (Pouzols and Lendasse 2010a, 2010b). Ranking the hidden neurons according to their accuracy is done in the second step using the MRSR algorithm. Hereafter, we present the MRSR algorithm briefly, according to Similä and Tikka (2005). Suppose that the targets are denoted by an n × p matrix T = [t ₁… t _p] and the regressors are denoted by an n × m matrix X = [x ₁… x _m]. The MRSR algorithm adds sequentially active regressors to the model:

$$ {\displaystyle {Y}^k}={XW}^k $$

(11)

such that the n × p matrix Y ^k = [y ₁ ^k… y _p ^k] models the targets T appropriately. The m × p weight matrix W^k includes k nonzero rows at the beginning of the kth step. Each step introduces a new nonzero row and, thus, a new regressor to the model. It is reported that MRSR is a variable ranking rather than selection algorithm (Pouzols and Lendasse 2010a). Finally, the third and last step in OP-ELM is the application of the leave-one-out (LOO) validation method where the decision over the actual best number of neurons for the model is taken. According to Grigorievskiy et al. (2014), using the so-called application of the PRESS (PREdiction Sum of Squares) statistic, provides an exact LOO.

Since it was proposed, OP-ELM has gained much popularity in engineering applications during the past few years. Moreno et al. (2014) compared the ELM and the OP-ELM for the classification of remote sensing hyperspectral data; Grigorievskiy et al. (2014) compared the OP-ELM with the linear model and the Least-Squares Support Vector Machines (LS-SVM) for long-term time series prediction; Akusok et al. (2015) applied the OP-ELM for the identification of the likeliest samples in a dataset to be mislabelled; and Sovilj et al. (2010) compared the OP-ELM and optimally pruned k-nearest neighbors OP-KNN for long-term time series prediction.

2.4 Model Performance Indices

In the present study, we used seven performance indices in order to evaluate performance and compare the developed models. These seven indices are: the coefficient of correlation (R) (Eq. 12) (Legates and McCabe 1999), the Willmott index of agreement (d) (Eq. 13) (Willmott 1982; Willmott et al. 1985), the Nash-Sutcliffe efficiency (NSE) (Eq. 14) (Nash and Sutcliffe 1970; Dawson and Wilby 2001), the root mean squared error (RMSE) (Eq. 15) (Chai and Draxler 2014; Dawson et al. 2002), the mean absolute error (MAE) (Eq. 16) (Dawson et al. 2002; Gebremariam et al. 2014), the bias error (Bias) (Eq. 17) (Dawson et al. 2007), and the mean absolute percentage error (MAPE) (Dawson et al. 2007) (Eq. 18):

$$ NSE=1-\frac{{\displaystyle \sum_{i=1}^N{\displaystyle {\left[{\displaystyle {O}_i}-{\displaystyle {P}_i}\right]}^2}}}{{\displaystyle \sum_{i=1}^N{\displaystyle {\left[{\displaystyle {O}_i}-{\displaystyle {O}_m}\right]}^2}}} $$

(12)

$$ d=1-\frac{{\displaystyle \sum_{i=1}^N{\displaystyle {\left({\displaystyle {P}_i}-{\displaystyle {O}_i}\right)}^2}}}{{\displaystyle \sum_{i=1}^N{\displaystyle {\left(\left|{\displaystyle {P}_i}-{\displaystyle {O}_m}\right|+\left|{\displaystyle {O}_i}-{\displaystyle {O}_m}\right|\right)}^2}}} $$

(13)

$$ R=\left[\frac{\frac{1}{N}{\displaystyle \sum \left({\displaystyle {O}_i}-{\displaystyle {O}_m}\right)\left({\displaystyle {P}_i}-{\displaystyle {P}_m}\right)}}{\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^n{\displaystyle {\left({\displaystyle {O}_i}-{\displaystyle {O}_m}\right)}^2}}}\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^n{\displaystyle {\left({\displaystyle {P}_i}-{\displaystyle {P}_m}\right)}^2}}}}\right] $$

(14)

$$ RMSE=\sqrt{\frac{1}{N}{\displaystyle \sum_{i=1}^N{\displaystyle {\left({\displaystyle {O}_i}-{\displaystyle {P}_i}\right)}^2}}} $$

(15)

$$ MAE=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left|{\displaystyle {O}_i}-{\displaystyle {P}_i}\right|} $$

(16)

$$ Bias=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left({\displaystyle {O}_i}-{\displaystyle {P}_i}\right)} $$

(17)

$$ MAP\mathrm{E}=\frac{1}{N}{\displaystyle \sum_{i=1}^N\left|\frac{{\displaystyle {O}_i}-{\displaystyle {P}_i}}{{\displaystyle {O}_i}}\right|} $$

(18)

where N is the number of data points, O_i is the measured value and P_i is the corresponding model prediction, and O_m and P_m are the average values of O_i and P_i. The Willmott index of agreement (d) is an indicator of model performance. It carries a value from 0 to 1, where a value of 1 indicates perfect agreement (Willmott 1982). The bias error (Bias), also reported as the mean error (ME), for a perfect match has the value zero. A bias index expresses the tendency of the model to underestimate (positive value) or overestimate (negative value). According to Dawson et al. (2007), a low value for the Bias does not necessarily indicate a good model in terms of accurate forecasts, since positive and negative errors will tend to cancel out each other; for this reason, MAE is often preferred to Bias. The Nash-Sutcliffe efficiency (NSE) is a measure of statistical association, which indicates the percentage of the observed variance that is explained by the predicted data (Nash and Sutcliffe 1970). NSE ranges between -∞ and 1, with NSE = 1 being the optimal value (Moriasi et al. 2007). The R statistic describes the degree of colinearity between the observed and predicted values. A perfect model has an R equal to 1; typically, values greater than 0.70 are considered acceptable (Moriasi et al. 2007). Both RMSE and MAE describe the difference between model simulations and observations in the units of the variable. For both, values close to zero indicate perfect fit. Some advantages of MAE over RMSE in the evaluation and comparison of model performance can be found in Willmott and Matsuura (2005). The MAPE, also called MARE (Dawson et al. 2007) or average absolute percentage error (AAPE) (Maier and Dandy 1996; Bowden et al. 2002), is one of the most widely used measures of forecast accuracy for developing models (Kim and Kim 2016; Deo et al. 2016). The MAPE has no upper bound and for a perfect match the result would be zero (Dawson et al. 2007). Although some researchers worldwide already recommend the use of the MAPE as a measure of forecast accuracy, the MAPE has some advantages and disadvantages. One of the most important disadvantages of the MAPE is that it produces infinite or undefined values for zero or close-to-zero measured (observed) values (Kim and Kim 2016), which is the case in many environmental case studies. According to Dawson et al. (2007), the MAPE is subject to “potential fouling by small numbers in the observed record”. The MAPE is susceptible to outliers and it tends to overstate forecast error (Tayman and Swanson 1999; Rayer 2007). If the observed values are very small, MAPE yields extremely large percentage errors (outliers), while zero actual values result in infinite MAPE (Kim and Kim 2016). Hence, when the observed values are close to zero, the MAPE value can explode to a huge number, and if averaged with the other values, it can give a distorted image of the magnitude of the error (Chase 2013).

In addition to all the reported seven performance indices, we also calculated the so-called RPD (Eq. 19) (ratio of standard deviation to RMSE), the scatter index (SI) (Eq. 20), also reported as the normalized objective function (NOF) (Gikas 2014), and the ratio of the mean absolute error to the standard deviation (RMAE) (Eq. 21). The RPD is an important index of model accuracy; in general, RPD values less than one are unacceptable while values greater than 3 are considered excellent (Chang et al. 2001; Ackerson et al. 2015). A value of the RMAE less than 0.5 may be considered low, acceptable and appropriate for model evaluation (Singh et al. 2004; Moriasi et al. 2007). For the scatter index (SI), a perfect model has a SI equal to 0.0 (Gikas 2014), and for values between 0.0 and 1.0, the model predictions are acceptable (Boskidis et al. 2010, 2011; Gikas et al. 2006; Pisinaras et al. 2010).

$$ RPD=\frac{S_X}{RMSE} $$

(19)

$$ SI=\frac{RMSE}{{\mathrm{O}}_m} $$

(20)

$$ RMAE=\frac{MAE}{S_X} $$

(21)

Finally, we report a comparison between measured and calculated mean and standard deviation of DO, in the training and validation phases.

3 Results and Discussion

An attempt has been made in this study to develop a robust model for forecasting DO several hours in advance, in the Klamath River, USA, using OP-ELM model. To draw robust conclusions, it is interesting to compare the results using OP-ELM with the results obtained using the standard MLPNN. We have developed six forecasting models called FM1 to FM6 using the same input data and having different outputs. The structures are shown in Table 3. The six models correspond to three different horizons: short-term (FM1 and FM2), medium-term (FM3 and FM4) and long-term forecasting horizons (FM5 and FM6). Forecasting DO time series refers to estimating the future values at different hour intervals by using a set of previous observed values. This will be interesting, as we use only the time series of DO without any other input variables. For the OP-ELM, we used the implementation available from www.cis.hut.fi/projects/tsp/index.php. In the present study, we applied the OP-ELM with all possible kernels, i.e., linear, sigmoid, and Gaussian, using a maximum number of 100 neurons (Fig. 6). For all the six models developed, the relationship among the inputs and the output can be defined by Eq. (22):

$$ DO\left(t+n\right)=f\left[ DO\left(t-5\right), DO\left(t-4\right), DO\left(t-3\right), DO\left(t-2\right), DO\left(t-1\right), DO(t)\right] $$

(22)

Table 3 Combinations of input variables considered in developing forecasting models

Full size table

For example, for model FM1, the output corresponds to DO at (t + 24), and the forecasting process is the following: with the N observations of the time series of DO at (t = 1, t = 2, t = 3, t = 4,…, t = N), using a model with n input nodes (6 inputs in our study), we have a training data composed of N-n training patterns, in which the first patterns will be composed of DO at (t = 1, 2, 3,…, n) as inputs and DO (n + 24) as the output; the second pattern will contain DO at (t = 2, 3, 4,…, n + 1) as inputs and DO (n + 25) as the output, and the last pattern will be DO (N-(n-6 + 24), N-(n-5 + 24), N-(n-4 + 24),…,N-(n-1 + 24)) as inputs and DO (N) as the output, where N is the total number of patterns. It is clear from the proposed structure that for the FM6 model that has DO at (t + 168) as output, we have a training data composed of (N-168) training patterns, in which the first patterns will be composed of DO at (t = 1, 2, 3, 4, 5,6) as inputs and DO (t = 172) as the output. This structure is valuable for the all six developed models. Therefore, all the six developed models are basically approximators of the general equation (19). In the following sections, the forecasting capabilities of the two proposed models (OP-ELM and MLPNN) are presented and compared.

3.1 Forecasting DO Concentration at the USGS 420853121505500 [Top] Station

According to the seven indices for assessing the performance of the models, the optimal model should have the lowest RMSE, MAE, MAPE and Bias, and the values of NSE, R and d should be close to 1. Table 4 displays the RMSE, MAE, MAPE, Bias, NSE, R and d model performance statistics for both the training and validation sets. The two models (OP-ELM and MLPNN) displayed the highest level of accuracy for both training and validation. For the OP-ELM model, in the training phase, the RMSE, MAE and Bias ranged from 1.081 to 1.785, 0.648 to 1.268, and 0, respectively. The NSE, R and d ranged from 0.779 to 0.919, 0.883 to 0.959, and 0.935 to 0.979, respectively. Likewise, for MLPNN, the RMSE, MAE and Bias ranged from 1.054 to 1.741, 0.635 to 1.241, and −0.0001 to 0.0033, respectively. The NSE, R and d ranged from 0.790 to 0.923, 0.889 to 0.961, and 0.938 to 0.980, respectively. From Table 4, it can be observed that model FM1, whose output is the DO at (t + 24), performed better than the other models in training; while the worst results were obtained for FM6 model, whose output is the DO at (t + 168). In the validation phase, both MLPNN and OP-ELM models performed quite well in forecasting hourly DO. According to Table 4, for the OP-ELM model, the RMSE, MAE and Bias ranged from 1.225 to 1.930, 0.665 to 1.258, and 0.0430 to 0.0920, respectively. The NSE, R and d ranged from 0.621 to 0.848, 0.794 to 0.922, and 0.887 to 0.959, respectively. Likewise, for MLPNN, the RMSE, MAE and Bias ranged from 1.270 to 1.933, 0.697 to 1.264, and 0.0479 to 0.1055. The NSE, R and d ranged from 0.619 to 0.836, 0.791 to 0.919, and 0.883 to 0.958, respectively. From Table 4, we conclude that the accuracies of the OP-ELM model predictions were usually slightly better than those of the MLPNN model. This demonstrates that the OP-ELM model is more suitable for forecasting DO.

Table 4 Performance indices for the OP-ELM and MLPNN forecasting models in the training and validation phases several hours in advance for USGS 420853121505500 [Top] station

Full size table

For reasons explained earlier in the section on model performance indices, an analysis of the MAPE is required to be presented. In the training phase, as shown in Table 4, the values of the MAPE ranged from 0.337 % to 0.784 % for OP-ELM and from 0.315 % to 0.750 % for MLPNN. For all the models, the MAPE of the OP-ELM shows slightly larger value than that of MLPNN models. In the validation phase, as shown in Table 4, the MAPE values ranged from 0.176 % to 0.333 % for OP-ELM and from 0.201 % to 0.342 % for MLPNN. For all the models, the MAPE values of the MLPNN were slightly larger than those of OP-ELM models. However, generally OP-ELM models performed better than MLPNN in the validation phase. To sum up, among the six forecasting models (FM1 to FM6), OP-ELM models showed the best performances in the validation phase. Examination of the MAPE values is another way to evaluate the developed models, and an important question must be answered: although the performances of the models are good according to the others six performance criteria (i.e., RMSE, MAE, Bias, R, NSE and d), why the values of the MAPE are very high, especially in the training phase? First, using the MAPE, the performance of the model can be easily judged. The MAPE is only 0.5 % if the measured value is 0.10 mg/L and the predicted value is 0.105 mg/L. But MAPE is 84 % if the measured value is 0.10 mg/L and the predicted value 0.184 mg/L. Second, as mentioned, the disadvantages of the MAPE include that it can assume a very high value (the case of the present study) if the actual measured value is very small (Kim and Kim 2016; Dawson et al. 2007; Tayman and Swanson 1999; Rayer 2007). For example, for a measured DO of 0.10 mg/L, which is frequently observed in the present study, and a calculated DO value of 8.567 mg/L, the MAPE is 8467 %, which is very high and unacceptable.

In order to demonstrate the capabilities and usefulness of the proposed approaches, we present in Table 5 a comparison between measured and calculated mean and standard deviation, the RPD index, the SI and the RMAE. Values of the RMSE and MAE less than half of the standard deviations (SD) of the observations may be considered low (Singh et al. 2004; Moriasi et al. 2007). As stated above, all RPD values less than one are unacceptable while values greater than 3 are considered excellent. According to Table 5, for both training and validation, there were small differences between model estimates and measured DO values, according to the mean and standard deviation. All RMAE values were less than half of the standard deviation and all RPD values were higher than 1.60. It can be seen from Table 5, in the validation phase, for the MLPNN and OP-ELM models, that all the differences between the mean of the measurements and the corresponding calculated DO values are less than 1.0 %. However, it is worth noting that there are only 5 cases in which the difference between the measured and the corresponding calculated standard deviations are higher than 10 % with a maximum of 13.72 % and a minimum of 0.6 %. This finding implies that the proposed models are very good with a high level of accuracy and robustness.

Table 5 Comparison between Mean and Standard deviation of the measured (Mes) and forecasted (Cal) DO values, ratio of standard deviation (S_x) to RMSE (RPD), scatter index (SI) and ratio of the mean absolute error to standard deviation (RMAE), for USGS 420853121505500 [Top] station

Full size table

For a final verification of the accuracy of the developed forecasting models, we also present in Table 5 the values of the SI index in the training and validation phases, for both MLPNN and OP-ELM models. In the training phase, the SI of the six developed models (FM1 to FM6) ranged between 0.144 and 0.238 for the OP-ELM models, and between 0.140 and 0.232 for the MLPNN models. Furthermore, for all the developed models, the SI was less than 0.240 which indicates excellent modeling results. Similarly, in the validation phase, the SI of the six models ranged between 0.144 and 0.206 for the OP-ELM models, and between 0.187 and 0.227 for the MLPNN models. Furthermore, for all developed models, the SI was less than 0.230, which indicates excellent modeling results. Once again, according to the SI, the OP-ELM is again better. In conclusion, according to the results obtained in the validation phase, the OP-ELM models must be ranked as best. Figures 7 and 8 show scatter plots of the calculated versus observed DO for the OP-ELM and MLPNN models, respectively, in the validation phase. Figures 9 and 10 show the comparison between the calculated and measured DO for OP-ELM and MLPNN models, respectively, in the validation phase.

3.2 Forecasting DO Concentration at the USGS 420853121505501 [Bottom] Station

The MLPNN and OP-ELM models with the same input and different outputs were compared based on their performance in training and validation sets. The results are summarized in Tables 6 and 7. It is observed that all models generally gave low values of the RMSE, MAE and bias, and high values of R, d and NSE. Generally, the performances of both the MLPNN and OP-ELM models in DO forecasting were very satisfactory. The first important conclusion is that the two models (MLPNN and OP-ELM) showed higher accuracy in prediction and performed better compared to the results obtained in Top station. Overall, they deliver a higher degree of reliability. It can be seen from the results that the values of all seven performance indices change with respect to the step ahead. Therefore, the two models appear to be performing at the same level of accuracy, but the OP-ELM offers more accuracy in the validation phase. In the training phase, for the OP-ELM model, the RMSE, MAE, and bias ranged from 0.662 to 1.534, 0.425 to 1.084, and 0, respectively. The NSE, R, and d ranged from 0.860 to 0.974, 0.927 to 0.987, and 0.961 to 0.993, respectively. Likewise, for MLPNN, the RMSE, MAE, and bias ranged from 0.656 to 1.502, 0.421 to 1.065, and −0.0042 to 0.0198, respectively, and the NSE, R, and d ranged from 0.865 to 0.974, 0.930 to 0.987, and 0.963 to 0.993, respectively. In the validation phase, both MLPNN and OP-ELM models performed quite well in forecasting hourly DO. According to Table 6, for the OP-ELM model, the RMSE, MAE, and bias ranged from 0.708 to 1.454, 0.425 to 0.995, and 0.00 to −0.0736, respectively, and the NSE, R and d ranged from 0.829 to 0.960, 0.911 to 0.980, and 0.952 to 0.990, respectively. Likewise, for MLPNN, the RMSE, MAE, and bias ranged from 0.730 to 1.489, 0.455 to 1.057, and −0.0582 to −0.1224, respectively, and the NSE, R and d ranged from 0.821 to 0.957, 0.906 to 0.979, and 0.948 to 0.989, respectively. According to Table 6, for all six models, the performance in the training phase was better than the performance in the validation phase. Nevertheless, the FM1 model DO output at (+24 h) must be considered as the best model developed. The FM6 model (+168 h) performed poorer than all the other models in terms of the seven performance indices. For the OP-ELM models, in the training phase, the lowest values of the RMSE, MAE, and Bias of forecasting models were 0.662 (mg/L), 0.425 (mg/L), and 0.00 (mg/L) (for OP-ELM FM1), and the highest values of the NSE, R and d were 0.974, 0.987 and 0.993 (for OP-ELM FM1). Table 6 indicates that the OP-ELM (FM1) had the lowest values of the RMSE and MAE (0.708 mg/L, 0.425 mg/L), and the highest NSE, R and d (0.960, 0.980, and 0.990) in the validation phase. According to Table 6, for the MLPNN models, the lowest values of the RMSE, MAE, and Bias of forecasting models were 0.656 (mg/L), 0.421 (mg/L), and −0.0042 (mg/L) (in MLPNN FM1), and the highest values of the NSE, R, and d were 0.974, 0.987 and 0.993 (in MLPNN FM1). Table 6 indicates that the MLPNN (FM1) had the lowest values of the RMSE and MAE (0.730 mg/L, 0.455 mg/L), and the highest NSE, R and d (0.957, 0.979 and 0.989) in the validation phase. From the results of training and validation, all six models developed in this study were evaluated all together, and results are conspicuous. In the validation phase, the OP-ELM models outperformed MLPNN models in terms of the various performance criteria. Finally, we report again the MAPE values in the training and validation phases. In the training phase, as shown in Table 6, MAPE values ranged from 0.243 % to 0.825 % for OP-ELM and from 0.244 % to 0.783 % for MLPNN. MAPE values for all OP-ELM models were slightly larger than those for MLPNN models. In the validation phase, as shown in Table 6, MAPE values ranged from 0.118 % to 0.278 % for OP-ELM and from 0.134 % to 0.295 % for MLPNN. MAPE values for all MLPNN models were slightly larger compared to those for OP-ELM models. However, generally OP-ELM performed better than MLPNN in the validation phase. To sum up, among the six forecasting models (FM1 to FM6), OP-ELM models showed better performances in the validation phase. In conclusion, the OP-ELM models are better for DO forecasting.

Table 6 Performance of the OP-ELM and MLPNN forecasting models in the training and validation phases several hours in advance for USGS 420853121505501 [Bottom] station

Full size table

Table 7 Comparison between Mean and Standard deviation of the measured (Mes) and forecasted (Cal) values of DO, ratio of standard deviation (S_x) to RMSE (RPD), scatter index (SI), and ratio of the mean absolute error to standard deviation (RMAE), for USGS 420853121505501 [Bottom] station

Full size table

We present in Table 7 a comparison between measured and calculated mean and standard deviation, the RPD index, the SI and the ratio of the RMAE. According to Table 7, for both training and validation there were small differences between model estimates and measured DO values, based on the mean and standard deviation. In the training phase, all the ratios of RMAE were less than 0.5 and all the RPD were higher than 3, except for the FM6 model for both OP-ELM and MLPNN. It can be seen from Table 7, in the validation phase, for both MLPNN and OP-ELM models, that all the difference between the mean of the measurements and the corresponding calculated DO values were less than 1.0 %. However, it is worth noting that there were only 2 cases were the difference between the measured and its corresponding calculated standard deviation were higher than 10 % (FM5 and FM6 for MLPNN), with a maximum of 11.81 % and a minimum of 2 %. This finding implies that the proposed models are very good with a high level of accuracy and robustness. We also present in Table 7 the SI index values in the training and validations phases, for both MLPNN and OP-ELM models. In the training phase, the SI of the six developed models (FM1 to FM6) ranged between 0.101 and 0.183 for OP-ELM models, and between 0.159 and 0.229 for MLPNN models. Furthermore, for all developed models, the SI was less than 0.240, which corresponds to excellent modeling. Similarly, in the validation phase, the SI of the six developed models ranged between 0.097 and 0.160 for OP-ELM models, and between 0.148 and 0.204 for MLPNN models. Furthermore, for all developed models, the SI was less than 0.210, which corresponds to excellent modeling. Once again, according to the SI values, the OP-ELM is better. In conclusion, according to the results obtained in the validation phase, the OP-ELM model must be ranked as better model. Figures 11 and 12 show scatter plots of the calculated versus observed DO for OP-ELM and MLPNN models, respectively, in the validation phase. Figures 13 and 14 show the comparison between the calculated and measured DO for OP-ELM and MLPNN models, respectively, in the validation phase.

4 Conclusions

In this paper, we presented and compared two models to forecast DO several hours in advance, MLPNN and OP-ELM, which are based on artificial intelligence paradigm. To show the applicability of the proposed methods, actual values of measured DO in Klamath River at Miller Island Boat Ramp, Oregon, USA were used. The data were collected from 2004 to 2013, at an hourly time step. The study showed the development of a powerful tool, forecasting DO values at good accuracy (R between 0.91 and 0.98) in the validation phase. The results are very encouraging because the development of these tools was undertaken to support the actual methods and models for estimating DO in rivers without using other water quality parameters. This is the first publication that we are aware of, which demonstrates the use of OP-ELM and MLPNN, in forecasting DO several hours in advance up to (7 days). In the validation phase, the results show that for both stations, Top and Bottom, the OP-ELM model ensured more accurate forecasting than the MLPNN model.

References

Ackerson JP, Demattê JAM, Morgan CLS (2015) Predicting clay content on field-moist intact tropical soils using a dried, ground VisNIR library with external parameter orthogonalization. Geoderma 259-260:196–204. doi:10.1016/j.geoderma.2015.06.002
Article Google Scholar
Adamala S, Raghuwanshi NS, Mishra A (2015) Generalized quadratic synaptic neural networks for ET₀ modeling. Environ Process 2:309–329. doi:10.1007/s40710-015-0066-6
Article Google Scholar
Akkoyunlu A, Altun H, Cigizoglu H (2011) Depth-integrated estimation of dissolved oxygen in a lake. ASCE J Environ Eng 137(10):961–967. doi:10.1061/(ASCE)EE.1943-7870.0000376
Article Google Scholar
Akusok A, Veganzones D, Miche Y, Björk K-M, du Jardin P, Severin E, Lendasse A (2015) MD-ELM: originally mislabeled samples detection using OP-ELM model. Neurocomputing 159:242–250. doi:10.1016/j.neucom.2015.01.055
Article Google Scholar
Alizadeh MJ, Kavianpour MR (2015) Development of wavelet-ANN models to predict water quality parameters in Hilo Bay, Pacific Ocean. Mar Pollut Bull 98:171–178. doi:10.1016/j.marpolbul.2015.06.052
Article Google Scholar
Altunkaynak A, Ozger M, Cakmakci M (2005) Fuzzy logic modeling of the dissolved oxygen fluctuations in Golden Horn. Ecol Model 189:436–446. doi:10.1016/j.ecolmodel.2005.03.007
Article Google Scholar
An Y, Zou Z, Zhao Y (2015) Forecasting of dissolved oxygen in the Guanting reservoir using an optimized NGBM (1,1) model. Journal of Environmental Sciences. (29):158–164. doi:10.1016/j.jes.2014.10.005.
Antanasijević D, Pocajt V, Povrenović D, Perić-Grujić A, Ristić M (2013) Modelling of dissolved oxygen content using artificial neural networks: Danube River, North Serbia, case study. Environ Sci Pollut Res 20:9006–9013. doi:10.1007/s11356-013-1876-6
Article Google Scholar
Antanasijević D, Pocajt V, Povrenović D, Perić-Grujić A, Ristić M (2014) Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo simulation uncertainty analysis. J Hydrol 519:1895–1907. doi:10.1016/j.jhydrol.2014.10.009
Article Google Scholar
Antonopoulos VZ, Georgiou PE, Antonopoulos ZV (2015) Dispersion coefficient prediction using empirical models and ANNs. Environ Process 2:379–394. doi:10.1007/s40710-015-0074-6
Article Google Scholar
Areerachakul S, Sophatsathit P, Lursinsap C (2013) Integration of unsupervised and supervised neural networks to predict dissolved oxygen concentration in canals. Ecol Model 261(262):1–7. doi:10.1016/j.ecolmodel.2013.04.002
Article Google Scholar
Ay M, Kisi O (2012) Modeling of dissolved oxygen concentration using different neural network techniques in Foundation Creek, El Paso County, Colorado. ASCE J Environ Eng 138(6):654–662. doi:10.1061/(ASCE)EE.1943-7870.0000511
Article Google Scholar
Azad S, Debnath S, Rajeevan M (2015) Analysing predictability in Indian monsoon rainfall: a data analytic approach. Environ Process 2(1):717–727. doi:10.1007/s40710-015-0108-0
Article Google Scholar
Boskidis I, Gikas GD, Pisinaras V, Tsihrintzis VA (2010) Spatial and temporal changes of water quality, and SWAT modeling of Vosvozis river basin, North Greece. J Environ Sci Health-Part A 45(11):1421–1440. doi:10.1080/10934529.2010.500936
Article Google Scholar
Boskidis I, Gikas GD, Sylaios G, Tsihrintzis VA (2011) Water quantity and quality assessment of lower Nestos river, Greece. J Environ Sci Health-Part A 46:1050–1067. doi:10.1080/10934529.2011.590381
Article Google Scholar
Bowden GJ, Maier HR, Dandy GC (2002) Optimal division of data for neural network models in water resources applications. Water Resour Res 38(2):1010. doi:10.1029/2001WR000266
Article Google Scholar
Bowden GJ, Dandy GC, Maier HR (2005a) Input determination for neural network models in water resources applications. Part 1-background and methodology. J Hydrol 301(1–4):75–92. doi:10.1016/j.jhydrol.2004.06.021
Article Google Scholar
Bowden GJ, Dandy GC, Maier HR (2005b) Input determination for neural network models in water resources applications. Part 2. Case study: forecasting salinity in a river. J Hydrol 301(1–4):93–104. doi:10.1016/j.jhydrol.2004.06.021
Article Google Scholar
Breitburg DL, Adamack A, Rose KA, Kolesar SE, Decker MB, Purcell JE, Keister JE, Cowan JH (2003) The pattern and influence of low dissolved oxygen in the Patuxent River, a seasonally hypoxic estuary. Estuaries 26(2):280–297. doi:10.1007/BF02695967
Article Google Scholar
Cao J, Lin Z, Huang GB (2012) Self-adaptive evolutionary extreme learning machine. Neural Process Lett 36:285–305. doi:10.1007/s11063-012-9236-y
Article Google Scholar
Chai T, Draxler RR (2014) Root mean square error (RMSE) or mean absolute error (MAE)? Arguments against avoiding RMSE in the literature Geosci Model Dev 7:1247–1250. doi:10.5194/gmd-7-1247-2014
Article Google Scholar
Chamoglou M, Papadimitriou T, Kagalou I (2014) Key-descriptors for the functioning of a Mediterranean reservoir: the case of the new Lake Karla-Greece. Environ Process 1:127–135. doi:10.1007/s40710-014-0011-0
Article Google Scholar
Chang CW, Laird DA, Mausbach MJ, Hurburgh CR (2001) Near-infrared reflectance spectroscopy-principal components regression analyses of soil properties. Soil SciSoc Am J 65:480–490. doi:10.2136/sssaj2001.652480x
Article Google Scholar
Chase C (2013) Demand-Driven Forecasting: A Structured Approach to Forecasting, 2nd Edition. Hoboken, NJ, USA: Wiley. ISBN: 978–1–118-66939-6, pp 384.
Cox BA (2003a) A review of dissolved oxygen modelling techniques for lowland rivers. Sci Total Environ 314(316):303–334. doi:10.1016/S0048-9697(03)00062-7
Article Google Scholar
Cox BA (2003b) A review of currently available in-stream water quality models and their applicability for simulating dissolved oxygen in lowland rivers. Sci Total Environ 314-316:335–377. doi:10.1016/S0048-9697(03)00063-9
Article Google Scholar
Das DB, Thirakulchaya T, Deka L, Hanspal NS (2015) Artificial neural network to determine dynamic effect in capillary pressure relationship for two-phase flow in porous media with micro-heterogeneities. Environ Process 2:1–18. doi:10.1007/s40710-014-0045-3
Article Google Scholar
Dawson CW, Wilby RL (2001) Hydrological modelling using artificial neural networks. Prog Phys Geogr 25(1):80–108. doi:10.1177/030913330102500104
Article Google Scholar
Dawson CW, Harpham C, Wilby RL, Chen Y (2002) Evaluation of artificial neural network techniques for flow forecasting in the river Yangtze. China Hydrol Earth Syst Sci 6:619–626. doi:10.5194/hess-6-619-2002
Article Google Scholar
Dawson CW, Abrahart RJ, See LM (2007) HydroTest: a web-based toolbox of evaluation metrics for the standardised assessment of hydrological forecasts. Environ Model Softw 22:1034–1052. doi:10.1016/j.envsoft.2006.06.008
Article Google Scholar
Deo RC, Wen X, Qi F (2016) A wavelet-coupled support vector machine model for forecasting global incident solar radiation using limited meteorological dataset. Appl Energy 168:568–593. doi:10.1016/j.apenergy.2016.01.130
Article Google Scholar
Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Stat 32:407–499. doi:10.1214/009053604000000067
Article Google Scholar
Emamgholizadeh S, Kashi H, Marofpoor I, Zalaghi E (2014) Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models. Int J Environ Sci Technol 11:645–656. doi:10.1007/s13762-013-0378-x
Article Google Scholar
Evrendilek F, Karakaya N (2014) Regression model-based predictions of diel, diurnal and nocturnal dissolved oxygen dynamics after wavelet denoising of noisy time series. Physica A 404:8–15. doi:10.1016/j.physa.2014.02.062
Article Google Scholar
Evrendilek F, Karakaya N (2015) Spatiotemporal modeling of saturated dissolved oxygen through regressions after wavelet denoising of remotely and proximally sensed data. Earth Sci Inf 8:247–254. doi:10.1007/s12145-014-0148-4
Article Google Scholar
Faruk DÖ (2010) A hybrid neural network and ARIMA model for water quality time series prediction. Eng Appl Artif Intell 23:586–594. doi:10.1016/j.engappai.2009.09.015
Article Google Scholar
Friedrich et al 2014 (2014) Investigating hypoxia in aquatic environments: diverse approaches to addressing a complex phenomenon. Biogeosciences, 11:1215–1259. doi:10.5194/bg-11-1215-2014 .
Gebremariam SY, Martin JF, DeMarchi C, Bosch NS, Confesor R, Ludsin SA (2014) A comprehensive approach to evaluating watershed models for predicting river flow regimes critical to downstream ecosystem services. Environ Model Softw 61:121–134. doi:10.1016/j.envsoft.2014.07.004
Article Google Scholar
Gikas GD (2014) Water quality of drainage canals and assessment of nutrient loads using QUAL2Kw. Environ Process 1:369–385. doi:10.1007/s40710-014-0027-5
Article Google Scholar
Gikas GD, Yiannakopoulou T, Tsihrintzis VA (2006) Modeling of non-point source pollution in a Mediterranean drainage basin. Environ Model Assess 11:219–233. doi:10.1007/s10666-005-9017-3
Article Google Scholar
Grigorievskiy A, Miche Y, Ventelä AM, Séverin E, Lendasse A (2014) Long-term time series prediction using OP-ELM. Neural Netw 51:50–56. doi:10.1016/j.neunet.2013.12.002
Article Google Scholar
Haykin S (1999) Neural networks: a comprehensive foundation. Prentice Hall, Upper Saddle River
Google Scholar
Heddam S (2014a) Generalized regression neural network (GRNN) based approach for modelling hourly dissolved oxygen concentration in the upper Klamath River, Oregon, USA. Environ Technol 35(13):1650–1657. doi:10.1080/09593330.2013.878396
Article Google Scholar
Heddam S (2014b) Modelling hourly dissolved oxygen concentration (DO) using two different adaptive neuro-fuzzy inference systems (ANFIS): a comparative study. Environ Monit Assess 186:597–619. doi:10.1007/s10661-013-3402-1
Article Google Scholar
Heddam S (2014c) Modelling hourly dissolved oxygen concentration (DO) using dynamic evolving neural-fuzzy inference system (DENFIS) based approach: case study of Klamath River at Miller Island Boat Ramp, Oregon, USA. Environ Sci Pollut Res 21:9212–9227. doi:10.1007/s11356-014-2842-7
Article Google Scholar
Heddam S (2016a) Secchi disk depth estimation from water quality parameters: artificial neural network versus multiple linear regression models? Environ Process 3(1):525–536. doi:10.1007/s40710-016-0144-4
Article Google Scholar
Heddam S (2016b) Multilayer perceptron neural network based approach for modelling phycocyanin pigment concentrations: case study from lower Charles River buoy, USA. Environmental Science and Pollution Research. doi:10.1007/s11356-016-6905-9
Google Scholar
Heddam S, Bermad A, Dechemi N (2011) Applications of radial basis function and generalized regression neural networks for modelling of coagulant dosage in a drinking water treatment: a comparative study. ASCE J Environ Eng 137(12):1209–1214. doi:10.1061/(ASCE)EE.1943-7870.0000435
Article Google Scholar
Heddam S, Bermad A, Dechemi N (2012) ANFIS-based modelling for coagulant dosage in drinking water treatment plant: a case study. Environ Monit Assess 184:1953–1971. doi:10.1007/s10661-011-2091-x
Article Google Scholar
Heddam S, Lamda H, Filali S (2016) Predicting effluent biochemical oxygen demand in a wastewater treatment plant using generalized regression neural network based approach: a comparative study. Environ Process 3(1):153–165. doi:10.1007/s40710-016-0129-3
Article Google Scholar
Hornik K (1991) Approximation capabilities of multilayer feedforward networks, Neural Netw, 4(2):251–257, 1991.doi:10.1016/0893-6080(91)90009-T.
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366. doi:10.1016/0893-6080(89)90020-8
Article Google Scholar
Huang GB (2015) What are extreme learning machines? Filling the gap between Frank Rosenblatt’s dream and John von Neumann’s puzzle. Cogn Comput 7:263–278. doi:10.1007/s12559-015-9333-0
Article Google Scholar
Huang GB, Chen L (2007) Convex incremental extreme learning machine. Neurocomputing 70(16–18):3056–3062. doi:10.1016/j.neucom.2007.02.009
Article Google Scholar
Huang GB, Chen L (2008) Enhanced random search based incremental extreme learning machine. Neurocomputing 71(16–18):3460–3468. doi:10.1016/j.neucom.2007.10.008
Article Google Scholar
Huang GB, Chen L, Siew CK (2006a) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892. doi:10.1109/TNN.2006.875977
Article Google Scholar
Huang GB, Zhu QY, Siew CK (2006b) Extreme learning machine: theory and applications. Neurocomputing 70(1–3):489–501. doi:10.1016/j.neucom.2005.12.126
Article Google Scholar
Huang GB, Wang DH, Lan Y (2011) Extreme learning machines: a survey. Int J Mach Learn Cybern 2:107–122. doi:10.1007/s13042-011-0019-y
Article Google Scholar
Huang G, Huang GB, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Netw 61:32–48. doi:10.1016/j.neunet.2014.10.001
Article Google Scholar
Kim S, Kim H (2016) A new metric of absolute percentage error for intermittent demand forecasts. Int J Forecast 32:669–679. doi:10.1016/j.ijforecast.2015.12.003
Article Google Scholar
Kingston GB, Maier HR, Lambert MF (2005) Calibration and validation of neural networks to ensure physically plausible hydrological modeling. J Hydrol 314:158–176. doi:10.1016/j.jhydrol.2005.03.013
Article Google Scholar
Kisi O, Akbari N, Sanatipour M, Hashemi A, Teimourzadeh K, Shiri J (2013) Modeling of dissolved oxygen in river water using artificial intelligence techniques. Journal of Environmental Informatics 22(2):92–101. doi:10.3808/jei.201300248
Article Google Scholar
Legates DR, McCabe GJ (1999) Evaluating the use of “goodness of fit” measures in hydrologic and hydroclimatic model validation. Water Resour Res 35:233–241. doi:10.1029/1998WR900018
Article Google Scholar
Li MB, Huang GB, Saratchandran P (2005) Sundarajan N (2005) fully complex extreme learning machine. Neurocomputing 68:306–314. doi:10.1016/j.neucom.2005.03.002
Article Google Scholar
Liang NY, Huang GB, Saratchandran P, Sundarajan N (2006) A fast and accurate online sequential learning algorithm for feedforward networks, IEEE trans. Neural Netw 17(6):1411–1423. doi:10.1109/TNN.2006.880583
Article Google Scholar
Liu S, Tai H, Ding Q, Li D, Xu L, Wei Y (2013) A hybrid approach of support vector regression with genetic algorithm optimization for aquaculture water quality prediction. Math Comput Model 58:458–465. doi:10.1016/j.mcm.2011.11.021
Article Google Scholar
Liu S, Xu L, Jiang Y, Li D, Chen Y, Li Z (2014) A hybrid WA-CPSO-LSSVR model for dissolved oxygen content prediction in crab culture. Eng Appl Artif Intell 29:114–124. doi:10.1016/j.engappai.2013.09.019
Article Google Scholar
Maier HR, Dandy GC (1996) The use of artificial neural networks for the prediction of water quality parameters. Water Resour Res 32(4):1013–1022. doi:10.1029/96WR03529
Article Google Scholar
Maier HR, Dandy GC (2000) Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications. Environ Model Softw 15(1):101–124. doi:10.1016/S1364-8152(99)00007-9
Article Google Scholar
Maier HR, Jain A, Dandy GC, Sudheer KP (2010) Methods used for the development of neural networks for the prediction of water resource variables in river systems: current status and future directions. Environ Model Softw 25(8):891–909. doi:10.1016/j.envsoft.2014.11.028
Article Google Scholar
Mandal S, Mahapatra SS, Adhikari S, Patel RK (2015) Modeling of arsenic (III) removal by evolutionary genetic programming and least square support vector machine models. Environ Process 2:145–172. doi:10.1007/s40710-014-0050-6
Article Google Scholar
Mellios N, Kofinas D, Laspidou C, Papadimitriou T (2015) Mathematical modeling of trophic state and nutrient flows of Lake Karla using the PCLake model. Environ Process 2(Suppl 1):S85–S100. doi:10.1007/s40710-015-0098-y
Article Google Scholar
Miche Y, Sorjamaa A, Lendasse A (2008) OP-ELM: theory, experiments and a toolbox. In: In: proceedings of the international conference on artificial neural networks. Lecture notes in computer science, Vol. 5163, Prague, Czech Republic, pp. 145–154. doi:10.1007/978-3-540-87536-9_16
Google Scholar
Miche Y, Sorjamaa A, Bas P, Simula O, Jutten C, Lendasse A (2010) OP-ELM: optimally pruned extreme learning machine. IEEE Trans Neural Netw 21(1):158–162. doi:10.1109/TNN.2009.2036259
Article Google Scholar
Mohan S, Pavan Kumar K (2016) Waste load allocation using machine scheduling: model application. Environ Process 3(1):139–151. doi:10.1007/s40710-016-0122-x
Article Google Scholar
Moreno R, Corona F, Lendasse A, Graña M, Galvão LS (2014) Extreme learning machines for soybean classification in remote sensing hyperspectral images. Neurocomputing 128:207–216. doi:10.1016/j.neucom.2013.03.057
Article Google Scholar
Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900
Article Google Scholar
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models part 1- a discussion of principles. J Hydrol 10:282–290. doi:10.1016/0022-1694(70)90255-6
Article Google Scholar
Nemati S, Fazelifard MH, Terzi O, Ghorbani MA (2015) Estimation of dissolved oxygen using data-driven techniques in the Tai Po River, Hong Kong. Environ Earth Sci 74:4065–4073. doi:10.1007/s12665-015-4450-3
Article Google Scholar
Nürnberg GK (2004) Quantified hypoxia and anoxia in lakes and reservoirs. Sci World J 4:42–54. doi:10.1100/tsw.2004.5
Article Google Scholar
Pisinaras V, Petalas C, Gikas GD, Gemitzi A, Tsihrintzis VA (2010) Hydrological and water quality modeling in a medium-sized basin using the Soil and Water Assessment Tool (SWAT). Desalination 250:274–286. doi:10.1016/j.desal.2009.09.044
Article Google Scholar
Pouzols FM, Lendasse A (2010a) Evolving fuzzy optimally pruned extreme learning machine: a comparative analysis. IEEE International Conference on Fuzzy Systems (FUZZ), pp.1–8. doi:10.1109/FUZZY.2010.5584327.
Pouzols FM, Lendasse A (2010b) Evolving fuzzy optimally pruned extreme learning machine for regression problems. Evol Syst 1:43–58. doi:10.1007/s12530-010-9005-y
Article Google Scholar
Ranković V, Radulović J, Radojević I, Ostojić A, Ćomić L (2010) Neural network modeling of dissolved oxygen in the Gruźa reservoir, Serbia. Ecol Model 221:1239–1244. doi:10.1016/j.ecolmodel.2009.12.023
Article Google Scholar
Rayer S (2007) Population forecast accuracy: does the choice of summary measure of error matter? Popul Res Policy Rev 26:163–184. doi:10.1007/s11113-007-9030-0
Article Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland PDP, Research Group, editors. Parallel distributed processing: explorations in the microstructure of cognition. Foundations, Vol. I. Cambridge, MA: MIT Press; pp. 318–362.
Santisukkasaem U, Olawuyi F, Oye P, Das DB (2015) Artificial neural network (ANN) for evaluating permeability decline in permeable reactive barrier (PRB). Environ Process 2:291–307. doi:10.1007/s40710-015-0076-4
Article Google Scholar
Similä T, Tikka J (2005) Multiresponse sparse regression with application to multidimensional scaling. In: Artificial neural networks: formal models and their applications-ICANN 2005, Vol. 3697/2005, pp. 97–102. doi:10.1007/11550907_16.
Singh J, Knapp HV, Demissie M (2004) Hydrologic modeling of the Iroquois River watershed using HSPF and SWAT.ISWS CR 2004–08. www.isws.illinois.edu .
Sorjamaa A, Miche Y, Weiss R, Lendasse A (2008) Long-term prediction of time series using NNE-based projection and OP-ELM. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN). Hong Kong, China, pp. 2674–2680. doi:10.1109/IJCNN.2008.4634173
Google Scholar
Sovilj D, Sorjamaa A, Yu Q, Miche Y, Séverin E (2010) OPELM and OPKNN in long-term prediction of time series using projected input data. Neurocomputing 73:1976–1986. doi:10.1016/j.neucom.2009.11.033
Article Google Scholar
Sullivan AB, Rounds SA, Deas ML, Sogutlugil IE (2012) Dissolved oxygen analysis, TMDL model comparison, and particulate matter shunting-preliminary results from three model scenarios for the Klamath River upstream of keno dam, Oregon: U.S. Geological Survey Open-File Report 2012–1101, 30 p. http://pubs.usgs.gov/of/2012/1101/.
Sullivan AB, Rounds SA, Asbill-Case JR, Deas ML (2013a) Macrophyte and pH buffering updates to the Klamath River water-quality model upstream of Keno dam, Oregon: U.S. Geological Survey Scientific Investigations Report 2013–5016, 52 p. http://pubs.usgs.gov/sir/2013/5016/
Sullivan AB, Sogutlugil IE, Rounds SA, Deas ML (2013b) Modeling the water-quality effects of changes to the Klamath River upstream of Keno dam, Oregon: U.S. Geological Survey Scientific Investigations Report 2013–5135, 60 p. http://pubs.usgs.gov/sir/2013/5135 .
Tayman J. Swanson DA (1999) On the validity of MAPE as a measure of population forecast accuracy. Population Research and Policy Review18(4):299–322. doi:10.1023/A:1006166418051.
U.S. Geological Survey (2008) National field manual for the collection of water-quality data: U.S. Geological Survey Techniques of Water-Resources Investigations, Book 9, Chaps. A1-A9 variously dated. Chapter A6, 6–2 dissolved oxygen, p 48. http://water.usgs.gov/owq/FieldManual/Chapter6/6.2_contents.html .
Wang Y, Zheng T, Zhao Y, Jiang J, Wan YG, Guo L, Wang P (2013) Monthly water quality forecasting and uncertainty assessment via bootstrapped wavelet neural networks under missing data for Harbin, China. Environ Sci Pollut Res 20:8909–8923. doi:10.1007/s11356-013-1874-8
Article Google Scholar
Willmott CJ (1982) Some comments on the evaluation of model performance. Bull of Am Meteorol Soc 63:1309–1313. doi:10.1175/1520-0477(1982)063
Article Google Scholar
Willmott CJ, Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim Res 30(1):79–82. doi:10.3354/cr030079
Article Google Scholar
Willmott CJ, Ackleson SG, Davis RE, Feddema JJ, Klink KM, Legates DR, O’Donnell J, Rowe CM (1985) Statistics for the evaluation and comparison of models. J Geophys Res 90:8995–9005. doi:10.1029/JC090iC05p08995
Article Google Scholar
Yang Y, Wang Y, Yuan X (2012) Bidirectional extreme learning machine for regression problem and its learning effectiveness. IEEE Transactions on Neural Networks and Learning Systems 23:1498–1505. doi:10.1109/TNNLS.2012.2202289
Article Google Scholar
Zeng Z, Jiang YL, Liu Y, Liu W (2013) Efficient Data Representation Combining with ELM and GNMF, pp.13–23. In Sun F, Toh KA, Romay MG, Mao K (eds.), Extreme Learning Machines 2013: Algorithms and applications, Adaptation, Learning, and Optimization 16.doi:10.1007/978-3-319-04741-6_2.

Download references

Acknowledgments

The authors would like to thank the staff of the United States Geological Survey (USGS) for providing the data that made this research possible. We would like to thank the staff of the Research Group Applications of Machine Learning (AML), Department of Computer Science at the Aalto University School of Science, Espoo, Finland, for providing the OP-ELM toolbox. Once again, we would like to thank the anonymous reviewers and the editor of Environmental Processes for their invaluable comments and suggestions on the contents of the manuscript which significantly improved the quality of the paper. Note that it has not gone unnoticed nor unappreciated. This makes me want to submit more papers to ENPR journal in the future.

Author information

Authors and Affiliations

Department of Agronomy, Faculty of Science, Division of Hydraulics University, 20 Août 1955, Route EL HADAIK, BP 26, SKIKDA, Algeria
Salim Heddam

Authors

Salim Heddam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Salim Heddam.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Heddam, S. Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA. Environ. Process. 3, 909–937 (2016). https://doi.org/10.1007/s40710-016-0172-0

Download citation

Received: 18 February 2016
Accepted: 01 June 2016
Published: 07 June 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s40710-016-0172-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA

Abstract

Similar content being viewed by others

Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors

Predicting water quality through daily concentration of dissolved oxygen using improved artificial intelligence

Non-tuned machine learning approach for hydrological time series forecasting

1 Introduction