MODWT—random vector functional link for river-suspended sediment load prediction

Hazarika, Barenya Bikash; Gupta, Deepak

doi:10.1007/s12517-022-10150-1

MODWT—random vector functional link for river-suspended sediment load prediction

Original Paper
Published: 10 May 2022

Volume 15, article number 966, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Arabian Journal of Geosciences Aims and scope Submit manuscript

MODWT—random vector functional link for river-suspended sediment load prediction

Download PDF

186 Accesses
6 Citations
Explore all metrics

Abstract

To estimate the daily suspended sediment load (SSL), it is necessary to understand its nonlinear and complex nature and to use the nonlinear models for prediction. Heavy rainfall, precipitation, river discharge patterns, and tropical climate are a few of the major parameters that are responsible for the complex nature of river SSL. Nonlinear machine learning models are capable enough to handle these types of complex nature and nonlinearity in river SSL datasets. Therefore, this study presents novel machine learning–based nonlinear random vector functional link (RVFL) model embedded with boundary corrected maximal overlap discrete wavelet transform (MODWT) for river SSL prediction. The proposed model known as boundary corrected wavelet RVFL (BCWRVFL) is trained on the river SSL datasets that have been gathered from the Tawang Chu river basin and Pare river basin in Arunachal Pradesh, India. The performances of the BCWRVFL models are validated using several performance indicators. BCWRVFL’s prediction performance is compared with the support vector regression (SVR), least squares SVR (LSSVR), asymmetric Huber loss SVR (AHSVR), wavelet twin SVR (WTSVR), extreme learning machine (ELM), and RVFL. It is observed that the proposed BCWRVFL shows impressive results showing root mean square error and mean absolute error of 0.034 and 0.015 respectively. The experimental results demonstrate the efficiency of the proposed BCWRVFL model for daily SSL in rivers.

Daily and monthly suspended sediment load predictions using wavelet based artificial intelligence approaches

Article 31 January 2015

Modeling stage–discharge–sediment using support vector machine and artificial neural network coupled with wavelet transform

Article Open access 04 April 2022

A Wavelet Based Data Mining Technique for Suspended Sediment Load Modeling

Article 22 February 2019

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Suspended sediment load (SSL) prediction is a complicated process in river engineering practices. The method of transporting soil materials through erosive agents is referred to as sediment transport (Aksoy et al. 2019). Sediment load data is extremely useful for dam construction, measuring pollutants in rivers, forecasting territorial risks, planning stable channels and estuaries, and so on (Melesse et al. 2011; Khan et al. 2019). Monitoring and evaluating SSL is also important in determining water quality and associated hydrologic functions (Peterson et al. 2018a, b). A rise in SSL also decreases water visibility and admission to light, limiting plant and algae growth in the primary tropics (Henley et al. 2000). Moreover, the deposition of suspended sediments decreases the flow area, restricting the movement of marine life and eventually contributing to a shift in the river course. Hence, it is important to estimate the SSL data precisely. An effective prediction model may play a critical role in improving sediment load modeling in rivers. To address the issue, several traditional and artificial intelligence-based models have been developed to predict the river SSL. In general, time series methods assumed linear relationships between variables; however, these relationships cannot be effortlessly applied to real hydrological data; thus, the analysis could be enhanced by novel artificial intelligence (AI) methods (Babanehzad et al. 2020). As compared to conventional methods and other AI methods, SLFNs produce appropriate results (Wang et al. 2009). However, a single SLFN model is not enough to handle the non-stationarity of river SSL datasets. Wavelets are powerful models that can handle the nonlinearity as well as non-stationarity in datasets. Therefore, hybrid models based on wavelets are needed to be developed which can not only handle the nonlinearity and non-stationary in datasets but also shows a promising prediction performance.

Literature review

There are several AI-based models for SSL prediction. Table 1 shows a few recent significant contributions on the estimation of SSL using the AI techniques. Banadkooki et al (2020) hybridized the ant-lion optimizer with an artificial neural network (ANN) for estimating the river SSL. Hazarika et al. (2020a) compared the performance of support vector machine (SVM) and ANN for SSL estimation. Gupta et al. (2020) generated novel Huber loss function-based extreme learning machines for SSL prediction. Ghanbarynamin et al. (2020) applied several soft computing models for SSL prediction. Salih et al. (2020) explored several data-mining models for SSL estimation. Ehteram et al. (2021) developed a hybrid multi-objective whale algorithm for estimating the river SSL. Meshram et al. (2021) developed an iterative optimizer base random forest model for river SSL prediction. For the same purpose, Panahi et al. (2021) developed a black widow optimization-based algorithm-based adaptive neuro-fuzzy interface system (ANFIS) and SVM models. Sharghi et al. (2021) proposed prediction interval-based emotional ANN (EANN) with the Bootstrap technique for SSL estimation. Mohammadi et al. (2021) estimated the SSL using multilayer perceptron (MLP) hybridized with particle swarm optimization (PSO) and differential evolution (DE). Mohanta et al. (2021) used the ANFIS model for river SSL estimation. Nourani et al. (2021) applied SVM, ANFIS and feed-forward neural network (FFNN), and multilinear regressions (MLR) for SSL prediction. Sahoo et al. (2021) applied the recurrent neural network as well as the conventional SVM for river SSL estimation. Anand et al. (2021) prepared a review on deployment of the cohesionless sediments over alluvial channel. Talebkeikhan et al. (2021) did a comparative analysis of ML-based models for prediction of permeability. Gumgum and Guney (2021) studied the effect of sediment feeding on live-bed scour around the circular bridge piers. A comprehensive review of AI-based models for SSL estimation was presented in Rajaee and Jafari (2020) and Gupta et al. (2021).

Table 1 Few recent prominent contributions for SSL estimation using AI models

Full size table

Drucker and his team proposed a novel SVM model called support vector regression (SVR) (Drucker et al. 1997) to solve the regression-type problems. The SVR and its variants have been fruitfully implemented for various regression-related problems including SSL prediction (Lafdani et al. 2013; Hazarika et al. 2021). Despite showing high-prediction performance, it lacks in computational cost increases exponentially as it solves a quadratic programming problem (QPP) for error minimization. In addition to that, its prediction performance degrades in the presence of noisy data. To improve the computational speed of SVR, a novel LSSVR model was suggested by Suykens and Vandewalle (1999). LSSVR solves a set of linear equations rather than solving QPP. Although its computational cost is reduced, it still lacks efficiency while dealing with noisy datasets. LSSVR has been explored by researchers in various application areas including SSL estimation. To enhance the prediction performance of SVR for noisy datasets, a novel AHSVR was suggested by Balasundaram and Meena (2019). However, recently the growing popularity of ELM (Huang et al. 2004, 2011; Liu et al. 2008) is because of its high generalization performance with low computational cost (Huang et al. 2006; Balasundaram and Gupta 2016; Hazarika et al. 2021). ELM has been fruitfully implemented by several researchers for SSL estimation. Hazarika et al. (2020b), Gupta et al. (2020), and Peterson et al. (2018a) to name a few recent applications. One of the widespread types of ANN is feed-forward networks with random weights which were popularized by Pao and Takefuzi (1992) in their research. They proposed novel RVFL networks (Pao and Takefuji 1992; Cao et al. 2015; Dai et al. 2017). In RVFL, the inputs and outputs can be directly connected, leading toward an exceptional generalization ability. The weights between the input and the hidden layers can also be generated randomly (Zhang and Suganthan 2016a). The RVFL model has been extensively investigated in a wide range of applications, including load demand forecasting (Ren et al. 2016), time-series analysis (Gao et al. 2021; Hazarika and Gupta 2020), visual tracking (Zhang and Suganthan 2016b), and others.

The AI-based models that have been developed for SSL prediction portray how a selection of different models and datasets can be made while developing a forecasting technique. It is well known that the river SSL datasets consist of non-stationary components; hence, it is tricky to come out with a decision using one method. This inspired us to suggest a hybrid prediction model. In view of this, by embedding the advantages of two distinct models, i.e., maximal overlap discrete wavelet transform (MODWT) and the powerful RVFL, the newly suggested algorithm eliminates the limitations of traditional prediction models. The high-generalization capability of RVFL with fast training speed is well known. Moreover, to the best of our knowledge, RVFL’s prediction performance has never been tested for river SSL prediction despite its numerous applications. On the other hand, wavelets are very powerful model that can handle the nonlinearity as well as non-stationary trends in datasets (Hazarika and Gupta 2020). Researchers have suggested various wavelet-based (WB) hybrid models for sediment load prediction. However, a recent study by Quilty and Adamowski (2018) presented that the prior wavelet-embedded forecasting studies generally do not focus on the best and the right practices for real-world WB prediction methodologies. Hence, they comprehensively studied the right and wrong wavelet-based studies. They also explored several boundary conditions (BC) that need to be adequately addressed to properly use a WB prediction technique in real-world issues and proposed a general WB data-driven forecasting framework (WDDFF) using MODWT and A-trous (AT). That was also the initial study that directly used the MODWT wavelet and scaling coefficients for predicting (Quilty and Adamowski 2018). Motivated by the idea of Quilty and Adamowski (2018), we have innovated a new framework by hybridizing the boundary-corrected MODWT algorithm with the fast and efficient RVFL model. The major contributions of this work are the following:

1.
The prediction capacity of the RVFL model for SSL estimation has been tested.
2.
Inspired by the work of Quilty and Adamowski (2018), a boundary-corrected MODWT-based approach has been adopted and a new boundary-corrected MODWT-based RVFL model is proposed.
3.
Two different activation function has been used for the proposed boundary-corrected RVFL model.
4.
Comparative analysis is shown with SVR, AHSVR, LSSVR, WTSVR, ELM, and RVFL models.
5.
Additionally, the autocorrelation plots are also presented for the SSL datasets.

The next section elaborates the related studies. In the third section, the new BCWRVFL model is described. The experimental analyses are elaborated in the fourth section. In the last section the conclusion of this work is explained in brief.

Materials and methods

The RVFL

RVFL (Pao et al. 1994; Zhang and Suganthan 2016a) is a widely accepted single-layer feed-forward network (SLFN) where the output weights are chosen as an adaptable parameter (Tang et al. 2018). In RVFL networks the input and the output layers can be directly linked. In addition to the input node and hidden layer node, there also exists an enhancement node (EN) which consists of the hidden layer of the RVFL network (Shi et al. 2021).

Let an SLFN with training samples $T$ such that $X={\left\{\left({x}_{i},{y}_{i}\right)\right\}}_{i=1}^{T}$, where ${x}_{i}\in {\mathfrak{R}}^{d}$ and ${y}_{i}\in {\mathfrak{R}}^{e}$ are input vector and output vector with d and e dimensions, respectively. Let $\beta ={\mathfrak{R}}^{{N}_{g}\times e}$ indicates the output weight matrix and $Y={\mathfrak{R}}^{N\times e}$ is the output target matrix. l is the output of hidden layer. The regularized form of RVFL model can be expressed as

$$\begin{array}{c}min\\\beta\end{array}\left\|Z\beta-Y\right\|^2\;+C\left\|\beta\right\|^2\\$$

(1)

where $Z=\left[GX\right]$ is the augmented matrix of input layer and hidden layer. $\beta$ is the tradeoff parameter. The hidden layer output matrix $L\in {\mathfrak{R}}^{N\times Ng}$ can be expressed as

$$L=\left[\begin{array}{ccc}{l}_{1}\left({x}_{i}\right)& \dots & {l}_{{N}_{g}}\left({x}_{1}\right)\\ {l}_{1}\left({x}_{2}\right)& \dots & {l}_{{N}_{g}}\left({x}_{2}\right)\\ \begin{array}{c}\begin{array}{cc}\bullet & \bullet \end{array}\\ \begin{array}{cc}\bullet & \end{array}\\ \begin{array}{c}\begin{array}{cc}\bullet & \end{array}\\ {l}_{1}\left({x}_{N}\right)\end{array}\end{array}& \begin{array}{c}\begin{array}{c}\\ \bullet \end{array}\\ \begin{array}{cc}& \bullet \end{array}\\ \dots \end{array}& \begin{array}{c}\bullet \\ \begin{array}{c}\bullet \\ \bullet \end{array}\\ {l}_{{N}_{g}}\left({x}_{N}\right)\end{array}\end{array}\right]$$

(2)

The weights of the hidden layer are created at random. Only the output layer weight vector $\beta$ must be learned. By deriving (1) with respect to $\beta$ and further equating to zero, we obtain

$$\beta ={\left(ZZ+CI\right)}^{-1}{Z}^{t}Y$$

(3)

Here,$I$ is an identity matrix with appropriate dimension.

For any new input sample, $x\in {R}^{n}$ the regression function of RVFL can be obtained as

$$f\left(x\right)=\left(\left[l\left(x\right)x\right]\beta \right)$$

The MODWT

The MODWT serves as a preprocessing model. The general advantage of the model is that it can handle the non-stationarity issue in time-series (TS) data. The non-stationarity is handled by decomposing the input samples into high pass filters (HPF) and low pass filters (LPF) that yield in wavelet coefficient (${V}_{j,i}$) and scaling coefficient (${U}_{j,i}$), respectively. The HPF and LPF are shown in Fig. 1 and can be denoted as (Al-Musaylh et al. 2020; Percival and Walden 2000)

$${V}_{j,i}=\sum_{l=0}^{l={L}_{j}-1}{t}_{j,l}{Z}_{i-1mod N}$$

(4)

$${U}_{j,i}=\sum_{l=0}^{l={L}_{j}-1}{S}_{j,l}{Z}_{i-1mod N}$$

(5)

where Z is the input data with $N;j=\mathrm{1,2},\dots ,J$, where $J$ indicates the level of decomposition at the time i The ${j}^{th}$ level ${V}_{j,i}$ and ${U}_{j,i}$ filters are denoted by ${t}_{j,l}$ and ${s}_{j,l}$ ${L}_{j}$ is the width of the ${j}^{th}$ level filter.

Finally, the additive reconstruction property can be used for reconstruction as (Maheswaran and Khosa 2012)

$${X}_{t}={U}_{j,i}+{V}_{j,i}$$

(6)

Proposed boundary-corrected wavelet random vector functional link (BCWRVFL)

It is very necessary to correctly use the ${V}_{j,i}$ and ${U}_{j,i}$. Hence, they should be boundary corrected (BC). BC indicates that the ${V}_{j,i}$ and ${U}_{j,i}$ should not suffer from any boundary conditions while prediction. Therefore, firstly the future data problem should be properly handled (Quilty and Adamowski 2018).

The data prediction problem and its solution

The data prediction problem takes place while a wavelet transform (WT) (e.g., AT-multiresolution analysis (MRA) and MODWT-MRA) needs TS observations that exist ahead of time $t$ to perform a WT on a TS data at time $t$. Hence, WT must not use future data in real-world TS data forecasting. However, as per Quilty and Adamowski (2018), the solution to the problem is simple. One should use the causal MODWT algorithm rather than the non-causal MODWT-MRA. However, in real-world forecasting problems, the decomposition level (DL), wavelet filters (WF), training, and testing should be properly chosen. The width of the filters ${L}_{j}$ can be chosen correctly as (Bašta 2014; Maslova et al. 2016)

$${L}_{j}=\left({2}^{J}-1\right)\left(L-1\right)+1$$

(7)

Additionally, the DL and WF selection is a 3-step procedure.

1)
select MODWT or AT for wavelet decomposition,
2)
chose the DL and WF,
3)
eliminate the first ${V}_{j,1}$ and ${U}_{j,1}$ using (7) that results in a BC ${V}_{j,1}$ and BC ${U}_{j,1}$

The model development stages of the proposed BCWRVFL are portrayed in Fig. 2. The normalized SSL data is given as an input to the MODWT model. The MODWT decomposes the data into some ${V}_{j,1}$ and ${U}_{j,1}$ using HPF and LPF, respectively. The ${V}_{j,1}$ and ${U}_{j,1}$ are BC in the next stage. The BC-MODWT data is given as an input to RVFL. Finally, the output is evaluated using five different performance indication measures.

Experimental setup and dataset description

The simulations have been undertaken in a Windows 7 system with 8 GB RAM and ROM of 1 TB embedded with an Intel i5 processor. The MATLAB-2019 was used for conducting the simulations. The 70–30 approach has been used for the training–testing split. Moreover, the tenfold cross-validation is applied for the selection of the optimal parameters. The datasets are also normalized by taking ${\overline{x} }_{lm}=\frac{{x}_{lm}-{x}_{m}^{min}}{{x}_{l}^{max}-{x}_{m}^{min}}$, x is the input value and ${\overline{x} }_{lm}$ is the normalized value of ${x}_{lm}$. ${x}_{l}^{max}$ and ${x}_{m}^{max}$ are the maximum values as well as the minimum values, respectively. Zhang and Suganthan (2016a) found that the hardlim and sign activation function degrades the whole performance of the RVFL model while the radbas activation function always leads to good generalization performance. Therefore, the radbas activation function and the popular multiquadric activation function have been selected for ELM, RVFL, and BCWRVFL models. The radbas and multiquadric activation functions can be symbolized as:

a)
Radbas: $f\left(a,x\right)=exp\left(-{\left(x-a\right)}^{2}\right),$
b)
Multiquard: $f\left(a,x\right)=\sqrt{\Vert {x}^{2}-{a}^{2}\Vert },$

where $\mathrm{f}\left(\mathrm{a},\mathrm{x}\right)$ indicates the output for $x$ and$a$. $\Vert .\Vert$ refers to the Euclidean norm. As per the selection of the kernel in the SVR, AHSVR, WTSVR, and LSSVR models, the popular Gaussian kernel has been used. To authenticate the efficiency of the proposed BCWRVFL model, five different performance evaluators, i.e., root mean square error (RMSE), correlation coefficient (R), mean absolute error (MAE), normalized absolute error (NAE), and the ratio of sum of squared error to the total sum of squares (SSE/SST). Their definitions can be given as:

$R=\frac{\sum\limits_{i=1}^{N}\left({z}_{i}-{\overline{z} }_{i}\right)\left({e}_{i}-{\overline{e} }_{i}\right)}{\sqrt{\sum\limits_{i=1}^{N}{\left({z}_{i}-{\overline{z} }_{i}\right)}^{2}} \sqrt{\sum\limits_{i=1}^{N}{\left({e}_{i}-{\overline{e} }_{i}\right)}^{2}}}$
- $RMSE=\sqrt{\frac{1}{N}\sum\limits_{i=1}^{N}{\left({z}_{i}-{e}_{i}\right)}^{2}}$
- $\mathrm{MAE}=\frac{1}{\mathrm{N}}\sum\limits_{i=1}^{N}\left|{z}_{i}-{e}_{i}\right|$
- $NAE=\frac{\frac{1}{\mathrm{N}}\sum\limits_{i=1}^{N}\left({z}_{i}-{e}_{i}\right)}{\frac{1}{\mathrm{N}}\sum\limits_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{z}}_{\mathrm{i}}}$
- $SSE/SST=\frac{\frac{1}{\mathrm{N}}\sum\limits_{i=1}^{N}\left({z}_{i}-{\widehat{z}}_{i}\right)}{\frac{1}{\mathrm{N}}\sum\limits_{i=1}^{N}\left({z}_{i}-{\overline{z} }_{i}\right)}$

where

e:: estimated values
$\widehat e$ :: predicted value of e
$\overline e$:: mean of e
z:: original values $\widehat{\mathrm z}$
$\overline{\mathrm z}$ :: mean of z
$\widehat{\mathrm z}$ :: predicted value of z
max:: peak value
N:: total samples

The SSL dataset is accumulated from the Tawang Chu river with a catchment area of 2737 sq km and latitude and longitude of 27°30ʹ00ʺ to 28°24ʹ00ʺ and 91°47ʹ00ʺ to 92°28ʹ00ʺ, respectively (Panda et al. 2014). The monsoon season takes place between May and September or early October. A detailed description of the datasets is presented in Hazarika et al. (2020b) and Gupta et al. (2020). The study area is portrayed in Fig. 3. We have named the datasets from 2013 and 2015 as SSLD1 and SSLD2. In addition to that, to ensure the efficiency of the proposed models, we have also used a dataset that has been collected from the Pare river, India. The Pare river has a catchment area of 824 sq km. We name the dataset Pare SSLD. The statistics of the two datasets are presented in Table 2.

Table 2 Details of the SSL datasets (g/L)

Full size table

Results and analysis

Experiment on SSL datasets

The daily observed SSL data for SSLD1 is exhibited in Fig. 4 and 5. Very low SSL rates can be observed from January 2013 to mid of March 2013. The increasing trend can be observed from May 2013 to September 2013. This is because of the monsoon season and the trend gradually decreases from mid-September 2013 until December 2013. This is because of the decrease in rainfall during the winter season.

In case of the SSLD2 dataset, high SSL rates can be observed from May to Sepetember 2015. During the period several spikes can be observed; this is due to irregularity in rain and wind speed. However, negligible SSL can be noticed in between January to April and October to December (winter season).

The experimental outcomes of SVR, LSSVR, AHSVR, WTSVR, ELM multiquard (ELM M), ELM radbas (ELM R), RVFL multiquard (RVFL M), RVFL radbas (RVFL R), and the proposed BCWRVFL multiquard (BCWRVFL M) and BCWRVFL radbas (BCWRVFL R) are presented in Table 3. Various performance indicators, viz., RMSE, MAE, SSE/SST, NAE, and R, have been used to evaluate the models. Generally, the R index directly compares the observed value and the predicted value. It is observed that (a) for SSLD1, there is a 5.3443%, 6.4344%, 5.7246%, 6.3415%, 11,1111%, 5.4934%, 9.6843%, and 9.4501% increase in R value for the proposed BCWRVFL M model compared to SVR, LSSVR, AHSVR, WTSVR, ELM R, ELM M, RVFL R, and RVFL M.

Table 3 Performance of the models on the SSL time-series datasets

Full size table

(b) For SSLD2, there is a 10.4244%, 12.8325%, 13.7213%, 11.1511%, 18.9818%, 8.9612%, 5.7537%, and 26.1948% increase in R value for the proposed BCWRVFL R model compared to SVR, LSSVR, AHSVR, WTSVR, ELM R, ELM M, RVFL R, and RVFL M.

(b) For Pare SSLD, there is a 20.1167%, 29.0193%, 29.6871%, 26.5539%, 17.7851%, 16.4982%, 84.7979%, and 28.4154% increase in R value for the proposed BCWRVFL R model compared to SVR, LSSVR, AHSVR, WTSVR, ELM R, ELM M, RVFL R, and RVFL M.

Further, the average rank based on the performance indicators is also tabulated in Table 4. One can notice from Tables 3 and 4 that the proposed BCWRVFL model shows better or comparable prediction performance. To portray the relationship between the observed and the predicted values, the observed versus prediction values of the models along with their R² values are also shown in Figs. 6 and 7 for the SSLD1 and SSLD2 datasets, respectively. From Fig. 6 it is observed that the proposed BCWRVFL M showed better R² value (0.6724) compared to SVR, LSSVR, AHSVR, WTSVR, ELM R, ELM M, RVFL R, and RVFL M models. Moreover, from Fig. 7, one can notice that the proposed BCWRVFL R showed better R² value (0.8013) compared to SVR, LSSVR, AHSVR, WTSVR, ELM R, ELM M, RVFL R, and RVFL M models. The following implications can be derived from Table 2, Table 3, Fig. 6, and Fig. 7:

a)
It is noticeable from Table 3 that the BCWRVFL multiquard model shows best MAE, SSE/SST, NAE, and R values for SSLD1 dataset.
b)
For SSLD2 dataset and SSLD3, the BCWRVFL multiquard model shows best NAE and R values, respectively.
c)
Moreover, the proposed BCWRVFL radbas model shows best RMSE values for all datasets.
d)
In addition to that, the BCWRVFL with radbas activation function shows the best SSE/SST and R values for SSLD2 and best NAE value for SSLD3.
e)
Fig. 6 shows the observed versus prediction plots of the reported models for SSLD1. One can notice that the proposed BCWRVFL multiquard model is highly correlated.

Table 4 Ranks based on performance indicators for the reported models

Full size table

Like Fig. 6, in Fig. 7 where the observed versus predicted values are shown for SSLD2, the proposed models show high correlation.

Moreover, the autocorrelation (ACF) as well as partial ACF functions are also presented in Figs. 8 and 9 for SSLD1 and SSLD2, respectively. The partial ACF removed the dependence on intermediate elements. Partial ACF identified how strongly the SSL data is correlated.

The model performances based on different indicators are plotted in Figs. 10 and 11 for SSLD1 and SSLD2 datasets, respectively. It can be noticed from Fig. 10 that the proposed BCWRVFL M shows the best NAE, SSE/SST, MAE, and R values while the BCWRVFL R shows the best RMSE value. Moreover, from Fig. 11 one can conclude that the proposed BCWRVFL R shows the best NAE score, while the BCWRVFL M shows the best RMSE, SSE/SST, and R values.

Experimental analyses on a real-world time-series dataset

Moreover, to further check the applicability of the proposed BCWRVFL model on real-world TS datasets, we have conducted an experiment on a TS dataset named “POPULATION,” which is the data of the total population in India during the time period of 1961 to 2019. The dataset is downloaded from https://data.worldbank.org/ and has been recently used by Hazarika and Gupta (2020). The experimental results of BCWRVFL are compared with the traditional SVR, LSSVR, AHSVR, WTSVR, ELM, and RVFL models. The results are portrayed in Table 4. It can be noted that the BCWRVFL shows excellent prediction performance for the “POPULATION” dataset. Table 5.

Table 5 Prediction performance of the reported models on “POPULATION” dataset (best result is bolded)

Full size table

Conclusion

A novel hybrid model was developed and used to predict the SSL in this study. It is well known that the river SSL datasets contain non-stationary components, making it difficult to decide using a single method. This prompted us to propose a hybrid prediction model. The newly proposed algorithm eliminates the limitations of traditional prediction models by incorporating the benefits of two distinct models, namely, the maximal overlap discrete wavelet transform (MODWT) and the powerful RVFL. The boundary-corrected MODWT is combined for this purpose to create the hybrid model known as BCWRVFL. Experiments are performed on two SSL datasets that are accumulated from the Tawang Chu river, India, and an SSL dataset that is collected from the Pare river, India. The proposed BCWRVFL models are compared with SVR, LSSVR, HSVR, WTSVR, ELM, and RVFL models and evaluated using five performance indicators. The experimental outcomes reveal the importance and potential of the BCWRVFL model for SSL estimation as it shows close agreement with the observed records. The proposed BCWRVFL model can be applied on several real-world time-series applications such as wind speed prediction, price forecasting, energy consumption prediction, and others. However, the main limitation of the study is that we consider only the SSL data. In the future, some other climatological parameters like rainfall intensity, wind speed, and evaporation are needed to be considered along with the SSL values. It is suggested that the model be tested in areas/countries with more seasons and variability in weather conditions in the future to determine its predictive strength. Moreover, it would be fascinating to develop a wavelet-based deep RVFL network for SSL prediction in the future.

Data Availability

The datasets that have been used in this study are available from co-author on reasonable request.

References

Aksoy H, Mahe G, Meddi M (2019) Modeling and practice of erosion and sediment transport under change. Water 11(8):1665
Article Google Scholar
Al-Musaylh MS, Deo RC, Li Y (2020) Electrical energy demand forecasting model development and evaluation with maximum overlap discrete wavelet transform-online sequential extreme learning machines algorithms. Energies 13(9):2307
Article Google Scholar
Anand A, Beg M, Kumar N (2021) Experimental studies and analysis on mobilization of the cohesionless sediments through alluvial channel: a review. Civil Eng J 7(5):915–936
Article Google Scholar
Babanezhad, M., Behroyan, I., Marjani, A., & Shirazian, S. (2020). Artificial intelligence simulation of suspended sediment load with different membership functions of ANFIS. Neural Comput Appl 1–15.
Balasundaram S, Gupta D (2016) On optimization based extreme learning machine in primal for regression and classification by functional iterative method. Int J Mach Learn Cybern 7(5):707–728
Article Google Scholar
Balasundaram S, Meena Y (2019) Robust support vector regression in primal with asymmetric Huber loss. Neural Process Lett 49(3):1399–1431
Article Google Scholar
FB Banadkooki M Ehteram AN Ahmed FY Teo M Ebrahimi CM Fai …A El-Shafie 2020 Correction to: suspended sediment load prediction using artificial neural network and ant lion optimization algorithm Environ Sci Pollut Res 27 30 38117 38119
Bašta M (2014) Additive decomposition and boundary conditions in wavelet-based forecasting approaches. Acta Oeconomica Pragensia 22(2):48–70
Article Google Scholar
Cao F, Ye H, Wang D (2015) A probabilistic learning algorithm for robust modeling using neural networks with random weights. Inf Sci 313:62–78
Article Google Scholar
Dai W, Chen Q, Chu F, Ma X, Chai T (2017) Robust regularized random vector functional link network and its industrial application. IEEE Access 5:16162–16172
Article Google Scholar
Drucker H, Burges CJ, Kaufman L, Smola A, Vapnik V (1997) Support vector regression machines. Adv Neural Inf Process Syst 9:155–161
Google Scholar
M Ehteram AN Ahmed SD Latif YF Huang M Alizamir O Kisi …A El-Shafie 2021 Design of a hybrid ANN multi-objective whale algorithm for suspended sediment load prediction Environ Sci Pollut Res 28 2 1596 1611
Gao, R., Du, L., Yuen, K. F., & Suganthan, P. N. (2021). Walk-forward empirical wavelet random vector functional link for time series forecasting. Appl Soft Comput 107450.
Ghanbarynamin S, Zaremehrjardy M, Ahmadi M (2020) Application of soft-computing techniques in forecasting sediment load and concentration. Hydrol Sci J 65(13):2309–2321
Article Google Scholar
Gumgum F, Guney MS (2021) Effect of sediment feeding on live-bed scour around circular bridge piers. Civil Eng J 7(5):906–914
Article Google Scholar
Gupta D, Hazarika BB, Berlin M (2020) Robust regularized extreme learning machine with asymmetric Huber loss function. Neural Comput Appl 32(16):12971–12998
Article Google Scholar
Gupta D, Hazarika BB, Berlin M, Sharma UM, Mishra K (2021) Artificial intelligence for suspended sediment load prediction: a review. Environ Earth Sci 80(9):1–39
Article Google Scholar
Hazarika, B. B., & Gupta, D. (2020). Modelling and forecasting of COVID-19 spread using wavelet-coupled random vector functional link networks. Appl Soft Compu 106626.
Hazarika BB, Gupta D, Berlin M (2021) A coiflet LDMR and coiflet OB-ELM for river suspended sediment load prediction. Int J Environ Sci Technol 18(9):2675–2692
Article Google Scholar
Hazarika, B. B., Gupta, D., & Berlin, M. (2020a). A comparative analysis of artificial neural network and support vector regression for river suspended sediment load prediction. In First Int Confer Sustain Technol Comput Intell (pp. 339–349). Springer, Singapore.
Hazarika BB, Gupta D, Berlin M (2020b) Modeling suspended sediment load in a river using extreme learning machine and twin support vector regression with wavelet conjunction. Environ Earth Sci 79:1–15
Article Google Scholar
Henley WF, Patterson MA, Neves RJ, Lemly AD (2000) Effects of sedimentation and turbidity on lotic food webs: a concise review for natural resource managers. Rev Fish Sci 8(2):125–139
Article Google Scholar
Huang GB, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892
Article Google Scholar
Huang, G. B., Zhou, H., Ding, X., & Zhang, R. (2011). Extreme learning machine for regression and multiclass classification. IEEE Trans Syst, Man Cybernet Part B (Cybernetics), 42(2), 513–529
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2004, July). Extreme learning machine: a new learning scheme of feedforward neural networks. In 2004 IEEE Int Joint Confer Neural Netw (IEEE Cat. No. 04CH37541) (Vol. 2, pp. 985–990). IEEE.
Khan MYA et al (2019) Artificial neural network simulation for prediction of suspended sediment concentration in the River Ramganga, Ganges Basin. India Int J Sediment Res 34(2):95–107
Article Google Scholar
Lafdani EK, Nia AM, Ahmadi A (2013) Daily suspended sediment load prediction using artificial neural networks and support vector machines. J Hydrol 478:50–62
Article Google Scholar
Liu, Q., He, Q., & Shi, Z. (2008, May). Extreme support vector machine classifier. In Pacific-Asia Confer Knowl Discov Data Min (pp. 222–233). Springer, Berlin, Heidelberg.
Maheswaran R, Khosa R (2012) Comparative study of different wavelets for hydrologic forecasting. Comput Geosci 46:284–295
Article Google Scholar
Maslova I, Ticlavilca AM, McKee M (2016) Adjusting wavelet-based multiresolution analysis boundary conditions for long-term streamflow forecasting. Hydrol Process 30(1):57–74
Article Google Scholar
Melesse AM et al (2011) Suspended sediment load prediction of river systems: an artificial neural network approach. Agric Water Manag 98(5):855–866
Article Google Scholar
Meshram SG, Safari MJS, Khosravi K, Meshram C (2021) Iterative classifier optimizer-based pace regression and random forest hybrid models for suspended sediment load prediction. Environ Sci Pollut Res 28(9):11637–11649
Article Google Scholar
Mohammadi B, Guan Y, Moazenzadeh R, Safari MJS (2021) Implementation of hybrid particle swarm optimization-differential evolution algorithms coupled with multi-layer perceptron for suspended sediment load estimation. CATENA 198:105024
Article Google Scholar
Mohanta, N. R., Biswal, P., Kumari, S. S., Samantaray, S., & Sahoo, A. (2021). Estimation of sediment load using adaptive neuro-fuzzy inference system at Indus River Basin, India. In Intell Data Eng Anal (pp. 427–434). Springer, Singapore.
Nourani, V., Gokcekus, H., & Gelete, G. (2021). Estimation of suspended sediment load using artificial intelligence-based ensemble model. Complexity, 2021.
Panahi, F., Ehteram, M., & Emami, M. (2021). Suspended sediment load prediction based on soft computing models and Black Widow Optimization Algorithm using an enhanced gamma test. Environ Sci Pollut Res 1–21.
Panda R, Padhee SK, Dutta S (2014) Glof study in Tawang River Basin, Arunachal Pradesh, India. Int Arch Photogramm Remote Sens Spat Inf Sci 40(8):101
Article Google Scholar
Pao YH, Takefuji Y (1992) Functional-link net computing: theory, system architecture, and functionalities. Computer 25(5):76–79
Google Scholar
Pao YH, Park GH, Sobajic DJ (1994) Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6(2):163–180
Article Google Scholar
Percival, D. B., & Walden, A. T. (2000). Wavelet methods for time series analysis (Vol. 4). Cambridge university press.
Peterson K et al (2018a) Suspended sediment concentration estimation from landsat imagery along the Lower Missouri and Middle Mississippi Rivers using an extreme learning machine. Remote Sens 10(10):1503
Article Google Scholar
Peterson KT, Sagan V, Sidike P, Cox AL, Martinez M (2018b) Suspended sediment concentration estimation from landsat imagery along the lower Missouri and middle Mississippi Rivers using an extreme learning machine. Remote Sens 10(10):1503
Article Google Scholar
Quilty J, Adamowski J (2018) Addressing the incorrect usage of wavelet-based hydrological and water resources forecasting models for real-world applications with best practices and a new forecasting framework. J Hydrol 563:336–353
Article Google Scholar
Rajaee T, Jafari H (2020) Two decades on the artificial intelligence models advancement for modeling river sediment concentration: State-of-the-art. J Hydrol 588:125011
Ren Y, Suganthan PN, Srikanth N, Amaratunga G (2016) Random vector functional link network for short-term electricity load demand forecasting. Inf Sci 367:1078–1093
Article Google Scholar
Sahoo, A., Barik, A., Samantaray, S., & Ghose, D. K. (2021). Prediction of sedimentation in a watershed using RNN and SVM. In Commun Softw Netw (pp. 701–708). Springer, Singapore.
SQ Salih A Sharafati K Khosravi H Faris O Kisi H Tao …ZM Yaseen 2020 River suspended sediment load prediction based on river discharge information: application of newly developed data mining models Hydrol Sci J 65 4 624 637
Sharghi E, Paknezhad NJ, Najafi H (2021) Assessing the effect of emotional unit of emotional ANN (EANN) in estimation of the prediction intervals of suspended sediment load modeling. Earth Sci Inf 14(1):201–213
Article Google Scholar
Shi Q, Katuwal R, Suganthan PN, Tanveer M (2021) Random vector functional link neural network based ensemble deep learning. Pattern Recogn 117:107978
Article Google Scholar
Suykens JA, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9(3):293–300
Article Google Scholar
Talebkeikhah M, Sadeghtabaghi Z, Shabani M (2021) A comparison of machine learning approaches for prediction of permeability using well log data in the hydrocarbon reservoirs. J Human Earth Future 2(2):82–99
Article Google Scholar
Tang L, Wu Y, Yu L (2018) A non-iterative decomposition-ensemble learning paradigm using RVFL network for crude oil price forecasting. Appl Soft Comput 70:1097–1108
Article Google Scholar
Wang WC, Chau KW, Cheng CT, Qiu L (2009) A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol 374(3–4):294–306
Article Google Scholar
Zhang L, Suganthan PN (2016a) A comprehensive evaluation of random vector functional link networks. Inf Sci 367:1094–1105
Article Google Scholar
Zhang L, Suganthan PN (2016b) Visual tracking with convolutional random vector functional link network. IEEE Trans Cybern 47(10):3243–3253
Article Google Scholar

Download references

Acknowledgements

We acknowledge the help of NHPC LTD, Tawang Basin Project for providing us the datasets.

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Andhra Pradesh, 522502, India
Barenya Bikash Hazarika
Department of Computer Science & Engineering, National Institute of Technology Arunachal Pradesh, Jote, 791113, India
Deepak Gupta

Authors

Barenya Bikash Hazarika
View author publications
You can also search for this author in PubMed Google Scholar
Deepak Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Deepak Gupta.

Ethics declarations

Conflict of interests

Authors declare that they have no competing interests.

Additional information

Responsible Editor: Broder J. Merkel

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hazarika, B.B., Gupta, D. MODWT—random vector functional link for river-suspended sediment load prediction. Arab J Geosci 15, 966 (2022). https://doi.org/10.1007/s12517-022-10150-1

Download citation

Received: 07 February 2022
Accepted: 19 April 2022
Published: 10 May 2022
DOI: https://doi.org/10.1007/s12517-022-10150-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

MODWT—random vector functional link for river-suspended sediment load prediction

Abstract

Similar content being viewed by others

Daily and monthly suspended sediment load predictions using wavelet based artificial intelligence approaches

Modeling stage–discharge–sediment using support vector machine and artificial neural network coupled with wavelet transform

A Wavelet Based Data Mining Technique for Suspended Sediment Load Modeling