Keywords

1 Introduction

Estimation and prediction of suspended sediment load is important in water management and monitoring of environmental issues (Melesse et al. 2011). In a prior work, multiple machine learning (ML) techniques were used to estimate sediment load from available input predictor variables (Hayder et al. 2021). The two predictor variables are suspended solid and discharge. The sufficient review on the important of sediment load estimation has also been discussed. However, the study is more focused on revealing the correlation between sediment load in the catchment area with the input variables, namely suspended solid and discharge.

This paper extent this earlier study by introducing multi-step ahead forecasting where artificial neural networks (ANN) will be used to forecast the sediment load based on its past values, the input variables and their historical data. The proposed multi-step ahead forecasting takes the popular time-series forecasting architecture namely non-linear autoregressive with exogenous inputs (NARX) with the internal function based on neural networks. This hybrid approach makes up an architecture known as non-linear autoregressive with exogenous inputs (NARX) neural networks. This architecture has been applied in various task for multi-step ahead forecasting ranging across disciplines not only in hydrology (Guo and Xue 2014; Sarkar et al. 2019).

In the applications of hydrology especially suspended sediment load forecasting, some studies have used that particular NARX architecture with different input variables and different machine learning methods and yielding various interesting results (Mohammadi et al. 2019; Kumar et al. 2017). Some studies use particular neural networks as the prediction engine in the NARX architecture, i.e., NARX neural networks (Bouzeria et al. 2017; Alarcon 2021; Afan et al. 2014; Nivesh and Kumar 2017). These vibrant research interest of using machine learning in the area of hydrology indicates the potential impact of application and contribution in solving problems in water management and environment systems.

This study aims to further explore and justify the important of this hybrid multidisciplinary approach of computer science and hydrology by presenting the case study of sediment load forecasting using NARX neural networks. The study employs the data was recorded in upstream catchment of Ringlet reservoir, Cameroon Highland, Malaysia.

2 Study Area and Data

There are 8 upstream area in Ringlet reservoir catchment located in Cameroon Highland, Pahang, Malaysia. This location is taken as the case study where there data was gathered (Hayder et al. 2021). The geographical map is shown in Fig. 1. Within the Cameron Highlands, there are six water catchments: Bertam, Lemoi, Mensun, Terla, Telom and Wi. In addition, the district contains various towns such as Kuala Terla, Kampung Raja, Kea Farm, Teringkap, Brinchang, Tanah Rata and Ringlet as the district's southernmost town (Gasim et al. 2009). The largest reservoirs in the Cameron Highlands district are Ulu Jelai and Sultan Abu Bakar (SAB) Dam. Ringlet Dam is another name for SAB Dam, which was erected in the 1960s with 19,000 m3 of rockfill and 52,000 m3 of concrete standing 39.6 m height and 135 m long.

Fig. 1
figure 1

The map of the study area at Cameron Highlands, Malaysia

Ringlet Reservoir contributes 20% of total power generation in West Malaysia as part of the National Electric Board’s Cameron Highlands Hydroelectric Scheme. Ringlet Reservoir, in addition to generating hydropower for Jor Power Plant, it serves as a flood control reservoir for residents in the Bertam Valley. The system was under the care of Malaysian Metrological Department (MMD), Tenaga Nasional Berhad (TNB) and the Department of Irrigation and Drainage (DID). Generally, Ringlet reservoir catchment is equipped with a good hydrological monitoring system including stream flow, rainfall and weather stations (Abdul Razad et al. 2018). Nonetheless, the reservoir storage volume has been depleted because of sedimentation, affecting the power station’s energy output. The worst-case situation occurs when the Ringlet Reservoir gradually loses its ability to hold large flood inflows, forcing flood discharge through the spillway to be controlled (Sidek 2013).

Three important variables, i.e., discharge (DC) in mg/L, suspended solid (SS) in m3/s and sediment load (SL) in ton/day units were collected as the raw data. The available information on DC and SS in the 8 (eights) spots neighboring the Ringlet reservoir catchment area are compiled after substantial pre-processing and cleansing; the locations are listed in this reference (Hayder et al. 2021). Many daily data are missing so that only left some hundred instances can be used for this study. From December 12, 1997, to May 12, 2010, 405 data instances were compiled after extreme outliers were removed. Figure 2 shows the boxplot of the data where several outliers are still detected on each variable although the extreme outliers are already removed.

Fig. 2
figure 2

Boxplot of the data

Despite the actual data is not in a uniform temporal time-series sequence, this research assumes that the compiled data is in a uniform temporal time-series sequence so that the NARX neural networks model development is more focused rather than the validity of the data and the model itself. Therefore, on order to have higher applicability impact, the data quality must be improved in the future since the main ingredient of machine learning-based model is the data.

3 Methodology

The architecture of NARX neural networks can be seen in Fig. 3 (Thapa et al. 2020). It is applied as multivariate estimator to perform prediction on a number of step ahead values of the sediment load using the lagged values of both SS and DC as well as the previously estimated values of SL.

Fig. 3
figure 3

The architecture of the proposed NARX neural networks

The predictive model used in here is NARX neural networks. It is essentially an extension of non-linear autoregressive (NAR) model by combining additional relevant time-series features to the forecasting model. The non-linearity of the model is handled by the neural networks. Here, the standard multilayer neural networks where the architecture and the training algorithms is listed in Table 1. The implementation is performed in MATLAB 2019b software under the university license.

Table 1 Architecture of the multilayer neural networks used in the NARX neural networks

Firstly, the training/learning of NARX neural networks model is performed by feeding the data consisting of input and target variables. Then, the trained model will be evaluated using test dataset once the training performance is satisfactory. Here, we use the first 85% of the daily data for training. The rest 15% proportion is used as test dataset. This proportion is quite common in machine learning model development.

This study performs regressive model development. For regression, the model accuracy can be evaluated using some common metrics such as presented in Pena et al. (2020). Similarly in this study, the metric used to evaluate the model accuracy are Nash–Sutcliffe efficiency coefficient (NSE) and root mean squared error (RMSE). These two metrics can be expressed as follows:

$${\text{RMSE}} = \sqrt {\mathop \sum \limits_{i = 1}^n \frac{{\left( {\hat{y}_i - y_i } \right)^2 }}{n}}$$
(1)
$${\text{NSE}} = 1 - \frac{{\sum_{i = 1}^n \left( {y_i - \hat{y}_i } \right)^2 }}{{\sum_{i = 1}^n \left( {y_i - \overline{y}_i } \right)^2 }}$$
(2)

The regression performance of hydrological model is often assessed using NSE. According to Pena et al. (2020), NSE outperforms other metric such as coefficient of determination, (regression coefficient). It is indicated that NSE > 0.75 is substantially very nice fit model while NSE < 0.5 implies poor performance of the model.

4 Results

The first experimentation is to get optimum delay (lag) number for the input (\(d_i\)) and the feedback \(\left( {d_o } \right)\) as closed-loop mode NARX neural networks architecture is implemented. Here, the auto-correlation and cross-correlation analysis are conducted. As shown in Fig. 4, the sample auto-correlation function for SL has quite significant correlation of its delayed value of 1–5, i.e., previous 1st to 5th months.

Fig. 4
figure 4

Graph showing sample auto correlation function (SL)

Likewise, the cross-correlation analysis between SL and SS, and also between SL and DC are also conducted. As shown in Fig. 5, some observation can be drawn as initial assessment for the features/input variable selection for the NARX neural networks model. SL has correspondingly very significant correlation to the present (delay of 0) value of both SS and DC. It extents the correlation between SL and both SS and DC with weaker value up to about lag 2 (past two months). With this observation, the developed NARX neural networks forecasting model can be expressed as:

$$\begin{aligned} \widehat{SL}\left( t \right) & = f\left( {SS\left( t \right), SS\left( {t - 1} \right), SS\left( {t - 2} \right), DC\left( t \right), DC\left( {t - 1} \right), DC\left( {t - 2} \right), } \right. \\ & \quad \left. { \widehat{SL}\left( {t - 1} \right), \widehat{SL}\left( {t - 2} \right),\widehat{SL}\left( {t - 3} \right),\widehat{SL}\left( {t - 4} \right), \widehat{SL}\left( {t - 5} \right)} \right) \\ \end{aligned}$$
(3)
Fig. 5
figure 5

Graph showing cross-correlation function (SL-SS and SL-DC)

The forecasting result evaluated on training data is presented in Figs. 6 and 7. With the outcome of NSE = 0.99 and RMSE = 0.14, the model fits well the training dataset. The NARX neural networks model produces very high NSE and low RMSE. The blue line (diagonal line) in Fig. 7 indicates the fitting line for regression between the observed and forecasted values.

Fig. 6
figure 6

Graph showing forecasting performance evaluated on training data set

Fig. 7
figure 7

Graph showing observed versus forecasted value (training dataset)

Furthermore, the forecasting result evaluated on test dataset is presented in Figs. 8 and 9. The successfully trained model is used recursively to predict 61 step ahead, i.e., prediction in a closed-loop mode. The result indicates excellent capability of the model with NSE = 0.99 and RMSE = 0.25, i.e., only slightly lower RMSE as comparted to that on training data. This means the model can generalize the data very well. This also indicates the good capability of the Bayesian regularization algorithm used in the model training.

Fig. 8
figure 8

Graph showing forecasting performance evaluated on test dataset

Fig. 9
figure 9

Graph showing observed versus forecasted value (test dataset)

5 Conclusion

In this study, sediment load forecasting using NARX neural networks have been proposed. The forecasting is performed in multi-step ahead manner. The study employed the data collected from 8 upstream stations of Ringlet reservoir catchment located in Cameroon Highland, Pahang, Malaysia. The results show the excellent capability of NARX neural networks to perform multi-step ahead forecasting of sediment load in recursive way based on its past values, the past values of suspended solid and discharge. The model produces low RMSE values for both training and testing data with Nash–Sutcliffe efficiency coefficient (NSE = 0.99). In the future, the data quality will be improved as in this study the data is assumed to be in uniform temporal of daily time-series. This to avoid some missing data and non-uniformity of the temporal sequence.