Ensuring a generalizable machine learning model for forecasting reservoir inflow in Kurdistan region of Iraq and Australia

Latif, Sarmad Dashti; Ahmed, Ali Najah

doi:10.1007/s10668-023-03885-8

Ensuring a generalizable machine learning model for forecasting reservoir inflow in Kurdistan region of Iraq and Australia

Published: 17 September 2023

Volume 26, pages 12513–12544, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environment, Development and Sustainability Aims and scope Submit manuscript

Ensuring a generalizable machine learning model for forecasting reservoir inflow in Kurdistan region of Iraq and Australia

Download PDF

205 Accesses
7 Citations
Explore all metrics

Abstract

Correct inflow prediction is a critical non-engineering measure for ensuring flood control and increasing water supply efficiency. In addition, accurate inflow prediction can offer reservoir planning and management guidance since inflow is the major input into reservoirs. This study aims at generalizing a machine learning model for forecasting reservoir inflow. Daily, weekly, and monthly inflow and rainfall time-series data have been collected as two hydrological parameters to forecast reservoir inflow using a machine learning method, namely, support vector regression (SVR). Four different SVR kernels have been applied in this study. The kernels are radial basis function (RBF), linear, normalized polynomial, and sigmoid. Two scenarios for input selection have been implemented. Dokan dam in Kurdistan region of Iraq and Warragamba Dam in Australia were selected as the case studies for this research. For the purpose of generalization, the proposed models have been applied to two countries with a different climate condition. The findings showed that daily timescale outperformed weekly and monthly, while RBF outperformed the other SVR kernels with root-mean-square error (RMSE) = 145.7 and coefficient of determination (R²) = 0.85 for forecasting daily inflow at Dokan dam. However, RBF kernel could not perform well for forecasting daily inflow in Warragamba dam. The results showed that the proposed machine learning model performed well at Kurdistan region of Iraq only, while the result for Australia was not accurate. Therefore, the proposed models could not be generalized.

Artificial Neural Network and Support Vector Machine Models for Inflow Prediction of Dam Reservoir (Case Study: Zayandehroud Dam Reservoir)

Article 03 April 2019

Application of Support Vector Regression for Modeling Low Flow Time Series

Article 17 December 2018

A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction

Article 10 March 2016

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Climate change and weather patterns, rising water demand, and poor water resource management practices are all factors have led to the present global water crisis (Herslund & Mguni, 2019; Kooy et al., 2020; Marlow et al., 2013; Sosa-Rodriguez et al., 2019). Water management is a critical component of urban development’s long-term viability. The scientific community has expressed worry about urban water-related issues all around the world (Jia et al., 2015). Water concerns currently include increasing urban floods, over-exploitation of groundwater, urban water shortages, the waste of rainfall resources, and water contamination as a result of fast urbanization and extreme weather events (Bábek et al., 2020; Nguyen et al., 2019; Wang et al., 2018).

One of the most essential elements in the development, maintenance, and sustainability of riparian ecosystems is reservoir inflow. Inflow may be thought of as a "master variable" that regulates riverine species’ abundance and distribution (Latif, Ahmed, et al. 2021). Weather (rainfall and temperature) interacts with geology, topography, soil, and vegetation to impact infiltration, evaporation, and run-off generation, all of which influence reservoir inflow. The number and timing of reservoir inflows are key components of river system environmental fluxes and ecological integrity. This "master variable" also shapes river ecosystems and affects fish eating, migratory, nesting, and spawning conditions (Dhungel et al., 2016; O’Keeffe et al., 2019; U.S. Environmental Protection Agency (U.S. EPA) and US EPA, 2008; Xu et al., 2020).

Correct inflow forecast is an essential non-engineering measure to confirm flood-control protection and to raise the efficiency of water supply use. In addition, since inflow is the main input into reservoirs, good inflow forecast may provide direction for reservoir development and management (Apaydin et al., 2020; More et al., 2019; Qi et al., 2019). Because of its importance, numerous reservoir inflow forecasting models and techniques have been created and tested in real-world scenarios (Apaydin et al., 2020).

Inflow prediction has been proposed using a variety of hydrologic models over the past decade, but there is no silver bullet: Various techniques will perform better for particular watersheds, lead times, and types of occurrences (Tikhamarine et al., 2020). Since inflow is the primary input into reservoirs, accurate inflow prediction is not only an important non-engineering method to assure flood-control safety and enhance water resource use efficiency, but it may also give direction for reservoir development and management. Therefore, the need to have a capable model for predicting reservoir inflow is crucial (Amnatsan et al., 2018; Yan et al., 2018).

According to recent research, Iraq will face greater issues in the future, with the water deficit situation worsening over time and the Tigris and Euphrates Rivers anticipated to be dry by 2040. The estimated discharge of the two rivers in 2025 will be drastically reduced (Zakaria et al., 2013). In Australia, overall urban water consumption is expected to rise by at least 39% between 2009 and 2026, following a population increase of more than 24% between 2007 and 2026. Climate change will probably certainly exacerbate the situation on a global and regional basis (Yan et al., 2018). This study focuses on implementing a generalizable model for both countries for forecasting reservoir inflow.

Nowadays, hydrologists focuses on machine learning algorithms for forecasting hydrological parameters (Lai et al., 2020; Latif & Ahmed, 2021; Latif et al., 2020, 2021a, 2021b; Najah et al., 2021). For example, Babaei et al., 2019, conducted a study in Zayandehroud dam reservoir in Iran to predict the dam reservoir inflow, and their input parameters were monthly inflow and rainfall. They have applied ANN and SVR as their proposed method. Their findings showed that the proposed model has the lowest error for inflow prediction, with the SVR model’s products outperforming those of the ANN model. Another study was conducted by Zhang et al., 2020, in the Huanren reservoir in China to produce an ensemble of 10-day inflow forecasts. The time scale was 10 days with different input combinations such as inflow, precipitation, relative humidity, minimum temperature, maximum temperature, and precipitation forecast. They have implemented ANN, SVR, and ANFIS for their methods. The decomposition outcome of their study showed that the input set is the dominant source of uncertainty. They found out the contribution of the data-driven model is limited and has a substantial seasonal variation which is more significant in winter and summer but more minor in spring and autumn. Furthermore, Y. Yu et al., 2017, proposed a study in Three Gorges Reservoir (TGR), China. They have developed a novel model, combining monthly inflow forecasting and multi-objective ecological reservoir operations. The objective of their research was to improve the efficiency of water resource allocation. For the monthly time scale, meteorological and hydrological data were used as inputs. A hybrid model based on SVR and singular spectrum analysis (SSA), namely, SSA-SVR, was applied for the method. The results of the simulations revealed that the proposed coupled model for the TGR will outperform actual TGR operations; moreover, multi-objective ecological operations based on inflow forecasts may help relieve water shortages. Meanwhile, Al-Suhili & Karim, 2015, developed five ANN models for predicting daily inflow at Dokan dam. According to their findings, their proposed model was capable of forecasting daily inflow with the highest correlation coefficient of 0.94. Moreover, Y. Wang et al., 2014, conducted a study in order to forecast monthly inflow at Three Gorges Reservoir. Three machine learning models, namely, SVR, genetic programming (GP), and seasonal autoregressive (SAR), have been implemented in their study. RBF has been adopted in their SVR prediction model as an effective kernel. Their findings showed that SVR and GP model performance significantly improves when coupled with the SSA for predicting the inflow series. On the other hand, Halik et al., 2015, utilized wavelet support vector machine (WSVM) with the adaptation of RBF for forecasting inflow at Sutami Reservoir, Indonesia. Their findings showed that WSVM performed better in forecasting inflow with utilizing RBF kernel.

The area of research is based on the primary data in Dokan dam, Iraq, and the secondary data in Warragamba dam, Sydney, Australia. In this study, reservoir inflow and rainfall as two different scenarios have been utilized as the input parameters for the proposed machine learning models. In Dokan dam, the four kernels of SVR are not applied for forecasting reservoir inflow to check the most accurate kernel. Therefore, this study aims to fill this gap in the literature by contributing a new idea of applying four different kernels of SVR in order to ensure the most accurate kernel for forecasting reservoir inflow.

2 Materials and methods

2.1 Dokan dam

Dokan dam is located on the Lesser Zab tributary, approximately 295-km north of Baghdad and 65-km southeast of Sulaymaniyah (Fig. 1) (Sulaiman et al., 2021). At a typical functioning level of 511 m above sea level, the dam height is approximately 116 m, with a total storage capacity of 6.87 109 m3 (6.14 109 m3 living storage and 0.73 109 m3 dead storage) (Ezz-Aldeen et al., 2018). The historical daily time-series inflow and rainfall data are collected from the Ministry of Agriculture and Water Resources, Kurdistan regional government, Iraq, for the duration of January 1, 1988, to December 31, 2015 (Fig. 2). The basic statistical characteristics of the utilized inflow data of Dokan dam are shown in Table 1.

Table 1 Statistical characteristics of Dokan dam inflow data

Ensuring a generalizable machine learning model for forecasting reservoir inflow in Kurdistan region of Iraq and Australia

Abstract

Similar content being viewed by others

Artificial Neural Network and Support Vector Machine Models for Inflow Prediction of Dam Reservoir (Case Study: Zayandehroud Dam Reservoir)

Application of Support Vector Regression for Modeling Low Flow Time Series

A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction

Explore related subjects

1 Introduction

2 Materials and methods

2.1 Dokan dam

2.2 Warragamba dam

2.3 Statistical analysis for datasets

2.4 Model combinations and input selection

2.5 Support vector regression (SVR)

2.6 Statistical measurements

2.6.1 Root-mean-square error (RMSE)

2.6.2 Coefficient of determination (R.2)

2.6.3 Nash–Sutcliffe model efficiency coefficient (NSE)

2.7 Sensitivity analysis (SA)

2.8 Strength and limitation of the proposed techniques

3 Results and discussion

3.1 Forecasting reservoir inflow utilizing RBF kernel

3.2 Forecasting reservoir inflow utilizing linear kernel

3.3 Forecasting reservoir inflow utilizing NP kernel

3.4 Forecasting reservoir inflow utilizing sigmoid kernel

3.5 Comparison performance of RBF, linear, NP, and sigmoid kernels

3.6 Predicting weekly reservoir inflow utilizing RBF kernel

3.7 Predicting monthly reservoir inflow utilizing RBF kernel

3.8 The second scenario for forecasting daily reservoir inflow

3.9 Analysis of SVR results

3.9.1 Warragamba dam results

3.9.2 The most appropriate selections for the best-performed model

4 Conclusion

Availability of data and material

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Ethical approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

2.6.2 Coefficient of determination (R.²)