Introduction

In hydrology, a watershed is an area of land that drains to a common point, such as a river, lake, or ocean. Water flow in a watershed is a critical component of the hydrological cycle, including precipitation, evapotranspiration, and runoff (Jencso et al. 2009).

The availability of a good network for measuring hydrological processes such as rainfall, river flow and groundwater levels, etc., is essential for proper management and use (Huang and Yang 1998; Zamoum and Souag-Gamane 2019), especially in areas that suffer from water scarcity on one side and the low density of the measurement network on the other side (Swain et al. 2015). Having current flow data on a spatial and temporal scale has many advantages, including a better understanding of climate change, the effects of land use change, and other environmental factors on water resources (Gudmundsson 2021; Pokhrel et al. 2017). Moreover, researchers can well assess the impact of human activities on water resources by examining changes in water availability by analyzing water stream flow data over time, (Sivapalan et al. 2012) and studying several natural phenomena such as droughts and floods (Le et al. 2022; Maity and Kashid 2011).

A good understanding of the change in streamflow records in a particular watershed is difficult because it is linked to many direct and indirect factors, such as the low density of the measuring network (Xuan Do et al. 2020) and sometimes no stream gauges in the small catchment areas, In some cases, different agencies or organizations may collect streamflow data but not shared with others (Gerlak et al. 2011; Kibler et al. 2014). Finally, changes in land use such as human activities, agriculture, urbanization, and deforestation, can significantly impact streamflow by altering the water balance of a watershed (Allan 2004; Bosch and Hewlett 1982; Zhang et al. 2001).

For many decades stream flow prediction has been an important topic of hydrology and water resource engineering (Beven 2006; Gupta et al. 1999). Initially, reliance on statistical techniques to establish a relationship between historical data and future flows (Nash and Sutcliffe 1970; Razavi et al. 2012). Despite their simplicity and ease of use, these models' accuracy in prediction is related to the quality and quantity of available data. Rating curves remained the most widely used tools for stream flow estimation (Kiang et al. 2018), which relied on relating the river water level to the corresponding flow rate. The development of rating curves relies on collecting flow and stage water data across a range of flow conditions and then constructing a curve on the data using regression analysis (Sivapalan et al. 2012). The development of rating curves allows estimating the flow rate at any point in time based on the current phase measurement.

Despite their potential advantages, streamflow models require facing some challenges. One of the most important challenges that can make it difficult to develop accurate and effective models is the issue of data availability, both in terms of quality and quantity, (Ghimire et al. 2021). This is often found in remote or developing areas where data may be little or completely absent and of poor quality. In addition, we may find streamflow models are more complex, requiring great expertise in their creation and interpretation (Kavetski et al. 2006; Wagener et al. 2003).

This complexity may limit its scope of use, especially in regions with limited technical expertise. Finally, streamflow models differ concerning the level of uncertainty, the latter of which can be influenced by a range of factors, which would particularly affect flow estimation. Uncertainties can be resolved through sensitivity analyses and model calibration if additional data and resources are available.

However, In recent years, a growing body of scientific research has used machine learning techniques for streamflow estimation because of its ability to capture complex non-linear relationships between different hydrological variables and streamflow and use them to estimate streamflow at ungauged or poorly measured sites. Some popular machine-learning methods for streamflow estimation include artificial neural networks, decision trees, support vector machines, and random forests. These models have been successfully applied in a range of studies. Wang et al. (2006) used three types of hybrid artificial neural network (ANN) models, namely the threshold-based ANN (TANN), cluster-based ANN (CANN), and periodic ANN (PANN). The latest hybrid model gave better prediction results than other models for daily discharge forecasting of the headwater region of the Yellow River northeast of the Tibet Plateau in China. Rahsepar and Mahmoodi (2014) proposed an algorithm combining ANN and ABC to predict the future discharge of the Tang-e Karzin hydrometric station of Salman Farsi Dam, South Iran, with good results. The ABC significantly improved the performance of the ANN, thus improving the prediction of future discharge in the study area. Adnan et al. (2017) used three measures, the coefficient of determination (R2), the root mean square error (RMSE) and the mean absolute error (MAE), to evaluate the accuracy of the ANN and the support vector machines (SVM) in predicting monthly flow in the upper Indus basin, north of India. These measures showed that SVM has the best accuracy in predicting monthly flow. Katipoğlu and Can (2018) used the Auto-Regressive (AR) model to model monthly streamflow in Karasu River in the Euphrates Basin. Cheng et al. (2020) used ANN and long and short-term memory (LSTM) to forecast streamflow using precipitation and runoff datasets in the Nan River Basin and Ping River Basin, Thailand. The results showed that the LSTM model is superior to the ANN model in daily prediction. For multi-month prediction, the LSTM model showed less satisfactory results, and this is due to the limited availability of monthly training data. Katipoğlu (2020) employed Extreme gradient boosting (XGBoost) and K-Nearest Neighbors (KNN) to predict monthly streamflows in the lower Euphrates basin. According to the analysis, Xgboost was found to be superior to KNN. Siddiqi et al. (2021) used regression extreme learning machines (ELM) and ANN to estimate mean monthly upstream flow for the Tarbela dam in the Indus River basin. Ghimire et al. (2021) have developed a new deep-learning model called CNN‑LSTM based on integrating CNN and LSTM to predict the hourly Qflow at Brisbane River and Teewah Creek, Australia, using deep neural networks. Pini et al. (2020) have used different machine learning algorithms such as ANN, support vector machine (SVM), and random forest (RF). Ha et al. (2021) used the monthly streamflow data of the Yangtze River from 1952 to 2018 to predict the monthly streamflow of the Yangtze River to estimate the streamflow in Lake Como (Italy). Le et al. (2022) have tried to estimate the monthly streamflow over several areas in the world, such as North America, South America, and Western Europe using three machine learning such as SVM, RF, and gradient-boosted trees, Akbarian et al. (2023) investigated the effect of these variables on the accuracy of streamflow forecasting using learning models, multiple linear regression (MLR), ANN, SVM, RF, and XGBoost, concerning the results, it showed a significant effect of surface runoff on the accuracy of flow forecasting, followed by precipitation and temperature, with regard to the performance of the models, the results showed that machine learning models, especially ANN, XGBoost and RF, can provide accurate predictions of surface runoff compared to other used models, to improve surface water management through accurate prediction of discharge in drought-prone areas. If we compile the results of these studies, we find that the use of artificial intelligence models for the estimation of flow offers many advantages, especially in terms of accuracy, and it helps to make informed decisions about the hydrological domain. However, the availability of high-quality data remains the main responsible for the accuracy of these models.

This study combines the ANN algorithm and artificial bee colony (ABC) to make a new integrated hybrid model called ABC-ANN for estimation streamflow time series. Furthermore, this model was combined with various signal decomposition techniques to assess its efficiency in estimating streamflow time series in the East Black Sea Region (Türkiye).

Material and method

The East Black Sea Region

The study area includes three cities located in the north-east of the country, on the shores of the East Black Sea region, Rize, Ordu, Trabzon, which are located 41°01′29″N 40°31′20″E, 40°77′45″N 37° 44′08″ and 41°00′18″N 39°43′21″E respectively. The climate of the study area is the same as the climate of the east Black Sea region, a humid subtropical climate with warm and sometimes cold summers due to the direct influence of the Black Sea, where the average temperature is about 26.5 C. It is characterized by mild to cold winters, sometimes due to snowfall, especially in the mountains, with the average temperature reaching up to 5.7 C for the period (1991–2020). Precipitation in the study area is relatively equal and moderate to high, especially in the late autumn season from (October to December), especially in the mountainous areas, where they receive large amounts of rain, with an average of 178 mm. Most of the streams in the area flow vertically to the sea in narrow and deep valleys. (Turkish State Meteorological Service 2021). The location information of the stream gauging station (SGS) for which the current is estimated is shown in Fig. 1. In Table 1, the locations of SGS are addressed.

Fig. 1
figure 1

Geographical coordinates of SGSs used in the study

Table 1 Information on SGS used in the current study

Statistical coefficients of monthly average flow data from 3 SGS in the Eastern Black Sea Region are given in Table 2. The data structure can be considered and model assumptions can be tested by evaluating the mean values, standard deviations and distributions of the stream flows according to these statistical parameters.

Table 2 Statistical characteristics of streamflow data

Methods

Artificial neural networks

ANNs are among the most preferred AI models for predicting incomplete hydrological data such as precipitation and river flow (Dawson et al. 2005; Kueh and Kuok 2018). These models can quickly model the relationship between variables. ANNs consist of input, hidden and output layers. The layers learn by changing the information between them. The training process is based on reducing the error between the expected and actual output values by adjusting the model parameters. The model estimates streamflow data based on training of historical data. The computational steps of the ANN model are presented in Fig. 2. The mathematical formula of the ANN is given in Eq. 1.

$$y=f\left(\sum_{i=1}^{n}{w}_{i}{x}_{i}+b\right)$$
(1)

where y indicates the output, f shows the transfer function, wi is the weight vector, xi is the input vector, and b is the bias (Katipoğlu 2022).

Fig. 2
figure 2

Algorithmic program showing the setup steps of the ANN model

Artificial bee colony (ABC)

ABC algorithm is one of the artificial intelligence techniques inspired by the intelligent behavior of bees in their search for food. Which focuses on studying the collective behavior of decentralized systems, represented by groupings of simple elements that interact locally with each other and with the surrounding environment. This algorithm is used to solve many optimization problems, i.e. issues that require reaching the optimal solution from a set of proposed solutions. (Karaboga and Basturk 2007). The ABC system has three types of bees: worker, observer, and explorer. After that, the bees disperse to search for food sources and if there is an ample nectar source compared to other sources in their search area. This determines the most abundant source among all the abundant sources (Karaboga et al. 2014) and this is exactly what optimization issues require. It is accepted that streamflow estimation is a challenging scenario that requires the utilization of a non-linear estimator to accurately reflect the correlating associations and dynamics. The current research used the ABC-ANN model as a non-linear estimator, taking advantage of the capabilities of the ANN to accurately depict the intricate and non-linear relations found in streamflow data. The ABC optimization algorithm was employed to optimize the ANN's parameters for enhancing the ability of ordinary ANN to capture the complex patterns in streamflow time series datasets. Figure 3 shows the application steps of ABC optimization technique.

Fig. 3
figure 3

Algorithmic program showing the setup steps of the ABC optimization (Durgut and Aydin 2021)

The ABC algorithm conceptualizes natural processes and activities by representing them as algorithmic components and functionalities. In this representation, the concept of a "food source" is transformed into a "feasible solution" denoted as xi, while the "nectar amount" corresponds to the fitness of a solution indicated by F(xi) as described in Eq. (2).

$$F\left({x}_{i}\right)\left\{\begin{array}{c}\frac{1}{1+\mathrm{f}({\mathrm{x}}_{\mathrm{i}})} {\mathrm{f}({\mathrm{x}}_{\mathrm{i}})}_{ }\ge 0\\ \left|{1+\mathrm{f}({\mathrm{x}}_{\mathrm{i}})}_{ }\right| otherwise\end{array}\right.$$
(2)
$${\mathrm{v}}_{i}={\mathrm{x}}_{i}+\varphi i\left({\mathrm{x}}_{i}-{\mathrm{x}}_{n}\right)$$
(3)

In the given equations, xi represents the current solution, xn represents the neighboring solution, and vi represents the candidate solution. The variable φi is a randomly generated number within the [-1, 1] range. The index i takes on values from 1 to N, indicating the index of the food source, where N represents the total number of food sources. Additionally, when the onlooker bees fail to find an improved solution, the scout bees can be generated using Eq. (4).

$${x}_{ij}={\mathrm{LB}}_{j}+rand\left(\mathrm{0,1}\right)\times \left({UB}_{j}-{LB}_{j}\right)$$
(4)

In the given context, xi,j represents the jth decision variable within the solution vector xi. The index j ranges from 1 to D, meaning the total decision variables. Additionally, LB and UB denote the lower and upper boundary values specified for the decision variable (Durgut and Aydin 2021).

Local mean decomposition (LMD)

Applicable in various fields and of numerous applications because of its power (Lei et al. 2013), this signal processing technique makes complex signals into simpler components according to their local average frequencies. The LMD algorithm can iteratively extract a series of component functions from the input signal, each representing a different frequency range (Huang et al. 1998). Then, these component functions are combined to form the original signal, which can be reconstructed by summing all the extracted components (Huang et al. 1998). Then, these component functions are combined to form the original signal, which can be reconstructed by summing all the extracted components.

Application steps of the LMD technique consist of tree step: (i) Determining the input time series to the model, (ii) Determining the parsing level, (iii) Obtaining subcomponents. In LMD, the process of smoothing a signal involves applying moving averaging, while the weighting is determined by examining the gap between consecutive extrema. To begin the decomposition, the first step entails computing each half-wave oscillation's maximum and minimum points. In this scenario, the mean value "mi" for the "ith" oscillation, positioned between two successive extrema "ni" and "ni + 1," is calculated according to the following method:

$${m}_{i}=\frac{{n}_{i}+{n}_{i+1}}{2}$$
(5)

A uniformly varying continuous local mean function m(t) is obtained. Half-wave oscillations are expressed as follows:

$${a}_{i}=\frac{|{n}_{i}-{n}_{i+1}|}{2}$$
(6)

For most natural data, LMD follows an iterative process that effectively captures a positive instantaneous frequency from a purely frequency-modulated signal with a constant envelope (Smith 2005).

Complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)

Flandrin et al. (2011) proposed CEEMDAN, which separates complex signals into their basic components using IMFs. The CEEMDAN algorithm uses an adaptive sifting process that dynamically adjusts the bandwidth of the sifting window for each IMF to avoid mode mixing and improve the decomposition accuracy (Li et al. 2015). In addition, CEEMDAN reduces the effects of noise on the separation results. It then calculates the final decomposition result of each community member's IMFs based on the average. EEMD results are affected by residual noise that causes problems in the opposite direction. The number of trials can be improved with this technique by increasing the number of eliminations. To reduce mode mixing and the number of trials, the CEEMDAN technique is utilized (Torres et al. 2011). Signal processing techniques such as LMD and CEEMDAN provide advantages in reducing the noise in the data by separating the various subbands of the input time series, modeling the subcomponents at different frequencies and better understanding the structure of the data.

The steps for conducting a CEEMDAN analysis are as follows:

Apply the identical EEMD technique to compute the first modal function.

$$\overline{{IMF }_{1}(t)}=\frac{1}{N}\sum\nolimits_{j=1}^{N}{IMF}_{1}^{j}(t)$$
(7)

Additionally, a distinctive initial residue is determined by performing the following calculation:

$${r}_{1}\left(t\right)=x\left(t\right)- \overline{{IMF }_{1}(t)}$$
(8)

The kth Intrinsic Mode Function (IMF) component, denoted as emd(t), is defined by applying Empirical Mode Decomposition (EMD). Next, the sequence sequencer1(t) + p1 * emd1(nj(t)) is decomposed to obtain the second IMF component.

$$\overline{{{IMF }_{2}\left(t\right)}} = \frac{1}{N}\sum\nolimits_{j=1}^{N}{emd}_{1 }[ {r}_{1}t+{p}_{1} {emd}_{1}({n}_{j}(t)]$$
(9)

The residual signal is indicated following:

$${r}_{2}\left(t\right)={r}_{1}\left(t\right)- \overline{{IMF }_{2}\left(t\right)}$$
(10)

Similar to the procedure outlined in steps 1 and 2, the kth residual signal can be represented using the given equation.

$${r}_{k}\left(t\right)={r}_{k-1}\left(t\right)-{IMF}_{k}\left(t\right)$$
(11)

Similarly, the k + 1th Intrinsic Mode Function (IMF) component can be expressed using the provided equation.

$$\overline{{{IMF }_{k+1}\left(t\right)}} = \frac{1}{N}\sum\nolimits_{j=1}^{N}{emd}_{1 }[ {r}_{k}t+{p}_{k} {emd}_{k}({n}_{j}(t)]$$
(12)

Continue iterating steps 1 to 3 until the residual signal satisfies the specified termination criterion. Assuming there are L IMF components, the original sequence can be expressed using the mentioned equation (Torres et al. 2011; Rezaie-Balf et al. 2019).

$$y\left(t\right)=\sum_{i=1}^{L}\overline{{{IMF }_{i}\left(t\right)}} {+r}\left(t\right)$$
(13)

Comparison of methods

For our case and to study the performance of the methods used to estimate the monthly streamflow, four indicators are used:

Mean squared error (MSE)

One of the common metrics used to evaluate the performance of regression models is the MSE scale. Where measures the average squared differences between the predicted (ŷ) values and the actual values (y). In other words, it measures the average amount by which forecasts deviate from actual values. The smaller the MSE value, the higher the accuracy of the model

$$MSE=\left(\frac{1}{n}\right)\sum {({y}_{i}-{\widehat{y}}_{i})}^{2}$$
(14)

Mean absolute percentage error (MAPE)

MAPE is a commonly used metric, as it measures the average percentage difference between the expected and actual values to evaluate the performance and calculate the prediction accuracy in terms of the error rate for the prediction model.

MAPE is calculated by Eq. 15 where (ŷ) are the predicted values and (y) are the actual values

$$MAPE=\left(\frac{1}{n}\right)\sum \left|\frac{({y}_{i}-{\widehat{y}}_{i})}{{y}_{i}}\right|$$
(15)

Correlation and determination coefficient

Sometimes we try to find if there is a relationship or a connection between two or more variables. The correlation coefficient can answer this question graphically or numerically by calculating the correlation coefficient (R). The square of the correlation coefficient is the determination coefficient (R2) and is calculated as follows:

$${{R}^{2}=\left(\frac{{\sum }_{i=1}^{N}\left({x}_{i}-\overline{x }\right)*\left({y}_{i}-\overline{y }\right)}{\sqrt{{\sum }_{i=1}^{N}{\left({x}_{i}-\overline{x }\right)}^{2}*}{\sum }_{i=1}^{N}{\left({y}_{i}-\overline{y }\right)}^{2}}\right)}^{2}$$
(16)

Nash–sutcliffe efficiency (NSE)

NSE is an efficiency parameter that shows the fit and relationship between two-time series. NSE is a preferred indicator mostly to show the accuracy of the model. This indicator shows the variation of the observed data with the predicted data. NSE values range from -∞ to 1. A value of 1 for NSE indicates that the estimated dataset perfectly matches the actual data. It can be said that the closer the NSE value is to 1, the higher the model performance.

$$NSE=1-\frac{{\left(\sum_{i=1}^{N} \left({y}_{i}-{\widehat{y}}_{i}\right)\right)}^{2}}{\sum_{i=1}^{N} {\left({y}_{i}-{\overline{{\widehat{y} }_{i}}}\right)}^{2}}$$
(17)

Kling-gupta efficiency (KGE)

Developed by Gupta et al. (2009), the KGE is an indicator of goodness of fit for hydrological modeling to enable its different components to decompose correlation, variability bias, and mean bias properties. Kling et al. (2012) developed the NSE indicator to ensure that the rates of bias and variability were not cross-correlated and recommended this indicator.

$$KGE=\sqrt{\left(R-1\right)+(a-1 {)}^{2} \;(b-1 {)}^{2}}$$
(18)
$$a=\frac{{S}_{{\widehat{y}}}}{{{S}_{y}}}, b=\frac{\overline{{\widehat{y} }_{i}}}{\overline{y} }$$
(19)

\({S}_{{\widehat{y}}}\) shows standard deviation of predictions, \({\mathrm{S}}_{\mathrm{y}}\mathrm{ indicates}\) standard deviation of observations, \(\overline{{\widehat{\mathrm{y}} }_{\mathrm{i}}}\mathrm{ is average}\) of predictions, \(\overline{\mathrm{y}}\mathrm{ is }\) \(\mathrm{average of}\) observations. If the KGE value is close to 1, it means that the prediction results of the model have perfect agreement with the real values. The fact that the KGE's values are equal to zero indicates no relationship between the model estimates and the actual data.

Taylor diagrams

Taylor diagrams are used in various fields, such as meteorology and oceanography. It is a graphical method for comparing similarity and statistical parameters between two or more data sets. For this, the closeness of the prediction models to the reference point and the values of the statistical parameters are evaluated. In addition, the most suitable model can be decided according to the correlation coefficient between the prediction models and the real data set. The Taylor diagram provides a clear and concise way to visualize the similarities and differences between multiple datasets in a single graph (Taylor 2001).

Results and discussion

Streamflow estimation is vital for the risk management of floods, the supply of water resources, the construction of water structures and irrigation planning. Streamflow prediction accuracy has improved with the development of artificial intelligence technologies. In addition, ANN techniques are strengthened with various signal separation and bio-inspired optimization techniques. This study evaluated the one-month lead-time streamflow prediction success of the ANN model combined with LMD, CEEEMDAN and ABC algorithms. Furthermore, the performance of the created hybrid ANN model was evaluated according to various statistical and graphical approaches.

Choosing the input combination is critical to determine the best streamflow prediction model. Within the scope of the study, a suitable model structure was established using partial autocorrelation function (PACF) graphics. PACF graphs are shown in Fig. 4. Accordingly, lagged values exceeding 95% confidence limits were used in the modeling. Lagged values that exceed the confidence limits and have a high correlation in the PAC graphs are presented as input to the proposed model for streamflow estimation. For the analysis of the streamflows in Ordu, Trabzon and Rize according to PAC charts, 1, 2, 10, 11, 12-month delayed values, 1, 2, 11, 12-month delayed values and 1, 2, 11, 12-month lagged values were selected as an input, respectively (Table 3). Also, Fig. 5 shows Taylor diagrams of selected inputs based on PACF in each station, which were used for the input of estimator models. The t-11 datasets (green point) are most closet point to the target dataset (t + 1 which is black point) and t dataset (purple point) and t10 dataset (yellow point) are located at the second and third closet points (respectively) to the target dataset in all stations.

Fig. 4
figure 4

PACF of streamflows of used SGS

Table 3 Selected model combinations for Q (t + 1) streamflow estimation
Fig. 5
figure 5

Taylor diagrams of selected input data for estimator models based on PACF in each station; t-11: green point, t-10: yellow point, t-9: red point, t-1: blue point, t: purple point, t + 1: black point

Figure 6 displays the subcomponents of delayed streamflow values obtained through the CEEEMDAN and LMD techniques in Ordu-2238 no SGS. The objective is to enhance the model's performance by presenting the obtained subcomponents to the ABC-ANN model. The CEEEMDAN algorithm separates lagged current values into various numbers of IMFs and residuals, enabling the intelligent model better to evaluate the input values' fluctuations and trends. With the LMD algorithm, the streamflow values are decomposed into sub-components, or product functions (PF), considering the impact of noise and outliers on the modeling. The goal is to strengthen the ABC-ANN model by incorporating these sub-components.

Fig. 6
figure 6

Streamflow values decomposed to subcomponents in Ordu- 2238 no SGS a) CEEEMDAN algorithm b) LMD algorithm

In Table 4, the performance evaluation of the models used in streamflow estimation has been made. Accordingly, the accuracy of the models was compared according to the KGE, MSE, NSE and R2 statistical indicators. Therefore, the training and test results of the CEEEMDAN-ABC-ANN model in SGS no. 2238 in Ordu and SGS no. 2202 in Trabzon showed more successful results than ABC-ANN and LMD-ABC-ANN models in monthly streamflow estimation. The hybrid model CEEEMDAN-ABC-ANN has the highest prediction accuracy at station 2238 with the following values for training (MSE: 43.58, R2: 0.79, NSE: 0.79, KGE: 0.84) and testing (MSE: 54.44, R2: 0.67, NSE: 0.67, KGE: 0.75). At station 2202, the CEEEMDAN-ABC-ANN hybrid model has the highest prediction accuracy with the following values for training (MSE: 19.15, R2: 0.83, NSE: 0.83, KGE: 0.84) and testing (MSE: 41.86, R2: 0.66, NSE: 0.66, KGE: 0.76). In addition, the ABC-ANN model showed higher success than LMD-ABC-ANN and CEEEMDAN-ABC-ANN hybrid models during training and testing stages in Rize-2215 with no SGS. At station 2215, the ABC-ANN model has the highest prediction performance with the following statistical values for training (MSE: 17.88, R2: 0.89, NSE: 0.88, KGE: 0.91) and testing (MSE: 36.92, R2: 0.80, NSE: 0.80, KGE: 0.86).

Table 4 Performance analysis of streamflow prediction via ABC-ANN, LMD-ABC-ANN, and CEEEMDAN-ABC-ANN models

In Fig. 7, the accuracy of the models used to estimate streamflow in SGS 2238 was evaluated according to the scatter diagrams. A scatter plot is a graphical indicator plotted along the X and Y axis to visualize the relationship between two variables and reveal correlations and outliers. In order to determine the appropriate model according to the scatterplots, an evaluation is made according to the distribution of the points around the 45-degree line. When the scatter diagrams are compared, it can be said that the performances of the established models in the streamflow estimation are close. Still, it can be said that the CEEEMDAN-ABC-ANN hybrid model gives more successful results than the other models. Figure 7 shows strong correlations between observed streamflow and predicted streamflow values for the training and testing phases at Ordu station. Generally, correlations between predicted and observed streamflow in the training phase are higher than corresponding values in the testing phases.

Fig. 7
figure 7

Evaluation of model performances in streamflow prediction at 2238 no SGS with scatter diagrams

Figure 8 analyzes the scattering diagrams of the models used to estimate streamflow in SGS 2202. When the scatter diagrams were evaluated, it was determined that the performances of the established models in the streamflow prediction were close to each other. However, it is seen that the CEEEMDAN-ABC-ANN hybrid model gives slightly more accurate results than other models. This issue indicates the higher performance of CEEEMDAN-ABC-ANN in both training and testing phases than other applied models. Also, it shows the ability of the ABC algorithm as a boosting tool to optimize the performance of the ANN model for streamflow precision.

Fig. 8
figure 8

Evaluation of model performances in streamflow prediction at 2202 no SGS with scatter diagrams

In Fig. 9, the streamflow prediction performances of the model established in SGS 2215 are compared according to the scatter diagrams. According to these diagrams, the ABC-ANN model has higher accuracy than CEEEMDAN-ABC-ANN and LMD-ABC-ANN models in the training and testing stages. This higher performance of the ABC-ANN model is demonstrated in the scatter plot, where the points for the ABC-ANN model are more closely grouped around the actual target values.

Fig. 9
figure 9

Evaluation of model performances in streamflow prediction at 2215 no SGS with scatter diagrams

Figure 10 shows time series plots of predicted and actual values in SGS 2238. These plots evaluated the relationship and spread between the actual and estimated streamflow values. According to Fig. 10, it can be said that the estimation results of the CEEEMDAN-ABC-ANN hybrid model during the training and testing phase are superior to other models since they spread following the real values. In addition, the LMD-ABC-ANN model is the weakest since it predicts the maximum current values with less accuracy than other models. The distribution of the predicted streamflow values around the observed streamflow values indicates that the CEEEMDAN-ABC-ANN model successfully captures the complexity and variability of streamflow in SGS 2238 with a high accuracy in the training and testing phases.

Fig. 10
figure 10

Comparison of model performances with time series plot in 2238 no SGS

Figure 11 presents time series plots of estimated and actual streamflow values in SGS 2202. These plots analyzed the relationship and spread between the actual and estimated streamflow values. The estimation results of the CEEEMDAN-ABC-ANN hybrid model during the training and testing phase are superior to other models since they best match the actual values. In addition, it is emphasized that the LMD-ABC-ANN model is the weakest because it predicts the maximum streamflow values with less accuracy than other models and has less overlap with the actual values.

Fig. 11
figure 11

Comparison of model performances with time series plot in 2202 no SGS

Figure 12 shows time series plots of streamflow values in SGS 2215. According to these plots, it is noteworthy that the actual and the estimated streamflow values overlap to a large extent. In addition, when the time series plots are examined in detail, it is seen that the ABC-ANN model represents the actual streamflow values better than the other models.

Fig. 12
figure 12

Comparison of model performances with time series plot in 2215 no SGS

In Fig. 13, the potentials of the streamflow prediction models are compared with the Taylor diagrams. These graphs compared the estimated currents during the training and testing phases with the actual values. According to the statistical properties of the prediction model, which is close to the reference point, the most superior model was decided. According to these diagrams, it has been determined that the CEEEMDAN-ABC-ANN model in SGS 2238 and 2202 has the highest accuracy since it is closest to the reference point and has low RMSE and high R2 values. Accordingly, it can be deduced that the CEEEMDAN technique is superior to the ABC-ANN model with its noise reduction in the input stream data, solving the mode mixing problem and time-varying structure. In addition, since the ABC-ANN model is closest to the reference point in SGS 2215, it is deduced that the estimations are the most realistic. In addition, all models showed satisfactory results in the flow estimation with values in the range of 0.80 to 0.95.

Fig. 13
figure 13

Comparison of model performances with Taylor diagrams: a) 2238 no SGS training, b) 2238 no SGS test, c) 2202 no SGS training, d) 2202 no SGS test, e) 2215 no SGS training, f) 2215 no SGS test

Discussion

The current study's main goal is to propose a new AI-based model coupled with data preprocessing approaches to enhance the accuracy of ABC-ANN for streamflow simulation. The overall results showed that both preprocessing approaches (LMD and CEEEMDAN) increased the capability of ABC-ANN in the Ordu station, and CEEEMDAN-ABC-ANN only performed better than the ABC-ANN in Trabzon. Also, both LMD and CEEEMDAN acted worse than the ABC-ANN model in Rize station for streamflow simulation. Various results are reported based on different inputs to each station's different time series behavior. The key advantage of the ABC-ANN approach is that the ANN's parameters can be tuned via an optimized framework (ABC) to reach the highest accuracy of time series simulation via ABC-ANN mode. Furthermore, adding preprocessing techniques (e.g., CEEEMDAN) can help the model to detect the non-linear behavior streamflow, and therefore CEEEMDAN-ABC-ANN utilized both advantages of bio-inspired and preprocessing methods.

It can be observed that all applied models are more capable in streamflow simulation of the training phase. This issue can be justified due to the existence of a large amount of peak flow in the testing phase, which was not learned by models in the training phase and can affect the final result of the testing period. Although CEEEMDAN-ABC-ANN reported better streamflow simulation results for streamflow forecasting, some limitations to implanting CEEEMDAN-ABC-ANN include (1) combining CEEEMDAN with the ABC-ANN adds more complexity to the final model (CEEEMDAN-ABC-ANN). Furthermore, (2) although CEEEMDAN can decompose the original time series of streamflow data into several signals, ABC-ANN needs more effort and process and make more relationships between all these separated signals during the learning phase of the ANN model, which can increase model running time. (3) using the original time series of streamflow data is always understandable in the hydrological view, but using decomposed signals could not be understandable in the hydrological view. The modeling process could be a (deeper) black box with less explanation. (4) The ABC-ANN uses the artificial bee colony optimization algorithm in the ANN learning rate for tuning the ANN model's hyperparameters. Sometimes the algorithm could be trapped in the local minimize and also it is sensitive to choosing algorithm initial parameters and; therefore, reaching optimal results by ABC-ANN has more challenges compared with the ordinary ANN model.

In recent years, metaheuristic optimization algorithms have been successfully coupled with AI models as optimizer tools in solving complex non-linear issues for hydrological modeling tasks (Mahmoudi et al. 2022; Maroufpoor et al. 2019; 2020). Due to the high computational performance of AI-based models in solving non-linear problems, these models have been used for streamflow simulation worldwide. However, due to AI models' lack of hydrological terms, they fail to interpret hydrological processes (Mohammadi et al. 2022) physically. Previous studies such as Cheng et al. (2020), Difi et al. (2022), Wang et al. (2022), and Ayana et al. (2023) recommended AI techniques as powerful tools for capturing streamflow time series, while they also mentioned these AI techniques are sensitive to use data. Therefore, they should be trained well with enough time series data. However, both hydrological physically based models and AI-based models have some advantages and disadvantages in their application, while the type of case study can decide which type of model could be suitable. The ABC algorithm showed it can find the optimal solution with a high probability (Wang et al. 2020). Although it uses fewer control parameters, the ABC model can effectively solve multidimensional multimodal optimization, giving better results than other models (Karaboga and Akay 2009). Thanks to their ability to analyze non-linear and non-stationary data, the two signal processing techniques, Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Local Mean Decomposition (LMD) have become popular and the most used in many fields. However, both CEEMDAN and LMD have some disadvantages as well. CEEMDAN can be computationally expensive, especially for large datasets, and may require significant computational resources. On the other hand, LMD may not perform well for signals with sharp transitions or discontinuities, as it relies on a smooth signal assumption.

Conclusion

Accurate streamflow simulation is vital for water resources management and environmental planning. The current study proposed a novel artificial intelligence technique based on an artificial bee colony combined with ANN (ABC-ANN) and the local mean decomposition (LMD-ABC-ANN) for the monthly streamflow time series prediction in Ordu, Trabzon and Rize hydrometric stations (in the East Black Sea Region Türkiye). The model was enhanced by applying different signal decomposition techniques, including Local Mean Decomposition (LMD) and Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN). A partial autocorrelation function (PACF) was used to detect effective lag times of streamflow as input of estimator models. The lag times including {t, t-1, t-9, t-10, t-11} and {t, t-1, t-10, t-11} and {t, t-1, t-10, t-11} were selected as effective monthly lag times for predicting streamflow in t + 1 in Ordu, Trabzon, and Rize stations, respectively. It can be concluded that all applied models simulated monthly streamflow with reliable performances in this study. The results showed that empirical envelope and complete ensemble empirical mode decomposition with adaptive noise combined with ANN-ABC (CEEEMDAN-ABC-ANN) outperformed in Ordu and Trabzon stations and ABC-ANN simulated streamflow by a higher accuracy compared with other applied models in Rize station. The CEEEMDAN method improved the capability of ABC-ANN in the Ordu and Trabzon stations with R2 = 0.67 and 0.66 for the test section in the Ordu and Trabzon stations, respectively. The results showed that the coupled ABC-ANN model, optimized via the ABC algorithm, achieved accurate streamflow estimation in the studied regions. The fluctuations and noise of streamflow time series data were captured by coupling the LMD and CEEMDAN techniques into estimator models, which helped to enhance streamflow time series prediction. The selection of lagged streamflow values by PACF analysis led to identifying the most effective input variables for each station, which also contributed to improving the accuracy of estimator models. The hybrid CEEEMDAN-ABC-ANN model resulted in lower MSE values and higher R2 values compared to the ABC-ANN model in Ordu and Trabzon stations. This shows that integration of the CEEEMDAN technique improved the accuracy of streamflow estimation during the training and testing phases in Ordu and Trabzon stations. This issue shows this coupled model can follow the dynamic pattern of streamflow time series during the training and testing phase. In Rize station, the ABC-ANN model resulted in a lower MSE and a higher R2 values for the training and testing phases compared to the LMD-ABC-ANN and CEEEMDAN-ABC-ANN models. These findings suggest that the efficiency of the hybrid models might be contingent upon the particular characteristics and patterns of streamflow data at each station. It can be concluded that the CEEEMDAN technique can increase the accuracy of streamflow simulation compared with the LMD method and the CEEEMDAN-based estimators could be tested and generalized for streamflow simulation in various climates. Streamflow time series behavior and its prediction have been better understood using several AI-based methods, such as presented in this study. Streamflow simulation via these methods can be useful to increase our knowledge of the mechanisms that drive streamflow behavior in which these processes may be altering in response to climate change impacts.

Despite the good results that the artificial intelligence models give us, there are some limitations that may decrease the accuracy of the models. One of the basic limitations is the availability of data in terms of quality and quantity and the availability of other parameter data such as evaporation, evapotranspiration, and information on soil properties, and land cover changes. This last one, using it, gives us an evaluation and a better analysis of the effect of other relevant variables on the process flow simulation. Despite all these limitations, the use of artificial intelligence models in forecasting the streamflow has many advantages, which contribute to the good management of water resources, planning for drought, and solving the problem of floods. Finally, the CEEEMDAN-ABC-ANN model can be applied in other hydro-climatic contexts similar to those studied, also integrating the proposed model and comparing the proposed method with other types of metaheuristics optimization algorithms and some newly developed machine learning techniques such as deep learning models, thus providing a useful solution possible management of water resources in different regions of the world.