Introduction

Hydrology is the science that studies the water cycle and its movement in terms of the topographic features of an area (Zhihua et al. 2020). A river basin or watershed is an area over which various hydrologic processes such as precipitation, snowmelt, interception, evapotranspiration, infiltration, surface runoff, and sub-surface flow are integrated (Islam 2011). Understanding the amount of water produced in the river basin in many ways is essential for managing water resources (Zhihua et al. 2020). However, the estimation of the hydrological parameters by the conventional method is not reliable and time-consuming (Gull and Shah 2020). Besides, the spatial and temporal occurrences of both surface water resource and groundwater resources within most watersheds are not properly known (Tufa and Sime 2020).

Consequently, numerous hydrological models have been developed across the world to know and understand the hydrology and water resources of basins (Devia and Ganasri 2015). The development of hydrologic models that consider physical characteristics of the watershed helps in accurate prediction of hydrological processes of a watershed (Devia and Ganasri 2015; Salih and Hamid 2017; Sime et al. 2020; Tufa and Sime 2020). Therefore, the hydrologic model has become an increasingly important tool to understand the hydrological cycle and can assist in the management of water resources in a watershed (Saifullah et al. 2016; Shiferaw et al. 2018; Serrão et al. 2019). They are used for hydrological simulation to water resource assessment, water allocation, climate change impact assessment, satellite data performance evaluation, and many other purposes (Setegn et al. 2008; Gebre and Ludwig 2015; Asitatikie and Gebeyehu. 2020; Lakew et al. 2020; Aawar and Khare 2020).

The common approach in hydrologic modeling is that the model is used to calculate streamflow based on meteorological data and catchment characteristics (Vidyarthi et al. 2020). Accordingly, two important inputs required for all models are rainfall data and drainage area (Ashenafi and Hailu 2014). Along with these, hydrological models consider watershed characteristics like soil properties, vegetation cover, watershed topography, and soil moisture content; characteristics of groundwater aquifer are also considered (Devia and Ganasri 2015; Roth et al. 2016). Moreover, hydrological modeling uses the conceptual representation of the hydrological cycle to describe the physical process in a watershed and regulate the processes of rainfall to runoff. Subsequently, the spatial and temporal variation and the interaction of hydrological processes are understood (Khayyun et al. 2019). The physical models represent different hydrological processes through mass, momentum, and energy conservation equations and define and analyze the various hydrological parameters like runoff and sediment yield (Jaiswal et al. 2020; Gull and Shah 2020). Thus, they are capable to deal with hydrological processes and predict different components of the hydrological cycle of the watershed by considering the spatial variability of land use, slope, and climate (Jaiswal et al. 2020).

Several studies have been conducted in the Upper Blue Nile basin to simulate and evaluate the water resources of the basin using hydrological models (Tekleab et al. 2011; Mengistu and Sorteberg 2012; Ashenafi and Hailu 2014; Dile and Srinivasan 2014; Abera et al. 2017; Polanco et al. 2017; Liersch et al. 2018; Roth et al. 2018; Woldesenbet et al. 2017; Walelgn 2018; Chimdessa et al. 2019; Dibaba et al. 2020; Tufa and Sime 2020). Some researchers reported successful application of the hydrological model for streamflow and water balance assessment at the basin and sub-basin scale (Tekleab et al. 2011; Abera et al. 2017). Some also model the entire basin to assess the impact of climate change on annual streamflow and hydrological regime of the basin through hydrological simulations (Mengistu and Sorteberg 2012; Liersch et al. 2018; Roth et al. 2018).

However, most of the studies do not provide detailed assessment for each of the hydrological processes and water balance components of the basin in the temporal and spatial extent. Hence, this research applies the physical-based, semi-distributed soil and water assessment tool (SWAT) hydrological model to simulate and examine the hydrological processes of the Upper Blue Nile basin. Furthermore, this study explores the spatial distribution of the hydrological components of the basin. Hence, the results obtained in this research are expected to improve water resource management of the basin and partly overcome problems due to data scarcity of the basin.

Materials and methods

Description of the study area

The Upper Blue Nile river basin is made from multiple tributaries that drain from the northwestern and western parts of Ethiopia. Geographically, the basin covers 7°44′32″–12°45′19″ N latitude in the south–north direction and 34°29′20″–39°48′17″ E longitude in the west–east direction (Fig. 1). The basin surface area is about 174,166 km2. The topography of the Upper Blue Nile is composed of highlands, hills, valleys, and occasional rock peaks (Conway 2000). Most of the streams feeding the Upper Blue Nile are perennial. The basin has an uneven spatial distribution of rainfall and the average annual rainfall varies between 1200 and 1800 mm/year (Fig. 2). The Upper Blue Nile consists of three seasons; from October to the end of February is the dry season, from March to May is a short rainy season, and June to September is the long rainy season (Tesemma et al. 2010).

Fig.1
figure 1

Map overview of Upper Blue Nile basin

Fig. 2
figure 2

Spatial variation of average annual rainfall of the Upper Blue Nile basin

Data collection and sources

A range of multiple and spatially distributed datasets such as topographic features, soil types, land use/land cove, climate, and hydrological data are needed for the SWAT model (Neitsch et al. 2011). In the present study, the topographic, land use, soil, climate, and hydrological data of the Upper Blue Nile basin were collected from different sources (Table 1).

Table 1 Data type, source and resolution

Data preparation

Spatial data preparation

The DEM data acquired from Alaska Satellite Facility (ASF) were preprocessed in ArcGIS software using a geospatial toolbox. Subsequently, the watershed and drainage patterns of the study area were delineated from the digital elevation model. Besides, the sub-basin parameters such as slope gradient, slope length of the terrain, and the stream network characteristics such as channel slope, length, and width were derived from the digital elevation model. The elevation band of the preprocessed image is range from 478 to 4258 m (Fig. 3).

Fig. 3
figure 3

Elevation map of the Upper Blue Nile basin

The second physiological data highly required for the SWAT model setup is the land use map, which is one of the main factors affecting water resources in a watershed (Koycegiz and Buyukyildiz 2019). The land use map extracted from Copernicus Global Land Service (CGLS) was classified based on the CGLS guide into10 main classes: agricultural land, bare land, closed forest, grassland, mixed forest, open forest, shrubs, urban, water, and wetland. Accordingly, about 55.94% of the Upper Blue Nile basin is covered by agricultural land, 18.04% by closed forest, 14.5% by mixed forest, 6.12% by shrubs, and the rest 5.4% by other land use classes (Fig. 4). The land-use types of the study basin were coded into SWAT by using the master database as a lookup table.

Fig. 4
figure 4

Land use map of the Upper Blue Nile basin

Different types of soil texture and physical–chemical properties of soils are required for SWAT simulations (Neitsch et al. 2011). The soil map of the study area (Fig. 5) was obtained from the World Food and Agriculture Organization of the United States (FAO) database. The physical and chemical properties of each soil type of the study area were extracted from the SWAT database.

Fig. 5
figure 5

FAO soil map of the Upper Blue Nile basin

Meteorological data preparation and quality assurance

Hydrological modeling and water resource studies have required the evaluation of the consistency and quality of the observed weather data (Talaee et al. 2014). The meteorological data of the study area were collected from the National Meteorological Service Agency (NMSA) of Ethiopia for 18 years (1999–2016). About 150 meteorological stations were found in the Upper Blue Nile basin, from which 55 stations were selected based on the length of the recording period and their completeness. Among these, 32 stations (Table 2; Fig. 6) were selected as a target station, and the remaining 23 stations were used to fill the missing data to the neighboring stations. The consistency of the meteorological data of the selected stations was checked using a double mass curve method. Finally, the missed data were filled and homogeneity analysis was executed using CLIMATOL v3.3 package of the R programming.

Table 2 Selected meteorological stations and their description
Fig. 6
figure 6

Spatial locations of the selected meteorological and streamflow gauge stations

Filling missing data and homogeneity analysis

The main objective of doing the quality control and homogeneity analysis of the meteorological data for this study was detecting data values that are obviously in error. Apart from affecting the value of a dataset in its own right, this is necessary because large outliers can influence subsequent climate analysis. Therefore, before applying raw meteorological data to subsequent analysis, quality control and homogeneity analysis are required. For the present study, due to the spatial heterogeneity of the Upper Blue Nile basin CLIMATOL package was selected to fill the missing data and homogeneity analysis.

CLIMATOL applies a spatial interpolation and rates of the normals to fill daily rainfall data; an algorithm developed by Paulhus and Kohler (1952) and standardized values method to fill temperature data. This implies knowing the averages and standard deviations of all series for a common and long period of observation, and incompatible constraints within a fragmented dataset. Therefore, averages and standard deviations are computed first with the available data, and missing data are filled with the unstandardized reference series computed for each station (Guijarro 2018).

The program uses the Standard Normal Homogeneity Test (SNHT) method to test the homogeneity of neighboring stations. The SNHT for the rainfall and/or temperature is based upon the assumption that the ratio q for rainfall and temperature at the station being tested (target station) and a neighboring station (reference station) is fairly constant in time. An inhomogeneity is one of the series that will then be revealed by a systematic change in this ratio. Consequently, the time series of this ratio qi (index i denoted to time step-i.e., year) is a suitable variable for testing the significance of a suspected homogeneity.

A standardized series of the ratio, zi, is defined as

$${z}_{\mathrm{i}}={(q}_{\mathrm{i}}-\widehat{q})/{s}_{q}$$
(1)

where q is the sample mean and sq is the sample standard deviation of the qi’s the sample mean value of zi then 0, while the sample standard deviation is 1 (Tuomenyirta 2002).

After the meteorological data quality was controlled, the observed data were used to generate additional weather data. PcpSTAT (precipitation index) and Dew point programs were employed to generate the required weather data. Accordingly, the generated weather data were exported into the SWAT database.

Hydrological data

Streamflow data of 14 years (2001–2014) recorded at Kessie and Border gauging stations (Table 3; Fig. 6), which are situated at the mainstream of the basin were used for calibration and validation purposes. These two-gauge stations were selected because they have adequate streamflow information needed for this study. The data were obtained from the Ethiopian Ministry of Water, Irrigation and Energy.

Table 3 Selected streamflow gauge stations and their spatial location

SWAT model description

Soil and water assessment tool (SWAT) model is a continuous-time, semi-distributed, process-based river basin scale model (Arnold et al. 2012). The model is designed to test and forecast the impact of water management on water, sediment, and agricultural chemical yields in ungagged watersheds on a daily time step (Gassman et al. 2007; Devia and Ganasri 2015). The main components of the model are weather, hydrology, soil temperature and properties, plant growth, nutrients, pesticides, bacteria and pathogens, and land management (Neitsch et al. 2005).

SWAT model operates on a daily time step for each hydrologic unit based on the water balance equation (Eq. 2) (Gassman et al. 2007). The simulation of the model is separated into two phases: Land phases control the loading of the amount of water, sediment, and nutrients to the main channel in each subbasin (watershed), and the second phase defines the movement of water, sediments, and nutrients through the channel of the HRUs or watershed outlets. The water balance in SWAT is calculated at the watershed level using the following equation (Neitsch et al. 2011; Arnold et al. 2012).

$${\text{SW}}_{{\text{t}}} = {\text{ SW}}_{0} + \sum\nolimits_{{{\text{i}} = 1}}^{t} {\left( {R_{{{\text{day}},{\text{i}}}} - Q_{{{\text{surf}},{\text{i}}}} - E_{{{\text{a}},{\text{i}}}} - Q_{{{\text{lat}},{\text{i}}}} - Q_{{{\text{gw}},{\text{i}}}} } \right)}$$
(2)

where SWt stands final soil water content, SWo stands initial water content on day i, t is the time in day, Rday stands the amount of rainfall on day i, Qsurf stands the amount of surface runoff on day i, Ea is amount of evapotranspiration on day i, Qlat stands amount of water entering vadose zone from the soil profile on day i, and Qgw stands amount of return flow on day i (Neitsch et al. 2011).

The USDA Soil Conservation Service runoff curve number (CN) method (USDA 1972) is used to compute runoff as follows:

$${Q}_{\mathrm{surf}}=\frac{{({R}_{\mathrm{day}}-{I}_{\mathrm{a}})}^{2}}{({R}_{\mathrm{day}}-{I}_{\mathrm{a}}+S)}$$
(3)

where Qsurf is the accumulated runoff or rainfall excess, Rday is the rainfall depth for the day, Ia is the initial abstraction, which includes infiltration, interception and surface storage before runoff. S (Eq. 4) is the retention parameter calculated from the curve number.

$$S=25.4(\frac{1000}{\mathrm{CN}}-10)$$
(4)

Curve number is important during the model calibration process and determines the surface runoff of the watershed (Arnold et al. 2012). Another important parameter, especially for densely vegetated watersheds, is evapotranspiration. In the SWAT model, three evapotranspiration estimation methods are employed; Penman–Monteith, Priestley-Taylor, and Hargreaves (Neitsch et al. 2011). Due to data availability and the modest nature of the method, the Hargreaves method was employed for estimating potential evapotranspiration.

Model setup

The Upper Blue Nile basin modeling framework was developed to understand the spatial and temporal variation of water resources in the basin include precipitation, streamflow, runoff (surface and groundwater), and evapotranspiration.

Watershed delineation

Watershed delineation is the initial stage of the SWAT model building (Luo et al. 2011). Since SWAT is a physical-based model, the subbasin and reaches of the watershed are derived from a digital elevation model. The watersheds of the Upper Blue Nile basin were delineated from the digital elevation model using ArcSWAT 2012 automatic delineation toolbox. The delineation was done based on the spatial distribution of the selected meteorological stations and the numbers of tributaries of the basin. Accordingly, 45 sub-basins were delineated with the minimum threshold value of 100,000 ha.

Hydrological response unit (HRU) definition

After watershed and their parameters were created, the hydrological response units (HRUs) were defined. Subsequently, the SWAT model requires the physiographical data (land use, soil type, and slope) to define the HRUs (Her et al. 2015; Devia and Ganasri 2015; Nasiri et al. 2020). Here, the slope map of the study basin was derived from the digital elevation model. Before the numbers of HRUs were defined, the land use, soil, and slope maps were reclassified and overlaid. As a result, 1138 HRUs were created using a 5% threshold value for land use, soil, and slope maps.

Writing climate dataset

The station-level climate data (temperature and precipitation) were encoded into the SWAT model database. Essentially the model requires humidity, solar radiation, and wind dataset. Due to lack of data, the generated weather data by the SWAT weather generator were encoded into the model database.

Model simulation

The encoded hydro-climatic parameters into the model were used to simulate hydrological processes of the Upper Blue Nile basin. The simulated years of the model are ranged from 1999 to 2016. The first 2 years (1999 and 2000) were used for model warm-up, to alleviate the effects of unknown initial conditions, and subsequently, this time period was excluded from the analysis.

Sensitivity analysis, model calibration and validation

Sensitivity analysis is the process of determining the rate of change in model output concerning changes in model inputs (parameters) (Arnold et al. 2012). The sensitivity analysis provided an understanding of the parameters and helps to identify the most dominant parameters that should be used for calibration (Worqlula et al. 2018). The responsive parameters are calculating using a multiple regression system with Latin hypercube samples by means of the objective function (t stat and P value) (Narsimlu et al. 2015). The t test provides the measures of the degree of sensitivity and the p value provides the significance of the sensitivity (Narsimlu et al. 2015). A larger absolute t stat means greater sensitivity and a p value close to zero represents higher significance (Abbaspour et al. 2007). Based on their t stat and p value the most sensitive parameters were selected and used for model calibration and validation.

Model calibration is performed by selecting the fitting values for the model input parameters by comparing the model simulated value for given local conditions with observed data for the same conditions (Arnold et al. 2012). Validation is performed by running a model using parameters with their fitting value that were determined during the calibration period and comparing the simulations to observed data not used in the calibration period (Abbaspour et al. 2015).

In this study, the sensitivity analysis, model calibration, and validation activities were performed by using SUFI-2 (Sequential Uncertainty Fitting ver. 2) under SWAT-CUP ver-2012 (the SWAT Calibration and Uncertainty Programs) specialized computer program. SUFI-2 is one of the optimization algorithms in SWAT-CUP developed by Abbaspour et al. (2004) that was used in this study. 14-year streamflow data of Border and Kessie stations at the mainstream of the Upper Blue Nile basin (Fig. 6) were used for model calibration and validation. The data from 2001 to 2009 were used for model calibration and 2010–2014 used for model validation.

Model performance evaluation

Statistical performance measures for hydrological model are computed to determine how the simulated values by the model match with those of observed. In this study, three statistical measures have been used for model performance evaluation, namely Nash–Sutcliffe efficiency (NSE), coefficient of determination (R2), and percent of bias (PBIAS), which were described in detail (Nash and Sutcliffe 1970; Krause et al. 2005; Gupta et al. 1999) as well as an appropriate range of values for these measures are listed in Table 4 (Moriasi et al. 2007; Ayele et al. 2017).

Table 4 General performance ratings for simulated streamflow (Moriasi et al. 2007; Ayele et al. 2017)

Nash–Sutcliffe efficiency (NSE) is a standardized statistic that describes differences between the simulated and observed values normalized by the variance of the observed values during the study period. The value ranges from -∞ to 1. An NSE = 1 indicates simulated data are equal to that of the observed data and NSE = 0 indicates that the simulated data are as accurate as the mean of the observed data (Nash and Sutcliffe 1970).

$$\mathrm{NSE}=1-\frac{\sum_{\mathrm{i}}^{n}{({Q}_{\mathrm{m}}-{Q}_{\mathrm{s}})}_{i}^{2}}{\sum_{\mathrm{i}}^{n}{({Q}_{\mathrm{m},i}-{\overline{Q}}_{\mathrm{s}})}^{2}}$$
(5)

where Q represents the discharge values, “m” and “s” stand for the measured and simulated data, respectively, and the bar stands for the average values of measured and simulated data.

The coefficient of determination (R2) is the square of the correlation (r) and measure the goodness of fit between the simulated and observed values; it ranges from 0 to 1. An R2 of 1 means the simulated data are equal to that of the observed data and R2 of 0 means no correlation between the simulated and observed data (Krause et al. 2005).

$${R}^{2}=\frac{{\left[\sum_{i}^{n}({Q}_{\mathrm{m},i}-{\overline{Q}}_{\mathrm{m}})({Q}_{\mathrm{s},i}-{\overline{Q}}_{\mathrm{s}})\right]}^{2}}{\sum_{i}^{n}{({Q}_{\mathrm{m},i}-{\overline{Q}}_{\mathrm{m}})}^{2}{\sum_{i}^{n}({Q}_{\mathrm{s},i}-{\overline{Q}}_{\mathrm{s}})}^{2}}$$
(6)

where Q is representing discharge, and m and s stand for measured and simulated data, i is the ith measured or simulated data.

The percent of bias (PBIAS) measures the average propensity between the simulated and observed data, indicating how much the simulated data to be larger or smaller than the observed data. The optimal value of PBIAS is zero, where low magnitude values indicate better model simulations. The positive PBIAS value indicates model underestimation, and the negative PBIAS value indicates model overestimation (Gupta et al. 1999).

$$\mathrm{PBIAS}=\frac{\sum_{i}^{n}{({Q}_{\mathrm{m}}-{Q}_{\mathrm{s}})}_{i}}{\sum_{i}^{n}{\mathrm{Q}}_{\mathrm{m},i}}\times 100$$
(7)

where Q is denoting for discharge, and m and s stand for measured and simulated value, respectively.

Results and discussion

Sensitivity analysis

Sensitivity analysis has to be performed to prior model calibration and to determine parameters most affect the variable of interest in the watershed, which helps to decrease the number of parameters in the calibration procedure by eliminating the parameters identified as not sensitive (Nazari-Sharabian et al. 2020). Accordingly, the analysis has been executed using the SUFI-2 module of the SWAT-CUP program over the simulated period (2001–2014).

In the sensitivity analysis, the whole discharge parameters were considered, from which thirteen parameters were found to be relatively most sensitive ranging from very high to medium. Among the sensitive parameters, the groundwater flow parameters were found to be more sensitive to streamflow. The sensitivity rank of the selected parameters was selected based on their t stat, and p value is shown in Table 5. SCS runoff curve number, deep aquifer percolation fraction, threshold depth of water in the shallow aquifer for “revap” to occur, available water capacity of the soil layer, groundwater delay, effective hydraulic conductivity in main channel alluvium, baseflow alpha-factor, groundwater “revap” coefficient, saturated hydraulic conductivity, threshold depth of water in the shallow aquifer required for return flow to occur, soil evaporation compensation factor, maximum canopy storage and manning’s “n” value for the main channel were identified. These parameters were considered for model calibration.

Table 5 Description of the most sensitive parameters in the SWAT model for the Upper Blue Nile basin

Model calibration and validation

By considering the most sensitive parameters (Table 5), the SWAT model has been calibrated and validated on the observed streamflow at Kessie and Border stations. To improve the efficiency of the model during calibration, the selected parameters were used to account for over and under prediction responses of the model as suggested by Neitsch et al. (2011). Until a satisfactory agreement between observed and simulated streamflow is attained, several calibration runs have been executed. Eventually, the calibration process using the SUFI-2 algorithm gave the final fitted parameters (Table 6). The available streamflow data of 14 years (2001–2014) were used for model calibration and validation. The streamflow data from 2001 to 2009 were used for calibration and 2010–2014 used for the validation period. The graphical comparison of monthly observed streamflow with simulated data streamflow for the calibration and validation periods is shown in Figs. 7 and 8.

Table 6 Final calibration parameters and their fitted value
Fig. 7
figure 7

Observed and simulated flow hydrograph for the calibration and validation period at Kessie

Fig. 8
figure 8

Observed and simulated flow hydrograph for the calibration and validation period at Border

Performance evaluation

The performance of the SWAT model is analyzed based on a graphical representation of observed and simulated streamflow and the basis of various statistical parameters. The statistical results that the SWAT model performed (Table 7) were acceptable at the Kessie station with Nash–Sutcliffe efficiency (NSE) 0.68 and 0.88 for calibration and validation periods, respectively. The values of coefficient of determination (R2) for the calibration and validation period are 0.81 and 0.89, respectively, which indicated that the agreements between observed and simulated streamflow for both periods were strong (Fig. 9). The PBIAS values were − 10.8% and 8.3% for the calibration and validation periods, respectively, which showed that the model slightly overestimated the flow during calibration and underestimated the flow during validation with reference to the observed data.

Table 7 R2, NSE and PBIAS value for the calibration and validation period at Kessie and Border stations
Fig. 9
figure 9

Scatter plot of observed and simulated streamflow for calibration period (a) and validation period (b) at Kessie

Based on the PBIAS value, the streamflow rate at the Kessie station was overestimated by 9.75% during the calibration period and underestimated by 8.31% during the validation period. The hydrograph of simulated and observed monthly streamflow at the Kessie station for calibration and validation is also presented in Fig. 7. It can be seen that the SWAT model overestimated for most years in the calibration period and underestimated for all years in the validation period.

At the Border gauging station, SWAT model performed very well (Table 7) with Nash–Sutcliffe efficiency (NSE) 0.83 and 0.89 for calibration and validation periods, respectively. The values of coefficient of determination (R2) for the calibration and validation periods were found to be 0.85 and 0.93, respectively, which reflected that the agreements between observed and simulated streamflow for both periods were strong (Fig. 10). The PBIAS values were found − 4.7% and 9.7% for the calibration and validation periods, respectively, which indicated that the model slightly overestimated the flow during calibration and underestimated the flow during validation.

Fig. 10
figure 10

Scatter plot of observed and simulated streamflow for calibration period (a) and validation period (b) at Border

Regarding to the PBIAS results, the streamflow rate at Border station overestimated by 4% during the calibration period and underestimated by 15% during validation period. The hydrograph of simulated and observed monthly streamflow at the Border station for model calibration and validation is presented in Fig. 8. It can be seen that the SWAT model overestimated for some years in the calibration period and underestimated for most years in the validation period.

Overall, the SWAT model results showed that there was a good agreement between observed and simulated monthly streamflow at the Upper Blue Nile basin. Previously several authors calibrated and validated the SWAT model in the Blue Nile river basin, and their report showed that the statistical parameters such as R2 and NSE varied between 0.59–0.96 and 0.43–0.92 respectively (Gebremicael et al. 2013; Roth et al. 2016; Polanco et al. 2017; Tegegne et al. 2017; Lemann et al. 2019; Nigussie et al. 2019). Hence, the SWAT hydrological model is applicable in the Upper Blue Nile basin. Accordingly, the simulated values of this study can be acceptable.

Hydrological water balance

Water balance in watersheds is one of the most important factors used to determine if a model is good enough for any particular application (Polanco et al., 2017). Based on the simulated precipitation, evapotranspiration, total discharge, and water yield of the SWAT model, the water balance of the Upper Blue Nile basin was analyzed. Table 8 lists the simulated water balance components on an annual average basis for the Upper Blue Nile basin over the whole simulation, calibration, and validation periods.

Table 8 Upper Blue Nile basin simulated annual water balance components for total simulated, calibration and validation periods

The results indicated that in the simulation period, 22.43% of the precipitation of the basin contributed to streamflow, whereas 49.5% of the precipitation is lost by evapotranspiration. Moreover, in the simulation period, the contribution of surface runoff (SUR_Q), lateral flow (Lat_Q), and groundwater flow (GW_Q) for the annual water yield (WYLD) was 43.7% and 13.48%, 40.44%, respectively. This shows as the contribution of surface runoff is higher than other water balance components of the basin.

For the calibration period, 22.05% of the precipitation of the basin contributed to streamflow; 50.17% of the precipitation is returned to the atmosphere through evapotranspiration. Besides, the contribution of the surface runoff (SUR_Q), later flow (Lat_Q), and groundwater flow (GW_Q) for water yield (WYLD) was 44.95%, 12.41%, and 40.15%, respectively. For the calibration period, 22.88% of the precipitation of the basin contributed to streamflow; 48.58% of the precipitation is returned to the atmosphere through evapotranspiration. Also, the contribution of the surface runoff (SUR_Q, later flow, (Lat_Q) and groundwater flow (GW_Q) for water yield (WYLD) was 45.26%, 12.22%, and 40.28%, respectively. The water yield (Table 8) during the validation period was higher than the calibration period. This could be because the rainfall amount during the validation period was relatively higher.

The spatial distribution of the water balance components shows a better understanding of the basin (Fig. 11). At the basin scale, the annual precipitation in the Upper Blue Nile basin varied from 734 to 2118 mm. The amount of precipitation falling on the south tip and southwest of the basin was more than that of the eastern and western tip of the basin. The annual evapotranspiration of the basin varied from 475 to 1159 mm, and likewise precipitation, the value of evapotranspiration was high on the south tip and southwest part of the basin. The water yield at the basin scale of the Upper Blue Nile basin varied from 115 to 1201 mm. Alike to precipitation, higher water yield was found on the south tip and southwest part of the basin. Hence, the SWAT output of the study area indicates a good spatial correlation among precipitation, evapotranspiration, potential evapotranspiration, and water yield.

Fig. 11
figure 11

Simulated rainfall (a), water yield (b), PET (c) and ET (d) across the Upper Blue Nile basin

Conclusion

Spatially semi-distributed hydrological model, SWAT was applied to model the hydrological processes and water balance components of the Upper Blue Nile basin. The model was calibrated and validated using monthly streamflow data by applying the SUFI-2 algorithm in SWAT-CUP2012. Sensitivity analysis was done to identify flow-sensitive parameters, and groundwater flow parameters were found to be more sensitive to streamflow.

Nash–Sutcliffe efficiency (NSE), coefficient of determination (R2), and percent of bias (PBIAS) statistical measures are considered to evaluate the performance of the model. In the calibration period, the value of NSE, R2, and PBIAS was 0.68, 0.81, and − 10.8%, respectively, for Kessie station and 0.83, 0.85, and − 4.7%, respectively, for the Border station. In the calibration period, NSE, R2, and PBIAS were 0.89, 0.88, and 8.3% for Kessie station and 0.93, 0.89, and 9.7%, respectively, for the Border station. Therefore, the result indicates that the SWAT model gave good simulation results and performed well in the Upper Blue Nile basin, and the results were acceptable.

The hydrological water balance analysiss showed that 22.43% of the precipitation of the basin contributed to streamflow as surface flow, whereas 49.5% of the precipitation is lost by evapotranspiration. Hence, the runoff of the basin is affected by evapotranspiration. Surface runoff, lateral flow, and baseflow contributed 44.95%, 12.41%, and 40.15%, respectively, for the annual water yield of the basin. The water yield during the validation period was higher than the calibration period. This could be because the rainfall amount during the validation period was relatively higher. For the spatial distribution of the water balance components, the result indicated a good correction among the hydrological components.

Overall, the finding of this research demonstrated the high potential applicability of the SWAT model for the assessment of hydrological processes at a large river basin. Perhaps, the findings of the study are useful for basin-wide water resources planning and management in the Upper Blue Nile basin, as they not only provide a proportion of water resource of the basin but also give a better understanding of the capacity of the available water resource for future water-based development projects.