1 Introduction

Global and regional climate models are being developed and widely used to project future climate conditions, and to help better understand the present and past climate (Buontempo et al. 2015; Clarke et al. 2007; Kalognomou et al. 2013; Lelieveld et al. 2012). Dynamic downscaling techniques can be applied to bring global-scale projections down to a regional level, where the output of general circulation models (GCMs) provides the initial and lateral boundary conditions, unlike statistical downscaling which develops statistical relationships between climate parameters, usually by a method of generating a modified local time series by translating a pattern into a local information (Giorgi 1990; Schmidli et al. 2007). Considerable computing resources are involved in dynamic downscaling, where high-resolution climate scenarios provided by regional climate models (RCMs) allow for a more precise description of topographic forcing due to orography, land-sea contrasts and land-use characteristics (Cohen 1990; Evans 2012; Giorgi 1990). These complex feedbacks and processes are of particular relevance for the Eastern Nile (EN) Basin, where despite the projected warming, future precipitation changes are characterized by low levels of robustness (Lelieveld et al. 2016; Zittis et al. 2019). Moreover, parts of the Basin (mainly downstream) are expected to become drier, while regions upstream will likely become wetter (Almazroui et al. 2020; Driouech et al. 2020). Regional hydro-climate assessments for the future are still limited, and the level of understanding of future climate behavior is still developing for different sectors in the EN due to the cost and complexity of carrying out representative climate studies. The complexity and chaotic nature of the climate system and the synergetic effect of temperature and precipitation changes suggest that future hydroclimate conditions cannot be represented by extrapolating past climate conditions (Collins et al. 2013; Baumberger et al. 2017). Instead, the best prospects for predicting future behavior are using mathematical representations of the Earth’s climate system through high-resolution regional climate and hydrological models that consider the peculiarities of the region. Although such applications are common in other parts of the world (Camera et al. 2020 and references therein), there is a lack of relevant research studies that assess the performance of the hydro-climate models over the Nile and specifically over the EN basin. Few studies (Argent, 2014; Beyene et al. 2010; Di Baldassarre et al. 2011; Siam 2010; Weldemariam 2015) focused on the impacts of climate change on the hydrology at different regions in the Nile basin hydrology and Lake Victoria basin using dynamical and statistical downscaling techniques.

In this context, this study pursues a high-resolution Regional Climate Model (RCM) configuration for the EN basin that is optimized and subsequently bias-corrected, thus serving for hydro-climatological applications. Our Weather Research and Forecasting (WRF) model optimization involves the testing of several parameterization options of sub-grid scale processes (e.g., microphysics and planetary boundary layer), as well as the off-line coupling with a semi-distributed hydrological rainfall-runoff model, using the Soil and Water Assessment Tool (SWAT) model. The results of the study are expected to be used in future dynamic downscaling of climate models for hydro-climatological applications using the pre-configured WRF model.

2 Data and Methods

2.1 Study Region

The EN hydrologic boundaries extend from the Ethiopian Highlands (~ 3° N) in the south to the High Aswan Dam (HAD) in the North (~ 24° N) and from the west of Sudan (~ 26° E) to the Gulf of Aden (~ 42° E) as shown in Fig. 1. Elevations in the EN basin range from 0 to 4300 m above mean sea level. About 5% of the basin lies in very low elevated areas while most of the EN (around 70%) is situated within the range of 300–600 m. Another 20% is between 600–2000 m and the remaining 5% is associated with very steep slopes (around 2000–4300 m) (Tesemma 2009). Ethiopia has a general elevation ranging from 1500 to 3000 m and the plateau height exceeds 4000 m. A key land feature in Ethiopia is Lake Tana, created by volcanic activity, at a height of 1785 m. In Sudan, elevations vary between 170 m and 1475 m, whereas in South Sudan, elevation ranges from 380 to 2885 m (Asore 2012; Raafat 2015; Siam 2010; Tarabia 2015).

Fig. 1
figure 1

Region of a study showing the configured WRF EN basin domain nested in the MENA domain and the selected sub-regions for evaluation

The EN is divided into five sub-basins that include the Main Nile, the Baro-Akobo-Sobat,-White Nile, the Abay-Blue Nile, and the Tekeze-Atbara-Setite as shown in Fig. 1. The Abay-Blue Nile and the Main Nile region host nearly 82% of the total population (ENTRO 2006; Siam 2010). The EN basin region experience a tropical to subtropical climate with seasonally summer (June–August) precipitation that mostly happens in the south in Ethiopia and South Sudan. Precipitation over the Nile basin increases from the north to the south (with elevation) with values up to 1600 mm/year over the Ethiopian highlands, which is mainly governed by the interaction with the basin topography and the Inter-Tropical Convergence Zone movement (Mohamed et al. 2005; Tarabia 2015). Temperatures varies widely from a Mediterranean climate in the north (Lower Egypt) to cold over the Ethiopian plateau and hot in the central and northern Sudan. The average for the minimum temperature varies from 12 to 20 °C, while the average for maximum temperature varies from 29 to 33 °C.

Due to the varying climate conditions across the EN basin, a wide swath of landcover is observed. Most of the northern basin area are arid to semi-arid lands due to the dry climatic conditions, while the south near Ethiopia and South Sudan are greener lands, either cultivated or natural forests. The observed landcover over a period of nearly 50 years from 1961 to 2009 has changed a little; where the natural vegetation has decreased by nearly 1.8% and the agriculture area has increased by 5.3% resulting in a loss of 0.3 Mha of green areas (ENTRO 2006; Siam 2010; Tarabia 2015).

2.2 Datasets

Based on availability throughout the study period (1980–2009), spatial resolution and quality of data, the following sets are selected:

  • European Centre for Medium-Range Weather Forecast (ECMWF) ERA-Interim reanalysis dataset. The ERA-Interim project started in 2006 to improve the ECMWF ERA-40 data (1957–2002) to introduce a better representation of the hydrological cycle, the quality of the stratospheric circulation, and the handling of biases and changes in the observing. The ERA-Interim atmospheric model and reanalysis system is coupled to an ocean-wave model and it is configured for 60 vertical atmospheric levels, with top level at 0.1 hPa system at approximately 80 km spatial resolution (Berrisford et al. 2011).

  • The Global Precipitation Climatology Center (GPCC) dataset version 6.0, covering the period 1901–2010, contains monthly precipitation sums and has a spatial resolution of 0.5° × 0.5° latitude by longitude. GPCC is derived from archived and collected global precipitation data based on quality-controlled data from 67,200 stations for record durations of more than 10 years (Schamm et al. 2014). The GPCC dataset is selected for its appropriate resolution and quality for use of multiple regions over Africa (Nicholson et al. 2003; Zittis 2018).

  • The University of Delaware (UDEL) monthly global dataset for air temperature and precipitation from 1950 to 2010 with a spatial resolution of 0.5° × 0.5° latitude by longitude (UDel 2015).

2.3 Methods

The methodology applied for hydro-climatic simulations and dynamical climate downscaling over the EN basin is summarized in the flow chart of Fig. 2. The regional climate modeling is carried out using version 3.5 of the WRF model, developed at the National Center for Atmospheric Research (NCAR) (Skamarock et al. 2008), over a domain covering the Middle East and North Africa (MENA) including the EN region. This is defined by the MENA initiative of the Coordinated Regional Climate Downscaling Experiment—CORDEX (Zittis et al. 2014). Climate downscaling is performed by forcing the WRF model with initial and boundary conditions from the European Centre for Medium-Range Weather Forecast (ECMWF) ERA-Interim reanalysis dataset (Berrisford et al. 2011) for a 30-years period from 1980 to 2009, to (a) select the suitable combination of physics parameterizations and (b) investigate the model skill in reproducing the past and present climate conditions. The model is configured as a sub-domain (nest) in the MENA-CORDEX domain, encompassing 185 × 250 horizontal grid points (~ 10 km grid resolution) and 30 vertical grid levels. The integration timestep is set to 240 s for the MENA parent domain computations and 1:5 as parent to nest timestep ratio which is the same ratio used for nesting the EN domain. The model is configured using the 21-categories MODIS satellite landcover dataset, with the default static datasets for soil, albedo, vegetation and topography (Skamarock et al. 2008). The spin-up time is set to seven months after sensitivity analysis to model for different spin-up periods.

Fig. 2
figure 2

The followed research methodology flow chart

Because the MENA-CORDEX domain is large and parts of it are out of context for the EN region, we focus our evaluation on the high-resolution nest and the 12 sub-domains listed in Table 1, that represent the individual watersheds of the EN and cover different climatic zones up to the High Aswan Dam. The selected regions are equally sized as boxes of dimensions 2° by 2° represented in Fig. 1. Each of them covers approximately 484 grid points at the EN domain selected resolution (~ 10 km grid resolution).

Table 1 Names and locations of the 12 sub-regions for analysis

2.3.1 Sensitivity to Physics Parameterization

Based on previous WRF studies for the MENA-CORDEX domain (Abdelwares et al. 2017; Zittis et al. 2014, 2017) and following other physics sensitivity studies (Caldwell et al. 2009; Done et al. 2005; Fujino et al. 2006; Gbobaniyi et al. 2014; Jin et al. 2010; Katragkou et al. 2015; Laprise et al. 2013; Otkin et al. 2006; Shin and Hong, 2011; Soares et al. 2012), four physics combinations, commonly used for regional climate simulations, are used to test the model sensitivity towards different parameterization schemes (Table 2). In this task, we mainly focus on the microphysics and planetary boundary layer (PBL) parameterizations. The physics representations were tested for precipitation and temperature over 2 years of extreme different precipitation regimes over the Nile basin; 1999 as a wet year and 1984 as a dry year (Siam 2010).

Table 2 WRF physics parameterization options tested in model sensitivity analyses

As reference data for the model evaluation, we use the Global Precipitation Climatology Center (GPCC) dataset (Schamm et al. 2014) for precipitation, while the UDEL (UDel 2015) provided the reference for temperature. The Pearson’s correlation coefficient (COR) is used to determine the skill of the model to simulate the variables’ pattern across the simulation period (Eq. 1). The Root Mean Square Error (RMSE), (Eq. 2) and the Mean Absolute Error (MAE), (Eq. 3) are also calculated to assess the quality of the model and estimate model simulation biases:

$${\varvec{C}}{\varvec{O}}{\varvec{R}}=\frac{{\varvec{c}}{\varvec{o}}{\varvec{v}}({\varvec{O}}{\varvec{B}}{\varvec{S}},{\varvec{S}}{\varvec{I}}{\varvec{M}})}{{{\varvec{\sigma}}}_{{\varvec{O}}{\varvec{B}}{\varvec{S}}}.{{\varvec{\sigma}}}_{{\varvec{S}}{\varvec{I}}{\varvec{M}}}}= \frac{{\sum }_{{\varvec{i}}=1}^{{\varvec{n}}}{({\varvec{O}}{\varvec{B}}{\varvec{S}}}_{{\varvec{i}}}-\underset{\_}{{\varvec{O}}{\varvec{B}}{\varvec{S}}})({{\varvec{S}}{\varvec{I}}{\varvec{M}}}_{{\varvec{i}}}-\underset{\_}{{\varvec{S}}{\varvec{I}}{\varvec{M}}})}{{{\varvec{\sigma}}}_{{\varvec{O}}{\varvec{B}}{\varvec{S}}}.{{\varvec{\sigma}}}_{{\varvec{S}}{\varvec{I}}{\varvec{M}}}}$$
(1)
$${\varvec{M}}{\varvec{A}}{\varvec{E}}= \frac{1}{{\varvec{n}}}{\sum }_{{\varvec{i}}=1}^{{\varvec{n}}}|{{\varvec{S}}{\varvec{I}}{\varvec{M}}}_{{\varvec{i}}}-{{\varvec{O}}{\varvec{B}}{\varvec{S}}}_{{\varvec{i}}}|$$
(2)
$${\varvec{R}}{\varvec{M}}{\varvec{S}}{\varvec{E}}= \sqrt{\frac{{\sum }_{{\varvec{i}}=1}^{{\varvec{n}}}{({{\varvec{S}}{\varvec{I}}{\varvec{M}}}_{{\varvec{i}}}-{{\varvec{O}}{\varvec{B}}{\varvec{S}}}_{{\varvec{i}}})}^{2}}{{\varvec{n}}}}$$
(3)

Where n is the number of samples, and OBS and SIM are the observed and simulated variables.

The used observational datasets are bilinearly re-gridded to match the configured WRF model resolution, to carry out grid-to-grid statistical analysis.

2.3.2 Precipitation Bias Correction

Several bias correction methods have been proposed, focusing mainly on precipitation and temperature. In view of the intricacy of physical processes to simulate precipitation, there are different methods at different levels of complexity available, from simple linear methods to empirical or theoretical functions aiming to correct moments of precipitation distribution (Argüeso et al. 2013).

Following a wide variety of bias correction methods (Argüeso et al. 2013), the algorithm followed to correct the precipitation output of our climate simulations is based on verifying the adjusted rainfall rates by applying them to a semi-distributed Rainfall-Runoff hydrological model (SWAT) and assessing the quality of the output flow hydrograph compared to that observed. The aim is to ensure the adequacy of the corrected precipitation results for hydrologic applications. The SWAT model have been previously used in different studies for simulating streamflow and watershed modeling (Arnold and Fohrer 2005; Mengistu and Sorteberg 2012), due to its capabilities of simulating hydrological processes as well as land-management and agricultural scenarios to support water resources management under different scenarios (Arnold and Fohrer 2005).

The selected study region for the SWAT model is the Gambella watershed, which is a part of the Baro-Akobo-Sobat Sub-Basin of the EN as shown in Fig. 3. The Gambella watershed covers an area of approximately 23,450 km2 over a wide elevation range, i.e. from a minimum of 450 m to a maximum of 2650 m. Raafat (2015) calibrated The SWAT model for Gambella watershed using ground-based observations of the runoff available near the streams following recommendations and sensitivity analysis of streamflow under different climate change scenarios within the EN basin (Mengistu and Sorteberg 2012). In that study, the weather information used is pulled from the Global Historical Climatology Network (GHCN) dataset version 2. After calibration, the model has shown a very high coefficient of determination (R2) value of nearly 0.95, and a Nash–Sutcliffe Efficiency value (NSE) of 0.92.

Fig. 3
figure 3

Left: Gambella watershed location within the Baro-Akobo-Sobat sub-basin. Right: a map of Gambella watershed within the WRF model grid at a resolution of 0.09° × 0.09°. “Light green circles with red dots” represent the used meteorological stations for the SWAT model configuration

The first bias-correction method is based on the probability distribution of the average monthly precipitation simulated from the WRF model in the whole domain (Legates and McCabe 1999). The modelled and observed precipitation (from GPCC dataset) fields are fitted and corrected according to a probability distribution function. The second correction method relies on inspection of the average precipitation bias over the experiment period against the elevation in each grid-point, then multiple bias correction factors are applied to the modelled precipitation based on the GPCC dataset to each topographic elevation group. The third bias correction method is based on evaluating the bias between the validation dataset and the monthly interannual variability in precipitation. The algorithm of correction depends on the daily average bias relative to the GPCC data in the same month, across the year for each grid-point in the domain. The three bias correction methods are evaluated based on the Nash–Sutcliffe Efficiency (NSE) of the SWAT model in using the corrected precipitation fields as the input for simulating the Gambella watershed runoff.

3 Results and Discussion

We performed a Taylor diagram-based statistical analysis for the four model configurations by comparing them with the UDEL dataset for the monthly average temperature in the historically wet year (1999) and dry year (1984). The analysis shows no significant difference between the chosen physics packages, which indicates that the four parameterization groups behave similarly in simulating the temperature over the Eastern Nile Basin (Fig. 4a). Physics configurations “1” and “4” show nearly similar performance for both precipitation and temperature for the selected 2 years (Fig. 4b). However, as an overall equally-weighted average for the RMSE and COR parameters for the selected 12 regions in the years 1984 and 1999, the performance of physics configuration “4” was found to give slightly better results (Table 3) and is selected for the historical 30 years downscaling experiment and includes “Lin” (Chen and Sun 2002) Microphysics scheme, “MYJ” (Janjić 1990) Planetary boundary layer parameterization scheme, “BMJ” Cumulus scheme (Janjić 1994), “CAM” (Collins et al. 2006) radiation scheme and “NOAH” (Chen et al. 1996) land surface model.

Fig. 4
figure 4

a Taylor diagram for temperature over the EN basin for the four physics groups over the two test years, 1984 and 1999. b Similar to a for the total precipitation over the EN basin

Table 3 Comparison between Physics groups “1” and “4”

Precipitation results from WRF are still highly uncertain due to the complexity of cloud formation and the precipitation physics representation. The model appears to overestimate precipitation over the highlands and underestimates it in low elevation regions, which needs further validation for application in hydrologic applications. Figure 5 shows the bias of the daily average over the 30-year period from 1980 to 2009.

Fig. 5
figure 5

WRF modeled precipitation bias compared to GPCC showing the maximum bias over the Ethiopian highlands for the period 1980 to 2009. Solid black lines represent elevation contours

The bias in precipitation is evident in the comparison for sub-regions, showing an underestimation in most of the dry parts and overestimation in the highlands, while both overestimation and underestimation were noticed in some regions like in Obeid. Figure 6 illustrates the modeled precipitation over three regions (Akobo, BN and GERD) out of the 12-selected analysis zones as time series of monthly precipitation (figures for other regions are not shown), averaged over all grid points of each sub-region. Interestingly, substantial differences between the monthly precipitations of the two gridded observational datasets are also found. These underscore concerns regarding the uncertainty in observational datasets (Tanarhte et al. 2012; Zittis 2018).

Fig. 6
figure 6

WRF modeled precipitation time series at: a Akobo, b BN and c GERD

The modeled precipitation is still subject to bias correction due to the obvious bias in modeling the past/present compared to the GPCC and UDEL datasets. The following sections will discuss in detail the correction methods followed, and the hydrologic verification technique to assure the adequacy of the chosen correction method (Elshamy et al. 2009).

According to the proposed first bias-correction method, the monthly GPCC precipitation is found to be best fitted as Weibull distribution with a maximum likelihood method, and shape and scale factors equal to 1.279 and 1.545, respectively, as shown in Fig. 7a, while the modeled precipitation has shape and scale factors equal to 1.202 and 1.183, respectively, as shown in Fig. 7b.

Fig. 7
figure 7

a GPCC precipitation Weibull distribution fitting. b WRF precipitation Weibull distribution fitting

We find that the WRF simulated precipitation, i.e. according to the distribution parameters, is underestimated by 30% compared to the observations, which is demonstrated in the Q–Q plot of Fig. 8a. The simulated precipitation is corrected by regressing the simulated precipitation against the observation, resulting in a multiplicative constant so that the fitted Weibull distribution of the WRF precipitation becomes 1.242 and 1.539 for the shape and scale factors, respectively. The Q–Q plot of the bias-corrected precipitation, shown in Fig. 8b, gives the best representation of the observations in the 30 years study period based on the probability distribution method.

Fig. 8
figure 8

a Q–Q plot for the modeled precipitation vs. the GPCC data. b Q–Q plot for the bias-corrected simulated precipitation vs. the GPCC data

The resulting bias-corrected precipitation is used as input for the SWAT hydrological model for the Gambella watershed for verification. The runoff results of the hydrological model shown in Fig. 9a yield a very high coefficient of determination (R2 = 0.88), however, the runoff values were highly overestimated leading to a poor NSE (− 0.15), which is not acceptable for hydrological models. The bias-correction method based on the probability distribution is, therefore, rejected for hydrological applications, even though the statistical parameters indicate good agreement with observations.

Fig. 9
figure 9

Comparison of monthly flow at Gambella watershed between the calibrated SWAT hydrological model forced by ground observations and WRF bias-corrected precipitation: a probability distribution corrected, b elevation-location corrected, c spatiotemporal corrected

The second correction method shows that bias is highly correlated to the terrain elevation, notably over the ETH, such that the bias-correction algorithm is a function of location and terrain elevation. The bias-correction matrices are divided into five bands based on elevation ranges, with band 1 representing the bias for grid points with elevation less than 500 m, band 2 from 500 to 1000 m, band 3 from 1000 to 1500 m, band 4 from 1500 to 2000 m and band 5 for grids of altitude more than 2000 m as shown in Fig. 10.

Fig. 10
figure 10

(Left) WRF 30-year mean precipitation bias and terrain elevation contour lines. (Right) The decomposed precipitation bias in the five elevation bands

The bias is corrected using the GPCC dataset and applied to the WRF model output at a condition that zero is the minimum value of precipitation. The resulting bias-corrected precipitation is again coupled with the SWAT hydrological model for the Gambella watershed. The runoff results of the hydrological model shown in Fig. 9b also yield a high coefficient of determination (R2 = 0.80) and a 0.61 NSE, which is considered sufficient for hydrological models.

In the third bias correction method, we generated 12 bias-correction matrices as daily average bias in each month (Fig. 11). It is clear that the bias in August is largest, with an underestimation of more than 10 mm/day, and smallest in May, July, and October, with almost no precipitation prediction bias.

Fig. 11
figure 11

Monthly spatiotemporal precipitation bias corrections

The-bias corrected precipitation is similarly coupled to the SWAT model for the Gambella watershed to assess the correction method against the previous two methods. The runoff results of the hydrological model (Fig. 9c) yield a very high coefficient of determination (R2 = 0.86), and a very good NSE of 0.79, that is more reliable for hydrological models.

Hence, the three bias correction methods applied here resulted in statistically acceptable results (overall mean and standard deviation of the modeled precipitation), however, verification based on a hydrological model the results for runoff were different. The spatiotemporal method was found to be the best bias-correction method, which should be applied to the modeled precipitation output for use in hydrologic applications.

4 Conclusions

Hydrological simulations, and quality of assessments based on their output, strongly depend on the input parameters, including precipitation. Due to the uncertainties and biases in modeling the temporal and spatial characteristics of precipitation, it is essential to reduce any biases before using it in such applications, for improving the overall process of decision making. The WRF model careful sensitivity analysis and optimization prior to model configuration domain (Abdelwares et al. 2017; Zittis et al. 2014, 2017), however, there are still biases in the produced output that require correction. The configured WRF model in this study shows a relatively high precision in simulating temperature with minor biases and errors based on the comparison with the observational datasets. However, the simulated precipitation is less accurate, and the results need to be bias-corrected to insure their validity for hydrologic applications.

The three bias-correction methods applied to WRF simulated precipitation output perform differently when compared to the rainfall-runoff hydrological model results. The spatiotemporal method, assuming a repeating monthly bias cycle throughout the years over the period considered appears to yield optimal performance, producing a valid precipitation product for the EN region at a high resolution, appropriate for hydrologic applications. The spatiotemporal method improved the NSE value of the runoff results using the WRF corrected precipitation from − 0.15 for the probability distribution correction method to 0.79, as well as preserving the high R-squared value at around 0.86.

Hydrological verification of the bias-correction method is found to be an essential step in approving the correction method since the results need to be used in hydrologic applications. Further, our results with the WRF model give confidence that scenario calculations will be useful for the future projection of temperature and precipitation, the latter at the condition of considering bias-correction with the proposed algorithm (Spatiotemporal Method). The success of this method also underscores the weakness of climate models in reproducing rainfall in regions with pronounced topography. It is worth noting that the suggested bias-correction methods are subject to the study region and may show different performance if applied to other climatic and topographic regions.