1 Introduction

According to IPCC (International Panel on Climate Change) reports, climate change and global warming is occurring with different consequences all over the world (IPCC 2001). Due to the relationship between the hydrological cycle and climate system, every change on climate will affect hydrological variables such as precipitation and run-off, which directly affect water resources in a region and may result in some disasters such as drought or flash flood. Many studies have shown that the frequency and magnitude of extreme events are increasing around the world because of the climate changes (Mays 2003; Wigley and Jones 1985). Therefore monitoring these changes and planning water resources in accordance with these changes is a vital issue for development of a sustainable water supply scheme.

A combination of hydrometeorological variables such as precipitation and temperature can be used as an indicator for quantifying climate change impacts on water resources. Precipitation and temperature variations with respect to climate change effects are assessed using GCM (General Circulation Model) outputs. But the application of GCMs for the local impact studies is restricted because of their low spatial resolution (typically in the order of 50,000 Km2) (Wigley et al. 1990; Carter et al. 1994). Therefore, downscaling techniques have emerged as a means to obtain local weather variables from regional atmospheric predictor variables. Harpham and Wilby (2005) simulated the precipitation for different zones of England using Statistical Down-Scaling Model, SDSM (Wilby et al. 2002), radial neural networks (neural networks with kernel based functions) and multi-layer perceptron neural networks. The results of their investigation show that all of these models are capable of simulating precipitation; however in different regions, their capabilities are different depending on the special characteristics of the region. Dibike and Coulibaly (2005) applied two statistical (a stochastic and a regression based) downscaling techniques called Long Ashton Research Station Weather Generator (LARS-WG) and Statistical Down-Scaling Model (SDSM) to generate the possible future values of local hydrometeorological variables such as precipitation and temperature in a watershed in the northern part of Canada. The results of this study show that GCMs are not very reliable in accuracy of simulated precipitation. Vidal and Wade (2008) investigated the uncertainty in catchment-scale precipitation scenarios due to the emissions scenario, the configuration of the GCM, and the downscaling method. The results show that the selection of a downscaling method and the multi-model building scheme has a significant impact on the simulated precipitation regime. Sajjad Khan et al. (2006) compared the results of three downscaling models namely SDSM, LARS-WG, and Artificial Neural Network (ANN), in terms of various uncertainty assessments exhibited in their downscaled results of precipitation and temperature. The uncertainty assessment results indicate that the SDSM is better capable of reproducing various statistical characteristics of the observed data. Massah Bavani (2006) used Kriging and the inverse weighting method for downscaling precipitation and temperature of the Zayandeh-rud river basin in Iran. His results indicate that both of these methods are capable of downscaling precipitation and temperature. Hingray et al. (2007) produced probability distributions for regional climate change in surface temperature and precipitation for 5 case studies. The results of regional climate changes especially at the end of study period (2070–2099) are significantly different regarding to characteristics of case studies, seasons and meteorological variables.

To evaluate drought periods, it is important to convert rainfall to runoff as a basis for impact studies of water resources management practices and simulation of water availability in the future. Karamouz and Araghinejad (2008) integrated two models based on a fuzzy inference system (FIS) and an ANN model for long-term prediction of Zayandeh-rud river streamflow in Iran. They calibrated the fuzzy inference system to be used as a climatic model to predict seasonal rainfall. This was done through the effect of large scale climate signals on rainfall variation as well as the persistence of seasonal rainfall time series. ANNs were used to map the observed and forecasted hydroclimatologic data into the seasonal streamflow. In this study, after long lead stremflow prediction, the probable low flow periods (droughts) are characterized.

The appropriate on-time strategies should be used to decrease the social, economical, and environmental consequences of drought periods. The drought management has shifted to risk-based management as a critical approach to emphasize on mitigation of impacts associated with drought based on societal vulnerabilities. The main issue in water resources management during the drought period is the optimal operation of reservoirs and water storage facilities. One of the conventional methods used to mitigate drought impacts is limitation of water allocation from the reservoir potential even when there is enough water for full target supply but there are indications of probable drought in the near future. This method is called hedging rule. Hedging rule provides assurance for higher-valued water supply where the reservoirs have low refill potentials or uncertain inflows. The intent of hedging rule is to reduce the risk and the cost of large shortages, but accept the cost of more frequent small shortages. Hashimoto et al. (1982) showed that in the cases where the loss function (on releases) is linear, the SOP (Standard Operation Policy) could result in the best reservoir operation policy. Shih and ReVelle (1994) formulated a nonlinear mixed integer programming model that minimizes the maximum deficit while considering a constant demand. They converted the continuous hedging rule into multiple discrete hedging rules which are more appropriate for practical applications. Tu et al. (2003) developed an optimization model that incorporates both the rule curves and hedging rules for management and operation of a multiple, multi-purpose reservoir system. Draper and Lund (2004) demonstrated that the optimal hedging policy for reservoir operation depends on a balance between beneficial release and carry over storage values. Their results indicate that where hedging is desirable, a linear “two-point” hedging policy is a better choice in a wide range of circumstances.

In this paper, an integrated algorithm for applying contingency plan in reservoir operation in the Aharchay case study is developed considering the climate change impacts on streamflow to a reservoir. The main objective is to develop more economical monthly water supply scheme in the drought periods and make an assessment of climate change impacts on the reservoir ability to supply the demand.

In the following sections, the case study is introduced and it is followed by a description of the methodology of the study. Then, the results are presented and discussed. Finally, a summary and conclusion is given.

2 Case Study

Iran is located in a semi-arid region with considerable water shortage but less attention is given to reservoir operation in emergency situations caused by droughts. The static operation rules which are commonly used, are not flexible and cannot be modified based on the observed situation; therefore, they may cause severe water shortages in droughts with high damages due to water shortages.

As a case study of this paper Sattarkhan reservoir, the only available surface water storage facility in the Aharchay river basin, in the northwestern part of Iran, has been selected. Aharchay river basin is located between 47°20′ and 47°30′ east longitude and 38°20′ and 38°45′ north latitude. The location of the study area with an area of 950 (Km2), is shown in Fig. 1. The mean annual rainfall and temperature at this basin are about 320 mm and 10°C, respectively. Sattarkhan reservoir has been in operation since 2000 and it is operated based on the standard operating policy (SOP) similar to most reservoirs in Iran (Fig. 7). The mean inflow of this reservoir is about 2.8 cm. The Reservoir is operated for multiple purposes, including water supply for irrigation, industrial, domestic uses, and instream (environmental) demands. The reservoir storage changes between 9 MCM (dead storage) and 132 MCM (maximum normal level). The average inflow to the reservoir is about 51 MCM/Year. The downstream demand of the Aharchay basin which is currently supplied by the Sattarkhan reservoir is about 50.52 MCM/year including water for irrigation (26.35 MCM/Year), industrial (5 MCM/Year), domestic (10.8 MCM/Year), and instream (environmental) demands (0.7 MCM/Year). It should be mentioned that the only considerable industrial activities in the study area is Songoon copper mine complex. The current irrigation area is about 3000 ha which will be increased to 8000 ha in the future. The projected water demand of the basin is shown in Table 1.

Fig. 1
figure 1

The location of the study area on the map of Iran

Table 1 The projected water demands of the study area for different periods

There are two meteorological stations, called “Kasanagh”, a rain gage upstream of the reservoir and “Ahar”, a synoptic station downstream of the reservoir, and a hydrometeorological station which is called “Orang”, just upstream of the Sattarkhan reservoirs which in this study are called KH, AR, and OG stations, respectively. The characteristics of these stations are given in Table 2. Karamouz (2010) demonstrated that these stations can be considered as the representative of the meteorological and hydrological condition of the study area.

Table 2 The characteristics of the stations used in Aharchay basin

The GCM outputs that are necessary for evaluation of climate change effects are downloaded from http://www.cics.uvic.ca/scenarios. This site includes the HADCM3 GCM model outputs developed in the UK, for the location of the study region on the GCM model grid. The considered CO2 emission scenarios are A2 and B2 which are both regional scenarios of climate change.

3 Methodology

In this paper, an algorithm has been developed to plan reservoir operation during the drought periods. The flowchart of the proposed methodology is shown in Fig. 2. Application of this methodology will increase the water supply system readiness to face the contingency periods of high water shortages. Since the water cycle characteristics change in time especially due to climate change effects, two examples of climate change scenarios are considered in order to project probable droughts’ characteristics in future and examine the proposed method performance in future.

Fig. 2
figure 2

The flowchart of reservoir operation using the climate change impacts on drought periods

For simulation of future drought condition, it is necessary to simulate the streamflow and for this purpose simulation of rainfall and temperature is necessary. Based on Fig. 2, the future precipitation and temperature are simulated using the effective climatic predictors selected among HadGCM3 output data and through SDSM. The predicted rainfall is converted to the runoff by IHACRES (Identification of unit Hydrographs and Components from Rainfall, Evaporation and Streamflow) developed by the Integrated Catchment Assessment and Management (ICAM) centre of Australian National University to estimate inflow to the reservoir. This model performs well in simulation of streamflow and its variation scheme, but it is not capable of simulating the peak values. To overcome this shortcoming of IHACRES, a Multi Layer Perceptron (MLP) model of ANNs has been trained to modify the peak values.

Based on the estimated streamflow, water resources availability in the following years with respect to the growing water demand is evaluated. Then, drought indices of the SPI (Standardized Precipitation Index), SWSI (Surface Water Supply Index) and RDI (Reconnaissance Drought Index) are calculated to project the probable drought periods and their intensity in the future. Please see Morid et al. (2006), Karamouz et al. (2007, 2009), Karamouz and Araghinejad (2008), Clausen and Pearson (1995), Abbaspour and Sabetraftar (2005), Khalili et al. (2011), Tabrizi et al. (2010), Tsakiris and Vangelis (2005), Vangelis et al. (2010) and Tsakiris et al. (2006) for more information on SPI, SWSI and RDI and their applications for drought analysis in Iran and elsewhere. In the last step, the hedging rule is utilized to develop the reservoir operating policies during the projected drought periods. In the hedging rule, a stationary reservoir operation scheme is provided to avoid the severe water shortage in case of long drought periods. In the following sections the methods and models used in this study are described.

3.1 Statistical Downscaling Method

In spite of the value of GCMs at the continental and hemispherical scales for assessing precipitation and temperature projections, they are inherently unable to demonstrate the characteristic and dynamics at the local sub-grid scale (Wigley et al. 1990; Carter et al. 1994). Therefore, GCM outputs should be converted into the local meteorological variables for reliable hydrological modeling. Methods used for this purpose are referred to as “downscaling techniques”. A well recognized statistical downscaling tool that implements a regression based method is named SDSM (Wilby et al. 2002). The SDSM package is available at the Canadian climate impacts and scenarios project site (http://www.cics.uvis.ca/scenarios/). The most important advantages of the statistical downscaling method are the capability of working with limited data and the high speed of analysis. Full technical details of the SDSM are provided by Wilby et al. (1999). Within the classification of downscaling techniques, SDSM is best described as a hybrid of the stochastic weather generator and regression based methods. This is because large-scale circulation patterns and atmospheric moisture variables are used to linearly condition local-scale weather generator parameters. The downscaling algorithm of SDSM has been applied to a host of meteorological, hydrological and environmental assessments, as well as a range of geographical locations such as Europe, North America and Southeast Asia (Hassan et al. 1998; Wilby et al 2000; Hay et al. 2000). The stochastic component enables generation of multiple simulations with slightly different time series attributes, but the same overall statistical properties. This downscaling method has been widely used within the climate impact community (Dibike and Coulibaly 2005; Wilby and Harris 2006; Prudhomme 2006). Using the daily GCM outputs, daily rainfall series are generated for future years using SDSM in which a multiple linear regression is developed between selected large-scale predictors and a local scale predictands such as temperature or precipitation in each month. The parameters of the regression equations are estimated using the efficient dual simplex algorithm. Large-scale relevant predictors are selected using correlation analysis, partial correlation analysis and scatter plots considering physical sensitivity between selected predictors and predictands in the region.

For precipitation downscaling, the predictors describing atmospheric circulation such as thickness (different atmospheric layers), different indices of wind velocity such as vorticity, zonal velocity and moisture content such as specific and relative humidity, in different altitudes, are preferred (Wilby et al. 2002). The utilization of SDSM includes five distinct tasks: (1) preliminary screening of potential downscaling predictors; (2) assembly and calibration of SDSM; (3) synthesis of ensembles of current weather data using observed predictor variables; (4) generation of ensembles of future weather data using GCM-derived predictor variables; (5) diagnostic testing/analysis of the observed data and climate change scenarios. In this study SDSM version 3.1 has been used.

3.2 IHACRES Model

There are different hydrological models used for rainfall-runoff modeling with special characteristics and limitations. Jakeman and Hornberger (1993) showed that the IHACRES model could be applied to a river basin with limited data of acceptable accuracy. This model requires river basin size (m2), time series of daily rainfall, daily streamflow data for model calibration and a surrogate variable representing evaporation where daily air temperature data is usually used. The model simulates the daily streamflow based on the input data.

The procedure followed in this model to convert rainfall to runoff. First, the recorded rainfall, r k , is converted to effective rainfall u k using a non-linear loss module.

$$ {u_k} = {S_k} * {r_k} $$
(1)

where S k (river basin wetness index) is computed at each time step k on the basis of recent rainfall and temperature as follows:

$$ {S_k} = C \times {r_k} + \left( {1 + \frac{1}{{{\tau_w}\left( {{t_k}} \right)}}} \right){S_{k - 1}} $$
(2)
$$ {\tau_w}\left( {{t_k}} \right) = {\tau_w}{e^{0.062f\left( {R - {t_k}} \right)}} $$
(3)

where R is the reference temperature and C is determined based on the mass balance between effective rainfall and runoff in the calibration period. τ w (t k ) is always more than one and S0 = 0. The underlying conceptualization of this module is that a river basin’s wetness varies with recent rainfall and evapotranspiration. Two major parameters in this model are τ w and f. Parameter τ w (the river basin drying time constant) is a value of τ w (t k ) at a reference temperature, in which τ w (t k ) controls the rate in which the river basin wetness index (S k ) decays in the absence of rainfall. Parameter f (the temperature modulation factor) controls the sensitivity of τ w (t k ) to changes in temperature, t k . In the second step, a linear unit hydrograph (UH) module converts effective rainfall to streamflow, X k .

The linear module allows the application of the well-known unit hydrograph theory which conceptualizes the river basin as a configuration of linear storages acting in series and/or parallel. The configuration of linear storage in the UH module which is allowed in IHACRES includes single storage and two storage units, in series or parallel. The optimal pair of w , f) is identified by trial and error for a given configuration of simple UH’s and a given value of the pure time delay between rainfall and runoff occurrence. Then the model automatically estimates the relevant parameters for a subsequent simulation.

3.3 Hedging Rule

Hedging is known as a means of withholding water before and during the drought periods (Bower et al. 1962). Therefore a lower deficit is accepted in the current supply in order to avoid more severe water shortages in the following periods. The hedging rule is the reservoir operation rule, which reduces deliveries of some portion of water supply to retain storage for use in the dry periods and provide assurances for higher priority water usage when the reservoir has low refill potential or uncertain inflows. As it is shown in Fig. 3, during normal periods of operation, when inflows are high and the reservoir is almost full, the operating rule efficiently allocates water to meet the demands imposed by competing users. During a drought, or when a drought is anticipated, as the reservoir inflow decreases and reservoir storage drops, the planned demands cannot be fully met. By considering the hedging rules along with the rule curves, guidelines are provided for reservoir release. To minimize the impact of drought, the hedging rules effectively reduce the ongoing water supply rate to offset the target storage to its target requirement.

Fig. 3
figure 3

Different types of Hedging rules in comparison with the Standard Operation Policy (SOP) and the conservation scheme of operation rules

Based on Lund and Guzman (1996), there are three types of hedging rules as follows (Fig. 3):

  • One-point hedging: In this model, the water release starts from nothing and increases linearly (at a slope <1) until intersecting with the actual water demand (Shih and Revelle 1994)

  • Two-point hedging, where a linear hedging rule begins from a first point (parameter) occurring somewhere above the origin on the shortage portion of the SOP rule to a second point occurring where the hedging slope (<1) intersects with the actual water demand (Bayazit and Unal 1990; Srinivasan and Philipose 1996)

  • Three-point hedging, where an intermediate point is specified in the two-point hedging rule, introducing two linear portions to the hedging portion of the overall release rule.

Draper and Lund (2004) tested the three types of hedging rules and concluded that the two point hedging rule is better in comparison with other hedging rules. Therefore, in this study, the two point hedging rule is employed. To develop hedging rules, the model proposed by Tu et al. (2003) is extended considering two distinct water demand categories. The objective function of this model is maximizing the water allocation to water demand categories as well as the reservoir storage.

$$ MAX\quad \sum\limits_{t = 1}^T {\left( {R{P_t}.{W_p} + R{A_t}.{W_a} + S{t_t}.{W_s}} \right)} $$
(4)

where T is the decision time period duration (months), RP t is the allocated water to industrial, municipal, and instream demands, namely overall public demand, in month t, RA t is the allocated water to agricultural demand in month t, St t is the reservoir storage at the beginning of month t and W p , W a and W s are weighting factors for public demand, agricultural demand and reservoir water storage, respectively. In this study, the weighting factors for public demand, agricultural demand and reservoir water storage are considered as 0.55, 0.25, and 0.2 respectively based on the priorities and stakeholders’ dependencies for different sectors.

The following assumptions are made in developing the optimization model: (1) All of water demand is supplied from the surface water (2) No loss occurs in domestic water supply. In order to apply hedging rules in reservoir operation, a set of constraints are defined as follows:

$$ S{t_t} + {I_t} - {R_t} - {E_t} = S{t_{t + 1}} $$
(5)
$$ S{t_{\min }} \leqslant S{t_t} \leqslant S{t_{\max }} $$
(6)
$$ S{t_t} = {\lambda_{1t}}S{t_{1,t}} + {\lambda_{2t}}S{t_{2,t}} + {\lambda_{3t}}S{t_{3,t}} $$
(7)
$$ {\lambda_{1,t}}.S{t_{\min }} \leqslant S{t_{1,t}} < {\lambda_{1,t}}.S{t_{Buffer}} $$
(8)
$$ {\lambda_{2,t}}.S{t_{Buffer}} \leqslant S{t_{2,t}} < {\lambda_{2,t}}.S{t_{target}} $$
(9)
$$ {\lambda_{3,t}}.S{t_{target}} \leqslant S{t_{3,t}} \leqslant {\lambda_{3,t}}.S{t_{\max }} $$
(10)
$$ {R_t} \leqslant \left( {{\lambda_{1,t}}.{\alpha_1} + {\lambda_{2,t}}.{\alpha_2} + {\lambda_{3,t}}} \right).{D_t} $$
(11)
$$ {\lambda_{1,t}} + {\lambda_{2,t}} + {\lambda_{3,t}} = 1 $$
(12)
$$ {\lambda_{1,t}} = \left\{ {\begin{array}{*{20}{c}} 1 \hfill & {\quad \quad \quad \quad if\;s{t_t}\;is\;in\;zone\;1} \hfill \\ 0 \hfill & {otherwise} \hfill \\ \end{array} } \right. $$
(13)
$$ {\lambda_{2,t}} = \left\{ {\begin{array}{*{20}{c}} 1 \hfill & {\quad \quad \quad \quad if\;s{t_t}\;is\;in\;zone\;2} \hfill \\ 0 \hfill & {otherwise} \hfill \\ \end{array} } \right. $$
(14)
$$ {\lambda_{3,t}} = \left\{ {\begin{array}{*{20}{c}} 1 \hfill & {\quad \quad \quad \quad if\;s{t_t}\;is\;in\;zone\;3} \hfill \\ 0 \hfill & {otherwise} \hfill \\ \end{array} } \right. $$
(15)

where I t is inflow; R t is reservoir release; E t is evaporation loss from the reservoir; all of them in month t. St min is the minimum storage of the reservoir; St max is maximum storage of the reservoir; S target is target storage; S Buffer is buffer storage; D t is water demand at month t; λ 1,t 2,t λ 3,t are binary variables which determine the reservoir storage class (which are described in the following paragraph) according to Fig. 3; α 1 , α 2 are rationing factors used in order to limit the water supply in dry periods as shown in Fig. 3. In this study, reservoir storage is divided into four pools. These include, from top to bottom, the flood-control zone (Smax to Starget), conservation zone (Starget to Sbuffer), buffer zone (Sbuffer to Smin) and inactive zone (less than Smin(.The storage between conservation and buffer pools, together, constitute the reservoir’s active storage. The buffer zone is determined based on the reservoir volume at the level of the lowest (first) reservoir outlet for water supply and the conservative zone corresponds to the volume below the top of the second reservoir outlet. Based on the design characteristics of Sattarkhan dam, minimum storage, maximum storage, target storage, and buffer storage are considered as 9, 132, 57.5, and 24 respectively.

The continuity equation for the reservoir is written as Eq. 5. During each time period t, the relationship between the rule curve and the hedging rule is shown as the step function (Fig. 3). According to Fig. 3, the storage volume is divided into three classes where the first class varies between the minimum storage and buffer levels, the second class is between buffer and target storage levels, and the volume between target and maximum storage levels is considered as the third class.

In each time period the amount of the reservoir storage, St t , will be placed in one of these three classes (Eqs. 8, 9 and 10). To define the storage class in the modeling process, three binary variables of λ 1,t , λ 2,t , λ 3,t , are determined using Eqs. 13 to 15. Equation 12 ensures that in each time step only one of the λ 1,t 2,t , λ 3,t could be active regarding to the reservoir storage class. With respect to the activated binary variable, the appropriate hedging rule for water release is developed. In the developed hedging rule, two rationing variables α 1 , α 2 are used to determine the percentage of the supplied demand depending on the reservoir storage class (Eq. 11). The rationing variables are determined at the planning stage and could be modified to meet operational purposes in different situations.

As it is shown in Fig. 3, when the reservoir storage falls below the target storage, while employing the hedging rule, the reservoir release is reduced regarding the severity of drought in order to provide a sustainable water supply scheme during the drought period. But in application of the SOP strategy, if the water supply is less than the target demand D t , all available water is released; no storage remains.

4 Results

4.1 Rainfall Projection Under Climate Change Impacts

The results of statistical downscaling are highly dependent on the assumption that the data is outlier-free, nearly normal, and the data is not serially correlated. If the applied data does not satisfy these assumptions, the results may be misleading. Therefore, the adequacy of data is checked at the first step. Based on the statistical evaluation of data, there was no outlier in the collected data. The normality test of rainfall and temperature data showed the need for using transformation for normalizing the data. Among different transformation methods, the fourth root transformation has been selected to normalize both the rainfall and temperature data. The autocorrelation analysis of rainfall data using ACF (Autocorrelation Function) diagrams at different lag times did not show any significant correlation in the 95% confidence level. The same analysis for temperature data showed that the first serial correlation is significant. These revealed characteristics are considered in further evaluation of the downscaling results.

To select the best set of rainfall predictors in the region, the relation between different combinations of climatic predictors and regional rainfall has been examined through evaluation of physical relationship and statistical tests. The results showed that different combination of predictors should be used for rainfall simulation in different months of a year. This is due to different climatic systems that affect rainfall variations during a year. In fact, the first group includes two predictors: Relative 500 hPa (hectopascal) Geopotential Height and Humidity at 500 hPa. These predictors are used for rainfall projection of all months except April, May and June (spring season). The predictors of the second group that are used to project spring rainfall include surface divergence, mean sea level pressure and 500 hPa vorticity.

The downscaling model has been calibrated during 1971–1980 and the results have been validated for 10 years during 1981–1990. Global reanalysis of NCEP (National Centers for Environmental Prediction) are used to find relationships between large-scale predictors and local predictands in model calibration. The NCEP reanalysis data are developed by NCEP and National Center for Atmospheric Research (NCAR) through a project (denoted “reanalysis”) and produce a retroactive record of more than 50 years of global analyses of atmospheric fields in support of the needs of the research and climate monitoring communities.

4.1.1 BSS Quantification

In this study, 20 ensembles of future rainfall and temperature series are developed arbitrarily. Due to large number of ensembles, each of which have long time periods, the most appropriate ensemble among generated data is selected for further analysis. For this purpose the Briers Skill Score (BSS) is employed. BSS compares different set of model predictions, Y, against a baseline prediction, B, and a set of model observations, X, and is estimated as follows (Spearman et al. 2004),

$$ BSS = 1 - \frac{{MSE\left( {X,Y} \right)}}{{MSE\left( {B,X} \right)}} $$
(16)

where MSE(X,Y) is the mean square error given by \( MSE\left( {X,Y} \right) = \frac{1}{n}\sum\limits_i {{{\left( {{Y_i} - {X_i}} \right)}^2}} \). The BSS gives a value of 0 if the prediction set of Y is as well as the baseline prediction, B. The BSS gets a maximum value of 1 if the prediction set of Y is 100% accurate (i.e. Y=X) and a negative score if the prediction set of Y is worse than the baseline. In this study, the first ensemble has been considered as baseline prediction and other ensembles are compared with it. The results are tabulated in Table 3. Based on this table, the 9th, 14th and 20th ensembles of NCEP, Had A2 and Had B2 rainfall downscaling data are selected respectively, for further analysis. They have the maximum value of BSS in each case.

Table 3 The results of BSS calculation to select the most appropriate ensembles of downscaled data of rainfall and temperature

In Fig. 4, the monthly mean of downscaled (using NCEP data) and observed rainfall are compared. The maximum difference between the observed and simulated monthly rainfall is about 15% in February. The downscaling results are underestimated in August-November and overestimated in the other months, but the differences are not significant.

Fig. 4
figure 4

Comparison between the monthly mean of downscaled and observed a rainfall (1971–2000) b temperature (1986–2000)

To ensure the appropriate performance of the downscaling model, regarding to the characteristics of the observed data, nonparametric methods are employed to test the equality of mean and variance of the downscaled and observed data. The applications of these tests are discussed in the following sections.

4.1.2 Wilcoxon Rank Sum Test

For constructing a hypothesis test p-value for equality of means of observed and downscaled data (difference of two population means), the Wilcoxon rank sum method is used (Sajjad Khan et al. 2006; Conover 1980). A detailed description of the theory of Wilcoxon rank sum test can be found in Conover (1980). In terms of hypothesis testing, p-value is the level of significance for which observed test statistic lays on the boundary between acceptance and rejection of the null hypothesis (equality of means). At any significance level greater than the p-value, the null hypothesis is rejected, and at any significance level less than the p-value the null hypothesis is accepted. MATLAB 7.0 software is used for performing this test. The results of this test application are given in Table 4. As can be seen, the results are all meaningful in 5% significance level except in May and April.

Table 4 The results of performing Levene and Wilcoxon rank sum tests to evaluate the performance of downscaling model in simulating temperature and rainfall data in the study area

4.1.3 Modified Levene’s Test

Modified Levene’s test suggested by Brown and Forsythe (1974) is used in this study to test the equality of variance of downscaled and observed data. Modified Levene’s test is used when the data come from continuous, but not necessarily normal, distributions. In this method the distances of the observations from their sample median are calculated (Sajjad Khan et al. 2006). MINITAB 13.0 is used for performing Modified Levene test. The results of this test are the same as the Wilcoxon text (Table 4). The null hypothesis of accepting the variance is rejected at 5% significance level in months of May and April when the downscaled data are accepted within 1% significance level.

In Table 5, the monthly maximum, minimum, mean, and variance of observed and downscaled rainfall are compared during 1971–2000. Based on these results, the developed model could be used to simulate the monthly rainfall pattern of the study area.

Table 5 Comparison between observed and simulated rainfall data (in mm, 1971–2000)

Massah Bavani (2006) demonstrated that HadCM3 GCM model has the best performance in application to Iran climate change assessment studies. Then the outputs of HadCM3 GCM model for Had A2 and Had B2 climate change scenarios are imported to SDSM software to exemplify the probable changes of region climate under climate change impacts.

In Table 6, the scenarios Had A2 and Had B2 are compared with the observed rainfall data during 1971–2000. Since there is no significant difference between results of two climate change scenarios, the results of scenario Had B2 are used for further analyses. In Fig. 5, the monthly mean of observed and Had B2 scenario results are compared during the period of 1971–2000. The investigation of climate change scenarios on the rainfall shows that the rate of rainfall fluctuates with a decreasing trend.

Table 6 Comparison of observed rainfall data with Had A2 and Had B2 (in mm, 1971–2000)
Fig. 5
figure 5

Comparison between the monthly mean of Had B2 scenario and observed a rainfall (1971–2000) b temperature during (1986–2000)

4.2 Future Temperature Projection

Since temperature data is needed in the rainfall-runoff modeling by IHACRES model, it has been downscaled in this study using the same procedure as rainfall downscaling. Two predictors of 500 hPa geopotential height and mean sea level pressure have been identified for temperature simulation. The model has been calibrated for years 1986–1995 (because of the availability of the observed data in this period) where the correlation coefficient between effective predictors and temperature in the calibration period is 76% which is significant in climate studies. The model is validated for period of 1996–2000. The better ensembles of downscaled temperature data are selected based on calculated BSS for different scenarios (Table 3). The monthly mean of downscaled and observed temperature are compared in Fig. 4.

In order to test the equality of mean and variance of the downscaled and observed temperature data, the Levene and Wilcoxon rank sum tests are employed (Table 4). The results of some months do not satisfy the criteria of mean and variance equality of observed and downscaled data but since temperature data is only used as an indicator for considering snow melt in rainfall-runoff model, the downscaled data could be applied with the current level of accuracy.

Since there is no significant difference between results of two considered climate change scenarios and for compatibility with rainfall data, the result of scenario Had B2 has been used for further analyses. The comparison between monthly mean of Had B2 scenario and observed temperature is shown in Fig. 5. The investigation of climate change effects on the temperature during 2004–2042 shows that there is an increasing trend in the temperature as there is an almost 0.8 centigrade degree increase each 20 years after 2004 (the final year of available data for this study).

4.3 Runoff Prediction Using Rainfall-Runoff Model

Parameters f, τ w , δ (time delay) and the hydrograph condition are determined by trial and error to minimize the runoff simulation error in the calibration period. Two indices of coefficient of determination, D, and percentage average relative parameter error, ARPE, are used to determine the simulation error when calibrating IHACRES. High D and low ARPE are desirable. These parameters are formulated as follows.

$$ D = 1 - \frac{{{{\left[ {{\sigma_\xi }} \right]}^2}}}{{{{\left[ {{\sigma_y}} \right]}^2}}} $$
(17)
$$ ARPE = \left[ {{{\left[ {\frac{{{\sigma_{a1}}}}{{{a_1}}}} \right]}^2} + {{\left[ {\frac{{{\sigma_{a2}}}}{{{a_2}}}} \right]}^2} + {{\left[ {\frac{{{\sigma_{bo}}}}{{{b_0}}}} \right]}^2} + {{\left[ {\frac{{{\sigma_{b1}}}}{{{b_1}}}} \right]}^2}} \right]/4 $$
(18)

where σ denotes standard deviation and, ξ and y subscripts correspond to model residuals and observed streamflow, respectively. The parameters of Eq. 18 are calculated using Eqs. 19 to 22, in which b and a are the unit hydrograph (UH) parameters and, s and q stand for slow and quick unit hydrographs, respectively.

$$ {b_0} = b_0^{(q)} + b_0^{(s)} $$
(19)
$$ {b_1} = b_0^{(q)}a_1^{(s)} + b_0^{(s)}a_1^{(q)} $$
(20)
$$ {a_1} = a_1^{(q)} + a_1^{(s)} $$
(21)
$$ {a_2} = a_1^{(q)}.\;a_1^{(s)} $$
(22)

Consider the simple discrete-time hydrograph in which unit effective rainfall over one time step produces streamflow (b < 1) over the same time step. In each subsequent time steps, streamflow is a fixed proportion (a < 1) of what it was in the previous time step and thus the flow decays exponentially (at a rate determined by a). In this study, the rates of exponential decay for the two simple UHs, one is ‘quick’ (q) and another is ‘slow’ (s), that are combined to give a UH for total streamflow. In Eqs. 19 to 22, \( b_0^q > 0,\quad - 1 < a_1^q < 1 \) and \( b_0^s > 0, - 1 < a_1^s < 1 \) are the quick and slow streamflow components of UH, respectively.

In the non-linear loss module, the ranges are selected for the parameters f and τ w . For each pair of these parameters, D and ARPE are calculated to select the best pair. In this study, τ w has been changed in the range of 1–100 by using a model step of 1 and the range of 1–10 has been considered for variations of f by using model step of 0.1. Finally, values of 3.2, 5 and 0 are estimated for parameters f, τ w and δ, respectively based on D and ARPE parameters. In addition a single hydrograph scheme has been determined as the best system that presents the rainfall-runoff process in the basin.

The accurate determination of the calibration period is important in achieving accurate simulation of basin runoff using the IHACRES model; therefore, different calibration periods are checked and finally the period of 1988–1997 is selected in developing the final runoff simulation model. Because of the missing data in the OG station between years 1999 to 2000, the model has been validated for the period of 2002–2004.

The daily simulated runoff data using IHACRES are converted to monthly timescales in order to be used in reservoir optimization model. The results of runoff simulation in the calibration and validation periods in the monthly time scale are shown in Fig. 6. According to this figure, although this model can simulate the value and trend of runoff, it is not capable of simulating the peak values. This could be due to simplifications made in this model which could not consider some phenomena affecting runoff variations such as snow budget or flash floods. To improve the results of the IHACRES model in simulating the extreme values, a Multi Layer Perceptron (MLP) model of ANNs has been trained.

Fig. 6
figure 6

Comparison between the monthly observed run-off and (a) IHACRES simulated, (b) MLP simulated in calibration and validation period

The inputs of MLP model are the observed rainfall and runoff in the present month and the simulated runoff by IHACRES for the next month, and the output is the modified runoff for the next month. The MLP with one hidden layer has been trained for the period of 1988–1997. The years between 1997 and 2000 are not considered in the model calibration due to lack of data during these years that may result in high errors in the model performance. The optimal number of neurons in the hidden layer and training epochs has been determined by trial and error. The range of 2–15 has been checked for the number of neurons in the hidden layer and training epochs between, 500 and 4,000 by 500 steps has been considered. Finally, 12 neurons in the hidden layer and training epoch of 500 have been chosen.

The results of the MLP model have been compared with the observed values as shown in Fig. 6 for calibration and validation periods. The comparison between parts (a) and (b) of the Fig. 6 demonstrates that the percentage of error in MLP model is 13% less compared to the IHACRES model.

The performances of developed models are evaluated based on E (Nash–Sutcliffe model efficiency coefficient), MAE (Mean Absolute Error) and RMSE (Root Mean Square Error) indices as demonstrated in Table 7. These indices are quantified as follows:

$$ MAE = \frac{{\sum\limits_{t = 1}^n {\left| {X_p^t - \left. {X_o^t} \right|} \right.} }}{n} $$
(23)
$$ RMSE = \sqrt {{\frac{{\sum\limits_{t = 1}^n {{{\left( {X_p^t - X_o^t} \right)}^2}} }}{n}}} $$
(24)
$$ E = 1 - \frac{{\sum\limits_{t = 1}^n {{{\left( {X_o^t - X_P^t} \right)}^2}} }}{{\sum\limits_{t = 1}^n {{{\left( {X_o^t - {{\overline X }_o}} \right)}^2}} }} $$
(25)

where n denotes the number of data, and \( X_o^t \) and \( X_p^t \) correspond to the observed and simulated runoff at time t, respectively. \( {\overline X_o} \) denotes the mean of the observed values. A value equal to one corresponds to a perfect match of simulated runoff to the observed data. The range of E lies between 1.0 (perfect fit) and −1.0. An efficiency of 0 indicates that the model predictions are as accurate as the mean of the observed data, whereas an efficiency of lower than zero indicates that the simulated value is not as good as the observed value (Krause et al. 2005).

Table 7 Comparison between IHACRES and modified IHACRES-ANN output

As it is shown in Table 7, the improvement of the MLP model in comparison with the IHACRES model is 30% and 60% based on MAE and RMSE respectively. As for the Nash–Sutcliffe efficiency coefficient, in calibration period, both of the models have the same performance, but in the validation period, the MLP predictions are more reliable than IHACRES model, since the efficiency of the IHACRES simulation is less than zero.

4.4 Determination of Probable Drought Periods

The results of the streamflow simulation showed that the average streamflow during 2005–2023 in comparison with historical data (1986–2004) will decrease about 13% but during 2024–2042 it will increase about 17% in the study area. In addition, the results show that in the future, the time of maximum streamflow occurrence at the OG station will be shifted from April to June, which should be considered in deriving reservoir operation policies.

The pre-assessment of reservoir operation during the drought periods plays an important role in mitigation of the expected damages. Therefore, projecting probable drought periods in future and their characteristics is necessary. In this study, drought indices of SPI, SWSI and RDI are calculated to determine the probable drought periods in future.

The SPI is formulated by Mckee et al. (1993). This index assigns a single numeric value to the precipitation which can be compared across regions with markedly different climates. Technically, the SPI is the number of standard deviations that the observed value would deviate from the long-term mean, for a normally distributed random variable. Since precipitation is not normally distributed, a transformation is first applied so that the transformed precipitation values follow a normal distribution.

The Surface Water Supply Index (SWSI) is a predictive indicator of total surface water availability within a watershed. SWSI values are scaled from +4.2 (abundant supply) to −4.2 (extremely dry) with a value of zero (0) indicating media water supply as compared to historical analysis. The formulation of SWSI is given as

$$ SWSI = \frac{{P - 50}}{{12}} $$
(25)

where P is the single probability of summed expected streamflow and current reservoir storage (Garen 1993).

The RDI index is developed by Tsakiris and Vangelis (2005). In this index, the ratio of precipitation over potential evapotranspiration (ETp) is used as a representative of the wetness condition for different time scales in the study region. For yearly time scale this index is estimated as follows:

$$ \alpha_0^{(i)} = \frac{{\sum\limits_{j = 1}^{12} {{P_{ij}}} }}{{\sum\limits_{j = 1}^{12} {PE{T_{ij}}} }},i = 1, \ldots N\quad and\quad j = 1, \ldots 12 $$
(26)
$$ RDI_{st(k)}^{(i)} = \frac{{y_k^{(i)} - {{\overline y }_k}}}{{{{\widehat{\sigma }}_{yk}}}} $$
(27)

where P ij and PET ij are precipitation and potential evapotranspiration of the jth month of the ith year and N is the total number of years of the available data. \( \alpha_0^{(i)} \) corresponds to the initial value of the index. \( RDI_{st(k)}^{(i)} \)is the Standardized form of RDI which is interpreted similarly to SPI for determining the wetness condition. y k is the ln\( \alpha_0^{(i)} \), \( {\overline y_k} \) is the arithmetic mean and \( {\widehat{\sigma }_{yk}} \) is the standardized deviation (Tsakiris et al. 2006).

In addition to these drought indices, the variations of streamflow are considered to find the probable drought periods in the future. The identified drought periods with duration more than 2 years are considered as possible severe drought periods in future for examining the hedging rules performance on Sattarkhan reservoir operation. The characteristics of the identified drought periods are presented in Table 8. The severity of droughts is determined by the summation of water shortages or negative values of RDI in the drought period. RDI is a better representation of drought condition in the study region because it considers both rainfall and evapotranspiration. Among the four identified drought periods in Table 8, the 3rd and 4th drought events which are the most severe and longest drought periods, are considered for developing hedging rules. If the hedging rule could mitigate these severe water shortages then it can better handle low flow conditions.

Table 8 Characteristics of the possible severe drought periods in the future

4.5 Application of Hedging Rules

The Sattarkhan reservoir has been operating since 2000, therefore the operating data between 2000 and 2004 has been used for calibrating the reservoir hedging rules considering a constant seasonal pattern of water demand.

In developing the optimization model of reservoir operation, two alternatives are considered as follows:

  • Alternative 1: All of the water demands have been assumed to be in one group regarding the objective functions by the assumption that the environmental demand is fully supplied (the minimum reservoir release is set equal to instream (environmental demand)).

  • Alternative 2: Two groups of water users have been considered in the objective function: a) the domestic, industrial and instream (environmental demand) (Public demand); b) the agricultural water demand. It is assumed that at least the instream (environmental demand) in the first group and the water demand of orchards in the second group must be supplied.

The results show that the performance of hedging in water allocation in both of the alternatives is almost the same. However, the reservoir storage volume in the second alternative is 30% more than the reservoir storage of the first alternative during the operation horizon. Indeed, the developed hedging rule in the second alternative is more effective in maintaining the water in the reservoir to be used in the forthcoming dry periods. Therefore for the rest of this study the second alternative is considered to achieve the optimal operating policies in the future years.

Different combinations of rationing factors are considered for water allocation to two groups of water users in developing hedging rules. The ranges of rationing factors lie between 0 and 1. In this study, the optimization model has been run for different combinations of public and agricultural demands. Within the ranges of demand rationing factors, more than 30 combinations are defined for model development. The six most appropriate combinations of these factors which have better performance in developing reservoir operation schemes are presented by number 1–6 in Table 9. In these combinations α1, α2 vary for public demand and agricultural demand in the range of 0.8–1 and 0.3–0.7, respectively. The monthly average of water supply and water shortage during the operation period for each of these combinations are also presented in this table. SR, ShP and ShA variables in Table 9 refer to the reservoir storage volume and the percentages of water shortage in public and the agricultural demands, respectively. As public demand has a priority over agricultural demand; therefore, models 3 and 5 which have the least shortage in the public demand supply have been selected as the better models.

Table 9 The results of sensitivity analysis on the rationing factors of different water users

The values of rationing factors play an important role in determination of the percentage of water shortage in different groups of consumers. Therefore, in models 3 and 5 which have the same rationing factors as in public demand, the increase of rationing factors of agricultural demand has decreased the amount of water shortage from 46.2% to 38.55%. In this case the amount of water shortage in public demand has been increased from 0.26% to 0.36%.

Although model 3 provides more water storage in comparison with model 5, due to less water shortage of model 5 in agricultural group, model 5 has been selected as the best model in this study.

The selected hedging model can identify the drought periods and trigger the reservoir operation accordingly. The operating model limits the water supply in normal years and stores water for the coming drought years. This way, the expected damages in the drought periods decrease considerably. It should be noted that in the developed model the release is always more than the environmental demand due to the constraint considered for minimum release from the reservoir.

4.6 Development of Policies for Dry Periods

For estimation of future water demand, the data provided in Karamouz (2010) are used. Regarding this study, the irrigation efficiency and water loss in the domestic section have been considered to be equal to 70% and 10%, respectively and the variations of demand in future for different sectors are given in Table 1.

Based on the estimated water demands for the future (Table 1), the water demand is almost constant during the months of November to March. After these months, the demand increases to reach its maximum value in July and then declines. The water demand gets its maximum value in July because of the high temperature resulting in the high amount of evaporation and transpiration; moreover, the most sensitive growth period of plants to water deficit is in this month.

The operation curves are developed for the 3rd and 4th drought events, which are the most severe and longest drought periods, based on Eqs. 5 to 15 and rationing factors of model 5 in Table 9. The results of the reliability of chosen dry periods in each year are given in Table 10. Also, the results of the application of hedging rules with parameters of model 5 to the Sattarkhan reservoir during the 3rd and 4th drought events as well as SOP releases are shown in Fig. 7. This figure shows that the allocated water of the hedging rule in the 3rd drought period is close to the water demand in most of the months. According to this figure, in the 4th drought period the simulated releases do not satisfy the water demand primarily during the summer and spring. In this period, the average, maximum, and minimum monthly water shortage are equal to 1.9 MCM 4.5 MCM (in May), and 0.1 MCM (in March), respectively. There is an almost constant water shortage in each month which shows the stability of system operation during the drought periods which is desirable in developing drought damage mitigation programs in the study area. Also the comparison between the hedging rule results with the standard operation rule during (2000–2004) shows that using the hedging rules for water supply is more compatible with the demand variation and as a result the intensity of water shortages has been decreased. This figure shows that hedging rule has a much better performance in water allocation during drought periods in comparison with SOP. Using SOP serious water shortages especially in periods of high water demand could be resulted.

Table 10 The reliability of water demand supply during 3rd and 4th dry periods
Fig. 7
figure 7

The monthly reservoir release in comparison with water demands in 3th and 4th dry periods

The analysis of monthly results of application of hedging rules and SOP in 3rd and 4th dry periods (Fig. 8) shows that by application of hedging rules more reasonable and sustainable water supply scheme during drought periods can be achieved. Using hedging rules in two considered drought periods, has decreased the maximum percentage of water shortage in 3rd and 4th dry periods from 0.93 and 0.88 to 0.82 and 0.78, respectively.

Fig. 8
figure 8

The monthly time series of reservoir release under application of SOP and Hedging rule (a) at the 3rd drought period (b) at the 4th dry period

5 Conclusion

Water resources planners and decision makers in recent years are paying more attention to preparedness of systems to deal with water shortages before drought occurrence. In this approach, the probable drought risks from magnitude, severity and duration aspects are projected before occurrence of disaster and the system is prepared to deal with them through development of appropriate policies. The water resources system readiness in dealing with water shortage could highly decrease the water deficit and socio-economical problems. This has become more important in recent years because of the climate change impacts resulting in increased frequency and severity of droughts.

In this paper, an integrated drought management approach in dealing with drought events with various characteristics under climate change impacts is developed. The proposed algorithm has been applied to the Aharchay basin in the northwestern part of Iran. The results of climate change studies show a decreasing trend of rainfall and runoff in the study area that will affect agricultural based economical activities as well as people’s lifestyle in the region. This also results in increasing water shortage periods and therefore necessity of drought mitigation strategies. In the two 20 year periods considered, in the second period, there is a severe water shortage in spring and summer that water demand cannot be supplied. To manage real time operation of Sattarkhan reservoir, hedging rules have been applied to reduce water shortages. This results in a more stable water supply scheme for drought periods and provides a lead time to apply water conservation strategies for the better use of the available water resources.

The results of this study show the successful application of the proposed algorithm in mitigating the severe drought periods in the region considering climate change impacts. The proposed algorithm can be applied to other basins with different characteristics but special care should be taken to incorporate the regional characteristics and the specific priorities in water supply and allocation in adopting the proposed algorithm. Applying this algorithm could decrease the scarcity of water supply in a region and serve as a useful tool in sustainable management of water resources.