1 Introduction

An increase in greenhouse gas emissions is raising the earth’s average temperatures and resulting in climate change around the world. This has changed the frequency, magnitude and persistence of extreme events of various climatic variables, including rainfall, which has been identified in observational records (Alexander et al. 2006; Alexander and Arblaster 2009; Evans et al. 2009; Frich et al. 2002). According to the Intergovernmental Panel on Climate Change (IPCC) Fourth Assessment Report (AR4, IPCC 2007), climate change has started to affect the frequency, intensity and duration of extreme temperature, rainfall and drought almost everywhere around the world over the late twentieth century and this will continue in future. This is also confirmed by the recent assessment by IPCC in the special report on Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation (SREX, IPCC 2012). Projections of future changes in climate extremes are of particular interest due to their adverse effects on society and ecosystems.

Significant changes in extreme temperature and rainfall during the twentieth century across Australia have already been identified in previous studies (Alexander et al. 2006; Alexander and Arblaster 2009; Alexander et al. 2007; Collins et al. 2000; Evans et al. 2009; Hennessy et al. 1999; Plummer et al. 1999; Suppiah and Hennessy 1998). Over the last century, a significant decrease in the frequency and intensity of extreme rainfall events was observed in the southwest region of Western Australia. It has also been reported that there has been a significant increase in the proportion of total rainfall from extreme events in eastern Australia (Haylock and Nicholls 2000; Li et al. 2005). Alexander and Arblaster (2009) observed that Australia is likely to shift toward increased temperature extremes and much longer dry spells with increased extreme rainfall according to the Coupled Model Intercomparison Project Phase 5 (CMIP5) general circulation models (GCMs). Sillmann et al. (2013) concluded that according to CMIP5 GCM results, Australia will experience dry conditions due to an increase in consecutive dry days coinciding with a decrease in both heavy precipitation days and maximum consecutive 5-day precipitation over the twenty-first century relative to their base period of 1981–2000.

Extreme climate events in South Australia (SA), particularly droughts, cause severe economic losses for the state’s agricultural industries and rural populations. In this regard, the future projection of extreme events is important for climate change adaptation purposes. However, to date, there has been very limited research that has attempted to assess the future changes in extreme climate at the regional level in SA using CMIP5 GCMs which include more sophisticated climate models and a new suite of forcing scenarios compared to the former CMIP3 models. In CMIP5, approximately half of the GCMs have an average longitudinal resolution finer than 1.3° whereas in CMIP3 only one model fell into this category (Teng et al. 2012). The CMIP5 models have been found to be more skilful than CMIP3 models in reproducing the Asian-Australian monsoon (AAM) (Wang et al. 2014). Sillmann et al. (2013) concluded that CMIP5 ensembles provide some improvement to CMIP3 ones in the representation of the magnitude of extreme precipitation indices, and this improvement is partly due to the higher spatial resolution of CMIP5 models compared to CMIP3 models. In a recent study, Rashid et al. (2015) observed that the 95th percentile of daily rainfall is likely to decrease over the period 2041–2060 compared to the historical period of 1961–2000 using statistically downscaled daily rainfall from CMIP5 GCM outputs.

GCMs are a widely used tool to project future climate change under different greenhouse gas emission scenarios (Chu et al. 2010; Hu et al. 2012; Huang et al. 2011; King et al. 2012; Sachindra et al. 2014). Due to the coarse resolution of GCM outputs, they are limited in their ability to capture meteorological processes at sufficiently fine resolutions, and consequently, they generally cannot be directly used for local-scale projections. In order to resolve this issue, GCM output datasets are often downscaled to a finer resolution for climate change impact studies. Downscaling methods are broadly divided into two classes, namely dynamic and statistical. Statistical downscaling is more widely used due to its ease of application and reduced cost. The Generalized LInear Modelling of daily CLImate sequence (GLIMCLIM) is a multi-site stochastic downscaling model based on a generalized linear model (GLM) (Chandler 2002), which has been used around the world (Beecham et al. 2014; Frost et al. 2011; Kigobe et al. 2011; Liu et al. 2012; Mehrotra et al. 2009; Mirshahi et al. 2008). Although application of GLMs for multi-site stochastic rainfall simulation is relatively new, successful application of this model in Australia is available in recent studies (Beecham et al. 2014; Frost 2007; Frost et al. 2011). Rashid et al. (2015) and Beecham et al. (2014) found that GLIMCLIM was able to downscale historical and projected daily rainfall from the National Centers for Environmental Prediction/National Center for Atmospheric Research (NCEP/NCAR) reanalysis (termed as NCEP reanalysis hereafter) data and CMIP5 GCM outputs in the Onkaparinga catchment in SA. While the GLIMCLIM model has been successfully applied for downscaling of daily rainfall, few studies have assessed its capability to reproduce extreme rainfall. Previous studies have identified that the GLIMCLIM model performed better than other downscaling models including the non-homogeneous hidden Markov model (NHMM) and the statistical downscaling model (SDSM) in reproducing the temporal dependence related extreme rainfall indices such as annual wet/dry days and wet/dry spell lengths (Hu et al. 2013; Liu et al. 2012).

Two types of descriptions are generally used to analyse rainfall extremes (Klein Tank et al. 2009). One is based on the various climate extreme indices that represent moderate meteorological extremes with re-occurrence periods of year, seasonal or monthly (Alexander et al. 2006; Huang et al. 2012; Hundecha and Bárdossy 2008; Jeong et al. 2012; Tebaldi et al. 2006; Yang et al. 2011). The other approach involves fitting a distribution to the annual extreme rainfall and then to estimate the extreme events with multi-year to multi-decadal recurrence commonly known as frequency analysis of annual maximum daily rainfall (Frei et al. 2006; Hashmi et al. 2011; Kharin et al. 2007; Tryhorn and DeGaetano 2011). This approach is important for engineering design and planning.

The classical frequency analysis technique considering the stationary condition is no longer valid in the context of climate change (López and Francés 2013; Tramblay et al. 2013; Villarini et al. 2009). One way of incorporating non-stationarity in the frequency model might be by making the distribution parameters dependent on large-scale climatic variables such as NCEP reanalysis and/or GCM output datasets. To date, only a few research studies have considered climatic variables rather than time as covariates in a non-stationary frequency model. Over the last decade, some researchers have successfully incorporated climatic variables as external covariates for non-stationary frequency analysis (Aissaoui-Fqayeh et al. 2009; El Adlouni et al. 2007; Kwon et al. 2008; López and Francés 2013; Ouarda and El-Adlouni 2011; Tramblay et al. 2013). The generalized additive model for location, scale and shape (GAMLSS), proposed by Rigby and Stasinopoulos (2005), provides a flexible modelling framework. The dependence of the distribution parameters on the covariates can be represented in terms of linear or non-linear, parametric and/or additive non-parametric functions. GAMLSS has been successfully used by López and Francés (2013) and Villarini et al. (2009) for non-stationary frequency analysis.

The objectives of the research described in this paper are to assess the performance of the GLIMCLIM downscaling model in terms of simulating extreme rainfall using downscaled daily rainfall from NCEP reanalysis and CMIP5 GCM outputs and to estimate changes for the future period 2041–2060 compared to the base period of 1961–2000. In doing so, Frequency Adapted Quantile Mapping (FAQM) was applied to correct the bias in the simulated daily rainfall and a non-stationary frequency analysis was employed to estimate the annual maximum daily rainfall (AMDR) for an exceedance probability of 0.01 (equivalent to a 100-year annual recurrence interval for stationary frequency analysis). This will be termed as the 100-year AMDR hereafter.

2 Study area and data

The study focuses on the Onkaparinga catchment in SA. The catchment is situated approximately 25 km to the southeast of the city of Adelaide. Being a significant source of water supply for metropolitan Adelaide as well as providing water to farm dams and for the natural environment to maintain environmental flows, the water resources of the Onkaparinga catchment are crucial. Moreover, strong spatial variability of rainfall over this small catchment of approximately 560 km2 in area makes it a challenging case study catchment for modelling and downscaling of extreme rainfall (Rashid et al. 2014). Heneker and Cresswell (2010) studied climate change impacts in the Mount Lofty Ranges (MLR) and estimated that there would be a 30 % potential reduction in the annual runoff in the Onkaparinga catchment over the period 2035 to 2065. In addition, Teoh (2003) identified that farm dams divert between 8 and 10 % (4.5 GL) of the catchment’s surface water and that this was forecast to increase to around 7 to 10 GL under the current management policies, as at 2002. In general, approximately 60 % of water for Adelaide is collected from the catchment, with the remaining demand being met by pumping water from the Murray River. Considering this, a key management issue for the Onkaparinga catchment is the potential risk of climate change to Adelaide’s water supply. In order to understand the future changes of extreme rainfall in the catchment, nine rainfall stations were considered as shown in Fig. 1. Details of these rainfall stations are shown in Table 1. These daily rainfall data for the period 1961–2000 were collected from the SILO database of the Queensland Climate Change Centre of Excellence (www.longpaddock.qld.gov.au/silo/).

Fig. 1
figure 1

NCEP reanalysis grid point (2.5° × 2.5°) around the catchment (top). Location of the rainfall stations (bottom)

Table 1 Details of rainfall stations

NCEP reanalysis datasets were collected from the National Oceanic and Atmospheric Administration/Earth System Research Laboratory (NOAA/ESRL) for the period 1961–2000. The model was calibrated and validated using the daily NCEP reanalysis datasets over the periods 1961–1986 and 1987–2000 respectively. Historical daily outputs of CMIP5 GCMs for the period 1961–2000 and future daily outputs for the period 2041–2060 under Representative Concentration Pathway (RCP) 4.5 and RCP 8.5 scenarios were collected from the Earth System Grid data distribution portal (www.earthsystemgrid.org). NCEP reanalysis and GCM output data were extracted from 12 grid points around the Onkaparinga catchment, as shown in Fig. 1. The GCM outputs were linearly interpolated to match with the NCEP reanalysis resolution (2.5° × 2.5°). Four CMIP5 GCMs were considered in this study, namely CSIRO-MK3.6.0 by the Commonwealth Scientific and Industrial Research Organization in collaboration with the Queensland Climate Change Centre of Excellence (termed CSIRO hereafter), GFDL-ESM2M by NOAA Geophysical Fluid Dynamics Laboratory (termed GFDL hereafter), MIROC-ESM by the Japan Agency for Marine-Earth Science and Technology (termed MIROC hereafter), BCC-CSM 1.1 by the Beijing Climate Center and China Meteorological Administration (termed BCC hereafter) and CanESM2 by the Canadian Centre for Climate Modelling and Analysis (termed CAN hereafter).

3 Methodology

3.1 Statistical downscaling model

In this study, daily rainfall was statistically downscaled at nine rainfall stations in the Onkaparinga catchment using the GLIMCLIM downscaling model. Details of this model are available in Chandler (2002), Chandler and Wheater (2002) and Yang et al. (2005). The version of the model used in this study is available at http://www.ucl.ac.uk/~ucakarc/work/software/rain_glm.zip . The application of this model for statistical downscaling of multi-site daily rainfall in SA is reported in Beecham et al. (2014). In GLIMCLIM, the GLM is used to downscale daily rainfall in a two-stage approach. Rainfall occurrence is modelled using logistic regression and then a gamma distribution is used to model the rainfall amount for any wet days. The logistic regression model can be defined as follows (Chandler and Wheater 2002; Yang et al. 2005):

$$ \ln \left(\frac{p_i}{1-{p}_i}\right)={X}_i\beta $$
(1)

where p i is the probability of rain for any day i associated with predictor vector X i with coefficient vector β. The mean rainfall μ i for the ith wet day is conditional on a covariate vector ξ i and coefficient vector α that can be estimated as follows:

$$ \ln \left({\mu}_i\right)={\xi}_i\alpha $$
(2)

Calibration and validation of the model were performed using predictor variables from NCEP reanalysis datasets. The model was calibrated for the period 1961–1986 and validated over the period 1987–2000. Nineteen large-scale atmospheric and circulation variables from NCEP reanalysis datasets were selected as probable predictor variables based on the previous literature which were subsequently reduced to ten potential variables based on their correlation with observed rainfall. Each predictor variable was extracted from the 12 reanalysis grid points (2.5° × 2.5°) around the study area, and the average of these 12 values gave a single series for each predictor variable used in the downscaling model. Details of the predictor selection process are reported in Beecham et al. (2014). The final selected predictor variables were air temperature at 700 hPa; geopotential height at 700 and 800 hPa; relative humidity at 500, 700 and 850 hPa; zonal wind component at 700 and 850 hPa and meridional wind component at 500 and 700 hPa. The historical (1961–2000) and future (2041–2060) model outputs of four GCMs (CSIRO, GFDL, BCC and CAN) corresponding to the potential predictors were standardized with the means and standard deviations of the NCEP reanalysis outputs of the calibration period. These historical and future model outputs of each GCM were then used in the downscaling model to simulate the past historical rainfall and to project the future rainfall respectively at each of the nine stations in the catchment.

3.2 Bias correction

Rainfall downscaled from NCEP reanalysis and various GCM output datasets were bias corrected using the Frequency Adapted Quantile (FAQM) technique for the calibration (1961–1986), validation (1987–2000) and projection (2041–2060) periods. A detailed description of the FAQM bias-correction technique is found in the studies by Themeßl et al. (2012) and Rashid et al. (2015). In frequency adaptation, when the number of dry days of the downscaled rainfall is less than that of the observed rainfall, all the modelled rainfall that has a probability equal to or less than the probability of the observed zero rainfall is transferred to zero. On the other hand when the number of dry days of the downscaled rainfall is greater than that of the observed rainfall, a fraction of the simulated dry days is converted to wet days using randomly generated values following the gamma distribution of the downscaled rainfall. Once the frequency of the downscaled rainfall was corrected, quantile mapping (QM) was applied to correct the downscaled rainfall using a parameter free empirical distribution. In QM, the correction factor was estimated for each empirical cumulative distribution function (ecdf) of the downscaled rainfall by differencing the inverse cumulative distribution function (ecdf −1) of downscaled and observed rainfall for the calibration period. Then the downscaled rainfall for the validation and future periods was corrected by adding the correction factor to the uncorrected rainfall. If the downscaled rainfall in the validation or projection periods was outside the range of the rainfall in the calibration period, the new extreme rainfall values were corrected by constant extrapolation of the correction factor neglecting the four highest and lowest quantiles of the calibration period considering the assumption that the tail of the correction factor was likely to be noisy. The FAQM bias correction was applied to each downscaled rainfall series (downscaled from NCEP reanalysis and GCM datasets) considering the observed station rainfall as the base. In the case of downscaled rainfall from NCEP reanalysis datasets, correction factors for each quantile were estimated based on the observed rainfall for the calibration period. These correction factors were applied to correct rainfall for the calibration and validation periods. In the same way, correction factors were also estimated for the rainfall downscaled from GCM output datasets and used to correct rainfall for the historical (calibration and validation) and future projection periods.

3.3 Extreme rainfall indices

Extreme rainfall indices (listed in Table 2) were estimated from the observed and downscaled daily rainfall forced by NCEP reanalysis and GCM output datasets. The downscaling model performance was assessed for its ability to reproduce the extreme rainfall indices for the calibration and validation periods in terms of the coefficient of determination (R 2), coefficient of efficiency (CE), mean bias (MB) and Nash-Sutcliffe efficiency (NS). Finally, the percentage change of the extreme rainfall indices was estimated for the future period (2041–2060) compared to the base period (1961–2000). The extreme rainfall indices considered in this study are listed in Table 2. These indices have also been used in different previous studies (Evans et al. 2009; Hashmi et al. 2011; Huang et al. 2012; Tryhorn and DeGaetano 2011; Yang et al. 2011).

Table 2 Extreme rainfall indices

3.4 Frequency analysis

Annual maximum daily rainfall (AMDR) series obtained from observed and downscaled daily rainfall (historical and projected) was used to estimate the 100-year AMDR. For this purpose, both stationary and non-stationary models were used. In the stationary model, the fitted distribution parameters are constant over the time. On the other hand, in the non-stationary model, the distribution parameters are considered as a function of certain explanatory variables so that parameters vary over time. GAMLSS (Rigby and Stasinopoulos 2005) has a flexible modelling framework which is suitable for non-stationary modelling of AMDR. In GAMLSS, the random response variable (AMDR for this study) has a parametric distribution function whose parameters can be modelled as a function of selected covariates (NCEP reanalysis and GCM output datasets in this study). Air temperature at 700 hPa; geopotential height at 700 and 800 hPa; relative humidity at 500, 700 and 850 hPa; zonal wind component at 700 and 850 hPa; and meridional wind component at 500 and 700 hPa were considered as the external covariates. The winter (June–August) means of these variables were extracted from the 12 grid points around the study area and spatially averaged which gives a single series for each variable used as a covariate for fitting a GAMLSS. The rationale of considering the winter average is that the most of the rainfall occurs over this period in this study area and in general the maximum correlation between observed AMDR and selected covariates occurred in winter.

A GAMLSS model assumes that for any observations for i = 1, 2, 3,... n independent observations, Y i has a cumulative distribution function F y (Y i i ), where θ i  = θ i1, θ i2, θ i3,...... θ ip is a vector of p distribution parameters accounting for location, scale and shape of the distribution. The number of parameters p generally varies from one to four. The model parameters are related to the covariates by a monotonic link function g k (.) where k = 1, 2, 3.... p and the parameters are modelled through link functions. In this study, identity and logarithm link functions were considered. Details of GAMLSS are available in Stasinopoulos and Rigby (2007) and Rigby and Stasinopoulos (2005). While GAMLSS have several different possible models, a semi-parametric additive model formulation was used in this study as follows:

$$ {g}_k\left({\theta}_k\right)\kern0.5em =\kern0.5em {\eta}_k\kern0.5em =\kern0.5em {X}_k{\beta}_k\kern0.5em +\kern0.5em {\displaystyle \sum_{j=1}^{j_k}{h}_{jk}\;\Big({x}_{jk}}\Big) $$
(3)

where θ k and η k are vectors of length n, X k is a matrix of covariates of order n × j k , \( {\beta}_k\;\left({\beta}_{1k},......,{\beta}_{j_kk}\right) \) is a parameter vector of length j k and h jk (.) represents the dependence function of the distribution parameters on covariates. The dependence could be linear, or a smoothing term can be included to allow more flexibility for modelling the dependence of the distribution parameters on the covariates. Four distribution functions were considered in this study, namely the Gumbel, Lognormal, Weibull and Gamma distributions, as shown in Table 3. A stepwise model fitting approach proposed by Rigby and Stasinopoulos (2005) was followed to identify the significant covariates and distribution functions. In addition to the Akaike information criterion (AIC) and the Bayesian information criterion (BIC), model efficiency statistics such as the coefficient of determination (R 2), the root mean square error (RMSE) and the Nash-Sutcliffe (NS) efficiency were also assessed at each step of the model fitting. In addition, the normality and independence of the residuals were assessed by examining the first four moments of the residuals, by the Filliben correlation coefficient (Filliben 1975) and by visual inspection of a diagnostic plot of the residuals. Once the model was fitted, the distribution parameters were estimated for each data point and the 100-year AMDR were estimated based on the time-varying estimates of the model parameters θ 1 and θ 2 for the selected stations.

Table 3 Probability density functions and their corresponding link functions

In order to assess the magnitude of the 100-year AMDR during the historical period (1961–2000) and the changes over the future period (2041–2060), four rainfall stations (G1, G3, G7 and G9) were considered for this study. These stations were selected to represent the upper, middle and downstream (near the coast) regions of the Onkaparinga catchment (Fig. 1).

4 Results

4.1 Calibration and validation of the model

The model was able to reasonably reproduce the monthly statistics and variability of the extreme rainfall indices for both the calibration (1961–1986) and validation (1987–2000) periods. Figure 2 shows the observed and median simulated extreme rainfall indices for different months of the year for the validation period. Simulations were driven by the NCEP reanalysis and GCM historical datasets. In general, the simulation driven by the NCEP reanalysis datasets showed better agreement with the observed extreme rainfall indices than the simulation driven by GCM historical datasets. This is due to the bias between the NCEP reanalysis and GCM datasets. The FAQM correction technique that was applied significantly reduced the bias in extreme rainfall indices. However, the maximum 5-day rainfall total was still underestimated by all simulations during the months from June to October. This might be due to an underestimation of rainfall amounts and consecutive wet days during this period of the year. The maximum consecutive dry days (CDD) was also underestimated for the months January to May for all simulations.

Fig. 2
figure 2

Comparison of extreme rainfall indices from observed and median simulated rainfall driven by NCEP reanalysis and GCM historical datasets for the validation period (1987 to 2000)

Table 4 shows the performance after bias correction of the statistical downscaling model driven by NCEP reanalysis data and different GCMs in terms of the coefficient of determination (R 2), coefficient of efficiency (CE), mean bias (MB) and Nash-Sutcliffe efficiency (NS) for the validation period. The efficiency statistics were found to be reasonable for all models, and in general, the model driven by NCEP reanalysis data showed better performance than the model driven by GCM historical outputs. The values shown in italic text in Table 4 are the NS statistics before bias correction. The results show that the FAQM technique was able to reasonably correct the bias for extreme rainfall indices such as RDM, RDL90, RQ90 and R5D. However, in contrast, the model efficiency for CDD deteriorated after bias correction. For the FAQM bias correction technique, when the number of dry days of the downscaled rainfall was greater than that of the observed rainfall, a certain fraction of the simulated dry days was converted to wet days by randomly replacing the values obtained from randomly generated values following the gamma distribution of the modelled rainfall. This is likely to break the long dry spells which eventually reduces the maximum consecutive dry days (CDD). Moreover, quantile mapping (QM) can correct the bias in rainfall quantity but it cannot correct the sequence of dry and wet days. While uncorrected downscaled daily rainfall showed a good degree of efficiency in terms of reproducing CDD statistics, as listed in Table 4, downscaled daily rainfall series without correction were used instead to estimate the future changes of CDD. However, it is expected that developing a bias correction technique based on the transitional probability of dry and/or wet days and applying this technique to simulated CDD series would reduce the bias in CDD.

Table 4 Downscaling model efficiency for the validation period after bias correction

4.2 Future changes in extreme rainfall indices

Changes in the extreme rainfall indices for different GCMs under RCP4.5 and RCP8.5 scenarios over the period 2041–2060 compared to the base period 1961–2000 are shown in Fig. 3 (RDM, RDL90 and RQ90) and Fig. 4 (R5D and CDD). Future projected changes are shown for each month of the year. Although a few model-month combinations showed that RDM, RDL90 and R5D would increase and CDD would decrease, most of the model-month combinations showed that RDM, RDL90, RQ90 and R5D would decrease whereas CDD would increase for both the RCP4.5 and RCP8.5 scenarios. Future changes of RDM, RDL90, RQ90, R5D and CDD range from 12.65 to −40.19 %, 23.27 to −50 %, 11.25 to −47.53 %, 21.76 to −37.26 % and 34.68 to −12.96 %, respectively, for the RCP4.5 scenario and 16.9 to −45.66 %, 22.12 to 67.3 %, 19.16 to 51.69 %, 22.81 to −40.32 % and 30.23 to −7.31 %, respectively, for the RCP8.5 scenario. The highest reduction in extreme indices is seen to occur during September to January whereas the lowest reduction occurs during the period May to August. This tendency is more prominent under the RCP8.5 than RCP4.5 scenario. This reveals that there is a possibility of higher reductions in rainfall amount-related indices (RDM, RDL90, RQ90 and R5D) in the dry season compared to the wet season, which indicates that a regional drought will be intensified in the future (2041–2060) compared to the historical period (1961–2000). Rashid et al. (2015) also observed significant reductions in annual and seasonal rainfall for the future period 2041–2060 in the Onkaparinga catchment.

Fig. 3
figure 3

Changes in extreme rainfall indices (RDM, RDL90 and RQ90) for different GCMs under scenarios RCP4.5 (left column) and RCP8.5 (right column) for the future period (2041 to 2060) compared to the base period (1961 to 2000)

Fig. 4
figure 4

Same as Fig. 2 but for R5D and CDD

An increase in the maximum number of consecutive dry days (CDD) may also intensify the droughts that are periodically experienced in the Onkaparinga catchment. These changes of daily rainfall occurrence sequences and reduction in the rainfall amount will reduce the runoff potential, which will eventually affect regional water availability. Obviously, the magnitude of changes in the extreme rainfall indices varies with the models and scenarios due to differences in the model settings and scenarios. However, in general, the direction of changes was found to be consistent for all models and scenarios.

4.3 AMDR

The downscaling model was able to reasonably reproduce the AMDR when forced by NCEP reanalysis and GCM historical datasets. Figure 5 shows a comparison between the observed and simulated AMDR when forced by NCEP reanalysis datasets at nine rainfall stations for the calibration (1961–1986) and validation (1987–2000) periods. The 2.5th and 97.5th percentiles of the AMDR series were generated from 1000 sets of simulations and are presented as dotted lines. The thick black line represents the median simulated AMDR. For most of the year, the AMDR were within the simulation envelop for all stations. Similar model efficiency was also observed for the simulations driven by the GCM historical datasets including CSIRO, GFDL, BCC and CAN after bias correction. However, the model was limited in its ability to reproduce the inter-annual variability of the AMDR for all simulations forced by the NCEP reanalysis and GCM historical datasets at all stations. AMDR is the maximum daily rainfall value over a year. For a daily rainfall model, it is difficult to accurately reproduce such extreme values for each year over the modelling period. Moreover, the gamma distribution used in the downscaling model has limited ability to capture the extreme rainfall (Rashid et al. 2014). In addition, downscaling of AMDR is challenging due to its high spatial and temporal variability and non-linear nature. Direct downscaling of AMDR from reanalysis and GCM output datasets by considering AMDR as a predictand might resolve this problem.

Fig. 5
figure 5

Observed and NCEP simulated annual maximum daily rainfall at different rainfall stations for the calibration (1961 to 1986) and validation (1987 to 2000) periods

4.4 Frequency analysis of AMDR

Figure 6 shows the results of the non-stationary modelling of the observed rainfall and 100-year AMDR magnitude for the representative sites. The non-stationary model was developed by fitting a GAMLSS to the observed AMDR considering the NCEP reanalysis variables as external covariates. In the same way for the median simulated AMDR driven by different GCMs, historical and projected (RCP4.5 and RCP8.5) GCM output datasets were considered as external covariates. Several of the selected external covariates were found to vary between the rainfall stations. It was found that the gamma (for station G1) and lognormal distributions (for stations G3, G7 and G9) gave the best overall results for the observed AMDR. The non-linear dependence of external covariates using cubic splines was found to be significant for the distribution parameter θ 1 for all stations. To reduce overfitting of the model, the distribution parameter θ 2 was considered constant. The model efficiency statistics such as the coefficient of determination (R 2), root mean square error (RMSE) and Nash-Sutcliffe (NS) were found to be reasonable for each station. Figure 6 shows the magnitude of the 100-year AMDR under both stationary and non-stationary conditions. In the non-stationary frequency analysis, the distribution parameters were considered to be varied over time whereas in the stationary modelling they were considered constant. It was observed that the non-stationary model showed significant variability in the AMDR frequency. The results show that the stationary model could lead to significant underestimation or overestimation of the rainfall compared to the non-stationary model. For example, at site G9, the non-stationary frequency analysis indicated that the 100-year AMDR during the 40 years (1961–2000) of the observation period varied from a maximum of 142 mm/day in 1969 to a minimum of 28.5 mm/day as shown in Fig. 6, whereas the 100-year AMDR was 83.52 mm/day for the stationary frequency analysis. Similar results were also observed for other stations.

Fig. 6
figure 6

Non-stationary modelled AMDR (blue line) and quantile estimates of AMDR with 0.01 annual exceedance probability for the period 1961 to 2000 based on stationary (dotted black line) and non-stationary (red line) models. The small black circles represent the observed AMDR

The non-stationary frequency analysis of AMDR obtained from the downscaled simulations driven by different GCM datasets shows that the 100-year AMDR (annual exceedance probability of 0.01) varies significantly (decreases and increases) over the periods 1960–2000 and 2041–2060. So in the non-stationary frequency analysis, the 100-year AMDR includes maximum and minimum values over a historical or future period. Table 5 lists the change in the maximum and minimum values of the 100-year AMDR for the future period (2041–2060) compared to the base period (1961–2000). It was observed that the degree of change varies with the selected models and scenarios. In general, maximum and minimum values of the 100-year AMDR are likely to decrease over the period 2041–2060 for all models and scenarios. An exception was observed for the downscaled simulation forced by the CSIRO GCM, which showed that the maximum of the 100-year AMDR would increase during the period 2041–2060 for the RCP8.5 scenario.

Table 5 Change in maximum and minimum values of 100-year AMDR for different GCMs and scenarios for the future period (2041–2060) compared to the base period (1960–2000)

5 Discussion

Even though the magnitude of changes in extreme rainfall varies for the future period, the direction of changes is consistent for all CMIP GCM models and scenarios. This gives some confidence in the projected changes presented in this study, although uncertainties still need to be considered. The future changes of extreme rainfall indices and annual maximum daily rainfall projected in this study may not account for the full range of climate change impacts because these are based on a small number of GCMs and scenarios due to data availability. Nevertheless, the methodology developed in this study can be applied to assess the future changes of extreme rainfall for other GCM scenarios through downscaling rainfall from those GCMs and scenarios.

Extreme climate conditions depend on large-scale processes that modify the stability of the atmosphere by exchanging mass and energy from the oceans to the atmosphere and from the equator to the poles. The specific influence of large-scale processes on South Australian extreme rainfall is not well understood. Evans et al. (2009) concluded that extreme rainfall over SA is linked to the large-scale rearrangement of circulation patterns. Their study reveals that meridional sea surface temperature (SST) in the eastern Indian Ocean (EIO) influences extreme rainfall events over SA. During dry winters, pole-ward shifts of the westerlies to the south of Australia and increased offshore flow over the northwest of Australia reduce advection of moist warm air into SA from the northwest and moist cold air along the southern coastline. Thus, the interaction of these air masses is reduced which eventually decreases rainfall in SA (Evans et al. 2009). In addition to large-scale processes, mesoscale weather system is also responsible to modulate the local-scale extreme rainfall. So, extreme rainfall is location specific. While extreme rainfall is likely to increase over the southern and southwestern flatlands (east) of Australia (CSIRO and BOM 2015; IPCC 2012), our study shows that amount-related extreme rainfall indices and AMDR are likely to decrease in future over the Onkaparinga catchment. It should be noted that the GCMs are unable to capture the mesoscale atmospheric features such as geography that modifies the atmospheric flow, small horizontal shifts in storm movement, land use changes and clouds which essentially influence the local-scale extreme rainfall (Tisseuil et al. 2010; Willems et al. 2012). Moreover, rainfall events averaged over a large spatial scale are not able to represent the extreme rainfall events that occur at a far smaller spatial scale. This reveals the necessity of catchment scale assessment of extreme rainfall in a changed climate.

In addition to natural variations in the climatic system, anthropogenic-induced climate change due to greenhouse gas emissions changes the hydrological cycle creating a situation whereby stationarity in hydro-climatic variables such as rainfall is no longer valid. This study has shown that using large-scale climatic variables as covariates in a non-stationary model of AMDR can produce reliable scenarios of future extreme rainfall.

6 Conclusion

In this study, the performance of the GLIMCLIM statistical downscaling model was first assessed in terms of its ability to simulate extreme rainfall indices and annual maximum daily rainfall (AMDR) when daily rainfall was downscaled from NCEP reanalysis and CMIP5 GCM output datasets. Then the changes in extreme rainfall indices and 100-year AMDR were estimated using four different CMIP5 GCMs (CSIRO, GFDL, BCC and CAN) under RCP4.5 and RCP8.5 scenarios for the future period of 2041–2060 compared to the base period of 1961–2000. The downscaling model was able to reasonably reproduce the monthly statistics and variability of the extreme rainfall indices for both the calibration (1961–1986) and validation (1987–2000) periods when driven by the NCEP reanalysis datasets. However, the model showed less skill in reproducing the extreme rainfall indices when forced by the CMIP5 GCM historical output datasets.

Frequency Adapted Quantile Mapping was applied to the simulated daily rainfall series, and this significantly reduced the bias in the extreme rainfall indices. However, the model efficiency for consecutive dry days (CDD) deteriorated after bias correction. This might be due to the fact that during the frequency adaptation, imputation of random values to turn the dry days to wet days might break the dry spell and reduce the maximum consecutive dry days. The downscaling model was found to be able to reasonably reproduce the AMDR for both the calibration and validation periods. For almost all years, the observed AMDR were within the 2.5th and 97.5th percentiles of the simulated rainfall. However, inter-annual variability of AMDR was underestimated in all models.

This study has shown that extreme indices related to rainfall amount are likely to decrease for all models and scenarios. This reduction will be higher during the dry season compared to the wet season, which points toward a regional water scarcity situation in the future (2041–2060), when the dry season will be drier compared to the present time (1961–2000). In addition, the maximum number of consecutive dry days (CDD) will increase in future and this will intensify future droughts in the Onkaparinga catchment.

It was also found in this study that the GAMLSS provides a flexible modelling framework for representing the non-stationary behaviour of AMDR. In particular, the gamma and lognormal distributions gave the best fit for AMDR. Non-linear dependence of external covariates (NCEP reanalysis and CMIP5 GCM output datasets) was found to be significant for selected distribution parameters. In general, future projections show that both the minimum and maximum values of the 100-year AMDR are likely to decrease over the future period (2041–2060) compared to the base period (1961–2000) for all models and scenarios at all selected rainfall stations. However, an exception was observed for the CSIRO GCM, which projected that the maximum 100-year AMDR will increase for all selected sites under the RCP8.5 scenario.

Overall, the downscaling model was able to reasonably simulate the extreme rainfall indices and AMDR in the Onkaparinga catchment. Generally, the direction of changes was consistent for all models and scenarios. This indicates that the Onkaparinga catchment will experience drier conditions due to an increase in consecutive dry days coinciding with decreases in heavy rainfall days, empirical 90th quantile of rainfall and maximum 5-day consecutive total rainfall for the future period (2041–2060) compared to the base period (1961–2000).