1 Introduction

Regional climate models (RCMs) are increasingly relied upon to assess the potential impacts of climate change for sectors and regions across the globe (Hewitson et al. 2014). Such models have finer resolutions than the global climate models (GCMs) in which they are nested, and so provide better representations of, for example, topography and atmospheric movements and the effects these have on rainfall patterns and intensities. Such improvements over GCMs are particularly important if the data are being used to drive hydrological models, to assess the potential impacts of climate change on river flows, and are even more important if the focus is hydrological extremes.

Previous studies driving hydrological models directly with data from RCMs over Europe have used data on grids with resolutions of ~12–50 km (e.g., Graham et al. 2007; Dankers and Feyen 2008; Bell et al. 2012; Kay and Jones 2012a, b). This generally required some form of further downscaling, particularly for precipitation data, before use for hydrological modelling, with the downscaling sometimes being combined with bias-correction. For example, Prudhomme et al. (2012) used quantile-mapping to downscale and bias-correct 25 km RCM precipitation to 1 km. However, such adjustments have to be applied with care. Several recent papers have questioned the supposed improvements for hydrological modelling brought by bias-correction (Addor and Seibert 2014; Huang et al. 2014; Ehret et al. 2012). Also, biases in different variables can compensate for each other when combined (e.g., Fischer and Knutti 2013), so separate bias correction of different variables could introduce inconsistencies; this could be important for the potential evaporation estimates required by hydrological models (Kay et al. 2013).

Recently, developments in technology and computing power have enabled further improvements in spatial resolution of climate models, including a further level of nesting of models. Specifically, a recent project (CONVEX) has involved nesting a 1.5 km RCM in a 12 km RCM, which is in turn either driven by European-reanalysis (ERA-Interim) boundary conditions (Kendon et al. 2012) or nested in the HadGEM3 GCM (Kendon et al. 2014). The ERA-driven runs cover the period 1989–2008, while two pairs of GCM-driven runs are available, one approximating the current climate (1996–2009) and the second a potential future climate of the end of the 21st century assuming RCP8.5 anthropogenic forcings (Riahi et al. 2011). The 12 km RCM covers Europe, while the 1.5 km RCM only covers southern Britain. The 1.5 km resolution is sufficiently fine so as to be ‘convection-permitting’; such models do not require schemes parameterising convective storms, whereas lower resolution models rely on such schemes, which can be a significant source of uncertainty in modelled precipitation, especially in summer (Kendon et al. 2012).

Analyses of rainfall from the CONVEX ERA-driven runs show that the 1.5 km RCM generates sub-daily rainfall that is more realistic in terms of duration and spatial extent than that generated by the 12 km RCM, although heavy rain can be too intense (Kendon et al. 2012). However, analyses of daily rainfall did not show clear evidence of improvements with increasing resolution (Chan et al. 2013). Similar results were found for a 2.2 km convection-permitting RCM covering the Alps, nested in a 12 km ERA-driven RCM (Ban et al. 2014). Analyses of hourly rainfall projections from the CONVEX GCM-driven runs show increases in the intensity of heavy summer precipitation in the 1.5 km RCM that are not seen in the 12 km RCM, with increases in heavy winter precipitation similar in the 1.5 and 12 km RCMs (Kendon et al. 2014). The 1.5 km RCM shows uniform increases in intensity across a range of return periods, while the 12 km RCM shows decreases at shorter return periods and increases at longer return periods (Chan et al. 2014). There is little confidence in the 12 km RCM projections at long return periods though, as they are linked to unphysical grid-point storms.

Catchments have different characteristic time-scales of response to precipitation inputs, with key influencing factors being catchment size, slope, urbanisation and soil/bedrock permeability (Kjeldsen 2007). This means that, although sub-daily rainfall distribution is important for flood-generation in very small or responsive catchments, rainfall accumulations at daily and longer durations, and antecedent soil moisture, are generally more important in moderately sized or large catchments (e.g., Smith et al. 2014; Addor and Seibert 2014), particularly those with a significant component of their flow derived from groundwater sources (Chiverton et al. 2015). Thus the benefits of very high resolution RCM data for hydrological modelling may depend on the characteristics of the catchment being modelled. Similarly, the implications of differing future rainfall projections between RCM resolutions could vary by catchment. However, winter is the main fluvial flood season in Britain, especially southern Britain (Bayliss and Jones 1993), so summer rainfall differences are likely to be less important.

Using the CONVEX data, the aims of this paper are to

  • Use data from the Baseline ERA-driven runs to test the performance of very high resolution climate model data for hydrological modelling of river flows.

  • Use data from the Current and Future GCM-driven runs to investigate the effect of RCM resolution on projections of change in peak river flows.

Using a national gridded hydrological model, the first analysis looks at performance in 32 gauged catchments across southern Britain, compared to both gauged flows and use of observed input data to drive the hydrological model. The second analysis uses the same national gridded hydrological model and presents spatial maps of flood changes, both annually and seasonally.

2 Methods and data

2.1 Hydrological model

A national gridded hydrological model is applied; CLASSIC-GB (Crooks et al. 2014). This model was recently developed from a semi-distributed catchment-based model CLASSIC (Climate and LAnd-use Scenario Simulation In Catchments; Crooks and Naden 2007), which has been used extensively for modelling the potential impacts of climate change on river flows and flooding in relatively large catchments across Great Britain (e.g., Prudhomme et al. 2013a, b; Kay and Crooks 2014). The development of CLASSIC from a catchment-based model to a national model took advantage of the fact that the runoff-production parameters in CLASSIC are derived from gridded datasets of soils, landcover and topography. CLASSIC-GB can currently be run at a range of spatial resolutions, from 1 km up to 10 km, aligned with the GB National Grid. The maximum time-step of the routing component has to be sufficiently small, relative to the spatial resolution, for stability of the kinematic wave routing scheme, but the main model time-step can be a multiple of the routing time-step. Here, to make best use of the RCM data, CLASSIC-GB is run with a 1 km spatial resolution and a 1-h main time-step (20-min routing time-step).

CLASSIC-GB requires gridded time-series of precipitation and potential evaporation (PE). If the snow module is implemented then gridded temperature time-series are also required, but this module is not used here as the focus on southern Britain means that snow is not a major influence on flows. A run of CLASSIC-GB is performed using observed precipitation and PE, to compare against the runs driven by RCM data. This observation-driven run uses 1 km daily total precipitation from CEH Gridded Estimates of Areal Rainfall (CEH-GEAR; Keller et al. 2015) and 40 km monthly total PE from the Met Office Rainfall and Evaporation Calculation System (MORECS; Hough et al. 1996). The data are temporally downscaled by dividing by the number of hydrological model time-steps in a data time-step, and the PE data are spatially downscaled by copying data from the 40 km grid down to the 1 km hydrological model grid (as Bell et al. (2012)).

2.2 Regional climate model data

Table 1 summarises the seven hydrological model runs performed, one using observed driving data and six using RCM data, and provides the precipitation and PE data sources and time-periods. The 1.5 km RCM is a modified version of the UK Met Office weather forecast model (UKV), and the 12 km RCM is a limited-area version of HadGEM3 (Moufouma-Okia and Jones 2015). Each 1.5 km RCM run is nested in the equivalent 12 km RCM run; the 1.5 km RCM covers southern Britain (Fig. 1) while the 12 km RCM covers Europe. The boundary conditions for the 12 km RCM runs are provided by either ERA-Interim reanalyses (ERA-driven) or an equivalent (Current or Future) GCM run (GCM-driven). Further RCM details are provided by Kendon et al. (2012, 2014).

Table 1 Summary of the seven hydrological model runs, with precipitation and PE data sources including the Met Office’s RCM run names (5 characters starting with ‘a’)
Fig. 1
figure 1

Map showing the locations and outlet points of the 32 example catchments in southern Britain (Table 2) and the region covered by the 1.5 km RCM (dotted rectangle)

Each RCM run provides hourly precipitation data. To drive CLASSIC-GB, the precipitation is converted from the RCM grid to the 1 km hydrological model grid. Conversion for the 1.5 km RCM uses area-weighting, while for the 12 km RCM the data are disaggregated to the 1 km hydrological model grid by assigning fractions of the 12 km grid-box rainfall amount to 1 km grid-boxes according to the ratios in observed average annual rainfall patterns (Bell et al. 2007). The latter scaling allows a non-uniform spatial distribution of rainfall in each 12 km RCM grid-box, based on the observed rainfall distribution (e.g., allowing orographic enhancement); it does not act as a bias-correction. Note that the RCMs have a rotated lat-long grid so do not align with the GB national grid used by CLASSIC-GB.

The RCM runs do not provide the PE data required to drive CLASSIC-GB, so monthly PE is estimated from other meteorological variables output by the RCMs, using the Penman-Monteith formula (Monteith 1965). A comparison of the ERA-driven RCM PE with MORECS PE (Rudd and Kay 2015) shows good correspondence, and also shows that the PE from the 1.5 and 12 km RCMs is very similar. Here, the PE from the 12 km RCM runs is also used for the equivalent 1.5 km RCM runs (Table 1), to simplify the application and ensure that any differences in flow results are due only to differences in precipitation inputs. The 12 km RCM PE is temporally and spatially downscaled as for MORECS PE (Section 2.1).

2.3 Analysis methods

2.3.1 Use of ERA-driven RCM data

Data from the 12 and 1.5 km ERA-driven RCM runs are used to drive CLASSIC-GB (Apr 1989–Nov 2008; Table 1). The resulting daily mean flows are output for points on the 1 km grid corresponding to 32 gauged catchments (Table 2 and Fig. 1). These catchments are the southern-GB subset of the 54 catchments used to assess CLASSIC-GB (Crooks et al. 2014), which were selected to cover a range of climatic and soil properties and catchment areas. The RCM-simulated flows are compared to both flows simulated using observed input data to drive CLASSIC-GB, and gauged (observed) flows for these catchments (from the National River Flow Archive, www.ceh.ac.uk/data/nrfa/). The comparison only looks at flows for Jan 1990–Nov 2008, thus allowing a spin-up period from Apr 1989 (which overlaps with the climate model spin-up, Apr–Nov 1989). As the development of weather features in the ERA-driven RCMs is not expected to directly follow the observed weather over the period, the comparison of resulting flows is done mainly in terms of flow duration and flood frequency curves, thus comparing the statistical characteristics of the flows rather than day-to-day equivalence.

Table 2 Details of the 32 example catchments

The flow duration curve is calculated from the flow time-series for a catchment by ranking the flows in descending order and selecting the flow Q p corresponding to each percentile point p between 1 and 100; Q p is thus the flow equalled or exceeded p% of the time. Several performance measures (adapted from Yilmaz et al. (2008)) are used to quantify different aspects of the fit of simulated and observed flow duration curves (Q p and O p respectively); percentage bias in low flow volume (lfv), median flow (mdf), and high flow volume (hfv). These are calculated as

$$ lfv=100\frac{{\displaystyle {\sum}_{p=70}^{95}}\left[f\left({Q}_p\right)-f\left({O}_p\right)\right]}{{\displaystyle {\sum}_{p=70}^{95}}f\left({O}_p\right)} $$
(1)
$$ mdf=100\frac{f\left({Q}_{50}\right)-f\left({O}_{50}\right)}{f\left({O}_{50}\right)} $$
(2)
$$ hfv=100\frac{{\displaystyle {\sum}_{p=1}^2}\left[f\left({Q}_p\right)-f\left({O}_p\right)\right]}{{\displaystyle {\sum}_{p=1}^2}f\left({O}_p\right)} $$
(3)

where the function f() is taken as sqrt(), so as not to put too much emphasis on bias in very high flow values. The low flow volume bias lfv only goes up to the 95th percentile flow (from the 70th) so as not to include very low flow values, which can be more severely affected by abstractions, effluent returns etc.

The flood frequency curve is calculated by first extracting annual maxima (AM) from the flow time-series for a catchment (for water years, 1st Oct–30th Sep). A generalised logistic (GLO) distribution is then fitted to the AM using the method of L-moments (Robson and Reed 1999). The three parameters of the fitted GLO (location ξ, scale α, shape k) can then be used to calculate flows Q T corresponding to specific return periods T using Q T = ξ + (α/k)(1-(T-1)-k), to produce a flood frequency curve. The fit of simulated and observed flood frequency curves (Q T and O T respectively) is averaged over 2, 5 and 10-year return periods, using

$$ ffr=\frac{100}{3}{\displaystyle {\sum}_{T\in \left\{2,5,10\right\}}}\frac{f\left({Q}_T\right)-f\left({O}_T\right)}{f\left({O}_T\right)} $$
(4)

where f() is as above. Higher return periods are not included in ffr due to the limited period of ERA-driven RCM data available (about 19 years; Table 1). For all four measures of fit, values closer to zero indicate better performance.

2.3.2 Use of GCM-driven RCM data

Data from the 12 and 1.5 km GCM-driven Current and Future RCM runs are used to drive CLASSIC-GB (Table 1). For these runs, instead of outputting flow time-series corresponding to specific gauged catchments, as for the ERA-driven runs, the outputs are 1 km grids of AM (derived from gridded hourly mean flows). These are used to calculate grids of the flows Q T corresponding to specific return periods T (via the fitting of GLOs as in Section 2.3.1) for T = 2, 5 and 10 years (as for ffr, higher return periods are not used due to the limited data period of about 13 years; Table 1). Percentage changes are then calculated between the return period flows for corresponding Current and Future runs.

As the 1.5 and 12 km RCMs show greater precipitation differences in summer than winter, the same analyses are done for seasonal maxima as for AM, for the four standard seasons; winter (Dec–Feb), spring (Mar–May), summer (Jun–Aug) and autumn (Sep–Nov).

3 Results

3.1 Performance using ERA-driven RCM runs

Figure 2 shows a comparison of gauged flows with flows simulated using observed data and 12 and 1.5 km ERA-driven RCM data, for two catchments; 39081 (Ock at Abingdon) and 55023 (Wye at Redbrook) (Table 2). The comparison is presented as flow time-series (for 2006–2007), and as flow duration and flood frequency curves (calculated from flow time-series for 1990–2008). Flows for the Ock are generally too high when simulated using data from either ERA-driven RCM run, leading to flow duration curves and flood frequency curves that are over-estimated compared to those from gauged flows or flows simulated using observed data. By contrast, flows for the Wye simulated using RCM data are a much better match to gauged flows, especially for the flow duration curve. In both catchments, the curves simulated using data from the 1.5 km RCM are higher than those using data from the 12 km RCM.

Fig. 2
figure 2

Gauged flow time-series, flow duration curves and flood frequency curves (grey) compared to those simulated using observed data (red), 12 km RCM data (green) and 1.5 km RCM data (blue) for two of the example catchments; 39081 (top) and 55023 (bottom) (Table 2)

Flow time-series (Fig. 2) also show that the high flows gauged in July 2007 (when much of southern Britain experienced severe flooding) are not captured by the ERA-driven RCM data for either catchment. However, for the Ock the 1.5 km RCM data generate a peak flow of similar size to that gauged in July 2007, but occurring about a month earlier. The 12 km RCM produces no such peak in the year 2007. Chan et al. (2013) comment on the failure of the RCM runs to capture the high precipitation totals of summer 2007, despite being driven by re-analysis boundary conditions. They state that, since re-analysis data are only used at the edge of the European domain, an exact agreement in the positioning of events over Britain should not be expected, either between RCMs and observations or between nested RCMs (due to internal variability of the models). Hence performance assessments are hereafter based on statistical comparisons using the four measures given in Section 2.3.1 (low flow volume lfv, median flow mdf, high flow volume hfv and flood frequency ffr).

The performance measures for all 32 example catchments (Fig. 3) show that, compared to the use of observed inputs, the 12 km RCM performs relatively well across the full flow range in many catchments, but the 1.5 km RCM generally performs less well than the 12 km RCM. More catchments have positive than negative biases using RCM data, and where the 12 km RCM shows positive bias the 1.5 km RCM often shows a more accentuated positive bias, especially for flood frequency fit. This is consistent with the mean precipitation bias being higher in the 1.5 km RCM than the 12 km RCM (Chan et al. 2013); a known weakness of convection-permitting models (Kendon et al. 2012).

Fig. 3
figure 3

Performance of the 12 km RCM (green) and 1.5 km RCM (blue) for simulating flows in the 32 example catchments, compared to using observed inputs (red), in terms of the fit of low flow volumes (lfv), median flows (mdf), high flow volumes (hfv) and flood frequency curves (ffr). The catchments are ordered by area (smallest at the top). The shading shows performance bands (dark grey – best, to white – worst), and the number of catchments in each band for each simulation is presented at the top

Mapping the performance measures (Fig. 4) shows clear spatial patterns for the 1.5 km RCM, with generally positive biases in the east and negative biases in the west. These patterns are either less clear or not present for the 12 km RCM and for observed inputs. The patterns are especially pronounced for high flows (volumes and flood frequency), and are consistent with the spatial patterns found in precipitation biases for the 1.5 km RCM, which are more pronounced than for the 12 km RCM (Fig. 3 of Chan et al. 2013).

Fig. 4
figure 4

Maps of the four performance measures (lfv, mdf, hfv, ffr), using observed inputs (left) and inputs from the 12 km RCM (middle) and 1.5 km RCM (right). Circles are located at the outlets of the 32 example catchments, and coloured by the value of the performance measure (see key)

Plots of the performance measures against catchment properties (Fig. 5) show that, while performance is relatively independent of catchment area and baseflow index, it is poorer for catchments with low annual rainfall or low altitude. In Britain, such catchments are generally located in the south and east of the country, so this is consistent with the spatial patterns of performance (Fig. 4), thus the lower performance in such catchments is likely to be due to the coincident spatial patterns of rainfall bias rather than catchment properties.

Fig. 5
figure 5

Plots of the four performance measures (lfv, mdf, hfv, ffr), using observed inputs (red circles) and inputs from the 12 km RCM (green crosses) and 1.5 km RCM (blue plus signs), against catchment area, standard annual average rainfall (SAAR), maximum catchment altitude (Max Alt) and baseflow index (BFI) (Table 2)

3.2 Changes in flood peaks using GCM-driven RCM runs

Percentage changes in 10-year return period flood peaks, simulated using data from corresponding Current and Future RCM runs, show some clear differences between the 12 and 1.5 km RCMs (Fig. 6), both in terms of spatial patterns and histograms summarising changes. The biggest differences are seen in spring, where the changes are often higher when simulated using data from the 1.5 km RCM than from the 12 km RCM. Similar but less pronounced differences are seen for changes in winter and annual peaks. This is consistent with the greater intensification of winter daily precipitation extremes seen in the 1.5 km RCM than the 12 km RCM (Fig. 3d of Chan et al. 2014), and the locations of the largest increase, over parts of Wales and the north–west, are also consistent (Fig. 1d of Chan et al. 2014).

Fig. 6
figure 6

Maps of the percentage changes in annual and seasonal flood peaks with a 10-year return period, between the Current and Future time-slices, for the 12 km RCM (left) and the 1.5 km RCM (middle), together with histograms of the percentage changes shown in the maps (right)

The least difference in the percentage changes in 10-year return period flood peaks is seen for summer peaks, despite summer being the season showing the biggest differences in sub-daily precipitation projections for the 12 and 1.5 km RCMs (Fig. 3 of Kendon et al. 2014). This illustrates the low influence that summer sub-daily precipitation has on flood peaks in most of the catchments being modelled here (that is, only those river points with drainage areas above 50 km2, so not very small, flashy catchments). Histograms stratified by catchment area (Area < 100 km2; 100 km2 < Area < 500 km2; Area > 500 km2) show little difference to the full histograms in Fig. 6, even for summer (not shown). Results for 2- and 5-year return period flood peaks are similar to those for the 10-year return period, although the percentage changes are generally smaller for lower return periods (not shown).

4 Discussion and conclusions

Although Regional Climate Models (RCMs) are a significant improvement upon Global Climate Models (GCMs) in terms of representation of rainfall patterns (e.g., Durman et al. 2001), their usual spatial scale (tens of kilometres) means that there are still deficiencies, particularly with smaller scale processes like convective storms that require parameterisation schemes (e.g., Molinari and Dudek 1992). Recently, the first decadal-length runs of RCMs with convection-permitting scales were performed. The aims here were to test the performance of data from these RCMs when used to drive hydrological models to simulate river flows, and to assess any differences in projections of flood change using data from the finer scale RCM runs compared to more usual scale runs.

The results using ERA-driven RCM runs (1.5 km RCM nested in a 12 km RCM driven by ERA-Interim boundary conditions for 1989–2008) show that the 1.5 km RCM generally performs worse than the 12 km RCM for simulating river flows in 32 example catchments in southern Britain, with a clear east/west pattern of bias. This is consistent with patterns of mean bias shown in the RCM precipitation data. Despite the 1.5 km RCM having much more realistic hourly rainfall characteristics, it has a tendency for too intense heavy rainfall. This is a current inherent weakness of convection-permitting models (Kendon et al. 2012) and it is this mean bias which seems to be dominating the fluvial results here. This suggests that, currently, very high resolution RCMs are not automatically more reliable for hydrological modelling than lower resolution RCMs, especially for large-scale river flooding. Improvements are needed if finer resolution RCMs are to become more routinely used, although it is possible that greater benefits of the 1.5 km RCM may be seen in smaller catchments.

The results using GCM-driven RCM runs (1.5 km RCM nested in a 12 km RCM nested in a GCM, for the current climate and a potential future climate) show clear differences in projected changes in flood peaks. In all seasons except summer the 1.5 km RCM tends towards larger increases in flood peaks than the 12 km RCM, with differences most pronounced for spring and winter. This is consistent with greater increases in winter daily precipitation extremes in the 1.5 km RCM. However, the robustness of this result is unclear, both because of the higher biases shown in the 1.5 km RCM and because only one set of such runs has so far been performed, for a relatively short period (~13 years), so the effects of natural variability may be a significant factor.

The apparent insensitivity of changes in summer peak flows, despite the 1.5 km RCM showing significantly greater increases in sub-daily rainfall intensity than the 12 km RCM, may be due to the lower limit on the drainage area of rivers mapped (50 km2). It is possible that very small flashy catchments would show different results, but such catchments would be better modelled with a finer scale gridded model than that applied here, or even with a catchment-based hydrological model which can be setup and calibrated specifically for the details of small catchments. In particular, small urban catchments are more likely to be adversely affected by increases in intensity of sub-daily rainfall in summer, in terms of both river (fluvial) flooding and surface water (pluvial) flooding, but modelling the flow of water in such catchments is complex due to the presence of artificial drainage channels etc.

Future plans involve further investigation of flood projections, including using the RCM data applied here with a new modelling approach developed for surface water flood forecasting (Cole et al. 2013) which was trialled operationally during the Glasgow 2014 Commonwealth Games (Moore et al. 2015). Rather than relying on exceedance of rainfall thresholds, as many existing pluvial flood warning systems do, the new approach uses gridded surface runoff estimates from the Grid-to-Grid model (Bell et al. 2009), which is also used for operational fluvial flood forecasting in Britain (Price et al. 2012). The approach will be used to investigate potential changes in the frequency of pluvial flooding, using the Current and Future 1.5 and 12 km RCM data, compared to changes projected just from rainfall thresholds. In addition, finer resolution RCM runs are currently being performed for the rest of Britain, so it will be interesting to see if the results shown here for southern Britain apply elsewhere.