1 Introduction

The extratropical storm tracks are an important component of the climate system. They impact the general circulation of the atmosphere and oceans through the transport of heat, momentum and moisture and they have considerable societal impacts through their associated weather phenomena (Pinto et al. 2007; Dailey et al. 2009; Schwierz et al. 2010). Being able to predict changes that may occur to the storm tracks is therefore crucially important, both in terms of assessing the potential societal impacts of a modified climate and for understanding other large-scale changes in the atmosphere–ocean system.

The climate change response of the storm tracks has been the focus of a large number of studies in recent years and there is now an emerging consensus as to the qualitative nature of some of the larger-scale changes (e.g. Yin 2005; Bengtsson and Hodges 2006; Ulbrich et al. 2008, 2009; Catto et al. 2011; Chang et al. 2013; Zappa et al. 2013). However, there are still substantial differences in the predictions of current state-of-the-art climate models as to how the storm tracks will respond to climate change (Harvey et al. 2012). In order to increase confidence in climate change projections there is a need to better understand the physical mechanisms causing changes in the storm tracks, and the reasons that projections differ between models.

Numerous mechanisms have been discussed in the literature by which the storm tracks could respond to an altered climate (e.g. Lunkeit et al. 1998; Geng and Sugi 2003; Lim and Simmonds 2009; Butler et al. 2010). Individual storms extract energy from horizontal temperature gradients, therefore any changes to temperature gradients may be expected to affect the storm tracks. In addition to this, the local baroclinicity (as measured for instance by the Eady growth rate) is a function of the static stability, with increased stability inhibiting storm growth, so the static stability may also be expected to influence the storm track response to climate change. Finally, the release of latent energy through the condensation of water vapour affects the structure of individual storms and therefore a change in local moisture content may be expected to influence the storm tracks. Changes in the atmospheric moisture content will also have an indirect effect via changes in the large-scale thermal structure of the atmosphere associated with altered latent heat fluxes; however, this effect would be captured by the temperature gradient and stability mechanisms (Schneider et al 2010).

The zonal-mean warming pattern projected by climate models for the coming century varies with latitude and with height. There are regions of relatively strong warming over the tropics in the upper troposphere and over the polar regions in the lower- to mid-troposphere (see Fig. 1 in this paper or Figure 10.7 from Solomon et al. 2007). As discussed by numerous authors (e.g. Lim and Simmonds 2009; Butler et al. 2010; Hernández-Deckers and von Storch 2010), each region of warming may impact the storm tracks via both the horizontal temperature gradients or the stability mechanisms. The equator-to-pole temperature difference is increased at upper levels by the tropical warming and decreased at low levels by the polar warming. The (dry) static stability, however, is increased in the tropics and subtropics by the tropical warming and decreased in the polar regions by the polar warming. The spatial pattern of the storm track responses can therefore be expected to vary spatially and be dependent on the relative magnitudes of the warming in the tropical upper-troposphere and the polar lower-troposphere.

Fig. 1
figure 1

Multi-model, zonal and annual mean HIST temperature (gray contours; units: deg C) and its RCP8.5 response (shading). The horizontal lines indicate the tropical and polar regions used to define the equator-to-pole temperature differences in Sect. 2.2

Several idealised modelling studies have aimed at understanding the impact that each of the regions of enhanced warming have on the extratropical storm tracks in turn, and the mechanisms involved (Lim and Simmonds 2009; Hernández-Deckers and von Storch 2010; Butler et al. 2010). Rind (2008) shows that the relative magnitude of the warming at low and high latitudes varies widely between models in the CMIP3 ensemble, therefore it is possible that the differences in the responses of the extratropical storm tracks may be related to differences in the responses of the temperature structure of the atmosphere. The idealised modelling studies give some insight into this and in addition suggest how constraining the temperature responses in the low and high latitudes may reduce uncertainty in the storm track projections.

This study compliments the idealised modelling experiments by analyzing how the storm track responses in the CMIP5 multi-model ensemble of climate models vary with the responses of the tropospheric temperatures. The inter-model spread in the storm track responses is analysed in terms of the responses of the equator-to-pole temperature differences in the upper-troposphere and in the lower-troposphere. This approach, based on temperature differences rather than the absolute temperature value in each region of warming, captures the inter-model spread associated with the tropical and polar regions of warming whilst recognizing that the equator-to-pole temperature difference is the primary energy source of the extratropical storm tracks. Whilst allowing insight into the sources of spread between the models, this method also highlights the possible drivers of the mean storm track responses.

The paper is organised as follows. The methods and data used are documented in Sect. 2. Regressions of the storm track responses against the responses of zonal-mean temperature differences are presented in Sect. 3 and regressions of the storm track responses against the responses of basin-wide equator-to-pole temperature differences are presented in Sect. 4. A summary and discussion are in Sect. 5.

2 Methodology

2.1 Description of models and scenarios

The data used here is taken from CMIP5 (the fifth phase of the World Climate Research Programme’s Coupled Model Intercomparison Project, see Taylor et al. 2012). Data from the following 24 models is considered in this study: BCC-CSM1.1, BCC-CSM1.1M, CanESM2, CCSM4, CMCC-CM, CNRM-CM5, CSIRO-Mk3.6.0, EC-EARTH, FGOALS-s2, GFDL-ESM2G, GFDL-ESM2M, HadGEM2-CC, HadGEM2-ES, INM-CM4, IPSL-CM5A-LR, IPSL-CM5A-MR, IPSL-CM5B-LR, MIROC-ESM, MIROC-ESM-CHEM, MIROC5, MPI-ESM-LR, MPI-ESM-LR, MRI-CGCM3, NorESM1-M. One ensemble member is used from each scenario for each model, usually the run denoted r1i1p1. The only exceptions, due to data availability, are CCSM4 where r6i1p1 is used and EC-EARTH where r9i1p1 is used. The choice of one ensemble member per model was adopted to avoid the issue of how to weight models which have different numbers of ensemble members. However, an adaption of the 2-way ANOVA framework suggested by Sansom et al. (2013) could be one possible solution.

In this study the responses of the models to future emissions scenarios are considered. In particular, the changes in the model climates between the end of the twentieth century (June 1976–August 2005), using the historical all-forcings experiment (HIST), and the end of the twentyfirst century (June 2070–August 2099), using both the RCP4.5 and RCP8.5 CMIP5 emissions scenarios (Moss et al. 2010). The figures in this paper show results from the RCP8.5 runs; results are also shown for reference in the Supplementary Material from the same period of the RCP4.5 runs and the main differences between the two scenarios are noted in the text.

2.2 Model diagnostics and methods

The measure of storm activity used in this study is the standard deviation of the 2-6 day bandpass-filtered daily-mean mean sea level pressure (MSLP) field, and this will be referred to as the storm track in the following. This commonly-used diagnostic provides a simple assessment of synoptic-scale activity using only daily-mean MSLP data (e.g. Hoskins and Hodges 2002; Chang 2009). The Lanczos filter with a 61-day convolution vector is used to filter the data, that length having a fairly clean frequency cut-off (Duchon 1979). The filter is applied to the full time-series and then the storm track for each season is calculated by first interpolating the filtered data onto a common n32 Gaussian grid and then taking the standard deviation of each season throughout the relevant time period. The result is a single storm track map for each season of each model run. The responses of the storm tracks are defined for each model as the difference between the seasonal storm track values of the RCP and the HIST runs. In this paper only the winter and summer seasons of each hemisphere are considered.

The equator-to-pole temperature differences are measured using the following two diagnostics. Lower- and upper-level equator-to-pole temperature differences are defined as the difference between the area-average time-mean temperature of the tropical (30S to 30N) and the polar (north of 60N for the NH and south of 60S for the SH) regions calculated at the 850 and 250 hPa levels respectively (see Fig. 1). These are denoted as \(\Updelta T850\) and \(\Updelta T250\) and are calculated separately for each hemisphere. Where appropriate, a subscript denotes the hemisphere so that, for instance,

$$\Updelta T850_{\rm NH}=T850_{\rm (30S-30N)}-T850_{\rm (60N-90N)}$$
(1)

is the difference between the tropical and northern polar temperatures at the 850 hPa level. In Sect. 4, longitudinally-confined temperature differences are also defined for the Atlantic and Pacific sectors, and these will be denoted as \(\Updelta T850_{\rm ATL},\,\Updelta T250_{\rm ATL},\,\Updelta T850_{\rm PAC}\) and \(\Updelta T250_{\rm PAC}\) respectively. Note that all of the temperature differences are defined as the tropical temperature value minus the polar temperature value and therefore they are all positive.

In the following, the inter-model spread of the storm track responses is regressed against the inter-model spread of the temperature difference responses using simple linear regressions. That is, the numbers α and β are calculated at each grid point so as to minimise the RMS of the residuals \(\epsilon_i\) in the equation

$$ST_{{\rm resp},i}=\alpha+\beta\Updelta T_{{\rm resp},i}+\epsilon_i$$
(2)

where ST respi is the storm track response from model i and \(\Updelta T_{{\rm resp},i}\) is the response of one of the temperature differences from model i. The association between the responses of the temperature variables and the storm track variables is assessed by considering maps of the following: the regression slopes β, the significance of the inter-model correlations between ST resp and \(\Updelta T_{\rm resp}\), and the fraction of variance explained (FVE) by the regression model (2), which is defined as

$$FVE=1-\frac{\sum{\epsilon_i^2}}{\sum{ST_{{\rm resp},i}^2}}.$$
(3)

The significance of the inter-model correlations between ST resp and \(\Updelta T_{\rm resp}\) is assessed using the t-statistic t = r/s r where \(s_r=\sqrt{(1-r^2)/(n-2)}\) is the standard error of the inter-model correlation coefficient, r, and n is the number of models.

The regression model (2) is designed to capture the inter-model spread of the storm track responses at each grid point. Regions where β is positive indicate a positive association between the temperature difference responses and the storm track responses. In some cases it may also be appropriate to consider the influence on the multi-model mean storm track response of the multi-model mean temperature difference response, as predicted by the linear regression model. That is, β multiplied by the multi-model mean temperature difference response, and this is discussed in the text throughout this paper.

3 Hemispheric results

3.1 Storm track responses

Figure 2 shows the multi-model mean storm track from the HIST experiment (contours) and the RCP8.5 climate change responses for the winter and summer seasons of each hemisphere. The patterns of the responses are qualitatively similar to those produced in other CMIP5 studies using similar diagnostics, for instance Chang et al. (2013) and Harvey et al. (2012). The NH exhibits a general weakening of the storm tracks during winter, with localised increases over the ocean basins and western Europe. During summer, the NH decrease is apparent over most of the mid-latitude region, with increases largely confined to the Arctic basin. During summer in the SH, the response is a poleward shift of the storm track maximum and a weakening across the subtropics. In contrast, the storm track magnitude increases almost everywhere in the SH winter, with little sign of a poleward shift, and decreases are confined to the subtropical Pacific region. In this study we restrict attention to the CMIP5 simulations, however we note briefly that Harvey et al. (2012) compare the wintertime responses of the storm tracks in CMIP3 and CMIP5 and show that the responses in the two ensembles are very similar, with the main differences confined to the NH sub-polar regions.

Fig. 2
figure 2

Multi-model mean HIST storm tracks (contours; units: hPa) with a, c the multi-model mean storm track response under RCP8.5 (shading), and b, d the inter-model standard deviation of the responses under RCP8.5 (shading). In panels (a, c) stippling indicates that the multi-model mean response is non-zero at the 95 % confidence level according to a Student’s t test. In this and future plots, the seasons used are indicated in each sub-panel; panels a, b show the winter for each hemisphere and panels c and d show the summer for each hemisphere

Also shown in Fig. 2 is the inter-model standard deviation of the storm track responses for each season. The inter-model standard deviation is large in the SH where the mean responses are largest. It is also large in the NH storm tracks during winter, where the mean responses are weaker. The aim of the current work is to understand which physical processes are causing the large spread between the models, and in particular what role the changes in the equator-to-pole temperature differences play in determining the storm track responses.

3.2 Temperature difference responses

The equator-to-pole temperature differences used here are defined in Sect. 2.2 and consist of seasonal-mean values of the equator-to-pole temperature difference at a lower-tropospheric level (850 hPa) and at an upper-tropospheric level (250 hPa). Figure 3 shows the range of model responses of each temperature difference for both summer and winter. Also shown for reference is the range of responses of the global mean surface temperature (Ts). As noted by Rind (2008) for the CMIP3 models, the spread in the temperature difference responses is larger than the spread of the global mean surface temperature responses.

Fig. 3
figure 3

Box-and-whisker symbols illustrating the median, inter-quartile range and the full range of the individual model responses of the (left column) annual-mean global surface temperature and (other columns) the zonal-mean DJF and JJA equator-to-pole temperature differences (as defined in Sect. 2.2); the lower section displays the multi-model mean (AVG) and inter-model standard deviation (SDV) of each difference

The upper-level temperature difference increases in all models in both hemispheres and in both of the seasons considered. That is, the upper-tropospheric tropical region consistently warms more than the upper-tropospheric polar regions. The lower-level temperature difference increases in the multi-model mean in the SH and decreases in the NH. Therefore there is an asymmetry between the responses of the low-level temperature differences in the NH and SH, as can be seen in the cross section of Fig. 1. However, the only lower-level temperature difference with a complete agreement on the sign of responses between the models is \(\Updelta T850_{\rm NH}\) in DJF.

It is informative to consider the relative role of the temperature responses in the tropical and polar regions in determining the model spread in the responses of the temperature differences. Table 1 shows the values of the inter-model correlations between each temperature difference response and the surface temperature response averaged over the tropical region and over the relevant polar region. The spread in the responses of the upper-level temperature differences is dominated by the contribution from the tropical region, with particularly high correlation values for the SH temperature difference response. In contrast, the spread in the responses of the low-level temperature differences is dominated by the polar temperature response, with the exception of the SH summer difference response for which both the tropical and polar regions appear to be important. Therefore the inter-model spread of the upper-level temperature difference responses largely reflects the spread of the surface warming in the tropical region, and, with the exception of SH summer, the inter-model spread of the lower-level temperature difference responses reflects the spread of the polar surface warming. Consistent with the NH/SH difference in the low-level temperature difference correlations, there is no significant inter-model correlation between the upper- and lower-level temperature difference responses in the NH, whereas the upper- and lower-level temperature difference responses in the SH are correlated with values of 0.45 and 0.50 for DJF and JJA respectively.

Table 1 Inter-model correlation values between the responses of the equator-to-pole temperature differences and the responses of the tropical (30S–30N) and polar (north of 60N for the NH differences and south of 60S for the SH differences) surface temperatures

3.3 Storm track regression maps

To assess the relationship between the responses of the equator-to-pole temperature differences and the responses of the storm tracks, Fig. 4 shows the linear regression of the storm track responses on the equator-to-pole temperature difference responses (β from Eq. 2) for each of the temperature differences.

Fig. 4
figure 4

The inter-model regression between the storm track responses and the responses of a, c the lower-level temperature differences and b, d the upper-level temperature differences. Panels a, b show winter for each hemisphere and panels c, d show summer for each hemisphere. In each sub-plot the regression is performed using the relevant temperature variable for that hemisphere. Stippling indicates a significant correlation at the 95 % confidence level

In both hemispheres there are large regions of positive regression slope. In these regions the storm track increases with the equator-to-pole temperature difference response. This sign of relationship is consistent with the storm track responses being driven by the responses of the baroclinicity, consistent with the study of Hwang et al. (2011). The only regions with significant negative regression slopes are the Arctic Ocean during summer in the \(\Updelta T850\) regression (Fig. 4c) and the SH subtropics during summer in the \(\Updelta T250\) regression (Fig. 4d). Each hemisphere is now discussed in detail.

3.3.1 Southern Hemisphere

In both seasons in the SH there is a large degree of similarity between the spatial patterns of the regressions for the upper- and lower-level temperature differences. In addition, both regression patterns resemble the spatial structure of both the multi-model mean storm track response and the inter-model standard deviation of Fig. 2. In general, the \(\Updelta T850_{\rm SH}\) regression values are than the \(\Updelta T250_{\rm SH}\) regression values; however, the multi-model mean response of \(\Updelta T250_{\rm SH}\) is larger than the multi-model mean response of \(\Updelta T850_{\rm SH}\), by factors of about 3 for DJF and 10 for JJA (see Fig. 3). This suggests a dominant role for the upper-level temperature difference in setting the multi-model mean storm track response.

The wintertime regressions (Fig. 4a, b) show a general strengthening of the storm track across the SH storm track region associated with increasing temperature differences whereas the summertime regressions (Fig. 4c, d) also have a strengthening but it is confined to a narrow latitudinal band on the southern edge of the storm track, resembling the poleward shift of the mean response. The fraction of inter-model variance explained (FVE; see Fig. 5) is over 30 % in much of the SH storm track region and locally exceeds 50 % in several locations.

Fig. 5
figure 5

The fraction of inter-model storm track variance explained by the temperature difference regressions of Fig. 4

The spatial similarity between the upper- and lower-level regression maps is perhaps to be expected given the correlation between the SH temperature differences noted above. However, there are some notable differences between the two maps. For instance, in winter (Fig. 4a, b) there is no association between the storm track responses and the \(\Updelta T250_{\rm SH}\) responses in and around the Drake Passage region of the Southern Ocean whereas there is a strong association with the \(\Updelta T850_{\rm SH}\) responses. In contrast, in summer the subtropical weakening associated with an increase in \(\Updelta T250_{\rm SH}\) (Fig. 4d) is not present in the \(\Updelta T850_{\rm SH}\) regression (Fig. 4c). This region of negative regression is consistent with static stability dominating the storm track response in the subtropics. There is a high correlation between \(\Updelta T250_{\rm SH}\) and the surface temperature in the tropics (Table 1). Since the dry lapse rate is expected to increase with the surface temperature in the tropics the dry static stability will also be correlated with \(\Updelta T250_{\rm SH}\) (Frierson 2006).

3.3.2 Northern Hemisphere

In the NH there is less similarity than in the SH between the upper- and lower-level regression maps, in either season. In NH summer there is a significant correlation between the storm track responses and the \(\Updelta T850_{\rm NH}\) responses over much of the hemisphere (Fig. 4c). The spatial pattern of the regression map is similar to the multi-model mean storm track response of Fig. 2c, with the opposite sign. Recall from Fig. 3a that the multi-model mean JJA response of \(\Updelta T850_{\rm NH}\) is negative and is therefore of the correct sign to explain the multi-model mean storm track response which is negative in the mid latitudes and positive over the Arctic. The FVE by \(\Updelta T850_{\rm NH}\) in the summer is over 40 % in the North Atlantic but less than 30 % in much of the North Pacific. In contrast, there is very little association between the summer NH storm track responses and the \(\Updelta T250_{\rm NH}\) responses in either basin, with small regression values, insignificant correlations and low FVE values.

In NH winter (Fig. 4a, b), there is a positive association between the storm track responses and the \(\Updelta T850_{\rm NH}\) responses across most of the northern hemisphere. In contrast, the region of significant association between the storm track responses and the \(\Updelta T250_{\rm NH}\) responses is confined to the ocean basins. The multi-model mean DJF \(\Updelta T850_{\rm NH}\) response is negative (Fig. 3) so the impact of \(\Updelta T850_{\rm NH}\) on the multi-model mean storm track response is negative, consistent with the weakening of the low-level baroclinicity. The influence on the multi-model mean storm track response from \(\Updelta T250_{\rm NH},\) however, is positive over the ocean basins. The FVE by each temperature difference is low in the NH winter (Fig. 5a) in Atlantic and Pacific storm track regions.

Unlike the results for the SH and the NH summer, neither of the two NH winter regression maps (Fig. 4a, b) appear to individually capture both the general reduction of the storm tracks in the NH extratropics and the increase over the ocean basins of the multi-model mean storm track response (Fig. 2a). However, the two regression maps combined do capture the spatial structure of the multi-model mean response. Figure 6 shows the linear sum of the mean storm track responses predicted by the \(\Updelta T850_{\rm NH}\) and \(\Updelta T250_{\rm NH}\) regression models (that is, the sum of each β multiplied by the multi-model mean temperature difference responses) as a simple test of this. A full multiple linear regression has also been performed using the two temperature differences together, however the result is very similar to Fig. 6 due to the small correlation between the responses of the two temperature differences noted above. In Fig. 6, the decrease in \(\Updelta T850_{\rm NH}\) is associated with the general decrease in storm activity across the hemisphere in the mean response and the increase in \(\Updelta T250_{\rm NH}\) is associated with the localised increases over the ocean basins. This more complex behaviour, in which the responses are different for the upper- and lower- level temperature differences, is unique to the NH winter and may go some way towards explaining the particularly large inter-model spread in the North Atlantic region.

Fig. 6
figure 6

The sum of the mean storm track response predictions for NH winter by the \(\Updelta T850\) and \(\Updelta T250\) regressions

To test the robustness of both the NH and SH regressions to the choice of scenario, the RCP4.5 versions of Figs. 2, 3, 4 and 5 are shown in the Supplementary Material. The text there describes them in detail; here it is just noted that there is a considerable level of agreement between the spatial patterns and magnitudes of the regression maps in RCP4.5 and RCP8.5 in both seasons in the SH and in the NH in summer. However, in NH winter there is less agreement: in RCP4.5 the \(\Updelta T850_{\rm NH}\) responses explain a larger fraction of the variance in the northern North Atlantic than in Fig. 5a. The reason for this difference is discussed further in Sect. 4.

Also shown in the Supplementary Material are regressions of the storm track responses against the global mean surface temperature responses for RCP8.5. Those show that whilst there are some regions with significant correlations present between the storm track responses and the global temperature responses, these are in general weaker and the regressions explain a smaller fraction of the variance than the temperature difference regressions presented here, thus supporting the choice of considering equator-to-pole temperature differences as the regression variable. As discussed in the Supplementary Material, the global mean surface temperature responses are correlated with the \(\Updelta T250\) responses, with the exception of NH JJA, and some aspects of the regression maps reflect this relationship.

4 A closer look at Northern Hemisphere winter

In this section the wintertime Northern Hemisphere storm track responses are studied in more detail. Since the zonal-mean temperature difference responses considered above do not account for much of the inter-model spread in the NH winter, we instead consider longitudinally-confined measures of the equator-to-pole temperature differences covering each of the two main storm track regions of the NH. The aim is to find out whether using more localised equator-to-pole temperature differences can explain a larger fraction of the inter-model variance of the storm track responses.

4.1 Atlantic and Pacific temperature differences

The longitudinally-confined equator-to-pole temperature differences used here are defined similarly to the zonal-mean temperature differences used above, except confined to the sectors 10W–60W (Atlantic) and 150E–220E (Pacific), and will be denoted as \(\Updelta T850_{\rm ATL},\,\Updelta T250_{\rm ATL},\,\Updelta T850_{\rm PAC}\) and \(\Updelta T250_{\rm PAC}\). These sectors are indicated in Fig. 7a. Here, as before, the differences are taken between the latitude bands of 30S–30N and north of 60N, and the number indicates the pressure level in hPa.

Fig. 7
figure 7

Panel a Multi-model and DJF mean surface temperature (gray contours; units: deg C) and its RCP8.5 response (shading); the solid-line boxes illustrate Atlantic and Pacific regions used to define the temperature differences defined in Sect. 4.1, thick contours show the 5 and 6 hPa regions of the multi-model mean HIST storm track (see Fig. 2) for reference and the small dashed-line boxes in the North Atlantic show the two regions discussed in Sect. 4.2. Panel b Box-and-whisker symbols illustrating the median, inter-quartile range and the full range of the individual model responses of the DJF Atlantic and Pacific temperature differences; the lower section displays the multi-model mean (AVG) and inter-model standard deviation (SDV) of each index

Figure 7b shows the range of model responses of these four temperature differences for DJF. The multi-model means and inter-model standard deviations of the responses are similar to the DJF zonal-mean versions of Fig. 3 in both basins, with the \(\Updelta T250\) responses positive and the \(\Updelta T850\) responses negative. The \(\Updelta T850\) response is smaller in the Atlantic than in the Pacific, and is positive in three of the models.

4.2 Atlantic and Pacific storm track regression maps

Figure 8 shows the storm track regressions and FVE for the responses of the two Atlantic temperature differences (\(\Updelta T850_{\rm ATL}\) and \(\Updelta T250_{\rm ATL}\)). Both of these regression maps are qualitatively similar to the regression maps from the zonal-mean analysis of Fig. 4a, b, with the \(\Updelta T850_{\rm ATL}\) regression positive throughout much of the hemisphere and the \(\Updelta T250_{\rm ATL}\) regression also positive but localised to the ocean basins. However, the regression values in the North Atlantic storm track region are larger for the Atlantic temperature differences than for the zonal-mean temperature differences. The Atlantic sector temperature differences also have larger regions of significant correlation and larger FVE values, exceeding 50 % in a large part of the North Atlantic for \(\Updelta T850_{\rm ATL}\), than the zonal-mean temperature differences. Therefore the Atlantic temperature difference responses have a stronger association with the North Atlantic wintertime storm track responses than the zonal-mean temperature differences.

Fig. 8
figure 8

Regression analysis based on the North Atlantic temperature differences. Panels a, b the inter-model regression between the storm track responses and the responses of the lower- and upper-level temperature differences respectively; stippling indicates a significant correlation at the 95 % confidence level. Panels c, d the FVE by the regression of the lower- and upper-level temperature differences respectively

Despite the high FVE in the North Atlantic using this method, care must be taken when inferring a physical mechanism for the change. For example, Woollings et al. (2012) argue that the North Atlantic wintertime storm track responses are influenced by changes in the ocean circulation, via changes in the sea surface temperature (SST) and ice edge position, both of which may potentially impact the North Atlantic storm track. As a simple test of the role of the local surface temperature responses compared to the role of the large-scale equator-to-pole measures considered here, a further regression analysis has been performed between the storm track responses and the responses of the local East Atlantic meridional surface temperature gradient. This is defined as the difference in temperature between the small solid-lined boxes in the East Atlantic in Fig. 7: 10W–35W, 30N–47N and 10W–35W, 47N–60N. This definition is chosen to capture the changes in SST gradient suggested by Woollings et al. (2012) to be important for the North Atlantic storm track responses, and in particular the region of mediated warming in the central North Atlantic associated with a reduction of the Atlantic Meridional Overturning Circulation (Drijfhout et al. 2012). The responses of this local surface temperature difference are strongly correlated with the \(\Updelta T850_{\rm NH}\) responses in winter (r = 0.64) and as a result the regression slope map (not shown) is very similar to the \(\Updelta T850\) regression slope map of Fig. 8a (see also Fig. 1f of Woollings et al. 2012). This emphasises how it is not clear from this analysis whether the storm track responses are driven directly by the large-scale equator-to-pole temperature differences or instead by the local surface temperature gradient responses.

Figure 9 shows the linear regressions and FVE by the responses of the two Pacific temperature differences (\(\Updelta T850_{\rm PAC}\) and \(\Updelta T250_{\rm PAC}\)) and the responses of the storm tracks. There is a striking difference between these plots and those of the zonal mean analysis in Figs. 4 and 5 in that over much of the hemisphere the association between the Pacific sector temperature differences and the storm track is no larger than with the zonal mean temperature differences of Fig. 4. Therefore the responses of the local equator-to-pole temperature differences do not appear to have a strong association with the storm track in the wintertime North Pacific. This suggests that other processes, for instance changes to the zonal structure of Tropical Pacific SSTs (Vecchi et al. 2006), which will not project onto the equator-to-pole temperature differences used here, may be influencing the responses of the Pacific storm track.

Fig. 9
figure 9

Regression analysis based on the North Pacific temperature differences. The panels are ordered as in Fig. 8

Again, the RCP4.5 version of Figs. 8 and 9 are shown in the Supplementary Material. In contrast to the zonal-mean temperature difference regressions discussed in Sect. 3.3, the \(\Updelta T850_{\rm ATL}\) regressions are very similar between RCP4.5 and RCP8.5. Therefore the difference between the RCP4.5 and RCP8.5 zonal-mean temperature difference regressions noted above is due to how well correlated the \(\Updelta T850_{\rm ATL}\) and \(\Updelta T850_{\rm NH}\) responses are. Consistent with this, the inter-model correlations between these two variables are larger for RCP4.5 (0.81) than for RCP8.5 (0.67).

In summary, the Atlantic basin equator-to-pole temperature difference responses have a stronger association with North Atlantic storm track responses than the zonal-mean equator-to-pole temperature difference responses do, and the regression maps are similar between RCP4.5 and RCP8.5. In contrast, the association between the Pacific basin equator-to-pole temperature difference responses and the North Pacific storm track responses does not appear to be any stronger than that of the zonal mean equator-to-pole temperature differences used in Sect. 3.

5 Summary and discussion

The aim of this study has been to determine the relationship between the responses of the storm tracks and the responses of the equator-to-pole temperature differences in the CMIP5 climate projections. A simple linear regression analysis has been performed on the storm track responses and the responses of both zonal-mean and basin-wide measures of the upper- and lower-tropospheric equator-to-pole temperature difference. Whilst allowing insight into the sources of the spread between the models, this method also highlights the possible drivers of the mean storm track responses.

In each of the main storm track regions there are regions where the storm track responses are significantly correlated (p = 0.05) with both the upper- and lower-tropospheric temperature difference responses in both the summer and winter seasons. The only exception to this are the upper-tropospheric temperature difference responses in NH summer which show no significant correlation with the NH storm track responses. In the SH the upper- and lower-tropospheric temperature difference responses are correlated and there are strong similarities between the two regression maps. In the NH the two temperature difference responses are not significantly correlated and the regression maps are qualitatively different. Regarding the mean responses predicted by the regression analysis:

  • In the SH in summer and winter both the upper- and lower-tropospheric temperature difference regression maps are qualitatively similar to the multi-model mean storm track responses.

  • In the NH in summer the lower-tropospheric temperature difference regression map is qualitatively similar to the multi-model mean storm track response, whereas the regression values for the upper-tropospheric temperature difference are small.

  • In the NH in winter the upper- and lower-tropospheric temperature difference regressions do not individually capture the full spatial structure of the multi-model mean storm track response; however, the linear combination of the two regressions does resemble the multi-model mean storm track response.

  • Over large parts of the main storm track regions the sign of the regression is such that an increase in equator-to-pole temperature difference is associated with an increase in storm activity.

The first three points suggest that in addition to explaining some of the differences between the models, the changes in the equator-to-pole temperature differences play a role in setting the multi-model mean climate change response of the storm tracks, and the final point is consistent with the storm tracks responding to the baroclinicity changes. Regarding the inter-model spread:

  • In the SH there are large regions in both the summer and winter seasons where the responses of both the upper- and lower-level temperature difference responses explain over 30 % of the inter-model variance of the storm track responses, and in some localised regions this is over 50 %.

  • In the NH the response of the upper-level temperature difference explains only a small fraction of the inter-model variance in both seasons; the response of the lower-level temperature difference explains over 40 % of the inter-model variance in the North Atlantic in summer, but only small amounts elsewhere.

This suggests that the spread due to model uncertainty in the climate change projections of the extratropical storm tracks may be reduced by constraining the relative temperature responses of the tropical and polar regions.

The small fraction of variance explained in NH winter by the zonal-mean temperature differences has been investigated further using equator-to-pole temperature differences restricted to the Atlantic and Pacific ocean basins in turn. The resulting regression maps are qualitatively similar to the zonal-mean temperature difference regression maps, however:

  • Over much of the North Atlantic the responses of the lower- and upper-level Atlantic temperature differences explain more of the inter-model variance of the storm track responses than the zonal-mean equator-to-pole temperature differences do, with values exceeding 50 and 30 % respectively.

  • Over the North Pacific there is little increase in the fraction of inter-model storm track variance explained by the responses of the Pacific temperature differences.

Therefore the North Atlantic wintertime storm track appears to be more sensitive to the local equator-to-pole temperature differences, whereas in the North Pacific other factors may dominate the storm track responses.

One limitation of this study is the use of a single measure of storm activity. In particular, as noted by Burkhardt and James (2006) and Chang (2009), Eulerian statistics such as the bandpass filtered MSLP variance used here are potentially susceptible to Doppler effects whereby changes in the mean propagation speed of storm systems cause power to ‘Doppler shift’ in to or out of the time filter window. Testing this by assessing the storm track responses using multiple measures of storm activity will be the subject of a future study.

Another limitation of the method used here is that the causality of the correlations cannot be determined. For instance, whilst the responses of the low-level Atlantic equator-to-pole temperature difference explains over 50 % of the inter-model variance in the North Atlantic, it is not clear whether the storm tracks respond directly to the equator-to-pole temperature difference or instead respond to more local baroclinicity changes, such as those caused by changes in sea ice, SSTs or the land-sea temperature contrast, which may themselves be correlated with the equator-to-pole temperature differences. In order to test this relationship, atmosphere-only GCM experiments will be run to isolate the impact on the storm track of changes in SSTs and sea ice verses changes in the large-scale baroclinicity. The results will be published in a future paper.