1 Introduction

The pattern-scaling technique is an efficient way to generate future projections of regional climate change under various emission scenarios (Santer et al. 1990; Mitchell 2003). In pattern scaling based climate projections, atmosphere-ocean global climate models (GCMs) are first used to estimate climate change under a specific emission scenario. The time-dependent projection is then separated into time-invariant spatial patterns of certain climate variables normalized by unit temperature change (“scaling pattern”) and the time series of global mean surface air temperature (GMST) change. Simplified climate models, such as energy balance models or intermediate-complexity climate models, are then used to estimate GMST changes under any emission scenarios, a task too expensive for fully coupled 3-dimensional GCMs. Finally, regional climate projections are emulated for a wide range of emission scenarios by multiplying the GMST changes from simple models by the scaling patterns of specific climate variables estimated from the global climate models (e.g., Ishizaki et al. 2014). The pattern-scaling approach has been demonstrated to approximate regional projections of temperature and precipitation satisfactorily for scenarios driven mostly by increases in well-mixed, long-lived greenhouse gas (GHG) concentrations (Mitchell 2003; Tebaldi and Arblaster 2014).

Besides temperature and precipitation, an important climate metric relevant to drought studies is potential evapotranspiration (PET) (Middleton and Thomas 1992). PET is an important component of the Palmer Drought Severity Index (PDSI) (Alley 1984; Dai et al. 2004) and is defined as “the amount of water transpired in unit time by a short green crop, completely covering the ground, of uniform height and never short of water” (Hartmann 1994; Allen et al. 1998). Sherwood and Fu (2014) proposed that the Aridity Index (AI), defined as the ratio of precipitation (P) to PET, should be used for aridity impact studies, rather than precipitation alone, because increases in PET can lead to more severe aridity, even in regions with increasing precipitation (Feng and Fu 2013). Some other studies, alternatively, suggested that PET increase (or AI decrease as in the model projections) may not be very relevant for observed pan evaporation and aridity trends (e.g., Roderick et al. 2015; Milly and Dunne 2016). Nevertheless, PET is a useful metric in that it is an essential input to most local hydrologic (i.e., runoff) impact models and it helps in quantifying irrigation water demand for agriculture activities (Allen et al. 1998).

Using simulations from a GCM, Lin et al. (2015a and b) showed that PET would increase (i.e., drying conditions will become more prevalent) under global warming conditions and the magnitude of drying is larger under higher GHG emissions scenarios. These results are in agreement with earlier work of Feng and Fu (2013) and Scheff and Frierson, (2014). It is unclear, however, whether PET can be reliably obtained using pattern-scaling methods, which is one of the two main objects of this study.

The key assumption of the pattern-scaling approach is that the scaling pattern is not significantly different across emission scenarios (e.g., Representative Concentration Pathway (RCP) emission scenarios, RCP2.6, RCP4.5, RCP6.0, and RCP8.5). Previous research has raised the concern that the scaling pattern for temperature and precipitation could be dependent on how different the sulfate aerosol concentrations are in the emission scenarios (Mitchell et al. 1999; Schlesinger et al. 2000; Mitchell 2003; Shiogama et al. 2009; Ishizaki et al. 2013), but the usefulness of accounting for aerosol differences in the scenarios has not been fully assessed.

Therefore, in addition to providing a pattern scaling based projection of PET, another objective of this study is to examine how the pattern-scaling approach may be sensitive to the composition of radiative forcing or GMST change (Shiogama et al. 2009; Ishizaki et al. 2013; Tebaldi and Arblaster 2014). In this study, we used two sets of ensemble simulations from a GCM (one under the CMIP5 scenario RCP8.5 and one under the same RCP8.5 GHG emissions but keeping aerosol emissions at 2005 levels), to obtain the normalized spatial pattern of three climate variables (temperature, precipitation, and PET) due to GHGs and aerosols separately. We then multiplied the scaling patterns by the changes in GMST projected by an energy balance climate model to emulate climate change under standard RCP scenarios. The “hybrid-pattern” based projections were evaluated against the direct output of GCMs and were compared to the traditional pattern-scaling approach that used a single pattern.

2 Methods

2.1 Model description

2.1.1 The global climate model

Community Earth System Model version 1 (CESM1) is a fully coupled ocean–atmosphere-land model that provides state-of-the-art simulations of the Earth’s climate (Hurrell et al. 2013). The horizontal resolution is 0.9° latitude × 1.25° longitude for the atmospheric component (Community Atmospheric Model 5, CAM5) and 1° × 1° for the ocean. A comprehensive three-mode modal aerosol module (Liu et al. 2012) and a two-moment bulk cloud microphysical scheme have been implemented (Morrison and Gettelman 2008; Gettelman et al. 2010). CESM1 contains the physics to represent aerosol direct effects, semi-direct effects, and indirect effects for both liquid and ice phase clouds (Ghan et al. 2012), and it can capture the observed geographical and temporal variations of aerosol mass and number concentrations, size distributions, and optical properties (Liu et al. 2012).

2.2 The energy balance model (EBM)

The time evolution of GMST across all four RCP scenarios was also calculated using a simple energy balance model as in Ramanathan and Xu (2010). The zero-dimensional EBM has been validated against three-dimensional GCMs (Hu et al. 2013), and Figure S1.a shows the agreement of global average temperature simulated by the EBM and CESM1 (anomaly relative to 2006–2015).

The simple EBM accounts for historical and future variations in the energy input to the climate system attributable to natural factors (solar), GHGs, ozone, and aerosols (i.e., sulfate, nitrates, black carbon, and organic carbon). The total, GHG, and aerosol forcing time series used by the simple model were obtained from the RCP database (Figure S1.b, c, d). The temperature evolution due to GHGs and aerosols can then be calculated separately using the EBM and used in conjunction with the scaling patterns (obtained as described in Section 2.3) to emulate climate change under different emission scenarios.

2.3 GCM experiments under four RCP scenarios and a fixed-aerosols scenario

  1. (1)

    The CESM1 Large Ensemble (LE) consists of 30-member simulations from 1920 to 2080. The simulation beyond 2005 is forced by RCP8.5. Each member of the simulations is forced by the same GHG and aerosol forcing trajectory but starts from randomly perturbed initial conditions in the atmosphere (Kay et al. 2015).

  2. (2)

    We also performed another set of simulation (15 ensemble members), forced by RCP8.5 except that aerosol precursor emissions and atmospheric oxidants are fixed at the year 2005 level (RCP8.5_FixA) (Xu et al. 2015).

  3. (3)

    The RCP4.5 Medium Ensemble (ME) uses the same strategies to perturb initial conditions as Large Ensemble but with a smaller ensemble size of 15 and is forced by RCP4.5 (Sanderson et al. 2015).

  4. (4)

    Three ensemble members of CESM1 under RCP6.0 were performed as part of phase 5 of the Coupled Model Intercomparison Project (CMIP5) (Meehl et al. 2013).

  5. (5)

    Three ensemble members of CESM1 under RCP2.6 were performed as part of phase 5 of the Coupled Model Intercomparison Project (CMIP5) (Meehl et al. 2013).

The simulations in (3), (4), and (5) are used as the benchmark for evaluation of the pattern-scaling approach.

2.4 Obtaining the scaling patterns

The climate change pattern for a climate variable C(x, i) as simulated by a GCM at location x and time i can be approximated by:

$$ \mathrm{C}*\left(x,\ i\right) = T(i) \times p(x) $$
(1)

T(i) is the time-dependent scaling factor, p(x) is the time-invariant scaling pattern, and C*(x, i) is the time-dependent pattern emulated by the product of the two terms (Mitchell et al. 1999). The value of T(i) we used here is, as commonly done in pattern scaling projections, GMST change during the first through the seventh decade of the twenty-first century (the 2010s to 2070s), with the 1985–2005 average used as the baseline. That is T(i) is a time series of temperature (units: °C), such as 2010–2019 mean minus 1985–2005 mean, 2020–2029 minus 1985–2005, …, 2070–2079 minus 1985–2005.

p(x) describes how a climate variable changes at location x when the GMST rises by 1oC and is obtained from the set of regression coefficients that minimize the root-mean-square of the differences between the directly simulated pattern C(x, i) and the emulated pattern C*(x, i), as follows:

$$ p(x) = \Sigma \mathrm{i}\ \left[T(i) \times C\left(x,\ i\right)\right]/\Sigma \mathrm{i}\ T{(i)}^2 $$
(2)

Scaling patterns, p(x), for most climate variables, are usually expressed as the percentage change per °C of global warming with respect to the baseline. Previous research has often obtained p(x) based on a single, multi-forcing emission scenario (e.g., RCP8.5 or RCP4.5, see Fig. 1a, b), or as the average of individual patterns derived from different emission scenarios (Tebaldi and Arblaster 2014). Our study, in contrast, aimed at distinguishing the spatial patterns of climate response to GHGs (P_GHG, Fig. 1c) and aerosols forcings (P_aerosol, Fig. 1d) separately.

Fig. 1
figure 1

The scaling patterns of annual mean precipitation changes (%/°C) in response to changes in RCP8.5 (a), RCP4.5 (b), and GHGs only (c), aerosols only (d). Patterns a and b are used for the “single-pattern” approach for projection, while c and d are used for the “hybrid-pattern” approach. Note that “GHGs only” (c) and “aerosol only” (d) scaling patterns are obtained from RCP8.5 and RCP8.5_FixA simulations. The four black numbers next to adjacent maps show the pattern correlation between the two maps. The red number (0.70) shows the pattern correlation between the panels a and d. The green number (0.95) shows the pattern correlation between the panels b and c. Note the remarkable distinction between c and d and the similarity between a and b. The dotted regions have significant changes in precipitation across ensemble members (i.e., more than 80% of simulations have the same sign of changes). A similar figure with the absolute precipitation changes (mm/day/°C) is shown in Figure S2

Firstly, we used the ensemble simulation with fixed aerosols (RCP8.5_FixA) to obtain the scaling pattern due to GHGs forcing only (P_GHG, Fig. 1c). For example, to get the scaling pattern for precipitation, we applied the regression method as described above using precipitation and temperature changes from RCP8.5_FixA. Secondly, we contrasted RCP8.5 and RCP8.5_FixA for every decade of twenty-first century (2010s to 2070s) to obtain the temperature and precipitation due to aerosol only (RCP8.5–RCP8.5_FixA), assuming linear additivity of the two responses, and then calculated the scaling pattern due to anthropogenic aerosols forcing only (P_aerosols, Fig. 1d). Some non-linearity when adding different responses are expected (Ming and Ramaswamy 2009), but our previous work using the same model and different aerosol forcing (Xu et al. 2016) suggested that the responses to individual forcing are linearly additive (e.g., for the warming vertical profile of temperature change).

2.5 Two approaches of pattern scaling projections: hybrid-pattern and single-pattern

Using a simple energy balance model (Ramanathan and Xu 2010) and the separate radiative forcing of GHGs and aerosols (van Vuuren et al. 2011, shown in Figure S1.c and d), we can calculate the contribution of GHGs and aerosols to GMST change separately (ΔT_GHGs and ΔT_aerosols).

For the “hybrid-pattern” approach, the spatial distributions of climate change can be obtained by combining both ΔT_GHGs × P_GHGs (Fig. 1c) and ΔT_aerosols × P_aerosols (Fig. 1d). Alternatively, we also emulated the climate change projection by calculating ΔT_All × P_RCP8.5 (Fig. 1a), which is the conventional “single-pattern” approach. The performance of both approaches can be evaluated against the direct model output from the same climate model, both regarding pattern correlation and absolute errors (root mean square of errors, RMSE). The evaluation results are shown in Section 3.2.

3 Results

3.1 Scaling patterns due to GHGs vs. aerosols

The ensemble mean scaling patterns for the percentile change of precipitation derived from RCP4.5 and RCP8.5, using the traditional, single-pattern approach, are shown in Fig. 1a, b. Over most of the globe, especially in the high latitude regions where the internal variability of rainfall is smaller, precipitation changes are significant (dotted regions in Fig. 1), satisfying the test that at least 80% of ensemble members show the same sign of change (Feng and Fu 2013).

The precipitation change pattern is characterized by a tropical oceanic rainfall enhancement and subtropical rainfall suppression, although considerable differences exist at the regional scale (e.g., North Africa, Arabian Sea). It is unclear why the relative precipitation changes are larger over Sahara regions in RCP8.5 (>40%/°C) than those in RCP4.5 (other studies have found larger changes under RCP2.6, Tebaldi and Arblaster 2014). This prominent difference could be simply due to the extremely dry base climate (as a close-to-zero denominator for the percent change). Indeed, we note that the absolute change (mm/day/°C) over this dry region is not significantly larger in RCP4.5 (Figure S2).

The high similarity between RCP8.5 (Fig. 1a) and RCP4.5 (Fig. 1b), with the pattern correlation coefficient at 0.93, supports the notion of adopting a single scaling pattern across scenarios. Even when computing the pattern correlation coefficient using all data points rather than only those with significant changes, its value remains as high as 0.90. In contrast, the scaling patterns due to GHGs only (Fig. 1c) and aerosols only (Fig. 1d) are remarkably different, with a low pattern correlation coefficient at 0.34 (or 0.32 if calculated using all data points). Changes with opposite signs occur in many places (northern South America, northern and southern Africa, and Australia). The most notable difference is the northward shift of tropical rainfall in the aerosol case. The physical mechanisms of precipitation change due to GHG changes have been discussed by many previous studies (Allen and Ingram 2002; Held and Soden 2006; Richter and Xie 2008). The spatial heterogeneity of precipitation response to aerosols arises from the asymmetric nature of the forcing that is stronger over Northern Hemisphere (Chung and Seinfeld 2005; Shindell et al. 2012; Hwang et al. 2013; Ocko et al. 2014). Note that the normalized process in Fig. 1 effectively reversed the sign of aerosol cooling so that the Northern Hemisphere features a stronger warming. The pattern correlations between patterns due to GHGs only (Fig. 1c) and those obtained from RCP8.5/RCP4.5 (Fig. 1a, b) are also high (0.94–0.95), because both RCP8.5 and RCP4.5 scenarios are primarily driven by GHG changes, as we will examine closely later.

Figure 2 shows the scaling patterns of PET over land from the different scenarios. The pattern correlation between the patterns derived from GHGs and RCP8.5/RCP4.5 is high (0.99/0.98). In these three scaling patterns, relative PET increases in high latitudes more than low latitudes (as found in Scheff and Frierson, 2014; Fu and Feng 2014; Lin et al. 2015a). But the scaling pattern in response to aerosols forcing (Fig. 2d) shows larger changes in South America, Europe, North Africa, and East Asia, with a weak decrease in India and most of sub-Sahara Africa. The pattern correlation between GHGs and aerosols is 0.90–0.92. The warmer temperature in response to higher GHG levels tends to increase PET due to the larger vapor pressure deficit and also the change in Clausius–Clapeyron slope (Scheff and Frierson, 2014). In contrast, aerosols can impact PET strongly through reducing net shortwave radiation reaching the surface in addition to the temperature effect (Lin et al. 2015b). Indeed, the scaling patterns for surface air temperature are similar to those for PET in the GHG case (Figure S3.c), but less so in the aerosol case (Figure S3.d). All pattern correlations among GHGs, RCP4.5, and RCP8.5 are above 0.96, while these patterns are relatively distinct from the aerosol pattern (pattern correlations: 0.85–0.92), which features much stronger changes over Northern Hemispheric land.

Fig. 2
figure 2

The same as Fig. 1 except for PET over land. The scaling pattern for temperature is similar (Figure S3) because the temperature is a dominating factor for PET along with surface radiation budget. Note “GHGs only” (c) and “aerosol only” (d) scaling patterns are obtained from RCP8.5 and RCP8.5_FixA simulations

From Figs. 1 and 2, the pattern obtained from only one emission scenario (“single-pattern” approach) has mixed influences of both GHGs and aerosols, but with predominant contributions from GHGs. This implies that a “single-pattern” approach can be applicable when the global mean aerosol loading changes are small. On the other side, a “single-pattern” approach would be problematic if aerosols lead to considerable forcing contribution in future climate change. Next, we examine the usefulness of combining GHGs and aerosols scaling patterns (“hybrid-pattern” approach) in emulating future climate projections, which has been suggested by the previous studies (Mitchell et al. 1999; Ishizaki et al. 2014).

3.2 Evaluating the pattern scaling projection: hybrid- vs. single-pattern approach

Figure 3a shows the CESM1 projected change pattern in precipitation for 2040–2049 relative to the baseline (1985–2005) under RCP4.5, as the reference for evaluating two pattern scaling projection approaches. Note that the changes are scaled by global mean temperature change during this period (1.3°C) to focus on the spatial pattern. The pattern correlation between CESM1 direct output and “hybrid-pattern” approach (Fig. 3b) is 0.96, higher than that between CESM1 and “single-pattern” approach (Fig. 3c, 0.94). Notably, there was a drying band (reduction in the zonal average of precipitation) around 10°N–30°N in the RCP4.5 simulations, which was well captured in the “hybrid-pattern” approach, but this feature was largely missing in the “single-pattern” approach (see the zonal average plot in Fig. 3c). Similar results are also found if the absolute changes of precipitation (mm/day) are considered (Figure S4.c).

Fig. 3
figure 3

(Left) Projected changes in precipitation (%/°C) at 2040s relative to 1985–2005 under RCP4.5, with the zonal average shown on the right-hand side of the map. a CESM1 direct output as the “Reference” for the evaluation. b Projection using the “hybrid-pattern” approach. c Projection using the “single-pattern” approach. All values were normalized by the change of GMST during this period (about 1.3°C). The two numbers at the top right showed the pattern correlation between a and b and between a and c. The significance of changes was assessed using the same method as in Figs. 1 and 2. (Right) Same as the left, except for PET over land. The pattern correlation of PET maps seems lower (0.82/0.78) because they are calculated only using values over major land regions (−60°S–85°N), while the pattern correlation of P on the left (0.96/0.94) are based on global values. If P pattern correlation were calculated only using land regions, they would also be lower (0.86/0.82)

For PET, the “hybrid-pattern” approach also demonstrated a better performance in emulating mid-century changes than the “single-pattern” approach (0.82 and 0.78 in pattern correlations, respectively). However, the “hybrid-pattern” approach underestimated the projected PET increase in Europe compared to the “single-pattern” approach (Fig. 3d–f).

Note that the procedure of significance test is affected by the internal variability (i.e., model ensemble spread) at the regional scale. Consequently, Fig. 3c (“single-pattern” approach) appears to have more dotted regions on it, because “single-pattern” approach used all 30 ensemble members of RCP8.5 simulations (Kay et al. 2015), and the internal variability is suppressed to a greater extent than those in Fig. 3b (using 15 available ensemble members of RCP8.5-FixAerosol as in Xu et al. (2015)) and Fig. 3a (using 15 ensemble members as in Sanderson et al. (2015)).

As for the changes throughout the twenty-first century, with the GHG forcing growing with time, the pattern of external responses increasingly overwhelms the effects of internal variability, which lead to the pattern correlations of both two methods approaching to unity at the end of twenty-first century (Figure S5, left). The main reason that the correlations increase with time and the RMSE decrease with time is that the signal-to-noise ratio becomes higher (Mitchell et al. 1999).

The benefit of using the “hybrid-pattern” approach is quantitatively examined next. Firstly, the pattern correlation (Figure S5, left) and the globally averaged RMSE (Figure S5, right) of temperature, precipitation, and PET between the two pattern scaling approaches and the direct model output for RCP4.5, RCP6.0, and RCP2.6 were calculated. Note that PET is only calculated over the land. Thus, it has smaller pattern correlation and larger RMSE compared to precipitation, due to a larger variability over the land. Secondly, the ratios of the “hybrid-pattern” approach and “single-pattern” approach are shown in Fig. 4, the projected patterns from the “hybrid-pattern” approach are overall better than the “single pattern” approach, indicated by a larger-than-one ratio for pattern correlation and smaller-than-one ratio for RMSE. The difference between the two approaches in the near-term (the 2020s, 2030s, and 2040s) is larger when the aerosol forcing makes a considerable fraction of the total forcing (Figure S1.e). The ratio approaches unity in the late twenty-first century, which implies that the “single-pattern” approach is more suitable for the long-term climate projection when changes are dominated by GHG effects.

Fig. 4
figure 4

(Left) The ratio of “hybrid-pattern” approach and “single-pattern” approach, in the centered pattern correlation between each pattern-scaling approach and the direct GCM output (Figure S5). The larger-than-one ratio indicates a better performance of “hybrid-pattern” approach. Top row: temperature; middle row: precipitation; and bottom row: PET. (Right) Same as the Left but for globally averaged RMSE. The smaller-than-one ratio indicates a better performance of “hybrid-pattern” approach

The emulated results using the “hybrid-pattern” approach for RCP2.6 are not always better than those under RCP4.5 or RCP6.0, as one may expect from the fact that the aerosol forcing is relatively larger in RCP2.6 (Figure S1.e). This is probably because only three ensemble members were available for RCP2.6 and RCP6.0 as the benchmark, which is more subject to the influence of internal variability. This larger influence for small ensemble in RCP2.6 and RCP6.0 can be readily seen from the larger decadal fluctuation of blue and green lines in Fig. 4 (and Figure S5), as opposed to the rather monotonic trends of RCP4.5 (black lines).

In summary, although the “single-pattern” approach demonstrates slightly lower RMSE in precipitation (Figure S5.d), the “hybrid-pattern” approach performs better in obtaining the zonally averaged change (Fig. 3). The “single-pattern” approach can be appropriately used to project the long-term changes under standard RCPs since all these scenarios assume a sharp decrease in aerosol emission toward the end of the century. But the “single-pattern” approach may be biased in scenarios with a significant difference in the composition make-up (ratio of GHG vs. aerosol forcing). These scenarios include the near-term climate projection for the twenty-first century as we discussed here, but would also be the relevant ones in geoengineering studies that consider different methods (e.g., carbon capture vs. solar radiation management; Tilmes et al. 2016).

4 Concluding remarks

The first objective of this study was to utilize a pattern-scaling approach for future projection of PET, which is not discussed in the previous literature. This approach was shown to produce reasonably good emulations of GCM projection, as expected, considering that PET changes are governed by temperature changes to a large extent.

This study also aimed at characterizing the robustness of pattern-scaling methods in the presence of aerosol forcing, thus addressing one of the challenges that pattern-scaling methods are commonly thought of facing, i.e., the characterization of a signal from scenarios containing significant sources of spatially non-homogenous forcings. Using a pair of CESM1 simulations under RCP8.5 scenarios with and without aerosol changes, we separately obtained the scaling patterns of temperature, precipitation, and PET due to GHGs and aerosols. We used those two patterns simultaneously with GMST changes from a simple energy balance model to emulate future changes of temperature, precipitation, and PET in three other RCP scenarios. Our approach used a large ensemble (15 to 30 members) as opposed to some earlier studies (e.g., three ensemble members in Ishizaki et al. 2013). The scaling pattern is obtained based on regression of each ensemble members, despite that all results are presented as the ensemble average. The variability between ensemble members are often at the regional (sub-continental) scale, and thus, the pattern correlation is high between ensemble members This ensemble spread due to natural variability is utilized to provide statistical significance test, which allows us to better identify the signal of forced change.

We found that considering the influence of aerosols separately increases the pattern correlation in temperature, precipitation, and PET and decreases the global RMSE in temperature and PET, hence, justifying the usefulness of a “hybrid-pattern” approach. The assumption that the scaling pattern remains stable across all emission scenarios (“single-pattern” approach) cannot be reliably applied if the change of aerosol forcing is large, for example, in the next few decades as expected. For regional climate change at the end of twenty-first century, however, the “single-pattern” approach provides reasonably good emulation, because aerosol influence gradually diminishes through the century and becomes negligible by its end. Of course, the benefit of adopting a “hybrid-pattern” approach comes at the cost of conducting additional GCM simulations to obtain GHG and aerosol patterns separately. On this note, presumably, an even more accurate (but also more computationally expensive) approach is to separate the climate change response pattern due to aerosol emissions from individual continents (e.g., Europe vs. Asia), as proposed by Hulme et al. (2000).

Note that due to internal variability, the obtained response pattern to GHG and aerosol forcing can vary greatly from model to model. Therefore, one caveat of the current analysis is that we only used a single set of parameters in the EBM (fitted to a single climate model, Figure S1), and this approach failed to account for the model uncertainty in the climate sensitivity and potentially regional pattern. We are now working to improve this aspect by including CMIP5 model output into a future pattern-scaling analysis on extreme weather events, with the problem being that not many of them provide aerosol forcing-only simulations. Given the increasing availability of computational resources, the model dependence of aerosol-induced climate responses should be further tested within a multi-model framework, to examine whether the findings of this study are robust.