1 Introduction

Interannual variations in climate are of great practical importance. In particular, extended periods of anomalously hot or cold weather have large impacts on nature and society. Recent prominent examples include the heat waves in central Europe in 2003 (García-Herrera et al. 2010) and in Russia in 2010 (Barriopedro et al. 2011), and the cold winter in eastern North America in 2013–1014 (Yu and Zhang 2015). Nonetheless, such extremes are just the tip of the iceberg within an omnipresent continuum of temperature variability, the magnitude of which depends on both the season and the location. The largest interannual temperature variability is observed over ice-covered oceans and high-latitude continents in winter, whereas the variability over the low-to-mid-latitude oceans is relatively muted outside of the eastern Tropical Pacific (Holmes et al. 2016; see also Figs. 3a, and 4a, b).

A fraction of interannual temperature variability is driven by external forcing such as major volcanic eruptions (Robock 2000; Paik and Min 2017). However, most of it results from the chaotic internal dynamics of the climate system: the variations in atmospheric and oceanic circulation, and the resulting perturbations in sea and land surface conditions. The influence of the oceans is largest at low latitudes, where the atmospheric circulation and temperatures are strongly controlled by the distribution of sea surface temperature (SST) (Wells 2012; Holton and Hakim 2012). In particular, the El Niño—La Niña variability in the eastern-to-central equatorial Pacific SSTs generates atmospheric teleconnections that profoundly affect the climate all around the tropics but to some extent also in extratropical latitudes (Diaz et al. 2001; Yang and DelSole 2012). However, the relative impact of SST variability decreases and that of internal atmospheric dynamics increases towards higher latitudes (Zwiers and Kharin 1998). The interannual SST variability over the extratropical oceans is strongly regulated by variations in the atmospheric circulation, whereas the ocean’s effect on the extratropical atmosphere is more subtle (Bjerknes 1964; Deser and Blackmon 1993). Nevertheless, there is evidence that the ocean plays a more active role in generating atmospheric variability on decadal than interannual time scales (Kushnir 1994).

Although ultimately driven by atmospheric and oceanic circulation, variations in near-surface temperature are modulated by feedbacks that affect the atmospheric and surface energy budget. For example, both reduced cloudiness (which increases the absorption of solar radiation) and reduced soil moisture (which decreases the evaporative cooling of the surface) have been identified as important ingredients in European heat waves (Black et al. 2004; Fischer et al. 2007). Consistent with both mechanisms, the correlation between monthly temperature and precipitation is widely negative over midlatitude continents in summer and in tropical land areas (Trenberth and Shea 2005). As another example, Park et al. (2015) used the climate feedback—response analysis method (Lu and Cai 2009) to explain the temperature differences between winters with a strong and a weak Siberian high. They found that lower temperatures in central Siberia in winters with a strong Siberian high result from a combination of factors, including cold advection, increased surface cooling due to larger sensible heat flux, and weaker greenhouse effect due to reduced water vapour and cloud water content. Hu et al. (2016) used the same method to energetically explain the different distribution of surface temperature anomalies in Eastern and Central Pacific El Niños. Although the heat flux from the ocean was identified as the main cause of surface temperature anomalies in both cases, the larger warming in the Eastern Pacific during the Eastern Pacific El Niños was attributed to a stronger water vapour feedback in this area.

Despite the previous work, a systematic view on the energetics of interannual temperature variability still appears to be lacking. Variations in several factors, among others atmospheric energy transport, surface-atmosphere energy exchange, surface albedo, clouds, and the atmospheric clear-sky greenhouse effect might all be important under at least some circumstances. But how important are they in general, in different parts of the world and in different seasons? This study aims to give at least an initial answer to this question, focusing on the interannual variability of monthly mean temperatures. The study is based on data sets from two modern atmospheric reanalyses (Sect. 2) and an energy balance framework that was earlier used for analysis of model-simulated CO2-induced temperature changes by Räisänen (2017; hereafter R17) (Sect. 3). The results are reported in Sect. 4, and some aspects of their physical interpretation are discussed further in Sect. 5. The main conclusions are presented in Sect. 6.

2 Data sets

Data from the ERA-Interim (Dee et al. 2011) and MERRA2 (Gelaro et al. 2017) reanalyses for the years 1980–2016 are used. The variables required by the energy balance decomposition include surface air temperature, total cloudiness, surface latent and sensible heat fluxes, and surface and top-of-the-atmosphere (TOA) downward and upward short-wave (SW) and long-wave (LW) radiative fluxes for both all-sky and clear-sky conditions (Table 1 in R17). These variables were downloaded as monthly means in a 2.5° × 2.5° latitude–longitude grid.

Table 1 Correlation of ∆T and its main energy balance components between the ERA-Interim and MERRA2 reanalyses

For ERA-Interim, surface pressure and six atmospheric variables (u and v wind, vertical velocity ω, temperature, geopotential and specific humidity) at 37 pressure levels were additionally downloaded at 0.75° × 0.75° horizontal resolution and 6-h time interval. This large (2.8 TB) data set was used for explicit calculation of the atmospheric energy flux convergence term (Sects. 3.2, 4.5 and Appendix A) that was inferred as a residual in the other parts of the analysis.

The suitability of reanalysis data sets for energy budget analysis might be questioned because reanalyses violate energy conservation (e.g., Trenberth and Fasullo 2013) and show spurious large-scale trends associated with changes in the observing system (Allan et al. 2014). However, because the focus in this study is on interannual climate variability, the energy budget biases only matter to the extent that they vary from year to year. We assessed this issue in two ways, by analyzing the analysis increments in MERRA2 and by studying the mutual agreement and differences between ERA-Interim and MERRA2. The analysis increments were found to be large, but their impact on our main diagnostic results is moderated by their relatively weak correlation with the actual temperature anomalies (Section S1 in the Supplementary material). Furthermore, ERA-Interim and MERRA2 give a largely consistent view on the energetics of interannual temperature variability, although there are in many cases substantial quantitative differences between these two reanalyses (Sect. 4.1).

3 Methods

This section first describes the main features of the R17 energy balance method and its application to the interannual variability of monthly mean temperatures. After this, the methods used in the explicit calculation of atmospheric energy flux convergence in the ERA-Interim reanalysis are summarized. They are described in more detail in Appendix A.

3.1 Energy balance framework

The R17 method is built around the concept of effective planetary emissivity εeff, which connects the surface air temperature T to the outgoing longwave (LW) radiation L at the TOA

$$L={\varepsilon _{eff}}\sigma {T^4}$$
(1)

and is (in broad terms, see Sect. 4.4) an inverse measure of the atmospheric greenhouse effect. Thus, warm anomalies in T require either a negative anomaly in εeff, a positive anomaly in L, or both. Combining (1) with the atmospheric energy budget equation gives

$${\varepsilon _{eff}}\sigma {T^4}=S - G+\left( {C - \frac{{\partial E}}{{\partial t}}} \right)$$
(2)

where S is net SW radiation at the TOA, G net downward heat flux to the surface, C horizontal energy flux convergence in the atmosphere, and E the total energy in the atmospheric column.

Referring to the climatological monthly mean of variable X as XCLIM, the anomaly is ∆X = X − XCLIM. After also defining [X] = (X + XCLIM)/2, (2) leads to

$$\sigma \left[ {{\varepsilon _{eff}}} \right]\Delta ({T^4})=\underbrace {{ - \sigma \Delta {\varepsilon _{eff}}[{T^4}]}}_{I}~+\underbrace {{\Delta S}}_{{II}}\underbrace {{ - \Delta G}}_{{III}}+\underbrace {{\Delta \left( {C - \frac{{\partial E}}{{\partial t}}} \right)}}_{{IV}}$$
(3)

Finally, linearizing the left side of (3) as

$$\sigma \left[ {{\varepsilon _{eff}}} \right]\Delta ({T^4}) \approx 4\sigma \left[ {{\varepsilon _{eff}}} \right]{\left[ T \right]^3}\Delta T=D\Delta T$$
(4)

allows one to decompose the temperature anomaly ∆T as

$$\Delta T=\underbrace {{LW}}_{I}~+\underbrace {{SW}}_{{II}}+\underbrace {{SURF}}_{{III}}+\underbrace {{CONV}}_{{IV}}+ERR$$
(5)

where the terms IIV in (3) have been divided by \(D=4\sigma [{\varepsilon _{eff}}]{[T]^3}\). These four terms represent the temperature anomalies due to LW and SW radiation, net surface energy flux, and atmospheric energy flux convergence minus storage. On the average, D ≈ 3.3 Wm−2 K−1, so that a 1 Wm−2 energy perturbation is typically equivalent to 0.3 K in temperature.

The linearization in (3) is performed around (T + TCLIM)/2 rather than TCLIM. This makes the linearization residual ERR very small, with a mean absolute value of less than 10−3 K. On the other hand, variations in D allow the means of LW, SW, SURF and CONV to differ from zero when averaged over the whole period. Nevertheless, their mean values are small relative to their interannual variability that is the focus of this paper.

The terms LW and SW are further divided to two and five parts, respectively

$$LW=L{W_{CLEAR}}+L{W_{CRE}}$$
(6)
$$SW=S{W_{IN}}+S{W_{CLEAR - ATM}}+S{W_{ALBEDO}}+S{W_{CLOUD}}+S{W_{NL}}$$
(7)

In (6), LWCLEAR is the temperature anomaly attributed to the clear-sky greenhouse effect (anomaly of εeff under clear-sky conditions) and LWCRE that due to the long-wave cloud radiative effect. The division (7) is based on the approximate partial radiative perturbation (APRP) method (Taylor et al. 2007). The five terms represent the SW radiation anomalies associated with incoming SW radiation (SWIN), SW radiative properties of the clear-sky atmosphere (SWCLEAR–ATM), surface albedo (SWALBEDO), clouds (SWCLOUD), and nonlinear effects (SWNL). Different notations are used for the two cloud terms (LWCRE and SWCLOUD) because of the difference in their way of calculation. LWCRE is based directly on the anomaly in the cloud radiative effect, which may be affected by variations in the clear-sky radiative properties of the atmosphere in addition to those in clouds. By contrast, SWCLOUD attempts to isolate the effect of cloud anomalies on the SW radiation budget by explicit although highly simplified modelling of the radiative transfer. For further details, see R17.

The focus in this paper is on interannual variability. To separate this from long-term climate change, all the anomalies were linearly detrended before the energy budget decomposition. Conversely, XCLIM as given above Eq. (3) was defined by the least-square trend line fitted for each calendar month separately.

3.2 Direct calculation of the convergence term

For most parts of the analysis, CONV in (5) was calculated from the difference of the net surface and TOA energy fluxes. This is straightforward but offers no information on the mechanisms that contribute to CONV. Therefore, we also estimated CONV directly from ERA-Interim data. In practice, the calculation of energy flux convergence was replaced by calculation of three-dimensional energy advection in the interest of numerical accuracy (Appendix A). However, because the convergence and advection forms are physically equivalent, the word “convergence” will be used when discussing the results.

The resulting direct estimate for CONV is

$$CON{V_{DIR}}=CON{V_{MON}}+CON{V_{SM}}+STOR$$
(8)

Here CONVMON denotes the temperature anomaly attributed to the energy flux convergence by the monthly mean flow, whereas CONVSM results from sub-monthly covariation between winds and atmospheric energy content. STOR represents the change in the total atmospheric energy content, being positive when the energy content anomaly decreases from the beginning to the end of the month (term IV in (3)).

4 Results

To introduce the method, Fig. 1 depicts time series of January and July mean temperature anomalies in central Finland (62.5°N, 25°E) and their decomposition to the main energy budget contributions, separately for the two reanalyses. Temperature variability at this location is much larger in January than July (standard deviation ~ 4 vs. ~ 1.5 °C), and the energy contributions to the variability are also partly different. In January, LWCLEAR, CONV and to a slightly smaller extent CLOUD are the main drivers of variability, with positive values in most of the mild Januarys and negative values in most of the cold Januarys. LWCLEAR and CLOUD also act to amplify temperature variability in July, but CLOUD is more important than LWCLEAR particularly in ERA-Interim. By contrast, CONV mostly opposes the actual temperature anomalies in July. The same applies to SURF in both January and July, since the anomalous net surface energy flux is directed from the atmosphere to the ground in most anomalously warm months and vice versa in anomalously cold months.

Fig. 1
figure 1

Linearly detrended temperature anomalies in central Finland (62.5°N, 25°E) in January and July 1980–2016 (solid lines) and the contributions of individual energy balance terms to them (bars, legend at the bottom). For reference, the mean and the 36-year linear trend of temperature are given in the top-right corner of the figure panels

SWALBEDO is excluded from Fig. 1 because it is negligible in both January (due to lack of solar radiation) and July (when the surface is always snow-free). SWCLEARATM is also generally small, but is substantially negative in MERRA2 after the Mt. Pinatubo eruption in July 1992 and 1993 (Fig. 1d). This feature is lacking from ERA-Interim, which uses prescribed climatological aerosol distributions that vary seasonally but not from year to year, and thus excludes the Pinatubo eruption (Dee et al. 2011; Allan et al. 2014). In MERRA2, by contrast, aerosols are simulated explicitly based on emissions that vary from year to year, and observations of aerosol optical depth are assimilated into the analysis (Randles et al. 2017).

The time series from the two reanalyses agree well on the interannual temperature variations. Apart from SWCLEARATM, the same qualitatively applies to the energy balance contributors to this variability. However, quantitative differences are apparent. For example, in some Julys CLOUD and CONV differ by several °C between ERA-Interim and MERRA2, but to opposite directions. Recall that CONV is derived from the difference of the surface and TOA net energy fluxes and any reanalysis-specific errors in these fluxes are therefore directly mirrored in it.

4.1 Magnitude of the terms and the agreement between the two reanalyses

The global importance of the energy balance components is characterized in Fig. 2 with two statistical measures: (1) their interannual standard deviation, and (2) their contribution to the interannual standard deviation of temperature. The latter is calculated as.

Fig. 2
figure 2

Typical magnitudes of the terms in Eqs. (5)–(7) in the ERA-Interim and MERRA2 reanalyses. The first column shows the interannual standard deviation (SD) of monthly temperature anomalies (∆T) averaged over the 12 months and the global area. The remaining columns show the corresponding SDs of the energy balance terms (blue) and their contribution to the standard deviation of ∆T (SDC, red). The numeric values at the bottom give the SDs (upper) and SDCs (lower) in units of 0.01 K

$$SDC(i)=r(i)SD(i)$$
(9)

where SD(i) is the standard deviation of term i and r(i) is the correlation between term i and temperature. Using the definition of correlation, one can show that the SDCs sum up to the interannual standard deviation of temperature:

$$\mathop \sum \limits_{i} SDC(i)=SD(\Delta T)$$
(10)

For Fig. 2, both the SDs and SDCs were first calculated for each month and grid box and then averaged over the 12 months and the global area, so to characterize the general behaviour of the terms. SWIN and SWNL are both very small, with SD < 0.1 K, and will therefore not be discussed further. Conversely, SURF and CONV are very large, with SD ≈ 5 K. However, as discussed in Sect. 4.2, they turn out to have a strong mutual cancellation particularly over the oceans. LWCRE and SWCLOUD are also large, SWCLOUD being the larger. Unsurprisingly, however, there is also some compensation between LWCRE and SWCLOUD. We will therefore mainly study their sum, denoted as CLOUD, in the rest of this paper. Although smaller than SD(SWCLOUD), SD(CLOUD) is also substantial (9th column of Fig. 2a, b). Of the remaining terms, LWCLEAR is of similar magnitude with the actual monthly mean temperature anomalies, whereas SWCLEAR–ATM and SWALBEDO are relatively small on the average.

The largest average contributors to the standard deviation of ∆T are, in this order, CONV, LWCLEAR and CLOUD (red bars in Fig. 2). The average SDCs of \(L{W_{CRE}}\) and \(S{W_{CLOUD}}\) are both positive in MERRA2 but the former is slightly negative in ERA-Interim. On the other hand, the net surface heat flux (SURF) has a strong tendency to reduce interannual temperature variability. This is particularly the case over the extratropical oceans (Sect. 4.2).

The globally averaged SDs and SDCs are generally similar between the two reanalyses. The largest differences occur in the SD and SDC of CLOUD and its two components, SDC(SURF) and SDC(CONV) (recall that CONV is a residual). In addition, SD(SWCLEAR-ATM) is twice as large in MERRA2 than in ERA-Interim. This is consistent with the already mentioned difference in the treatment of aerosols.

To further quantify the agreement between ERA-Interim and MERRA2, the correlation coefficients between the two reanalyses were calculated for (1) the full space–time interannual variability of ∆T and its energy balance components during the 37-year period, and the space–time variability in the (2) SDs and (3) SDCs over the global area and the 12 calendar months (Table 1). All three correlations are strongly positive for ∆T (> 0.9) and most of the major energy balance components, particularly LWCLEAR, SURF and CONV (≥ 0.85). However, the correlations for CLOUD are somewhat lower, and the difference in the treatment of aerosols strongly deteriorates the agreement on SWCLEAR–ATM. Maps of the inter-reanalysis differences in the SDCs are shown in Fig. S2. Typically, the differences on the grid box scale are about 10% of the two-reanalysis mean for the temperature anomalies, 25% for LWCLEAR, of the order of 40% for SURF and CONV, and between 60 and 100% for SWCLEARATM, SWALBEDO and CLOUD.

It seems obvious that MERRA2 provides more realistic estimates of SWCLEARATM than ERA-Interim. For the other terms, the relative performance of the two reanalyses is more difficult to assess, although some insight might be gained from comparison with satellite data (e.g., Loeb et al. 2018) and other observational data sets. In the figures shown in the rest of this paper, we will simply average the statistics derived from the two reanalyses to emphasize their common features. Selected maps for ERA-Interim and MERRA2 separately are included in the Supplementary material (Figs. S3, S4, S6, S7 and S12, S13).

4.2 Geographic variability

The first column in Fig. 3 shows the SDs of ΔT and its main energy balance components, averaged over the 12 months and the two reanalyses. The corresponding SDCs are displayed in the third column, with the grey shading indicating areas where their sign is not robust. The SDC is considered robust if it has the same sign in the two reanalyses, and differs in at least one of them from zero at the 5% significance level based on a two-sided sign test (Appendix B). The SDs and SDCs are connected by the correlation between the individual energy balance terms and temperature, shown in the middle column. Following (9), the “average” correlation is defined here by dividing the average SDC by the average SD.

Fig. 3
figure 3

Left: interannual standard deviation of monthly mean temperature anomalies (∆T) and their main energy balance components. Middle: Correlation between the individual energy balance components and ΔT. Right: Contributions of the individual energy balance components to the standard deviation of ΔT. All statistics are averaged between ERA-Interim and MERRA2. In the third column, grey colour indicates areas where the sign of the standard deviation contribution is not robust (see Sect. 4.2 for definition)

Interannual temperature variability is generally larger at high than low latitudes and over the continents than over the oceans (Fig. 3a). However, the Arctic Ocean and, relative to its latitude, the tropical East Pacific also stand out with large variability. The SD patterns for the individual energy balance components are variable (left column of Fig. 3). For example, SD(SWALBEDO) is small in most areas, but locally large where interannual variations in sea ice and snow cover are substantial: the margins of the Arctic Ocean, off the coast of Antarctica, and in the Northern Hemisphere extratropical continents, notably the Tibetan Plateau (Fig. 3h). By contrast, SD(CLOUD) is large (1–4 K) nearly everywhere, but smaller over the Arctic Ocean, Greenland, Antarctica, and the deserts extending from Sahara to central Asia (Fig. 3k). An inspection of LWCRE and SWCLOUD separately (Fig. S5) suggests two main explanations for the relatively small magnitude of SD(CLOUD) in these areas: lack of optically thick clouds (over deserts and ice sheets), and/or limited sensitivity of the TOA radiation balance to clouds where modest insolation (in polar regions in most of the year) and/or high surface albedo (over ice sheets and sea ice) make it easier for LWCRE to offset SWCLOUD.

SD(SURF) and SD(CONV) are both very large over the oceans (Fig. 3n, q), exceeding 8 K in many areas mainly in the extratropics. Their patterns are very similar, which results from a strong mutual compensation. This compensation reflects, on one hand, the ability of the ocean to absorb large amounts of heat with only modest changes in the surface temperature, and on the other hand, the tendency of the atmospheric circulation to horizontally spread the effects of local energy input over a larger area (Sect. 5). SD(CONV) is also large over the continents, generally in the range 1–4 K, with the largest values at mid-to-high latitudes. By contrast, SD(SURF) is < 1 K in most land areas, due to the modest heat capacity of the land surface. The main exception are the northern parts of Eurasia and North America, where variations in the energy consumed by snowmelt amplify the variability in the net surface heat flux in winter and spring.

How much a given energy term amplifies or attenuates temperature variability is affected by both its standard deviation and its correlation with temperature anomalies (Eq. (9)). A case in point is LWCLEAR, which has a strong positive correlation (> 0.7) with ΔT in most extratropical areas, but a weaker or locally negative correlation with ΔT in much of the tropics (Fig. 3c). This makes SDC(LWCLEAR) less positive in most of the tropics than at higher latitudes (Fig. 3d), although SD(LWCLEAR) is also large in the tropics (Fig. 3b). An exception with large SDC(LWCLEAR) is the equatorial East Pacific, where LWCLEAR is both highly variable and highly correlated with temperature. The interpretation of LWCLEAR is discussed in more detail in Sect. 4.4.

SWCLEARATM is positively correlated with ΔT in most regions, particularly at mid-to-high latitudes (Fig. 3f). This indicates a positive SW water vapor feedback due to a positive correlation between temperature and atmospheric water vapor, which leads to larger water vapor absorption of SW radiation in months with positive temperature anomalies. However, since SD(SWCLEARATM) is relatively small, this term makes a fairly modest contribution to interannual temperature variability (Fig. 3e, g).

Where SD(SWALBEDO) is substantial, this term is positively correlated with ΔT, because warm anomalies typically coincide with negative anomalies in snow and ice cover (Fig. 3h, i). However, there are also areas where this correlation is negative. In particular, the negative correlation over Antarctica reflects a positive correlation between temperature and snowfall: higher snowfall during anomalously warm summer months counteracts the ageing of snow, thereby slightly increasing the surface albedo (Picard et al. 2012). This feature is more pronounced in ERA-Interim than in MERRA2 (Figs. S3, S4).

CLOUD is also positively correlated with ΔT in most areas, and therefore generally acts to amplify temperature variability (Fig. 3l, m). Exceptions with a slightly negative correlation include, among others, eastern tropical Pacific and parts of the Southern Ocean. The physical interpretation of CLOUD is discussed in some more detail in Sect. 4.3.

SURF and CONV strongly oppose each other over the oceans. In the tropics, particularly over the equatorial East Pacific, SURF is large in magnitude and positively correlated with ΔT, and is thus strongly driving anomalies in surface air temperature (Fig. 3o, p). However, in the same areas, CONV strongly damps the temperature variability, effectively diffusing out the impact of the local surface heat flux anomalies (Fig. 3r, s). Over most of the mid-to-high-latitude oceans, the roles of SURF and CONV are reversed, with the atmospheric heat convergence strongly driving but the net surface heat flux strongly attenuating the temperature variability. This picture of mainly ocean-driven temperature variability over the tropical and atmosphere-driven variability over the extratropical oceans is consistent with a large number of earlier studies (e.g., Bjerknes 1964; Deser and Blackmon 1993; Wu and Kirtman 2007).

Over nearly all land areas, the variation in the net surface heat flux acts to reduce the interannual temperature variability (Fig. 3o, p). This effect is modest but not negligible: as averaged over the 12 months and all land, SDC(SURF) = − 0.44 K, or 30% of the corresponding mean of SD(ΔT) = 1.47 K. Conversely, SDC(CONV) is positive over most land areas (Fig. 3s). The correlation between CONV and ΔT is mostly not very strong (Fig. 3r), but exceeds 0.7 over large parts of the Greenland and Antarctic ice sheets and 0.9 over East Antarctica. The high correlations in Greenland and Antarctica seem to be linked to the relatively modest interannual variability in the other energy balance terms over these ice sheets (left column of Fig. 3). On the other hand, CONV attenuates interannual temperature variability in northern South America and some other low-to-midlatitude land areas. The physical interpretation of CONV is explored in more depth in Sect. 4.5.

4.3 Seasonality

We next discuss the seasonality of the six main energy terms included in Fig. 3, focussing on their SDCs. Comparison between extended Northern Hemisphere winter (November-to-March, NDJFM) and summer (May-to-September, MJJAS) seasons reveals several differences (Fig. 4).

Fig. 4
figure 4

Interannual standard deviation of monthly mean temperature anomalies in a November–March and b May–September, and cn the contributions of the main energy balance components to it. All statistics are averaged between ERA-Interim and MERRA2. Grey colour indicates areas where the sign of the standard definition contribution is not robust (see Sect. 4.2 for definition)

  1. 1.

    Temperature variability in extratropical latitudes is larger in the local winter than the summer season (Fig. 4a, b). The same applies to SDC(LWCLEAR) (Fig. 4c, d).

  2. 2.

    SDC(SWCLEARATM) is largest over the summer hemisphere ice sheets, in Greenland in NDJFM and in Antarctica in MJJAS (Fig. 4e, f).

  3. 3.

    The seasonality of SWALBEDO reflects the seasonalities of snow and ice cover and incoming solar radiation. Accordingly, in the Northern Hemisphere continents, SWALBEDO is mainly important in midlatitudes in winter but in the Arctic in summer (Fig. 4g, h). Near the sea ice edge, SWALBEDO is mainly important during the local summer.

  4. 4.

    SDC(CLOUD) is typically more positive during the local summer than winter (Fig. 4i, j), particularly in the midlatitudes. This is due to SWCLOUD, which strongly amplifies the temperature variability in the midlatitudes in summer, when solar radiation is abundant and reduced cloudiness therefore tends to increase temperature (Fig. S8). In winter, the paucity of solar radiation makes SWCLOUD much less important. However, LWCRE also plays a role, attenuating temperature variability when and where temperature is negatively correlated with (particularly high-top) cloudiness, but amplifying the variability when the correlation is positive. The latter is typical at mid-to-high latitudes in winter, as well as in the tropical East Pacific (Fig. S8). In the tropical East Pacific, SWCLOUD and LWCRE nearly cancel out (see also Fig. S10d), but in high latitudes in winter, LWCRE dominates. Over the Arctic Ocean, the high-latitude Southern Ocean, and the Greenland and Antarctic ice sheets, the net effect represented by SDC(CLOUD) in Fig. 4i, j is therefore more positive in winter than in summer.

  5. 5.

    Reflecting the more vigorous extratropical atmospheric circulation and the stronger climatological temperature gradients in the winter hemisphere, SDC(CONV) is more positive and SDC(SURF) more negative over the mid-latitude oceans in winter than in summer (Fig. 4k–n). The seasonality of SDC(CONV) is even more striking over the northern halves of Eurasia and North America, where the atmospheric heat flux convergence strongly amplifies temperature variability in winter but slightly attenuates it in summer. The interpretation of CONV is explored further in Sect. 4.5.

As an example that illustrates the seasonal variation in more detail, the monthly contributions of the main energy balance terms to temperature variability in central Finland (cf. Fig. 1) are shown in Fig. 5. At this location, there is a gradual shift from large CONV- and LWCLEAR-dominated variability in October–March to smaller CLOUD-dominated and CONV-suppressed variability in May–August. SWALBEDO only plays a significant role during the snowmelt season in March–April.

Fig. 5
figure 5

Interannual standard deviation of temperature in central Finland (the same grid box as in Fig. 1) (solid line) and the contributions of the six main energy balance terms to it (bars, legend at bottom; contributions that are not robust in the sense defined in Sect. 4.2 are indicated with a grey core)

To explore the seasonal variation in another way, Fig. 6 identifies for every grid box and every second month of the year the term that provides the largest positive contribution to the standard deviation of temperature in this month. The broad picture over the oceans is seasonally uniform to the extent that SURF tends to make the largest contribution to interannual variability in the tropics and CONV at higher latitudes. However, the border between the CONV- and SURF-dominated zones is further poleward in summer than in winter, particularly in the Northern Hemisphere. LWCLEAR and CLOUD are also important, overriding all the other terms in some months in some ocean regions. CLOUD is more frequently the foremost term in summer than in winter; in particular it dominates the variability over large parts of the extratropical North Pacific and North Atlantic in summer. In addition to the larger insolation, this reflects the weaker midlatitude baroclinicity in summer, which reduces the importance of CONV relative to the winter season. To provide some more detail, diagrams similar to Fig. 5 are shown for six ocean grid boxes (in the Arctic Ocean, extratropical North Atlantic and North Pacific, eastern and western tropical Pacific, and high-latitude Southern Ocean) in Fig. S10.

Fig. 6
figure 6

Largest contributors to interannual temperature variability in six calendar months, based on the mean of the ERA-Interim and MERRA2 results

Over most of the winter hemisphere continents, either CONV or LWCLEAR is the largest contributor to temperature variability. In summer, however, CLOUD is widely dominant in the extratropical continents. CLOUD is also commonly the largest term in tropical land areas, although this varies with month and region. Seasonal cycles of the individual energy terms in six land grid boxes (in Greenland, Siberia, Central Europe, the Tibetan Plateau, Amazonia and East Antarctica) are shown in Fig. S11.

The variations of snow and sea ice conditions make either SWALBEDO or SURF the largest contributor to temperature variability in some months and locations. SWALBEDO has this position in midwinter in parts of the United States and south-central Asia. During the spring, such areas shift northward. In May, in particular, SWALBEDO is the largest term over much of northern Siberia and northernmost North America, as well as the Tibetan Plateau (see also Figs. S11b, d). Due to variations in the ice edge position, SWALBEDO is also locally dominant over the Arctic and Antarctic Oceans in the local spring and summer (see also Figs. S10a, f). Variations in ice conditions also dramatically affect the atmosphere–ocean heat exchange during the cold season (Deser et al. 2010; Petrie et al. 2015). This locally makes SURF the largest contributor to temperature variability near the sea ice edge in late fall and winter, both over the Arctic Ocean and the high-latitude Southern Ocean (see also Figs. S10a, f).

Averaging over all 12 months and the global area, CONV is the largest contributor to variability in 47% of cases, followed by CLOUD (21%), LWCLEAR (16%), SURF (14%), and SWALBEDO (2%). SWCLEARATM only has this position in limited parts of the Antarctic continent in the local summer (0.1%).

To complement the overview provided this far, we next focus on the physical interpretation of two of the major energy terms: LWCLEAR (Sect. 4.4) and CONV (Sect. 4.5). In both cases, there are several factors involved and a more detailed analysis is therefore useful.

4.4 Factors affecting LW CLEAR

Using the method detailed in Appendix C, the term LWCLEAR was further decomposed as

$$L{W_{CLEAR}}=L{W_{CLEAR - S}}+L{W_{CLEAR - WW}}+L{W_{CLEAR - LR}}+\varepsilon$$
(11)

Here LWCLEARS represents variations in an effective surface emissivity calculated from the monthly means of surface air temperature and surface upward LW radiation (Eq. (21)). In practice, this term mainly reflects variations in the surface minus surface air temperature difference. The next two terms represent the main factors expected to affect the atmospheric clear-sky greenhouse effect (Webb et al. 1993), i.e. the atmospheric water vapor content (WW) and the lapse rate between the surface and the midtroposphere (LR). These terms were estimated using linear regression. ε is the residual from this regression.

The factors that contribute to SDC(LWCLEAR) based on (11) are analysed in Fig. 7. Variations in the effective surface emissivity (term \(L{W_{CLEAR - S}}\)) are unimportant over most land areas (Fig. 7a). However, they are more important over the mid-to-high latitude oceans, particularly the northern North Atlantic, where relatively large differences between the surface and surface air temperatures occur. Elsewhere, SDC(LWCLEAR) is dominated by variations in the atmospheric clear-sky greenhouse effect.

Fig. 7
figure 7

Division of SDC(LWCLEAR) (e) to contributions from the four terms in Eq. (11) (ad). All values are averaged over the 12 months and between ERA-Interim and MERRA2. The global area means are given in the headings. Note that the colour scale differs from Figs. 3 and 4

Both the water vapour and the lapse rate variations are found to amplify temperature variability in most areas (Fig. 7b, c). The lapse rate contribution (Fig. 7c) is largest in areas where temperature anomalies typically have a bottom-heavy structure, so that anomalies of surface temperature are not accompanied by equally large anomalies aloft. This is generally the case in high latitudes (especially in winter, Figs. S16, S17), but also over dry land areas such as Australia. The lapse rate contribution is also substantial in the easternmost tropical Pacific, where local SST variations mainly affect air temperature in the boundary layer below a climatological subsidence inversion (Andrews and Webb 2018). The water vapour contribution is widely dominant at lower latitudes, being particularly large over the central and eastern Pacific Ocean (Fig. 7b), but is still not positive everywhere. One of the exceptions is the western tropical Pacific, where the highest surface air temperatures coincide with remotely forced anomalous subsidence that warms the surface by reducing cloud cover but also simultaneously reduces the atmospheric water vapour (Trenberth and Shea 2005).

4.5 Interpretation of CONV

For the maps and diagrams shown this far, CONV was calculated as a residual. Here we report the results obtained from the direct calculation of the term using ERA-Interim data (Sect. 3.2 and Appendix A).

It is first necessary to note that CONVDIR (8) and the residual CONV are far from identical. CONVDIR exhibits larger interannual variability than CONV (Figs. S18a, d), and the interannual standard deviation of their mutual difference exceeds 4 K in many parts of the world (Fig. S18g). Given the earlier experience of numerical difficulties in the calculation of atmospheric energy flux convergence (e.g., Chiodo and Haimberger 2010; Mayer and Haimberger 2012; Liu et al. 2017), these differences are not unexpected. Nevertheless, the time series of CONVDIR and CONV are positively correlated nearly everywhere, and over most of the oceans the correlation exceeds 0.7 (Figure S19). CONVDIR and CONV also share broadly the same statistical relationship with temperature anomalies, particularly over the oceans (Figs. S18b, c, e, f). Still, the SDC of the residual estimate tends to be somewhat more positive than that of the direct estimate over the midlatitude continents, and less positive over the midlatitude oceans (Fig. S18i). This systematic feature might reflect a mismatch between the interannual variations of the atmospheric energy flux convergence and the TOA and surface energy fluxes in ERA-Interim, rather than just numerical errors in CONVDIR.

Following (8), CONVDIR was divided into three terms that represent the energy flux convergence by the monthly mean atmospheric flow and sub-monthly variations in the flow, and changes in the atmospheric energy content during a month (“storage”) (Fig. 8). This division reveals a strong tendency of cancellation between the monthly mean and sub-monthly energy flux convergence components at extratropical latitudes (Fig. 8d–i). In midlatitudes, the monthly mean energy flux convergence component amplifies temperature variability (Fig. 8e, f), whereas the sub-monthly component acts to reduce the variability (Fig. 8h, i) but is typically slightly smaller in magnitude. Outside of midlatitudes, the situation is less clear-cut. For example, over parts of Antarctica, sub-monthly energy flux convergence appears to amplify, but monthly mean convergence to attenuate temperature anomalies. In the tropics, the monthly mean component generally dominates over the sub-monthly component. Finally, Fig. 8j shows that within-month changes in atmospheric energy storage are a non-negligible part of CONV in individual months. However, these changes neither systematically amplify nor reduce the temperature variability (Fig. 8l). The atmospheric energy content tends to be broadly in phase with surface air temperature, and hence its change from the beginning to the end of the month is nearly uncorrelated with the monthly mean temperature anomaly (Fig. 8k).

Fig. 8
figure 8

Term CONV as calculated directly from energy flux convergence and storage using ERA-Interim data (ac), and its decomposition to the contributions of df the monthly mean flow, gi sub-monthly flow variations and jl atmospheric energy storage. The three columns are the same as in Fig. 3. Grey colour indicates areas where the sign of the standard definition contribution is not significant at 5% level based on a sign test

The tendency of sub-monthly energy flux convergence to reduce interannual temperature variability in midlatitudes is consistent with earlier research. In particular, Lau and Nath (1991) found a negative correlation between anomalies of monthly mean temperature at the 850 hPa level and the temperature tendencies induced by synoptic-scale eddy heat fluxes (their Fig. 13). This diffusion-like behavior of eddies also applies to the time mean flow, with the eddy heat fluxes acting to reduce both the meridional and zonal gradients of temperature (Lau and Holopainen 1984). One may therefore assume that, at least in the midlatitudes, the anomalies in the sub-monthly energy flux convergence are more a consequence than a cause of the monthly mean energy content (or temperature) anomalies.

Monthly mean energy flux convergence tends to amplify and its sub-monthly counterpart to attenuate the midlatitude temperature variability in both the NDJFM and MJJAS seasons (Fig. S20). Interestingly, however, the sub-monthly energy flux convergence makes a more negative SDC contribution in the northern parts of Eurasia and North America in summer than in winter. The tendency of CONV to amplify temperature variability in winter but to rather reduce it in summer in these areas (Fig. 4m, n) thus reflects a delicate balance between the contributions of the monthly mean and sub-monthly energy flux convergence.

5 Discussion

The results of diagnostic techniques tend to become more difficult to interpret when the quantity of interest (here the temperature anomaly) is a small residual of large but compensating right-hand-side terms. The tendency of compensation between the monthly mean and sub-monthly energy flux convergences was already discussed in Sect. 4.5. Another equally important case is the compensation between CONV and SURF over the ice-free oceans.

As shown in R17,

$$\Delta (S - L)=\Delta G - \Delta \left( {C - \frac{{\partial E}}{{\partial t}}} \right)= - D(SURF+CONV)$$
(12)

The compensation between SURF and CONV therefore indicates that, over the ice-free oceans, the anomalies in the net TOA radiation flux S–L are smaller than those in the net surface energy flux G. In fact, the average interannual standard deviation of G as calculated over the 12 months and all ocean grid boxes exceeds the standard deviation of S–L by more than a factor of three (not shown). This difference is qualitatively explicable by the fact that the TOA radiation balance is much less sensitive to variations in air temperature than the net surface energy flux is to the air–sea temperature difference. Everything else being the same, a 1 K anomaly in T only increases L, and hence reduces SL, by D∆T ≈ 3.3 Wm−2 (Eqs. (1) and (4)). On the other hand, bulk parameterizations of turbulent energy fluxes (e.g., Kara et al. 2000) indicate a change of up to several tens of Wm−2 in the net surface energy flux per each 1 K change in the air–sea temperature difference. Over the ice-free oceans, where a substantial net surface flux can be sustained by the heat capacity of the ocean mixed layer, ∆G can thus easily exceed ∆(SL) even when the anomaly in the air–sea temperature difference is relatively small.

One may argue that the multiplicator D1 ≈ 0.3 K W−1 m2 used in (5) exaggerates the actual sensitivity of surface air temperature to variations in local energy input. This is particularly the case over the ice-free ocean, due to the ability of the net surface flux to consume a large fraction of any anomalous energy input into the air column. However, the diffusive behavior of the sub-monthly atmospheric energy flux convergence (Sect. 4.5) implies that the same also applies in other areas. Anomalies in the energy input into an air column, regardless of whether they originate from the net surface energy flux, cloudiness or, for example, surface albedo, are only partly balanced by local temperature-mediated changes in the TOA radiation balance. A large fraction of the energy input anomaly rather tends to be exported away by the atmospheric circulation.

To alleviate the systematic compensations, an energy budget framework should ideally take into account the effects of surface air temperature anomalies on SURF and CONV, rather than treating all of SURF and CONV as independent right-hand-side terms. However, this would require a substantial extension of the method. First, the energy budgets of the upper ocean and ground should be explicitly included, in addition to that of the atmosphere (Hedemann et al. 2017; Liu et al. 2017). Second, the effect of temperature anomalies on atmospheric horizontal energy flux convergence should be parameterized as a diffusion process. The second requirement is particularly difficult to achieve in a single-column framework, because the energy flux convergence is regulated by the gradients rather than the absolute local values of temperature and atmospheric energy content.

A local and instantaneous energy budget framework cannot identify processes that are non-local in space or time. For example, during an El Niño, atmospheric energy flux divergence over central and eastern tropical Pacific acts to cool the air locally, thereby balancing a large fraction of the anomalous net surface energy flux. However, energy flux divergence in one area requires convergence elsewhere. Due to the stationary Rossby waves excited by diabatic heating anomalies (Simmons 1982; Ji et al. 2016), this energy redistribution process is more complicated than just horizontal diffusion. As another example, an anomaly in atmospheric circulation in the preceding months might help to build a warm or cold anomaly in the upper ocean temperature in some area, which would then feed back to the atmosphere by inducing an anomalous net surface energy flux. Thus, although energy budget analysis is useful for diagnosing the origin of temperature anomalies, it alone will not reveal the full cause-effect chain of events.

6 Conclusions

This study has investigated the energetics of interannual temperature variability in the ERA-Interim and MERRA2 reanalyses. Using the method introduced in R17, the anomalies in monthly mean surface air temperature were decomposed to six main components, representing the variations in (1) the atmospheric clear-sky greenhouse effect, (2) clear-sky SW radiative properties of the atmosphere, (3) surface albedo, (4) clouds, (5) the net surface energy flux, and (6) atmospheric energy flux convergence minus storage. Based on their covariation with the actual temperature anomalies, the effects of these individual components on temperature variability were then statistically diagnosed. A rich variety in the energetics of temperature variability in different areas and times of the year was found, depending on the surface conditions, availability of solar radiation and the local meteorological characteristics. Nevertheless, the main findings are the following:

  1. 1.

    Over the ice-free oceans, anomalies in surface air temperature are typically a small residual of opposite contributions from the net surface heat flux and atmospheric energy flux convergence. In the tropics, particularly in the eastern Pacific, the net ocean-to-atmosphere heat flux provides the main energy source for temperature variability, but most of this energy input is transported away by the atmospheric circulation. This pattern is reversed at higher latitudes, where variations in atmospheric energy flux convergence are large but are mainly consumed by heating or cooling the water mass, rather than changing the surface air temperature.

  2. 2.

    The net surface heat flux also tends to attenuate temperature variability on land but is mostly a secondary term in the energy budget. Major energetic drivers of temperature variability over land include, depending on season and location, variations in the atmospheric energy flux convergence, clouds, the clear-sky greenhouse effect, and surface albedo. Nonetheless, atmospheric energy flux convergence reduces rather than amplifies temperature variability over large parts of Eurasia and North America in summer, partly compensating a strongly positive cloud contribution to temperature variability. The same happens in some tropical land areas, especially northern South America.

  3. 3.

    Care is needed in the interpretation of atmospheric energy flux convergence, which is affected by variations in both the atmospheric circulation and the atmospheric energy content and hence temperature. Thinking of anomalies of energy flux convergence simply as a cause of temperature anomalies is therefore not justified. In midlatitudes, in particular, our results reveal a duality between time scales, with anomalies in the monthly mean flow amplifying, but the sub-monthly variations attenuating temperature variability via their effect on the energy flux convergence. The net of these two very large components leaves a much smaller residual, particularly over land. The counter-intuitive situation in which the net effect of the energy flux convergence is to reduce temperature variability may arise when other components in the energy balance strongly act to amplify the variability. This is the case, for example, with cloud anomalies in much of Eurasia and North America in summer. Thus, although this has not been directly addressed herein, many summer heatwaves with reduced cloudiness may actually coincide with anomalous energy transport out of the air column.

The two reanalyses agree well on these general features, but some quantitative differences are evident. The ERA-Interim minus MERRA2 differences in the individual terms typically range from about 25% to 100% of the two-reanalysis mean on the grid box scale. Perhaps unsurprisingly, the effect of clouds is one of the most uncertain terms in the decomposition.

By analyzing the energetic contributions to the standard deviation of monthly mean temperature, this study has emphasized the typical energy budget features associated with temperature anomalies. Nevertheless, the correlation between the individual energy budget components and temperature anomalies is far from perfect (middle column of Fig. 3). Thus, a similar temperature anomaly may result from different combinations of energetic contributions. Examples of this variation are also readily visible in the time series of Fig. 1. For instance, although the net surface heat flux typically attenuates temperature variability, it amplified the cold anomalies in Januarys 2003 and 2010 (Fig. 2a, b). In both cases, the cold January was preceded in Finland by a very cold second half of December, which served to reduce the ground-to-air heat flux by cooling the ground. Apart from this case-to-case variability, it would be worthwhile to study to which extent the relationship between energetics and temperature anomalies is (or is not) nonlinear. For example, do summer months with extreme warm anomalies differ from those with moderate anomalies in the relative importance of the energy balance components that contribute to these anomalies?

To give a globally consistent overview, the analysis in this paper has covered the whole world. More remains to be learned from more in-depth studies of temperature variability on regional scales. Moreover, keeping in mind the issues discussed in the previous section, a diagnostic energy budget approach should ideally be complemented by carefully designed model experiments. Such experiments could help to elucidate, for example, the remote effects of SST variability on the atmospheric energy transport and hence temperature.

Energetics of interannual temperature variability is also important in the context of climate modelling. The magnitude of interannual variability differs considerably between different global and regional climate models (Räisänen 2002; de Elía et al. 2013). Linking this variation to its energetic contributors could potentially help the improvement of climate models. The energetics point of view might also facilitate a better understanding of model-simulated future changes in temperature variability. Together with the evaluation of the present-day energetics of variability in the models, this could help distinguishing between more and less likely projections for the future.