1 Introduction

Dynamical down-scaling has been widely used to improve regional climate description at fine scale. It consists in driving a regional climate model (RCM) by large-scale fields provided by a global circulation model (GCM) or (re) analysis as initial and boundary conditions. Previous studies (e.g. Miguez-Macho et al. 2004; Lo et al. 2008; Radu et al. 2008; Salameh et al. 2010) have shown the necessity of relaxing the three-dimensional RCM fields towards the GCM fields to avoid deviation from the large-scale atmospheric circulation. This relaxation technique, also referred to as nudging, consists in partially imposing the large scale of the driving fields provided by global climate models (GCM) or (re)-analyses, on the regional climate model (RCM) simulation with the aim of disallowing large and unrealistic departures between driving and driven fields. Two different types of nudging exist, indiscriminate nudging (IN), also referred to as data assimilation, dynamical relaxation, grid-point nudging or analysis nudging, consists in driving the RCM indiscriminately at all scales (Davies and Turner 1977; Stauffer and Seaman 1990; Lo et al. 2008; Salameh et al. 2010; Omrani et al. 2012a). Spectral nudging (SN) consists in driving the RCM on selected spatial scales only (e.g. Kida et al. 1991; Waldron et al. 1996; von Storch et al. 2000; Radu et al. 2008; Omrani et al. 2012b).

Nudging was initially developed for data assimilation purposes. Several studies have investigated the “optimal” use of nudging by varying nudging parameters (Zou et al. 1992; Stauffer and Bao 1993; Vidard et al. 2003; Lei et al. 2012). The first motivation of using nudging in data assimilation is to minimize the model errors with respect to observations or gridded analysis, whereas, nudging is used in regional climate modeling to ensure a dynamical consistency between the driving field and the simulated large-scale fields, however it does not ensure the best observed fine scale field because of the possible impact of incorrect driving large-scale field. In this context, the optimal nudging strategies with respect to dynamic downscaling could add skill whenever the parent global model has some level of skill.

Nudging can be applied to a different set of meteorological variables depending on models, nudging technique (IN or SN) and the down-scaling purpose (see Table 1). However, how to choose the nudged variable(s) remains an open question. Only few studies have addressed this issue. Radu et al. (2008) analyzed the sensitivity of RCM simulations to the nudged variables in order to quantify the associated uncertainties. Recently Pohl and Crétat (2013) explored the impact of the nudged variables on the WRF simulated deep atmospheric convection in the tropics. They showed that nudging temperature reduces the model bias, while nudging horizontal wind improves the model time variability with respect to the observations. In the present work, we adopt an idealized approach known as the Big Brother Experiment (BBE) (Denis et al. 2002a) in order to evaluate the “best” set of variables to be nudged and the physical processes supporting such optimal configuration, if they exist. However, the variables that can be nudged differ between models and between indiscriminate and spectral nudging (Table 1). The WRF model is thus chosen here because of the possibility to apply both indiscriminate and spectral nudging techniques all other things being equal, and as a natural follow-up of Omrani et al. (2013). To isolate the effect of nudging without any interference with other sources of error and uncertainty propagation, a BBE approach is used as in Omrani et al. (2013), meaning that the same model and hence the same physics are used between the driving and driven models. Indeed, in real regional climate modeling, the GCM used to drive the RCM have generally different numerical schemes and physical parameterizations (e.g. Kanamaru and Kanamitsu 2007; Thatcher and McGregor 2009). In addition, the validation of regional climate simulations must eventually be done by comparing to observations, often using gridded datasets like CRU and ECA&D which have their own uncertainties and biases (Flaounas et al. 2012).

For all these reasons, it must be clear that the ideal nudging configuration of WRF discussed in this study may not at the end produce the best results due to other uncertainty sources. However, it offers a set of controlled experiments designed to isolate the impact of nudging one variable rather than another and to understand the different physical processes involved.

Table 1 Review of variables used for nudging in various models

This paper is organized as follows. A description of the WRF model and the experimental set-up is given in Sect. 2. In Sect. 3 we quantify statistically the impact of the different sets of nudged variables on the simulation of fields of great importance for regional climate modeling: surface temperature, wind and rainfall. Potential dynamical processes causes for the different responses observed are analyzed and discussed in Sect. 4. Finally, Sect. 5 summarizes the results and points out some open research questions needing further investigation.

2 The model and the experiment design

The model used in this study is the 3.1.1 version of the weather research and forecasting model (WRF) released on July 31, 2009. WRF is a limited area model, non-hydrostatic, with terrain following sigma-coordinate mesoscale modeling system designed to serve both operational forecasting and atmospheric research needs (Skamarock and Klemp 2007). The WRF development is a collaborative partnership, principally among the National Center for Atmospheric Research (NCAR), the National Oceanic and Atmospheric Administration (the National Centers for Environmental Prediction (NCEP) and the Earth System Research Laboratory (ESRL), the Air Force Weather Agency (AFWA), the Naval Research Laboratory, the University of Oklahoma, and the Federal Aviation Administration (FAA). WRF offers a flexible and robust platform not only used but also developed by a worldwide research community including government laboratories, academia and the private sector.

The physical options chosen to perform the simulations include the WRF Single-Moment 5-class microphysical parameterization (Hong and Juang 1998; Hong et al. 2004), the new Kain–Fritsch convective parameterization (Kain 2004), the Dudhia shortwave radiation (Dudhia 1989) and Rapid Radiative Transfer Model longwave radiation (Mlawer et al. 1997) and the Yonsei University planetary boundary layer scheme (Hong et al. 2006). For the land surface model, a 5-layer diffusive scheme is used here.

As in Omrani et al. (2013), we conduct a “Big-Brother Experiment”. It consists in first establishing a reference climate by performing a large-domain high-resolution RCM simulation: this simulation is called the Big-Brother (BB). This reference simulation is then degraded by filtering short scales that are generally unresolved by the driving fields (DF) (e.g. global reanalysis and global climate models). This filtered reference is then used to drive the same nested RCM (called the Little-Brother LB), integrated at the same high-resolution as the Big Brother, but over a smaller domain that is embedded in the Big-Brother domain. The climate statistics of the Little Brother are then compared with those of the Big-Brother over the Little-Brother domain (Fig. 1). Differences can thus be attributed easily to errors associated with the downscaling technique, and not to model errors nor to observation limitations.

Fig. 1
figure 1

“Big-Brother” experiment diagram using a limited area model (LAM) as a regional climate model (RCM)

2.1 Nudging

We investigate the two existing nudging techniques in WRF, i.e. the indiscriminate nudging (IN) (Stauffer and Seaman 1990) and the spectral nudging (SN) (Cha et al. 2006). The nudging technique consists in relaxing the model state towards the driving large-scale fields by adding a non-physical term to the model equation. This nudging term is defined as the difference between the observation and the model solution weighted by a nudging coefficient which is the inverse of the nudging time \(\tau \). In WRF, indiscriminate nudging is represented by the general following equation described by Stauffer and Seaman (1990) [Eq. (1)].

$$\begin{aligned} \frac{\partial p^{*}\alpha _{LB}}{\partial t}=F\left( \alpha _{LB},x,t\right) +\frac{1}{\tau }\, W\left( x,t\right) \,\epsilon \left( x\right) \, p^{*}\left( \alpha _{DF}-\alpha _{LB}\right) \end{aligned}$$
(1)

where \(\alpha \) is a prognostic variable, \(p^{*}=p_{s}-p_{t}\) is a mass-weighting factor due to the use of a terrain-following sigma-coordinate, \(p_{t}=50\,\hbox{hPa}\) is the pressure at the model top, \(p_{s}\) is the surface pressure, \(x\) represents the spatial variables, \(t\) is time. The model’s physical forcing terms (advection, Coriolis effects, etc.) are represented by \(F\). The subscripts \(DF\) and \(LB\) stand for driving field and Little-Brother, respectively. The quantity \(1/\tau \) is the nudging coefficient, with \(\tau \) a relaxation time scale for the nudging term. The nudging coefficient can vary in time and space with a four-dimensional weighting function \(W\). Finally, \(\epsilon \) is the analysis quality factor which ranges between \(0\) and \(1\).

In WRF, indiscriminate nudging can be applied to the wind components \(u\) and \(v\) (\(u\) and \(v\) are grid-relative wind components but in this study with the Mercator projection \(u\) and \(v\) are the zonal and meridional wind components, respectively), to the potential temperature \(\theta \), and to the water vapor mixing ratio \(q\). Nudging can be restricted to not act within the atmospheric boundary layer or below a user-specified model level (Rogers et al. 2013). For indiscriminate nudging, Omrani et al. (2012a) show that the smaller the nudging time \(\tau \), the closer the RCM predictions \(\alpha _{LB}\) to the driving fields \(\alpha _{DF}\) interpolated on the RCM grid and the larger the inhibition of the small-scale RCM dynamics.

The spectral nudging technique technique consists in driving the RCM on selected spatial scales only and does not affect the small-scale fields since only the large scales are relaxed. In this study, the cutoff wavelength corresponds to 300 km. Spectral nudging in WRF can be applied to the wind components, potential temperature and geopotential \({\varPhi }\). The spectral nudging equation is that of Miguez-Macho et al. (2004):

$$\begin{aligned} \frac{\partial \alpha _{LB}}{\partial t}=L\left( \alpha _{LB}\right) +\sum _{\left| n\right| \le N}\sum _{\left| m\right| \le M}\frac{1}{\tau _{mn}}\,\left( \alpha _{mn}^{DF}-\alpha _{mn}^{LB}\right) e^{ik_{m}x}e^{ik_{n}y} \end{aligned}$$
(2)

where \(\alpha \) represents the prognostic variable being nudged, L is the model operator, \(\alpha _{mn}^{DF}\) and \(\alpha _{mn}^{LB}\) represent the spectral coefficients of \(\alpha _{DF}\) and \(\alpha _{LB}\). The nudging coefficient \(\frac{1}{\tau _{mn}}\) can vary with \(m\) and \(n\) (wavenumbers in the \(x\) and \(y\) direction, respectively) as well as height; \(k_{m}\) and \(k_{n}\) then represent the wave vector and are dependent on the domain size, \(D_{x}\) and \(D_{y}\), given by:

$$\begin{aligned} k_{m}=\frac{2\pi \, m}{D_{x}}\quad \hbox {and} \quad k_{n}=\frac{2\pi \, n}{D_{y}} \end{aligned}$$

Following Lo et al. (2008), the nudging term is applied only above the planetary boundary layer (PBL) with a time-dependent height diagnosed by the planetary boundary layer scheme. In fact, nudging in the PBL may prevent reasonable mesoscale structures from being developed by constraining these structures toward large-scale driving fields. Above that height the nudging coefficient is constant. The nudging coefficient can be defined separately for each nudged variable, however in this work we use the same nudging coefficient for all the nudged variables. It is set to six hours (6 h). This time defined as “the optimal nudging time” was determined in Omrani et al. (2013) using the same model configuration, where a set of experiment was conducted by varying the nudging times (01, 03, 06 and 12 h). However, it should be noticed that the “optimal” nudging time used here is not universal but depends on the model configuration.

2.2 Simulations

A set of \(2\times 11\) simulations has been performed for winter and summer by varying the set of nudged variables, for both indiscriminate and spectral nudging techniques (Table 2). The BB simulation was performed over a large domain covering Europe and North Africa (Fig. 1) with 130 × 140 horizontal grid points with a 50 km horizontal resolution, as required within CORDEX (Giorgi et al. 2009), and 28 vertical levels. The model top is at 50 hPa. The winter simulations start on 1 December 1989 to 28 February 1990 with one month spin up (November 1989). The summer simulations start on 1 June 1990 to 31 August 1990 with one month spin up (May 1990). The initial and boundary conditions of the BB simulation are provided by the ERA-Interim reanalysis of the European Center for Medium-range Weather Forecast (ECMWF). The resolution of the BB fields are then degraded using a simple low pass averaging filter to obtain a resolution of 300 km × 300 km from the 50 km × 50 km fields and to serve as large-scale driving fields for the LB simulations.

The LB simulations have been performed with a 50 km resolution over the EURO-CORDEX domain (98 × 100 horizontal grid points) covering Europe and the Mediterranean regions (Fig. 1). When indiscriminate or spectral nudging is used, it is applied above the planetary boundary layer as suggested by Lo et al. (2008). The variables that can be nudged are the potential temperature, wind and moisture for IN and the potential temperature, wind and geopotential for SN. The update frequency of the large-scale driving fields is set to 6 h since the ERA-Interim reanalyses and CMIP5 climate simulations used in the frame of HyMeX (Drobinski et al. 2009) and MED-CORDEX programs are also sampled every 6 h. WRF automatically performs a linear time interpolation between two samples of the driving fields.

Table 2 List of performed simulations

To quantify the ability of the LB to reproduce the reference field, we use the mean bias \(\delta \), the root mean square error \(\varepsilon \), the standard deviation \(\sigma \) and the correlation coefficient \(\gamma \).

3 Sensitivity analysis to the nudged variables

In this section the impact of nudging various sets of variables on nudged and non-nudged variables is analyzed.

3.1 Nudging wind and temperature

We first focus on simulations where only wind and temperature are nudged (IN-UV, IN-T, IN-UV&T, SN-UV, SN-T, and SN-UV&T), in order to have the same set of experiments for indiscriminate and spectral nudging and compare the two techniques.

Figure 2 shows the Taylor diagram for the total space–time 3-h cumulated precipitation in summer (Fig. 2a) and winter (Fig. 2b) for the simulations performed without nudging (NN with square marker) and with nudging (IN with circle marker and SN with star marker). The Taylor diagram provides a way of graphically summarizing how closely a pattern (or a set of patterns) matches the reference field (Taylor 2001). The similarity between two patterns is quantified in terms of their correlation \((\gamma )\), their centered root-mean-square difference \((\epsilon )\) and the amplitude of their variations (represented by their standard deviations \(\sigma \)). Notice that we first subtract at each grid point the seasonal mean before computing the space–time standard deviation and the space–time root-mean-square difference. The LB patterns that agree well with BB are the nearest from the point marked “BB” on the x-axis. These patterns have relatively high correlation and low errors. Models lying on the arc corresponding to the standard deviation \(\sigma \) value of the reference have the correct standard deviation (which indicates that the pattern variations are of the right amplitude).

Fig. 2
figure 2

Taylor diagram for daily cumulative precipitation in summer (a) and in winter (b) obtained from the LB simulations over the EURO-CORDEX domain. The red dot indicate the skill target for the LB simulations (“BB” stands for Big-Brother). The stars, circles and square display the skill scores of the SN, IN and NN simulations in the Taylor diagram, respectively

Overall, we note a much lower skill at reproducing the reference precipitation in summer (Fig. 2a) compared to winter (Fig. 2b). For NN simulations, the precipitation field is poorly correlated to the BB fields compared to IN and SN simulations. IN-UV&T and SN-UV&T simulations give the highest scores among all performed simulations. We also note that in summer, all simulations have systematically a higher standard deviation compared to BB, however in winter, all the simulations have a smaller standard deviation compared to BB.

Fig. 3
figure 3

The mean bias (mm \(\hbox {day}^{-1}\)) between the LB simulations with respect to the BB simulation of the daily precipitation for summer (ag) and for winter (a′–g′) for NN (a, a′), IN-UV&T(b, b′), IN-UV (c, c′), IN-T (d, d′), SN-UV&T (e, e′), SN-UV (f, f′), SN-T (g, g′) simulations (see Table 2)

Figure 3 displays maps of summer and winter precipitation bias for the different experiments. In summer, a dry bias (about 2 mm \(\hbox {day}^{-1}\)) over a large area of Eastern Europe is produced in the NN simulation (Fig. 3a). We also note a strong wet bias over the Northeastern boundary of the domain, probably due to the resolution discontinuity between the driving large-scale fields at the domain boundaries and the fine resolution simulated field within the domain. This discontinuity causes Gibbs oscillations that are partially smoothed by applying a damping Davies zone. However, some discontinuity is unavoidable which produces strong horizontal gradient of the horizontal wind. Conservation of mass then imposes strong vertical velocities which can trigger unrealistic precipitation near the domain boundaries. Therefore, in all our diagnostics we removed the data within the Davies zone over the 5 grid points nearest to the domain boundaries. When temperature is nudged (Fig. 3d, g) this effect is significantly reduced. However, the dry bias persists. On the other hand, when only wind is nudged (Fig. 3c, f), the dry bias disappears almost completely but not the wet bias over the boundaries. It seems that the two effects are decoupled and are due to different processes. Finally, when both wind and temperature are nudged (Fig. 3b, e) the bias is strongly reduced over all the domain. In winter, precipitation is underestimated in all LB simulations and the impact of nudging is not clear.

For surface wind we found the same behavior (not shown). Nudging wind and temperature gives the highest skills.

Figure 4 is similar to Fig. 2 and shows the Taylor diagram for the total space–time surface temperature in summer (Fig. 4a) and winter (Fig. 4b) for the simulations performed without and with nudging. Again we first subtract at each grid point the seasonal mean before computing the space–time standard deviation and the space–time root-mean-square difference.

Fig. 4
figure 4

Same as Fig. 2 for surface temperature

For both seasons, the NN simulations (square marker) have the lowest correlation coefficients and the highest root-mean-square error compared to the nudged simulations (IN and SN). It also has a higher standard deviation compared to the BB in summer and almost the same in winter. The nudged simulations, independently of the type of nudging (IN and SN), have the highest skills. Indeed, nudging both wind and temperature (IN-UV&T, SN-UV&T) gives the highest skills in term of correlation and root mean square. Then comes IN-UV and SN-UV simulations. Nudging only temperature shows the lowest skills among all the nudged simulations. For the same set of nudged variables it is hard to distinguish between the two nudging techniques, the circle and star markers are almost superposed.

Fig. 5
figure 5

Same as Fig. 3 (°C) for surface temperature (i.e. at 2 m height)

Figure 5 shows the mean biases between the BB and LB simulations in the surface temperature in summer and winter for the not nudged (NN) simulation (Fig. 5a, a′) and the nudged simulations (IN-UV&T, IN-UV, IN-T, SN-UV&T, SN-UV, SN-T respectively Fig. 5b–g, b′–g′. In summer, the non nudged simulation displays a strong warm bias (>5 °C) over Europe and North Africa mainly over land. This bias is very weak over the sea because the sea surface temperature (SST) is prescribed from ERA-Interim reanalysis for all simulations, whereas over land, the land surface model (LSM) computes its own surface temperature. When nudging is used, this bias decreases to about \(\pm 2\,^{\circ }\hbox {C}\). Here again, nudging wind and temperature show the highest skills by reducing the bias to less than \(\pm 1\,^{\circ }\hbox {C}\). Unlike summer, we note a strong cold bias (about \(-6\,^{\circ }\hbox {C}\)) in winter in Eastern Europe and a smaller bias between \(1\) and \(4\,{^\circ }\hbox {C}\) over almost the whole continental domain in the NN simulation. This bias is strongly reduced when nudging is applied to wind and temperature. However in winter, the impact of nudging is not as spectacular as in summer and a significant residual cold bias persists over almost the entire domain. Here again the difference between the two nudging techniques is very small. At least two facts could yield this result. The first one is that being applied above the PBL, nudging does not affect the small-scales in the PBL. Above the PBL, the small-scale structures are much less energetic so whatever its strength, nudging mainly impacts the large-scale field in LB. The second one is that the nudging time \(\tau =06\,\hbox {h}\) is equal to its optimal value as determined in (Omrani et al. 2013). Therefore we performed additional simulations with a much smaller nudging time (\(\tau =10\,\hbox {mn}\)) and with nudging in the PBL.

Fig. 6
figure 6

Power spectra for temperature at 1,000 hPa for a relaxation time \(\tau =10\,\hbox {mn}\) (a, c) and \(\tau =06\,\hbox {h}\) (b, d) when nudging is applied in the PBL (a, b) and above the PBL (c, d)

Figure 6 displays the power spectra of temperature at 1,000 hPa for a relaxation time \(\tau =10\,\hbox {mn}\) and \(\tau =06\,\hbox {h}\) when nudging is applied in the PBL and above the PBL. It shows that when nudging (IN or SN) is applied above the PBL, the small-scales of the surface temperature pattern are weakly affected (panels c and d) even if the nudging coefficient is large (i.e. nudging time \(\tau =10\,\hbox {mn}\)). When nudging is applied in the PBL, the small-scales are correctly simulated when the nudging time \(\tau =06\,\hbox {h}\) and equal to its optimal value as deteremined in (Omrani et al. 2013). Conversely, the small-scales are significantly damped in IN simulation when \(\tau =10\,\hbox {mn}\). In SN, the small-scale features of the temperature field are well reproduced in SN for length-scales smaller than the cutoff length. Overall, small scales are preserved if nudging is applied either above the PBL only or with a sufficiently long nudging time, both conditions being satisfied in our experiments, leading to small differences between IN and SN.

This large surface temperature bias produced in the NN simulations (Fig. 5a, a′) has already been observed in several studies (e.g. Radu et al. 2008; Caldwell et al. 2009; Bowden et al. 2012). Most of these studies explained this bias by an overprediction of daily-maximum temperature correlated with a low soil moisture content. As in Omrani et al. (2013), such an explanation did not hold since the very simple 5 layer land-surface model used did not permit to simulate soil moisture deficit. The other mechanism invoked in Omrani et al. (2013) is that the increase in temperature in summer corresponds to an atmospheric blocking situation artificially created by the model when nudging is not used. In that case small-scale discrepancies between the LB and BB can grow with time and contaminate the large scales. The NN simulation then produces in the center of the domain a large-scale atmospheric circulation which is inconsistent with the large-scale atmospheric circulation imposed at the domain boundaries and within the LB domain. This explanation is also valid in this study since the same experimental methodology is used here. We can easily verify this hypothesis. Figure 7, shows the 500 hPa geopotential height mean bias between the BB and LB simulations. The various sub-panels show the 500 hPa geopotential height mean bias for different sets of nudged variables. We note a strong positive anomaly of the summer geopotential height (\(\ge \)140 m) in the center of the domain for the NN simulation (Fig. 7a). When the temperature is nudged, this anomaly decreases significantly (\(\le \)40 m). Whereas, when the wind is nudged, this centered positive anomaly disappears and it is replaced by a negative anomaly over the whole domain. In winter, we see on the contrary a negative bias over the Eastern Mediterranean which decreases somewhat, but not spectacularly, when the wind is nudged.

Fig. 7
figure 7

Same as Fig. 5 for geopotential height at 500 hPa (m)

To check whether this temperature anomaly is confined to the surface or spreads over all vertical levels of the model, we plot the summer mean bias of temperature and the geopotential height at different vertical levels (900, 700, 500 and 400 hPa) for the NN, IN-UV&T, IN-UV and IN-T simulations (Fig. 8). Results for spectral nudging are not shown because we obtain very similar figures. The warm temperature bias exists at almost all vertical levels of our model when nudging is not used. Moreover, this bias is associated with a strong positive bias of the geopotential height that increases for the upper layers. Nudging reduces the temperature and the geopotential height bias at all levels (≤2 °C).

These results are consistent with the results of Bowden et al. (2012) who showed evidence of a temperature bias over North America in the absence of nudging, significantly correlated to a 500 hPa geopotential height anomaly. The impact of the absence of nudging on the 500-hPa geopotential height, and thus on the synoptic atmospheric circulation has significant consequences on the rainfall pattern as shown in Fig. 3 (panels a and a′). In summer, the dry bias is induced in part by the positive anomaly of the 500-hPa geopotential height. The artificial high pressure over the center of the domain reduces cloud cover and precipitation producing a negative soil-moisture anomaly with smaller evaporation and higher sensible heat which in turn warms the planetary boundary layer and increases the surface temperature (Zampieri et al. 2009; Hohenegger et al. 2009).

Fig. 8
figure 8

The mean bias (°C) between the LB simulations with respect to the BB simulation of temperature (in shaded color) at 900 (ad), 700 (eh), 500 (il) and 400 (mp) hPa for summer for NN (a, e, i, m), IN-UV&T (b, f, j, n), IN-UV (c, g, k, o), IN-T (d, h, l, p) simulations and geopotential height anomaly in contours

In summer and winter, a significant residual bias on the 500 hPa geopotential height persists even when all possible variables are nudged. To analyze why, let us assume that hydrostatic balance approximately holds, which is a good approximation at scales larger than a few tens of kilometers. The temperature profile then entirely determines the pressure and density profiles, up to a constant provided by surface pressure, an independent prognostic variable in sigma-coordinate models like WRF. Now any difference in surface pressure between LB and BB contaminates the whole pressure field. This in turn affects the large-scale circulation which is approximately in geostrophic balance with the pressure field. Figure 9 displays the time-mean bias of the surface pressure. The observed pattern is remarkably similar to the mean bias in 500 hPa geopotential height. This strongly suggests that surface pressure is the missing quantity that, in addition to temperature, would allow to prevent the pressure field to diverge from its LB reference value.

Fig. 9
figure 9

Same as Fig. 5 for hydrostatic surface pressure (hPa)

Figure 10 is similar to Fig. 3 but for 10m wind speed (m \(\hbox {s}^{-1}\)). In summer, the bias is very low over land with no significant difference between the NN simulation and the nudged ones. In winter, we note a negative bias (≥3 m s−1) over the Atlantic part of the domain, nevertheless weak over land. Overall, nudging has no significant effect on surface wind. Figure 11 shows the wind speed bias for different pressure levels. Near the surface the impact of nudging is not significant as shown in Fig. 10, however, for the upper level, the strong positive anomaly of the geopotential height creates an anticyclonic circulation in the center of the domain. Nudging wind remove this bias over the whole vertical level while nudging only temperature leaves a small residual bias.

Fig. 10
figure 10

Same as Fig. 3 for surface wind speed (m \(\hbox {s}^{-1}\))

Fig. 11
figure 11

Same as Fig. 8 but for wind speed (m \(\hbox {s}^{-1}\))

3.2 Nudging moisture

We now examine the impact of nudging moisture. Note that with WRF, nudging moisture can be used only with IN. Indeed, we have compared simulations where moisture is or is not nudged, i.e. NN versus IN-Q, IN-UV&T versus IN-UV&T&Q, IN-UV versus IN-UV&Q, IN-T versus IN-T&Q. Tables 3 and 4 display the correlation coefficients and the root mean square errors between the reference (BB simulation) and the different set of experiment with and without nudging moisture for rain, temperature and wind. Overall, nudging moisture has a positive effect on rain by reducing the bias and improving the temporal variability. This effect seems to be more significant in summer than in winter. Indeed, it increases the correlation coefficients by 0.1 at least and reduces significantly the RMS errors. For the other variables, nudging moisture shows no significant feedback on wind or temperature. This result is consistent with the finding of Radu et al. (2008), where they compared the sensitivity of the model to the set of nudged variables and showed that nudging the specific humidity improves the rainfall but it has a weak impact on temperature. No evident reason has been found within this framework which would explain these results. The boundary layer parametrization (eg. the convection scheme) may be at stake here. However, additional investigations out of the scope of this Big Brother Experiment are left for future work.

Table 3 Correlation coefficients for the different experiment set-up
Table 4 Root mean square error for the different experiment set-up

3.3 Nudging geopotential height

The bias of the geopotential height controls a large part of the surface wind and temperature biases. Nudging the geopotential height may thus be a way to overcome the problem of the residual bias due to the surface pressure bias. With WRF, we can nudge geopotential height with SN. To identify the added value of nudging geopotential (as for moisture) we compared pairs of simulations (NN versus SN-GHT), (SN-UV versus SN-UV&GHT), (SN-T versus SN-T&GHT) and (SN-UV&T versus SN-UV&T&GHT). Results show no difference (not shown). Nudging the geopotential has no discernable effect.

4 Discussion: effectiveness of nudging and inter-variable relationships

We finally discuss potential dynamical causes of two observations made previously: the absence of any impact of nudging geopotential on the simulations and the persistence of a bias in surface pressure and geopotential height even when nudging wind and temperature.

4.1 Implications of a nearly-hydrostatic regime

At scales much larger than the scale height (about 10 km), non-hydrostatic effects are small. Our simulations are well within this nearly-hydrostatic regime, and we first explore implications for the biases we observed. Table 5 shows the correlation coefficient between the geopotential height bias and either the surface pressure bias (\(C({\varDelta }{\varPhi }{}_{500},{\varDelta } p_{s})\)) or the surface temperature bias (\(C({\varDelta }{\varPhi }{}_{500},{\varDelta } T_{2})\) ) for NN, IN-UV&T, IN-UV and IN-T simulations, with \({\varDelta }{\varPhi }_{500}={\varPhi }_{500}^{LB}-{\varPhi }_{500}^{BB}, {\varDelta } p_{s}=p_{s}^{LB}-p_{s}^{BB}, {\varDelta } T_{2}=T_{2}^{LB}-T_{2}^{BB}\). In winter, we note that the geopotential height bias is mainly controlled by the surface pressure bias. Indeed, for the NN simulation the correlation coefficient \(C({\varDelta }{\varPhi }{}_{500},{\varDelta } p_{s})\) is high (83.27 %) compared to \(C({\varDelta }{\varPhi }{}_{500},{\varDelta } T_{2})=19.23\,\%\). When we nudge the temperature (IN-UV&T and IN-T), the residual bias of the geopotential height is even more correlated to the surface pressure bias (≥98 %). In summer, the geopotential bias seems to be more correlated to the temperature bias with \(C({\varDelta }{\varPhi }_{500},{\varDelta } T_{2})=84.6\,\%\) for NN simulation versus \(C({\varDelta }{\varPhi }{}_{500},{\varDelta } p_{s})=67.6\,\%\). When the temperature is nudged, the residual bias is strongly correlated to the surface pressure bias (\(\sim \)99 %).

Table 5 Correlation coefficients between the biases of geopotential height at 500 hPa (\({\varDelta }{\varPhi }_{500}={\varPhi }_{500}^{LB}-{\varPhi }_{500}^{BB}\)), of surface pressure (\({\varDelta } p_{s}=p_{s}^{LB}-p_{s}^{BB}\)) and of surface temperature (\({\varDelta } T_{2}=T_{2}^{LB}-T_{2}^{BB}\))

To analyze more theoretically the results of Table 5, let us consider the hypsometric equation (Holton 1992)

$$\begin{aligned} {\varPhi }_{500}-{\varPhi }_{s}=-R\left\langle T\right\rangle ln\left( \frac{500}{p_{s}}\right) \end{aligned}$$

where \(\left\langle T\right\rangle \) denotes the mean temperature in the layer between the surface pressure \(p_{s}\) and 500 hPa and R is the gas constant for dry air (R = 287 J K−1 kg−1). The corresponding geopotential height bias at 500 hPa is then:

$$\begin{aligned} {\varDelta }{\varPhi }_{500}^{theo}=\frac{1}{g_{0}} \left( {\varPhi }_{500}^{LB}-{\varPhi }_{500}^{BB}\right) = -\frac{R}{g_{0}}\left[ \left\langle T\right\rangle ^{LB}ln\left( \frac{500}{p_{^{LB}s}}\right) -\left\langle T\right\rangle ^{BB}ln\left( \frac{500}{p_{^{BB}s}}\right) \right] \end{aligned}$$
(3)
Table 6 Correlation coefficient and mean bias between the geopotential height bias retrieved from respectively NN, IN-T simulations and the geopotential height computed from Eqs. 3, 4

Table 6 shows the correlation coefficient and the mean bias between \({\varDelta }{\varPhi }_{500}\) and \({\varDelta }{\varPhi }_{500}^{theo}\) for the NN simulation. The two time series are highly correlated (99 %) with a very small bias. This confirms that the spatial scales considered here are dominated by the hydrostatic balance and the geopotential height is entirely determined by the surface pressure and the vertical profile of potential temperature. Now, if we prescribe the temperature by nudging toward the Big Brother temperature (\(T^{LB}\sim T^{BB}\)) Eq. (3) becomes:

$$\begin{aligned} {\varDelta }{\varPhi }_{500}^{IN-T,\, theo}=\frac{R}{g_{0}}\left\langle T\right\rangle ^{BB}ln(500)\left[ ln\left( p_{s}^{LB}\right) -ln\left( p_{s}^{BB}\right) \right] \end{aligned}$$
(4)

and the geopotential bias is then, theoretically, entirely determined by the bias of the surface pressure logarithm. This is a very plausible explanation for the strong correlation coefficient between \({\varDelta }{\varPhi }{}_{500}\) and \({\varDelta } p_{s}\) (\(\ge \)98 %) for IN-UV&T and IN-T simulations and both seasons (Table 5).

4.2 Actual degrees of freedom

To understand the absence of any impact of nudging geopotential on the simulations, it is useful to identify the actual degrees of freedom of the flow and how they relate to the prognostic variables of the numerical model. For atmospheric flows the existence of a dominant, hydrostatic balance reduces the degrees of freedom that the model can actually explore. This is especially true at the hydrostatic scales resolved in the present numerical experiments. In a hydrostatic numerical model using the same mass-based coordinate as WRF, the geopotential is a diagnostic variable which is entirely determined for each atmospheric column by the surface pressure and the vertical profile of potential temperature. When modelling hydrostatic flow with a non-hydrostatic model like WRF, it seems that geopotential is a prognostic variable but in fact it is strongly constrained due to hydrostatic balance being nearly satisfied. As a consequence nudging geopotential has a negligible effect unless the time scale for nudging is decreased until it becomes comparable to that of hydrostatic adjustment. This time is comparable to the time needed by sound waves to travel vertically over the atmospheric column, i.e. less than 1 min (Bannon 1995).

In hindsight, it is therefore entirely expected that nudging the geopotential has little usefulness. In fact nudging the geopotential should probably be avoided altogether, because it is more a diagnostic quantity than a prognostic variable. Notice that while this argument is clearly valid at horizontal scales where hydrostatic equations describe the flow well, it is probably also valid at smaller scales where the flow is non-hydrostatic. Indeed Arakawa and Konor (2009) and Dubos and Voitus (2014) have recently proposed non-hydrostatic equations where hydrostatic balance is nevertheless enforced for each atmospheric column, therefore making the geopotential explicitly diagnostic, exactly as with the primitive equations.

On the other hand hydrostatic balance puts no constraint on the value of the surface pressure, which is equivalent to the total mass of an atmospheric column. This is therefore a genuine degree of freedom. Surface pressure is associated with barotropic motion such as Lamb waves and barotropic Rossby waves. The latter play a significant role in large scale atmospheric variability at a scale of a few days (Madden 2007). It is therefore entirely possible that this mode of atmospheric motion becomes progressively inconsistent between the driving model and the driven model as time passes, if no nudging is applied. This issue might not be encountered when dealing with small regional domains or short experiments. However for regional climate studies over large domains, nudging surface pressure may be necessary as Fig. 9 (panels c and d) suggest.

4.3 Inter-variable relationships and nudging

Nudging has been initially introduced as a data assimilation technique (Anthes 1974; Hoke and Anthes 1976; Davies and Turner 1977). When used for this purpose, nudging is typically applied during a short assimilation window (12–24 h) before a non-nudged forecast is performed. When used for the purpose of regional climate modelling, nudging is applied during the whole simulation, typically months or years. Despite this difference, it is interesting to discuss our results in the light of the understanding of nudging gained in a data assimilation context. Indeed the question of which variables can best improve a forecast when assimilated into an initial condition by nudging has been examined in some detail (Hoke and Anthes 1976; Kuo et al. 1993; Bao and Errico 1997). The general picture that emerges from these studies is that nudging a certain field can affect other fields due to inter-variable relationships imposed by the dynamics. Those relationships, which depend on the spatial scale, are captured by a normal-mode analysis of idealized models, especially the shallow-water equations (Bao and Errico1997). The latter are relevant for hydrostatic atmospheric motion because three-dimensional motion can be decomposed into vertical modes (baroclinic and external) obeying uncoupled linearized shallow-water equations.

Whenever these modes couple two fields, nudging one affects the other. For instance small-scale gravity waves have as much kinetic energy as potential energy, coupling the (divergent) velocity field and the mass field. Nudging mass only (or velocity only) will affect both mass and (divergent) velocity in that case. Indeed, although the loss of potential (resp. kinetic) energy due to nudging is partially offset by drawing energy from the kinetic (resp. potential) reservoir, the wave is damped with a damping time just twice the nudging time (detailed calculation not shown). Conversely large-scale Rossby waves have much more potential energy than kinetic energy. Nudging wind affects the mass field very weakly in that case, as the loss of kinetic energy due to nudging is compensated by drawing energy from the much larger potential energy reservoir, resulting in an overall damping time much longer than the nudging time.

Following the above reasoning, nudging the small (\(<\)Rossby radius) scales of the mass field affects mostly gravity waves, hence small-scale divergent wind. Nudging the small scales of the wind field affects both Rossby and gravity waves, the latter affecting the small scales of the mass field. Nudging the large scales of the mass field affects large-scale Rossby waves, hence large-scale wind, assuming the latter is mostly geostrophic, a reasonable assumption at midlatitudes. Nudging the large scales of the wind field affects mostly inertial waves, which have very little potential energy, so the effect on the mass field should be weak. Overall while the small-scale mass field is affected by nudging applied to wind, it is not the case of the large-scale mass field.

Translating these shallow-water results into the three-dimensional realm involves the vertical structure of normal-modes, which are either external (barotropic) or baroclinic. Baroclinic modes have a very small projection onto surface pressure while external (barotropic) modes have a vanishing temperature perturbation (Davies et al. 2003). Hence in the above intervariable relationships, mass is a synonym of temperature for baroclinic modes and of surface pressure for external modes. Especially nudging wind affects the small-scale temperature field and nudging temperature affects the large-scale baroclinic wind field. However the shallow-water analysis applied to the barotropic modes suggests that nudging temperature has no effect on wind while nudging wind effectively controls only the scales smaller than the barotropic Rossby radius (about 3,000 km). Controlling the long barotropic Rossby waves would require nudging mass, i.e. surface pressure, as suggested above. The absence of such a control opens the possibility that surface pressure becomes inconsistent with the driving field, as we have observed in our simulations.

5 Conclusion

In this work, we investigate the impact of different sets of variables nudged in regional climate simulations performed with the Weather Forcasting and Research (WRF) model in a Big Brother Experiment (BBE) framework, using indiscriminate and spectral nudging. A set of of numerical simulations has been performed over the Euro–Mediterranean region in summer and winter. We quantify statistically the ability of nudging to reduce the inconsistencies between LB and BB (reference fields) that develop over time (Omrani et al. 2012a, b, 2013).

Comparison of nudged and non nudged configurations shows that nudging clearly improves the model capacity to reproduce the reference fields from the Big-Brother simulations, regardless of the diagnosed variable. However the improvement depends on the set of variables used to nudge the regional climate simulations. The results also depend on the season and the diagnosed variable.

The tropospheric horizontal wind is by far the key variable to nudge to simulate correctly surface temperature and wind, and rainfall. To a lesser extent, nudging tropospheric temperature also contributes to significantly improve the simulations. This is unsurprising as wind determines the transport of all conserved quantities, especially heat and moisture. These results are consistent with the study of Pohl and Crétat (2013), who showed that nudging the horizontal wind and temperature provides the best simulation of deep tropical convection with respect to observations. Furthermore wind and temperature are coupled through the thermal wind balance. Thermal wind balance follows from hydrostatic equilibrium and geostrophic equilibrium, the latter being valid at scales larger than about 1,000 km. Therefore nudging tropospheric wind or temperature directly impacts the simulation of the tropospheric geopotential height and thus the synoptic scale atmospheric circulation. Maintaining a synoptic circulation within the simulation domain consistent with the synoptic circulation at the domain boundaries is essential to avoid numerical artifacts which are known to produce large surface temperature and rainfall biases.

Nevertheless nudging wind and temperature leaves a residual bias in surface pressure and, as a consequence, in geopotential height. We suggest that this is due to the inability of the wind and temperature fields to control large-scale barotropic Rossby waves. Removing this residual bias seems to require the nudging of the surface pressure, an independent degree of freedom. This would not be without problems since nudging surface pressure actually adds a source term to the mass budget. Some care would be needed to do this in a manner consistent with the transport of potential temperature and other species.

Nudging moisture has generally a marginal impact on the quality of the regional climate simulations. However, it has a significant positive impact on the simulation of rainfall. As an immediate consequence, nudging all possible variables in WRF gives by far the best results with respect to the Big-Brother simulation. The same conclusions were found with spectral nudging and were thus not illustrated in the article. The fact that nudging can not be applied on moisture with spectral nudging in WRF (Bowden et al. 2012) is detrimental to a more thorough comparative analysis between indiscriminate and spectral nudging.

In this study, nudging has been applied to all nudged variables with the same relaxation time. Due to the different nature of the processes and scales that control temperature, wind and precipitation, the question of the use of different nudging times for wind, temperature and moisture seems relevant and should be addressed in the future. A potential limitation of this work is to be based on the single WRF model. However, for those results (ie that nudging wind and temperature has the most impact, that nudging geopotential is ineffective and that the surface pressure needs to be nudged) for which a plausible physical explanation could be found, it seems reasonable to expect that they are robust and model independent.