Evaluation of reanalysis datasets against observational soil temperature data over China

Yang, Kai; Zhang, Jingyong

doi:10.1007/s00382-017-3610-4

Evaluation of reanalysis datasets against observational soil temperature data over China

Published: 17 March 2017

Volume 50, pages 317–337, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Climate Dynamics Aims and scope Submit manuscript

Evaluation of reanalysis datasets against observational soil temperature data over China

Download PDF

Kai Yang^1,2 &
Jingyong Zhang^1,2

1225 Accesses
46 Citations
1 Altmetric
Explore all metrics

Abstract

Soil temperature is a key land surface variable, and is a potential predictor for seasonal climate anomalies and extremes. Using observational soil temperature data in China for 1981–2005, we evaluate four reanalysis datasets, the land surface reanalysis of the European Centre for Medium-Range Weather Forecasts (ERA-Interim/Land), the second modern-era retrospective analysis for research and applications (MERRA-2), the National Center for Environmental Prediction Climate Forecast System Reanalysis (NCEP-CFSR), and version 2 of the Global Land Data Assimilation System (GLDAS-2.0), with a focus on 40 cm soil layer. The results show that reanalysis data can mainly reproduce the spatial distributions of soil temperature in summer and winter, especially over the east of China, but generally underestimate their magnitudes. Owing to the influence of precipitation on soil temperature, the four datasets perform better in winter than in summer. The ERA-Interim/Land and GLDAS-2.0 produce spatial characteristics of the climatological mean that are similar to observations. The interannual variability of soil temperature is well reproduced by the ERA-Interim/Land dataset in summer and by the CFSR dataset in winter. The linear trend of soil temperature in summer is well rebuilt by reanalysis datasets. We demonstrate that soil heat fluxes in April–June and in winter are highly correlated with the soil temperature in summer and winter, respectively. Different estimations of surface energy balance components can contribute to different behaviors in reanalysis products in terms of estimating soil temperature. In addition, reanalysis datasets can mainly rebuild the northwest–southeast gradient of soil temperature memory over China.

Investigating spatiotemporal changes of the land-surface processes in Xinjiang using high-resolution CLM3.5 and CLDAS: Soil temperature

Article Open access 16 October 2017

A 10-Yr Global Land Surface Reanalysis Interim Dataset (CRA-Interim/Land): Implementation and Preliminary Evaluation

Article 01 February 2020

Evaluation of ERA5 and NCEP reanalysis climate models for precipitation and soil moisture over a semi-arid area in Kuwait

Article 15 March 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Soil temperature is a key land surface parameter that contributes considerably to climate variations and predictions (Tang et al. 1988; Wang 1991; Liu and Avissar 1999; Zhou and Huang 2006; Fan 2009; Wang et al. 2013). As a reflection of the land surface thermal conditions, soil temperature plays an important role in the energy and water balance of the land surface. Through interaction with the atmosphere, soil temperature has been demonstrated to have substantial effects on monthly to interannual climate variations (Hu and Feng 2004a, b; Mahanama et al. 2008; Wu and Zhang 2014). By inducing an eastward-propagating cyclone, a warm May subsurface soil temperature in the western United States can lead to more June precipitation in the southern United States and less precipitation in the north (Xue et al. 2012). The modeling work of Wu and Zhang (2014) emphasized the importance of subsurface soil temperature on summer surface air temperature variability over arid and semi-arid regions of Eastern Asia. As a slow variable of the land surface, soil temperature can “remember” climate anomalies and release their effects in subsequent seasons (Hu and Feng 2004a). The memory of soil temperature can persist for 1 month to years, depending on soil depth, season, and climate regime (Liu and Avissar 1999; Yang and Zhang 2015). Soil temperature memory is considered to be a potential predictor for seasonal climate anomalies and extremes, and could improve forecasts of monthly to interannual climate (Xue et al. 2012; Yang and Zhang 2015).

Reanalysis datasets have been widely adopted in climatology research to complement incomplete observational records (Robock et al. 2000; Koster et al. 2004; Zhang et al. 2008; Kim and Alexander 2013). The evaluation of reanalysis products, which is essential and critical because of the uncertainty that may be caused by data assimilation and forecast models, provides a reference for applying reanalysis datasets in different regions and fields (Dirmeyer et al. 2004; Hodges et al. 2011; Bao and Zhang 2012; Shah and Mishra 2014). Many variables, such as surface air temperature, radiative fluxes, precipitation, and wind speed, from different reanalysis datasets have been widely evaluated (Simmons et al. 2004; Chaudhuri et al. 2012; Lindsay et al. 2014). According to previous evaluation works, generally speaking, no one product performs better than the others in all fields and regions (Makshtas et al. 2007; Mao et al. 2010; Chaudhuri et al. 2012). The reanalysis data of land surface parameters, such as soil moisture, have also been evaluated. The 40-year European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis (ERA-40) dataset has been found to perform better than the National Centers for Environmental Prediction-National Center for Atmospheric Research (NCEP-NCAR) reanalysis 1 and the NCEP-Department of Energy (NCEP-DOE) datasets for reproducing the mean value and interannual variability of soil moisture in China (Li et al. 2005). Albergel et al. (2012) compared soil moisture in the ECMWF’s Interim reanalysis (ERA-Interim) dataset with in situ soil moisture observations from 117 stations across the world (Australia, Africa, America, and Europe). They found that the ERA-Interim dataset generally overestimates soil moisture, especially over dry land. However, it still performs well for surface soil moisture variability.

Considering the importance of soil temperature in the land system, recently, several works have assessed the forecast quality of soil temperature in numerical weather predictions (Holmes et al. 2012; Albergel et al. 2015). Holmes et al. (2012) assessed surface soil temperature from the Integrated Forecasting System from ECMWF, the modern-era retrospective analysis for research and applications (MERRA) from the NASA Global Modeling and Assimilation Office, and the global data assimilation system used by NCEP over Oklahoma. Albergel et al. (2015) used soil temperature measurements over the United States and Europe to assess ECMWF forecasts of soil temperature during 2012. They found that the ECMWF forecasts can generally represent the annual and diurnal cycle of soil temperature. Furthermore, they highlighted the importance of orographic data for estimating soil temperature. However, to the best of our knowledge, evaluations of soil temperature data from reanalysis products over China are still scarce.

Our present study evaluates four well-known reanalysis datasets, namely the land surface reanalysis of ECMWF (ERA-Interim/Land), the second MERRA (MERRA-2), the NCEP Climate Forecast System Reanalysis (CFSR), and version 2 of the Global Land Data Assimilation System (GLDAS-2.0), by comparing them with the observational soil temperature data over China, to provide a reference for the ability of these products to describe the seasonal mean, interannual variability and linear trend of soil temperature, and also, for their ability to estimate soil temperature memory over China. To explore the possible reasons for the different behaviors of the four reanalysis datasets, we further examine the relationship between soil temperature and surface energy balance components. The remainder of this paper is arranged as follows. The observational and reanalysis datasets are described in Sect. 2. Section 3 shows the results of the evaluation. Discussion and conclusions are presented in Sects. 4 and 5, respectively.

2 Data and methods

2.1 Observations

The observational data used in this study are the monthly mean soil temperature of 626 stations over China for the period of 1981–2005, provided by the China Meteorological Administration. The dataset has nine soil layers of 0, 5, 10, 15, 20, 40, 80, 160, and 320 cm. Owing to the limitation of data availability, we only retain the stations with complete records for specific periods and soil layers. Figure 1 shows the spatial distribution of the stations with available soil temperature data for all nine soil layers for summer (June, July, and August), winter (December, January, and February), and for all 12 months of all 25 years. These stations are mostly located over the east of China. In this study, bilinear interpolation is used to interpolate reanalysis data to stations.

2.2 ERA-Interim/Land

The ERA-Interim/Land dataset (Dee et al. 2011; Balsamo et al. 2015) is the newest land surface model simulation produced by ECMWF, covering 1979–2010. Based on a spatial resolution of 80 km (T255 spectral), the soil temperature data in ERA-Interim/Land have four layers with depths of 0–7, 7–28, 28–100, and 100–289 cm. Forced by the near-surface meteorological fields from ERA-Interim (Dee et al. 2011) and precipitation adjustments based on the Global Precipitation Climatology Project Version 2.1 (Huffman et al. 2009), ERA-Interim/Land is executed using the latest version of the Hydrology-Tiled ECMWF Scheme for Surface Exchanges over land (HTESSEL, Balsamo et al. 2009). Compared with the Tiled ECMWF Scheme for Surface Exchanges over Land (TESSEL) used in ERA-Interim, HTESSEL has significant improvements in terms of soil hydrology, snow scheme, vegetation climatology, and bare-soil evaporation. The ERA-Interim/Land dataset has been found to show more agreement with the observations for latent and sensible heat fluxes, soil moisture, and snow than the ERA-Interim dataset, and is considered to be more suitable for climate applications in terms of land surface parameters. The ERA-Interim/Land data are used on a 0.5°×0.5° grid in our study.

2.3 MERRA-2

The MERRA-2 dataset (Bosilovich et al. 2016), as a replacement for the MERRA reanalysis (Rienecker et al. 2011) produced by NASA, uses an upgraded version of the Goddard Earth Observing System Model, Version 5 (GEOS-5) data assimilation system, including the GEOS-5 atmospheric model (Rienecker et al. 2008; Molod et al. 2015) and the Gridpoint Statistical Interpolation (GSI) analysis scheme (Wu et al. 2002). Compared with MERRA, MERRA-2 uses observation-based precipitation data instead of model-generated precipitation to force the land surface parameterization, and it includes numerous additional satellite observations. The soil temperature in MERRA-2 has six layers with thicknesses of 9.88, 19.52, 38.59, 76.27, 150.7, and 1000 cm, and is provided on a grid with 576 points in the longitudinal direction and 361 points in the latitudinal direction (0.625°×0.5°).

2.4 CFSR

The CFSR dataset (Saha et al. 2010; Meng et al. 2012) is the newest global, high-resolution reanalysis covering 1979–2009 developed by NCEP. Using the Noah four-layer land surface model, CFSR adopts the NASA land information system (LIS) to execute the global land data assimilation system (GLDAS/LIS; Mitchell et al. 2004; Rodell et al. 2004; Peters-Lidard et al. 2007) to perform the land surface analysis. GLDAS/LIS is forced by the atmospheric data assimilation output of CFSR and observational precipitation, including the pentad data of the Climate Prediction Center (CPC) Merged Analysis of Precipitation (Xie and Arkin 1997) and the CPC unified global daily gauge analysis. The soil depths of the four soil layers are 0–10, 10–40, 40–100, and 100–200 cm.

2.5 GLDAS-2.0

The GLDAS-2.0 dataset (Rodell et al. 2004) is the newest reanalysis as part of the mission of NASA’s Earth Science Division covering 1948–2010, and is archived and distributed by the Goddard Earth Sciences (GES) Data and Information Services Center (DISC). Based on the Noah model, GLDAS-2.0 is forced by the global meteorological forcing dataset from Princeton University (Sheffield et al. 2006). In spite of the model version upgrade, GLDAS-2.0 uses the MODIS-based land surface parameter datasets and includes initialization of soil moisture over desert. The bottom-layer temperature in the Noah model is also updated, compared with GLDAS-1. GLDAS-2.0 has two resolutions of 1°×1° and 0.25°×0.25°, and the resolution of GLDAS-2.0 used in our study is 0.25°×0.25°. Similar to CFSR, GLDAS-2.0 has four soil layers with thicknesses of 0–10, 10–40, 40–100, and 100–200 cm.

3 Results

In land surface models, as an unavoidable limitation, soil temperature is given as an average of a soil layer. Linear interpolation is usually adopted to approximately calculate the soil temperature at certain soil depths, which may cause bias in the evaluation. To present the best behavior of each reanalysis dataset, we evaluate both the layer-averaged soil temperature (LA-ST) and the soil temperature interpolated to the nine observational soil depths using linear interpolation (INTER-ST) by comparing them with the observations (OBS-ST). Figure 2 shows the vertical distribution of the mean LA-ST, INTER-ST, and OBS-ST of the stations with complete observational records in summer and winter. As an observational fact to support the applicability of linear interpolation, the vertical variation of OBS-ST is approximately linear, especially for the soil temperature at 0–40 cm depth. All four reanalysis datasets generally underestimate the soil temperature. And for each reanalysis product, the comparison of LA-ST and INTER-ST depends on soil depth. Comparing LA-ST for the four datasets, in general, the ERA-Interim/Land and GLDAS-2.0 datasets are closer to the observations than the MERRA-2 and CFSR datasets. In summer, the LA-ST of the GLDAS-2.0 dataset at 0–10 cm is relatively higher than the other datasets, and the ERA-Interim/Land dataset has similar values for LA-ST at 0–7 cm. For summer LA-ST at 10–28 and 40–100 cm, the ERA-Interim/Land dataset has a higher estimation than the other datasets. The LA-ST of the GLDAS-2.0 dataset at 28–40 and 100–200 cm is relatively closer to the observations. The MERRA-2 dataset shows good behavior in representing the summer LA-ST in its second soil layer, which may be due to the fact that it has more soil layers than the other reanalysis datasets. The LA-ST of the CFSR dataset is relatively lower than the others datasets. For LA-ST in winter, the ERA-Interim/Land dataset has the highest estimation at 0–10 and 28–69.77 cm, and even has a higher value than the observations (estimated based on the assumption of linear variation) around 28 and 100 cm. The GLDAS-2.0 dataset has a relatively higher estimation for soil temperature at 10–28 cm. The MERRA-2 dataset has the highest values for LA-ST at 69.77–100 cm and 144.26–294.94 cm (the soil depth of the fifth soil layer of MERRA-2). Six soil layers are used in the MERRA-2 dataset, which is the most layers among the four datasets, makes MERRA-2 dataset has the closest winter soil temperature to the observations at deep soil layers, but also leads to a large bias in its estimation in summer. In addition, we find that the bias between the reanalysis data and the observations is larger for soil temperature in summer than in winter.

To investigate the ability of the reanalysis data to rebuild the spatial distribution of soil temperature in summer and winter, we choose 40 cm as an example. For each reanalysis dataset, INTER-ST or LA-ST at 40 cm is selected, depending on which one is closer to observations (shown in Fig. 2), to present the spatial distribution. Therefore, INTER-ST at 40 cm in the ERA-Interim/Land and MERRA-2 datasets and LA-ST of the second layer of the CFSR and GLDAS-2.0 datasets are adopted as the soil temperature at 40 cm in summer. The LA-ST of the third layer of the four datasets is chosen as the soil temperature at 40 cm in winter. The observed soil temperature in summer shows a clear north–south difference over the east of China, with relatively high values in the south and relatively low values in the north (Fig. 3a). Over the west of China, summer soil temperature is relatively larger in the north than in the south. All four reanalysis datasets, which mainly show negative anomalies, generally capture those spatial characteristics, and have fewer discrepancies with the observations in the east than in the west of China. Relatively, the ERA-Interim/Land and GLDAS-2.0 datasets show more in common with the observations compared with the other products. The MERRA-2 dataset also performs well at reproducing the summer soil temperature in central and south China, but has a relatively large bias for soil temperature in north China (Fig. 3). For soil temperature in winter, as shown in Fig. 4, the observations also show a north–south disparity in the east of China, which is generally reproduced by the four reanalysis datasets. The GLDAS-2.0 dataset stands out as having the smallest bias, with a positive anomaly in the north and a negative anomaly in the south. The ERA-Interim/Land dataset has a relatively large overestimation of winter soil temperature over north and northwest China, which could be the main reason for the large national mean LA-ST of the ERA-Interim/Land dataset at 40 cm shown in Fig. 2. The MERRA-2 and CFSR datasets show good behavior in terms of reproducing the winter soil temperature over the east of China, and the CFSR dataset has a relatively smaller bias. Compared with the estimation of soil temperature in summer, the reanalysis datasets have a smaller bias for soil temperature in winter.

We also calculate the multiyear mean, correlation coefficient with observation, root mean square difference of mean soil temperature of stations with complete records at 40 cm in summer and winter. As is shown in Table 1, except for ERA-Interim/Land dataset in winter, reanalysis data have an underestimation of soil temperature comparing with observations. GLDAS-2.0 datasets shows the smallest bias of multiyear mean both in summer and winter. And it also performs better than other datasets in terms of correlation coefficient and root mean square difference, except for having a relatively low correlation coefficient with observations in winter.

Table 1 Multiyear mean (MM, °C), correlation coefficient with observations(CC), standard deviation (SD), root mean square difference (RMSD, °C), linear trend (LT, °C/year) and memory lengths (STM, months) of mean soil temperature of stations with complete records at 40 cm in summer (JJA) and winter (DJF)

Full size table

Standard deviation is adopted to represent the interannual variability of soil temperature. The standard deviation of soil temperature in summer for the observations has a spatial distribution characterized by obvious regional disparity, with higher values in the north than in the south (Fig. 5a). The reanalysis datasets also show a north–south gradient, and generally underestimate the interannual variability of summer soil temperature. Relatively, the ERA-Interim/Land dataset has a more similar spatial distribution to the observations than the other datasets. The other three datasets can mainly capture the interannual variability of summer soil temperature over south China (Fig. 5). Unlike the simple north–south disparity of summer soil temperature, the interannual variability of the observed soil temperature for winter is characterized by a high–low–high pattern from north to south (Fig. 6a). The CFSR dataset has a similar distribution to the observations. The GLDAS-2.0 dataset shows an underestimation in most areas, and can generally reproduce the spatial patterns of the observations over the east of China. The ERA-Interim/Land and MERRA-2 datasets do not rebuild the large standard deviation over north China, but they still capture the spatial characteristics of the observations over central China and south China (Fig. 6). In addition, ERA-Interim/Land and MERRA-2 datasets show the smallest bias of the standard deviation of mean soil temperature of stations with complete records in summer and winter respectively (Table 1).

Figure 7 shows the spatial distributions of the linear trend of soil temperature from observations and reanalysis datasets in summer. The linear trend of observed summer soil temperature has a relatively high value in north and a relatively low value in south, which has been rebuilt by CFSR and GLDAS-2.0 datasets. Four reanalysis datasets generally have an underestimation of the linear trend over north and northwest China. Relatively, GLDAS-2.0 dataset has the smallest bias comparing with others. For the linear trend of mean soil temperature of stations with complete records in summer, GLDAS-2.0 also has a closer value with observations than other datasets (Table 1). Comparing with the linear trend of observed soil temperature in summer, the linear trend of observed soil temperature in winter has a relatively higher value over south China (Fig. 8). Reanalysis datasets have all underestimated the linear trend over north China, and fail to rebuild the north–south disparity shown in observations. MERRA-2 dataset has a relatively closer value with observations for the linear trend of mean soil temperature of stations with complete records in winter (Table 1).

Soil temperature, as a reflection of the land surface thermal conditions, is highly related to the surface energy balance:

$${\text{S}}{{\text{R}}_{\text{n}}} + {\text{L}}{{\text{R}}_{\text{n}}} + {\text{SH}} + {\text{LH}} = {\text{G}}$$

(1)

where SR_n is the net downward shortwave radiation, LR_n is the net downward longwave radiation, SH is the sensible heat flux, LH is the latent heat flux, and G is the soil heat flux (Meng et al. 2012). Therefore, the quality of the estimations of surface energy balance components may have an influence on the estimations of soil temperature. Figure 9 shows the seasonal cycle of the observed soil temperature for 0–320 cm and the soil heat flux. During April–September, soil temperature in the upper layers is higher than in the deeper layers, and during November–March, the deeper layers have a higher soil temperature than the upper layers. Surface soil temperature peaks in July, while soil temperature at 80 and 320 cm peaks in August and October, respectively. In general, the four reanalysis products show a similar annual variability of the soil heat flux, which turns from negative to positive around February, then peaks in April and turns to negative around September–October. Positive and negative soil heat fluxes correspond to the soil gaining and losing energy. Therefore, the land surface is gaining energy from the atmosphere in spring and summer and losing energy to the atmosphere in autumn and winter. In the land surface models, LA-ST is usually calculated based on the heat diffusion equation. During April–September, the land surface transfers energy, which is got from atmosphere, to the deep soil layers, and during autumn and winter, the deep soil upwardly releases energy to the land surface.

The largest amount of energy that the land surface receives from the atmosphere is during April–June, while the highest land surface soil temperature appears around June–August. This phenomenon could be due to the fact that soil temperature has a memory ability for climate. We compare the increment of soil temperature from spring to summer in the first layer of reanalysis products with the averaged soil heat flux during April–June (Fig. 10a, c). The reason for using summer soil temperature minus spring soil temperature is considering that the energy from the atmosphere influences the summer soil temperature on the basis of the spring soil temperature. The increment of soil temperature from spring to summer corresponds well to the soil heat flux for the four products, with the CFSR dataset showing the highest values, followed by the ERA-Interim/Land and MERRA-2 datasets. The GLDAS dataset has both the smallest increment of soil temperature and soil heat flux. Therefore, in land surface models, the estimations of land surface energy balance components during April–June can significantly influence the estimations of summer soil temperature. In autumn and winter, energy is transmitted upward from the deep layers to the surface. Therefore, the soil heat flux should have an anti-correlation with the reduction in soil temperature, as a smaller reduction in surface soil temperature means more energy is transmitted from the deep soil layer to surface, leading to more energy being released to the atmosphere. As shown in Fig. 10b, d, the reduction of soil temperature from autumn to winter and the soil heat flux in winter correspond well. With the largest soil temperature reduction from autumn to winter, the CFSR dataset has the smallest winter soil heat flux. The ERA-Interim/Land dataset has the smallest soil temperature reduction and the largest winter soil heat flux.

Except for the ERA-Interim/Land dataset, the magnitudes of standard deviation for soil temperature and soil heat flux also correspond well (Fig. 11). The CFSR dataset, with the largest standard deviation for the increase of soil temperature from spring to summer, has the largest standard deviation of soil heat flux during April–June. The GLDAS-2.0 dataset has smaller standard deviations than the MERRA-2 dataset for the soil temperature increase and soil heat flux. The comparisons of standard deviations for the decrease of soil temperature from autumn to winter and for soil heat flux in winter are similar to the comparisons in summer. Therefore, the accuracy of estimating the interannual variability of soil temperature can be influenced by the estimation of the interannual variability of the soil heat flux.

Our previous work investigated the spatiotemporal characteristics of soil temperature memory over China based on the same observations, and emphasized its potential for improving our ability to predict seasonal climate (Yang and Zhang 2015). Owing to missing observational data, investigations of soil memory in some areas, especially northeast China and the Tibetan Plateau, are still scarce. Reanalysis products, as evaluated in this study, can be very helpful in compensating for this lack of data.

Based on the analysis in our previous work, we adopted the red noise method to calculate the soil temperature memory $(r(\tau ) = exp( - \tau /d))$, where d is the decay time scale, which characterizes the red noise process, and r(τ) is the autocorrelation coefficient at lag time τ (1 month in this study) (Jones 1975; Delworth and Manabe 1988). The 1-month autocorrelation coefficients of June and July, and July and August are averaged as the 1-month autocorrelation coefficients of summer, and the 1-month autocorrelation coefficients of November and December, and December and the January of the next year are averaged as the 1-month autocorrelation coefficients of winter.

We know that the main spatial characteristic of soil temperature memory is a northwest to southeast gradient, with relatively high values in northwest China, and relatively low values in southeast China, which can also been found in Fig. 12a. In summer, the ERA-Interim/Land dataset mainly shows an underestimation of soil temperature memory. It has a relatively larger memory length over northwest China than south China, which is generally consistent with the observations. The MERRA-2 dataset has a small bias of 0–2 months compared with the observations over the east of China, and it has a relatively larger bias over northwest China. The CFSR and GLDAS-2.0 datasets show similar spatial distributions of summer soil memory with an overestimation over north China, and they do not perform well at rebuilding the northwest–south disparity. For winter soil memory (Fig. 13), the ERA-Interim/Land dataset shows the northwest–south gradient, and overestimations over north China and northwest China. The memory lengths for the MERRA-2 dataset in northwest China are shorter than in the other datasets. Similar to the summer, the CFSR and GLDAS-2.0 datasets have a similar spatial pattern to soil memory in winter. They have a relatively smaller bias in the east than in the west of China. In spite of the poor skills presenting the spatial distribution of soil temperature memory, CFSR and GLDAS-2.0 datasets show the smallest bias of the memory lengths of mean soil temperature of stations with complete records in summer and winter respectively (Table 1).

4 Discussion

Figure 2 shows that the four reanalysis datasets perform differently at different soil depths, and there is no one product that performs better than the others at all soil depths. The evaluation results of soil temperature at 40 cm may not be applicable for the soil temperature at other depths. We also evaluated soil temperature at 80 cm using the same methods (not shown). In general, the four reanalysis datasets show similar spatial distributions of the seasonal mean and standard deviation for soil temperature at 40 and 80 cm, but not exactly the same. The MERRA-2 dataset has a better ability for reproducing the spatial characteristics of soil temperature at 80 cm than the other products. So the evaluation of soil temperature at 40 cm can’t be completely applied as the evaluation the soil temperature at other depths.

In this study, we only investigated the relationship between soil temperature and surface energy balance components to determine the reason for the different behaviors of the reanalysis datasets. In fact, except for energy balance, many land and atmospheric parameters and processes can influence or interact with soil temperature. Albergel et al. (2015) emphasized the importance of orography, soil moisture, and snow cover on the forecast of soil temperature in ECMWF. The orography over the west of China is much more complex than in the east of China, which could be the main reason that reanalysis products perform better in east than in west. Soil moisture, as a crucial parameter of the land–atmosphere interaction, is highly correlated with soil temperature (Subin et al. 2012). Heat transport in the soil column is usually based on the thermal gradient, and the heat conductivity is closely related to soil moisture (Koster et al. 2000). Soil moisture can also influence the surface energy balance by impacting the latent heat flux. The dynamics of snow, which is sensitive to air temperature, can influence the surface energy balance and then alter the soil temperature (Zhang et al. 2005a, b; Khoshkhoo et al. 2015). Soil and vegetation characteristics, and soil frost also play an important role in land energy and water balance (D’Odorico et al. 2007; Wu et al. 2011; Collow et al. 2014). The estimations of these parameters and processes can influence the estimation of soil temperature in reanalysis products to varying degrees.

A major conclusion of our study is that reanalysis datasets generally show an underestimation of the soil temperature over China, which is consistent with the evaluation of soil temperature from ECMWF forecasts during 2012 over Europe (Albergel et al. 2015). Ma et al. (2008) also found that ERA-40, NCEP/NCAR, and NCEP/DOE datasets have underestimated the air temperature (which is high correlated with soil temperature) over China. As mentioned above, Albergel et al. (2015) investigated the influence of orography data and snow cover on the estimation of soil temperature. They chose Darrington station (Washington DC, USA) as an example and found that the orography correction can make the surface soil temperature larger than the original data and closer to the observations. Zhao et al. (2008) demonstrated that “topographic correction” can notably improve the quality of surface air temperature in NCEP-NCAR and ERA-40, which have generally underestimate the surface air temperature over China. Albergel et al. (2015) investigated the impact of snow on soil temperature running offline ECMWF land surface model for a single grid point over the Wild Basin station (Colorado, USA). And they found that soil starts warming until snow depth reach 10 cm in model, while in observations, soil starts warming when soil depth is still 40 cm. They also mentioned that in model, soil starts warming until the snow depth of the entire grid point is less than 10 cm. These two limitations of land surface model can result in an underestimation of soil temperature. Another factor, the land use changes can also lead to an underestimation of soil temperature in reanalysis data. Urbanization (urban heat island effect) and other land use changes can contribute to a high surface temperature (Zhang et al. 2005a, b). While these anthropogenic changes in land surface condition are poorly described in models.

We found that reanalysis data perform better for estimating soil temperature in winter than in summer, especially over the east of China. This may be due to the effect of precipitation on soil temperature. Precipitation can influence the soil moisture and latent heat flux of land surface, which then directly or indirectly impacts the soil temperature. Influenced by the Asian monsoon, China has more precipitation in summer than in winter. Figure 14 shows the correlation coefficients of the soil temperature and precipitation from observations and reanalysis datasets. The observational precipitation data are provided by the China National Climate Center (http://ncc.cma.gov.cn/Website/index.php?ChannelID=43&WCHID=5). The stations of observational soil temperature data (ST-STATION) are different from the stations of observational precipitation data (PRE-STATION). For each PRE-STATION, the soil temperature of the adjacent ST-STATION (with a latitude and longitude difference from the latitude and longitude of the PRE-STATION less than 1° respectively) is selected or averaged (if there are more than one adjacent ST-STATION) as the soil temperature of this PRE-STATION. PRE-STATIONs with no adjacent ST-STATION are abandoned. We can find that the reanalysis datasets generally show similar spatial distribution characteristics with observations. In the four reanalysis datasets, soil temperature in summer shows high correlation with precipitation over most areas of the east of China, while in winter, there is no significant correlation between soil temperature and precipitation over those regions (Fig. 14). Therefore, the quality of the estimations of summer precipitation can play an important role in the estimations of summer soil temperature. The observed precipitation data adopted by the reanalysis products (mentioned in Sect. 2) can be very helpful for improving the reliability of the estimated soil temperature in summer.

5 Conclusions

In this study, we evaluated soil temperature from four reanalysis datasets, namely ERA-Interim/Land, MERRA-2, CFSR, and GLDAS-2.0, in terms of climatological mean, interannual variability, linear trend and memory lengths by comparison with observational data over China for 1981–2005. The magnitude of soil temperature averaged over the study period is generally underestimated by all four reanalysis datasets, which can be due to the limitations of models at reproducing the topographic characteristics, snow cover and land use changes. The ERA-Interim/Land and GLDAS-2.0 datasets have a relatively closer national mean to the observations than the MERRA-2 and CFSR datasets. Benefitting from the utilization of six soil layers, the MERRA-2 dataset has a good ability for rebuilding the winter soil temperature. For soil temperature at 40 cm, the four datasets all rebuild similar spatial distributions to the observations, and the GLDAS-2.0 dataset stands out as having a smaller bias in both summer and winter. The ERA-Interim/Land dataset shows a similar spatial distribution to the GLDAS-2.0 dataset for soil temperature in summer. The spatial distribution for the interannual variability of soil temperature, as characterized by standard deviation, is well reproduced by the ERA-Interim/Land dataset in summer and by the CFSR dataset in winter. Reanalysis datasets can generally rebuild the linear trend of soil temperature in summer.

The reanalysis products generally perform better in the east of China than in the west of China, which could be due to the fact that the orography over the west of China is much more difficult to describe than over the east of China. Furthermore, the reanalysis products perform better in winter than in summer. Soil temperature in the reanalysis data is significantly correlated with precipitation in summer over the east of China, while in winter, the correlation is small and insignificant. Hence, the estimation of summer precipitation can have an important influence on the estimation of summer soil temperature.

We have demonstrated that summer soil temperature is highly correlated with the soil heat flux during April–June, and winter soil temperature is related to the soil heat flux in winter, which highlights the importance of the estimation of land surface energy balance components on the estimation of soil temperature.

The four datasets can mainly rebuild the northwest–southeast gradient of soil temperature memory, and the ERA-Interim/Land dataset is more consistent with the observations in both summer and winter. In addition, the four reanalysis datasets have different abilities for estimating soil temperature at different soil depths. The results of the evaluation of soil temperature at 40 cm may not be applicable for the soil temperature of the other layers.

References

Albergel C, de Rosnay P, Balsamo G, Isaksen L, Muñoz-Sabater J (2012) Soil moisture analyses at ECMWF: evaluation using global ground-based in situ observations. J Hydrometeorol 13(5):1442–1460
Article Google Scholar
Albergel C, Dutra E, Muñoz-Sabater J, Haiden T, Balsamo G, Beljaars A, Isaksen L, de Rosnay P, Sandu I, Wedi N (2015) Soil temperature at ECMWF: An assessment using ground-based observations. J Geophys Res Atmos 120(4):2014JD022505
Article Google Scholar
Balsamo G, Beljaars A, Scipal K, Viterbo P, Hurk Bvd, Hirschi M, Betts AK (2009) A revised hydrology for the ECMWF model: verification from field site to terrestrial water storage and impact in the integrated forecast system. J Hydrometeorol 10(3):623–643
Article Google Scholar
Balsamo G, Albergel C, Beljaars A, Boussetta S, Brun E, Cloke H, Dee D, Dutra E, Muñoz-Sabater J, Pappenberger F, de Rosnay P, Stockdale T, Vitart F (2015) ERA-Interim/Land: a global land surface reanalysis data set. Hydrol Earth Syst Sci 19(1):389–407
Article Google Scholar
Bao X, Zhang F (2012) Evaluation of NCEP–CFSR, NCEP–NCAR, ERA-Interim, and ERA-40 reanalysis datasets against independent sounding observations over the Tibetan Plateau. J Clim 26(1):206–214
Article Google Scholar
Bosilovich MG, Lucchesi R, Suarez M (2016) MERRA-2: file specification. GMAO Office Note No. 9 (Version 1.1). http://gmao.gsfc.nasa.gov/pubs/office_notes
Chaudhuri AH, Ponte RM, Forget G, Heimbach P (2012) A comparison of atmospheric reanalysis surface products over the ocean and implications for uncertainties in air–sea boundary forcing. J Clim 26(1):153–170
Article Google Scholar
Collow TW, Robock A, Wu W (2014) Influences of soil moisture and vegetation on convective precipitation forecasts over the United States Great Plains. J Geophys Res Atmos 119(15):2014JD021454
Article Google Scholar
D’Odorico P, Caylor K, Okin GS, Scanlon TM (2007) On soil moisture–vegetation feedbacks and their possible effects on the dynamics of dryland ecosystems. J Geophys Res Biogeosci 112(G4):G04010
Google Scholar
Dee DP, Uppala SM, Simmons AJ, Berrisford P, Poli P, Kobayashi S, Andrae U, Balmaseda MA, Balsamo G, Bauer P, Bechtold P, Beljaars ACM, van de Berg L, Bidlot J, Bormann N, Delsol C, Dragani R, Fuentes M, Geer AJ, Haimberger L, Healy SB, Hersbach H, Hólm EV, Isaksen L, Kållberg P, Köhler M, Matricardi M, McNally AP, Monge-Sanz BM, Morcrette JJ, Park BK, Peubey C, de Rosnay P, Tavolato C, Thépaut JN, Vitart F (2011) The ERA-Interim reanalysis: configuration and performance of the data assimilation system. Q J R Meteorol Soc 137(656):553–597
Article Google Scholar
Delworth TL, Manabe S (1988) The influence of potential evaporation on the variabilities of simulated soil wetness and climate. J Clim 1(5):523–547
Article Google Scholar
Dirmeyer PA, Guo Z, Gao X (2004) Comparison, validation, and transferability of eight multiyear global soil wetness products. J Hydrometeorol 5(6):1011–1033
Article Google Scholar
Fan X (2009) Impacts of soil heating condition on precipitation simulations in the weather research and forecasting model. Mon Weather Rev 137(7):2263–2285
Article Google Scholar
Hodges KI, Lee RW, Bengtsson L (2011) A comparison of extratropical cyclones in recent reanalyses ERA-Interim, NASA MERRA, NCEP CFSR, and JRA-25. J Clim 24(18):4888–4906
Article Google Scholar
Holmes TRH, Jackson TJ, Reichle RH, Basara JB (2012) An assessment of surface soil temperature products from numerical weather prediction models using ground-based measurements. Water Resour Res 48(2):W02531
Article Google Scholar
Hu Q, Feng S (2004a) A role of the soil enthalpy in land memory. J Clim 17(18):3633–3643
Article Google Scholar
Hu Q, Feng S (2004b) Why has the land memory changed? J Clim 17(16):3236–3243
Article Google Scholar
Huffman GJ, Adler RF, Bolvin DT, Gu G (2009) Improving the global precipitation record: GPCP Version 2.1. Geophys Res Lett 36(17):L17808
Article Google Scholar
Jones RH (1975) Estimating the variance of time averages. J Appl Meteorol 14(2):159–163
Article Google Scholar
Khoshkhoo Y, Jansson PE, Irannejad P, Khalili A, Rahimi H (2015) Calibration of an energy balance model to simulate wintertime soil temperature, soil frost depth, and snow depth for a 14 year period in a highland area of Iran. Cold Reg Sci Technol 119:47–60
Article Google Scholar
Kim J-E, Alexander MJ (2013) Tropical precipitation variability and convectively coupled equatorial waves on submonthly time scales in reanalyses and TRMM. J Clim 26(10):3013–3030
Article Google Scholar
Koster RD, Suarez MJ, Ducharne A, Stieglitz M, Kumar P (2000) A catchment-based approach to modeling land surface processes in a general circulation model: 1. Model structure. J Geophys Res Atmos 105(D20):24809–24822
Article Google Scholar
Koster RD, Suarez MJ, Liu P, Jambor U, Berg A, Kistler M, Reichle R, Rodell M, Famiglietti J (2004) Realistic initialization of land surface states: impacts on subseasonal forecast skill. J Hydrometeorol 5(6):1049–1063
Article Google Scholar
Li H, Robock A, Liu S, Mo X, Viterbo P (2005) Evaluation of reanalysis soil moisture simulations using updated Chinese soil moisture observations. J Hydrometeorol 6(2):180–193
Article Google Scholar
Lindsay R, Wensnahan M, Schweiger A, Zhang J (2014) Evaluation of seven different atmospheric reanalysis products in the Arctic. J Clim 27(7):2588–2606
Article Google Scholar
Liu Y, Avissar R (1999) A study of persistence in the land–atmosphere system using a general circulation model and observations. J Clim 12(8):2139–2153
Article Google Scholar
Ma L, Zhang T, Li Q, Frauenfeld OW, Qin D (2008) Evaluation of ERA-40, NCEP-1, and NCEP-2 reanalysis air temperatures with ground-based measurements in China. J Geophys Res 113:D15115
Article Google Scholar
Mahanama SPP, Koster RD, Reichle RH, Suarez MJ (2008) Impact of subsurface temperature variability on surface air temperature variability: an AGCM study. J Hydrometeorol 9(4):804–815
Article Google Scholar
Makshtas A, Atkinson D, Kulakov M, Shutilin S, Krishfield R, Proshutinsky A (2007) Atmospheric forcing validation for modeling the central Arctic. Geophys Res Lett 34(20):L20706
Article Google Scholar
Mao J, Shi X, Ma L, Kaiser DP, Li Q, Thornton PE (2010) Assessment of reanalysis daily extreme temperatures with China’s homogenized historical dataset during 1979–2001 using probability density functions. J Clim 23(24):6605–6623
Article Google Scholar
Meng J, Yang R, Wei H, Ek M, Gayno G, Xie P, Mitchell K (2012) The land surface analysis in the NCEP climate forecast system reanalysis. J Hydrometeorol 13(5):1621–1630
Article Google Scholar
Mitchell KE, Lohmann D, Houser PR, Wood EF, Schaake JC, Robock A, Cosgrove BA, Sheffield J, Duan Q, Luo L, Higgins RW, Pinker RT, Tarpley JD, Lettenmaier DP, Marshall CH, Entin JK, Pan M, Shi W, Koren V, Meng J, Ramsay BH, Bailey AA (2004) The multi-institution North American Land Data Assimilation System (NLDAS): utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. J Geophys Res Atmos 109(D7):D07S90
Article Google Scholar
Molod A, Takacs L, Suarez M, Bacmeister J (2015) Development of the GEOS-5 atmospheric general circulation model: evolution from MERRA to MERRA2. Geosci Model Dev 8(5):1339–1356
Article Google Scholar
Peters-Lidard CD, Houser PR, Tian Y, Kumar SV, Geiger J, Olden S, Lighty L, Doty B, Dirmeyer P, Adams J, Mitchell K, Wood EF, Sheffield J (2007) High-performance Earth system modeling with NASA/GSFC’s land information system. Innov Syst Softw Eng 3(3):157–165
Article Google Scholar
Rienecker MM, Suarez MJ, Todling R, Bacmeister J, Takacs L, Liu H-C, Gu W, Sienkiewicz M, Koster RD, Gelaro R, Stajner I, Nielsen JE (2008) The GEOS-5 data assimilation system: documentation of versions 5.0. 1, 5.1.0, and 5.2.0. NASA technical report series on global modeling and data assimilation, vol 27. https://ntrs.nasa.gov/archive/nasa/casi.ntrs.nasa.gov/20120011955.pdf
Rienecker MM, Suarez MJ, Gelaro R, Todling R, Julio Bacmeister, Liu E, Bosilovich MG, Schubert SD, Takacs L, Kim G-K, Bloom S, Chen J, Collins D, Conaty A, Silva Ad, Gu W, Joiner J, Koster RD, Lucchesi R, Molod A, Owens T, Pawson S, Pegion P, Redder CR, Reichle R, Robertson FR, Ruddick AG, Sienkiewicz M, Woollen J (2011) MERRA: NASA’s modern-era retrospective analysis for research and applications. J Clim 24(14):3624–3648
Article Google Scholar
Robock A, Vinnikov KY, Srinivasan G, Entin JK, Hollinger SE, Speranskaya NA, Liu S, Namkhai A (2000) The global soil moisture data bank. Bull Am Meteorol Soc 81(6):1281–1299
Article Google Scholar
Rodell M, Houser PR, Jambor U, Gottschalck J, Mitchell K, Meng C-J, Arsenault K, Cosgrove B, Radakovich J, Bosilovich M, Entin JK, Walker JP, Lohmann D, Toll D (2004) The global land data assimilation system. Bull Am Meteorol Soc 85(3):381–394
Article Google Scholar
Saha S, Moorthi S, Pan H-L, Wu X, Wang J, Nadiga S, Tripp P, Kistler R, Woollen J, Behringer D, Liu H, Stokes D, Grumbine R, Gayno G, Wang J, Hou Y-T, Chuang H-Y, Juang H-MH, Sela J, Iredell M, Treadon R, Kleist D, Delst PV, Keyser D, Derber J, Ek M, Meng J, Wei H, Yang R, Lord S, Dool HVD, Kumar A, Wang W, Long C, Chelliah M, Xue Y, Huang B, Schemm J-K, Ebisuzaki W, Lin R, Xie P, Chen M, Zhou S, Higgins W, Zou C-Z, Liu Q, Chen Y, Han Y, Cucurull L, Reynolds RW, Rutledge G, Goldberg M (2010) The NCEP climate forecast system reanalysis. Bull Am Meteorol Soc 91(8):1015–1057
Article Google Scholar
Shah R, Mishra V (2014) Evaluation of the reanalysis products for the monsoon season droughts in India. J Hydrometeorol 15(4):1575–1591
Article Google Scholar
Sheffield J, Goteti G, Wood EF (2006) Development of a 50-year high-resolution global dataset of meteorological forcings for land surface modeling. J Clim 19(13):3088–3111
Article Google Scholar
Simmons AJ, Jones PD, da Costa Bechtold V, Beljaars ACM, Kållberg PW, Saarinen S, Uppala SM, Viterbo P, Wedi N (2004) Comparison of trends and low-frequency variability in CRU, ERA-40, and NCEP/NCAR analyses of surface air temperature. J Geophys Res Atmos 109(D24):D24115
Article Google Scholar
Subin ZM, Koven CD, Riley WJ, Torn MS, Lawrence DM, Swenson SC (2012) Effects of soil moisture on the responses of soil temperatures to climate change in cold regions. J Clim 26(10):3139–3158
Article Google Scholar
Tang M, Zhang J, Wang J, Zhang L (1988) The similarity between the seasonal anomalous maps of soil temperature and the precipitation of the subsequent season (in Chinese). Acta Meteorol Sin 46(4):481–485
Google Scholar
Wang W (1991) Numerical experiments of the soil temperature and moisture anomalies’ effects on the short term climate (in Chinese). Sci Atmos Sin 15(5):116–123
Google Scholar
Wang Y, Chen W, Zhang J, Nath D (2013) Relationship between soil temperature in may over Northwest China and the East Asian summer monsoon precipitation. Acta Meteorol Sin 27(5):716–724
Article Google Scholar
Wu L, Zhang J (2014) Strong subsurface soil temperature feedbacks on summer climate variability over the arid/semi-arid regions of East Asia. Atmos Sci Lett 15(4):307–313
Google Scholar
Wu W-S, Purser RJ, Parrish DF (2002) Three-dimensional variational analysis with spatially inhomogeneous covariances. Mon Weather Rev 130(12):2905–2916
Article Google Scholar
Wu SH, Jansson PE, Zhang XY (2011) Modelling temperature, moisture and surface heat balance in bare soil under seasonal frost conditions in China. Eur J Soil Sci 62(6):780–796
Article Google Scholar
Xie P, Arkin PA (1997) Global precipitation: a 17-year monthly analysis based on gauge observations, satellite estimates, and numerical model outputs. Bull Am Meteorol Soc 78(11):2539–2558
Article Google Scholar
Xue Y, Vasic R, Janjic Z, Liu YM, Chu PC (2012) The impact of spring subsurface soil temperature anomaly in the western U.S. on North American summer precipitation: A case study using regional climate model downscaling. J Geophys Res Atmos 117(D11):D11103
Article Google Scholar
Yang K, Zhang J (2015) Spatiotemporal characteristics of soil temperature memory in China from observation. Theor Appl Climatol. doi:10.1007/s00704-015-1613-9. 1–11
Zhang J, Dong W, Wu L, Wei J, Chen P, Lee DK (2005a) Impact of land use changes on surface warming in China. Adv Atmos Sci 22(3):343–348
Article Google Scholar
Zhang Y, Chen W, Smith SL, Riseborough DW, Cihlar J (2005b) Soil temperature in Canada during the twentieth century: complex responses to atmospheric climate change. J Geophys Res Atmos 110(D3):D03112
Google Scholar
Zhang J, Wang W-C, Wei J (2008) Assessing land-atmosphere coupling using soil moisture from the Global Land Data Assimilation System and observational precipitation. J Geophys Res Atmos 113(D17):D17119
Article Google Scholar
Zhao T, Guo W, Fu C (2008) Calibrating and evaluating reanalysis surface temperature error by topographic correction. J Clim 21:1440–1446
Article Google Scholar
Zhou L, Huang R (2006) Characteristics of interdecadal variability of the difference between surface temperature and surface air temperature in spring in arid and semi-arid region of Northwest China and its impact on summer precipitation in North China (in Chinese). Clim Environ Res 11(1):1–13
Google Scholar

Download references

Acknowledgements

The ERA-Interim/Land data used in this study were obtained from the ECMWF data server: http://apps.ecmwf.int/datasets/. MERRA-2 is an official product of the Global Modeling and Assimilation Office (GMAO) at NASA GSFC, supported by NASA’s Modeling, Analysis, and Prediction (MAP) program. The CFSR data were developed by NOAA’s NCEP. The data for this study are from the Research Data Archive (RDA), which is maintained by the Computational and Information Systems Laboratory (CISL) at the National Center for Atmospheric Research (NCAR). NCAR is sponsored by the National Science Foundation (NSF). The original data are available from the RDA (http://dss.ucar.edu) in dataset number ds093.2. The GLDAS-2.0 data used in this study were acquired as part of the mission of NASA’s Earth Science Division and archived and distributed by GES DISC. This work was supported by the National Natural Science Foundation of China (Grant No. 41275089), the National Basic Research Program of China (2012CB955604), and the Jiangsu Collaborative Innovation Center for Climate Change.

Author information

Authors and Affiliations

Center for Monsoon System Research, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing, 100029, China
Kai Yang & Jingyong Zhang
University of Chinese Academy of Sciences, Beijing, 100049, China
Kai Yang & Jingyong Zhang

Authors

Kai Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jingyong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kai Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, K., Zhang, J. Evaluation of reanalysis datasets against observational soil temperature data over China. Clim Dyn 50, 317–337 (2018). https://doi.org/10.1007/s00382-017-3610-4

Download citation

Received: 24 September 2016
Accepted: 27 February 2017
Published: 17 March 2017
Issue Date: January 2018
DOI: https://doi.org/10.1007/s00382-017-3610-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of reanalysis datasets against observational soil temperature data over China

Abstract

Similar content being viewed by others

Investigating spatiotemporal changes of the land-surface processes in Xinjiang using high-resolution CLM3.5 and CLDAS: Soil temperature

A 10-Yr Global Land Surface Reanalysis Interim Dataset (CRA-Interim/Land): Implementation and Preliminary Evaluation

Evaluation of ERA5 and NCEP reanalysis climate models for precipitation and soil moisture over a semi-arid area in Kuwait

1 Introduction