1 Introduction

The Tibetan Plateau (TP), located in central Asia, is the largest and highest plateau on earth, with an average elevation in excess of 4000 m. It has often been referred to as the ‘Roof of the World’ or the ‘Third Pole’ of the earth (Qiu 2008; Yao et al. 2019). Moreover, The TP is known for its significant impact on regional and even global climates through the actions of both mechanical and thermal forcing (Nan et al. 2009; Yao et al. 2012). The TP is also referred to as the ‘water tower of Asia’ because it is the headwater catchment of several major Asian rivers, including the Yellow, Yangtze, Lancang-Mekong, Salween, Ganges–Brahmaputra, and Indus rivers; thus, it supports the environment and livelihood of over 1.4 billion people (Immerzeel et al. 2010). The signature of climate warming in the TP has been apparent over the past half-century, particularly in winter (Guo and Wang 2012; Wang et al. 2008), and the warming trend has been more significant than that at similar latitudes elsewhere (Liu and Chen 2000). Furthermore, this warming has resulted in glacial recession, permafrost degradation, and increases in river discharge and lake levels according to numerous studies (Li et al. 2014, 2017; Wang et al. 2017, 2019a, b; Yang et al. 2010a, b; Yao et al. 2012).

Regional and global climates have changed over the past century, and such changes are expected to continue in the future (IPCC 2013). A series of climate modeling tools have been adopted to investigate and understand the climate over the TP (Gao et al. 2015, 2017; Jiang et al. 2019; Maussion et al. 2011; Shi et al. 2011; You et al. 2018). Owing to their relatively coarse spatial resolution, general circulation models (GCMs) can not usually describe the effects of regional features on the climate adequately (Gu et al. 2015; Jiang et al. 2016; Su et al. 2013; Wang et al. 2018; You et al. 2017; Yu et al. 2018). Meanwhile, regional climate models (RCMs) are able to depict regional heterogeneity and fine-scale forcing (i.e., complex terrain and vegetational variety); thus, they are regarded as the most frequently used tool for basin-scale climate studies (D’Orgeville et al. 2014; Erler and Peltier 2017; Gu et al. 2012; Gula and Peltier 2012; Guo et al. 2018; Peltier et al. 2018; Xu et al. 2005). Previous investigations have suggested that RCMs can reasonably model Asian monsoon climatology and its variability (Gao et al. 2012; Gu et al. 2018; Huo and Peltier 2019; Kim et al. 2008; Oh et al. 2014). Furthermore, RCMs have proven to be effective in enhancing our understanding of the interactions between the surface factors that impact precipitation and the atmospheric system and the effect of external forcing (i.e., sea surface temperature, changes in land surface (Sun et al. 2017)) on the dynamics of the monsoon system (Song et al. 2010; Wang et al. 2016a, b).

Among a dozen of the RCMs currently available (Tapiador et al. 2020), the various versions of RegCM have been some of the most frequently applied to investigate the climate of the TP, as well as many other areas in the world (Ali et al. 2015; Giorgi et al. 2012; Koné et al. 2018; Seth et al. 2007; Sinha et al. 2014; Wang et al. 2013). Moreover, the complexity of RegCM has continued to evolve, with the model version being consistently updated. For example, many aspects were improved in the most recent version released in 2017, including the ability to modify physical parameterizations and the introduction of a new land surface scheme (CLM4.5) to enable the depiction of land surface processes related to land biogeophysics, the hydrologic cycle, biogeochemistry, human dimensions, and ecosystem dynamics. Because many alternative parameterizations can be employed to represent the same physical process in RegCM4, uncertainties are generated in the simulation when the model is applied to any particular region. These uncertainties must be investigated to tune the model to perform optimally in the target region (Huang et al. 2013; Li et al. 2016).

Many sensitivity analyses have been conducted in the context of the work described in this paper to identify a suitable domain size (Almazroui 2015; Dash et al. 2014; Rauscher et al. 2006), adequate horizontal resolution (Gao et al. 2006; Liu et al. 2010), potential driving GCMs (Park et al. 2013), and a physics parameterization scheme (Im et al. 2008; Kang et al. 2014; Wang et al. 2016a, b). When monsoon rainfall is simulated by climate models, the convective parameterization is commonly regarded as the most critical physical process to be accurately represented. In addition, the surface latent heat flux parameterizations over land and the ocean are vital owing to their influence in controlling the quantity of the earth’s surface moisture entering the atmosphere (Chen and Avissar 1994; Li et al. 2012). Although a number of scientific reports have confirmed the absence of a best “universal” model setup, improved results have been obtained for certain model configurations under particular climate conditions. However, the current climate models still have limited capability to reproduce the historical climate over the TP. First, the TP is the highest and most extensive highland in the world, with highly heterogeneous terrain and land surface characteristics. Moreover, the TP has a unique planetary boundary layer (PBL), which is approximately 9 km above the sea level, much higher than the surrounding lowlands (Chen et al. 2016). Thus, it is difficult for climate models to properly treat the complex surface heterogeneity and critical processes such as glacier/snow cover–related albedo feedbacks (Su et al. 2013). Meanwhile, the mechanical and thermodynamic forces over the TP are influenced by the Asian monsoon systems, which presents a challenge for current state-of-the-art climate models to simulate (Song and Zhou 2013; Sperber et al. 2013). The work described in the present paper is devoted to investigating a wide range of model configurations as a first step toward the creation of an appropriate ensemble of integrations of the global warming process targeted on the TP.

Therefore, this study is similar in design to many of the above-referenced RCM investigations with the primary novelty being the work focused on the TP region. RCMs have been widely used in regional climate studies over the TP, and remarkably encouraging results have been achieved (Gao et al. 2011; Maussion et al. 2014; Wang et al. 2013, 2016a, b). Xu et al. (2018) and Lin et al. (2018) investigated the impact of the model resolution on precipitation simulation and water vapor transport over the TP by the REMO and WRF models, individually. The results indicate the importance of a higher resolution in precipitation simulation over a complex terrain region. Jiang et al. (2019) evaluated the simulations of summer rainfall over the TP by the WRF model with two horizontal resolutions and two cumulus schemes. The sensitivity of the RegCM simulations to the land surface models on the TP was studied by Wang et al. (2014). In contrast to previous studies that concentrated on the impact of a single configuration of physical parameterizations applied to modeling the climate at 20–50 km horizontal resolutions for a period of 5–10 years (Ali et al. 2015; Ji and Kang 2013; Shi et al. 2018; Wang et al. 2014; Xu et al. 2018; Zhang et al. 2015), this study was conducted at a finer (10 km) horizontal resolution for a 20-year-period as a basis to evaluate the performance of the climate model with different parameterization schemes. For this purpose, we also selected the source regions of five large rivers as subregions, and validated the model specifically upon these different river basins. The reasons for the selection the TP region as the study area include the following: (a) the TP region is a known “hotspot” that has significantly attracted the interest of national decision-makers; (b) a series of problems continue to exist for the TP when attempts at climate model validation have been conducted; (c) there is a lack of knowledge concerning the temporal and spatial variations of the climate over the TP, which play an important role in studies of terrestrial ecosystems and hydrological processes, as well as a need to further explore the impact of the TP on the Asian monsoon climate.

This article is organized as follows. Descriptions of the climate model to be employed and its most relevant parameterization schemes, as well as the data employed and the experimental design are provided in Sect. 2. In Sect. 3, the results of the performed simulations are presented together with a discussion of their implications. A summary of our conclusions is offered in Sect. 4.

2 Study area, data, model, and experiments

2.1 Study area

The TP covers five provinces of China (Xizang, Qinghai, Gansu, southern Xinjiang, and western Sichuan) and occupies the region bounded by 25–40° N latitude and 75–105° E longitude (Fig. 1). The TP is famous for its large number of contributing mountain ranges, such as the Kunlun Mountains in the north, the Tangula Mountains in the central region, and the Himalayas on the southernmost margin. The elevation in the TP decreases gradually from the northwest to the southeast, with a mean altitude of 4292 m. Vegetation types include forest, temperate shrubland/meadow, temperate/alpine desert, and ice/polar desert (Cui and Graf 2009).

Fig. 1
figure 1

Domain configurations of the nested RegCM simulations with a D1: 30-km resolution and b D2: 10-km resolution and the Tibetan Plateau investigation area. The black dots indicate the monitoring stations used to construct the observational reference. The boundaries of the five source river basins are shown in b (YE Yellow River, YA Yangtze River, ME Mekong River, SA Salween River, BR Brahmaputra River)

As previously mentioned, five major river basins situated on the TP were selected as foci for the purpose of the study, namely the upper reaches of the Yellow River (YE), Yangtze River (YA), Mekong River (ME), Salween River (SA), and the Brahmaputra River (BR) (Fig. 1). These five basins display a wide variety of climate, topography, land cover, and hydrological regimes. The Indian monsoon, East Asian monsoon, and the mid-latitude westerlies are critical factors impacting the climate in these basins (Yang et al. 2014; Yao et al. 2012). The annual mean precipitation increases as the elevation diminishes, with more than 70% of the precipitation occurring in the months from May to September.

2.2 Regional climate model (RegCM)

RegCM version 4.6.1 (RegCM4) from ICTP (Giorgi et al. 2012) was applied in this study. It includes the latest non-hydrostatic version and a dynamical core resembling that of the “parent” hydrostatic version of MM5. The physical parameterizations in RegCM4 comprise the moisture scheme at the sub-grid scale (Pal et al. 2007), the radiation package (RAD) of the National Center for Atmospheric Research (NCAR) community climate model version 3 (CCM3) (Kiehl et al. 1996) and the Rapid Radiation Transfer Model (RRTM) (Mlawer et al. 1997), the non-local PBL scheme developed by Holtslag and Boville (1993) and the new University of Washington PBL (UW PBL) scheme (Bretherton et al. 2004; Grenier and Bretherton 2001), as well as several optional cumulus parameterizations such as the Kuo (Anthes 1977), Grell (Grell 1993), Emanuel (Emanuel 1991), Kain-Fritsch (Kain and Fritsch 1993), and Tiedtke (Kain and Fritsch 1993) schemes, ocean surface schemes (Zeng et al. 1998), and land surface schemes (BATS (Dickinson et al. 1993) and CLM4.5 (Oleson et al. 2008)).

In this study, 6 hourly ERA-Interim reanalysis datasets, which show better performance than other reanalysis data over the TP, with the spatial resolution of 1.5° × 1.5° were applied as the initial and boundary conditions to the climate model (Wang and Zeng 2012). The Optimum Interpolation Sea Surface Temperature is another type of forcing data for the RCM, which constitutes weekly timescale data obtained from the National Oceanic and Atmospheric Administration.

2.3 Observational datasets

The daily precipitation and 2-m air temperature dataset with the spatial resolution of 0.1° (1979–2015) was obtained from the Data Assimilation and Modeling Center of the Institute of Tibetan Plateau Research, Chinese Academy of Sciences (Wang and Zeng 2012). The precipitation data were generated by merging the station observations (Fig. 1; 95 stations located in and 34 stations located on the edge of the TP region) from the China Meteorological Administration with other sources of precipitation data such as the Princeton forcing data, Global Land Data Assimilation System (GLDAS) data, and Tropical Rainfall Measuring Mission (TRMM) satellite precipitation data. The applicability and reliability of the dataset have been verified in other studies (Liu et al. 2019; Wang et al. 2019a, b; Xie et al. 2017). Furthermore, these datasets have been used for land surface water and energy modeling in the TP (Xue et al. 2013) and compared with in situ observation data (Xie et al. 2017).

The monthly satellite-based total cloud cover data from 1983 to 2009 (Rossow and Schiffer 1999), with a native resolution of 2.5°, from the International Satellite Cloud Climatology Project (ISCCP) was employed to evaluate the RCM simulations. In addition, Lange et al. (2015) indicated that the uncertainty from the satellite observations mentioned above reached approximately 5%.

The monthly surface shortwave and longwave net radiation observations on a 1° grid extending from July 1983 to December 2007, developed by Stackhouse et al. (2011) was employed to evaluate the quality of our model simulations. Moreover, Lange et al. (2015) found that the uncertainties for the above shortwave and longwave radiations are 20 W/m2 and 5 W/m2, respectively. In addition, the ERA-Interim reanalysis data were selected as the reference data to serve as a basis for our investigation of the performance of the model in terms of wind circulation at 500 hPa.

2.4 Experimental setup

This study focused on the evaluation of climate model simulations over the TP based on different combinations of cumulus parameterization, land surface, PBL, and RAD schemes. First, five options for the cumulus convection scheme were considered: Kuo, Grell, Emanuel, Kain, and Tiedtke. The Kuo scheme, which is the simplest cumulus convection scheme (Giorgi et al. 2012), has been available since the earliest version of RegCM. The Emanuel is the most complex scheme used for depicting the cumulus process, and includes processes such as cloud mixing and ice processes. For the land surface model, CLM4.5 is a more sophisticated model than BATS; the former includes more elaborate surface characteristics, with more soil and snow layers, and uses explicit treatments for both liquid water and ice (Steiner et al. 2009). Meanwhile, the UW PBL and the RRTM are both newly implemented optional schemes in RegCM.

In contrast to other studies, we focused on the sensitivity of RegCM4 with a large number of different configurations, including the combinations of five convection schemes, two land surface models, two PBL schemes, and two radiation schemes. The modeling experiments compromised the current computing resources. Therefore, the convection scheme, which is the most sensitive scheme for precipitation, was treated as the primary calibration scheme. Then, the best convection scheme was selected and used to calibrate the other physical schemes individually. This study included analyses of a total of nine combinations of physical parameterizations to further evaluate the performance of RegCM4. As summarized in Table 1, C1–5 share the same land surface physics, RAD, and PBL schemes to quantify the impact of the cumulus parameterizations. C5 and C6 share the same cumulus, RAD, and PBL schemes, aimed at identifying the effects of two different land surface models, etc. Additionally, C7, C8, and C9 were compared with C5 to test the model sensitivity to the RAD, PBL, and resolution, respectively.

Table 1 Nine conducted experiments with five different cumulus parameterization schemes, two radiation schemes, two PBL schemes, two land surface models, and two spatial resolutions

For the experiments listed in Table 1 (except the last experiment), the center of the model grid was fixed at 31° N, 94° E, with 250 grid points along each longitudinal circle and 220 grid points along each latitude direction. The buffer zone at the outer boundaries of the model wass set to be 12 grid points deep with exponential relaxation toward the model forcing and nudging decreasing in strength toward the interior of RegCM. The model was first employed with a horizontal resolution of 30 km and 18 vertical levels (from the surface up to 10 hPa). The study period covered 20 years, ranging from 1989 to 2008, as previously stated. In addition, 1988 was selected as a spin-up period to allow for a full year of equilibration.

The final experiment in the set to be described employed a double-nested dynamic downscaling method to simulate the climate cover the whole TP at a 10-km resolution with 380 grid points in the longitudinal direction and 232 grids in the latitudinal direction. The 10-km resolution simulation was run within the 30-km resolution domain and used similar physical configurations as those in the C6 experiment. Figure 1b shows the double-nested model domain of C9. A comparison of the two domains demonstrates that the topography is much more highly resolved in the innermost domain at 10-km resolution than in the outermost domain at 30-km resolution (Fig. 1).

Several commonly used statistical measures were employed to quantify the performance of the RegCM4 simulations (Samouly et al. 2018): absolute bias, relative bias, spatial correlation coefficient, and root mean square error.

3 Results

We first analyzed the ability of RegCM4 to simulate the seasonal distributions of precipitation, total cloud cover (TCC), 2-m air temperature, surface shortwave and longwave net radiations, and the wind field at 500 hPa. Next, our analyses focused on the climate of the five river basins considering both the seasonal cycles and interannual variability. Finally, we considered the distribution of the daily precipitation intensities computed using the different RegCM configurations.

3.1 Seasonal climatology

3.1.1 Precipitation

We commence with a discussion of the most important atmospheric variable as far as this study is concerned. The mean precipitations during winter [December–January–February (DJF)] and summer [June–July–August (JJA)] are displayed in Fig. 2. In JJA, the precipitation exhibits a strong southeast-northwest gradient in the rainfall pattern, with the rainfall decreasing from the southeast (rainfall > 5 mm/day) to the northwest (rainfall < 1 mm/day) due to the impact of the Asian summer monsoon in the southern regions. All the model simulations can reproduce these primary precipitation features, but considerable differences exist between the different parameterization schemes. Summarizing a few statistical measures for all five convection schemes (Table S.1), the biases of the mean summer precipitation simulations range from − 1.3 to 91% over the TP for the period 1989–2008. There is a common overestimate of the summer precipitation for all convection schemes, except for a 1.3% underestimate from the Kain scheme. The Kuo scheme overestimates the summer precipitation in the western TP and underestimates the precipitation in the eastern TP. Similarly, the Grell and Kain schemes overestimate the summer precipitation in the northern TP and underestimate the precipitation in the middle and southern TP. As a result, the Kuo, Grell, and Kain schemes exhibit the lowest spatial correlation coefficient (i.e., < 0.3), even though they are characterized by lower average relative biases in summer precipitation. In contrast, the Emanuel and Tiedtke schemes show a higher spatial correlation coefficient, although they significantly overestimate the rainfall compared with observations. As we know, different convection schemes have different physical hypotheses. The Emanuel scheme focuses on the local atmospheric stability, while the Tiedtke scheme focuses on the large-scale water vapor movement (Zhang et al. 2015). The Emanuel scheme is more active in transporting humid air from low levels to high levels, generating more precipitation (Ali et al. 2015). We believe for a thicker PBL, the unique characteristics of deep convection and entrainment in the Tiedtke scheme are responsible for the wet bias reduction in summer precipitation. Considering the above, it is not yet possible to infer a “best” convection scheme based on the ability of the parameterization to reproduce the multi-year mean precipitation.

Fig. 2
figure 2

Observed mean precipitation and simulations for winter (DJF, three leftmost columns) and summer (JJA, three rightmost columns) during the years 1989–2008. For each season, the left column shows the seasonal mean value, while the middle and right columns show the absolute and the relative differences between the observation (topmost row) and the model simulations (from top to bottom), respectively. Here, the model setups (from top to bottom) have been defined in Table 1

Based on the evidence from previous studies, the precipitation amount is expected to be considerably reduced when using CLM to replace BATS as the land surface scheme (Gao et al. 2016). More specifically, the combination of the Emanuel and Tiedtke schemes with the CLM reduces the positive results, which is not the case for other convective schemes when applied over the TP. Similar findings were reinforced by additional simulations using the CLM as the land surface model. The Emanuel and Tiedtke schemes combined with BATS overestimate the summer precipitation by 2.30 and 1.68 mm/day, respectively, while the summer precipitation biases were reduced to 1.36 and 0.97 mm/day, respectively, when combined with CLM. In addition, the overestimations are primarily in the western and southern parts of the TP, where the observations are limited. Considering that the CLM is one of the state-of-the-art land surface models, with possibly the most-advanced parametrizations, it is recommended to be used for areas with complex land surface condition like the TP (Wang et al. 2014). Therefore, the Tiedtke scheme combined with the CLM is suggested to be the preferred convection scheme for annual average precipitation simulations over the TP.

Compared with the convection scheme and land surface model, the PBL parameterizations and radiation schemes have little effect on precipitation. To further investigate the impact of a horizontal resolution on a historical climate simulation, a nested experiment at 10-km resolution was performed. The summer precipitation overestimation at this higher resolution was reduced to 0.43 mm/day. This shows that the higher resolution model performs significantly better at reproducing the magnitude and distribution of historical precipitation.

The winter is characterized by very low precipitation rates (< 0.5 mm/day) over most regions of the TP based on observation. The winter precipitation is higher over the Himalayas, in particular in the western and eastern parts, where it can reach more than 2 mm/day. Most simulations could reproduce the above spatial pattern. However, all models consistently overestimate the winter precipitation to be approximately 3 times higher than that observed. This is consistent with the previous studies, which indicated that higher winter precipitation is commonly estimated in this region with RCM (Dimri and Niyogi 2013; Gao et al. 2015; Ji and Kang 2013; Maussion et al. 2014; Ménégoz et al. 2013). Winter precipitation is not sensitive to the cumulus parameterization scheme and model resolution, implying that it is mainly non-convective. It is also important to note that most winter precipitation occurs in the form of snow. Thus, this overestimation in the winter precipitation will simulate larger snow cover extent and snow depth, which will impact the radiation budget because of the albedo changes (Guo et al. 2019). In turn, this will encourage 2-m air temperature cooling by subsequently reducing the net shortwave radiation (Rangwala and Miller 2012).

3.1.2 Total cloud cover

Clouds play a significant role in the water and energy balance of the atmosphere. The winter and summer mean values of the TCC fractions are shown in Fig. 3. Generally, most models overestimate the winter TCC and underestimate the summer TCC over the TP. In JJA, all the convection schemes show similar results and reproduce the TCC well, with an areal average bias of less than 8% compared with the ISCCP observation data, excluding the Kuo scheme, which underestimates the TCC by approximately 11%. In DJF, all models overestimate the TCC in the north and west parts of the TP and underestimate the TCC in the southeast part of the TP, although an areal average bias of less than 4% was found. In general, the Emanuel and Tiedtke schemes exhibit better performances in reproducing the TCC in magnitude and spatial distribution.

Fig. 3
figure 3

Mean cloud cover fraction versus observations during winter (DJF, two leftmost columns) and summer (JJA, two rightmost columns) from 1989 to 2008. For each season, the left column shows the seasonal mean value, and the right column presents the absolute difference between the observation (topmost row) and the model simulations (from top to bottom) with the different configurations defined in Table 1

Compared with the convection schemes, a change of land surface model results in larger TCC changes. Generally, the CLM land surface model yields less mean cloud cover of the TP compared with the BATS scheme, although it will partially offset the overestimation in winter. In contrast, the UW PBL parameterization and RRTM radiation scheme tend to yield more cloud cover compared with their default options. Compared with the 30-km simulations, the higher 10-km resolution model has little effect on the model performance in reproducing the TCC over the TP.

3.1.3 Surface shortwave net radiation

Figure 4 shows the spatial distribution of the simulated and observed surface net SW radiations (SW net) for the whole sky. The SW net in winter is much lower than that in summer, and shows a clear decreasing gradient from south to north over the TP. However, the highest SW net was found in the western region of the TP in summer due to the low rainfall and cloud cover in this area. Overall, the model underestimates the SW net, resulting in less solar absorption at the land surface compared to the satellite observations with a bias of approximately − 37 to − 70 W/m2 in the summer and winter. A negative bias is widely distributed throughout the entire domain, particularly over the western TP. Regarding the spatial variability, the Tiedtke scheme shows the most realistic spatial distribution of the SW net (R = 0.72 and 0.73 in summer and winter, respectively). Unlike the TCC, the CLM yields a higher SW net value in summer compared with the BATS scheme, which has a positive impact by reducing the bias. As reported from several previous studies, this anti-correlation between the SW net and TCC biases is physically rational considering the reflection effect of clouds (García-Díez et al. 2015; Pessacg et al. 2014).

Fig. 4
figure 4

Same as Fig. 3, but for surface net SW radiation

3.1.4 Surface longwave net radiation

Figure 5 shows the spatial distribution of the surface net LW radiation for the whole sky. The LW net has a similar magnitude and spatial distribution in winter and summer. The highest LW net was found over the Qaidam basin in the north TP. Regarding the absolute LW, the model clearly underestimates the observations and is not sensitive to the employed parameterization scheme to represent cumulus convection. However, using the CLM instead of the BATS land surface model significantly reduces the summer biases from more than 20 W/m2 in BATS to less than 4 W/m2 in CLM. This implies that the CLM land surface model generally increases the outgoing longwave radiation. Overall, the Tiedtke scheme combined with the CLM has the best performance in simulating longwave radiation.

Fig. 5
figure 5

Same as Fig. 3, but for surface net LW radiation

3.1.5 2-m air temperature

The DJF and JJA mean values of the 2-m air temperature, which are related to the surface radiation budget as well as to precipitation, are shown in Fig. 6. Most experiments can reproduce the observed seasonal mean 2-m air temperature over the TP very well, in terms of its spatial distribution. Moreover, the effects of topography on the surface air temperature can clearly be identified in both the observation and model simulations; the surface air temperature is warm in low-altitude areas such as the Qaidam basin and cold in high-altitude areas such as the south and west TP. Furthermore, there is a pronounced cold bias in both winter and summer, particularly in the western and southern TP. In addition, there is a significant warm bias over the northern TP in winter. This finding is in accordance with the results of Wang (2014), whose results were obtained using RegCM4 and the Grell convective scheme coupled with either the BATS or CLM over the TP. Moreover, the cold biases in the model simulations are partially due to the systematic bias in the ERA forcing data (Bao and Zhang 2012; Wang and Zeng 2012). The observations may also have warm bias because the meteorological stations mostly lie in valleys (Gu et al. 2012; Qin et al. 2009). Similarly, the cold biases in the simulations are most probably a direct result of the lack of observations in the western TP and high-latitude areas. In general, the Tiedtke scheme combined with the CLM shows the best performance in simulating the 2-m air temperatures. Contrary to some expectations, the double-nested 10-km resolution simulation tends to overestimate the surface temperature over the eastern TP, particularly in summer.

Fig. 6
figure 6

Same as Fig. 3, but for 2-m air temperature

3.1.6 Regional circulation patterns

It is also important to study the large-scale atmospheric circulation variations when investigating regional climate variations (Zhang et al. 2018). Hence, the model simulations are further analyzed herein with respect to the circulation characteristics by comparing our model results with the ERA Interim reanalysis. Unlike much of China, the mean sea level pressure is not a suitable field to use for studying the prevailing circulation characteristics over the land surface of the TP due to the high elevation of this region. Considering that the mean elevation of the TP area in this study is higher than 4200 m, and therefore that the 500-hPa level is just above the surface of the TP (Zhang et al. 2018), the 500-hPa wind field was employed for the purpose of analysis in this study.

Figure 7 shows a comparison of the spatial distribution of the 500-hPa wind field circulation obtained in our simulations with the results of the ERA-interim reanalysis for the JJA period. All models except the Kuo convection scheme depict the circulation pattern reasonably well; i.e., the midlatitude westerlies split into two branches near the western TP in summer, and the moisture is transported from the Arabian Sea and the Bay of Bengal onto the southeastern TP by the Indian summer monsoon. As can be seen in Fig. 7, the Grell and Kain schemes show a stronger westerly flow of air in the northern TP; whereas, the Emanuel and Tiedtke schemes show a stronger southerly flow in the south of the TP, which tends to transport more moisture to the TP and explains the precipitation overestimation. The wind bias in the southeast TP is reduced when using CLM as the land surface model, although the bias remains large in the westernmost TP. Despite this, all convective schemes, except the Kuo scheme, depict roughly similar biases over the TP. Due to the limited simulation domain, the wind field from the double-nested 10-km resolution simulation is not shown here because the large-scale circulation pattern could not be captured in this interior domain.

Fig. 7
figure 7

Spatial distributions of JJA wind fields at 500 hPa (two leftmost panels) and associated wind field biases (two rightmost panels) derived from observations (first row) and simulations with different configurations. The colors indicate the wind speed, and the arrows indicate the direction and intensity

3.2 Basin scale evaluation

In this section, we concentrate on the simulation results for precipitation and the 2-m air temperature over the five major river basins in the TP that were previously discussed, creating the foundation for the investigation regarding coupling RegCM4 with a hydrological model in the near future.

3.2.1 Seasonal cycles

Figure 8 shows the seasonal cycles of precipitation and the 2-m air temperature in the five river basins, upon which we decided to focus. For the entire TP, the ratio of the mean precipitation in the period of May–September to the annual precipitation exceeds 80% based on the observations. The Kuo scheme simulates an early peak in June; the Grell, Emanuel, and Kain model configurations simulate a late peak in August; while the Tiedtke configurations capture the observed peak in July very well. However, all the model configurations overestimate the observed precipitation, particularly the Emanuel scheme. The observations indicate a cold winter and warmer summer period with the highest temperatures occurring in July. The Kuo scheme is the only configuration that fails to capture the seasonal temperature pattern, simulating an early peak in June and overestimating the temperature in most months. The double-nested modelling at 10-km resolution tends to overestimate the temperature in summer, despite reproducing the seasonal cycle of the temperature well. All other model configurations reproduce the seasonal distribution of the temperature at 2 m well but tend to underestimate the temperature compared to the observations, particularly in the first half of the year. The simulation from Tiedtke combined with the CLM is closer to the observations compared to the other configurations.

Fig. 8
figure 8

Seasonal cycles of the yearly precipitation (top) and temperature (bottom) averaged over the whole TP and five subregions (left to right) for the given time period of 20 years (1989–2008)

Among the five basins, the source area of the Yellow River is located in the northeastern TP and contributes approximately 35% of the river’s annual discharge (Lan et al. 2010; Yuan et al. 2015). Excluding that using the Kuo scheme, all model configurations capture the seasonal cycles of the precipitation and temperature well; however, the Emanuel, Tiedtke, and Grell schemes overestimate the observed precipitation, particularly in summer. The overestimation is reduced when the Tiedtke scheme is the coupled with the CLM land surface model.

The source region of the Yangtze River is located to the south of that of the Yellow River, and covers the largest area among the five river source regions. The seasonal cycles of precipitation and temperature are well captured by most model configurations. Tiedtke coupled with CLM continues to show the best performance among the configurations.

Meanwhile, the source regions of the Mekong River and Salween River are two south-north-oriented narrow basins located in the eastern TP, and cover the smallest area. Most of the model configurations are able to capture the seasonal cycles of temperature in these basins well, but have difficulty in reproducing the seasonal patterns of precipitation. For example, the Grell and Kain schemes simulate an unrealistically low precipitation in June and underestimate the observed summer precipitation. Furthermore, Emanuel consistently overestimates the precipitation in most months. Tiedtke also overestimates the precipitation and shows little improvement when coupled with CLM, although it captures the seasonal patterns of precipitation well.

The Brahmaputra River in the northern Himalayas has the highest elevation catchment of any of the world’s great rivers, and the upper region is approximately 2000 km in length (You et al. 2007). All model configurations reproduced the seasonal temperature cycle with cold biases throughout the year, and in particular, they underestimate the temperature by more than 3 °C in the cold season (e.g., November–March). However, as shown in Fig. 8, all model configurations greatly overestimated the precipitation throughout the year, particularly the Kuo, Emanuel, and Tiedtke schemes, which overestimate the observed precipitation by more than 100%. A reduced overestimation was found to be characteristic of the Grell and Kain schemes, but they simulated unrealistically low precipitation in June. The CLM land surface model did not reduce the overestimation in the Brahmaputra River, but the model with the 10-km resolution did reduce the overestimation significantly compared with the 30-km resolution simulation (reduced from approximately 180% to 70%). This is primarily due to the relatively smooth topography in RegCM at the course resolution, which extends the rain-belt northward into the Plateau rather than limiting the rain-belt to the southern region of the Himalayas. A model with higher resolution could improve the precipitation simulation over this basin, although an overestimation might still occur.

In summary, the simulation from Tiedtke combined with the CLM model better reproduces the observed seasonal cycles of temperature and precipitation over the five subregions and the entire TP domain, compared with the other configurations investigated.

3.2.2 Interannual variability

The interannual variations of the yearly temperature and precipitation averaged over the whole TP and five subregions for the study period of 20 years (1989–2008) are shown in Fig. 9, including the results obtained for all the model configurations. Most of the experiments show a consistent wet and cold bias compared with the observation. The Emanuel scheme yields the largest wet bias in the whole TP domain. Meanwhile, the Grell scheme produces a relatively low bias for the precipitation but yields the largest cold bias for the temperature. Most models could simulate the interannual variations of the temperature well, with spatial correlation coefficients ranging from 0.54 to 0.85. However, the interannual variations of the precipitation were not captured by most models, with an average correlation coefficient of 0.29. The Kuo scheme has the worst performance in reproducing the interannual variabilities of precipitation and temperature.

Fig. 9
figure 9

Interannual variations of the yearly precipitation (top) and temperature (bottom) averaged over the whole TP and five subregions (left to right) for the given time period of 20 years (1989–2008)

It is useful to evaluate the ability of the model to simulate the regional climatic trends. A good performance in this respect could reinforce the credibility of the model in future climate projections. The linear trend values of the observed and simulated precipitation and 2-m air temperature during the period 1989–2008 averaged over the entire TP and five subregions are shown in Fig. 10. The observed precipitation trend shows a predominantly positive value over the entire TP, particularly in the Yellow and Brahmaputra River basins, where an increase of more than 10%/decade was found. This trend signal has been captured by most models, although they underestimate the magnitude. For the Yangtze, Mekong, and Salween River basins, where the observed precipitation trends are not significant, little agreement was found between the different simulations. The observed 2-m air temperature shows a pronounced warming during this period. Most models can reproduce this positive trend over the whole domain, though they also underestimate the magnitude. This underestimation of the warming trend is partly due to the fact that the ERA-interim reanalysis forcing data also underestimates the warming trend over the TP (Gao et al. 2015). The results indicate that there are numerous qualitative agreements between the observations and model simulations, and the simulated trend was influenced by the model configurations. It should be noted that the warming and wetting trends in the 20-year study period are more significant than those in the last 50 years (Xu et al. 2008). In addition, due to the large magnitude of interannual variability and the existence of multi-decadal variability (Ziegler et al. 2005), the study period is too short for a meaningful climate trend analysis, particularly for precipitation.

Fig. 10
figure 10

Observed and simulated annual precipitation and temperature trends during the period 1989–2008 averaged over the whole TP and five subregions

3.2.3 Precipitation intensities

Figure 11 displays the percentage distribution of the daily precipitation frequency and amount during the period from 1989 to 2008, averaged over the entire domain of the TP and five subregions for all experiments. Generally, the frequency and amount of daily precipitation events within the given intervals of intensity were well simulated by most experiments. However, the dry days (e.g., < 0.1 mm/day) were underestimated compared with the observations in most configurations. The results for the dry days show less underestimation in the five subregions compared with the entire TP, which indicates a more significant underestimation in the dry western TP. This is partly because RegCM usually “drizzles” excessively at the lowest intensities, particularly over the dry region (Halenka et al. 2006; Seth et al. 2004).

Fig. 11
figure 11

Ratio distribution of averaged observation and simulated daily precipitation frequency (left panel) and amount (right panel) over the whole TP and five subregions (from top to bottom) during 1989–2008

It is evident that the maximum frequency of the observed precipitation (0.1–1 mm/day) was well captured in most configurations. However, the model tends to overestimate the frequency of light (1.0–5.0 mm/day), and particularly moderate (5.0–10.0 mm/day) and heavy (> 10 mm/day) precipitation. It also overestimates the amount of heavy precipitation compared to the observed data, particularly in the Brahmaputra River basin. The model underestimated the amount of light precipitation and overestimated the amount of precipitation with an intensity higher than 7 mm/day in the Brahmaputra River basin, indicating that, in general, the overestimation of the precipitation in this basin derives from high-intensity events, which mainly occur in summer. This overestimation is mostly related to the enhancement of the summer convective precipitation because of the relatively coarse horizontal resolution of the exceptionally complex terrain along the Himalayas in this area. In fact, we found that the configurations (for example, the Emanuel cumulous convection scheme) with a wet bias in the annual precipitation are associated with an overestimation of the precipitation amounts for events with precipitation at rates higher than 10 mm/day. This is consistent with the results from Zhang et al. who argued that convective activity is overestimated in the Emanuel scheme, generating excessively heavy precipitation (Zhang et al. 2015). These results indicate that it remains difficult for the current version of RegCM4 to simulate extreme precipitation events in the TP.

4 Summary and conclusions

This paper presented a comprehensive evaluation of RegCM4 over the TP. Our analyses focus on climate factors including the precipitation, cloud cover, surface net radiation, 2-m air temperature, and surface wind circulation. We compared the performance of the model with nine distinct physics configurations, which differ in the parameterizations of cumulus parameterization, land surface processes, PBL, and radiation schemes. Furthermore, five major river basins were selected to thoroughly investigate the performance of RegCM4 on the basin scale.

The results indicate that the modeled climate is highly sensitive to the parametrizations selected for the representation of the sub-grid scale processes. In particular, precipitation is sensitive to the cumulus parameterization, with the Emanuel and Tiedtke schemes exhibiting a better performance in reproducing the spatial distribution of 20 years of mean precipitation, though they significantly overestimate the rainfall compared with the other schemes. However, the overestimation of precipitation is reduced when the model employs the CLM land surface model instead of the default BATS model. In addition, the model characterized by a higher resolution (10 km) shows better performance in reproducing the magnitude and distribution of the mean precipitation.

Compared with the convection schemes, a change of land surface model results in larger changes in cloud cover and surface radiation. Generally, the CLM results in the prediction of less mean cloud cover and more surface shortwave radiation and surface longwave radiation over the TP compared with the BATS scheme. In contrast, the UW PBL parameterization and RRTM radiation scheme tend to result in increased cloud cover and less surface shortwave and longwave radiations compared with the default options for these processes.

Most model configurations were able to reproduce the spatial distribution of 2-m air temperature well over the TP. However, significant cold biases were found to exist in both winter and summer, particularly in the western and southern TP. These cold biases may be partly caused by the systematic biases in the ERA forcing data and the overestimation in the observations associated with the fact that most meteorological stations lie in valleys (Wang et al. 2016a, b). Analogously, uncertainties in the observation dataset over the TP may also contribute to the model wet bias. Ma et al. (2015) concluded that an overall mean of 27% negative bias was found in the collected precipitation data from 241 meteorological stations across the TP and surrounding regions from ten countries; in particular, the largest bias was found in the unshielded Chinese standard precipitation gauge, which was used in the central TP region.

At the basin scale, most models were able to simulate the seasonal cycle and interannual variations of temperature well over the five different basins. However, the seasonal distribution and interannual variations of precipitation were not captured by most models. The simulated precipitation and temperature by the Tiedtke scheme combined with the CLM is closest to the observations among the simulations produced using nine different model configurations. The increasing trends for the historical precipitation and temperature was captured by most models, though they underestimate the magnitude. The most significant overestimation of precipitation was found in the Brahmaputra River basin, and this overestimation derives from high-intensity events.

Based on the above results, it can be concluded that the model is highly sensitive to the choice of configuration over the TP. Overall, the Tiedtke scheme combined with the CLM land surface model yields better results for seasonal mean climate and basin scale climate simulations compared to other model configurations. Previous studies have recommended the Grell and Emanuel schemes for South Asia and East Asia (Li et al. 2016; Maity et al. 2017; Raju et al. 2015; Zhang et al. 2015). However, over most parts of the TP and the five major source basins, our results suggest the Tiedtke scheme is superior, especially when it is combined with the CLM. In the near future, we will continue to perform long-period climate simulations to generate an ensemble of future climate projections and use them to investigate the response of terrestrial hydrological processes over the TP to climate change.