1 Introduction

The substantial evidences justify existence of global warming, causing changes in the hydrological cycle, both regionally and globally, therefore, it is essential to identify any potential implications of climatic variations on hydrological process including extreme events e.g. floods, droughts and even expedition of snow- and glacier-melt (Azmat et al. 2016; Chen et al. 2012). To identify possible fluctuations in hydrological regime subjected to climate changes, several steps are needed to be taken for the climate projections and interconnectivity of steps called as “model chain”. Previous studies explored that each step of model chain influences simulation, thereby contributes to uncertainty (Addor et al. 2014). Therefore, for each model chain step, research community developed different models and approaches with limitations. Steps of model chain comprise of emission scenarios e.g. GCMs, downscaling approaches/models, bias corrections and eventual projections.

GCMs are the principal source of projections (Dibike and Coulibaly 2005), however, direct use of GCMs is inhibited by the coarse spatial resolution which represents significant obstacle in projection of climate variables at basin level. Conversion from coarse to fine is, therefore, essential for climate change studies to reduce uncertainties (Meenu et al. 2013; Wang et al. 2013). The downscaling of climate variables, particularly precipitation, was reported as highly uncertain by several researchers which makes the selection of appropriate downscaling techniques and GCMs, tremendously essential (Hagemann et al. 2013; Wang et al. 2013). Taking this into consideration, several statistical downscaling approaches/models have been developed to overcome uncertainty problems (Chen et al. 2012). Lin et al. (2017) employed k-nearest neighbor method (KNN), a spatiotemporal statistical downscaling technique with integration of Bergen Climate Model (BCM2.0) and third generation Coupled Global Climate Model (CGCM3.1) to assess change in hourly precipitation and stated that this technique was efficient tool to downscale hourly precipitation data over Dahan River in northern Taiwan. Chen et al. (2012) evaluated different statistical downscaling techniques and GCMs for the downscaling of climate variables, subsequently, utilized to examine the impact of climate change on runoff of the upper Hanjiang basin, China. However, different downscaling techniques with integration of GCMs are always challenging in high-altitude and scarcely-gauged catchments. Typically, high-altitude catchments confront two kinds of data issues, first, unavailability of daily climate data at appropriate length (Liu et al. 2009); second, the quality of available precipitation data (Azmat et al. 2016). Such issues make downscaling from global into local scale climate variables very challenging and misrepresentation of ground data. Therefore, in this study we examine different GCMs, downscaling and bias correction approaches to rely on suitable combination of model chain for future projections. To best our knowledge, most of the studies were conducted in low-elevation regions where data issues were not extensive. Even very few studies were carried out by considering uncertainty issues following comparative analysis. The specific objective of this study was to examine the efficiency of five different GCMs, two well-known statistical downscaling models, a multiple regression hybrid statistical downscaling model (SDSM) and a supervised learning based smooth support vector machine (SSVM) and finally adaptation of two bias correction techniques, distribution mapping and linear scaling, for the projected data correction. Subsequently, we examined potential changes in precipitation and temperature under future scenarios by adopting suitable method/approach/model in model chain to overcome uncertainty propagation, in Jhelum River basin, Pakistan.

2 Study Area

This study was conducted in the Jhelum River basin, a trans-boundary river catchment shared by Pakistan and India, located in Western Himalayas and draining the southern slopes of the Himalayan range. The catchment area upstream of the Mangla dam (only storage structure) is 33,867 km2 (Fig. 1) and the estimated mean elevation of the study catchment is approximately 2094 m a.s.l. This catchment is influenced by two different precipitation patterns; the westerlies circulations and monsoon. Therefore, twofold precipitation peak occurs as recorded at most of the stations (i.e. Narran, Murree and Kotli) in the Jhelum River basin. The mean annual precipitation ranges from 485 mm at Astore to 1178 mm at Narran, respectively, while the mean annual temperature varies from 6.04 °C at Narran and 25.39 °C at Gujjar Khan station. Some key characteristics of the climate stations and data points for the un-gauged part of the basin and altitude zones of the Jhelum River basin are given in Table 1 and shown Fig. 1a.

Fig. 1
figure 1

(a) Geographical location of Pakistan including study area along with climate stations and data points of the APHRODITE over un-gauged part of the catchment, (b) Flowchart of methodology. The shaded boxes depict the major elements of model chain and white boxes show particulates of the element. Several methods were adopted for each element as listed below the box of corresponding element and process

Table 1 Some key characteristics of different climate stations located in Jhelum River basin

3 Datasets

Measured daily and monthly precipitation, Tmin (minimum temperature) and Tmax (maximum temperature) of 11 stations provided by Water and Power Development Authority (WAPDA) and Pakistan Meteorological Department (PMD) over different available control periods as mentioned in Table 1. Since, the Jhelum River is a trans-boundary river catchment, the catchment located in India, we used daily precipitation and mean temperature (Tmean) provided by Asian Precipitation-Highly-Resolved Observational Data Integration Towards Evaluation of Water Resources (APHRODITE) project with 0.25° × 0.25° resolution (Yatagai et al. 2012). This data have already been successfully used by several researchers (Kumar et al. 2016; Lutz et al. 2014) in high Asia mountains (e.g. Himalayan, Hindu-Kush and Karakoram Range). The daily precipitation (APHRO_MA_V1003R1) and mean temperature (APHRO_MA_TAVE_V1204R1) data products were downloaded and corrected (http://chikyu.ac.jp/precip) for 1961–2004 and 1961–2007, respectively.

The National Center for Environmental Protection (NCEP/NCAR) provided daily re-analysis atmospheric predictors for period of 1961–2000, used for the calibration and validation of downscaling models. Furthermore, the output of five different GCMs, Hadley Center’s (HadCM3), Coupled Global Climate Model (CGCM3), Community Climate System Model version 3 (NCAR-CCSM3), MPI-ECHAM5 and CSIRO-MK3.5 were used to scrutinize the suitable GCMs for the Jhelum River basin.

3.1 Conversion of Monthly into Daily Data

The availability of long term daily climate data are key for downscaling of climate variables, however, the selected study area is subjected to twofold scarcely-gauged issues, first, trans-boundary catchment, second, limited daily data over a feasible temporal scale and range. While daily data at most of the stations were collected after 1970 and the only monthly records were available from 1960 or earlier (Table 1). To deal with this issue, monthly observed climate data was converted into daily data by using MODAWEC model. The MODAWEC model is a Fortran language based parametric weather generator. This model has been applied successfully by Liu et al. (2009) to convert monthly Tmin, Tmax (°C) and precipitation (mm) into daily time scale. It is a parametric weather generator, which uses rainfall as a trigging variable to generate rainfall amount and event occurrence independently adopting stochastic relationship, while, temperature variables generated based on the generated rainfall in first step. The MODAWEC model utilizes first-order Markov chain to differentiate dry- or wet-days based on transitional probability matrices. For the derivation of daily precipitation, the first-order Markov chain follows two states (wet or dry) with transitional probabilities of “wet day follow a dry-day” and “dry-day follows previous wet-day”. The transition from wet to dry or dry to wet is governed by two transitional probabilities; first, in case of wet-day, the modified exponential distribution provide approximation of daily precipitation and second, the monthly given precipitation corrects final daily precipitation. The probability of wetness on corresponding day is based on dryness or wetness of previous day. The detailed discussion of MODAWEC model are provided by Liu et al. (2009). Further, the methodology flow chart of aforementioned equations is presented in Fig. S1 (supplement material). For temperature (minimum and maximum) generation, the MODAWEC employs algorithm designed by Richardson (1981) to approximate initial temperature values based on correlation with precipitation. The multivariate normal distribution generates the residual of daily Tmin and Tmax. Final corrected daily Tmin and Tmax were obtained by the correcting initial temperature values with help of monthly mean temperature. MODAWEC utilizes standard deviations (SD) of Tmin and Tmax as input, however, in case of absence of SD the long term monthly extreme may substitute SD.

In this study, the monthly precipitation and temperature variables (Tmin and Tmax) are available at large temporal range (i.e. 1960–2013) for most climate stations located in the study basin, the monthly extremes of Tmin and Tmax were extracted from the observed data. The number of wet days for each year was obtained from Climatic Research Unit (CRU) available at 50 × 50 km spatial resolution for 1901–2000. The MODAWEC model is freely available on http://www.eawag.ch/en/department/siam/software/ with complete program version and user help manual. Further description of MODAWEC is also available in supplement material.

4 Methods

We used two different downscaling methods; Statistical Downscaling Model (SDSM) and Smooth Support Vector Machine (SSVM), to scrutinize two most suitable GCMs out of those five (CGCM3, HadCM3, CCSM3, ECHAM5 and CSIRO-MK3.5), for the projection of reliable climate data. Downscaling was performed using 40 years’ baseline (1961–2000) observed data (Tmin and Tmax and precipitation). We calibrated and validated both downscaling techniques (SDSM and SSVM) for 30 years (1961 to 1990) and 10 years (1991 to 2000), respectively, using NCEP\NCAR reanalysis data. Subsequently, suitable GCMs for Jhelum River basin were utilized with downscaling of climate variables up to the end of the current century (2099). The generated future climate data were further bias corrected using distribution mapping and linear scaling methods with help of observed/control period of 1961–2000 (baseline) data. The change in future climate variables was analyzed for three time slices of 2011–2040 (2030s), 2041–2070 (2060s) and 2071–2099 (2090s). Further, detailed flow chart for the methodology adopted in this study is presented in Fig. 1b.

Since, precipitation data are very sensitive to hydrological predictions and are more important for the efficiency of downscaling techniques for the projection of precipitation data. Therefore, only precipitation data were utilized for the evaluation of downscaling techniques and analysis of GCMs using sophisticated statistical descriptors such as mean (mm/d), SD (mm/d), percentile (95%), percent wet days (%), maximum wet spell length (days), dry spell length (days) and 5-day maximum rainfall. In this study, 1 mm was selected as wet-threshold.

4.1 Statistical Downscaling Methods

4.1.1 Statistical Downscaling Model (SDSM)

Statistical Downscaling Model; a multiple regression hybrid downscaling approach, was developed by Wilby et al. (2002) and mathematical relationships are explained by Wilby and Dawson (2013). Model develops an empirical relationship between GCM predictors and local scale predictands to downscale climate variables. In this study, two major steps were taken to achieve the downscaling of climate variables. First, empirical relationship was established between predictors and predictands through processes such as quality control, screening, transformation and predictors’ selection, calibration, and the weather generator by using observed predictands. Second, the future climate data (temperature and precipitation) series were simulated using GCMs and selected NCEP\NCAR predictors in the first step. The most critical step in downscaling is to screen the best suitable predicators. Out of a total of twenty six, the most suitable large scale predictors were selected for each station during the “Screen Variables” process in the SDSM model (see Table S1). This model has been tested by several researchers for the downscaling of climate variables in different catchments (Chen et al. 2012; Najafi and Kermani 2017) to examine climate change implications. The SDSM model is freely available at http://co-public.lboro.ac.uk/cocwd/SDSM/sdsmmain.html. Further, detailed description along with mathematical relationships used in SDSM, is provided in supplement material.

4.1.2 Smooth Support Vector Machine (SSVM)

Support Vector Machine (SVM) is a supervised learning technique based on Vapnik-Chervo-nenkis (VC) and structural risk minimization (SRM) proposed by Vapnik (1998). It is an efficient approach to establish a workable relationship between model learning ability and complexity, even with inadequate data. Moreover, it has the capacity to predict and address several practical issues such as small sample size, high dimensionality and nonlinearity of data. It has been applied by several researchers (Meenu et al. 2013). Additionally, the learning algorithm of SVMs automatically chooses the model structure (number of hidden units). An advanced version of SVM, called the Smooth Support Vector Machine (SSVM) proposed by Lee and Mangasarian (2001), attempts to simplify the SVM training and minimize computational complexity which is very efficient for large data with linear kernel function. In SSVM with linear kernel function, it is modified by solving fast Newton algorithm, the smoothing approach utilized to convert a quadratic constrained optimization problem into a convex quadratic unconstrained problem. These properties converges a unique solution of unique problem of large datasets by considering smoothing parameter as infinity. The SSVM approach predicts climate data by establishing the statistical relationship between large scale GCM variables and local climate variables as generated by Chen et al. (2012). The selection of large scale predictors was performed SSVM models on the basis of correlation between observed predictands and large scale predictors by considering a threshold of 0.3. The complete description and derivation of the relationship can be found from supplement material and Lee and Mangasarian (2001). The detailed description of features, properties and algorithm of the SSVM is provided by Lee and Mangasarian (2001). The MATLAB written program of SSVM is provided in supplement material and also available on http://research.cs.wisc.edu/dmi/svm/ssvm/.

4.2 Bias Correction Approaches

We adopted distribution mapping and linear scaling approaches for the bias correction of APHRODITE gridded dataset and daily temperature and precipitation generated by the integration of SDSM and SSVM with selected GCMs. The idea behind bias correction was to cater the possible biases (uncertainties) between observed and other datasets (APHRODITE and downscaled climate variables). It is assumed that the bias correction algorithm and parameterization are applicable for both current (1961–2000) and future climate change scenarios (2030s, 2060s and 2090s). The distribution mapping used transfer function of precipitation to settle the occurrence distribution of temperature and precipitation. For the precipitation, the shape (α) and scale (β) parameter of Gamma distribution are the most suitable for the distribution of precipitation event occurrence. The shape parameter controls the distribution profile, while, scale parameter is responsible to estimate dispersion of Gamma distribution (Teutschbein and Seibert 2012). In current study, a cumulative distribution function (CDF) was developed for observed and GCMs simulated data for a certain month on daily basis. Thus, GCMs simulated value of temperature or precipitation of corresponding day within the month sought on empirical CDF of GCMs simulation corresponding to the cumulative probability. Subsequently, the value of temperature or precipitation of that cumulative probability was located on CDF of observed data.

Linear scaling method needs monthly correction factor by taking difference between observed and GCMs simulated values of the present day. Theoretically, the corrected monthly values should be perfectly similar to the observed. The correction of precipitation data is based on the factor estimated using long term monthly mean observed and GCMs simulation. On the other hand, temperature data correction is based on additive factor estimated by taking difference between long term mean monthly observed and GCMs simulated temperature. The scaling factors (SFs) for the temperature variables (Tmin and Tmax) is estimated by an additive, established on the basis of difference between long-term observed mean and control run of GCM-simulations. The complete description and derivation of the relationship can be found from supplement material and Teutschbein and Seibert (2012).

5 Results and Discussion

5.1 Comparison of GCMs and Downscaling Models

The results for calibration period (1961–1990) using NCEP\NCAR reanalysis data with integration of SDSM and SSVM are given in Table 2a. Results show better performance of both models for Tmean and precipitation in comparison with observed data and are comparable with the results of several previous studies (Huang et al. 2011; Wilby et al. 2002). However, the simulated precipitation using SSVM was found slightly better than SDSM as confirmed by Meenu et al. (2013). The spatial heterogeneous nature of precipitation could be the responsible for low efficiency in comparison to temperature data (Tmean), therefore, only precipitation data was considered for the screening of two suitable GCMs for Jhelum River basin.

Table 2 (a) Explained variance (E) and standard errors (SE) for calibration period (1961–1990), and statistical descriptors, (b) comparison between SDSM and SSVM simulated precipitation under NCEP\NCAR reanalysis with mean observed during calibration (1961–1990) and validation period (1991–2000) and, (c) comparison between five (5) different GCMs using mean observed and simulated precipitation of all climate stations by using SSVM and SDSM during validation period (1991–2000)

A comprehensive performance evaluation and comparison during calibration and validation of SDSM and SSVM were carried out by taking an average of daily observed and NCEP\NCAR simulated precipitation and temperature values of 15 stations (Table 2b). The bold and italic values indicated less biases between observed and simulated values and better efficiency in comparison with normal font values. SSVM performed better in reproducing precipitation for most of the statistics than SDSM. During the calibration period, most statistics indicated better performance of SSVM than SDSM, a few statistics such as maximum dry and wet spell length indicated better skill of SDSM than SSVM for all seasons (Table 2b). During the validation period, most statistics for SSVM was found to be higher than SDSM, therefore, SSVM was considered to be better than SDSM.

Comparisons between five GCMs were carried out to analyze suitability of SDSM and SSVM (Table 2c) using precipitation data generated for five different GCMs to determine two suitable GCMs for the Jhelum River basin. Table 2c indicates most of the bold values belong to CGCM3 while most italic values belong to HadCM3. However, other than CGCM3, the total number (4 values for each) of bold shaded values belonging to CCSM3 and number of ECHAM5 and CSIRO-MK3.5 are greater than that of HadCM3 (3 values) suggesting that all GCMs performed efficiently. However, the maximum number of values (bold and italic) generated with CGCM3 and HadCM3 were found consistent with observed values. In addition, the downscaled values with CGCM3 were found slightly accurate than those of HadCM3. As a result, CGCM3 and HadCM3 were determined as the first and second appropriate models for Jhelum River basin. These models were selected for future climate projection at climate stations. Further, Figs. 2 and 3 provides comparisons between observed and simulated Tmin, Tmax and precipitation by employing NCEP\NCAR reanalysis, HadCM3 and CGCM3 predictors, during calibration period (1961–1990). A close agreement was found between observed (baseline) and simulated temperature variables (Tmin and Tmax) at four selected stations (i.e. Narran, Gharidopata, Kotli and Astore) for NCEP\NCAR, HadCM3 and CGCM3 scenarios using SSVM approach. However, simulated precipitation was overestimated particularly during extreme rainfall season (Fig. 3). The simulated precipitation under NCEP\NCAR reanalysis and CGCM3 were slightly better than those of HadCM3. Moreover, a slight inefficiency was found for the generation of precipitation at Baramula, Anantang, Pulwama and Srinagar with bias corrected APHRODITE precipitation. This may be associated with the existence of uncertainties in APHRODITE precipitation dataset which further leads to the low performance of downscaled precipitation.

Fig. 2
figure 2

Comparison between observed and downscaled maximum temperature using SSVM with NCEP\NCAR, CGCM3 and HadCM3 scenarios at four selected stations in Jhelum River basin, for calibration (1961–1990)

Fig. 3
figure 3

Comparison between observed and downscaled precipitation with NCEP\NCAR, CGCM3 and HadCM3 scenario using SSVM at selected gauged (Astore, Gharidopata, Kotli and Narran) and un-gauged points (Baramula, Anantnag, Pulwama and Srinagar) in Jhelum River basin, for 1961 to 1990

Since large uncertainties were observed in projected datasets using SSVM and SDSM under the analyzed GCMs (CGCM3 and HadCM3), therefore, two widely used bias correction approaches were adopted to cater these uncertainties. Results before and after bias correction for the validation period (1991–2000) are presented in Table 3. The average values of performance indicators such as NS coefficient (%), root mean square error (RMSE) and R2 for the downscaled climate variables (precipitation and Tmean) at stations during validation (1991–2000), are given in Table 3. The datasets for the climate variables with NCEP\NCAR generated good results in comparison to both selected GCMs. In the case of GCMs comparison, CGCM3 provides a better agreement with observed datasets as compared to HadCM3, particularly for precipitation. Moreover, the statistical descriptors for the precipitation showed that the SSVM with the integration of NCEP\NCAR (NCEP-SSVM) and GCMs (CGCM3-SSVM and HadCM3-SSVM) provided better results than SDSM (NCEP-SDSM, CGCM3-SDSM and HadCM3-SDSM) (Table 3). Results for the generation of mean temperature (Tmean) using aforementioned sub-models had similar conclusions. These results (SSVM performing better than SDSM for precipitation simulation) are consistent with Chen et al. (2012) and Meenu et al. (2013).

Table 3 Statistical comparison of downscaling techniques (SSVM and SDSM), NCEP\NCAR, scrutinized GCMs (CGCM3 and HadCM3) and bias correction techniques (mapping distribution and linear scaling approach) for the downscaling (with and without bias correction) and observed precipitation and Tmean, during validation period (1990–2000)

A significant improvement was realized in the climate datasets (precipitation and temperature) after adopting bias correction methods. However, it was observed that the distribution mapping method performed slightly better than the linear scaling approach. The values of R2, NS coefficient and RMSE (mm) of the sub-models without bias correction of precipitation data varied between 0.49–0.69, 0.47–0.65 and 35.12–43.29 (mm), respectively. After bias correction of precipitation data, the values of R2, NS and RMSE (mm) were varied between 0.72–0.81, 0.71–0.79 and 19.28–24.87, respectively (for linear scaling method) and 0.72–0.84, 0.69–0.83 and 15.34–19.24 (for distribution mapping approach), respectively. A significant improvement was realized in temperature because both GCMs and downscaling methods generated raw temperature data efficiently (as discussed in previous section). In general results show that distribution mapping performed efficiently than the linear scaling method for bias correction of climate variables. Further, Fig. S2 depicted a significant improvement in basin-wide simulated precipitation after bias correction, particularly, for peak rainfall season. Therefore, the bias corrected datasets using distribution mapping method were adopted to analyze the change in climate variables under future scenarios.

5.2 Projection of Climate Variables

The projected climate data (precipitation and temperature) using CGCM3 and HadCM3 by the integration of SSVM at different climate stations and the un-gauged part of the study area over mean annual and seasonal basis, is given in Table S2 and S3 and Figs. 4 and 5. The maximum average annual increment in precipitation (mm) was found at stations mainly influenced by the monsoon precipitation pattern such as Gharidopata, Kotli, Murree, Planderi, Rawalakot, Baramula, Anantnag and Pulwama, for 2030s in comparison with baseline period (1971–2000) (Fig. 4 and Table S2). An increase in precipitation ranges between 8.08 (Astore) and 89.50 mm (Gharidopata) using CGCM3. Interestingly, at Narran station, a decrease in precipitation by −23.61 and − 26.94 mm was observed for 2030s using CGCM3 and HadCM3, respectively. Similarly, for 2060s at Gharidopata station, the highest precipitation is expected to increase by 108.6 mm (CGCM3) and 103.9 mm (HadCM3). Meanwhile, minimum precipitation at Astore station was found to be increased by 12.48 mm (CGCM3) and 10.83 mm (HadCM3). Slight decrease of −29.54 mm and − 34.96 mm were found at the Narran station, for 2060s and 2090s, respectively, when using CGCM3. Table S2 and Fig. 4 shows that the maximum increase in precipitation is expected in 2090s at Gharidopata (125.5 mm) stations using CGCM3. Overall, a continuous increasing trend of precipitation was observed for future selected time windows (2030s, 2060s and 2090s) in comparison to the baseline. In all future scenarios, the precipitation values downscaled using HadCM3 were found slightly less than CGCM3. It was noticed that the precipitation increases at Astore (8.08–13.43 mm), Gujjar Khan (9.58–21.44 mm) and Mangla (13.12–23.72 mm) stations are very low than other stations. Moreover, decreasing trend in precipitation was only found at the Narran station. This may be associated with fact that the selected study area is mainly influenced by monsoon season (summer precipitation); however, Astore and Narran stations are mainly influenced by westerlies circulation (winter precipitation pattern). The decrease in precipitation at Narran station may be associated with the continuous decrease in temperature, particularly during winter months (Fig. 5). This decrease in temperature could be the cause of precipitation occurrence in the form of snow (solid) rather than liquid. In case of APHRODITE data, the precipitation is expected to increase in the future at all data points located in the un-gauged part of the basin, with maximum precipitation increment of 59.32 and 66.49 mm, with CGCM3 and HadCM3, respectively, at Pulwama for 2090s. Overall, the increase in precipitation at APHRODITE data points (Srinagar, Baramula, Anantnag and Pulwama) is not as high as that at Gharidopata and Kotli. This may be due to the fact that these stations, like Narran and in contrast to Gharidopata and Kotli, are mainly influenced by the westerlies circulation precipitation pattern, as confirmed by Azmat et al. (2017).

Fig. 4
figure 4

Spatial distribution of change in precipitation (mm) including contour lines, for 2030s, 2060s and 2090s, in the Jhelum River basin with CGCM3 (top) and HadCM3 (bottom) using SSVM

Fig. 5
figure 5

Spatial distribution of change in mean temperature (°C) including contour lines for 2030s, 2060s and 2090s, in the Jhelum River basin with CGCM3 (top) and HadCM3 (bottom) using SSVM

The projection of temperature variables for CGCM3 and HadCM3 with SSVM, is shown in Table S3 and Fig. 5. The minimum and maximum temperature at all stations are expected to increase in the future except for Narran, where the minimum and maximum temperature [−0.16, −0.19 and − 0.30; −1.52, −1.95 and − 2.21 for 2030s, 2060s and 2090s, respectively] are expected to decrease, for CGCM3. The maximum rise in mean temperature is expected to occur at Srinagar for 2090s, by 1.67 or 1.79 °C, when using CGCM3 and HadCM3, respectively. The maximum increase in mean temperature is observed at Srinagar and Baramula. The maximum precipitation is expected to increase at stations located at relatively low elevation and mainly influenced by monsoon precipitation pattern (i.e. Gharidopata, Murree and Kotli) and this increase continuously happens under future scenarios (2030s, 2060s and 2090s). Conversely, the precipitation at high-altitude stations (e.g. Narran and Astore) mainly influenced by the westerlies circulation pattern is expected to decrease or a very slight increment will occur at the end of twenty-first century for CGCM3 and HadCM3 with SSVM (Fig. 4). Further, it was observed that the Tmean is expected to significantly increase at low elevation stations (i.e. Gharidopata, Kotli, Gujjar Khan, Planderi, Muzaffarabad and Mangla), while a slight increase (i.e. Astore, Pulwama and Anantnag) and decrease at Narran station is expected under future scenarios (Fig. 5). It seems that the slight increase in precipitation is may be due to the less increase in temperature at high-altitude stations and vice versa for the low elevation stations. Most of precipitation at high-altitude stations occurs in form of snow and the precipitation at Narran station will possibly occur in form of solid snow due to decrease in temperature. At low elevation stations, on the other hand, the rise in temperature is expected to contribute significantly in precipitation of liquid form even during the winter season also observed by Azmat et al. (2017).

Figure 6a indicates that the maximum increase in basin-wide precipitation was expected during monsoon or extreme rainfall season (July–Sep), while least increment was found during pre-monsoon season (April–June), generated for CGCM3. Since, the major proportion of precipitation in study basin occurs during late winter (Feb and March) and monsoon season, this could be a reason for maximum increase in precipitation during the aforementioned seasons. In contrast to the precipitation prediction results, Fig. 6b shows that highest mean temperature rise is expected during pre-monsoon season (April–June). Overall, the mean basin-wide temperature is expected to increase by 0.759, 1.37 and 2.017 during 2030s, 2060s and 2090s, respectively, when using CGCM3. The increasing and decreasing trend of climate variables at different stations in Jhelum River basin are confirmed and stated by Azmat et al. (2017).

Fig. 6
figure 6

Basin-wide seasonal change in (a) percent (%) change precipitation and (b) Tmean (°C), in Jhelum River basin for 2030s, 2060s and 2090s, with the scenarios of CGCM3 and HadCM3 using SSVM

6 Conclusion

The current study aimed to provide a methodology to improve the predictions of climate variables by comparing GCMs (CGCM3, HadCM3, CCSM3, ECHAM5 and CSIRO-MK3.5), downscaling techniques (SSVM and SDSM) and bias correction approaches in high-altitude, scarcely-gauged, Jhelum River basin.

  • Comparative analysis of downscaling techniques showed that the SSVM performed slightly better than SDSM, particularly for downscaling of precipitation data, in high-altitude Jhelum River basin. Similar results were found for the downscaling of temperature data.

  • All selected GCMs performed well, however, CGCM3 and HadCM3 performed better than CCSM3, ECHAM5 and CSIRO-MK3.5. Furthermore, CGCM3 provided better results in comparison to HadCM3. CGCMs and HadCM3 were chosen for the investigation of change in climate variables and their potential impact on hydrological behavior of high-altitude regions under future scenarios. In addition, the climate change analysis could be conducted using several GCMs and emission scenarios with the integration of different downscaling techniques to optimize robust and reliable conclusions.

  • The availability of APHRODITE (gridded temperature and precipitation) datasets at long temporal range (1954–2007) could be an alternative option (after careful bias correction) for the climate change impact studies in high-altitude, scarcely gauged catchments. The long range precipitation record of APHRODITE data is more suitable for climate changes studies with integration of GCMs. However, the bias correction is mandatory to make APHRODITE datasets suitable for the downscaling of climate variables in scarcely-gauged, snow-fed mountainous catchments such as Jhelum River basin.

  • The distribution mapping approach corrected temperature and precipitation fairly well in comparison with the linear scaling approach particularly for the precipitation. Distribution mapping convincingly showed better performance to overcome uncertainty in GCMs simulated data series for all statistical descriptors in comparison with linear scaling. Therefore, distribution mapping approach is a good choice for the bias correction of gridded datasets and GCM simulations.

  • The MODAWEC model could be a good option for the catchments where long term monthly data is available, however the daily data are lacking or not up to the mark for the application.

  • The results for the future climate change scenarios showed that a significant increase in precipitation (basin-wide) by 25.51, 36.76 and 45.52 mm and temperature (basin-wide) by 0.64, 1.47 and 2.79 °C during 2030s, 2060s and 2090s, respectively, are expected. Both climate variables (precipitation and temperature) are expected to increase at all stations except for Narran where both climate variables are expected to decrease.

Overall, current study relies on combination of modeling experimental framework, enabling the examining of downscaling uncertainties in a systematic way. This modeling framework provides novel insights into the future climate change in a high-altitude Jhelum River basin, because it permits in identifying robust changes.