Keywords

1 Introduction

Urbanization and industrialization, the effect of natural climatic variability, and increased greenhouse gasses in the atmosphere have been suggested as the leading causes of the increase in duration, magnitude, and frequency of temperature extremes and heatwaves in the context of the current climate evolution [1]. Study of temperature extremes is important for understanding climate variability which can vary spatially and temporally at different local, regional and global scales. Rising global temperatures are causing increases in the frequency and severity of extreme climatic events, such as floods, droughts, and heat waves [2]. Mean temperatures across India have risen by more than 0.5 ℃ over the period of 1960–2009, with statistically significant increases in heat waves.

The development of Temperature-Duration-Frequency (TDF) curves, which relate the intensity of heat events of different durations to their frequencies, can be useful tools for the analysis of heat extremes [3]. The TDF curves are developed based on historical maximum temperature time series data by fitting a theoretical probability distribution of annual maximum extreme temperature series. TDF curves are developed similar to Rainfall Intensity-Duration-Frequency (IDF) curves. Conventionally, frequency analysis is based on the concept of stationary extreme value theory i.e., the probability distribution parameters are invariant with time. This assumption of stationarity does not hold valid due to the recent evidences in global warming. Various studies analysing the patterns of change in temperature extremes in terms of heatwaves and cold waves [4,5,6,7,8], summer and winter temperature extremes [9] and trend analysis [10, 11] were carried out in the past years. Khaliq et al. [12] developed the concept of TDF curves assuming that the characteristics of the probability distribution of extreme heat events are invariant through time. This study was conducted for four stations in southern Quebec region, Canada. This work was further extended to the non-stationary concept of TDF curves by Ouarda [3] for six stations in the Province of Quebec, Canada. As far as India is concerned, the climate is interesting and complex due to diverse topographical features and large geographical area, hence in case of a local variability in climate change, it is important to know which area could be more affected by these changes. These results can aid in verification of climate model studies, especially for the future. High resolution daily gridded temperature data (1969–2005) was developed by Srivastava et al. [13] which has supported many researchers in their temperature analysis. Even though stationary and non-stationary frequency analysis in terms of rainfall intensity, flood, drought severity has been explored by various researchers in the past, the notion of establishing TDF curves for India still remains to be unexplored. The semi-arid north-western part of India will likely warm more rapidly than the remaining parts making it more prone to extreme temperatures.

This study aims to conduct the frequency analysis of temperature extremes with time-varying distribution functions to characterize temperature extremes in a Non-Stationary (NS) context for the North-West region of India. The TDF analysis is carried out to explore the following objectives: (i) to develop Stationary (S) TDF model assuming location, scale and shape parameter to be a constant and five NS-TDF models by varying location and scale parameter linearly and exponentially with respect to time and the shape parameter as a constant; (ii) to identify the best fit model among the S and NS models for each duration extreme temperature series; (iii) to build and compare S-TDF curve and NS-TDF curves based on the best fit NS model for quantifying the percentage variation; and (iv) to compute the temperature return levels for various return periods with reference to the NS-TDF curves.

2 Study Area and Data

India is a vast country stretching between latitudes 8°4′N and 37°6′N and longitudes 68°7′E and 97°25′E. It is fractionated into seven homogeneous temperature regions namely, North-West (NW), Western Himalayas (WH), West Coast (WC), East Coast (EC), North-East (NE), North Central (NC) and Interior Peninsula (IP) by the Indian Institute of Tropical Meteorology, Pune (online at www.tropmet.res.in). In this paper, the NW homogenous region which comprises of the Indian desert is considered as the study area due to its prominent temperature. The daily maximum temperature gridded data (1°latitude × 1°longitude) observed by the India Meteorological Department (IMD) is procured for NW region for a period of 1951–2019 (69 years). A total of 50 grid points spread out over Gujarat, Rajasthan, Haryana, Punjab, Uttar Pradesh and Himachal Pradesh belongs to the NW region (see Fig. 1).

Fig. 1.
figure 1

Study area – North West temperature homogeneous region of India

3 Methodology

3.1 Significance of Trend

Firstly, Mann Kendell (MK) Trend analysis test [14, 15] was carried out to ascertain the presence of statistically significant trend. For a time series, X(t), the Mann–Kendall statistic is defined as:

$$ S = \mathop \sum \limits_{t = 1}^{N - 1} \mathop \sum \limits_{\dot{t} = t + 1}^N sign\left( {X\left( {\dot{t}} \right) - X\left( t \right)} \right){ } $$
(1)

where N is number of data and sign (•) represents a signum function given by:

$$ sign\left( a \right) = { }\left\{ {\begin{array}{*{20}c} {\frac{a}{\left| a \right|}, if \; x \ne 0} \\ {0 , if \; x = 0} \\ \end{array} } \right. = { }\left\{ {\begin{array}{*{20}c} {1 , if \; a > 0} \\ {0 , if \; a = 0} \\ { - 1 , if \; a < 0} \\ \end{array} } \right. $$
(2)

Sign and value of the S statistic show the direction and intensity of the trend. The increasing or decreasing trends are indicated by positive or negative values of S respectively, while zero represents no trend. A normalized test statistic z can be used to statistically quantify the significance of the trend.

$$ z = \left\{ {\begin{array}{*{20}l} {\frac{{S - 1}}{{\sqrt {var\left( S \right)} }}} \hfill & {if\;S > 0} \hfill \\ 0 \hfill & {if\;S = 0} \hfill \\ {\frac{{S + 1}}{{\sqrt {var\left( S \right)} }}} \hfill & {if\;S{\text{ < }}0} \hfill \\ \end{array} } \right. $$
(3)

The positive values of z represent increasing trend while negative values indicate decreasing trend. At 95% confidence interval, if the value of z is more than 1.96 it represents significant non-stationary increasing trend. Similarly, if z is less than −1.96, it represents significant non-stationary decreasing trend.

3.2 Stationary and Non-stationary Temperature Models

Extreme Value Theory (EVT) is a framework used for analysing the risks associated with climate extremes and their return levels [1]. The Generalized Extreme Value (GEV) distribution is a continuous probability distribution that combines Gumbel, Fréchet and Weibull extreme value distributions. The frequency model using this family of distributions is often referred to as the block maxima approach [1]. The GEV distribution depends on three parameters: location (µ), scale (σ), and shape (ξ) with a cumulative distribution function expressed as:

$$ F\left( {x,\mu ,\sigma ,\xi } \right) = \left\{ {\begin{array}{*{20}c} {exp\left\{ { - \left[ {1 + \xi \left( {\frac{x - \mu }{\sigma }} \right) } \right]^{ - 1/\xi } } \right\} , \sigma > 0, 1 + \xi \left( {\frac{x - \mu }{\sigma }} \right) > 0, \xi \ne 0} \\ {exp\left\{ { - exp\left[ { - \frac{{\left( {x - \mu } \right)}}{\sigma }} \right]} \right\} , \sigma > 0, \xi = 0 } \\ \end{array} } \right. $$
(4)

In this study Stationary (S) model and five Non-Stationary (NS) models were developed as shown in Table 1. The NS setting for the location and scale parameters of GEV distribution was introduced as a function of time. The shape parameter was assumed to be a constant for all models.

Table 1. Development of stationary and non-stationary temperature models

The distribution parameters are estimated using Maximum Likelihood Estimation (MLE) method. MLE involves defining a likelihood function for calculating the conditional probability of observing the data sample given a probability distribution and distribution parameters. The log likelihood is derived from Eqs. (5) and (6). The location and scale parameters in the above equations are replaced with the corresponding equations in Table 1 based on the model application. MLE is done by maximizing the likelihood estimates obtained from Eqs. (5) and (6).

$$ \begin{aligned} & For\;\xi \ne 0, \\ & {\text{log}}L(\left. {\mu ,\sigma ,\xi } \right|X) = - n{\text{log}}\sigma - \left( {1 + \frac{1}{\xi }} \right)\sum\limits_{{i = 1}}^{n} l og\left[ {1 + \xi \left( {\frac{{x_{i} - \mu }}{\sigma }} \right)} \right] \\ & \quad \quad \quad \quad \quad \quad \quad \quad - \sum\limits_{{i = 1}}^{n} {\left[ {1 + \xi \left( {\frac{{x_{i} - \mu }}{\sigma }} \right)} \right]^{{ - 1/\xi }} } ,1 + \xi \left( {\frac{{x_{i} - \mu }}{\sigma }} \right) > 0 \\ \end{aligned} $$
(5)
$$ \begin{aligned} & For\;\xi = 0, \\ & {\text{log}}L(\left. {\mu ,\sigma } \right|X) = - n{\text{log}}\sigma - \sum\limits_{{i = 1}}^{n} {\left( {\frac{{x_{i} - \mu }}{\sigma }} \right)} - \sum\limits_{{i = 1}}^{n} {{\text{exp}}} \left[ { - \left( {\frac{{x_{i} - \mu }}{\sigma }} \right)} \right] \\ \end{aligned} $$
(6)

The Akaike Information Criterion (AIC), which penalizes the minimized negative log likelihood for the number of parameters estimated, is used to select the best model [16]. AIC is used to estimate the best fit GEV model among the SGEV model and 5 NSGEV models. It is computed using Eq. (7)

$$AIC=2K-2\mathrm{ln}\left(L\right)$$
(7)

where K is the number of independent variables used and L is the log-likelihood estimate.

3.3 Quantification of Temperature Intensities for Various Return Periods

Once the best NS model is identified, the NS temperature intensity is estimated using the best NS model parameters. Unlike S model, the NS models’ location and/or scale parameter value are bound to be time dependant. In this study for NS framework models, 95 percentiles of the location and scale parameter values from the historical observation obtained from Eqs. (8) and (9) is substituted into Eq. (10) for estimating the non-stationary temperature intensities for various return periods (T in years).

$$\widehat{{\mu }_{95}}={Q}_{95}(\widehat{{\mu }_{t1}}, \widehat{{\mu }_{t2}}, \dots \dots , \widehat{{\mu }_{tn})} $$
(8)
$$\widehat{{\sigma }_{95}}={Q}_{95}(\widehat{{\sigma }_{t1}}, \widehat{{\sigma }_{t2}}, \dots \dots , \widehat{{\sigma }_{tn})}$$
(9)
$$I= \left\{\begin{array}{c}\widehat{\mu }+ \frac{\widehat{\sigma }}{\widehat{\xi }}\left[{\left(-\mathrm{log}\left(1-\frac{1}{T}\right)\right)}^{-\widehat{\xi }}-1\right], \widehat{\xi } \ne 0 \\ \widehat{\mu }+ \widehat{\sigma }\left[-log\left(-\mathrm{log}\left(1-\frac{1}{T}\right)\right)\right], \widehat{\xi }=0\end{array}\right.$$
(10)

4 Results and Discussion

The daily maximum temperature data obtained from IMD for 50 grid points belonging to NW region was used to extract the annual maximum temperature over 6 durations (1-day, 2-day, 4-day, 6-day, 8-day and 10-day). The trend analysis was carried over the NW region using MK test and it was found that number of grid points following non-stationarity trend increases as the duration increases. The same is represented in a bar graph in Fig. 2.

Fig. 2.
figure 2

Significance of trend in grid points of NW region

The S TDF and NS TDF analysis was carried out by developing GEV models as mentioned in Sect. 3.2 and the best fit model was identified with the help of AIC values. From Fig. 3 it can be seen that the best fit models vary for the same grid point as the duration changes in some cases while in others the best fit model remains constant throughout for all durations.

Fig. 3.
figure 3

Best fit models for NW temperature homogenous region for all durations

It can be concluded that more than 84% of the grid points have NSGEV-1 as the best fit model where location parameter is linearly time varying with a constant scale and shape parameter for all durations (see Fig. 4). S TDF curves and best fit NS TDF curves were developed for all the grid points for various return periods (RP) of 2 years, 5 years, 10 years, 25 years, 50 years and 100 years. Due to space constraints, a sample grid point (23.5oN, 68.5oE) of Gujarat is chosen for a detailed representation of S and best fit NS TDF curves for all return periods in Fig. 5. The dashed curves represent S-TDF curves and the thick line represent NS-TDF curves for various return periods. It was found that the temperature intensities decrease with increase in the daily maximum durations and increase when return period increases.

Fig. 4.
figure 4

Percentage of grid points for best fit models in NW region

Fig. 5.
figure 5

S and best fit NS (NSGEV-1) TDF curves for grid point (23.5oN, 68.5oE) Gujarat

The percentage variations (PV) of return levels between S & NS cases were calculated for all the grid points. A spatial representation of the temperature intensities for both S and NS case along with the PV for 1 day duration for all return periods for the entire NW region is shown in Fig. 6.

Fig. 6.
figure 6

S and NS temperature intensities and percentage variation for 1-day duration

When the stationary case is compared with non-stationary case it was found that the temperature intensities increased beyond 45 °C for many grid points when non-stationarity was considered. This points out that the stationary temperature intensities are lower than all of the corresponding non-stationary values for all grid points. The same was the case for all the other durations as well. Majority of the percentage variations range between 0% and 2% i.e., about 0 °C to 0.9 °C. In the case of return period of 2 years around 36% of the grid points showed a percentage variation between 2% to 3% which reduced to the category of 0% to 2% as the return period increased. One grid point (28.5°N,76.5°E) showed a consistent increase in the percentage variation reaching a value of 3% to 4% (up to 1.6 °C) from 25 years to 100 years of return period. The pattern of percentage variations of return levels was observed closely by plotting a stem plot as shown in Fig. 7 for 1-day duration. It was seen that the graph follows two patterns i.e., PV decreases as RP increases and vice-versa. While comparing Fig. 6 with Fig. 3 it was observed that the grid points having NSGEV-1 as the best fit model followed the first pattern of PV decreases as RP increases while the grid points having NSGEV-4 and NSGEV-5 as the best fit model followed the second pattern of PV increases as RP increases.

Fig. 7.
figure 7

Percentage variation of all grid points for 1-day duration

5 Conclusions

In this study, TDF analysis was carried out for NW temperature homogeneous region of India by developing S model and five NS models with time as covariate. It was found that more than 84% of the grid points approve NSGEV-1 model with location parameter linearly varying with respect to time as the best fit model with the help of AIC values. The best fit model tends to change for each duration for certain grid points. Temperature intensity versus duration plots were established for both S and NS case with each curve representing return period of 2 years, 5 years, 10 years, 25 years, 50 years and 100 years. A comparison of S-TDF curves and NS-TDF curves were carried for all the 50 grid points and it was found that the NS temperature intensities were more than the S estimates. The temperature intensity increased beyond 45 °C when non-stationarity was considered. As the return period increases, the temperature intensities tend to increase for S and NS case. A PV of 0% to 2% was observed for majority of the grid points. In general, it can be concluded that for more than 84% of the grid points PV decreases as RP increases while for the remaining 16% PV increases as RP increases based on the best fit model for each grid point.