1 Introduction

In hydrological studies, drought is a climatic anomaly, caused by either sub-normal rainfall, erratic rainfall distribution, higher water need or a combination of all these factors. In a large country like India where precipitation varies both in space and time, drought is one of the most frequently occurring natural calamities in various parts of the country. About two-thirds of the geographic area of India receives low rainfall (less than 1000 mm), which is also characterized by uneven and erratic distributions. Out of net sown area of 140 million hectares, about 68% is reported to be vulnerable to drought conditions and about 50% of such vulnerable area is classified as ‘severe’, where frequency of drought is almost regular (Murthy and Sesha Sai 2010).

As drought is a multivariate phenomenon characterizing severity, duration and peak; drought properties need to be modelled using effective probability models. Recently, copulas have been successfully applied in the field of hydrology for multivariate analysis of hydrological systems, viz., flood frequency analysis and drought frequency analysis (Kao and Govindaraju 2010; Mirakbari et al. 2010; Wong et al. 2010). By using copulas, some of the limitations of traditional multivariate distributions can be relaxed as suggested in recent studies (Favre et al. 2004; Genest and Favre 2007). Shiau (2006) investigated bivariate joint distribution of drought properties severity and duration in Southern Taiwan using standardized precipitation index (SPI) and theory of copulas. Shiau et al. (2007) performed bivariate frequency analysis of hydrological droughts using monthly stream flow data of Yellow River basin employing theory of runs and Archimedean class of Clayton copula. Laux et al. (2009) analysed regional nature of droughts in Volta basin using effective drought index calculated from daily precipitation data in order to assess drought properties of five different rainfall regions within Volta basin. In their study, bivariate Archimedean class of Clayton copula family is chosen to construct joint distribution and associated joint return periods. Serinaldi et al. (2009) modelled four-dimensional joint distribution of drought properties – mean SPI, duration, minimum SPI, drought areal extent and computed joint return periods using elliptical class of Student’s t copula for different degrees of freedom. Shiau and Modarres (2009) employed Archimedean class of Clayton copula to investigate the nature of S–D–F curves of two different climatic regions in Iran due to its simplified structure. Song and Singh (2010) modelled bivariate joint probability distribution of drrought properties in Texas using meta-elliptical class of copulas and found that meta-Gaussian copula performed satisfactorily in modelling the dependence. Kao and Govindaraju (2010) proposed copula-based joint deficit index using precipitation and stream flow marginals with window sizes varying from 1 to 12 months in Indiana watershed. The standardized index is adopted for statistical analysis of hydrologic variables. The temporal dependence structure among hydrologic variables is modelled using elliptical class of Student’s t copula due to large number of variables involved in the study and its ability to model tail dependence. Wong et al. (2010) investigated effect of El Niño-Southern Oscillation (ENSO) phenomenon on nature of multivariate drought frequencies using precipitation data from two districts in New South Wales, Australia. The performance of trivariate Gumbel–Hougaard and Student’s t copulas were tested for each climatic phase and limitations of each copula model was discussed. The trivariate Gumbel–Hougaard copula has a limitation that weak dependences are averaged in asymmetric trivariate structure and there is a restriction in application as the correlation between two pairs should be identical and lower than the third pair. In contrast, trivariate t copula has no such restriction except that variance-covariance matrix has to be positive definite. A small difference was observed between Gumbel–Hougaard and Student’s t copulas in distance-based goodness-of-fit measures and the latter emerged as a better model. The fitted models were then used to estimate annual recurrence intervals taking into account at least one of the three drought variables and all of the three variables exceeding critical values. Madadgar and Moradkhani (2011) investigated the effect of climate change on drought characteristics using the copula method in Oregon’s Upper Klamath River basin. The performance of two trivariate copulas – Gumbel–Hougaard and Student’s t were tested in modelling joint distribution of drought properties – severity, duration and intensity for analysing streamflow-based drought indices. Results showed that both Gumbel–Hougaard and Student’s t copulas performed similarly for historical (1920–2009) time period. To evaluate potential impact of climate change, five General Circulation Model (GCM), outputs under A1B emission scenario were used for multivariate drought analysis. The return period analysis based on bivariate and trivariate copulas showed that climate change causes an overall decline in drought severity and duration in the Upper Klamath River basin during projected time period 2020–2090. Lee et al. (2012) studied tail dependence of four different copula functions – Gumbel–Hougaard, Frank, Clayton and Gaussian copulas for bivariate drought frequency analysis in Canada and Iran. Their study showed that Clayton copula may not be suitable for modelling droughts as dependence between two variables in the upper tail of Calyton copula was found to be very weak and similar to the independent case, whereas Frank and Gumbel–Hougaard copula showed better performance for modelling bivariate drought characteristics.

The present study investigates the role of ENSO-based climate variability in influencing multivariate drought frequencies in the drought-prone region of Western Rajasthan in India. The state has the maximum probability of drought occurrence in India, with recurring droughts of about 3–4 years in a cycle of 5 years (Mall et al. 2006). Studies conducted by Rathore (2004), identified 48 drought years of varied intensity in the state during the period 1901–2002. During the year 2002, when about 29% of the total area of the country was affected by drought, seasonal departures of precipitation for west and east Rajasthan were −71% and −60%, respectively (RAPCC 2012). During the year 2001, the total monsoon rainfall in Rajasthan was 381 mm and 30,583 villages were affected by drought, whereas in 2002, the monsoon rainfall was only 173 mm and 41,000 villages faced drought situation (Khera 2004).

2 Large-scale climatic phenomena – ENSO

Drought is a normal part of natural climate variations. Past research studies based on tree-rings and other instrumental records have shown that many major droughts which occurred over different parts of the world are triggered by atmospheric teleconnection such as ENSO phenomenon (Dai 2011). ENSO refers to oscillation between a warm phase (El Niño) characterized by abnormal warming of surface ocean waters of the central and eastern Pacific and enhanced convection in the atmosphere above; and a cold phase (La Niña) characterized by abnormal cooling of these waters and suppressed convection in the atmosphere above (Gadgil et al. 2007). Several researchers have explored the links between Indian monsoon rainfall and ENSO events (Webster and Yang 1992; Rajeevan et al. 2004). There have been many indices defined to represent ENSO events. Two well-known ENSO indices are southern oscillation index based upon sea surface pressure differences at Tahiti (18°S 150°W) and Darwin (12°S 131°E) from 1880 onwards; and the Niño index from 1950 onwards based upon sea-surface temperature (SST) anomalies (°C) of the central and eastern equatorial Pacific.

A recent index for describing ENSO phenomena is the multivariate ENSO index (MEI). Computation of MEI involves moving average of 2-months for the time series under consideration. The MEI represents the first unrotated principal component (PC) of all six observed climatic parameters over the tropical Pacific: sea-level pressure, north-south component of surface wind, east-west component of surface wind, SST, surface air temperature and total cloudiness fraction of the sky (Wolter and Timlin 2011). Inclusion of these climatic variables helps MEI to explain the ocean–atmospheric interactions better than the indices that rely on single climatic variable. Large positive values of the MEI represent El Niño, while negative values correspond to La Niña episodes.

3 Copula

Copulas are parametrically specified joint distribution functions obtained by linking marginal distribution of any form. If X = (X 1,...,X d ) is a random vector with continuous marginal cumulative distribution functions (CDF) F 1,...,F d , then joint distribution H(X) can be expressed by Sklar’s theorem (Sklar 1959) as:

$$ H(\boldsymbol{\it X})=C\{F_1(x_1),\ldots,F_d(x_d); \theta\},\quad\boldsymbol{\it X}\,\,\in\,\,R^d , $$
(1)

where the function C:[0, 1]d →[0, 1] is called a d-dimensional copula, with association parameter θ. There exists different class of copulas in the literature, viz., Archimedean, Plackett, elliptical copulas, etc. Though Archimedean class of copulas are simple and can be easily generated, extension to higher orders in symmetric form is limited (Grimaldi and Serinaldi 2006). In this study, elliptical class of Student’s t copula is chosen due to its capability to model multivariate distribution with asymmetric dependence structure and can capture upper tail dependence quite well (Nelsen 2006).

3.1 Student’s t copula

If ∑ ∈ R d for x ∈ R d denotes a symmetric positive definite shape matrix, then multivariate Student’s t copula for marginals u = (u 1,..., \(u_{d}) \in [0, 1]^{d}\) with ϑ degrees of freedom is defined as (Mashal and Zeevi 2002)

$$\begin{array}{lll} && C(u_1,u_2,u_3;\vartheta,\Sigma)= t^{d}_{\vartheta, \Sigma}(t^{-1}_{\vartheta}(u_1),t^{-1}_{\vartheta}(u_2),t^{-1}_{\vartheta}(u_3))\\ && \quad = \int^{t^{-1}_{\vartheta}(u_1)}_{-\infty}\int^{t^{-1}_{\vartheta}(u_2)}_{-\infty}\int^{t^{-1}_{\vartheta}(u_3)}_{-\infty}\frac{\Gamma((\vartheta+d)/2)}{\Gamma(\vartheta/2)(\vartheta\pi)^{d/2}\left|\Sigma\right|^{1/2}}\\&&\quad \times (1+y^{T}\Sigma^{-1}y/\vartheta)^{-(\vartheta+d)/2}dy_1dy_2dy_3, \end{array}$$
(2)

where d = 3; y = {y 1, y 2, y 3}; ϑ and ∑ are parameters of Student’s t copula.

The parameter of t −copula is estimated using a two-step transformation procedure. The shape parameter matrix ∑ consists of elements \(\hat{{\sigma }}_{i,j} \), which are functions of rank correlation coefficient. For trivariate case, the elements of shape parameter matrix \(\sum {\in \left\{ {\sigma_{11} \;\sigma_{12} \;\sigma_{13} ;\sigma_{21} \;\sigma_{22} \;\sigma_{23} ;\sigma_{31} \;\sigma_{32} \;\sigma_{33} } \right\}} \), and the elements \(\hat{{\sigma }}_{i,j} \) are estimated using the relationship \(\hat{{\sigma }}_{i,j} =\sin \left( {\frac{\pi }{2}\hat{{\tau }}_{ij} } \right)\), where \(\hat{{\tau }}_{ij} \) is the pair-wise Kendall’s dependence measure or rank correlation coefficient. Obtaining \(\hat{{\sum }}\) in a three-dimensional case involves computing \(\hat{{\sigma }}_{ij} \) for i, j\(\left\{ {1,2,3} \right\}\). Then, a numerical search technique is employed for estimating ϑ

$$ l(\hat{\vartheta})= \mathop{\mbox{arg\;max}}\limits_{{\vartheta\in(2,\infty]}}\sum\limits_{i=1}^{n}\ln\left[c_{\theta}\left\{(U_{i,1},U_{i,2},U_{i,3})|\hat{\vartheta},\hat{\Sigma}\right\}\right], $$
(3)

where U i,d denotes empirical CDF of d th random variable, and c θ is the copula density.

3.2 Testing suitability of the copula family

After fitting copula, theoretical (fitted copula model) and observed probabilities (empirical observations) are compared. The empirical copula can be defined as (Genest and Favre 2007)

$$\begin{array}{lll} &&C_n \left( {u_1 ,u_2 ,u_3 } \right)\\ &&\quad =\frac{1}{n}\sum\limits_{i=1}^n {\bf 1} \left( {\hat{{U}}_{i,1} \le u_1 ,\hat{{U}}_{i,2} \le u_2 ,\hat{{U}}_{i,3} \le u_3 } \right), \\&&\quad\;\; u_1 ,u_2 ,u_3 \in [ {0,1} ]. \end{array}$$
(4)

Apart from distance-based test, the parametric and non-parametric tail dependence tests are performed for the fitted copula model. The tail dependence coefficient (TDC) of copulas provides a measure of strength of dependence in the tails of a bivariate distribution (McNeil et al. 2005). The parametric upper TDC of Student’s t copula with ϑ degrees of freedom and \(\hat{\sum} \in \{\sigma_{11}\, \sigma_{12}\, \sigma_{13};\; \sigma_{21}\, \sigma_{22}\, \sigma_{23};\; \sigma_{31}\, \sigma_{32} \,\sigma_{33}\}\) is given by

$$ \label{eq5} \lambda_{U_{ij} }^{\rm param} =2-2t_{\vartheta +1} \left( {\sqrt {\left( {\vartheta +1} \right)} \sqrt {\frac{\left( {1-\sigma_{ij} } \right)}{\left( {1+\sigma_{ij} } \right)}} } \right), $$
(5)

where t ϑ + 1 is CDF of Student’s t distribution with ϑ+1 degrees of freedom. For non-parametric tail dependence test, Capéraá-Fougéres-Genest (CFG) estimator (Capéraá et al. 1997; Frahm et al. 2005) is employed. If \(\{\left(u_1, v_1\right),\ldots,\) \(\left({u_n ,v_n}\right)\}\) are random samples obtained from Copula \(C\left( \bullet \right)\), the bivariate upper TDC using CFG estimator (\(\lambda_U^{CFG} )\) is given by

$$\begin{array}{lll} \label{eq6} &&{\kern-18pt} \hat{{\lambda }}_U^{\rm CFG} \,=\,2\,-\,2\exp \left[\frac{1}{n}\,\sum\limits_{i=1}^n \log\, \left\{\, \sqrt {\log \left( {\frac{1}{u_i }} \right)\log \left( {\frac{1}{v_i }} \right)} \right.\right.\\&&\qquad \left.\left.\mathord{\left/ {\vphantom {{\sqrt {\log \left( {\frac{1}{u_i }} \right)\log \left( {\frac{1}{v_i }} \right)} } {\log \left( {\frac{1}{\max \left( {u_i ,v_i } \right)^2}} \right)}}} \right. \kern-\nulldelimiterspace} {\log \left( {\frac{1}{\max \left( {u_i ,v_i } \right)^2}} \right)} \right\} \right]. \end{array}$$
(6)

3.3 Trivariate frequency analysis of droughts using copulas

Multivariate frequency analysis is helpful in understanding critical behaviour of drought characteristics. As drought events may last for more than a year, drought characteristics can be analysed as a partial duration series (PDS) of independent events. The univariate return period of drought for a specific drought variable (say severity) can be computed by (Kim et al. 2003)

$$\label{eq7} T_{s}=\frac{N}{n(1-F_{s}(s))}=\frac{\delta}{1-F_{s}(s)}, $$
(7)

where δ=N/n and N = total length of SPI time series (years), n = number of drought events, F s (s) is the univariate CDF of drought severity.

By following the above definition, the copula- based trivariate return period of drought for exceeding thresholds of any one of the drought variables (in OR-case) without accounting climatic state is given by

$$\begin{array}{lll}\label{eq8} T^{\cup}_{SDI}\,\,\,\,&=&\,\,\,\frac{\delta}{1-F_{SDI}(s,d,i)}=\frac{\delta}{1-C_{SDI}(s,d,i)}\\[2pt] &=&\,\,\,\frac{\delta}{1-P^{\cup}_{SDI}}, \end{array}$$
(8)

where \(P_{SDI}^{\cup} =P\left( {S\leq s\cup D\leq d\cup I\leq i} \right)\) denotes joint probability of occurrence of any one of the drought variables, i.e., either severity (S) or duration (D) or peak (I).

Similarly, multivariate return period of drought for exceeding thresholds of all drought variables simultaneously (in AND-case) without accounting climate state is given by

$$\begin{array}{lll}\label{eq9} T^{\cap}_{\rm SDI} \,\,\,\,&=&\,\,\, \frac{\delta}{\begin{array}{c}1-F_\mathrm{s}(s)-F_{\rm D}(d)-F_{\rm I}(i)+F_{\rm SD}(s,d)\\ +\;F_{\rm DI}(d,i)+F_{\rm SI}(s,i)-F_{\rm SDI}(s,d,i)\end{array}}\\ &=&\,\,\,\frac{\delta}{\begin{array}{c}1-F_\mathrm{s}(s)-F_{\mathrm{D}}(d)-F_\mathrm{I}(i)+C_\mathrm{SD}(s,d)\\+\;C_\mathrm{DI}(d,i)+C_\mathrm{SI}(s,i)-C_\mathrm{SDI}(s,d,i)\end{array}}\\ &=&\,\,\,\frac{\delta}{1-P^{\cap}_\mathrm{SDI}}, \end{array}$$
(9)

where C SD (s, d), C DI (d, i) and C SI (s, i) are the joint distributions obtained from bivariate copula for severity-duration, duration-peak and severity- peak combinations, respectively; \(P_{SDI}^{\cap} =P(S\leq s\cap\) D ≤ d ∩ I ≤ i) denotes joint probability of occurrence of all the drought variables simultaneously.

By using Bayes rule for obtaining total probability (from three ENSO state conditional probabilities) and concepts from Willems (2000), the mean annual multivariate return period of drought accounting climate state information (i.e., compound return period) in OR-case can be written as

$$\begin{array}{lll} \label{eq10} &&T_{\rm SDI}^{\cup ,{\rm Climate}}=\frac{\delta }{\begin{array}{c} 1-\left( p_{\rm El~\text{Ni\~{n}o}} P_{{\rm SDI}\{\{{\rm El~ \text{Ni\~{n}o}}\}}^{\cup}\right.\\[6pt] +\;p_{{\rm La~ \text{Ni\~{n}a}}} P_{{\rm SDI}\left\{{\rm La~ \text{Ni\~{n}a}}\right\}}^{\cup}\\[2pt] \left.+\;p_{\rm Neutral} P_{{\rm SDI}\left\{{\rm Neutral}\right\}}^{\cup} \right) \end{array}}, \end{array}$$
(10)

where \(P_{{\rm SDI}\left\{ {{\rm El~ \text{Ni\~{n}o}}} \right\}}^{\cup} \), \(P_{{\rm SDI}\left\{ {{\rm La~ \text{Ni\~{n}a}}} \right\}}^{\cup} \) and \(P_{{\rm SDI}\left\{ {{\rm Neutral}} \right\}}^{\cup} \) denote joint probability of occurrence of any one of the drought variables at El Niño, La Niña and neutral climatic phases, respectively; p ENSO  State denotes probability of occurrence of drought in a particular climate episode; ENSO State = {El Niño, La Niña, Neutral}. The p ENSO  State can be obtained by dividing the number of drought events occurring at a particular climate state by the total number of drought events during the period of study. Similarly, mean annual recurrence interval of drought in AND- case incorporating climate episode can be written as

$$\begin{array}{lll}\label{eq11} &&T^{\cap, \rm Climate}_{\rm SDI}=\frac{\delta}{\begin{array}{c}1-\left(p_{\rm El~\text{Ni\~{n}o}}P^{\cap}_{\rm SDI\{El~\text{Ni\~{n}o}\}}\right.\\[6pt] \;p_{\rm La~\text{Ni\~{n}a}} P^{\cap}_{\rm SDI \{La~\text{Ni\~{n}a}\}}\\[2pt] \left. +\;p_{\rm Neutral}P^{\cap}_{\rm SDI\{Neutral\}}\right)\end{array}}, \end{array}$$
(11)

where \(P_{{\rm SDI}\left\{ {{\rm El~ \text{Ni\~{n}o}}} \right\}}^{\cap} \), \(P_{{\rm SDI}\left\{ {{\rm La~ \text{Ni\~{n}a}}} \right\}}^{\cap} \) and \(P_{{\rm SDI}\left\{ {{\rm Neutral}} \right\}}^{\cap} \) denote joint probability of occurrence of all variables simultaneously at El Niño, La Niña and neutral climatic phases, respectively.

3.4 Conditional return periods of drought characteristics

The copula-based joint distribution can be used to obtain conditional return periods of drought characteristics. For example, joint return period of drought severity and duration conditional on drought peak (T SD ∣ I ) can be obtained from

$$\label{eq12} T_{S,D|I}=\frac{\delta}{(1-F_{S,D|I}(s,d|I\leq i^{\prime}))}, $$
(12)

where

$$\begin{array}{lll} F_{S,D|I}&=&P\left[S\leq s,D\leq d|I\leq i^{\prime}\right]\\[2pt] &=&\frac{F_{S,D,I}(s,d,i^{\prime})}{F_{I}(i^{\prime})}\\[2pt] &=&\frac{C_{SDI}(s,d,i^{\prime})}{F_{I}(i^{\prime})}. \end{array}$$

Similarly, equivalent formula for conditional return period of drought severity and peak, given drought duration (T SI ∣ D ), and conditional return period of drought duration and peak, given severity (T DI ∣ S ) can also be obtained.

3.5 Conditional probability of drought characteristics under each ENSO state

Conditional distribution of drought characteristics under a particular ENSO state can be derived from copula-based joint distribution function fitted for drought characteristics in that climate state, which can be helpful in studying the influence of one dependent variable on another. The conditional probability of drought severity, given drought duration and peaks exceeding certain thresholds d and i , respectively (under a specific climate state), is expressed as

$$\begin{array}{lll} &&P\left( {S\le s\vert D\ge {d}',I\ge {i}'} \right)\\[2pt] &&\quad = \frac{F_S \left( s \right)\,-\,F_{SD} \left( {s,d^\prime } \right)\,-\,F_{SI} \left( {s,i^\prime } \right)\,+\,F_{SDI} \left( {s,d^\prime ,i^\prime } \right)}{1\,-\,F_D \left( {d^\prime } \right)-F_I \left( {i^\prime } \right)+F_{DI} \left( {d^\prime ,i^\prime } \right)} \\[2pt] &&\quad =\frac{F_S \left( s \right)\,-\,C_{SD} \left( {s,d^\prime } \right)\,-\,C_{SI} \left( {s,i^\prime } \right)\,+\,C_{SDI} \left( {s,d^\prime ,i^\prime } \right)}{1\,-\,F_D \left( {d^\prime } \right)\,-\,F_I \left( {i^\prime } \right)\,+\,C_{DI} \left( {d^\prime ,i^\prime } \right)}\\ \end{array}$$
(13)
$$\begin{array}{lll} &&P(S\leq s,I \leq i| D \geq d^{\prime})\\[2pt] &&\quad = \frac{F_{SI}(s,i)-F_{SDI}(S,d^{\prime},i)}{1-F_{D}(d^{\prime})}\\[2pt] &&\quad =\frac{C_{SI}(s,i)-C_{SDI}(s,d^{\prime},i)}{1-F_{D}(d^{\prime})}. \end{array}$$
(14)

4 Application

4.1 Study area and data

Western Rajasthan with an area of 196,150 km2 occupies 57.31% of India’s total arid zone area. The climate is characterized by low, highly variable and ill-distributed rainfall, high wind speed, high evaporation losses, and extremes of seasonal temperatures. Rajasthan has only 1% of the country’s total surface water resources. The monsoon period is short (about 2 to 3 months, July–September), resulting in annual rainfall ranging from 150–900 mm in different parts of the state (average annual precipitation 576 mm) and temperature varies from 5°–45°C in different seasons (RACP 2012).

The monthly area-weighted precipitation data of nine rainfall stations in western Rajasthan meteorological subdivision for about 110 years (from 1896 to 2005) is obtained from Indian Institute of Tropical Management, Pune (http://www.tropmet.res.in). Map of the study area showing locations of the rain gauge stations is presented in figure 1. Relevant data for MEI of about 135 years (from 1871 to 2005) are obtained from NOAA website (http://www.esrl.noaa.gov/psd/enso/mei.ext/index.html#data). MEI values before 1950 (extended back to 1871) are based on Hadley Centre sea-level pressure and SST (Wolter and Timlin 2011).

Figure 1
figure 1

Location map of the study area showing rain gauge stations.

4.2 Drought modelling using standardized precipitation index

Drought events are identified using standardized precipitation index, which is a commonly used indicator for drought assessment. Calculation of SPI for any location and time scale (such as 3, 6, 9 and 12 months) involves fitting a probability distribution function (generally Gamma or Pearson Type III distribution) for long time series of aggregated precipitation which is then transformed into a standardized normal distribution so that the mean SPI for the location and desired period is zero. In this study, 6-month aggregated precipitation data is fitted with Gamma distribution function and used for SPI computation. A drought period is identified when SPI value reaches 20 percentile or below threshold level (Svoboda et al. 2002). Drought duration (D) is taken as the number of consecutive intervals (months), where SPI remains below the specified threshold value. Drought severity (S) is the cumulative value of SPI within the drought duration. For convenience, the severity of drought event i, is taken as positive and given as \(S_i =-\sum\nolimits_{t=1}^D {S\,P\,I_{i,t} } \), ∀ i = 1, ..., n. Drought peak (I) is the absolute value of the minimum value taken by the SPI over the duration of the drought. Figure 2 illustrates the definitions of drought characteristics. More details on SPI and definitions of the drought characteristics and their computation are well-documented by Janga Reddy and Ganguli (2012).

Figure 2
figure 2

Illustration of drought characteristics using SPI.

For data of the present study region, it is noticed that SPI-6 time series reaches 20 percentile limit at − 0.8; hence it is taken as a threshold level for drought identification. Monthly SPI-6 series were calculated and 87 drought events were identified. The time series plots of SPI-6 and MEI time series are shown in figure 3. From figure 3a, it can be observed that the region had experienced major drought events during 1899–1900, 1901–1902, 1905–1906, 1911–1912, 1915–1916, 1918–1919, 1920–1921, 1939, 1946–1947, 1968–1970, 1973 and 2002–2003.

Figure 3
figure 3

Time series plot of (a) SPI-6 and (b) MEI during study period (1896–2005).

The three climate states (El Niño, La Niña and neutral phases) are assumed to be mutually exclusive and exhaustive during the study period so that each year belongs to one particular ENSO episode. Every year from 1896–2005 is given an ENSO classification based on May–November MEI index (Wolter and Timlin 2011). The non-parametric Spearman’s rank correlation is computed between MEI and SPI-6 time series at each ENSO phase year during 1896–2005. The rank correlations between SPI-6 and MEI time series at El Niño, La Niña and neutral phase years are found to be −0.14, −0.30 and −0.14 with corresponding p-values 5.11e − 4, 2.3e − 8 and 0.008, respectively. All correlations are statistically significant at 5% significance levels as tested by standard two-tailed t test, which indicates that drought in the study region is responsive to ENSO phenomenon. Figure 4 presents scatter plot of SPI-6 against MEI time series at each climate phase, which depicts the strength of association between the two time series. Using SPI-6 time series, a total of 87 drought events were identified during the study period. Among 87 drought events, 41 are classified under El Niño and 23 are classified under La Niña and the remaining under neutral episodes. Table 1 presents summary statistics of drought without and with accounting ENSO state. From table 1, it can be noticed that the droughts in El Niño episodes are longer and severe in nature, whereas the opposite is observed for La Niña year droughts. Neutral phase droughts lie in between El Niño and La Niña episode droughts in terms of severity and duration. Table 2 presents association between drought properties using Pearson’s linear correlation and the two non-parametric dependence measures – Spearman’s ρ and Kendall’s τ with and without accounting ENSO phases. In all cases, corresponding p-value is less than 0.0001, indicating significant positive association between drought variables. Figure 5 presents scatter plots of pair-wise dependent drought variables at each ENSO state.

Figure 4
figure 4

Scatter plot between SPI-6 and MEI time series at each ENSO phase during study period (1896–2005): (a) El Niño, (b) La Niña, and (c) neutral phases.

Table 1 Drought characteristics without and with accounting ENSO climate phases.
Table 2 Strength of dependence between pair-wise drought characteristics without and with accounting ENSO climate phases.
Figure 5
figure 5

Scatter plots of drought variables: (a) severity-duration (b) severity-peak, and (c) duration-peak pairs for three ENSO climate states – El Niño, La Niña and neutral phases.

4.3 Marginal distribution of drought variables

To fit marginal probability distributions for drought variables from each climatic phase, several parametric distributions are applied and their performance is evaluated using Akaike Information Criterion (AIC). It is found that the drought characteristics duration and peak are best represented by exponential and log-normal distributions, respectively for all the cases (i.e., for droughts with and without ENSO phase considerations). Drought severity is best modelled by Weibull distribution for the case of without ENSO and El Niño phase data, log-normal distribution for La Niña and neutral phase data. Validity of each model is tested using Kolmogorov–Smirnov (KS) goodness-of-fit test. Table 3 lists the distributions and their associated parameters used for fitting marginal distributions of drought characteristics. Figure 6 presents marginal distribution fit of drought variables – severity, duration and peak without accounting ENSO phase information. Figures 7 and 8 present Probability Density Function (PDF) and CDF plots of marginal fit of drought variables under three ENSO states. The probability distribution plots of drought variables show good match between theoretical and empirical distributions.

Table 3 Estimated parameters for marginal distribution of drought variables.
Figure 6
figure 6

Marginal distribution fit of drought variables: (a) severity, (b) duration, and (c) peak without accounting ENSO phase information.

Figure 7
figure 7

Marginal distribution fit of drought variables: (a) severity, (b) duration, and (c) peak, show PDF plots of fitted distributions for each phase of ENSO state.

Figure 8
figure 8

Marginal distribution fit of drought variables: (a) severity, (b) duration, and (c) peak, show CDF plots of fitted distributions for each phase of ENSO state.

4.4 Dependence modelling using copulas

Student’s t copulas with 2 degrees of freedom are found to be adequate for fitting drought variables with and without accounting ENSO climate state information. The associated shape parameter matrix of Student’s t copulas is listed in table 4. The root mean square error (RMSE) between parametric Student’s t and empirical copulas for entire drought data (i.e., without considering ENSO stratification) is found to be 0.031; and the RMSE values for El Niño, La Niña and neutral phase models are 0.035, 0.053 and 0.056, respectively. Figure 9 shows PP plots of trivariate copulas fitted for joint distribution of drought characteristics considering with and without ENSO state information. The PP plot gives the relation between empirical copula C n (∙) and parametric Student’s t copulas C  ∑ ,ϑ . From figure 9, it can be seen that the scatter plots between parametric and empirical copulas are close to the 45° line, which shows that a good correspondence exists between theoretical model and empirical distributions. To test the performance of the Student’s t copula in modelling upper tail dependence, pair-wise TDC (for both parametric and non-parametric estimates) are computed and used for evaluation of copula model efficacy. The parametric upper TDC \(\left( {\hat{{\lambda }}_U^{\rm param} } \right)\) of Student’s t copula is computed using equation (5) and the non- parametric upper TDC \(\left( {\hat{{\lambda }}_U^{\rm CFG} } \right)\) is computed using equation (6). The non-parametric TDC is computed both for observed samples (denoted by \(\hat{{\lambda }}_U^{\rm obs} )\) and for copula-simulated data (denoted by \(\hat{{\lambda }}_U^{\rm CFG} )\). The \(\hat{{\lambda }}_U^{\rm obs} \) is computed for data of observed samples, and the \(\hat{{\lambda }}_U^{\rm CFG}\) is computed for 1000 samples generated from copulas. The non-parametric TDC for copula models is repeated for hundred runs and the corresponding mean \(\hat{{\mu }}\left( {\hat{{\lambda}}_U^{\rm CFG}} \right)\) and standard deviation \(\hat{{\sigma }}\left({\hat{{\lambda }}_U^{\rm CFG}} \right)\) values for various combinations of drought variables are presented in table 5. It can be seen from the table that Student’s t copula is able to capture the observed upper tail dependence satisfactorily.

Table 4 The elements of shape parameter matrix (\(\hat{{\sigma }}_{i,j} )\) of Student’s t copula for different climate states. In \(\hat{{\sigma }}_{i,j} \), i and j takes values of {1,2,3}, which represents three drought characteristics severity, duration and peak, respectively.
Figure 9
figure 9

The PP plots of trivariate copulas, show the scatter plot between empirical copula (C n ) and Student’s t copulas (C  ∑ ,ϑ ): (a) without ENSO-state, (b) at El Niño, (c) at La Niña, and (d) at neutral ENSO phase droughts.

Table 5 Upper tail dependence coefficient of Student’s t copulas for pair-wise dependent drought variables at each ENSO-state drought condition.

4.5 Analysis of trivariate return periods conditional on ENSO-state

The use of tele-connections between ENSO and regional droughts is of great importance for planning and management of water resources projects in the region, as ENSO is one of the major sources of atmospheric variability in global scale. Hence, this study uses the climate tele-connection between ENSO and SPI-6 for ENSO-conditioned meteorological drought-risk analysis for the Western Rajasthan region. The drought probabilities during El Niño, La Niña and neutral years are found to be 0.47, 0.26 and 0.26, respectively. Table 6 presents univariate return periods and its associated drought quantiles, and joint return periods computed using OR as well as AND-case utilizing with and without ENSO state information. Due to limited data on La Niña and neutral years (each have 23 drought events), analysis is restricted to only shorter return periods. From table 6, it can be noticed that the return period in AND-case is always greater than that of the return period in OR-case. A small difference in return periods is observed on accounting without and with climatic state information, which shows that the developed models are consistent, and the uncertainty associated with models (and its associated parameters) are handled satisfactorily.

Table 6 Comparison of trivariate return periods (in years) of drought characteristics without and with accounting ENSO state conditions. The first column gives univariate return period (T) of drought characteristics.

The joint return periods of drought variables conditional on third drought property – T S,D ∣ I , T SI ∣ D and T DI ∣ S are computed using equation (12) (without considering ENSO states) and the corresponding contour plots are presented in figure 10. This figure also shows the superimposed historical (or observed) drought variables, which indicates that most of the historical drought events have shorter return periods. At the same time, there are few historical drought events, which have return periods of more than 100 years.

Figure 10
figure 10

Contour plots of conditional return periods (years) of droughts without accounting ENSO state information: (a) conditional return periods of drought severity and duration at given peak, T SD ∣ I , (b) conditional return periods of drought severity and peak at given duration, T SI ∣ D , and (c) conditional return periods of drought duration and peak at given severity, T D,I ∣ S . Historical droughts events are shown as black dots.

The conditional probability \(P(S\le s\left| D\ge d^\prime ,\right.\) I ≥ i ) of drought severity, given drought duration and peaks exceeding certain thresholds, d and i are computed using equation (13). These conditional probabilities are computed for each ENSO state, where d takes values of duration at 50th, 75th and 90th percentile levels, and i takes values of peak at 50th and 75th percentile levels and are shown in figure 11. From the plots in figure 11, it is possible to know the conditional non-exceedance probability of drought severity, given drought duration and peak exceeding certain values. For all three cases, the skewness of conditional probability curves shows increasing trends. These results can be helpful in assessing drought risks during different phases of ENSO. For example, for a given data of drought properties: severity s = 7, duration d = 4 months and peak i = 1.45, the corresponding conditional probability \(P\left( {S\le s\left| {D\ge d^\prime ,I\ge i^\prime } \right.} \right)\) values estimated for El Niño, La Niña and neutral state droughts are 0.15, 0.66 and 0.44, respectively.

Figure 11
figure 11

The conditional probabilities \(P\left( {S\le s\left| {D\ge d^\prime ,I\ge i^\prime } \right.} \right)\) of drought severity, given drought duration and peak exceeding certain thresholds d and i , respectively for each ENSO state (where d takes values of duration at 75th, 90th and 95th percentile levels, and i takes values of peak at 50th and 75th percentile levels). First row (a) shows the conditional probability curves for i taking values of peak at 50th percentile level, and second row (b) shows the conditional probability curves for i taking values of peak at 75th percentile level for El Niño, La Niña and neutral state droughts.

Similarly, the conditional non-exceedance probabilities \(P\left( {S\le s,I\le i\left| {D\ge d^\prime } \right.} \right)\) of drought severity and peak for a given duration exceeding certain threshold at 50th, 75th and 90th percentiles levels are computed using equation (14) and the corresponding surface plots are presented in figure 12 for each ENSO state condition. Table 7 presents conditional probability \(P\left( {S\le s,I\le i\vert D\ge {d}'} \right)\) of drought variables at different percentile values of drought properties for three ENSO state droughts. From table 7, it can be observed that the magnitude of drought properties are always higher for El Niño phase as compared to La Niña and neutral phases. Also, the conditional probability is increasing in nature for different percentile values of drought variables. Thus, the conditional probabilities of drought characteristics under different ENSO state conditions can be useful for analysing the associated drought risks. The copula-based trivariate modelling of drought characteristics is very useful in estimation of multivariate return periods and probabilistic assessment of drought risks, since the conventional univariate risk analysis of drought characteristics may lead to overestimation or underestimation of associated risks. Thus, by using the information on ENSO-based climate variability in modulating drought risks, it is possible to effectively plan agriculture and manage water resources in the region.

Figure 12
figure 12

Conditional probabilities \(P\left( {S\le s,I\le i\left| {D\ge d^\prime } \right.} \right)\) of drought severity and peak, given duration exceeding certain threshold d for El Niño, La Niña and neutral state droughts. First row (a), second row (b), and third row (c) give the conditional probabilities at drought duration exceeding 50th, 75th and 90th percentile values. Here \(d^{\prime}_{1}\), \(d^{\prime}_{2}\) and \(d^{\prime}_{3}\) correspond to values of drought duration at 50th, 75th and 90th percentiles.

Table 7 Conditional probabilities of drought characteristics for different ENSO state droughts. \(P_{{SI}\left| {{ D}\geq{d}} \right.} \) denotes conditional probability of drought severity and peak for given duration exceeding certain threshold.

5 Conclusion

This study investigated the influence of ENSO-based climate variability on multivariate drought risks in drought-prone region of Western Rajasthan, India. Using multivariate ENSO index, the drought events are partitioned into three climatic states – El Niño, La Niña and neutral episodes. It is observed that during the study period (1896–2005), El Niño year droughts were more severe and longer than La Niña and neutral phase droughts. Different probability distributions are employed for fitting marginal distributions for drought variables and the best-fitted models are selected based on goodness-of-fit test and AIC criteria. To construct multivariate joint distribution of drought characteristics, the elliptical class of trivariate Student’s t-copula family is adopted and used for deriving multivariate return periods of drought properties in two cases, i.e., without and with accounting ENSO state information. Then, the conditional probabilities of drought characteristics are analysed under three ENSO states – El Niño, La Niña and neutral phases. It is found that the droughts during El Niño state are more severe than the other two states. The results of the study suggest that: (i) the conventional univariate risk analysis of drought characteristics may lead to overestimation or underestimation of associated risks, hence, the copula-based estimation of multivariate return periods can be used for effective risk assessment of droughts; (ii) the ENSO-based climate variability can be used for assessing drought risks and appropriate planning of agriculture and water management in the region.