Abstract
With the increasing impact of climate change and anthropogenic activities, drought happens in more areas with higher frequency. In this paper, we calculate the return period and the drought risk in China based on the monthly PDSI, the Palmer Drought Severity Index, data over 188 stations from 1901 to 2010. We use the theory of runs to identify the drought duration and severity. We adopt the kernel density estimation to obtain the marginal distribution function, and the Gumbel Copula function to obtain the joint distribution function. The results show that the return period of the joint distribution for the drought duration and severity can be regarded as the extreme condition of the return period of the marginal distribution for the single factor such as the drought duration or drought severity. Under the same drought severity, the return period of the joint distribution is increasing with the prolonging of the drought duration, and it approaches to the return period of the marginal distribution of the drought severity. Under the extreme drought situation, Haihe River Basin, Huaihe River Basin, Songliao River basin, and rivers in the northwest China have a higher drought risk in future 50 years. The drought risk value in China is increasing with the prolonging of predicting time.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Drought is an extreme event in hydrologic cycle. The occurrences of drought events usually feature determinacy and randomness. With the increasing impact of climate change and anthropogenic activities, drought happens in more areas with higher frequency. Drought issue has become one of the major factors to affect sustainable economic and social development. Drought occurs frequently not only in the northern China with shortage of water resources but also in the southern China with relatively abundant water resources. In recent years, several extreme drought events happened frequently in southwest China and the middle and lower Yangtze River. The annual average affected areas (the areas that crop yields decreased by over 10 % than normal annual yields) and damaged areas (the areas that crop yields decreased by over 30 % than normal annual yields) of drought disasters were nearly 0.21 × 108 km2 and 0.10 × 108 km2 from 1950 to 2010, which were 2.19 times and 1.77 times of the impacts of flood disasters, respectively (State Flood Control and Drought Relief Headquarters 2010). Therefore, it is important to study on the possibilities and impact of the drought duration and severity to cope with drought.
The drought with typical probabilistic characterized (Sen 1980a; Loaiciga and Leipnik 1996; Chung and Salas 2000; Mishra et al. 2009) is one of the hydrological extremes including multiple correlative variables such as the duration or severity of drought. The theory of runs provides a support for probability estimation of drought variable from time domain (Downer et al. 1967; Llamas and Siddiqui 1969; Sen 1976, 1980b; Dracup et al. 1980a, b; Frick et al. 1990; Fernández and Salas 1999a, b). The probabilistic feature analysis of drought mainly includes three aspects at present: (1) univariate analysis (Sen 1980a, b; Güven 1983; Zelenhasic and Salvai 1987; Mathier et al. 1992; Sharma 1995), (2) bivariate analysis (Shiau and Shen 2001; Bonaccorso et al. 2003; Kim et al. 2003; González and Valdés 2003; Salas et al. 2005; Mishra et al. 2009; Song and Singh 2010a, b), and (3) multivariate analysis based on Copula function (Favre et al. 2004; Salvadori and De Michele 2004; Genest and Favre 2007; Poulin et al. 2007; Zhang and Singh 2006, 2007; Bárdossy and Li 2008; Chowdhary and Singh 2010; Shiau 2006; Shiau et al. 2007). The univariate analysis and bivariate analysis with the same marginal posterior could not rightly appreciate the relationship of joint distribution among the droughty multivariate, because of the significant correlation and different distribution among different drought variables. However, the Copula function allows the marginal distributions of each single-factor variables with any form and the various kinds of relationship among variables, which has favorable flexibility and adaptability (Xie and Huang 2008).
In this paper, we use the theory of runs to identify the drought duration and severity. We adopt the kernel density estimation to obtain the marginal distribution function for the drought duration and severity. We adopt the Copula function to obtain the joint distribution function for them. Then, we calculate the return period and the drought risk in China based on the monthly Palmer Drought Severity Index (PDSI) data over 188 stations from 1901 to 2010.
Methodology
Theory of runs
Self-calibrated PDSI with Penman-Monteith PE (sc_PDSI_pm) based on IPCC AR4 (IPCC 2007) 22-model ensembles mean climate under the twentieth century forcing and A1B scenario (Dai et al. 2004; Dai 2011a, b). The detail of the data is as in the studies by Dai (2011a). This study collected monthly sc_PDSI_pm data over 188 stations in China from 1901 to 2010 (Fig. 1).
The drought duration \( D \) is expressed as a drought parameter which is continuously below the critical level. In other words, it is the time period between the initiation and termination of a drought event. That is the positive run length. The drought severity \( S \) indicates a cumulative deficiency of a drought parameter below the critical level. \( {X}_0 \), \( {X}_1 \), and \( {X}_2 \) are thresholds of the PDSI (Table 1). Figure 2 shows that “g” is a drought event because X is more than \( {X}_1 \). “h” is not a drought event because the D is only one unit and X is less than \( {X}_2 \), though it is more than \( {X}_1 \). “p” is a drought event because X is more than \( {X}_1 \), though there is one unit of D below \( {X}_1 \) between \( {D}_1 \) and \( {D}_2 \), say \( D={D}_1+{D}_2+1 \), \( S={S}_1+{S}_2 \). More details can be found in the studies by Lu et al. (2010).
Kernel density estimation
The kernel density estimation is an ordinary non-parameter estimation algorithm (Guo et al. 1996). The kernel probability density function for single variable is
Here, n is the number of the observed value x i ; K(·) is the kernel density estimation function (KDEF); and h is the bandwidth, which decides the variance of the KDEF. Uniform, Triangle, Epanechnikov, and Gaussian are the common kernel density estimation functions (Silverman 1998; Tae-Woong et al. 2003; Lall et al. 1996).
Copula function
Copula function is the function which connects the joint distribution function and their marginal distribution functions together (Wei and Zhang 2008). The dualistic Copula function C(·, ·) has the following properties: (1) C(·, ·) has the domain I 2, that is [0, 1]2; (2) C(·, ·) has the zero basal plane, and is increasing for the two-dimensional scale; and (3) C(u, 1) = u and C(1, v) = v are satisfied for an arbitrary variable. It is assumed that F(x) and G(y) is the continuous one-dimensional distribution function. Let be u = F(x) and v = G(y). Then u and v all obey uniform distribution [0, 1]. Therefore, C(u, v) is a two-dimensional distribution function which the marginal distribution obeys the uniform one [0, 1] and has the property 0 ≤ C(u, v) ≤ 1 for any (u, v) within the definition domain.
Let H(·, ·) be the joint distribution function which has the marginal ones for F(·) and G(·) based on the Sklar theorem (Sklar 1959), then there is a Copula function C(·, ·) which is satisfied that:
If the F(·) and G(·) were continuous, the C(·, ·) will be determined uniquely. If the F(·) and G(·) were the one-dimensional distribution functions and the C(·, ·) was the corresponding Copula function, H(·, ·) defined by Eq. (2) is the joint distribution function which has the marginal distribution functions for F(·) and G(·).
Normal Copula function (Nelsen 2006), t-Copula function (Bouyé et al. 2000; Cherubini et al. 2004), and Archimedes Copula function (Genest and Mackay 1986) are the common dualistic Copula functions, while Gumbel Copula (Frees and Valdez 1998; Patton 2002), Clayton Copula, and Frank Copula are the common dualistic Archimedes Copula functions.
Return period
The return periods of the marginal distribution functions for the drought duration and severity can be calculated from Eq. (3), which is deduced by Shiau and Shen in 2001.
Here, T D and T S are the return periods of the marginal distribution functions for the drought duration and severity, respectively; F D (d) and F S (s) are the marginal distribution functions for the drought duration and severity; and E(L) is the expected value of the drought intervals.
Among all these, the return periods of the joint distribution for the drought duration and severity include two situations: T o (D > d or S > s) and T a (D > d and S > s). T o and T a can be calculated from Eq. (4) proposed by Shiau in 2003, which are the return periods of the joint distribution with double variables based on the Copula function.
Here, F(d, s) is the joint distribution function of the drought duration and severity.
The associated drought risks R can be calculated to impose the probability of extreme drought with T-year return period for N years (Chow et al. 1988).
Results and discussion
Marginal distribution functions of the drought duration and severity
The two main aspects affecting kernel density estimation are the bandwidth and the KDEF. Firstly, the optimal bandwidth and the optimal KDEF for drought duration and severity are determined. Secondly, the Gaussian function as the KDEF is confirmed and the influence of kernel density estimation of different bandwidths is observed. It can be seen that the values and curve shapes of kernel density estimation are quite different under different bandwidths. Take Mainland China, excluding Taiwan, Hong Kong, and Macao, the same below (Fig. 3). With less bandwidth level, as h is 0.3 for the drought duration and h is 0.5 for the drought severity, the curve shapes of kernel density estimation are more flexural and less smooth which can reflect many details. But with bigger bandwidth level, as h is 2.0 for the drought duration and h is 3.0 for the drought severity, the curve shapes of kernel density estimation are much smoother which cover many details. According to Silverman empirical law (Silverman 1998), the bandwidth level of kernel estimation of drought duration is 0.5, and the bandwidth level of kernel estimation of drought severity is 1.0. The level of bandwidth is fixed, as h is 0.5 for the drought duration and h is 1.0 for the drought severity; the kernel functions are chosen, such as Gaussian, Uniform, Triangle, and Epanechnikov; and then the influence of different kernel functions on kernel density estimation is observed. It can be seen that different kernel functions have little influence on kernel density estimation (Fig. 4). But the smooth Gaussian kernel function is better than Uniform. According to Silverman empirical law, Gaussian kernel function is chosen as the final result.
In order to compare the fitting effect of parametric and non-parametric methods, this paper selects three kinds of frequently used distributions, Normality, Index, and Gamma, to estimate the cumulative distribution of drought duration and drought severity, while chooses kernel estimation, as h is 0.5 for the drought duration, h is 1 for the drought severity, and kernel functions are all Gaussian functions. The empirical estimation is Kaplan-Meier. The result of non-parametric kernel estimation can be a better representation for probability density features of non-unimodal type than parametric method (Fig. 5). Table 2 shows the optimal bandwidth level of kernel density estimation and the basic statistics of drought duration and severity in eight river basins. And the kernel functions are all Gaussian.
Joint distribution function of drought duration and severity
The frequency histogram of marginal distribution of drought duration and severity has asymmetric tail taking Mainland China as an example (Fig. 6). The top part of tail is high, and the base is lower. These illustrate that there is more relevant among variables at the top of tail and asymptotic independent at the base of tail. Therefore, we can choose dualistic Gumbel Copula function as the method for describing drought duration and severity.
This paper uses maximum likelihood method for parameter estimation to compare the imitative effect of dualistic Gumbel Copula function, dualistic normal Copula function, dualistic t-Copula function, dualistic Clayton Copula function, and dualistic Frank Copula function. The squared Euclidean distance between the above five kinds of dualistic Copula function and experience Copula function are calculated, respectively. The squared Euclidean distance between dualistic normal Copula function with the linear related parameter of 0.95 and experience Copula function is 0.40. The squared Euclidean distance between dualistic t-Copula function with the linear related parameter of 0.95 and degree of freedom of 18 and experience Copula function is 0.22. The squared Euclidean distance between dualistic Clayton Copula function with the parameter of 4.12, dualistic Frank Copula function with the parameter of 18.50, dualistic Gumbel Copula function with the parameter of 4.98, and the experience Copula function is 8.91, 0.40, and 0.15, respectively. At the distance standard of squared Euclidean, dualistic Gumbel Copula function could better fit the dependency structure of drought duration and severity. This result is consistent with the result obtained by the frequency histogram of marginal distribution of drought duration and severity.
The Gumbel Copula function of drought duration and severity has the smallest squared Euclidean distance in all basins except the Yangtze and Songliao River Basin (Fig. 7). And the t-Copula function has the smallest squared Euclidean distance in these two basins. But the difference of squared Euclidean distance between t-Copula function and Gumbel Copula function is small in Yangtze and Songliao River Basin. Therefore, in order to calculate conveniently, the joint distribution function of drought duration and severity in the eight basins is Gumbel Copula function. The relevant parameters are shown in Table 3.
Return period of drought duration and severity
The univariate return periods of marginal distribution of drought duration and severity are between T o and T a . The two kinds of return periods of joint distribution can be seen as two extreme cases of marginal distribution (Fig. 8). Take Mainland China as an example. When drought duration is 5 months, and drought severity is 10 mm, the marginal distribution return periods of drought duration and severity are 51 and 52 months, and the joint distribution ones are 46 and 58 months. Under these conditions, the joint distribution of drought duration and severity in Yellow River and Songliao River basins has the biggest return period, in which T a is 69 months, and has the smallest return period in southwestern region in which T a is 42 months (Table 4).
At the same conditions of drought severity, the return period of joint distribution, T a , is increasing continuously with the extension of drought duration. And the end of joint distribution, return period is gradually becoming the marginal distribution of drought severity (Fig. 9).
Drought risk analysis
The drought risk map taking basin as the spatial scale can better reflect the spatial variation of drought risk compared with taking Mainland China as the spatial scale (Fig. 10). In order to compare the space characteristic of drought risk, this paper chooses the sub-basin as the spatial scale to build respective models of drought risk identification. The results are more precise than those for the whole basin. And this illustrates that it is more suitable for drought risk assessment. Figure 11 is the drought risk map with the spatial scale of basin. At the extreme drought situation, the basins with high risk of drought in the next 50 years are Haihe, Huaihe, Songliao, and rivers in the northwest China. In each river basin, the drought risk value is increasing with the increase of forecasting time.
Conclusions
In this paper, we calculate the return period and the drought risk in China based on the monthly PDSI data over 188 stations from 1901 to 2010. We use the theory of runs to identify the drought duration and severity. We adopt the kernel density estimation to obtain the marginal distribution function, and the Gumbel Copula function to obtain the joint distribution function. After confirming the marginal distribution function of drought duration and severity, we compare the influence of bandwidth and kernel function on kernel estimation, respectively, and the fitting effect of kernel and parameter estimation. After confirming the joint distribution function of drought duration and severity, we compare the fitting effect of dualistic Gumbel Copula function, dualistic normal Copula function, dualistic t-Copula function, dualistic Clayton Copula function, and dualistic Frank Copula function which are used frequently in hydrologic and meteorological fields. We compare the return period of univariate marginal distribution and bivariate joint distribution when the joint distribution return period of drought duration and severity are calculated. We compare the spatial difference of drought risk calculated based on Mainland China and river basin, respectively.
The bandwidths of drought duration about eight major river basins of China are between 0.66 and 1.09, and the bandwidths of drought severity are between 1.45 and 2.43. Kernel functions are Gaussian. The joint distribution functions of eight major river basins are all Gumbel Copula, and their parameters are between 4.79 and 6.09. The return period of bivariate joint distribution can be treated as two extreme return period cases of univariate marginal distribution. Under the same conditions of drought severity, the return period of bivariate joint distribution, T a , is increasing continuously with the continuous extension of drought duration. Its end part is gradually becoming the return period of drought severity marginal distribution. The drought risk map taking river basin as the spatial scale can better reflect the spatial variation of drought risk compared with taking Mainland China as the spatial scale. At the extreme drought situation, the basins with high risk of drought in the next 50 years are Haihe, Huaihe, Songliao, and rivers in the northwest China. In each river basin, the drought risk value is increasing with the increase of forecasting time. The research results of this paper can provide some basic supports to cope with drought.
References
Bárdossy A, Li J (2008) Geostatistical interpolation using copulas. Water Resour Res 44, W07412. doi:10.1029/2007WR006115
Bonaccorso B, Cancelliere A, Rossi G (2003) An analytical formulation of return period of drought severity. Stoch Env Res Risk A 17(3):157–174
Bouyé E, Durrleman V, Nikeghbali A et al (2000) Copulas for finance: a reading guide and some applications. Financial Econometrics Research Centre, City University Business School, London
Cherubini U, Luciano E, Vecchiato W (2004) Copula methods in finance. John Wiley & Sons Ltd., England
Chow VT, Maidment DR, Mays LW (1988) Applied hydrology. McGraw-Hill Book Company, New York
Chowdhary H, Singh VP (2010) Reducing uncertainty in estimates of frequency distribution parameters using composite likelihood approach and copula-based bivariate distributions. Water Resour Res 46, W11516. doi:10.1029/2009WR008490
Chung CH, Salas JD (2000) Return period and risk of droughts for dependent hydrologic processes. J Hydrol Eng 5(3):259–268
Intergovernmental Panel on Climate Change, Fourth Assessment Report (IPCC AR4) (2007) Climate Change Synthesis Report. www.ipcc.ch/ipccreports/av4-syr.htm
Dai A (2011a) Characteristics and trends in various forms of the Palmer Drought Severity Index (PDSI) during 1900-2008. J Geophys Res 116, D12115. doi:10.1029/2010JD015541
Dai A (2011b) Drought under global warming: a review. Wiley Interdiscip Rev Clim Chang 2:45–65. doi:10.1002/wcc.81
Dai A, Trenberth KE, Qian T (2004) A global data set of Palmer Drought Severity Index for 1870-2002: relationship with soil moisture and effects of surface warming. J Hydrometeorol 5:1117–1130
Downer R, Siddiqui M, Yevjevich V (1967) Application of runs to hydrologic droughts. In: Proc., Int. Hydrology Symp., Colorado State Univ., Fort Collins, Colo., 63(1): 496–505
Dracup JA, Lee KS, Paulson EG (1980a) On the statistical characteristics of drought events. Water Resour Res 16(2):289–296
Dracup JA, Lee KS, Paulson EG (1980b) On the definition of droughts. Water Resour Res 16(2):297–302
Favre AC, El Adlouni S, Perreault L, Thiémonge N, Bobée B (2004) Multivariate hydrological frequency analysis using copulas. Water Resour Res 40, W01101. doi:10.1029/2003WR002456
Fernández B, Salas JD (1999a) Return period and risk of hydrologic events. I: mathematical formulation. J Hydrol Eng 4(4):297–307
Fernández B, Salas JD (1999b) Return period and risk of hydrologic events. II: applications. J Hydrol Eng 4(4):308–316
Frees EW, Valdez EA (1998) Understanding relationships using Copulas. N Am Actuar J 2(1):1–25
Frick DM, Bode D, Salas JD (1990) Effect of drought on urban water supplies. I: drought analysis. J Hydrol Eng 116:733–753
Genest C, Favre AC (2007) Everything you always wanted to know about copula modeling but were afraid to ask. J Hydrol Eng 12(4):347–368
Genest C, Mackay J (1986) The joy of copulas: bivariate distributions with uniform marginals. Am Stat 40:280–283
González J, Valdés JB (2003) Bivariate drought recurrence analysis using tree ring reconstructions. J Hydrol Eng 8(5):247–258
Guo SL, Kachroo RK, Mngodo RJ (1996) Non-parametric kernel estimation of low flow quantiles. J Hydrol 185:335–348
Güven O (1983) A simplified semiempirical approach to probabilities of extreme hydrologic droughts. Water Resour Res 19(1):441–453
Kim T, Valdés JB, Yoo C (2003) Nonparametric approach for estimating return periods of droughts in arid regions. J Hydrol Eng 8:237–246
Lall U, Rajagopalan B, Tarboton DG (1996) A non-parametric wet/dry spell model for resampling daily precipitation. Water Resour Res 32(9):2803–2823
Llamas J, Siddiqui M (1969) Runs of precipitation series. Hydrology Paper 33. Colorado State University, Fort Collins, Colorado
Loaiciga HA, Leipnik RB (1996) Stochastic renewal model of low-flow streamflow sequences. Stoch Hydrol Hydraul 10(1):65–85
Lu GH, Yan GX, Wu ZY et al (2010) Regional drought analysis approach based on copula function. Adv Water Sci 21(2):188–193 (in Chinese)
Mathier L, Perreault L, Bobée B, Ashkar F (1992) The use of geometric and gamma-related distributions for frequency analysis of water deficit. Stoch Hydrol Hydraul 6:239–254
Mishra AK, Singh VP, Desai VR (2009) Drought characterization: a probabilistic approach. Stoch Env Res Risk A 23(1):41–55
Nelsen RB (2006) An introduction to copulas. Springer, New York
Patton AJ (2002) Skewness, asymmetric dependence, and portfolios. Working paper of London School of Economics & Political Science
Poulin A, Huard D, Favre AC, Stéphane P (2007) Importance of tail dependence in bivariate frequency analysis. J Hydrol Eng 12(4):394–403
Salas JD, Fu C, Cancelliere A, Dustin D, Bode D, Pineda A, Vincent E (2005) Characterizing the severity and risk of drought in the Poudre River, Colorado. J Water Resour Plan Manag 131(5):383–393
Salvadori G, De Michele C (2004) Frequency analysis via copulas: theoretical aspects and applications to hydrological events. Water Resour Res 40, W12511. doi:10.1029/2004WR003133
Sen Z (1976) Wet and dry periods for annual flow series. J Hydraul Eng - ASCE 102:1503–1514
Sen Z (1980a) Regional drought and flood frequency analysis, theoretical consideration. J Hydrol 46:265–279
Sen Z (1980b) Statistical analysis of hydrologic critical droughts. J Hydraul Eng - ASCE 106(1):99–115
Sharma T (1995) Estimation of drought severity on independent and dependent hydrologic series. Water Resour Manag 11:35–49
Shiau JT (2003) Return period of bivariate distributed hydrological events. Stoch Env Res Risk A 17(1/2):422–571
Shiau JT (2006) Fitting drought duration and severity with two-dimensional copulas. Water Resour Manag 20:795–815
Shiau JT, Shen HW (2001) Recurrence analysis of hydrologic droughts of differing severity. J Water Resour Plan Manag 127:30–40
Shiau JT, Feng S, Nadarajah S (2007) Assessment of hydrological droughts for the Yellow River, China, using copulas. Hydrol Process 21:2157–2163
Silverman BW (1998) Density estimation for statistics and data analysis. Chapman & Hall/CRC, London, ISBN 0-412-24620-1
Sklar A (1959) Fonctions de repartition àn dimensions et leurs marges. Publ Inst Stat Univ Paris 8:229–231
Song SB, Singh VP (2010a) Frequency analysis of droughts using the Plackett copula and parameter estimation by genetic algorithm. Stoch Env Res Risk A 24:783–805
Song SB, Singh VP (2010b) Meta-elliptical copulas for drought frequency analysis of periodic hydrologic data. Stoch Env Res Risk A 24:425–444
State Flood Control and Drought Relief Headquarters (2010) Bulletin of flood and drought disasters in China. China Ministry of Water Resources, Beijing (in Chinese)
Tae-Woong K, Juan BV, Chulsang Y (2003) Non-parametric approach for estimation return periods of droughts in arid regions. J Hydrol Eng 8(5):237–246
Wei YH, Zhang SY (2008) Copula theory and its application in financial analysis. Tsinghua University Press, Beijing
Xie H, Huang JS (2008) A review of bivariate hydrological frequency distribution. Adv Water Sci 19(3):443–452 (in Chinese)
Zelenhasic E, Salvai A (1987) A method of streamflow drought analysis. Water Resour Res 23:156–168
Zhang L, Singh VP (2006) Bivariate flood frequency analysis using the copula method. J Hydrol Eng 11(2):150–164
Zhang L, Singh VP (2007) Gumbel–Hougaard copula for trivariate rainfall frequency analysis. J Hydrol Eng 12(4):409–419
Acknowledgments
This work was supported by the National Science and Technology Support Program Project (2012BAC19B03, 2013BAC10B01) and the General Program of the National Natural Science Foundation of China (51279207, 51409266).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Weng, B., Zhang, P. & Li, S. Drought risk assessment in China with different spatial scales. Arab J Geosci 8, 10193–10202 (2015). https://doi.org/10.1007/s12517-015-1938-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12517-015-1938-9