1 Introduction

Changes in precipitation extremes in both time and space have a strong influence on human society. Higher frequencies of weather extremes (e.g., Easterling et al. 2000; Alan et al. 2003) pose a potential threat to the at-risk populations. Tropical storms, floods, and droughts affect human welfare directly through catastrophic damage or indirectly through adverse effects on crop productivity (e.g., Huntington 2006). In recent years, climate changes and related impacts on human society, particularly on agriculture and food security, have received an unprecedented attention (e.g., Schmidhuber and Tubiello 2007; MacDonald 2010; Piao et al. 2010) because of the significant role of the availability, accessibility, and security of food and water in the stability and sustainability of human society. Moreover, investigation of precipitation changes could be the first step into thorough understanding of variations of hydrological cycles in the backdrop of global climate changes (e.g., Allen and Ingram 2002; Alan et al. 2003; Dore 2005). All these could be the reasons that the precipitation changes have drawn overwhelmingly increasing concerns from researchers around the world in recent decades (Fatichi and Caporali 2009), and it is particularly true for the precipitation extremes (Easterling et al. 2000; Emori and Brown 2005; Zhang et al. 2011, 2012).

The Yangtze River (Fig. 1), being the longest river in China and the third longest river in the world, plays an important role in the sustainable development in China. However, frequent floods in the Yangtze River basin inflicted considerable loss on economy and human life. In 1998, disastrous floods occurred to large parts of the basin, which was the largest flood since 1954. The economic loss was 166 billion RMB (nearly 28 billion US dollars). This flood hazard was the direct result of unusually high precipitation that occurred between June and August (670 mm) due to a strong El Niño event (Yin and Li 2001). Therefore, more and more investigations have been conducted on floods and the possible mechanisms behind (e.g., Zhang et al. 2006, 2007). Extreme precipitation events are the main causes for the flood hazards in the Yangtze River basin (Zhang et al. 2005).

Fig. 1
figure 1

Locations of the Yangtze River basin and selected meteorological stations

The southern part of the Yangtze River basin is climatically close to tropical climate and the northern part to the temperate zone. The annual mean temperature in the southern and northern parts of the middle and lower Yangtze River basin is roughly 19 and 15 °C, respectively; and the mean annual precipitation in the basin varies from 300 to 500 mm in the western region to 1,600–1,900 mm in the southeastern region and the precipitation is mostly concentrated in the summer season (from June to August), accounting for nearly half of the annual total precipitation amount (Guo et al. 2012). In this regard, uneven distribution of precipitation changes is one of the major causes behind the frequent floods and droughts across the Yangtze River basin. Therefore, regionalization of precipitation extremes and the related statistical behaviors do definitely merit further study. There are some studies pertaining to the precipitation extremes in the Yangtze River basin (e.g., Su et al. 2005). Zhang et al. (2008) analyzed the changing characteristics of precipitation maxima using Mann–Kendall trend test and explored the possible causes for the changes by using NCEP/NCAR reanalysis dataset. The research results indicate changes of precipitation maxima from relatively stable patterns to the significant increasing/decreasing trend in the mid-1970s. With respect to the intra-annual variability, the rainy days are decreasing and precipitation intensity is increasing, and significant increasing trend of precipitation intensity was detected in the middle and lower Yangtze River basin.

However, the regional frequency analysis of precipitation extremes with the state-of-art L-moments techniques has not been conducted in the whole Yangtze River Basin, except similar study in the Pearl River basin (Yang et al. 2010), as a highly complicated river system that encompasses a large area with various random and systematic variations so far. This is definitely instrumental in terms of sound understanding of spatial and temporal variations of precipitation extremes, and is also greatly helpful for understanding of occurrence of floods and droughts in both space and time. Therefore, efforts should be made to carry out the regional frequency analysis of extreme precipitation in the basin and this is the major motivation of this study.

The major objectives of this study are to: (1) identify and delineate the hydrological homogeneous regions for precipitation extremes defined in this study; (2) determine the best probability distribution for rainfall extremes, conduct regional frequency analysis with uncertainty assessment including the corresponding error bounds and root mean squared error using the L-moments; and (3) characterize the spatial–temporal patterns of extreme precipitation events in order to reveal the underlying impacts of climate variations dominated in the Yangtze River basin. The study results will definitely benefit further development of understanding of the unique and complex features of extreme rainfall in the Yangtze River basin, the largest river basin in China, and hence the basin-scale water resources management in the backdrop of changing climate.

2 Data

Daily precipitation data covering 1960–2009 were collected at 141 precipitation stations and the precipitation dataset has been provided by the National Climate Center of the China Meteorological Administration. The locations of the stations are shown in Fig. 1. There are 42 precipitation stations containing missing daily precipitation data, as illustrated in circles with dots (Fig. 1). We excluded the stations with missing values for a month or longer and the largest missing data rate is only 1.09 %. The missing data were filled by the long-term average of the same days of other years. Given the objectives of this study, the gap-filling method, which was similarly employed by Zhang et al. (2011), does not significantly affect the final results. As for precipitation extremes, Alexander et al. (2006) has set an exact definition and formula for 27 extreme daily precipitation and temperature indices and there are 11 extreme daily precipitation indices, involving five different categories that are percentile-based indices, absolute indices, threshold indices, duration indices, and so on. In this study, four indices are selected: (1) the annual number of heavy precipitation days with precipitation >10 mm (R10); (2) the annual amount of rainfall exceeding the 95th (R95P) percentiles, and the 95th percentile of precipitation was defined on the wet days (defined as the rainy day with daily precipitation larger than 1 mm) in a 30-year period, and is selected during the period of 1971–2000; (3) maximum 1-day precipitation amount (RX1DAY); and (4) maximum 5-day precipitation amount (RX5DAY).

3 Methodology

Regional frequency analysis based on L-moment method has been used to characterize the spatial pattern of precipitation extremes within the Yangtze River basin. The methods are presented in this section, including the stationarity test, serial correlation check, L-moments approach, regional frequency analysis based on the L-moment, and spatial analysis.

3.1 Stationarity test and serial correlation check

Both stationarity and independence are important underlying assumptions inherent in frequency analysis, thus the analysis without stationarity and serial correlation tests may lead to incorrect results and conclusions (Yang et al. 2010). Therefore, it is beneficial to conduct stationarity and serial correlation tests prior to the regional frequency analysis. A stationary series is usually defined as constant mean, variance and autocorrelation, etc., and in this paper we mean a flat-looking series. The trend test is one of the most popular methods for examining the stationarity in hydrological series. With the advantage of not requiring any distribution assumptions in the data while having the same power as its parametric counterparts, the rank-based Mann–Kendall method (Mann 1945; Kendall 1975) is recommended by the World Meteorological Organization for assessing the significance of monotonic trends in hydrological series (Mitchell et al. 1966). The effect of the serial correlation on the Mann–Kendall (MK) test was eliminated using the pre-whitening technique developed by Yue et al. (2002) in this study.

The serial correlation check was carried out mainly by examining the autocorrelation coefficients of the time series. When the autocorrelation coefficients of different lag times, calculated for a time series, are within the 95 % confidence level, the observations in this time series can be accepted as being independent from each other. In this study according to the calculated autocorrelation coefficients of lag-1, for each annual series, the observations in that series can be accepted as being independent at the 95 % significance level. In the paper, for the stations that are not stationary or independent are excluded in the dataset for the regional frequency analysis of extreme daily precipitation index.

3.2 L-moments theory

L-moments have the theoretical advantages over conventional moments for being able to characterize a wider range of distributions and being more robust to the presence of outliers in the data. Details about the L-moments approach can be found in Hosking and Wallis (1997). In brief, it is a modification of the probability weighted moments method with the advantage of offering a description of the shape of a probability distribution by L-skewness and L-kurtosis (Yang et al. 2010). The L-moment is a linear combination of the probability weighted moments. The sample L-moment ratios are defined as

$$ t={l}_2/{l}_1\kern0.5em \mathrm{and}\kern0.5em {t}_r={l}_r/{l}_2,\kern0.5em r=3,4,\dots $$
(1)

where l r being the unbiased rth L-moments, being analogue to the traditional ratios, that is, t is the coefficient of variation (L-CV), t 3 the L-skewness, and t 4 the L-kurtosis. The L-moment ratios will be used for homogeneity analysis in the regional frequency analysis.

Compared with conventional moments, L-moments have less bias in estimation and their asymptotic are closer to the normal distribution in finite samples. And the L-moments approach covers the characterization of probability distributions, the summary of observed data samples, the fitting of probability distributions to data, and the testing of the distributional form (e.g., Yang et al. 2010).

3.3 Regional frequency analysis based on L-moments method

Suppose that there are N sites in a region with sample size n 1, n 2, . . . , n N , respectively. And the sample L-moment ratios (L-CV, L-skewness, and L-kurtosis) at site i are denoted by t (i), t 3 (i), and t 4 (i), respectively. Then the regional weighted average L-moment ratios are given by:

$$ \overline{t}={\displaystyle \sum_{i=1}^N{n}_i{t}^{(i)}}/{\displaystyle \sum_{i=1}^N{n}_i}\kern0.20em \mathrm{and}\kern0.24em {\overline{t}}_r={\displaystyle \sum_{i=1}^N{n}_i{t}_r^{(i)}}/{\displaystyle \sum_{i=1}^N{n}_i}\mathrm{r}=3,4,\dots $$
(2)

The regional frequency analysis using L-moments consists of five steps (Hosking and Wallis 1993, 1997): (1) identification of homogenous regions by cluster analysis; (2) screening of the data using the discordancy measure D i ; (3) homogeneity testing using the heterogeneity measure H; (4) distribution selection using the goodness-of-fit measure Z; and (5) regional estimation of precipitation quantiles using the L-moment approach. These five steps were followed to conduct a regional frequency analysis for the Yangtze River basin and the statistical methods employed are introduced below.

3.3.1 Identification of homogenous regions by cluster analysis

In this study, five variables are adopted to characterize the precipitation changes, i.e., latitude, longitude, elevation, the mean annual precipitation, and the standard deviation of annual precipitation. And then the Fuzzy c-mean based on the five variables was used to cluster the stations in the Yangtze River basin with the number of clusters identified based on the cluster validity index for the fuzzy c-mean as proposed by Wu and Yang (2005).

The output from the cluster analysis is not the final results. Subjective adjustments can often be found to improve the physical coherence of regions and to reduce the heterogeneity of regions as measured by the heterogeneity measure. Several adjustments of regions may be recommended (Hosking and Wallis 1997): (1) move a site or a few sites from one region to another; (2) delete a site or a few sites from the dataset; (3) subdivide the region; (4) break up the region by reassigning its sites to other regions; (5) merge the region with another or others; (6) merge two or more regions and redefine groups; and (7) obtain more data and redefine groups.

3.3.2 Screening the data using the discordancy measure

Let u i  = [t(i),t (i)3 ,t (i)4 ]T be the vector containing the t, t 3, and t 4 values for site i where the superscript T denotes transposition of a vector or matrix. And the (unweighted) regional average is defined as (Hosking and Wallis 1993):

$$ \overline{u}={\displaystyle \sum_{i=1}^N{u}_i}/N $$
(3)

Then the discordancy measure for site i is defined as (Hosking and Wallis 1993):

$$ {D}_i=\frac{1}{3}N{\left({u}_i-\overline{u}\right)}^T{A}^{-1}\left({u}_i-\overline{u}\right) $$
(4)

where \( A={\displaystyle \sum_{i=1}^N\left({u}_i-\overline{u}\right){\left({u}_i-\overline{u}\right)}^T} \).

Obviously, a large value of D i indicates the discordancy of site i with other sites. Hosking and Wallis (1997) found that there was no single fixed number which can be considered to be a “large” D i value and suggested some critical values for discordancy test which are dependent on the number of sites in the study region.

3.3.3 Homogeneity testing using the heterogeneity measure

The regional average L-CV, L-skewness, and L-kurtosis, represented by t R, t R3 , and t R4 , respectively, are computed as (Hosking and Wallis 1993; Yang et al. 2010):

$$ \begin{array}{l}{t}^R={\displaystyle \sum_{i=1}^N{n}_i{t}^{(i)}/{\displaystyle \sum_{i=1}^N{n}_i}}\hfill \\ {}{t}_3^R={\displaystyle \sum_{i=1}^N{n}_i{t}_3^{(i)}/{\displaystyle \sum_{i=1}^N{n}_i}}\hfill \\ {}{t}_4^R={\displaystyle \sum_{i=1}^N{n}_i{t}_4^{(i)}/{\displaystyle \sum_{i=1}^N{n}_i}}\hfill \end{array} $$
(5)

where \( {n}_i/{\displaystyle \sum_{i=1}^N{n}_i} \) denotes the weight applied to sample L-moment ratios at site i. And the regional average mean l R1 is, without loss of generality, set to be 1. As Hosking and Wallis (1997) found that H 2 is a weaker test of heterogeneity than H 1, the measures of dispersion in this study is defined as (Hosking and Wallis 1997; Yang et al. 2010):

$$ {V}_1={\left\{{\displaystyle \sum_{i=1}^N{n}_i{\left[{t}^{(i)}-{t}^{(R)}\right]}^2/{\displaystyle \sum_{i=1}^N{n}_i}}\right\}}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$2$}\right.} $$
(6)

Let μ v and σ v denote the mean and standard deviation values of V 1 computed on the basis of a large number of simulated homogeneous regions. Hence the statistic that measures the heterogeneity of a set of sites is given by:

$$ {H}_1=\frac{\left({V}_1-{\mu}_v\right)}{\sigma_v} $$
(7)

In order to obtain reliable values of μ v and σ v , the number N sim of simulations needs to be large and N sim = 1,000 was used in this study, and these were calculated using the R package lmomRFA maintained by Hosking (2012a). The region is regarded to be “acceptably homogeneous” if H 1 < 1, “possibly heterogeneous” if 1 ≤ H 1 < 2, and “definitely heterogeneous” if H 1 > 2. In this study a region has been accepted as homogeneous if H 1 < 2.

3.3.4 Distribution selection using the goodness-of-fit measure

For each candidate distribution, the goodness-of-fit measure:

$$ {Z}^{\mathrm{DIST}}=\left({\tau}_4^{\mathrm{DIST}}-{\overline{t}}_4+{\beta}_4\right)/{\sigma}_4 $$
(8)

was used, as suggested by Hosking and Wallis (1993) using the L-kurtosis, where τ DIST4 is the L-kurtosis of the fitted distribution to the data using the candidate distribution, and:

$$ {\beta}_4={\displaystyle \sum_{m=1}^{N_{\mathrm{sim}}}\left({\overline{t}}_4^{\mathrm{m}}-{\overline{t}}_4\right)/{N}_{\mathrm{sim}}} $$
(9)

is the bias of \( {\overline{t}}_4 \) estimated using the simulation technique as before with \( {\overline{t}}_4^m \) being the sample L-kurtosis of the mth simulation, and:

$$ {\sigma}_4={\left\{{\left({N}_{\mathrm{sim}}-1\right)}^{-1}\left[{\displaystyle \sum_{m=1}^{N_{\mathrm{sim}}}\left({\overline{t}}_4^m-{\overline{t}}_4\right)-{N}_{\mathrm{sim}}{\beta}_4^2}\right]\right\}}^{\frac{1}{2}} $$
(10)

is the standard deviation of \( {\overline{t}}_4 \). The fit is considered to be adequate if |Z DIST| is sufficiently close to zero, and a reasonable criterion being |Z DIST| ≤1.64.

As the goodness-of-fit test allows only to decide to keep a distribution or not but does not allow to select among the distributions kept, then if more than one candidate distribution is acceptable, the L-moment ratio diagram has been used to identify the distribution by comparing its closeness to the L-skewness and L-kurtosis combination in the L-moment ratio diagram furthermore.

3.4 Spatial interpolation

To understand the spatial patterns of statistical characteristics of extreme precipitation regimes across the Yangtze River basin, the geostatistical or stochastic methods were used as they capitalize on the spatial correlation between neighboring observations to predict attributed values at unsampled locations (Goovaerts 1999). And Goovaerts (1999) indicated that the major advantage of the Kriging method over other simpler interpolation methods is that sparsely sampled observations of the primary attribute can be complemented by secondary attributes that are more densely sampled. Therefore, the Kriging interpolation method has been used in this study to characterize the spatial patterns of the extreme precipitation for the study region.

4 Results and discussion

4.1 Regionalization of precipitation extremes based on L-moment technique

The topographical features and the climate types of the Yangtze River basin are complex. The elevation of the Yangtze River basin is gradually decreasing from the northwest to the southeast and the river basin is generally affected by the southeast monsoon with the southern part of the basin is climatically close to tropical climate and the northern part to the temperate zone. The precipitation changes are far from being spatially homogeneous, so different precipitation climate patterns were identified. In this case, the homogeneous regions were first differentiated using the fuzzy c-means clusters in terms of the latitude, longitude, elevation, mean annual precipitation, and the standard deviation of annual precipitation. Generally, six homogeneous regions were identified based on the cluster validity index for the fuzzy c-mean proposed by Wu and Yang (2005) and the results of the cluster validity index are not shown in the paper.

As suggested by Yang et al. (2010), it is beneficial to make stationarity and serial correlation tests prior to the regional frequency analysis. And the Mann–Kendall test was conducted on the observations of precipitation extremes (R10, R95P, RX1DAY, and RX5DAY) for all of the stations in the study basin. For the stations with trends being significant at >95 % confidence level are considered as not stationary and excluded from the dataset analyzed in this study and the results of stations with significant trend at >95 % confidence level are shown in Fig. 2. Besides, the stations are considered as not independent if the autocorrelation coefficients of lag-1 are beyond the 95 % confidence level and the results of the stations, considered as not independent are also shown in Fig. 2. Similarly, for the stations considered as not independent will also be removed from the dataset for analysis.

Fig. 2
figure 2

Homogeneous regions clustered by the fuzzy c-means and the stations being not stationary or independent, where a for R10; b for R95P; c for RX1DAY, and d for RX5DAY

Then the discordance test was done for each homogeneous region of R10, R95P, RX1DAY, and RX5DAY, respectively. Results of the initial cluster show that there are some sites being discordant with other sites within a homogeneous region. For these discordant stations, subjective adjustments are often necessary to improve the physical coherence of regions and to reduce the heterogeneity of regions as measured by the heterogeneity measure, and these can be done as illustrated in Section 3.3.1. After several adjustments, the final results of the identified homogeneous regions for each extreme precipitation index are obtained and illustrated in Fig. 3, which shows that the entire Yangtze River basin can be categorized into six homogeneous regions for each extreme daily precipitation index. The homogeneity measure H 1 defined in Eq. 7 was calculated for the detected homogeneous regions and the results (Fig. 3) indicate that the homogeneous regions identified for each extreme daily precipitation index are corroborated statistically to be homogeneous with H 1 < 2. And for the sites discordant with the other sites in a homogenous region are also illustrated in Fig. 3.

Fig. 3
figure 3

Homogeneous regions identified for each extreme daily precipitation index and the sites discordant with the other sites in a homogenous region are also illustrated by circles with dots, where a for R10, b for R95P, c for RX1DAY, and d for RX5DAY

4.2 Selection of probability functions

Within each homogeneous region for R10, R95P, RX1DAY, and RX5DAY, the regional distribution of each extreme daily precipitation index was fitted by the general normal (GNO, three parameters); Pearson type III (PE3, three parameters), general extreme-value (GEV, three parameters), generalized logistic (GLO, three parameters), and generalized Pareto (GPA, three parameters) distributions and these distributions are introduced in the Appendix. The goodness-of-fit test was carried out to generate the results as shown in Table 1, where the criterion value was 1.64 for the absolute value of goodness-of-fit measure statistic Z (Hosking and Wallis 1997), corresponding to acceptance of the hypothesized distribution at a confidence level of 90 %. As the criterion that the absolute value of Z less than 1.64 is used only to decide whether or not a distribution should be accepted but does not allow to decide the probability distributions with the highest goodness-of-fit value. In this case, the L-moment ratio diagram is also used to scrutinize the distribution by comparing its closeness to the L-skewness and L-kurtosis combination in the L-moment ratio diagram. The L-moment ratio plot for R10, R95P, RX1DAY, and RX5DAY at six homogeneous regions for extreme daily precipitation index R10, R95P, RX1DAY, and RX5DAY are illustrated in Figs. 4, 5, 6, and 7, respectively.

Table 1 Goodness-of-fit test of regional distribution for each homogeneous region
Fig. 4
figure 4

L-moment ratio plot for R10 at six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Fig. 5
figure 5

L-moment ratio plot for R95P at six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Fig. 6
figure 6

L-moment ratio plot for RX1DAY at six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Fig. 7
figure 7

L-moment ratio plot for RX5DAY at six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Table 1 and Figs. 4, 5, 6, and 7 indicate that the regional distributions with the highest goodness-of-fit performance for R10, R95P, RX1DAY, and RX5DAY for each homogeneous region are displayed in Table 2. It can be observed that PE3, GEV, and GNO perform well in fitting regional precipitation extremes. And then the regional growth curves of R10, R95P, RX1DAY, and RX5DAY for each homogeneous region are demonstrated in Figs. 8, 9, 10, and 11, respectively, and the 99 % error bounds are also shown in these figures. It can be identified from these figures that the quantile estimates are valid and justified to be used when return periods are less than 100 years. Meanwhile, Fig. 8 also shows that the regional growth curve of R10 for each homogeneous region is convex, i.e., the precipitation increments in all of the homogeneous region are generally decreasing when occurrence frequencies increase for precipitation extremes with return periods of 5, 10, 50, and 200 years. This means that the extreme values of R10 deviate little from the mean or the median in the Yangtze River basin. As the skewness is a measure of the asymmetry of the probability distribution of a real world random variable; and kurtosis is a measure of the “peakedness” of the probability distribution of a real world random variable, it can be seen from the L-moment ratio diagram of R10 (Fig. 4) that the regional average L-skewness and L-kurtosis are also small, which is in good agreement with the results of regional growth curves.

Table 2 The best-fitted regional distributions of R10, R95P, RX1DAY, and RX5DAY for each homogeneous region
Fig. 8
figure 8

Estimated regional growth curves of R10, with 99 % error bands for six homogeneous regions and A, B, C, D, E and F are for regions I to VI, respectively

Fig. 9
figure 9

Estimated regional growth curves of R95P, with 99 % error bands for six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Fig. 10
figure 10

Estimated regional growth curves of RX1DAY, with 99 % error bounds for six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

Fig. 11
figure 11

Estimated regional growth curves of RX5DAY, with 99 % error bands for six homogeneous regions and a, b, c, d, e, and f are for regions I to VI, respectively

It can be seen from Fig. 9 that the regional growth curve of RX95P for homogeneous regions I to IV are evidently convex; but for regions V and VI, the regional growth curves are nearly the straight lines, especially for region VI. Similarly, this can also be discerned from the L-moment ratio diagram for R95P (Fig. 5), which shows that the regional average L-skewness and L-kurtosis are small in regions I to IV, while larger in region V and region VI. This means that the values of R95P in region V are in more asymmetrical distribution and are prone to extreme value events, especially for region VI. It can be observed from Fig. 10 that the regional growth curves of RX1DAY for each homogeneous region are concave, opposite of convex, except for region I which is nearly a straight line, and this can also be expected from the L-moment ratio diagram of RX1DAY (Fig. 6). It can be seen from Fig. 6 that the regional average L-skewness and L-kurtosis are large in each homogeneous region and this means that it is prone to extreme value events in the Yangtze River basin for 1-day maximum precipitation. Figure 11 indicates that the regional growth curves of RX5DAY for homogeneous regions V and VI are evidently concave which can be identified from the L-moment ratio diagram of RX5DAY (Fig. 7) that the regional average L-skewness and L-kurtosis in regions V and VI are larger than other regions. And this means that, taking account of consecutive 5 days precipitation amount, it is prone to extreme value events only in regions V and VI, implying that regions V and VI are more likely to suffer from extreme precipitation events.

4.3 Spatial patterns of precipitation extremes of different return periods

Spatial patterns of the precipitation extremes can serve as one of the most important environmental indicator for integrated basin-scale water resources management, thus it is instrumental to quantify the spatial associations of precipitation extremes between sites and to map precipitation extremes with different return periods across the Yangtze River basin using the Kriging interpolation technique. To improve the accuracy of spatial interpolation of precipitation extremes, the stations which could not be grouped into any differentiated homogeneous region are not excluded in the dataset. It can be seen from the above that the performance of PE3, GEV, and GNO are good in fitting the regional distribution for R10, RX95P, RX1DAY, and RX5DAY. The discordant stations of R10, RX95P, RX1DAY, and RX5DAY are fitted by the three distributions, the goodness-of-fit tests by Kolmogorov–Smirnov method show that for all of the stations these three distributions are all significant at 95 % level (the results not shown here). Then the best-fitted distributions are selected by plotting position figure, here taking the station 114 which is discordant for the regional frequency analysis of RX1DAY as an example, the plotting position figure of station 114 is illustrated in Fig. 12. It can be seen from Fig. 12 that the GEV fits well the RX1DAY. And then the best-fitted distributions for all of the discordant stations are selected and the results are shown in Table 3.

Fig. 12
figure 12

Fitted probability functions for RX1DAY at station 114

Table 3 The best fitted distributions for all of the discordant stations of each extreme daily precipitation index

Then the spatial patterns of annual extreme daily precipitation index of R10, RX95P, RX1DAY, and RX5DAY with return period of 20 years are illustrated in Fig. 13. It can be seen from Fig. 13 that precipitation amount increases gradually from the upper to the lower Yangtze River basin. Meanwhile it can also be seen that there are two regions with the highest precipitation extremes in the eastern part of the Yangtze River basin and the regions covered by the stations 21, 24, and 25 (Fig. 1), specifically the regions east of the 107° E. However, there is also another region with higher precipitation extremes in the central parts of the basin, specifically the region between 100° E–107° E, and it is particularly the case for R95P, RX1DAY, and RX5DAY (Fig. 13b–d). Figure 1 shows that the locations of stations 21, 24, and 25 are in the boundary of a basin, which is also called as the Szechwan Basin. The elevation at the southeastern part of the Szechwan Basin is relatively lower, and the water vapor of southeast monsoon is conducive to input to the Szechwan Basin while the water vapor is blocked in the other boundary of the basin, causing the convection of water vapor. Then the precipitation in the boundary of the Szechwan Basin will be more than other regions in the vicinity of the precipitation center. It can be seen from Figs. 13 and 1 that the precipitation in the boundary of the Szechwan Basin, except the southeastern part, is more than the ambient regions. And there is an obvious precipitation center at the west part of the Szechwan Basin. Meanwhile it can be seen from Fig. 13a that precipitation in the center part of the Yangtze River basin is large in amount and this is due to the fact that stations 21, 24, and 25 with significant trends of R10 are not included in the spatial interpolation analysis.

Fig. 13
figure 13

Spatial patterns of annual extreme daily precipitation index with return period of 20 years: a for R10; b for R95P; c for RX1DAY, and d for RX5DAY

The result that higher precipitation amount of precipitation events with return periods of 20 years is identified in the middle and southeastern parts of the Yangtze River basin, or mainly in the lower Yangtze River basin is consistent with our previous studies (e.g., Zhang et al. 2008). Large-scale atmospheric circulation analysis indicates decreasing strength of East Asian summer monsoon during 1975–2005 as compared to that during 1961–1974 and increasing geopotential height in the north China, South China Sea, and west Pacific regions, all of which combine to negatively impact the northward propagation of the vapor flux. This circulation pattern will be beneficial for the longer stay of the Meiyu front in the Yangtze River basin, leading to more precipitation in the middle and lower basin in summer months (Zhang et al. 2008). Larger precipitation amount of precipitation events with return periods of 20 years indicate higher risk of floods and droughts. Moreover, Zhang et al. (2008) also demonstrated that the significant increasing summer precipitation intensity and changing frequency in the rain/no rain days in the middle and lower Yangtze River basin have potential to result in higher occurrence probability of flood and drought hazards in the region. Therefore, under the influences of changing climate, the risk of droughts and floods will be significantly amplified.

5 Conclusions

Regional frequency analysis based on extreme daily precipitation index has scientific and practical value in the context of basin-scale water resource and flood risk management. This paper presents a regional frequency analysis of precipitation extremes and characterization of the spatial pattern of rainfall extremes variations in the Yangtze River basin using the well-known index-flood L-moments approach together with some advanced statistical tests and spatial analysis methods.

The Yangtze River basin can be categorized into six homogeneous regions in terms of extreme daily precipitation index. For each homogeneous region of R10, R95P, RX1DAY, and RX5DAY, the regional distribution of each extreme daily precipitation index was fitted by the general normal, Pearson type III, general extreme-value, generalized logistic, and generalized Pareto distribution and the goodness-of-fit test results indicated that PE3, GEV, and GNO performed well in fitting regional distributions.

Additionally, this study also presents the regional growth curves of R10, R95P, RX1DAY, and RX5DAY for each homogeneous region with the 99 % error bands. The results indicate that the quantile estimates are reliable enough and can be used when return periods are less than 100 years. Furthermore, analysis results of the shape of the regional growth curves and the regional average L-skewness and L-kurtosis of R10, R95P, RX1DAY, and RX5DAY for each homogeneous region indicate that, for R95P, RX1DAY, and RX5DAY, the regional average L-skewness and L-kurtosis in regions V and VI are larger than other regions, implying that extreme precipitation events are highly probable to occur in regions V and VI, and hence higher risk of floods and droughts in regions V and VI.

Spatial patterns of annual extreme daily precipitation index of R10, RX95P, RX1DAY, and RX5DAY with return period of 20 years indicate that precipitation amount increases gradually from the upper to the lower Yangtze River basin. Moreover, there are two precipitation centers with larger precipitation amount in the eastern part of the Yangtze River basin and the western part of the Szechwan Basin boundary. As the Yangtze River basin is dominated by the southeast monsoon, it is obvious that the precipitation gradually decreases from southeast to northwest, forming an obvious precipitation center with higher precipitation amount. The precipitation center with higher precipitation amount in the western part of the Szechwan Basin boundary should be attributed to the effects of topography because the southeast monsoon is blocked in the boundary of the basin, causing the convection of water vapor. Furthermore, decreasing strength of East Asian summer monsoon during 1975–2005 as compared to that during 1961–1974 and increasing geopotential height in the north China, South China Sea, and west Pacific regions combine to negatively impact the northward propagation of the vapor flux, and which should contribute much to the higher precipitation amount in the middle and lower Yangtze River basin (Zhang et al. 2008).