Non-Stationary Annual Maximum Flood Frequency Analysis Using the Norming Constants Method to Consider Non-Stationarity in the Annual Daily Flow Series

Xiong, Lihua; Du, Tao; Xu, Chong-Yu; Guo, Shenglian; Jiang, Cong; Gippel, Christopher J.

doi:10.1007/s11269-015-1019-6

Non-Stationary Annual Maximum Flood Frequency Analysis Using the Norming Constants Method to Consider Non-Stationarity in the Annual Daily Flow Series

Published: 29 April 2015

Volume 29, pages 3615–3633, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Water Resources Management Aims and scope Submit manuscript

Non-Stationary Annual Maximum Flood Frequency Analysis Using the Norming Constants Method to Consider Non-Stationarity in the Annual Daily Flow Series

Download PDF

Lihua Xiong¹,
Tao Du¹,
Chong-Yu Xu^1,2,
Shenglian Guo^1,3,
Cong Jiang¹ &
…
Christopher J. Gippel⁴

1193 Accesses
42 Citations
Explore all metrics

Abstract

Flood frequency analysis is concerned with fitting a probability distribution to observed data to make predictions about the occurrence of floods in the future. Under conditions of climate change, or other changes to the water cycle that impact flood runoff, the flood series is likely to exhibit non-stationarity, in which case the return period of a flood event of a certain magnitude would change over time. In non-stationary flood frequency analysis, it is customary to examine only the non-stationarity of annual maximum flood data. We developed a way of considering the effect of non-stationarity in the annual daily flow series on the non-stationarity in the annual maximum flood series, which we termed the norming constants method (NCM) of non-stationary flood frequency analysis (FFA). After developing and explaining a framework for application of the method, we tested it using data from the Wei River, China. After detecting significant non-stationarity in both the annual maximum daily flood series and the annual daily flow series, application of the method revealed superior model performance compared to modelling the annual maximum daily flood series under the assumption of stationarity, and the result was further improved if explanatory climatic variables were considered. We conclude that the NCM of non-stationary FFA has potential for widespread application due to the now generally accepted weakness of the assumption of stationarity of flood time series.

A comparison of three approaches to non-stationary flood frequency analysis

Article Open access 24 August 2017

Identification of the Non-stationarity of Floods: Changing Patterns, Causes, and Implications

Article 21 November 2018

Comparison of Two Nonstationary Flood Frequency Analysis Methods within the Context of the Variable Regime in the Representative Polish Rivers

Article Open access 01 February 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Flood frequency analysis (FFA) is concerned with determining the probability of occurrence of extreme runoff events in rivers, and is open to a number of approaches (e.g. Vogel et al. 1993; Katz et al. 2002; Kwon et al. 2008; Gilroy and McCuen 2012; Katz 2012; Li et al. 2015a). FFA is conventionally conducted under the assumptions of: (i) independence of the flood series, which means that the probability distribution parameters to be estimated come from independent and identically distributed observations, and (ii) stationarity of the flood series, which means that the environmental factors that generate or modify floods, such as climate and land cover, are the same in the past, present and future (Khaliq et al. 2006; Gilroy and McCuen 2012; Li et al. 2015b). Under the conditions of climate change, land use change and river regulation, acting individually or together, the assumption of stationarity is suspect (Katz et al. 2002; Xiong and Guo 2004; Milly et al. 2005, 2008; Gilroy and McCuen 2012; Salas and Obeysekera 2014; Vasiliades et al. 2015), and design flood predictions made on this basis would be questionable. Climate change and human activities impacting flood runoff is a global phenomenon. There is a need then for development of effective non-stationary FFA methods.

Non-stationary FFA is most commonly approached using the method of time-varying moments (Strupczewski et al. 2001a, b; Strupczewski and Kaczmarek 2001; Coles 2001; Katz et al. 2002; Khaliq et al. 2006; Leclerc and Ouarda 2007; Villarini et al. 2009b, 2010; Gilroy and McCuen 2012; Salas and Obeysekera 2014). Strupczewski et al. (2001a) introduced a linear or quadratic time trend into the first two moments (mean and standard deviation) of the observed Vistula River flood series over the period of 1921–1990 with six candidate probability distributions. In this case, non-stationarity was expressed based on the relationships between the distribution parameters and the time-dependent moments, the maximum likelihood method was adopted for parameter estimation, and the Akaike Information Criterion (AIC) optimizing criterion (Akaike 1974) used for model selection. In other studies by these authors, trends in 39 annual maximum flow series in Polish Rivers were investigated using the non-stationary at-site FFA method of weighted least squares (Strupczewski and Kaczmarek 2001), and using the maximum likelihood method (Strupczewski et al. 2001b). Katz et al. (2002) fitted the Generalized Pareto distribution by maximum likelihood estimation to the US annual damage from flood series with a linear trend in the log-transformed scale parameter. A detailed review of the time-varying moments approach applied to non-stationary FFA was provided by Khaliq et al. (2006). Leclerc and Ouarda (2007) presented a method to perform regional FFA at ungauged sites when the assumption of stationarity was not valid. A non-stationary approach using the Generalized Additive Models for Location, Scale and Shape parameters (GAMLSS) (Rigby and Stasinopoulos 2005) was applied to seasonal precipitation and temperature data from Rome (Villarini et al. 2010) and peak discharge data from the Little Sugar Creek in Charlotte (Villarini et al. 2009b). Gilroy and McCuen (2012) developed a non-stationary FFA method that accounts for climate change and urbanization influences, and applied it to the Little Patuxent River in Guilford. Salas and Obeysekera (2014) suggested placing the concepts of return period and risk as they apply in the stationary world into a non-stationary framework by extending the geometric distribution to allow for changing exceeding probabilities over time. Although the literature on non-stationary FFA contains variety in certain aspects of the methodology, one common factor is exclusive reliance on the annual maximum flood series. The innovation presented in this paper is to also make use of the annual daily flow series for non-stationary FFA.

In seasonal rivers, generation of the maximum flood in a year is very closely related to hydrological processes during the flood season in that year, so non-stationarity in the annual daily flow series is likely to have some impact on non-stationarity in the annual maximum daily flood (AMDF) series. The central theme of this paper is development of an approach for incorporating non-stationarity information in the annual daily flow series in the non-stationary FFA of the AMDF series. We did this using the norming constants method (NCM), which is based on the Fisher-Tippett theory (Fisher and Tippett 1928). The core theory of NCM is that the statistical parameters of the annual maximum flood distribution for a year can be mathematically derived from the daily flow series within the year in terms of the maximum domain of attraction of the annual maximum flood distribution. We elaborate our NCM-based approach in this paper and illustrate its practical application using a case study of the Wei River, China. FFA often makes use of maximum instantaneous flow data, which is particularly important in small and medium size rivers, where the instantaneous peak daily flow can be much higher than the mean daily flow. Due to limitations on data availability, we used mean daily flow data, but the NCM applies equally well to instantaneous peak flow series.

This paper is organized as follows. First the NCM methodology is described, along with a description of methods for calculation of the statistical characteristics of the annual daily flow series, and covariate analysis to establish relationships between statistical parameters of the annual maximum flood distribution and meteorological factors. Then, relevant information about the Wei River basin is provided, and the data sets used in this case study are described. The results and discussion of the case study follow, and finally we make conclusions about application of NCM to the problem of non-stationary FFA.

2 Proposed Norming Constants Method (NCM) of Non-Stationary FFA

We suggest a four-step framework, termed the norming constants method (NCM) of non-stationary FFA, for incorporating non-stationarity information in the annual daily flow series in the non-stationary FFA of the AMDF series:

1.
The AMDF series is tested for statistically significant non-stationarity, which is the basis for then proceeding with non-stationary FFA.
2.
Statistical characterization and non-stationarity detection is undertaken on the series of the mean and variance of the annual daily flow series.
3.
If non-stationarity is detected in both the AMDF and the annual daily flow series in Steps 1 and 2, then non-stationary FFA based on the NCM is carried out.
4.
If non-stationarity is present, meteorological covariate analysis is carried out to establish relationships between the statistical parameters of the annual maximum flood distribution and the meteorological factors for the NCM. All the models are compared by the AIC criterion and assessed using residual plots and statistics.

2.1 Detection of Non-Stationarity in Hydrological Time Series

Incorporation of non-stationarity information in the annual daily flow series does not depend on the form of non-stationarity, whether it be monotonic, non-monotonic, linear, non-linear or step-change trend. A number of tests are available for the purpose of testing for non-stationarity in hydrological time series (Chiew and McMahon 1993; Xu et al. 2004) and one or more of these might be appropriate, depending on the nature of the data.

2.2 Probability Distributions of the Annual Daily Flow Series

The annual daily flow series refers to the hydrograph of all mean daily flows for a year, and is denoted by X _t = (x _d _,t), d = 1,2,...,v, where x _d,t represents the flow rate of day d in year t and v is the number of days in the year, referred to as the block length. When the annual daily flow series is serially independent, the block length is 365 for non-leap years or 366 for leap years. In essence, frequency analysis of the annual daily flow series fits the flow-duration curve based on X _t with a probability distribution F ^X_t (x). With respect to the distribution of the annual maximum daily flood series, the distribution F ^X_t (x) of X _t is normally referred to as the parent distribution (PD). The daily flows are often assumed to follow a Lognormal or Gamma distribution (Fennessey and Vogel 1990; LeBoutillier and Waylen 1993; Krasovskaia et al. 2006; Gottschalk et al. 2013a, b). For the annual daily flow series of different years, although the distribution function type is assumed to be the same, the statistical parameters might be different due to the non-stationarity of the annual daily flow series.

When F ^X_t (x) is a Lognormal distribution, its probability density function and the mean and variance of X _t are (Johnson et al. 1994; Stasinopoulos et al. 2008):

$$ {f}_t^X(x)=\frac{1}{{\left(2\pi \right)}^{1/2}{\sigma}_tx} \exp \left\{-\frac{{\left[ ln(x)-{\mu}_t\right]}^2}{2{\sigma}_t^2}\right\},x>0,{\mu}_t\kern.3em \in \Re, {\sigma}_t>0 $$

(1)

$$ E\left({X}_t\right)={\omega}^{1/2} \exp \left({\mu}_t\right) $$

(2)

$$ Var\left({X}_t\right)=\omega \left(\omega -1\right) \exp \left(2{\mu}_t\right) $$

(3)

where μ _t and σ _t are the mean and standard deviation of the log-transformed annual daily flow series X _t for the year t, and ω = exp(σ ²_t ).

When F ^X_t (x) is a Gamma distribution, its probability density function and the mean and variance of X _t are:

$$ {f}_t^X(x)=\frac{1}{\beta_t^{\gamma_t}\varGamma \left({\gamma}_t\right)}{x}^{\gamma_t-1} \exp \left(-\frac{x}{\beta_t}\right),x>0,{\gamma}_t>0,{\beta}_t>0 $$

(4)

$$ E\left({X}_t\right)={\gamma}_t{\beta}_t $$

(5)

$$ Var\left({X}_t\right)={\gamma}_t{\beta}_t^2 $$

(6)

where γ _t and β _t are the shape and scale parameters.

As the mean E(X _t) and variance Var(X _t) of X _t are derived from the parameters of the PD by Eqs. (2)–(3) or (5)–(6), the character of the PD should be determined before investigating the non-stationarity in the series of E(X _t) and Var(X _t) of the annual daily flow series.

We suggest adoption of both the maximum likelihood and L-moments (Hosking 1990) techniques to estimate the statistical parameters (μ _t, σ _t) and (γ _t, β _t) of the PD for each year. This results in four optional scenarios for the annual daily flow series: Lognormal PD with maximum likelihood method, Lognormal PD with L-moments method, Gamma PD with maximum likelihood method, and Gamma PD with L-moments method. In order to determine the optimal PD and parameter estimation method, these four scenarios are considered for fitting the annual daily flow series of each year, and the Chi-squared test (Greenwood 1996), the Nash-Sutcliffe efficiency between modelled and empirical probabilities (NSE _pp), and the Nash-Sutcliffe efficiency between modelled and empirical daily flows (NSE _qq) are calculated to assess the goodness-of-fit of each scenario.

2.3 Norming Constants Method (NCM)

The maximum statistic of a sample of random variables is also a random variable and has its own probability distribution. For the annual daily flow series X _t = (x _d,t), d = 1, 2, ⋯, v of the year t, the corresponding annual maximum daily flood statistic Z ^v_t of the year is derived as:

$$ {Z}_t^v= \max \left({X}_t\right)=\underset{1\le d\le v}{ \max}\left({x}_{d,t}\right) $$

(7)

The extreme value distribution of Z ^v_t approaches one of three limiting forms, i.e. the Fréchet, Weibull or Gumbel distribution, as the sample size increases (Fisher and Tippett 1928). These three extreme value distributions can be further combined into a single distribution function, the well-known Generalized Extreme Value (GEV) distribution. The Gumbel distribution is more widely employed as the probability distribution of the annual maximum flood event than the other two extreme value distributions (Kwon et al. 2008; Villarini et al. 2009a, b; Mujere 2011; Gottschalk et al. 2013a). The cumulative distribution function of the Gumbel is given by:

$$ {F}_t^{Z^v}\left(\left.{z}_t^v\right|{d}_t^v,{c}_t^v\right)= \exp \left\{- \exp \left[-\frac{\left({z}_t^v-{d}_t^v\right)}{c_t^v}\right]\right\} $$

(8)

where d ^v_t and c ^v_t are the norming constants of the year t satisfying the maximum domain of attraction of the Gumbel distribution. From a theoretical perspective, the corresponding norming constants d ^v_t and c ^v_t can be derived from the statistical parameters of the parent distribution F ^X_t (x) of the annual daily flow series of year t (Cramér 1999; Embrechts et al. 1997).

When F ^X_t (x) is a Lognormal distribution, the expressions of the norming constants d ^v_t and c ^v_t by the Lognormal parameters (μ _t and σ _t) can be derived as:

$$ {d}_t^v= \exp \left\{{\mu}_t+{\sigma}_t\left[{\left(2 \ln v\right)}^{1/2}-\frac{ \ln \left(4\pi \right)+ \ln \left( \ln v\right)}{2{\left(2 \ln v\right)}^{1/2}}\right]\right\} $$

(9)

$$ {c}_t^v={\sigma}_t{\left(2 \ln v\right)}^{-1/2}{d}_t^v $$

(10)

When F ^X_t (x) is a Gamma distribution, the expressions of the norming constants d ^v_t and c ^v_t by the Gamma parameters (γ _t and β _t) can be derived as:

$$ {d}_t^v={\beta}_t\left[ \ln v+\left({\gamma}_t-1\right) \ln \left( \ln v\right)- \ln \varGamma \left({\gamma}_t\right)\right] $$

(11)

$$ {c}_t^v={\beta}_t $$

(12)

After either the Lognormal or the Gamma distribution is selected as the PD that best fits the daily flows of year t, and the corresponding PD parameters are estimated, the Gumbel norming constants d ^v_t and c ^v_t for year t are calculated using Eqs. (9)–(10) or (11)–(12). The non-stationary FFA can then be conducted using the norming constants for each year.

An observed annual daily flow series would not normally be expected to satisfy the assumption of independence. We followed other authors who have relaxed the requirement for independence when fitting the daily flow series to probability distributions (e.g. Fennessey and Vogel 1990; LeBoutillier and Waylen 1993; Krasovskaia et al. 2006; Gottschalk et al. 2013a). In this situation, the block length v should be a value less than 365 (or 366) days. When the NCM is applied in practice, v should be calibrated to take an effective value reflecting the real independence among the annual daily flow series, denoted here by v _eff. The optimal value of v _eff can be derived in terms of the AIC criterion. When v _eff is determined, it replaces v in Eqs. (9)–(12).

2.4 Meteorological Covariate Analysis

The main mechanism for flood runoff generation in most places is infiltration-excess runoff generation. In this case, flood events are closely associated with heavy rainfall events, which in turn can be related to other climatic variables. Thus, climate variables often at least partially explain non-stationarity in a flood series. Including climatic factors as explanatory variables in non-stationary FFA might have two benefits. First, it could help to provide physical meaning to the non-stationary design flood prediction, and second, the results could be interpreted in the context of runoff changes predicted by application of global circulation models and downscaling techniques.

In the proposed NCM, the estimated PD parameters are firstly expressed as linear functions of covariates, and these functions are then substituted into Eqs. (9)–(10) or (11)–(12) for calculating the time-varying norming constants d ^v_t and c ^v_t and conducting non-stationary FFA.

2.5 Model Selection and Assessment Criteria

The established models are compared using AIC criterion, where the maximum likelihood value is penalized by the effective degrees of freedom (Akaike 1974; Rigby and Stasinopoulos 2005). While a smaller AIC value indicates better model performance, which identifies the optimal model, there is no indication of how well the selected model captures the variability of the non-stationary flood series. Examination of the normalized quantile residuals (subsequently referred to here as residuals) (Dunn and Smyth 1996) is an effective way for testing the efficiency of the selected model. Thus, we suggest use of a combination of qualitative and quantitative assessment criteria to assess the overall performance of the selected model. The qualitative assessment involves visual inspection of the diagnostic residual plots, specifically, residual versus observed year plot, ACF (Autocorrelation Function) plot (Guang-Te and Singh 1994), normal Q-Q plot, or the more effective worm plot (Buuren and Fredriks 2001; Stasinopoulos et al. 2008). The worm plot is a de-trended Q-Q plot with the difference between the empirical and theoretical quantile residuals on the vertical axis and the theoretical quantile residuals on the horizontal axis. Quantitative assessment relies on statistics of the Filliben correlation coefficient (Filliben 1975) and KS test (Massey 1951) of the residuals. Centile curve plots can indicate how well the selected model captures the variability of the observed flood series, and to assess how the observed flood events fall in relation to each centile curve and the corresponding theoretical probability.

3 Study Area and Data

3.1 General Characteristics of the Study Area

The Wei River, the largest tributary of the Yellow River, originates from the Niaoshu Mountain at an elevation of 3 485 m above sea level in the Weiyuan county of Gansu province. Located between the coordinates 33°40′-37°26′N and 103°57′-110°27′E in the southeastern part of the loess plateau, the river has a length of 818 km and a drainage area of 134 800 km² (Fig. 1). After flowing west to east through the three provinces (autonomous region) of Gansu, Ningxia and Shaanxi the Wei River joins the Yellow River at Tongguan in Shaanxi province. Two large tributaries, the Jing River and the Beiluo River, join the Wei River in the middle and lower reaches of the basin respectively. The Wei River is known regionally as the ‘Mother River’ of the Guanzhong Plain of the southern part of the loess plateau because of its key role in the economic development of western China (Song et al. 2007; Huang et al. 2014).

The Wei River basin is characterized by semi-arid and sub-humid continental monsoon climate. Mean annual precipitation of the basin is about 570 mm but there is a strong decreasing gradient from south to north. The southern region has a sub-humid climate with mean annual precipitation of about 800–1000 mm, whereas the northern region has a semi-arid climate with mean annual precipitation of about 400–700 mm. The annual mean temperature over the entire basin is about 6–14 °C. The range in the annual potential evapotranspiration is about 660–1600 mm, and the mean annual actual evapotranspiration from the land surface is about 500 mm (Zuo et al. 2014).

The mean annual natural discharge of the Wei River is about 10 × 10⁹ m³, contributing approximately 17 % of the discharge of Yellow River. The Wei River basin is one of the most important industrial and agricultural production zones in China, which has led to large scale exploitation of the surface and groundwater resources. The annual discharge of the Wei River at Linjiacun station during the decade 1991–2000 was 53.9 % less than that during the preceding decade, while at Huaxian station it was 50.3 % less (McVicar et al. 2007; Song et al. 2007).

3.2 Data

The observed mean daily flow series from Huaxian gauging station over the period 1954–2009, provided by the Hydrology Bureau of the Yellow River Conservancy Commission, was the source of the AMDF series. Huaxian hydrological station, located at 109°46′E, 34°35′N, is about 70 km upstream of the junction of the Wei and Yellow rivers (Fig. 1). The catchment area upstream of this station is 106 500 km², or about 80 % of the total basin area (Fig. 1).

Meteorological data from 22 stations (Fig. 1) for the period 1954–2009 was provided by the National Climate Center of the China Meteorological Administration (source: http://cdc.cma.gov.cn). The data included five observed daily variables: daily precipitation (rain, mm), daily maximum temperature (TMAX, °C), daily minimum temperature (TMIN, °C), daily mean temperature (TMEAN, °C) and daily mean relative humidity (H, %). Time series of the areal mean values of these daily climate variables for the basin above the Huaxian station were generated using the Thiessen polygon method. From these data series, eight statistics were calculated: annual maximum daily precipitation (rain _1 max), annual maximum 3-day precipitation (rain _3 max), annual maximum 7-day precipitation (rain _7 max), annual total precipitation (rain _year), annual mean of daily maximum temperature (TMAX _mean), annual mean of daily minimum temperature (TMIN _mean), annual mean of daily mean temperature (TMEAN _mean) and annual mean of daily mean relative humidity (H _mean).

4 Results of Case Study

4.1 Step 1. Temporal Trends in the AMDF Series

The lag-1 ACF value of the AMDF series was not significantly different from 0 at α = 0.05, meaning that the AMDF series was serially independent. We used the Mann-Kendall (MK) test (Mann 1945; Kendall 1975; Yue and Wang 2004; Li et al. 2014) to test for the presence of non-stationarity in the flood series. In non-stationary analysis of extreme value series, the focus is commonly on non-stationarity in the mean. However, non-stationarity in the variance of the distribution of the extreme value series may also have significant impacts on non-stationary modelling (Katz and Brown 1992; Villarini et al. 2009a, b). Vinnikov and Robock (2002) provided a technique for investigating temporal trends in the variance and higher moments of the distribution of extreme value series. Their method constructs time series of the squares, cubes and any higher powers of the residuals (anomalies) after the trend in the expected value is subtracted from the observed time series. Trend analysis of these created series can then be used to examine the variability of the series over time. The MK test detected significant decreasing trends in both the mean and the variance of the distribution of the AMDF series (Table 1).

Table 1 Mann-Kendall (MK) test results of the moments considered in this study. The critical value of the MK statistic is Z _{1 − α/2} = 1.960, at α = 0.05; all trends were decreasing

Full size table

4.2 Step 2. Temporal Trends in the E(X _t) and Var(X _t) Series

The PD and corresponding parameter estimation method were applied to the annual daily flow series of different years from Huaxian station under the four scenarios described earlier: Lognormal PD with maximum likelihood method, Lognormal PD with L-moments method, Gamma PD with maximum likelihood method, and Gamma PD with L-moments method. Examination of the assumption of independence for each of the observed annual daily flow series unsurprisingly indicated that most of the annual daily flow series did not satisfy the independence assumption. In this situation we relaxed the requirement for independence, as explained previously.

When taking the Lognormal distribution as the PD, no rejection at the significance level of α = 0.05 using the Chi-squared test was detected for both the maximum likelihood and the L-moments methods, which means that the scenarios of Lognormal PD with maximum likelihood and L-moments techniques both satisfactorily fitted the 56 observed annual daily flow series. For the Gamma distribution, there were rejections in 20 and 9 out of 56 years for the maximum likelihood and the L-moments methods respectively. Thus, the Lognormal performed better than the Gamma distribution as the PD. Boxplots of the NSE _pp and NSE _qq for the period of 1954–2009 (Fig. 2) indicated that the L-moments was superior to the maximum likelihood method, with almost all the NSE _pp and NSE _qq higher than 0.9. This data exploration led us to select the Lognormal distribution as the PD of the annual daily flow series, and employ the L-moments technique to estimate the parameters of the PD. Examples of the frequency analysis of annual daily flow series with corresponding NSE _pp and NSE _qq under the four scenarios (Fig. 3) illustrate the differences in the distributions and parameter estimation methods. Finally, the E(X _t) and Var(X _t) series for the period of 1954–2009, calculated by Eqs. (2)–(3), both had significant decreasing trends indicated by the MK test (Table 1).

4.3 Step 3. Non-Stationary Flood Series Modelling with NCM

As significant non-stationarity was detected in both the annual daily flow series and the AMDF series, we could progress to investigating the impact of the non-stationarity in the annual daily flow series on the non-stationarity in the AMDF series using NCM. The estimated shape parameter κ of the GEV distribution fitted to the observed AMDF series from Huaxian station was close to zero (0.039), suggesting that the Gumbel distribution was more appropriate (Stedinger et al. 1993; Gilroy and McCuen 2012). For any year t, the Gumbel norming constants d ^v_t and c ^v_t can be theoretically derived by Eqs. (9)–(10) given the corresponding fitted PD parameters (μ _t and σ _t) and the block length v. The block length for an annual maximum flood event is ostensibly 1 year, whereby v = 365 (or 366). As the annual daily flow series in most years was not serially independent, v should take a value lower than this, which might explain the sporadically poor model fits by the Lognormal and Gamma distributions, expressed by outliers in the boxplots of both NSE _pp and NSE _qq, even for the optimal scenario of Lognormal PD with L-moments parameter estimation method (Fig. 2). For the observed AMDF series, we derived an optimal value of v _eff = 343 days in terms of the AIC criterion (Fig. 4a). Time-varying norming constants $ {d}_t^{v_{eff}} $ and $ {c}_t^{v_{eff}} $ for the period of 1954–2009 were then calculated by Eqs. (9)–(10) with the estimated μ _t and σ _t series and the optimal v _eff. The result indicated decreasing trends for both $ {d}_t^{v_{eff}} $ and $ {c}_t^{v_{eff}} $ (Fig. 4b).

In application of non-stationary FFA by NCM, the residuals should theoretically be approximately independent and normally distributed if the fit of the model is correct. The independence and normality of the residuals indicate that the selected model describes all the systematic information and the remaining part (residuals) are independent and normally distributed random noise. For the Wei River data, the model residuals were found to be randomly distributed and serially independent (Fig. 5a, b). Most of the points in the normal Q-Q plot (not illustrated in this paper) were evenly distributed on or around the 1:1 line—a pattern also revealed by the worm plot (Fig. 5c). Almost all of the worm points were within the 95 % confidence intervals, indicating good consistency between the selected model and the observed flood data. The statistics of the model residuals (Table 2) indicated no significant departure from normality of the residuals. The centile curves plot (Fig. 5d) showed most of the points within the region between the 5 and 95 % centile curves and the decreasing trend was also captured. The empirical frequency was within 5 % of the theoretical probability for all centile curves, except for the 50th percentile, which had a difference of 7.1 % (Table 3). The above qualitative and quantitative assessment criteria together suggested that: (i) the model residuals were approximately independent and identically distributed random noise, and (ii) the NCM provided a reasonably good fit to the non-stationary AMDF series by considering the relationship of the non-stationarities in the annual maximum floods to the non-stationarities in the annual daily flow series.

Table 2 Summary of the AIC values and statistics of the residuals for the NCM, the stationary model and the optimal NCM with meteorological covariates (NCM-MC)

Full size table

Table 3 Comparison between the empirical frequency of the observed flood events falling below each centile curve and the corresponding theoretical probability for the NCM, the stationary model and the optimal NCM with meteorological covariates (NCM-MC)

Full size table

As a comparison with the NCM, the conventional stationary FFA model was also applied to the AMDF series by fitting the Gumbel distribution and estimating the invariable norming constants d ^v and c ^v by the L-moments technique over the entire AMDF series. A decreasing trend of the residuals was indicated (Fig. 6a), and a serial correlation could not be rejected (Fig. 6b). The worm plot indicated an acceptable result, with almost all the points falling within the 95 % confidence intervals (Fig. 6c), but the stationary model had a larger AIC value than the NCM, and the normality of the residuals was just unacceptable in terms of the Filliben correlation coefficient (Table 2). Furthermore, all the centile curves were horizontal (Fig. 6d), which does not describe the variability of the AMDF series, and there were discrepancies of 5 % or more for three of the centile curves (Table 3).

4.4 Step 4. Meteorological Covariate Analysis for the NCM

The estimated μ _t and σ _t sequences for the period of 1954–2009 were expressed as multivariate linear functions of the meteorological covariates. The fitted expressions were then substituted into Eqs. (9)–(10) for calculating the time-varying norming constants (with v _eff = 343) and the covariate combinations for both PD parameter sequences were optimized in terms of the AIC criterion. This produced significant covariates for μ _t and σ _t, the optimal AIC value and the statistics of the residuals (Table 2, Fig. 7). The optimal fitted expressions of μ _t and σ _t by meteorological covariates were:

$$ {\mu}_t=3.5751+0.0819{H}_{mean}-0.4496 TMEA{N}_{mean} $$

(13)

$$ {\sigma}_t=2.6093-0.0376{H}_{mean}+0.0056rai{n}_{3 \max }+0.0012rai{n}_{year} $$

(14)

For the NCM with meteorological covariates (NCM-MC) a slightly smaller AIC value was observed compared to that of the NCM without considering the relation of the PD parameters (μ _t and σ _t) to meteorological factors (Table 2). The residuals were randomly distributed and serially independent, and without outliers (Fig. 7a, b). The normal Q-Q plot (not illustrated in this paper) had almost all the points distributed on the 1:1 line—a pattern also revealed by the worm plot (Fig. 7c), where all the points were located within the 95 % confidence intervals and aligned on or around the fitted cubic polynomial line. The statistics of both the Filliben correlation coefficient and the KS test of the residuals (Table 2) indicated that the residuals were normally distributed random noise. The centile curves plot (Fig. 7d) showed the majority of the points falling within the 5–95 % region, and the centiles captured the decreasing trend of the flood series. The empirical frequency was within 5 % of the theoretical probability for all centile curves (Table 3).

4.5 Implication of NCM for Design Flood Estimation

The NCM with meteorological covariates (NCM-MC) takes full advantage of the non-stationarity in the annual daily flow series and closely related climate factors. For the Wei River, this method produced the most reliable estimate of the magnitude of a design flood under non-stationary conditions. Flood magnitudes with a 0.01 annual exceedance probability (corresponding to 100-year flood events under stationary conditions) were estimated for each observed year for the NCM-MC. The 0.01-probability flood quantile of the stationary FFA model was also estimated and compared with that from the NCM-MC. The 95 % confidence intervals of the 0.01-probability flood quantiles were evaluated with the parametric bootstrap technique (Kyselý 2010; Tramblay et al. 2013) for both the stationary and the NCM-MC models.

For the stationary FFA model, the estimated 100-year flood event was a constant magnitude of 7 300 m³/s (Fig. 8). For the NCM-MC, a great range of variability was apparent in the estimated design flood (Fig. 8). The flood magnitudes under the 0.01-probability ranged from a minimum of 2 000 m³/s (in 1997) to a maximum of 14 100 m³/s (in 1983). In this case, the assumption of stationarity would seem to over-estimate design flood magnitudes for the future. In the case of increasing trend in the flood series (e.g. Robson 2002), design floods could be under-estimated, potentially increasing risk of flood damage or dam failure into the future. On the other hand, where the trend is decreasing (e.g. Ishak et al. 2013, and our Wei River case study) design floods could be over-estimated, possibly imposing unnecessarily high costs on society for future flood protection. The NCM proposed here for non-stationary FFA has application to both of these cases.

5 Conclusions

The fundamental assumption of stationarity for flood frequency analysis is suspect under the conditions of climate change and large-scale anthropogenic activities that affect flood runoff. Recognizing this problem, we developed a solution termed the norming constants method (NCM) of non-stationary flood frequency analysis (FFA). Our approach was to incorporate non-stationarity information in the annual daily flow series into the non-stationary FFA of the AMDF series.

A four-step framework is appropriate for applying the NCM of non-stationary FFA. The basis of applying the NCM is testing the presence of non-stationarity in the hydrological time series. As the time-varying statistical parameters of the AMDF distribution are mathematically derived from the annual daily flow series, the form of non-stationarity of the AMDF series is unimportant, and any suitable test for non-stationarity can be employed.

In applying the NCM, we suggest that both Lognormal and Gamma distributions be fitted to annual daily flow series, and both the maximum likelihood and L-moments techniques be used to estimate the statistical parameters of the distributions. Statistical tests will determine the most appropriate scenario for the annual daily flow series. Our suggested solution to the serial correlation apparent in most annual daily flow series is to calibrate the value of block length v in terms of the AIC criterion, to reflect the real independence within the annual daily flow series. Application of the suggested NCM to data from Huaxian hydrological station on the Wei River, China, found that by considering information concerning the non-stationarity in the annual daily flow series, the NCM performed better than a model based on the assumption of stationarity. Incorporating climate factors as explanatory variables further improved the model performance, and made the non-stationary FFA more physically meaningful. On this basis, estimation of design flood magnitude should be more reliable using the NCM approach than a method that assumes stationarity. While our test application of the NCM approach was limited to only one study site, the results were sufficiently promising to suggest that the method could have widespread global application in providing more realistic information for flood risk assessment.

References

Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716–723
Article Google Scholar
Buuren SV, Fredriks M (2001) Worm plot: a simple diagnostic device for modelling growth reference curves. Stat Med 20(8):1259–1277
Article Google Scholar
Chiew FHS, McMahon TA (1993) Detection of trend or change in annual flow of Australian rivers. Int J Climatol 13:643–653
Article Google Scholar
Coles S (2001) An introduction to statistical modeling of extreme values. Springer, London
Book Google Scholar
Cramér H (1999) Mathematical methods of statistics. Princeton University Press, Nineteenth printing, Princeton
Dunn PK, Smyth GK (1996) Randomized quantile residuals. J Comput Graph Stat 5(3):236–244
Google Scholar
Embrechts P, Klüppelberg C, Mikosch T (1997) Modelling extremal events for insurance and finance. Springer
Fennessey N, Vogel RM (1990) Regional flow-duration curves for ungauged sites in Massachusetts. J Water Res Pl-ASCE 116(4):530–549
Article Google Scholar
Filliben JJ (1975) The probability plot correlation coefficient test for normality. Technometrics 17(1):111–117
Article Google Scholar
Fisher RA, Tippett LHC (1928) Limiting forms of the frequency distribution of the largest or smallest members of a sample. Math Proc Camb 24(2):180–190
Article Google Scholar
Gilroy KL, McCuen RH (2012) A non-stationary flood frequency analysis method to adjust for future climate change and urbanization. J Hydrol 414:40–48
Article Google Scholar
Gottschalk L, Krasovskaia I, Yu KX, Leblois E, Xiong LH (2013a) Joint mapping of statistical streamflow descriptors. J Hydrol 478:15–28
Article Google Scholar
Gottschalk L, Yu KX, Leblois E, Xiong LH (2013b) Statistics of low flow: theoretical derivation of the distribution of minimum streamflow series. J Hydrol 481:204–219
Article Google Scholar
Greenwood PE (1996) A guide to chi-squared testing. John Wiley & Sons, New York
Google Scholar
Guang-Te W, Singh VP (1994) An autocorrelation function method for estimation of parameters of autoregressive models. Water Resour Manag 8(1):33–55
Article Google Scholar
Hosking JRM (1990) L-moments: analysis and estimation of distributions using linear combinations of order statistics. J R Stat Soc Ser B 52(1):105–124
Google Scholar
Huang SZ, Change JX, Huang Q, Chen YT (2014) Spatio-temporal changes and frequency analysis of drought in the Wei River, China. Water Resour Manag 28(10):3095–3110
Article Google Scholar
Ishak EH, Rahman A, Westra S, Sharma A, Kuczera G (2013) Evaluating the non-stationarity of Australian annual maximum flood. J Hydrol 494:134–145
Article Google Scholar
Johnson NL, Kotz S, Balakrishnan N (1994) Continuous univariate distributions. Vol. 1, Wiley series in probability and mathematical statistics: applied probability and statistics (2nd ed). New York
Katz RW (2012) Statistical methods for non-stationary extremes. In: AghaKouchak A, Easterling D, Hsu K, Schubert S, Sorooshian S (eds) Extremes in a changing climate. Springer, Dordrecht
Google Scholar
Katz RW, Brown BG (1992) Extreme events in a changing climate: variability is more important than averages. Clim Chang 21(3):289–302
Article Google Scholar
Katz RW, Parlang MB, Naveau P (2002) Statistics of extremes in hydrology. Adv Water Resour 25(8):1287–1304
Article Google Scholar
Kendall MG (1975) Rank correlation methods. Griffin, London
Google Scholar
Khaliq MN, Ouarda TBMJ, Ondo JC, Gachon P, Bobée B (2006) Frequency analysis of a sequence of dependent and/or non-stationary hydro-meteorological observations: a review. J Hydrol 329(3):534–552
Article Google Scholar
Krasovskaia I, Gottschalk L, Leblois E, Pacheco A (2006) Regionalization of flow duration curves. Clim Var Chang Hydrol Impacts 105–110
Kwon HH, Brown C, Lall U (2008) Climate informed flood frequency analysis and prediction in Montana using hierarchical Bayesian modeling. Geophys Res Lett 35(5):L05404. doi:10.1029/2007GL032220
Article Google Scholar
Kyselý J (2010) Coverage probability of bootstrap confidence intervals in heavy-tailed frequency models, with application to precipitation data. Theor Appl Climatol 101(3–4):345–361
Article Google Scholar
LeBoutillier DW, Waylen PR (1993) A stochastic model of flow duration curves. Water Resour Res 29(10):3535–3541. doi:10.1029/93WR01409
Article Google Scholar
Leclerc M, Ouarda TBMJ (2007) Non-stationary regional flood frequency analysis at ungauged sites. J Hydrol 343(3):254–265
Article Google Scholar
Li DF, Xie HT, Xiong LH (2014) Temporal change analysis based on data characteristics and nonparametric test. Water Resour Manag 28(1):227–240
Article Google Scholar
Li JZ, Liu XY, Chen FL (2015a) Evaluation of nonstationarity in annual maximum flood series and the associations with large-scale climate patterns and human activities. Water Resour Manag 29(5):1653–1668
Article Google Scholar
Li LC, Zhang LP, Xia J, Gippel CJ, Wang RC, Zeng SD (2015b) Implications of modelled climate and land cover changes on runoff in the middle route of the south to north water transfer project in China. Water Resour Manag. doi:10.1007/s11269-015-0957-3
Google Scholar
Mann HB (1945) Nonparametric tests against trend. Econometrica 13(3):245–259
Article Google Scholar
Massey FJ Jr (1951) The Kolmogorov-Smirnov test for goodness of fit. J Am Stat Assoc 46(253):68–78
Article Google Scholar
McVicar TR, Li L, Van Niel TG, Zhang L, Li R, Yang Q, Gao P (2007) Developing a decision support tool for China’s re-vegetation program: simulating regional impacts of afforestation on average annual streamflow in the Loess Plateau. For Ecol Manag 251(1):65–81
Article Google Scholar
Milly PCD, Dunne KA, Vecchia AV (2005) Global pattern of trends in streamflow and water availability in a changing climate. Nature 438(7066):347–350
Article Google Scholar
Milly PCD, Betancourt J, Falkenmark M, Hirsch RM, Kundzewicz ZW, Lettenmaier DP, Stouffer RJ (2008) Stationarity is dead: whiter water management? Science 319(5863):573–574
Article Google Scholar
Mujere N (2011) Flood frequency analysis using the Gumbel distribution. Int J Comput Sci Eng 3(7):2774–2778
Google Scholar
Rigby RA, Stasinopoulos DM (2005) Generalized additive models for location, scale and shape. J R Stat Soc Ser C Appl Stat 54(3):507–554
Article Google Scholar
Robson AJ (2002) Evidence for trends in UK flooding. Philos Trans R Soc A Math Phys Eng Sci 360(1796):1327–1343
Article Google Scholar
Salas JD, Obeysekera J (2014) Revisiting the concepts of return period and risk for non-stationary hydrologic extreme events. J Hydrol Eng 19:554–568
Article Google Scholar
Song J, Xu Z, Liu C, Li H (2007) Ecological and environmental instream flow requirements for the Wei River-the largest tributary of the Yellow River. Hydrol Process 21:1066–1073
Article Google Scholar
Stasinopoulos DM, Rigby RA, Akantziliotou C (2008) Instructions on how to use the GAMLSS package in R. Accompanying documentation in the current GAMLSS help files
Stedinger JR, Vogel RM, Foufoula-Georgiou E (1993) Frequency analysis of extreme events. In: Maidment DR (ed) Handbook of hydrology (Chapter 18). McGrawHill, Inc., New York
Google Scholar
Strupczewski WG, Kaczmarek Z (2001) Non-stationary approach to at-site flood frequency modeling II. Weighted least squares estimation. J Hydrol 248(1-4):143–151
Article Google Scholar
Strupczewski WG, Singh VP, Feluch W (2001a) Non-stationary approach to at-site flood frequency modeling I. Maximum likelihood estimation. J Hydrol 248(1–4):123–142
Article Google Scholar
Strupczewski WG, Singh VP, Mitosek HT (2001b) Non-stationary approach to at-site flood frequency modeling III. Flood analysis of Polish rivers. J Hydrol 248(1–4):152–167
Article Google Scholar
Tramblay Y, Neppel L, Carreau J, Najib K (2013) Non-stationary frequency analysis of heavy rainfall events in southern France. Hydrol Sci J 58(2):280–294
Article Google Scholar
Vasiliades L, Galiatsatou P, Loukas A (2015) Nonstationary frequency analysis of annual maximum rainfall using climate covariates. Water Resour Manag 29(2):339–358
Article Google Scholar
Villarini G, Serinaldi F, Smith JA, Krajewski WF (2009a) On the stationarity of annual flood peaks in the continental United States during the 20th century. Water Resour Res 45(8):W08417. doi:10.1029/2008WR007645
Article Google Scholar
Villarini G, Smith JA, Serinaldi F, Bales J, Bates PD (2009b) Flood frequency analysis for non-stationary annual peak records in an urban drainage basin. Adv Water Resour 32(8):1255–1266
Article Google Scholar
Villarini G, Smith JA, Napolitano F (2010) Non-stationary modeling of a long record of rainfall and temperature over Rome. Adv Water Resour 33(10):1256–1267
Article Google Scholar
Vinnikov KY, Robock A (2002) Trends in moments of climatic indices. Geophys Res Lett 29(2):1027. doi:10.1029/2001GL014025
Article Google Scholar
Vogel RM, McMahon TA, Chiew FHS (1993) Floodflow frequency model selection in Australia. J Hydrol 146:421–449
Article Google Scholar
Xiong LH, Guo SL (2004) Trend test and change-point detection for the annual discharge series of the Yangtze River at the Yichang hydrological station. Hydrol Sci J 49(1):99–112
Article Google Scholar
Xu ZX, Chen YN, Li JY (2004) Impact of climate change on water resources in the Tarim River basin. Water Resour Manag 18(5):439–458
Article Google Scholar
Yue S, Wang CY (2004) The Mann-Kendall test modified by effective sample size to detect trend in serially correlated hydrological series. Water Resour Manag 18(3):201–218
Article Google Scholar
Zuo DP, Xu ZX, Wu W, Zhao J, Zhao FF (2014) Identification of streamflow response to climate change and human activities in the Wei River basin, China. Water Resour Manag 28(3):833–851
Article Google Scholar

Download references

Acknowledgments

This research is financially supported by the National Natural Science Foundation of China (Grants NSFC 51190094 and NSFC 51479139), which are gratefully acknowledged. The contribution of C. J. Gippel was made whilst visiting the College of Water Resources and Hydropower Engineering, Wuhan University, supported by the High-End Foreign Expert Recruitment Programme, administered by the State Administration of Foreign Experts Affairs, Central People’s Government of the People’s Republic of China. We are very grateful to the editor and one anonymous reviewer for their valuable comments and constructive suggestions that helped us to greatly improve the manuscript.

Author information

Authors and Affiliations

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, 430072, China
Lihua Xiong, Tao Du, Chong-Yu Xu, Shenglian Guo & Cong Jiang
Department of Geosciences, University of Oslo, P.O. Box 1022, Blindern, 0315, Oslo, Norway
Chong-Yu Xu
Hubei Provincial Collaborative Innovation Center for Water Resources Security, Wuhan University, Wuhan, 430072, China
Shenglian Guo
Australian Rivers Institute, Griffith University, Nathan, Queensland, 4111, Australia
Christopher J. Gippel

Authors

Lihua Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Tao Du
View author publications
You can also search for this author in PubMed Google Scholar
Chong-Yu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shenglian Guo
View author publications
You can also search for this author in PubMed Google Scholar
Cong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Gippel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tao Du.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiong, L., Du, T., Xu, CY. et al. Non-Stationary Annual Maximum Flood Frequency Analysis Using the Norming Constants Method to Consider Non-Stationarity in the Annual Daily Flow Series. Water Resour Manage 29, 3615–3633 (2015). https://doi.org/10.1007/s11269-015-1019-6

Download citation

Received: 12 June 2014
Accepted: 21 April 2015
Published: 29 April 2015
Issue Date: August 2015
DOI: https://doi.org/10.1007/s11269-015-1019-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Non-Stationary Annual Maximum Flood Frequency Analysis Using the Norming Constants Method to Consider Non-Stationarity in the Annual Daily Flow Series

Abstract

Similar content being viewed by others

A comparison of three approaches to non-stationary flood frequency analysis

Identification of the Non-stationarity of Floods: Changing Patterns, Causes, and Implications

Comparison of Two Nonstationary Flood Frequency Analysis Methods within the Context of the Variable Regime in the Representative Polish Rivers

1 Introduction

2 Proposed Norming Constants Method (NCM) of Non-Stationary FFA

2.1 Detection of Non-Stationarity in Hydrological Time Series