Keywords

1 Introduction

Information on the spatio-temporal variability of extreme rainfall characteristics is of critical importance for planning, design, and management of various water systems [1]. In particular, for urban and small rural watersheds that are generally characterized by fast response, the design of hydraulic structures such as small dams, culverts, storm sewers, detention basins, and so on, requires extreme rainfall input for very short time durations (e.g. few minutes or hours) for runoff simulation models. More specifically, this extreme rainfall information is extracted from the “intensity–duration–frequency” (IDF) relations for various durations and return periods at a given site of interest [1,2,3].

In current engineering practice, the IDF relations are commonly derived based on statistical frequency analyses of annual maximum (AM) rainfall series for different durations. In Canada, Environment and Climate Change Canada (ECC) provides short-duration extreme rainfall data for nine different rainfall durations (D = 5, 10, 15, 30, 60, 120, 360, 720, and 1440 min) and the IDF relations for approximately 650 stations across Canada with at least 10-year rainfall record [4]. Traditionally, the extreme rainfalls for six different return periods (T = 2, 5, 10, 25, 50, and 100 years) were computed by fitting the two-parameter Gumbel distribution to the AM series for each rainfall duration independently using the method of moments (MOM). However, it has been widely known that this Gumbel/MOM-based traditional approach may not produce accurate and robust extreme rainfall estimates as compared to those given by, for instance, the generalized extreme value (GEV)/L-Moment method [5]. Consequently, there are several recently developed IDF products in Canada (as summarized in Table 1) to provide some improvements in the estimation of extreme rainfalls for both gaged or ungaged locations. Hence, there is an urgent need to carry out a critical review of existing rainfall estimation methods and to perform a detailed comparative study to assess their performance in order to identify the best estimation method for deriving IDF relations for Canada.

Table 1 Existing IDF products and tools in Canada

2 Extreme Rainfall Estimation Methods

As mentioned previously, Table 1 provides a summary of existing methods for extreme rainfall estimation and for developing the IDF relations for Canada. These include the traditional EC-IDF product by ECC [4], the Metro Vancouver IDF tool [6]; the IDF design values for Atlantic Provinces by Shephard [7], the Ontario Ministry of Transportation (MTO-IDF) tool by Soulis et al. [8], the IDF-CC tool by Simonovic et al. [3], and the SMExRain tool by Nguyen and Nguyen [2]. The key differences between these tools are: (i) the different probability model that was selected for describing the distribution of extreme rainfalls; (ii) the different estimation method that was used for estimating the probability model parameters; (iii) the different regression model that was chosen to represent the IDF curves; (iv) the different estimation method that was used for estimating the regression model parameters; (v) the different spatial interpolation technique that was used for transferring IDF information from gaged sites to the target ungaged location; and (vi) the different consideration of the scale-invariance property of the extreme rainfall processes for different rainfall durations.

More specifically, for the whole Canada, only the EC-IDF and IDF-CC are readily available, and the data are maintained up to date. The MTO IDF tool is mainly developed for the province of Ontario using data primarily from EC-IDF in combination with USGS digital elevation data to derive physiographic characteristics [8]. The Metro Vancouver IDF curves are primarily developed for the City of Vancouver and its neighbouring regions based on Hosking and Wallis [9]’s regional approach [6]. The IDF curves for Atlantic Provinces use the same approach as in the EC-IDF and have been recently integrated into the EC-IDF product [4]. The SMExRain tool is a standalone application that can be used to generate IDF curves for any location in Canada with available IDF data [2]. Therefore, only the EC-IDF product and the IDF-CC tool were selected for this comparative study. A summary of the basic features of these two products is provided in the following sections.

The EC-IDF tool provides the historical IDF relations for approximately 650 stations across Canada with at least 10-year record [4]. The extreme rainfalls for these IDF relations were computed for nine different rainfall durations (D = 5–1440 min) and for six different return periods (T = 2–100 years) using the two-parameter Gumbel distribution. In particular, the Gumbel distribution is fitted to the historical AM series for each rainfall duration independently using the method of moments (MOM). In addition, the IDF relations are described by a simple power-form relation as follows:

$$ I = aD^{b} $$
(1)

in which \(I\) is the rainfall intensity; \(D\) is the rainfall duration; and \(a, b\) are coefficients that are computed for each return period by the least-square technique in the log-data space. These two coefficients are also computed for locations without data using a simple linear spatial interpolation technique [4].

The IDF-CC tool uses the generalized extreme value (GEV) distribution as the parent distribution for representing the distribution of AM rainfall series [3]. The L-moment (LMOM) estimation method is used to estimate the three parameters of the GEV model. This tool also provides the IDF curves derived from the Gumbel distribution for the purpose of comparison. The IDF curves are described by the following mathematical relation:

$$ I = a\left( {D + c} \right)^{b} $$
(2)

in which \(a, b, c\) are coefficients that are computed for each return period using the differential evolution optimization algorithm [3] in the real data space.

In summary, these two traditional approaches are based on the regression of the computed rainfall intensities (or depths) over durations using Eq. (1) with two coefficients (QR2C method) or Eq. (2) with three coefficients (QR3C method). In the present study, a new approach is introduced based on the regression of the empirical statistical moments of observed rainfall amounts over durations. As compared to the traditional approaches, this new method can account for the observed scale-invariance property of the empirical statistical moments. In general, the proposed new method consists of the following four steps:

  1. (i)

    Firstly, compute the statistical moments of the AM rainfalls for the first few orders for each rainfall duration (e.g. from 5-min to 24-h intervals) for a given location of interest. More specifically, only the first two non-central moments (NCMs) are required for the GUM/MOM model, and the first three probability weighted moments (PWMs) are necessary for the GEV/LMOM model;

  2. (ii)

    Secondly, construct regression models to describe the relationships between the computed NCMs (or PWMs) of rainfall amounts and the rainfall durations. These relationships indicate the scale-invariance property of the AM processes. For instance, some previous studies have found that extreme rainfall processes for durations ranging from a few minutes to several days could display one or two different scaling regimes [10, 11]. These rainfall-statistical-moment-based regression models are quite useful for the estimation of the NCMs (or PWMs) of sub-hourly or sub-daily rainfall amounts from those of the daily amounts;

  3. (iii)

    Thirdly, estimate the parameters of a selected theoretical distribution using the method of moments (or the method of L-moments) in consideration of the regression relationships established in step (ii). More specifically, there are two different approaches are proposed in this study depending on the application of these regression relationships: (a) the rainfall-statistical-moment-based method (referred hereafter as MR method) if the parameters of the GUM/MOM or GEV/LMOM model are computed based on the direct regression of the statistical moments of AM rainfalls for different durations; and (b) the scaling-statistical-moment-based method (referred herein as MRS method) if the GUM/MOM and GEV/LMOM parameters are computed based on the scale-invariance relations between daily and sub-daily statistical moments; and

  4. (iv)

    Finally, estimate the design rainfall intensity (or depth) for a given duration and return period of interest using the quantile function of the Gumbel distribution or GEV distribution [10].

In this study, two common dimensionless goodness-of-fit (GOF) criteria were selected to compare the performance of the four methods (QR2C, QR3C, MR, and MRS). These criteria include the mean absolute relative difference (\({\text{MADr}}\)) and the root mean square relative error (\({\text{RMSEr}}\)) as defined below:

$$ {\text{MADr}} = { }\frac{1}{n}\sum {\left\{ {\frac{{|x_{ij} - y_{ij} |}}{{x_{ij} }}} \right\}} $$
(3)
$$ {\text{RMSEr}} = { }\left[ {\frac{1}{n}\sum {\left\{ {\frac{{\left( {x_{i} - y_{i} } \right)}}{{x_{i} }}} \right\}^{2} } } \right]^{1/2} $$
(4)

in which \(x_{ij} \) and \(y_{ij}\) are the rainfall quantiles corresponding to duration \(D_{i}\) (\(i\) = 1, 2, …, 9) and return period \(T_{j}\) (\(j\) = 1, 2, …, 6) estimated based on the observed data and based on one of the four estimation methods (i.e. QR2C, QR3C, MR, and MRS), respectively; \(n = i \cdot j\) is the number of design rainfall values at each station.

3 Study Sites and Data

As mentioned previously, the IDF network consists of approximately 650 raingages located across Canada [4]. For each site, ECC provides the AM series for nine different rainfall durations ranging from 5 min to 24 h. The record length for these AM series varies from 10 to 82 years, and most of the data are recorded after 1960 and updated to 2017. For this comparative study, only 39 stations with very long record (containing at least 50 years) were selected since the estimation of rainfall quantiles for high return periods (e.g. for T = 50 and 100 years) can be considered as reliable for these long datasets [2]. The details of the 39 chosen stations are presented in Table 2. The comparison of the performance of the two traditional (QR2C and QR3C) approaches and the two new MR and MRS methods will be carried out using the available IDF data for these 39 stations.

Table 2 Details of the selected 39 stations, including the province (PR), station identification (ID), station name, latitude (lat, degrees), longitude (long, degrees), and record period and length (No. of years)

4 Results and Discussions

In the present study, the rainfall quantiles for each station were estimated for all six return periods (T = 2–100 years) and for all nine durations (D = 5–1440 min) using the four selected approaches (QR2C, QR3C, MR, and MRS). The IDF relations are then constructed for each station. For purposes of illustration, Fig. 1 shows the IDF relations for St Thomas WPCP Station for all methods. It can be observed that the QR2C method produced the least accurate results. As expected, the QR3C, MR, and MRS methods, considered as the general form of the QR2C method, provided more accurate results. Similar results were found for other stations.

Fig. 1
figure 1

IDF relations at St Thomas WPCP Station based on different methods: a and b quantile-based regression methods (QR2C and QR3C); c and d moment-based regression without and with scaling (MR and MRS). Markers represent the observed GUM/MOM rainfall quantiles

More generally, Fig. 2 shows the comparison of these four methods based on the \({\text{MADr}}\) (%) and \({\text{RMSEr}}\) (%) for both Gumbel and GEV models and for all stations. The slope ratio between the first and second scaling regimes of AM series is also plotted. Notice that the first scaling regime is identified as from the 5-min duration to the breakpoint on the graph between the empirical statistical moments and the rainfall durations; the second scaling regime is defined from the breakpoint to the 24-h duration for each station. It can be seen that the QR2C method yields high error for most of stations, especially stations with large difference in the slope ratio. The QR3C method produces good results for most stations with \({\text{MADr}}\) less than 5%, except the first two stations with the slope ratio less than 1 indicating the convex pattern for the IDF curves. On the other hand, both MR and MRS approaches can capture very well this convex pattern for these two stations and can also provide the \({\text{MADr}}\) less than 5% for all other stations. The \({\text{MADr}}\) and \({\text{RMSEr}}\) results for all stations are also summarized in the form of boxplots as shown in Fig. 2. It can be seen that the QR2C is the least accurate method.

Fig. 2
figure 2

Plots of the scaling slope ratio, MADr (%), and RMSEr (%) for different methods (QR2C, QR3C, MR, and MRS) for Gumbel and GEV models

A more detailed investigation of the slope ratios of the first and second scaling regimes is shown in Fig. 3. These slopes are computed based on the first NCMs (or first PWMs) of rainfall depths over durations for all available IDF stations (approximately 650 stations). In general, it can be seen that the pattern of the IDF curves varies over different regions of Canada. However, for a large number of stations in the Pacific region the convex pattern (i.e. the slope ratio value is less than one) occurs more frequent. This could be due to the orographic effect in this region which causes a very different behaviour of short-duration extreme rainfall processes. Hence, the QR3C method should not be applied in this region.

Fig. 3
figure 3

Slope ratios of the first and second scaling regimes based on the first NCMs (or PWMs) of rainfall depths for different regions of Canada

5 Summary and Conclusions

In Canada, there exist several different methods for developing IDF relations. Hence, in this research, a comparative study was carried out to assess the performance of these methods in order to identify the best approach for use in practice. More specifically, two traditional approaches (QR2C and QR3C) were compared with two new proposed procedures (MR and MRS). In general, the traditional methods were based on the relationships between the extreme rainfall quantiles with the rainfall durations, while the new approaches were relied on the relationships between the statistical moments of extreme rainfalls and the rainfall durations. The comparation was performed using the available historical AM data for nine different durations (from 5 min to 24 h) for 39 stations with long record length (at least 50 years) across Canada to represent the diverse climatic conditions of Canada.

Results of this comparative study have indicated that the QR2C based on the GUM/MOM model is the least accurate method as compared with the other three methods. In addition, it was found that different patterns of the IDF curves for different locations in Canada can be linked to the scaling behaviour of the extreme AM processes. This scaling behaviour can be identified based on the relationships between the empirical statistical moments (NCMs or PWMs) of rainfall amounts and the rainfall durations. Consequently, the QR3C method can be used for IDF relations with linear and concave scaling patterns but cannot provide an accurate rainfall estimation for IDF curves with the convex pattern that is a common pattern observed in the Pacific region. On the other hand, it was found that the new MR and MRS methods proposed in this study can be used for all regions of Canada.

Furthermore, the MR method produces slightly better results than the MRS since it was relied on the best fit of the regression model to the empirical statistical moments over all different rainfall durations, while the MRS method was based on the approximate fit of the derived sub-daily rainfall statistical moments from the daily statistical moments. However, this difference is not significant (on average, less than 2% between the two methods). The MRS method, however, offers a key advantage for cases of ungaged for partially-gaged sites where the derivation of the distributions of sub-hourly and sub-daily extreme rainfalls from that of daily amounts is required.