1 Introduction

Design rainfall in the form of intensity, frequency and duration (IFD) information is one of the most commonly used data in planning, designing, development and risk assessment of water resources engineering projects (BOM 2016). The IFD represent a functional relationship among three important parameters, average rainfall intensity, duration and return period. Due to the paramount importance of IFD curves, it has been widely researched previously (e.g. Liong et al. 2002; Jakob et al. 2007; Xu and Tung 2009; Lee et al. 2010; Haddad et al. 2011; Dourte et al. 2013; Du et al. 2014; Khan et al. 2015 and Garcia-Urquia 2016).

Different approaches have been adopted for derivation of IFD curves. For example, Yu et al. (2004) developed IFD curves using data from 46 rain gauges over three regions in northern Taiwan. They pointed out that the IFD formula could be expressed in three ways depending on the types of scaling and the homogeneity of the regions. In another study, Nhat et al. (2006) derived IFD relationship for short durations based on daily rainfall data at two rainfall stations located at Yodo catchment in Japan. The new IFD curves showed better results as compared to IFD curves estimated by other methods such as Bernard equation (Bernard 1932). Overeem et al. (2008) adopted generalised least squares regression (GLSR) method and generalised extreme value (GEV) distribution to derive design rainfalls for 1–24-h durations using rainfall data from 12 stations in the Netherlands. The L-Moments method was adopted to calculate parameters of the GEV distribution. It was found that by using the bootstrap method, the uncertainties in design rainfalls could be estimated. Furthermore, it was noted that the shape parameter of the GEV distribution did not change with duration, while for the location parameter there was a notable increase as duration decreased.

Elsebaie (2011) developed IFD relationship for duration of 10–1440 min and return periods of 2–100 years for two stations at Najran and Hafr Albatin regions in the Kingdom of Saudi Arabia by using the Gumbel and the log Pearson type 3 distributions. A nonlinear regression analysis was adopted to estimate the parameters of IFD relationship at each region. The Chi-square test was used to select the best-fit statistical distribution. It was found that both the distributions produced similar IFD curves.

Mamoon et al. (2014) updated the IFD data in Qatar using L-Moments method and the index frequency method using daily rainfall data from 32 stations. It was found that the Pearson type 3 distribution provided the best fit to the daily duration annual maximum (AM) rainfall data. Liu et al. (2015) developed IFD curves using at-site and regional approaches as well as the corresponding confidence intervals at 9 rainfall stations in Jakarta area. The regional frequency analysis with L-Moments approach was used to analyse the AM rainfall series. With goodness-of-fit measure based on Monte Carlo simulation, GEV distribution was selected as the best-fit distribution. When comparing the quantiles of IFD curves and corresponding confidence intervals between at-site and regional approaches, it was found that the confidence intervals estimated by regional approach were generally smaller than those for the at-site approach.

In Australia, there have been a good number of studies on design rainfall estimation (e.g. Jakob et al. 2007; Haddad et al. 2010, 2011; Haddad and Rahman 2014; Yilmaz et al. 2014; Verdon-Kidd and Kiem 2015 and Bennett et al. 2016). For example, Jakob et al. (2007) applied L-Moments-based regionalisation method to the rainfall data in southeast Queensland and northeast New South Wales in Australia to IFD curves for durations of 24, 48 and 72 h and return periods of 1, 2, 5, 10, 20, 50 and 100 years. A partial least squares regression approach was used to derive reliable estimates of L-skewness and L-coefficient of variation to extend the study to the sub-daily durations of 1, 2, 3, 6 and 12 h. For selecting the best-fit distribution in the region, generalised logistic, GEV and generalised normal distributions were considered. The results showed that among the three-parameter distributions considered, the GEV gave the best fit. The validation against data in the pilot study area suggested good performance for 6- and 12-h durations. The comparison of the results of this study with those of Australian rainfall and runoff 1987 (ARR87) showed broad similarities in design rainfalls between the two methods.

Haddad et al. (2010) presented a statistical modelling framework to derive design rainfalls based on GLSR and L-Moments methods using data from 203 rainfall stations across Australia. By using the GLSR method, the regional prediction equations for the index rainfall, L-CV and L-skewness were developed for durations of 1–72 h. It was found that the proposed method, the GLSR coupled with L-Moments, was able to produce quite accurate design rainfall estimates for all the selected durations and return periods considered in the study. Also, it was found that a polynomial of third degree was sufficient for smoothing the IFD curves.

Haddad et al. (2011) developed IFD curves for short storm durations based on GLSR and L-Moments-based index methods using data from 203 rainfall stations across Australia. The Akaike and Bayesian information criteria were applied to identify a regional parent distribution. Monte Carlo simulation was adopted to estimate the sampling error variances of the AM series parameters (L-CV and L-skewness). The results showed that the GLSR performed relatively well for both the 6-min and 1-h durations. The generalised Pareto distribution appeared as the best-fit distribution. The new design rainfall estimates were generally greater than the at-site estimates, with some notable differences from the ARR87 estimates (I. E. Aust. 1987).

Haddad and Rahman (2014) developed a technique to derive short duration (sub-hourly and sub-daily durations) design rainfalls using long-duration rainfall statistics using L-Moments and GLSR to rainfall data set consisting of 203 rainfall stations. For estimating the design rainfall quantiles, the GEV distribution and generalised Pareto distribution were adopted. In another study, Haddad et al. (2015) compared three methods to form regions in rainfall frequency analysis using Australian data.

In Australia, the recommended IFD data are provided in Australian rainfall and runoff (ARR). The ARR1987 adopted LP3 distribution to derive IFD curves (I. E. Aust. 1987). In 2013, GEV distribution has been adopted to derive the IFD curves (Green et al. 2012; Ball et al. 2016) where a region of influence approach was adopted to form regions and a spline software was used to achieve spatial consistency in IFD data.

There have been many researches on design rainfall estimation as mentioned above. However, there is a lack of study on systematic comparison among IFD curves derived by different distributions and methods. It is important to assess how much variation in IFD values can occur when different probability distribution is adopted or between the IFD curves derived from at-site and regional frequency analyses. Hence, this research is devoted to compare IFD curves between a number of distributions and methods. In this paper, the following research questions are investigated. Firstly, how much variation in IFD curves is expected between two commonly adopted probability distributions, LP3 and GEV. Secondly, in smoothing the rainfall quantiles at a given site, whether empirical method or polynomial regression provides a better fitting. Finally, how the at-site IFD curves vary with the regional IFD curves. We use data from ten rainfall stations from NSW state in Australia to answer these research questions. We also use the regional IFD curves in Australia (from ARR 1987 and ARR 2016) to compare between the at-site and regional IFD curves (I. E. Aust. 1987; Ball et al. 2016). It is expected that the outcomes of this research will provide important guidance in validating the regional IFD curves, which are widely used in practice in preference to at-site IFD curves.

2 Study area and data

We use data from ten pluviograph stations from eastern New South Wales (NSW), Australia (Fig. 1). The rainfall data of each of these stations have been abstracted using HYDSTRA software from the database of Australian Bureau of Meteorology (BOM 2013). The rainfall data have record lengths greater than 30 years, ranging between 32 and 64 years with an average of 45 years (Table 1). The annual maximum (AM) rainfall events were extracted from rainfall records for each of the study stations. The AM rainfall events of these stations have less than 5% gaps. The primary rainfall data used in this study are recorded at 6-min interval; a FORTRAN program is developed to extract AM rainfall events of six sub-hourly durations (6, 12, 18, 24, 30 and 48 min), six sub-daily durations (1, 2, 3, 6, 8 and 12 h) and three daily durations (1, 2 and 3 days) for each of the ten rainfall stations.

Fig. 1
figure 1

Locations of the selected ten pluviograph stations in NSW, Australia

Table 1 Details of selected pluviography stations

3 Methods

In this study, stationary frequency analysis is applied to derive intensity frequency duration (IFD) curves at each of the selected stations for 7 return periods of 1, 2, 5, 10, 20, 50 and 100 years. The GEV and LP3 distributions are used to fit the rainfall intensity data for a given duration at a given station. The reason for using these two distributions is that they have been used widely in rainfall frequency analysis in previous studies (e.g. Nhat et al. 2006; Elsebaie 2011; Haddad and Rahman 2014; Liu et al. 2015; Ball et al. 2016). Three goodness-of-fit tests (i.e. Kolmogorov–Smirnov, Anderson–Darling and Chi-square tests) are adopted to assess the goodness-of-fit of the GEV and LP3 distributions. To establish the relationship between the estimated rainfall quantiles, rainfall duration and return period, two methods have been adopted (empirical and polynomial). A brief description of each of these statistical techniques is presented below.

3.1 Generalised extreme value (GEV) distribution

GEV distribution is a continuous probability distribution developed within extreme value theory to combine the Fréchet (1927), Weibull (1951) and Gumbel (1958) families of distributions. The GEV distribution has three parameters (location: ξ; scale: α; and shape: κ), and it is used as an approximation to model the maxima of long sequences of random variables. The GEV distribution is equivalent to a Gumbel, Fréchet or Weibull distribution depending on whether κ = 0, κ > 0 or κ < 0, respectively. The GEV distribution has the cumulative density function (CDF) and probability density function (PDF) (Hosking and Wallis 1997) as shown below:

$$F(x;\zeta ,\alpha ,\kappa ) = \exp \left\{ { - \left( {1 + \frac{{\kappa \left( {x - \xi } \right)}}{\alpha }} \right)^{{{{ - 1} \mathord{\left/ {\vphantom {{ - 1} \kappa }} \right. \kern-0pt} \kappa }}} } \right\}$$
(1)
$$f(x;\zeta ,\alpha ,\kappa ) = \alpha^{ - 1} \exp \left\{ {\left( {\frac{1}{\kappa } + 1} \right) \times \log \left( {1 - \frac{{\kappa \left( {x - \xi } \right)}}{\alpha }} \right) - \exp \left( { - \frac{{\left( {x - \xi } \right)}}{\alpha }} \right)} \right\}$$
(2)

where x refers to the primary data series which is to be fitted by the GEV distribution.

3.2 Log Pearson type 3 distribution (LP3)

The LP3 distribution is a three-parameter distribution (similar to GEV), and it uses location (μ), scale (σ) and shape (γ) parameters (Millington et al. 2011).

To calculate the T-year event quantile, the LP3 distribution based on the mean (μ), standard deviation (σ) and skewness (γ) becomes:

$$Q_{\text{T}} = \mu + K_{\text{T}} \sigma {\kern 1pt}$$
(3)

where KT is frequency factor for return period T, which depends on skewness.

3.3 L-Moments method

In this study, L-Moments method is used to fit the GEV distribution. L-Moments are defined as liner combinations of probability weighted moments (PWMs), which were introduced by Greenwood et al. (1979) and others (i.e. Landwehr et al. 1979; Wallis 1980; Greis and Wood 1983; Hosking et al. 1985; Hosking and Wallis 1987) to develop statistical inference procedures for use as a tool for estimating the parameters of probability distributions.

3.4 Goodness-of-fit tests

In this study, three goodness-of-fit tests are used to assess how well a given distribution fits the rainfall data series of a given duration. The detail of each test is provided below:

3.5 Kolmogorov–Smirnov test

The Kolmogorov–Smirnov (KS) test (Kirkman 1996) utilises the greatest vertical difference between the theoretical and the empirical cumulative distribution functions:

$$KS = \mathop {\hbox{max} }\limits_{1 \le i \le n} \left\{ {F(X_{i} ) - \frac{i - 1}{n},\frac{i}{n} - F(X_{i} )} \right\}$$
(4)

3.6 Anderson–Darling test

Anderson–Darling (AD) test (Scholz and Stephens 1987) gives more weight to the tails of the distribution and is defined by the following equation:

$$AD^{2} = - n - \frac{1}{n}\sum\limits_{i = 1}^{n} {(2i - 1)} \left[ {\ln (F(X_{i} ) + \ln (1 - F(X_{n - i + 1} )))} \right]$$
(5)

3.7 Chi-square test

The Chi-square (\(\chi^{2}\)) test (Preacher 2001) is applied to binned data. The \(\chi^{2}\) test statistic is defined as:

$$\chi^{2} = \sum\limits_{i = 1}^{N} {\left( {O_{i} - E_{i} } \right)^{2} /E_{i} }$$
(6)

where \(O_{i}\) is the observed frequency of the data sample and \(E_{i}\) is the expected frequency of data sample calculated by \(E_{i} = F(X_{2} ) - F(X_{1} )\), where \(F\) is the cumulative distribution function of the probability distribution being tested.

The test is conducted at three significance levels (10, 5 and 1%).

3.8 Developing IFD curves

Once the seven rainfall quantiles (T = 1, 2, 5, 10, 20, 50 and 100 years) are estimated from the fitted GEV (or LP3) distribution, the IFD curve is developed in two ways: (1) making an empirical relationship among rainfall intensity (I), duration (D) and return period (T); and (2) fitting a second-degree polynomial among the seven I values. The empirical relationship is expressed mathematically as follows:

$$I = F(T,D)$$
(7)

The empirical IFD relationship is given by the below equation:

$$I = {{f \times T} \mathord{\left/ {\vphantom {{f \times T} {\left( {D^{v} + l} \right)^{e} }}} \right. \kern-0pt} {\left( {D^{v} + l} \right)^{e} }}$$
(8)

where I refers to a rainfall intensity (mm/hr), T is return period (year), D is duration (minutes), and f, l, e and v are coefficients.

The adopted polynomial equation has the following form:

$$log \, \left( {I_{\text{T}} } \right) \, = \, a \times \left( {log \, \left( D \right)} \right)^{2} + \, b \times log \, \left( D \right) \, + \, c$$
(9)

where a, b, c are constants estimated by polynomial regression analysis using XLSTAT software.

3.9 Relative error and root mean square error

In this study, the relative accuracy of the new IFD curves is assessed by calculating the relative error (RE) and the root mean square error (RMSE) (expressed by Eqs. 10 and 11) between the new IFD values (Inew) and the IFD values recommended in Australia for general use, i.e. ARR87 and ARR13 IFD values (IARR) (I. E. Aust. 1987 and Ball et al. 2016).

$${\text{RE}}\left( \% \right) = \left( {\frac{{I_{\text{new}} - I_{\text{ARR}} }}{{I_{\text{new}} }}} \right) \times 100$$
(10)
$${\text{RMSE}}\left( \% \right) = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {\frac{{I_{\text{new}} - I_{\text{ARR}} }}{{I_{\text{new}} }}} \right)}^{2} } \times 100$$
(11)

4 Results

4.1 Goodness-of-fit tests

Three goodness-of-fit tests (i.e. KS, AD and χ2) (described in Sect. 3) were used in this study to choose the better statistical distribution among the two adopted distributions (GEV and LP3). Depending on the AM rainfall data series of the ten selected stations and 15 rainfall durations, a total of 150 data sets (i.e. 10 × 15) were available for the goodness-of-fit testing. Depending on the results of goodness-of-fit tests, GEV and LP3 distributions showed a minor difference in fitting the AM rainfall data series. At the three significance levels (10, 5 and 1%) and out of the 150 data sets (Table 2), the KS and AD tests did not reject GEV distribution in any of the cases, while Chi-square test rejected GEV just one occasion (for rainfall data series of 120 min duration at station Barraba- 054102). The KS test also did not reject the LP3 distribution for any of the cases. The AD test rejected the LP3 distribution just one occasion (for rainfall data series of 6 min duration at station Murwillumbah-058158). Finally, the Chi-square test did not reject LP3 in any cases.

Table 2 Number of rejections (out of 150 cases) at the three significance levels (10, 5 and 1%) for the KS, AD and Chi-square goodness-of-fit tests

4.2 Shape parameter for the GEV distribution

In estimating extreme rainfall, tail of the distribution is of special significance, which is governed by the shape parameter. As noted in Sect. 3, the GEV distribution belongs to one of the three families of extreme value distributions (i.e. Gumbel, Fréchet and Weibull families) depending on the value of shape parameter (i.e. if it is 0, larger than 0 or less than 0). Therefore, it is of importance to test whether the shape parameter equals to 0 or not.

In this section, the results of the shape parameter of the GEV distribution for the AM rainfall data at the ten rainfall stations are examined. Figure 2 shows the boxplot of the shape parameter values. For 6-min duration, the shape parameter is generally negative, with a median value of − 0.083, while for 4320-min (3 days) duration, the shape parameter is dominated by positive values, with a median value of 0.036. For the sub-hourly durations, most of the stations are dominated by negative shape parameters, and for the sub-daily and daily durations, the shape parameters are generally positive. For the 1 h, the shape parameter has a median very close to 0. Among the ten stations, the minimum value of the shape parameter is − 0.319 (for 6 min rainfall duration), and the maximum value is 0.288 (for 12 h rainfall duration). The regional average (over the ten stations) value of the shape parameters is 0.011, with 79 positive values and 71 negative values. All the five important quartile values of the shape parameters are presented in Table 3.

Table 3 Quartile information of the shape parameter of the GEV distribution for ten stations in NSW
Fig. 2
figure 2

Boxplots of shape parameters of the GEV distribution at ten stations in NSW

4.3 Development of the IFD curves

The empirical relationship of the IFD curves based on Eq. 8 for each of the ten stations is presented in Table 4 (where 7 basic quantiles were estimated using fitted GEV distribution) and Table 5 (quantiles estimated by LP3 distribution). The coefficient of determination (R2) values of the fitted empirical IFD curves range 0.981–0.995 (average = 0.990) for the GEV case. For the LP3 distribution, R2 values range 0.977–0.996 (average = 0.988). These show that the R2 values are quite high for the empirical IFD curves. The RMSE values (estimated by Eq. 11) range 3.153 and 6.205% (average = 4.423) for the GEV distribution, while for LP3 distribution range 2.468 and 6.430% (average = 4.622). These results imply that confidence can be placed on the empirical equations used to derive IFD curves.

Table 4 Empirical relationship between I (mm/hr) (estimated by GEV distribution), T (years) and D (min) at the ten selected stations
Table 5 Empirical relationship between I (mm/hr) (estimated by the LP3 distribution), T (years) and D (min) (estimated by LP3) at the ten selected stations

The complete set of IFD curves obtained by the fitted empirical equations (shown in Tables 4 and 5) for the three stations (Gowrie-055194, Comboyne-060080 and Ingebyra-071042) are presented in Figs. 3 and 4 as examples. These figures show that there is a high degree of consistency in the developed IFD curves across different return periods. Also, these figures show that the IFD curves have a negative gradient, consistent with experience that higher rainfall intensity occurs over shorter durations.

Fig. 3
figure 3

Examples of developed IFD curves based on empirical relationships shown in Table 5 (GEV distribution case)

Fig. 4
figure 4

Examples of developed IFD curves based on empirical relationships shown in Table 6 (LP3 distribution case)

The second-degree polynomial was used to derive the relationship of an IFD curve based on Eq. 9 for each of the ten stations. Table 6 shows the R2 values of the fitted second-degree polynomial IFD curves for all the 7 return periods. The R2 values range 0.997–0.999 (average = 0.998). Also, it can be seen that the RMSE values range 1.015 to 3.851% for the GEV distribution case, and 1.113–3.607% for LP3 distribution case. The R2 values for the second-degree polynomial are found to be higher than those obtained by using the empirical method (Tables 4 and 5); therefore, it can be concluded that the second-degree polynomial has resulted in more accurate IFD curves than the empirical method. As examples, Figs. 5, 6, 7 and 8 show the IFD curves derived by the second-degree polynomial for the three rainfall stations (i.e. Gowrie-055194, Comboyne-060080 and Ingebyra-071042) and two return periods (5 and 100 years).

Table 6 Average coefficient of determination (R2) and RMSE values for all the ten stations (case of the fitted second-degree polynomial IFD curves)
Fig. 5
figure 5

Fitting a second-degree polynomial between I and D estimated by GEV distribution for 5-year return period

Fig. 6
figure 6

Fitting a second-degree polynomial between I and D estimated by GEV distribution for 100-year return period

Fig. 7
figure 7

Fitting a second-degree polynomial between I and D estimated by LP3 distribution for 5-year return period

Fig. 8
figure 8

Fitting a second-degree polynomial between I and D estimated by LP3 distribution for 100-year return period

4.4 Visualisation of derived IFD curves with respect to observed AM rainfall data

Figures 9 and 10 show the new IFD curves (by the empirical and polynomial fitting methods) along with the observed AM rainfall data for the same three stations as above (Gowrie-055194, Comboyne-060080 and Ingebyra-071042). Weibull plotting position formula was used to plot the AM rainfall data. These two figures show that the IFD curves obtained from second-degree polynomial (IFD*: red colour) generally match the AM rainfall data more closely than IFD curves obtained from the empirical relationship (IFD**: blue colour). Moreover, it is noted that in general for shorter rainfall durations such as 6 min, the IFD curves better fit the AM rainfall data than those of longer durations such 180 and 2880 min.

Fig. 9
figure 9

IFD curves from polynomial equations (IFD*) and IFD curves from empirical formula (IFD**): GEV distribution case

Fig. 10
figure 10

IFD curves from polynomial equations (IFD*) and IFD curves from empirical formula (IFD**): LP3 distribution case

4.5 Comparison of estimated IFD curves with the ARR87 and ARR13

The new IFD curves based on both the adopted distributions (GEV and LP3) were compared with IFD curves recommended by ARR87 (I. E. Aust. 1987) and ARR13 (Ball et al. 2016). The comparison is illustrated for two return periods (5 and 100 years) for three stations (i.e. Gowrie-055194, Comboyne-060080 and Ingebyra-071042) in Fig. 11. This figure shows that for the three selected stations, there are quite good match between the new IFD curves (based on GEV distribution) and ARR13 IFD curves. Also, our new IFD curves (based on LP3 distribution) exhibit a quite good match with those of the ARR87. Also, this figure shows that for 100-year return period, the IFD curves have slightly more kinks especially for longer durations (i.e. greater than 60 min), which is not unexpected.

Fig. 11
figure 11

Comparison of at-site IFD curves with ARR13 and ARR87 IFD curves for 5- and 100-year return periods

For evaluating the accuracy of the new IFD curves, the relative error (RE) (defined by Eq. 10) and root mean square error values (RMSE) (defined by Eq. 11) were used. Table 7 shows the absolute median RE values (%) between the new IFD curves and the ARR IFD curves (i.e. ARR87 and ARR13), which shows that the absolute median RE values range between 1 and 48% for the ten selected stations, with a mean RE value of 14.73%, and with a standard deviation of RE values of 9.93% (based on GEV distribution) and with a mean RE value of 17.20%, and with a standard deviation of RE values of 11.97% (based on LP3 distribution). As the return period increases, the absolute RE values generally increase. For stations Norfolk-200288, the RE values are the largest, and for station Glen Alice-061334, the RE values are quite high for the ARR87 case. Overall, considering all the seven return periods and ten stations, the average median RE value is the smallest (5.3%) between the new IFD curves (based on the GEV distribution) and the ARR13 IFD curves. Furthermore, the average highest median RE value (37.3%) is found between the new IFD curves (based on LP3 distribution) and ARR87 IFD curves. It should be noted here that the ARR87 and ARR13 IFD curves were not at-site IFD curves (they were regional IFD curves that were also subjected to regional smoothing), and hence, some differences between ARR and our new IFD curves are expected.

Table 7 Absolute median relative error (RE) values (%) between the new IFD curves and ARR87 and ARR13 IFD curves

The boxplots of the RE values (considering the sign of RE) were examined. (An example is shown in Fig. 12 for station Gowrie-055194.) These boxplots show that the new IFD values are generally smaller than the ARR87 and ARR13 IFD curves.

Fig. 12
figure 12

Boxplots for relative error (RE) (considering sign of RE) for the 7 return periods and for the station Gowrie-055194. a RE estimated between the new IFD curves based on GEV distribution and ARR87. b RE estimated between the new IFD curves based on GEV distribution and ARR13. c RE estimated between the new IFD curves based on LP3 distribution and ARR87. d RE estimated between the new IFD curves based on LP3 distribution and ARR13

Figure 13 shows the spatial distribution of median RE values (%) (considering sign) between the new IFD curves and the ARR IFD curves (ARR87 and ARR13) for 5-year return period. From Fig. 13, it can be seen that for 5-years return period, the RE values are generally negative, i.e. the new IFD curves are smaller than the ARR IFD curves. However, the new IFD curves based on GEV distribution (Fig. 13b) show the lowest degree of differences (with 5 positive RE and 5 negative RE values out of ten stations) with the ARR13 IFD curves. Figure 14 shows the spatial distribution of median RE values (%) (considering sign) between the new IFD curves and the ARR IFD curves (ARR87 and ARR13) for 100 years return period. It can be seen from Fig. 14 that the new IFD curves are smaller than the ARR IFD curves for majority of the selected ten stations.

Fig. 13
figure 13

Distribution of median RE values (%) between the new IFD curves based on GEV distribution and the a ARR87, b ARR13 and the new IFD curves based on the LP3 distribution and c ARR87 and d ARR13 IFD curves (5-year return periods)

Fig. 14
figure 14

Distribution of median RE values (%) between the new IFD curves based on GEV distribution and the e ARR87, f ARR13 and the new IFD curves based on the LP3 distribution and g ARR87 and h ARR13 IFD curves (100-year return periods)

Table 8 summarises root mean square error (RMSE) values (estimated by Eq. 11) for all the seven return periods and ten selected stations. Overall, considering all the return periods and stations, the median RMSE value is the smallest (14.0%) between the new IFD curves (based on the GEV distribution) and the ARR13 IFD curves. Furthermore, the highest median RMSE value (20.6%) is found between the new IFD curves (based on LP3 distribution) and ARR87 IFD curves. Moreover, it can be seen that the lowest RMSE value is 6.3% for station Ingebyra-071042 (based on GEV distribution), while the highest RMSE value is 45% for station Murwillumbah-058158 (based on LP3 distribution).

Table 8 Root mean square error (RMSE) values (%) between the quantiles of the GEV and LP3 distributions and the quantiles of ARR87 and ARR13 for each of the ten selected stations

5 Conclusions

The intensity–frequency–duration (IFD) data, also known as design rainfall, are widely used in planning, designing and operation of water resources management projects. The reviewing/updating of IFD curves is an important research task in hydrology. This study compares the latest regional IFD curves in Australia, published in 2013, as a part of the new Australian rainfall and runoff (ARR) 2016 (Ball et al. 2016) with the at-site IFD curves to examine the expected degree of variation between the at-site and regional IFD data. It uses data from ten pluviography stations from eastern New South Wales (NSW), Australia. Fifteen rainfall durations (6 min to 3 days) and seven return periods (i.e. 1, 2, 5, 10, 25, 50 and 100 years) are considered. The IFD curves generated by two commonly used probability distributions, generalised extreme value (GEV) and log Pearson type 3 (LP3) distributions are compared. Empirical and polynomial regression methods are used for smoothing the IFD curves at a given site. Based on the three goodness-of-fit tests (i.e. Kolmogorov–Smirnov test, Anderson–Darling test and Chi-square test), it has been found that both GEV and LP3 distributions fit the annual maximum rainfall data (at 1% significance level) for all the ten selected stations. The developed IFD curves based on the second-degree polynomial represent better fitting than the empirical method. It has been found that the ARR87 and ARR13 IFD curves are generally higher than the at-site IFD curves derived here. The median difference between the at-site and regional ARR-recommended IFD curves is in the range of 13–19%. It is expected that the outcomes of this research will provide better guidance in selecting the correct IFD data for a given application in eastern NSW. The methodology developed here should be extended to more rainfall stations across Australia to have a better understanding of the possible differences in IFD data between different distributions and at-site and regional analyses.