1 Introduction

The initial rainfall-runoff pollution (or the first-flush pollution) is one of the major causes of urban non-point source pollution and urban water pollution (Wang et al. 2021). The initial rainwater storage tank is a commonly used measure to deal with this sort of problem(Costa et al. 2021; Mamun et al. 2020; Maniquiz-Redillas et al. 2022).

For design of the initial rainwater storage tank, a fundamental question is how to determine the effective and rational design volume to capture more runoff pollutants with the smaller volume. Several design methods were employed in the past decades. Empirical (or direct) method was the commonly used method across the globe. It is calculated based on the average initial rainfall depth and area of catchment. Merely the design value of the initial rainfall depth varies in different countries or regions. For example, 12–15 mm was used as the empirical values in the United Kingdom (Andoh and Iwugo 2002), 0.5 ( for scenarios with the first-flush effect ) or directly 1 inch in the United States (Andrle 2005; Wang 2008), and 4–8 mm in China (Xu et al. 2005). Particularly in Shanghai, 11 mm is recommended as minimum of the initial rainfall interception standard for the combined sewer system and 5 mm for the separate sewer system according to “Shanghai Urban Rainwater Drainage Plan (2020–2035)” (Zhang et al. 2022). In some cases, the runoff coefficient was also taken into consideration (often referred to as the method of empirical formula) (Chen and Adams 2005). Another method is the cumulative probability method. It is calculated by the runoff volume corresponding to the rainfall event with 90% probability (Andoh and Iwugo 2002). This method was commonly used in the United States and the United Kingdom. In addition, the hydrological and hydraulic simulation models (e.g. Stormwater management model (SWMM) are normally used when the actual rainfall, discharge and water quality data are available (Si et al. 2022; Shi et al. 2021).

In summary, there were several methods for calculating the design volume of the initial rainwater storage tank across the globe. However, the design level (or design return period) of design volume was normally ignored or vague for the current literatures. This makes it difficult for designers to optimize the design volume parameters, resulting in design schemes that are either too large or too small. In this context, taking the randomness of rainfall processes into consideration, a new method was proposed for calculating the design volume of the initial rainwater storage tank.

2 Methodology

The proposed method of this study should be based on the long-term recorded rainfall series of the target region. It includes several steps such as rainfall event division (or separation), selection of the initial rainfall events, sampling of the initial rainfall intensity, the calculation of the empirical and the theoretical frequency of the initial rainfall intensity, the initial rainfall intensity formula, and determination of design scenarios (including design return period, the initial loss, overland time of concentration, area of the catchment studied) and calculation of design volume. The flowchart of this study is shown in Fig. 1. Figure 2 illustrates the design background or scenario of the initial rainwater storage tank.

Fig. 1
figure 1

Flowchart of this study

Fig. 2
figure 2

Design scenario of the initial rainwater storage tank

2.1 Rainfall Event Division Method

The operation process of the initial rainwater storage tank is related to rainfall event. Therefore, rainfall event division is necessary for its design. For rainfall event division, the determination of the minimum inter-event time (MIET) is the question in point. In practice, in order to improve the efficiency of the initial rainwater storage tank, we often expect that it can play the greater role to deal with more pollutants or rainfall events. In this context, the emptying time of the tank was an important factor for selecting MIET. This enables that it can play the maximum role and generate maximum benefits in the two successive rainfall events. On the other hand, the determination of MIET should also take production and accumulation process of the surface pollutants into consideration. Therefore, the selection of rainfall events for design purpose is illustrated in the next step.

2.2 Selection Method of the Initial Rainfall Events

As mentioned above, not all the divided rainfall events are suitable for design purposes. It is expected that the more first-flush pollutants can be collected with the smaller volume. Therefore, the capture quantity and efficiency of the targeted pollutants should be emphasized here. According to these principles, the initial (or design) rainfall events (rainfall events used for this design purpose) should be selected firstly. Generally, the washing and concentration processes of the first-flush pollutants are related to rainfall intensity, rainfall amount and antecedent dry periods (ADP), and so on(Gnecco et al. 2005). For instance, ADP affects pollutants accumulated before the rainfall event, while the rainfall intensity and the rainfall amount affect hydrograph of pollutants flushed (Zhang et al. 2018). Therefore, the three indicators, rainfall intensity, rainfall amount and ADP, were used for selection of the design rainfall events (or the initial rainfall events).

2.3 Determination Method of the Initial Rainfall Period

Next, the initial rainfall period should be sampled from the initial rainfall events. First, the span of the initial period is crucial for the design. If it is too longer, we cannot catch the concentrated part of the pollutant hydrograph and overestimate the design volume. Otherwise, we cannot collect most of the pollutants from the whole catchment and underestimate the design volume. In addition, determination of the initial rainfall period should consider the processes of the initial loss of the catchment, such as filtration, retention and detention, and so on. Apparently, it is also relevant to the detailed conditions of the catchments. For simplicity, the overland time of concentration of the catchment can be used to determine the initial rainfall period. Here, the overland time of concentration is the time that surface runoff flows from the most remote point of the catchment to its outlet.

In addition, the initial rainfall period can also be determined based on the percentage of pollutants flushed, which is related to the rainfall intensity and the amount of pollution(Zhang and Wang 2016). It can be expressed as the following Eq. 

$$\frac{{L - {L_0}}}{{t - {t_0}}}=\frac{{\vartriangle L}}{{\vartriangle t}}=\frac{{{\text{d}}L}}{{{\text{d}}t}}= - ki(t)L$$
(1)
$$i=\frac{{{A_1}(1+C\lg P)}}{{{{(t+b)}^n}}}$$
(2)
$$\int_{0}^{t} {\frac{1}{L}} {\text{d}}L=\int_{0}^{t} { - ki(t)} {\text{d}}L$$
(3)
$$\ln \frac{{{L_t}}}{{{L_0}}}=\frac{{ - {A_1}(1+C\lg P)k}}{{1 - n}}[{(t+b)^{1 - n}} - {b^{1 - n}}]$$
(4)
$${t_I}=\sqrt[{1 - n}]{{\frac{{(1 - n)\ln {L_p}}}{{ - {A_1}(1+C\lg P)k}}+{b^{1 - n}}}} - b$$
(5)

where i is the initial rainfall intensity (mm/min), P (= 1/F ) is the return period (a), tI is the initial rainfall period (minute), A, C, b, n are the local parameters of the catchment, L is the amount of residual pollutant, Lt is the amount of residual pollutant at time t, L0 is the initial amount of residual pollutant on the surface, k is the average reduction coefficient, and \({L_p}=\frac{{{L_t}}}{{{L_0}}}\) is the percentage reduction of the pollutant.

2.4 Sampling Method of the Initial Rainfall Intensity

The annual multi-event-maxima method (AMEM) was used for the sampling of the initial rainfall intensity method in this work (Liu et al. 2023). The sampling process is as follows:

  1. (1)

    Select the first X minutes of each rainfall event as the initial rainfall period. Normally, it can be set as 5, 10, 15, 20, 25–30 min according to catchment conditions.

  2. (2)

    M initial (or design) rainfall events are selected every year according to three indicators (initial rainfall amount, maximum rainfall intensity during the initial rainfall period, and ADP). If y years of rainfall records are available, (y × M) rainfall events will be collected.

  3. (3)

    Sample the maxima of the average rainfall intensity for each rainfall duration (5, 10, 15, 20, 30 min are often used) during the initial rainfall period (X minutes) according to AMEM sampling principles.

  4. (4)

    Arrange the above samples of all the rainfall durations in the descending order, respectively.

  5. (5)

    The first (3 ~ 6)×y samples of the initial rainfall intensity for every rainfall duration will be used for the subsequent analysis.

Fig. 3
figure 3

Sampling procedure of the initial rainfall intensity

Figure 3 depicts the sampling method of the initial rainfall intensity.

2.5 Calculation Method of the Empirical / Theoretical Frequency Curve

After sampling, the initial rainfall intensities can be obtained for different rainfall durations and sorted in descending order. Next, their empirical frequency values can be calculated according to the Weibull formula, as shown below:

$$F=\left( {\frac{m}{{n+1}}} \right) \times 100\%$$
(6)

where m is the serial number of the corresponding initial rainfall intensity, n is the total number of samples, and F is the empirical frequency of the initial rainfall intensity.

The pairs of the initial rainfall intensity and the corresponding empirical frequency can then be used to plot the empirical frequency curve. According to the empirical frequency curve, the theoretical frequency function or curve can be acquired. Normally, the lognormal, exponential, Gumble, and Pearson type III (P-III) distribution functions and several parameter optimization methods such as method of moments, the least square method, and Markov Chain Monte Carlo (MCMC), can be used for this purpose (Deng 1996; Zalina et al. 2002).

2.6 Extrapolation Method of the Initial Rainfall Intensity Formula

Similar to storm sewer design, the framework of rainfall intensity formula can be also applied to calculate the initial rainfall intensity. Hence, the three-parameter Horner formula (Eq. (7) and Eq. (2)) was used and its parameters can be estimated through the optimization algorithms.

$$i=\frac{A}{{{{\left( {t+b} \right)}^n}}}$$
(7)

where i is the initial rainfall intensity (mm/min), t is the initial rainfall period (minute), and A, b, n are the local parameters.

2.7 Calculation Method of Design Volume of the Initial Rainwater Storage Tank

For the catchment studied, four parameters (design return period, the initial rainfall period, the initial loss and area of the catchment) should be determined before the calculation of design volume. First, area of catchment can be calculated through the geographic information tools. Generally, it should not exceed a few hectares. It is noted that the area of waters should be excluded from the total area of the catchment. Second, the design return period is relevant to the investment of the project and should be optimized through the cost effectiveness analysis. It is clear that this return period is different from that of the storm sewers. Third, the initial loss of the catchment can be estimated according to the empirical data or hydrological tools (SCS-CN method, Horton infiltration method, Green-Ampt method, etc.). For the current short duration rainfall-runoff issues, evaporation is often ignored. For the pervious area, depression and infiltration should be taken into consideration while for the impervious area, depression sometimes is significant. Finally, the initial rainfall period (t) is an important parameter in the initial rainfall intensity formula. Here, the overland time of concentration of the catchment studied is recommended to calculate the initial rainfall period.

When values of the above four parameters are obtained, the initial rainfall intensity can be acquired according to the initial rainfall intensity formula. Thus, the rainfall depth during the design duration can be calculated as: rainfall depth = the initial rainfall intensity × the design duration. Finally, design volume can be obtained, that is, design volume = (rainfall depth - the initial loss) × area of the catchment.

2.8 Study Data

The recorded rainfall data used in this study were obtained from an urban meteorological station in southwestern China. They span ten years from 1 January 2008 to 31 December 2017, with the temporal resolution of 5 min.

3 Results of this Study

3.1 Results of Rainfall Event Division

In this study, the rainfall statistical analysis software (RSAS)was used to separate rainfall events (Liu et al. 2023). The emptying time of the initial rainwater storage tank is 3 h; therefore, MIET was set as 3 h. According to the results of the rainfall event division, the 10-year original rainfall series were divided into 694 rainfall events, as shown in Table 1.

Table 1 Statistics of the total amount of rainfall events when MIET = 3 h

3.2 Results of the Selection of the Initial Rainfall Events

After rainfall event division, several rainfall events were selected from the above 694 rainfall events for design purposes. Normally, overland time of concentration of urban catchment (several hectares) is less than 30 min (According to Standard for design of outdoor wastewater engineering of China (GB50014-2021), it is about 5 ~ 15 min). Therefore, the initial 30 min of each rainfall event was used as the initial rainfall period in this work. Next, the three indicators (total rainfall amount, maximum rainfall intensity and ADP) were used to identify the selected rainfall events. Finally, a total of 80 rainfall events (8 rainfall events annually) were selected for the further study. Statistics of the selected rainfall events were shown in Table 2.

Table 2 Statistics of the selected rainfall events (for the initial 30 min)

3.3 Results of the Initial Rainfall Intensity Sampling

On the basis of the 80 design rainfall events mentioned above, the initial rainfall intensity for different durations (5, 10, 15, 20, 25, 30 min) were calculated and arranged in the descending order, respectively. Furthermore, the annual multi-sampling method were used for sampling of the initial rainfall intensity and calculation of the empirical frequency. Here, (3 ~ 6) × y (y = 10 in this study) samples were collected and three schemes (30, 40, 60 sets) were investigated for the different durations. Results were shown in Table 3.

3.4 Results of the Theoretical Frequency Curve Selection

Based on the results of the empirical frequency of the initial rainfall intensity, the theoretical probability distribution was investigated. In this study, the exponential, Gumbel and lognormal distributions were compared and three schemes were discussed respectively, as shown in Figs. 4, 5, and 6. According to coefficient of determinability (R2), the lognormal distribution was the best-fit one, as shown in Table 4.

Table 3 The empirical frequency of the initial rainfall intensity
Fig. 4
figure 4

Comparison of the theoretical distribution curves (30 sets of samples)

Fig. 5
figure 5

Comparison of theoretical distribution curves (40 sets of samples)

Fig. 6
figure 6

Comparison of theoretical distribution curves (60 sets of samples)

Table 4 Comparison of the theoretical frequency distributions

3.5 Results of the Initial Rainfall Intensity Formula

According to results of the theoretical frequency distribution, the initial rainfall intensity formula was obtained. The local parameters (A1, C, b, n) of the initial rainfall intensity formula were estimated using the Markov Chain Monte Carlo (MCMC) algorithm. Results were shown in Table 5. The corresponding intensity-duration-frequency (IDF) curves were shown in Fig. 7. It was found that when t = 15 min and P = 0.5 a, the initial rainfall intensities of the three schemes were 0.706, 0.711, and 0.707 mm/min, respectively, which means that the difference between them was less than 1%. Therefore, the number of samples was insignificant on the initial rainfall intensity. In the next steps, the sampling scheme of 60 sets was used for illustration.

Table 5 Comparison of the initial rainfall intensity formula for different schemes
Fig. 7
figure 7

IDF curve for the initial rainfall intensity

3.6 Results of Design Volume of the Initial Rainwater Storage Tank

For the calculation of design volume of the initial rainwater storage tank, four parameters (design return period, overland time of concentration, the initial loss and the area of the catchment) are necessary. In this study, the design scenarios for an urban catchment were shown in Table 6.

Table 6 Four parameters for the case study

Based on the initial rainfall intensity formula below, the initial rainfall intensity was obtained from Eq. (8). Then the rainfall amount (in rainfall depth, H) can be calculated as rainfall depth = the initial rainfall intensity ×the design duration (eq. (9)), then design volume can be obtained, that is, design volume (V) = (rainfall depth - the initial loss) × area of the catchments (eq. (10))

$$i = \frac{{19.278(1 + 0.9534\lg 0.25)}}{{{{(15 + 15.2947)}^{0.8711}}}} \approx 0.42({\text{mm/min)}}$$
(8)
$$H = i \times t = 0.42 \times 15 = 6.3({\text{mm}})$$
(9)
$$V = (H - {i_L}) \times A = (6.3 - 3) \times 4 = 13.2({{\text{m}}^3})$$
(10)

4 Discussions

In this work, a new method was proposed for calculating the design volume of the initial rainwater storage tank. Currently, the deterministic method is often used for rainwater storage tank design. For this design methodology, only one invariable design volume result is offered for the specific initial rainwater storage tank. However, the actual rainfall process is random process. Therefore, the probability and statistical theory should be introduced to cope with this issue. In this manner, the deterministic design method is only for the average design scenarios. Moreover, from the perspective of individual infrastructure design, it is necessary to provide the information of design level. Thus, the appropriate design level (or design return period) can be determined based on the investment and functional requirements of the project.

Actually, there are many purposes for the rainwater storage tank design, such as urban flooding mitigation, rainwater collection for water resource utilization, and so on. Here the function of the initial rainwater storage tank is to collect more pollutants from surface runoff and prevent pollution of the receiving waters. Therefore, the first-flush effect is the fundamental design principle in order to collect more pollutants with the most economical volume. In this sense, the size of the targeted urban catchment cannot be too large to ensure the existence of the first-flush effect for the initial surface runoff process. Normally, several hectares are suitable for urban catchment delineation. In the real world, there are often a number of similar catchments within the urban drainage districts (or watersheds). To abate this category of the non-point source pollution from the catchments, a number of the initial rainwater storage tanks should be designed and constructed. In this context, the initial rainfall intensity formula is necessary for guiding their designs. Under the design framework of the initial rainfall intensity formula, the suitable design parameters, such as overland time of concentration, the initial loss, the catchment area, can be determined or calculated by designers according to the detailed conditions of the targeted catchment.

In addition, it is important to improve the usage efficiency of the initial rainwater storage tank to optimize its design volume. From this perspective, the annual multi-event-maxima (AMEM) method was selected for sampling purpose. Therefore, the maxima of the design return period of the initial rainfall intensity formula is half an year. In the meantime, the emptying time is another factor for the optimal design. Normally, it is relevant to the rainwater discharge or treatment conditions. Given the fact that the production and accumulation of surface pollutants often require a longer period of time, the case study in this article is only for the reference of methodology and the larger MIET are recommended for the actual application purpose.

In summary, we always expect to achieve better (economic, efficient, effective, rational) design schemes of the initial rainwater storage tank. For this sake, further theoretical research, more experiments and applications are equally important for the the mitigation of the first-flush pollution from the separate or combined sewer system.

5 Conclusions

In this study, a new method is proposed based on the initial rainfall intensity formula to calculate the design volume of the initial rainwater storage tank for collecting the first-flush pollutants at the source of the catchments or sub-catchments. It is a stochastic design framework different from the current deterministic method, and the design level (return period) can be selected according to the requirements and demands of the decision-makers, planners or designers, and so on. It includes several steps such as rainfall event division, selection of the initial rainfall events, sampling of the initial rainfall intensity, the calculation of the empirical and the theoretical frequency of the initial rainfall intensity, the initial rainfall intensity formula and determination of design scenarios. First, rainfall events were obtained by dividing the original rainfall series, setting MIET as the emptying time of storage tank. The rainfall events for design purpose were selected from the above rainfall events according to three indicators (the initial rainfall amount, maximum rainfall intensity of the specified period, and ADP). Then AMEM method was used for the sampling of initial rainfall period (the initial X minutes) on the basis of the selected rainfall events. Second, samples were arranged in descending order and the empirical frequency of the samples was calculated according to the mathematical expectation formula (Weibull formula). The theoretical probability distribution function was estimated based on the MCMC algorithm and the initial rainfall intensity formula was obtained according to the three-parameter Horner formula. Finally, for the targeted catchment and the selected return period, the initial rainfall intensity can be calculated setting the initial period equals to overland time of concentration. Afterward, the design rainfall amount (design volume) can be obtained by the initial rainfall intensity, the initial period, initial loss and the catchment area. It was concluded that: (1) A method for the initial rainfall intensity sampling and the initial rainfall intensity formula were proposed for design volume of the initial rainwater storage tank. (2) The return period of the design volume can be taken into consideration in this method. (3) The proposed method is suitable for scenarios of collecting urban first-flush pollutants from small urban catchments.