1 Introduction

Combined sewer overflow (CSO) pollution was a challenge facing urban water environment managers across the globe (Andrés-Doménech et al. 2010; Passerat et al. 2011; Yu et al. 2013; Goore Bi et al. 2015; Rosin et al. 2021). In recent years, many cities in China have been affected by CSO pollution because point pollution, from the industrial wastewater and the domestic sewage, were dramatically improved by the Chinese ecological civilization practices.

For CSO polluted events, the drainage capacity of the combined interceptor sewer was a crucial parameter. Physically, CSO will occur when the actual outflow of the combined sewer exceeds the design inflow of the intercepted sewer. That is to say, mathematically, CSO will definitely occur when the design return period of the combined sewer is greater than that of the actual rainfall intensity. Design method of the combined sewer was similar to that of storm sewer. Its design drainage capacity and the equivalent return period is easy to obtain. In contrast, the design rainfall return period of the interceptor sewer is often ignored, because its design methodology is different from that of the combined sewer. In this context, it is of significance to calculate the equivalent return period of interceptor sewer.

In China, design of interceptor sewer is based on a design parameter-interceptor ratio (or interceptor multiple, n0), which is the ratio of the intercepted storm flow and the average sanitary flow. If we know the average sanitary flow and select the interceptor multiple, design inflow of interceptor sewer can be calculated. However, from the perspective of the rainfall intensity, the equivalent design return period of this design inflow of interceptor sewer was normally ignored. Especially for CSO related issues, it is of significance for estimating the rainfall intensity threshold of CSO.

Currently, the commonly used rainfall intensity formula was for the purposes of storm sewer design. It is called as Intensity—Duration—Frequency (IDF) or Depth—Duration—Frequency (DDF) curve in many countries across the globe. This intensity formula is obtained based on the annual maxima sampling (AMS). It is suitable for occasions with a return period of more than 2 years. In contrast, design return period of interceptor sewer is much smaller, because its design flow is only (n0 + 1) times of the average sanitary flow ( n0 = 2 ~ 5). That is the reason why CSO occurs many times annually and their return periods are normally lower than half a year. In this context, a low return period rainfall intensity formula was needed for analysis of CSO and it was investigated in this study.

The rest of this study is organized as follows: Sect. 2 gives a detailed introduction to the basic principles and flowchart of the proposed method. Section 2.1 introduces the scheme of rainfall event separation; Sect. 2.2 introduces the annual multi- event- maxima (AMEM) sampling method; Sect. 2.3 introduces the frequency curve (or distribution) selection and Sect. 2.4 introduces the parameter optimization of rainfall intensity formula based on MCMC (Markov Chain Monte Carlo) algorithm. In Sect. 3, a case study is elaborated to demonstrate the feasibility of the new method. Finally, findings of our work are summarized in Sect. 4.

2 Methodology

A flowchart for obtaining the low return period rainfall intensity formula was proposed based on the recorded rainfall data, as shown in Fig. 1. It consists of four parts: rainfall event separation, rainfall intensity sampling, frequency curve selection and parameter estimation of rainfall intensity formula.

Fig. 1
figure 1

Flowchart

2.1 Rainfall Events Separation (or Division) Method

Actually, the return period of the interceptor sewer is relevant to the CSO frequency. When the return period of the interceptor sewer is higher, the CSO frequency is relatively lower and vice versa. A CSO event is triggered by a rainfall event that return period of its maxima rainfall intensity is higher than that of the interceptor sewer. Therefore, to calculate the return period of the interceptor sewer, it is necessary to conduct rainfall event division firstly. An ideal rainfall event should be completely continuous rainfall process and independent of other rainfall events in the meantime (Jean et al. 2018; Li et al. 2019). Hence, it is very important to determine the appropriate interevent time between the two rainfall events. If this parameter is set too small, the two rainfall events do not meet the requirement of statistical independence. Conversely, it breaks the integrity of a continuous rainfall process (or a rainfall event). Currently, a very popular method to determine the interevent time was based on the assumption of coefficient of variation (Cv) of rainfall event intervals equals to 1 (Restrepo-Posada and Eagelson 1982). It was assumed that the prior distributions of rainfall characteristics (rainfall interval, rainfall depth and rainfall duration) are all the exponential distribution. In this study, not only the independence of rainfall events, but also the independence of the two adjacent CSO events should be ensured from the statistical perspective. In view of this requirement, runoff or flow from the two adjacent rainfall events, should not overlap or meet in the combined system (On this occasion, CSO frequency can be clearly defined and quantified). According to this principle, a rainfall event separation method was proposed accordingly. It can be explained as follows: the minimum interevent time (MIET) between the two rainfall events should be greater than or equal to the time of concentration (tc) of the studied tributary area (or the time of concentration from the most downstream interceptor well), as shown in Fig. 2a, b.

Fig. 2
figure 2

MIET calculation principle in this study

Time of concentration is a commonly used parameter for storm sewer design. For the interceptor sewers concerned in the actual project, their times of concentration can be obtained from the existing storm sewer design data or completion archives. Otherwise, these values should be re-calculated according to the local storm sewer design manuals. For instance, time of concentration (tc) equals to t1 plus t2. Here, t1 is the inlet time to the point where the runoff enters the combined sewers (or overland runoff time), t2 is the time of flow in the closed combined sewers to the studied interceptor well or sewer flow time.

2.2 Rainfall Intensity Sampling Method

Currently, there are many methods used for rainfall intensity sampling, such as the annual maxima (AM) sampling, the annual multi-sampling, and so on. They are suitable for the different applications with the varied return period spans respectively (Ahmad et al. 2019). Table 1 presents the relationship between the sampling methods and their applicable ranges of return period.

Table 1 Relationship between the sampling methods and their return period ranges

Compared with flooding events or storm sewer design scenarios, return period of the combined interceptor sewer is lower than half a year. Therefore, a new rainfall intensity sampling algorithm, AMEM sampling algorithm, was developed based on the rainfall event separation results. The sampling process is as follows:

  1. (a)

    Select a rainfall duration to be analyzed;

  2. (b)

    Corresponding to the current rainfall duration, calculate the maxima of (average) rainfall intensity for each rainfall event;

  3. (c)

    M maxima were selected annually in the descending order. Note, only one maximum can be selected for each rainfall event to ensure the statistical independence. Suppose there are N years of rainfall data. At this time, M * N maxima will be collected.

  4. (d)

    Sort the M * N data in descending order, and then select the first half of the data. M /2* N data (maxima) can be sampled.

  5. (e)

    Continued to select another rainfall duration and replicate step (a) ~ (d).

In general, the selected rainfall duration can be 5, 10,15, 20, 30, 45, 60, 90, 120 min. If sizes of the urban catchments studied are very big, 150 or 180 min can also be considered.

2.3 Frequency Curve Selection Method

After AMEM sampling, a number of rainfall intensities for different rainfall durations can be selected. Then these rainfall intensity values are sorted in the descending order, and their corresponding empirical frequency values can be calculated according to Weibull formula (Weibull 1939), shown as follows:

$$F=\frac{m}{n+1}\times 100\%$$

where m is the corresponding number of rainfall intensity, n is the total number of samples, and F is called as the empirical frequency of the corresponding rainfall intensity.

Later, the empirical frequency curve can be plotted by rainfall intensity—frequency pairs. Currently, hydrologists prefer to use the specific mathematical probability density distribution functions to summarize the empirical frequency features of rainfall intensity, such as lognormal distribution, Gumble distribution and Pearson type III (P-III) distribution (Yilmaza et al. 2014). This mathematical probability density distribution is normally referred to as the ‘theoretical’ frequency distribution (It is a relative concept. Actually, we will never know the true distribution). Based on the plotted empirical frequency graph, the best fitting frequency curve can be determined in terms of physical causality or statistics. Thus, the theoretical frequencies of rainfall intensities can also be obtained.

2.4 Parameter Estimation Method for Rainfall Intensity Formula

In China, three—parameter Horner rainfall intensity formula was generally used to represent the relations between rainfall intensity, rainfall duration and rainfall frequency for storm sewer design. In this study, the above relationships were similar. Therefore, the mathematical form of the low return-period rainfall intensity formula is the same as that of storm sewer design and Horner formula is still used here, shown as below.

$$i=\frac{A}{{\left(t+b\right)}^{n}} \;{\text{or}} \;i=\frac{A{}_{1}\left(1+C\mathrm{lg}P\right)}{{\left(t+b\right)}^{n}}$$

where i is the rainfall intensity (mm/min), P (=1/F) is the return period (a), t is rainfall duration (minute), and A, A1, C, b, n are the local parameters. In this study, t is equal to the time of concentration.

As mentioned earlier, the relationship between rainfall intensity ( i) and the theoretical frequency ( F) or return period ( P) can be determined by the theoretical frequency curve (probability density distribution function). Meanwhile, the maxima of rainfall intensity corresponding a rainfall duration (e.g. 5, 10,15, 20, 30, 45, 60, 90, 120 min) can be analyzed according to the recorded rainfall series in real applications (Zhang et al. 2016). On this occasion, data of i, t, and P should be collected for all the interceptor sewers for parameters (A, A1, C, b, n) estimation. Several methods (including moments method, maximum likelihood estimation, MCMC method) can be used to calculate the local parameters of low return-period rainfall intensity formula (Liu et al. 2021).

3 Results of the Case Study

3.1 Study Data

The recorded rainfall data from January 1, 2008 to December 31, 2017 was used in this case study, which are obtained from a urban meteorological station located in Sichuan province of southwestern China. The temporal resolution of the recorded rainfall data is 5 min, as shown in Fig. 3. It was used for calculation of the low-return period rainfall intensity formula.

Fig. 3
figure 3

The recorded rainfall data of the case study (2008/01/01–2017/12/31)

A combined sewer system was used for validation and verification of the resulted rainfall intensity formula, as shown in Fig. 4. Design data of the combined interceptor sewers, including design flows and times of concentration, were were from the completion archives and listed in Table 2.

Fig. 4
figure 4

The combined sewer system for the case study

Table 2 Design data of the combined sewer system

3.2 Rainfall Statistical Analysis Software

A rainfall statistical analysis software was especially developed to separate rainfall events and obtain their statistics, such as the total number of rainfall events, average rainfall intensity of each rainfall event, the maximum of (average) rainfall intensity for the specific duration, and so on.

3.3 Results

3.3.1 Results of MIET Uncertainty Evaluation

In order to evaluate the impact of MIET on the rainfall event separation, three schemes of MIET (1, 2 and 3 h) were studied, respectively. Results were compared with the total amounts and rainfall depths of rainfall events, as shown in Table 3.

Table 3 Statistics of total amounts, rainfall depth of rainfall events for different MIET values (unit: mm)

First, it reveals that the total amount of rainfall events decreases from 754 to 694 when MIET increases from 1 to 3 h. Meanwhile, variability of the total amount of rainfall events was -5.84% when MIET increases from 1 to 2 h, and -2.25% when MIET increases from 2 to 3 h.

In addition, it terms of mean rainfall depth, variability was 0.9% when MIET increases from 1 to 2 h, and 0.32% when MIET increases from 2 to 3 h. Meanwhile, for the median value, variability was 11.67% when MIET increases from 1 to 2 h, and 1.33% when MIET increases from 2 to 3 h. It is the same results for the the minimum.

In summary, according to the above results for different MIET scenarios, it can be found that most of the statistical indicators rarely changes when MIET equals to 2 h and 3 h. In addition, time of concentration of urban downtown catchment is generally not more than 2 h (except for mega-cities). Therefore, 2 h were selected to carry out the later work.

3.3.2 Results of Rainfall Intensity AMEM Sampling

First, rainfall statistical analysis software (RSAS) was used to obtain the maximum rainfall intensity of rainfall event for different rainfall durations (5, 10, 20, 30, 45, 60, 90, 120 and 180 min). For each rainfall duration, a total of 710 rainfall intensity samples can be obtained.

Second, the first 8 maxima of each year were selected as candidate samples. Thus, there are 80 candidate samples derived from the 10-year recorded rainfall data for the specific duration. Later, 80 candidate samples were sorted in the descending order and the first 40 data were selected as the final samples. Finally, according to Weibull formula, the empirical frequency of each rainfall intensity can be calculated for different durations, as shown in Table 4.

Table 4 The empirical frequency results of rainfall intensity (unit: mm/min)

3.3.3 Results of Frequency Curve Selection

In this study, the exponential distribution, lognormal distribution, Gumbel distribution were used for frequency curve analysis. For each distribution, eleven empirical frequency curves can be plotted for every rainfall duration, as shown in Fig. 5a–c. Moreover, the least square method was used to estimate parameters of the probability density distribution function. Correlation coefficient (CC) was selected as the fitness indicator. It was found that the lognormal distribution was the best-fit one and selected for the calculation of the theoretical frequency of rainfall intensity, as shown in Table 5. Figure 6 presents relationship curves between rainfall intensity, rainfall duration and return period. They can be called as IDF curves for low return periods.

Fig. 5
figure 5

Comparison of theoretical distribution curves and empirical frequency scatter data

Table 5 Comparison of the theoretical frequency distributions
Fig. 6
figure 6

IDF curves for low return periods

3.3.4 Results of Rainfall Intensity Formula

Based on the above data of the theoretical frequency, rainfall duration and rainfall intensity, MCMC algorithm was used to estimate the local parameters (A1, C, b, n) of rainfall intensity formula. Resulted rainfall intensity formula for low return period scenarios was shown as follows:

$$i=\frac{28.2852\left(1+0.882lgP\right)}{{\left(t+28.662\right)}^{0.921}}$$

where i, P, t are the areal unit sanitary flow (or equivalent rainfall intensity), return period, and time of concentration. of the interceptor sewer, respectively.

To demonstrate the applicability of the formula, it was used for a combined sewer system to calculate the return periods of the interceptor sewers. Results were shown in Table 2. It revealed that this formula is effective for interceptor sewers.

4 Discussions and Conclusions

In this work, a low- return- period rainfall intensity formula was proposed for the combined interceptor sewers. According to this formula, the equivalent return period (P) of the existing interceptor sewers can be estimated. Here, as the two known parameters, t and i can be obtained according to the completed sewer design archives (Table 2). In addition, this formula can also be used for design scenario of the new interceptor sewer. Design areal unit flow (i) can be acquired based on the formula according to the specified design return period (it is normally related to the CSO control standard) and the corresponding time of concentration. Under this circumstance, as the unknown parameter, the design interceptor ration(n0) can be calculated according to the design areal unit flow, the local population density and sewage discharge quota of the contributing catchment.

More importantly, this formula can also be utilized for calculating the threshold of CSO from the perspective of the interceptor sewer (or well). As mentioned above, design rainfall intensity of the combined interceptor sewers can be viewed as one of thresholds for occurrence of CSO events. If occurrence of CSO can be judged, CSO frequency can be estimated accordingly. From this standpoint, this formula is helpful to estimate CSO frequency. As we know, CSO spill frequency is one of the important indicators for controlling CSO. Hence, this formula can be employed for design of CSO abatement facilities to meet the requirement of the specific CSO frequency standards.

In addition, thresholds of CSO occurrence at the different interceptor wells are varied and this formula provide us an appropriate approach to calculate their return period thresholds. Currently, observation method is used in many studies for estimation of these thresholds and a large number of monitoring tasks should be carried out in wet weather. This formula offer us a theoretical pathway to estimate one of the critical thresholds of CSO and only few data are needed.

Nowadays, CSO frequency standard has been applied in more and more cities (Lau et al. 2002; Mailhot et al. 2015). The low return period rainfall intensity formula offers an approach to establish the relationship between interceptor ratio, the areal unit sanitary flow, time of concentration, design drainage capacity, return period of the interceptor sewers and CSO frequency.