1 Introduction

In the past decades, with the explosive growth of the urban population, more and more cities and towns along the river have built sluices, weirs, and other water conservancy projects to meet the needs of urban water and economic construction. For these sluice and weir projects, the determination of the design water level has an important engineering significance for the safe operation (Alila and Mtiraoui 2002; Ren et al. 2016) and thus must be calculated scientifically and reasonably. In general, the design level determination under a single flood can be treated by flood frequency analysis by fitting the most optimal univariate probability distribution of flood peak sequence (Modarres 2008). However, in the case of multiple floods, such as the main and tributary confluence, the design level calculation analysis cannot simply combine the different probability distributions of main and branch floods, especially when the river characteristics are dramatically altered by climate change and human activity. Hence, further investigation of the design level determination under multiple floods has a great engineering significance.

To our knowledge, the design water level driven by the combined effect of multiple floods has received little attention in the literature; besides, there is no effective calculation method for the engineering practice. Some engineering examples, such as the case study of GPSH in this paper, the design level is performed by flood regulating calculation with the Empirical combination method (ECM). The ECM usually uses the flood discharge of mainstream that satisfies the design RP combines the average flood discharge of tributary or vice versa. Though the ECM can be treated as a practical approach, particularly when it is insufficient to measure flood discharge samples, the applicability is still worth considering due to the following debations. On the one hand, the ECM comes from the summary of engineering practice and lacks a scientific theoretical basis. On the other hand, the ECM cannot accurately distinguish the different characteristics of main and tributary floods due to connecting the multiple floods simply; this would lead to uncertainty in determining the design water level and cause potential risks.

Copula can characterize the different effects of multivariates by connecting them in an appropriate form with a strong scientific basis. Because of its simple construction and flexible structure, the copulas have become the most popular mathematical modeling tool for multivariate hydrological frequency analysis (Favre et al. 2004; Salvadori and Michele 2004; Shiau and Modarres 2009; Zhang and Singh 2007a). Copulas have been widely applied in the multivariate hydrological risk frequency analysis such as flood peak discharge and volume (Duan et al. 2016; Li et al. 2013a, b; Sraj et al. 2015; Tsakiris et al. 2015; Zhang and Singh 2007b), duration and intensity of drought (Abdi et al. 2016; Mirakbari et al. 2010; Rad et al. 2017; Xu et al. 2015), the combination of rain and tide in the coastal area (Lian et al. 2013; Xu et al. 2014; Tu et al. 2018), and suspended sediment analysis (Shojaeezadeh et al. 2018; Peng et al. 2020). In another study, Dodangeh et al. (2020) investigated the flood frequency of interconnected rivers by applying Copulas. The results show that the binary distribution is more appropriate than the univariate distribution for flood frequency analysis at the confluence of two rivers. Therefore, for the FCDL investigation at the intersection of main and tributary, the Copula would be a proper approach to characterize the different influences of multiple floods.

Based on copula theory, the design level investigation driven by the combined action of main and tributary in the study can be concluded as follow, (1) modeling the main and tributary floods with an appropriate copula; (2) determining the design flood combination according to AND and OR RPs; (3) flood regulating calculation. However, both OR and AND RPs correspond to numerous flood combinations for a given T. Despite the Same-Frequency (Li et al. 2013a, b) and Most Likely (Salvadori et al. 2011) combinations being widely applied, the most appropriate flood combination to determine the design level is still controversial. Thus, further investigation should be conducted to avoid the controversy of selecting flood combinations for design level determination under multiple floods.

The main purpose of this study is to investigate the design level driven by the combined action of main and tributary floods and thus evaluate the uncertainty of the FCDL calculation caused by samplings. To reach this purpose, the FCRP performed by the bivariate Copula was proposed to identify the different roles of main and tributary fluxes on design level determination. The design levels conducted by the proposed FCRP were compared with the conventional AND and OR RPs. Furthermore, the increasing concern of uncertainty analysis due to the sample sizes n was also discussed. This paper might be the first to focus on the design level analysis under main and tributary floods in literature, and the results would be of an innovative reference significance for calculating design levels driven by multiple floods.

2 Theory and Method

2.1 Copula Theory

For bivariate cases, F(x) and F(y) are cumulative marginal distribution functions of the two continuous random variable sequences X and Y, respectively. According to Sklar (1959), the joint distribution function FXY(x, y) of the random variables x and y can be expressed as,

$${F}_{XY}\left(x,y\right)=C\left[F\left(x\right),F\left(y\right)\right]=C\left[u,v;\theta \right]$$
(1)

where C[u,v; θ] is a bivariate Copula function with a parameter θ.

There are Ellipse copulas (Fang et al. 2002), Plackett copulas (Plackett 1965), and Archimedean copulas three main types of bivariate Copulas applied in hydrological analysis, in which the Archimedean Copulas are the most widely performed due to their simple symmetrical structure and ease of calculation (Brahimi et al. 2011; Mou et al. 2018; Nelsen 2000). Thus, four Archimedean copulas, Clayton, Frank, Gumbel-Hougaard (GH), and Ali-Mikhail-Haq (AMH) copulas presented in Table1, have been applied to model the main and tributary sequences in the study. The parameter θ is performed by solving the relationship between θ and Kendall correlation coefficient τ.

Table 1 Four commonly used Archimedean Copula functions

For bivariate hydrological probability analysis, the OR and AND RPs are defined as follow,

$${T}_{O}=\frac{1}{P\left(X>x\bigcup Y>y\right)}=\frac{1}{1-C\left(u,v\right)}$$
(2)
$${T}_{A}=\frac{1}{P\left(X>x\bigcap Y>y\right)}=\frac{1}{1-u-v+C\left(u,v\right)}$$
(3)

According to the OR and AND RPs, there would be numerous flood combinations with a given T; this indicates no one-to-one correspondence between design T and design flood combination. The risks associated with varying combinations of the flood are extremely diverse, and the design water levels carried out by different flood combinations are various, which will result in inconsistent flood protection for water projects. An increasing body of research suggests that the threat caused by the flood is related to the characteristics of the flood itself and the features of the flood discharge boundary.

The coupling effect of floods and different flooding boundary conditions can produce a variety of flood control characteristic parameters, such as flood control levels of floodgates and dams, and flood control storage capacity, the difference between the water depth after the jump of the stilling pool and the water depth of the channel, and the flood surface of the track. In the paper, the average interval time when the flood control parameter f determined by different flood combinations exceeds its specific flood control parameter F is defined as the FCRP. The design water level z under the combined action of main and tributary floods is considered as the flood control parameter; thus, the dangerous events can be described as,

$${E}_{X,Y}^{F}=\left\{H\left(x,y\right)>Z\right\}$$
(4)

where x and y are main and tributary flood peak discharges, respectively, H is a method for flood regulation calculation, Z is a design water level under a specific return period, m.

The recurrence period corresponding to \({E}_{X,Y}^{F}\) is the FCRP, which is,

$${T}_{F}=\frac{1}{P\left\{H\left(x,y\right)>Z\right\}}=\frac{1}{1-{F}_{Z}\left(z\right)}$$
(5)

where FZ(z) is the cumulative distribution function of the design water level.

In multivariate hydrological analysis, the most likely combination is the most concerned, that is,

$$\left({u}_{m},{v}_{m}\right)=\mathrm{arg\,max}f\left(u,v\right)$$
(6)
$$f\left(u,v\right)=c\left(u,v\right)f\left(x\right)f\left(y\right)$$
(7)

where c(u,v) is the joint distribution probability density function, f(x) and f(y) are the marginal distribution probability density functions of main and tributary floods, respectively.

2.2 Method for Flood Regulation Calculation

Figure 1 shows the relationship between the runoff discharges of Guigang(GG) and Dahuangjiangkou(DHJK) stations corresponding to the bottom water level of GPSH. The design water level of GPSH could be performed by flood regulation calculation as follow,

$$Q=\sigma \varepsilon MB\sqrt{2g}{{H}_{0}}^\frac{3}{2}$$
(8)

where σ is submerged coefficient, ε is lateral shrinkage coefficient, \(M=\sqrt{2g}m\), in which m is discharge coefficient, B is the net width of a single gate, m, H0 is the total hydraulic head of the wire crest, m.

Fig. 1
figure 1

Runoff from GG and DHJK stations corresponding to the natural water level at the bottom of the GPSH

2.3 Uncertainty Measurement

The horizontal average offset (DX, m3/s), vertical average offset (DY, m3/s), area of confidence interval (S, m3/s·m3/s), and average Euclidean distance (d, m3/s) (Pham-Gia and Huang 2001; Liu et al. 2010; Yin et al. 2018) are applied to investigate the sampling uncertainty quantitatively. In this paper, DX and DY are the estimated deviations between main and tributary flood discharges and design values derived from the measured sample sequences, respectively. S and d are used to measure the spatial distance between the design value point data and the measured sample series design value. ContourSizes Functions of the R program perform the confidence interval area S, and the other three metrics are expressed as follow,

$$D\left(X\right)=\frac{1}{N}\sum_{i=1}^{N}\left|{x}_{Ti}-{\hat{x}}_{T}\right|, D\left(Y\right)=\frac{1}{N}\sum_{i=1}^{N}\left|{y}_{Ti}-{\hat{y}}_{T}\right|$$
(9)
$$d=\frac{1}{N}\sqrt{\sum_{i=1}^{N}{\left({x}_{Ti}-{\hat{x}}_{T}\right)}^{2}+{\left({y}_{Ti}-{\hat{y}}_{T}\right)}^{2}}$$
(10)

where N is the number of repeated samplings, which is taken as 1000 in the paper, \(\left({\hat{x}}_{T},{\hat{y}}_{T}\right)\) is the most likely design combination of the measured sequence. The smaller the index value, the smaller the uncertainty.

2.4 Steps for FCDL Calculation and Sampling Uncertainty Analysis

Combined with the Monte Carlo random simulation approach, the specific steps to calculate the design water level are,

  1. 1.

    Determine the optimal marginal distribution F(x), F(y) of the measured flood sequences, and the optimal joint distribution model C(u,v).

  2. 2.

    Generate two random numbers n1 and n2 at (0,1), letting n1 = u, according to \({n}_{2}=C\left(v|u\right)=\frac{\partial }{\partial v}C\left(u,v\right)\) obtain a set of related probabilities u and v.

  3. 3.

    Convert u and v into flood combinations x and y, a water level Z is obtained after flood regulation calculation.

  4. 4.

    Repeat steps 1) ~ 3) N times to get N water levels Z, then sort the water levels Z, calculate the design water level follow P(Z > z) = 1/T with a given T.

The specific steps to investigate the sampling uncertainty under the combined action of main and tributary floods are as follows:

  1. (a)

    The same as step 1) – 2) aforementioned to obtain flood sequences X and Y with a sample size n.

  2. (b)

    Repeat step a for N times to get N flood combinations X and Y with sample size n, and N Copula function parameter θ.

  3. (c)

    Calculate N most likely combinations (xm, ym) according to the Eqs. (6) and (7) with a given T.

  4. (d)

    Obtain the (1-α) confidence interval area for the N (xm, ym) with a given certain significance level α by applying the Kernel density estimation.

  5. (e)

    Obtain N design water levels zOR performed by flood regulation calculations on the N flood combinations (xm, ym).

3 Case Study

The GPSH project is located at the Yujiang River section at the Yujiang and Qianjiang rivers intersection in Guiping City, Guangxi, China (Fig. 2). The engineering is mainly composed of shipping lock, overflow dam, and sluice, in which the length of the overflow dam is 296 m, the width of the total overflow surface is 238 m with 17 holes set to discharge. Thus the net width of each hole is 14 m, and the height of the weir crest is 21 m. The design recurrence period of the sluice of GPSH is once every 100 years with a design flood level is 43.48 m, obtained after flood regulation calculation by performing the combinations of a 100-year flood peak at DHJK station and 0.2 times the flood peak value.

Fig. 2
figure 2

Overview of the study area and location map of GPSH

Fifty-eight years (1953–2010) measured flood peak discharge data of the GG and DHJK hydrographic stations were used in the paper. The data collected from the GG station is defined as the Y series. The mainstream flood sequence X is conducted by subtracting the measured sample data of the GG station from the DHJK station.

4 Result and Discussion

4.1 Selection of Marginal Distribution and Copula Distribution

PE3, Generalized Extreme Value (GEV), and Weibull are adopted in the paper to model the marginal distributions. Parameters of marginal distributions are conducted by Moment estimation; the K-S Test is applied to examine the fitting degree of the sample theoretical and empirical distributions. The critical value of the K-S test in the paper is 0.179, all the marginal distributions shown in Table 2 pass the K-S test. Therefore, according to the AIC and RMSE criteria, both main and tributary flood samples are fitted by PE3 distribution.

Table 2 selection of marginal distribution

As shown in Table 3, the joint distribution of main and tributary floods is fitted by Clayton Copula for its minimal RMSE and AIC values of 0.05 and –346.258, respectively.

Table 3 Copula function fitness evaluation result

Thus, the joint Copula model for main and tributary floods can be described as follow,

$${C}_{Clayton}\left(u,v\right)={\left({u}^{-0.377}+{v}^{-0.377}-1\right)}^{-1/0.377}$$
(11)

According to the analysis above, the joint distribution probability of main and tributary floods can be plotted as Fig. 3,

Fig. 3
figure 3

Joint distribution probability of main and tributary floods

4.2 Analysis of Joint Characteristics of Main and Tributary Floods

The contours of OR and AND RPs with design T = 100, 50, 20, 10, 5, and 2 are plotted in Fig. 4. It can be noted that the RP contours are symmetrically distributed with the 45° line; this may indicate that the main and tributary floods have the same effect on the design level calculation of the sluice. However, in actual projects, the impact of main and tributary floods on the FCDL determination of hydraulic engineering is usually diverted with the river characteristics. Therefore, though Fig. 4 shows the correlation between the main and tributary floods to a certain extent, it fails to deliver the various impact of the main and tributary floods on calculating the design water level of the project.

Fig. 4
figure 4

T-level curves of AND and OR recurrence period

Figure 5a shows the contours carried out by FCRP; as the design T increases, the shapes tend to be more perpendicular to the x-axis; this indicates that the main flood has a much greater impact on the sluice’s FCDL calculation than the tributary flood. And the effect of the mainstream increases with increasing design T. The results show that FCRP can better characterize different floods' impacts in design level calculations than OR and AND, which consider the same effects of main tributary floods.

Fig. 5
figure 5

a. T-level curves of FCRP; b. Comparison of three different recurrence periods

For the FCRP, flood combinations of the curve correspond to the same water level Z, while the OR and AND recurrence periods are conducted to the same T. Figure 5b describes the distribution of the main and tributary flood combinations contour conducted by OR, AND, and FCRP three different standards with the design T = 100. It is worthy to note that the FCRP contour is between OR and AND ones; this indicates that the design water level performed by the FCRP can avoid the over-or-under designed water levels by OR or AND RPs, respectively. Therefore, the FCRP proposed in the study provides an appropriate approach to conducting multiple floods' design water levels.

4.3 Uncertainty Analysis of Main and Tributary Flood Combinations

According to Serinaldif (2013), the OR RP is recommended to investigate the multivariate hydrological uncertainty analysis. Thus, OR RPs with design T of 20 and 50 years are chosen to evaluate the sampling uncertainty in this paper. The measured sample length of main and tributary flood combinations is 58; compared with the measured samples and simplicity, the simulated sample lengths were 58, 100, and 200, respectively.

The parameter uncertainty estimation is an essential part of the sampling uncertainty assessment. Uncertainty analysis of parameters is a statistically based mathematical analysis that requires random sampling for a large number of parameters. Table 4 shows the interval distribution of the joint distribution parameters based on Monte Carlo simulation with different sample sizes at 95% confidence conditions. It can be seen that the amplitude of the parameters decreases as the sample size increases.

Table 4 95% confidence interval for joint distribution parameters of different sample sizes

As shown in Fig. 6, for the same T, the binary confidence interval of the joint design value decreases with the increasing sample size. In contrast, the binary confidence interval increases as the T increases with the same sample size n. Moreover, it can also be seen that at T = 50 and n = 58, the most likely combinations scatter from the contour of T = 10 to 100. In contrast, at T = 20 and n = 200, the most likely varieties are mainly concentrated between the T = 10 and the T = 50 curves. These findings indicate that when the sample size n is smaller and the design recurrence period is larger, the uncertainty of the design value is also greater.

Fig. 6
figure 6

Binary confidence interval graph of joint design values

The four uncertainty evaluation indexes were performed to characterize the sampling uncertainty under multiple floods, and the results are listed in Table 5. It can be concluded that DX, DY, d, and the area of 95% confidence interval are reduced by 42.5%, 42.9%, 44.7, and 73.9% respectively at T = 20 with increasing sample size n, and at T = 50 it is reduced by 39.4%, 46.3%, 41.1%, and 68.3% respectively.

Table 5 Calculation Results of Uncertainty Index of Main and Tributary Flood Combination

4.4 Uncertainty Analysis of Design Level

For the safety of hydraulic engineering, the most important control factor is the design water level rather than the flood discharge; thus, the uncertainty of the flood combinations due to sampling can be considered to uncertainty analysis of the design water level. Here, the design level uncertainty conducted by the OR RP and FCRP was further investigated. Following the flood regulation calculation of the GPSH, the N design levels corresponding to the most likely combinations of OR RP and the flood protection levels under the FCRP criteria can be obtained separately for each set of parameters. Table 6 shows the design level estimation results of the GPSH under different criteria of recurrence period. It is noted that the 95% confidence interval and standard deviation of the design value calculated by the FCRP are less than the OR RP with the same sample size. Take the sample size n = 58 and T = 20 as an example; the interval length and standard deviation of the FCRP are 14.1% and 14.4%, respectively, smaller than the OR return period; this indicates the design water levels conducted by the FCRP are more reliable. Therefore, the FCRP can effectively avoid the problem of uncertainty in the design of flood protection under the action of multiple floods due to the sample length.

Table 6 Estimation of flood control level of sluice under different criteria of recurrence period

Furthermore, when n is less than 100, the interval length of both design standards exceeds 60 mm. As a result, the standard deviations are greater than 0.5; a short size of the flood sample series creates a large uncertainty in the FCDL determination of hydraulic engineering. Therefore, if necessary, the flood sample lengths need to be extended to enhance the reliability of the results.

The box diagrams of the design water level for the FCRP and OR RP are further described in Fig. 7. It can be seen that the length of the design level box becomes shorter as the sample size n increases when T is the same, while the length of the design level box becomes longer as T increases when n is the same. Those findings indicate that the larger the sampling n, or the smaller the design T, the more stable the calculation results, consistent with the analysis of the conclusion in Table 6. Furthermore, the median levels of the box diagram for the FCRP are smaller than that of the OR RP under the same condition, suggesting the fact that design water levels conducted by the FCRP are smaller than those for the OR RP, which is consistent with the analysis in Fig. 5b.

Fig. 7
figure 7

Comparison of design levels box diagrams for FCRP and OR RP

5 Conclusion

Analysis of FCDL determination under the combined action of main and tributary floods is extremely important for the safe operation of hydraulic engineering. Still, it has attracted little attention in the literature. The paper proposes the FCRP based on the Copula to investigate the design level driven by multiple floods with a case study of GPSH. Four symmetric Archimedean with three marginal distributions were applied to model the main and tributary floods. The sampling uncertainty to the flood combinations and design water level calculation under multiple floods is further analyzed. Some main conclusions can be drawn from the study,

  1. 1.

    The Clayton Copula with PE3 and PE3 marginal distributions is the best-fit joint distribution for main and tributary floods.

  2. 2.

    The proposed FCRP can identify the different roles of main and tributary floods on design level calculation compared with the ECM and conventional AND and OR RPs. The design level conducted by the FCRP can avoid the situation over-or-under performed by the OR or AND RP.

  3. 3.

    The uncertainty of the joint design combinations under the effect of multiple floods decreases with the increase of sample size n but increases with the rise of the design T.

  4. 4.

    The 95% confidence interval and standard deviation of the FCDL calculated by FCRP are smaller than that of OR RP, which means the FCRP can reduce the uncertainty of design level calculation under the condition of multiple floods.

This paper provides innovative ideas for the design of flood protection under the combined effect of multiple floods, which can be used as a reference for practical projects.