1 Introduction

In most hydrological studies several random variables are generally considered independent. For instance, different combinations of rainfall intensity and duration (5), flood peak and volume (Di Michele et al. 2005), drought duration, intensity, magnitude, and so on. Thus, it is often important to link the marginal probability distributions of different variables to describe the main aspect of hydrological events. Conventionally, multivariate probability distributions are used in hydrology (Singh 1987). The most used joint cumulative distribution function (CDF) was the Gaussian, but it has the obvious limitations that the marginal distributions should be normal. The Box-Cox’s formula helped to reach condition by data transformation (Box and Cox 1964), however, do not always ensure that the recorded data follow a Gaussian PDF. Thereafter, further bivariate PDFs with non-normal margin distribution, such as the Gumbel bivariate exponential model (Gumbel 1960), with exponential margins, and bivariate Gamma distribution function used in hydrological studies (Grimaldi and Serinaldi 2006). Yue (2001) suggests bivariate Gamma distribution in pairs peak-volume and volume-duration flood frequency analysis. Knowing that extreme hydrological events can be represented by one of the three univariate extreme value distributions (Sklar 1959), bivariate Logistic and mixed models with Gumbel margins proposed (Gumbel and Mustafi 1967).

However, all mentioned models show some limits such as a similar family of marginal distributions, complicated mathematical formulation of an increased number of variables, and distinguish of marginal and joint behavior of variables. The copula PDF functions could overthrow these restraints. Many studies present some recent advances of copula using in hydrological modelings such as calculation of conditional probabilities, level curves of joint distributions, and return periods of multivariate events. Palynchuk and Guo (2011), applied copulas to joint distributions of rainstorm variables (depth, duration, and peak intensity). Serinaldi and Grimaldi (2007) used Fully Nested 3-copula to model the wave height, peak period, and maximum height as three characteristics of sea wave behavior. Renard and Lang (2007) used the Gaussian copula for multivariate extreme value analysis to prepare discharge-duration-frequency (QDF) models. Wang et al. (2009) used copula for flood frequency analysis of the confluence of river systems. Ghizzoni et al. (2010) used copula functions to provide an estimate of the joint probability of flood events in a multisite analysis. Grimaldi and Serinaldi (2006), Dupuis (2007), Genest and Favre (2007), Chen et al. (2010), Reddy and Ganguli (2012), Ganguli and Reddy (2013) used copula in multivariate flood frequency analysis (flood volume, peak, and duration). Razmkhah et al. (2016a, b) used copulas to analyze the spatial dependence of rainfall events and its effect on Rainfall-Runoff (RR) uncertainty modeling. Razmkhah et al. (2017) evaluated the effect of parameter correlation on the RR uncertainty, using 2-copula. Lorenz et al. (2018) used copula for downscale daily precipitation considering spatial correlation. Ayantobo et al. (2019) used a four-variate Archimedean copula in drought frequency analysis. Zhong et al. (2021) analyzed flash flood risk under the compound effect of soil moisture and rainfall using a copula function. Zening et al. (2020) analyzed reservoir inflow for four reservoirs on mainstream and its tributaries using a multivariate copula.

This paper aims to model the bivariate joint distribution of flood peak, volume, base flow duration, and time to peak using a wide range of copula families in the Karoon III basin. The innovation of this work is to analyze flood characteristics such as time to peak (tp), volume (V), peak discharge (Qp), and hydrograph base time (tb), using two variable copula and considering the dependence of the parameters. The accuracy of predicted flood properties such as volume and peak discharge and time to peak is important for flood warning purposes, maintenance, and the operating rules of the dam.

2 Methodology

2.1 Multivariate Flood Frequency Analysis

A probability distribution function (PDF) is a function used to define a particular probability distribution. It is the mathematical function that gives the probabilities of occurrence of different possible outcomes (Everitt 2006; Ash 2008, in Wikipedia) of a random phenomenon in terms of its sample space and the probabilities of events (subsets of the sample space) (Evans and Rosenthal 2010, in Wikipedia).

Flood frequency analysis (FFA) is the means by which flood discharge magnitude (Q) is related to the probability of its being equaled or exceeded in any year or to its frequency of recurrence or return period (T). The return period is used to indicate the average interval between floods of a given magnitude (Archer 1998). Most of the studies, use peak flood frequency analysis to find the occurrence probability of a flood event.

But to design most hydraulic structures, it is more accurate to have information not only about peak discharge (QP, m3/s), but also the duration of the event (tb, days), in terms of the time interval that discharge exceeds the fixed threshold, volume (V, m3) (Yue et al. 2002), and their joint probabilistic behavior together with flood peak because these flood variables are correlated (Singh and Singh 1991). Hence, many researchers have performed multivariate FFA. For example, Yue (2001) used a bivariate gamma PDF to describe the joint probability distributions, conditional distributions, and joint return periods of two correlated variables. A brief review of univariate and multivariate approaches was explained in Vittal et al. (2015).

Multivariate frequency analysis makes it possible to jointly evaluate Qp, tb, and V. Knowing the CDF of (P, V, D, …) and given design P, the pair (V, D) with a specific joint return period can be estimated (Grimaldi and Serinaldi 2006).

2.2 Derivation of Multivariate Distributions

The joint cumulative distribution function (H) of two random variables X and Y can be defined:

$$H\left(X, Y\right)=P\left(X\le x, Y\le y\right)=\textstyle{\int }_{-\infty }^{x}\textstyle{\int }_{-\infty }^{y}h\left(U, V\right)dUdV$$
(1)

where x and y are the values of X and Y, and P is the non-exceedance probability. Or the right-hand side represents the probability that the random variable x takes on a value less than or equal to x, and that y takes on a value less than or equal to y (Park 2018, in Wikipedia).

The joint return period of an event \(\left(X, Y\right)\), \({T}_{X,Y}\left(X, Y\right)\), can be expressed as:

$${T}_{X,Y}\left(X, Y\right)=\frac{1}{1-H\left(X, Y\right)}$$
(2)

2.3 Copula

Let observation \(\left({X}_{11},{X}_{21}, \dots \dots . {X}_{n1} \right), \dots \dots \dots \dots \dots \dots , \left({X}_{1n},{X}_{2n}, \dots \dots . {X}_{nN}\right)\) be drawn from a multivariate population of \(\left({X}_{1}, {X}_{2}, \dots .. {X}_{n}\right)\), where N is the number of observations and n is the number of variables. Let\({F}_{{X}_{i}}\left({X}_{i}\right), \mathrm{i}=1, 2, \dots \mathrm{n}\), be the marginal CDFs of \({X}_{i}, \mathrm{i}=1, 2, \dots \mathrm{n}.\)

The objective is to determine the multivariate distribution denoted as \({H}_{{X}_{1},{X}_{2, \dots \dots ..{X}_{n}} }\left({X}_{1}, {X}_{2}, \dots .. {X}_{n}\right)\) or simply H. Copulas are functions that join one-dimensional marginal PDFs to a multivariate PDF (Nelsen 1999). Therefore the multivariate PDF (H) expressed in terms of its marginal distributions and the associated dependence function, C, as:

$$C\left({F}_{{X}_{1}}\left({X}_{1}\right), \dots \dots ..{F}_{{X}_{n}}\left({X}_{n}\right)\right)={H}_{{X}_{1},{X}_{2, \dots \dots ..{X}_{n}} }\left({X}_{1}, {X}_{2}, \dots .. {X}_{n}\right)$$
(3)

where C, named copula, takes the quality of random variable dependence. The copula functions were developed by Sklar (1959), and different copula families were proposed by Nelsen, (1999). Some copula functions like Archimedean 2-copula e.g. Frank copula (Joe 1997) can model all ranges of variables dependence such as positive, null (independence), or negative (Grimaldi and Serinaldi 2006). Normal and t copulas are a subdivision of the Elliptical, which means that a copula is constructed based on the normal and t PDFs. Both normal and t are symmetrical, and the normal copula is a limiting case of the t when m becomes infinity. The advantage of the t copula is that it can capture lower and upper tail dependence of data (Goda 2010). Another widely used family of 2-copula is the Archimedean. A detailed list of the Archimedean copulas is presented in Nelsen (2006). Popular Archimedean includes the Gumbel, Frank, and Clayton copulas. A difference between them is the tail dependence: the Gumbel and Clayton capture upper and lower tail dependence, respectively, but Frank does not. One disadvantage of the mentioned 2-copulas is the symmetrical feature concerning diagonal lines of a unit square. To deal with, a class of asymmetrical Archimedean, which are a transformed shape of mentioned family, can be used. Goda (2010) compared simulated samples of the normal and t copula, Gumbel, Frank, and Clayton copulas for different correlation coefficients and parameters and tail dependence characteristics.

The dependence of the variables is usually measured via the canonical Pearson's coefficient of a linear correlation when this parameter only models a linear dependence, and in some cases, it may not exist (De Michele et al. 2005). In copula modeling, the dependence of random data is measured by Kendall’s and Spearman’s coefficients. The Kendall’s stated as the difference between the probability of concordance and discordance, while the Spearman’s expressed as the linear correlation coefficient of the probability-transformed variables. The measures are rank-dependent and invariant under strictly monotonic transformations and can be expressed in terms of a copula function (Goda 2010). A copula function can be expressed as:

$$\mathrm{C}\left({\mathrm{u}}_{1}, \dots ,{\mathrm{u}}_{\mathrm{d}}\right)=\mathrm{Pr}\left({\mathrm{U}}_{1}\le {\mathrm{u}}_{1},\dots ,{\mathrm{U}}_{\mathrm{d}}\le {\mathrm{u}}_{\mathrm{d}}\right)$$
(4)

where \({\mathrm{U}}_{\mathrm{i}}\sim \mathrm{U}\left(\mathrm{O},1\right)\) for \(\mathrm{i}=1,\dots ,\mathrm{d}\). Copulas allow characterizing the dependence structure of random variables from the marginal PDFs. For any multivariate (joint) distribution F of \({\mathrm{X}}_{1},\dots \dots .{\mathrm{X}}_{\mathrm{d}}\) as random variables, there exists a unique copula C as:

$$\mathrm{C}\left({\mathrm{u}}_{1},\dots \dots ,{\mathrm{u}}_{\mathrm{d}}\right)=\mathrm{F}\left[{\mathrm{F}}_{1}^{-1}\left({\mathrm{u}}_{1},\dots \dots ,{\mathrm{F}}_{\mathrm{d}}^{-1}\left({\mathrm{u}}_{\mathrm{d}}\right)\right)\right]$$
(5)

where \({\mathrm{F}}_{1}^{-1}\) as the quantile function defined by \({\mathrm{F}}_{1}^{-1}\left(\mathrm{u}\right)=\mathrm{inf}\left[\mathrm{x}: {\mathrm{F}}_{\mathrm{i}}\left(\mathrm{x}\right)\ge \mathrm{u}\right]\) (Sklar 1959).

Dupuis (2007) defined F as a multivariate PDF with marginal PDFs \({F}_{1},\dots .,{F}_{d}\) as:

$$\mathrm{F}\left({\mathrm{x}}_{1},\dots ,{\mathrm{x}}_{\mathrm{d}}\right)=C\left[{F}_{1}\left({x}_{1}\right),\dots ,{F}_{d}\left({x}_{d}\right)\right]$$
(6)

where C is copula function. Some of the bivariate copula families were presented in Razmkhah et al. (2016b). In this study, several bivariate copula families like Elliptical (Gaussian and Student t), Archimedean (Clayton, Gumble, Frank, Joe), BB1 (Clayton-Gumbel), BB6 (Joe-Gumbel), BB7 (Joe-Clayton), and BB8 (Frank-Joe)) have been fitted to data to select the most suitable one. For more details about copula families, readers could refer to Genest and MacKay (1986), Joe (1997), Favre et al. (2004), Nelsen (2006), and references therein. For the Archimedean families rotated versions cover negative dependence. The parameters of the 2-copula are estimated using maximum likelihood estimation.

2.4 Copula Model Selection

Copulas are selected according to the Akaike and Bayesian information criteria (AIC and BIC, respectively). First, all available copulas are fitted, then the criteria are computed and the copula with the minimum value chosen.

  • AIC

    The AIC is a criterion to estimator the relative quality of statistical models (McElreath 2016; Taddy 2019, in Wikipedia). For a statistical model with k as the number of estimated parameters, and \(\widehat{L}\) as the maximum value of the likelihood function of the model, the AIC value is calculated as (Burnham and Anderson 2002; Akaike 1974, in Wikipedia):

    $$\mathrm{AIC}=2\mathrm{K}-2\mathrm{Ln}(\widehat{\mathrm{L}})$$
    (7)

    Given a set of candidate models, the preferred model is the one with the minimum AIC value.

  • BIC

    The BIC or Schwarz information criterion is another criterion for model selection; the model with the lowest BIC is preferred. It is based, in part, on the likelihood function. It was developed by Schwarz (1978), considering a Bayesian argument.

    The BIC is defined as (Wit et al. 2012):

    $$\mathrm{BIC}=\mathrm{KLn}(\mathrm{n})-2\mathrm{Ln}(\widehat{\mathrm{L}})$$
    (8)

    where

    • \((\widehat{L})\) = the maximized value of the likelihood function of the model M, i.e. \(\left(\widehat{L}\right)=\mathrm{p}\left(x/(\widehat{\theta },\mathrm{M})\right)\), where \(\widehat{\theta }\) are the parameter values that maximize the likelihood function;

    • x = the observed data;

    • n = the number of data points in x, the number of observations, or the sample size;

    • k = the number of parameters estimated by the model

2.5 Conditional Probabilities

The conditional probabilities can be calculated, differentiating the corresponding copula (C) of random variables U and V (Nelsen 1999):

$$p\left\{V\le v|U=u\right\}=\frac{\partial C\left(U,V\right)}{\partial U}$$
(9)
$$p\left\{U\le u|V=v\right\}=\frac{\partial C\left(U,V\right)}{\partial V}$$
(10)

2.6 Copulas and Return Periods

Return period, as the average time between two successive realizations of the event, is a criterion for sizing the hydraulic structures and risk analysis (Salvadori and De Michele 2007). The univariate frequency analysis may lead to an under/overestimation of the risk because many hydrological events are characterized by the joint behavior of several random variables. Using copula can cover the mentioned disadvantage.

2.7 Bivariate Return Periods

Bivariate return period is discussed for x and y variables with increasing marginals Fx and Fy and C copula. Having U and V as Eq. (5) we have:

$$\left\{\begin{array}{c}U={F}_{x}\left(x\right)\\ V={F}_{y}\left(y\right)\end{array}\right. <=> \left\{\begin{array}{c}x={F}_{x}^{-1 }\left(U\right)\\ U={F}_{y}^{-1 }\left(V\right)\end{array}\right.$$
(11)

In practice, an event could be dangerous if either U or V exceed given thresholds, Eq. (12), or both U and V are longer than a prescribed value, Eq. (13):

$${E}_{U,V}^{\cup }=\left\{U>u\right\} \cup \left\{V>v\right\} \left( or\;case\right)$$
(12)
$${E}_{U,V}^{\cap }=\left\{U>u\right\} \cap \left\{V>v\right\} \left( and\;case\right)$$
(13)

where \(\cup\) and \(\cap\) are or and operators and \({E}_{U,V}^{\cup }\) and \({E}_{U,V}^{\cap }\) are the events with the greatest interest. Let the \({P}_{U,V}^{\cup }\) and \({P}_{U,V}^{\cap }\) be the probability of the events \({E}_{U,V}^{\cup }\) and \({E}_{U,V}^{\cap }\) respectively. Using copula we have (Nelsen 1999):

$${P}_{U,V}^{\cup }=P \left\{U>u \cup V>v\right\}=1-C\left(U, V\right)$$
(14)
$${P}_{U,V}^{\cap }=P \left\{U>u \cap V>v\right\}=\bar{C }\left(1-U, 1-V\right)$$
(15)

where

$$\bar{C }\left(U, V\right)=U+V-1+C\left(1-U, 1-V\right)$$
(16)

Then the return periods \({T}_{U,V}^{\cup }\) and \({T}_{U,V}^{\cap }\) of the events \({E}_{U,V}^{\cup }\) and \({E}_{U,V}^{\cap }\) can be defined as:

$${T}_{U,V}^{\cup }=\frac{\mu }{{P}_{U,V}^{\cup }}$$
(17)
$${T}_{U,V}^{\cap }=\frac{\mu }{{P}_{U,V}^{\cap }}$$
(18)

where \(\mu\) = average interval time between two successive events \({E}_{i}\) …. \({T}_{U,V}^{\cup }\) and \({T}_{U,V}^{\cap }\) are decreasing functions so we have (Salvadori and De Michele 2007):

$${P}_{U,V}^{\cup }\ge {P}_{U,V}^{\cap },$$
(19)
$${T}_{U,V}^{\cup }\le {T}_{U,V}^{\cap }.$$
(20)

Using the above equations, we can calculate the return periods of the events \({E}_{U,V}^{\cup }\) and \({E}_{U,V}^{\cap }\), in terms of the pair (x, y).

2.8 Univariate Versus Multivariate Analysis

In a multivariate hydrological event, univariate frequency analysis cannot evaluate the occurrence of an event completely. Salvadori and De Michele (2007) showed the desire return period \(\left(\pi \right)\) followed this equation:

$${T}_{U,V}^{\cup }\le {\pi \le T}_{U,V}^{\cap }$$
(21)

So if the desire return period \(\left(\pi \right)\) was not determined using joint PDFs design could be under dimensioned leading to risk increase, or overestimated leading to waste of money. De Michele et al. (2005) using the value of real design showed the return period of the event \({E}_{U,V}^{\cup }\) was about 20% smaller than \(\pi\), while the return period of \({E}_{U,V}^{\cap }\) was about 30% larger.

Another problem is the identification of the events having an assigned return period (T). In general, the solution is not unique, because different choices of U and V may yield the same T. Thus iso-lines of \({T}_{U,V}^{\cup }\) and \({T}_{U,V}^{\cap }\) should be identified (Salvadori and De Michele 2007).

2.9 Level Curves

The level curves of a joint distribution can provide practical information in the design process. Given \(0\le t\le 1\), the curves defined by the Eq. (22) are called the level curves of C:

$${L}_{t}=\left\{C\left(U,V\right)=t\right\}$$
(22)

The use of copulas may yield analytical solutions to the problem of calculating the iso-lines (Salvadori and De Michele 2007).

2.9.1 Isolines of \({T}_{U,V}^{\cup }\)

In the equation \({T}_{U,V}^{\cup }=\frac{\mu }{{P}_{U,V}^{\cup }}=\frac{\mu }{{1-C}_{U,V}}\), \({P}_{U,V}^{\cup }\) reduces by increasing C, while the \({T}_{U,V}^{\cup }\) increases. Increasing C is equivalent to choosing more extreme events with a longer return period. Salvadori and De Michele (2007) showed that iso-line plots would be the same for all of the distributions \({F}_{xy}\) generated by C.

2.9.2 Isolines of \({T}_{U,V}^{\cap }\)

We told that \({P}_{U,V}^{\cap }=\bar{C }\left(1-U, 1-V\right)=1-U-V+C\left(U, V\right)\) where \(0\le C\left(U, V\right)\le 1\). The authors referred to Salvadori and De Michele (2007) for preparing \({T}_{U,V}^{\cap }\) isolines.

2.10 Tail Dependence

The most common definition of tail dependence between two random variables X and Y is:

$${\lambda }_{U}=\underset{U\to 1}{\mathrm{lim}}P\left\{{G}_{X}\left(X\right)>U|{H}_{Y}\left(Y\right)>V\right\}$$
(23)
$${\lambda }_{L}=\underset{U\to {0}^{+}}{\mathrm{lim}}P\left\{{G}_{X}\left(X\right)\le U|{H}_{Y}\left(Y\right)\le V\right\}$$
(24)

where \({\lambda }_{U}\) and \({\lambda }_{L}\) are called the upper and lower tail dependence. \({G}_{X}\left(X\right)\) and \({H}_{Y}\left(Y\right)\) are the marginal distributions and U and V are threshold levels. The tail dependence corresponds to the probability that one margin to be large/small conditioned on the other margin being large/small. The tail dependence can also be defined by a copula function. The authors referred to Yue et al. (2014) for a detailed description.

2.11 Case Study

The Karoon III is a sub-basin of large Karoon, located in the southwest of Iran. The watershed boundaries limit to 49°, 30' to 52° E and 30° to 32°, 30' N. The area is approximately 24,200 (km2) with 30 reliable climatology and synoptic gauges. The elevation varies from 700 (m) at the outlet of the basin to 4500 (m) on the Kouhrang and Dena mountains. The digital elevation model (DEM) and major drainage network of Karoon III are shown in Fig. 1. About 50% of the watershed area has a higher elevation than 2500 (m). The watershed receives an average annual precipitation of 767 (mm) that about 55% of precipitation is as snowfall. The average daily discharge flow of the basin is about 384 (m3/s). The sub-basins physiographic characteristics were presented in Razmkhah et al. (2016ab) . Pole-Shaloo was the only existed streamflow gauge, at the outlet of the basin with recorded daily data used in this study.

Fig. 1
figure 1

DEM and major drainage network of Karoon III (Razmkhah 2018)

3 Results

Since Qp, tb, tp, and V are variables defined by the same physical phenomenon, they should be mutually correlated. Table 1 summarized the statistics of the Pearson's correlation coefficient (r), Kendall’s tau (\(\tau ),\) and Spearman’s rho (\({\rho }_{s})\) of tb, tp, Qp, and V random variables. A two-sided test with 5% and 1% of significance levels confirmed that variables are correlated. Pearson's correlation coefficient represents the linear dependence between two variables. Both \(\tau\) and \({\rho }_{s}\) are based on the ranks which represent the degree of association between two random variables (Yue et al. 2014). The copula (P, V) is more correlated than the others. Equation (25) shows the rank of correlation between flood characteristics:

Table 1 Dependence between variables as expressed by Pearson’s (r), Kendall’s tau (\(\tau ),\) and Spearman’s rho (\({\rho }_{s})\) correlation coefficients
$$\left(V, Qp\right)>\left({t}_{p}, {t}_{b}\right)>\left(V, {t}_{b}\right)>\left(V, {t}_{p}\right)>\left({t}_{b}{, Q}_{p}\right)>\left({t}_{p}{, Q}_{p}\right)$$
(25)

The order of correlation between variables could be explained by the physical governing low of rainfall-runoff processes. Extreme rainfalls resulted in a large amount of runoff volume and consequently peak discharge; long-based time floods have a longer time to peak; larger base time floods belong to the larger volume of floods and so on.

Table 2 shows copula families fitted to the flood characteristics, and related parameters, using R software (Brenchmann and Schepsmeier 2013). The P-value of the independent test showed that all of the variables are dependent. Quantities \(\left(V, Qp\right)\) and \(\left({t}_{p}, {t}_{b}\right)\), seem to follow Clayton’s family of copula while the others follow BB8 copula. The \(\left(V, Qp\right)\) and \(\left({t}_{p}, {t}_{b}\right)\) were the most correlated ones and tail dependence analysis showed that they are lower tail dependent, (Table 2). The BB8 copula family fitted to the other variables, did not show any lower or upper tail dependence.

Table 2 Copula families fitted to the flood characteristics

Figure 2a–f represent 3D plots of copula functions fitted to the flood characteristics. The variation of correlated parameters could be seen in the figures. It could be seen that \(\left(V, Qp\right)\) and \(\left({t}_{p}, {t}_{b}\right)\) are the most correlated variables with lower tail dependence (Fig. 2c, d).

Fig. 2
figure 2

3D plots of flood characteristics copula functions

Figure 3a–f show iso-lines (contour plots) of the fitted copula functions. As we told the identification of the events having an assigned return period (T) in the multivariate analysis does not have a unique solution, because different choices of U and V may yield the same T. Thus iso-lines of \(T\) prepared. The figures could be used to determine different \(\left({t}_{b}, {Q}_{p}\right)\) with a specified T or return period (figure a); \(\left({t}_{p}, {Q}_{p}\right)\) from figure b; \(\left({t}_{p}, {t}_{b}\right)\) from figure c; \(\left(V, {Q}_{p}\right)\) from figure d; \(\left(V, {t}_{b}\right)\) from figure e, and \(\left(V, {t}_{p}\right)\) from figure f. Then another flood characteristics could be extracted from the other graphs, and we have many possible variables with different probability to make a decision more precisely.

Fig. 3
figure 3

Contour plots (iso-lines) of copula functions

Figure 4a–f shows conditional probability iso-lines of the bi-variates copula functions of (V/U) for (U, V). The conditional probability iso-lines of (U/V) figure was not presented because of the large volume of the article. The figures could be used to determine a \({Q}_{p}\) for a specified \({t}_{b}\) and \({t}_{p}\) (figure a, b); \({t}_{p}\) for a specified \({t}_{b}\) (figure c); \({Q}_{p}\) for a specified V (figure d); V for a specified \({t}_{b}\), and \({t}_{p}\) for a specified V (figure e, f) and so on.

Fig. 4
figure 4

Conditional probability iso-lines of copula functions (V/U) for (U, V)

4 Conclusion

In this study, a multivariate flood frequency analysis was performed using 2-copula functions. By this multivariate tool, bivariate density and distribution of peak, volume, duration, and time to peak of flood events are determined without assuming the same type of marginal distributions. A brief analysis of observed data shows that peak, volume, duration, and time to peak variables are dependent in several manners. Properties of the copulas are described numerically and graphically. Advantages in using copulas are highlighted and confirmed in the case study.

The use of copulas yields analytical or graphical solutions to the problem of calculating the iso-lines of a joint distribution. Once the design return period has been decided, it is easy to calculate all the joint events such as \({E}_{U,V}^{\cup }\) and \({E}_{U,V}^{\cap }\) in terms of the pair \(\left(x, y\right).\)

In particular, we consider events, depending upon the joint behavior of two dependent random variables. This approach can be easily generalized to the multivariate case, using copula functions without traditional multivariate joint distributions. These results yield a copula-based procedure for the estimation of the probability of correlated random variables. The results help hydrologists for stochastic joint dependence of the variables and offer the possibilities of correctly sizing the structures.