Evaluation of additional physiographical variables characterising drainage network systems in regional frequency analysis, a Quebec watersheds case-study

Msilini, A.; Ouarda, T. B. M. J.; Masselot, P.

doi:10.1007/s00477-021-02109-7

Evaluation of additional physiographical variables characterising drainage network systems in regional frequency analysis, a Quebec watersheds case-study

Original Paper
Published: 12 October 2021

Volume 36, pages 331–351, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Evaluation of additional physiographical variables characterising drainage network systems in regional frequency analysis, a Quebec watersheds case-study

Download PDF

588 Accesses
4 Citations
Explore all metrics

Abstract

Regional Frequency Analysis (RFA) relies on a wide range of physiographical and meteorological variables to estimate hydrological quantiles at ungauged sites. However, additional catchment characteristics related to its drainage network are not yet fully understood and integrated in RFA procedures. The aim of the present paper is to propose the integration of several physiographical variables characterizing the drainage network systems in RFA, and to evaluate their added value in predicting quantiles at ungauged sites. The proposed extended dataset (EXTD) includes several variables characterising drainage network characteristics. To evaluate the new variables, a number of commonly used RFA approaches are applied to the extended data representing 151 stations in Quebec (Canada) and compared to a standard dataset (STA) that excludes the new variables. The considered RFA approaches include the combination of two neighborhood methods namely the canonical correlation analysis (CCA) and the region of influence (ROI) with two regional estimation (RE) models which are the log-linear regression model (LLRM) and the generalized additive model (GAM). The RE models are also applied without the hydrological neighborhood. Results show that regional models using the extended dataset lead to significantly better flood quantile predictions, especially for large basins. Indeed, the variable selection performed with EXTD consistently includes some of the new variables, in particular the drainage density, the stream length ratio, and the ruggedness number. Two other new variables are also identified and included in the DHR step: the circularity ratio and the texture ratio. This leads to better predictions with relative errors about 29% for EXTD, versus around 42% for STA in the case of the best combination of RFA approaches. Thus, the proposed new variables allow for a better representation of the physical dynamics within the watersheds.

Development of a kriging-based regional flood frequency analysis technique for South-East Australia

Article Open access 08 August 2022

Regional L-Moment-Based Flood Frequency Analysis in the Upper Vistula River Basin, Poland

Article 05 May 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Regional frequency analysis (RFA) procedures are commonly used in hydrology to estimate flood and low-flow quantiles at sites where little or no hydrological data is available. Generally, RFA includes two main steps: delineation of homogenous regions (DHR) and regional estimation (RE) (e.g. Chebana et al. 2014; Chebana and Ouarda 2007; Ouarda 2016). In this context, climatic, morphometric and physiographic characteristics of the watershed are widely used to describe geomorphic processes (e.g. Baumgardner 1987; Hadley and Schumm 1961; Marchi and Dalla Fontana 2005; Tramblay et al. 2010) in order to predict hydrological variables using RFA approaches (e.g. Dawson et al. 2006; Dodangeh et al. 2014; Goswami et al. 2007; Seidou et al. 2006; Tsakiris et al. 2011).

A number of physio-meteorological variables, such as basin area, basin slope, precipitation characteristics and land occupation are commonly used in the field of hydrology and more precisely in the RFA procedures. They are considered as the most relevant variables for these studies based on their high correlation with the hydrological variables (Chokmani and Ouarda 2004). In addition to the commonly considered variables (a more exhaustive list is in Table 1), drainage network characteristics (Jung et al. 2017) and tectonic setting (e.g. Ahmadi et al. 2006; Hamed et al. 2014) may have a strong impacts on hydrological dynamics, and are consequently related to flood quantiles. However, they are not yet well investigated and integrated in RFA studies. Indeed, the assessment of morphometric and physiographic variables requires the analysis of a number of stream characteristics (e.g. ordering of the streams, bifurcation ratio, texture ratio, stream length ratio, etc.). These variables characterize the basin shape as well as the drainage system, and can be useful to model the hydrological dynamics. Youssef et al. (2011) also indicated that the circularity ratio, number of orders and drainage density have a direct impact on the hydrological risk. Hence, the integration of these variables in the procedures for the regionalization of extreme hydrological events may contribute to the enhancement of RFA results. Variables related to drainage network systems are already used in several morphometric and hydrologic studies (e.g. Ameri et al. 2018; Biswas et al. 1999; Kaliraj et al. 2015; Pareta and Pareta 2011; Rai et al. 2017; Ratnam et al. 2005; Reddy et al. 2004; Sivasena Reddy and Janga Reddy 2013; Vijith and Satheesh 2006; Youssef et al. 2011) and they can eventually be useful in regionalization studies. These variables can be extracted based on classical approaches such as topographic maps and field examination or with advanced techniques using remote sensing and Digital Elevation Models (DEM). Remote sensing techniques coupled with the potential of GIS tools are increasingly popular. Indeed, they make it possible to calculate the various characteristics of the basin very quickly and more efficiently based on a DEM which is not possible in the past.

Table 1 Predictor variables used in a number of previous regionalization studies

Full size table

During the last decades, the focus in RFA has been mainly on the development of new delineation and estimation methods (e.g. Durocher et al. 2015; Ouali et al. 2016; Wazneh et al. 2016). Meanwhile, the list of physiographical and meteorological variables used as predictors has seen little evolution. In the present study, a number of commonly used RFA approaches are applied to test and evaluate the potential improvements that may result from the adoption of new physiographic variables.

The objective of this work is to propose the use of new physiographical variables related to the basin shape and drainage network and argue about their usefulness. To evaluate their added value for quantile prediction in RFA, they are computed and used for a set of 151 basins in Quebec (Canada). More specifically, the objective is to use both the standard and extended databases to predict quantiles associated to several return periods, and compare their prediction performances. In this work, standard RFA methods are considered for quantile prediction, namely Canonical correlation analysis (CCA) (Ouarda et al. 2000) and the region of influence (ROI) (Burn 1990) for DHR, including a case with no DHR, as well as the log-linear regression model (LLRM) and the generalized additive model (GAM) (Hastie and Tibshirani 1987) for RE.

The present paper is structured as follows: Sect. 2 offers a review of the new physiographic and morphometric variables proposed in this work by detailing their characteristics. Section 3 briefly presents the theoretical background of the CCA and the ROI approaches for the delineation of neighborhoods and the LLRM and the GAM for the regional estimation. The adopted methodology and the developed regional models are detailed in Sect. 4. Section 5 describes the study area and the used datasets. The results are presented and discussed in Sect. 6, and the conclusions of the work are summarized in the last section.

2 Variables characterizing drainage networks

Drainage network characteristics and evolution depend closely on the prevailing climatic, physiographic, and topographic conditions of the basin (Jung et al. 2015). These conditions determine the drainage network configuration which, in turn, can affect the hydrological response of the watershed (Howard 1990), and consequently hydrological quantile estimation. The new physiographical variables considered in this work are presented herein. Table 2 summarizes the definitions and standard mathematical equations used to determine these variables.

Table 2 Morphometric variables definitions

Full size table

2.1 Stream order (U)

The stream order of a basin is the highest stream order within the basin, where an order one is a stream starting at the source. A number of stream ordering systems are available in the hydrological literature. The simplest and most used one is the Strahler system originally introduced by Horton (1945) and then modified by Strahler (1952). This method is based on a hierarchical ranking of streams. When two first order streams join, an order two is formed and so on. Several researchers have directly correlated the stream order with stream flow (e.g. Blyth and Rodda 1973; Stall and Fok 1967). Blyth and Rodda (1973) also observed that during dry periods, first-order streams present less than 20% of the total length of the drainage network. At the maximum development of the drainage network, the total length of first-order streams constitutes over 50% of the total basin stream length. Thus, stream order frequency, especially the frequency of the first-order streams, may be well correlated with the hydrological response of the watershed.

2.2 Texture ratio (RT)

The texture ratio (RT) allows characterizing the basin drainage texture and is one of the most important factors in the drainage morphometric analysis due to its high relationship with the underlying lithology, the infiltration ability and the topographic characteristics of the terrain (Schumm 1956). High RT levels indicate the presence of soft rocks with high sensitivity to erosion (Ameri et al. 2018), and consequently a high and speedy surface runoff.

2.3 Circularity ratio (RC)

The circularity ratio (RC) is defined as the ratio between the areas of a catchment to the area of the circle having the same perimeter of the catchment. It is an important variable that helps characterize the basin shape. It is affected by the length and frequency of streams, geological structures, land use and cover, and the slope of the catchment (Dar et al. 2014; Vijith and Satheesh 2006). RC values range between 0 and 1. Basins with RC values close to 1 are characterized by circular form and a low concentration time and then a high peak flow. Low RC values are associated with strongly elongated basins and with lower runoff.

2.4 Stream length ratio (RL)

The stream length ratio (RL) was defined by Horton (1945) as the ratio between the mean length of the streams of a given order and the next lower order. It is based on Horton's law (1945) of stream length that indicates the existence of a direct geometric relationship between the mean length of the streams of a given order and the next lower order. The RL between successive stream orders changes under the effect of the topographic and slope variability, and has a significant relationship with surface runoff and the erosional stage of the watershed (Sreedevi et al. 2005).

2.5 Mean bifurcation ratio (MRB) and weighted mean bifurcation ratio (WMRB)

The bifurcation ratio (RB) is defined as the ratio between the stream’s number of a given order and those of the next-higher order in a drainage network. It permits the characterization of the impacts of the geological structures on the drainage network. Strahler (1957) indicated that the RB shows a slight range of variation for different regions except where the impact of the geological control is important. Chow (1964), Strahler (1964) and Verstappen (1983) indicated that, in general, the geological structures have a negligible impact on drainage networks, if the mean bifurcation ratio (MRB) of the watershed is comprised between 3 and 5. A higher value of this variable indicates a sort of geological control (Agarwal 1998). This variable can also characterize the watershed’s shape. A high RB value is, generally, associated with an elongated basin, while a low RB value is likely to be associated with a circular basin (Gajbhiye 2015; Taofik et al. 2017). Strahler (1953) proposed a more representative bifurcation number measure, called weighted mean bifurcation ratio (WMRB). It consists in multiplying the ordinary RB identified for each successive order by the total number of streams involved in the ratio and subsequently taking the mean of these values. Schumm (1956) used this approach to determine the WMRB of the drainage system of the Perth Amboy (N.J). Pareta and Pareta (2011) and Bajabaa et al. (2014) also used this variable in hydrologic and morphometric analysis studies.

2.6 RHO coefficient (ρ)

The RHO coefficient (ρ) is defined as the ratio between the RL and the RB of the watershed. It characterizes the relationship between the physiographic development of the watershed and the drainage density, and permits the assessment of the storage capacity of the drainage network (Horton 1945). This variable is affected by several climatic, geologic, biologic, geomorphologic and anthropogenic factors (Mesa 2006).

2.7 Drainage density (DD)

The drainage density (DD) was introduced by Horton (1932) in the hydrological literature as the total length of stream networks per unit area. DD express the closeness of the spacing of streams, and provides a quantitative measurement of landscape dissection and runoff potential (Magesh et al. 2011). It is a result of interacting factors controlling the surface runoff such as, the infiltration capacity, the climatic conditions and the vegetation cover of the watershed (Máčka 2001; Patton 1988; Reddy et al. 2004; Verstappen 1983).

2.8 Stream frequency (FS)

The stream frequency (FS) is the number of stream segments of all orders per unit area (Horton 1932, 1945). It depends on the rock characteristics, infiltration capacity, vegetation cover, relief, amount of rainfall and subsurface permeability (Hajam et al. 2013), and reflects the texture of the drainage network (Magesh et al. 2011). In general, a high FS is associated with impermeable subsurface, sparse vegetation, high relief conditions and low infiltration capacity (Reddy et al. 2004; Shaban et al. 2005).

2.9 Infiltration number (IF)

The infiltration number (IF) is defined by Faniran (1968) as the product of the DD and the FS. It allows the characterization of the watershed infiltration capacity (Hajam et al. 2013). This variable is inversely proportional to the infiltration capacity of the basin. The higher the IF values, the lower will be the infiltration and the higher will be the runoff (Pareta and Pareta 2011).

2.10 Ruggedness number (RN)

The ruggedness number (RN) is often used to evaluate the flood potential of streams (Patton and Baker 1976) and it usually combines the impact of slope steepness with its length (Strahler 1964). This variable allows describing the structural complexity of the terrain. Watersheds characterized by high RN values are highly subject to erosion and therefore susceptible to an increased peak flow (Sreedevi et al. 2013).

3 Theoretical background

In this section, we briefly present the statistical approaches adopted in the present work. We define a RFA model as a two-step procedure beginning with a neighborhood identification method and then performing regional estimation. We hereby consider two different methods for each step, which are described below.

3.1 Delineation of homogeneous regions

3.1.1 Canonical correlation analysis (CCA)

CCA method is detailed in Ouarda et al. (2001) in the context of RFA, and commonly used in this context to identify group of basins having the same hydrological response. This method consists of space reduction by establishing pairs of canonical variables based on a linear transformation of two groups of random variables. Let two sets of random variables $X = \left( {X_{1} ,X_{2} , \ldots ,X_{m} } \right)$ and $Y = (Y_{1} ,Y_{2} , \ldots ,Y_{n} )$ containing, respectively, the m physio-meteorological variables and the n hydrological variables of N gauged sites. Based on these variables, the linear combinations V_i and Z_i of the variables X and Y and the canonical correlation coefficients λ₁, …, λ_p (with λ_i = corr (V_i, Z_i)) can be computed.

Using the CCA method, the considered basins can be represented as points in a spaces of the uncorrelated canonical variables (V_i, Z_j); where i ≠ j. Then, it will be possible to examine the similarity of the point patterns in these spaces, i.e., the ability of the physio-meteorological canonical variables to predict the hydrological variables. The point patterns that are sufficiently similar are associated with sub-group of basins that belongs to the same statistical population and vice versa. The similarity between the basins are measured based on a Mahalanobis distance.

3.1.2 Region of influence (ROI)

As the CCA, the ROI method (Burn 1990) allows the identification of a hydrological neighborhood for a given target-site based on a Euclidean distance, generally a weighted Euclidean distance. This distance determines the similarity of watersheds in a multidimensional space of physio-meteorological variables. A more detailed description of the approach can be found for example in Burn (1990) and GREHYS (1996).

3.2 Regional estimation approaches

3.2.1 Linear regression model

The linear regression model or the log-linear regression model (LLRM) is commonly used to find a linear relationship between the hydrological variable (such as the flood quantile Q_T corresponding to a return period T) and the physio-meteorological characteristics of a watershed (X₁, X₂, …, X_m), and it is defined as (e.g. Girard et al. 2004; Pandey and Nguyen 1999):

$$\log \left( {E\left( {Y/X} \right)} \right) = \beta _{0} + \mathop \sum \limits_{j = 1}^{m} \beta_{j} \log (X_{j} ) + \varepsilon$$

(1)

where X is a matrix whose columns correspond to a set of m explanatory variables, β₀ and β_j are unknown parameters to be estimated using the least-square method (Pandey and Nguyen 1999) and ε is the model error.

3.2.2 Generalized additive model

GAM was developed by Hastie and Tibshirani (1987). It is an extension of the generalized linear model (GLM). This model allows for a response distribution other than Gaussian and for a non-linear relationship between response and explanatory variables through smooth functions (Hastie and Tibshirani 1987; Wood 2006), which may lead to a more close description of the hydrological processes involved. The GAM formula is given by Wood (2006):

$$g \left( {E\left( {Y/X} \right)} \right) = \beta_{0} + \mathop \sum \limits_{j = 1}^{m} S_{j} (X_{j} ) + \varepsilon$$

(2)

where g is a monotonic link function and ${\text{S}}_{{\text{j}}}$ are smooth functions of explanatory variables ${\text{X}}_{{\text{j}}}$.

The estimation of the smooth functions ${\text{S}}_{{\text{j}}}$ is carried out using splines, which are piecewise polynomial functions linked at points named knots. Generally, the smooth functions ${\text{S}}_{{\text{j}}}$ are defined as follows:

$$S_{j} \left( x \right) = \mathop \sum \limits_{i = 1}^{q} \beta_{ji} b_{ji} (x)$$

(3)

where ${\upbeta }_{{{\text{ji}}}}$ are unknown parameters and ${\text{b}}_{{{\text{ji}}}}$ are the spline basis functions.

4 Methodology

4.1 Regional models

In this study, we apply all combinations of the two DHR methods (CCA, ROI) in conjunction with the RE models (LLRM and GAM) presented in Sect. 3. The RE models are also considered with all stations (i.e. without defining any neighborhood). This result in six possible combinations for each dataset (STA and EXTD). Thus, the following regionalization approaches are evaluated (Fig. 1):

ALL/LLRM (STA and EXTD): LLRM used without neighborhoods (all stations) and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure.
ALL/GAM (STA and EXTD): GAM used without neighborhoods (all stations) and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure.
CCA/LLRM (STA and EXTD): LLRM used with neighborhoods identified by the CCA method and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure.
CCA/GAM (STA and EXTD): GAM used with neighborhoods identified by the CCA method and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure.
ROI/LLRM (STA and EXTD): LLRM used with neighborhoods identified by the ROI method and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure.
ROI/GAM (STA and EXTD): GAM used with neighborhoods identified by the ROI method and with variables selected from the STA and the EXTD datasets using the backward stepwise procedure

The CCA and ROI methods are used in the DHR considering two different sets of physio-meteorological variables. The first group includes variables from the STA dataset, namely the area (AREA), mean basin slope (MBS), percentage of the area occupied by lakes (PLAKE), mean annual total precipitation (MATP), mean annual degree days below 0 °C (DDBZ) and the longitude of the centroid of the catchment (LONGC). The second one comprises variables from the EXTD dataset, which are PLAKE, MATP, DDBZ, LONGC, RT and RC. The selection of these variables is carried out based on their correlation level with the hydrological variables (Table 3) as the principle of the CCA is based on correlations. For the aim of simplicity and to be consistent with the CCA, variables selected for the ROI are also based on correlation levels.

Table 3 Correlation between hydrological and physiographical variables

Full size table

The classical procedures of ROI and CCA lead to neighbourhoods with highly variable sample sizes from a target site to another. Indeed, considering a given threshold value, sites located near the centre of the cloud of points determined by the Euclidean space for ROI and the canonical space for CCA are expected to include more sites within their neighbourhoods than sites located on the edge of the cloud of points (Leclerc and Ouarda 2007). Since the accuracy of the estimates obtained by regression models is sensible to the sample size, it was decided to fix the neighbourhood size for all target stations. This size is chosen with a standard jackknife procedure and optimized using the optimization procedure of Ouarda et al. (2001) developed in the Matlab environment.

LLRM and GAM are used in this study as RE models. GAM was developed based on the R package mgcv (Wood 2006). In this work, the thin plate regression spline is considered as basis b_ji (.) in the smoothing function ${\mathrm{S}}_{\mathrm{j}}(.)$ in Eq. (3). This basis function is considered due to its advantages. The thin plate regression spline is characterized by its reduced calculation time, its flexibility and it comprises a lower number of parameters compared to other smoothing functions (Wood 2006). The considered link function g in (2) is the identity function since the log-transformed quantiles are approximately normal (as in Ouali et al. (2017)).

4.2 Selection of explanatory variables

Variable selection procedure is different for the two RFA steps; a correlation-based selection is considered for DHR and a stepwise method is used for RE as a standard approach in the RFA studies. Based on correlation level between physio-meteorological variables and hydrological variables (Table 3), six variables are identified for DHR (see above).

For the RE step, four variable selection methods are firstly tested namely forward, backward, stepwise and shrinkage approaches (Heinze et al. 2018) in this study. Table 4 presents the results obtained from each variable selection approach applied for QS₁₀ that can be considered as the most reliable quantile. It can be seen that, regardless of the considered selection method, several new variables are selected in the final model. This suggests that new variables in the EXTD are potentially useful for RFA.

Table 4 Variables selection results for QS₁₀ case (with different methods)

Full size table

To evaluate whether the new variables are predictive of target quantiles, the backward stepwise selection procedure is adopted for both LLRM and GAM. It has already been successfully applied previously with the same dataset (STA) and in the same context by Chebana et al. (2014), Ouarda et al. (2018) and more recently by Msilini et al. (2020). Backward stepwise selection procedure consists in a progressive elimination of variables having the highest p value (based on the hypothesis that the coefficients in Eq. (1) for LLRM or the smooth terms in Eq. (3) for GAM are null) from an initial model comprising all available variables. The procedure stops when the number of variables remaining in the model drops below a specific number (Fig. 2). This number is chosen as the one minimizing the RRMSE estimated by jackknife.

4.3 Models validation

For each RFA model, a jackknife procedure (also called leave one-out cross validation procedure) is used to evaluate its performance. It consists in considering, in turn, each gauged site as an ungauged one and comparing thereafter the regional estimate to the observed value. This comparison is performed through several criteria: first, the Nash criterion (NASH) gives an evaluation of the degree of adequacy and a global assessment of the prediction quality. Second, the root mean squared error (RMSE) provides information about the accuracy of the prediction in an absolute scale, and the relative RMSE (RRMSE) removes the impact of each site’s order of magnitude from the RMSE values and gives information about the accuracy of the prediction in a relative scale. Finally, the bias (BIAS) and the relative bias (RBIAS) give a measure of the magnitude of the systematic overestimation or underestimation of a model. The formulations of these criteria are given as follows:

Nash:

$${\text{NASH}} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} (y_{i} - \hat{y}_{i} )^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} (y_{i} - \overline{y})^{2} }}$$

(4)

$${\text{RMSE = }}\sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{N} (y_{i} - \hat{y}_{i} )^{2} }$$

(5)

Relative root-mean-square error:

$${\text{RRMSE}} = 100\sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left[ {\frac{{(y_{i} - \hat{y}_{i} )}}{{y_{i} }}} \right]^{ 2} }$$

(6)

Mean bias:

$${\text{BIAS}} = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} (y_{i} - \hat{y}_{i} )$$

(7)

Relative mean bias:

$${\text{RBIAS}} = 100 \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \frac{{(y_{i} - \hat{y}_{i} )}}{{y_{i} }}$$

(8)

where $y_{i}$ and $\hat{y}_{i}$ are, respectively, the local and regional quantile estimates at site i, $\overline{y}$ is the mean of the local quantile estimates, and N is the number of stations.

5 Case study and datasets

The data used in this study includes two datasets, the STA and the EXTD, covering 151 stations located in the southern part of Quebec, Canada (Fig. 3). The STA was considered in previous studies with geographical coordinates of the stations and commonly used physio-meteorological variables (e.g. Durocher et al. 2015; Shu and Ouarda 2007; Wazneh et al. 2016). The EXTD dataset combining STA dataset with less common variables representing drainage network properties. The stations are operated by the Ministry of Sustainable Development, Environment, and Fight Against Climate Change.

The considered hydrological variables ($Y$ in the theoretical background) are at-site quantiles standardized by the basin area (specific quantiles), denoted by QS₁₀, QS₅₀ and QS₁₀₀ with 10, 50 and 100 are the return periods. Descriptive statistics of hydrological and physio-meteorological variables of the STA (not presented here to avoid repetition) can be found for example in Durocher et al. (2015). The hydrological variables were identified in Kouider et al. (2002a) using a local Frequency Analysis in each gauged site. Data series with at least 15 years of measurement were considered for the analysis. The basic assumptions of stationarity, homogeneity and independence were verified and the appropriate statistical distributions were fitted to data. The appropriate probability distributions identified, are mainly the inverse gamma and Log-Normal with two parameters. For more details about this study, reader may refer to the report of Kouider et al. (2002b). The new physiographical variables, considered in the EXTD, are summarized in Table 5. These variables are identified from drainage networks extracted using the D8 method based on the DEMs (Jenson and Domingue 1988; O'Callaghan and Mark 1984). This technique is implemented in Arc Gis (Arc Hydro).

Table 5 Descriptive statistics of new physiographical variables

Full size table

The D8 method is based on a digital elevation model (DEM) which is basically a grid of elevation values. For each cell, it is considered that water flows in direction of the steepest slope among the eight neighbors of a given DEM cell. The direction grid can then be used to estimate flow accumulation which is obtained by summing the weight of all grid cells following into each downslope cell in the output grid, i.e. simulating the flow path. Based on the obtained flow accumulation grid, the drainage networks can be extracted with the stream head locations corresponding to accumulation values below a constant threshold value (see for instance (Tarboton et al. 1991)).

In this work, the DEMs were hydrologically corrected based on information from the National Hydro Network (NHN). This correction was carried out using the DEM Reconditioning process, which is an implementation of the “AGREE” method. It consists in adjusting the DEM by imposing linear features as a reference. The reference in this case is the (NHN).

The used DEMs have a spatial resolution of ~ 20 m grid cells and are obtained from the Natural Resources Canada database (https://www.nrcan.gc.ca/earth-sciences/geography/topographic-information/download-directory-documentation/17215). Note that, drainage networks of six cross-border watersheds are extracted using the United States Geological Survey (USGS) data distributed with ~ 30 m grid cells (https://earthexplorer.usgs.gov/).

CCA requires the normality of all variables. Hence, some variables need to be transformed. The normality of each variable is visually assessed with a normal probability plot. This technique plots empirical quantiles versus theoretical Gaussian quantiles and should be approximately linear in the case of actual normality. The logarithmic transformation is considered for the hydrological variables, AREA, MBS, MATP, DDBZ and RT, and a square root transformation for PLAKE and RC. The LONGC is used without transformation since it is approximately normal.

6 Results and discussion

A correlation analysis is carried out in order to investigate the relationships between variables. Table 3 shows the list of the variables selected for the DHR step based on their high correlation level with the hydrological variables. One can see the existence of relatively high negative correlations between the hydrological variables and the AREA, PLAKE, DDBZ and RT. We also note the presence of important positive correlations between the response variables and the MATP and RC variables. The linear correlation coefficients between the variable RT, which is one of the most important new variables, and the specific quantiles QS₁₀ and QS₁₀₀ are -0.53 and -0.51 respectively. However, those between the RT variable and the at-site flood quantiles Q₁₀ and Q₁₀₀ are 0.87 and 0.86 respectively. Positive and high correlation values indicate that the increase in RT is associated with a relatively fast and high hydrologic response and consequently an increased risk of erosion. This is consistent with what is stated in Ameri et al. (2018). The second important new variable in terms of correlation level is the RC characterizing the basin shape. Higher RC values (close to 1) are associated with circular basins with low concentration time and high hydrological response hence the positive correlation.

The identification of the neighborhood requires the determination of the optimal number of stations to be used in the RE step. To this end, the optimization procedure of Ouarda et al. (2001) is used. Based on a selected criterion such as RMSE, RRMSE, BIAS or RBIAS the optimal size of neighborhoods can be identified. The optimal size of the neighborhoods should be large enough to ensure that RE can be carried out effectively, but not too large in order to maintain an acceptable degree of homogeneity within the neighborhoods. In this study, we obtain n^opt (STA) = 85 sites and n^opt (EXTD) = 78 sites with respect to the RRMSE, which is the most important criterion (Hosking and Wallis 2005), for the CCA approach. For the ROI method, the obtained optimum sizes are n^opt (STA) = 54 sites and n^opt (EXTD) = 44 sites with respect to the same criterion.

The backward stepwise selection method is considered for each quantile (QS₁₀, QS₅₀ and QS₁₀₀) and for each model (LLRM and GAM). In the present study, the optimal number of variables in GAM, which is the most complex model, is found to be seven. Table 6 shows the seven selected variables for each quantile and model combination. We note the selection of three new variables (RN, MRL and DD).

Table 6 Explanatory variables selected for the various regression models

Full size table

The jackknife procedure results for all considered combinations are presented in Table 7. The best overall performances are obtained with the EXTD, especially with ROI/GAM/EXTD followed by the CCA/GAM/EXTD approaches. Based on the high NASH values (0.79) and the lowest RRMSE values (29.24% for QS₁₀₀), the ROI/GAM/EXTD combination gives the most precise estimates compared to all other approaches. According to RBIAS, all models underestimate flood quantiles but the least biased model is ROI/LLRM/EXTD (-1.38% for QS₁₀₀). However, compared to the ROI/GAM/EXTD approach, the difference is low (around -1.8% for QS₁₀₀).

Table 7 Jackknife validation results

Full size table

Note that, GAM applied to EXTD (with and without the neighborhoods) outperforms LLRM applied to EXTD and STA. This may be explained by the ability of GAM to take into account the possible nonlinear connections between predictor and response variables, and also by the important impact of the new variables.

We also notice that the use of the EXTD leads to even more important improvements when adopting the ROI method compared to the CCA approach. Wazneh et al. (2016) have also obtained better results with the ROI than with the CCA approach.

To further explain the previous results, the relative errors as a function of the stations ordered according to their area corresponding to the best combinations (ROI/GAM and CCA/GAM) are given in Figs. 4 and 5 respectively. It can be seen that the EXTD performs well especially for large basins. Indeed, for the large watersheds the relative errors decrease considerably with the EXTD. This result may also be confirmed by Fig. 6, where one can note that the lowest specific quantiles, which are usually associated to sites with large basin areas, are well estimated with the EXTD. A significant improvement can also be seen for some specific sites that have exceptionally large relative errors with STA. Four such sites (030401, 030402, 041903 and 042607) were identified previously by Chokmani and Ouarda (2004), Durocher et al. (2015) and Ouali et al. (2017) as particular stations with underestimated areas. The integration of more accurate variables dealing with the drainage network, improves considerably the quantile estimates corresponding to these sites.

Jackknife estimates using the ROI/GAM and CCA/GAM approaches (for QS₁₀₀) are illustrated, respectively, in Figs. 7 and 8. One can see that these models combined with the EXTD show better performances compared to the STA. The points associated to the scatter diagram of the at-site and regional estimates are less dispersed when using the EXTD than the STA. In addition, the coefficient of determination R² values show that the linearity between the local and the regional specific quantile estimates is better explained when using the EXTD than the STA.

Results also indicate that sites with high specific quantile values (more than 0.7 m³/s.km²), which are generally associated to small basins with an area less than 800 km², are underestimated using the two datasets. This may suggest the usefulness of developing specific regional models for small basins. This result can be explained by the fact that traditional neighborhood approaches (CCA and ROI) lead to an underestimation for sites with small basin areas as shown in Wazneh et al.(2016). This may be the cause of the obtained negative RBIAS values in this work.

Figures 9 and 10 present the smooth functions of the response variable log(QS₁₀₀) as a function of the STA and the EXTD explanatory variables respectively. We notice that the variables PLAKE, DDBZ, AREA and DD show a complex nonlinear relationship (nonlinear smooth function curves and high edf values), while the variables LONGC; MALP, MCL, MBS and MRL present linear relations.

A particular case of interest from the EXTD that can be observed concerns the relationship between the hydrological variable and the DD values. One can see that the higher the DD values are the lower the hydrological response will be. This result is in contradiction with what is commonly observed in practice (Melton 1957). In fact, the correlation between the DD variable and specific quantile is negative (-0.11) while the correlation between flood quantile and the variable DD is positive (0.13). Thus, this variable depends on the size of the watershed, for this reason its effect is reversed in this study case because the specific quantile is used.

We also notice that the MRL and MCL variables are found to be inversely proportional to the hydrological response. An increase of these variables is associated with a decrease of the MBS and hence a decrease of the hydrological response.

It can also be seen that the relationship between log(QS₁₀₀) and PLAKE is decreasing for the majority of PLAKE values, but increases for the highest values of PLAKE. However, the number of points is very limited in the high PLAKE range and more effort will be required to understand the effect of this variable on the flow regime for this range. In general, lakes act as a sponge absorbing the excess water during extreme events, which explains the decreasing relationship between log(QS₁₀₀) and PLAKE.

The LONGC in this study is an indicator of the station proximity to the Atlantic Ocean and thereafter reflects the influence of the ocean on the local climate. Finally, the variability in the relationship between the DDBZ values and the hydrological response may indirectly reflect the seasonality impact of the temperature on the flow regime. The same patterns were observed previously by Chebana et al. (2014) for the DDBZ and PLAKE variables.

7 Conclusions

Through a case study in the province of Quebec, the present study shows the relevancy of considering drainage network characteristics for quantile prediction in RFA. This result is outlined by the variable importance in RFA models which shows that five new variables, namely RT, RC, DD, MRL and RN are found particularly useful for the specific case of Quebec. Prediction accuracy is also improved using the new variables, especially when considering small neighbourhoods and nonlinear models as shown by the superior accuracy of the ROI/GAM/EXTD combination. This result seems also more important for large basins.

By focusing on the drainage network and basin shape, the new physiographical variables allow integrating more information about the underlying hydrogeological flows and thus, indirectly, to make the link between the groundwater and the surface water flows. This added information allows for a better description of the hydrological dynamics involved and consequently to better flood quantile estimates.

The present study paves the way for several perspectives. In particular, drainage network characteristics should be evaluated further in a wider variety of settings including different climate and catchment geology. The increasing complexity of databases used in RFA to which this research participate, also outlines the need for methodological development that allow a more efficient use of this extensive information, as classical approaches may be limited in this regard. Future research should thus focus on studying how to take advantage of the interaction between the newly proposed variables on quantile estimation, as well as the potential nonlinear impact of the considered variables.

Abbreviations

BH:: Basin relief
BIAS:: Mean bias
CCA:: Canonical correlation analysis
DD:: Drainage density
DDBZ:: Mean annual degree days below 0 °C
DEM:: Digital elevation model
DHR:: Delineation of homogenous regions
Edf:: Estimated smooth degree of freedom
EXTD:: Extended dataset
FS:: Stream frequency
GAM:: Generalized additive model
IF:: Infiltration number
LATC:: Latitude of the centroid of the basin
LLRM:: Log-linear regression model
LONGC:: Longitude of the centroid of the basin
LU:: Stream length
MALP:: Mean annual liquid precipitation
MASP:: Mean annual solid precipitation
MATP:: Mean annual total precipitation
MBS:: Mean basin slope
MCL:: Main channel length
MRB:: Mean bifurcation ratio
MRL:: Mean stream length ratio
NASH:: Nash efficiency criterion
NHN:: National Hydro Network
PFOR:: Percentage of the area occupied by forest
PLAKE:: Percentage of the area occupied by lakes
PL1:: Percentage of first-order stream lengths
PN1:: Percentage of first-order streams
QS_T :: Specific quantile associated to the return period T
Q_T :: At-site flood quantile corresponding to return period T
R² :: Coefficient of determination
RB:: Bifurcation ratio
RBIAS:: Relative mean bias
RC:: Circularity ratio
RE:: Regional estimation
RFA:: Regional frequency analysis
RL:: Stream length ratio
RMSE:: Root-mean-square error
RN:: Ruggedness number
ROI:: Region of influence
RRMSE:: Relative root-mean-square error
RT:: Texture ratio
STA:: Standard dataset
U:: Stream order
Var:: Explanatory variable
WMRB:: Weighted mean bifurcation ratio
ρ:: RHO coefficient
$\rho_{{{\text{WMRB}}}}$ :: RHO WMRB coefficient

References

Agarwal C (1998) Study of drainage pattern through aerial data in Naugarh area of Varanasi district. Up J Indian Soc Remote Sens 26:169–175. https://doi.org/10.1007/BF02990795
Article Google Scholar
Ahmadi R et al (2006) The geomorphologic responses to hinge migration in the fault-related folds in the Southern Tunisian Atlas. J Struct Geol 28:721–728
Article Google Scholar
Alobaidi MH, Marpu PR, Ouarda TB, Chebana F (2015) Regional frequency analysis at ungauged sites using a two-stage resampling generalized ensemble framework. Adv Water Resour 84:103–111. https://doi.org/10.1016/j.advwatres.2015.07.019
Article Google Scholar
Ameri AA, Pourghasemi HR, Cerda A (2018) Erodibility prioritization of sub-watersheds using morphometric parameters analysis and its mapping: a comparison among TOPSIS, VIKOR, SAW, and CF multi-criteria decision making models. Sci Total Environ 613:1385–1400. https://doi.org/10.1016/j.scitotenv.2017.09.210
Article CAS Google Scholar
Aziz K, Rahman A, Fang G, Shrestha S (2014) Application of artificial neural networks in regional flood frequency analysis: a case study for Australia. Stoch Environ Res Risk Assess 28:541–554
Article Google Scholar
Bajabaa S, Masoud M, Al-Amri N (2014) Flash flood hazard mapping based on quantitative hydrology, geomorphology and GIS techniques (case study of Wadi Al Lith, Saudi Arabia). Arab J Geosci 7:2469–2481. https://doi.org/10.1007/s12517-013-0941-2
Article Google Scholar
Baumgardner RW (1987) Morphometric studies of subhumid and semiarid drainage basins, Texas panhandle and Northeastern New Mexico, vol 163. University of Texas at Austin, Bureau of Economic Geology
Google Scholar
Beck HE, Dijk AI, Miralles DG, Jeu RA, McVicar TR, Schellekens J (2013) Global patterns in base flow index and recession based on streamflow observations from 3394 catchments. Water Resour Res 49:7843–7863. https://doi.org/10.1002/2013WR013918
Article Google Scholar
Biswas S, Sudhakar S, Desai V (1999) Prioritisation of subwatersheds based on morphometric analysis of drainage basin: a remote sensing and GIS approach. J Indian Soc Remote Sens 27:155–166. https://doi.org/10.1007/BF02991569
Article Google Scholar
Blyth K, Rodda J (1973) A Stream Length Study. Water Resour Res 9:1454–1461. https://doi.org/10.1029/WR009i005p01454
Article Google Scholar
Burn DH (1990) Evaluation of regional flood frequency analysis with a region of influence approach. Water Resour Res 26:2257–2265. https://doi.org/10.1029/WR026i010p02257
Article Google Scholar
Castellarin A (2014) Regional prediction of flow-duration curves using a three-dimensional kriging. J Hydrol 513:179–191. https://doi.org/10.1016/j.jhydrol.2014.03.050
Article Google Scholar
Castiglioni S, Castellarin A, Montanari A (2009) Prediction of low-flow indices in ungauged basins through physiographical space-based interpolation. J Hydrol 378:272–280. https://doi.org/10.1016/j.jhydrol.2009.09.032
Article Google Scholar
Chebana F, Ouarda TB (2007) Multivariate L-moment homogeneity test. Water Resour Res. https://doi.org/10.1029/2006WR005639
Article Google Scholar
Chebana F, Charron C, Ouarda TBMJ, Martel B (2014) Regional frequency analysis at ungauged sites with the generalized additive model. J Hydrometeorol 15:2418–2428. https://doi.org/10.1175/JHM-D-14-0060.1
Article Google Scholar
Chokmani K, Ouarda TB (2004) Physiographical space‐based kriging for regional flood frequency estimation at ungauged sites. Water Resour Res 40
Chow V (1964) Handbook of applied hydrology: a compendium of water-resources technology, vol 1. McGraw-Hill
Google Scholar
Dar RA, Romshoo SA, Chandra R, Ahmad I (2014) Tectono-geomorphic study of the Karewa Basin of Kashmir Valley. J Asian Earth Sci 92:143–156. https://doi.org/10.1016/j.jseaes.2014.06.018
Article Google Scholar
Dawson CW, Abrahart RJ, Shamseldin AY, Wilby RL (2006) Flood estimation at ungauged sites using artificial neural networks. J Hydrol 319:391–409. https://doi.org/10.1016/j.jhydrol.2005.07.032
Article Google Scholar
Dodangeh E, Soltani S, Sarhadi A, Shiau JT (2014) Application of L-moments and Bayesian inference for low-flow regionalization in Sefidroud basin. Iran Hydrol Process 28:1663–1676. https://doi.org/10.1002/hyp.9711
Article Google Scholar
Durocher M, Chebana F, Ouarda TB (2015) A nonlinear approach to regional flood frequency analysis using projection pursuit regression. J Hydrometeorol 16:1561–1574. https://doi.org/10.1175/JHM-D-14-0227.1
Article Google Scholar
Faniran A (1968) The index of drainage intensity: a provisional new drainage factor. Aust J Sci 31:328–330
Google Scholar
Flavell D (2012) Design flood estimation in Western Australia. Aust J Water Resour 16:1–20
Google Scholar
Gajbhiye S (2015) Morphometric analysis of a Shakkar river catchment using RS and GIS International Journal of u-and e-Service. Sci Technol 8:11–24. https://doi.org/10.14257/ijunesst.2015.8.2.02
Article Google Scholar
Girard C, Ouarda TB, Bobée B (2004) Étude du biais dans le modèle log-linéaire d’estimation régionale. Can J Civ Eng 31:361–368. https://doi.org/10.1139/l03-099
Article Google Scholar
Goswami M, O’connor K, Bhattarai K (2007) Development of regionalisation procedures using a multi-model approach for flow simulation in an ungauged catchment. J Hydrol 333:517–531. https://doi.org/10.1016/j.jhydrol.2006.09.018
Article Google Scholar
GREHYS (1996) Presentation and review of some methods for regional flood frequency analysis. J Hydrol 186:63–84
Article Google Scholar
Griffis V, Stedinger J (2007) The use of GLS regression in regional hydrologic analyses. J Hydrol 344:82–95. https://doi.org/10.1016/j.jhydrol.2007.06.023
Article Google Scholar
Haddad K, Rahman A (2012) Regional flood frequency analysis in eastern Australia: Bayesian GLS regression-based methods within fixed region and ROI framework–quantile regression vs. parameter regression technique. J Hydrol 430:142–161. https://doi.org/10.1016/j.jhydrol.2012.02.012
Article Google Scholar
Hadley R, Schumm S (1961) Sediment sources and drainage basin characteristics in upper Cheyenne River basin. US Geol Surv Water-Supply Paper 1531:198
Google Scholar
Hailegeorgis TT, Alfredsen K (2017) Regional flood frequency analysis and prediction in ungauged basins including estimation of major uncertainties for mid-Norway Journal of Hydrology. Reg Stud 9:104–126. https://doi.org/10.1016/j.ejrh.2016.11.004
Article Google Scholar
Hajam RA, Hamid A, Bhat S (2013) Application of morphometric analysis for geo-hydrological studies using geo-spatial technology: a case study of Vishav drainage basin. Hydrol Curr Res 4:2. https://doi.org/10.4172/2157-7587.1000157
Article Google Scholar
Hamed Y et al (2014) Use of geochemical, isotopic, and age tracer data to develop models of groundwater flow: a case study of Gafsa mining basin-Southern Tunisia. J African Earth Sci 100:418–436
Article CAS Google Scholar
Hastie T, Tibshirani R (1987) Generalized additive models: Some applications. J Amer Stat Assoc 82:371–386. https://doi.org/10.1080/01621459.1987.10478440
Article Google Scholar
Heinze G, Wallisch C, Dunkler D (2018) Variable selection–a review and recommendations for the practicing statistician. Biomet J 60:431–449
Article Google Scholar
Horton RE (1932) Drainage-basin characteristics Eos. Trans Am Geophys Union 13:350–361. https://doi.org/10.1029/TR013i001p00350
Article Google Scholar
Horton RE (1945) Erosional development of streams and their drainage basins; hydrophysical approach to quantitative morphology. Geol Soc Am Bull 56:275–370. https://doi.org/10.1130/0016-7606(1945)56[275:EDOSAT]2.0.CO;2
Article Google Scholar
Hosking JRM, Wallis JR (2005) Regional frequency analysis: an approach based on L-moments. Cambridge University Press
Google Scholar
Howard AD (1990) Theoretical model of optimal drainage networks. Water Resour Res 26:2107–2117. https://doi.org/10.1029/WR026i009p02107
Article Google Scholar
Jenson SK, Domingue JO (1988) Extracting topographic structure from digital elevation data for geographic information system analysis. Photogram Eng Remote Sens 54:1593–1600
Google Scholar
Jung K, Marpu PR, Ouarda TB (2015) Improved classification of drainage networks using junction angles and secondary tributary lengths. Geomorphology 239:41–47. https://doi.org/10.1016/j.geomorph.2015.03.004
Article Google Scholar
Jung K, Marpu PR, Ouarda TB (2017) Impact of river network type on the time of concentration. Arab J Geosci 10:546. https://doi.org/10.1007/s12517-017-3323-3
Article Google Scholar
Kaliraj S, Chandrasekar N, Magesh N (2015) Morphometric analysis of the River Thamirabarani sub-basin in Kanyakumari District, South west coast of Tamil Nadu, India, using remote sensing and GIS. Environ Earth Sci 73:7375–7401. https://doi.org/10.1007/s12665-014-3914-1
Article Google Scholar
Kouider A, Gingras H, Ouarda T, Ristic-Rudolf Z, Bobée B (2002a) Analyse fréquentielle locale et régionale et cartographie des crues au Québec Rep R-627-el
Kouider A, Ouarda T, Bobée B (2002b) Analyse fréquentielle locale des crues au Québec. http://espace.inrs.ca/365/1/T000342.pdf
Latt ZZ, Wittenberg H, Urban B (2015) Clustering hydrological homogeneous regions and neural network based index flood estimation for ungauged catchments: an example of the Chindwin river in Myanmar. Water Resour Manage 29:913–928. https://doi.org/10.1007/s11269-014-0851-4
Article Google Scholar
Leclerc M, Ouarda TB (2007) Non-stationary regional flood frequency analysis at ungauged sites. J Hydrol 343:254–265. https://doi.org/10.1016/j.jhydrol.2007.06.021
Article Google Scholar
Máčka Z (2001) Determination of texture of topography from large scale contour maps
Magesh N, Chandrasekar N, Soundranayagam JP (2011) Morphometric evaluation of Papanasam and Manimuthar watersheds, parts of Western Ghats, Tirunelveli District, Tamil Nadu, India: a GIS approach. Environ Earth Sci 64:373–381. https://doi.org/10.1007/s12665-010-0860-4
Article Google Scholar
Marchi L, Dalla Fontana G (2005) GIS morphometric indicators for the analysis of sediment dynamics in mountain basins. Environ Geol 48:218–228. https://doi.org/10.1007/s00254-005-1292-4
Article Google Scholar
Melton MA (1957) An analysis of the relations among elements of climate, surface properties, and geomorphology. Columbia Univ New York
Mesa L (2006) Morphometric analysis of a subtropical Andean basin (Tucuman, Argentina). Environl GeoL 50:1235–1242. https://doi.org/10.1007/s00254-006-0297-y
Article Google Scholar
Miller VC (1953) quantitative geomorphic study of drainage basin characteristics in the Clinch Mountain area, Virginia and Tennessee Technical report (Columbia University Department of Geology); no 3
Msilini A, Masselot P, Ouarda TBMJ (2020) Regional frequency analysis at ungauged sites with multivariate adaptive regression splines. J Hydrometeorol. https://doi.org/10.1175/jhm-d-19-0213.1
Article Google Scholar
Muttiah RS, Srinivasan R, Allen PM (1997) Prediction of two-year peak stream-discharges using neural networks Jawra. J Am Water Resour Assoc 33:625–630. https://doi.org/10.1111/j.1752-1688.1997.tb03537.x
Article Google Scholar
O’Callaghan JF, Mark DM (1984) The extraction of drainage networks from digital elevation data. Comput vis Graph Image Process 28:323–344. https://doi.org/10.1016/S0734-189X(84)80011-0
Article Google Scholar
Odry J, Arnaud P (2017) Comparison of flood frequency analysis methods for ungauged catchments in france. Geosciences 7:88. https://doi.org/10.3390/geosciences7030088
Article Google Scholar
Ouali D, Chebana F, Ouarda TBMJ (2016) Non-linear canonical correlation analysis in regional frequency analysis. Stoch Environ Res Risk Assess 30:449–462. https://doi.org/10.1007/s00477-015-1092-7
Article Google Scholar
Ouali D, Chebana F, Ouarda TBMJ (2017) Fully nonlinear statistical and machine-learning approaches for hydrological frequency estimation at ungauged sites. J Adv Model Earth Syst 9:1292–1306. https://doi.org/10.1002/2016MS000830
Article Google Scholar
Ouarda TBMJ (2016) Regional flood frequency modeling chapter 77. In: Singh VP (ed) Chow’s handbook of applied hydrology, 3rd edn. Mc-Graw Hill, New York, pp 77.71-77.78
Google Scholar
Ouarda TBMJ, Haché M, Bruneau P, Bobée B (2000) Regional flood peak and volume estimation in northern Canadian basin. J Cold Region Eng 14:176–191. https://doi.org/10.1061/(ASCE)0887-381X(2000)14:4(176)
Article Google Scholar
Ouarda TBMJ, Girard C, Cavadias GS, Bobée B (2001) Regional flood frequency estimation with canonical correlation analysis. J Hydrol 254:157–173. https://doi.org/10.1016/S0022-1694(01)00488-7
Article Google Scholar
Ouarda TBMJ, Charron C, Hundecha Y, St-Hilaire A, Chebana F (2018) Introduction of the GAM model for regional low-flow frequency analysis at ungauged basins and comparison with commonly used approaches. Environ Model Softw 109:256–271. https://doi.org/10.1016/j.envsoft.2018.08.031
Article Google Scholar
Pandey G, Nguyen V-T-V (1999) A comparative study of regression based methods in regional flood frequency analysis. J Hydrol 225:92–101. https://doi.org/10.1016/S0022-1694(99)00135-3
Article Google Scholar
Pareta K, Pareta U (2011) Quantitative morphometric analysis of a watershed of Yamuna basin, India using ASTER (DEM) data and GIS. Int J Geomat Geosci 2:248
Article Google Scholar
Patton PC (1988) Drainage basin morphometry and floods Flood Geomorphology. Wiley, New York, pp 51–64
Google Scholar
Patton PC, Baker VR (1976) Morphometry and floods in small drainage basins subject to diverse hydrogeomorphic controls. Water Resour Res 12:941–952. https://doi.org/10.1029/WR012i005p00941
Article Google Scholar
Rahman A (2005) A quantile regression technique to estimate design floods for ungauged catchments in south-east Australia. Aust J Water Resour 9:81–89. https://doi.org/10.1080/13241583.2005.11465266
Article Google Scholar
Rahman A, Charron C, Ouarda TBMJ, Chebana F (2018) Development of regional flood frequency analysis techniques using generalized additive models for Australia. Stoch Environ Res Risk Assess 32:123–139. https://doi.org/10.1007/s00477-017-1384-1
Article Google Scholar
Rai PK, Mishra VN, Mohan K (2017) A study of morphometric evaluation of the Son basin, India using geospatial approach. Remote Sens Appl Soc Environ 7:9–20. https://doi.org/10.1016/j.rsase.2017.05.001
Article Google Scholar
Ratnam KN, Srivastava Y, Rao VV, Amminedu E, Murthy K (2005) Check dam positioning by prioritization of micro-watersheds using SYI model and morphometric analysis—remote sensing and GIS perspective. J Indian Soc Remote Sens 33:25. https://doi.org/10.1007/BF02989988
Article Google Scholar
Reddy GPO, Maji AK, Gajbhiye KS (2004) Drainage morphometry and its influence on landform characteristics in a basaltic terrain, Central India–a remote sensing and GIS approach. Int J Appl Earth Obs Geoinf 6:1–16. https://doi.org/10.1016/j.jag.2004.06.003
Article Google Scholar
Requena AI, Ouarda TBMJ, Chebana F (2018) Low-flow frequency analysis at ungauged sites based on regionally estimated streamflows. J Hydrol. https://doi.org/10.1016/j.jhydrol.2018.06.016
Article Google Scholar
Ridolfi E, Rianna M, Trani G, Alfonso L, Di Baldassarre G, Napolitano F, Russo F (2016) A new methodology to define homogeneous regions through an entropy based clustering method. Adv Water Resour 96:237–250. https://doi.org/10.1016/j.advwatres.2016.07.007
Article Google Scholar
Schumm SA (1956) Evolution of drainage systems and slopes in badlands at Perth Amboy, New Jersey. Geol Soc Am Bull 67:597–646. https://doi.org/10.1130/0016-7606(1956)67[597:EODSAS]2.0.CO;2
Article Google Scholar
Seckin N (2011) Modeling flood discharge at ungauged sites across turkey using neuro-fuzzy and neural networks. J Hydroinf 13:842–849. https://doi.org/10.2166/hydro.2010.046
Article Google Scholar
Seidou O, Ouarda T, Barbet M, Bruneau P, Bobee B (2006) A parametric Bayesian combination of local and regional information in flood frequency analysis. Water Resour Res. https://doi.org/10.1029/2005WR004397
Article Google Scholar
Shaban A, Khawlie M, Abdallah C, Awad M (2005) Hydrological and watershed characteristics of the El-Kabir River, North Lebanon. Lakes Reserv Res Manage 10:93–101. https://doi.org/10.1111/j.1440-1770.2005.00262.x
Article Google Scholar
Shu C, Ouarda TBMJ (2007) Flood frequency analysis at ungauged sites using artificial neural networks in canonical correlation analysis physiographic space. Water Resour Res. https://doi.org/10.1029/2006WR005142
Article Google Scholar
Shu C, Ouarda T (2008) Regional flood frequency analysis at ungauged sites using the adaptive neuro-fuzzy inference system. J Hydrol 349:31–43. https://doi.org/10.1016/j.jhydrol.2007.10.050
Article Google Scholar
Sivasena Reddy A, Janga Reddy M (2013) Identification of homogenous regions in rain-fed watershed using Kohonen neural networks ISH. J Hydraul Eng 19:55–66. https://doi.org/10.1080/09715010.2013.763408
Article Google Scholar
Smith A, Sampson C, Bates P (2015) Regional flood frequency analysis at the global scale. Water Resour Res 51:539–553. https://doi.org/10.1002/2014WR015814
Article Google Scholar
Sreedevi P, Subrahmanyam K, Ahmed S (2005) The significance of morphometric analysis for obtaining groundwater potential zones in a structurally controlled terrain. Environ Geol 47:412–420. https://doi.org/10.1007/s00254-004-1166-1
Article Google Scholar
Sreedevi P, Sreekanth P, Khan H, Ahmed S (2013) Drainage morphometry and its influence on hydrology in an semi arid region: using SRTM data and GIS. Environ Earth Sci 70:839–848. https://doi.org/10.1007/s12665-012-2172-3
Article Google Scholar
Stall JB, Fok Y-S (1967) Discharge as related to stream system morphology. In: International association of scientific hydrology symposium on river morphology, publication, pp 224–235
Strahler A (1952) Hypsometric (area-altitude) analysis of erosional topography. Geol Soc Am Bull 63:1117–1142. https://doi.org/10.1130/0016-7606(1952)63[1117:HAAOET]2.0.CO;2
Article Google Scholar
Strahler (1953) Revisions of Horton’s quantitative factors in erosional terrain. Trans Am Geophys Union 34:345
Google Scholar
Strahler, (1957) Quantitative analysis of watershed geomorphology Eos. Trans Am Geophys Union 38:913–920. https://doi.org/10.1029/TR038i006p00913
Article Google Scholar
Strahler (1964) Part II. Quantitative geomorphology of drainage basins and channel networks handbook of applied hydrology. McGraw-Hill, New York, pp 4–39
Taofik OK, Innocent B, Christopher N, Jidauna GG, James AS (2017) A comparative analysis of drainage morphometry on hydrologic characteristics of Kereke and Ukoghor basins on flood vulnerability in Makurdi Town, Nigeria. Hydrology 5:32. https://doi.org/10.11648/j.hyd.20170503.11
Article Google Scholar
Tarboton DG, Bras RL, Rodriguez-Iturbe I (1991) On the extraction of channel networks from digital elevation data. Hydrol Process 5:81–100
Article Google Scholar
TaRT H (1986) Generalized additive models. Stat Sci 1:297–310. https://doi.org/10.1214/ss/1177013604
Article Google Scholar
Tramblay Y, Ouarda TBMJ, St-Hilaire A, Poulin J (2010) Regional estimation of extreme suspended sediment concentrations using watershed characteristics. J Hydrol 380:305–317. https://doi.org/10.1016/j.jhydrol.2009.11.006
Article Google Scholar
Tsakiris G, Nalbantis I, Cavadias G (2011) Regionalization of low flows based on canonical correlation analysis. Adv Water Resour 34:865–872. https://doi.org/10.1016/j.advwatres.2011.04.007
Article Google Scholar
Verstappen H (1983) Applied Geomorphology. Elsevier, Amsterdam
Google Scholar
Vijith H, Satheesh R (2006) GIS based morphometric analysis of two major upland sub-watersheds of Meenachil river in Kerala. J Indian Soc Remote Sens 34:181–185. https://doi.org/10.1007/BF02991823
Article Google Scholar
Wan Jaafar W, Liu J, Han D (2011) Input variable selection for median flood regionalization. Water Resour Res. https://doi.org/10.1029/2011WR010436
Article Google Scholar
Wazneh H, Chebana F, Ouarda TB (2016) Identification of hydrological neighborhoods for regional flood frequency analysis using statistical depth function. Adv Water Resour 94:251–263. https://doi.org/10.1016/j.advwatres.2016.05.013
Article Google Scholar
Wood (2006) Generalized additive models: an introduction with R. CRC press,
Youssef AM, Pradhan B, Hassan AM (2011) Flash flood risk estimation along the St, Katherine road, Southern Sinai, Egypt using GIS based morphometry and satellite imagery. Environ Earth Sci 62:611–623. https://doi.org/10.1007/s12665-010-0551-1
Article Google Scholar

Download references

Acknowledgements

Financial support for the present study was graciously provided by the Natural Sciences and Engineering Research Council of Canada (NSERC), the Canada Research chairs program (CRC) and the University Mission of Tunisia in Montreal (MUTAN). The authors would like to thank Christian Charron for his valuable help and input. The authors are grateful to Natural Resources Canada (https://www.nrcan.gc.ca/earth-sciences/geography/topographic-information/download-directory-documentation/17215) and the USGS (https://earthexplorer.usgs.gov/) services for the employed DEM and NHN data. The authors would like also to thank the Ministry of Sustainable Development, Environment, and Fight Against Climate Change (MDDELCC) services for the used dataset (STA). The authors are grateful to the Editor-in-Chief, Dr. George Christakos, and to three anonymous reviewers for their comments which helped improve the quality of the manuscript.

Author information

Authors and Affiliations

Canada Research Chair in Statistical Hydro-Climatology, INRS-ETE, 490 de la Couronne, Québec, QC, G1K 9A9, Canada
A. Msilini & T. B. M. J. Ouarda
London School of Hygiene & Tropical Medicine (LSHTM), 15-17 Tavistock Pl, London, WC1H 9SH, UK
P. Masselot

Authors

A. Msilini
View author publications
You can also search for this author in PubMed Google Scholar
T. B. M. J. Ouarda
View author publications
You can also search for this author in PubMed Google Scholar
P. Masselot
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. Msilini.

Ethics declarations

Conflict of interest

The author declares no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Msilini, A., Ouarda, T.B.M.J. & Masselot, P. Evaluation of additional physiographical variables characterising drainage network systems in regional frequency analysis, a Quebec watersheds case-study. Stoch Environ Res Risk Assess 36, 331–351 (2022). https://doi.org/10.1007/s00477-021-02109-7

Download citation

Accepted: 27 September 2021
Published: 12 October 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00477-021-02109-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of additional physiographical variables characterising drainage network systems in regional frequency analysis, a Quebec watersheds case-study

Abstract

Similar content being viewed by others

Development of a kriging-based regional flood frequency analysis technique for South-East Australia

Regional L-Moment-Based Flood Frequency Analysis in the Upper Vistula River Basin, Poland

Regional L-Moment-Based Flood Frequency Analysis in the Upper Vistula River Basin, Poland

1 Introduction

2 Variables characterizing drainage networks

2.1 Stream order (U)

2.2 Texture ratio (RT)

2.3 Circularity ratio (RC)

2.4 Stream length ratio (RL)

2.5 Mean bifurcation ratio (MRB) and weighted mean bifurcation ratio (WMRB)

2.6 RHO coefficient (ρ)

2.7 Drainage density (DD)

2.8 Stream frequency (FS)

2.9 Infiltration number (IF)

2.10 Ruggedness number (RN)

3 Theoretical background

3.1 Delineation of homogeneous regions

3.1.1 Canonical correlation analysis (CCA)

3.1.2 Region of influence (ROI)

3.2 Regional estimation approaches

3.2.1 Linear regression model

3.2.2 Generalized additive model

4 Methodology

4.1 Regional models

4.2 Selection of explanatory variables

4.3 Models validation

5 Case study and datasets

6 Results and discussion

7 Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation