1 Introduction

Debris flows are fast mass movements, formed by a mix of water and solid materials, which mostly occur in steep torrents. They represent a major risk to human settlements and infrastructures in mountainous areas. The basal entrainment of channel-bed material is a common feature of debris flows. Debris-flow torrents often consist of colluvium and other types of coarse sediment, which can be partially or totally incorporated into the flowing mass (Pierson and Scott 1985; Hungr et al. 2005). The entrainment of loose and unconsolidated material along the flow path affects fundamental parameters used for debris-flow hazard assessment, since it directly influences the volume and flow dynamics (Mangeney et al. 2010; Iverson et al. 2011). In debris-flow hazard assessment, the erosion rate is a common term to describe the amount of material incorporated into the flowing mass per unit of length (Hungr et al. 1984).

The total volume of the debris flow can be considerably enlarged due to the entrainment (Hungr et al. 2005; Guthrie et al. 2009; Berger et al. 2011). Studies on individual events with remarkable erosion are available in literature (Scheidl et al. 2008; Theule et al. 2012), as well as in some regional datasets on entrainment (Fannin and Rollerson 1993; Hungr et al. 2005; Gertsch 2009). These data revealed that the material entrained along the flow path can reach a high percentage (up to 90 %) of the total volume of the events. For this reason, it is a key question to predict the potential volume that can be entrained along the flow path.

The theoretical basis to evaluate the entrainment and to predict erosion rates has strongly improved (e.g. Iverson 2012). However, the practical application of this concept has not yet been solved. Most of the approaches that determine the volume of debris flows are based on estimations of the entire volume of the event, while only few specifically incorporate the effect of entrainment. The approaches to estimate the event volume could be divided into two principal groups: (1) empirical methods and (2) physically based methods.

The empirical methods can be separated into two groups: simple equations and field or geomorphological methods. The simple equations are usually based on few terrain parameters. Many authors have developed this type of equations, such as VanDine (1985), Takei (1984), Zeller (1985), Franzi and Bianco (2001), Marchi and D’gostino (2004), Dong et al. (2009) among others. These approaches incorporated the catchment area and correlated it to the volume of the debris flow by means of expressions such as V = kA b. Other authors included one or two additional parameters such as the torrent slope and an adimensional factor of torrentiality (Kronfellner-Kraus 1985), the torrent slope and a geological index (D’Agostino 1996) or the run-out distance instead of the catchment area (Rickenmann 1995). These approaches are defined to be applied at the entire torrent scale, without considering local changes in the torrent characteristics and erosion rates. The field or geomorphological methods consist of estimations of erosion rates based on the morphology of the torrent or its specific reaches. These methods consist of geomorphological descriptions of the torrents to identify and to quantify in some occasions, the potential sediment sources (Fannin and Rollerson 1993; Hungr et al. 1984; Spreafico et al. 1999; Fannin and Wise 2001).

The physically based methods include both static and hydrodynamic approaches and describe the mechanics of the channel-bed scouring and lateral bank failures. Many of these approaches have been derived from laboratory observations (Egashira et al. 2001; Rickenmann et al. 2003; Papa et al. 2004) or a large outdoor flume (Iverson et al. 2011). Some numerical models have included physical approaches into the codes (e.g. McDougall and Hungr 2004; Crosta et al. 2009; Medina et al. 2008), while others use algorithms derived from empirical approaches (Degetto et al. 2011) or simply exclude the effect of this mechanism (Denlinger and Iverson 2001).

Besides the uncertainties of defining the mechanism of the entrainment itself, the difficulties start with the ambiguities in the definition of the governing factors of the process. The empirical methods have typically incorporated terrain factors, which can be assessed by simple topographical calculations or by field campaigns. However, the influence of each factor in the erosion rate has only been studied by few authors (Chen et al. 2005; Hungr et al. 2005; Guthrie et al. 2009; Mangeney 2011). The results show a large scattering, which suggests that the mechanisms governing the entrainment are complex and that different factors influenced the entrained volume.

The main goals of this publication are threefold: first, we present and analyse the database collected at recent debris flows in order to determine the characteristics of the entrainment and the governing factors. Second, we propose and compare two approaches related to learning techniques to predict the amount of material that can be entrained from the channel bed along the flow path. Third, we present a simple and general approach to estimate the entrainment, which should be applicable without extensive field work.

2 Study regions and database

Entrainment data on 17 debris flows were collected in this study. All the debris flows are granular-type (Coussot and Meunier 1996). Data collection was carried out during field surveys and was complemented using a geographic information system (GIS). The surveyed debris-flow torrents are located in the Pyrenees and the European Alps (Fig. 1).

Fig. 1
figure 1

Location of the surveyed debris-flow torrents in the Pyrenees (a) and the European Alps (b). All the torrents used in this study (training set, validation set and test set) are shown

The central-eastern sector of the Pyrenean mountain range limits the Spanish autonomy of Catalonia from France and also includes the Principality of Andorra. The highest peaks show altitudes slightly over 3,000 m a.s.l. and are located in the central Pyrenees. From a geological point of view, the bedrock of the Axial Pyrenees mainly consists of a basement of igneous and metamorphic Paleozoic rocks folded, intruded and metamorphosed during the Hercynian orogeny (Muñoz 1992). The southern outer part of the range, coinciding with the Pre-Pyrenees, is composed of sedimentary sequences mostly of Mesozoic ages. The superficial deposits predominantly consist of colluvium. The sites (13) are located in periglacial areas, where weathering, erosion and hill slope processes are the principal agents for sediment supply in the channels. Most catchments are weathering-limited or supply-limited systems after the classification of Bovis and Jakob (1999). The debris-flow activity in the Pyrenees is lower than the activity of other mountain ranges; however, recent studies prove that in some catchments, it is much higher than expected (Hürlimann et al. 2013). The events took place during a short period of time and high-intensity rainfall episodes related to convective summer storms.

Four debris flows occurring in the European Alps (A) were analysed (Table 1). The European Alps cross Western Europe along ~1,200 km and include many peaks over 4,000 m a.s.l. (the maximum elevation is 4,810 m). In short, they comprise different nappes (Pennine, Helvetic and Austroalpine) of sedimentary to metamorphic rocks and a crystalline basement. The three Swiss events are located in areas where the Helvetic nappe, and partially the crystalline basement crop out, while the Austrian event is situated in the National Park ‘Gesäuse’ within the northern limestone Alps. All the sites are located in periglacial areas, affected by high-mountain morphologic processes and by a minor importance of the erosional activity of the torrents. All events occurred during the summer and were triggered by convective storms with high rainfall intensities.

Table 1 Main characteristics of the debris flows included in the database

The events selected for this study are constituted by reaches with a wide range of erosion rates. Therefore, the database includes a large variety of governing factors and erosion. The database contains events initiated both by a landslide and by run-off, and the final volumes range from hundreds to tens of thousands of cubic metres (Table 1). These final volumes of the events were estimated from our field data or taken from technical reports (Marquis 2006, 2008; Pinyol 2008; Oller and Pinyol 2009). The field surveys were mostly carried out days or weeks after the events. Only in a few cases, the data collection took place after a longer period of time.

The estimates of erosion rates were based on the field observations of erosion evidence and reconstruction of the sediment disposition before the event (Fig. 2). For some reaches, an airborne LIDAR digital elevation model (of the pre-event situation) was available. For the other reaches, where a digital elevation model was not available, the reconstruction of both channel bed and banks was done based on expert criteria. The morphologic comparison between parts of the cross section that were not affected by the event (above the maximum flow depth) and the parts where the entrainment occurred was a key point for the reconstruction. The estimated erosion rates include changes on the cross section (including channel bed and banks) but not lateral failures that can occur in the channel-adjacent slopes. Finally, for each event, two types of input data were incorporated into the database: (1) field observations and (2) morphometric data derived from digital elevation models (DEM) with pixel size ranging from 1 to 5 m.

Fig. 2
figure 2

Sketch of the cross section of a channel reach indicating how erosion rate was estimated

3 Methods

3.1 Governing factors

The 17 torrents were divided into reaches according to similar geomorphological features and erosion rates. Thus, the final database contains 110 reaches of debris-flow torrents. The length of the reaches ranges from tens of metres to more than one thousand metres.

In every reach, different terrain features were determined, herein called governing factors. Four governing factors were selected for each reach i: (1) reach-averaged slope (S i ), (2) sediment availability (SA i ), (3) cross section shape (CS i ), and (4) upstream-contributing area (UCA i ). These factors can be divided into two groups depending on the collecting method: (1) three field factors (S i , SA i , CS i ) and (2) one topographical factor (UCA i ). The factors can also be distinguished between numerical (S i , UCA i ) and categorical ones (SA i , CS i ). The choice of these factors was based on: (1) the simplicity of their collection, (2) field observations that prove their influence on the entrainment process (regarding the field governing factors) and (3) experiences obtained from previous studies (e.g. Hungr et al. 2005).

3.1.1 Reach-averaged slope

The reach-averaged slope of a specific reach (S i ) is a numerical factor measured in degrees. In our work, the slope of the reaches was measured in the field using a standard clinometer: one single measurement for short reaches (few tens of metres) and several ones for longer reaches, which are then averaged to obtain the reach-averaged slope. The slope of the longest reaches was compared to the one provided by the DEM. This comparison showed that the values obtained from both methods were similar.

Some authors studied the relation between the erosion rate and the slope of the torrent reaches or of individual points in the channel. In some cases, the relation shows that the slope increases, as the entrainment rate enlarges (Rickenmann and Zimmermann 1993; Guthrie et al. 2009). This result would stress the importance of the slope on the entrainment process. In other circumstances, however, it has been observed that there is a feedback effect, which means that the entrainment is higher at low slopes. The justification of this effect was the increasing flow concentration or peak discharge with longer distances, which is normally associated with a slope decrease (Breien et al. 2008). Other studies contradict these two cases indicating that there is no clear trend, as the erosion rate is completely related to the availability of sediment in the reach (Hungr et al. 1984; Marchi et al. 2009).

3.1.2 Sediment availability

The sediment availability in reach i (SA i ) is a categorical factor. According to our observations in the field, it is one of the most influencing factors in the entrainment during a debris flow. In some torrents, reaches may show unlimited sediment (here called “unlimited” reaches). Thus, the erosion produced is totally conditioned by other factors. While in other reaches, the lack of sediment limits the entrainment. The importance of sediment availability on the entrainment has been demonstrated in several occasions by other authors (Bovis and Jakob 1999; Jakob et al. 2005).

In this work, five classes were described depending on how much material is available in the torrent reach. The distinction of classes is based on the percentage of the cross section that is covered by a certain thickness of sediment (Table 2).

Table 2 Sediment availability classes

3.1.3 Cross section shape

The reach cross section shape (CS i ) is a categorical factor. Although the cross section shape is not considered in previous works as a governing factor for the entrainment process, we incorporate this aspect, because it illustrates the degree of incision of the channel before the event and thus affects the flow dynamics.

In order to determine the shape of the cross section, the concept proposed by Gabet and Bookter (2008), who studied the shape of some gullies in south-west Montana (USA) based on a shape index, was adapted. The width was established at a certain height above the thalweg before the event, being the relevant dimension to describe the reach incision. The height was chosen with expert criteria, as an estimate of the flow depth that achieved the debris flow in the specific reach. In order to simplify field work and subsequent data analysis, the cross section shape factor was divided into the three classes “wide”, “moderately incised” and “incised” instead of using the exact width (Table 3). The class limits were established based on the observation during the field surveys in the torrents.

Table 3 Cross section shape classes

3.1.4 Upstream-contributing area

The reach’s upstream-contributing area (UCA i ) is a numerical factor that can easily be determined in GIS. It is calculated as the sum of map units (pixels) that drain water and other substances to the lowest point of a reach. The sum value of the pixels is obtained by means of the flow accumulation tool found in ArcGIS® (ESRI 2005), using a DEM.

The upstream-contributing area is applied in this work to illustrate the differences of discharge and sediment transport between reaches. For example, a debris flow can initiate into a similar concentration as a hyper concentrated flow in the upper reaches of the torrent (smaller upstream-contributing area) and be totally developed to a mature debris flow lower at the fan apex, where the upstream-contributing areas are larger.

The general patterns were analysed for each governing factor (S i , CS i , SA i , UCA i ) and for the target variable (e i ). The analysis was performed by means of histograms and correlations between variables for the entire database of 110 reaches.

3.2 Multiple linear regression

The multiple linear regression (MLR) is a learning technique, whose intent is to find out the best linear relationship between the erosion rate in a particular reach (e i ) and the governing factors using the following expression:

$$e_{i} = a_{1} S_{i} + a_{2} {\text{SA}}_{i} + a_{3} {\text{CS}}_{i} + a_{4} {\text{UCA}}_{i} + b.$$
(1)

The coefficients a 1 , a 2 , a 3 , a 4 and b are determined by minimizing the sum of the squared errors originated by the approximation of the erosion rate in every reach of the dataset. A numerical value was attributed to the categorical factors (CS and SA). For example: a “wide cross section shape” is represented by 1, a “medium cross section shape” by 2 and an “incised cross section shape” by 3. The same was done with the SA, although numbers range from 1 (complete limitation of sediment) to 5 (no limitation of sediment). Therefore, the resulting erosion rate is a numerical value.

Previously to the application of the MLR to the training set (93 reaches), the distribution of the variables was taken into account. If some of the variables in the dataset present a distribution far from normal, it is common to perform a transformation of the dataset. By transforming the data, a normal distribution of the values of a certain variable was achieved (Gartner et al. 2008).

3.3 Decision tree

Decision trees are a family of learning techniques that have been used by several authors in the field of natural hazards (Wan and Chiang Lei 2009; Chevalier et al. 2013). DTs allow the use of categorical values directly. They are based on two main elements: nodes and leafs. At each node, an attribute (governing factors) is tested. The procedure consists in comparing the attribute with a certain value (in numerical attributes) or choosing a specific category (in categorical attributes). At a certain level of the tree, a leaf is reached, which means that a final category of the target variable is achieved.

To create the DT, we used the data mining-software WEKA (Hall et al. 2009). Data mining is the process of discovering patterns, sometimes hidden, in a database (Fayyad et al. 1996). WEKA incorporates a total of 77 algorithms of learning techniques, from which 16 are DTs. Many of them were tested, achieving the best results in the J48 DT. The algorithms of the DTs are not discussed herein but are explained in Breiman et al. (1984). That is why herein only results from DT J48 are presented. We selected a 10-fold cross validation method to optimize the tree.

The DT learning process can present problems, if the target variable is not equally distributed. For this reason, the cost matrixes are a common strategy in the definition of DTs in imbalanced datasets (Witten et al. 2011). They are implemented as a metaclassifier during the learning process to force that all of the target classes appear in the tree. The matrix components provide a cost value for each class that would be misclassified into another class. Another problem in the definition of DTs is the overfitting, which creates extremely complex trees. One of the strategies to avoid overfitting is pruning (Witten et al. 2011). The pruning parameters are set in the classifier and affect the size of the tree and its complexity.

3.4 Validation and test of models

The evaluation of the success of the models was structured in two parts. Firstly, the two models were assessed by comparing which of them fits better with the validation set (ten reaches, representative for the whole dataset and randomly selected). Secondly, both models were tested in a torrent where the final volume of the event and the erosion rates of the reaches are known (test set, seven reaches). In this part, a comparison between the models (MLR and DT) and two existing empirical formulae was carried out.

The first part (validation) consists in evaluating the success of the models obtained by the learning techniques. The success of different predicting models is usually based on the receiver-operating characteristics, also called ROC analysis (Fawcett 2006). Common parameters used for ROC analysis are the true positive rate (TPR) and the false positive rate (FPR). Typically, the values of the TPR and FPR are plotted in a ROC graph. Then, the values are compared to the diagonal line (y = x), in case the predicted variable has two classes.

Since our predicted variable (the erosion rate) has more than two classes, the ROC analysis and Precision–Recall curves become more complicated. Thus, the ROC space is n-dimensional being n the number of classes of the variable and a visualization in the bi-dimensional ROC space is impossible. Therefore, we calculated the F-measure parameter to test the performance of the models for each class of entrainment. This F-measure parameter can be expressed by:

$$F = \frac{{2 \cdot {\text{precision}} \cdot {\text{recall}}}}{{({\text{precision}} + {\text{recall}})}}.$$
(2)

This expression combines the precision (true positives over total positives) and the recall (true positives over the sum of true positives and false negatives) of each class. This is the weighted harmonic mean of these two parameters (Hripcsak and Rothschild 2005). The comparison of the F-measure for each entrainment class illustrates whether a model is especially advantageous for a certain entrainment class. This could be meaningful in case the class is especially representative in the database.

The second part (test) consists in applying the models to a torrent from the database that was used neither in the training nor in the validation processes. The Schipfenbach torrent (Fig 1b; Table 1), which includes seven reaches, was selected as test set because of its variability in the governing factors and the erosion rates. The two learning techniques as well as two simple empirical methods were applied to the Schipfenbach torrent and the final debris-flow volume was calculated for each methodology. The observed total volume of entrainment along the torrent is known from field observation after the event (Hürlimann et al. 2003).

The two selected empirical formulae are one proposed by Rickenmann (1995)

$$\theta_{f} < 15\;\% :\,V = (6.40_{f} - 23)\;L;\,15\,\% < \theta_{f} < 40:\,V = (110 - 2.5\,\theta_{f} )\;L,$$
(3)

where θ f is the fan slope in (%), V is the volume of the event and L is the run-out length; and another suggested by D’Agostino (1996)

$$V = 45\;A^{0.9} \theta^{1.5} ({\text{I}} . {\text{G}} .),$$
(4)

where θ is the mean slope of the channel; I.G. is a geological index. For the DT, two different volumes were computed taking into account the minimum and maximum values of the erosion rate classes for all the reaches.

4 Results

4.1 Statistical distribution of the governing factors

The database contains a total number of 110 torrent reaches located in the Pyrenees (84) and the European Alps (19). Cumulated frequency graphs of the numerical variables and histograms of the four governing factors and the target variable are shown in Fig. 3. In the same Fig. 3, the histograms of the reaches in the test set are also shown.

Fig. 3
figure 3

Histograms of the four governing factors (a, b, c, d) and the erosion rate (e) for the 110 reaches included in the database. The cumulated frequency is shown for the numerical governing factors (a, d) and erosion rate (e). See Table 2 for abbreviations of sediment availability classes

The histogram of the reach-averaged slope distribution (Fig. 3a) presents a peak in the slope bin of 10°–20°. The most frequent values of the reach-averaged slope are between 10° and 40°, with a mean value of 23° and a standard deviation of 9°. Steeper (>40°) and smoother (<10°) reaches are of low frequency in our dataset, ~6 and ~8 %, respectively. These results fit other published works, in which channel slopes showing erosion range from 18° to 27° (Marchi and D’Agostino 2004), or are higher than 15° (Hungr et al. 2005).

Almost 40 % of the reaches in the database are unlimited (NL) in terms of sediment availability (Fig. 3b) and more than half of the reaches have NL or LL. The cross section shape distribution revealed that incised reaches are the most common ones in our database (Fig. 3c).

The upstream-contributing area of the reaches in the database takes values up to 8.2 km2 (Fig. 3d). This value and the total drainage areas (see Table 1) support the hypothesis that a small catchment area is a typical feature for debris-flow occurrence (e.g. Rickenmann and Zimmermann 1993). The histogram shape is strongly skewed to the left, and almost 60 % of the reaches have an upstream-contributing area lower than 1 km2.

The erosion rate-cumulated frequency graph shows a gradual increase: sharper for the lower values of the erosion rate and rising slowly for the higher ones (Fig. 3f). The prevalence of low erosion rates (of less than 2.5 m3/m) is also illustrated in the histogram. Although high erosion rates were reported in some reaches, the values reported in our database are similar to the lower classes of erosion rates from similar studies, such as Hungr et al. (1984). This can be justified by the exclusion of lateral bank failures and the glacial geological context of Hungr’s work.

In order to identify possible relations between governing factor and the target variable or among themselves, many governing factor plots were visualized. In spite of the high dispersion, few general trends could be identified. Regarding the graph that correlates the reach-averaged slope versus erosion rate, a general linear increasing trend between the slope and the erosion rate for the less limited sediment classes (NL and LL) can be observed (Fig. 4a). In the case of more limited sediment classes, the trend cannot be observed. The influence of the cross section shape can be observed in Fig. 4b, where it can hardly be observed that higher erosion rates occur preferably in incised reaches than in moderately incised or wide reaches.

Fig. 4
figure 4

Relations between slope and erosion rate depending on sediment availability (a) and channel cross section shape (b) for the 92 reaches of the database

4.2 Multiple linear regression

The histograms of the previous section show that some variables present a distribution far from normal, particularly skewed to one side: upstream area, erosion rate, sediment availability or cross section shape. As mentioned above, a transformation of the values of these variables could help to get better results in the learning process. Only two of these variables are numerical; and therefore, a transformation of the values could be applied to their values: erosion rate and upstream area. We performed two different types of modifications on these variables in order to transform them in normal-distributed variables. We carried out a log-transformation and ln-transformation. Moreover, to ensure the importance of each of the variables in the analysis, we carried out a step-by-step MLR analysis. In each step, we added a new variable. Therefore, we can ensure that adding variables increases the success of the methodology.

The results of the MLR applied to the original training set (without transformation of area upstream and erosion) and the transformed training sets are shown in Table 4a. Two parameters were chosen to compare the three attempts: the R 2 and the standard error. The models with highest R 2 value and lowest standard error are the ones with best performance. Thus, when the erosion rate and the area upstream are log-transformed, best results are obtained. The purpose of the second part of the analysis was to verify, whether accuracy depends on the quantity of governing factors included in the MLR. The results confirm that the best performance is achieved when using all the parameters (Table 4b).

Table 4 Results from MLR analysis. (a) Different attempts with different data transformations. (b) Step-by-step analysis of the different parameters (log-transformed values of upstream-contributing area and erosion rate)

Therefore, the final equation obtained by the MLR is a first-order polynomial expression of five variables and can be expressed by:

$$\log (e_{i} + 1) = 0.007\;S_{i} + 0.102\;{\text{SA}}_{i} - 0.066\;{\text{CS}}_{i} + 0.053\;\log ({\text{UCA}}_{i} + 1) - 0.312.$$
(5)

4.3 Decision tree

The resulting tree applying the J48 algorithm to the training set is shown in Fig. 5. The predicted erosion rate was divided into four classes with an interval of 2.5 m3/m (the same classes as the ones shown in the histogram in Fig. 3e). The interval of 2.5 m3/m is considered to be a reasonable resolution of an erosion rate class for our database. In addition, such a classification of this parameter supports our goal to propose a simple methodology.

Fig. 5
figure 5

Decision tree for entrainment estimate using the training set of our database applying the algorithm J48

First, a cost matrix was used with the intention to avoid the problems derived from the imbalance of the erosion rate distribution. The cost matrix was defined in order to force that the four classes of the erosion rate appear in the final DT. It was done by means of attributing higher costs to the misclassification of reaches with larger erosion rates (less present in the database). Second, the pruning was set by a confidence factor of 0.25 after several attempts. This confidence factor provided a DT with a maximum of three levels, which was acceptable according to the aim of defining a simple model.

The first split of the tree refers to the sediment availability, and the branches with fewer availability of sediment (CL and HL) already reach the final leaf. These leafs correspond to the class of the lowest erosion rates. The branches with more sediment available (ML, LL and NL) need one or two additional levels to reach the leaf of the value of the erosion rate.

The cross section shape is the second factor appearing in two of the classes of sediment availability, while the average slope of the reach exists in one. In the following level of the tree, the slope or the upstream-contributing area is located. In general, the dominant class in the DT is the lowest erosion rate class: “0 ≤ e i  ≤ 2.5 m3/m”, appearing in half of the leafs of the DT. The maximum erosion classes only appear in steep reaches (<32°) with unlimited sediment availability.

4.4 Validation and test of models

The validation of the two models was based on the F-measure parameter, as mentioned in previous sections. The F-measure was calculated for each of the classes of the erosion rate, both in the training set and in the validation set. The values in Table 5 suggest that both techniques show good performance in the lowest class of erosion rate (up to 2.5 m3/m) in both training set and test set. This is a reasonable result, as this is the most frequent erosion rate class. However, for the larger erosion rate classes, MLR does not offer good results. Thus, the weighted average of the F-measure was calculated to get an overall evaluation. Regarding the weighted average, the DT learning technique showed better results, especially in the training set, but also in the validation set.

Table 5 Values of the F-measure parameter for each class of the two models to predict the entrainment in the training and the validation set

The test of the models revealed two main ideas (Fig. 6): first, MLR-estimation and the maximum value estimated by DT coincide very well with the observation, while the minimum value estimated by DT underestimates the volume. Second, the volumes determined by the two empirical relationships strongly overestimate the observed volume by more than one order of magnitude. This result can be attributed to various facts: empirical equations always have a large scatter, and the results can consequently have important errors; the empirical techniques were developed in a specific area, and their application to other regions may generate inaccuracy.

Fig. 6
figure 6

Comparison of the observed total volume of the debris flow occurred in the Schipfenbach torrent with the volumes obtained by the DT (minimum and maximum values), MLR and empirical formulae

5 New general approach to estimate entrainment

We selected the DT model to propose a new general approach that estimates the entrainment. Empirical formulae may be adequate for a preliminary estimate without time-consuming field work, but they present several uncertainties, as mentioned in the review section and as observed in Fig. 6. In addition, a comparison between the DT and MLR showed several advantages of the DT: (1) it is simple, intuitive and logical, (2) it can be easily adaptable to other regions, and (3) it showed a better success than the MLR in terms of statistical parameters.

The new approach is based on both the tree obtained from our database and expert criteria. The resulting DT is presented in a qualitative form (Fig. 7) and may be adapted by introducing specific values of the different parameters according to the characteristics of the region. Several outputs of the study are included in our general approach. Firstly, sediment availability is the most influencing factor. Second, the reaches with low sediment availability (HL, CL) directly reach the leaf of the corresponding erosion rate, independently of other factors. Thirdly, for the reaches with more sediment available (ML, LL, NL), the key factors are the slope and shape. The upstream-contributing area was excluded in this final approach to simplify the DT, and because our results pointed out that it is a factor of minimum influence. The erosion classes were simplified into three classes in contrast to the four classes used in the datasets of our study.

Fig. 7
figure 7

General DT proposed as a learning model to estimate the erosion rate in a torrent reach

It is important to keep in mind that the DT was developed on a dataset that includes event volumes from thousands up to a few tens of thousands of cubic metres (see Table 1). Therefore, extreme events (hundreds of thousands or even millions of cubic metres) are not considered. Thus, results of this study are representative for medium and high-frequency debris flows that occur in the Pyrenees and Alps, but not for extreme or low-frequency events with considerably larger volumes. Last but not least, if the method will be applied to other mountainous regions, the values of both the governing factors and the erosion rates should be adapted.

6 Conclusions

The entrainment during a debris flow is a complex process. There are different methods to predict the erosion along a debris-flow trajectory and total volume of an event. While empirical equations are straightforward and useful to obtain preliminary values of debris-flow volume, detailed information on topographical data and field observations is necessary to determine the erosion rate more precisely.

The preliminary analysis of the field and topographical information collected in 110 channel reaches of 17 debris flows supported the outcomes of previous studies that there is no clear correlation between the erosion rate and pairs of governing factors. This has also been observed by other authors (Chen et al. 2005; Hungr et al. 2005), which reinforces the complexity of the entrainment process.

Learning techniques were applied in order to obtain knowledge that describes a model that is able to predict the erosion rate in a torrent reach based on several governing factors. The entrainment showed better performance in the DT J48 model than in MLR model, even if the MLR model revealed good results in the application to our test case (Schipfenbach torrent). The resulting DT revealed that the sediment availability is the most relevant factor in the erosion process, followed by reach-averaged slope and cross section shape. In this study, the sediment availability in a torrent reach was defined by five different classes according to the overall relation between bedrock and colluvium in the cross section. It must be stated that colluvium was mostly poorly consolidated and highly erosive material accumulated from older torrential and hill slope processes. These specific conditions of the material properties might have affected the importance of this factor and should be checked, if the model will be applied in different environments.

Finally, we proposed a simple prediction model for debris-flow erosion rate in channel reaches. The approach is based on both the DT obtained from our database and expert criteria. We presented the DT in a qualitative way in order to make it adaptable to other regions. It must be stated that this approach only estimates the entrainment directly related to both channel-bed erosion and local failures of channel banks neglecting large failures in torrent slopes.

In spite of the limited number of reaches in our database, the outcomes of this study improve the understanding of debris-flow entrainment. In addition, the general DT represents a useful tool to estimate the total event volume, since the determination of the volume without considering the entrainment could considerably underestimate the real volume involved in the event. Regarding a hazard assessment at catchment scale, the general DT could be applied to any torrent, after calibrating the limit values of the classes of the geomorphological factors. Moreover, the values of the erosion rates should be adapted to the values observed in each specific region, since they may be higher than the ones in this study.