Introduction

Seeking to shed light on the characteristics of landslides, numerous papers in the literature have examined the features of landslide movement and have proposed such forms as falls, topples, slides, translation, lateral spread, and flows (Hutchinson 1988; Cruden and Varnes 1996), where flow is the characteristic movement of debris flows. Borgomeo et al. (2014) and Bordoni et al. (2018) found that landslides generally occurred where the slope was > 10°, with falls usually occurring on 45° - 90° slopes, rotational slides largely on \( 15{}^{\circ}-40{}^{\circ} \), and planar on \( 30{}^{\circ}-70{}^{\circ} \) slopes. Debris flows (DF) typically occur on slopes < 30°. Taiwan is very susceptible to slope-failure disasters due to her complex geomorphology and seismic activity. Mountainous terrain with steep slopes and large elevation differences occupies roughly 75% of the island, and the geology is fragile. Due to even greater geological fragility following earthquakes, and torrential rainfall induced by climate abnormalities, there has been a wide spread of landslides. In recent years, due to increased extreme rainfall events, there have been a growing number of large-scale landslides. Jacobs et al. (2017) among several other researchers have reported landslides associated with extreme rainfall events in the Rwenzori Mountains in Uganda; Bordoni et al. (2018) in Oltrpo, Italy; Dille et al. (2019) in DR Congo etc. These landslides have been broadly classified into either shallow or large landslides. Shallow landslides (SL) have a shallow slip plane, and typically small surface dimensions and a relatively short-distance movement (Santacana et al. 2003; Giannecchini 2006; Chigira and Yagi 2006). Most are induced by river or road cutting. On the contrary, large landslides (LL) are induced by gravitational slope deformation, and are thought to be a common manifestation of a creep, and have a relatively deep slip plane (Hutchinson 1988; Dille et al. 2019). Tamura et al. (2008) define deep-seated landslides as having an area greater than 10 ha or depth greater than 10 m or volume greater than 1 × 105 m3. In this study, we defined large landslides in an almost similar manner except that LL were defined as having a depth greater than 2 m and volume greater than 2 × 105 m3. For practical applications in Taiwan as highlighted by Liu (2009), the classification of landslides is simplified to five main categories modified from Varnes (1978). These include rock falls, shallow-seated landslides, deep-seated landslides, dip-slope and wedge slides, and debris flows. Henceforth, the classification adopted in this study falls within the acceptable scheme in Taiwan.

Topographical features play an important role in determining the location and abundance of landslides (Guzzetti et al. 2008). Meunier et al. (2008), Regmi and Walter (2019), and Bhardwaj et al. (2019) have investigated the characteristics of landslides triggered by earthquakes and rainfall, and have employed different characteristics and topographical factors to analyse landslide morphology. Meunier et al. (2006), Zhuang et al. (2018), and Othman et al. (2018) suggested that landslides induced by earthquakes are concentrated near ridge crests, with over 65% originating in upper quadrant of slopes, while those induced by rainfall are often on lower slopes. Dai and Lee (2002) analysed the distribution of landslide length, width, and depth on the basis of such topographical factors as geology, lithology, elevation, slope direction and characteristics, gradient, and land use. They found the frequency of landslides to gradually increases with slope, until at about \( 35{}^{\circ}-40{}^{\circ} \), beyond which it gradually decreased. In addition, the study found landslide length to increase with width or volume and concluded that landslide length had the best correlation with the change in slope. Numerous researchers have also used topographical factors in conjunction with various statistical methods, such as logistic regression (Guzzetti et al. 2008), weighting, and correlation testing to create landslide susceptibility maps (Regmi et al. 2010; Regmi and Walter 2019). Guzzetti et al. (2008) found landslides to generally occur in soft, weak rock formations with irregular layering, and the abundance of landslides increases with the landslide area, which is linked with the conclusion that geological conditions have a very close relationship with landslide location and frequency (Cardinali et al. 2002). However, other natural conditions also have an effect on landslide location and frequency (Guzzetti et al. 2012). Besides using topographic factors, some studies have applied quantitative assessments to show important parameters in landslide mapping such as elevation, standard deviation of elevation, and slope (Lagomarsino et al. 2017). Although many papers have described landslide locations and morphology, a little has been done on the relationship between landslide volume and morphology. Hence, this study aims to fill that gap. Additionally, the defined aimed to define some geometrical rules to characterize landslides. Landslides were first classified as SL, LL, and DF. Topographical characteristics of each were obtained for in-depth analysis. Landslide length, width, depth, volume, and area were used to draw ternary diagrams proposed to classify landslides in conjunction with logistic regression models.

Materials and methods

Study area

Taiwan is located on a convergent and compressive boundary along the Eurasian and Philippines sea plate, and is mainly characterized by sub-tropical and monsoon climate with plentiful rainfall. The average annual rainfall is about 2500 mm and is concentrated between May and October from typhoons and torrential rains. The country is a young orogenic belt underlain by mostly tertiary geo-synclinal sediments having a thickness of more than 10,000 m (Ho 1986). The eastern part of the island, starting from the central mountain range, is composed of metamorphic rock of the earlier Mesozoic and Palaeozoic eras. The stratum of western Taiwan is principally made of marine rock sediment. Taiwan’s strata are distributed in long and narrow strips, almost parallel to the island’s axis. Sedimentary rock forms part of the island-wide piedmonts and coastal plains as well as the coastal mountain range (Ho 1986). The island is further characterized by steep terrain and river valleys, numerous mountains, large elevation differences, etc. (Fig. 1c). Mountains and hills occupy 75% of the island’s area and the average slope is about 30–40%. The complex geology combined with the enormous rainfall triggers several instances of landslides (Wu et al. 2011).

Fig. 1
figure 1

Normalized frequency of landslides according to (a) elevation, (b) volume, and (c) distribution of compiled landslides

Data collection and analysis

The study compiled data of landslides generated by typhoons and floods (rainfall-induced) in Taiwan between 2006 and 2014 (Fig. 1). This was mainly because rainfall influences the viscosity of the landslide mass and can almost result to maximum runout distances. The displaced materials of a rainfall-induced landslide are usually washed away from steep slopes. It remains only the fresh scars of the rupture surface (Liu 2009) making it much simpler to afford landslide measurements. Landslide characteristics such as length (L), width (W), and mean depth (H), which are described in sections that follow and in Fig. 2, were obtained from the Taiwan official landslide website (https://246.swcb.gov.tw/Achievement/MajorDisasters). These were investigated by the Soil and Water Conservation Bureau of Taiwan through a series of investigative tools such as field measurement and aerial photos interpretation. The landslide area was computed as a product of L and W, and volume obtained through the product of this area and H (shown in Fig. 2). According to the definition as described in the “Introduction” section, the landslides consist 76 DF, 495 SL, and 34 LL. Their distribution is shown in Fig. 1. From the map, it can be seen that most of the landslides occurred in southern Taiwan. A study by Tsou et al. (2011) showed that this part of the island is mainly underlain by sedimentary rocks of Quaternary, Pliocene, and Miocene. This is also where the Neiying Fault is located and the combination of these attributes exposes this region to several landslide disasters.

Fig. 2
figure 2

Schematic diagram of the parameters used in the analysis

The digital elevation model (DEM) used for the analysis was derived from a 30-m mesh Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) DEM. From Fig. 1, it is observed that most LL and SL occurred at an elevation > 100 m and were mostly concentrated between 500–1000 m. There were no LL at an elevation < 100 m. DF frequents were at an elevation between 100 and 500 m. The range in landslide volume for all landslides spans six orders of magnitude from 101 to 107 m3, and all LL were larger than 2 × 105 m3. Most (82%) SL ranged from 103–105 m3 as did DF. Only a few DF had volumes greater than 106 m3. Landslide volume from SL and DF shows near normal distributions, with an exception of LL, which shows a skewed distribution towards larger volume.

Meunier et al. (2008) characterized landslide sites in terms of their area and distribution, and found topographic site effects to have strong control over their location. Earthquake-triggered tend to be more clustered near ridge crest, while those triggered by rainfall were found mainly on colluvial slope toes. Additionally, the location of these two types gradually converged as the landslide area increased. Understanding the topographic site effects in this study requires several parameters: distance from the ridge crest to the nearest stream (LT), ridge to the top of the landslide (T), toe of the landslide to the nearest stream (B), top of the landslide to its toe (L), mean landslide depth (H), and width (tW) as illustrated by Fig. 2. These parameters were extracted from Google Earth images. The measured T and B were normalized by LT.

Logistic regression

Logistic regression model has often been applied in the study of landslides (Lombardo and Mai 2018; Tanyas et al. 2019; Luo et al. 2019) and has provided accurate and reliable results. It is characterized by a discriminative model which estimates a probability for a given feature (Eq. 1). We applied the model to validate the ternary classification proposed, with the landslide type as our nominal response. The Akaike information criterion (AIC) (Akaike 1974) and the Bayesian information criterion (BIC) (Stone 1979) were used to evaluate the models. AIC uses the in-sample fit to estimate the likelihood of a model to predict/classify future values, while BIC measures the trade-off between model fit and complexity of the model. A better model is one that has lower AIC or BIC values. The equations (Kleinbaum 1998) below are applied;

$$ p=1\div \left(1+{e}^{-z}\right),\mathrm{where}\ z=a+{b}_1{X}_1+{b}_2{X}_2+\dots +{b}_n{X}_n $$
(1)

where a is the intercept of the model, b refers to the beta values (power law scaling components of W, L, and H) associated with each of the independent variable, n is the number of variables, p is the probability varying from 0 to 1, and z varies from −∞ to +∞ on an S-shaped curve.

$$ \mathrm{AIC}=-2\times \ln (L)+2\times k $$
(2)
$$ \mathrm{BIC}=-2\times \ln (L)+2\times \ln (N)\times k $$
(3)

where L is the value of the likelihood, N is the number of recorded measurements, and k is the number of estimated parameters.

Results and discussion

Magnitude of landslides

The magnitude cumulative frequency (MCF) analysis was derived by sorting the inventory in the order of increasing magnitude (volume) and accumulating the incremental frequencies from largest magnitude to the lowest (Hungr et al. 1999). Figure 3 shows the MCF of the 3 types of landslides, DF, SL, and LL. The empirical power law relationship between the cumulative frequency (N) and the volume (V) of DF, SL, and LL was obtained as r2= 0.96, 0.94, and 0.87, respectively. This is in line with observation made by Tsou et al. (2017), who applied similar landslide definition as the study herein. Differences were noted however in power scaling exponents, which could be attributed to the different methodologies applied when conducting landslide inventories. Tsou et al. (2017) derived inventories from Lidar images, whereas in this study, they were derived from satellite images and ASTER DEMs. The rollover effect, where the data are no longer represented by the power law, is observed in the three landslides and is more obvious under SL and LL. Guthrie and Evans (2004) noted that larger landslides were well described by the power law, which could explain the not obvious rollover for the DF in this study.

Fig. 3
figure 3

Magnitude cumulative frequency curves for the landslides inventory

Landslide morphology

Figures 4, 5, and 6 shows the pairs of width to volume, length to volume, and depth to volume, respectively. A majority of DF (82%) and SL (87%) had a width less than 100 m. All LL had their widths greater than 200 m with 75% clustered between 200 and 500 m. The width of DF correlated well with volume (r2 = 0.726), while LL had the lowest correlation coefficient (r2 = 0.263). The width of DF was significantly lower when compared with the other 2 classes. On the contrary, DF and LL had significantly higher length (> 200 m) when compared with SL, a majority of which were below 200 m. Unlike width, the regression patterns are similar with length, indicating a similar relationship with volume. A steeper regression line is observed with LL, and stronger correlation exists between the depth and volume (r2 = 0.662). The depth of SL tended to be deeper than that of DF for any given volume, and while the volume of SL increased approaching that of LL, the depth also proportionally increased.

Fig. 4
figure 4

Relationship between landslide width and volume

Fig. 5
figure 5

Relationship between landslide length and volume

Fig. 6
figure 6

Relationship between landslide depth and volume

Landslides distribution

Following the work of Meunier et al. (2008), Fig. 7 shows the location of each landslide. The normalized distance of the landslide from its top to the ridge (T/LT: abscissa) and the normalized distance of the landslide from its toe to the nearest stream (B/LT: ordinate) are used for the illustration. SL were concentrated on the lower-right position (0.4 < T/LT < 0.9 and 0.4 <B/LT). This suggests that most SL occurred on lower slopes and were induced by converging runoff and stream bed erosion, similar observations made by Meunier et al. (2008), Wu et al. (2014), and Yano et al. (2019), and some are induced by river/road cutting. The presence of mechanically weak colluvial deposits and the focusing of groundwater outflow could be other attributes (Meunier et al. 2008). In contrast, DF and LL were clustered on the lower-left (Fig. 7). About 80% of DF originated in less than the upper 8/10 of the total slope length (T/LT < 0.8) and extended to the streams (B/LT = 0), suggesting that most DF were transported directly to rivers. Meunier et al. (2008) observed landslides triggered by typhoon Herb in 2006 that a significant portion of debris flows was delivered direct into rivers. In the population of LL, 94% extended to less than the lower 2/10 of the total slope length (0 < B/LT < 0.2) whereas those extending to rivers (B/LT = 0) presents 69%. Hence, most LL slided close to the stream, or even deposited rock and soil directly in the streambed suggesting high fluidity of the transported mass.

Fig. 7
figure 7

Landslide location with respect to ridge crest (T/LT) and closest stream (B/LT)

When comparing the length and width of landslides; widths of SL, LL, and DF rarely exceeded the length (i.e. L/W ratio is mostly greater than or equal to one, see Fig. 8). SL and LL had L/W ratio in the range 0.079 − 32 and \( 0.3-4.41 \), respectively. Comparatively, DF had a wider L/W distribution, ranging from 2.25 to 213, with more spread around the mean value of 45.7; hence, DF have more pronounced elongated form.

Fig. 8
figure 8

Length and width ratio (L/W) of SL, LL, and DF

Landslide size and slope analysis

From Fig. 9, it can be seen that DF occurred mostly (53%) at gentler slopes (<20). On the contrary, SL and LL were mostly concentrated on slopes >30. About 48% of SL and 54% of LL were between 30 and 40°, an almost similar observation by Borgomeo et al. (2014). To determine the magnitude of slope failure, we used the dimensionless parameter L/LT, which is directly proportional to the landslide area. A value of 1, therefore, would mean the failure occupied the entire slope. Figure 9 further illustrates that 88% of SL were clustered below L/LT = 0.4. In selected cases, SL had L/LT > 0.4 where the slope was generally short. Eighty nine percent of LL had an L/LT > 0.5, some approaching 1, suggesting the entire slope to have been destroyed. Likewise, DF (85%) had L/LT > 0.4 since they tended to have greater length.

Fig. 9
figure 9

The relationship between the landslide area and slope

Ternary distribution of landslide geometric form index

Earlier investigations have shown that in a natural river, a geometric form index relationship exists between water depth, width, and flow velocity, and formulas involving an index and coefficients have been proposed to estimate relevant factors (Leopold and Maddock 1953). Rhodes (1977) used this concept to draw the geometric form index of rivers with straight, meandering, and braided morphologies on ternary diagrams. Based on these concepts, a landslide geometric form index applicable to DF, SL, and LL is proposed. The relationship between the width (W), length (L), and depth (H) through power equations with the landslide volume (V) as the independent variable is illustrated by the equations below;

$$ W={\alpha}_1{V}^{\beta_1} $$
(4)
$$ L={\alpha}_2{V}^{\beta_2} $$
(5)
$$ H={\alpha}_3{V}^{\beta_3} $$
(6)

where α1, α2, and α3 are coefficients, and β1, β2, and β3 are the power law scaling exponents of W, L, and H, respectively. The βs are hereby referred to as landslide geometry exponents. The reader should note that all these coefficients and power law scaling components are empirical; hence, their values should be assessed likewise. The volume (V) can also be expressed as;

$$ V=W\times L\times H $$
(7)

Equation 7 can be written as the following;

$$ V={\alpha}_1{\alpha}_2{\alpha}_3{V}^{\beta_1+{\beta}_2+{\beta}_3} $$
(8)

Therefore, α1α2α3 = 1 and β1 + β2 + β3 = 1. For graphical data presentation, β1, β2, β3 are the tools for interpreting the landslide geometry.

The ternary diagram shown in Fig. 10 clearly demonstrates the differences between the three landslides observed. SL have smaller L, W, and H, and similar average β1 and β2. A typical SL lies almost at the centre of the ternary plot. Inversely, LL have larger L and W, and further similar averages of β1 and β2 to satisfy Eq. 8. This makes β3 to be the key controlling factor for LL. A majority of the LL lie in the region with greater depth. Like LL, DF had larger L with a typical DF similar to a SL in depth (β3). We expect the ternary plot to be universally applicable in that the different landslides within the ternary should follow the same patterns as observed in this study. Furthermore, a large database of similar scale or more would be necessary, having landslides classified as outlined in the “Introduction” section. The reader should note that the methodology was developed for rainfall-induced landslides using the database as described in the “Data collection and analysis” section. Henceforth, geological and geomorphological conditions, and role of the triggering mechanisms of landslide need to be taken into account when developing the analysis. Although, more recent techniques such as hybrid models (Luo et al. 2019) and machine learning (Dou et al. 2020) could be used, simplified tools such as the ternary plot are still necessary in practice and are still highly sought by conservation managers.

Fig. 10
figure 10

Ternary diagram illustrating the distribution of landslide geometry exponents

Logistic regression analysis

The ternary diagram in Fig. 10 has demonstrated the location of each landslide event and how each one may be estimated using β1, β2, and β3. In an attempt to better characterize these events from β, logistic regression was applied. The characterization profiles for the different β values are shown in Figs. 11, 12, and 13. Using β1 to classify the landslide event is shown to be applicable in all events: DF, SL, and LL. We used a probability of 0.8 (80%) as a threshold to indicate the likelihood of a landslide event. From Fig. 11, it is shown that the β1 values for DF, SL, and LL were 0.166, 0.263, and 0.511, respectively, to get a classification with a 0.8 chance of success. Beyond 0.166, the probability (p) of correctly classifying LL in the ternary diagram is drastically reduced, while that of SL increased. Likewise, at about β1 of 0.330, the probability of correctly classifying SL decreases, while that of DF increases. After β1 reaches 0.511, p for DF dominates, while SL and LL are almost 0. A confusion matrix using β1 is shown in Table 1 and the overall correct rate was 0.955.

Fig. 11
figure 11

Characterization profile using β1

Fig. 12
figure 12

Characterization profile using β2

Fig. 13
figure 13

Characterization profile using β3

Table 1 Classification table for β1

Using β2, shown in Fig. 12 and Table 2, suggests a threshold of 0.341 for β2. Furthermore, this would accurately estimate SL, since beyond 0.341, the probability of DF and LL is very low (below 0.2) and is zero from β2 ∼ 0.45 . This is further indicated by the poor correct rate in Table 2, where for example, the correct rate for LL was 0. There were 76 DF from our compilation; moreover, using β2 gave only 42, having an accurate rate of 0.553.

Table 2 Classification table for β2

As already indicated in the “Ternary distribution of landslide geometric form index” section, LL have larger L and W, and their β1 and β2 values are close to the mean, hence influencing β3 to be the controlling factor for volume. From Fig. 13, 0.466 is the cutting point for characterizing LL. Beyond this value, p for SL nears 0, while for DF is 0. At β3 of about 0.225, the p to correctly classify SL increases and gently decreases after β3 of 0.3. The confusion matrix in Table 3 shows a 100% correct rate when using β3.

Table 3 Classification table for β3

A general summary of the model evaluation profiles is given by Table 4. Logistic regression using β1 (AIC = 136.115, BIC = 153.736) outperformed both β2 and β3 using either AIC and BIC. The logistic model from β3 was the poorest, having AIC and BIC values of 421.759 and 439.380, respectively. Even though some of the models performed poor, they were all significant as indicated by the p values in Table 4.

Table 4 A summary statistic for logistic regression using β1, β2 and β3

Conclusion

An improved topographic site effect method has been applied to analyse landslide’s volume and characteristics. The study has also presented a ternary diagram using three indices (W (β1), L (β2), and H (β3)) to develop a three-dimensional volume method of classifying landslides. Landslides were classified as debris flows, shallow, and large landslides. There was large variability in width between DF and SL, and width had a significant positive correlation with volume. The W of DF was significantly lower than that of SL and LL. DF and LL had a significantly higher L (> 200 m) when compared with SL, and a stronger correlation was found between H and volume (r2 = 0.662). SL and LL had L/W ratio in the range 0.079 − 32 and 0.3 − 4.41, respectively. The improved topographic site effect method indicated that SL are clustered on lower slopes (0.4 <T/LT< 0.9 and 0.4 <B/LT). LL originating from higher slopes and extending to near streams accounted for 69% of the total LL. A majority (53%) of DF occurred at slopes <20. To enhance the ternary diagram developed, logistic regression was applied and β1 was found to be a better classifier of the three types of landslides (DF, SL, and LL), with a correct rate of 0.955, followed by β3 and lastly β2. Through characterization profiles, when using β1, the probability (p) was 0.8 to correctly classify a landslide as LL when β1 < 0.166, SL when β1 > 0.263, and SL when β1 > 0.511. AIC and BIC were the lowest for β1 when compared with logistic models of β2 and β3. Moreover, β3 logistic model had a correct rate of 100% in classifying LL. While the ternary diagram proposed in this study has been successfully applied, it is important to note that β1, β2, and β3 are empirical parameters. Therefore, some deviation is expected in practice and with different regions. Nonetheless, location of the different landslides within the ternary should follow the same patterns as observed in this study, making the ternary more versatile in classifying landslides. Additionally, a large database of similar scale or more would be necessary for improved performance.