1 Introduction

Rainfall-induced landslides often cause enormous damage and property losses worldwide (e.g., Zhang et al. 2011; Wei et al. 2019; Lu et al. 2023; Mondini et al. 2023). As these landslides are often distributed over large regions, developing landslide susceptibility maps (LSMs) has been considered a useful tool for the risk management of these landslides (e.g., Long et al. 2021; Gong et al. 2022; Su et al. 2023). At present, two types of models have been widely used for developing the LSMs, i.e., the data-driven models rely on the statistical analysis of previous landslide events, and physically-based models through regional stability analysis of the slopes (e.g., Merghadi et al. 2020; Ji et al. 2022). The data-driven methods consider the evolution of landslides to predict slope failure (Cascini et al. 2022), while also assuming that conditions resembling those of historical landslide areas may evolve into future slope instability (e.g., Yalcin., 2008; Gong et al. 2022), and various techniques have been used for learning it, such as logistic regression (e.g., Zhao et al. 2019), random forest (e.g., Sun et al. 2021; Fu et al. 2023), convolutional neural network (e.g., Fang et al. 2020; Wang et al. 2021a; Wei et al. 2024). However, the applicability of data-driven models may be limited by the quality and quantity of available historical landslide data. Unlike the data-driven models, the physically-based model typically consists of a hydrological and a slope stability model (e.g., Vieira et al. 2018), which allows realistic modeling of the physical mechanism of slope failure. Several physically-based models have been developed for rainfall-induced regional LSMs, such as shallow slope stability model (e.g., Montgomery and Dietrich 1994; Abraham et al. 2023), transient rainfall infiltration and grid-based regional slope-stability model (e.g., Baum et al., 2002; Ma et al. 2021; Wei et al. 2021) and the regional assessment of rainfall-induced landslides model (RARIL) (Yang et al. 2022). Nevertheless, physically-based models mostly are created by simplifying the real physical process and giving assumptions (Medina et al. 2021). Additionally, the soil parameters necessary for physically-based models on a regional scale could be difficult to obtain (e.g., Sun et al. 2024) and are often treated as fixed values during the application of such models (e.g., Pradhan and Kim 2016).

To overcome the limitations of the two types of models, the hybrid model, which intends to take advantage of the two types of models, has also been explored recently as a promising tool to achieve more interpretable and accurate LSMs (e.g., Nguyen et al. 2022; Wei et al. 2023). For instance, Nguyen et al. (2022) adopted a matrix-based approach to combine the outcomes of data-driven and physically-based models. Wei et al. (2023) integrated the factor of safety (FOS) value derived from the physically-based model into the data-driven model, serving as a new conditioning factor to replace the initial geological conditioning factor. These hybrid models improved the interpretability of landslide susceptibility analysis and produced more accurate LSMs. Nevertheless, in these hybrid models, the soil parameters uncertainty is not considered. The soil properties in a region, are indeed hard to determine accurately and could be associated with a significant amount of uncertainty (e.g., Jelínek and Wagner 2007; Cao et al. 2016).

The object of this study is to propose a novel hybrid model for landslide susceptibility analysis with explicit consideration of the uncertainty in the soil properties. This paper first introduces the framework for constructing hybrid models. Then, the data-driven model and physically-based model is introduced, followed by the three hybrid models, i.e., one model considering the soil parameter uncertainty, another model using the same data-driven framework but does not consider soil parameter uncertainty, and a matrix-based model also does not consider soil parameter uncertainty. Finally, the conventional data-driven, physically-based model and the three hybrid models are illustrated and compared with a case. We believe that the proposed method offers a new way to combine the advantages of data-driven and physically-based models for more accurate rainfall-induced landslide susceptibility assessment.

2 Methodology

Model hybridizing provides a novel method for improving landslide susceptibility assessment. As the data-driven model is trained to find the relationship between conditioning factors and landslide occurrence, one possible way to develop the hybrid model is to train the data-driven model by considering the prediction from the physically-based model as an additional conditioning factor. As such, the information provided by the physically-based model can naturally be integrated into the hybrid model. To consider the soil parameters uncertainty in the physically-based model, the uncertain soil parameters can be treated as random variables, and the stability of the slopes can be thus measured through the failure probability. In such a case, the failure probability of the landslide calculated through a physically-based model can be considered as the additional conditioning factor to be incorporated in the conventional data-driven model.

To implement the above idea, a data-driven model and a physically-based model should be first selected. As an illustration, the logistic regression (e.g., Yilmaz 2009; Zhan et al. 2023) will be used in this study to develop the data-driven model as it is simple and be widely used, and RARIL (Yang et al. 2022) will be used as the physically-based model for its ability to explicitly consider uncertainty in the soil properties through reliability analysis. Figure 1 shows the process of the proposed hybrid models, which consists of three steps, i.e., construction of a data-driven module, establishment of a physically-based model, and development of the hybrid model through the combination of the two types of models. In the following, each of the above steps will be introduced.

Fig. 1
figure 1

Illustration of the development process of hybrid models

2.1 Data-driven model

To develop the hybrid model, a data-driven technique should be first selected. As an illustration, the logistic regression will be used in this study as the data-driven model. In the logistic regression, the relationship between the probability of landslide occurrence and the conditioning factors (e.g., Zhao et al. 2019) can be expressed as follows:

$$p=\frac{1}{{1+{e^{ - Z}}}}=\frac{{{e^Z}}}{{1+{e^Z}}}$$
(1)

where p is the estimated landslide occurrence probability, and Z is the linear combination of conditioning factors as follows:

$$Z={w_0}+\sum\limits_{{i=1}}^{n} {{w_i}{x_i}}$$
(2)

where xi is the ith conditioning factor, wi is the regression coefficient of the ith conditioning factor, and n is the number of conditioning factors.

To calibrate Eq. (1), a landslide inventory should be first compiled. Then, the parameters in Eq. (2) can be calibrated through the method like maximum likelihood estimation (e.g., Chen et al. 2016). To standardize the raw data in regression model, the Min-Max normalization method can be adopted (e.g., Zhang et al. 2021) as shown below:

$${x^*}=\frac{{x - {x_{{\text{min}}}}}}{{{x_{{\text{max}}}} - {x_{{\text{min}}}}}}$$
(3)

where x is the raw data, x* is the normalized data, and xmax and xmin are the maximum and minimum values of the conditioning factor x.

2.2 Physically-based model

As an illustration, RARIL, as suggested by Yang et al. (2022), is used as the physically-based model in this study. In RARIL model, the digital terrain is first partitioned into square grids with 30 m resolution, and the stability of each grid is assessed through slope stability analysis during the rainfall infiltration process. In particular, the modified Green-Ampt model (Zhang et al. 2014) is used for rainfall infiltration analysis, and the infinite slope stability model (Fig. 2a) (e.g., Zhang et al. 2014; Lu et al. 2023) is used to calculate the FOS of each cell. Figure 2b shows the water content profile of Green-Ampt model. In the modified Green-Ampt model, the calculation of the infiltration rate after ponding occurs, i, is performed (Chen and Young 2006) as follows:

$$i={k_s}[\frac{{{s_f}({Q_s} - {Q_i})}}{I}+\cos \alpha ]$$
(4)

where ks is the soil saturated permeability, sf is the suction head at the wetting front, Qs is the water content of the soil in the wetted zone, Qi is the initial water content of the soil, and α is the slope angle. Based on Eq. (4), the cumulative infiltration, I, can be solved through a two-step procedure (Zhang et al. 2014). As presented in Fig. 2b, the depth of the wetting front, Zw, can then be calculated as follows:

$${Z_w}=\frac{I}{{({Q_s} - {Q_i})\cos \alpha }}$$
(5)

Assuming that rainfall-induced landslides occur along the wetting front (e.g., Cho and Lee 2002; Vanacker et al. 2003; Zhang et al. 2014; Zhang et al. 2020; Chen et al. 2021), the FOS of the slope, Fs, can then be generated using the following equation (Fredlund et al. 1978):

Fig. 2
figure 2

(a) The infinite slope stability model under rainfall infiltration. Modified from Zhang et al. 2014, (b) The water content profile of the modified Green-Ampt model. Modified from Kim et al. 2014

$${F_s}=\frac{{c{\text{+}}{\gamma _s}{Z_w}{{\cos }^2}\alpha \tan \varphi +{s_f}{\gamma _w}\tan \varphi }}{{{\gamma _s}{Z_w}\sin \alpha \cos \alpha }}$$
(6)

where c and φ are the effective cohesion and effective friction angle of the soil, γs is the unit weight of soil, and γw is the unit weight of water.

One challenge for applying the physically-based model in a region is that the soil parameters are difficult to determine accurately (e.g., Wang et al. 2015). Let θ denote uncertain soil parameters in the RARIL model, and f (θ) denote the probability density function (PDF) of θ. Let Fs (θ, t) denote the FOS of the slope at time t evaluated based on Eq. (6). The failure probability at time t, i.e., pf (t), can then be expressed below (e.g., Tobutt 1982; Zhang et al. 2014):

$${p_f}(t)=\iiint {\left[ {{F_s}({\mathbf{\uptheta}},t)} \right]f({\mathbf{\uptheta }}){\text{d}}{\mathbf{\uptheta }}}$$
(7)

where J [Fs (θ, t)] is an indicator function that measures the stability of slope defined as follows:

$$J\left[ {{F_s}({\mathbf{\uptheta }},t)} \right]=\left\{ {\begin{array}{*{20}{c}} {0,{F_s}({\mathbf{\uptheta }},t)>1} \\ {1,{F_s}({\mathbf{\uptheta }},t) \leqslant 1} \end{array}} \right.$$
(8)

Let θk denote the kth sample of θ. The failure probability in Eq. (7) can then be estimated based on Monte Carlo simulation as follows:

$${p_f}(t) \approx \frac{1}{N}\sum\limits_{{k=1}}^{N} {J[{F_s}({{\mathbf{\uptheta }}^k},t)]}$$
(9)

where N is the number of samples.

2.3 Hybrid model

The purpose of the hybrid model is to take advantage of both the data-driven and physically-based models (e.g., Oliveira et al. 2017; Strauch et al. 2019). In this study, the prediction from the physically-based model is first considered as an additional conditioning factor for constructing the data-driven model, through which the information obtained through the physically-based model can be taken into account in the hybrid model. When using logistic regression to develop the hybrid model, Eq. (2) can be revised as follows:

$$Z={w_0}+\sum\limits_{i}^{n} {{w_i}{x_i}{\text{+}}{w_{n+1}}{x_{n+1}}}$$
(10)

where xn+1 represents the additional conditioning factor in the hybrid model. Two hybrid methods are first considered to develop the hybrid model, i.e., hybrid model I, where the failure probability calculated based on Eq. (9) is considered as the additional conditioning factor xn+1, and hybrid model II, where the FOS calculated based on Eq. (6) is considered as an additional conditioning factor xn+1. To develop the two hybrid models, one can first evaluate the failure probability or FOS of the slopes using the physically-based model. Then, the hybrid model can be developed based on logistic regression as given by Eq. (10) through taking failure probability or FOS as the additional conditioning factor.

Furthermore, to provide another comparison, this paper also adopts a matrix-based hybrid method (Hybrid method III) by integrating the data-driven and physically-based classification results. This method has been frequently used in previous studies by constructing a matrix where rows represent landslide susceptibility classes derived from the physically-based model, and columns represent landslide susceptibility classes obtained from the data-driven models (e.g., Chowdhury and Flentje 2003; Nguyen et al. 2022; Xue et al. 2024). As shown in Fig. 3, each landslide susceptibility results are further divided into five classes: Very High, High, Middle, Low, and Very Low. In this study, the matrix-based hybrid method results are considered as the lower-class value between the landslide susceptibility class from data-driven and physically-based models.

Fig. 3
figure 3

Landslide prediction matrix integrating data-driven and physically-based models

2.4 Model evaluation

In this study, the overall accuracy, the receiver operating characteristic curve (ROC), and the relative landslide density index (R-index) will be used as quantitative measures to compare the prediction capacity of the different landslide susceptibility analysis models (e.g., Shahabi et al. 2014; Wei et al. 2021; Miao et al. 2023). When comparing the model predictions with the observations, it is considered a true positive (TP) case if the landslide does occur at the predicted landslide location. If the landslide does not occur at the predicted landslide location, it is considered a false positive (FP) case. If the landslide does not occur at the predicted non-landslide location, it is considered a true negative (TN) case. If the landslide occurs at the predicted non-landslide location, it is considered a false negative (FN) case. Using these definitions, the overall accuracy, true positive rate (TPR), and false positive rate (FPR) can be determined below:

$${\text{Overall Accuracy}}=\frac{{{\text{TP+TN}}}}{{{\text{TP+TN+FP+FN}}}}$$
(11)
$${\text{TPR}}=\frac{{{\text{TP}}}}{{{\text{TP+FN}}}}$$
(12)
$${\text{FPR}}=\frac{{{\text{FP}}}}{{{\text{FP+TN}}}}$$
(13)

As can be seen from the above definition, the overall accuracy is a measure of model correctness. It represents the proportion of correct predicted values made by the model (e.g., Liu et al. 2023). The TPR describes the proportion of landslide locations that are classified correctly as landslide locations by the model, while the FPR represents the proportion of non-landslide locations that the model incorrectly classifies as landslide locations (e.g., Cantarino et al., 2019). In this study, the predictions from the landslide susceptibility analysis models are in terms of probabilities of landslides. To assess the prediction capacity of different models, a probability threshold should be first specified, i.e., a landslide is predicted to occur if the probability is greater than the probability threshold and vice versa. As the probability threshold changes, the values of TP, FP, TN, and FN also change, and hence the values of overall accuracy, TPR, and FPR value change. The relationship between TPR and FPR is often known as ROC. From the ROC curve, the area under the curve (AUC) can be calculated (e.g., Cantarino et al., 2019), which is between 0 and 1. The value of AUC is often used to access the prediction capacity of a model (e.g., Bradley 1997) and is classified as average (0.6–0.7), good (0.7–0.8), very good (0.8–0.9), and excellent (0.9-1) (e.g., Woodard et al. 2023).

The R-index, which measures the relative landslide density, is often used to validate the quality of LSMs (Baeza and Corominas 2001), as defined below:

$${\rm{R}}=({n_i}/{N_i})/\sum {({n_i}/{N_i})} \times 100$$
(14)

where ni and Ni are the number of landslide samples and total cells in the susceptibility class i. Based on Eq. (14), the R-index can be calculated for each susceptibility class. It is expected that, when the susceptibility class is low, the chance of occurrence of landslides in the low susceptibility region is also low, and hence the R-index should be small. On the other hand, when the susceptibility class is high, the R-index should also be large. Therefore, the value of R-index is expected to increase as the susceptibility class changes from low to high, so it can be used to measure the classification capacity of a LSM (Shahabi et al. 2014). In the following, the conventional data-driven, physically-based model and the three hybrid models for developing the landslide susceptibility assessment work will be illustrated with a case and validated with the above evaluation methods.

3 Case study

3.1 Engineering background

Covering a total area of 1789.6 km2, Shengzhou County is located in the east of Zhejiang province, China. Figure 4a shows the elevation of the terrain in Shengzhou County. As shown in Fig. 4a, its terrain tilts from northwest to southeast with altitude varying from 1074 m to 2 m. Rainfall serves as the major triggering factor for landslides in this region. On August 2nd, 2018, the typhoon “Jongdari”, which brought extremely heavy precipitation (e.g., Zhan and Xie 2022), attacked Shengzhou County, leading to many landslides. Landslide inventory is an essential component of LSMs (e.g., Harp et al. 2011; Wang et al. 2021b). Based on Google Earth and GF-1 remote sensing data, 240 landslide polygons were detected by comparing the satellite figures before and after the typhoon-related rainfall events. For further analysis, all landslide polygons were converted into landslide grids (30 m resolution). These 318 resulting grids are then serves as representative landslide samples, as illustrated in Fig. 4a. To build a dataset for model analysis, an equal number of non-landslide samples are also needed (e.g., Liu et al. 2022; Chang et al. 2023). Considering that there are large flat terrains in Shengzhou County with almost no observed landslides, we first employed the r.geomorphon module to exclude these areas (Jasiewicz and Stepinski 2013). Furthermore, we create a 100 m buffer zone around landslide points (Süzen and Doyuran 2004), thus ensuring most of the non-landslide sample area exclusion from regions of landslide occurrences (Fig. 4b). The combined landslide/non-landslide dataset is further randomly split into two parts, with 70% for training and the remaining 30% for validation. (e.g., Razavi-Termeh et al. 2023; Saha et al. 2023).

Fig. 4
figure 4

(a) Elevation and landslide samples distribution in the Shengzhou County, (b) Non-landslide samples distribution

3.2 Rainfall data

Shengzhou Meteorological Bureau set up a monitoring network consisting of 43 rainfall stations in the study area. Figure 5 shows the distribution of 43 rainfall stations in this region. Figure 6 shows the regional maximum hourly precipitation in all rainfall stations and maximum cumulative precipitation collected by rainfall station in this region. As presented in Fig. 6, the rainfall in this region lasts nearly 29 h and brings a maximum cumulative precipitation exceeding 150 mm.

Fig. 5
figure 5

Distribution of the rainfall stations and 29-hours cumulative rainfall data

Fig. 6
figure 6

Records of the maximum hourly precipitation in this region and maximum cumulative precipitation in the rainfall station

Based on rainfall data collected from 43 rainfall monitoring stations, as shown in Fig. 5, the distribution of rainfall over the region can be derived through the Kriging interpolation method (e.g., Cressie 1990; Olea,1999; Chen et al., 2014; Zhao et al. 2019; Wang et al. 2023). Let R(ui) denote the cumulative rainfall at the location of the ith rainfall station ui, and let R(u0) represent the rainfall data at a location u0 where the cumulative rainfall needs to be interpolated. Based on the kriging method, R(u0) can then be expressed by a linear combination of R(ui) as follows (e.g., Chen et al., 2014):

$${\rm R}({{\mathbf{u}}_0})=\sum\limits_{{{n_r}}}^{{i=1}} {{\lambda _i}} {\rm R}({{\mathbf{u}}_i})$$
(15)

where nr is the number of rainfall stations in the region, and λi is the optimal weight with the sum weights for all nr stations equal to 1. With the rainfall data collected from the 43 rainfall stations, the 29 h cumulative rainfall data at the regional scale can then be interpolated, as shown in Fig. 5. In RARIL, the simulation period spans the entire duration of the rainfall event.

3.3 Conditioning factors

While many studies have been carried out to identify conditioning factors for developing LSMs (e.g., Reichenbach et al. 2018), consensus on the selection of which conditioning factors remains elusive (e.g., Kavzoglu et al. 2015). In general, the conditioning factors used should both consider the characteristics of landslides and the availability of the data (e.g., Kavzoglu et al. 2015; Zhao et al. 2019). In this study, nine conditioning factors are considered to build the basic data-driven model through logistic regression, including elevation, slope angle, slope aspect, distance to rivers, distance to faults, distance to roads, topographic wetness index (TWI), normalized difference vegetation index (NDVI), and land use, which are shown in Fig. 7a-i, respectively. In addition to the above nine conditioning factors, the soil type layer (available at https://www.fao.org/) is also obtained, as shown in Fig. 7j, which will be used to derive soil parameters as input for RARIL. Details of the selected conditioning factors are presented in Table 1. Considering that some environmental conditioning factors have time-variant characteristics, especially when applying during a typhoon event (Lee et al. 2008), such as NDVI and TWI values, the NDVI is derived from a Landsat 8 satellite image captured shortly before the typhoon event. Similarly, the digital terrain model with a 30 m resolution, which is used to extract the TWI, is obtained before the date of the typhoon event. The elevation, slope angle, slope aspect, and distance to rivers are also extracted from the same digital elevation model. Furthermore, a land use map provided by Globeland30 is used to get the land use layer (available at http://www.globallandcover.com/) and a road map provided by Bigemap (available at http://www.bigemap.com/) is used to generate the value of distance to roads and Zhejiang province geological map at a scale of 1:500,000 (available at http://www.ngac.org.cn) is used to extract the value of distance to faults. Consistent with the resolution of the conditioning factors, all the models in this study have a 30 m resolution.

Table 1 Information on landslide conditioning factors
Fig. 7
figure 7

Landslide conditioning factors: (a) elevation, (b) slope angle, (c) slope aspect, (d) distance to rivers, (e) distance to faults, (f) distance to roads, (g) TWI, (h) NDVI, (i) land use, (j) soil type

In the following, different models will be used to assess the landslide susceptibility for Shengzhou County.

4 Results

4.1 Logistic regression model for landslide susceptibility mapping

To build a logistic regression model, the created dataset is adopted. Using nine basic conditioning factors as the features, a logistic regression model is developed based on Eqs. (1) and (2) through the training dataset. Then, the prediction capacity of the logistic regression model is checked with the validation dataset. Table 2 shows the regression coefficients of the logistic regression model. Generally, after Min-Max normalization of the raw data, a greater absolute value of the regression coefficient indicates that the conditioning factor has a greater impact on the landslide probability (e.g., Shahabi et al. 2014). Table 2 shows that the slope angle has the greatest regression coefficient (i.e., 7.908), indicating that it has the greatest impact on the landslide probability.

Table 2 Coefficients of the logistic regression, hybrid models I and II

When the data-driven method is used for developing LSM, the susceptibility can be divided into five classes through the landslide probability using the equal interval method, i.e., very low (0-0.2), low (0.2–0.4), middle (0.4–0.6), high (0.6–0.8), and very high (0.8-1), respectively (e.g., Baeza et al. 2016). Figure 8a shows the LSM derived based on the logistic regression model using the above susceptibility classification method. Table 3 presents the number of landslides (ni) and number of cells (Ni) in each susceptibility class i, which can then be used to calculate the R-index using Eq. (14). The R-index values corresponding to susceptibility classes of very low, low, middle, high, and very high in the logistic regression model are 0.59, 4.01, 9.29, 25.58, and 60.53, respectively. It seems that as the susceptibility class increases from very low to very high, the R-index value which measures the landslide density also increases rapidly, indicating that the landslide susceptibility class is consistent with the observed landslide phenomenon.

Table 3 R-index comparison for the logistic regression, RARIL, and three hybrid models
Fig. 8
figure 8

Landslide susceptibility maps derived using five models: (a) Logistic regression, (b) RARIL, (c) Hybrid model I, (d) Hybrid model II, (e) Hybrid model III

4.2 RARIL model for landslide susceptibility mapping

To implement the RARIL model, equal weight is initially assigned to all analysis grids and soil property samples (Horton et al. 2013), ensuring that each component contributes equally to the overall representation of terrain and slope stability analysis. Then the soil parameters in the region is determined. In this study, the soil parameters are determined based on its soil type, and further be divided into four types, i.e., clay, loam, silt loam, and sandy clay loam, as shown in Fig. 7j. The statistics and distributions of c, φ, Qi, Qs, ks, and sf for different types of soil used in this study are summarized in Table 4. As shown in Table 4, the initial parameter space for soil properties is provided, specifying that c and φ follow a normal distribution, Qi and Qs follow a uniform distribution, and ks and sf follow a lognormal distribution. Then, the Monte Carlo simulation generates a set of random samples from these probability distributions, and each sample represents a possible combination of soil property values. These samples are further used to calculate the failure probability of each grid under the rainfall infiltration process. In particular, the cohesion and the friction angle of the soils are determined based on the Geotechdata Database (available at https://www.geotechdata.info/). The values of Qi and Qs variables are derived from Zhang et al. (2018), and the values of ks and sf are derived from Mays (2011).

Table 4 Statistics of soil parameters

Based on the Geotechdata Database, the unit weights used for different soils are as follows: 18 kN/m3 for clay, 20 kN/m3 for loam, 19.5 kN/m3 for silt loam and 21 kN/m3 for sandy clay loam. This study also adopts a unit weight of water (γw) of 9.81 kN/m3. With the above soil parameters, the failure probability of each cell in the study area during the rainfall event can be calculated through Monte Carlo simulation based on Eq. (9). For a physically-based model, Yang et al. (2022) suggested that the susceptibility of a cell can be considered as low, middle, and high when the failure probability is less than 0.05, between 0.05 and 0.1, and above 0.1, respectively. Based on the above landslide susceptibility classification method, a LSM can also be generated for the study area, as shown in Fig. 8b. Table 3 also presents the R-index for each susceptibility class corresponding to the RARIL model. As shown in Table 3, a notable high R-index (71.71) is observed in the high susceptibility class, indicating a reasonable classification capacity of the RARIL model.

4.3 Hybrid model I for landslide susceptibility mapping

By employing the nine basic conditioning factors and incorporating the failure probability value obtained in Sect. 4.2 as an additional conditioning factor, a logistic regression model can be trained and the coefficients are shown in Table 2. It is interesting to see that while in the logistic regression model and hybrid model II, the slope angle has the greatest coefficient, in the hybrid model I, the failure probability estimated based on RARIL has the greatest coefficient. The LSM generated by hybrid model I is subsequently reclassified using the equal interval method, as shown in Fig. 8c. Table 3 presents the R-index for each susceptibility class corresponding to hybrid model I. It is interesting that for the hybrid model I, its R-index value in the very high landslide susceptibility class is 80.46, which is greater than all other models, indicating that it can classify landslides more accurately.

4.4 Hybrid model II for landslide susceptibility mapping

The hybrid model II is created by treating the FOS value calculated through RARIL as the additional conditioning factor for logistic regression. To implement hybrid model II, the FOS value generated by RARIL without considering soil parameters uncertainty is needed. Hence, the FOS is calculated by RARIL using the mean values of the random variables as listed in Table 4. Subsequently, using Eq. (6), RARIL is adopted to calculate the FOS distribution of the study area under the typhoon-related rainfall event, as illustrated in Fig. 9.

Fig. 9
figure 9

Factor of safety (FOS) calculated by the RARIL

Using nine basic conditioning factors and incorporating the FOS value as an additional conditioning factor, hybrid model II is built through logistic regression. Table 2 also presents the coefficient of the ten conditioning factors of hybrid model II. As can be seen from Table 2, the slope angle exhibits the highest significance in hybrid model II which shows that the FOS still not surpass the influence of slope angle with this hybrid method. Based on the equal interval method, a new LSM is created, as shown in Fig. 8d, and the R-indexes for different landslide susceptibility classes are calculated and shown in Table 3. As can be seen from Table 3, the R-index values corresponding to susceptibility classes of very low, low, middle, high, and very high in the hybrid model II are 0.59, 3.71, 6.36, 18.48, and 70.85, respectively, exhibiting an accurate trend of increasing landslide intensity with the landslide susceptibility class.

4.5 Hybrid model III for landslide susceptibility mapping

As mentioned in Sect. 2.3, hybrid model III which employed a matrix-based approach, is also included for comparison. This method integrates the data-driven landslide susceptibility classes derived by logistic regression (Sect. 4.1) with FOS values obtained from the RARIL model without considering the soil parameters uncertainty (Sect. 4.4). As can be seen from Fig. 9, FOS values are divided into five classes: FOS < 1 (Very High), 1 < FOS < 1.2 (High), 1.2 < FOS < 1.5 (Middle), 1.5 < FOS < 2 (Low), and FOS > 2 (Very Low) (e.g., Wei et al. 2023). Following the matrix-based approach illustrated in Fig. 3, these inputs are combined to generate a new LSM, as shown in Fig. 8e. The corresponding R-index values for each susceptibility class are also presented in Table 3. This hybrid model also demonstrates an accurate trend of increasing relative landslide intensity index with increasing susceptibility class: very low (0.25), low (1.65), middle (3.36), high (21.13), and very high (73.61).

5 Discussion

5.1 Comparison of the conventional and hybrid models

In this section, the five landslide susceptibility models are compared in terms of overall accuracy and the ROC curve with the validation dataset. To calculate the overall accuracy of the five models based on Eq. (11), the landslide probability threshold beyond which a landslide is considered to occur needs to be determined. For the conventional logistic regression model, hybrid models I and II, which are constructed based on logistic regression, and hybrid model III, which is constructed based on a matrix approach, the probability threshold is set to 0.5 (e.g., Nefeslioglu et al. 2008). For RARIL, the probability threshold is selected as 0.05 (Yang et al. 2022). Table 5 shows the overall accuracy of the five models. As shown in Table 5, hybrid model I gets the highest overall accuracy with a value of 0.80, followed by hybrid model II (0.77), hybrid model III (0.75), logistic regression (0.74), and RARIL (0.70). It is obvious that all the three hybrid models outperform the conventional logistic regression and RARIL model. Moreover, after considering the soil parameters uncertainty, hybrid model I achieved the highest overall accuracy.

Table 5 The overall accuracy of the five models

Figure 10 presents the ROC curves of the five models. As shown in Fig. 10, the AUC values of the conventional logistic regression model, RARIL model, hybrid model I, hybrid model II and hybrid model III are 0.82, 0.79, 0.89, 0.85, and 0.85, respectively. Also can be seen that all the hybrid models outperform the conventional data-driven and physically-based models. Among the three hybrid models, the hybrid model I has the greatest AUC compared to the other models, indicating that considering the soil parameters uncertainty can improve the prediction capacity of landslide susceptibility analysis in this study.

Fig. 10
figure 10

ROC curves for the logistic regression, RARIL, and three hybrid models

5.2 Improvement of prediction capacity through model hybridization

The comparison of five models reveals the potential of integrating data-driven and physically-based approaches for the improvement of model prediction capacity. In this study, we proposed a hybrid method that can considers soil parameter uncertainty and compared it with two other models that does not consider it. Hybrid models II and III, which integrate the RARIL model with fixed soil parameters to generate FOS values/classes, exhibited similar AUC values. This suggests that combining the two types of model results using either a logistic regression framework or a matrix-based approach does not significantly impact predictions in this study. Furthermore, hybrid model I, which considers soil parameter uncertainty with the RARIL model to generate failure probability values, shows a notable improvement in prediction capacity compared to the other hybrid models that do not consider it. This can be explained that after consider the inherent uncertainty in soil properties at a regional scale, hybrid model I provides a more realistic reflection of actual soil conditions in its physically-based model inputs. Moreover, through considering the soil parameters uncertainty and simulating more soil samples to calculate failure probability values, the proposed hybrid model can further reduce the randomness of calculation than using FOS as inputs, thus reaching a better prediction capacity in the hybrid results.

5.3 Event-induced landslide susceptibility mapping

In many regions affected by typhoons, typhoon-related heavy rainfall event can lead to landslides on a large regional scale. Sometimes, as one typhoon-related heavy rainfall event just passes, another follows, and rainfall-induced regional landslides in this region may occur again. This phenomenon is quite common. However, fewer typhoon-related rainfall-induced regional landslide susceptibility assessments have been carried out. Previously, in most rainfall-induced landslide susceptibility assessment studies, data-driven models based on historical landslide inventories and conditioning factors have been commonly used, but their triggering factors have seldom been emphasized. Due to the lack of temporal information in multi-source historical landslide inventories, rainfall factors are typically treated similarly to other environmental or geological conditioning factors by using annual average rainfall data. Therefore, we aimed to address this issue by considering rainfall-related results calculated by the physically-based model as an additional conditioning factor to assess the influence of typhoon-related rainfall events on a regional scale. This can also be seen as the event-induced landslide susceptibility mapping. Further, there is published research about that. Dai and Lee (2003) proposed a method using real rainfall data as an independent variable in typhoon-induced shallow landslide susceptibility assessment which provides new insight into rainfall-induced regional landslide susceptibility mapping. Subsequently, Lee et al. (2008) conducted study with four typhoon rainfall-induced landslide inventories, confirming the effectiveness of the method in predicting subsequent typhoon-induced regional landslide events in nearby areas. In this study, we select Shengzhou County as an illustration, as it was significantly affected by typhoon “Jongdari”. The proposed hybrid method incorporates a data-driven model and a physically-based model. Initially, the data-driven model adopted a typhoon event-related landslide inventory to establish a basic framework. Subsequently, the physically-based model, which does not consider the actual typhoon-related landslide inventory data, relies solely on the regional terrain data, slope angle, soil type, and the triggering factor - rainfall, to predict the landslide-prone area through reliability analysis.

While the data-driven model uses the landslide inventory caused by a typhoon event, which can also be seen as historical landslide data in the subsequent typhoon events in this region, the physically-based model’s results are primarily influenced by the rainfall event, showing no need for actual typhoon-related landslide inventory data. This also gives this study the potential for application. It may further be applied to the study area experiencing subsequent rainfall events by updating the rainfall event inputs in the physically-based model to generate new failure probability value. With the data-driven model establishing a basic framework, coupled with the inputs from the updated failure probability value, thus a new landslide susceptibility maps can be created with the new rainfall event.

6 Summary and conclusions

Traditionally, data-driven model and physically-based models are widely used for landslide susceptibility mapping, but each method has limitations. This paper introduces a novel hybrid method that combines the data-driven and physically-based models, aiming to improve the prediction capacity of rainfall-induced regional landslide susceptibility mapping by explicitly considering uncertainty in soil properties. The proposed method is implemented with a case study in Shengzhou County, Zhejiang Province, China. In the case study, logistic regression serves as the data-driven model, and RARIL serves as the physically-based model. Three hybrid models are developed: hybrid model I, which explicit consider the soil parameters uncertainty, hybrid model II and hybrid model III, which do not consider it and with different hybrid types. The comparison results indicate that all the hybrid models outperform conventional logistic regression and RARIL model. Notably, hybrid model I, which consider soil parameter uncertainty, exhibits greater prediction capacity compared to hybrid model II and hybrid model III that do not consider it, demonstrating the importance of considering uncertainty in soil properties for more accurate landslide susceptibility mapping.