Introduction

Slope movements are considered to be one of the main hazards and risks in mountainous areas (Van Westen et al. 2006). Among these phenomena, landslides and rock-fall are the most widespread. They are characterized by the displacement of a more or less coherent materials (soil, marls, clays ora mixture of several type of materials) or rocks (blocks, rocks) along a shear surface induced by gravity effect (Dikau 1999; Palucis et al. 2018; Chattoraj et al. 2019). These phenomena can be called as slope failures because the underlying debris that holds the slope in place fails to maintain stability due to the large thrust exerted by the mobilized land (Solaimani et al. 2013). Their velocity and intensity may depend on the nature of the displaced materials as well as the slope gradient (Mahdadi et al. 2018; Cui et al. 2019). Slow moving landslides (i.e., with a velocity of few centimeters per year) can seriously damage infrastructures, especially when volumes are significant (Frattini et al. 2018). Rapid landslides provide little warning and cause a large impact, especially near developed areas (Marc et al. 2015; Zygouri and Koukouvelas 2019). Landslides are considered a major risk in the world and they can be disastrous for human life and property (Van Westen et al. 2011). According to the rate mortality, these phenomena represent approximately 9 % of all natural risks in the world (Mousavi et al. 2011) and cause approximately 1000 deaths per year worldwide and approximately four billion (dollars) property damage only in the USA (Lee and Pradhan 2007).

Algeria is one of the countries that are widely affected by these phenomena. Landslides have seriously affected housing and various infrastructures (buildings, roads, bridges, dams, and railways) during the last twenty years (Mahdadi et al. 2018; Karim et al. 2019). Coastal and mountainous zones are the most landslide prone areas of the country (Lee and Pradhan 2007). Each year, especially in winter, extreme rainfalls cause significant landslides affecting the roads in mountainous regions and cause the isolation of several hundred homes. In recent years, remedial works have been carried out to repair damages on the roads with hundreds of thousands dollars spent by the government (Manchar et al. 2018). Because the potential problems induced by landslides are related to poor slope planning and little attention by end-users, the Algerian authorities are warned against the damages induced by landslides and the seriousness of their management and prevention. Therefore, recently, the authorities have highlighted the need to consider this hazard in local development planning. Some efforts have been made to identify phenomena location, improve the inventory with the characteristic of phenomenon, and prevent their occurrences the main goal is to produce landslide susceptibility, hazard, and risk maps for specific areas.

Located in the northern fringe of Algeria, the Fergoug watershed is very prone to landslides affecting infrastructures. In order to follow the government instructions and to improve the landslide mapping practices for regular objectives, we are interested in how to map landslide susceptibility with simple data-driven methods based on available data. The defined methodology should be easily reproducible and easily understandable by local authorities to be applied on other territories. To carry out this work, the Fergoug watershed was chosen as a case study to assess the susceptibility using three methods answering the above-mentioned points namely: the frequency ratio method (RF), the value of information method (IVM), and multiple linear regression method (MLR). After a sensitivity analysis of the different spatial variables on the models, the best maps obtained for each approach are compared through two statistical tests. The results are discussed in order to choose the most adapted approach to the local context. The best map obtained could be used as a basic support for planning small or large projects while avoiding areas highly exposed to landslides.

Study area

The Fergoug watershed, covering an area of 169 km2, is located in the Beni Chougrane Mountains, which is part of the western Tellian Chain of Algeria (Fig. 1a). This young mountain area was structured during the Alpine phase and slowly continues to structure itself (Upper Miocene) (Thomas 1985). These mountains rise from 50 m (a.s.l.) near Mohamadia from 935 m (a.s.l) near Chareb Errih in the east of Mascara. Structurally, the two plains (Ghriss and Habra) are separated by the very tectonized ridge of the Beni-Chougrane mountainous in which the Fergoug watershed is part (Fig. 1a).

Fig. 1
figure 1

Geographic and geomorphologic frameworks of the study area, a: map showing the western Tellian chain related to the Northern part of Algeria (small map in the left corner), b: Digital elevation model showing the topographic features of the study area, the red circles represent the landslides locations, the white dashed line represents the Sidi Daho Synclinorium (SNCL SD); c and d: topographic cross section related to the dashed line in b. The abbreviations FR and FD are respectively Fergoug River and Fergoug Dam

The Fergoug watershed is located northeast of the Tessala Mountains, south of Mohamaddia on the axis of the Macta River (Fig. 1a). The morphology of the study area is irregular with mountainous areas on either side of the Macta River (Fig. 1b). These mountains are cut by a network of fairly dense rivers sometimes feeding directly into the Fergoug dam, making this dam the collection point for rainwater discharged by tributaries of the main wadi (Fergoug, (Thaghzout and Hammam; Fig. 1c, d). Among this dense network, it is possible to observe small plains corresponding to ancient fluvial terraces (Fig. 1b). These mountains have a typical Tellian direction with an ENE / WSW orientation coinciding with the anticlinal fold systems affecting northern Algeria. This area is also affected by faults of various nature and direction ranging from N140° (along which the main wadi of Fergoug is located), N70° to N20°. They are represented respectively by dextral or senestral strike slip and reverse faults sometimes limiting the folded structures. The anticlines reveal an ancient core represented by friable Cretaceous formations, notably marl and marl-limestone (Fig. 2b). In the center, the lithology consists of friable Miocene rocks such as marls, sandy clays, clays, and sandstone clays. These formations are overlain by hard Pliocene rocks (sand, sandstone, and limestone) forming the main ridges (Fig. 2a). The soils derived from the weathering of these soft Miocene materials are rich in swelling clays favorable to runoff when surface conditions permit (Stark and Hovius 2001; Zaagane et al. 2015), particularly when the Mediterranean-influenced climate manifests itself in the form of violent thunderstorms with intensities greater than 100 mm h-1. These particular episodes can be frequent especially in autumn and winter when vegetation cover is very limited (Mekerta et al. 2008).

Fig. 2
figure 2

Geological map of the Beni-Chougrane Mountains (geographic coordinates) a: 1, eruptive rocks; 2, Jurassic limestones; 3, ante-nappes formations; 4, marl (early Miocene); 5, conglomerate and sandstone (early Miocene); 6, continental formation of Bou Hanifia; 7, basis sandstone; 8, Bleu marl; 9, Lithothamniees limestone; 10, El Bordj sandstone formation; 11, diatomitic formation; 12, gypsyfour and marly gypsifourous; 13, bleu marl; 14, sea sandstone; 15, continental sandstone and silt at Helix; and 16, lake deposit; 17, limestone crust; 18, alluvium; 19, watercourse. The black line disposed perpendicular to the alignment of the chain represents the geological cross section, b: geological cross-section along the Beni-Chougrane Mountains showing the geologic features (before and after-nappes). For the abbreviations, ANT is and SNL are respectively anticline and synclinal, BG is Bel Ghrib, MRDJ is Merdja, CRBH is Cherb Errih. FRG is Fergoug; PRG is Mohamadia O. is Oued e.g (Wadi or rivers)

A characteristic feature of the manifestation of these intense rainfalls on an arid Mediterranean landscape is the development of badlands. These erosion processes have been widely described in the literature and generally attributed to environmental changes (Cánovas et al., 2017; Mosavi et al. 2018). These changes are induced either by climatic oscillations or by prioritized human activity, which leads to increased runoff due to vegetation cover degradation (Roose et al. 1993). These environmental conditions of a degraded Mediterranean environment are also conducive to the initiation of landslides affecting many infrastructures in two ways:

  1. (i)

    Directly: for example on the road connecting Mascara to Mohamadia, many cases of destruction were observed (Fig. 3). Landslides affect the roadways, generally located on slopes more or less steep, mobilizing few part or large part of the road (Fig. 3a, b, c, d).

  1. (ii)

    Indirectly: the landslides join the erosion areas along the banks of the Fergoug River mobilizing a large part of the land. Materials are transported during floods towards the Fergoug dam located downstream, thus causing its silting (Fig. 4). These landslides are triggered in different environments along the main course of the Fergoug River on dense forest cover (Fig. 4a, b) or in upstream tributaries, affecting both bare ground (Fig. 4c) and ground with temporary vegetation cover (Fig. 4d, e).

Fig. 3
figure 3

Roads affected by landslides in different location, a: road connecting Mascara to Mohamadia affected as a whole by a large landslide (dark line represents the road which leads from Mascara to Mohamadia), b: the landslide seen from one top, the dashed lines show the limits of the landslide area, c: the cliff corresponds to the rupture zone, even the device designed to protect the road was affected by the great thrust generated by landslide, c: landslide affecting part of the road connecting Ain Fares Mohamadia in which we can see the beginning of a slip affecting thisroad recently built, dashed lines are related to cracking relatives to the rupture zone. The black and white arrows show the direction of movement of the materials

Fig. 4
figure 4

Several landslides happened along the Fergoug river, the moved material is transported by fluvial waters to the Fergoug Dam during the floods season, a: important landslide characterized by the presence of a dense forest cover and b: landslide happened along river not far from the Fergoug dam, c:Miocene marl affected by a small landslide not directly in the Fergoug River it happened on one of its tributaries, d: soils moved because a landslide close to the road and e: many landslide located in the oriental slope in the central part of the study area

Landslide susceptibility mapping: background and description

The production of a landslide susceptibility maps is the first step in the landslides risk assessment (Corominas et al. 2014). Over the past decade, advances in modeling and geographic information systems (GIS) have provided several of quantitative methods for mapping susceptibility to landslides (Van Westen et al. 2006). Various quantitative models have been successfully implemented with several datasets representing the predisposing factors (Reichenbach et al., 2018). These models are based on the assumption that landslides follow common environmental, physio-geographical, and geotechnical behaviors under similar conditions in the past, present, and future (Ghimire 2011; Corominas et al. 2014). To achieve correctly a landslide susceptibility map by a quantitative method, a preliminary stage requires the preparation of a database about the landslides, with an inventory compiling several information (i.e., location, typology, activity, and morphometric indexes) (Zaagane et al. 2015; Thiery et al. 2020) and different maps representing the predisposing factors. Once the data prepared, the inventory is confronted with the different factors listed (Fig. 5) in order to establish the relationships between events and the most influential factors

Fig. 5
figure 5

Overall methodology flowchart showing different stages of process such as data inventorying, data preparation, computations of differents LSM and validation

Depending on the approach chosen, the results may differ (Thiery et al., 2007). It is sometimes necessary to compare certain approaches with other in order to choose the best one suiting the study site and the different field observations (Thiery et al. 2014). In this study, the choice of the reliable susceptibility map suitable for the study area was guided by the degree of performance of the model used in the development of the final map. To compare them, the best way is to use the same dataset and the same strategy of calibration, validation with simple tests of comparison. In Figure 5, we illustrate the strategy used for this study. Results can be used as a basic support showing the probability of occurrence of landslides according to different susceptibility classes (Guzzetti et al. 2012; Caiyan and Jianping 2009; Pradhan and Lee 2010; Erzin and Cetin 2013; Pourghasemi et al. 2012; Karim et al. 2019) (Fig. 5). Among these methods, bivariate and multivariate methods (Pourghasemi et al. 2012; Lari et al. 2014; Hungr 2018; Robbins 2016; Nsengiyumva et al. 2019) become standards in the scientific community for mapping landslide susceptibility at regional and local scales (Thiery et al. 2020). Bivariate methods have the advantages to explore simply the relationships between landslides and factors or combination of factors. They are simple to implement and yielding results easily understood by practitioners. Multivariate methods can be a little bit more complex to implement because they need, sometimes, the used of specific statistical packages, and need to explore positive relationship of landslides and non-landslides. Moreover, they are more difficult to explain to end-users. Nevertheless, for both methods, their degree of prediction based on the analysis of phenomena taken and not taken into account in the calibration phase is generally higher than 80%, especially if the data used are accurate and validated by the expert.

Therefore, three data-driven methods have been selected with two bivariate (the frequency ratio — FR, and the information value — IVM) and one multivariate method (the multiple linear regression method — MLR). The goal is to choose the best suitable method for the test site.

Frequency ratio model

The frequency ratio model (FR) has been applied for several case studies in order to predict areas likely to generate landslides (Yalcin et al. 2011; Park et al. 2013; Pradhan and Lee 2010; Manchar et al. 2018; Gholami et al. 2019). The frequency ratio, as a leading probability model, is based on the observed spatial relationships between spatial distribution of landslide and causal factors. It is used especially to reveal the correlation between landslide locations and the factors in the study area. Consequently, the FR can be used to quantitatively assess the landslide susceptibility.

Technically speaking, FR is based on the association between landslides and predisposing factors with the computation of a ratio between the number of landslides in a predisposing factor class, the total number of landslides, the area of a predisposing factor class, and the total area of the predisposing factor (Lee and Pradhan 2007). Then, FR can be compute using Eq. (1):

$$ \mathrm{FR}=\frac{\raisebox{1ex}{$N\genfrac{}{}{0pt}{}{p}{i}$}\!\left/ \!\raisebox{-1ex}{$N$}\right.}{\raisebox{1ex}{$N\genfrac{}{}{0pt}{}{lp}{i}$}\!\left/ \!\raisebox{-1ex}{${N}^l$}\right.} $$
(1)

Where \( \mathrm{N}\genfrac{}{}{0pt}{}{\mathrm{p}}{\mathrm{i}} \) is the number of cells with landslides in factor class i, N is the total number of cells with landslides. \( \mathrm{N}\genfrac{}{}{0pt}{}{\mathrm{lp}}{\mathrm{i}} \) is the total number of cells (i.e., with and without landslides) of factor class i, Nl is the total number of cells (i.e., with and without landslides) in the entire study area (Javier and Kumar 2019).

Once the ratio of each independent factor class was calculated, the landslide susceptibility index (LSI_FR) is obtained by summing all (FR) of all factors, as indicated in Eq. (2) (Yilmaz 2010):

$$ \mathrm{LSI}\_\mathrm{FR}=\sum \mathrm{FRj} $$
(2)

Where j = 1 to n, and n is the total number of factors.

The model obtained shows the degree of correlation between landslides and independent factor class. The higher value of the ratio (FR > 1) shows the large relationship between the landslides occurrence and the factor class; however, a lower ratio value (FR < 1) indicates a low probability of landslide occurrence (Yalcin et al. 2011).

The information value model

The information value model has been widely used to map sensitivity to landslides. For instance, in 2012, Pereira and his colleagues used the Information Value Model (IVM) to assess the role of different combinations of causative factors in the triggering of shallow landslides in different parts of northern Portugal. Many authors have found that the value of information (IVM) model is very useful to define the degree of influence of the individual causative factor responsible of landslides (Champatiray 2000; Arora et al. 2004; Champatiray et al. 2007; Kanungo et al. 2009; Caiyan and Jianping 2009; Pereira et al. 2012; Balsubramani and Kumaraswamy 2013; Passang and Kubíček 2018).

The principle is very close the FR method. It is based on the determination of the influence of each class of predisposing factors on the onset of landslide in a given area. In this model, the information has the value IVM for a class in a thematic layer as indicated in the following equation (Chen et al. 2019):

$$ \mathrm{IVM}=\log \left(\frac{\mathrm{Conditional}\ \mathrm{probability}}{\mathrm{Prior}\ \mathrm{probability}}\right)\left(\frac{\frac{Si}{Ai}}{\frac{S}{A}}\right) $$
(3)

Where Si is a number of landslide pixels in factor class i, Ai is the number of pixels in a given class i, S is a total number of landslide pixels in the study area, and A is a total number of pixels in the entire study area. The weights of all factor classes were calculated through the ratio of landslide density of each factor class to the landslide density of total area, or the information value can provide the landslide probability in each class and in the total area (Table 1). If IV > 0.1, the factor classes will have the highest probability of landslide occurrence, but factor classes with negative values indicate the presence of a factor with no significant contribution to landslide occurrence. The landslide susceptibility index (LSI) was calculated as follows:

$$ \mathrm{LSI}\_\mathrm{IVM}=\sum \limits_{i=1}^M{X}_{ij}\ast \mathrm{IVM} $$
(4)
Table 1 Summary of different types, levels, and complexity of landslide susceptibility and hazard zoning according to the objectives (modified from Fell et al., 2008), where TU = available for any use, MSP = available for public utility mission; P = paying service; SD = on request; CNTS = National Space Technical Center, SPC = Bridge and roadway service, SF= Forestry service; SCGA = Geologicl servey of Algeria; +++ =essential; ++ = important; (*) = precaution (data verification); (**) = verification of the scale of application; I = inventory; S = susceptibility; A = hazard

Where Xij = 1 is the class i exist in factor j and 0 if class i does not exist in factor j, M is the number of class considered (Passang and Kubicek 2018).

Multiple linear regressions

MLR method is one of the first multivariate method applied to assess and to map landslide susceptibility. The method was used in different regions of the world (Felicísimo et al. 2013; Onagh et al. 2012; Felicísimo et al. 2013; Erzin and Cetin 2013). The multiple regression model for some authors appears as an alternative to the bivariate method with better results. This method shows that the sensitivity of landslides changes with the standard deviation of independent and predictor variables (Yalcin et al. 2011). However, some studies show very similar results.

In statistics, MLR is a mathematical regression method that extends simple linear regression to describe changes in a dependent variable associated with changes in several independent variables following this equation (Lee and Sambath 2006):

$$ \mathrm{LSI}\_\mathrm{MLR}={a}_0+{a}_1{\mathrm{X}}_1+{a}_2{\mathrm{X}}_2+\dots +{a}_n{\mathrm{X}}_n+{\upvarepsilon}_i\kern0.5em $$
(5)

Where εi is the error of the model which expresses or summarizes the missing information in the linear explanation of the values of LSII_MLR from X1, ..., Xn (specification problem, variables not taken into account, etc. ). The coefficients a0, a1, ..., an are the parameters to be estimated.

Data and modeling strategy

The strategy used to assess landslide susceptibility and to obtain the best landslide susceptibility map is split in 4 steps with: (i) the development of a new inventory of phenomena; (ii) the preparation of spatial variables represented by predisposing factors and aggravating factors; (iii) computations of different landslide susceptibility maps with the three selected methods; (iv) validation of results and comparison of maps.

Landslide inventory

Landslide inventory is a key step to assess landslide susceptibility and hazard (Lee and Pradhan 2013). Several authors as Gzzetti et al. (2006), Corominas et al. (2014) and Thiery et al. (2020) consider this operation as a basic element for all work aimed at the susceptibility mapping. In the Fergoug watershed, the landslide inventory is based on diachronic interpretations (Nichol and Wong 2005; Guzzetti et al. 2012) of aerial and satellite imageries (Landsat imagery and under Google Earth) taking into account of the resolution and the characteristics of each imageries (i.e., tonal, contrast, size, shape, and shadow, as well as contextual indicators such as position and direction of phenomena; Liu et al. 2004). The remote sensing inventory has been completed by field investigations and deep consultation of old reports.

One hundred forty-two landslides have been identified, recorded, and mapped. Several information relating to landslides have been recorded in a spatial database, with (i) the geographic situation, (ii) the typology (translational, rotational, solifluxion and muddy flow), (iii) the surface (in km2), and (iv) all the factors involved in its initiation. The majority of landslides identified not exceeded an area of 0.01 km2 (89% of landslides < 1 ha) while 10.56% > 0.01 km2 with an area reaching up to 0.0749 km2). The total area of the landslides identified is approximately 11.14 km2, which represents approximately 6.61% of the entire surface of study area. Phenomena affecting this area can be superficial < 20 m or deep > 20 m. According to their form, (i) 119 cases landslides are translational located upstream of the Fergoug River or (ii) 23 cases are rotational located mainly along the same river (Fig. 1b).

Conditioning factors

The causes of the triggering and occurrence of landslides are truly complex and diverse. There is not a clear agreement to the precise reasons for their manifestation (Corominas et al. 2014). The complex nature of landslide development has led many researchers to study how the occurrence of landslides could be affected by various conditioning factors (Mugagga et al. 2012). Among them, we quote predisposing factors with (i) topographic factors (e.g., slope, aspect, curvature) and geological factors (e.g., earthquakes, cave-in collapse); (ii) aggravating factors (e.g., floods, snowmelt and rainfalls; Zêzere et al. 1999).

Predisposing factors

Three topographic factors were used for this analysis (slope, aspect, curvature, stream network). The topographic factors were extracted based on high-precision SRTM DEM data. It provides a quasi-global representation of the Earth’s surface at a medium spatial resolution (30–90 m). This data is obtained from Shuttle Radar Topography Mission (SRTM) and is available in website https://earthexplorer.usgs.gov/ with medium resolution 30 m/ cells of resolution.

Slope gradient is considered to as one of the main factors conditioning the triggering of landslides (Van Westen et al. 2008; Petley et al. 2005; Kuriakose et al. 2008). The value derived from the DTm is divided into five classes consistent with a 10° interval for each (> 10°) (10°–20°), (20°–30°) (30°–40°), and (< 40°) (Van Westen 2006; Petley et al. 2005; Kuriakose et al. 2008) (Fig. 6a). Slope aspect is considered as a influencing factor which controlled the landslides triggering (Pradhan and Lee 2010; Schlögel et al. 2018; Pourghasemi et al. 2012; Yan et al., 2020). Slope aspect is defined as the direction in which a slope is oriented and it relates to the degree of exposure to the sun. This factor also influences the plant cover, the daily temperature, and relative humidity ranges of a slope (Chen et al. 2019). For this purpose, the slope aspect map has been classified in 8 distinct classes (N, NE, E, SE, S, SW, W, and NW); the distribution of landslides related to slope aspect can be shown in Figure 6 b. Another factor that can be considered in landslide hazard analysis is topographic curvature. Indeed, slopes curvature profile characterizes the morphology of the topography and has a positive influence on the onset of landslides. The slope curvature influences the training and resistance stresses in a landslide in the direction of motion and controls the change in speed of mass motion flowing down the slope The value of the curvature can be either above, below, or equal to zero, representing the convex, concave, or flat shaped curvatures, respectively (Fig. 6c).

Fig. 6
figure 6

Causal factors responsible for triggering landslide a: slope, b: aspect, c: curvature, d: tectonic fault, e: lithology, f: land cover, g: stream network, h:rainfall, i: road traffic and j: earthquakes. The white circles represent the spatial distribution of landslide

Geo-environmental and anthropogenic factors were involved in this analysis by a landcover spatial data. The land cover map was obtained from supervised classification of Landsat (Oli_8) and compared to the map developed by forestry services. This map of has been prepared and classified into four distinct categories (dense forest, sparse vegetation cover, pasture areas, and totally bare soil) (Fig. 6d). (Table 1).

Geological factors were split in three different spatial variables with (i) a lithological map; (ii) a characteristics of materials; and the (iii) structure with integration of faults. The lithological map was established by digitizing two geological maps (Mascara and Mohamadia maps with 1/50000 scale) (Dalloni 1936). The geological section contains geologic formations dating from Cretaceous to Plio-Quaternary. According to the friability of the rocks, these formations were classified into 4 categories and (crumbly rock, soft rock, competent rock, and very resistant rocks) (Fig. 6f). The characteristics of the material itself result from the bedrock, weathering may be weak or fractured, or different layers may have different strengths and rigidity. For this purpose, three geological factors were taken into account for this analysis. The structural map is based on fault lineaments extracted from the old geological maps, and drawn from the morpho-structural analysis using photo-interpretation of satellite images coupled with a shaded DEM. This data represents the distance to fault map was constructed by generating buffers along the fault lines. This distance was classed into 4 classes (0–250 m), (250–500 m), (500–1000 m), and (> 1000 m), more the area is located near the fault line, the risk of slipping is very likely (Fig. 6e).

Aggravating factors

Aggravating factors can depicted as factors s accentuating the possibility to have landslide occurrences without actually triggering the phenomena. For example, a series of earthquakes can accentuate the fragility of materials without triggering landslides, or the fact that rainfall is higher than the annual average for some areas will induce a greater sensitivity for intense rainfalls for some land. Four aggravating factors have been selected with (i) mean precipitation data; (ii) seismic activity; (iii) stream network; and (iv) anthropogenic influence.

In our study area, landslides are triggered mainly following rainfalls, which can be intense and sudden. Due to lack of daily rainfall data, we used an annual mean precipitation map, produced using rainfall data from three weather stations located around the study area. The selected period is 10 years (from 2004 to 2014). The map is obtained by applying the ordinary Kriging method. The final map was prepared and reclassified into four classes based on the annual average rainfall (380–410 mm), (410–440 mm), (440–470 mm), and (470–500 mm) (Fig. 6g).

An earthquake or series of earthquake can accentuate the slope instability by the inertial loading it imposes or by causing a loss of resistance of the materials of the slope (Hadji et al. 2016; Yang et al. 2017; Huang et al. 2017). The Beni-Chougrane region, as all areas of northern Algeria, is characterized by moderate seismic activity (Refas et al. 2019). Indeed, large earthquakes shook this area in 1967 and 1994 (Benouar et al. 1994; Bezzeghoud and Buforn 1999). Therefore, the seismic sources were extracted from the international database (ANSS), of which 6 large sources located in the vicinity of the Fergoug watershed. A buffer was applied around these sources. Based in this strategy, the prepared raster map was classified into 4 classes representing by distances (> 500 m), (500–1 km), (1–3 km), and (> 3 km) (Fig. 6h).Remote areas of seismic sources have a low probability of generating a landslide.

The distance to stream networks can a fundamental parameter controlling the landslide occurrences. We can then consider that the basal cutting of the banks can induce landslides by foot failure. In the Fergoug watershed, three rivers cross it diagonally with lengths 24.9, 12.7, and 9.70 km respectively for Fergoug, Taghzout, and Hammam rivers. Moreover many tributaries spread out along the study area. Then, we extracted the water system from the SRTM-DTM, and buffers of a distance of 50 m were applied. The raster map was coded using a distance-based strategy. According to this distance, five classes have been highlighted: (< 50 m), (50–100 m), (100–150 m), (150–200 m), and (> 200 m) (Fig. 6i).

Human activities and road construction works (digging, excavation, and trenching) can be an important aggravating factor for landslide occurrences. The slope gradient can be artificially increased inducing a higher failure probability.

Beyond this increase in local slopes, many authors have shown along roadways the effect of traffic on landslide initiation (Shipway et al. 2015; Voumard et al. 2013; Jotic et al. 2018). Indeed, this road traffic can cause incessant vibrations that then affect the stability of the slope. To determine this effect, we mapped out all roads crossing the Fergoug area and then proceeded to apply buffers at random distances (500 m), (1 km), (2 km), and (> 2 km) (Fig. 6j). This strategy is based on the fact that the closer the area is to the road, the higher the probability of triggering a landslide.

Landslide susceptibility modeling

For each selected method, the procedure to compute landslide susceptibility map is the same with a confrontation of the events (i.e., landslides) with the factors. The reliability of each model used to produce susceptibility map is carried out through a classical procedure based on the computation of receiver operating characteristics (ROC) and the area under curve (AUC). The tests were performed by comparing known landslide location data with the landslide susceptibility map. To determine the reliability of three models of landslide susceptibility (FR, IVM, and RLM) applied in this study, we performed a check using the receiver operating characteristic curve, or ROC curve, which is graphical representation created by plotting the true positive rate (TPR) known as sensitivity against the false positive rate (FPR) known as specificity and this at various threshold settings (Pourghasemi et al. 2012). By tradition, the ROC curve is plotted using False Positive Rate on X-axis (shown in Eq. (6)) and True Positive Rate on Y-axis (shown in Eq. (7)):

$$ X=\mathrm{Specificity}=1-\left[\frac{TN}{TN+ FP}\right] $$
(6)
$$ Y=\mathrm{Sensitivity}=\left[\frac{TN}{TP+ FN}\right] $$
(7)

Where TP (True positive) is the number of landslide pixels correctly classified in the landslide class, and TN (True negative) is the total number of non-landslide pixels correctly classified in the non-landslide class of ground. FP (false positive) is the number of landslide pixels classified in the non-landslide class; FN (False negative) is the non-slip pixels classified in the class of landslides (Xu et al., 2012).

Usually, AUC curves range from 0.5 to 1.0. The model with a higher AUC is considered the best. If the area under the ROC curve (AUC) is close to 1, the test result is excellent (Hong et al., 2016). On the other hand, if the model does not predict well, then this value will be close to 0.5. Related to test value, it can be divided in five class, the value ranges from 0.50 to 0.60 (poor), 0.60–0.70 (average), 0.70–0.80 (good), 0.80–0.90 (very good), and 0.90–1.00 (excellent) (Pourghasemi et al. 2012).

Results and discussion

Results are analyzed in three steps with (i) analysis of predisposing factors, (ii) analysis of aggravating factor, and (iii) analysis of final maps. Each part is discussed. By this way, we explore and analyze simultaneously the influence of each predictive factor and how each model how each model behaves in relation to the data.

The results in Table 2 show that the values obtained by the information value model (IVM) are almost consistent with the values of the frequency ratio (FR). The statistical correlation between landslides and their independent factors was determined by the multiple linear regression (MLR) model. Based on constant values of regression coefficients, the landslide occurrence predicting equation was obtained as follows:

$$ LSI\ (MLR)=0.692\hbox{--} 0.194\ (Roads)-0.086\ (Vegetation)+0.175\ (faults)+0.085\ (exposition)+0.138\ (Slope)-0.013\ (Lithology)-0.048\ (Rainfall)+0.208\ \left( Stream\ network\right)+0.189\ (Earthquakes)+0.027\ \left( Curvature\ topography\right). $$
(8)
Table 2 A spatial relationship between listed landslides and conditioning factors in the study area. For abbreviations, FR = frequency ratio, IVM = information value model and MLR = multiple linear regression

Analysis of predisposing factors

Slope aspect bounded for (NE, E, SE, and S) has rational correlation with landslide occurrence; however, the higher value (FR = 5.87) marked the Northern slope exposition (Fig. 6a). This can be explained by the morphological disposition of the mountainous chain. Indeed, Beni-Chougrane Mountains are oriented ENE-WSW and the ridges divide slopes into two categories, Northern and Southern slopes. These mountains are anticlines limb with steep slopes. These anticlines are overturned to the south; therefore, the north-facing slopes are larger than the southern facing, and are characterized by a significant number of landslides, the north-facing slopes are wetter than those facing south. Among the slope categories, high landslide frequency ratio is observed in slope smaller than 10° (FR = 2.09). This is due to the total lack of sufficient plant cover to protect these lands. Knowing that Fergoug forest is located in the Western part and occupied the higher altitudes. In addition, these lands are the result of anarchic exploitation by farmers. The frequency ratio is less than 1, indicating a lower probability of triggering a landslide in the slope class ranged between 10° and 20°. However, the areas which correspond to slopes ranging from 20° and higher than 50° are exposed to very significant chemical and physical alteration processes and there are characterized by higher value (> 1). This is due to the rising of shear stress in steep slopes under the gravity effect (Fig. 6b).

The relationship between curvature and landslide indicates that they generally occur in or near concave slopes where FR values are significant and are greater than 3. Our results show also that the FR values are more significant in concave slope and less significant (< 1) in the flat slope (Fig. 6c).

The results obtained using the FR model show that regions most sensitive to landslides are distributed in areas situated near faults (Fig. 6d). FR value of 3.39 is obtained for areas closer to the fault line which are characterized by a distance not exceeding to 0.25 km. These areas represent 71.68 % of landslide occurrence. Our results show also that the less important FR is obtained for area located far from the fault line. Indeed, the FR ratio is −1.34, − 1.4, and − 3.76 respectively for zones situated between 0.5 and 1 km, between 1 and 2 km, and beyond 2 km.

Values of FR = 1.83 and are attributed to zones which is characterized by a crumbly rock in particular marls and clays. Due to the presence of clay and marl particles exposed to water, these reach very quickly the liquidity limit and the lands slide easily. Resistant rocks are characterized by very less value (FR = 0.19) (Fig. 6e).

The denuded areas are represented by significant value FR = 9.49. However, FR is less than 1 in areas characterized by permanent forest cover (Fig. 6f).

The results shown in Table 2 show that values obtained by the information value model (IVM) are almost consistent with the values of the frequency ratio (FR). In fact, the most significant values are those which correspond to approximate areas with faults. IVM values are also important in areas characterized by IVM values of 1.06, 0.88, and 3.25 respectively for steep slopes, crumbly rocks, and bare soil.

According to the equation (Eq. (8)) obtained, 6 parameters statistically significant; however, 3 parameters generally considered as less important. For faults, the regression coefficient Cr have positive value respectively (Cr = 0.175). Indeed, this topographic factor is considered to be one of the determining factors in the triggering of landslides, and represents the most effective parameter in terms of landslides occurrence. This shows their decisive role in promoting landslide activities.

Slopes, curvature, exposure, and lithology have proven to be moderately important in the current context with low values respectively 0.138, 0.085, 0.027, and 0.013.

Coefficients related to the vegetation and roads were recorded with negative Cr values (− 0.086 and − 0.019), which proves that these factors are less important to promote landslide activities.

Analysis of aggravating factors

Our results show that 66.73% of landslide occurrence is reported to areas which correspond to distances of river ranges < 100 m. In these zones, the FR is marked by a higher value (FR = 3.44). The values of the FR become lower as we move away from rivers. FR is 0.84, 0.36, and 0.15 respectively for areas between (100–250 m), (250–500 m), and beyond 500 m (Fig. 6g).

This indicates a good correlation between these hydrological parameters and landslide occurrence. The erosion exerted in the meanders of a river continuously digs deep in the bed of the river, undermining and weakening the banks, and causing their collapse. This is caused by a breakage of the stopper of the foot and thus leads to undermining of the banks. To this end, areas closer to rivers (< 100 m) have a stronger relationship with landslides. Presumably, areas characterized by a dense hydrographic network have also, a high probability of landslides occurrence (Karim et al. 2019).

Our results show that it exists a very important relationship between annual and daily precipitation and the occurrence of landslides. This indicates that landslides correlate very well with high rainfall values (FR = 1.24) (Fig. 6h). This confirms that precipitation, especially rains, is the trigger for landslide. It has been found that cumulative precipitation of 10 to 30 days is mainly associated not only with the triggering of landslides but also with their reactivation.

In the case of our study area, this is can easily observed in the landslides affecting the Mohamadia road (Fig. 3a, b). Indeed each winter, the municipality implements remediation work after the damage caused by the landslide and this to ensure the continuity of road traffic. In the case of our study area, there is a positive dependent relationship between altitude and precipitation, landslides generally occur in high altitude areas surrounding in particular the Sidi Daho syncline (Fig. 1b). It should be noted that torrential rains are considered to be the main trigger for landslides in the Fergoug watershed. Several landslides are triggered during the wet seasons, especially in winter. The relationship between roads proximity and landslides indicates that it occurred generally near roads at distances smaller than 500 m (Table 2) where FR = 1.76. Several roads are typically affected by this phenomenon (Fig. 3 and Fig. 4d). Indeed, in the case our study, the degradation of the roads begins with a simple cracking due to the removal of a stabilizing abutment in downstream, and then these cracks develop to evolve to more or less spaced openings leading to the sliding of the entire facade (Fig. 6i). In most time, this degradation is due to the anarchic construction of roads by increasing the gradient of slope without thinking on the stability measures (Solaimani et al. 2013) (Fig. 6i) (Fig. 3d). In many parts of the world, earthquakes are frequently accompanied by gravitational instabilities, particularly landslides and rock fall (Marc et al. 2015; Li et al. 2016; Zaagane et al. 2016; Roback et al. 2018; Veh et al., 2019). During the twentieth century, nearly 80 earthquakes caused between 100.000 to 1.000.000 landslides that claimed tens of thousands of lives (Keefer 1984). The zones characterized by an important seismic activity are also the seat of several landslides. Indeed in our study, FR = 5.12 value characterizes all areas situated less than 1 km from the seismic source (Fig. 6j).

FR values decrease in areas far from seismic sources where landslides are rare.

Our results show that IVM values are very important for hydrographic networks, seismic sources, and roads. Indeed IVM values are 1.56 for areas located near river lines with distance not exceed at 0.1 km to river. IVM value (IVM = 5.12) characterized area situated in circle that the radius is less than 1 km. For the anthropogenic factors, the IVM value is more and characterized all areas situated near road with distance not exceed at 0.5 km.

For the MRL model, the results in Table 2 shown that the coefficient linked to the hydrographic network has very strong dependence (Cr = 0.208). This indicates that the distance from the river corresponds to a high positive Cr which confirms a higher dependence between the banks and the landslides. Earthquakes have a clearly positive coefficient (Cr = 0.189). Indeed, landslides are largely triggered by earthquakes all over the world which can greatly increase the destructive impact of earthquakes especially in mountainous regions. These triggered landslides represent a significant proportion of the total earthquake damage due to the destruction of buildings and the impacts of landslides on lifelines and roads traffics that can been used in case of the rescue efforts.

Rainfall effects are more represented with MLR models. Indeed, Cr values are negative where Cr = − 0.048. The spatial distribution of rainfall has a secondary effect on the triggering of landslides; however, it should be noted that the rainwater joins the main rivers along the slopes, and it is the flood waters which are responsible in triggering landslides.

Analysis of final maps

The LSI landslide calculated using the FR and IVM model was summarized in all factor classes as shown in the equation (Eq. (2) and Eq. (5)), then we ranked the LSI values at an equal interval. The high LSI value is closely related to the total combination of factors controlling landslides. We then produced a raster map by assigning the value FR to each class of factors. We proceeded in the same way to establish the LSI map using the IVM model.

To predict the possibility of landslide, the probability was calculated from Eq. (8) then converted to raster format (Table 2).

Figure 7 shows the raster maps of LSI (FR), LSI (IVM), and LSI (MLR). These maps are represented by values ranging from 0.0245 to 1.931, and between − 0.004 and 89.25 respectively for the FR and IVM.

Fig. 7
figure 7

Susceptibility maps produced using three models a: Frequency Ratio (FR) b: Information Value Model (IVM) and c: Multiple Linear Regression (MLR). The histogram in (d) represent the porcentage of each susceptibility class relating to (FR), (IVM) and (MLR) models

The values of the susceptibility map calculated by the MLR model show a range from 3.44 to 122.16. These values ranges were divided into five classes of susceptibility to landslides of equal intervals for the three models Very Low (VL), Low (L), Middle (M), High (H), and Very High) (VH) (Fig. 7).

The map produced using FR model shows a very important zone characterized by Very high landslide susceptibility located especially in the central zone of the Fergoug watershed especially around the Fergoug River (Fig. 7a). MLR model shows almost the same characteristic of susceptibility zone; however, the very high is less significantly compared to the LSI_FR map (Fig. 7b).

The very high and high susceptibility classes are well represented in the two maps produced by the FR and IVM models. Indeed, the very high susceptibility class presents a percentage of 58% for the Fr model and 47% for the IVM model (Fig. 7d). Although this percentage is less significant for the MLR model and is characterized by only 27% of the total area. However, the four classes are almost equal with percentages not exceeding 26% (Fig. 7d).

For the MLR model, the map produced shows the biggest zone represented by the Very low susceptibility class related to FR and IVM models (Fig. 7c). Indeed, 20% of the total area is represented by very low susceptibility; however, it is almost 10% for two models cited above.

For three models used in this study, the Fergoug River is considered as a very active in which almost the majority of the listed landslides are located along this zone.

The susceptibility map produced by the three models FR, IVM, and MLR shows very good spatial similarity. Indeed, the highest susceptibility classes are very important, and are listed on slopes greater than 40°, and along the main line of the river. The greatest probability is reported in this corridor located between the Mohamadia road and the Fergoug River. Towards the south of this watershed, the high density of landslides is located on both sides of the perched syncline of Sidi Daho where two factors have been combined. This combination represents the steep slope associated with soft geological rocks represented mainly by Miocene marls. This observation proves that the good correlation between conditional factors such as slope and hydrographic network as well as a landslide was illustrated by the three models (FR, IVM, and MLR).

Validation

Choosing an adequate model to map landslide susceptibility (LSM) and determining its accuracy is an important aspect of research on landslide sensitivity. Checking landslide susceptibility maps is very important. Consequently, three commonly used statistical parameters, the receiver operating characteristic curve (ROC) and the area under the curve (AUC), the standard error and the 95% confidence interval are introduced to check the susceptibility maps of landslides. AUC and ROC curves have also been developed using the Excel-based (Xstalt) extension (Lee and Pradhan 2007; Pourghasemi et al. 2012).

The results show that the FR, IVM, and MLR models used in this study give satisfactory results and show a very good precision in the prediction of the landslides susceptibility in the Fergoug area. Indeed, the AUC value is greater than 0.80 which means that the rate curve contains more than 80% of landslides in the three models (Fig. 8).

Fig. 8
figure 8

Area under curve (AUC) of the ROC curve for the landslide susceptibility map produced by FR, IVM and MLR models

Results show that the FR model acquired the highest AUC value (0.87), followed by the IVM model (0.83) and MLR model (0.81). The highly susceptible area with the prediction rate curve recognized by the FR model also includes more than 87 % of landslides, which illustrates that the precision of the FR model is the highest and that the FR model is the best model that can been used in this study.

It appears from this study, that during torrential rains, an intense runoff develops in the region of Fergoug. The tributaries discharge a large quantity of water and solid matter (soils, mixture of blocks and gravel uprooted upstream) to the main stream of Fergoug (Fig. 9a). Given the morphological aspect of this area, the materials transported by the floodwaters permanently erode the river bed. This erosion phenomenon occurs in two ways:

  1. (i)

    A vertical erosion causing a deepening of the river bed and thus leads to an increase in the gradient of slope release of forces thus leading to mass movements towards the bed of the river (Fig. 9b, c).

  2. (ii)

    A horizontal erosion resulting in the meanders erosion of this river leading to destabilize the slope (Fig. 9a). When the soils are slipped, this movement causes the widening of this river on the one hand and the modification of the flow of the fluvial network on the other hand (Fig. 9d). Along the Fergoug River, all these erosion products are transported to the Fergoug dam thus leading to its silting. This dam works with 5% of its total capacity. As a result, over 70% of the total surface area of citrus fields irrigated from dam is currently dry. Consequently, it is a real desperation for the local population that all of their economic activity is based heavily on the exploitation of water from the dam.

    Fig. 9
    figure 9

    Image showing the relationship between the Fergoug river and thelandslide triggering, a: the disposal of landslide along the river, especially in the meanders causing locally the change of the streamcourse b: scheme showing the initial stage in a failure slope, c: the deepening of the Wadi bed leading to the increase of the slope gradient,d: the triggering of landslides under the soil weight effect leading to the filling of the river bed by the mobilized land. For the symbols 4a, 4b please see the figure 4

Conclusion

Like all areas of northern Algeria, the Fergoug watershed has strongly affected by landslides phenomenon. Indeed, during our trip field, we noticed that several infrastructures, such as roads, rivers and dams, even bridges, and houses have been affected by this phenomenon. It seems that many factors acted together on the slopes and leading to the triggering of landslides. In the case of our study area, landslides are mainly due to factors classing into two types: (i) predisposing factors including aspect, slope, the curvature, faults, soft geological rocks, land cover, and (ii) aggravating factors represented by aggressive precipitation, earthquakes, stream network and roads. Two research steps were followed in this process. Firstly, the realization of the hazard maps was carried out by applying the FR, IVM, and MLR models chosen for their performance and their simplicity, then, the results obtained were tested using AUC curve These tests show that our results show very good accuracy for all three models. Indeed, our results reveal that the frequency ratio model (87%), the IVM model (83), and the MLR models (81%) have been highly accurate and can be classified in the category of very good precision (> 80). Despite its simplicity, the FR model shows more precision than IVM and MLR models.

In the FR model, the biggest class of landslide susceptibility is represented by high to very high class, accounting for more than 80% of the area, compared to 60% in the IVM model and 41% for MLR.

The multiple linear regression equation allows us to conclude that landslide causal factors (distance from faults and stream network, curvature, slope and earthquakes) have a significant influence on landslides.

The landslide susceptibility maps obtained as part of this study also showed that an area with significant landslide risk was located in the central part of the Fergoug watershed. The results also show that the lands most likely to cause this type of instability are spread around the Fergoug River bed.

The road that connects the Mascara town and the Mohamadia is used by thousands of car drivers every month. This road has a high susceptibility to landslides, while more than 5% of the inhabitants are distributed along the edges of this road and on extremely sensitive lands. To this end, the models used in this study have shown that susceptibility mapping is an essential tool for delimiting areas prone to landslides. These maps can be considered an important piece of information for decision makers. They have to be the basis for future plan decision and they are in line with the government’s desire to improve landslide hazard documents by being easy to implement and easily understood by practitioners.