Introduction

Landslides are a natural destructive phenomenon that causes material damage, loss of life, and significant damage to natural resources (Intarawichian and Dasananda 2010). This phenomenon represent a major risk for a property, often resulting in loss of life, economic losses, and high maintenance costs (Das et al. 2010). Northern Tunisia and especially the north west of the country suffers from this phenomenon. Marthelot (1957, 1959), Bonvallot (1984a et b) and El Aroui (2016, 2017) have developed a typology of forms caused by landslides and have determined the main causes of their trigger. By determining cause-and-effect relationships, this damage can be mitigated (Intarawichian and Dasananda 2010). The best solution to reduce its damages is to predict the zones of high susceptibility in order to mitigate their consequences. The assessement of the related dangers and susceptibility to the landslide has drawn the attention of both geoscientists and the local authorities (Carrara et al. 1991; Parise 2001; Krejci et al. 2002; Demoulin and Chung 2007; Nefeslioglu et al. 2008). Many technics were proposed in literature for landslide susceptibility mapping (Van Westen 1993; Soeters and Van Westen 1996). According to Feizizadeh and Blaschke 2012a, b, the GIS-MCDA method is increasingly used in landslide susceptibility mapping for the prediction of future risk areas. However, LSM methods based on GIS-MCDA are often used without any indication of the error rate or confidence in the results. The uncertainty of the results is explained by the large amount of parameters and the heterogeneity of the data sources. Changes in factor weights and decision methods have a significant influence on the ranking order of the criteria and may alter the results (Feizizadeh et al. 2012). In this paper, the fuzzy logic gamma operator has been used to integrate the factors controlling landslides for the first time in Tunisia. We have chosen this method because it is similar to human reasoning, based on linguistic model and for its rapidity operation and its high precision. To minimize the subjectivity of this method, the fuzzy membership values were derived from the frequency ratio of landslides. Then, The fuzzy gamma operator was selected for the integration of causal factors. Finally, to validate the resulting map, ROC curves have been prepared.

Study area

Landslides are well known in Northern Tunisia, which is characterized by steep slopes, heavy rainfall, and fragile lithology. These parameters have a great importance, especially at the level of the watersheds crossed by the national road n°11 that connects the cities of Mateur and Beja. It covers an area of 15,349 ha. Its central value of longitude is 36°55′N, and its latitude is 9°27′E. The altitude varies from 76 to 612 m, and the highest point is located in Jebel Antara. The average annual rainfall and temperature are respectively 650 mm and 17C°. The geology of this region is very complex, and the lithological units include several formations causing landslides. The instability of this area has not been the subject of any mapping study.Often these landslides are very close to the national road n°11, which constitutes a threat for the users of this road. The study location map is illustrated in Fig. 1.

Landslide inventory map

The Landslide inventory map shows mainly the spatial distribution of landslides (Yalcin et al. 2011) and is one of the main steps required to map susceptibility. In this study, the establishment of the inventory map of landslides is based on a Google Earth snippets and field work. A set of 147 landslides was detected in the study area and subsequently digitized for further analysis. Their locations are mapped in Fig. 1. For this analysis, all of the landslides selected were taken for building landslide susceptibility models and after they were used to validate it.

Fig. 1
figure 1

Location of study area; (I) and (II): examples of landslides characterizing the study area

Data preparation

Based on field work and review of previous works, nine factors was determined as primary for the occurence of landslides, namely, slope, aspect, curvatures, lithology, distance from faults, distance from watercourses, land cover, and rainfall. The Sentinel 2 image with a spatial resolution of 10 m was used to generate the land use map. ALOS PALSAR DEM with a spatial resolution of 12.5 m was used to generate drainage network maps, slope, aspect, and curvatures. Ancillary data such as 1:50000 geological maps and precipitation data were used. All data sets were rasterized in a 10 m × 10 m grid cell. Table 1 shows the different types of data used in this search. When weighting the index cards (qualitative mapping) the choice of the number and the width of the classes consist of a main step because it controls directly the quality of the resulting maps. But in the case of the semi-quantitative mapping that is provided in the present study by the fuzzy logic method, the weight of each class depends on the value of the fuzziness which is estimated from the frequency ratio.

Table 1 Data used in the present study

Slope

The slope factor is often used in the development of the landslide susceptibility map because it has a direct effect on the landslide process (Ayalew and Yamagishi 2005; Jordan et al. 2000). Its angle significantly affects the incidence of landslides (Kanungo et al. 2006; Gupta et al. 2008; Dahal et al. 2009). The classification of this parameter is based on previous studies of (Pradhan and Lee 2009; Anbalagan et al. 2015 and Lee et al. 2016).The slope map has six levels: very low/flat (0°–5°), low (5°–10°), moderate (10°–15°), moderately high (15°–20°), high (20°–25°), and very high (> 25°) (Fig. 2).

Fig. 2
figure 2

Input data layers; (A) slope, (B) aspect, (C) distance from the faults, (D) distance from the rivers, (E) lithology, (F) plan curvature, (G) rainfall, (H) land cover, (I) profile curvature

Aspect

According to DeGraff and Romesburg 1980; Marston et al. 1998; Nagarajan et al. 1998; Saha et al. 2002; Pradhan and Lee 2009; Kanungo et al. 2009; Anbalagan et al.2015 and Lee et al. 2016 there is a direct relationship between aspect and landslides because humidity and sunshine depend on the exposure of the land. The aspect map was extracted from DEM ALOS PALSAR, it shows nine classes that are; flat (− 1), north (0°–22.5° and 337.5°–360°), northeast (22.5°–67.5°), east (67.5°–112.5°), southeast (112.5°–157.5°), south (157.5°–202.5°), southwest (202.5°–247.5°), west (247.5°– 292.5°), and northwest (292.5°–337.5°) (Fig. 2).

Rainfall

After heavy rains, the water infiltrates in the soil and increases the degree of saturation which facilitates the triggering of landslides (Pourghasemi et al. 2009). This factor was prepared on the basis of rainfall data for the period 1982–2012 of rainfall stations of Tebourba, Mateur, Joumine Dam, Ghezala Dam, Bazina, Beja, Sejnane delgation and Nefza (rainfall data from the National Institute of Meteorology). In this study, the rainfall map was classified into four classes, < 600, 600–800, 800–1000, and > 1000 mm/year (Fig. 2).

Curvatures

There are two forms of curvature: the profile curvature which is within the direction of the maximum slope and the plan curvature which is perpendicular to the maximum slope direction. In the plan output, a positive curvature indicates that the ground is upwardly convex at this cell. The negative curvature shows that the soil is concave upwardly in this cell. However, in the profile output, it is the opposite, and a value of 0 indicates the ground is flat in both cases (Fig. 2).

Distance from the rivers

The risk of landslides increases with the decreasing of distance from watercourses because they threaten the stability of land by river banks erosion or saturation of materials (Ercanoglu and Gokceoglu 2004). After superimposing the landslides on the hydrographic network, we have noticed that by moving away from watercourses the number of landslides decrease. The distance from river map was prepared by applying the “Euclidean distance” tools under Arcgis 10.2.2. Based on the previous works of (Ilanloo M, 2011 and Wu Y et al. 2016), five levels were determined: very high (< 100 m), high (100–300 m), moderate (300–500 m), low (500–800 m), and very low (> 800 m) (Fig. 2).

Lithology

The mechanic resistance and the permeability of rocks and soils are controled by the structural and lithologic variations (Champati Ray et al. 2007). Lithology is an essential factor in the landslide zonation (Kayastha et al. 2013). This map is achieved in order to georeference and digitalize the geologic maps 1/50000 covering the study area. Our study area is mainly formed mainly by recent alluvial deposits 7351 ha, marl alternations and limestone marly 4256 ha, sandstone and clay 817 ha, limestone with nummulite 1272 ha, gray marl 1319 ha, triasic facies (gypsum, dolomite) 128 ha, and brown limon 204 ha (Fig. 2).

Distance from the faults

By approaching to the fault, the probability of having landslide increases and becomes more and more important (Lovine 2008). The map of distance from the faults was prepared by applying the Euclidean distance tools on the faults that were digitalized from the geological map 1/50000, and nine classes were determined (Fig. 2).

Land cover

However, land cover may accelerate the occurrence or play an important role in the occurrence of landslides (Chen and Wang 2007). After a classification of the Sentinel 2 image with ENVI 4.5 by using the maximum likelihood method, four land cover classes were determined; forest, vegetation and agricultural land, built up area, and bare ground. These last levels cover, respectively, 864 ha, 190 ha, 9399 ha, and 4895 ha (Fig. 2).

Methods

In this study, fuzzy logic technique was used to carry out landslide susceptibility mapping. Unlike classical set theory, where an object does or does not belong to a set, in the theory of fuzzy sets, the fuzzy membership values oscillate between 0 and 1, depending on the degree of membership. The problem is that there is no rule on which fuzzy membership values are assigned. Several researchers are simply based on their subjective judgment, as Bonham-Carter (1994) has shown for mineral exploration, but others are based on statistical analysis (Ercanoglu and Gokceoglu 2002). So the landslide frequency ratio approach was adopted to reduce the subjectivity of this aproach. In the one hand, we identify the parameters controlling landslide. It is vitally to anticipate causal factors to predict landslide occurrences because until now there is no universally defined set of factors to predict the spatial occurrences of landslides. Lee and Talib (2005) trust that future landslides will occur under the same conditions as past and present landslides. In the other hand, we calculate the fuzzy membership for each cateogory of each factor. Then, gamma operator was used to integrate the landslide causative factors in order to delineate the landslide susceptibility zones (Fig. 3). Finally, the ROC curve was used to choose the better gamma value to predict areas of high susceptibility to landslides.

Fig. 3
figure 3

Methodology organogram for landslides susceptibility mapping

Fuzzy membership calculation

Following the hypothesis of Lee and Talib 2005, a relationship can be determined between landslide-related causal factors with landslide occurrences. To quantify this relationship, the frequency ratio can be used. It is the landslide percentage of each land category divided by the area in percent of the same category (Bonham-Carter 1994).

Where

$$ \mathrm{FR}=\frac{\raisebox{1ex}{$ Di$}\!\left/ \!\raisebox{-1ex}{$ Ai$}\right.}{\sum_{i=1}^N Di/{\sum}_{i=1}^N Ai} $$
(1)
FR:

Frequency ratio

Di:

the number of landslide in each category

Ai:

the area of each category in each parameter

N:

the category number of each parameter

The fuzzy membership follows the principle of conditional probability, if the ratio is equal to 1, so there is a good relationship between landslide and factor classes if the ratio is < 1, so it represents a weak relationship. Normalized value of landslide frequency ratio was used to calculate the fuzzy membership function (Pradhan et al. 2010). Table 2 shows the values of frequency ratio and fuzzy memberships in each factor.

Table 2 Frequency ratio and fuzzy membership values of each parameter

Fuzzy modeling

The theory of fuzzy logic was introduced by Zadeh (1965) as a theory for mathematical modeling where can happen in uncertain conditions. This technique is performed by functions such as OR, AND, SUM, product, and gamma (Bonham and Cox 1995). In this study, we will apply the gamma function due to its rapidity operation and high precision.

The fuzzy gamma operator is the algebraic product of fuzzy product and fuzzy sum, to the power of gamma. The function is as follows:

$$ \mu c={\left[\prod \limits_{i=1}^n\mu 1\right]}^{\gamma }\ X\ {\left[\begin{array}{c}\ \\ {}\ 1-\prod \limits_{i=1}^n\left(1-\mu i\right)\end{array}\right]}^{\left(1-\gamma \right)} $$
(2)

where γ is a parameter chosen in the range (0, 1).

Results and discussion

We found the highest landslide frequency ratio in the medium slope (10°–15°), therfore we can deduce that slopes with these values are sufficient to trigger a landslide. The rainfall map shows an average annual rainfall of > 1000 mm with the highest value of frequency ratio (1.16) indicating the highest probability of landslide occurrence in this class. The topographic aspect is equally considered as an important factor in this research. The level of very high frequency ratio 2.45 and 2.68 is found respectively in the north and northwest because they receive the highest rainfall. The map of distance from river indicates that frequency ratio of the range of 0–100 m and 100–300 m is revealed to be high in the buffer drainage case, and it could be attributed to stream bank erosion which leads to landslides. Each geological formation is characterized by a lithology which could run landslide incidence. By analyzing the frequency ratio results of geology layer, we found that the recent alluvial deposits are the most susceptible to landslide in view of their fragility (clays and ancient soils) but the highest frequency ratio value is found in gray marl because it has a value of landslide frequency that is almost twice its area. Within the land use classes, we noticed a high value of landslide frequency in vegetation and agricultural land. The highest landslide frequency ratio was observed in the negative values of plan curvature. But in the case of profile curvature, the highest value of frequency ratio was observed in the positive values. We found 8.51% of the landslides in class 0 (curvate plan) and 12.76% in class 0 of the curvate profile because these areas have a certain inclination (the slope) which can trigger some landslides. Therfore it can be concluded that concave slopes always occupy the maximum of landslides. About distance from the faults, the highest frequency ratio was found in the last range with a value of 2.36. This is explained by the stability of this zone. Four landslides susceptibility maps were developed applying four different gamma values (0.75, 0.85, 0.90, and 0.92). Until now, landslide susceptibility values are represented by a continuous data. To better visualize the results, we applied a discretization. Data discretization remains uncertain in landslide susceptibility mapping (Kritikos and Davies 2011). Therefore, most researchers use their expert opinion to delineate classes (Ayalew et al. 2004; Kritikos and Davies 2011). In our study, a discretization based on Jenks natural breaks classification method was performed, because it reduces the variance within classes and maximizes the variance between classes. So, landslide susceptibility maps were divided into five classes (no susceptibility, low susceptibility, medium susceptibility, high susceptibility, and very high susceptibility). Landslide susceptibility maps show that the spatial distribution of susceptibility classes varies from one map to another. Table 3 shows the percentage occupied by the susceptibility areas for different gamma values.

Table 3 Percentage of susceptibility classes

For example, in the case of γ (0.75), 33.02% of the area was occupied by unsusceptible area, and 27.66% was located in low susceptible area, 22.41% in medium susceptible area, 13.22% in high susceptible area, and 3.66% in very high susceptible area. But in γ (0.90), 31.54% of the area was occupied by unsusceptible area, and 14.32% was located in low susceptible area, 24.9% in medium susceptible area, 19.81% in high susceptible area, and 9.41% in very high susceptible area.

To choose the map closest to reality, the receiver operating curves characteristic (ROC) has been applied.

Choice and validation of landslide susceptibility map

The receiver operating characteristic is often represented as a curve that gives the sensitivity (landslides that are actually detected) as a function of the specificity (stable areas that are incorrectly detected). Sensitivity represents the probability to have a slipped cell in the correct class, and it is shown on the y-axis in a ROC curve. But, 1-specificity is the false positive rate, and it is taken along the x-axis of the curve. The reliability of this method is tested by the area under curve (AUC) (Williams et al. 1999). A perfect test has an AUC equal to 1, while an AUC of 0.5 represents a weak test.

By examining the results of the ROC curve, we find that gamma 0.90 has the highest value of AUC (0.89). The case of gamma 0.75 has the lowest value of AUC (0.865) (see Table 4, Fig. 5).

Table 4 AUC of different gamma values
Table 5 Landslides distribution

Finally, the gamma value of 0.90 was chosen. Figure 4c shows the distribution of different susceptibility zones after discretization. The northwest-facing slopes are more susceptible to landslides than those facing to south, and they contain 98 landslides because they are the most humid. The high susceptibility area is found in the proximity of drainages and bare ground. The highest susceptible zones are found near the high slopes exposed to the north, cliffs, and cut slopes adjoining the roads. Low susceptibility zones are observed in the agricultural area, the forests, and the low slopes exposed essentially to the south and the southeast. Without validation, the prediction model will have no scientific significance (Chung and Fabbri 2003; Bui et al. 2011). The validation of landslide susceptibility model is done after verification of the predictive capacities of the landslide susceptibility map. There are two types of error found in landslide susceptibility map. Landslide scan occurs in stable areas and the can be absent in unstable areas (Soeters and van Westen 1996). In our case study, the prediction accuracy of landslide susceptibility map was evaluated on the basis of an expert evaluation method using known landslide locations (Pradhan 2010). A good quality map must predict the most of the landslides in the high or very high susceptibility class, in contrast to the low or negligible susceptibility class. In addition to that, the different classes must be composed by agglomerated pixels and avoid the “isolated pixel” effect (Table5).

Fig. 4
figure 4

Landslide susceptibility map for different γ values, (A) 0.75, (B) 0.85, (C) 0.90, and (D) 0.92

Fig. 5
figure 5

The ROC graph of different gamma cases

By examining the spatial distribution of landslides in the susceptibility map of gamma 0.9, we distinguish that the non-susceptible zone and the low susceptible zone contain together eight landslides. However, high and very high susceptible zone contains 111 landslides.

Conclusion

To minimize the subjectivity of fuzzy logic method, the frequency ratio was used. The using of fuzzy gamma operator to map landslide susceptibility has achieved acceptable result. Gamma 0.90 has the highest AUC with 0.89. The model proves that gamma 0.90 shows the better prediction of landslides susceptibility mapping. By examining the spatial distribution of landslides, we can say that landslides susceptibility mapping using the frequency ratio and the fuzzy logic with gamma value 0.90 is efficient to predict the landslides susceptibility in our study area with a reliability of 89%.