1 Introduction

Landslide susceptibility map depicts the areas which are potentially prone to landslides by analyzing some principle factors that contribute to landslides. But proper identification of landslide susceptibility zones depends on the knowledge of slope movement and their controlling factors. For example, lithological and structural variations such as difference in strength and permeability of rocks and soil, presence and pattern of fractures and joints, slope modification through roads, terrace for tea etc. Considering these factors a proper landslide susceptibility map is very much essential to reduce the risk of such geomorphic hazard. The methodologies for the assessment of landslide susceptibility broadly fall under three categories i.e. qualitative, semi quantitative and quantitative. Estimation of weightage value through quantitative way involves several sophisticated methods like Analytic hierarchy process [1], bivariate [2], multivariate [3], logistic regression [4], fuzzy logic [5], artificial neural network [6] etc. Analytic Hierarchy Process (AHP) by Saaty [7] allows direct participation of decision makers in finding out the final outcome. Decomposition, comparative judgment and synthesis of priorities are the three principles on which AHP is based on. It involves building a hierarchy of decision elements then comparison between possible pairs to give a weight in case of each parameter. It also provides a consistency ratio to check the consistency level. This procedure has gained wide application in different fields such as in site selection, suitability analysis and landslide susceptibility [8]. Hence now importance is given to estimate the proper weightage value of different landslide causing factors using Analytical hierarchy process (AHP).

Darjeeling Himalaya is highly susceptible for landslide. Every year a variety of reasons such as heavy rainfall, reduction in natural vegetation cover, urban development, soil saturation have caused landslides of varied magnitudes on the steep slopes [9]. For this reason accurate landslide susceptibility map is essential to identify such hazard prone areas.

The aim of this paper is the identification and delineation of proper landslide susceptible zones for the Gish River basin by taking into consideration of 13 parameters. The AHP method is taken into consideration to find out the weightage value of different parameters. To prepare the final output map integration process has been applied on the basis of the weightage value of each indicator. An attempt is also taken to validate the final output in reference to landslide inventory map.

2 Study area

Gish River Basin of Darjelling Himalaya (264.94 km2 ), a tributary of Tista River, possesses high range of height (2355 m at source and 115 m at confluence (Fig. 1). Ramthi Khola, Lethi Nadi, Nimbong Khola, Pokhribong Khola, Reyon Khola, etc. are its important tributaries. The average annual rainfall in the study area is 3094.4 mm and 80% rainfall happens during monsoon season (July to October). Maximum landslides take place in this season. Darjeeling Himalaya is one of the most vulnerable belts of Himalayan range and it recorded more than 20,000 landslides in 1 day [10]. The upper part of the basin is composed with darjeeling gneiss (Fig. 2a). Mainly loamy type of soil dominates here (Fig. 2b). The middle part is mainly dominated by daling phylite and schist, lower gondwana, biotite daling phylite and Older Alluvium.

Fig. 1
figure 1

Location of the study area a Location of West Bengal in India, b Location of Gish River Basin in northern part of West Bengal, c Gish River Basin

Fig. 2
figure 2

Geology and soil map a Geology, b Soil

3 Materials and methodology

3.1 Parameter selection, scaling and weighting of the indicators

In this case 13 parameters are taken into consideration and these parameters are further classified into five categories e.g. triggering factor (rainfall and seismicity); lithological causal factors (geology, soil, lineament and Gravity anomaly); surface causal factor (slope, drainage density and relative relief); anthropogenic causal factor (road, agriculture and settlement); protective factor (natural vegetation). Methods for preparing the data layers are shown briefly in Table 1.

Table 1 Procedure to prepare raster layer of different parameters

For making all the data layers unidirectional 10-point scale, a semi-quantitative method is selected [11]. For this the spatial data layers are classified into 10 equal classes in ArcGIS (9.3). It is supposed that greater rating will have maximum potentiality to influence landslide susceptibility. One can also consider it in reverse order. The scaling process and the logic behind scaling are shown in Table 2.

Table 2 Scaling process and logic behind scaling

Analytic Hierarchy Approach [1] is used for weighting the parameters. Ranking of the selected parameters for making comparison matrix is done based on the previous literatures [12] and field experience. Consistency check in this case is 0.02. For example, slope is given maximum priority because most of the existing landslide sites are located within the steeper slope. In case of some qualitative data layers e.g. geology and soil, before assigning 10-point rating to the individual class frequency ratio (number of exiting landslide to the concerned zone) is calculated.

After deriving the weightage of different layers, integration of different layers using weighted linear combination (WLC) Eq. (1) of Eastman [13] is performed. This methodology is implied in the ArcGIS(9.3).

Simple equation of WLC is as follows:

$$\begin{array}{*{20}l} {WLC = \sum\limits_{j = 1}^{n} {\mathop a\nolimits_{ij} \mathop w\nolimits_{j} } } \hfill \\ \end{array} \qquad$$
(1)

where, aij = ith rank of jth attribute; wj = weightage of jth attribute.

3.2 Preparation of different factors cluster

Beside the main landslide susceptible model, five separate WLC models of landslide affecting homogenous factor cluster [triggering factor model (TFM), anthropogenic factor model (AFM), lithological factor model (LFM), surface causal factor model (SCFM) and protective factor model (PFM)] are prepared to find out the role of individual factors cluster to landslide susceptibility. Correlation matrix based weighting of the parameters has been done following Pal [13]. To get the standardized weighted value of different parameters the Eq. 2 is used.

$$W_{ij} = \frac{{W_{i} }}{{W_{i\_total} }} \times 1$$
(2)

where, Wij = Weighted value, W i  = Summation of standardized correlation value W i _total = Sum of all Wi Again, \(W_{i} = W_{11} + W_{12} + \cdots + W_{1n}\) W11, W12, W1n = Standardized correlation value of 1st, 2nd and last indicators of 1st row.

3.3 Validation of the landslide susceptible model

For determining the level of accuracy of the final output, validation process is followed with the help of landslide inventory. Elmahdy et al. [14] rightly documented that this approach of validation is directly linked with direct incidences. The imprints of the past landslides are obtained from secondary sources like previous literatures [15], toposheets (78 B/9 & 78 A/12) of Survey of India, Google earth and from on field observations. Total 45 locations are identified and after assigning the latitude and longitude these are updated on the landslide susceptible map (Fig. 3b). The validation process is performed through three steps. The first step aims to establish the relationship between different LSZ and occurrence of landslide frequency density. Areal density (areal extent of landslide/area of the concerned zone) under different LSZ is shown in case of second step. In the last step Receiver Operating Characteristic (ROC) curve is prepared to find out the predicted rate of the model, prepared using Analytic Hierarchy Approach (AHP) (Fig. 5b).

Fig. 3
figure 3

Landslide susceptible zones a Continuous susceptibility grades b Classified susceptibility zones

4 Results and analysis

4.1 Landslide susceptibility mapping

From integration of parameters, landslide susceptibility model is prepared. Equation 3 represents the expression of statistical model of landslide susceptibility model and Fig. 3a, b show the spatial model for the same in continuous and classified modes.

$$\begin{aligned} {\text{Landslide Susceptibility Model }} = {\text{slope }} \times 0.195 \, + {\text{ geology }} \times \, 0.188 \, + {\text{ drainage density }} \times \, 0.165 \, + {\text{ lineament }} \times \, 0.126 \hfill \\ \, + {\text{ relative relief }} \times \, 0.085 \, + {\text{ soil }} \times \, 0.054 \, + {\text{ rain}}\;{\text{fall }} \times \, 0.039 \, + {\text{ road }} \times \, 0.028 \, + {\text{ agricultural land }} \times \, 0.024 \hfill \\ \, + {\text{ settlement }} \times \, 0.021 \, + {\text{ vegetation }} \times \, 0.019 \, + {\text{ seismic frequency }} \times \, 0.031 \, + {\text{ gravity anomaly }} \times \, 0.024 \hfill \\ \end{aligned}$$
(3)

In order to find out the landslide susceptibility zones (LSZ) with greater precision the composite raster layer of landslide susceptibility (Fig. 3a) is further subdivided into five equal interval zones (Fig. 3b) viz. very high (WLC = 6.18–7.22), high (5.14–6.18), moderate (4.10–5.14), low (3.06–4.10), very low (3.04–3.08). Very high landslide susceptible zone is located in a scattered manner at the upper part of the basin. Nearly 19.92 km2 (7.52%) of the total basin area falls under the very high LSZ zone. In this zone, degree of slope ranges from 42.13° to 10.44° (average 26.285°) and steeper slope triggers this incidents [16]. By eroding the slopes or by saturating the regolith streams may adversely affect stability of the slope [17]. Drainage density ranges from 5.48 to 3.14 km/km2 (average 4.285 km/km2), it also helps for soil saturation in the contiguous area and helps to trigger landslide. Higher relative relief indicates greater intensity of erosion hence grater chance for landslide susceptibility [10]. In this zone, relative relief is higher and it ranges from 873 to 301 m (average 587 m). High landslide susceptible zone covers nearly 90 km2 (33.97%) area of the basin (Table 4). This zone is also located in the upper part of the basin and some patches in the middle part of the basin. In this zone also drainage density (average 3.9 km/km2), slope (average 27.075°) and relative relief (average 422 m) are high. 52.25 km2 (19.72%) area lies under the moderate landslide susceptible zone. In this zone range of the degree of slope is higher (from 50.13° to 4.28°), drainage density (average 1.965 km/km2) and relative relief (average 279.5 m) are also quite less than high and very high susceptible zones. Lower catchment is majorly characterized by low land slide susceptibility. Mandal and Maiti [18] also proved this in their work over Darjeeling Himalaya. Loamy skeletal type of soil covers nearly 99% of very high LSZ. Lineament density (5.66 km/km2) is also high in this region and average value of gravity anomaly is also highest in this region. Hence, higher value of lithological causal factors makes this region most vulnerable for landslide.

In case of high LSZ nearly 88% of this is situated on the granite formation. In this zone lineament density (4.14 km/km2) is also quite high. On the other hand, low and very low zones are situated mainly on the alluvium geological formation (99.06%) with greater cohesiveness and soaking power. Density of lineament (3.63 km/km2) is also low in this region. All these things are responsible for less frequent and low magnitude of landslide. In case of anthropogenic factors settlement and agricultural land do not play major role here in determining the landslide susceptibility because very low sparse presence (nearly 12%) of the settlement in the very high or high landslide susceptible zones. While on the other hand settlement and agricultural land covers greater percentage of area (67.79%) in case of very low landslide susceptible zone. Road density (0.988 km/km2) shows high value in respect to very high and high LSZ but the value is low (0.479 km/km2) in case of low and very low LSZ.

4.2 Responsible factors cluster

Figure 4a–e represents the factors cluster models depicting landslide susceptibility in terms of the individual factor groups. This, in fact, helps to understand which factor cluster carries responsibility for influencing historical landslides based on their physical location. Frequency analysis of the historical landslides (LS) in different factors cluster models (only within high and very high LS susceptible zones) states that out of total 45 sites, 25, 01, 24, 18 and 11 number of LS sites are located in lithological, anthropogenic, surface, triggering and protective models respectively. Only one is counted within very high and high zones of AFM; maximum number of LS sites is the cumulative consequences of lithological and surface factors followed by triggering factors.

Fig. 4
figure 4

Responsible factors cluster model a Anthropogenic factors cluster b Lithological factors cluster c Surface factors cluster d Triggering factors cluster e Protective factor cluster

Highest relationship with landslide susceptibility can be observed in case of lithological clusters (Table 3). So influence of lithological factors (Geology, Soil, Lineament and Gravity anomaly) is greater to determine the vulnerability. Beside this, correlation value of 0.89 in case of surface causal factors (drainage density, relative relief and slope) also shows significant impact for the occurrence of landslide. In case of this basin anthropogenic factors play minimum role as its correlation value is only 0.00452. Among the anthropogenic factors, only road to some extent exerts significant influence. Most of the landslide in the study area occurs during the monsoon season. Though the rainfall is significantly high throughout the region but highest landslide susceptibility zone is not located in the highest rainfall area. That is why the relationship is slightly low. Lastly the protective factor model shows positive relation which suggests that high and very high landslide susceptible zones are covered with less vegetation.

Table 3 Relationships between landslide susceptibility layer and other factor clusters

5 Discussion

In last 100 years earth quake incidents within 500 km buffer distance from the basin boundary was 82 with a magnitude varies from 5.0 to 8.0 MW (USGS Earthquake Hazards Program). Landslide of 2015 (April, 25) is considered as a cumulative result of Nepal earthquake (intensity: 7.8 MW) and excessive torrential rain (about 200 mm). Earthquake exerts significant stress on crystalline rocks. Formation and deformation of joints, fractures, lineaments in rocks are very regular phenomena. Excessive infiltration of water through these joints during monsoon season slides the rock fragments. More than 90% landslide incidents happen during monsoon season due to excess rainfall. In this area average monsoon rainfall is about 600–800 mm [19]. Intensive rainfall within few hours often simulates landslide incidents. Out of total rainfall, 10–50% rainfall happens within 1–10 days [20]. It is quite difficult to calculate threshold rainfall whistling landslide because other existing conditioning factors work integrated manner. Most of the previous works in Himalayan Mountain condemned earthquake and rainfall as prime factors of landslide incidents [19,20,21]. Unconsolidated lose materials is sensitive to move and speed of movement depends on degree of slope and presence of water. But it doesn’t indicate that frequency of earthquake will always be high in the high rainfall zone. In the region where high rainfall takes place is composed with fine to coarse loamy soil and this sort of soil has greater elasticity and not highly sensitive to earthquake; slope of this region is very low because in this region river already fall into the plain. The high landslide susceptible zone is prone to steeper slope, crystalline rocks and therefore sensitive to slope instability. According to the local people degenerated forest species may be one of the causes behind accelerated landslide in this zone. Heavy pressure of human activities like heavy traffics, slope modification for constructing roads and houses etc. are also very crucial for boosting landslide incidents [22]. Presence of road just at the edge of the steeper slope is susceptible for landslide. Every year, these vital roads become blocked with heap of landslide about 10–15 times. In the surrounding regions (like Lish, river basin) also built up area is condemned as dominant one [23]. Direct suffering of people from this incident in every monsoon highlights this as major factor.

The landslide susceptibility model is validated with the help of historical landslide locations. From Table 4 it can be observed that there are 14, 21, 08, 02 and 00 landslide sites are associated with very high, high, moderate, low and very low zones respectively (Fig. 3b). Simple frequency analyses in different zones sometimes may not carry any particular interpretation because areas under different susceptible zones are not identical. For this reason, frequency density of the occurrences of landslide is calculated. From this frequency density it is observed that very high landslide density (0.70/km2) is associated with very high landslide susceptible zone. The next high frequency density of landslide occurrence is observed over the high landslide susceptible zone. Moderate landslide susceptible zone is associated with moderate landslide frequency density. The low landslide susceptible zone indicates low value for landslides frequency density (0.04/km2). From this it can be stated that the prepared spatial modelcan be accepted.

Table 4 Density of Occurrence of Landslide in Different LSZs

After determining the relationship between frequency of landslides and LSZ, second attempt (Fig. 5a) is taken to find out the areal coverage of landslides under different LSZs in order to validate this model more effectively.

The size of landslides in the study area are categorized into three categories viz. >0.15, 0.15–0.05 and <0.05 km2. Table 5 presents the areal coverage of different categories of landslides under different LSZ. From Table 5 it can be observed that larger size landslides are associated with high and very high LSZ having greater cumulativeareal coverage. On the other hand, moderate and low LSZ are associated with mainly minor size of landslide having relatively lesser areal coverage under landslide. While considering the total areal coverage under landslide, it is also observed that highest areal density (0.068) is associated with very high landslide susceptible zone and lowest areal density (0.00) is associated with very low landslide susceptible zone. The second validation technique also shows that this model is valid.

Fig. 5
figure 5

Location of historical landslides and receiver operating characteristics (ROC) curve a Landslide inventory map, b ROC curve

Table 5 Areal density of landslide under different LSZs

Receiver Operating Characteristic (ROC) curve (Fig. 5b) is prepared to validate the landslide susceptible model in a quantitative way. To prepare the ROC curve a total number of 2156 landslide and 2156 non landslide points are taken into consideration. The ROC curve is generated with the help of SPSS statistical software. The area under curve >0.80 (80%) is suggested as good accuracy rate by Rasyid et al. [24]. In case of this study, the ROC curve shows 84.00% (0.84) area under the curve. This shows good prediction rate of the model as the value is greater than 80%. Thus all the validation procedure suggests that the landslide susceptible model, prepared using Analytic Hierarchy Approach, is accurate to determine the landslide susceptible zones.

6 Conclusion

This study has been carried out by taking into consideration of thirteen landslide decisive parameters. These parameters are further subdivided for common scaling. All the parameters are intimately related with landslide and able to depict the vulnerability of a place in respect to the occurrence of a landslide. Based on this all the parameters are integrated to prepare the landslide susceptibility zone. After validation this methodology is proven useful to predict the landslide susceptible area.

The final outcome of the study demonstrates the following facts: (1) Very high landslide susceptibility zone is mainly situated around the upper part of the basin. On the other hand, very low landslide susceptible zone is situated mainly towards the downward portion of the basin. (2) Very high LSZ is associated with high runoff and steeper rock surface as the drainage density and relative relief is high in this zone. (3) Among the causal factors lithological condition exerts greater influence to determine the susceptibility in respect to the occurrence of landslide. Slope modification for construction of roads, expansion of settlements is apparently appeared as less important factor. But impact of slope modification has far flung effects. Even it does not mean that landslide alongside the roads is the sole expression slope modification. Its effect may be away from the road side. Scientific routing and light weight buildings are essential for partially escaping from landslide incidents. Present trend of heavy weight traffic, high traffic density, multi-storied buildings and expanding clustered towns are not ecological to reduce landslide frequency.