Introduction

Extending from Afghanistan to Myanmar, the Himalaya region is inherently fragile and susceptible to landslides as a result of its rugged mountain topography, soft soil cover, high-intensity monsoon precipitation and weak nature of the geological structures (Upreti and Dhital 1996; Chalise 2001). Landslide problems in this region are further aggravated by anthropogenic factors such as deforestation, unsound agricultural practices, haphazard settlement and unplanned developmental works (Upreti and Dhital 1996; Kayastha et al. 2010). Every year, especially during the monsoon season, a lot of damage to life and property is caused by the landslides in this region. In Nepal alone, the average annual number of deaths caused by landslides and floods from 1983 to 2009 was about 280 and in 1993 was as high as 1,336 (DWIDP 2010). In order to control or mitigate problems caused by mass movements, systematic studies of landslides including inventory mapping, susceptibility mapping, hazard mapping and risk assessment have to be carried out (Kayastha et al. 2012). Within the last few decades, numerous attempts at landslide susceptibility and hazard and risk mapping have been made throughout the world mainly due to increasing awareness of the socio-economic impacts as well as increasing pressure of urbanisation on environment (Aleotti and Chowdhury 1999). Overviews of the different landslide susceptibility and hazard mapping are given by Varnes (1984), Carrara et al. (1995), Soeters and van Westen (1996), Aleotti and Chowdhury (1999), Guzzetti et al. (1999) and Wang et al. (2005). The most common approaches for assessing landslide susceptibility and hazard can be grouped into five categories (Carrara et al. 1995; Soeters and van Westen 1996; Aleotti and Chowdhury 1999; Guzzetti et al. 1999; Wang et al. 2005), namely: (1) direct geomorphological mapping; (2) analysis of landslide inventories; (3) heuristic methods; (4) statistical methods including fuzzy logic and artificial neural networks; and (5) process based conceptual modelling.

Different researchers have used statistical models to obtain the landslide susceptibility/hazard map in different parts of the world. Statistical methodologies can be broadly divided into bivariate and multivariate. In the bivariate methodologies, the weight value for each class of different factors responsible for landslide is obtained on the basis of relationship between past landslides and each class of different causative factors (van Westen 1994). These weight values can be obtained from different statistical methodologies such as general instability index (Carrara et al. 1978), frequency index (Parise and Jibson 2000), surface percentage index (Uromeihy and Mahdavifar 2000), information value method (Yin and Yan 1988), statistical index method (van Westen 1997), weighting factor (Çevik and Topal 2003), certainty factor (Chung and Fabbri 1993), conditional analysis (Carrara et al. 1995), weights of evidence (van Westen 1993), frequency ratio (Lee and Min 2001), probability analysis (Tien Bui et al. 2013) and landslide susceptibility analysis (Süzen and Doyuran 2004).

In this study, the bivariate frequency ratio methodology is applied for obtaining the weight values. The main advantage of using bivariate frequency ratio methodology is that the weight values measure, directly or in a weighted form, the relative or absolute abundance of landslide area or number in different classes. Hence, this methodology is used by different researchers in different parts of world such as Lee and Min (2001) in Yongin, Korea; Lee (2004) in Janghung area, Korea; Lee et al. (2004) and Choi et al. (2012) in Boun, Korea; Lee and Dan (2005) in Lai Chau Province, Vietnam; Lee and Talib (2005) and Lee and Lee (2006) in Gangneung, Korea; Lee and Pradhan (2006) in Penang, Malaysia; Lee and Sambath (2006) in Damrei Romel area, Cambodia; Lee and Pradhan (2007) in Selangor, Malaysia; Vijith and Madhu (2007, 2008) in Kerala, India; Akgun et al. (2008) in Findikli, Turkey; Jadda et al. (2009) in Marzan Abad, Iran; Oh et al. (2009) in Pechabun area, Thailand; Yilmaz (2009) in Tokat, Turkey; Yilmaz and Keskin (2009) in Sebinkarahisar, Turkey; Ehret et al. (2010) in the Xiangxi catchment, Three Gorges Reservoir area, China; Oh et al. (2010) in Pemalang, Indonesia; Poudyal et al. (2010) in Panchthar, Nepal; Pradhan (2010) in the Cameron catchment, Malaysia; Pradhan and Lee (2010a) in Klang valley, Malaysia; Pradhan and Lee (2010b) in Penang Island, Malaysia; Pradhan and Youssef (2010) in Cameron, Malaysia; Yilmaz (2010) in Koyulhisar, Turkey; Akinci et al. (2011) at Samsun, Turkey; Intarawichi and Dasananda (2011) in the Mae Chaem watershed, Thailand; Jadda et al. (2011) in the Central Alborz, Iran; Mezughi et al. (2011) in Gerik-Jeli, Malaysia; Yalcin et al. (2011) in Trabzon, Korea; Akgun (2012) in İzmir, Turkey; Lepore et al. (2012) in Puerto Rico; Reis et al. (2012) in Rize Province, Turkey; and Sujatha et al. (2013) in Tevankarai Ar sub-watershed, Kodaikkanal, India.

This paper summarises the outcomes of a landslide susceptibility mapping study in the Garuwa sub-basin, East Nepal (Fig. 1a) using the bivariate statistical frequency ratio method and also deals with the effect analysis of different causative factors responsible for landslide occurrences.

Fig. 1
figure 1

a Location of the study area and b digital elevation model (DEM) of the study area with rivers, hydro-meteorological stations and landslide distributions

The objectives of the present study are to (i) prepare a landslide inventory maps and maps of the causative factors of landslides; (ii) compute the weight values for each factor based on the frequency ratio; (iii) analyse the effect of each of these factors responsible for landslide occurrences; (iv) determine the most successful combination of different causative factors that generates the best landslide susceptibility map; and (v) check the statistical significance of the obtained landslide susceptibility map using a chi-square test.

Study area

The Garuwa sub-basin (Fig. 1a) lies in the Ilam and Jhapa Districts of the Mechi Zone in eastern Nepal. This sub-basin lies between latitudes 26° 39′ 30″ to 26° 50′ 30″ N and longitudes 87° 44′ 00″ to 87° 58′ 00″ E. It covers an area about 228 km2 and has a more or less triangular shape. The altitude varies from 120 m near Domukha in the south to 1,520 m near Soktim in the northeast of the sub-basin (Fig. 1b). The Kankai River is the main watercourse, and the Mai Khola, the Garuwa Khola and the Lodiya Khola are its major tributaries (Khola means a stream in the Nepali language) (Fig. 1b).

The Garuwa sub-basin lies mainly in Siwaliks Range. However, a small part of the sub-basin in the southern part lies in the Terai region, whereas a small part in the northeast part lies in the Mahabharat Range. The Main Boundary Thrust (MBT) separates the Mahabharat Range from the Siwaliks Range.

There are two climatic zones, i.e. a subtropical zone (150–1,200 m) and a warm temperate humid zone (above 1,200 m), in this sub-basin. The average temperature is 20 to 35 °C in the subtropical zone whereas 10 to 30 °C in the warm temperate humid zone. The annual rainfall varies from 2,250 to 2,650 mm (Table 1). A detail description of this sub-basin can be found in Kayastha (2012).

Table 1 Average annual rainfall (mm) recorded at the hydro-meteorological stations from 1985 to 2007 (source: Department of Hydrology and Meteorology, Government of Nepal)

Materials

Different thematic data on causative factors are needed to prepare a landslide susceptibility map. These data are collected from different sources. For instance, digital elevation contour lines, land cover maps and aerial photos are collected from the Department of Survey, Government of Nepal. A geological map (1:250,000 scale) was prepared by Pradhan et al. (2006) obtained from the Department of Mines and Geology, Government of Nepal. Daily rainfall data from 1985 to 2007, recorded at eight hydro-meteorological stations situated inside or just outside of the study area (Fig. 1b), are collected from the Department of Hydrology and Meteorology, Government of Nepal. Data on landslides are collected by field reconnaissance in April 2010.

These data sources are used to prepare thematic digital maps using GIS software. All maps are raster based with a cell size of 20 m × 20 m. Detail descriptions for each data layer are described below.

Landslide inventory map

For assessing landslide susceptibility, the spatial distribution of landslides in the field is essential (Deoja et al. 1991). Landslide inventories are the simplest form of landslide mapping (Guzzetti et al. 1999). In order to prepare the landslide inventory map, the researcher should collect historical information of individual landslide events from different sources such as previously published reports, satellite imageries, aerial photographs and interview with local inhabitants. Then, the distribution of landslides in the study area should be verified with field surveys. The study revealed 136 landslides in the Garuwa sub-basin (Fig. 1b), covering an area of about 1.75 km2 or about 0.77 % of the study area. During field visit on 2010 April, it was found that most of these slides were already stabilised and consisted mainly of shallow soil or rock slides, plane or wedge failures and rotational slides.

Topographic factors

From the digital elevation contours with intervals of 20 m, a digital elevation model (DEM) of the study area (Fig. 1b) was prepared. From this DEM, topographical thematic data layers such as slope aspect, slope angle, slope shape (curvature) and relative relief were prepared.

Slope aspect

The direction of maximum slope of the terrain surface gives the slope aspect. In this study, slope aspect was divided into nine classes: (i) north (N), (ii) northeast (NE), (iii) east (E), (iv) southeast (SE), (v) south (S), (vi) southwest (SW), (vii) west (W), (viii) northwest (NW) and (ix) flat. Almost one third of the study area lies in the flat aspect (Table 2).

Table 2 Spatial relationships between each class of causative factors and observed landslides and resulting frequency ratio

Slope angle

Slope angle is one of the most important parameter which influences the stability of slope (Terzaghi and Peck 1967). In this study, a map of slope angle was generated from the DEM and classified into five different classes: (i) flat to gentle slope (<15°), (ii) moderate slope (15–25°), (iii) fairly moderate slope (25–35°), (iv) steep slope (35–45°) and (v) very steep slope (>45°). Previous studies show that slope gradients between 25° and 45° are prone to failure in the Nepal Himalayas (Deoja et al. 1991; Kayastha et al. 2010). However, landslides also occur on gentler as well as on moderate slopes in this sub-basin (Table 2). In this study area, almost half of the study area lies in the flat to gentle slope (<15°).

Slope shape (curvature)

The study area was classified according to slope curvature values: (i) convex, (ii) concave and (iii) straight (planar). Generally, concave slopes are considered as potentially unstable as they concentrate water at the lowest point and contribute to develop adverse hydrostatic pressure, whereas convex slopes are more stable as they disperse the runoff more equally down the slope (Stocking 1972). Contrary to this, straight slopes are found more stable than the concave or convex slopes in this study area (Table 2).

Relative relief

The maximum height dispersion of a terrain normalised by its length or area is known as relative relief (Oguchi 1997). In this study, relative relief was computed as the difference between maximum and minimum altitudes (m) per hectare of land. Relative relief was divided into the four classes: (i) <25 m/ha, (ii) 25–50 m/ha, (iii) 50–100 m/ha and (iv) >100 m/ha.

Geological factors

Geology is one of the most influential factors, which plays an important role in slope stability. As stated previously, Pradhan et al. (2006) prepared the geological map of the study area. The digital geological map (Fig. 2) was prepared on the basis of the geological map of Pradhan et al. (2006) and field observations.

Fig. 2
figure 2

Geological map of the study area, modified after Pradhan et al. (2006)

The lithological units of the Garuwa sub-basin belong to the Neogene-Quaternary Group and Lesser Himalayan Group (Pradhan et al. 2006). The Neogene-Quaternary Group consists of river beds, recent alluvium, Lower Siwaliks, Middle Siwaliks and Upper Siwaliks. Quaternary alluvial deposits are found in the intermontain valleys of the Mai River, Garuwa River and Kankai River. These deposits consist of alluvial fans, terraces and bars made up of gravels, sands and silts. The Lower Siwaliks are fine-grained, hard, grey sandstones interbedded with purple and green shales. The Middle Siwaliks consist of fine- to medium-grained arkosic pebbly sandstones with rare grey to dark grey clays and occasionally with silty sandstones and conglomerates. The Upper Siwaliks consist of coarse boulder conglomerates with irregular beds of sandstones and thin intercalations of yellow, brown, and grey sandy clays. The Lesser Himalayan group consists of quartzites, grey-green phyllites, grey metasandstones, grey garnetiferous schists with dark grey-green amphibolites and banded gneisses.

Active thrusts increase landslide susceptibility because rocks near a fault are weaker, due to intense shearing (Leir et al. 2004). Active thrusts such as the MBT and the Mai Khola Thrust (MKT) are also found in the study area (Fig. 2). The MBT separates the Lesser Himalayan rocks from the Siwaliks like in other parts of the Nepal Himalaya. The MKT is covered at many places by alluvial deposits of the Mai Khola. A digital map of distance from active faults was prepared using the Euclidian distance method and classified into two classes: (i) <1 km and (ii) >1 km.

Land use

Land use is one of most important factor for slope instability. Based on a land cover map prepared by the Department of Survey, Government of Nepal and field study, eight land use classes were considered, as shown in Fig. 3, i.e. (i) cultivation and built-up areas, (ii) tea plantation, (iii) forest, (iv) grass land, (v) bush, (vi) sandy area, (vii) barren land and (viii) water body. Almost 32 % of the study area is used for cultivation or tea plantation with few built-up areas, and 7 % of the study area is covered by the sandy area whereas 59 % of the study area is covered by the forest.

Fig. 3
figure 3

Land cover map of the study area

Hydrological and climatic factors

Runoff also plays an important role in slope instability. In this sub-basin, landslides occur frequently on stream banks (Fig. 1b). Hence, in order to see the effect of runoff on landsliding, a digital thematic distance from drainage map was prepared using the Euclidian distance method. Then, the study area was classified into four classes: (i) <25 m, (ii) 25–50 m, (iii) 50–100 m and (iv) >100 m. Almost 37 % of the study area lies in <25 m.

Landslide processes are closely related with rainfall in Nepal (Dhital et al. 1993; Gabet et al. 2004; Dahal and Hasegawa 2008). Annual total rainfall from 1985 to 2007 observed in eight hydro-meteorological stations (Table 1 and Fig. 1b) situated just outside of the study area was used to prepare a rainfall map, classified in three classes: (i) 2,250–2,400 mm/year, (ii) 2,400–2,550 mm/year and (iii) > 2,550 mm/year. Almost three fifth of the study area annually receives 2,400–2,550-mm rainfall.

Methodology

Frequency ratio

A landslide susceptibility map is prepared by combining causative factors with quantitatively defined weight values. In the present study, the frequency ratio method (Lee and Min 2001) is used in which a weight value for a parameter class is obtained as follows:

$$ {W}_{ij}=\frac{f_{ij}^{\ast }}{{\overline{f}}_{ij}^{\ast }}=\frac{A_{ij}^{\ast }}{A^{\ast }}\times \frac{A-{A}^{\ast }}{A_{ij}-{A}_{ij}^{\ast }} $$
(1)

where W ij is the weight value or frequency ratio of class i of parameter j, f ij  = A ij /A is the frequency of observed landslides in class i of parameter j, \( {\overline{f}}_{ij}^{\ast }=\left({A}_{ij}-{A}_{ij}^{\ast}\right)/\left(A-{A}^{\ast}\right) \) is the frequency of non-observed landslides in class i of parameter j, A ij is the area of landslides in a class i of parameter j, A ij is the area of class i of parameter j, A * is the total area of landslides in the study area and A is the total area of the study area.

Relationship between landslide and causative factors

If the frequency ratio is greater than 1, the relationship between landslides and the factors is high and, if the ratio is less than 1, the relationship between landslide and the factors is low.

In case of the relationship between landslide occurrence and slope aspect, landslides are more abundant on southwest-facing and southeast-facing slopes, whilst the frequency of landslides is lowest on north, northeast-facing, south and northwest-facing slopes. The main reason behind this may be that the monsoon storms in Nepal enter from the east and slowly move towards the west producing a lot of precipitation in the southern slopes than northern slopes.

Previous studies show that in Nepal, flat to gentle slopes are expected to be safe from slope instability, whereas steep to very steep natural slopes are susceptible to landsliding (Kayastha et al. 2010, 2012). The results in Table 2 show that for slope angles less than 25°, the frequency ratio is less than 1, which indicates a low probability of landslide occurrence, whilst for slope angles more than 25°, the frequency ratio is greater than 1, which indicates a high probability of landslide occurrence.

For slope curvature (shape), the frequency ratio is larger than 1 for both convex and concave slopes and less than 1 for straight slopes. This may be due to retention of more water by convex or concave slopes after heavy rainfall, which increases soil water pressures and reduces shear resistance (Lee and Min 2001).

In case of relative relief, the frequency ratio is larger than 1 for 50 - 100 m/ha and >100 m/ha classes and less than 1 for <25 m/ha and 25–50 m/ha classes. This result reveals indication of relative relief as the potential energy for mass wasting and soil erosion (Ghimire 2001).

In the case of landslide occurrence and geology, the frequency ratio is higher for the banded gneiss of the Lesser Himalaya and Lower Siwaliks, whereas the frequency ratio is lower in the river beds, recent alluvium, quartzites, phylites, schists, Upper Siwaliks and Middle Siwaliks. In this sub-basin, rocks of the Lower Siwaliks and banded gneiss are highly weathered so that these rocks are susceptible to landsliding. On the other hand, quartzites, phylites, schists, rocks of the Middle Siwaliks and Upper Siwaliks are moderately strong to hard in nature so that these rocks are more resistant to slope failure. The presence of a fault increases landslide susceptibility because rocks near a fault are weaker due to intense shearing (Leir et al. 2004). This is proved by the results shown in Table 2 as the frequency ratio for the distance from faults less than 1 km is more than 1.

The effect of land use is also seen in the study area. For certain land uses such as cultivation and built-up area, grassland, tea plantation, sandy area and bush, the frequency ratio is less than 1 indicating the lower probability of landslide occurrences, whereas the frequency ratio is more than 1 for barren land and forest indicating that these land uses are highly susceptible for landslide occurrences.

In the present study area, the drainage has a clear influence on landslides because the closer the drainage is the greater is the frequency ratio. At a distance of less than 25 m and 25–50 m classes, the frequency ratio is more than 1, whereas at a distance greater than 50 m, the frequency ratio is less than 1. This result clearly shows that there is a lower probability of landslides further away from rivers.

Rainfall also has higher influences on the initiation of landslides. For the area with annual rainfall more than 2,550 mm, the frequency ratio is higher than other areas. However, the area with annual rainfall 2,400–2,550 mm has almost 33 % of the observed landslides, but due to high area covered by this class, the frequency ratio becomes less than 1 indicating less probability of landslide occurrences in this class.

Landslide susceptibility mapping

Finally, the integration of the various causative factors and classes in a single landslide susceptibility index (LSI) is given by a procedure based on the weighted linear sum

$$ \mathrm{L}\mathrm{S}\mathrm{I}={\displaystyle {\sum}_{j=1}^n{W}_{ij}}. $$
(2)

where n is the number of parameters.

The effect of the causative factors can be seen in the landslide susceptibility mapping by exclusion of causative factors during the summation stage using Eq. 2. The studies of effect analysis show how a susceptibility index map changes when the input causative factors are changed (Lee and Talib 2005; Jadda et al. 2009). The causative factors that have the most influences on the calculated landslide susceptibility index map can be identified using effect analysis. In the present study, four cases were chosen, as shown in Table 3 and Fig. 4, to see the effect of causative factors in the landslide susceptibility. Then, the success rate curve (Chung and Fabbri 1999, Kayastha et al. 2012) is obtained for each case by plotting the cumulative percentage of observed landslide occurrence against the areal cumulative percentage in decreasing LSI values as shown in Fig. 5. The area under a curve is used to assess the success accuracy qualitatively. The area under curve for four different cases is shown in Table 3. For instance, in the case of taking topographic factors and geological factors, the area under curve is 0.7721 which means that the success rate accuracy is 77.21 %, and in the case of taking all causative factors, the area under curve is 0.8119 indicating that the success rate accuracy is 81.19 %. Table 3 shows that amongst four different cases, the first three cases, such as taking (i) topographic factors and geological factors, (ii) topographic factors, geological factors and land use and (iii) topographic factors, geological factors, land use and distance from drainage, produce almost identical results for the success rates. The best result for the success rate accuracy is produced by the fourth case which considers all the causative factors. Hence, the LSI map (Fig. 4d), which is derived from considering all the causative factors, is chosen to produce the landslide susceptibility zonation map (Fig. 6).

Table 3 Success rate accuracy and area under curve (AUC) for different cases
Fig. 4
figure 4

Landslide susceptibility index (LSI) maps of the study area derived from a topographical and geological factors; b topographical factors, geological factors and land use; c topographical factors, geological factors, land use and drainage; and d topographical factors, geological factors, land use, drainage and annual rainfall

Fig. 5
figure 5

Graph showing success rate curves, i.e. cumulative percentage of observed landslide occurrences versus cumulative areal percentage of decreasing LSI values

Fig. 6
figure 6

Landslide susceptibility zonation map of the study area derived by using all causative factors

This map is categorised into low, moderate, high and very high landslide susceptible zones such that 40 % of the study area has low LSI values, 30 % of the study area has moderate LSI values, 20 % has high LSI values and the remaining 10 % of the study area has the highest LSI values (Bijukchhen et al. 2013; Kayastha et al. 2012). It shows that the very high susceptible zone occupies 44.06 % of the total landslide occurrences, whereas high, moderate and low susceptible zones cover 34.29, 14.74 and 6.91 % of the total landslides, respectively (Fig. 6 and Table 4).

Table 4 Distribution of susceptibility zones and observed landslides and resulting landslide density

For validation of a landslide susceptibility map, the computation of landslide density of each susceptibility zone is important as the landslide density assesses the overall quality of the landslide susceptibility map (Sarkar and Kanungo 2004). The results presented in Table 4 showed that the landslide density for the very high susceptible zone is 0.0339. Furthermore, landslide density values gradually declined from very high to low susceptible zones as shown in Table 4. Hence, the landslide susceptibility map reveals the existing field instability conditions.

A chi-square test also can be performed in order to test the statistical significance and effectiveness of the landslide susceptibility map (Sarkar and Kanungo 2004; Kayastha et al. 2012). For the null hypothesis, it was assumed that the presence of landslide cells in different susceptibility classes were purely due to chance. The observed number of cells with and without landslides for each of the four susceptibility classes was determined from the map (upper part of Table 5), and the expected number of cells for the same was estimated from the observed values using expected probabilities (middle part of Table 5). The standard chi-square value with 3 degrees of freedom at the 0.001 significance level is 16.3. The total discrepancy, i.e. chi-square value computed for the above data, is 7,125.79 (lower part of Table 5). Since the calculated chi-square value is greater than the standard chi-square value, it can be concluded that the landslide susceptibility map is considered as statistically significant.

Table 5 Results for the chi-square test, indicating observed number of cells with or without landslide in each susceptibility zone, the corresponding expected number of cells in case landslide occurrence would be random and the resulting chi-square values

Conclusions

The selection of contribution factors for modelling landslide susceptibility is an inhibit task. In the present study, nine different contributing factors (slope aspect, slope angle, slope curvature, relative relief, geology, distance from faults, land cover, distance from drainage and annual rainfall) are chosen to model the landslide susceptibility using frequency ratio method. Four landslide susceptibility index maps are generated by selection of different causative factors. The results show that the landslide susceptibility index map produced by using all nine causative factors reveals the best success rate accuracy, i.e. 81.19 %, which is later used for producing the final landslide susceptibility zonation map.

The analysis shows that landslides are more abundant on southwest-facing and southeast-facing slopes, whilst the frequency of landslides is lowest on north, northeast-facing, south and northwest-facing slopes. For slope angles more than 25°, there is a high probability of landslide occurrence in this study area. The probability of landslide occurrence is higher for the banded gneiss of the Lesser Himalaya and Lower Siwaliks than the river beds, recent alluvium, quartzites, phylites, schists, Upper Siwaliks and Middle Siwaliks. In this sub-basin, rocks of the Lower Siwaliks and banded gneiss are highly weathered so that these rocks are susceptible to landsliding, whereas quartzites, phylites, schists, rocks of the Middle Siwaliks and Upper Siwaliks are moderately strong to hard in nature so that these rocks are more resistant to slope failure. The land uses such as barren land and forest are highly susceptible for landslide occurrences. At a distance of less than 25 and 25–50 m from drainage, there is high susceptibility for landslide occurrences. Rainfall also has higher influences on the initiation of landslides.

The landslide susceptibility zonation map reveals that 10 % of the study area lies on the very high susceptible zone which predicts 44.06 % of the past landslides. Likewise, 20 % of the study area is situated on high susceptible zone and predicts 34.29 % of the past landslides. In addition, results from the landslide density analysis and chi-square test prove that landslide susceptibility map is statistically significant. Hence, this landslide susceptibility map is trustworthy for future land use planning and disaster management planning.