Introduction

Earthquake-triggered landslides are very common throughout the world. In particular, the last large earthquake events, e.g. in Pakistan in 2005 and in China in 2008, have demonstrated the importance of seismic shaking for the initiation of slope failures. For instance, the 2008 Wenchuan earthquake (magnitude 8.0) shook the area located in the mountainous region of northwest of Sichuan Province. It brought overwhelming destruction to eight provinces and cities, including Sichuan, Gansu, Shanxi, Chongqing, Yunnan, Shaanxi, Guizhou, and Hubei. The area is characterized by rugged topography, steep high mountains, deep valleys, and complicated geologic structures. The Wenchuan earthquake induced more than 15,000 landslides, debris flows, and rock avalanches in an area of about 50,000 km2, and it is estimated that approximately 20,000 deaths were directly related to these landslides (Huang and Li 2008; Yin et al. 2009). From the landslides and debris flows within the Wenchuan earthquake area, related risks can be observed at up to 12,700 locations, threatening the lives of almost one million people (Huang and Li 2008; Cui et al. 2009; Yin et al. 2009). The earthquake also destabilized many slopes and produced large amounts of material for future landslide events. Consequently, it is estimated that debris flows and landslides will occur at higher intensity and with higher frequency than normal; the occurrence of debris flows triggered by the heavy rainfall of 24 September 2008 indicates that the Wenchuan earthquake areas are now particularly susceptible to debris flows which are likely to occur frequently within the next 5–10 years (Tang et al. 2009). It has been suggested that the impact of an earthquake can increase the activity of landslides and debris flows for a period of 10–30 years (Cui et al. 2008; Xie et al. 2009). This can be expected to impact significantly local land use, planning and construction, and will restrict the future development of the area.

It will usually be appropriate to carry out landslide susceptibility zoning as a first stage in the development of landslide hazard or risk zoning for planning purposes (Fell et al. 2008). The frequent occurrence of landslides, collapses, and breakdowns in the Longnan region brought about a major effect on land planning and post earthquake construction, and became an obstacle to the sustainable development of the local economy. At the moment, the local and regional authorities at Longnan rely primarily on landslide inventory maps to conduct spatial planning and to organize emergency actions. It is, therefore, essential to provide more reliable landslide susceptibility maps to facilitate landslide hazard and risk management in the region. Although it is expected that landsliding will occur more frequently in the most susceptible areas, in the susceptibility analysis the time frame is not taken into account explicitly (Fell et al. 2008). Furthermore, the susceptibility maps are on a regional scale and cannot be used for site-specific work or as a substitute for professional advice from qualified geologists or geotechnical engineers.

In order to determine the most fragile landslide areas in the future under the influence of a similar earthquake, it is important to determine the areas prone to future landslide, through landslide-susceptibility maps. Both qualitative and quantitative methods have been used in the processes of creating landslide-susceptibility maps (Soeters and Van Westen 1996; Aleotti and Chowdhury 1999; Guzzetti et al. 1999; Xu et al. 2012a, b). Qualitative methods were widely used during the late 1970s by engineering geologists and geomorphologists. Quantitative methods, however, have become popular in the past few decades largely because of their numerical expressions of the relationship between controlling factors and landslides, assisted by the advances in computer and geographic information system (GIS) technology (Dai et al. 2001; Dai and Lee 2002; Chung et al. 2002; Van Westen and Getahun 2003; Van Westen et al. 2008; Bell and Glade 2004; Guzzetti et al. 2005; Bai et al. 2011; Dai et al. 2011; Xu et al. 2012b). The logistic regression (LR) method has been applied extensively to landslide susceptibility mapping (e.g. Atkinson and Massari 1998; Guzzetti et al. 1999; Bai et al. 2010; Yalcin et al. 2011). The algorithm of logistic regression applies maximum likelihood estimation after transforming the dependent variable into a logical variable, representing whether the natural log of odds of the dependent variable occurs or not.

The purpose of this study was to analyze the relation between Wenchuan earthquake-triggered landslides and pre-disposing factors and to carry out landslide susceptibility assessments by logistic regression in Longnan.

The study area

The study area, Longnan, is located in the southeast of Gansu Province in China (Fig. 1). In total, it covers an area of approximately 27,000 km². The region is mountainous and has a large relief varying between 550 m and more than 4,000 m above sea level. The administrative unit includes one district and eight counties organized in 193 towns and townships, inhabited by 2.7 million people. Longnan is located in the transition zone between the Tibetan Plateau and the Loess Plateau, of which a large area is covered by extensive loess deposits. The Tibetan Plateau uplift has resulted in a high geomorphic activity. Two metamorphic rock units (the Bailongjian and the Bikou) have been identified. The Bailongjiang metamorphics are exposed in the northwestern quadrant of this region and consist of conglomerates, slates, limestones, and phyllites. The metamorphic rocks belonging to Bikou unit are located in the south, and consist of igneous rocks, phyllites, and slates (Derbyshire et al. 2000). The geological structures are dominated by the southern Qinling fold system. During the geologically recent Yenshan and Himalayan crustal movements, uplift and complex faulting of the region took place, resulting in widespread valley incision and undercutting of the slopes. About 90 % of the area north of the Bailongjiang River is mountainous. Here, the Qinling Mountains are segmented by two branches from the Bailongjiang River, the Dieshan in the north and the Minshan in the south. Such severe incision has resulted in high relative relief and a landscape marked by many large landslides. The combination of frequent earthquakes, high relative relief, steep slopes, strongly seasonal monsoonal climates, and widely distributed thick loess results in both large single failures and widespread landslide occurrence.

Fig. 1
figure 1

Location of the study area, Longnan

The Longnan region is one of the four most active landslide and debris flow regions in China, characterized by a very high frequency of occurrence (Scheidegger and Ai 1987). Figure 1 presents the peak ground acceleration (PGA) in the study area. According to the China Geological Survey Bureau of Statistics (2008), geological disasters killed 567 people in the 40 years before the Wenchuan earthquake. The geological hazards induced by the Wenchuan earthquake killed further 147 people, injured 468, and caused a direct economic loss of 2.855 billion Yuan (0.41 billion $).

Data

Several data sources were utilized for the investigations of landslide pre-conditioning environmental factors and the subsequent analysis of landslide susceptibility. All data sources are listed in Table 1.

Table 1 Data properties

Following the main shock of the Wenchuan earthquake, the Ministry of Land and Resources of China immediately organized scientists to conduct earthquake engineering and geological reconnaissance to provide scientific decision-making information and emergency countermeasures for the immediate post-seismic emergencies and also for the mid-term transitional resettlement sites and reconstruction. In order to gain reliable information on the seismogenic geohazards, the investigation was based on two approaches: field investigation and remote-sensing image interpretation (Huang and Li 2008; Tang et al. 2009).

The typical landslide morphology recognizable on the available maps includes visible scarps, hummocky topography, and landslide dams. Different landslide types were summarized in terms of the following classes: rockfalls (including rock falls, debris falls and rock avalanches); landslides (debris slides and rotational slides, slump); and debris flows, according to Varnes’s (1984) and Cruden and Varnes’s (1996) classification. From the post-seismic emergencies report of the General Monitoring Station of Geological Environment of Gansu province (Longnan Bureau of Land and Resources 2008) and the remote sensing emergency survey in the Wenchuan earthquake area (Qin et al. 2009), a total of 2,238 earthquake landslides were identified in the study area (Fig. 2).

Fig. 2
figure 2

Elevation and spatial distribution of landslides in the study area

Lithology plays a key role in the distribution of landslides and is correlated with the properties of the slope-forming materials, such as rock-mass strength and structure. Based on the geological maps at 1:200,000 scale and geotechnical criteria, a simplified rock group map was produced from an original geological map considering the composition and structure of bedrock. Eleven rock groups were defined based on their composition and physical–mechanical characteristics (Chen et al. 2006; Table 2). Each rock group has similar geological properties and conditions. An overview on the 11 rock groups is presented in Fig. 3.

Table 2 Distribution of rock groups in the study area
Fig. 3
figure 3

Distribution of rock groups and faults in the study area

The pixel size of the digital elevation model (DEM) is 30 × 30 m2 (Fig. 2). To store the information of these parameter maps in a uniform thematic database, the size of each pixel for all the products was set to 30 × 30 m, regardless of the scale of the original source. Indeed, this has to be carefully considered by following interpretation. The elevation, slope angle, aspect, plan curvature, and profile curvature are derived from the DEM using standard ArcGIS®, while the distance from slope failures to rivers was digitized from the 1:50,000 topographic map and the faults was digitized from the 1:200,000 geological maps.

Land- use types (Fig. 4) were mapped using preprocessed and enhanced Landsat TM 5 images. The analyzed images were then digitally processed to modify further the boundaries by supervised classification with ERDAS (Earth Resource Data Analysis System) software. The accuracy of the land-use interpretation was validated in the field.

Fig. 4
figure 4

Land-use map of the study area

Within this study, the PGA was selected as the main earthquake landslide-triggering factor (Fig. 5). PGA data were obtained from the earthquake strong-motion record of the National Strong Motion Observation Network System (NSMONS) (Lu et al. 2010). The NSMONS consists of permanent free-field stations, special observation arrays, mobile observatories, and a network management system. During the Wenchuan earthquake, over 1,400 components of acceleration records were obtained from 460 permanent free-field stations. In addition, three arrays for topographical effect and structural response of the main shock were observed in the network system.

Fig. 5
figure 5

Peak ground acceleration contour map of the study area

Methodology

The methodology applied within this study includes a stepwise procedure. In a first step, the influence of landslide pre-conditioning environmental factors on landslide occurrence is analyzed. In a second step, landslide susceptibility is mapped by a logistic regression methodology, using the Wenchuan earthquake-triggered landslide.

Analyses of landslide pre-conditioning factors

The event-based earthquake landslide inventory was utilized within the study to investigate the influence of landslide pre-conditioning environmental factors. Two statistic variables were used: landslide density (LD) and landslide proportion (LP). The LD refers to the number of landslide points per km², while LP refers to the number of landslide points per each class of a pre-conditioning factor. The distribution of LD and LP is analyzed for rock groups, slope angle, aspect, land use classes, and distance to faults and rivers.

Mapping of landslide susceptibility

The basic concept of landslide susceptibility (Brab 1984; Chung and Fabbri 1999) includes the spatial distribution of factors related to the instability processes to determine zones of landslide-prone areas without any temporal implication. A landslide susceptibility map ranks the slope stability of an area in categories that range from stable to unstable and thereby show where landslides may occur (Chacon et al. 2006). Susceptibility may also include a description of the velocity and intensity of existing or potential landsliding (Fell et al. 2008). The identification and mapping of a suitable set of parameters related to landslide occurrence requires prior knowledge of their main causes (Guzzetti et al. 1999). The methodology applied in this study is based on the principle “the present and past are keys to the future”. Although it can be questioned whether this principle is still valid in its generality, any study on this is beyond the presented research. The fundamental principle of this approach is to use the characteristics of existing landslides to evaluate possible areas for future landslides. In order to achieve this goal, databases including landslide characteristics and landslide causative factor were created and subsequently analyzed. Two major assumptions are made in a probabilistic approach of mapping susceptibility to landslides: (1) landslides will occur under similar environmental conditions to those in the past; and (2) predictions of which areas are likely to experience landslide in the future can be true only if the factors used for the forecasts will not change significantly in the future.

Logistic regression approach

The logistic regression estimates the probability of a certain event occurrence (Atkinson and Massari 1998). In the present situation, the dependent variable is a binary variable representing the presence or absence of landslides. The logistic model can be expressed in its simplest form as follows:

$$P = \frac{1}{{1 + {\text{e}}^{ - z} }} $$
(1)

where P is the probability of an event (landslide) occurrence, which varies from 0 to 1 on an s-shaped curve; z is defined as the following equation (linear logistic model), with its value varies from −∞ to +∞:

$$z = \hat{\beta }_{o} + \hat{\beta }_{1} x_{1} + \hat{\beta }_{2} x_{2} + \ldots + \hat{\beta }_{n} x_{n} $$
(2)

where \(\hat{\beta }_{0}\) is the intercept of the model, n is the number of independent variables, \(\hat{\beta }_{i}\) (i = 1, 2, 3,…, n) is the slope coefficient of the model, and x i (i = 1, 2, 3,…, n) are the independent variables. This linear model is a logistic regression representing the presence or absence of landslides (at the present conditions) on the independent variables (pre-failure conditions).

Landslide susceptibility mapping using logistic regression

The landslide susceptibility calculation applied in this study is based on logistic regression includes four main steps: (1) splitting the dataset into a sampling and validation set; (2) multi-colinearity diagnosis; (3) application of logistic regression; and (4) validation and evaluation of the model results.

In logistic regression, it is generally recommended to use an equal number of landslide and non-landslide points (Süzen and Doyuran 2004; Yesilnacar and Topal 2005; Nefeslioglu et al. 2008). Moreover, approximately 20 % of the dataset should be used to validate the results (Van Den Eeckhaut et al. 2006).

In a first step, for the Wenchuan earthquake landslide inventory 2,238 random sample locations were chosen from the landslide-free area to represent the absence of landslide. The 2,238 landslide pixels and 2,238 landslide-free pixels were merged and a new column of a binary variable indicating the presence and absence of landslides was added. The sample of 4,476 pixels was subdivided into calibration and validation data subsets. The calibration dataset contained 80 % of the pixels and the validation set the remaining 20 % of the pixels. Continuous data like topographical parameter data, PGA, and the distance to vector layer were handled in logistic regression analysis in their original format, in order not to alter the state and information present in the parameter maps. The rock groups, land use and aspect were categorized into a binary format (a dummy variable).

In a second step, the tolerance factor (TOL) and variance inflation factor (VIF) indexes of the causative factors for the Wenchuan earthquake-triggered landslide were calculated. Indexe values of VIF were found to be less than 2 and higher than 0.4 for the TOL 4 and were subsequently as input to the logistic analysis.

In the third step, all the causative factors multi-temporal landslide and the Wenchuan earthquake-triggered landslide were separately imported into the logistic regression algorithm with statistical packages for SPSS 15.0 for Windows to calculate the correlation of landslides to each factor. A forward stepwise logistic regression was carried out to incorporate only those predictor variables with a significant contribution to the presence of landslides. In this study, the significance level of score χ 2 for entering the model was 0.15. The significance level of Wald χ 2 for a variable to stay was 0.05.

In a fourth step, the landslide occurrence probability was computed using the logistic regression coefficients. Many methods described in literature divide the histogram of the susceptibility map into different categories (Guzzetti et al. 1999; Dai and Lee 2002; Ohlmacher and Davis 2003; Süzen and Doyuran, 2004; Ayalew et al. 2005). In this study, standard ArcGIS 9.3 tools were used to generate equal interval class breaks. The susceptibility map was divided into five classes: very low, low, medium, high, and very high.

Accuracy assessment

The correct classification percentage calculation was performed for the validation dataset to evaluate the performance of this generalization of the models constructed during the training stages. For this, the area under receiver-operating characteristic (ROC) curve (AUC) was applied, which constitutes one of the most commonly used accuracy statistics for the prediction models in natural hazard assessments (Begueria 2006).

According to Can et al. 2005, in the susceptibility value spatial distribution on a useful landslide susceptibility map, the observed landslide areas should coincide with those areas having high susceptibility values (Rule I). In addition, high susceptibility values should cover only small areas (Rule II). Since, in this study, only one pixel of each landslide was used to produce a landslide susceptibility map, the percentage of landslide number was used on each susceptibility level.

Results

Influence of pre-conditioning factors and landslides

The changes of landslide susceptibility due to the 2008 Wenchuan earthquake were investigated to understand the different impact of the multi-temporal landslide and the Wenchuan earthquake landslide. The authors were especially concerned with (1) the variations between landslide spatial distribution and pre-conditioning factors using the landslide proportion (LP) and (2) the variations of the landslide density (LD).

Lithology

The statistical analysis of landslide distribution with lithological data showed that the unit of phyllite, slate, and thin limestone was the most affected, with the unit of thick layer of limestone and slate being the second (Fig. 6a). Almost half of all earthquake-triggered landslides occurred in areas with phyllites, slates, and thin limestone, which however experienced with a relatively low landslide density of 0.095 landslides/km². The highest density (0.17 landslides/km²) was found in areas with rock sand and gravel. The absence of landslides in units of magmatic rocks is mainly due to the small number of such cells, which together cover less than 0.0087 % of the total surface area.

Fig. 6
figure 6

Histogram of landslides and pre-disposing factors (LD and the LP value is enlarged 100 times)

Distance to faults

The comparison of landslide distribution with the distance to geological faults shows a general trend of a higher number of landslides with a shorter distance to faults. 55.45 % of landslide occurred within the distance of 2,100 m from faults (Fig. 6b). The greatest density (0.112 landslides/km²) was found within the distance of 300 m from faults.

Topography

The majority (over 84.09 %) of all earthquake-triggered landslides occurred at elevations below 1,900 m, and of these more than 60 % were triggered at elevations from 1,300 to 1,900 m. Only 3.04 % of all earthquake-triggered landslides took place at elevations of over 2,500 m. Both the LP and the LD decrease with the elevations increase (Fig. 6c).

The relationship between landslides and the corresponding slope categories has been examined as shown (Fig. 6d). Landslides increase as the slope increases, until the maximum is reached in the 20°–30° category, and then generally decrease as the slope increases. Approximately 30 % of all earthquake-triggered landslides are located on slopes with angles between 20° and 30°. The highest landslide density was found for areas with slope angles of over 50°. This could be related to the amplification effect of the ridges and the transitional and turning position of gentle topographic slope to steep slope positions was the most seismic ground striking (Huang and Li 2008).

The influence of aspect on landslides was also analyzed. Both the LD and the LP have a slightly higher value in S direction than in any others (Fig. 6e), which may indicate that slope aspect has a minor influence on the distribution of earthquake landslides.

Distance to rivers

The statistical analysis showed that a strong relation could be observed between landslide distribution and the distance to rivers. Over 67.92 % of earthquake-triggered landslides occurred within a distance of 1,000 m from rivers. Both the LP and the LD decrease with a longer distance from rivers (Fig. 6f).

Land-use

The statistical analysis showed that the vast majority of earthquake landslides occurred in areas classified as forest land (42.05 %) and other land used for agricultural production or irrigation or Saline land (39.32 %), while all other classes experienced less than 20 % of all landslides (Fig. 6g).

Mapping of landslide susceptibility

A forward stepwise logistic regression was carried out. The variables aspect, elevation, slope, land-use, rock groups, PGA, and distance to faults were selected due to their statistical significance. After the landslide occurrence probability was computed using the logistic regression coefficients, the re-classified susceptibility map created by the described methodology is presented in Fig. 7. Table 3 shows the distribution of the five susceptibility zones.

Fig. 7
figure 7

Re-classification susceptibility map of logistic regression analysis

Table 3 The distribution of four susceptibility zones in the study area

Accuracy assessment

The predicted accuracies of the validation dataset were 72.3 % for landslides and 68.9 % for non-landslides, with an average value of 70.6 %. Again, a high AUC value of 0.777 was reached.

As in this study only one pixel of each landslide was used to produce a landslide-susceptibility map, the percentage of landslide numbers was used without using observed landslide areas on each susceptibility level. The percentage of the area covered by the very low and low susceptibility zones was found to be relatively large, with a value of 43.11 %. Still, the very low and low susceptibility area included about 15 % of all the landslides. The percentage of the area computed with the highest susceptibility and of the landslide numbers was 14.25 and 40.8 %, respectively.

Conclusions

The high landslide susceptibility is related to the soft and erodible lithologic formations and the high tectonic activity. In combination with other factors, this causes geological disasters with a very high magnitude and frequency of landslides and debris flows. The Wenchuan earthquake mobilized large amounts of material and weakened the geological structure in Longnan, making it more susceptible for future slope failures. It is a challenge for the many tens of millions of people who live in this region. In the past 10 years, several large landslide disasters have occurred. For example, the Gansu mudslide took place on August 8th 2010 within the Longnan region. It was caused by heavy rainfall of 77.3 mm in 1 h and killed more than 1,287 people (Yu et al. 2010). In addition, the increase of urbanization, the building of new infrastructure such as new highways and high-speed railway lines (Chen et al. 2006), and the reconstruction following the Wenchuan earthquake increased the negative consequences of landslide occurrences, and also accelerated the impact of landslides on society. Therefore, landslides in the Longnan of China have received considerable attention after Wenchuan earthquake; 55.45 % of all earthquake-triggered landslides occurred in areas with phyllites or slates and occurred within the distance of 2,100 m from faults; 67.92 % of them occurred within a distance of 1,000 m from rivers. More than 60 % were triggered at elevations from 1,300–1,900 m and approximately 30 % of all earthquake-triggered landslides were located on slopes with angles between 20° and 30°. The slope aspect, however, has a minor influence on the distribution of earthquake landslides. The majority of earthquake landslides occurred in areas classified as forest land (42.05 %).

In order to predict areas most likely to experience slope failures in the future, landslide susceptibility maps have to be created. Nevertheless, the exact timing of future landslides cannot be estimated accurately with the current methodologies, given the complex nature of the respective processes. A susceptibility map shows where landslides may occur. The landslide susceptibility map ranks the slope stability of the study area in categories that range from stable to unstable. There are many GIS-based qualitative and quantitative techniques available to analyze the relationship between landslides and environmental factors. In this study, a logistic regression model was presented within the GIS framework for landslide susceptibility calculation. The aspect, elevation, slope, land use, rock groups, PGA, and distance to fault were the most significant factors to estimate Wenchuan earthquake-triggered landslide susceptibility. The final susceptibility map shows areas that are highly prone to earthquake-triggered landslides in the future, assuming that the epicenter is roughly in the same region. A landslide susceptibility map is fundamental in quantifying risk due to earthquake-triggered landslides in the Longnan region. The calculated landslide susceptibility map could be used for spatial and emergency planning by Longnan authorities.