Introduction

As declared in the 4th World Landslide Forum, the 2017 Ljubljana Declaration on landslide risk reduction and the Kyoto 2020 Commitment will shape the future works related to landslide studies throughout the world in the next years. These efforts particularly aim to strengthen networking with governments for landslide risk reduction (Sassa 2017), similar to the previous scientific attempts such as Aleotti and Chowdhury (1999), Fell et al. (2008a, 2008b) and Corominas et al. (2014). All these studies are excellent guides for any landslide-related study considering almost every aspect of the idealized procedures related to these works. However, landslides and their undesirable effects have still been a problematic issue throughout the world despite such important efforts. This situation could be explained by the fact that insufficient attention is still paid by the decision makers, politicians and local administrations as well as the people. It could be concluded that one of the most important aspects of the 4th Landslide Forum is emphasizing the importance of responsibilities of the governments and decision makers on the landslide mitigation efforts, including people living in the hazardous regions.

Landslide maps of any type are required for a safer urbanization and environment, land use and regional development plans. Of these, landslide-susceptibility and hazard studies were more performed by the researchers in the landslide literature. However, landslide risk was not a frequent topic in the landslide literature (Bednarik et al. 2012). Nonetheless, landslide inventory and database preparation stage, a key to whatsoever landslide study, is by far the most significant step. As Ye et al. (2019) stated, detecting and monitoring landslides are hot topics in the remote sensing community, particularly with the development of remote sensing (RS) technologies and the significant progress of computer vision. In this context, RS techniques such as aerial photography, high- or medium-resolution optical satellite images, interferometric synthetic aperture radar (InSAR) and light detection and ranging (LiDAR) were widely used in landslide studies (Alkevli and Ercanoglu 2011; Pirasteh and Li 2017a; Pirasteh et al. 2020). Based on the perpetual developments of computer, geographical information system (GIS) and RS technologies at the end of the 1980s and 1990s, the scientific landslide literature has witnessed splendid progressions. On the one hand, more attention has been paid by scientists on landslide-related studies; on the other, data availability and quality, funding provided by local/governmental agencies and technological aspects have always increased during this period worldwide. Particularly, landslide-susceptibility mapping studies have been more preferred by the scientists using different quantitative and qualitative methods when compared with the landslide hazard and risk studies. The main reason could be related to the difficulties in producing a landslide hazard or risk map that could be considered as a more challenging task and needs more detailed data. In addition, incompleteness of the landslide inventory maps and absence of reliable damage data related to landslides could be considered as the two most important reasons in order to reach hazard and risk. Despite all, landslide hazard and risk could be evaluated by different approaches even in national and global scales in some studies such as Guzzetti et al. (2000), Evans and Roberts (2006), Nadim et al. (2006), Yoshimatsu and Abe (2006), Abella and Van Westen (2007), Van Den Eeckhaut and Herva´s (2012), Jaedicke et al. (2014) and Di Martire et al. (2017).

Although the term risk corresponds to various meanings in different disciplines, one of the most commonly used definitions of risk is the expected number of lives lost, persons injured, damage to property and disruption of economic activity due to a particular damaging phenomenon for a given area and reference period (Varnes and IAEG 1984). This definition is also known as the total risk (RT), which is often difficult to calculate and is almost impossible particularly for medium- and small-scale studies. For the medium- and small-scale studies, the researchers preferred to use the term specific risk (RS) (e.g. Abella and Van Westen 2007; Akgun et al. 2012; Bianchini et al. 2017). RS is simply expressed by the product of hazard (H) and vulnerability (V) (Varnes and IAEG 1984). The term V means degree of loss to a given element, or set of elements, at risk resulting from the occurrence of a natural phenomenon of a given magnitude (Varnes and IAEG 1984). In this regard, we approached the specific risk term in this study by considering the available data and the scale. Details regarding the specific risk components (i.e. hazard, which also includes the susceptibility stage, and vulnerability) were given in the related sections of the study.

In this study, given the concepts above, a regional semi-quantitative landslide risk assessment was performed in Kahramanmaraş City centre and its vicinity, located in the Eastern Mediterranean region of Turkey. Since the landslides are one of the most problematic natural hazards in the city, it is aimed to highlight the landslide risk hotspots in a regional scale for the study area. To assess the landslide risk of the selected area, a semi-quantitative landslide risk index (LRI) was approached because of the fact that there were limited numerical data such as complete landslide inventory, occurrence dates of the landslides, quantitative damage information, etc. A semi-quantitative risk assessment could be considered as an intermediate stage between qualitative (heuristic) and quantitative (probabilistic) risk evaluations by ranking the risk of concern. In order to calculate the LRI in the study area, data provided by AFAD (Disaster and Emergency Management Presidency of Turkey) were used and updated during the field studies as much as possible. Then, landslide susceptibility, hazard and vulnerability assessments were performed. Finally, a semi-quantitative landslide risk assessment was conducted by producing an LRI map of the study area. All stages of the study were presented in detail in the next sections.

Study area

Kahramanmaraş City is located in the eastern part of the Mediterranean region of Turkey (Fig. 1), and has an extent of 14,525 km2, comprising of 11 different provinces. The population of Kahramanmaraş is 1,075,706, and the northern part of the city mostly contains plain areas. In general, the city presents mountainous and semi-mountainous morphological features, and the Mediterranean climate is dominant in the region. The eastern and the southern parts have a rugged topography. The summers are hot and arid; the winters are warm and rainy. The months December and January are the rainiest ones with an average rainfall of around 120 mm. The highest temperature is observed in July and August (MGM 2019) (Table 1). The city is in a very important position in terms of industrial purposes, and there are two official and two privately organized industrial zones. For the textile industry, Turkey’s largest organized industrial zone will be established in the city. The city has a feature that can be regarded as a centre for transportation and railway network in the Eastern Mediterranean region, connecting the Eastern and Southeastern Anatolia regions of Turkey.

Fig. 1
figure 1

Location map of the study area

Table 1 Monthly long-term meteorological characteristics of the region

Kahramanmaraş City has suitable geological, geomorphological and climatic characteristics in terms of natural disaster occurrences such as earthquakes, landslides, floods, avalanches, etc. For this reason, it is necessary to take into consideration the properties and the mechanisms of the natural disasters, particularly for the landslides that are the most commonly encountered disaster type in the city (see Fig. 1). However, if the spatial distribution of the landslides is considered, it is evident that the central part of the city contains more landslide locations as well as a bigger population and higher urbanization. For this reason, the central part of the city (see Fig. 1) was selected as the study area covering 3516 km2 in order to assess the landslide risk.

With respect to the geological features, it can be concluded that Kahramanmaraş is located in a region where the geological and tectonic characteristics are complicated. In order to simplify the geological features, at this stage, they were grouped by their system/period and are shown in Fig. 2. Detailed lithological names and ages were also presented below this figure. There are 11 different lithological groups and 48 units ranging between the Precambrian to the Quaternary period (see Fig. 2). At the basement, Precambrian amphibolite, schists and gneisses are located. These units are overlaid by marble, gneiss, schists and dolomites of the Ordovician period. Silurian shale, sandstone alternations are overlaid by massive limestones of Devonian, quartzite and siltstone. Permian, Triassic and Jurassic units are mostly represented by limestone, dolomite, marble and calc-schist. In general, Cretaceous units are in flysch facies comprising conglomerate, sandstone, mudstone and marl alternations. The Paleogene period is mostly represented by limestone, sandstone and marl in addition to volcanic rocks, which were overlaid by Neogene units of conglomerate, sandstone, gypsum, tuff and marl. Neogene units were the most abundant lithological units in the study area. Quaternary units include alluvium and slope debris, mostly located in close vicinity to the city centre and along the rivers and steep slopes. The study area is very close to the Eastern Anatolian Fault Zone (EAFZ) at which many destructive earthquakes occurred before, and a few segments of the EAFZ pass through the SE part of the study area (see Fig. 2). In addition, thrusts developed mostly in the Eocene and Miocene and strike slip faults are commonly traced in the study area (Sümengen 2013).

Fig. 2
figure 2

Geological features of the study area

The city centre is mostly covered by alluvial deposits and represents plain features with gentle slopes, while the other parts show generally moderate to high steep topographical features. Topographical elevation ranges between 203 and 3024 m (asl), and the eastern and northeastern parts of the study area have elevation values greater than 2000 m. Slope degrees range between 0 and 69°, and particularly, the SE and NE parts of the study area contain steep slopes greater than 45°, covered by massive and strong lithological units.

Landslides in the region were mapped and updated during the field studies. A total of 215 deep-seated rotational earth slides (Fig. 3a) were mapped during the field studies, and their areal extents ranged between 0.002 and 5.2 km2 (Fig. 3b). It should be noted that the x axis of the landslide sizes in Fig. 3b represents the logarithm of the landslide sizes. It is very well-known that a detailed and reliable landslide inventory and database are essential for any landslide work. However, as in the case of this study, complete landslide inventory maps are lacking in many countries. Based on the archives and database of AFAD, it was not possible to obtain the exact dates of the all landslides, which was very important for the landslide hazard assessment stage for triggering factor analysis. Unfortunately, only the dates of 29 of the total landslides were exactly known (day/month/year) in the AFAD landslide archives and database (Fig. 3c). It was revealed that March and January were the two months that most of the landslides occurred. In addition, the occurrence years of the 71 landslides were only known in this database when compared with the Kahramanmaraş Meteorological Station (KMS) data (Fig. 3d). Although there are 11 rainfall stations in the city, they have many data deficiencies. There is only one rainfall station located in the city centre (i.e. KMS), including full daily rainfall data starting from 1959 to 2015 with a mean annual precipitation (MAP) of 718.5 mm (see Fig. 3d).

Fig. 3
figure 3

Landslide features of the study area: a a view of the landslides, b frequency distribution of landslide sizes, c frequency distribution of landslide occurrence months and d annual precipitation of KMS and number of landslides in the study area

Even if the exact dates of all landslides were not known, it could be initially concluded that there was a good harmony between the number of landslides and rainfall amount, which could be considered as a key indicator to the hazard assessment. When Fig. 3d is examined, it could be concluded that there was a general agreement with the number of landslides and precipitation, particularly in 1976, 1987 and 2009, respectively. Unfortunately, the long-term data of KMS itself was not sufficient enough to model the landslide hazard throughout the study area. For example, some landslides, particularly located in the NW part of the study area, are highly far from the location of KMS (approximately 50 to 60 km). Thus, under these circumstances, it was not possible to assess the landslide hazard in the region based on the definition given by Guzzetti et al. (1999). The details related to the triggering factors can be found in the hazard section.

Methodology

Landslide susceptibility

In order to assess the landslide risk in a region, the second important stage following the landslide inventory studies is to evaluate the landslide susceptibility. Landslide susceptibility evaluations need to be able to respond to questions (e.g., where, under which conditions? etc.) that landslides are likely to occur in the future. The quality and the accuracy of these maps are also related to the purpose, scale, data and financial conditions of the study. In this context, the susceptibility phase after the inventory studies is very important for both hazard and risk stages. Landslide susceptibility assessment has been frequently performed in the scientific landslide literature. Basically, two approaches, qualitative and quantitative methods, have been adopted for the preparation of landslide susceptibility maps. Qualitative methods are generally carried out depending on the field observations by using the knowledge and experience of the researcher, while quantitative methods are performed by using statistical and/or mathematical modelling techniques. In quantitative methods, the parameters that are related to landslides are statistically evaluated and are produced as digital maps, mostly on a GIS platform. For example, in addition to some multivariate statistical analyses, frequency ratio (FR), logistic regression (LR), support vector machines (SVM) and artificial neural networks (ANN) have been widely used to assess and/or to compare landslide susceptibility in many studies (Li et al. 2017; Chen et al. 2019a; Gholami et al. 2019). Although all these methodologies had pros and cons, we preferred to assess landslide susceptibility in the study area by FR and LR methods which were successfully applied in the landslide literature for different regions.

The FR method has been one of the most widely used and simple methodologies to evaluate and/or to compare the landslide-related factors and their relationships between the landslide and non-landslide locations (e.g. Solaimani et al. 2013; Mondal and Maiti 2013; Regmi et al. 2014; Meten et al. 2015; Youssef et al. 2015; Wang et al. 2016; Wu et al. 2016; Zhang et al. 2016a, 2016b; Chen et al. 2017; Li et al. 2017; Nicu 2017; Pirasteh and Li 2017b; Sahana and Sajjad 2017; Aditian et al. 2018; Mandal et al. 2018; Zhu et al. 2018; Arca et al. 2019; Jana et al. 2019; Nsengiyumva et al. 2019; Yan et al. 2019). Actually, the theory behind the FR method is a simple ratio of A (area of landslides in a parameter class to area of all landslides) to B (area of a parameter class to the entire area) (i.e. FR = A/B). The FR value greater than 1 reflects a strong relationship, while the FR value less than 1 presents a poor relationship between the landslides and the considered parameters (Lee and Sambath 2006). Thus, landslide conditioning parameters were produced by the available data in a GIS platform to produce the landslide susceptibility map by the FR method. The considered parameters related to landslides in the region were slope (SLP), lithology (LIT), aspect (ASP), topographical elevation (TEL), distance to rivers (DTR), land use (LUS), curvature (CRV), distance to faults (DTF) and normalized difference vegetation index (NDVI). The SLP, ASP, TEL and CRV parameters were obtained from the digital elevation model (DEM) (containing 26,300,000 pixels) with the 10 m*10 m resolution, gathered from the General Directorate of Mapping of Turkey. The DTR map was produced from DEM using ArcHydro tools and GIS procedures. The LUS map was gathered from AFAD data, produced from Coordination of Information on the Environment (CORINE) codes. The NDVI map was produced from a Landsat ETM+ satellite image using red (R; band 3) and near-infrared (NIR; band 4) bands by rationing the reflectance values of the related bands (NDVI = (NIR − R) / (NIR + R)). In addition, the landslide inventory map (containing 805,662 pixels) was also integrated to the database in digital format. All files were stored in the GIS platform, and they are presented in Fig. 4. The next step of the FR analysis was to calculate the number of pixels either in the domain of a subgroup of a considered parameter or in the landslide inventory map. This calculation was made in the GIS platform, and the number of pixels related to the FR equation was extracted. Then, the FR values of the considered parameters and their subgroups were evaluated (Table 2). Finally, the Landslide Susceptibility Index (LSI) was calculated by the summation of FR values of each parameter as given below (Lee and Sambath 2006):

Fig. 4
figure 4figure 4

Parameter maps used in the susceptibility analyses: a aspect, b curvature, c distance to faults, d distance to rivers, e normalized difference vegetation index, f slope, g topographical elevation, h lithology and i land use

Table 2 FR values of the considered parameters and their subgroups
$$ \mathrm{LSI}=\sum \mathrm{FR}\kern0.5em \left(\mathrm{FR}\ \mathrm{rating}\ \mathrm{of}\ \mathrm{each}\ \mathrm{facto}{\mathrm{r}}^{'}\mathrm{s}\ \mathrm{type}\ \mathrm{or}\ \mathrm{range}\right) $$
(1)

The landslide susceptibility map produced by FR was obtained (Fig. 5a) using the equation above and standardized in [0, 1] interval which ranges from very low to very high landslide susceptibility classes.

Fig. 5
figure 5

Landslide susceptibility analysis results: a FR map, b LR map and c ROC curves

As for the LR method, it has also been one of the most commonly used methods for landslide susceptibility mapping purposes among the researchers in recent years (e.g. Talaei 2014; Bai et al. 2015; Lombardo et al. 2015; Chen et al. 2016; Sangchini et al. 2016; Wang et al. 2016; Du et al. 2017; Pham et al. 2017a, 2017b; Raja et al. 2017; Sahana and Sajjad 2017; Wu et al. 2017; Achour et al. 2018; Mandal and Mandal 2018; Polykretis and Chalkias 2018; Chen et al. 2019b; Kadavi et al. 2019; Soma et al. 2019). Conventionally, application of the LR method to produce a landslide susceptibility map is performed by dependent (landslide inventory) and independent variables (conditioning parameters of landslides) (Nandi and Shakoor 2009). LR analyses were performed by the LogisticReg module of the Idrisi Selva computer code. This module performs binomial logistic regression using raster or attribute values files. One of the most important advantages of the logistic regression method is that continuous and/or discrete variables could be used for the calculation stage (Lee and Sambath 2006). For the LR analyses, the dependent (i.e. landslide inventory) and 9 independent variables (slope, lithology, aspect, topographical elevation, distance to rivers, land use, curvature, distance to faults and normalized difference vegetation index) were loaded to the GIS platform. In the LR analysis, the module estimates a model of probability which represents the relationship between the dependent and independent variables. The probability of the dependent variable is calculated by the equation given below:

$$ P\left(y=1|X\right)=\left(\exp \left(\sum BX\right)/\left(1+\exp \left(\sum BX\right)\right)\right. $$
(2)

where P is the probability of the dependent variable being 1 (landslide); X is the independent variables (conditioning parameters) (X = (x0, x1, x2,…, xk) and B is the estimated coefficients of the considered input parameters (B = (b0, b1, b2,…, bk) (Eastman 2012). Then, a linearization process is needed to remove the zero and one boundaries for the probability of the dependent variable. By doing so, the predicted probability value (P′) will range from 0 to 1 and will be continuous by applying Eq. 3 (Eastman 2012):

$$ P'=\mathrm{Ln}\left(P/1-P\right)=\mathrm{b}0+\mathrm{b}1\ast \mathrm{x}1+\mathrm{b}2\ast \mathrm{x}2+\dots \dots +\mathrm{b}\mathrm{k}\ast \mathrm{x}\mathrm{k}+\mathrm{error} $$
(3)

In addition, the maximum likelihood estimation procedure is used in the related module of the computer code for finding the best-fitting set of independent parameters (Eastman 2012). These procedures are carried out by the following equations:

$$ L={\prod}_{i=1}^N{\mu}_i^{yi}\ x\ {\left(1-{\mu}_i\right)}^{\left(1- yi\right)} $$
(4)
$$ {\mu}_i=\exp \left({\sum}_{k=0}^K{b}_k{x}_{ik}\right)/\Big(1+\exp \left({\sum}_{k=0}^K{b}_k{x}_{ik}\right) $$
(5)

where L is the likelihood, μi corresponds to the predicted value of the dependent variable for sample i, N refers to the number of samples and yi is the observed value of the dependent variable for sample i (Eastman 2012). A stratified random sampling strategy was applied for the analyses considering 10% sampling proportion by default. LogisticReg module also summarizes some statistical indicators such as − 2log(L0), − 2log(Likelihood) and pseudo R2. These values indicate how well the model fit or not considering the parameters fed into the analyses. For example, pseudo R2 values greater than 0.2 indicate a relatively good fit, and the model is assumed as acceptable (Eastman 2012). The statistical results of LR are tabulated in Table 3, and the landslide susceptibility map produced by the LR model is shown in Fig. 5b. Since the pseudo R2 value was greater than 0.2, the model was accepted as a relatively good fit.

Table 3 Statistical results of the LR method of landslide susceptibility analyses

The final stage of the landslide susceptibility mapping procedures was to assess the performance of the so-produced maps by FR and LR methods. It was conducted by the ROC module of Idrisi Selva. The ROC module calculates the area under the curve (AUC) value using true- and false-positive percentages for each class, and the user can draw the ROC curve using these percentages. The AUC value of 1 corresponds to a perfect fit, while the AUC value of 0.5 is considered as an agreement by chance. During this stage, 8,878,255 pixels (covering approximately 33% of the study area), which were allocated for this validation stage and were never used for the former FR and LR analyses, were taken into consideration. Thus, by using this module, AUC values were calculated for each susceptibility map. The values were 0.862 and 0.828 for FR and LR methods, respectively. The schematic representation of these results is shown in Fig. 5c. Although the AUC values were close to each other, the landslide susceptibility map produced by the FR method was selected as the input for the landslide hazard assessment because of its higher AUC value.

Landslide hazard

Similar to the landslide susceptibility assessments, landslide hazard mapping techniques are divided into two major groups such as qualitative (direct) and quantitative (indirect) methods. Qualitative methods are generally based on expert opinion and considered as subjective. On the contrary, quantitative methods are considered as objective and produce numerical estimates (probabilities) of the occurrence of landslide phenomena in any hazard zone (Guzzetti et al. 1999). For both cases, data quality, reliability and the completeness of the landslide database are the most significant items for hazard assessments. In addition, the hazard (H) component is by far the most complex to establish since it is almost impossible to link the temporal probability with the landslide occurrences due to the lack of reliable historical and complete landslide data in many parts of the world (Van Westen et al. 2006).

Incompleteness of the landslide inventory is one of the most important problems when one tries to produce a landslide hazard and/or risk map. Thus, it becomes almost impossible to reach the exact definition of hazard including the three components (spatial, temporal and magnitude/size probabilities) in many studies proposed by Guzzetti et al. (1999). Of these, the magnitude (i.e. size, volume, intensity, etc.) component was not considered for the landslide hazard stage in this study due to the incompleteness of the landslide database. This situation can also be confirmed in Fig. 3b, particularly for small landslide sizes covering a few thousand square metres of areal extent. In addition, information related to the triggering event(s) is also a problematic issue. One of the most important reasons for this difficulty is generally sourced from the absence of the documentation of landslide occurrences and triggering agents such as earthquake, rainfall, human activity, etc. Furthermore, triggering factors are peculiar to the area concerned, and their effects on the landslide occurrences may be different for each site.

Given the problems mentioned above, landslide hazard assessment was conducted by the available data at hand. The first step of this assessment was to identify the triggering factor(s). Based on our archive studies and the interviews with local people during the field studies, the staple point of these works was that the precipitation was the major triggering factor of landslides. In addition, even if it was not valid for all landslides in the database, there was a good compatibility between the higher precipitation values and number of landslides represented in Fig. 3d. Therefore, precipitation was determined as the major triggering factor in the study area according to these studies. In these studies, there were no earthquake-triggered landslide data or damage information caused by landslides following an earthquake. However, the study area was located in a seismically active zone. In addition, as mentioned before, the Eastern Anatolian Fault Zone (EAFZ) and its segments are very close to the study area, particularly in the SE part (see Fig. 2). Thus, instead of not eliminating earthquake as a triggering factor of landslide occurrences, it was used as a hazard component. For this stage, the approaches provided by Abella and Van Westen (2007) and Akgun et al. (2012) were considered although there were no earthquake-triggered landslide data in their database. In these studies, peak ground acceleration (PGA) was taken into consideration as a triggering indicator with respect to landslide hazard. For this purpose, an index map (LHI) was produced by using Eq. 6 and Eq. 7:

$$ \mathrm{LHI}=\mathrm{LSI}\ast \mathrm{TI} $$
(6)
$$ \mathrm{TI}=\left(\mathrm{PGA}\ast 0.2\right)+\left(\mathrm{PRE}\ast 0.8\right) $$
(7)

where LHI is the landslide hazard index, LSI refers to the landslide susceptibility index and TI is the triggering indicator of landslides including precipitation (PRE) and PGA data. Of these, LSI was obtained in the previous section. For the triggering indicators, PGA and PRE maps were used as inputs of TI. For the PGA map, earthquake hazard map of the study area was gathered from AFAD. It was based on PGA values ranging from very low to very high hazard, which corresponds to 10% probability of exceedance in 50 years with a return period of 475 years. This map was standardized in [0, 1] interval for the study area and is shown in Fig. 6a as an input of TI. Then, the other input of TI, namely the PRE map, was produced. To produce the PRE map of the study area, the approach provided by Akgun et al. (2012) was considered. Data scarcity and deficiency problems related to the precipitation and landslide data were almost the same in their study area. For this reason, we considered the additional data of three meteorological stations (Göksun, Afşin and Elbistan; for the locations, see Fig. 1) at which the most long-term data exist regardless of their deficiencies in some years, along with the data of KMS. Thus, considering the annual precipitation data of these four meteorological stations from 1959 to 2014, the PRE map was obtained by using the inverse distance weighted technique. This map was standardized for the study area similar to the PGA indicator and is shown in Fig. 6b as a second input of TI. Then, to produce the LHI map, PGA and PRE maps were multiplied by 0.2 and 0.8, respectively, as suggested by Abella and Van Westen (2007). In other words, the TI part of LHI was obtained by using Eq. 7. Finally, the LHI map of the study area was produced (Fig. 7) using these equations. The LHI map was classified into five groups ranging from very low (1) to very high (5) classes to represent the hazard component of the LRI.

Fig. 6
figure 6

Triggering factor maps of the study area: a PGA map and b PRE map

Fig. 7
figure 7

LHI map of the study area

Vulnerability

Losses and/or damages that may be raised from any natural disaster should be evaluated by considering economic, social, socio-economic or environmental factors. Based on the literature studies, scientists generally considered vulnerability part of landslide risk in four different groups such as physical, economic, environmental and social ones. Naturally, the most important element is human life, but it is suggested that other factors should also be taken into consideration (Fell et al. 2008a, 2008b). It is evident that an interdisciplinary study is needed to evaluate this issue in a way that involves many different disciplines for a sustainable risk assessment and management. Vulnerability assessment is an essential part of the landslide risk evaluations (Bednarik et al. 2012). In addition, vulnerability assessment involves the understanding of the interaction between a given landslide and the affected elements (Dai et al. 2002). However, studies related to vulnerability with respect to landslides are far less abundant (Galli and Guzzetti 2007). Furthermore, it should be noted that there is a considerable amount of uncertainty in the vulnerability assessment stage. Thus, in many cases, researchers have to make some assumptions depending on factors such as scale, spatial features, data groups at hand, data quality and reliability.

In the literature, the term vulnerability was defined in many ways for various purposes. In addition to the definition of vulnerability given in the first section of this study by Varnes and IAEG (1984), it is possible to find out different definitions of vulnerability. Recent reviews and applications of vulnerability in different disciplines can be found in detail in Gonçalves and Zezere (2018), Behera et al. (2019), Giannakidou et al. (2019), Miranda and Ferreira (2019), Rehman et al. (2019) and Singh et al. (2019a, 2019b). From a different point of view, Köhle et al. (2007) considered vulnerability as a dynamic parameter in risk assessments since it has temporal and spatial content. Since it was almost impossible to access the data on temporal content for such a big area and there were many data deficiencies, the spatial content of vulnerability was solely considered by the available data of concern.

Similar to the absence of documentation problem between landslide occurrences and triggering factors mentioned in the hazard section, there are also problems arising from the lack of information between reliable and/or available damage data and landslides. To some extent, this situation also makes quantitative landslide risk assessments impossible (Van Westen et al. 2006). In addition, lack of damage data also hampered to produce vulnerability curves or functions as well as fragility curves. Therefore, it is one of the most challenging tasks to assess vulnerability as a part of risk concept when historical catalogues related to damage data for landslides are lacking, as in the case of this study.

To cope with these issues mentioned above, some indices were used in the literature, particularly for small- and medium-scale studies (e.g. Abella and Van Westen 2007; Akgun et al. 2012; Giannakidou et al. 2019). In this study, for the reasons mentioned above, it was preferred to produce a relative vulnerability index (RVI), as a second component of LRI, in addition to LHI. It contains mainly three different components such as environmental, infrastructural and population density features, based on the available data for the study area. In addition to RVI, its sub-components were also explained in the following lines.

In the study area, categorical land use (Fig. 8), water channels (WCH), potable water pipeline (PWP), power lines (PWL), road network (RON), railway (RAI) and natural gas pipeline (NGP) (Fig. 9) and population density data (Fig. 10) were taken into consideration as the elements at risk. All these data were gathered from the AFAD Provincial Directorate of Kahramanmaraş, based on the year of 2015 data. For land use components (see Fig. 4i and Table 2), they were classified into six groups (see Fig. 8) such as urban areas, plantation, agricultural, forest, bare land and water based on their categorical features for the AHP analyses explained below to shorten the number of parameters. In addition, linear density calculations (per km2) were performed in the ArcGIS platform for lineament features of WCH, PWP, PWL, RON, RAI and NGP elements in the study area, and they were standardized in [0, 1] interval (see Fig. 9). For the population density (see Fig. 10), 2015 population density data of Kahramanmaraş were interpolated and standardized in [0, 1] interval throughout the study area.

Fig. 8
figure 8

Categorical land use map of the study area

Fig. 9
figure 9

Infrastructural elements-at-risk maps and their densities: a water channel, b potable water pipeline, c power line, d road network, e railway and f natural gas pipeline

Fig. 10
figure 10

Standardized population density map of the study area

As being a part of the specific risk, the term vulnerability depends basically upon the degree of loss (economically and/or physically) for the element(s) at risk and the magnitude/intensity (velocity, volume, energy, impact pressure, etc.) of a natural phenomenon (i.e. landslide in this case). In the scientific literature, some excellent reviews and applications can be found with respect to landslide vulnerability considering the loss and magnitude components (e.g. Leone et al. 1996; Catani et al. 2005; Galli and Guzzetti 2007; Totsching and Fuchs 2013; Bianchini et al. 2017; Singh et al. 2019a, 2019b). Due to the lack of such vulnerability-related data, vulnerability assessment was performed by considering the relative loss (RL) and spatial impact (SI) of the elements at risk in the study area to produce a RVI map (Fig. 11). In other words, degree of loss and magnitude and/or intensity components of vulnerability were simulated by RL and SI, respectively (see Fig. 11). Economic valuation (i.e. relative cost of the elements) of RL for environmental and infrastructural elements at risk was achieved by AHP matrices fulfilled by the local experts of AFAD. The question of “if a landslide occurs in the study area, what would be the relative economic loss on an element at risk because of this landslide?” was asked to these experts. For population data, a standardized population density map was used for RL. The spatial impact (SI) of the elements at risk was linked to the spatial distributions of these elements on the landslide locations based on categorical (for land use) and frequency distributions (for infrastructures and population) throughout the study area. SI values were simply calculated by dividing the number of pixels of each subgroup of element at risk to the total number of landslided pixels, expressed as landslide frequency (LF) in Fig. 11. The idea behind this concept was that the LF value of an element at risk would be high in the landslide areas if it was really linked to a landslide occurrence. If not, it would be low or zero as in the case of RAI and NGP components of infrastructural elements (see Fig. 11).

Fig. 11
figure 11

Flow chart representing the calculation of spatial impact and relative loss of elements at risk: a environmental, b infrastructural and c population

For the RL component, the AHP model was used for environmental and infrastructural vulnerability. The AHP method was introduced by Saaty (1977), and it was one of the most commonly used methods to cope with various problems in decision making. AHP is a theory of measurement through pairwise comparisons and relies on the judgements of experts to derive priority scales in relative terms (Saaty 2008). The reasons why we selected AHP theory to assess the RL component of vulnerability are mainly twofold: firstly, damage/loss data scarcity throughout the study area and, secondly, scale and size of the study area. Information such as economic loss and damage that may occur as a result of the landslides on the vulnerability of each element at risk could not be reached and/or was not available for the study area. Under these conditions, the most important advantage of the utilization of the AHP method was that it allowed the spatial expression of the RL to the elements at risk. To make comparisons, a scale of numbers was needed, which indicated the importance among the elements (Saaty 2008). The scale provided by Saaty (1977) ranges between 1 (equal importance) and 9 (extreme importance), and the reciprocals (e.g. 1/9, 1/7, 1/3, etc.) could also be used when comparing the importance of the pairs to fulfil the AHP matrices. From this point of view, three local experts of AFAD were asked the question mentioned above to fulfil the AHP matrices for environmental and infrastructural elements at risk to achieve the RL component of the vulnerability. AHP matrices used to produce RL maps of the environmental and infrastructural elements at risk are tabulated in Table 4 and Table 5, respectively. The grey-shadowed parts in these tables correspond to the reciprocals of the values. To produce the RL maps, the considered elements-at-risk parameters of concern were fed into the GIS platform. For this stage, the weight module of the Idrisi Selva computer code was used. The module gives the value of consistency ratio (CR) as a measure of the consistency among the considered parameters. The CR value lower than 0.1 is acceptable with respect to the consistency of the parameters of concern (Saaty 1977). Details related to the AHP method can be found in Saaty (1977, 2008). As can be seen from Table 4 and Table 5, CR values were lower than 0.1, and they were considered as consistent. To obtain an RVI map of the study area, SI and RL components were multiplied and standardized in [0, 1] interval for each element-at-risk group in the GIS platform. Then, environmental (EVM), infrastructural (IVM) and population (PVM) vulnerability maps were obtained by these calculations. All these stages are represented in Fig. 11. EVM, IVM and PVM maps are also shown in Fig. 12a–c as the components of the RVI map. Finally, the RVI map of the study area (Fig. 12d) was produced by arithmetically adding these three vulnerability maps using the equations given below. It was classified in 5 groups ranging from very low (1) to very high (5) vulnerability:

Table 4 AHP matrix used in economic valuation of vulnerability assessments for the components of land use categories
Table 5 AHP matrix used in economic valuation of vulnerability assessments for the components of infrastructural elements
Fig. 12
figure 12

Vulnerability maps for the study area: a environmental vulnerability map (EVM), b infrastructural vulnerability map (IVM), c population vulnerability map (PVM) and d relative vulnerability (RVI) map

$$ \mathrm{EV}={\mathrm{SI}}_{\mathrm{env}}\ast {\mathrm{RL}}_{\mathrm{env}} $$
(8)
$$ \mathrm{IV}={\mathrm{SI}}_{\mathrm{inf}}\ast {\mathrm{RL}}_{\mathrm{inf}} $$
(9)
$$ \mathrm{PV}={\mathrm{SI}}_{\mathrm{pop}}\ast {\mathrm{RL}}_{\mathrm{pop}} $$
(10)
$$ \mathrm{RVI}=\mathrm{EV}+\mathrm{IV}+\mathrm{PV} $$
(11)

where EV is the environmental vulnerability, IV is the infrastructural vulnerability, PV is the population vulnerability, SI is the spatial impact for the elements at risk group (i. e. environmental, infrastructural and population), RL is relative loss for the elements-at-risk group (i.e. environmental, infrastructural and population) and RVI is the relative vulnerability index.

Results and discussion

At the final stage of the study, a semi-quantitative landslide risk index (LRI) map was produced by considering the LHI and RVI maps. LRI was achieved by combining the LHI and RVI maps, expressing the specific risk throughout the study area. For the combination of LHI and RVI, we took into consideration a semi-quantitative risk matrix ranging from very low (1) to very high (5) classes, which is tabulated in Table 6. Considering the LHI and RVI classes tabulated in Table 6, the final LRI map was produced (Fig. 13). This map provides a general overview on the landslide risk phenomenon in the study area. In other words, it could be considered as an initial step with respect to landslide risk assessments in the study area. According to this map, the northern part of the region was abundantly classified in the VL group, while the close surrounding regions to the city centre (except for the southern part of the city centre) were in the high and very high LRI classes. This means that LHI and RVI values were relatively high. Areal coverages of the LRI classes were calculated as 21.4% very low (VL), 10.8% as low (L), 37.4% as medium (M), 24.8% as high (H) and 5.6% as very high (VH) LRI, respectively. These values were 19.1% and 10.6%, 23.3% and 31.3%, 27.6% and 31.6%, 20.2% and 22.2%, and 9.8% and 4.3%, corresponding to the classes of VL, L, M, H and VH components, for hazard (LHI) and vulnerability (RVI), respectively. These differences could be explained by the hazard and vulnerability components of LRI. For example, a region with a very high class of vulnerability may be in a very low hazard class, or vice versa. If these classes were combined together based on the class values given in Table 6, the resulting class would be low LRI.

Table 6 Semi-quantitative risk matrix used in the study
Fig. 13
figure 13

LRI map of the study area

As a general point of view of landslide characteristics, a total of 215 deep-seated rotational earth slides were mapped during the field studies. It was revealed that the landslides in the study area generally occurred in low to medium slope angles. The landslides in the region were mostly observed in a weathering zone of weak flysch units, formed by siltstone, mudstone and sandstone alternations with occasionally volcanic intercalations. The landslides in the study area generally occurred on the northeastern-facing slopes with topographical elevations between 1000 and 1250 m (asl). The most common land cover types of landslides are orchards (25.72%), mixed agricultural area (20.42%) and olive groves (16.60%). In landslide susceptibility analyses, FR and LR models were taken into consideration. In the performance evaluations, the AUC values were calculated as 0.828 for the LR method and 0.862 for the FR method, respectively. Due to its higher AUC value, the FR model susceptibility map was selected as the main input component for hazard analysis. For the hazard stage, precipitation and peak ground acceleration parameters were considered. Although there were no data related to the earthquake-triggered landslides in the region, closeness to the active fault zones of the study area was the main reason for this selection (20% effect in triggering indicators in hazard stage). Based on the archive studies and the interviews with local people, most of the landslides was triggered after a long-term heavy rainfall. There was also a good harmony between the higher precipitation values and the number of landslides. Thus, 80% weight was given to the triggering indicator for LHI calculation.

It was almost impossible to assess the temporal vulnerability in such a region and data scarcity. Thus, it was preferred to assess the spatial vulnerability comparing the considered elements at risk, relatively. In other words, it was not intended to assess an individual element at risk in the region such as a building, road cut, etc. The main idea herein was to represent the spatial vulnerability with respect to each group of elements at risk. The economic valuation of vulnerability stage was performed by the AHP model. However, it should not be considered as an insufficient approach since it would provide valuable information based on the experience and judgement of the experts, particularly in data-scarce environments. For the vulnerability assessments, two indicators, namely spatial impact (SI) and relative loss (RL), were introduced. These indicators were associated with the degree of loss and magnitude and/or intensity components of the term vulnerability given by Varnes and IAEG (1984). Degree of loss and magnitude (to some extent, intensity) components of vulnerability were imitated by RL and SI, respectively. For the losses, economic valuation of RL for environmental and infrastructural elements at risk was performed by using the AHP model. Based on the results of the AHP model, urban area class was the most valued one for environmental elements at risk. For infrastructural elements, the most valued element was natural gas pipeline. For population data, population density map was used for this purpose (i.e. relative loss) since it was not logical to compare human life with another type of element at risk by using AHP. As for the spatial impacts of the elements at risk, it was assessed based on the spatial distributions of these elements located on the landslides. Spatial impact values of the elements at risk were calculated by landslide frequencies based on their spatial distributions. For this indicator, it was linked to the intensity or magnitude of a landslide. It was thought that an element at risk was further damaged if the intensity or magnitude of the landslide of concern was higher than that of the other landslides. Thus, spatial distribution on landslide locations (i.e. LF) of any element at risk could be more expected to expose a landslide. For example, in the case of the RAI and NGP components of infrastructural elements, there were no landslide locations with respect to areal extents since the LF values were 0 for their all subgroups. However, NGP and RAI were two of the most valued components (NGP, the first one; RAI, the third one) of infrastructural elements based on the economic valuation (see Table 5). It means that there will be a possible economic loss if a landslide occurs, but since they have no spatial impact, infrastructural vulnerability will be zero for these components. However, environmental and/or population vulnerability may exist in the areas covered by NGP and RAI components. For this reason, environmental, infrastructural and population vulnerabilities were added arithmetically to produce the RVI map.

Conclusions

The practical utilization of the so-produced LRI map in regional planning of the study area is very important in the future. It is thought that the utilization of such maps, which will be very beneficial especially for decision makers and local administrations, will also play an important role in minimizing the damages caused by landslides. In the light of all these assessments, it is clear that landslides and their consequences will pose serious problems for the region in the future, regardless of considering the population growth. Apart from the city centre, there were many scattered settlements located in medium- to very-high-LRI zones in the study area (corresponding approximately to 70% areal coverage for these classes). Since this approach is the initial step of highlighting the landslide risk assessment of the region, detailed analyses should be performed in these zones in addition to the database improvement.

It was important to conduct such a work to highlight the landslide risk conditions based on the available data for the study area. However, it should be kept in mind that it can be re-evaluated with some additional studies when the data deficiencies are eliminated in the future. It is inevitable in data-scarce regions that some certain assumptions should be made. In this context, it is considered that the public institutions and organizations, especially the decision makers, local administrations and universities, have significant responsibilities to produce and to transfer reliable and high-quality data not only for landslides but also for all natural hazards for future works.