1 Introduction

Pipelines are main components of the infrastructure that are essential for providing primaries social services essential for inhabitants living in urban settlements and for supporting companies and industrial activities. The effect of natural hazards on this type of infrastructures (Hossam and Hossam 2010) is a crucial issue to be considered in order to prevent or mitigate damages to property and people. In the specific case of gas and oil pipelines, the risk arising from the possibility of fire and/or bursts intensifies the potential impact on human casualties, as well as on the loss of infrastructure functionality (Shebeko et al. 2007). Generally, gas/oil pipelines are positioned underground in order to minimize the exposure to weathering and surface accidents. However, relevant ground deformations, such as in earthquake-induced landslides, can affect buried pipelines with potentially dangerous effects.

Important infrastructure for gas/oil extraction and related pipeline networks are located in the valley of the Agri River (Basilicata, southern Italy) and extend to the Taranto oil terminal, in an area characterized by steep slopes in poorly cemented or highly fractured rocks, high slope instability (Vignola et al. 2001) and high seismogenic potential. The combination of all these factors gives rise to a significant earthquake-induced landslide hazard (EILH), a widely used term and concept since a decade (Romeo 2000; Refice and Capolongo 2002; Mahdavifar et al. 2012) at least, which requires spatial assessment and modelling in order to properly address risk facing for mitigation policies and decision-making support in emergency response.

Several methods based on the geographic information system (GIS) spatial modelling approach have been developed for the evaluation of the hazard connected to earthquakes (Borfecchia et al. 2010; Boccardo 2013) and, particularly, to earthquake-triggered landslides (Keefer 1984; Jibson et al. 2000; Rapolla et al. 2010, 2012). Many authors have dealt with post-event inventory assessment and characterization of the earthquake-induced landslides using high-resolution multispectral remotely sensed imagery (Kamp et al. 2010) and in situ georeferenced inspections (Guemache et al. 2010). Some of them have exploited spatial modelling GIS capabilities to assess the seismic landslide hazard by processing topographic and geologic layers within statistical models calibrated by means of an inventory of post-event landslides derived by means of remote sensing in order to obtain semi-quantitative zonation (Kouli et al. 2010; Yang et al. 2014). Fell et al. (2008) suggested three levels of zoning (level 1 regional zoning, scale 1:250,000–1:25,000; level 2 local zoning, scale 1:25,000–1:5,000; and level 3 site-specific zoning, scale > 1:5000) to allow assessment of landslide susceptibility. In this study, the methodologies proposed by Newmark (1965) and Rapolla et al. (2010, 2012) were implemented and GIS tools were used in order to compare their results and to provide a reliable level 1 regional zoning of the susceptibility to earthquake-induced landslides in the area of the gas/oil plant of the Agri Valley.

The selected methods are representative of two different approaches to the estimate of EILH: the first one consists in a permanent-displacement analysis, based on geotechnical-engineering criteria, originally developed to predict the effects of earthquake motion on earth or rock-filled dams and embankments (Newmark 1965); at a later stage, it was adapted and successfully applied to more general problems of slope stability under seismic action (Romeo 2000; Maugeri et al. 2009). Actually, the Newmark’s method is halfway between the too simplistic pseudo-static approximation and the overly complex stress–deformation analyses (Jibson 2011). The second method, instead, constitutes a simplified, typically GIS-based heuristic approach aimed at a quantitative evaluation of the landslide susceptibility in seismic areas. The diverse approaches of these two methods to the EILH evaluation which have been widely applied, mainly singularly, in similar contexts and at various scales, suggested us to test and compare their feasibility and performance in a large-sized seismic area, as that considered here, where, due to the presence of a huge crude oil extraction plant, there is also a stronger commitment at national level.

2 Study area

The Upper Agri Valley hosts the largest onshore oilfield and the largest oil/gas pre-treatment plant in Europe, known as Centro Olio Val d’Agri (COVA), managed by ENI Energy Company. At present, there are 25 active productive wells (Ministero dello Sviluppo Economico 2013), which are directly connected to the COVA through a buried pipeline network and reinjection sub-network (Fig. 1). Oil stabilization and gas conditioning are performed at COVA, before piping the oil to the refinery located in Taranto, on the Ionian seacoast, through a 136-km-long, 20-inch-diameter pipeline, referenced as main pipeline in Fig. 1. The main pipeline is buried at an average depth of 2 m.

Fig. 1
figure 1

a Map of the infrastructures for gas/oil extraction in the Agri Valley. The map shows the Centro Olio Val d’Agri (COVA), the active productive wells, the main pipeline, the rejection pipeline, the wells pipeline and the buffer area of interest. The map also reports the distribution of landslides (IFFI project—Inventory of Landslide Phenomena in Italy, ISPRA—Servizio Geologico d’Italia 2006); b detail of the most critical sector of the study area

The study area was defined by a 3.0-km-wide buffer area around the above-mentioned gas/oil infrastructures (covering a 61,000 ha area). It crosses the Upper Agri Valley (elevation ranging from about 540 m to about 1835 m above sea level), the valleys of some major Basilicata Rivers (Cavone, Basento and Bradano), and finally reaches the nearly flat landscape of the Ionian seacoast (Fig. 1).

The study area is affected by several natural hazards. From the seismic point of view, the Upper Agri Valley was hit in 1857 by one of the most destructive historical earthquakes in Italy (M = 7.0). Conflicting seismogenic models for the Upper Agri Valley are discussed in the recent literature. The seismogenic fault system capable of producing large events is alternatively associated with (1) the Eastern Agri Fault System (EAFS) (Benedetti et al. 1998; Cello et al. 2003; Giano et al. 2000; Giocoli et al. 2015) and (2) the Monti della Maddalena Fault System (MMFS) (Pantosti and Valensise 1990; Maschio et al. 2005; Improta et al. 2010). In addition, Borraccini et al. (2002) supposed that the most probable structure responsible for the development of the active fault system in Agri Valley is a buried fault just below the central part of the valley floor. Many hydrogeological instabilities and landslides have been reported and mapped (e.g. IFFI project—Inventory of Landslide Phenomena in Italy, ISPRA—Servizio Geologico d’Italia 2006) within the area of interest (Fig. 1). The landslides layer reported in Fig. 1 includes the mapped areas in the region of interest affected by these phenomena both with the localization of related triggering points; in addition, the different types of instabilities refer to their level of attention linked to the crossed land cover/use classes derived from CLC2006 (i.e. going respectively from 3 to 0 type number: urban fabric/infrastructures, transportation network, green areas/semi-natural environment, municipality areas without hydrogeological instabilities mapped).

Because of the roughness of relief, the geological features are also quite varied ranging from compact rocks (mainly limestones and sandstones) of Cretaceous–Miocene age in the extraction area, to Plio–Pleistocene poorly cemented sands and clays in the intermediate valleys, to recent alluvial and marine terraced sediments in the final section of the main pipeline. According to the aim of the present study, the spatial analysis was limited to the buffer area potentially involved in earthquake-induced landslide phenomena possibly affecting the described infrastructure in order to allow the optimization of models implementation and processing time.

3 Data

The available digital terrain model (DTM), geological, geotechnical, seismological and engineering data, mainly under the form of digital maps, were collected from different sources (Table 1) in order to provide the basic input data required by the models. As shown in Fig. 2, the two implemented models share the same databases, consisting in three distinct primary layers: the map of the seismic action, taking into account the site effects (PGAs); the digital terrain model; and the lithology. The PGAs (representing the local seismic action) (Fig. 3c) was determined from the PGA (pick ground acceleration, a g) values of the Italian national scale seismic hazard estimates for the area of interest and in accordance with the latest Italian technical code for construction (NTC 2008) and the Eurocode 8 (2003). The slope steepness map (Fig. 3a) was obtained from a DTM with a 20 × 20 m mesh. The lithology (Fig. 3b) was derived from the 1:100,000 geological map available at the National Geoportal of the Italian Ministry of the Environment (NG). In particular, the different geological units were assembled in 11 lithological units on the basis of their shear waves velocities (V s), cohesion (c′), friction angle (φ′) and unit weight (γ), which were derived from specific literature (e.g. Vignola et al. 2001; Caputo et al. 2004; Maugeri et al. 2005; Caputo et al. 2006; Cherubini et al. 2008; Di Giulio et al. 2008) and in situ investigations, mostly carried out by the administrations of municipalities located in the study area (Table 2). The different parameters of the models were extracted from such three primary layers, as described here forward.

Table 1 Input layers for GIS modelling
Fig. 2
figure 2

Synthetic workflow diagram. The central part shows the three distinct primary layers common to the two implemented models. On the left, the data derived from the primary layers for the Newmark’s method. On the right, the data derived from the primary layers for the Rapolla’s method

Fig. 3
figure 3

Intermediate thematic products: a slope steepness map; b lithological map; and c PGAs map

Table 2 Lithologies and related geotechnical parameters

4 Methods

The above-described data were preprocessed and transformed into homogeneous and compatible (in terms of cartographic projection) GIS layers, using specific tools and customized procedures. These layers were combined or merged utilizing specific algorithms to extract or generate the needed information or features mostly in the form of GIS layers (UTM WGS-84 zone 33N cartographic projection). The workflow diagram in Fig. 2 shows the steps followed in generating the EILH maps, while a more detailed schema is reported in appendix (Fig. 7).

According to the NTC (2008), the appropriate levels of protection for the oil pipeline facilities are considered to be achieved by selecting exceedance probabilities (PVR) and return period (RP) as indicated in Table 3 for nominal design life (V n) ≥ 100 years (large or strategic constructions). For the oil pipeline facilities, a V n = 100 years and a coefficient of use (C u) equal to 2 were chosen. The probability of exceedance of the seismic action during the RP = V n C u varies with the limit state, as shown in Table 3. Therefore, for the performance objective SLV (limit state for the safeguard of human life or ultimate state), we produced a seismic hazard map of the area in terms of reference PGA (pick ground acceleration, a g) on type A ground for RP equal to 1900 years. As the a g values refer to type A ground (stiff horizontal outcropping bedrock with V s > 800 m s−1), the Italian code introduced a stratigraphic factor (S s) and a topographic factor (S t) to take account of the site effects. Consequently, the local effective PGAs to be used as seismic action in our approach is given by the following equation:

$${\text{PGA}}_{\text{s}} = a_{\text{g}} S_{\text{s}} S_{\text{t}}$$
(1)

S s is a function of the ground type, a g and F 0 (maximum amplification of the horizontal elastic response spectrum). While a g and F 0 were mapped according to Annex A of NTC (2008) for RP equal to 1900 years, the ground type is supposed to be assigned through seismic microzonation procedure based on V s30 measurements. A recent microzonation project in the region of interest suggested that all landslides areas should be classified as type D ground, since V s30 measurements revealed poor indicators on the seismic amplification for this kind of site conditions (Mucciarelli et al. 2006). The above-mentioned project also reported that the presence of zones with many scattered landslides is very common conditions in the study area, as confirmed by the IFFI landslides maps of Basilicata (Fig. 1). Consequently, in the present study S s was calculated by considering the type D ground (NTC 2008). On the other hand, in the equations provided by the Italian code, taking into account the correlation between F 0 and a g in the study area (Fig. 4a), S s decreases with a g for all ground types so that, in particular, where a g grows over 0.4 g, S s values are comprised between 0.9 and 1.1 (Fig. 4b), which means that the soil amplification is weakly dependent on ground types in the most hazardous zones of the study area.

Table 3 Seismic action parameters according to the Italian code NTC (2008) (enhanced in light blue are the utilized values)
Fig. 4
figure 4

a Correlation between the maximum amplification of the horizontal elastic response spectrum (F 0) and the reference PGA on type A ground (a g) in the study area. b Stratigraphic factor (S s) versus a g according to the Italian code NTC (2008)

In addition, the DTM was used to assess the topographic factor S t by identifying the slope steepness classes and the ridges (Table 3). These latter were obtained by means of the flow direction and basins detection analysis. For each polygonal basin, the minimum and maximum heights (h min and h max, respectively) were extracted from the DTM using GIS zonal functions (Fig. 8a in appendix). Then, the following pixel-based interpolation was used:

$$S_{\text{t}} = 1 + b\frac{{(h - h_{\hbox{min} } )}}{{(h_{\hbox{max} } - h_{\hbox{min} } )}}$$
(2)

where h is the pixel height and b depends on the slope class (Table 3).

4.1 The Newmark’s method

The Newmark’s method (Newmark 1965; Maugeri et al. 2009) has undergone several modifications and improvements (Wilson and Keefer 1983) and several relations between seismic ground-motion parameters, and computed landslide displacements have been proposed (Ambraseys and Srbulov 1995; Jibson et al. 2000). In this paper, the simplest model of an infinite slope with a steady sliding surface in dry conditions parallel to the slope was adopted. The critical acceleration coefficient k y was calculated (Fig. 8b) by the following equation derived from Romeo (2000):

$$k_{y} = \frac{{c^{{\prime }} }}{{H\gamma \cos^{2} \beta }} + \tan \varphi^{{\prime }} - \tan \beta$$
(3)

where the critical acceleration (k y  × g) is a function of the geotechnical parameters (c′, φ′ and γ), the slope angle β and the depth H of the sliding surface from the ground level. The thickness H of the sliding surface was assumed here equal to 40 m in order to take into account the worst situation from a safety perspective.

Coherently with the Newmark’s model, the susceptibility to earthquake-induced landslide was assessed in terms of safety factor SF computed as the ratio of the critical acceleration to the seismic action expressed in terms of PGAs:

$${\text{SF}} = \frac{{k_{y} \cdot g}}{{{\text{PGA}}_{s} }}$$
(4)

4.2 The Rapolla’s method

Rapolla et al. (2010, 2012) proposed a heuristic approach based on three factors considered significant for predicting the susceptibility to earthquake-induced landslides: the soil/rock geotechnical behaviour (expressed by means of V s), the slope steepness and the seismic action (PGAs). The first two factors represent the background predisposing conditions, while the seismic action is the triggering factor for landslide motion.

These factors are synthesized by three indices, respectively: the Lithology Index (S a); the Slope Index (S b); and the Seismic Index (S c). Such indices are comprised between 0 and 1. According to the authors, S a was computed as a linear function of the inverse of V s:

$$S_{\text{a}} = \frac{0.2045}{{V_{\text{s}} }} - 0.1363$$
(5)

S a is null for V s > 1.5 km s−1 (compact, non-fractured rocks) and equal to 1 for V s < 0.18 km s−1 (cohesionless or pseudo-coherent clayey materials with high natural moisture content). The Slope Index, S b, according to previous studies by Keefer (1984), Mora and Vahrson (1994), Rodriguez et al. (1999) and Wasowski et al. (2002), is modulated on the basis of two distinct proportionality laws valid for soil and rock slopes, respectively:

$$S_{\text{b}} = 0.04\,\beta \, \left( {\text{soils}} \right)$$
(6)
$$S_{\text{b}} = 0.04\,\beta - 0.6\,\left( {\text{rocks}} \right)$$
(7)

where β = slope (in degrees). On soil slopes, S b grows linearly from 0 up to the limit value of 25°, over which the value of S b is kept constant at the maximum value, whereas, on rock slopes, S b is null for slope angles lower than 15°, and then it grows linearly up to 40° and remains constant at rating 1 for steeper slopes.Finally, the Seismic Index S c is expressed in terms of the local maximum MCS intensity. Also in this case, a linear relation between S c and MCS was used. Considering that in the present work the local maximum MCS is equal to XI, the linear equation for the S c index assumes the following values:

$$S_{\text{c}} = 0.6667({\text{MCS}}) - 0.8333$$
(8)

As suggested by the authors, the MCS intensity was derived from the PGAs (Table 3; Fig. 5b) by means of the formula proposed by Ambraseys (1975):

$${\text{MCS}} = \frac{{(\log_{10} {\text{PGA}}_{\text{s}} + 0.408)}}{0.36}$$
(9)

Finally, the seismically induced landslide Susceptibility Index (SI) is obtained by the following equation:

$${\text{SI}} = \frac{{S_{\text{a}} + S_{\text{b}} }}{2}S_{\text{c}}$$
(10)
Fig. 5
figure 5

Final thematic products: a safety factor (SF) map; b Susceptibility Index (SI) map; and c residuals map from linear SF–SI regression model

5 Results and discussion

The spatial distributions of SF index and SI in the study area were mapped to emphasize the most critical areas, in terms of EILH, according to the methods of Newmark and Rapolla. In order to obtain an effective zonation, the distributions of SF and SI were represented as five equal-area classes (Table 4), displayed with colour shades from red to green indicating maximum and minimum EILH, respectively (Fig. 5a, b). In this way, SF and SI were classified using the same rules and constraints for the identification of the most critical areas, allowing an inter-comparison, even though the two indices are characterized by different units and mathematical meaning. As shown in Fig. 5, a good global visual agreement between the two zonation maps is evident. In particular, in both maps the most susceptible areas (red zones) are concentrated in the northern and central sectors of the main pipeline and in the oil/gas extraction area (Upper Agri Valley).

Table 4 Equal-area zonation limits for SI and SF classifications

In order to have a quantitative comparison, the linear correlation between SF and SI was analysed (Fig. 6). As expected, a significant negative correlation in terms of Pearson’s coefficient was found (r SF,SI = −0.654; p < 0.001, i.e. there is a probability that the sample under analysis comes from a population with r value that is due merely to chance). To clarify further, a linear regression model was tested. SF was assumed as dependent variable and SI as explanatory variable. As a whole, the goodness of fit was not very high (R 2 = 0.428). However, inspecting Fig. 6 it is clear that in correspondence of low SF values in the diagram, the correlation appears weaker and the points look more dispersed. In other words, in spite of a highly significant r SF,SI coefficient, the relation between SF and SI is not linear all over the scatter plot. At the lowest levels of hazard, the range of variation of SF was broader, as evidenced by a ‘flaring’ in the bottom of the graph in Fig. 6. As a consequence, it seemed appropriate to adopt the logarithmic transformation of SF variable. As a matter of fact, a stronger correlation (r SF,SI = −0.718; p < 0.001) between SI and natural logarithm of SF, along with a lesser dispersion, was obtained. In fact, by reapplying a linear regression model after the aforesaid transformation, the goodness of fit was found better than the previous one (R 2 = 0.521). The regression coefficient b SI relative to SI was statistically significant (b SI = −2.048; p < 0.001, meaning that the probability that this value is due to chance and is, therefore, equal to zero in the population, is <1/1000). It represents the rate of change of SF natural logarithm index as a function of changes in the SI.

Fig. 6
figure 6

Linear regression model graph between SF and SI

Another approach to better understand the degree of agreement between the two indices is the analysis of their five-class distributions. Table 5 shows the contingency matrix based on subdivision into quintiles. The Cohen’s kappa coefficient K was estimated on the basis of this matrix (Cohen 1960; Altman 1999). Theoretical K ranges between 0 and 1, with 1 in case of identical classifications, when all the cases are allocated along the diagonal of the contingency table. The estimated global K is rather poor (0.38, see Table 5), but the partial accuracy for the most critical class, marked as very high hazard, increases significantly (~0.70, see last row in Table 5, in which a decomposition of global K is showed). The same evidence was achieved by adopting the weighted K indices. The weighting procedure is usually adopted in order to better reflect the ordinal scale measure of the two indices. It minimizes the classification error near the diagonal of the contingency matrix, by attributing higher penalties to misclassifications moving away from the diagonal. In effect, using linear and quadratic weights, the obtained K values were 0.483 and 0.674, respectively, which stand for moderate and good agreement according to Altman (1999).

Table 5 Contingency matrix for the SI and SF five-class distributions with the estimated Cohen’s K coefficients

Another comparison was carried out by considering the central values of the nominal distributions of the indices (1 and 0.5 for SF and SI, respectively) as a limit between safe and unsafe zones. The above central values might well represent quite different hazard levels and a better comparison should be based on a deeper knowledge of the indices sensitivity to their input parameters. However, as a first approximation such an assumption can still provide preliminary indications of interest. The related areal extents for both indices were compared. In particular, SI is >0.5 in 8.1 % of the study area, while SF is lower than 1 in 25.9 % of the same area. Therefore, both indices mapped the study area as safe in the great majority of its extent, but Newmark’s method seemed to give more conservative results. Possibly, this was due to overly conservative values adopted for particularly uncertain SF input parameters, e.g. the depth H of the sliding surface assumed conservatively equal to 40 m. In any case, the mean values of both indices agreed in indicating an overall safety value (0.241 for SI and 1.347 for SF). A more local evaluation of the results was performed by analysing the distributed residuals arising from the linear regression model (Fig. 5c). The largest disagreements between the SI and SF distributions concentrate mainly in the northern parts of the extraction zone, which are characterized by very heterogeneous lithology or compact rocks, where Newmark’s method, provide extremely conservative outputs due to the previously cited high uncertainty on the input geotechnical parameters values.

6 Conclusions

The GIS spatial modelling approach allowed us to estimate the EILH distribution through SF and SI over the area hosting the most important European onshore gas/oil extraction plant. Two EILH maps of the area of interest were produced on the basis of the models developed by Newmark (1965) and Rapolla et al. (2010, 2012). Although the applied methodologies could not be validated through the comparison of their results with any available in situ data referring to historical earthquake-induced landslides in the study area, both the EILH SF and SI resulting distribution show hazard maxima in correspondence of the areas where landslides are concentrated (e.g. IFFI Project landslide inventory, ISPRA—Servizio Geologico d’Italia 2006) (Fig. 1). The two different approaches were compared for mutual accordance; in particular, even though the basic models are conceptually different, the results (maps and statistical correlation analysis between SF and SI) showed a substantial agreement in highlighting the most critical areas characterized by high levels of hazard. In addition, the first evaluation of the residual distribution obtained from regression models provided some indications about the input parameters reliability linked to the effective performance of the two exploited modelling methods. The slicing of distributions of both indices into five classes of growing EILH through equal-area zonation produced congruent results, according to Cohen’s kappa coefficient K values. These preliminary results demonstrate the capability of both of the utilized GIS-based methods to study the EILH distribution at regional scale (i.e. level 1 zonation).

On the whole, the outcomes from the two implemented models result in satisfying agreement, even if achieved through different conceptual and computational approaches. Only some localized slight disagreements between the two methods were observed, mostly concentrated in the areas characterized by steep rock slopes. Such areas are more difficult to be studied through the Newmark’s method, which is indeed particularly appropriate for analysing soil slopes and requires a more accurate geotechnical characterization, more adequate to local and site-specific scale study (level 2 and level 3 of zonation). On the other hand, Rapolla’s approach appears easier to apply in providing reliable results at the regional level of our study (level 1 of zonation). As a matter of fact, this method is based on just one geotechnical parameter (V s) which may be more easily measured by means of in situ surveys or estimated via linked parameters and often is more representative of the overall ground response to seismic shaking at regional scale. The reliability of results coupled with very limited data demand implies a better cost-effective usage of this heuristic method compared with the previous one, based on the displacement analysis, which in any case provided here comparable outcomes.

Considering that the earthquake-induced (or triggered) landslides constitute one of the most dangerous seismic effects, in this earthquake-prone and geologically vulnerable area and generally it is thus of interest to implement an effective modelling approach of the related hazard (EILH) in the prevention and mitigation perspective (i.e. early warning systems). While in general the statistic (multivariate, logistic, etc.) approach has been followed to spatially assess the EILH, here two more physically based models with different capabilities to better account for regional and local responses were implemented and used. While in general these two models have been widely exploited singularly, here they have been applied both to the same area using the same inputs and this processing schema allowed us to better evaluate their relative intrinsic capabilities and limits by analysing the obtained results. The perspective to include in the present analysis the potential interactions deriving from the impacts of the anthropic extraction activities is one of our future goals.

As a concluding remark, it is worth emphasizing that a first-order assessment of EILH at regional scale can only be a first indicative assessment of the criticalities distribution and that further local investigations should be carried out to provide more representative geological, geomorphological and geotechnical data for proper slope stability modelling and assessment under seismic conditions especially in the field extraction area, more critical also from the geotechnical characterization point of view and seismicity.