1 Introduction

Reliability of the ground motion predicted by empirical models mostly depends on the characteristics of the underlying calibration dataset. In the framework of seismic hazard assessment, the motivation behind compilation of a large strong motion dataset which includes recordings from different regions is twofold: first, to improve the magnitude-distance data distribution, and sampling different source characteristics and site conditions; second, to allow the calibration of models complex enough to describe the main physical processes contributing to the variability of the ground motion. The current practice for computing the seismic hazard is based on an ergodic assumption, where the aleatory variability, i.e. the standard deviation sigma (σ) of ground motion prediction equation (GMPE), includes the regional differences in ground motion. If on one hand the ergodic assumption allows to replace the time sampling of ground shaking generated in a given region by successive earthquakes with a spatial sampling of ground shaking observed across different regions, on the other hand it increases the aleatory variability associated with source, propagation, and site seismic processes. Allowing regional differences in the GMPE ‘partially’ removes this ergodicity by translating the aleatory variability into epistemic uncertainty which, in statistical sense, is the modelling uncertainty in region-specific adjustments.

The collection of data from different regions with similar tectonic features (e.g. shallow crustal active regions, stable continental regions, etc.) was performed in the past under the assumption that the trans-regional and between-country variability of the ground motion was either negligible or otherwise difficult to model due to the limitation in the sampling properties of the compiled datasets (e.g. Douglas 2004a, b). As an example, the NGA-West models (Abrahamson and Silva 2008) were derived from a dataset including recordings from multiple regions (mainly California, Taiwan, Japan) without modelling the regional effects. Later studies on the applicability of the NGA models to Europe (e.g. Stafford et al. 2008) highlighted the general agreement between predicted median values and the observations. The main difference was a faster distance attenuation observed in European data with respect to California; in agreement with previous findings (Douglas 2004a). Moreover a detailed comparison between the NGA models and strong motion data recorded in Italy (Scasserra et al. 2009) confirmed that it was possible to improve the predictive performance of NGA models for Europe by applying regional corrections to the attenuation with distance terms and to the overall scaling parameters (offset and pseudo-depth).

Extension of the NGA database into NGA-West2 (Ancheta et al. 2014) with introduction of several small magnitude events mainly from California, and moderate to large size earthquakes from other regions of the world, promoted the interest in evaluating regional effects in the ground motions. As a consequence, the most recent GMPEs developed from NGA-West2 include correction terms accounting for regional effects. Many authors (e.g. Boore et al. 2014; Chiou and Youngs 2014) introduced regional differences in the anelastic attenuation coefficient and the site term related to depth of basin. Regional differences in the Vs30 scaling were also considered (e.g. Abrahamson et al. 2014), while information available in the dataset is not enough to constrain correction factors for other parameters.

RESORCE strong motion database (Akkar et al. 2014a) was compiled with recordings from different European and Middle-East countries, and was used to derive several GMPEs (Douglas et al. 2014). While these models do not account for regional differences in ground-motion scaling, recent studies highlighted the presence of regional effects either between selected countries (e.g. between Turkey and Iran by Kale et al. 2015), or among different tectonic regions in Europe (Gianniotis et al. 2014). Ignoring the regional differences in ground motion scaling may result in an inflated residual standard deviation, and correction for regional bias in the median ground motion can be a first step towards ‘partially non-ergodic’ region-specific PSHA. With such a goal in mind, this study focuses on identification of systematic regional differences in ground motion scaling in Europe. Following the previous efforts of developing GMPE using RESORCE dataset (Douglas et al. 2014 and reference therein), we derive a new GMPE based on a relatively simple functional form which will still be able to capture the main features of ground motion-scaling (Bindi et al. 2014). However, unlike in previous studies, a non-linear mixed effect regression (NLMER by Bates et al. 2014) approach is applied where the regional differences are estimated as random effects applied to different model parameters. The advantages of using NLMER in place of the traditional random effect algorithm by Abrahamson and Youngs (1992) are discussed by Stafford (2014). For example, group specific adjustments can be estimated for any of the regression coefficients in a statistically correct way making NLMER much more extendable than traditional approaches. We identify the statistically significant random effects and the regional adjustments for relevant parameters are provided as final result.

2 Dataset and selection criteria

The most recent Pan-European GMPEs (Douglas et al. 2014) are based on the RESORCE strong motion dataset (http://www.resorce-portal.eu/). RESORCE extends the previous pan-European strong motion dataset (Ambraseys et al. 2004) with recently compiled Greek, Italian, Swiss and Turkish accelerometric archives (Akkar et al. 2014a). In this study, starting from the 2013 release of RESORCE, we performed a preliminary data selection to exclude the poor quality or unprocessed records, or those records lacking the three components of ground-motion; then, we applied the following criteria to select the input data for regression:

  • Given the recent interest in considering small magnitude earthquakes for assessing the hazard in several regions of Europe (http://projet-sigma.com/ScientificObjectives.html), records from events with moment magnitudes larger than or equal to 4 are considered.

  • Only focal depths shallower than 35 km, and distances (Joyner–Boore, RJB, or epicentral Repi) shorter than 300 km are selected. The epicentral distance, Repi is used to approximate RJB when the latter is unspecified, but only when M ≤ 5 and Repi ≥ 10 km. For larger magnitudes and smaller epicentral distances, records without RJB are disregarded.

  • For each oscillator period T, only those recording filtered with high pass corner frequency (fhp) smaller than or equal to 1/(1.25 T), i.e. fhp ≤0.8 foscillator (Abrahamson and Silva 1997). For example, for T = 1 s (foscillator = 1 Hz), we considered only recordings with fhp smaller than or equal to 0.8 Hz; for T = 4 s (foscillator = 0.25 Hz), we chose fhp ≤ 0.2 Hz. Single recorded earthquakes are not selected.

  • We consider only recordings from sites with known or inferred Vs30.

In RESORCE, the moment magnitude is provided either as directly computed (e.g. from the moment tensor solutions), or converted from other magnitude scales (e.g. local magnitude or surface wave magnitude) using country-based empirical regressions (see Akkar et al. 2014a for details). Earthquakes with Mw derived through empirical regressions are not considered in this study.

Considering the unbalanced composition of the dataset, we categorize the contributing regions into three groups: Italy, Turkey, and Others, where the latter collects data from all the countries contributing to RESORCE with less than 200 selected records. Although a regionalization based on the tectonic settings (e.g. Delavaud et al. 2012) could be more appropriate to explore regional differences in ground motion, we opt for a country-based categorization that reflects the structure followed for data compilation. The filtered dataset is composed of 1251 recordings, with 659 recordings from Turkey (TR), 378 from Italy (IT), 214 in Others group; primarily contributed to by Greece, Montenegro, Iran, and France.

In terms of magnitude range, distance range, and site characterization, the dataset is unbalanced among the regions. Douglas (2007) showed that the predicted median ground motions are not well-constrained away from the centroid of data, especially for sparse datasets. Figure 1 shows the magnitude—distance distribution of recordings in our dataset, categorized according to different regions and soil classes. For example, there are very few recordings from Turkey in site class A (rock with Vs30 > 800 m/s), which means that when a GMPE is derived from the compendium dataset without regional distinction, the estimated site response for class A could be controlled by contributions from Italy and Other regions, even though the class A rock response in Turkey could be significantly different. Similarly, for distances larger than 100 km and empirical site response of class B (stiff soil with 800 m/s > Vs30 ≥ 360 m/s) and Class C (soft soil with 360 m/s > Vs30 ≥ 180 m/s) the predictions could be controlled by strong motion recordings from Turkey. Moreover, preliminary non-parametric analysis (here not shown) suggest that the average slope of distance scaling is different among the regions, hinting for possible regional differences in the distance scaling of high-frequency ground motions, which we could quantify as a regional variation during the GMPE regression. Based on these evidences, in the following we seek for ground motion regional variations related to the scaling with distance and to the site response.

Fig. 1
figure 1

Scatter plot showing the distribution of observed data in Magnitude - Distance ranges for different EC8 site classes for each region Italy (IT), Others, and Turkey (TR). The red markers correspond to events without a computed moment magnitude but only an empirically estimated/converted moment magnitude, and consequently excluded from the regression. Green markers show the final distribution of records that are used for PGA regression

3 Regression approach

Different models were derived from RESORCE dataset performing either a parametric regression (e.g. Akkar et al. 2014b; Bindi et al. 2014) or following non-parametric approaches (e.g. Derras et al. 2012; Hermkes et al. 2014). The parametric regression approaches were applied using the random effects methodology of Abrahamson and Youngs (1992), where the residuals are split into between-event (δBe), and within-event (δWes) residuals. The GMPE functional forms used were relatively simple with respect to those implemented within the NGA-West2 project (e.g. Abrahamson et al. 2014), reflecting the detail of information available in the RESORCE metadata. With the aim of investigating the presence of regional effects in ground motion variability, we also follow a parametric regression approach but using a non-linear mixed effect approach (NLMER, e.g. Bates et al. 2014). Following Bindi et al. (2014), we consider the following functional form:

$$ \text{ln} \left( {GM} \right) = e_{1} + F_{D} \left( {R,M} \right) + F_{M} \left( M \right) + \delta B_{e} + \delta B_{s} + \varepsilon $$
(1)
$$ F_{D} \left( {R,M} \right) = \left[ {c_{1} + c_{2} \left( {M - M_{ref} } \right)} \right]\text{ln} \left( {\frac{{\sqrt {R^{2} + h^{2} } }}{{R_{ref} }}} \right) + \left( {c_{3} + \varDelta c_{3,r} } \right) \left( {\sqrt {R^{2} + h^{2} } - R_{ref} } \right) $$
(2)
$$ F_{M} \left( M \right) = \left\{ {\begin{array}{*{20}l} {b_{1} \left( {M - M_{h} } \right) + b_{2} \left( {M - M_{h} } \right)^{2} } \hfill & {{\text{for M}} < M_{h} ,\;\;{\text{where M}}_{\text{h}} = \, 6.75} \hfill \\ {b_{3} \left( {M - M_{h} } \right)} \hfill & {{\text{for}}\;{\text{M}} \ge {\text{M}}_{\text{h}} } \hfill \\ \end{array} } \right. $$
(3)

In Eq. (1), e1 is the global off-set parameter; FD, and FM are the distance and magnitude scaling components as defined in Eqs. (2) and (3), respectively; δBe and δBs are random effects on e1 describing the between-event and between-station variability, respectively (Stafford 2014; Al Atik et al. 2010); ε is the residual distribution accounting for the aleatory variability. In the following, the standard deviation of the between-event and residual distributions are indicated with the symbols τ and ∅0, respectively. The hinge magnitude Mh is fixed at 6.75 and the parameter b3, which controls the saturation with magnitude, is not constrained to be positive (i.e. the over-saturation at magnitudes >6.75 is allowed). As in Bindi et al. (2014), the reference moment magnitude Mref and reference Joyner-Boore distance Rref are set at M5.5 and 1 km, respectively.

The major contributor to ‘Others’ group in terms of recordings is Greece (137), followed by Montenegro (35), and Iran (20). We performed several preliminary regressions considering different number of geographical categories, including attempts of isolating the Greek recordings from Others. In order to get reliable regional adjustments for the anelastic attenuation, a minimum number of recording per category (i.e. per country) was needed. Since the adjustment factor for Greece, once isolated from Others, was not significantly different from zero at 95 % CI, we kept the Greece recordings inside the Others category.

It is worth noting that we only introduced a regional adjustment factor for the apparent anelastic attenuation coefficient (i.e. c3 in Eq. 1), but the magnitude scaling component (FM in Eq. 1) is constrained by the data from all regions. When asked for a random-effect on a regression parameter (e.g. regional adjustment to c3 in Eq. 1) for each level in the group (levels being Italy, Turkey, and Others), the NLME algorithm estimates scalar additive adjustments which follow a standard-normal distribution. Therefore, the GMPE regression-coefficient c3 without any regional-adjustments (i.e. without adding ∆c3,r to c3), is a generic anelastic attenuation coefficient without a regional bias.

3.1 Regional variability in apparent anelastic attenuation term

In Eq. (2), we introduce a country-based random effect ∆c3,r on parameter c3, where r represents the three selected regions, i.e. r = IT, Others, TR. Coefficients c1, c2 and c3 in the scaling with distance FD, correspond to the geometrical spreading, magnitude-dependent geometrical spreading, and apparent anelastic attenuation, respectively, although these names should be strictly used only for a model based on Fourier spectral amplitudes. Coefficient c3 is constrained to being less than or equal to 0 for all spectral periods to disallow oversaturation at longer distances as in Bindi et al. (2014). Preliminary trials showed that for long periods (>1 s), c3 is taking a positive value and has a large negative correlation with c1, and a positive correlation with e1. Since a Student’s t test confirms that it is anyway losing its significance, c3 and the associated regional variations are fixed at zero for periods longer than 1 s.

3.2 Style of faulting terms

Dependence of the median ground motion on style of faulting (SoF) is generally accounted through a period-dependent SoF specific adjustment to the median. Trial regressions including SoF adjustment factors on the offset showed that the estimates were not well constrained and had large standard errors. In RESORCE the distribution of recorded focal mechanisms among different regions is strongly unbalanced since in Italy most of the events are normal and very few strike-slip events, unlike in Turkey. Moreover, reverse faulting events are very few in the dataset. Considering that the odd distribution of SoF among the regions could result in a trade-off with the regional random effects on the offset, and also based on a preliminary non-parametric analysis of the dataset that showed no clearly distinguishable differences among the distance scaling of ground motion between different SoF, we chose to drop the SoF term from the functional form.

3.3 Regional variability in site-response as a function of Vs30

In the model described by Eq. (1), site effects are captured by the between-station terms, which account for the systematic station-specific deviation in offset with respect to the generic prediction for the population. Figure 2 shows that δBs scales with Vs30 indicating that Vs30 is a first order proxy for describing site response. Large scatter around the best fit model suggests that a combination with other proxies is needed to better capture the complexity of site response (e.g. Cadet et al. 2008; Luzi et al. 2011). Besides the clear region-dependent scaling with Vs30, Fig. 2 suggests that the distributions of velocities for three regions are compatible with the assumption of a region-dependent reference velocity (i.e. the value of Vs30 corresponding to zero-crossing of δBs). Hence, we perform a further mixed-effect regression considering the following model:

Fig. 2
figure 2

Between station residuals (PGA in left panel, SA (2 s) in right panel) plotted against Vs30 (m/s) with stations separated into regions. The blue line is a regression fit of residuals as a function ln(Vs30). The grey ribbon shows the standard error on regression fit. Difference in slope of the regression fit shows regional difference in linear site-amplification (g2), difference in x-intercept shows the regional difference in reference Vs30 (Vref)

$$ \delta B_{s} = \left( {g_{1} + \varDelta g_{1,r} } \right) + \left( {g_{2} + \varDelta g_{2,r} } \right) \ln \left( {V_{s30} } \right) + \delta_{S2S} $$
(4)

Regional effects on site term are captured by the random effects ∆g1,r on offset g1, and ∆g2,r on the slope with Vs30. In Eq. (4), δS2S represents the systematic deviation of recordings for individual station with respect to the model accounting also for the scaling with Vs30. The standard deviation of δS2S is the between-station variability (∅S2S) in the GMPE. It is worth noting that non-linear site amplification effects are not considered in the present study. Moreover, since the attenuation of high frequency ground motion can be a result of both anelastic attenuation and site effects, it is worth checking for a possible correlation (or a trade-off) between the parameters c3 and g2, as well as between the estimated regional variations ∆c3,r and ∆g2,r. The results (here not shown) do not highlight any significant correlation among these parameters.

4 Results

The presence of ground motion regional variations in RESORCE dataset are modelled by allowing the site response component and the decay of ground motion with distance to be region specific. The fixed and random effects parameters relevant to regression (1) and (4) are listed in Tables 1 and 2. At each period, the mixed effect regression provides both the global c3 value and the estimated deviation ∆c3,r for each region (r), computed as random effect on c3 in a region group. Figure 3 shows the random effects at different periods along with the associated 95 % CI, the standard error (grey ribbon). Regional variations in c3 are shown only until spectral period of 1 s beyond which, along with c3, they are constrained to zero. The apparent anelastic attenuation is higher for Italy than for Turkey or Others regions; a trend similar to that observed by Boore et al. (2014) in their within-event residuals which showed a faster distance-decay in Italy (and Japan) compared to Turkey (and China). The physical interpretation of the differences between the attenuation in Italy and Turkey is beyond the aim of our paper. A comparison of results available in literature for those physical properties that can influence the anelastic attenuation (e.g. velocity and attenuation topographic maps; heat flow distribution; etc.) is not straightforward because of the different implemented methodologies, the different investigated spatial scales, and the different data analyzed. In any case the standard errors on ∆c3,r are small enough to indicate that the regional corrections at short periods are statistically significant. These standard errors represent the modelling uncertainty of regional adjustments to anelastic attenuation component and can be handled through ground motion logic trees. The ∆c3,r random effects for different periods are listed in Table 1 along with their standard errors.

Table 1 Coefficient table for GMPE
Table 2 Coefficient table for Vs30 based site response
Fig. 3
figure 3

∆c3 for the three regions across different spectral time periods. Beyond spectral period of 1 s, c3 in the regression is constrained to 0 with no regional variations. Grey-ribbon shows the 95 % CI about the median

Regarding the site response term, by allowing the offset g1 to vary among regions, we can account for regional differences in the reference Vs30 while with g2 we quantify the regional differences in scaling with Vs30. Figure 4 shows the random effects ∆g1,r and ∆g2,r for different periods, along with the estimation errors. A larger value of g1 (and a smaller g2) indicates a smaller reference Vs30 for that region according to Eq. (4). Figure 1 showed that the largest fraction of recordings from Turkey come from EC8 soil class B and C stations compared to Italy and Others groups where the stations are more evenly distributed across soil classes. This means the ‘centroid’ Vs30 (modal Vs30 value) of the data is lower for Turkey as indicated by the higher positive ∆g1 value for Turkey in Fig. 4. Also seen in Fig. 2 is the stronger scaling with Vs30 for Turkey indicated by a larger negative value for g2 in Fig. 4. It is worth noting that by allowing regional variations in these two components of GMPE we move a fraction of the aleatory variability into epistemic uncertainty, quantified through the standard error on ∆c3,r, ∆g1,r and ∆g2,r. These standard errors can be reduced by collecting more ground motion data from the regions.

Fig. 4
figure 4

Random effects on g1 and g2 along with their standard errors. ∆g1 and ∆g2 are estimated as a correlated-random effects. Grey-ribbon shows the 95 % CI about the median

5 Discussion

In the previous sections, we derived a GMPE from the European-Middle-East dataset (RESORCE), including regional (i.e. country-based) adjustments. Following recent studies, we introduced corrections for the ground motion decay and for the scaling with Vs30 in terms of random effects.

5.1 Region dependent distance scaling and Vs30 based site response

The regionalization of distance attenuation has been described by a region-dependent apparent anelastic attenuation model (Fig. 3). As also observed by Chiou (2012), the geometric spreading (term dependent on logarithm of distance) and of the anelastic (term dependent on distance) contribution to the attenuation show a high degree of correlation. Studies dealing with the parametrization of Fourier amplitude decay with distance in terms of geometrical spreading and anelastic attenuation shows that the trade-off between these two terms cannot be resolved using only the spectral amplitude information (e.g. Oth et al. 2011; McNamara et al. 2014). Although the model for Fourier spectral amplitude is not strictly applicable to response spectra (Bora et al. 2014), a similar situation arises with the GMPE, where the period-dependent terms controlling the linear decay with distance (i.e. c1 and the magnitude correction c2) are in trade-off with c3, controlling the decay with the logarithm of distance. Since different wave types (body waves and surface waves) and phases (direct waves and reflect waves as SmS) contribute to the attenuation with distance over different distance and period ranges, the geometrical terms could be affected by regional bias related, for example, to differences in focal depths and crustal thickness (Cotton et al. 2006; Douglas 2007). Therefore, we tested a model including a correlated regional variation on the parameters controlling the distance scaling (c1, c2 and c3), or considering combination of them (e.g. c1 and c3). Statistical tests using ANOVA (R Core Team 2013; Chambers and Hastie 1991) do not show appreciable improvements in prediction power of GMPE (e.g. comparing the Akaike Information Criterion values, performing significance tests, or analyzing the residual distributions). The estimated regional variations in anelastic attenuation (∆c3,r) are similar to the ones in the simpler model discussed in previous section, and the random effects on c2 (∆c2,r) either have 0 values at high frequencies or large standard errors (encompassing 0) at low frequencies, which makes it not a well constrained regression parameter. We finally preferred not to include regional variations in c1 and c2 in our model.

By considering region specific reference Vs30, we observed remarkable differences in the site term scaling with Vs30. In particular, Fig. 2 shows that the slope of the between station random effects with Vs30 is larger in Turkey than in the other two regions, both at short and long periods. Regional effects in the site term were already recognized in the NGA-West2 models. For example, Abrahamson et al. (2014) included regional corrections in the Vs30 scaling for Taiwan, Japan and China with respect to California. As discussed in Boore et al. (2014), the observed regional variability of the site effects can be a consequence of using a simplified proxy (i.e. Vs30) to capture the site amplification which in fact depends on many other factors, such as the soil depth. Previous studies showed that regional differences in the depths of typical soil profiles lead amplification functions with peaks occurring at different period also for site sharing similar Vs30 (e.g. Atkinson and Casey 2003; Ghofrani et al. 2013), as observed when comparing sites in Japan with those in California. Previous work (e.g. Boore et al. 2011) showed that the correlation of Vs30 with the shear-wave velocity at different depths (either shallower or deeper than 30 m) is regional dependent. In particular, Boore et al. (2011) suggested that the differences in the correlation observed for Japan with respect to California, or Europe, could be ascribed to differences in the selection of the strong motion sites, since Japanese stations are mostly installed on stiff or rock material. Similar considerations could be applied also to discuss the differences observed for Italy and Turkey. Anyway, without any detailed analysis of the velocity profiles for the analysed stations, any conclusion would be speculative and we left this investigation for future studies.

Finally, in the NGA-West2 models the soil depth effect is considered through ∆Z1.0 (depth of basin to rock with Vs30 of 1000 m/s), and a regionalization for this term is also considered. The site information included in RESORCE does not allow including soil depth in the model for site effects.

5.2 Impact of the regionalization on the median predictions

The impact of regional adjustments on distance scaling (Fig. 5), and magnitude scaling (Fig. 6) obtained with and without allowing the regional corrections in the regressions are compared. Included in these figures is an ‘Initial’ model which is a GMPE without any regional variations with functional form as in Eq. (5). Note that in Eq. (1) the regional variability in site-response is left to be examined using Eq. (4), while for the ‘Initial’ GMPE without regional variability a generic site response term \( g*\text{ln} \left( {V_{s30} } \right) \) is included in the median (Eq. 5).

Fig. 5
figure 5

Distance scaling for PGA (left panel) and SA (2 s) (right panel) at site with Vs30 = 450 m/s (above panels), and Vs30 = 800 m/s, for M5 and M7. Comparison of distance scaling with GMPE accounting regional variations in anelastic attenuation (slope of the curves) and Vs30 scaling (offset of the curves), against the ‘Initial’ GMPE obtained from regression without accounting regional variations

Fig. 6
figure 6

Magnitude scaling for PGA (left panel) and SA (2) (right panel) at site with Vs30 = 450 m/s (above panels), and Vs30 = 800 m/s, for Joyner – Boore distances 20 km, and 100 km. Comparison of magnitude scaling with GMPE accounting regional variations in anelastic attenuation and Vs30 scaling (offset of the curves), against the ‘Initial’ GMPE obtained from regression without accounting regional variations

In the left panel of Fig. 5, regional differences in high frequency ground motion are observed as difference among the offset of the curves, and in slope of the curve at distances greater than 50 km, which are a combination of Vs30 scaling and anelastic attenuation effects. In the right panel however, which is for lower frequency ground motion, the differences are solely due to variations in Vs30 scaling, more pronounced for rock sites (800 m/s). Similarly, in magnitude scaling (Fig. 6) the differences in offset of the curves are a combination of regional variations in anelastic attenuation and Vs30 scaling.

$$ \text{ln} \left( {GM} \right) = e_{1} + F_{D} \left( {R,M} \right) + F_{M} \left( M \right) + g*\text{ln} \left( {V_{s30} } \right) + \delta B_{e} + \delta B_{S2S} + \varepsilon $$
(5)

Cumulative effect of all regional adjustments across spectral periods is shown in the response spectra (Fig. 7). For a site with Vs30 of 450 m/s at distance 10 km, regional variations in anelastic attenuation and site response are negligible at all spectral periods. On the other end is a site with Vs30 of 800 m/s located 100 km from the seismic source; in this case both anelastic attenuation and site response terms are significantly different across the regions. At the same site, for spectral periods larger than 1 s the regional differences are solely contributed to by differences in site response. The two intermediate scenarios, Vs30 450 m/s at distance 100 km, and Vs30 800 m/s at distance 10 km show effect of regional differences in anelastic attenuation, and site response scaling with Vs30 respectively. For example, at a rock site (800 m/s) located 25 km from a rupture of magnitude M6.5 the predicted ground motion at spectral frequency of 3 Hz is 1.51 g in Italy, 1.47 g in Turkey, and 1.96 g in Others region. The differences in predicted ground motion are significant across regions after correcting the GMPE median for regional bias.

Fig. 7
figure 7

Response spectra showing the cumulative effect of regional adjustments to the GMPE. Most significant differences are observed for rock sites (Vs30 = 800 m/s) at distances larger than 50 km

5.3 Impact of regionalization on the model uncertainty

Introducing regional differences reduced the aleatory variability at the cost of an increased epistemic uncertainty in GMPE. The increase in modelling uncertainty is captured by standard errors on regional adjustments, while the reduction of variability is captured by decrease in standard deviation of GMPE given by Eq. (6)

$$ \sigma = \sqrt {\tau^{2} + \varnothing_{s2s}^{2} + \varnothing_{0}^{2} } $$
(6)

Figure 8 shows the comparison of standard deviations between the model with and without regional variations ‘Initial’. There is a 5–10 % reduction in the total standard deviation (σ) by introducing regional variations, primarily from the reduction of between-station variability (∅s2s) by 13–20 %. Reduction in residual (∅0) standard deviation is small (<2 %). There is no noticeable change in between-event standard deviation (τ).

Fig. 8
figure 8

Comparison of individual components of aleatory variability in GMPE (standard deviations) between the model with regional variations (solid line) and without regional variations ‘Initial’ (dashed lines)

Improvement in median prediction of the GMPE by correcting regional bias with regional adjustments is quantified in terms of Akaike Information Criterion (AIC), which is a measure of the relative quality of a statistical model for a given set of data penalized by the number of model parameters. Introducing the regional variations in this case increases the number of regression parameters by 3, yet a smaller AIC value of the model with regional variations justifies its increased complexity.

5.4 Potential regional differences in magnitude scale

The between-event residual δBe can be used to evaluate the impact of considering earthquakes with converted moment magnitude from other magnitude scales (local magnitude, surface-wave magnitude, and body-wave magnitude). In Fig. 1, the recordings relevant to these earthquakes are shown in red and mainly correspond to magnitude smaller than 5 in Turkey. Figure 9 is a box-plot of δBe at Sa(1 s) for each country in the regressed dataset. The scatter in δBe from considering events with both computed (from moment tensor solutions) and empirically estimated Mw is larger than that when considering only those with computed Mw (refer to Akkar et al. 2014a for details on empirical estimation of Mw). In Fig. 9 this reduction in scatter can be seen as a shift of the country-wise median towards 0, from left to right panel. Within-country scatter shown as the height of the box-plot has also reduced, especially in case of Turkey. Filtering out events with empirically estimated Mw reduced the between-event standard deviation of the GMPE (τ) by an average of 10 % (and a maximum of 30 %) across the periods, without losing constrain on other regression parameters (i.e. increase in standard error of estimate of coefficients). We note that this filter primarily removes small magnitude events from Turkey (less than M5), which could also be the reason for decrease in τ. A further study could be focused on examining the regional differences in moment tensor solutions based computed Mw which, once homogenized, may allow analyzing other regional differences in source physical parameters.

Fig. 9
figure 9

Regional variation of between event residuals at SA (1 s). Box-plot the median (50th) and the quartiles (5th, 25th, 75th and 95th). The left panel shows residuals from all the events whose Mw is either computed (as calculated from moment tensor solutions) or empirically estimated (for details refer to Akkar et al. 2014a). Right panel shows the residuals from only the events whose Mw is computed, and not empirically estimated/converted. The decrease in height of the box plots reflects a decrease in between event variability within and across regions

6 Conclusions

RESORCE database and the Non-linear mixed effects regression tools allowed analyzing and quantifying regional variations in ground motion data for Europe–Middle-East regions. The GMPE is developed specifically for active crustal earthquakes in Europe–Middle-East regions, and we do not recommend using it elsewhere without a prior compatibility check. The dataset is strongly unbalanced across the contributing regions in terms of magnitude, distance and recording station site classification. If separate GMPEs were to be developed for each of the regions, then the applicability of each GMPE would be strongly limited in magnitude, distance, and site Vs30 range. By allowing regional variability only on specific terms (anelastic attenuation and site response), and estimating all the regression coefficients (magnitude scaling, geometric spreading) using the entire dataset we overcome this limitation. In its current form, the GMPE is recommended to be used for following scenarios:

  • Active crustal earthquakes magnitude range from 4 to 7.6: since the magnitude distribution is symmetric around the median magnitude of M5.5, neither the small nor the large events are likely to bias the prediction.

  • Sites with Vs30 from 180 to 1000 m/s : Even though the range of Vs30 used in regression is 90 m/s–2000 m/s, the bulk of data is within 200–600 m/s. We suggest using the GMPE in a range narrower than its underlying dataset, and especially not to extrapolate beyond the suggested Vs30 limits.

  • Joyner–Boore (RJB) distances up to 200 km: the GMPE is calibrated with data up to 300 km with the bulk of data from within 150 km.

  • Partially non-ergodic region specific seismic hazard assessment by adjusting the GMPE median (Table 1) and linear site-amplification model (Table 2) with the provided regional adjustments. The reported standard errors are estimated as square-root of conditional variances estimated by the Markov Chain Monte Carlo bootstrap method available in LME4.0 package in R (Bates et al. 2014). These values can also be used as epistemic uncertainty on the regional adjustments. Since the underlying distribution is not known, the epistemic uncertainty can be assumed to be normally distributed and modeled using a three-point distribution that maintains the mean and the standard deviation of the original distribution. Under such an assumption the upper and lower limits on regional adjustments can be set as ±1.6 times the standard error, with logic tree weights 0.2, 0.6 and 0.2 for upper, middle and lower branches respectively.

At the moment statistically significant regional variations in apparent anelastic attenuation, and Vs30 scaled linear site response could be captured and accounted in the new GMPE; thereby correcting median for regional bias and deflating the total variability by 5–10 % depending on the spectral period. Regional differences in distance scaling found in this study are in agreement with recently published studies. The largest reduction in GMPE standard deviation comes from allowing regional variations in the site response component. This variability could be further reduced by using a combination of site-response proxies, instead of Vs30 alone. Another large reduction in standard deviation comes from using only the events with moment tensor solutions based moment magnitude in regression, at the cost of losing many small magnitude events. It is desirable to plug such data losses by homogenizing the magnitude scale across regions. In summary, a decrease in aleatory variability of ground motion prediction as demonstrated in this study is accompanied by a new epistemic uncertainty on estimated regional adjustments, which in turn may only be reduced by improving the underlying datasets.