1 Introduction

A reliable estimation of ground motions that can be produced by future earthquakes is an essential element of any seismic hazard analysis study. This is achieved by employing empirically derived ground motion models (GMMs) that provide a conditional distribution of expected ground motion as a function of magnitude, distance, and site condition. Often, probabilistic seismic hazard analysis (PSHA) and probabilistic seismic risk analysis (PSRA) consider elastic response spectral ordinates as the primary ground motion intensity measures (GMIMs) and the GMIMs are conventionally used as a single ground-motion measure. However, it is widely acknowledged that a single measure of GMIM may not fully capture all the aspects of the ground shaking that are of engineering interest (Shome et al. 1998; Luco and Cornell 2007). A vector of GMIMs is widely applied in several applications such as probabilistic seismic risk analysis and multi-GMIM based ground-motion selections (Bradley 2011a; Du et al. 2020). Also, for vector-valued probabilistic seismic hazard and risk analysis (Baker 2007; Huang and Zhang 2021) a vector of GMIMs is recommended.

In this context, Arias intensity (IA), cumulative absolute velocity (CAV) and significant duration (SD) are often considered as additional GMIMs that characterize various aspects of strong ground motions that are of engineering importance. Thus, recently, various studies have proposed empirical models for significant durations (Kempton and Stewart 2006; Bommer et al. 2009; Yaghmaei-Sabegh et al. 2014; Afshari and Stewart 2016; Sandikkaya and Akkar 2017; Du and Wang 2017; Baharampouri et al. 2020; Meimandi-Parizi et al. 2020). The ground-motion models for durations are also used to obtain response spectral ordinates along with empirical Fourier models (Vanmarcke and Lai 1980; Reinoso et al. 1990; Jaimes et al. 2006; Bora et al. 2014). Moreover, ground-motion duration is an essential element of stochastic simulations (Boore 2003; Boore and Thompson 2014; Kolli and Bora 2021). When compared to the GMMs of elastic response spectrum, the published ground-motion models for cumulative measures of ground-motion IA and CAV are rather limited (Douglas 2012). Most of the existing IA and CAV GMMs have been developed for active crustal tectonic regions (Campbell and Bozorgnia 2010, 2012, 2019; Du and Wang 2012; Foulser-Piggott and Goda 2015; Macedo et al. 2020). Since CAV models have a lower standard deviation as compared to the other GMIMs, they can be implemented to wider range of engineering applications (Campbell and Bozorgnia 2012). Recent studies (Stafford et al. 2009; Campbell and Bozorgnia 2010, 2012; Lee et al. 2012; Bustos and Stafford 2012; Foulser-Piggott and Stafford 2012; Du and Wang 2012; Du and Wang 2013; Foulser-Piggott and Goda 2015; Abrahamson et al. 2016; Sandikkaya and Akkar 2017; Campbell and Bozorgnia 2019; Macedo et al. 2019; Macedo et al. 2020; Bahamrampouri et al. 2020; Farhadi and Pezeshk 2020) have proposed empirical models for cumulative measures of ground motion, IA and CAV.

The generalized conditional intensity measures approach of Bradley (2010) provides a probabilistic framework in which a combination of GMIMs can be considered in ground-motion selection. A key component of this probabilistic framework is the availability of the correlation between various GMIMs. Many correlation models have been developed, including the correlations for spectral accelerations at different vibration periods (Goda 2011; Cimellaro 2013), spectral accelerations and spectral intensities (Bradley 2011a), significant duration and cumulative GMIMs (Bradley 2011b), peak ground velocity (PGV) and spectrum-based GMIMs (Bradley 2012), Arias intensity (IA) and amplitude, duration, and cumulative GMIMs (Bradley 2015; Du 2019), spectral input energy at different vibration periods (Cheng et al. 2020) and empirical correlations of spectral input energy with peak amplitude, cumulative, and duration intensity measures (Cheng et al. 2022).

In the context of Iran, researchers have mainly focused on developing GMMs for predicting amplitude-based and spectrum-based GMIMs, such as PGA and spectral pseudo-acceleration (PSA) (Sedaghati and Pezeshk 2017; Zafarani et al. 2018; Darzi et al. 2019; Farajpour et al. 2019). These GMIMs (PGA and PSA based) describe ground motion amplitude and frequency content but fail to capture the cumulative effect of ground motion duration and intensity (Farhadi and Pezeshk 2020). Yaghmaei-Sabegh et al. (2014) and Meimandi-Parizi et al. (2020) duration models (D5–75 and D5–95) are calibrated on earlier versions of Iranian strong motion database. As mentioned earlier, cumulative-based GMIMs, in particular, IA and CAV, could be used in line with amplitude-based and spectrum-based GMIMs to characterize strong ground motion fully for engineering applications. To the best of our knowledge, there are no studies available in literature that are focussed on developing models for cumulative intensity measures such as IA and CAV in addition to their correlations with other GMIMs. This warrants the need for developing empirical models for duration-related and cumulative-based GMIMs such as IA, CAV, and significant durations, importantly, to quantify and understand the associated correlation between various GMIMs.

In the present study, we develop empirical models for Arias Intensity (IA), Cumulative Absolute Velocity (CAV), and significant strong-motion duration (D5–75, D5–95) calibrated on Iranian strong motion database. The major objective of this study is to investigate correlation between different pairs of GMIMs in addition to investigating the variability in these GMIMs for the selected dataset. For that purpose, a common functional form is deemed to be appropriate for all the four GMIMs which is further verified by residuals variation and comparison (of median predictions) with other regional and global models. The correlation between different pairs of GMIMs is presented by computing residual correlations (Bradley 2011b, 2015) separately for between-event and within-event components which were subsequently combined to present the correlations of total residuals. This article is organized as follows: (1) definitions of the GMIMs used in this study, (2) summary of the strong ground-motion dataset, (3) calibration of empirical models for significant durations (D5–75, D5–95), IA and CAV, and (4) investigation of residual correlations.

2 IA, CAV and significant durations

Arias Intensity (IA) is a duration related GMIM that represents the total earthquake input energy per unit weight for a set of un-damped elastic oscillators with frequencies uniformly distributed from zero to infinity (Arias 1970).

$${I}_{A}=\frac{\pi }{2 g}{\int }_{0}^{T}{\left(a\left(t\right)\right)}^{2}dt$$
(1)

where a(t) is the ground acceleration in m/s2, g is the gravitational acceleration, T is the total duration of the ground motion recording and IA is the Arias Intensity in m/s. IA combines amplitude and duration and has been considered as a useful indicator of damage potential for earth dams in seismic analysis (Travasarou et al. 2003). Several studies have found that IA correlates well with distributions of earthquake-induced landslides (Harp and Wilson 1995; Keefer 2002) and building damage (Cabanas et al. 1997).

Cumulative Absolute Velocity (CAV) is a scalar GMIM, and it is originally defined as the total area under the absolute ground acceleration (a(t)) with total duration:

$$CAV={\int }_{0}^{T}\left|a\left(t\right)\right|dt$$
(2)

where a(t) is the acceleration at time t (in unit of m/s2), T is the duration of time series. One can draw a more intuitive interpretation of CAV as the product of total absolute acceleration and duration of the acceleration trace. The term velocity in CAV essentially reflects integration of acceleration with respect to time. Several studies used CAV for: distinguishing between potentially damaging ground motions in the high frequency range and non-damaging ground motion (EPRI 1998), formulate the generation of excessive pore water pressure for potentially liquefiable soils, peak displacements of pile foundations under seismic loading (Kramer and Mitchel 2006; Bradley et al. 2009), soil structure interaction problem (Macedo 2017; Bray and Macedo 2017; Bullock et al. 2019a; b; Azizi et al. 2022; Kashani et al. 2022), structural analyses and time-history selection (Muin and Mosalam 2017; Danciu and Tselentis 2007; Tarbali et al. 2019), seismic risk analysis of critical facilities and, in the context of earthquake early warning system (Fahjah et al. 2011). After EPRI (1998) study, CAV was used as a scalar intensity measure that combines both the amplitude and duration. Usually, IA and CAV are considered as crucial GMIMs, for engineering applications, because they reflect more than one key aspect of the strong ground motion at the same time, and the key advantage of them over peak response parameters is evident from their mathematical expressions in Eqs. (1) and (2).

There is no consensus on the definition of strong ground-motion duration in literature. Often, a particular definition is driven by application. From engineering application perspective, a detailed review of strong ground-motion duration definitions and on their measurement methods can be found in Boomer and Martinez-Pereir (1999). In this study, we use significant duration definitions as the measure of strong ground-motion duration. It uses husid plots to compute the time elapsed between different levels of cumulative IA (Arias 1970). D5–75 and D5–95 considers the time interval between 5 and 75%, and 5–95% of the normalized cumulative IA. D5–75 is intended to capture the energy from the body waves, whereas the D5–95 includes a large portion of the full waveform (Meimandi-Parizi et al. 2020) that often includes low frequency surface waves and coda (Bommer et al. 2009). Figure 1 demonstrates the computation of significant durations (D5–75 and D5–95) from an acceleration time history of the 2019/11/07 earthquake used in this study.

Fig. 1
figure 1

Plots showing the significant duration from the acceleration time series (H1 component) of the 2019/11/07 earthquake with Mw 5.9 recorded at Basmanj station. Upper panels (a and b) depict the selected significant duration based on the time interval between (a) the 5% and 75% and (b) the 5% and 95% levels of the normalized Arias intensity (Arias 1970) (bottom panels c and d)

It is worth mentioning that in core seismological studies such as in numerical simulations of realistic waveforms, the duration is related with the slip rise-time at a particular point and for larger events it also includes effect of finiteness of the fault through rupture velocity (Boore 2003).

In engineering applications, various studies have highlighted the importance of strong ground-motion duration (e.g., Rauch and Martin 2000; Chai et al. 1998; Tiwari and Gupta 2000; Krawinkler et al. 1983; Seed and Idriss 1982; Hancock and Bommer 2006). It has been shown that significant durations (D5–75 and D5–95) correlate well with structural damage and can be used in both geotechnical earthquake engineering as well as in record selection (Boomer et al. 2009; Bradley 2011b; Lee and Green 2014). Also, significant durations have been used for structural damage assessment (Chai et al. 1998; Bommer et al. 2004; Iervolino et al. 2006; Ruiz-Garcia 2010; Chandra Mohan et al. 2016). Tremblay (1998) as well as Hou and Qu (2015) consider strong motion duration for generating realistic stochastic simulations. Recently, Kolli and Bora (2021) have investigated the use of significant durations (D5–75 and D5–95) for generating response spectra using empirical models for Fourier amplitude spectrum (FAS).

3 Dataset and processing of strong motion records

We used 1749 three-component acceleration time histories from 566 events that were recorded by 338 stations from 1976 to 2020. The data is taken from the Building and Housing Research Center (BHRC) of Iran. In the Iranian plateau, collection of strong motion data often faces challenges particularly with regard to metadata information of the records such as site-characterization of strong motion station sites, focal mechanism and event-depths for large events. The raw acceleration traces obtained from BHRC website were fully processed by us using standard signal processing protocols suggested by Boore and Akkar (2003), Boore (2005), Boore and Bommer (2005). The major processing steps we followed for processing of strong motion records are summarized in what follows.

We rotated longitudinal and transverse components to obtain the two orthogonal horizontal components North-South (H1) and East-West (H2) components. We also performed the base line corrections that includes removing the mean and de-trending any linear trend in the raw acceleration records.

The orientation-independent component is calculated using the scheme suggested by Boore (2010), and the two as-recorded orthogonal components (H1 and H2) are combined into a single time series and is rotated between orientations of 0° and 180° (Boore 2010) as follows:

$${\text{H}}_{{{\text{ROT}}}} \left( {{\text{t}};\uptheta } \right) = {\text{H}}_{1} \left( {{\text{t}}){\text{cos}}(\theta } \right) + {\text{H}}_{2} \left( {t){\text{sin}}(\theta } \right)$$
(3)

This procedure is for finding RotD50 (the 50th percentile of the rotated orientation-independent) PSA. Similarly, an orientation-independent RotD50 values for D5–75, D5–95, IA and CAV (from two orthogonal components H1 and H2) are computed in this study.

The final processed dataset consists of recordings made from earthquakes in the magnitude range Mw 3–7.5 at distances from 10 up to 200 km. We only used crustal events with hypocentral depth up to 50 km. The magnitude-distance distribution of events is shown in Fig. 2a. In the compiled dataset we have considered stations which have recorded at least three events, and events were recorded by at least three stations. Figures 2b and c show histograms of the data in different magnitude and distance bins, respectively. Figure 2d shows distribution of the measured VS30 values in the selected dataset.

In our dataset, the event magnitude scale is not homogenous and some of the events are reported with Nuttli magnitude scales (MN), local magnitude scale (ML), and body-wave magnitude scale (mb). In order to have a uniform magnitude scale, we converted different magnitude types to a uniform Mw scale. The magnitude scale conversion is performed using the empirical relationships developed by Mousavi and Babaie Mahani (2020) for the Iranian Plateau (for more details, the reader is referred to Davatgari-Tafreshi et al. 2021 and Davatgari-Tafreshi et al. 2022).

Fig. 2
figure 2

a Magnitude-distance distribution, b Magnitude (Mw) histograms, c distance (Rhypo) histograms, and d VS30 histogram for the selected database. Vertical dashed lines depict the limits of National Earthquake Hazard Reduction Program (NEHRP) site classes (E: Soft soil; D: Stiff soil; C: Very dense soil and soft rock; B: Rock; A: Hard rock)

4 Functional form and regression analysis

In this section, we derive empirical GMMs for IA, CAV and significant durations (D5–75, D5–95) calibrated on the Iranian strong motion database. We adopt a common functional form for all the four GMIMs to facilitate easy comparison between the model predictions and associated coefficients. However, the final judgement on the adopted functional-form was made by investigating residual variation and associated aleatory uncertainty. The functional dependence over predictor variables such as magnitude, source-to-site distance (hypocentral distance) and site VS30 involves similar functional dependence to that of empirical models for peak-amplitudes and spectral ordinates (e.g., PSA). The primary motive of choosing such a functional dependence was to facilitate easy interpretation of regressed coefficients with respect to their counterparts in response spectral and Fourier domain. However, as noted earlier, IA and CAV are expected to capture combined effect of both amplitude and duration information from a recorded trace. Similarly, the significant duration measures used in the current study capture source duration (i.e., size of the event) and propagation effects related to dispersion and scattering of seismic waves. In order to select a functional form for all models, many preliminary regression and visual checks were performed. The final functional form considered as:

$$Y={c}_{1}+{F}_{e}\left({M}_{w}\right)+{F}_{p}\left({R}_{hypo},{M}_{w}\right)+{F}_{s}\left({V}_{S30}\right)+{\delta B}_{e}+{\delta WS}_{es}$$
(4)

where Y represent the natural log of either IA, CAV, D5–75 and D5–95 with respective units; \({\delta B}_{e}\) and \({\delta WS}_{es}\) represent between-event and within-event residual respectively (Al Atik et al. 2010). The predictor variables are the moment magnitude (MW), hypocentral distance (Rhypo) and \({V}_{S30}\) (time-averaged shear-wave velocity in upper 30 m of the soil column). We adopt a linear site-response term as a simple function of VS30. We also tested other forms of site response function (more complex) and nonlinear site behaviour in our models. One reason for not being able to incorporate nonlinear site behaviour could be insufficient soft-site recordings at short distances. This is further validated by the variation of residuals as a function of VS30 discussed later. The source (Fe), path (Fp) and site (Fs) functions are described as follows:

The source (event) function \({F}_{e}\) is given by the following equation:

$${F}_{e}={c}_{2}\left({M}_{w}-6.5\right)+{c}_{3}{(8.5-{M}_{w}) }^{2}$$
(5)

The path function \({F}_{p}\) is given by:

$${F}_{p}=\left[{c}_{4}+{c}_{5}\left({M}_{w}-6.5\right)\right]\text{ln}\left(\sqrt{{R}_{hyp}^{2}+{c}_{6}^{2}}\right)$$
(6)

The site function \({F}_{s}\) is given by:

$${F}_{s}={c}_{7}\text{l}\text{n}\left({V}_{S30}\right)$$
(7)

The model parameters (i.e. coefficients, c1-c7) of our four empirical models in Eqs. (47), were determined using mixed effects regression algorithm (Bates et al. 2015) separately for D5–75, D5–95, IA and CAV. This regression algorithm allows decomposition of total residuals into between-event component (\({\delta B}_{e}\)) with zero mean and standard deviation \(\tau\) and within-event residual (\({\delta WS}_{es}\)) with zero mean and standard deviation \(\phi\). The model coefficient c6 (pseudo-depth term) was derived in the first step by performing a simple nonlinear regression (fully fixed-effects regression). The remaining model coefficients and the between-event and within-event standard residuals were derived using linear mixed-effects regression (Bates et al. 2015), keeping c6 as the value determined from the previous stage. Note that the decomposition of residuals and estimation of model coefficients is performed together in a single step.

The final coefficients (for significant duration (D5–75 and D5–95), IA, and CAV models) along with corresponding standard errors in parameter estimations and statistical p-values are presented in Tables 1 and 2. We observe from Table 1 (for D5–75 and D5–95) that a large standard-error is associated with estimation of coefficients c3 and c5 which results in a large p-value (statistical insignificance). It clearly suggests that both, a quadratic dependence over magnitude and a magnitude-dependent distance scaling of D5–75 and D5–95 cannot be resolved by using the current dataset as observed by other studies (Sandikkaya and Akkar 2017; Bora et al. 2019). For IA and CAV models, all coefficients are found statistically significant (Table 2). The full variance-covariance matrices of estimated coefficients for all four models are presented in supplementary material (Table S1- Table S4). Note that the magnitude range covered by the present dataset is between MW 3.0–7.5 however, based on residual variations and comparison of median predictions with other global models we consider these models to be valid for events between MW 4.0–7.0 in the distance range 20–200 km. An extrapolation of the models beyond the magnitude and distance range prescribed here must be validated vis-`a-vis observed ground motions and with proper caution, if required. In the following sections, we first discuss comparison of the median predictions from the four empirical models with other regional and global studies. Following that, we discuss the residual variations. Finally, residual-correlations between different pairs of GMIMs are presented.

Table 1 Coefficients associated with D5–75 and D5–95 models from mixed effects regression
Table 2 Coefficients associated with IA and CAV models from mixed effects regression

5 Median predictions

In this section we present comparisons of the median predictions from the empirical models derived in this study with those from recently published models. The comparisons for significant durations are performed with Meimandi-Parizi et al. (2020) (hereafter MPA20), Yaghmaei-Sabegh et al. (2014) (hereafter YSS14), Afshari and Stewart (2016) (hereafter AS16) and Sandikkaya and Akkar (2017) (hereafter SA17) studies. The MPA20 and YSS14 models are calibrated on earlier versions of Iranian strong motion database whereas AS16 represents a global model calibrated on NGA-West2 (Ancheta et al. 2014) database with most of the records obtained from California region. Similarly, SA17 models for SD are calibrated on a pan-European (RESORCE) strong motion database (Akkar et al. 2014) with records dominated from Turkey and Italy. SA17 and AS16 used Rhypo and Rrup in their models, respectively. Figures 3, and 4 depict comparison of median predictions as a function of distance and magnitude, respectively. The differences in median predictions between different models are usually due to differences in underlying datasets and differences in functional dependence over magnitude, distance and VS30. The differences may also stem from incomplete functional forms that overlook, for example, the differences in stress drop, anelastic attenuation, and geometrical spreading. Thus, as expected there are differences in absolute values of D5–75 and D5–95 between the median predictions but, in general the distance scaling is consistent with the global model of AS16 and the pan-European model of SA17. The distance scaling of MPA20 is slightly different at larger distance which is mainly due to a different functional form adopted in their study. Median predictions from YSS14 model are in general good agreement with those from current study except with minor differences at shorter distances for smaller magnitude events. Note that the YSS14 model adopts a different functional dependence over magnitude and distance and that it uses base 10 of logarithm. For magnitude scaling of D5–75 and D5–95, the predictions from this study are in general good agreement with AS16 and SA17. The differences in the shape of the scaling are due to functional form, distance metric used, and actual differences in the underlying scaling of the ground motion with magnitude and/or distance. The last point is important where the functional form and distance metrics are identical for example the differences with SA17. In this case, the differences are due to differences in the underlying datasets. The D5–75 values from MPA20 are slightly larger than those from current study. Particularly, the magnitude scaling (i.e., slope) of MPA20 at smaller magnitudes is relatively stronger than that from other models. We observe an identical magnitude scaling of D5–75 from our model with that of AS16 and SA17 at Mw > 5. Note that the functional dependence of significant durations of AS16 is derived from seismological theory where the source component of ground-motion duration is considered as the reciprocal of Brune’s (1970, 1971) source corner frequency. The larger difference in scaling for D5–95 is mainly due to that a quadratic magnitude scaling of significant durations cannot be resolved by the current dataset as mentioned earlier (see Table 1). It is worth mentioning here that with NGA-West2 database that covers a much broader magnitude range AS16 and Bora et al. (2019) have found a rather weak dependence of duration over magnitude below MW < ~ 5.

Fig. 3
figure 3

Comparisons of distance scaling for the D5–75 (a and b) and D5–95 (c and d) significant duration models (median duration values) derived in this study, and models from previous studies (MPA20, YSS14, AS16 and SA17) for VS30=800 m/s. Note that, the AS16, SA17 and MPA20 predictions are shown for strike-slip events. Cyan empty circles are the observed strong-motion data

Fig. 4
figure 4

Comparisons of magnitude scaling for the D5–75 (a and b) and D5–95 (c and d) significant duration models (median duration values) derived in this study, and models from previous studies (MPA20, YSS14, AS16 and SA17) for VS30=800 m/s. Note that, the AS16, SA17 and MPA20 predictions are shown for strike-slip events. Cyan empty circles are the observed strong-motion data

Median predictions of IA and CAV are compared with recently derived IA and CAV models from SA17. As noted earlier, the IA and CAV models of SA17 are calibrated on a pan-European strong motion database. Figures 5 and 6 show comparison of median predictions for IA and CAV as a function of distance and magnitude, respectively. Figure 5 clearly demonstrates that IA scales rather strongly with distance in comparison to CAV. This is expected given that IA is computed using the squared amplitudes of accelerations which can be further appreciated by observing almost doubling of the coefficient c4 in Table 2. The median predictions are in general consistent with SA17 with differences in distance scaling of both IA and CAV. Differences in distance scaling indicate variability in regional attenuation that is captured by the two datasets.

Fig. 5
figure 5

Comparisons of distance scaling for the IA and CAV models (median IA and CAV values) derived in this study, and models from previous study (SA17) for VS30=800 m/s. Note that the SA17 predictions are shown for strike-slip events and extrapolated for MW < 4. Cyan empty circles are the observed strong-motion data in the selected magnitude (4.8 < MW < 5.2 and 6.6 < MW < 7.4) and VS30 (600 < VS30 < 1000 m/s) bins

Figure 6 depicts comparison of median predictions from the proposed IA and CAV models with that from SA17 models. The median predictions from this study and those from SA17 models are in general good agreement both at R = 30 and 80 km. However, one can observe large difference in scaling at Mw<5.0. We mainly attribute this to the differences in underlying datasets because the functional dependence over magnitude is identical in both the models. Moreover, it is worth mentioning that the SA17 model utilizes events with Mw≥ 4 whereas in the present study we consider events Mw≥3. Note that, the difference in magnitude scaling at smaller magnitudes was also observed for D5–75 in Fig. 4.

Fig. 6
figure 6

Comparisons of magnitude scaling for the IA and CAV models (median IA and CAV values) derived in this study, and models from previous study (SA17) for VS30=800 m/s. Note that the SA17 predictions are shown for strike-slip events and extrapolated for MW < 4. Cyan empty circles are the observed strong-motion data in selected distance (20 < R < 40 and 70 < R < 90 km) and VS30 (600 < VS30 < 1000 m/s) bins

Furthermore, Figs. S1 and S2 depict distance (Mw=5) and magnitude (R = 30 km) scaling for D5–75, D5–95, IA, and CAV models.

6 Residuals

The robustness of the selected functional form (and derived coefficients) of the four models is evaluated by analysing residual trends. As an outcome of our mixed effects regression analysis (Bates et al. 2015) using Eqs. (47), we obtain residuals that are decomposed into event-specific between-event and within-event components. The \({\delta B}_{e}\) are expected to capture source related variation (such as the stress parameter and radiation pattern) with respect to the average (median) model, while the \({\delta WS}_{es}\) represent path specific variations in recorded ground motion along with differences in site-characteristics of the station sites. The path-specific variation for IA and CAV represent the variations in attenuation regime while for ground-motion duration they reflect dispersion and scattering effects.

Figures 7, 8, 9 and 10 show the variations of residuals from four models for D5–75, D5–95, IA and CAV, respectively. Panels (a) in Figs. 7, 8, 9 and 10 show variation of \({\delta B}_{e}\) against magnitude. A rather unbiased variation of \({\delta B}_{e}\) with magnitude (for all four models) suggests the sufficiency of quadratic magnitude scaling term in Eq. (5). Similarly, panels (b) in Figs. 7, 8, 9 and 10 show variation of \({\delta WS}_{es}\) against hypocentral distance for all four models. A relatively large variability can be observed in \({\delta WS}_{es}\) however, there is no significant bias observed with distance except very minor overprediction (for D5–75 and D5–95) and minor underprediction for IA and CAV at larger distances. Panels (c) in Figs. 7, 8, 9 and 10 show variation of the station-averaged \({\delta WS}_{es}\) (i.e., \({\delta S}_{S2S}\)) against VS30. Clearly, we do not observe any systematic trends of \({\delta S}_{S2S}\) with VS30, indicating that a linear site-response term is sufficient to capture the site-effects in the data. In general, the variability (or spread) in residuals is smallest for D5–95 and largest for IA. A larger spread in IA is expected as an effect of squared amplitudes used for its computation. However, a larger spread in D5–75 residuals in comparison to that for D5–95 is rather intriguing.

Fig. 7
figure 7

a Between-event significant duration (5–75%) residuals (\({\delta B}_{e}\)) against magnitude. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 0.5 magnitude bins. b Within-event significant duration (5–75%) residuals (\({\delta WS}_{es}\)) against hypocentral distance. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in equally spaced (in log) distance bins. c Averaged within-event residuals at each station (i.e., \({\delta S}_{S2S}\)) against station VS30 values. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 100 m/s VS30 bins

Fig. 8
figure 8

a Between-event significant duration (5–95%) residuals (\({\delta B}_{e}\)) against magnitude. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 0.5 magnitude bins. b Within-event significant duration (5–95%) residuals (\({\delta WS}_{es}\)) against hypocentral distance. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in equally spaced (in log) distance bins. c Averaged within-event residuals at each station (i.e., \({\delta S}_{S2S}\)) against station VS30 values. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 100 m/s VS30 bins

Fig. 9
figure 9

a Between-event IA residuals (\({\delta B}_{e}\)) against magnitude. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 0.5 magnitude bins. b Within-event IA residuals (\({\delta WS}_{es}\)) against hypocentral distance. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in equally spaced (in log) distance bins. c Averaged within-event residuals (i.e., \({\delta S}_{S2S}\)) at each station against station VS30 values. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 100 m/s VS30 bins

Fig. 10
figure 10

a Between-event CAV residuals (\({\delta B}_{e})\) against magnitude. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 0.5 magnitude bins. b Within-event CAV residuals (\({\delta WS}_{es}\)) against hypocentral distance. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in equally spaced (in log) distance bins. c Averaged within-event residuals (i.e., \({\delta S}_{S2S}\)) at each station against station VS30 values. The dots with vertical bars (error bars) indicate mean residuals and the standard deviation of the mean in 100 m/s VS30 bins

We also provide an estimate of aleatory variability involved in all four models for D5–75, D5–95, IA and CAV. The total aleatory variability (\(\sigma\)) can be computed, using the following equation as a combination of \(\tau\) and \(\phi\):

$$\sigma =\sqrt{{\tau }^{2}+{\phi }^{2}}$$
(8)

The total standard deviation (σ) for significant duration (D5–75 and D5–95) are presented in Table 1 and for IA and CAV in Table 2. The relatively large standard deviation associated with IA is consistent with findings of Campbell and Bozorgnia (2012, 2019), SA17 and Farhadi and Pezeshk (2020). Figure S3 shows the comparison of aleatory uncertainty for all models from this study with that from SA17 models. As noted earlier, both the \(\tau\) and \(\phi\) are largest for IA (due to its definition) whereas \(\tau\) is comparable for the two durations D5–75 and D5–95 but, \(\phi\) is significantly larger for D5–75. This indicates that D5–75 is rather sensitive to path and site related variations in comparison to D5–95.

7 Empirical correlation analyses

In this section, we present empirical correlations between the different pairs of GMIMs used in this study. The correlation between two GMIMs can be evaluated by the correlation between the corresponding residuals of the two GMIMs (Baker and Jayaram 2008; Bradley 2015; Baker and Bradley 2017). As derived in this study and prescribed by most of the modern ground motion models (GMMs), in a mixed-effects regression algorithm, residuals are decomposed into δBe and \({\delta WS}_{es}\) components. Thus, in a first step, we estimate the linear correlations separately for \({\delta B}_{e}\) and \({\delta WS}_{es}\) assuming that they are independent normal variate using the following Eq. (9) (Ang and Tang 2007; Baker and Jayaram 2008; Bradley 2012):

$${\rho }_{x,y}=\frac{\sum _{n}\left[\left(x-\stackrel{-}{x}\right)\left(y-\stackrel{-}{y}\right)\right]}{\sqrt{\sum _{n}\left[{\left(x-\stackrel{-}{x}\right)}^{2}\right]\sum _{n}\left[{\left(y-\stackrel{-}{y}\right)}^{2}\right]}}$$
(9)

where x and y are generic variables (i.e., in this case \({\delta B}_{{e}_{i}}\) and \({\delta B}_{{e}_{j}}\) for the between-event correlation and \({\delta WS}_{{es}_{i}}\) and \({\delta WS}_{{es}_{j}}\) for the within-event correlation), \(\stackrel{-}{x}\) and \(\stackrel{-}{y}\) are the sample means of x and y; and \(\sum _{n}\left[\right]\) represents summation over the number of earthquakes (for computation of the between -event correlation), or over the number of ground motion records (for computation of the within -event correlation). Thus, this equation can be used to estimate \({\rho }_{{\delta B}_{{e}_{i}},{\delta B}_{{e}_{j}}}\) (between-event correlation), \({\rho }_{{\delta WS}_{{es}_{i}},{\delta WS}_{{es}_{j}}}\)(within-event correlation) and \({\rho }_{{\epsilon }_{i},{\epsilon }_{j}}\) (total residuals correlation). Here \(i\) and \(j\) represent the two GMIMs considered to compute the correlation for example: \(i={I}_{A}\) and \(j={D}_{5-95}\). We have presented a quantile-quantile plot (Figures S4 and S5) of both the between-event (\({\delta B}_{{e}_{i}}\)) and the within-event (\({\delta WS}_{{es}_{j}}\)) residuals of D5−75, D5−95, IA and CAV. It can be observed in both cases that the observed residual distribution is close to a normal distribution, with slight discrepancy towards the tail of the distribution. Figure S6 illustrates the distributions of the (normalized) between- and within-event residuals for the D5–75, D5–95, IA, and CAV. For comparison, the (theoretical) standard normal distribution and Kolmorogov–Smirnov goodness-of-fit bounds (at the 5% significance level) are also shown. The fact that both between- and within-event distributions lie within the Kolmorogov–Smirnov goodness-of-fit bounds illustrates the applicability of the models. Also, a Kolmogorov-Smirnov test indicated that the hypothesis that the \({\delta B}_{{e}_{i}}\) and \({\delta WS}_{{es}_{j}}\) are normally distributed could not be rejected at the 5% significance level. Note that we compute the correlations using the standardized residuals (with zero mean and unity standard-deviation).

7.1 Uncertainty in the correlation coefficient

Usually, correlation coefficients between various GMIMs are considered stable (Baker and Bradley 2017) thus the major source of uncertainty in the point estimate of the correlation coefficient is attributed to sampling uncertainty. We quantify the sampling uncertainty in our point estimates of linear correlation using bootstrap approach (Efron 1979; Eferon and Tibshirani 1993; Bradley 2012, 2015) to account for the finite number of sample sizes uncertainty. This approach uses random bootstrap samples with replacement from the observed data to calculate the correlation coefficient ρ for each bootstrap replicate. We performed 200 bootstraps and evaluated the mean and standard deviation of these 200 iterations for correlation findings to assess the stability of the results. The bootstrap means and standard deviations are summarised in Tables S5 and S7. Some earlier studies have found that the correlations between response spectral ordinates have been stable between different datasets such as NGAWest2 (Baker and Bradley 2017) and NGAWest1 (Baker and Jayaram 2008). Moreover, often the correlations (between GMIMs) are considered rather insensitive with respect to rupture scenarios as well (Baker and Jayaram 2008). However, as shown later, we have observed significant scenario dependence of the linear correlations between different GMIMs with the current dataset.

To facilitate easy comparison with other studies and for further applications, we combined the \({\delta B}_{{e}_{i}}\) and \({\delta WS}_{{es}_{j}}\) correlations to get the correlation between the total residuals. Using the definition of the correlation coefficient, the correlation between the total residuals (\(\epsilon\)) can then be found from the individual correlations of \({\delta B}_{{e}_{i}}\) and \({\delta WS}_{{es}_{j}}\) using the following equation (Bradley 2011a, 2012):

$${\rho }_{{\epsilon }_{i},{\epsilon }_{j}=\frac{1}{{\sigma }_{i}{\sigma }_{j}}\left[{\rho }_{{\delta B}_{{e}_{i}},{\delta B}_{{e}_{j}}}{\tau }_{i}{\tau }_{j}+{\rho }_{{\delta WS}_{{es}_{i}},{\delta WS}_{{es}_{j}}}{\phi }_{{ss}_{i}}{\phi }_{{ss}_{j}}\right]}$$
(10)

7.2 Correlation results

Figure 11 and Table S5 provide a summary of the Pearson correlation coefficients of the \({\delta B}_{e}\) and \({\delta WS}_{es}\) for different pairs of GMIMs. We observe in general rather low to moderate degree of correlations between IA and the two duration measures (D5–75, and D5–95) for both \({\delta B}_{e}\) and \({\delta WS}_{es}\). The negative values of correlations for both \({\delta B}_{e}\) and \({\delta WS}_{es}\) between IA and the significant duration measures is consistent with previous studies (Bradley 2015) and the notion that a lower than average amplitude usually corresponds to a higher than average duration of ground-motion (or vice-versa) as the wave energy (power in this case) gets distributed over a longer interval of time in the seismogram. A relatively higher degree of negative correlation between IA and D5–95 in comparison to IA and D5–75 essentially reflects that a significant portion of energy is distributed over a longer duration in the time-histories from the current dataset. It is noteworthy that the correlations are relatively stronger (in their magnitude) for \({\delta WS}_{es}\) in comparison to that for \({\delta B}_{e}\). The correlations between CAV and the two duration measures (D5–75, and D5–95) are quite small for both \({\delta B}_{e}\) and \({\delta WS}_{es}\). However, we believe that a relatively higher (negative) correlation between IA and the two duration measures is consistent as D5–75, and D5–95 are defined based on the cumulative sum of squared acceleration amplitudes. The correlation between \({\delta B}_{e}\) of CAV with IA is 0.9459 (very high degree correlation). The correlation between \({\delta WS}_{es}\) of CAV with IA is 0.9476 (very high degree correlation). This shows that overall CAV and IA are highly correlated which is unsurprising, given that they are both cumulative measures of a ground motions intensity.

We further evaluate the residual correlations in various magnitudes and distance bins as an indicative of scenario dependence of the correlations. Figures 12, 13 and 14 illustrate the observed empirical correlations of \({\delta B}_{e}\) and \({\delta WS}_{es}\) in different magnitudes, distance and VS30 bins respectively for various pairs of IA, CAV, D5–75 and D5–95. For Fig. 12, we consider simple magnitude bins Mw 3–4, Mw 4–5, Mw 5–6 and Mw 6–7 and the average correlations are plotted at the mid-points of these bins along with the bin scatter. The average correlation values may get affected by the bin-width chosen here nevertheless, they provide an average trend captured by the data. Figure 12a illustrates that the negative correlation (of \({\delta B}_{e}\)) between IA and D5–75 increases in amplitude as the event magnitude increases. At smaller magnitudes Mw 3–4, the correlation is close to zero whereas for the moderate magnitude range Mw 4–5, Mw 5–6 it is a moderate degree correlation with -0.2536, -0.2631 values in the respective bins. The largest correlation is ~-0.5 observed in the Mw 6–7 bin. A similar trend is observed in Fig. 12b with slightly larger values of correlations in each bin. This is an interesting observation from the point of view that larger magnitude earthquakes produce more low-frequency energy as the corner frequency of the event gets smaller with the increase in event magnitude. We are aware that these correlations are affected by the standard-deviation of the residuals as well (see Eq. 9). Hence, we plotted the \(\tau\) in same magnitude bins (Mw 3–4, Mw 4–5, Mw 5–6 and Mw 6–7) for all the GMIMs as given in supplementary Figure S7. We can appreciate that the low degree of correlation in Mw 3–4 and Mw 4–5 bins is explained by relatively larger standard-deviation (\(\tau\) in this case) but for Mw 5–6 and Mw 6–7 it does not fully explain the observed trend in Figs. 12a and b. Figures 12c and d show the correlations (of \({\delta B}_{e}\)) between CAV-D5–75 and CAV-D5–95, respectively. A similar trend is observed in this case, for Mw 3–4 the correlations are positive with low degree of correlations (~ 0.2 and ~ 0.15). In the moderate magnitude range Mw 4–5, Mw 5–6, the correlations are close to zero (-0.0016, -0.0294) whereas in the larger magnitude range Mw 6–7 there is comparatively stronger negative correlations (-0.3167, -0.2803). A slightly stronger negative correlation for Mw 6–7 seems consistent with the notion that CAV captures low-to-moderate frequency range of ground motion. Figure 12e shows the correlation (of \({\delta B}_{e}\)) between IA and CAV. In this case, the (positive and high degree) correlation values are relatively stable across different magnitude bins with lowest for Mw 6–7 range. This again suggests that IA and CAV may capture slightly different frequency ranges that are present in the ground-motion.

Figure 13 shows the correlation of \({\delta WS}_{es}\) in four different distance bins for different pairs of GMIMs as considered in Fig. 12. For Fig. 13, we consider four broad distance bins with Rhypo 10–50 km, 50–100 km, 100–150 km and 150–200 km. Average correlations along with the bin spreads (25th and 75th percentiles) are plotted at the mid-point of these bins. We have considered rather broad bins to capture more observations hence to get a stable estimate of average values. Figures 13a and b show the correlations for IA-D5–75 and IA-D5–95, respectively. Except the differences in absolute values of the correlations the trend of decreasing correlation with increasing distance is similar in both the figures. The strongest negative correlation between IA and the two duration measures is found to be in the near distance ranges (Rhypo < 100 km). Beyond Rhypo > 100 km the correlation between the pair of GMIMs is negligible. This observation is again consistent with the seismological theory that with increasing distance there are more scattered wave arrivals which often form the coda part of the seismogram. In the near distance range (Rhypo < 50–70 km) most of the energy in the seismogram is due to the direct S-wave arrivals. Supplementary Figure S8 shows values of \(\phi\) in the same distance bins for all four GMIMs. The \(\phi\) values are lowest (relative to other bins for all the four GMIMs) in the distance range Rhypo > 100 km hence they do not account for the low values of correlation in these distance ranges. Figure 13e shows the correlations (of \({\delta WS}_{es}\)) between CAV and IA in the distance bins considered here. The correlations are rather stable with respect to distance with slightly lower values and larger scatter in the distance bin Rhypo 150–200 km.

Fig. 11
figure 11

Box plots of Pearson Correlation Coefficients of (a and c) the between-event (\({\delta B}_{e}\)) and (b and d) within-event (\({\delta WS}_{es}\)) residuals obtained between D5–75, D5–95, IA, and CAV. On each box, the central mark (horizontal red lines) indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (first and third quintile.) The black horizontal lines are minimum and maximum values

Fig. 12
figure 12

Box plots of Pearson Correlation Coefficients of the between-event residuals (\({\delta B}_{e}\)) obtained between:(a) IA and D5–75; (b) IA and D5–95; (c) CAV and D5–75; (d) CAV and D5–95 and (e) CAV and IA in magnitude bins for Mw 4–5, Mw 5–6 and Mw 6–7. On each box, the central mark (horizontal red lines) indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (first and third quintile) in each magnitude bin, respectively. The black horizontal lines are minimum and maximum values

Fig. 13
figure 13

Box plots of Pearson Correlation Coefficients of the within-event residuals (\({\delta WS}_{es}\)) obtained between: (a) IA and D5–75; (b) IA and D5–95; (c) CAV and D5–75; (d) CAV and D5–95 and (e) CAV and IA in distance bins for Rhypo 10–50, Rhypo 50–100, Rhypo 100–150 and Rhypo 150–200 km. On each box, the central mark (horizontal red lines) indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (first and third quintile) in each Rhypo bin, respectively. The black horizontal lines are minimum and maximum values

We further investigated the correlation of \({\delta WS}_{es}\) (for D5–75, D5–95, IA, and CAV) with κ0 (near-surface attenuation parameter) (Table S6) at each station. Out of 338 stations, κ0 was available for 215 stations from the study of Davatgari-Tafreshi et al. (2022). Figure 14a illustrates the correlations for D5–75, D5–95, IA, and CAV together in a single plot. In general, the values of the correlations are very low between κ0 and the GMIMs considered here. However, the sign of correlation is positive between κ0 and the two SD measures. Interestingly, the correlation between \({\delta WS}_{es}\) (for IA) and κ0 is quite close to zero. This observation is contrary to the popular notion that IA as the representative of high-frequency ground motions. In fact, the correlation between \({\delta WS}_{es}\) (for CAV) and κ0 has negative sign which is consistent with how CAV is defined. We further explore these correlations by plotting them in different VS30 bins. For that purpose, we consider four VS30 bins: VS30 200–400, VS30 400–600, VS30 600–800 (m/s) and VS30 > 800 m/s. Average correlations (along with associated scatter) are plotted at the mid-point of these bins. Clearly, there is a VS30 dependence of the correlations between \({\delta WS}_{es}\) and the two duration measures (D5–75 and D5–95) with a slight (positive) correlations in VS30: 200–400 m/s. The correlation between \({\delta WS}_{es}\) (for IA) and κ0 does not show any VS30 dependence. On the other hand, a moderate degree of negative correlation (~-0.4) between \({\delta WS}_{es}\) (for CAV) and κ0 is observed in VS30 bin: 200–400 m/s. At other VS30 values this correlation is negligibly small close to zero.

Fig. 14
figure 14

a Box plots of Pearson Correlation Coefficients of the within-event residuals (\({\delta WS}_{es}\)) obtained between D5–75, D5–95, IA and CAV with κ0. On each box, the central mark (horizontal red lines) indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (first and third quintile) respectively. b, c, d and e Box plots of Pearson Correlation Coefficients of the within-event residuals (\({\delta WS}_{es}\)) obtained between D5–75, D5–95, IA and CAV with κ0 in VS30 bins for VS30 200–400, VS30 400–600, VS30 600–800 (m/s) and 800 < VS30 (m/s). On each box, the central mark (horizontal red lines) indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (first and third quintile) in each VS30 bin, respectively. The black horizontal lines are minimum and maximum values

Table S7 and Figure S9 summarise the Pearson Correlation Coefficients of the total residuals, computed using Eq. (10), in this study. Our results show that D5–75 and D5–95 have a high degree correlation (0.8613). This is consistent with Bradley’s (Bradley 2011b) results (0.843). An imperfect correlation between D5–75 and D5–95 essentially suggests how these two measures of duration capture different information from the acceleration time-histories. In our study, total residual of IA negatively correlates (moderate degree correlation) with D5–75 (-0.3240) and D5–95 (-0.3932). Bradley (2015) observed that IA negatively correlates with D5−75 (-0.19) and D5–95 (-0.20). The results of the two studies are not identical, but they are comparable. The total residual-correlation between CAV and the two duration measures is very low: D5–75 (-0.0699), and D5–95 (-0.1028). Bradley (2011b) also observed that CAV has a very small positive correlation with D5–75 (0.08) and D5–95 (0.12). At first the relatively small correlation of CAV with significant duration measures may appear contrary to the notion that CAV carries more information regarding duration than the IA (Campbell and Bozorgnia 2010; Macedo et al. 2020). The lack of correlation between CAV and significant duration can be understood through Fig. 14 in which we observed a negative correlation between CAV and κ0 whereas, D5–75 and D5–95 are weakly (but positive) correlated with κ0. Thus, one can appreciate the weak negative correlation between CAV and the two duration measures as an outcome of the two competing effects at play here. It is widely accepted that parameter κ0 reflects high-frequency behaviour (Anderson and Hough 1984) of ground-motion in Fourier domain. A relatively strong (negative) correlation between CAV and κ0 in comparison to that between IA and κ0 indicates that CAV is more representative of high-frequency ground motion. The often observed high correlation of IA with PSA (spectral pseudo-acceleration) at shorter periods (Bradley 2012, 2015) can be understood by observing similarity between the definition of IA (Eq. 1) and the zeroth spectral moment of the single degree of freedom (SDOF) oscillator response (Bora et al. 2016; Eqs. (19) and (20) in Kolli and Bora 2021) through Parseval’s theorem. Thus, similar to PGA (and other short period PSAs), IA is affected by a broad range of frequencies in the ground-motion not just the high frequencies. This can be partially appreciated by observing a stronger negative correlation (of \(\delta {B}_{e}\)) between IA and D5–75, D5–95 for larger magnitudes in Fig. 12. Events with larger magnitude are rich in low-frequency content.

We have observed strong positive correlation (very high degree correlation) between IA and CAV (0.9435) as also observed by Bradley (2015) (0.89) and Campbell and Bozorgnia (2012) (0.923). Farhadi and Pezeshk (2020) mentioned that the Pearson correlation coefficients equal 0.959, suggesting a very high correlation between IA and CAV. An imperfect correlation between IA and CAV clearly suggests that they are sensitive to different frequency ranges in the ground-motion as has been discussed here.

8 Conclusions

In this study, we derive empirical models for four GMIMs namely, IA, CAV, D5-75, and D5-95. A common functional form was used to model the observed scaling of the selected GMIMs over magnitude, distance and VS30 to facilitate easy comparison and interpretation of the derived coefficients (model parameters). The models are calibrated on 1749 acceleration (horizontal) time histories compiled by the Building and Housing Research Center (BHRC) of Iran originated from 566 events (Mw 3–7.5) and recorded at 338 stations at Rhypo≤200 km in the time period from 1976 to 2020. An orientation independent measure of the ground-motion (RotD50) was used for all the GMIMs. Mixed-effects regression algorithm was employed to estimate the model coefficients and the components of uncertainty (residuals). Median predictions of IA, CAV, D5-75, and D5-95 were found consistent with other previously published regional and global studies with obvious differences arising from the choice of datasets and adopted functional forms. Robustness of the derived models (and coefficients) was further assessed by examining residual distributions and their variation against predictor variables such as magnitude, distance and VS30. Based on residual variations and comparison with other global models, we consider these models to be valid for shallow crustal earthquakes in active tectonic regions with magnitudes (Mw) ranging from 4.0 to 7.0, and distances ranging from 20 up to 200 km.

We examined the residual correlation between different pairs of GMIMs. Overall, low-to-moderate degree of (total) correlation was found between IA and the two SD measures. Almost no correlation was observed between CAV and the two SD measures with values−0.072 and−0.11 between CAV-D5-75 and CAV-D5-95, respectively. As expected, a very high degree of correlation was observed between IA and CAV however an imperfect correlation (~ 0.95) indicates that both the cumulative GMIMs capture different aspects of ground-motion. Similarly, a high degree of correlation (~ 0.87) between D5-75 and D5-95 is an indicative of their definition and how they are computed. Again, an imperfect correlation between the two SD measures clearly suggests that they may not essentially capture the same information from a ground-motion trace. The correlations were analyzed for \({\delta B}_{e}\) and \({\delta WS}_{es}\) separately which were subsequently combined to obtain correlation of total residuals. In general, the correlation values (in terms of their absolute values) are lower for \({\delta B}_{e}\) in comparison to that for \({\delta WS}_{es}\). The uncertainty in correlation estimates was evaluated by using bootstrapping procedure.

We further investigated rupture scenario dependence of the residual correlations (for both \({\delta B}_{e}\) and \({\delta WS}_{es}\)) in different magnitudes and distance bins. The negative correlations of \({\delta B}_{e}\) between IA and the two SD measures were found strongest for larger magnitude events Mw > 6. Almost no correlation was found for Mw < 4.0 between the two GMIMs whereas in the moderate magnitude range Mw 4–6 the correlation was rather low. A similar trend was observed for the correlations of δBe between CAV and the two SD measures. Similarly, the correlations for \({\delta WS}_{es}\) between IA and the two SD measures were found to be depending upon the distances. For both IA-D5-75 and IA-D5-95 the values of negative (\({\delta WS}_{es}\)) correlations were strongest in the near distance range Rhypo < 50 km with values decreasing with increase in distance. Almost no correlation (< ~-0.2) was observed beyond 100 km distance. A similar trend (with slightly lower values) was observed for the correlations of \({\delta WS}_{es}\) between CAV and the two SD measures.

We further investigated the correlations of \({\delta WS}_{es}\) (for all four GMIMs) with station-specific κ0 values. In general, the values of the correlations (correlation of \({\delta WS}_{es}\) ) are very low between κ0 and the GMIMs considered here. We observed positive correlation between κ0 and the two SD measures, and correlation value quite close to zero between IA (correlation of \({\delta WS}_{es}\)) and κ0. A relatively stronger negative correlation was observed between CAV and κ0 in comparison to that between IA − κ0. A close-to-zero correlation between IA and κ0 is opposite to the popular notion of IA being more representative of high-frequency ground-motions in comparison to CAV. At sites with lower VS30 values, the negative correlation (− 0.4) between CAV and κ0 was evident. The relatively strong (negative) correlation between \({\delta WS}_{es}\) (for CAV) and κ0 is consistent with how CAV is defined.

Correlation models (e.g., Bradley 2012a, b) between different GMIMs are key components in the application of vector probabilistic seismic hazard analysis (VPSHA) and the generalized conditional intensity measure (GCIM) methods (Bazzurro 1998; Baker and Cornell 2005; Bradley 2012). In this context, clearly, the results particularly with regard to residual correlations represent the seismological characteristics and properties captured by the current dataset. Nevertheless, the analysis in this article presents significant new insights with regard to residual correlations (between GMIMs considered here) and their scenario dependence which has broader implications for engineering applications such as VPSHA and ground-motion selection. Clearly, a thorough investigation of residual correlations (between various GMIMs) is warranted for other geographical regions as well with much larger datasets which cover broader event magnitude range than that is covered by the current dataset.

9 Data and resources

The data underlying this article were accessed from the Building and Housing Research Center (BHRC) of Iran (http://www.bhrc.ac.ir/, last accessed December 2020). The derived data generated in this research will be shared on reasonable request to the corresponding author. The supplemental material to this article includes nine figures, which provides comparisons of distance and magnitude scaling for the significant duration (D5–75 and D5–95), IA and CAV models, variations in between-event, within event and total standard deviation for models, the quantile-quantile plot of the between-event and within-event residuals, distributions of normalized between-event; and normalized within-event residuals, box plots of between-event standard deviation (τ) and within-event standard deviation (ϕ), and box plots of Pearson Correlation Coefficients of the total residuals, and seven tables, which contain, coefficients covariance, the summary of the Pearson Correlation Coefficients of the between-event and within-event residuals, the Pearson Correlation Coefficients of within-event residuals with κ0, and the Pearson Correlation Coefficients of the total residuals.