1 Introduction

Amplitude, frequency content and duration jointly describe the basic characteristics of ground motion, but research on ground-motion duration started relatively late compared to that on the amplitude and frequency content. In most engineering seismic analyses, the influence of the amplitude and frequency content is considered more than the duration because when the engineering structure is in the elastic response stage, the duration has little effect on its seismic response. However, when the engineering structure is in the inelastic response stage, even if the maximum deformation of the structure does not exceed the limit deformation, the structure may collapse due to the loss of energy storage capacity reaching a certain limit. The effect of duration on the seismic response of the structure cannot be ignored. Therefore, to fully describe the effect of ground motion on structural seismic response, the duration should be considered.

The influence of the duration effect on structural response is still controversial (Du and Wang 2017); most studies have reported that the duration has a negligible effect on the peak structural responses (e.g., Rahnama and Manuel 1996; Cosenza et al. 2004; Iervolino et al. 2006), but it is generally accepted that there is a positive correlation between the duration and cumulative damage measures (e.g., Meskouris 1983; Uang and Bertero 1990; Reinoso et al. 2000; Bommer et al. 2004), as described by Hancock and Bommer (2006). In recent years, with the development of performance-based seismic design theory, researchers have paid increasing attention to the influence of ground-motion duration on the seismic capacity of structures. Chandramohan et al. (2016) examined the influence of duration on the collapse capacities of a five-story steel moment frame and a reinforced concrete bridge pier; the results showed that the collapse capacity of the steel moment frame is 29% lower, and for bridge pier, it is 17% lower, when using long duration records. Bravo-Haro and Elghazouli (2018) examined the influence of the ground-motion duration on the seismic response of steel moment frames with due consideration for cyclic degradation effects; the results indicated collapse capacity reductions up to 40% due to the duration influence. Bravo-Haro et al. (2020) presented a detailed investigation into the seismic response of nondeteriorating and deteriorating single degree-of-freedom systems controlled by P-Δ effects, with due account for the duration influence; they found that the duration effect is significant, increasing with an increasing structural period and decreasing with an increasing P-Δ effect. Liapopoulou et al. (2020) explored the effects of the ground-motion duration and acceleration pulse on the collapse capacity of ductile single-degree-of-freedom systems and showed that the collapse ability of flexible bilinear systems can be reduced by 60% due to the duration effect at low level P-Δ levels. Du et al. (2020) quantitatively investigated the duration effect on structural collapse assessment by using hazard-consistent ground-motion suites; the results showed that a longer duration record is more likely to reduce the structural collapse capacity. These prior researches indicate that the influence of the ground-motion duration on the collapse capacities of structures cannot be ignored. Therefore, it is recommended to consider this duration in addition to its intensity and frequency content in structural design and seismic risk assessment.

There are more than 30 definitions of ground-motion duration, and the significant duration (DS) is the most commonly used in the field of earthquake engineering and the one most studied by researchers (Chandramohan et al. 2016; Du and Wang 2017). Ground-motion prediction equations (GMPEs) are one of the critical components in probabilistic seismic hazard assessment (PSHA), and the GMPEs of the duration are mainly developed according to the attenuation relationship (Akkar et al. 2018; Ameri et al. 2017; Ebrahimian et al. 2019; Zolfaghari and Darzi 2019). The GMPE of the duration was first proposed by Esteva and Rosenblueth (1964). In the following 20 years, some scholars also proposed their own GMPEs. The forms of these GMPEs are simple, and the influencing factors considered are relatively few (Housner 1975; Trifunac and Brady 1975; Dobry et al. 1978; McGurie and Barnhard 1979; Kamiyama 1984). Abrahamson and Silva (1996) first expressed the source duration (Dsource) as a function of the moment magnitude (Mw) and stress drop (Δσ) based on the Brune source model, and the influence of rupture distance (Rrup) and site type (S) were added to their GMPEs of DS (AS96). Hernandez and Cotton (2000) developed a GMPE for DS5-95 based on ground-motion databases in California and Italy, but the coupling effect of magnitude and path terms was not included. Reinoso and Ordaz (2001) proposed a GMPE for DS2.5-97.5 using ground-motion data of normal faults in the subduction zone of Mexico and combined the duration with random vibration theory to predict the response spectrum. On the basis of the work of Abrahamson and Silva, Kempton and Stewart (2006) established a GMPE (KS06) for DS5-75 and DS5-95 using the ground-motion data of global shallow crustal earthquakes near active plate margins. Different from the previous GMPEs, KS06 takes the time-averaged shear-wave velocity to 30 m (VS30) as the site effect parameter and considers the influence of the depth to shear wave velocity of 1500 m/s for the site (Z1.5) on DS. Bommer et al. (2009) used the NGA-West1 database to put forward GMPEs for DB (bracketed duration), DU and DS (BSA09). In addition to Mw, Rrup and VS30, the influence of the fault type Frv was added to the GMPEs of DB and DU, and the influence of the depth to the top of rupture (Ztor) was incorporated into the GMPE. BSA09 considered the coupling of magnitude and path, but the basin depth was not included. Yaghmaei-Sabegh et al. (2014) proposed a GMPE of DS for Iran. Due to the lack of fault type, stress drop, depth of rupture surface and other data in Iran, their GMPE includes only the magnitude, hypocentral distance and site type. Lee and Green (2014) proposed a GMPE of DS (LG14) in stable continental regions and compared and analyzed its difference with active shallow crustal region motions. Afshari and Stewart (2016) used the NGA-West2 database to develop GMPEs (AS16) for DS20-80, DS5-75, and DS5-95, and AS16 was found to be an improvement over KS06. Du and Wang (2017) also developed a new GMPE (DW17) based on the NGA-West2 database, which improved the model prediction for small- and medium-magnitude earthquakes and far-field earthquakes. Anbazhagan et al. (2017) used intraslab regions from Canada, Australia, the Indian Peninsula and the central and southern United States to present GMPEs for DB and DS, and Xu and Wen (2018) established a GMPE of DS for mainland China based on strong earthquake data in China. López-Castañeda and Reinoso (2021) and Jaimes and García-Soto (2021) successively proposed GMPEs (JG21) for DS of ground motion for interface and intraslab earthquakes in the Mexican subduction zone. The results showed that DS of subduction interface and intraslab earthquakes is significantly different from those in other parts of Mexico and that these two types of earthquakes need to be considered separately when establishing prediction models. Bahrampouri et al. (2021) proposed GPMEs of DS (BRG21) for intraslab, interface, and shallow crustal earthquakes based on the KiK-net database, which compensated for the lack of research on the GMPEs of duration in subduction zones, but the effect of depth was not incorporated in these equations. In addition, the magnitude term in the exponential form in the equation may not easily accommodate regression analysis under different data sets, and the residuals have prominent trends when Mw < 5 and Rrup < 100 km.

As the GMPEs of ground-motion parameters have been successively proposed, the spatial correlation of ground-motion parameters has gradually attracted attention. In terms of seismic hazard analysis, the traditional single-site analysis method cannot evaluate the probability of simultaneous earthquakes at multiple sites, and the spatial correlation model of ground motions is beneficial to develop the seismic hazard analysis from a single site to multiple sites (Albarello and Peruzza 2017). Therefore, scholars have begun to analyze the spatial correlation characteristics of ground-motion parameters through the residuals of GMPEs. Boore et al. (2003) analyzed the spatial variability of the PGA using the main shock records of the 1994 Northridge earthquake in the United States. The results showed that the spatial variability in PGA decreases with increasing distance between the two sites. Goda and Hong (2008) studied the spatial correlation of the PGA, PGV and pseudo-spectral acceleration p-Sa(T = 0.3 s, 1.0 s, 3.0 s) using the records of Southern California in the United States and Chi-Chi in Taiwan, China. The results showed that the spatial correlation decreases with increasing separation distance. The spatial correlation of ground motion is still one of the key research topics of scholars, and an increasing number of spatial correlation models have been proposed (Wang and Takada 2005; Hong et al. 2009; Jayaram and Baker 2009; Goda and Atkinson 2010; Du and Wang 2013; Bradley 2014; Foulser-Piggott and Goda 2015), but the spatial correlation curves of individual earthquake events are more variable. However, to date, research on the spatial correlation of ground-motion duration has been very rare, especially for the Japanese subduction zone.

This study analyzes the effect of Mw, Rrup, VS30, δZ1 (the difference between the depth to shear wave velocity of 1000 m/s for the site and a median depth conditional on VS30) and other factors on DS (the geometric mean of significant durations in two horizontal directions) by developing new GMPEs of Japanese subduction intraslab and subduction interface earthquakes based on K-NET and KiK-net databases. The magnitude term is simplified, the magnitude is calibrated by a bilinear function (Xing and Zhao 2021), and the depth (Ztor) term is added in the new GMPEs. Through residual analysis and comparison with existing prediction models, the rationality and reliability of the prediction model proposed in this study are proven. Finally, according to the intra-event residuals of the new GMPEs, the spatial correlation of DS is analyzed, the spatial correlation coefficient of DS is calculated by using semivariograms, and the corresponding continuous spatial correlation function is obtained by fitting an exponential model. This work provides a reference for the development of next-generation GMPEs based on nonergodic assumptions, the spatially distributed lifeline engineering, and regional seismic hazard and loss assessment.

2 Strong ground-motion databases

The strong ground-motion data in this study come from the K-NET and KiK-net databases developed by the National Research Institute for Earth Science and Disaster Prevention (NIED) of Japan. The ground-motion records of nearly the past 25 years from January 1997 to July 2021 are included and selected according to the following standards: (a) select earthquake events with 4 ≤ Mw ≤ 9 and Rrup ≤ 300 km; (b) identify PGA ≥ 10 gal (both horizontal components); (c) discard records with abnormal waveforms, including clearly aberrant waveforms, incomplete waveforms (mainly manifested as missing P waves), and multiple earthquake events included in one waveform; (d) discard records with incomplete information, including records with missing site information and focal mechanism solutions; and (e) the number of records contained in a single earthquake event is not less than 10.

Since the ground-motion records obtained from K-NET and KiK-net are unprocessed and there are different degrees of baseline wander and noise pollution, the records must be baseline corrected and filtered before used. The baseline correction method proposed by Ohsaki (2008) is used. Before filtering, P-wave arrival picking is performed on each record to extract the pre-event noise, the Fourier amplitude spectrum of the ground-motion signal and the pre-event noise signal is calculated, and the frequency range [fc1, fc2] of the signal-to-noise ratio (SNR) ≥ 3 is determined (Bozorgnia 2020). This study stipulates that the high-pass cutoff frequency fc1 should be between 0.05 and 0.5 Hz and that the low-pass cutoff frequency fc2 should not be lower than 25 Hz (Bahrampouri et al. 2020). For the two horizontal components of one record, fc1 takes the maximum high-pass cutoff frequency of the two components, and fc2 takes the minimum low-pass cutoff frequency of the two components. After determining the filtering frequency range, zero-padding is performed on each record, and a 4th-order Butterworth acausal filter is used for filtering (Boore 2005; Boore and Akkar 2003).

The latitude and longitude coordinates, depth, and Mw of the sources in this study are from the Japan Meteorological Agency (JMA) Unified Hypocenter Catalog and F-net Mechanism Research. The source information for the events in 2005 and later are taken from the JMA Unified Hypocenter Catalog with K precision, and other events are given in F-net mechanism research. The latitude and longitude coordinates of the stations are derived from the site list given by K-NET and KiK-net, and we use the method proposed by Boore et al. (2011) to calculate VS30. The fault models provided by the United States Geological Survey (USGS), K-NET, SRCMOD database, and published papers (see electronic supplement) are used in this study, allowing Rrup and Ztor to be calculated. For earthquake events without a fault model, an empirical formula is used to estimate Rrup and Ztor (Scherbaum et al. 2004; Mai et al. 2005; Kaklamanos et al. 2011). The global subduction-zone geometry model Slab 2.0 provided by Hayes et al. (2018) is used to consider the location of the hypocenter relative to the depth of the surface of the subducting plate, and the earthquake events are classified according to the earthquake classification method of the Japanese subduction zone proposed by Zhao et al. (2015). It should be pointed out that the classification of some earthquake events is based on the research results of NGA-Subduction Project (Stewart 2020).

According to the abovementioned ground-motion records selection and data processing, 68,256 records are finally selected, including 27,086 subduction intraslab records and 10,209 subduction interface records. The distribution of Mw versus Rrup is shown in Fig. 1, and the distribution of the number of stations versus VS30 is shown in Fig. 2. An interesting phenomenon in Fig. 2b is that two large-magnitude earthquakes are limited to data with Rrup less than 250 km, shorter than smaller-magnitude earthquakes. There are two reasons to explain this phenomenon: on the one hand, the distribution of stations is not always uniform, which leads to a lack of records over some distance ranges; on the other hand, the records with large distance have a relatively low SNR and poor data quality. To ensure the high quality of the database in this study, these low-quality data have been removed.

Fig. 1
figure 1

Distribution of Mw versus Rrup: a subduction intraslab earthquakes and b subduction interface earthquakes

Fig. 2
figure 2

Histogram of the number of stations versus VS30: a subduction intraslab earthquakes and b subduction interface earthquakes

Two components (E-W and N-S) of the accelerograms in the database and their Husid plots are shown in Fig. 3. Even though long pre-event and post-event noise (approximately 20–30 s) signals appear in the ground-motion acceleration records, the slope of the Husid value is small and has almost no effect on DS.

Fig. 3
figure 3

Accelerograms (TCG0142107270519) in the database and their Husid plots: a, b accelerograms and c, d Husid plots

3 Prediction models

The ground-motion duration D is usually written as the sum of the magnitude term FM, the path term Fpath and the site term FVS30 (Kempton and Stewart 2006), and the expression is as follows:

$$\ln D = F_{M} + F_{{{\text{path}}}} + F_{VS30}$$
(1)

With the continuous improvement in GMPEs, an increasing number of influencing terms have been considered, including basin and focal depth. The influence of the basin effect on the ground motion is very significant. This is because when the seismic wave enters the basin from the soil, the difference in the impedance of the medium amplifies the ground motion, and at the same time, the body waves reaching the critical incident angle are completely reflected in the basin. Surface waves propagating inside the basin are generated. After complex superposition and interference of seismic waves, the ground motion is amplified, and the duration is also significantly increased. Depth is also an indispensable factor in the relationship with ground-motion attenuation. Zhao et al. (2016a, b, c) noted that the effect of depth on ground motion in the subduction zone is strong. There are many different types of earthquakes that can be generated over a wide range of depths, so changes in depth can have a significant effect on duration. Based on the K-NET and KiK-net databases, the influence of the magnitude, path, VS30, basin and depth to the top of rupture on DS of the ground motion are deeply explored, and a new GMPE of DS is proposed in this study:

$$\ln D_{S} = F_{{{\text{source}}}} + F_{{{\text{path}}}} + F_{{{\text{site}}}} + \delta B + \delta W = F_{M} + F_{{{\text{path}}}} + F_{VS30} + F_{{{\text{basin}}}} + F_{{{\text{depth}}}} + \delta B + \delta W$$
(2)

where, the source term Fsource includes the magnitude term FM and the depth to the top of the rupture term Fdepth, the site term Fsite includes the VS30 term FVS30 and the basin term Fbasin, δB is the inter-event residual, and δW is the intra-event residual. The coefficients in the GMPE are regressed using random effects regression analysis (Abrahamson and Youngs 1992), and the t-distribution statistics of each coefficient and the AIC value of the GMPE are calculated (Akaike 1974). When a new explanatory variable is added to the GMPE, only when |tα/2|≥ 1.96 (significance level α = 0.05) and the AIC value decreases upon adding the new explanatory variable can the new explanatory variable be significant, and the goodness of fit of the GMPE is higher; otherwise, the new explanatory variable needs to be removed.

3.1 Path term

By considering the coupling relationship between magnitude and distance, the path term cannot be obtained directly from the distribution of DS and Rrup. Therefore, the distribution of DS on Rrup is grouped according to different Mw ranges, and the path term under different Mw ranges is given. Figure 4 shows the Rrup distribution of DS5-95 under each Mw range. The Rrup distribution of DS5-75 is similar to that of DS5-95 and is given in the electronic supplement due to space limitations (the results for DS5-75 in later sections are also included in the electronic supplement). By fitting the data with a piecewise linear function, it can be found that when Mw < M4*, there is a significant difference in the slope of the piecewise linear function on both side of the reference distance R*, and the path term is as follows:

$$F_{{{\text{path}}}} = \left\{ {\begin{array}{ll} {C_{1} \ln R_{{{\text{rup}}}} ,} &\quad {R_{{{\text{rup}}}} \le R^{*} } \\ {C_{1} \left[ {\ln R^{*} + C_{2} \left( {\ln R_{{{\text{rup}}}} - \ln R^{*} } \right)} \right],} &\quad {R_{{{\text{rup}}}} > R^{*} } \\ \end{array} } \right.$$
(3)
$$C_{2} = \left\{ \begin{array}{ll} 0,&\quad M_{{\text{w}}} < M_{3}^{*} \hfill \\ \frac{{M_{{\text{w}}} - M_{3}^{*} }}{{M_{4}^{*} - M_{3}^{*} }} ,&\quad M_{3}^{*} \le M_{{\text{w}}} < M_{4}^{*} \hfill \\ 1 ,&\quad M_{{\text{w}}} \ge M_{4}^{*} \hfill \\ \end{array} \right.$$
(4)

where C1 and C2 are model parameters, M3* and M4* are reference magnitudes, and C2 is determined by the relationship between Mw, M3* and M4*.

Fig. 4
figure 4

Distribution of DS5-95 versus Rrup under different Mw: a subduction intraslab earthquakes; b subduction interface earthquakes

The red lines in Fig. 4 are the regression lines after determining the source and path terms and do not represent the final results. The same is true for the red solid line in Figs. 5, 6, 8 and 10. The specific form of the GMPEs is subject to the final result obtained by regression analysis.

Fig. 5
figure 5

Distribution of CM,e and C1,e versus Mw: a, b subduction intraslab earthquakes; c, d subduction interface earthquakes

Fig. 6
figure 6

Distribution of δWes versus VS30: a subduction intraslab earthquakes; b subduction interface earthquakes

3.2 Magnitude term

Abrahamson and Silva (1996), Kempton and Stewart (2006), and Afshari and Stewart (2016) used the Brune source model to express the source duration (i.e., magnitude term) as a function of the stress drop Δσ as follows:

$$F_{M} = \frac{{\left( {\Delta \sigma /M_{0} } \right)^{{ - \frac{1}{3}}} }}{{4.9 \cdot 10^{6} \beta }} = \frac{1}{{4.9 \cdot 10^{6} \beta }}\left\{ {\frac{{\exp \left[ {b_{1} + b_{2} \left( {M_{{\text{w}}} - M^{*} } \right)} \right]}}{{10^{{1.5M_{{\text{w}}} + 16.05}} }}} \right\}^{{ - \frac{1}{3}}}$$
(5)

where M0 is the seismic moment, β is the shear wave velocity at the source (usually taken as 3.2 km/s), b1 and b2 are regression coefficients, and M* is the reference magnitude. Although there is theoretical support for expressing the source effect of duration in this form, the stress drop is difficult to obtain accurately in practice. Afshari and Stewart (2016) reported that the duration and distance of each earthquake can be linearly fitted to obtain a linear function relationship, and then the duration when the distance is 0 is taken as the source duration. However, we have concluded that the source duration and stress drop obtained in this way are inaccurate, as after calculating, it is found that the source duration and stress drop of some earthquake events are negative (Afshari and Stewart pointed this out in their paper as well). The reason is that the relationship between duration and distance is not strictly linear within the whole range of Rrup but approximate linear or piecewise approximate linear; thus, using a linear function to fit the source duration leads to large errors. Moreover, the magnitude term given by Eq. (5) is relatively complex. Although Bahrampouri et al. (2021) adopted a variant of Eq. (5), the magnitude term is still in exponential form, which may not converge in regression analysis, and Bommer et al. (2009) concluded that the magnitude term should be equivalent to a simpler form to facilitate regression. Therefore, the present study adopts the variant of the magnitude term given by Bommer et al. (2009) and uses the method of Xing and Zhao (2021) to calibrate the magnitude. First, we assume that the GMPE is of the following form:

$$\ln D_{S,es} = C_{M,e} + F_{{{\text{path}}}}$$
(6)

where the subscripts e and s represent the number of earthquake event and the station, respectively. The site and depth are not considered for now, and the magnitude term is assumed to be a constant CM; that is, the magnitude term of the eth earthquake event is CM,e, and then the magnitude term is calibrated. In addition, to study whether there is a coupling phenomenon between the magnitude and the distance, the path term coefficient C1,e of the eth earthquake event will also be calibrated. Figure 5 shows the distribution of CM,e and C1,e versus Mw for DS5-95, CM,e is piecewise linear versus Mw and C1,e is also related to Mw. CM,e shows an upward trend with increasing Mw, while C1,e shows a decreasing trend. Therefore, the coefficients of the magnitude and path terms can be written as:

$$F_{M} = \left\{ {\begin{array}{ll} {a_{1} + a_{2} M_{{\text{w}}} ,} &\quad {4 \le M_{{\text{w}}} < M_{1}^{*} } \\ {a_{1} + a_{2} M_{1}^{*} + a_{3} \left( {M_{{\text{w}}} - M_{1}^{*} } \right),} &\quad {M_{{\text{w}}} \ge M_{1}^{*} } \\ \end{array} } \right.$$
(7)
$$C_{1} = \left\{ {\begin{array}{ll} {a_{4} + a_{5} M_{{\text{w}}} ,} &\quad {M_{{\text{w}}} \le M_{2}^{*} } \\ {a_{4} + a_{5} M_{2}^{*} + a_{6} \left( {M_{{\text{w}}} - M_{2}^{*} } \right),} &\quad {M_{{\text{w}}} > M_{2}^{*} } \\ \end{array} } \right.$$
(8)

where C1 is a model parameter in Eq. (3), a1 to a6 are regression coefficients, and M1* and M2* are reference magnitudes obtained from Fig. 5a, c and b, d by piecewise linear fitting, respectively.

3.3 V S30 term

The early GMPEs expressed the site effect as the site type, but in recent years, they usually incorporate VS30 to represent the site effect. VS30 is a smooth and continuous parameter that can provide a smooth transition from a rock site to a soft soil site when describing site effects (Kamai et al. 2016). Kempton and Stewart (2006), Afshari and Stewart (2016), and Bahrampouri et al. (2021) chose VS30 as the site effect parameter, so the present paper also uses VS30 to describe the impact of the site effect (i.e., VS30 term) on DS; the basin term is presented in the next section.

Figure 6 shows the distribution of VS30 versus intra-event residual δWes of various types of earthquakes after determining the magnitude and path terms. With increasing VS30, δWes gradually decreases, and the slope of the trend line changes when VS30 is greater than VS30*, where VS30* is a reference value calculated from Fig. 6 by piecewise linear fitting. Therefore, the expression of the VS30 term is as follows:

$$F_{VS30} = \left\{ {\begin{array}{ll} {a_{7} \ln V_{S30} ,} &\quad {V_{S30} \le V_{S30}^{*} } \\ {a_{7} \ln V_{S30}^{*} + a_{8} \left( {\ln V_{S30} - \ln V_{S30}^{*} } \right),} &\quad {V_{S30} > V_{S30}^{*} } \\ \end{array} } \right.$$
(9)

Figure 7 shows the scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by VS30. As VS30 decreases, the slope of scatter points becomes larger, and the peak value of the probability distribution histogram gradually moves to longer durations, which indicates that the duration has a negative correlation with VS30.

Fig. 7
figure 7

Scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by VS30: ah subduction intraslab earthquakes and ip subduction interface earthquakes

3.4 Basin term

It is difficult to quantify the basin effect due to the complexity. At present, only the basin depth has been considered in GMPEs: Zn (the depth to shear wave velocity of n km/s for the site, e.g., Z1, Z1.5 and Z2.5) and δZn (the difference between the depth to shear wave velocity of n km/s for the site and a median depth conditional on VS30, e.g., δZ1 δZ1.5, and δZ2.5). Zn can be obtained from the J-SHIS website. Note that J-SHIS directly provides Z1 and Z1.5, and Z2.5 is calculated by modified Akima cubic Hermite interpolation.

In this study, it is found that δZ1 and Z1 are selected as the basin depth indicators to have the best fitting effect (the AIC value of the GMPE is small). Considering that both Afshari and Stewart (2016) and Bahrampouri et al. (2021) used δZ1 as the basin depth index, we also choose δZ1 to describe the basin effect. Figure 8 shows the distribution of the intra-event residual δWes versus δZ1 after considering the magnitude, path and VS30 terms. When δZ1 is less than δZ1*, δWes increases with increasing δZ1, and greater than δZ1*, δWes is almost constant. Therefore, the basin effect term is written as follows:

$$F_{{{\text{basin}}}} = a_{9} \min \left( {\delta Z_{1} ,\delta Z_{1}^{*} } \right)$$
(10)

where δZ1* is a reference value calculated from Fig. 8 by piecewise linear fitting.

Fig. 8
figure 8

Distribution of δWes versus δZ1: a subduction intraslab earthquakes; b subduction interface earthquakes

Figure 9 shows a scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by δZ1. As δZ1 decreases, the slope of scatter points becomes smaller, and the peak value of the probability distribution histogram gradually moves to shorter durations, which indicates that the duration has a positive correlation with δZ1.

Fig. 9
figure 9

Scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by δZ1: ah subduction intraslab earthquakes and ip subduction interface earthquakes

3.5 Depth term

The influence of depth on ground-motion DS is not reflected in most studies, but Ztor is used in BSA09 and DW17, so we also consider the depth term in the GMPE. Figure 10 shows the distribution of the inter-event residual δBe versus Ztor after considering the magnitude, path, VS30 and basin terms. It can be found that δBe basically shows a decreasing trend with increasing Ztor. For subduction interface earthquakes, δBe does not change with increasing Ztor within the range of 50 km. When Ztor is greater than 50 km, δBe shows a decreasing trend. The expression of the depth term is as follows:

$$F_{{{\text{depth}}}} = a_{10} \max \left( {Z_{{{\text{tor}}}} ,Z_{{{\text{tor}}}}^{*} } \right)$$
(11)
Fig. 10
figure 10

Distribution of δBe versus Ztor: a subduction intraslab earthquakes; b subduction interface earthquakes

here, \(Z_{{\rm tor}}^{*}\) equals 50 km for interface earthquakes and 0 for intraslab earthquakes.

Figure 11 shows a scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by Ztor. As Ztor decreases, the slope of scatter points becomes larger, and the peak value of the probability distribution histogram gradually moves to longer durations, which indicates that the duration has a negative correlation with Ztor.

Fig. 11
figure 11

Scatter plot of DS5-95 versus Rrup and probability distribution of DS5-95, which are grouped by Ztor: ah subduction intraslab earthquakes and ip subduction interface earthquakes

3.6 Model performance

The intra-event residual δWes is further decomposed into the site-to-site residual δS2Ss and the intra-event single-site residual δWSes (Baltay et al. 2017; Chen and Faccioli 2013). Thus, Eq. (2) is rewritten as:

$$\ln D_{S,es} = \ln \hat{D}_{S,es} + \delta B_{e} + \delta W_{es} = \ln \hat{D}_{S,es} + \delta B_{e} + \delta S2S_{s} + \delta WS_{es}$$
(12)

where DS,es is the observation value of the duration, \(\hat{D}_{S,es}\) represents the predicted value calculated by the GMPE, the standard deviation of δWes is σ, the standard deviation of δBe is τ, and the corresponding standard deviations of δS2Ss and δWSes are ϕs2s and ϕss, respectively. The random effect model is used to regress the GMPE proposed in this study. The regression results of each coefficient, standard deviation and t-distribution statistics are shown in Tables 1 and 2. Table 3 shows the AIC values of the GMPEs considering different explanatory variables. From the t-distribution statistics of explanatory variables given in Tables 1 and 2, the values are all greater than 1.96, and the AIC values gradually decrease as the number of explanatory variables increases, which proves that each explanatory variable in the GMPE is significant and that the fitting effect of the prediction equation gradually tends to be optimal. In the GMPE, M1*, M2*, M3*, M4*, R*, VS30*, δZ1* and Ztor* are obtained by observation and trial and error, so these parameters have no standard deviation.

Table 1 Regression results of the GMPE for DS of subduction intraslab earthquakes
Table 2 Regression results of the GMPE for DS of subduction interface earthquakes
Table 3 AIC values of the GMPE considering different explanatory variables

Figures 12 and 13 show the distribution of residual of each explanatory variable in the GMPEs for DS5-95. The scatters of residuals are relatively evenly distributed on both sides of the zero line, and the trend line basically coincides with the zero line. The distribution of residuals in some explanatory variables is biased, which may be due to two reasons: (1) the data corresponding to the residual migration part are relatively sparse, which cannot provide sufficient data support for the regression analysis, and (2) the lack of accurate site information and fault models makes the data have deviations compared with the actual situation. However, in general, the deviation of the residuals is not large and in most cases is unbiased. Figure 14 shows the DS prediction values of DS with different magnitudes and different site conditions, the DS increases with increasing Mw and Rrup, and decreases with increasing VS30. Due to the influence of δZ1 and Ztor is not as significant as other parameters on DS, thus, this study does not give the prediction curves of DS under different δZ1 and Ztor. However, from the regression results, DS increases with increasing δZ1, and decreases with increasing Ztor. The above rules are consistent with the existing research results. Figure 15 shows the variation in DS of various types of earthquakes with the Rrup under different magnitudes. The prediction values of DS of subduction intraslab and interface earthquakes are significantly different, the prediction values of subduction interface earthquakes are generally longer than those of intraslab earthquakes. Figure 14a, c and Fig. 15 show that at Mw5, there is a prominent abrupt change in the slope of DS of subduction intraslab earthquakes, and at Mw7, the slope of DS of subduction intraslab earthquakes is slightly larger than that of interface earthquakes. Therefore, it can be concluded that the differences in earthquake types should be considered when discussing issues related to the duration, such as seismic risk assessment and structural response.

Fig. 12
figure 12

Distribution of residuals of the GMPE for DS5-95 of subduction intraslab earthquakes versus explanatory variables (the blue dots are residual values, the red solid line is the trend line, and the red area is the 95% confidence interval)

Fig. 13
figure 13

Distribution of residuals of the GMPE for DS5-95 of subduction interface earthquakes versus explanatory variables (the blue dots are residual values, the red solid line is the trend line, and the red area is the 95% confidence interval)

Fig. 14
figure 14

Predicted values of DS5-95 (δZ1 = 100 m, Ztor = 25 km): a subduction intraslab, VS30 = 760 m/s; b subduction intraslab, Mw = 6; c subduction interface, VS30 = 760 m/s; d subduction interface, Mw = 6

Fig. 15
figure 15

Predicted values of DS under different magnitudes: a DS5-75; b DS5-95 (VS30 = 760 m/s, δZ1 = 100 m, Ztor = 25 km)

3.7 Comparison to prior models

Figure 16 shows the comparison between the DS prediction model in this paper and the results of previous work. The prediction results of the subduction intraslab and interface earthquake models in this paper are basically between those of the BRG21 model and the JG21 model. However, the predicted values of the models proposed in this study are smaller than those of the BRG21 model. The reasons may be as follows: (1) in this study, we select records with PGA ≥ 10 gal, but BRG21 does not limit the PGA. (2) Rrup of the data set in this paper is within 300 km, and the Rrup of the subduction intraslab and interface earthquakes of BRG21 reaches 500 km. Since the discreteness of DS increases with increasing distance, there is great uncertainty in a prediction of DS at large distance, and when the distance exceeds 300 km, the PGA attenuates to a very low level, which has little impact on the structure. Data beyond 300 km are unnecessary for the follow-up work of this paper. (3) The selection of records in this paper is relatively strict. BRG21 requires SNR ≥ 2, while this paper requires SNR ≥ 3 to ensure higher data quality. BRG21 does not have a clear limit on the number of records for a single earthquake event, but to ensure the statistical significance of the data, it is required that the records of a single earthquake event should not be less than 10 in this study. These constraints cause some data to be ignored, so that the number of records used in this paper is less than that of BRG21, but the database in this study has higher quality than that of BRG21. In addition, the way the focal mechanism solution is obtained can also affect the dip angles of the fault. The criteria for judging fault types, earthquake types, and the selection of the geometry model of the subduction zone are different, resulting in differences in the final earthquake classification. In this study, the geometry model of the subduction zone is selected as the latest Slab 2.0, and the criteria for judging fault types and earthquake types are also the most commonly used and convincing.

Fig. 16
figure 16

Comparison of predictions of different models for significant duration with results from this study: a, b, c, g, h, i subduction intraslab earthquakes and d, e, f, j, k, l subduction interface earthquakes (VS30 = 760 m/s, δZ1 = 100 m, Ztor = 25 km)

Although there are differences between BRG21 and the models in this study, at present, there are very few GMPEs for DS of the ground motion in the subduction zone of Japan. The results and conclusions of this paper need to be tested after the research results in this area are continuously enriched and improved. However, we simplifie the traditional magnitude term and directly uses Mw to recalibrate it, which reduces the error caused by calculating the source duration from the stress drop. We also consider a focal depth term in the GMPEs, which helps improve the prediction accuracy of the model. In addition, the authors also investigated the influence of fault type, volcanic prearc and postarc distances and volcanic area distance (Zhao et al. 2016a, b, c) on DS, but the results show that the above factors have no significant influence on DS of this database. Therefore, the terms corresponding to the above factors are not added to the GMPE. With the continuous expansion of the database, the GMPE in this study is planned to be updated, and the authors plan to improve the database and the GMPEs in future work.

Note that although Rrup of the database used in this study reaches 300 km, Rrup of most earthquake events (especially for Mw < 5.5) is much less than 300 km and basically reaches only 200 km; the data beyond 200 km are relatively scarce. Moreover, with increasing distance, the discreteness of duration also gradually increases, especially when Mw ≥ 6 and Rrup ≥ 200 km, and the distribution range of duration can reach tens of seconds. Therefore, we suggest that the distance range of the GMPEs proposed in this study be within 200 km. The distance range of the database should reach 350 km or even 400 km to ensure higher accuracy in predicting DS of Rrup close to 300 km.

4 Spatial correlation of intra-event residuals

4.1 Semivariogram and spatial correlation coefficient

As discussed, the GMPE is a ground-motion probability distribution model established based on parameters such as magnitude, distance, and site. The inter-event residual in the GMPE describe the average deviation away from the median GMPE for any one event, while the intra-event residual is the remaining variation of a single record about the event-specific average GMPE. For the same event, closer sites show stronger correlation of the ground-motion parameters, constituting spatial correlation of ground motion (Chen et al. 2020; Goda and Hong 2008; Jayaram and Baker 2009).

Evaluating the correlation of intra-event residuals is usually divided into the following steps: (a) calculating the intra-event residuals of the prediction model; (b) fitting the semivariogram of the residual in the event; and (c) calculating the spatial correlation coefficient of the residual in the event. Fitting the semivariogram (step (b)) is the most critical step. A semivariogram is one of the indicators describing spatial correlation in geostatistics. If the semivariogram depends only on the separation distance h rather than the actual position, the semivariogram is considered to be second-order stationary, and the expression is as follows:

$$\hat{\gamma }(h) = \frac{1}{2N(h)}\sum\limits_{i = 1}^{N(h)} {\left( {\varepsilon_{ij} - \varepsilon_{ik} } \right)^{2} }$$
(13)

where εij and εik represent the intra-event residuals of the jth and kth sites of the ith earthquake event, respectively, i = 1, 2, …, N(h), and N(h) is the number of stations [ai, bi] satisfying h–Δh/2 ≤|aibi|≤ h + Δh/2. To ensure the reliability of the statistics, Δh = 2 km. The above formula is the method of moments proposed by Matheron (1962). Cressie (1985) proposed a more robust estimator, which is relatively insensitive to discrete values:

$$\hat{\gamma }(h) = \frac{1}{2}\left\{ {\frac{{\left[ {\frac{1}{|N(h)|}\sum\limits_{i = 1}^{N(h)} {|\varepsilon_{ij} - \varepsilon_{ik} |^{0.5} } } \right]^{4} }}{{0.457 + \frac{0.494}{{|N(h)|}}}}} \right\}$$
(14)

Therefore, Eq. (14) is selected as the estimator of the semivariogram in this paper. N(h) should be as large as possible. Esposito and Iervolino (2012) and Du and Wang (2013) suggested that N(h) should be at least 30, while Wagener et al. (2016) suggested that N(h) should be at least 100 to ensure more reliable and representative estimation results. The parameter should not be too small, at least not less than 30. However, when is N(h) too large, it is difficult to accurately evaluate the spatial correlation coefficients because there are too few data records that meet the requirements. Based on repeated reductions in N(h) and corresponding evaluation, we suggest that N(h) = 50 is more appropriate.

The continuous semivariogram can be fitted from the estimated value of Eq. (14). There are many commonly used semivariogram fitting models based on 2nd-order stationarity and isotropy. In this study, the exponential model is selected because the exponential model is considered to be the model with the best fitting effect (Jayaram and Baker 2009). The exponential model selected is represented by the following formula:

$$\gamma (h) = a\left[ {1 - \exp \left( { - \frac{3h}{b}} \right)} \right]$$
(15)

where a and b are the sill and the range of the semivariogram model, respectively. The sill a is equal to the variance in the intra-event residual ε, and the range b is the separation distance corresponding to the semivariogram equal to 0.95 times the sill. The intra-event residual is normalized; that is, the intra-event residual ε is divided by its standard deviation φ. Then, Eq. (15) can be simplified as:

$$\gamma (h) = 1 - \exp \left( { - \frac{3h}{b}} \right)$$
(16)

The choice of the standard deviation φ is extremely important here, and Esposito and Iervolino (2012) use the standard deviation of the intra-event residuals regressed from the GMPEs to be normalized as φ. However, since the standard deviation of intra-event residual of each earthquake event is different, using a fixed standard deviation leads to a biased intra-event correlation. Therefore, each earthquake event should use its own standard deviation (Foulser-Piggott and Goda 2015; Schiappapietra and Douglas 2020). After normalization, the spatial correlation coefficient can be calculated:

$$\rho (h) = 1 - \gamma (h) = \exp \left( { - \frac{3h}{b}} \right)$$
(17)

Since the intra-event residual δW can be decomposed into the site-to-site residual δS2S and the intra-event single-site residual δWS according to Eq. (12), the semivariogram can be used not only to measure the spatial correlation of intra-event residual but also to analyze the spatial correlation of intra-event single-site residual. Therefore, this study also gives the spatial correlation analysis results based on the intra-event residual δW and the intra-event single-site residual δWS.

4.2 Results of spatial correlation models

The semivariogram values at each separation distance of various types of earthquakes are fitted, referring to Foulser-Piggott and Goda (2015), and the fitting value of range b is shown in Table 4. Figures 17 and 18 show the fitting results of the semivariogram and spatial correlation coefficient for DS5-95. The spatial correlation coefficient of DS decreases rapidly with increasing separation distance; when the separation distance reaches 100 km, the spatial correlation coefficient is reduced to approximately 0. In addition, the spatial correlation curve of DS of a single earthquake event is basically the same as that of the combined earthquake event, but the spatial correlation curve of a single event has a large variability, especially when the separation distance is large. Both the spatial correlation coefficient based on δWS and that based on δW show the characteristics of decay with increasing separation distance, indicating that the spatial correlation is related not only to the site effect but also to factors unrelated to site conditions such as path effects. The above conclusions are consistent with the results of the spatial correlation analysis of the Arias intensity by Foulser-Piggott and Goda (2015). Figure 19 shows the comparison of spatial correlation curves for DS of different types of earthquakes. With increasing separation distance, the attenuation rate of the spatial correlation coefficient of subduction interface earthquakes grows slower than that of subduction intraslab earthquakes.

Table 4 Fitting values of ranges for semivariogram exponential models
Fig. 17
figure 17

Fitting results of the semivariogram and spatial correlation coefficient for DS5-95 of subduction intraslab earthquakes: a fitting results of the semivariogram (based on δWS); b spatial correlations (based on δWS); c fitting results of the semivariogram (based on δW); d spatial correlations (based on δW)

Fig. 18
figure 18

Fitting results of the semivariogram and spatial correlation coefficient for DS5-95 of subduction interface earthquakes: a fitting results of the semivariogram (based on δWS); b spatial correlations (based on δWS); c fitting results of the semivariogram (based on δW); d spatial correlations (based on δW)

Fig. 19
figure 19

Comparison of DS spatial correlation curves: a, b Spatial correlations based on δWS and c, d spatial correlations based on δW

5 Conclusions

In this study, for subduction intraslab and interface earthquakes in Japan, we first consider the effects of Mw, Rrup, VS30, δZ1, and Ztor on DS. Then, GMPEs for DS5-75 and DS5-95 are developed, and the rationality and reliability of these GMPEs are proven by residual analysis and comparison with previous models. Finally, based on the intra-event residuals, the spatial correlation of DS is briefly analyzed. The results show that (1) DS increases with increasing Mw, Rrup and δZ1 and decreases with increasing VS30 and Ztor, and the path term is dependent on Mw. (2) DS of subduction interface earthquakes is greater than that of subduction intraslab earthquakes on the whole; and under large magnitudes, the slope of DS of subduction intraslab earthquakes is slightly greater than that of subduction interface earthquakes; the differences of the earthquake types should be taken into account when discussing issues related to the duration of ground motions such as seismic risk assessment and structural response. (3) The spatial correlation of DS is related not only to site effects but also to factors unrelated to site conditions, such as path effects, and the spatial correlation coefficient decreases with increasing separation distance. When the separation distance reaches 100 km, the spatial correlation coefficient decreases to approximately 0, but the spatial correlation coefficient attenuation rate of the subduction interface earthquake is slower than that of the intraslab subduction earthquake. (4) The variation trend of the spatial correlation curve of a single earthquake event is basically consistent with the overall average spatial correlation curve of earthquake events, but the spatial correlation curve of a single earthquake event has great variability, especially for large separation distances.