1 Introduction

Site amplification, or the increase in amplitudes of seismic waves as they traverse soft soil layers near the Earth’s surface, is a major factor influencing the extent of earthquake damage to structures (Borcherdt 1970; Nakamura 1989; Lermo and Chávez-García 1994; Field and Jacob 1995). Thus understanding of site-specific amplification effects and their role in determining ground motions is important for the design of engineered structures. The methods utilized for site effect analysis from recorded motions can be divided into three general approaches. A standard spectral ratio (SSR) approach has been applied to determine the ratio of the motions recorded on a soil site to those recorded on a nearby reference rock site (Borcherdt 1970; Hartzell 1992; Field and Jacob 1995; Kato et al. 1995; Field 1996; Hartzell et al. 1996; Su et al. 1996; Bonilla et al. 1997). In this method, it is assumed that the reference rock site does not amplify ground motion. However, it is difficult to find a true reference site due to the prevalence of a weathered layer near the surface everywhere (Steidl et al. 1996); the reference rock site may also have significant amplification due to impedance effects associated with its shear-wave velocity gradient. Moreover, if the reference rock site is not very close to the target soil site, then the incident wave field may not have the same characteristics. Other variants of the spectral ratio approach include the use of cross-spectral ratios (Safak 1991; Steidl 1993), response spectral ratios (Kitagawa et al. 1992), ratios of peak RMS (root-mean-square) or effective accelerations (Borcherdt and Wentworth 1995), and blind deconvolution techniques (Zerva et al. 1995).

The second method uses generalized inversion techniques to characterize site amplification. In this method, the source, path, and site characteristics are identified simultaneously (Andrews 1986; Boatwright et al. 1991; Hartzell 1992); reviews of this method can be found in Bard (1994) and Field and Jacob (1995). The advantage of this method is in providing the absolute amplification/de-amplification of body waves (S-wave) at a site. However, like the SSR technique, the generalized inversion method also suffers from the need to define a reference site.

The third common method of quantifying the site effects is known as the single-station method (Langston 1979; Nakamura 1989). This technique is based on the horizontal-to-vertical spectral ratio of the motions at a site, which is convenient in cases where no reference site is available. Reviews of this method can be found in Lachet and Bard (1994), Field and Jacob (1995), and Atakan (1995).

In this study, we use a technique based on cross-spectral ratios, a more direct variant of the SSR approach, in which the reference site is at the bottom of a borehole, directly below the soil site. Borehole data are useful in measuring site effects. For example, borehole measurements have provided direct in situ evidence of nonlinearity (Seed and Idriss 1970; Wen et al. 1994; Zeghal and Elgamal 1994; Iai et al. 1995; Sato et al. 1996; Aguirre and Irikura 1997; Satoh et al. 2001a); for surface-rock recordings they can be used as input motion to soil columns (Satoh et al. 1995; Steidl et al. 1996; Boore and Joyner 1997); and they have provided information about scaling of earthquakes of different magnitudes (Kinoshita 1992; Abercrombie 1997). The problems associated with using borehole data as the input to a site transfer function were discussed by Cadet et al. (2012). The most significant effect is the interference between the incident wave and the reflection from the surface and any other velocity contrasts (discontinuities) in the soil column. The destructive interference of the incident wave field and the down-going waves may produce holes in the ground-motion spectral ratio (Steidl et al. 1996). Consequently, a direct spectral ratio between the surface and the total motion at depth generally produces pseudo-resonances where these holes are present. These phenomena are referred to as the depth effect (Cadet et al. 2012). These effects should be taken into account when using surface-to-borehole spectral ratios (S/B) for computing the transfer function of a site, though the best means of doing so is a controversial issue.

Site amplification effects were profound for the 2011 M9.0 Tohoku event. As an illustration, Fig. 1 compares surface and borehole ground motions at station MYGH04; the location of this station relative to the fault plane is indicated by a black square in Fig. 2. Amplitudes of ground motions at the borehole level (PGA[borehole] \(\approx 100\,\text{ cm}/\text{ s}^{2})\) are amplified by a factor of 5 at the surface (PGA[surface]); this large amplification is due to the presence of a shallow low-velocity soil layer, about 4 m thick, that overlies stiff bedrock.

In this study, we perform a thorough analysis of site amplification during the Tohoku event (as well as other Japanese events of M \(>\) 5.5). We take advantage of the vertical array of the KiK-net (KIBAN kyoshin network: http://www.kik.bosai.go.jp/) including the enriched database of the NIED (National Research Institute for Earth Science and Disaster Prevention: http://www.bosai.go.jp/e/) to estimate the site transfer functions for all sites, considering both linear and nonlinear site effects. The applicability of the H/V spectral ratio technique as an alternative tool to estimate site response is evaluated by modeling the relationship between H/V ratios and S/B ratios, as a function of physical site properties. We examine several approaches for site amplification estimation, as a means of checking the robustness of the functions we determined based on S/B ratios. We develop a suite of simple, useful, and reliable models for prediction of site amplification effects based on readily-obtainable site parameters. Finally, we develop an empirical model of ground motions for the Tohoku earthquake (including site effects) and compare it to the predictions of several existing ground motion prediction equations (GMPEs) commonly used in seismic hazard analysis applications.

Fig. 1
figure 1

Comparison of observed surface and borehole ground motions for MYGH04 station of the KiK-net. The values at the end of each trace list the peak ground acceleration in \(\text{ cm}/\text{ s}^{2}\) for East-West (EW), North–South (NS), and Vertical (UD) components, respectively. All the records are plotted on the same scale. Time series in black are recorded on the surface, and those in green are recorded in the borehole. The plot on the right is the shear-wave velocity profile of the corresponding site. The site is at \(\sim 91\,\text{ km}\) from the fault plane and categorized as the NEHRP site class C (very dense soil and soft rock), with \(\text{ V}_\mathrm{S30} = 850 \text{ m}/\text{ s}\). Seismometer is installed at depth of 100 m at this station

2 Strong ground motion data and record processing

The strong-motion data used in this study were collected from the KiK-net of the NIED of Japan. The KiK-net consists of 687 strong-motion observation stations installed both on the ground surface and at the bottom of boreholes (directly below the surface site). Figure 2 shows the distribution of the KiK-net stations triggered by the Tohoku event. Detailed descriptions of the other strong motion networks that recorded ground-motions of the 2011 Tohoku earthquake can be found in Aoi et al. (2011) and Midorikawa et al. (2012).

To enable a statistically-robust and meaningful analysis of site effects, we supplement the Tohoku-event data by adding other events of \(\mathbf{M }\ge 5.5\) that were recorded on all KiK-net stations from 1998 to 2009. For the 687 KiK-net stations, we have processed and analyzed 30,453 records from 258 earthquakes in total. The number of events for each station varies from 4 to 150. The processing procedure for all records includes windowing, correction for baseline trend, and band-pass filtering. We have applied non-causal, band-pass Butterworth filters with an order of 4. The selected frequency range of analysis is 0.1–15 Hz. The lower frequency limit was selected by inspecting many records, and concluding that this value is appropriate to produce well-shaped displacement time series, with a flat portion of the displacement spectrum at low frequencies. The upper limit is chosen by considering the cut-off frequency of the seismograph response spectrum (15 Hz). For each record, we compute the instrument-corrected Fourier amplitude spectrum (FAS) of accelerograms.

3 Calculation of site response using surface-to-borehole spectral ratio (S/B)

Surface-to-borehole spectral ratios (S/B) are used to provide a direct measure of site response. However, destructive interference between the up-going incident wave field and down-going reflected waves from the surface at specific frequencies can produce a notch in the FAS of the borehole recording (Shearer and Orcutt 1987; Steidl et al. 1996). For an undamped single layer over an elastic half space, this notch can be interpreted as the infinite value of the transfer function (Zhao 1996) between the total displacement at the surface soil site and the total displacement at the bottom of the soil layer (borehole record) (J. Zhao, personal communication, 2012). The existence of this hole in the borehole spectrum could produce peaks in the spectral ratios that might be misinterpreted as site-response peaks. Figure 3 schematically illustrates the problem. In this figure, the amplitude spectrum of incident waves is denoted “A”, while that of the reflections is denoted “B”. It is assumed that the soil profile consists of n layers overlying bedrock. As the incident and reflected waves always constructively interfere at the surface, the total amplitude for the outcrop rock would be 2A\(_{n}\), while for the station over the soil column it would be 2A\(_{1}\). The ratio of motions recorded at the surface on soil to those on bedrock can be defined as:

$$\begin{aligned} \frac{S}{B} = \frac{2A_1 }{\left( {A_n + B_n } \right)} \end{aligned}$$
(1)

The factor of two in the numerator represents the free-surface amplification. At the borehole level, the down-going reflected waves are interfering destructively and will reduce the apparent amplitude of the incident waves, thus obscuring the actual site response.

Fig. 2
figure 2

The spatial distribution of all KiK-net stations (black dots) and those stations recorded the Tohoku event (green dots). The star is the epicenter. The major tectonic boundaries—the trench and the volcanic front—are represented by dashed black and red lines, respectively. A blue rectangle shows the fault plane obtained from the GPS Earth Observation Network System (GEONET) data analysis (http://www.gsi.go.jp/)

Fig. 3
figure 3

Schematic representation of site amplification. Soil profile consists of n layers where incident and reflected waves for each layer are denoted as \(\text{ A}_\mathrm{i}\) and \(\text{ B}_\mathrm{i }\) (i = 1 to n), respectively. Amplification can be defined as the surface-to-borehole (S/B) spectral ratio [Eq. (1)] or surface-to-outcrop (standard) spectral ratio (SSR: \(\text{2A}_{1}/\text{2A}_\mathrm{n})\)

A possible solution to the destructive interference problem involves the use of cross-spectrum techniques (Safak 1991; Field et al. 1992; Steidl 1993; Bendat and Piersol 2006), which consider the coherency of the waveforms at the surface and down-hole. Taking into account the effect of the reflected wave-field, S/B ratios are multiplied by the coherence (\(C ^{2})\) between surface and borehole recordings (Steidl et al. 1996). The coherence is defined as:

$$\begin{aligned} C^{2} = \frac{\left| {S_{12} \left( f \right)} \right|^{2}}{S_{11} \left( f \right)S_{22} \left( f\right)^{\prime }} \end{aligned}$$
(2)

and thus the depth-corrected cross spectral ratios are given by:

$$\begin{aligned} S/B^{{\prime }} = C^{2}\left( {S/B} \right), \end{aligned}$$
(3)

where \(S _{11}(f)\) and \(S _{22}(f)\) are the power spectral densities of the seismograms recorded at the surface and down-hole, respectively, \(S _{12}(f)\) is the cross-power-spectral density function (Steidl et al. 1996), and \(\text{ S}/\text{ B}^{\prime }\) is the S/B ratio corrected for the depth effect. It should be mentioned that a loss in coherence could be due to the site response itself, and not just to the destructive interference effect.

Another possible way to correct the S/B ratio for the depth effect is to use empirical correction factors as proposed by Cadet et al. (2012). The correction function in their paper is an empirical version of an analytical solution provided by Zhao (1996; 1997) for a multiple-layer 1-D model. The key parameter in the correction factors is the fundamental resonance frequency (\(\text{ f}_{0})\) of a site. In their paper, the fundamental resonance frequency was defined as the frequency corresponding to the first (i.e., lowest frequency) peak of the S/B ratio. A “peak” is defined as a specific local maximum with amplitude larger than 2 in a statistical sense (Cadet et al. 2012). Knowing the average shear-wave velocity (\(\text{ V}_\mathrm{S})\) over the depth of the borehole (\(\text{ D}_\mathrm{b})\), the so-called first destructive interference frequency (\(\text{ f}_\mathrm{d1})\) of the soil column can also be estimated using \(\text{ f}_\mathrm{d1} = \text{ V}_\mathrm{S}/4\text{ D}_\mathrm{b}\).

To illustrate the issues, Fig. 4 shows recorded FAS and S/B ratios for two representative sites (ABSH10 and AICH12). Based on the definition of \(\text{ f}_{0}\) given by Cadet et al. (2012) (i.e. the lowest frequency peak with amplitude \(>\) 2), we might choose \(\text{ f}_{0} \sim 2.0\,\text{ Hz}\) for both stations. Referring to Fig. 4, we can see that the increase in the amplitude of S/B for ABSH10 near 2.0 Hz is due to a corresponding drop of FAS at the borehole level because of downward reflections. However, the significant amplification at the surface in AICH12 near 2.0 Hz is clearly due to actual site amplification; it is not due to interference of incident and reflected waves at the borehole, and should therefore not be reduced using correction factors. We found that, in general, empirical correction factors and cross-spectral ratio techniques produced similar results in terms of eliminating the depth effect. However, we adopt the cross-spectral ratio techniques to calculate \(\text{ S}/\text{ B}^{\prime }\) (as defined by Eq. (3)) as it is easier to apply and less subject to misinterpretations (Fig. 4). We acknowledge that there is some controversy in the best approach to correct S/B ratios for the depth effect, and even in the meaning of the spectral holes that can be seen in S/B ratios (J. Zhao, personal communication, 2012). Therefore, we have used several lines of reasoning to investigate the robustness of our \(\text{ S}/\text{ B}^{\prime }\) ratios as a measure of the site transfer function.

One check on the amplifications derived using \(\text{ S}/\text{ B}^{\prime }\) ratios is to compare them to those computed theoretically based on the site soil profile. In order to estimate the theoretical site response we used the program Nrattle (written by C. Mueller with modification by R. Herrmann), which calculates the 1-D transfer function of a layered, damped soil over an elastic bedrock, for a vertically propagating shear-wave (SH), using a Thomson–Haskell approach (Thomson 1950; Haskell 1960). The Nrattle solution was found to be exactly equivalent to the solution computed by the equivalent-linear site response program SHAKE (Schnabel et al. 1972) for linear modulus reduction and damping curves (Thompson et al. 2009). The input data for Nrattle are the layered velocity model, specifying the thickness, density, shear-wave velocity, and Quality factor for each layer. For the density and Quality factor, we used values reported by Cadet et al. (2012). The other input parameters are the velocity and density of the half-space, incident angle, and the depth with respect to which the transfer function is calculated. This reference depth is the depth of installation for borehole sensors. We compared the fundamental frequencies and amplitude of amplification as obtained from the theoretical transfer function with the corresponding values as deduced from the \(\text{ S}/\text{ B}^{\prime }\) ratios (after correction for the depth effect), for 167 sites where there is a clear, statistically-significant single peak in the H/V ratios. We found close correlation between the theoretical transfer functions and observations, in agreement with similar results from other studies (Steidl et al. 1996; Cadet et al. 2012). Observed and predicted fundamental frequencies are strongly correlated (correlation coefficient of 0.95), with standard deviation of residuals (empirical \(\text{ S}/\text{ B}^{\prime }\)—theoretical \(\text{ S}/\text{ B}^{\prime }\)) of \(\sigma _\mathrm{residual} = 0.08\) (log base 10). The agreement between \(\text{ S}/\text{ B}^{\prime }\) and theoretical amplification amplitude is not as close as that for the fundamental frequencies, with a mean residual of almost zero and \(\sigma _\mathrm{residual} = 0.23\). The greater scatter between observed and theoretical site response amplitudes could be the result of errors in the parameters specifying the velocity profiles, densities, and Quality factors used for computing the transfer functions.

Fig. 4
figure 4

Comparing the amplification functions at two different stations. The top row is showing S/B ratio (horizontal component) for the Tohoku event data (green line) along with the corresponding average for all KiK-net data (solid black line), and its \(\pm \) 1\(\sigma \) bounds (shaded gray). The bottom row plots the FAS of horizontal components (geomean) for Tohoku, for both surface and borehole ground motions. The solid lines are the smoothed spectra

Good agreement with the results obtained by the cross-spectral ratios (\(\text{ S}/\text{ B}^{\prime }\)) from this study and the empirical correction factors using the techniques of Cadet et al. (2012) further indicates the robustness of our calculated site response functions; correlation coefficients at almost all frequencies are greater than 0.8 (for frequencies \(>\) 0.4 Hz, correlation coefficients are greater than 0.9). It should be mentioned that \(\text{ S}/\text{ B}^{\prime }\) ratios may not represent the desired transfer function for sites with significant feedback at the borehole sensor (Safak 1991, 1997). Nonetheless, this is not the case for many sites as shown in Fig. 4. On the other hand, Safak (1997) illustrated that \(\text{ S}/\text{ B}^{\prime }\) ratios provide probabilistically more accurate estimates than the traditional SSR ratios by proving that they represent the least-squares estimate of site amplification if noise signals are assumed to be Gaussian random processes (Assimaki et al. 2008).

Finally, we checked key aspects of the transfer function using empirical regression (as in a traditional GMPE) to determine the amplification term for \(\text{ V}_\mathrm{S30}\) (relative to a reference of 760 m/s). This confirmed that the amplifications via \(\text{ S}/\text{ B}^{\prime }\), which applies to stations with shear-wave velocity at the bottom of the borehole (\(\text{ V}_\mathrm{S}\)[depth]) \(\ge 760\,\text{ m}/\text{ s}\), and via regression for the same condition give similar peak and frequency.

4 Relationship between amplification and site parameters

4.1 Linear site amplification

In this section, we examine the relationship between site amplification (as given by \(\text{ S}/\text{ B}^{\prime }\) ratios) and site variables describing the depth and stiffness of the deposit. As a commonly-used index parameter for the shear-wave velocity profile, we use the average shear-wave velocity in the uppermost 30 m (Borcherdt 1992, 1994):

$$\begin{aligned} V_{S30} = {30}\left.{\sum _{i = 1}^N {\left( {\frac{d_i }{v_i }} \right)} }\right., \end{aligned}$$
(4)

where \(\text{ d}_{i}\) and \(\text{ v}_{i}\) denote the thickness (in meters) and shear-wave velocity of the \(i\)th layer respectively, and N is the total number of layers. To calculate \(\text{ V}_\mathrm{S30}\) for each site, we use the site velocity profile of the KiK-net stations, extended to the depth of 100–200 m as recommended by Boore et al. (2011).

Figure 5 explores the relationship between site amplification and \(\text{ V}_\mathrm{S30}\), considering all KiK-net data. In this figure, \(\text{ V}_\mathrm{S}\)[depth] is the value at the depth of installation; we assume that, for holes that extend to hard-rock conditions (\(\text{ V}_\mathrm{S} > 1,500\,\text{ m}/\text{ s}\)), the vertical component at that depth is essentially unamplified, and the horizontal component has only a small potential amplification (as represented by the H/V ratios at the bottom). As seen in Fig. 5, the average \(\text{ S}/\text{ B}^{\prime }\) ratio increases with frequency, as does the scatter among data points. We see some evidence for greater amplification, for the same \(\text{ V}_\mathrm{S30}\), if \(\text{ V}_\mathrm{S}\)[depth] \(\ge 760\,\text{ m}/\text{ s}\), due to impedance effects. Furthermore, the amplification for the data in the range 760–1,500 m/s appears to be similar to that for \(\text{ V}_\mathrm{S}\)[depth] \(> 1,500\,\text{ m}/\text{ s}\). We therefore conclude that for characterizing the overall site amplification, we should use the \(\text{ S}/\text{ B}^{\prime }\) ratio data for \(\text{ V}_\mathrm{S}\)[depth] \(\ge 760\,\text{ m}/\text{ s}\). We perform a simple least-squares regression to determine the amplification for each frequency as a function of \(\text{ V}_\mathrm{S30}\):

$$\begin{aligned} \log _{10} \left( {S/{B^{{\prime }}}} \right) = m.\log _{10} \left( {{V_{S30} }/{V_{ref} }} \right) + b, \end{aligned}$$
(5)

where \(\text{ V}_\mathrm{ref} = 760\,\text{ m}/\text{ s}\). This regression will give us an estimate of the site term to remove from the data, if we wish to correct them to equivalent values for bedrock, \(\text{ V}_\mathrm{S30} = 760\,\text{ m}/\text{ s}\) (B/C boundary site conditions according to the National Earthquake Hazard Reduction Program, NEHRP, provisions) in a generic way. The regression coefficients (amplification factors) are tabulated in Table 1 for the frequency range of 0.1–15 Hz. Table 1 contains site amplification coefficients using just the Tohoku event and those obtained using all KiK-net data (M \(\ge \) 5.5; 1998–2009).

Fig. 5
figure 5

Amplification \(\text{ S}/\text{ B}^{\prime }\) (horizontal components) for the KiK-net stations relative to \(\text{ V}_\mathrm{S30}\), using all events from 1998 to 2009. Sites are categorized into four groups based on their \(\text{ V}_\mathrm{S}\)[depth] which is shear-wave velocity at the depth of installation. The amplification of the vertical component \(\text{ S}/\text{ B}^{\prime }\)(vert) is also plotted versus \(\text{ V}_\mathrm{S30}\)

Table 1 Coefficients for site correction factors, horizontal component

Another important site parameter, in addition to site stiffness, is the fundamental resonance frequency. The fundamental frequency depends on both layer depth and its stiffness. The mapping of predominant frequency of amplification facilitates the micro-zonation (Lachet and Bard 1994). Furthermore, the fundamental frequency may carry information on deeper part of the soil column, in comparison to \(\text{ V}_\mathrm{S30}\) which considers only the top 30 m. The fundamental frequency is obtained for the KiK-net data using the peak of the H/V spectral ratios of the recorded strong motions at the sites (Lermo and Chávez-García 1993;Theodulidis and Bard 1995; Zhao et al. 2006a; Fukushima et al. 2007), considering all events of M \(\ge \) 5.5 recorded at each station from 1998 to 2009. Only those stations that recorded at least three events are considered. The peak of the H/V ratios may be ambiguous as an indicator of fundamental frequency in the case of close proximity to sources or in the absence of a marked impedance contrast at some depth. However, when averaged over multiple events, the peak H/V ratio provides a stable estimate of fundamental frequency that is well-correlated with that obtained from the peak of the \(\text{ S}/\text{ B}^{\prime }\) ratio.

Considering all events recorded at each station, we have calculated the mean value of horizontal-to-vertical spectral ratios at both surface H/V[surface] and at the bottom of the borehole H/V[depth]. The latter is helpful in detecting the possible presence of a large impedance contrast at deeper sections of the borehole. To determine the H/V ratios, the following procedure was applied:

  1. 1)

    Apply a 5 % cosine taper function to the signal window;

  2. 2)

    Calculate FAS of the three components (NS, EW, UD), for both surface and borehole ground-motions;

  3. 3)

    Smooth the FAS using a Konno-Ohmachi window with a bandwidth parameter, b, of 20 (Konno and Ohmachi 1998);

  4. 4)

    Obtain the geometric mean of the two horizontal components ((\(\text{ FAS}_\mathrm{EW }\times \text{ FAS}_\mathrm{NS}) ^{0.5})\);

  5. 5)

    Calculate the ratio of FAS(H) to FAS(V) at the surface and the borehole (H/V[surface] and H/V[depth]).

Despite varying amplitudes and predominant frequencies, the H/V[surface] ratios can be grouped based on their overall shape. About 60 % of the stations show a single clear H/V “peak” which is associated with the fundamental frequency of the site; a peak is defined as a local maximum where the amplitude is greater than twice the mean level of H/V[surface]. The rest of the spectral ratios can be divided into groups having two peaks (20 %), more than two peaks (jagged shape), or no significant peak.

At the bottom of the borehole, we would expect H/V[depth] to be unity, if this is at bedrock level (Nakamura 1989; Theodulidis et al. 1996). However, for the KiK-net stations the borehole sensor is not always placed at bedrock. By inspection of H/V[depth] compared to H/V[surface], there are often peaks at very low-frequencies, which could be due to significant velocity contrast beneath the down-hole sensor (Cadet et al. 2012).

We would expect the fundamental frequency to be related to the depth-to-bedrock (i.e. \(\text{ f}_{0} = \text{ V}_\mathrm{S}/4\text{ H}_\mathrm{B}\); Dobry et al. 2000). The depth-to-bedrock, defined by the depth of a layer with \(\text{ V}_{S \ge } 760\,\text{ m}/\text{ s}\), or to a significant impedance contrast between surface soil deposits and material with \(\text{ V}_\mathrm{S} \quad \cong 760\,\text{ m}/\text{ s}\), is obtained for each site from the velocity profile. Figure 6 plots the fundamental site frequency as a function of \(\text{ V}_\mathrm{S30}\). As \(\text{ V}_\mathrm{S30}\) is a common site parameter in engineering seismology, its relationship with other site parameters is important and useful. It is interesting that \(\text{ V}_\mathrm{S30}\) appears to be a good proxy to estimate the natural frequency of a site. Furthermore, the fundamental frequency inferred from the H/V ratios matches well with that derived from the theoretical relation (\(\text{ f}_{0} = \text{ V}_\mathrm{S} /4\text{ H}_\mathrm{B})\). Strictly speaking, this expectation is only true for a soil layer with a constant \(\text{ V}_\mathrm{S}\) profile over a thickness of H\(_{B}\). The frequency \(\text{ f}_{0} = \text{ V}_\mathrm{S}\)/4H\(_{B}\) is also close to the first modal frequency for a layered 1-D site. It is worth pointing out that the correlation between \(\text{ V}_\mathrm{S30}\) and \(\text{ f}_{0}\) is not applicable for deep low-frequency sites (Zhao and Xu 2012b). In Fig. 6, the upper limit on site frequency at 10 Hz may be artificial due to the limitations of the instruments and filtering - we would not be able to see peaks at shorter periods clearly.

Fig. 6
figure 6

Fundamental site frequency as a function of \(\text{ V}_\mathrm{S30}\). Gray squares are observations for which a clear peak frequency is indicated by the H/V ratio. The green triangles are fundamental site frequencies calculated using the equation \(\text{ f}_{0} = \text{ V}_\mathrm{S}/4\text{ H}_{B}\), where H\(_{B}\) is the known depth-to-bedrock. The blue circles are observed fundamental frequencies of NGA data. Lines are the best linear fit to \(\text{ f}_{0}\) as a function of \(\text{ V}_\mathrm{S30}\) for each dataset

To check if the \(\text{ V}_\mathrm{S30}\)-\(\text{ f}_{0}\) correlation is region-specific, or the observed relation may be valid in other tectonic regions, we use the 2005 Pacific Earthquake Engineering Research Center (PEER) ground motion database which was developed for the 2008 NGA-West project (http://peer.berkeley.edu/smcat/). The PEER-NGA database is the most complete source of high-quality ground-motion data compiled from active tectonic regions. The database consists of 3,551 multi-component records from 173 shallow crustal earthquakes (M4.2–7.9). We calculate the H/V ratios for all sites in the database and pick the fundamental frequencies, considering just those stations that show a clear single peak and that recorded at least 3 events. As there are relatively few NGA stations having multiple recordings, the slope of the \(\text{ V}_\mathrm{S30}\)-\(\text{ f}_{0}\) relation is less significant (\(0.64 \pm 0.10\)) than that for Japan (1.33 \(\pm \) 0.12). The systematically-lower values of \(\text{ f}_{0}\) for the NGA data, in comparison to the Japanese data, can be attributed to deeper bedrock (for the same \(\text{ V}_\mathrm{S30})\) for most regions that comprise the NGA database; about 75 % of the NGA data we used in this study are from the 1999 Chi–Chi, Taiwan earthquake. It should be noted that the KiK-net stations are intentionally sited on harder sites, not deeper soil sites.

Figure 7 plots the fundamental site frequency as a function of depth-to-bedrock, which is interrelated to \(\text{ V}_\mathrm{S30}\). We also show in Fig. 7 the corresponding relationship for the PEER-NGA strong motion dataset; for the PEER-NGA data, the “depth to bedrock” is assumed to be the depth to \(\text{ V}_\mathrm{S} = 1.0\,\text{ km}/\text{ s}\) (Z1.0), as it is the closest proxy to our selected \(\text{ V}_\mathrm{S} = 760\,\text{ m}/\text{ s}\). The Japanese and PEER-NGA data show similar trends, but there is a greater intercept for the PEER-NGA data, probably because they are referenced to stiffer, deeper bedrock. We also show the estimated Z1.0 from the velocity profiles of the KiK-net stations in Fig. 7. We find that with an approximate adjustment, one can estimate Z1.0 in Japan and California by using the \(\text{ V}_\mathrm{S30 }\)and depth-to-bedrock relation for Japan. The correlations for fundamental site frequency and depth-to-bedrock (\(\text{ H}_\mathrm{B})\) with \(\text{ V}_\mathrm{S30}\) can be described for Japan by the following linear regressions:

$$\begin{aligned} \log _{10} \left( {f_0 } \right) = 1.331\log _{10} \left( {\frac{V_{S30} }{V_{ref} }} \right) + 0.9066,\end{aligned}$$
(6)
$$\begin{aligned} \log _{10} \left( {H_B } \right) = -1.729\log _{10} \left( {\frac{V_{S30} }{V_{ref} }} \right) + 0.9136 \end{aligned}$$
(7)

In Fig. 8, we examine the factors influencing site amplification, specifically depth-to-bedrock, and \(\text{ V}_\mathrm{S30}\). At low frequencies, as we would expect, the large wavelengths see (sample) only deep deposits. At higher frequencies, we are sampling more of the near-surface materials. Note that shallow sites are relatively stiff in terms of \(\text{ V}_\mathrm{S30}\), and generally show less amplification (except for shallow soil sites at high frequency). We developed empirical relationships between site amplification (\(\text{ S}/\text{ B}^{\prime }\)), depth-to-bedrock and fundamental site frequency, of the following form:

$$\begin{aligned} \log _{10} \left( {S/{B^{{\prime }}}} \right) = a.\log _{10} \left( {f_0 } \right) + b,\end{aligned}$$
(8)
$$\begin{aligned} \log _{10} \left( {S/{B^{{\prime }}}} \right) = c.\log _{10} \left( {H_B } \right) + d, \end{aligned}$$
(9)

where a, b, c, and d are the regression coefficients. The regression coefficients are summarized in Table 2.

Table 2 Regression coefficients of corrected site amplifications relative to the fundamental site frequency (a and b) and depth-to-bedrock (c and d), respectively
Fig. 7
figure 7

Depth-to-bedrock as a function of \(\text{ V}_\mathrm{S30}\). Gray squares are the depth to \(\text{ V}_\mathrm{S} = 760\,\text{ m}/\text{ s}\) and purple circles are Z1.0 (depth to \(\text{ V}_\mathrm{S} = 1,000\,\text{ m}/\text{ s}\)) from velocity profiles of the KiK-net stations. The dotted line is the best fit to the Z1.0 values from the PEER-NGA database. The dashed line is the estimated model for predicting Z1.0 in Japan. Note deeper bedrock for the PEER-NGA database

Fig. 8
figure 8

Corrected site amplifications (horizontal components) versus depth-to-bedrock, at frequency = 0.34, 1.1, and 3.7 Hz. The symbols are binned into different groups based on \(\text{ V}_\mathrm{S30}\)

4.1.1 Regional site amplification factors

Figure 9 plots the average site amplification factors for sites with \(\text{ V}_\mathrm{S30}\) of 180, 360, and 600 m/s, respectively, where the amplifications are derived using the coefficients of Table 1 (all KiK-net data). We also show our estimate of a representative site amplification curve for the reference velocity of 760 m/s. Table 3 shows the amplifications that come from the regression of log(H/V[depth]) vs. log(\(\text{ V}_\mathrm{S}\)[depth]), evaluated at \(\text{ V}_\mathrm{S} = 760\,\text{ m}/\text{ s}\) (black line in Fig. 9).

Table 3 Ratio of horizontal to vertical component of ground-motion at depth, evaluated for the reference B/C boundary site condition (\(\text{ V}_\mathrm{S}\text{[depth]} = 760\,\text{ m}/\text{ s}\))
Fig. 9
figure 9

Amplification (\(\text{ S}/\text{ B}^{\prime }\)) for different NEHRP site classes (soft soil profile in solid black; stiff soil profile in red; and very dense soil and soft rock in green). The estimated amplification for a reference velocity of 760 m/s is shown in black (dashed line)

4.1.2 Additional site factor: kappa filter

The attenuation of site amplifications at high frequencies, as seen in Fig. 9, is often represented by the high-frequency attenuation operator, kappa \(\kappa \) (Anderson and Hough 1984). This high-frequency decay of ground motions can be modeled by multiplying the spectrum by the factor P(f):

$$\begin{aligned} P\left( f \right) = \exp \left( {-\pi f\kappa } \right), \end{aligned}$$
(10)

where \(\kappa \) is a region-dependent or site-dependent parameter. In general, kappa will increase with distance due to path effects; its zero-distance intercept, sometimes referred to as \(\kappa _{0}\), is considered to represent path-independent near-surface attenuation of seismic waves.

After correcting ground motions from the Tohoku mainshock for regional site effects using the coefficients given in Table 1, we performed a simple regression analysis to estimate the anelastic attenuation factor, assuming a fixed geometrical spreading with slope of -1 over all distance ranges:

$$\begin{aligned} \log _{10} \left( {FAS} \right) = c_0 - \log _{10} \left( {R_{eff} } \right) +c_1 R_{cd} , \end{aligned}$$
(11)

where \(R_{eff} = \sqrt{R_{cd}^2 + 10^{2}}\) and R\(_{cd}\) is the closest-distance to the rupture surface, which is calculated based on the geometrical parameters of the background fault plane including fault length, fault width, strike, dip, and depth to the top of the fault plane (GSI 2011). Based on FAS regressions, we evaluated and plotted out the spectral shape on a log-linear plot for a few values of near-source distances, as shown in Fig. 10. We infer \(\kappa = 0.044\) from these spectral shapes.

Fig. 10
figure 10

Spectral shapes from initial regression of FAS (Tohoku mainshock) after site correction in log-linear scale. The shape is consistent for different effective distances (R\(_\mathrm{eff})\) of 10, 20, and 50 km. The slope of the fitted lines (dashed lines) for frequencies \(> 2\,\text{ Hz}\) provides an estimated kappa = 0.044

4.1.3 Using H/V[surface] as an extra parameter to estimate site amplification function

The H/V spectral ratio has been widely used to provide a preliminary estimate of site amplification (Kanai and Tanaka 1961; Nogoshi and Igarashi 1971; Nakamura 1989). The H/V ratios on soft soil sites generally exhibit a clear, stable peak that is correlated with the fundamental resonant frequency (Ohmachi et al. 1991; Field and Jacob 1993, 1995; Lermo and Chávez-García 1994; Lachet et al. 1996; Bonnefoy-Claudet et al. 2006). However, they typically underestimate the site amplification factor (Field and Jacob 1995; Bonilla et al. 1997; Satoh et al. 2001b). Considering these findings, we explore the use of the H/V ratios as an amplification function for the KiK-net data. This will be useful for other applications where the H/V ratios are known but borehole data are not available to constrain site amplification.

We performed a linear regression using:

$$\begin{aligned} \log _{10} \left( Y \right) = a_1 .\log _{10} \left( {{V_{S30} }/{760}} \right) + a_2 .\log _{10} \left( {f_0 } \right) + a_3 , \end{aligned}$$
(12)

in which Y is the ratio of the “true” site amplification (\(\text{ S}/\text{ B}^{\prime }\)) to an estimate obtained as the average of H/V[surface] over many events, and \(\text{ f}_{0}\) is the fundamental frequency of a site as measured from its average H/V peak frequency. Figure 11 shows the performance of Eq. (12) as a predictor of site amplification; Table 4 summarizes the regression coefficients for 23 logarithmically spaced frequencies. By using the H/V ratios and \(\text{ V}_\mathrm{S30}\) with Eq. (12), we obtain an excellent match to the observed site transfer functions for the whole range of frequencies. Applying this new model, over all KiK-net stations, the residuals (log\(_{10}\)(obs) – log\(_{10}\)(pred)) of site amplification as a function of \(\text{ V}_\mathrm{S30}\) is plotted in Fig. 12. Using the combination of \(\text{ f}_{0}\) and \(\text{ V}_\mathrm{S30}\) gives near-zero average residuals for prediction of site amplification for a wide range of site classes. The advantage of this method could be significant for assessing the site effects by using a single station or in an area where a rock reference site cannot be found.

Table 4 Regression coefficients for predicting site amplification (\(\text{ S}/\text{ B}^{\prime }\)) using Eq. (12)
Fig. 11
figure 11

Comparison of \(\text{ S}/\text{ B}^{\prime }\) ratios using cross-spectral ratios (solid black line) for sample sites (NGNH11, NGNH14, and NGNH20) with mean H/V ratios (dashed black line). Transfer functions are overlaid by prediction model using \(\text{ V}_\mathrm{S30}\) and \(\text{ f}_{0}\) as predictor parameters and adding H/V as an extra parameter (green line)

Fig. 12
figure 12

Residuals of predicting corrected observed amplifications at different site classes (varying \(\text{ V}_\mathrm{S30})\) using \(\text{ f}_{0}\), depth-to-bedrock, and H/V as an extra parameter [Eq. (12)]. Bars are indicating \(\pm \)1\(\sigma \) around the mean

4.2 Non-linear site amplification

To this point, we have assumed linear soil behavior in assessing average site amplification. Under weak motions, the stress-strain relationship of soil is linear, i.e. stress = G \(\times \) strain, where G is the shear modulus. Under strong shaking, soil shows nonlinear and hysteretic behavior, with the effective modulus G decreasing at high strain (Beresnev and Wen 1996a). Since the shear-wave velocity is given by \(V_\mathrm{s} =\sqrt{\text{ G}/\uprho }\) (where \(\uprho \) is density), the effective shear-wave velocity decreases as the strain increases. On the other hand, amplification decreases due to damping (loss of energy) in hysteresis. Overall, nonlinearity will result in a shift of the resonance frequency to lower values, and the reduction in amplification, as the amplitude of motions increases (Silva 1986; Beresnev and Wen 1996b; Field et al. 1997; Dimitriu et al. 2000).

Figure 13 shows the nonlinear behavior of a site subjected to strong shaking (PGA \(\sim 530 \text{ cm}/\text{ s}^{2})\) during the Tohoku event. The spectral ratio of the station (MYGH04 from the KiK-net, with \(\text{ V}_\mathrm{S30 }= 850\,\text{ m}/\text{ s}\)) is compared for the strongest part of the signal and the coda-window, noting that the shaking during the coda is a representative of weak-motion (Chin and Aki 1991). We hypothesize that the linear-elastic soil response is restored after the termination of the strong shaking portion. We have assumed that the coda window starts at the lapse time later than seven times the S-wave arrival time. If nonlinear behavior exists, it should appear as a discrepancy in site response characteristics between weak and strong motions. From Fig. 13, a clear shift of the peak amplitude frequency (fundamental frequency) to lower frequencies during the strong shaking portion of the record can be seen. Also, the amplitudes are reduced for the S-window spectral ratios in comparison with the coda-window spectral ratios. However, it should be mentioned that the coda-window does not necessarily behave in a completely linear fashion. If soil behaves nonlinearly, some residual deformation may remain, and it takes at least some time to reconsolidate to the original state (Wu et al. 2010). For Tohoku, at some sites the dissipation of pore pressure due to ground shaking might have taken a long time because of the long duration of shaking. Extensive liquefaction effects are hypothesized to be related to the number of cycles that soil underwent during the Tohoku event (Bhattacharya et al. 2011).

Fig. 13
figure 13

Amplification (S/B) of the East–West (EW) and North–South (NS) components at MYGH04 for the S-window of the first arrival (tick solid red), S-window of the second arrivals (dashed light red), and coda-window (blue)

By contrast to the nonlinear behavior noted in Fig. 13, we observe linear behavior in Fig. 14, which shows similar amplification of S-window and coda-window for TCGH16. \(\text{ V}_\mathrm{S30}\) for this site is 213 m/s and the borehole is at the depth of 112 m, in a layer with shear-wave velocity 680 m/s. The ground-motions reached PGA of \(1,197 \text{ cm}/\text{ s}^{2}\) in the EW component on the surface, and the PGA of the vertical component at the borehole level is \(137\,\text{ cm}/\text{ s}^{2}\); this is above a common threshold level of \(\sim \!\!100 \text{ cm}/\text{ s}^{2}\) often cited for nonlinearity for surface ground-motions (Beresnev and Wen 1996b).

Fig. 14
figure 14

Amplification (S/B) of the East-West (EW) and North–South (NS) components at TCGH16 for the S-window (red) and coda-window (blue)

4.2.1 Time-frequency analysis of borehole data to assess nonlinearity

We apply a moving-window method of Sawazaki et al. (2006; 2009) and Wu et al. (2009; 2011) to detect the PGA threshold of nonlinearity for each station. This technique uses Welch’s periodogram method of spectral estimation. The short-time Fourier transform (STFT) of a moving window is calculated for both surface and borehole time series. Then, the spectra are plotted versus evolutionary time. This way, a decrease of fundamental frequency and amplitude due to the nonlinear behavior of a site can be detected within the strong part of the signal, relative to the coda window and background noise (representing weak motions). A sample application of the technique is depicted in Fig. 15.

Fig. 15
figure 15

Temporal evolution of \(\text{ S}/\text{ B}^{\prime }\) for the MYGH04 station. The drop of amplitude and shifting of the fundamental frequency to lower-frequencies during strong shaking can be seen clearly in this figure

In our implementation, we used a moving window with a length of 6 s to examine all Tohoku waveforms recorded by the surface and borehole stations. Successive windows have 67 % overlap (windows are moved forward by 2 s intervals). The employed 6 s window appears to offer a good balance between temporal resolution and stability (Wu et al. 2009, 2010). We remove any linear trend and apply a 5 % cosine-taper to both ends of each window, and compute the FAS of the two horizontal components. The spectra is then smoothed using a Konno–Ohmachi window (\(b = 20\); where \(b\) is the bandwidth coefficient). The spectral ratios (\(\text{ S}/\text{ B}^{\prime }\)) are obtained by dividing the geometric mean of horizontal FAS spectra of a surface station by the corresponding spectra of a borehole station, tabulated for 50 frequencies logarithmically spaced between 0.1 and 25 Hz; we extended the upper-bound frequency as the \(\text{ S}/\text{ B}^{\prime }\) ratio is a dimensionless quantity and instrument correction can be ignored. Estimating a threshold for nonlinear behavior, we plot the \(\text{ S}/\text{ B}^{\prime }\) ratios as a function of sorted PGA values obtained from each record at the borehole level. PGA values are binned into 100 classes.

Figure 16 illustrates the changes of \(\text{ S}/\text{ B}^{\prime }\) relative to PGA for MYGH04. Note the reduction in the fundamental frequency and the amplification with increasing PGA. The threshold for nonlinearity at this station is \(\sim \)25 cm/s\(^{2}\). For this station, soil response at the surface starts to deviate from linear response at 25 cm/s\(^{2}\), becoming significant at \(100 \text{ cm}/\text{ s}^{2}\). MYGH04 has a 4 m thick soil layer with shear-wave velocity of 220 m/s over a 10 m thick of very stiff soil or soft rock layer with a shear-wave velocity of 960 m/s. For this site, it may take a large surface acceleration before the soil develops strong nonlinear response: the ratio of inertial force (i.e. accelerations times mass) over the yielding shear-stress of the soil above the base can be quite small even when the acceleration is large, due to the small thickness of the soil layer (J. Zhao, personal communication, 2012). The threshold values for nonlinearity based on PGA[surface] and PGA[borehole] are estimated and tabulated for 49 KiK-net stations in “Appendix 2”.

Fig. 16
figure 16

Spectral ratios versus recorded PGA for MYGH04. The threshold ground motion for nonlinear behavior is PGA[surface] \(\sim 25 \text{ cm}/\text{ s}^{2}\)

To examine the relative degree of nonlinearity during the Tohoku mainshock, in comparison to that exhibited during smaller earthquakes, we calculate the average amplification (\(\text{ S}/\text{ B}^{\prime }\)) at each station over all events (\(A_\mathrm{ref})\) and the frequency of that peak (\(f_\mathrm{ref})\). The same procedure is also applied for just the Tohoku mainshock ground motions. In this case, the maximum amplitude is called \(A _\mathrm{MS}\) and the fundamental frequency is called \(f _\mathrm{MS}\). We consider stations that show a clear-single peak (with amplitude \(>\) 2) in the H/V ratios at the surface (or \(\text{ S}/\text{ B}^{\prime }\) ratio). In total, 225 out of 475 stations passed these criteria and among them only 42 stations showed nonlinear behavior (shifting of the fundamental frequency to lower frequencies, and a decrease in amplitude). To quantify the degree of nonlinearity for these 42 stations, we plot the ratio A\(_\mathrm{MS}\)/A\(_\mathrm{ref }\)and \(\text{ f}_\mathrm{MS/} \text{ f}_\mathrm{ref }\)as a function of \(\text{ PGA}_\mathrm{ref}\), the predicted median PGA by the Tohoku regression Eq. (13) (derived below), for \(\text{ V}_\mathrm{S30 }= 760\,\text{ m}/\text{ s}\) (reference), in Fig. 17 (values for other sites, not identified as being nonlinear, as shown in the background). It is interesting that nonlinear effects are seen for all soil types. Probably this is because the NEHRP C sites are typically soft shallow soil over rock. The plots show that there is a clear, steady trend of increasing nonlinearity with increasing PGA. We fit a linear trend line (for the sites showing nonlinearity) and obtain the significance of slope and intercept for A\(_\mathrm{MS}\)/A\(_\mathrm{ref}\) and \(\text{ f}_\mathrm{MS}/\text{ f}_\mathrm{ref}\) ratios. The p-value of the 2-tailed t-test for the slope (Fig. 17a) is 0.22, which is significantly different from 0 at the 95 % confidence level. The standard error (SE) on the slope for A\(_\mathrm{MS}\)/A\(_\mathrm{ref }\) is 0.0014. For the \(\text{ f}_\mathrm{MS}/\text{ f}_\mathrm{ref}\) ratio, the p-value of the slope is 0.0048, which indicates significant difference from 0 (with \(\text{ SE} = 9.3\times 10^{-4}\)).

Fig. 17
figure 17

Nonlinearity symptoms: decrease in the predominant frequency (\(\text{ f}_\mathrm{MS}/\text{ f}_\mathrm{ref})\) and/or amplification amplitude (\(\text{ A}_\mathrm{MS}/\text{ A}_\mathrm{ref})\) as a function of \(\text{ PGA}_\mathrm{ref}\) (predicted PGA for \(\text{ V}_\mathrm{S30} = 760\,\text{ m}/\text{ s}\)). NEHRP site classes are shown with different colors (a and b) and trend lines are shown as solid black lines. Grey dots in background show values for sites that did not exhibit nonlinearity symptoms

In summary, we found some localized nonlinear soil behavior (such as stations MYGH04 and IBRH18), and the degree of nonlinearity increases with the intensity of shaking as would be expected (Fig. 17). However, nonlinearity was not a pervasive phenomenon during the Tohoku event, with only a small fraction of the sites showing significant changes in amplification and its peak frequency. On the other hand, it should be mentioned that there are many cases of liquefaction-related damage during the Tohoku earthquake, and that some nonlinearity was also observed at the K-NET sites (Tokimatsu et al. 2011).

5 Overall characteristics of Tohoku ground motions

Having evaluated site amplification for the Tohoku motions, we can now examine what the underlying motions would be without the site effects. We fit the Tohoku motions to:

$$\begin{aligned} \log _{10} \left( Y \right) + \log _{10} \left( {R_{eff} } \right) - site_{factor} = c_0 + c_1 .F.R_{cd} + c_2 .B.R_{cd} \end{aligned}$$
(13)

For stations in the forearc region, F = 1 and B = 0, otherwise (for backarc stations) F = 0 and B = 1 (Ghofrani and Atkinson 2011). The site\(_{factor}\) is \(m\).log(\(V _{S30}/V_\mathrm{ref})+b\) using coefficients in Table 1. The regression coefficients are given for both FAS and pseudo-spectral acceleration (PSA) in Table 5.

Table 5 Regression coefficients for FAS and PSA (geometric mean of horizontal components)

A comparison between the event-specific Tohoku prediction equation and the motions prescribed by regional GMPEs for Japan (Zhao et al. 2006b; Kanno et al. 2006; Atkinson and Macias 2009) is shown in Fig. 18. In this figure, AM09 is in fact a GMPE for Cascadia which is adjusted for Japan by multiplying the Cascadia motions by the ratio of Japan/Cascadia site factors, as given by Macias et al. (2008) and Atkinson and Macias (2009), respectively. The observed PSA values are corrected for site amplification using coefficients from Table 1, before comparison with GMPEs for a reference condition of B/C. The Zae06 (Zhao et al. 2006b) and Kan06 (Kanno et al. 2006) GMPEs are over-predicting Tohoku ground motions at 1 Hz, while AM09 (Atkinson and Macias 2009) is similar to the new Tohoku equation for the back-arc stations. The magnitude scaling for very large megathrust events is not empirically-constrained, and thus extrapolation of empirical GMPEs may cause unknown biases. For example, the over-prediction of ground-motions using Zae06 and Kan06 could be the result of inadequacy of the magnitude squared term for magnitude scaling of subduction interface earthquakes in their relations (Zhao and Xu 2012a). The relatively-good agreement of the site-corrected data with the AM09 predictions, while encouraging, should be interpreted with caution as this result pertains to a single event, at a specific magnitude. It is clear from Fig.18 that the attenuation (slope and curvature) of single Q-factor GMPEs is controlled mostly by the back-arc stations.

Fig. 18
figure 18

Comparing event-specific prediction equation for the site corrected Tohoku ground-motions (B/C) with other GMPEs (Kan06 = Kanno et al. 2006; Zea06 = Zhao et al. 2006b; AM09 = Atkinson and Macias 2009) at four frequencies. Fore-arc stations are shown with blue circles and back-arc stations are in magenta

6 Conclusions

The M9.0 2011 Tohoku earthquake has provided important new quantitative information on site response that is invaluable in refining seismic hazard analysis and mitigation efforts. Conclusions from our study of site effects are as follows:

  1. 1.

    Site amplification effects in Japan are very large at f \(>\) 2 Hz, with amplification factors of 4 to 8, even for relatively strong shaking. At high frequencies, median peak ground accelerations from the Tohoku event were near 300 cm/s\(^{2}\) (\(\sim \)0.30g) at 100 km, for the NEHRP C sites, largely due to high-frequency site effects. The site amplifications were much greater than those adopted in standard building code based on the NEHRP site class. In part, the large amplitudes at high frequencies are due to the prevalence of shallow soil conditions over a harder layer in Japan; this is especially applicable for the KiK-net sites. Accounting for site amplification is critical to interpretation of motions.

  2. 2.

    Analyzing spectral ratios of motions recorded on the surface to those at depth in boreholes are a good way of obtaining site response at stations, but should be corrected for the depth effect (interference of up-going and down-going waves). A cross-spectral ratio technique for correcting the depth effect of the surface-to-borehole ratios is effective.

  3. 3.

    We estimated the frequency-dependent site amplification factors that relate the recorded motions to equivalent values for the B/C boundary site conditions. The regression coefficients (amplification factors) are derived for the Tohoku event and for all KiK-net data (M \(\ge \) 5.5; 1998–2009) for the frequency range of 0.1–15 Hz (Table 1).

  4. 4.

    We developed empirical relationships between parameters that characterize site amplification, depth-to-bedrock and fundamental site frequency. It is interesting that \(\text{ V}_\mathrm{S30}\) is a good proxy to estimate the natural frequency of a site. Furthermore, the fundamental frequency inferred from H/V matches well with that derived from the theoretical relation (\(\text{ f}_{0} = \text{ V}_\mathrm{S}\) /4H).

  5. 5.

    Nonlinear site response was not pervasive during the 2011 M9.0 Tohoku earthquake at the KiK-net stations. No specific dependence of nonlinearity on the site properties (e.g. \(\text{ V}_\mathrm{S30})\) was observed. This may be because \(\text{ V}_\mathrm{S30}\) is not providing a good measure of soil stiffness in Japan, as the stiffer sites are mostly just shallow soft soil.

  6. 6.

    We can obtain an excellent match to the observed site transfer functions for the whole range of frequencies by using H/V and \(\text{ V}_\mathrm{S30,}\) with Eq. (12). Using the combination of \(\text{ f}_{0}\) and \(\text{ V}_\mathrm{S30}\) gives near-zero average residuals for prediction of site amplification for a wide range of site classes. The advantage of this method could be significant for assessing the site effects by using a single station or in an area where a rock reference site cannot be found.

  7. 7.

    Generic GMPEs developed for subduction regions appear to under-estimate the Tohoku motions if soil amplification effects are not removed. However, once site effects are taken into account the agreement is improved.