1 Introduction

The statistical nonlinear regression relationships of the fundamental resonant frequency (\(f\)) and S-wave velocity (\({V}_{s}\)) versus bedrock depth (\(h\)) are usually used to estimate the depth to the bedrock half-space. The general form of the relationship of \(f\) versus \(h\) could be defined as

$$h=a{f}^{-b}$$
(1)

where \(a\) and \(b\) are correlation coefficients.

Based on microtremor observation, the characteristics of ground motions could be estimated using the pioneering horizontal-to-vertical spectral ratio (HVSR) technique by Nakamura (1989). Henceforward, Lermo and Chavez-Garcia (1993) and Wen et al. (2006) estimated the fundamental resonant frequency using HVSR of weak and strong earthquakes. Recently, Sánchez-Sesma et al. (2011) back-calculated the 1D velocity structure applying the diffuse field theory on HVSRs of microtremors, whereas Kawase et al. (2011) and Nagashima et al. (2014) used the HVSRs of earthquake recordings. Long-period microtremor measurements by Yamanaka et al. (1994) concluded that the resulted Rayleigh waves ellipticities are resembling those resulted from earthquakes. Their study was conducted in the northwestern part of the Kanto Plain, Japan. Moreover, the HVSRs of ambient vibrations fit well with the resulted fundamental frequencies of standard spectral ratios of earthquakes (Bard and SESAME team 2004).

In the literature, previous existing relationships of \(f\) versus \(h\) were determined based on HVSRs of microtremor investigations (e.g., Ibs-von Seht and Wohlenberg 1999, Delgado et al. 2000a and 2000b, Parolai et al. 2002, Scherbaum et al. 2003, Hinzen et al. 2004, García–Jerez et al. 2006, D’Amico et al. 2008, Harutoonian et al. 2013, Tün et al. 2016, and Moon et al. 2019). Using earthquakes of KiK-net to determine regression relationships of \(f\) versus \(h\) is examined by Thabet (2019). These statistical nonlinear regression relationships are highly site dependent.

Early introduction of the empirical transfer function (ETF) in the valuable study conducted by Borcherdt (1970) is considered as the base of other later studies. Calculating the ETF is usually done using a standard spectral ratio (SSR), and requires a pair of instruments, one located at a surface soil site under investigation (generally on alluvium) and the other at a nearby surface reference rock site. The ETF represents resonant frequencies of the soil column. These resonant frequencies are only descriptive of that portion of the soil column between the two sensors and are not necessarily representative of the site’s global behavior experienced by an earthquake event. Pioneering study of Kagami et al. (1982) examined the usefulness of long-period microtremor observations in Niigata plain, Japan, and in Los Angeles, California. Their results showed that Fourier amplitude of microtremors systematically increase from shallow to deep sites resembling observation from strong-motion earthquake records during the Niigata earthquake of 1964 and the San Fernando earthquake of 1971. Satoh et al. (1995) conducted their study to evaluate the local site effects due to surface layers overlying the engineering bedrock and to remove them by using one-dimensional (1D) soil models. They could estimate the engineering bedrock waves, which are supposed to be observed on the outcrop of the engineering bedrock, from borehole records. They concluded that the resonant frequency determined from SBR at a borehole site should not be treated as the resonant frequency of the site. Microtremor measurements were carried in the western lower Rhine embayment (Germany) by Ibs-von Seht and Wohlenberg (1999) to estimate the resonance frequency using the classical spectral ratios and taking advantage of hard rock basement sites used as reference stations. Horike et al. (2001) compared the relative amplification factors inferred from HVSRs of microtremor and earthquake motions, and they found reasonable agreement between the resulted spectral ratios. Satoh et al. (2001a) conducted 48 singe stations and 12 arrays of microtremor measurements in Taichung Basin, Taiwan. They estimated the S-wave velocity structure using the Rayleigh wave inversion technique by matching the calculated Rayleigh wave HVSR to observed HVSR.

In the present study, the surface-to-borehole spectral ratios (SBRs) are calculated using database of the Kiban Kyoshin network (KiK-net) in Japan. Seismic downhole array sites of KiK-net are invaluable tools in the present attempts to understand and accurately estimate the fundamental resonant frequency (\(f\)). Ground motions recorded at various depths within these seismic vertical arrays are used to calculate the surface-to-borehole spectral ratios (SBRs), and consequently estimate the peak frequencies as the seismic waves travel from bedrock half-space to the ground surface. When using recorded ground motions from vertical seismic arrays (i.e., surface and borehole records), it is nearly inevitable to separate between the down-going and up-going seismic wave effects. Tao and Rathje (2020) showed clearly that both effects are not negligible. They examined the down-going wave effect and the appearance of pseudoresonances in downhole array data for sites with a shallow velocity contrast or with little to no velocity contrast. In contradict to this study, Rong et al. (2016) compared HVSRs of the S-wave with one dimensional equivalent-linear numerical simulations on 21 sites in western China and suggested that the HVSR from observed earthquake ground motion resembles the SBR of nonlinear site response. Therefore, it is recommended to carefully select sites, which have similar SBR and HVSR, to avoid the down-going and up-going seismic wave effects.

Thabet (2019) determined site-dependent regression relationships of \(h\) versus \(f\) using HVSRs of earthquakes at KiK-net sites, Japan. Although the resulted HVSR peak frequencies are clear, unique, and sharp, there is a dominated strong scattering in the relationships of \(h\) versus \(f\) up to bedrock depths of 100 m. Some KiK-net sites are excluded from the study conducted by Thabet (2019) due to their strong non-stationary behavior, which lead to high scattering and perturbations in their resulted HVSRs. According to Thabet (2019), there are 192 having their bedrock depths < 20 m, which corresponds to 59.3% of the accepted 324 KiK-net sites. This means that bedrock depths < 20 m are significantly responsible for the high scattering, which is dominated in the relationships of \(h\) versus \(f\) up to bedrock depths of 100 m. Moreover, these 192 sites constitute approximately 27.5% of the whole 698 KiK-net sites. So that, it is expected to obtain ≈ 287 accepted K-NET sites out of the whole 1045 K-NET sites in the present HVSR analyses. For that reason, both KiK-net and K-NET sites in Japan are used in the present study. The fundamental resonant frequency is determined using HVSR and SBR analyses (i.e., \({f}_{HVSR}\) and \({f}_{SBR}\), respectively) at KiK-net sites, whereas only \({f}_{HVSR}\) is determined at K-NET sites. In total, all the available earthquake records from 1997 to 2019 at 698 KiK-net and 1045 K-NET seismic sites are used. Borcherdt (1994) stated that time average S-wave velocity of the upper 30 m (\({V}_{s30}\)) provides accurate site characterization and permits seismic site classification unambiguously. S-wave velocity of the upper 30 m (\({V}_{s30}\)) is the main factor for the seismic site classification in terms of seismic response by many organizations, particularly the National Earthquake Hazard Reduction Program (NEHRP). In this study, site-dependent relationships of \({f}_{HVSR}\), \({f}_{SBR}\), and S-wave velocity (\({V}_{S}\)) versus \(h\) in Japan are provided for more improvement to reveal the influence of different physical and geological parameters on these statistical site-specific nonlinear regression relationships. Additionally, the applicability and discrepancy between HVSR and SBR are revealed and discussed.

2 Data set

The data from the KiK-net and K-NET in (last accessed, January 2020) Japan are used to establish the proposed statistical nonlinear regression relationships based on the seismic site classification. The KiK-net consists of 698 seismic vertical array sites equipped with a pair of surface and borehole accelerometers, and the K-NET consists of 1045 sites equipped with seismic surface accelerometers (refer to Fig. 1).

Fig. 1
figure 1

Location map of KiK-net and K-NET seismic stations. Blue triangles are the 698 KiK-net seismic sites. Red circles are the 1045 K-NET seismic sites

KiK-net site data have high variety of stratigraphic and lithologic columns and \({V}_{P}\) and \({V}_{S}\) structures up to depths between 100 and 3500 m. Around 88% of KiK-net sites have borehole depths up to 250 m, whereas 12% have their borehole depths (250–500 m) and higher than 500 m, equally. As a result, it will be possible to define the bedrock half-space.

The K-NET consists of 1045 strong-motion observation sites with a spacing of 25 km. This spacing of the K-NET is essential to record strong motions in the epicentral region of a crustal earthquake with more than magnitude of 7 anywhere in Japan. Beneath each K-NET site, the observed detailed N-value (i.e., blow count of standard penetration test), \({V}_{P}\) and \({V}_{S}\) structures, bulk densities (\(\rho\)), and lithologic column are provided down to a depth of 20 m. The vertical resolution of these observed data is 1 m. These observed data are a valuable opportunity to test and experience the present strong scattering that is dominated in the regression relationships of \(h\) versus \(f\) at bedrock depths of < 30 m. Those KiK-net and K-NET sites show very wide lithological variation.

All KiK-net and K-NET sites are classified according to the definition of National Earthquake Hazards Reduction Program (NEHRP Provision, 2000) site classes. According to Boore (2004), there are common three extrapolation methods to estimate the \({V}_{S30}\), and consequently it will be possible to classify K-NET sites having shallow velocity models of depths < 30 m according to the NEHRP seismic site class. These extrapolation methods are (1) assuming constant velocity, (2) using the correlation between \({V}_{S30}\) and \(\overline{{V }_{S}}(d)\), and (3) based on velocity statistics to determine site class. In the present paper, the simplest method is adapted, which is assuming that the lowermost velocity of the model extends to 30 m (see Eq. 2).

$${V}_{S30}=30/(tt\left(d\right)+\frac{30-d}{{V}_{eff}})$$
(2)

where \({V}_{eff}\) is the assumed effective velocity (i.e., velocity at the bottom of the velocity model) from the bottom depth of K-NET site (\(d)\) to 30 m. \(tt\left(d\right)\) is the travel time to a depth \(d\).

The most of KiK-net and K-NET sites have a class of C or D. Minority of the sites has a class of B or E, whereas sites have a class of A are not involved. In the present study, the KiK-net and K-NET enormous earthquake database collected covers the period between 1997 and 2019. Waveforms are recorded with a sampling frequency of 100 Hz for most of the used events and 200 Hz for some events. All the available events with PGAs (i.e., Peak Ground Accelerations) of ≤ 10 cm/s2 are used in the present analyses in accordance with Régnier et al. (2013) in order to characterize the linear behavior at each site. This implied that linear behavior is prevailing avoiding later modification due to nonlinear response (Thabet et al. 2008 and Thabet 2021). This enormous earthquake database for each KiK-net and K-NET site is prepared using multiple earthquakes approach (Thabet 2019), and is simply assuming several consequent earthquake records as a continuous time series to lengthen the total time window of each record. Between each two adjacent earthquake records, the effect due to conjunction points is negligible because the frequencies of interest are low (i.e., ≤ 50 Hz). Figure 2 shows an example of 737-earthquake surface records at IBRH15 site. Adopting multiple earthquake approach enable us to exclude sites having recorded earthquakes with strong non-stationary and may undergo high scattering and perturbations, which may significantly affect the physical meaning of the HVSR and SBR peak frequencies. As a result, the included or accepted sites in the present study have the lowest level of scattering (i.e., acceptable low standard deviation values) indicating that propagation path effects, which are represented in shallow or deep earthquakes and far-field or near-field earthquakes, could not control the resulted HVSR or SBR curves.

Fig. 2
figure 2

Example of more than 1-day time window of the conjugated three components (i.e., Z, N, and E are UD, NS, and EW components, respectively) of 737 events recorded at IBRH15 site prepared for processing in GEOPSY software suite. (Note: the shown 737 earthquake events are all the recorded events at IBRH15 site from 1997 to 2019 with amplitudes less than 10 cm/s2)

3 Methods of analyses

The HVSR technique (Nogoshi and Igarashi 1970 and 1971; Nakamura 1989 and 2000) has been extensively studied with earthquake recordings to quantify the site effect produced by the sedimentary covering in specific frequency bands (Field and Jacob 1993; Lachet and Bard 1994; Lermo and Chavez-Garcia 1994; Bindi et al. 2000; Fäh et al. 2001). The HVSR is defined as the ratio of the quadratic mean of horizontal Fourier amplitude of ground motion (i.e., EW and NS components) and the vertical Fourier amplitude of ground motion at the free surface. The calculation is illustrated in

$$HVSR=\frac{\sqrt{{(EW}^{2}+ {NS}^{2})/2}}{V}$$
(3)

where EW, NS, and V are the acceleration Fourier amplitude spectra of the east–west, north–south, and vertical components of the ground motion, respectively.

GEOPSY software suite (http://www.geopsy.org) is used for HVSR processing the prepared conjugated three-component events at the surface of each KiK-net and K-NET site. These lengthy time-window conjugated records provide fulfillment of the widely accepted framework guidelines of the European research project SESAME (Bard and SESAME team 2004). These guidelines have been consulted and followed concerning HVSR processing. Therefore, the reliability and quality of these HVSR calculations are assessed depending on three basic requirements. They are the following: (a) the expected fundamental resonant frequency of interest must be more than 10 significant cycles in each time window, (b) the total number of the significant cycles must be more than 200 and it is recommended to be raised around two times at low fundamental resonant frequencies, and (c) the acceptable low standard deviation values that are calculated for the amplitudes of the time windows, which is recommended to be less than 2 for peak frequencies higher than 0.5 Hz or 3 for peak frequencies lower than 0.5 Hz. The identification of the clear peak frequency (f0) is met when the HVSR curve exhibits two quantitative criteria for a clear HVSR peak frequency as proposed by Bard and SESAME team (2004). Those criteria are the amplitude and stability conditions. Table 1 is summarizing the threshold values of amplitude and frequency. Amplitude conditions require that the HVSR amplitude of the peak frequency must be higher than 2 (i.e., A0 > 2). The stability conditions require the following: (1) the peak frequency should appear at the same frequency on the HVSR curves corresponding to mean + and − one standard deviation (i.e., σf is within a percentage ± 5%) of the peak frequencies estimated from individual time windows of the orthogonal components at one KiK-net or K-NET site, (2) σf (i.e., standard deviation of f0 estimated from individual time windows) must be lower than a frequency-dependent threshold ε(f), and (3) σA(f0) (i.e., standard deviation of the HVSR amplitudes of the peak frequencies estimated from individual time windows) must be lower than a frequency-dependent threshold θ(f).

Table 1 Threshold parameters for the identification of a clear peak frequency according to guidelines by Bard and SESAME team (2004)

A seismogram can be visualized as the convolution of the source effect, propagation path effect, site effect, and instrument response as

$${A}_{ij}(t)={S}_{i}\left(t\right)*{P}_{ij}(t)*{G}_{j}\left(t\right){*I}_{j}\left(t\right)$$
(4)

where \({S}_{i}\left(t\right)\) is the source effect, \({P}_{ij}\left(t\right)\) is the propagation path effect, \({G}_{j}\left(t\right)\) is the site effect, \({I}_{j}\left(t\right)\) is the instrument response, of the ith event, and for the jth station at free surface. “\(*\)” denotes the convolution operator.

Denoting Fourier transform, Eq. 4 can be written as in Eq. 5. The spectral ratio is obtained by dividing the Fourier spectrum of the acceleration at the jth station at free surface by the spectrum at the kth station at borehole as in Eq. 6.

$${A}_{ij}(f)={S}_{i}\left(f\right){P}_{ij}(f){G}_{j}(f){I}_{j}(f)$$
(5)
$$\frac{{A}_{ij}(f)}{{A}_{ik}(f)}=\frac{{S}_{i}\left(f\right){P}_{ij}(f){G}_{j}(f){I}_{j}(f)}{{S}_{i}\left(f\right){P}_{ik}(f){G}_{k}(f){I}_{k}(f)}=\frac{{G}_{j}(f)}{{G}_{k}(f)}$$
(6)

If the distance between stations j and k is much less than their hypocentral distances from the source, the source and path effects would be eliminated. The instrument response can be removed assuming similar instrument units at surface and in borehole. The exact site response can thus be obtained from Eq. 6.

In this study, SBR is defined as spectral ratios between the acceleration horizontal Fourier amplitude motions recorded on the free surface and in the borehole at each KiK-net site. Equations 7 and 8 show the SBR of both horizontal components (i.e., EW and NS, respectively).

$${SBR}_{ew}={H}_{s.ew}/{H}_{b.ew}$$
(7)
$${SBR}_{ns}={H}_{s.ns}/{H}_{b.ns}$$
(8)

where “s” and “b” terms are referred to motions recorded on the free surface and in the borehole, respectively.

The Fourier amplitude spectra for surface and borehole horizontal components at each KiK-net site are calculated using GEOPSY software suite (http://www.geopsy.org). Fulfilling Bard and SESAME team (2004) guidelines, the reliability and quality of the SBR calculations are also assessed depending on the previously explained three basic requirements. As a result, \({f}_{SBR}\) could be estimated for both horizontal components (EW and NS) and defined as \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\), respectively. Calculating the \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\) is carried out for two reasons. The first reason is to estimate the effects of lateral or horizontal variation in the lithology. The second reason is to estimate the directional or structural control dependency on site resonance.

Previous work by Ibs-von Seht and Wohlenberg (1999) defined the bedrock depths based on the interface between the soft sedimentary covers of different thicknesses of Tertiary and Quaternary ages (VS < 1000 m/s), and Paleozoic hard rock basement (VS ≥ 2500 m/s). Delgado et al. 2000a, 2000b(a) defined the bedrock depths based on the interface between the Upper Miocene sedimentary fill (i.e., conglomerates, sandstones, and marls) of Bajo Segura basin (VS ≥ 85 m/s), and the basement of the basin which is composed of limestones and marls of Triassic to Cretaceous age (VS ≥ 200 m/s). Özalaybey et al. (2011) depended on geological evaluation and gravity data to estimate the bedrock depths in Izmit basin. García–Jerez et al. (2006) estimated the sedimentary cover thicknesses (i.e., depth to bedrock half-space) in Zafarraya basin by means of close geoelectrical surveys. Thabet (2019) estimated \(h\) at a site based on the highest seismic impedance ratio between each adjacent underlying and overlying layers. In this study, \(h\) is estimated at each KiK-net and K-NET site according to the highest seismic reflection coefficient of VP and VS (i.e., ICP and ICS) between each adjacent two layers, which are calculated based on the concept of seismic reflection coefficient (see Eqs. 9 and 10). So that, it will be possible to identify the presence of high velocity layer, which has negative ICP and/or ICS. Consequently, bedrock presence could be clearly judged and differentiated from the presence of high velocity layer.

$${IC}_{P}= \frac{{(\sigma {V}_{P})}_{i+1}-{(\sigma {V}_{P})}_{i}}{{(\sigma {V}_{P})}_{i+1}+{(\sigma {V}_{P})}_{i}}$$
(9)
$${IC}_{S}= \frac{{(\sigma {V}_{S})}_{i+1}-{(\sigma {V}_{S})}_{i}}{{{(\sigma {V}_{S})}_{i+1}+(\sigma {V}_{S})}_{i}}$$
(10)

where \(\sigma\) is the density of each layer i (note: 1 is the surficial layer and ith for underlying layers).

The appropriate \(h\) is assigned to its \({f}_{HVSR}\) and \({f}_{SBR}\), whereas the unreasonable \(h\) is excluded from any further analyses due to difficulties or suspicion in assigning the accurate and proper bedrock half-space. However, exclusion of KiK-net and K-NET sites from further analyses is executed depending on designed filtering process as illustrated in Fig. 3 and Fig. 4, respectively. The rejected or excluded sites from further analyses are due to many reasons. They are organized on a base of step-by-step process as follows: (1) the unavailability of velocity structures, geotechnical, or seismological database at some KiK-net and K-NET sites, (2) K-NET sites exhibiting peak resonance frequencies < 2.5 Hz, (3) the clarity criteria according to SESAME guidelines (Bard and SESAME team 2004) is not satisfied, (4) presence of unclear peak frequency such as broad peak frequency or a multiplicity of local maxima, and (5) difficulties in relating the peak resonance frequency with its proper seismic reflection coefficient bedrock half-space. The present PS logging seismic velocity structures of KiK-net and K-NET are considered as reliable and applicable (Thabet 2019).

Fig. 3
figure 3

Flowchart showing the designed filtering process to discard or accept KiK-net site in the present regression relationship

Fig. 4
figure 4

Flowchart showing the designed filtering process to discard or accept K-NET site in the present regression relationship

4 Results and discussion

Out of 698 KiK-net sites processed with HVSR and SBR methods and 1045 K-NET sites processed with HVSR method, 366 and 483 KiK-net sites processed with HVSR and SBR, respectively, and 339 K-NET sites processed with HVSR method could be accepted and included in the further analyses. Table 2 shows the ranges of \(h\) and \(f\) of the accepted sites in the present study. Interesting investigation is carried out by Zhu et al. (2020) at 90 KiK-net sites, which have no large velocity contrast. They perform novel empirical correction to preserve the earthquake HVSR with that of one-dimensional S-wave amplification spectrum. As a result, the observed ETF, which could be taken as SBR, could be reproduced. Figure 5 is showing two comparisons between \({f}_{HVSR-KiK}\) versus \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\). Both comparisons are based on the same KiK-net sites having accepted \({f}_{HVSR-KiK}\), \({f}_{SBR-EW}\), and \({f}_{SBR-NS}\). \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\) tend to be equal to or slightly higher than their corresponding \({f}_{HVSR-KiK}\). This indicates that \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\) are comparable to their corresponding \({f}_{HVSR-KiK}\). Moreover, the variability between \({SBR}_{ew}\) and \({SBR}_{ns}\) is tested through comparison between \({f}_{SBR-EW}\) and \({f}_{SBR-NS}\) as shown in Fig. 5. This indicates that the lateral variation between EW and NS components considering the resonance peak frequencies are insignificant, particularly for sites having peak frequencies < 1 Hz. For that reason, the arithmetic average between \({SBR}_{ew}\) and \({SBR}_{ns}\) is calculated and the \({f}_{SBR}\) is used henceforward. Figure 6 shows that the class of C and D are the dominant site classes as depicted from the accepted KiK-net and K-NET sites.

Table 2 Resulted ranges of \(h\) and \(f\) in the present study
Fig. 5
figure 5

Comparison between fundamental resonant frequencies resulted from SBR, of both horizontal EW and NS components, and HVSR at KiK-net sites

Fig. 6
figure 6

Statistics of the accepted sites resulted from HVSR and SBR classified according to NEHRP Provision (2000)

The engineering bedrock has S-wave velocity of 400 to 700 m/s according to seismic microzonation study by Nath (2007), while Miller et al. (1999) defined the bedrock by considering S-wave velocity of ≥ 244 m/s using MASW survey. Delgado et al. (2000b) defined the geotechnical bedrock by considering S-wave velocity > 250 m/s of the Triassic carbonate rocks and Triassic to Cretaceous limestones. However, the engineering bedrock has S-wave velocity of ≥ 350 m/s, whereas the seismological or seismic bedrock corresponds to S-wave velocity of ≥ 3000 m/s (Satoh et al. 2001b, Kawase et al. 2011 and Kawase et al. 2018). In the present paper, 302 and 37 out of the 339 K-NET sites processed with HVSR method have S-wave velocity of bedrock > 350–2000 m/s and < 350 m/s, respectively. 348 and 18 out of the 366 KiK-net sites processed with HVSR method have S-wave velocity of bedrock > 350–3000 m/s and < 350 m/s, respectively, whereas 453 and 30 out of the 483 KiK-net sites processed with transfer function method have S-wave velocity of bedrock > 350–3000 m/s and < 350 m/s, respectively. As a result, those minor sites with S-wave velocity of bedrock < 350 m/s have intermediate layer within the sediments (i.e., highest seismic reflection coefficient interface) and responsible for producing peak resonance frequency.

Figure 7 shows three examples of KiK-net sites that are producing clear sharp and unique single HVSR and SBR fundamental resonant frequencies of > 3 Hz, 1–3 Hz, and < 1 Hz that are processed at SZOH34, YMTH15, and ISKH06 sites, respectively. In the supplementary material, Figures S1 through S10 are showing various examples of KiK-net sites processed with HVSR and SBR and K-NET sites processed with HVSR. The examples exhibit clear, sharp, and unique single HVSR or SBR peak frequencies. Figures S11, S12, and S13 are showing various examples of discarded KiK-net sites processed with HVSR and SBR and K-NET sites processed with HVSR. Tables S1, S2, and S3 are showing the whole discarded KiK-net and K-NET sites with their discard codes. There are 23 KiK-net sites and 34 K-NET sites with no available velocity structure or any geotechnical database. Four K-NET sites have their soil condition data to a depth of only 5 m. Due to the depth limitation in K-NET database to ≤ 20 m, so that, 388 K-NET sites are discarded because they have peak resonance frequency < 2.5 Hz which is corresponding to > 20 m depth. There are 291, 106, and 197 KiK-net sites processed with HVSR and SBR and K-NET sites processed with HVSR, respectively, not satisfying the clarity criteria according to SESAME guidelines (Bard and SESAME team 2004). Discarded 7, 62, and 21 KiK-net sites processed with HVSR and SBR and K-NET sites processed with HVSR, respectively, have unclear peak frequency such as broad peak frequency or a multiplicity of local maxima. Due to difficulties in relating the peak resonance frequency with its proper seismic reflection coefficient bedrock half-space, 11, 24, and 62 KiK-net sites processed with HVSR and SBR and K-NET sites processed with HVSR, respectively, are discarded. It is important to know that Thabet (2019) used 324 KiK-net sites processed with HVSR method, whereas the present study could success to increase the accepted or included sites up to 366 KiK-net sites processed with HVSR method. This improvement is due to adopting multiple earthquake approach, which enable us to include all the available ground motion records with PGAs ≤ 10 cm/s2. Consequently, the effect of strong non-stationary nature of waveforms, which may significantly affect the physical meaning of the HVSR and SBR, could be minimized or eliminated in the resulted peak frequencies. The other improvement in the present study is the accepted or included 483 KiK-net sites processed with SBR. As depicted from Eq. 6, SBR proves its efficiency in assessing the site response based on surface-to-borehole KiK-net sites. This indicates the applicability and productivity of SBR method in establishing regression relationships of \(h\) versus \(f\).

Fig. 7
figure 7

Examples of peak frequencies at ISKH06, YMTH15, and SZOH34 KiK-net sites resulted from HVSR (upper curves) and SBR of both horizontal components (middle and lower curves). (Note: red curves are the average HVSR or SBR curves, whereas the pale gray curves are the minimum and maximum)

Figure 8 shows an example of assigning the HVSR fundamental and secondary resonant frequencies to their deep and shallow bedrock half-spaces, respectively, at KYTH08 KiK-net site. In this example, the first highest \({IC}_{P}\) and \({IC}_{S}\) are directly indicating the deep bedrock half-space at 350 m depth, whereas the second highest \({IC}_{P}\) and \({IC}_{S}\) are directly indicating the shallow bedrock half-space at 10 m depth. As shown in Eqs. 8 and 9, \({IC}_{P}\) and \({IC}_{S}\) are the main basics for inferring the reliable bedrock half-space to its corresponding peak frequency.

Fig. 8
figure 8

Example of bedrock depth assignment at KYTH08 site

Generalized nonlinear regression relationships of the \({f}_{HVSR}\), estimated using KiK-net sites (\({f}_{HVSR-KiK}\)) and K-NET sites (\({f}_{HVSR-K}\)), and \({f}_{SBR}\) versus their corresponding \(h\) are superimposed as shown in Fig. 9. Significant scattering in the fundamental resonant frequencies at bedrock depths < 100 m is dominant. These strong deviations of the data points would be analyzed based on seismic site classification and bedrock lithology. Applying the quarter wavelength theory in Fig. 9 will result in identification of shallow and deep sites with average overlying S-wave velocities of 100–200 m/s and 400–800 m/s, respectively. This is a theoretical interpretation of the observed relationships of \(f\) versus \(h\), whereas Fig. 10 is showing the delineated average overlying S-wave velocities obtained from the PS logging seismic velocity structures of KiK-net and K-NET. The most interesting point is that these nonlinear regression relationships of \(f\) versus \(h\) are strongly corresponding to the average S-wave velocity overlying the bedrock half-space (\(\overline{{V }_{S}}\), is illustrated in Eq. 11) versus \(h\) (Fig. 10), whereas the bedrock \({V}_{S}\) has insignificant relation with their corresponding \(f\) versus \(h\) (Fig. 10). These results indicate the importance to establish nonlinear regression relationships of \(f\) versus \(h\) depending on seismic site classification, because there is higher dependency on the \(\overline{{V }_{S}}\) than the bedrock \({V}_{S}\).

$$\overline{{V }_{S}}=h/\sum_{i=1}^{n}\frac{{h}_{i}}{{V}_{si}}$$
(11)

where \(h\) is the total thickness of the layers above the bedrock half-space. \({h}_{i}\), \({V}_{si}\), and \({V}_{pi}\) are the thickness, and S-wave velocity, respectively, of each individual layer. \(n\) is the number of these layers.

Fig. 9
figure 9

Fundamental resonant frequency versus bedrock depth obtained from HVSR and SBR of KiK-net sites and from HVSR of K-NET sites. (Nonlinear regression equations from top to bottom are corresponding to HVSR and SBR of KiK-net, and HVSR of K-NET, respectively)

Fig. 10
figure 10

Average overlying and bedrock (Vs) versus h corresponding to f versus h in Fig. 9

Figure 9 shows the highest scattering in the \(f\) versus \(h\) at depths < 100 m. Therefore, this generalized \(f\) versus \(h\) relationship is rebuilt according to different seismic site classifications. Figure 11 shows the \(f\) versus \(h\) relationships in the main four seismic site class of B, C, D, and E. Reasonable fit could be achieved in the \({f}_{HVSR-KiK}\), \({f}_{HVSR-K}\), and \({f}_{SBR}\) versus their corresponding \(h\) relationships. Significant differences are present in the \(f\) versus \(h\) relationships produced from HVSR and SBR analyses.

Fig. 11
figure 11figure 11figure 11

The f versus h and their corresponding average overlying and bedrock VS versus h in B, C, D, and E site class

Remarkable correlations could be seen in the \(\overline{{V }_{S}}\) versus \(h\) and \(f\) relationships as shown in Fig. 11. Poor fitting could be clearly seen in the bedrock \({V}_{S}\) versus \(h\) relationships as shown in Fig. 11. This reflects the strong dependency of the resulted \({f}_{HVSR-KiK}\), and \({f}_{SBR}\) on average S-wave velocity structure of the layers over the bedrock half-space. The bedrock \({V}_{S}\) has insignificant influence on the resulted fundamental resonant frequency. This interesting point concludes that the presence of the bedrock half-space is determined based on the highest \({IC}_{P}\) and \({IC}_{S}\) as explained earlier, and not based on the bedrock \({V}_{S}\).

Table 3 summarizes the fitting curve parameters (\(a\) and \(b\)) with their characterizing maximum \(\overline{{V }_{S}}\) and the bedrock \({V}_{S}\) as depicted from Fig. 11. Among the different lithological groups of different site classes, the high majority of the maximum \(\overline{{V }_{S}}\) is not exceeding 1262 m/s. Moreover, a general gradual decreasing in the maximum \(\overline{{V }_{S}}\) is observed thorough site class of B, C, D, and E. Depending on these observations, different gradual ranges for each site class are proposed as shown in Table 4. These gradual ranges and their ± errors are calculated based on the average values and their standard deviations of the maximum \(\overline{{V }_{S}}\) in Table 3. Moreover, these gradual ranges of the maximum \(\overline{{V }_{S}}\) resemble the ranges of \({V}_{s30}\) in the NEHRP Provision (2000), because the high majority of \({f}_{HVSR-KiK}\), \({f}_{HVSR-K}\), and \({f}_{SBR}\) are corresponding to \(h\) of ≤ 30 m (refer to Fig. 9). This explains the high importance of studying these nonlinear regression relationships considering the seismic site classification.

Table 3 Summary of correlation coefficients (i.e., \(a\) and \(b\)) with their characterizing maximum \(\overline{{V }_{S}}\) and bedrock \({V}_{S}\) results through site class of B, C, D, and E
Table 4 Proposed ranges of the maximum \(\overline{{V }_{S}}\) (refer to Table 3) overlying bedrock half-space according to the present study

It was hoped that the HVSR processing results at K-NET sites would interpret and solve the strong scattering in the \(f\) versus \(h\) relationships at \(h\) of < 20 m, but it is obvious that \({f}_{HVSR-K}\) versus \(h\) relationship has strong scattering too (Fig. 9). The resulted parameters of \(a\) and \(b\) are generally lower than those resulted from the \({f}_{HVSR-KiK}\), \({f}_{HVSR-K}\), and \({f}_{SBR}\) versus \(h\) relationships (Table 3). This underestimation indicates that the ranges of \(h\) and \(f\) have strong influence on the resulted \(a\) and \(b\) coefficients. Also, it is an indication that deep bedrock half-spaces are governing and guiding the overall \(f\) versus \(h\) relationships.

Ibs-von Seht and Wohlenberg (1999) studied sand formations of class C in western Lower Rhine Embayment, Germany. The ranges of \(h\) and \(f\) in this previous study were 15–1600 m and 5–0.1 Hz, respectively, and investigated with application of Nakamura’s technique and transfer function technique using microtremor measurements. Scherbaum et al. (2003) presented relationships of \(\overline{{V }_{S}}\) versus \(h\) of sand formations of class C at three arrays near Pulheim, Chorweiler, and Lülsdorf in Cologne city area, Germany. Their results consider the uppermost 200–300 m analyzing the fundamental mode Rayleigh wavefield in the frequency range of 0.7–2.2 Hz. Therefore, both studies are used in fair comparison with class C of the present study as shown in Table 5.

Table 5 Existing correlation coefficients (i.e., \(a\) and \(b\)) with their characterizing ranges of \(\overline{{V }_{S}}\) and bedrock \({V}_{S}\) results (refer to Table 3)

D’Amico et al. (2008) mapped the \(f\) of the shale sedimentary cover (\(h\)) of class C in the city of Florence, Italy. Their ranges of \(h\) and \(f\) are < 5–153 m and 0.1–20 Hz, respectively. The parameters of \(a\) and \(b\) and their characterizing ranges of \(\overline{{V }_{S}}\) and bedrock \({V}_{S}\) are compared with class C in the present study.

Harutoonian et al. (2013) conducted HVSR microtremor measurements in a dynamically compacted gravelly fill area in Western Sydney, Australia. The ranges of \(h\) and \(f\) are 13.3–1.2 m and 4.2–27 Hz, respectively. Therefore, their parameters of \(a\) and \(b\) and the characterizing ranges of \(\overline{{V }_{S}}\) and bedrock \({V}_{S}\) are fairly compared with those resulted from HVSR processing of K-NET sites belonging to class D.

Finally, resulted ranges of \(h\) and \(f\) from Moon et al. (2019) study are 10–45 m and 2–9 Hz, respectively. They studied the bedrock depths of weathered Bukit Timah granite formation in Singapore using microtremor measurements. The sites used in this study belong to site class of D.

Because of similarities in the site information and measurement parameters of these previous studies and the present study, fair comparison could be achieved considering the site-dependent variability, as shown in Table 5. Interestingly, remarkable similarities could be seen in the parameters of \(a\) and \(b\) and their characterizing ranges of \(\overline{{V }_{S}}\) and bedrock \({V}_{S}\). It is important to note that these previous and present relationships are site-specific and highly dependent on each region’s geologic conditions. Therefore, the target of these comparisons is only to introduce the seismic site classification as a crucial controlling factor in establishing the previous and the present nonlinear regression relationships.

5 Conclusions

With the results discussed earlier, it is possible to retrieve the following conclusions. The present study introduces additional improvements to the study conducted by Thabet (2019). These improvements consist of SBR and HVSR processing using database at the 698 KiK-net and the 1045 K-NET sites. The seismic database includes all the available earthquakes with PGAs of ≤ 10 cm/s2 at each site. Additionally, HVSR and SBR processing using the adopted conjugated records could increase the number of accepted sites due to enhancement in the requirements of SESAME guidelines (Bard and SESAME team 2004). Moreover, the relationships of \(\overline{{V }_{S}}\) versus \(h\) are inversely resembling the relationships of \(f\) versus \(h\), whereas the relationships of bedrock \({V}_{S}\) versus \(h\) have insignificant importance. The shallow and limited ranges of \(h\) and \(f\) underestimate the parameters of \(a\) and \(b\) resulted from \({f}_{HVSR-K}\) versus \(h\) relationship whenever compared with those resulted from the \({f}_{HVSR-KiK}\), and \({f}_{SBR}\) versus \(h\) relationships. This indicates that deep bedrock half-spaces are governing and guiding the overall \(f\) versus \(h\) relationships. Relationships of and \({f}_{SBR}\) versus their corresponding \(h\) are strongly resembling relationships of \({f}_{HVSR-KiK}\) versus \(h\) indicating that SBR analysis is productive and efficient tool to delineate the fundamental resonant frequency. Overall, rebuilding the \({f}_{SBR}\) and \({f}_{HVSR-KiK}\) versus their corresponding \(h\) based on site-dependent variability in different lithologies could strongly decrease scattering and deviations of data points. Remarkable correlations could be seen in the \(\overline{{V }_{S}}\) versus \(h\) relationships reflecting the strong dependency of the \({f}_{HVSR-KiK}\), \({f}_{HVSR-K}\), and \({f}_{SBR}\) on the \(\overline{{V }_{S}}\). The high majority of the maximum \(\overline{{V }_{S}}\) is not exceeding 1262 m/s. The ranges of maximum \(\overline{{V }_{S}}\) are decreasing gradually through site classes of B, C, D, and E resembling the ranges of the Vs30 of the seismic site classification according to NEHRP Provision (2000). Therefore, the presence of the bedrock half-space is determined based on the highest \({IC}_{P}\) and \(h\) without respect to the bedrock \({V}_{S}\). HVSR inversions based on the diffuse field approach could interpret the presence of the bedrock half-space using the criteria of the highest V S and h. Interestingly, fair comparisons based on site information and site-dependent variability of different bedrock lithologies between previous relationships of F versus H in the literature and the present study relationships are showing remarkable and reasonable similarities.

It is important to note that these previous and present relationships are site-specific and highly dependent on each region’s geologic conditions. The main achievement of the present study is only to introduce the seismic site classification as a crucial controlling factor in establishing the previous and the present nonlinear regression relationships of the depth of high seismic reflection coefficient interface versus the peak resonance frequency and the average overlying S-wave velocity. Practically, inverting the layered S-wave velocity structure is possible using known resonance frequency from HVSR analyses, particularly in regions with deployed strong-motion observation system, and consequently reliable inference of the bedrock depth, average overlying S-wave velocity, and seismic site classification. As a result, the usefulness of these present nonlinear regression relationships is the possibility to quickly obtain a general idea of the subsurface lithology. Moreover, using these present nonlinear regression relationships based on known subsurface lithology and seismic site classification could be used to provide an initial guess and to constrain ranges of the depths and average overlying S-wave velocity to feed into the HVSR inversion process.