1 Introduction

Ambient seismic noise contains surface waves (Shapiro and Campillo 2004) and body waves (e.g., Roux et al. 2005; Wang et al. 2014a). It is more convenient to extract surface waves from ambient noise, however, because surface waves possess higher energy than body waves. Nowadays, ambient noise is widely used to extract surface waves in research (Shapiro et al. 2005) and engineering (Park et al. 2007; Lin et al. 2013). Rayleigh waves could be extracted in two main methods: spatial autocorrelation method (SPAC) and seismic interferometry. The common features of the two methods have been studied (Prieto et al. 2009; Tsai and Moschetti 2010).

SPAC method was firstly proposed by Aki (1957) with sensors arrayed in a circle. Sensor-array was simplified into line (Chávez-García et al. 2006; Margaryan et al. 2009) and even only two sensors (Morikawa et al. 2004; Chávez-García et al. 2005). It has been demonstrated that reliable Rayleigh-wave phase velocities would be calculated with two sensors using SPAC method (Boschi et al. 2013).

Seismic interferometry, including cross-correlation and cross-coherence, has been used to generate empirical Green’s functions from ambient seismic noise (Weaver and Lobkis 2001; Campillo and Paul 2003). Rayleigh-wave group velocities and phase velocities can be calculated through the empirical Green’s function between two sensors (Shapiro et al. 2005; Yao et al. 2006). Xu et al. (2013) used multichannel analysis of surface waves (MASW) (e.g., Song et al. 1989; Xia 2014) in imaging surface waves by processing the empirical Green’s functions while investigated the near surface.

Love waves could also be extracted from three-component data or transverse component data using seismic interferometry (Nakata et al. 2011; Behm and Snieder 2013). While analyzing the transverse component of empirical Green’s functions, people used to focus on Love waves and neglect Rayleigh waves. In real ambient noise, however, there are not only Love waves but also Rayleigh waves propagating in horizontal directions (Aki 1957). Noise sources were assumed distributed randomly (Arai and Tokimatsu 2004). Because some noise sources are distributed away from the radial direction, the horizontal components of Rayleigh waves which are stimulated by the noise sources could be decomposed to the transverse component. As a result, we could observe both Love waves and Rayleigh waves in the transverse direction (Okada 2003). Through analyzing the transverse component of empirical Green’s functions, both Rayleigh waves and Love waves would be distinguished in the frequency–velocity (f-v) domain.

In this study, we process broadband three-component ambient noise acquired in Xinjiang Province, China, using SPAC and seismic interferometry. Rayleigh-wave phase velocities are calculated through the two methods. We then focus on analyzing transverse component of empirical Green’s functions in theory and practice. The transverse component of empirical Green’s functions is analyzed in the f-v domain using the MASW method. Both Rayleigh waves and Love waves are found in the f-v domain as theory indicates. At last, we discuss influence of Rayleigh waves existing in the transverse component data.

2 Data Acquisition

Ambient seismic noise data were recorded in western Junggar Basin near Karamay city, China, in this study for reconnaissance of mineral resources and delineating deep-seated structures (Liu et al. 2014; He et al. 2015). The study site (Fig. 1) includes deserts, a river, roads, and Karamay city. Deserts are the main observation site, where there are few human activities. A river and several roads cross the survey line, which is plotted with red triangles (Fig. 1). Although ambient noises may be due to the flowing river and moving cars on the roads, the main noise source is from the city where people’s activities and traffic noise are strong enough to produce seismic noise in 1–5 Hz (Wang et al. 2014b).

Fig. 1
figure 1

A site map of the study field. Each red triangle represents an observation point. A river and roads are pointed out with black line. The insects show the geographical location of the study site in China with the black star

Karamay downtown is located 5 km away from the east end of the survey line. The survey line is 70 km long and about 76° from the north to the east (NE). The entire survey line was covered by 35 broadband three-component seismometers (120 s to 50 Hz) with a 1 km interval in three rounds. The 71 observation points in total were named W00, W01 to W70 (red triangles in Fig. 1). Each round was overlapped with a distance about 17 km and lasted around 16 days. Seismometers recorded ambient seismic noise in vertical, north and east directions at a 100 Hz sampling rate.

3 Calculating Rayleigh-Wave Phase Velocities

Vertical component of ambient noise is dominated by Rayleigh-wave energy. Rayleigh-wave phase velocities have always been calculated by processing the vertical component data (Aki 1957; Lin et al. 2008). We process the vertical component data by referring to the procedures proposed by Bensen et al. (2007) in three steps: data preparation, cross-coherence, and phase velocity measurement.

3.1 Data Preparation

Means and trends of raw data are removed in data preparation. To remove waves outside the target frequency band (0.5–5 Hz), band-pass filter between 0.1 and 15 Hz is applied to the data. Then the data are cut into same window-length (1 h) sections. Large events (such as earthquakes) are removed using temporal normalization with an iterative water-level normalization algorithm discussed by Bensen et al. (2007). We set three times of standard deviation as the water-level, and down-weight amplitude above it iteratively until the entire waveform is below the water-level. Using this method, the amplitude could be fully preserved if there are no big earthquakes in raw data (Fig. 2). Even if earthquakes are obvious in raw data (Fig. 3a) and after band-pass filtering (Fig. 3b), they will disappear through the temporal normalization (Fig. 3c, d).

Fig. 2
figure 2

Data before and after temporal normalization. a After a mean and a trend are removed from raw data, the data are band-pass filtered between 0.1 and 15 Hz. b The data in a are temporal normalized

Fig. 3
figure 3

The record containing earthquakes before and after temporal normalization. a The record is processed as Fig. 2a. b The data in a are band-pass filtered between 0.5 and 5 Hz. The signal of earthquakes still remains in the band. c The data in a are temporal normalized. The signal of earthquakes disappears. d The data in c are band-pass filtered between 0.5 and 5 Hz. The earthquakes are still invisible. So the signal of earthquake is removed fully

3.2 Cross-Coherence

Cross-coherence is calculated between two sensors (for example, A and B). We calculate the spectral cross-coherence H AB,i of ith time-window section with following equation:

$$H_{AB,i} = \frac{{u_{A,i} u_{B,i}^{ * } }}{{\left| {u_{A,i} } \right|\left| {u_{B,i} } \right|}}$$
(1)

where u A,i is spectrum of sensor A’s ith time-window record and asterisk denotes a complex conjugate. Here we adopt cross-coherence instead of cross-correlation, because cross-coherence possesses better ability to reduce the influence of random noise (Nakata et al. 2011). Equation 1 indicates that cross-coherence could be recognized as pre-whitening plus cross-correlation in the frequency domain (Prieto et al. 2009). The cross-coherences H AB,i are stacked to obtain final results H AB with following equation:

$$H_{AB} = \sum\limits_{i}^{N} {H_{AB,i} }$$
(2)

where N is the total number of time-window sections.

3.3 Phase Velocity Measurement

Phase velocity could be calculated from the final result H AB in the frequency domain and the time domain. In the frequency domain, the real part of averaged cross-coherence is SPAC coefficient \(\overline{\rho } \left( {\omega ,r} \right)\) (Fig. 4a) (Chávez-García et al. 2005; Ekström et al. 2009) and the SPAC coefficient is equal to \(J_{0} \left( {\frac{\omega r}{c\left( \omega \right)}} \right)\) as following equation (Aki 1957; Okada 2003):

$$\overline{\rho } \left( {\omega ,r} \right) = {\text{Real}}\left( {\frac{1}{N}H_{AB} \left( \omega \right)} \right) = J_{0} \left( {\frac{\omega r}{c\left( \omega \right)}} \right)$$
(3)

where J 0 is the Bessel function of the first kind, zero order, r is distance between two sensors (A and B), and c(ω) is phase velocity at frequency ω. Phase velocity could be calculated using the frequency of zero crossings (Ekström et al. 2009; Luo et al. 2012), crests and troughs (Aki 1957) in the SPAC coefficients with equation

$$c(\omega_{n} )= {\frac{{\omega_{n} r}}{{z_{n} }}}$$
(4)

where ω n denotes the frequency of the nth observed zero crossing, crest or trough and z n denotes the nth observed zero crossing, crest or trough of J 0. In practice, because random noise in spectrum could cause missed or extra zero crossings, crests or troughs, a set of estimated c m (ω n ) based on Eq. 4 are calculated

$$c_{m} (\omega_{n} )= {\frac{{\omega_{n} r}}{{z_{n + 2m} }}}$$
(5)

where m equals 0, ± 1, ± 2,…, indicating the number of the missed or extra ones (Ekström et al. 2009).

Fig. 4
figure 4

Calculating Rayleigh-wave phase velocities through SPAC method. a The real part of averaged spectral cross-coherence between W37 and W43. b Triangles represent calculated results of Eq. 5. Upward triangles represent crests in a, and downward triangles represent troughs in a. According to the range of Rayleigh-wave phase velocity (from 3 to 2.5 km/s, between 1 and 5 Hz in this area), triangles near the blue line are appropriate and selected as Rayleigh-wave phase velocities. The blue line represents the smooth results of the selected phase velocities

We choose crests and troughs instead of zero crossings to calculate phase velocity, because crests and troughs could be detected from random noise with their high amplitude. Using Eq. 5, a set of phase velocities could be calculated (Fig. 4b). Because Rayleigh waves dominate in the vertical component data, the calculated phase velocities are mainly due to Rayleigh waves. We identify fundamental mode Rayleigh-wave phase velocities (triangles near the blue line in Fig. 4b) within a reasonable range (from 3 to 2.5 km/s, from 1 to 5 Hz in this area).

Stacked cross-coherence H AB (t) in the time domain could be used to generate empirical Green’s function G AB (t) (Snieder 2004; Lin et al. 2008) with

$$G_{AB} (t) = - \frac{{{\text{d}}H_{AB} (t)}}{{{\text{d}}t}}0 \le t < \infty .$$
(6)

This step could also enhance signal at high frequencies (Sabra et al. 2005a, b).

Green’s function between two sensors could be recognized as shooting at one sensor, recording at the other. The vertical component of empirical Green’s functions between a certain and several other sensors could be formed as a common-shot gather (Fig. 5a), where surface waves can be observed clearly. The pseudo shot-gather is transformed into the f-v domain using high-resolution linear Radon transform (Luo et al. 2008). A continuous energy trend can be seen in the f-v domain (Fig. 5b), and phase velocities could be picked by following the energy peak.

Fig. 5
figure 5

a The vertical component of empirical Green’s functions (band-pass filtered between 0.5 and 5 Hz) between W30 and W30 to W45 are arrayed. b The pseudo shot-gather in a (where W30 is a virtual shooting point, W37 to W43 are receivers) is analyzed in the f-v domain. The energy trend is calculated using high-resolution linear Radon transform (Luo et al. 2008) and represents Rayleigh waves. Black dots represent results calculated through SPAC method and gray squares represent results calculated by Yao’s method (Yao et al. 2006)

It is well known that the imaginary part of spectral Green’s function for Rayleigh waves has the shape of a Bessel function in a homogeneous medium (Tsai and Moschetti 2010; Boschi et al. 2013). So seismic interferometry and SPAC could generate similar phase velocity in a homogeneous medium. Particularly, in this study, Rayleigh-wave phase velocities obtained through the two methods show good consistency (Fig. 5b).

The reliability of the obtained Rayleigh-wave phase velocities is further ensured by comparing the obtained and results of another method (Yao et al. 2006). The Rayleigh-wave phase velocities estimated using Yao’s method also agree well with results of seismic interferometry and SPAC (Fig. 5b). It is obvious that we could obtain similar Rayleigh-wave phase velocities through these three different methods.

4 Calculating Love-Wave Phase Velocities

4.1 Theory

People usually extract Love waves from the transverse component that is perpendicular to the direction of a great-circle path or a line in small scale through observation sensors in ambient seismic noise (Lin et al. 2008; Nakata et al. 2011). The transverse component data contain Rayleigh waves and Love waves (Okada 2003). If the transverse component data of two sensors (for example, A and B) are processed with seismic interferometry, the equation of seismic interferometry in the frequency domain (here we use cross-correlation as example) is as followings:

$$\begin{aligned} & \sum\limits_{i}^{N} {u_{A,i} u_{B,i}^{*} } \\ & \quad = \sum\limits_{i}^{N} {\left[ {(u_{A,R,i} + u_{A,L,i} )(u_{B,R,i}^{*} + u_{B,L,i}^{*} )} \right]} \\ & \quad = \sum\limits_{i}^{N} {(u_{A,R,i} u_{B,R,i}^{*} + u_{A,R,i} u_{B,L,i}^{*} \,+ u_{A,L,i} u_{B,R,i}^{*} + u_{A,L,i} u_{B,L,i}^{*} )} \\ \end{aligned}$$
(7)

where u A,R,i and u A,L,i indicates the spectrum of Rayleigh waves and Love waves in the transverse record within sensor A’s ith time-window section. Here we only deal with Rayleigh waves and Love waves included in the transverse component data and neglect other irregular or weak noise. Rayleigh waves and Love waves are assumed to be mutually uncorrelated (Okada 2003; Cho et al. 2006). So Eq. 7 can be further derived as

$$\begin{aligned} & \sum\limits_{i}^{N} {\left( {u_{A,R,i} u_{B,R,i}^{*} + u_{A,R,i} u_{B,L,i}^{*} + u_{A,L,i} u_{B,R,i}^{*} + u_{A,L,i} u_{B,L,i}^{*} } \right)} \\ & \quad \approx \sum\limits_{i}^{N} {\left( {u_{A,R,i} u_{B,R,i}^{*} + u_{A,L,i} u_{B,L,i}^{*} } \right)} \\ \end{aligned}$$
(8)

Rayleigh waves and Love waves exist in the cross-correlation of the transverse component data (Eq. 8). The two kinds of waves also exist in cross-coherence, because whitened cross-correlation is equivalent to cross-coherence (Prieto et al. 2009); it has been proven that there are the two kinds of waves in the real part of spectral cross-coherence for the transverse component (Aki 1957; Okada 2003; Cho et al. 2006). So the transverse component of empirical Green’s function also includes the two kinds of waves.

4.2 Practice

To demonstrate this phenomenon, we analyze the transverse component of empirical Green’s functions in the f-v domain and distinguish Rayleigh waves and Love waves.

The transverse component of ambient seismic noises is acquired through observation in transverse direction (Nakata et al. 2011) or recovering from north and east components (Lin et al. 2008; Behm and Snieder 2013). Here we rotate north and east components (N and E) into the transverse component (T) with following equation:

$$T = N\cos \theta - E\sin \theta$$
(9)

where θ represents the angle between direction of the survey line and east direction (Fig. 6).

Fig. 6
figure 6

Illustration of how transverse (T) component is defined between two sensors (A and B)

By processing the transverse component data with the same procedures of seismic interferometry discussed in the previous sections, we could achieve empirical Green’s functions. The empirical Green’s functions are arrayed to form a pseudo shot-gather (Fig. 7a). The dispersion image in the f-v domain presents two dispersion energy trends. The two energy trends suggest two kinds of dispersive waves which possess different velocities at each frequency (Fig. 7b). The two kinds of surface waves will be demonstrated to be Rayleigh waves and Love waves with determined Rayleigh-wave and Love-wave phase velocities in the followings.

Fig. 7
figure 7

a The transverse component of empirical Green’s functions (band-pass filtered between 0.5 and 5 Hz) between W33 and W33 to W48 are arrayed. b The pseudo shot-gather in a (where W33 is a virtual shooting point, W37 to W43 are receivers) is analyzed in the f-v domain using high-resolution linear Radon transform (Luo et al. 2008). Black dots represent the Rayleigh-wave phase velocities calculated in Fig. 4b; gray squares represent the forward calculated Love-wave phase velocities discussed in Sect. 4.2

Rayleigh-wave phase velocities obtained using vertical component data through SPAC (black dots in Fig. 7b) are very close to the lower-velocity energy trend. It indicates that the lower-velocity energy trend represents Rayleigh waves. So the energy of Rayleigh waves exists in the transverse component of empirical Green’s functions.

With the Rayleigh-wave phase velocities (Fig. 5b), we could obtain a multi-layer shear-wave velocity model (Xia et al. 1999) as the inversion result, then Love-wave phase velocities (gray squares in Fig. 7b) can be calculated through forward modeling (Schwab and Knopoff 1970). The forward calculated Love-wave phase velocities agree with velocity of the higher-velocity energy trend, indicating that the higher-velocity energy trend is due to Love waves.

As Fig. 7b shows, the two energy trends are close to each other. So these two trends would merge and probably be misidentified as one trend (the energy trend in Fig. 8) with methods (such as Yao et al.’s method 2006) that neglect Rayleigh waves exist. The misidentified energy trend would be the mixture of Love waves and Rayleigh waves, instead of true Love waves. As a result, slightly lower Love-wave phase velocities and an inversion result with lower shear-wave velocity would be obtained.

Fig. 8
figure 8

Love waves imaging using Yao’s method (2006) with the empirical Green’s function between W37 and W43. Black dots represent the Rayleigh-wave phase velocities calculated in Fig. 4b; gray squares represent the forward calculated Love-wave phase velocities discussed in Sect. 4.2

5 Discussion

5.1 Assessing Bias in Phase Velocities

There would be bias in measured surface-wave phase velocities from the real ones due to uneven ambient energy noise distribution (Yang and Ritzwoller 2008). The bias is found to be slight and below 5 % (Yao and van Der Hilst 2009). The bias could be smaller if more noise sources were within the Fresenel zone where plane waves propagate with an angle aside the direction of a survey line (the angle is from 0° to 28.9° and from 331.1° to 360° in a homogeneous media), because energy of noise sources in the Fresenel zone controls recovering empirical Green’s function (Yao and van Der Hilst 2009). Besides noise sources distributed in the Fresenel zone, long time recording and 2D array could also reduce the bias in measured surface-wave phase velocities (Aki 1957; Yao and van Der Hilst 2009).

In our study site, the main noise source is found to be the Karamay city based on the results of beamforming analysis (Wang et al. 2014b). Because the city is distributed in the Fresenel zone of the survey line (Fig. 1), the obtained Rayleigh-wave and Love-wave phase velocities are reliable. Inverted multi-layer shear-wave velocity models are also reliable due to the stability of inversion (Xia et al. 1999).

5.2 Calculating Real Love-Wave Phase Velocities

Methods that neglect Rayleigh waves exist would misidentify Love-wave phase velocities dealing with the transverse component of ambient seismic noise in this paper. Those methods are based on one empirical Green’s function between two sensors. Waveforms of surface waves in the empirical Green’s function are important for calculating phase velocity of the surface waves (Yao et al. 2006; Lin et al. 2008). Because the phase velocities of Rayleigh and Love waves are close at high frequencies (>0.1 Hz) and distance between two sensors in this paper is short (e.g., 6 km), arrival times of the two kinds of waves are very close. As a result, the waveforms of Love waves and Rayleigh waves would be merged (Fig. 9). There is only one peak of waveforms (red line in Fig. 9) in the transverse component near the peak of Rayleigh waves in the vertical component (black line in Fig. 9). So the waveforms in the transverse component must include the two kinds of waves. Information of Love waves may not be identified in the waveforms of single empirical Green’s function so that true Love-wave phase velocities could hardly be calculated. Love-wave phase velocities, which are slightly lower than the real ones, could be obtained using those methods at high frequencies in this study.

Fig. 9
figure 9

The transverse component (red line) of empirical Green’s function extracted from the transverse component data and the vertical component (black line) of empirical Green’s function extracted from the vertical component data between W37 and W43. Both are band-pass filtered between 0.5 and 5 Hz, and normalized

Those methods (Yao et al. 2006; Lin et al. 2008) based on one Green’s function could only calculate fundamental Rayleigh-wave or Love-wave phase velocities, but MASW is based on several Green’s functions. And MASW could image different surface waves, like fundamental and higher-mode surface waves, in the f-v domain simultaneously (Xia et al. 2003). So we adopt MASW to obtain Love-wave phase velocities in the frequency band (0.5–5 Hz). As a result, Rayleigh waves and Love waves could both be imaged (Fig. 7b). Love waves at high frequencies are important in investigating the shallow geology while the distance between sensors is limited to several hundred meters, even meters. MASW is suitable for calculating Love-wave phase velocities in investigating the near surface. MASW does require dense networks of sensors, and this requirement could be easily met in investigating the shallow geology.

It does not mean that true Love-wave phase velocity could not be calculated using those methods (Yao et al. 2006; Lin et al. 2008) in other situations. When ambient seismic noise in a low frequency band (e.g., 0.01–0.1 Hz) is researched and the distance between sensors is several hundred kilometers, the velocities of Rayleigh and Love waves are in large difference. In this situation, the arrival times and waveforms of the two kinds of waves could be distinctively separated from each other.

6 Conclusions

Rayleigh waves and Love waves both could be extracted from the transverse component data using seismic interferometry, because horizontal components of ambient seismic noise possess the two kinds of waves. We demonstrate it in both theory and practice.

Rayleigh waves and Love waves can be imaged in the f-v domain after processing the transverse component data in this paper. Rayleigh-wave and Love-wave phase velocities are so close at high frequencies that the energy of two waves could merge in the f-v domain. If we neglect Rayleigh waves, the merged energy trend may be misidentified as Love waves. Consequently, we would calculate Love-wave phase velocities which are slightly lower than the real ones in this study, which results in a multi-layer model with lower shear-wave velocities after inversion.

We recommend, in investigating the shallow geology, using MASW with the high-resolution linear Radon transform (Luo et al. 2008) to calculate real Love-wave phase velocities from the results of seismic interferometry.