Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

1.1 Why Single Station/Sparse Network?

Regional and local seismic moment tensors are an important observation for various applications, such as spatio-temporal analysis of earthquake sequences, volcanic studies, monitoring of geothermal facilities, and tomographic studies. In order to avoid errors, the moment tensors must be determined precisely. However, this objective can be challenging, especially in the regional/local distance range, where waveforms are more complex than in the teleseismic distance range.

Usually, translational ground motion observations from a network of stations are used to determine the seismic moment tensor. The more data are available and the better the stations are distributed around the source in distance and azimuth (thus sampling the entire radiation pattern), the better is the resolution of the determined moment tensor. However, in inaccessible environments such as deserts or arctic regions it is not always possible to install a large network. Even worse, in planetary seismology often only single observation sites can be instrumented (e.g. InSight: Banerdt et al. 2013). The examination and understanding of the structure of other planets such as Mars and Moon are hampered because most seismological methods are based on station networks. Establishing entire networks on other planets is logistically and economically impossible, at least for now. Therefore, the question remains if and to what extent the full seismic moment tensor is resolvable from data of just one single station or a sparse network of maximum three stations.

1.2 Difficulties in the Single Station Approach

Theoretically, the three components of translational motion recorded at a single location should be sufficient to unambiguously determine the full seismic moment tensor in the teleseismic distance range (Ekstöm et al. 1986). Converted into an event-centric coordinate system (ZRT) the different wave types P, SV, and SH can be separated on the three components. Their amplitude ratios then provide the necessary information about the mechanism. However, in the regional distance range waveforms are more complex and more strongly influenced by noise. In addition, the inversion is performed on surface waves rather than on body waves. Especially for shallow sources this approach results in difficulties for the resolution of specific moment tensor components (e.g. Bukchin 2006; Bukchin et al. 2010). Therefore, the single station approach almost always fails in the regional distance range.

1.3 Rotational Ground Motions and Outline

Rotational ground motions are a relatively new observable, providing an additional three out of twelve components that completely describe the motion of a measurement point. So far, this observable has often been ignored, since compared to the traditional three components of translation it is much smaller (up to a factor \(10^{-9}\)) and suitable measurement devices were not available. Only recently, sensitive portable broadband measurement devices based on fibre-optic techniques are available (Bernauer et al. 2018).

Rotation \(\vec {\omega }\) is the curl of the wave field defined through a linear combination of the space derivatives of the translation vector \(\vec {u}\):

$$\begin{aligned} \begin{pmatrix} \omega _x \\ \omega _y \\ \omega _z \end{pmatrix} = \frac{1}{2} \begin{pmatrix} \partial _x \\ \partial _y \\ \partial _z \end{pmatrix} \times \begin{pmatrix} u_x \\ u_y \\ u_z \end{pmatrix} = \frac{1}{2} \begin{pmatrix} \partial _y u_z - \partial _z u_y \\ \partial _z u_x - \partial _x u_z \\ \partial _x u_y - \partial _y u_x \end{pmatrix} \end{aligned}$$
(1)

where \(\times \) denotes a vector product. Measured at the Earth’s surface, the horizontal components of rotation are commonly known as tilt. Rotational ground motions provide us with information about the vertical displacement gradient which is not available from conventional seismic stations on the Earth’s surface.

In an earlier study, we could numerically show that data of a network based on six component measurement devices (3 translations + 3 rotations) considerably improves the resolution of the full moment tensor when performing a Bayesian (i.e. probabilistic) waveform inversion while keeping the total number of components constant (Donner et al. 2016). Motivated by the promising results of this earlier study, here, we evaluate the benefits of rotational ground motions for single or sparse station configurations.

In addition to the Bayesian inversion using the set-up of the former study, we also show the analytical solution for a double-couple moment tensor in a homogeneous space for translational and rotational ground motion. We will use the analytical solution to discuss under which geometries the reliable retrieval of the seismic moment tensor has the highest probability.

2 The Test Case Scenario and Applied Method

In an earlier study, we designed a synthetic earthquake scenario in Iran to analyse the performance of an entire network for waveform inversion to retrieve seismic moment tensors (Donner et al. 2016). The motivation was to examine the benefits of rotational ground motion measurements for the moment tensor retrieval. For this purpose, we have calculated Greens functions for two scenarios: one based on the real station distribution of the Iranian broadband network and one based on a fictional grid of stations covering the whole country. In this study, we continue with the fictive grid of stations and evaluate the cases when only one, two, or three stations are available. In the following, we briefly review the set-up of the experiment and the Bayesian, i.e. probabilistic, inversion approach.

2.1 Set-Up of the Synthetic Test Case

In Fig. 1 the blue star marks the location of the synthetic earthquake in north-central Iran. It is an almost vertical strike-slip event with a magnitude \(M_w4.0\) at a depth of 6 km. Its double-couple (DC) and isotropic (ISO) parts have 80% and 15%, respectively. The full moment tensor is given by

$$\begin{aligned} M_{kj} = \begin{pmatrix} -0.75 &{} -0.16 &{} 0.41 \\ &{} 1.17 &{} 0.26 \\ &{} &{} 0.02 \\ \end{pmatrix} \end{aligned}$$
(2)

The theoretically observed waveforms as well as the Greens functions are calculated using a spectral-element solver based on a 1-D structural model for the Alborz mountains (Fichtner et al. 2009; Donner et al. 2013). Gaussian noise was added to the synthetic seismograms to render them more realistic. For this study, since only the relative noise levels are relevant, we applied a noise of 10% of the maximum amplitude and evaluate translation and rotation data separately.

The fictional stations are distributed equally covering almost the whole surface of Iran (Fig. 1). There is a grid of 21 by 31 stations along latitude and longitude, respectively. The Greens functions are calculated for depths between 2 and 20 km in 2 km steps for frequencies up to 0.1 Hz to include surface waves, which are usually used for inversion in the regional distance range.

Fig. 1
figure 1

Study area in Iran. The blue star marks the position of the synthetic \(M_w4.0\) earthquake. The mechanism is shown in the lower left with black lines and grey shaded areas being the nodal planes of the double-couple part and the full moment tensor solution, respectively. Black dots show the grid of theoretical stations

2.2 Probabilistic Waveform Inversion for Seismic Moment Tensors

Probabilistic, i.e. Bayesian, inversion has the advantage that it results in a solution class instead of one single solution. A selection of “the best” solution according to a more or less subjective criterion is omitted. Thus, Bayesian inversion provides an unbiased measure of resolution and possible trade-offs.

As in our former studies, we use the Bayesian inversion approach based on equations from Tarantola (2005):

$$\begin{aligned} \sigma (\mathbf{m}) = k \rho (\mathbf{m}) L(\mathbf{m}) \end{aligned}$$
(3)

where \(\sigma (\mathbf{m})\) and \(\rho (\mathbf{m})\) are the posterior and prior probability density functions (pdf) of the model parameters, respectively, k is a normalization constant, and \(L(\mathbf{m})\) is the likelihood function providing a measure of how well a model \(\mathbf{m}\) fits the data.

The inversion is performed for seven parameters, i.e. the six moment tensor components and centroid depth, in the time domain according the forward problem for the seismic point-source moment tensor (Aki and Richards 2002):

$$\begin{aligned} u_n(\mathbf{x},t) = M_{kj} \cdot G_{nk,j}(\mathbf{x},t-\tau ) \end{aligned}$$
(4)

with \(u_n\) is the ground motion at observation point \(\mathbf{x}\) and time t, \(M_{kj}\) the components of the seismic moment tensor, and \(G_{nk,j}\) the spatial derivatives of the Green’s function components. \(\tau \) is the origin time of the source. Einstein summation convention was applied to facilitate readability of the equation.

The posterior pdf is approximated from the homogeneous prior pdf by testing one million start models by applying the Metropolis-Hastings algorithm (e.g. Metropolis et al. 1953; Hastings 1970). Intuitively spoken, this algorithm performs a random walk through the model space. Based on Eq. 4 waveforms for the model candidate are calculated and compared to the theoretical observations. Depending on whether or not the model candidate improves the data fit, it will be added to the solution class or not.

For an objective estimate on the information content of the posterior relative to the prior pdf we use Shannon’s measure of information gain:

$$\begin{aligned} I(\rho ,\sigma ) = \int \rho (x) \log \left[ \frac{\rho (x)}{\sigma (x)} \right] dx . \end{aligned}$$
(5)

Due to the logarithm base 2 in Eq. 5, the unit of the information gain is termed a bit. For further details on the set-up, on the method and Shannon’s measure of information gain, we refer to our earlier studies (Bernauer et al. 2014; Donner et al. 2016; Reinwald et al. 2016) and to (Tarantola 2005).

3 Results

In the following, we show the results of the inversion of the theoretically observed waveforms for the test case shown in Fig. 1 and described in Sect. 2. We tested three scenarios with only one, two and three stations. For each scenario, we performed 670 inversions where we first inverted only translational ground motions and then repeated the 670 inversions inverting translational and rotational ground motions. For each inversion, the station(s) are randomly selected from the grid of possible stations. For each scenario, we show the generalised result over all 670 inversions along with the result of the inversion with the best outcome.

3.1 Single Station

Figure 2 shows the generalised result of the 670 inversions based on data of only one station. Each sub-plot represents one of the seven parameters inverted for. The x-axes show the value of Shannon’s measure of information gain, while the y-axes show the Gaussian kernel density estimations (kde), i.e. the marginal probability density function over the information gain. The higher the information gain, the better is the resolution of the parameters inverted for. Coral and green functions give the results for inversion of translational data only and translational with rotational data, respectively. The same colour-coding is used for all following figures.

Fig. 2
figure 2

Gaussian kernel density estimations (kde) for the information gain corresponding to 670 inversions for the single station case. Coral and green distributions show the results for inverting translational ground motions only and translational with rotational ground motions, respectively. The higher the information gain, the better is the resolution of the parameters

No matter which of the stations within the grid is used, four out of the seven parameters almost never benefit from the inclusion of rotational ground motions. These are the centroid depth and the moment tensor components \(M_{xy}\), \(M_{xz}\), and \(M_{yz}\). In contrast, the other three components \(M_{xx}\), \(M_{yy}\), and \(M_{zz}\) can benefit remarkably. Especially the component \(M_{zz}\) almost always benefits from the inclusion of rotational ground motions, no matter which station is used for the inversion.

Fig. 3
figure 3

Inversion result for the single station case for the inversion with highest information gain (0.72/1.48 bit for 3/6 components). The station used for this inversion is marked by a red dot on the map. The red bars show the real value of the target solution. Blue dotted lines are the homogeneous prior pdf. Coral and green curves show the posterior pdf for inversion of translations only and translations plus rotations, respectively. Numbers in the upper right corner give increase of information content relative to the prior according to Shannon’s measure

The question remains whether the real moment tensor solution was determined. Figure 3 shows the result of only one of the 670 inversions, the one with the highest resulting information gain. For each of the parameters inverted for the posterior pdf kde’s are plotted over the parameter space. In the upper right corners the information gains are given. The vertical red bars show the real values of the target solution. Most of the parameters benefit only slightly from the rotational ground motions, except for the moment tensor component \(M_{yz}\) which shows a substantial improvement. For the components \(M_{yy}\), \(M_{xy}\), and the centroid depth the highest probability is not at the real value of the target model. When considering the rotational ground motions, the highest probability for centroid depth is different from the true value. Despite this, the overall benefit of the rotational ground motion for inversion based on a single station, i.e. the increase in information gain, is 105% (Fig. 3, \((1.48-0.72)*100/0.72\)).

3.2 Two Stations

Figure 4 shows the generalised result of the 670 inversions based on data from two stations. Now, compared to the single station case, the green kde functions (6-component data) of the components \(M_{xy}\), \(M_{xz}\), and \(M_{yz}\) are slightly shifted to higher values for information gain compared to the coral kde functions. These shifts indicate that these components have a higher probability to be determined reliably when including rotational ground motions to the inversion based on two stations. In addition, the probability to estimate the correct centroid depth is increased. For the remaining three moment tensor components the benefits due to rotations are also more pronounced.

Fig. 4
figure 4

Generalized inversion results for the two station case. Colour-coding is the same as in Fig. 2

Fig. 5
figure 5

Inversion result for the two station case for the inversion with highest information gain (0.98/2.73 bit for 3/6 components). The stations used for this inversion are marked by red dots on the map. Colour-coding is the same as in Fig. 3

The result of the inversion with the highest resulting information gain (0.98/2.73 bit for 3/6 components) based on data from two stations is shown in Fig. 5. Almost all moment tensor components show a much higher probability to be determined correctly when including rotational ground motions, except the component \(M_{xy}\). By far the biggest improvement can be seen for the moment tensor component \(M_{xz}\). The most critical parameter is still the centroid depth, which again shows a low probability to be resolved. The overall increase in information gain is 178%, almost 75% more than for the single station case.

3.3 Three Stations

Figure 6 shows the generalised result of the 670 inversions based on data from three stations. As expected, the shift to values of higher information gain of the green kde’s (6-component data) compared to the coral kde’s (3-component data) is much more distinct for all moment tensor components compared to the two station case. The overlap between coral and green curves is greatly reduced, implying that for almost all station distributions, the inclusion of the rotational ground motion data results in a much higher probability of producing in the correct solution.

Fig. 6
figure 6

Generalized inversion results for the three station case. Colour-coding is the same as in Figs. 2 and 4

Fig. 7
figure 7

Inversion result for the three station case for the inversion with highest information gain (1.20/3.38 bit for 3/6 components). The stations used for this inversion are marked by red dots on the map. Colour-coding is the same as in Figs. 3 and 5

The result of only one inversion showing the best result (i.e. highest information gain) based on three stations is shown in Fig. 7. Now, there is even a significant probability to retrieve the correct moment tensor from translational data alone (coral functions). However, the probability is 181% higher when including the rotational ground motions. Especially the components including depth dependencies (\(M_{xz}\), \(M_{yz}\), and \(M_{zz}\)) and the component \(M_{xy}\) benefit most from including rotational data into the inversion. The probability of retrieving the correct centroid depth is still low.

4 Discussion and Conclusion

In this study, we evaluated the retrieval of seismic moment tensors under the assumption of having data from only one, two, or three stations with both translational and rotational ground motions. The study is based on a theoretical scenario in Iran already used in an earlier study, where we evaluated the performance of a network under consideration of rotational ground motions (Donner et al. 2016). For each station case, we have performed a Bayesian, i.e. probabilistic, inversion once only using translational and once using translational plus rotational ground motions. We have repeated these inversions 670 times, always randomly selecting station(s) out of a grid of theoretical station locations (Fig. 1). Next to the generalized results based on these 670 inversions (Figs. 2, 4 and 6), we also showed the results of the inversion with the highest information gain (Figs. 3, 5 and 7).

The results show that it might be difficult to fully retrieve the moment tensor from one station alone. Figure 3 shows that even when including the rotations, the probability to retrieve the true values for the inversion parameters is not very high. Nevertheless, the increase of information gain due to the rotations is still 105%. Here, we tested the inversion for the full moment tensor. There might be a chance to retrieve at least the mechanism of an earthquake from data of a single station when applying constraints to the inversion. Finally, when including data of a second and third station the probability to retrieve the correct full moment tensor is very high. The overall increase of information gain is 178% for two and 181% for three 6-component stations. The generalisations of the two and three station case in the Figs. 4 and 6 show that this is the case for a wide range of station distributions. Especially for planetary seismology, this is an important result. It might be logistically and economically possible to bring two or three instruments to another planet such as Mars or Moon, even though the locations for these instruments would be determined according to practical rather than scientific criteria.

Centroid depth is the only parameter which in all station cases has a very low probability of being resolved. This effect is a general problem in waveform inversion for moment tensors. Especially, in the regional distance range, where inversion is done on surface waves, and for shallow earthquakes depth is a very critical parameter (e.g. Bukchin 2006; Bukchin et al. 2010). Our test case scenario includes exactly these difficulties. Therefore, the unsatisfying results for centroid depths are to be expected.

The question remains what the cause for the benefit from rotational data is. As can be seen from Eq. 1, rotational ground motions are linear combinations of the space derivatives of the translational ground motions. Measured at the Earth surface, their horizontal components correspond to tilt. Therefore, rotational ground motions include information about the velocity gradient with depth which is not available from translational ground motions. However, the comparison is not completely fair. When including the rotational ground motions, we have doubled the number of data (3 versus 6 traces for the single station case). Thus, the improved inversion results may be partly due to higher amount of information. However, in our former study based on an entire network, we kept the number of data in both cases the same (by cutting the number of stations in half when including rotational ground motions) and still received greatly improved inversion results. Therefore we argue that the increased amount of data is responsible only for a small portion of the improvement, if at all.

Already in 1986, when waveform inversion for moment tensors was newly established as a standard in the teleseismic distance range, Ekstöm et al. successfully inverted waveforms of the 1985 \(M\,8.1\) Michoacan, Mexico, earthquake from only three components of translational data for the deviatoric moment tensor (i.e. isotropical part is constrained; \(ISO={{\mathrm{tr}}}\,M_{ij}=0\)). They emphasised the necessity of a very good signal-to-noise ratio (SNR), a recording system with a broad frequency response and large dynamic range (not standard at that time), a well known velocity model, and exclusion of antipodal event-station geometry. Nowadays, three out of these four criteria are fulfilled in the teleseismic distance range and strong enough earthquakes result in a good SNR. Due to the well developed global seismic network it is now also possible to find an appropriate station, which meets the geometry criterion. However, in the regional distance range this approach is hampered due to more complex waveforms, less well known velocity models, and a limited number of stations. Jiménez et al. (1989) have extended the approach of Ekstöm et al. (1986) to regional and moderate-sized (i.e. \(M_S\approx 4.5 - 5.5\)) earthquakes. They further limited the inversion to the double-couple part of the moment tensor (i.e. tectonic part; \(DC=\det M^{dev}_{ij}=0\)) and emphasized the need for path corrections using the dispersion properties of Love and Rayleigh waves. Fan and Wallace (1991) used a similar approach and found a trade-off between structure and depth in that a reliable mechanism might come with a “wrong” depth due to inaccuracies in the assumed structural model. Additionally, they stated that the noise level should be 10% as a maximum to retrieve a reliable mechanism. The results of these studies based on only three components of translation combined with the results of the Bayesian inversion in this study are promising. We are confident that having six components available there are chances to at least determine the deviatoric moment tensor reliably from a single station measurement, if not the full moment tensor.

The velocity model needed to calculate the Greens functions used for inversion is the more crucial point in the entire process. In the regional distance range, a larger proportion of the seismic energy travels through the crust than in the teleseismic distance range (where the more homogeneous deeper structure of the Earth is more important). Thus, the structural details of the crust have a stronger influence on the waveforms and therefore cause the waveform complexities. In planetary seismology, where the structures are less well known than on Earth, the same difficulties will be faced. Sampling an additional part of the seismic wave field by including rotational ground motions into the inversion can compensate for these issues, at least partly.

Fig. 8
figure 8

Radiation pattern for the synthetic earthquake scenario in Iran (Fig. 1). The plots are in the event-centric cartesian coordinate system (ZRT). Top row shows radial, transverse, and vertical component of the velocity energy, respectively, while the bottom row shows the same for rotation. The mechanism of the theoretical earthquake is shown at its location. Blue, green, and turquoise dots show the station(s) used for the inversions based on one, two, and three stations, respectively; inversion results are shown in Figs. 3, 5 and 7

Next to the velocity model, the event-station geometry is of great importance when restricted to only a few stations. In Fig. 8 we show the radiation pattern of the theoretical earthquake for the six components of translation and rotation. Each sub-plot shows the maximum energy for this specific earthquake mechanism and for each of the theoretical stations of the grid for the component mentioned above the sub-plot. Blue, green, and turquoise dots show the station(s) used for the best inversions for the single, two, and three station case shown in Figs. 3, 5, and 7. The energy is calculated as the square root of the sum of squared amplitudes divided by the maximum amplitude of the trace: \(E = \sqrt{\sum {tr_i^2}}/A_{max}(tr)\).

When comparing the locations of the stations used within the radiation pattern of the different components, it is clearly visible that they lead to significant amplitude ratios between the different components. Inverting only waveforms of the three translational components results in only three amplitude ratios. The additional three components of rotation provide further three amplitude ratios of rotation plus further nine amplitude ratios of translation versus rotation. Therefore, the position within the radiation pattern at which the station(s) are located is better constrained. Not the single traces themselves but rather their amplitude ratios contribute to the resolution of the moment tensor solution. Therefore, we think that it might be possible to at least derive the pure deviatoric mechanism from only one station, i.e. adding constraints to the waveform inversion. This argumentation is supported by the evaluation of the analytical solution for the radiation pattern of a double-couple source in a homogeneous medium in Appendix A.

In summary, there are two main crucial points when inverting waveforms for moment tensors from sparse networks: a limited knowledge of the underlying velocity model and the question where in the radiation pattern the station(s) is/are located. Including rotational ground motion data can at least partly compensate for these limitations. First, by sampling another property of the wave field, and second, by adding more input data, in the form of more components or local wavefield gradients. We see potential to greatly improve the performance of existing seismic networks with respect to waveform inversion for moment tensors by adding just one single rotational sensor to the network. In regions where the same mechanism occurs regularly, it is possible to find the most advantageous station position with respect to the radiation pattern.