1 Introduction

In a recent report from the London Workshop on the Future of Statistical Sciences (Madigan et al. 2014), the responsibilities of a statistician were defined as:

  • to design the acquisition of data in a way that minimizes bias and confounding factors and maximizes information content;

  • to verify the quality of the data after those are collected; and

  • to analyse data in a way that produces insight or information to support decision-making

As statistical seismologists, our work tends to emphasize the third point, with little thought given to the others: we often treat earthquake catalogues as collections of perfect measurements, forgetting (or at least neglecting) that a hypocentre location is the uncertain result of an unsolved inversion problem; and we are almost never involved in planning data collection or seismic network design. With this article, we address the second responsibility, which directly supports the third responsibility, future data analyses. In particular, we consider the quality of the data in the earthquake catalogue generated by the SIL seismic network in Iceland.

The SIL network was planned and developed in the framework of the Nordic SIL project in 1988–1994 (Stefánsson et al. 1993; Böðvarsson et al. 1996, 1999). The network was originally installed and operated in South Iceland in 1990 to monitor seismicity in the South Iceland Seismic Zone (SISZ), a sinistral transform zone that crosses southern Iceland. In 1993, it was expanded to the seismic zone in northern Iceland, thereby becoming a national network for Iceland, covering the country’s two main seismic zones. In the following years, the network was gradually expanded along the rift zones that cross Iceland from southwest to northeast (pink colour in Fig. 1a). In 1994, SIL included 18 stations and by the end of 2013 the seismic network had grown to 68 stations (Fig. 1a).

Fig. 1
figure 1

a Schematic representation of the plate boundary in Iceland (modified from Angelier et al. 2004). b Map of the study area. Black dots events in the selected catalogue. EVZ Eastern Volcanic Zone, NVZ Northern Volcanic Zone, KR Kolbeinsey Ridge, RP Reykjanes Peninsula, RR Reykjanes Ridge, TFZ Tjörnes Fracture Zone, SISZ South Iceland Seismic Zone, WVZ Western Volcanic Zone

South Iceland is a rather densely populated farming area with many small towns and several critical infrastructures, such as hydropower plants and associated reservoirs, geothermal power plants, industrial plants and transportation infrastructures. More hydropower plants and reservoirs are planned in the area in coming years. Because of the presence of human population and critical infrastructures, southern Iceland has the highest seismic risk in the country. We would like to build models to mitigate the seismic risk; almost any such model one can imagine requires a seismicity catalogue as input and, therefore, we study the SIL catalogue to obtain a first-order understanding of seismicity in this region and to understand the quality of the catalogue. Moreover, since the year 2000 a major earthquake sequence is ongoing in the study area, where three earthquakes of M W > 6 have already occurred and more events of up to moment magnitude (M W) 7.0 can be expected in the coming years to decades (Einarsson et al. 1981; Decriem et al. 2010). The historical seismicity in this region is well known (Einarsson et al. 1981; Ambraseys and Sigbjörnsson 2000) and many major faults have been mapped on the surface (Einarsson et al. 1981; Bergerat and Angelier 2000; Clifton and Einarsson 2005) and at depth by high-precision locations (HjaltadÓttir et al. 2005; Hjaltadóttir 2009).

Quality check of the seismic catalogue is important because results obtained from a contaminated dataset may be misleading (Gulia et al. 2012). To make such a check, careful identification of man-made changes should represent the first step in any statistical analysis of seismicity. Such changes, for example, redefining the magnitude scale in a catalogue may introduce errors in statistical analyses of seismicity patterns and they mislead researchers by generating spurious apparent variations in the observed seismicity rate (Habermann 1987; Tormann et al. 2010). Our catalogue check emphasizes removing events with very high or even unknown location uncertainty (filtering), removing quarry blasts (cleaning), and re-estimating magnitudes (correcting). After this filtering and corrections to the reported magnitudes, we report a spatio-temporal analysis of South Iceland seismicity based on b value and magnitude of completeness. The revised SIL catalogue is available from us upon request, and it should be used as the starting point for future analyses and modelling of South Iceland seismicity.

2 Tectonic Setting

In this section, we give only a brief summary of the regional tectonics (for more details, see Einarsson 1991).

The complex tectonic setting of Iceland is linked to the eastward shifting of the Mid-Atlantic Ridge axis, a consequence of its westward migration away from the hotspot currently situated under the Vatnajökull glacier (Einarsson 2008). As illustrated in Fig. 1a, this movement gives rise in southern Iceland to two sub-parallel volcanic zones. The first is the continuation of the Atlantic Ridge that comes on shore on the Reykjanes Peninsula (RP) and continues as the Western Volcanic Zone (WVZ). The second, the Eastern Volcanic Zone (EVZ), which continues in northern Iceland as the Northern Volcanic Zone (NVZ), is shifted eastward along the SISZ transform zone. Finally, in northern Iceland the rift shifts back westward along the Tjörnes Fracture Zone (TFZ) to connect with the Kolbeinsey Ridge (KR).

The SISZ is a 10–20-km-wide, EW striking sinistral shear zone, but faulting occurs on sub-parallel, NS striking dextral faults (Hackman et al. 1990; Roth 2004), as demonstrated by the shape of the damage zones of historical earthquakes (Einarsson et al. 1981; Ambraseys and Sigbjörnsson 2000), their mapped surface traces (Einarsson 1991; Clifton and Einarsson 2005) and subsurface fault mapping by high-precision earthquake relocations (Hjaltadóttir 2009).

The RP is a highly oblique spreading segment of the Mid-Atlantic Ridge, oriented about 30° from the direction of absolute plate motion (Clifton and Kattenhorn 2006). There are four distinct volcanic fissure swarms on the peninsula (Jakobsson et al. 1978), with an average strike of 40°N. Similar to the SISZ, large earthquakes on Reykjanes peninsula occur on NS striking faults that intersect the volcanic fissure swarms (Árnadóttir et al. 2004; Keiding et al. 2008; Clifton et al 2003; Antonioli et al. 2006).

3 Data Collection and Magnitude Estimation

The initial earthquake catalogue considered for this study spans the period 1991–2013 and contains 205,016 earthquakes with depth ranging between 0 and 50 km and with M L ≥ −2.0. The selected area includes both the SISZ and the RP and is within the limits 63.75° to 64.15° in latitude and −22.8° to −19.6° in longitude (Fig. 1b). Originally, the incoming seismic data were pre-processed by a computer at the site of each seismic station and transients were detected and defined by onset time, amplitude, and duration. Apparent velocity, azimuth, and spectral parameters were calculated, and this information was packed into short messages and sent to the data centre at the Icelandic Meteorological Office in Reykjavík, where an automatic phase association process defined events and sent requests for waveform data to the stations (Böðvarsson et al. 1996). In 2013–2014, transmission of continuous waveforms was initiated for all the sites and the analysis was moved to the data centre at IMO. Automatic locations and magnitudes of earthquakes are usually available within 1–2 min of their occurrence. These are manually reviewed by analysts at the SIL data centre. This includes readjusting or adding phase arrivals where necessary, relocating events and recalculating magnitudes. This process is a routine operation and special evaluation of larger events is not part of the analysis.

The SIL system uses two methods to estimate each earthquake’s magnitude. The first is based on an empirical local magnitude relationship:

$$M_{\text{L}} = \log_{10} \left( A \right) + 2.1 \, \log_{10} \left( D \right) + 4.8$$
(1)

where A is the maximum velocity amplitude in m/s of high-pass filtered waveforms with a corner frequency (f) at 1.5 Hz and scaled to the response of Lennartz 1.0 Hz sensor and Nanometrics RD3 digitizer, and D is the epicentral distance in km (Gudmundsson et al. 2006).

The other magnitude scale is a “local” moment magnitude scale, M LW that was originally constructed by Slunga et al. (1984) to agree with local magnitude scales in Sweden:

$$M_{\text{LW}} = \log_{10} \left( {m_{0} } \right){-}10,\quad {\text{for}}\, \, M_{\text{LW}} \le \, 2.0$$
(2)

where m 0 is the seismic moment in Nm (Rögnvaldsson and Slunga 1993). For magnitudes above 2.0, the formula is slightly modified so that the slope decreases with increasing magnitude (Pétursson and Vogfjörd 2009). In particular, considering the factor:

$$m = \log_{10} \left( {m_{0} } \right){-}10$$
(3)

the M LW formula becomes:

$$M_{\text{LW}} = m\quad {\text{for}}\,m \le 2.00$$
(4a)
$$M_{\text{LW}} = 2.0 + \left( {m - 2.00} \right) \cdot 0.9\quad {\text{for}}\,2.00 < m \le 3.11$$
(4b)
$$M_{\text{LW}} = 3.0 + \left( {m - 0.89} \right) \cdot 0.8\quad {\text{for}}\,3.11 < m \le 5.11$$
(4c)
$$M_{\text{LW}} = 4.6 + \left( {m + 1.1} \right) \cdot 0.7\quad {\text{for}}\,5.11 < m \le 6.25$$
(4d)
$$M_{\text{LW}} = 5.4 + \left( {m + 2.25} \right) \cdot 0.5\quad {\text{for}}\,6.25 < m \le 7.25$$
(4e)
$$M_{\text{LW}} = 5.9 + \left( {m + 3.25} \right) \cdot 0.4\quad {\text{for}}\,7.25 < m \le 9.25$$
(4f)
$$M_{\text{LW}} = 6.3 \, + \left( {m + 5.25} \right)\cdot0.35\quad {\text{for}}\,9.25 < m$$
(4g)

4 Verifying the Quality of the Data and Revising the Catalogue

In this section, we consider the quality of the SIL catalogue and revise it in three stages: we remove events with very large or unknown location uncertainty (filtering); we identify and delete quarry blasts that were misidentified as earthquakes (cleaning); and we re-estimate magnitudes (correcting). The procedure is illustrated in Fig. 2, including the number of events affected at each stage, and the three stages are described in the following three subsections.

Fig. 2
figure 2

Flowchart that summarizes the filtering, cleaning, and correcting used to produce the revised catalogue. Boxes in the right column the number of events affected by each step

4.1 Earthquake Location Precision: Filtering the Catalogue

Despite all the checks in the SIL system, often earthquakes stored in the catalogue are not well located, have large azimuthal GAP (the largest angle between any two stations that recorded the earthquake), and large uncertainties in latitude, longitude, and depth (blue histograms in Fig. 3). Workers have suggested many methods to filter a catalogue so that those events that are not well located are removed. The most popular of these are the network criteria methods (NCM) discussed by Gomberg et al. (1990), Bondar et al. (2004), and Husen and Hardebeck (2010). These authors proposed to keep only the events with GAP smaller than 180° and with at least 8 arrival times for P and S waves, of which at least one is an S wave arrival time. We applied this filter to the SIL catalogue (red histograms in Fig. 3), but the SIL network is less dense than regional networks elsewhere and, therefore, these criteria seem to be too restrictive, especially for microseismicity.

Fig. 3
figure 3

Histograms showing the distribution of longitude, latitude, and depth errors as well as azimuthal gap before (blue) and after the filtering by the network criteria method (NCM, red) and the Mahalanobis distance method (MDM, green). Note that for the first 3 columns the scale is semi-logarithmic, and in the third row a different scale for error histograms is used

Uncertainties in earthquake locations arise from (unknown) errors in seismic arrival times, network geometry, signal-to-noise ratio, dominant frequency of the arriving phase, and the velocity model used for the location, so any catalogue should include uncertainty estimates for each earthquake’s latitude, longitude, and depth. We used these estimates directly as an alternative to filter the catalogue. As in the NCM, we removed all the earthquakes with GAP higher than 180°. Noting that a histogram of the depth distribution (Fig. 4; left panel) shows spikes at exact depths 0, 1, 3, and 5 km, we also removed all earthquakes without any estimate of depth error, i.e. D E = 0. These represent earthquake locations for which the location program did not converge to a solution, and thus the SIL analyst fixed the depth at 0, 1.3, or 5 km.

Fig. 4
figure 4

Distribution in depth before filtering (blue) and after removing GAP > 180° and D E = 0 (red)

After removing these events with large GAP and artificially fixed depths, we applied a technique commonly used in multivariate analysis to identify outliers: filtering based on Mahalanobis distance (e.g. Everitt 2005, Section 1.4). We treat the uncertainties in latitude, longitude, and depth as multivariate data and compute the Mahalanobis distance for each earthquake; this is the distance of a point x(x 1; x 2; x 3) from the population mean μ(μ 1; μ 2; μ 3):

$$d_{\text{m}} \left( {\vec{x}} \right) = \sqrt {\left( {\vec{x} - \vec{\mu }} \right)^{T} S^{ - 1} \left( {\vec{x} - \vec{\mu }} \right)}$$
(5)

where S −1 is the inverse covariance matrix of the independent variables. Then, if the data distribution is multivariate normal, the squared Mahalanobis distances should follow a Chi-squared (χ2) distribution with p degrees of freedom satisfying:

$$\left( {\vec{x} - \vec{\mu }} \right)^{T} S^{ - 1} \left( {\vec{x} - \vec{\mu }} \right) \le \chi_{p}^{2} (\alpha )$$
(6)

where p is the dimension of the data (in our case, 3) and α is the probability of an equal or larger value. We take α = 0.05, and the corresponding χ2 critical value is 7.82. Consequently any earthquake that has a squared Mahalanobis distances in the tail of the corresponding χ2 distribution (i.e. with value higher than 7.82) is considered an outlier and removed. We call this the MDM (Mahalanobis Distance Method).

The NCM did reduce the mean error in the catalogue (Table 1), but many earthquakes with large uncertainties in latitude, longitude and depth remained (see the second row of Fig. 3 and note the different scale for the second and third rows). On the other hand, the MDM results in a catalogue of earthquakes with a much narrower distribution of location uncertainties. Moreover, the MDM-filtered catalogue contains 153,385 earthquakes compared to the 118,823 using NCM. Because it contains more, and more precise, information, we proceed with the MDM-filtered catalogue. The MDM method is appropriate for an area like South Iceland, where the network is optimized to record microearthquakes. These events are well located despite not being recorded by several seismic stations (e.g. 4–5 seismic stations). The MDM filter could be applied also in areas that are not optimized for microearthquakes detection because it filters based on the uncertainties directly, whereas the NCM filter uses network criteria as a proxy.

Table 1 Range of GAP, and mean and standard deviation of the latitude, longitude, and depth errors during the filtering process

4.2 Analysis of Explosion Contamination–Cleaning the Catalogue

For analysis of seismicity, the SIL catalogue should only contain earthquakes, but sometimes quarry blasts are erroneously identified as earthquakes. These explosion events have low magnitudes and produce an enrichment in the number of small earthquakes; such an increase of microseismicity may be misinterpreted as a change in the natural phenomena (Gulia et al. 2012).

A histogram of the number of seismic events during each hour of the day is one way to identify the presence of blasts. In a contaminated catalogue, the plot is characterized by a peak during daytime hours (Wiemer and Baer 2000). As noted by Gulia et al. (2012), this behaviour is due to the fact that during daytime ambient noise interferes with the detection of earthquakes and a decrease in the number of detected events is generally observed, whereas the quarry-rich areas show the opposite trend. The SIL catalogue, after filtering by Mahalanobis distance, seems to be only slightly contaminated by the presence of explosions. A small peak observed in the activity around noon is likely due to less human noise during lunch hour, when stations are not affected by traffic and industrial activity (Fig. 5a). Moreover, contamination by explosions is found when areas around the cities are analysed in more detail.

Fig. 5
figure 5

a Histogram of the number of events per hour of the day for the catalogue after the application of the Mahalanobis distance filter and example of noise amplitude versus time at the station san, which is located near a quarry. b Map of the ratio between daily and nightly events and histograms of the number of events per hour of the day for suspected areas

To identify blasts, in Fig. 5b we present a map of the ratio of the number of daytime events to the number of nighttime events using the Wiemer and Baer (2000) algorithm. This normalized ratio, Rq, is defined as:

$$Rq = \frac{NdLn}{NnLd}$$
(7)

where Nd is the total number of events in the daytime, Nn in the nighttime, Ld is the number of hours in the daytime period and Ln in the nighttime period. The ratio is calculated using a regularly spaced grid covering the studied area. We used a spatial grid of 0.03° in latitude and 0.07° in longitude that gives approximately equal distance in km, and sampled the 100 nearest events to each node. The quarry contamination map is then obtained using the nearest neighbour gridding method. It is important to highlight that interpolation algorithms may show edge effects, especially when the data are not homogenously distributed. To reduce edge effects, we only identify anomalous behaviour where earthquakes have occurred (see Fig. 1b for earthquakes location).

In the obtained map, as suggested by Gulia et al. (2012), areas with Rq ≥ 1.5 are identified as areas contaminated by explosions. In particular, using interactively selected polygons all the areas with Rq higher than 1.5 were selected and removed. The resulting map shows that the study area is only contaminated by explosions along the Reykjanes peninsula coast, at Helguvík, Hafjarfjördur, east Reykjavík, and in the south near Thorlákshöfn. This result is also supported by the hourly distribution of events for such areas (Fig. 5b).

This analysis led to removal of 502 events that were recognized as explosions, resulting in a catalogue of 152,883 earthquakes.

4.3 Problems with Magnitudes: Correcting the Catalogue

In the left panel of Fig. 6 are the M L versus M LW magnitude values from the filtered and cleaned catalogue. Many events fall above the 1:1 line, indicating that M LW gives magnitude values higher than those obtained using M L. The difference between the two magnitude scales seem to increase for larger earthquakes. This behaviour is probably due to the fact that M L is a better estimate for small earthquakes, but for large earthquakes it underestimates the magnitude because short-period seismometers and high-pass filtering (f > 1.5 Hz) are used. We also note that the theoretical M LW values derived from Slunga et al. (1984) for the magnitude range 1.8 < M LW < 6.6, are larger than global M W, while outside this range M LW is smaller (Fig. 6; right panel).

Fig. 6
figure 6

Left panel M L versus M LW scatter plot in which the red line is a line with slope one and yellow ellipse contains outliers due to erroneous magnitude estimates. Right panel the comparison between the M LW and M W magnitudes

In addition to these systematic discrepancies—i.e. the M L underestimation of large events and the non-linear behaviour of the M LW scale—there are other problems in the SIL magnitude estimates. In particular the two main concerns are: (1) the underestimation of the m 0 of large earthquakes in the routine processing performed to obtain M LW in the SIL catalogue (Rögnvaldsson and Slunga 1993) and (2) the overestimation of magnitude assigned to aftershocks of big earthquakes, due to contamination by the mainshocks.

The main reason for the underestimated m 0 is that for large earthquakes, the routine analysis window around the P and S waves is too short to contain all of the source time function, and the frequency band within which the corner frequency of the source spectrum is searched does not extend to low enough frequencies. Additionally, most of the SIL seismic stations have short-period seismometers (0.2 and 1 Hz) that do not extend to sufficiently low frequencies to allow for undisturbed calculation of seismic moment for the largest earthquakes. The underestimation of m 0 becomes evident for M LW greater than 3.0, which corresponds to an M L of about 2.5.

Table 2 shows that the moment magnitude of larger earthquakes obtained using m 0 in the SIL catalogue (M W-SIL) is severely underestimated compared to the magnitudes (M W-GCMT) reported in the Global Centroid Moment Tensor catalogue (GCMT; Dziewonski et al. 1981; Ekström et al. 2012). Consequently it is not possible to use the seismic moment (m 0 ), which is used for estimating M LW, to directly estimate M W. Another evidence of the problem concerning m 0 estimation is shown in the left panel of Fig. 6 (yellow ellipse), where we highlight outliers characterized by surprisingly large values of M LW given their M L. These are erroneous M LW values assigned to aftershocks of big earthquakes, and the errors are caused by contamination of the waveform from the mainshock. Usually, the overestimation of aftershock magnitudes occurs in the first few minutes after the mainshock. We suspect that the low-frequency component of the mainshock coda waves make the corner frequency appear to be lower than it should be.

Table 2 List of earthquakes in South Iceland for which the GCMT reports a moment magnitude. In the table M W-SIL are the underestimated moment magnitude values obtained by using the seismic moment in the SIL catalogue, named m 0 (SIL), whereas MW-GCMT and m 0 (GCMT) are corresponding values in Global Centroid Moment Tensor catalogue

For all these reasons the magnitude values in the catalogue must be corrected before any statistical analysis. Although the M L estimate is influenced by the high-pass filtering, we prefer to use these estimates to find a method for magnitude correction, because they are more stable than the M LW estimates. Moreover, there are 902 earthquakes in the catalogue with M L higher than 2.5 for which it is necessary to make the correction, and it would be very time-consuming to re-compute the seismic moment for all of these earthquakes.

To correct magnitudes, we used a technique based on peak ground velocity magnitude (M PGV) estimates using an attenuation relationship for South Iceland (Pétursson and Vogfjörd 2009). This relationship uses an empirical form for the decay of peak ground velocity and acceleration (PGV and PGA) with distance based on waveforms from 45 events in southern Iceland in the magnitude range 3.0 ≤ M W ≤ 6.5 and for distances in the range 5–300 km. The relations were scaled with MW and include a non-linear, near-source term. A log-linear approximation to the relation, valid outside the near-source region, is a robust real-time measure of magnitude for events of M w ≥ 3.0. The log-linear part of the equation for velocity is:

$$\log_{10} \left( {\text{PGV}} \right) = - 1.63\log_{10} \left( r \right) + M - 4.88$$
(8)

where r is the epicentral distance in km and M is the moment magnitude. For our study we used Eq. (8) to estimate M PGV through the reverse formula:

$$M_{\text{PGV}} = \log_{10} \left( {PGV} \right) + 1.63\log_{10} \left( r \right) + 4.88$$
(9)

This equation is also used in the Icelandic real-time ShakeMap system (Vogfjörd et al. 2012; Icelandic Meteorological Office website: http://hraun.vedur.is/ja/alert/shake/). This type of approach was used with good results by several authors in others part of the world (e.g. Lin and Wu 2010; Eshaghi et al. 2013).

To calibrate a conversion between M L with M PGV, we used the 45 events used by Pétursson and Vogfjörd (2009) and additional 59 events taken from the Icelandic ShakeMap website (all events are listed in Table 3).

Table 3 List of M L and M PGV used to obtain the conversion law in Eq. (10). MPGV of 45 events was taken from Pétursson and Vogfjörd (2009), while the additional 59 events are computed using Eq. (8)

We fit a generalized orthogonal regression (GOR) model to the data in Table 3 to describe the relationship between M PGV and M L (Fig. 7, left panel). We used GOR rather than standard least-squares because the two magnitude scales have similar uncertainties (Castellaro et al. 2006). The resulting GOR relationship is:

Fig. 7
figure 7

Left panel M L versus M PGV, red line the best fit obtained using generalized orthogonal regression. Right panel residual values of predicted M PGV using Eq. (10) for the earthquakes listed in Table 2

$$M_{\text{PGV}} = \left( {1.215 \, \pm \, 0.031} \right) \, M_{\text{L}} + \, \left( { - 0.373 \, \pm \, 0.110} \right),\quad \, M_{\text{L}} \ge \, 2.5$$
(10)

We used Eq. (10) to correct estimates of M PGV ≈ M W for all the earthquakes with M L higher than 2.5, since those are events for which the seismic moment is thought to be underestimated with respect to the GCMT catalogue. For earthquakes with M L lower than 2.5 we used the SIL seismic moment to estimate M W. After these corrections, we performed comparisons among the derived M PGV, M W-SIL, M W-GCMT and body wave magnitude (m b) from the International Seismological Centre (ISC) catalogue (Storchak et al. 2013). In Fig. 8a (left panel) we present M PGV versus M W-GCMT for 14 earthquakes in South Iceland (Table 4) reported in the GCMT catalogue. We noted that the relationship is nearly 1:1 between M PGV and M W-GCMT. Since 14 events are too few and the M W magnitude range is short, for a better comparison we used also m b for 38 events from the ISC catalogue with magnitudes in the range 3.5–6.0 (Table 4). We fitted a GOR model using M PGV and m b (Fig. 8a, right panel) and then we compared the obtained model with that proposed by Wason et al. (2012). Although the number of events used and the range of magnitudes considered by Wason et al. (2012) are different from those of our analysis, there seems to be a good agreement between the two models (Fig. 8a, right panel). On the contrary, as already discussed concerning the problems with the SIL magnitude estimates, Fig. 8b highlights a poor correlation among the M W-SIL, M W-GCMT and mb listed in Table 4.

Fig. 8
figure 8

Comparisons of the derived M PGV, M W-SIL, M W-GCMT and mb for the earthquakes listed in Table 4. a M PGV versus M W-GCMT left panel and mb vs MPGV right panel, respectively. b M W-SIL versus M W-GCMT left panel and mb vs MW-SIL right panel, respectively. The black lines in the figure are 1:1 lines showing how closely the values match. Blue lines the best fit obtained using GOR, whereas green lines the model proposed by Wason et al. (2012)

Table 4 List of earthquakes plotted in Fig. 8

This procedure allowed us to overcome the problem of underestimated seismic moment for events with M L ≥ 2.5. However, below M L 2.5 there are several earthquakes with poorly estimated local moment magnitude (yellow ellipse in the left panel of Fig. 6), and we also seek to correct these. To correct the erroneous moment magnitudes of earthquakes with M L < 2.5, we divided the earthquakes by ML into bins of ± 0.1 (see Table 5) and for each bin we calculated the mean value and two standard deviations (2σ) of MW (see Table 5), obtained from the SIL seismic moment. For each bin any earthquake having an M W more than 2σ from the mean were considered outliers. For these outliers, we used a linear relationship between the M L central values and M W mean values in Table 5 to estimate a corrected M W:

$$M_{\text{W}} = 0.7654 + 0.6944\, M_{\text{L}} ,\quad \sigma = 0.0326, \quad 0.0\, \le M_{\text{L}} < 2.5$$
(11)

Following this procedure, we corrected the moment magnitude estimates of 5645 earthquakes.

Table 5 M L central value, bin amplitude, mean values of M W and two standard deviations (σ)

After these magnitude re-estimations, we have a revised catalogue that has been filtered, cleaned, and corrected. Again, we refer you to Fig. 2 for a graphical summary. In the next section, we describe an analysis of the revised catalogue, which contains 152,833 earthquakes.

5 Analysis of Seismicity

5.1 Magnitude of Completeness and b value

Estimating the magnitude of completeness, M C, is important for analysing earthquake rate changes, mapping seismicity parameters, forecasting seismicity, and assessing seismic hazard. It is defined as the magnitude above which all events in a given space and time are expected to be detected by a seismic network (Woessner and Wiemer 2005; Schorlemmer and Woessner 2008; Mignan and Woessner 2012).

Most methods for assessing the magnitude of completeness rely on the Gutenberg–Richter (GR; Gutenberg and Richter 1944) relation that describes the frequency of earthquake magnitudes:

$$\log_{10} N\left( { \ge \, M} \right) = a - bM,$$
(12)

where N(≥M) is the number of earthquakes with magnitude at least as large as M. Consequently, M C can be identified from data as the minimum magnitude at which the cumulative frequency magnitude distribution departs from the exponential decay (e.g. Zuniga and Wyss 1995). In this study, we used the maximum curvature method (Wiemer and Wyss 2000) to estimate M C, and we estimated the b value using maximum likelihood (Utsu 1965). Figure 9a shows the results before and after the quality check of the catalogue. There are no strong differences in M C, b value and a values using the two considered catalogues, but the linear part of the GR relation appears to fit the corrected catalogue better (right panel in Fig. 9a). For the investigated area we estimate M C as 0.79 ± 0.05, which demonstrates the SIL system’s ability to reliably detect small earthquakes; the estimated b value is 0.85 ± 0.03.

Fig. 9
figure 9

a Magnitude distribution of the original catalogue (left panel) and of the revised catalogue (right panel). b M C (top panel) and b value (lower panel) as a function of time. Red line the M C and b value estimated from the GR relationship of all events during the considered time period and the black arrows the onset of the June 2000 and May 2008 earthquake sequences

In case of an aftershock sequence or changes to the seismic network, one would expect to see changes in M C and b value. To investigate this, we checked for temporal variations in M C and b value using the maximum curvature technique, a window size of 500 events, a moving window of 50 events, and a 25 % smoothing function (Fig. 9b).

The results show that both the M C and the b value are affected by large earthquake sequences, such as the June 2000 and May 2008 sequences, which both included M W > 6.0 events. The mainshock and the subsequent moderate magnitude earthquakes (M W > 4.0) in the following days, indeed, introduce the largest variation in M C when it reaches values of about 1.4–1.5 (Fig. 9b, upper panel). In the case of b value, strange variations could be seen as for instance starting from 2000 until 2005 for the occurrence of a long aftershocks sequence following the 17 and 21 June 2000 earthquakes (Fig. 9b, lower panel). In particular, in this case the b value increased steadily reaching 1.4 after those events. We also note that the b value exceeds in many periods the value estimated from the GR relationship of all events during the considered time period (Fig. 9b). These results could be related to the behaviour of the GR curve (Fig. 9a). The GR curve has a high slope in the magnitude range 0.79–2.6, whereas the slope is low at magnitudes greater than 2.6. A possible explanation could be an interaction between tectonic and volcanic activity, which in Iceland are tightly linked. A similar result was obtained in Guagua Pichincha volcano (Ecuador) by Legrand et al. (2004). The authors suggested that a non-linear Frequency Magnitude Distribution (FMD) of GR may be understood as the superposition of various processes, such as classic elastic rupture (b value ≈ 1.00) and hydraulic fracturing (b value up to 2.0). Therefore, deviations from linearity in the FMD may be related to temporal and spatial variations in b value as observed in our results.

The spatial variations of the M C and b values for South Iceland were achieved using the common catalogue-based mapping approach of Wyss et al. (1999) and Wiemer and Wyss (2000). The method consists of computing M C and b value through FMD, using a combination of a fixed number of events, N in each node of a spatial grid and the constant radius method, which consists of selecting all earthquakes within a certain distance. This means that if a node in the grid does not meet the requirement of N earthquakes, then those events within distance R from the node are included, to fulfil the requirement. In particular, we used a spatial grid of 0.03° in latitude and 0.07 in longitude and R = 15 km. The fixed number of earthquakes, N to compute the GR was 300 and the minimum number of earthquakes required to estimate M C was 30. Finally the maps were smoothed by using a Gaussian kernel filter with sigma equal to 3.

In Fig. 10a we show the results for the original catalogue; these results seem to be strongly influenced by the explosions near Reykjavík and Reykjanesbaer with very high b values and M C (>2.0). The b value for the revised catalogue is in the range 0.8–1.0 except for the eastern zone, which is characterized by high b value, probably due to a small number of earthquakes (Fig. 10b). The central area of the South Iceland Seismic Zone is characterized by low values of M C (≈0.6–0.8) whereas the eastern and western areas show higher values (>1.0). In particular, the detection capability of the seismic network decreases near the Reykjanes peninsula coast, probably due to the proximity to the city, and in the eastern SISZ where the seismic network configuration is more spread out.

Fig. 10
figure 10

b value and M C maps for the a the original catalogue and b revised catalogue. Black triangle and rectangle the position of Reykjavík and Reykjanesbaer, respectively

5.2 Seismic Rate Analysis

In Fig. 11a, we plot the number of earthquakes as a function of time to explore possible changes in seismicity rate, considering only the earthquakes with magnitude greater than M C (0.79). The trend of the revised catalogue is not markedly different from the original catalogue, although of course the cumulative number of earthquakes is different. Three prominent steps in the curve’s slope are identified corresponding to the April 1994, June 2000, and May 2008 earthquake sequences in the investigated area (dashed line in Fig. 11a). There is a change in slope (i.e. increased average seismicity rate) between mid-1994 and 1999, which is due to an intense volcano-tectonic interaction episode in the Hengill region (Feigl et al. 2000). The effect on the overall trend, caused by the Hengill swarm, is evident by removing all the seismicity around this region (black curve in Fig. 11a) and verifying that the slope becomes fairly regular for the studied period (1991–2013). It is also interesting to observe that the seismicity filter demonstrates more clearly another step in the cumulative curve in addition to those related to the occurrence of strong earthquakes (June 2000 and May 2008). In particular, the step linked to the two earthquakes of November 1998, previously hidden in the Hengill swarm, appears evident (Fig. 11a).

Fig. 11
figure 11

a Number of earthquakes versus time before and after revising the catalogue. b Earthquake density in which stars indicate the areas with high earthquake activities

There is also a noticeable change around 2011. This change in rate may be explained by the fact that the seismic network was improved in the eastern part of the investigated area in late 2010. Distribution of earthquakes in the studied area was also investigated by drawing an earthquake density map (Fig. 11b), as the logarithm of earthquakes per square km, obtained using a spatial grid of 0.03° in latitude and 0.07 in longitude. From this map we can identify the zones with major occurrence of earthquakes. The seismicity is not uniformly distributed, but rather it is concentrated around the major active faults, geothermal centres in the Krísuvík region and the geothermal/volcanic centre in Hengill (Fig. 11b).

6 Concluding Remarks

Twenty years after the installation of the SIL system, we have analysed the South Iceland earthquake catalogue to understand its strengths and weaknesses. In a sense, this is necessary preliminary work for any future study on seismicity forecasting or regional seismic hazard assessment. The revised catalogue is available upon request to the authors.

The main results of this study can be summarized as follows:

  • We explored the consequences of using the network criteria method to filter the SIL catalogue in Iceland. In this region, where the emphasis is on recording microseismicity, but the network is not as dense as in many other parts of the world, we found the filter to be too strict: almost all microearthquakes were filtered out because they did not have enough arrival time picks. As an alternative, we suggest a filter based on the multivariate distribution of hypocentral uncertainties. This filter, based on Mahalanobis distance, results in a catalogue with smaller uncertainties and more events than the network criteria method (75 % of events pass the filter).

  • We searched the catalogue for quarry blasts that were incorrectly identified as earthquakes. We found and removed 502 such events (~0.4 % of earthquakes in the filtered catalogue).

  • We noted and corrected several problems with the local magnitude and so-called local moment magnitude estimates reported in the SIL catalogue. In particular, the magnitude of the largest events is often underestimated, and the magnitude of smaller events following large events was often overestimated.

  • We compared estimates of the magnitude of completeness, b value, and seismicity rate based on the original catalogue and the filtered, corrected catalogue and identified possible explanations for the differences.

This work emphasizes the importance of carefully assessing an earthquake catalogue before proceeding with further analysis, and as such we hope that our revised catalogue will be used for future studies and models of seismicity in South Iceland.