Introduction

The study of short-term spatiotemporal seismicity clustering properties constitutes a powerful tool for earthquake forecasting. These properties are being investigated via the development of statistical models, combining well known laws of seismology, such as the Omori (1894) and Gutenberg and Richter (1949) laws. The Epidemic Type Aftershock Sequence (ETAS) is the clustering model applied to large extent for describing short-term seismicity. Introduced by Ogata (1988), this model considers the temporal seismicity properties only, and later it was also extended in their spatial clustering features (Ogata 1998). Although ETAS is the most frequently used clustering model, alternative formulations also exist that describe and study the spatiotemporal clustering seismicity properties (e.g., Console and Murru 2001; Rhoades and Evison 2004; Marzocchi and Lombardi 2008).

The basic assumption upon which these models are formulated is that each earthquake, even if it is considered as foreshock, mainshock or aftershock, is capable to produce its own offspring events, depending on the previous earthquake into a time and space window, according to certain scaling relations (e.g., the aftershock productivity law).This capability has led to a large number of practical applications focusing on the characteristics of the short-term seismicity clustering properties worldwide, such as in Japan (Ogata 2011), California (Helmstetter et al. 2006; Field et al. 2017), Taiwan (Zhuang et al. 2005) and Italy (Lombardi et al. 2010; Zhuang et al. 2018).

Over the years a large number of studies were also focused on the retrospective and/or the prospective forecast of both the aftershock evolution soon after the occurrence of large main shocks and the large earthquake occurrence using small to moderate past earthquakes (Zhuang et al. 2008; Console et al. 2007; Marzocchi and Lombardi 2009; Murru et al. 2009, 2014; Ogata et al. 2013; Savran et al. 2020).The wide applicability of the candidate short-term clustering models has allowed the objective testing of their performance by the international infrastructure of Collaboratory for the Study of Earthquake Predictability (CSEP; Jordan 2006). CSEP is using reproducible prospective testing experiments to assess the performance of the candidate forecast models, in terms of the expected number of earthquakes above a specific magnitude threshold in a certain spatiotemporal window, among other metrics (e.g., Zechar et al. 2010; Rhoades et al. 2011; Schorlemmer et al. 2018).

The present study aims at the examination of the short-term seismicity clustering pattern in the Eastern Aegean Sea area via the application of the spatiotemporal model proposed by Console and Murru (2001), namely the Epidemic Type Earthquake Sequence (ETES) model, in an earthquake catalog covering the period (2008–2016; learning period) and the consistent checking of the final estimated model by the accomplishment of retrospective forecast tests in the last period since 2017 (testing period), in which the sequences of 2017 Mw = 6.4 Lesvos, 2017 Mw = 6.6 Kos and 2020 Mw = 7.0 Samos earthquakes are included. The verification process is performed by comparing the occurring earthquakes above certain magnitude thresholds and those that are expected according to the model.

The study area constitutes part of the back arc Aegean area (Fig. 1), exhibiting high seismic moment rates (Papazachos et al. 1997a), and frequent occurrence of large earthquakes. These characteristics offer the opportunity for thorough studies concerning the short-term clustering features of seismicity. Console et al. (2006) applied a retrospective performance test using a spatiotemporal clustering models (both short and long term) using an earthquake catalog for the period 1981–2002 and showed that both models were performing substantially better than time-independent forecast models. Gospodinov et al. (2015) and Mangira et al. (2020) applied the temporal ETAS model and a spatiotemporal clustering model, respectively, in the central Ionian Islands area, to fit and later on to test the performance of their respective models in a retrospective way. Additional recent studies were focusing either only on temporal properties of specific seismic excitations (e.g., Mesimeri et al. 2018) or both the time and space clustering features of the Greek seismicity (Kourouklas et al. 2020).

Fig. 1
figure 1

The active boundaries (solid yellow lines) and their relative motions (red arrows) in the broader Aegean Sea area. The study area is denoted with the red rectangle. The epicenters of the 12 June 2017 Mw = 6.4 (Event 1) and the 20 July 2017 Mw = 6.6 (Event 2) earthquakes occurred in Lesvos and Kos islands are depicted by yellow stars, while the one of the Mw = 7.0 30 October 2020 Samos earthquake by a red star (Event 3)

Study area and data

The subduction of the oceanic lithosphere of Eastern Mediterranean under the continental Aegean microplate is the leading mechanism of the active deformation of the Aegean region (Papazachos and Comninakis 1971). This latter process forms the Hellenic Arc, along with the extensional back arc region of the Aegean Sea due to the roll back of the subducted plate (LePichon and Angelier 1979). The North Aegean Trough (NAT) is at the northern boundary of the Aegean microplate, representing the continuation of the westward prolongation of North Anatolian Fault (NAF) into the Aegean Sea (Fig. 1). The study area (marked by the red box in Fig. 1) constitutes the easternmost part of the Aegean Sea. The active deformation is mainly expressed by complex normal fault populations with a total extensional rate equal to 6 mm/yr (McClusky et al. 2000).

The earthquake catalog used in the current study is taken from the regional catalog of Geophysics Department of the Aristotle University of Thessaloniki (Aristotle University of Thessaloniki 1981; http://geophysics.geo.auth.gr/ss), by considering the recordings of the Hellenic Unified Seismological Network (HUSN) and includes crustal (h ≤ 40 km) earthquakes with Mw ≥ 2.5 that occurred from 1 January 2008 to 10 November 2020. The magnitudes of the catalog are expressed in moment magnitude scale (Mw) or equivalent Mw based on scaling relations proposed by Papazachos et al. (1997b). The area covered by the catalog is slightly larger than the study area (at least 0.2° at all dimensions), for avoiding any possible boundary effect in the application of the clustering model.

The completeness magnitude is investigated with the Goodness-of-Fit method (GFT; Wiemer and Wyss 2000), considering the 95% confidence level of residuals (Fig. 2a). Even though the first magnitude bin with less than 5% of residuals was found equal to M = 2.9 (residuals = 4.54%), the completeness magnitude, Mc, was considered equal to 3.1 (Mc = 3.1), with the minimum value of residuals equal to 2.75%, resulting to a data set with 2332 earthquakes of M ≥ 3.1. The selection of the magnitude bin of 3.1, which performs the least residual percentage according to the GFT method, is made to ensure the most reliable and secure data sample for the model application. This is clearer from Fig. 2b, focusing on to the incremental number of earthquakes (orange circles), where between the magnitude bins of M = 2.9 and M = 3.1 the earthquake frequency is not increasing steadily. From the Mc = 3.1 and beyond the magnitude distribution appears consistent.

Fig. 2
figure 2

Magnitude of completeness, Mc, calculation via the Goodness-of-Fit (GFT) method for the period 1 January 2008 to 10 November 2020. a The percentage of residuals (100-R) between the observed frequency-magnitude distributions and the ideal synthetic power law as a function of the minimum magnitude cut-off of the catalog. The red triangle indicates the magnitude bin with the least residual value (2.75%). b The Magnitude-Frequency Distribution (MFD) of the incremental and the cumulative number of events (orange circles and blue squares, respectively). The red straight line represents the MFD part above the magnitude of completeness

The b-value is estimated via the maximum likelihood method (Aki 1965) and takes a value of b = 1.10 ± 0.005, and the total seismicity rate above Mc, a, is found equal to 6.77 (a = 6.77) after applying the Gutenberg–Richter law to the data above Mc. The b-value standard error was computed with the method proposed by Shi and Bolt (1982). The fit of the Gutenberg–Richter law using the estimated seismic parameters b and a to the final dataset (Fig. 2b) reveals a deficit of moderate to large earthquakes, namely in the magnitude range 5.5 ≤ Mw < 6.4, during the period 2008–2020.

The study area is characterized by moderate seismicity rates with small to moderate magnitudes during the period since 2008 until the May of 2017 (Figs. 3 and 4), with only some swarm excitations like the one occurred southeastern of the Samos Island during 2009 with largest earthquake magnitude equal to Mw = 5.1 (Tan et al. 2014). During June and July of 2017, two large (Mw ≥ 6.0) earthquakes occurred in the northern and southern borders of the study area, respectively. Specifically, on the 12th of June 2017 a large (Mw = 6.4) main shock (Event 1 in Fig. 1 and Table 1) occurred offshore the south-east coast of Lesvos Island. The mainshock was followed by many aftershocks (about 900 events, with M ranging from 1.2 to 5.2), with the largest of them being equal to Mw = 5.2 (Papadimitriou et al. 2018). Aftershocks concentrated in an area striking NW–SE and extending about 40 km, parallel to the south-east coasts of Lesvos Island. A second large main shock with Mw = 6.6 occurred on the 20th of July 2017 (Event 2 in Fig. 1 and Table 1) offshore, north of the Kos Island, strongly felt and damaged cities on the Island and the town of Bodrum in the western Turkey coastline. The aftershock activity extended on both sides of the mainshock epicenter with many aftershocks within two weeks after its occurrence (Ganas et al. 2019).

Fig. 3
figure 3

Cumulative number of earthquakes (left y axis; blue line) and daily earthquake rate (right y axis; orange line) with magnitudes above Mc (Mc = 3.1) during the period 1 January 2008 to 10 November 2020 versus time

Fig. 4
figure 4

Epicentral distribution of the crustal (0 km ≤ h ≤ 40 km) earthquakes occurred in the study area with magnitudes above Mc (Mc = 3.1) during the period 1 January 2008 to 10 November 2020. Small magenta, moderate green and large orange circles represent the 3.1 ≤ Mw < 4.0, 4.0 ≤ Mw < 5.0 and 5.0 ≤ Mw < 6.0 earthquakes. Yellow stars are depicting the Mw ≥ 6.0 earthquakes, whereas the red star the Mw = 7.0 earthquake

Table 1 Strong (Mw ≥ 6.0) earthquakes focal parameters occurred in the Eastern Aegean Sea since 2017

Earthquake rates decreased within the study area soon after the extinction of the second 2017 strong earthquake sequence to values like the pre 2017 excitations level during the years 2018–2020, until the occurrence of the 2020 Samos (Mw = 7.0) main shock (Fig. 3), which occurred offshore the northern coasts of Samos island (Event 3 in Fig. 1 and Table 1). The daily earthquake rates were significantly increased (Karakostas et al. 2020), exhibiting the largest values observed during the whole study period. A large number of aftershocks occurred immediately after the main shock, with 6 of them having magnitudes 4.0 ≤ Mw < 5.0 within only 15 min. Aftershocks activity occupied an area extending more than 50 km in an E–W direction offshore the northern coasts of Samos island (Fig. 4).

In summary, Eastern Aegean seismicity could be divided into two distinctive periods of activity, one with moderate occurrence rates until the first half of 2017 and a second one dominated by the occurrence of the seismic sequences in 2017 and 2020. These two periods constitute the duration of the two subcatalogs of our investigation. The first subcatalog comprises earthquakes that occurred from 2008/01/01 to 2016/12/31, accounting for the learning period (N = 1197 earthquakes), and will be used for the parameters estimation of the clustering model. The seismic sequences are included into the second subcatalog (testing period catalog), lasting from 2017/01/01 to 2020/11/10 and including 1135 earthquakes, which will be later used for the retrospective forecasting tests using the estimated model parameters of the learning period.

ETES model formulation

The short-term clustering features of seismicity are investigated with the application of the Epidemic Type Earthquake Sequence (ETES) stochastic model. The ETES model had been formulated by Console and Murru (2001) and further improved and discussed by Console et al. (2003). In this section, a brief outline including its main principles is presented.

Seismicity on short-scale is considered clustered in time and space. On short scale oftentimes the terms foreshock, main shock and aftershock are arbitrarily used, and their distinction is difficult, therefore, the adoption of an appropriate model where events are not labeled is necessary. Earthquakes are considered as elements in a self-exciting point process called Hawkes’ process. Each one of them corresponds to a point with coordinates (x, y, t, m) corresponding to the event’s location, time, and magnitude, respectively. Depth is ignored not only for simplicity but also due to the uncertainties associated with its computations.

The fundamental idea of the model is the potential triggering of every event by the previous ones and its capability to trigger subsequent events in line with their relative time and space distance. Seismicity is the summation of background spontaneous and offspring events as it is depicted in the form of the expected rate density, λ,

$$\lambda \left( {x,y,t,m} \right) = f_{r} \lambda_{0} \left( {x,y,m} \right) + \mathop \sum \limits_{i = 1}^{N} H\left( {t - t_{i} } \right)\lambda_{i} \left( {x,y,t,m} \right)$$
(1)

with \(H\left( t \right) = \left\{ {\begin{array}{*{20}c} {0,\quad {\text{if}} t \le 0} \\ {1,\quad {\text{if}} t > 0} \\ \end{array} } \right.\)


where \({f}_{r}\) is the ratio of the earthquakes that are considered independent, defined as the “failure rate”, \({\lambda }_{0}\left(x,y,m\right)\) is the background seismicity, \({t}_{i}\) is the occurrence time of ith event and N is the total number of earthquakes and \({\lambda }_{i}\left(x,y,t,m\right)\) is the function expressing the contribution of the previous events according to the magnitude, distance and time of the triggering earthquake (Console and Murru 2001; Console et al. 2003).

The first term of the summation refers to the time-independent seismicity and the second term to the time-dependent “induced” seismic activity. This relation reveals the concept of the ETES model, i.e., the assumption that no earthquake should be considered absolutely independent or dependent on a particular previous event. On the contrary, every earthquake is connected based on different weights to all previous ones as well as to the background seismicity.

The long-term background seismicity, \({\lambda }_{0}\left(x,y,m\right)\) is computed when ignoring the interactions between earthquakes under the assumption that the Gutenberg–Richter law holds. It takes the form

$${\lambda }_{0}\left(x,y,m\right)={\mu }_{0}\left(x,y\right)\beta {e}^{-\beta \left(m-{m}_{0}\right)},$$
(2)

where \({\mu }_{0}\left(x,y\right)\) is the background seismicity spatial kernel density of events with magnitude m ≥ mo and \(\beta =bln(10)\). The background spatial density \({\mu }_{0}(x,y)\) is computed with the method of Frankel (1995) by means of a smoothing algorithm developed by Console et al. (2010). According to this algorithm, earthquakes are weighted by the probability of being independent, as it is also suggested by Zhuang et al. (2002). Then, the weights are adapted through an iterative process.

Regarding the single contribution of the previous events, the term \({\lambda }_{i}\left(x,y,t,m\right)\) comprises the time, magnitude and space distribution, as follows

$${\lambda }_{i}\left(x,y,t,m\right)=Kh\left(t-{t}_{i}\right)\beta {e}^{-\beta \left(m-{m}_{i}\right)}f\left(x-{x}_{i},y-{y}_{i}\right),$$
(3)

where K is a constant parameter, namely the productivity coefficient, \(h(t)\) and \(f(x,y)\) are the temporal and spatial functions. The modified Omori law (Ogata 1983) describes the time dependence

$$h\left( t \right) = \left( {p - 1} \right)c^{p - 1} \left( {t - c} \right)^{ - p} ,\quad p > 1,$$
(4)

where c and p are the parameters of the point process. For the spatial component of the triggered earthquakes several forms have been proposed (Zhuang et al. 2002, 2004, 2005; Ogata & Zhuang 2006; Console et al. 2006).

A function \(f(x-{x}_{i},y-{y}_{i})\) with circular symmetry around the point \(\left({x}_{i}, {y}_{i}\right)\) is adopted in this study following Console et al. (2003). It can be written in polar coordinates (r, θ)

$$f\left( {r, \theta } \right) = \frac{{\left( {q - 1} \right)}}{\pi }\frac{{d^{{\prime 2\left( {q - 1} \right)}} }}{{\left( {r^{2} + d^{\prime 2} } \right)^{q} }},$$
(5)

where r is the distance between points \(\left(x, y\right)\) and \(\left({x}_{i}, {y}_{i}\right)\), q is a free parameter, \(d\prime = d_{0} e^{{\alpha \left( {m_{i} - m_{0} } \right)}}\) where \({d}_{0}\) is the characteristic triggering distance for an event with magnitude \({m}_{i}\) and \(\alpha\) is a free parameter.

The ETES model parameters (K, \({d}_{0}\), q, α, c and p) are estimated through the maximum-likelihood estimation method. It should be noted that the parameter \({f}_{r}\), i.e., the fraction of the events that are considered independent is not free. It is restricted by the fact that the two parts of Eq. (1), the observed and expected number of events are equal.

Application of ETES model for the learning period (2008/01/01 to 2016/12/31)

The initial step, which is a prerequisite for the estimation of the model’s parameters, is the estimation of the correlation distance, C, of the epicentral distribution of the earthquakes in the learning period. The correlation distance could be considered as an indicator of the spatial variability of seismicity, i.e., showing the spatial interrelation of the earthquakes. The correlation distance is then used to estimate the smoothed distribution of the background seismicity. This estimation is made by the application of a smoothed seismicity algorithm proposed by Console et al. (2010) using the spatial kernel method of Frankel (1995). In this way, the learning period catalog is divided into two almost equal parts, the first part with 601 earthquakes and the second part with 596 earthquakes, and various correlation distances are tested assuming the Poisson model using the second part over the first part and vice versa (Fig. 5). The optimal correlation distances, C1 and C2, are then selected in accordance with their respective log likelihood values and found equal to C1 = 15 km and C2 = 17 km. The final correlation distance, C, for the entire learning period is the average value of C1 and C2, namely equal to 16 km (C = 16 km).

Fig. 5
figure 5

a Likelihood of the Poisson model against the correlation distance, C, in km of the second part of the catalog of the learning period over the first part as obtained from the smoothing approach. b Same as (a) for the first part of the catalog of the learning period over the second part

The estimation of the ETES model parameters is implemented via an iterative procedure described in detail by Console et al. (2010). As a first step the maximum likelihood estimation method is applied to the initial smoothed seismicity distribution, which is obtained from the average correlation distance of 16 km (Fig. 6a). These estimates (K, \({d}_{0}\), q, α, c and p, along with fr; Table 2) are then used for the calculation of the probability of an event to belong either to the time-independent or the triggered seismicity and a new weighted smoothed distribution is estimated based on these probabilities (Fig. 6b). A new set of parameters is estimated by means of the weighted distribution and the procedure was repeated until the optimal set is found (i.e. the set which has the maximum likelihood value over all the iterations).

Fig. 6
figure 6

Smoothed seismicity of the Eastern Aegean Sea during the learning period (1 January 2008 to 31 December 31) as obtained from the smoothing approach using the average correlation distance (C = 16 km) for the initial catalog (a) and the weighted one (b) after the parameters estimation of the model

Table 2 ETES parameters estimate for the Eastern Aegean Sea using the learning period (2008/01/01 to 2016/12/31) dataset

In our case the optimal set of the ETES parameters is found at once, since the second iteration results in a lower likelihood value. However, a third round of the iterative procedure is made to ensure the convergence of the parameters’ estimation. The optimal set of parameters is shown in Table 2. The proportion of the expected background events over the total number of events is found equal to 0.502 (fr = 0.502), showing that half of the learning period earthquakes belong to the background seismicity. Productivity coefficient K (K = 0.105) is found to be rather high compared with previous studies (e.g., Mangira et al. 2020), showing that small earthquakes with magnitudes above the Mc are capable to produce their own triggered events. The estimated productivity parameter α (α = 0.419) value is about half of the typical one (α≈0.8) as found for Mainshock–Aftershock sequences (de Arcangelis et al. 2016, among others) This is probably attributed to earthquakes included in the learning period, which is dominated by small to moderate magnitude earthquakes rather than by strong (e.g., Mw ≥ 6.0) ones. This implies that the ability of the offspring events production is also controlled by moderate magnitude events and not only by the stronger ones (Fig. 7a).

Fig. 7
figure 7

Number of direct offsprings versus the corresponding triggering magnitude (a) and the temporal aftershocks decay (b) versus time, using the obtained parameters of the ETES model

The estimated temporal parameters of the modified Omori law, the c and p for the learning period (c = 8.893e−3 and p = 1.012) result in an effective time interval of about 10 days in which the aftershock occurrence rate is high. After this time, the occurrence probability of aftershocks significantly diminishes in relation to their rate being almost equal to zero (Fig. 7b). The value of the characteristic triggering distance, d0, (d0 = 2.541 km) of an event with M = Mc combined with the productivity parameter, α, which included a spatial triggering response function, scaled to an event magnitude, indicates that a M = 5.0 earthquake will affect a circular area with a radius of about 16 km, whereas a Mw = 6.0 earthquake affects an area of almost 40 km radius with direct aftershocks. The value of spatial decay of offspring events, q, is found equal to 1.932 (q = 1.932).

Although the estimated model parameters are based on a learning period catalog dominated by small to moderate events, which exhibit a more swarm like behavior rather than a Mainshock–Aftershock one, the validation of the model using the testing period data is an interesting task that will be investigated in the next section.

Retrospective testing of the model with the 2017 M w = 6.4 Lesvos, 2017 M w = 6.6 Kos and 2020 M w = 7.0 Samos earthquake sequences

As previously mentioned, the duration of the learning period lasts from the 1st of January 2008 until the 31st of December 2016, while the testing period starts on the 1st of January 2017 and terminates on the 10th of November 2020. The testing period, includes the 2017 Mw = 6.4 Lesvos and the 2017 Mw = 6.6 Kos aftershock sequences and terminates soon after the initiation of the 2020 Mw = 7.0 Samos sequence, for which a retrospective test will be performed on the model capability to forecast the daily aftershock rate. For a direct comparison of the model performance, it is necessary to use the same set of parameters.

The testing period for the 2020 Samos seismic sequence is chosen to be between the 1st of September and the 10th of November 2020. For every day of the testing period daily occurrence probabilities are calculated for aftershocks with various magnitude thresholds. The absence of foreshocks is reflected in the calculations since, before the occurrence of the mainshock, the values of the occurrence probabilities for earthquakes with M ≥ 3.5 are almost stable and about 5 × 10–2 and for events with M ≥ 4.5 about 3 × 10–3. For this reason at the midnight of the last day, approximately 12 h before the occurrence of the Mw = 7.0 main shock, the occurrence probability for an event with M ≥ 6.5 in the entire study area was found equal to 1.1 × 10–5. The daily occurrence probabilities for events with M ≥ 3.5 and M ≥ 4.5 for several days before and after the mainshock are presented in Fig. 8a. A large difference is observed between the calculations before and after the main shock occurrence (red dotted line). Particularly, earthquakes with M ≥ 3.5 are expected to occur with a probability almost equal to 1 for the days after the mainshock. The daily probabilities of an event with M > 4.5 decrease gradually after the 31st of October since zero events in that magnitude range are observed after that date.

Fig. 8
figure 8

Occurrence probability per day of at least one event with M ≥ 3.5 and M ≥ 4.5 with magenta and orange lines, respectively, for several days before and after a the 2020 Mw = 7.0 Samos earthquake, b the 2017 Mw = 6.6 Kos earthquake, c the 2017 Mw = 6.4 Lesvos earthquake. The red dotted lines coincide with the mainshocks

A large discrepancy is also observed when comparing the expected and the observed number of earthquakes that occurred on the 30th of October. The calculations are performed at the midnight of the last day before the mainshock occurrence and refer to the earthquake occurrence of the entire day. Yet, the model is quickly adjusted from the second day and the number of observed toward the expected events are well coincided for earthquakes with M ≥ 3.5 (Fig. 9b) and those with M ≥ 4.5 (Table 3). Particularly on the 31st of October they are remarkably similar (32.47 expected events Vs 33 observed events with M ≥ 3.5 and 1.89 expected events Vs 1 observed events with M ≥ 4.5).

Fig. 9
figure 9

Expected versus observed number of events with M ≥ 3.5 under the clustering model for a the 2020 Mw = 7.0 Samos sequence, b the 2017 Mw = 6.6 Kos sequence, c the 2017 Mw = 6.4 Lesvos sequence

Table 3 Expected and observed number of events with M ≥ 3.5 and M ≥ 4.5 per day under the clustering model for Samos sequence

The location of the observed earthquakes and their relationship with the expected ones is investigated through occurrence rate density time-dependent maps which illustrate the spatial pattern of the expected events. The computations are daily and are performed in cells of 0.1° × 0.1°. Again, good agreement between the forecast and the observations is established soon after the mainshock occurrence (Fig. 10).

Fig. 10
figure 10

Expected number of events with M ≥ 3.5 related to the Samos sequence per day per cell of 0.1° × 0.1° from 31 October until 03 November 2020. The starting time for the daily computations is at midnight. The epicenters of the occurring shocks are represented with white circles. The red star represents the epicenter of the Mw = 7.0 Samos earthquake that occurred on 30 October 2020

The evolution of the 2017 Kos seismic sequence, located at the southern border of the study area, is also tested retrospectively. The verification period covers the 31 days of July 2017, with the main shock occurred on the 21st of July. Twelve (12) events with M ≥ Mc occurred in the testing period before the Mw = 6.6 Kos main shock. For this reason, the daily occurrence probabilities of M ≥ 3.5 events are slightly larger than in the previous case, ranging from 0.05 to 0.16. The occurrence probabilities for an event with M ≥ 4.5 range between 0.003 and 0.01. After the 21st of July, like in the case of Samos sequence, the daily occurrence probability of an earthquake with M ≥ 3.5 is almost equal to 1. For events with M ≥ 3.5 the occurrence probabilities drop relatively fast and reach the value of 0.1 (Fig. 8b).

On the 20th of July, the occurrence day of the mainshock, 18 events with M ≥ 3.5 occurred. The aftershock sequence is not as rich as in the previous examined case, but this is also happening because the mainshock occurred approximately 2 h before the beginning of the computations of the day after. In the day after the occurrence of the mainshock the difference between the observed and the expected number of events is marginal (45.28 expected toward 44 observed events) (Fig. 9b, Table 4). The spatial pattern of the expected earthquakes rates agrees well with the aftershocks locations (Fig. 11).

Table 4 Expected and observed number of events with M ≥ 3.5 and M ≥ 4.5 per day under the clustering model for Kos sequence
Fig. 11
figure 11

Expected number of events with M ≥ 3.5 related to the Kos sequence per day per cell of 0.1° × 0.1° 21 July until 24 July 2017. The starting time for the daily computations is at midnight. The epicenters of the occurring shocks are represented with white circles. The red star represents the epicenter of the Mw = 6.6 Kos earthquake that occurred on 20 July 2017

The 2017 Mw = 6.4 Lesvos sequence is located in the northern part of the study area. The testing period lasts from the 1st of June until the 30th of June 2017. Before the mainshock occurrence only 2 events with M ≥ Mc are included in the verification period. This is the reason that at midnight, approximately 12 h prior to the Mw = 6.4 event the occurrence probability for an M ≥ 6.0 earthquake is relatively low, 4.8 × 10–5. The daily occurrence probabilities of events with M ≥ 3.5 and M ≥ 4.5, are shown in Fig. 8c with magenta and orange lines, respectively. A Mw = 5.3 event on the 17th of June and its own aftershocks is responsible for the increase that is observed in the daily probability calculations on the 18th of June. This fact is also reflected when comparing the expected and observed number of events (Fig. 9c). At first, the expected number of events is larger than the observed one, but the ongoing seismic activity is responsible for the increase in the expected rates. The expected number of events per day per cell of 0.1° × 0.1° is in good agreement with the observations (Fig. 12, Table 5).

Table 5 Expected and observed number of events with M ≥ 3.5 and M ≥ 4.5 per day under the clustering model for Lesvos sequence
Fig. 12
figure 12

Expected number of events with M ≥ 3.5 related to the Lesvos sequence per day per cell of 0.1 × 0.1from 13 to 16 June 2017. The starting time for the daily computations is at midnight. The epicenters of the occurring shocks are represented with white circles. The red star represents the epicenter of the Mw = 6.4 Lesvos earthquake that occurred on 12 June 2017

Discussion and conclusions

The short-term clustering features of the Eastern Aegean Sea (Greece) seismicity are studied by applying an epidemic type stochastic model, namely the ETES model. The application is performed using an earthquake catalog covering the period 2008–2016. The learning phase (2008–2016), is characterized by moderate seismicity rates of earthquakes with small to moderate magnitudes up to Mw = 5.1. The estimation of the ETES model parameters reveals a property of moderate magnitude events to produce their own triggered events.

The retrospective testing of the ETES model performance is motivated by the recently occurred 2020 Mw = 7.0 Samos earthquake sequence. The remarkably close temporal proximity of two sequences in the northern and southern borders of the study area, near Lesvos and Kos islands, during 2017 has provided the opportunity to compare the fitting and investigate the adjustment of the model in the spatiotemporal behavior of the three sequences. The performed tests also focused on the reliability of the forecasts regarding the mainshocks.

For all examined cases, the absence of foreshocks has been the reason for the low predictability performance for the mainshocks occurrence, with the occurrence probabilities to be around 10–5. The model though soon adjusts, from the second day of the sequences and computations, with notable resemblance between the number of observed earthquakes and those obtained from the retrospective forecast of the model M ≥ 3.5. Among the three sequences, the one located near Lesvos Island, presents the largest discrepancy in terms of the number of events. This may be due to the relatively high value of the magnitude threshold considered, according to which only events with M ≥ 3.1 are capable to produce subsequent aftershocks while events with lower magnitude are not considered at all.

The results and the frequent occurrence of strong (Mw ≥ 6.0) earthquakes in the study area highlight the need for the application of the model in real time. Daily forecasts would potentially allow us to investigate the variations of seismicity that will lead to the detection of increased potential foreshock activity a few hours before the occurrence of a strong event. The evolution of the aftershock activity could also be successfully tracked and become a powerful tool for decision makers.