1 Introduction

One of the most significant aspects of seismicity is its spatiotemporally varying and mostly implicit potential for creating instantaneous structural damage. Among several strong ground-motion amplitudes, the horizontal component of peak ground acceleration (PGA) has widely been used for both structural vulnerability assessment and design of earthquake-resistant structures. Therefore, a significant fraction of seismic hazard assessments is carried out with a focus on PGA. The aim of seismic hazard analysis, done by purpose-oriented statistical methodologies mostly based on the contents of known parameters, is to determine the annual frequency of exceedance or rather the annual frequency of non-exceedance belonging to specific amplitude, having specific intensity, considering a given future time interval. Contrary to frequency-based statistics, the Bayesian perspective aims to define the probability of an occurrence based on a priori information on the pertinent states relating to the primary phenomena. Besides, the Bayesian probability theory supplies several flashing opportunities and versatile tools for estimating characteristic seismicity parameters that can easily be used to determine seismic hazard potential. The Bayesian perspective is rather based on a posterior belief. In this sense, one of the genuine advantages of Bayesian approaches is that it provides users with an excellent opportunity to take into account the uncertainty of each parameter used in probabilistic relationships as well as a priori knowledge (Campbell 1982; Campbell 1983; Galanis et al. 2002; Kelly and Smith 2011; Mortgat and Shah 1979).

In the present paper, we implemented a Bayesian methodology, which was first postulated by Pisarenko et al. (1996) and then improved and generalized by Pisarenko and Lyubushin (1997) and Pisarenko and Lyubushin (1999), in order to illuminate the PGA-driven seismic hazard of Turkey. The focus of the paper is on the delineation of one of the most essential seismic hazard parameters, namely the expected maximum values of PGA and quantiles of the pertinent probabilistic distributions taken into account for the given future intervals for the whole mainland of Turkey.

In a very short history, the first attempt to use the method of Pisarenko et al. (1996) in zone-based hazard analysis, or in other words grid-based computations, which is similar but far beyond the single-site-based analysis, was given by Lyubushin and Parvez (2010). Later on, another example was recently done by Salahshoor et al. (2018) for Iran. To the best of our knowledge, on the one hand, no further implementation of the same methodology has as yet been done for any other place. On the other hand, using Pisarenko et al. (1996), several site-based analyses were conducted to estimate seismic hazard parameters in multifarious regions of the world to date. These implementations conducted to date can be classified into three different categories, to wit (i) the maximum possible magnitude estimations (M) and its related statistical parameters (β, λ) (Bayrak and Türker 2016; Bayrak and Türker 2017; Mohammadi et al. 2016; Pisarenko and Lyubushin 1999; Pisarenko et al. 1996; Ruzhich et al. 1998; Tsapanos 2003; Tsapanos and Christova 2003; Tsapanos et al. 2001; Yadav et al. 2013a), (ii) peak ground acceleration (PGA or Amax) (Lyubushin and Parvez 2010; Lyubushin et al. 2002; Pisarenko and Lyubushin 1997), and (iii) tsunami intensity (I) (Yadav et al. 2013b).

The present paper focuses on delineating the maximum horizontal peak ground acceleration (PGA) estimates for Turkey using an updated earthquake catalog and the Bayesian methodology aforementioned. Of several outputs of this study, the new national PGA hazard map got prepared. Besides, various return periods were used in several site-based analyses. After having performed consecutive analyses, through available information and GIS tools, we compared some outputs of this study with the last two official seismic hazard maps (upon 475-year return period) of Turkey. Of two maps, the predecessor hazard map is known to be one of the several results of the study made by Gülkan et al. (1993) (hereafter referred to as G93). G93 was created in the form of a zonation map, which has long been known as a long-sought-after mapping concept in earthquake engineering practices. Having said that, the successor hazard map (AFAD 2018) is a core part of the results of the comprehensive study officially entitled “the revised national probabilistic seismic hazard maps project” (hereafter T-SHM project). Some components of the T-SHM project (Akkar et al. 2018b; Demircioğlu et al. 2018; Duman et al. 2018; Emre et al. 2018; Eroglu Azak et al. 2017; Kadirioğlu et al. 2016; Şeşetyan et al. 2018) are now available in the literature. In addition to map-based comparisons, we further compared the new hazard estimates belonging to ten earthquake-prone cities with those figures taken from the last two national seismic hazard maps of Turkey.

2 Brief seismotectonic setup of Turkey

Turkey has actively been undergoing an explicit deformation as a result of the characteristically distinguishable continental collision between the plates of Africa, Arabia, and Eurasia, which is even originated from the Mesozoic split-up of Gondwana. Because of that, Turkey is thenceforth known to be one of the extraordinarily active regions of the world in terms of seismic activity (Harrison 2008; Jackson 1994; Şengör et al. 1984). Due to particular deployment in the Alpine Himalayan orogenic belt, a great variety of evolutionary geologic structures and destructive seismic events originate from the active neotectonics of Turkey. Highly seismically active Anatolian plate has been moving obliquely since the Miocene. The movement is suggested to be related both to subduction processes that take place in the Aegean–Cyprus Arc (AA-CA) and to the explicit collision between Arabia and Eurasia (Royden 1993; Şengör et al. 1984). The convergence between the Eurasian and Arabian plates creates immense coercion into the plate boundaries, which specifically makes each part of the country an emblematic area of the modern global seismology.

The main neotectonic structures of Turkey, which are of great potential to produce consecutive destructive earthquakes, are briefly categorized into significant major zones, namely North Anatolian Fault Zone (NAFZ), East Anatolian Fault Zone (EAFZ), Dead Sea Fault Zone (DSFZ), and Aegean–Cyprus Arc (Bozkurt 2001). These fault zones are thought to have inherited from the structures that were active even before Miocene time (Yılmaz et al. 1997). Tectonic evidence-based judgments of McKenzie (1972) denote and broadly justify that the Anatolian plate is in a relative westward movement to the Eurasian plate. Since the Eurasian plate is pretending to be stable, the ongoing convergence of Arabian and African plates, which is in northward movement, creates the most seismically active transform zones of the Earth called North Anatolian Fault Zone (NAFZ). One of the main characteristics of North Anatolian Fault (NAF) is that it delineates the transform plate boundary between the Anatolian and Eurasian plates, which extends for a length of 1200 km (Şengör et al. 2005). It is mostly interpreted by making some sort of affinity-based simulations with the San Andreas Fault of California (Allen 1982). In point of fact, each of them is in dextral strike-slip motion generating large-scale crustal deformations, and thus large earthquakes (Hussain et al. 2018; McClusky et al. 2000; Reilinger et al. 2006; Reilinger et al. 1997; Şengör et al. 2005). In addition, NAFZ is of multiple synthetic and antithetic faults just like the Northeastern Anatolian Fault Zone (NEAFZ) (Badgley 1965; Vialon et al. 1976; Wilcox et al. 1973). NEAFZ, which is about 800 km in length, is a northeasterly trending dextral strike-slip fault. It has a north-dipping thrust fault, which is about 250-km length (Koçyiğit et al. 2001; Westaway 1994). Besides, the connection between NEAFZ and NAF gets integrated by 400-km-long northeast-trending and south-dipping Black Sea Fault (BSF), which has a ramp-flat structure depicted in Fig. 1 (Softa et al. 2018; Softa et al. 2019). As is well known from past events, most of the major large earthquakes occurred at the nodes of these active structures and they should again be expected to happen thereabout (Tuyen and Lu 2012).

Fig. 1
figure 1

Brief amalgamated geologic and tectonic map of the Anatolia–Caucasus region, after Sosson et al. (2016) and Hässig et al. (2016) with modifications. NAFZ, North Anatolian Fault Zone; CAFZ, Central Anatolian Fault Zone; EAFZ, East Anatolian Fault Zone; NEAFZ, Northeast Anatolian Fault Zone; BSF, Black Sea Fault; DSFZ, Dead Sea Fault Zone; PT, Plino Trench; ST, Strabo Trench; AA-CP, Aegean–Cyprus Arc; GC, Greater Caucasus; LC, Lesser Caucasus; T-C, Trans Caucasus; KM, Kirsehir Massif; MM, Menderes Massif; SM, Sakarya Massif; CACC, Central Anatolian Crystalline Complex; EAP, East Anatolian Block; R-K, Rioni Kura Basin; TA, Tavsanli and Afyon Zones. 1, European margin including magmatic arc; Pontides/Somkheto-Karabakh; 2, Ophiolites; 3, Sakarya accreted terrane; 4, Peri-Arabic units (Lycian nappes); 5, Taurides–Anatolides, East Anatolian accreted terrane. Solid yellow arrows with numbers represent plate movement (mm a−1) with respect to a stable Eurasia (Reilinger et al. 2006)

Principally, the seismic activity of the Eastern Anatolian region originates from the amalgamated convergence of the Eurasian, African, and Arabian plates that exposes the region to explicit northward motion (Dewey et al. 1986; McKenzie 1976). The zone of Aegean–Cyprian Arc, which is of a great potential to produce rather deep earthquakes, shows specific tectonic characteristics that stemmed from plunging motion into the oceanic lithosphere beneath the Anatolian plate due to the northward motion of the African plate (Jackson 1994; Pichon and Kreemer 2010; Reilinger et al. 2006) (for in-depth review and discussions, we recommend readers to examine the following resources: Barka 1992; Delph et al. 2015; Duman and Emre 2013; Saroglu et al. 1992; Şengör et al. 1985; Şengör and Yilmaz 1981; Tatar et al. 2013; Turkelli et al. 2003). An illustrative map explaining the brief amalgamated geologic and tectonic map of the Anatolia–Caucasus region is set out in Fig. 1 and the map of the brief seismotectonic setup of Turkey with the earthquake occurrences experienced in the last 118-year is given in Fig. 2.

Fig. 2
figure 2

a Illustrative map of the brief seismotectonic setup with earthquakes occurred between 1900 and 2018 in and around Turkey used in this study. b Site location map of the surveyed ten earthquake-prone cities of Turkey in this study

3 Data and resources used

The first and the foremost input of the Bayesian method (Pisarenko et al. 1996) is the spatial data of seismicity for a given enclosed region; namely, it is the earthquake catalog of Turkey. For Turkey and adjacent areas, aside from the old ones, recently compiled different instrumental earthquake catalogs (Akkar et al. 2010; Duman et al. 2016; Kadirioğlu et al. 2016; Kalafat et al. 2011; Leptokaropoulos et al. 2013) were published for several purposes by various researchers to date. In addition, several studies partially or totally covering Turkey (Ambraseys and Jackson 2000; Guidoboni et al. 1994; Shebalin et al. 1998; Soysal et al. 1981; Stucchi et al. 2013; Tan et al. 2008) were conducted to compile and merge the historical earthquakes that happened before the instrumental era of seismology.

Of all the available catalogs, Kadirioğlu et al. (2016) recently published one of the sound examples of the unified earthquake (1900 to 2012) catalogs homogenized into a moment magnitude scale (Mw) ever created for Turkey. This catalog is not only the core catalog of the present study, which was particularly updated for our hazard assessments, but also it is the same catalog with that used in the T-SHM project. Initially, we updated the core catalog at the outset. To this end, two online and officially published earthquake databases (#1 KOERI, http://udim.koeri.boun.edu.tr/zeqdb/, last accessed on 07 March 2018; #2 TR-NSMN, http://kyhdata.deprem.gov.tr/2K/kyhdata_v4.php, last accessed on 07 March 2018) were used to collect further events to make the core catalog update. The further events were selected from those events that occurred between 25 December 2012 and 07 March 2018. Before merging two datasets (core and further) into a single catalog, it was necessary to bring further data to the same scale as the core one. To homogenize the further events, moment magnitude-oriented conversion relations provided by Kadirioğlu and Kartal (2016) were then used when necessary as fully in line with the methodology adopted in Kadirioğlu et al. (2016).

In the meantime, no historical event that occurred before the year 1900 was added to the catalog. The primary reason for this preference is that the method used in this paper requires a continuous catalog so that it can statistically be handled. Namely, the catalog must not be of incomplete data; otherwise, the results of the Bayesian perspective might shift from what exactly could be. The specific example of this fact was showed in a comparative hazard study recently conducted by Salahshoor et al. (2018) for Iran. The spatial distribution of the fully merged catalog (hereafter catalog) over Turkey is shown in Fig. 2. The catalog comprises 13,564 events in total being dependents and independents events altogether.

Besides, to estimate PGA values upon a given future period, all data were retested for the responsiveness level of the method postulated by Pisarenko et al. (1996). As with the seismic hazard analysis pathway, the adopted methodology requires mounting the proper ground-motion prediction equations (GMPEs) to the logic tree to be used as a second input to the method, and then all of which are submitted to the compound hazard integral. It is crucial to any seismic hazard analysis for which GMPEs are to be used. No matter which GMPEs are used, the raising epistemic uncertainty is inevitable. Epistemic uncertainty is a disruptor factor that increases the potential bias in seismic hazard analyses (Bommer 2012; Scherbaum et al. 2005; Scherbaum and Kuehn 2011).

As is stated by Kale and Akkar (2013), of several goodness-of-fit measures, the most prevalent way of testing and ranking the GMPEs depending on their performance is the residual-driven methods. In that respect, several ranking and selection techniques have so far been developed and used. Even though it was particularly designed for different scientific disciplines, the concept of the Nash–Sutcliffe efficiency (NSE) coefficient (Nash and Sutcliffe 1970) and its modified way of implementation (McCuen et al. 2006) are used as a tool of goodness-of-fit measure for analyzing the GMPEs. Scherbaum et al. (2004) developed a new goodness-of-fit measure based on the concept of likelihood, LH, and Scherbaum et al. (2009) designed a log-likelihood differences technique, LLH. With an effective and progressive perspective, Yaghmaei-Sabegh (2012) put forward to an alternative and rather advanced technique for ranking and weighting of GMPEs based on artificial neural networks (ANN). Kale and Akkar (2013) developed a new approach called the Euclidean distance-based ranking (EDR) method. In this study, regarding specifying and weighting GMPEs to be used in the logic tree, we adopted the results of Akkar et al. (2018b) used in the computations of the recent earthquake hazard map of Turkey (AFAD 2018). The main reason for this is that the earthquake catalog used in the present study is diametrically the same as the one used in the T-SHM project, except for the addition of earthquake data concerning the preceding few years to update it. In the T-SHM project, Akkar et al. (2018b) first selected a bunch of relations from several national and global GMPEs as candidate models and all models were then assessed by data-driven testing techniques not only for delineating which models are the best result-giving ones according to earthquake database but also for what the sound weight must be assigned to each model. After having tested several GMPEs, in consequence, they decided to mount the selected successful ground-motion models to the complete hazard model. As is mentioned above, the body of the seismic source model of the T-SHM project is attributed to three kinds of sources; howbeit, the ground-motion models were divided into two categories. On the one hand, the first group is for the representations of the shallow crustal seismicity modeled with the GMPEs of Akkar et al. (2014) (hereafter referred to as AK14), Akkar and Çağnan (2010) (hereafter AC10), and Chiou and Youngs (2008) (hereafter CY08) by assigning a weight of 1/3 to each relation. On the other hand, the second group that epitomizes the subduction and interface zones is modeled by GMPE models of Megawati and Pan (2010) for interface regions and through the model of García et al. (2005) for in-slab regions with an individually unique weight of 1.0. Because the focus of the paper is not only on the generation of national seismic hazard map of Turkey but also on ensuring affinity between the past two studies to the extent possible, the GMPEs used in the T-SHM project for the shallow crustal seismicity of Turkey were adopted verbatim in our modeling phase.

Of several distance metrics, epicentral distance and Joyner–Boore distance (Joyner and Boore 1981) (hereafter JB distance) genuinely dominate the realm of the GMPEs. When using different metrics at the same time, a distance adjustment, which inevitably creates a somewhat escalation in the aleatory uncertainty of the hazard model, is crucially required to stabilize the distance metrics. To that end, the Electric Power Research Institute (EPRI 2004) suggested a series of distance conversion pathways depending on several modeling assumptions with respect to the epicenter location of an earthquake relative to the fault rupture. The modeling is done through either random epicenters approach, which is based on the assumption that the epicenter of an earthquake is uniformly distributed along the full length of the rupture or hypothetically centered epicenters, which is based on the assumption that the epicenter of an earthquake is expressly centered on the earthquake rupture. In the present study, now that the adopted ground-motion model of the logic tree is of three components and a great majority of which use the JB distance, we used the JB distance of each earthquake. This expedient is useful both for the homogenization of the distances and for tamping down the total amount of aleatory uncertainty to the highest possible extent. To this end, we adopted the random epicenter model of EPRI (2004) for the distance adjustments on the grounds that it is thought to be more consistent with the perplexing and amalgamated seismotectonic setup of Turkey as briefly explained above.

Besides, as is well known, local site conditions of the ground-motion recording sites profoundly affect the measured PGA values. Each GMPE that we used in this study includes a VS30 term, and it inevitably makes the result of PGA calculations (the spectral period is zero) site-dependent. In this perspective, the PGA values got computed by taking into account the reference-rock site condition for VS30 = 760 m/s seeing that this value was applicable to all GMPEs used in the present study, and it denotes B/C boundary value in keeping with the classification of National Earthquake Hazards Program of the USA (NEHRP) (BSSC 2001). Furthermore, the results obtained through the fixed VS30 approach provided a great convenience in that it made our results comparable with the outcomes of the successor and predecessor hazard maps of Turkey.

Upon the completion of the T-SHM project, the hazard map and its index list were given in an annex to a decree in a form of read-only PDF file, https://deprem.afad.gov.tr/images/depbolge/deprem-tehlike-haritasi.pdf, last accessed on 01 May 2020. We used the official seismic hazard map of Turkey (AFAD 2018) in our comparative calculations whereupon we created a digital map upon retyping the pertinent parts of this rather long document. Furthermore, using coordinate-based data picking, the results of the present study got compared with both those of G93 and the T-SHM project regarding ten earthquake-prone cities surveyed. The data and materials used in this research are available from the corresponding author upon reasonable request.

4 Method and limitations

In this part of the paper, for the sake of brevity, the full details of the Bayesian procedure used in this study are not fully reported. Instead, only the basics of the theory and its pertinent formulations are solely given to illustrate what the methodology aims to represent. For an in-depth description of the theory, the following papers that refer to the original sources are strongly recommended to readers. Besides, to the best of our knowledge, only two examples (Lyubushin and Parvez 2010; Salahshoor et al. 2018) used the way of Bayesian estimation philosophy implemented here exist in the literature, and it is worth noting that these early examples will benefit the reader as well. With this in mind, the summary of the Bayesian theory is given hereunder.

We call “R” as an acronym that denotes a value referring to either estimated or measured quantity observed in an apparent sequence happened in a particular period commonly named as the past time interval (− τ, 0):

$$ {\overset{\rightharpoonup }{R}}^{(n)}=\left({R}_1,\dots, {R}_n\right),{R}_i\ge {R}_0,{R}_{\tau },{R}_{\tau }=\genfrac{}{}{0pt}{}{\max }{1\le i\le \mathrm{n}}\left({R}_1,\dots, {R}_n\right) $$
(1)

This sequence (1) is nothing but a time series and each member of which could hypothetically be found in a random space posed by a kind of coincidence-based physical nature. In other words, no discretionary event is thusly considered.

The sequence of \( {\overset{\rightharpoonup }{R}}^{(n)} \) is considered to be either a bin of the magnitudes of seismic events that occurred in an enclosed region or the logarithm (to the base 10) of the peak ground accelerations (Amax or hereafter referred to as PGA) of those seismic events measured within the scope of the specified area.

R0 is a specific quantity called minimal cutoff value, which is either determined by physical possibilities of strong ground-motion recording network or else it is picked out as an ordinary but statistically representative value from those events of the sequence (1). As a fundamental and constructive assumption for the whole theory, the sequential values of (1) follow the Gutenberg–Richter (G-R) power law for magnitude-recurrence (Gutenberg and Richter 1944). Now then, we can write the parametric equality below.

$$ \Pr \left\{R<x\right\}=F\left(x|{R}_0,\rho, \beta \right)=\frac{e^{-\beta .{R}_0}-{e}^{-\beta .x}}{e^{-\beta .{R}_0}-{e}^{-\beta .\rho }},{R}_0\le x\le \rho $$
(2)

The upper limit of (2), ρ, denotes the maximal possible value of R that has an unknown parametric quantity. In similar ways, another parameter is β, which denotes the slope of the G-R law computed from a doubly logarithmic plot.

The second assumption of the theory, which is a rather requisite thought in explaining the sequence of the earthquake occurrence both in time and space, is about the sequence (1) that it expresses a Poissonian statistical process driven by λ. In the meanwhile, λ is a distribution-oriented well-known intensity parameter and then it comes out as another unknown parameter to the theory.

If we gather the unknown parameters up here, the full vector of the unknown parameters can be written as follows:

$$ \theta =\left(\rho, \beta, \lambda \right) $$
(3)

For brevity, all explanatory expressions can be summed up regarding the given sequence (1), and then the argument R0 is omitted and we can express them in nature of ∙(∙ | θ), namely, for (2), the expression turns to F(x| θ). Considering the law (2), the probabilistic density function of the distribution can then be written as

$$ f\left(x|\theta \right)={F}^{\prime}\left(x|\theta \right)=\frac{\beta .{e}^{-\beta .x}}{e^{-\beta .{R}_0}-{e}^{-\beta .\rho }} $$
(4)

Let ε be an error term for the known values of sequence (1). That is to say, the magnitudes of the earthquakes in the sequence (1) do not include the true values; in fact, they are nothing but the apparent values of R. Hence, they, apparent values, can be represented as follows:

$$ \overset{\sim }{R}=R+\varepsilon $$
(5)

Now, then, we can introduce the n(x| δ) explaining the error’s (ε) probabilistic distribution density and δ is a set scale parameter regarding the density. At this juncture, we can thence start using the below distribution density in a uniform nature,

$$ n\left(x|\delta \right)=\left\{\begin{array}{c}1/2\delta, \mid x\mid \le \delta \\ {}0,\mid x\mid >\delta \end{array}\right. $$
(6)

We can also assign ∏ as a priori uncertainty to each parameter of θ, considering that a priori density of θ is of uniform nature:

$$ \prod =\Big\{{\lambda}_{\mathrm{min}}\le \lambda \le {\lambda}_{\mathrm{max}},{\beta}_{\mathrm{min}}\le \beta \le {\beta}_{\mathrm{max}},{\rho}_{\mathrm{min}}\le \rho \le {\rho}_{\mathrm{max}} $$
(7)

The future time interval [0, T] is a temporal array that becomes our working space to be used to delineate ρ and its quantiles.

To the definition of conditional probability, a posterior density of distribution of θ can be represented below by the Bayes formula (Rao 1965);

$$ f\left(\theta |{\overrightarrow{R}}^{(n)},\delta \right)=\frac{f\left(\theta, {\overrightarrow{R}}^{(n)}|\delta \right)}{f\left({\overrightarrow{R}}^{(n)}|\delta \right)} $$
(8)

Formula (8) is the principal estimation tool for a posterior parametric density of θ.

The Bayesian estimate of vector θ is then given below:

$$ \hat{\theta}\left({\overrightarrow{R}}^{(n)}|\delta \right)=\underset{\Pi}{\int}\vartheta .f\left(\vartheta |{\overrightarrow{R}}^{(n)},\delta \right) d\vartheta . $$
(9)

In this implementation of the theory, ρmin = Rτ − δρmax is set up by the user subjecting to the nature of the series (1). Maximal and minimal boundary values can be expressed in below forms:

$$ {\beta}_{\mathrm{min}}={\beta}_0.\left(1-\gamma \right),{\beta}_{\mathrm{max}}={\beta}_0.\left(1+\gamma \right),0<\gamma \le 1 $$
(10)

In formula (10), β0 is the median value computed from the G-R law through the maximum likelihood estimation technique:

$$ {\sum}_{i=1}^n\ln \left\{\frac{\beta .{e}^{-\beta .{R}_{\tau }}}{e^{-\beta .{R}_0}-{e}^{-\beta .{R}_{\tau }}}\right\}\to \underset{\beta, \beta \upepsilon \left(0,{\beta}_{\delta}\right)}{\max } $$
(11)

βs is a fairly large value, e.g., 10, γ is a parameter of the method taken to be 0.5.

As with the Poissonian process particularly for significantly large n (Cox and Lewis 1966), the variance of λτ tends to have approximate value; \( \sqrt{n}\approx \sqrt{\lambda \tau} \).

If we draw the boundaries for our estimations (upon ±3σ), we can procure the below expression for the intensities denoted in (15):

$$ {\lambda}_{\mathrm{min}}={\lambda}_0.\left(1-\frac{3}{\sqrt{\lambda_0\tau }}\right),{\lambda}_{\mathrm{max}}={\lambda}_0.\left(1+\frac{3}{\sqrt{\lambda_0\tau }}\right) $$
(12)

where \( {\lambda}_0=\frac{{\overline{\lambda}}_0}{c_f\left({\beta}_0,\delta \right)},{\overline{\lambda}}_0=\frac{n}{\tau } \).

It is particularly presented to the reader’s attention that the fundamental philosophy of Pisarenko et al. (1996) conceptually uses a commonly known fundamental approach postulated by Cornell (1968) and McGuire (2004) to seismic hazard analysis. The significant difference between the Bayesian theory used in the current study and the Cornell–McGuire version is attributed to both the arrangement of earthquake sources and the computation of the source parameters. Namely, in the Bayesian approach of Pisarenko et al. (1996), the corresponding parameters to the theory (ρ, β, λ) are thought to have a succession denoting logarithm of acceleration values from adjoining events.

Using the GMPEs, β and λ values, which are of similar nature with the other seismicity parameters to the theory, are calculated through the acceleration values. More precisely, magnitude values are replaced with the peak ground acceleration (to the logarithm scale) values calculated by replacing the magnitude and distance in GMPEs. Thus, this method indeed offers a significant convenience to the frequently rising question of whether or not the seismogenic zonation, in other words, any zone-dependent assumption is needed. That is to say, neither local nor regional seismic sources are used to obtain PGA estimates.

Right then, according to the Bayesian methodology briefly outlined above, we estimated the parameter “ρ” that epitomizes the maximal anticipated value of PGA upon a probability (α = 0.90) for a predefined future time interval (T = 475 years) in a spatial area covering both Turkey and its proximity gridded into 200 × 200 nodes. The corner limits of the spatial boundaries of the catalog are 32 °  ≤ Lat ≤ 45 ° ; 23 °  ≤ Lon ≤ 48°. However, the boundaries of the study area covering mainland Turkey are 35.81 °  ≤ Lat ≤ 42.10 ° ; 25.67 °  ≤ Lon ≤ 44.82°. In that sense, the spatial boundaries of the Turkish hazard map are considered to be of a clipped version of the outer limits of the catalog. As is inferred from the limits above, the spatial coverage of the catalog, which particularly elongates to the off places from the boundaries of Turkey, gives us a useful advantage over the particular use of in-field data only for Turkey. Seeing that possible sharp interruption of the spatial and temporal data in the proximity of any study area possibly causes significant errors in geostatistical correlations and, in the strict sense, it could yield erroneous numerical extrapolations. As for computation practices, the path that we followed for the estimations can be summarized as follows. For each grid, the PGA sequence was computed through the catalog of earthquakes and specified GMPEs (AC10, AK14, and CY08 with weights of 1/3). As is aforementioned, these relationships and weights are the same as those used by the T-SHM project.

For making the Bayesian estimates of PGA possible for a given future time interval according to Lyubushin et al. (2002), the solely major independent events of each node are needed to get included to the Bayesian framework, which is crucial for satisfying the characteristic Poissonian behavior assumption of the sequence. In this vein, the dependent events are needed to be first identified and then removed from the catalog to achieve the fortuitous sequential characteristics of time moments. Surprisingly, very little work has as yet been done on the issue of which declustered method is the most applicable one and rather is of better result-giving performance regarding the seismicity of Turkey, albeit the number of studies is in the ascendant. To the best of our knowledge, in this regard, two studies are now only available in the literature. On the one hand, in a sensitivity-based study, Eroglu Azak et al. (2018) investigated the effects of different mainshock catalogs derived from several seismic declustering methods. On the other hand, through the agency of simulation envelopes and Monte Carlo tests particularly to keep track of the variations in the temporal and spatial point patterns, Nas et al. (2019) made a spatiotemporal comparison of different mainshock catalogs derived from several declustered methods. Apart from many other findings, the results of each study revealed that the Gardner and Knopoff (1974) method was found to be fairly effective in giving a better-refined catalog. In this respect, the full earthquake catalog used in this study got then declustered by the Gardner and Knopoff (1974) method with a purpose-oriented modification to a slight extent. Gardner and Knopoff (1974) put forward to a kind of windowing method that mainly classifies earthquakes according to their spatial and temporal characteristics. Each window, either spatial or temporal, is based on magnitude. Namely, the method progresses based on magnitudes and presents magnitude data. The modification is nothing but an extra query on deciding by which event should be adopted as a mainshock only to be used in the computations of PGA values. More precisely, being the first step of declustering, all of the events were divided into two distinct classes—mainshocks and their fore- or aftershocks—with the help of the Gardner and Knopoff (1974) method. Now that the aim of the modification is first to find and then leave one event only in a reciprocal earthquake sequence, the magnitude of this single event must have characteristics to deliver the highest acceleration values depending on the GMPE used. Hence, only one event was left from each mainshock and aftershock sequence that generates the largest PGA. Thanks to this application, the possibility of auto-eliminating a large aftershock that could produce larger PGA values than its hypothetical mainshock got eliminated. Early examples of this implementation are available in Lyubushin et al. (2002) and Lyubushin and Parvez (2010). On the upshot, the resultant catalog of independent events consisted of 7275 earthquakes. For each node of the grid, only 30 events of “mainshocks” with maximum PGA values were specified for the subsequent analyses. The choice of 30 events has so far been checked a lot of times and to have been judged as quite sufficient enough numbers for estimating statistics of maximum events (Bayrak and Türker 2016; Bayrak and Türker 2017; Mohammadi et al. 2016; Pisarenko and Lyubushin 1997; Pisarenko et al. 1996; Salahshoor et al. 2018; Tsapanos et al. 2001; Yadav et al. 2013b). The method that we used here is principally intended for estimating maximum events, all other events (not maximum) are necessary only for estimating the b value. The convergence of the Bayesian method is presented in the prominent book of Rao (1965). In our example, the volume of the priory domain was even too much and according to the theorem of convergence of the Bayesian method, the influence of a priory method is tending to zero with the increasing number of observations for which case of N = 30 is quite enough. All things considered, we ultimately achieved the quadrat value (n = 30) in relation (1), albeit with different values of R0 and\( {R}_{\tau }=\genfrac{}{}{0pt}{}{\max }{1\le i\le n}{R}_i \), namely the a priori threshold value for ρ was adopted as ρmax = Rτ + 0.5. Furthermore, in this methodology, if there are no 30 events that produce the peak ground acceleration in the grid point that exceeds a given threshold, then this grid point, which scilicet epitomizes a sparse seismicity characteristic, is first omitted from the analysis. But, be that as it may, the value of PGA statistics nevertheless is estimated in this grid point by extrapolating from nearest other grid points using a smoothing method called Gaussian kernel function with radius 1° using 100 nearest neighbors’ values. In a more succinct sense, on the basis thereof, by dividing all study area into 200 × 200 orthogonal regular grids, it is possible to make a map that depends upon stationary quadrat counts as specified and explained above. Upon these node-based calculations, the grid values got converted to the maps with the help of a non-parametric smoothing process depending on the Gaussian kernel functions (Härdle 1990). The World Geodetic System (WGS84) was used as a reference coordinates system for all coordinate-based products produced in the context of the present study.

5 Results and discussion

Using the GMPEs mentioned above weighted equally with the ratio of 1/3, Fig. 3 shows the seismic hazard map of 90% quantile of the distribution of PGA values (in g) for the 475-year return period, which is another expression of 10% probability of exceedance in 50 years according to Poisson distribution. Figure 3 is an unclipped but processed map and it covers both the inland and seas encompassing Turkey, which constitutes the full orthogonal space of the research area. After having clipped the inland of Turkey from the total map of hazard, we created the bare PGA hazard map of the inland of Turkey (Fig. 4) through the methodology of which is the same with that of T-SHM project. On the one hand, the map depicted in Fig. 4a is a smoothed contour map, and it was identically made through the same philosophy that the T-SHM project used regarding the mapping concept. On the other hand, Fig. 4b shows the same hazard map, namely that was created with the same data of Fig. 4a, albeit using a zonation methodology in line with a technique that the 1° radius used. The active fault map of Turkey depicted as a layer in the pertinent hazard maps below was taken from Emre et al. (2013).

Fig. 3
figure 3

Unclipped version of Bayesian Turkish seismic hazard map (B-TSHM) with spatial zonation (upon 90% quantile, VS30 = 760 m/s, 475-year return period, in g) covering both seas and lands

Fig. 4
figure 4

Bayesian Turkish seismic hazard map (B-TSHM) covering the inland areas only (upon 90% quantile, in g). a The map of smoothed contours. b The map of spatial zonation. The estimations are subject to reference-rock site assumption (VS30 = 760 m/s) and are made through a three-pronged logic tree, which comprises AK14, AC10, and CY08, depending on the 475-year return period

In examining Fig. 4, regarding the full extent of Turkey, the maximum PGA values were observed in some specific cities (Erzincan, Düzce, Sakarya, and Kocaeli), all of which are located in the NAFZ. The PGA values estimated for throughout NAFZ were observed greater than 0.2g overall. It is worth noting that the locations for which high PGA values estimated are those places where the most destructive earthquakes occurred in Turkey to date, for instance, 27 December 1939 Great Erzincan Earthquake (M7.6, Ms7.9), 17 August 1999 Kocaeli (M7.6), and 12 November 1999 Düzce (M7.1) earthquakes. The estimated PGA values for the eastern Anatolia zone varied from 0.1 to 0.2g. In particular, the estimated PGA values along with the EAFZ were found to generally be less than 0.4g and the largest values in this region were observed in and around the Karliova region that comprises a non-subductable intercontinental junction commonly called “Karliova triple junction.” More so, it is a spectacularly engrossing geologic formation, which gets the Anatolian, the Eurasian, and the Arabian plates to come together (Aktug et al. 2013; Barka 1992; Şengör 2014). In addition, a significant hot spot was observed on the east coast of Lake Van where is of a recently delineated rupture process (Altiner et al. 2013; Elliott et al. 2013; Konca 2015; Utkucu 2013) due to the 23 October 2011 Van earthquake (M7.1). The PGA values calculated for the western Anatolia region generally were found to be between 0.3 and 0.4g. Moreover, the values in the border region among the cities of Burdur, Isparta, and Denizli were apparently observed to be larger than 0.4g. This hot spot location was found to be consistent with the particular location where the 03 October 1914 Burdur earthquake (M6.9) happened. The seismic hazard potential of the Central Anatolia region was observed indeed low (< 0.2g). Also, when moving away from the middle of the Central Anatolia (< 0.1g) to the places having significantly active seismic zones, which encompass thereto, estimated PGA values increased and finally reached to the largest value (0.586g) to the scale. Of the cities located in the central part of Turkey, in the western tip of the Kirsehir and the southern tip of the Osmaniye, the sharp PGA increase was observed. These locations were found to be consistent with those of the 19 April 1938 Kirsehir earthquake (M6.5) and several other moderate earthquakes happened on the southern tip of the EAFZ. Southernmost border cities of Turkey were found to be the lowest seismic hazard zones, which are similar to those cities most of which are located in the northern cities of Turkey. To the best of our knowledge, the main reason for these low PGA values, as is seen in Fig. 4, is thought to be not only ascribed to the relative tectonic activity of particular geological structures but also the fact that these structures have long been undergoing creep-driven movements. Furthermore, it is worth noting that both the locations and the number of active recording stations belonging to the seismograph networks of Turkey are crucial elements in the sound analysis of strong ground-motion parameters of each earthquake. Upon the increases in these instruments, the sensitivity of the observations and straightforwardness of calculations increase as well. Therefore, the uncertainty created by the increased sensitivity over time affects this kind of seismic hazard studies one way or another. That is to say, this effect will inevitably be observed in future studies. At exactly this point, the issue of why we did not use historical earthquakes (prior to 1900) can clearly be explained. First of all, as is explained above, Salahshoor et al. (2018) signified that the Bayesian method of Pisarenko et al. (1996) gives more healthy and straightforward outputs if the earthquake catalog to be used as a main input to the method is spatially and temporally complete. Instrumental catalogs are of an apparent spatial and temporal continuity, although there are some data-related problems in itself, for instance, the sensitivity of magnitude precision and lack of the number of measurements. However, combining historical earthquake catalogs with instrumental catalogs in order to use in hazard estimates is a rather problematic issue nevertheless, because the magnitudes of these historical earthquakes were totally anticipated through several non-objective historical records and few if any geologic data. It is a well-known fact that a consistent and unbiased earthquake catalog is a must-have factor for the success of the hazard estimates. Namely, due to indispensable conversions of the macroseismic intensities to a kind of instrumental magnitudes, they obviously increase the rate of the uncertainty of the instrumental catalog to be used in hazard estimates (Rong et al. 2011).

In fact, all of the estimated PGA values and hazard maps were made taking into account the reference-rock site condition characteristics, which made our results comparable to the predecessor and successor hazard maps of Turkey. Namely, Akkar et al. (2018a) stated that the PGA hazard maps of the T-SHM project were created in view of the reference-rock site conditions. Also, they expound with the justification that because the G93 solely used the GMPE of Joyner and Boore (1981), which does not include the site-specific term, G93 compulsorily disregarded site effect in the calculations. However, Akkar et al. (2018a) also stated that the result of G93 should be ascribed to the statement of G93 that denotes the use of apparent stiff soil characteristics in computations. Consequently, they signify that the maps generated by G93 should be interpreted to have been prepared with respect to the boundary of stiff soil and rock site condition. This way of thinking is attributed to the B/C boundary of NEHRP (BSSC 2001) classification of soil sites, which is consistent with the adoption of reference rock site condition both in the present study and in the T-SHM project. In a nutshell, this joint feature helps us compare them with each other.

The contour map of PGA ratios [PGA(this study)Long, Lat/PGA(G93 and T-SHM)Long, Lat] shown in Fig. 5 denotes that the Bayesian seismic hazard map of the present study makes larger PGA estimates, which have been up to 20–45% with regard to G93, in several particular places (Erzincan, Kocaeli, Sakarya) where the major earthquakes of Turkey occurred to date and further where the main active faults are abundant. Besides, these locations represent the highest acceleration values of the B-TSHM (see Figs. 3 and 4b). As moving away from these hot spots, the calculated PGA ratios decreased following the significant contours, which are available at the approximately same distance to these points that epitomize the explicit attenuation. Of the PGA ratios expressed in Fig. 5, in several places, such as Erzurum, Erzincan, Yozgat–Sivas border, Tekirdag, the northern tip of Edirne, and in a spatial band of the Asian side of Istanbul city to Duzce, the computed ratios were found to be 20% larger than those of G93 and they mostly observed in the proximity of the hot spot places aforementioned. Besides, in the Kocaeli–Sakarya band, and in the cities of Bilecik, Gumushane, and Van, B-TSHM gave much higher acceleration values than those of T-SHM. As moving away from the center of Kocaeli–Bilecik cities, in the middle of the triangle formed by Karabuk, Kastamonu, and Bartin cities in parallel to the NAF, in the city center of Kirsehir and Erzincan, on the axis of Gumushane–Bayburt, between eastern Erzurum and western Kars, between Van and Agri city center, B-TSHM gave higher acceleration values than those of T-SHM. When moving away from these areas, a certain attenuation pattern appeared and the PGA values of B-TSHM gradually fell below the values of T-SHM. Meanwhile, an interesting finding is that the PGA values of B-TSHM were rather in line with the acceleration values given by T-SHM and those of G93 too. In Fig. 5, according to both maps, it is observed that the results of this study are compatible in the ± 20% band in the cities of Tekirdag–Canakkale, Kutahya, Burdur, Kastamonu, Tokat, Yozgat, Gumushane, and Erzurum. Right after the pertinent PGA ratios observed in these regions, the ratios decreased by up to 20%. This is a strong indication of the fact that the estimates of the B-TSHM reasonably corresponded to the expected attenuation characteristics of the peak ground acceleration against increasing JB distance. Furthermore, almost a large part of the Aegean region, some part of Eastern Anatolia, particularly in the East Anatolian Fault Zone, the PGA ratios decreased by 50–80% relative to G93. When it comes to T-SHM, B-TSHM gave much closer results to it in these regions, especially in active seismic zones with a large number of fault bundles. In the rest of the country, most of which cover the cities located in the northern and southern border of Turkey and Central Anatolia, where the large earthquakes either not realized or rarely happened to date, the PGA ratios of B-TSHM decreased by 50% or less than that of G93. To put it in another way, these areas might specifically have corresponded to the regions that have not so far produced any large earthquake due to either veiled fault creep or the very sparingly placed or even the absence of recording stations, as in Northeast Turkey (Softa et al. 2019). As can be seen above, the interpretations were mostly made by taking into consideration the places where the significant and densely flocculated earthquakes happened. The main reason for this is that the method used in this study is not based on any kind seismogenic fault; that is, in point of fact, it is rather based on the use of earthquakes themselves. In the strictest sense, because the T-SHM project was conducted using several kinds of areal sources for crustal events and active faults (Akkar et al. 2018a), it is reasonably expected that the certain ratios are thence observed in or around the specified fault areas. However, in the present study, as is ideally and distinctively expected, the self-similar ratios are observed and found to have intensified at the specific locations where the earthquakes occurred with similar PGA values. Succinctly, the reasons for low PGA values expressed in B-TSHM can somewhat be attributed to several factors reviewed in above paragraphs such as the lacking seismometer, aseismic creep, inherited geological structures, and/or multifarious combinations of these factors. In addition, the main reason for the moderate to much higher values of B-TSHM cannot be attributed to a single factor; howbeit, it could be related not only to explicit instrumental seismicity hereby covered but also whether the historical seismicity got evaluated or not. An individual sensitivity study might also be required to illuminate this, which could also be considered as a pioneering example for all other future studies.

Fig. 5
figure 5

Smoothed contour maps of the normalization ratios a for AFAD (2018) and b for G93 (each map evaluated in this plot is based on the 475-year return period and comprises the reference rock (VS30 = 760 m/s) site-oriented PGA estimates)

The query on whether or not the similarity of the empirical tail probability function of the estimated PGA values to the G-R recurrence law exists is one other collateral objective of this paper. From this standpoint, Fig. 6 shows the individual empirical tail functions of distribution of calculated PGA values (in the form of Log(g)) according to three GMPEs of AK14, AC10, and CY08 considering city center coordinates of ten earthquake-prone cities of Turkey. Regarding the city center–based computations, these coordinates were taken as geographical coordinates of the locations where the governor’s office is located. As can be seen in Fig. 6, while GMPEs of AK14 and CY08 have a very similar and mostly overlapped increment pattern, the AC10 generally proceeds with some less but approximately the same slope.

Fig. 6
figure 6

aj Empirical tail probability functions of PGA distributions (for 475-year RP and VS30 = 760 m/s, in g) for ten earthquake-prone city centers concerning AK14, AC10, and CY08 respectively

In the context of this paper, it is, of course, not possible to provide all the maps that could be created depending on several different return periods using hereby adopted Bayesian technique regarding both for the purpose of the study and the brevity of thereto. But, be that as it may, Fig. 7 denotes a panel plot of the individual quantiles of expected PGA values (in g) created for 43, 72, 140, and 475 years of return period for surveyed ten cities of Turkey respectively. In some regions with very high seismicity (e.g., Bingol, Erzincan, Izmit, Tokat), the trends of estimated PGA values were distinctive to the eyes that the PGA values increased rapidly with the increasing return period. Besides, some cities (Izmir, Istanbul, Agri) followed a slight PGA increment against the increased return period. Having said that, the estimated PGA values for the 475-year return period were very close to those values estimated for the rather extended return periods. This pattern can also be attributed to the decreased possibility of making a straightforward projection with the catalog when the PGA estimations are particularly made taking into account a very long return period. Now that the catalog, which was used in the T-SHM project and used in this study with a somewhat improvement, covers the earthquakes that happened in slightly more than a century (1900–2018), this problem has distinctly been available in both studies.

Fig. 7
figure 7

aj The quantiles of the distribution function of estimated PGA values (in g, upon 90% quantile) for ten earthquake-prone cities of Turkey (the curves represent the expected PGA values depending on 43, 72, 140, and 475 years of return period and VS30 = 760 m/s)

Figure 8 shows the maximum expected PGA values (in g, upon 90% quantile) determined in this study for selected ten cities by taking into account the 475-year return period comparing with the values picked from the maps of G93 and T-SHM project regarding the same spatial coordinates. From this point on, the PGA values estimated for Istanbul City by each study were found to be very close to each other. In general, it can, therefore, be said that the T-SHM project overall tends to make higher estimates against those estimated by G93 and this study. Furthermore, for some cities, the results of the present study exceeded the PGA values suggested by the G93, while, on the other hand, for some cities, it was the opposite. Therefore, it is evident that even if the expected values contain somewhat differences, similar estimates got observed in the same region, excluding the city of Izmit.

Fig. 8
figure 8

Comparison of the maximum expected PGA (upon 90% quantile, in g) for ten earthquake-prone cities of Turkey (all of the PGA values are calculated by taking into account the reference-rock site condition (VS30 = 760 m/s) and a return period of 475 years)

6 Conclusions

With causalities and loss of property in the headlines almost every year, the issue of the intrinsically ever-changing seismic hazard potential of countries is taking on increasing importance, especially for the survival of the future of nations. Of several different methodologies developed and implemented to date, the present study is the first example of the Bayesian method of Pisarenko et al. (1996), which directly uses the PGA values to estimate the seismic hazard, for Turkey. In that sense, we aimed at determining PGA values with the context of different return periods and then comparing them with the results of the antecedent and current national seismic hazard projects of Turkey. In essence, comparing with the T-SHM project, which is a large-scale study ever conducted for this purpose in Turkey, the scope and the details of the current study remain relatively limited though. Besides, this study fundamentally tests whether the Bayesian method of Pisarenko et al. (1996) is suitable for Turkish seismicity and what the differences and similarities between the outcomes of the present study and those of early ones are. However, if we pay attention to the spatial distribution characteristics of PGA values over the maps compared here, the boundaries of the spatial distribution of PGA values determined in the present study are found to be quite close to those of G93 and T-SHM project. It is not possible for a certain amount of difference or departure between any different studies not to occur. This fact is clearly seen when the maps of the G93 and the T-SHM get compared.

Another conclusion that can be drawn from the present paper is that the Bayesian method of Pisarenko et al. (1996) is thereby referred to as an alternative and straightforward tool for further seismic hazard analyses for Turkey. One of the several pros of this method is that the Bayesian method does not require the use of areal or line faults. As is known, unavoidable uncertainty in defining the space and time of an earthquake is likely. Therefore, the use of a distribution function is a genuine convenience of the method in the estimation of ground-motion amplitudes. One of the few if any cons of this method is that the method of Pisarenko et al. (1996) requires seismic catalog and GMPEs only. Considering that the method is not zone-dependent but catalog-driven, it is possible to specify in conclusion that the earthquakes themselves are a reasonably sufficient and versatile tool to elucidate the potential seismic hazard peculiar to a given region. Thus, much more precise detection of earthquake locations, magnitudes, and other related parameters can ultimately help us to make a less biased and more accurate evaluation of potential seismic hazard for the zones where significant and intensified seismicity is available. To provide reliable data for all kinds of seismic hazard studies, it is highly recommended that such studies should be supported and their findings should particularly be compared with those of paleoseismology studies to the utmost extent possible. In this sense, Paleoseismological and seismic studies should be carried out concurrently in terms of the quality and integrity of the information to be obtained, and thereby, total and inevitable bias on the hazard estimates can be reduced to the lowest possible levels.

It is evident and should particularly be emphasized here that the recent earthquake hazard map of Turkey (AFAD 2018), which is one of the most prominent figures of T-SHM project, will anywise not be superseded by the outputs of the present study, nor does our study have a motivation like this. However, the compendium of all the remarks of the current study clearly transpires that the method of Pisarenko et al. (1996) is genuinely applicable and feasible in estimating the potential seismic hazard of Turkey, while the number of the possible seismogenic zones is numerous, albeit self-amalgamated and contentious. With this study, it becomes clear that the holistic process of PSHA, which innately requires a long and tiresome effort, can instantaneously be performed against the changing of earthquake catalog over time and prompt evaluations can thus be made.