1 Introduction

Pakistan lies in a seismically active and earthquake-prone region of the world. The country and its surrounding region possess a complex seismo-tectonic environment where Arabian, Indian, and Eurasian tectonic plates are interacting with different rates of movement [1]. In the northern part of Pakistan, the Indian plate is in a state of a slow head-on collision (spanning over 50–55 million years) with the Eurasian plate at a rate of 3.7–4.2 cm/year [2]. The country is located on the north-west edge of the Indian plate as shown in Fig. 1 [3]. The convergent boundary between two continental plates, also referred to as North Collision Boundary, NCB (Fig. 1), has resulted in the development of Himalayan orogenic belt which includes some folds and thrust faults. The Himalaya Arc, Hazara Arc, Himalayan Frontal Thrust (HFT), Indus Kohistan Seismic Zone (IKSZ), and the Salt Range Thrust (SRT) are part of the NCB [1, 4,5,6]. The 2005 Kashmir earthquake shows a NW–SE thrust dominant mechanism, this earthquake occurred at IKSZ [7]. It has also resulted in the development of two complex subduction zones with deep brittle seismological structures; the Hindukush and the Pamir ranges [8]. In the west, the convergence of Indian and Eurasian plates forms an oblique collision zone called as West Collision Boundary (WCB) (Fig. 1). A well-known strike-slip fault (Chaman fault) and Sulaiman–Kirthar mountain ranges are the results of this inclined collision. The rate of left lateral shear between these two plates is approximately 3.0 cm/year [3]. Another complex tectonic structure, called Quetta Transverse Zone (QTZ), is located in the east of WCB as depicted in Fig. 1. The QTZ is comprised of several thrust faults (e.g. Ghazaband fault and Ornach-Nal fault) and folds that are associated with the Indian–Eurasian plate boundary [9]. In the south of Pakistan, the Arabian plate is subducting under the Eurasian plate at the rate of 2.8–3.3 cm/year resulting in the development of Makran Subduction Zone (MSZ) [10]. The MSZ is shown in Fig. 1 [11,12,13]. It is located in the south-east of Iran and southern Pakistan that extends for almost 900 km along the Eurasian–Arabian plate boundary [14]. This complex geo-tectonic environment has posed a high level of seismic hazard to Pakistan and its surrounding region.

Fig. 1
figure 1

The tectonic setting of Pakistan. a shows the plate boundary between Indian, Eurasian, and Arabian Plates with their respective slip rates. b shows the tectonic features of the country (courtesy: Geological survey of Pakistan). The velocities of plates are extracted from Bird [64], Jade [65], Sella et al. [66], Vernant et al. [67]

In the last one hundred years, the country has been hit by several major earthquakes [15, 16]. These include the 2005 Kashmir earthquake \(({M}_{w}\) = 7.6) and the 1945 Makran earthquake \({(M}_{w}\) = 8.1) and tsunami [17]. Pakistan has severely suffered in the past from dangerous earthquakes and is still facing high seismic risk due to high seismic hazard and high vulnerabilities of communities which may lead to huge socio-economic losses in the future. With such a high level of seismic hazard and associated risk, there is a dire need for an accurate and reliable seismic hazard assessment study of the country to develop seismic hazard maps.

In the last two decades, several studies have compiled and presented the catalogue of earthquake events for seismic hazard assessment of Pakistan [18,19,20]. In one of the pioneering studies, NESPAK (2007) compiled an earthquake catalogue for the probabilistic seismic hazard assessment conducted during the development of the Building Code of Pakistan [21]. This catalogue included earthquake events from 1668 to 2006 CE (see Table 1). There are several limitations attributed to this catalogue. A comprehensive set of international databases was not considered while developing this catalogue. Secondly, this catalogue includes earthquake events up to 2006. Similarly, Zare et al. [20] developed an earthquake catalogue for the Earthquake Model of Middle East (EMME) project which included the whole region of Pakistan. The earthquake events in this catalogue were reported up to 2014 CE. More recently, Khan et al. [18] have developed an earthquake catalogue which contains seismic events up to 2016 CE. To develop an updated seismic hazard map, we need our earthquake catalogue to be updated as much as possible. So, for this purpose, all of the previously compiled earthquake catalogues are combined. The recent events which were missing in the previous catalogues are included in the combined catalogue. Finally, the earthquake catalogue is critically analysed for duplicated events, dependent events, and completeness.

Table 1 The earthquake catalogues compiled by different studies for the region

In this study, a relatively more updated and complete earthquake catalogue was compiled, and a seismic source model was proposed for the seismic hazard assessment studies in Pakistan. The geographical boundary was extended to 300 km from the territory of Pakistan to include all the earthquake events which may influence the hazard in the case study region. The geographical area that is bounded to longitude 20°–40° N and latitude 58°–83° E (Fig. 1) contains this composite earthquake catalogue. The earthquake catalogue was compiled using multiple earthquake catalogue sources for the period of AD 10–2018 CE. Based on this updated catalogue, a new seismic source model is also proposed. Both the conventional area source model and spatially smoothed gridded seismicity model were developed, and their recurrence parameters were reported for the seismic hazard studies in Pakistan.

2 Development of Updated Earthquake Catalogue

For data collection, the earthquake events with epicentres in Pakistan and surrounding areas, bounded by geographical limits 20°–40° N and 58°–83° E, are considered. The data of historically reported (AD 10 to 1900 CE) and instrumentally recorded (1900 CE to December 2018 CE) earthquake events are compiled using multiple international databases and local sources. The international sources include South Asian Catalogue (SACAT), International Seismological Centre [22], National Earthquake Information Centre (NEIC) [23], National Geophysical Data Centre (NGDC) [24], Global Centroid Moment Tensor (GCMT) [25, 26], and the United States Geological Survey (USGS, 2019) while the local sources include Pakistan Meteorological Department (PMD) (http://seismic.pmd.gov.pk/) and Water & Power Development Authority (WAPDA) (pers. comm.). The historical data were obtained from the published literature [18, 20, 27,28,29,30]. A total of 63,273 events are collected for a period of AD 10 to 2018 CE from the aforementioned earthquake databases for the region under consideration. A summary of longitude, latitude, time, depth, magnitude, and reporting agency was specified for each event in the catalogue. Table 2 shows some important data sources and the number of events reported by them for this study area. Since earthquake data were collected from multiple sources, some events were reported by more than one source. There were duplicated events in the combined catalogue. All the duplicated events were excluded from the combined catalogue based on priority order (see Table 2) which reduced the events to 34,104. The priority order is based on the comprehensiveness of earthquake catalogues which is given in Table 2.

Table 2 Earthquake databases used as sources

2.1 Magnitude Homogenization

The international databases used in this study report the earthquake events in different magnitude scales (\({M}_{w}\),\({M}_{\mathrm{S}}\),\(mb\),\({M}_{\mathrm{L}}\), and \({M}_{\mathrm{D}}\)). For example, the PMD and NEIC report the events mostly in body-wave magnitude (mb), whereas the ISC database provides the events in other magnitude scales. The procedure used for the seismic hazard assessment requires the earthquake catalogue to be homogenized in terms of a single magnitude scale. For this purpose, several researchers have developed magnitude conversion equations using the regression analysis of the data which are recorded and reported in two different magnitude scales. In this study, all magnitude scales for the developed earthquake catalogue are converted to the moment magnitude (\({M}_{w}\)) scale. Apart from using several established conversion relationships, the collected data are also used to develop new empirical equations for magnitude homogenization. These relationships are compared with those developed in a few selected studies including Khan et al. [18], Rafi et al. [31], Scordilis [32], and Zare et al. [20]. The study conducted by Scordilis [32] used the global earthquake event dataset for deriving magnitude conversion relations, while the others have used the regional earthquake datasets. The relationships developed for a particular seismo-tectonic region may result in a significant deviation from those derived for global earthquake data [33]. Therefore, in this study, new empirical relations are developed on the basis of regional dataset collected as part of developing the earthquake catalogue for the study region.

For converting body-wave magnitude \((mb)\) to moment magnitude \(({M}_{w})\), an empirical relationship is developed based on the standard linear regression analysis of data shown in Fig. 2(a). Although several studies have proposed the use of nonlinear magnitude conversion relationships, e.g. Lolli et al. [34], a linear relation is developed in this study for simplicity and is presented as Eq. (1). It is developed for a total of 490 paired events having both the magnitude types (\(mb\) and \({M}_{w}\)) collected from all the databases mentioned above. The minimum and maximum magnitude observed for those paired events are 4.0 and 6.2, respectively. Therefore, the equation is valid for the magnitude range of \(4.0 \le mb\le 6.2\) with \({R}^{2}=0.7211\) and \(\sigma =0.24\). Here, \({R}^{2}\) is the coefficient of determination, and \(\sigma\) is the standard deviation.

$$M_{w} = 0.967mb + 0.1989\quad \left( {4.0 \le mb \le 6.2} \right)$$
(1)
Fig. 2
figure 2

a and b show the relationships developed for converting \(\mathrm{mb}\) and \({M}_{\mathrm{S}}\) to \({M}_{w}\) using data from all the databases mentioned above. c and d show a comparison of the magnitude conversion relationships with the previous studies

Similarly, for the conversion of the surface wave magnitude scale (\({M}_{\mathrm{S}}\)) to moment magnitude (\({M}_{w}\)), two linear standard regression equations are developed using the data presented in Fig. 2(b). The data are separated into two magnitude ranges (i.e. \({M}_{\mathrm{S}}=3.0-6.1\) and \({M}_{\mathrm{S}}=6.2-8.2\)) since a different trend for events with \({M}_{\mathrm{S}}>\) 6.2 is observed. The bilinear regression equations (derived for this conversion) are represented in Eqs. (2) and (3). Eq. (2) is developed using 728 paired events (having both magnitude scales \({M}_{\mathrm{S}}\) and \({M}_{w}\) in the catalogue) for magnitude range \(3.0\le {M}_{\mathrm{S}}\le 6.1\) with correlation coefficient \({R}^{2}= 0.69\) and standard deviation \(\sigma =0.24\), whereas Eq. (3) is derived based on 76 paired events for magnitude range \(6.2\le {M}_{\mathrm{S}}\le 8.2\) with \({R}^{2}= 0.73\) and \(\sigma =0.28\).

$$M_{w} = 0.5396M_{{\text{S}}} + 2.7051\quad 3.0 \le M_{{\text{S}}} \le 6.1$$
(2)
$$M_{w} = 0.9336M_{{\text{S}}} + 0.3781\quad 6.2 \le M_{{\text{S}}} \le 8.2$$
(3)

The relationships developed in this study are graphically compared with those developed by Khan et al. [18], Rafi et al. [31], Scordilis [32], and Zare et al. [20] in Fig. 2(c and d). Also, a tabular comparison of all selected studies is shown in Table 3.

Table 3 Comparison of magnitude conversion relations

For conversion of local magnitude \(({M}_{\mathrm{L}})\) scale to moment magnitude \(({M}_{w})\), the common data included a total of 510 identified paired events. However, these data were scattered, and the correlation coefficient \(({R}^{2})\) obtained for magnitude conversion relationship was \(0.20\). Therefore, the empirical relationship for this conversion is not incorporated in this study. Additionally, several studies have proposed that the \({M}_{\mathrm{L}}\) and \({M}_{w}\) are almost equal for magnitudes smaller than \({M}_{w}=6.5\) [18, 20, 31, 32]. Therefore, the magnitude conversion equation by Heaton, Tajima, and Mori (1986) is employed for the conversion of local magnitude (\({M}_{\mathrm{L}}\)) to moment magnitude (\({M}_{w}\)).

$$M_{{\text{L}}} = M_{w} \quad \left( {M_{{\text{L}}} \le 6} \right)$$
(4)

Some of the earthquake events were reported in the duration magnitude scale \(({M}_{\mathrm{D}})\) by ISC and USGS. In this current study, the duration magnitude was converted to \({M}_{w}\) by using the relationship developed by Akkar et al. [35] as presented in Eq. (5) as follows.

$$M_{w} = 0.764 M_{{\text{D}}} + 1.379\quad 3.7 < M_{{\text{D}}} < 6.0$$
(5)

Finally, a priority order for earthquake events based on the magnitude scale was set to collect the data. The priority order for \({M}_{w}\),\(mb\), \({M}_{\mathrm{S}}\), \({M}_{\mathrm{L}}\), and \({M}_{\mathrm{D}}\) is set as 1, 2, 3, 4, and 5, respectively. After the homogenization of earthquake events, the catalogue had a total of 63,273 events. The identification of same/duplicated events (reported by different constituent catalogues) was performed after homogenizing the magnitude. The events having identical year, month, day, hour, minute and having almost the same coordinates are considered as same/duplicated events. All the duplicated events were manually excluded from the combined catalogue, based on priority order as shown in Table 2. This process reduced the total number of events to 34,104.

2.2 Declustering of Earthquake Events

One of the basic assumptions of the probabilistic seismic hazard assessment (PSHA) is that the earthquake events are statistically independent and occur randomly in time. Such process is mathematically described as a Poissonian stochastic process or a process having Poissonian distribution. Based on this assumption, several methodologies have been proposed to identify the dependent events in a time–space window around the mainshock event. The removal of such dependent events from the earthquake catalogue results in a Poissonian distribution of seismicity in a region. This process in which the earthquake events (mainshocks) are segregated from the dependent events is called declustering [36]. For this purpose, four studies including Gardner, Knopoff [36], Reasenberg [37], Uhrhammer [38], and Gruenthal (Zare et al. [20]) are selected in this study. These studies propose different methods for declustering the earthquake catalogue. They are selected after the review of available literature. More importantly, their algorithms and source codes were also easily available.

The results obtained from these methods generally suggest different sizes of the time and space windows for declustering. The size of the time and space window is directly proportional to the magnitude of the mainshock. The collection of earthquake events that occur in that time and space window is called a seismic cluster. The earthquake event with a higher magnitude in the seismic cluster is considered as the mainshock, whereas the events within that spatial window of space and time frame with reference to the mainshock are regarded as dependent events [37]. The values suggested by Reasenberg [37] algorithm for the default standard parameter are represented in Table 4. Foreshocks and aftershocks are the dependent events that occur before and after the mainshock, respectively. Therefore, the foreshocks and aftershocks are required to be eliminated from the catalogue as these events are considered to be dependent events over the mainshock.

Table 4 Declustering input parameters recommended by Reasenberg [37]

Table 5 shows the empirical relationships for determining the size of time and space window proposed by Gardner, Knopoff [36], Uhrhammer [38], and Gruenthal (Zare et al. [20]). The catalogue developed in this current study was declustered separately by employing these four algorithms using the ZMAP software package [39]. The total number of clusters and remaining events after the application of each method of declustering is shown in Table 6. In this study, the catalogue obtained after the declustering method proposed by Gardner, Knopoff [36] is used for determining the seismic hazard parameters. Several similar studies focussed on the hazard analysis of this region including Zare et al. [20] and Khan et al. [18] have also preferred the use of declustering method developed by Gardner, Knopoff [36]. The spatial distribution of final declustered earthquake events is presented in Fig. 3.

Table 5 The windows size approximation by Gardner and Knopoff (1974), Uhrhammer (1986), and Gruenthal (Zare et al. [20])
Table 6 Number of events after declustering
Fig. 3
figure 3

Temporal and spatial distribution of final declustered earthquake catalogue consisting of 7845 earthquake events

2.3 Data Completeness

Incompleteness in the catalogue is the difference in recorded and real seismicity of the region [40]. Catalogue incompleteness is considered as an important uncertainty of an earthquake catalogue in seismic hazard analysis. The earthquake catalogue must be complete to be used in any seismic hazard assessment study. Since in the past the earthquake recording instruments were insufficient to record every possible earthquake event. Therefore, the available data of past earthquakes are very scarce. This scenario brings forward the uncertainty of completeness in catalogue. The temporal distribution of earthquake events from 10 to 2018 CE is shown in Fig. 4.

Fig. 4
figure 4

Plot showing the temporal distribution of events for Pakistan and surrounding areas bounded by geographical limits 20°–40° N and 58°–83° E. Events are plotted from 10 to 2018 CE

There are several methods to measure the completeness of the catalogue with respect to magnitude and time. In this study, two techniques; Visual Cumulative Method (CUVI) [41] and Stepp [42] method; are employed. The CUVI method [41] is a simple graphical procedure. It is based on the visual observation in which complete earthquakes of a given magnitude are assumed to follow a stationary occurrence process (straight line). Furthermore, the declustered catalogue is divided into nine magnitude classes. The magnitude classes are incremented with a 0.5 magnitude bin, i.e. \(4-4.5\) and \(4.5-5\), etc. (Table 7). For every magnitude class, a graph is constructed between the cumulative number of events and the time period in years. So, for each magnitude class, the period of completeness is the approximate straight line starting from the point where the trend gets stabilized. The initial point is called magnitude of completeness (\({M}_{\mathrm{C}}\)). Similarly, Woessner, Wiemer [43] defined magnitude of completeness, \({M}_{\mathrm{C}}\), as the lowest recorded magnitude at which approximately all the events are detected. It is also known as threshold magnitude. The magnitude of completeness (\({M}_{\mathrm{C}}\)) has an important role in the calculation of seismicity parameters (‘a’ and ‘b’ values) for seismic sources using Gutenberg, Richter [44] relationship. In this current study, the magnitude frequency distribution (MFD) by Gutenberg, Richter [44] is used for calculating the magnitude of completeness (\({M}_{\mathrm{C}}\)) within each seismic source. A graph between the magnitudes and logarithm of the cumulative number of events is drawn. The maximum curvature of the graph gives the magnitude of completeness for each set of events, as shown in Tables 9 and 10. The method is applied using the ZMAP software package [39].

Table 7 Completeness ranges for every magnitude class

The Stepp [42] method is based on the mean rate of recurrence of earthquakes \((\lambda )\) in a given time and magnitude class, this method classifies the earthquake catalogue on the basis of magnitude and time (years) intervals as represented in Eq. 6. For the completeness analysis applying the Stepp [42] method, the OpenQuake engine [45] is used.

In this study, both the methods were employed and the results from both methods (CUVI and Stepp (1973)) yielded similar completeness periods. The completeness periods for different magnitude classes are shown in Table 7. It is a common observation that larger magnitude earthquakes are rarely missed as compared to the smaller magnitude events in history. This is the reason that the completeness range of larger magnitudes is higher as compared to the smaller magnitudes, which are complete only within the instrumental period.

2.4 Seismogenic and Focal Depths

The maximum observed depth in any seismogenic zone at which most of the earthquakes occur is called seismogenic depth (\({D}_{\mathrm{seis}}\)) [46]. In geophysics, the seismogenic layer includes all the depths of earthquakes which mostly occur in the lithosphere or crust of the earth [47]. To get a better insight into the tectonics of the region and to determine the seismic hazard, the accurate determination of the focal depths of earthquakes is extremely important [48]. For this reason, the final catalogue was mapped, and Fig. 5 represents the earthquakes based on focal depths. Based on focal depths, the seismic area sources are classified into shallow and deep seismic sources. Figure 5 is depicting that the seismicity of Pakistan is spread all over the country. However, deeper earthquakes are focussed in the northern areas of Pakistan.

Fig. 5
figure 5

Earthquake events in the catalogue with different focal depths

The depth of seismogenic layers is related to the size of earthquakes which are generated by active faults. This is the reason that it’s a significant parameter for the assessment of seismic hazard [49]. Some intermediate-depth events at about 70° E 24° N where only shallow seismicity seems to occur shows a pattern are actually the area of Kachchh mainland fault and Nigarparkar fault (Fig. 5). Another important pattern of earthquakes at an area approximately 65°–68° E 25°–28° N shows shallow seismicity with some intermediate-depth events is the area of Chamman, Ghazaband, and Ornachnal-Nal faults. Similarly, the deeper seismicity of Hindukush region can be seen in the northern areas of Pakistan and Afghanistan (Fig. 5).

Therefore, the ultimate declustered catalogue (Fig. 3) is further divided into shallow- (depth \(<\) 50 km) and intermediate-depth earthquakes (depth \(<\) 50 km) as shown in Figs. 6 and 7, respectively. The data in these figures are displaying that most of the seismicity of Pakistan is due to shallow earthquakes. Shallow earthquakes contribute 83%, while the deep earthquakes have only a 17% contribution. Most of the deeper earthquakes are located in the northern part of Pakistan and around the plate boundary, while some intermediate-depth earthquakes are also observed in the south-western part of the country. The research carried out by the Pakistan Meteorological Department and Norwegian Seismic Array [50] has concluded similarly. The report presented the variation in past seismicity of Pakistan based on focal depths. Additionally, PMD & NORSAR (2007) reported that 80% of past seismicity is shallow (depth \(<\) 40 km) and only 20% of previous earthquake events have focal depths between 50 and 320 km.

Fig. 6
figure 6

Spatial distribution of shallow earthquakes (depth < 50 km); blue: 0–15 km depth, green: 15–30 km depth, red: 30–50 km depth

Fig. 7
figure 7

Spatial distribution of intermediate-depth earthquakes (50 km < depth < 250 km); blue: 50–100 km depth, green: 100–150 km depth, red: 150–250 km depth

Makran subduction zone shows intermediate to low seismicity except a few large earthquakes. The 1945 earthquake (\({M}_{w}\) 8.2) is the maximum earthquake observed in this region. In this region, the Arabian plate is subducting under the Eurasian plate with a dip angle of 10 degrees extending 400–500 km towards the north [51]. The figure shows most of the events are having depths ranging between 5 and 55 km (Fig. 8).

Fig. 8
figure 8

Cross sections of the Makran subduction zone (MSZ) along longitude 57° E covering the historical events from 57° E to 66° E and latitude between 24° N and 27.5° N

3 Development of Updated Seismic Source Model

The development of a reliable seismic source model is a primary and important step for any seismic hazard assessment study [52]. The source model comprises of delineated area source zones and characterization of other seismogenic sources like faults and background seismicity. Based on the updated earthquake catalogue, two seismic source models (area source and spatially smoothed background seismicity) are presented for the seismic hazard assessment studies in Pakistan.

3.1 Seismic Area Source Model

In probabilistic seismic hazard analysis, area sources are used for the representation of regions having homogenous seismicity. These area sources are often used for the modelling of seismicity pattern for those regions where the tectonic evidence are very rare. In the literature, various studies have delineated area sources for Pakistan that include Danciu et al. [53], Rafi et al. [31], Zhang et al. [54], Khan et al. [18], and NESPAK [19]. In this current study, Pakistan and the surrounding areas are divided into twenty-three (23) shallow crustal and five (5) deep source zones (Figs. 9 and 10). Small area sources are delineated and preferred over large source because the seismic hazard is reduced when large area sources are selected. This phenomenon is called spatial smearing [55]. The delineation of area sources is performed by considering the seismicity pattern, active crustal faults of the region, and principles of the Global Seismic Hazard Assessment Program [56] and Earthquake Model Middle East [53]. The area sources proposed by the above-mentioned researchers were digitized/reproduced and combined to get an insight of the area sources of the past studies. In addition to this, the area sources in this current study are based on more updated catalogue which has higher number of earthquake events. The magnitude completeness periods for different magnitude ranges are greater than the previous studies. Number of area sources are comparatively higher as shown in Table 8. Furthermore, the seismo-tectonic of the region is accurately and precisely studied and taken into consideration while delineating area sources. These are the reasons which make this study more peculiar than the previous ones.

Fig. 9
figure 9

The shallow seismic source zones divided based on active crustal faults and historical seismicity of that region

Fig. 10
figure 10

The deep source zones delineated to count for the intermediate-depth earthquakes (< 50 km depth < 250 km)

Table 8 Comparison of various studies to the current study

The recurrence rates for shallow and deep area sources are calculated by using the maximum likelihood method [57] and Gutenberg, Richter [44] magnitude distribution formula.

$${\text{Log}} \lambda_{M} = a - b*M$$
(6)

where \({\lambda }_{M}\) is the rate of earthquakes with magnitudes greater than and equal to moment magnitude \((M)\), ‘a’ is the y-intercept and ‘b’ is negative slope of the exponential curve as shown in Figs. 11 and 12. The ‘a’ value indicates the overall rate of earthquakes in a region and ‘b’ value indicates the relative ratio of small and large magnitudes [18]. It is assumed that the earthquake events with magnitude lower than \({M}_{w} 4\) may not result in any significant damage to the structures. Therefore, \({M}_{w} 4\) is selected as the lower bound magnitude (\({M}_{w}^{\mathrm{min}}\)). On the other hand, the maximum magnitude (\({M}_{w}^{\mathrm{max}}\)) in each area source is selected as the maximum observed magnitude plus 0.5. This margin of 0.5 is selected to account for any uncertainty in \({M}_{w}^{\mathrm{max}}\) estimation.

Fig. 11
figure 11figure 11figure 11

Magnitude frequency distribution (MFD) curve for shallow seismic sources. The maximum likelihood method of Aki [57] is employed for obtaining seismicity parameters, i.e. a and b values

Fig. 12
figure 12

Magnitude frequency distribution (MFD) curve for deep seismic sources. The maximum likelihood method of Aki [57] is employed for obtaining seismicity parameters, i.e. a and b values

The seismicity parameters are extracted from the graphs which are plotted by using the updated version of the ZMAP tool [39]. The values of ‘a’ and ‘b’ for shallow and deep seismic sources were calculated and are shown in Tables 9 and 10, respectively. The seismogenic depth (\({D}_{\mathrm{seis}}\)) for all seismic sources is also determined and is presented along with other parameters. In shallow seismic sources, the variation in ‘b’ value is from 0.529 to 1.23, while in deep seismic sources, the value ‘b’ varies from 0.63 to 1.05. This variation in ‘b’ is indicating that several major earthquakes (\({M}_{w}>7\)) have focal depths less than 50 km. The value of ‘b’ is much lower for area sources located in the Hindukush region. The graph is being flattened due to several large earthquakes in this region. The final catalogue has 112 earthquake events that are considered as major earthquakes (\({M}_{w}>7\)). 79% of those earthquakes have focal depths less than or equal to 50 km, while the remaining 21% were intermediate-depth earthquakes. Similarly, the value of \({\lambda }_{M}\) in shallow seismic source varies from 0.36 to 13.18, while it varies from 0.68 to 40.08 in deep zones. The shallow zone 21 showed a minimum value of \({\lambda }_{M}\), i.e. 0.36 indicating least active seismic region having 23 number of events while the deep zone 5 depicted the maximum value of \({\lambda }_{M}\), i.e. 40.08 among all the seismic sources indicating the most active zone of the region. This seismic source has a ‘b’ value of 0.93 and a maximum moment magnitude of 7.5.

Table 9 Seismicity parameters for 23 shallow area sources using the maximum likelihood method (Aki 1965)
Table 10 Seismicity parameters for five deep area sources using the maximum likelihood method [57]

3.2 Spatially Smoothed Seismicity Model

A recent trend in the seismic hazard analysis is to explicitly model the crustal faults along with the background seismicity of the region. For this purpose, generally, the earthquake events are divided into two categories for the modelling of crustal faults and background seismicity. In such studies, the use of area source model alone is not considered sufficient to accurately predict the seismic hazard of an area. Therefore, besides the explicit modelling of faults, the background seismicity is also modelled as a spatial smoothed gridded seismicity model. Currently, such seismicity models are mostly being used in the probabilistic seismic hazard assessment studies [20, 53, 58,59,60]. These studies assume that future earthquakes will occur near the small and moderate size events [60]. Ideally, the earthquake record for a thousand years is required to predict the future hazard of a region. However, the available seismic data of the recorded earthquakes are too short to be relied upon for the spatio-temporal pattern of future earthquakes. Due to the unavailability of data, the pattern of available past seismicity can be used as the predictor of the earthquake occurrence in the intra-plate region [61]. In the current study, spatially smoothed gridded seismicity rates have been calculated by using the compiled earthquake catalogue. The smoothed activity rate, i.e. \({10}^{a}\) values are representing the earthquakes in mapped and unmapped fault areas as shown in Figs. 13 and 14. Besides dividing the region into small area zones, the region bounded by latitude 20°–40° N and longitude 58°–83° E is considered as a single large seismic source zone. The earthquake events reported in the region are divided into two layers based on their depths ranging from 0 to 50 km and 50 to 250 km. The region is divided into a grid of 0.1° in latitude and 0.1° in longitude. The variable catalogue completeness is taken into account by using the maximum likelihood method [62] for both shallow- and intermediate-depth seismicity for every grid cell. In this method, the earthquake’s data are classified into sets of magnitude ranges. The completion period is measured for each magnitude range to count the number of events for every range. Similarly, the seismicity rates are determined by counting the number of events having a magnitude greater than the threshold value (\({M}_{w}=4\)) in each grid cell. The number of events represents the estimate of maximum likelihood \({10}^{a}\) value for each cell. The resulting \({10}^{a}\) values are the Gutenberg, Richter [44] annual rate of occurrence of each cell.

Fig. 13
figure 13

Smoothed activity rate \({10}^{a}\) the value derived for seismicity from 0–50 km depth

Fig. 14
figure 14

Smoothed activity rate \({10}^{a}\) the value derived for seismicity from \(50-250\) km depth

One constant value of ‘\(b\)’ is required to be used for spatial smoothed seismicity model which should be calculated by considering the whole study region as one seismic source [59]. The same methodology was adopted for this current study. Consequently, a uniform value of \(b=0.9\) is used, which was calculated from the magnitude-frequency distribution of complete and declustered earthquake catalogue. The seismicity rates are spatially smoothed using the two-dimensional Gaussian function with a bandwidth of 50 km [58]. The bandwidth value is selected based on the catalogue location uncertainty and personal judgement.

4 Conclusions

An updated earthquake catalogue is compiled for Pakistan and surrounding areas bounded by coordinates latitude 20°–40° N and longitude 58°–83° E. It provides a comprehensive seismic data available for the seismic hazard assessment of the region. This catalogue is compiled by combining all the historical, pre-instrumental, and instrumentally recorded events from the available literature and online sources. The empirical expressions for the conversion of data from other magnitude scales to the moment magnitude scale are developed for homogenizing all the events. For removing aftershocks and foreshocks, the declustering of this catalogue is performed by a computer package ZMAP [39]. The total number of events that remained after declustering is 7845 (\(\mathrm{AD} 10-2018 \mathrm{CE}\)). Using this catalogue, several shallow and deep seismic area sources are also delineated based on the seismicity pattern, seismotectonic, and faulting mechanism. The declustered catalogue is further processed for each seismic source zone to compute seismic parameters (i.e. ‘a’ and ‘b’ value). The catalogue contains six events with \({M}_{w}\)> 8, while 93 events are between \({M}_{w}\) 7 and \({M}_{w}\) 8. The completeness analysis is performed using the visual cumulative method [41] and [42] method. For this purpose, the OpenQuake engine [45] is used. The magnitude of completeness is determined using the Gutenberg, Richter [44] magnitude-frequency distribution method for all seismic sources. Finally, using the developed catalogue, the smoothed activity rates for spatially smoothed gridded seismicity model are also determined for two layers of depths (from 0 to 50 km and 50 to 250 km). The developed catalogue, seismic source models, and recurrence parameters can be effectively used in the probabilistic seismic hazard analysis (PSHA) of Pakistan and its surroundings.