1 Introduction

The Himalayan–Tibetan region has long been regarded as one of the world’s biggest and highest seismotectonic active regimes resulting from ongoing collision between the Indian and Eurasian plates since Tertiary period (Avouac and Tapponnier 1993; Bai et al. 2017; Tapponnier et al. 2001; Zhao et al. 2012). This region encompasses the intra-continental active crustal Tibetan plateau in the center, the Himalaya orogen in the south, Pamir–Hindu Kush deep subduction zone in the west, and Tien Shan mountain belt in the north. Historical documents (Table 1) state that this region has produced many catastrophic earthquakes. The most recent damaging earthquake affecting central Himalaya is the 25 April 2015, M w 7.8 Gorkha, Nepal earthquake which caused nearly 9000 casualties, destroyed over fifty thousands of buildings, and gave rise to huge economic losses (Bilham 2015). In the current century, the largest earthquake affecting the China mainland, the eastern margin of Tibetan plateau in particular, was the 12 May 2008, M w 7.9 Wenchuan earthquake. It was occurred in Sichuan region resulting more than 80,000 human lives losses with ~U.S. $110 billion damage (Wang et al. 2012). Thus, the entire Himalayan–Tibetan region is facing a significant seismic hazard and consequent seismic risk.

Table 1 List of historical and major earthquakes (M w ≥ 7.5) in and around the study region since 1000 AD

Since the pioneer work by Cornell (1968), the probabilistic seismic hazard assessment (PSHA) has widely been used as a critical element for the determination of appropriate seismic design provisions for buildings and infrastructure (e.g., Bhatia et al. 1999; Kolathayar and Sitharam 2012; Ordaz et al. 2014; Ornthammarath et al. 2011; Sawires et al. 2016; Zhang et al. 1999). Implementing good seismic design provisions for buildings and other infrastructure is an effective way to reduce the seismic risk and thereby evading the earthquake-related disasters. The PSHA studies are primarily based on the earthquake catalogs (both instrumental and historical ones). In terms of time span, the duration of the complete part of the instrumental earthquake catalog is very short (e.g., around 55–60 years for this study region) compared to the duration of the tectonic processes required for earthquake generation (Bilham 2013). Using the short spanning earthquake catalog in PSHA may yield low hazard values for some regions, where great earthquakes have actually occurred and evident from the 2008 Sichuan M w 7.9, the 2010 Haiti M w 7.0, and 2011 Japan M w 9.0 earthquakes (Wang et al. 2012). We, therefore, compile all available historical and instrumental earthquake catalogs, which include more data than the existing studies.

The seismic hazard maps for the Himalayan–Tibetan area have been previously studied since the 1990s (e.g., Bhatia et al. 1999; Zhang et al. 1999; Gao 2003). The GEM-faulted earth project carried out studies on the characterization of the Himalayan Frontal Thrust (MFT) active fault in appraising the attributes for seismic hazard and seismic risk for Himalaya region (Berryman et al. 2014). The occurrence of the major historical earthquakes in the Himalayan–Tibetan region (Table 1) varies remarkably on the basis of the seismotectonic settings of the study area. The frequency of earthquake occurrence in the Himalayan orogenic belt and the Tien Shan orogenic belt is much higher than that in the interior of the Tibetan plateau (Bai et al. 2017).

In this study, we use a methodology recently developed by Kijko et al. (2016) for the assessment of the key seismic hazard parameters, because it uses both historical and instrumental earthquakes for the estimation of spatio-temporal hazard variation. The seismicity parameters (e.g., γ, b value, Mmax) are assumed to be constant over time and space by most of the PSHA procedures. However, the closer examination of earthquake catalogs indicates the significant variations of seismic activity rate and Gutenberg–Richter b value. The parameters are computed assuming the incompleteness of the event catalogs by accounting for historical records since the tenth century and complete instrumental records with different levels of completeness. In addition, the uncertainty of the earthquake-occurrence model and the determination of seismicity parameters are taken into account. Besides, three seismogenic source models (i.e., the smoothed-gridded seismicity, the linear source model, and areal source model) are combined by the use of a standard logic tree structure for capturing the model based epistemic uncertainties. In addition, two sets of ground motion prediction equations (GMPEs) are also selected for active shallow crustal and subduction interface earthquake regimes. This analysis produces a set of probabilistic seismic hazard maps for peak ground acceleration (PGA) and spectral acceleration (SA) for a better understanding about the spatial variation of seismic hazard value.

2 Data and Methods

2.1 Earthquake Catalogs

Earthquake catalog provides the fundamental basis for delineating the seismogenic sources and for assessing the key seismicity parameters, including the mean seismic activity λ, the b value of the frequency–magnitude Gutenberg–Richter relation, and M max, the maximum expected earthquake magnitude of the study area. The incompleteness of the earthquake catalog is considered by accounting both historical earthquakes and instrumental catalog with different levels of completeness (Kijko and Smit 2012; Kijko et al. 2016). To prepare a composite earthquake catalog, we have collected historical earthquake archives for Tibetan–Himalayan region as much as possible. Lee et al. (1976) is one of the biggest historical earthquake repositories for China mainland, which even contains the information about pre-historic earthquakes. For Himalaya and adjacent region, Ambraseys and Douglas (2004) and Szeliga et al. (2010) are the prime sources for the historical major earthquakes. Besides, the historical earthquake records from the National Oceanic and Atmospheric Administration (NOAA), GEM Global Historical Earthquake Archive (GEM-GHEA), and the National Seismological Centre of Nepal (NSC) are used. For instrumental seismic-event catalogs, the earthquake records from the U.S. Geological Survey (USGS), International Seismological Centre (ISC), and China Seismograph Network (CSN) between 1906 and 2016 are utilized.

To obtain the comprehensive catalog, the catalogs from different sources are then assembled together and checked manually for removing the duplicate earthquakes on the basis of location, time, and magnitude. Different types of magnitude scaling (e.g., body wave magnitude, surface wave magnitude, duration magnitude, and local magnitude) contained in the catalog are converted into moment magnitude (M w) scaling using the formulas, as shown in Table 2. About 30,000 events are recorded in the comprehensive catalog with magnitude M w ≥ 4.0 including the historical earthquakes. Since the seismicity model assumes that the occurrence of an earthquake is independent, a declustering process following the algorithm of Gardner and Knopoff (1974) is performed to exclude the effects of foreshocks and aftershocks. A total of 12,000 events are included in the final catalog (Fig. 1).

Table 2 Magnitude scaling relations used for conversion of different types of magnitude scaling into moment magnitude scale
Fig. 1
figure 1

Location map showing the distribution of instrumental (black open circles) and major historical (grey filled circles) earthquake epicenters. Active faults are adopted from Styron et al. (2010). Dotted white lines are suture zones: IYS Indus Yalu suture zone, BNS Bangong Nujiang suture zone, JS Jinsha suture zone, SSZ Shyok suture zone, TS Tanymas suture zone, AMS Anyimaqen–Kunlun–Muztagh suture zone. Black rectangles are seismic source zones. Insert is the map of East Asia and surrounding countries, while the marked rectangle within the insert map is the study region

For instrumentally recorded earthquake catalog, it is very important to assess the different levels of completeness of various sub-catalogs, because the large earthquake records are generally complete rather for longer periods than the smaller earthquakes. The completeness of earthquake catalog is usually associated with the socio-economic and historical aspects, the demographic variations, and the seismic station coverage. If the incompleteness of the catalog is not taken into account, the recurrence rates would be overestimated and underestimated for small and large earthquakes, respectively. To estimate the completeness of the earthquake catalog, we use visual cumulative method (Tinti and Mulargia 1985) and assume a threshold magnitude (M 0) of M w 4.0. The catalog is found to be complete from 1964 onwards for Himalaya and Pamir–Hindu Kush regions (Fig. 2a) and from 1960 onwards for Tibetan and Tien Shan regions. This implies that the catalog before the modern instrumental period is incomplete typically for small magnitude earthquakes. To incorporate the entire instrumental earthquake catalog since 1906, different complete time windows for various magnitude intervals (e.g., 4.0–5.0; 5.0–6.0; and 6.0–7.0) which corresponds to different complete sub-catalogs that are assessed employing the statistical method of Stepp (1972). For example, in the Himalayan region, the different levels of completeness are of 4.0, 5.0, and 6.0 with the corresponding complete time windows that are of 1964 onwards, 1925–1964, and 1906–1925, respectively (Fig. 2b).

Fig. 2
figure 2

Example of a completeness verification for M 0 = 4.0 and b different complete time windows for the corresponding magnitude intervals (solid tangent lines show the constant standard deviation and vertical dashed lines show the length of the time interval) for Himalayan orogenic belt

2.2 Earthquake Source Models

Proper seismic source zone identification and delineation is very important in PSHA. Seismic zones are usually defined by considering on the tectonic environment, seismological and geological attributes. The study region is regarded as complex tectonic environment in producing the earthquakes. This analysis thus used a model built from a combination of three different seismic source materials: areal, linear, and smoothed-gridded sources.

2.2.1 Areal Seismic Sources

The basic principle used in delineating the area of seismic source zones requires that seismicity within a single source zone must remain uniform and homogenous. Therefore, every point within the source is assumed to have an equal potential for the generation of an earthquake in the future. The standard Cornell approach (Cornell 1968) is primarily based on seismic areal sources. Seismicity distribution, geological behaviors, active fault characteristics, seismotectonic settings, and focal mechanism of earthquakes are the principal criteria to delineate the seismic areal source zone. Based on these criteria and zones on the previous studies (e.g., Bhatia et al. 1999; Nath and Thingbaijam 2012; Zhang et al. 1999; Zhangming 1992), 63 areal source zones are identified for the whole study area. High seismicity clusters are present in Pamir–Hindu Kush region, western Nepal, east-central Nepal, eastern Himalayan syntaxis, and central to west of Tien Shan. Interior of Tibet shows moderate seismicity, while the Tarim basin shows very sparse seismicity (Fig. 1).

2.2.2 Linear Seismic Sources

The major active faults are considered as linear seismic sources, because major earthquakes are mostly associated with these faults. The active faults identified from the remote sensing imagery, aerial photographic interpretation, paleoseismic studies, and field investigations (Styron et al. 2010) are used in this study. For the whole study region, the identified numbers of normal, strike slip, and thrust faults are of 50, 69, and 119, respectively (Fig. 1). The empirical relationships are applied to characterize the parameters of faults if it is not well constrained by other studies. For magnitude–area and magnitude–length scaling relationships, two different regression relationships are adopted: Yen and Ma (2011) and Wells and Coppersmith (1994). Stirling and Goded (2012) carried out a compilation and evaluation on 72 magnitude–area and magnitude–length scaling relationships. They found that the quality of the relationships of Yen and Ma (2011) is better than that of Wells and Coppersmith (1994) because of updated wide-spread data sets with large magnitude coverage, and the assigned quality score is of 1 (best available). On the other hand, they recommended the quality score of 2 (good) for Wells and Coppersmith (1994) relationships because of old data sets and relatively narrow magnitude coverage. Accordingly, in this analysis, two-third and one-third weightages are assigned to Yen and Ma (2011) and Wells and Coppersmith (1994), respectively. The occurrence rate of large magnitude earthquakes along these faults is determined using long-term slip rates as

$${\text{Recurrence Interval}} = M_{0} / \mu \dot{u}LW,$$
(1)

where μ is shear modulus, 3.0 × 1011 dyne/cm2, L is rupture length, W is rupture width, \(\dot{u}\) is the fault slip rate, and M 0 is the seismic moment which is directly related to magnitude (M) as Hanks and Kanamori (1979):

$$Log\left( {M_{0} } \right) = 1.5M + 16.05.$$
(2)

As the attributes (e.g., dip, slip, and slip rate) of small-scale faults are not well-documented, we assign the same values for the above-mentioned parameters taken from the nearby known faults to small-scale faults. The Main Frontal Thrust (MFT) is one of the well-constrained active faults in the study area. The slip rate of the MFT is studied from the GPS observations (Ader et al. 2012). The attributes of the MFT at three different segments are derived by the GEM-faulted earth project (Berryman et al. 2014); which are directly used in this study. The Characteristic-earthquake magnitude–frequency distribution model for the empirically estimated earthquakes with magnitude equal to or higher than M w 6.5 and Gutenberg–Richter magnitude–frequency distribution model for instrumentally recorded earthquakes are used in linear source model. Table 3 shows an example illustrating the linear source parameters (e.g., seismic moment, slip rates, rupture length, and rupture) for Himalayan region.

Table 3 Examples showing parameters of active faults in Himalayan region estimated in this study

2.2.3 Smoothed-Gridded Seismicity

Due to the wide scattering of earthquake hypocenters in relation with fault locations, a zone-free approach (Frankel 1995; Woo 1996) is used for the spatial smoothing of seismicity to avoid any subjectivity for delineating areal seismic sources. This zone-free approach is also called the kernel estimation method, because it implements the spatial smoothing of seismicity and depicts the non-uniform spatial distribution of seismicity assuming that it varies from place to place. For smoothed gridded, the study area is divided into thousands of grids with the size of 0.1° × 0.1°. Earthquakes of M w ≥ 4.0 are counted in each grid square for different magnitude intervals to estimate the occurrence rates for the corresponding magnitude intervals. The calculated seismicity rates are then smoothed to get the final activity rate adopting the multivariate Kernel probability density function using the equation (Woo 1996):

$$\hat{n}_{i} = \frac{{\mathop \sum \nolimits_{j} n_{j} e^{{ - \Delta_{ij}^{2} /c^{2} }} }}{{\mathop \sum \nolimits_{j} e^{{ - \Delta_{ij}^{2} /c^{2} }} }},$$
(3)

where n j is the rate of seismicity in the jth grid cell, \(\hat{n}_{i}\) is the smoothed rate of seismicity in the ith cell, c is the correlation distance accounting for location uncertainties, and Δ ij is the distance between the ith and jth cells. The sum is taken over cells j within a distance of 3c of cell i. At present, there are no complete rules or guidelines in determining the appropriate c value. For very small c value, the resulting smoothed seismicity will be concentrated around the epicenters of the recorded earthquakes, while larger c value spreads out the seismicity and does not reflect the true spatial variation of seismicity as assumed in zone-free approach. Therefore, we assign c value as 0.5° according to Frankel (1995).

2.3 Maximum Magnitude and Focal Depths of Earthquakes

The maximum expected earthquake magnitude (M max) for every seismic source zone is computed using historical and recent earthquake data, as well as the largest magnitude earthquakes that occurred in nearby seismic zones with similar tectonic settings following the algorithm of Kijko and Singh (2011) by taking into account the uncertainty in the earthquake magnitude determination. The maximum expected magnitude of earthquake along its uncertainty for all areal source zones is given in Table 4. It is found that most of the zones along the central Himalaya are with high M max values and reach as high as 8.6, while the dominant portion in the interior of Tibetan plateau shows low (as low as 6.5)-to-medium value. The computed M max value varies between 8.6 and 6.5. For smoothed-gridded seismicity, the same range of M max is used.

Table 4 Computed seismicity parameters for areal source zones

Accurate determination of earthquake locations for the study area is usually difficult because of the insufficient station coverage. Earthquakes listed in the ISC catalog appear to be distributed throughout the thickness of the Tibetan and Indian crust range between 0 and 90 km. However, recent studies have shown that most of the earthquakes in Himalaya are concentrated at depths ranging between 10 and 20 km (Pandey et al. 1999; Priestley et al. 2008). In addition, the average focal depth for the Gorkha earthquake sequence is ~15 km estimated using waveforms from a nearby broadband seismic array (Bai et al. 2016). In Tibetan plateau, the focal depths for most of the events range from 0 to 40 km (Bai et al. 2012) and 0 to 50 km (Sloan et al. 2011). In particular, moderate-to-large earthquakes are mostly shallower than 30 km in the northern Tibet and shallower than 20 km in the central and southern Tibet (Bai et al. 2017). The earthquakes occurred in Tien Shan are mostly limited within the depths ranging between 0 and 30 km (Khan et al. 2017; Sloan et al. 2011). In this analysis, the average focal depth (e.g., 15 km in Himalaya and Tien Shan) of each seismic belt is assigned as a constant depth for all source models.

2.4 Seismicity Model and Parameters

To calculate earthquake magnitude exceedance rates, a Modified Gutenberg–Richter (MGR)–Poisson model is selected as a seismicity model for all seismogenetic sources except characteristic earthquakes. Herein, the seismicity is expressed as (Cornell and Van Marke 1969)

$$\lambda \left( M \right) = \lambda_{0} \frac{{\exp \left( { - \beta M} \right) - \exp \left( { - \beta M_{\hbox{max} } } \right)}}{{\exp \left( { - \beta M_{0} } \right) - \exp \left( { - \beta M_{\hbox{max} } } \right)}},\quad M_{0} \le M \le M_{\hbox{max} } ,$$
(4)

where λ 0 is the rate of earthquake activity with threshold magnitude M 0 (here, M 0 = 4.0), β is a parameter equivalent to the b value (β = 2.303 × b), and M max is the maximum expected magnitude for the sources. The uncertainty of both M max and β is taken into account in computing the \(\lambda \left( M \right)\) value.

Traditionally, the seismicity parameters are calculated based on the minimum magnitude of completeness (M c). The evaluation of M c is conducted using the Maximum Curvature (MAXC) method (Wiemer and Wyss 2000). The estimated M c value is ~M w 4.0 and this value is taken as threshold magnitude (M 0), because earthquakes smaller than M w 4.0 are not likely to cause any damage to infrastructure from engineering point of view. In this study, the essential seismicity parameters are computed taking into account the incompleteness of the instrumental catalog and the historical earthquake records since the tenth century. The Gutenberg–Richter b value and its standard error for every seismic belt and areal source zone are computed using the joint likelihood function of Kijko et al. (2016) by accounting the incomplete historical and complete parts of the catalog. This newly developed method takes into account both incompleteness of the seismic catalogs and temporal variation of seismicity. This procedure applies a Bayesian Poisson–gamma distribution as a model of earthquake occurrence in time and a Bayesian Exponential–gamma distribution as a model of earthquake magnitude. We create the Poisson–gamma compound distribution to obtain the probability for the observed n seismic events, within a time interval t, for temporal varying seismic activity λ (Benjamin 1968), as follows:

$$P_{n} \left( {\bar{\lambda },t,v_{\lambda } } \right) = \mathop \int \limits_{0}^{\infty } P_{n} \left( {\lambda , t} \right) f_{\Delta } \left( \lambda \right){\text{d}}\lambda = \frac{{\varGamma (n + q_{\lambda } )}}{{n!\varGamma (q_{\lambda } )}}\left( {\frac{{p_{\lambda } }}{{t + p_{\lambda } }}} \right)^{{q_{\lambda } }} \left( {\frac{t}{{t + p_{\lambda } }}} \right)^{n} ,$$
(5)

in which \(\varGamma (n)\) is the gamma function, \(p_{\lambda }\) and \(q_{\lambda }\) are the parameters of gamma distribution, such that \(p_{\lambda } = \bar{\lambda }/\sigma_{\lambda }^{2} ,\) \(q_{\lambda } = \bar{\lambda }^{2} /\sigma_{\lambda }^{2}\), and \(\bar{\lambda }\) denote the mean value of the activity rate λ.

The applied models make it possible to incorporate the uncertainty associated with randomly, time-varying seismicity. It is observed that the overall b value using this algorithm is rather lower than that of traditionally used algorithm of Aki (1965). For Himalaya, Pamir–Hindu Kush, Tien Shan and Tibet regions, the estimates of β value are of 1.50, 1.53, 1.49, and 1.48, respectively. In Table 4, the β values and their standard errors are given for all the areal source zones. Worth mentioning that in smooth-gridded seismicity model, the exact same β value is assigned to all grids within the each seismic belt. The average annual rate of seismicity (λ 0) at threshold magnitude for each seismic source zone is computed using the method of likelihood function (Kijko et al. 2016) assuming the Poisson–gamma compound distribution of the earthquake magnitude–frequency as

$$L_{i} \left( {\bar{\lambda }|n_{i} ,t_{i} } \right) = \left( { \bar{\lambda }^{\left( i \right)} t_{i} + q_{\lambda } } \right)^{{ - q_{\lambda } }} \left( {\frac{{\bar{\lambda }^{\left( i \right)} t_{i} }}{{\bar{\lambda }^{\left( i \right)} t_{i} + q_{\lambda } }}} \right)^{{n_{i} }} ,$$
(6)

where \(\bar{\lambda }^{\left( i \right)}\) is the mean earthquake activity rate of the ith sub-catalog, corresponding to the magnitude level of completeness \(m_{\hbox{min} }^{(i)}\).

2.5 Selection of Ground Motion Prediction Equations

Attenuation relations, i.e., the ground motion prediction equations (GMPEs), are required for the estimation of ground motion parameters (e.g., PGA and SA). These can be used to calculate the probability distribution of ground motion intensity as a function of various variables such as earthquake magnitude, source-to-site distance, and soil condition. These relationships are usually obtained empirically using a statistical regression method on a particular set of strong-motion parameters.

The seismotectonic setting of the Himalayan–Tibetan region is very complex. This study area is a tectonically active interplate region (Nath and Thingbaijam 2012), where the earthquake-occurrence environment is not simple and rather even compound in combination. According to the study of Nath and Thingbaijam (2012), the Himalaya orogenic belt, Tien Shan and Tibet regions are belonged to the active shallow crustal regime (ACR) with many subduction interface zone (SIZ) earthquakes across the Himalaya. Studies on focal depths and focal mechanisms of earthquakes (Bai et al. 2017) stated that earthquake occurrence in this region is fairly associated with subduction interface. Most of the zones in Pamir–Hindu Kush region are regarded as SIZ. Moreover, some previous seismic hazard studies assumed that the Himalaya and its surrounding region belong to either SIZ (e.g., Pandey et al. 2002) or ACR (e.g., Ram and Wang 2013), or both (e.g., Chaulagain et al. 2015). To avoid the controversy, in this study, we consider that Himalaya and Pamir–Hindu Kush regions are characterized with both ACR and SZI environments, while the Tibetan plateau and Tien Shan are belonged to ACR. For Tibetan–Himalayan region, there is no adequate number of strong-motion data and consequently no particular GMPEs yet and even not enough to check the validity and compatibility of the GMPEs from the other regions.

The usual practice to resolve the problem is to select the GMPEs developed in other regions with similar seismotectonic settings which could represent adequately the ground motion in this region. Therefore, following the criteria (e.g., tectonic regime, journal types, data sets, frequency range of the model, functional form, regression coefficients, etc.) of Cotton et al. (2006), and performing a number of trellis plots [i.e., predicted spectral acceleration (PSA) versus magnitude, PSA versus distance, and PSA versus structural periods] according to Stewart et al. (2013), a total of eight GMPEs are selected in this analysis. Four of them are for the ACR and the other four for the SZI. For ACR, the selected models include two Next-Generation Attenuation (NGA-west2) models developed by Abrahamson et al. (2014) and Chiou and Youngs (2014) using the global data sets, one model applicable for Europe and Middle East by Ambraseys et al. (2005), and Zhao et al. (2006) mostly using Japanese data. Due to scarcity of strong-motion records in Himalayan–Tibetan region, the weight assessment and, thereby, the ranking of different GMPEs are not computed in this study. Many of the seismic hazard studies for Asian countries (e.g., Chaulagain et al. 2015; Kolathayar and Sitharam 2012; Nath and Thingbaijam 2012; Ornthammarath et al. 2011) used equal weight for all models due to insufficient observed data. Therefore, equal probabilistic weights (1/4) are assigned to each of these four models. On the other hand, for SIZ with the large distance, we employed three models from global data sets, namely, the BC Hydro 2016 known as Abrahamson et al. (2016), Atkinson and Boore (2003), and the Youngs et al. (1997), and another one from local data sets is of Zhao et al. (2006). Similar to ACR, equal probabilities (1/4) to each of the models are used.

Compatibility among these selected GMPEs is checked when applied in the CRISIS2015 (Ordaz et al. 2015) program. This program allows the use of different distance definitions (e.g., Joyner–Boore, rupture, and hypocentral distance) for different GMPEs. Moreover, the focal depth is taken into account as an independent variable. The assessment is carried out assuming rock-site condition (where the average V s30 is equal to 760 m/s). However, for Himalaya and Pamir–Hindu Kush regions, we combine two sets of GMPEs assigning equal weighting factor (0.5) in the logic tree. Besides, GMPEs for ACR are employed for the rest part of the study area. In the similar way, GMPEs are employed for all three source models.

2.6 Logic Tree Structure

Uncertainties are usually associated with the models used in PSHA. The use of logic tree allows capturing the epistemic uncertainties in different input models (Bommer et al. 2005; Sabetta et al. 2005) by including alternative models in the hazard computation. In this study, three types of seismogenic source models and two sets of GMPEs (each consisted of four different GMPEs) are considered. Equal weights (1/3) are assigned to each of the source models, as there is no substantial observed data for ranking the weight of the models. For linear source model, similarly, equal weights (1/2) are used for Characteristic-earthquake and Gutenberg–Richter seismicity models, because there is no specific reason to prefer one over another. Figure 3 shows the representation of the standard logic tree framework used for linear source model in the Himalaya, and Pamir–Hindu Kush regions.

Fig. 3
figure 3

Logic tree structure for active linear seismogenic source model

2.7 Seismic Hazard Estimation

To predict the seismic hazards for future earthquakes, the seismic hazard module CRISIS2015 (Ordaz et al. 2015) is used because of its high efficiency of calculation and flexibility in model selection (Danciu et al. 2010). For areal and linear seismic sources, CRISIS computes the seismic hazard based on the standard Cornell approach (Cornell 1968). It assumes that within a source, seismicity is evenly distributed by unit area and all points could be a potential earthquake focus. To obtain accurate hazard values, CRISIS performs a spatial integration by subdividing the original sources. Once a source is subdivided into sub-sources, the acceleration exceedance rate due to the ith single source can be computed using the equation:

$$\nu_{i(a)} = \mathop \sum \limits_{j} W_{ij} \mathop \int \limits_{{M_{0} }}^{{M_{\hbox{max} } }} \left( { - \frac{{{\text{d}}\lambda \left( M \right)}}{{{\text{d}}M}}} \right)\Pr \left( {A > a|M, R_{ij} } \right){\text{d}}M,$$
(7)

where M 0 and M max are the threshold and maximum magnitudes, respectively; \(\Pr \left( {A > a|M, R_{ij} } \right)\) is the probability that the acceleration exceeds a certain value a at the site across distance R ij for an earthquake with magnitude M, and W ij is the weight of each sub-element. The above expression also assumes that ∑W ij  = 1. The total average acceleration exceedance rate at the site due to the contributions of all the sources, N, within 300 km is calculated as

$$\nu (a) = \mathop \sum \limits_{n = 1}^{N} \mathop \int \limits_{{M_{0} }}^{{M_{\hbox{max} } }} \left( { - \frac{{{\text{d}}\lambda \left( M \right)}}{{{\text{d}}M}}} \right)\Pr \left( {A > a|M, R_{ij} } \right){\text{d}}M.$$
(8)

For smoothed-gridded seismicity, once the seismicity parameters and the GMPEs are known for each of the nodes of the grids, the hazard at a given node is calculated based on the effects of the totality of the nodes and their corresponding distances to the site of interest, as shown in Eqs. (7) and (8).

3 Results and Discussion

For seismic hazard prediction, the entire study region is divided into small grids with the size of 0.1° × 0.1°. At the center of each grid cell, the PGA and SA values are assessed based on a referenced bedrock-site condition. The contribution to hazard value of all the potential sources within the radial distance of 300 km from the center of each gird cell is taken into account. This hazard estimation is performed by disaggregating the hypocentral distance into small intervals of 1 km and the magnitude range (between the minimum and maximum magnitudes) into small incremental values of 0.1. This analysis produces different seismic hazard maps of PGA and SA at short (0.2 s) and long (1.0 s) natural periods at 5% critical damping factor for 10 and 2% probabilities of exceedance in 50 years, corresponding to 475 and 2475 year return periods, respectively. Figure 4a, b depicts the hazard maps of PGA value for the probabilities of exceedance of 10 and 2%, respectively, in 50 years. The spatial variations of SA values obtained at 0.2 and 1.0 s natural periods for 2% probability of exceedance are presented in Fig. 5a, b, respectively.

Fig. 4
figure 4

Spatial variation of PGA values for a 10% probability of exceedance and b 2% probability of exceedance in 50 years

Fig. 5
figure 5

Hazard maps of spectral acceleration at a 0.2 s natural period and b 1.0 s natural period for 2% probability of exceedance in 50 years

The predicted hazard parameters in terms of both PGA and SA show a non-uniform spatial variation. In addition, a significant hazard-level variation is observed from 10% probability of exceedance to 2% probability of exceedance. Due to the heterogeneous nature of the seismotectonic settings of the study region, the produced seismic hazard maps show many low-and-high hazardous patches. The hazard distribution pattern thus nearly follows the background seismicity distribution over the study area and the areas of highest potential hazard are likely distributed along the Indian–Eurasian plate collision zones. However, for 10% probability of exceedance at 50 years, the minimum and maximum PGA values are found as low as 0.04 g and as high as 0.87 g, respectively (Fig. 4a). The Pamir–Hindu Kush region, where both shallow and intermediate seismic activities are high (Bai and Zhang 2015) and many earthquakes of M w ≥ 7 have occurred during the last century, is likely to be experienced the highest seismic hazard potential which is in the order of 0.67–0.87 g.

Along the entire Himalayan orogen specially the central segment, the resulting seismic hazard value is comparatively high (as high as 0.67 g) from existing studies and does agree with the results of some geodetic and paleoseismic studies (e.g., Ader et al. 2012; Avouac et al. 2015; Bilham 2015; Bollinger et al. 2016; Rajendran et al. 2015; Sapkota et al. 2013). All of these studies expected high seismic hazard potential in this region which has also raised the concerns of high seismic risk. It is essential to note that the MFT is the most potential source for future devastating earthquakes in Himalaya orogen, but the predicted hazard value is not as high as in central segment. In PSHA, hazards are computed considering the influence from earthquakes with possible magnitudes at all significant distances from the site of target. Besides, the level of hazard also depends on the rate of earthquake occurrence. Along the MFT, the recurrence interval of the great earthquake is ~700 years (Rajendran et al. 2015; Sapkota et al. 2013), whereas, in central Himalaya, the rate of seismicity is very high, although most of the earthquakes are medium to small in size. Therefore, these might be the plausible reasons for having comparatively lower values along MFT.

Hazard distribution pattern for Tien Shan region nearly agrees with that of the ground motion parameter zonation map of China (Gao 2003). The highest hazard value for this region is in the order of 0.60–0.67 g and is supported by the very active crustal deformation with high rate of seismicity, which is also evident from the historical and instrumental earthquake records (Gao 2003; Kaban and Yuanda 2014). Because of very low rate of seismicity and rigidness of the crust, the Tarim basin and Inner Mongolia is clearly identified as a low seismic hazardous region whereas the hazard value down as low as 0.04 g. The interior of the Tibetan plateau excluding the isolated zone near to Lhasa is likely to be exposed in medium-to-low hazard value, whereas medium-to-high hazard is found at the plateau flank, particularly in the northwest plateau and in the eastern Himalayan syntaxis.

Predicted seismic hazard value increases significantly with decreasing the probability of exceedance. For example, at 2% probability of exceedance in 50 years, the expected level of hazard value is found to be a factor of 2 higher than that of 10% probability of exceedance in 50 years (Fig. 4b). In Fig. 4b, the estimates of minimum and maximum PGA values are observed as 0.07 and 1.50 g, respectively. The seismic hazard maps for SA values (Fig. 5a, b), which are produced as per the requirement of the modern building design provisions, show fairly similar distribution patterns as that of PGA hazard maps. The levels of SA value at 0.2 and 1.0 s natural periods for 2% probability of exceedance in 50 years are obtained in the order of 0.10–2.80 g and 0.06–1.10 g, respectively.

For making comparison, we also conducted PSHA using the short duration earthquake catalog since 1960 and estimating the Gutenberg–Richter parameter b value by the method of Aki (1965). Figure 6a shows the illustration of PGA value for 10% probability of exceedance from earthquake catalog since 1960. It is observed from the comparison (Figs. 4a, 6a) that the newly developed method of Kijko et al. (2016) which accounts the wide span of earthquake history (i.e., incomplete historical earthquakes records and complete instrumental seismic-event catalogs); uncertainties of the rate of seismicity, b value and M max provide better results in terms of both hazard level and its spatial distribution. In Fig. 4a, the maximum hazard value is up to 0.87 g, which is about 32% higher than that of 0.67 g, as shown in Fig. 6a. The result obtained by applying the new approach is comparatively higher and rather flat than that of current seismic hazard maps for this region (e.g., Fig. 6b, c).

Fig. 6
figure 6

PGA hazard maps for 10% probability of exceedance in 50 years: a in this study computed from instrumental earthquake catalog; b Global Seismic Hazard Assessment Program (Zhang et al. 1999); and c Seismic Ground Motion Parameter Zonation Map of China (GB 18306-2001 2001)

The hazard maps of this analysis show high values at some regions, where current existing hazard maps showed very low value. For example, near the location of 12 May 2008, M w 7.9 Wenchuan earthquake, the ground motion parameter is significantly underestimated in the China ground motion zoning map (Liu et al. 2013) and most of the area is placed in zone II with very low zoning factor of as high as 0.15 g (Gao 2003; Fig. 6c). From Fig. 4a, we can find that at the location of the Wenchuan earthquake, the resulting hazard value of this analysis is higher (>0.43 g) than that predicted by China seismic zoning hazard maps (Fig. 6c), but consistent with the result of Liu et al. (2013) who used the historical intensity catalog. In the context of regional seismic hazard maps, the level of maximum seismic hazard value for 10% probability of exceedance in this analysis is found to be increased by ~70 and 52% in Pamir–Hindu Kush region, and 34 and 20% in central Himalayan region from the results of Bhatia et al. (1999) and Zhang et al. (1999), respectively.

In addition, another comparison is made to illustrate the effect of different source models to the predicted hazard value. For convenience, a small part of the study area (Nepal) is considered, and illustrated on the basis of the result obtained using the conventional PSHA method. It seems that the contribution of different source models to the seismic hazard value is not same at every point and thereby varied spatially and temporarily. The deviation of hazard value with respect to the various source models is described by computing the seismic hazard curves for different cities of Nepal (Fig. 7). In the southern and central-western Nepal (e.g., nearby the cities of Bara, and Rukum), linear source contributes much higher seismic hazards than those of other two source models and consequently augments the levels of combined hazard value. For the central portion of Nepal around the Kathmandu city, the contribution is quite similar for all the models. In the southwest region (near Kailali city), the highest contribution is from smoothed-gridded seismicity. Besides, the level of maximum hazard value is found to be as high as 0.54, 0.58, and 0.66 g for areal, linear, and smoothed-gridded source models, respectively, for 10% probability of exceedance. It is noted that the hazard values for areal source model are nearly similar to the previous hazard maps of Nepal, while the maximum values are in the order of 0.50–0.54, 0.52–0.62, 0.45–0.50, and 0.48–0.56 g for this study, shown by Ram and Wang (2013), Chaulagain et al. (2015), and Zhang et al. (1999), respectively. However, the combined hazard value in this case is reasonably higher than that of the value for traditional areal source model and consistent with the results of similar seismogenic source models in India (e.g., Kolathayar and Sitharam 2012; Nath and Thingbaijam 2012). The incorporation of the past seismicity data (e.g., historical catalog and paleoseismic results) and instrumental catalog in the combination of multiple source models provides more comprehensive and explainable constraints to the predicted values of the ground motions.

Fig. 7
figure 7

a Seismic hazard (PGA) curves for different cites of Nepal; b location map of these cities

4 Conclusion

We incorporated the latest data sets, multiple seismogenic source considerations, updated GMPEs, well-constrained focal depth, and newly developed methodology (Kijko et al. 2016) for the estimate of seismicity parameters for Himalayan–Tibetan region. All the uncertainties associated with the essential seismicity parameters (e.g., β, γ, and M max) and the epistemic uncertainties of different models are taken into account. An unified declustered instrumental earthquake catalog and historical earthquake catalog since the tenth century are utilized in this hazard assessment. A total of 63 areal source zones, 228 active linear sources, and thousands of smoothed-gridded sources are identified and employed using a logic tree structure. The seismicity parameters for each of the seismic belts and areal source zones are computed by taking into account the incompleteness of the catalog, the uncertainty in the earthquake magnitude determination, as well as the uncertainty associated with the applied earthquake-occurrence models. To get more precise estimates of the earthquake parameters, different sub-sets of the catalog with different levels of completeness are assessed. The modified Gutenberg–Richter and Characteristic-earthquake magnitude–frequency models and two sets of GMPEs (four GMPEs for the active shallow crust and four GMPEs for the subduction interface) are used.

The hazard distribution is computed in terms of PGA for 10 and 2% probabilities of exceedance on bedrock level in 50 years. The spectral acceleration distribution maps at short and long natural periods (i.e., 0.2 and 1.0 s) are also produced for 2% probability of exceedance. To illustrate the resulting hazard value, two different comparisons are made: one is between the results from the newly adopted and conventional methods; and the other is between the inter-seismogenic source models. The present result depicts that the level of hazard value is rather higher than that of the previous hazard maps. The region of low background seismicity shows relatively higher value than the previous studies. The resulting hazard maps would provide new insights in improving the national building design provisions for Himalayan–Tibetan region. For better constraints of seismogenic sources, detail paleoseismic and geodetic studies are necessary. Strong ground motion records are essential in developing attenuation equations to make a better choice of GMPEs for this complex tectonic regime.