1 Introduction

Uncertainty, in different forms, is an inevitable part of engineering analysis. Uncertainties are responsible for seismic risk evaluation of structural systems and installations. The uncertainties are categorized as either aleatory or epistemic. Aleatory uncertainty, which is not reducible, is defined randomly or as a natural congenital variable. Epistemic uncertainty stems from a lack of knowledge or incomplete information and, unlike aleatory uncertainty, is reducible. In fact, epistemic uncertainty can be reduced by increasing data and information. With all this, distinguishing between aleatory and epistemic uncertainty is not a straightforward task and from a scientific point of view, uncertainty type identification is the main capability of the analyst in reducing of uncertainties (Der Kiureghian and Ditlevsen 2009). Epistemic uncertainties associated with probability seismic hazard analysis (PSHA) can be classified into two categories: (i) earthquake rupture prediction and (ii) ground motion characteristics (Bradley 2009).

In the first category where uncertainties are related to earthquake prediction, estimating the occurrence frequency of earthquake for a fault/faults of a region and its magnitude is very difficult and dependent on assumptions (Murray and Segall 2002). In conventional PSHA, earthquake occurrence is considered randomly with respect to time. Assuming random occurrence, the probability of an earthquake in the specified time intervals is constant and does not change over time which is inconsistent with elastic rebound theory (Reid 1911). The uncertainty of recurrence model can be categorized into the uncertainty in magnitude and time of occurrence (Kramer 1996). In determining the magnitude of an earthquake, Gutenberg and Richter (1944) obtained a linear relationship between the magnitude and total number of earthquakes in any given region and the time period of at least that magnitude, which is known as Gutenberg–Richter law. Then this relationship was transformed into the modified Gutenberg–Richter law by controlling the magnitude of earthquakes to specific values (McGuire and Arabasz 1990). Many studies have been carried out in this subject area to reduce the uncertainties. The main focus of these studies was frequently on determination of maximum magnitudes, b-value prediction and illustration of seismicity of the study area (Smit and Kijko 2016; Kijko et al. 2016; Beirlant et al. 2018). In the case of time uncertainty, several recursive models have been proposed by researchers. For this case, Markov’s recurrence model presented by Knopoff (1971) using Markov chain can be instanced. In addition, Patwardhan et al. (1980) introduced the pseudo-Markov model. Most of time-dependent recurrence models are based on the renewal process. In these models, the recurrence interval of an earthquake can have different distributions and the probability of an earthquake within a given interval depends on the elapsed time since the last event. Hagiwara (1974), Utsu (1984) and Nishenko and Buland (1987) utilized Weibull, Gamma and log-normal distributions in recursive models, respectively.

The second category of uncertainty is related to modeling ground motion uncertainty. For a specific earthquake, ground motion amplitude is described by GMMs, which is a function of earthquake magnitude, source-to-site distance, site conditions and occasionally some other parameters (Bommer et al. 2010; Wang 2011; Tsang et al. 2011; Yaghmaei-Sabegh 2012; Stewart et al. 2015; Douglas 2017). GMMs are obtained using statistical regression on data recorded at the stations. Despite the expansion of the global seismic network, there is no adequate data on near field and large magnitude earthquakes which necessitate the presentation of new models according to updated data (Beven et al. 2018). Due to the assumptions and simplifications in mathematical modeling, GMMs contain epistemic uncertainty. The uncertainty can be observed in the model’s inaccurate form, selection of particular data, error in input data, and statistical error in the estimation of constants (Foulser-Piggott 2014). Youngs (2006) and Stafford et al. (2009) showed that the uncertainty caused by the inaccurate form and the selection of particular data can be calculated by the logic tree. Using the logic tree, several predictions of seismic intensity measurements are made through a set of GMMs, for a given scenario. It is also shown that although the use of a set of GMMs could estimate epistemic uncertainties (Yuongs 2006; Stafford et al. 2009), yet part of the variables remain as epistemic uncertainty (Foulser-Piggott 2014). Foulser-Piggott (2014) provides a quantitative approach for modeling of the input data uncertainty in the median prediction of GMMs by adopting the MC approach for the three variables: moment magnitude (\( M_{w} \)), shear wave velocity at a depth of 30 meters (\( V_{{S_{30} }} \)), and the closest distance to fault rupture plane (\( R_{\text{rup}} \)).

Source-to-site distance measurement is difficult in areas with low seismicity or where the dimensions and geometry of the faults are unclear. In these regions such as Australia, Peninsular Malaysia, and the island of Sri Lanka, using predefined areal source zones is still the standard of practice. The number of the historical earthquake and seismicity data are sparse; therefore, the amount of information available is not enough to have individual faults to be modeled as precisely in a PSHA (Lam et al. 2016; Abrahamson 2006). Therefore, in this paper, a method has been developed to quantify the distance uncertainties where the geometry of the faults is unknown. A simple algorithm based on the MC simulation method which quantifies the uncertainties in distance metric (\( R_{\text{epi}} \), \( R_{\text{rup}} \), and \( R_{jb} \)) and geometric characteristics of seismic sources is proposed and errors in results of the GMMs from these uncertainties have been calculated. Briefly, this paper main focus is the distance metrics uncertainty measurement and its quantification. Various factors engender source-to-site distance measurement uncertainty, which include the following:

  1. 1.

    The difference in fault rupture geometry.

  2. 2.

    When using a GMM, depending on the size of the project site, a question may arise about the selection of a point from which the distance to the source ought to be measured. This is more common in large-scale projects located at a close distance to a fault. For instance, the Oakland bridge project in San Francisco is several kilometers long. Hence, the earthquake waves along the course of this project will have different characteristics at the beginning and the end of the bridge (Moss 2009).

  3. 3.

    Different definitions for distance measurement are used in GMMs. Among the most common definitions, the closest horizontal distance to the vertical projection of fault rupture (\( R_{jb} \)), the closest distance to fault rupture plane (\( R_{\text{rup}} \)), the closest distance to the epicenter (\( R_{\text{epi}} \)), and the distance to the hypocenter (\( R_{\text{hypo}} \)) can be mentioned (Abrahamson and Shedlock 1997). There is also no agreement on the most appropriate definition of distance measurement, which indicates uncertainty about the comparative accuracy of these definitions (Moss 2009).

MC simulation method as a powerful uncertainty modeling tool is a computational algorithm that, by the random selection of input parameters, solves the problem and stores the results. Ultimately, using the stored outcomes, the overall result is reported by the method. Production of random numbers, based on the probability distribution assigned for integrating various uncertainties, is one of the main tasks of the MC method (Yazdani et al. 2012). The main advantages of the MC method are its ability to control the uncertainty of the parameters involved in the PSHA model and to clearly identify the effects of each parameter in the final result (Musson 1999). Also, this method has a simple concept and has been used by several researchers to evaluate PSHA results (Yazdani et al. 2017; Pavel and Vacareanu 2017; Assatourians and Atkinson 2013).

In the following, a simple MC-based simulation method is presented to evaluate the distance measurement uncertainties in Sect. 2. To quantify the uncertainties of the defined input data and the assumptions made, including fault geometry and the considered sites, we propounded an example in Sect. 3 of the paper and the GMMs that used in this paper are introduced in this section. Section 4 consists of two subsections; the outcomes of three sites and the standard deviation maps are presented in the first and second subsections, respectively. Simulation results of the sites are presented for small, medium and large magnitude earthquake. In Sect. 5, we utilized the simulated data of Sect. 4 and investigated their impacts upon the GMMs. Section 5 consists of two subsections; the output results of different GMMs are obtained for each point and subsequently, their distributions are depicted for the three sites. Finally, Sect. 6 summarizes the outcomes of the paper.

2 Source-to-site distance simulation algorithm

The geometric of fault is an important factor for determining the source-to-site distance that is usually not known. The uncertainty of the fault position and the determination of the input parameters of the GMM are adopted as the epistemic uncertainty which influences the results of PSHA analysis. Among the effective geometrical factors which influence measuring the source-to-site distance with respect to the definition of distance, the following factors can be mentioned:

  1. 1.

    Start point of rupture on the fault plane (hypocenter location).

  2. 2.

    Length and width of fault rupture.

  3. 3.

    Rupture depth.

In the proposed algorithm, uncertainty of the various definitions of source-to-site distance is simulated. The detail of the algorithm used in this paper is shown in Fig. 1. The proposed algorithm is very simple and consists of several parallel loops which are considered the uncertainties of the mentioned factors to determine of distance metric.

Fig. 1
figure 1

Source-to-site distance simulation process

In the first loop, uncertainty length and width of fault are considered. These two parameters are related to the moment magnitude scale by the Wells and Coppersmith (1994) model (see Eqs. 1, 2). Therefore, several numbers of the length and width are generated in this loop. The generated length and width have a log-normal distribution with standard deviations 0.16 and 0.26, respectively (\( \sigma_{{\log_{10} L}} = 0.16,\quad \sigma_{{\log_{10} W}} = 0.26 \).

$$ \log_{10} W = - 1.01 + 0.32M_{w} $$
(1)
$$ \log_{10} L = - 2.44 + 0.59M_{w} $$
(2)

where L and W are length and width of the rupture fault, respectively, and \( M_{w} \) is moment magnitude.

In the second loop, uncertainty quantification of the fault dip (\( \delta \) is considered. In this path, the uncertainty of fault dip results proposed by Kilb and Hardebeck (2006), Hayes and Wald (2009) and Foulser-Piggott (2014) was used, and dip variation was considered 10°. In this loop, the random desired number of fault dip angle with the same probability is generated. For instance, if the fault dip is 70°, the angles which are generated in this loop are between 60° and 80° (i.e., 70° ± 10°) with the same probability.

One of the effective factors in the estimation of source-to-site distance \( R_{\text{epi}} \) and \( R_{\text{hyp}} \) is the hypocenter position on the fault plane. In order to determine \( R_{\text{hyp}} \), the realization of hypocenter proposed by Mai et al. (2005) is utilized. Mai et al. (2005) represented that the relative position of hypocenter along the fault width (\( {\text{hyp }}Z \)) has a normal distribution with mean of 0.5 and standard deviation of 0.23, and the relative position of hypocenter along the fault length (\( {\text{hyp }}X \)) follows a Weibull distribution with a scale parameter of 0.626 and shape parameter of 3.921. Regarding the length and width of the fault, the desired number of hypocenter position with the aforementioned distribution is generated. In addition to the above realizations, the depth of hypocenter (\( Z_{\text{hyp}} \)) should be determined. The depth of hypocenter according to Eq. (3), which is presented by Scherbaum et al. (2004), is a function of the moment magnitude and its standard deviation is equal to 4 km. The minimum and maximum values of hypocenter depth (\( Z_{\text{hyp}} \)) are considered 4 and 16 km, respectively (Scherbaum et al. 2004).

$$ Z_{\text{hyp}} = 5.63 + 0.68M_{w} $$
(3)

With regarding standard deviation, the upper and lower limit, the desired number of \( Z_{\text{hyp}} \) is generated with the same probability.

In order to identify the realizations mentioned in the previous loop, we can also calculate the other parameters of the fault, including depth-to-top of rupture (\( Z_{\text{tor}} \)). Based on the modified Kaklamanos et al., (2011) relation, \( Z_{\text{tor}} \) can be calculated through Eq. (4):

$$ Z_{\text{tor}} = \hbox{max} ((Z_{\text{hyp}} - {\text{hyp}}Z. \, W. \, \sin (\delta )),0) $$
(4)

After realizations of the fault geometry, we can calculate different definitions of source-to-site distance metrics.

In sum, the simulation algorithm for each moment magnitude can be carried out in the following steps

  1. 1.

    Loop 1: n number of length and width which is generated by Eqs. 1 and 2 (\( w_{n \times 1} \quad {\text{and}}\quad L_{n \times 1} \)).

  2. 2.

    Loop 2: n number of the dip angle with uniform probability is generated (\( \delta_{n \times 1} \)).

  3. 3.

    Loop 3: n number of hypZ and hypX regarding the probability distribution of each hypocenter is generated (\( \text{hyp}X_{n \times 1} \quad {\text{and}}\quad \text{hyp}Z_{n \times 1} \)).

  4. 4.

    Loop 4: n number of \( Z_{\text{hyp}} \) and \( Z_{\text{tor}} \) regarding existing distribution is generated (\( Z_{n \times 1} \), \( Z_{{{\text{tor}}_{n \times 1} }} \)).

By the combination of four parallel loops, n number of the fault geometry is simulated. The process of simulation is shown in Fig. 1.

3 Example set up

To quantify the epistemic uncertainty of GMM due to the uncertainty of source-to-site distance, a strike-slip fault with the length of 60 km is considered according to Fig. 2. It is assumed that the selected sites in this example have been located on the bedrock; hence not soil response is relevant. The results of the MC simulation will be presented in two parts. In the first part, the simulation results of distance determinations are presented and discussed and in the second part, simulated data in the first part is used as GMM input to determine the uncertainties in the GMM outputs.

Fig. 2
figure 2

Strike slip fault and sites configuration

Nine GMMs have been used in this paper: the four NGA08 GMMs, Abrahamson and Silva (2008), Boore and Atkinson (2008), Campbell and Bozorgnia (2008), and Chiou and Youngs (2008), in which they are abbreviated as AS08, BA08, CB08, and CY08, respectively. Moreover, three GMMs: Danciu and Tselentis (2007), Nowroozi (2005), Ambraseys et al. (2005), which are based on epicenter distance and abbreviated as DT07, N05, ADSS05 in this paper, are used. Finally, two event-based GMMs named isotropic (Iso) and non-isotropic (Non-iso) are used in the analysis. The main characteristics of Iso and Non-iso GMMs are that: (i) They are a random function that means they have an event-specific parameter. The GMMs are fitted to a given event based on corresponding records. (ii) The level of ground motion is not the same in all directions in the non-isotropic GMM. The general form of these GMMs that are used to nonlinear regression as follows (Raschke 2013):

$$ {\text{Ln}}({\text{PGA}}) = \theta_{0} - \theta_{1} \ln (R) - \theta_{2} R\quad {\text{and}}\quad R^{2} = r^{2} + h^{2} $$
(5)

where PGA is peak ground motion acceleration, h is hypocenter depth, and r is the horizontal distance between the site and the epicenter. \( \theta_{0} \), \( \theta_{1} \), \( \theta_{2} \) are the constant coefficients obtained from a nonlinear regression. To consider the non-isotropic conditions, R is calculated utilizing Eq. 6 (Raschke 2013):

$$ R = \sqrt {{\raise0.7ex\hbox{${r^{2} }$} \!\mathord{\left/ {\vphantom {{r^{2} } {d_{\text{unit}}^{2} \left( \varphi \right)}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${d_{\text{unit}}^{2} \left( \varphi \right)}$}}} + h^{2} $$
(6)

The radius function dunit (φ) determines the unit-isoline with an azimuth \( \varphi \) of local polar coordinates. There are many types of unit-isoline that can be applied in the non-isotropic GMM which include area π equal to the unit circle of angle function. Different unit-isolines can be combined by the sum \( d_{\text{unit}}^{2} (\varphi ) = \sum {a_{i} } d_{{i,{\text{unit}}}}^{2} (\varphi ) \) with weighting \( 0 \le a_{i} \le 1 \) and \( \sum {a_{i} } = 1 \). Circle with eccentricity has been considered herein as the unit-isoline in the non-isotropic GMM which is shown in Fig. 3. If the circle with eccentricity is used, parameters of eccentricity (\( a_{c} \) and \( b_{c} \)), along with \( \theta_{0} \), \( \theta_{1} \), and \( \theta_{2} \), should be estimated. The main characteristics of the GMMs are listed in Table 1. The selected events to represent small, medium, and large earthquakes are Big Bear 02, California 2001 (Mw = 4.56), Sierra Madre, California 1991 (Mw = 5.61) and Loma Prieta, California 1989 (Mw = 6.93). For the three selected events, isotropic and non-isotropic GMMs parameters are presented in Tables 2 and 3, respectively, which obtained from the nonlinear regression.

Fig. 3
figure 3

(Reconstructed from Raschke 2013)

Eccentric circle unit-isoline

Table 1 Summary of GMMs used in this study
Table 2 Parameters of isotropic GMM
Table 3 Parameters of non-isotropic GMM

4 Simulated results

Using the proposed algorithm, 100,000 realizations for the fault geometry are generated. According to the fault geometry, the desired distance metrics will be calculated as well. Fault rupture could occur in all parts of the fault; however, to reduce the simulated variables only one position of the fault rupture is considered so that the center of the fault rupture length corresponds to the fault center. In fact, it is symmetric with regard to the fault center. The maximum rupture length due to limitations of the fault length is 60 km.

4.1 Simulation results for site S 1, S 2, and S 3

Using the simulated algorithm, the different source-to-sites distance metric distributions (\( R_{\text{epi}} \), \( R_{jb} \), \( R_{\text{rup}} \)) are calculated for three S1, S2, and S3 sites. To quantify and compare the variation of distance distribution, the coefficient of variation is used in accordance with Eq. (7) (Everitt 1998).

$$ C_{v} = \frac{\sigma }{\mu } $$
(7)

In this regard, \( C_{v} \) is the coefficient of variation, \( \sigma \) and \( \mu \) are the standard deviation and the mean of the simulated data, respectively.

Figure 4 illustrates the values of \( C_{v} \) for each magnitude in the semi-logarithmic scale for the three sites, S1, S2 and S3. The \( C_{v} \) values for \( R_{\text{epi}} \), \( R_{jb} \), and \( R_{\text{rup}} \) depend on the magnitude and the site position with respect to the fault. \( C_{v} \) coefficient of all distance measures will increase by increasing magnitude and for a given magnitude, distance increasing will reduce \( C_{v} \).

Fig. 4
figure 4

\( C_{v} \) of different source-to-site distance metrics (\( R_{jb} \), \( R_{\text{rup}} \), \( R_{\text{epi}} \)) at site \( S_{1} \), \( S_{2} \) and \( S_{3} \)

For small and medium magnitude levels, \( C_{v} \) coefficient of the three distances metrics is closed to each other in all of the sites. The trend of changes is the same in these sites, \( R_{jb} \) and \( R_{\text{rup}} \) included the least and the largest \( C_{v} \), respectively.

The \( C_{v} \) coefficient has a different trend in the large magnitude, and it is also sensitive to measure distance. For the S1 site, \( C_{v} \) coefficient of different source-to-site distance metrics is very close to each other, and by increasing the distance (i.e., in the S2 and S3 sites), the \( C_{v} \) coefficient takes large values for the \( R_{\text{epi}} \) distance measures. The mean values estimated for \( R_{jb} \) and \( R_{\text{rup}} \) in the S1 site (\( \mu \) in Eq. 7) are lower than the mean value of \( R_{\text{epi}} \), and due to this, their \( C_{v} \) coefficients are greater than \( R_{\text{epi}} \) so that by increasing the distances, the mean value of these three distance metrics approaches each other, and the \( C_{v} \) coefficient of the \( R_{\text{epi}} \) is larger than others.

As mentioned before, by increasing magnitude, \( C_{v} \) would be increased either; however, the rate of increasing is not the same for all distance definitions. The increasing rate of \( R_{\text{epi}} \) is higher than that of other definitions with increased magnitude. By increasing magnitude, the length and width of the fault rupture and the points that can be the epicenter will increase; due to this process, the variation coefficient of the rupture increases as well.

The increasing process of the \( C_{v} \) for \( R_{\text{rup}} \) in large magnitude in contrary to \( R_{\text{epi}} \) will reduce and the reason for this is reaching the rupture level to ground’s surface in large magnitude. In this case, most \( Z_{\text{tor}} \) values in the simulation algorithm are equal to zero and thus the \( C_{v} \) of \( R_{\text{rup}} \) is lower than other distance definitions. The increasing process of \( R_{jb} \) is also constant and \( C_{v} \) increases with increasing magnitude.

Figure 5 shows the distribution of simulated data for \( R_{jb} \), \( R_{\text{rup}} \), and \( R_{\text{epi}} \) considering different magnitudes at site S3. In this case, \( R_{jb} \) and \( R_{\text{epi}} \) are bounded to the minimum and maximum of simulated data, respectively. The range of simulated data depends on the number of parameters involved in the definition of distance type. In the proposed algorithm, \( R_{jb} \) depends only on the length, width and dip angle of the fault; however, in addition to the mentioned parameters for \( R_{jb} \), the epicenter location on the fault plane will additionally affect the measure of \( R_{\text{epi}} \). Uncertainties in the length, width, dip and epicenter location on the fault plane are considered for \( R_{\text{epi}} \). Also, length, width, dip and the \( Z_{\text{tor}} \) are the parameters that are considered in the calculation of \( R_{\text{rup}} \). By increasing magnitude, the length and width of the rupture will increase and the uncertainty associated with the geometry of the fault increases and the extent of variation in all three types of distances increases.

Fig. 5
figure 5

Distance simulation distribution (\( R_{jb} \),\( R_{\text{rup}} \), \( R_{\text{epi}} \)) in different magnitude at site \( S_{3} \)

4.2 Distance uncertainty map

The proposed algorithm is accomplished for all sites around the fault. The standard deviation values associated with three distance metrics (\( R_{\text{epi}} \), \( R_{\text{rup}} \), and \( R_{jb} \)) at the Mw6.93 event are illustrated in Fig. 6. For this case, the standard deviation trend depends on the distance definitions (\( R_{\text{epi}} \), \( R_{jb} \),\( R_{jb} \)) and the main assumption of this example as well. (The rupture has symmetric evolution with respect to fault plane.) Therefore, the standard deviation of simulated data is the lowest for all distance definitions in the middle of the fault.

Fig. 6
figure 6

The standard deviations contour of different source-to-site distance metrics for the Mw6.93 event

The maximum variation of the standard deviation is related to \( R_{\text{epi}} \); by changing the angle to the fault center, the standard deviation values are changed and the highest standard deviation is about 9 km, which is along the fault strike.

The variation of the standard deviation of two distances (\( R_{\text{rup}} \) and \( R_{jb} \)) is similar to another, and its lowest amount occurs in the middle of the fault. Because in the defined scenario, fault rupture is located in the middle of the fault and due to Eq. 2, the mean length of the rupture with the magnitude scale of 6.93 is about 45 km; thus, in the middle of fault region, the amount of \( R_{jb} \) is constant. The small standard deviation difference in the hanging wall and footwall is due to the 10° of measurement error on the fault dip that is considered randomly in the simulated algorithm. Because the dip variation has no effect on the footwall part, the standard deviation of \( R_{jb} \) is equal to zero. The maximum standard deviation for \( R_{jb} \) is calculated on the top and bottom of the fault strike. Because rupture level has reached the ground surface, contour variation in \( R_{\text{rup}} \) is similar to \( R_{jb} \).

5 Distance uncertainty effect on GMMs output

5.1 GMMs simulation results for site \( S_{1} \), \( S_{2} \) and \( S_{3} \)

In this section, simulated data in the previous section are used as input in GMMs. According to Eq. (7), the \( C_{v} \) for all GMMs is calculated for different magnitudes. Figure 7 shows the PGA distribution of the GMMs at the site \( S_{3} \). According to Table 4, Non-iso, Iso, ADSS05, N05, and DT07, which are dependent on \( R_{\text{epi}} \) have the same behavior and also the suffering of their changes is the same. Maximum and minimum of \( C_{v} \) are DT07 (\( C_{v} = 0.1266 \)) and non-isotropic (\( C_{v} = 0.0796 \)) GMMs at site \( S_{3} \), respectively. As shown in Fig. 7, the PGA distribution for AS08 and CY08 is different when compared with other GMMs. According to sensitivity analysis for the input values of the NGA08 GMMs, \( Z_{\text{tor}} \) uncertainty has a more effect on the outputs of these GMMs.Footnote 1 By considering a constant value for \( Z_{\text{tor}} \), the behavior of these GMMs is similar to each other. Figure 8 shows that after eliminating the \( Z_{\text{tor}} \) uncertainty in the simulation process, the distribution of these two GMMs (AS08 and CY08) is similar to other GMMs and \( C_{v} \) coefficient of these GMMs is reduced.

Fig. 7
figure 7

GMMs outputs distribution (PGA) at site \( S_{3} \) for the Mw6.93 event

Table 4 Considered different distance definition in GMMs
Fig. 8
figure 8

GMMs outputs distribution (PGA) at site \( S_{3} \) for the Mw6.93 without considering \( Z_{\text{tor}} \) uncertainty

Figure 9 shows the \( C_{v} \) of the NGA08 GMMs for two cases: one with \( Z_{\text{tor}} \) uncertainty and other without \( Z_{\text{tor}} \) uncertainty. For the first case, \( C_{v} \) of AS08 and CY08 is large for all sizes and converges to another GMMs’ \( C_{v} \) with an increase in magnitude. This trend depends on the variation of \( Z_{\text{tor}} \) that is large for small magnitude and small in large magnitude. As shown in Fig. 9b when \( Z_{\text{tor}} \) uncertainty is excluding, results decrease in \( C_{v} \) for small and medium magnitude. In both cases, \( C_{v} \) of PGA increases with increasing the magnitude size and decreases with increasing distance from the rupture, for all GMMs.

Fig. 9
figure 9

\( C_{v} \) of GMMs in different magnitude at site \( S_{1} \), \( S_{2} \) and \( S_{3} \)a with \( Z_{tor} \) uncertainty b without \( Z_{tor} \) uncertainty

5.2 GMMs uncertainty map

Simulated data in Sect. 4.2 are used as input for all sites around the fault. The median and standard deviation of all GMMs output are computed for all sites, and \( C_{v} \) is calculated using Eq. (7). Figure 10 illustrates the \( C_{v} \) of PGA for the Mw6.93. As shown in Fig. 10, \( C_{v} \) takes large values in the near-fault region and decreases by distance increasing. The \( C_{v} \) is large along strike in comparison with perpendicular direction to the fault. Due to the assumption behind this example (rupture is located in the middle of the fault), \( R_{\text{epi}} \) has smaller variation in this region which results small variation for PGA. The \( C_{v} \) of Iso and Non-iso GMMs are close to each other and three GMMs of N05, DT07 and ADSS05 are showing similar results. Maximum and minimum of \( C_{v} \) are N05 (\( C_{v} = 0.4 \)) and Non-iso (\( C_{v} = 0.2 \)) GMMs, respectively.

Fig. 10
figure 10

The \( C_{v} \) contour of GMMs for the Mw6.93 event

The \( C_{v} \) of NGA08 models have a different trend when compared to the other GMMs and have an asymmetrical pattern with unequal values for hanging wall and footwall. BA08 which depends on \( R_{jb} \) has a small \( C_{v} \) and in the perpendicular direction to rupture, \( R_{jb} \) has a small variation which results small variation in \( C_{v} \). Similar to other GMMs \( C_{v} \) has the largest value along strike and decrease with an increasing distance. This GMM has the largest \( C_{v} \) of 0.32.

CB08 GMM which depends on \( R_{\text{rup}} \), \( R_{jb} \), \( Z_{\text{tor}} \), and fault wide has a different trend within comparison to BA08 GMM. In this GMM \( R_{\text{rup}} \) has the largest contribution to the variation in the results and in regions with smaller \( R_{\text{rup}} \) variation, the \( C_{v} \) is also small. \( C_{v} \) does not have a symmetrical trend with respect to the strike direction due to the hanging wall and footwall effect. In this GMM, the maximum \( C_{v} \) equal to 0.32 in along the strike.

AS08 and CY08 GMMs are similar to CB08 GMM and the considerable variation in the results are coming variety of \( R_{\text{rup}} \) and \( Z_{\text{tor}} \). These GMMs have different trends with respect to other GMMs as \( C_{v} \) is greater at large distances with respect to the other GMMs. In these GMMs the maximum \( C_{v} \) equal to 0.32 similar to other NGA08 GMMs.

6 Conclusions

In this paper, we investigated the uncertainty of source-to-site distance and its effects on the outputs of GMMs. A simple framework was also proposed based on MC simulation method. The results of the analysis demonstrated that the two factors of magnitude and distance to the fault affect the values of \( C_{v} \). It was observed that in general, the value of this coefficient increases with the increase in magnitude and decreases with the increase in distance to the fault. Significant differences were observed in the standard deviations of different distance definitions in different directions. The highest standard deviation of all distance definitions was along and near the fault line, whereas the least standard deviations belonged to the direction perpendicular to fault rupture.

In GMMs that are only a function of \( R_{epi} \), the distribution of the outputs of GMMs closely resembled that of the \( R_{epi} \). In the NGA08 GMMs whose inputs are more than one variable, the output distributions of the GMMs were not the same as inputs distribution. The BA08 model which is a function of \( R_{jb} \) had the lowest and the AS08 and CY08 GMMs had the highest \( C_{v} \). The GMMs’ \( C_{v} \) maps indicated that the variations (\( \sigma \) in Eq. 7) in the vicinity of the fault fluctuate between 20 and 40 percent of the average value of the GMM output, which can have a significant impact on the determination of risk level in the region. The \( C_{v} \) of the GMMs decreased with the increase in distance to the fault, where the amount of decrease depended on the rupture location with respect to the fault. On average, the variations were less than 10% at distances farther than 30 km.

We showed the utilization of empirical relationships to determine the dimension of rupture have uncertainty. Employing the simulation algorithm proposed in this paper, this uncertainty was quantified. It was also depicted that, depending on the GMMs, the uncertainty can alter the results up to 40%. This amount of discrepancy can theoretically reach zero with the increase in precise information on the dimensions of fault rupture. However, with the available information, the outputs of the GMMs may increase up to 40%, which can have a significant impact on the seismic hazard determination of the region. This paper explains that distance uncertainty is one of the parameters that can have a great impact on the final results of PSHA and in the design of the important structures such as power plants, bridge, schools, etc.; this uncertainty can have a significant impact on engineering decisions to ensure the safety of the structure and its economic viability.