1 Introduction

Carbon dioxide (CO2) capture and storage (CCS) is considered as an option for reducing the atmospheric emissions of CO2 and mitigating the greenhouse effects on the environment, as well as enhancing energy and resources recovery (e.g., IPCC 2005; Liu and Wu 2015; Mathias et al. 2009). CO2 geosequestration is widely regarded as an effective technology for the reduction in greenhouse gas emissions to alleviate climate change (Deng et al. 2012; Li et al. 2014c; Wei et al. 2015). However, the considerable quantities of supercritical CO2 that are injected into deep geological formations can create large overpressures and subsequent changes in the state of stress that may jeopardize the caprock mechanical stability and could generate a leakage path for CO2 (Bachu 2000; Song and Zhang 2013). This overpressure may induce microseismic events if the caprock yields, which could also open a leakage path for CO2 (Vilarrasa 2014). The leaked CO2 could affect both human health and groundwater and even threaten the ecosystem balance (Li et al. 2014b). Thus, it is necessary to have sufficient knowledge of the caprock and its long-term integrity to ensure the long-term storage of CO2 in deep saline formations or depleted hydrocarbon reservoirs.

In recent years, the issues involved in underground CO2 sequestration have been discussed in the scientific literature mainly from fluid property (Hosein and Alshakh 2013), fluid interaction (Gysi and Stefánsson 2012; Lei and Ma 2013), and geochemical viewpoints (Dethlefsen et al. 2011). Regarding CO2 injection, many attempts have been made to examine the related geomechanical processes, particularly caprock integrity, during CO2 injection (Karimnezhad et al. 2014; Li et al. 2014a; Vidal-Gilbert et al. 2009; Vilarrasa 2014). The following concerns have recently been raised: (1) CO2 injection at a scale that is necessary to curb increases in the atmospheric CO2 will inevitably lead to the reactivation of moderately sized faults that compromise caprock integrity (Seebeck et al. 2015; Zhou and Burbey 2014) and may even trigger notable seismic events; and (2) it is unclear how such events could impact the long-term integrity of a CO2 repository (Cappa and Rutqvist 2011). Zoback and Gorelick (2012) reported a high probability that earthquakes would be triggered by the injection of large volumes of CO2 into the brittle rocks that are commonly observed in continental interiors, and because even small to moderately sized earthquakes threaten the seal integrity of CO2 repositories, large-scale CCS would be a risky and likely unsuccessful strategy for significantly reducing greenhouse gas emissions into the atmosphere (Lary et al. 2015; Wei et al. 2015). Studies of the actual CO2 injection sites, such as the In Salah in Algeria, have demonstrated that significant geomechanical changes may indeed occur depending on the injection pressure and the site-specific geomechanical conditions (Rutqvist 2012; Zhou et al. 2010). It is clear that modeling the magnitude of injection-related stresses and the associated property changes in the caprock, reservoir, and fault are crucial to understanding and surmounting this particular challenge facing CCS (Dempsey et al. 2014; Rohmer et al. 2014; Tenthorey et al. 2014).

In this study, we performed coupled flow and stress numerical modeling of CO2 injection to investigate the potential for caprock instability and induced seismicity (Rohmer 2014). If the seismicity occurs, the operator needs to do the enhanced monitoring, to do the detailed geomechanical assessment, and to check related emergence management measures. This research is important not only concerning artificial reservoir problems such as CO2 geosequestration or underground energy storage but also for researchers of subsurface fluid flow triggered natured earthquake processes. This study provides novelty in three aspects: (1) the distributions of Coulomb failure stress (CFS) that describe the fault stability and (2) in case of reactivation the moment magnitude of the associated seismic event. (3) Response surface methodology with a Box–Behnken experimental design was used to investigate the statistical significance of indicators of CFS as well as the moment magnitude of the induced seismicity and predicted the optimal design of the subsurface uncertainties. Our recent published paper in Engineering Geology (Wei et al. 2015) is to investigate an uncertainty analysis of nine impact indicators. It proved the Box–Behnken design and response surface methodology could efficiently be applied in uncertainty analyses of caprock integrity and induced seismicity in CO2 geosequestration. On this basis, we focus on a more complex situation that we usually encountered in site selection stage. Compared with our previous work, this study provides novelty in three aspects: (1) A 200-m fault with a 60° dip angle penetrates the caprock into the reservoir, and contact between fault and surrounding rock and fault slip have to be taken into consideration. (2) Based on a large number of 22-input tested parameters that exert major effects, which are beyond the upper limitation of the Box–Behnken design, to settle this problem, before the application of Box–Behnken design and response surface mapping, we used a tornado analysis to determine which tested parameters have the greatest influence on the simulated results, and only the parameters that show observable sensitivity go to the next step. (3) In case of reactivation, the moment magnitude of the associated seismic event was evaluated. Response surface methodology with a Box–Behnken experimental design was used to investigate the statistical significance of indicators of the moment magnitude of the induced seismicity, and the optimal design of the subsurface uncertainties was predicted. The results are more constructive in the site selection process. In Sect. 2, the geometry and the failure properties are described in detail. Section 3 details the study methodology, including a description of the response surface mapping that was used to investigate the statistical significance of the different tested parameters (Sect. 3.1), and the statistical experimental design for the tests performed, including tornado analysis and Box–Behnken experimental design (Sect. 3.2). In Sect. 4, we describe the statistical significance and optimal design of tested parameters for both Coulomb failure stress and the moment magnitude of the induced seismicity (Sects. 4.1 and 4.2). The primary findings and conclusions of this study are then summarized in Sect. 5.

2 Geomechanical model

2.1 Geometry and mesh of the model

The application of geomechanical models contributes significantly to the calculation of the stress distribution after CO2 injection (Aruffo et al. 2014; Dethlefsen et al. 2011; Xing et al. 2015; Yang et al. 2012). To solve the coupling between fluid flow and geomechanical problems, different approaches can be used (Fei et al. 2015; Jeannin et al. 2007; Rutqvist et al. 2002). The fully coupled approach simultaneously solves the whole set of equations that govern the hydromechanical problem. And the partially coupled approach is based on an external coupling between conventional reservoir and geomechanical simulators. One-way coupled approach is the simplest partially coupled approach in which the pore pressure history issued from a conventional reservoir simulation is introduced as input into the geomechanical equilibrium equation. In practice, the pore pressure computed by reservoir simulation is introduced in poroelasticity equations to deduce stress and deformation. This coupling is easy to be implemented and still includes interesting physics (Fei et al. 2015; Vidal-Gilbert et al. 2009).

In this study, we focus on the effect of fluid injection with low pore pressure variation, and the effect of deformation on porosity is expected to be insignificant and therefore a fully coupled model is not necessary. Hence, the one-way coupling technique has been chosen without multiphase flow processes, and a two-dimensional (2D) numerical model was used to consider the geomechanical aspects of the injection of CO2 into the reservoir to simulate the storage of CO2 through injections from a single well into the deep saline formations in the Ordos Basin, China. Rock masses are discontinuous and have heterogeneous and anisotropic properties, and their mechanical properties are greatly depending on the types of rock, buried depths, geological, and diagenetic processes (Jaeger et al. 2009; Zhou and Burbey 2014). For evaluating the coupled fluid and mechanical behavior of these coupled systems, we use the theory of poroelasticity as a very practical and powerful mathematical approach (Biot 1956; Coussy 2004).

As the most complete and stable structural unit in China, the Ordos Basin is an ideal site for a CCS project (Liu et al. 2009). The Shenhua demonstration CCS project is located in Inner Mongolia, and the target zone is located in the backfill area of the Shenhua open pit coal mine in the western riverbed of the Ulan Moron River. This area is approximately 45 km southeast of the Ordos city, Inner Mongolia, China, and 1.3 km east of the Shenhua Ordos Coal to Oil Branch (Li et al. 2014a). Figure 1 shows a schematic view of the model geometry, where only the main geological units are represented: Ordovician, Carboniferous, Permian, and Triassic (Li et al. 2013). The target layer is 1620 m thick. Based on the lithology histogram, the geometric model was obtained by ignoring the thin layers of the reservoir and caprock.

Fig. 1
figure 1

Geological model and numerical model of the target layers in the Ordos Basin. A Location of Ordos Basin and the study area (purple square). B, C Reservoir–seal combinations and the corresponding numerical model, respectively. The injection well located on the left boundary of the numerical model. And the boundary conditions used in the numerical model are illustrated in D

The geometry of the 2D model is rectangular with a length of 3000 m; therefore, the outer boundary does not significantly affect the mechanical behavior of the model. According to the geological depth, the numerical model extends vertically from −780 to −2400 m, and the formations below 780 m can be divided into three primary reservoir–seal combinations (Fig. 1). Fluid is injected into the storage reservoir from a vertical injection well located on the left of the model at a constant injection pressure of 4.5 MPa for 15 days. A 200-m fault with a 60° dip angle penetrates the caprock into the reservoir. The coefficient of friction on the fault was set to 0.6, and the distance from the injection well to the left boundary of the fault is 1000 m. To enhance the computational accuracy and efficiency, a structured mesh with 5990 elements was adopted in the numerical model. And four-node reduced integration and pore pressure elements were used to mesh the model. Figure 2 presents the pore pressure nephograms in the targeted reservoir and the fault at different time. The pore pressure rises from the leftmost boundary from the injection point and spreads to right with time.

Fig. 2
figure 2

AC Pore pressure nephograms in the targeted reservoir and the fault at 1, 24, and 120 h, respectively

2.2 Failure properties

To determine whether the preexisting fault affecting the caprock is stable during CO2 injection, a failure criterion must be defined. A failure criterion defines a domain in the stress space outside of which the rock cannot withstand the load (Vidal-Gilbert et al. 2009). A failure criterion is commonly addressed by Mohr–Coulomb’s approach:

$$\tau = c + \sigma^{{\prime }} \tan \varphi$$
(1)

where \(\tau\) and \(\sigma^{{\prime }}\) are the shear stress and normal effective stress, respectively, on the fault through which material failure occurs, and c and \(\varphi\) are the cohesion and the angle of the internal friction of the fault, respectively. In this study, the Mohr–Coulomb failure criterion was included to simulate the effects of rock failure. According to the Mohr–Coulomb failure criterion, a necessary condition for fault instability is the Coulomb failure stress (CFS) reaching or exceeding its breaking strength (S):

$${\text{CFS}} = \tau + \tan \varphi \left( {\sigma - P_{\text{f}} } \right) \ge S$$
(2)

where \(\tau\) is the shear stress that causes sliding, \(\sigma\) is the normal effective stress, and \(P_{\text{f}}\) is the pore pressure. In the presence of a fault, we focused on the CFS in the fault generated with the fluid injection.

3 Methodology

3.1 Response surface methodology

Response surface methodology (RSM), which was developed by Box and Hunter in the 1950s (Box and Hunter 1957), is a collection of statistical and mathematical methods that are useful for the modeling and analysis of engineering problems (Aslan and Cebeci 2007). RSM consists of a group of mathematical and statistical techniques based on the fit of empirical models to experimental data obtained in relation to the statistical experimental design. More specifically, RSM quantifies the relationship between the controllable input indicators, as well as their interactions, and the obtained response surfaces (Khuri and Mukhopadhyay 2010). In addition, the direct and interactive effects of the input indicators can be represented through two- and three-dimensional plots (Myers et al. 2009).

For this objective, linear or square polynomial functions were used to describe the studied system and consequently explore the experimental conditions until the system’s optimization to achieve the minimalized Coulomb failure stress in the fault (Khuri and Mukhopadhyay 2010). Some of the stages in the application of RSM as an optimization technique are the following: (1) the selection of independent variables that exert major effects on the system through screening studies and the delimitation of the statistical experimental region based on the experience of the researcher; (2) the choice of the experimental design and the performance of the experiments according to the selected experimental matrix; (3) the mathematic–statistical treatment of the obtained experimental data through fitting to a polynomial function; (4) the evaluation of model’s fitness; (5) the verification of the necessity and possibility of performing a displacement in the direction toward the optimal region; and (6) the identification of the optimal values for each studied variable (Myers et al. 2009).

In most cases, tested parameters are not measurable. With the goal of optimizing the response variable, the independent variables were assumed to be controllable by experiments with negligible errors. Such a relationship between a response variable and the selection of independent tested parameters that exert major effects is unknown but can be approximated by a low-degree polynomial model of the form in which the response surface can be expressed as follows:

$$Y = K + \mathop \sum \limits_{i = 1}^{n} K_{i} x_{i} + \mathop \sum \limits_{i = 1}^{n} K_{ii} x_{i}^{2} + \mathop \sum \limits_{i = 1}^{n - 1} \mathop \sum \limits_{j = 1}^{n} K_{ij} x_{i}x_{j} + \varepsilon$$
(3)

where Y is the response variable; \(x_{i}\) and \(x_{j}\) are the tested parameters; \(K, K_{i} , K_{ii} ,\,\,{\text{and}}\, \,K_{ij}\) are parameters that should be determined in a second-order model; and \(\varepsilon\) is a random error (Khuri and Mukhopadhyay 2010).

Optimizing refers to improving the performance of a system, a process, or a product in order to obtain the maximum benefit from it. The surfaces generated by linear models can be used to indicate the direction in which the original design must be displaced in order to attain the optimal conditions.

For quadratic models, the critical point can be characterized as maximum, minimum, or saddle. It is possible to calculate the coordinates of the critical point through the first derivate of the mathematical function, which describes the response surface and equates it to zero. The quadratic function obtained for two variables as described below (Eq. 4) is used to illustrate the example:

$$y = b_{0} + b_{1} x_{1} + b_{2}x_{2} + b_{11} x_{1}^{2} + b_{22} x_{2}^{2} + b_{12} x_{1} x_{2}$$
(4)
$$\frac{\partial y}{{\partial x_{1} }} = b_{1} + 2b_{11} x_{1} + b_{12} x_{2} = 0$$
(5)
$$\frac{\partial y}{{\partial x_{2} }} = b_{2} + 2b_{22} x_{2} + b_{12} x_{1} = 0$$
(6)

Thus, to calculate the coordinate of the critical point, it is necessary to solve the first grade system formed by Eqs. (5) and (6) to find the \(x_{1}\) and \(x_{2}\) values. The visualization of the predicted model equation can be obtained by the surface response mapping. This graphical representation is an N-dimensional surface in the (N + 1)-dimensional space. Usually, a two-dimensional representation of a three-dimensional (3D) plot can be drawn. For the quadratic response surface plot in the optimization of two parameters, the location of the maximum point (inside the experimental region or outside the experimental region) determines the affection of parameter variation to the studied system. As a result, it is possible to find the optimum region through visual inspection of the surfaces.

3.2 Statistical experimental design for tests

The classical approach of changing one variable at a time and studying the effect of the variable on the response is a complicated technique, particularly in a multivariate system or in cases in which more than one response is important. Statistical experimental designs are statistical techniques that can be used to optimize such multivariable systems. Using a statistical experimental design based on response surface methodology, the aggregate mix proportions with the lowest void content can be determined with the minimum number of experiments without requiring the experimental study of all of the possible combinations. Furthermore, the input levels of the different variables for a particular response level can also be determined.

Response surface methodology can be applied when a response or a set of responses of interest is influenced by several variables. The objective is to simultaneously optimize the levels of these variables to attain the best system performance (Myers et al. 2009). However, before applying the RSM, it is first necessary to select a suitable statistical experimental design that will define which experiments should be performed in the experimental region being studied.

There are some experimental matrices for this purpose. In this study, based on the large number of 22-input input variables that exert major effects, we used a tornado analysis to determine which tested parameters have the greatest influence on the simulated results. A Box–Behnken experimental design for quadratic response surfaces can then be used to approximate a response function for the experimental data that cannot be described by linear functions.

3.2.1 Tornado analysis

A tornado analysis was used to determine which tested parameters have the greatest influence on the simulated resulting CFS generated with the fluid injection (O’Dell and Lindsey 2010). A tornado analysis was performed on the numerical model, which represented a column of five levels. Because only one simulated input indicator is changed at a time, 22-input tested parameters were varied to five levels, namely 60, 80, 100, 120, and 140 %, in the tornado analysis, as listed in Table 1. The ranges of the values for the subsurface uncertainties are based on observations and variations known for the caprock, fault, and reservoir properties observed in other CO2 injection projects (Rohmer et al. 2014).

Table 1 Tested parameters and ranges in the tornado analysis

Based on the results listed in Appendix A in Supplementary Material, Fig. 3 shows the relative change in the CFS to the benchmark (100 % level) obtained in the tornado analysis. After eliminating the tested parameters that do not exert any significance (e.g., K c and e r) or do not show any observable sensitivity at a lower-value level (e.g., f and K f), nine (P, \(\varphi_{\text{f}}\), \(\mu_{\text{c}}\), \(\varphi_{\text{r}}\), E c, \(\varphi_{\text{c}}\), K r, μ f, and E f) out of the 22-input tested parameters were found to exert the greatest influence on the simulated CFS results. Moreover, it should be noted that the CFS value increases with increases in all of the nine tested parameters with the exception of the angle of friction in the reservoir (\(\varphi_{\text{r}}\)).

Fig. 3
figure 3

Relative change in the CFS obtained in the tornado analysis

3.2.2 Box–Behnken experimental design

A statistical experimental design is widely used to control the effects of tested parameters in many processes. Its usage decreases the number of experiments and the use of time and material resources (Ferreira et al. 2007). A Box–Behnken statistical screening design was used to statistically optimize the formulation tested parameters and evaluate their main effects \(K_{i}\), interaction effects \(K_{ij}\), and quadratic effects \(K_{ii}\) in Eq. (3) to the Coulomb failure stress and moment magnitude of the induced seismicity (Rohmer 2014). This cubic design is characterized by a set of points lying at the midpoint of each edge of a multidimensional cube and center point replicates, and the ‘missing corners’ help the experimenter avoid the combined factor extremes. This property prevents a potential loss of data in these cases (Box and Behnken 1960).

In this study, based on statistical diversity, the full factorial statistical experimental design of the nine tested parameters, which were varied at the three levels listed in Table 2, would result in a detailed response surface after considerable computational work. With the expectation of a detailed response, note that a full factorial design requires the evaluation of all of the possible combinations of the nine tested parameters at the high, middle, and low levels and requires 39 = 19,683 simulation runs to implement. Thus, a huge workload and long working hours would be required to complete these simulation runs before the application of an efficient statistical experimental design.

Table 2 Nine impact indicators varied in three levels

The Box–Behnken experimental design (BBD) is a class of rotatable or nearly rotatable second-order designs based on three-level incomplete factorial designs. For three factors, its graphical representation can be seen in two forms: a cube that consists of the central point and the middle points of the edges and a figure of three interlocking 22 factorial designs and a central point (Zhou and Burbey 2014). The number of experiments (N) required for the development of BBD is defined as N = 2 k(k − 1) + C 0, (where k is number of factors and C 0 is the number of central points). For comparison, the number of experiments for a central composite design is N = 2k + 2k + C 0. BBD is slightly more efficient than the central composite design but much more efficient than the three-level full factorial designs where the efficiency of one experimental design is defined as the number of coefficients in the estimated model divided by the number of experiments. The Box–Behnken is a good design for response surface methodology because it permits the following: (1) estimation of the tested parameters of the quadratic model; (2) building of sequential designs; (3) detection of a lack of fit of the model; and (4) the use of blocks (Ferreira et al. 2007). Another advantage of the BBD is that it does not contain combinations for which all factors are simultaneously at their highest or lowest levels. So these designs are useful in avoiding experiments performed under extreme conditions, for which unsatisfactory results might occur (Coussy 2004).

To decrease the number of simulation runs, a spreadsheet implementation of the Box–Behnken design, which creates an evenly spaced lattice of the input variable combinations, is available (Ferreira et al. 2007). Therefore, the Box–Behnken experimental design was chosen to determine the relationship between the response functions and the variables. For a three-level nine-factorial Box–Behnken experimental design, a total of 130 experimental runs are needed. The runs were generated using simulation software (Shen 2010). The response functions can be directly calculated with a user-defined subroutine, and selected simulation results were gathered from each of the runs and used for the statistical analysis process (Appendix B in Supplementary Material).

4 Results and discussion

4.1 Uncertainty analysis of the Coulomb failure stress

4.1.1 Statistical significance of the indicators of Coulomb failure stress

In geosequestration, the injection pressure in conjunction with the upward pressure exerted by the injected CO2 (due to buoyant forces) leads to perturbation of the stress field in the reservoir. The change in Coulomb failure stress of the reservoir formation rock and caprock caused by the perturbation can lead to strength reduction and failure of the caprock. For this reason, the level of Coulomb failure stress was raised as a predicted response of variables to evaluate the caprock integrity, and the present investigation was carried out to minimize the Coulomb failure stress to a practical lower level and to optimize the mix proportion of the individual variables (Shukla et al. 2010).

The present investigation was conducted to minimize the Coulomb failure stress to a practical lower level and to optimize the mix proportion of the individual variables. As shown in Appendix B in Supplementary Material, the Coulomb failure stress (response) of the 130 statistically designed combinations (inputs) suggested by the Box–Behnken design of experiments for the nine variables was experimentally determined, and the minimum and maximum CFS values obtained were 3.412 MPa and 5.404 MPa, respectively. The application of the response surface methodology yielded a regression equation from an analysis of the variance that gave the level of Coulomb failure stress as a predicted response of the variables and fits the experimental data with a correlation coefficient (R 2) of 0.9757, indicating that the fitness of the selected model is good and that the model could be used for further investigation.

$$\begin{aligned} {\text{CFS }} & = - \,4316260 - 531465*P + 202750*\varphi_{\text{r}} - 317650*\varphi_{\text{f}} \\ & \quad - 337475*\mu_{\text{c}} - 176610*\varphi_{\text{c}} - 185765*\mu_{\text{f}} - 458450*P*\varphi_{\text{c}} \\ & \quad + 425950*P*\mu_{\text{f}} - 586999*\mu_{\text{c}} *E_{\text{c}} + 457499*\mu_{\text{c}} *\varphi_{\text{c}} \\ & \quad - 480999*E_{\text{c}} *\varphi_{\text{c}} - 529950*\varphi_{\text{c}} *\mu_{\text{f}} \\ \end{aligned}$$
(7)

The main effects \(\left( {K_{i} } \right)\) and reciprocal actions \((K_{ii} \,\, {\text{and}}\,\,K_{ij} )\) in Eq. (7) were estimated from the experimental results. All of the terms, in consideration of their significance, were included in the equation above. The practical application of this statistical model is to link the results obtained from Box–Behnken experimental design to response surface methodology. It concluded the relation between the 130 combinations of the 3 leveled 9 parameters and their corresponding Coulomb failure stress in the first place, and the statistical significance of indicators on Coulomb failure stress as well as optimum design for minimum Coulomb failure stress are all based on this model.

An analysis of variance (ANOVA) of the response surface quadratic model gives the squares and degrees of freedom for the regression Eq. (4). In statistics, the P value (not variable P in Eq. (7)) is a function of the observed sample results and is used to test a statistical hypothesis. Before performing the test, a threshold value is chosen, called the significance level of the test, which was 5 % in this study and denoted as α (Nuzzo 2014). A P value equal to or smaller than the significance level (α) suggests that the observed data are inconsistent with the assumption that the null hypothesis is true and thus that the hypothesis must be rejected, resulting in the alternative hypothesis being accepted as true (Iversen and Norpoth 1987). When the P value is calculated correctly, such a test is guaranteed to maintain the error rate at a value no greater than α. Therefore, the P value level indicates the significant model terms, and a model P value less than 0.0001 indicates that the tested parameters are significant. In contrast, a model P value greater than 0.05 indicates that the tested parameters are not significant. Moreover, the F test is used to compare the components of the total deviation because statistical significance is tested by comparing the F test statistic, the F value of the quadratic model, and individual model terms. An ANOVA of the response surface quadratic model gives the squares and degrees of freedom for the 130 combinations and the corresponding Coulomb failure stress, as listed in Appendix C in Supplementary Material. Moreover, the F test is used to compare the components of the total deviation because statistical significance is tested by comparing the F test statistic, the F value of the quadratic model, and individual model terms. Based on the ANOVA results, the sensitivity and reciprocal action of the impact indicators are shown in Fig. 4.

Fig. 4
figure 4

Sensitivity and reciprocal action of the tested parameters. The color depth of the four partitions (I, II, III and IV) in the histogram indicates their significance level from high to low. In each partition, the sensitivity and reciprocal action of the tested parameters are listed by their F value as follows: tested parameters in (I) are highly significant to the level of Coulomb failure stress with a P value at the 0.001 level; the P values in (II) and (III) are at the 0.01 and 0.05 levels, respectively; and P values greater than 0.05 in (IV) indicate no significance

As shown in Fig. 4, the injection pressure, Poisson’s ratio in the caprock, and angle of friction in the fault are highly significant to the level of Coulomb failure stress with a P value at the 0.001 level; Poisson’s ratio in the caprock * Young’s modulus in the caprock, angle of friction in the caprock * Poisson’s ratio in the fault, Young’s modulus in the caprock * angle of friction in the caprock, and injection pressure * angle of friction present significance at the 0.01 level; angle of friction in the reservoir, Poisson’s ratio in the fault and angle of friction in the caprock exhibit significance at the 0.05 level; and none of the other tested parameters are significant (P > 0.05) to the predicted response. Additional tested parameters with P value greater than 1 are not listed in the histogram.

4.1.2 Optimal design of indicators of Coulomb failure stress

To better understand the results, the predicted models are presented in Fig. 5 as 3D response surface plots. Figure 5 shows the reciprocal action of the highly significant indicators injection pressure (P), Poisson’s ratio in the caprock (μ c), and angle of friction in the fault (φ f) to the Coulomb failure stress.

Fig. 5
figure 5

Predicted models of highly significant indicators of CFS. The 3D response surface plot showing the effect of injection pressure and Poisson’s ratio in the caprock in A indicates that increases in the coded values of the proportion of injection pressure and Poisson’s ratio in the caprock from −1 to 1 increase the Coulomb failure stress. The effect of injection pressure and angle of friction in the fault is shown in B, and the effect of Poisson’s ratio in the caprock and angle of friction in the fault is shown in C. Increases in the values of the proportion of injection pressure, Poisson’s ratio in the caprock, and angle of friction in the fault from −1 to 1 in Fig. 5A, B, C increase the Coulomb failure stress to −5.404 MPa and thereafter continues to increase. This result indicates that the injection pressure, Poisson’s ratio in the caprock, and angle of friction in the fault maximize the Coulomb failure stress

As shown in Fig. 5A, B, C, as the coded values of the proportion of injection pressure, Poisson’s ratio in the caprock, and the angle of friction in the fault increase from −1 to 1, the Coulomb failure stress increases. This result indicates that the injection pressure, Poisson’s ratio in the caprock, and angle of friction in the fault significantly increase the Coulomb failure stress. In particular, as the most sensitive indicator, an increase in the injection pressure within the range results in a greater increase in Coulomb failure stress.

To minimize the Coulomb failure stress to a practical lower level and to optimize the mix proportion of the individual variables, optimization was accomplished by obtaining the individual optimal values for each response and by combining the individual optima to obtain a combined or composite optimum and later maximizing the composite optimum and identifying the optimal tested parameters settings. The predicted Coulomb failure stress was obtained from the regression equation using the experimentally determined values for the 130 combinations. Upon optimization, a minimization target was assigned to the resolution factor response CFS value in numerical optimization. Accordingly, the minimum Coulomb failure stress given by the computation software was −3.39331 MPa. The optimal levels of the coded individual variables to minimize the CFS are shown in Fig. 6.

Fig. 6
figure 6

Optimal levels of the coded individual variables to minimize the CFS

As the proportion of the tested parameters reached their individual optimal coded values, the CFS decreased to −3.39331 MPa, corresponding to the optimal value. In particular, a desirability value of 1.000 indicates a good fit of the predictive model. Furthermore, the optimized combination was significantly different from the experimentally studied combinations and showed less CFS. This result demonstrates the usefulness of statistical techniques in model exploitation and empirical model building. The final optimal design for the subsurface uncertainties in terms of actual indicator values is shown in Table 3.

Table 3 Actual tested parameters’ values in the optimal design for the subsurface uncertainties

4.2 Uncertainty analysis of moment magnitude

4.2.1 Seismic moment and moment magnitude

The scalar seismic moment M 0, as defined for a ruptured patch on a fault by Hanks and Kanamori (1979), is as follows:

$$M_{0} = GAd$$
(8)

where G is the shear modulus, A is the rupture area, and d is the average slip in the rupture area. Second, the moment magnitude M w of an earthquake is given by Kanamori and Anderson (1975) as follows:

$$M_{\text{w}} = \frac{2}{3}\left( {\log_{10} \left( {M_{0} } \right) - 9.1} \right)$$
(9)

Note that our simulation results were achieved with a 2D plane-strain model, and to calculate the seismic moment and magnitude, the length of the rupture area A was 200 m, but the depth of the rupture area A was assumed. In this model, the shear modulus G can be easily calculated with the Young’s modulus and Poisson’s ratio provided above by the equation \(G = E/2(1 + \mu ) = 7.69E + 09\).

4.2.2 Statistical significance of the indicators of moment magnitude

From the application of the Box–Behnken experimental design and response surface methodology, the minimum and maximum moment magnitude (M w) values obtained were 4.17986 and 4.36532. The regression equation obtained after the analysis of variance gave the level of M w as a predicted response of the variables. This equation fitted the experimental data best with a correlation coefficient (the value of R 2) of 0.9946. The main effects \(\left( {K_{i} } \right)\) and reciprocal actions \((K_{ii} ,K_{ij} )\) in Eq. (3) were estimated from the experimental results. All of the terms, regardless of their significance, were included in the following equation:

$$\begin{aligned} M_{\text{w}} & = 4.270933965 + 0.007172216*P + 0.043785571*\varphi_{\text{r}} + 0.00986756*\varphi_{\text{f}} \\ & \quad + 0.028086374*\mu_{\text{c}} + 0.021237674*E_{\text{c}} + 0.017992779*\varphi_{\text{c}} + 0.032056121*K_{\text{r}} \\ & \quad + 0.00095415*\mu_{\text{f}} + 0.00291713*P*\varphi_{\text{r}} + 0.00166024*P*K_{\text{r}} \\ & \quad + 0.00409613*\varphi_{\text{r}} *K_{\text{r}} - 0.001646151*\varphi_{\text{r}}^{2} - 0.002296648*\varphi_{\text{f}}^{2} \\ & \quad + 0.001731633*\mu_{\text{c}}^{2} - 0.001241751*E_{\text{c}}^{2} - 0.001714995*K_{\text{r}}^{2} \\ \end{aligned}$$
(10)

An analysis of variance (ANOVA) for the response surface quadratic model gives the squares and degrees of freedom for the 130 combinations and corresponding M w values listed in Appendix B in Supplementary Material. Based on the ANOVA results, the sensitivity and reciprocal action of the tested parameters are shown in Fig. 7. The injection pressure, Poisson’s ratio in the caprock, and angle of friction in the fault are highly significant to the M w level with a P value at the 0.001 level; Poisson’s ratio in the caprock * Young’s modulus in the caprock, angle of friction in the caprock * Poisson’s ratio in the fault, Young’s modulus in the caprock * angle of friction in the caprock, and injection pressure * angle of friction present significance at the 0.01 level; the angle of friction in the reservoir, Poisson’s ratio in the fault, and angle of friction in the caprock are significant tested parameters at the 0.05 level; and none of the other indicators are significant (P > 0.05) for the predicted response. Additional tested parameters with a P value greater than 1 are not listed in the histogram. The detailed ANOVA results are listed in Appendix D in Supplementary Material.

Fig. 7
figure 7

Sensitivity and reciprocal action of the impact indicators. The color depth of the four partitions I, II, III and IV in the histogram indicates their significance level from high to low. In each partition, the sensitivity and reciprocal action of the tested parameters are listed by their F value as follows: the tested parameters in (I) are highly significant to the M w level with a P value at the 0.001 level; the P values in (II) and (III) are at the 0.01 and 0.05 levels, respectively; and the P values greater than 0.05 in (IV) indicate no significance

4.2.3 Optimal design of indicators of the moment magnitude

The effect of the three highly significant tested parameters, namely angle of friction in the reservoir (φ r), permeability in the reservoir (K r) and Poisson’s ratio in the caprock (μ c), on the M w is shown in Fig. 8A–C. Increases in the coded values of the proportion of the angle of friction in the reservoir, the permeability in the reservoir, and Poisson’s ratio in the caprock from −1 to 1 increase the M w. This result indicates that the friction in the reservoir, the permeability in the reservoir, and Poisson’s ratio in the caprock significantly increase the M w. In particular, as the most sensitive indicator, the angle of friction in the reservoir has the largest of the chosen sizes, and higher input values will lessen the values required for other grades to fill the high M w level.

Fig. 8
figure 8

Predicted models of highly significant indicators of M w. The 3D response surface plot showing the effect of injection pressure and Poisson’s ratio in the caprock in A indicates that increases in the coded values of the proportion of the angle of friction in the reservoir (φ r) and the permeability in the reservoir (K r) from −1 to 1 increase the M w. The effect of the permeability in the reservoir (K r) and Poisson’s ratio in the caprock (μ c) is shown in B, and the effect of the angle of friction in the reservoir (φ r) and Poisson’s ratio in the caprock (μ c) is shown in C Increases in the proportion of the injection pressure, Poisson’s ratio in the caprock, and the angle of friction in the fault from −1 to 1 in A–C increase the M w, which continues to increase thereafter. This result indicates that the angle of friction in the reservoir, permeability in the reservoir, and Poisson’s ratio in the caprock maximize the M w

The present investigation was conducted to minimize the M w to a practical lower level and to optimize the mix proportion of the individual variables. The predicted M w was obtained from the regression equation using the experimentally determined values for the 130 combinations. A minimization target was assigned to the resolution factor response M w in the numerical optimization. Accordingly, the minimum M w given by the software was 4.17339. The optimal levels of the coded individual variables for minimizing the M w are shown in Fig. 9.

Fig. 9
figure 9

Optimal levels of the coded tested parameters for minimizing the M w

The M w decreased to 4.17339 when the proportion of tested parameters reached their individual optimal coded values, and the desirability value of 1.000 shows a good fit of the predictive model. Furthermore, the difference between the optimized combination and the studied combinations showed a lower M w, demonstrating the usefulness of statistical techniques in model exploitation and empirical model building. The final optimal design for the subsurface uncertainties in terms of the actual tested parameters values is shown in Table 4.

Table 4 Actual tested parameters’ values in the optimal design for the subsurface uncertainties

5 Conclusions

The optimization of tested impact parameters is a complex process that requires one to consider a large number of variables and their interactions. The present study conclusively demonstrates the application of response surface methodology, and the use of the Box–Behnken design from the point of tested impact parameters for caprock integrity during CO2 injection was discussed.

A suitable approximation for the true functional relationships between independent tested parameters and the response surfaces was obtained from a limited number of experimental runs and was in good agreement with the experimental values of both Coulomb failure stress and moment magnitude of the induced seismicity. An analysis of variance of this response surface quadratic model gives the sum of squares and degrees of freedom for the model for estimating the statistical significance of tested parameters of the Coulomb failure stress and moment magnitude of the corresponding seismicity. In addition, response surface mapping aids the prediction of the values of selected independent variables for the preparation of optimal formulations with desired properties. We therefore reached the following conclusions:

  1. 1.

    The results of an analysis of variance of the response surface quadratic model demonstrated that the injection pressure, angle of friction in the fault, Poisson’s ratio in the caprock, angle of friction in the reservoir, Young’s modulus in the caprock, angle of friction in the caprock, Poisson’s ratio in the fault, and the quadratic term angle of friction in the fault * angle of friction in the caprock are highly significant to the level of Coulomb failure stress with a P value at the 0.001 level. In addition, the angle of friction in the reservoir, permeability in the reservoir, Poisson’s ratio in the caprock, Young’s modulus in the caprock, angle of friction in the caprock, angle of friction in the fault, injection pressure, and the quadratic terms angle of friction in the reservoir * permeability in the reservoir, Poisson’s ratio in the caprock * Poisson’s ratio in the caprock, Young’s modulus in the caprock * Poisson’s ratio in the fault, and Poisson’s ratio in the caprock * Poisson’s ratio in the fault are highly significant to the M w level of the induced seismicity with a P value at the 0.001 level.

  2. 2.

    The highly significant tested parameters of the Coulomb failure stress and moment magnitude of the induced seismicity are listed in an inconsistent sequence. For Coulomb failure stress, injection pressure is the most sensitive indicator, and variations in the value of the injection pressure lead to significant changes in the Coulomb failure stress. However, for the M w of the induced seismicity, the tested parameters in the reservoir and the caprock play a key role. Due to the above conclusions, we should focus on controlling the magnitude of the injection pressure to avoid caprock failure in most CCS projects, but for the projects that have demonstrated that induced seismicity may indeed occur, such as Weyburn in Saskatchewan, Canada, In Salah in Algeria, and Sleipner in the Norwegian sector of the North Sea, we should not only evaluate the injection pressure but also choose the optimal geomechanical conditions. In addition to both Coulomb failure stress and the moment magnitude of the induced seismicity, the tested parameters in the fault are not as significant as those in the reservoir and the caprock due to the relatively small volume of the fault in the whole model.

  3. 3.

    Both the Coulomb failure stress and the moment magnitude of the induced seismicity for different combinations of subsurface uncertainties were optimized based on the experimental data. The optimal combinations had a predicated value of less than the experimentally determined model with a desirability value of 1.000, which demonstrated the fitness of the statistical model for analyzing the experimental data. Furthermore, this study demonstrated that the Box–Behnken design and response surface methodology can efficiently be applied for the modeling of caprock integrity during CO2 injection and that this approach is an economical method for obtaining the maximal amount of information in a short period of time and with the fewest number of experimental runs. However, due to the problem of convergence, we have to compromise the uncertainty range (90–110 %) to ensure our results are usable and accurate in this study. We can account for larger levels of uncertainty for future lines of research and application.