Introduction

Uncertainties influencing both structural parameters and imposed loads are important in the prediction of behaviour of the structure. Many approaches have been used to deal with the uncertainties for studying the system responses. These approaches demand a mathematical representation of uncertainties on the basis of available information. Probability theory is the most customary technique to describe uncertainties as random variables characterised by the probability density functions (PDF). If the data are, however, incomplete, ambiguous, and of poor quality, it is difficult to form the PDF. Hence, uncertainty is inadequate to analyse the real structural behaviour by probabilistic theory; therefore, expected structural response may diverge due to the presence of imprecise uncertainties.

Imprecise quantities are defined/expressed as set of possible values that a parameter under consideration varies. Imprecise probability is an extension of traditional probability theory when uncertainties are bounded by lower and upper values for an event. In the past, many researchers considered the uncertainties by representing in many ways based on their sources. For example, evidence theory [1,2,3], fuzzy sets and possibility theory [4,5,6,7], convex models [8,9,10], Bayesian theory [11,12,13], interval analysis [14,15,16], and probability-box (p-box) [17,18,19,20,21] can be found in many engineering applications. The representations are different from one another by the way the incomplete knowledge is interpreted/described mathematically. Also, these theories need the description of variables in bounds rather than precise information about the probability distribution. Structural reliability-based methods were developed with uncertainties; mainly, direct reliability [4,5,6,7, 14, 15, 18, 20, 22] and inverse reliability [23,24,25] are the two methods which were studied and algorithms were formulated for finding the solution. The lower and upper ranges of structural responses have been extracted by adopting interval finite element method [26]. Matrix decomposition strategy and fixed point formulation were used [19] so as to reduce the overestimation of bounds for uncertain variables, in estimating the structural reliability. Fuzzy [5, 6] and interval [15, 16] parameters are used to quantify the uncertainties with insufficient data in respective studies for the estimation of reliability for complex structures. In the context of imprecise probability, reliability bounds have been evaluated [27], considering imprecise measurements of the members of the structure. To limit the dependency overestimation, interval computation was adopted [28]. Intervals are employed to represent spatial nondeterministic response of structural systems under various loading conditions [22]. Reinforced concrete framed structure was considered for an earthquake loading in the presence of uncertainties in the design of the structure [29]. Probabilistic risk is identified for a retaining wall on considering sensitivity of the variables [30]; also, the study had been extended for the reliability estimation of retaining wall subjected to blast loading with different geotechnical uncertainties [31].

\(\underline{P} (E)\) and \(\bar{P}(E)\) are a generalised representation of lower and upper probabilities of an event E, respectively, with \(0 \le \underline{P} (E) \le \bar{P}(E) \le 1\) for any imprecise variable. To handle imprecise uncertainties, interval analysis can be unified with traditional probability theory, called probability bounds analysis, which is generally represented as p-box. It is a general representation of an imprecise variable with lower bound (LB) and upper bound (UB) on its cumulative distribution function (CDF). Computation of reliability index or failure probability becomes burdensome and impractical for the modern multiplex engineering structures. Therefore, simulation methods [e.g. Monte Carlo simulation, (MCS)] and approximate methods like first- and second-order reliability methods (FORM and SORM) are used. These are robust and easy for application, but impractical to generate more samples for real-life examples, in which a complex analysis procedure may be required for the analysis. Hence, for nonlinear systems with more number of uncertainties having imprecise information, traditional simulation-based methods become impractical.

Moreover, design of the system involves repeated analysis to satisfy the design constraints. The complex high-dimensional systems are computationally cumbersome. Hence, approximate models which map the input and output, commonly called response surface methods (RSM), become inevitable. Some of the RSM techniques include sensitivity analysis [11], Kriging meta-models [9, 18], gradient projection method, and Rackwitz–Fiessler algorithms [32]. The application of antecedently mentioned theories is restricted to simple models since they are computationally expensive to adopt for high-dimensional complex structural models. In order to surmount this rigorousness, high-dimensional model representation (HDMR) was introduced by [33]. A concept of support vector regression (SVR) was utilised in computing the sensitivity indices with an adequate number of sample points in accordance with HDMR [34]. The application of HDMR has also been incorporated to uncertainty analysis when the input variables are modelled as interval variables and fuzzy membership functions [5, 6, 14, 34].

The main objective of this work is to propose a methodology for estimating failure probability of structural systems with imprecise uncertainties defined as p-box variables. The HDMR techniques are used for response surface generation, and interval Monte Carlo simulations (IMCSs) are performed for failure probability calculation. The paper is organised as follows. Section 2 presents a brief description of the uncertainties considered and their mathematical operations. Section 3 presents the failure probability assessment and response surface generation in the presence of imprecise uncertainties using HDMR. Both explicit and implicit numerical examples are solved in Sect. 4 to substantiate the proposed methodology. Summary and conclusion are presented in the last section.

Probability-box (P-box)

P-box approach merges conventional probability theory with the concept of intervals. The p-box gives bounds on CDF for an uncertain variable. Let \(x = [\underline{x} , \, \bar{x}]\) be a p-box, in which \(\underline{x}\) and \(\bar{x}\) are the LB and UB, respectively. In this paper, the notations followed are as follows: \({\mathbf{X}}\) for a vector of random variables, \(X\) for a random variable, \({\mathbf{x}}\) for a realisation vector, and \(x\) for a realisation of a random variable.

The possible value of \(F(x)\) lies in between \(\underline{F} \left( x \right)\) and \(\overline{F} \left( x \right)\) for every value of x, and it represents the p-box. The expression for failure probability from MCS is given by:

$$P_{f} \approx \frac{1}{{N_{s} }}\sum\limits_{k = 1}^{{N_{s} }} {I\left[ {g(x_{k} ) \le 0} \right]} \, = \left\{ \begin{aligned} 1,{\text{ if }}I[ \cdot ]{\text{ is true}} \hfill \\ 0,{\text{ if }}I[ \cdot ]{\text{ is false}} \hfill \\ \end{aligned} \right\}$$
(1)

where \(N_{s}\) is the number of samples, \(I[ \cdot ]\) is the indicator function, and \(x_{k}\) is the kth simulated sample of X, which can be generated using inverse transform method.

$$x_{k} = F_{X}^{ - 1} \left( {v_{j} } \right),\quad j = 1,2, \ldots N$$
(2)

where \(v_{j}\) is the sample of random variable. If the variable X is not precisely defined with known PDF, then it may be assumed to fall in between two extreme ranges for the PDF. In this context, failure probability also varies in ranges \(\left[ {\underline{P} , \, \bar{P}} \right]\). This interval of failure probability can be evaluated using IMCS [26] as expressed in Eqs. (3) and (4) which represent bounds of failure probability for all possible values of \(F(x)\).

$$\underline{P}_{f} = \hbox{min} \left\{ {\frac{1}{{N_{s} }}\sum\limits_{k = 1}^{{N_{s} }} {I\left[ {g\left( {F_{X}^{ - 1} (v_{j} )} \right) \le 0} \right]} } \right\}$$
(3)

and

$$\bar{P}_{f} = \hbox{max} \left\{ {\frac{1}{{N_{s} }}\sum\limits_{k = 1}^{{N_{s} }} {I\left[ {g\left( {F_{X}^{ - 1} (v_{j} )} \right) \le 0} \right]} } \right\}$$
(4)

Similarly, for \(g({\mathbf{x}})\), CDF intervals are defined as \(\left[ {\underline{g} , \, \bar{g}} \right]\). The interval bounds are expressed as Eqs. (5) and (6):

$$\underline{g} = \mathop {\hbox{min} }\limits_{{x \in \left[ {\underline{x} ,\overline{x} } \right]}} \left\{ {g({\mathbf{x}})} \right\}$$
(5)

and

$$\bar{g} = \mathop {\hbox{max} }\limits_{{x \in \left[ {\underline{x} ,\overline{x} } \right]}} \left\{ {g({\mathbf{x}})} \right\}$$
(6)

If X is a Gaussian variable with standard deviation of 0.5, and due to limited resource, mean of the variable is not precise and lies in an interval between [2, 3]; therefore, the variable is modelled as p-box. This imprecise information on the variable is shown in Fig. 1.

Fig. 1
figure 1

P-box with a mean [2, 3] and standard deviation 0.5

Failure Probability Assessment in the Presence of Imprecise Uncertainties

Failure probability \(\it \left( {P_{\text{f}} } \right)\) is the assessment of the probability of occurrence of an extreme event related to a given structure. In the usual setting, limit state function (LSF) describes the safety level of the structure for a given input vector \({\mathbf{X}} \in R\). The failure domain (FD), i.e. \(F = \, \{ {\mathbf{X}} \in R|g({\mathbf{X}}) \le 0\}\), corresponds to the set of inputs for which the performance function \(g({\mathbf{X}}) \le 0\).\(P_{f}\) evaluation demands calculation of the probability that the response oversteps a threshold limit, defined by a LSF, mathematically represented as:

$$P_{\text{f}} = P({g({\mathbf{X}}) \le 0}) =\int\limits_{{g({\mathbf{x}}) \le 0}} {p_{{\mathbf{X}}} ({\mathbf{x}})} {\text{d}}x$$
(7)

where \({\mathbf{X}} = \left\{ {X_{1} , \, X_{2} , \ldots , \, X_{N} } \right\}\) is the N-dimensional vector of random variables of the system under observation; \(g({\mathbf{X}})\) is the LSF such that \(g({\mathbf{X}}) \le 0\) represents the FD; and \(\ p_{{\mathbf{X}}} ({\mathbf{x}})\) is the joint PDF.

HDMR is a correlated function expansion which maps input–output in an orderly manner. Let \(g({\mathbf{X}})\) be the response function with N input variables. The first-order HDMR expansion is defined as:

$$g\left( {\mathbf{X}} \right) = g_{0} + \mathop \sum \limits_{i = 1}^{N} g_{i} \left( {x_{i} } \right) + \mathop \sum \limits_{1 \le i \le j \le N} g_{ij} \left( {x_{i} , x_{j} } \right) + \mathop \sum \limits_{1 \le i \le j \le k \le N} g_{ijk} \left( {x_{i} , x_{j} ,x_{k} } \right) + \cdots + g_{12 \ldots N} \left( {x_{1} , x_{2} , \ldots , x_{N} } \right)$$
(8)

where \(g_{0}\) is a constant term representing the response at c. Here, \(g_{i} (x_{i} )\) is a first-order term indicating the effect of \(x_{i}\) acting alone. The function \(g_{{i_{1} i_{2} }} (x_{{i_{1} }} , \, x_{{i_{2} }} )\) is a second-order term which defines the interdependent effects of the variables \(x_{{i_{1} }}\) and \(x_{{i_{2} }}\), and \(g_{1 \, 2, \ldots , \, N} \left( {x_{1} , \, x_{2} , \ldots , \, x_{N} } \right)\) represents any residual dependence of all the input variables which shows the impact on the output \(g({\mathbf{X}})\).

In contrast to ANOVA–HDMR, which is one of the forms of HDMR, the cut-HDMR specifically exhibits the output in the hyper-plane passing through a reference point c defined in the variable space [14, 33]. After determining each component in Eq. (8), the resulting equation gives HDMR function by replacing the original computationally exorbitant model. The behaviour of the output \(g({\mathbf{X}})\) is collectively extracted from lower-order correlations up to second order of the input variables [5, 6, 23, 35], by ignoring higher-order terms in Eq. (8). The steps involved for the response surface generation by HDMR are shown in Fig. 2. Here, N and n are number of variables present in the structural system under consideration and number of sample points considered on the variable axes. \(\upphi_{j} (x_{i} )\) is Lagrange’s interpolation term for first-order HDMR.

Fig. 2
figure 2

Flowchart for generating the first-order HDMR-based function

The sampling system for first-order HDMR for a function having one variable \((X)\) and two variables \((X_{1} {\text{ and }}X_{2} )\) is shown in Fig. 3a, b, respectively.

Fig. 3
figure 3

Sampling scheme for first-order HDMR: a for a function having one variable \((X)\) and b for a function having two variables \(\left( {X_{1} {\text{ and }}X_{2} } \right)\)

Based on the previous studies [5, 6, 14, 23, 34, 35], it is witnessed that up to second-order expansions are likely sufficient to characterise outputs of varied realistic systems. In this paper, first-order HDMR expansions are utilised for obtaining \(\tilde{g}({\mathbf{X}})\) as shown in Fig. 2. For all the values of N and n, the function has to be evaluated \(N \times \left( {n - 1} \right) + 1\) number of times for first-order HDMR [35].

Numerical Examples

In this paper, four numerical examples are illustrated with proposed HDMR-based uncertainty analysis. Both explicit and implicit structural problems are presented to demonstrate the efficiency and applicability of the proposed method. An exact continuous and approximated function obtained from the HDMR technique is simulated with IMCS for different function evaluations, and variation in the bounds of response of the models and failure probability is studied.

Creep–Fatigue Interaction

A nonlinear creep–failure criterion is considered on the basis of creep and fatigue damage accumulation. The initial explicit model is defined as:

$$g\left( {N_{\text{c}} ,N_{\text{f}} ,n_{\text{c}} ,n_{\text{f}} ,\theta_{1} ,\theta_{2} } \right) = 2 - \exp \left( {\theta_{1} D_{\text{c}} } \right) + \frac{{\exp \left( {\theta_{1} } \right) - 2}}{{\exp \left( {\theta_{2} } \right) - 1}}\left( {\exp \left( { - \theta_{2} D_{\text{c}} } \right) - 1} \right) - D_{\text{f}}$$
(9)

where \(D_{\text{c}} {\text{ and }}D_{\text{f}}\) represent the creep damage and fatigue damage, such that, \(D_{\text{c}} = n_{\text{c}} /N_{\text{c}}\) and \(D_{\text{f}} = n_{\text{f}} /N_{\text{f}}\). \(n_{\text{c}} {\text{ and }}N_{\text{c}}\) are a number of loading cycles and life of creep, and \(n_{\text{f}} {\text{ and }}N_{\text{f}}\) are a number of loading cycles and life of fatigue. \(\theta_{1} {\text{ and }}\theta_{2}\) are the experimental parameters. The input parameters are listed in Table 1.

Table 1 Input parameters for creep–fatigue interaction

The function in Eq. (9) is analysed using the present method by deploying three samples along each of the variable axis. The original function was evaluated only at the selected sample point locations, and the corresponding component functions are developed; then, the HDMR response function was obtained. The IMCS was applied on the developed HDMR function by simulating the input variables based on their characterisation. In this process, 13 number of function evaluations resulted for the construction of the HDMR function. To study the impreciseness of the failure probability, the CDF was drawn for the LB and UB, as shown in Fig. 4. The function was analysed using other available methods like direct IMCS, FORM, and SORM. Here, the direct IMCS, which requires 500,000 function evaluations, is taken as the reference for the comparison. The proposed technique predicts the failure probability with least computational effort. However, the bounds are wider compared to IMCS. Therefore, the chosen number of sample points along each of the axis was varied from 3 to 7 for seeking improvement in the constructed HDMR model, and the results are presented in Table 2 and Fig. 4. The CDFs for \(n = 5\) and \(n = 7\) are overlapping exactly with the original function, wherein the pattern of CDFs for \(n = 3\) is significantly showing the deviation, and the curves are wide covering ample of bounds.

Fig. 4
figure 4

CDFs of limit state function of creep–fatigue interaction

Table 2 Bounds of failure probability of creep–fatigue interaction

Fracture Strength of Turbine Blade and Disc

The fracture strength of a turbine blade and disc shown in Fig. 5 is considered in this example. The radius of crack of the turbine disc exposed to cyclic loading \(N_{\text{c}}\) is defined as:

$$a = \left( {\pi^{m/2} (2 - m)/2 \cdot c \cdot N_{\text{c}} (2F\sigma_{\hbox{max} } /\pi )^{m} + a_{0}^{1 - m/2} } \right)^{2/(2 - m)}$$
(10)

where c, \(\sigma_{\hbox{max} }\), and \(a_{0}\) represent the crack propagation, maximum stress, and the radius of initial crack on the surface of the material. The correction factor F and the crack propagation index m are taken as 1.122 and 3.285, respectively. The stress intensity factor for maximum stress near crack tip is defined as \(K_{\hbox{max} } = 2F\sqrt {\pi a} /\pi \cdot \sigma_{\hbox{max} }\). The LSF is expressed as the difference of critical fracture toughness \(K_{IC}\) and stress intensity factor.

$$g = K_{IC} - K_{\hbox{max} } - 73$$
(11)
Fig. 5
figure 5

Turbine blade and turbine disc

The input p-box variables considered for this study are listed in Table 3. Practically, the number of loading cycles \(N_{\text{c}}\) can be accurately controlled as a constant in a small amount of the fatigue test; similarly, the crack propagation (c) as well as the initial surface crack radius (\(a_{0}\)) can also be simply treated as constants when the crack is clear.

Table 3 Input parameters of turbine blade and turbine disc

The first-order HDMR function is established for the fracture strength of turbine disc by considering n sample points along each of the variable axes, considering c as the mean of p-box variables. Also, the function in Eq. (11) is evaluated using crude IMCS for comparing the computational effort of the HDMR uncertainty analysis which is appreciably lesser than the original function evaluation. Only five function evaluations were required for obtaining the responses, which is computationally very less intensive compared to the direct IMCS (i.e. 500,000 function evaluations). Bounds of failure probability of the function are presented in Table 4, and the corresponding CDFs of the responses are shown in Fig. 6.

Table 4 Bounds of failure probability of turbine blade and turbine disc
Fig. 6
figure 6

CDFs of limit state function of turbine blade and turbine disc

Further, n is varied from 3 to 7 to examine the accuracy. It is evident from Table 4 that the failure probability bounds for the LSF obtained using HDMR (\(n = 3\)) have a slight variation with reference to the original model. However, for \(n = 5,{\text{ and }}7\), the CDFs are exact. The effort taken for getting zero error is (nine evaluations) much lesser than that of the direct IMCS. However, as there is no error resulted for the higher values of n, the respective bounds are overlapping in Fig. 6.

Portal Frame

Figure 7 shows the portal frame taken for our study as the implicit problem [35]. The cross-sectional areas \(A_{1} {\text{ and }}A_{2}\) (log-normal) and horizontal load P (normal) are modelled as imprecise uncertainties, and their parameters are given in Table 5.

Fig. 7
figure 7

Portal frame

Table 5 Input parameters for portal frame

The values of the second moment of areas are expressed as \(I_{i} = \alpha_{i} A_{i}^{2} \left( {i = 1,2;\alpha_{1} = 0.08333,\alpha_{2} = 0.16670} \right)\). The LSF is defined for deflection in the horizontal direction at the top of the frame.

$$g\left( {A_{1} ,A_{2} ,P} \right) = \Delta_{h} - \Delta_{\lim }$$
(12)

where \(\Delta_{\lim } = 6{\text{ mm}}\) and \(\Delta_{h}\) is the horizontal deflection. The LSF for the displacement of the portal frame is derived using the first-order HDMR technique by distributing n sample points along each of the variable axis and taking, respectively, the mean values of the p-box variables as reference point c.

The failure probability is evaluated for all the values of n = 3. Only seven function evaluations were required for obtaining the responses, which is computationally very less intensive compared to the direct IMCS (i.e. 500,000 function evaluations). Bounds of failure probability of the function are presented in Table 6, and the corresponding CDFs of the responses are shown in Fig. 8. Table 6 also presents the bounds of failure probability for sample points \(n = 5{\text{ and }}7\), and FE results by direct IMCS without adopting HDMR for evaluating the efficiency of the methodology. Figure 8 shows CDFs of LSF for different sample points.

Table 6 Bounds of failure probability of portal frame
Fig. 8
figure 8

CDFs of limit state function of portal frame

The results of the frame for three different variables show the different bounds for all sample points when compared with the direct FE analysis using IMCS without adopting HDMR. From Table 6, for sample points 3 and 5, bounds are nearer; however, \(n = 7\) significantly reduces the error with minimum computational effort compared to direct IMCS.

Plane Truss

A 15-bar steel truss structure [26] is studied with small modification, as shown in Fig. 9. The cross-sectional areas (normal) are \({\mathbf{A}} = \left\{ {A_{1} , \, A_{2} , \ldots , \, A_{15} } \right\}\), and three loads (log-normal) \({\mathbf{P}} = \left\{ {P_{1} , \, P_{2} , \, P_{3} } \right\}\) are considered as p-box variables for the study. First and second moments are listed in Table 7. The LSF is defined for vertical deflection at the node-5 with limit of 0.06 m.

$$g\left( {A_{1} ,A_{2} , \ldots A_{15} ,P_{1} ,P_{2} ,P_{3} } \right) = \delta_{v} - \delta_{\lim }$$
(13)

where \(\delta_{\lim } = 0.06{\text{ m}}\) and \(\delta_{v}\) is the vertical deflection obtained by the proposed method. From FE analysis, vertical deflection at node-5 is calculated for all the mean values of variables for deriving the explicit approximated LSF. The failure probability is evaluated for all the values of \(n = 3\). As the number of input variables is high compared to other examples, 37 function evaluations were required for obtaining the responses, which is still computationally very less intensive compared to the direct IMCS (i.e. 500,000 function evaluations). Bounds of failure probability of the function are presented in Table 8, and the corresponding CDFs of the responses are shown in Fig. 10.

Fig. 9
figure 9

Plane truss

Table 7 Input parameters for plane truss
Table 8 Bounds of failure probability of plane truss
Fig. 10
figure 10

CDFs of limit state function of plane truss

Similar to the other examples, a parametric study has been carried out for \(n = 3, \, 5,{\text{ and }}7\). Figure 10 shows CDFs of failure probability for the vertical deflection at node-5 for different sample points. The exact ranges are presented in Table 8 as LB and UB, along with the computational efficiency. The errors in the results are compared with that of FE analysis using IMCS. Sample point \(n = 7\) is showing more accurate results compared to \(n = 3{\text{ and }}5\). The effort taken for FE analysis with IMCS is way greater than the effort taken for sample point \(n = 7\) (i.e. 109).

Summary and Conclusions

Uncertainties are inherent to most of the physical processes, and therefore, these uncertainties should be quantified efficiently, in which, the characterisation of uncertainty in the input variables is crucial. It is encouraged to consider different kinds of uncertainties in the system, when the information of the uncertainties is imprecise in real-time systems. In this context, a computationally efficient uncertainty analysis method to estimate the failure probability of structural system by modelling the input variables using probability-box is presented in this paper. The behaviour of the system is modelled by the concepts of HDMR, and interval Monte Carlo simulations are implemented to predict the responses in the numerical examples.

Distributions of all the variables are predefined, and no assumptions are introduced in the numerical examples. The proposed method holds good for all kinds of distributions of parameters with inadequate statistical data. The numerical examples, which possess the explicit nonlinear functions, are directly used for simulation; then, the function is approximated using first-order HDMR in which the polynomial-type function replaces the original nonlinear function. The number of sample points in each of the variable axis is varied from 3 to 7 so that the improvement in the accuracy of the approximated HDMR function is witnessed. The simulation of the original nonlinear function over million times is evidently cumbersome as compared to the HDMR-based polynomial function in terms of computation effort. However, in case of implicit examples, the FE analysis carried out for original function evaluation with IMCS is very tedious, as each run takes more time. Therefore, first-order HDMR is adopted to derive the performance function, and the simulation is performed on the HDMR function. The CDFs are presented for all the bounds of the LSF, as the input variables are represented by imprecise probability distributions. In all four examples, the bounds of the failure probabilities obtained from the proposed method are closer to the failure probability bounds of the original model evaluation from the interval Monte Carlo method with lesser effort and high efficiency. In the example of portal frame, it is evident that sample point \(n = 7\) shows the lesser percentage of error with the lesser effort of 19 evaluations compared to direct IMCS, whereas in the example of the plane truss, \(n = 7\) shows more accurate results as compared to \(n = 3{\text{ and }}5\) with a lesser percentage of error. The effort taken for FE analysis with IMCS (i.e. 5, 00,000) is way greater than the effort taken for \(n = 7\) (i.e. 109). From the results obtained using the proposed methodology, the application of HDMR makes the uncertainty quantification more efficient when the imprecise uncertainties are characterised as p-box variables in the systems. It is recommended that the proposed method can be applied to any kind of imprecise uncertainties with less computational effort without compromising on the accuracy.