Reliability measure approach for confidence-based design optimization under insufficient input data

Jung, Yongsu; Cho, Hyunkyoo; Lee, Ikjin

doi:10.1007/s00158-019-02299-3

Reliability measure approach for confidence-based design optimization under insufficient input data

Research Paper
Published: 11 June 2019

Volume 60, pages 1967–1982, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Reliability measure approach for confidence-based design optimization under insufficient input data

Download PDF

755 Accesses
23 Citations
Explore all metrics

Abstract

In most of the reliability-based design optimization (RBDO) researches, accurate input statistical model has been assumed to concentrate on the variability of random variables; however, only a limited number of data are available to quantify the input statistical model in many practical engineering applications. In other words, irreducible variability and uncertainty due to lack of knowledge exist simultaneously in random design variables, which may result in uncertainty of reliability. Therefore, the uncertainty induced by insufficient data has to be accounted for RBDO to guarantee the confidence of reliability. Using the Bayesian approach, the uncertainty of input distributions is successfully propagated to a cumulative distribution function (CDF) of reliability under reasonable assumptions, but it requires a number of function evaluations in double-loop Monte Carlo simulation (MCS). To tackle this challenge, the reliability measure approach (RMA) in confidence-based design optimization (CBDO) is proposed to handle the uncertainty of reliability following the idea of performance measure approach (PMA) in RBDO. Input distribution parameters are transformed to random variables following the standard normal distribution for the most probable point (MPP) search based on the proposed stochastic sensitivity analysis of reliability. Therefore, the reliability is approximated at MPP with respect to input distribution parameters. The proposed CBDO can treat confidence constraints employing the reliability value at the target confidence level that is approximated by MPP in standard normal space. In conclusion, CBDO can be performed in a probabilistic space of input distribution parameters corresponding to the conventional U-space in RBDO to yield the probability (confidence) that reliability is larger than the target reliability. The proposed method can significantly reduce the number of function evaluations by eliminating outer-loop MCS while maintaining acceptable accuracy. Numerical examples are used to demonstrate the effectiveness of the developed sensitivity analysis and RMA to estimate the confidence of reliability in CBDO.

Probabilistic sensitivity analysis for novel second-order reliability method (SORM) using generalized chi-squared distribution

Article 01 October 2014

Conservative reliability-based design optimization method with insufficient input data

Article 06 June 2016

Extending SORA method for reliability-based design optimization using probability and convex set mixed models

Article 31 October 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Reliability has been utilized to measure the safety of the system in the presence of various uncertainties that can be classified as inherent variability (i.e., aleatory uncertainty) and reducible uncertainty due to a lack of knowledge (i.e., epistemic uncertainty). These two uncertainties are propagated to the uncertainty of the system performances that may lead to unexpected failure of the system, so that accurate reliability analysis with exhaustive representation and quantification of the uncertainties is indispensable for design optimization.

There have been numerous researches on reliability-based design optimization (RBDO) to obtain a reliable optimum design employing probabilistic quantification of uncertain performance. It can be typically categorized into sampling approaches (Dubourg et al. 2011; Lee et al. 2011a; Zhao et al. 2013; Wang and Wang 2014) and most probable point (MPP)-based approaches (Tu et al. 1999; Youn et al. 2003; Li et al. 2015; Choi et al. 2018). These extensive RBDO researches have generally assumed that statistical information of input random variables is fully accessible. However, obtaining true input statistical information from a set of finite samples is difficult due to the limitation of resource for testing samples to quantify the variability. Therefore, inaccurate input model is a much practical condition in real engineering problems. Epistemic uncertainty in simulation-based design optimization can be comprehensively defined including simulation model uncertainty due to the uncertainty of model parameters and model bias (Der Kiureghian and Ditlevsen 2009; Liang and Mahadevan 2011; Nannapaneni and Mahadevan 2016; Moon et al. 2017; Peng et al. 2017; Bae et al. 2017, 2018); hence, the uncertainty of input statistical models will be mainly treated in this paper.

There have been a few researches to quantify the uncertainty of input random variables induced by insufficient input dataset. In the early stage, possibility-based approaches (Du et al. 2006a, b; Lee et al. 2013) and interval uncertainty approaches (Rao and Cao 2002; Sankararaman and Mahadevan 2011; Yoo and Lee 2014; Muscolino et al. 2016) have been adopted, but the Bayesian approach is a more direct and intuitive way to use insufficient input data statistically. Gunawan and Papalambros (2006) assumed that the variability of reliability could be exhibited as the beta distribution and performed RBDO while maximizing the confidence of reliability. Youn and Wang (2008) proposed a similar approach focusing on the extreme case of the beta distribution to make a sufficiency, in which Bayesian reliability has to be smaller than exact reliability. On the other hand, the uncertainty of the input random variable can be handled by adjusting the standard deviation and correlation coefficient (Noh et al. 2011a, b). In addition, likelihood-based approaches have been proposed that the distribution of the parameters can be quantified through the maximum likelihood estimation (MLE) and Gaussian process interpolation (Sankararaman and Mahadevan 2011, 2013a, b). Instead of Bayes’ theorem, the researches utilizing the bootstrap method without the assumption that data follow certain distribution have been proposed (Picheny et al. 2010; Noh et al. 2011b). Ito et al. (2018) propose a conservative reliability index (CRI) decomposed into the aleatory part and the epistemic part for Gaussian input distribution. Meng et al. (2015b) and Hao et al. (2017) developed non-probabilistic RBDO to treat lack of input data by employing a convex model to describe uncertain-but-bounded parameters rather than the stochastic model. Recently, Cho et al. (2016a) and Moon et al. (2018) proposed conservative RBDO (CRBDO) to fully model the probability of reliability in the presence of the uncertainty of input distribution types and parameters. Employing Bayes’ theorem and Monte Carlo simulation (MCS) (Rubinstein and Kroese 2016), accurate confidence of reliability under given dataset can be obtained. However, estimating a cumulative distribution function (CDF) of reliability is very computationally demanding.

Therefore, the motivation of our research is to reduce the excessive amount of function evaluations to obtain the confidence of reliability. It is noted that the confidence of reliability has to be assessed for design optimization rather than the whole CDF of reliability. In this paper, we propose a reliability measure approach (RMA) and confidence-based design optimization (CBDO) which have constraints for the confidence of reliability. The reliability will be treated as a function of input distribution types and parameters. Due to the discreteness of input distribution type, it is divided into two approaches depending on prior knowledge of input distribution types, and kernel density estimation (KDE) (Silverman 2018) is used to quantify the variability of a random variable when its distribution type is totally unknown. Once the input distribution types are given or identified through any model identification method (Kang et al. 2016), the probabilistic domain that consists of input distribution parameters can be defined. The process has been inspired from performance measure approach (PMA) (Tu et al. 1999), but random design variables, performance functions, and reliability are replaced with input distribution parameters, reliability, and confidence of reliability, respectively. In other words, the input distribution parameter space is regarded as the upper-level space of input random variables where RMA is performed. The proposed method handles the confidence of reliability by searching MPP in the probabilistic space denoted as P-space transformed from input distribution parameter space. Consequently, the double-loop MCS can be reduced to a nested RMA-MCS loop leading to a significant reduction of function evaluations for MCS. Moreover, sensitivity analysis in P-space is derived through the first-order score functions that are an essential concept to facilitate stochastic sensitivity analysis in sampling-based RBDO (Lee et al. 2011b) and explicit expression of uncertainties.

The paper is organized as follows. PMA for RBDO and confidence of reliability in CRBDO are explained in Section 2. In Section 3, general RMA for CBDO and stochastic sensitivity analysis is formulated, and two approaches depending on the prior knowledge of input distribution types are given in Section 4. Then, the feasibility and effectiveness of the developed sensitivity analysis, RMA, and CBDO are verified in Section 5. The conclusion and future research will be given in Section 6.

2 Overview of reliability analysis and confidence of reliability

2.1 Performance measure approach for reliability-based design optimization

Reliability analysis is to obtain the probability of failure which is the probability that a performance function is larger than zero since G(X) > 0 means failure in this paper. The probability of failure is in general computed through the multi-dimensional integration as

$$ {P}_F\equiv \Pr \left[G\left(\mathbf{X}\right)>0\right]={\int}_{G\left(\mathbf{X}\right)>0}{f}_{\mathbf{X}}\left(\mathbf{x}\right)d\mathbf{x} $$

(1)

where G(X) is the performance function, and X is the vector of random design variables following the joint probability density function (PDF) of f_X(x).

Reliability-based design optimization (RBDO) with probabilistic constraints is formulated to

$$ {\displaystyle \begin{array}{l}\underset{\mathbf{d}}{\operatorname{minimize}\;}\kern0.48em \mathrm{cost}\left(\mathbf{d}\right)\\ {}\mathrm{subject}\ \mathrm{to}\kern0.48em \Pr \left[{G}_i\left(\mathbf{X}\right)>0\right]\le {P}_{F_i}^{\mathrm{Target}},\kern0.6em i=1,..., nc\\ {}\kern3.119999em {\mathbf{d}}^L\le \mathbf{d}\le {\mathbf{d}}^U\end{array}} $$

(2)

where X is the vector of random variables, d is the design vector defined as the mean of X, $ {P}_{F_i}^{\mathrm{Target}} $ is the target probability of failure for the ith constraint, and nc is the number of probabilistic constraints. Exact estimation of the probabilistic constraints in (2) defined by (1) is very difficult since it requires a large number of simulations. Therefore, performance measure approach (PMA) using MPP has been proposed (Tu et al. 1999; Youn et al. 2003, 2005; Yang and Yi 2009; Meng et al. 2015a; Keshtegar and Lee 2016). In PMA for RBDO, the MPP has the largest performance function value on the hypersphere with a radius of the given target reliability index β_t. If the performance function value at MPP is smaller than zero, the probabilistic constraint of PMA is satisfied. Thus, MPP in PMA can be obtained by solving the following optimization as

$$ {\displaystyle \begin{array}{l}\underset{\mathbf{u}}{\operatorname{maximize}}\kern0.75em G\left(\mathbf{x}\left(\mathbf{u}\right)\right)\\ {}\;\mathrm{subject}\ \mathrm{to}\kern0.74em \left\Vert \mathbf{u}\right\Vert ={\beta}_t\\ {}\kern1.08em \mathrm{where}\kern0.73em \mathbf{u}={\Phi}^{-1}\left({F}_{\mathbf{X}}\left(\mathbf{x}\right)\right)\end{array}} $$

(3)

where u is the random variable vector following the standard normal distribution transformed from original X-space by the Rosenblatt transformation (Rosenblatt 1952) where cumulative distribution function (CDF) of X is F_X(x). MPP, the optimum of (3), is denoted as u^∗. Hence, the probabilistic constraint in (2) can be rewritten using PMA as

$$ g\left({\mathbf{u}}^{\ast}\right)\le 0 $$

(4)

which is identical with G(x(u^∗)) ≤ 0 as a function at MPP in the original domain.

2.2 Confidence of reliability with insufficient input data

Confidence of reliability is defined as the probability that given target reliability is satisfied at a design point. Uncertainty of reliability is originated from input model uncertainty, distinguished from input variability, which means that insufficient input data induce uncertain input statistical models such as distribution types and parameters. Using Bayes’ theorem, joint PDF of the reliability under given dataset ^∗x can be obtained as (Cho et al. 2016a; Moon et al. 2018)

$$ f\left(\operatorname{Re},\boldsymbol{\upzeta}, \boldsymbol{\uppsi} |\ast \mathbf{x}\right)=f\left(\operatorname{Re}|\boldsymbol{\upzeta}, \boldsymbol{\uppsi}, \ast \mathbf{x}\right)P\left(\boldsymbol{\upzeta} |\boldsymbol{\uppsi}, \ast \mathbf{x}\right)f\left(\boldsymbol{\uppsi} |\ast \mathbf{x}\right) $$

(5)

where ζ and ψ are input distribution types and parameters, respectively. Since the distribution types are not continuous variables, the probability mass function (PMF) obtained from Bayes’ theorem is used in (5). Therefore, the confidence level of the given target reliability is obtained using CDF of the reliability as (Cho et al. 2016a)

$$ \mathrm{CL}\left(\operatorname{Re}|\ast \mathbf{x}\right)\equiv 1-{F}_{\mathrm{Re}}\left(\operatorname{Re}|\ast \mathbf{x}\right)=1-{\int}_0^{\mathrm{Re}}{\int}_{\varOmega_{\boldsymbol{\upzeta}}}{\int}_{\varOmega_{\boldsymbol{\Psi}}}f\left(\phi, \boldsymbol{\upzeta}, \boldsymbol{\uppsi} |\ast \mathbf{x}\right)d\boldsymbol{\uppsi} d\boldsymbol{\upzeta} d\phi $$

(6)

where ϕ is a variable for reliability bounded from 0 to 1.

The main difficulty to evaluate (6) is its large computational demands since double-loop MCS with a large number of samples has to be used: inner-loop MCS for reliability estimation and outer-loop MCS for estimation of confidence of reliability caused by the uncertainty of input distribution types and parameters. Using the double-loop MCS, CDF of the reliability in (6) is expressed as (Cho et al. 2016a)

$$ {F}_{\mathrm{Re}}\left(\operatorname{Re}|\ast \mathbf{x}\right)\cong \frac{1}{NMC{S}_{\mathbf{Z}} NMC{S}_{\boldsymbol{\Psi}}}\sum \limits_{n=1}^{NMC{S}_{\boldsymbol{\Psi}}}\sum \limits_{m=1}^{NMC{S}_{\mathbf{Z}}}{I}_{\left[0,\operatorname{Re}\right]}\left(\operatorname{Re}\left({\zeta}^{(m)},{\psi}^{(n)}\right)\right) $$

(7)

where I_[0, Re](ϕ) is an indicator function in which the value is 1 if ϕ is between 0 and Re, and 0 otherwise. It should be noted that the number of samples for the double-loop MCS is NMCS_Re × NMCS_Ζ × NMCS_Ψ where NMCS_Re, NMCS_Ζ, and NMCS_Ψ are the number of samples for estimation of Re(ζ^(m), ψ⁽ⁿ⁾), input distribution types, and input distribution parameters, respectively. Reliability is a function of input distribution types and parameters using indicator function due to the insufficient input dataset. So, the reliability is estimated through the sampling method as (Rubinstein and Kroese 2016)

$$ \operatorname{Re}\left(\boldsymbol{\upzeta}, \boldsymbol{\uppsi} \right)={\int}_{{\mathbb{R}}^N}I\left(G\left(\mathbf{x}\right)\right){f}_{\mathbf{X}}\left(\mathbf{x};\boldsymbol{\uppsi}, \boldsymbol{\upzeta} \right)d\mathbf{x} $$

(8)

where ψ is the input distribution parameter vector, ζ is the input distribution types, and I(G(x)) is the indicator function defined as

$$ I\left(G\left(\mathbf{x}\right)\right)=\Big\{{\displaystyle \begin{array}{l}\;1,\kern1.44em G\left(\mathbf{x}\right)\le 0\\ {}\;0,\kern1.32em \mathrm{otherwise}\end{array}} $$

(9)

2.3 Conservative reliability-based design optimization

General formulation of CRBDO considering the confidence of reliability as constraints can be formulated as

$$ {\displaystyle \begin{array}{l}\kern0.96em \mathrm{given}\kern0.36em \ast \mathbf{x}\\ {}\underset{\mathbf{d}}{\operatorname{minimize}}\kern0.36em \mathrm{cost}\left(\mathbf{d}\right)\\ {}\mathrm{subject}\ \mathrm{to}\kern0.36em \Pr \left[\Pr \left[{G}_i\left(\mathbf{X}\right)\le 0\right]\ge {\operatorname{Re}}_i^{\mathrm{Target}}\right]\ge {\mathrm{CL}}_i^{\mathrm{Target}}\kern0.36em \mathrm{for}\kern0.24em i=1,2,..., nc\end{array}} $$

(10)

which uses (7) to estimate the confidence of reliability at a design point under given dataset x. As can be seen from (10), CRBDO yields an optimum considering epistemic uncertainty caused by insufficient input data but requires a huge amount of function evaluations. Uncertainty quantification of input distribution parameters and types used to evaluate the confidence of reliability in (10) will be explained in the next section.

3 Confidence-based design optimization using a reliability measure approach

In this section, confidence-based design optimization (CBDO) using a reliability measure approach (RMA) is proposed to resolve the efficiency issue of CRBDO. Decomposition of input dataset can be referred to Section 2.2 of the literature (Cho et al. 2016a). An underlying condition in CBDO is that there are random variables in which statistical information is not given and should be estimated using only a few samples. Accordingly, the confidence of reliability will be mainly treated as a probabilistic constraint instead of reliability used in conventional RBDO since the reliability of a system has also uncertainty propagated from the uncertainty on the input statistical model.

There are two sources of uncertainty in the statistical model of input random variable: input distribution types and parameters. Since the input distribution type is not a continuous random variable, the proposed method takes variability of input distribution parameters into consideration in the probabilistic domain to obtain confidence of reliability under two circumstances: 1) input distribution types are known, but input distribution parameters are unknown and 2) both input distribution types and parameters are unknown. In this study, uncertainties of input distribution parameters are quantified based on the normality condition, and unknown distribution types are described by kernel density estimation (KDE).

Under the aforementioned configurations, the confidence of reliability defined as a probability that the reliability of a performance function is larger than the user-specified target reliability needs to be computed in CBDO. However, it is very difficult to be calculated even if a surrogate model is used because double-loop MCS is necessary for repetitive reliability computations with respect to uncertain parameters. Therefore, RMA is proposed to efficiently estimate the confidence of reliability in parameter space by utilizing the framework of PMA. Reliability and input distribution parameters correspond to a performance function and random variables in PMA, respectively. Therefore, RMA finds MPP for the reliability with the target confidence in the input distribution parameter space and thus requires several MCSs only for searching MPP, unlike CRBDO. After finding the MPP, the confidence of reliability is obtained using a linear approximation of the reliability function in parameter space. Consequently, RMA can significantly reduce the computational burden in CRBDO since it estimates the confidence of reliability in an analytic way rather than sampling methods.

The uncertainty quantification of input distribution model is presented in Section 3.1. The formulation of the proposed RMA in P-space and stochastic sensitivity analysis for reliability based on uncertainty quantifications of input distribution parameters are explained in Section 3.2. Two cases for known and unknown input distribution types are discussed in Section 4.

3.1 Uncertainty of input distribution model

Variabilities of input distribution parameters that arise due to insufficient input data have to be explicitly quantified to define probabilistic space for reliability estimation. Under the normality assumption described in the literature (Gelman et al. 2013; Cho et al. 2016a), variabilities of mean and variance are expressed as normal and inverse-gamma distributions, respectively. The normality assumption does not mean that the true distribution type is a normal distribution, but it is the intermediate assumption for uncertainty quantifications of input distribution parameters without causing a significant error. Uncertainties of variance and mean of the ith random variable can be described as

$$ {\sigma}_i^2\mid \ast {\mathbf{x}}_i\sim \mathrm{IG}\left(\frac{ND-1}{2},\frac{\left( ND-1\right){s}_i^2}{2}\right) $$

(11)

and

$$ {\mu}_i\mid {\sigma}_i,{{}^{\ast }{}_{\mathbf{x}}{}^{\ast}\mathbf{x}}_i\sim N\left(\ast {\overline {\mathbf{x}}}_i,\sqrt{\frac{\sigma_i^2}{ND}}\right), $$

(12)

respectively, where ^∗x_i is given input data for the ith random variable; ND is the number of input data; $ {s}_i^2 $ is the sample variance of the ith random variable; and $ \ast {\overline {\mathbf{x}}}_i $ is the sample mean of the ith random variable. It is noted that the normal distribution of the mean in (12) is conditional to the variance in (11). The sample mean in (12) will change during design optimization since it is used as a design vector d in (10). In other words, the dataset is used to identify the initial design and sample variance which is assumed to be constant during the optimization, and design movement can be achieved by shifting the data. Obviously, the sample mean of random parameter is invariant during the optimization

The PDF of reliability under given input distribution parameters and types becomes a Dirac delta function expressed by

$$ f\left(\operatorname{Re}|\boldsymbol{\uppsi}, \boldsymbol{\upzeta}, \ast \mathbf{x}\right)=\delta \left(\operatorname{Re}-\operatorname{Re}\left(\boldsymbol{\uppsi}, \boldsymbol{\upzeta} \right)\right) $$

(13)

where ψ is the input distribution parameters, and ζ is the input distribution types that can be either specific parametric distribution or nonparametric KDE depending on prior knowledge as will be explained in Section 4. Consequently, when ψ and ζ are given, the reliability is considered as a deterministic value obtained from MCS, so that PDF of reliability has an infinity value only at the deterministic reliability, Re(ψ, ζ).

3.2 Reliability measure approach

In this section, RMA to explore MPP in the standard normal parameter space (P-space) transformed from input distribution parameter space is explained. In RMA, MPP search is performed in P-space to find the smallest reliability on the given hypersphere with a radius β_CL derived from the target confidence level, CL_Target. Note that the smallest reliability should be explored unlike the maximization problem in (3). Figure 1 illustrates the concept of RMA in P-space and design history for searching MPP where N is the number of random variables with insufficient samples. The initial point is set to origin in the standard normal P-space in general, and optimizer finds the next MPP candidate point based on the gradient of reliability in P-space which can be obtained from sensitivity analysis of reliability with respect to each parameter employing the first-order score functions. Once the MPP is identified as black filled circle in Fig. 1, the reliability at the MPP calculated from transformed input distribution parameters denoted as ψ including mean and variance is used to judge the violation of the confidence constraint. In Fig. 1, the reliability at the MPP is estimated as 0.87 that has to be larger than the target reliability. If the target confidence is set to 0.87, RMA for CBDO will decide that the current design point satisfies the confidence constraint even though the yellow region is the true unconfident region where reliability is less than the target reliability. Details of formulation and sensitivity analysis will be explained in Sections 3.2.1, 3.2.2, and 3.2.3.

3.2.1 Formulation of RMA

The optimization to find MPP in P-space can be formulated to

$$ {\displaystyle \begin{array}{l}\underset{\mathbf{p}}{\operatorname{minimize}}\kern0.62em \operatorname{Re}\left(G\left(\mathbf{X}\right)|\boldsymbol{\upmu}, {\boldsymbol{\upsigma}}^2\right)\\ {}\;\mathrm{subject}\ \mathrm{to}\kern0.5em \left\Vert \mathbf{p}\right\Vert ={\Phi}^{-1}\left({\mathrm{CL}}_{\mathrm{Target}}\right)\\ {}\kern1.08em \mathrm{where}\kern0.49em \operatorname{Re}\left(G\left(\mathbf{X}\right)|\boldsymbol{\upmu}, {\boldsymbol{\upsigma}}^2\right)=\int {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right){f}_{\mathbf{X}}\left(\mathbf{x};\boldsymbol{\upmu}, {\boldsymbol{\upsigma}}^2\right)d\mathbf{x}\cong \frac{1}{N_{MCS}}\sum \limits_{j=1}^{N_{MCS}}{I}_{\varOmega_R}\left(G\left({\mathbf{x}}^{(j)}\right)\right)\\ {}\kern3.119999em {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right)=\Big\{\begin{array}{l}1,\kern0.96em G\left(\mathbf{x}\right)\le 0\\ {}0,\kern0.84em \mathrm{otherwise}\end{array}\kern2.879999em \\ {}\kern3.239999em \Phi \left({p}_{2i-1}\right)={F}_{\sigma_i^2}\left({\sigma}_i^2|\ast {\mathbf{x}}_i\right)\\ {}\kern3.239999em \Phi \left({p}_{2i}\right)={F}_{\mu_i}\left({\mu}_i|{\sigma}_i^2,\ast {\mathbf{x}}_i\right)\kern0.84em for\kern0.24em i=1,2,...N\;\end{array}} $$

(14)

In Eq. (14) Re(G(X)| μ, σ²) is reliability for the given performance function G(X) and realizations of input distribution parameters. It means that uncertainty of performance function propagated from a random vector X is conditional to input distribution parameters. CL_Target is the target confidence level to be satisfied in CBDO. The indicator function for a reliable domain, $ {I}_{\varOmega_R}\left(G\left({\mathbf{x}}^{(j)}\right)\right) $, is to judge whether the jth sampling point is in the reliable region or not. N_MCS means the number of samples for reliability estimation, and N is the number of random variables which are assumed to be independent in this study. It can be seen that two input distribution parameters for the ith random variable following $ {F}_{\mu_i}\left({\mu}_i|{\sigma}_i^2,\ast {\mathbf{x}}_i\right) $ and $ {F}_{\sigma_i^2}\left({\sigma}_i^2|\ast {\mathbf{x}}_i\right) $ are transformed to P-space following the standard normal distribution. The MPP that has minimal reliability as a function of input distribution parameters in P-space would be explored, and the number of random variables in P-space becomes 2N since a two-parameter distribution is assumed for the input random variables.

3.2.2 Stochastic sensitivity analysis in RMA

Sensitivity analysis of reliability with respect to mean and variance is essential for efficient and accurate gradient-based optimization. The stochastic sensitivity analysis has been discussed in the previous studies to develop the sampling-based RBDO (Lee et al. 2011a, b; Cho et al. 2016b). The sensitivity analysis with respect to the ith mean can be described using the first-order score function and chain rule as

$$ \frac{\partial \operatorname{Re}}{\partial {P}_{\mu_i}}=\frac{\partial \operatorname{Re}}{\partial {\mu}_i}\frac{\partial {\mu}_i}{\partial {P}_{\mu_i}}=\int {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right)\frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\mu}_i}{f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)d\mathbf{x}\times \frac{\partial {\mu}_i}{\partial {P}_{\mu_i}} $$

(15)

where the PDF of mean is given in (12). The derivative of mean with respect to a corresponding $ {P}_{\mu_i} $ can be obtained through the following relationship as

$$ \frac{\partial {\mu}_i}{\partial {P}_{\mu_i}}=\frac{\mathrm{\partial \Phi}\left({P}_{\mu_i}\right)}{\partial {P}_{\mu_i}}/\frac{\partial {F}_{\mu_i}\left({\mu}_i\right)}{\partial {\mu}_i}=\frac{\phi \left({P}_{\mu_i}\right)}{f_{\mu_i}\left({\mu}_i\right)} $$

(16)

where $ {F}_{\mu_i}\left({\mu}_i\right)=\Phi \left({P}_{\mu_i}\right) $ and $ {P}_{\mu_i} $ is a random variable following the standard normal distribution transformed from the mean of i-th random variable denoted as μ_i. ϕ and Φ are PDF and CDF of the standard normal distribution, respectively. On the other hand, the sensitivity analysis with respect to the variance of i-th random variable, which seems to be more complicated since the mean is conditional to the variance, is derived as

$$ {\displaystyle \begin{array}{l}\frac{\partial \operatorname{Re}}{\partial {P}_{\sigma_i^2}}=\frac{\partial \operatorname{Re}}{\partial {\sigma}_i^2}\frac{\partial {\sigma}_i^2}{\partial {P}_{\sigma_i^2}}=\int {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right)\frac{d\ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{d{\sigma}_i^2}{f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)d\mathbf{x}\times \frac{\partial {\sigma}_i^2}{\partial {P}_{\sigma_i^2}}\\ {}\\ {}\kern5.039997em =\int {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right)\left(\frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\sigma}_i^2}+\frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\mu}_i}\frac{d{\mu}_i}{d{\sigma}_i^2}\right){f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)d\mathbf{x}\times \frac{\phi \left({P}_{\sigma_i^2}\right)}{f_{\sigma_i^2}\left({\sigma}_i^2\right)}\end{array}} $$

(17)

where $ {P}_{\sigma_i^2} $ is a random variable following the standard normal distribution corresponding to $ {\sigma}_i^2 $. The PDF of variance is given in (11). The derivatives of the random variable with respect to distribution parameters can be shown in the literature using the relationship of the Rosenblatt transformation written as (Cho et al. 2017)

$$ \frac{d{x}_i}{d{p}_i}=-\frac{1}{f_{X_i}\left({x}_i;{a}_i,{b}_i\right)}\frac{\partial }{\partial {p}_i}{F}_{X_i}\left({x}_i;{a}_i,{b}_i\right) $$

(18)

where p_i corresponds to either mean or variance of i-th random variable, and a_i and b_i are general distribution parameters as functions of mean and variance. Therefore, the derivative of mean with respect to variance in (17) can be derived as

$$ {\displaystyle \begin{array}{l}\frac{d{\mu}_i}{d{\sigma}_i^2}=-\frac{1}{f_{\mu_i}\left({\mu}_i;{{}^{\ast}\overline{x}}_i,\sqrt{\sigma_i^2/ ND}\right)}\times \frac{-\left({\mu}_i-{{}^{\ast}\overline{x}}_i\right)}{2{\sigma}_i^3/\sqrt{ND}}\times \phi \left(\frac{\mu_i-{{}^{\ast}\overline{x}}_i}{\sqrt{\sigma_i^2/ ND}}\right)\\ {}\kern1.44em =\frac{\mu_i-{{}^{\ast}\overline{x}}_i}{2{\sigma}_i^2}\end{array}} $$

(19)

where $ {{}^{\ast}\overline{x}}_i $ is the sample mean of data which equivalent to design point of ith random variable. In addition, the first-order score functions of various parametric distribution, $ \frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\mu}_i} $ and $ \frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\sigma}_i^2} $ used in (15–17), are derived analytically in the literature (Lee et al. 2011a, b; Cho et al. 2016b).

Searching MPP with any algorithms of PMA demands the gradient vector at the current point, and thus the gradient vector of reliability with respect to input distribution parameters is necessary. Employing developed sensitivity analysis in this section, searching MPP in P-space can be performed without any additional MCSs to obtain the gradient vector. Therefore, only one MCS can provide the gradient vector to any optimizer in searching MPP.

3.2.3 Sensitivity analysis in CBDO

Like conventional PMA, sensitivity analysis with respect to a design variable vector—mean vector of random variables—in CBDO can be described as (Lee et al. 2010)

$$ \frac{\partial \operatorname{Re}}{\partial {d}_i}{\left|{}_{\mathbf{p}={\mathbf{p}}^{\ast }}=\frac{\partial \operatorname{Re}}{\partial {\mu}_i}\right|}_{\mathbf{p}={\mathbf{p}}^{\ast }}=\int {I}_{\varOmega_R}\left(G\left(\mathbf{x}\right)\right)\frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)}{\partial {\mu}_i}{f}_{\mathbf{x}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right)d\mathbf{x} $$

(20)

Equation (20) can be easily obtained since $ \frac{\partial \ln {f}_{\mathbf{x}}\left(\mathbf{x};\boldsymbol{\uppsi} \right)}{\partial {\mu}_i} $ is already used in (15) and (17). Unlike sensitivity analysis in searching MPP described in Section 3.2.2, sensitivity analysis of reliability with respect to mean vector is used to find the optimal design point rather than MPP. In other words, CBDO has a double-loop algorithm composed of MPP search as an inner loop and optimal design search as an outer loop. Sensitivity analyses in Sections 3.2.2 and 3.2.3 are for the inner loop and outer loop, respectively. Note that (20) is equivalent to (15), except that the probabilistic space has to be transformed to P-space.

Using RMA, MPP in P-space is obtained, and reliability at the MPP is used as an index of violation under the given confidence of a reliability constraint. Since the function approximation method is the same as the conventional MPP-based reliability analysis, the second-order reliability method (SORM) (Lee et al. 2012; Park and Lee 2018), and MPP-based dimension reduction method (DRM) (Rahman and Xu 2004; Jung et al. 2019) can be utilized for more accurate confidence estimation.

4 Two approaches for input distribution type uncertainty

As mentioned earlier, there are two sources of uncertainties in input statistical models: input distribution parameters and distribution types. In Section 3, only uncertainties of input distribution parameters are treated for RMA and their sensitivity analysis. In this section, two approaches for input distribution type uncertainties—known input distribution type and unknown input distribution type—are explained for estimation of reliability and its sensitivity for CBDO.

4.1 Known input distribution type

Knowing input distribution types means that a designer has set the distribution types based on any prior knowledge or has figured out the most likely distribution type. For instance, model selection methods can determine the best distribution for the given data among multiple candidate distributions, and acceptance or rejection of the hypothesis with respect to the adequacy of given distribution type can be determined through goodness-of-fit (GOF) tests. Thus, the designer is able to use the model identification methods before the proposed CBDO as a preprocessing if there is no particular clue about the distribution type. Since the identification of a distribution type is beyond the purpose of this research, readers can refer to recent research on distribution type identification (Kang et al. 2016). The reason for separating uncertainty of a distribution type and its parameters is that it is difficult to include discrete variables for MPP-based approaches, and even if it is possible, it requires additional assumption and can become too complicated since two uncertainties are correlated.

When input distribution types are known as parametric distributions, the input joint PDF can be expressed as

$$ {f}_{\mathbf{X}}\left(\mathbf{x};\boldsymbol{\uppsi} \right)=\prod \limits_{i=1}^N{f}_{X_i}\left({x}_i;\boldsymbol{\uppsi} \right) $$

(21)

Since input random variables are assumed to be independent in this study. The PDF of the ith random variable can be described as (Cho et al. 2016b)

$$ {f}_{X_i}\left({x}_i;\boldsymbol{\uppsi} \right)={f}_{X_i}\left({x}_i;{\mu}_i,{\sigma}_i^2\right) $$

(22)

Assuming that the PDF is a two-parameter distribution with parameters of μ_i(a_i, b_i) and σ_i(a_i, b_i) as a function of two general distribution parameters. Under this configuration, RMA and its sensitivity analysis can share the sampling-based RBDO framework, and the relationship between mean and variance and general distribution parameters can be found in the literature (Lee et al. 2011a, b; Cho et al. 2016b). Note that any two-distribution parameters need to be expressed as mean and variance and vice versa to utilize the framework of RMA. Furthermore, the first-order score functions with respect to mean and variance for various parametric distributions can be found.

Once input distribution types are decided, epistemic uncertainty induced by an insufficient number of data is merely quantified as uncertainties of input distribution parameters. The reliability of target performance is a function of input distribution parameters so that the proposed MPP-search in P-space is available. The sensitivity analysis of reliability in (15) and (17) can be performed since $ {f}_{\mathbf{X}}\left(\mathbf{x};{\mu}_i,{\sigma}_i^2\right) $ is given as an explicit function of mean and variance. However, when input distribution types are still unknown, PDFs should be defined using KDE prior to RMA, which will be presented in the next section.

4.2 Unknown input distribution type

4.2.1 Uncertainty quantification using kernel density estimation

When input distribution types are unknown, it is necessary to describe f_X(x; ψ) without using any parametric distribution. In this research, kernel density estimation (KDE) is implemented for the purpose. KDE under given input data is described as (Silverman 2018)

$$ \hat{f}(x)=\frac{1}{nh}\sum \limits_{i=1}^nk\left(\frac{X_i-x}{h}\right) $$

(23)

where n is the number of samples; X₁, X₂, ..., X_n are independent and identically distributed random samples; k(u) is a kernel function satisfying $ {\int}_{-\infty}^{\infty }k(u) du=1 $; h is a smoothing parameter of KDE. In this paper, the standard normal density function is employed for the kernel function.

It should be noted that the shape of the KDE expression in (23) does not change during design optimization. Only mean, which is the sample mean initially, changes in the optimization process. Thus, during the design optimization, a shifted KDE of the independent jth random design variable with n samples is used as an input distribution given by

$$ {\overset{\wedge }{f}}_{X_j}\left(x;{{}^{\ast}\mathbf{x}}_j,{\mu}_j\right)=\frac{1}{n{h}_j}\sum \limits_{i=1}^nk\left(\frac{s_i\left({{}^{\ast}\mathbf{x}}_j,{\mu}_j\right)-x}{h_j}\right) $$

(24)

where $ {s}_i\left({{}^{\ast}\mathbf{x}}_j,{\mu}_j\right)={{}^{\ast }x}_{j,i}-{{}^{\ast}\overline{x}}_j+{\mu}_j $ is the ith shifted sample of the jth random variable, and μ_j is the given mean of the jth random variable. On the other hand, the uncertainty of variance cannot be directly reflected in (24) since the variance of KDE is always larger than the sample variance (Hansen 2009). Therefore, using the same normality condition used in Section 3.1, the bandwidth in (24) can be determined using the normal reference rule as (Silverman 2018)

$$ {h}_j={\left(\frac{4{\sigma_j}^5}{3n}\right)}^{\frac{1}{5}}\approx 1.06{\sigma}_j{n}^{-\frac{1}{5}} $$

(25)

where $ {\sigma}_j^2 $ is the given variance of the jth random variable. The bandwidth of normal reference rule is to choose the optimal bandwidth that minimizes the mean integrated squared error when underlying density function is a normal distribution. Despite the normality assumption, (24) can express a plausible PDF for any input distribution. Through (24) and (25), KDE can reflect the given mean and variance to compute the reliability in P-space. Note that given mean and variance as μ_j and $ {\sigma}_j^2 $ indicate the input distribution parameters to specify the uncertain input distribution since mean and variance are changed during MPP search in P-space, and KDE has to reflect to difference mean and variance following the current MPP candidate.

4.2.2 Stochastic sensitivity analysis for kernel density estimation

To utilize the stochastic sensitivity analysis in Section 3.2.2 for unknown input distribution cases, the first-order score function for KDE needs to be obtained. Using (24) and (25), the first-order score function with respect to mean for KDE of the jth independent random variable is derived as

$$ {\displaystyle \begin{array}{l}\frac{\partial \ln {\overset{\wedge }{f}}_{X_j}\left(x;h\right)}{\partial {\mu}_j}\kern0.36em =\frac{\partial \ln {\overset{\wedge }{f}}_{X_j}}{\partial^{\ast }{x}_{j,1}}\frac{\partial^{\ast }{x}_{j,1}}{\partial {\mu}_j}+...+\frac{\partial \ln {\overset{\wedge }{f}}_{X_j}}{\partial^{\ast }{x}_{j,n}}\frac{\partial^{\ast }{x}_{j,n}}{\partial {\mu}_j}\\ {}\kern3.839998em =\sum \limits_{i=1}^n\frac{x-{{}^{\ast }x}_{j,i}}{\sqrt{2\pi }n{h}_j^3{\overset{\wedge }{f}}_{X_j}(x)}{e}^{-\frac{1}{2}{\left(\frac{x-{{}^{\ast }x}_{j,i}}{h_j}\right)}^2}\end{array}} $$

(26)

where ^∗x_{j, i} is ith sample of jth random variable, $ {\overset{\wedge }{f}}_{X_j}(x)=\frac{1}{n{h}_j}\sum \limits_{i=1}^n\phi \left(\frac{x{-}^{\ast }{x}_{j,i}}{h_j}\right) $, and $ \phi (z)=\frac{1}{\sqrt{2\pi }}{e}^{-\frac{z^2}{2}} $which is the standard normal density function. Similarly, the first-order score function with respect to the variance for KDE of the jth independent random variable is derived as

$$ {\displaystyle \begin{array}{l}\frac{\partial \ln {\overset{\wedge }{f}}_{X_j}\left(x;h\right)}{\partial {\sigma}_j^2}=\frac{\partial \ln {\overset{\wedge }{f}}_{X_j}}{\partial {h}_j}\frac{\partial {h}_j}{\partial {\sigma}_j^2}\\ {}\kern3.839998em =-\frac{1}{h_j}+\frac{\sum \limits_{i=1}^n{\left(x-{{}^{\ast }x}_{j,i}\right)}^2\phi \left(\frac{x-{{}^{\ast }x}_{j,i}}{h_j}\right)}{h_j^3\sum \limits_{i=1}^n\phi \left(\frac{x-{{}^{\ast }x}_{j,i}}{h_j}\right)}\frac{{\left(1.06{n}^{-1/5}\right)}^2}{2{h}_j}\end{array}} $$

(27)

Hence, (26) and (27) are substituted in (15–17) and (20) for the stochastic sensitivity analysis for CBDO when there is no prior knowledge on input distribution types.

4.3 Flowchart and practical issues

Figure 2 shows the flowchart of the proposed CBDO with RMA. Starting at the initial design point, variabilities of mean and variance are quantified as a normal and inverse-gamma distribution using the given input dataset. Before MPP search, input distribution types have to be specified according to which method is used. If the distribution types are parametrically identified, the parametric distributions are directly used for random sampling with respect to realizations of mean and variance. Otherwise, KDE is used to describe input distributions based on the given input dataset. To check violation of confidence constraints, uncertain input distribution parameters are transformed to P-space to find MPP using the stochastic sensitivity analysis for reliability. After finding the MPP, the current design point is moved to the next point using any nonlinear optimizer (e.g., sequential quadratic programming).

There are two remarks compared to common MPP searches used in PMA. Firstly, since reliability is always bounded between 0 and 1, sensitivity may become zero in case of totally violated or safe constraints. Therefore, it is recommended to check reliability values before MPP search in P-space to prevent numerical instability due to zero sensitivity. Secondly, characteristics of reliability with respect to mean and variance may result in multiple MPPs. This happens when a performance function has multiple failure regions in the probabilistic domain. It can occur in conventional reliability analysis as well, but may be more frequent in RMA since reliability is a much more complex function with respect to input distribution parameters. The issue in the second remark will be discussed in further works.

5 Numerical examples

Two mathematical examples and one engineering example are introduced in this section for demonstration of the proposed CBDO with RMA. The proposed sensitivity analysis with respect to each input distribution parameter is validated, and MPP search in P-space is visualized to show its feasibility by checking contours of reliability functions for the 1-D mathematical function. In the 2-D mathematical example, the results of proposed CBDO under various circumstances and its effectiveness have been analyzed. Finally, the 11-D side impact example is used to test the applicability of the proposed CBDO to practical and complex engineering problems.

5.1 One-dimensional mathematical function for RMA

To demonstrate the proposed RMA and its sensitivity analysis, 1-D mathematical performance function given by

$$ G(X)=-0.35+X\sin (X)-0.5{X}^2 $$

(28)

is introduced. Even though the performance function has only one random design variable, its probabilistic space of input distribution parameters is expressed in 2-D space since a two-parameter distribution will be used for the random variable X. There are 10 input data for X randomly drawn from normal, lognormal, Weibull, and Gumbel distributions where the true variance is set to 0.5², respectively.

5.1.1 Verification of sensitivity analysis in RMA

The proposed stochastic sensitivity analysis with respect to mean and variance in P-space is verified in this section by comparing with the finite difference method (FDM). Although FDM could be used for sensitivity analysis, it requires additional function evaluations, and proper perturbation is unknown in general due to truncation and round-off error in computation. Therefore, analytical sensitivity analysis is critical in design optimization in terms of both efficiency and accuracy. The FDM results are merely used as a reference to validate the derivation on analytical sensitivity analysis. For the test, a design point of p^∗ = {1.1897, 1.1357}^T which is the MPP in P-space with given dataset drawn from the normal distribution is utilized. Sensitivities of reliability with respect to distribution parameters for parametric distributions and nonparametric distribution given in (15) and (17) and FDM sensitivities with 0.5% perturbation are listed in Tables 1 and 2. Agreements between the proposed and FDM sensitivities are shown in the parenthesis. From the tables, it is shown that accurate sensitivities in P-space are obtained even when a distribution of a random variable is expressed using KDE.

Table 1 Sensitivity analysis results using known parametric distribution

Full size table

Table 2 Sensitivity analysis results using KDE

Full size table

5.1.2 Results of RMA

In this section, the proposed RMA is explained using the same 1-D performance function. Figure 3 shows the contours of reliability and hyperspheres for 95% target confidence in P-space assuming input distributions are known. Figure 3 also shows MPP search history which is obtained by hybrid mean value method (HMV) (Youn et al. 2003), and all the reliability values at converged points are the smallest on the target confidence circle. Hence, it can be seen that the MPP search has been successfully performed based on accurate sensitivity information. Each axis in Fig. 3 represents input distribution parameters transformed to P-space.

In case of no prior knowledge on input distribution types, contours of reliability are shown in Fig. 4 where the dataset of the previous case is reused for KDE. Even though the same dataset is used, the variability of random design variables changes resulting in different reliability contours. This is because only 10 samples are used for input statistical modeling.

Results of the proposed RMA for 95% target confidence level are listed in Table 3. For validation, MCS using original parametric distributions is performed at MPP obtained using RMA and agreements between two results are written in the parentheses. Table 3 shows that RMA is more accurate when input distributions are known and KDE shows slightly higher error especially in non-normal cases. It can be seen from Fig. 3 that limit-state reliability functions near MPP seem to be linear so that the RMA results using parametric distributions are accurate. On the other hand, in case of unknown distribution as shown in Fig. 4, it should be addressed that KDE cannot accurately approximate the original input distribution of the random variable with 10 samples only. Therefore, the larger errors in KDE are mainly caused by inexact estimations of the original PDF.

Table 3 MCS results at MPP obtained using RMA with a 95% confidence level using 10 data

Full size table

Table 4 shows RMA results when the number of input data is 30. Accuracy is generally improved using more data in two ways. First, increasing the number of data will reduce the variability of each input distribution parameter. Therefore, the limit-state reliability function near MPP in P-space becomes more linear, thereby reducing error due to the linearization. Second, when input distribution types are unknown, increasing the number of data can reduce error due to inaccurate estimation of input distributions by KDE as well. Therefore, increasing the number of data improves the accuracy of RMA with KDE more as shown in Table 4.

Table 4 MCS results at MPP obtained using RMA with a 95% confidence level using 30 data

Full size table

5.2 Two-dimensional confidence-based design optimization

Efficiency and accuracy of the proposed CBDO with RMA are verified through a 2-D mathematical example formulated as

$$ {\displaystyle \begin{array}{l}\kern0.48em \min \kern0.96em -\frac{{\left({d}_1+{d}_2-10\right)}^2}{30}-\frac{{\left({d}_1-{d}_2+10\right)}^2}{120}\\ {}\kern0.96em \mathrm{s}.\mathrm{t}\kern1.08em \Pr \left[\Pr \left[{G}_j\left(\mathbf{X}\left(\mathbf{d}\right)\right)\le 0\right]\ge 0.9772\right]\ge 0.9000,j=1,2,3\\ {}\kern2.639999em 0\le {d}_i\le 10,i=1,2\\ {}\mathrm{where}\kern0.96em {G}_1\left(\mathbf{X}\right)=1-\frac{X_1^2{X}_2}{20},\\ {}\kern2.639999em {G}_2\left(\mathbf{X}\right)=-1+{\left(0.9063{X}_1+0.4226{X}_2-6\right)}^2+{\left(0.9063{X}_1+0.4226{X}_2-6\right)}^3\\ {}\kern4.679997em -0.6{\left(0.9063{X}_1+0.4226{X}_2-6\right)}^4-\left(-0.4226{X}_1+0.9063{X}_2\right),\\ {}\kern2.639999em {G}_3\left(\mathbf{X}\right)=1-\frac{80}{X_1^2+8{X}_2+5}\end{array}} $$

(29)

where the target reliability and target confidence of reliability are set to 97.72% and 90.00%, respectively. In other words, the optimization in Eq. (29) explores a conservative optimum satisfying the confidence constraints with 90.00 % target confidence and 97.79 % target reliability. The number of MCS samples for reliability computation is 10⁶, and the number of MCS samples for input distribution parameters in double-loop CRBDO is 10⁴.

The design history of CBDO is illustrated in Fig. 5 when 10 input data are given. For efficiency, the sampling-based RBDO is performed from the initial point, and then CBDO is carried out from the RBDO optimum which is supposed to be near the CBDO optimum compared to arbitrary initial points. It is reasonable that the CBDO optimum is farther from each limit-state function than the RBDO optimum to compensate the input uncertainties. Note that the sampling-based RBDO is performed only with the inherent randomness of input variables using sample mean and variance.

Tables 5, 6, and 7 show design optimization results obtained using the proposed CBDO for known and unknown input distribution cases and conventional double-loop CRBDO. The confidence in each table means exact confidence for each constraint obtained using MCS with true input distributions. When input distribution types are known, the CBDO optimum gradually approaches the double-loop CRBDO optimum as the number of data increases. On the other hand, in case of unknown input distribution type, the CBDO optimum becomes conservative especially for the second constraint as the number of data increases. This is because the reliability for the second constraint obtained using KDE is overestimated due to its nonlinearity and inaccurate density estimation. The number of MCSs in the last columns of Tables 5, 6, and 7 means the number of reliability calculations using MCS. Since the third constraint is perfectly safe during the design optimization, only one MCS is required to check feasibility in each iteration of CBDO. However, in the case of CRBDO, MCS with 10⁴ samples for input distribution parameters is performed for a constraint in each iteration. Thus, the total number of MCSs in CRBDO becomes the number of design iterations × the number of constraints × the number of MCS samples for input distribution parameters. As a result, the total number of MCSs is reduced from 180,000 in CRBDO to 44 and 36 in CBDO for a case of 100 input samples meaning that efficiency of CBDO is significantly improved maintaining the acceptable level of accuracy compared to CRBDO. Notably, confidence at the optimum is slightly overestimated or underestimated because the proposed method is an MPP-based approximation method inducing approximation error. In this paper, we cannot estimate nonlinearity of reliability with respect to input distribution parameters, but the accuracy will be definitely improved using higher-order MPP-based approximation methods.

Table 5 Results of CBDO when input distribution types are known

Full size table

Table 6 Results of CBDO when input distribution types are KDE

Full size table

Table 7 Results of double-loop CRBDO

Full size table

On the other hand, true reliability, reliability at mean value (i.e., sample variance), and reliability at the target confidence level can be compared to see how much uncertainty should be allowed in Table 8. The differences between reliabilities can be indices to quantify the impact of epistemic uncertainty. In Table 8, all results are taken from the CBDO optimum when distribution types are known. Certainly, reliability at MPP has to be the same with the target reliability of 97.72% for the first and second constraints which are active. Reliability at mean value indicates the reliability when variances of random variables are set to sample variances. It should be gradually closer to the true reliability in which the exact variances of random variables are known. Unlike overall tendency of results, it seems that the reliability at mean value is very similar to the true reliability when the number of input data is 10 since the sample variance of drawn data from a population is coincidently very close to the true variance.

Table 8 Reliability under various input distribution parameters with known distribution types

Full size table

5.3 Eleven-dimensional side impact problem

In this section, the proposed CBDO is applied to the vehicle side impact crashworthiness problem with nine random variables and two random parameters (Youn et al. 2004). It is assumed that statistical information of nine random variables is fully known and two random parameters have uncertainty induced by insufficient input with various numbers of data. Design optimization for the vehicle side impact problem is formulated as

$$ {\displaystyle \begin{array}{l}\kern0.48em \min \kern0.72em \mathrm{cost}\left(\mathbf{d}\right)=1.98+4.9{d}_1+6.67{d}_2+6.98{d}_3+4.01{d}_4+1.78{d}_5+2.73{d}_7\\ {}\kern0.96em \mathrm{s}.\mathrm{t}\kern0.72em \Pr \left[\Pr \left[{G}_j\left(\mathbf{X}\left(\mathbf{d}\right)\right)\le 0\right]\ge 0.95\right]\ge 0.95,j=1,2,3\\ {}\kern2.28em {\mathbf{d}}^L\le \mathbf{d}\le {\mathbf{d}}^U\\ {}\mathrm{where}\kern0.6em {G}_1\left(\mathbf{X}\right)=14.36-9.9{X}_2-12.9{X}_1{X}_8+0.1107{X}_3{X}_{10}\\ {}\kern2.28em {G}_2\left(\mathbf{X}\right)=0.72-0.5{X}_4-0.19{X}_2{X}_3-0.0122{X}_4{X}_{10}\\ {}\kern4.799997em +0.009325{X}_6{X}_{10}+0.000191{X}_{11}^2\\ {}\kern2.28em {G}_3\left(\mathbf{X}\right)=1.35-0.489{X}_3{X}_7-0.843{X}_5{X}_6+0.0432{X}_9{X}_{10}\\ {}\kern4.799997em -0.0556{X}_9{X}_{11}-0.000786{X}_{11}^2\end{array}} $$

(30)

where the objective is the vehicle weight, both the target confidence of reliability and target reliability are set to 95%, and the statistical information of random variables is listed in Table 9. The original side impact problem contains 10 constraints among which only three are active, and they are included in (30). Note that X₁₀ and X₁₁ are random parameters which are invariant during the optimization, and samples are randomly drawn from the normal distribution as shown in Table 9.

Table 9 Properties of random input (9 variables and 2 parameters) for side impact crashworthiness problem

Full size table

CBDO results with various numbers of input data are listed in Tables 10 and 11. In addition, CBDO results using KDE are listed in Tables 12 and 13. Overall results show that as the number of data increases, the CBDO optimum gradually approaches the RBDO optimum where true input distributions of two parameters are used. It is natural that an insufficient number of samples may provide large uncertainty in distribution parameters, and thus the optima become more conservative to compensate for the lack of knowledge on input statistical models. As discussed in the previous example, the accuracy of KDE is slightly worse than known distribution cases due to inaccurate input distribution estimation and propagation of variance to the smoothing parameter in KDE.

Table 10 Optima under various numbers of data when input distribution types are known

Full size table

Table 11 Confidences at optima when input distribution types are known

Full size table

Table 12 Optima under various numbers of data when input distribution types are KDE

Full size table

Table 13 Confidences at optima when input distribution types are KDE

Full size table

6 Conclusion

This paper proposes RMA of CBDO that can be defined as an MPP-based approximation method in P-space to eliminate outer-loop MCS of CRBDO to efficiently obtain the confidence of reliability. RMA to deal with the uncertainty of input distributions searches MPP in P-space and approximates the reliability based on MPP, resulting in significant improvement in efficiency with an acceptable level of accuracy. For the MPP search in P-space, two approaches using input distribution parameters quantified through the normality assumption are proposed according to the uncertainty of input distribution types. In addition, stochastic sensitivity analysis for reliability with respect to input distribution parameters is derived employing the first-order score function to reduce the number of MCSs and provides accurate search direction to an optimizer. Finally, CBDO with RMA accounting for the confidence of reliability is proposed adopting the framework of RBDO using PMA when the uncertainty of input statistical models exists. As a result, a conservative optimum in the presence of the uncertainty of input statistical models can be obtained using much-reduced computations compared to the previous CRBDO. Moreover, other existing MPP-based approximation methods can be directly combined with the proposed CBDO for more accurate results.

In future work, reweighting samples of MCS to estimate the reliability will be investigated to further reduce the number of MCSs. Samples to compute the reliability at a design point in P-space can be reused in the next point during MPP search using the scheme of the importance sampling. It can be effective in RMA in CBDO because input distribution parameters vary locally in P-space. In addition, the cost to add input samples will be treated as development cost to find the balanced optimum between system cost and development cost satisfying confidence constraints.

7 Replication of Results

Matlab codes for the mathematical examples in Section 5 are uploaded on https://github.com/Yongsu-Jung/SMO_CBDO.git. Overall concepts and algorithms can be validated through the mathematical example.

References

Bae S, Kim NH, Park C, Kim Z (2017) Confidence interval of Bayesian network and global sensitivity analysis. AIAA J:3916–3924
Article Google Scholar
Bae S, Kim NH, Jang SG (2018) Reliability-based design optimization under sampling uncertainty: shifting design versus shaping uncertainty. Struct Multidiscip Optim 57(5):1845–1855
Article MathSciNet Google Scholar
Cho H, Choi KK, Gaul NJ, Lee I, Lamb D, Gorsich D (2016a) Conservative reliability-based design optimization method with insufficient input data. Struct Multidiscip Optim 54(6):1609–1630
Article MathSciNet Google Scholar
Cho H, Choi KK, Lee I, Lamb D (2016b) Design sensitivity method for sampling-based RBDO with varying standard deviation. J Mech Des 138(1):011405
Article Google Scholar
Cho H, Choi KK, Lamb D (2017) Sensitivity developments for RBDO with dependent input variable and varying input standard deviation. J Mech Des 139(7):071402
Article Google Scholar
Choi SH, Lee G, Lee I (2018) Adaptive single-loop reliability-based design optimization and post optimization using constraint boundary sampling. J Mech Sci Technol 32(7):3249–3262
Article Google Scholar
Der Kiureghian A, Ditlevsen O (2009) Aleatory or epistemic? Does it matter? Struct Saf 31(2):105–112
Article Google Scholar
Du L, Choi KK, Youn BD (2006a) Inverse possibility analysis method for possibility-based design optimization. AIAA J 44(11):2682–2690
Article Google Scholar
Du L, Choi KK, Youn BD, Gorsich D (2006b) Possibility-based design optimization method for design problems with both statistical and fuzzy input data. J Mech Des 128(4):928–935
Article Google Scholar
Dubourg V, Sudret B, Bourinet JM (2011) Reliability-based design optimization using kriging surrogates and subset simulation. Struct Multidiscip Optim 44(5):673–690
Article Google Scholar
Gelman A, Stern HS, Carlin JB, Dunson DB, Vehtari A, Rubin DB (2013) Bayesian data analysis, 3rd edn. Chapman and Hall/CRC
Gunawan S, Papalambros PY (2006) A Bayesian approach to reliability-based optimization with incomplete information. J Mech Des 128(4):909–918
Article Google Scholar
Hansen BE (2009) Lecture notes on nonparametrics. Lecture notes
Hao P, Wang Y, Liu C, Wang B, Wu H (2017) A novel non-probabilistic reliability-based design optimization algorithm using enhanced chaos control method. Comput Methods Appl Mech Eng 318:572–593
Article MathSciNet Google Scholar
Ito M, Kim NH, Kogiso N (2018) Conservative reliability index for epistemic uncertainty in reliability-based design optimization. Struct Multidiscip Optim 57(5):1919–1935
Article MathSciNet Google Scholar
Jung Y, Cho H, Lee I (2019) MPP-based approximated DRM (ADRM) using simplified bivariate approximation with linear regression. Struct Multidiscip Optim 59(5):1761–1773
Article MathSciNet Google Scholar
Kang YJ, Lim OK, Noh Y (2016) Sequential statistical modeling method for distribution type identification. Struct Multidiscip Optim 54(6):1587–1607
Article Google Scholar
Keshtegar B, Lee I (2016) Relaxed performance measure approach for reliability-based design optimization. Struct Multidiscip Optim 54(6):1439–1454
Article MathSciNet Google Scholar
Lee I, Choi KK, Gorsich D (2010) Sensitivity analyses of FORM-based and DRM-based performance measure approach (PMA) for reliability-based design optimization (RBDO). Int J Numer Methods Eng 82(1):26–46
MATH Google Scholar
Lee I, Choi KK, Zhao L (2011a) Sampling-based RBDO using the stochastic sensitivity analysis and dynamic Kriging method. Struct Multidiscip Optim 44(3):299–317
Article MathSciNet MATH Google Scholar
Lee I, Choi KK, Noh Y, Zhao L, Gorsich D (2011b) Sampling-based stochastic sensitivity analysis using score functions for RBDO problems with correlated random variables. J Mech Des 133(2):021003
Article Google Scholar
Lee I, Noh Y, Yoo D (2012) A novel second-order reliability method (SORM) using noncentral or generalized chi-squared distributions. J Mech Des 134(10):100912
Article Google Scholar
Lee I, Choi KK, Noh Y, Lamb D (2013) Comparison study between probabilistic and possibilistic methods for problems under a lack of correlated input statistical information. Struct Multidiscip Optim 47(2):175–189
Article MathSciNet MATH Google Scholar
Li G, Meng Z, Hu H (2015) An adaptive hybrid approach for reliability-based design optimization. Struct Multidiscip Optim 51(5):1051–1065
Article MathSciNet Google Scholar
Liang B, Mahadevan S (2011) Error and uncertainty quantification and sensitivity analysis in mechanics computational models. Int J Uncertain Quantif 1(2)
Meng Z, Li G, Wang BP, Hao P (2015a) A hybrid chaos control approach of the performance measure functions for reliability-based design optimization. Comput Struct 146:32–43
Article Google Scholar
Meng Z, Hao P, Li G, Wang B, Zhang K (2015b) Non-probabilistic reliability-based design optimization of stiffened shells under buckling constraint. Thin-Walled Struct 94:325–333
Article Google Scholar
Moon MY, Choi KK, Cho H, Gaul N, Lamb D, Gorsich D (2017) Reliability-based design optimization using confidence-based model validation for insufficient experimental data. J Mech Des 139(3):031404
Article Google Scholar
Moon MY, Cho H, Choi KK, Gaul N, Lamb D, Gorsich D (2018) Confidence-based reliability assessment considering limited numbers of both input and output test data. Struct Multidiscip Optim 57(5):2027–2043
Article MathSciNet Google Scholar
Muscolino G, Santoro R, Sofi A (2016) Reliability analysis of structures with interval uncertainties under stationary stochastic excitations. Comput Methods Appl Mech Eng 300:47–69
Article MathSciNet MATH Google Scholar
Nannapaneni S, Mahadevan S (2016) Reliability analysis under epistemic uncertainty. Reliab Eng Syst Saf 155:9–20
Article Google Scholar
Noh Y, Choi KK, Lee I, Gorsich D, Lamb D (2011a) Reliability-based design optimization with confidence level under input model uncertainty due to limited test data. Struct Multidiscip Optim 43(4):443–458
Article MathSciNet MATH Google Scholar
Noh Y, Choi KK, Lee I, Gorsich D, Lamb D (2011b) Reliability-based design optimization with confidence level for non-Gaussian distributions using bootstrap method. J Mech Des 133(9):091001
Article Google Scholar
Park JW, Lee I (2018) A study on computational efficiency improvement of novel SORM using the convolution integration. J Mech Des 140(2):024501
Article Google Scholar
Peng X, Li J, Jiang S (2017) Unified uncertainty representation and quantification based on insufficient input data. Struct Multidiscip Optim 56(6):1305–1317
Article Google Scholar
Picheny V, Kim NH, Haftka RT (2010) Application of bootstrap method in conservative estimation of reliability with limited samples. Struct Multidiscip Optim 41(2):205–217
Article MathSciNet MATH Google Scholar
Rahman S, Xu H (2004) A univariate dimension-reduction method for multi-dimensional integration in stochastic mechanics. Probab Eng Mech 19(4):393–408
Article Google Scholar
Rao SS, Cao L (2002) Optimum design of mechanical systems involving interval parameters. J Mech Des 124(3):465–472
Article Google Scholar
Rosenblatt M (1952) Remarks on a multivariate transformation. Ann Math Stat 23(3):470–472
Article MathSciNet MATH Google Scholar
Rubinstein RY, Kroese DP (2016) Simulation and the Monte Carlo method (Vol. 10). John Wiley & Sons
Sankararaman S, Mahadevan S (2011) Likelihood-based representation of epistemic uncertainty due to sparse point data and/or interval data. Reliab Eng Syst Saf 96(7):814–824
Article Google Scholar
Sankararaman S, Mahadevan S (2013a) Distribution type uncertainty due to sparse and imprecise data. Mech Syst Signal Process 37(1–2):182–198
Article Google Scholar
Sankararaman S, Mahadevan S (2013b) Separating the contributions of variability and parameter uncertainty in probability distributions. Reliab Eng Syst Saf 112:187–199
Article Google Scholar
Silverman BW (2018) Density estimation for statistics and data analysis. Routledge
Tu J, Choi KK, Park YH (1999) A new study on reliability-based design optimization. J Mech Des 121(4):557–564
Article Google Scholar
Wang Z, Wang P (2014) A maximum confidence enhancement based sequential sampling scheme for simulation-based design. J Mech Des 136(2):021006
Article Google Scholar
Yang D, Yi P (2009) Chaos control of performance measure approach for evaluation of probabilistic constraints. Struct Multidiscip Optim 38(1):83
Article Google Scholar
Yoo D, Lee I (2014) Sampling-based approach for design optimization in the presence of interval variables. Struct Multidiscip Optim 49(2):253–266
Article MathSciNet Google Scholar
Youn BD, Wang P (2008) Bayesian reliability-based design optimization using eigenvector dimension reduction (EDR) method. Struct Multidiscip Optim 36(2):107–123
Article Google Scholar
Youn BD, Choi KK, Park YH (2003) Hybrid analysis method for reliability-based design optimization. J Mech Des 125(2):221–232
Article Google Scholar
Youn BD, Choi KK, Yang RJ, Gu L (2004) Reliability-based design optimization for crashworthiness of vehicle side impact. Struct Multidiscip Optim 26(3–4):272–283
Article Google Scholar
Youn BD, Choi KK, Du L (2005) Enriched performance measure approach for reliability-based design optimization. AIAA J 43(4):874–884
Article Google Scholar
Zhao L, Choi KK, Lee I, Gorsich D (2013) Conservative surrogate model using weighted Kriging variance for sampling-based RBDO. J Mech Des 135(9):091003
Article Google Scholar

Download references

Funding

This research was supported by the development of thermoelectric power generation system and business model utilizing non-use heat of industry funded by the Korea Institute of Energy Technology Evaluation and Planning (KETEP) and the Ministry of Trade Industry & Energy (MOTIE) of the Republic of Korea (No. 20172010000830).

Author information

Authors and Affiliations

Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, 34141, South Korea
Yongsu Jung & Ikjin Lee
Department of Mechanical Engineering, Mokpo National University, Muan-gun, 58554, South Korea
Hyunkyoo Cho

Authors

Yongsu Jung
View author publications
You can also search for this author in PubMed Google Scholar
Hyunkyoo Cho
View author publications
You can also search for this author in PubMed Google Scholar
Ikjin Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ikjin Lee.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Responsible Editor: Nam Ho Kim

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jung, Y., Cho, H. & Lee, I. Reliability measure approach for confidence-based design optimization under insufficient input data. Struct Multidisc Optim 60, 1967–1982 (2019). https://doi.org/10.1007/s00158-019-02299-3

Download citation

Received: 18 December 2018
Revised: 01 May 2019
Accepted: 02 May 2019
Published: 11 June 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s00158-019-02299-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Reliability measure approach for confidence-based design optimization under insufficient input data

Abstract

Similar content being viewed by others

Probabilistic sensitivity analysis for novel second-order reliability method (SORM) using generalized chi-squared distribution

Conservative reliability-based design optimization method with insufficient input data

Extending SORA method for reliability-based design optimization using probability and convex set mixed models

1 Introduction

2 Overview of reliability analysis and confidence of reliability

2.1 Performance measure approach for reliability-based design optimization

2.2 Confidence of reliability with insufficient input data

2.3 Conservative reliability-based design optimization

3 Confidence-based design optimization using a reliability measure approach

3.1 Uncertainty of input distribution model

3.2 Reliability measure approach

3.2.1 Formulation of RMA

3.2.2 Stochastic sensitivity analysis in RMA

3.2.3 Sensitivity analysis in CBDO

4 Two approaches for input distribution type uncertainty

4.1 Known input distribution type

4.2 Unknown input distribution type

4.2.1 Uncertainty quantification using kernel density estimation

4.2.2 Stochastic sensitivity analysis for kernel density estimation

4.3 Flowchart and practical issues

5 Numerical examples

5.1 One-dimensional mathematical function for RMA

5.1.1 Verification of sensitivity analysis in RMA

5.1.2 Results of RMA

5.2 Two-dimensional confidence-based design optimization

5.3 Eleven-dimensional side impact problem

6 Conclusion

7 Replication of Results

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation