Multi-fidelity uncertainty propagation using polynomial chaos and Gaussian process modeling

Wang, Fenggang; Xiong, Fenfen; Chen, Shishi; Song, Jianmei

doi:10.1007/s00158-019-02287-7

Multi-fidelity uncertainty propagation using polynomial chaos and Gaussian process modeling

Research Paper
Published: 04 June 2019

Volume 60, pages 1583–1604, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Multi-fidelity uncertainty propagation using polynomial chaos and Gaussian process modeling

Download PDF

Fenggang Wang¹,
Fenfen Xiong ORCID: orcid.org/0000-0002-4422-6330¹,
Shishi Chen² &
…
Jianmei Song¹

1385 Accesses
21 Citations
Explore all metrics

Abstract

The polynomial chaos (PC) method has been widely studied and applied for uncertainty propagation (UP) due to its high efficiency and mathematical rigor. However, the straightforward application of PC on the computationally expensive and highly complicated model for UP might be too costly. Therefore, a multi-fidelity PC approach using the Gaussian process modeling theory is developed in this work, by which the classic multi-level co-kriging multi-fidelity modeling framework in the deterministic domain is extended to the stochastic one. Meanwhile, taking advantage of the Gaussian process modeling theory, the strategies for response models with hierarchical and non-hierarchical fidelity are both addressed within the proposed multi-fidelity PC approach. The effectiveness and relative merit of the proposed method are demonstrated by comparative studies on several numerical examples for UP. It is noticed that the proposed approach can significantly improve the accuracy and robustness of UP compared to the commonly used addition correction-based multi-fidelity PC method; compared to co-kriging, the accuracy and robustness are generally also improved, especially for problems with unsymmetric distributed random input and large variation. An engineering robust aerodynamic optimization problem further verifies the applicability of the proposed multi-fidelity PC method.

Adaptive multi-fidelity sparse polynomial chaos-Kriging metamodeling for global approximation of aerodynamic data

Article 10 April 2021

Non-intrusive Uncertainty Quantification by Combination of Reduced Basis Method and Regression-based Polynomial Chaos Expansion

Metamodel-Based Sensitivity Analysis: Polynomial Chaos Expansions and Gaussian Processes

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Uncertainty propagation (UP) methods, which quantify uncertainty in system output performance based on random or noisy inputs, are of great importance for design under uncertainty like robust design and reliability-based design (Du and Chen 2002; Chen et al. 2015). A wide variety of UP techniques have been developed (Lee and Chen 2009) among which the polynomial chaos (PC) technique is one of the most popular approach due to its mathematically rigorous concept, strong theoretical basis, and inherent ability to converge to computer calculation precision (Eldred 2009). With PC, a stochastic quantity can be represented as a polynomial chaos expansion, based on which the statistical moments and reliability can be conveniently obtained. Oftentimes, the analysis models in practical engineering are highly nonlinear and computationally expensive, such as computational fluid dynamics (CFD) for aerodynamic analysis, resulting in intensively computational cost in implementing UP via PC, which becomes more serious for high-dimensional problems. Therefore, the straightforward application of PC on the expensive model for UP might be too costly and infeasible in practical application.

Generally, a complicated physical process can be modeled using several methods with different levels of fidelity, or a computer code for a complex problem can be run at different levels of fidelity. For example, the aircraft aerodynamic analysis can be simulated with different reduced physical order (e.g., Euler model vs. potential flow model) or different numerical solver (e.g., finite difference method vs. finite element analysis). A high-fidelity (HF) model takes more computational time but offers higher accuracy, whereas a low-fidelity (LF) model is faster at the cost of accuracy. The exploitation of the availability of multiple models within a hierarchy of fidelity is popular in assisting the process of optimization in engineering (Gratiet and Cannamela 2012; Huang et al. 2006; Gratiet et al. 2014). This scenario has been extended to UP via PC for improving computational efficiency recently, which has received considerable interest (Shah et al. 2015; Ng and Eldred 2012; Zhu et al. 2017; Zhu et al. 2014). The earliest work about the multi-fidelity PC method was proposed by Ng and Eldred (Ng and Eldred 2012), in which the stochastic collocation technique was employed to construct the PC model, and the LF and correction PC expansions are integrated in the form of addition, multiplication, or a combination of the two into a single expansion to match the HF model values. As stated by Ng and Eldred in their work (Ng and Eldred 2012), for the multiplicative and combinative correction forms, the calculation of the multi-fidelity polynomial coefficients is much more complicated and the accuracy is generally worse or comparable, and thus the additive form is widely studied and applied. This method with the additive form has been applied to UP for a vertical axis wind turbine under extreme gusts (Santiago Padron et al. 2014; Palar et al. 2018). Another similar technique is the multi-fidelity stochastic collocation that relies on Lagrange-polynomial interpolation, in which a greedy procedure based on the information from the LF model is used to collect “important” sample points for the HF simulations (Zhu et al. 2014, 2017).

For the widely studied method proposed by Ng and Eldred, the roots of the orthogonal polynomials are employed as the collocation points for the PC expansion. Therefore, the number and location of collocation points cannot be arbitrary, resulting in less flexibility in performing UP for the user with a limited computational budget. To address this issue, based on the work of Ng and Eldred, the multi-fidelity PC approach using regression has been developed (Pramudita et al. 2016), which has been applied to multidisciplinary design optimization under uncertainty (West and Gumbert 2017) and aerodynamic robust optimization (Palar et al. 2015). For all the above PC based multi-fidelity modeling approaches for UP, it is required that the HF sample points should be a subset of the LF ones (i.e., nested sample points). To relax this constraint and improve the flexibility, Berchier has proposed to calculate the correction expansion term using the low-fidelity PC model rather than the LF sample points (Matteo 2016).

In the deterministic case, the most well-known multi-fidelity modeling method is the multi-level co-kriging approach proposed by Kennedy and O’Hagan (short for KOH in this work), in which the discrepancy-based autoregressive multi-fidelity modeling formulation and Gaussian process (GP) modeling technique are employed (Kennedy and O’Hagan 2000). It has been widely recognized that KOH is more accurate and flexible compared to the classic additive and multiplicative correction forms for multi-fidelity modeling (Laurenceau and Sagaut 2008; Toal et al. 2011; Han et al. 2012; Huang et al. 2013; Toal and Keane 2015). One main reason is that the KOH framework employs the Gaussian process modeling method that can flexibly capture the nonlinearity of the model, and the scaling factor on the LF model that is more beneficial to improve the accuracy of the correction term and avoid the bumpy issue (Fernández-Godino et al. 2016). This prompts us to think about whether the KOH multi-fidelity modeling framework can be extended to the stochastic domain for UP, within which the multi-fidelity PC can be implemented to improve the performance of UP. As is well known, the basic theoretical foundation of KOH is the GP modeling method, and the predicted response in KOH can be represented as a GP. Recently, the PC method has been extended to polynomial-chaos-kriging (PC-Kriging) by adding a GP term to the original PC model to more accurately capture the local variability of the response model (Schobi et al. 2015). It has been demonstrated that the PC-Kriging approach is more accurate than PC as well as kriging in performing UP (Schobi et al. 2015; Kersaudy et al. 2015). Clearly, the success that introducing GP into PC for UP lays a foundation for the extension of KOH framework to multi-fidelity PC. Therefore, a multi-fidelity PC approach will be developed and studied using the KOH framework in this paper.

In addition, almost all the works of multi-fidelity UP focus on model fusion within a hierarchical fidelity of the models. However, in many applications, it is not possible to rank models by their levels of fidelity a priori, exhibiting non-hierarchical fidelity. For example, the models of climate system are developed from different research groups to understand and predict its behavior, based on disparate theories or mechanisms to incorporate the physics and chemistry of the atmosphere, ocean, and land surface (Allaire and Willcox 2012). The non-hierarchical multi-fidelity modeling approach from the deterministic point of view has been developed by Chen et al. using the spatial random process (Chen et al. 2016), which will be extended to multi-fidelity PC within a non-hierarchal fidelity in this work.

It is the objective of this work to explore the applicability and effectiveness of the Gaussian process modeling theory on multi-fidelity UP via the PC technique. For the hierarchical fidelities, the well-known KOH framework is extended to multi-fidelity UP, in which the lowest-fidelity model and all the correction terms are respectively represented as a PC-Kriging model. For the non-hierarchical fidelities, the weighted summation method proposed by Chen et al. is extended, in which all the lower-fidelity models and the correction term are respectively represented as a PC-Kriging model. Meanwhile, for high-dimensional problems, the hyperbola truncation scheme is employed to reduce the number of the orthogonal polynomials during the construction of PC term in the PC-Kriging model, and thus to reduce the computational cost.

The remainder of this paper is organized as follows. A brief review of the PC-Kriging method combining PC and Gaussian process modeling for UP in given in Section 2. The proposed multi-fidelity UP method using PC and the Gaussian process modeling technique is presented, in which multi-fidelity UP strategies respectively for hierarchical and non-hierarchical fidelities are explained in detail. Comparative studies on numerical problems are presented in Section 4, where the commonly used co-kriging method (Kennedy and O’Hagan 2000) and multi-fidelity PC method (Ng and Eldred 2012) are also tested for comparison. In Section 5, the proposed method is applied to an aerodynamic robust optimization problem to further verify its effectiveness and applicability in dealing with practical problems. Conclusions are drawn in Section 6.

2 Review of PC-kriging

The polynomial-chaos-kriging (PC-Kriging) method is a newly developed approach for UP by adding a GP term to the PC model. As is well known, the PC method formulated as a weighted sum of a set of orthogonal polynomials can efficiently capture the global behavior of the analysis model. The introduction of the GP term helps to capture the local variability, thus to improve the accuracy of PC for UP. With PC-Kriging, a stochastic response y = g(x) with a d-dimensional input vector x = [x₁, ..., x_d] can be represented as follows:

$$ y\approx {M}^{(PCK)}\left(\mathbf{x}\right)=\sum \limits_{i=0}^P{b}_i{\varPhi}_i\left(\mathbf{x}\left(\xi \right)\right)+{\sigma}^2Z\left(\mathbf{x}\right) $$

(1)

where $ \sum \limits_{i=0}^P{b}_i{\varPhi}_i\left(\mathbf{x}\left(\xi \right)\right) $ is a PC model with order p describing the mean value of the Gaussian process, ξ is a standard random vector generated by mapping the original random vector x to the standard random space according to the distribution parameters of x, b_i is the coefficient of PC model; σ is the prior standard deviation of the Gaussian process, and Z(x) is a zero-mean and unit-variance stationary Gaussian process with the autocorrelation function R.

The commonly used formulation of R in the literature is:

$$ R\left(\mathbf{x},{\mathbf{x}}^{\prime },\boldsymbol{\theta}, h\right)=\exp \left(-{\sum}_{k=1}^d{\theta}_k{\left|{x}_k-{x}_k^{\hbox{'}}\right|}^h\right) $$

(2)

Building a PC-Kriging stochastic metamodel consists of two parts: (i) the construction of Φ_i(x(ξ)) in the PC term, and (ii) the estimation of hyper-parameters (θ, h,σ) and b = [b₀, ..., b_P]^T. For the first part, the commonly used method is the direct tensor product technique. For high-dimensional problems, the least angle regression method (Wang et al. 2016) or the hyperbolic truncation scheme (Blatman and Sudret 2011) shown in (3) can be employed to remove some unimportant orthogonal polynomials, and thus to reduce the computational cost of PC:

$$ {A}_q^{d,\omega }=\left\{\boldsymbol{\upalpha} \in {\mathbb{N}}^d:{\left\Vert \boldsymbol{\upalpha} \right\Vert}_q={\left(\sum \limits_{i=1}^d{\alpha}_i^q\right)}^{\frac{1}{q}}\le \omega \right\} $$

(3)

where A is the truncation set of multi-indices α for PC, q is the sparse factor, ω is the highest order of polynomial Φ(x(ξ)), and α_i is the degree of the i^th random variable in Φ(x(ξ)).

For the second part, the maximum likelihood estimation (MLE) method or the cross validation (CV) method can be employed (Schobi et al. 2015) by maximizing the following functions, respectively:

$$ {G}_{ML}\left(\boldsymbol{\theta}, h\right)=\underset{\theta, h}{\arg\ \min}\left[\frac{1}{N}{\left(\boldsymbol{y}-\mathbf{F}\boldsymbol{b}\right)}^T{\mathbf{R}}^{-1}\left(\boldsymbol{y}-\mathbf{F}\boldsymbol{b}\right){\left(\det \mathbf{R}\right)}^{\frac{1}{N}}\right] $$

(4)

$$ {G}_{CV}\left(\boldsymbol{\theta}, h\right)=\underset{\theta, h}{\arg\ \min}\left[{\boldsymbol{y}}^T{\mathbf{R}}^{-1}\mathit{\operatorname{diag}}{\left({\mathbf{R}}^{-1}\right)}^{-2}{\mathbf{R}}^{-1}\boldsymbol{y}\right] $$

(5)

where N is the number of sample points, y = [y₁, ..., y_N]^Tis the response vector at the input sample points,R is the autocorrelation matrix with the i^th row and j^th column element as R_ij = R(x_i, x_j, θ, h), and Fis a (P + 1) × N matrix with F_ij = Φ_i(x_j(ξ))(i = 0, 1, ..., P; j = 1, ..., N); the PC coefficient vector b can be represented as follows:

$$ \boldsymbol{b}\left(\boldsymbol{\uptheta}, h\right)={\left({\mathbf{F}}^T{\mathbf{R}}^{-1}\mathbf{F}\right)}^{-1}\mathbf{F}{\mathbf{R}}^{-1}y $$

(6)

Once all the parameters are obtained, the predicted response at any new input site x_p can be calculated by (Rasmussen and Williams 2006):

$$ {y}_{(PCK)}\left({\mathbf{x}}_p\right)=\boldsymbol{\Phi} {\left({\mathbf{x}}_p\left(\boldsymbol{\xi} \right)\right)}^T\boldsymbol{b}+R\left({\mathbf{x}}_p,\mathbf{X},\boldsymbol{\theta}, h\right){\mathbf{R}}^{-1}\left(\boldsymbol{y}-\mathbf{F}\boldsymbol{b}\right) $$

(7)

where X is the matrix stacking all the collected input sample points.

Then, Monte Carlo simulation (MCS) can be directly employed on the PC-Kriging metamodel to obtain the statistical moments and probabilistic distribution of random response y.

3 The proposed multi-fidelity UP method

3.1 Multi-fidelity UP for hierarchical fidelity

It is assumed that there exist s analysis models with responses as y^t(x)|_{t = 1, ...s}, and a larger t corresponds to a higher level of fidelity and larger computational cost. Therefore, y¹(x)is the lowest-fidelity and cheapest model, while y^s(x) is the highest fidelity and most expensive one. x = [x₁, x₂, … , x_d] ∈ ℝ^d represents a d-dimensional random input vector. A step-by-step description of the proposed multi-fidelity UP using PC and the Gaussian process modeling technique for the hierarchical fidelity scenario is presented as follows.

Step 1.
According to the distribution information of random input vector x, generate input sample points using methods such as the Latin Hypercube sampling or Gaussian quadrature points rule etc., and calculate the corresponding model responses with different levels of fidelity.

For the t^thlevel (t = 1,...,s) model, it is supposed that a set of response observations $ {\mathbf{d}}_t={\left[{y}^t\left({\mathbf{x}}_1^t\right),...,{y}^t\left({\mathbf{x}}_{n_t}^t\right)\right]}^T $at input sites $ {\mathbf{D}}_t={\left[{\left({\mathbf{x}}_1^t\right)}^T,{\left({\mathbf{x}}_2^t\right)}^T,...,{\left({\mathbf{x}}_{n_t}^t\right)}^T\right]}^T $have been collected. Let $ \mathbf{d}={\left[{\mathbf{d}}_1^T,...,{\mathbf{d}}_s^T\right]}^T $ denote all of the collected response data from all models at the input space Γ = [D₁; D₂; ...; D_s]. Generally, the number of sample points n_t is decreased with the increase of t considering the computational cost.
Step 2.
Construct the multi-fidelity PC-Kriging metamodel in replacement of the highest fidelity model y^s(x)by extending the KOH framework to UP.

The KOH formulation in constructing multi-level co-kriging (Kennedy and O’Hagan 2000) is:

$$ {y}^t\left(\mathbf{x}\right)={\rho}_{t-1}{y}^{t-1}\left(\mathbf{x}\right)+{\delta}^t\left(\mathbf{x}\right),t=2,\dots, s $$

(8)

whereρ_t − 1represents the scaling factor between model responses y^t(x) andy^t − 1(x), and the correction function δ^t(x) is a Gaussian process denoting the discrepancy between y^t(x) and ρ_t − 1y^t − 1(x).

Accordingly, the highest fidelity output responsey^s(x)can be expressed as below based on (8):

$$ {y}^s\left(\mathbf{x}\right)=\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right){y}^1(x)+\left(\prod \limits_{i=2}^{s-1}{\rho}_i\right){\delta}^2\left(\mathbf{x}\right)+\dots +{\rho}_{s-1}{\delta}^{s-1}\left(\mathbf{x}\right)+{\delta}^s\left(\mathbf{x}\right) $$

(9)

It is assumed that y¹(x), δ²(x), ..., δ^s(x) can be respectively modeled by a GP, which are represented as PC-Kriging metamodels as below, respectively:

$$ \Big\{{\displaystyle \begin{array}{c}{y}^1\left(\mathbf{x}\right)=\sum \limits_{i=0}^P{b}_i^1{\boldsymbol{\Phi}}_i^1\left(\mathbf{x}\left(\boldsymbol{\xi} \right)\right)+{\sigma}_1^2{Z}^1\left(\mathbf{x}\right)\sim \mathcal{GP}\left({M}^1\left(\mathbf{x}\right),{V}^1\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)\right)\\ {}{\delta}^2\left(\mathbf{x}\right)=\sum \limits_{i=0}^P{b}_i^2{\boldsymbol{\Phi}}_i^2\left(\mathbf{x}\left(\boldsymbol{\xi} \right)\right)+{\sigma}_2^2{Z}^2\left(\mathbf{x}\right)\sim \mathcal{GP}\left({M}^2\left(\mathbf{x}\right),{V}^2\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)\right)\\ {}\vdots \\ {}{\delta}^s\left(\mathbf{x}\right)=\sum \limits_{i=0}^P{b}_i^s{\boldsymbol{\Phi}}_i^s\left(\mathbf{x}\left(\boldsymbol{\xi} \right)\right)+{\sigma}_s^2{Z}^s\left(\mathbf{x}\right)\sim \mathcal{GP}\left({M}^s\left(\mathbf{x}\right),{V}^s\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)\right)\end{array}} $$

(10)

where $ M\left(\mathbf{x}\right)=\sum \limits_{i=0}^P{b}_i{\boldsymbol{\Phi}}_i\left(\mathbf{x}\left(\boldsymbol{\xi} \right)\right) $ is the mean function expressed as the weighted sum of Φ(x(ξ))(i.e., PC term), V(x, x^′) = σ²R(x, x^′, θ, h) is the covariance function, representing the spatial covariance between any two inputs x and x^′of the GP.

Then, additively, the highest fidelity model y^s(x) can be further expressed by a GP based on (9) and (10) as follows:

$$ {\displaystyle \begin{array}{l}{y}^s\left(\mathbf{x}\right)\sim \mathcal{GP}\Big({M}^1\left(\mathbf{x}\right)\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right)+{M}^2\left(\mathbf{x}\right)\left(\prod \limits_{i=2}^{s-1}{\rho}_i\right)+\dots +{M}^s\left(\mathbf{x}\right),\\ {}{\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right)}^2{V}^1\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)+{\left(\prod \limits_{i=2}^{s-1}{\rho}_i\right)}^2{V}^2\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)+\dots +{V}^s\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)\Big)\end{array}} $$

(11)

Step 3.
Estimate all the unknown hyper-parameters by the maximum likelihood estimation method.

The unknown hyper-parameters in (11) are: Δ = {B, σ, Θ, ρ, h}, where B = [(b¹)^T, … , (bⁱ)^T, ..., (b^s)^T]^Tis the polynomial coefficient matrix with each element as bⁱ = [b₀, b₁..., b_P]^T, and the rest can be expressed as σ = [σ¹, … , σ^s]^T, Θ = [θ¹, … , θ^s]^T, ρ = [ρ¹, … , ρ^s − 1]^T, h = [h¹, … , h^s]^T.

In this work, the method considering the full correlation of all the response models (Liu et al. 2018) is employed for parameter estimation. Then, one can obtain that all the collected data d follow a multivariate normal distribution based on the assumption of GP, i.e.:

$$ \mathbf{d}\sim \mathcal{N}\left(\mathbf{HB},{\mathbf{V}}_d\right) $$

(12)

where H is defined as:

$$ \mathbf{H}=\left[\begin{array}{cccc}{\boldsymbol{\Phi}}^1\left({\mathbf{D}}_1\left(\boldsymbol{\upxi} \right)\right)& \mathbf{0}& \cdots & \mathbf{0}\\ {}{\rho}_1{\boldsymbol{\Phi}}^1\left({\mathbf{D}}_2\left(\boldsymbol{\upxi} \right)\right)& {\boldsymbol{\Phi}}^2\left({\mathbf{D}}_2\left(\boldsymbol{\upxi} \right)\right)& \mathbf{0}& \mathbf{0}\\ {}{\rho}_1{\rho}_2{\boldsymbol{\Phi}}^1\left({\mathbf{D}}_3\left(\boldsymbol{\upxi} \right)\right)& {\rho}_2{\boldsymbol{\Phi}}^2\left({\mathbf{D}}_3\left(\boldsymbol{\upxi} \right)\right)& {\boldsymbol{\Phi}}^3\left({\mathbf{D}}_3\left(\boldsymbol{\upxi} \right)\right)& \vdots \\ {}\vdots & \vdots & \vdots & \mathbf{0}\\ {}\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right){\boldsymbol{\Phi}}^1\left({\mathbf{D}}_s\left(\boldsymbol{\upxi} \right)\right)& \left(\prod \limits_{i=2}^{s-1}{\rho}_i\right){\boldsymbol{\Phi}}^2\left({\mathbf{D}}_s\left(\boldsymbol{\upxi} \right)\right)& \cdots & {\boldsymbol{\Phi}}^s\left({\mathbf{D}}_s\left(\boldsymbol{\upxi} \right)\right)\end{array}\right] $$

(13)

and Φ^t(D_j(ξ))is a matrix of size n_j × (P + 1) (j, t = 1,2, …,s), formulated as:

$$ {\boldsymbol{\Phi}}^t\left({\mathbf{D}}_j\left(\boldsymbol{\xi} \right)\right)=\left[{\varPhi}_0^t\left({\mathbf{D}}_j\left(\boldsymbol{\xi} \right)\right),{\varPhi}_1^t\left({\mathbf{D}}_j\left(\boldsymbol{\xi} \right)\right),\dots, {\varPhi}_P^t\left({\mathbf{D}}_j\left(\boldsymbol{\xi} \right)\right)\right] $$

(14)

The matrix V_d of size (n₁+, ..., +n_s) × (n₁+, ..., +n_s) is given as:

$$ {\mathbf{V}}_d=\left(\begin{array}{ccc}{V}_{1,1}& \cdots & {V}_{1,s}\\ {}\vdots & \ddots & \vdots \\ {}{V}_{s,1}& \cdots & {V}_{s,s}\end{array}\right) $$

(15)

where the t^th diagonal block (n_t × n_t) is defined as:

$$ {V}_{t,t}={\sigma}_t^2{R}_t\left({\mathbf{D}}_t\right)+{\sigma}_{t-1}^2{\rho}_{t-1}^2{R}_{t-1}\left({\mathbf{D}}_t\right)+\dots +{\sigma}_1^2\left(\prod \limits_{i=1}^{t-1}{\rho}_i^2\right){R}_1\left({\mathbf{D}}_t\right) $$

(16)

withR_i(D_t) = R_i(D_t, D_t, θⁱ, hⁱ), and the off-diagonal block of size $ {n}_t\times {n}_{t^{\prime }} $ is given by:

$$ {V}_{t,{t}^{\prime }}=\underset{1\le t<{t}^{\prime}\le s}{\left(\prod \limits_{i=t}^{t^{\prime }-1}{\rho}_i\right)}\left({\sigma}_t^2{R}_t\left({\mathbf{D}}_t,{\mathbf{D}}_{t^{\prime }}\right)+\dots +{\sigma}_1^2\left(\prod \limits_{i=1}^{t-1}{\rho}_i^2\right){R}_1\left({\mathbf{D}}_t,{\mathbf{D}}_{t^{\prime }}\right)\right) $$

(17)

The maximum likelihood function is defined as follows:

$$ \mathcal{L}\left(\varDelta |\mathbf{d}\right)\propto {\left|{\mathbf{V}}_d\right|}^{-1/2}{\left|\mathbf{W}\right|}^{1/2}\mathit{\exp}\left\{-\frac{1}{2}{\left(\mathbf{d}-\mathbf{HB}\right)}^T{\mathbf{V}}_d^{-1}\left(\mathbf{d}-\mathbf{HB}\right)\right\} $$

(18)

where $ \mathbf{W}={\left({\mathbf{H}}^T{\mathbf{V}}_d^{-1}\mathbf{H}\right)}^{-1} $, and the PC coefficient matrix B in the PC-Kriging model can be derived using the first order optimal condition:

$$ \overset{\frown }{\mathbf{B}}=\mathbf{WH}{\mathbf{V}}_d^{-1}\mathbf{d} $$

(19)

The rest parameters σ, Θ, ρ and h can be obtained using genetic algorithm or simulated annealing algorithm by maximizing (18).

Step 4.
MCS is conducted on the constructed multi-fidelity PC-Kriging metamodel above to obtain the stochastic property of random output y.

Based on the GP modeling theory, all the collected data d together with the to-be-predicted responses $ \boldsymbol{y}\left({\mathbf{x}}_p\right)={\left[y\left({\mathbf{x}}_1\right),...,y\left({\mathbf{x}}_{n_p}\right)\right]}^T $ follow a multivariate normal distribution:

$$ \left[\begin{array}{c}\mathbf{d}\\ {}y\left({\mathbf{x}}_p\right)\end{array}\right]\sim \mathcal{N}\left(\left[\begin{array}{c}\mathbf{H}\\ {}{\mathbf{H}}_p\end{array}\right]\mathbf{B},\left[\begin{array}{cc}{\mathbf{V}}_d& {\mathbf{T}}_p^T\\ {}{\mathbf{T}}_p& {\mathbf{V}}_p\end{array}\right]\right) $$

(20)

Then, the final prediction of y(x_p) can be calculated by:

$$ \overset{\frown }{\boldsymbol{y}}\left({\mathbf{x}}_p\right)={\mathbf{H}}_p\overset{\frown }{\mathbf{B}}+{\mathbf{T}}_p{\mathbf{V}}_d^{-1}\left(\mathbf{d}-\mathbf{H}\overset{\frown }{\mathbf{B}}\right) $$

(21)

where:

$$ {\mathbf{H}}_p=\left(\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right){\boldsymbol{\Phi}}^1\left({\mathbf{x}}_p\left(\xi \right)\right),\left(\prod \limits_{i=2}^{s-1}{\rho}_i\right){\boldsymbol{\Phi}}^2\left({\mathbf{x}}_p\left(\xi \right)\right),\dots, {\rho}_{s-1}{\boldsymbol{\Phi}}^{s-1}\left({\mathbf{x}}_p\left(\xi \right)\right),{\boldsymbol{\Phi}}^s\left({\mathbf{x}}_p\left(\xi \right)\right)\right) $$

(22)

$$ {\mathbf{T}}_p={\left({t}_1{\left({x}_p,{\mathbf{D}}_1\right)}^T,\dots, {t}_s{\left({x}_p,{\mathbf{D}}_s\right)}^T\right)}^T $$

(23)

$$ {t}_1\left({\mathbf{x}}_p,{\mathbf{D}}_1\right)=\left(\prod \limits_{i=1}^{s-1}{\rho}_i\right){\sigma}_1^2{R}_1{\left({\mathbf{x}}_p,{\mathbf{D}}_1\right)}^T $$

(24)

$$ {t}_t\left({\mathbf{x}}_p,{\mathbf{D}}_t\right)={\rho}_{t-1}{t}_t\left({\mathbf{x}}_p,{\mathbf{D}}_t\right)+\left(\prod \limits_{i=t}^s{\rho}_i\right){\sigma}_t^2{R}_t{\left({\mathbf{x}}_p,{\mathbf{D}}_t\right)}^T,t=2,\dots, s $$

(25)

$$ {\mathbf{V}}_p={V}_d^s\left({\mathbf{x}}_p,{\mathbf{x}}_p\right)+{\rho}_{s-1}^2{V}_d^{s-1}\left({\mathbf{x}}_p,{\mathbf{x}}_p\right)+{\rho}_{s-1}^2{\rho}_{s-2}^2{V}_d^{s-2}\left({\mathbf{x}}_p,{\mathbf{x}}_p\right)+\dots +\left(\prod \limits_{t=2}^{s-1}{\rho}_t\right){V}_d^2\left({\mathbf{x}}_p,{\mathbf{x}}_p\right) $$

(26)

MCS is employed directly on (21) to calculate the mean, standard deviation and probabilistic distribution, etc., of the random response y.

3.2 Multi-fidelity UP for non-hierarchical fidelity

For the non-hierarchical fidelity case, it is assumed that there are m lower-fidelity models with responses as y^q(x)|_{q = 1, ...m}, of which the fidelity cannot be ranked in advance, and one high-fidelity model y^H(x).

Step 1.
Similar to the step 1 during hierarchical multi-fidelity modeling in Section 3.1, collect input data $ {\mathbf{D}}_q={\left[{\left({\mathbf{x}}_1^q\right)}^T,{\left({\mathbf{x}}_2^q\right)}^T,...,{\left({\mathbf{x}}_{n_q}^q\right)}^T\right]}^T $and output response data $ {\mathbf{d}}_q={\left[{y}^q\left({\mathbf{x}}_1^q\right),...,{y}^q\left({\mathbf{x}}_{n_q}^q\right)\right]}^T $for each lower-fidelity model. Let $ {\mathbf{d}}_L={\left[{\mathbf{d}}_1^T,...,{\mathbf{d}}_m^T\right]}^T $denote the collected response data from all lower-fidelity models at the input sites Γ_L = [D₁; D₂; ...; D_m], and $ {\mathbf{d}}_H={\left[{y}^H\left({\mathbf{x}}_1^H\right),...,{y}^H\left({\mathbf{x}}_{n_H}^H\right)\right]}^T $ denote the collected response data from high-fidelity model at input sites $ {\mathbf{D}}_H={\left[{\left({\mathbf{x}}_1^H\right)}^T,...,{\left({\mathbf{x}}_{n_H}^H\right)}^T\right]}^T $.
Step 2.
Construct the multi-fidelity PC-Kriging metamodel in replacement ofy^H(x).

According to the weighted summation method (Chen et al. 2016), represent y^H(x) as an addition of the weighted summation of all the lower-fidelity models y^q(x)|_{q = 1, ...m} and a residual deviation function δ(x):

$$ {y}^H\left(\mathbf{x}\right)=\sum \limits_{q=1}^m{\rho}^q{y}^q\left(\mathbf{x}\right)+\delta \left(\mathbf{x}\right) $$

(27)

where ρ^qdenotes the weighting coefficient of model y^q(x).

It is assumed that all the lower-fidelity models y^q(x)|_{q = 1, ...m}andδ(x)are priori independent and can be respectively represented as a GP to simplify the model fusion process. Construct the stochastic metamodel for each lower-fidelity model y^q(x)|_{q = 1, ...m} using the PC-Kriging method, during which the same Gaussian correlation function R(x, x^′, θ, h) is employed considering that all the lower-fidelity models describe the same physical process. Similarly, construct the PC-Kriging metamodel for δ(x) with the Gaussian correlation function R^δ(x, x^′, θ^δ, h^δ).

Based on (27), y^H(x) can be further expressed as a GP additively as follows:

$$ {y}^H\left(\mathbf{x}\right)\sim \mathcal{GP}\left(M\left(\mathbf{x}\right),V\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)\right) $$

(28)

where $ M\left(\mathbf{x}\right)=\sum \limits_{q=1}^m{\boldsymbol{\Phi}}^q{\left(\mathbf{x}\right)}^T{\boldsymbol{b}}^q{\rho}^q+{\boldsymbol{\Phi}}^{\delta }{\left(\mathbf{x}\right)}^T{\boldsymbol{b}}^{\delta } $, $ V\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)={\boldsymbol{\rho}}^T\mathbf{E}\boldsymbol{\rho } R\left(\mathbf{x},{\mathbf{x}}^{\prime}\right)+{\sigma}_{\delta}^2{R}^{\delta}\left(\mathbf{x},{\mathbf{x}}^{\prime}\right) $, ρ = [ρ¹, … , ρ^m]^T, $ \mathbf{E}=\left[\begin{array}{ccc}{\mathbf{E}}_{1,1}& \cdots & {\mathbf{E}}_{1,m}\\ {}\vdots & & \vdots \\ {}{\mathbf{E}}_{m,1}& \cdots & {\mathbf{E}}_{m,m}\end{array}\right] $, E_i,j is the unknown covariance between lower-fidelity models yⁱ(x) and y^j(x) calculated by $ {\mathbf{E}}_{i,j}={c}_{i,j}\sqrt{{\mathbf{E}}_{i,i}{\mathbf{E}}_{j,j}} $, and c_{i, j} ∈ [−1, 1] is the unknown correlation coefficient.

Step 3.
Estimate the hyper-parameters by the maximum likelihood estimation method.

The hyper-parameters to be estimated areΔ = {b¹, … , b^m, b^δ, Ε, ρ, θ, θ^δ, h, h^δ}. Similarly to the step 3 during the hierarchical multi-fidelity modeling, all the collected response data d = [d_L; d_H] follow a multivariate Gaussian distribution as shown in (12), and some related matrices are re-defined as follows:

$$ \mathbf{B}={\left[{\left({\boldsymbol{b}}^1\right)}^T,\dots, {\left({\boldsymbol{b}}^m\right)}^T,{\left({\boldsymbol{b}}^{\delta}\right)}^T\right]}^T $$

(29)

$$ \mathbf{H}=\left[\begin{array}{cccc}{\boldsymbol{\Phi}}^1\left({\mathbf{D}}_1\left(\xi \right)\right)& \cdots & \mathbf{0}& \mathbf{0}\\ {}\vdots & \ddots & \vdots & \vdots \\ {}\mathbf{0}& \cdots & {\boldsymbol{\Phi}}^m\left({\mathbf{D}}_m\left(\xi \right)\right)& \mathbf{0}\\ {}{\rho}^1{\boldsymbol{\Phi}}^1\left({\mathbf{D}}_H\left(\xi \right)\right)& \cdots & {\rho}^m{\boldsymbol{\Phi}}^m\left({\mathbf{D}}_H\left(\xi \right)\right)& {\boldsymbol{\Phi}}^{\delta}\left({\mathbf{D}}_H\left(\xi \right)\right)\end{array}\right] $$

(30)

$$ {\mathbf{V}}_d=\left[\begin{array}{cccc}{\boldsymbol{e}}_1^T\mathbf{E}{\boldsymbol{e}}_1R\left({\mathbf{D}}_1,{\mathbf{D}}_1\right)& \cdots & {\boldsymbol{e}}_1^T\mathbf{E}{\boldsymbol{e}}_mR\left({\mathbf{D}}_1,{\mathbf{D}}_m\right)& {\boldsymbol{e}}_1^T\mathbf{E}\boldsymbol{\rho } R\left({\mathbf{D}}_1,{\mathbf{D}}_H\right)\\ {}\vdots & \ddots & \vdots & \vdots \\ {}{\boldsymbol{e}}_m^T\mathbf{E}{\boldsymbol{e}}_1R\left({\mathbf{D}}_m,{\mathbf{D}}_1\right)& \cdots & {\boldsymbol{e}}_m^T\mathbf{E}{\boldsymbol{e}}_mR\left({\mathbf{D}}_m,{\mathbf{D}}_m\right)& {\boldsymbol{e}}_m^T\mathbf{E}\boldsymbol{\rho } R\left({\mathbf{D}}_m,{\mathbf{D}}_H\right)\\ {}\boldsymbol{\rho} \boldsymbol{E}{\boldsymbol{e}}_1R\left({\mathbf{D}}_H,{\mathbf{D}}_m\right)& \cdots & \boldsymbol{\rho} \boldsymbol{E}{\boldsymbol{e}}_mR\left({\mathbf{D}}_H,{\mathbf{D}}_m\right)& \begin{array}{l}{\boldsymbol{\rho}}^T\mathbf{E}\boldsymbol{\rho } R\left({\mathbf{D}}_H,{\mathbf{D}}_H\right)\\ {}+{\sigma}_{\delta}^2{R}^{\delta}\left({\mathbf{D}}_H,{\mathbf{D}}_H\right)\end{array}\end{array}\right] $$

(31)

where e_i is an m-dimensional unit column vector, with the i^th element as 1, while the others as zeros.

Then, using the same MLE method in Section 3.1, the hyper-parameters can be estimated.

Step 4.
Predict response value at the sample pointsx_psimilarly to the step 4 during the hierarchical multi-fidelity modeling in Section 3.1. Meanwhile, some matrices are re-defined as follows:

$$ {\mathbf{H}}_p=\left[{\rho}^1{\boldsymbol{\Phi}}^1\left({\mathbf{x}}_p\left(\xi \right)\right),\dots, {\rho}^m{\boldsymbol{\Phi}}^m\left({\mathbf{x}}_p\left(\xi \right)\right),{\boldsymbol{\Phi}}^{\delta}\left({\mathbf{x}}_p\left(\xi \right)\right)\right] $$

(32)

$$ {\mathbf{T}}_p=\left[{\boldsymbol{\rho}}^T\mathbf{E}{e}_1R\left({\mathbf{x}}_p,{\mathbf{D}}_1\right),\dots, {\boldsymbol{\rho}}^T\mathbf{E}{e}_mR\left({\mathbf{x}}_p,{\mathbf{D}}_m\right),{\boldsymbol{\rho}}^T\mathbf{E}\boldsymbol{\rho } R\left({\mathbf{x}}_p,{\mathbf{D}}_H\right)+{\sigma}_{\delta}^2{R}^{\delta}\left({\mathbf{x}}_p,{\mathbf{D}}_H\right)\right] $$

(33)

$$ {\mathbf{V}}_p={\boldsymbol{\rho}}^T\mathbf{E}\boldsymbol{\rho } R\left({\mathbf{x}}_p,{\mathbf{x}}_p\right)+{\sigma}_{\delta}^2{R}^{\delta}\left({\mathbf{x}}_p,{\mathbf{x}}_p\right) $$

(34)

4 Comparative studies

The effectiveness of the proposed two multi-fidelity modeling approaches is tested by some mathematical examples with different nonlinearity and random input dimension for UP in this section.

4.1 Test for hierarchical fidelity

For hierarchical fidelity, the existing multi-fidelity PC method with the addition form proposed by Ng and Eldred (Ng and Eldred 2012) (denoted as MF-PC) and co-kriging that has been commonly employed for metamodeling in the deterministic domain (Kennedy and O'Hagan 2000) are also employed for UP and compared to the proposed method in this work (denoted as MF-PCK). The tested examples are shown in Table 1, in which $ \mathcal{B} $,$ \mathcal{U} $, and $ \mathcal{N} $ respectively represent beta, uniform, and normal distribution. Examples 1 and 3–6 are adopted from the papers about multi-fidelity modeling in literature, and example 2 is created by authors that have a simple correction term to explore the effect of the scale factor in MF-PCK. Meanwhile, cases with nested (D_i + 1 ⊆ D_i, i = 1, …,s-1) and non-nested (D_i + 1 ⊄ D_i, i = 1, …,s-1) sample points are both tested to explore the impact of sample property on the accuracy of multi-fidelity modeling. The results generated by conducting MCS on the original high-fidelity response function (denoted as direct MCS for simplicity) is employed as the benchmark to validate the effectiveness of the proposed methods.

Table 1 Tested examples with hierarchical fidelity

Multi-fidelity uncertainty propagation using polynomial chaos and Gaussian process modeling

Abstract

Similar content being viewed by others

Adaptive multi-fidelity sparse polynomial chaos-Kriging metamodeling for global approximation of aerodynamic data

Non-intrusive Uncertainty Quantification by Combination of Reduced Basis Method and Regression-based Polynomial Chaos Expansion

Metamodel-Based Sensitivity Analysis: Polynomial Chaos Expansions and Gaussian Processes

1 Introduction

2 Review of PC-kriging

3 The proposed multi-fidelity UP method

3.1 Multi-fidelity UP for hierarchical fidelity

3.2 Multi-fidelity UP for non-hierarchical fidelity

4 Comparative studies

4.1 Test for hierarchical fidelity

MF-PCK vs. co-kriging

MF-PCK vs. MF-PC

4.2 Test for non-hierarchical fidelity

5 Application to airfoil optimization

6 Conclusions

7 Replication of results

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation