A two-stage adaptive multi-fidelity surrogate model-assisted multi-objective genetic algorithm for computationally expensive problems

Zhou, Qi; Wu, Jinhong; Xue, Tao; Jin, Peng

doi:10.1007/s00366-019-00844-8

A two-stage adaptive multi-fidelity surrogate model-assisted multi-objective genetic algorithm for computationally expensive problems

Original Article
Published: 20 August 2019

Volume 37, pages 623–639, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Engineering with Computers Aims and scope Submit manuscript

A two-stage adaptive multi-fidelity surrogate model-assisted multi-objective genetic algorithm for computationally expensive problems

Download PDF

Qi Zhou¹,
Jinhong Wu¹,
Tao Xue¹ &
…
Peng Jin¹

2005 Accesses
63 Citations
Explore all metrics

Abstract

Surrogate model-assisted multi-objective genetic algorithms (MOGA) show great potential in solving engineering design problems since they can save computational cost by reducing the calls of expensive simulations. In this paper, a two-stage adaptive multi-fidelity surrogate (MFS) model-assisted MOGA (AMFS-MOGA) is developed to further relieve their computational burden. In the warm-up stage, a preliminary Pareto frontier is obtained relying only on the data from the low-fidelity (LF) model. In the second stage, an initial MFS model is constructed based on the data from both LF and high-fidelity (HF) models at the samples, which are selected from the preliminary Pareto set according to the crowding distance in the objective space. Then the fitness values of individuals are evaluated using the MFS model, which is adaptively updated according to two developed strategies, an individual-based updating strategy and a generation-based updating strategy. The former considers the prediction uncertainty from the MFS model, while the latter takes the discrete degree of the population into consideration. The effectiveness and merits of the proposed AMFS-MOGA approach are illustrated using three benchmark tests and the design optimization of a stiffened cylindrical shell. The comparisons between the proposed AMFS-MOGA approach and some existing approaches considering the quality of the obtained Pareto frontiers and computational efficiency are made. The results show that the proposed AMFS-MOGA method can obtain Pareto frontiers comparable to that obtained by the MOGA with HF model, while significantly reducing the number of evaluations of the expensive HF model.

An online variable-fidelity optimization approach for multi-objective design optimization

Article 09 May 2019

Constraint boundary pursuing-based surrogate-assisted differential evolution for expensive optimization problems with mixed constraints

Article 15 February 2023

A dynamic surrogate-assisted evolutionary algorithm framework for expensive structural optimization

Article 07 November 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Practical engineering design optimization problems usually contain several objectives, in which at least two of them are conflicting in nature. Therefore, solving these problems generally results in multiple optimal solutions termed as Pareto-optimal solutions. Multi-objective genetic algorithms (MOGAs) have been widely used for obtaining these Pareto-optimal solutions, due to its advantages including ease of implementation, no need of the gradient information of the objective functions or constraints, and great ability to handle problems with nonconvex Pareto fronts. Despite these advantages, it is still impractical for them to solve engineering problems that involve time-consuming simulations. This is because MOGAs usually require a large number of fitness evaluations to locate near-optimal solutions. A promising way to improve the efficiency of such algorithms is to incorporate the surrogate model, also referred as the metamodel or approximation model, in evolutionary computations for reducing the computationally expensive exact fitness evaluations [1, 2]. The incorporations of surrogate models, e.g., Gaussian process (GP) model [3], neural network (NN) [4], radial basis function (RBF) [5, 6], could be achieved in almost all elements of the evolutionary computations, which can be classified into three types [7]. The first incorporation strategy is termed as surrogate model-assisted migration, in which individuals approximated with different levels of accuracy can migrate from one subpopulation to another. The second one is termed as surrogate model-assisted initialization and genetic operations [8]. Since initialization, crossover, and mutation are usually implemented randomly, it is believed that using the surrogate model for initializing the populations and guiding the crossover and mutation would be beneficial for accelerating the convergence rate. The last one is termed as surrogate model-assisted fitness evaluations, in which the surrogate model is constructed to replace the time-consuming simulations aiming at reducing the number of fitness calculations [9,10,11]. Since the surrogate model-assisted fitness evaluations can lead to the best performance among the above three strategies, there has been widespread interests in these approaches [12,13,14,15].

The surrogate model-assisted fitness evaluation approaches can be broken down into two distinct modes, off-line (non-adaptive) mode and on-line (adaptive) mode [16,17,18,19,20,21]. In the off-line mode, a pre-specified amount of sample points are employed to build a surrogate model, which is used for the fitness evaluations in the evolutionary computation subsequently. The main shortcoming of the off-line mode is that it is difficult to predetermine the proper sample size for obtaining an accurate surrogate model, which will lead to the reduction of the number of fitness evaluations not be significant [22]. On the other hand, the on-line mode generates an initial surrogate model first and then adaptively updates the surrogate model following some model management strategies during the evolutionary computation. Compared with the off-line mode, the on-line mode can make use of the knowledge from previous iterations and is reported to be more efficient for evolutionary algorithms [23]. The core factor that determines the success of on-line surrogate model-assisted fitness evaluations is the model management strategy, i.e., to update the surrogate model in the evolutionary computation, which individuals should be selected to be evaluated using the exact fitness functions. The most straightforward idea is to evaluate the individuals that are potentially with the best fitness values, the largest degree of prediction uncertainty, maximum space-filling characteristic, or individuals that can make a trade-off between the fitness values and surrogate model accuracy. Randomly selecting individuals to evaluate with the real fitness function in each generation for updating the surrogate model was also studied. Preliminary efforts have demonstrated that these management strategies with a pre-defined updating number may cause oscillation because the prediction accuracy of the surrogate model may vary significantly during optimization. To address this issue, Li et al. [8] proposed an effective kriging surrogate model-assisted MOGA, in which an objective measure is developed to select the individuals whose dominated states will be changed because of the prediction error from the surrogate model. Furthermore, Li [17] improved this method by introducing an enhanced quantitative switching criterion.

Although the previous work demonstrated the obvious merits of surrogate-assisted MOGAs, for the computationally expensive high-fidelity (HF) models, even performing the number of simulations needed for constructing a surrogate model could be too expensive [24,25,26,27,28,29,30,31,32]. To further alleviate the high computational cost in HF analyses, an efficient alternative termed as the multi-fidelity surrogate (MFS) modeling is recognized recently. In MFS, an assumption is made that there exists a low-fidelity (LF) model, which is less accurate compared with the corresponding HF model but is considerably less computationally demanding [33,34,35,36]. By integrating the information from both LF and HF models, i.e., the LF model is used to provide the trend of the quantity of interests (QoI), whereas a small number of HF simulations are used to guarantee the prediction accuracy in the critical subspaces, MFS can make a trade-off between high accuracy and low computational expense [28, 37,38,39,40]. The most notable of MFS is the cokriging surrogate model developed by Kennedy and O’Hagan [41], where a Bayesian method was proposed for predicting the responses of the HF model with the assistance of several LF models. Han et al. [36] developed an extended cokriging surrogate model in which the cokriging weights were refined and a scaling factor was introduced to consider the effects of LF data on the prediction of the HF model. Furthermore, Zhou et al. [42] developed an uncertainty quantification approach to concurrent treat the effects of uncertainties from design variables and MFS in the robust optimization. Nguyen et al. [43] combined the MFS with the multidisciplinary feasible approach to the ease the computational burden caused by the high-fidelity analysis in multidisciplinary design optimization (MDO).

Though MFS has been reported as being used in engineering design, its combination with the multi-objective evolutionary algorithms is scarce. When the computational cost of the LF model should be taken into consideration, Shu et al. [44] proposed an on-line MFS-assisted MOGA, which can decide whether the LF model or the HF model would be selected to analyze for a sample point, recently, while in the case of that the computational cost of the LF model can be ignored compared with the HF model, Liu et al. [45] combined the MFS with MOGA, in which the MFS was constructed based on the multiplicative scaling function, and successfully applied it to light-weight design of a stiffened panel. However, the prediction uncertainty introduced by the MFS, which can have a significant effect on the accuracy of the obtained Pareto set, was ignored in this approach [46]. Therefore, in this work, a two-stage adaptive MFS-assisted MOGA (AMFS-MOGA) is developed to improve the accuracy for combining the MFS with MOGA. In the warm-up stage, a preliminary Pareto set is obtained relying only on the data from the LF model. In the second stage, an initial MFS is constructed based on the data from both LF and HF models at the samples, which are selected from the preliminary Pareto set according to the crowding distance in the objective space. Then the fitness values of individual are evaluated by the MFS, which is adaptively updated according to two developed strategies, individual-based updating strategy and generation-based updating strategy. The former considers the prediction uncertainty from the MFS, while the latter takes the discrete degree of the populations into consideration. The performance of the proposed AMFS-MOGA approach is illustrated using three benchmark test functions and the design optimization of the hull of an autonomous underwater vehicle. The comparisons between the proposed AMFS-MOGA approach and some existing approaches considering the quality of the obtained Pareto frontiers and computational efficiency are made. The merits of AMFS-MOGA approach are analyzed and summarized.

The remainder of this paper is organized as follows. In Sect. 2, the background and terminology of the multi-objective optimization and multi-fidelity surrogate model are presented. Details of the proposed AMFS-MOGA are introduced in Sect. 3. In Sect. 4, the comparison results between the proposed approach and some existing approaches on three benchmark test functions and a real-world engineering design problem are presented. In Sect. 5, the concluding remarks and possible future work are given.

2 Background and terminology

2.1 Multi-objective optimization problems (MOPs)

Generally, MOPs can be formulated as,

$$\begin{aligned} & {\text{minimize}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} F(\varvec{x}) = \left\{ {f_{1} (\varvec{x}),f_{2} (\varvec{x}), \ldots ,f_{m} (\varvec{x})} \right\}{\kern 1pt} \\ & {\text{subject}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{to}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} g_{j} (\varvec{x}) \le 0,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} j = 1,2, \ldots ,J \\ {\kern 1pt} & \varvec{x}_{\text{lb}} \le \varvec{x} \le \varvec{x}_{\text{ub}} , \\ \end{aligned}$$

(1)

where $F(\varvec{x})$ denotes the objective function vector, $m$ denotes the number of objective functions, $\varvec{x}$ denotes the design variable vector with the lower and upper bounds are $\varvec{x}_{\text{lb}}$ and $\varvec{x}_{\text{ub}}$, respectively. $g_{j} (\varvec{x})$ is the jth constraint. Since at least two of the objective functions are conflicting, therefore, solving Eq. (1) generally results in multiple optimal solutions termed as Pareto-optimal solutions. The objective function values at these Pareto-optimal solutions form the Pareto frontier. A large number of MOGAs can be used to obtain the Pareto frontier for global optimization. Particularly, a modified non-dominated sorting in genetic algorithms (NSGA-II) proposed by Deb et al. [47] is used in this work.

2.2 Multi-fidelity surrogate model

The motivation of MFS modeling is that many cheaper LF sampling points are adopted to reduce the computational cost while a limited number of expensive HF sampling points are used to ensure the prediction accuracy of the surrogate model. Noted that this motivation is based on the assumption that the LF model can provide a general trend of the QOI. Three common ways to obtain a LF model are [33, 46, 48]: (a) simplifying the analysis model (e.g., by using a coarse finite element mesh instead of a refined mesh); (b) simplifying the modeling concept or domain [e.g., by using a two-dimensional (2D) model instead of a three-dimensional (3D) one], and (c) simplifying the mathematical or physical description (e.g., by using the Euler non-cohesive equations instead of the Navier–Stokes viscous Newton equations). It should be noted that the LF and HF models are a relative concept. Take an airfoil design, which is an aerodynamic component, as an example. For obtaining the aerodynamic coefficients, when compared with the 2D computational fluid dynamic (CFD) simulation with Euler non-cohesive equations, a 3D CFD simulation with Navier–Stokes viscous Newton equations could be termed as HF model, whereas it would be treated as LF model when the wind tunnel experiments are available.

Generally, the MFS based on the interaction of the HF model and the LF model can be expressed as [49],

$$\hat{F}(\varvec{x},\varvec{a}) \equiv \hat{F}(f^{l} (\varvec{x}),\varvec{a}) \approx F(\varvec{x}),$$

(2)

where $\varvec{x}$ is the design vector, $\hat{F}(\varvec{x},\varvec{a})$ denotes the MFS that is used to replace the actual HF model, $f^{l} (\varvec{x})$ represents the response of the LF model, $F(\varvec{x})$ represents the actual response of the HF surrogate model, and $\varvec{a}$ is a vector of tuning parameters used for minimizing the discrepancy between the LF and HF models. From the above definition, the MFS $\hat{F}(\varvec{x},\varvec{a})$ tends to approach the high accuracy of the HF model, but at a considerably less computational effort.

3 The proposed approach

The goal of the proposed AMFS-MOGA approach is to improve the efficiency of the MOGA by adopting a two-stage adaptive MFS approach. In the warm-up stage, a preliminary Pareto front is obtained by the LF model or LF surrogate model. In the second stage, a set of individuals from the preliminary Pareto front are selected to be simulated by the HF model. Then, these HF sample data are fused with the LF model or LF surrogate model for constructing the MFS model. The preliminary Pareto set is re-evaluated as the initial individuals and the MFS model is used for the fitness evaluations in the second stage to obtain the final Pareto front. During the evolutionary process, two model management strategies, individual-based updating strategy and generation-based updating strategy, are developed to improve the efficiency and convergence of the proposed approach further. In the following subsections, we present the core ideas of AMFS-MOGA approach as two parts. They are (1) multi-fidelity surrogate model approach and (2) model management strategies. The flowchart for the proposed method is plotted in Fig. 1.

3.1 Multi-fidelity surrogate model approach

Commonly used MFS approaches are scaling methods, in which the MFS is obtained by tuning the LF model using scaling function according to the responses of the HF model [37, 50]. In this work, the MFS approach proposed in our previous work [51] is adopted, where the LF model or a tuned LF surrogate model is taken as a base model and is mapped to the studied HF model using a kriging surrogate model. For completeness, a brief review of this MFS approach is presented here. A more detailed description can be found in Ref. [51]. In the MFS approach, if the relationships between the input variables and corresponding responses could be expressed explicitly in the LF model, the model will be directly used without the need of constructing a surrogate model. Otherwise, a LF surrogate model needs to be built, e.g., using kriging, for MFS. To tune the LF surrogate model and make it as close to the HF model as possible, an optimization procedure can be taken as,

$$\begin{aligned} & \hbox{min} :L(a_{0} ,a_{1} ) = \sum\limits_{i = 1}^{m} {\left[ {(a_{0} + a_{1} \hat{f}^{l} (x_{i}^{h} ){\kern 1pt} ) - f^{h} (x_{i}^{h} )} \right]^{2} } \\ & {\text{s}} . {\text{t}} .{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} l_{0} \le a_{0} \le u_{0} ,{\kern 1pt} {\kern 1pt} l_{1} \le a_{1} \le u_{1} , \\ \end{aligned}$$

(3)

where $L(a_{0} ,a_{1} )$ represents the loss function in the least-square-error sense and $x_{i}^{h}$ is the ith sample point of the HF model. The bounds posed on tuning parameters represent the prior knowledge of the global constant bias and multiplicative scaling between LF and HF models, respectively.

Once the LF model or tuned LF surrogate model is obtained, the scaling process is implemented as follows. For a given HF sample set $\varvec{x}^{h} = \left\{ {\varvec{x}_{1}^{h} ,\varvec{x}_{2}^{h} , \ldots ,\varvec{x}_{{m_{h} }}^{h} } \right\}$ with $m_{h}$ HF sample points and the corresponding responses $\varvec{f}^{h} = \left\{ {f_{1}^{h} ,f_{2}^{h} , \ldots ,f_{{m_{h} }}^{h} } \right\}$, the discrepancies $\varvec{C}\left( \varvec{x} \right) = \left\{ {c(\varvec{x}_{1}^{h} ),c(\varvec{x}_{2}^{h} ), \ldots ,c(\varvec{x}_{{m_{h} }}^{h} )} \right\}$ between the HF and LF model/surrogate model for a HF sample point $x_{i}^{h}$ can be calculated as,

$$\left\{ \begin{aligned} &c(x_{i}^{h} ) = f^{h} (x_{i}^{h} ) - f^{l} (x_{i}^{h} ){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} {\kern 1pt} {\text{model}}{\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ){\kern 1pt} {\kern 1pt} {\kern 1pt} \hfill \\ & c(x_{i}^{h} ) = f^{h} (x_{i}^{h} ) - a_{0}^{*} - a_{1}^{*} \hat{f}^{l} (x_{i}^{h} ){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} \;{\text{surrogate}}{\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ){\kern 1pt} {\kern 1pt} \hfill \\ \end{aligned} \right.,$$

(4)

where $f^{h} (x_{i}^{h} )$ is the actual response of the HF model at $x_{i}^{h}$, $f^{l} (x_{i}^{h} )$ is the real response of the LF model for $x_{i}^{h}$, and $\hat{f}^{l} (x_{i}^{h} )$ is the predicted value of the LF surrogate model for $x_{i}^{h}$.

Based on the HF sample set $\varvec{x}^{h} = \left\{ {\varvec{x}_{1}^{h} ,\varvec{x}_{2}^{h} , \ldots ,\varvec{x}_{{m_{h} }}^{h} } \right\}$ and corresponding discrepancy or scaling data $\varvec{C}\left( \varvec{x} \right) = \left\{ {c(\varvec{x}_{1}^{h} ),c(\varvec{x}_{2}^{h} ), \ldots ,c(\varvec{x}_{{m_{h} }}^{h} )} \right\}$, the scaling function $\varvec{C}\left( \varvec{x} \right)$ modeled using the kriging can be expressed as,

$$\begin{aligned} \left\{ \begin{aligned} &\hat{C}\left( \varvec{x} \right) = \hat{\beta }_{h} + (\varvec{r}_{h} )^{\text{T}} (\varvec{R}_{h} )^{ - 1} (\varvec{f}^{h} \left( \varvec{x} \right) - \varvec{f}^{l} \left( \varvec{x} \right) - \hat{\beta }_{h} \varvec{p}) \hfill \\ \hat{\beta }_{h} &= (\varvec{p}^{\text{T}} \varvec{R}_{h}^{ - 1} \varvec{p})^{ - 1} \varvec{p}^{\text{T}} \varvec{R}_{h}^{ - 1} (\varvec{f}^{h} \left( \varvec{x} \right) - \varvec{f}^{l} \left( \varvec{x} \right)) \hfill \\ \end{aligned} \right.{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} {\kern 1pt} {\text{model}}{\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ){\kern 1pt} \hfill \\ \left\{ \begin{aligned} &\hat{C}\left( \varvec{x} \right) = \hat{\beta }_{h} + (\varvec{r}_{h} )^{\text{T}} (\varvec{R}_{h} )^{ - 1} (\varvec{f}^{h} \left( \varvec{x} \right) - (a_{0}^{*} \varvec{ + }a_{1}^{*} \hat{\varvec{f}}^{l} \left( \varvec{x} \right)) - \hat{\beta }_{h} \varvec{p}) \hfill \\ &\hat{\beta }_{h} = (\varvec{p}^{T} \varvec{R}_{h}^{ - 1} \varvec{p})^{ - 1} \varvec{p}^{\text{T}} \varvec{R}_{h}^{ - 1} (\varvec{f}^{h} \left( \varvec{x} \right) - (a_{0}^{*} \varvec{ + }a_{1}^{*} \hat{\varvec{f}}^{l} \left( \varvec{x} \right))) \hfill \\ \end{aligned} \right.{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} \;{\text{surrogate}}\;{\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ) ,\hfill \\ \end{aligned}$$

(5)

where $\varvec{r}_{h} \in R^{{m_{h} }}$ and $\varvec{R}_{h} \in R^{{m_{h} }}$ denote the correlation vector and correlation matrix, respectively, and $\varvec{p}$ is a column vector of length $m_{h}$ that is filled with ones.

After the LF model or LF surrogate model and the scaling function are constructed, the MFS that is used to approximate the HF model can be expressed as,

$$\left\{ \begin{aligned} &\hat{\varvec{f}}_{\text{vf}} \left( \varvec{x} \right){\kern 1pt} = \varvec{f}^{l} \left( \varvec{x} \right) + \hat{\varvec{C}}\left( \varvec{x} \right){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} {\kern 1pt} {\text{model}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ){\kern 1pt} \hfill \\ &\hat{\varvec{f}}_{\text{vf}} \left( \varvec{x} \right){\kern 1pt} = a_{0}^{*} + a_{1}^{*} \hat{\varvec{f}}^{l} \left( \varvec{x} \right) + \hat{\varvec{C}}\left( \varvec{x} \right){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} \;{\text{surrogate}}{\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} )\hfill \\ \end{aligned} \right..$$

(6)

3.2 Model management strategies

Model management strategies have a significant impact on the success of the surrogate model-based MOGA methods. The core of the model management strategy is to identify which sample points should be selected from the current population to improve the prediction accuracy of the surrogate model. In the proposed AMFS-MOGA, two model management strategies are adopted. One is an individual-based updating strategy aiming to take the interpolation uncertainty from MFS model into consideration, and the other one is a generation-based updating strategy aiming to improve the degrees of the dispersion of the populations. In the following subsections, these two strategies are described in more details.

3.2.1 Individual-based updating strategy

The predicted fitness values of the individuals from the MFS model have prediction uncertainty, which may mislead the evolutionary process. Therefore, an individual-based updating strategy is developed to take the interpolation uncertainty of the MFS model into consideration. As mentioned, although this work focuses on the case that the computational cost of the LF model can be ignored compared with the HF model, the LF model or the LF surrogate model would be used in the MFS. This will lead to two different types of uncertainty quantifications of the interpolation uncertainty from the MFS. On the one hand, when the LF model is directly used in the MFS, the interpolation uncertainty of the MFS only comes from the scaling function $\hat{\varvec{C}}\left( \varvec{x} \right)$. The predicted variance of the scaling function $\hat{\varvec{C}}\left( \varvec{x} \right)$ can be calculated as,

$$\sigma_{\text{c}}^{2} (x_{\text{o}} ) = \hat{\sigma }_{\text{c}}^{2} \left[ {1 - (\varvec{r}_{\text{c}} )^{\text{T}} (\varvec{R}_{\text{c}} )^{ - 1} \varvec{r}_{\text{c}} + \frac{{(1 - \varvec{p}^{\text{T}} (\varvec{R}_{\text{c}} )^{ - 1} \varvec{r}_{\text{c}} )^{2} }}{{\varvec{p}^{\text{T}} (\varvec{R}_{\text{c}} )^{ - 1} \varvec{p}}}} \right].$$

(7)

Therefore, the prediction interval $I(x_{\text{o}} )$ for an individual $x_{\text{o}}$ from the MFS can be modeled as an interval with 95.5% confidence level. Then the bounds of the prediction interval $I_{95.5\% } (x_{\text{o}} )$ are two times the standard deviation ($2\sigma_{\text{c}} (x_{\text{o}} )$) from each side of the mean.

On the other hand, when the LF surrogate model is used, the predicted variance of the MFS is quantified by calculating the sums of the predicted variance of the two surrogate models, one for the LF surrogate model $\hat{\varvec{f}}^{l} \left( \varvec{x} \right)$ and the other for the scaling function $\hat{\varvec{C}}\left( \varvec{x} \right)$. It can be calculated by

$$\sigma_{\text{mfs}}^{2} (x_{\text{o}} ) = (a_{1}^{*} )^{2} \sigma_{\text{l}}^{2} (x_{\text{o}} ){ + }\sigma_{\text{c}}^{2} (x_{\text{o}} ),$$

(8)

where $\sigma_{\text{l}}^{2} (x_{\text{o}} )$ is predicted variance of LF surrogate model. It can be calculated by

$$\sigma_{\text{l}}^{2} (x_{\text{o}} ) = \hat{\sigma }_{\text{l}}^{2} \left[ {1 - (\varvec{r}_{\text{l}} )^{\text{T}} (\varvec{R}_{\text{l}} )^{ - 1} \varvec{r}_{\text{l}} + \frac{{(1 - \varvec{p}^{\text{T}} (\varvec{R}_{\text{l}} )^{ - 1} \varvec{r}_{\text{l}} )^{2} }}{{\varvec{p}^{\text{T}} (\varvec{R}_{\text{l}} )^{ - 1} \varvec{p}}}} \right].$$

(9)

Note that as long as the domination status of the individual may not change due to the interpolation uncertainty of the MFS model, its fitness value can be predicted by the MFS model instead of expensive simulation models. However, if the domination status of the individual may change, then its fitness value should be evaluated by simulation models. In this work, the objective switching criterion [52] that relates the minimum of minimum distance (MMD) and the prediction interval is extended to MFS scenarios to determine whether the simulation models or the MFS model should be used to evaluate the fitness of individuals. In each generation, the MMD, which is used to measure the lower bound of the distance between the points in dominated and non-dominated set, is calculated. The illustration of the calculation process for MMD is presented in Fig. 2. In Fig. 2, the distance between two points, e.g., $d(A,a)$, is calculated by the Euclidean distance.

The different relationships between the MMD and prediction interval can result in two different scenarios of the individuals. For simplicity, the illustrations of the types of individuals on one-dimensional of the objective space are presented in Figs. 3 and 4, respectively.

As can be seen in Figs. 3a and 4a, no matter how the individuals “A” and “a”, move within the corresponding prediction interval $I_{95.5\% } (A)$ and $I_{95.5\% } (a)$, the individual “a” always dominated by the individual “A”. It means that for these scenarios although the MFS model has interpolation uncertainty at individuals, “A” and “a”, these interpolation uncertainties do not affect the domination status of these two individuals at the given confidence probability. Therefore, it is no need to add these individuals as new sample points for updating the MFS model in the subsequent evolutionary process. On the contrary, as seen in Figs. 3b and 4b, the probability interval of the non-dominated individual “A” overlaps with that of the dominated individual “a”. It means that the domination status of the individual “A” would be changed because of the interpolation uncertainty from the MFS model. Then these two individuals, “A” and “a”, should be sent to HF analysis to avoid the misleading of the searching. In summary, the individuals that satisfy the following condition will be sent for HF analysis,

$$\frac{1}{2}I_{95.5\% } (A) + \frac{1}{2}I_{95.5\% } (a) \ge {\text{MMD}}(f_{1} ),$$

(10)

where

$$\left\{ \begin{aligned} &\frac{1}{2}I_{95.5\% } (A) = 2\sigma_{\text{c}} (A);{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \frac{1}{2}I_{95.5\% } (a) = 2\sigma_{\text{c}} (a){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ( {\text{if}}{\kern 1pt} {\kern 1pt} {\text{LF}}{\kern 1pt} {\kern 1pt} {\text{model}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{is}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\text{used}}{\kern 1pt} ){\kern 1pt} \hfill \\ &\frac{1}{2}I_{95.5\% } (A) = 2\sigma_{\text{mfs}} (A);{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \frac{1}{2}I_{95.5\% } (a) = 2\sigma_{\text{mfs}} (a){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} (if{\kern 1pt} {\kern 1pt} LF\;{\kern 1pt} {\text{surrogate}}{\kern 1pt} {\kern 1pt} is{\kern 1pt} {\kern 1pt} {\kern 1pt} used{\kern 1pt} ) \hfill \\ \end{aligned} \right..$$

(11)

Figure 5 depicts the schematic of this developed individual-based updating strategy. Notice that the MFS will be updated after each generation by adding the individuals according to the developed individual-based updating strategy. Table 1 provides the algorithm of the developed individual-based updating strategy.

Table 1 The algorithm of individual-based updating strategy

Full size table

3.2.2 Generation-based updating strategy

The uniformity of the Pareto set maybe not good if the solution is updated only according to the above individual-based updating strategy. To improve the diversity of the Pareto set, a generation-based updating strategy is developed, in which the point with the maximum degree of diversity is selected for HF analysis after a fixed number of generations. These points can be obtained by solving the following equation

$$\begin{aligned} & {\text{Find}}:{\kern 1pt} {\kern 1pt} {\kern 1pt} x \\ & {\text{Max}} = \mathop {\hbox{min} }\limits_{1 \le m \le l} (d(x,x_{m}^{0} )){\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} x_{m}^{0} \in X_{{P_{m} }} , \\ \end{aligned}$$

(12)

where $d(x,x_{m}^{0} )$ is defined to be the Euclidean distance between $\varvec{x}$ and the mth training point in the current HF sampling set $X_{{P_{m} }}$.

Figure 6 depicts a schematic of the developed generation-based updating strategy. The algorithm for selecting such individuals is listed in Table 2.

Table 2 The algorithm of generation-based updating strategy

Full size table

3.3 Steps for the proposed AMFS-MOGA approach

As a supplement to the flowchart of the proposed approach depicted in Fig. 1, detailed steps of the proposed approach are presented as follows:

Stage I

Step 1::: Initialize the population of NSGA-II.
Step 2::: Obtain a preliminary Pareto frontier by NSGA-II, in which the fitness values for each individual are evaluated by the LF model or LF surrogate model.

Stage II

Step 3::: Select a set of individuals that have the same number of the offset from the preliminary Pareto frontier based on crowding distance in the objective space. Then, these individuals are sent to the HF model for analysis.
Step 4::: Construct the MFS according to the approach described in Sect. 3.1.
Step 5::: Initialize the count number $N = 1$, evaluate the fitness values for individual by using the constructed MFS and obtain the Pareto front by NSGA-II.
Step 6::: Update the MFS by the two model management strategies described in Sect. 3.2. Notice that the generation-based updating strategy is implemented when the current generation number is equal to multiples of the pre-defined generation updating number k.
Step 7::: Update the count number $N = N + 1.$
Step 8::: Check whether the stopping criterion is satisfied. If yes, go to Step 9; otherwise, go back to Step 6.
Step 9::: Output the obtained Pareto set.

4 Examples and results

4.1 Numerical examples

In this section, three well-used numerical benchmarks (ZDT1, ZDT2, and ZDT3) with different degrees of complexity are used to illustrate the applicability and efficiency of the proposed AMFS-MOGA approach. In these three numerical examples, the original mathematical functions [53], described in Eqs. (7)–(9), are taken as the HF models. The LF models are assumed to be the Taylor expansion of the HF models. To test the applicability of the proposed approach for different situations, it is assumed that a LF surrogate model needs to be constructed in the ZDT3, while for ZDT1 and ZDT2, the LF models can be directly used for the MFS without fitting a surrogate model to replace them. The settings in the NSGA-II for these benchmarks are given in Table 3.

Table 3 The settings of NSGA-II in the numerical benchmarks

Full size table

ZDT1

$$\begin{aligned} & {\text{minimize}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} f_{1} (x) = x_{1} \\ & f_{2} (x) = g(x) \times h(x) \\ & {\text{where}}\;g(x) = 1 + \frac{9}{n - 1}\sum\limits_{i = 2}^{n} {x_{i} } \\ & h(x) = 1 - \sqrt {f_{1} (x)/g(x)} \\ & n = 3 \\ & 0 \le x_{i} \le 1,i = 1, \ldots ,n. \\ \end{aligned}$$

(13)

ZDT2

$$\begin{aligned} & {\text{minimize}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} f_{1} (x) = x_{1} \\ & f_{2} (x) = g(x) \times h(x) \\ & {\text{where}}{\kern 1pt} \;g(x) = 1 + \frac{9}{n - 1}\sum\limits_{i = 2}^{n} {x_{i} } \\ & h(x) = 1 - (f_{1} (x)/g(x))^{2} \\ & n = 3 \\ {\kern 1pt} & 0 \le x_{i} \le 1,i = 1, \ldots ,n{\kern 1pt} .{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \\ \end{aligned}$$

(14)

ZDT3

$$\begin{aligned} & {\text{minimize}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} f_{1} (x) = x_{1} \\ & f_{2} (x) = g(x) \times h(x) \\ & {\text{where}}{\kern 1pt} {\kern 1pt} {\kern 1pt} \;{\kern 1pt} g(x) = 1 + \frac{9}{n - 1}\sum\limits_{i = 2}^{n} {x_{i} } \\ {\kern 1pt} & h(x) = 1 - \sqrt {f_{1} (x)/g(x)} - (f_{1} (x)/g(x))\sin (10\pi f_{1} ) \\ & n = 3 \\ & 0 \le x_{i} \le 1,i = 1, \ldots ,n. \\ \end{aligned}$$

(15)

4.2 Quality metrics

To compare the ability of obtaining desirable Pareto frontiers and the efficiency of different approaches, two metrics for measuring the quality of Pareto frontiers [54, 55], i.e., the relative hyperarea difference (RHD) and overall spread (OS), and the HF function calls (FC) for measuring the efficiency of different approaches, are calculated. The RHD and OS represent the convergence and diversity of the obtained Pareto frontier, respectively. The smaller the value of RHD is, the higher convergence of the Pareto frontier would be, while a larger value of OS indicates a more diverse Pareto frontier.

Figure 7 illustrates these two metrics geometrically for a 2D case. Let the current Pareto set to be $P = \left\{ {a,b,c,d} \right\}$. $p_{\text{good}}$ and $p_{\text{bad}}$ are the extreme “good” and “bad” points, respectively. The quantity RHD, shown as in Fig. 7a, is defined as the relative difference between the area bounded by $p_{\text{good}}$ and $p_{\text{bad}}$, and the shaded area covered from $p_{\text{bad}}$ to the current robust Pareto set $P$. The quantity OS, shown as in Fig. 7b, is defined as the ratio between the area bounded by two extreme points $a$ and $d$ in the current robust Pareto set $P$ and the area bounded by $p_{\text{good}}$ and $p_{\text{bad}}$.

4.3 Results and comparisons

For comparison, four other methods are considered: (1) MOGA with LF model (2) MOGA with HF model (3) kriging surrogate model-assisted MOGA (K-MOGA) proposed by Li et al. [52], and (4) Multiplication-scale multi-fidelity surrogate model-assisted MOGA (MMFS-MOGA) proposed by Zhu et al. [56]. For each approach, 15 runs are conducted for all examples to account for the influence of randomness. Figure 8 demonstrates the typical Pareto frontiers obtained from one of the 15 runs of these approaches.

As illustrated in Fig. 8, the Pareto frontiers from the three surrogate model-assisted MOGA, i.e., K-MOGA, MMFS-MOGA, and AMFS-MOGA, are consistent with that of the MOGA with HF model, while only a small portion of the Pareto frontier of MOGA with LF model overlap with those from the MOGA with HF model. This indicates that it is difficult to obtain the true Pareto frontier by only incorporating the LF model into the MOGA. It is worth to mention that compared with K-MOGA and MMFS-MOGA, the proposed AMFS-MOGA does not lose the boundary points that on the Pareto frontier of the MOGA with HF model. This is attributed to the developed generation-based updating strategy in the AMFS-MOGA, which is very helpful for improving the degree of dispersion of the populations.

To further demonstrate the superiority of the proposed approach, the quality of convergence, diversity of Pareto optimum, and the computational efforts for the MOGA with HF model are summarized in Table 4. The comparison results of the proposed AMFS-MOGA, K-MOGA and MMFS-MOGA are summarized in Table 5. In Table 5, “FC” denotes the required HF function calls. Notice that one function call in Tables 4 and 5 refer to the calculation of objective and constraints together for a single individual.

Table 4 Quantity metrics of the NSGA-II with HF model for numerical cases

Full size table

Table 5 Comparison of different approaches for numerical cases

Full size table

As illustrated in Tables 4 and 5, the average values of RHD and OS from the K-MOGA, MMFS-MOGA, and AMFS-MOGA are close to those of the MOGA with HF model in all numerical examples. This means that these three surrogate model-assisted MOGA approaches can obtain a comparable convergence and diversity of Pareto sets compared with the MOGA with HF model. Another observation is that considering the standard deviation (STD) values in RHD and OS, K-MOGA and MMFS-MOGA perform worse than the proposed AMFS-MOGA approach.

Regarding the computational efficiency of these four approaches, the number of function calls for AMFS-MOGA is nearly 100 times less than that of the MOGA with HF model. Meanwhile, the average number of function calls is reduced by 45–60% using the AMFS-MOGA compared to that of the K-MOGA. Compared to MMFS-MOGA, the average number of function calls is reduced by 30% for ZDT1 and ZDT2. Figure 9 demonstrates the HF function calls for ZDT2 in all the 15 runs for each approach. As illustrated in Fig. 9, the proposed AMFS-MOGA run with the number of HF calls (the maximum of 160) requires 100 times fewer than that of the MOGA with HF model run (the minimum of $1.9 \times 10^{4}$). All individuals will be evaluated by the HF model for obtaining their fitness values in the MOGA with HF model, whereas only a small portion of them are needed to be analyzed using the HF model in AMFS-MOGA. It is worth to mention that although there is no need for the K-MOGA and MMFS-MOGA to analyze all individuals by HF model in the evolution process, it still requires a larger number of function calls than that of AMFS-MOGA. This is because the proposed AMFS-MOGA can not only make full use of the uncertainty information from MFS but also the data from both LF and HF models for updating MFS.

Figure 10 depicts the number of function calls for the HF model in the first fifteen generations at one of the 15 runs for different approaches. As can be seen in Fig. 10, K-MOGA requires evaluating more individuals with HF model than the proposed AMFS-MOGA to reduce the prediction uncertainty of the surrogate model in the early stage of the evolution process. MMFS-MOGA needs to evaluate more individuals in the later generations. As a result, the total number of function calls for the HF model in the proposed AMFS-MOGA is less than those of K-MOGA and MMFS-MOGA. Noted that although we detail the merits of computational efficiency for AMFS-MOGA in ZDT2, similar results can also be obtained in ZDT1 and ZDT3.

4.4 Engineering case

In this section, the developed approach is applied to the design optimization for a stiffened cylindrical shell with variable ribs. The structural profile of the stiffened cylindrical shell with variable ribs is shown in Fig. 11. Figure 12 depicts the schematic of the big ribs and small ribs of the cylindrical shell. The object of this problem is to minimize the weight and improve the stability of the stiffened cylindrical under the constraints of relevant regulations. The design variables are the space of ring-ribs l, the sizes of large and small ribs’ webs and face panels. Other parameters are fixed during the optimization process. The ranges and values for these design variables and parameters are depicted in Table 6. The material property is listed in Table 7.

Table 6 Ranges and values of the design variables and parameters

Full size table

Table 7 Material properties

Full size table

Therefore, the optimization problem can be defined as,

$$\begin{aligned} & {\text{Minimize [}}M ,- p_{\text{cr2}} ]\\ & {\text{Subject to }}g_{1} = \frac{{\sigma_{1} }}{{0.85\sigma_{\text{S}} }} - 1 \le 0, \, g_{2} = \frac{{\sigma_{2} }}{{1.15\sigma_{\text{S}} }} - 1 \le 0, \\ & g_{3} = \frac{{\sigma_{3} }}{{0.60\sigma_{\text{S}} }} - 1 \le 0, \, \;g_{4} = 1 - \frac{{p_{\text{cr1}} }}{{p_{j} }} \le 0, \\ & g_{5} = 1 - \frac{{p_{\text{cr2}} }}{{1.2p_{j} }} \le 0,\;g_{6} = \frac{{h_{1} }}{{23t_{2} }} - 1 \le 0, \\ & g_{7} = \frac{{b_{1} }}{{6t_{1} }} - 1 \le 0,\;\;g_{8} = \frac{{h_{2} }}{{23t_{4} }} - 1 \le 0 \\ & g_{ 9} = \frac{{b_{2} }}{{6t_{3} }} - 1 \le 0, \\ \end{aligned}$$

(16)

where $M$ is the total weight of the stiffened cylindrical shell, $\sigma_{1}$ is mid-plane circumferential stress of the shell, $\sigma_{2}$ donates the longitudinal stress of the outer-face of the shell at rib, $\sigma_{3}$ is the rib stress, $P_{\text{cr1}}$ represents the local buckling pressure, $P_{\text{cr2}}$ represents the global buckling pressure. Intuitively, as the mass increases, the load required to destabilize the structure increases.

In this work, two-levels of fidelity models, the LF empirical model and HF simulation model, are used to obtain the objective and constraint values.

In the LF empirical model, the $\sigma_{1}$, $\sigma_{ 2}$, $\sigma_{ 3}$, $P_{\text{cr1}}$ and $P_{\text{cr2}}$ can be computed by the following formulas

$$\sigma_{1} = \frac{{k_{1} p_{j} R}}{t},$$

(17)

$$\sigma_{ 2} = \frac{{k_{2} p_{j} R}}{t},$$

(18)

$$\sigma_{3} = \frac{{k_{3} p_{j} R}}{t},$$

(19)

$$\begin{aligned} & P_{\text{E1}} = E\left( {\frac{t}{R}} \right)^{2} \left[ {\frac{0.6}{(u - 0.37)}} \right]\;u \ge 1 \\ & P_{\text{E1}} = 1.21E\left( {\frac{t}{R}} \right)^{2} \;u \le 1 \\ & P_{\text{cr1}} = k_{4} P_{\text{E1}} , \\ \end{aligned}$$

(20)

$$\begin{aligned} & P_{\text{E2}} = \frac{E}{{n^{2} - 1 + 0.5\alpha^{2} }}\left[ {\frac{t}{R}\frac{{\alpha^{4} }}{{(\alpha^{2} + n^{2} )^{2} }} + \frac{{I(n^{2} - 1)^{2} }}{{R^{3} l}}} \right] \\ & P_{\text{cr2}} = k_{5} P_{\text{E2}} , \\ \end{aligned}$$

(21)

where $k_{1} ,k_{2} ,k_{3} ,k_{4} ,k_{5} ,u,n,\alpha$ are the coefficients determined by the guide from the China Classification Society (CCS), $I$ is the moment of inertia of the rib.

The finite element analysis (FEA) model is taken as the HF model. It is solved using the ANSYS 18.0 simulation tool. The boundary conditions are listed as: (1) all translation degrees in the right end are fixed; (2) all the translation degrees except the axis degree are fixed in the left end. Meanwhile, the load applying to the outer shell of this structure is equal to 3 MPa, which is used to simulate the pressure under 300 meters’ water depth. Beam 188 elements are used to simulate the face panels of the ribs, and Shell 181 elements are used to simulate the shell and the webs of the ribs. The number of the element is more than 30,000 to ensure the simulation results with a desirable accuracy level. The FEA model and one simulation result are shown in Fig. 13.

Since running the HF FEA model is computationally expensive, the proposed AMFS-MOGA approach is used to solve this optimization problem. The setting of NSGA-II in this example is the same as that in ZDT1. The two quality metrics RHD and OS are also used to make comparisons for the Pareto frontiers obtained from different approaches. The $p_{\text{good}} = [1 \times 10^{4} , - 30]$ and $p_{\text{bad}} = [2 \times 10^{5} , - 3]$ are set in the objective function space. Figure 14 depicts the obtained typical Pareto frontiers from four approaches. The X represents the mass and Y represents the global buckling pressure, which are shown in Eq. (16). As observed from Fig. 14, the Pareto frontier from MOGA with LF model is dominated by those from the proposed AMFS-MOGA and MMFS-MOGA. This is expected because relying only on the LF empirical model can result in the unreliable Pareto frontier. It is noted that the K-MOGA cannot obtain a desirable Pareto frontier for this engineering case. A possible reason is that the relationships between the design variables and the objective and constraints are high non-linear, which leads to a distorted kriging surrogate model for the objective functions and constraints under limited HF sample points. Therefore, only the comparison results of the quality of convergence, diversity of Pareto optimum, and the computational efforts for the proposed AMFS-MOGA and MMFS-MOGA are summarized in Table 8. Since direct optimization for this problem is not possible, evaluating the accuracy of the final fronts are implemented by comparing the predicted results with the actual simulation results for the obtained Pareto solutions. The comparison results illustrate that the average relative errors of AMFS-MOGA and MMFS-MOGA for the obtained Pareto solutions are less than 6%, while as observed from Table 8, the proposed AMFS-MOGA can save 12.5% simulation calls over the MMFS-MOGA.

Table 8 Comparison of MMFS-MOGA and AMFS-MOGA for the engineering case

Full size table

5 Conclusion

In this study, a two-stage adaptive MFS model-assisted MOGA is proposed, in which the information from different fidelity models are integrated to improve the computational efficiency of MOGA. In the first stage, the fitness values of the individuals are evaluated by the LF model or the LF surrogate model for obtaining a preliminary Pareto frontier. In the second stage, an initial MFS model is constructed based on the data both from the LF model and HF sample points selected from the preliminary Pareto set. Then, this MFS model will be used for the fitness evaluations and adaptively updated according to the developed individual-based updating strategy and generation-based updating strategy. Numerical and engineering cases with different levels of complexities are tested to demonstrate the applicability and efficiency of the proposed AMFS-MOGA approach. The observations are summarized as follows: (1) only relying on a simplified HF model (i.e., LF model) may result in unreliable Pareto frontier, while using a single-fidelity HF model/surrogate model in MOGA is time-consuming or even computationally prohibitive, and (2) the proposed AMFS-MOGA approach can significantly reduce the number of evaluations of the expensive HF model, and at the same time obtain comparable convergence and diversity of the Pareto frontier as those obtained by the MOGA with HF model.

As part of future work, the proposed AMFS-MOGA approach will be further tested on more engineering design problems with higher dimensions. Also, practical engineering design problems always involve uncertainties, extending the AMFS-MOGA approach for addressing the robust optimization problem will also be beneficial to broaden the applicability of the approach.

References

Kitayama S, Srirat J, Arakawa M, Yamazaki K (2013) Sequential approximate multi-objective optimization using radial basis function network. Struct Multidiscip Optim 48:501–515
MathSciNet Google Scholar
Jiang C, Qiu H, Yang Z, Chen L, Gao L, Li P (2019) A general failure-pursuing sampling framework for surrogate-based reliability analysis. Reliab Eng Syst Saf 183:47–59
Google Scholar
Liu B, Koziel S, Zhang Q (2016) A multi-fidelity surrogate-model-assisted evolutionary algorithm for computationally expensive optimization problems. J Comput Sci 12:28–37
MathSciNet Google Scholar
Roshanian J, Bataleblu AA, Ebrahimi M (2018) A novel evolution control strategy for surrogate-assisted design optimization. Struct Multidiscip Optim 58:1255–1273
Google Scholar
Peng L, Liu L, Long T, Yang W (2014) An efficient truss structure optimization framework based on CAD/CAE integration and sequential radial basis function metamodel. Struct Multidiscip Optim 50:329–346
Google Scholar
Ray T, Smith W (2006) A surrogate assisted parallel multiobjective evolutionary algorithm for robust engineering design. Eng Optim 38:997–1011
Google Scholar
Jin Y (2003) A comprehensive survey of fitness approximation in evolutionary computation. Soft Comput 9:3–12
Google Scholar
Li G, Li M, Azarm S, Rambo J, Joshi Y (2007) Optimizing thermal design of data center cabinets with a new multi-objective genetic algorithm. Distrib Parallel Databases 21:167–192
Google Scholar
Wang H, Jin Y, Jansen JO (2016) Data-driven surrogate-assisted multiobjective evolutionary optimization of a trauma system. IEEE Trans Evol Comput 20:939–952
Google Scholar
Habib A, Singh HK, Ray T (2017) A multiple surrogate assisted evolutionary algorithm for optimization involving iterative solvers. Eng Optim 50:1625–1644
Google Scholar
Pan L, He C, Tian Y, Wang H, Zhang X, Jin Y (2018) A classification based surrogate-assisted evolutionary algorithm for expensive many-objective optimization. IEEE Trans Evolut Comput 23:74–88
Google Scholar
Jin Y, Wang H, Chugh T, Guo D, Miettinen K (2018) Data-driven evolutionary optimization: an overview and case studies. IEEE Trans Evolut Comput 23:442–458
Google Scholar
Sun C, Jin Y, Cheng R, Ding J, Zeng J (2017) Surrogate-assisted cooperative swarm optimization of high-dimensional expensive problems. IEEE Trans Evol Comput 21:644–660
Google Scholar
Dong H, Li C, Song B, Wang P (2018) Multi-surrogate-based differential evolution with multi-start exploration (MDEME) for computationally expensive optimization. Adv Eng Softw 123:62–76
Google Scholar
Wang H, Jin Y, Sun C, Doherty J (2019) Offline Data-driven evolutionary optimization using selective surrogate ensembles. IEEE Trans Evolut Comput 23:203–216
Google Scholar
Jin Y (2011) Surrogate-assisted evolutionary computation: recent advances and future challenges. Swarm Evolut Comput 1:61–70
Google Scholar
Li M (2011) An improved kriging-assisted multi-objective genetic algorithm. J Mech Des 133:071008
Google Scholar
Tian J, Tan Y, Zeng J, Sun C, Jin Y (2018) Multi-objective infill criterion driven gaussian process assisted particle swarm optimization of high-dimensional expensive problems. IEEE Trans Evolut Comput 23:459–472
Google Scholar
Wang H, Jin Y, Sun C, Doherty J (2018) Offline data-driven evolutionary optimization using selective surrogate ensembles. IEEE Trans Evolut Comput 23:203–216
Google Scholar
Yu H, Tan Y, Sun C, Zeng J (2019) A generation-based optimal restart strategy for surrogate-assisted social learning particle swarm optimization. Knowl Based Syst 163:14–25
Google Scholar
Li K, Yu Y, He J, Lin Y (2018) An integrated beam-plate structure multi-level optimal design framework based on bi-directional evolutionary structural optimization and surrogate model. Adv Eng Softw 115:230–247
Google Scholar
Chugh T, Sindhya K, Hakanen J, Miettinen K (2017) A survey on handling computationally expensive multiobjective optimization problems with evolutionary algorithms. Soft Comput 23:3137–3166
Google Scholar
Yu H, Tan Y, Zeng J, Sun C, Jin Y (2018) Surrogate-assisted hierarchical particle swarm optimization. Inf Sci 454:59–72
MathSciNet Google Scholar
Dong H, Song B, Wang P, Dong Z (2018) Hybrid surrogate-based optimization using space reduction (HSOSR) for expensive black-box functions. Appl Soft Comput 64:641–655
Google Scholar
Song X, Lv L, Li J, Sun W, Zhang J (2018) An advanced and robust ensemble surrogate model: extended adaptive hybrid functions. J Mech Des 140:041402
Google Scholar
Li E, Wang H (2016) An alternative adaptive differential evolutionary algorithm assisted by expected improvement criterion and cut-HDMR expansion and its application in time-based sheet forming design. Adv Eng Softw 97:96–107
Google Scholar
Zhang K-S, Han Z-H, Gao Z-J, Wang Y (2019) Constraint aggregation for large number of constraints in wing surrogate-based optimization. Struct Multidiscip Optim 59:421–438
MathSciNet Google Scholar
Han Z, Xu C, Zhang L, Zhang Y, Zhang K, Song W (2019) Efficient aerodynamic shape optimization using variable-fidelity surrogate models and multilevel computational grids. Chin J Aeronaut. https://doi.org/10.1016/j.cja.2019.05.001
Article Google Scholar
Lin Y, He J, Li K (2018) Hull form design optimization of twin-skeg fishing vessel for minimum resistance based on surrogate model. Adv Eng Softw 123:38–50
Google Scholar
Qian J, Yi J, Cheng Y, Liu J, Zhou Q (2019) A sequential constraints updating approach for Kriging surrogate model-assisted engineering optimization design problem. Eng Comput. https://doi.org/10.1007/s00366-019-00745-w
Article Google Scholar
Toal DJJ (2015) A study into the potential of GPUs for the efficient construction and evaluation of Kriging models. Eng Comput 32:377–404
Google Scholar
Bouhlel MA, Martins JRRA (2018) Gradient-enhanced kriging for high-dimensional problems. Eng Comput 35:157–173
Google Scholar
Guo Z, Song L, Park C, Li J, Haftka RT (2018) Analysis of dataset selection for multi-fidelity surrogates for a turbine problem. Struct Multidiscip Optim 57:2127–2142
Google Scholar
Park C, Haftka RT, Kim NH (2017) Remarks on multi-fidelity surrogates. Struct Multidiscip Optim 55:1029–1050
MathSciNet Google Scholar
Cai X, Qiu H, Gao L, Shao X (2017) Metamodeling for high dimensional design problems by multi-fidelity simulations. Struct Multidiscip Optim 56:151–166
MathSciNet Google Scholar
Han Z, Zimmerman R, Görtz S (2012) Alternative cokriging method for variable-fidelity surrogate modeling. AIAA J 50:1205–1210
Google Scholar
Liu H, Ong Y-S, Cai J, Wang Y (2018) Cope with diverse data structures in multi-fidelity modeling: a Gaussian process method. Eng Appl Artif Intell 67:211–225
Google Scholar
Xiao M, Zhang G, Breitkopf P, Villon P, Zhang W (2018) Extended co-kriging interpolation method based on multi-fidelity data. Appl Math Comput 323:120–131
MATH Google Scholar
Song X, Lv L, Sun W, Zhang J (2019) A radial basis function-based multi-fidelity surrogate model: exploring correlation between high-fidelity and low-fidelity models. Struct Multidiscip Optim 60:965–981
Google Scholar
Park C, Haftka RT, Kim NH (2018) Low-fidelity scale factor improves Bayesian multi-fidelity prediction by reducing bumpiness of discrepancy function. Struct Multidiscip Optim 58:399–414
Google Scholar
Kennedy MC, O’Hagan A (2000) Predicting the output from a complex computer code when fast approximations are available. Biometrika 87:1–13
MathSciNet MATH Google Scholar
Zhou Q, Wang Y, Choi S-K, Jiang P, Shao X, Hu J et al (2018) A robust optimization approach based on multi-fidelity metamodel. Struct Multidiscip Optim 57:775–797
Google Scholar
Nguyen N-V, Choi S-M, Kim W-S, Lee J-W, Kim S, Neufeld D et al (2013) Multidisciplinary unmanned combat air vehicle system design using multi-fidelity model. Aerosp Sci Technol 26:200–210
Google Scholar
Shu L, Jiang P, Zhou Q, Shao X, Hu J, Meng X (2018) An on-line variable fidelity metamodel assisted multi-objective genetic algorithm for engineering design optimization. Appl Soft Comput 66:438–448
Google Scholar
Liu Y, Collette M (2014) Improving surrogate-assisted variable fidelity multi-objective optimization using a clustering algorithm. Appl Soft Comput 24:482–493
Google Scholar
Zhou Q, Wang Y, Choi S-K, Jiang P (2017) An on-line multi-fidelity metamodel assisted multi-objective genetic algorithm. In: ASME 2017 international design engineering technical conferences and computers and information in engineering conference, Cleveland
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. Evolut Comput IEEE Trans 6:182–197
Google Scholar
Zhou Q, Yang Y, Jiang P, Shao X, Cao L, Hu J et al (2017) A multi-fidelity information fusion metamodeling assisted laser beam welding process parameter optimization approach. Adv Eng Softw 110:85–97
Google Scholar
Zadeh PM, Toropov VV, Wood AS (2009) Metamodel-based collaborative optimization framework. Struct Multidiscip Optim 38:103–115
Google Scholar
Zhou Q, Wang Y, Choi SK, Jiang P, Shao X, Hu J (2017) A sequential multi-fidelity metamodeling approach for data regression. Knowl Based Syst 134:199–212
Google Scholar
Zhou Q, Shao X, Jiang P, Gao Z, Wang C, Shu L (2016) An active learning metamodeling approach by sequentially exploiting difference information from variable-fidelity models. Adv Eng Inform 30:283–297
Google Scholar
Li M, Li G, Azarm S (2008) A kriging metamodel assisted multi-objective genetic algorithm for design optimization. J Mech Des 130:031401
Google Scholar
Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, Chichester, U.K.
MATH Google Scholar
Wu J, Azarm S (2001) Metrics for quality assessment of a multiobjective design optimization solution set. J Mech Des 123:18–25
Google Scholar
Cheng S, Zhou J, Li M (2015) A new hybrid algorithm for multi-objective robust optimization with interval uncertainty. J Mech Des 137:021401
Google Scholar
Zhu J, Wang Y-J, Collette M (2014) A multi-objective variable-fidelity optimization method for genetic algorithms. Eng Optim 46:521–542
MathSciNet Google Scholar

Download references

Acknowledgements

This work has been supported by the National Natural Science Foundation of China (NSFC) under Grant No. 51805179 and No. 51775203.

Author information

Authors and Affiliations

School of Aerospace Engineering, Huazhong University of Science and Technology, Wuhan, 430074, People’s Republic of China
Qi Zhou, Jinhong Wu, Tao Xue & Peng Jin

Authors

Qi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jinhong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Xue
View author publications
You can also search for this author in PubMed Google Scholar
Peng Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peng Jin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, Q., Wu, J., Xue, T. et al. A two-stage adaptive multi-fidelity surrogate model-assisted multi-objective genetic algorithm for computationally expensive problems. Engineering with Computers 37, 623–639 (2021). https://doi.org/10.1007/s00366-019-00844-8

Download citation

Received: 17 May 2019
Accepted: 08 August 2019
Published: 20 August 2019
Issue Date: January 2021
DOI: https://doi.org/10.1007/s00366-019-00844-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A two-stage adaptive multi-fidelity surrogate model-assisted multi-objective genetic algorithm for computationally expensive problems

Abstract

Similar content being viewed by others

An online variable-fidelity optimization approach for multi-objective design optimization

Constraint boundary pursuing-based surrogate-assisted differential evolution for expensive optimization problems with mixed constraints

A dynamic surrogate-assisted evolutionary algorithm framework for expensive structural optimization

1 Introduction

2 Background and terminology