On the ensemble of metamodels with multiple regional optimized weight factors

Yin, Hanfeng; Fang, Hongbing; Wen, Guilin; Gutowski, Matthew; Xiao, Youye

doi:10.1007/s00158-017-1891-1

On the ensemble of metamodels with multiple regional optimized weight factors

RESEARCH PAPER
Published: 23 January 2018

Volume 58, pages 245–263, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

On the ensemble of metamodels with multiple regional optimized weight factors

Download PDF

Hanfeng Yin¹,
Hongbing Fang²,
Guilin Wen¹,
Matthew Gutowski² &
…
Youye Xiao¹

768 Accesses
29 Citations
Explore all metrics

Abstract

Metamodels are often used as surrogates for expensive high fidelity computational simulations (e.g., finite element analysis). Ensemble of metamodels (EM), which combines various types of individual metamodels in the form of a weighted average ensemble, is found to have improved accuracy over the individual metamodels used alone. Currently, there are mainly two kinds of EMs called as pointwise EM and average EM. The pointwise EM generally has better prediction accuracy than the average EM, but it is much more time-consuming than the average EM. In most cases, as a metamodel, EM is often used in the engineering design optimization which needs to invoke EM tens of thousands of times. Therefore, the average EM is still the most extensively used EM. To the authors’ best knowledge, the most accurate average EM is the EM with optimized weight factors proposed by Acar et al. However, the EM proposed by Acar et al. is often too “rigid” and may not have sufficient accuracy over some regions of the design space. In order to deal with this problem and further improve the prediction accuracy, a new EM with multiple regional optimized weight factors (EM-MROWF) is proposed in this study. In this new EM, the design space is divided into multiple subdomains each of which is assigned a set of optimized weight factors. This new EM was constructed by combining three typical individual metamodels, i.e., polynomial regression (PR), radial basis function (RBF), and Kriging (KRG). The proposed technique was evaluated by ten benchmark problems and two engineering application problems. The ten benchmark problems are typical mathematical functions for evaluating the approximation performance in previous studies. And, the two engineering application problems refer to the vehicular passive safety in the field of crashworthiness design. The study results showed that the EM-MROWF performed much better than the other existing average EMs as well as the three individual metamodels.

A novel metamodel-based multi-objective optimization method using adaptive multi-regional ensemble of metamodels

Article 07 April 2023

Employing partial metamodels for optimization with scarce samples

Article 30 September 2017

On-line Metamodel-Assisted Optimization with Mixed Variables

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The design of modern engineering systems often relies on high-fidelity numerical simulations (e.g. finite element analysis) that are usually computationally expensive. In the optimum design of an engineering system, the high-fidelity numerical simulations are performed many times and thus the computational cost often becomes excessive and unaffordable. To reduce the computational cost, metamodels are used in the optimization work as the surrogates of the high-fidelity numerical simulations.

During the past decades, a large number of metamodeling methods have been proposed by the researchers and are available from literature. The commonly used metmodeling methods include polynomial regression (PR) (Box et al. 1978; Myers and Montgomery 2002), radial basis function (RBF) (Dyn et al. 1986; Buhmann 2003), Kriging (KRG) (Sacks et al. 1989; Simpson et al. 2001), Gaussian process (GP) (MacKay 1998; Rasmussen and Williams 2006), neural networks (Bishop 1995; Smith 1993), support vector regression (SVR) (Clarke et al. 2005; Gunn 1997) and multivariate adaptive regression splines (MARS) (Jerome 1990; Diana et al. 2015). In the work of Hou et al. (2007), they compared different PR models with different polynomial functions in the design optimization of hexagonal thin-walled structures. They found that quadratic polynomials provided the best approximations to the specific energy absorption (SEA) and maximum peak load. Fang et al. (2005) compared PR and several RBF models for the crashworthiness optimization problems and showed that the RBF models were more suitable for modeling highly nonlinear responses than the PR models. Song et al. (2013) carried out a comparative study on four commonly used metadmodels, PR, RBF, KRG and SVR, for the design optimization of foam-filled tapered structures. They found that no single model was the best for approximating all objective functions in the considered problems. Yin et al. (2011) employed PR, RBF, KRG, MARS and SVR models to approximate the responses of SEA and peak crushing stress of aluminum honeycomb structures. It was found that the best metamodels were different for the two responses. Forrester and Keane (2009) reviewed different metamodeling methods used in surrogate-based optimization and suggested that the choice of which surrogate to use should be based on the problem size, the expected complexity, and the cost of the analyses. From the available studies, the general consensus was that no single metamodel was the most effective for all problems. Different metamodels are suitable for fitting different functions; each metamodel has its advantages and drawbacks (Forrester and Keane 2009; Queipo et al. 2005; Wang and Shan 2007).

To improve the prediction accuracy of surrogate models, a number of studies were conducted on combining multiple metamodels into a single ensemble using the weighted sum approach (Goel et al. 2007; Zerpa et al. 2005; Sanchez et al. 2008; Acar and Rais-Rohani 2009; Acar 2010; Acar 2015; Lee and Dong-Hoon 2014; Fang et al. 2017; Ferreira and Serpa 2016; Zhou and Jiang 2016; Zhou et al. 2011; Zhang et al. 2012). Since an RBF could accurately capture the nonlinear aspect of a response, and a PR model could give the overall trend, an ensemble of metamodels (EM) of both RBF and PR models may have better prediction ability than the stand-alone models. In the work of Gu et al. (2015), they established the EM of PR, RBF, KRG and SVR for the responses of an occupant protection system. The results of the study showed that the EM had better prediction than all the individual metamodels.

In an EM, the weight factors have a significant effect on the prediction accuracy and there exist a number of methods for determining the weight factors such as the simple average weights (Goel et al. 2007), prediction-sum-of-squares-based average weights (Goel et al. 2007), optimized weights (Acar and Rais-Rohani 2009) and functional weights as the location of prediction point (Zerpa et al. 2005; Sanchez et al. 2008; Lee and Dong-Hoon 2014). Based on the method of determining the weight factors, EM can be classified into two types, an average EM and a pointwise EM (Lee and Dong-Hoon 2014). An average EM has unchanged weight factors in the entire design space, while a pointwise EM has varied weight factors which change as the location of a prediction point varies. The EMs proposed by Goel et al. (2007) and Acar and Rais-Rohani (2009) belong to average EM and the EMs proposed by Zerpa et al. (2005), Sanchez et al. (2008) and Lee and Dong-Hoon (2014) are typical pointwise EMs. The weight factors of the stand-alone metamodels in an average EM are determined based on their average accuracies throughout the entire design space, while the weight factors of the stand-alone metamodels in a pointwise EM are determined based on their accuracies at each prediction point. A pointwise EM was found to be more accurate than the average EM because it had varied weight factors according to the prediction point (Lee and Dong-Hoon 2014). However, it had much higher computational cost than the average EM especially when it was used as the surrogate in the design optimization.

In most cases, EM is employed as the surrogate of high fidelity computational simulation (e.g., finite element analysis) with expensive computational cost in the engineering design optimization (Hou et al. 2007; Fang et al. 2005; Song et al. 2013; Yin et al. 2011; Forrester and Keane 2009; Queipo et al. 2005; Wang and Shan 2007). Generally, the EM is invoked tens of thousands of times in the optimization process. The computational time may not be accepted if we use pointwise EM in the practical design optimization. Thus, the average EM is relatively efficient in the practical design optimization. The average EM proposed by Acar et al. was found to be more accurate than the other existing average EMs (Acar and Rais-Rohani 2009). However, the average EM proposed by Acar et al. has only one set of weight factors across the entire design space, and this approach is often too “rigid” and may not have good prediction in some areas of the design space.

To deal with this issue and improve the accuracy of the EM proposed by Acar et al., a new EM with multiple regional optimized weight factors (EM-MROWF) was proposed and investigated in this study. In this new EM, the design space was divided into multiple subdomains, each of which had its own set of optimized weight factors. The optimized weight factors in each subdomain were determined by minimizing the error metric of the training points in that subdomain. The EM-MROWF technique was evaluated using ten benchmark problems and two engineering application problems. The results showed that the EM-MROWF had better prediction accuracy and robustness than the other two considered average EMs and the individual metamodels.

2 Ensemble of metamodels

Metamodels are widely used in simulation-based design optimization to reduce the computational cost of a large number of expensive simulations. The most commonly used ensemble method is the weighted average ensemble (Goel et al. 2007) given by

$$ {\widehat{y}}_{\mathrm{ens}}\left(\mathbf{x}\right)=\sum \limits_{i=1}^{n_{\mathrm{M}}}{w}_i{\widehat{y}}_i\left(\mathbf{x}\right) $$

(1)

where ŷ_ens is the prediction by the EM, x is the vector of design variables, n_M is the number of metamodels used in the ensemble, w_i is the weight factor of the ith basis metamodel in the ensemble, and ŷ_i is the predicted value by the ith metamodel. To have unbiased response estimation, the following equation should be satisfied by the weight factors:

$$ \sum \limits_{i=1}^{n_{\mathrm{M}}}{w}_i=1 $$

(2)

A metamodel that is deemed more accurate should be assigned a large weight factor, and the less accurate model should have less influence on the predictions. There are many possible strategies of determining weight factors (Lee and Dong-Hoon 2014).

Among the existing EM, there are mainly two types of EMs, i.e., the pointwise EM (Zerpa et al. 2005; Sanchez et al. 2008; Lee and Dong-Hoon 2014) and the average EM (Goel et al. 2007; Acar and Rais-Rohani 2009). Generally, the accuracy of the pointwise EM is much better than that of the average EM, but the pointwise EM is much more time-consuming than the average EM. As a metamodel, the EM is often used as the surrogate of the numerical simulation (e.g., finite element analysis) for the engineering design optimization. In the design optimization process, the computational time will be exaggerated if we use pointwise EM as the surrogate of the numerical simulation. In order to consider the numerical cost of the EM, we only consider the average EM in this study.

2.1 The EM proposed by Goel

Goel et al. (2007) proposed an average EM using the basis metamodels of PR, KRG and RBF. The weight factors of the EM proposed by Goel were determined as:

$$ {w}_i=\frac{w_i^{\ast }}{\sum \limits_{i=1}^M{w}_i^{\ast }} $$

(3)

$$ {w}_i^{\ast }={\left({E}_i+\alpha {E}_{avg}\right)}^{\beta },\beta <0,\alpha <1 $$

(4)

$$ {E}_{avg}=\frac{\sum \limits_{i=1}^M{E}_i}{M} $$

(5)

$$ {E}_i={GMSE}_i=\frac{1}{ndes}\sum \limits_{k=1}^{ndes}{\left[y\left({\mathbf{x}}_k\right)-{{\widehat{y}}_i}^{\left(-k\right)}\left({\mathbf{x}}_k\right)\right]}^2 $$

(6)

where M is the number of individual metamodels, ndes is the number of design points, x_k is the kth design point, y(x_k) is the real response value at x_k, and ŷ_i^(−k)(x_k) is the predicted response value of the ith basis metamodel generated using nexp-1 design points without the kth point at x_k. In this study, α = 0.05 and β = −1 as the Goel et al. suggested. In the method proposed by Goel et al., the generalized mean square cross-validation error (GMSE) was used to calculate the prediction errors of the basis metamodel.

2.2 The EM proposed by Acar

An error-based minimization method (Acar and Rais-Rohani 2009; Acar 2010, 2015) is recognized as an effective method for determining the weight factors of the EM. Acar and Rais-Rohani (2009) proposed an ensemble of metamodels with optimized weight factors, in which they determined the weight factors w_i by solving an optimization problem as

$$ \left\{\begin{array}{l}\operatorname{Minimize}\ \mathrm{Error}\left({\widehat{y}}_{\mathrm{ens}}\right)\\ {}\mathrm{s}.\mathrm{t}.\kern0.5em \sum \limits_{i=1}^{n_{\mathrm{M}}}{w}_i=1\end{array}\right. $$

(7)

where Error(ŷ_ens) is the selected error metric that measures the accuracy of the ensemble-predicted response ŷ_ens. In Acar and Rais-Rohani’s study, GMSE was selected as the error metric to evaluate the accuracy of the ensemble of metamodels (Acar 2015). Then, GMSE was selected to evaluate the accuracy of the ensemble-predicted response ŷ_ens. The Error(ŷ_ens) of (7) will be written as

$$ \mathrm{Error}\left({\widehat{y}}_{\mathrm{ens}}\right)={GMSE}_{\mathrm{ens}}=\frac{1}{ndes}\sum \limits_{k=1}^{ndes}{\left[y\left({\mathbf{x}}_k\right)-{\widehat{y}}_{\mathrm{ens}}^{\left(-k\right)}\left({\mathbf{x}}_k\right)\right]}^2 $$

(8)

where ndes is the number of design points, x_k is the kth design point, y(x_k) is the real response value at x_k, and ŷ_ens^(−k)(x_k) is the predicted response value of the ensemble metamodel generated using nexp-1 design points at x_k.

3 New proposed ensemble of metamodels

In this study, a new EM approach was investigated by dividing the design space into multiple subdomains and use a set of optimized weight factors for each subdomain. The basic idea of the new EM approach was to allow the subdomain to have independently determined sets of weight factors so that they will not affect or be affected by those of other subdomains. With multiple regional optimized weight factors, the accuracy of the EM could be improved in each of the subdomains and thus across the entire design space.

The first step of constructing the EM with multiple regional optimized weight factors was to divide the design space into m = n₁ × n₂×⋯×n_p subdomains or regions (see Fig. 1), where n_i is the number of sections of ith variable, and p is the number of variables. In each region, there is one set of weight factors that is obtained by minimizing the GMSE on the design points in that region. The flowchart of the process to establish the EM-MROWF is shown in Fig. 2 and a detailed description is given as follows.

STEP 1: Generate ndes design points using the optimal Latin hypercube sampling, and create ntest test points using the regular Latin hypercube sampling. The design points were used to create the individual metamodels, i.e., PR, RBF and KRG. The ensemble training points were used to calculate the weight factors of the ensemble of metamodels. The test points were used to calculate the root mean square error (RMSE) and maximum absolute relative error (MARE) of the ensemble of metamodels.
STEP 2: Divide the design space into m = n₁ × n₂×⋯×n_p regions, as illustrated in Fig. 1.
STEP 3: Create the individual metamodels, i.e., PR, RBF and KRG, and construct the EM based on the design points. The functions and parameters of these metamodels are given in Table 1.
STEP 4: Calculate the error metric GMSE of the EM in each subdomain using the design points in that region.
STEP 5: Obtain the optimized weight factors for each region by minimizing the GMSE. In this study, the sequential quadratic programming (SQP) optimizer of MATLAB, the “fmincon” function, was used to solve the optimization problem of (7). A total of m = n₁ × n₂×⋯×n_p sets of optimized weight factors were obtained for the m = n₁ × n₂×⋯×n_p regions.
STEP 6: Obtain the weight factors for the regional boundaries by averaging the weight factors of the adjacent regions. The weight factors for the regional boundaries were determined by averaging the weight factors of the adjacent regions, which is illustrated in Fig. 3 (e.g. if the weight factors of region A and region B were (w₁₁, w₁₂, w₁₃) and (w₂₁, w₂₂, w₂₃), respectively. The weight factors of the boundary of region A and region B would be (w₁₁/2+ w₂₁/2, w₁₂/2+ w₂₂/2, w₁₃/2+ w₂₃/2).)
STEP 7: Obtain one EM for the whole design space and calculate the RMSE and MARE of the test points. RMSE and MARE can be calculated as:

Table 1 User chosen functions and parameters of four individual metamodels (Yin et al. 2014)

On the ensemble of metamodels with multiple regional optimized weight factors

Abstract

Similar content being viewed by others

A novel metamodel-based multi-objective optimization method using adaptive multi-regional ensemble of metamodels

Employing partial metamodels for optimization with scarce samples

On-line Metamodel-Assisted Optimization with Mixed Variables

1 Introduction

2 Ensemble of metamodels

2.1 The EM proposed by Goel

2.2 The EM proposed by Acar

3 New proposed ensemble of metamodels

4 Example problems

4.1 Benchmark problems

4.2 Engineering application problems

4.2.1 Thin-walled column crushing problem

4.2.2 Airbag cushion problem

5 Results and discussion

5.1 Benchmark problems

5.1.1 Test 1: Prediction performance with various number of design points

5.1.2 Test 2: Prediction performance with various number of design variables

5.2 Engineering application problems

6 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation