Multi-objective optimization of process parameters in plastic injection molding using a differential sensitivity fusion method

Zhou, Huifang; Zhang, Shuyou; Wang, Zili

doi:10.1007/s00170-021-06762-8

Multi-objective optimization of process parameters in plastic injection molding using a differential sensitivity fusion method

ORIGINAL ARTICLE
Published: 16 March 2021

Volume 114, pages 423–449, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The International Journal of Advanced Manufacturing Technology Aims and scope Submit manuscript

Multi-objective optimization of process parameters in plastic injection molding using a differential sensitivity fusion method

Download PDF

Huifang Zhou¹,
Shuyou Zhang¹ &
Zili Wang¹

920 Accesses
14 Citations
Explore all metrics

Abstract

The product quality, productivity, and cost are mainly considered to make the manufacturing plan in plastic injection molding (PIM). The process parameters in PIM play a crucial role in determining the product quality, productivity, and cost. There are actually contradictions between above three properties. Therefore, it is difficult to quickly and accurately obtain the process parameters setting that meet the product quality requirement under the premise of acceptable productivity and cost. In this paper, a differential sensitivity fusion method (DSFM) is proposed to perform the multi-objective optimization of process parameters in PIM for the product quality and productivity improvement and the cost-saving, which integrates sampling strategy, numerical simulation, metamodeling method, and multi-objective optimization algorithm. The sampling strategy is utilized to generate sampling points from the design space at different parameter levels. For the sampling points, the numerical simulation is implemented to calculate the objective responses. Based on the sampling points and their corresponding response, the metamodeling method is applied to construct the response predictors to calculate the objective responses for any sampling point in the global design space. The multi-objective optimization algorithm is executed to locate the Pareto-optimal solutions, where the response predictors are taken as the fitness functions. The automobile front bumper is taken as the case study to verify the proposed method. The numerical results demonstrate that the proposed metamodeling method has better prediction accuracy and performance compared to some classical methods (e.g., response surface model, Kriging) and the multiple objectives cannot reach the optimal simultaneously. Moreover, the trade-off analysis identifies the better solution for decision-making, which helps to quickly and effectively select the optimal process parameters setting.

Multi-objective optimization of injection molding process parameters in two stages for multiple quality characteristics and energy efficiency using Taguchi method and NSGA-II

Article 28 June 2016

Multiobjective optimization of process parameters for plastic injection molding via soft computing and grey correlation analysis

Article 07 December 2014

Optimization of the plastic injection molding process using the Taguchi method, RSM, and hybrid GA-PSO

Article 21 August 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Plastic injection molding (PIM) is widely used to manufacture a wide variety of plastic products owing to its high productivity, low cost, and good flexibility to various complex geometries. PIM is a nonlinear coupled system with multiple inputs and multiple outputs. The output includes product quality, manufacturing cost, and molding efficiency, whereas the input includes machine characteristics, mold design pattern, process parameters, polymer material characteristics, and product geometric characteristics. The machine, mold, material, and product will determined before manufacturing, which means that the process parameters should be carefully set to avoid or reduce quality defects, improve productivity, and decrease energy consumption and cost.

To reduce the product defects (e.g., warpage, shrinkage, weldline), processing conditions, material properties, product design, and mold design have been studies by many researchers. Ozcelik and Sonat [1] used the Taguchi method to study the effect of process parameters on the warpage of parts with different thicknesses. Oktem et al. [2] conducted a series of experiments and used the signal-to-noise (S/N) ratio method and analysis of variance (ANOVA) to study the effects of process parameters on warpage and shrinkage of products. Tang et al. [3] performed warpage experiments on a thin plate to study the impact factors of the warpage problem. Kurt et al. [4] investigated the influence of the cavity pressure and mold surface temperature on the quality of the final parts. Masato et al. [5] analyzed and concluded that the shrinkage for thin-wall parts was caused by fiber orientation and could be reduced by higher melt temperature, packing pressure. Wang et al. [6] investigated the structural carbon emissions and injection molding process carbon emissions and implemented multi-objective optimization to realize the low-carbon design for injection molding machine.

Different from the constant process parameter optimization, the variable process parameter profile (e.g., packing pressure profile) has been considered to improve product quality and efficiency. Li et al. [7] performed the optimization of variable packing pressure profile for the shrinkage evenness of a slab. Gao and Wang [8] applied the sequential approximate optimization (SAO) to determine the optimal packing profile and the optimal process parameters for warpage minimization. For minimizing warpage and cycle time, Kitayama et al. [9] investigated the multi-objective optimization of variable pressure profile and process parameters. Hashimoto et al. [10] applied SAO to realize the simultaneous optimization of variable injection velocity profile and process parameters for minimizing weldline and cycle time.

Due to the advance of computer technology, computer-aided simulation software (e.g., Moldflow, Moldex3D) coupled with design optimization is recognized as an alternative approach for determining the optimal process parameters. The numerical simulation in PIM is generally so intensive that the surrogate-based or metamodeling approach, which can establish a mathematical relationship between the process parameters and optimization objectives, is valid to determine the optimal process parameters with a small number of simulations. Ozcelik and Erzurumlu [11] integrated finite element analysis (FEA), statistical DOE methods, RSM, and genetic algorithm (GA) to minimize the warpage of thin-walled parts. Kurtaran et al. [12] integrated the FEA, statistical DOE methods, ANN, and GA to minimize the warpage of automotive ceiling lamps. Gao and Wang [13] adopted the Kriging model for determining the optimal process parameters to minimize the warpage of a cellular phone cover. Li et al. [7] used the radial basis function (RBF) and the expected improvement (EI) to optimize the process parameters for achieving uniform shrinkage. Xia et al. [14] adopted the Gaussian process (GP) model for the front grille’s warpage optimization. The above researches handle a single objective optimization, but there are generally many objectives to optimization for high product quality and productivity in PIM. Chen and Kumiawan [15] proposed a two-stage multi-objective optimization system. Zhao et al. [16] proposed a two-stage optimization system to optimize the warpage, shrinkage, and sink marks of injection molded parts simultaneously. Zhao and Cheng [17] proposed a hybrid multi-objective optimization system to simultaneously optimize the warpage and cycle time of the PIM process. Cheng et al. [18] developed a novel method to find the optimal solution set of constrained multi-objective optimization problems that integrated variable complexity methods (VCMs), constrained non-dominated sorting genetic algorithm (CNSGA), BPNN, and Moldflow analysis. Liu et al. [19] proposed a multi-objective optimization method for process parameters of PIM with the haze ratio (HR) and peak valley 20 (PV20) of an optical lens as the optimization target. Xu et al. [20, 21] performed the process parameter optimization for minimizing the weight, the flash, and the volume shrinkage of a thin-walled plastic product.

In particular, a sequential approximate optimization (SAO) that the surrogate model is repeatedly constructed and optimized by adding several new sampling points has gradually been a popular approach to improve the optimization result. The general framework of the process parameters optimization in PIM using the SAO is summarized in [22]. Several representative papers using SAO approach is briefly reviewed. Gao and Wang [8, 13] adopted the modified rectangular grid (MRG) sampling strategy and the expected improvement (EI) sampling criterion to improve the accuracy of the Kriging model, respectively. Xia et al. [14] used an enhanced probability of improvement criterion to find the direction of adding training samples and optimize the surrogate model. Shi et al. [23] used a parametric sampling evaluation (PSE) strategy to improve the accuracy of the ANN model and speed up the optimization process to converge to the global optimum. Deng et al. [24] used the mode-pursuing sampling strategy (MPS) to obtain new sample points, thereby improving the accuracy of the Kriging model.

In addition to the optimization of process parameters, the conformal cooling channel is applied to improve the product quality and efficiency of PIM. Dimla et al. [25] reported that the conformal cooling channel could drastically reduce cycle time. Au and Yu [26] designed various scaffold cooling channels and evaluated the cooling performance and found that the conformal cooling channel could offer a more uniform thermal distribution. Wang et al. [27] introduced an approach to generate conformal spiral cooling channels, which helped improve the uniform of mold cooling. Kitayama et al. [10, 28,29,30] investigated the cooling performance of the conventional straight-type cooling channels and conformal cooling channels numerically and experimentally where the process parameter optimization for warpage, cycle time, weldline, and clamping force reduction was performed.

Here, the motivation for this paper is summarized as follows:

1.
Multi-objective optimization of process parameters in PIM is a crucial issue. Several process parameters are optimized for warpage and weldline reduction, and clamping force and cycle time minimization.
2.
Weldlines are one of the major defects, which cannot be completely eliminated. The low weldline temperature generates long weldlines for the quick solidification. The minimum weldline temperature is considered to be maximized for weldline reduction.
3.
The sensitivity information of sampling points is used to improve the prediction accuracy of the response predictor. Therefore, the performance of multi-objective optimization can be improved.
4.
In general, warpage, weldline, clamping force, and cycle time cannot reach the optimal at the same time. Therefore, the trade-off analysis is implemented to make the decision for the multi-objective optimization.

To realize the high product quality, high productivity, and low energy consumption and cost in PIM, this paper proposes a differential sensitivity fusion method (DSFM) to perform the multi-objective optimization of process parameters in PIM for minimizing warpage, weldlines, clamping force, and cycle time. It integrates the sampling strategy, numerical simulation, metamodeling method, and multi-objective optimization algorithm. The sampling strategy, Latin hypercube sampling (LHS), is utilized to generate stratified and uniformly distributed sampling points from the design space. For sampling points, the numerical simulation based on Moldflow is implemented to calculate the responses. Based on the sampling points and their corresponding responses, the metamodeling method is applied to construct the response predictors to calculate the responses for any sampling point in the global design space. The gradient-enhanced response surface model (GERSM) combined with the moving least-squares method (MLSM) is applied to construct the response predictor for each objective, which simultaneously utilizes the response and sensitivity information of the sampling point to improve the accuracy of the response predictors. For the capture of the sensitivity information, an adaptive sensitivity generation method (ASGM) is proposed to calculate the gradient vector for each design variable of the sampling point. The multi-objective optimization algorithm, non-dominated sorting genetic algorithm-III (NSGA-III), is executed to locate the Pareto-optimal solutions, where the response predictors are taken as the fitness functions. The trade-off analysis based on the spider-web chart is applied to make the decision for the optimal process parameters. Moreover, for the structural features of the product, the valve hot runner system is applied to reduce the weldlines and improve the product surface quality. The automobile front bumper is taken as a case study to verify the proposed method DSFM. The numerical results show that the proposed method can help the manufacturers to quickly and accurately select the optimal process parameters setting.

The remainder of the rest of this paper is organized as follows. In Section 2, the multi-objective optimization problem and the trade-off analysis based on the spider-web chart are described. In Section 3, the optimization methodologies and the detailed process for realizing the multi-objective optimization of the process parameters in PIM are described. To verify the proposed method, the automobile front bumper is taken as the case study and the numerical results are shown in Section 4. Finally, the conclusion is drawn in Section 5.

2 Overview of multi-objective optimization problem

2.1 Multi-objective optimization model

The goal of the injection molding process parameters (IMPP) optimization is to improve the product quality and productivity, while reducing cost. It is a typical multi-objective optimization problem. A multi-objective design optimization problem can be generally formulated as follows:

$$ {\displaystyle \begin{array}{c}\mathrm{find}:\boldsymbol{x}={\left[{x}_1,{x}_2,\cdots, {x}_D\right]}^T,\\ {}\begin{array}{c}\operatorname{minimize}F\left(\boldsymbol{x}\right)=\min \mathrm{imize}\left({f}_1\left(\boldsymbol{x}\right),{f}_2\left(\boldsymbol{x}\right),\cdots, {f}_K\left(\boldsymbol{x}\right)\right),\\ {}\begin{array}{l}\mathrm{Subject}\ \mathrm{to}:\kern0.5em \\ {}\kern2.75em {x}_i^L\le {x}_i\le {x}_i^H,\kern0.75em i=1,2,\cdots, D\end{array}\end{array}\end{array}} $$

(1)

where x is the column vector of the design variables, and x_i(i = 1, 2, ⋯, D) represents the i-th deign variable; D is the number of the design variables; f_j(x) (j = 1, 2, ⋯, K) is the j-th objective function to be minimized, and K is the number of the objective functions; $ {x}_i^L $ and $ {x}_i^H $ are the lower and upper bounds of the i-th design variable, respectively.

2.2 Objective functions

Warpage is one of the major defects in PIM. It causes the actual dimensions of the product to deviate from the design requirement, which should be minimized for high product quality. The first objective function f₁(x) is taken as the warpage.

Weldlines influence the appearance of the product and the product strength. It is significant to reduce the weldlines of the product. The weldline will generate when two or more flow fronts meet. The melted plastic will be quickly solidified with the low weldline temperature, which causes the generation of long weldlines. The weldline temperature is one of the important factors for the weldline reduction [29, 30]. The minimum weldline temperature (MinT_weld) needs to be maximized for weldline reduction. Therefore, the opposite of the minimum weldline temperature is taken as the second objective function f₂(x), and is minimized.

Energy consumption and cost is also an important issue in PIM. When the melted plastic is injected into the cavity, a reversed pressure will generate. The clamping force should be applied to keep the mold closed. Small clamping force can reduce the energy consumption and save the cost. Therefore, clamping force is taken as the third objective function f₃(x) for energy consumption reduction.

Cycle time directly influences the molding efficiency and should be minimized for high productivity, so it is taken as the fourth objective function f₄(x). The sum of the injection time, the packing time, and the cooling time can be evaluated as the cycle time, which is expressed as:

$$ {f}_4\left(\boldsymbol{x}\right)={t}_{inj}+{t}_p+{t}_c, $$

(2)

where t_inj is the injection time; t_p is the packing time; t_c is the cooling time.

2.3 Design variables

The melt temperature (T_melt), the mold temperature (T_mold), the injection time (t_inj), the packing pressure, the packing time, and the cooling time (t_c) are taken as the design variables. To ensure the improvement of the products dimensional accuracy and the consistency of the product shrinkage degree, a variable packing pressure profile instead of constant packing pressure is applied, as shown in Fig. 1.

The packing pressure profile consists four parameters (packing pressure P_p1, P_p2 and packing time t_p1, t_p2 at point A and B in Fig. 1). Therefore, the design variables are x = [T_melt, T_mold, t_inj, P_p1, P_p2, t_p1, t_p2, t_c]^T. The lower and upper bound of design variables are shown in Table 1, which are determined by the recommended values in Moldflow and the manufacture’s recommendation.

Table 1 Design variables and their lower/upper bounds

Full size table

2.4 Trade-off analysis

For the multi-objective optimization problem expressed in Eq. (1), we want to minimize all the objectives simultaneously. Because of the contradiction between the objectives and their possible incommensurability, it is impossible to find a solution to ensure that all the objectives are simultaneously optimal. Thus, there is no single optimal solution but rather a set of compromise solutions named Pareto-optimal solutions or non-dominated solutions to such an optimization problem with multiple conflicting objectives.

After the Pareto-optimal solution set has been generated, the decision-maker should perform the trade-off analysis to select the most preferred one or a few solutions among the alternatives for producing the product. The spider-web chart or radar chart can visually understand the trade-off over three objectives, and is one of the useful tools for the trade-off analysis [31]. An illustrative example of a spider-web chart for the Pareto-optimal solutions visualization is shown in Fig. 2, in which four objective functions to be minimized is handled. Each apex of a polygon in Fig. 2 represents one objective. The outermost polygon shows the nadir solution, the innermost polygon represents the ideal solution, and the middle polygons present the alternatives.

Because of the difference of value range between the objectives, each apex is normalized using Eq. (3) to draw the spider-web chart.

$$ \overline{f_j}\left({\boldsymbol{x}}_p^{(i)}\right)=\frac{f_j\left({\boldsymbol{x}}_p^{(i)}\right)-{f}_j^I}{f_j^N-{f}_j^I}\ i=1,2,\cdots, {N}_{alt}\kern0.5em j=1,2,\cdots, K $$

(3)

where $ {\boldsymbol{x}}_p^{(i)} $ denotes the i-th alternative (Pareto-optimal solution); N_alt is the number of the Pareto-optimal solutions; $ {f}_j\left({\boldsymbol{x}}_p^{(i)}\right) $ and $ \overline{f_j}\left({\boldsymbol{x}}_p^{(i)}\right) $ represent the unnormalized and normalized value of the j-th objective of $ {\boldsymbol{x}}_p^{(i)} $, respectively; K is the number of the objectives; $ {f}_j^I $ and $ {f}_j^N $ denote the ideal and nadir value of the j-th objective.

The area of each polygon in the spider-web chart is used to compare the alternatives. The Pareto-optimal solutions minimizing and maximizing the polygon area in the spider-web chart are taken as the better solution and the worse solution, respectively.

3 Optimization methodologies

3.1 Multi-objective optimization process

The method, DSFM, is proposed to realize the process parameter optimization in PIM for warpage and weldlines reduction and clamping force and cycle time minimization, which integrates the sampling strategy, numerical simulation, metamodeling method, and multi-objective optimization algorithm. The sampling strategy is utilized to generate sampling points from the design space at different parameter levels. For the sampling points, the numerical simulation based on Moldflow is implemented to calculate the responses (warpage, minimum weldline temperature, clamping force, and cycle time). Based on the sampling points and their corresponding response, the metamodeling method is applied to approximately represent the mathematical relationship between the design variables (process parameters) and the responses, which constructs the response predictors to calculate the responses for any sampling point in the global design space. The multi-objective optimization algorithm, NSGA-III, is executed to locate the Pareto-optimal solutions, where the response predictors are taken as the fitness functions. The flowchart of our proposed method to realize multi-objective process parameter optimization in PIM is shown in Fig. 3.

As shown in Fig. 3, the main steps of the proposed method include the acquisition and processing of the sampling points, the response predictor modeling, and the multi-objective optimization. The specific process can be described as:

Step 1: Identify the responses as the objectives and the process parameters related to the selected responses as the design variables of the optimization problem. Determine the value range of each design variable as the constraints of the optimization problem.
Step 2: Implement the sampling strategy, LHS, to generate stratified, and uniformly distributed sampling points in the global design space, and execute the numerical simulation to calculate and obtain the responses for each sampling point based on the Moldflow.
Step 3: Apply the sensitivity analysis among the design variables and the responses to identify the most important design variables that have a significant influence on the selected objectives.
Step 4: Response predictor modeling based on the metamodeling method. Taking the sampling points obtained in Step 2 as the training set (displayed in Table 11 ~ 12 in Appendix), the proposed method, ASGM, is implemented to calculate the gradient vector for each design variable of each sampling point in the training set as the sensitivity information of the training set. Then, based on the response and sensitivity information of the sampling points, the GERSM combined with the MLSM is constructed as the response predictor for each selected objective.
Step 5: Multi-objective process parameter optimization based on NSGA-III algorithm. Set the initial parameters of NSGA-III algorithm, and then take the response predictors constructed in Step 4 as the fitness functions to perform multi-objective global optimization and locate the Pareto-optimal solution set. For the Pareto-optimal solutions, the trade-off analysis is implemented to locate a better solution.
Step 6: Organize the confirmation experiments (as shown in Table 13 ~ 14 in Appendix) to verify the effectiveness of the proposed method.

3.2 Response predictor modeling

3.2.1 GERSM metamodeling

Compared with the traditional response surface model (RSM), the GERSM utilizes not only the response information but also the sensitivity information of the sampling points to construct the response surface [32]. The total error consists of response error and gradient error. In addition, the MLSM is a local approximation method, which is essentially a weighted least-squares method, and it uses weighting factors and local approximation to improve the accuracy of least-squares fitting when constructing a surrogate model [33].

Combined with the MLSM, the surface was constructed by GERSM changes with the position of the point in the design space. Therefore, the total error and regression coefficients are functions of the sampling point position, which can be defined as:

$$ \boldsymbol{\beta} \left(\boldsymbol{x}\right)=\underset{\boldsymbol{\beta}}{\arg\ \min }{E}_{\mathrm{total}}\left(\boldsymbol{x}\right)=\underset{\boldsymbol{\beta}}{\arg\ \min}\left[\left(1-\mathrm{swg}\right)\cdotp {E}_y\left(\boldsymbol{x}\right)+\mathrm{swg}\cdotp {E}_g\left(\boldsymbol{x}\right)\right] $$

(4)

$$ \boldsymbol{x}={\left[{x}_1,{x}_2,\cdots, {x}_D\right]}^{\mathrm{T}}\kern3pt \in \kern3pt {\mathbb{R}}^D $$

(5)

where β(x) is the regression coefficient vector; E_total(x) is the total error function; x is the general sampling point; D is the number of the design variables; ℝ^D is the design space; swg is the sensitivity control factor for measuring the effect of the response information and the gradient information on the total fitting error; E_y(x) and E_g(x) represent the response error function and the gradient error function, respectively.

The second order polynomial is used to construct the response surface, which can be defined as:

$$ \hat{y}\left(\boldsymbol{x}\right)={b}_0+\sum \limits_{i=1}^D{b}_i\cdotp {x}_i+\sum \limits_{i=1}^D{b}_{ii}\cdotp {x_i}^2+\sum \limits_{i=1}^D\sum \limits_{j>i}^D{b}_{ij}\cdotp {x}_i{x}_j $$

(6)

where $ \hat{y}\left(\boldsymbol{x}\right) $ is the response surface; mapping b₀, b_i, b_ii, b_ij : x → ℝ project sampling point to its corresponding regression coefficients; x_i and x_j denote the i-th and j-th design variable, respectively.

According to Eq. (6), the error function E_y(x) and E_g(x) are formulated as follows:

$$ {E}_y\left(\boldsymbol{x}\right)={\left(\boldsymbol{y}-\mathbf{X}\bullet \boldsymbol{\beta} \left(\boldsymbol{x}\right)\right)}^{\mathrm{T}}{\mathbf{W}}_y\left(\boldsymbol{x}\right)\left(\boldsymbol{y}-\mathbf{X}\bullet \boldsymbol{\beta} \left(\boldsymbol{x}\right)\right) $$

(7)

$$ {E}_g\left(\boldsymbol{x}\right)=\sum \limits_{i=1}^D{\left({\boldsymbol{g}}_{x_i}-{\mathbf{T}}_{x_i}\bullet \boldsymbol{\beta} \left(\boldsymbol{x}\right)\right)}^{\mathrm{T}}{\mathbf{W}}_g\left(\boldsymbol{x}\right)\left({\boldsymbol{g}}_{x_i}-{\mathbf{T}}_{x_i}\bullet \boldsymbol{\beta} \left(\boldsymbol{x}\right)\right) $$

(8)

where y is the true response vector; $ {\boldsymbol{g}}_{x_i} $ is the true gradient vector for the i-th design variable of x; X denotes the response design matrix; $ {\mathbf{T}}_{x_i} $ denotes the response and gradient design matrix of the i-th design variable, which is the partial derivative of X to the i-th design variable; the diagonal matrices W_y(x) and W_g(x) represent the response and gradient weight matrix, respectively.

The vector y and β(x), the matrices X, $ {\mathbf{T}}_{x_i} $, W_y(x) and W_g(x) can be formulated as follows:

$$ \boldsymbol{y}={\left[{y}^{(1)}\kern0.5em {y}^{(2)}\kern0.5em \mathbf{\cdots}\kern0.5em {y}^{(k)}\kern0.5em \cdots \kern0.5em {y}^{(N)}\right]}^{\mathrm{T}} $$

(9)

$$ \boldsymbol{\beta} \left(\boldsymbol{x}\right)={\left[{b}_0\kern0.5em {b}_1\kern0.5em \mathbf{\cdots}\kern0.5em {b}_i\kern0.5em \cdots \kern0.5em {b}_D\kern0.5em {b}_{11}\kern0.5em \mathbf{\cdots}\kern0.5em {b}_{ii}\kern0.5em \cdots \kern0.5em {b}_{ij}\kern0.5em \cdots \kern0.5em {b}_{\left(D-1\right)D}\right]}^{\mathrm{T}} $$

(10)

$$ \mathbf{X}=\left[\begin{array}{ccccccccccccc}1& {x}_1^{(1)}& \cdots & {x}_i^{(1)}& \cdots & {x}_D^{(1)}& {x_1^{(1)}}^2& \cdots & {x_i^{(1)}}^2& \cdots & {x}_i^{(1)}{x}_j^{(1)}& \cdots & {x}_{D-1}^{(1)}{x}_D^{(1)}\\ {}\vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \ddots & \vdots \\ {}1& {x}_1^{(k)}& \cdots & {x}_i^{(k)}& \cdots & {x}_D^{(k)}& {x_1^{(k)}}^2& \cdots & {x_i^{(k)}}^2& \cdots & {x}_i^{(k)}{x}_j^{(k)}& \cdots & {x}_{D-1}^{(k)}{x}_D^{(k)}\\ {}\vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \ddots & \vdots \\ {}1& {x}_1^{(N)}& \cdots & {x}_i^{(N)}& \cdots & {x}_D^{(N)}& {x_1^{(N)}}^2& \cdots & {x_i^{(N)}}^2& \cdots & {x}_i^{(N)}{x}_j^{(N)}& \cdots & {x}_{D-1}^{(N)}{x}_D^{(N)}\end{array}\right] $$

(11)

$$ {\mathbf{T}}_{x_i}=\left[\begin{array}{ccccccccccccc}0& 0& \cdots & 1& \cdots & 0& 0& \cdots & 2{x}_i^{(1)}& \cdots & {x}_j^{(1)}& \cdots & 0\\ {}\vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \ddots & \vdots \\ {}0& 0& \cdots & 1& \cdots & 0& 0& \cdots & 2{x}_i^{(k)}& \cdots & {x}_j^{(k)}& \cdots & 0\\ {}\vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \vdots & \ddots & \vdots & \ddots & \vdots & \ddots & \vdots \\ {}0& 0& \cdots & 1& \cdots & 0& 0& \cdots & 2{\mathrm{x}}_i^{(k)}& \cdots & {x}_j^{(N)}& \cdots & 0\end{array}\right] $$

(12)

$$ {\mathbf{W}}_y\left(\boldsymbol{x}\right)=\left[\begin{array}{ccccc}{\omega}_y\left(\boldsymbol{x}-{\boldsymbol{x}}^{(1)}\right)& \cdots & 0& \cdots & 0\\ {}\vdots & \ddots & \vdots & \vdots & \vdots \\ {}0& \cdots & {\omega}_y\left(\boldsymbol{x}-{\boldsymbol{x}}^{(k)}\right)& \cdots & 0\\ {}\vdots & \vdots & \vdots & \ddots & \vdots \\ {}0& \cdots & 0& \cdots & {\omega}_y\left(\boldsymbol{x}-{\boldsymbol{x}}^{(N)}\right)\end{array}\right] $$

(13)

$$ {\mathbf{W}}_g\left(\boldsymbol{x}\right)=\left[\begin{array}{ccccc}{\omega}_g\left(\boldsymbol{x}-{\boldsymbol{x}}^{(1)}\right)& \cdots & 0& \cdots & 0\\ {}\vdots & \ddots & \vdots & \vdots & \vdots \\ {}0& \cdots & {\omega}_g\left(\boldsymbol{x}-{\boldsymbol{x}}^{(k)}\right)& \cdots & 0\\ {}\vdots & \vdots & \vdots & \ddots & \vdots \\ {}0& \cdots & 0& \cdots & {\omega}_g\left(\boldsymbol{x}-{\boldsymbol{x}}^{(N)}\right)\end{array}\right] $$

(14)

where y^(k) denotes the true response of the k-th sampling point; N is the size of the sampling points; x^(k) denotes the k-th sampling point; $ {x}_i^{(k)} $ denotes the i-th design variable of the k-th sampling point. The element ω_y(x − x^(k)) and ω_g(x − x^(k)) on the diagonal denote the weight of x^(k) for the calculation of response and gradient error, respectively. The exponential function is applied as the weight function to calculate ω_y(x − x^(k)) and ω_g(x − x^(k)). The formulas are as follows:

$$ {\omega}_y\left(\boldsymbol{x}-{\boldsymbol{x}}^{(k)}\right)={\omega}_g\left(\boldsymbol{x}-{\boldsymbol{x}}^{(k)}\right)=\omega (d)=\left\{\begin{array}{r}\exp \left(-\frac{d}{\mathrm{RI}}\right),\frac{d}{\mathrm{RI}}\le 1\ \\ {}0\kern1.5em ,\frac{d}{\mathrm{RI}}>1\end{array}\right. $$

(15)

$$ d={\left\Vert \boldsymbol{x}-{\boldsymbol{x}}^k\right\Vert}_2={\left[\sum \limits_{i=1}^D{\left({x}_i-{x}_i^{(k)}\right)}^2\right]}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$2$}\right.} $$

(16)

where ω_y(∙) and ω_g(∙) denote the response and gradient weight function, respectively; RI is the size of the support region, which means that only the sampling point located in the support region will have an impact on the prediction of x; d is the Euclidean distance between x and x^(k).

According to the Eq. (4), the regression coefficient vector β(x) of GERSM can be obtained by the following set of equations:

$$ \frac{\partial {E}_{\mathrm{total}}\left(\boldsymbol{x}\right)}{\partial \boldsymbol{\beta}}=\left(1-\mathrm{swg}\right)\cdotp \frac{\partial {E}_y\left(\boldsymbol{x}\right)}{\partial \boldsymbol{\beta}}+\mathrm{swg}\cdotp \frac{\partial {E}_g\left(\boldsymbol{x}\right)}{\partial \boldsymbol{\beta}}=0 $$

(17)

$$ \boldsymbol{\beta} \left(\boldsymbol{x}\right)=A{\left(\boldsymbol{x}\right)}^{-1}B\left(\boldsymbol{x}\right) $$

(18)

$$ A\left(\boldsymbol{x}\right)=\left(1-\mathrm{swg}\right)\cdotp {\mathbf{X}}^{\mathrm{T}}{\mathbf{W}}_y\left(\boldsymbol{x}\right)\mathbf{X}+\mathrm{swg}\cdotp \sum \limits_{i=1}^D{{\mathbf{T}}_{x_i}}^{\mathrm{T}}{\mathbf{W}}_g\left(\boldsymbol{x}\right){\mathbf{T}}_{x_i} $$

(19)

$$ B\left(\boldsymbol{x}\right)=\left(1-\mathrm{swg}\right)\cdotp {\mathbf{X}}^{\mathrm{T}}{\mathbf{W}}_y\left(\boldsymbol{x}\right)\boldsymbol{y}+\mathrm{swg}\cdotp \sum \limits_{i=1}^D{{\mathbf{T}}_{x_i}}^{\mathrm{T}}{\mathbf{W}}_g\left(\boldsymbol{x}\right){\boldsymbol{g}}_{x_i} $$

(20)

Base on the GERSM, the response of x can be predicted as:

$$ \overset{\sim }{y}=\left(1-\mathrm{swg}\right)\cdotp {\boldsymbol{x}}^p\boldsymbol{\beta} \left(\boldsymbol{x}\right)+\mathrm{swg}\cdotp \sum \limits_{i=1}^D{\boldsymbol{t}}_{x_i}\boldsymbol{\beta} \left(\boldsymbol{x}\right) $$

(21)

$$ {\boldsymbol{x}}^p=\left[1\kern0.5em {x}_1\kern0.5em \cdots \kern0.5em {x}_i\kern0.5em \cdots \kern0.5em {x}_D\kern0.5em {x_1}^2\kern0.5em \cdots \kern0.5em {x_i}^2\kern0.5em \cdots \kern0.5em {x}_i{x}_j\kern0.5em \cdots \kern0.5em {x}_{D-1}{x}_D\right] $$

(22)

$$ {\boldsymbol{t}}_{x_i}=\left[0\kern0.5em 0\kern0.5em \cdots \kern0.5em 1\kern0.5em \cdots \kern0.5em 0\kern0.5em 0\kern0.5em \cdots \kern0.5em 2{x}_i\kern0.5em \cdots \kern0.5em {x}_j\kern0.5em \cdots \kern0.5em 0\right] $$

(23)

where x^p denotes the response design vector of x; $ {\boldsymbol{t}}_{x_i} $ denotes the gradient design vector for the i-th design variable of x.

3.2.2 Adaptive sensitivity generating

The construction of the GERSM simultaneously requires the response and sensitivity information of the sampling points. Through numerical or practical experiment, only the response information can be obtained. Therefore, based on the finite difference (FD), ASGM is proposed to obtain the sensitivity information of the sampling points. For the general sampling point x in the design space, the general process of capturing the sensitivity information of x is as follows: Firstly, an adaptive response surface model (AdaRSM) is constructed as the strong predictor, which is proposed to explore the compact neighborhood of x. Then, the response of the new sampling point in the compact neighborhood of x can be given by the strong predictor AdaRSM. Finally, the FD is used to calculate the gradient vector of x.

The AdaRSM is the linear combination of a series of base response surface model (BRSM), which are generate by iteratively updating the weights of the sampling points. The AdaRSM can be expressed as:

$$ \hat{\boldsymbol{f}}\left(\boldsymbol{x}\right)=\sum \limits_{m=1}^M{\alpha}^{(m)}\cdotp {\mathrm{RSM}}^{(m)}\left(\boldsymbol{x}\right) $$

(24)

where $ \hat{\boldsymbol{f}}\left(\boldsymbol{x}\right) $ is the adaptive response surface; α^(m) denotes the weight of the m-th BRSM; RSM^(m)(x) is the m-th BRSM; x is the general sampling point, as shown in Eq. (5); M is the number of the BRSM.

The construction of AdaRSM includes the following steps: (1) initialize the weights of the sampling points and constructing the BRSM; (2) calculate the total error of the BRSM over the entire samples; (3) update the weight of each sampling point according to the prediction error of the BRSM for each sampling point; (4) iteratively construct the next BRSM based on the updated weights of the sampling points. Repeat steps (2)-(4) until the target number of the BRSM has been reached. (5) Calculate the weight of each BRSM according to the total error of each BRSM. The m-th BRSM can be expressed as:

$$ {\mathrm{RSM}}^{(m)}\left(\boldsymbol{x}\right)={a}_0^{(m)}+\sum \limits_{i=1}^D{a}_i^{(m)}\cdotp {x}_i+\sum \limits_{i=1}^D{a}_{ii}^{(m)}\cdotp {x_i}^2+\sum \limits_{i=1}^D\sum \limits_{j>i}^D{a}_{ij}^{(m)}\cdotp {x}_i{x}_j={\boldsymbol{x}}^p{\boldsymbol{\beta}}^{(m)} $$

(25)

$$ {\boldsymbol{\beta}}^{(m)}={\left({\mathbf{X}}^{\mathrm{T}}{\mathbf{W}}^{(m)}\mathbf{X}\right)}^{-1}{\mathbf{X}}^{\mathrm{T}}{\mathbf{W}}^{(m)}\boldsymbol{y} $$

(26)

where $ {a}_0^{(m)} $, $ {a}_i^{(m)} $, $ {a}_{ii}^{(m)}, $ and $ {a}_{ij}^{(m)} $ are the regression coefficients of the m-th BRSM; β^(m) denotes the regression coefficient vector of the m-th BRSM; x^p, X, and y are as shown in Eq. (22), (11), and (9); W^(m) represents the weight matrix of the sampling points when constructing the m-th BRSM, which can be formulated as:

$$ {\mathbf{W}}^{(m)}=\left[\begin{array}{ccccc}{\upomega}_1^{(m)}& \cdots & 0& \cdots & 0\\ {}\vdots & \ddots & \vdots & \vdots & \vdots \\ {}0& \cdots & {\upomega}_k^{(m)}& \cdots & 0\\ {}\vdots & \vdots & \vdots & \ddots & \vdots \\ {}0& \cdots & 0& \cdots & {\upomega}_N^{(m)}\end{array}\right] $$

(27)

where $ {\upomega}_k^{(m)} $ denotes the weight of the k-th sampling point when constructing the m-th BRSM, which can be evaluated by the following equations.

$$ {\upomega}_k^{(m)}=\left\{\begin{array}{r}1\kern4em ,m=1\kern1.75em \\ {}\frac{\upomega_k^{\left(m-1\right)}}{Z^{\left(m-1\right)}}\cdotp \left[1+\exp \left(\lambda {\varepsilon}_k^{\left(m-1\right)}\right)\right],m=2,3,\cdots, M\end{array}\right. $$

(28)

$$ {\varepsilon}_k^{\left(m-1\right)}=\frac{\left|{RSM}^{\left(m-1\right)}\left({\boldsymbol{x}}^{(k)}\right)-{y}^{(k)}\right|}{y^{(k)}} $$

(29)

$$ {Z}^{\left(m-1\right)}=\frac{1}{N}\sum \limits_{k=1}^N{\upomega}_k^{\left(m-1\right)}\cdotp \left[1+\exp \left(\lambda {\varepsilon}_k^{\left(m-1\right)}\right)\right] $$

(30)

where $ {\upomega}_k^{\left(m-1\right)} $ and $ {\upomega}_k^{(m)} $ denote the weight of x^(k) when constrcuting the (m-1)-th and the m-th BRSM, respectively; $ {\varepsilon}_k^{\left(m-1\right)} $ denotes the relative error of the (m-1)-th BRSM over x^(k); RSM^{(m − 1)}(x^(k)) denotes the predicted response of the (m-1)-th BRSM for x^(k); Z^{(m − 1)} is the normalization factor of sampling point weights; λ is the scaling factor.

The weight of each BRSM depends on the total error of each BRSM over the entire sampling points. The smaller the total error, the larger the weight of the BRSM. The weight of the m-th BRSM is defined as follows:

$$ {\varepsilon}^{(m)}=\sqrt{\frac{1}{N}\cdotp \sum \limits_{k=1}^N{\left[{\mathrm{RSM}}^{(m)}\left({\boldsymbol{x}}^{(k)}\right)-{y}^{(k)}\right]}^2} $$

(31)

$$ {\alpha}^{(m)}=\frac{\exp \left(\tau \cdotp {\varepsilon}^{(m)}\right)}{\sum_{m=1}^M\exp \left(\tau \cdotp {\varepsilon}^{(m)}\right)} $$

(32)

where ε^(m) is the root mean square error of the m-th BRSM over the entire samples; RSM^(m)(x^(k)) denotes the predicted response of the m-th BRSM for x^(k); τ is the scaling factors; α^(m) denotes the weight of the m-th BRSM.

The sensitivity information (gradient vector) of x^(k) is approximate by the FD, which can be defined as follows:

$$ {\mathbf{g}}_{{\boldsymbol{x}}_{\boldsymbol{i}}}^{(k)}=\frac{\hat{f}\left({\boldsymbol{x}}_{\boldsymbol{i}+}^{(k)}\right)-\hat{f}\left({\boldsymbol{x}}_{\boldsymbol{i}-}^{(k)}\right)}{2h} $$

(33)

$$ {\boldsymbol{x}}_{\boldsymbol{i}+}^{(k)}={\left[{x}_1,\cdots, {x}_{i-1},{x}_i+h,{x}_{i+1},\cdots, {x}_D\right]}^{\mathrm{T}} $$

(34)

$$ {\boldsymbol{x}}_{\boldsymbol{i}-}^{(k)}={\left[{x}_1,\cdots, {x}_{i-1},{x}_i-h,{x}_{i+1},\cdots, {x}_D\right]}^{\mathrm{T}} $$

(35)

where $ {\mathbf{g}}_{{\boldsymbol{x}}_{\boldsymbol{i}}}^{(k)} $ denotes the gradient vector of x^(k) for the i-th design variable; $ {\boldsymbol{x}}_{\boldsymbol{i}+}^{(k)} $ and $ {\boldsymbol{x}}_{\boldsymbol{i}-}^{(k)} $ are the new sampling points in the compact neighborhood of x^(k), which are generated by adding h to the i-th design variable of x^(k) and reducing the i-th design variable of x^(k) by h, respectively; $ \hat{f}\left({\boldsymbol{x}}_{\boldsymbol{i}+}^{(k)}\right) $ and $ \hat{f}\left({\boldsymbol{x}}_{\boldsymbol{i}-}^{(k)}\right) $ denote the predicted response of the strong predictor AdaRSM for $ {\boldsymbol{x}}_{\boldsymbol{i}+}^{(k)} $ and $ {\boldsymbol{x}}_{\boldsymbol{i}-}^{(k)} $, respectively.

3.3 NSGA-III for locating Pareto-optimal solutions

NSGA-III algorithm [34, 35] utilizes a reference point-based selection mechanism to select the populations, which can obtain good performance in the case of four or more objectives optimization. By introducing a set of reference points, the population is selected according to the distances between the individuals and the reference points, thereby guiding the population search to approach the reference points. Because the reference points are evenly distributed over the reference hyperplane, the reference point-based selection mechanism can make the population more evenly distributed on the Pareto frontier and the optimization process can better converge to the Pareto-optimal solution set.

The process of locating the Pareto-optimal solution set of multi-objective optimization problem by the NSGA-III is as follows:

Step 1: The initial population containing N individuals is randomly generated, where N is the size of the population.
Step 2: The original population in the current generation is set as the parent population. The parent population goes through random selection, simulated binary intersection, and polynomial mutation operations to generate the offspring population. The size of the offspring population is N.
Step 3: The parent population and the offspring population are combined to form a new population with a population size of 2N. Then, the fitness function is used to evaluate each individual in the new combined population by calculating the responses (objectives).
Step 4: According to the fitness of each individual, the non-dominated sorting algorithm and reference point-based selection mechanism is used to select N optimal individuals from the new combined population in Step 3 to enter the next generation.
Step 5: Check whether the convergence condition is satisfied (e.g., generations of the population reach the preset maximum threshold). If the convergence condition is satisfied, the selected N optimal individuals in Step 4 is the Pareto-optimal solution set and the optimization process end; otherwise, the selected N optimal individuals in Step 4 is treated as the original population of next generation and redirect to Step 2.

3.4 Data analysis

Sensitivity analysis among process parameters and responses

Sensitivity analysis is applied to identify the important process parameters (design variables) which have significant influences on responses (objectives). It can exclude irrelevant parameters to reduce the dimension of the design space.

Correlation coefficient is implemented to indicate the sensitivity among process parameters and responses, which can be calculated by the following formula.

$$ \rho =\frac{\mathrm{Cov}\left(X,Y\right)}{\sqrt{\mathrm{Var}(X)\mathrm{Var}(Y)}}=\frac{E\left[\left(X-{\mu}_X\right)\left(Y-{\mu}_Y\right)\right]}{\sigma_X{\sigma}_Y} $$

(36)

where Cov(X, Y) is the covariance of vector X and vector Y; Var(X) and Var(Y) are the variance of X and Y respectively; μ_X and μ_Y are the mean of X and Y respectively; σ_X and σ_Y are the standard deviation of X and Y respectively; ρ is the correlation coefficient, which is in the range of [-1, 1]. If ρ = 0, it indicates that there is no correlation between the two vectors; if ρ < 0, there is a negative correlation between the two vectors; if ρ > 0, there is a positive correlation between the two vectors.

Prediction accuracy analysis

The response predictor is utilized as the fitness function of NSGA-III algorithm in the multi-objective optimization process. The prediction accuracy of the response predictor influences the optimization results, which may cause the optimization algorithm to not converge. Therefore, the prediction accuracy analysis is verified by evaluating the root mean squared error (RMSE), mean absolute error (MAE), and coefficient of determination (R²) between the responses calculated by simulation experiments and the responses predicted by the response predictor. These evaluation criteria can be formulated as follows:

$$ \mathrm{R}{\mathrm{MSE}}_j=\sqrt{\frac{1}{N_V}\sum \limits_{k=1}^{N_V}{\left({\hat{y}}_j^{(k)}-{y}_j^{(k)}\right)}^2}\ j=1,2,\cdots, K $$

(37)

$$ {\mathrm{MAE}}_j=\frac{1}{N_V}\sum \limits_{k=1}^{N_V}\left|{\hat{y}}_j^{(k)}-{y}_j^{(k)}\right|\ j=1,2,\cdots, K $$

(38)

$$ {R^2}_j=1-\frac{\sum_{k=1}^{N_V}{\left({\hat{y}}_j^{(k)}-{y}_j^{(k)}\right)}^2}{\sum_{k=1}^{N_V}{\left({y}_j^{(k)}-{\overline{y}}_j\right)}^2}\ j=1,2,\cdots, K $$

(39)

where RMSE_j, MAE_j, and R²_j denote the RMSE, MAE, and R² of the j-th response (objective), respectively; K denotes the number of the objectives; N_V denotes the number of the sampling points for verification; $ {\hat{y}}_j^{(k)} $ and $ {y}_j^{(k)} $ denote the j-th response of the k-th sampling point predicted by the corresponding response predictor and calculated by simulation experiment, respectively; $ {\overline{y}}_j $ denotes the mean of the j-th response.

4 Case study

Because of the characteristic of lightweight, customization, good adaptability to variable complex structures, low cost, high efficiency, etc., the plastic injection molding products are widely used in automobile industry such as the bumper, door, and dashboard. The automobile bumper should meet the mechanical and geometric requirement to ensure the necessary protection function as well as the shape consistency and appearance beauty. Therefore, the automobile front bumper is taken as a case study to verify the proposed method.

4.1 Finite element simulation model

An injection plastic product of the automobile front bumper shown in Fig. 4 is taken into consideration. The maximum size of this product is 1800mm × 430mm × 725mm, in which the maximum thickness is 4 mm and the minimum thickness is 2.5 mm.

Considering the complex structure of the product, the multiple gates can achieve better filling. However, the utilization of multiple gates can inevitably cause the generation of weldlines. The valve hot runner system adopts the valve gate controllers to control the opening and closing of each gate, which can realize the sequential filling of the whole cavity. Therefore, it can reduce the stress concentration at the intersection of the melt flow front and effectively eliminate weldlines. The overview of the conventional and valve hot runner system is shown in Fig. 5. The gate locations, the number of gates, and the gate type are determined by the combination of the “gate location” analysis and the “fill” analysis. As shown in Fig. 5, three new gates and corresponding hot runners are added to the valve hot runner system compared to the conventional runner system in order to balance the runners. For the valve hot runner system, the flow path of the melted plastic in each runner is as follows: hot sprue → hot runner → hot runner → hot gate → cold runner → cold runner → cold gate.

The details of numerical simulation are shown in Table 2. The numerical simulations are executed through the Autodesk® Moldflow® software (version 2018) [36], where the processor of the computer is Inter® Core™ i9-9900K CPU @ 3.60 GHz. The material, acrylonitrile butadiene styrene (ABS), is used to manufacture the product, and its properties are listed in Table 3.

Table 2 Detailed settings of the numerical simulations in Moldflow

Full size table

Table 3 Material property of acrylonitrile butadiene styrene (ABS)

Full size table

4.2 Numerical result analysis

4.2.1 Sensitivity analysis for process parameters

The correlation coefficient can show sensitivity among the design variables and responses, which has been calculated and displayed in Fig. 6. As shown in Fig. 6, the value in each sub-box is the correlation coefficient of the two crossed variables. The positive value indicates the positive correlation, while the negative value indicates the negative correlation, and zero value indicates irrelevance. The bigger the absolute value of the correlation coefficient is, the stronger the relationship between the two variables is.

According to the heat-map plot in Fig. 6, it can be found that t_inj, P_p1, and t_c have the negative influence on warpage with the correlation coefficients equaling to −0.86, −0.5, and −0.28, respectively; T_melt has the positive influence on warpage with the correlation coefficient equaling to 0.18. Similarly, T_melt and t_p1 have the positive influence on minimum weldline temperature with the correlation coefficients equaling to 0.96 and 0.27, respectively; t_inj, P_p1, and t_p2 have the negative influence on minimum weldline temperature with the correlation coefficients equaling to −0.12, −0.19, and −0.27, respectively. P_p1, T_melt, t_p1, and t_p2 have the positive influence on clamping force with the correlation coefficients equaling to 0.92, 0.2, 0.15, and 0.1, respectively; t_inj and t_c have the negative influence on clamping force with the correlation coefficients equaling to −0.17 and −0.22, respectively. In fact, cycle time is the sum of t_inj, t_p1, t_p2, and t_c. It indicates that t_inj, t_p1, t_p2, and t_c have a significant positive influence on cycle time. In addition, the correlation coefficients between warpage and cycle time are −0.48, which indicates that cycle time changes in the opposite direction with warpage.

4.2.2 Accuracy analysis for response prediction

Fifteen experiments different from thirty-six initial sampling points, which are both generated by the Latin hypercube sampling (LHS), are implemented to verify the prediction accuracy of the response predictors. The cycle time is the sum of the injection time, packing time ,and cooling time, which can be accurately calculated based on the design variables of the four-objective optimization problem. The proposed GERSM, classical RSM, Kriging, support vector regression (SVR), and Gaussian process regression (GPR) are applied to construct the surfaces of design variables and responses (warpage, minimum weldline temperature, and clamping force). The prediction results of responses are displayed in Fig. 7, from which it can be found that Kriging has the largest prediction error, while the proposed model works best.

As shown in Fig. 7, the predictions and experiments of the responses are qualitatively compared. In order to quantitatively compare the performance of different prediction models, MSE, RMSE, MAE, and R² are applied to evaluate the performance of the prediction models, which are calculated and displayed in Tables 4, 5, and 6.

Table 4 Comparison of different prediction models (warpage)

Full size table

Table 5 Comparison of different prediction models (minimum weldline temperature)

Full size table

Table 6 Comparison of different prediction models (clamping force)

Full size table

The smaller the MAE and RMSE are, the higher the prediction accuracy of model is. R² ∈ [0, 1] is closer to 1; the accuracy of model is higher. As shown in Tables 4, 5, and 6, it can be found that the proposed model has the highest prediction accuracy for the warpage, minimum weldline temperature, and clamping force, compared with RSM, SVR, GPR, and Kriging.

4.2.3 Multi-objective optimization results

NSGA-III algorithm is implemented to locate the Pareto-optimal solutions for the four-objective optimization problem. The parameters of this algorithm are set as follows:

$$ \left.\begin{array}{lll}\mathrm{Population}\ \mathrm{size}=220& \mathrm{Generation}=600& \mathrm{Refrence}\ \mathrm{point}\ \mathrm{division}=9\\ {}\mathrm{Crossover}\ \mathrm{possibility}=1.0& \mathrm{Distribution}\ \mathrm{index}\ \mathrm{of}\ \mathrm{crossover}=30&\ \\ {}\mathrm{Mutation}\ \mathrm{possibility}=0.2& \mathrm{Distribution}\ \mathrm{index}\ \mathrm{of}\ \mathrm{mutation}=20&\ \end{array}\right\} $$

The triple-objective Pareto frontiers among warpage, minimum weldline temperature, clamping force, and cycle time are shown in Fig. 8, and the pair-wise Pareto frontiers are shown in Fig. 9. As shown in Fig. 8, the triple-objective Pareto frontiers show that any three of four objectives cannot reach the optimal at the same time. Therefore, different process parameters have different influences on the results. The four objectives cannot reach the optimal result at the same time and the ideal solution of the optimization problem cannot be located, which leads to the trade-off.

The pair-wise Pareto frontier shown in Fig. 9a indicates that there is no obvious trade-off between warpage and MinT_weld (minimum weldline temperature). Observing Fig. 6, warpage is strongly influenced by t_inj and P_p1, while MinT_weld is strongly influenced by T_melt.

The pair-wise Pareto frontier shown in Fig. 9b indicates a trade-off between warpage and clamping force. Observing Fig. 6, the correlation coefficient between warpage and clamping force equals to −0.27, which means that they are negatively correlated and change in reverse direction. As shown in Fig. 6, P_p1 has reverse impact on warpage and clamping force, with correlation coefficients equaling to −0.5 and 0.92, respectively. It means that smaller P_p1 leads to smaller clamping force, but larger warpage. They cannot reach the optimal at the same time.

The pair-wise Pareto frontier shown in Fig. 9c indicates an obvious trade-off between warpage and cycle time. Observing Fig. 6, the correlation coefficient between warpage and cycle time equals to −0.48, which means that they are negatively correlated and change in reverse direction. As shown in Fig. 6, t_c and t_inj have reverse impact on warpage and cycle time. It means that shorter t_c and t_inj lead to shorter cycle time, but larger warpage. They cannot reach the optimal at the same time.

The pair-wise Pareto frontier shown in Fig. 9d indicates a trade-off between MinT_weld and clamping force. Observing Fig. 6, the correlation coefficient between MinT_weld and clamping force equals to 0.14, which means that they are positively correlated and change in same direction. Therefore, clamping force and the opposite of minimum weldline temperature (−MinT_weld) change in reverse direction. As shown in Fig. 6, T_melt, t_inj, and t_p1 have same impact on MinT_weld and clamping force. It means that smaller T_melt, larger t_inj, and smaller t_p1 lead to smaller clamping force and MinT_weld. However, the melted plastic will be quickly solidified with the low weldline temperature, which causes the generation of long weldlines. Therefore, smaller MinT_weld generates larger weldlines. They cannot reach the optimal at the same time to simultaneously minimize the weldlines and clamping force.

The pair-wise Pareto frontier shown in Fig. 9e indicates that there is no obvious trade-off between MinT_weld and cycle time. Observing Fig. 6, MinT_weld is strongly influenced by T_melt, while cycle time is strongly influenced by t_inj, t_p1, t_p2, and t_c.

The pair-wise Pareto frontier shown in Fig. 9f indicates that there is no obvious trade-off between clamping force and cycle time. Observing Fig. 6, clamping force is strongly influenced by P_p1, while cycle time is strongly influenced by t_inj, t_p1, t_p2, and t_c.

The Pareto-optimal solutions located by NSGA-III are listed in Table 7. The trade-off analysis is performed to determine the better and worse solution for decision-making based on the spider-web chart. The ideal and nadir value of the objectives in Eq. (3) can be set as follows: $ \left[{f}_1^I,{f}_2^I,{f}_3^I,{f}_4^I\right]=\left[0,-255,5400,75\right];\left[{f}_1^N,{f}_2^N,{f}_3^N,{f}_4^N\right]=\left[25,-225,7000,115\right] $. In the spider-web chart, the areas of the Pareto-optimal solutions located by NSGA-III are calculated and displayed in Table 8, where there is no preference for all objective. The smaller the area of alternative solution, the better the solution.

Table 7 Pareto-optimal solutions located by NSGA-III

Full size table

Table 8 Areas of Pareto-optimal solutions in spider-web chart

Full size table

The spider-web chart is shown in Fig. 10, in which the red, blue, and black lines represent the better solution, worse solution, and alternative solutions, respectively. As shown in Fig. 10, the area of the better solution is smaller than that of the worse solution. In addition, warpage and weldlines temperature distribution at the better and worse solution are compared and displayed in Figs. 11 and 12, respectively. As shown in Fig. 11, it can be found that warpage is well reduced at better solution compared to that at worse solution. As shown in Fig. 12, it can be found that weldlines are well reduce at better solution compared to that at worse solution. From the perspective of process parameters, it can indicate that larger packing pressure and injection time can reduce warpage.

To verify the effect of the valve hot runner system on the weldline reduction, the comparison of weldlines under the conventional hot runner system and under the valve hot runner system at better solution is shown in Fig. 13. It can be found that the weldlines are generated in inconspicuous positions such as grilles, lamp holes, and product edges and effectively eliminated when valve hot runner system is used, compared with conventional hot runner system. The surface quality of the automobile front bumper is higher when the valve hot runner system is applied.

According to the result of the trade-off analysis, the optimized setting of the process parameters is shown in Table 9 for the product quality and productivity improvement and the cost-saving. The warpage and the weldlines of the automobile front bumper after the optimization are compared with them before the optimization, as shown in Fig. 14. Moreover, the setting of the process parameters is determined by the recommendation of Moldflow and the result of the “molding window” analysis.

Table 9 The optimized setting of the process parameters based on the trade-off analysis

Full size table

The verification of prediction accuracy for four objectives at optimized solution is summarized in Table 10, which shows that relative absolute errors for all objectives at optimized solution are below 2.5%. With the result of accuracy analysis displayed in Section 4.2.1, the prediction accuracy can be confirmed and verified.

Table 10 Verification of the optimized results

Full size table

The proposed multi-objective optimization method analyzes the correlation between the process parameters and the objectives and conducts the trade-off analysis to balance the multiple conflicting objectives. It can effectively locate the optimized process parameters setting to achieve the product quality and productivity improvement and the cost-saving, which can provide a theoretical basis and reference for the actual injection molding process.

5 Conclusion

In this paper, the multi-objective optimization of process parameters in PIM for minimizing the warpage, weldlines, clamping force, and cycle time is performed to realize high product quality, high productivity, and low energy consumption. The melted plastic will be quickly solidified with low weldline temperature and long weldlines will generate. To shorten the weldlines, the minimum weldline temperature is considered to be maximized. In this view, we propose a differential sensitivity fusion method (DSFM). The conclusions are as follows:

(1)
The variable packing pressure profile and the melt temperature, mold temperature, injection time, and cooling time are taken as design variables and optimized. The generic optimization algorithm NSGA-III is applied to locate the Pareto-optimal solutions. Pareto frontier shows that selected four objectives cannot simultaneously reach the optimal, which leads to the trade-off. The spider-web chart is used to perform the trade-off analysis among the four objectives, and the better and worse solutions are identified for decision-making.
(2)
The metamodeling method, GERSM coupling with the MLSM, is used to construct the response predictors, which fit the mathematical relationship between design variables and responses and is taken as the fitness functions in the multi-objective optimization process. This model simultaneously utilizes the response and sensitivity information of the sampling point to improve the accuracy of the response predictors. Considering the capture of the sensitivity information, ASGM is proposed to calculate the gradient vector for each design variable of the sampling point. The results of the accuracy analysis for response predictors show that the proposed model has the highest prediction accuracy for the warpage, minimum weldline temperature, and clamping force, compared with RSM, SVR, GPR, and Kriging.
(3)
The automobile bumper is taken as the case study, where the valve hot runner system is used. The numerical simulation result shows that the weldlines are generated in inconspicuous positions such as grilles, lamp holes, and product edges and effectively eliminated, compared with the conventional hot runner system. The valve hot runner system can effectively improve the product quality.

References

Ozcelik B, Sonat I (2009) Warpage and structural analysis of thin shell plastic in the plastic injection molding. Mater Des 30(2):367–375
Article Google Scholar
Oktem H, Erzurumlu T, Uzman I (2007) Application of Taguchi optimization technique in determining plastic injection molding process parameters for a thin-shell part. Mater Des 28(4):1271–1278
Article Google Scholar
Tang SH, Tan YJ, Sapuan SM, Sulaiman S, Ismail N, Samin R (2007) The use of Taguchi method in the design of plastic injection mould for reducing warpage. J Mater Process Technol 182(1-3):418–426
Article Google Scholar
Kurt M, Kamber OS, Kaynak Y, Atakok G, Girit O (2009) Experimental investigation of plastic injection molding: assessment of the effects of cavity pressure and mold temperature on the quality of the final products. Mater Des 30(8):3217–3224
Article Google Scholar
Masato D, Rathore J, Sorgato M, Carmignato S, Lucchetta G (2017) Analysis of the shrinkage of injection-molded fiber-reinforced thin-wall parts. Mater Des 132:496–504
Article Google Scholar
Wang Z, Zhang S, Qiu L, Liu X, Li H (2019) A low-carbon design method integrating structure design and injection process design for injection molding machines. Math Probl Eng 2019(11):1–19
Google Scholar
Li C, Wang F, Chang Y, Liu Y (2010) A modified global optimization method based on surrogate model and its application in packing profile optimization of injection molding process. Int J Adv Manuf Technol 48(5-8):505–511
Article Google Scholar
Gao YH, Wang XC (2009) Surrogate-based process optimization for reducing warpage in injection molding. J Mater Process Technol 209(3):1302–1309
Article MathSciNet Google Scholar
Kitayama S, Yokoyama M, Takano M, Aiba S (2017) Multi-objective optimization of variable packing pressure profile and process parameters in plastic injection molding for minimizing warpage and cycle time. Int J Adv Manuf Technol 92(9-12):3991–3999
Article Google Scholar
Hashimoto S, Kitayama S, Takano M, Kubo Y, Aiba S (2020) Simultaneous optimization of variable injection velocity profile and process parameters in plastic injection molding for minimizing weldline and cycle time. J Adv Mech Des Syst Manuf 14(3):JAMDSM0029
Article Google Scholar
Ozcelik B, Erzurumlu T (2005) Determination of effecting dimensional parameters on warpage of thin shell plastic parts using integrated response surface method and genetic algorithm. Int Commun Heat Mass Transfer 32(8):1085–1094
Article Google Scholar
Kurtaran H, Erzurumlu T (2006) Efficient warpage optimization of thin shell plastic parts using response surface methodology and genetic algorithm. Int J Adv Manuf Technol 27(5-6):468–472
Article Google Scholar
Gao Y, Wang X (2008) An effective warpage optimization method in injection molding based on the Kriging model. Int J Adv Manuf Technol 37(9-10):953–960
Article Google Scholar
Xia W, Luo B, Liao XP (2011) An enhanced optimization approach based on Gaussian process surrogate model for process control in injection molding. Int J Adv Manuf Technol 56(9-12):929–942
Article Google Scholar
Chen W, Kurniawan D (2014) Process parameters optimization for multiple quality characteristics in plastic injection molding using Taguchi method, BPNN, GA, and Hybrid PSO-GA. Int J Precis Eng Manuf 15(8):1583–1593
Article Google Scholar
Zhao J, Cheng G, Ruan S, Li Z (2015) Multi-objective optimization design of injection molding process parameters based on the improved efficient global optimization algorithm and non-dominated sorting-based genetic algorithm. Int J Adv Manuf Technol 78(9-12):1813–1826
Article Google Scholar
Zhao J, Cheng G (2016) An innovative surrogate-based searching method for reducing warpage and cycle time in injection molding. Adv Polym Technol 35(3):288–297
Article Google Scholar
Cheng J, Liu Z, Tan J (2013) Multiobjective optimization of injection molding parameters based on soft computing and variable complexity method. Int J Adv Manuf Technol 66(5-8):907–916
Article Google Scholar
Liu J, Chen X, Lin Z, Diao S (2017) Multiobjective optimization of injection molding process parameters for the precision manufacturing of plastic optical lens. Math Probl Eng 2017:1–13
Google Scholar
Xu G, Yang Z, Long G (2012) Multi-objective optimization of MIMO plastic injection molding process conditions based on particle swarm optimization. Int J Adv Manuf Technol 58(5-8):521–531
Article Google Scholar
Xu G, Yang Z (2015) Multiobjective optimization of process parameters for plastic injection molding via soft computing and grey correction analysis. Int J Adv Manuf Technol 78(1-4):525–536
Article Google Scholar
Dang XP (2014) General frameworks for optimization of plastic injection molding process parameters. Simul Model Pract Theory 41:15–27
Article Google Scholar
Shi H, Xie S, Wang X (2013) A warpage optimization method for injection molding using artificial neural network with parametric sampling evaluation strategy. Int J Adv Manuf Technol 65(1-4):343–353
Article Google Scholar
Deng YM, Zhang Y, Lam YC (2010) A hybrid of mode-pursuing sampling method and genetic algorithm for minimization of injection molding warpage. Mater Des 31(4):2118–2123
Article Google Scholar
Dimla DE, Camilotto M, Miani F (2005) Design and optimisation of conformal cooling channels in injection moulding tools. J Mater Process Technol 164:1294–1300
Article Google Scholar
Au KM, Yu KM (2007) A scaffolding architecture for conformal cooling design in rapid plastic injection moulding. Int J Adv Manuf Technol 34(5-6):496–515
Article Google Scholar
Wang Y, Yu KM, Wang CC (2015) Spiral and conformal cooling in plastic injection molding. Comput Aided Des 63:1–11
Article Google Scholar
Kitayama S, Yamazaki Y, Takano M, Aiba S (2018) Numerical and experimental investigation of process parameters optimization in plastic injection molding using multi-criteria decision making. Simul Model Pract Theory 85:95–105
Article Google Scholar
Kitayama S, Tamada K, Takano M, Aiba S (2018) Numerical and experimental investigation on process parameters optimization in plastic injection molding for weldlines reduction and clamping force minimization. Int J Adv Manuf Technol 97(5-8):2087–2098
Article Google Scholar
Kitayama S, Tamada K, Takano M, Aiba S (2018) Numerical optimization of process parameters in plastic injection molding for minimizing weldlines and clamping force using conformal cooling channel. J Manuf Process 32:782–790
Article Google Scholar
Miettinen K (2014) Survey of methods to visualize alternatives in multiple criteria decision making problems. OR Spectr 36(1):3–37
Article MathSciNet Google Scholar
Kim C, Wang S, Choi KK (2005) Efficient response surface modeling by using moving least-squares method and sensitivity. AIAA J 43(11):2404–2411
Article Google Scholar
Lancaster P, Salkauskas K (1981) Surfaces generated by moving least squares methods. Math Comput 37(155):141–158
Article MathSciNet Google Scholar
Deb K, Jain H (2013) An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, Part I: solving problems with box constraints. IEEE Trans Evol Comput 18(4):577–601
Article Google Scholar
Jain H, Deb K (2013) An evolutionary many-objective optimization algorithm using reference-point based nondominated sorting approach, Part II: handling constraints and extending to an adaptive approach. IEEE Trans Evol Comput 18(4):602–622
Article Google Scholar
Autodesk Moldflow Insight User’s Guide, 2018. [Online]. Available: http:// https://knowledge.autodesk.com/support/moldflow-insight/learn-explore/caas/CloudHelp/cloudhelp/2018/ENU/MoldflowInsight/files/GUID-66B3B0E8-DB05-4DC5-8E8F-CCA29A11A7ED-htm.html. [Accessed 28 March 2017]

Download references

Funding

This work has been funded by the National Natural Science Foundation of China (51905476).

Author information

Authors and Affiliations

State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, Hangzhou, 310027, People’s Republic of China
Huifang Zhou, Shuyou Zhang & Zili Wang

Authors

Huifang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shuyou Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zili Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Huifang Zhou’s contributions are conceptualization; writing—original draft; methodology; formal analysis; investigation; and project administration. Shuyou Zhang’s contributions are Investigation; resources; and validation. Zili Wang’s contributions are funding acquisition; project administration; methodology; formal analysis; supervision; visualization; and writing—review and editing.

Corresponding author

Correspondence to Zili Wang.

Ethics declarations

Ethics approval and consent to participate

No applicable

Consent for publication

Authors consent to publish this article.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

The sampling points for constructing and training the response predictor are displayed in Table 11, and the objective responses of them are displayed in Table 12. The sampling points for confirmation or validation are displayed in Table 13, and the objective responses of them are displayed in Table 14.

Table 11 Sampling points for training generated by LHS

Full size table

Table 12 Numerical simulation results of responses for sampling points in Table 11

Full size table

Table 13 Sampling points for validation generated by LHS

Full size table

Table 14 Numerical simulation results of responses for sampling points in Table 13

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, H., Zhang, S. & Wang, Z. Multi-objective optimization of process parameters in plastic injection molding using a differential sensitivity fusion method. Int J Adv Manuf Technol 114, 423–449 (2021). https://doi.org/10.1007/s00170-021-06762-8

Download citation

Received: 16 October 2020
Accepted: 02 February 2021
Published: 16 March 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s00170-021-06762-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multi-objective optimization of process parameters in plastic injection molding using a differential sensitivity fusion method

Abstract

Similar content being viewed by others

Multi-objective optimization of injection molding process parameters in two stages for multiple quality characteristics and energy efficiency using Taguchi method and NSGA-II

Multiobjective optimization of process parameters for plastic injection molding via soft computing and grey correlation analysis

Optimization of the plastic injection molding process using the Taguchi method, RSM, and hybrid GA-PSO

1 Introduction

2 Overview of multi-objective optimization problem

2.1 Multi-objective optimization model

2.2 Objective functions

2.3 Design variables

2.4 Trade-off analysis

3 Optimization methodologies

3.1 Multi-objective optimization process

3.2 Response predictor modeling

3.2.1 GERSM metamodeling

3.2.2 Adaptive sensitivity generating

3.3 NSGA-III for locating Pareto-optimal solutions

3.4 Data analysis

Sensitivity analysis among process parameters and responses

Prediction accuracy analysis

4 Case study

4.1 Finite element simulation model

4.2 Numerical result analysis

4.2.1 Sensitivity analysis for process parameters

4.2.2 Accuracy analysis for response prediction

4.2.3 Multi-objective optimization results

5 Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation