1 Introduction

Injection molding is a widely used process to shape thermoplastic materials into molded products of intricate shapes. It is particularly suitable for mass production due to its short cycle time and high tooling cost. The qualities of injection moldings are affected by many factors such as plastic part design, mold design, materials, and molding parameters. One of common quality issues for injection moldings is part shrinkage. Part shrinkage is natural due to thermal contraction, as the plastic material changes from its high temperature molten state to low temperature solid state, to form the desired shape in the mold cavity. A packing phase is thus required to force more plastic melt into the mold cavity to compensate for material shrinkage during mold cooling. Large non-uniform shrinkages, due to poor part or tooling design and improper molding conditions, often create internal or residual stresses that could lead to part warpage. Therefore, it is highly desirable to minimize the shrinkage of injection moldings by optimizing processing conditions, and to predict the shrinkage under different molding parameters. This will help reduce the setup time with proper molding parameters and improve the quality of injection moldings.

Shrinkage prediction for plastics injection molding has been studied extensively. Han et al. [1] developed a simulation program based on crystallization kinetics to predict the shrinkage of slowly-crystallizing thermoplastic polymers in injection molding. The material properties such as viscosity, thermal conductivity, heat capacity, PVT (pressure–volume-temperature) relation and crystallization kinetics were considered in their study. Jansen et al. [2] studied the effect of processing conditions on shrinkage for seven thermoplastic polymers, and concluded that a simple thermoelastic model used in the study could predict amorphous polymers, but it over-predicted for semi-crystalline polymers. Choi et al. [3] conducted a numerical analysis of shrinkage for amorphous polymers in consideration of the residual stresses produced during the packing and cooling stages of injection molding. These shrinkage predictions with conservation equations (mass, momentum and energy) are associated with assumptions and simplifications to obtain the solutions. Some commercial simulation programs such as Moldflow were used for predicting the shrinkage. Lucyshyn et al. [4] used a differential scanning calorimeter to identify the transition temperatures at different cooling rates, and applied the transition temperatures in calculating the shrinkage with Moldflow. Pomerleau et al. [5] investigated the injection molding shrinkage of polypropylene with a factorial design of experiment, and examined the effects of holding pressure and injection velocity on shrinkages.

Artificial neural network (ANN) is a modeling technique that is capable of dealing with highly nonlinear problems with multi-input and multi-output systems. It requires no assumption of the problem and has a strong adaptive learning capability. ANN handles information in a way similar to the operation mechanism of neurons within the human brain. It has been applied to various applications such as design optimization, pattern recognition, etc. In the literature, there are a few studies that applied ANN in injection molding to predict part shrinkage. Shen et al. [6] used the combination of ANN and genetic algorithm (GA) to optimize the injection molding process for minimizing the volumetric shrinkage variation. Lee et al. [7] developed a neural network which was trained by the data from a numerical flow analysis, to predict shrinkage of the injection molded polypropylene parts. Liao et al. [8] demonstrated the successful use of back-propagation ANN in predicting the shrinkage and warpage of injection-molded thin-wall parts, and amorphous plastics polycarbonate and acrylonitrile butadiene styrene were used in their study. Wang et al. [9] evaluated the effect of injection molding process parameters on shrinkage based on neural network simulation, however, the data input was from Moldflow simulation results and no real experiments were conducted. Altan et al. [10] reduced shrinkage in injection moldings via the Taguchi, ANOVA and neural network methods for PP and polystyrene (PS). The same network was used to predict both PP and PS. Tsai et al. [11] combined the ANN with a genetic algorithm (GA) to establish an inverse model of injection molding for optical lens form accuracy.

High density polyethylene (HDPE) is one of the most widely used thermoplastics in injection molding, because it offers many benefits such as low processing temperature, low cost, good stiffness and toughness, compared to other thermoplastics. As a semi-crystalline polymer, HDPE typically has a higher shrinkage than amorphous plastics, due to its higher crystallinity. The Taguchi approach has been used to optimize injection molding conditions to achieve desirable product qualities [10,11,12,13]. Compared with a factorial design, the Taguchi approach is more efficient with a fewer resources required.

To our knowledge, few studies have been conducted to investigate the shrinkage prediction accuracy of injection molded HDPE parts by using the combination of ANN and the Taguchi approach. In this study, back propagation ANN is used as data-driven modeling technique in predicting the minimum shrinkages along the flow direction (i.e., the length) and the cross-flow direction (i.e., the width), under the injection molding conditions optimized by the Taguchi approach. Three important molding parameters studied include injection speed, holding time, and cooling time. A Taguchi orthogonal array (L27) was designed for conducting experiments, to identify the optimal molding conditions for minimum shrinkages. Experimental data were used to train, test, and validate the ANN model. The main objective is to predict the minimal shrinkage under the optimized molding parameters. The second objective is to facilitate the molding setup with proper molding parameters and reduce the need for trial and error.

2 Model description

2.1 ANN

Figure 1 shows the typical structure of ANN. It consists of an input layer, hidden layer(s), and an output layer. Each layer has several elements which are called nodes or neurons. The links between elements carry values (or weights), and they usually determine the network function. ANN is trained by adjusting the link weights to perform a particular function for reaching the target outputs. Many input/target data pairs are required to train a network [14]. In this study, we have three inputs (injection speed, holding time, and cooling time) and two outputs (length and width shrinkages).

Fig. 1
figure 1

Typical structure of an ANN

2.2 Multi-layer ANN and backpropagation training

For the feedforward model adopted in this study, there are three neurons in its input layer, which are the dimensionless injection speed, cooling time and holding time. The output layer has two neurons, which are the length shrinkage and width shrinkage. Experimental data are randomly separated into a training set, a testing set, and a validation set. The training set is used to train the backpropagation ANN to identify the relationship between the inputs and the outputs. The testing set is used to test the prediction performance of the ANN. There were 27 experimental data points used to train the ANN. The number of hidden layers, the transfer function of hidden layer, the learning rate, the minimum gradient and momentum are considered in this study. The training process for the ANN is stopped when one of two following criteria is met: (1) the sum squared error (SSE) should be smaller than 0.0001, and (2) the iteration epoch reaches 10,000 times. The training quality is determined by the root mean squared (RMS) error. The definitions of SSE and root mean squared error are [8]:

$$ \begin{aligned} & SSE = \mathop \sum \limits_{j = 1}^{m} \mathop \sum \limits_{i = 1}^{n} \left( {Tij - Pij} \right)\, ^{\wedge} 2 \\ & RMS_{q} = \sqrt {\frac{SSEq}{m \times n}} \\ \end{aligned} $$
(1)

where Tij is the target value, and Pij is the predicted value from the model.

2.3 Steps for the backpropagation ANN design process

The multilayer feedforward neural network is widely used for function fitting, pattern recognition, and prediction problems. In this study, the work flow for designing the feedforward ANN involves the following six primary steps:

  1. 1.

    Performing injection molding experiments and collecting the data set of inputs and outputs

  2. 2.

    Providing these input and output data pairs to the software (i.e., MATLAB-Mathworks)

  3. 3.

    Tagging the data as inputs, outputs, training, and testing

  4. 4.

    Creating a custom network of the required structure (check for the network with the least RMSE value)

  5. 5.

    Training and testing the network, checking for the least RMSE value. The data tagged as “train” will be used for training;

  6. 6.

    Validating the network with the validating data set for post-training analysis

  7. 7.

    Using the network for predicting the minimum shrinkages under the optimal molding condition.

3 Experimental

3.1 Molding machine and experimental material

The molding machine used is Engel E-victory 30 with a clamping force of 30 ton, and it has the screw diameter of 22 mm and the L/D ratio of 30. HDPE (Proline 2053), supplied by Shannon Industrial Corporation, has a density of 0.953 g/cm3 and a melt flow index of 18 g/10 min.

3.2 Mold design

A two-part family mold was designed as shown in Fig. 2. The fan gate was selected to achieve a uniform material flow, minimize backfilling and part warpage, and keep the cross sectional area constant. The steel insert mold base, from DME Company (Model 08/09 U Style Frame), was cut with a CNC machining center. The original length and width of the mold cavity are 116.56 mm and 19.25 mm, respectively, measured with a coordinate measuring machine (CMM, Mitutoyo). The resolution of CMM is 0.0005 mm. All the injection molded samples were measured with the CMM.

Fig. 2
figure 2

The mold used in the study

The length and width shrinkages were calculated as follows.

$$ \begin{aligned} & {\text{S}}_{\text{L}} = \left( {{\text{L}}_{\text{m}} {-}{\text{L}}_{\text{p}} } \right)/{\text{L}}_{\text{m}} \times 100\% \\ & {\text{S}}_{\text{W}} = \left( {{\text{W}}_{\text{m}} {-}{\text{W}}_{\text{p}} } \right)/{\text{W}}_{\text{m}} \times 100\% \\ \end{aligned} $$
(2)

where SL = length shrinkage, Lm = mold length, Lp = part length, Sw = width shrinkage, Wm = mold width, Wp = part width

3.3 Baseline study

The baseline study was conducted with the injection speed 10 cm3/s, the injection pressure of 1000 bar, the cooling time of 24 s, and the holding time of 14 s. The obtained average length shrinkage and width shrinkage from the baseline study were 2.64% and 3.56%, respectively.

4 Results and discussions

4.1 Taguchi experimental design

The Taguchi method was applied to identify the effects of the molding parameters on the flow direction shrinkage (i.e., length shrinkage) and the cross flow shrinkage (i.e., width shrinkage). The optimum molding condition for the minimum shrinkages can be obtained through the Taguchi experiments. Similar to the ANN, the three significant controllable molding parameters with three levels were selected, as shown in Table 1. The output variables are the length and width shrinkage. The environment temperatures, the room temperature without ventilation (70 °F) and the room temperature with ventilation (65 °F), were selected as the noise factor. The measured shrinkage values and the signal-to-noise (S/N) results are given in Table 1. Each row in the table represents an experiment with different combination of molding parameters and their levels. However, these experiments were conducted randomly. The S/N ratio is a quality indicator that is used to evaluate the effects of the molding parameters on the shrinkages of the injection moldings. In this study, ‘‘the smaller the better” criterion was selected when calculating the S/N ratios, as minimizing the shrinkages is the goal.

Table 1 Taguchi experimental design and results for length and width shrinkage (%)
$$ {{\eta}} = \frac{{\text{S}}}{{\text{N}}} = - 10{\text{Log}}\left( {{\raise0.7ex\hbox{${\sum\limits_{{i = 1}}^{n} {Yi^{2} } }$} \!\mathord{\left/ {\vphantom {{\sum\limits_{{i = 1}}^{n} {Yi^{2} } } n}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$n$}}} \right) $$
(3)

where η is the S/N ratio, Yi is the individual shrinkage measurement, and n is the number of measurements for each run.

4.2 Analysis of noise factor and S/N ratio for the length and width shrinkages

Table 2 is the response table for the mean length and width shrinkages, as well as their associated S/N ratios. The table is used to determine the optimal combination of the controllable factors that leads to the minimum shrinkage. Figure 3 shows the graphical representation of the effects of molding parameters on the shrinkage and S/N ratio. Since the shrinkage is the-smaller-the-better, the optimal setting combination for the minimum length shrinkage is A2-B1-C3, which is interpreted as the 2nd level of injection speed, the first level of holding time, and the third level of cooling time. For the S/N ratio, we are looking for the largest value. Therefore, the optimal setting combination for the S/N ratio of the length shrinkage is A2-B3-C3. Similarly, the optimal setting combination for the minimum width shrinkage is A1-B2-C1, and the optimal setting combination for S/N ratio of the width shrinkage is A3-B3-C2.

Table 2 Response table for the mean length and width shrinkages and S/N ratios
Fig. 3
figure 3

The main effect plots for a the length shrinkage; b the width shrinkage

4.3 Hypothesis testing for the length and width shrinkage

Table 3 shows the t-tests with 99% confidence level were conducted to determine if the environment temperature had a significant effect on the length and width shrinkages.

Table 3 T-test for the environment temperature effect on the length and width shrinkages

Based on the results, we fail to reject the null hypothesis, as the T value obtained is less than the T- critical value. Therefore, the length shrinkage and the width shrinkage are not significantly affected by the environment temperature.

4.4 Training and testing ANN

The training function (TRAINGDX), the adaptive learning function (LEARNGD), the performance function (MSE), and the transfer function (LOGSIG), are used in the back propagation ANN. They can be found in neural network toolbox [14]. In this study, 135 experimental data sets (see Appendix A) were used for training and testing the ANN. The inputs of the ANN include injection speed, holding time and cooling time, and the outputs are the width shrinkage and the length shrinkage.

4.5 ANN network architecture

An optimum number of hidden layer neurons needs to be designed to accurately predict the outputs via the ANN. The best approach in seeking the optimum number of neurons is to first take a small number of neurons and then slightly increase it until a considerable improvement is observed. At the starting point of the process, 9 neurons was selected for the hidden layer. Then, the number of neurons increased every step. For the network structure, the selection of hidden layers is based on trial and error, and this study used one hidden layer. Finally, it was found that the 3-9-2-2 network structure has the least RMS error among all selected network structures. Thus, the 3-9-2-2 network structure, as shown in Fig. 4, was used in this study.

Fig. 4
figure 4

The ANN network architecture

To train a multilayer network, the experimental data used is usually divided into three subsets. The first subset is the training set used for computing the gradient, the network weights, and the biases. The second subset is the validation set. The network weights and biases are saved at the minimum validation error. The third subset is the testing set used to compare different models. In this study, a total experimental set of 135 data samples, were used for the training, testing, and validation phases of the network models. The data subsets of training, testing, and validation were achieved by dividing roughly the available data set into 80, 15, and 5%, respectively.

4.6 Analyze neural network performance after training

Figure 5 shows the regression plots. It displays the network outputs (shrinkage) with respect to targets for training, validation, and test sets. If the R-value is equal to 1, that means the output is equal to the target. A larger R-value indicates a smaller difference between the output and the target. For this study, all the R-values are larger than 0.91, which means the fit is realistically good for all data sets. The minimal shrinkage was achieved with the highest R-value based on the training, validation, and testing errors. The analysis shows that the designed network (3-9-2-2) for the shrinkage parameter is sufficiently accurate. If more accurate results are required, the network can be retrained, or increased the number of hidden neurons, or applied different training functions, or improved with additional training data.

Fig. 5
figure 5

R-value for testing, training, and validation

The validation set is used for tuning the model parameters. To make sure the network is not overfit, the validation set is used to check if the error is within some range.When the error in the measurement begins to increase, the training process must be stopped by the independent validation set. Figure 6 indicates the iteration at which the best validation performance reached a minimum at epoch 37. Figure 6 does not indicate any major problems with the training.

Fig. 6
figure 6

Schematic representation of validation performance

4.7 Comparison of prediction performance of ANN and the Taguchi methodology

Table 4 show the prediction performance of the ANN and the Taguchi experimental data for the length and width shrinkage under the optimal molding conditions. It indicates that the ANN model can predict the shrinkages that are very close to the Taguchi experimental data. Figure 7 shows the comparison of the experimental data and the ANN predicted data for randomly selected 27 test pairs. It demonstrates that the ANN model has a high prediction accuracy.

Table 4 Length and width shrinkage comparison: experimental versus ANN prediction
Fig. 7
figure 7

Width and length shrinkages: experimental versus predicted values

5 Conclusion

In this study, the Taguchi approach and the ANN model were utilized to investigate the effects of injection speed, holding time, and cooling time on the length shrinkage and the width shrinkage of HDPE injection moldings. The experimental results indicate that all these selected factors have significant effects on the shrinkages. The best combination of molding parameters for the minimum shrinkages was identified through the Taguchi experiments. Compared with the shrinkages obtained from the baseline experiments, the length shrinkage and the width shrinkage under the optimal molding conditions were reduced by 5.06% and 20.4%, respectively. A multi-layer feedforward ANN was designed, backpropagation trained, tested and validated with the experimental results. It turned out that the ANN model has a similar prediction performance under the optimal molding conditions for minimal shrinkages, compared with that of the Taguchi approach. In addition, the comparison of the experimental data and the ANN predicted data for randomly selected 27 test pairs under different molding conditions indicates that the ANN model has a high prediction accuracy. The combined use of the Taguchi approach and the ANN model, demonstrated in this study, would provide an efficient and effective way for injection molders to predict and minimize the shrinkages along the flow direction and the cross-flow direction under the optimal molding conditions, and to facilitate the molding setup and improve the quality of injection moldings.