1 Introduction

As with most machining processes, such as end milling, the multiple quality characteristics measured are highly correlated and with different optimization objectives. The relationship between roughness and material removal rate responses can be taken as one example; while roughness has to be minimized, removal rate has to be maximized. In these cases, the task is to find a vector of decision variables that satisfies, at the same time, more than one of the objective functions and the constraints, and provides an acceptable value for each response [1].

However, choosing parameters that provide global optimization for more than one characteristic of interest is not an easy task [2], once it requires complete understanding of process mechanism.

According to Moshat et al. [3] and Montgomery [4], when wanting to obtain the simultaneous variation of factors to build forecasting models for all relevant outputs, generally employ design of experiments (DOE), such as Taguchi arrays or Response surface methodology (RSM) or, regression techniques. However, as pointed out by many researchers [5, 6], these techniques can be greatly influenced by the correlations, causing model instability, over fitting, prediction errors, and inaccuracy on the regression coefficients. Another aspect is that the individual analysis of each response may lead to a conflicting optimum, since the factor levels that improve one response can, otherwise, degrade another [6]. In this case, the regression equations or the use of optimization methods that do not consider correlation are not adequate to represent an objective function without considering the variance–covariance structure among the multiple responses [1, 2, 58].

The optimization literature frequently reveals correlated responses with conflicting objectives in several end milling studies. Some of these works have been summarized in Table 1. In all these works, a strong or moderated correlation with a significant statistic among the responses was observed (this analysis was carried out by us, using the coefficient of Pearson). Although the results have been coherent, the correlation structure was not considered in these works. Just three works have assumed that correlated responses could deviate the optimization results and produce unreal solutions [1, 8, 9]. To resolve this drawback, these authors have presented a hybrid approach considering a combination of methods as PCA and RSM with multivariate mean square error (MMSE) [1], PCA and RSM [8], and PCA and grey relational analysis (GRA) [9], respectively. Moshat et al. [3] have also assumed that correlated responses could deviate the optimization results.

Table 1 Background studies of end milling process

In addressing the correlation influence in multiobjective optimization problems, some researchers have employed the principal component analysis (PCA) as an alternative approach [1, 58, 1019]. This multivariate technique summarizes, in a few uncorrelated components, common patterns of variation among response variables. While effective, this approach presents a drawback, when the first principal component score is not enough to explain most of the correlation (or variance–covariance) structure [2, 5, 6]. To overcome these shortcomings, some researchers proposed methods based on weighted principal component analysis (WPCA) [2, 2025]. This strategy aggregates all the principal component scores weighted by their respective eigenvalues, fully explaining variations in all responses.

However, when multiple correlated characteristics also have conflicting objectives, maximizing or minimizing the weighted principal components can favor some variables and harm others. In this case, WPCA method—though capable of eliminating the correlation’s effect—is unable to maximize and minimize the wanted parameters, simultaneously. Thus, the WPCA method can conduct the results to inadequate optimum solutions.

In this way, when the multiple responses present conflicting objectives and significant correlation, applying the Taguchi’s Signal-to-noise ratio (SNR) before principal components analysis will solve the problem [7, 25]. In recent times, the Taguchi’s signal-to-noise ratio combined with principal component analysis has been the subject of studies by many authors, such as [3, 7, 9, 1116, 2528]. This method has obtained noteworthy results, given that it reduces the data set, considers the correlation between the responses, avoids the production of inappropriate optimal points in the optimization problem, and considers the conflicting objectives between the multiple responses.

By analysing these works, it can be noted that only in Costa et al. [7], PCA was conducted combined with signal-to-noise ratio for RSM. In all the other publications, Taguchi design has been used to model the responses. Furthermore, only two works have combined WPCA with SNR [27, 29]. However, in both works, the Taguchi array was used. According to Paiva, Ferreira e Balestrassi [6], the main reason why PCA is more widely used with Taguchi than with response surface designs is related to the type of optimization that the multivariate objective functions must follow. The analysis in Taguchi designs is made employing the concept of loss function. Specifically, that means each kind of optimization (maximization, minimization, or normalization) can be represented by a proper signal-to-noise relation applied to the original responses. Due to the mathematical nature of this relation, the signal-to-noise ratio must always be maximized. In RSM, however, the approach is very different.

To that end, this paper proposes a hybrid multiobjective optimization approach able to standardize the optimization objectives of the original responses, reduce dimensions and, at the same time, eliminate the correlation between the multiple responses. This approach proposed is called WPCA-SNR method. A systematic procedure was developed for this very purpose. To model and optimize the problem, the WPCA-SNR combines Response Surface Methodology and Generalized Reduced Gradient (GRG) algorithm.

To demonstrate its applicability, the dry end milling process of the AISI 1045 steel was considered for a case study. Four input parameters (feed per tooth, cutting speed, axial, and radial depth of cut) and six response variables, average surface roughness, maximum surface roughness, root-mean-square roughness, ten-point height, maximum peak to valley, and material removal rate were considered.

Although the hybrid approach between Principal Components Analysis and Taguchi’s signal-to-noise ratio has already been extremely disseminated, this study differs from most approaches already proposed. First, in this study, more than two response variables, with conflicting objectives and significantly correlated, were considered in the optimization problem. Second, unlike most in the published literature studies, the mathematical models of Principal Components have been created to predict the results of different input parameters using Response Surface. Third, more than one principal component was considered. It can be noted that in the most cases, studied and published have had one eigenvalue larger than one; and thus, only the first principal component has been considered in the analyses. However, this does not occur in the majority of cases in today’s complex manufacturing processes we have. Finally, it was not found another work using a weighted approach of Principal Component combined with Taguchi’s signal-to-noise ratio for RSM in end milling process.

In the following sections, this paper compares the results obtained by the WPCA method and SNR-WPCA method applied on the dry end milling process.

2 Theoretical fundamentals

2.1 Weighted principal component analysis method

Principal component analysis is a multivariate analysis technique, which, in summary, is able to represent the original responses in a few number of uncorrelated latent variables, without significant loss of information [5, 6]. According to Paiva et al. [6] and Bertolini and Schiozer [28], the reduced data set consists of single or multiple components called Principal Components (PC) and are sorted from the highest variance to the lowest. Zhang et al. [30] also mentioned that, based on the variance–covariance matrix, the PCA method proceeds in such a way that the first principal component has the highest variance, and each succeeding component, in turn, has the highest possible variance within the constraint.

PCA uses the factorization of a variance–covariance Σ or correlation matrix R associated with the random vector \(Y^{T} = \left[ {Y_{1} ,Y_{2} , \ldots ,Y_{p} } \right]^{{}}\), to produce pairs of eigenvalues–eigenvectors \(\left( {\lambda_{i} ,e_{i} } \right), \ldots , \ge \left( {\lambda_{p} ,e_{p} } \right)\), where \(\lambda_{1} \ge \lambda_{2} \ge \cdots \ge \lambda_{p} \ge 0\) and an uncorrelated linear combination \({\text{PC}}_{i} = e_{i}^{T} Y = e_{1i} Y_{1} + e_{2i} Y_{2} + \cdots + e_{pi} Y_{p}\), \(i = 1,2, \ldots ,p\). The original data set may be then replaced by the uncorrelated linear combinations in the form of principal components score \(( {\text{PC}}_{\text{score}} )\). \({\text{PC}}_{\text{score}}\) can be written as \({\text{PC}}_{\text{score}} = \left[ {\mathbf{Z}} \right] \times \left[ {\mathbf{E}} \right]\) [2], where Z is the standardized data matrix and E is the eigenvectors matrix of the multivariate set [7]:

$${\text{PC}}_{k} = {\mathbf{Z}}^{{\mathbf{T}}} {\mathbf{E}} = \left[ {\begin{array}{*{20}c} {\left( {\frac{{x_{11} - \bar{x}_{1} }}{{\sqrt {s_{11} } }}} \right)} & {\left( {\frac{{x_{21} - \bar{x}_{2} }}{{\sqrt {s_{22} } }}} \right)} & \cdots & {\left( {\frac{{x_{p1} - \bar{x}_{p} }}{{\sqrt {s_{pp} } }}} \right)} \\ {\left( {\frac{{x_{12} - \bar{x}_{1} }}{{\sqrt {s_{11} } }}} \right)} & {\left( {\frac{{x_{22} - \bar{x}_{2} }}{{\sqrt {s_{22} } }}} \right)} & \cdots & {\left( {\frac{{x_{p2} - \bar{x}_{p} }}{{\sqrt {s_{pp} } }}} \right)} \\ \vdots & \vdots & \ddots & \vdots \\ {\left( {\frac{{x_{1n} - \bar{x}_{1} }}{{\sqrt {s_{11} } }}} \right)} & {\left( {\frac{{x_{2n} - \bar{x}_{2} }}{{\sqrt {s_{22} } }}} \right)} & \cdots & {\left( {\frac{{x_{pn} - \bar{x}_{p} }}{{\sqrt {s_{pp} } }}} \right)} \\ \end{array} } \right]^{\text{T}} \, \times \, \left[ {\begin{array}{*{20}c} {e_{11} } & {e_{12} } & \cdots & {e_{1p} } \\ {e_{21} } & {e_{22} } & \cdots & {e_{2p} } \\ \vdots & \vdots & \ddots & \vdots \\ {e_{1p} } & {e_{2p} } & \cdots & {e_{pp} } \\ \end{array} } \right].$$
(1)

Many methods have been proposed to determine the number of principal components. Methods include (among others): the Kaiser’s criteria, Log-Eigenvalue (LEV) diagram, Velicer’s Partial Correlation Procedure, Cattell’s SCREE test, cross-validation, bootstrapping techniques, cumulative percentage of total of variance, and Bartlett’s test for equality of eigenvalues. Kaiser’s criteria (1958) will be used in this paper, since is the most popular method [6].

According to Kaiser’s criteria, only the principal components, whose eigenvalues are greater than one, and whose explained cumulative variance are greater than 80 %, can be used to replace the original responses. This criterion is adequate when it is used with the correlation matrix [6].

However, while effective, the multivariate technique PCA presents a drawback, when the optimization of only one principal component is not adequate to represent all the process data set [2, 5, 6]. In some cases, studied in the published literature, for example, the first principal component has been the only component extracted [8, 10, 11, 19]. This does not occur in the majority of cases in today’s complex manufacturing processes we have [18]. In particular, it is doubtful whether the factor/level combination, determined by the first principal component only, is optimal or not [6, 8, 18].

In response to these concerns, some researchers proposed methods based on Weighted Principal Component Analysis (WPCA) [2, 2025], based especially on Taguchi and Response Surface design.

The Eq. (2) is based on Paiva et al. [2]. approach, in which the WPCA index combines RSM and PCA and, works as following: the original responses, obtained through a central composite design (CCD-RSM), are replaced by the resulting significant principal component scores. Subsequently, considering the eigenvalues of the correlation matrix as a set of weights, a new response index can be written as such:

$${\text{WPCA}} = \sum\limits_{p = 1}^{r} {\left[ {\lambda_{p} ({\text{PC}}_{p} )} \right]}$$
(2)

where, \(\lambda_{p}\) are the obtained eigenvalues of significant principal component and \(PC_{p}\) are the significant principal component scores that were extracted and stored from the original responses.

The ordinary least-squares (OLS) algorithm can be applying to WPCA index for obtain the second-order polynomial. With the RSM model established, the optimal point of the multiobjective problem can be found by locating the stationary point according to Eq. (3), only subjected to the constraint of the experimental region [2]. Other constraints may be added to the system according to the needs of the problem.

$$\begin{aligned} {\text{Maximize WPCA}} = \sum\limits_{p = 1}^{r} {\left[ {\lambda_{p} ({\text{PC}}_{p} )} \right]} \hfill \\ s.t:g\left( x \right) = x^{T} x \le \rho^{2} . \hfill \\ \end{aligned}$$
(3)

To solve the non-linear programming problem (NLP) described in the form of Eq. (3), the GRG can be used. GRG is considered one of the most robust and most efficient methods for constrained NLP [2].

The assumption of maximization described in Eq. (3) is established supposing that the desired optimization direction for each original response is positively correlated with the multivariate index WPCA. In this respect, Paiva el al. [2] have assumed which direction of optimization of the WPCA index can be established by analysing the correlation between WPCA and each original response. If there is a positive correlation between a WPCA index and certain original response, then they will have the same optimization direction. In this case, maximizing or minimizing WPCA will imply on the maximization or minimization of each original response variable. On the other hand, if the correlation is negative, the optimization senses will be inverse. For example, if WPCA index maintains a negative correlation with certain variables (whose goal is maximization), then the maximization of the WPCA will lead to their minimization. Likewise, it could be argued that if the correlation between the two output variables is negative, the maximization of the WPCA will imply on the minimization of the other and vice versa. For this, the inspection of the eigenvectors reveals the relationship that exists between the ith WPCA index and the original responses. More details about the WPCA approach can be seen in [2]. Thus, a potential difficulty in optimizing a multivariate response as WPCA occurs due to conflicting minima and maxima in a group of variables that, for instance, must be simultaneously maximized.

When there is a positive correlation with some variables and negative correlation with other simultaneously, multiplying the original response by a negative constant will solve the problem. This multiplication should be done before proceeding to the analysis of the principal component. However, when the responses have different optimization objectives, another likely alternative is to apply the Taguchi’s signal-to-noise ratio [7], since it is considered more robust, simple, and may be efficiently applied for continuous quality improvement and quality control of process not only end milling but also any other machining operations, where multiple objectives come under consideration [3, 7].

Taguchi uses SNR to measure the quality characteristic deviating from the desired value and these characteristics vary depending on the type of problem under study, which may be classified as ‘‘smaller-the-better’’, ‘‘bigger-the-better’’, and ‘‘nominal-is-best” [38]. Their mathematical expressions are represented by the Eqs. (4), (5), and (6), respectively, where y denotes the performance indicator, subscript i is the experiment number, and N is the number of replicates of the experiment i. After the transformation, SNR must always be maximized, which makes it possible to standardize the optimization objectives for the individual responses [6].

$${\text{SNR}} = - 10\log_{10} \left( {\frac{{\sum\nolimits_{i = 1}^{N} {y_{i}^{2} } }}{N}} \right)$$
(4)
$${\text{SNR}} = - 10\log_{10} \left( {\frac{{\sum\nolimits_{i = 1}^{N} {1/y_{i}^{2} } }}{N}} \right)$$
(5)
$${\text{SNR}} = 10\log_{10} \left[ {\left( {\frac{{\bar{y}}}{s}} \right)^{2} } \right],\bar{y} = \frac{{y_{1} + y_{2} + , \cdots , + y_{N} }}{N},{\text{and }}s = \frac{{\sum\nolimits_{i = 1}^{N} {(y_{i} - \bar{y})^{2} } }}{N - 1}.$$
(6)

2.2 The weighted principal component analysis and signal-to-noise ratio applied to the multiobjective optimization

Given the discussion above, this paper proposes a hybrid approach able to standardize the optimization objectives of the original responses, to reduce dimensions, and, at the same time, to eliminate the correlation between the multiple responses. This approach, called WPCA-SNR method, is based on Taguchi’s signal-to-noise ratio and Principal Component Analysis weighted by their respective eigenvalues.

The SNR-WPCA proposes that before constñructing the objective function and before carrying out the Principal Component Analysis, the signal-to-noise ratio must be applied to the original responses. A four-step process was developed to better understand the proposal approach, as it follows:

Step A: Apply the Taguchi’s signal-to-noise ratio for roughness responses according to Eq. (4) and for MRR according to Eq. (5). Then, establish the Response Surface models on the normalized responses.

Step B: Perform the Principal Component Analysis on the normalized responses by Taguchi’s signal-to-noise ratio using the correlation matrix. Store their respective scores, eigenvalues, and eigenvectors. Then, using the scores obtained, establish Response Surface models for the significant principal components.

Step C: Based on Eq. (2), develop the SNR-WPCA index. In this work, SNR-WPCA was developed as such Eq. (7), where, \(\rm{SNRPC_{p}}\) are the principal component scores that were extracted from the signal-to-noise ratio responses. Then, establish the Response Surface models on SNR-WPCA index. This model is the new response of the optimization problem.

$${\text{SNR-}}{\text{WPCA}} = \sum\limits_{p = 1}^{r} {\left[ {\lambda_{p} ({\text{SNRPC}}_{p} )} \right]} .$$
(7)

Step D: Based on Eq. (3), promote the optimization of the dry end milling process according to Eq. (8). In WPCA-SNR approach, the optimal points were identified using the GRG algorithm, available from Microsoft Excel’s Solver®, on the respective presented formulation. The set of constraints \(g\left( x \right) = x^{T} x \le \rho^{2}\) represents the experimental region, but other constraints can be added if it is necessary.

$$\begin{aligned} {\text{Maximize SNR-WPCA}} = \sum\limits_{p = 1}^{r} {\left[ {\lambda_{p} ({\text{SNRPC}}_{p} )} \right]} \hfill \\ {\text{s}} . {\text{t}}:g\left( x \right) = x^{T} x \le \rho^{2} . \hfill \\ \end{aligned}$$
(8)

In terms of design factors, this proposal establishes the empirical models for multiple responses of the dry end milling process, but can be applied to any manufacturing process.

3 Experimental method

The multiobjective optimization of dry end milling process employing to WPCA-SNR approach can be described at three stages. In the first stage, the Response Surface Methodology (RSM) was employed to determine the objective functions for the original responses. RSM combines statistical experimental design principles and mathematical techniques which are useful for the modelling and analysis of problems, in which responses of interest are not known and are influenced by several variables [4]. In the second stage, the correlation structure among the responses was analysed and the process was optimized by the WPCA method, thereby obtaining a primary optimal solution; it characterized the necessity of standardization the original response. Therefore, Taguchi’s signal-to-noise ratio was applied for the original responses, and then, the third stage was initiated. In the third stage, the proposal SNR-WPCA approach was applied, identifying a new optimal solution based on Taguchi’s signal-to-noise ratio and Principal Component Analysis weighted by their respective eigenvalues. At the end, the results obtained by the WPCA method and SNR-WPCA method applied on the dry end milling process were compared.

3.1 Experimental procedure and data analysis

In this work, the end milling parameters defined as input variables were feed per tooth (f z), axial depth of cut (a p), Cutting speed (V c), and radial depth of cut (a e). These parameters were designed according to a central composite design (CCD), created for four parameters at two levels totalling 16 factorials \(\left( {2^{k} = 2^{4} = 16} \right)\), eight axial points \(\left( {2k = 8} \right)\), and six center points. The additional centre point can be used to check for curvature. Furthermore, multiplying the centre points is strongly recommended, since they can improve the estimates of the quadratic effects and allow additional degrees of freedom for error. In coded units, the centre points present values 0 [4] (Table 4). To specify the parameter levels, previous studies and preliminary tests were considered. The parameter levels were fixed, as shown in Table 2. In the CCD matrix, a coded distance (\(\rho\)) of 2.0 was adopted for the centre points to the axial points. The experiments were conducted in a random order, but the data were arranged in standard order for easier viewing of the experimental designing. The software Minitab® was used to build the experimental matrix and to perform the statistical analysis from the experimental data.

Table 2 Cutting parameters and respective levels

The set of the outputs variables included average surface roughness (R a), maximum surface roughness (R y), root-mean-square roughness (R q), ten-point height (R z), maximum peak to valley (R t), and material removal rate (MRR). We selected five roughness’s responses, because although they are considered similar in evaluating the machined surface roughness, these parameters have not the same individual optimal points. Furthermore, they are highly correlated objective functions and have conflicting objectives with the MRR. Overall, since roughness and MRR exhibit positive correlation and considering that these objective functions are conflicting, there is a trade-off, where the WPCA-SNR approach could satisfactorily solve the problem.

MRR (cm3/min) in end milling process was calculated as follows:

$${\text{MRR}}_{i} = \frac{{V_{{{\text{f}}_{i} }} \times a_{{{\text{p}}_{i} }} \times a_{{{\text{e}}_{i} }} }}{1000} = \frac{{f_{z} \times N \times Z \times a_{\text{p}} \times a_{\text{e}} }}{1000}$$
(9)

with V f (table feed, mm/min), N (spindle speed, RPM), Z (number of effective teeth of the tool, Z = 3), and i (number of experiments). N can be calculated as \(N_{{_{i} }} = \frac{{V_{{{\text{c}}_{i} }} \times 1000}}{{\pi \times {\text{DC}}_{\text{ap}} }}\), where DCap (cutting diameter at cutting depth, of the tool, DCap = 25 mm).

The experiments were conducted on an FADAL, vertical machining center, model VMC 15, Maximum power of 15 kW, and maximum rotation speed of 7500 RPM. The tool overhang was 60 mm. The tool used was a positive end mill, code R390-025A25-11M with a 25 mm diameter, entering angle of \(\chi_{r} - 90{^\circ }\), and a medium step with three inserts (Fig. 1a). Three rectangular inserts were used with edge lengths of 11 mm each, code R390-11T308M-PM GC 1025 (Fig. 1b). The insert material used was cemented carbide ISO P10 coated with TiCN and TiN by the PVD process. The coating hardness was around 3000 HV3 and the substrate hardness 1650 HV3 with a grain size smaller than one. The surface roughness of the machined workpiece was measured using a Mitutoyo portable roughness meter, model Surftest SJ 201, with a cut-off length of 0.25 mm. All the milling experiments were performed under dry conditions.

Fig. 1
figure 1

a End milling tool, b cutting inserts, c workpiece

The workpiece material was AISI 1045 steel (Fig. 1c), with hardness of approximately 180 HB. The dimensions of the workpiece were rectangular blocks square sections of 100 × 100 mm and lengths of 300 mm. In the present days, AISI 1045 is widely used for all industrial applications requiring more wear resistance and strength. This material shows good machinability in normalized as well as the hot rolled condition. Based on the recommendations given by the machine manufacturers, operations, such as tapping, milling, broaching, drilling, turning, and sawing, etc, can be carried out on AISI 1045 steel using suitable feeds, tool type, and speeds. The chemical composition of this steel is given in Table 3.

Table 3 Chemical composition of AISI 1045 steel

Once all the responses had been measured, they were assembled to compose the experimental matrix presented in Table 4 and used as a data source for the modelling and optimization of the process.

Table 4 Experimental design–central composite design matrix

Experimentally, two least-squares-based models can be obtained: coded and uncoded based parameters. Montgomery [4] considers that the statistical analyses of the results are usually carried out using the coded variables. However, the same author says that it is often necessary to convert coded units into the original responses. For quantitative variables, uncoded variables can be achieved by the inverse of the following:

$${\rm X}_{i} = \frac{{x_{i} - \bar{x}_{i} }}{{\frac{1}{2}\Delta x_{i} }}$$
(10)

where \({\rm X}_{i}\) stands for the coded value of the ith variable, \(x_{1}\) stands for the original value of the ith variable, \(\bar{x}_{1}\) stands for the center point value of the original ith variable, and \(\Delta x_{i}\) stands for the difference of the original two values of the ith variable. The half value of the difference is called the step size [4].

The coded units approach is better than uncoded, because it provides resources to eliminate any spurious statistical results due to different measurement scales for the factors. In addition, uncoded units often lead to collinearity among the terms in the model. This inflates the variability in the coefficients estimates and makes them difficult to interpret. For these reasons, it was employed the model based on the coded units [4, 6].

4 Results and discussion

4.1 Development of mathematical model

Considering the goal to analyse the relation among end milling input parameters and responses, response surface models for the six end milling characteristics were developed. According to RSM, a lower order polynomial is usually employed. However, if the experimental space is in a region of curvature, then the process is fine-tuned by a second-order polynomial, such as described by the following [4]:

$$\begin{aligned} \hat{y}(x) = \beta_{0} + \beta_{1} f_{\text{z}} + \beta_{2} a_{\text{p}} + \beta_{3} V_{\text{c}} + \beta_{4} a_{\text{e}} + \beta_{11} f_{\text{z}}^{2} + \beta_{22} a_{\text{p}}^{2} + \beta_{33} V_{\text{c}}^{2} + \beta_{44} a_{\text{e}}^{2} + \hfill \\ \beta_{12} f_{\text{z}} a_{\text{p}} + \beta_{13} f_{\text{z}} V_{\text{c}} + \beta_{14} f_{\text{z}} a_{\text{e}} + \beta_{23} a_{\text{p}} V_{\text{c}} + \beta_{24} a_{\text{p}} a_{\text{e}} + \beta_{34} V_{\text{c}} a_{\text{e}} + \varepsilon \hfill \\ \end{aligned}$$
(11)

where \(\hat{y}(x)\) represents the responses variables R a, R y, R q, R z, R t, and MRR, f z, a p, V c, and a e are the cutting parameters, \(\beta_{ij}\) are coefficients to be estimated, and \(\varepsilon\) is the error observed in the response.

The values of the coefficients of the polynomials were estimated by Ordinary Least-Squares (OLS) algorithm. Then, the ANOVA procedure was applied to check the adequacies of the models as well as their adjustments and to remove non-significant terms. Table 5 presents the obtained coefficients for the final full quadratic models and the main results of the ANOVA.

Table 5 Estimated coefficients for the final quadratic models and ANOVA results

Regarding the adequacy, all regressions presented lower error S values. Furthermore, all regression p value presented less than 5 % significance. These results indicate that all expressions are adequate. Another important aspect in the statistical model-building process is associated with the amount of explicability of the dependent variables (y) by the predictors (x). It can be noted that the models found in the present work are adequate, once all of them exhibit Adj.R 2 above 85 % [17]. The results of the normality test for the residuals of the RSM models showed that the residuals are normal, since all Anderson–Darling coefficients (AD) were less than 1.000 (with p values higher than 5 % of significance). The Variance Inflation Factor (VIF) was also assessed. VIF values close to one imply that the predictors are not correlated. On the other hand, VIF values greater than five suggest that the regression coefficients were poorly estimated. All models showed VIF values close to one, indicating that the predictors were not correlated and were well estimated.

Table 5 also shows the calculated curvature p values for the responses. It indicates that the experimental space for the roughness responses falls within the curvature region (p values less than 5 % of significance). As MRR is not an experimental response, there is no guarantee that the MRR equation will present curvature.

Although some non-significant terms were found, their exclusion from the complete model did not imply in prediction variance reduction of the error S and reduced Adj.R 2.

Therefore, the ANOVA results have showed that the developed models are reliable and can be used in predicting, controlling, and optimizing this dry end milling process of the AISI 1045 steel.

The analysis of variance for Response surface models and the main effects of process parameters on mean response characteristic are presented in Fig. 2. The results have shown that feed per tooth is the most influential factor to explain the behaviour of roughness responses, followed by axial depth of cut, whereas cutting speed and radial depth of cut have no impact on the roughness. On the other hand, MRR has been strongly influenced by feed per tooth and axial depth of cut, while cutting speed and radial depth of cut have a slight impact on MRR. These same results can also be found in Brito et al. [34], Charnjeet et al. [35], Costa et al. [1, 7], and Lopes et al. [8].

Fig. 2
figure 2

Effects of process parameters on mean response characteristic

In fact, the authors described in Table 1 for end milling process have also published similar results. In Ginta et al. [31], for example, R a is also more affected by the feed, followed by cutting speed and axial depth of cut. According to authors, this is related to the effect of vibration, built-up edge, and other phenomena that occur during machining. Thangarasu, Devaraj, and Sivasubramanian [32], on the other hand, have affirmed that R a is more affected by the depth of cut and feed rate than that of the spindle speed, so that the depth of cut is the most critical factor for attaining the MRR while reducing the value of R a. At the same time, in Chahal et al. [33], R a has been significantly influenced by table feed rate and step over followed by spindle speed and depth of cut. Coolant pressure has a slight impact on surface roughness. On the other hand, MRR has been significantly influenced by depth of cut and step over followed by feed rate and spindle speed, whereas coolant pressure has no impact on MRR. Finally, in Kumar e Davis [36], the only significant factor found for the end milling operation of AISI Steel 410 was speed, whose effect on the surface roughness has to be considered. While none of the factors were found to be significant for the milling operation of Aluminum 6061.

Therefore, the results above-mentioned prove that feed per tooth, axial depth of cut, and cutting speed appear to have been the most important factors on the roughness and the MRR in the end milling process.

4.2 Correlation analysis

Using correlation analysis, the existence of a strong, positive correlation, with statistical significance, between the roughness responses, and a moderate, positive correlation between roughness responses and MRR can be observed (Table 6).

Table 6 Correlation structure between the responses

Since the responses of the dry end milling process are multi-correlated, using PCA method to optimize is thus justified. In the following sections, these components are optimized using the weighted approach of Principal Component, and weighted Principal Component with Taguchi’s signal-to-noise ratio approach, to decide optimal factor levels.

4.3 Optimization by the weighted principal component analysis method (WPCA)

The Principal Component Analysis was first performed to find the uncorrelated principal components needed to represent the original responses. Thus, using the correlation matrix, the principal components scores were extracted from the original responses and stored (Table 8) with their respective eigenvalues and eigenvectors (Table 7). The first and second principal components are responsible for of 99 % of the variation structure of the six dry end milling responses (Table 7).

Table 7 Principal component analysis: WPCA method

By Kaiser’s criteria, only PC1 could be use in the optimization. However, it can be noted that, while PC1 represents the roughness responses, since the coefficients of these terms have the same signal and are not close to zero, PC2 represents the MRR response. In other words, there is a strong positive correlation between PC1 and the quality characteristics studied, and a strong positive correlation between PC2 and MRR. Thus, it is necessary to select both PC’s. Thus, most of the data structure can be captured in two underlying dimensions. The remaining principal components account for a very small proportion of the variability and are considered unimportant.

To performing PC1 and PC2 as a valid method, it is important to check for outliers, because they can significantly influence your results. The observations’ squared Mahalanobis distance can be used for this type of situation. The squared Mahalanobis distances accounts for the distance between and a group and an observation to identify outliers in the multivariate space. A point above the line reference represents an unusual observation [39]. According to Groot, Postma, and Melssen [39], it is a more powerful multivariate method for detecting outliers than examining one variable at a time, because it considers the different scales between variables and the correlations among them.

Thus, an outlier identification plot was included to assess eventual observations that could influence the inflation of regression coefficients (Fig. 3). All points are located below the reference line. This implies that the multivariate approach is an adequate option for the end milling data and that only the first two principal components are necessary to form the WPCA index.

Fig. 3
figure 3

Multivariate outlier’s detection for WPCA method

Considering the scores calculated in the PCA (Table 8) and modelling them according to RSM, Eqs. (12) and (13) were obtained. To OLS algorithm was employed to estimate the coefficients.

Table 8 Results for WPCA method and SNR-WPCA method
$$\begin{aligned} {\text{PC}}_{1} &= 0.510 + 2.390f_{\text{z}} - 0.114a_{\text{p}} + 0.086V_{\text{c}} + 0.100a_{\text{e}} - 0.473f_{\text{z}}^{2} -0.255a_{\text{p}}^{2} + 0.061V_{\text{c}}^{2} + \\ & 0.029 a_{\text{e}}^{2} - 0.187f_{z} a_{\text{p}} + 0.134f_{\text{z}} V_{\text{c}} - 0.052f_{\text{z}} a_{\text{e}} - 0.044a_{\text{p}} V_{\text{c}} + 0.319a_{\text{p}} a_{\text{e}} + 0.073V_{\text{c}} a_{\text{e}} \\ {\text{Adj}} - R^{2} = 95.26\;\% \\ \end{aligned}$$
(12)
$$\begin{aligned} {\text{PC}}_{2}& = -0.165 + 0.106f_{\text{z}} + 0.801a_{\text{p}} + 0.137V_{\text{c}} + 0.183a_{\text{e}} + 0.149f_{\text{z}}^{2} + 0.078a_{\text{p}}^{2} - 0.013V_{\text{c}}^{2} - \\ & 0.008a_{\text{e}}^{2} + 0.321f_{\text{z}} a_{\text{p}} + 0.017f_{\text{z}} V_{\text{c}} + 0.073f_{\text{z}} a_{\text{e}} + 0.084a_{\text{p}} V_{\text{c}} - 0.023a_{\text{p}} a_{\text{e}} - 0.005V_{\text{c}} a_{\text{e}} \\ {\text{Adj}} - R^{2} = 97.38\;\% . \\ \end{aligned}$$
(13)

The WPCA values reported in Table 8, obtained according to Eq. (2), turn into \({\text{WPCA}} = \lambda_{1} ({\text{PC}}_{1} ) + \lambda_{2} ({\text{PC}}_{2} )\), and has considered the first eigenvalue as \((\lambda_{1} = 5.254)\) and second eigenvalue as \((\lambda_{2} = 0. 6 8 3)\), according to Table 7. Applying the OLS algorithm, the multivariate objective function can be written as:

$$\begin{aligned} \rm{WPCA} & = 2.567 + 12.619f_{\rm{z}} - 0.0.054a_{\rm{p}} + 0.544V_{\rm{c}} + 0.650a_{\rm{e}} - 2.382f_{\rm{z}}^{2} - 1.2285a_{\rm{p}}^{2} + 0.310V_{\rm{c}}^{2} + \\ & + 0.149a_{\rm{e}}^{2} - 0.763f_{\rm{z}} a_{\rm{p}} + 0.716f_{\rm{z}} V_{\rm{c}} - 0.223f_{\rm{z}} a_{\rm{e}} - 0.175a_{\rm{p}} V_{\rm{c}} + 1.657a_{\rm{p}} a_{\rm{e}} + 0.381V_{\rm{c}} a_{\rm{e}} \\ \rm{Adj} - R^{2} = 95.58\;\% . \\ \end{aligned}$$
(14)

The ANOVA results for PC1, PC2, and WPCA expressions presented lower error S values, and the regression p value represented less than 5 % of significance. The models presented Adj.R 2 value above 85 %. These results indicate that all expressions are adequate and the full quadratic models of PC1, PC2, and WPCA were considered. Figure 4 represents the WPCA multivariate objective function as a function of the variables f z and a p, given that feed per tooth (f z) and axial depth of cut (a p) were the most significant for all expressions.

Fig. 4
figure 4

WPCA surface plot as a function of a p and f z

As previously mentioned, the direction of optimization of the WPCA index can be established by analysing the correlation between WPCA and each original response. The inspection of the eigenvectors reveals the kind of relationship that exists between the ith WPCA index and the original responses.

Examining the eigenvectors (Table 7), it is possible to observe that although there is a reasonable explanation observed between PC1 and MRR response, roughness is mainly composed by the first principal component. On the other hand, MRR is mostly composed by PC2. Furthermore, the direction of optimization (or direction of correlation) between PC1 and all the responses is positive. On the other hand, the direction of optimization (or direction of correlation) between roughness and PC2 is negative, while the direction of optimization is positive for PC2 and MRR.

Primarily, when PC1 is maximized, all the responses will also be maximized, and minimizing PC1 implies in minimizing all responses. When PC2 is maximized, roughness will be minimized and MRR values will be maximized. On the other hand, minimizing PC2 both the roughness and MRR responses will be maximized. Analogously, maximizing WPCA will maximize all responses, while minimizing it will minimize all responses.

To improve the end milling process, the roughness responses must be minimized, while material removal rate must be maximized. Thus, the study of the eigenvectors shows that it is impossible to minimize the roughness responses and maximize MRR simultaneously.

In this case, to comply with the objectives of the second part of this paper, two strategies will be proposed and compared: (a) to maximize WPCA and (b) to minimize WPCA. Probably, these multivariate optimization approaches by the WPCA method can lead to MRR minimization, or roughness surface maximization.

In both cases, using the GRG algorithm to solve the system of Eq. (3), the maximization and minimization of WPCA can be established according to Eqs. (15) and (16), respectively. The constraint of the experimental region \({\mathbf{x}}^{{\mathbf{T}}} {\mathbf{x}} \le 2^{2}\) was considered, since \(\rho\) for CCD with control factors \(k = 4\) is 2. Table 9 exhibits the optimal solution found.

Table 9 Optimal solutions for the dry end milling process obtained with WPCA method
$$\begin{aligned} \text{Maximize WPCA} = \lambda_{1} {\rm(PC_{1}} ) + \lambda_{2} {\rm(PC_{2}} ) \hfill \\ s.t:g\left( x \right) = x^{T} x \le 2^{2} \hfill \\ \end{aligned}$$
(15)

and,

$$\begin{aligned} \text{Minimize WPCA} = \lambda_{1} {\rm(PC_{1}} ) + \lambda_{2} {\rm(PC_{2}} ) \hfill \\ s.t:g\left( x \right) = x^{T} x \le 2^{2} . \hfill \\ \end{aligned}$$
(16)

In Table 9, Target and Nadir values of each original response were determined by individual constraint minimization (for roughness responses) and individual constraint maximization (for MRR response), such as \(\zeta_{{y_{i} }} = \mathop {\text{Min}}\limits_{x \in \varOmega } \left[ {\hat{y}_{j} \left( x \right)} \right]\) and \(\zeta_{{y_{i} }} = \mathop {\text{Max}}\limits_{x \in \varOmega } \left[ {\hat{y}_{j} \left( x \right)} \right]\), respectively. The target value corresponds to the best possible values, while Nadir value corresponds to the worst values. These values are considered as specification limits for the optimization problem.

As expected, the achieved optimum solutions by the WPCA method do not satisfy the end milling process improvement. It can be observed that when WPCA is maximized, the method tends to the MRR optimal values, while, at the same time, obtains high roughness values which are outside the specifications limits. On the other hand, when WPCA is minimized, the WPCA method tends to the roughness optimal values, in detriment of productivity.

4.4 Optimization by the weighted principal component analysis combined with Taguchi’s signal-to-noise ratio method (SNR-WPCA)

Although the WPCA method can be very useful to eliminating correlated responses that will be optimized, they may conduct the results to inadequate optimum solutions if there are variables in the set with inverse directions of optimization. The results previously reported by the WPCA method showed that the maximization or minimization of the principal components favored some end milling characteristics and not others. Therefore, the WPCA method was applied to a new optimization to improve these results through standardization of optimization objectives of the original responses. For this, the Taguchi’s signal-to-noise ratio (SNR) was applied before principal components analysis. In this part of the experiment, the steps previously described were developed.

4.4.1 Step A. Taguchi’s signal-to-noise ratio and RSM models

The Taguchi’s signal-to-noise ratio was calculated according to Eq. (4) for average surface roughness (R a), maximum surface roughness (R y), root-mean-square roughness (R q), ten-point height (R z), maximum peak to valley (R t), and according to Eq. (5) for material removal rate (MRR). The calculated values for SNR/R a, SNR/R y SNR/R z SNR/R q SNR/R t, and SNR/MRR are presented in Table 8. Then, the OLS algorithm was applied on normalized responses, and the results can be observed in Table 10. It can be noted that all expressions presented lower error S values and the regression p-value were less than 5 % of significance. The Adj.R 2 value is above 85 %. All the residuals are normal. Thus, the ANOVA results show that the final full quadratic models are reliable and can be used for the optimization of this process.

Table 10 Estimated coefficients for the final quadratic models and ANOVA results

4.4.2 Step B. Principal component analysis for the Taguchi’s SNR responses and RSM models

Once the original responses were normalized by Taguchi’s signal-to-noise ratio, PCA was performed on the data, considering the correlation matrix. The principal components scores were extracted and stored (Table 8) with their respective eigenvalues and eigenvectors (Table 11). PC1 and PC2 are responsible for the explanation of 99.1 % of the variation structure within the six SNR responses (Table 11). The multivariate outlier’s detection (squared Mahalanobis distances) showed no unusual points (Fig. 5). Furthermore, it can be noted that PC1 is mostly related to standardized roughness and that PC2 relate to SNR-MRR (Table 11). These analyses imply that both PC1 and PC2 are necessary to form the SNR-WPCA index.

Table 11 Principal component analysis: SNR-WPCA method
Fig. 5
figure 5

Multivariate outlier’s detection for SNR-WPCA method

The OLS algorithm applied on PC1 and PC2 yielded the results that can be observed in Table 10. The ANOVA results show that the final full quadratic models are reliable and can be used for the optimization of this process.

4.4.3 Step C. SNR-WPCA index and RSM models

Finally, the SNR-WPCA multivariate objective function was developed considering the first and second principal component scores and their respective eigenvalues, \((\lambda_{1} = 5. 1 7 8)\) and \((\lambda_{2} = 0. 7 6 2)\) as weights, yielding the following:

$${\text{SNR-WPCA = }}\lambda_{ 1} ( {\text{SNPC}}_{ 1} ) + \lambda_{ 2} ( {\text{SNPC}}_{ 2} ).$$
(18)

The SNR-WPCA values were stored in Table 8.

Considering the observations calculated in the SNR-PCA formulation, the values of the coefficients were estimated by the OLS algorithm, and the second-order mathematical model was developed as follows:

$$\begin{aligned} {\text{SNR-WPCA}} = - 2.54 - 10.217f_{\text{z}} + 2.190a_{\text{p}} + 0.542v_{\text{c}} - 1.252a_{\text{e}} - 0.837f_{{{\text{z}}^{2} }} + 2.751a{}_{{{\text{p}}^{2} }} + \, \hfill \\ \, 0.527v_{{{\text{c}}^{ 2} }} + 0.714a_{{{\text{e}}^{2} }} + 0.185f_{\text{z}} \times a_{\text{p}} - 1.908f_{\text{z}} \times v_{\text{c}} + 1.528f_{\text{z}} \times a_{\text{e}} - 0.820a_{\text{p}} \times v_{\text{c}} - \hfill \\ \, 0.516a_{\text{p}} \times a_{\text{e}} + 0.520v_{\text{c}} \times a_{\text{e}} \, .\hfill \\ \end{aligned}$$
(19)

Then, the ANOVA procedure was applied to check for adequacies of the model as well as their adjustment. Table 10 presents the obtained coefficients for the final full quadratic model and the main results of the ANOVA. Since the regression p value presented less than 5 % of significance and lower error S value was presented, it can be seen that the expression SNR-WPCA is adequate. Regarding adjustments, the model presented Adj.R 2 values above 85 %, indicating a good adjustment. Furthermore, the calculated curvature p values show that the experimental space for the SNR-WPCA multivariate objective response falls within the curvature region. The result of the normality test for the residuals demonstrates that the residuals are normal and uncorrelated for SNR-WPCA model (p value >5 % of significance). Even though some non-significant terms were found, their exclusion from the complete model increased the error S and reduced Adj.R 2. A full second-order model was then considered to surpass this problem. Therefore, the ANOVA results showed that the developed model is reliable and can be used in optimizing the dry end milling process.

Figure 6 represents the SNR-WPCA multivariate objective function as a function of the parameters feed per tooth (f z), axial depth of cut (a p), cutting speed (V c), and radial depth of cut (a e).

Fig. 6
figure 6

SNR-WPCA surface plot as a function of the end milling parameters

4.4.4 Step D. SNR-WPCA index and RSM models

Employing the GRG algorithm available from Microsoft Excel’s Solver® routine for the system of Eq. (8), the maximization of the set of pre-processed response can be established according to Eq. (20). The non-linear constraint \({\mathbf{x}}^{{\mathbf{T}}} {\mathbf{x}} \le 2^{2}\) was also considered.

$$\begin{aligned} {\text{Maximize SNR-WPCA}} = \lambda_{1} ({\text{SNRPC}}_{1} ) + \lambda_{2} ({\text{SNRPC}}_{2} ) \hfill \\ s.t:g\left( {\mathbf{x}} \right) = {\mathbf{x}}^{{\mathbf{T}}} {\mathbf{x}} \le 2^{2} . \hfill \\ \end{aligned}$$
(20)

The maximization of the SNR-WPCA was used due to the mathematical nature of the signal-to-noise ratio; it must be always maximized. In this case, it is not a problem. Examining the eigenvectors (Table 11), it is possible to observe that the direction of optimization (or direction of correlation) between the PC1 and all the roughness responses is positive, while the direction of optimization (or direction of correlation) between the PC1 and MRR is negative. On the other hand, the direction of optimization between PC2 and all the responses is negative.

Since the first principal component consists mostly by the roughness and that the PC2 consists mostly by the MRR, it can be find out that when PC1 and PC2 are maximized, both roughness and MRR will be maximized. Therefore, all the objectives of the SNR responses will be achieved.

The optimization results are presented in Table 12 and can be compared with the Targets (\(\zeta_{yi}\)) and Nadir values of the original responses.

Table 12 Optimal solutions for the dry end milling process obtained with the SNR-WPCA method

According to the results present in Table 12, it can be observed that to maximize the MRR, while minimizing surface quality simultaneously, f z = 0.08 mm/tooth, a p = 1.62 mm, Vc = 331 m/min, and a e = 15.49 mm are the values that attained the desired quality conditions using the SNR-WPCA method for dry end milling process.

Similar results were also obtained by Costa et al. [1]. The authors analysed the optimal parameters in a multiobjective optimization problem, in end milling process, to find at the same time, the maximum volume of removed material, and minimum surface roughness. In this article, f z = 0.09 mm/tooth, a p = 1.73 mm, V c = 333.78 m/min, and a e = 16.20 mm have been considered as the optimal cutting parameters to obtain values of 0.91, 4.34, 3.95, 1.06, and 4.44 μm (R a, R y, R z, R q, R t, respectively), while a value of 33.88 cm3/min has been obtained for MRR.

There are some physical explanations of the AISI 1045 end milling process that can justify the results found from this study. For example, the lower feed per tooth (f z) minimizes the roughness of the part, because it promotes a geometric effect of the inserts on the tips of peaks of the milled surface texture irregularities. The depth of cut (a p) obtained, near the level (+1) of DOE design, allows the mill to work with the main cutting edge and not just the nose radius (r = 0.8 mm). This makes it easier to shear the workpiece material and prevents the formation of a lateral flow on the chip, which could harm the finish and increase the roughness [40]. The larger the depth of cut the greater the productivity. The radial depth of cut (a e) obtained, near the level (−1) of DOE design, enables the mill to work with its centre within the workpiece, with a ratio a e/Dc of around 62 %. This ratio of radial depth of cut with cutting diameter (Dc) is considered optimal in terms of tool-workpiece engagement for asymmetrical down cut end mill, which causes it less prone to the vibration process [40]. The smaller the vibration the lower the surface roughness. The cutting speed (V c) obtained per optimization method, near to level (0) of DOE design, enables the mill to work on a medium rotation. This makes it less likely that a vibration is brought to the rotation system of the machine tool and workpiece, since the machine used in the tests has longer than 10 years.

Thangarasu, Devaraj, and Sivasubramanian [32], Chahal et al. [33] and Singh et al. [35] have put forward the same arguments, as the ones referred to above when roughness and MRR have been analysed in the end milling process. Ginta et al. [31] have also affirmed that an increase in cutting speed, axial depth of cut, and feed per tooth leads to increase in the surface roughness.

Furthermore, all optimized responses were established within the specification limits and relatively close to their Utopia point, which suggests that this approach gives a better solution than the WPCA method.

The numerical results indicate that the solutions found by WPCA-SNR approach were characterized as more appropriate optimal point in relation to one obtained with the WPCA method. Thus, the SNR-WPCA method showed itself to be a good technique to the optimization of the AISI 1045 dry end milling process.

5 Confirmation experiments

To verify the reproducibility of the results, a series of three confirmation experiments were run with the optimal combination of the dry end milling parameters, i.e., f z = 0.08 mm/tooth, a p = 1.62 mm, V c = 331 m/min, and a e = 15.49 mm. Table 13 presents these results and shows that most of the responses presented real values close to the predicted ones. It can be observed here that the SNR-WPCA method successfully conducted the process to a compatible result with the expected objectives.

Table 13 Optimal solutions for the dry end milling process obtained with the SNR-WPCA method

6 Conclusion

This paper presents an approach, called weighted principal component analysis, combined with Taguchi’s signal-to-noise ratio (SNR-WPCA). SNR-WPCA was developed to optimize multiple correlated responses presenting different objectives optimization. Although capable of considering the correlation between the responses, the weighted principal component analysis (WPCA) can conduct the results to inadequate optimum solutions when the multiple responses present conflicting objectives of optimization. We proposed here a standardization of the optimization objectives of the original responses, by Taguchi’s signal-to-noise, before principal component analysis. Response Surface Methodology, applied to model the dry end milling characteristics, developed statistically significant mathematical models.

The numerical results indicate that the solution found by SNR-WPCA approach was characterized as a more appropriate optimal point in relation to one obtained with the WPCA. Considering the SNR-WPCA method, f z = 0.08 mm/tooth, a p = 1.62 mm, V c = 331 m/min, and a e = 15.49 mm are the optimal parameters for minimize roughness, and maximize material removal rate, simultaneously.

On the other hand, the results reported by the WPCA method showed that the maximization or minimization of this function favors some characteristics and not others. I.e., when WPCA is maximized, the optimized MRR is established within the specification limits and relatively close to his target. However, this approach presents high roughness values. The opposite is also true. When WPCA is minimized, the optimized roughness responses are established very close to their targets and within the specification limits.

Therefore, the SNR-WPCA method showed itself capable to surpass the drawbacks of the WPCA method. The SNR-WPCA method showed itself to be a good technique to the optimization of the AISI 1045 dry end milling process when the multiple quality characteristics measured are highly correlated and with different optimization objectives.

The models’ capability of predicting the results was verified by the confirmation experiments; low errors were observed between the theoretical and the real values.

Finally, the results are particularly useful for scientists and engineers to determine which parameters of the end milling process without cutting fluids are able to achieve, at the same time, the maximum rate of removed material, and minimum surface roughness.