1 Introduction

Aluminium and its alloys commonly exhibit notable multi-property characteristics, including a high strength-to-weight ratio, excellent heat conductivity, and effective resistance against corrosion, under typical circumstances. As a consequence, these materials may be implemented across a wide range of industries. Notwithstanding their notable attributes, these materials have notable drawbacks like diminished mechanical strength, inadequate wear resistance, and subpar corrosion resistance when exposed to submerged water environments [1,2,3,4]. The utilization of aluminium-based metal matrix composites (MMCs) is a viable solution to mitigate these limitations. In order to acquire the requisite mechanical, physical, chemical, or thermal characteristics, metal matrix composites (MMCs) fundamentally amalgamate two or more components at the macroscopic level. They are widely utilized in many industries such as manufacturing electrical equipment, automotive, marine, and aerospace, owing to their distinctive characteristics [5,6,7]. Ceramic materials are commonly employed as reinforcement in composites due to their ability to increase the essential characteristics of the composites [8,9,10]. The Al-6082 alloy (Si) is primarily composed of two prominent alloying elements, namely magnesium (Mg) and silicon. The alloy has moderate mechanical strength and high resistance to air corrosion. It is commonly utilized as a construction material for architectural elements necessitating substantial structural integrity, such as cranes, bridges, and trusses [11,12,13]. Likewise, Al-6061 is a widely recognized aluminium alloy renowned for its exceptional ratio of strength to weight, commendable resistance to corrosion, and favorable weldability. When the combination of titanium dioxide (TiO2) and graphene is utilized, it introduces novel opportunities and augments its characteristics for diverse applications. Several uses of this material may be observed in various industries such as Aerospace and Aviation, Automotive Industry, Marine and Offshore Industries, Sports Equipment, Medical Devices, and other sectors [14,15,16].

To evaluate the performance efficiencies and mechanical properties of aluminium and aluminium-based matrix composites, numerous research-related activities and evaluations have been conducted [17,18,19,20]. The literature demonstrates that the creation of composite materials can enhance the hardness, wear, and corrosion characteristics of MMCs based on aluminium. By using ceramic components, aluminium composites were created. The results of the literature review revealed that adding ceramic components not only increased the composite's macro and micro hardness but also considerably increased its wear and corrosion resistance. The stir-casting procedure is one of the most used methods for creating composites. Ceramics made of Titanium Dioxide (TiO2), and Graphene powder, which are tough, can be utilized as reinforcement for composite materials. Additionally, it has good qualities for resisting corrosion [21,22,23]. Researchers are actively working on research on the impacts of TiO2 and graphene powder in composites for corrosion resistance and enhancing thermal and electrical conductivity. However, TiO2 and graphene powder has a well-established history of usage as a hardening reinforcement, and it has been studied to determine the effect of produced metal composites on various properties [24,25,26]. With the use of SEM examination, the microstructure of the TiO2 and Graphene powder-reinforced MMC is thoroughly examined. The results have revealed numerous modifications in the mechanical characteristics of TiO2 and Graphene powder-enhanced MMCs [27,28,29]. This reinforcement has generally improved the mechanical properties of the produced composites, which is a good thing. According to studies, TiO2 and Graphene powder has a more favourable impact on mechanical characteristics including hardness and compressive strength. Studies have revealed that the increased hardness of TiO2 and Graphene powder reinforced composites also enhances their tribological characteristics [30, 31]. According to research, TiO2 and Graphene powder are largely utilized to enhance the mechanical and tribological characteristics of composite materials [32, 33].

It is necessary to investigate the extent of the post-processing technique's operational parameters [34,35,36]. The needed surface finish can be produced by choosing the correct operating parameter values. Numerous tests and testing of the specimens may be necessary due to the wide variation in the number of parameters and their range [37,38,39]. Taguchi-Design of experiments (T-DoE), an optimization technique, is used to get around this laborious process [40,41,42]. By using this technique, it is possible to identify the post-processing settings that are most effective at giving coated surfaces the desired surface finish [42,43,44]. Experimentation and surface roughness testing can be used to validate the optimal values. A surface coating with a high-quality surface finish is produced as a result of using the best post-processing settings [45,46,47].

The ideal values of the process parameters were obtained using optimization techniques. Researchers have used a variety of techniques for process parameter optimization, including turning of AISI 5140 for flank wear, cutting forces and vibration [48, 49], plasma arc cutting [50], welding of similar and dissimilar metals [51,52,53,54], fused filament fabrication for part geometry accuracy [55], dry sliding wear characteristics [56, 57], forming process [58], and cold rolling for improved surface characteristics [59]. According to the literature, a variety of methodologies can be used for process parameter optimization, however, Taguchi analysis is the most effective in terms of the outcomes.

In their study, Fang and Hong [60] explored various transformations of the response variable and subsequently developed estimation and confidence intervals for the resulting non-linear model's uncertainty. In the context of the Box-Cox model with general uncertainty, the utilization of the uncertain least squares method for estimation may result in a flawed estimate due to the transformation parameter approaching negative infinity and zero. This issue arises due to the tendency of the aforementioned parameters towards these limiting values. Liu et al. [61,62,63] proposed the rescaled least squares estimation (RLSE) method to obtain results for the uncertainty Box-Cox regression model. However, it should be noted that this method is only applicable to the Box-Cox transformation and cannot be used for a general uncertain model [64]. An alternative methodology is the uncertain maximum likelihood estimation (MLE), which has been proposed in recent literature. The authors Lio [61] and Liu [62] introduced the maximum likelihood estimation (MLE) approach for the continuous uncertainty model and implemented it in the context of the uncertainty linear regression model.

The present study aims to enhance the surface finish and minimize the surface roughness of the Al alloy composite subjected to turning operation by optimizing the CNC turning operating parameters. The optimization of operating parameters, namely Depth of Cut (DoC), Spindle Speed (SS), and Feed Rate (FR), will be carried out through the utilization of Taguchi analysis in combination with the Single-Response Performance Index (SPI). Acquiring the regression equation that is fine-tuned with hyperparameters to accurately predict the surface roughness at a high level. The nature of the data plot is heteroscedastic, indicating that the variables are unbiased.

The utilization of analysis of variance (ANOVA) is a prevalent method for optimizing process parameters since it aids in the creation of regression equations. The primary objective of this study is to enhance optimization by employing the Box-Cox transformation technique and determining the maximum likelihood value (lambda ‘λ’) to assess statistical significance (P-value). Following this, the variance inflation factor (VIF) is calculated, which functions as a quantitative measure for evaluating the existence of multicollinearity in regression analysis.

2 Materials and methods

The composite material consists of the matrix material and the matrix material selected for this study was the aluminium alloy Al-6061. Of all the aluminium alloys, Al-6061 alloy offers the highest strength and ductility as well as outstanding machinability, good bearing, and wear characteristics. Additionally, it has a low weight, a good strength-to-weight ratio, and great open-air corrosion resistance. The automobile, aerospace, and marine sectors all make extensive use of it. Its lack of mechanical strength restricts its application. In order to make the use of aluminium alloy in high-strength bearing applications in the automotive and aerospace sectors simpler, extensive research on the material has been done through the creation of MMCs [11]. The composition of Al-6061 is shown in Table 1, and its mechanical properties are shown in Table 2.

Table 1 Composition of Al-6061
Table 2 Mechanical properties of Al-6061

Based on the existing body of research, ongoing efforts are being made to develop metal matrix composites (MMCs) with aluminium as the primary constituent, with the aim of catering to a diverse array of applications. The utilization of titanium dioxide (TiO2) as reinforcement has been proven to enhance the necessary strength [21]. Graphene powder (C) is also used as a reinforcement as it is a good conductor of electricity and a good conductor of heat, due to the presence of free electrons. Obtaining the hyperparameter-tuned regression equation that predicts the surface roughness with high-level accuracy. The data plot is to be heteroscedasticity in nature, so the variables are not biased [27]. The mechanical properties of reinforcements are shown in Table 3. A flow chart to express the research work is shown in Fig. 1.

Table 3 Mechanical properties of Titanium Dioxide and Graphene
Fig. 1
figure 1

Research study flowchart

2.1 CNC turning operation

The present investigation aims to explore the optimization of CNC turning operation parameters that influence the diminution of surface roughness and enhancement of surface finish in machined composites. Figure 2 depicts the workpiece mounted on the CNC machine for machining. The machining operations were carried out using the CNC lathe CLT100. This CNC machine is flatbed type, with a Fanuc emulated keyboard. The chuck is manual with a maximum size of 100 mm dia. The CNC controller is ‘Cutviewer’, operating software which is a PLC-based controlled system. The CNC machine is automatic with a stepper motor with stepper drives. The Taguchi analysis was utilized together with the Single-Response Performance Index (SPI) to optimize the Depth of Cut (DoC), Spindle Speed (SS), and Feed Rate (FR) operating parameters. The optimization of operating parameters was carried out to achieve the desired surface finish.

Fig. 2
figure 2

CNC Turning tool and specimen

2.2 Design of experiments using Taguchi method (T-DoE)

The turning process parameters of the CNC lathe must be optimized in order to improve the surface quality finish of composite materials. The present investigation involved the utilization of Taguchi analysis for the optimization of process parameters. Figure 3 depicts the flow chart of the Taguchi Design.

Fig. 3
figure 3

T-DoE flow chart

During the preliminary phase, the T-DoE methodology relies on the number of process parameters associated with turning operations and the number of levels at which each of these factors is evaluated. Table 4 depicts the process parameters and their respective levels for the turning operation, which have been determined in accordance with the capacity of the CNC machine that has been selected for the purposes of this investigation.

Table 4 CNC Turning Operation Process Parameters and their Levels

The determination of the optimal combination of process parameters for turning operations, which results in the minimum surface roughness (Ra), is contingent upon the number of parameters and their respective levels. The present investigation involved the consideration of three distinct process parameters, each with six corresponding levels. Equation 1 [59] was utilized to establish the required number of trials.

$$\mathrm{No}.\mathrm{ \,of \,Experiments}=\mathrm{P }\times \left(L-1\right) +1$$
(1)

where L depicts levels and P represents parameters.

The present investigation utilized the T-DoE methodology and determined that a total of 9 experimental trials have been necessary. Orthogonal arrays of L9 and L27 are the available options in the T-DoE, of which L9 is chosen as shown in Table 5. The experimental design involved the consideration of two replications in each run, with the response being determined as the average of the two replications.

Table 5 Turning operation process parameters and their levels

2.3 Experimental setup and result measurement

The machining operation was conducted using a single-point cutting tool on a CNC turning centre. A rudimentary program for text conversion has been created with the aim of scholarly investigation. The surface roughness test was conducted using the Mitutoyo roughness tester as shown in Fig. 4. The other components for the surface roughness tests include the test specimen and an anvil. The specimen is kept over the v-groove of the anvil so that the lateral degrees of movement are arrested. The probe is set to be in contact with the workpiece and the probe movement was along the length of the workpiece. The measured results were recorded as presented in Table 6. Signal to noise ratio was calculated by using a smaller-is-better condition.

Fig. 4
figure 4

Setup for measuring the surface roughness

Table 6 Experimental Results with signal to noise ratio data

3 Results and discussion

By applying the 'smaller is better' criterion to the signal-to-noise ratio of the experimental findings, the influence of noise has been identified. The results of the signal-to-noise ratio (SNR) are presented in Table 6 along with the respective surface roughness values. The main effects plot of SRN is shown in Fig. 5. The significant combination of parameters identified from the experimental studies using the L9 orthogonal array was the DoC = 1 mm/cycle, SS measured 1000 rpm, and FR of 50 mm/min. The plot was obtained from the Minitab platform. The surface roughness values constantly increased as and when the DoC, SS, and FR were raised. This could be due to the built-up edges at the cutting edge of the single-point HSS tool.

Fig. 5
figure 5

Main effect plot of Signal noise ratio

3.1 Mathematical modelling

For the mathematical modelling, a three-step process is used. The Box-Cox transformation and calculation of the maximum likelihood value (lambda) for statistical significance (P-value) are described in detail in the first step [65,66,67]. The variance inflation factor (VIF), a metric for evaluating the presence of multi-collinearity in regression analysis, is determined in the second step [68,69,70]. The general linear regression is discussed in the third phase.

3.1.1 Box-Cox transformation

The utilization of the Box-Cox transformation methodology within the field of machine learning is implemented in order to achieve a normal distribution of the coefficients. The variable that is influenced by the independent variable is commonly represented as Y, while the variable that is being manipulated or controlled is typically denoted as X = (1, x1, x2, x3,…..xk) [71]. The Box-Cox technique presents a model that facilitates the transformation of a non-normal distribution of data for variables A, B, and C, without reliance on the original scale. This method aims to achieve a normal distribution of the data [72].

$${{\varvec{Y}}}_{{\varvec{B}}{\varvec{C}}}\boldsymbol{ }\left({\varvec{Y}},\boldsymbol{ }{\varvec{\lambda}}\right)\boldsymbol{ }={\varvec{X}}{\varvec{\beta}}+\boldsymbol{ }{\varvec{\sigma}}{\varvec{e}}$$
(2)

where,

If \({\varvec{\lambda}}\boldsymbol{ }\ne 0\)

$${{\varvec{Y}}}_{{\varvec{B}}{\varvec{C}}}\boldsymbol{ }\left({\varvec{Y}},\boldsymbol{ }{\varvec{\lambda}}\right)=\boldsymbol{ }{{\varvec{Y}}}^{{\varvec{\lambda}}}-1$$
(3)

If \({\varvec{\lambda}}\boldsymbol{ }=0\)

$${{\varvec{Y}}}_{{\varvec{B}}{\varvec{C}}}\boldsymbol{ }\left({\varvec{Y}},\boldsymbol{ }{\varvec{\lambda}}\right)={\varvec{L}}{\varvec{o}}{\varvec{g}}\boldsymbol{ }\left({\varvec{Y}}\right)$$
(4)

X is the covariate matrix including intercept.

β = (β0, β1, β3, ……, βk) = vector of regression coefficients.

The random error's variance is denoted by the symbol σ. The stochastic error term "e" follows the norm of the standard distribution. The Box-Cox transformation is used in Eq. 3, where the logarithm of Y reflects a particular case in which lambda (λ) equals zero.

For estimating the parameter lambda (λ), the method of maximum likelihood is frequently used [73]. Multiple lambda values are used to apply the model to the altered data; the best lambda is then chosen since it produces the highest likelihood value. The fact that changing the covariates alters the likelihood value is one problem with this approach. The lambda argument will change as a result. Thus, the use of Minitab software to optimize the value in order to handle this problem was considered since it effectively uses the fit statistic approach. Equation (2) and Eq. (3) only apply to positive numbers, namely when Y is greater than 0. The Yeo-Johnson distribution family should be used when the dataset contains negative values since it can be used without placing any restrictions on the response variable (including negative replies). The ideal lambda value can be easily calculated in Minitab software to improve the correlation between the variables. This improves the equation's accuracy as well [74].

The coefficient values for the intercept (constant), DoC, SS, and FR are displayed in Table 7. On either side of the scale, the values are evenly dispersed [62]. A negative number denotes the coefficients' direction.

Table 7 Coefficient values for different λ (hyper-parameter) value

Figure 6(a) and (b) depict the pattern of the coefficient values. The values are much closer to the mean, which reflects the tuning of the model is proceeding in the right direction [61]. Both positive and negative correlations were noticed in the coefficient plot. Figure 6(a) represents the lambda value of 3. Figure 6(b) depicts the p-value trend for different values of lambda.

Fig. 6
figure 6

Coefficient values and their P-value trend. a Coefficient values of the variables. b P-value trend for different values of lambda

Figure 6(a) and (b) shows the coefficient plot for the variables as shown in Table 8. It was noticed that the coefficients were decreasing with the λ value reaching the optimum value. This signifies that the model is highly accurate and more significant [63]. The residual plot for all the lambda values is shown in Fig. 7(a), (b), (c), and (d).

Table 8 P-values for different λ (hyper-parameter) value
Fig. 7
figure 7figure 7

Residual plots. a Residual plot of the variables λ = 0.0. b Residual plot of the variables λ = 0.5. c Residual plot of the variables λ = 0.75. d Residual plot of the variables λ = 2.0

Histogram in the residual plot shows the frequency distribution. The histogram [Fig. 7 (d)] for the λ value of 2.0, is distributed uniformly across both sides partially. Although the histogram is not perfect it is still within the acceptable range. Heteroscedasticity data was noticed in the verses fit, which is a clear sign of being unbiased. The linearity in the normal probability plot passes over all the residual points with minimum error, which is also a good fit. A two-way residual distribution was observed in the versus order.

3.1.2 Variance inflation factor

Variance Inflation Factor (VIF) determines the strength of the correlation between the independent variables. It is predicted by taking a variable and regressing it against every other variable. The R2 value is determined to find out how well an independent variable is described by the other independent variables. A high value of R2 means that the variable is highly correlated with the other variables [68, 69, 75]. This is captured by the VIF, denoted by the Equation,

The Variance Inflation Factor (VIF) is a statistical measure used to assess the degree of correlation among independent variables. Performing a regression analysis of a variable against all other variables results in the prediction. The R2 value is utilized to assess the degree to which one independent variable is explained by the remaining independent variables. A strong correlation between a variable and its peers is indicated by a high R2 [65, 66, 76]. The aforementioned concept is quantified by the Variance Inflation Factor (VIF), represented by the following equation:

$${\varvec{V}}{\varvec{I}}{\varvec{F}}=\frac{1\boldsymbol{ }}{1-{{\varvec{R}}}^{2}}$$
(5)

If the VIF value is lesser than 5, then the model is said to have low multicollinearity between the independent variables. Similarly, if the VIF value is between 5 to 10, then the model has high collinearity, and the variables can be further tuned to fit the data. However, if the VIF value is above 10, it is removed or deleted from the data set as they are highly multicollinear. As shown in Table 9, the regression model exhibits the VIF value to be lesser than 5, which indicates that the factors are acceptable.

Table 9 Variance Inflation Factor

3.1.3 Linear regression

The regression equation denotes the statistical association between different independent variables and a response variable. Equation 5 represents the general regression equation.

$$Y= {\beta }_{0}+ {\beta }_{1} \left(A\right)+ {\beta }_{2} \left(B\right)+ {\beta }_{3} \left(C\right)$$
(6)

The aforementioned equation pertains to a linear regression model where Y represents the response variable, which in this case is surface roughness. β0 denotes the intercept or constant, while β1, β2, and β3 are the regression coefficients. The independent variables in question are denoted as "DoC," "SS," and "FR," represented by the letters A, B, and C, respectively. The linear regression equations for the variable 'Surface Roughness (SR)' were derived and expressed as Eqs. 6, 7, 8, and 9 for the lambda values of 0, 0.5, 0.75, and 2.0.

$$ \begin{aligned} SR & =0.2066+0.3028 \left(DC\right)+0.000197 \left(SS\right)\\ &\quad +0.000287 \left(FR\right) \end{aligned}$$
(7)
$$ \begin{aligned} {SR}^{0.5} & =0.6010+0.1407 \left(DC\right)+0.000092\left(SS\right)\\ &\quad +0.000440 \left(FR\right) \end{aligned}$$
(8)
$$ \begin{aligned}{SR}^{0.75}&=0.4042+0.2186 \left(DC\right)+0.000143 \left(SS\right)\\ &\quad +0.000442 \left(FR\right) \end{aligned}$$
(9)
$$ \begin{aligned} {SR}^{2} &=-0.634+0.7253 \left(DC\right)+0.000472 \left(SS\right)\\ &\quad - 0.00218 \left(FR\right) \end{aligned}$$
(10)

The model summary of the predicted values is expressed in Table 6. The surface roughness values predicted after substituting in Eq. (7) are very much close to the actual experimental values and also well within the defined upper limit and lower limit. The predicted surface roughness values are fitted by assuming a 95% Confidence Interval (CI) for each variable. Table 10 displays the surface roughness values that were obtained through experimentation and prediction, while also taking into account defined boundaries and a 95% confidence interval. The graphical representation in Fig. 8 depicts the predicted value of surface roughness within the range of the upper and lower limits.

Table 10 Experimental and predicted Surface roughness value with defined boundaries using 95% CI
Fig. 8
figure 8

Predicted value between upper and lower limit

4 Conclusions

The experimental process parameter combination is done by Taguchi design of experiment. L9 Orthogonal array was selected, and the experimental combinations were framed by using Minitab. The predicted surface roughness was very much aligned with experimental results. This high-level accuracy was able to be established only after optimizing the hyperparameter lambda (λ). The following conclusions are drawn:

  • The optimum λ value = 2 was determined using the box-cox transformation machine learning technique. This equation was further used for experimental data validation. Using a Bix-Cox transformation and an optimal lambda value can help improve the performance of statistical models and enhance the validity of their predictions or estimations, particularly when the assumptions of those models are violated due to skewed or non-normal data distributions. By applying the Box-Cox transformation with the optimal λ value of 2, improved performance was achieved in the statistical model and enhanced the validity of its predictions or estimations. This transformation effectively addressed any issues related to skewed or non-normal data distributions, allowing the statistical model to make more accurate predictions and estimations.

  • The coefficients also have equal contributions and were not biased. This proves that the regression equation built by using the optimum lambda value using box-cox transformation techniques has higher accuracy and high-reliability confidence. This approach of using the Box-Cox transformation technique with the optimal lambda value has not only enhanced the accuracy of the regression model but has also boosted its reliability and confidence. This demonstrates the effectiveness of experimental design, data preprocessing, and model-building processes.

  • Obtained regression equation (Eq. 9) had a mean error of 0.03%, which was closer to the measured value. Having a mean error of only 0.03% in the obtained regression equation is a remarkable realization. A low mean error indicates that the predictions made by the regression model are very close to the actual measured values. This level of accuracy is desirable and suggests that your regression model is performing exceptionally well in capturing the relationships between the predictors and the response variable.