Introduction

Surface roughness (arithmetic average, R a ) is commonly used as one of the principal methods to assess high-quality products in the manufacturing industries.

The ability of a workpiece to distribute and hold a lubricant as well as to accept a coating, resist fatigue, friction, and wear is related to its surface roughness. Also the interactions between the workpiece material-cutting tool-machining system have an impact on surface roughness, surface texture, and dimensional deviations of the product being machined.

The idea of predicting the surface roughness prior to machining has attracted a lot of attention, being the main goal of a number of research efforts.

An artificial neural network (ANN) is a mathematical model of a physical process which comprises a large number of processing elements organized into layers.

Before a network can perform a useful task, it has to be trained using a set of inputs and known outputs. Once trained, the network is able to give a particular answer for a given set of inputs.

In this article, several ANNs are developed to predict surface roughness on Al 7075-T7351 after face milling process. Five inputs (cutting speed, feed per tooth, axial depth of cut, chip’s width, and chip’s thickness) were considered for the development of the networks (Radial Base, Feed Forward and General Regression). The networks were compared between each other, selecting the one that achieved the best performance when comparing the predicted and measured surface roughness value.

Literature Review

Engineered components must satisfy surface texture requirements and, traditionally, surface roughness (arithmetic average, R a ) has been used as one of the principal methods to assess quality. The surface roughness value is a result of the cutting parameters and tool wear. Moreover, surface finish influences mechanical properties such as fatigue behavior, wear, corrosion, lubrication, and electrical conductivity. Thus, measuring and characterizing surface finish can be considered for predicting machining performance.

Luong and Spedding (Ref 1) applied neural network technology for the prediction of machining performance in metal cutting. Their network was trained using data from a machining data handbook. They concluded that the network was able to determine conditions for a given material and required depth of cut, and to predict the performance of the process in terms of cutting forces, surface finish, and tool life. Also they recognized that there is a lack of guidance on network design.

For the prediction of surface roughness, Benardos and Vosniakos (Ref 2) used a feed forward ANN. The experimental data was obtained after face milling Al alloy normally used in aerospace applications. They concluded that an ANN can be used reliably, successfully, and very accurately for the modeling of surface roughness formation mechanism and the prediction of its value in face milling.

In 2005, Bisht et al. (Ref 3) developed a back propagation neural network for the prediction of flank wear in turning operations. In their case, they included the chip width in addition to the existing inputs (cutting speed, feed rate, depth of cut, and cutting forces). They concluded that the back propagation neural network could be trained for the effective prediction of flank wear during turning operations.

Pal and Chakraborty (Ref 4) predicted the surface roughness in a turning process by using a back propagation neural network. A large number of experiments were performed on mild steel using a high-speed cutting tool. They showed the efficacy of a back propagation neural network for predicting surface roughness in turning.

In 2006, Basak et al. (Ref 5) developed radial basis neural network models when turning AISI D2 cold-worked tool steel with ceramic tool. They identified the best values of cutting parameters for a desired value of surface roughness.

In 2006, Zhong et al. (Ref 6) predicted surface roughness heights R a and R t of turned surfaces using a neural network. The experiments were conducted using aluminum and copper rods of 19 mm diameter. Their study showed the effect of the neural network and hyperbolic tangent and sigmoid activation functions on the accuracy of the network.

The determination of best cutting parameters leading to a minimum surface roughness in end milling mold surfaces used in biomedical applications was done by Oktem et al. (Ref 7). For their research, they coupled a neural network and a genetic algorithm (GA) providing good results to solve the optimization of the problem.

In 2007, Lin et al. (Ref 8) developed a surface prediction model for high-speed machining 304L stainless steel, Al 6061-T6, SKD11, and Ti-4Al-4V. For this purpose, the finite element method and neural network were coupled, and they concluded that surface roughness may quickly be determined from the prediction model developed when the process parameters are set.

In 2007, Jesuthanam et al. (Ref 9) proposed the development of a novel hybrid neural network trained with GA and particle swarm optimization for the prediction of surface roughness. The experiments were carried out for end milling operations, and they found that the proposed hybrid neural network is competent in terms of computational speed and efficiency over the neural network model.

As it is observed once completing the literature review, most of the ANN has been developed for turning process giving less attention to face milling operations. Due to this fact and considering the importance of the face milling process in many industries, for example, the mold and die industry, the radial base (RBNN), feed forward (FFNN), and generalized regression (GRNN) neural networks architecture are developed in this research to help engineers in the manufacturing field to predict the desire surface roughness for a specific environment by selecting the optimum combination of cutting parameters.

Experimental Procedure

Workpiece Characteristics

Aluminum alloy 7075-T7351 square bars, 333 × 76.2 × 31.75 mm3 as suggested by ISO Standard 8688-1 (Ref 10), were used for the experiments. This material was selected for the experiments because it is widely used where low weight is needed such as in plates, sheet, and extrusions of airframes. Also, this Al alloy has some advantages such as good resistance, high strength, heat treatable, and high toughness. Table 1 and 2 show the chemical composition and the mechanical properties of the Al 7075-T7351 alloy, respectively.

Table 1 Chemical composition of Al 7075-T7351
Table 2 Mechanical Properties of Al 7075-T7351

Tool and Tip Characteristics

A standard insert holder of ØTool = 32 mm with two cutting edges (inserts) Z = 2 were used for the experiments. As tool insert, SDHT 120508FR-ALP CWK26 (tool nose radius of 0.8 mm), was used as recommended by the tool suppliers for Al alloy under a wet cutting operation.

The ØTool = 32 mm was selected to machine the whole width of the workpiece in a single pass since the tools diameter is bigger than the workpiece’s width (ØTool = 32 > 31.75 mm).

This condition (ØTool > W) as well as a symmetric position between the tool and the workpiece will achieve a better performance of the tool (longer tool life) as suggested by Diniz and Filho (Ref 11). Figure 1 shows a scheme of the cutting process.

Fig. 1
figure 1

Scheme of the cutting process

Cutting Parameters

Cutting speed, feed per tooth, and the axial depth of cut were the variables chosen for the study because from previous research it was observed that these variables had the most influence on surface roughness and tools life. A low, medium, and high level was selected for each of the cutting parameter to have a wide range of combinations. Also the selected values are recommended from the tool supplier when cutting Al alloy under wet cutting conditions.

Selected cutting parameters are shown in Table 3.

Table 3 Selected cutting parameters

Equipment Characteristics

A Deckel Maho, DMV 50 eVolution, high-speed vertical machine center with a maximum spindle speed of 18,000 rpm was used for the face milling operations.

The tests were conducted under MQL (minimum quantity of lubricant) since tool wear was not considered as a criterion that will affect the result of the cutting process due to the fact that Al 7075-T7351 is a very soft material and the cutting process was conducted under wet cutting conditions.

Design of Experiments: Taguchi Method

An efficient method of experimental planning is design of experiments (DoE), which incorporates the orthogonal arrays (OAs), developed by Taguchi, to successfully design and conduct fractional factorial experiments that can collect all the statistically significant data with the minimum possible number of repetition.

The selection of the appropriate OA is based on the following criteria: the numbers of factors and interactions of interest, the numbers of levels for the factors of interest, and the desired experimental resolution or cost limitation (Ref 12).

Although a three-level factor is considered as a small OA, due to cost and time saving the L9 array was selected, and Table 4 shows the OA for the three cutting parameters selected. The values 1 to 3 indicate the levels of the three cutting parameters as these are defined in Table 3.

Table 4 L9 orthogonal array for the experiments

Roughness Measurements

A Profilometer, ProScan 2000, using a white lamp was used for the measurement of R a values. The sample size used for this case was 4 mm in the X direction and 4 mm in the Y direction, and 1335 steps with a size of 0.003 mm each to cover this 4 × 4 mm2 area was used in the experiments as recommended in the ProScan manufacturer’s guide for face milling operations.

Three measurements were taken in each sample after cutting a length of 333 mm as shown in Fig. 2.

Fig. 2
figure 2

Scheme indicating the areas where surface roughness measurements were taken

Because there was no difference between each of the three surface roughness measurements made in each sample, an average of them was taken and considered as the roughness of the specimen for a specific cutting condition.

Chip Measurement

Due to the importance of the chips geometry on the surface roughness, since it’s related to the feed and axial depth of cut, the chips generated from each trial were collected and measured to include these variables as one of the inputs of the networks.

The chip’s width (C w) was measured with an Olympus SZ61 optical magnifier at ×12, and for the chip’s thickness (C t) measurement, a TESSA micrometer with a range of 0 to 25 mm with a resolution of 0.01 mm was used. Each dimension (width and thickness) was measured three times, reporting only the average values for each of them.

The Artificial Neural Network

An ANN consists of a number of elementary units called neurons. A neuron is a simple processor which takes one or more inputs and produces an output. Each input has an associated weight that determines the intensity of the input. A network can be trained to perform certain tasks where the data is fed into the network through an input layer. This is processed through one or more intermediate hidden layers and finally it is fed out to the network through an output layer as shown in Fig. 3.

Fig. 3
figure 3

Inputs and outputs of the network

The MATLAB 6.5 software, Neural Network Toolbox function, was used to create, train, validate, and predict the different ANN reported in this research.

It must be highlighted that the best network architecture is reached by trial and error after considering different combinations of the number of neurons in the hidden layer, number of hidden layers, spread parameter, and learning rate, depending on the type of neural network being used.

From the nine trials, six of them were picked randomly (1, 3, 5, 6, 7, and 8) and were used for training and validating the network, and the other three trials (2, 4, and 9) were used for predicting.

During the training of the network, the calculated output is compared to the target output, and then the mean square error percentage (%MSE) is calculated by using Eq 1.

$$ {\text{\%}} {\text{MSE}_{X}} = \left| \frac{{{R_{a} - R_{{a}_{X}}}}}{R_{a}} \right| \cdot 100, $$
(1)

where %MSE X is the Mean Square Error in %, R a is the measured surface roughness (μm). X is either trained or predicted depending on the case.

Radial Base Neural Network

This network architecture can be designed in a very short period of time since it consists of three layers: (1) input layer, (2) hidden layer, and (3) output linear layer.

There are only two types of radial base networks, the newrbe (exact design) and the newrb (more efficient design). For this study, the newrb was selected since this architecture interactively creates one neuron at a time. Neurons are added to the network until the sum-squared error falls beneath an error goal or a maximum numbers of neurons have been reached (MATLAB user’s guide, Ref 13).

Feed Forward Neural Network

This network is one of the most popular multi-layer architectures proving to be an excellent universal approximation of nonlinear functions. Its ability to map complex input-to-output relationships with acceptable error best demonstrates its suitability. There are many variations of feed forward networks and in this case, the Lavenberg-Marquardt algorithm was selected. This algorithm is designed to approach second-order training speed without having to compute the Hessian Matrix, being the fastest method for training moderate-sized FFNNs.

Generalized Regression Neural Network

This type of network is often used for function approximation. It has a radial basis layer and a special linear layer. The first layer is just like the newrbe network and it has as many neurons as there are input/target vectors.

Experimental Setup

Figure 4 shows a scheme of the experiment setup used in this research. As it is observed once each trial was concluded, experimental surface roughness was measured and chip was collected for further measurement. The ANNs were developed considering cutting parameters and chips geometry as input variables and the surface roughness as the output variable. The predicted surface roughness obtained from each developed ANN was compared with experimental (measured) values of surface roughness.

Fig. 4
figure 4

Diagram of the experimental setup

Figure 5 shows the flow chart used for the three networks studied in this research to reach the minimum %MSE for trained and predicted values of surface roughness.

Fig. 5
figure 5

Flow chart to reach a minimum %MSE at the same time for trained and predicted values of surface roughness

Pearson Correlation Coefficient

To clarify the matching between surface roughness and the inputs studied (cutting speed, feed per tooth, axial depth of cut, chip’s width, and chip’s thickness), a correlation among these parameters is determined.

A correlation is a statistical technique which can show whether and how strongly pairs of variables are related. The main result of a correlation is called correlation coefficient (or r). It ranges from−1.0 to +1.0. The closer the r is to +1 or −1, the more closely the two variables are related.

If r is close to 0, it means that there is no relationship between the variables. If r is positive, it means that as one variable gets larger, the other gets larger. If r is negative, it means that as one gets larger, the other gets smaller (often called an inverse correlation). There are several different correlation techniques but the most common one is the Pearson correlation and this is the one that has been selected for this study.

Results and Discussions

Once all the experiments were concluded, the following results were obtained and divided in five subsections for better understanding.

Values of Surface Roughness (R a ) Measured with the ProScan 2000 Profilometer

Table 5 shows the surface roughness average values obtained once the face milling process was concluded for specific cutting conditions.

Table 5 Surface roughness and average surface roughness obtained from face milling operations under different cutting conditions

Chip Measurement

Table 6 shows the average value of the chip’s width and thickness once the face milling process was concluded for a specific cutting condition.

Table 6 Chip’s width (C w) and thickness (C t) obtained from face milling operations under different cutting conditions

As reported in section 3.7, chip’s area (width and thickness) depends on the axial depth of cut and the feed per tooth. As it can be observed, chip’s width values obtained and reported in Table 6 are almost the same as the axial depth of cut values selected for the experiments, but there are some differences in the values of the chip’s thickness reported in Table 6 when compared to the feed per tooth values used in this work. This result is probably due to the fact that since new tips are screwed into the tool holder for each trial, generating a discrepancy in the location of the cutting tools teeth and the cutter axis inclination with regards to the direction that must be maintained during the cutting process. This is what Franco et al. (Ref 14) called tool errors or radial and axial runouts.

Artificial Neural Network

Radial Base Neural Network

To start the training of this neural network, five inputs (cutting speed, feed per tooth, axial depth of cut, chip’s width, and chip’s thickness), six neurons in the hidden layer, a goal of 0.01, and a spread parameter of 0.8 varying it every 0.05 were considered.

In Fig. 6, goal’s performance for this radial base network is observed. Also, this figure shows how a goal of 0.01 is reached around 2.6 epochs.

Fig. 6
figure 6

Goal performance for radial base network

Since the goal value of 0.01 was reached so soon (before reaching 3 epochs), then three neurons instead of six were considered in the hidden layer.

Figure 7 shows the architecture of the RBNN selected and used in this study, where number 5 represents the number of inputs, number 3 the number of neurons in the hidden layer, and number 1 the number of outputs, in our case the surface roughness value.

Fig. 7
figure 7

Radial base network architecture used in this study

Table 7 shows the surface roughness (R a T) obtained by training the network for a goal of 0.01 and different spread parameter. Also, measured surface roughness (R a ) is shown.

Table 7 Spread parameter, measured and trained surface roughness (R a , R a T), and %MSE obtained by training the RBNN with a goal of 0.01 and 3 neurons in the hidden layer

Due to the small amount of values for training the network, it is observed that a spread parameter from 0.05 to 0.3 gave smaller %MSE, but as it was consulted in the literature (MATLAB user’s guide), the spread parameter must be large enough that the radial base neurons respond to overlapping regions of the input space, but not so large that all the neurons will respond in essentially the same manner. For this reason, the network was trained again, but in this case, the network was stopped when the calculated surface roughness values obtained when training the network (R a T) and the predicted surface roughness values (R a P) showed at the same time the minimum %MSE when comparing them to the target output which is the measured surface roughness (R a ). In our case, this value was reached when using a spread parameter of 0.6.

Table 8 shows the measured and predicted surface roughness value (R a and R a P) obtained when using three neurons in the hidden layer, a goal of 0.01, and a spread parameter of 0.6.

Table 8 Measured and predicted surface roughness and mean square error when training the RBNN with a goal of 0.01 and a spread of 0.6

As it can be observed in Table 8, a %MSE of 17.32 between the measured (R a ) and the predicted values of surface roughness (R a P) was obtained by using the selected RBNN.

Figure 8 shows the results obtained with the measured and trained surface roughness, and Fig. 9 shows the results of the measured and predicted values of surface roughness.

Fig. 8
figure 8

Measured and trained values of surface roughness for different trials using the selected RBNN architecture

Fig. 9
figure 9

Measured and predicted values of surface roughness for different trials using the selected RBNN architecture

Feed Forward Network

For this neural network, same as the RBNN, five inputs (cutting speed, feed per tooth, depth of cut, chip width, and chip thickness) were considered.

The Lavenberg-Marquardt algorithm was selected and from trial and error, a maximum of two hidden layers and a maximum of nine neurons were used. The activation function for the three layers architecture network (input layer, hidden layer, and output layer) were tansig and purelin, and the activation function for the four layers architecture network (input layer, hidden layer 1, hidden layer 2, and output layer) were tansig, logsig, and purelin. Also, a goal of 0.01, 100 epochs, and a maximum of 400 iterations were used. This number of iterations corresponds to the minimum %MSE obtained by the trained and predicted values of surface roughness at the same time.

Table 9 shows just few numbers of the total iterations, the FFNN architecture, the trained surface roughness (R a T), and the trained and predicted mean square error, %MSET and %MSEP, respectively.

Table 9 FFNN architecture, trained surface roughness (R a T), MSET, and MSEP

As it is observed in Table 9, the FFNN architecture that achieved the minimum %MSE for both trained and predicted surface roughness value is 5-3-7-1. Also it observed an error of 8.5% between the measured roughness (R a ) and the trained surface roughness (R a T), and an error of 5.5.% between the measured roughness (R a ) and the predicted surface roughness (R a P), was obtained.

Figure 10 shows the architecture of the selected feed forward network.

Fig. 10
figure 10

Feed forward architecture network selected for the study

Table 10 shows the measured and predicted surface roughness values (R a and R a P, respectively) obtained when using this 5-3-7-1 FFNN architecture on trials 2, 4, and 9.

Table 10 Measured (R a ) and predicted (R a P) surface roughness and %MSE when using a feed forward architecture network 5-3-7-1

Figure 11 shows the results obtained with the measured and trained surface roughness, and Fig. 12 shows the results of the measured and predicted values of surface roughness for the 5-3-7-1 FFNN architecture.

Fig. 11
figure 11

Measured and trained surface roughness for a 5-3-7-1 feed forward architecture network

Fig. 12
figure 12

Measured and predicted surface roughness for a 5-3-7-1 feed forward architecture network

Generalized Regression Neural Network

To start the training of this GRNN, a spread parameter of 0.8 varying it every 0.05 was selected, and from this result, it was observed that the minimum % MSE was obtained with a spread parameter of 0.3.

Due to this fact, an extra training was done with a spread parameter of 0.3 but varying this last one every 0.005. Also, the algorithm was built to show the minimum %MSE between the measured and the trained and predicted value of surface roughness at the same time.

The selected GRNN architecture showed a minimum %MSE with a spread parameter of 0.295.

Figure 13 shows the architecture of the GRNN, with five inputs, six neurons in the hidden layer, and 1 output.

Fig. 13
figure 13

Generalized regression architecture network selected for the study

Table 11 shows the result of measured, trained, and predicted surface roughness as well as the minimum %MSE obtained with the selected GRNN architecture.

Table 11 Measured (R a ) and predicted (R a P) surface roughness, %MSET, and %MSEP when using a GRNN with a spread of 0.295

Performance Assessment

The performance of each ANN used in this research was measured with the mean square error (%MSE).

Table 12 shows the minimum %MSE obtained for each selected architecture, and Fig. 14 and 15 illustrates this performance for trained and predicted values of surface roughness, respectively.

Table 12 Minimum %MSE predicted obtained for the different selected ANN architecture
Fig. 14
figure 14

Comparison of trained and measured surface roughness obtained for each selected network for different trials

Fig. 15
figure 15

Comparison of predicted surface roughness obtained for each selected network for different trials

When comparing the %MSE of the three selected studied architecture shown in Table 12, it is observed that the FFNN shows the closest and minimum result when training the network and when predicting the values of roughness. In this case, a 5.5% of difference between the predicted and the measured values of surface roughness can be interpreted as a very good approach.

Also, it is observed that a low %MSET = 0.012 is obtained when training the GRNN and this is due to the fact that this type of net uses a maximum number of neurons equal to the maximum numbers of trials used to train the net; with this fact and an adequate spread parameter, the difference between the measured and the trained roughness is null since an exact RBNN (newrb) was used.

Pearson Correlation Coefficient Analysis

Table 13 shows the Pearson correlation coefficient obtained for each parameter.

Table 13 Pearson correlation coefficient among surface roughness and inputs studied

From Table 13 it can be observed that there is a strong correlation between the chip thickness (C t) and the chip width (C w) on the surface roughness since a correlation coefficient value of 0.53 and 0.50 was obtained, respectively. From the cutting parameters, the strongest correlation is between the cutting speed and the surface roughness (−0.49), but in this case, it is a negative correlation, which means that the magnitude of surface roughness is increased as the cutting speed is decreased. This result is in agreement with previous researches (Ref 15, 16).

Conclusions

In this study, ANNs with five inputs have been developed to predict arithmetic surface roughness (R a ) when face milling Al 7075-T7351.

FFNN showed the best results (%MSEP = 5.5) but it took more time training this network compared to the RBNN and the GRNN.

Due to the fact that the outcome of the training greatly depends on the initialization of the weights, which is done randomly by the MATLAB software, each ANN must be trained few times until the minimum %MSE between the trained and predicted surface roughness against the measured surface roughness is achieved.

From Pearson correlation analysis, the cutting speed is the most important cutting parameter affecting the machined surface quality.