1 Introduction

Among various advanced machining processes, laser cutting is one of the most widely used thermal-based processes applied for processing a wide range of materials. Laser cutting technology finds many applications in various manufacturing industries due to its convenience of operation, high precision, small heat-affected zone, minimum deformity [24], low cost, high quality of end product [19], high processing speed, and low waste [25]. Additionally, as a noncontact, it is well suited for advanced engineering materials, such as difficult-to-cut materials, brittle materials, electric and non-electric conductors, and soft and thin materials [6]. Numerous advantages and possibilities of laser cutting technology motivated considerable theoretical and experimental research aimed at better understanding of the laser cutting process. Comprehensive review papers about laser cutting are available [7, 8, 16].

The laser cutting process is characterized by a number of controllable and uncontrollable process parameters and their interactions, which in turn determine the efficiency of the whole process in terms of productivity, quality, and costs. Maximization of productivity and quality along with cost minimization are of particular interest to manufacturers. Each of these goals often requires “optimal” selection of cutting parameter settings. With a limited theoretical and practical background to assist in the systematic selection, these parameters are usually set by previous experience in time consuming trial and error procedure. Consequently, it is of great importance to exactly quantify the relationships between laser cutting parameters and cutting performances through mathematical modeling. Different methodologies were employed for modeling of laser cutting, such as analytical methods, statistical experimental design, and artificial intelligence (AI)-based methods. Few researchers concentrated on modeling and optimization of laser cutting through AI-based techniques, such as artificial neural networks (ANNs) [1, 4, 19, 26] and fuzzy logic [22]. Chen et al. [6] presented an approach for optimization of laser cutting process with multiple performance characteristics based on Taguchi method and grey relational analysis.

ANNs as one of the most used AI methods have proved to be one of the most powerful modeling tools based on the statistical approach. The ability of ANN to capture any complex input–output relationships from limited data is very valuable in manufacturing processes where huge experimental data for the process modeling are difficult and expensive to obtain [13]. In addition, they are more efficient modeling tool when compared with other classical prediction methods [2]. They are especially suitable in situations where mathematical formulae and prior knowledge on the functional relationship between process parameters are unknown. Among different ANN types, single hidden layer back propagation (BP) ANN is considered as most popular variant of multilayer feed forward ANN for modeling of complex nonlinear relationships between multiple input and the output variables [4]. The most common training algorithm for ANNs is the BP algorithm and its variants because it is stable and easy to implement. Conventional BP and faster algorithms like Levenberg–Marquardt have been already used for input/output modeling in the field of laser cutting [19, 26]. However, because ANNs generate complex error surfaces with multiple local optima, even for simple functions being estimated, BP tends to become trapped in local solutions that are not global [9]. In addition, the basic BP algorithm is often very slow to converge in real practice [10]. Considering the disadvantages of the BP, researchers observed that genetic algorithms (GAs) might prevent ANN from falling into a local optimum during training [9, 14]. Gupta and Sexton [9] showed that the GA is superior to the BP in effectiveness, ease of use, and efficiency for training. Furthermore, as noted by [3], real coding is the most suitable coding for continuous domains, and it is beneficial to use real coded genetic algorithm (RCGA) for the ANN training.

As the present study deals with relatively small number data obtained from Taguchi’s experimental design, training of ANNs using the RCGA is thought to be an appropriate algorithm for the purpose of developing accurate and robust ANN models for the prediction of cut quality in CO2 laser cutting.

2 Modeling methodology

Relationship between laser cutting parameters (cutting speed, laser power, and assist gas pressure) and the output parameters (kerf width and surface roughness) was obtained from the ANN models. Two ANN models with the same architecture were designed for each of the output parameters. Experimental results, obtained from Taguchi’s experimental design, were used for developing predictive models. The experimental data were randomly divided into a data subset for the ANN training (N tr = 20), and data subset for testing the prediction accuracy of the ANN models (N ts = 5). Among the various kinds of ANN models, feed-forward multilayer perceptron (MLP) was selected due to its robustness and ability to approximate any nonlinear relationship. To overcome the local convergence problem of the BP, RCGA was chosen as an appropriate method to be used for the ANN training. The aim of RCGA is to simultaneously search for the near optimal set of weights and biases of the ANNs so that ANNs are able to accurately model the CO2 laser cutting process.

3 Experimental setup

Experiment trials were conducted using a 2.2 kW CO2 ByVention 3015 laser cutting machine provided by Bystronic Inc. The cuts were performed with a Gaussian distribution beam mode (TEM00) on 2-mm-thick commercial mild steel sheet. Oxygen gas with a purity of 99.95 % was employed as an assist gas. The laser beam was focused on the surface of the samples using a focusing lens with a focal length of 5 in. (127 mm). The conical shape nozzle (HK10) with 1-mm nozzle diameter was used. The nozzle–work piece stand-off distance was controlled at 0.7 mm. Focusing lens, focus position, nozzle diameter, nozzle stand-off distance, and sheet thickness were kept constant throughout the experimentation.

In the present experimental study, three input parameters, namely: cutting speed (v), laser power (P), and assist gas pressure (p), were considered. The parameter ranges were varied by about 40 % above and below their normal operating level as recommended by the machine manufacturer (Table 1).

Table 1 Laser cutting parameters and their levels

To study the entire experimental region with minimum cost and time of the experiment, Taguchi experimental design was applied. Based on the selected parameters and parameter levels, a design matrix was constructed in accordance with the standard L 25 Taguchi’s orthogonal array (Table 2).

Table 2 Experimental design and results

Two cuts, 60-mm length each, were made in every experimental trial. The laser cut quality was assessed by measuring kerf width and surface roughness. Surface roughness on the cut edge was measured in terms of the average surface roughness (R a) using Surftest SJ-301 (Mitutoyo) profilometer. Cutoff length was 0.8 mm, and evaluation length was 4 mm. Each measurement was taken along the cut at approximately the middle of the thickness, and the measurements were repeated three times to obtain averaged values. The kerf width (K w) of each cut was measured at three equally distanced locations along the length of cut by means of Eschenbach scaled magnifier (10×) with illumination and digital micrometer (Mitutoyo) with accuracy of 0.01 mm.

4 Kerf width and surface roughness ANN models

4.1 ANNs functioning

The MLP represents fully connected feed-forward network architecture composed of many interconnected neurons grouped into input, hidden, and output layer. The functioning of a three-layer feed-forward ANN with n input neurons, m hidden neurons, and one output neuron can be expressed by the following mathematical relation:

$$\hat{y} = g\left( {\sum\limits_{m} {{\bf W} \cdot f} \left( {\sum\limits_{n} {{\bf V} \cdot {\bf X} + {\bf B}} } \right) + b_{\text{ok}} } \right) $$
(1)

where V and W are weight matrices between input and hidden layer and hidden and output layer, respectively; B is the bias matrix of hidden neurons; b ok is the bias of output neuron; X is the vector of input parameters; f and g are transfer functions in hidden and output layer, respectively; and \( \hat{y} \) is the ANN prediction.

Determining weights and biases, in which the “knowledge” of the ANN is kept, is done during training process where examples (input–output pairs) are presented to the ANN. Training is a continuous process, which is repeated until the differences between predicted and the target (experimental) values are below previously defined threshold. The trained ANN should be tested on selected set of input–output data (testing data) in order to assess its ability to predict and make generalization on the basis of the acquired “knowledge”. Finally, the trained and tested ANN can be used for modeling and prediction of outputs when it is presented with new (original) input data.

4.2 ANNs design

The ANN models were aimed to predict the kerf width (K w) and average surface roughness (R a) considering three process parameters, namely: cutting speed (v), laser power (P), and assist gas pressure (p). To this aim, two ANN models were developed:

  • Model 1—which relates v, P, p, and K w , and

  • Model 2—which relates v, P, p, and R a .

In order to increase prediction accuracy, stabilize and enhance ANN training, the input and output data were normalized between −1 and 1 by the following equation:

$$ x_{i}^{n} = 2 \cdot \left( {x_{i} - x_{\min } } \right)/\left( {x_{\max } - x_{\min } } \right) - 1;\,\,i = 1, \ldots ,n $$
(2)

where \( x_{i}^{n} \) is the normalized value for the variable, and x min and x max are the minimum and maximum of each variable x i .

Linear transfer function and hyperbolic tangent sigmoid transfer function were used in the output and hidden layer, respectively. These transfer functions were used, since it was assumed that there exists nonlinear relationship between input and output process parameters.

It has been widely reported that ANN with a single hidden layer are able to approximate any arbitrary function to a given accuracy. Therefore, the selection of ANN architecture can be reduced to finding the “optimal” number of hidden neurons. Too few neurons in hidden layer can lead to under-fitting, whereas too many neurons can contribute to over-fitting [13].

For the single hidden layer ANN architecture with n input neurons, m hidden neurons and k output neurons, the total number of weights and biases can be expressed as

$$ T = m \cdot \left( {n + k + 1} \right) + k $$
(3)

The upper limit of number of hidden neurons can be determined considering that the total number of weights and biases in the ANN does not exceed the number of data for training (N tr). Though the ANN can still be trained, the case is mathematically undetermined [20]. For ANNs with single output neuron (k = 1), the upper limit of number of hidden neurons can be determined by

$$ m_{\text{up}} \le \left( {N_{\text{tr}} - 1} \right)/\left( {n + 2} \right) $$
(4)

In order to take full modeling potential of the ANN, the 3-3-1 ANN architecture was selected for both models.

4.3 ANNs training

The ANN models were trained with the RCGA using the available training data. The RCGA itself is not discussed, and the details are available elsewhere along with numerous examples of applications [3, 5, 15, 18]. A review related to RCGA is also available [11]. Prior to using the RCGA for ANN training, the explicit mathematical model of the 3-3-1 ANN architecture was created using MATLAB package. The aim of the RCGA was to explore the search space to find optimal or near-optimal weights and biases on the ANNs. These include weights between the input layer and the hidden layer, weights between the hidden layer and the output layer, biases of the hidden neurons and bias of the output neuron. Therefore, the optimization problem involved determining of total 16 weight and bias values, that is,

$$ \begin{gathered} {\bf V} = \left[ \begin{gathered} v_{11} \quad v_{12} \quad v_{13} \hfill \\ v_{21} \quad v_{22} \quad v_{23} \hfill \\ v_{31} \quad v_{32} \quad v_{33} \hfill \\ \end{gathered} \right]; \hfill \\ {\bf W} = \left[ {w_{1} \quad w_{2} \quad w_{3} } \right]; \quad {\bf B} = \left[ {b_{1} \quad b_{2} \quad b_{3} } \right]; \quad b_{\text{ok}} ; \hfill \\ \end{gathered} $$
(5)

The objective of the RCGA application was to approximate connection weights and biases given in Eq. (5) such that to minimize the following fitness function:

$$ E = \sum\limits_{i = 1}^{{N_{tr} }} {\left| {d_{i} - \hat{y}_{i} } \right|} $$
(6)

where d i and y i are the i-th target (experimental) and ANN predicted values, respectively.

Basically, obtaining the best optimal results depends on some features related to the RCGA parameters. Although some general guidelines about such selections exist in relevant literature, it was reported that optimal setting is strongly related to the design problem under consideration [1]. After few trials were conducted, the parameters of the RCGA’s for both models were set as shown in Table 3 since with these settings minimum error was observed.

Table 3 RCGA parameter values used for ANN training

Figure 1 shows the best function value in each generation versus iteration number and convergence of the optimization problem.

Fig. 1
figure 1

Plot functions of the best fitness for a Model 1 and b Model 2

After an iterative calculus, the RCGA provided the (near) optimal values for weights (V, W) and biases (B , b ok ) for the ANN models (Table 4).

Table 4 Experimental design and results

5 Results and discussion

5.1 Prediction performance of the ANN models

Regarding the data normalization, transfer functions used in the ANNs and using the weights and biases from Table 4, one can predict the kerf width and surface roughness using Eq. (1). The prediction performance of the ANN models was examined based on the correlation coefficient between the ANN predictions and the experimental values using both training and testing data. Figure 2a shows good agreement of the results for kerf width and the experimental results given in Table 2. Also, Fig. 2b shows good accuracy of calculation for the surface roughness, compared with the experimental values of the average surface roughness.

Fig. 2
figure 2

Comparison of measured and predicted values for a K w and b R a

To get a better picture of the ANN models’ prediction performance, the statistical methods of correlation coefficient (R) and mean absolute percent error (MAPE) were calculated separately for both training and testing data (Table 5).

Table 5 Experimental design and results

The results from Fig. 2 and Table 5 indicate that the developed ANN models can be used for the prediction of the kerf width and surface roughness for arbitrarily chosen values of cutting parameters with good accuracy. Thus, the influence of the cutting parameters on the kerf width and surface roughness can be studied using the developed ANN models. The effects of the cutting parameters were examined at different combinations of input parameters. Kerf width and surface roughness variation are shown in Figs. 3 and 4 with two parameters in interaction, keeping other parameter constant.

Fig. 3
figure 3

Variation of kerf width with a cutting speed and assist gas pressure, b laser power and cutting speed, and c assist gas pressure and laser power

Fig. 4
figure 4

Variation of surface roughness with a cutting speed and assist gas pressure, b laser power and cutting speed, and c assist gas pressure and laser power

5.2 Effects of cutting parameters on the kerf width

As seen from Fig. 3a, the effect of the cutting speed on the kerf width is nonlinear and variable. For assist gas pressure below 5 bar, in the first stage the kerf width increases with the increase of the cutting speed, but after a certain limit, the increase of the cutting speed adversely affects the kerf width, and the kerf width decreases with the increase of the cutting speed. Laser cutting is a complicated process because melting and evaporation of material take place simultaneously under laser beam power and exothermal reaction by the action of oxygen. For a given laser power level, reducing the laser cutting speed increases the duration for which the high temperature oxidation reaction takes place at the workpiece surface. In this case, the influence of the assist gas pressure is obvious, i.e., the combination of high assist gas pressure with low cutting speed enhances the energy coupling at the workpiece surface and increases the kerf width considerably (Fig. 3a). In general, it was observed that a decrease in the kerf width occurs with an increase in the cutting speed. These results are in agreement with those reported in references [17, 23, 25]. This is due to the fact that an increase in cutting speed reduces the rate of energy transfer from the laser source to the workpiece material [23] which results in decrease in side burning [21].

As seen from Fig. 3b, generally, increasing the laser power enhances the kerf width size. Moreover, as the cutting speed increases the kerf width reduces which is particularly evident at higher laser power levels. When comparing the present predictions with the previous findings [23, 25], it can be noted that both results agree. For cutting speed above 5,000 mm/min, the kerf width is decreasing slowly until a certain limit with the increase of the laser power but it begins increasing with the increase of the laser power after about 1,200 W. On the other side, for lower cutting speeds, the kerf width is increasing rapidly. These findings are in agreement with previously reported [25]. At higher laser power levels, the ignition zone is expected to be wider because of the higher heat input [17]. As the interaction effect of the laser power and the cutting speed determines the amount of heat that enters the workpiece during processing, combination of the lowest laser power level and the highest cutting speed level resulted in the lower heat input during machining and consequently, the lowest kerf width was observed.

For cutting speed of 5,000 mm/min, the increase in oxygen gas pressure produces small changes in the kerf width (Fig. 3c). However, the interaction effect of the assist gas pressure and the laser power of 1,500 W results in continual increase of the kerf width. When the high power laser beam is absorbed by the workpiece material, it results in solid-state heating, melting and evaporation of the workpiece material. The melted and the evaporated material together with the oxygen from the impinging gas initiate a high temperature exothermic reaction [25]. This ionizes the evaporation front slightly and generates a partially ionized surface plasma [12] which acts as a heat source, enlarging the size of the melt in the kerf. This effect was particularly pronounced when using high laser power (1,500 W) and high assist gas pressure (7 bar) when wide kerf is produced (Fig. 3c).

5.3 Effects of cutting parameters on the surface roughness

As seen from Fig. 4a, for laser power of 1100 W, an increase in the cutting speed decreases the surface roughness for assist gas pressures ranging from 3 to 7 bar. This is due to the fact that as the cutting speed increases, the interaction time between the laser beam and workpiece material decreases, hence the thermal energy available at the workpiece surface decreases, which results in minimum side burning of the cut edge. On the other hand, increasing the heat input (low cutting speed and high laser power) increases the kerf width and surface roughness.

An increase in laser power improves surface roughness in the 700–1,000 W laser power range (Fig. 4b). This is because laser cutting is less stable at low power levels [17, 21]. Above this range, the effect of the laser power is variable depending on the cutting speed.

At lower cutting speeds, there is sufficient time for heat diffusion and melting wider grooves which cause surface deterioration. Actually, for a given cutting speed and assist gas pressure, there exists an optimum laser power which provides good surface finish.

An increase in the assist gas pressure increases the surface roughness, and this functional dependence follows the same trend apart from the values of the laser power (Fig. 4c). By increasing the oxygen pressure, the exothermic-induced burning of the cut surface is increased and also drag force is enhanced which results in higher surface roughness.

It should be noted that the variations of the kerf width and surface roughness with laser cutting parameters were not consistent probably because of the interaction effects between the laser cutting parameters and/or presence of impurities and inclusions (pockets of phosphorus and sulfur) within mild steel workpiece.

With respect to the various combinations of laser cutting parameters used in the experimental design, the dross formation has been observed. In high-speed CO2 laser cutting of mild steel (experimental trials 21, 22 and 23 in Table 2), the different combinations of the laser power and assist gas pressure levels resulted in dross formation. This was also observed when using low cutting speeds (experimental trials 4 and 5 in Table 2) wherein high laser power and increased oxygen pressure led to excessive exothermal reaction resulting in wide irregular kerf and dross formation.

For most of the cutting conditions performed in the study, the top kerf width was larger than the bottom kerf width indicating the tapered nature of CO2 laser cutting as caused by the loss of laser beam intensity and gas pressure across the thickness of the cut.

6 Conclusions

This paper proposes an approach for the development of predictive models for a CO2 laser cutting using the ANN models trained by the RCGA. Experimental results from the Taguchi’s experimental design, where three cutting parameters (cutting speed, laser power and assist gas pressure) were arranged, were used to develop ANN prediction models for the kerf width and surface roughness. The ANN models’ prediction results were compared with the experimental results and were statistically assessed. Considering the ANN model architectures and available data for training, it can be concluded that the RCGA offers reliable ANN training along with good prediction results of ANNs. In addition, fast and easy implementation in MATLAB software package makes applied approach an alternative that can be effectively used in the prediction of CO2 laser cutting process. It should be noted that the prediction performance may be more enhanced by exploiting the full potential of the RCGA through fine tuning of the RCGA parameters.

It can be concluded that the proposed approach can be efficiently used for the mathematical modeling and analysis of the effects of process parameters on the cut quality characteristics with the ultimate aim of better understanding of the CO2 laser cutting process.

The practical application of the developed models derived from the study can provide a fundamental foundation for the manufacturers using laser cutting technology. Integration of the developed models with an optimization method will lead to improvement of product quality via appropriate selection of the process parameters.