Introduction

Friction stir welding process (FSW) found its implementation in many industrial applications [1, 2]; surely necessity arises in developing different techniques for monitoring the outcome of the process in terms of quality of the welded joints. But only few researchers had attempted monitoring of FSW process [3]. A mathematical model was proposed by  Mehta, et al [4] for real time measurement of traverse force and torque in terms of input current and power from the spindle and feed motors in order to monitor the FSW process. Acoustic emission signals were analyzed to study the effect of pin profiles on FSW process by Subramanian, et al [5] and boiled down to a conclusion that square pin profile adds more towards the strength of the joints.

Modeling of weld quality has become significant nowadays to offer better control over the process. Different approaches had been evolved for predicting the weld quality. Support vector machine (SVM) learning technique for classification as well as prediction in different welding processes has been attempted by researchers. Prediction of residual stresses in gas tungsten arc welding of dissimilar metals was attempted by Na, et al [6] using SVR models. They found that the SVR models are highly accurate in predicting the experimental results. Classification of defective and defect free welds was attempted by Wang, et al [7] from the X-ray image features of line welds. It was presented that SVM accurately classify the defective features and the defect free features. Application of SVM in material design was demonstrated in the work by Lu, et al [8]. Electric resistance of material, thickness control of semiconductor film were attempted to predict using SVM models and concluded that the developed models can be effective in material designing. Apart from the SVM approach, researchers mostly used artificial neural network (ANN) models for prediction of weld qualities in FSW and other processes [9,10,11].

From the aforementioned literature survey, it is realized that prediction of weld quality in FSW process was targeted by few researchers, and most of them considered only process parameters in their prediction mechanisms. For the complex process like FSW, where precise physics based formulation is lacking and the outcome of the process is governed by too many influencing factors, depending solely on selective process parameters for the development of more accurate monitoring system for strength prediction will not suffice. Inclusion of current signal features to the monitoring systems will surely enhance the accuracy. To demonstrate this, in the current research work, prediction of weld quality in terms of UTS values is proposed using main spindle motor current signal features. Time domain statistical features namely, root mean square (RMS), skewness, variance and kurtosis are computed from the signals and presented to SVR model developed for the prediction of UTS of the welds. The prediction performances of the SVR models are compared with the ANN models.

Experimental Procedures

In the present work, two aluminum alloy (AA1100) plates with dimension of 160 × 110 × 6 mm are used as the workpiece material to perform the FSW operation in butt joint configuration. The composition and mechanical properties of base material is shown in Table 1. After the welding, the welded samples are cut into specific dimensions provided in ASTM E8 manual for tensile testing. The tensile test is performed using a universal testing machine. A converted milling machine developed indigenously for friction stir welding process, is used in the research work. The welding setup has a three phase ac induction motor (maximum current rating of 19 A, 440 V supply voltage and 50 Hz) responsible for tool rotational speed. Current signals from main spindle motor are acquired using a Hall Effect current transducer. All the signals are acquired using a high speed data acquisition system at a sampling rate of 10 kHz.

Table 1 Composition and mechanical properties of AA1100

In FSW process, tool rotational speed and welding speed are the two most influencing process parameters responsible for rate of heat generation and mixing of the material [1]. Apart from these two parameters, shoulder diameter also plays an important role in FSW process [4]. Therefore, these three parameters are considered to study the FSW process behaviour. All these process parameters are chosen at four levels and a full factorial design method is used to obtain the design matrix. The levels of each parameter are listed in Table 2. A total of 43, that is, 64 experimental runs are obtained from the full factorial design. The design matrix is shown in Table 3 with responses. Experiments are performed randomly to reduce bias or experimental error. Straight cylindrical tool pin profile is used in all the experiments with fixed pin length of 5.7 mm, pin diameter of 6 mm and plunge depth of 0.06 mm.

Table 2 Levels of process parameters
Table 3 Design matrix with responses

Analysis of Current Signal

In this study, four statistical features namely RMS, skewness, kurtosis and variance of current signal are computed and listed in Table 3 to analyze the signal in time domain. The magnified view of signals acquired during Exp. No. 36 and Exp. No 45 are shown in Fig. 1. It is to note that, these two experiments correspond to minimum and maximum UTS, respectively, and performed against same tool rotational speed. But from the figure, the difference in the current signal magnitude for these two cases can be clearly seen and is supported by the change in different statistical domain features tabulated in Table 3. This entails the fact that the process parameters may go inconsistent in the presence of other unknown influencing factors, the cumulative effects of which will bring some change in the process and thus in the weld quality. Therefore, monitoring based on signal feature will provide a rich way for the evaluation of the process performance.

Fig. 1
figure 1

Magnified view of main spindle motor current signal

The variation of UTS with the statistics domain features are explored in this study to check for existence of correlation between computed features and UTS. From the investigation it is observed that all the statistical features follow an increasing trend with the increase in the UTS. It manifests that there is intelligible correlation between these signal features and the strength of the welded joints. So, these signal features along with the process parameters are used in the input space of the SVR model as well as ANN models for predicting UTS of the joints.

Modelling of UTS using SVM

SVMs are very specific class of algorithm characterized by usage of kernels, absence of local minima, sparseness of the solution and capacity controlled by acting on margin or on support vectors, etc. As the same way with classification approach there is motivation to seek and optimize the generalization bounds for support vector regression (SVR). The detail of the SVR is out of the scope of this article and can be found in relative technical article [12]. SVM generalization performance (estimation accuracy) depends on setting of meta parameters C, ε and kernel parameter (γ). Different algorithms are available for hyperparameter optimization and grid search algorithm is one of those [13, 14]. For SVMs, C and γ are the two hyperparameter that need to be optimized and initialized in pairs. Grid search trains a SVM with each pair (C, γ) in the Cartesian product of these two sets and evaluate their performance on held-out validation set. Finally the grid search outputs the settings that achieved highest score in the validation procedure.

To find the optimum network parameters, the value of C is varied in the range of 1 to 100 in steps of 1. Whereas, γ and ε values are varied in the range of 0.01 to 0.1 in steps of 0.01. From the grid search method, optimum set of C and γ are found to be 90 and 0.06, respectively, when the input space of the model contains signal features along with the process parameters. On the otherhand, optimum set of C and γ values for the model when the input space contains only the process parameters are 60 and 0.07, respectively. Optimum values of ε are found to be 0.03 and 0.06 for the models, respectively. The models are trained using 57 data sets and remaining 7 data sets are used for testing the developed models. Prediction performance of the developed models are shown in Fig. 2. Output predictions of the developed models for the testing cases are listed in Table 4. From the table it is observed that absolute average percentage error of 2.15 and 6.03 are obtained from the models. Moreover, it is demonstrated that inclusion of signal features in the model input space, increases the prediction accuracy by 64% which indicates that signal features in the input space of the model can be quite effective to accurately predict the quality of the weld. The prediction performance of the SVR model is compared with perfomance obtained from ANN models to find the better candidate for accurate prediction of weld quality.

Fig. 2
figure 2

Performance of support vector regression model a with signal features b without signal features

Table 4 Performance of support vector regression models

Modelling of UTS using ANN

In this work, a standard multi-layer feed forward neural network trained using back propagation algorithm; known as back propagation neural network (BPNN) and a radial basis function neural network (RBFNN) are developed to model the UTS of the welded samples. Computer programs for ANN models are developed using the C programming language. As neural networks are prone to overfitting the data, validation datasets are used to monitor the behaviour of the network so that it does not move towards overfitting [15, 16]. In general, it is found that around 15% of datasets are used for validation, 10% are used for testing the developed neural network model [17]. Remaining datasets are used for training the network. Among the full factorial datasets, 47 and 10 patterns are selected randomly for training and validating the models, respectively. Data set used for testing the SVR model is used for testing the developed neural networks.

The prediction performance of BPNN model depends on the architecture of the network, learning rate (η) and momentum coefficient (α). The number of neurons in the hidden layer is varied from 5 to 39 in steps of 1. Learning rate is varied in between 0.1  to 1 in the steps of 0.04 and momentum coefficient is varied in between 0.01  to 1 in the steps of 0.04. Initial weight values are chosen randomly between ±0.9 and the bias values at the input layer is taken as 0 and that for hidden and output layer as 1.0 respectively. All the inputs and output variables are normalized between 0.1 and 0.9 which ensures that the back propagation algorithm does not drive some of the connections weights to infinity and thus slow down the training [18, 19]. The activation functions for both the hidden and output layers neurons are log-sigmoid. The objective of the training is to minimize the mean square error (MSE) by updating the weights through gradient descent method [18].

The best network architecture from the optimization framework is found to be 7-21-1 with η = 0.7 and α = 0.41 with signal features and the process parameters in the input space. The comparison between the actual UTS values and BPNN predicted UTS values are shown in Fig. 3. In the figure it is reflected that out of 64 number of patterns, only six predictions deviate from the ±5% error line and among which, only two are from the testing data set. The absolute average percentage error for the testing cases is 4.89.

Fig. 3
figure 3

Prediction performance of BPNN model

Radial basis function neural networks are different than BPNN in the way that it has some special activation functions in the hidden layers called as the radial basis functions. Detail description of training algorithm is out of the scope of this article; interested readers may refer to the relevant technical article [20]. Number of hidden layer neurons is varied from 5 to 29 in steps of 1 to find out the optimal network structure. Learning rate for updating weights is varied in the interval of 0.1 to 1 and that for center updation and Gaussian functions spread updation were varied in the interval of 0.01 to 1 in steps of 0.04, respectively.

The best RBFNN is found to be 7-7-1 with 0.46, 0.05 and 0.05 as learning rate for weight, center and spread updating, respectively, with signal features along with the process parameters in the input space. In Fig. 4, comparison between the actual UTS and RBFNN predicted UTS is shown. Compared to BPNN prediction, the prediction of RBFNN is inferior and out of 64 datasets, 23 predictions are out of ±5% error line. The absolute average percentage error for the testing cases is 7.33.

Fig. 4
figure 4

Prediction performance of RBFNN model

The prediction performance of SVR models with BPNN and RBFNN models developed in this research work is compared and comparative analysis is presented in Fig. 5. From the figure it is observed that, SVR model performance for prediction of UTS is better than ANN models. An improvement of 55% is experienced in SVR model prediction accuracy when compared with BPNN model predictions. When compared with RBFNN predictions, there is an improvement of 70% in prediction of SVR model. This implies that SVR models, coupled with current signal features can lead to more accurate predictions of UTS of friction stir welded joints and can be implemented in real time in-process monitoring of FSW process.

Fig. 5
figure 5

Comparison of SVR and ANN model performance a with signal features b without signal features

Conclusions

Main spindle motor current signal against each experiment is acquired and time domain features of the signal are computed. SVR and ANN models are developed for the prediction of UTS of the welded joints using the process parameters along with the extracted signal features. Prediction performances of the developed models are compared to find the best candidate for the accurate prediction of UTS of the joints. Following conclusions are drawn from this research work.

  • Current signal of main spindle motor is a potential candidate for real time monitoring of FSW process.

  • The performance of the SVR model is compared with the output from two ANN models. Among the developed models, SVR model yields better prediction accuracy as compared to developed neural network models. An increase of 55% in accuracy is observed for SVR prediction UTS compared to BPNN model prediction UTS for the case when input space of both the models contain the signal features along with the process parameters. Whereas, an increase of 70% in prediction accuracy is observed for SVR model when compared with the performance of RBFNN model.

The presented work can be modified towards the development of online monitoring techniques for FSW process to achieve accurate prediction of the joint strength without destroying the welded components. This will offer precise control over the process to get the desired output in terms of weld quality.