1 Introduction

Aerospace industry is looking at developing novel materials by incorporating features like superior strength-to-weight ratio, superior mechanical strength, excellent corrosion resistance, etc. in alloys. Nickel titanium (Nitinol) alloy is one of the upcoming novel materials with properties such as better strength-to-weight ratio, high ductility, more resistance to wear and corrosion, excellent biocompatibility, etc. [1]. The addition of a ternary alloying element like copper leads to increase in transformation temperatures, quick actuating response, fatigue and damping properties [2]. In addition to this, this material also acquires two outstanding distinct properties like shape memory and super elasticity, leading to several applications in automotive and medical industries. Nitinol alloy belongs to difficult-to-cut materials due to low thermal conductivity and modulus of elasticity, high strain hardening and is exceptionally hard to machine using regular machining forms like turning, boring, milling and so on, to avoid extreme tool wear as well as chips adhering to the tool, etc. [3] researchers are looking at non-traditional machining processes such as electro-chemical machining (ECM), electro-discharge machining (EDM), abrasive jet machining (AJM) wire electro-discharge machining (WEDM), laser machining, hybrid machining processes, etc. wire electro-discharge machining is a standout process amongst the most versatile un-traditional machining procedures to machine hard metals and combinations of metals like nickel-based alloys, titanium-based alloys, shape memory alloys, strain hardened materials, etc. In WEDM, metal expulsion is influenced by start disintegration, since the wire is bolstered through work piece [4] where ordinarily used wires are made of copper, brass, aluminium, molybdenum and graphite. The use of copper as wire leads to increase in surface hardness because of the development of titanium carbide. Graphite has high melting point due to which its material removal rate is greater while aluminium gives good surface finish [5, 6]. Worked on WEDM characteristics of input attributes on MRR and SR of Ti50Ni42.4Cu7.6 shape memory alloys. Conjunction of small peak current with little Ton is more advantageous to maximize MRR while minimizing SR high Ton with small current is essential [7] and reports the impact of WEDM process attributes on machined surface of Ti50Ni49Co1 shape memory alloy. Indigent surface quality was noticed at greater Ton while superior surface roughness was obtained at higher Servo Voltage [8]. Suggested a fuzzy adaptive controller and used a fuzzy imperialist competitive algorithm to handle the complexity in resolving typical optimization problems in manufacturing. Artificial neural network along with genetic algorithm was applied for improving surface texturing properties of EN 31 steel during WEDM process [9]. Responses like MRR and surface roughness are predicted and analysed parametrically using ANFIS modelling during turning of 202 stainless steel [10, 11]. It is proposed that comparative analytical models rely on neural network back-propagation along with ANFIS and RSM for predicting MRR and SR all along with helium-assisted electro-discharge machining of die steel. An ANFIS modified system was proposed to optimize the active parameters and grinding effectiveness in bearing manufacturing to achieve better productivity compared with other practices [12, 13]. Determined the impact of WEDM attributes on performance attributes to enhance the efficiency with superior quality of titanium alloy (Ti–6Al–4V) utilizing ANFIS and grey relation analysis (GRA) approach. Artificial intelligence techniques like ANFIS and artificial neural networks are implemented to anticipate tool wear during conventional milling of aluminium alloy composites [14]. ANFIS combined with Gaussian regression function and Taguchi analysis was adopted for online prophecy of SR and grinding wheel wear all through the grinding of Ti–6Al–4V [15]. Neural networks and fuzzy inference system (FIS) optimized models were evolved for determining ultimate tensile strength of friction stir welded aluminium alloy joints [16]. ANN models were developed for predicting WEDM responses of aluminium-based metal matrix composites [17]. Hybrid techniques of artificial neuro-fuzzy and ANN were adopted for the prediction of machining characteristics of Ti–6Al–4V alloy [18, 19] attempted back-propagation neural network model for anticipation of SR during the turning of mild steel work pieces. A multi-stage decision algorithm was designed based on ANN and ANFIS models for detection and diagnosis of bearing faults [20]. Prediction of machining responses like tool wear and SR was done during the turning of steel under minimum quantity of lubrication using artificial neural network [21]. Various neural network models on MRR were compared during electrical discharge machining process and it was concluded that ANFIS with bell-shaped function gives the best predicted model [22]. For obtaining greater productivity and good product quality during industrial automation, tool wear prediction was made through a combination of radial basis functions and neural fuzzy function [23]. Experimental investigations was carried out to examine the effect of input attributes on SR and cutting force components during hard turning of American Iron and Steel Institute (AISI) 420, which also predicted machining responses using RSM and ANN [24]. Laser cutting roughness is predicted using ANFIS intelligent model which combines adaptive learning ability with FIS for improving the quality of laser cutting [25]. A hybrid method of orthogonal-based ANN and multi-genetic algorithm is adopted for optimization during wire electro-discharge machining of Ti-48Al intermetallic alloy [26, 27] compared ANN and ANFIS models for estimating the performance of solar-assisted ground source heat pump system [28] demonstrated the usefulness of adaptive neuro-fuzzy inference system, especially hybrid learning algorithm used for modelling the ground coupled heat pump system more accurately [29] has made a comparative study of ANN and ANFIS models for modelling and predicting a ground coupled heat pump system [30] used ANN with statistical weighted preprocessing technique for predicting and improving the performance of ground source heat pump systems. Application of artificial neural networks was adopted for predicting the horizontal ground coupled heat pump systems [31]. Performance of a ground source heat pump prediction based on ANFIS with a fuzzy weighted preprocessing technique was adopted. Finally, based on the results, the proposed fuzzy weighted preprocessing-based ANFIS gives better accuracy than standard ANFIS results [32, 33]. A modelling study of solar air heater using ANN and wavelet neural network modelling approaches for evaluating the performance of a new solar air heater is reported. Application of ANN model with three different learning algorithms, Levenberg–Marquardt (LM), Scaled Conjugate Gradient (SCG) and Pola–Ribiere Conjugate Gradient (CGP) algorithms and ANFIS model were developed using the same input variables for predicting the performance of vertical ground source heat pump [34, 35] developed ANN model for prediction of surface roughness in WEDM of Inconel 825 aerospace alloy. The performance of solar chimney power plants were identified and modelled by using ANN and ANFIS models [36]. ANN and ANFIS models were compared for predicting surface roughness of assisted electrical discharge machining of D3 steel [37]. Application of artificial neural network computing technique was adopted for predicting the depth and surface roughness in laser milling of poly-methyl-methacrylate [38]. Hence, it is apparent from earlier studies that only limited studies on the prediction of WEDM output attributes of a Nitinol alloy through artificial intelligent techniques have been carried out. The present article is therefore an attempt to predict WEDM response attributes like MRR and SR of Nitinol alloy while comparing two models based on statistical errors and accuracy. In this study, experiments were designed using central composite face-centred design by considering pulse on time (Ton), pulse off time (Toff), peak current (Ipeak) and gap voltage (V) as input attributes, while SR and MRR were output attributes for WEDM of a Nitinol alloy. Two artificial adaptive models-neural networks and adaptive neuro-fuzzy inference system are explored to predict output attributes of a work piece for a variety of input attributes.

2 Materials and methods

Nitinol alloy (nickel titanium alloy) is commercially available and is supplied by Kellogg’s Research Labs, United States of America (USA), and was used as work piece material for machining. It has a melting point of around 1300 ± 50 °C and density of 6.45 gm/cm3.

2.1 Energy-dispersive X-ray fluorescence (EDXRF) of a Nitinol alloy

The chemical constitution of the workpiece is provided by the supplier and confirmed by energy-dispersive X-ray fluorescence (EDXRF) analyser (VEGA 3LMU.TESCAN, Czech Republic) shown in Table 1, as illustrated in Fig. 1. It shows the presence of different chemical elements such as carbon (C), oxygen (O), aluminium (Al), silicon (Si), sulphur (S) and iron (Fe).

Table 1 Chemical constitution of Nitinol alloy
Fig. 1
figure 1

EDXRF analysis for Nitinol alloy

2.2 Microstructure of Nitinol alloy

Figure 2 shows the microstructure of Nitinol alloy. In this technique, the surface of the specimen should be cleaned with polishing techniques to obtain mirror surface. Etchants were finally adopted for Nitinol alloy to obtain different phases of crystal structure. From this analysis, it is observed that the dark phase is identified as nickel titanium (NiTi) and bright phase as Ti2Ni and copper and Fe atoms were also present in the matrix of both phases.

Fig. 2
figure 2

Microstructure of Nitinol alloy before machining

2.3 Differential scanning calorimetry (DSC) of a Nitinol alloy

Differential scanning calorimetry (DSC) analysis is the most commonly adopted thermal analysis technique for measuring transformation temperatures. It measures the amount of energy absorbed or released by a sample when it is heated or cooled. DSC curve was observed for Nitinol alloy and the transformation temperatures were analysed using tangent method as shown in Fig. 3.

Fig. 3
figure 3

DSC curve of a Nitinol alloy

From DSC Curve, we analyse the transformation temperatures using tangent method. Martensite start temperature (Ms) = 42.64 °C; martensite finish temperature (Mf) = 13.97 °C; austenite start temperature (As) = 49.126 °C; austenite finish temperature (Af) = 78.21 °C.

2.4 Artificial neural networks

Artificial neural network is the most extensively adopted model for addressing complex and nonlinear problems. It is the preferred model to explain complex behaviour between input and output in WEDM process. It has been used extensively in numerous fields such as process modelling, robotics, pattern recognition, forecasting, etc. [39, 40]. A basic model of ANN comprises three layers: input, hidden and output. The pairs of input–output patterns are stocked in input and output layers, the hidden layer relates to the distinguished strength of data from the input to output patterns, through so-called weights. The weights are altered in the learning process in which all the layers of input–output are presented in the learning phase frequently. For training multi-layer network, a lot of algorithms are available, the most commonly accepted learning algorithm being back-propagation algorithm. The general architecture of back-propagation neural network constitutes input, hidden and output layers, as shown in Fig. 4. The input layer takes data from exterior sources and advances this data to the network for processing. The hidden layer takes data from the input layer, and does whole data processing, and the output layer takes treated data from the network, and transmit the results out to an exterior receptor. Cluster type of superintend learning has been used in the current case, where, whole input–output pattern sets are furnished to the neural network step by step, and again modified using average gradient data. During training, the evaluated output is correlated with the target output, and the mean square error is determined. If the mean square error is greater than the recommended restricting value of the error, it is back-propagated to change the concatenation weights and the process continues until the recommended error rein is arrived.

Fig. 4
figure 4

Multi-layer neural network architecture for both MRR and SR

Mean square error, E, is calculated by Eq. (1)

$$E = \frac{1}{P}\mathop \sum \limits_{1}^{P} \mathop \sum \limits_{k = 1}^{n} \left( {d_{k}^{p} - c_{k}^{p} } \right)^{2}$$
(1)

where p = number of pattern, n = number of node in output layer, d pk  = desired output of kth node of pth pattern, cp k = calculated output of kth node of pth pattern.

Figure 5 shows the framework of ANN model.

Fig. 5
figure 5

Framework of ANN model

To accomplish ANN model for prediction of MRR and SR, the following points are followed in the sequence below

  1. 1.

    Collect experimental data.

  2. 2.

    Divide the experimental data into training and testing data sets.

  3. 3.

    Create a network by giving training and testing sample.

  4. 4.

    Configure the network by choosing the number of hidden layers and required training, transfer and learning functions.

  5. 5.

    Train the ANN model by giving it the required parameters to achieve MSE goal.

  6. 6.

    If the attempt results in failure, go back to change the hidden layers or weights and generate the network again and repeat the cycle until the required goal is achieved.

2.5 Adaptive neuro-fuzzy inference system

ANFIS is a hybrid conjecturing model which combines neural network and fuzzy logic to develop mapping connection between inputs and outputs [41]. In the hybrid approach, the neural network is trained by data while fuzzy logic is based on linguistic rules called If–then rules. If–then rules are incorporated along with trained data and are called inference of fuzzy inference system. Fuzzy systems and neural networks are amongst the most important soft computing methods. The connection between neural networks and linguistic knowledge-based system is bidirectional and has been discussed extensively [42]. The ANFIS model gives overall behaviour of Input or output obtained by a collection of variables [43]. The goal of ANN and fuzzy systems are to follow the actions of an expert for resolving a complex problem through instruction or learning. If one has data instruction or can learn from simulation or real problem, ANN is highly suitable. If one gains learning from knowledge expressed in the form of linguistic rules, one can build a fuzzy system. The advantages of both fuzzy and ANN systems are combined in a neuro-fuzzy approach called ANFIS [44].

The goal is to establish an adaptive neuro-fuzzy inference model to correlate the inputs to the output attributes perfectly. They are assessed on the basis of testing accomplishment. An ANFIS model is developed for predicting the wire electro-discharge machining responses like material removal rate and SR. In this research, a total of 100 data patterns were used for the prediction of ANFIS model. In general, total data instances are divided into training and testing data sets. The training data set is mainly adopted for building ANFIS model as long as the testing data sets are used for validating the predicted model.

ANFIS structure is categorized into five layers in which each layer is erected by various nodes as in a neural network. (1) Input data fuzzification, (2) fuzzy database structure, (3) structure of fuzzy rule base, (4) preparation for decision and (5) output defuzzification [45]-simplified ANFIS illustration are shown in Fig. 6.

Fig. 6
figure 6

Simplified ANFIS structure [47]

To illustrate this model clearly, the five layers of ANFIS network contain two inputs (x and y) with two fuzzy rules and output attribute f. The inputs of individual layers in the structure are acquired by the nodes from prior layer. ANFIS rule base constitutes If–then rules of the fuzzy Sugeno type. For a first-order Sugeno fuzzy inference system (FIS), two rules may be given as

  • Rule 1 If x is A1 and y is B1, then f is f1 (x, y)

  • Rule 2 If x is A2 and y is B2, then f is f2 (x, y),

x, y are ANFIS inputs, Ai, Bi are the fuzzy sets, f1 (x, y) are the output of the first-order Sugeno FIS. The simplified ANFIS structure is shown in Fig. 6. The attributes sets that are flexible in these nodes are given by adoptive nodes indicated by squares, while firm nodes are indicated by circles [46].

ANFIS model for both MRR and SR comprises five layers of adaptive network which consists of four inputs with eight fuzzy rules with one output Inference system is assembled by five layers, where every layer constitutes various nodes characterized by node function. The current layer inputs are obtained from earlier layers. The ANFIS rule base generally contains If–then rules of Sugeno type [48].

Primary layer is the fuzzy layer, which gives information about adaptive nodes with node functions i. The membership connection between the output and input functions of this layer are given by:

$$Q_{1,i} = \mu_{Ai} \left( x \right)\quad \left( {i = 1, \, 2} \right)$$
(2)
$$Q_{1ij} = \mu_{Bi } \left( y \right)\quad \left( {i = 3,4} \right)$$
(3)

Q1i and Q1ij stand as output functions and µAi and µBj are designates of membership functions. There are various membership functions such as trigonometric, triangular, trapezoidal, bell mf, gauss mf types, which are generally adopted. Amongst all types gauss membership function is preferred to show the linguistic labels because the connection between processing time and make span is nonlinear and this gives a smooth transition.

It may be written as.

Initial parameter membership function

$$\mu_{Ai } \left( x \right) = \exp \left[ { - \,0.5\left( {\frac{{\left( {x - a_{i1} } \right)}}{{b_{i1} }}} \right)2} \right]$$
(4)
$$\mu_{Bi} \left( y \right) = \exp \left[ { - \,0.5 \left( {\frac{{x - a_{i2} }}{{b_{i2} }}} \right)2} \right]$$
(5)

Layer 2 consists of fixed nodes indicated by a circle. A node function multiplied by input gives output. For each node, this type of layer is called rule node. It evaluates the firing strength of the correlated rule.

$$Q_{2,i} = w_{i} = \mu_{Ai} \left( x \right) \mu_{Bi} \left( y \right)$$
(6)

here Q2,i denotes the output of the second layer.

In the third layer, every node i is considered firm node and is indicated by circles and labelled N which gives the normalization of the firing levels. Its purpose is to normalize the weight function in the following process:

$$Q_{3,i} = \bar{w}\frac{{w_{i} }}{{\sum w_{i} }}\quad \left( {i = 1, \, 2} \right)$$
(7)

where Q3,i signifies the output of the third layer

Layer 4 is called de-fuzzy layer, whose node i is adaptive and indicated by a square. If the output equation is \(\bar{w}\), the de-fuzzy connection between the input and output of this layer may be defined as:

$$Q_{4} , \, i \, = \bar{w}_{i} \times f_{i} \quad \left( {i = 1, \, 2} \right)$$
(8)

where Q4, i signifies the output of the fourth layer, where f1 and f2 are the fuzzy If–then rules are as follows

  • Rule 1 If x is A1 and y is B1, then f1 = p1x+q1y+r1

  • Rule 2 If x is A2 and y is B2, then f2=p2x+q2y+r2

where \(\overline{{w_{i} }}\) is firing strength of the earlier normalized layer and pi qi ri are the resultant attribute sets.

Layer 5 is gross output layer, whose node is marked as Σ. In this layer we have only one node called fixed node, indicated by a circle, which gives inclusive output by adding up all approaching signals

$$Q_{5} ,i = {\text{fout}} = \varSigma \bar{w}_{i} \times f_{i} .$$
(9)

In ANFIS simplified diagram, the first and fourth layers with two adaptive nodes are very crucial. There are two flexible premise attributes [ai, bi] in the first layer which connect two membership functions. In layer 4, we observe three flexible consequent attributes [pi qi ri] belonging to a single-order polynomial. Figure 7 gives the framework of ANFIS model

Fig. 7
figure 7

Framework of the ANFIS model

To accomplish adaptive neuro-fuzzy inference system model for prediction of MRR and SR, the following points are taken into consideration:

  1. 1.

    Initial training data is defined.

  2. 2.

    Attributes for input and output membership function are set.

  3. 3.

    Membership function structure is defined.

  4. 4.

    ANFIS model is trained to extract data.

  5. 5.

    Testing data is defined.

  6. 6.

    ANFIS model is tested.

3 Experimental details

In this study, all experiments were conducted on a Computer Numerical Control (CNC) WEDM (“Ezeecut plus” manufactured by Ratnapar-khi Electronics) as shown in Fig. 8. A molybdenum wire of 0.18 mm diameter was used as wire electrode material as it is more advantageous in terms of greater melting point and larger tensile strength, which is very helpful during machining of intricate shapes, especially in aerospace and missile components compared with other wire materials. During this process, the diameter of the wire and the pressure of de ionized water as dielectric fluid (2.6 kg/cm2) were kept constant.

Fig. 8
figure 8

Schematic diagram of WEDM process of a Nitinol alloy

Table 2 demonstrates the four processing input attributes used for ANFIS modelling for training and testing.

Table 2 Input attributes used in training and testing

3.1 Measurement of MRR and SR

The performance of WEDM machine depends mainly on MRR and SR [49]

$${\text{MRR}}\,\left( {{\text{mm}}^{3} / {\text{min}}} \right) = \frac{\text{volume of material removed}}{\text{machining time}}$$
(10)

volume of material removed = length of cut (mm) × width of cut (mm) × thickness of the sample, where width of cut (mm) = 2 × clearance + diameter of the wire.

Machining time is obtained by means of stop watch for each trial if time is measured with a stop watch. Surface roughness is measured using surface roughness tester (Taylor Hobson, Sutronic-3+) expressed in micrometres. Surface roughness is measured at three distinct areas of the work surface and the average value of surface roughness is then calculated. Table 3 shows the experimental design for WEDM of Nitinol alloy developed by response surface methodology (RSM) face-centred central composite design (CCD).

Table 3 Experimental matrix design and their results

4 Data analysis: prediction-modelling for material removal rate and SR using artificial neural networks and adaptive neuro-fuzzy inference system

4.1 Artificial neural network modelling

Neural network modelling is commonly adopted soft computing technique to solve intricate nonlinear problems. The neural network constitutes extensive interrelated neural computing elements. In this research article, MATLAB software was employed for designing ANN architecture. The input layer related to Ton, Toff, Ipeak and Gap voltage whereas the output layer related to material removal rate or surface roughness. In this model, the input layer is interrelated with a hidden layer of neuron and the hidden layer is related to output layers. After thorough trials and from operating the network, ANN models for material removal rate and the surface roughness were developed.

ANN model, back-propagation with three different learning algorithms Levenberg–Marquardt (LM), Elman regression neural network and generalized regression neural network (GRNN) were used for predicting WEDM attributes. The most suitable algorithm and neuron number in the hidden layer was found to be Levenberg–Marquardt (LM) with 10 neurons for ANN models. The attributes chosen for predicting ANN model for MRR and SR are shown in Table 4. For swift and superintended learning, Levenberg–Marquardt back-propagation neural network algorithm was used throughout the time of training the network [50,51,52]. The accomplishment of the network is expressed in terms of mean square error (MSE) and average percentage error (APE). MSE can be calculated as:

$${\text{MSE}} = \frac{1}{x \times y}\mathop \sum \limits_{i = 1}^{X} \mathop \sum \limits_{j = 1}^{Y} \left( {p_{j} - q_{j} } \right)$$
(11)

where X is the number of output nodes, Y is the total number of training data, pj is output of jth neuron and qj is the predicted value of jth neuron [53]. In the current neural network model for MRR and surface roughness throughout simulation, the correlation coefficient (R) values obtained for MRR and surface roughness are 0.9815 and 0.9878 as represented in Figs. 9 and 10. From statistical perspective, a network can accurately compare the input attributes with output attributes. If the correlation of coefficients is close to 1, which exhibits the collation of experimental and anticipated values of MRR and surface roughness by feed forward back-propagation neural network (FFBP).

Table 4 Selected ANN attributes for MRR and SR
Fig. 9
figure 9

Regression analysis between the actual and predicted values by FFBP-neural network of a Nitinol alloy for training, testing, validation, and overall for MRR

Fig. 10
figure 10

Regression analysis between the actual and predicted values by FFBP-neural network of a Nitinol alloy for training, testing, validation, and overall for SR

4.2 ANFIS modelling

In this section, an ANFIS model with three different membership functions (gauss mf, gauss2mf, g bell mf) were employed for anticipating both MRR and SR during WEDM of Nitinol alloy using MATLAB. The entire experimental data set was split into training and testing datasets. Graphical user interface (GUI) was generally adopted for training and testing ANFIS model. The training data sets include 70 observations whereas testing includes 30 observations. Testing 30 data sets was considered for predicting WEDM responses.

The parameters chosen for predicting and developing ANFIS model are illustrated in Table 5. Two Gaussian-type membership functions (gauss mf) have been preferred for input variables while constant-type membership function (MF) has been used for MRR and SR as shown in Fig. 11. Grid partition method is generally applied for generating an fuzzy inference system (FIS) when we test for fewer data sets. During training of ANFIS, least squares combined with back-propagation were applied for better trained optimized FIS. It includes Ton, Toff, Ipeak, V and MRR, SR. The deviation of actual and predicted data sets is illustrated in Fig. 12. The blue dots represent experimental output data and pink dots represent ANFIS anticipated data. The conspiracy shows distribution of data is unity in nature. Figure 13 gives the gauss initial membership functions of one of the input attribute (i.e. 2 2 2 1) with respect to SR, and Fig. 14 gives the FIS rule Sugeno-type architecture when Gaussian membership is mixed with constant function. Grid partition method is generally adopted in ANFIS model structure for both MRR and SR as illustrated in Fig. 15. The effect of each individual input attributes on WEDM responses is indicated by 3D contour plots as represented in Figs. 16 and 17. ANFIS architecture with four input attributes and one output with 2 2 2 1 gauss membership function, we get 8 fuzzy rules as illustrated in Fig. 18. The variation of ANFIS-predicted WEDM response attributes with actual experimental attributes in the form of error and accuracy percentages is given in Table 6. Correlation coefficient is a statistical process for determining the relation between the testing input data sets and the predicted WEDM output attributes.

Table 5 Selected ANFIS attributes for training of ANFIS model for MRR and SR
Fig. 11
figure 11

The training of data for both MRR and SR

Fig. 12
figure 12

Distribution of experimental and predicted data of MRR and SR (testing)

Fig. 13
figure 13

Membership function for pulse on time (Ton) on SR

Fig. 14
figure 14

Sugeno-type structure (FIS) for MRR and SR

Fig. 15
figure 15

ANFIS model structure for MRR and SR

Fig. 16
figure 16

Exhibits surface contour plot of predicting MRR

Fig. 17
figure 17

Exhibits surface contour plot of predicting SR

Fig. 18
figure 18

A sample set of FIS rules for prediction of MRR and surface roughness

Table 6 Experimental and predicted values of material removal rate and SR using ANN and ANFIS test data sets

5 Results and discussion

ANN models, back-propagation with three different algorithms Levenberg–Marquardt (LM), Elman regression neural network and generalized regression neural network (GRNN) were used for anticipating the WEDM responses. The statistical measures like MSE, RMSE, MAE and standard deviation of all three neural networks are illustrated in Table 7. LM back-propagation algorithm with 10 neurons gives fewer statistical errors and is more accurate than Elman RNN and GRNN. ANFIS models with three different membership functions (gauss MF, gauss2MF, gbell MF) were trained and tested for predicting WEDM responses. Statistical errors were fewer compared to the remaining two MF for the same input variables as shown in Table 8. As per the statistical perspective ANFIS model will give more accuracy than ANN models. This exhibits variation of experimental responses with the predicted responses being a validation of both ANN and adaptive neuro-FIS model. Both may then be utilized to predict the MRR and surface roughness.

Table 7 Statistical values for WEDM responses of a Nitinol alloy, ANN models
Table 8 Statistical values for WEDM responses of a Nitinol alloy, ANFIS models

The absolute percentage error (Ei), mean absolute percentage error (Eav), root mean square error (RMSE) of both MRR and SR are calculated by Eqs. (12)–(14)

$$E_{i} = \frac{{\left| {{\text{actual output value}} - {\text{predicted ANFIS output}}\,{\text{value}} } \right|}}{{{\text{actual output}}\,{\text{value}}}} \times 100$$
(12)
$$E_{\text{avg}} = \frac{1}{M }\mathop \sum \limits_{i = 1}^{M} E_{i}$$
(13)

where Ei is the % error of the data set i; Eavg is the average percentage error; M is the number of datasets

$${\text{RMSE}} = \sqrt {\frac{1}{M }} \mathop \sum \limits_{I = 1}^{M} \left( {x_{i} - y_{i} } \right)^{2}$$
(14)

where M is the number of testing datasets, xi is the experimental value and yi is the value, anticipated by artificial neural networks and adaptive neuro-FIS model.

Figures 19 and 20 show the coefficient of determination of ANN model (R2) for MRR, and surface roughness was 0.9491, i.e. 94.91 and 0.9758, i.e. 97.58, respectively, whereas for ANFIS-predicted model (R2) for MRR and SR it was 0.9667, i.e. 96.67 and 0.9891, i.e. 98.91 percentage of the variance. The adjusted R-square value for MRR and SR of ANN model is 0.9473 and 0.9749 which is very close to R-square which makes our neural networks and adaptive neuro-FIS model more accurate. The root mean square error (RMSE) of MRR and SR of ANN model is 0.088286 and 0.04369 whereas for ANFIS model it is 0.070627 and 0.038525; this is used to anticipate the variance between the experimental and predicted value of the model. From Figs. 21 and 22, we observe that output of ANFIS-predicted model for MRR and SR is very close to experimental values compared to ANN anticipated models. The mean absolute error (MAE) obtained from ANFIS-predicted model for MRR and SR is 0.0611 and 0.0285 which is very low compared to ANN MAE, i.e. 0.0652 and 0.04086.

Fig. 19
figure 19

Coefficient of determination (R2) of a Nitinol alloy for ANN model. a MRR, b SR

Fig. 20
figure 20

Coefficient of determination (R2) of a Nitinol alloy for ANFIS model, a MRR, b SR

Fig. 21
figure 21

Collation of experimental and predicted values by different FFBP-ANN models of a Nitinol alloy. a Material removal rate, b SR

Fig. 22
figure 22

Collation of experimental and predicted values by different ANFIS membership functions of a Nitinol alloy. a Material removal rate, b SR

5.1 Comparison of the artificial neural networks and ANFIS models

A collation of anticipated values of WEDM output responses of a Nitinol alloy by ANN and ANFIS models is achieved. ANN model, back-propagation with three different algorithms Levenberg–Marquardt (LM), Elman regression neural network (RNN) and generalized regression neural network (GRNN) and ANFIS model were developed using the same input variables. The most suitable algorithm and neuron number in the hidden layer were found as Levenberg–Marquardt (LM) with 10 neurons for ANN models whereas the most suitable membership functions and number of membership functions are found to be gauss and two, respectively. The precision of anticipated models for both artificial neural network and ANFIS was determined by using mean square error (MSE), root mean square error (RMSE), mean absolute percentage error (MAPE), as given in Tables 6 and 7. From these values it may be concluded that the adaptive neuro-FIS model offers more reliable and accurate prediction in combination with artificial neural network models. A comparison of experimental and predicted values of MRR and surface roughness of Nitinol alloy by ANN and ANFIS models is represented in Figs. 23 and 24.

Fig. 23
figure 23

Experimental and predicted values collation of MRR for ANN and adaptive neuro-FIS models

Fig. 24
figure 24

Experimental and predicted values collation of SR for ANN and adaptive neuro-FIS models

6 Conclusions

This article provides insights into the features of both ANN and adaptive neuro-fuzzy inference models for better prediction of output attributes as well as a comparison of both models during WEDM machining of Nitinol alloy. The study is carried initially with the ANN model with three different algorithms and the best algorithm has chosen according to the statistical validation measures. Similarly the ANFIS with three different membership functions and the best has chosen with minimum statistical validation measures. Then, the ANN model will be compared with the ANFIS model and the results obtained will be resumed as follows:

  1. 1.

    Generalized regression neural network is the slowest algorithm in the back-propagation compared with remaining two neural networks (LM and Elman RNN). Statistical validation measures like MSE, RMSE, MAE SD are more in generalized regression neural network.

  2. 2.

    LM training algorithm is the fastest back-propagation algorithm compared to the remaining two algorithms. LM with 5 neurons appeared to be the best ANN model with low statistical validation errors and good accuracy.

  3. 3.

    Elman regression neural network is another algorithm with better accuracy than generalized regression neural network and low accuracy than LM back-propagation neural network in terms of statistical measures perspective for anticipating WEDM responses.

  4. 4.

    ANFIS model with two Gaussian-type membership functions (gauss mf) has been preferred for input attributes while constant-type membership function (MF) which is used for MRR and SR gives best results compared to gauss2mf and gbell MF.

  5. 5.

    ANFIS model with two Gaussian MF was the best model amongst all neuro-fuzzy inference techniques (LM, Elman RNN, GRNN, gauss2MF, gbell MF) which was used for comparative purpose for predicting WEDM responses of Nitinol alloy.

From this investigation, it was concluded that ANFIS model gives more precise and useful soft computing approach when compared to ANN model for better prediction of WEDM machining responses like MRR and SR of Nitinol alloy.