1 Introduction

Machining of micron-size holes in difficult-to-cut materials remains a complex task though the present manufacturing industries are equipped with advanced machining and tooling technologies due to the material's mechanical properties and size of the hole. Inconel 718 (Ni718) superalloy is an important material being used today for manufacturing a wide range of aero-engine, rocket, gas turbine, and submarine components that are functioning between 450 and 700 °C temperatures [1]. Superior thermal and corrosion resistance and high yield and ultimate strength, high hot hardness with Ni718 alloy are attractive properties for a widespread applications in the electronics, chemical, and marine industries [2, 3]. However, the properties such as poor thermal conductivity and work hardening tendency of Ni718 alloy made it classify as the difficult-to-machine material [4] using traditional machining practices. Since, there is no mechanical contact in nontraditional machining approaches between the workpiece and tool, the chip formation in metal removal is independent of the workpiece hardness [5]. Hence these approaches have been proved as efficient for machine materials irrespective of their hardness [6].

Electro-discharge machining (EDM) is a popularly commercialized nontraditional machining process in manufacturing industries for machining precision components that require a high degree of surface quality and dimensional accuracy. Due to high tool wear and chatter, machining of micro-features on difficult-to-cut and heat resistance materials such as Inconel alloys to achieve the required surface quality and drill accuracy through conventional drilling process become very difficult and involved with a huge expenses [7]. Micro-EDM (µEDM) as a version of the EDM process has been identified as an appropriate non-conventional machining process for machining micron-size holes on Ni718 alloy within the limits of the desired tolerance [8]. Also, µEDM allows to generate micro-textures and devices for a wide range of industrial, aeronautical, electronics and the biomedical applications. In the µ-EDM process, due to the generation of high-intensity heat by the rapid recurring electric sparks [9], a small portion of the material in the form of debris is removed from the workpiece by melting and evaporation. The debris formed by melting are flushed from the workpiece electrode interface by the continuous supply of dielectric fluid. Despite of several advantages with the µEDM process, radial overcut, tool wear rate, and material removal rate are pointed out as its limitations. Hence, several research investigations have been reported on the fundamental understanding of various process characteristics for better control of its performance towards the machining efficiency improvement. Type, size and shape of the electrode, electrode feed rate, polarity, current density, gap voltage, pulse-on time, pulse-off time, duty cycle, type of dielectric and its pressure are some of the frequently considered process control variables of µEDM. While the material removal rate (MRR), tool wear rate (TWR), surface roughness, diameter radial deviation, hole taper angle, recast layer, and radial overcut are the µEDM performance indicating characteristics. Higher feed rates increase the MRR as it accelerates the sparking rate at the workpiece-tool gap, but it also increases the formation of the recast layer on the machined surface which reduces the surface quality [10]. MRR is increased by increasing the peak current as it increases the material melting rate by increasing the discharge energy. Increasing the peak current simultaneously increases the TWR [11] which is an adversely effect the machine hole dimensional accuracy. The use of negative polarity increases the density of electrons reaching the workpiece which simultaneously improve the MRR and reduce the TWR compared to positive polarity [12, 13]. De-ionized water gives better MRR and TWR than kerosene as dielectric [14]. The larger electrode diameter increases the TWR due to the continuously changing discharge conditions and higher discharge frequency [15].

The occurrence of secondary discharge at the electrode-drilled home gap due to the movement of debris generate progressively tapered hole [16]. Lim et al. [17] investigated the influence of voltage, gap control algorithm, and capacitance and resistance values in µEDM on TWR, MRR, and the stability of the machining. Ay et al. [18] noted better micro hole quality at low pulse duration and low discharge current. The tool orbital actuation reduces the tool wear and improves the machined surface quality by accelerating the electrode flushing [19]. Natarajan et al. [20] observed better micro hole topographical conditions at lower values of pulse-on time and discharge current. A better surface finish in µEDM can be obtained by sacrificing the MRR while the acceleration of MRR results in higher surface roughness and TWR. Therefore, Muhammad et al. [12] proposed an ideal set of process parameters to deliver better µEDM performance for machining WC–Co. Rao et al. [21] derived an optimal processing conditions for µEDM Ni718 alloy based on the experimental observations pertaining to the MRR, TWR, and recast layer by the conduction of design of experimentation and gray relational analysis integrated Taguchi’s statistical approach.

Likewise, several authors have performed experimental studies to understand the significance of many adjustable process parameters of µEDM on its response characteristics. However, the knowledge of these investigations may not be enough to evaluate the µEDM process performance for real-time machining on the shop floor. In general, the selection of µEDM process parameters is done by using either the data manual provided by the machine tool manufacturer and/or the machine tool operators’ experience. But such criteria neither guarantee the desired drilled hole dimensional characteristics nor the surface quality due to the presence of highly conflicted process responses. Hence, the choice of a suitable set of machining conditions meeting the desired quality characteristics become a complicated task. Such circumstances made selection of optimal µEDM processing conditions quite essential through a reliable approach to enhance the machining performance. But considering the confliction among µEDM process responses for a large number of process control parameters, optimization of any one response alone cannot represent the complete performance of the process in real practice. Dealing with such conflicted responses required a method of simultaneous optimization of multiple responses. Hence, statistical and intelligent modeling and optimization of the process responses gained more attention for simultaneous optimization of multiple conflicted responses to enhance the machining process efficiency [5].

Deepak et al. [10] considered the process parameters: voltage, electrode rotation speed, and feed rate of µEDM to optimize its responses such as MRR, overcut (OC), and taper angle using the genetic algorithm proposed sum of weighted objectives technique for machining Ni718 alloy. Somashekhar et al. [22] shown artificial neural network (ANN) model as effective for MRR prediction in µEDM with the develop a parametric regression model. Anthony et al. [23] employed the ANN model to obtain optimal tool profiles instantaneously and their results showed considerably very little deviation from the experimentally measured results. The prediction accuracies of ANN and adaptive neuro-fuzzy interference system (ANFIS) models for µEDM process performance characteristics were compared by Suganthi et al. [24] and noted superior predictability with the ANFIS model. Recently, by Ashish et al. [25] implemented response surface methodology (RSM) to predict the significance of various EDM process parameters on MRR, SR, and TWR and ANFIS to derive the optimal processing conditions for machining Inconel alloy. The results of their validation experiments are observed with a good agreement with the predicted results. Pathak et al. [26] presented the studies on the circularity and surface roughness against various control parameters of powder-EDM process for machining Inconel 718 alloy and determined their optimal values by implementing fuzzy logic and ANFIS models. They reported better prediction with ANFIS model compared with the fuzzy model. Singh et al. [27] integrated gray relational analysis (GRA) with ANFIS method to derive the optimal EDM conditions for machining WC alloy. In gas-assisted µEDM of D3 die-steel, Nishant et al. [28] employed an integrated neuro‑fuzzy model and GA optimization approach to predict and analyze the responses such as MRR and surface roughness. GA has been used to improve the model architecture of the ANFIS as well as to determine the optimal set of machining conditions. Compared with the results of ANN and ANFIS models, ANFIS integrated GA has shown better performance for the prediction of the machining conditions. To adopt the dynamic realistic non-linear relationships among the response, and to treat all the responses based on their degree of significance among them in statistical analysis, Ishwar et al. [29] used principal component analysis (PCA) to normalize the response data of MRR, TWR, and OC of µEDM process performed on a silver plate and the normalized data was fed as input to ANFIS model for process optimization. Lingxuan et al. [30] integrated multi-objective GA with supporting vector machine (SVM) prediction model for prediction and optimization of TWR and machining time in µEDM process. Assarzadeh et al. [31] employed ANN model for prediction and augmented Lagrange multiplier algorithm for optimization of µEDM conditions.

Owing to the above-conferred investigations on the implementation of various predictive modeling and parametric optimization approaches for improvement of the µEDM process, it has been noted that, a handful number of investigations have been used ANN modeling technique. This is due to the reason that, ANN is a flexible process modeling tool inspired by the biological nervous system [32] having the capability of mapping a nonlinear mathematical relationship between several process input variables and a response [31]. However, the application of the ANN modeling technique is limited by the problem of overfitting the response values because it follows the “black-box models” trained by sampling data of input variables and output response rather than the machining process mechanism. Moreover, a large number of data samples should be provided train the network as a well-trained ANN model. Determination of a reliable network architecture is also a complex task and is depending on experiences. Therefore, Saffaran et al. [33] optimized the network architecture with the help of the PSO (particle swarm optimization) algorithm, and the performance of the determined architecture was verified by SA (simulated annealing) algorithm. Recently, the ANFIS modeling technique has been proved as one of the potential predictive modeling techniques for complex nonlinear machining processes. ANFIS evolved with the integration of two popular soft-computing techniques of ANN and fuzzy logic. ANN in ANFIS has the capability of self-understanding the relation between the input and response data while fuzzy logic holds the capability of dealing with imprecision and uncertainty present with the data using if–then fuzzy rules [34].

Based on the available literature, efficient approaches are vital for exact prediction of the precise machining processes to minimize the model’s uncertainty. The stat-of-affairs result in the hybridization of the suggested predictive models in the literature for the development of efficient predictive models for the µEDM process is motive of this investigation. Nevertheless, being implemented µEDM for manufacturing a wide range of components in many fields, the literature review shows that no research work is published on the comparative analysis of the reliability of ANN, ANFIS, ANFIS-GA, and ANFIS-PSO algorithms for prediction of µEDM responses.

2 Research significance and novelty

Since, micro-meter range material removal is an indispensable requirement in today’s industrial scenario for manufacturing precision components for medical, optical, automobile, and aerospace industries. A high degree of dimensional accuracy and surface quality of the machined micro-holes are the desired functional performance characteristics, and the material removal rate reflects the production rate of the components in µEDM process. However, the dimensional accuracy and material removal rate are highly correlated and important drilled micro-hole characteristics in processing micro-features by µEDM process. While producing quality micro-features by µEDM is involved with the control of a large number of process parameters. Conduction of the trail machining experiments for identification of set optimal machining conditions is relatively time-consuming and costly. Therefore, efficient predictive and optimization approaches are vital for a more precise prediction of the machining conditions to enhance the machining efficiency. Owing to the proposed predictive and optimization approaches in the literature aiming with various machining process, the selection of a reliable approach is still ambiguous as every approach may not efficiently predict every machining process. Recently, the application of machining learning algorithms has been popularly used for prediction of various machining process. However, the prediction efficiency of a machine learning algorithm in turn need an appropriate selection of the algorithm-centric control variable. While still there are no stipulated guidelines or constructive formulations to determine the necessary number of MFs in the literature. Hence this investigation presented the evaluation of a reliable machine learning algorithms and allied intelligent hybrid approaches for µEDM performance prediction and experimentally proved for their shop-floor implementation. Also, the results of this investigation explained and compared the precision of ANFIS-PSO, ANFIS-GA, ANFIS, ANN, and regression models for predicting the µEDM performance characteristics. This kind of investigation has been not reported yet in the literature in application to the µEDM process performance evaluation in terms of the chosen micro-hole characteristics such as recast layer thickness, diametral deviation, and MRR.

3 Model development

3.1 Adaptive network-based fuzzy inference system (ANFIS)

A Neuro-fuzzy system (NFS) was proposed by Jang in 1983 based on the integration of ANN and Fuzzy-rule based interface system [34, 35]. This interface system allows the flexible estimation with the capability of exploring interpretable Takagi–Sugeno type fuzzy If–then rules. ANFIS correlates the output and input data by using an integrated learning approach for obtaining the optimal spreading of different membership functions (MF’s). The rationalization of MF’s in ANFIS is done in two ways such as back and hybrid propagation. ANFIS architecture is designed with 5 layers integration and each layer is defined by a node function as shown in Fig. 1: “input fuzzification as layer 1”, “fuzzy set database creation as layer 2”, “fuzzy rule base structure as layer 3”, “decision-making as layer 4”, and “output defuzzification as Layer 5” [36, 37].

Fig. 1
figure 1

ANFIS architecture based on Sugeno fuzzy model

The five layers of the ANFIS model are described as:

Layer 1 The set of input values is converted into a set of if–then fuzzy rules by the selection of appropriate MFs for each input set. For instance, the general bell-shaped membership function (gbellmf’s) of the adaptive node is defined as:

Membership function of the first parameter:

$$ \mu A_{i} \left( {I_{1} } \right) = \frac{1}{{1 + \left[ {\left( {\frac{{I_{1} - c_{i} }}{{a_{i} }}} \right)^{2} } \right] \times b_{i} }} $$
(1)

Membership function of the second parameter:

$$ \mu B_{i} \left( {I_{2} } \right) = \frac{1}{{1 + \left[ {\left( {\frac{{I_{2} - c_{i} }}{{a_{i} }}} \right)^{2} } \right] \times b_{i} }} $$
(2)

Membership function of the third parameter:

$$ \mu C_{i} \left( {I_{3} } \right) = \frac{1}{{1 + \left[ {\left( {\frac{{I_{3} - c_{i} }}{{a_{i} }}} \right)^{2} } \right] \times b_{i} }} $$
(3)

Membership function of the fourth parameter:

$$ \mu D_{i} \left( {I_{4} } \right) = \frac{1}{{1 + \left[ {\left( {\frac{{I_{4} - c_{i} }}{{a_{i} }}} \right)^{2} } \right] \times b_{i} }} $$
(4)

Here, \(\mu A_{i}\), \(\mu B_{i}\), \(\mu C_{i}\) and \(\mu D_{i}\) are the MFs of the input variables \(I_{1}\), \(I_{2}\), \(I_{3}\) and \(I_{4}\) respectively. The parameter sets \(a_{i}\), \(b_{i}\), and \(c_{i}\), for which the fuzzy membership values vary between 0 and 1. The membership functions change according to the varying values of the parameters and are not constant. The parameters in this layer are called “principle parameters”.

Layer 2 The output signal in this layer is calculated by the multiplication of the input signal with the fuzzy operator as:

$$ w_{i} = \mu A_{i} \left( {I_{1} } \right) \times \mu B_{i} \left( {I_{2} } \right) \times \mu C_{i} \left( {I_{3} } \right) \times \mu D_{i} \left( {I_{4} } \right); i = 1,2,3,4, \ldots $$
(5)

Here, \({w}_{i}\) signifies the “strength” of the fuzzy rule setup. The nodes in this layer are termed “rule nodes” and are fixed.

Layer 3 In this layer, the normalized node strength for every neuron is calculated as:

$$ N_{i} = \frac{{w_{i} }}{{\mathop \sum \nolimits_{i} w_{i} }};{ }i = 1,2,3,4, \ldots . $$
(6)

Layer 4 Here the fuzzy quantity is de-fuzzified. Every node is an adaptive node holding a node function. The combined membership functions and fuzzy rules are represented as:

$$ N_{i} \times F_{i} = N_{i} \cdot(p_{i} \cdot\left( {I_{1} } \right) + q_{i} \cdot\left( {I_{2} } \right) + r_{i} \cdot\left( {I_{3} } \right) + s_{i} \cdot\left( {I_{4} } \right);{ }i = 1,2,3, \ldots $$
(7)

where, \(p_{i}\), \(q_{i}\), \(r_{i}\) and \(s_{i}\) are the parameter set at each node, \(N_{i}\) is the normalized node strength and \(F_{i}\) is the fuzzy rule.

Layer 5 This layer gives the overall output of the system by summation of all incoming signals as:

$$ Overall\;output = { }\mathop \sum \limits_{i} N_{i} \times F_{i} = \frac{{\mathop \sum \nolimits_{i} N_{i} \times F_{i} }}{{\mathop \sum \nolimits_{i} N_{i} }} $$
(8)

3.2 Integrated ANFIS (ANFIS-GA & ANFIS-PSO) model

The ANFIS modelling variables are generally adjusted by steep descend error or least square error methods. However, these two optimization approaches cannot result in the global optimal solution and hence the solution may be trapped in local optima. Also, the algorithmic parameters of ANFIS such as the number of neurons, hidden layers, initial weights, membership functions, etc. are derived by the trial–error approach. To conquer with such limitations, the researchers proposed an integrated approach of ANFIS-GA to enhance the predictability of the ANFIS model to find the global optimal value. In the ANFIS-GA approach, the important parameters such as population size, Maximum number of generations, mutation percentage, crossover percentage, selection, mutation rate, number of fuzzy rules, type and number of membership functions are algorithmic parameter inputs of simulation to estimate the optimal fitness function. While as per the discussions reported in the literature, the most significant ANFIS-PSO algorithmic parameters are the maximum number of iterations, the maximum number of particles, initial inertia weight, inertia weight damping ratio, personal learning coefficient, global learning coefficient, number of fuzzy rules. The main intention of GA and PSO algorithms integration with ANFIS is to adjust such ANN and fuzzy algorithmic parameters to minimize the specified objective function. ANFIS integrated GA approach has been popularly used for solving a variety of engineering problems [38,39,40].

Similarly, the PSO integrated ANFIS model has been also implemented to solve many manufacturing problems in recent investigations [37, 41]. The flowchart of ANFIS-GA integrated algorithm and ANFIS-PSO integrated algorithm are respectively presented in Figs. 2 and 3.

Fig. 2
figure 2

The scheme of ANFIS-GA hybrid algorithm

Fig. 3
figure 3

The scheme of ANFIS-PSO hybrid algorithm

4 Experimental details and measurement of µ-EDM responses

The micro-through holes by µEDM process for the present investigation have been processed on a 3 mm thick aerospace Ni718 superalloy plate by using a 3 axis CNC EDM machine (model: ZNC S430, Acro Machinery & Electric Co., Ltd, Taiwan). The commercially available cylindrical copper electrode of 800 µm diameter was used as the tool electrode. TOTAL EDM3 oil was used as the dielectric fluid and supplied to the machining zone continuously at a flushing pressure of 0.4 kg/cm2 to wash away the formed debris molten at the workpiece-tool gap. The details of the experimental parameters and other machining conditions which are used to perform µEDM holes are given in Table 1. While the chemical composition of the as received workpiece determined by EDS analysis is presented in Fig. 4. The experimental design consists of a total of 36 experimental runs based on the Taguchi’s orthogonal mixed-level design and 3 repetitive and unnecessary tests were not considered for the experimentation [5]. The machine-dependent permissible ranges of µEDM input parameters such as Pulse on time, Current, Voltage and Duty cycle were determined by conducting the pilot experimental runs prior to the conduction of actual machining experiments. Each experiment in the design matrix was conducted for a depth of 3 mm.

Table 1 Details of the experimental parameters
Fig. 4
figure 4

The chemical composition of as-received Ni718 alloy workpiece determined by EDS analysis

Production rate, surface quality, and dimensional accuracy are the most important µEDM process performance evaluation parameters concerned with the quality cut. Therefore, the machining experiments in this study are conducted to measure the material removal rate (MRR), recast layer thickness (RLT), and radial overcut (ROC) as µEDM process responses to evaluate them and predict the process performance. MRR represents the production rate of µEDMed components. The formation of the recast layer on the machined surface alters the metallurgical properties and hardness of surface and subsurface regions which predominantly influence the corrosion and fatigue characteristics of the machined part. Therefore, the RLT represents the quality of µEDMed hole surface integrity. While the ROC represents the dimensional accuracy of the machined hole. MRR (in mg/min) was estimated as the difference in weight of the workpiece before and after making drill per machining time. A high-precision digital weighing machine was used for accurately measuring the weights of the workpiece samples before and after the conduction of machining experiment. The recast layer in the radial direction and machined micro-hole diameter was measured on scanning electron micrographs using an image analyzer software. A total of six readings were taken on each machined hole to estimate recast layer thickness and machined hole diameter individually and their average was considered as the final response values respectively. ROC was estimated as the difference between the initial diameter of the tool electrode and the diameter of the machined hole measured. The readings of RLT and ROC are taken on the top surface of the machined micro-hole. In Fig. 5, the measurement of RLT and ROC on machined micro-hole SEM images is shown. The measured experiment results are reported in Table 2.

Fig. 5
figure 5

Measurement of µEDMed a hole diameter and b recast layer thickness

Table 2 Experimental design and measured response values

5 Results and discussion

Initially, the adequacy of the experimentally measured µEDM process response values was evaluated by ANOVA. Table 3 present the ANOVA of the process responses. The p-value less than 0.05 in Table 3 estimated using ANOVA represent that the postulated models and selected variables are significant to predict the process responses statistically at a confidence level of 95% [42]. Accordingly, based on the results presented in Table 3, all the response models postulated for this investigation are statistically significant. The multiple regression coefficient values (R2 values) in Table 3 for the responses MRR, RLT, and ROC are 0.9898, 0.9992, and 0.9947 respectively. These values indicate that the respective model of MRR, RLT, and ROC can estimate their corresponding measured response with 98.98%, 99.92%, and 99.47% accuracy.

Table 3 Significance of µEDMcontrol variables and fit statistics of response models

5.1 Prediction of µEDM responses by ANN

ANN procedure map the process response between the input process control variables to solve complex problems [43]. The ANN consists of interconnected constituent neural components that are capable learn, discrete, and prepare the data for prediction [44]. The ANN architecture was constructed with input, hidden, and output layers. The neural components in ANN architecture can mimic a human in learning from the provided social event data for prediction. The application of ANN in the recent past investigation has been considered very significant for modeling and prediction of several machining process quality characteristics. In consideration of its computational capabilities, in this investigation, ANN has been used for modeling and prediction of µEDM performance characteristics. The number of neurons in the input layer represents the chosen process control variables and the output layer neurons are the µEDM responses. While the number of hidden layers and the number of neurons in each hidden layer must be derived through trial and error. To construct a perfect ANN architecture for the present problem, MATLAB programming was used. The input layer consists of the chosen four µEDM variables linked to the concealed neuron layer, and the output layers of µEDM responses considered in this work are connected to the shrouded layer. From several trials execution of the program, the determined ANN architecture model for the responses is shown in Fig. 6.

Fig. 6
figure 6

The determined ANN structure 4-19-1 for µEDM performance characteristics modelling and analysis

Each hidden layer in the developed model comprises 19 neurons, four input, and one output neuron. Levenberg–Marquardt feed-forward with back-propagation learning (FFBP) architecture was used for the training of the ANN system for fast and intensive learning [44, 45] and to predict the response values. The estimated value of mean square relative error (MSRE) between the experimental data and the estimated output response values is the performance measure of the constructed FFBP network. MSRE is the average square between the network output values and experimental targeted values. The zero value of MSRE represents the zero error between the targeted values and the predicted values and therefore, a lower value of MSRE is preferred. The MSRE between the experimental and ANN predicted response values corresponding to MRR, RLT, and ROC are 0.0077, 0.0211, and 0.0012 respectively. On the other hand, the regression value (R-value) is another measure of network efficiency. R values estimate the correlation between the targeted (experimentally measured) values and predicted output values by the network. An R-value of 1 represents accurate predictability of the response values by developed network, whereas, the random relationship between the predictive model and targeted values for R value is zero. The simulated correlation coefficients (R values) for this investigation corresponding to the chosen µEDM responses: MRR, RLT, and ROC are 0.9914, 0.9105 and 0.9481 respectively as shown in Fig. 7.

Fig. 7
figure 7

Linear regression analysis between the experimental values and predicted values by FFBP-ANN for training, validation, testing and overall a of MRR, b of RLT and (b) of ROC

The regression coefficient (R2) values estimated for the chosen µEDM responses: MRR, RLT, and ROC are closer to 1 is the indication for that the constructed network mapped the chosen µEDM inputs to its outputs closely and can be used for prediction precisely. Accordingly, the prediction capability of the constructed FFBP network can be evaluated and determined its scope applicability for a machining process prediction [46]. In ANN predictability for the experimental data collected in this investigation is presented in Fig. 8a–c by correlating the experimentally measured data with the ANN predicted data. The figure shows that the predicted response values are in close correlation with their measured values.

Fig. 8
figure 8

Comparison of experimental and ANN-predicted data for a) MRR, b) RLT and c) ROC

5.2 Prediction of µEDM responses by ANFIS

The ANFIS model is integration of ANN learning and fuzzy if–then rule-based decision-making rationale. It develops the mapping relationships among the process response and input data to generate the finest distribution for membership functions. In this section, prognostication of MRR, RLT, and ROC as µEDM output responses by ANFIS is aimed under its operating conditions such as pulse-on time, current, gap voltage, and duty cycle. There are two significant stages in the implementation of the ANFIS model, the creation of pattern vectors is the initial one, while the generation of mapping relationship between the target vectors and their corresponding input condition vectors is the other one. According to the measured experimental data listed in Table 2 for this investigation, out of 33 experimental runs, 24 were randomly selected for training the network, and the rest 9 were selected for testing. In developing the ANFIS model, the input variables of the model such as type and number of MFs and the number of network iterations are essential parameters. However, for a nonlinear process modeling, the appropriate section of these essential parameters is considerably a challenging characteristic. There are no stipulated guidelines or constructive formulations to determine the necessary number of MFs in the literature [47]. Therefore, the number of MFs and their type are determined through the trial-and-error method. Figure 9 shows the derived MFs of the input variables based on trail-error method. The training parameters used for the formulation of the ANFIS predictive model are given in Table 4. Generally, the predictive accuracy of the ANFIS model is evaluated using the root-mean-square error (RMSE). Figure 10a–c shows a good correlation between the developed ANFIS model predicted values based on the selected model training parameters for its formulation and the experimentally measured values for MRR, RLT, and ROC respectively. The results reveal that a good predictive model is constructed based on the employment of appropriate learning and decision-making rationale with the minimum test error to predict the µEDM parameters.

Fig. 9
figure 9

The MFs for a pulse-on time, b current, c gap voltage, and d duty cycle for ANFIS predictive model

Table 4 Training parameters ANFIS predictive models
Fig. 10
figure 10

Comparison of experimental and ANFIS-predicted data for a MRR, b RLT and c ROC

5.3 Prediction of µEDM responses by ANFIS trained by GA and ANFIS trained by PSO

Rather than selecting the ANFIS model input parameters randomly, it is proposed their optimal conditions derivation by the integration of evolutionary based optimization algorithms such as GA and PSO towards the prediction accuracy improvement of the ANFIS model. As evolutionary optimization approaches with their random search, GA and PSO are integrated with ANFIS as the most viable approaches [47]. However, here the appropriate selection of GA and PSO algorithm-centric parameters intern is another significant task an efficient hybrid ANFIS model development.

The scheme of the ANFIS-GA hybrid algorithm is shown in Fig. 2. Table 5 lists the GA algorithm control parameters used for the simulation of the ANFIS-GA algorithm. The values of GA parameters for this investigation are derived based on the several pilot simulations trials and the identified values are the most appropriate for the present problem. According to the available experimentally measured data listed in Table 2 for this investigation, out of 33 experimental data, 24 were randomly selected for the training of the ANFIS-GA algorithm, and the rest 9 was selected for testing. To establish the ANFIS-GA predictive model, code was generated in MATLAB software. The Gaussian MFs are used for the simulation of the ANFIS-GA algorithm as suggested by many authors. Additionally, in the development of ANFIS-PSO, the number of particles, weight damping inertia ratio, initial inertia weight, the number of iterations, social and cognitive acceleration are the important parameters of PSO to optimize [41]. To establish the ANFIS-PSO predictive model, code was generated in MATLAB software.

Table 5 GA and PSO algorithm control parameters used for simulation of ANFIS-GA algorithm

The Gaussian-shaped MFs are considered for this work. According to the available experimentally measured data listed in Table 2 for this investigation, out of 33 experimental data, 24 were randomly selected for the training of the ANFIS-PSO algorithm, and the rest 9 was selected for testing. Table 5 lists the PSO algorithm control parameters used for the simulation of the ANFIS-PSO algorithm. The PSO parameter values were identified as the most appropriate for the present problem and were derived based on several pilot simulation runs and the identified ideal values for PSO parameters such as the number of particles, weight damping inertia ratio, initial inertia weight, the number of iterations, social and cognitive acceleration respectively are 35, 2000, 1, 2, 0.5, and 0.99. The MFs structure corresponding to the ANFIS-GA and ANFIS-PSO are presented in Figs. 11 and 12, respectively. Figure 13a–c and Fig. 14a–c present the correlation of the hybrid ANFIS models such as ANFIS-GA and ANFIS-PSO predicted data with the experimentally measured data of µEDM responses respectively. A closer correlation between the predicted values and their corresponding experimental values can be observed from these figures.

Fig. 11
figure 11

The MFs for a pulse-on time, b current, c gap voltage, and d duty cycle for ANFIS-GA model

Fig. 12
figure 12

The MFs for a pulse-on time, b current, c gap voltage, and d duty cycle for ANFIS-PSO model

Fig. 13
figure 13

Comparison of experimental and ANFIS-GA predicted data for a MRR, b RLT and c ROC

Fig. 14
figure 14

Comparison of experimental and ANFIS-PSO predicted data for a MRR, b RLT and c ROC

5.4 Predictability comparison between ANN, ANFIS, ANFIS-GA, and ANFIS-PSO for µEDM responses

Figure 15a, b is the histograms that show the errors distributed in the training and testing data of predicted and experimental values for MRR predictive models of ANN, ANFIS, ANFIS-GA, and ANFIS-PSO. Similarly, Fig. 16a, b and 17a, b are the histograms that show the errors distributed in the training and testing data of predicted and experimental values for RLT and ROC predictive models of ANN, ANFIS, ANFIS-GA, and ANFIS-PSO respectively. The error distribution in these histograms holds the normal distribution.

Fig. 15
figure 15

Histogram of errors for the selectivity of MRR prediction by ANN, ANFIS, ANFIS-GA, and ANFIS-PSO models: a training data set and b testing data set

Fig. 16
figure 16

Histogram of errors for the selectivity of RLT prediction by ANN, ANFIS, ANFIS-GA, and ANFIS-PSO models: a training data set and b testing data set

Fig. 17
figure 17

Histogram of errors for the selectivity of ROC prediction by ANN, ANFIS, ANFIS-GA, and ANFIS-PSO models: a training data set and b testing data set

The performance of neural network model formulation is generally assessed based on the statistical parameters such as the mean squared relative error (MSRE), root-mean-square error (RMSE), average absolute relative deviation (AARD), and determination coefficient (R2) evaluated between experimentally measured and the predictive model output data [28, 40]. Computation of these statistical parameters is as follows:

$$ {\text{MSRE}} = { }\frac{1}{N}\mathop \sum \limits_{N} \left( {\frac{{X_{pred} - X_{exp} }}{{X_{exp} }}} \right)^{2} $$
(9)
$$ {\text{RSME}} = \sqrt {{ }\frac{1}{N}\mathop \sum \limits_{N} \left( {X_{exp} - X_{pred} } \right)^{2} } $$
(10)
$$ {\text{\% AARD}} = { }\frac{100}{N}\mathop \sum \limits_{N} \left( {\frac{{X_{pred} - X_{exp} }}{{X_{exp} }}} \right) $$
(11)
$$ {\text{R}}^{2} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left[ {X_{pred} - X_{exp} } \right]}}{{\mathop \sum \nolimits_{i = 1}^{N} \left[ {X_{pred} - X_{m} } \right]}},{ }X_{m} = \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left[ {X_{pred} } \right]}}{N} $$
(12)

where, Xpred is the predictive model output value, Xexp is the targeted experimentally measured value, and N is the total number of experimental runs.

The values of MSRE, RMSE and AARD are closer to 0 for the accurate predictive model, whereas the value of R2 is closer to 1. The calculated values of statistical parameters representing the predictability accuracy of the predictive network models proposed in this work for the prediction of the chosen µEDM responses are listed in Table 6. The values of statistical parameters in Table 6 show that the predicted values by using ANFIS-PSO model are closely correlated to the experimentally measured values of the µEDM responses compared with the predicted values by using ANFIS-GA, ANFIS, regression, and ANN models. Accordingly, ANFIS-PSO model is recommended as the more suitable algorithm for the prediction of µEDM responses compared with the other models considered in this investigation.

Table 6 Performances comparison between the developed predictive models

The scatter plots between the data predicted through the developed predictive models and the experimentally measured data are shown in Fig. 18a–c for detailed understanding. The scattered data show a reasonable correlation between the predicted data through all predictive models and experimental data. Considering the three responses, the data scatter of ANFIS-PSO predictive model with experimental data is very less and closer to the line compared to the rest of the predictive models for the training as-well-as testing stage. This explains the precision of the ANFIS-PSO model as superior in comparison with ANFIS-GA, ANFIS, ANN, and regression models for predicting the µEDM performance characteristics.

Fig. 18
figure 18

Comparison of predicted selectivity data against experimental selectivity data for a MRR, b RLT, and c ROC

6 Conclusions

The research work presented the use of a novel hybrid intelligent predictive modeling approach that uses bio-inspired algorithms such as an improved ANFIS with the integration of GA and PSO to predict MRR, RLT, and ROC in the µEDM process. The developed predictive models were assessed statistically to understand their accurate predictability.

The derived conclusions through this investigation are:

  1. 1.

    ANOVA on the experimentally measured date presents the chosen µEDM process control variables are significant for their responses.

  2. 2.

    The postulated regression models for MRR, RLT, and ROC can predict their measured value with an accuracy of up to 98.98%, 99.92%, and 99.47% respectively.

  3. 3.

    The predicted training and testing values for the responses with ANN, ANFIS, ANFIS-GA, and ANFIS-PSO were reasonably in good agreement with their corresponding experimentally measured values.

  4. 4.

    Comparison plots of predicted data with ANN, ANFIS, ANFIS-GA, and ANFIS-PSO and experimental values for all the responses are shown with a good correlation.

  5. 5.

    The histograms show the normal distribution of the errors in training and testing data of predicted data with ANN, ANFIS, ANFIS-GA, and ANFIS-PSO and experimental values for all the responses.

  6. 6.

    The estimated MSRE, RMSE, %AARD, and R2 values of ANFIS-PSO were found to be superior compared with regression, ANFIS-GA, ANFIS, and ANN models.

  7. 7.

    The scatter plots between the data predicted through the developed predictive models and the experimentally measured data also explain the precision of the ANFIS-PSO model as superior in comparison with regression, ANFIS-GA, ANFIS, and ANN models for predicting the µEDM performance characteristics.

  8. 8.

    The computation time noted for the simulation of ANN and ANFIS models was too much lower (less than 1 min) while ANFIS-GA and ANFIS-PSO models were about 45 min.