1 Introduction

Titanium alloy (Cp-Ti grade 2) is used extensively in the biomedical industry for fabricating medical implants, filters, and micro-catheters due to bio-compatibility [1]. In these applications, machining of micro-holes of desired quality is essential [2, 3].

Micro-holes are machined using both conventional and non-conventional micromachining processes. The non-conventional processes include electro-discharge machining, electrochemical machining, abrasive jet machining, laser beam machining, and electron beam machining. Among conventional micro-machining processes, micro-holes are machined using a rotating micro-drill on a stationary workpiece by direct contact in Mechanical Micro Drilling (MMD) process. MMD process is preferred over other non-conventional processes because the better quality holes can be machined in less time [4].

Dry micro-drilling is gaining much importance in many biomedical applications, as there is a requirement for cleaner production [5], and cutting fluids can cause contamination. The biomedical implants make direct contact with the human tissues during their usage [6], and contamination has to be minimized to avoid infections [7].

However executing MMD under dry conditions on Titanium alloy (Cp-Ti grade 2) poses significant challenges as it is a difficult to machine material because of low thermal conductivity, affinity with cutting tools, and strain hardening behaviour. Due to low thermal conductivity high temperatures are generated at chip-tool interface resulting in an accelerated tool wear [8]. Moreover micro-drills used in MMD are slender with a higher flute length to diameter ratio and susceptible to breakages [9]. Therefore, selecting a suitable drilling strategy, process parameters, and tools becomes vital.

Continuous and peck drilling strategies are mainly used for machining in MMD. The temperature at the chip-tool interface decreases with the peck drilling strategy as holes are machined with intermittent feed-in steps [10]. In MMD, peck drilling is observed better than continuous drilling in improving hole dimensional accuracy and roundness due to better chip clearance [11]. Peck drilling is also found effective in improving tool life when machining micro-holes of smaller aspect ratios [12, 13]. Even though the machining time is higher in peck drilling than continuous drilling; it improves tool life by avoiding frequent drill breakages and minimizes tooling costs.

For increasing tool life, selection of pecking depth is critical. In literature, 10% of the drill diameter is suggested for MMD of stainless steel [14], whereas in another study on nickel-based super alloy was varied from 0.05 to 0.5 mm with Ø 0.5 mm drill [15]. Therefore pecking depth selection mainly depends on the combination of tool, work-piece, and machining condition.

The coated micro-drills are used effectively in the dry machining of various engineering materials. The effect of chromium-based multi-layered coatings on tungsten carbide micro-drills was investigated in MMD of Printed Circuit Board (PCB), Cr-Si -Al-N coating with 8.7% silicon content effectively reduced tool wear and improved hole quality [16]. Similarly, another study on MMD of PCB investigated the effect of nitrogen percentages in Zirconium chromium nitride-based coatings on tungsten carbide micro drills. Coating with 17% nitrogen effectively improved tool life and hole quality [17]. Among the various coated tools, the Ti–Al–N coated micro-drills are used successfully in MMD of hard-to-cut materials. Imran et al. [18] investigated the tool wear mechanism in MMD of Inconel 718. They optimized the cutting conditions for increasing tool life using Ti–Al–N coated micro-drills under wet conditions. The tool wear mechanism was analysed, and significant causes were identified as abrasion, microchipping, and adhesion. In another study, Ti–Al–N coated micro-drills were also found effective in the reduction of entry burr height in MMD of Nickel-based super-alloy Nimonic 80A [19].

Many studies in the past were carried out on MMD of titanium alloys on the tool, process parameter selection and optimization of responses. Giorleo et al. [20] evaluated the performance of Al2O3 coated HSS micro-drill of Ø 0.5 mm in MMD of Titanium grade 2 material using direct drilling strategy under dry conditions with tool life and hole roundness as measured responses. The Al2O3 coating effectively enhanced the tool life, whereas it had a negligible effect on hole roundness error.

Guu et al. [21] studied effect of process parameters spindle speed (5000, 12,000, 20,000) rpm, feed (1, 2, 3) mm/min, and tool holder length (10, 15, 20) on responses stress concentration and hole quality in MMD of Ti–6Al–4 V. Experiments were conducted under wet conditions with three different coolants with an uncoated Tungsten carbide micro-drill of Ø 0.2 mm using a direct drilling strategy. A finite element analysis model was developed to determine stress concentration. An increase in stress concentration influenced the hole quality adversely.

Prasanna et al. [22] optimized process parameters such as spindle speed (2000, 3500, 5000) rpm, feed rate (5, 10, 15) mm/min, under air blow condition by varying air pressure (2, 4, 6) bars for output responses like thrust force, radial overcut, circularity, and taper angle in MMD of Ti–6Al–4 V. Air blow condition was selected for experimentation using Ø 0.4 mm uncoated carbide micro-drill adopting direct drilling strategy. Thrust force was influenced mainly by spindle speed and feed, whereas spindle speed and air pressure significantly impacted hole quality.

The effect of different cooling conditions like dry, wet, Minimum Quantity Lubrication (MQL), and cryogenic cooling on responses like thrust force, torque, exit burr height, tool wear, and surface roughness in MMD of Ti–6Al–4 V with tungsten carbide micro-drill of Ø 0.7 mm [23]. Experiments were performed by direct drilling strategy with varying process parameters spindle speed: (1000, 25,000, 5000, and 7500) rpm and feed rates (5, 10, 15, 30, 50) mm/min. Among various cooling conditions, cryogenic was influential in reducing both tool wear and exit burr height. Similarly, the effect of nano-diamond particle-based Minimum Quantity Lubrication (MQL) on responses thrust force, torque, tool wear, and circularity in MMD of Ti–6Al–4 V were investigated [24]. The experiments were carried out using uncoated tungsten carbide micro-drills of Ø 0.3 mm by direct drilling strategy. Nano-particle size and concentration were optimized for responses, and a significant reduction in all the responses was observed with particle size of 35nano-meters, 0.4% weight concentration and feed rate of 10 mm/min. Mechanistic models for responses like exit burr height and thrust force were developed in MMD of Ti–6Al–4 V [25]. The experiments were carried out by direct drilling in wet conditions using an uncoated tungsten carbide micro-drill of Ø 0.3 mm. Experimental results were in good agreement with the results predicted by the model.

Even though much information is available on the MMD of titanium alloys, whenever there is a change in tool type, workpiece material, drilling strategy, and machining conditions, the process parameters must be selected optimally to enhance hole quality and minimize thrust forces. In regular practice, the operator has to perform numerous experiments to identify the optimal parameters. This trial and error approach extends the time needed for preparation before processing and results in resource wastages. Therefore to avoid the drawbacks of the trial and error approach, it becomes necessary to develop appropriate models for predicting output responses quickly and accurately when there is a change in process parameters [26].

Regression analysis is a statistical modelling technique that establishes the relationship between input variables and output responses. It is ideally suited to develop models to predict output responses when many independent variables are present. However, the major disadvantage of adopting statistical modelling techniques is their greater dependence on model-structural assumptions, whether linear or nonlinear, which results in uncertainty in the model's prediction performance [27]. Therefore Artificial Intelligence (AI) based techniques like Adaptive Neural Networks (ANN) and Fuzzy Logic (FL) are adopted for developing predictive models to overcome these disadvantages.

ANN are mathematical simulations of the biological nervous system that mimic its behaviour. They have distributed, simultaneous, and adaptive computing that can map complicated, nonlinear systems, which are difficult to map using regression methods [28]. An ANN is a potent mathematical tool that has demonstrated its effectiveness in various domains [29]. The sample size has a significant impact on the traditional ANN. Usually, a large number of samples are needed to ensure the accuracy of the prediction. Many variants of ANN like GRNN and ANFIS are developed to overcome these limitations.

GRNN is a probabilistic neural network developed by Specht in 1991, mainly applied for regression and function approximation. GRNN is advantageous compared to traditional ANN as only a smaller proportion of the training data is necessary. Because a probabilistic neural network can connect to the underlying function of the data even when using a limited number of training samples [27]. GRNN is based on a one-pass training algorithm; therefore, it speeds up the training process and trains the network more quickly. Unlike traditional feed-forward networks, GRNN estimation can always converge to a global solution [30]. The GRNN is also found very effective in modelling many non-conventional machining process [27, 31].

ANFIS is a hybrid technique that combines the fuzzy behaviour of a fuzzy inference system with the high-learning capabilities of an ANN developed by Jang [32]. The ANN can self-understand the data patterns, and fuzzy logic applies if–then rules to manage existing uncertainty and imprecision. Since it combines both neural networks and fuzzy logic principles, it can capture both benefits in a single frame. In the ANFIS technique, a fuzzy model is developed initially based on rules extracted from the system's input and output data. The ANN is later utilized to fine-tune the fuzzy rules to obtain the ANFIS model. ANFIS models are also used successfully in other fields of mechanical engineering due to its advantages [32,33,34]

In past studies on MMD, statistical and Artificial Intelligence (AI) based methods are used to develop predictive models for responses as presented in Table 1.

Table 1 Statistical and AI based predictive modelling in MMD process

Past studies on predictive modelling in the MMD process reveal that uncoated carbide micro-drill is used. Using a Ti–Al–N coated carbide micro-drill in the MMD process, researchers have not attempted to develop predictive models for thrust force and quality features like burr height and radial overcut. Even though several AI models have been tried in the MMD process, the GRNN modelling technique has not been attempted in spite of its advantages. There are very few research studies that compare the performance of AI-based models and regression models in conventional micro-machining processes.

Therefore, the current study aims to develop GRNN, ANFIS, and MRA models for thrust force, burr height, and radial overcut in MMD of Titanium alloy (Cp-Ti grade 2) using Ti–Al–N coated carbide micro-drills under dry conditions, with cutting speed, feed, and pecking depth as process parameters.

The study has been extended to determine process parameter settings in order to simultaneously minimize responses by multi-response optimization. The experiments are also carried out using an uncoated carbide micro-drill at optimal settings, and the results are compared with Ti-Al –N coated carbide drill.

2 Materials and methods

In the current study, Titanium alloy (CP-Ti grade 2) is selected as workpiece material, its chemical composition is tested using optical emission spectroscopy and presented in Table 2.

Table 2 Chemical composition of Titanium alloy—(Cp-Ti grade 2)

Workpieces of size 40 × 20 × 1 mm are prepared for experimentation.

Commercially available Ti–Al–N coated carbide micro-drills of Ø 0.4 mm are used for experiments, and the specification is shown in Table 3. Figure 1a depicts the image of the micro-drill used in the study. The coating on micro-drill is confirmed using Energy Dispersive Spectroscopy (EDS), and results are depicted in Fig. 1b.

Table 3 Tool specification
Fig. 1
figure 1

Details of Ti–Al–N coated micro-drill used for experimentation

2.1 Experimental design and setup

A full factorial design presented in Table 4 by varying process parameters like cutting speed, feed, and pecking depth in three levels is adopted for experimentation. The levels of process parameters are selected based on preliminary experimentation and the literature survey.

Table 4 Experimental design

Experiments are carried out on a 3-axis vertical machining centre (KMI118 of JYOTI make) and the specification of machine is presented in Table 5. The work-pieces are secured firmly on vice and mounted on a multi-component dynamometer of Kistler make (9257B). Finally, the dynamometer is connected to a charge amplifier type 5070 of Kistler make and data acquisition system compatible with dynoware software. The schematic of the experimental setup is presented in Fig. 2.

Table 5 Specification of machine used for experimentation
Fig. 2
figure 2

Experimental setup

2.2 Measurement procedure for responses

The thrust force, exit burr height, and radial overcut are the three responses measured in this study, and the procedure for measuring them is discussed in this section.

2.2.1 Thrust force

Thrust forces are relatively higher when machined with micro-drills than the larger diameter drills due to longer chisel edge length compared with its diameter [41]. Moreover, the micro-drills are slender with larger flute length to diameter, and larger thrust forces can reduce tool life due to breakages. Therefore it is necessary to minimize thrust forces for increasing tool life.

The thrust force signals are captured at a 1500 Hz/s sampling rate using a multi-component dynamometer. The signals are further amplified using a charge amplifier and transferred to a data acquisition system, and a low pass filter is applied to eliminate noise.

The maximum value of thrust force is considered for each hole [4]. A typical recorded thrust force signal is presented in Fig. 3, and three distinct stages can be identified [42]. In stage 1, a gradual increase in thrust force is observed as the micro-drill contacts the work-piece during entry. As the drill penetrates, thrust force becomes steady, as observed in stage 2. A decrease in thrust force can be observed in stage 3, as the drill exits the work-piece material. Therefore the maximum value is considered for each hole from the thrust force signal for analysis.

Fig. 3
figure 3

Thrust force variation at V = 18.8 m/min, F = 4 µm/rev and P = 75 µm

2.2.2 Burr height

Burrs are undesirable material projections generated on both micro-drill entry and exit surfaces of work-piece. In the drilling of titanium alloys, the exit burrs are larger and more challenging to remove than entry burr. Therefore exit burr formation is of significant interest in the current study, as burr elimination or reduction minimizes manufacturing time and costs.

The burr formation is mainly characterized by burr height and burr thickness; among them, burr thickness remains stable and does not vary much with process parameters [43] compared to burr height. Therefore burr height is characterized and measured using a non-contact 3D profiler of VEECO make. The hole is scanned, and maximum burr height is considered [44] as indicated on the digital scale for analysis, Fig. 4 shows a typical burr height measurement.

Fig. 4
figure 4

3D profiler burr height measurement

2.2.3 Radial overcut

The machined holes are characterized for dimensional accuracy by measuring radial overcut. The diameter of machined holes on the entry side is larger than the exit side due to wandering motion [21, 24] and therefore considered for analysis. The images of machined holes are captured using a stereomicroscope, and diameter is measured by selecting 8 points on the hole periphery using image processing software Image–j [45]. The radial overcut is determined using Eq. 1, based on the measured hole diameter on entry side Dentry and the tool Dtool [46]

$$ Radial\;overcut \left( {\mu m} \right) = \left( {\frac{{D_{entry } - D_{tool} }}{2}} \right) $$
(1)

2.3 Generalised regression neural network (GRNN)

GRNN model mainly consists of four different layers such as input layer, pattern layer, summation layer, and output layer. The input layer feeds the input to the subsequent layer. The pattern layer determines the Euclidean distance and activation function. The number of neurons in this layer will be similar to the number of experiments used to train the network. The summation layer is made up of two subparts (that is the numerator and a denominator part of Eq. 2). The output layer consists of a single neuron that calculates the output by dividing the numerator and denominator parts of the summation layer. The typical structure of GRNN model developed in the current study is shown in Fig. 5.

Fig. 5
figure 5

Block diagram of GRNN architecture

The GRNN model predicts based on the Eq. 2.

$$ \hat{Y}\left( X \right) = \left( {\frac{{\mathop \sum \nolimits_{{\begin{array}{*{20}c} {i = 1} \\ \end{array} }}^{{\begin{array}{*{20}c} n \\ \end{array} }} Y_{i} exp\left( { - \frac{{D_{i}^{2} }}{{2\sigma^{2} }}} \right)}}{{\mathop \sum \nolimits_{{\begin{array}{*{20}c} {i = 1} \\ \end{array} }}^{{\begin{array}{*{20}c} n \\ \end{array} }} \exp \left( { - \frac{{D_{i}^{2} }}{{2\sigma^{2} }}} \right)}}} \right) $$
(2)

where \(\hat{Y}\left( X \right)\) = output predicted, X = input, Yi = Output of the input sample I, D i2 = Euclidean distance from the X and Xi, n = number of experiments, σ = spread parameter

The GRNN model output is calculated using the weighted average of the training dataset's outputs. The weights are computed based on the Euclidean distance calculated between the training data and test data. The Euclidean distances are calculated from training sample Xi using Eq. 3.

$$ D_{i}^{2} = \left( {X - X_{i } } \right)^{T} \left( {X - X_{i} } \right) $$
(3)

2.4 Adaptive neuro fuzzy inference system (ANFIS)

The general structure of the ANFIS model with three inputs (cutting speed, feed, and pecking depth) and single-output (thrust force) is presented in Fig. 6.

Fig. 6
figure 6

Block diagram of ANFIS architecture

The ANFIS model consists of five layers; the initial layer is the fuzzification layer and consists of adaptable nodes. In order to obtain fuzzy datasets from input values, the fuzzification layer employs the Membership function (MF).

Many MF are available in ANFIS modelling, such as Trapezoidal and triangle MF, which are linear MF. Similarly, the Gaussian MF function processes non-linear input data. The combinational MF that combines different MF’s is also available. The type of MF affects the model prediction accuracy [47] and therefore, a suitable selection must be made.

Pi-membership function, abbreviated as Pi-MF, is selected for model development in the current study. Pi-MF is spline-based, pi-shaped, and combines an S-shaped and a Z-shaped MF and is shown in Eq. 4.

$$ \begin{array}{*{20}l} {\mu_{Ai} \left( x \right) = { }PiMF{ }\left( {{\text{x}};{\text{ a}},{\text{ b}},{\text{ c}},{\text{d}}} \right) = { }0,} \hfill & {{\text{x}} \le {\text{a}}} \hfill \\ {2\frac{{\left( {x - a} \right)}}{{\left( {b - a^{2} } \right)}},} \hfill & {a \le x \le \frac{a + b}{2}} \hfill \\ {1 - 2 \left( {\frac{a - b}{{b - a}}} \right)^{2} ,} \hfill & {\frac{a + b}{2} \le x \le b} \hfill \\ {1,} \hfill & {b \le x \le c} \hfill \\ { 1 - 2 \left( {\frac{x - c}{{d - c}}} \right)^{2} ,} \hfill & {c \le x \le \frac{c + d}{2} } \hfill \\ { 2 \left( {\frac{x - d}{{d - c}}} \right)^{2} ,} \hfill & {\frac{c + d}{2} \le x \le d} \hfill \\ {0,} \hfill & {x \ge d} \hfill \\ \end{array} $$
(4)

where a, b, c, d terms are called premise parameters, and the learning algorithm minimizes the training error by optimizing these terms.

The second layer in the ANFIS structure is referred to as the rule layer and it consists of a fixed number of nodes. This layer is mainly responsible for generating firing strength (Wi) for the rules using membership values computed in the previous layer. The values Wi are determined by multiplying the membership values as shown in the Eq. 5.

$$ W_{i} = \mu_{Ai} \left( x \right).\mu_{Ai} \left( y \right).\mu_{Ai} \left( z \right) i = 1,2,3 \ldots .27 $$
(5)

where x, y, z are inputs µAi(x), µAi(y) and µAi (z) are membership functions and i is the number of rules generated (27 in the current study).

The third layer is the normalization layer, which determines each rule's normalized firing strengths. The normalized value for the ith rule is calculated as shown in Eq. 6.

$$ \overline{{W_{i} }} = \frac{{W_{i} }}{{\mathop \sum \nolimits_{i} W_{i} }} i = 1,2,3 \ldots .27 $$
(6)

The activation function layer is the fourth layer and each node of this layer determines the weighted values of rules by multiplying with a first-order polynomial as shown in Eq. 7.

$$ \overline{{W_{i} }} . f_{i} = W_{i} \left( {p_{i } x + q_{i } y + r_{i } z } \right) $$
(7)

where \(\overline{{W_{i} }}\) is normalisation layer output and fi is activation function.

The nodes in this layer are adaptive and where constants in equation pi, qi and ri (called as consequence parameters) have to be adjusted by the learning algorithm.

Finally, the fifth layer is referred to as the summation layer or overall output layer, and the final output of ANFIS is determined by summing the individual outputs of each rule of defuzzification layer as shown in Eq. 8.

$$ \mathop \sum \limits_{i = 1}^{27} \overline{{W_{i} }} f_{i} = \frac{{\mathop \sum \nolimits_{i = 1}^{27} W_{i } * f_{i} }}{{\mathop \sum \nolimits_{i = 1}^{27} W_{i} }} $$
(8)

2.5 Multiple regression analysis

The Multiple Regression Analysis (MRA) combines statistical and mathematical techniques to analyse the effects of many input process parameters on output responses. In MRA, the quadratic models of second order are found effective in modelling responses in MMD compared to bilinear models [35]. The second order quadratic models are also effective in predicting responses in others domains of mechanical engineering [32]. Therefore quadratic models of second-order are developed for the responses based on the Eq. 9 [48]

$$ Y = \beta_{{0{ }}} + \mathop \sum \limits_{i = 1}^{k} \beta_{i} x_{i} + \mathop \sum \limits_{i = 1}^{k} \beta_{ii} x_{i}^{2} + \mathop \sum \limits_{j > 1}^{k} \beta_{ij} x_{i} x_{j} $$
(9)

where y = output response, k = number of input process parameters, xi, xj input or process parameters, β0 is constant, βi, βii, βij are coefficients of linear, square, and interaction terms.

3 Results and discussion

The experiments are carried out in three trials. The average value of measured responses is determined for each experiment and presented in Table 6.

Table 6 Experimental results –Training data

The stereo microscope images of machined holes on the entry side of the workpiece under different experimental conditions are depicted in Fig. 7.

Fig. 7
figure 7

Stereo microscope images of machined holes on the entry side

The data in Table 6 is used for training the GRNN and ANFIS models. The additional 10 data sets are generated by performing experiments at random points (not considered in the initial experimental design) to test and validate models and are presented in Table 7.

Table 7 Experimental results additional –testing and validation data

The prediction accuracy of the developed model is influenced by the spread parameter (σ) value in the GRNN model and the type of MF in the ANFIS model. Therefore to minimize prediction errors, the datasets from experiments 1–5 in Table 7 are utilized to determine the optimal value of spread parameter in GRNN models and the best type of MF in ANFIS models. The developed models are validated using the datasets from experiments 6–10.

3.1 Predictive models

In the current study three modelling techniques are used to establish relationship between process parameters and responses. The performance of AI based GRNN, ANFIS models are compared with regression models developed by MRA.

3.1.1 GRNN modelling

In the initial step, the spread parameter “σ” is selected, and from Eq. 2 used for prediction in GRNN modelling, “σ” is the only unknown parameter. The selection of “σ” influences the prediction accuracy of GRNN models. The high value of “σ” can result in under fitting, and the low-value results in overfitting. Therefore an optimal value has to be selected for the best performance of GRNN models.

No intuitive methods are available to choose the optimal spread parameter, and a trial and method must be applied [27, 30]. Since many computations are needed to calculate one set of variables, a Matlab spread parameter selection function is used for the implementation. The training data set is used as input (experimental data in Table 6), and a trial error method is applied to predict responses at different “σ” and compared with the test data set (from experiments 1–5 in Table 7) for calculating mean absolute errors. The variation of mean absolute error with spread parameter for thrust force, burr height, and radial overcut models is depicted in Fig. 8.

Fig. 8
figure 8

Mean absolute error variation with spread parameter for test data

The spread parameter value (σ) at which minimum mean absolute error is observed represents the optimal value and is selected for developing the GRNN model. Therefore σ = 1.5 is selected for developing the thrust force model, and similarly, σ = 1.35 and σ = 0.15 are selected for developing burr height, and radial overcut models.

GRNN models are developed for thrust force, burr height and radial overcut using selected “σ” values. The correlation between experimental values and their corresponding GRNN model predicted values for training data are depicted in Fig. 9.

Fig. 9
figure 9

Correlation between experimental values and GRNN model predicted values for training data

From Fig. 9a, b, and c, it is observed that R2–values are 1 for thrust force, burr height, and radial overcut models. Because the R2-value of data is equal to a perfect fit value, the trained network model's accuracy is high. Finally developed GRNN models are validated using validation data set (from experiments 6–10 in Table 7).

3.1.2 ANFIS modelling

ANFIS models are developed using Matlab 2021a based on experimental data in Table 6. In the initial step, the input data are normalized to get values ranging from 0 to 1 to compute ANFIS models.

In the next step, the Fuzzy Inference System (FIS) is selected, and in the ANFIS package, Sugeno and Mamdani FIS are commonly used to develop models. In Sugeno-FIS, the weighted average is used to de-fuzzify output, which reduces processing time and also improves computational accuracy [49]. Hence the Sugeno FIS is selected in the current study, and later developed models are trained by selecting parameters and optimization method.

For every input, three membership functions are selected, 27 rules are generated, and 550 epochs are used to train models [50]. Back-propagation and hybrid methods are currently available in the ANFIS package for training. The back-propagation method uses a gradient descent algorithm, which means that errors from each epoch are sent back to the network; this serves as feedback in self-correcting and self-learning [51]. Therefore, in current study back-propagation method is adopted for training.

In the final step, ANFIS models are developed by selecting a suitable Membership Function (MF). The current study evaluates different MF to select an MF that minimizes the prediction errors. After preliminary screening, trapezoidal-MF, Pi-MF, P-sigma-MF, and D-sigma-MF are considered for evaluation among the various MF provided in the ANFIS package. ANFIS models are developed for output responses such as thrust force, burr height, and radial overcut by providing the model’s training data set as input (experimental data in Table 6). The type of MF is selected by comparing model predictions from various types of MF with actual experiment values in the test data set (from experiment 1–5 in Table 7). The prediction error is calculated using Eq. 10.

$$ \begin{aligned} & Prediction\;error \left( \% \right)\\ & \quad = \frac{Experimental\;value - predicted\;value}{{Experimental\;value}} \end{aligned} $$
(10)

The calculated absolute prediction error for thrust force, burr height and radial overcut models for different MF’s is shown in Table 8. From Table 8, the minimum value of error is observed with Pi-MF.

Table 8 Prediction errors in ANFIS models with different membership functions

The Fig. 10a shows the training process and selected parameters for thrust force model. The rule viewer in Fig. 10b depicts the prediction for experiment 1 in the test data set with the Pi-MF type in thrust force model.

Fig. 10
figure 10

ANFIS model of thrust force

The ANFIS models developed for thrust force, burr height and radial overcut by selecting PI- MF is considered for prediction. The developed models are validated using validation data set (from experiment 6–10 in Table 7).

3.1.3 Multiple regression analysis (MRA) modelling

The quadratic second-order regression models are developed using Minitab 17 statistical software. The actual values of the process parameters cutting speed (V), feed (F), and pecking depth (P) is transformed to coded units using Eqs. 11, 12 and 13.

$$ X_{1} = \left( {\frac{{V - V_{0} }}{\Delta V}} \right) $$
(11)
$$ X_{2} = \left( {\frac{{F - F_{0} }}{\Delta F}} \right) $$
(12)
$$ X_{3} = \left( {\frac{{P - P_{o} }}{\Delta P}} \right) $$
(13)

where X1, X2 and X3 are coded cutting speed V, feed F and pecking depth P, V0, F0 and Po represent values of V, F and P at zero levels. ΔV, ΔF and ΔP are the interval of variation of V, F, and P respectively.The models developed for thrust force, burr height, and radial overcut in coded units is presented as Eqs. 14, 15, and 16.

$$ \begin{aligned} Thrust\;force &= 11.592 - 0.904X_{1} + 1.334X_{2}\\ & \quad + 0.787X_{3} - 0.118X_{1} *X_{1} \\ &\quad - 0.207X_{2} *X_{2} - 0.518X_{3} *X_{3}\\ & \quad -0.417X_{1} *X_{2} + 0.006X_{1} *X_{3} \\ & \quad + 0.118X_{2} *X_{3} \\ \end{aligned} $$
(14)
$$ \begin{aligned} Burr\;height &= 32.380 - 0.916X_{1} + 1.315X_{2}\\ & \quad + 1.612X_{3} - 0.584X_{1} *X_{1} \\ &\quad + 0.314X_{2} *X_{2} - 0.129X_{3} *X_{3}\\ & \quad -0.033X_{1} *X_{2} + 0.053X_{1} *X_{3} \\ & \quad + 0.184X_{2} *X_{3} \\ \end{aligned} $$
(15)
$$ \begin{aligned} Radial\;overcut &= 14.184 + 1.081X_{1} + 0.717X_{2} \\ & \quad + 0.580X_{3} - 1.219X_{1} *X_{1} \\ &\quad + 0.115X_{2} *X_{2} - 0.159X_{3} *X_{3} \\ & \quad + 0.914X_{1} *X_{2} - 0.235X_{1} *X_{3} \\ & \quad - 0.073X_{2} *X_{3} \\ \end{aligned} $$
(16)

3.2 Comparison of developed models

The GRNN, ANFIS, and regression models developed for output responses thrust force, burr height, and radial overcut are used to predict at random points considered for validation (experiments 6–10 in Table 7). The prediction errors are determined by comparing GRNN. ANFIS and MRA model predicted values with actual experiment results using Eq. 10. The predicted responses from developed thrust force, burr height, and radial overcut models for testing and validation data are presented in Tables 9, 10, and 11 respectively.

Table 9 Predicted thrust force in testing and validation
Table 10 Predicted burr height in testing and validation
Table 11 Predicted radial overcut in testing and validation

The calculated MAPE for the GRNN, ANFIS and MRA models for testing and validation data is depicted in Fig. 11.

Fig. 11
figure 11

Comparison between GRNN, ANFIS and MRA predicted values for testing and validation data

In thrust force model from Fig. 11a, the Maximum Absolute Prediction Error (MAPE) of 4.92%, 5.26%, and 11.69% with GRNN, ANFIS, and regression models respectively is observed. Similarly in burr height model in Fig. 11b, MAPE of 4.41%, 5.43%, and 9.27% are observed respectively with GRNN, ANFIS, and MRA models. Figure 11c depicts MAPE in radial overcut model of 4.68%, 5.22%, and 11.71% respectively with GRNN, ANFIS, and MRA models.

To the best of our knowledge, the literature is not available on the simultaneous development of GRNN and ANFIS and MRA prediction models to predict responses like thrust force, burr height, and radial overcut in the MMD process. Very few research works have been reported on applying AI tools like ANN and ANFIS in the MMD process, and the prediction results for responses of past studies are presented in Table 12.

Table 12 Prediction results of ANN and ANFIS models from past studies on MMD process

In past studies on GRNN modelling in other machining processes, a mean prediction errors of 2.479% and 1.75% for kerf width and surface roughness in laser machining of stainless steels [26]. The mean prediction errors of 1.34% is observed for radial overcut in electro discharge machining of AISI D2 tool steel [21].

In past studies on ANFIS modelling in other machining processes, the mean prediction errors of 8.67% for material removal rate, 3.20% for tool wear rate, and 13.44% for diametric overcut in micro-electro discharge machining of silver [51]. The mean absolute prediction errors for material removal rate, surface roughness, and tool wear rate of 4.33%, 7.49%, and 6.37% in micro-electro discharge machining of stainless steel [52].

In the current study, mean prediction errors of 3.40% and 3.96% are observed in the prediction of thrust force in GRNN and ANFIS models for testing and validation data, respectively. The mean prediction errors of 2.67% and 3.19% are observed in the prediction of burr height in GRNN and ANFIS models for testing and validation data, respectively. Similarly, mean prediction errors of 2.53% and 2.84% are observed in the prediction of radial overcut in GRNN and ANFIS models for testing and validation data, respectively. The mean prediction errors observed in GRNN and ANFIS are in similar ranges of less than 5%, as observed in the literature.

3.3 Influence of process parameters on responses

The ANOVA is performed at the desired confidence level of 95% to identify the significant process parameters and their contribution towards responses. The results of ANOVA are shown in Table 13.

Table 13 ANOVA results

The bar charts are constructed using the average values of thrust force at each level of process parameters and depicted in Fig. 12.

Fig. 12
figure 12

Mean variation of thrust force with cutting speed, feed and pecking depth

From Fig. 12a illustrates a change in cutting speed from a lower to a higher level results in a decrease of 12.91% in thrust force. Higher cutting speeds cause thermal softening of the work material, which requires less force for chip removal, resulting in a lower thrust force. On the contrary, increasing the feed from a lower to a higher level resulted in a 27.72% increase in thrust force. This trend can be attributed to increased chip load at the cutting edge due to increasing feed. On the other hand, the thrust force increased by 15.63% as the pecking depth increased from lower to higher levels. Chip clogging increases as pecking depth increases, resulting in more friction between drill flutes and hole wall, resulting in higher thrust force.

The ANOVA results in Table 13a feed; cutting speed, pecking depth, and the interaction term of cutting speed and feed are significant towards thrust force with a P-value of less than 0.05. Feed had a significant contribution of 46.52% towards thrust force followed by cutting speed, pecking depth, and interaction of cutting speed-feed at 21.34%, 16.19%, and 3.02%.

The bar chart in Fig. 13a indicates a 4.16% decrease in exit burr height as cutting speed increases from lower to higher levels. With higher cutting speeds, chip height decreases, leading to a reduction in burr heights [23]. On the contrary, an increase in burr height with a rise in feed and pecking depth is observed in Fig. 13b and c. This increasing trend in burr height can be attributed to the elevation of thrust forces during tool exit at higher feed and pecking depth. An increase in the feed from lower to higher levels results in an 8.51% increase in exit burr height. Similarly, a rise in pecking depth from lower to higher levels causes a 10.58% rise in exit burr height.

Fig. 13
figure 13

Mean variation of burr height with cutting speed, feed and pecking depth

From the ANOVA results in Table 13b, linear terms of pecking depth, cutting speed, and square term of cutting speed are significant towards exit burr height with P-value less than 0.05. The pecking depth significantly contributed with 45.12% towards exit burr height followed by feed, linear cutting speed, and square term of cutting speed at 30.04%, 14.56%, and 1.98%, respectively.

From Fig. 14a, a more significant radial overcut is observed with higher cutting speed due to increased drill wandering motion during entry. An increase of 19.21% in radial overcut is observed due to a rise in cutting speed from lower to higher levels.

Fig. 14
figure 14

Mean variation of radial overcut with cutting speed, feed, and pecking depth

A similar trend is also observed with a change in feed and pecking depth in Fig. 14b and c. With the increase in feed, chip size also becomes larger [5], and due to this, chip evacuation becomes difficult, resulting increase of lateral forces causing more overcut (similar results were observed in MMD of nickel-based super-alloy by Zhou et al. [53]. An increase of 11.32% is observed in radial overcut by a rise in the feed from lower to higher levels.

With higher pecking depth, chip length also increases, resulting in more wandering motion due to chip entangling with the flute, and therefore more radial overcut is observed on the hole entry side. The increase of 9.13% in radial overcut is observed due to a rise in pecking depth from lower to higher levels.

From the ANOVA results in Table 13c, the linear terms of cutting speed, feed, pecking depth, square term of cutting speed, and the interaction term of cutting speed and feed are significant towards radial overcut with P-value less than 0.05. Linear cutting speed has significantly contributed with 35.19% followed by interaction term of cutting speed and feed, linear term of feed, square term of cutting speed, and linear term of pecking depth at 16.77%, 15.48%, 14.91%, and 10.14%, respectively.

3.4 Multi-response optimization

From the previous section, it is observed that a unique optimal solution to optimize all responses is not available; therefore multi-response optimization is carried out to obtain a single optimal solution. The Desirability Function Analysis (DFA) method is implemented for multi-response optimization. In DFA, desirability values ranging from 0 to 1 are calculated using desirability functions. In the current study, responses like thrust force, exit burr height, and radial overcut should be minimized; hence smaller the better desirability function is used. A composite desirability value is determined by assigning equal weights to the desirability values of individual responses. The process parameter setting where the maximum composite desirability value is observed represents an optimal condition.

The results of DFA are presented in Fig. 15. The optimal process parameter setting is obtained at cutting speed of 18.8 m/min, feed of 4 µm/rev, and pecking depth of 25 µm.

Fig. 15
figure 15

Multi response optimization plot

3.5 Comparison of Ti–Al–N coated and uncoated tools

In this section, 20 holes are machined at fixed condition of process parameters, at optimal cutting speed 18.8 m/min, feed 4 µm/rev, and pecking depth 25 µm obtained previously from DFA. An uncoated carbide micro-drill of same length and similar geometry is used to machine 20 holes at optimal conditions. The performance of both the drills are analysed in terms of thrust force, exit burr height and radial overcut. This provides an insight on the influence of tool wear progress towards the responses while machining at a fixed condition (optimal process parameter settings).

The Fig. 16 shows an increase in thrust force with the number of holes machined for both the micro-drills. A similar trend can also be observed in exit burr height, and radial overcut.

Fig. 16
figure 16

Variation of responses with number of holes machined

The coefficient of variation is calculated for comparing the performances of micro-drills based on the variability in measured response values of twenty holes machined. The Coefficient of Variation (CV) is a statistical measure is used; CV represents variability in sample of measured values of individual responses. CV is calculated using Eq. 17 in percentage [54].

$$ CV \left( \% \right) = \frac{standard\; deviation\;of\;sample}{{Mean\;of\;sample}} \times 100 $$
(17)

The CV of 6.59% and 5.43% in thrust force, 13.67% and 11.65% in burr height, 10.04% and 9.72% in radial overcut with uncoated and Ti–Al–N micro-drill are observed, respectively. The Ti–Al–N coated micro-drill displayed a better performance as the lower coefficient of variation is observed compared to the uncoated tool for all responses.

3.5.1 Tool wear

The tool wear after machining twenty holes using both uncoated and the Ti–Al–N coated micro-drills is also compared and image of the tooltip before and after machining is shown in Fig. 17.

Fig. 17
figure 17

Stereo microscope images of tool tip

From Fig. 17, the dominant wear mechanism observed is adhesion and abrasion. The adhesion of work material due to affinity with the tool is observed more with the uncoated tool than Ti–Al–N coated tool. The low thermal conductivity of Titanium alloy (Cp-Ti grade 2) and machining under dry conditions also contributes to high chip tool interface temperature causing adhesion wear [8].

The wear on the flank surface is observed in both the tools due to material adhesion and abrasion during machining. The flank wear is measured after machining twenty holes using a stereomicroscope, the maximum flank wear value of 28.09 µm for uncoated micro-drill and 19.30 µm for Ti–Al–N coated micro-drill are observed. The magnitude of flank wear is found more in the uncoated micro-drill, and clear evidence of chipping of cutting edges is observed.

The Fig. 17 also reveals the chisel edge wear in both the micro-drills after machining, caused mainly due to abrasion and blunting of chisel edge is also noticed.

3.5.2 Chip morphology

The current study is concluded with an analysis of chip morphology. The morphology of the formed chips determines the ease with which the drilling process is carried out. In orthogonal cutting processes such as turning and milling formation of chips ends as they leave the cutting edges, whereas in drilling chips flows along the flutes even after they are formed. Due to the complex shape of the drill with spiral flutes, the chips are not uniformly generated. The chips formed during holes machined at optimal settings to study tool wear are collected. The SEM images of typical continuous spiral chips generated are presented in Fig. 18.

Fig. 18
figure 18

Chips formed at V = 18.8 m/min, F = 4 µm/rev and P = 25 µm

In the current study, continuous spiral chips are formed with uncoated micro-drill whereas squeezed continuous spiral chips are formed with Ti–Al–N coated micro-drill. As shown in Fig. 18a, squeezed spiral chips are observed with the uncoated micro-drill can be attributed to high temperature at the chip tool interface during machining. Therefore chips formed get welded to the cutting edge, thus obstructing the flow resulting in squeezed spiral chips entangling between the flutes and work-piece surface [55]. The pitch between the spirals is also lesser due to the squeezing; hence the chips do not break easily. These reasons eventually cause an increase in thrust forces due to difficulties in chip evacuation and also elevate wandering motion causing more radial overcut.

On the contrary, with the Ti–Al–N coated micro-drill, chips formed due to deformation flow easily in a continuous spiral form, as shown in the Fig. 18b, which is easier to evacuate. Therefore, the coating on the tool is found effective in reducing the temperature at the chip tool interface. Moreover, the chips have a larger pitch between spirals and breaks, easily indicating ease of machining hence generates lesser thrust forces than uncoated tool.

4 Conclusion

The mechanical micro-drilling process is carried out in the current study under dry conditions to machine micro-holes on Titanium: Cp-Ti grade-2 using Ø 0.4 mm Ti–Al–N coated micro-drills. The current study develops predictive models for thrust force, burr height, and radial overcut using Generalised Regression Neural Network (GRNN), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Multiple Regression Analysis (MRA) techniques. The experimental data generated based on the full-factorial design by varying cutting speed, feed, and pecking depth is used to train models. The model's performance is evaluated based on data generated by additional experiments for testing and validation. The major findings of the study are:

  • Both GRNN and ANFIS models are more effective in predicting all responses than MRA models. Among them, GRNN models showed better performance with Maximum Absolute Prediction Error (MAPE) of 4.92%, 4.41%, and 4.68% in predicting thrust force, burr height, and radial overcut compared to the ANFIS model with MAPE of 5.27%, 5.17%, and 5.22%, Whereas MRA models predicted with MAPE of 11.72%, 9.27%, and 11.71% respectively.

  • The effects of process parameters on output responses is analyzed using ANOVA. The results suggest that feed and pecking depth played a significant role in increasing thrust force and burr height, respectively while cutting speed increases the radial overcut considerably.

  • The multi-response optimization is carried out using DFA and optimal process parameter settings to simultaneously minimize responses is obtained at cutting speed of 18.8 m/min, feed of 4 µm/rev, and pecking depth of 25 µm. The average thrust force of 6.65 N, average burr height of 32.40 µm and radial overcut of 12.60 µm.

  • At optimal parameters, the performance of the Ti–Al–N coated, uncoated micro-drills is evaluated by carrying out additional experiments. Compared to an uncoated micro-drill, the Ti–Al–N coated micro-drill showed a reduction in average thrust force from 8.83 to 6.65 N (24.68%), average burr height from 36.33 to 32.40 µm (10.81%), and average radial overcut from 15.64 to 12.61 µm (19.36%).

  • In the tool wear analysis and chip formation, Ti–Al–N coated micro-drill effectively reduced flank wear from 28.09 to 19.30 µm (31.29%) compared to the uncoated micro-drill. Continuous spiral chips are observed with Ti–Al–N coated micro-drill, indicating ease of machining compared to squeeze continuous spiral chips formed with uncoated micro-drill.

The current research can be extended to model other quality attributes like cylindricity and surface roughness under minimum quantity lubrication and other near-dry machining conditions.