1 Introduction

Power system protection is a crucial problem in modern power system for secured and reliable operation of the system. The electrical utilities have been adopting numerous techniques to reduce the number of line outages due to short circuit. In this context, many researches were carried out to detect and classify the type of fault which leads to location and isolation of fault, thereby reducing the outages in both transmission and distribution system. One of the troublesome abnormalities that occur in the system is HIF. This type of fault occurs mostly in the distributor feeder network, when the energized conductor has in contact with high resistance surface such as tree limb, wet and dry sand, and wet and dry asphalt. It exhibits the asymmetry, intermittence and nonlinear arcing characteristics with the magnitude of fault current of about 0–75 A in an grounded system [1,2,3,4]. Normally, the over current relay fails to detect the HIF in the system because of the lower current magnitude, leading to the cascading failure of the system and causing risk to the people and their properties [5]. A mathematical model for designing the HIF model in the form of nonlinear partial differential equation for studying the effect of HIF in distribution system is presented in [6]. In addition, many works were carried out for designing the HIF model [7]. Another approach of live tree-related HIF model is suggested [8], to study the effect of environmental conditions in real time and biological classification using finite element method (FEM). A least square approach with advanced numerical techniques is used to locate the HIF in the distribution system [9].

Linking with the previously published work, several studies were carried out to discriminate and identify the power system disturbances such as capacitor switching, symmetrical, unsymmetrical HIF and another transient phenomenon. Nevertheless, significant importance is given to detect the HIF that occurs in the system by the utilities. The conventional scheme of detection of single-phase HIF in low-resistance grounded distribution network (LRGDN) with the underlying perception of variation of composite power angle and its magnitude, voltage–current characteristics profiles during normal and fault condition is studied [2, 10]. In some cases, the conventional numerical methods fail to identify the HIF due to selection of improper step size. It can be overcome by the powerful signal processing techniques. Short-time Fourier transform (STFT) based on lower-order harmonics represented by its magnitude and phase was used to identify the HIF from other system disturbances such as capacitor switching and feeder energizing [11]. A mathematical morphology and WT were used to extract the high-frequency component of the three-phase current signal to locate the power system faults from the other transients using the energy index [1, 12]. The detection of HIF using DWT with the mother wavelet of Daubechies wavelet db4 from the signal comprising of another non-fault transient event is presented in [13, 14]. The continuous wavelet transform (CWT) to extract the feature for learning the extreme learning machine (ELM) type of neural network to identify the HIF and current transformer saturation using the high-speed communication techniques for the co-ordination of the relay has also been worked out [15]. Moreover, WT and other time–frequency analysis in combination with pattern classifier, ANN and power line communication techniques were used to locate the fault [16,17,18]. ANFIS is one of the major trade-offs among ANNs and fuzzy logic systems, offering smoothness due to the fuzzy control interpolation and adaptability due to the ANN for classification problem [19,20,21,22,23]. On the flipside, SVM-based intelligence classifier is more efficient than neural network method because of its classification with few samples and uses the raw data without pre-processing [24,25,26].

Among the various feature extraction methods, wavelets are gaining the popularity in many works [27,28,29,30,31] due to its capability of reliable feature decomposition in frequency and localization in time even in the presence of noise. Therefore, in this work DWT analysis was used to extract the features and the classifiers such as ANFIS and SVM were used to detect the occurrence of HIF in the system and to discriminate them from other transients.

The prime contribution of this paper is as follows:


HIF detection In this paper, the detection of HIF is carried out by analysing the three-phase current signal of the power system using DWT analysis to extract the features such as SD for training the classifiers.


HIF identification In this step, the SD features drawn were used to train the computational intelligence-based classifiers with different possibilities of power system disturbances, also considering the time-varying fault resistance to prove the effectiveness of the classifier.

The paper is organized as follows: Sect. 2 presents the system model studied and proposed methodology to detect the HIF in the system. Section 3 deals with the data acquisition of current signal using DWT analysis to extract the features for classification. Section 4 explains the various computational intelligence-based classifiers such as fuzzy, Bayes, MLP, ANFIS and SVM for classifying the fault. Results and discussion is portrayed in Sect. 5. Finally, the conclusion and future scope are made in the last part of the paper.

2 System studied and methodology

In order to validate the proffered method of classifier technique, it is necessary to accumulate the field data signals from the power system to train the classifier with different operating conditions of the system. To obtain the field data in real time for power system is quite expensive, so a real-time medium-voltage (MV) distribution feeder network in Spain [17] is considered for validating the proposed classifiers simulated in MATLAB for obtaining the fault current data under different possible disturbances.

2.1 Distribution network model

The MV feeder network configured in Spain consists of substation transformer with 5 feeder lines. But in the proposed work only 4-feeder network (F1, F2, F3 and F4) is considered for the simulation assuming the power system disturbances occur in each of the feeder as represented in Fig. 1.

Fig. 1
figure 1

Single-line diagram of radial distribution feeder

The system parameters are as follows: a generator with voltage rating of 30 kV coupled with substation transformer with the turns ratio of 30/13.8 kV, 12 MVA rating. Assuming that F1 is connected to the nonlinear load, F2 feeding the linear load in which HIF occurs and the balanced and unbalanced fault occurs in the third feeder with the normal condition in F4 and the model description is discussed in [17].

2.2 Proposed methodology to detect HIF

This section describes the steps for detection and identification of HIF in the MV distribution power system as portrayed in Fig. 2 and also discussed as follows.

Fig. 2
figure 2

Flow chart of the proposed method of fault classification

  • Step 1 Pre-processing: Design the MATLAB/Simulink model of MV network and apply the disturbances for acquisition of fault current data.

  • Step 2 Feature extraction: The fault current signal is sampled using db9 mother wavelet of DWT and extracting the features, namely SD values, of all decomposed signal to train the classifiers.

  • Step 3 Training phase: The data obtained (SD) from the DWT under various power system disturbances were used to train the computational intelligence-based classifiers.

  • Step 4 HIF identification: The trained classifiers are tested with different power system disturbances to discriminate the HIF from other disturbances such as three-phase fault, line-to-ground (LG), line-to-line (LL) and double line-to-ground (LLG) fault. This process continues for every cycle of operation of the system for reliable operation and complete protection of the system.

3 Data acquisition using DWT analysis

Wavelet transform (WT) is a significant tool for decomposing the transient signal into several components and represents in time–frequency domain rather than the time domain [31, 32]. The fundamental idea behind this is to analyse the signal by means of dilation and translation process. In general, WT exists in two forms: CWT and DWT, in which continuous signal f(t) has been defined as given in Eq. (1)

$$ {\text{CWT}} \left( {a,b} \right) = \left[ {\frac{1}{\sqrt a }\mathop \int \limits_{ - \infty }^{\infty } f\left( t \right)*h\left( {t - \frac{b}{a}} \right){\text{d}}t} \right] $$
(1)

where a and b are the scaling and translational parameters and h is the mother wavelet function. The mother wavelet is the prototype for generating wavelet function. CWT is an alternative approach to overcome the problem of resolution as in STFT. But, this method also has low redundancy during reconstruction of signal compared to the DWT method. Therefore, in this paper the DWT analysis was used to extract the features for training the classifiers.

The DWT is a powerful signal processing information tool which allows the signal to be sampled with localized transients. In recent decades, such powerful advanced tool has been used for operating the protective relays [27,28,29,30,31]. The time and frequency information can also be calculated using the following techniques such as fast Fourier transform (FFT), STFT and CWT, but the DWT has been used because of its fast computation speed and accuracy [4]. Thus, DWT is defined as Eq. (2)

$$ {\text{DWT}}\, \left( {m,k} \right) = \left[ {\frac{1}{{\sqrt {a_{0}^{m} } }}\mathop \sum \limits_{n} f\left( n \right)*h\left( {\frac{{k - na_{0}^{m} }}{{a_{0}^{m} }}} \right)} \right] $$
(2)

where the parameter a m0 and na m0 are scaling and translation constant, respectively, n and m are integer variables and h is the wavelet function. Figure 3 shows the timescale representation of digital signal achieved by digital filtering techniques, and this representation is called subband coding algorithm. The various decomposition levels of the signal X(n) are shown below [14].

Fig. 3
figure 3

DWT decomposition of signal

Steps for decomposing the signal are as follows:

  • Step 1 Decompose the original X(n) into levels, i.e. denoising process.

  • Step 2 Choose the certain levels for reconstruction of desired signal.

  • Step 3 Reconstruct the signal with selected levels.

  • Step 4 Specify the following parameters: sampling frequency, window length, levels of decomposition and mother wavelet.

The original signal is divided into low- g(n) and high-frequency h(n) components by DWT, which are called approximation and detailed coefficients. The decomposed signal is further iterated with successive approximation, so that the signal is broken down into many lower-resolution components. This is called multi-resolution analysis (MRA).

In this work, Daubechies 9 (db9) is selected as mother wavelet for the detection of fault, because it is a better frequency extractor compared to the Haar wavelet. It also satisfies the Parsevall’s theorem due to orthogonality and reduces the redundancy in the signal compared to Coiflet and Meyer wavelets [40, 41].

The optimal decomposition of L levels is given by the condition as represented in Eq. (3)

$$ N = 2^{L} $$
(3)

where N is the length of decomposition level and L is the level of decomposition. During the signal decomposition, at each level the signal has been divided into different frequency bands as defined in Eq. (4)

$$ B = \frac{F}{{2^{L + 1} }} $$
(4)

where B is the bandwidth of each level in Hz and F is the sampling frequency in Hz.

The sampling frequency of 20 kHz is considered for decomposing the signal into different levels with 5000 points in length and 800 samples for each phase of current signal. The band frequencies captured for each level are varied and calculated using Eq. (4) and are as follows: 5–2.5 kHz, 2.5–1.25 kHz, 1.25–0.625 kHz, 0.625–0.3125 kHz and 0.3125–0.15625 kHz are represented as detailed coefficient of d1, d2, d3, d4 and d5, respectively, and the approximation is made from detailed coefficient d5 [14]. Hence, the wavelet analysis is performed for the proposed work using the mother wavelet of db9 with detailed coefficients of 5 levels, for different fault current signal recorded for each cycle.

3.1 Feature extraction

Standard deviation (SD) is calculated for feature extraction to identify the type of fault. It is observed that some useful information can be extracted from SD of approximation signal (a5) with detailed coefficient level of d1 to d5 for each phase. The SD values of current signal are calculated for each phase under different fault conditions. The SD values are obtained using Eq. (5) as shown below:

$$ {\text{SD}} = \sqrt {\frac{1}{n - 1}\left[ {\mathop \sum \limits_{i = 1}^{n} \left( {x_{i} - \bar{x}} \right)^{2} } \right]} $$
(5)

where \( \bar{x} = \frac{1}{n}\sum\nolimits_{i = 1}^{n} {x_{i} } \). \( x \) is the data vector, and n is the number of elements in that data vector.

4 Computational intelligence-based classifiers

This section describes the miscellaneous classifiers such as fuzzy, Bayes, MLP, ANFIS and SVM classifier approach to classify and discriminate the HIF from other faults that occur in the distribution system. Here, the classes for various faults are considered as C1—normal, C2—three-phase fault, C3—LG fault, C4—LLG, C5—LL and C6—HIF.

4.1 Fuzzy inference system (FIS)

FIS is a classical set theory employed in detection of HIF in the system and has the flexibility in handling the data during uncertainties. In few cases such as fault or other disturbances in the system, the boundaries of the membership function may overlap and greatly reduce the accuracy of the crisp classifiers. Therefore, the performance of FIS is not appreciable in the case of classifier problem involving noisy, imprecise or incomplete data [32,33,34]. In this work, the features extracted (SD values) from DWT analysis of each phase A, B and C are considered as SDA, SDB and SDC. Then, these values are assigned as input with 4 input triangular membership functions and fed into the Sugeno FIS engine to determine the status of the system as described in Fig. 4. The boundary values for the input membership functions are as follows: 17–24 for normal, 25–35 for ground, 8–16 for HIF and 27–47 for fault. The developed FIS model in MATLAB is portrayed in Fig. 5 with its rules for classifications represented in Fig. 6.

Fig. 4
figure 4

Fuzzy inference system

Fig. 5
figure 5

MATLAB model of Sugeno-based FIS

Fig. 6
figure 6

Rule viewer to classify the fault for FIS system in MATLAB

4.2 Multi-layer perceptron (MLP) network

Multi-layer perceptron network is the feed forward structure of ANN with fully connected nodes using weights. The calculation of output at every node of the network is same which is given by:

$$ {\text{Net}}_{i} = W_{{i_{o} }} + \mathop \sum \limits_{{j \in {\text{pred}}\left( i \right)}} \left( {W_{ij} a_{j} } \right) $$
(6)

where aj is the output of the previous layer neuron, Wij is the weight connecting ith and jth neuron and \( W_{{i_{o} }} \) represents the input bias of the neuron. The MLP is trained using the back-propagation algorithm which calculates the error and update the weight at each trial run [31, 35, 38].

4.3 Bayesian neural network

The Bayesian neural network (BNN) uses the principle approach to deal with the complex network problem and also determines the input variables which are relevant to the output to be classified. This is done by proper prediction of test cases through the probability produced by the network with several set of parameters that are averaged are called predicted probabilities P(C). This prior information is used for training the network about the system and then calculates the probability of occurrence of event or fault in the system by comparing the likelihood probability, P(L) with the P(C). Let C denote the classes C1 to C6 of a likelihood of data set L. To predict the class of likelihood L = {X1, X2Xn} by using the Bayes rule, the highest probability is,

$$ P\left( {C |X_{1} ,X_{2} , \ldots X_{n} } \right) = \frac{{P\left( C \right)P(X_{1} ,X_{2} , \ldots X_{n} |C)}}{{P\left( {X_{1} ,X_{2} , \ldots X_{n} } \right)}} $$
(7)

The simplified form of the above equation is represented as follows:

$$ P\left( {C |L} \right) = \frac{P\left( C \right)P(L|C)}{P\left( L \right)} $$
(8)

It is seen that from Eq. (7) the number of parameters in the likelihood term \( P(X_{1} ,X_{2} , \ldots X_{n} |C ), \) increases exponentially with the number of attributes leading to the computational burden of the classifier [36, 37].

4.4 Support vector machine (SVM)

SVM is a statistical theory-based popular intelligent soft computing classifier to classify the data without pre-processing the data. It is a nonlinear kernel-based classifier which maps the data from the one space region to other form, and this is done during training of data and also assumes each fault or disturbance as a class labelled in Sect. 4. During the testing phase, the classifier identifies the type of fault by predicting the class label of the test input data and detects the disturbance occurred in the system. The examples of input and decision space are shown in Figs. 7 and 8, respectively [24,25,26].

Fig. 7
figure 7

Support vector machine

Fig. 8
figure 8

Input space of nonlinear separable classifier

The hyperplane f(x) as shown in Fig. 7 which groups the input data after training of classifier is depicted as:

$$ \begin{aligned} f\left( x \right) & = {\text{sign}} \left( {\mathop \sum \limits_{i \in N} \alpha_{i}^{*} y_{i} \cdot x_{i} + b^{*} } \right) \\ & x \in \left\{ {\begin{array}{*{20}l} {{\text{class}}\;{\text{I}},} \hfill & {{\text{if}}\;f\left( x \right) = + 1} \hfill \\ {{\text{class}}\;{\text{II}},} \hfill & {{\text{if}}\;f\left( x \right) = - 1} \hfill \\ \end{array} } \right. \\ \end{aligned} $$
(9)

where x is the input data set for classification with N samples, \( \alpha_{i}^{*} \) is the optimal value of Lagrangian multiplier, \( y_{i} \in \left\{ { + \,1, - \,1} \right\} \) which decides the class of input data sample x and \( b^{*} \) is the solution of weight vector as given in [24].

A nonlinear kernel-based SVM is used which consists of 3 SVMs, namely SDA, SDB and SDC, to detect the fault which occurs in three phases A, B and C, respectively. Figure 9 shows the decision space vector of SVM for SDA after training phase, and this is done using a kernel function for mapping the data in higher-dimensional space. Therefore, rewriting Eq. (9) using a kernel function results in [38, 39]

$$ f\left( x \right) = {\text{sign}}\left( {\mathop \sum \limits_{i \in N} \alpha_{i}^{*} y_{i} \cdot k\left( {x_{i} ,x_{N} } \right) + b^{*} } \right) $$
(10)

where \( k\left( {x_{i} ,x_{N} } \right) \) is the kernel function and is defined by radial basis function (RBF) because of reliability than other kernel function and is represented by [30].

Fig. 9
figure 9

Decision space of nonlinear separable classifier

$$ k\left( {x_{i} ,x_{N} } \right) = { \exp }\left( { - \frac{{\left\| {x - y} \right\|^{2} }}{{2\sigma^{2} }}} \right) $$
(11)

The SVM detects the class as 1 if the fault occurs in the respective phase during the testing phase, otherwise it gives − 1. The multiple logics for SVM under different type of fault of the system are represented in Table 1.

Table 1 Multiple SVM logic for fault classification

4.5 Adaptive neuro-fuzzy inference system

ANFIS is an intelligent adaptive data learning method in which the fuzzy inference system is optimized via the ANN training. It maps the input and output through the input and output member function. From the input–output data, ANFIS adjusts the membership function using least square method or back-propagation descent method for linear and nonlinear system. The Sugeno fuzzy model has been proposed for creating the fuzzy rules from a given input–output data set. A typical Sugeno fuzzy rule is expressed in the following form [19,20,21,22,23]:

  • Rule 1 If (x1 is A1) and (x2 is B1), then f1 = (u1x1 + v1x2 + r1)

  • Rule 2 If (x1 is A2) and (x2 is B2), then f2 = (u2x1 + v2x2 + r2)

where the vector X = {x1, x2,… xm} is the input data to be classified. Ai, Bi and Ci are the fuzzy sets with output fi specified within the rule-based fuzzy system. ui, vi and ri are design parameters which are obtained during the training process. The architecture of ANFIS system consists of 6 layers: They are the input layer, fuzzification layer, rule layer, normalization layer, defuzzification layer and output layer, respectively as given in Fig. 10.

Fig. 10
figure 10

ANFIS structure

Layer 1 is the input layer which passes the external crisp signals to Layer 2 called fuzzification layer which has the bell activation function and is defined by [19]:

$$ \mu_{{A_{i} }} \left( x \right) = \frac{1}{{1 + \left\{ {\left( {\frac{{x - c_{i} }}{{a_{i} }}} \right)^{2} } \right\}^{{b_{i} }} }} $$
(12)

where ai, bi and ci are the design parameters of bell-shaped membership function. Layer 3 is the rule layer; here each neuron corresponds to a single Sugeno-type fuzzy rule. The neuron gets inputs from the corresponding fuzzification neurons and calculates the firing strength of the rule. In ANFIS, the conjunction of the rule antecedents is evaluated by the operator product. Thus, the output of neuron ‘i’ in Layer 3 is obtained in terms of membership function (µ) using Eq. (12).

$$ y_{i}^{3} = w_{i} = \mu_{{A_{i} }} \left( {x_{i} } \right) \mu_{{B_{i} }} \left( {x_{j} } \right) $$
(13)

Layer 4 is the normalization layer. Individual neuron in this layer obtains inputs from all neurons in the Layer 3 and calculates the normalized firing strength of a given rule as the ratio of the firing strength of a specified rule to the sum of firing strengths of all rules. It signifies the influence of a given rule to the final result. Thus, the output of neuron ‘i’ in Layer 4 is determined as given by Eq. (14).

$$ y_{i}^{4} = \bar{w}_{i} = \frac{{w_{i} }}{{w_{1} + w_{2} }},\quad i \in 1,2 $$
(14)

Layer 5 is the defuzzification layer. Respective neuron in this layer is linked to the individual normalization neuron and also receives initial inputs x1 and x2. A defuzzification neuron calculates the weighted consequent value of a given rule as given in Eq. (15)

$$ y_{i}^{5} = \bar{w}_{i} f_{i} = \bar{w}_{i} \left( {u_{i} x_{1} + v_{i} x_{2} + r_{i} } \right),\quad i \in 1,2 $$
(15)

where xi is the input and yi is the output of defuzzification neuron in Layer 5, ui, vi and ri are set of consequent parameters of rule ‘i’ and Layer 6 is represented by a single summation neuron. This neuron calculates the sum of outputs of all defuzzification neurons and produces the overall ANFIS output ‘y’ which is given by Eq. (16).

$$ y_{i}^{6} = \mathop \sum \limits_{i = 1}^{2} \bar{w}_{i} f_{i} = \frac{{\mathop \sum \nolimits_{i = 1}^{2} w_{i} f_{i} }}{{w_{1} + w_{2} }} $$
(16)

The SD values obtained for different conditions of the system using DWT analysis are fed as input to the ANFIS model with its output representing the type of fault occurred in the system. To achieve this, the training for input data set is done using the MATLAB ANFIS by utilizing the mathematical properties of ANN in tuning the rule-based fuzzy system and that exploits the best qualities of these two approaches efficiently. Thus, the ANFIS system possesses the following advantages than fuzzy approach [38, 39]:

  • Revising the IF–THEN rules to describe the characteristics of complex system.

  • Knowledge of human expertise for training is not required as like that of FI approach.

  • Have better choice of membership function (MFs) to use.

  • Faster convergence speed than fuzzy approach.

  • Adaptive through mathematical learning gives accurate and efficient results.

The network is trained for the input–output data set with MATLAB ANFIS editor, which adjusts the MFs directly based on the data set. Here for each phase of input, SDA, SDB and SDC are assumed with 4 triangular membership functions and the output being constant for Sugeno model ANFIS. Fourteen rules were framed with 45 input–output data sets for training the ANFIS model. The trained constant output assumed for various cases of the system is as follows: 0—no fault, 0.2—HIF in phase C, 0.3—HIF in phase B, 0.4—HIF in phase C, 0.5—LLLG, 0.6—AG, 0.7—BG, 0.8—CG, 0.9—AB, 1—BC, 1.1—CA, 1.2—ABG, 1.3—BCG and 1.4—CAG. The output model of ANFIS structure developed in MATLAB/Simulink is shown in Fig. 11, which represents that 0.2 indicates the HIF in phase C of the system.

Fig. 11
figure 11

ANFIS results for distribution network

4.6 Performance indices for classifiers

4.6.1 Kappa statistic

It is the statistical measure of classifiers that calculate the consistency among the predicted and observed data sets and is defined as follows:

$$ K = \frac{{P\left( {\text{OF}} \right) - P\left( {\text{EF}} \right)}}{{\left( {1 - P\left( {\text{EF}} \right)} \right)}} $$
(17)

where P(OF) is the probability of observed fault in the system and P(EF) is the probability of expected type of fault by chance that occurs in the system. Its value ranges from 0 to 1. If the classifier is excellent, then the kappa statistics is 1; if it is between 0.4 and 0.75, then it is good; and if it is less than 0.4, then the performance is poor [21, 38, 39]. This measurement is important to calculate the overall accuracy of the classifier.

4.6.2 Mean absolute error (MAE) and root mean square error (RMSE)

MAE is the absolute average measure of error between the predicted and observed value of the classifier, and it is depicted as follows [21, 38, 39]:

$$ {\text{MAE}} = \frac{{\left| {\mathop \sum \nolimits_{i = 1}^{n} \left( {E_{\text{P}} - E_{\text{O}} } \right)} \right|}}{n} $$
(18)

RMSE is the square root of average measure of variance between the predicted and observed fault by the classifiers and is given by:

$$ {\text{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {E_{\text{P}} - E_{\text{O}} } \right)^{2} }}{n}} $$
(19)

where EP is the predicted value of classifier and EO is the observed value of the classifiers over ‘n’ data sample.

5 Results and discussion

This section presents the simulation of proposed approach for detection and identification of HIF in MV distribution network. To validate the performance of the proffered method, the simulation is carried out using MATLAB/Simulink for the distribution model portrayed in Fig. 1. The time-varying current signal for the normal condition of feeder for the time period of 0.25 s is captured and is represented in Fig. 12. Moreover, the various faults that occur in the power system were also simulated in addition with the study of HIF to prove the effectiveness of the proposed method. The three-phase current waveform observed for the period of 0.02–0.08 s in case of HIF fault and single line to ground (LG) is shown in Figs. 13 and 14, respectively. Figure 15 shows that the fault current magnitude in case of HIF fault in phase C of three-phase system is low as like the normal phase current signal. But the amplitude of current signal in case of LG fault in phase A of three-phase system is very high as shown in Fig. 14, and because of this, the detection of HIF in power system is a major challenge till date as compared to other conventional type of fault that occurs in the system. To overcome these difficulties in real time, DWT analysis was used to extract the features as it decomposes the signal both in time and in frequency domain.

Fig. 12
figure 12

Three-phase current waveform for normal case

Fig. 13
figure 13

Three-phase current waveform for HIF at phase C

Fig. 14
figure 14

Three-phase current waveform for LG fault at phase A

Fig. 15
figure 15

High-impedance fault current waveform

The DWT analysis of each phase current signal which is sampled for ever cycle with 800 numbers of samples is performed with five decomposition levels. Each level represents with different bands of frequencies as discussed in Sect. 3. The approximate coefficient obtained for the final decomposed level (d5) and the various detailed coefficient levels d1, d2, d3, d4 and d5 are used for calculating SD values as shown in Table 2. The DWT analysis of each phase A, B and C of the system under normal case is portrayed in Figs. 16, 17 and 18, respectively. In addition, the DWT analysis for the faulty phase during the occurrence of HIF in phase C in one of the feeder and the occurrence of LG fault in phase A on another feeder network is also represented in Figs. 19 and 20, respectively, for better understanding of the proposed method. Similarly, the DWT analysis for other faults such as LL, LLG and three-phase fault for varying fault resistance is done and the extracted SD features for training the classifiers to detect the HIF in the system are given in Table 2.

Table 2 Fault detection in case of varying fault resistance by different intelligence classifiers
Fig. 16
figure 16

DWT analysis of phase A under normal case

Fig. 17
figure 17

DWT analysis of phase B under normal case

Fig. 18
figure 18

DWT analysis of phase C under normal case

Fig. 19
figure 19

DWT waveform of HIF fault at phase C

Fig. 20
figure 20

DWT waveform of LG fault at phase A

The performance indices such as MAE, RMSE and kappa statistic of the proposed ANFIS and SVM method of classifiers are compared with the fuzzy, MLP and Bayes classifier approach. Figures 21 and 22 represent that the proposed classifiers performance is significantly improved compared to the conventional classifiers and the results are illustrated in Tables 2 and 3. It is observed that the kappa statistics is 1 for all the classifier approaches than fuzzy method due to human error introduced during the selection of ranges for membership functions. On the other hand, all the neural-based and SVM classifiers are learned mathematically and there is no possibility of human error. Thus, the performance of ANFIS and SVM method is substantially predominant than any other method as depicted in Fig. 22.

Fig. 21
figure 21

Accuracy and effectiveness of different intelligence classifiers

Fig. 22
figure 22

Comparison of MAE, RMSE and kappa statistic of various intelligence classifiers

Table 3 Performance comparison of various intelligence classifiers

In addition with evaluation indices, the other performance indices are also tested further to validate the effectiveness of the proposed classifiers for classification of fault and are defined as below:

$$ {\text{Success}}\;{\text{rate}} = \frac{{{\text{Number}}\;{\text{of}}\;{\text{HIF}}\;{\text{detected}}}}{{{\text{Total}}\;{\text{number}}\;{\text{of}}\;{\text{HIF}}\;{\text{events}}}}*100\% $$
(20)
$$ {\text{Discrimination}}\;{\text{rate}} = \left[ {1 - \frac{{{\text{Number}}\;{\text{of}}\;{\text{events}}\;{\text{in}}\;{\text{correctly}}\;{\text{diagnosed}}}}{{{\text{Total}}\;{\text{number}}\;{\text{of}}\;{\text{events}}}}} \right]* 100\% $$
(21)

Figure 21 shows the success and discrimination rate of all the classifier methods are 100% except the fuzzy-based approach which is 66.67% and 85%, respectively, and is presented in Table 3.

6 Comparison of literature work

The comparative performance of accuracy for various classifiers to identify different types of fault in the system by proffered method is made with all other existing methods and presented in Table 4. The result reveals that all the intelligence-based classifier requires the training of data obtained from the signal processing techniques and their accuracy lies in the range of 70–100% for detection of either conventional fault (LG, LL, LLG and three-phase fault) [42,43,44,45,46,47,48,49,50,51] or HIF [12, 13, 15, 34, 52]. Nevertheless, these works fail to identify conventional and HIF faults in the system, whereas the proposed work considers both faults. Moreover, to test the effectiveness of the proposed classifiers the performance evaluation indices such as kappa statistics, MAE and RMSE are evaluated. However, the literature work portrayed in Table 4 fails to evaluate these performance indices that demonstrate the robustness of the classifier in addition with the accuracy of classification. It can be concluded that from the results given in Tables 2 and 3 the proposed ANFIS and SVM classifier performance is substantially predominant for identifying various types of fault in the system.

Table 4 Performance comparison with literature work

7 Conclusion

In this paper, the detection and identification of HIF in MV distribution power system network is presented using computational intelligence-based classifiers such as fuzzy, MLP, Bayes, ANFIS and SVM. The distribution feeder network of 13.8 kV is simulated using MATLAB/Simulink, and various power system disturbances such as HIF, LG, LL, LLG and three-phase fault are studied with time-varying fault resistance for testing the effectiveness of the proposed classifiers. The DWT analysis of three-phase current signal using a mother wavelet of db9 with detailed coefficient of 5 levels is done to extract the SD features for various types of fault. The features obtained were used to train the computational intelligence-based classifiers to identify the HIF in the system. The results showed that the performance of ANFIS and SVM classifiers is superior in terms of success rate, discrimination rate, MAE, RMSE and kappa statistics. Furthermore, it is observed that the SVM-based intelligence classifier performance is remarkable than other classifiers. Nonetheless, a more depth analysis on identification of HIF in the distribution network comprising of renewable energy resources will be the future goal of research integrating the field of data mining and internet of things (IoT) for rapid recovery of system from fault conditions.