1 Introduction

Electrical power systems are more frequently operated close to their technical limits due to the increase of renewable energy systems and distributed generation. Therefore they become more prone to fault occurrences. Especially the medium voltage grid is affected since it experiences the most significant changes. Therefore fault detection identification and localization are of great concern for the distribution grid operators. Fault identification and localization mainly includes the following aspects: type, direction, and distance. It is generally independent from protective relaying. The purpose is not to protect assets through a short-term generation of a trip signal to de energize a faulted section as fast as possible. The objective is to supply the grid operator with information about the fault prior to an on-site investigation. Transmission lines are most prone to occurrence of fault. Fault detection, direction estimation and faulty phase selection play a critical role in the protection for a transmission line. Accurate and fast fault detection and classification under a variety of fault conditions are important requirements of any protective relaying scheme [1].

This paper aims to provide a contribution toward more effective identification and classification of earth faults in 10 kV isolated network. The investigated network modelled in accordance with realistic distribution network data in Bosnia and Herzegovina. The main characteristic of Bosnian 10 kV distribution network is lightly loaded, long overhead distribution lines which run through the harsh mountain terrain and supply electricity to remote communities. These feeders are often run from the same substation, which is used to supply electricity to urban areas and industrial customers via shorter, but more heavily loaded underground cables. Most of 10 kV distribution system in Bosnia and Herzegovina still operates with an isolated neutral point. This mode of operation has numerous advantages (as well as disadvantages) over the other methods of neutral point treatment, as shown in Table 1. The main advantage of an isolated system is the ability of uninterrupted operation under the fault conditions which is good for systems in the early stages of their development (such as the Bosnian system in 1960s and 1970s). However, as the system develops and as the new sections and feeders are added, the earth fault currents increase and overvoltage during fault becomes a serious issue, causing an ongoing debate on the benefits and challenges of the existing configuration.

Table 1. Overview of advantages and disadvantages of isolated system

2 Overview of Earth Fault Fundamentals

In the past 20 years, there has been a rapid development in various fields concerning the detection, classification and location of faults in power systems. The advances in signal processing techniques, artificial intelligence and machine learning, global positioning system (GPS) and communications have enabled more and more researchers to carry out studies with high breadth and depth in that the limits of traditional fault protection techniques can be stretched [2]. Earth fault is a type of power system fault which occurs when contact is established between conducting phase and the earth in the system with isolated neutral point of a transformer. Since in the isolated system, there is no direct connection with the earth, the only connection that can be established is due to the ground level capacities of transmission lines. When the earth fault is developed in an isolated network, currents flow through earth capacities of healthy phases and the earth fault current is dependent on the fault resistance and network capacitance [3]. Figure 1 represents an illustration of basic distribution network parameters during the earth fault. With reference to Fig. 2 and defining the V0 as the neutral point voltage, the general representation of the earth fault current can be derived as follows:

Fig. 1.
figure 1

Illustration of basic power distribution network parameters during the earth fault

Fig. 2.
figure 2

Phasor diagram during earth fault in an isolated system

$$\underline{{{\text{V}}_{1} }} = \sqrt 3 {\text{V}}_{\text{n}} {\text{e}}^{{{\text{j}}150^{{^\circ }} }}$$
(1)
$$\underline{{{\text{V}}_{2} }} = \sqrt 3 {\text{V}}_{\text{n}} {\text{e}}^{{{\text{j}}210^{{^\circ }} }}$$
(2)
$$\underline{{{\text{I}}_{1} }} = {\text{V}}_{1} {\text{j}}\upomega\,{\text{C}}_{0} = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}150^{{^\circ }} }} {\text{e}}^{{{\text{j}}90^{{^\circ }} }} = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}240^{{^\circ }} }}$$
(3)
$$\underline{{{\text{I}}_{2} }} = {\text{V}}_{2} {\text{j}}\upomega\,{\text{C}}_{0} = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}210^{{^\circ }} }} {\text{e}}^{{{\text{j}}90^{{^\circ }} }} = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}300^{{^\circ }} }}$$
(4)
$$\begin{aligned} \underline{{{\text{I}}_{\text{f}} }} & = \underline{{{\text{I}}_{1} }} + \underline{{{\text{I}}_{2} }} = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}240^{{^\circ }} }} + \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} {\text{e}}^{{{\text{j}}300^{{^\circ }} }} \\ & = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} ({\text{e}}^{{{\text{j}}240^{{^\circ }} }} + {\text{e}}^{{{\text{j}}300^{{^\circ }} }} ) \\ & = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} (\cos 240^{{^\circ }} + {\text{j}}\,\sin 240^{{^\circ }} + \cos 300^{{^\circ }} + {\text{j}}\,\sin 300^{{^\circ }} ) \\ & = \sqrt 3 {\text{V}}_{\text{n}}\upomega\,{\text{C}}_{0} (\frac{1}{2} - {\text{j}}\frac{\sqrt 3 }{2} - \frac{1}{2} - {\text{j}}\frac{\sqrt 3 }{2}) = - {\text{jV}}_{\text{n}} 3\,{\rm \omega} \,{\text{C}}_{0} \\ \end{aligned}$$
(5)

Earth fault modulus can be represented as:

$$\left| {{\text{I}}_{\text{f}} } \right| = {\text{V}}_{\text{n}} 3\upomega{\text{C}}_{\text{tot}}$$
(6)

3 Method and Data Collection

The input data (voltage waveforms) are generated in simulations performed on the model of a realistic electric power distribution system in Bosnia and Herzegovina. The single line diagram of power distribution network used in this study is shown in Fig. 3. Figures 4 and 5 show the relevant electrical data of the underground cable and overhead line conductors used to create the power distribution network model, respectively. This is a typical 35/10 kV zone substation with eight, 10 kV feeders. The first four are underground cables, while feeders 5–8 are modelled as overhead lines. Normal mode (steady state) operations and fault conditions, for each of the ten shunt fault types are simulated and voltage waveforms for three line voltages at 10 kV substation bus bar are reported. The types of faults are shown in Table 2. The value of an earth fault current in an isolated network is independent from location. As shown in Eq. 5. earth fault depends only on voltage and total line capacitance (which, obviously is related to line lengths). Notwithstanding this, simulations are performed for faults in different parts of the network in order to provide same diversity of input data. It was observed that some difference in voltage at the faulted line measure at 10 kV substations is present as a function of fault locations. For example, voltage in phase 1 drops to zero at the fault locations, and increases to some 10% of nominal value as we move away from the network.

Fig. 3.
figure 3

Single line diagram of power distribution system used in this study

Fig. 4.
figure 4

Relevant electrical data for underground cables used in the model

Fig. 5.
figure 5

Relevant electrical data for the overhead line conductor used in the model

Table 2. Number of generated samples, and binary and decimal binary code

3.1 Development of Fault Classification and Identification System

Figure 6 represents schematic representation of the proposed system for fault identification and classification. It is conveniently divided into four logical parts which might be considered as stages of the proposed system. The first stage is concerned with signal generation and the obtained txt files contain signal time series are stored in the database. The following stage is named signal processing and is used to store values of each line phase as a separate vector. The next step in the signal processing stage is the calculation of root mean square value for each vector. The obtained values are fuzzified and used as inputs into a fuzzy inference engine. The output of a fuzzy inference procedure is a fuzzy variable which needs to be defuzziffied in order to give a final, single valued quantity which represents the fault code. The final values of the system are represented by two vectors a and b, where a contains the information on fault code and b on the number of repetitions for each fault code [4]. The proposed system procedure is concluded by the output formatting and reporting stage which is used to graphically represent the output data. Following subsections provide some more details.

Fig. 6.
figure 6

Schematic representation of the system for fault identification and classification

3.1.1 Data Collection

The generated waveforms are sampled at a rate of 1000 Hz and saved in database as txt files. The total of 900 simulations was run, which produced 900 text files. Table 2 provides a summary of simulations performed for each type of fault and normal operation conditions.

3.1.2 Fuzzification

If define \(\upmu_{{{\tilde{\text{A}}}}} \left( {\text{x}} \right) \in \left[ {0,1} \right]\) as membership function and X as a Universe or universe of discourse, a fuzzy set \({\tilde{\text{A}}}\) in X is expressed as a set of ordered pairs [5]:

$${\tilde{\text{A}}} = \left\{ {\left. {\left( {{\text{x}},\upmu_{{{\tilde{\text{A}}}}} \left( {\text{x}} \right)} \right)\left| {{\text{x}} \in {\text{X}}} \right.} \right\}} \right.$$
(7)

Fuzzy logic models are particularly useful in applications which require the description of imprecise and complex processes. Such descriptions are performed with the use of fuzzy sets used in the process of fuzzy logic control [6]. In the proposed fuzzy system, input and output linguistic variables are described by an expert knowledge and represented by sets \({\tilde{\text{A}}}\), \({\tilde{\text{B}}}\), \({\tilde{\text{C}}}\) and \({\tilde{\text{O}}}\) which contain single terms Ai, Bj, Cl i Ok [6]:

$$\begin{aligned} {\tilde{\text{A}}} & = \left\{ {{\text{A}}_{1} ,{\text{A}}_{2} , \ldots ,{\text{A}}_{\text{i}} ,{\text{A}}_{{{\text{i}} + 1}} , \ldots ,{\text{A}}_{\text{n}} } \right\} \\ {\tilde{\text{B}}} & = \left\{ {{\text{B}}_{1} ,{\text{B}}_{2} , \ldots ,{\text{B}}_{\text{j}} ,{\text{B}}_{{{\text{j}} + 1}} , \ldots ,{\text{B}}_{\text{m}} } \right\} \\ {\tilde{\text{C}}} & = \left\{ {{\text{C}}_{1} ,{\text{C}}_{2} , \ldots ,{\text{C}}_{\text{l}} ,{\text{C}}_{{{\text{l}} + 1}} , \ldots ,{\text{C}}_{\text{z}} } \right\} \\ {\tilde{\text{O}}} & = \left\{ {{\text{O}}_{1} ,{\text{O}}_{2} , \ldots ,{\text{O}}_{\text{k}} ,{\text{O}}_{{{\text{k}} + 1}} , \ldots ,{\text{O}}_{\text{f}} } \right\} \\ \end{aligned}$$
(8)

Terms Ai, Bj, Cl and Ok are fuzzy sets defined as [6]:

$$\begin{aligned} \widetilde{{\text{A}}}_{{\text{i}}} & = \left\{ {\left. {\left( {{\text{x}},{\upmu }_{{\widetilde{{\text{A}}}_{{\text{i}}} }} ({\text{x}})} \right)} \right|{\text{x}}\, \in \,\widetilde{{\text{A}}}_{{\text{i}}} \, \subset \,{\text{U}}_{{\text{1}}} } \right\},\,{\text{i}} = 1, \ldots ,{\text{n}} \\ \widetilde{{\text{B}}}_{{\text{j}}} & = \left\{ {\left. {\left( {{\text{y}},{\upmu }_{{\widetilde{{\text{B}}}_{{\text{j}}} }} ({\text{y}})} \right)} \right|{\text{y}}\, \in \,\widetilde{{\text{B}}}_{{\text{j}}} \, \subset \,{\text{U}}_{{\text{2}}} } \right\},\,{\text{j}} = 1, \ldots ,{\text{m}} \\ \widetilde{{\text{C}}}_{{\text{1}}} & = \left\{ {\left. {\left( {{\text{z}},{\upmu }_{{\widetilde{{\text{C}}}_{{\text{j}}} }} ({\text{z}})} \right)} \right|{\text{z}}\, \in \,\widetilde{{\text{C}}}_{{\text{1}}} \, \subset \,{\text{U}}_{{\text{3}}} } \right\},\,{\text{1}} = 1, \ldots ,{\text{z}} \\ \widetilde{{\text{O}}}_{{\text{k}}} & = \left\{ {\left. {\left( {{\text{w}},{\upmu }_{{\widetilde{{\text{O}}}_{{\text{k}}} }} ({\text{z}})} \right)} \right|{\text{w}}\, \in \,\widetilde{{\text{O}}}_{{\text{k}}} \, \subset \,{\text{U}}_{{\text{4}}} } \right\},\,{\text{k}} = 1, \ldots ,{\text{f}} \\ \end{aligned}$$
(9)

In particular, in the proposed model input variables are values the root mean square of three line voltage, represented by Eqs. 1016. Graphical representation of input variables is presented in Fig. 7 while the output variable is presented in Fig. 8. Some of the values are sub normalized in order to improve the classification accuracy.

Fig. 7.
figure 7

Graphical representation of fuzzy to represent input variables

Fig. 8.
figure 8

Graphical representation of fuzzy set used to represent output variable

$${\tilde{\text{V}}}_{1} = \left\{ {{\text{A}}_{1} ,{\text{A}}_{2} ,{\text{A}}_{3} ,{\text{A}}_{4} } \right\}$$
(10)
$${\tilde{\text{V}}}_{1} = \left\{ {{\text{verylow}},{\text{low}},{\text{normal}},{\text{high}}} \right\}$$
(11)
$${\tilde{\text{V}}}_{2} = \left\{ {{\text{B}}_{1} ,{\text{B}}_{2} ,{\text{B}}_{3} ,{\text{B}}_{4} } \right\}$$
(12)
$${\tilde{\text{V}}}_{2} = \left\{ {{\text{verylow}},{\text{low}},{\text{normal}},{\text{high}}} \right\}$$
(13)
$${\tilde{\text{V}}}_{3} = \left\{ {{\text{C}}_{1} ,{\text{C}}_{2} ,{\text{C}}_{3} ,{\text{C}}_{4} } \right\}$$
(14)
$${\tilde{\text{V}}}_{3} = \left\{ {{\text{verylow}},{\text{low}},{\text{normal}},{\text{high}}} \right\}$$
(15)
$${\tilde{\text{F}}} = \left\{ {{\text{O}}_{1} ,{\text{O}}_{2} ,{\text{O}}_{3} ,{\text{O}}_{4} ,{\text{O}}_{5} ,{\text{O}}_{6} ,{\text{O}}_{7} ,{\text{O}}_{8} ,{\text{O}}_{9} ,{\text{O}}_{10} ,{\text{O}}_{11} } \right\}$$
(16)
$${\tilde{\text{F}}} = \left\{ {{\text{none}},{\text{ag}},{\text{bg}},{\text{cg}},{\text{abc}},{\text{abg}},{\text{acg}},{\text{bcg}},{\text{ab}},{\text{ac}},{\text{bc}}} \right\}$$
(17)

3.1.3 Inference

The inference model used in this paper is Mamdani type inference with three inputs (root mean square of three line voltage) and one output (fault code) variable [7]. Schematic representation of the proposed inference model is shown in Fig. 9.

Fig. 9.
figure 9

Schematic representation of fuzzy inference system with three inputs and one output variable

3.1.4 Rule Base

A total of eleven IF…AND…THEN rules are created on the bases of expert knowledge. They form a rule base and can be represented as shown in Table 3.

Table 3 Rule base

3.1.5 Defuzzification

The last component of fuzzy system is defuzzification stage, sometimes also called decoding. The purpose of this stage is to produce a non-fuzzy control output which adequately represents the membership function \(\mu_{agg} ({\text{z}})\)[6]. There are numerous defuzzification methods and in this paper, center of gravity method is chosen as defuzzification method [8]. It is important to note that one can refrain from computerized defuzzification, which means that one characterizes the faulty situation by a gradual representation rather than by a yes-no statement [9].

4 Results and Discussion

The output values of the proposed identification and classifications systems are two vectors a and b which contains data on fault type and occurrence (in the form of decimal fault code). Figure 10 show a graphical representation of these vectors. Results shown in Fig. 10 are the output result of the proposed system, which for each fault type gives occurrence rate. It can be seen, by comparing results in Fig. 10 with results presented in the second column of the Table 2 that the proposed system is capable to identify and classify the events with 100% accuracy.

Fig. 10.
figure 10

Output results of the proposed system—fault types and occurrence

5 Limitations and Future Work

This paper proposed a promising system for identification and classification of power distribution network faults. The study focused signal processing of voltage waveforms, used as input variables in the fuzzy system for identification and classification of power distribution network faults. Proposed system proved to be very efficient for identification and classification of low impedance faults. Further investigation is required to take into consideration fault resistance. The proposed system could be extended to include identification and classification algorithms that are capable to account for different network configurations (neutral point treatment). Particularly interesting would be investigation of hardware implementation possibilities of the proposed system.

6 Conclusion

This paper proposed a fuzzy logic based approach to classification and identification of isolated power distribution network. It was demonstrated that the proposed system is able to identify the phase(s) involved in all ten types of shunt faults that may occur in an electric power distribution system and to use this information to perform fault classification. The proposed method uses three line voltages, available at the substation as input variable and returns the fault code as the output variable. The input data (voltage waveforms) are generated in simulations performed on a model of realistic distribution system in Bosnia and Herzegovina.