Keywords

1 Introduction

One big problem, appearing on cap & pin insulators (Fig. 1) with surface pollution, is the flashover phenomenon. Briefly, the flashover phenomenon, has different occurrence process (depending on the material of the insulator), is presented at a particular operating voltage, called critical flashover voltage and can cause the partial or total evacuation of an insulator until the collapse of a power line [1].

Fig. 1.
figure 1

Basic design parameters of Cap & Pin Insulator

This paper tries to balance between accurate and comprehensible results, through the modelling effort of the flashover phenomenon using Artificial Neural Networks (ANNs), Genetic Programming (GP) and Inductive Decision Trees (IDT) with reference point, a standard data analysis approach (Multiple Linear Regression-MLR). Finally, using an evolutionary algorithm (Gravitational Search Algorithm-GSA) we try to optimize the proposed model given from genetic programming.

The database was common to all four techniques applied to the problem and consists of 168 cases (i.e. series of related measurements) represented by six (6) numeric variables, namely the insulator’s maximum diameter (Dm), height (H), creepage distance (L), insulator’s manufacturing constant (F), and the insulator’s pollution (C) and the critical flashover voltage (Uc). The critical flashover voltage of a polluted insulator is expressed in relation to these variables. A part of these application data (140 cases) are simulated data derived from a specialized model corresponding to incidents of flashover voltage on polluted Cap & Pin insulators, while the rest of the data (28 cases) consist of real experimental observations [1].

The main database was divided into two parts. The first part, which is called Training-Validation Set (130 cases of simulated data and 20 cases of actual data), was used for the development of the predictive models and the second part, which is called Test Set (10 cases of actual data and 8 cases of simulated data), was used to test the final predictive models.

2 Modelling Approaches

The MLR model (MLR) for the estimation of flashover voltage is described by Eq. 1 and its performance through different measures is given in Table 1.

Table 1. Aggregation and comparative scoreboard of MLR, ANN-2 GP-1, GP-2 and GSA-1
$$ {{\text{U}}_{\text{C}} = 1.0368 \cdot {\text{D}}_{\text{m}} + 0.468 \cdot {\text{H}} + 1.8342 \cdot {\text{L}} + 0.4292 \cdot {\text{F}} - 6.8969 \cdot {\text{C}} + 13.0605} $$
(1)

Its performance is clearly lower than all other intelligent approaches applied, as it can also be seen in Table 1.

The ANN model (ANN-2), presented in Fig. 2, is a Multilayer Perceptron (MLP) with 1 hidden layer (20 nodes) and activation function of hidden nodes was the logistic function. ANN-2 training was supervised, into batch mode and was performed using the back propagation algorithm (BP). Moreover, the validation was held together with the process of training using the 10-fold cross validation method based on the same dataset (Training-Validation Set). The performances are depicted in Table 1 [2, 3].

Fig. 2.
figure 2

The ANN-2 model

The GP models (donate as GP-1 and GP-2), emerged using the Training-validation Set and following the procedure of 10-fold cross validation. Specifically, for each fold, 30 independent runs were made within the same modulation. All runs trained their solution with k-1 subsets and all were compared in the rest testing data. So, ten (10) candidate models resulted, which were compared in the second set of 18 cases, and the best model in terms of correlation coefficient was chosen. The GP-1 and GP-2 are presented, respectively in Eqs. 2 and 7. Their performances are depicted in Table 1 [4, 5].

$$ {\text{U}}_{\text{C}} = {\text{A}} \cdot \left( {{\text{B}} \cdot {\text{K}} + {\text{J}}} \right) + 3.254 $$
(2)

where:

$$ \begin{aligned} {\text{A}} = & 2.159 \cdot \frac{{10^{ - 6} }}{{0.998 \cdot {\text{F}} + \log \left( {1.908 \cdot {\text{C}}} \right) + 7.182}} \cdot (\log \left( {0.531 \cdot {\text{L}}} \right) - 13.224) \cdot \\ & ( - 0.741 \cdot {\text{F}} - \log \left( {0.106 \cdot {\text{L}}} \right) + 11.943) \\ \end{aligned} $$
(3)
$$ \begin{aligned} {\text{B}} = & ( - 0.604 \cdot {\text{D}}_{\text{m}} \cdot {\text{F}} + 0.016 \cdot {\text{H}} \cdot \left( {2.267 \cdot {\text{L}} + 277.2} \right) \\ & - \,24.76 \cdot {\text{L}} \cdot ( - { \log }(0.106 \cdot {\text{L}}) + \frac{{\log \left( {1.022 \cdot {\text{C}}} \right)}}{{ - 0.741 \cdot {\text{F}} - \log \left( {0.352 \cdot {\text{F}}} \right) + 11.943}}) \\ \end{aligned} $$
(4)
$$ {\text{K}} = 0.604 \cdot {\text{D}}_{\text{m}} \cdot {\text{F}} - 21.253 \cdot \,\log \left( {1.908 \cdot {\text{C}}} \right) - 21.253 $$
(5)
$${ {\begin{aligned} {\text{J}} = & -151.67 \cdot {\text{L}} \cdot \left( {0.604 \cdot D_{m} \cdot {\text{F}} - 21.253 \cdot \,\log \left( {1.908 \cdot {\text{C}}} \right) + 15.311} \right) \\ & + \log \left( {1.908 \cdot {\text{C}}} \right) + \frac{{\log \left( {1.908 \cdot {\text{C}}} \right)}}{{0.741 \cdot {\text{F}} - 24.76 \cdot {\text{L}} - 4.761}} - \frac{{6.813 \cdot 10^{ - 5} }}{{1.145 \cdot D_{m} + \frac{{\log \left( {1.908 \cdot {\text{C}}} \right)}}{{0.083 \cdot {\text{C}} + 3020702}}}} \\ \end{aligned}} }$$
(6)
$$ {\text{U}}_{\text{c}} = \left( {\log \left( {0.054012 \cdot {\text{C}}} \right)} \right)^{2} \cdot \left( {1.298 \cdot {\text{L}} + 1.4616 \cdot {\text{D}}_{\text{m}} } \right) \cdot 0.0053419 + 1.9647 $$
(7)

The GSA model (GSA-1) arose during the optimization effort of the performances of GP models. The GSA-1 (Eq. 8) uses Harmony Search for initialization, is based on GP-2 and its performance are depicted in Table 1 [6, 7].

$$ {\text{U}}_{\text{c}} = {\text{E}}\, \cdot \,{\text{Z}} + {\text{G}} $$
(8)

where:

$$ {\text{E}} = { \log }^{2} (0.0544783402089791 \cdot {\text{C}}) $$
(9)
$$ {\text{Z}} = 0.0053419 \cdot \left( {1.29462312655721 \cdot {\text{L}} + 1.46957461951861 \cdot {\text{D}}_{\text{m}} } \right) $$
(10)
$$ {\text{G}} = 1.95794336098125 $$
(11)

Finally, the IDT model (IDT-1) presented in this work, emerged after of several tests, which were related to the number of the data cases, the classification of used data cases (according to the values of Uc and with the modify of key operating parameters, such as the training-validation process. Specifically, for comparison reasons with the other AI approaches used in this paper, a 10-fold cross validation and no pruning was selected for the entire data set (i.e. 150 cases for training and 18 cases for testing the produced model in new, unseen cases) according of the classification of 5 kV of Uc (i.e. six classes ranging within [5,10) kV, [10,15) kV, [15,20) kV, [20,25) kV, [25,30) kV, [30,35] kV) [8,9,10,11].

The resulting decision tree contains 32 internal nodes and is translated into 30 different rules (the detailed outcome is not given in the paper due to space limitations). Seven of the rules produced, are handy and accurate as they cover a considerable number of cases from the entire data set and no negative classifications and the probability of correct classification of new unseen cases in the future is higher than 95%, i.e. statistical tests can take place for the verification of the acquired rule-based knowledge. In total, 13 rules are interesting (“cover” means how many cases from the dataset verify the produced rule and the number in brackets following each rule corresponds to the probability of correct classification of new cases in the future using the specific rule):

3 Comparative Results and Conclusions

In this paper four different intelligent techniques were employed to estimate the value of critical flashover voltage for polluted insulators. Results obtained cannot be compared always to exactly the same datasets and experimental conditions, but can be comparable in a somewhat fair manner, in order to draw general conclusions.

The results show that methods like ANNs and GP, which have inner procedures, not fully understandable by humans, produce accurate models. ANNs prove the strongest approach in terms of accuracy. Furthermore, GP may be programmed in such a way that can aim to shorter equations without any discount on accuracy. Its comprehensibility sometimes can be considered higher than that of the ANNs’ structured, as standard mathematical formulas are some times more common to people. In addition, under proper encoding, grammar guided GP approaches can also produce fully comprehensible rule-based systems (IF-THEN rules or even Fuzzy Rule Based Systems). Models derived by GP-approaches can be optimized further by Nature Inspired Evolutionary Algorithms to give more accurate results, as it has been shown in this work. Inductive machine learning techniques presuppose the formation of reasonable decision classes when the target variable is of numerical nature (selecting a proper discretization). Its performance in terms of accuracy is lower, while there is need for larger collections of experimental data in order to be able to draw firm conclusions on the value of the technique. Nevertheless, experts find the acquired rules comprehensible and easy to use in the inspection process, while, as observed in the indicative results given in this paper, their comprehensibility is indeed high, in terms of measuring the number of resulting rules acquired and the conditions contained within these rules.

Further work includes experimentation to other competitive hybrid intelligent schemes based on nature inspired optimization approaches. The enrichment of the dataset is also a priority for the research team, as results seem encouraging.