Abstract
Partial discharge is one of the major causes that accelerates the deterioration of insulation in high-voltage electrical equipment, leading to insulation breakdown and causing significant damage to power systems such as power outages and fires. Partial discharge can occur both inside electrical equipment and on its surfaces, with various types. This paper proposes an artificial intelligence model capable of classifying patterns of various partial discharges. To analyze the designed model, pattern classification training data for each type of partial discharge, generated through UHF sensors, were collected. These data were transformed into 2D data using the Phase Resolved Partial Discharge. The proposed models were individually designed based on deep learning algorithms, namely VGG and ResNet. Additionally, Grad-CAM was used to visualize the learning areas of the pattern classification algorithms. Experimental result shows that each model can effectively improve the accuracy of partial discharge pattern classification. For the VGG model, the classification accuracies for DI, FE, and PE patterns were 99%, respectively. Regarding ResNet, the classification accuracy for the Noise pattern was 93%. Especially, Because Grad CAM provides class-discriminative and high-resolution visualization, it can effectively prove the weight of training data.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
The increasing incidents of fires and power outages due to the aging of power equipment highlight the importance of an efficient management system for electrical equipment. Partial discharge (PD), which often occurs in high-voltage power equipment such as switchboards, transformers, and switchgear, is the main cause of shortening the lifespan of insulators and causing dielectric breakdown [1, 2]. Feature extraction such as statistical, fractal and moment parameters based on phase resolved partial discharge (PRPD) are widely used [3, 4]. However, the randomness of partial discharges complicates the characterization of discharges into a single type with a typical graph. Partial discharges can manifest in various forms depending on factors such as the material properties, geometry, and operating conditions of the insulation system such as Protrusion Electrode (PE), Defective Insulator (DI), Floating Electrode (FE) and Noise (NS) patterns [5]. Consequently, a single type of discharge may not exhibit a consistent pattern or signature, making it challenging to apply traditional feature extraction methods designed for typical discharges. Moreover, traditional feature extraction methods are often tailored to detect and analyze specific patterns or characteristics commonly associated with typical discharges. These methods may not be sufficiently robust or adaptable to effectively diagnose the diverse range of atypical discharges encountered in real time scenarios [6].
Among the various Artificial Intelligence (AI) model, The significant advancement in Convolutional Neural Network (CNN) performance has propelled their widespread adoption in the field of image recognition and processing. It is credited ability to extract meaningful features from complex image data and effectively learn the local characteristics of images, thereby facilitating precise classification. Additionally, the classification of partial discharge patterns holds great importance, as it enables the identification of specific defects or the prevention of accidents in advance. Consequently, recent studies are increasingly centered on the application of AI model based on CNN for accurate recognition and classification of partial discharge patterns [7]. In this paper, we proposed two models which are Visual Geometry Group (VGG) and Residual neural network (ResNet) based on the CNN for PD classification in PRPD patterns and utilized the Gradient Weighted Class Activation Mapping (Grad-CAM) model [5, 8] among VGG and ResNet to propose a method for humans to understand the reasons for the results more effectively.
2 Background Theory
This paper proposes two models which are Visual Geometry Group (VGG) and Residual neural network (ResNet) based on the CNN for PD classification in PRPD patterns. Basically, CNN architecture described consists of layers for feature extraction (convolutional and pooling layers), feature processing and dimensionality reduction (fully connected layer), and classification (softmax layer). This architecture has proven effective in various computer vision tasks, including image classification, object detection, and segmentation. The description of each layer is as follows:
-
(1)
Convolution layer
-
Convolutional layers apply filters to input image data to extract features. These filters detect patterns such as edges, textures, or shapes within the image.
-
The output of a convolutional layer consists of feature maps, which represent the presence of specific features across the input image.
-
-
(2)
Pooling layer
-
Pooling layers sample the feature maps produced by convolutional layers, reducing their spatial dimensions.
-
Common pooling operations include max pooling and average pooling, which retain the most salient features while discarding redundant information.
-
-
(3)
Fully Connected layer
-
It re-extracts features and performs dimensionality reduction by transforming the high-dimensional feature representation into a lower-dimensional space.
-
The fully connected layer, also known as the dense layer, operates on the flattened output of the preceding layers.
-
-
(4)
Softmax Layer
-
The softmax layer is typically the final layer in a CNN and is used for classification tasks.
-
It applies the softmax function to the output of the preceding layer, producing a probability distribution over multiple classes.
-
Each output neuron corresponds to a class, and the softmax function ensures that the probabilities sum to one, making it suitable for multi-class classification (Fig. 1).
-
2.1 Overview of Proposed Models
VGG is a CNN, comprising multiple convolutional layers stacked on top of each other. The depth of the network, with 19 layers in total, allows it to learn complex hierarchical representations of input images. The model consists of 16 convolutional layers, where each layer is followed by a rectified linear unit (ReLU) activation function, contributing to the non-linearity of the model. Where the x is input data, activation function (ReLU) can be expressed as (1):
VGG utilizes max-pooling layers after certain convolutional blocks to downsample the spatial dimensions of the feature maps while preserving important features. Following the convolutional layers, there are 3 fully connected layers that perform high-level feature extraction and classification. The final fully connected layer typically outputs the class probabilities using a softmax activation function. VGG uses small 3 × 3 convolution filters with a stride of 1 and zero-padding to maintain the spatial resolution of the feature maps. Also, ResNet applied a concept called residual blocks to solve the problem of disappearing gradients by using skip connections that link layers to subsequent ones through an addition operation. This forms a residual block, and the ResNets model is created by stacking these residual blocks together, utilizing ReLU activation functions and 2D convolutions. Figure 2 Shows overall flowchart for deep learning.
2.2 Grad-CAM
Figure 3 shows that the Architecture of Proposed models. Grad-CAM is a technique that bypasses the necessity of Global Average Pooling (GAP) and instead generates a heatmap by weighting each feature map with the gradient. The effectiveness of Grad-CAM can be substantiated by comparing the formulas and heatmap generation process of both traditional CAM and Grad-CAM.
In contrast, in traditional CAM calculation, instead of the flattening process typically following the last convolution layer, a GAP (Global Average Pooling) layer is employed. This entails computing the average value of each feature map.
\(f_{k} (i,j)\) from the final convolutional layer, yielding a single numerical output. The association between the last convolution layer and the class is depicted by weight (ω) which are then multiplied by \(f_{k} (i,j)\) to generate k heatmaps. These heatmaps are subsequently summed to yield the final image of the CAM. The formula can be expressed as (2):
where the Z is sum of the feature map, \(A_{ij}^{k}\) is kth feature map and \(y^{c}\) is the score for class c, \(a_{k}^{c}\) can be expressed as below (3):
By examination of the formula, it was noted that a ReLU function was integrated, and the weights were substituted with gradient \(a_{k}\) This illustrates that Grad-CAM, devoid of a GAP layer, holds applicability across a spectrum of CNN architectures. Moreover, Grad-CAM can extend its application beyond solely the final convolution layer to intermediate layers, facilitating the scrutiny of the model's information processing at different stages.
3 Data Preparation
3.1 Data Collection
To collect data for learning proposed models, Ultra-High Frequency (UHF) sensor used for measuring PD activity in gas-insulated substations (GIS). Unlike traditional ultrasonic sensors or current sensors typically employed for PD measurement [9], the UHF sensor offers greater resistance to ambient noise, potentially enhancing the accuracy and reliability of PD detection in challenging environments. The process involves receiving pulses generated from the GIS through a Radio Frequency (RF) receiver connected to the UHF sensor. These pulses are then measured at a sampling rate of 128BIN within a frequency band ranging from 300 to 1500 MHz. This frequency band is chosen to capture the UHF signals associated with partial discharge activity within the GIS.
3.2 Data Preprocessing
In this paper, four partial discharge defect patterns were considered, including PE, DI, FE and NS patterns. The data was preprocessed into 2D data using the PRPD. As shown in Table 1, it was confirmed that it occurred in a unique pattern according to the PD defect mode. And in the case of NS, it shows a pattern broadly distributed at the bottom of the y-axis. An experiment was conducted by configuring data sets for each pattern, including 70% learning data sets and 30% test data sets based on preprocessed data.
4 Result and Discussion
4.1 Results of Classification for PRPD Patterns
Table 2 shows that traditional CNN models used for class classification yielded insufficient results for accurate analysis. However, by applying Grad-CAM with the original CNN model, a more comprehensive analysis of class classification results became possible using activation images.
As a result of analyzing the data for each PRPD pattern by each proposed model, Accuracy of VGG was 96.5% on average, and it showed high classification accuracy for most of the three patterns except the NS pattern. On the other hand, in the case of ResNet, the average was 93.13%, and the classification rate for NS was 93%, showing higher accuracy than VGG (Table 3).
4.2 Epoch Plot
Accuracy and loss compared to learning rate were analyzed using the python-based matplotlib library. These metrics are typically plotted on the y-axis, while the number of epochs is plotted on the x-axis. The plot allows practitioners to visualize how the model's performance changes over time and whether it's converging or diverging. It helps in determining the optimal number of epochs for training and diagnosing issues such as overfitting or under fitting. Where the \(y_{i}\) is predicted value, \(t_{i}\) is data label and k is number of data, loss function can be expressed as (4):
Figure 4a, b shows the epoch plot outcomes for the VGG and ResNet models, respectively. Accuracy, defined as the convergence towards 1 by minimizing the error rate through a function, represents the model's capability to correctly classify instances. Conversely, loss rate, aiming to converge towards 0, signifies the reduction in errors. Hence, upon inspecting the two plot results, it is evident that the pattern classification learning accuracy of each model is improved.
4.3 Discussion
To compare the performance of two different deep learning models, VGG and ResNet, in classifying patterns associated with partial discharge. The summaries are as follows:
-
(1)
The average accuracy of VGG was 96.5%.
-
(2)
VGG demonstrated high classification accuracy for most of the three patterns (Protrusion Electrode, Defective Insulator, and Floating Electrode), indicating that it effectively identified these patterns.
-
(3)
The average accuracy of ResNet was 93.13%, showing a higher classification rate compared to the NS despite the lower average accuracy compared to VGG.
PRPD patterns are primarily characterized by signal features that occur in specific frequency bands in the frequency domain. However, typical convolutional neural networks such as VGG and ResNet focus on extracting features from images. This can make it difficult to adequately extract features in the frequency domain. VGG and ResNet use a fixed 3 × 3 convolutional filter size, which can be insufficient to capture the different features and scales of partial discharge patterns. Since partial discharge patterns vary in size and have features that are strongly influenced by the surrounding environment, a more extended convolution filter size or filters of different scales may be needed. Therefore, more noise data should be acquired for further validation. The presence of some vertically aligned points in the noise data resulted in a misclassification as surface discharges. It is believed that noise was introduced during the measurement. To build an enhanced CNN model, we need to increase the number of data samples and train on partially noisy cases.
5 Conclusion
In this paper, a UHF sensor was installed in GIS to classify four types of partial discharge. The existing initial model showed low accuracy, but higher accuracy was shown through VGG and ResNet based on the CNN deep learning model. Grad-CAM classified PRPD patterns in the proposed learning model and used them to verify the results, which was effective in deriving inconsistencies and directions for improvement of the learning model. This study shows high accuracy in classifying the partial discharge characteristics of electric facilities, but it can be used to diagnose facilities using instantaneous values, but it is insufficient to predict the prognosis of facilities, and it is necessary to derive improvements through continuous research in the future.
References
Maheswari RV, Perumal S, Vigneshwaran B, Mariasiluvairaj WI (2014) Partial discharge signal denoising using adaptive translation invariant wavelet transform-online measurement. J Electr Eng Technol 9(2):695–706. https://doi.org/10.5370/JEET.2014.9.2.695
Hosseini SMH, Baravati PR (2015) Partial discharge localization based on detailed models of transformer and wavelet transform techniques. J Electr Eng Technol 10(3):1093–1101. https://doi.org/10.5370/JEET.2015.10.3.1093
Arvind Shriram RK, Chandrasekar S, Karthik B (2018) PD Signal time-frequency map and PRPD pattern analysis of nano SiO₂ modified palm oil for transformer insulation applications. J Electr Eng Technol 13(2):902–910. https://doi.org/10.5370/JEET.2018.13.2.902
Li J, Han X, Liu Z, Yao X (2016) A novel GIS partial discharge detection sensor with integrated optical and UHF methods. IEEE Trans Power Deliv 33(4):2047–2049. https://doi.org/10.1109/TPWRD.2016.2635382
Hudon C, Belec M (2005) Partial discharge signal interpretation for generator diagnostics. IEEE Trans Dielectr Electr Insul 12(2):297–319. https://doi.org/10.1109/TDEI.2005.1430399
Majidi M, Fadali MS, Etezadi-Amoli M, Oskuoee M (2015) Partial discharge pattern recognition via sparse representation and ANN. IEEE Trans Dielectr Electr Insul 22(2):1061–1070. https://doi.org/10.1109/TDEI.2005.1430399
Janani H, Kordi B (2018) Towards automated statistical partial discharge source classification using pattern recognition techniques. IET High Voltage 3(3):162–169. https://doi.org/10.1049/hve.2018.5048
Vigneshwaran B, Maheswari RV, Kalaivani L, Vimal S, Seungmin R et al (2021) Recognition of pollution layer location in 11 kV polymer insulators used in smart power grid using dual-input VGG convolutional neural network. Energy Rep. https://doi.org/10.1016/j.egyr.2020.12.044
Do TD, Tuyet-Doan VN, Cho YS, Sun JH, Kim YH (2020) Convolutional-neural-network-based partial discharge diagnosis for power transformer using UHF sensor. IEEE Access. https://doi.org/10.1109/ACCESS.2020.3038386
Acknowledgements
This paper was supported by the ‘Development of Electrical Facilities Uninterruptible Diagnostic Technology / Safety Standards and Real-Time Risk Prediction System’ (No. 20215910100080) of the Korea Institute of Energy Technology Evaluation and Planning(KETEP) grant funded by the Korea government Ministry of Trade, Industry and Energy.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jung, H., Kim, YT., Lee, SK. et al. Study on Deep-Learning Model for Phase Resolved Partial Discharge Pattern Classification Based on Convolutional Neural Network Algorithm. J. Electr. Eng. Technol. (2024). https://doi.org/10.1007/s42835-024-01967-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42835-024-01967-9