Abstract
The transformer is one of the most important units in the power grid. Due to the potential failures and costs of the power system, it is necessary to pay attention to the fault diagnosis of power transformers. This paper proposes a fault diagnosis method based on Canonical Variate Analysis and Support Vector Machine (CVA-SVM). As a system identification method, CVA is widely used for fault detection because of its ability to identify multivariate state space models using experimental data. The support vector machine is a new machine learning method and is a powerful tool for solving problems with nonlinear and non-Gaussian distributed data. Dissolved gas analysis (DGA) has shown great potential for detecting faults in power transformers. For fault diagnosis based on DGA, a CVA model is first constructed for the process variables to generate a series of feature vectors, and then the fault types are classified using SVM. A real power transformer process is employed to verify the effectiveness of the proposed method.
This work was supported by the key technology project of the State Grid Corporation of China under Grant 522821200090.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
As an important piece of equipment in a power system, an interruption of power supply can be caused by any failure in the power transformer. Therefore, it is vital to detect transformer faults [1,2,3]. Dissolved gas analysis (DGA) has been widely recognized as an effective diagnostic technique for fault detection in power transformers. The analysis of specific dissolved gas concentrations in transformer insulating oil yields knowledge about the condition of the transformer and allows necessary preventive measures to be taken based on the results of the process [4,5,6]. However, due to the variability of gas data and the nature of the operation, fault detection by conventional methods is not always an easy task.
To develop more accurate diagnostic tools based on DGA data, scholars have developed a number of artificial intelligence methods [7, 8]. With the development of machine learning, the fault diagnosis of power transformers has also been enhanced. To cope with the uncertainty in fault diagnosis, Huang et al. proposed a fuzzy logic-based fault diagnosis method for power transformers, where the technique can diagnose multiple faults in a transformer and quantitatively indicate the severity of each fault [9]. To reduce the redundant information of the data, Kari et al. proposed to reduce the dimensionality of the data with principal component analysis and detect power transformer faults using fuzzy C-means method [10].
To develop more accurate diagnostic tools based on DGA data, scholars have developed many artificial intelligence methods [7, 8]. With the development of machine learning, the fault diagnosis of power transformers has also been enhanced. To cope with the uncertainty in fault diagnosis, Huang et al. proposed a fuzzy logic-based fault diagnosis method for power transformers, where the technique can diagnose multiple faults in a transformer and quantitatively indicate the severity of each fault [9]. To reduce the redundant information of the data, Kari et al. proposed to reduce the dimensionality of the data with principal component analysis and detect power transformer faults using the fuzzy C-means method [10].
However, the above methods do not take into account the dynamic nature of power transformer data, and for continuously operating systems, may fail to explore valuable dynamic information for the process and lead to some misleading monitoring results [11]. CVA is widely used in a dynamic process to generate a state-space model from data by maximizing the correlation between the constructed “past” and “future” matrices [12]. To the best of the authors’ knowledge, the CVA method has not been used in the data processing of power transformers for fault diagnosis.
Motivated by the above discussion, considering the characteristics of CVA and SVM, a new fault diagnosis method is proposed by combining CVA and SVM for the power transformer process. First, CVA extracts the dynamic features of the process data. Based on the extracted features. And then SVM is employed to classify the fault types to address the issues of no-Gaussian assumption and nonlinearity. For the parameter optimization problem in SVM, this paper uses the random grid search cross-validation method to improve the accuracy of the model.
This paper is organized as follows. Section 2 briefly reviews the CVA and SVM. Section 3 is devoted to describing the proposed CVA-SVM method. Section 4 presents the application of the proposed method in the real power transformer data. Finally, conclusions are given in Sect. 5.
2 Review of CVA and SVM
2.1 CVA
CVA is based on the so-called subspace identification, where process measurements are stacked to form the past and future spaces [13]. Denote \(\mathbf {x_k}\in \mathbb {R}^m\) (\(k=1,2,\dots ,N\)) as the normalized stacked vector at time instant k. For each k, the past data vector \(\textbf{x}_{p,k}\) and future data vector \(\textbf{x}_{f,k}\) are collected as
where l is the number of time lag. For a finite sequence with N samples, the past and future Hankel matrices \(\textbf{X}_p\) and \(\textbf{X}_f\) are constructed,
where \(M=N-2l+1\). The estimates of the sample covariance and cross-covariance of the past and future vector are expressed below,
In CVA, the projection matrices \(\textbf{J}\) and \(\textbf{L}\) can be computed through performing singular value decomposition (SVD) on the Hankel matrix \(\textbf{H}\),
Here, \(\textbf{U}\) and \(\textbf{V}\) are the left and right singular matrices of the matrix \(\textbf{H}\), respectively. \(\varLambda =diag[\sigma _1,\sigma _2,\dots ,\sigma _q]\) is the diagonal matrix containing all singular values, and q is the rank of \(\textbf{H}\).
From the result of SVD, the projection matrices \(\textbf{J}\) and \(\textbf{L}\) can be calculated. The first r columns of \(\textbf{V}\) can be considered to have the highest pairwise correlation with the first r columns of \(\textbf{U}\) [14]. It produces a pair of new matrices \(\textbf{U}_r\in \mathbb {R}^{ml\times r}\) and \(\textbf{V}_r\in \mathbb {R}^{ml\times r}\) with smaller dimensionality.
Finally, two matrices contain the state and residual vectors are derived below,
2.2 SVM
As illustrated in Fig. 1, a Support Vector Machine aims to find an optimal hyperplane by maximally separating the margins between the hyperplane and the data [15, 16].
Given a data set \(F=\left\{ x_i, y_i\right\} ^m_{i=1}\), where m is the sample number, \(x_i\in R^n\) stands for the input vectors, \(y_i\in {+1,-1}\) denotes two classes. The hyperplane \(f(x) = 0\) that separates the given data can be determined when the two classes are linearly differentiable.
where w denotes the weight vector and b denotes the bias term. The separation hyperplane should satisfy the following constraints,
For solving the linear indivisibility problem, the relaxation variable \(\zeta _i\) is introduced, and thus the constraint becomes as,
where C is the error penalty.
The above optimization problem is transformed into a pairwise quadratic optimization problem by introducing the Lagrange multiplier \(\alpha _i\) , i.e.
The linear decision function is therefore created by solving a pairwise optimization problem defined as,
SVM can be used for nonlinear classification. By using a nonlinear mapping function, the original data x is mapped to a high-dimensional feature space in which linear classification can be performed. Then the decision function is transformed into,
In this paper, the Gaussian Kernel is selected as kernel function,
where \(\phi \) is a nonlinear mapping that maps data points to the high-dimensional feature space. To obtain a tighter boundary, an appropriate width parameter h of the Gaussian kernel function is selected.
3 CVA-SVM Based Fault Diagnosis
In the proposed CVA-SVM method, the space of canonical variables can be divided into the state space and the residual space. The state space is then used as target objects for developing SVM hypersphere layers. Finally, The SVM faults classification is performed. The procedure of the CVA-SVM based fault detection method is depicted (Table 1).
As shown in Fig. 2, two phases are included, offline training and online diagnosis. Specifically, the procedure of the CVA-SVM based fault diagnosis is described in detail as follows,
Offline training:
-
Step 1. Standardize the collected faulty measurements.
-
Step 2. Construct the Hankel matrices \(\textbf{X}_f\) and \(\textbf{X}_p\) with the determined time-lag l.
-
Step 3. Obtain the projection matrices \(\textbf{J}\) and \(\textbf{L}\) according to Eq.(5)
-
Step 4. Determine the state and residual matrices \(\textbf{Z}\) and \(\textbf{E}\) using Eq.(6).
-
Step 5. Build SVM model for \(\textbf{Z}\) with the determined C and h.
-
Step 6. The SVM classifier is trained using the appropriate values of parameters.
Online diagnosis:
-
Step 1. Obtain and Standardize the test sample \(\textbf{x}^t_{k}\).
-
Step 2. Construct stacked vectors and calculate the state and residual vectors from \(\textbf{J}\) and \(\textbf{L}\),
$$\begin{aligned} \begin{aligned} \begin{array}{l} \textbf{z}_k=\textbf{J} \textbf{x}^t_{p,k} \\ \textbf{e}_k=\textbf{L} \textbf{x}^t_{p,k} \end{array} \end{aligned} \end{aligned}$$(14) -
Step 3.Input the state space \(\textbf{z}_k\) into the SVM classifier.
-
Step 4. Obtain the diagnostic results.
4 Case Study
To verify the effectiveness of the CVA-SVM method proposed in this paper, 188 power transformer oil dissolved gas content faulty data were collected for the experiment. This data has 6 types of fault states and 5 components of dissolved gas content in oil, some of which are shown in Table 2. For computational convenience, we coded and labeled the fault types of the dataset, as shown in Table 3. After that, the data were divided into a training set and a test set, and the data in the test set and the training set each accounted for \(50\%\) of the original data.
By subjecting the gas data to the CVA algorithm analysis, the five data variables of transformer faults can be dimensionally reduced to four data variables. This will facilitate the linear partitioning of the data by the SVM classifier and can also improve the computational speed of the fault diagnosis system. In addition, to compare the superior performance of the proposed methods, we compare the traditional SVM method, PCA and SVM combined algorithms. The classification result plots of the three methods are shown in Fig. 3.
The final classification accuracy of each model is summarized in Table 4. From Fig. 3, it is obtained that the SVM algorithm is less effective in identifying normal samples and has a lower detection rate for the medium to low temperature overheating and the high temperature overheating faults. The SVM method also has some false detection. The detection effectiveness of the PCA-SVM method is improved. Table 3 quantifies the detection effect of each model, and from the Table, we get that CVA-SVM has the highest accuracy for both the test set and the training set, and achieves the best classification effect.
5 Conclusion
This paper proposes a design of a power transformer fault diagnosis system based on the optimized SVM kernel function model algorithm and optimizes the SVM model by collecting the data of five gases generated by oil fission when a transformer fault occurs. Compared with the traditional SVM and PCA-SVM methods, it can be seen that the CVA-SVM method can significantly improve the accuracy of transformer fault diagnosis. CVA can extract dynamic information from the data, so the optimized SVM model algorithm is more suitable for transformer fault diagnosis systems with high practicality. Further investigation is strongly recommended to extend power transformer fault identification methods.
References
Fei, S.-W., Zhang, X.-B.: Fault diagnosis of power transformer based on support vector machine with genetic algorithm. Expert Syst. Appl. 36(8), 11352–11357 (2009)
Wang, M.-H.: A novel extension method for transformer fault diagnosis. IEEE Trans. Power Delivery 18(1), 164–169 (2003)
Kari, T., et al.: Hybrid feature selection approach for power transformer fault diagnosis based on support vector machine and genetic algorithm. IET Gen. Transm. Distrib. 12(21), 5672–5680 (2018)
Bacha, K., Souahlia, S., Gossa, M.: Power transformer fault diagnosis based on dissolved gas analysis by support vector machine. Electric Power Syst. Res. 83(1), 73–79 (2012)
Fang, J., Zheng, H., Liu, J., Zhao, J., Zhang, Y., Wang, K.: A transformer fault diagnosis model using an optimal hybrid dissolved gas analysis features subset with improved social group optimization-support vector machine classifier. Energies 11(8), 1922 (2018)
Zhang, Y., et al.: A fault diagnosis model of power transformers based on dissolved gas analysis features selection and improved Krill Herd Algorithm optimized support vector machine. IEEE Access 7, 102803–102811 (2019)
Sun, H.-C., Huang, Y.-C., Huang, C.-M.: Fault diagnosis of power transformers using computational intelligence: a review. Energy Procedia 14, 1226–1231 (2012)
Bakar, N.A., Abu-Siada, A., Islam, S.: A review of dissolved gas analysis measurement and interpretation techniques. IEEE Electr. Insul. Mag. 30(3), 39–49 (2014)
Huang, Y.-C., Sun, H.-C.: Dissolved gas analysis of mineral oil for power transformer fault diagnosis using fuzzy logic. IEEE Trans. Dielectr. Electr. Insul. 20(3), 974–981 (2013)
Kari, T., Gao, W.: Power transformer fault diagnosis using FCM and improved PCA. J. Eng. 2017(14), 2605–2608 (2017)
Zheng, J., Zhao, C.: Enhanced canonical variate analysis with slow feature for dynamic process status analytics. J. Process Control 95, 10–31 (2020)
Jiang, B., Braatz, R.D.: Fault detection of process correlation structure using canonical variate analysis-based correlation features. J. Process Control 58, 131–138 (2017)
Samuel, R.T., Cao, Y.: Kernel canonical variate analysis for nonlinear dynamic process monitoring. IFAC-PapersOnLine 48(8), 605–610 (2015)
Odiowei, P.-E.P., Cao, Y.: Nonlinear dynamic process monitoring using canonical variate analysis and Kernel density estimations. IEEE Trans. Industr. Inform. 6(1), 36–45 (2009)
Widodo, A., Yang, B.-S.: Support vector machine in machine condition monitoring and fault diagnosis. Mech. Syst. Signal Process. 21(6), 2560–2574 (2007)
Zhang, X., Chen, W., Wang, B., Chen, X.: Intelligent fault diagnosis of rotating machinery using support vector machine with ant colony algorithm for synchronous feature selection and parameter optimization. Neurocomputing 167, 260–279 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Luo, L. et al. (2023). A Fault Diagnosis Method for Power Transformer Using Canonical Variate Analysis and Support Vector Machine. In: Liang, Q., Wang, W., Liu, X., Na, Z., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2022. Lecture Notes in Electrical Engineering, vol 874. Springer, Singapore. https://doi.org/10.1007/978-981-99-2362-5_18
Download citation
DOI: https://doi.org/10.1007/978-981-99-2362-5_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2361-8
Online ISBN: 978-981-99-2362-5
eBook Packages: EngineeringEngineering (R0)