Keywords

1 Introduction

In recent years, the high-speed trains have gradually become one of the convenient ways for people to travel [1]. Traction control systems is of great significance to ensure the efficient operation of high-speed trains. Once any fault occurs, it may cause great loss of personnel and property [2]. Therefore, effective fault detection of tractions systems is a hotspot in recent years.

Recently, because of the better applicability and high accuracy, much attention has been paid to data-driven fault detection methods [3]. At present, data-driven fault detection methods for industrial systems can be divided into three categories: multivariate statistical analysis-based methods, deep learning-based methods, and subspace identification-based methods. Commonly adopted multivariate statistical analysis-based fault detection methods such as principal component analysis (PCA) [4], slow feature analysis (SFA) [5], and canonical variate analysis (CVA) [6], depend on different variables to design detection methods directly. In [7], an improved dynamic kernel PCA is proposed based on local preserving projections for dynamic nonlinear systems. In addition, A weighted probabilistic SFA and improved CVA are constructed for process monitoring of dynamic systems in [8] and [9]. Although this type of method has been successfully extended to dynamic systems, it is difficult to obtain robust detection results due to the complexity of noise and disturbance. Benefiting from the rapid development of neural networks, deep learning-based methods occupy the mainstream of current research. Its low complexity of model designing brings great convenience for researchers and engineers. For example, Zhang et al. [10] proposed a full feedback dynamic neural network with exogenous input. The noise of collected data is reduced to obtain accurate results. Besides optimizing the network model, improving the training method can also enhance the dynamic performance of the algorithm [11]. However, it is also difficult to deal with the disturbance and noise. In the multi-sensor monitoring environment, enhancing the interpretability of deep learning methods is still a big problem to deal with. The subspace identification-based strategy not only realizes the analysis of dynamic characteristics, but detects faults through residual generator by data-driven observer. Motivated by this, this paper proposed a joint data-driven observer for the fault detection task of traction systems. The main contributions and innovations of this study include:

  1. (1)

    A joint observer based on the copula function is constructed for traction systems.

  2. (2)

    The fault detection strategy based on data-driven designs is introduced.

  3. (3)

    Different experiments including motor platform and traction systems are used for verification.

The paper is organized as follows. In Sect. 2, the proposed data-driven fault detection scheme is introduced. Section 3 shows the experiment results by different aspects. In Sect. 4, the discussions about the open problems and perspectives are given. Section 5 is the conclusions and future works of this study.

2 The Proposed Data-Driven Fault Detection Scheme

The purpose of this paper is to design a joint data-driven fault detection method for traction systems in high-speed train. The framework of the proposed method is shown in Fig. 1.

Fig. 1.
figure 1

The framework of proposed joint observer model

2.1 Systems Description

Consider the LTI model of systems as

$$ \begin{gathered} x(k + 1) = Ax(k) + Bu(k) + \omega (k) \hfill \\ y(k) = Cx(k) + Du(k) + \upsilon (k) \hfill \\ \end{gathered} $$
(1)

where \(x \in \mathcal{R}^{{k_{x} }}\) represents the state of systems; \(u \in \mathcal{R}^{{k_{u} }}\) and \(y \in \mathcal{R}^{{k_{y} }}\) are input and output of systems, respectively; \(A\), \(B\), \(C\), and \(D\) stand for the system matrices with known dimensions; \(\omega \in \mathcal{R}^{{k_{x} }}\) and \(y \in \mathcal{R}^{{k_{y} }}\) are white noises with the assumption that \(\omega \sim \left( {0,\sum_{\omega } } \right)\) and \(\upsilon \sim \left( {0,\sum_{\upsilon } } \right)\).

The full-order observer introduced in [12] is given below.

$$ \begin{gathered} \hat{x}(k + 1) = A\hat{x}(k) + Bu(k) + L(r(k)) \hfill \\ \hat{y}(k) = C\hat{x}(k) + Du(k) \hfill \\ \end{gathered} $$
(2)

where \(r(k)\) stands for the residual of system output.

In data-driven scheme, the residual generator can be obtained by using the kernel representation \(\mathcal{K}\) and left coprime factorization introduced in [13]. These definitions are given as

$$ \mathcal{K}\left[ \begin{gathered} u(z) \hfill \\ y(z) \hfill \\ \end{gathered} \right] = 0,\mathcal{K}:{ = }\left[ {\begin{array}{*{20}c} { - \hat{N}(z)} & {\hat{M}(z)} \\ \end{array} } \right] $$
(3)

According to the definitions above, the residual generator (4) can be used to fault detection.

$$ r(z) = \hat{M}(z)y(z) - \hat{N}(z)u(z) $$
(4)

2.2 Data-Driven Joint Observer Based on Correlation Measure

To design a joint model, the important lemma [14] is given as follows.

Lemma 1

(Sklars theorem): For a random vector \(X\) with cumulative distribution function \(F\) and univariate marginal cumulative distribution functions \(F_{1} , \cdots ,F_{d}\). There exists a copula \(C\) such that

$$ F(x_{1} , \cdots ,x_{d} ) = C(F_{1} (x_{1} ), \cdots ,F_{d} (x_{d} )) $$
(5)

If \(X\) is continuous, then such a copula \(C\) is unique. Conversely, if we know the joint cumulative distribution junction \(F\) and the marginals \(F_{1} , \cdots ,F_{d}\), we can find the copula via

$$ C(u_{1} , \cdots ,u_{d} ) = F(F_{1}^{ - 1} (u_{1} ), \cdots ,F_{d}^{ - 1} (u_{d} )) $$
(6)

where \(F_{j}^{ - 1} (t) = \inf \{ s:F_{j} (s) \ge t\}\).

Lemma 1 illustrates the existence of copula functions. For computing the weight of different marginal distributions, Kendall tau, one of the correlation measures of the copula function, is used in this section. The specific definition is given as

$$ \xi = \Pr \left[ {\left( {X_{1} - X_{2} } \right)\left( {Y_{1} - Y_{2} } \right) > 0} \right] - \Pr \left[ {\left( {X_{1} - X_{2} } \right)\left( {Y_{1} - Y_{2} } \right) < 0} \right] $$
(7)

where \((X_{1} ,Y_{1} ),(X_{2} ,Y_{2} )\) are independent identically distributed binary random vectors.

The copula function used in this paper is Gaussian copula, which belongs to a family of elliptic copula function. The cumulative distribution function and probability density function are given below.

$$ C(u,v;\rho ) = \int_{ - \infty }^{{\psi^{ - 1} (u)}} {\int_{ - \infty }^{{\psi^{ - 1} (v)}} {\frac{1}{{2\pi \sqrt {1 - \rho^{2} } }}\exp \left( {\frac{{ - r^{2} - s^{2} + 2\rho rs}}{{2 - 2\rho^{2} }}} \right)drds} } $$
(8)
$$ \begin{gathered} p(u,v;\rho ) \hfill \\ = \frac{1}{{\sqrt {1 - \rho ^{2} } }}\exp \left( { - \frac{{\psi ^{{ - 1}} (u)^{2} + \psi ^{{ - 1}} (v)^{2} - 2\rho \psi ^{{ - 1}} (u)\psi ^{{ - 1}} (v)}}{{2 - 2\rho ^{2} }}} \right)\exp \left( { - \frac{{\psi ^{{ - 1}} (u)^{2} \psi ^{{ - 1}} (v)^{2} }}{2}} \right) \hfill \\ \end{gathered} $$
(9)

where \(\psi^{ - 1} ( \cdot )\) is the inverse function of unary normal distribution, and \(\rho\) is the linear correlation coefficient.

According to the copula function above, the weight \(\tau_{i}\) of the \(i\) variable used to obtain the proportion of correlation can be defined as

$$ \tau_{i} = \frac{{\xi_{i} }}{{\xi_{1} + \cdots + \xi_{n} }} $$
(10)

where \(n\) represents the number of variables.

To analyze the performance of dynamic systems, the data model can be expressed as

$$ \rho_{s} (k) = \left[ {\begin{array}{*{20}c} {\rho^{T} (k)} & \cdots & {\rho^{T} (k + s)} \\ \end{array} } \right]^{T} $$
(11)

where \(k\) stands for the instant,\(s\) is the stacked size. It is worth to mention that \(\rho\) can represent any variables in systems. Therefore, the data-driven observer based on deep learning technology for dynamic systems can be constructed by full connection neural network as follows:

$$ \mathcal{N}\mathcal{N} = \min \sum\limits_{k = s + 1}^{N - s} {\frac{1}{2}\left\| {y_{s} (k) - \mathcal{A}\left( \begin{gathered} u_{s - 1} (k - s) \hfill \\ \begin{array}{*{20}c} {\;\;\;\;u_{s} (k)\;\;\;\;\;\;;} & \Theta \\ \end{array} \hfill \\ y_{s - 1} (k - s) \hfill \\ \end{gathered} \right)} \right\|_{2}^{2} } $$
(12)

According to the setting of network model above, the observer can be defined as

$$ \hat{y}(k) = \mathcal{A}\left( {u(k);\Theta } \right) $$
(13)

Based on (8) and (11), the joint data-driven observer for traction systems can be obtained as

$$ \hat{y}_{\_joint} (k) = \frac{1}{n}\sum\limits_{i = 1}^{n} {\tau_{i} \hat{y}_{i} (k)} $$
(14)

where \(\hat{y}_{\_joint}\) represents the estimation of joint observer output. Fault detection can be accomplished by errors below.

$$ e_{\_joint} (k) = y_{\_joint} (k) - \hat{y}_{\_joint} (k) $$
(15)

2.3 Implementation Procedures of Fault Detection

Following the proposed fault detection method in this study, the implementation procedure is formulated in Algorithm 1.

Algorithm 1 Fault detection using joint observer

1: Load the normal data of \(y_{i}\) and fault data \(y_{i}^{f}\)

2: Calculate the copula function by (6) and obtain the Kendall tau by (7)

3: Calculate the joint weight by (10)

4: Construct the data-driven observer by (12)

5: Estimate the joint output of systems by (14)

6: Detect the fault by using errors in (15)

3 Verifications

For illustrating the practicality of the proposed method, two different experiments developed by traction systems are designed. Then, the experimental environment and experimental results are given below.

3.1 Experiment on Pilot-Scale Platform

The traction systems shown in Fig. 2 contains the cabinet, the traction motor, and sensors. The input voltage is 220V and the power of module is 10KW. The parameters of traction systems in this experiment are given in Table 1. Figure 3(a) illustrates the probability density function of copula. It is different with the Archimedean copula function because of the symmetric tail correlation. Figure 3(b) shows the fault detection result using the proposed method. The system fault is happened in the 100th sample. When the fault occurs, the residual has a significant step.

Fig. 2.
figure 2

Traction control systems in laboratory

Table 1. Parameters of motor
Fig. 3.
figure 3

Experimental results of traction control systems in laboratory

3.2 Verification on Traction Systems of High-Speed Trains

For illustrating the practicality, the proposed method is verified by the running data of traction system in high-speed trains. The traction motor equipped in trains is shown in Fig. 4. All the running data are collected from monitoring systems in high-speed trains. Figure 5(a) illustrates the probability density function of copula for actual data. It also has symmetric tail correlation. Figure 5(b) shows the fault detection result using the running data of train system. The system fault happened in the 35th sample. Although, due to the complexity of the actual systems, false alarm appears at the 9th and 13th sample points, the results of fault detection are still satisfactory.

Fig. 4.
figure 4

Traction systems of high-speed trains

Fig. 5.
figure 5

Experimental results of traction control systems in high-speed trains

4 Discussions

Although data-driven fault detection technology has some limitations in its current development stage, it still has significant potential for industrial systems. Especially for multi-sensor monitoring systems, the observer-based modeling approach can describe the system's dynamic behavior and directly construct residual generators to detect faults. Most importantly, when sufficient data is available, data-driven methods can greatly reduce modeling complexity and achieve satisfactory detection results. However, for the traction system of high-speed trains, there are still some issues that need further consideration:

  1. (1)

    In engineering applications, the real-time performance of algorithms is always a hot topic. How to further improve the computing efficiency of high frequency sampling systems deserves more attention.

  2. (2)

    While nonlinear observer models are effective in describing system characteristics, the design of such models for distributed systems poses significant challenges.

  3. (3)

    Besides the characteristics of initial variables, the correlation between process variables and noise should be considered.

5 Conclusions

In this article, a data-driven joint observer is proposed for traction systems in high-speed trains. To obtain the joint distribution of system state, marginal distributions of all variables are considered in this scheme. In the end, the proposed method is verified by different experiments. Results shows the superiority of this method. Benefiting from the joint model, future research consists of fault configuration and health prediction for other complex systems is possible.