Keywords

1 Introduction

Gearbox is commonly used in industrial production [1]. Gearbox is an important part of the power transmission of many mechanical devices. It is used to change the transmission ratio, direction of change, and power transmission. In the actual production life, the operating state of the gearbox is closely related to the operating state of the equipment, while the gears, shafts and other components in the gearbox work under high load for a long time, and the probability of mechanical failure is higher [2,3,4]. According to relevant statistics, the gear failure is about 60% of the failure of the gearbox components. Therefore, research on gear fault diagnosis in gearboxes has become very valuable and necessary [5, 6].

As early as 1901, PCA was first proposed by Pearson. The basic idea of the method is to compress multiple linear correlation variables into a multivariate statistical method of a few unrelated variables [7, 8]. Ding et al. reduced the amount of data dimension by PCA [9]. PCA reduced the multidimensional correlated variable to be low dimensional independent eigenvector. PCA has advantages in the simple concept, convenient calculation. It has been widely used in numerous areas such as data compression, feature extraction, image processing, signal analysis etc. al [10].

In 1947, Wald introduced SPRT in his book. Because of its simplicity and high efficiency, this method has been widely used in the field of fault diagnosis in recent years. Ray et al. applied SPRT to fault detection and identification in nuclear power plants and aircraft [11]. Chen et al. used SPRT for gearbox fault diagnosis [12, 13]. Compared with the traditional method, Compared with the traditional method, sequential probability ratio test differs from the fixed sample test in that this algorithm requires a smaller average sample size, but it has a higher test efficiency [14].

This paper proposes a new gear multi-fault monitoring method based on PCA and SPRT. The gear vibration signal is denoised by wavelet packet transform, and then PCA is taken to remove the characteristic data of the signal and select the appropriate principal element as the substitute data of SPRT. Finally, SPRT algorithm is used to identify the fault state of the gearbox. With the aim for corroborating the potency of SPRT algorithm for fault diagnosis of the gearbox, the equipment fault diagnosis of the gearbox is diagnosed by jointing the root mean square error (RMSE) and the binary SPRT. The final consequences show that the strategy is effective and dependable.

2 SPRT Algorithm of Gearbox

In the likelihood ratio test, the change in mean and standard deviation has a greater impact on the results. The test sequence selected after pre-processing and PCA basically conforms to the Gaussian distribution, and the mean value is \( c\mu \), and the standard deviation is \( \sigma \). Suppose that the probability distribution of one of the groups to be tested satisfies the initial hypothesis: \( H_{j} :\mu = \mu_{j} \); the probability distribution of another group of sequences to be tested satisfies the alternative hypothesis: \( H_{j} :\mu = \mu_{j} \).The standard deviation \( \sigma \) does not change. When the both hypotheses are reality, the joint probability density function (PDF) of the two sets of sequences is as follows:

$$ P_{ik} \left( {y_{k} } \right)\, = \,\frac{1}{{\sigma \sqrt {2\pi } }}\exp \left( { - \frac{1}{{2\sigma^{2} }}\left( {y_{k} - \mu_{i} } \right)^{2} } \right) $$
(1)
$$ P_{jk} \left( {y_{k} } \right)\, = \,\frac{1}{{\sigma \sqrt {2\pi } }}\exp \left( { - \frac{1}{{2\sigma^{2} }}\left( {y_{k} - \mu_{j} } \right)^{2} } \right) $$
(2)

Where \( P_{jk} \left( {y_{k} } \right) \) represents PDF under selective hypothesis; \( P_{ik} \left( {y_{k} } \right) \) represents PDF under null hypothesis.

We could calculated the likelihood ratio of SPRT as follows:

$$ \lambda_{i,j} \left( {Y_{Sm} } \right)\, = \,\frac{{\prod\limits_{k = 1}^{n} {p_{jk} } }}{{\prod\limits_{k = 1}^{n} {p_{ik} } }} = \frac{{p_{j1} \left( {y_{1} } \right)p_{j2} \left( {y_{2} } \right) \cdots p_{jk} \left( {y_{k} } \right)}}{{p_{i1} \left( {y_{1} } \right)p_{i2} \left( {y_{2} } \right) \cdots p_{ik} \left( {y_{k} } \right)}} \times \frac{{p_{j0} }}{{p_{i0} }} $$
(3)

Where \( S_{m} \,{ = }\,\left\{ {S_{1} ,S_{2} ,S_{3} ,S_{4} } \right\} \), \( P_{j0} \) and \( P_{i0} \) are prior probability functions under selective and null hypothesis. SPRT probability ratio \( \lambda_{i,j} \left( {Y_{Si} } \right) \) and \( \lambda_{i,j} \left( {Y_{Sj} } \right) \) are computed by inputting the testing data \( \left( {Y_{Si} ,Y_{Sj} } \right) \) of the signal waveform for gearbox \( Si \) and \( Sj \) conditions to Eq. (3). SPRT probability ratios \( \lambda_{i,j} \left( {Y_{Si} } \right) \) and \( \lambda_{i,j} \left( {Y_{Sj} } \right) \) are expressed in the following section. The Eq. (3) can be simplified:

$$ \Delta_{i,j} \left( {Y_{Sm} } \right) = \ln \lambda_{i,j} \left( {Y_{Sm} } \right) = \ln \frac{{\prod\limits_{k = 1}^{n} {p_{jk} } }}{{\prod\limits_{k = 1}^{n} {p_{ik} } }} = \sum\limits_{k = 1}^{n} {\ln \frac{{p_{jk} }}{{p_{ik} }}} $$
(4)
$$ E_{i,j} \left( {Y_{Sm} } \right) = \left( {\frac{1}{k}\sum\limits_{t = 1}^{k} {\left( {\lambda_{i,j} \left( {Y_{Sm} } \right) - \lambda_{i,j} \left( {Y_{S} } \right)} \right)^{2} } } \right)^{{\frac{1}{2}}} \,m = i,j $$
(5)

3 Experiment System

The normal gear F1 and the cracked fault gears F2, F3, and F4 were selected for the experiment. The full depth of the gear crack is \( a\, = \,2.4\,\text{mm} \), the full width of the crack is \( b\, = \,25\,\text{mm} \), the thickness of the gear crack is 0.4 mm, and the angle of the gear crack is 45°. In the experiment, the gearbox was in no-load state and the speed was 800 r/min. The parameters of the three sets of gears are shown in Table 1.

Table 1. Four gear failure modes

Figure 1 is a structural diagram of a gearbox. Vibration is generated between the running gears 3 and 4, so one of them is selected to simulate the fault. In this experiment, the gear 3 was selected for the simulation experiment.

Fig. 1.
figure 1

Gearox structure

The experimental signal acquisition is obtained from the gearbox SpectraQuest dynamic simulator. A PCB352C67 accelerator is fixed in the vertical and horizontal directions of the gearbox. The collected initial vibration signals are input into the PC and stored by the DSP20-42 signal analyzer. This paper only studies the analysis of signals in the horizontal direction. S1, S2, S3, and S4 are used to represent the signals under normal conditions and three fault conditions.

4 Multi-fault State Recognition of Gearbox Based on Three-Layer SPRT

4.1 Diagnosis of Gearbox in Fault State F1

The Eqs. (13) are used to calculate the variable \( \lambda_{1,2} \left( {Y_{S1} } \right) \). The parameters \( \mu_{0} \) and \( \mu_{1} \) are the means of the waveform under \( \left( {F1,F2} \right) \) condition. The variables \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) \), \( \lambda_{1,2} \left( {Y_{S2} \left( N \right)} \right) \), \( \lambda_{1,2} \left( {Y_{S3} \left( N \right)} \right) \), \( \lambda_{1,2} \left( {Y_{S4} \left( N \right)} \right) \) are calculated by Eqs. (45) by using ten waveforms \( \left( {Y_{S1} } \right) \) under \( F1 \) and one testing waveform of the \( F2,F3,F4 \).

Figure 2(a) shows the establishment of the unequal relationship \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) < b \). Figure 2(b) shows the variables \( \left( {E_{1,2} \left( {Y_{S1} } \right)} \right) \) for \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) \), \( N = 1, \cdots ,10 \) are compared with \( \left( {E_{1,2} \left( {Y_{Sj} } \right)} \right) \), \( j = 2,3,4 \) for \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) \) with \( \lambda_{1,2} \left( {Y_{Sj} } \right) \), \( j = 2,3,4 \). The parameters \( \left( {E_{1,2} \left( {Y_{Sj} } \right)} \right),j = 2,3,4 \) are bigger than \( \left( {E_{1,2} \left( {Y_{S1} } \right)} \right) \).

Fig. 2.
figure 2

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) \), \( \lambda_{1,2} \left( {Y_{S2} } \right) \), \( \lambda_{1,2} \left( {Y_{S3} } \right) \), \( \lambda_{1,2} \left( {Y_{S4} } \right) \) versus sequential test iteration number. (b) RMSEs \( E_{1,2} \left( {Y_{S1} \left( N \right)} \right) \), \( E_{1,2} \left( {Y_{S2} } \right) \), \( E_{1,2} \left( {Y_{S3} } \right) \), \( E_{1,2} \left( {Y_{S4} } \right) \).

Figure 3(a) shows the establishment of the unequal relationship \( \lambda_{1,2} \left( {Y_{S1} \left( N \right)} \right) < b \). Figure 3(b) shows the unequal relationship are established, which is \( E_{1,3} \left( {Y_{S1} \left( N \right)} \right) < E_{1,3} \left( {Y_{Sj} } \right),\,j = 2,3,4 \).

Fig. 3.
figure 3

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{1,3} \left( {Y_{S1} \left( N \right)} \right) \), \( \lambda_{1,3} \left( {Y_{S2} } \right) \), \( \lambda_{1,3} \left( {Y_{S3} } \right) \), \( \lambda_{1,3} \left( {Y_{S4} } \right) \) versus sequential test iteration number. (b) RMSEs \( E_{1,3} \left( {Y_{S1} \left( N \right)} \right) \), \( E_{1,3} \left( {Y_{S2} } \right) \), \( E_{1,3} \left( {Y_{S3} } \right) \), \( E_{1,3} \left( {Y_{S4} } \right) \).

Figure 4(a), the unequal relationship \( \lambda_{1,4} \left( {Y_{S1} \left( N \right)} \right) < b \) is established. In Fig. 4(b), the parameters \( E_{1,4} \left( {Y_{S1} } \right) \), \( E_{1,4} \left( {Y_{S3} } \right) \) and \( E_{1,4} \left( {Y_{S4} } \right) \) are less than the threshold \( E_{C1} \) and RMSEs \( E_{1,4} \left( {Y_{S2} } \right) \) are more than \( E_{C} \).

Fig. 4.
figure 4

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{1,4} \left( {Y_{S1} \left( N \right)} \right) \), \( \lambda_{1,3} \left( {Y_{S2} } \right) \), \( \lambda_{1,3} \left( {Y_{S3} } \right) \), \( \lambda_{1,3} \left( {Y_{S4} } \right) \) versus sequential test iteration number. (b) RMSEs \( E_{1,3} \left( {Y_{S1} \left( N \right)} \right) \), \( E_{1,3} \left( {Y_{S2} } \right) \), \( E_{1,3} \left( {Y_{S3} } \right) \), \( E_{1,3} \left( {Y_{S4} } \right) \).

As shown in Figs. (2, 3 and 4), if \( E_{1,2} \left( {Y_{S1} } \right) < E_{C} \) and \( \lambda_{1,2} \left( {Y_{S1} } \right)\, < \,b \) or \( E_{1,3} \left( {Y_{S1} } \right)\, < \,E_{C} \) and \( \lambda_{1,3} \left( {Y_{S1} } \right)\, < \,b \) are satisfied, then \( F1 \) could be diagnosed.

4.2 Diagnosis of Gearbox in Fault State F2

If the waveform is not \( S1 \) and falls into one of the other three waveforms that is \( \left( {S2,S3,S4} \right) \), the variables \( \lambda_{2,3} \) and \( \lambda_{2,4} \) are used to identify \( F2 \) by classifying \( \left( {S2,S3,S4} \right) \).

As is shown in Fig. 5(a), the inequality that \( \lambda_{2,3} \left( {Y_{S2} } \right) < b \) is satisfied. Figure 5(b) shows that \( E_{2,3} \left( {Y_{S2} \left( M \right)} \right) < E_{C} \) and \( E_{2,3} \left( {Y_{S4} \left( M \right)} \right) > E_{C} \).The inequality \( E_{2,3} \left( {Y_{S} } \right) < E_{C} \) is satisfied means the condition is \( F2 \) or \( F3 \). The variable \( \lambda_{2,4} \) is used to classify the waveform \( S2 \) and \( S4 \). Figure 6(a) shows \( \lambda_{2,4} \left( {Y_{S2} \left( M \right)} \right) < b \) is satisfied. Figure 6(b) shows RMSEs \( E_{2,4} \left( {Y_{S2} \left( M \right)} \right) < E_{C} \). The condition \( F2 \) is classified among the three conditions by the variable \( \lambda_{2,4} \). The inequalities \( E_{2,4} \left( {Y_{S2} } \right) < E_{C} \) and \( \lambda_{2,4} \left( {Y_{S2} } \right) < b \) are the index to diagnose \( S2 \) from \( \left( {S2,S3,S4} \right) \), which is effective to classify \( F2 \) among \( \left( {F2,F3,F4} \right) \).

Fig. 5.
figure 5

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{2,3} \left( {Y_{S2} \left( N \right)} \right) \), \( \lambda_{2,3} \left( {Y_{S3} } \right) \), \( \lambda_{2,3} \left( {Y_{S4} } \right) \) versus sequential test iteration number. (b) RMSEs \( E_{2,3} \left( {Y_{S2} \left( N \right)} \right) \), \( E_{2,3} \left( {Y_{S3} } \right) \) and \( E_{2,3} \left( {Y_{S4} } \right) \).

Fig. 6.
figure 6

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{2,4} \left( {Y_{S2} \left( N \right)} \right) \), \( \lambda_{2,4} \left( {Y_{S3} } \right) \), \( \lambda_{2,4} \left( {Y_{S4} } \right) \) versus sequential test iteration number. (b) RMSEs \( E_{2,4} \left( {Y_{S2} \left( N \right)} \right) \), \( E_{2,4} \left( {Y_{S3} } \right) \) and \( E_{2,4} \left( {Y_{S4} } \right) \).

4.3 Diagnosis of Gearbox in Fault State F3

Calculate the likelihood ratios of the sequences to be tested according to the formulas (1) to (5), labeled \( \lambda_{3,4} \left( {Y_{S1} \left( 1 \right)} \right) \cdots \lambda_{3,4} \left( {Y_{S3} \left( {10} \right)} \right) \) and \( \lambda_{3,4} \left( {Y_{S4} } \right) \), and the test results are shown in Fig. 7(a). It can be seen from the figure that the likelihood ratio \( \lambda_{3,4} \left( {Y_{S3} \left( M \right)} \right) < b \) of the vibration signal \( S3 \) in the fault state \( F3 \) and the likelihood ratio \( \lambda_{3,4} \left( {Y_{S4} } \right) < b \) of the vibration signal \( S4 \). Figure 7(b) shows that if \( E_{3,4} \left( {Y_{S3} } \right) < E_{3,4} \left( {Y_{S4} } \right) \) and \( E_{3,4} \left( {Y_{S3} } \right) < E_{C} \) are satisfied, the inequalities \( \lambda_{3,4} \left( {Y_{S3} \left( M \right)} \right) < b \) and \( E_{3,4} \left( {Y_{S3} } \right) < E_{C} \) are used to categorize the waveform \( S3 \) from the waveform \( S4 \). The gearbox condition \( F3 \) is diagnosed from the gearbox condition \( F4 \).

Fig. 7.
figure 7

Three-layer SPRT results: (a) likelihood ratio \( \lambda_{3,4} \left( {Y_{S3} \left( N \right)} \right) \), \( \lambda_{3,4} \left( {Y_{S4} } \right) \) versus sequential test iteration number; (b) RMSEs \( E_{3,4} \left( {Y_{S3} \left( N \right)} \right) \) and \( E_{3,4} \left( {Y_{S4} } \right) \).

5 Conclusions

In this paper, a gearbox fault diagnosis method based on PCA and SPRT is proposed. Firstly, WPT is used to denoise the initial vibration signal, and the time domain analysis method is utilized to extract the characteristic parameters of the signal. Then PCA is used to reduce the extracted feature parameters. The principal component with the biggest contribution rate is chosen as the monitoring sequence. Finally, SPRT algorithm is used to analyze the running state of the gearbox and combine the root mean square algorithm to classify the fault. Subsequently, detailed fault diagnosis experiments and analysis were carried out on the gearbox. SPRT was used to identify different faults and combined with the root mean square error to further give the identification criteria. The experimental results show that the fault diagnosis of the gearbox is accurate and effective.