1 Introduction

As a very important part of transmission equipment, bearings are widely found in various power transmissions [1]. Bearing failure is an important factor in inducing malfunctions in mechanical transmissions, and periodic friction or impact is generated when bearings fail. Since its failure is difficult to diagnose by appearance, it is generally diagnosed by the vibration signal extracted from the acceleration sensor. However, the acquired signals are typically non-smooth, nonlinear modulated signals that tend to exhibit modulation of the slew frequency on the engagement frequency [2]. Therefore, the extraction and analysis of modulated components of these vibration signals have always been a difficult task for bearing fault diagnosis.

In recent years, signal decomposition algorithms have made good developments in gear fault feature extraction. Yang et al. performed local mean decomposition (LMD) on the collected vibration signals and extracted the characteristics of diaphragm pump check valve fault signals by Hilbert envelope spectral analysis [3]. Han et al. used the empirical mode decomposition algorithm (EMD) to decompose the vibration signal, and the time, frequency, energy characteristic parameters, and box dimensions were calculated from the time domain, frequency domain, energy domain, and fractal domain to obtain the gear fault characteristics under different load excitations [4]. Li et al. proposed a rolling bearing fault feature extraction method combining ensemble empirical mode decomposition (EEMD) and improved frequency band entropy (IFBE). A bandpass filter designed based on the frequency band entropy is used to optimize the bandwidth parameters according to the maximum envelope kurtosis principle, and a sensitive IMF reflecting the fault characteristics is selected [5]. Wang et al. combined complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) with time–frequency representation demodulation analysis and proposed a strategy to reweight and reconstruct the decomposed eigenmodal functions to ensure fault feature extraction under Gaussian and non-Gaussian noise [6]. LMD, EMD, and their improved methods are adaptively decomposed, but the modal mixing problem is still difficult to solve.

The advent of variational modal decomposition has brought signal decomposition techniques to new heights [7], and it has achieved success in reducing the complexity of non-stationary signals and minimizing the effectiveness of modal aliasing. Xing et al. proposed a particle swarm algorithm based on the joint fitness function of the crag and power spectral entropy to optimize the number of decomposition layers and the penalty factor of the variational modal decomposition, select the mode with the highest energy for signal reconstruction, and apply the Hilbert transform to obtain the time difference of the echo signal [8]. Ji et al. proposed the energy difference and correlation coefficient as the criterion for the number of modal decompositions in VMD and improved the interval sampling energy operator to quickly extract the instantaneous amplitude and frequency of harmonics, realizing the fast and accurate measurements of non-stationary harmonic time–frequency parameters [9]. In feature extraction, the envelope entropy of the vibration signal is an important indicator for determining bearing faults. Tan et al. proposed a hybrid framework based on Multi-Envelope Teaching Optimization (METLBO) by combining parametric optimization variational modal decomposition with an improved support vector machine (ISVM) [10]. An improved parametric adaptive variational modal decomposition is proposed by Miao et al. A new integrated kurtosis index is constructed by combining kurtosis and envelope spectral kurtosis. The mean value of the ensemble kurtosis of all modes is selected as the objective function, and an iterative algorithm is used to extract all potential fault information [11]. However, there is room for improvement in the determination of the optimal number of decomposition layers K for modal decomposition.

To address the problem of fault feature enhancement, in 2012, Mcdonald et al. proposed maximum correlation cliff deconvolution based on minimum entropy inverse fold product, which can significantly enhance the shock component of the fault signal [12]. In recent years, MCKD has been widely used for fault diagnosis of gear, bearing, and other rotating machinery components [13, 14]. Chen et al. used the MCKD technique for signal noise reduction to enhance the energy amplitude of components related to planetary bearing faults and improved the feature optimization map to improve the envelope spectrum to effectively extract bearing fault vibration components [15]. He et al. proposed the maximum correlation kurtosis inverse fold product as an effective means to identify periodic pulses of fault signals and filter out bearing fault information using the envelope spectrum generated by the correlation kurtosis index [16]. Meanwhile, for the parameter selection problem in MCKD, some intelligent optimization algorithms, including gray wolf optimizer (GWO) [17], sparrow search algorithm (SSA) [18], and dung beetle optimizer (DBO) [19], play an important role. Jun et al. used an artificial fish swarm algorithm to optimize the filter length L and shift number M in MCKD to achieve a multi-fault diagnosis of bearings [20]. Song et al. combined the sample entropy and crag index to construct the minimum entropy crag ratio and used genetic algorithms to search for the minimum value of the minimum entropy crag ratio to determine the optimal parameter combinations of the number of shifts, filter lengths, and periods, and to realize the effective extraction of weak fault features [21].

Therefore, to solve the problem that the determination of parameter K in VMD is too dependent on the a priori knowledge, an optimal decomposition layer calculation method based on SVD is proposed, and for the problem that it is difficult to identify the weak fault features in the background of strong noise, DBO is used to optimize the L, M parameter of MCKD to enhance the fault features of vibration signals. Determine the optimal number of decomposition layers of VMD to decompose the vibration signal using the singular value kurtosis difference spectrum. The IMF reconstructed signals are screened according to the correlation coefficient criterion and the kurtosis index, and the weak pulse features are enhanced using the MCKD optimized by the DBO algorithm. Fault diagnosis is carried out by analyzing the fault frequency of the envelope spectrum of the bearing signal, and simulation and experimental analysis verify the effectiveness of the method in bearing fault diagnosis.

2 Basic theory

2.1 Optimal K-value algorithm based on SVD

VMD is based on the adaptive property of signal decomposition, which depends on the frequency information of the signal. The real-valued signal is decomposed into K eigenmode functions by VMD, and the choice of the value of K has a great influence on the decomposition result. When the value of K is too small, it results in incomplete decomposition. And when the value of K is too large, it results in over-decomposition. Therefore, the distance between the center frequencies of adjacent IMF components is calculated using the following equation:

$$ \min \left( {\left| {\frac{{\omega_{i + 1} - \omega_{i} }}{{\left( {\omega_{i + 1} - \omega_{i} } \right)/2}}} \right|} \right) \le \gamma $$
(1)

where \(\omega_{i}\), \(\omega_{i + 1}\) are two similar center frequencies, \(\gamma\) is a constant and is set to 0.1.

Singular value decomposition has the ability of noise filtering, interference insensitivity, and high-resolution spectral decomposition. Singular values in a signal often carry important information, and it is one of the important features of the signal. The difference between neighboring singular values in a singular value matrix can be defined by the singular value difference spectrum, and the formula for neighboring singular value difference is as follows:

$$ b_{i} = \sigma_{i} - \sigma_{i + 1} ;\left( {i = 1,2, \cdots \left( {n - 1} \right)} \right) $$
(2)

The sequence composed of \(b_{i}\) is called the singular value kurtosis difference spectrum. When the neighboring singular values are different, a peak appears in the singular value difference spectrum, i.e., a mutation occurs. The location of the mutation is used to find out where the original signal is converted from a useful signal to a noise signal. Kurtosis is very sensitive to the shock characteristics of the signal, when the rolling bearing occurs local failure, the vibration signal will have shock characteristics, so kurtosis can be used as the basis for selecting the effective singularity.

In summary, an improved SVD-based method for calculating the optimal K value is proposed in Fig. 1.

Fig. 1
figure 1

Flow chart for calculating the optimal number of decomposition layers

2.2 Correlation and kurtosis metrics

After the original signal is decomposed into K IMFs, the IMFs are often filtered using the correlation principle. The strength of the correlation between the IMF and the original signal is determined by calculating the correlation coefficient between the IMF and the original signal. Assuming that x and y are two time-domain signals, the correlation coefficient \(r_{xy}\) is calculated as follows:

$$ r_{xy} = \frac{{\sum\nolimits_{i - 1}^{N} {\left( {x_{i} - \overline{x}} \right)\left( {y_{i} - \overline{y}} \right)} }}{{\sqrt {\sum\nolimits_{i = 1}^{N} {\left( {x_{i} - \overline{x}} \right)^{2} } \sum\nolimits_{i = 1}^{N} {\left( {y_{i} - \overline{y}} \right)^{2} } } }} $$
(3)

where N is the length of the signal; \(\overline{x}\) and \(\overline{y}\) are the mean values of signal x and y.

Since the correlation coefficient is greatly affected by noise, the kurtosis index is added to the screening. According to experience, the signal kurtosis is close to 3 when the normal bearing is running, and when the bearing fails, the signal kurtosis will be larger than 3. In this paper, we calculate the correlation coefficient and kurtosis value between each IMF and the original signal and filter out the IMF components that satisfy the correlation coefficient larger than the threshold value and the kurtosis larger than 3 at the same time for the reconstruction of the signal.

2.3 Optimization of DBO-based signal enhancement algorithm

The MCKD algorithm can effectively extract the periodic shock pulse components, and its solution process is a process of solving the optimal solution of the FIR filter. To verify that the signal recovered by this filter satisfies the periodic shock characteristics, it is necessary to utilize the associated cliff metric. The signal is filtered and the filter solved is the one that satisfies the requirements when the correlation cliff \(CK_{M} \left( T \right)\) is maximized. The algorithm has 3 important parameters T, L, and M.

In this paper, we use the DBO algorithm to optimize the L and M parameters of MCKD, using the sample entropy as the objective function. Set the population size N = 30, the number of iterations n = 50, the range of values of L is [100, 500], and the range of values of M is [1, 7] in the DBO algorithm. The DBO optimization of the MCKD algorithm is performed to find the best combination of (L, M) by taking the minimum value of the sample entropy of the effective component as the objective function. Where the sample entropy is calculated as

$$ SampEn\left( {m,r} \right) = \ln B^{m} \left( r \right) - \ln A^{m} \left( r \right) $$
(4)

where \(B^{m} \left( r \right)\) is the average value of the ratio of the approximate number of signal sequences to the total number, and \(A^{m} \left( r \right)\) is the average value of the ratio of the approximate number of signal sequences to the total number after adding 1 to the dimension.

2.4 Fault diagnosis process

This paper proposes a bearing fault diagnosis method based on improved variational modal decomposition and envelope spectral analysis, and the structural flowchart of the fault diagnosis method is shown in Fig. 2. The main algorithmic process consists of the following steps: Firstly, the vibration signals are acquired; the original signals are denoised based on the singular value kurtosis difference spectrum, and the optimal number of signal decomposition layers K is determined; the vibration signals are decomposed by the VMD and screened; the IMFs that meet the correlation and kurtosis indexes are used for signal reconstruction; the reconstructed signals are augmented with the MCKDs optimized by the dung beetle optimization algorithm; the augmented signals are demodulated by the envelopes and analyzed for the frequency of the faults.

Fig. 2
figure 2

Bearing fault diagnosis flow chart

3 Simulation signal analysis

To verify the effectiveness of the method, simulated signals are created and analyzed in this paper. To simulate the strong noise operating environment of the bearing, a Gaussian noise signal with a signal-to-noise ratio of − 14 dB is added to the pulse signal to form the analog signal. The final analog signal is shown in Eq. (5):

$$ \left\{ {\begin{array}{*{20}l} {x\left( t \right) = s\left( t \right) + n\left( t \right) = \sum\limits_{i} {A_{i} h\left( {t - iT} \right) + n\left( t \right)} } \hfill \\ {h\left( t \right) = \exp \left( { - Ct} \right)\cos \left( {2\pi f_{n} t} \right)} \hfill \\ {A_{i} = 1 + A_{0} \cos \left( {2\pi f_{r} t} \right)} \hfill \\ \end{array} } \right. $$
(5)

where \(s\left( t \right)\) is the periodic shock part; the amplitude \(A_{0}\) is 0.3; the rotation frequency \(f_{{\text{r}}}\) is 30 Hz; the attenuation coefficient C is 700; the resonance frequency \(f_{n}\) is 4 kHz; the inner ring fault characteristic frequency \(f_{i}\) = 1/T = 120 Hz; \(n\left( t \right)\) is the Gaussian white noise component; the sampling frequency \(f_{{\text{s}}}\) is 16 kHz; and the number of analysis points is 4096.

Figure 3a shows the time domain diagram of the simulated signal, and it can be seen from the figure that the simulated signal shows an obvious periodic variation.

Fig. 3
figure 3

Analog signal waveform diagram

The time domain plot and spectrum of the signal after adding − 14 dB of random Gaussian white noise are shown in (b) and (c) in Fig. 3. The signal characteristics are almost completely drowned out after noise contamination, and the pulse frequencies are hidden by the noise, the resonant frequencies are also vaguely visible. Figure 3d shows the envelope spectrum of the simulated signal after noise staining. The fault frequencies and their multiples in the figure are masked by noise and are indistinguishable.

(e) is a time–frequency plot of the dyed-noise signal, with a large number of noise points in the plot.

The optimal number of decomposition layers K = 10 for the analog signal is determined using the method proposed in Sect. 2.1. IMF3, IMF6, IMF7, IMF9, and IMF10 that satisfy a crag greater than 3 and a correlation coefficient greater than the mean value of 0.33 were screened for signal reconstruction.

The time domain plot spectrum and the time–frequency analysis spectrum of the reconstructed signal are shown in Fig. 4a and b. Compared with the original analog signal, the burr and noise bands in the reconstructed signal are much reduced, and the noise reduction effect is obvious. To visualize the noise reduction effect, a time–frequency analysis of the signal is shown in Fig. 4c. Comparing with Fig. 3e, it is found that the noise in high and low frequencies has been eliminated substantially. That is, most of the high and low-frequency noise is eliminated, but a small amount of noise still exists. Figure 4d shows the envelope spectrum of the reconstructed signal. The fault frequencies and their multiples can already be seen in the figure, but they cannot be clearly extracted and there are many interfering spectral lines.

Fig. 4
figure 4

Reconstructed signal waveform diagram

The time-domain and envelope spectra of the signals processed by the method of this paper are shown in Fig. 5a and b. The frequency at the maximum peak in the plot is chosen as the characteristic frequency \(f_{i}\), so \(f_{i} = 120\). The DBO algorithm is used to optimize the parameters L and M of MCKD. The population number of the DBO algorithm is set to 100, the maximum number of iterations is 10, and the objective function of parameter optimization is the minimum value of sample entropy. At the 6th iteration, the optimal parameters are found to be [243, 1]. Therefore, the parameters L = 243, M = 1, and T = 75 are set for the MCKD algorithm. The characteristic spectral lines in the figure are very prominent, and the characteristic frequency \(f_{i}\) of the fault and its multiples can be easily identified, thus confirming the effectiveness of the proposed diagnosis method.

Fig. 5
figure 5

Time domain and envelope spectra of the signal processed by this method

To verify the rationality of using the DBO algorithm to optimize the parameters L and M of MCKD, we now change the parameters of M and keep L constant to observe the results. The signal envelope spectrum analyzed by the MCKD algorithm with random order M = 7 and L = 243 is shown in Fig. 6a. As can be seen in Fig. 6a, compared to the optimal combination of L and M, when only L is guaranteed to be at the optimal value, the amplitude of the envelope spectrum of the signal processed by the method in this paper is small and reduced to about one percent of the original one. Then keeping the value of M constant and changing the value of L, that is, taking L = 300 and M = 1 randomly, the signal envelope spectrum analyzed by the MCKD algorithm is shown in Fig. 6b. In Fig. 6b, there are numerous interference spectral lines, which make it difficult to identify the fault frequency and its multiplication information. It shows the reasonableness of the results of parameters L and M of MCKD optimized by DBO in this paper.

Fig. 6
figure 6

Envelope spectrum after changing MCKD parameters

4 Measured signal analysis

4.1 Data source

To verify the effectiveness of the method in this paper, the test data from the publicly available rolling bearing failure simulation platform at Case Western Reserve University were used for analysis. The EDM technique was used to set the fault, the motor speed was 1797 r/min, the sampling frequency was 12 kHz, and 7800 points were selected for analysis in this paper. The structural parameters of the rolling bearing are shown in Table 1.

Table 1 Bearing failure information

4.2 Signal decomposition and screening

Taking the inner ring fault data as an example, the waveforms of the fault signal are shown in Fig. 7a–d. In the time-domain diagram, the signal waveform is relatively noisy, and there is no obvious shock signal. In the spectrogram, there is an obvious modulation phenomenon in the range of 2000–5000 Hz. The time–frequency diagram of the fault signal is densely populated with noise, indicating that it contains more noise. In the envelope spectrum, the characteristic frequency and its multiplier are covered by the rotary frequency and its multiplier as well as the noise interference band, and the weak pulse characteristics are almost completely ignored, which makes it impossible to carry out fault diagnosis.

Fig. 7
figure 7

Bearing failure signal waveform diagram

Using the method proposed in this paper, the signal is first denoised by SVD, and the optimal number of decomposition layers K = 8 is determined based on the singular value kurtosis difference spectrum. The histograms of correlation coefficient r and kurtosis bk for each IMF are shown in Fig. 8. The mean value of the correlation coefficient is 0.36, and the sub-signals IMF3, IMF4, IMF5, IMF6, and IMF7 with r greater than 0.36 and bk greater than 3 are screened for reconstruction.

Fig. 8
figure 8

Correlation coefficients and kurtosis values for IMFs

4.3 Signal reconstruction and enhancement

The waveforms of the reconstructed signals are shown in Fig. 9a–d. In the figure, it can be observed that the shock signal features covered by noise have been revealed after processing, and the noise in the time–frequency diagram has been reduced significantly, indicating that the proposed method is effective in noise reduction, and it can be seen from the envelope spectrum that the octave of the eigenfrequency is difficult to distinguish.

Fig. 9
figure 9

Bearing reconfiguration signal waveform diagram

MCKD is optimized using DBO, the population size of the DBO algorithm is set to 100, and the maximum number of iterations is 15. The fitness curve of DBO is shown in Fig. 10, and the parameters reach the optimum at the 12th iteration, and the optimal parameter combination is [132, 1]. The reconstructed signal is subjected to the MCKD algorithm, and the optimal parameters are set L = 132 and M = 1 according to the optimal parameters.

Fig. 10
figure 10

DBO optimization curve

4.4 Envelope comparison

The envelope spectrum of the signal processed by the MCKD algorithm with optimized parameters is shown in Fig. 11a, and the time-domain diagram shows that the frequency of the largest value in the diagram is the eigenfrequency of \(f_{i}\) = 161.538 Hz, which can be seen in the figure, the eigenfrequency of the fault and its multiplier frequency is obvious and \(f_{i}\) corresponds to the theoretically calculated fault frequency of the inner ring of the bearing, therefore, it can be determined that the fault location of the rolling bearing is in the inner ring of the bearing. To emphasize the effect of noise reduction, random white noise with a signal-to-noise ratio of − 3 dB is added to the fault signal of the inner ring of the bearing \(f_{{{\text{ir}}}}\), and the envelope spectrum processed by the method of this paper is shown in Fig. 11b, and it can be seen that the characteristic frequency and its multiplicative frequency are also clearly extracted.

Fig. 11
figure 11

The envelope spectrum of the inner ring fault signal processed by this method

The envelope spectra for the outer bearing signals using the method proposed in this paper are shown in Fig. 12. Where Fig. 12a shows the unstained outer bearing signal and (b) shows the stained signal with − 3 dB random white noise added. The MCKD optimized by DBO enhances the weak impulse fault, and the optimization combinations are [235, 1] and [309, 1], respectively, and the frequency opposite to the largest magnitude in the figure is taken as the eigenfrequency \(f_{i}\) = 107.629 Hz. The eigenfrequency and its multiplier frequency in the figure are extracted in an orderly manner, with almost no sideband effects, and correspond to the theoretical value of the bearing outer ring fault frequency \(f_{{{\text{or}}}}\), so it is possible to determine the location of rolling bearing faults in the bearing outer ring.

Fig. 12
figure 12

The envelope spectrum of the outer ring fault signal processed by this method

4.5 Comparison with the PE-MCKD methodology

Liu et al. used an improved MCKD method using the maximum alignment entropy value to determine the optimal filter length L and the optimal fault period T [22]. To verify the superiority of the method in this paper, their proposed PE-MCKD method was used to diagnose the inner ring faults of Case Western Reserve University bearings. Figure 13a shows the untainted noise envelope spectrum processed by the method, and the optimization combination of L and T is [279, 74], in which the fault frequency \(f_{i}\) = 161.538 Hz, and its 2–5 octave frequencies can be seen; (b) shows the tainted noise envelope spectrum with the addition of a − 3 dB Gaussian white noise, and the optimization combination of L and T is [264, 11], and in which the fault frequency and its octave frequencies have been masked by the noise and are not recognizable. The fault frequency and its octave have been obscured by noise and cannot be recognized in the graph. This indirectly proves the superiority of the method proposed in this paper.

Fig. 13
figure 13

PE-MCKD processing inner ring fault signal envelope

5 Conclusion

The algorithm proposed in this paper realizes the fault diagnosis of rolling bearing, introduces the SVD to get the singular value matrix of the vibration signal, locates the position of the mutation of the singular value peak difference spectrum to determine the effective order of the singular value matrix, to denoise the vibration signal. In addition, the peak singular value difference spectrum mutation is set as the initial K-value of VMD, and a reasonable K-value is searched according to the optimal K-value algorithm proposed in this paper, which improves the status quo of the K-value setting relying too much on empirical values. The IMF components are screened and signal reconstruction is performed using the principles of kurtosis and correlation. For the first time, the DBO algorithm and the MCKD algorithm are combined to search for the optimal values of the parameters L and M in the MCKD, and the optimized MCKD algorithm is used to enhance the reconstruction of weak pulses in the signal. The experimental results show that the proposed method can effectively remove the noise in the vibration signal, and the K value calculated by the optimal K value algorithm can effectively decompose the signal under the premise of ensuring that no modal aliasing occurs. The DBO optimization algorithm can jump out of the local optimum to find the optimal combination of L and M of the MCKD algorithm, and the experiments verified the reasonableness and validity of the optimization of the MCKD algorithm using the DBO algorithm. Comparison with the PE-MCKD algorithm also confirms that the algorithm proposed in this paper has good noise immunity, effectively extracts the bearing fault characteristics, and determines the fault location in a strong background noise environment.