1 Introduction

As the core component of rotating machinery, the rotor system plays a vital role in the high-speed operation of equipment [1, 2]. As the requirements for the speed and efficiency of mechanical equipment are more demanding, the clearance between components is increasingly smaller, which renders rub-impact a more common fault for the rotor system [3]. The rubbing fault may affect the service life of the machine and even lead to catastrophic accidents. Therefore, it is of great engineering and practical significance to research the methods of detecting and diagnosing the rubbing fault of the rotor system.

It is well known that the frequency response function (FRF) is a fundamental concept of linear systems and widely applied in engineering practice [4]. Some systems exhibit strong nonlinear characteristics, such as rotor systems with rub-impact faults [5,6,7]. An extension of the FRF theory to the nonlinear domain is known as the generalized frequency response functions (GFRFs), which were derived based on the Volterra series theory to describe the nonlinear characteristics of systems in the frequency domain. However, the GFRFs are rarely used to resolve complex engineering problems because it is tedious to calculate due to its multi-dimensional nature [8,9,10].

Lang and Billings proposed the concept of the nonlinear output frequency response functions (NOFRFs) based on the GFRFs to tackle this problem. The NOFRFs well explains the generation mechanism of nonlinear phenomena and provides a theoretical foundation for frequency domain analysis of nonlinear systems [11,12,13,14]. Therefore, it is extensively used to resolve some engineering problems such as structural health monitoring and fault diagnosis of complicated structures [15,16,17,18,19].

Peng [15, 16] employed the NOFRFs to detect cracks in beams and the position of nonlinear components in periodic structures. Bayma [17] designed a complete set of fatigue crack detection methods for beam structures by combining the NARX model with the NOFRFs. Based on the NOFRFs and the super-harmonic responses, Yao [18] proposed a detection strategy to locate the rub-impact fault in the rotor system. Cao [19] proposed a novel fault detection and diagnosis method for complicated nonlinear systems by combining the NOFRFs and evidence theory and employed this approach to diagnose the fault of the transmission system of a numerical control machine tool.

In order to improve the performance of the evaluated NOFRFs for nonlinear systems, scholars proposed associated indexes to reveal the system’s dynamic characteristics [20,21,22,23,24,25]. Peng [20] proposed a simple NOFRFs-based index, Fe, to identify the damage of aluminum plates with holes and cracks. The experimental results demonstrated that the values of Fe of the inspected structure and those of a damage-free structure are quite distinct. Huang [21] introduced the information entropy to integrate the first four-order of NOFRFs and proposed a new index Ne, which is applied to detect the cumulative fatigue damage of fatigued plate specimens and used connecting rod parts. The obtained results revealed that Ne is more sensitive to different degrees of structural damage and can detect working time. Based on indexes of Fe and Ne, Liu [22,23,24] proposed an evaluated method named the NOFRFs weighted contribution rate and proposed an associated index, the second-order optimal weighted contribution rate denoted by Rm. The simulation and experimental results verified its sensitivity and superiority comparing to Fe and Ne in the fault diagnosis of rotor systems. Based on the concepts of KL divergence and NOFRFs, Mao [25] proposed the NKL detection index. And through the hammering frequency response experiment of the train wheelset, it was verified that NKL can effectively identify the fatigue damage and be proportional to the service time of the wheelset.

During the in-depth research of NOFRFs weighted contribution rate in Refs. [22,23,24], it is found that only the second-order weighted contribution rate is considered in this method but ignoring the influence of the system’s nonlinearity on the other orders. Besides, as the severity of the fault increases, the values of this index Rm and its increments are relatively small and the sensitivity to faults is slightly lower. So it is necessary to propose a new index that can more fully characterize the system's nonlinearity and has greater sensitivity. Moreover, Rm is more easily affected by stiffness and friction coefficient. So this method can be improved, as well as the index of the second-order optimal weighted contribution rate.

In order to solve these problems, a new fault detection method named the improved NOFRFs weighted contribution rate based on KL divergence (NWKL) is proposed in this paper, which is applied to detect the rub-impact fault of the rotor system. And the associated index of the improved NOFRFs optimal weighted contribution rate based on KL divergence (KLRm) shows high sensitivity to rotor rub-impact both in simulation and experimental results. Compared to the previous method, the performance of the improved method is optimal. Finally, a novel evaluation method known as the feature dynamic stability analysis is proposed, which is applied to study the influence of the friction coefficients and rub-impact stiffness of the rubbing rotor system on the detection method. And the simulation results verified the feature dynamic stability of the proposed method.

2 Related theory of the method of NWKL

2.1 The NOFRFs of nonlinear systems

The relationship between the input signal and the output signal of a nonlinear system in the time domain can be described as

$$ y(t) = \sum\limits_{n = 1}^{N} {\int_{ - \infty }^{\infty } \cdots \int_{ - \infty }^{\infty } {h_{n} \left( {\tau_{1} , \ldots ,\tau_{{_{n} }} } \right)\prod\limits_{i = 1}^{n} u \left( {t - \tau_{i} } \right)\text{d}\tau_{i} } } $$
(1)

where \(u(t)\) and \(y(t)\) are the input and output, respectively; \(h_{n} (\tau_{1} , \cdots ,\tau_{n} )\) is the nth Volterra kernel functions of the nonlinear system; N denotes the maximum order; \(\tau\) is the time delay.

In the frequency domain, \(U(j\omega_{i} )\) is the Fourier transform of \(u(t)\). Then, the Fourier transform of \(u^{n} (t)\) is expressed as

$$ U_{n} (j\omega ) = \frac{1/\sqrt n }{{(2\pi )^{n - 1} }} \times \int_{{\omega_{1} + \cdot \cdot \cdot + \omega_{n} = \omega }} {\prod\limits_{i = 1}^{n} {U(j\omega_{i} )\text{d}\sigma_{n\omega } } } $$
(2)

The expression for the output frequency response of the nonlinear system under general excitation is

$$ Y(j\omega ) = \sum\limits_{n = 1}^{N} {Y_{n} (j\omega )} \, $$
(3)

where \(Y(j\omega )\) represents the Fourier transform of output signal; \(Y_{n} (j\omega )\) denotes the nth-order output frequency response of the nonlinear system and is defined by

$$ Y_{n} (j\omega ) = \frac{1/\sqrt n }{{(2\pi )^{n - 1} }} \times \int_{{\omega_{1} + \cdot \cdot \cdot + \omega_{n} = \omega }} {H_{n} (j\omega_{1} , \cdot \cdot \cdot ,j\omega_{n} )} \prod\limits_{i = 1}^{n} {U(j\omega_{i} )\text{d}\sigma_{n\omega } } $$
(4)

where \(\int_{{\omega_{1} + \cdot \cdot \cdot + \omega_{n} = \omega }} {H_{n} (j\omega_{1} , \cdot \cdot \cdot ,j\omega_{n} )} \prod\limits_{i = 1}^{n} {U(j\omega_{i} )\text{d}\sigma_{n\omega } }\) denotes the integral of \(H_{n} (j\omega_{1} , \cdots ,j\omega_{n} )\prod\nolimits_{i = 1}^{n} {U(j\omega_{i} )}\) under the condition of \(\omega_{1} + \cdots + \omega_{n} = \omega\) in the n-dimensional hyperplane. \(H_{n} (j\omega_{1} , \cdots ,j\omega_{n} )\) is the Fourier transform of \(h_{n} (j\omega_{1} , \cdots ,j\omega_{n} )\) and denotes the nth GFRFs of the nonlinear systems. Its expression is

$$ H_{n} \left( {j\omega_{1} , \ldots ,j\omega_{n} } \right) = \int_{ - \infty }^{\infty } \ldots \int_{ - \infty }^{\infty } {h_{n} } \left( {\tau_{1} , \ldots ,\tau_{n} } \right) \times e^{{ - \left( {\omega_{1} \tau_{1} + \cdots + \omega_{n} \tau_{n} } \right)j}} \text{d}\tau_{1} \ldots \text{d}\tau_{n} $$
(5)

Under the condition that \(U_{n} (j\omega ) \ne 0\), the NOFRFs are defined as

$$ G_{n} (j\omega ) = \frac{{\int_{{\omega_{1} + \cdots + \omega_{n} = \omega }} {H_{n} (j\omega_{1} , \cdots ,j\omega_{n} )} \prod\limits_{i = 1}^{n} {U(j\omega_{i} )} \text{d}\sigma_{n\omega } }}{{\int_{{\omega_{1} + \cdots + \omega_{n} = \omega }} {\prod\limits_{i = 1}^{n} {U(j\omega_{i} )} } \text{d}\sigma_{n\omega } }} $$
(6)

Substituting Eq. (6) into Eq. (4), the output spectrum of the nth nonlinear system can be represented as

$$ \begin{gathered} Y_{n} (j\omega ) = \frac{{\int_{{\omega_{1} + \cdots + \omega_{n} = \omega }} {H_{n} (j\omega_{1} , \cdots ,j\omega_{n} )} \prod\limits_{i = 1}^{n} {U(j\omega_{i} )} \text{d}\sigma_{n\omega } }}{{\int_{{\omega_{1} + \cdots + \omega_{n} = \omega }} {\prod\limits_{i = 1}^{n} {U(j\omega_{i} )} } \text{d}\sigma_{n\omega } }} \times \frac{1/\sqrt n }{{(2\pi )^{n - 1} }}\int_{{\omega_{1} + \cdot \cdot \cdot + \omega_{n} = \omega }} {\prod\limits_{i = 1}^{n} {U(j\omega_{i} )\text{d}\sigma_{n\omega } } } \hfill \\ \, = G_{n} (j\omega )U_{n} (j\omega ) \hfill \\ \end{gathered} $$
(7)

So the relationship between the input and output in the frequency domain can be rewritten by

$$ Y(j\omega ) = \sum\limits_{n = 1}^{N} {Y_{n} (j\omega )} = \sum\limits_{n = 1}^{N} {G_{n} (j\omega )} U_{n} (j\omega ) $$
(8)

Considering that the first four-order output frequency response function of a nonlinear system can almost characterize its most nonlinear characteristics, only the first four orders of NOFRFs are calculated so as to simplify the calculation process and improve calculation efficiency [15].

Therefore, Eq. (8) can be further expressed as

$$ Y(j\omega ) = G_{1} (j\omega )U_{1} (j\omega ) + G_{3} (j\omega )U_{3} (j\omega ) $$
(9)
$$ Y(j2\omega ) = G_{2} (j2\omega )U_{2} (j2\omega ) + G_{4} (j2\omega )U_{4} (j2\omega ) $$
(10)
$$ Y(j3\omega ) = G_{3} (j3\omega )U_{3} (j3\omega ) $$
(11)
$$ Y(j4\omega ) = G_{4} (j4\omega )U_{4} (j4\omega ) $$
(12)

Through the analysis of Eqs. (9-12), it is found that when a nonlinear system is excited by two harmonic signals with different amplitudes but the same frequency, its first four orders of NOFRFs can be obtained by using the least square method.

When two different sinusoidal signals with different amplitudes but the same frequency are employed to excite a rotor system, the input spectra of the system are, respectively, expressed by \(A_{i}^{(1)} (j\omega_{F} )\) and \(A_{i}^{(2)} (j\omega_{F} )\), \(i = 1,2,3,4\). And the corresponding output spectra of the system \(Y^{(1)} (j\omega_{F} )\) and \(Y^{(2)} (j\omega_{F} )\) are expressed as

$$ \left( {\begin{array}{*{20}c} {Y^{(1)} (j\omega_{F} )} \\ {Y^{(2)} (j\omega_{F} )} \\ \end{array} } \right){ = }\left( {\begin{array}{*{20}c} {A_{1}^{(1)} (j\omega_{F} )} & {A_{3}^{(1)} (j\omega_{F} )} \\ {A_{1}^{(2)} (j\omega_{F} )} & {A_{3}^{(2)} (j\omega_{F} )} \\ \end{array} } \right)\left( {\begin{array}{*{20}c} {G_{{^{1} }} (j\omega_{F} )} \\ {G_{{^{3} }} (j\omega_{F} )} \\ \end{array} } \right) $$
(13)

\(G_{1} (j\omega_{F} )\) and \(G_{3} (j\omega_{F} )\) can be calculated by

$$ \left( {\begin{array}{*{20}c} {G_{1} (j\omega_{F} )} \\ {G_{3} (j\omega_{F} )} \\ \end{array} } \right){ = }\left( {\begin{array}{*{20}c} {A_{1}^{(1)} (j\omega_{F} )} & {A_{3}^{(1)} (j\omega_{F} )} \\ {A_{1}^{(2)} (j\omega_{F} )} & {A_{3}^{(2)} (j\omega_{F} )} \\ \end{array} } \right)^{ - 1} \left( {\begin{array}{*{20}c} {Y^{(1)} (j\omega_{F} )} \\ {Y^{(2)} (j\omega_{F} )} \\ \end{array} } \right) $$
(14)

Similarly, the other orders of NOFRFs can be obtained.

2.2 NWKL detection method

Based on the concepts of contribution rate and feature weighting, Liu [22,23,24,25] integrated them with the NOFRFs and proposed the method of NOFRFs weighted contribution rate, which is defined by

$$ Rn(n)(\rho ) = \frac{{\int_{ - \infty }^{ + \infty } { \, \left| {\frac{{G_{n} (j\omega )}}{{n^{\rho } }}} \right|\text{d}\omega } }}{{\sum\nolimits_{i = 1}^{N} {\int_{ - \infty }^{ + \infty } {\left| {\frac{{G_{i} (j\omega )}}{{i^{\rho } }}} \right|\text{d}\omega } } }}, \, 1 \le n \le N, \, \rho \in ( - \infty , + \infty ) $$
(15)

In the definition of the weighted contribution rate, the weighted coefficient \(n^{\rho }\) is applied to increase the contribution rate of high-order NOFRFs for the nonlinear system, which solves the problem of small values of high-order NOFRFs, as well as that of NOFRFs-based indexes.

In the fault diagnosis of the rotor system, it was found through deep research that the second-order weighted contribution rate can better indicate the severity of the failure. And its expression is

$$ Rn2(\rho ) = \frac{{\int_{ - \infty }^{ + \infty } {\left| {\frac{{G_{2} (j\omega )}}{{2^{\rho } }}} \right|\text{d}\omega } }}{{\sum\nolimits_{i = 1}^{N} {\int_{ - \infty }^{ + \infty } {\left| {\frac{{G_{i} (j\omega )}}{{i^{\rho } }}} \right|\text{d}\omega } } }}\; $$
(16)

For the curve of \(Rn2(\rho )\) changing with \(\rho\), the abscissa of the curve at the peak is named as the optimal fitness factor \(\rho_{\max }\). And the corresponding weighted contribution rate at \(\rho = \rho_{\max }\) is defined as the second-order optimal weighted contribution rate, the equation of which is as

$$ Rm = \frac{{\int_{ - \infty }^{ + \infty } {\left| {\frac{{G_{2} (j\omega )}}{{2^{{\rho_{\max } }} }}} \right|\text{d}omega } }}{{\sum\nolimits_{i = 1}^{N} {\int_{ - \infty }^{ + \infty } {\left| {\frac{{G_{i} (j\omega )}}{{i^{{\rho_{\max } }} }}} \right|\text{d}\omega } } }}\; $$
(17)

In the rotor experiment, the severity of the rub-impact fault can be distinguished by Rm. This NOFRFs-based index only considers the second-order weighted contribution rate while ignoring the influence of the other orders. So it does not fully reflect the degree of nonlinearities of the system. Moreover, it is found that as fault severity increases, the values of the Rm do not change significantly, and its sensitivity to faults is poor. And this index is easily affected by the friction coefficient and stiffness of the system.

Therefore, in order to construct a new index that can comprehensively reveal the nonlinearity of the system and better distinguish the severity of faults, as well as having better robustness, the concept of KL divergence is introduced to the NOFRFs weighted contribution rate method.

For a random variable, the KL divergence is used to distinguish its two continuous probability density distributions \(f(x_{i} ),i = 1, \cdots ,N\) and \(g(x_{i} ),i = 1, \cdots ,N\). And the definition of the KL divergence is defined by

$$ D_{KL} (f\parallel g) = \sum\limits_{i = 1}^{N} {f(x_{i} )\log \frac{{f(x_{i} )}}{{g(x_{i} )}}} $$
(18)

For example, the distribution difference of second-order weighted contribution rate, for healthy rotor system and fault rotor system, can be represented by their KL divergence. The difference between two distributions is larger, the KL divergence will be larger. Therefore, to better measure the distribution difference between the healthy rotor system and the detested-to-be rotor system with unknown fault conditions, an improved NOFRFs weighted contribution rate based on the KL divergence method (KLRn) is proposed, which is defined as

$$ \begin{gathered} KLRn(\rho ) = KLRn1(\rho ){ + } \cdots { + }KLRn{\text{N}}(\rho ) \hfill \\ { = }\sum\limits_{n = 1}^{N} {\left| {Rn_{h} (n)(\rho )\log \frac{{Rn_{h} (n)(\rho )}}{{Rn_{t} (n)(\rho )}}} \right|} ,1 \le n \le N \hfill \\ \end{gathered} $$
(19)

In Eq. (19), \(Rn_{h} (n)(\rho )\) denotes the distribution of the nth-order weighted contribution rate changing with \(\rho\) of a healthy rotor system, while \(Rn_{t} (n)(\rho )\) represents the distribution of the nth-order weighted contribution rate changing with \(\rho \) of the detested-to-be rotor system.

When the detested-to-be rotor system is a healthy rotor system, \(Rn_{t} (n)(\rho )\) is almost equal to \(Rn_{h} (n)(\rho )\) so \(\log \frac{{Rn_{h} (n)(\rho )}}{{Rn_{t} (n)(\rho )}}\) is close to zero. Therefore, \(KLRn(\rho )\) is approximately coincident with the x-axis.

If the rotor system to be tested is a faulty rotor system, for a specific \(\rho = \rho_{0}\), the values of \(KLRn(\rho_{0} )\) gradually increase with the fault of the rotor system becoming more serious, namely, \(KLRn(\rho )\) is proportional to the system’s nonlinearity. Therefore, this approach can comprehensively represent the nonlinear characteristics of a nonlinear system by fusing all orders of NOFRFs weighted contribution rate.

3 Simulation analysis

3.1 Finite element model of the rotor system

Jeffcott rotor system with a rub-impact fault between the rotor and the stator was adopted for simulation analysis. The shaft is divided into 11 shaft segments by 12 nodes. The specific parameters of segments are given in Table 1. Each shaft segment is modeled by the Timoshenko beam. The rub-impact rotor system is simplified by assuming that the rubbing between the rotor and the stator is a local elastic collision and the deformation is elastic deformation. The finite element model of the rub-impact rotor system studied in this paper is illustrated in Fig. 1. The shaft segments are denoted as the numbers, dots as the black marks. And the rub-impact occurs at the red mark (at node 9).

Table 1 Specific parameters of the segments
Fig. 1
figure 1

Finite element model of the rub-impact rotor system

In order to ensure that the disk and the elastic rod meet the contact conditions and do not penetrate each other during the rubbing process, the Kuhn–Tucker condition of Coulomb friction is employed to detect the contact between the rotor and the stator. Contact constraint conditions are handled by the augmented Lagrangian method of friction. When a rub-impact fault occurs, the friction torque caused by it is ignored and only the normal and tangential rubbing forces, \(P_{n}\) and \(P_{t}\), are considered. Their specific expressions are as follows.

$$ P_{n} = \left\{ {\begin{array}{*{20}c} {0,} & {~r < \delta _{0} } \\ {\left( {r - \delta _{0} } \right)k_{c} + \lambda _{r}^{{\left( {i + 1} \right)}} ,~~} & {~r < \delta _{0} } \\ \end{array} } \right.P_{t} = f_{c} P_{n}$$
(20)

where the specific expression of \(\lambda_{r}^{(i + 1)}\) is

$$ \lambda _{r}^{{\left( {i + 1} \right)}} = \left\{ {\begin{array}{*{20}c} {\lambda _{r}^{{\left( i \right)}} + k_{c} \delta _{r} ,} & {~\left| {\delta _{r} } \right| \ge \varepsilon } \\ {\lambda _{r}^{{\left( i \right)}} ,~} & {~\left| {\delta _{r} } \right| \ge \varepsilon } \\ \end{array} } \right. $$
(21)

where \(\varepsilon \) is the specific penetration tolerance; \(\delta_{r} = r - \delta_0\). In the iterative process, the contact stiffness of the contact unit is expanded through \(\lambda_{r}^{(i + 1)}\) until the penetration is less than the initial preset value.

The friction force of the rotor system can be expressed by

$$ \left\{ {\begin{array}{*{20}c} {F_{rubx} } \\ {F_{ruby} } \\ \end{array} } \right\} = - {\text{H}}\left( {\delta_{r} } \right)\delta_{r} \frac{{\delta_{r} k_{c} + \lambda_{r}^{{\left( {i + 1} \right)}} }}{r}\left( {\begin{array}{*{20}c} 1 & { - f_{c} } \\ {f_{c} } & 1 \\ \end{array} } \right)\left\{ {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right\} $$
(22)

where the Heaviside function is defined as

$$ H\left( {\delta_{r} } \right) = \left\{ {\begin{array}{*{20}c} {1, \delta_{r} \ge 0} \\ {0, \delta_{r} < 0} \\ \end{array} } \right. $$
(23)

Therefore, the control equation of the rubbing rotor system is described as

$$ \left\{ {\begin{array}{*{20}c} {M\mathop x\limits + \left( {C + G} \right)\dot{x} + Kx = F_{ex} + F_{rubx} } \\ {M\mathop y\limits + \left( {C + G} \right)\dot{y} + Ky = F_{ey} + F_{ruby} } \\ \end{array} } \right. $$
(24)

where M, C, G, and K represent mass matrix, damping matrix, gyro matrix, and stiffness matrix of the rotor system. And Fex and Fey are unbalanced force vectors of x and y directions.

The method of Newmark-\({\upbeta }\) is applied to solve Eq. (24). The simulation information is given in Table 2. When the rub-impact gap is 120 μm, 80 μm, 40 μm, respectively, that is, the rotor system is, respectively, in the condition of normal, slight, moderate, and severe rub-impact fault, the dynamic responses of the rotor system at node 8 along the y-direction at a rotational speed of 1500 rpm are as shown in Fig. 2. Figure 2a shows the time responses and Fig. 2b shows the frequency responses. The x-axis represents frequency and the y-axis represents the clearance of the rub-impact. And the z-axis represents the amplitude of the frequency response.

Table 2 Simulation parameters of the finite element model
Fig. 2
figure 2

Dynamic responses with three different rub-impact conditions at 1500 r/min (a-b)

It can be seen from Fig. 2a that when there is no rub-impact fault, that is, the rotor system is in normal working condition, the time domain waveform is a standard sine wave. As the rubbing clearance decreases from 120 μm to 40 μm, the standard sine wave increasingly deforms. The amplitude at the peak gradually increases and peak clipping at the trough is more sever. Figure 2b indicates that as the rub-impact clearance gradually decreases, the rub-impact fault renders super-harmonic components in the frequency domain response of the system. And the more serious the rub-impact fault degree is, the stronger nonlinearity of the system is so the larger the corresponding amplitude of the super-harmonic component is. Therefore, traditional linear methods are difficult to analyze characteristics of nonlinear systems.

3.2 NWKL for the detection of rub-impact

Based on the related theory of NWKL and the dynamic response analysis of the rub-impact rotor system, this section employs the method of NWKL to analyze the simulation model of the rub-impact rotor system by assuming that system’s nonlinearities can be fully represented when the maximum nonlinearity order N = 4. The working conditions of the simulation model are set to normal and the rub-impact fault with the rubbing clearance of 160 μm, 140 μm, 120 μm, 100 μm, 80 μm, 60 μm, 40 μm, and 20 μm, respectively. And vibration response signals at node 8 of the rotor system under different working conditions are obtained for subsequent analysis. The changing curves of the weighted contributed rate of the NOFRFs (Rn) with the adaptation factor \(\rho\) from -30 to 10 with a step of 1.0 under different rub-impact conditions are illustrated in Fig. 3.

Fig. 3
figure 3

Each order of Rn at different rub-impact conditions (a-d)

It is obvious that changing trends of curves Rn1-Rn4 are quite distinct. It can be seen from Fig. 3a that curve Rn1 monotonically increases in the range of [0,1]. And for a certain \(\rho\), the values of Rn1 under different conditions only slightly decrease as rub-impact faults are more sever. As shown in Fig. 3b, Rn2 increases from zero to the maximum value and then decreases to zero. And maximum values show an increasing trend along with the decrease of the rubbing clearance. However, the difference between the maximum value of Rn2 under the normal condition and the most sever rubbing fault is less than 0.05, which is a bit small. Figure 3c indicates that Rn3 follows the same changing trend with Rn2 but maximum values show a decreasing trend with the increase of the fault severity. It is obvious from Fig. 3d that curve Rn4 monotonically decreases in the range of [0,1]. And for a certain \(\rho\), the values of Rn4 only slightly increase with the severity of the rub-impact fault. But for the weakest rub-impact fault condition where the rub-impact gap is 160um, its value of Rn4 is less than that of the normal rotor system for a certain \(\rho\), which is undesirable. In reference [25], the maximum value of Rn2 is defined as a new index Rm for the detection of rub-impact. On this basis, the authors conducted the following in-depth research.

The concept of KL divergence was introduced to the NOFRFs weighted contribution rate. After obtaining values of Rn1-Rn4, the corresponding values of each order of the improved NOFRFs weighted contributed rate based on KL divergence (respectively, denoted by KLRn1, KLRn2, KLRn3 and KLRn4) were further calculated through Eq. (19). And the changing curves of KLRn1-KLRn4 with the adaptation factor \(\rho\) from -30 to 10 with a step of 1.0 under different rub-impact conditions are illustrated as Fig. 4.

Fig. 4
figure 4

Each order of KLRn at different rub-impact conditions (a-d)

In Fig. 4, the changing trend of KLRn1, KLRn2, KLRn3, and KLRn4 increases from zero to the maximum value and then decreases to zero. And maximum values of each order of KLRn show an increasing trend as the severity of the rub-impact fault increases. Under the same condition, the values of KLRn1, KLRn2, KLRn3 and KLRn4 are different because each order KLRn has different sensitivity to nonlinearities for a nonlinear system.

In order to comprehensively consider the influence of the system’s nonlinearity on KLRn1, KLRn2, KLRn3, and KLRn4, the value of KLRn is obtained by summing KLRn1-KLRn4. Figure 5 shows the changing curves of KLRn with the adaptation factor \(\rho\) under different rub-impact conditions.

Fig. 5
figure 5

Changing curve of KLRn under different rub-impact conditions

In Fig. 5, the changing trend of KLRn increases from zero to the maximum value and then decreases to zero with the adaptation factor \(\rho\) changing from −30 to 10. And maximum values of KLRn under different conditions show an increasing trend starting from zero as the severity of the rub-impact fault increases.

3.3 The index of KLRm

Based on the analysis in Sect. 3.2, for the curve of KLRn(ρ) changing with ρ, the abscissa of the curve at the peak is named as the optimal fitness factor ρmax. The corresponding maximum value of the curve at ρ = ρmax is defined as a new indicator named improved NOFRFs optimal weighted contribution rate based on KL divergence (KLRm), the equation of which is as

$$ KLRm = \max KLRn(\rho ) $$
(25)

For showing the advantages of the proposed index, values of KLRm under different rub-impact conditions and their related optimal adaptation factor (ρm2) are compared with the values of Rm and their related optimal adaptation factor (ρm1), as given in Table 3. And comparison charts of KLRm with Rm at different rubbing gaps and their increments (△KLRm and △Rm) are shown as Fig. 6.

Table 3 Comparison of KLRm and Rm
Fig. 6
figure 6

Comparison charts of KLRm with Rm (ab)

It is evident from Table 3 and Fig. 6 that values of Rm and KLRm both show an increasing trend as the severity of rubbing increases. And indexes Rm and KLRm can distinguish clearances of rub-impact so they can be employed to detect the rub-impact fault. However, values of KLRm start from zero and are larger than that of Rm under fault working conditions. Also, △KLRm is more obvious than △Rm under the same fault condition. Comparing optimal adaptation factors ρm1 and ρm2, it is seen that the change of ρm2 is smaller for different rubbing gaps. Therefore, the indicator KLRm is more sensitive and stable to the rub-impact fault of rotor systems.

3.4 Feature dynamic stability analysis

When the rub-impact occurs, friction conditions and rub-impact stiffness will significantly affect the dynamic responses of the rotor system [26]. Therefore, studying the influence of the friction coefficients and rub-impact stiffness on the proposed detection method is necessary. So authors proposed a new evaluation method named the feature dynamic stability analysis of the rubbing rotor system. This section will explore the influence of the change in the friction coefficient and rub-impact stiffness between the rotor and the stator on the values of KLRm.

In one case, the rub-impact stiffness is set to k = 1 × 104 N/m and the friction coefficients are c1 = 0.1, c2 = 0.2 and c3 = 0.3, respectively. The variation curves of KLRn and Rn with ρ under different conditions are shown in Figs.7, 8, 9. And the values of KLRm and Rm under different friction coefficients are shown in Fig. 10.

Fig. 7
figure 7

Comparison of KLRn and Rn curves when friction coefficient is 0.1 (a-b)

Fig. 8
figure 8

Comparison of KLRn and Rn curves when friction coefficient is 0.2 (a-b)

Fig. 9
figure 9

Comparison of KLRn and Rn curves when friction coefficient is 0.3 (a-b)

Fig. 10
figure 10

Comparison of KLRm and Rm under different friction coefficients (a-b)

By comprehensive comparison of Figs.7, 8, 9, it can be seen that when the rub-impact stiffness is constant and the friction coefficient changes, the trend of the KLRn and Rn curves is almost unchanged but the maximum of the curve changes slightly. As shown in Fig. 10, in contrast, for the same type of rub-impact clearance, the change of the index KLRm is significantly smaller than that of Rm when the friction coefficient is different.

In another case, the friction coefficient is set to c = 0.1 and the rub-impact stiffness is k1 = 1 × 104 N/m, k2 = 2 × 104 N/m and k3 = 3 × 104 N/m, respectively. The values of KLRm and Rm under different rub-impact stiffness are shown in Fig. 11.

Fig. 11
figure 11

Comparison of KLRm and Rm under different rub-impact stiffness (a-b)

Similarly, in Fig. 11, when the friction coefficient is constant and the rub-impact stiffness changes, the change of the index KLRm is also significantly smaller than that of Rm for the same type of rub-impact clearance.

Therefore, it is concluded that, compared to the method of NOFRFs weighted contribution rate, the improved NOFRFs weighted contribution rate method based on KL divergence has more robust feature dynamic stability.

3.5 Fault detection procedure

Based on the previous introduction, the method of improved NOFRFs weighted contribution rate based on the KL divergence for the rotor system’s rub-impact fault detection procedure is as follows.

  • Step 1: Excite a healthy or a faulty rotor system twice using the signals with different amplitudes but the same frequency and collect the input and output signals.

  • Step 2: Calculate the first four orders of NOFRFs, \(G_{n} (jw),n = 1, \cdots ,4\), through Eq. (9)—Eq. (12) from the data obtained in Step 1.

  • Step 3: Calculate the distribution of the NOFRFs weighted contribution rate (Rn) with the change of \(\rho\) through Eq. (16).

  • Step 4: Calculate the distribution of the improved NOFRFs weighted contribution rate based on KL divergence (KLRn) with the change of \(\rho\) through Eq. (19).

  • Step 5: Calculate the value of the proposed index (KLRm) through Eq. (25) and then determine the condition of the system qualitatively and quantitatively.

4 Experimental verification

The experimental test bench is illustrated in Fig. 12. The eccentric quality of the disk can be changed by arranging different numbers of bolts in even bolt holes. And the rotor system is excited twice with the same frequency but different intensities, namely the same speed but two different unbalances. Different working conditions (normal, weak rub-impact, moderate rub-impact, serious rub-impact, and very serious rub-impact) are implemented by changing the tightening amount of the rub screw. The vibration signals of the rotor system at 1600 rpm under healthy and four different rubbing fault conditions are measured by the eddy current sensor.

Fig. 12
figure 12

Experimental test bench of rub-impact rotor system

Then, each order of the improved NOFRFs weighted contributed rate (KLRn1, KLRn2, KLRn3, KLRn4) with the adaptation factor \(\rho \) from -40 to 10 with a step of 1.0 under the healthy and the fault conditions are, respectively, calculated. The results are illustrated in Fig. 13.

Fig. 13
figure 13

Each order of KLRn at different rub-impact conditions (a-d)

It can be found from Fig. 13 that the experimental results are similar to the simulation results in Fig. 4. Each order of improved weighted contribution rate, KLRn1-KLRn4, generally increases first and then decreases. And the peaks of the curves are proportional to the severity of the rubbing fault.

As shown in Fig. 14, it is a comparison of the improved weighted contribution rate, KLRn, and the second-order weighted contribution rate, Rn, with the change of ρ. The curves of KLRn and Rn also have a similar trend with that of KLRn1-KLRn4.

Fig. 14
figure 14

Comparison of KLRn and Rn at different rub-impact conditions (a-b)

Figure 15 shows a comparison chart of the maximum value of the indexes, KLRm and Rm. The values of Rm increase from 0.16145 to 0.52287 and the values of KLRm increase from 0 to 1.13811. Obviously, as the degree of rubbing increases, the change of KLRm is significantly greater than that of Rm, indicating that the sensitivity of the index KLRm to rotor rubbing faults is greater than Rm, which is consistent with the simulation results. Therefore, the improved NOFRFs optimal weighted contribution rate based on KL divergence (KLRm) is superior to the second-order optimal weighted contribution rate (Rm) in diagnosing the rubbing fault.

Fig. 15
figure 15

Comparison of KLRm and Rm at different rub-impact conditions (a-b)

5 Conclusion

A method for detecting the rubbing fault of a rotor system was presented in this paper. Based on the method of NOFRFs weighted contribution rate, the KL divergence was introduced and a novel method named improved NOFRFs weighted contribution rate based on KL divergence (NWKL) has been proposed. A new index was also proposed to diagnose the rub-impact fault severity of the rotor system, which is named the improved NOFRFs optimal weighted contribution rate based on KL divergence (KLRm). Furthermore, an evaluation method named feature dynamic stability analysis was proposed to explore the effects of the friction coefficients and rub-impact stiffness on associated indexes. It provides a new research idea for the rubbing fault of the rotor system.

The simulation and experimental results have demonstrated the effectiveness and better superiority of the improved method. Compared with the second-order optimal weighted contribution rate (Rm), the newly proposed index (KLRm) has higher sensitivity and more robust feature dynamic stability to rub-impact faults of the rotor system. The proposed new approach can comprehensively represent the nonlinear characteristics of a nonlinear system by fusing four orders of NOFRFs weighted contribution rate. It shows a great application prospect in structural monitoring and fault diagnosis.

Future works will be concerned with the influence of the system nonlinearities on each order of improved NOFRFs weighted contribution rate based on KL divergence. The effectiveness of the proposed index in detecting other fault types of the rotor system needs to be further researched, as well as faults of other nonlinear engineering structures.