1 Introduction

Over the past decade, robotic machining has gained increasingly attention because of its high flexibility and large working envelope [1, 2]. For instance, in the aviation manufacturing, industrial robots have shown great application prospects, which can efficiently accomplish drilling works of thousands of aircraft intersection holes [3]. However, the application of robotic machining is still very limited in actual production at present. Compared with the conventional machine tools, serial robot manipulators are more likely to chatter due to the lower stiffness [4, 5]. It is known that chatter brings various negative impacts on the machining process, including accelerating the tool wear, lowering the machining quality, and even affecting the safety of the robot. For these reasons, chatter has been a major obstacle to realize stable and efficient robotic drilling [6]. To avoid the detriments caused by the chatter, researches on the stability analysis, detection and suppression of such detrimental instability have become a heated topic in academia and industry [7,8,9,10,11,12,13,14,15].

So far, researchers have conducted in-depth researches on chatter suppression and proposed many effective suppression methods, including selection of chatter-free cutting parameters, variable spindle speed, passive chatter control, and active chatter control. Theoretically speaking, the occurrence of chatter can be prevented by adopting appropriate cutting parameters based on the stability lobe diagrams [15]. However, due to the heterogeneous stiffness of robot within its working envelope, it is difficult to guarantee the robotic drilling process without chatter by the selected fixed machining parameters [4]. Fortunately, there is a certain time interval from chatter onset to mature chatter, and accurate detection and timely suppression provide an alternative feasible approach to avoid the hazards of chatter. If chatter can be identified accurately at an early stage, there will be enough time to implement appropriate measures to suppress its further development. Hence, accurate and timely detection of chatter is of vital significance to avoid the adverse effects of chatter on the machining system.

In order to automatically detect chatter, various chatter-related signals can be acquired, including vibration signal [16,17,18,19,20], torque signal [21, 22], motor current signal [23,24,25], and sound signal [26,27,28]. In industrial applications, vibration signal stands out from these signals owing to its information usability, low cost, and easy access [29], which has been widely adopted for chatter detection. Chatter vibration is a typical kind of self-excited vibration between the cutter and the workpiece. Due to the regenerative effect, the occurrence of chatter will usually cause changes in energy distribution and frequency components. However, during the early stage of chatter development, the chatter features are extremely weak. Consequently, it is necessary to choose an appropriate signal processing tool to extract the required chatter features from the signal. In the last decades, time domain analysis and spectral analysis methods have been employed to detect chatter [30, 31]. In view of the nonlinear and nonstationary properties of chatter signal, time-frequency analysis (TFA) methods that preserve both time-domain and frequency-domain information have gradually become the prior choice approaches for chatter signal analysis. So far, short-time Fourier transform (STFT), wavelet transform (WT), wavelet packet transform (WPT), and empirical mode decomposition (EMD) have been used for chatter detection [32,33,34,35]. For instance, Tangjitsitcharoen et al. [34] adopted wavelet transform to process the cutting forces signals in ball end milling process for chatter detection. In [23], WPT was utilized to monitor the energy change of motor current signal to identify chatter in gear grinding process. Liu et al. [35] investigated a milling chatter identification method, in which EMD and wavelet packet decomposition (WPD) was combined to eliminate the influence of modal aliasing. Although these methods based on the conventional TFA methods achieved certain effects, they are without exception limited by the Heisenberg uncertainty principle, which stipulates that a signal cannot be localized with an arbitrary precision both in time and frequency.

To improve the performance of classical TFA methods, researchers have proposed some advanced methods in the past few years. The reassignment method (RM) is a kind of high-resolution TFA methods which aims to sharpen the time-frequency (TF) representation while keeping the temporal localization, but it is very limited in engineering applications because it cannot reconstruct the signal [36]. To overcome these drawbacks, Daubechies et al. [37] developed the synchrosqueezing transform (SST) which can achieve high-resolution TF representation and meanwhile retain the signal reconstruction ability. After that, many improved methods have been proposed, e.g., synchroextracting transform (SET) [38], demodulated SST (DSST) [39], and high-order SST [40]. These advanced TFA methods have been utilized for chatter detection and achieved better performance. For instance, Cao et al. [26] introduced a chatter detection approach in milling process based on SST of sound signals. Also, Tao et al. [41] employed the SET to analyze the vibration signal and realized the early chatter identification in robotic drilling process. However, the SST is very sensitive to signal noise and has poor noise robustness, while SET may suffer large reconstruction errors when processing strong nonstationary signals due to lack of perfect signal reconstruction. More recently, Yu et al. [42] proposed a novel TFA method termed multi-synchrosqueezing transform (MSST), which concentrated the energy of the TF result by iteratively employing multiple SST operations. Compared with the other methods mentioned above, MSST can obtain higher resolution TF representation when dealing with the strongly time-varying signals, and meanwhile retain excellent signal reconstruction ability. Therefore, MSST is an ideal signal processing tool for chatter detection.

During the robotic drilling process, the energy distribution and frequency components of vibration signal changes with the machining state. Under the stable machining state, rotation frequency and its harmonics are dominant. When the machining state transitions to chatter, new dominant frequency components will appear and the energy will be transferred to the chatter frequency gradually [41]. Hence, the change of energy distribution can be used to characterize chatter. To quantify the change of the system energy distribution, energy entropy is introduced as the chatter indicator. It is worth noting that chatter frequency is easy to be merged by rotation frequency and its harmonics in the early stages of chatter [43]. Therefore, it is desperately needed to eliminate the disturbance of rotational-related frequencies to the measured signal. Based on the analysis above, this paper develops a novel approach for chatter detection based on MSST and energy entropy, in which weak chatter features during early development stage can be keenly captured. The remainder of this paper is structured as follows. In the next section, the proposed chatter detection method is developed in detail. Section 3 presents the experimental verification for the proposed method, including the experimental setup, the experimental results, and the analysis. Finally, the conclusion is drawn in Sect. 4.

2 Multi-synchrosqueezing-based method

2.1 Matrix notch filter

It is well recognized that the spindle rotating frequency and its harmonics always exist throughout the machining process. In particular, during the early stage of chatter, because the forced vibration is dominant, the rotational-related frequencies still prevail while the chatter components are exceedingly weak. If the weak chatter frequencies are submerged by the rotational-related frequencies, the chatter indicator extracted from the vibration signal may have no distinct change, resulting in a time lag in the chatter detection. Consequently, the rotational-related components should be discarded to ensure that the chatter can be recognized as early as possible.

As an effective filtering tool, notch filter is capable of rejecting just one specific frequency from the signal. It is typically adopted to eliminate the sinusoidal or narrowband interference while keeping the broadband signal unchanged. Moreover, considering that the data used for analysis in the chatter detection belongs to the short data records, the selected method should have good filtering performance for short data. However, when processing short data records, the conventional IIR notch filters show unsatisfactory performance due to the occurrence of the transient state while the FIR notch filters have higher orders under the same requirement [44]. To achieve better transient performance when filtering short data records, the key to designing the notch filter is obtaining the transfer function with frequency response as close as possible to the ideal notch filter [45]. The specific design algorithm for the optimal matrix notch filter is described below.

Define \( {\mathbf{H}}_{\omega_s} \) as a matrix notch filter with the notch frequency ωs. Let x = [x(0), x(1),…, x(N-1)]T be input signal to the matrix notch filter whose output is y = [y(0), y(1),…, y(N-1)]T. Then, the filtering operation can be expressed as \( \mathbf{y}={\mathbf{H}}_{\omega_s}\mathbf{x} \). Define a vector of the complex exponentials ν(ω) as ν(ω) = [1, e, ej2ω,…, ej(N-1)ω]T , and the output of the ideal matrix notch filter should satisfy

$$ {\mathbf{H}}_{\omega_s}\boldsymbol{v}\left(\omega \right)=\left\{\begin{array}{cc}0& \omega ={\omega}_s\\ {}\boldsymbol{v}\left(\omega \right)& \omega \ne {\omega}_s\end{array}\right. $$
(1)

The passband of the notch filter is P = [0, ωs-ε]∪[ωs+ε, π], where ε is a small positive number. By inserting K uniformly spaced points in the passband P, one can obtain

$$ {\omega}_i=\frac{\pi -2\varepsilon }{K},i=1,2,\dots, K $$
(2)

where ωi are uniform dense grid frequency points in the passband P.

Substitute ωi and ωs in the vector ν(ω) and then split ν(ω) into the real part νR(ω) and the imaginary part νI(ω). Define two matrices as follows

$$ \mathrm{V}=\left[{\boldsymbol{\upnu}}_R,{\boldsymbol{\upnu}}_I\right] $$
(3)
$$ \mathrm{W}=\left[{\boldsymbol{\upnu}}_R\left({\omega}_S\right),{\boldsymbol{\upnu}}_I\left({\omega}_S\right)\right] $$
(4)

where VR = [νR(ω1), νR(ω2),…, νR(ωK)] and VI = [νI(ω1), νI(ω2),…, νI(ωK)].

The ideal notch filter \( {\mathbf{H}}_{\omega_s} \) should meet the conditions \( {\mathbf{H}}_{\omega_s}\mathbf{V}=\mathbf{V} \) and \( {\mathbf{H}}_{\omega_s}\mathbf{W}=0 \). Let hl and νl denote the lth column vectors of matrices \( {\left({\mathbf{H}}_{\omega_s}\right)}^T \)and VT, respectively. According to the least squares method, the hl is determined by minimizing the cost function

$$ J={\left\Vert {\mathrm{V}}^T{\mathrm{h}}_l-{\mathrm{h}}_l\right\Vert}^2 $$
(5)

whose constraint is Whl = 0. This optimization problem can be solved by Lagrange multiplier method, and its closed form solution is

$$ {\mathbf{h}}_l={\mathbf{Q}}^{-1}{\mathbf{V}\boldsymbol{\upnu}}_l-{\mathbf{Q}}^{-1}{\mathbf{W}}^T{\left({\mathbf{W}\mathbf{Q}}^{-1}{\mathbf{W}}^T\right)}^{-1}{\mathbf{W}\mathbf{Q}}^{-1}{\boldsymbol{\upnu}}_l $$
(6)

where Q = (VVT)-1. Compose all column vectors hl column by column and the optimal matrix notch filter can be finally written as

$$ {\mathbf{H}}_{\omega_s}=\mathbf{I}-\mathbf{W}{\left({\mathbf{W}}^T\mathbf{QW}\right)}^{-1}{\mathbf{W}}^T{\mathbf{Q}}^T $$
(7)

2.2 Multi-synchrosqueezing transform

The essence of MSST is to iteratively execute multiple SST operations. For clear description of the MSST principle, SST is reviewed briefly at first. SST is a post-processing procedure on the conventional TFA methods and it is introduced based on an STFT framework in this section. The STFT of the signal x(t) is represented as

$$ {F}_x\left(\tau, \omega \right)={\int}_{-\infty}^{\infty }x(t)g\left(t-\tau \right){e}^{- j\omega \left(t-\tau \right)} dt $$
(8)

where g(t) is the compactly supported window function. The regular definition of STFT differs from Eq. (8) by a phase shift factor ejωt. Define x(t) as a mono-component signal, i.e., x(t)=A(t)e(t). According to Parseval’s theorem, the STFT of x(t) is given by

$$ {F}_x\left(\tau, \omega \right)=A(t)\cdot \hat{g}\left(\omega -\phi \hbox{'}\left(\tau \right)\right)\cdot {e}^{j\phi \left(\tau \right)} $$
(9)

where \( \hat{g}\left(\omega -\phi \hbox{'}\left(\tau \right)\right) \) denotes the Fourier transform of the window function. If Fx(τ, ω) ≠ 0, the instantaneous frequency estimate \( \overset{\sim }{\omega}\left(\tau, \omega \right) \) for the STFT result can be approximated by

$$ \tilde{\omega}\left(\tau, \omega \right)=-j\frac{\partial_t{F}_x\left(\tau, \omega \right)}{F_x\left(\tau, \omega \right)} $$
(10)

where ∂tFx (τ, ω) means the partial derivative of the variable t, namely, ∂tFx (τ, ω) = ∂Fx (τ, ω)/∂t.

The SST employs a frequency reallocation operator to gather all FT coefficients with the same instantaneous frequency, which is expressed as

$$ TF\left(\tau, \psi \right)={\int}_{-\infty}^{+\infty }{F}_x\left(\tau, \omega \right)\cdot \delta \left(\psi -\tilde{\omega}\left(\tau, \omega \right)\right) d\omega $$
(11)

where δ denotes the Dirac function.

The basic idea of MSST is to repeatedly apply another SST operation to the already acquired SST result to obtain a much sharper TF result than previous result. Through multiple iterations of SST, the energy of the TF representation can be concentrated in a step-wise manner. Assuming that the iteration times is N (N ≥ 2), MSST can be formulated as

$$ {\displaystyle \begin{array}{c}{TF}^{\left[2\right]}\left(\tau, \psi \right)={\int}_{-\infty}^{+\infty }{TF}^{\left[1\right]}\left(\tau, \omega \right)\cdot \delta \left(\psi -\tilde{\omega}\left(\tau, \omega \right)\right) d\omega \\ {}{TF}^{\left[3\right]}\left(\tau, \psi \right)={\int}_{-\infty}^{+\infty }{TF}^{\left[2\right]}\left(\tau, \omega \right)\cdot \delta \left(\psi -\tilde{\omega}\left(\tau, \omega \right)\right) d\omega \\ {}\vdots \\ {}{TF}^{\left[N\right]}\left(\tau, \psi \right)={\int}_{-\infty}^{+\infty }{TF}^{\left[N-1\right]}\left(\tau, \omega \right)\cdot \delta \left(\psi -\tilde{\omega}\left(\tau, \omega \right)\right) d\omega \end{array}} $$
(12)

where TF(τ, ψ) in Eq. (11) is denoted by TF [1](τ, ω). Combining Eq. (11) and Eq. (12), the formula of the MSST can be rewritten as

$$ {TF}^{\left[N\right]}\left(\tau, \psi \right)={\int}_{-\infty}^{+\infty }{F}_x\left(\tau, \omega \right)\cdot \delta \left(\psi -{\tilde{\omega}}^{\left[N\right]}\left(\tau, \omega \right)\right) d\omega $$
(13)

where \( {\tilde{\boldsymbol{\omega}}}^{\left[\boldsymbol{N}\right]}\left(\tau, \omega \right) \) denotes the instantaneous frequency estimate of MSST. It can be obtained by a recursive algorithm, namely

(14)

in which \( \overset{\sim }{\boldsymbol{\omega}}\left(\tau, \omega \right) \) can be calculated by Eq. (10).

MSST allows for the perfect signal reconstruction. Let ψk and ψk+1 be the lower and upper limits of the kth frequency band, and then the subsignal corresponding to the frequency band can be recovered as follows

$$ {x}_k(t)=\frac{1}{2\pi g(0)}{\int}_{\psi_k}^{\psi_{k+1}}{TF}^{\left[N\right]}\left(t,\psi \right) d\psi $$
(15)

2.3 The proposed detection algorithm

Chatter is a kind of self-excited vibration phenomenon generated by the robot when it absorbs the energy from periodic cutting forces during the drilling process. At the beginning of the chatter, chatter components are not sufficiently obvious and have the smaller amplitude than the rotational-related frequency. Consequently, in order to identify the chatter effectively, it is necessary to obtain the high-resolution TF representation and seek a sensitive indicator that can accurately reflect the energy distribution change of the measured signal. Meanwhile, the rotational-related components need to be removed to prevent them from submerging the chatter frequency before the chatter characteristic extraction.

On the basis of discussion above, a novel chatter detection approach based on MSST and energy entropy is proposed in this paper. The flowchart of the proposed scheme is illustrated in Fig. 1. As the basic work, the accelerometer is first installed to measure the vibration signals during the robotic drilling process. Based on the measured vibration signals, the chatter identification algorithm is developed and it is implemented in four steps. Firstly, the rotational-related frequencies are filtered out from the measured vibration signal by using the designed optimal matrix notch filters. Secondly, the MSST is utilized to address the filtered signal and acquire the concentrated TF representation. Then, the entire TF representation is divided into several equally spaced frequency bands and the reconstruction operation is performed to generate the subsignal in each frequency band. At last, the energy entropy of the whole signal is calculated in time for chatter detection.

Fig. 1
figure 1

The flowchart of the proposed algorithm for chatter detection

Specifically, in the first step of the proposed algorithm, matrix notch filters are designed to remove the rotation frequency and its harmonics. Suppose ωs1, ωs2,…, ωsk represent spindle rotating frequency and its harmonics, and \( {\mathbf{H}}_{\omega_{s1}},{\mathbf{H}}_{\omega_{s2}},\dots, {\mathbf{H}}_{\omega_{sk}} \) are the corresponding optimal matrix notch filters. Given the measured vibration signal si = [si(1), si(2),…, si(L)]T, the filtered output signal so = [so(1), so(2),…, so(L)]T can be obtained as

$$ {\mathbf{s}}_o={\mathbf{H}\mathbf{s}}_i=\left({\mathbf{H}}_{\omega_{sk}}\cdots {\mathbf{H}}_{\omega_{s2}}\cdot {\mathbf{H}}_{\omega_{s1}}\right){\mathbf{s}}_i $$
(16)

where \( {\mathbf{H}}_{\omega_{s1}},{\mathbf{H}}_{\omega_{s2}},\dots, {\mathbf{H}}_{\omega_{sk}} \) can be calculated by Eq. (7). To obtain the energy-concentrated TF results, the filtered signal so is analyzed by the MSST, which can reconstruct the signal perfectly. Considering that so is discrete, the discrete STFT is presented as

$$ {F}_x\left[d,m\right]=\sum \limits_{l=1}^L{\mathbf{s}}_o\left[l\right]g\left[l-d\right]{e}^{-j\frac{2\pi }{L}m\left[l-d\right]} $$
(17)

According to Eq. (10), the discrete form of instantaneous frequency estimate \( \hat{\boldsymbol{\omega}}\left[d,m\right] \) can be deduced as

$$ \hat{\omega}\left[d,m\right]\left\{\begin{array}{cc}\mathrm{Ro}\left[\operatorname{Re}\left[\frac{1}{2\pi j}\ln \left[\frac{F_{x\left[d+1,m\right]}}{F_{x\left[d,m\right]}}\right]\right]\right]&, {F}_x\left[d,m\right]\ne 0\\ {}0&, {F}_x\left[d,m\right]=0\end{array}\right. $$
(18)

where Re[] denotes taking real part. And Ro[] represents the round operation, which rounds the variable to the nearest integer. For instance, Ro[3.1] = 3 and Ro[3.8] = 4.

Then, according to Eq. (13), the TF distribution of so can be written as

$$ {TF}^{\left[N\right]}\left[d,\eta \right]=\sum \limits_{m=1}^L{F}_x\left[d,m\right]\delta \left[\eta -{\hat{\omega}}^{\left[N\right]}\left[d,m\right]\right] $$
(19)

where \( {\hat{\boldsymbol{\omega}}}^{\left[N\right]}\left[d,m\right] \) can be obtained by combining Eq. (14) and Eq. (18).

In theory, the chatter can be identified by observing whether there are significant chatter frequencies in the obtained high-resolution TF representation. However, this method cannot achieve the automatic chatter detection and it is difficult to identify the chatter at the early stage. Thus, the energy entropy is chosen as the chatter indicator to depict the energy distribution changes of the vibration signal. To calculate the energy entropy of the signal, it is necessary to divide the whole signal s(t) into a finite number of frequency bands and recover the subsignals corresponding to each frequency band. Let Δη be the bandwidth of each frequency band. According to the reconstruction formula Eq. (15), the corresponding subsignal can be expressed as

$$ {S}_k\left[d\right]\approx \frac{1}{2\pi g(0)}\sum \limits_{\eta \in \left[{\eta}_k,{\eta}_k+\varDelta \eta \right]}{TF}^{\left[N\right]}\left[d,\eta \right] $$
(20)

For each reconstructed subsignal, their energy is calculated as

$$ {E}_k=\sum \limits_{j=1}{s}_k{\left[{d}_j\right]}^2,\kern0.5em k=1,2,\cdots, n $$
(21)

Finally, the energy entropy MSSTE of the whole signal can be acquired as

$$ \mathrm{MSSTE}=\sum \limits_{k=1}^n{E}_k\ln \left({E}_k\right) $$
(22)

The energy entropy MSSTE is an indicator with good performance, which is quite sensitive to the chatter. With the occurrence of chatter, the energy of the system is transferred to the frequency band where chatter frequency is located, and the energy entropy MSSTE will dramatically increase. Accordingly, during the robotic drilling process, timely chatter detection can be perfectly performed by calculating the energy entropy of the vibration signal and comparing it with the chatter threshold in real time. If the energy entropy exceeds the threshold, it is indicated that chatter is detected and appropriate vibration suppression measures should be taken. If not, the robotic drilling process is stable. It can be seen that the chatter threshold plays a critical role in the proposed algorithm. In consideration of the versatility and effectiveness, the chatter threshold should be determined on the basis of numerous robotic drilling experiments.

3 Verification and comparisons

3.1 Experimental setup

In order to verify the actual effect of the proposed algorithm, a series of drilling experiments with different cutting conditions were carried out on a robot drilling machining experimental platform, as shown in Fig. 2. The platform consists mainly of a KUKA industrial robot, a drilling end actuator, the workpiece and the corresponding fixture, sensors and a data acquisition system. The drilling end actuator is mounted on the end of the robot through flanges, which is composed of a spindle module, a feed module, a visual measurement module, a normal detection module, and a pressure foot. The spindle module and the feed module are used to realize the drilling operation, and the visual measurement module provides the precise positioning of the reference hole. The strict verticality of the drilling hole is ensured through the normal detection module, by which the normal line of the workpiece can be measured accurately. To improve the stiffness of the system, a relatively large pressing force is applied to the workpiece by the pressure foot during the robotic drilling process.

Fig. 2
figure 2

The experimental platform for robotic drilling machining

To measure the vibration signals of the spindle of the robotic drilling process, we selected the PCB 356A24 accelerometer and attached it to the nonrotating part of the spindle closest to the tool with adhesive, as shown in Fig. 3. It is a triaxial, ceramic shear accelerometer and has a voltage output. The frequency range of PCB 356A24 is from 0.5 to 12,000 Hz, and the sensitivity of the X-, Y-, and Z-axes is 1.043 mv/m/s2, 1.035 mv/m/s2, and 1.064 mv/m/s2, respectively. Since the vibration signal of the X-axis is more sensitive to chatter than that of the Y-axis and the Z-axis (parallel to the feed direction), it is used for chatter detection. Taking the high-frequency properties of chatter components into account, the data acquisition system (the Crystal Instruments CoCo-80) collected acceleration signals at a sampling frequency of 10,240 Hz. Two different uncoated hard alloy steel drilling-countersinking tools with different geometrical shapes were used. Specifically, the tool A has a diameter of 5 mm, 130° point angle, and 100° countersink angle, while the tool B is characterized by the 6 mm diameter, 120° point angle, and 100° countersink angle.

Fig. 3
figure 3

Illustration of accelerometer installation

3.2 Results and comparisons

Robotic drilling experiments under different cutting parameters were performed to validate the proposed method. Two kinds of aerospace aluminum alloy workpiece, i.e., AL7075 and AL606, were used in the experiments. To avoid accidental factors affecting the results, all the experiments with the same machining parameters were repeated three times. The drilling mode was dry drilling without lubricants and coolants. To better observe the surface quality of holes, the drilling holes were all blind holes. Notice that whether chatter occurs during robotic drilling process is evaluated by combining the surface quality of the drilling holes, the amplitude, and the spectrum of the acceleration signal. The surface quality of chattered hole and normal hole is illustrated in Fig. 4. It is clearly seen that chatter left obvious vibration marks on the surface of the hole, while the normal hole has smooth surface.

Fig. 4
figure 4

The surface quality of chattered hole and normal hole. a Chatter occurs. b Without chatter

During the robotic drilling process, the proposed chatter detection algorithm is carried out every 25 ms. Time length of the sample signal for a calculation is 50 ms, which corresponds to 512 sample data at the sampling frequency of 10,240 Hz. The calculated energy entropy is compared with the threshold to determine whether chatter occurs. Once the energy entropy value exceeds the threshold, it means that chatter starts. The energy entropy at different time points constitutes the trend map of the energy entropy change. At the same time, to verify the effectiveness and superiority of the proposed method, comparisons with the synchroextracting-based method in [41] is conducted. It should be noted that all calculations were done with MATLAB and no library was involved.

The first group of robotic drilling experiments were conducted with tool A. The workpiece material is aluminum alloy AL7075. Figure 5 illustrates the experimental results at spindle speed 3000 rpm and feed rate 3.0 mm/s. The energy entropy change of vibration signal in the whole drilling process is shown in Fig. 5b. At the beginning, the energy entropy is relatively small and remains unchanged basically, which corresponds to the stable robotic drilling stage. It can also be demonstrated from Fig. 5a, where the amplitude of vibration is small during the corresponding period. After t = 0.198 s, the energy entropy begins to increase gradually and exceeds the preset threshold at the moment t1 = 0.470 s, which means that the chatter is detected by the proposed method. Then, the energy entropy continues to increase for a short time, but the growth rate slows down gradually. Eventually, the energy entropy stabilizes at a higher value. It can be found that the energy entropy increases dramatically and eventually far exceeds the initial value when the machining state transitions from stable to unstable. Therefore, the energy entropy can effectively characterize the machining state of the robotic drilling and is quite sensitive to the appearance of chatter. According to Fig. 5c, the relatively distinct chatter frequency emerges in the spectrogram of the MSST at about the moment t = 0.593 s. At the same time, the synchroextracting-based method detects chatter at the moment t2 = 0.554 s, which is 84 ms latter than the proposed method. As a consequence, the results demonstrate that the presented algorithm can effectively detect the chatter at an early stage during the robotic drilling process.

Fig. 5
figure 5

Chatter test with tool A at spindle speed 3000 rpm, feed rate 3.0 mm/s. Workpiece material AL7075. a The vibration signal. b The energy entropy. c Time-frequency spectrum of MSST

Figure 6 shows the chatter detection results for the vibration signal and the corresponding MSST spectrogram at spindle speed 4500 rpm and feed rate 3.2 mm/s. The workpiece material is also aluminum alloy AL7075. The trend map of the energy entropy change, e.g., Fig. 6b, illustrates that the energy entropy remains at a relatively low range and has a trend of slight decline before t = 0.108 s. As can be seen from Fig. 6a, the amplitude of the vibration is also small and relatively steady during the corresponding period, demonstrating that the robotic drilling process is stable at this moment. After t = 0.145 s, the energy entropy begins to rise and the rate of increase becomes faster. Then, the chatter is detected around t1 = 0.232 s by the proposed method and t2 = 0.243 s by the synchroextracting-based method. After a short period of rapid growth, the energy entropy begins to level off, indicating that the chatter has become mature. It verifies that the energy entropy can keenly depict the development of chatter. Meanwhile, according to Fig. 6c, it is at about the moment t = 0.256 s that the chatter frequency can be observed relatively clearly from the time-frequency spectrum of MSST. Consequently, the result verifies the effectiveness and superiority of the presented chatter detection method for robotic drilling operations.

Fig. 6
figure 6

Chatter test with tool A at spindle speed 4500 rpm, feed rate 3.2 mm/s. Workpiece material AL7075. a The vibration signal. b The energy entropy. c Time-frequency spectrum of MSST

Moreover, the effectiveness and universality of the proposed algorithm are further testified by more experiments with tool B and aluminum alloy 6061 under different cutting conditions. Figure 7 illustrates the chatter detection results with spindle speed 3000 rpm and feed rate 3.5 mm/s. As shown in Fig. 7b, the energy entropy stays at a low level and has a very slight fluctuation in the initial stage, and it increases rapidly after t = 0.125 s. It is at the moment t1 = 0.289 s that the chatter is identified by the proposed method. In contrast, the synchroextracting-based method detects chatter at the moment t2 = 0.367 s. It demonstrates that the proposed method achieves a better performance than the synchroextracting-based method. Subsequently, the energy entropy gradually grows to a higher stable value, which indicates that the chatter has been fully developed. According to Fig. 7a, the amplitude of the measured vibration signal has the similar change tendency, which validates that the energy entropy can fully reflect the development process of chatter. Figure 7 c presents the time-frequency spectrum of the vibration signal acquired by MSST, where the relatively obvious chatter frequency can be observed at approximately t = 0.384 s. Even though the occurrence of the chatter can be detected from the Fig. 7c, it is difficult to identify the chatter at the early stage. By comparison, the proposed method can detect the chatter earlier.

Fig. 7
figure 7

Chatter test with tool B at spindle speed 3000 rpm, feed rate 3.5 mm/s. Workpiece material AL6061. a The vibration signal. b The energy entropy. c Time-frequency spectrum of MSST

The chatter detection results with spindle speed 3000 rpm and feed rate 4.5 mm/s are presented in Fig. 8. The workpiece material is aluminum alloy AL6061 as well. During the initial stage, the energy entropy almost stays constant, as shown in Fig. 8b. However, a considerable increase occurs after t = 0.151 s and shortly after the energy entropy exceeds the threshold value. Specifically, the chatter is detected around t1 = 0.169 s by the proposed method and t2 = 0.178 s by the synchroextracting-based method. It validates that the proposed algorithm can detect chatter earlier than the synchroextracting-based method. After that, the energy entropy continues to rise and reaches a higher stable value. As shown in Fig. 8a, there is a same trend in the amplitude of the vibration signal, but it does not increase significantly until approximately t = 0.197 s. It is seen that the energy entropy is very sensitive to the chatter development. It can been seen from Fig. 8c that the fundamental chatter frequency was about 1673 Hz and its second and third harmonics (3347 Hz and 5030 Hz) also appeared when chatter occurred. Actually, other experiments also had similar results, as shown in Figs. 5c, 6c, and 7c. Figure 8 c suggests that the chatter frequency becomes relatively apparent at about t = 0.197 s in the time-frequency spectrum. In conclusion, the result shows that the proposed method has a good performance for chatter detection and is applicable for different drilling conditions.

Fig. 8
figure 8

Chatter test with tool B at spindle speed 3000 rpm, feed rate 4.5 mm/s. Workpiece material AL6061. a The vibration signal. b The energy entropy. c Time-frequency spectrum of MSST

4 Conclusion

In this study, we develop a chatter detection method combining MSST and energy entropy for the robotic drilling process, in which weak chatter features in the early stage can be captured accurately. The spindle vibration signals are measured for chatter detection by accelerometer. In view of the principle that the chatter should be identified as early as possible, optimal matrix notch filters with good computational efficiency are designed to eliminate spindle rotating frequency and its harmonics from the measured vibration signals so as to highlight the early weak chatter components. In addition, to ensure the accuracy of chatter recognition, the MSST is adopted to analyze the filtered signals and generate a high-resolution TF representation. Then, the high-resolution TF plane is divided equally into several frequency bands and the corresponding subsignal can be recovered accurately by utilizing perfect signal reconstruction ability of MSST. Since the development of chatter is always accompanied by the change of frequency and energy distribution in vibration signal, the energy entropy of the whole signal is calculated to indicate the chatter, which has remarkable sensitivity to chatter. Robotic drilling experiments and comparisons with the synchroextracting-based method have been conducted to verify the proposed algorithm, and the results demonstrate that the proposed method can detect the weak chatter at an early stage with high effectiveness and is suitable for different drilling conditions. In the future work, more efforts will be devoted to the construction of online chatter suppression system for robotic drilling, and the research of chatter suppression method based on sinusoidal spindle speed variation.