1 Introduction

Due to the design errors, material defects, process defects, machining errors, improper installation and debugging, improper operation, etc. [1], CNC machine tools fail frequently within half a year of use [2, 3], which greatly reduces their reliability. At the same time, the initial stage of product use is a critical time for users to judge the quality of the product psychologically. In this stage, the frequent failure will seriously damage the image of the enterprise [4, 5]. Therefore, it is of great significance to study the failures in this period and try to eliminate them in the product manufacturing enterprises to improve the reliability and market competitiveness of products [6].

The definition of CNC machine tool early failures in the current research is not accurate enough, and the characteristics of different types of failures are not considered when establishing the failure model. The accurate establishment of the early failure model of CNC machine tools is of great significance for the subsequent early failure analysis and elimination measures, so it is necessary to select the appropriate method to analyze the failure data of CNC machine tools. The current failure modeling methods of machine tools are mostly based on the failure time intervals. They assume that the failure data of products are independent identical distribution and consider that the repair of the products is a process of “repair as new” and the repaired products can reach the level of just leaving the factory, such as literature [7, 8]. However, the occurrence of CNC machine tool failure is a dynamic process. It is obviously inappropriate to use the time interval between failures as the analysis data. In addition, the theory of “repair as new” is generally for non-repairable products, while machine tools are repairable systems [9, 10]. Moreover, the theory also ignores the degradation of non-faulty parts of machine tools [11], and the quality of repaired products cannot be restored to its state of delivery. In fact, when CNC machine tools fail, it is generally only necessary to repair or replace the failed parts [12], which can make the reliability of the repaired products basically restore to the level before it was repaired, so it is more reasonable to regard the maintenance process as “repair as old” [13, 14]. In conclusion, it is more accurate to use the time of failure as the analysis data and use the random point process to describe the characteristics of CNC machine tool failure. The occurrence of machine tool failure is a random process. At present, there are three main methods currently used for random point process modeling, namely HPP (homogeneous Poisson process), NHPP, and RP (renewal process). HPP assumes that the failure intensity of the product is constant [15, 16], and it is proven that the failure intensity of machine tools is not constant [3], so the modeling method is not suitable for machine tools. RP is mainly used to describe the situation that product repair is “repair as new” [17, 18], and it is not suitable for failure modeling of machine tools. NHPP meets the minimum maintenance assumption of “repair as old” [10, 19], and it is considered that the repaired product can reach the level before its failure. NHPP can reflect the randomness and dynamics of machine tool failure, which is widely applied, as shown in literature [20, 21]. So, it is reasonable to select this method to establish the failure model of CNC machine tools. In addition, the former failure of products also has a certain impact on the latter failure, and this relationship has an important impact on the failure modeling of machine tools, which should also be valued and analyzed [22].

The rationality and integrity of failure data collection are the basis for the accurate establishment of the failure model and early failure cause tracing. The failure data should mainly come from the experimental data of manufacturing enterprises, the maintenance data of after-sales departments, the failure data of user self-repair, and the failure data of similar products. The collected failure data should include product model, production date, failure time, failure location, failure mode, failure cause, failure handling, and maintenance start and end time, etc. After the early failure model of CNC machine tools is established by the collected failure data and the selected method, the fitting effect of the established model to the failure data should be verified to determine the accuracy of the selected model. Then, the early failure analysis and prediction can be carried out according to the established model.

In this paper, the early failure of CNC machine tools is clearly defined, and the classification and collection methods of early failure are also introduced. The respective analysis models are established for different types of early failure, and the model is solved and the goodness of fit test is carried out. The appropriate indicators are selected to evaluate the reliability of CNC machine tools. The correlation between the former early failure and the latter early failure of CNC machine tools is also analyzed. The accuracy and rationality of the proposed method are verified by an example.

2 Early failure definition and classification

2.1 Early failure definition

A large number of researches and practices have shown that under the specified operating environment, operation, and maintenance, the relationship between the failure characteristic indexes of machine tools and time in its life cycle is generally shown as the shape of “bathtub”, which is commonly called as the bathtub curve, as shown in Fig. 1 [23, 24].

Fig. 1
figure 1

Failure rate bath curve

In Fig. 1, t0 is the moment when machine tools are installed and debugged; t1 is the transition point between early failure period and accidental failure period of machine tools, which is called the early failure time inflection point; t2 is the transition point between accidental failure period and depletion failure period of machine tools, which is called the accidental failure time inflection point; t3 is the moment when the machine tools are scrapped. The early failure of CNC machine tools is defined as follows:

CNC machine tool early failures (CMTEF) refer to the failures that occur during the period from the completion of machine tool installation and commissioning to the rapid decline of failure rate in the early stage of use, specifically the failures occurring during the period t0~t1 in Fig. 1.

In early failure period, the failure rate of machine tools will decrease rapidly with time. In accidental failure period, the failure rate of machine tools will basically stabilize. When products are used for a certain period of time, the failure rate will increase rapidly with time until they are scrapped. The failures in early failure period of CNC machine tools often have the following characteristics: failure occurrence rate is high but its decline speed is fast, large proportion of early failures in the total number of failures in the whole life cycle of the products, failure mode and failure causes are concentrated, etc. [25].

2.2 Early failure classification

In this paper, the early failures of CNC machine tools are divided into two types: sudden early failures and progressive early failures. The relevant definitions and analysis are as follows:

Sudden early failures (SEF) refer to those failures caused by “accidental causes” in the early use stage of machine tools. These failures occur irregularly, and they cannot be established with probability models and calculated.

The “accidental causes” here refer to those factors that cause sudden failures of machine tools, such as foreign matter intrusion, processing fluid pollution, excessive temperature, excessive humidity, noise pollution, large vibration, unstable voltage, improper operation, interference of purchased parts, and self-made parts, etc. These factors will cause sudden failures of the parts, which can reduce the stability of the performance of CNC machine tools. In severe cases, the function of machine tools will be lost. The occurrence of sudden early failures is irregular, and it is impossible to use quantitative probability model to model them [26], which is the important reason to restrict the improvement of machine tool reliability.

Sudden early failures can be regarded as a random impulse failure function. Due to the early failure causes of this type, the operating state of machine tools will suddenly fluctuate greatly. When the fluctuation reaches or exceeds the acceptable critical value, failures will occur. Assuming that the normal operation state fluctuation function of machine tools is F(t), which will be impacted by pulse function δ(t) at tt0 and tt1, and the maximum fluctuation amplitude that machine tools can bear is A, then the schematic diagram of sudden early failures of machine tools is shown in Fig. 2, and the mathematical model is shown in formula (1).

Fig. 2
figure 2

Sudden early failure mathematical model

$$ {\displaystyle \begin{array}{l}F\left({t}_{\mathrm{t}0}\right)+\delta \left({t}_{\mathrm{t}0}\right)\ge A\\ {}\mathrm{or}\\ {}F\left({t}_{\mathrm{t}1}\right)+\delta \left({t}_{\mathrm{t}1}\right)\le -A\end{array}} $$
(1)

Progressive early failures (PEF) refer to the failures caused by “essential cause” in the early failure period of machine tools. This kind of failures is subject to a certain probability distribution, and it can be predicted by building a mathematical model.

The “essential causes” here refer to those failure factors that can be modeled or quantified. The changes of parts quality characteristics caused by them are traceable and can be quantified by a certain statistical model. Progressive early failures obey a probability distribution, which can be analyzed by building a probability statistical model [27].

The failure generation process of machine tools can generally be described by NHPP [28,29,30]. According to the failure model, the early failure period of machine tools can be calculated, and the early failures of products can be obtained, which lays a foundation for subsequent failure analysis.

3 Early failure modeling

3.1 Sudden early failure modeling

The occurrence of sudden early failures is closely related to the capability and quality of the operators, the maintenance status of equipment, the coordination status among the parts with different materials, the process flow, the operating environment, and the parameters detection, etc., which is the 5M1E.

Fishbone diagram, also known as Ishikawa diagram and cause effect diagram, is an effective method to explore the “essential causes” of problems, which is simple, intuitive, convenient, and quick. In this paper, Fishbone diagram and 5M1E are used to discuss the sudden early failures, and the failure model is shown in Fig. 3.

Fig. 3
figure 3

Sudden early failure model

Figure 3 is only a schematic diagram to illustrate how to establish the model from man, machine, material, method, measurement, and environment. In the specific case analysis, the required failure influencing factors can be added according to the process. When the collected failure data is less, the historical failure data of similar products can be used for analysis.

3.2 Progressive early failure modeling

The modeling process of progressive early failures of CNC machine tools in this paper is shown in Fig. 4.

Fig. 4
figure 4

Progressive early failure modeling process

The detailed analysis of each modeling step in Fig. 4 is as follows.

  1. (1)

    Failure data collection

There are three main sources of failure data in this paper: the user’s self-maintenance record, the after-sales maintenance record of products manufacturer, and the on-site failure detection record. The specific failure data collection process is as follows:

  1. (a)

    For the failures that can be repaired by users themselves, the maintenance personnel shall promptly repair them in time after the failures occur and record the product maintenance information;

  2. (b)

    For the failures that users cannot repair by themselves, they should contact with the after-sales department of the product manufacturer in time. After receiving the users’ maintenance requirements, the after-sales department should promptly repair the products and record the maintenance process;

  3. (c)

    Regularly track the investigated products, so as to know the products status and deal with the problems encountered in time.

  1. (2)

    Failure data preprocessing and trend test

    1. (a)

      Failure data preprocessing

Generally, the failure data collected from the products use site cannot be used directly and needs to be transformed in advance. TTT (total test time) method is the most commonly used data transformation method [31]. This paper selects this method to preprocess the collected data, and the transformation process is as follows.

According to the principle of timing censoring, the failure data of m machine tools are statistically analyzed. Assuming that the timing censoring time of the i-th (0 < i < m) product is Ti, ni failure data are collected in the time of (0, Ti), and the moment when the j-th failure of i-th product occurs is tij. The failure occurrence times of the m machine tools are arranged in order of small to large, then 0 < S1 ≤ S2 ≤ … ≤ SK ≤ … ≤ SN, where SK represents the moment when the K-th failure occurs in the time series, and N = n1 + n2 + … + nm is the total number of m product failures in the statistical time.

TTT method is used to process the collected failure data, and the transformed time series can be obtained as formula (2):

$$ T\left({S}_{\mathrm{K}}\right)={\int}_0^{S_{\mathrm{K}}}p(u){\mathrm{d}}_{\mathrm{u}} $$
(2)

where u represents the time of failure occurrence; p(u) is the number of machine tools observed at time u, and when all failure processes are observed, p(u) = m [30].

  1. (b)

    Failure trend test

In order to improve the accuracy of reliability model selection, it is necessary to judge the trend of product failure data in advance. TTT graph method is particularly suitable for the trend judgment of failure data from multiple products [32]. The values of abscissa and ordinate of TTT graph can be calculated from formula (3):

$$ \left\{\begin{array}{l}{y}_{\mathrm{K}}=T\left({S}_{\mathrm{K}}\right)/T(S)={\int}_0^{S_{\mathrm{K}}}p(u){\mathrm{d}}_{\mathrm{u}}/{\int}_0^Tp(u){\mathrm{d}}_{\mathrm{u}}\\ {}{x}_{\mathrm{K}}=K/N\end{array}\right. $$
(3)

where T = max(T1, T2,…, Tm), K is the number of failures observed at u = SK.

Substituting the processed failure data into formula (3), a TTT graph of the product failures can be obtained, and then, the trend of product failure intensity can be judged. In general, there are four representations of TTT graphs, as shown in Fig. 5.

Fig. 5
figure 5

Typical shape of TTT graphs. a System with stable failure rate. b System with reduced failure rate. c System with increased failure rate. d System with reduced first and increased then failure rate

In Fig. 5, the scatter points in figure (a) are approximately a straight line, indicating that there is no tendency for product failures, and the product failure intensity is almost unchanged, which means that the system is relatively stable. The scattered points in figure (b) are concave, indicating that the product failure intensity is getting smaller and smaller. As time goes by, the number of product failures is less and less, and the product performance tends to be stable. Figure (c) has the opposite meaning to figure (b). The scatter points in figure (d) are S-shaped and show “concave before convex”, which indicates that the product failure intensity first decreases with time and then increases with time, that is, the number of product failures decreases first and then increases with time, which is a typical “bathtub curve” trend.

  1. (3)

    Model selection

    1. (a)

      Modeling method primary selection

Traditional reliability modeling methods believe that the reliability model established must be the same as long as the product failure time intervals are the same [33]. However, in engineering practice, for different CNC machine tools of the same type with the same failure time intervals but different failure sequence, the reliability at the same time is not the same, as shown in Fig. 6.

Fig. 6
figure 6

CNC machine tool failure sequence

In Fig. 6, product 1-1, product 1-2, and product 1-3, respectively, represent different CNC machine tools of the same model; 1, 2, 3, and 4, respectively, represent the serial number of failure; and △t, 2△t, and 3△t, respectively, represent the failure interval time. It can be seen from Fig. 6 that products 1-1, 1-2, and 1-3 have the same failure time interval, then the reliability models established by traditional methods are the same, and the reliability of the three products at the same time is also the same. However, the failure rules of these three products are obviously different, and their reliability level at the same time is not the same. It can be seen that the reliability model based on the failure time interval is not consistent with the engineering practice. Many studies have proved that the maintenance of machine tools belong to the “minimum maintenance” of repairable products, and it can be considered that the reliability of repaired products can reach the level before failure, that is, the maintenance of CNC machine tools is a “repair as old” process [34].

Compared with the operation time of machine tools, the failure maintenance time can generally be ignored, so that the failure occurrence process of machine tools can be described by random point process. NHPP is the most commonly used method in random point process, and it is also the most suitable method for reliability modeling of CNC machine tools [35]. Therefore, this paper uses NHPP method to model the reliability of CNC machine tools.

  1. (b)

    NHPP process basic principles

If a counting process {N(t), t ⩾ 0} satisfies the following conditions, it is called an NHPP process with failure intensity function of λ(t):

  • N(0) = 0;

  • ② {N(t), t ⩾ 0} has independent increments;

  • P{N(t + Δt) − N(t) = 1} = λ(t)Δt + ot);

  • P{N(t + Δt) − N(t) ≥ 2} = ot).

where λ(t)Δt represents the failure probability of product within time Δt; cumulative failure intensity function ω(t) is the average failure number E[N(t)] in time (0,t], which is shown in formula (4):

$$ E\left[N(t)\right]=\omega (t)={\int}_0^t\uplambda (t) dt $$
(4)

Then

$$ {\displaystyle \begin{array}{l}P\left\{N\left(t+\Delta t\right)-N(t)=k\right\}\\ {}=\frac{{\left[\omega \left(t+\Delta t\right)-\omega (t)\right]}^{\mathrm{k}}}{k!}{e}^{-\left[\omega \left(t+\Delta t\right)-\omega (t)\right]}\end{array}} $$
(5)

or

$$ P\left\{N(t)=k\right\}=\frac{{\left[\omega (t)\right]}^{\mathrm{k}}}{k!}{e}^{-\omega (t)} $$
(6)

It can be seen from formulas (5) and (6) that N(t + Δt) − N(t) obeys the Poisson distribution with a mean of ω(t + Δt) − ω(t), or N(t) obeys the Poisson distribution with a mean of ω(t).

  1. (c)

    Model comparison and selection

Figure 1 is obtained from the failure analysis of electronic products. Generally speaking, the failure intensity of machine tools does not strictly follow the curve trend shown in Fig. 1. Many test data show that the failure of CNC machine tools generally follows the law shown in Fig. 7 [3].

Fig. 7
figure 7

CNC machine tool failure intensity curve

It can be seen from Fig. 7 that the failure intensity curve of CNC machine tools can be roughly divided into two parts: early failure period and depletion failure period. The failure process with the shape shown in Fig. 7 can be described by superposition of two independent NHPPs [35], and the failure intensity function is shown in formula (7).

$$ \lambda (t)={\lambda}_1(t)+{\lambda}_2(t) $$
(7)

where λ1(t) and λ2(t), respectively, represent the early failure process and the depletion failure process of products. According to the properties of NHPP, the superposed function λ (t) is still a NHPP.

For products whose failure intensity curve is “bathtub” shape, the commonly used NHPP modeling methods are mainly as follows: S-PLP (superposed power law process) method proposed by Pulcini [36], S-LLP (superposed log-linear process) method proposed by Byeong [37], four-parameter method proposed by Ren (referred to as four-parameter method) [29], five-parameter method proposed by Wang (referred to as five-parameter method) [38], and the BBIP method proposed by Pulcini, which are, respectively, introduced as follows.

  • ① S-PLP method model

S-PLP model is superimposed by two PLP models, and its failure intensity function is shown in formula (8):

$$ {\displaystyle \begin{array}{l}\lambda (t)={\lambda}_1{\beta}_1{t}^{\beta_1-1}+{\lambda}_2{\beta}_2{t}^{\beta_2-1}\kern1.1em \\ {}{\lambda}_1,{\lambda}_2>0,0<{\beta}_1<1,{\beta}_1>1,t\ge 0\end{array}} $$
(8)

where λ1, β1, λ2, and β2 are the model parameters of S-PLP model, respectively.

From formula (8), we can see that the first half of S-PLP model is a monotonically decreasing function, and the second half is a monotonically increasing function. However, the failure intensity function will have a large mutation when t → 0+, which is quite different from the actual [39].

  • ② S-LLP method model

S-LLP model is superimposed by two LLP models, and its failure intensity function is shown in formula (9):

$$ {\displaystyle \begin{array}{l}\uplambda (t)={\alpha}_1{e}^{-{\gamma}_1t}+{\alpha}_2{e}^{\gamma_2t}\\ {}\kern0.1em {\alpha}_1,{\alpha}_2,{\gamma}_1,{\gamma}_2>0,t\ge 0\end{array}} $$
(9)

where α1, γ1, α2, and γ2 are the model parameters of S-LLP model, respectively.

It can be seen from formula (9) that S-LLP model is also composed of a monotone decreasing function and a monotone increasing function. λ(t) = α1 + α2 at t → 0+, and λ(t)|t →  + ∞ =  + ∞ at t →  + ∞, which indicate that failure intensity of products will increase infinitely with time, which is not in line with the actual use process of the product. This phenomenon is not consistent with the actual.

  • ③ Four-parameter method model

The four-parameter method is superimposed by two independent NHPP processes, and its failure intensity function is shown in formula (10):

$$ {\displaystyle \begin{array}{l}\uplambda (t)=\alpha \beta /\left(t+\alpha \right)+\eta \left[1-\theta /\left(t+\theta \right)\right]\kern0.2em \\ {}\alpha, \beta, \eta, \theta >0,t\ge 0\end{array}} $$
(10)

where α, β, η, and θ are the model parameters of four-parameter method model, respectively.

In this modeling method, λ(t) = β at t → 0+, and λ(t) = η at t →  + ∞, which can better simulate the occurrence process of product failures.

  • ④ Five-parameter method model

Five-parameter modeling method is a combination of bounded Burr XII failure intensity function and bounded intensity process. It is mainly used to describe the failure intensity change of the minimum maintenance system with the shape of “bathtub”, and its failure intensity function is shown in formula (11):

$$ {\displaystyle \begin{array}{l}\uplambda (t)={a}_1+{kb}_1{t}^{b_1-1}/\left(1+{t}^{c_1}\right)+{c}_1\left[1-\exp \left(-t/{d}_1\right)\right]\kern0.3em \\ {}{a}_1,k,{c}_1,{d}_1>0,{b}_1>1,t\ge 0\end{array}} $$
(11)

where a1, b1, c1, d1, and k are the model parameters of five-parameter method model, respectively.

Although formula (11) can better simulate the process of product failures, the increase of model unknown parameters will inevitably lead to the increase in computational difficulty compared with the four-parameter modeling method. Generally speaking, on the premise that solution accuracy can be achieved, the fewer the parameters, the better.

  • ⑤ BBIP method model

BBIP model is based on the Drenick theory, which consists of an LLP model and a bounded intensity process model, and its failure intensity function can be expressed as formula (12):

$$ {\displaystyle \begin{array}{l}\uplambda (t)=a\times \exp \left(-t/b\right)+c\left[1-\exp \left(-t/d\right)\right]\kern0.1em \\ {}a,b,c,d>0,t\ge 0\end{array}} $$
(12)

where a, b, c, and d are the model parameters of BBIP model, respectively.

When t = 0, λ(t)|t = 0 = a, and a represents the initial value of the failure intensity function λ(t). When t →  + ∞, λ(t)|t → ∞ = c, and c represents the progressive value of failure intensity function λ(t). This model not only has the form of “bathtub” in function shape, but also has the boundedness of the boundary intensity function, and its form is simple and easy to calculate.

When the characteristics of product failure rate and the properties of analyzed data are not known, the best and the most suitable model can be obtained directly by hypothesis testing. Instead, we can first compare the properties of the alternative models with the characteristics of product failure, so as to eliminate some inappropriate alternative models. If there are still some alternative models suitable for the analyzed data, the most suitable model can be selected from the remaining alternative models by hypothesis testing.

In summary, four-parameter method and BBIP method are the most suitable NHPP methods for failure intensity modeling. Among these two methods, BBIP is more advantageous in terms of model structure, calculation process, and applicability [40]. Therefore, the BBIP method is used to model and analyze the failures of CNC machine tools.

  1. (d)

    Model property analysis

Formula (12) calculates the first partial derivative of time t, and formula (13) can be obtained as follows:

$$ {\lambda}^{\prime }(t)=-\frac{a}{b}{\mathrm{e}}^{\left(-\frac{t}{b}\right)}+\frac{c}{d}{\mathrm{e}}^{\left(-\frac{t}{d}\right)} $$
(13)

When t = 0, λ(t)|t = 0 =  − a/b + c/d. Because λ(t) is a function with the shape of “bathtub”, so λ(t)|t = 0 < 0, that is, there must be formula (14) as follows:

$$ bc< ad $$
(14)

It can be seen from the value of λ(t) at t = 0 that the smaller b is, the faster λ(t) decreases, so b represents the decline rate of failure intensity. The larger d is, the slower λ(t) approaches c.

Let λ(t) = 0, the only standing point τ0 of λ(t) can be obtained as shown in formula (15):

$$ {\tau}_0=\frac{bd}{d-b}\ln \left(\frac{ad}{bc}\right)\kern1.5em b\ne d $$
(15)

It is known from the meaning of τ0 that it is a positive number, and b < d should be satisfied.

Let formula (13) calculate the partial derivative of time t, and formula (16) can be obtained as follows:

$$ {\lambda}^{{\prime\prime} }(t)=\frac{a}{b^2}{\mathrm{e}}^{\left(-\frac{t}{b}\right)}-\frac{c}{d^2}{\mathrm{e}}^{\left(-\frac{t}{d}\right)} $$
(16)

Substituting the value of τ0 into formula (16), formula (17) can be obtained as follows:

$$ {\left.{\lambda}^{{\prime\prime} }(t)\right|}_{t={\tau}_0}={\left(\frac{ad}{bc}\right)}^{\frac{d}{b-d}}\times \frac{a\left(d-b\right)}{b^2d} $$
(17)

As can be seen from formula (17), \( {\left.{\lambda}^{{\prime\prime} }(t)\right|}_{t={\tau}_0}>0 \). Since there are \( {\left.{\lambda}^{\prime }(t)\right|}_{t={\tau}_0}=0 \) and \( {\left.{\lambda}^{{\prime\prime} }(t)\right|}_{t={\tau}_0}\ne 0 \) when t = τ0, τ0 must be the only extreme value point of λ(t), and τ0 is the minimum value of λ(t) when bc < ad and b < d are satisfied. The minimum value of λ(t) can be obtained by substituting τ0 into formula (12) as shown in formula (18):

$$ {\left.\lambda (t)\right|}_{t={\tau}_0}=c+\frac{a\left(b-d\right)}{b}\times {\left(\frac{ad}{bc}\right)}^{\frac{d}{b-d}} $$
(18)

To ensure that λ(t) is a single valley function with the “bathtub” shape, formula (19) must be satisfied.

$$ \left\{\begin{array}{l}a,b,c,d>0\\ {} bc< ad\\ {}b<d\end{array}\right. $$
(19)

Let λ(t) = 0, the only standing point of λ(t) can be obtained as shown in formula (20):

$$ {\tau}_1=\frac{bd}{d-b}\ln \left(\frac{ad^2}{b^2c}\right) $$
(20)

Comparing formulas (15) and (20), since b < d, there must be τ1 > τ0 > 0.

Let formula (16) calculate the partial derivative of time t, and formula (21) can be obtained as follows:

$$ {\lambda}^{{\prime\prime\prime} }(t)=-\frac{a}{b^3}{\mathrm{e}}^{\left(-\frac{t}{b}\right)}-\frac{c}{d^3}{\mathrm{e}}^{\left(-\frac{t}{d}\right)} $$
(21)

Substituting the value of τ1 into formula (21), and formula (22) can be obtained as follows:

$$ {\left.{\lambda}^{{\prime\prime\prime} }(t)\right|}_{t={\tau}_1}={\left(\frac{ad^2}{b^2c}\right)}^{\frac{d}{b-d}}\times \frac{a\left(b-d\right)}{b^3d}<0 $$
(22)

It can be seen from \( {\left.{\lambda}^{{\prime\prime} }(t)\right|}_{t={\tau}_1}=0 \) and formula (22) that τ1 must be an inflection point of λ(t), and the slope of λ(t) is the largest at t = τ0. From the above analysis, we can also know that λ(t) decreases monotonically in the interval [0, τ0], increases monotonically in the interval [τ0, +∞], and τ0 is the early failure inflection point. λ(t) is a concave function in the interval [0, τ1], and a convex function in the interval [τ1, +∞].

To illustrate the influence of different parameters on λ(t), Fig. 8 is obtained by control variable method.

Fig. 8
figure 8

Influence of model parameters on failure intensity

In Fig. 8, comparing line ② and line ①, it can be seen that the value of a determines the starting point of λ(t), and the larger the value of a, the higher the initial failure rate of products. Comparing line ③ and ①, we can know that the value of b determines the decline speed of failure intensity in the early failure period of products, and the smaller the b value, the faster the λ(t) decreases, the shorter the early failure period is. As far as product manufacturing companies and users are concerned, the smaller the b value, the better. Comparing line ④ and line ①, the value of c determines the asymptotic value of failure intensity. It can be seen from the figure that the smaller the c value, the more gentle the λ(t) is, and the better the performance of products. Comparing line ⑤ and line ①, the larger the value of d is, the slower the failure intensity curve approaches to its gradual value, and the longer the products is in the accidental failure period. From the above analysis, we can see that for the BBIP model, the smaller the values of a, b, and c, and the larger the value of d, the better the reliability of products.

  1. (4)

    Model parameter estimation

Substituting formula (12) into formula (4), the average failure number of CNC machine tools in (0, t] time can be obtained as shown in formula (23):

$$ {\displaystyle \begin{array}{c}E\left[N(t)\right]=\omega (t)={\int}_0^t\lambda (t) dt\\ {}= ab- cd- ab\times \exp \left(-t/b\right)\\ {}+c\left[t+d\times \exp \left(-t/d\right)\right]\end{array}} $$
(23)

It can be seen from formula (23) that E[N(t)]|t = 0 = 0 at t = 0, which is consistent with the engineering practice.

The joint probability density likelihood function of m CNC machine tools’ failures can be obtained by formula (12) as shown in formula (24):

$$ {\displaystyle \begin{array}{c}L=\prod \limits_{i=1}^m\Big\{\prod \limits_{j=1}^{n_{\mathrm{i}}}\left\{a\times \exp \left(-{t}_{\mathrm{i}\mathrm{j}}/b\right)+c\left[1-\exp \left(-{t}_{\mathrm{i}\mathrm{j}}/d\right)\right]\kern0.2em \right\}\\ {}\times \exp \Big\{- ab+ cd+ ab\times \exp \left(-{T}_{\mathrm{i}}/b\right)\\ {}-c\left[{T}_{\mathrm{i}}+d\times \exp \left(-{T}_{\mathrm{i}}/d\right)\right]\left\}\right\}\end{array}} $$
(24)

Formula (25) can be obtained by taking the logarithm of both sides of formula (24) as follows:

$$ {\displaystyle \begin{array}{c}l=\ln L=\sum \limits_{i=1}^m\Big\{\sum \limits_{j=1}^{n_i}\ln \left\{a\times \exp \left(-{t}_{ij}/b\right)+c\left[1-\exp \left(-{t}_{ij}/d\right)\right]\kern0.20em \right\}\\ {}-\left\{ ab- cd- ab\times \exp \left(-{T}_i/b\right)+c\left[{T}_i+d\times \exp \left(-{T}_i/d\right)\right]\right\}\Big\}\\ {}=\sum \limits_{i=1}^m\sum \limits_{j=1}^{n_i}\ln \left\{a\times \exp \left(-{t}_{ij}/b\right)+c\left[1-\exp \left(-{t}_{ij}/d\right)\right]\kern0.20em \right\}\\ {}-\sum \limits_{i=1}^m\left\{ ab- cd- ab\times \exp \left(-{T}_i/b\right)+c\left[{T}_i+d\times \exp \left(-{T}_i/d\right)\right]\right\}\end{array}} $$
(25)

It can be seen from formula (23) that the total failure number N of the m machine tools in (0, T] can be expressed as formula (26):

$$ {\displaystyle \begin{array}{l}N=\sum \limits_{i=1}^m{n}_{\mathrm{i}}\\ {}=\sum \limits_{i=1}^m\left\{ ab- cd- ab\times \exp \left(-{T}_{\mathrm{i}}/b\right)+c\left[{T}_{\mathrm{i}}+d\times \exp \left(-{T}_{\mathrm{i}}/d\right)\right]\right\}\end{array}} $$
(26)

Formula (27) can be obtained by transforming formula (26) as follows:

$$ a=\frac{\sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-{de}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]}{b\sum \limits_{i=1}^m\left(1-{e}^{-\frac{T_{\mathrm{i}}}{b}}\right)} $$
(27)

Substituting formulas (26) and (27) into formula (25), a function with three parameters can be obtained as shown in formula (28):

$$ {\displaystyle \begin{array}{l}l=\ln L\\ {}=\sum \limits_{i=1}^m\sum \limits_{j=1}^{n_{\mathrm{i}}}\kern2pt \ln \left\{\frac{\sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-{de}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]}{b\sum \limits_{i=1}^m\left(1-{e}^{-\frac{T_{\mathrm{i}}}{b}}\right)}\times \exp \left(-{t}_{\mathrm{i}\mathrm{j}}/b\right)+c\left[1-\exp \left(-{t}_{\mathrm{i}\mathrm{j}}/d\right)\right]\kern0.2em \right\}\\ {}-N\end{array}} $$
(28)

In order to ensure the meaning of formula (28), there must be formula (29).

$$ \frac{\sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-{de}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]}{b\sum \limits_{i=1}^m\left(1-{e}^{-\frac{T_{\mathrm{i}}}{b}}\right)}>0 $$
(29)

As can be seen from formula (12), b > 0. It is known from the meaning of Ti that Ti > 0. At this time, \( b\sum \limits_{i=1}^m\left(1-{e}^{-\frac{T_{\mathrm{i}}}{b}}\right) \), then formula (29) can be equivalent to formula (30).

$$ \sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-{de}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]>0 $$
(30)

The parameter estimation problem of λ(t) can be finally transformed into the minimization problem of formula (28) under the nonlinear constraint, and the solving model can be shown in formula (31).

$$ {\displaystyle \begin{array}{c}\min -\sum \limits_{i=1}^m\sum \limits_{j=1}^{n_{\mathrm{i}}}\kern2pt \ln \left\{\frac{\sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-d{e}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]}{b\sum \limits_{i=1}^m\left(1-{e}^{-\frac{T_{\mathrm{i}}}{b}}\right)}\times \exp \left(-{t}_{\mathrm{i}\mathrm{j}}/b\right)+c\left[1-\exp \left(-{t}_{\mathrm{i}\mathrm{j}}/d\right)\right]\kern0.20em \right\}+N\\ {}s.t.\kern1.8em \left\{\begin{array}{c}-\sum \limits_{i=1}^m\left[{n}_{\mathrm{i}}+c\left(d-{T}_{\mathrm{i}}-d{e}^{-\frac{T_{\mathrm{i}}}{d}}\right)\right]<0\\ {}-b<0\\ {}-c<0\\ {}-d<0\end{array}\right.\end{array}} $$
(31)

Solving formula (31), we can get the estimated values of b, c, and d as \( \hat{b} \), \( \hat{c} \), and \( \hat{d} \), respectively. Substituting them into formula (27), we can get the estimated value of a as \( \hat{a} \).

  1. (5)

    Goodness of fit test

The purpose of goodness of fit test is to find out the degree of conformity between the selected failure model and the failure data. According to the reference [41], the goodness of fit evaluation index P of the failure model of CNC machine tools is set as formula (32).

$$ P=1-\sqrt{\frac{\sum \limits_{S_{\mathrm{K}}=0}^T{\left({N}_{S_{\mathrm{K}}}-{\overline{N}}_{S_{\mathrm{K}}}\right)}^2}{\sum \limits_{S_{\mathrm{K}}=0}^T{N}_{S_{\mathrm{K}}}^2}} $$
(32)

where \( {N}_{S_{\mathrm{K}}} \) is the actual cumulative failure number of CNC machine tools observed at time SK; \( {\overline{N}}_{S_{\mathrm{K}}} \) is the expected failure number at time SK.

Generally speaking, when P > 0.9, it can be considered that the fitting degree between the assumed model and the failure data is acceptable [40], and the larger the value of P, the higher the fitting degree between the assumed model and the failure data, that is, the more appropriate the selected model is.

4 Product reliability evaluation and early failure correlation analysis

4.1 Reliability evaluation

Generally speaking, the commonly used reliability evaluation indexes can be divided into instantaneous reliability indexes and cumulative reliability indexes. Instantaneous reliability evaluation indexes mainly include instantaneous failure intensity function λ(t) and MTBFs (instantaneous mean time between failure). Cumulative reliability indexes mainly include average cumulative failure number ω(t), cumulative failure intensity function λc(t), MTBFc (cumulative mean time between failure), and reliability function R(t). The meanings and calculation formulas of λ(t) and ω(t) have been given above, and other reliability evaluation indexes are introduced as follows:

  1. (1)

    Instantaneous mean time between failure

MTBFs refers to the time taken for a failure of products, which can be obtained by formula (33).

$$ {MTBF}_{\mathrm{s}}=1/\lambda (t)=1/\left\{\hat{a}\times \exp \left(-t/\hat{b}\right)+\hat{c}\left[1-\exp \left(-t/\hat{d}\right)\right]\right\} $$
(33)
  1. (2)

    Cumulative mean time between failure

MTBFc represents the reliability level of CNC machine tools in time [0, t], which can be obtained by formula (34).

$$ {\displaystyle \begin{array}{c}{MTBF}_{\mathrm{c}}=\frac{t_2-{t}_1}{\int_{t_1}^{t_2}\lambda (t){d}_{\mathrm{t}}}=\frac{t_2-{t}_1}{{\left.\omega (t)\right|}_{t_1}^{t_2}}\\ {}=\frac{t_2-{t}_1}{{\left.\left\{\hat{a}\hat{b}-\hat{c}\hat{d}-\hat{a}\hat{b}\times \exp \left(-t/\hat{b}\right)+\hat{c}\left[t+\hat{d}\times \exp \left(-t/\hat{d}\right)\right]\right\}\right|}_{t_1}^{t_2}}\end{array}} $$
(34)
  1. (3)

    Cumulative failure intensity function

λc(t) can be directly given by formula (35) as follows [42]:

$$ {\displaystyle \begin{array}{c}{\lambda}_{\mathrm{c}}(t)=\frac{1}{MTBF_{\mathrm{c}}}\\ {}=\frac{{\left.\left\{\hat{a}\hat{b}-\hat{c}\hat{d}-\hat{a}\hat{b}\times \exp \left(-t/\hat{b}\right)+\hat{c}\left[t+\hat{d}\times \exp \left(-t/\hat{d}\right)\right]\right\}\right|}_{t_1}^{t_2}}{t_2-{t}_1}\end{array}} $$
(35)
  1. (4)

    Reliability function

R(t) is mainly used to reflect the probability that products can operate normally at time t, which can be obtained by formula (36).

$$ {\displaystyle \begin{array}{c}R(t)=\exp \left(-{\int}_0^t\lambda (t) dt\right)\\ {}=\exp \Big(-\hat{a}\hat{b}+\hat{c}\hat{d}+\hat{a}\hat{b}\times \exp \left(-t/\hat{b}\right)\\ {}-\hat{c}\left[t+\hat{d}\times \exp \left(-t/\hat{d}\right)\right]\Big)\end{array}} $$
(36)

4.2 Early failure correlation analysis

Generally speaking, there is a strong or weak connection between the former failure and the latter failure of products. Exploring this connection can lay a foundation for failure prediction and preventive maintenance. Considering that the functional relationship between the failures of products is weak, the correlation analysis method can be used to study the relationship. When the relationship between failure time data is obvious, the data can be analyzed by Pearson correlation coefficient method, Spearman’s rank correlation coefficient method and Kendall tau rank correlation coefficient method. The calculation methods are as follows.

Pearson correlation coefficient ρpxy can be obtained by formula (37).

$$ {\rho}_{\mathrm{pxy}}=\frac{\operatorname{cov}\left(x,y\right)}{\sigma_{\mathrm{x}}{\sigma}_{\mathrm{y}}}=\frac{E(xy)-E(x)E(y)}{\sqrt{E\left({x}^2\right)-{E}^2(x)}\sqrt{E\left({y}^2\right)-{E}^2(y)}} $$
(37)

where cov(x, y) represents the covariance between two sets of failure time data of x and y; σx and σy are the standard deviation of two sets of failure time data of x and y, respectively.

Spearman’s rank correlation coefficient ρsxy can be obtained by formula (38).

$$ {\rho}_{\mathrm{sxy}}=1-\frac{6\sum \limits_{K=1}^N{d}_{\mathrm{K}}^2}{N\left({N}^2-1\right)} $$
(38)

where dK represents the difference between the ranks of two sets of failure time data of x and y; N is the total failure number; K is the number of failures observed at u = SK.

Kendall tau rank correlation coefficient ρkxy can be obtained by formula (39).

$$ {\rho}_{\mathrm{kxy}}=\frac{2\left(C-D\right)}{N\left(N-1\right)} $$
(39)

where C represents the number of elements with consistency in x and y failure time data; D represents the number of elements with inconsistencies in x and y failure time data; N is the total failure number.

The correlation judgment criteria between the two sets of data represented by ρpxy, ρsxy, and ρkxy are shown in Table 1.

Table 1 Simple correlation judgment criteria

When the correlation of failure time data obtained from formulas (37) to (39) is moderate or weaker, the complex correlation of these data should be analyzed. At this time, we can use polynomial, one or multiple regression equations, information entropy and mutual information, Fourier series, and other methods to analyze and solve the data correlation.

5 Application

In this paper, a CNC machine tool made in China is taken as an example for early failure modeling and analysis. Based on the historical failure data of the products to be tested and similar products, the time threshold for failure data collection is set as 9500 h by the reference [39]. During this time, 138 failure data of 8 CNC machine tools were obtained from after-sales department, user maintenance department, and irregular investigation, and 106 failure data were left after eliminating the sudden early failures, as shown in Table 2.

Table 2 CNC machine tool failure time

5.1 Sudden early failure modeling

Among the failure data collected, there are 32 sudden failures, the failure frequency is shown in Table 3, and the failure proportion is shown in Fig. 9.

Table 3 Failure modes and frequencies of sudden early failure
Fig. 9
figure 9

Sudden early failure proportion

It can be seen from Table 3 and Fig. 9 that in the sudden early failures of the CNC machine tools, component function loss, reduced machining accuracy, parts damage and liquid, gas, oil leakage are the main failure modes, which account for 45.28% of the total failures.

In this paper, the failure number of component function loss is the most, so its sudden early failure model is built by the proposed method, as shown in Fig. 10.

Fig. 10
figure 10

Component function loss early failure model

5.2 Progressive early failure modeling

5.2.1 Failure trend test

Analyzing the failure data in Table 2, we can get the change trend of machine tool failure number and cumulative average failure number with time, as shown in Fig. 11.

Fig. 11
figure 11

CNC machine tool failure trend and cumulative failure change trend

As can be seen from Fig. 11, the cumulative average failure number of machine tools shows a trend of non-monotonic change, specifically, the failure rate first decreases and then increases, and the corresponding failure interval time increases first and then decreases. In the early failure period, the number of machine tool failures is obviously more, and the time between two adjacent failures is smaller, which indicates that the machine failures occur frequently during this period.

In order to further make a clear judgment on the change of machine tool failure with time, the failure data in Table 2 is preprocessed by formula (2), and a TTT diagram is obtained by formula (3) as shown in Fig. 12.

Fig. 12
figure 12

Progressive early failure TTT diagram

It can be seen from the comparison between Figs. 5 and 12 that the failure intensity of the machine tools conform to the “bathtub curve”, so BBIP model can be used to describe the failure occurrence process.

5.2.2 Parameter estimation and test

Assume that the failure process of these eight machine tools can be described by the BBIP model with the same parameters, and the maximum value of its log-likelihood function is l0. Under this assumption, the corresponding model parameters can be obtained by Table 2, formula (27) and formula (31) as shown in Table 4.

Table 4 Estimation of BBIP model with same parameters

In fact, the failure process of these eight machine tools may not be completely the same considering the different manufacturing process and using environment of machine tools, and the different technology and quality of operators.

Suppose that the eight machine tools have different failure model parameters, and the maximum value of their respective log-likelihood function is li (i = 1, 2, …, 8). The parameters of each machine tool can be obtained by Table 2, formula (27), and formula (31), as shown in Table 5.

Table 5 Estimation of BBIP model with different parameters

In order to verify which of the above two hypotheses is more suitable to describe the failure process of machine tools, this paper uses the method proposed in literature [43] to test the two hypotheses. The test process is as follows:

  • H0 (zero hypothesis): The BBIP model parameters of each machine are the same;

  • H1 (alternative hypothesis): The BBIP model parameters of each machine are different;

If H0 is correct, the likelihood ratio statistic should obey the chi-square distribution with 4 freedom degrees approximately. The likelihood ratio statistic can be obtained from formula (40) as follows:

$$ \varLambda =-2\left({l}_0-\sum \limits_{i=1}^8{l}_{\mathrm{i}}\right) $$
(40)

Substituting the data in Tables 4 and 5 into formula (40), Λ = 10.40. According to the quantile table of chi-square distribution, the critical value p = 9.4877 when the significant level is 0.05 and the freedom degree is 4. There is Λ > p, so H0 should be rejected, which means that the BBIP model parameters of different machine tools are different.

5.2.3 Goodness of fit test

Substituting the data in Tables 2 and 5 into formula (32), the evaluation values of the goodness of fit for each machine tool can be obtained as shown in Table 6.

Table 6 Machine tool goodness of fit evaluation index value

It can be seen from Table 6 that Pi > 0.9 (i = 1, 2, ..., 8), which indicates that the parameter models obtained are better for fitting the failure data. Therefore, the model can be used to describe and analyze the failure process of the machine tools.

5.2.4 Early failure period

It can be seen from the above analysis that the failure process of 8 machine tools cannot be described by the BBIP model with the same parameters. In order to reflect the error between the early failure period obtained by the same parameter model and the early failure period obtained by different parameter models, the data in Tables 4 and 5 are substituted into formula (15), then the early failure period of each machine tool can be obtained as shown in Table 7 and the visual representation of the error can be obtained shown in Fig. 13.

Table 7 Early failure period of machine tools
Fig. 13
figure 13

Progressive early failure TTT diagram

The “-” in the error column in Table 7 indicates that the early failure period of machine tool obtained by the same parameter model is smaller than the early failure period obtained by the different parameter model.

It can be seen from Table 7 and Fig. 13 that the early failure periods of different machine tools are not the same. If the same parameter model is used to calculate the early failure period of each machine tool, it will inevitably bring errors in the calculation results, which will adversely affect the subsequent early failure analysis. It can also be seen from Table 7 and Fig. 13 that the early failure period calculated by the same parameter model and different parameter models are quite different, which also proves that the H0 hypothesis should be rejected.

5.3 Reliability evaluation and correlation analysis

5.3.1 Reliability evaluation

The data in Table 5 is sequentially substituted into formula (12), formula (22), and formula (33) to formula (36), and the reliability index function of each machine tool can be obtained as shown in Figs. 14, 15, 16, 17, 18, 19, 20, and 21, respectively.

Fig. 14
figure 14

Reliability index function of machine tool 1. a Failure intensity function of machine tool 1. b Average cumulative failure number function of machine tool 1. c Instantaneous mean time between failure function of machine tool 1. d Cumulative mean time between failure function of machine tool 1. e Cumulative failure intensity function of machine tool 1. f Reliability function of machine tool 1

Fig. 15
figure 15

Reliability index function of machine tool 2. a Failure intensity function of machine tool 2. b Average cumulative failure number function of machine tool 2. c Instantaneous mean time between failure function of machine tool 2. d Cumulative mean time between failure function of machine tool 2. e Cumulative failure intensity function of machine tool 2. f Reliability function of machine tool 2

Fig. 16
figure 16

Reliability index function of machine tool 3. a Failure intensity function of machine tool 3. b Average cumulative failure number function of machine tool 3. c Instantaneous mean time between failure function of machine tool 3. d Cumulative mean time between failure function of machine tool 3. e Cumulative failure intensity function of machine tool 3. (f) Reliability function of machine tool 3

Fig. 17
figure 17

Reliability index function of machine tool 4. a Failure intensity function of machine tool 4. b Average cumulative failure number function of machine tool 4. c Instantaneous mean time between failure function of machine tool 4. d Cumulative mean time between failure function of machine tool 4. e Cumulative failure intensity function of machine tool 4. f Reliability function of machine tool 4

Fig. 18
figure 18

Reliability index function of machine tool 5. a Failure intensity function of machine tool 5. b Average cumulative failure number function of machine tool 5. c Instantaneous mean time between failure function of machine tool 5. d Cumulative mean time between failure function of machine tool 5. e Cumulative failure intensity function of machine tool 5. f Reliability function of machine tool 5

Fig. 19
figure 19

Reliability index function of machine tool 6. a Failure intensity function of machine tool 6. b Average cumulative failure number function of machine tool 6. c Instantaneous mean time between failure function of machine tool 6. d Cumulative mean time between failure function of machine tool 6. e Cumulative failure intensity function of machine tool 6. f Reliability function of machine tool 6

Fig. 20
figure 20

Reliability index function of machine tool 7. (a) Failure intensity function of machine tool 7. (b) Average cumulative failure number function of machine tool 7 c Instantaneous mean time between failure function of machine tool 7. d Cumulative mean time between failure function of machine tool 7. e Cumulative failure intensity function of machine tool 7. f Reliability function of machine tool 7

Fig. 21
figure 21

Reliability index function of machine tool 8. a Failure intensity function of machine tool 8. b Average cumulative failure number function of machine tool 8. c Instantaneous mean time between failure function of machine tool 8. d Cumulative mean time between failure function of machine tool 8. e Cumulative failure intensity function of machine tool 8. f Reliability function of machine tool 8

From Figs. 14, 15, 16, 17, 18, 19, 20, and 21, it can be seen that the reliability indexes of each machine tool change with time. We can know that in the early failure period, the overall trend of the failure intensity is decreased, and then increases with time from Figure (a), Figure (c), Figure (d), and Figure (e). It can also be seen from the comparison between Figure (c) and Figure (d) that the value of MTBFc is lower than the value of MTBFs at the same time due to the early failure.

5.3.2 Failure correlation analysis

For CNC machine tools, when a failure occurs, its reliability will inevitably change, and the performance of products cannot be “restored as new”, which will affect the occurrence of its latter failure.

In order to find the relationship between the former failure and the latter failure, the correlation between them should be analyzed. In this paper, only the first three failures of the tested machine tools are analyzed as an example. The data in Table 2 are substituted into formula (37) to formula (39) in turn, and then ρpxy, ρsxy, and ρkxy are obtained as shown in Table 8.

Table 8 Correlation coefficient between failures

In Table 8, ρ(i, j) represents the correlation coefficient between the i-th failure and the j-th failure of the machine tools.

It can be seen from Tables 1 and 8 that the relationship between the first failure and the second failure of the machine tools is moderate correlation, and the relationship between the second failure and the third failure is significant correlation, so it can be considered that the second failure of the machine tools has a significant impact on its third failure, while the relationship between the first failure and the second failure may be a more complex nonlinear relationship that needs to be further analyzed.

The relationship between the first failure and the second failure ρxy can be obtained as follows:

$$ {\rho}_{\mathrm{xy}}={a}_0+\sum \limits_{k=1}^2\left[{a}_{\mathrm{k}}\cos \left( k\omega t\right)+{b}_{\mathrm{k}}\sin \left( k\omega t\right)\right] $$
(41)

The coefficients in formula (41) and the fitting effect can be obtained by MATLAB, as shown in Table 9.

Table 9 Failure data fitting parameters and effect

It can be seen from Tables 1 and 9 that there is a strong correlation between the first failure and the second failure.

The relationship between the fitting function and the failure data can be obtained by formula (41) and Table 9 as shown in Fig. 22.

Fig. 22
figure 22

Relevance between the first failure and second failure of machine tools

It can be seen from Table 9 and Fig. 22 that there is a simple or complex correlation between the former failure and the latter failure of machine tools, and the occurrence of the former failure will have an important impact on the latter failure. When establishing a failure model for CNC machine tools, the correlation between failures should be taken into account so as not to affect the accuracy of the analysis results.

6 Conclusion

Early failure seriously restricts the improvement of the quality of CNC machine tools. The existing research on early failure of CNC machine tools lacks a clear definition of early failure and does not classify early failures according to their different characteristics and assumes that all early failures obey the unified distribution. At the same time, the characteristics of early failure of CNC machine tools are not considered when establishing the early failure model. When analyzing multiple CNC machine tools, it is assumed that they have the same model parameters. In addition, the relationship between the former failure and latter failure is not considered when establishing the early failure model of CNC machine tools. The above reasons make the final early failure model of CNC machine tools not appropriate and accurate.

To address those problems, the early failures of CNC machine tools are divided into sudden early failures and progressive early failures herein, and they are strictly defined. Considering the unpredictability of sudden early failures, the analysis model of this kind of early failure is established by 5M1E method.

According to the randomness of progressive early failures, the failure occurrence time rather than failure interval time is taken as the analysis data, and the mathematical model of progressive early failures is established by BBIP method.

Machine tools made in China are analyzed as an example, and the conclusion that different product failures of the same model could not be analyzed by the same parameter model is obtained. The reliability of the product is also evaluated by its instantaneous reliability indexes and cumulative reliability indexes. The relationship between the former failure and the latter failure of CNC machine tools is discussed by simple correlation and complex correlation. It is concluded that the occurrence of the former failure will have an important impact on the latter failure.

The research results lay a foundation for the accurate establishment of CNC machine tool early failure model. The accuracy of the early failure model is the premise of the early failure mechanism analysis and early failure elimination measures establishment.

Therefore, the method proposed in this paper not only enriches the existing early failure analysis methods, but also provides a basis for shortening the early failure period of CNC machine tools and improving their reliability.