1 Introduction

Machinery health monitoring is gaining more importance in the industry because of the need to improve reliability and reduce the risk of production loss due to machinery breakdowns. The use of acoustic and vibration signals is common in the domain of condition monitoring of both structures and rotating machinery. By comparing the vibration or acoustic signals of a machine running in healthy or normal and faulty conditions, identification of faults like rotor rub, unbalance of mass, failure of gears, misalignment of the shaft, and bearing faults is possible. These signal data can also be used to identify the incipient failures of the machinery components, using the online monitoring system, decreasing the prospect of catastrophic failure and downtime. For the detection of machinery-bearing faults, a variety of modern signal processing approaches are available. These approaches are utilized to examine bearing defect signal data utilizing time domain [1,2,3], frequency domain [4,5,6,7], and a combination of the two time–frequency-domain methods [8]. These approaches each have their own set of benefits and drawbacks. The presence of noise can distort fault data recorded in the form of vibration or acoustic signals for bearing problems. To detect early onset failures, a suitable signal processing approach is required [9, 10]. Tang et al. [11] explained the spatial localization of acoustic technique to identify the defect localization on rolling element bearing stationary outer race. Maczak and Jasinski [12] investigated local faults in gears using a model-based approach. Inturi et al. [13] described the experimental analysis of the three-stage gearbox under constant and fluctuating operating conditions with bearing faults using wavelet transform to predict the different statistical parameters. Further, based on the fault severity the identified fault is classified using a support vector machine. In continuation to this, the support vector machine is used to extract the features from a rotor vibration signal and acoustic emission to classify the misalignment presented in the paper [14]. The advantage of this method was the direct use of the time signal processing algorithms for the implementation of the diagnostic systems working online simply and faster.

Vibration-based analysis has been increasingly popular for diagnosing bearing faults in recent years. The rotating machine's characteristics are reflected in the vibration data, which is based on the premise that all elements of the system influence the vibration signal. As a result, analyzing vibration signal data is critical for ensuring that the equipment runs without failure. Unfortunately, these methods are not able to provide inherent data related to non-stationary signals. These methods give only a partial performance for machinery diagnostics. To overcome this problem, a signal processing technique called Wavelet analysis also called the time–frequency method has been introduced. The wavelet transform method is a variable window method, which uses a time-domain signal interval to find out the respective frequency components of the same signal. When compared to a raw vibration signal, it has been proven in recent years that the statistical measure of kurtosis detects the fault at an early stage. A value of kurtosis > 3 indicates the presence of defects [15]. In the identification and estimation of faults, the wavelet transform technique [16, 17] has been widely employed. The non-stationary signal is analyzed using the wavelet technique. The fixed window size in wavelet analysis is a drawback. A high-frequency component may be detected effectively in advance (with less relative error) than a low-frequency component by modifying the window size, and a low-frequency component can be identified well in comparison to a high-frequency component by adjusting the window size. In many cases, the wavelet method is used in combination with additional features, such as Gaussian/ exponential enveloped functions and support vector regressions [18, 19]. Tabrizi et al. [20] identified the small defects in the roller bearings by using the wavelet packet decomposition (WPD) and empirical mode decomposition (EMD) methods. De-noising techniques [21] are used to enrich the fault detection process. The wavelet packet transform method (WPT) is a simplification of the wavelet transform method and has been used in many signal processing applications such as de-noising and compression [22,23,24]. Spectral Kurtosis analysis and combined ensemble empirical mode decomposition (EEMD) is used for fault identification of rolling bearing [25]. The method of empirical mode decomposition (EMD) along with Hilbert–Huang transform (HHT) is used to obtain the features data of bearing defect signal, which can understand the condition monitoring of fault signal [26]. Li et al. [27] demonstrated twelve sensitive features of bearing fault signals for early fault identification. Both empirical mode decomposition and wavelet packet analysis are together used to get the features and are then given as inputs for Radial Basis Function Networks (RBFNs) to categorize different faults [28]. In the work, three different types of neural network models were used for fault analysis of bearing elements. The effectiveness of fault detection in the bearings using the deep neural network (DNN) technique and convolution neural network over ANN was studied [29,30,31] successfully and demonstrated the intelligent fault detection method of rotating machinery with huge test data. This study also highlighted the advantages of using deep neural networks over shallow networks. The same method was validated using data captured from the bearing and gearbox. Mutra et al. [32, 33] used a neural network-based surrogate model to explain the parametric identification of a high-speed rotor system supported on oil/oil-free bearings. An impulse is created whenever a rolling element passes through a local fault in a bearing. Because the impulse period is so short in comparison to the time between pulses, the induced energy will be spread out across a large frequency range. As a result, the impact pressures will excite numerous resonances of the bearing and the entire structure, and the FFT technique will be unable to determine the problem frequencies. Envelope detection is a technique for converting a bipolar input signal to a unipolar signal using a smoothing circuit for FFT analysis of system frequencies. [34]. Zhou et al. [35] demonstrated the identification of the fault characteristics of the motor base screw loosening through online vibration by using the similarity measurement theory. The corresponding amplitude and frequency characteristics of the curve were obtained from the FFT technique. Cai et al. [36] explained a new method for fault diagnosis for rolling bearings by combining the instantaneous spectrum estimation with fractional Fourier instantaneous spectrum (FRFT). Here the maximum kurtosis coefficient method is used to estimate the spectrum of the optimal fractional domain. Moshrefzadeh and Fasana [37] explained the lumped parameter model to identify the frequency response of the planetary gearbox with different faults in the bearings. Ho and Randall [38] proposed a method of envelope analysis using the Hilbert–Huang transform. Hilbert transform is applied in frequency bands where the signal-to-noise ratio is maximum. Rai and Mohanty [39] demonstrated bearing fault detection using Hilbert–Huang transform (HHT) method to capture characteristic defect frequencies of a rotating bearing. HHT provides multi-resolution in various frequency scales as it takes signal frequency with its variation in the time domain for consideration. However, it has an error in the estimation of defect frequencies of the bearings. To enhance the resolution of the Hilbert transforming frequency domain, the researchers used FFT of intrinsic mode functions (IMFs). Mutra and Srinivas [40] explained the identification of floating ring bearing coefficients from the vibration response using a modified particle swarm optimization scheme. Ali et al. [41] employed a soft computing method for both empirical mode decomposition and the artificial neural network (ANN) technique in automatic fault detection and Diagnosis (FDD) of bearings elements using vibration data is proposed. Presently, artificial neural networks [42, 43] have gained more attention in industrial automation applications. Also, a neural network is used for data processing and classification technique. Correspondingly, an artificial intelligence self-adaptive FDD system inspired by the genetic algorithm (GA) and nearest neighbor (NN) was studied in [44]. Literature studies show that online and automated monitoring can be achieved by incorporating the expert systems such as ANN Support Vector Machine (SVM) and k-nearest neighbor to detect faults in the machine. Even though roller bearing identification is not new, the proposed methodology simplifies the approach with the use of experimental data. The reliability and speed of identification are the main targets in the present work. Because of the high efficiency to identify similarities among large data matrices, ANN has gained popularity over its competitors. For hyperspectral image classification, a feature extraction method integrating principal component analysis and local binary pattern is presented in [45]. Vibration amplitude feature extraction from spectral imaging Convolution Deep Believe Network (CDBN) is proposed in [46]. To learn the extracted features CDBN along with Gaussian distribution is utilized. To minimize the sum of the economic cost and maximize customer satisfaction, a time-dependent split delivery green vehicle routing problem with multiple time windows is presented in [47]. To accelerate the convergence, a dynamic parameter adjustment mechanism by particle swarm optimization and the fuzzy is developed in [48]. To solve the large-scale traveling salesman problems the proposed method worked satisfactorily. Adaptive mathematical morphology spectrum [49] used in scale selection methods that depends on experimental parameters like noise ratio, fault frequencies, etc.

Despite all the above fault identification research, there are limited works that make use of experimental data directly to predict the fault locations in rotor-bearing systems accurately. The current research focuses on dynamic modeling and experimental research of a rotor-bearing system with common bearing defects. Extraction of fault features using the EMD method is developed and neural network model is employed to predict the parameters and an Adaptive Fuzzy-Neuro inference system is used to identify the bearing fault type. The contribution of this paper mainly includes (i) a multi-stage-prediction model for fault identification in roller bearings from non-stationary time series. (ii) Demonstrated prediction accuracy of various parameters using statistical features. (iii) Illustrated the efficiency of the proposed model to predict different types of fault types. (iv) A finite element model is developed to simulate fault signals in roller bearings. (v) Post-processing of time series data using statistical parameters is presented. (vi) Inverse model is developed to identify the operating parameters from the vibration response and the fault classification using ANFIS is the prime novelty in this paper.

The following is how the rest of the article is organized:

The mathematical model of the rotor-bearing system is described in Sect. 2. The architecture of a suggested system is discussed in Sect. 3. Section 4 explains the experimentation performed to collect the vibration data of bearings at various runtimes and the empirical mode decomposition (EMD) method, which results in the feature extraction and selection procedure. Section 5 presents the conclusion drawn with the future direction.

2 Mathematical modeling of the rotor-bearing system

The rotor model is simplified using the finite element (FE) method as shown in Fig. 1, with four degrees of freedom per node, which are two bending deflections (Ux, Uy) and their corresponding slopes (θx,θy).

Fig. 1
figure 1

FE model considered

The rotor system's equations of motion are as follows:

$$[M]\{ \ddot{q}\} + \left( {[C] - \omega [G]} \right)\{ \dot{q}\} + [K]\{ q\} = \{ F_{B} \}$$
(1)

where [M], [C], [K] are the effective mass, damping, and square stiffness matrices, and [G] is a skew-symmetric gyroscopic matrix, ω is rotational speed while {q} is a vector representing displacements. \(\{ F_{B} \} = \left[ {F_{{\text{u}}} + F_{x} F_{y} } \right]^{T} = \{ F_{{\text{u}}} \} + \{ F_{{\text{b}}} \}\) is the resultant force vector component. \({F}_{\mathrm{u}}\) is the unbalance force along with gravity force vector and \({F}_{\mathrm{b}}\) is the vector of roller bearing forces.

3 Methodology

3.1 Empirical mode decomposition (EMD)

Empirical mode decomposition is a nonlinear, non-stationary signal analysis technique that decomposes a signal into a series of full and nearly orthogonal intrinsic mode functions (IMFs). The IMF reflects oscillatory modes contained in the signal and functions as fundamental functions defined by the signal rather than predetermined kernels. A finite number of IMFs could be decomposed from the signal. The following definitions must be met by each IMF.

(1) The number of extreme and zero-crossings in the whole data set must be equal or deviate by no more than one.

(2) The mean value of the upper and lower envelopes is zero at any point.

The procedure for EMD and its algorithm is clearly explained in Fig. 2. The original signal is decomposed as follows after a series of computations.

$$x\left( t \right) = \mathop \sum \limits_{i = 0}^{n} c_{i} \left( t \right) + r_{n} \left( t \right)$$
(2)

where ci(t) is the sum of IMF levels and rn(t) is the final residue.

Fig.2
figure 2

Flow diagram of EMD process

EMD provides better diagnostic information about the system by analyzing its vibration response, sound, and acoustic emission signals than the conventional methods. IMFs use statistical characteristics to identify defects in machine elements' early stages.

3.2 Identification procedure via counter propagation neural network

Once the features are extracted, a machine learning approach can be conveniently adapted to identify the system parameters including faults. Several machine learning techniques such as support vector machines, neural networks, and neuro-fuzzy systems have been employed in this line. In the present work, the instar–outstar counter propagation neural network model is exploited. The concept of the Counter Propagation Neural Network (CPNN) was introduced by Robert Hecht-Nielsen[50]. This architecture is a novel combination of existing network types that provides versatility in using it. Figure 3 shows the network with inputs and outputs. The algorithm is provided with inputs in vector pairs as (x1,y1). CPNN learns to link an x vector on the input layer to a y vector on the output layer in this example. The CPNN will learn to approximate this mapping for every value of x in the range provided by the collection of training vectors if the connection between x and y can be represented by a continuous function, phi, such that y = Φ(x). Furthermore, if phi has an inverse, such that x is a function of y, the CPNN will learn the inverse mapping, x = Φ−1(y). The first and second layers, that is, the input layer and Kohonen layer form an instar model, and the second and third layers, the Kohonen layer and output layer constitute an outstar model [51]. In this network, the second and third layers perform Kohonen and Grossberg learnings, respectively. Thus it combines unsupervised Kohonen learning and supervised Grossberg learning. The CPNN is much faster than BPNN, and there is no chance of its weights getting trapped in the local minima. However, it may be inferior to the BPNN in mapping applications.

Fig. 3
figure 3

Architecture of CPNN

3.3 Adaptive neuro-fuzzy inference system (ANFIS)

This section provides a brief description and capabilities of the Adaptive Neuro-Fuzzy Inference System (ANFIS). ANFIS systems use the capabilities of fuzzy logic and neural networks. Fuzzy logic maps the input data to desired output with highly connected neural network weighted processing elements. The parameters of fuzzy inference systems are tuned by neural network learning methods. With the application of neural networks, ANFIS refines the fuzzy IF–THEN rules. Different rules cannot share the same output membership function. The basic fuzzy inference system consists of fuzzy rules, data-driven membership functions, and fuzzy reasoning to derive the outputs. Out of many FIS systems, the Takagi–Sugeno Fuzzy system has many applications due to its linguistic interpretable capability.

The learning algorithm for ANFIS is a hybrid algorithm that is a combination of gradient descent and least-squares methods. A hybrid learning algorithm extracts rules from input–output data of the systems that are being modeled. The neural network of ANFIS fine-tunes the rules with updating weights of the rules. Many research articles have a greater explanation of ANFIS. Identification of an optimal fuzzy model concerning the training data reduces to a linear least-squares estimation problem. The method follows in two steps: (i) the First step involves the extraction of an initial fuzzy model from input–output data by using a cluster estimation method incorporating all possible input variables. (ii) In the next step the important input variables are identified by testing the significance of each variable in the initial fuzzy model. This initial fuzzy model can be selected based on the fuzzy rules framed by either using the subtractive clustering technique or the grid partitioning method. In the current analysis, the subtractive clustering technique is used to identify the type of fault occurring in the rotor-bearing system. The cluster estimation technique helps in locating the cluster centers of the input–output data pairs. This in turn helps in the determination of the rules which are scattered in input–output space, as each cluster center is an indication of the presence of a rule.

3.4 Fault classification through ANFIS

The methodology for fault type identification by ANFIS is given in the below flow diagram. The experimental data collected from EMD and CPNN is divided into three data sets. The training data set is used for training the ANFIS, the Checking data set is used for checking the performance of ANFIS for variation in the data set and Validation data is used to find the outputs, i.e., type of fault from ANFIS. For fault type identification Sugeno type FIS with a subtractive clustering system is generated by providing input membership functions, Range of influence, and accept and reject ratios. The generated FIS is then trained with training data set. A training data set contains the first few columns as inputs and the last column as output (preidentified). A hybrid optimization method is used with an error tolerance is less than 1 × 10–2 and a max epoch of 100. If error tolerance is greater than 1 × 10–2 ANFIS needs to be re-trained with modifications in input membership functions, and range of influence for each input.

Once the training error is within the predefined error limit, ANFIS is checked for model data overfitting by training routine with checking data set. Generally, data overfit is identified by the trend of Root Mean Square Error (RMSE) curves. The model data overfit is identified by an increase in the error of checking data. If overfitting is observed then input parameters for the training of the ANFIS are altered, else ANFIS is validated with random input data. The percentage error in the output of ANFIS is calculated with the Actual output of random data. Figure 4 shows the flowchart for the fault classification through ANFIS.

Fig. 4
figure 4

Methodology of ANFIS for fault type identification

Figure 5 shows the methodology of identification with feature extraction from signal and learning process using input–output training data. The computational FE model is developed to validate the response of the present experimental model in terms of frequency response at different speeds and different bearing fault conditions. This comparison is explained in Sect. 4.5. The statistical parameters were extracted from the raw response signals obtained from the experimental test rig in different operating conditions. Further, an inverse model is developed using CPNN and identified operating parameters from the vibration response. Later, ANFIS is used to classify the fault condition in the system.

Fig. 5
figure 5

Flowchart of diagnostic procedure

4 Results and discussion

4.1 Construction of the experimental test rig and its components

The experimental test rig consists of a rotor-bearing system, driving unit, and hydraulic loading unit as shown in Fig. 6. The rig comprises a stainless steel shaft held between two single-row cylindrical roller bearings of SKFDPI N-304 type which are supported in bearing housings each at the end of the shaft. The single-phase motor is connected to the shaft with flexible coupling the specifications of the single-phase motor are depicted in Table 1. The speed of rotation using variable frequency drive from 500 to 3000 RPM can be achieved. A digital non-contact type tachometer is used to measure the rotational speed of the rotor. On the test bearing a radial load is applied with the help of the hydraulic loading arrangement. A data acquisition system is selected to monitor and acquire the vibration and acoustic signals. An OROS of 34 series, 4 channel, and 24-bit compact analyzer is employed in this research. It was employed to record the vibration signals at desired sampling rates. The hardware is comprised of 3 parts: (1) Piezoelectric accelerometer (2) Charge amplifier (3) Analyzer. An accelerometer was connected to the compact analyzer through a charge amplifier. The analyzer was further linked to a computer. The amplifier is very much essential to amplify the signals from an accelerometer that are generally weak. The amplifier also separates the sensor from the analyzer and display apparatus. A piezoelectric accelerometer with a sensitivity of 105 mV/g is positioned near the test bearing to capture vibration signals. This acceleration transducer takes measurements in the zone of highest load, and the frequency-domain signal is presented when linked to a computer through a Fast Fourier Transform (FFT) analyzer. Because faults on the inner races or rolling components provide the greatest risk of exposure, the current experimental study includes these forms of defects as well. With the use of an Electrical Discharge Machine, faults are recreated by creating flaws on the races and rollers of the bearing. Thermal sensors are used to measure temperature change. The tests were carried out on artificially damaged bearings with the flaws depicted in Fig. 7. For ease of understanding let us consider Fault-1 is the surface fault on Bearing Roller, Fault-2 is the surface fault on the bearing inner race and Fault-3 is the surface fault on the bearing outer race. Different operating parameters such as radial load, temperature, and unbalanced phase angle are used as variables in addition to bearing faults, and response is recorded at various operating speeds of interest. The data is then familiarized with several fault situations that are reproduced at 500, 1500, and 2500 rpm. The vibration response was recorded at 500 Hz in all cases; with the frequency band, three to five kHz chosen for simplicity of spectra of bearing vibrations.

Fig. 6
figure 6

Schematic diagram of an experimental setup

Table 1 Specification of the motor
Fig. 7
figure 7

Artificially generated defect on a roller bearing with 30% roller fault, b roller bearing with 60% inner race fault, and c roller bearing with 100% outer race fault

DPI N-304 roller testing bearing whose specifications are presented in Table 2.

Table 2 Specifications of the DPI testing bearing

Initially, the raw signals of the time domain and frequency response of the system from the experimental test rig are identified. Only in one case (100% of inner race fault) the raw signal of the time domain and frequency response of the system is shown in Fig. 8.

Fig. 8
figure 8

Time and frequency response of the experimental test rig (100% of inner race fault): a time domain and b frequency response

4.2 Time-domain analysis of the signal

Intrinsic mode functions for healthy and defective vibration signals are first obtained. Decomposition of time-domain signals for healthy and fault rotor-bearing systems are processed through the EMD algorithm and obtained ten intrinsic mode functions as shown in Fig. 9a–f.

Fig. 9
figure 9figure 9figure 9

The decomposed results of the vibration signals with EMD. a Healthy bearing signal, b Fault 1(roller bearing with 30% roller fault), c Fault 2 (roller bearing with 60% inner race fault), d Fault 3 (roller bearing with 100% outer race fault), e Fault 4 (roller bearing with 60% roller fault) and f Fault 5 (roller bearing with 100% inner race fault)

It is observed that the faults in the inner race and on the roller are much inflecting the response of the system compared to a fault in the outer race. From these plots, it is very difficult to identify the type of fault and expertise required in analyzing this signal data. It is discussed earlier operating conditions can vary in addition to a fault in bearings. This makes the identification methods to be robust.

4.3 Frequency-domain analysis of signal

It is a well-known fact that the first parameter to predict any fault is by analyzing the frequency content of the signal. The time-domain signal shown in Fig. 8 and IMFs shown in Fig. 9 as such does not provide any information about the fault. Further, Fig. 10a–f shows frequency response IMF1 for healthy and faulty condition bearings.

Fig. 10
figure 10

Frequency spectrum of IMFs for 0 to 500 Hz range. a No-fault (healthy bearing), b Fault 1(roller bearing with 30% roller fault), c Fault 2 (roller bearing with 60% inner race fault), d Fault 3(roller bearing with 100% outer race fault), e Fault 4 (roller bearing with 60% roller fault) and f Fault 5 (roller bearing with 100% inner race fault)

The primary peak frequency for a healthy bearing is 45 Hz, but numerous peaks emerge under various defective situations, and these peaks are caused by bearing defects. The bearing inner race and roller faults are much influencing the response of the system compared to outer race fault. As per the methodology, CPNN is used to extract the operating conditions from the time domain/IMF response statistical data. An inverse neural network model is developed to identify the different parameters such as radial load, temperature, fault number, and unbalanced phase angle from the known system response at different rotor speeds. Table 3 shows the experimental response data for various parameters. Training data consists of the signal characteristics namely statical parameters like mean, kurtosis, skewness, and peak to a peak value [52] as inputs, and corresponding operating parameters (radial load, temperature, fault type and unbalance phase angle) as outputs are employed a counter propagation neural network. A separate neural network identification model is used for different shaft speeds (Tables 4, 5).

Table 3 Experimental data at a rotor speed of 500 rpm
Table 4 Experimental data at a rotor speed of 1500 rpm
Table 5 Experimental data at a rotor speed of 2500 rpm

Initially, training data consists of the central moments such as mean, kurtosis, skewness, and peak to peak as the inputs, and corresponding operating parameters are provided as outputs to train the network. After training the full CPNN with the instar–outstar learning rule, using adaptive learning rates, α, and β, the weights of hidden and output layers are obtained for an error tolerance of 1e−4. Here learning rate α = 0.4 and momentum factor β = 0.01 is employed throughout. Figure 11 shows the convergence trend with different hidden nodes to estimate the correct number of hidden nodes at a rotor speed of 500 rpm. The mean square error is low and converges at a faster rate when the number of hidden nodes is equal to five.

Fig. 11
figure 11

Convergence trend

Further, to show the capabilities of CPNN over BPNN a comparison was made for the convergence trend at three rotor speeds (500 rpm, 1500 rpm, and 2500 rpm) with five hidden nodes shown in Figs. 12, 13, and 14, respectively. BPNN is a common method and is reliable in the identification of random data analysis. Initially, training data consists of the central moments such as mean, kurtosis, skewness, and peak to peak as the inputs, and corresponding operating parameters are provided as outputs to train the network. The parameters such as learning rate of 0.6, momentum rate of 0.9, and activation function are sigmoid. The parameters are verified for the error tolerance of 1e−4.

Fig. 12
figure 12

Convergence trend (speed 500 rpm)

Fig. 13
figure 13

Convergence trend (speed 1500 rpm)

Fig. 14
figure 14

Convergence trend (speed 2500 rpm)

It is observed that in all three cases the CPNN is converging faster with less RMSE compared to BPNN. This reduces the computational time for the designers and helps in predicting accurate data. Table 6 depicts the test data, which includes three scenarios in which the targets are provided. The neural network's projected outcomes are compared to the measured values. In all three situations, the difference between anticipated and actual parameter values is less than three percent.

Table 6 Identified results

4.4 Fault identification by ANFIS

In the current work, the ANFIS methodology is described in the flow diagram shown in Fig. 4. MATLAB R2018b software is used for ANFIS simulations. Operating parameters like radial Load, bearing temperature, and unbalance phase angle are collected from CPNN output; statistical parameters of IMF like mean, kurtosis, skewness, and peak-to-peak variation are collected from EMD for different faults and speeds of the rotor-bearing system are provided as inputs and fault type is provided as output to ANFIS. A total of 81 samples are collected from experimental simulations of know faults. Out of 81 samples, that were used for training, 27 samples were used for checking and 27 were used for testing. 40 samples are collected randomly for validation of ANFIS (only input provided to ANFIS) for identification of fault type. Genfis2 surgeon type ANFIS generates FIS structure from data using subtractive clustering. The number of the input membership function is 26 for each input, which generates 27 fuzzy if–then rules, connected by T-norm (Fuzzy AND) operators. A hybrid learning algorithm is used in training ANFIS. The training of ANFIS is done by minimal error tolerance with a designated epoch number. The training process stops whenever the designated epoch number is reached or the training error goal is achieved. In the current case, we considered 1 × 10–2 errors with 100 epochs. The Trained ANFIS system can be evaluated with any other input data. The results of outputs from ANFIS are compared with checking and validated data. Table 7 shows the parameters used for ANFIS simulation. Figure 15 shows the data collected for training.

Table 7 ANFIS parameters
Fig. 15
figure 15

Training data set inputs and output

The data for training is collected randomly from the experimental results. All the 7 input and their know outputs are shown in the figure. From the figure, it is clear that the collected data is very random and is not structured. Similar data is collected for checking, validation, and random data to identify the capabilities of ANFIS for fault type.

The Generated ANFIS is trained with training and validation data. The Error for the 100 epoch is shown in Fig. 16. The Trend of both the error curves is decreasing trend with a training error of 0.0005913 and a validation error of 0.0004508. With these outputs, it can be concluded that the error is minimal than the target and show a better result even with limited data of 81 samples. Figure 17 shows the error curves for training and checking data.

Fig. 16
figure 16

Training error versus validation error

Fig. 17
figure 17

ANFIS model overfitting curves (training and checking data)

To identify the model overfitting of the ANFIS system, the ANFIS system is trained with training and checking data sets as input arguments. At close to the 55th epoch the checking data error shows an abnormal peak. This could be due to the number of data used for training being smaller than the number of modifiable parameters. The error before and after this peak is close by and maintains the same trend. So considering the model parameters before or after the peak does not have much variation. From the plot, it can be interpreted that the trained ANFIS system does not possess model over or data overfitting. As the results of error curves are promising, ANFIS is used to validate with random data. Forty samples of random experimental data are collected and only input parameters are provided with trained ANFIS to identify the fault type. Figure 18 shows the output comparison of ANFIS with experimental output for all three types of faults. Table 8 presents the error from the experimental output or actual output. The Error values are rounded off to the second digit. An error less than 0.05 is considered as matching output, an error between 0.05 to 0.1 is considered an alarming output, and an error above 0.1 is considered as strongly not matching or caution outputs. Out of 40 random sample inputs, only one prediction is strongly not matching with experimental or actual output. The Trained ANFIS is 100% accurate in predicting Fault type-1 (12 samples out of 12 are matching), 87.5% in predicting Fault type-2 (14 samples out of 16 are matching, one sample is strongly not matching), and 96% in predicting Fault type-3 (10 samples out of 12 are matching, two are close by).

Fig. 18
figure 18

ANFIS OUTPUT prediction

Table 8 Error in prediction of fault type by ANFIS

A mathematical model to simulate bearing fault is developed using finite element methods. Using EMD raw time series data from both finite element models and experiments are decomposed to their intrinsic functions. EMD natural processes the statistical features from the time series data. In this article, statistical features such as mean, kurtosis, skewness, peak to peak, variance, and standard deviation are employed in fault identification. Further, these statistical data are used in CPNN and ANFIS schemes for the effective detection of faults.

There are some limitations also presented in the above study while conducting the experiments the temperature change, misalignments, and rotor–stator rubbing. Selection and decision of statistical parameters used to train the neural network and number of trained data sets.

4.5 Validation

For the healthy bearing condition at a rotor speed of 2500 rpm, a comparison of the frequency response from the finite element (FE) model and the experimental model is performed. The frequency response from FE and the experimental model is shown in Fig. 19.

Fig. 19
figure 19

Frequency response at a rotor speed of 2500 rpm. a FE model and b experimental

The first dominating peak frequency from FE and experimental are 47.3 Hz and 46 Hz, respectively. The values are compared in terms of frequency these frequency values are reasonably coming close. The comparison of the dominant peak frequency for the healthy and different bearing fault conditions from the FE model and Experimental at various speeds are shown in Fig. 20.

Fig. 20
figure 20

Comparison of FE and experimental model. a Rotor speed 500 rpm, b rotor speed 1500 rpm, and c rotor speed 2500 rpm

It is observed that the identified dominant peak frequencies from FE and experimental are closer.

Further, with the finite element model, the time and frequency response with different bearing faults are identified. An interactive MATLAB code is developed to identify these responses. At a constant rotor speed of 15,000 rpm and a bearing inner defect, Fig. 21 illustrates the time and frequency response at the bearing location.

Fig. 21
figure 21

Time and frequency response at bearing node with inner race fault. a Time domain, b bearing vibration spectra, and c envelope spectrum

It is observed from the bearing vibration spectra, that for the faulty condition the initial two dominant peak frequencies are 5.99 Hz and 18.28 Hz. whereas the speed-dependent peak frequency is 184.7 Hz. It is seen in the amplitude difference for the healthy and damaged conditions from the envelope spectrum. Figure 22 depicts the time and frequency response at a bearing location with a roller defect at a constant rotor speed of 15,000 rpm.

Fig. 22
figure 22

Time and frequency response at bearing node with roller fault. a Time domain, b bearing vibration spectra, and c envelope spectrum

It is observed from the bearing vibration spectra, that for the faulty condition the initial two dominant peak frequencies are 6.94 Hz and 18.28 Hz. whereas the speed-dependent peak frequency is 240.8 Hz. Compare to the inner race faulty condition the first and third dominant peak frequencies are increased slightly. It is seen in the amplitude difference for the healthy and damaged conditions from the envelope spectrum. Figure 23 shows the time and frequency response at bearing location at a constant rotor speed of 15,000 rpm with outer race fault.

Fig. 23
figure 23

Time and frequency response at bearing node with outer race fault. a Time domain, b bearing vibration spectra, and c envelope spectrum

It is observed from the bearing vibration spectra, that for the faulty condition the initial two dominant peak frequencies are 8.59 Hz and 22.31 Hz. whereas the speed-dependent peak frequency is 265.9 Hz. Compare to inner race and roller faulty condition the dominant peak frequencies are increasing. It indicates the faults on the inner race and roller are much influencing the response of the system compared to the outer race fault. It is seen in the amplitude difference for the healthy and damaged conditions from the envelope spectrum.

5 Conclusions

This research paper proposed a methodology for parameter and fault type identification from vibration response through empirical mode decomposition (EMD), counter propagation neural network (CPNN), and adaptive neuro-fuzzy inference system (ANFIS). Initially, an experimental rotor-bearing model was established to study the dynamics of the system under different operating conditions. The experimental results are validated with FEM simulation results for a healthy rotor-bearing system. Both time-domain and frequency-domain results are compared and found that the results are with minimal deviation. For the healthy and defective circumstances, the empirical mode decomposition approach was utilized to break down the extracted signal into intrinsic mode functions. CPNN was used to predict the operating parameters and ANFIS Classifies and predicts the faults in roller bearing of rotor systems. The following were the concluding remarks of the present work:

  • It was observed from Intrinsic Mode Functions (IMF) that the faults on the inner race and the roller are much more influencing the response of the system compared to the fault on the outer race.

  • It was observed from frequency-domain analysis of IMF that the dominant peak frequency for the healthy bearing was 45 Hz, whereas for various faulty conditions the multiple peaks forming these peaks are due to bearing faults.

  • It was observed that the first dominant peak frequency in a healthy system from the finite element model and experimental are 47.3 Hz and 46 Hz. The values are compared in terms of frequency these frequency values are reasonably coming close.

  • CPNN with five hidden layer neurons perfectly generalize the input–output relationship in extraction or prediction of operating conditions.

  • CPNN was superior and faster compared to BPNN for feature prediction and extraction.

  • It was observed that the trained ANFIS is 100% accurate in predicting Fault type-1 87.5% in predicting Fault type-2 and 96% in predicting Fault type-3.

  • From the FE model also the inference from the frequency response was identified that the faults on the inner race and the roller were much influencing the response of the system compared to faults on the outer race.

  • The amplitude change also clearly demonstrates the faulty conditions.