An Improved Fault Diagnosis Scheme Based on a Type-2 Fuzzy Classification Algorithms

Rodríguez-Ramos, Adrián; da Silva Neto, Antônio J.; Llanes-Santiago, Orestes

doi:10.1007/978-3-031-49552-6_8

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14335))

Included in the following conference series:

International Workshop on Artificial Intelligence and Pattern Recognition

209 Accesses

Abstract

The Industry 4.0 paradigm aims to obtain high levels of productivity and efficiency, more competitive final products and compliance with the demanding regulations related to industrial safety. To achieve these objectives, the industrial systems must be equipped with condition monitoring systems for the detection and isolation of faults. The paper presents the design of a fault diagnosis system with robust behavior for industrial plants by using Type-2 Fuzzy algorithm. In order to improve the classification, a kernel variant is implemented in the proposed algorithms to accomplish a better differentiation between classes. Several experiments were conducted (without noise, 2%, and 5% of noise level) by using the T2FCM, IT2FCM, KT2FCM, and KIT2FCM algorithms for the DAMADCIS benchmark, obtaining excellent results.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A Robust Fault Diagnosis Strategy in Mechanical Systems Using Pythagorean Fuzzy Sets

A Proposal of Hybrid Fuzzy Clustering Algorithm with Application in Condition Monitoring of Industrial Processes

A Robust Condition Monitoring Approach in Industrial Plants Based on the Pythagorean Membership Grades

Article 17 April 2023

Keywords

1 Introduction

A main premise in the Industry 4.0 paradigm is to obtain high production levels with low operating expenses to improve the relation benefits-costs [6, 10]. An important cause of the increase in operating expenses and the descending productivity in industrial plants is the occurrence of faults [3, 17].

Many research results on the fault diagnosis topic in industrial systems have been published in the scientific literature in the last two decades under two main approaches: model based, and data based fault diagnosis [14, 15]. However, the advances in the Internet of Things (IoT) and Big Data technologies have currently allowed a major attention and results in the last approach [2, 11].

Several computational tools have been displayed in scientific papers and books to improve the performance of industrial fault diagnosis systems [7, 8]. However, the need to develop new strategies remains open because the results depend on the type of industrial plant analyzed.

The training stage of a data-based supervised fault diagnosis system is decisive for achieving the best online performance. To accomplish better results in training, the different classes that represent the operation of the industrial plant have to be very well identified [16]. However, this is a very complex task due to the uncertainties that characterize the industrial measurements by the effect of external disturbances and noise [18].

To overcome some difficulties of type-1 fuzzy sets to deal with the uncertain that characterize the industrial process due to noise and external disturbances type-2 fuzzy sets are used. In type-1 fuzzy sets, the memberships degree is a crisp number, but in type-2 fuzzy sets, the memberships degree is a type-1 fuzzy number. The goal is that higher membership values should contribute more than memberships that are smaller when the cluster centers are updated [19, 20]. In this paper, a fault diagnosis methodology based on type-2 fuzzy classification algorithms is presented.

The main contribution of this paper is to present a robust condition monitoring scheme versus external disturbances and noise. For this, a scheme based on the use of Type-2 Fuzzy sets is displayed. For misclassification reduction, a kernel variant is implemented of the proposed algorithms to accomplish a better differentiation between classes. The proposal exhibits high performance in the presence of noisy observations

2 Materials and Methods

2.1 Type-2 Fuzzy C-Means Algorithm (T2FCM) and Kernelized T2FCM (KT2FCM)

For updating the cluster centers in T2FCM, the weighted mean of all observations is used [19]. The membership values for the Type 2 membership are obtained as follow:

$$\begin{aligned} a_{ik} = u_{ik} - \frac{1-u_{ik}}{2} \end{aligned}$$

(1)

where $a_{ik}$ and $u_{ik}$ are the type-2 and type-1 memberships respectively. The cluster centers are updated according to the traditional FCM but taking into account the new type-2 fuzzy membership . Although T2FCM has proven effective for spherical data, it fails when the data structure of input patterns is non-spherical. A way of increasing the accuracy of the T2FCM is using a kernel function for calculating the distance of data point from the cluster centers, i.e., mapping the data points from the input space to a high dimensional space. This algorithm is used to obtain a better separability among classes improving the classification results. In the KT2FCM algorithm is minimized the following objective function:

$$\begin{aligned} J_{KT2FCM} \,=\, \sum _{i=1}^{l}\sum _{k=1}^{N}a_{ik}^{*m}\left\| \mathbf {\Psi (z_{k})}-\mathbf {\Psi (v_{i})}\right\| ^{2} \end{aligned}$$

(2)

where, $\left\| \mathbf {\Psi (z_{k})}-\mathbf {\Psi (v_{i})}\right\| ^{2}$ is the square of the distance between $\mathbf {\Psi (z_{k})}$ and $\mathbf {\Psi (v_{i})}$. In the feature space, the distance is computed through the kernel in the input space as:

$$\begin{aligned} \left\| \mathbf {\Psi (z_{k})}-\mathbf {\Psi (v_{i})}\right\| ^{2} = & {} \mathbf {K(z_{k},z_{k})}- \mathbf {2K(z_{k},v_{i})}\nonumber \\ {} & {} + \mathbf {K(v_{i},v_{i})} \end{aligned}$$

(3)

In the scientific bibliography, many kernel functions are found, and the most appropriate depends on the applications [13]. Nonetheless, the most used is the Gaussian Kernel Function (GKF).

If the GKF is used, then $\mathbf {K(z,z) = 1}$ and $\left\| \mathbf {\Psi (z_{k})}-\mathbf {\Psi (v_{i})}\right\| ^{2} = \mathbf {2\left( 1-K(z_{k},v_{i})\right) }$. So, Eq. (2) can be expressed as:

$$\begin{aligned} J_{KT2FCM} = & {} 2\sum _{i=1}^{l}\sum _{k=1}^{N}a _{ik}^{*m}\left\| 1-\mathbf {K(z_{k},v_{i})}\right\| ^{2} \end{aligned}$$

(4)

where,

$$\begin{aligned} \mathbf {K(z_{k},v_{i})} = e^{-\left\| \textbf{z}_{k}-\textbf{v}_{i}\right\| ^{2}/\delta ^{2}} \end{aligned}$$

(5)

where $\delta $ is the bandwidth which illustrates the smoothness degree of the GKF. Minimizing Eq. (4), yields:

$$\begin{aligned} a _{ik}^{*} = \frac{1}{\sum _{j=1}^{l}\left( \frac{1-\mathbf {K(z_{k},v_{i})}}{1-\mathbf {K(z_{k},v_{j})}}\right) ^{1/\left( m-1\right) }} \end{aligned}$$

(6)

$$\begin{aligned} \textbf{q}_{i} = \frac{\sum _{k=1}^{N}\left( a_{ik}^{*m}\mathbf {K(z_{k},v_{i})z_{k}}\right) }{\sum _{k=1}^{N}a_{ik}^{*m}\mathbf {K(z_{k},v_{i})}} \end{aligned}$$

(7)

2.2 Interval Type-2 Fuzzy C-Means Algorithm (IT2FCM) and Kernelized IT2FCM (KIT2FCM)

The parameter m is crucial in fuzzy clustering algorithms to determine the partition matrix uncertainty. Nevertheless, it is not an easy task to decide the value of m in advance. IT2FCM regards the fuzzification coefficient as an interval [$m_{1}$,$m_{2}$] and minimizes the objective function as [20]:

$$\begin{aligned} J_{IT2FCM} \,=\, \sum _{i=1}^{l}\sum _{k=1}^{N}u _{ik}^{*m}d_{ik}^{2} \end{aligned}$$

(8)

where the parameter m is substituted by $m_{1}$ and $m_{2}$ that represent different fuzzy degrees and provide different objective functions compared with FCM. To minimize the objective function [20]:

$$\begin{aligned} \overline{u_{i}}(k) \,=\, max\left( 1/\sum _{j=1}^{l}(d _{ik}/d _{jk})^{2/(m_{1}-1)}, 1/\sum _{j=1}^{l}(d _{ik}/d _{jk})^{2/(m_{2}-1)} \right) \end{aligned}$$

(9)

$$\begin{aligned} \underline{u_{i}}(k) \,=\, min\left( 1/\sum _{j=1}^{l}(d _{ik}/d _{jk})^{2/(m_{1}-1)}, 1/\sum _{j=1}^{l}(d _{ik}/d _{jk})^{2/(m_{2}-1)} \right) \end{aligned}$$

(10)

where $d_{ik}^{2} = \left\| z_{k} - q_{i}\right\| $ is the distance between input patterns $z_{k}$ and cluster centers $q_{i}$. $\overline{u_{i}}(k)$ $(\underline{u_{i}}(k))$ is the upper (lower) membership function of $z_{k}$ to $q_{i}$.

Distinct from FCM, the output of IT2FCM algorithm is an interval type-2 fuzzy set, that it is not possible to convert to a crisp set directly by a defuzzication operation. To calculate the centroid of a type-2 fuzzy set and reduce the type-2 fuzzy set to the type-1 fuzzy set is executed the type reduction just as the first step of output processing [9]. The interval-valued cluster centers are calculated as:

$$\begin{aligned} \widetilde{\textbf{q}_{i}} = [\widetilde{q}_{i,1}, \widetilde{q}_{i,2}]= \sum _{u_{i1}}\cdot \cdot \cdot \sum _{u_{i1}}\frac{1}{\frac{\sum _{k=1}^{N}u_{ik}^{m^{*}}z_{k}}{\sum _{k=1}^{N}u_{ik}^{m^{*}}}} \end{aligned}$$

(11)

supported on such type-2 memberships. $m^{*}$ switches from $m_{1}$ to $m_{2}$, and $\widetilde{q}_{i,1}$ and $\widetilde{q}_{i,2}$ are usually obtained by Karnik-Mendel algorithm [5]. The procedure to obtain the kernel version of the IT2FCM algorithm (KIT2FCM) is similar to the one used in the case of T2FCM algorithm. The distance is calculated through the kernel function using the Gaussian Kernel Function (GKF).

2.3 Proposed Methodology

The proposed classification scheme for Fault Detection and Isolation (FDI) is displayed in Fig. 1. It exhibits an offline training phase and a recognition phase executed online. In the first phase, the fuzzy classifier is trained using a training database builds with historical data of the process. In the online phase, the classifier analyzes each observation collected from the process. The result offers information to the operator about the state of the system in real time. Training is the most important stage, since the center of each of the classes that represent the operation of the process will be determined, either in normal operation or in the presence of faults.

Offline Training Phase. In this phase, the FDI system is trained with a set of historical data which contain the necessary information of each known operating state or class of the industrial plant (normal operation condition (NOC) and states of fault). The main aim of the training process is to determine the center of the known classes $\textbf{Q} = {\textbf{q}_{1},\textbf{q}_{2}, \ldots , \textbf{q}_{c}}$ is determined to be used in the on-line recognition stage.

On-Line Recognition Phase. In this phase, it is determined to which class each observation k belongs at each time instant. First, the distance between the observation and the centers of the classes that were determined in the offline stage is computed. Subsequently, the degree of membership of the observation k is obtained for each class. It will be assigned to the class with the highest degree of membership (See Algorithm 1).

2.4 Case Study: DAMADICS

To verify that, the proposed methodology was used in the DAMADICS test problem. It represents an intelligent electro-pneumatic actuator widely used in industries [1]. The diagram of this actuator is shown in Fig. 2. Table 1 and Fig. 3 (with 300 observations per class) shows the operation modes evaluated in the actuator and the measured variables used. Selected faults occur in different parts of the actuator and were selected in order to test the robustness of the diagnostic system.

Table 1. Operation modes and measured variables in DAMADICS.

Full size table

2.5 Design of Experiments

Table 2 shows the characteristics of the training database used, which is free of outliers, noise, and missing variables. The values of the parameters used for the applied algorithms were: $\epsilon $ = $10^{-5}$, m = 2, $\sigma $ = 50. The parameters were taken from [12].

Table 2. Characteristics of the training database.

Full size table

K-cross-validation method with K = 5 was chosen for training (800 observations) and validation (200 observations). In the experiments of the online phase 2400 observations were used (400 new observations of each operation mode not used in the training). Each experiment was replicated 100 times to ensure repeatability of results. The average of the 100 results was considered as final result. To evaluate the robustness of the proposal, three experiments were developed:

1.
Observations without noise.
2.
Observations with 2% of noise level
3.
Observations with 5% of noise level.

3 Discussion of Results

4 Online Recognition Stage

The confusion matrix (CM) tool was used to evaluate the performance of the FDI system proposed. The values $CM_{rs}$ for $ r \ne s$ in the CM show the number of observations of the operation mode r that the classifier algorithm misclassifies in the operation modes.

Table 3 shows the confusion matrix (without noise in the measurements) where the results for the operation states Normal Operation Condition (NOC), Fault 1 (F1), Fault 7 (F7), Fault 12 F12), Fault 15 (F15) and Fault 19 (F19) are presented. In the main diagonal are presented the number of observations well classified. The accuracy of the classification process is obtained as TA=correctly classified observations/total observations. The average (AVE) of TA is displayed in the last row.

Figure 4 show the classification results for the different operation modes (NOC and faults 1, 7, 12, 15, 19) by using the T2FCM, IT2FCM, KT2FCM and KIT2FCM algorithms for DAMADICS process. They show a classification percentage obtained for each data set. Figure 5 displays a global classification percentage obtained for each algorithm (without noise, 2% and 5% of noise level).

Table 3. Confusion matrix for the DAMADICS process (NOC: 400, F1: 400, F7: 400, F12: 400, F15: 400, F19: 400)

Full size table

4.1 Statistical Tests

Since several algorithms are used, statistical tests should be applied to compare their performance [4]. The statistical Friedman test can be used in order to establish if the differences among the obtained performances are significant. If significant differences are found, a comparison in pairs should be developed to find the best classifier. In this case, the statistical Wilcoxon test was used.

Friedman Test. Applying the test for $k = 4$ algorithms and $N = 10$ datasets, the value obtained for the statistical Friedman $F_{F}$ = 241. $F_{F}$ is distributed according to the F distribution with $k-1=3$ and $(k-1)\times (N-1)=27$ degrees of freedom. From the distribution F table, F(3,27) for $\alpha =0.05$ is 2.9604, so the null-hypothesis (F(3,27) < $F_{F}$) is rejected. This means that there are significant differences among the obtained performances.

Wilcoxon Test. Table 4 exhibits the results of applying the Wilcoxon test (A1: T2FCM, A2: IT2FCM, A3: KT2FCM, A4: KIT2FCM). First row displays the sum of positive ranks $R^{+}$, and the second rows displays the sum of the negative ranks $R^{-}$ obtained from the comparison developed. The values of the T statistic and its critical values for a significance level $\alpha =0.05$ are shown below. Finally, the winning algorithm are shown in each comparison. Table 5 shows that KT2FCM and KIT2FCM obtain the best results.

Table 4. Results of the Wilcoxon test

Full size table

Table 5. Algorithm comparison summary

Full size table

5 Conclusions

This paper presented the design of a fault diagnosis system with robust behavior by using type-2 fuzzy classification algorithm. The main contribution of the proposal was the application of the theory of Type-2 Fuzzy Sets to overcome the effect of uncertainties that characterize the industrial process due to noisy observations and external disturbances.

The capacity of the function kernels to discriminate better among the operation modes reducing misclassification was demonstrated in the developed experiments. The proposed FDI scheme was successfully validated using the DAMADICS process benchmark.

References

Bartys, M., Patton, R., Syfert, M., de las Heras, S., Quevedo. J.: Introduction to the DAMADICS actuator FDI benchmark study. Control Eng. Pract. 14, 577–596 (2006)
Google Scholar
Chi, Y., Dong, Y., Wang, Z., Yu, F., Leung, V.: Knowledge-based fault diagnosis in industrial internet of things: a survey. IEEE Internet Things J. 9(15), 12886–12900 (2022). https://doi.org/10.1109/JIOT.2022.3163606
Article Google Scholar
Fernandes, M., Corchado, J., Marreiros, G.: Machine learning techniques applied to mechanical fault diagnosis and fault prognosis in the context of real industrial manufacturing use-cases: a systematic literature review. Appl. Intell. 52, 14246–14280 (2022). https://doi.org/10.1007/s10489-022-03344-3
Article Google Scholar
García, S., Molina, D., Lozano, M., Herrera, F.: A study on the use of non-parametric tests for analyzing the evolutionary algorithms behavior: a case study on the CEC’2005 special session on real parameter optimization. J. Heuristic 15, 617–644 (2009)
Article Google Scholar
Karnik, N., Mendel, J.M.: Centroid of a type-2 fuzzy set. Inf. Sci. 132, 195–220 (2001)
Article MathSciNet Google Scholar
Lasi, H., Fettke, P., Kemper, H.: Industry 4.0. Bus. Inf. Syst. Eng. 6, 239–242 (2014). https://doi.org/10.1007/s12599-014-0334-4
Li, W., et al.: A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: theories, applications and challenges. Mech. Syst. Signal Process. 167, 108487 (2022). https://doi.org/10.1016/j.ymssp.2021.108487
Article Google Scholar
Lv, H., Chen, J., Pan, T., Zhang, T., Feng, Y., Liu, S.: Attention mechanism in intelligent fault diagnosis of machinery: a review of technique and application. Measurement 199, 111594 (2022). https://doi.org/10.1016/j.measurement.2022.111594
Article Google Scholar
Mendel, J.M., Liu, F.: Super-exponential convergence of the karnikmendel algorithms for computing the centroid of an interval type-2 fuzzy set. IEEE Trans. Fuzzy Syst. 15(2), 309–320 (2007)
Article Google Scholar
Popkova, E.G., Ragulina, Y.V., Bogoviz, A.V. (eds.): Industry 4.0: Industrial Revolution of the 21st Century. SSDC, vol. 169. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-94310-7
Book Google Scholar
Quiñones-Grueiro, M., Verde, C., Prieto-Moreno, A., Llanes-Santiago, O.: An unsupervised approach to leak detection and location in water distribution networks. Int. J. Appl. Math. Comput. Sci. 28(2), 283–295 (2018). https://doi.org/10.2478/amcs-2018-0020
Article MathSciNet Google Scholar
Rodríguez-Ramos, A., Javier-Ortiz, F., Llanes-Santiago, O.: A proposal of robust condition monitoring scheme for industrial systems. Computación y Sistemas 27(1), 223–235 (2023)
Google Scholar
Rodríguez-Ramos, A., de Lázaro, J.B., Cruz-Corona, C., Neto, A.S., Llanes-Santiago, O.: An approach to robust condition momitoring in industrial processes using pythagorean memberships grades. Ann. Braz. Acad. Sci. 94(4), 1–22 (2022)
Google Scholar
Rodríguez-Ramos, A., de Lázaro, J.B., Prieto-Moreno, A., Neto, A.S., Llanes-Santiago, O.: An approach to robust fault diagnosis in mechanical systems using computational intelligence. J. Intell. Manuf. 30(4), 1601–1615 (2019). https://doi.org/10.1007/s10845-017-1343-1
Article Google Scholar
Torres, P.R., Mercado, E.S., Llanes-Santiago, O., Rifón, L.A.: Modeling preventive maintenance of manufacturing processes with probabilistic boolean networks with interventions. J. Intell. Manuf. 29, 1941–1952 (2018). https://doi.org/10.1007/s10845-016-1226-x
Article Google Scholar
Verron, S., Tiplica, T., Kobi, A.: New features for fault diagnosis by supervised classication. In: 18th Mediterranean Conference on Control and Automation (MED’10) (2010)
Google Scholar
Webert, H., Döß, T.D., Kaupp, L., Simons, S.: Fault handling in industry 4.0: definition, process and applications. Sensors (Basel) 22(6), 2205 (2022). https://doi.org/10.3390/s22062205
Wolpert, D., Macready, W.: No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997). https://doi.org/10.1109/4235.585893
Article Google Scholar
Yang, X., Yu, F., Pedrycz, W.: Typical characteristic-based type-2 fuzzy c-means algorithm. IEEE Trans. Fuzzy Syst. 29, 1173–1187 (2021)
Article Google Scholar
Yin, Y., Sheng, Y., Qin, J.: Interval type-2 fuzzy c-means forecasting model for fuzzy time series. Appl. Soft Comput. 129, 1–7 (2022)
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge the financial support provided by FAPERJ, Fundacão Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro; CNPq, Consehlo Nacional de Desenvolvimento Científico e Tecnológico; CAPES, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, research supporting agencies from Brazil and the project PN223LH004-23 from the Science and Technology National Program in Automation, Robotic and Artificial Intelligence (ARIA) of the Ministry of Science, Technology and Environment (CITMA) of Cuba.

Author information

Authors and Affiliations

Universidad Tecnológica de la Habana José Antonio Echeverría, CUJAE, La Habana, Cuba
Adrián Rodríguez-Ramos & Orestes Llanes-Santiago
Instituto-Politécnico - Universidade do Estado do Rio de Janeiro, Nova Friburgo, RJ, Brazil
Antônio J. da Silva Neto

Authors

Adrián Rodríguez-Ramos
View author publications
You can also search for this author in PubMed Google Scholar
Antônio J. da Silva Neto
View author publications
You can also search for this author in PubMed Google Scholar
Orestes Llanes-Santiago
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Orestes Llanes-Santiago .

Editor information

Editors and Affiliations

Universidad de las Ciencias Informáticas, Havana, Cuba
Yanio Hernández Heredia
Universidad de las Ciencias Informáticas, Havana, Cuba
Vladimir Milián Núñez
Universidad de las Ciencias Informáticas, Havana, Cuba
José Ruiz Shulcloper

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rodríguez-Ramos, A., da Silva Neto, A.J., Llanes-Santiago, O. (2024). An Improved Fault Diagnosis Scheme Based on a Type-2 Fuzzy Classification Algorithms. In: Hernández Heredia, Y., Milián Núñez, V., Ruiz Shulcloper, J. (eds) Progress in Artificial Intelligence and Pattern Recognition. IWAIPR 2023. Lecture Notes in Computer Science, vol 14335. Springer, Cham. https://doi.org/10.1007/978-3-031-49552-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-49552-6_8
Published: 20 December 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49551-9
Online ISBN: 978-3-031-49552-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Improved Fault Diagnosis Scheme Based on a Type-2 Fuzzy Classification Algorithms

Abstract

Similar content being viewed by others

A Robust Fault Diagnosis Strategy in Mechanical Systems Using Pythagorean Fuzzy Sets

A Proposal of Hybrid Fuzzy Clustering Algorithm with Application in Condition Monitoring of Industrial Processes

A Robust Condition Monitoring Approach in Industrial Plants Based on the Pythagorean Membership Grades

Keywords

1 Introduction

2 Materials and Methods

2.1 Type-2 Fuzzy C-Means Algorithm (T2FCM) and Kernelized T2FCM (KT2FCM)