A Robust Contextual Fuzzy C-Means Clustering Algorithm for Noisy Image Segmentation

Kalti, Karim; Touil, Asma

doi:10.1007/s00357-023-09443-1

A Robust Contextual Fuzzy C-Means Clustering Algorithm for Noisy Image Segmentation

Original Research
Published: 09 August 2023

Volume 40, pages 488–512, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Classification Aims and scope Submit manuscript

A Robust Contextual Fuzzy C-Means Clustering Algorithm for Noisy Image Segmentation

Download PDF

222 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, we address the problem of the fuzzy c-means (FCM) algorithm sensitivity to noise when clustering image pixels. We propose in this regard an improved FCM algorithm that incorporates contextual information at the membership degrees updating stage. For that aim, we introduce two novel parameters: the contextual similarity degree and the intrinsic similarity degree which are used to estimate each pixel’s nature (normal or noisy), according respectively to its context and to its specific features. Based on this estimation, we propose a modified membership degrees updating strategy that proceeds by adaptively reinforcing the assignment of a pixel to its context’s cluster when this pixel is detected as noisy. Experiments performed on synthetic and real-world images proved that our approach achieves competitive performance compared to state-of-the-art FCM-based methods.

Improved fuzzy clustering algorithm with non-local information for image segmentation

Article 17 March 2016

New Approach to Image Segmentation Based on Neighborhood-Influenced Fuzzy C-Means Clustering

Image Guided Fuzzy C-Means for Image Segmentation

Article 06 April 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Segmentation is a fundamental processing in image analysis systems. For a long time, various segmentation approaches have been proposed in the literature. These approaches can be classified into three main categories: region-oriented approaches, contour-oriented approaches, and clustering approaches. Region and contours oriented approaches are based on techniques that have been developed specifically for image segmentation. They, therefore, benefit from image data properties, namely the spatial correlation. However, segmentation by clustering is based on techniques that are originally used for the analysis of any type of data and not specifically images. Even if these techniques give acceptable results when applied to image segmentation, their performances can be further improved if they are adapted to this particular task. Segmentation based on the FCM clustering algorithm (Bezdek, Ehrlich, and Full (1984)) does not escape this rule. In fact, this algorithm has been widely used in the literature for grouping image pixels into regions. Basically, the assignment of a pixel to a cluster is exclusively based on its inherent characteristics. The information related to the spatial correlation in the local neighborhood of each pixel is not taken into consideration in this process. The incorporation of this important feature of the image signal can be of great interest, especially when dealing with noisy image segmentation (Choudhry, and Kapoor (2016); Liew, Leung, and Lau (2000)). Indeed, noise appears as pixels that look visually different from their neighbors. Despite this visual difference, they belong semantically to the same region as their neighbors. This configuration causes an ambiguity for classification algorithms, making it difficult for them to classify noisy pixels correctly. To overcome this issue, improved versions of FCM have been proposed in the literature. These versions have proceeded by incorporating spatial information at different levels of the FCM algorithm: the objective function, the dissimilarity distance, and the membership degrees updating. In this paper, we propose a modified FCM algorithm that falls within the framework of the last type of above-mentioned approaches, where the incorporation of the spatial information is made at the level of the membership degrees updating. Our proposal, called robust contextual fuzzy c-means (RCFCM), defines three types of local spatial contexts and introduces a novel readjustment factor that adaptively amends the membership degrees according to these contexts. Most of the proposed approaches introduce spatial information through a uniform parameterized mathematical model. The adaptation to normal or noisy pixels is implemented via the model parameters which are dynamically calculated depending on the context data. Compared to these approaches, we use in our proposal a rule-based modeling where we define a specific processing strategy for each context type. This offers thereby a greater adaptation ability that lets us cope better with non-linearly separable data since normal and noisy pixels will be processed differently, each with a dedicated model. Moreover, the RCFCM algorithm has the advantage to be easy to set since except the size of the neighborhood to be considered, all the other used parameters are automatically estimated.

The rest of the paper is organized as follows: The next section reviews the state of the art relevant to this work. Section 3 presents the conventional FCM algorithm and details its limits when dealing with noisy images. The novel approach is introduced in Section 4. Section 5 describes the experiments and the assessment of our proposal. Finally, the discussion of the obtained results and conclusions are respectively reported, in Sections 6 and 7.

2 Related Work

To overcome the FCM sensitivity to noise, most algorithms try to incorporate spatial contextual information into the clustering process. The review of literature shows that such incorporation was addressed at three main levels.

The first one is the level of the objective function to optimize. The main idea in such approaches is to modify the conventional objective function of the FCM algorithm by adding a regularization term that quantifies the dissimilarity between a pixel and its neighbors in the feature space (Ahmed, Yamany, Mohamed, Farag and Moriarty (2002); Lei, Jia, Zhang, He, Meng, and Nandi (2018); Wang, Song, Soh, and Sim (2013); Wang, Wang, Fang, and Yang (2020c); Wang, Pedrycz, Li, and Zhou (2020a); Wang, Pedrycz, Zhou, and Li (2020b); Wang, Wang, Fang, and Jiao (2021); Zhang, Pan, Wu, Chen, Mao, and Wu (2020)). Pixels’ memberships and clusters’ centers expressions are obtained by optimizing this new objective function using the Lagrange multiplier technique. Most of these methods consider a single objective function. They tend to be effective for well-separated spherical clusters, but their performances decrease with more complicated cluster structures (Zhao, Liu, and Fan (2015)).

The second level of the spatial contextual information incorporation is that of the distance used to measure the dissimilarity between each pixel and the clusters’ centers ( Ayech, El Kalti, and El Ayeb (2010); Despotovic, Vansteenkiste, and Philips (2013); Liew, Leung, and Lau (2000, 2003); Mohamed, Ahmed, and Farag (1998)). Most of the proposals made in this context use a weighted two terms distance, where the first term expresses the conventional pixel distance to a given cluster while the second term expresses a spatial distance. This latter represents the distance separating the pixel’s neighbors from the considered cluster. Weights serve to control the influence to give to each term. They can be statically fixed for all the pixels or dynamically calculated according to the local context of each pixel. The main difference between the different proposals of this category lies in the way in which the neighbors are taken into account. We note here that the updating of the membership degrees and the clusters’ centers remain identical to those of the standard FCM algorithm. It is noteworthy in this context that the spatial distances of the above approaches have been constructed around the Euclidean distance. These approaches lose performance when data is nonlinearly separable. To cope with this issue, Kernel distance-based methods are used to project data into a higher dimensional space and thus make it linearly separable. In Chang-Chien, Nataliani, and Yang (2021); Elhedda, Mehri, and Mahjoub (2020); Yang, Zhang, Lu, and Ma (2010), the authors introduced various kernel distances in the FCM algorithm. Some other works like in Elazab, Wang, Jia, Wu, Li, and Hu (2015); Yang, and Tsai (2008); Zhao, Jiao, and Liu (2013) combined both the use of kernel distance and the incorporation of spatial information into the cost function. The main shortcoming of these techniques lies in the high time-consuming.

The third level of spatial contextual information incorporation is that of the membership degrees updating function. In that case, the partition matrix is updated in such a way as to take into account the neighbors’ membership degrees. Two main approaches can be distinguished in that case. In the first approach, the membership degrees of each pixel are updated by a new value, obtained by a combination with those of its neighbors and based on a linear spatial function. The membership degrees of all the pixels are updated uniformly using this spatial function (Alruwaili, Siddiqi, and Javed (2020); Adhikari, Sing, Basu, and Nasipuri (2015); Shamsi, and Seyedarabi (2012); Li, and Li (2006); Chuang, Tzeng, Chen,Wu, and Chen (year)). In the second approach, the spatial context of each pixel is first analyzed. Then, according to this analysis, its membership degrees are amended using a readjustment factor. Conditional rules are often used to model the knowledge related to the amendment (Fan, Zhen, and Xie (2003); Tian, Yu, and Shen (2012); Tolias, and Panas (1998)).

It should be noted that, apart from the FCM framework, other fuzzy clustering techniques incorporating spatial information have been proposed in the literature. In Zhao, Liu, and Fan (2015), fuzzy clustering is modeled as a multi-objective optimization problem to satisfy multiple segmentation requirements. The authors proposed a multi-objective spatial fuzzy clustering algorithm for image segmentation that optimizes two objective functions. The first expresses the fuzzy compactness with spatial information, and the second expresses the fuzzy separation. In Zhao, Liu, Li, Liu, Lan, and Fan (2021), the authors proposed the use of two membership functions expressing, respectively, the local and non-local spatial information. The multiobjective optimization is implemented using evolutionary algorithms. In Kalaiselvi, and Gomathi (2020), the authors proposed a fuzzy deep neural network (FDNN) for change detection between multi-temporal images. The fuzzyfication layer of the FDNN retains spatial information and the variation of neighbor pixels in order to reduce the effect of speckle when classifying pixels as “changed” or “unchanged”.

3 Fuzzy C-Means Clustering

3.1 FCM Algorithm

Fuzzy c-means algorithm (FCM) is a popular technique used for unsupervised clustering of multivariate data. It represents an extension of the hard clustering K-means (Jain (2010)) algorithm that introduces fuzzy logic. Unlike K-means algorithm that assigns each data sample to one cluster, FCM assigns each sample to all the clusters with fuzzy membership degrees. The clustering is performed by iteratively minimizing a cost function based on a quadratic criterion that represents the weighted distance separating each sample from the clusters’ centers. This cost function denoted J is given by the following equation:

$$\begin{aligned} J_{\text {FCM}}(U,V) = \sum _{i=1}^{C}\sum _{j=1}^{N} \mu _{ij}^m \Vert x_j-v_i\Vert ^2, \end{aligned}$$

(1)

where

$x_j \in X$ refers to the feature vector of the $j^{th}$ sample,

$X=\{x_1, x_2, \dots , x_N\} \in \mathrm {I\!R}^p$ is the dataset of samples, N the size of X and P the size of the features space,

$V = (v_1, v_2, \dots , v_c)$ is the vector of the clusters’ centers,

$v_i$ represents the $i^{th}$ cluster center and C is the number of clusters,

m is a fuzzyfication factor,

$\mu _{ij}$ refers to the membership degree of the sample j to the cluster represented by the center $v_i$,

$U=[\mu _{ij}]$ is the partition matrix. This matrix fulfills the two following constraints:

$$\begin{aligned} \mu _{ij} \in [0,1] \text { } \forall \text { } i,j, \end{aligned}$$

(2)

$$\begin{aligned} \sum _{i=1}^{C}\mu _{ij} = 1 \text { } \forall \text { } j . \end{aligned}$$

(3)

The FCM algorithm proceeds as follows: first, samples are assigned to the different clusters with random membership degrees. Clusters’ centers are then calculated using Eq. 4.

$$\begin{aligned} v_i = \frac{\displaystyle \sum _{k=1}^{N} \left( \mu _{ik}\right) ^m x_k}{\displaystyle \sum _{k=1}^{N} \left( \mu _{ik}\right) ^m} . \end{aligned}$$

(4)

Considering the newly calculated centers, the samples’ membership degrees are updated using Eq. 5.

$$\begin{aligned} \mu _{ij} = \frac{1}{\displaystyle \sum _{k=1}^{C} \left( \frac{\Vert x_j-v_i\Vert }{\Vert x_j-v_k\Vert }\right) ^\frac{2}{m-1}} . \end{aligned}$$

(5)

This iterative process is repeated until convergence which is reached when the change in the objective function between two consecutive iterations (t) and ($t+1$) is smaller than a given threshold $\epsilon $.

$$\begin{aligned} \Vert J_{\text {FCM}} (U^{(t+1)}, V^{(t+1)}) - J_{\text {FCM}} (U^{(t)}, V^{(t)}) \Vert < \epsilon . \end{aligned}$$

(6)

3.2 Limitation of the FCM Algorithm in Noisy Pixel Clustering

The main drawback of the conventional FCM algorithm when dealing with image segmentation lies in its use of the individual pixels’ data without taking into consideration the local context represented by the spatial neighborhood. In images, data are spatially correlated. Hence, the incorporation of the context becomes advantageous to reduce FCM sensitivity to noise.

Figure 1 depicts an illustrative case of such a situation. It represents an image with two regions: black (0) and white (255). Applying the FCM algorithm to segment the two regions produces two well-separated clusters, one cluster per region. However, the FCM fails to assign correctly the noisy pixel denoted NP on the image. In fact, due to its gray level value, the FCM assigns it to the white region cluster while it belongs spatially to the black region cluster.

Figure 2 shows another example of a noisy image with three samples of noisy pixels and their corresponding neighborhoods (marked by the red squares).

Figure 3 shows in more detail, these neighborhoods (column a), the result of the clustering of the image into three classes using the standard FCM (column b), and the expected outcome in case of the incorporation of the local context (column c). We note that in the case of the presence of more than one cluster in the local context, noisy pixels take the label of the dominant cluster.

4 Proposed Approach

To overcome the above-mentioned limitations of the standard FCM algorithm when dealing with noisy images, we propose in this paper a novel approach that incorporates contextual information in the clustering process. This approach proceeds by readjusting the membership degrees after each iteration in such a way as to take into consideration the local context of each pixel. To set out our approach, we need to introduce first some key notions we have defined.

4.1 Key Notions

Our approach is mainly based on the notion of spatial context. A spatial context of a pixel j, denoted SC$_j$ is defined by the neighboring pixels belonging to a $L \times L$ square window centered on j. Given the membership degrees of j to the different clusters $\{\mu _{ij}, i:1..C\}$, we define two particular types of clusters: the pixel cluster and the context cluster.

Definition 1

: Pixel cluster

The pixel cluster of a pixel j, denoted P$_j$, is defined as the cluster to which j has the highest membership degree.

$$\begin{aligned} P_j = \arg \max _{i=1..C}(\mu _{ij}) . \end{aligned}$$

(7)

Definition 2

: Context cluster

The context cluster of a pixel j, denoted C$_j$ is defined as the cluster which is the pixel cluster of the most neighbors of j in the local context.

$$\begin{aligned} C_j = \arg \max _{i=1..C}(f(i,j)) , \end{aligned}$$

(8)

where

$$\begin{aligned} f(i,j) = \sum _{k \in SC_j} w_{ik} \text {, where } {\left\{ \begin{array}{ll} w_{ik} = 1 &{} \text { if } P_k = i , \\ w_{ik} = 0 &{} \text { else} . \end{array}\right. } \end{aligned}$$

(9)

Given these two particular types of clusters, we define two other key parameters: the contextual similarity degree and the intrinsic similarity degree.

Definition 3

: Contextual similarity degree

The contextual similarity degree of a pixel j denoted ($\alpha _j$) is defined as the proportion of neighbors of j having the same context cluster than j.

$$\begin{aligned} \alpha _j = \frac{\text {card}\left( k \in SC_j / k \ne j, C_j=C_k \right) }{L^2-1} . \end{aligned}$$

(10)

Definition 4

: Intrinsic similarity degree

The intrinsic similarity degree of a sample j denoted ($\beta _j$) is defined as the proportion of neighbors of j having the same pixel cluster as j.

$$\begin{aligned} \beta _j = \frac{\text {card}\left( k \in SC_j / k \ne j, P_j=P_k \right) }{L^2-1} . \end{aligned}$$

(11)

4.2 Types of Pixels

In our approach, we distinguish three types of pixels needing each, an adapted processing during its clustering: noisy pixel, contour pixel, and region pixel. The distinction between these three types is mainly based on the analysis of the spatial context.

A noisy pixel (denoted NP) is a pixel that represents a noise. It belongs generally to a cluster that is different than those of its neighbors. Based on the notions we have introduced, a noisy pixel will have a pixel cluster P$_{\text {NP}}$ that is different from the pixels clusters P$_k$ of its k neighbors. Hence, if we consider its spatial context, its context cluster will be different from its pixel cluster, and its $\beta _{\text {NP}}$ factor will be zero or very close to zero. For a given pixel j, the noisy pixel estimation rule can be expressed as follows:

$$\begin{aligned} j \text { IS NP} \text { IF } P_i \ne C_j \text { AND } \beta _j \approx 0 . \end{aligned}$$

A region pixel (denoted RP) is a normal pixel that belongs to a region. Its features are correlated to those of its neighbors, and consequently, it has the same pixel cluster P$_{\text {RP}}$ as those of its neighbors. Hence, its context cluster and its pixel cluster are the same. This makes that its Contextual and intrinsic similarity degrees $\alpha _{\text {RP}}$ and $\beta _{\text {RP}}$ will be equal and both close to 1. For a given pixel j, the region pixel estimation rule can be expressed as follows:

$$\begin{aligned} j \text { IS RP} \text { IF } P_j = C_j \text { AND } \alpha _j \approx \beta _j \approx 1 . \end{aligned}$$

A contour pixel (denoted CP) is a pixel situated at the level of a contour separating two or more regions. Its context contains generally pixels that are similar to it (same cluster) and others that belong to the cluster(s) of the bordering region(s). Its P$_{\text {CP}}$ and C$_{\text {CP}}$ may be equal or different depending on its position with respect to the contour. Unlike NP and RP, for a CP, $\alpha _{\text {CP}}$ and $\beta _{\text {CP}}$ factors are not close to 0 nor to 1 but take values around 0.5. For a given pixel s, the region pixel estimation rule can be expressed as follows:

$$\begin{aligned} j \text { IS CP} \text { IF } \alpha _j \approx 0.5 \text { AND } \beta _j \approx 0.5 . \end{aligned}$$

Figure 4 represents two portions of an image that has two regions, one black (label B) and the other white (label W). It gives some illustrations of the three types of pixels that we consider in this work (one RP, one NP, and four examples of CP). Table 1 gives the values taken by the parameters we defined for each of these pixels. For a sample j, these parameters concern, respectively, the pixel cluster P$_j$, the context cluster C$_j$, the contextual similarity degree $\alpha _j$, and the intrinsic similarity degree $\beta _j$. These values are consistent with our characterization of the various types of pixels. Note that in the case where $\alpha _j= \beta _j$ (case of CP1 in Fig. 4), C$_j$ takes the label of the central pixel (C$_j=$P$_j$).

4.3 Integration of the Contextual Information

The integration of the contextual information that we propose is performed at the level of the membership degrees updating. Hence, after the computation of the new membership degrees according to the standard FCM algorithm, these degrees are analyzed in order to estimate the type of each pixel (NP, RP, or CP). This analysis is based on the $\alpha _j$ and $\beta _j$ parameters as presented in the previous section. According to the type of the pixel in process, we propose a readjustment strategy that tries to compensate for the weakness of the standard FCM at this level.

Table 1 Pixel cluster, context cluster, intrinsic, and contextual similarity degrees of the illustrative pixels of the Fig. 4

Full size table

4.4 Updating Strategy for NP Pixels

If the pixel in process corresponds to a NP, then the membership degree of this latter to its context cluster should be increased while its membership degrees to the other clusters including its pixel cluster should be decreased. This updating strategy can be expressed for a given pixel j by the following rule:

$$\begin{aligned} \text {IF } j \text { is NP} \text { THEN } increase \text { } \mu _{c_jj} \text { AND } decrease \text { } \mu _{kj} . \end{aligned}$$

where $\mu _{c_jj}$ refers to the membership degree of the pixel j to its context cluster C$_j$ and $\mu _{kj}$ refers to the membership degrees of the pixel j to the other k clusters.

To implement this rule, we propose a readjustment that brings the membership degree of the pixel NP to an estimation of the context cluster membership degree. This estimation is given by the average of the membership degrees of a selection of representative neighbors which are not noisy and which belong to the context cluster of NP. A neighbor is representative when it has a higher membership degree to the context cluster than that of the pixel NP. Formally, for a given noisy pixel j and an iteration (t), this update is made using the readjustment factor $\eta _j$:

$$\begin{aligned} \eta _j^{(t)} = \left| \left( \frac{1}{N_c} \sum _{k \in V_{cj}} \mu _{c_kk}^{(t)}\right) - \mu _{c_jj}^{(t)}\right| , \end{aligned}$$

(12)

where

$$\begin{aligned} V_{cj} = \{x_k \in sc_j /p_k = c_j \text { and } \mu _{c_jk} > \mu _{c_jj}\} \text { and } N_c = \text {card}(V_{cj}) . \end{aligned}$$

The membership degree of $\mu _{cj}$ is increased as follows:

$$\begin{aligned} \mu _{c_jj}^{(t+1)} = \mu _{c_jj}^{(t)}+ \eta _j^{(t)} . \end{aligned}$$

(13)

To fulfill the condition given by Eq. 3 the membership degrees of j to the other clusters are decreased proportionally to their importance as follows:

$$\begin{aligned} \mu _{lj}^{(t+1)} = \mu _{lj}^{(t)} - \frac{\mu _{lj}}{\displaystyle \sum _{\begin{array}{c} m=1..c \\ m\ne c_j \end{array}} \mu _{mj}} \eta _j^{(t)}, \text { } l: 1..C \text { and } l \ne c_j . \end{aligned}$$

(14)

4.5 Updating Strategy for CP Pixels

If the pixel being processed corresponds to a CP, the membership of this latter to its pixel cluster should be increased. To satisfy the constraint given by Eq. 3, its membership degrees to the other clusters should be decreased. This updating strategy can be expressed for a given pixel j by the following rule:

$$\begin{aligned} \text {IF } j \text { is CP} \text { THEN } increase \text { } \mu _{p_jj} \text { AND } decrease \text { } \mu _{kj} , \end{aligned}$$

where $\mu _{c_jj}$ refers to the membership degree of the pixel j to its context cluster $C_j$ and $\mu _{kj}$ refers to the membership degree of the pixel j to the remaining k clusters.

The readjustment factor that we propose for this updating strategy is given as follows:

$$\begin{aligned} \eta _j^{(t)} = \left| \left( \frac{1}{N_p}\sum _{k \in V_{pj}} \mu _{pk}^{(t)}\right) - \mu _{pj}^{(t)}\right| , \end{aligned}$$

(15)

$$\begin{aligned} V_{pj} = \{x_k \in sc_j /p_k = p_j \text { and } \mu _{p_kk} > \mu _{p_jj}\} , \end{aligned}$$

$$\begin{aligned} N_p = \text {card}(V_{pj}) . \end{aligned}$$

Then, the membership degrees are updated according to the following equations:

$$\begin{aligned} \mu _{p_jj}^{(t+1)} = \mu _{p_jj}^{(t)}+ \eta _j^{(t)} , \end{aligned}$$

(16)

$$\begin{aligned} \mu _{lj}^{(t+1)} = \mu _{lj}^{(t)} - \frac{\mu _{lj}}{\displaystyle \sum _{\begin{array}{c} m=1..c \\ m\ne c_j \end{array}} \mu _{mj}} \eta _j^{(t)}, \text { } l: 1..C \text { and } l \ne p_j . \end{aligned}$$

(17)

4.6 Updating Strategy for RP Pixels

For a RP, the context cluster and the pixel cluster are the same (C$_{\text {RP}}$ $ = $P$_{\text {RP}}$). This means that its assignment by the standard FCM is going in the right direction. For such pixels, our approach preserves their membership degrees without amendment.

4.7 Proposed Clustering Algorithm

Considering the updating strategies described in the above section. The clustering algorithm that we propose is summed up in Algorithm 1. The main principle is identical to the standard FCM algorithm. However, at the updating stage, our algorithm starts by estimating the type of the pixel to cluster. Then, depending on its type, the adequate updating strategy is used according to the rules introduced in the previous section.

5 Experiments

To assess the effectiveness of our approach, we carried out experiments on both synthetic and real images corrupted by three types of noises at different rates: Gaussian noise (Gauss), salt-and-pepper noise (S &P) and mixed noise (Mixed). This latter is created by mixing salt-and-pepper noise and Gaussian noise at the same rate.

The proposed robust contextual fuzzy c-means algorithm (RCFCM) is compared with the following state-of-the art FCM based algorithms: FCM Jain (2010), FCMS1 Chen, and Zhang (2004), FCMS2 Chen, and Zhang (2004), EnFCM Szilagyi, Benyo, Szilágyi, and Adam (2003), FGFCM Cai, Chen, and Zhang (2007), FLICM Krinidis, and Chatzis (2010), FRFCM Lei, Jia, Zhang, He, Meng, and Nandi (2018), WRFCM Wang, Pedrycz, Li, and Zhou (2020a), and FCM_SICM Wang, Wang, Fang, and Yang (2020c).

5.1 Parameters Setting

All these algorithms were applied with a fuzzification parameter $m = 2$ and minimum error $\epsilon = 10^{-4}$. The $\alpha $ parameter used in FCMS1, FCMS2, and EnFCM to control the effect of the spatial context is set to 0.85. The spatial and gray level scale factors used in FGFCM are respectively set to $\lambda _s = 3$ and $\lambda _g = 6$, and the size of the neighborhood is $3 \times 3$. For the FRFCM algorithm, a $3 \times 3$ window is used for the structuring element used to produce the marker image and for the kernel of the median filter used to update the membership degrees. For the WRFCM algorithm, the parameters are set as follows: $\xi =0.0008$, $\phi =5$, and the neighborhood size = $3 \times 3$. The geometric and photometric spread parameters of the bilateral filter used by the FCM_SICM are respectively set to $\sigma _d = 3.5$ and $\sigma _r = 2$. The eps parameter of this algorithm is set to 0.000001. Finally, for our algorithm RCFCM, the only parameter is the size of the context, and it was set to $3 \times 3$.

5.2 Performance Metrics

Performances are evaluated using three metrics which are accuracy, Dice index, and peak signal-to-noise ratio.

5.2.1 Accuracy

The accuracy (ACC) is defined as the sum of the ratios of the correctly classified pixels for each cluster to the total number of pixels. It is given by the following equation:

$$\begin{aligned} ACC = \sum _{k=1}^C \frac{A_k \cap C_k}{\sum _{j=1}^C C_j} , \end{aligned}$$

(18)

where c is the number of clusters, $A_k$ and $C_k$ denote the pixels of the cluster k, respectively detected by the clustering technique and those given by the ground truth.

5.2.2 Dice Index

The Dice index (DI) gives the degree of similarity between the segmented image and the ground truth. Using the same notations than in Eq. 18, this index is defined as follows:

$$\begin{aligned} DI = \frac{1}{C}\sum _{k=1}^C \frac{A_k \cap C_k}{A_k \cup C_k} . \end{aligned}$$

(19)

5.2.3 Peak Signal-to-Noise Ratio

The peak signal-to-noise ratio (PSNR) is a metric that expresses the quality of a reconstruction of an image compared to the original image. In our case, the reconstructed image is the one obtained as the output of the studied clustering algorithms while the original image is the ground truth. This metric is interpreted as follows: the higher the PSNR, the better the quality. PSNR is expressed in dB and formulated as in Eq. 20.

$$\begin{aligned} PSNR = 10 \log _{10} \left( MAX^{2} \left[ \frac{1}{N} \sum _{i=1}^N (s_{i} - o_{i})^{2} \right] ^{-1} \right) , \end{aligned}$$

(20)

where $s_i$ denotes the segmented image, $o_i$ the original image, N the number of pixels and MAX refers to the maximum value that can be taken by a pixel.

5.3 Results on Synthetic Images

Figure 5 shows two synthetic images among those used in experiments. The first image (Fig. 5a) contains three classes and presents both straight and curved contours separating regions. The second image (Fig. 5b) contains some thin structures (plant branches) that should be preserved as much as possible by the clustering techniques.

Figures 6, 7, and 8 show the results obtained after the application of the different FCM-based algorithms on images in Fig. 5a. For illustrative purposes, we present the results for one rate for each type of noise. Visual observation of the results shows that the RCFCM algorithm ranks among the best techniques in terms of quality. The contours of the geometric shapes (square, circle, and triangle) have been well preserved, even at the corners. Table 2 lists the quantitative results obtained for this image for the three types of conidered metrics. The values obtained demonstrate that our proposal gives the best performance for the salt-and-pepper noise. The RCFCM is ranked second for the Gaussian and mixed noise at 2%. But when the rate of these noises increased, its performances dropped compared to FLICM, WRFCM, and FCM_SICM. The PSNR results are globally consistent with those of the accuracy and the Dice index.

Table 2 Quantitative results obtained for the image given by Fig. 5a

Full size table

The second synthetic image (Fig. 5b) is segmented into three classes: black (value 0), gray (value 127), and white (value 255). Branches and leaves are within the gray class. They represent thin graphic elements that are sensitive to spatial clustering.

Figures 9, 10, and 11 show the outcomes of the segmentation of this image for various types of noise. The details of the quantitative evaluation are given in Table 3. These findings show that the RCFCM algorithm acquires the better evaluation results for the most types of noise except for the mixed noise at 4%, where it is outperformed by the FRFCM algorithm. They also reveal that it achieves the best preservation of thin structures, while some other techniques considerably deteriorate them.

5.4 Results on Real-World Images

Figure 12 gives two examples of real images among those used in experiments. These images were also corrupted by the three types of noises at different rates.

Table 3 Quantitative results obtained for the image given by Fig. 5b

Full size table

Image given by Fig. 12a was segmented on two clusters in order to separate rice grains from the non-uniformly illuminated background. Figures 13, 14, and 15 visually show the comparison between the RCFCM algorithm result and its peers, while Table 4 gives the quantitative comparison. We notice that the main competitor of our technique is the WRFCM algorithm. The best results are shared between this technique and ours with a slight superiority for WRFCM.

Figure 12b represents a brain magnetic resonance image (MRI). The accurate segmentation of brain tissues such as gray matter, white matter, and cerebrospinal fluid is an important step for the detection of many diseases. Figures 16, 17, and 18 show the results obtained by clustering the brain MRI image into three clusters to isolate each type of tissue. It shows that for salt-and-pepper noise, RCFCM achieves good delineation of the different tissue types while removing noise. However, Gaussian and mixed noises caused a classification ambiguity, especially between gray matter and white matter. The quantitative evaluation of these results is reported in Table 5. As for most other experiments, RCFCM outperformed the other algorithms when dealing with salt-and-pepper noise. But for the Gaussian and mixed noises, the best performances are globally achieved by the FLICM algorithm.

Table 4 Quantitative results obtained for the image given by Fig. 12a

Full size table

6 Discussion

All the carried out experiments showed that our algorithm produced very competitive results placing it globally among the top studied techniques. It achieved the best performance for salt-and-pepper noise, and it gave acceptable results for Gaussian and mixed noise with low rates where it was slightly outperformed, mainly by the WRFCM algorithm (often with a deviation around $1\%$) for some images and by the FLICM algorithm for some others. However, as the noise became more severe, the performance of RCFCM decreased, compared to the top techniques. This point will have to be further studied in our future work in order to improve it. It should be noted that RCFCM has the merit of obtaining these performances without having to resort to filtering, as is the case with a certain number of studied techniques, that exploit filtered versions of the image in the classification process (mean and median filtering for FCMS1 and FCMS2, morphological filtering for FRFCM, and bilateral filtering for FCM_SICM). Moreover, in the majority of works, the integration of spatial information has led to the use of additional parameters that are difficult to configure. For some techniques, these parameters are set experimentally, sometimes depending on the image under study (case of WRFCM). Such techniques need to be massively tested for each image to find the appropriate parameter value to use; otherwise, their performances may fluctuate. Compared to this, RCFCM has the advantage of being easily configurable since, apart from the standard FCM parameters, it only requires one additional, easy-to-set, parameter, which is the context size.

Table 5 Quantitative results obtained for the image given by Fig. 12b

Full size table

7 Conclusion

In this paper, a novel robust contextual clustering algorithm is proposed to address the limitations of the conventional FCM algorithm in the classification of noisy pixels in images. Contextual clustering is performed through the incorporation of the information coming from the contexts of pixels in the clustering process. For each pixel, the context is defined by its surrounding neighbors, delimited by a square window. The incorporation of contextual information can be made at different levels. In the proposed approach, it was made at the membership degrees updating level and was based on two new parameters: the contextual similarity degree and the intrinsic similarity degree. These parameters are used to estimate the type of pixel being processed. In this regard, three types of pixels were identified: noisy pixel, contour pixel, and region pixel. For each of these types, a specific strategy for updating the membership degrees was established. The reported experimental results have proven the effectiveness of the approach on two levels. The first level is related to the main objective of this work, which is the improvement of the clustering quality by reducing the misclassification of noisy pixels. The second level is concerned with the preservation of contours and details, where the proposed approach has achieved a satisfactory tradeoff between noise removal effect and detail preservation. This tradeoff was performed due to the adaptive updating strategy adopted in this work.

Data Availability

The data analyzed during the current study are available from the corresponding author upon request.

Code Availability

The code used in this work will be publicly shared once the paper is accepted for publication.

References

Adhikari, S. K., Sing, J. K., Basu, D. K., & Nasipuri, M. (2015). Conditional spatial fuzzy c-means clustering algorithm for segmentation of MRI images. Applied Soft Computing, 34, 758–769.
Article Google Scholar
Ahmed, M. N., Yamany, S. M., Mohamed, N., Farag, A. A., & Moriarty, T. (2002). A modified fuzzy c-means algorithm for bias field estimation and segmentation of MRI data. IEEE Transactions on Medical Imaging, 21(3), 193–199.
Article Google Scholar
Alruwaili, M., Siddiqi, M. H., & Javed, M. A. (2020). A robust clustering algorithm using spatial fuzzy c-means for brain MR images. Egyptian Informatics Journal, 21(1), 51–66.
Article Google Scholar
Ayech, M. W., El Kalti, K., and El Ayeb, B. (2010) “Image segmentation based on adaptive fuzzy-c-means clustering." In 2010 20th International Conference on Pattern Recognition, 2306–2309. IEEE
Bezdek, J. C., Ehrlich, R., & Full, W. (1984). FCM: The fuzzy c-means clustering algorithm. Computers & Geosciences, 10(2–3), 191–203.
Article Google Scholar
Cai, W., Chen, S., & Zhang, D. (2007). Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation. Pattern Recognition, 40(3), 825–838.
Article MATH Google Scholar
Chang-Chien, S.-J., Nataliani, Y., & Yang, M.-S. (2021). Gaussian-kernel c-means clustering algorithms. Soft Computing, 25(3), 1699–1716.
Article MATH Google Scholar
Chen, S. & Zhang, D. (2004) “Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure." IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 34(4), 1907–1916
Choudhry, M. S., & Kapoor, R. (2016). Performance analysis of fuzzy c-means clustering methods for MRI image segmentation. Procedia Computer Science, 89, 749–758.
Article Google Scholar
Chuang, K.-S., Tzeng, H.-L., Chen, S., Wu, J., & Chen, T.-J. (2006). Fuzzy c-means clustering with spatial information for image segmentation. Computerized Medical Imaging and Graphics, 30(1), 9–15.
Article Google Scholar
Despotovic, I., Vansteenkiste, E., & Philips, W. (2013). Spatially coherent fuzzy clustering for accurate and noise-robust image segmentation. IEEE Signal Processing Letters, 20(4), 295–298.
Article Google Scholar
Elazab, A., Wang, C., Jia, F., Wu, J., Li, G., & Hu, Q. (2015) “Segmentation of brain tissues from magnetic resonance images using adaptively regularized kernel-based fuzzy-means clustering." Computational and Mathematical Methods in Medicine, 2015
Elhedda, W., Mehri, M., & Mahjoub, M. A. (2020). Hyperkernel-based intuitionistic fuzzy c-means for denoising color archival document images. International Journal on Document Analysis and Recognition (IJDAR), 23(3), 161–181.
Article Google Scholar
Fan, J.-L., Zhen, W.-Z., & Xie, W.-X. (2003). Suppressed fuzzy c-means clustering algorithm. Pattern Recognition Letters, 24(9–10), 1607–1612.
Article MATH Google Scholar
Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666.
Article Google Scholar
Kalaiselvi, S., & Gomathi, V. (2020). $\alpha $-cut induced fuzzy deep neural network or change detection of SAR images. Applied Soft Computing, 95, 106510.
Article Google Scholar
Krinidis, S., & Chatzis, V. (2010). A robust fuzzy local information c-means clustering algorithm. IEEE Transactions on Image Processing, 19(5), 1328–1337.
Article MathSciNet MATH Google Scholar
Lei, T., Jia, X., Zhang, Y., He, L., Meng, H., & Nandi, A. K. (2018) “Significantly fast and robust fuzzy c-means clustering algorithm based on morphological reconstruction and membership filtering." ’IEEE Transactions on Fuzzy Systems, 26(5), 3027–3041
Li, M., & Li, Y.-S. (2006) “Fuzzy-c-means clustering based on the gray and spatial feature for image segmentation." In 2006 International Conference on Computational Intelligence and Security, 2, 1641–1646. IEEE
Liew, A., Leung, S., & Lau, W. (2000). Fuzzy image clustering incorporating spatial continuity. IEE Proceedings-Vision, Image and Signal Processing, 147(2), 185–192.
Article Google Scholar
Liew, A.-C., Leung, S. H., & Lau, W. H. (2003). Segmentation of color lip images by spatial fuzzy clustering. IEEE transactions on Fuzzy Systems, 11(4), 542–549.
Article Google Scholar
Mohamed, N. A., Ahmed, M., & Farag, A. (1998) “Modified fuzzy c-mean in medical image segmentation." In Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Vol. 20 Biomedical Engineering Towards the Year 2000 and Beyond (Cat. No. 98CH36286), 3, 1377–1380. IEEE
Shamsi, H., & Seyedarabi, H. (2012). A modified fuzzy c-means clustering with spatial information for image segmentation. International Journal of Computer Theory and Engineering, 4(5), 762.
Article Google Scholar
Szilagyi, L., Benyo, Z., Szilágyi, S. M., & Adam, H. (2003) “MR brain image segmentation using an enhanced fuzzy c-means algorithm." In Proceedings of the 25th annual international conference of the IEEE engineering in medicine and biology society (IEEE Cat. No. 03CH37439), 1, 724–726. IEEE
Tian, J. W., Yu, Y. L., & Shen, T. (2012) “FCM clustering segmentation algorithms based on spatial constraint." In Advanced Materials Research, 411, 497–500. Trans Tech Publ
Tolias, Y. A., & Panas, S. M. (1998). On applying spatial constraints in fuzzy image clustering using a fuzzy rule-based system. IEEE Signal Processing Letters, 5(10), 245–247.
Article Google Scholar
Wang, C., Pedrycz, W., Li, Z., & Zhou, M. (2020). Residual-driven fuzzy c-means clustering for image segmentation. IEEE/CAA Journal of Automatica Sinica, 8(4), 876–889.
Article MathSciNet Google Scholar
Wang, C., Pedrycz, W., Zhou, M., & Li, Z. (2020). Sparse regularization-based fuzzy c-means clustering incorporating morphological grayscale reconstruction and wavelet frames. IEEE Transactions on Fuzzy Systems, 29(7), 1826–1840.
Article Google Scholar
Wang, Q., Wang, X., Fang, C., & Jiao, J. (2021). Fuzzy image clustering incorporating local and region-level information with median memberships. Applied Soft Computing, 105, 107245.
Article Google Scholar
Wang, Q., Wang, X., Fang, C., & Yang, W. (2020). Robust fuzzy c-means clustering algorithm with adaptive spatial & intensity constraint and membership linking for noise image segmentation. Applied Soft Computing, 92, 106318.
Article Google Scholar
Wang, Z., Song, Q., Soh, Y. C., & Sim, K. (2013). An adaptive spatial information-theoretic fuzzy clustering algorithm for image segmentation. Computer Vision and Image Understanding, 117(10), 1412–1420.
Article Google Scholar
Yang, M.-S., & Tsai, H.-S. (2008). A Gaussian kernel-based fuzzy c-means algorithm with a spatial bias correction. Pattern Recognition Letters, 29(12), 1713–1725.
Article Google Scholar
Yang, X., Zhang, G., Lu, J., & Ma, J. (2010). A kernel fuzzy c-means clustering-based fuzzy support vector machine algorithm for classification problems with outliers or noises. IEEE Transactions on Fuzzy Systems, 19, 105–115.
Article Google Scholar
Zhang, X., Pan, W., Wu, Z., Chen, J., Mao, Y., & Wu, R. (2020). Robust image segmentation using fuzzy c-means clustering with spatial information based on total generalized variation. IEEE Access, 8, 95681–95697.
Article Google Scholar
Zhao, F., Jiao, L., & Liu, H. (2013). Kernel generalized fuzzy c-means clustering with spatial information for image segmentation. Digital Signal Processing, 23(1), 184–199.
Article MathSciNet Google Scholar
Zhao, F., Liu, F., Li, C., Liu, H., Lan, R., & Fan, J. (2021). Coarse-fine surrogate model driven multiobjective evolutionary fuzzy clustering algorithm with dual memberships for noisy image segmentation. Applied Soft Computing, 112, 107778.
Article Google Scholar
Zhao, F., Liu, H., & Fan, J. (2015). A multiobjective spatial fuzzy clustering algorithm for image segmentation. Applied Soft Computing, 30, 48–57.
Article Google Scholar

Download references

Acknowledgements

The authors thank the editor, the associate editor, and the referees for their constructive comments that helped to improve this manuscript.

Author information

Authors and Affiliations

Université de Sousse, Ecole Nationale d’Ingénieurs de Sousse, LATIS-Laboratory of Advanced Technology and Intelligent Systems, Sousse, 4023, Tunisia
Karim Kalti & Asma Touil
Computer Science Department, Faculty of Science, University of Monastir, Monastir, 5019, Tunisia
Karim Kalti

Authors

Karim Kalti
View author publications
You can also search for this author in PubMed Google Scholar
Asma Touil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Karim Kalti.

Ethics declarations

Ethical Approval

The paper has been prepared in compliance with ethical standards.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Kalti, K., Touil, A. A Robust Contextual Fuzzy C-Means Clustering Algorithm for Noisy Image Segmentation. J Classif 40, 488–512 (2023). https://doi.org/10.1007/s00357-023-09443-1

Download citation

Accepted: 14 June 2023
Published: 09 August 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00357-023-09443-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Robust Contextual Fuzzy C-Means Clustering Algorithm for Noisy Image Segmentation

Abstract

Similar content being viewed by others

Improved fuzzy clustering algorithm with non-local information for image segmentation

New Approach to Image Segmentation Based on Neighborhood-Influenced Fuzzy C-Means Clustering

Image Guided Fuzzy C-Means for Image Segmentation

1 Introduction

2 Related Work

3 Fuzzy C-Means Clustering

3.1 FCM Algorithm

3.2 Limitation of the FCM Algorithm in Noisy Pixel Clustering

4 Proposed Approach

4.1 Key Notions

Definition 1

Definition 2

Definition 3

Definition 4

4.2 Types of Pixels

4.3 Integration of the Contextual Information

4.4 Updating Strategy for NP Pixels

4.5 Updating Strategy for CP Pixels

4.6 Updating Strategy for RP Pixels

4.7 Proposed Clustering Algorithm

5 Experiments

5.1 Parameters Setting

5.2 Performance Metrics

5.2.1 Accuracy

5.2.2 Dice Index

5.2.3 Peak Signal-to-Noise Ratio

5.3 Results on Synthetic Images

5.4 Results on Real-World Images

6 Discussion

7 Conclusion

Data Availability

Code Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation