Performance evaluation of FMIG clustering using fuzzy validity indexes

Tlili, Monia; Ayadi, Thouraya; Hamdani, Tarek M.; Alimi, Adel M.

doi:10.1007/s00500-014-1478-3

Performance evaluation of FMIG clustering using fuzzy validity indexes

Focus
Published: 21 October 2014

Volume 19, pages 3515–3528, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

Performance evaluation of FMIG clustering using fuzzy validity indexes

Download PDF

Monia Tlili¹,
Thouraya Ayadi¹,
Tarek M. Hamdani^1,2 &
…
Adel M. Alimi¹

194 Accesses
1 Citation
Explore all metrics

Abstract

The clustering of high-dimensional data presents a critical computational problem. Therefore, it is convenient to arrange the cluster centres on a grid with a small dimensional space that reduces computational cost and can be easily visualized. Moreover, in real applications there is no sharp boundary between classes, real datasets are naturally defined in a fuzzy context. Therefore, fuzzy clustering fits better for complex real datasets to determine the best distribution. Self-organizing map (SOM) technique is appropriate for clustering and vector quantization of high-dimensional data. In this paper we present a new fuzzy learning approach called FMIG (fuzzy multilevel interior growing self-organizing maps). The proposed algorithm is a fuzzy version of MIGSOM (multilevel interior growing self-organizing maps). The main contribution of FMIG is to define a fuzzy process of data mapping and to take into account the fuzzy aspect of high-dimensional real datasets. This new algorithm is able to auto-organize the map accordingly to the fuzzy training property of the nodes. In the second step, the introduced scheme for FMIG is clustered by means of fuzzy C-means (FCM) to discover the interior class distribution of the learned data. The validation of FCM partitions is directed by applying six validity indexes. Superiority of the new method is demonstrated by comparing it with crisp MIGSOM, GSOM (growing SOM) and FKCN (fuzzy Kohonen clustering network) techniques. Thus, our new method shows improvement in term of quantization error, topology preservation and clustering ability.

The use of conventional clustering methods combined with SOM to increase the efficiency

Article 24 June 2021

Clustering-Based Adaptive Self-Organizing Map

Fuzzy C-Means Clustering Validity Function Based on Multiple Clustering Performance Evaluation Components

Article 21 February 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Real datasets are generally defined in a fuzzy context. In fact, there is no sharp boundary between classes so that fuzzy clustering is better suited for the data classification. Therefore, to classify such data is a very challenging task as in Ayadi et al. (2007, 2010, 2011), Alimi (2000, 2003), Hamdani et al. (2011a), Dhahri and Alimi (2005), Kohonen (1998), Denaïa et al. (2007) and Chang and Liao (2006). The clustering is an unsupervised classification mechanism where a set of entities are classified into a number of homogeneous clusters, with respect to a given similarity measure. Due to the fuzzy nature of many classification problems, a number of fuzzy clustering methods have been developed such as in Ayadi et al. (2007, 2010, 2011), Alimi (2003), Hamdani et al. (2008, 2011a, b), El Malek et al. (2002), Pascual et al. (2000), Kohonen (1998, 2001), Denaïa et al. (2007), Fritzke (1994), Petterssona et al. (2007) , Abraham and Nath (2001), Aguilera and Frenich (2001) and Chang and Liao (2006). Fuzzy clustering can be applied as an unsupervised learning technique to classify patterns (Alimi 2000, 2003; Alimi et al. 2003).

In the unsupervised training, the network learns to form its own classes of the training data without external adjustment. Unsupervised systems of neural networks use a competitive learning process which is based on similarity comparisons in a continuous space. The resulting system associates similar inputs close to each other in one- or two-dimensional grid. Among the architectures suggested for unsupervised neural networks, the self-organizing map (SOM) (Kohonen 1982) is the most efficient in creating specific organized internal representations of input vectors and their abstractions. SOM is an unsupervised learning model used to visualise and explore linear and nonlinear relationships in high-dimensional data. SOM was first used in speech recognition by Kohonen in (1998, 2001). In literature researches, SOM is successful in visualisation of rich and complex phenomena such as pattern recognition tasks involving different criteria especially in clustering mechanism (Denaïa et al. 2007; Fritzke 1994) and (Hsu and Halgarmuge 2001). Then, SOM algorithms have been implemented in various aspects of researches in classification Kohonen (1998), managing ecosystems Chang and Liao (2006) and GIS for addressing remediation Petterssona et al. (2007). SOM approaches were also applied in pattern recognition of big data (El Malek et al. 2002; Tlili et al. 2012; Amarasiri et al. 2004).

Generally, in their training process SOM approaches try to preserve the geometric form of the data called topology with the minimum error of quantization (Alimi et al. 2003; Hamdani et al. 2008; Pascual et al. 2000; Ayadi et al. 2012). However, the topology is not perfectly preserved with SOM algorithms due to their predefined structure. Therefore, dynamic variants of SOM are proposed to define the structure dynamically during the training process.

In this paper we present a new fuzzy dynamic learning method called FMIG (Fuzzy Multilevel Interior Growing Self-Organizing Maps). This algorithm is a hybrid approach in which both the MIGSOM technique (Multilevel Interior Growing Self-Organizing Maps) (Ayadi et al. 2010, 2012) and the FKCN algorithm (Fuzzy Kohonen Clustering Network) (Pascual et al. 2000) have been integrated. Our approach introduces a new form of fuzzy growing process and attempts to overcome some of the clustering difficulties by taking advantage of the best features of the multilevel structure given by MIGSOM and the fuzzy clustering model presented by FKCN. In fact, MIGSOM develops its growing process from the node that accumulates the highest Quantization Error (Fig. 1) (Amarasiri et al. 2004; Ayadi et al. 2012). On the other hand, FKCN presents a fuzzy model of clustering based on the Kohonen’s Self-Organizing Feature Maps (SOFM) (Kohonen 1997) and the Fuzzy C-Means (FCM) (Bezdek 1981, 1987). Both of SOFM and FCM are successfully applied in classification.

One of the advantages of our approach is to introduce a new Growing Threshold (GT) used to control the growth of the map. In fact, we remarked that the GT structure used in other algorithms such as MIGSOM becomes complicated and takes big values as the training process grows (Ayadi et al. 2010; Alimi 2000, 2003; Alimi et al. 2003). In addition, when the dataset is large the computing process becomes more complicated at next iterations and takes more time with the augmentation of GT values. For these reasons, we created a new growing threshold applied in our powerful methodology FMIG for big data clustering. The result is a new method of fuzzy dynamic SOM, which maintains the topology of patterns and considerably decreases the quantization error.

The rationale underlying the presented FMIG algorithm is that both advantages of fuzzy clustering and MIGSOM approaches are combined. It is worth to mention that FMIG serves two purposes: first, it can be used to produce the best fuzzy distribution of data by finding out the topology corresponding to the fuzzy form of inputs. Second, it gives the minimum value of error quantization providing the best cluster structure. In fact, FMIG is used to produce the best approximation to the real distribution of a given dataset. This algorithm finds a set of codebook vectors to code compactly all the input patterns. The topological FMIG maps introduce an additional relation between the codebook vectors. This topological relationship is a constraint which has been proved to produce robust results with the FMIG algorithm (Alimi 2000; Alimi et al. 2003; Hamdani et al. 2011a).

The classification of high-dimensional datasets presents a computational problem and it increases depending on the data size. Thus, to reduce the computational cost, two-steps have been proposed. First, the datasets are trained using a neural map to find the quantization prototypes. Then, the generated prototypes are clustered using a clustering algorithm such as FCM algorithm (Bezdek 1981, 1987). The choice of FCM is based on its best clustering compared to SOM and hierarchical algorithms (Mingoti and Lima 2006). However, the challenge of big data clustering is how to evaluate the performance of the resulting partition and finding the optimal number of clusters. Currently, some validity indexes are developed focusing on the geometric structure of the data subject of unsupervised classification. The validity function Dunn (1974) is based on inter-cluster distance and the diameter of a hyperspheric cluster. Inspired from the Xie-Beni validity function, Kim and Ramakrishna developed in 2005 the Xie-Beni Index (XBI) based on a fuzzy compactness criterion (Mingoti and Lima 2006). In 2004, Tsekouras and Sarimveis proposed an index based on a fuzzy measure of separation called Separation Validity Index (SVI) (Wu and Yang 2005). Lung and Yang proposed in 2004 a fuzzy index evaluating noisy points in the dataset (Hu et al. 2004), called Partition Coefficient And Exponential Separation (PCAES) that was normalized by Tlili et al. (PCAESN) (2014). Wang and Lee also presented a method which calculates the fuzzy degree of overlapping clusters called Validity Overlap Separation (VOS) (Pal and Bezdek 1992). In 2009, M. Tlili et al. presented a new cluster validity function called Fuzzy Validity Index with Noise-Overlap Separation (FVINOS) for evaluating fuzzy and crisp clustering (Tlili et al. 2014).

Our analysis is based on cluster validity indexes FVINOS, PCAESN, VOS, SVI, Dunn and XBI. In fact, we have studied these indices in our previous research (Tlili et al. 2014) (Appendix).

In a first step, we have undertaken extensive performance comparisons with MIGSOM, GSOM and FKCN to establish the robustness of FMIG in mapping several synthetic as well as real-world datasets. Then, we have proceeded by the classification of the generated prototypes with the FCM algorithm. Finally, we have applied some validity indexes on the resulting partitions to evaluate FMIG clustering performance.

This paper is organized as following: Sect. 2 reviews the MIGSOM and FKCN functions. Section 3 introduces our new approach FMIG and the details of its steps. Section 4 evaluates FMIG method compared with MIGSOM, GSOM and FKCN algorithms in term of quantization and topologic errors with a variety of synthetic and real datasets. In Sect. 5, we focus our experimentations on the robustness of our approach in data clustering by validating the obtained partitions with the validity indexes described above. Finally, Sect. 6 summarises and concludes our remarks.

2 Review of crisp and fuzzy SOM algorithms

2.1 Multilevel interior growing self-organizing maps algorithm (MIGSOM)

Proposed by Ayadi et al. (2010) in 2012, this SOM heuristic presents a new method of growing process. The MIGSOM algorithm is based on three phases:

1.
The initialization phase giving the values of the Growing Threshold (GT) to control the growth of the map and weight vectors.
2.
The growing process is developed in the second phase that increases the map size by adding new nodes from the unit that presents the highest quantization error.
3.
The smoothing phase is used to refine the resulting map of the previous phase.

MIGSOM is given by

1.
Initialize the map grid with (2 $\times $ 2) or (3 $\times $ 3) nodes defined randomly and calculate the growth threshold (GT) as shown:
$$\begin{aligned} \text{ GT } =-\text{ ln }\left( {n}\right) \times \text{ ln }\left( {\text{ SF }}\right) \end{aligned}$$
(1)
The GT is used to decide when to initiate new node growth. It specifies the spread of the feature map to generate. The GT value depends on the requirement for map growth. Thus if we require only a very abstract picture of the data, a large GT will result in a map with fewer nodes. Similarly, a smaller GT will result in the map spreading out more. For these reasons, we have integrated the size of the database in the new form NGT to represent faithfully and correctly Big Data. SF is the Factor of Spread used to control the growth of the map taken in [0, 1]. n is the size of the given dataset.
2.
In the growing phase, six steps are developed: Step 1-Present the input patterns to the network. Step 2-Calculate the weight of the best vectors (winners) that are close to the input data using Euclidean distance. Step 3-Update weight vectors as following formula:
$$\begin{aligned} w_i ( {t+1})=\frac{\sum \limits _{j=1}^n {h_{ci} ( t)\times x_j } }{\sum \limits _{j=1}^n {h_{ci} ( t)} } \end{aligned}$$
(2)
with $w_i (t)$ the weight vector of unit $i$, $x_j$ is the input pattern ($j=1 \ldots n)$. $h_{ci} ( t)$ represents the neighbourhood kernel centred on the BMU (Best Match Unit) given by
$$\begin{aligned} h_{ci} ( t)=\exp \left( {-\frac{Ud^2}{2\sigma ^2( t)}}\right) \times l( {\sigma ( t)-Ud}), \end{aligned}$$
(3)
where $Ud$ defines the distance between neuron i and the winning node c on the map, $\sigma ( t)$ represents the degree to which excited neurons in the vicinity of the winning neuron. $l( {\sigma _i -Ud})$ is an evaluating function taking two values:
$$\begin{aligned} l( {\sigma _i -Ud})=\left\{ {\begin{array}{l@{\quad }l} 1&{} if\ (\sigma _i \ge Ud), \\ 0 &{} otherwise \\ \end{array}} \right. \end{aligned}$$
(4)
Step 4-Compute the error of each node. Calculate the node presenting the highest quantization error called node q with k units mapped in as follows:
$$\begin{aligned} E_{rr} =\sum \limits _j^k {\left\| {x_j -w_q } \right\| } \end{aligned}$$
(5)
Step 5-If (Errq $>$ GT), new nodes will be generated from $q$. Then initialize the new node weight vectors to match the neighbouring node weights. Step 6-Repeat steps 1–5 until the number of iterations is reached.
3.
Smoothing Phase: Step 1-Fix the neighborhood radius to one. Step 2-Training the map as the growing phase without adding new nodes.

2.2 The fuzzy Kohonen clustering network: FKCN

Proposed by Pascual et al. (2000), FKCN presents a fuzzy SOM heuristic. It consists of two layers: The input layer is formed by n nodes, with n the number of input patterns. The output layer is composed of c nodes, where c is the number of the output map vectors. For each input node, it is assigned a connection to all output units with a weight vector $\nu _i $ ($i=1 \ldots c)$. Based on a pre-defined learning rate, the neurons in the output layer update their weights for an input vector $x_k $ ($k=1 \ldots n)$. FKCN approach integrates the fuzzy degree of membership $u_{ik,t} $introduced by the fuzzy c-means algorithm (Xie and Beni 1991). The weight vector $\nu _{i,t} $ at iteration $t$ is given by

$$\begin{aligned} \nu _{i,t} =\nu _{i,t-1} +\alpha _{ik,t} \times ( {x_k -\nu _{i,t-1} }) \end{aligned}$$

(6)

The learning rate $\alpha $ is defined as

$$\begin{aligned}&\alpha _{ik,t} =\left( {u_{ik,t} }\right) ^{m_t }\end{aligned}$$

(7)

$$\begin{aligned}&m_t =m_0 -t\times \Delta m\end{aligned}$$

(8)

$$\begin{aligned}&\Delta m=\frac{\left( {m_0 -1}\right) }{t_{\max } }, \end{aligned}$$

(9)

where $t_{\max }$ is the value of the iteration limit and $m_0 $ is a positive constant greater than 1.

The membership degree $u_{ik,t} $ is given by

$$\begin{aligned} u_{ik,t} =\left[ {\sum \limits _{j=1}^c {\left[ {\frac{( {x_k -\nu _{i,t} })}{( {x_k -\nu _j })}} \right] } ^{2/(m_{t}-1)}} \right] ^{-1} \end{aligned}$$

(10)

3 FMIG: fuzzy multilevel interior GSOMs

We present a new form of fuzzy self-organizing algorithm called FMIG (Fuzzy Multilevel Interior GSOMs). Our approach is based on the MIGSOM algorithm (Ayadi et al. 2010, 2012) which adapts its structure according to the data form. The fuzzy aspect of our method is inspired from the FKCN algorithm (Pascual et al. 2000).

As MIGSOM, the training process of FMIG is composed of three steps: initialization phase, growing phase and smoothing phase. The main idea of FMIG is to present a new parameter of Growing GT to ameliorate the control of map growth and improve the training process. In fact, the GT form used in MIGSOM algorithms becomes complicated as the growing increases. In addition, with big datasets, the computing process becomes more complex and takes more time with the augmentation of GT values.

Our new approach FMIG shows two interesting properties:

It produces the best distribution of data by finding out the topology corresponding to the form of inputs.
It gives the minimum value of error quantization providing the best cluster structure.

We define our FMIG algorithm as the following processes:

3.1 Initialisation

Initialize the pre-defined values of the Spread Factor (SF) used to control the growth of the map taken in [0, 1], and $m_{0}$ the fuzzifier factor. The spread factor has to be specified to allow the data analyst to control the growth of the training process. SF is independent of the dimensionality of the given dataset. This factor is used by the system to calculate the Growing Threshold (GT). Such first step of the training process will act as a threshold value for initiating node generation. A high GT value will result in less distribution (spread) out map, and a low GT will produce a well-distributed map.
Taking into account the dimension of the training dataset (n) and the number of its characteristics (D), the growing threshold gives an identical image of the specific data aspects. Hence, mapping such data produces the best distribution and topology. The formula of our New Growing Threshold (NGT) is defined as follows:
$$\begin{aligned} \text{ NGT } =\frac{( -{\text{ ln }( D)\times \text{ ln }( {\text{ SF }})})}{n} \end{aligned}$$
(11)
Initialize the map grid with (2 $\times $ 2) or (3 $\times $ 3) $c_{0}$ nodes randomly and the weight vectors $\nu _{0}= (\nu _{1,0}, \nu _{2,0}, \nu _{c0,0}).$ Initialize randomly the membership matrix, with n the number of data inputs:
$$\begin{aligned} U=\left| {u_{ij} } \right| _{({i} =1 \ldots {c}0, {j}=1 \ldots {n})} \end{aligned}$$
(12)
Fix $t_{\max }$ the limit of iterations.

3.2 Growing process

For t=1... $t_{\max }$:

Calculate the fuzzifier parameter $m_t$ as shown in (8).
Determine the membership degree $u_{ik,t} $ for the iteration t using the Euclidean distance between the input pattern $x_k$ and the weight vector $\nu _i $, with $c_{t}$ being the number of weight vectors at the iteration $t$ as defined in (10).
Compute the learning rate $\alpha _{ik,t} $ as defined in (7).
The weight vectors are updated by the following formula Pascual et al. (2000):
$$\begin{aligned} \nu _{i,t} =\nu _{i,t-1} +\frac{\sum \limits _{k=1}^n {\alpha _{ik,t} \left( {x_k -\nu _{i,t-1} }\right) } }{\sum \limits _{s=1}^n {\alpha _{is,t} } } \end{aligned}$$
(13)
Calculate the quantization error of each node $i$ as Ayadi et al. (2010):
$$\begin{aligned} QE_i =\sum \limits _j^\mathrm{{nbu}} {\left\| {x_j -vi} \right\| } \end{aligned}$$
(14)
with nbu, the number of units mapped by the node i. The Quantization Error (QE) is a measure that completely disregards map topology and alignment. The quantization error is computed by determining the average distance of the sample vectors to the prototype vectors by which they are represented.
Compute the error of each node. Calculate the node presenting the highest quantization error QEmax called node $q$ with $k$ units mapped in as shown in (5).
If $QE_\mathrm{{max}}$ $>$ NGT then generate new nodes from $q$ as (Ayadi et al. 2010).

3.3 Smoothing process

Initialise the limit of smoothing iterations $t_{s\mathrm{{max}}}$.
Train the map as the growing process without adding new nodes as (Ayadi et al. 2010).

4 FMIG map quality preservation

In order to present the performance of our technique, comparative studies are made in terms of quantization and topologic errors. These comparisons take place between the proposed FMIG algorithm, MIGSOM, FKCN and an inspired version of GSOM methods (Denaïa et al. 2007).

We first executed FMIG considering two forms of growing threshold NGT Eq. (11) and GT Eq. (1) to demonstrate the performance of the new structure of NGT.

4.1 Experimental settings

The experimentations are realized by executing FMIG, MIGSOM [...], FKCN and GSOM algorithms in the test context parameters of the Euclidean distance function, the fuzzy factor value $m_0 $= 2 and the Spread factor SF = 0.05. The limit of iterations in the growing process $t_{\mathrm{{max}}}$ is taken at the values 30, 50 and 100. The smoothing process takes its maximum of iterations $t_{s\mathrm{{max}}}$ at 10.

The comparison of the algorithms is made in term of quantization and topologic errors which are presented as follows:

Topologic Error: the Final Topologic Error FTE is computed identically to Ayadi et al. (2010) as follows:
$$\begin{aligned} \mathrm{{FTE}}=\frac{1}{c_t }\sum \limits _{i=1}^{c_t } {mu( {\nu _i })} \end{aligned}$$
(15)
$mu( {\nu _i })$ takes the values 1 if the first and the second Best Matching Units (BMUs) of $\nu _i $ are adjacent, 0 otherwise.
The Final Quantization Error: FQE is computed as the normalized average distance between each input data and its BMU.
$$\begin{aligned} \mathrm{{FQE}}=\left\{ {\begin{array}{l} \text{1 } \text{ if } \text{ no } \text{ data } \text{ point } \text{ matches } \text{ the } \text{ unit } \\ \frac{1}{c_t }\sum \limits _{i=1}^{c_t } {\left[ {\frac{\frac{1}{\mathrm{{nbu}}}\sum \limits _{j=1}^{\mathrm{{nbu}}} {\left\| {x_j -\nu _{\mathrm{{BMUt}}}} \right\| } }{\mathrm{{norm}}( {\nu _i })}} \right] } \\ \end{array}} \right. \end{aligned}$$
(16)
$\nu _\mathrm{{{BMUt}}} $ is the weight vector of the BMU at the iteration t.

Table 1 Datasets description

Full size table

4.2 Datasets description

In order to test the effectiveness of the proposed algorithm FMIG, we performed a sensitivity analysis in synthetic datasets with different dimensions (Dataset1, Dataset2 and Dataset3) and real datasets (IRIS, Ionosphere, MNIST, DNA and NE) (Blake and Merz 1998). NE (Theodoridis 1996) consists of postal addresses of three metropolitan cities in US (New York, Philadelphia and Boston). All datasets are described in Table 1.

Dataset 1 is composed of 1,600 points, 2-Dimension and distributed in 4 overlapping clusters.
Dataset 2 presents 1,300 points, 2-Dimension and distributed in 13 noisy clusters.

4.3 Results and analyses

4.3.1 Multilevel map structure with FMIG

Figure 2 illustrates the labelled FMIG map structures for Dataset1, IRIS and NE. These maps are the result of the execution of FMIG with 50 iterations of growing phase. Thus, as we show FMIG is able to produce grid structure with multi-levels oriented maps. In addition, the maps present homogenous clusters.

4.3.2 New growing threshold NGT

In order to prove the amelioration introduced by our new structure of NGT (11) in quantization data and topology preservation, we performed some tests on different datasets. In fact, we executed our FMIG algorithm in two versions, taking into account, respectively, the GT and NGT formula to demonstrate the capacity of integrating the new parameter NGT. Table 2 shows results of FQE and FTE by executing FMIG 100 times, considering the two forms of GT and NGT.

Table 2 Comparison of growing thresholds NGT and GT

Full size table

Table 2 proves that the using of the NGT gives the minimum values of quantization error FQE. Therefore, FQE using the NGT form that we propose is clearly smaller than FQE resulting from the classic GT form for all datasets especially when we have a big data (Letter and NE).

Based on results presented in Table 2, the topologic error is improved by the new growing threshold. In fact, FTE with NGT gives minimum values for the tested databases compared to FTE using GT. The minimum values of the topologic error lead to a well-distributed map.

This experimental study confirms that in iterative processes, the number of iterations is important to evaluate the results because it has the probability to give the global solution of the problem. If the stop condition is chosen correctly, the iterative process will have the ability to execute the max of iterations and produce the optimal values of the algorithm. For these reasons, we generated the new form of growing parameter in FMIG algorithm that corresponds to the stop condition of the growing process:

$$\begin{aligned}&\mathrm{If} \, \mathrm{QEmax} < \mathrm{NGT} \, \mathrm{then \, stops \, growing, \, else \, continue} \\&\quad \mathrm{generating \, new \, nodes}. \end{aligned}$$

When we have a small NGT, the probability to continue the process is important. A high GT value will result in less spread out map. However, a low GT will produce a well-distributed map.

Our NGT is smaller than the interior GT that is why the iterative process will continue until the maximum of iterations leading to a well-spread map.

4.3.3 Comparison of FMIG with MIGSOM, GSOM and FKCN

In this study, we executed our proposed algorithm FMIG on different databases with different iterations values. The experimental results of comparison with MIGSOM, GSOM and FKCN are presented by Tables (3, 4, and 5), with, respectively, 100, 200 and 500 iterations.

Table 3 Experimental results with 100 iterations

Full size table

Table 3 shows that from 100 iterations, results of quantization and topographic quality given by the two methods FMIG and MIGSOM are similar for the synthetic databases. In fact, the two methods give the same value of quantization error for Dataset3 which is a small dataset. Whereas, applied on real data that present different geometric aspects, FMIG generates the best values compared to MIGSOM with similar map size. Such improvement could be explained by the combination of MIGSOM advantages in multilevel maps and the fuzzy aspect of FKCN to develop FMIG that produces a well distributed map taking into account the characteristics of the dataset. Results given by GSOM and FKCN are far from the optimal values found by our method for synthetic and real datasets.

Table 4 Experimental results with 200 iterations

Full size table

Table 5 Experimental results with 500 iterations

Full size table

The execution of the algorithms 200 and 500 iterations proves that FMIG gives more precision in quantization and topologic errors than MIGSOM, GSOM and FKCN (Tables 4, 5). Our fuzzy algorithm is able to generate the best distribution which produces the minimum values of FTE, especially with big real datasets. With the new structure of NGT taking into account the data size and its characteristics, in addition to the update modifications applied for the predefined parameters, the values of FQE and FTE give their minimum to improve the quantization factor of the map and preserve the topographic aspect of datasets.

To analyse each database separately shown by Tables 4 and 5, we discus results given by FMIG compared to those generated by MIGSOM, GSOM and FKCN. Dataset1 preserves its distribution significantly with the minimum values of quantization error and topologic error by the map structure given by FMIG. It is the same case with datasets 2 and 3.

The real databases are more preserved by our new algorithm in terms of quantization and topologic errors. In fact, FMIG takes its minimum values with 100 and 200 iterations compared to MIGSOM. We note that GSOM and FKCN become inefficient to train big data and attain their limitation for 200 and 500 iterations leading to unacceptable errors.

As an example, MNIST is considered a big data with a single aspect of data distribution that is why results given by our FMIG are more explicit and fiddle to the reality.

As we shown in Tables 4 and 5, with the evolution of growing iterations, the difference between crisp and fuzzy algorithms is significant. Therefore, 200 and 500 iterations of FMIG growing generate the best maps in parameters FQE and FTE compared with crisp MIGSOM and GSOM. These optimal results are returned due to the new fuzzy form of the weight vectors (13) taking into account the fuzzy membership degree (10). It is worth mentioning that GSOM and FKCN fail to map databases quite there preserving the topology criterion. Furthermore, these algorithms take an important execution time with big data to achieve their learning phase. Considering the GSOM and FKCN training, the resulting maps present important errors compared to FMIG. It could be explained by the drawback of GSOM (Ayadi et al. 2007; Amarasiri et al. 2004) and FKCN to learn big datasets. Therefore, it is clear that the results shown in the tables above are influenced by the number of iterations in the growing phase of FMIG, MIGSOM, GSOM and FKCN algorithms.

5 Evalution of FMIG clustering using validity indexes

In this study, we applied three stages of the clustering process to evaluate our method FMIG performance in a comparison with MIGSOM, GSOM and FKCN. We first trained synthetic and real datasets I with the four algorithms for 100 iterations of growing. The resulting prototypes were then clustered by FCM algorithm. Finally, we applied five validity indexes: FVINOS (Tlili et al. 2014), PCAESN (Hu et al. 2004), SVI (Wu and Yang 2005), VOS (Pal and Bezdek 1992), Dunn (Dunn 1974) and XBI (Mingoti and Lima 2006). Table 6 presents the clustering evaluation given by FMIG compared with other algorithms.

Table 6 Clustering evaluation with validity indexes

Full size table

Results given by Table 6 prove that FCM partitions obtained by FMIG neural maps are optimal and produce the best number of clusters which is evaluated by the validity indexes. In fact, with different databases presenting multiple geometric aspects described by Table 1, FVINOS detects the optimal number of clusters given by FMIG training as well as in MIGSOM, GSOM and FKCN training. In addition, the majority of indexes give their best results with FMIG prototypes that could be explained by the ability of FMIG to produce well-distributed maps preserving the quality of data characteristics. Moreover, with noisy Dataset2, FMIG maps generate the best partition; all the indexes detect outliers and give good estimate of 13 clusters. MIGSOM algorithm produces the best partition with FVINOS, PCAESN and VOS, but the other indices are unable to detect the optimum number of clusters. GSOM and FKCN methods as presented in Table 6 return the best partition only with FVINOS and PCAESN because these indexes are able to detect noise aspect (Tlili et al. 2014).

Overlapping datasets such as Dataset1, IRIS and MNIST are well clustered by FMIG maps. As shown in Table 6, all the indexes discover the optimum number of clusters with Dataset1. For IRIS, just XBI is unable to give good estimate with FMIG; the other indices produce the best partition. MNIST is considered a big data and well trained with FMIG that gives 10 clusters in the resulting partition. FVINOS, VOS, Dunn and PCAESN indices are able to detect the overlap aspect (Tlili et al. 2014) of MNIST. MIGSOM, GSOM and FKCN produce the best partition just with FVINOS and fail in detecting the optimum number of clusters in the other cases.

Heterogeneous DNA is well clustered by FMIG partitioning that produces the optimum number of clusters evaluated by each of the indices. MIGSOM, GSOM and FKCN give good estimate with FVINOS and VOS, but the results of the other indexes fail to detect the optimum result.

Huge databases such as Letter and NE clustering prove that FMIG is able to produce the best distribution of prototypes clustered by FCM. In fact, with FMIG, FVINOS, PCAESN, VOS and SVI indexes detect the best partition. MIGSOM, GSOM and FKCN give the optimum number of clusters with FVINOS and fail to produce good results with the other indexes because they are limited in mapping huge data.

6 Conclusion

In this paper, we have presented a new form of fuzzy dynamic self-organizing map algorithm called FMIG. This new method improves the topology and quantization errors and generates its structure according the data form. The fuzzy aspect of our approach adapts its growing process to real problems. In this study, we presented a new parameter of Growing Threshold (GT) to ameliorate the control of the map growth and improve the training process. In fact, we proved that error values of quantization and topology are minimized by applying our new Growing Threshold (NGT). FMIG realizes the purposes of producing the best distribution of data by finding out the topology corresponding to the form of inputs. Also, FMIG gives the minimum value of quantization error providing the best cluster structure. Furthermore, FMIG is able to generate the best representations of big data.

Future research should focus on improving the performance of FMIG algorithm over high-dimensional datasets by incorporating some evolutionary techniques and providing a new form of Fuzzy evolutionary SOM.

References

Abonyi J, Migaly S, Szeifert F (2002) Fuzzy self-organizing map based on regularized fuzzy c-means clustering. Advances in soft computing, Engineering Design and Manufacturing
Google Scholar
Abraham A, Nath B (2001) A neuro-fuzzy approach for modelling electricity demand in Victoria. Appl Soft Comput 1(2):127–138
Article Google Scholar
Aguilera PA, Frenich AG (2001) Application of the Kohonen neural network in coastal water management: methodological development for the assessment and prediction of water quality. Water Res 35:4053–4062
Article Google Scholar
Alimi AM, Hassine R, Selmi M (2003) Beta fuzzy logic systems: approximation properties in the MIMO case. Int J Appl Math Comput Sci 13(2):225–238
Alimi AM (2000) The beta system: toward a change in our use of neuro-fuzzy systems. Int J Manag, Invited Paper, no. June, pp 15–19
Alimi AM (2003) Beta neuro-fuzzy systems. TASK Q J, Special Issue on Neural Networks edited by Duch W and Rutkowska D, vol 7, no 1, pp 23–41
Amarasiri R, Alahakoon D, Smith KA (2004) HDGSOM: a modified growing self-organizing map for high dimensional data clustering. Fourth international coriference on hybrid intelligent systems (HIS’04), pp 216–221
Ayadi T, Ellouze M, Hamdani TM, Alimi AM (2012) Movie scenes detection with MIGSOM based on shots semi-supervised clustering. Neural Comput Appl. doi 10.1007/s00521-012-0930-5
Ayadi T, Hamdani TM, Alimi AM, and Khabou MA (2007) 2IBGSOM: interior and irregular boundaries growing self-organizing maps. IEEE sixth international conference on machine learning and applications, pp 387–392
Ayadi T, Hamdani TM, Alimi AM (2010) A new data topology matching technique with multilevel interior growing self-organizing maps. IEEE international conference on systems, man, and cybernetics, pp 2479–2486
Ayadi T, Hamdani TM, Alimi AM (2011) On the use of cluster validity for evaluation of MIGSOM clustering. ISCIII: 5th international symposium on computational intelligence and intelligent informatics, pp 121–126
Ayadi T, Hamdani TM, Alimi AM (2012) MIGSOM: multilevel interior growing self-organizing maps for high dimensional data clustering. In: Neural processing letters, vol 36, pp 235 256
Barrea A (2011) Local fuzzy c-means clustering for medical spectroscopy images. Appl Math Sci 5(30):1449–1458
MATH Google Scholar
Bezdek JC (1974) Cluster validity with fuzzy sets. J Cybern 58–72
Bezdek JC (1987) Convergence theory for fuzzy c-means. IEEE Trans SMC, pp 873–877
Bezdek JC, Harris JD (1978) Fuzzy partitions and relation: an axiomatic basic for clustering fuzzy sets and systems. Academic Press, Massachusetts
Bezdek JC, Castelaz PF (1977) Prototype classification and feature selection with fuzzy sets. IEEE Trans Syst Man Cybern 7(2):87–92
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York
Book MATH Google Scholar
Blake CL, Merz CJ (1998) UCI repository of machine learning databases, [http://www.ics.uci.edu/learn/MLRepository.html]. University of California, Department of Information and Computer Science, Irvine, CA
Chang PC, Liao TW (2006) Combining SOM and fuzzy rule base for flow time prediction in semiconductor manufacturing factory. Appl Soft Comput 6(2):198–206
Article Google Scholar
Das S, Abraham A, Konar A (2008) Automatic kernel clustering with a Multi-Elitist Particle Swarm Optimization Algorithm. Patt Recogn Lett 29:688–699
Article Google Scholar
Das S, Abraham A (2008) Automatic clustering using an improved differential evolution algorithm. IEEE Trans SMC, vol 38, no 1, pp 218–237
Denaïa MA, Palisb F, Zeghbibb A (2007) Modeling and control of non-linear systems using soft computing techniques. Appl Soft Comput 7(3):728–738
Article Google Scholar
Dhahri H, Alimi AM (2005) Hierarchical learning algorithm for the beta basis function neural network. In: Proceedings of third international conference on systems, signals & devices: SSD’2005, Sousse, Tunisia, March 2005
Dunn JC (1974) Well separated clusters and optimal fuzzy partitions. J Cybern 4:95–104
Article MathSciNet Google Scholar
El Malek J, Alimi AM, Tourki R (2002) Problems in pattern classification in high dimensional spaces: behavior of a class of combined neuro-fuzzy classifiers. Fuzzy Sets Syst 128(1):15–33
Fritzke B (1994) Growing cell structure: a self organizing network for supervised and un-supervised learning. Neural Netw 7:1441–1460
Hamdani TM,. Alimi AM, Khabou MA (2011) An iterative method for deciding SVM and single layer neural network structures. Neural Process Lett 33(2):171–186
Hamdani TM, Alimi AM, Karray F (2008) Enhancing the structure and parameters of the centers for BBF fuzzy neural network classifier construction based on data structure. In: Proceedings of IEEE international join conference on neural networks, Hong Kong, IJCNN, art. no. 4634247, pp 3174–3180
Hamdani TM, Won JM, Alimi AM, Karray F (2011) Hierarchical genetic algorithm with new evaluation function and bi-coded representation for features selection with confidence rate. Appl Soft Comput 11(1):2501–2509, ISSN 1568–4946
Hsu AL, Halgarmuge SK (2001) Enhanced topology preservation of dynamic self-organising maps for data visualization. IFSA world congress and 20th NAFIPS international conference, vol 3, pp 1786–1791
Hu W, Xie D, Tan T, Maybank S (2004) Learning activity patterns using fuzzy self-organizing neural network. IEEE Trans Syst Man Cybern B 34(3):1618–1626
Article Google Scholar
Kim DW, Lee KH, Lee D (2004) On cluster validity index for estimation of the optimal number of fuzzy clusters. Patt Recogn 37(10):2009–2025
Article Google Scholar
Kim M, Ramakrishna RS (2005) New indices for cluster validity assessment. Patt Recogn Lett 26(15):2353–2363
Article Google Scholar
Kohonen T (1998) Statistical pattern recognition with neural networks: benchmark studies. In: Proceedings of the second annual IEEE international conference on neural networks, vol 1
Kohonen T (1982) Self-organized formation of topologically correct feature maps. Biol Cybern 43:59–69
Article MATH MathSciNet Google Scholar
Kohonen T (1984) Self-organization and associative memory. Springer, Berlin
MATH Google Scholar
Kohonen T (1997) Self-organizing maps, 2nd edn. Springer, Berlin
Book MATH Google Scholar
Kohonen T (1998) The self-organizing map. Neurocomputing 21:1–6
Article MATH Google Scholar
Kohonen T (2001) Self-organizing maps. Springer, Berlin
Book MATH Google Scholar
Li Hui-Ya, Hwang Wen-Jyi, Chang Chia-Yen (2011) Efficient fuzzy C-means architecture for image segmentation. Sensors 2011(11):6697–6718. doi:10.3390/s110706697
Article Google Scholar
Mingoti SA, Lima JO (2006) Comparing SOM neural network with Fuzzy c-means, K-means and traditional hierarchical clustering algorithms. Eur J Oper Res 174:1742–1759
Pal NR, Bezdek JC (1992) On cluster validity for fuzzy c-means model. IEEE Trans Fuzzy Syst 3:370–379
Article Google Scholar
Pascual A, Barcena M, Merelo JJ, Carazo JM (2000) Mapping and fuzzy classification of macromolecular images using self-organizing neural networks. Ultramicroscopy 84:85–99
Article Google Scholar
Petterssona F, Chakrabortib N, Saxéna H (2007) A genetic algorithms based multi-objective neural net applied to noisy blast furnace data. Appl Soft Comput 7(1):387–397
Article Google Scholar
Ravia V, Kurniawana H, Thaia PN, Kumarb PR (2008) Soft computing system for bank performance prediction. Appl Soft Comput 8(1):305–315
Article Google Scholar
Theodoridis Y (1996) Spatial datasets—an unofficial collection. Available from: http://www.dias.cti.gr/~ytheod/research/datasets/spatial.html
Tlili M, Ayadi T, Hamdani TM, Alimi AM (2012) FMIG: fuzzy multilevel interior growing self-organizing maps. IEEE international conference on tools with artificial intelligence (ICTAI)
Tlili M, Hamdani TM, Alimi AM, Abraham A (2014) FVINOS: fuzzy validity index with noise-overlap separation for clustering algorithms. Patt Recogn Lett (in press)
Tsekouras GE, Sarimveis H (2004) A new approach for measuring the validity of the fuzzy c-means algorithm. Adv Eng Softw 35(8–9):567–575
Article MATH Google Scholar
Velmurugan T, Santhanam T (2010) Performance evaluation of K-means and fuzzy C-means clustering algorithms for statistical distributions of input data points. Eur J Sci Res ISSN 1450–216X, 46(3):320–330
Wu KL, Yang MS (2005) A cluster validity index for fuzzy clustering. Patt Recogn Lett 26(9):1275–1291
Article Google Scholar
Xie XL, Beni G (1991) A validity measure for fuzzy clustering. Pattern Analysis and Machine Intelligence, IEEE Transactions on Pattern Analysis and Machine Intelligence 13(8):841–847
Article Google Scholar
Yuksel ME, Besdok E (2004) A simple neuro-fuzzy impulse detector for efficient blur reduction of impulse noise removal operators for digital images. IEEE Trans Fuzzy Syst 12(6):854–865
Article Google Scholar

Download references

Acknowledgments

The authors would like to acknowledge the financial support of this work by grants from General Direction of Scientific Research (DGRST), REGIM-Lab (LR11ES48) Tunisia, under the ARUB program.

Author information

Authors and Affiliations

REGIM Lab: Research Groups on Intelligent Machines, National Engineering School of Sfax (ENIS), University of Sfax, BP 1173, Sfax, 3038, Tunisia
Monia Tlili, Thouraya Ayadi, Tarek M. Hamdani & Adel M. Alimi
College of Science and Arts at Al-Ula, Taibah University, Al-Madinah al-Munawwarah, Saudi Arabia
Tarek M. Hamdani

Authors

Monia Tlili
View author publications
You can also search for this author in PubMed Google Scholar
Thouraya Ayadi
View author publications
You can also search for this author in PubMed Google Scholar
Tarek M. Hamdani
View author publications
You can also search for this author in PubMed Google Scholar
Adel M. Alimi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Monia Tlili.

Additional information

Communicated by E. Lughofer.

Appendix

1.1 Description of the used validity indexes

Let $X=\left\{ {x_1 ,x_2 ,...,x_n } \right\} $ be a dataset in an m-dimensional Euclidean space Rm with its ordinary Euclidean norm $\left\| . \right\| $ and let C, be the matrix of cluster centres with $c_i $, the centre of the cluster $C_i $ ($1\le i\le \mathrm{{nbc}})$. In case of fuzzy clustering, a pattern (point) may belong to all the clusters with a certain fuzzy membership grade. Consider the matrix $U=\left| {u_{ik} } \right| $ where $u_{ik} $ is the value of membership of the point k in cluster $i$, and $u_{ik} $ is in the interval [0,1] ($1\le k\le n)$.

We present below the validity indices studied in this work depending on the geometric aspects of clusters (compactness, separation) and the data properties (noise, overlap) evaluated by each index.

1.1.1 SVI index

SVI (Separation Validity Index) (Wu and Yang 2005) is given by

$$\begin{aligned} \mathrm{{SVI}}=\frac{\varvec{\pi }}{S} \end{aligned}$$

(17)

$\varvec{\pi }$ is a global compactness of the partition, it makes the sum of $\pi _i $, $1\le i\le \mathrm{{nbc}}$.

The compactness of the cluster c$^i$ is defined by

$$\begin{aligned} \pi _i =\frac{\sigma _i }{n_{{i}} }1\le i\le \mathrm{{nbc}} \end{aligned}$$

(18)

The variation $\sigma _i $ and the cardinality $n_{i} $ of the cluster c$_i$ are given, respectively, as

$$\begin{aligned} \sigma _i =\sum \limits _{k=1}^n ({u}_{ik} )^m\left\| {x_k \left. {-\nu _i } \right\| } \right. ^{2},n_i =\sum \limits _{k=1}^n {u_{ik} } ,1\le i\le \mathrm{{nbc}} \end{aligned}$$

(19)

with ${u}_{ik}$, the membership degree of the vector $x_k$ to the cluster c$_i $ and $\nu _i$ the centre of c$_i $.

$S$ is the global separation between the nbc clusters defined as

$$\begin{aligned} S=\sum \limits _{i=1}^{\mathrm{{nbc}}+1} {\sum \limits _{\begin{array}{c} j=1\\ j\ne i \end{array}}^{\mathrm{{nbc}}+1} {(\mathrm{de}\nu _{ij} )^2} } \end{aligned}$$

(20)

$\mathrm{de}\nu _{ij} $ is the deviation between the centres of $c_i$ and $c_j $.

$$\begin{aligned} de\nu _{ij} =(\mu _{ij} )^{(2+\omega )/{2\omega }}\left\| {z_j -z_i } \right\| \end{aligned}$$

(21)

$\left[ {z_1 ,z_2 ,...,z_{\mathrm{{nbc}},} z_{\mathrm{{nbc}}+1} } \right] =\left[ {\nu _1 ,\nu _2 ,...,\nu _{\mathrm{{nbc}},} \overline{x} } \right] ^T,$ where $\overline{x} $ is the mean of X.

$$\begin{aligned} \overline{x} =\sum \limits _{k=1}^n {{x_k }/n} \end{aligned}$$

(22)

$\mu _{ij} $ is the degree of membership of $z_j $ to the centre $z_i $.

$$\begin{aligned}&\mu _{ij} =\frac{1}{\sum \limits _{\begin{array}{c} l=1\\ l\ne j \end{array}}^{\mathrm{{nbc}}+1} {\left( {\frac{\left\| {z_j -z_i } \right\| }{\left\| {z_j -z_l } \right\| }}\right) ^\omega } },1\le i\le \mathrm{{nbc}}+1;\nonumber \\&\qquad \qquad 1\le j\le \mathrm{{nbc}}+1,j\ne i \end{aligned}$$

(23)

1.1.2 XBI index

This index is normalized and gives the best number of clusters at its minimum value (Mingoti and Lima 2006) as shown:

$$\begin{aligned} \mathrm{{XBI}}(\mathrm{{nbc}})=\frac{\max _{k=1,..,nbc} \left\{ {\sum \limits _{j=1}^n {\frac{u_{kj}^2 \left\| {x_j -c_k } \right\| ^2}{n_k }} } \right\} +\max \mathrm{{Diff}}(\mathrm{{nbc}})}{\min _{i,j\ne i} \left\| {c_i -c_j } \right\| ^2} \end{aligned}$$

(24)

where

$\max _{k=1,..,\mathrm{{nbc}}} \left\{ {\sum \limits _{j=1}^n {\frac{u_{kj}^2 \left\| {x_j -c_k } \right\| ^2}{n_k }} } \right\} $ gives the max fuzzy distance between points $x_j $ with membership degree $u_{kj}$ to the center $c_k $.

This index introduced the $\max \mathrm{{Diff}}(\mathrm{{nbc}})$ factor in order to compare the nbc-partitions obtained by increasing nbc. Thus, we can find out the best partition.

$$\begin{aligned}&\max \mathrm{{Diff}}(\mathrm{{nbc}})=\max \nolimits _{\mathrm{{nbc}}\max ,...,\mathrm{{nbc}}} \mathrm{{diff}}_{dw}\end{aligned}$$

(25)

$$\begin{aligned}&\mathrm{{diff}}_{dw} =dw(\mathrm{{nbc}})-dw(\mathrm{{nbc}}+1) \end{aligned}$$

(26)

with $dw$ intra-cluster distance measured for different values of nbc.

1.1.3 PCAESN index

Called Partition Coefficient And Exponential Separation (PCAES) (Hu et al. 2004), this validity index takes into account two factors with a normalized partition coefficient and an exponential separation measure to validate separately each fuzzy cluster $i$. PCAES is then defined as:

$$\begin{aligned} \mathrm{{PCAES}}(\mathrm{{nbc}})=\sum \limits _{i=1}^{\mathrm{{nbc}}} {\mathrm{{PCAESi}}} \end{aligned}$$

(27)

The validity index of cluster i is measured by:

$$\begin{aligned} \mathrm{{PCAESi}}=\sum \limits _{j=1}^n {{\mu _{ij}^2 }/{\mu _M }-} \text{ exp } ( {-\min \{\left\| {a_\mathrm{i} -a_k} \right\| ^{2}\} /{\beta _\mathrm{T}}}) \end{aligned}$$

(28)

where$\sum \limits _{j=1}^n {{\mu _{ij}^2 }/{\mu _M }} $: the compactness of the cluster i compared to the most compact cluster with its value of compactness $\mu _M $measured by:

$$\begin{aligned} \mu _M =\mathop {\min }\limits _{1\le i\le nbc} \left\{ {\sum \limits _{j=1}^n {\mu _{ij}^2 } } \right\} \end{aligned}$$

(29)

$\mu _{ij}$ is the membership degree of vector j ($1\le j\le n)$ to cluster i ($1\le k\le \mathrm{{nbc}})$,

$ \text{ exp } ( {{-\min \{ }\left\| {a_\mathrm{i} -a_k } \right\| ^{2}{\} } / {\beta _\mathrm{T} }})$ is the separation measure relative to the total average separation $\beta _\mathrm{T} $ of the nbc clusters given by:

$$\begin{aligned} \beta _\mathrm{T} =\frac{\sum \nolimits _{l=1}^{nbc} {\left\| {a_l -\overline{a} } \right\| ^2} }{nbc} \end{aligned}$$

(30)

$a_\mathrm{i}$ is the center of cluster $i$, and $\overline{a} =\sum \nolimits _{i=1}^{\mathrm{{nbc}}} {a_i /nbc} $, the average of center vectors $a_\mathrm{i} $.

Normalisation of PCAES index:

Basically, each validity index measure PCAESi ($i =1{\ldots }{\mathrm{nbc}}$) is obtained by a subtraction between the compactness and separation values which are defined in the interval [0 1]. Consequently, we present PCAES values as

$$\begin{aligned} -\mathrm{{nbc}}< \text{ PCAES } ( {\mathrm{{nbc}}})\, <\mathrm{{nbc}} \end{aligned}$$

(31)

Thus, to obtain the value of PCAES in [0 1], we have specified PCAESN Tlili et al. (2014) as following:

$$\begin{aligned} \mathrm{{PCAESN}}(\mathrm{{nbc}})=0.5+\left[ {( {\mathrm{{PCAES}}(\mathrm{{nbc}})/nbc})/2} \right] \end{aligned}$$

(32)

1.1.4 VOS index

Validity Overlap Separation (VOS) (Pal and Bezdek 1992) gives its values in the interval [0 1] and reaches its best partition for the minimum value.

This index is given by

$$\begin{aligned} \mathrm{{VOS}}(\mathrm{{nbc}},U)=\frac{Overlap^N(\mathrm{{nbc}},U)}{\mathrm{{Sep}}^N(\mathrm{{nbc}},U)}, \end{aligned}$$

(33)

where $Overlap^N(nbc,U)$ gives an inter-cluster overlap for different values of nbc, normalized by the max overlap for nbc = 2...nbcmax.

$$\begin{aligned} \mathrm{{Overlap}}^N(\mathrm{{nbc}},U)=\frac{\mathrm{{Overlap}}(\mathrm{{nbc}},U)}{\mathrm{{Overlap}}_{\max } } \end{aligned}$$

(34)

$Sep^N(nbc,U)$ calculates the separation between clusters for different values of nbc, normalized by the max separation for nbc = 2...nbcmax.

$$\begin{aligned} Sep^N(nbc,U)=\frac{Sep(nbc,U)}{Sep_{\max } } \end{aligned}$$

(35)

1.2 Overlap function

$$\begin{aligned} \mathrm{{Overlap}}(\mathrm{{nbc}},U)=\frac{2}{\mathrm{{nbc}}(\mathrm{{nbc}}-1)}\sum \limits _{p=1}^{\mathrm{{nbc}}-1} {\sum \limits _{q=p+1}^{nbc} {P(\overline{F} _p ,\overline{F} _q )} } \end{aligned}$$

(36)

$P(\overline{F} _p ,\overline{F} _q )$ defines the total overlap between two fuzzy clusters $\overline{F} _p $ and $\overline{F} _q $:

$$\begin{aligned} P(\overline{F} _p ,\overline{F} _q )=\sum \limits _\mu {f(\mu :} \overline{F} _p ,\overline{F} _q ) \end{aligned}$$

(37)

The function $f(\mu :\overline{F} _p ,\overline{F} _q )$ calculates the overlap degree between two fuzzy clusters $\overline{F} _p $ and $\overline{F} _q $ at a membership degree $\mu $ given by

$$\begin{aligned} f(\mu :\overline{F} _p ,\overline{F} _q )=\sum \limits _{j=1}^\mathrm{nbc} {\delta (x_j ,} \mu :\overline{F} _p ,\overline{F} _q ) \end{aligned}$$

(38)

$\delta (x_j , \mu :\overline{F} _p ,\overline{F} _q )$ indicates whether two clusters are overlapping at the membership degree $\mu $ for data point $x_j $. It returns an overlap value of $\omega (x_j )$ when the membership degrees of two clusters are both greater than $\mu $; otherwise it returns 0.0:

$$\begin{aligned}&\delta (x_j , \mu :\overline{F} _p ,\overline{F} _q )\nonumber \\&\quad =\left\{ {\begin{array}{l} \omega (x_j )\quad \text{ if } (\mu _{\overline{F} _p } (x_j )\ge \mu ) \text{ and }\, (\mu _{\overline{F} _q } (x_j )\ge \mu ), \\ 0 \quad \text{ otherwise } \\ \end{array}} \right. \end{aligned}$$

(39)

$\omega (x_j )$ is a weight factor for each point $x_j $ between clusters. $\omega (x_j )$ is a value in [0 1].

1.3 Separation function

The separation measure is obtained using the similarity distance $S(\overline{F} _p ,\overline{F} _q )$ between two fuzzy clusters $\overline{F} _p $ and $\overline{F} _q $. It is defined as

$$\begin{aligned} Sep(nbc,U)=1-\underbrace{\min }_{p\ne q}S(\overline{F} _p ,\overline{F} _q ) \end{aligned}$$

(40)

$S(\overline{F} _p ,\overline{F} _q )$ is the maximum membership degree between two clusters $\overline{F} _p $ and $\overline{F} _q $ in the interval [0 1]:

$$\begin{aligned} S(\overline{F} _p ,\overline{F} _q )=\underbrace{\max }_{x\in X}\min (\mu _{\overline{F} p} (x),\mu _{\overline{F} q} (x)), \end{aligned}$$

(41)

where $\mu _{\overline{F} p} \text{( }x\text{) }$, $\mu _{\overline{F} q} (x)$ are the membership degrees of vector $x$, respectively, in $\overline{F} _p $ and $\overline{F} _q $.

1.3.1 Dunn index

The validity indice Dunn (1974) is based on inter-cluster distance and the diameter of a hyperspheric cluster. Dunn index is given as

$$\begin{aligned}&\mathrm{{Dunn}}(\mathrm{{nbc}})=\min \nolimits _{i=1,...,\mathrm{{nbc}}}\\&\quad \times \left\{ {\min \nolimits _{j=i+1,...,\mathrm{{nbc}},j\ne i} \left\{ {\left. {\frac{d(c_{i,} c_j )}{\max _{k=1,...,nbc} \left\{ {\mathrm{{diam}}(c_k )} \right\} }} \right\} } \right. } \right\} \\&d(c_{i,} c_j )=\min \nolimits _{\mathop {x\in C}\nolimits _i ,\mathop {y\in C}\nolimits _j } \left\{ {d(x,y)} \right\} \end{aligned}$$

$diam(c_k )$ is the diameter of the cluster $c_k $.

1.3.2 FVINOS index

Fuzzy Validity Index with Noise-Overlap Separation (Tlili et al. 2014) (FVINOS) is inspired from the Davies–Bouldin validity index Mingoti and Lima (2006). FVINOS is defined as

$$\begin{aligned}&\mathrm{{FVINOS}}(\mathrm{{nbc}})=\frac{1}{\mathrm{{nbc}}}\sum \limits _{i=1}^{\mathrm{{nbc}}} \nonumber \\&\quad \times {\left( {\frac{\max _{k=1,\ldots ,\mathrm{{nbc}},k\ne i} \left\{ {S_i +S_k } \right\} +\max \mathrm{{Diff}}_i (\mathrm{{nbc}})}{\min _{l=1,\ldots ,nbc,l\ne i} \left\{ {d_{i,l} } \right\} }}\right) } \end{aligned}$$

(42)

$\min _{l=1,...,\mathrm{{nbc}},l\ne i} \left\{ {d_{i,l} } \right\} $ calculates the minimum separation between two clusters $i$ and l.

max Diff(nbc) factor is given to compare each two successive obtained partitions. In consequence, the best partition of the dataset is found.

$$\begin{aligned} \max \mathrm{{Diff}}_i (\mathrm{{nbc}})=\max \nolimits _{\mathrm{{nbc}}\max ,...,\mathrm{{nbc}}} \mathrm{{diff}}_i (\mathrm{{nbc}}) \end{aligned}$$

(43)

$$\begin{aligned} \mathrm{{diff}}_i (\mathrm{{nbc}})&= \max _{k=1,..,\mathrm{{nbc}},k\ne i} \left\{ {S_i (\mathrm{{nbc}})+S_k (\mathrm{{nbc}})} \right\} \nonumber \\&-\max \nolimits _{k=1,..,\mathrm{{nbc}}\!+\!1,k\ne i} \left\{ {S_i (\mathrm{{nbc}}\!+\!1)\!+\!S_k (nbc\!+\!1)} \right\} \nonumber \\ \end{aligned}$$

(44)

$\mathrm{diff}_i (\mathrm{nbc})$ calculates the difference between max sums of compactness in a pair of clusters $i$ and k; this is calculated for the obtained partition at nbc and nbc+1.

We define the average of fuzzy compactness relative to cluster $i$ as

$$\begin{aligned} S_i =\frac{1}{n_i }\sum \limits _{\mathop {x\in C}\nolimits _i } {(u_i (x)^m)\times d(x,c_i )}, \end{aligned}$$

(45)

where $u_i (x)$ is the membership degree of point $x$ to cluster $i$, $m$ is the fuzzifier factor.

$d(x,c_i )$ is the Euclidean distance separating $x$ from the cluster centre $c_i $.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tlili, M., Ayadi, T., Hamdani, T.M. et al. Performance evaluation of FMIG clustering using fuzzy validity indexes. Soft Comput 19, 3515–3528 (2015). https://doi.org/10.1007/s00500-014-1478-3

Download citation

Published: 21 October 2014
Issue Date: December 2015
DOI: https://doi.org/10.1007/s00500-014-1478-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Performance evaluation of FMIG clustering using fuzzy validity indexes

Abstract

Similar content being viewed by others

The use of conventional clustering methods combined with SOM to increase the efficiency

Clustering-Based Adaptive Self-Organizing Map

Fuzzy C-Means Clustering Validity Function Based on Multiple Clustering Performance Evaluation Components

Explore related subjects

1 Introduction

2 Review of crisp and fuzzy SOM algorithms

2.1 Multilevel interior growing self-organizing maps algorithm (MIGSOM)

2.2 The fuzzy Kohonen clustering network: FKCN

3 FMIG: fuzzy multilevel interior GSOMs

3.1 Initialisation

3.2 Growing process

3.3 Smoothing process

4 FMIG map quality preservation

4.1 Experimental settings

4.2 Datasets description

4.3 Results and analyses

4.3.1 Multilevel map structure with FMIG

4.3.2 New growing threshold NGT

4.3.3 Comparison of FMIG with MIGSOM, GSOM and FKCN

5 Evalution of FMIG clustering using validity indexes

6 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

1.1 Description of the used validity indexes

1.1.1 SVI index

1.1.2 XBI index

1.1.3 PCAESN index

1.1.4 VOS index

1.2 Overlap function

1.3 Separation function

1.3.1 Dunn index

1.3.2 FVINOS index

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation