Keywords

1 Introduction

Multiple sclerosis (MS) is a chronic inflammatory demyelinating disease of the central nervous system. Magnetic resonance imaging (MRI) detects lesions in MS patients with high sensitivity but low specificity, and is used for diagnosis, prognosis and as a surrogate marker in MS trials [1, 2]. In this article, we are interested in the brain MRI analysis within the context of following up the patients suffering from Multiple Sclerosis (MS).

The segmentation of various tissues and structures in medical images in a robust and efficient manner is of crucial significance in many applications, such as the identification of brain pathologies in Magnetic Resonance (MR) images [3]. The Magnetic Resonance Imaging (MRI) has widely contributed in the establishment of new knowledge about the Multiple sclerosis that allowed the clinicians to significantly improve effective therapeutic approaches. The Magnetic Resonance Imaging (MRI) is one of the complementary examinations in this disease’s diagnosis approach. It plays also a key role in the patient’s state follow-up and the quantification of a response after having taken medicines. So, the automatic extraction of quantifiers for the Multiple Sclerosis has many potential applications, in the clinical as well as pharmaceutical fields. On the other hand, reading those images is difficult due to the variability in size, contrast and lesions’ localization.

The appearance of new lesions or the raising of ancient patches detected by MRI constitutes one of the acknowledged criteria for definitive diagnosis. If the MRI provides essential information on the appearance of lesions of the white substance (Multiple sclerosis), the evolution of the lesions and their consequences on the clinical state of the patient remain weakly correlated. This observation has underlined the concept of “the clinico-radiological paradox” [4]. Besides, the paradigm of a disease primarily inflammatory is contested at present by the hypothesis of a neurodegenerative pathogen which is reinforced by the observation, that the bouts do not influence the progression of an irreversible handicap. In fact, some recent histological works [5] have clearly shown that, besides focal inflammatory and demyelinating lesions disseminated in the white substance, there is a spread and progressive attack of the whole CNS, in both of the white substance of normal appearance and the grey substance, expressed in microscopic manner by the axonal loss and the tissue atrophy.

In this paper, we focus our studies on Brain MR imaging of the brain and we propose an automatic method of segmentation to detect the lesions of MS. Our algorithm of segmentation is composed of three stages: segmentation of the brain into regions using the algorithm FCM (Fuzzy C-Means) in order to obtain the characterization of the different healthy tissues (White matter, grey matter and cerebrospinal fluid (CSF)). In the second stage, we eliminate the atypical data of the white matter by the Optimization algorithm PSOBC (Particle Swarm Optimization-Based image Clustering). Finally in the third stage, a decision is made to use of a Mamdani-type fuzzy model composed of a group of fuzzy rules “if…. then” to extract the MS lesions among all the absurd data.

We present in the second section the related work. We will present the steps of the automatic method of segmentation of the proposed approach to detect the lesions of MS in the third section. The fourth section will present the results obtained on the MRI images. Finally, we will finish by a conclusion and future work in the fifth section.

2 Related Work

A variety of approaches to MS lesion segmentation have been proposed in the literature. Generally speaking, they can be classified into two groups: outlier-based and class- based methods.

In outlier-based methods [610], MS lesions are treated and detected as the outliers to the normal brain tissue distribution, which is usually modelled with a Finite Gaussian Mixture (FGM) of CSF, GM and WM classes. Van Leemput et al. [6] pioneered this approach. Under their framework, MR field inhomogeneities, parameters of the Gaussian distribution and membership are computed iteratively, with the contextual information being incorporated using a Markov random field. Observed intensity values whose Mahalanobis-distances exceed a predefined threshold are marked as lesions. The thresholds are empirically set in this work. Bricq et al. [11] applied neighborhood information during the inference process using a Hidden Markov Chain model and outliers were extracted using the Trimmed Likelihood Estimator (TLE) [12]. This approach was evaluated on real data including MS lesions using T1 and FLAIR MR images.

Class-based methods [1318] model the lesions as an independent class to be extracted. In [14], a combination of intensity-based k-nearest neighbor classification (k-NN) and a template-driven segmentation (TDS+) was designed to segment different types of brain tissue. Lesions are modeled as one of the expected tissue types, and the class parameters are obtained through an operator supervised voxel sampling on two randomly selected scans. Since the manual training step is highly data-dependent, it is expected be conducted for each study or data set. A similar approach was proposed in [7]. The segmentation method determines for each voxel in the image the probability of being part of MS-lesion tissue, and the classification is conducted also based on K-NN algorithm. Voxel intensities and spatial information are integrated as discriminative features, and voxels are classified based on their proximity to the pre-classified samples in the feature space. It should be noted that manual or semiautomatic training is normally a required step in k-NN based methods, and the value of k (number of classes) has to be determined in advance, either interactively [14] or empirically [7].

One should note that in the class-based approaches [7, 1416, 18], a training procedure, to either calibrate the classifier parameters or to choose the tissue class representatives, is normally required. In order to obtain desired segmentation results, the testing data sets are also expected to be highly similar to the training sets, ideally from the same group. Outlier-based models [610] relax the training requirement, but they usually subsume a thresholding step. Those thresholds, critical for segmentation performance and system reproducibility, usually require certain prior to be set up precisely, which are often difficult to be determined.

Image segmentation is the process of partitioning a digital image into non-overlapped homogeneous regions with respect to some characteristics, such as gray value, motion, texture, etc. Image segmentation is used in various applications like medical imaging, locating objects in satellite images, face recognition, traffic control systems, and machine vision, etc. [19]. Several techniques for image segmentation have been proposed [20]. They can be classified into region based approaches [21, 22] and edge detection based approaches [23]. In the present work, we are focused on the region based approach using fuzzy clustering algorithm (soft clustering), instead of hard clustering strategies. In the latter, each data point is assigned to only one cluster, while in soft clustering each data point belongs to all clusters with different degrees of membership, thus taking a better account for poor contrast, overlapping regions, noises, and intensity inhomogeneities [24].

3 Proposed Approach

In classical methods, each voxel of the brain is assigned to one of four following classes: WM, GM, CSF, or MS lesions [10]. The Fig. 1 shows the processing sequence proposed for the segmentation of MS lesions. The images are noisy, the inhomogeneities are corrected and all images are registered in the same space. So, our segmentation algorithm can be decomposed into three main steps:

Fig. 1
figure 1

General architecture of the steps of the automatic segmentation of MS lesions

  1. 1.

    Segmentation of brain tissues into three classes (WM, GM, CSF)

  2. 2.

    Segmentation of the white matter to extract the atypical data

  3. 3.

    Decision-making

3.1 Segmentation of the Brain Tissues

The segmentation of the brain tissues into different compartments (white matter (WM), gray matter (SG) and cerebrospinal fluid (CSF)) is a key step in our study. The outcome of this segmentation serves as the basis for implementing lesion-handling based strategies.

The first question we faced in order to achieve this task is whether a supervised or a non-supervised algorithm will be employed for this purpose.

The use of a supervised algorithm requires a learning database for each class and each patient, which restricts applicability of no such labelled data is fully available as reported in [25]. Given the context of our study, we have chosen a non-supervised based approach.

Fuzzy c-means algorithm

Motivation grounds. Motivated by its reported success in various fields, i.e., agricultural engineering, astronomy, chemistry, geology, image analysis, medical diagnosis [26, 27], its reduced complexity, easy implementation, especially for large data but also its blurring (integration degree of membership) [28], we advocate in this paper the use of fuzzy c-means (FCM) approach for clustering, which consisted in our case in separating the three classes (white matter (WM), gray matter (SG) and cerebrospinal fluid (CSF)). On the other hand, the FCM algorithm has been widely used for segmentation brain images, regardless of the modality and the type of acquisition (mono or multimodal) and many studies have been performed including imaging magnetic resonance [24, 29, 30].

Formulating of FCM algorithm. FCM is a fuzzy clustering method based on the minimization of a quadratic criterion where clusters are represented by their respective centers [31]. More specifically, for a set of data patterns X = {x 1 , x 2 , …, x N } the fuzzy c-means clustering algorithm allows us to partition the data space, by calculating the centers of classes (c i ) and the membership matrix (U), by minimizing an objective function J with respect to these centers and membership degrees

$$ J = \sum\limits_{i = 1}^{C} {\sum\limits_{j = 1}^{N} {\left( {u_{ij} } \right)} }^{m} d^{2} \left( {x_{j} ,c_{i} } \right) $$
(1)

Under constraints:

$$ \forall j \in \left[ {1,N} \right]:\,\sum\limits_{i = 1}^{C} {u_{ij} = 1} \,\forall i \in \left[ {1,C} \right],\forall j \in \left[ {1,N} \right]:\,u_{ij} \in \left[ {0,1} \right] $$
(2)

where

U = [uij] C×N is the membership function matrix,

d(x j , c i ) is the metric which calculates the distance between the element x j and the center of cluster c i ,

C is the number of clusters,

N is the number of data,

m is the degree of fuzziness (m > 1).

The problem of minimizing the objective function (1) under the constraints (2) is solved by converting the problem to an unconstrained one using the Lagrange multiplier. Both centers of classes and membership degrees cannot be found directly at the same time, so an alternating procedure is used. Firstly, the prototype of classes are fixed arbitrary to find the membership degrees, secondly, the membership degrees are fixed to find the centers corrected. These two steps are alternatively repeated until convergence is attained.

The fuzzy c-means clustering algorithm proceeds according to Algorithm 1.

In image segmentation, x i can represent the gray value of the ith pixel, N is the number of pixels of the image, C is the number of the regions (clusters), d 2(x i , c j ) is the Euclidean distance between the pixel x i and the center c j and u ij is the membership degree of pixel x i in the jth cluster.

3.2 Segmentation of the White Matter

Although it is not always correlated with the clinical disability as shown in other studies [32, 33], the load lesional constitutes the primary indicator of inflammatory phenomena. However, the infringement predominantly inflammatory present in the WM is probably in relationship with the mechanisms of degeneration and achievement, axonal. As well, the measurement of the load lesional informs us about the degree of achievement of the WM in the course of the disease.

Particle swarm optimization algorithm. The next stage in our methodology is the segmentation of the Multiple Sclerosis lesions. although the segmentation of the multiple sclerosis lesions provides an excellent contrast for the different tissues of the brain parenchyma (white matter, grey matter, CSF). The lesions of the multiple sclerosis are not always well contrasted and their segmentation has become more difficult due to the partial volume with the surrounding tissues. In order to elaborate a segmentation system which accommodates our large variety of images, we consider the segmentation as an optimization problem. The latter makes use of particle swarm optimization (PSO) based algorithm. This is motivated by: its simplicity and proven efficiency in similar other segmentation tasks as reported in [34]. Besides, its rapid convergence and ability to deal with high dimension dataset, which enable to fly around the solution space effectively, have been pointed out in [35, 36].

More formally, particle swarm optimization (PSO) is a population-based stochastic optimization algorithm proposed for the first time by Kennedy and Eberhart [37], inspired by bird flocking and fish schooling. The problem is tackled by considering a population (particles), where each particle is a potential solution to the problem. Initial positions and velocities of the particles are chosen randomly. In the commonly used standard PSO, each particle’s position is updated at each iteration step according to its own personal best position and the best solution of the swarm. The evolution of the swarm is governed by the following equations:

$$ V^{{\left( {k + 1} \right)}} = w.V^{\left( k \right)} + c_{1} .rand_{1} .\left( {Pbest^{\left( k \right)} - X^{\left( k \right)} } \right) + c_{2} .rand_{2} .\left( {Gbest^{\left( k \right)} - X^{\left( k \right)} } \right) $$
(3)
$$ X^{{\left( {k + 1} \right)}} = X^{\left( k \right)} + V^{{\left( {k + 1} \right)}} $$
(4)

where

  • X is the position of the particle,

  • V is the velocity of the particle,

  • W is the inertia weight,

  • Pbest is the best position of the particle,

  • Gbest is the global best position of the swarm,

  • rand 1 , rand 2 are random values between 0 and 1,

  • c 1 , c 2 are positive constants which determine the impact of the person al best solution and the global best solution on the search process, respectively,

  • k is the iteration number.

PSO-Based Image Clustering. In the context of clustering, a single particle represents the C cluster centroid vectors. That is, each particle xi is constructed as follows:

$$ X_{i} = (x_{i1} , \ldots ,x_{ij} , \ldots ,x_{{iC_{{}} }} ) $$
(5)

where \( x_{ij} \) refers to the j-th cluster centroid vector of the i-th particle in cluster Cij. Therefore, a swarm represents a number of candidate clustering’s for the current data vectors. The fitness of particles is easily measured as the quantization error,

$$ J_{e} = \frac{{\sum\limits_{j = 1}^{C} {\left[ {\sum\limits_{{\forall x_{i} \in C_{j} }} {d\left( {x_{i} ,c_{j} } \right)} } \right]/} N_{j} }}{C} $$
(6)

where

$$ d(x_{i} ,c_{j} ) = \sqrt {\sum\limits_{k = 1}^{{N_{d} }} {\left( {x_{ik} - c_{jk} } \right)^{2} } } $$
(7)

And

  • N d denotes the input dimension, i.e. the number of parameters of each data vector

  • N denotes the number of WM image pixels

  • C denotes the number of cluster centroids (as provided by the user)

  • x i denotes the coordinates of pixel i

  • c j denote the means of cluster j.

  • \( N_{j} \) is the number of pixels in \( C_{j} \)

The PSO based image clustering algorithm is summarized below

where tmax is the maximum number of iterations.

3.3 Decision-Making

The last step consists in the decision-making concerning the belonging of a voxel of the white matter to a class of the MS disease. The MS lesions appear in hypo and hyper-signal in comparison to the WM according to the MRI methods. The weighted images in T2 and PD underline the myelin component in the lesions characterized by the edemas with hyper-intense appearance in comparison to the white matter. Furthermore, the method T1 underlines the irreversible destruction of the tissues with the appearance in the white matter of persistent “black holes” (Hypo-signal) [38]. For this purpose, we propose the use of the Mamdani’s fuzzy inference process. This fuzzy model is composed of:

Fuzzification stage (Fig. 2). The system has two inputs, the weighting contrast of MRI and the type of the voxels’ signal. And a signal of output which diagnose the Multiple Sclerosis disease.

The steps of fuzzification consist in setting the belonging functions, the steps of which are:

  1. 1

    To set the linguistic variables

  2. 2

    To set the fuzzy quantifiers (Number of the linguistic values);

  3. 3

    To assign a digital signification to each fuzzy quantifier: belonging function

For the fuzzification of the contrast, we choose three fuzzy intervals and belonging functions of the Gaussian types. Figure 3 shows the fuzzy repartition of contrast input variable.

Fig. 2
figure 2

Diagram of fuzzy system of the MS disease

For the fuzzification of signal’s type, we choose two fuzzy intervals and belonging functions of Gaussian types. Figure 4 shows the fuzzy repartition of the input variable of signal’s type.

Fig. 3
figure 3

Fuzzy repartition of contrast input variable

For the output variable, we choose three fuzzy intervals and Gaussian membership functions, which define predicates: low, normal and high of the MS disease in comparison to the white matter. Figure 5 shows the fuzzy repartition of the output variable of the decision of the MS disease.

Fig. 4
figure 4

Fuzzy repartition of input variable of signal’s type

Fuzzy rule base. In the view of intuitive interpretation of the input variables, some fuzzy rules can be set up manually, for instance:

Here are ten examples of the rules’ base represented in a linguistic form:

  1. 1.

    If [(the contrast weighted at T1) and (the zones are of high intensity)] then (the MS is low).

  2. 2.

    If [(the contrast weighted at T1) and (the zones are of high intensity)] then (the MS is normal in comparison to the white matter).

  3. 3.

    If [(the contrast weighted at T2) and (the zones are of high intensity)] then (the MS is high).

  4. 4.

    If [(the contrast weighted at PD) and (the zones are of high intensity)] then (the MS is high).

  5. 5.

    If [(the contrast weighted at T1) and (the zones are of low intensity)] then (the MS is low).

  6. 6.

    If [(the contrast weighted at T2) and (the zones are of low intensity)] then (the MS is high).

  7. 7.

    If [(the contrast weighted at PD) and (the zones are of low intensity)] then (the MS is high).

  8. 8.

    If [(the contrast weighted at T1) and (the zones are of high intensity after injection of gadolinium)] then (the MS is normal in comparison to the WM).

  9. 9.

    If [(the contrast weighted at T2) and (the zones are of high intensity after injection of gadolinium)] then (the MS is high).

  10. 10.

    If [(the contrast weighted at PD) and (the zones are of high intensity after injection of gadolinium)] Then (the MS is high).

Table 1 summarizes the set of exhibited fuzzy rules.

Table 1 Fuzzy rule base

The selected inference method is Mamdani’s method. Consequently, the operator is realized by the calculation of the minimum, whiles the operator OR is realized by the calculation of the maximum.

The defuzzification step is done using the method of calculating the center of attraction.

4 Results and Discussion

To validate the developed method, we relied on the BrainWeb database (http://www.bic.mni.mcgill.ca/brainweb/). This database was chosen as it is frequently used in the literature and thus allows providing an easier comparison with alternative methods proposed in literature.

The actual images on which we worked were acquired as part of the collaboration between LSI laboratory (Laboratory Intelligent Systems: image and signal team) Ferhat Abbas University of Setif and LAMIH UMR CNRS 8201 (Laboratory of Industrial and Human Automation control, Mechanical engineering and Computer Science) University of Valenciennes Cedex 9 France (Figs. 5 and 6).

Fig. 5
figure 5

Fuzzy repartition of the output variable giving the decision of the MS disease

Fig. 6
figure 6

Original Images

4.1 Analysis of the Results

It concerns the images weighted at T1, T2 and Proton density (Pd) for different old patients (Pixel size = 1 mm, matrix size 512 × 512). The images are in the form of DICOM (Digital Imaging and Communications in Medicine). The selection of the proposed noise on site BrainWeb is between 0 and 9 % and that of the heterogeneity of the setting values between 0 and 40 %. The brain segmentation was successfully applied on some real images, the results are shown in the following figures

Step 1: segmentation of tissues (WM, GM and CSF)

The Fig. 7 show the segmentation by the FCM algorithm for the T2 weighted image in order to obtain a characterization of the different healthy tissues (White matter, Grey matter and cerebrospinal fluid)

Fig. 7
figure 7

Image T2 segmented by FCM

To compare the performance of these images, we compute different coefficients reflecting how well two segmented volumes match. We used different performance measures:

$$ Overlap\quad \quad (ovrl) = \frac{TP}{TP + FN + FP} $$
(8)
$$ Similarity\quad \quad (SI) = \frac{2.TP}{2.TP + FN + FP} $$
(9)

where TP and FP stand for true positive and false positive, which were defined as the number of voxels correctly and incorrectly classified as brain tissue by the automated algorithm. TN and FN stand for true negative and false negative, which were defined as the number of voxels correctly and incorrectly classified as non-brain tissue by the automated algorithm. The comparative results are presented in Table 2 below:

Table 2 Comparison of segmentation results

The results in Table 2 show a considerable improvement for all tissues using T2 than T1 and PD.

Step 2: segmentation of the white matter by PSOBC

The use of PSOBC allows us to eliminate the atypical data of the white matter for each image (T1, T2, PD) as exhibited in Fig. 8 for image T2.

Fig. 8
figure 8

WM of the image T2 segmented by PSOBC

Table 3 summarizes the segmentation outcome by PSOBC

Table 3 Comparative results

The results obtained by PSOBC are very satisfactory and well confirm the validity of the algorithm, its ease of implementation gives it a substantial advantage. We have made an improvement in optimizing the white matter and atypical data (Table 4).

Table 4 Result of the defuzzification of the atypical data of the different sequences

Step 3: Decision-making

However, in complex structure, we can not make a final decision because of the blur. For this, the selected inference method is Mamdani’s method. Consequently, the operator is realized by the calculation of the minimum, whiles the operator OR is realized by the calculation of the maximum.

The following table presents the results of defuzzification:

The decision-making depends always on the expertise, the patient suffers from the multiple sclerosis and the MS lesions are detected in all the sequences by a normal or a high characterization.

4.2 Experimental Results

The Fig. 9 shows the results obtained after segmentation of the images (a) weighted T2 on axial plane. The images (b), (c), (d) and (e) are the results of segmentation realized by the expert, FCM, PSOBC and the approach successively proposed.

Fig. 9
figure 9

Pathological image (a) and its segmentation gotten by (b) segmentation by the expert; c FCM; d PSOBC and e Proposed approaches

The results of each stage of the segmentation are presented on a sectional level (Fig. 6) in which the localization allows distinguishing three separated classes of the tissue:

  • GM (Pallidum, Putamen, caudate nucleus, thalamus and cortex)

  • WM (brain parenchyma).

  • CSF (subarachnoid space, lateral ventricles and V3).

The interpretation of our results is done by an expert (hospital center of Ain Naadja Algiers) on simulated and real images. By analyzing the images of the Fig. 9, the expert has established the following statement:

  • Image (b): The interpretation of the classes is totally improved in relation to (FCM, PSOBC), we notice the distinction between the 03 classes of the brain and the class of the pathology SEP.

  • Image (c): The class CSF does not conform to the class of the original image. The lack of information about the small grooves (image (a)) and the poor discrimination CSF/GM make that the segmented CSF class does not well represent the fluid distribution. The distributions of the WM and GM get closer to those given by the original image. The detection of the pathology is indicated according to the expert but the details are not well expressed.

  • Image (d): PSOBC is unsuitable in this segmentation in relation to the image (o).

  • Image (e): the proposed approach brings a great performance to the segmentation for the three classes and especially for the fourth one which is the pathology that specifies well the size and the details about this later.

We compare the T2 weighted RM Images between the segmentation by the segmentation realized by the expert, FCM and PSOBC for a given time of acquisition and the segmentation by the proposed approach.

The results of the Table 5 and the Fig. 10 underline the advantages of the proposed approach in comparison to the segmentation by FCM and PSOBC for all tissues CSF, WM, GM and MS lesions.

Table 5 Comparison of the results gotten by different algorithms
Fig. 10
figure 10

Performance measures of the results gotten by different algorithms

5 Conclusion

In this article, we have proposed a new automatic approach of segmentation of the MS lesions’ images. Two reasons explain their difficulties:

  1. 1.

    The first is that there is a very wide variety of abnormal tissues which are different in their size, shape, position and composition.

  2. 2.

    The second reason is that the datum issued from the MRI acquisition is sensitive to the noise background noise and to the sampling collection.

We have firstly split up the process of automatic segmentation of the MS lesions into three fundamental stages:

Firstly, we segmented the brain into regions by using the algorithm FCM (Fuzzy C-Means) in order to obtain the characterization of the different healthy tissues (White matter, Grey matter and cerebrospinal fluid (CSF). Secondly, we eliminated of the atypical data of the white matter by the optimization algorithm PSOBC (Particle Swarm Optimization -Based image Clustering). Finally, in the framework of our application on the MS disease, we used a Mamdani-type fuzzy model to make decision of the MS disease. We presented the results of our work consisting in the use of an algorithm for the segmentation if medical images in order to improve the quality of the MS lesions’ detection. The good quality of our solutions depends on the fact that:

  • It is a method totally automatic due to the modeling of the prior knowledge of the neuroradiology experts. the fuzzy theory is important for modeling the human knowledge using the mathematical functions and to solve the effect of the partial volume of the MRI.

  • It satisfies the application’s constraints due to the automaticity and the different final results which may be provided by the fuzzy 3D reconstruction.

  • Its performance is better than the performance of the supervised method.

  • It is a system based on fuzzy and optimization theory

  • It is efficient on 2 types of tissues at least.