Keywords

1 Introduction

Cardiovascular disease is one of the main reasons of death over the past decades in the world. Cardiac magnetic resonance imaging (MRI) has proven to be a versatile and noninvasive imaging modality. Some techniques are used for the diagnoses of heart diseases. The MRI of the left ventricle (LV) is very important for the assessment of stroke volume, ejection fraction, and myocardial mass, as well as regional function parameters such as wall motion and wall thickening [1]. To perform a quantitative analysis of a LV, doctors need an accurate segmentation of the LV which can provide the anatomical and functional information of a heart, so it can be widely applied in clinical diagnoses [2, 3]. The segmentation of cardiac MRI images is one of the most critical prerequisites for quantitative study of the LV.

So far, in clinical practice, the segmentation of the LV is almost completed manually. This workload, however, is too heavy and time-consuming, subjective and irreproducible. Therefore, it is attractive to develop accurate and automatic segmentation algorithms for clinical applications.

This paper is organized as follows. Section 2 briefly reviews the related works on the segmentation of LV. In Sect. 3, this paper introduces the basic theory of ELM. In Sect. 4, the image segmentation methods are introduced in detail. The experimental results of the segmentation of LV based on the ELM are presented in Sect. 5. In Sect. 6, the conclusions are offered.

2 Related Works

In recent years, many methods have been proposed for LV segmentation. They can be classified into two types: deformable models and image-based methods. A complete review of recent literature is given in [3].

Deformable models include snakes [46], level set [7, 8], and their variants [911]. Deforming curves are derived iteratively in accordance with the minimization of an energy function which is composed of a data-driven term. A random active contour scheme for automatic image segmentation was proposed in [12].

Image-based approaches include thresholding [13], pixel-based classification [1416], region-based and edge-based approaches. Otsu approaches [17] were employed by Lu [18] and Huang [19] in LV segmentation of cardiac MRI. However, the Otsu approach, which is based on the histogram of objects in the image, could be biased from the optimal threshold [20]. Dynamic programming (DP) approach is used to find boundaries of the LV in MRI images [21, 22]. However, due to the complex boundaries of the myocardium [23], the performance of the DP approach sometimes is poor in boundary extraction.

3 A Brief Introduction to the Extreme Learning Machine

The extreme learning machine (ELM) is a learning algorithm, whose speed can be thousands of times faster than traditional feedforward network learning algorithms, and which has better generalization performance [24].

Given N arbitrary different samples \( ({\text{X}}_{i} ,{\text{t}}_{i} ) \), where \( {\text{X}}_{i} = [{\text{x}}_{i1} ,{\text{x}}_{i2,} \ldots ,{\text{x}}_{{i{\text{n}}}} ]^{\text{T}} \in {\text{R}}^{\text{n}} \), and \( \text{t}_{i} = [t_{i1} ,t_{i2,} \ldots ,t_{{i\text{m}}} ]^{\text{T}} \in \text{R}^{\text{m}} \), standard SLFNs with M hidden nodes and activation function \( \text{g}(\text{x}) \) are modeled as

$$ \sum\limits_{i = 1}^{M} {\beta_{i} } \text{g}_{i} (\text{x}_{j} ) = \sum\limits_{i = 1}^{M} {\beta_{i} } \text{g}(\text{w}_{i} \cdot \text{x}_{j} + \text{b}_{i} ) = \text{o}_{j}\quad \left( {j = 1, \ldots ,N} \right) $$
(1)

where M is the number of the hidden layer nodes,\( \text{w}_{i} = [w_{{i{1}}} ,w_{i2} , \ldots ,w_{in} ]^{\text{T}} \) is the input weight vector, \( \beta_{i} = [\beta_{{i{1}}} ,\beta_{{i{2}}} , \ldots ,\beta_{im} ]^{\text{T}} \) is the output weight vector, and \( b_{i} \) is the threshold of the ith hidden node. \( \text{w}_{i} \cdot \text{x}_{j} \) is the inner product of \( \text{w}_{i} \) and \( \text{x}_{j} \) [24].

If only the activation function is infinitely differentiable, the input weights and hidden layer biases can be randomly generated [24]. All the parameters of SLFNs need to be adjusted; training an SLFN is simply equivalent to finding a least squares solution \( \hat{\beta } \) of the linear system \( \text{H}\beta = \text{T}{:} \)

$$ \left\| {\text{H}(\text{w}_{1} , \ldots ,\text{w}_{M} ,b_{1} , \ldots ,b_{M} )\hat{\beta } - \text{T}} \right\| = \hbox{min} \left\| {\text{H}(\text{w}_{1} , \ldots ,\text{w}_{{\tilde{N}}} ,b_{1} , \ldots ,b_{{\tilde{N}}} )\beta - \text{T}} \right\|. $$
(2)

If the number \( M \) of the hidden nodes equals the number N of distinct training samples, matrix H is square and invertible, and SLFNs can approximate these training samples with zero error. However, in most cases the number of hidden nodes is much less than the number of distinct training samples, \( M \ll N \), H is a non-square matrix and there may not exist \( \text{w}_{i} ,b_{i} ,\beta_{i} \) such that \( \text{H}\beta = \text{T} \)where

$$ {\text{H}}\left( {{\text{w}}_{1} , \ldots ,{\text{w}}_{M} ,b_{1} , \ldots ,b_{M} ,{\text{x}}_{1} , \ldots ,{\text{x}}_{N} } \right) = \left[ {\begin{array}{*{20}c} {{\text{g}}\left( {{\text{w}}_{1} \cdot {\text{x}}_{1} + b_{1} } \right)} & \cdots & {{\text{g}}\left( {{\text{w}}_{M} \cdot {\text{x}}_{1} + b_{M} } \right)} \\ \vdots & \cdots & \vdots \\ {{\text{g}}\left( {{\text{w}}_{1} \cdot {\text{x}}_{N} + b_{1} } \right)} & \cdots & {{\text{g}}\left( {{\text{w}}_{M} \cdot {\text{x}}_{N} + b_{M} } \right)} \\ \end{array} } \right]_{N \times M} , $$
(3)
$$ \beta = \left[ {\begin{array}{*{20}c} {\beta_{1}^{\text{T}} } \\ \vdots \\ {\beta_{M}^{\text{T}} } \\ \end{array} } \right]_{M \times m} \quad {\text{and}}\quad {\text{T}} = \left[ {\begin{array}{*{20}c} {{\text{t}}_{1}^{\text{T}} } \\ \vdots \\ {{\text{t}}_{N}^{\text{T}} } \\ \end{array} } \right]_{N \times m} $$
(4)
$$ \hat{\beta } = \text{H}^{\dag } \text{T}, $$
(5)

where \( H^{\dag } \) is the Moore–Penrose generalized inverse of matrix H. The ELM algorithm [24] can be described as follows [25].

Algorithm: ELM

1: for i = 1 to M do

2: randomly assign input weight \( \text{w}_{i} \)

3: randomly assign bias \( b_{i} \)

4: calculate H

5: calculate \( \hat{\beta } = \text{H}^{\dag } \text{T} \)

4 Methods

The whole algorithm of this image segmentation includes pre-processing techniques, training ELM, classification and post-processing as shown in Fig. 1.

Fig. 1
figure 1

Workflow of the proposed segmentation algorithm

4.1 Pre-processing Training Data and Training ELM

The pre-processing procedure of training data consists of the following steps:

  1. 1.

    Three typical images in cardiac MRI were selected as sample images.

  2. 2.

    All the pixels were selected from the LV of the ground truth, which were labeled as 1. The region of the LV was extended, and then all the pixels were selected from the extended region, which were labeled as 0 [26].

  3. 3.

    In addition to gray level value, gray mean value and gray median, representative features related to gray level co-occurrence matrix(GLCM) [27] such as energy, contrast, correlation entropy and inverse from 4 directions via a 5 × 5 window were used to represent a pixel, amounting to 23-dimentional features. Feature vectors of all pixels of an image were concentrated to generate a feature matrix.

  4. 4.

    Pre-processing procedure of each image includes steps 2–4, three feature matrices were merged into a feature matrix at last, whose values are normalized to [0, 1].

The training ELM is to find optimal parameters. The ELM kernel used in the pro-posed algorithm is Sigmoid function and the number of hidden nodes is 100, which are selected through multiple trials and means the optimal performance. Owing to the randomness of the input weights and hidden layer biases, in order to find the optimal input weights and hidden layer biases to gain optimal segmentation performance, therefore, the segmentations of images were performed over and over.

4.2 Pre-processing Testing Data

The pre-processing procedure of testing data consists of the following steps:

  1. 1.

    A fitting threshold of the image was found using the Otsu method, and then the original image was converted into a binary image.

  2. 2.

    Little objects in the binary image were removed, whose areas are less than 300 pixels. The contours of the remaining objects were depicted.

  3. 3.

    Due to the shape of LV approximating a circle, the roundness of each remaining object was calculated, and then the object which owned the biggest roundness is the LV, the LV was located approximately.

  4. 4.

    Usually, the errors of the contour of the LV aren’t satisfactory, therefore, the morphological methods were used to process the contour, and as a result, the processed LV con tour included almost all the pixels of the LV regarded as the testing pixels set.

  5. 5.

    By the same methods (introduced in Sect. 4.1) 23-dimentional features of each pixel of the testing pixels set were extracted to generate a feature matrix.

4.3 Classification and Post-processing

All pixels were classified into two classes by using the trained ELM, namely one class belongs to the LV and the other one belongs to the non-LV. The LV contour was depicted and smoothed using the open-close operation in mathematical morphology, the segmentation results are shown in Fig. 2.

Fig. 2
figure 2

ELM segmentation results of images (ad). Green and red contours are obtained with manual and automatic segmentation, respectively

5 Results

5.1 Evaluation Measures

In this paper, a convenient tool for researchers is adopted to test and compare the segmentation results of the proposed method; the level set method and the SVM method easily and objectively. From the point of view of accuracy, several measures are used in our experiments, including mean absolute deviation (MAD), maximum absolute deviation (MAXD) and Dice Similarity Coefficient (DSC).

5.2 Performance

In this section, the performance of LV segmentation based on an ELM is studied through evaluating its efficiency and effectiveness. The algorithms are coded in MATLAB. All experiments are conducted on a 3.2-GHz PC with 4G memory running window 7. Real datasets are used in the experiments. The dataset includes a total of 1000 images whose sizes are 216 × 256 or 256 × 216.

Figures 2, 3 and 4 illustrate the segmentation results of the proposed method, the level set method and the SVM method of images (a), (b), (c), (d), respectively. The segmented LVs are delineated by red pixels. Green contours are obtained by manual segmentation. Many experiments were conducted on image segmentation based on the ELM. Among these experiments, the optimal performance is obtained, when the number of hidden layer nodes equals 100, and activate function is Sigmoid function.

Fig. 3
figure 3

Level set segmentation results of images (ad). Green and red contours are obtained with manual and automatic segmentation, respectively

Fig. 4
figure 4

SVM segmentation results of images (ad). Green and red contours are obtained with manual and automatic segmentation, respectively

Table 1 lists mad, maxd, dice, and time of our proposed method, the level set method and the SVM method about 24 images, respectively. From Table 1, it can be seen that the mean mad of proposed method is about 0.64 pixels, its standard deviation is about 0.18 pixels, the mean maxd of the proposed method is about 2.25 pixels, its standard deviation is about 0.71 pixels and the mean dice of the proposed method can reach up to 92 %, its standard deviation is about 2.67 %. The mean computation time of the proposed method is 0.36 s.The proposed method runs about 25 times faster than the level set method, about 7 times faster than the SVM method. The dice of image segmentation based on the ELM is about 8 and 2 % higher than that of the image segmentation based on the level set and the SVM, respectively. The standard deviation of our proposed method is the lowest in all three methods.

Table 1 Segmentation accuracy and speed on the ELM, the level set, the SVM

In order to further evaluate the performance of the proposed method, the local distributions of segmentation errors and the similarity between the segmentation and the ground truth have been illustrated in Fig. 5, respectively. The boxplots indicate the median, lower and upper quartiles of mad, maxd, dice of the above three methods directly. It can be seen from Fig. 5 that the proposed method outperforms the other two methods since it obtains the higher dice and the lower mad and maxd than the other two methods.

Fig. 5
figure 5

Box plots of mad (a), maxd (b), and dice (c) between the ELM method, the level set method and the SVM method

6 Conclusions

This paper describes a new automatic LV segmentation method based on an ELM in cardiac MRI images in short-axis MRI view. This method takes into account the intensity inhomogeneity which often occurs in the left ventricular cavity and may cause many difficulties in image segmentation. Some images processing methods such as the morphological dilatation and the open-close operation in mathematical morphology have been used in the procedures of segmenting MRI images. Firstly, in accordance with background and foreground all the sample pixels were divided into two classes and then 23-dimensional features of each pixel were extracted to generate a feature matrix. Secondly, an ELM was trained using the feature matrix. Finally, the LV image was segmented using the trained ELM. Experimental results show that the mean speed of LV segmentation based on the ELM is about 25 times faster than that of the level set method, about 7 times faster than that of the SVM method, and in terms of the mad metric, maxd metric and dice metric, the image segmentation based on the proposed method is slightly better than those of the level set method and the SVM method. The results of this study prove that the proposed method is efficient and satisfactory for segmentation of LV.