Segmentation of the left ventricle in cardiac MRI using a hierarchical extreme learning machine model

Luo, Yang; Yang, Benqiang; Xu, Lisheng; Hao, Liling; Liu, Jun; Yao, Yang; Vosse, Frans van de

doi:10.1007/s13042-017-0678-4

Segmentation of the left ventricle in cardiac MRI using a hierarchical extreme learning machine model

Original Article
Published: 22 May 2017

Volume 9, pages 1741–1751, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Segmentation of the left ventricle in cardiac MRI using a hierarchical extreme learning machine model

Download PDF

Yang Luo^1,2,
Benqiang Yang³,
Lisheng Xu ORCID: orcid.org/0000-0001-8360-3605^1,4,
Liling Hao¹,
Jun Liu¹,
Yang Yao¹ &
…
Frans van de Vosse⁵

755 Accesses
13 Citations
Explore all metrics

Abstract

Segmentation of the left ventricle (LV) from cardiac magnetic resonance imaging (MRI) images is an essential step for calculation of clinical indices such as stroke volume, ejection fraction. In this paper, a new automatic LV segmentation method combines a Hierarchical Extreme Learning Machine (H-ELM) and a new location method is developed. An H-ELM can achieve more compact and meaningful feature representations and learn the segmentation task from the ground truth. A new automatic LV location method is integrated to improve the accuracy of classification and reduce the cost of segmentation. Experimental results (including 30 cases, 10 cases for training, 20 cases for testing) show that the mean absolute deviation of images segmented by our proposed method is about 67.9, 81.3 and 98.7% of those images segmented by the level set, the SVM and Hu’s method, respectively. The mean maximum absolute deviation of images segmented by our proposed method is about 63.5, 77.3 and 98.0% of those images segmented by the level set, the SVM and Hu’s method, respectively. The mean dice similarity coefficient of images segmented by our proposed method is about 13.7, 9.3 and 0.5% higher than that of those images segmented by the level set, the SVM and Hu’s method, respectively. The mean speed of our proposed method is about 38.3, 6.7 and 23.8 times faster than that of the level set, the SVM and Hu’s method, respectively. The standard deviation of our proposed method is the lowest among four methods. The results validate that our proposed method is efficient and satisfactory for the LV segmentation.

Segmentation of the Left Ventricle in Cardiac MRI Using an ELM Model

Segmentation and Quantification of Bi-Ventricles and Myocardium Using 3D SERes-U-Net

Two-stage active contour model for robust left ventricle segmentation in cardiac MRI

Article 28 July 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Cardiovascular disease was the leading cause of chronic non-communicable diseases deaths in 2012 and was responsible for 17.5 million deaths, or 31% of all global deaths. Of these deaths, an estimated 7.4 million were due to coronary heart disease and 6.7 million were due to strokes [1]. Cardiac magnetic resonance imaging (MRI) has proven to be a versatile and noninvasive imaging modality. The MRI of the left ventricle (LV) is important for the assessment of stroke volume, ejection fraction, and myocardial mass, as well as regional function parameters such as wall motion and wall thickening [2]. To perform a quantitative analysis of a LV, clinicians need an accurate segmentation of the LV which can provide the anatomical and functional information of a heart, so it can be widely applied in clinical diagnoses [3]. The LV segmentation in cardiac MRI images is one of the most critical prerequisites for quantitative study of the LV.

So far, in clinical practice, The LV segmentation is almost completed manually. This workload, however, is too heavy and time-consuming, subjective, and prone to intra- and inter-observer variability. Therefore, it is attractive to develop accurate and automatic segmentation algorithms for clinical diagnosis and treatment. But, there are several challenges in the automatic LV segmentation from cardiac MRI images: blood flow resulting in heterogeneities in the brightness of heart chambers; intensities of papillary muscles similar to that of the myocardium; complexity of apical and basal slices segmentation; partial volume effects due to the limited resolution of cardiac MRI; intrinsic noise related with cardiac MRI; motion of the heart and inhomogeneity of intensity; considerable variability in shape and intensity of the heart chambers across patients, notably in pathological cases, etc [4,5,6,7]. Due to these technical barriers, the automatic LV segmentation from cardiac MRI is still a challenging problem. Some shortcomings of classical LV segmentation methods from cardiac MRI, i.e., shrinkage and leakage and sensitivity to initialization still need to be solved. Furthermore, the existing LV segmentation methods mainly focus on the segmentation accuracy. According to the segmentation accuracy our proposed method is comparable to those reported in previous studies. However, in the context of big data, it is not enough that only the segmentation accuracy is improved, the training time and segmentation time our proposed method is extremely fast.

Machine learning methods for medical image analysis has addressed this issue by estimating more complex shape and appearance models using annotated training data [8,9,10]. However, the accuracy and time requirements in clinical applications usually mean that these models need to be quite complex, which can learn all appearance and shape variations found in the annotated training data, and as a result, this training data has to be large and rich. But the acquisition of comprehensive annotated training data is a particularly difficult task [11, 12]. Therefore, in order to reduce the model complexity and the requirement for large and rich training data, naturally, an automatic, accurate and robust LV segmentation method which combines a Hierarchical Extreme Learning Machine (H-ELM) algorithm [13] and an automatic location LV technique from cardiac MRI is proposed. During past years, extreme Learning Machine (ELM) [14, 15] has attracted considerable attention. As a powerful classification algorithm, it is of faster learning speed and better generalization performance, comparing with traditional feedforward network learning algorithms. ELM, with its variants [16,17,18], has been widely applied to many fields. However, due to its shallow architecture, feature learning may not be effective for natural signals (e.g., images/videos) [13]. To tackle this problem, a new H-ELM framework [13] is proposed, which is composed of two main parts: (1) self-taught feature extraction followed by supervised feature classification and (2) they are bridged by random initialized hidden weights. The H-ELM implements more compact and significant feature representations than the original ELM and a better generalization with faster learning speed [13]. Meanwhile, the automatic Location method can make the best of spatiotemporal continuity of MRI images to improve the segmentation accuracy and reduce the cost of segmentation. The contributions of the proposed work are as follows.

(1)
A new automatic LV location method is proposed. Once the segmentation result of the previous slice is obtained, the segmentation scope of the current slice is fixed, applying the morphological dilatation method with the structural element of a $3\times 3$ disk (empirically selected based upon 20 trials) from the mid-slice image of LV to the apical and the basal slice image, respectively. The same method is applied to the end diastole (ED) and end systole (ES) slices, respectively [19].
(2)
A new automatic LV segmentation method based on an H-ELM is developed. To the best of my knowledge, it is the first time the H-ELM is utilized in segmenting LV MRI images, which has been proven to have a better generalization and classification performance with faster learning speed.
(3)
The average computation time of LV segmentation is extremely fast than the existing methods.

The remaining of this paper is as follows. Section 2 briefly reviews the related works on the segmentation of LV. In Sect. 3, this paper introduces the basic theory of ELM and H-ELM. In Sect. 4, the image segmentation methods are introduced in detail. The experimental results of the segmentation of LV based on the H-ELM are presented in Sect. 5. In Sect. 6 discussion is given. Section 7 concludes the paper.

2 Related works

In recent years, many methods have been proposed for LV segmentation. They can be classified into two types, in accordance with no or weak or strong prior [5].

2.1 LV segmentation without or with weak prior

The LV segmentation with weak or without prior consists of image-based methods, pixel (voxel) classification-based methods, region-based methods, edge-based methods, a combined deep-learning and deformable-model method [4], a combining deep learning and level set method [11] as well as deformable models.

Image-based methods include thresholding [20], dynamic programming (DP) [2, 21,22,23,24], spatiotemporal Continuity and Myocardium Information-based methods [19]. However, the Otsu methods [25, 26] can deviate from the optimal threshold. The performance of the DP method sometimes is poor in the boundary extraction [23, 24].

Pixel (voxel) classification-based methods include statistical models and artificial intelligence-based methods. Statistical models [27, 28] take full advantage of the characteristics of the image gray histogram and the fitting approximation of distribution function, the establishment of the distribution function and parameter estimation methods are challenging problems. Artificial intelligence-based methods contain clustering methods and classification methods. The clustering methods are unsupervised, but the clustering maybe result in non-optimal solutions. The classification methods usually utilize artificial neural network (ANN) algorithms, including BP, support vector machine (SVM) [29] but the performance of these methods rely on the selection of samples and the extraction of the features.

Region-based methods include region growing method, splitting method, and watershed algorithm and so on [30]. Region growing method depends on selection of seed points, watershed algorithm provides the advantages of stabilization and speediness, but it is difficult to decide stopping criteria for region-based methods.

Edge-based methods utilize gray level differences between organs to find edges. The limitation is that the performance is affected by noise, pseudo edge and weak edges.

Deformable models include snakes [20, 31,32,32,33], level set [34,35,36,37], and their variants [38,39,40]. A random active contour schem19e for automatic images segmentation is proposed [41]. This method utilizes a parametric shape prior and integrates the region and boundary information into a generalized energy function to realize minimization. This method, however, requires the prior knowledge.

2.2 LV segmentation with strong prior

The segmentation consists of shape prior based deformable models, active shape (ASM) and appearance models (AAM), and atlas based methods.

The deformable models with strong prior take up the variational framework and modify the energy functional to be minimized by introducing a new term, which embeds an anatomical constraint on the deforming contour [42].

The ASM consists of a statistical shape model, called Point Distribution Model (PDM) and a method for searching the model in an image [8, 43]. The combination of the AAM and the ASM is also used [44].

In the atlas based method, an atlas can be generated by manually segmenting an image or integrating information from multiple segmented images of different individuals [45,46,47]. This method denotes that the segmentation will not be too much leakage, but also limit flexibility.

3 Brief introduction to H-ELM

3.1 ELM theory

The ELM is a learning algorithm, whose speed can be thousands of times faster than traditional feed-forward network learning algorithms, and which has better generalization performance [48].

Given N arbitrary different samples ${({\rm X}_{\rm i},{\rm t}_{\rm i})},i=1,\dots ,N,$ where ${\rm X}_{\rm i} =\left[ x_{i1},x_{i2},\dots ,x_{in}\right] ^{\rm T} \in R^n$ , and ${{\rm t}_{\rm i}}=\left[ t_{i1},t_{i2},\dots ,t_{in} \right] ^{\rm T} \in R^m,$ standard SLFNs with M hidden nodes and activation function g(x) are modeled as

$$\begin{aligned} \sum \limits _{i=1}^M \beta _i g_i (\rm {x}_j)=\sum \limits _{i=1}^M\beta _i g(\rm {W}_i \cdot \rm {X}_j + b_i)=o_j,\quad (j=1,\dots ,N) \end{aligned}$$

(1)

where M is the number of the hidden layer nodes, $\rm {W}_i=\left[ w_{i1},w_{i2},\dots ,w_{in} \right]$ is the input weight vector, $\beta _i=\left[ \beta _{i1},\beta _{i2},\dots ,\beta _{im} \right] ^{\rm T}$ is the output weight vector, and $b_i$ is the threshold of the i th hidden node.$\rm {W}_i \cdot \rm {X}_j$ is the inner product of $\rm {W}_i$ and $\rm {X}_j$ [49]. The output of ELM can be written compactly as

$$\begin{aligned} f(X)=\sum \limits _{i=1}^M \beta _i g(\rm {W}_i \cdot \rm {X}_j+b_i) \end{aligned}$$

(2)

where

$$\begin{aligned} & \rm{H}(\rm {W}_1,\dots ,\rm {W}_M,b_1,\dots ,b_M,\rm {X}_1,\dots ,\rm {X}_N) \\ & \quad = \begin{bmatrix} g(\rm {W}_1 \cdot \rm {X}_1+b_1)&\dots&g(\rm {W}_M \cdot \rm {X}_1+b_M)\\ \vdots&\dots&\vdots \\ g(\rm {W}_1 \cdot \rm {X}_N+b_1)&\dots&g(\rm {W}_M \cdot \rm {X}_N+b_M) \end{bmatrix} _{N \times M} \end{aligned}$$

(3)

$$\begin{aligned} \beta = \begin{bmatrix} \beta _1^{\rm T}\\ \vdots \\ \beta _M^{\rm T} \end{bmatrix} _{N \times m}\quad \rm{and}\quad {\rm T}= \begin{bmatrix} t_1^{\rm T}\\ \vdots \\ t_N{\rm T} \end{bmatrix} _{N \times m} \end{aligned}$$

(4)

If only the activation function is infinitely differentiable, the input weights and hidden layer biases can be randomly generated [49]. All the parameters of SLFNs need to be adjusted; training an SLFN is simply equivalent to finding a least squares solution $\hat{\beta }$ of the linear system $H\beta =\rm {T}:$

$$\begin{aligned}&\left\| \rm {H}(\rm {W}_1,\dots ,\rm {W}_M,b_1,\dots ,b_M)\hat{\beta }-\rm {T} \right\| \nonumber \\&\quad =\rm {min} \left\| \rm {H}(\rm {W}_1,\dots ,\rm {W}_{\hat{N}}, b_1,\dots ,b_{\hat{N}})\beta -\rm {T} \right\| \end{aligned}$$

(5)

If the number M of the hidden nodes equals the number N of distinct training samples, matrix H is square and invertible, and SLFNs can approximate these training samples with zero error. However, in most cases the number of hidden nodes is much less than the number of distinct training samples, $M\ll N,$ $\rm {H}$ is a non-square matrix and there may not exist $\rm {W}_i,b_i,\beta _i$ such that $\rm {H} \beta = \rm {T}.$

$$\begin{aligned} \hat{\beta }=\rm {H}^\dagger \rm {T} \end{aligned}$$

(6)

where $\rm {H}^{\dagger }$ is the Moore–Penrose generalized inverse of matrix $\rm {H}.$

3.2 H-ELM theory

The H-ELM training architecture is structurally separated into two independent phases: (1) unsupervised hierarchical feature representation and (2) supervised feature classification. For the first phase, a new ELM-based autoencoder is designed to extract multilayer sparse features of the input data, and then for the second one, the original ELM-based regression is implemented for making final decision [13].

3.2.1 Unsupervised feature learning

Firstly, the original input data is transformed into an ELM random feature space, which can help to utilize hidden information among from training data. Then, an unsupervised learning is performed to eventually obtain the high-level sparse features [50]. The output of the ith hidden layer can be represented mathematically as

$$\begin{aligned} \rm {H}= g (\rm {H}_{i-1}\cdot \beta ) \end{aligned}$$

(7)

where $\rm {H}_i$ denotes the output of the ith hidden layer ($i\in {[1, \rm {K}]}$), $\rm {H}_{i-1}$ is the output of the (i1)th layer, $g(\cdot )$ is the activation function of the hidden layers, and $\beta$ is the output weights. Note that each hidden layer of the H-ELM is independent each other, and a separated features extractor. The more layers, the more compact the resulting features. In this frameworks all the hidden layers are gathered together as a whole, with unsupervised initialization. The whole system is retrained iteratively by BP-based NNs. After unsupervised hierarchical training, the outputs of the Kth layer, i.e. , $\rm {H}_K$ are considered as the high-level features extracted from the input data. Before classification, they are randomly projected, and then used as the inputs of the supervised ELM-based regression to obtain the final classification. To speed up the learning speed, the H-ELM framework is constructed based on random mapping and makes full use of the universal approximation capability of ELM, both in two phases of the whole framework. According to [13, Theorem 2.1], using random mapped features as the inputs, the H-ELM can approximate or classify any input data.

3.2.2 ELM sparse autoencoder

As mentioned above, the H-ELM briefly includes two independent phases: (1) unsupervised training and (2) supervised training. Owing to the latter phase is performed by the original ELM, the former one (autoencoder) will be emphatically introduced. It is known that the autoencoder aims to approximate the input data, by making the reconstructed outputs being similar to the input data as much as possible [13].

The universal approximation capability of ELM is used for constructing the autoencoder [51], meanwhile, sparse constraint is added on the autoencoder optimization, and thus, it is referred to as ELM sparse autoencoder. According to the ELM theory [52], the autoencoder is initialized without the fine-tuning. Additionally, to gain more sparse and compact features of the inputs, the optimization model of the proposed ELM sparse autoencoder can be expressed as follows:

$$\begin{aligned} O_\beta =\rm {argmin}\{\left\| H\beta -X\right\| ^2+\left\| \beta \right\| _{\ell _1}\} \end{aligned}$$

(8)

where $\rm {X}$ is the input data, $\rm {H}$ is the random mapping output which needs not to be optimized [15], and $\beta$ denotes the hidden layer weight. The $\ell _1$ optimization has been proved to be a better solution for data recovery and other applications [53, 54].

4 Methods

The whole algorithm of this image segmentation includes pre-processing training data, training H-ELM, Pre-processing testing data, classification and post-processing as shown in Fig. 1.

4.1 Pre-processing training data

The procedure of pre-processing training data consists of the following steps (as shown in Fig. 1a):

(1)
In accordance with image clarity and whether including varying amounts of endocardial trabeculae and papillary muscles, 186 images of 10 cases in cardiac MRI were selected as sample images, whose ground truth had been acquired.
(2)
For any sample image, all the pixels were selected from the LV region of the ground truth, meanwhile, they were labeled as 1.
(3)
The LV region was extended, applying the morphological dilatation method with the structural element of a $5\times 5$ disk (empirically selected based upon 20 trials), and then all the pixels were selected from the new region adjacent to the LV of the ground truth, meanwhile, they were labeled as 0.
(4)
Four different kinds of features were extracted, which included 3-dimensional gray level values, such as gray level value, gray mean value and gray median, 20-dimensional gray level co-occurrence matrix [55] such as energy, contrast, correlation, entropy and inverse from four directions via a $11\times 11$ window, 9-dimensional histogram of oriented gradient features calculated within $17 \times 17$ cell blocks with nine histogram bins similar to [56] and 18-dimensional local binary pattern features [57, 58] via a $5\times 5$ size empirically, amounting to 50-dimensional features [59].
(5)
Feature vectors of all pixels of an image were concentrated to generate a feature matrix.
(6)
The procedure of pre-processing testing data included steps 2–5, all feature matrices were merged into a feature matrix at last. Also, each value of this matrix were normalized to [0, 1].

4.2 Training H-ELM

Training H-ELM aimed to find optimal parameters, using the obtained feature matrix. The $\ell _1$ penalty of the last layer ELM is $2^{-50}$ and $S =0.8$ is the scaling factor. The ELM kernel used in the proposed algorithm was Sigmoid function and the number of hidden nodes was 100, which were selected empirically based upon 20 trials, owing to the randomness of the input weights and hidden layer biases.

4.3 Pre-processing testing data

The procedure of pre-processing testing data consists of the following steps (as shown in Fig. 1b):

(1)
At the same temporal phase, the LV of the mid-slice image (as shown in Fig. 2a) is always the biggest and roundest, which was segmented first. To reduce computational complexity and time [4], a circle with the center of the MRI image and the radius of 50 pixels was drawn, all pixels outside the circle were set to 0 (as shown in Fig. 2b). Accordingly, a fitting threshold was found using the Otsu method, and then the original image (as shown in Fig. 2b) was converted into the binary image (as shown in Fig. 2c).
(2)
The roundness, area and centroid of each object were calculated, based on an overall consideration, the LV was located approximately, the result was shown in Fig. 2d.
(3)
As shown in Fig. 2d, in virtue of the presence of intensity inhomogeneity, endocardial trabeculae and papillary muscles of the LV cavity, usually, not all contours of LVs could be satisfactory, therefore, the LV were extended by the same method (introduced in step 3 of Sect. 4.1). As a result, the extended region included almost all the pixels of the LV, which were regarded as the testing pixels set as shown in Fig. 2e.
(4)
By the same method (introduced in step 4 of Sect. 4.1) 50-dimensional features of each pixel of the testing pixels set were extracted to generate a feature matrix.

4.4 Classification

The feature matrix was input into the trained H-ELM, then all pixels were classified into two classes, namely one class belonged to the LV and the other one belonged to the non-LV area.

4.5 Post-processing

The contour of the LV was depicted and smoothed using the open-close operations in mathematical morphology, the segmentation result was shown in Fig. 2f. In order to segment the adjacent slice image in the superior and/or inferior directions, the contour of the LV was extended by the same method (introduced in step 3 of Sect. 4.1), and then the extended LV was used as a binary mask to locate the LV of the next image. Sequentially by the above method (introduced in step 4 of Sects. 4.3, 4.4 and 4.5), the next image was segmented. That is to say, the derived contour region from the preceding slice image was dilated and utilized to locate the LV of the next slice image till to the apical slice, which can be also adopted from the mid-slice to the basal slice. In the dataset, all slices of each case were divided into two classes: the end diastole (ED) and end systole (ES) slices. The above method is used for them, respectively.

5 Results

5.1 Data set

In this paper, the dataset is cardiac cine MRI short axis images from the General Hospital of Shenyang Military. There are a total of 30 cases (19 males, 11 females, mean age 48.4$\pm {15.6}$ years), including heart failure cases, coronary heart disease cases, hypertrophy cases and normal cases. Cine CMR images data were acquired using a 2D SSFP pulse sequence on 1.5 T (General Electric) scanners. In each case, LV was imaged in 8–16 short axis slices from the level of the mitral valve annulus through the LV apex. Typical CMR parameters were repetition time (TR) 3.5 ms, echo time (TE) 1.6 ms, flip angle 60, matrix size $256\times 256,$ image dimensions $256\times 256,$ receiver bandwidth 125 kHz, FOV: 360 mm, slice thickness 6–8 mm, slice gap 2–4mm.

5.2 Evaluation measures

In this paper, several measures were used in our experiments to test and compare the segmentation results of our proposed method, level set method, the SVM method and Hu’s method [27], including mean absolute deviation (Mad), maximum absolute deviation (Maxd), dice similarity coefficient (Dsc) and segmentation Time.

5.3 Performance evaluation

In this section, the performance of LV segmentation based on an H-ELM was studied through evaluating its efficiency and effectiveness. The algorithm was coded in MATLAB 2014a. All experiments were conducted on a 2.0-GHz PC with 16G memory running window 7. The same pre-processing and post-processing methods were used in the proposed method and the SVM method, the similar location method was used in the level set method [36] and Hu’s method [27].

Table 1 listed Mad, Maxd and Dsc of our proposed method, the level set, the SVM and Hus method from 20 patients, respectively. From Table 1, it could be seen that the mean Mad of images segmented by our proposed method was about 67.9, 81.3 and 98.7% of those images segmented by the level set, the SVM and Hu’s method, respectively. The Mad Std of our proposed method was the lowest in four methods. The mean Maxd of images segmented by our proposed method was about 63.5, 77.3 and 98.0% of those images segmented by the level set, the SVM and Hu’s method, respectively. The Maxd Std of our proposed method was the lowest in four methods. The mean Dsc of images segmented by our proposed method was about 13.7, 9.3 and 0.5% higher than that of those images segmented by the level set, the SVM and Hu’s method, respectively. The Dsc Std of our proposed method was the lowest in four methods. The similar results were seen from Table 2, the mean speed of our proposed method was about 38.3, 6.7 and 23.8 times faster than that of the level set, the SVM and Hu’s method, respectively. The mean segmentation Time Std of our proposed method was the lowest in four methods.

Table 1 Mean segmentation accuracy of the H-ELM method (H), the level set method (L), the SVM method (S) and Hus method (U) (a total of 20 patients)

Full size table

Table 2 Mean segmentation time of the H-ELM method (H), the level set method (L), the SVM method (S) and Hu’s method (U) (a total of 20 patients)

Full size table

In order to further evaluate the performance of our proposed method, the local distributions of segmentation errors and the similarity between the segmentation result and the ground truth were illustrated in Fig. 3, respectively. The boxplots indicated the median, lower and upper quartiles of Mad, Maxd, Dsc and Time of our proposed method, the level set, the SVM and Hu’s method. It was notable from Fig. 3 that our proposed method outperformed the level set, the SVM and Hu’s method since it obtained the higher Dsc, the lower Mad, Maxd and Time.

6 Discussion

In this study, an automatic LV segmentation method based on an H-ELM model is developed and validated. The whole framework is divided into two components including training an H-ELM and segmenting LVs from cardiac MRI images by the trained H-ELM, the former one is composed of self-taught feature extraction and supervised feature classification [13]. The self-taught feature extraction implemented more compact and meaningful feature representations than the original ELM, and then the hierarchically encoded outputs are randomly projected to generate a better generalization with faster learning speed, the supervised feature classification is performed by an original ELM, the hidden layers of this framework are trained in a forward manner. The latter one consists of the LV location followed by the LV segmentation. Owing to complexity of cardiac MRI, the LV location is a prerequisite for the LV segmentation from cardiac MRI images based on an H-ELM, the segmentation result is directly affected by the location accuracy. First, the mid-slice with the biggest and roundest shape is located, using a fitting threshold from the Otsu method. Second, the LV is segmented through the trained H-ELM, the segmentation result is dilated with the structural element of a $5\times 5$ diskempirically selected, and then the contour of the dilated region is used as a binary mask to locate the LV of the adjacent image from mid-slice to apical slice and basal slice, respectively. The same method is performed for the end diastole (ED) and end systole (ES) slices, respectively. When the segmentation is close to the most apical slice, an unsuccessful classification lead to failure not only for the current segmentation but also for subsequent segmentations, such as the area of the current segmentation LV equals 0, Accordingly, the current segmentation result is replaced by the corroded previous segmentation result with the structural element of a $3\times 3$ diskempirically selected. The previous segmentation result is utilized as the segmentation mask of the current slice, which may avoid to leak to surrounding areas to some extent. In a word, the segmentation result of the previous slice is crucial to the current slice segmentation. Due to all parameters are empirically selected in our proposed method, they cannot be guaranteed to be optimal for feature extraction and classification in all cases.

In the future, there are several points can be explored in order to improve the segmentation results of the LV. Firstly, the LV should be segmented from point of view 3D, which may make the best of features of each voxel, meanwhile, reduce problem complexity and computational time. Secondly, currently, in our proposed method, for all parameters of the H-ELM they are empirically selected, ideally, a systematically analytic approach is need to obtain optimal all parameters. Finally, during the growth of training data, all parameters of the H-ELM are updated incrementally to further produce more accurate segmentation results.

7 Conclusions

In summary, a new method for automatic LV segmentation from cardiac MRI images is proposed. This method takes into account the intensity inhomogeneity which often occurs in the LV cavity and may cause many difficulties in image segmentation, cardiac MRI image scaling, shifts and spatiotemporal continuity and so on. Experimental results show that our proposed method is better than the level set, the SVM and Hu’s method. The results of this study prove that our proposed method is fast, robust, efficient and satisfactory for LV segmentation.

References

World Health Organization (2014) Global status report on noncommunicable diseases 2014. http://www.who.int/mediacentre/factsheets/fs317/en/. Accessed 5 April 2016
Hu HF, Gao ZY, Liu LM et al (2014) Automatic segmentation of the left ventricle in cardiac MRI using local binary fitting model and dynamic programming techniques. PLoS One 9:e114760–e114760. doi:10.1371/journal.pone.0114760
Article Google Scholar
Frangi AF, Niessen WJ, Viergever MA (2001) Three-dimension modeling for functional analysis of cardiac images: a review. IEEE Trans Med Imaging 20:2–25. doi:10.1109/42.906421
Article Google Scholar
Avendi MR, Kheradvar A, Jafarkhani H (2016) A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac MRI. Med Image Anal 30:108–19. doi:10.1016/j.media.2016.01.005
Article Google Scholar
Petitjean C, Dacher JN (2011) A review of segmentation methods in short axis cardiac MR images. Med Image Anal 15:169–84. doi:10.1016/j.media.2010.12.004.15
Article Google Scholar
Queir S, Barbosa D, Heyde B et al (2014) Fast automatic myocardial segmentation in 4D cine CMR datasets. Med Image Anal 18:1115–1131. doi:10.1016/j.media.2014.06.001
Article Google Scholar
Tavakoli V, Amini AA (2013) A survey of shaped-based registration and segmentation techniques for cardiac images. Comput Vis Image Underst 117:966–989. doi:10.1016/j.cviu.2012.11.017
Article Google Scholar
Cootes TF, Taylor CJ, Cooper DH et al (1995) Active shape models-their training and application. Comput Vis Image Und 61:38–59. doi:10.1006/cviu.1995.1004
Article Google Scholar
Georgescu B, Zhou XS, Comaniciu D et al (2005) Database-guided segmentation of anatomical structures with complex appearance. IEEE Comput Soc Conf Comput Vis Pattern Recognit (CVPR’05) 2:429–436. doi:10.1109/CVPR.2005.119
Zheng Y, Barbu A, Georgescu B et al (2008) Four-chamber heart modeling and automatic segmentation for 3-D cardiac CT volumes using marginal space learning and steerable features. IEEE Trans Med Imaging 27:1668–1681. doi:10.1109/TMI.2008.2004421
Article Google Scholar
Ngo TA, Lu Z, Carneiro G (2016) Combining deep learning and level set for the automated segmentation of the left ventricle of the heart from cardiac cine magnetic resonance. Med Image Anal 35:159–71. doi:10.1016/j.media.2016.05.009
Article Google Scholar
Zhao YH, Wang GR, Zhang X et al (2014) Learning phenotype structure using sequence model. IEEE Trans Knowl Data Eng 26:667–681. doi:10.1109/TKDE.2013.31
Article Google Scholar
Tang J, Deng C, Huang GB (2016) Extreme learning machine for multilayer perceptron. IEEE Trans Neural Netw Learn Syst 27:809–821. doi:10.1109/TNNLS.2015.2 424995
Article MathSciNet Google Scholar
Huang GB (2014) An insight into extreme learning machines: random neurons, random features and kernels. Cognit Comput 6:376–390. doi:10.1007/s12559-014-9255-2
Article Google Scholar
Huang GB, Zhou H, Ding X et al (2012) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern B Cybern 42:513–529. doi:10.1109/TSMC B. 2011.2168604
Article Google Scholar
Huang GB, Wang DH, Lan Y (2011) Extreme learning machines: a survey. Int J Mach Learn Cybern 2:107–122. doi:10.1007/s13042-011-0019-y
Article Google Scholar
Miche Y, Sorjamaa A, Bas P et al (2010) OP-ELM: optimally pruned extreme learning machine. IEEE Trans Neural Netw 21:158–162. doi:10.1109/TNN.2009.2036259
Article Google Scholar
Soria-Olivas E, Gomez-Sanchis J, Martin JD et al (2011) BELM: Bayesian extreme learning machine. IEEE Trans Neural Netw 22:505–509. doi:10.1109/TNN.2010.2103956
Article Google Scholar
Wang LJ, Pei MC, Codella NC et al (2015) Left ventricle: fully automated segmentation based on spatiotemporal continuity and myocardium information in cine cardiac magnetic resonance imaging (LV-FAST). Biomed Res Int 2015:1–9. doi:10.1155/2015/367583
Google Scholar
Lee HY, Codella NC, Cham MD et al (2010) Automatic left ventricle segmentation using iterative thresholding and an active contour model with adaptation on short-axis cardiac MRI. IEEE Trans Biomed Eng 57:905–13. doi:10.1109/TBME.2009.2014545
Article Google Scholar
Geiger D, Gupta A, Costa LA et al (1995) Dynamic programming for detecting, tracking and matching deformable contours. IEEE Trans Pattern Anal Mach Intell 19:294–302. doi:10.1109/ 34.368194
Article Google Scholar
Lalande A, Legrand LP, Walker PM et al (1999) Automatic detection of left ventricular conours from cardiac cine magnetic resonance imaging using fuzzy logic. Invest Radiol 34:211–7. doi:10.1016/S0921-4534(98)00004-5
Article Google Scholar
Zmc M, van der Geest RJ, Swingen C et al (2006) Time continuous tracking and segmentation of cardiovascular magnetic resonance images using multidimensional dynamic programming. Invest Radiol 41:52–62. doi:10.1097/01.rli.0000194070.88432.24
Yeh JY, Fu JC, Wu CC et al (2005) Myocardial border detection by brand-and-bound dynamic programming in magnetic resonance images. Comput Methods Programs Biomed 79:19–29. doi:10.1016/j.cmpb.2004.10.010
Article Google Scholar
Lu Y, Radau P, Connelly K et al (2009) Automatic image-driven segmentation of left ventricle in cardiac cine MRI. Midas J 5528:339–347. doi:10.1002/jmri.21451
Google Scholar
Otsu N (1975) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9:62–66. doi:10.1109/TSMC.1979.4310076
Article Google Scholar
Hu H, Liu H, Gao Z et al (2013) Hybrid segmentation of left ventricle in cardiac MRI using Gaussian-mixture model and region restricted dynamic programming. Magn Reson Imaging 31:575–584. doi:10.1016/j.mri.2012.10.004
Article Google Scholar
Xu J, Monaco JP, Madabhushi A (2010) Markov random field driven region-based active contour model (MaRACel): application to medical image segmentation. Med Image Comput Comput Assist Intervent (MICCAI) 13:197–204. doi:10.1007/978-3-642-15711-0_25
Google Scholar
Reyna RA, Hernandez N, Esteve D et al (2000) Segmenting images with support vector machines. Int Conf Image Process 1:820–823. doi:10.1109/ICIP.2000.901085
Google Scholar
Cousty J, Najman L, Couprie M et al (2010) Segmentation of 4D cardiac MRI: automated method based on spatio-temporal watershed cuts. Image Vision Comput 28:1229–1243. doi:10.1016/j.imavis.2010.01.001
Article Google Scholar
Grosgeorge D, Petitjean C, Caudron J et al (2011) Automatic cardiac ventricle segmentation in MR images: a validation study. Int J Comput Assist Radiol Surg 6:573–581. doi:10.1007/s1154 8-010-0532-6
Article Google Scholar
Kaus MR, Berg J, Weese J et al (2004) Automated segmentation of the left ventricle in cardiac MRI. Med Image Anal 8:245–254. doi:10.1016/j.media.2004.06.015
Article Google Scholar
Santarelli MF, Positano V, Michelassi C et al (2003) Automated cardiac MR image segmentation: theory and measurement evaluation. Med Eng Phys 25:149–159. doi:10.1016/S1350-4533(0 2)00144-3
Article Google Scholar
Ammar M, Mahmoudi S, Chikh MA et al (2012) Endocardial border detection in cardiac magnetic resonance images using level set method. J Digit Imaging 25:294–306. doi:10.1007/s10278-011-9404-z
Article Google Scholar
Chen T, Babb J, Kellman P et al (2008) Semiautomated segmentation of myocardial contours for fast strain analysis in cine displacement-encoded MRI. IEEE Trans Med Imaging 27:1084–1094. doi:10.1109/TMI.2008.918327
Article Google Scholar
Li CM, Huang R, Ding Z et al (2011) A level set method for image segmentation in the presence of intensity inhomogeneities with application to MRI. IEEE Trans Image Process 20:2007–2016. doi:10.1109/TIP.2011.2146190
Article MathSciNet MATH Google Scholar
Pham VT, Tran TT, Shyu KK et al (2014) Multiphase B-spline level set and incremental shape priors with applications to segmentation and tracking of left ventricle in cardiac MR images. PLoS One 25:1967–1987. doi:10.1007/s00138-014-0626-1
Google Scholar
OBrien SP, Ghita O, Whelan PF, (2011) A novel model-based 3D+ time left ventricular seg-mentation technique. IEEE Trans Med Imaging 30:461–474. doi:10.1109/TMI.2010.2086465
Pednekar A, Kurkure U, Muthupillai R et al (2006) Automated left ventricular segmentation in cardiac MRI. IEEE Trans Biomed Eng 53:1425–1428. doi:10.1109/TBME.2006.873684
Zhang HH, Wahle A, Johnson RK et al (2010) 4-D cardiac MR image analysis: left and right ventricular morphology and function. IEEE Trans Med Imaging 29:350–364. doi:10.1109/TMI.2009.2030799
Article Google Scholar
Pluempitiwiriyawej C, Moura JM, Wu YJ et al (2005) New active contour scheme for cardiac MR image segmentation. IEEE Trans Med Imaging 24:593–603. doi:10.1109/TMI.2005.843740
Article Google Scholar
Paragios N (2002) A variational approach for the segmentation of the left ventricle in cardiac image analysis. Int J Comput Vision 50:345–362. doi:10.1023/A:1020882509893
Article MATH Google Scholar
Edwards GJ, Taylor CJ, Cootes TF (1998) Interpreting face images using active appearance models. Int Conf Autom Face Gest Recogn 92:145–149. doi:10.1109/AFGR.1998.670965
Google Scholar
Zambal S, Hladvka J, Bhler K (2006) Improving segmentation of the left ventricle using a two-component statistical model. Med Image Comput Comp Assist Intervent (MICCAI) 9:151–158. doi:10.1007/11866565_19
Google Scholar
Lorenzo-Valds M, Sanchez-Ortiz GI, Mohiaddin R et al (2002) Atlas-based segmentation and tracking of 3D cardiac MR images using non-rigid registration. Med Image Comput Comp Assist Intervent (MICCAI) 2488:642–650. doi:10.1007/3-540-45786-0_79
MATH Google Scholar
Lorenzo-Valds M, Sanchez-Ortiz GI, Elkington AG et al (2004) Segmentation of 4D cardiac MR images using a probabilistic atlas and the EM algorithm. Med Image Anal 8:255–265. doi:10.1007/978-3-540-39899-8_55
Article Google Scholar
Ltjnen J, Kivist S, Koikkalainen J (2004) Statistical shape model of atria, ventricles and epicardium from short- and long-axis MR images. Med Image Anal 8:371–386. doi:10.1016/j. media.06.013
Article Google Scholar
Zhao YH, Wang GR, Yin Y et al (2014) Improving ELM-based microarray data classification by diversified sequence features selection. Neural Comput Appl 27:155–166. doi:10.1007/s00521-014-1571-7
Article Google Scholar
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501. doi:10.1016/j.neucom.2005.12.126
Article Google Scholar
Cao J, Zhang K, Luo M et al (2016) Extreme learning machine and adaptive sparse representation for image classification. Neural Netw 81:91–102. doi:10.1016/j.neunet.2016.06.001
Article Google Scholar
Huang GB, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17:879–892. doi:10.1109/TNN.2006.875977
Article Google Scholar
Kasun LLC, Zhou H, Huang GB et al (2013) Representational learning with extreme learning machine for big data. IEEE Intell Syst 28:31–34. doi:10.1109/MIS.2013.140
Article Google Scholar
Beck A, Teboulle M (2009) A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J Imag Sci 2:183–202. doi:10.1137/080716542
Article MathSciNet MATH Google Scholar
Beck A, Teboulle M (2009) Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Trans Image Process 18:2419–2434. doi:10.1109/TIP.2009.2028250
Article MathSciNet MATH Google Scholar
Wan SY, William H (2003) Symmetric region growing. IEEE Trans Image Process 12:1007–1015. doi:10.1109/TIP.2003.815258
Article Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) 1:886–893. doi:10.1109/CVPR.2005.177
Kang C, Liao S, Xiang S, Pan C (2014) Kernel sparse representation with pixel-level and region-level local feature kernels for face recognition. Neurocomputing 133:141–152. doi:10.1016/j.neucom.2013.11.022
Article Google Scholar
Ojala T, Pietikainen M, Maenpaa T (2002) Multi resolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24:971–987. doi:10.1109/TPAMI.2002.1017623
Article MATH Google Scholar
Zheng Q, Lu Z, Zhang M et al (2015) Automatic segmentation of myocardium from black-blood MR images using entropy and local neighborhood information. PloS One 10:e0120018. doi:10.1371/ journal.pone.0120018
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Nos. 61374015, 61202258), and the Fundamental Research Funds for the Central Universities (Nos. N130404016, N110219001).

Author information

Authors and Affiliations

Sino-Dutch Biomedical and Information Engineering School, Northeastern University Shenyang, Shenyang, 110167, Liaoning, China
Yang Luo, Lisheng Xu, Liling Hao, Jun Liu & Yang Yao
Anshan Normal University, Anshan, 114005, Liaoning, China
Yang Luo
General Hospital of Shenyang Military, Shenyang, 110016, Liaoning, China
Benqiang Yang
Key Laboratory of Medical Image Computing, Ministry of Education, Shenyang, 110819, China
Lisheng Xu
Department of Biomedical Engineering, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
Frans van de Vosse

Authors

Yang Luo
View author publications
You can also search for this author in PubMed Google Scholar
Benqiang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lisheng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Liling Hao
View author publications
You can also search for this author in PubMed Google Scholar
Jun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Yao
View author publications
You can also search for this author in PubMed Google Scholar
Frans van de Vosse
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lisheng Xu.

Additional information

Yang Luo and Benqiang Yang contributed equally to this work and are co-first authors.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luo, Y., Yang, B., Xu, L. et al. Segmentation of the left ventricle in cardiac MRI using a hierarchical extreme learning machine model. Int. J. Mach. Learn. & Cyber. 9, 1741–1751 (2018). https://doi.org/10.1007/s13042-017-0678-4

Download citation

Received: 25 August 2016
Accepted: 01 April 2017
Published: 22 May 2017
Issue Date: October 2018
DOI: https://doi.org/10.1007/s13042-017-0678-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Segmentation of the left ventricle in cardiac MRI using a hierarchical extreme learning machine model

Abstract

Similar content being viewed by others

Segmentation of the Left Ventricle in Cardiac MRI Using an ELM Model

Segmentation and Quantification of Bi-Ventricles and Myocardium Using 3D SERes-U-Net

Two-stage active contour model for robust left ventricle segmentation in cardiac MRI

1 Introduction

2 Related works