Keywords

1 Introduction

Despite continuous progresses both in clinical research and practice, cardiovascular diseases (CVDs) remain the leading cause of mortality and morbidity in the world [1]. In this context, cardiac imaging such as cine-MRI (Magnetic Resonance Imaging) is expected to play an important role due to its ability to quantify in detail structural and functional properties of the beating heart [2]. However, visual assessment of CVDs using cine-MRI remains challenging and labor-intensive due to the complexity of these diseases, in particular when the structural and functional disorders are subtle [3]. Moreover, quantitative assessment through existing clinical indices such as volumetric measures, ejection fraction, and thickening measures can be suboptimal for borderline cases. Consequently, more advanced automated techniques are needed to exploit the richness of the cardiac data to estimate diagnosis, as well as severity of the phenotype which often is associated with prognosis. Over the years, several methods have been proposed based on eigendecomposition of the moving cardiac shapes [4,5,6,7,8,9,10] but they have mostly used geometrical information.

In this paper, we propose instead a radiomics approach to automated image-based diagnosis of complex CVDs. Radiomics is the task of calculating a large number of imaging descriptors from delineated images, which has been widely used in cancer imaging [11]. In the context of cine-MRI, radiomics describing changes in image appearence due to CVDs have not been exploited. The proposed method estimates a large number of radiomic features including statistical, shape, and textural descriptors and assess their ability to discriminate between different CVDs automtically and robustly within a machine learning framework based on SVMs. Training and cross-validation are carried out in this study based on a database of 100 cine-MRI cases corrresponding to five different subclasses, from the ACDC challenge of MICCAI 2017.

2 Method

2.1 Data Description

This study was conducted in the context of the MICCAI 2017 challenge on Automated Cardiac Diagnosis (the ACDC challenge). The database consists of 100 cases comprising cine-MRI data, height and weight information, as well as the diastolic and systolic phase instants for each subject. Five subclasses were included, namely (see examples in Fig. 1):

  1. (1)

    Normal subjects (NOR).

  2. (2)

    Patients with dilated cardiomyopathy (DCM).

  3. (3)

    Hypertrophic cardiomyopathy (HCM).

  4. (4)

    Abnormal right ventricle (RV)

  5. (5)

    Myocardial infarction (MINF).

The 100 images were acquired at the University Hospital of Dijon (France) by using 1.5T or 3T MRI scans (Siemens Medical Solutions, Germany) with the following parameters depending on the examination: image sequence = SSFP cine-MRI, slice thickness = 5 mm or 8 mm, inter-slice gaps = 5 mm or 10 mm, spatial resolution = 1.37 to 1.68 mm2/pixel, number of frames = 28 to 40. This training dataset was then manually segmented for the left ventricle, myocardium and right ventricle by an experienced manual observer at both end-diastole and end-systole time frames. The test data of the challenge will be segmented as explained in the next section.

Fig. 1.
figure 1

Examples of cine-MRI images for the four abnormalities classified in this study (Top: ED, bottom: ES)

2.2 Semi-automatic Segmentation

To segment the test datasets semi-automatically, we propose an atlas-based approach using the publicly available cardiac atlas [12]. To this end, we firstly define manually six anatomical landmarks on each cine-MRI case, more specifically at:

  1. (1)

    Mid-ventricular slice: RV insertion point next to the liver.

  2. (2)

    Mid-ventricular slice: A point on the RV free wall.

  3. (3)

    Mid-ventricular slice: RV insertion point next to the lung.

  4. (4)

    Mid-ventricular slice: A point on the LV free wall.

  5. (5)

    Apical slice: Apex.

  6. (6)

    Basal slice: Center of the base.

We then use the atlas-based technique described in [13] to extract the cardiac structures of interest, namely the LV, RV and myocardium. This is followed by user-friendly manual correction of the segmented contours to correct for potential errors using the ITK-SNAP toolFootnote 1. Note that this segmentation approach will be only used to segment the test data of the challenge and that this paper focuses only on the classification part of the ACDC challenge.

2.3 Radiomics Features for Cardiac Diagnosis

As mentioned in the introduction, most existing techniques included in clinical practice use shape and motion indices such as ejection fraction, ventricular volumes and myocardial thickening to classify the subjects under investigation. This means that a lot of information produced by the image is lost during this operation, in particular imaging evidence in relation to the tissue appearence in the blood pool, myocardium and right ventricle, as well as more complex morphological and functional information. But it is unclear which advanced indices could contribute to improved classification of cardiovascular cases. To address these issues, we propose a radiomics approach for computer-aided diagosis in cine-MRI. Radiomic features have been used so far mostly for cancer image quantification [11, 13], such as for the estimation of patient prognosis and treatment response based on the characteristics of the tumors as encoded by the image data. The suffix -omics attached to radiology refers to the use of large amounts of imaging features, from which the most relevant set can be selected for the specific task in question. In this paper, we estimate a large pool of new radiomic features from the segmented cine-MRI images, which will be then analyzed to extract the most powerful features for classification.

In other words, we augment the set of indices to be leveraged for cardiac diagnosis by considering more complex shape/motion radiomic features, as well as advanced textural radiomic features. Specifically, we use 567 features (height, weight, ED-ES duration, plus 188 features per structure: LV, MYO, RV at ED and ES) based on five categories using the PyRadiomics library [14], namely:

  1. (1)

    Shape based (Volume, surface area, sphericity, compactness, diameters, elongation, etc.).

  2. (2)

    Intensity first order statistics (e.g. mean, standard deviation, energy, entropy, etc.).

  3. (3)

    Gray level cooccurence matrix (GLCM) (autocorrelation, contrast, dissimilarity, homogeneity, inverse difference moment, maximum probability, etc.).

  4. (4)

    Gray level run length matrix (GLRLM) (short/long run emphasis, gray-level/run-length non-uniformity, etc.).

  5. (5)

    Gray level size zone matrix (GLSZM) (smal/large area emphasis, zone percentage, etc.).

Note that the first group of radiomics consist of pure shape information, while the four remaining groups are intensity-based features, describing the intensity variations inside the cardiac structures, as well as the complexity and repeatability of the tissue texture. Our hypothesis is that some of these radiomic values will be modified in the presence of cardiac abnormality in a way that is unique to each subgroup of patients when compared to normal individuals.

2.4 Classification Method

The next step of our approach is to combine the heterogeneous radiomic features within a classification scheme that will learn to discriminate between the different patient subgroups and normal individuals. In this paper, we choose to use Support Vector Machines (SVMs) [15] due to well-known performance when classifying image data, in particular in the case of small sample size. An SVM model corresponds to a transformation of the examples to a hyperspace where a good separation is achieved by the hyperplanes that have the largest distance to the nearest training-data point of any class (so-called functional margin). This ensures that the examples belonging to the different classes are separated as clearly as possible. New cases are then mapped onto that same hyperspace and classified based on their location with respect to the hyperplanes separating the different classes. As such, it is suitable for cardiovascular disease classification as the challenge is precisely to identify subtle changes and differences between normal cardiac characteristics and those of pathological cases.

2.5 Radiomic Feature Selection

Due to the large number of radiomic features, radiomic-based classification can easily suffer from over-fitting due to the limited number of examples that can be realistically collected for training. As a result, it is of paramount importance to identify a smaller subset of radiomic features that is optimal for the cardiac diagnosis task. In this paper, we do this by using sequential forward feature selection [16], through which radiomic features will be added to the final subset one at a time until the classification becomes negatively impacted as a result of adding new radiomic features.

3 Results

For all experiments, we used leave-one-out tests to evaluate the proposed method and measured accuracy as proportion correct classifications. Firstly, we evaluated the accuracy of the CVD classifications by using only intensity radiomics or only shape radiomic features. For intensity radiomics, we obtained a maximal accuracy of 0.98 (two misclassifications) when using 13 optimal features. For shape radiomics, we obtained an accuracy of 1.0 (all cases correctly classified) but by using a total of 32 features. Subsequently, we combined intensity, shape and patient information (height and weight) all together and the forward feature selection results are provided in Fig. 2. It can be seen that the best single feature only achieves a 0.62 accuracy. However, after adding three selected features to the classification task, the accuracy is improved beyond the 0.90 accuracy line to reach 0.91 and even 0.94 after five selected features. A maximum accuracy of 1.0 (all cases correctly classified) is reached by combining 10 features only when combining intensity, shape and patient information.

Fig. 2.
figure 2

Accuracy of the proposed CVD classification as a function of the number of radiomic features trained in the model.

The shape of the curve in Fig. 2 indicates the importance of feature selection, as after reaching the maximum accuracy, incorporating additionnal features leads to model over-fitting and reduced accuracy. While these preliminary results are obtained on a small and controlled study, they are encouraging. In comparison, we obtained an accuracy of 0.84 when using all radiomic features and 0.86 when combining conventional clinical indices only such as ejection fraction, cavity volumes, and body-mass index.

To understand the behavior of the model, we evaluated the precision, recall and confusion matrix after selecting five features, at an accuracy of 0.94. It can be seen that the least accurately detected classes are for the HCM and DCM patients. However, after adding all optimal features, we finally reach a maximal accuracy of 1.0 (Fig. 2).

Table 1. Precision, recall and confusion matrix obtained by using the five first optimal radiomic features at accuracy of 0.94.

This selected list of optimal features is given in Table 2, which include one conventional shape index (volume), seven advanced shape radiomic features (e.g. compactness, least axis, surface area), one patient information (height) and one textural radiomic feature (GLCM inverse difference). This shows how mutiple radiomics of different nature can be complimentary to each other, which enables to identify correctly all cases. Also, the table shows that the features are well distributed among the three cardiac structures (LV, myocardium, RV), as well as for the ED and ES frames.

To show the relevance of the selected features, we have added to the table the accuracy results by removing each feature from the SVM model (column W/O). It can be seen that the removal of each of these features negatively affects the final accuracy, which is reduced from 1.0 to 0.88 by removing the Surface Area to Volume feature, and to 0.96 by removing the Inverse Difference Intensity (GLCM) feature. This shows how these features can play an role in discriminating some of the challenging and ambiguous cases.

To further show the relevance of combining all of the selected, we have also added to the table in the last column the accuracy by using a single radiomic feature. It can be seen that their on their own, these features do not enable a satisfactory classification, with the accuracy values vaying between 0.18 (Height) and 0.62 (Volume). In particular, the Height variable is not capable of producing any meaninful classification on its own, but contributes to the overall accuracy of the multi-radiomic model by normalizing with respect to size.

Table 2. List of 10 selected radiomic features as selected by the proposed technique for CVD classification. W/O: Accuracy without the feature. Alone: Accuracy using only this feature.

4 Conclusions

In this paper, we proposed the use of large amounts of radiomic features, integrating advanced shape and textural descriptors, to predict cardiac subgroups. The obtained results suggest that radiomics are indeed capable to encode alterations in the anatomy and tissues of the affected cardiac structures. Furthermore, the feature selection results indicate that shape and intensity descriptors compliment each other and their combinations enable to enhance the prediction power of the system in particular for uncertain cases situated close to the boundary between two disease classes. However, the high accuracy of 1.0 suggests that further evaluations with additional datasets are required to test this radiomics model in larger and more variable data samples. In particular, inter-subject variability due to semi-automatic segmentation of the boundaries will need to be assessed. Future work also includes the testing of additional radiomic features (e.g. fractals, wavelets) and clincal interpretation of the features and results.