Introduction

Cardiac amyloidosis (CA) is caused by the accumulation of amyloid fibrils in cardiac tissue, which induces diastolic dysfunction [1,2,3]. Immunoglobulin light chain (AL) amyloidosis is the most common type of CA [4]. Up to 50% of patients with AL amyloidosis have cardiac involvement, which is the leading cause of morbidity and mortality in these patients [5,6,7]. If untreated, the median survival time is less than 6 months [2, 8]. Therefore, timely monitoring of indicators related to the prognosis of CA is essential for prognostic improvement of CA patients.

Clinically, Mayo stage based on N-terminal pro B-type natriuretic peptide (NT-proBNP, ≥ 1800 pg/mL) and cardiac troponin T (cTNT, ≥ 0.025 ng/mL) and difference in serum-free light chains (≥ 18 mg/dL) has been prognostically validated for risk stratification of CA patients [9, 10]. However, these serum biomarkers lack specificity and are influenced by renal impairment [6]. Thus, to explore other prognostic indicators of CA becomes an important task. With the rapid development of cardiac magnetic resonance (CMR) technology, the prognostic indicators of CA have increased, significantly improving the prediction power of adverse events in CA. CMR is an accurate and highly reproducible measurement that offers comprehensive cardiac structure and function evaluation as well as myocardial tissue characterization [11]. CMR with the late gadolinium enhancement (LGE) technique is considered the imaging modality of choice for the identification of CA with high diagnostic and prognostic value [12,13,14]. However, because the identification of LGE is based on the difference between abnormal and normal tissues, it may underestimate the extent of disease progression when lesions are not significantly enhanced in early stages or are diffusely enhanced in late stages, thus affecting the assessment of prognosis [11].

Since LGE is closely related to clinical, morphological, and functional markers [14], there is an urgent need to mine more prognostic information from LGE images. Radiomics can reveal vital information invisible to the human eye by extracting quantitative information from imaging data [15,16,17]. Previous studies [18,19,20] have reported the potential of CMR radiomics in predicting important clinical outcomes in cardiovascular disease, such as providing incremental information about adverse cardiac events in patients with myocardial infarction [21]. However, no prognostic role of radiomics in CA has been explored. We hypothesize that radiomics analysis will abstract important invisible information of LGE to predict the future outcome of CA patients. Therefore, the aim of this exploratory study was to extract radiomics features of LGE-CMR images of CA patients in order to assess their potential prognostic performance and further risk stratify patients.

Methods

Patients

We collected 200 patients with AL amyloidosis undergoing CMR between 2012 and 2020 from 3 tertiary medical centers of three different regions of China. All patients had histological evidence of systemic AL amyloidosis based on positive Congo red stain and light chain deposition. ATTR form of the disease was not included. Only patients with myocardial involvement were included in this study, while patients without myocardial involvement were excluded. CA was managed and diagnosed based on the current guidelines [22, 23]. Cardiac involvement was confirmed according to positive extracardiac biopsy accompanied either by echocardiography evidence of left ventricular (LV) wall thickening ≥ 12 mm in the absence of any other known causes, or NT-proBNP > 332 ng/L in the absence of renal insufficiency and atrial fibrillation. Severity of CA was classified into four stages according to the revised Mayo stage published in 2012 [9]. The Mayo stage details and Mayo stage of each patient in different centers and different groups are shown in Supplemental Appendix, Supplemental Table 1, and Table 1.

Table 1 Clinical characteristics, CMR parameters, and radiomics parameter in the CA patients

CMR acquisition and LGE analysis

The details of CMR scanners and sequence are provided in Supplemental Appendix. Myocardial enhancement was evaluated using a semi-quantitative score system (QALE) [24]. Each LV level is scored according to the degree of LGE. The range for each level in each patient is from 0 (no detectable LGE) to 4 (circumferential and transmural LV LGE), plus 6 if RV LGE is present [24, 25].

LV volumes, mass, ejection fraction (EF), and LGE extent were quantified using CVI 42 version 5.13.5 (Circle Cardiovascular Imaging lnc). For LGE quantification, a gray-scale threshold of 5 standard deviations (SD) above the mean signal intensity of myocardium was used [26]. Values of quantitative measurement of LGE were reported as a percentage of the LV enhanced area, the total enhanced mass, and the total enhanced volume.

Radiomics analysis

Segmentation

LGE segmentation, feature extraction, and model building were based on Dr. Wise Multimodal Research Platform (https://keyan.deepwise.com,V2.0).

Segmentation was performed on the base-to-apex slices of the short-axis LGE images by one cardiovascular radiologist (X.Y.Z. with 3 years of experience in CMR). For testing the reproducibility, image segmentation was repeated in 50 patients by another reader (J.H.L. with 3 years of experience in CMR) for interobserver analysis.

Feature extraction and dimension reduction

Radiomics features were extracted on the short-axis images for the entire LV, basal, middle, and apical portions, respectively. Image preprocessing and radiomics feature calculation methods are seen in Supplemental Appendix. To reduce the high dimensionality of the extracted texture features, the interobserver reproducibility was evaluated for all features using intra-class correlation coefficient (ICC), and those with high stability (ICC > 0.80) were kept for further analysis. To increase the robustness of the features, the included features were subjected to correlation analysis. When the linear correlation coefficient between any two independent variables on the training set was greater than 0.90, one of the features was removed to alleviate the redundancy between the features, making each feature independent of each other. Subsequently, the features correlated with the endpoint event were selected for further dimensionality reduction using a one-way analysis of variance feature screening method, and 5% of features with significant differences were retained. Logistic regression algorithm conducted by fivefold cross-validation was then used to construct a radiomics signature. A radiomics signature was created by a linear combination of selected features weighted by their respective coefficients, and the corresponding radiomics score (Rad-score) was calculated for each patient. The radiomics workflow is shown in Fig. 1 and Supplemental Appendix.

Fig. 1
figure 1

The flow chart of LV segmentation and quantitative texture feature analysis

Clinical outcomes

Follow-up of CA patients was completed independently in each center. All patients were followed up via clinic visit or telephone interview. The primary outcome was defined as all-cause mortality. Duration of follow-up was determined from the CMR study date to the occurrence of an endpoint or the date of the last clinical follow-up. Unless the outcome was death from any cause, patients were censored at the end of the study follow-up time. If patients were lost to follow-up, their last clinic visit record date was used.

Statistical analysis

Statistical analysis was performed using SPSS 26.0 (IBM Inc), MedCalc 19.6 (MedCalc Software Ltd), and X-tile 3.6.1(X-tile software). All continuous data were given as mean ± SD or median and 25–75th percentiles, while categorical data were presented as frequencies. NT-proBNP and troponin T were log transformed to achieve normality. Numerical variables were compared by means of the t-test or Mann–Whitney U test, and categorical variables were compared using the χ2 test.

The performance of CMR parameters and radiomics in predicting endpoint events was evaluated using the area under the curve (AUC) and compared using the DeLong test. The optimal cut-point for continuous prognostic markers was obtained by using X-tile software to dichotomize feature. Survival curves for different Rad-score groups were depicted using the Kaplan–Meier method and compared using the log-rank test. The clinical utility of the model was evaluated with decision curve analysis. p < 0.05 was considered to indicate statistical significance.

Results

Patients

A total of 80 CA-negative patients were excluded, and 120 CA-positive patients (79 patients from Jinling Hospital, 19 patients from Peking Union Medical College Hospital, and 22 patients from the Second West China Hospital) were finally included. Forty endpoint events occurred during a median follow-up of 12.9 months (interquartile range 6.8, 22.4 months), including 1 sudden cardiac death, 1 heart failure, and 38 deaths of unknown cause. CA patients were divided into event-positive and event-negative groups according to the presence or absence of all-cause death. Patients in the event-positive group had higher levels of cTNT and NT-proBNP (all p < 0.001); thus, Mayo stage based on the above markers was also significantly higher. The two groups did not show statistically significant differences in the quantitative LGE parameters (total enhancement volume, total enhancement mass, and LGE extent). However, Rad-score in the event-positive group showed significantly higher at the base-LV than that of the event-negative group (0.64 vs. 0.39, p < 0.001). Other patients’ characteristics and CMR parameters are reported in Table 1.

Radiomics features selection and construction

A total of 1906 radiomics features were extracted from the global, basal, middle, and apical segments of LGE images, respectively. After assessing the reproducibility and further dimensionality reduction, 16, 7, 9, and 22 radiomics features in the global, basal, middle, and apical sections were finally selected for model building. Logistic regression algorithm was used to rank the weight coefficients of the selected features and build the radiomics model. The weight coefficients of the 7 most relevant features at the base-LV selected from the training set included one original, one wavelet-based, two Laplacian of Gaussian–based, and three logarithm-based features (Supplemental Fig. 1).

Predictive performance and clinical utility of radiomics models

Table 2 shows the predictive performance of the radiomics model for different sections. In the validation set, the AUC was 0.75, 0.77, 0.76, and 0.77 in the global LV, base, mid-ventricle, and apex, respectively (Fig. 2A). The DeLong test showed no statistical difference in AUC between different sections (all p > 0.05). Basal radiomics model was selected for further prognostic analysis.

Table 2 Predictive performance of radiomics models for different sections
Fig. 2
figure 2

ROC curve analysis for radiomics model of different sections in the validation set (A); ROC curves analysis for comparison of predictive performance of LV base-radiomics with semi-quantitative and quantitative LGE parameters (B); Rad-score for each patient in the training and validation sets (C); decision curve for the radiomics model in the training and validation sets (D)

The AUC of the basal Rad-score was higher than that of LGE total enhancement mass (0.77 vs. 0.54, p < 0.001) and LGE extent (0.77 vs. 0.53, p = 0.004) (Fig. 2B). Supplemental Table 2 shows the predictive performance of cardiac functional and LGE semi-quantitative and quantitative parameters in detail. The basal Rad-score of each patient in the training and validation sets is shown in Fig. 2C, which further confirms that radiomics can predict the adverse event of CA.

To estimate the clinical utility of established models, decision curve analysis (DCA) was performed by quantifying the net benefits at different threshold probabilities in the validation dataset and showed that both the training and validation sets to predict all-cause mortality had more benefit than either the “treat-all” model or the “treat-none” model (Fig. 2D).

Survival analysis and risk stratification

During a median follow-up duration of 12.9 months, a total of 40 CA patients died. Univariate survival analysis showed sex, age, diabetes, Mayo stage, QALE, LVEF, LVEDV, LVESV, and Rad-score were considered as factors for risk stratification of all-cause mortality in CA. ROC curves of the six factors with higher AUC are shown in Fig. 3A. Although the AUC of Mayo stage was higher than Rad-score, there was no statistically significant difference (0.81 vs. 0.77, p = 0.40). We further evaluated the predictive performance of Rad-score combined with Mayo stage and found that the AUC of their combination was higher than that of Mayo stage alone (0.86 vs. 0.81, p = 0.03) or Rad-score alone (0.86 vs. 0.77, p = 0.01), as shown in Fig. 3B. Other imaging indices such as EF, LGE semi-quantitative, and quantitative parameters combined with Mayo stage had no incremental value.

Fig. 3
figure 3

ROC curve of single factor (A) and Rad-score combined with Mayo stage (B). Graphs show AUC of the Rad-score combined with Mayo stage is higher than those of the Mayo stage alone and the Rad-score alone. Asterisk represents a statistical difference

Survival of patients was stratified by the risk classification according to the Rad-score cut-off value of 0.66, which was the best predictor of survival (HR 8.91; 95% CI, 3.89–20.44; p < 0.001) (Fig. 4). The survival curve indicates that there is an approximately 84% chance of survival at 12 months in patients with the Rad-score < 0.66 compared with 33% for patients with Rad-score ≥ 0.66. Figure 5 shows the radiomics analysis and prognosis correlation from LGE images of three CA patients, which suggests that radiomics model is a better prognostic assessment for CA.

Fig. 4
figure 4

Kaplan–Meier survival curves of Rad-score. Survival probability differs significantly categorized by Rad-score cut-off values of 0.66

Fig. 5
figure 5

Representative LGE images of CA patients. Panel A Base, mid, and apex—LV short-axis LGE MR images depict a focal, patchy LGE pattern in a 62-year-old woman with CA. This patient had no adverse events during 3 years follow-up. Panel B A 36-year-old male patient had transmural LGE at the base and subendocardial LGE in the middle and apical sections of LV. This patient died suddenly after 1 year of follow-up. Panel C A 59-year-old man had a low LGE extent but a high Rad-score of 0.83 (> 0.66), who died of cardiac arrest during 1 year of follow-up

Discussion

To the best of our knowledge, this is the first study focusing on the prognostic value of radiomics analysis based on LGE images in CA. The results of this study suggest that LGE-based radiomics features are associated with all-cause mortality in CA patients, which can be a helpful risk stratification tool in CA patients. The main findings of this study include the following: (1) The predictive performance of the radiomics model was similar in different sections of LV in the validation set, and the predictive performance of the basal radiomics model was superior to the LGE semi-quantitative and quantitative parameters; (2) Rad-score combined with Mayo stage further improved the predictive performance of adverse events in CA, suggesting that radiomics features have incremental value for clinical stage; (3) Rad-score ≥ 0.66 was associated with a significantly increased risk of all-cause mortality in CA patients.

The development of CMR technology has proposed numerous new imaging biomarkers for CA prognosis. Transmural LGE, T1 elevation, and extracellular volume all correlate with amyloid load and provide incremental information of clinical outcome [12, 14, 27]. For example, Kotecha et al [28] found that the T2 value was a predictor of prognosis in AL amyloidosis. Some new CMR techniques, such as CMR tagging, are considered essential to measure regional myocardial function and contribute to the quantitative assessment of local myocardial motion and strain, thus reflecting the severity of cardiac amyloid deposition [29, 30]. However, these indicators of CA on CMR imaging including the presence of LGE and more quantitative parameters do not reflect the microscopic information of the disease and may overlook some important prognostic information. Developments in radiomics have changed the way of interpreting images, thereby potentially improving diagnostic, prognostic, and predictive accuracy. A study by Kotu et al [20] demonstrated that texture features extracted from LGE images improved arrhythmic risk stratification in post-myocardial infarction patients. Amano et al [31] found that texture analysis may provide information about LGE related to ventricular tachyarrhythmias in hypertrophic cardiomyopathy. Our study confirmed the association of radiomics features with all-cause mortality in CA patients using LGE images, highlighting the value of radiomics in the prognosis of cardiovascular disease.

The Mayo staging system is an important clinical tool for risk stratification in patients with AL amyloidosis [9]. Our previous study [32] demonstrated that radiomics features were moderately correlated with Mayo stage, suggesting that radiomics features may reflect prognosis of CA. This study confirmed that the risk of death was significantly increased when Rad-score ≥ 0.66. Therefore, Rad-score can be a new imaging indicator for CA risk stratification. Moreover, radiomics features can provide incremental value relative to clinical prognostic predictors.

Our previous research demonstrated no significant differences in the diagnostic performance of radiomics models for different sections of the LV [32]. In this study, radiomics models were still constructed based on these sections, and we further found that their predictive performance differed in the training set, but did not show statistically significant differences in the validation set. This was similar to the results of our previous study [32], indicating that there was no significant difference in the diagnosis and prognosis of CA between radiomics features extracted from different sections, further confirming the rationality of selecting only base-LV for analysis. This finding did not seem to support the regional variation of LGE distribution in patients with CA. It is speculated that although amyloid load shows a significant base-to-apex gradient, all myocardium has already undergone at least subtle pathological changes, which can be easily detected by radiomics.

LGE-CMR is a classic imaging tool to analyze CA. However, accurate quantification of amyloid infiltration on LGE is challenging, because CA is typically a diffuse disease with mixing of the normal and the diseased myocardium. Therefore, the true severity of the lesion can be underestimated in patients with mild or diffuse lesions. Patients in the event-positive and event-negative groups in this study did not show statistically significant differences in quantitative LGE parameters. This may be because patients in the positive group had more severe myocardial involvement, making it difficult to outline the normal myocardial regions, and thus the quantitative LGE measurements were underestimated. The underlying mechanism remains to deserve further study.

Radiomics analysis is of clinical importance in CA. The radiomics score is expected to be incorporated into the Mayo staging system to create a new staging system and will help develop risk-adapted therapies for AL amyloidosis. Radiomics score combined with Mayo stage allows better classification of patients and helps to develop and test treatment strategies for specific patient groups in terms of clinical outcome.

There are several limitations in this preliminary study. First, it was a retrospective study and potential bias was not avoided; for example, we did not include ATTR form of the disease in this study. Second, the sample size of study subjects was limited and the follow-up duration was short with inconsistent time of follow-up endpoints across three centers. Third, no patients had undergone endomyocardial biopsy, thus lacking pathological evidence of myocardial involvement. Fourth, only all-cause mortality was used as primary outcome, data regarding cardiovascular outcomes were not available. Fifth, the population of this study was entirely composed of patients with AL CA, so the prognosis of radiomics in patients with different subtypes of CA still requires further studies. Sixth, different MRI parameters were used in this study. Finally, this study was only based on LGE images for radiomics analysis, and future studies on radiomics from other sequences such as T1 or T2 mapping are needed.

In conclusion, this study shows radiomics has better predictive performance than LGE semi-quantitative and quantitative parameters for poor prognosis in CA and may provide incremental value and improve risk stratification for all-cause mortality in patients with AL amyloidosis.