Introduction

Breast cancer has become the most commonly diagnosed cancer, as well as the leading cause of cancer-related deaths in women worldwide [1]. In these patients, it is the metastases at distant sites, rather than the primary tumor, that are the main cause of death [2]. One of the key factors of metastases is lymphovascular invasion (LVI), which was defined as the infiltration of neoplastic cells within blood vessels or lymphatic lumens [3]. Notably, it is one of the critical steps in the metastasis, and it has been proved to be an independent prognostic factor for local recurrence (LR) or distant metastasis (DM) of breast cancer [4]. At present, LVI can only be diagnosed with a surgical specimen postoperatively [5]. Preoperative knowledge of LVI in breast cancer may assist clinicians in making more reliable clinical decisions, including type of surgery, surgical margin determination, and guiding more aggressive neoadjuvant treatment protocols. However, preoperative prediction of LVI is still a main challenge [6].

Clinicopathologic risk indicators related to LVI of breast cancer have been studied, but the results are inconsistent [7, 8]. MRI is an important diagnostic tool for tumor detection and prediction of neoadjuvant therapy or prognosis in patients with breast cancer, owing to the advantages of high-resolution and multiparameter scanning [9]. Previous studies have shown that some MRI features, such as peritumoral edema, adjacent vessel sign (AVS), and tumor apparent diffusion coefficient (ADC) values were associated with LVI [10,11,12,13,14]. However, these features have not been extensively utilized for prediction of LVI, and they should be further assessed clinically.

Radiomics can capture underlying pathophysiological information, and it has noticeably attracted scholars’ attention [15, 16]. The association between intratumor heterogeneity and LVI has been previously reported [17,18,19,20]. To date, few studies have evaluated the application of radiomics to predict LVI in patients with breast cancer [21], and no study has specifically concentrated on the potential value of multiparametric MRI-based radiomics to reliably predict LVI and outcomes.

Therefore, the present study aimed to investigate whether the radiomics nomogram, integrating clinical factors, MRI features, and radiomics features extracted from multiparametric MRI could be used to predict LVI and outcomes in patients with invasive breast cancer.

Materials and methods

Patients

This retrospective study was approved by the institutional ethics review board of our hospital and the requirement of informed consent was waived. A total of 538 consecutive female patients with breast invasive ductal carcinoma (IDC), who underwent mastectomy or breast-conserving surgery from March 2014 to December 2017, were recruited. Patients were excluded if one of the following criteria was met: LVI status was not explicitly reported pathologically, patients with a history of other malignant tumors, undergoing neoadjuvant chemotherapy preoperatively, undergoing vacuum-assisted excisional biopsy before the MRI examination, poor quality of imaging, or unavailable follow-up data. Finally, a total of 160 patients with IDC were included and were randomly assigned into training dataset (n = 112) and validation dataset (n = 48) at a ratio of 7:3.

Histopathology

All patients underwent surgical treatment within 2 weeks after MRI examination. The pathological type, grade, and LVI status were recorded. LVI was defined as positive when there were cancer cells in lymphatic vessels or small blood vessels outside the tumor. Immunohistochemistry (IHC) was performed to detect the expression levels of estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER-2), and Ki-67 expression status. The cutoff value for ER and PR was set at 1%, and the cutoff value for Ki-67 was 20%. Concerning the HER-2 status, according to the American Society of Clinical Oncology (ASCO)/College of American Pathologists (CAP) guidelines [22], tumors with an IHC staining score of 0 or 1 + were defined as HER-2-negative and those with an IHC staining score of 3 + as HER-2-positive. For tumors with an IHC staining score of 2 + , fluorescence in situ hybridization (FISH) was further performed to confirm the HER-2 status. A non-amplified FISH result denotes the HER-2 status as negative, and an amplified result denotes the HER-2 status as positive. Breast cancer can be classified into four main immunohistochemical subtypes, namely luminal A, luminal B, HER2-enriched, and triple negative subtypes according to the 2013 St. Gallen Consensus Conference [23].

MRI protocol and radiologic evaluation

All MRI examinations were performed on Achieva 3.0-T system (Philips Healthcare) with an eight-channel phased array breast coil. The routine protocols were composed of turbo spin-echo T1-weighted imaging (T1WI), fat-suppressed spin-echo T2-weighted imaging (T2WI), echo-planar diffusion-weighted imaging (DWI), and dynamic contrast-enhanced MRI (DCE-MRI). Imaging acquisition parameters were detailed in Appendix E1. For patients with multiple lesions, the largest lesion was selected for radiomics analysis.

MR images were reviewed independently by two breast radiologists with 15 and 10 years of MR experience respectively, and consensus was reached by discussion in event of disagreement. Both radiologists were blinded to all histopathological data. Those radiologists assessed the following imaging features of tumors: tumor size, lesion type, tumor margin, time-intensity curve (TIC) pattern, peritumoral edema, AVS, and MRI-reported axillary lymph node (mrALN) status. Peritumoral edema positive was defined as the higher signal intensity surrounding the tumor compared with that of surrounding breast tissue or almost equal to that of water by visual evaluation on T2W images [24]. The presence of the AVS was defined as the presence of vessel either entering the lesion or contacting the edge of the lesion on the contrast T1W images [10]. Positive-MRI findings of ALN were accompanied with a short-axis diameter exceeding 10 mm, a ratio of the longest axis to the shortest axis was < 1.5, loss of fatty hilum, and eccentric cortical thickening [25].

MRI segmentation and radiomics features extraction

Regions of interests (ROIs) were manually drawn on the transverse T2WI, DWI, and cT1WI (the early enhanced phase at 60 s after contrast injection), along the edge of the lesion slice-by-slice for each patient, by the above two radiologists using ITK-SNAP 3.8 software (http://www.itksnap.org). The ROIs on DWI were automatically mapped to ADC map.

Image preprocessing and feature extraction were performed using open-source Pyradiomics software (http://PyRadiomics.readthedocs.io/en/latest/). Details of the feature extraction algorithms are presented in Appendix E2.

Radiomics feature selection and radiomics signature building

We devised a four-step procedure for selecting robust radiomics features. Firstly, the intraclass correlation coefficient (ICC) was calculated to explore the stability of radiomics features, and features with an ICC < 0.80 were excluded. Secondly, the univariate analysis was performed to select LVI-related features (with p < 0.01). Thirdly, the Spearman’s correlation analysis with r ≥ 0.90 was used to eliminate the redundancy. Finally, we used the least absolute shrinkage and selection operator (LASSO) method to select the most significant features in the training dataset. Multivariate logistic regression analysis was then conducted to develop a radiomics signature using the Akaike information criterion (AIC) as the stopping rule for predicting LVI.

Development, performance, and validation of the radiomics nomogram

Univariate analysis was used to assess the association between the clinico-radiological (clinicopathological and radiological) characteristics and LVI status, and significant risk factors were considered for the multivariate logistic regression analysis to develop the clinico-radiological model. Similarly, the radiomics signature and the abovementioned clinico-radiological signatures were then tested using univariate and multivariable logistic regression analyses to construct the radiomics nomogram for prediction of LVI in the training dataset.

Receiver operating characteristic (ROC) curves and the area under the curve (AUC) values were used to assess the discrimination performance of the established models for LVI prediction. Calibration curve was adopted to investigate the calibration of the radiomics nomogram in the training and validation datasets. The Hosmer–Lemeshow test was used to assess the goodness-of-fit of the radiomics nomogram. Decision curve analysis (DCA) was conducted to estimate the clinical usefulness of the radiomics nomogram by calculating the net benefits at different values of threshold probability.

Statistical analysis

The statistical analysis was conducted using the SPSS 22.0 software (IBM Corp.) and R software version 3.6.3 (http://www.R-project.org, Appendix E3). Clinico-radiological features of patients in LVI-positive and LVI-negative groups and in the training and validation datasets were compared by the Student t-test or the Mann–Whitney U test for continuous variables and Chi-squared or the Fisher exact test for categorical variables. Disease-free survival (DFS) was the outcome of interest, which was defined as the time from the date of surgery to that of the first LR or DM or death from any cause. The association of the radiomics nomogram with DFS was analyzed by using the Kaplan–Meier method and compared by the log-rank test. The optimum cutoff values to classify patients into high-risk and low-risk groups, according to the radiomics signature or nomogram, were identified using the maximally selected rank statistical method. Univariate and multivariate logistic regression analyses with the Cox proportional-hazards regression determined the predictors of DFS. Variables that reached a statistical significance in the univariate analysis were considered for the multivariate logistic regression analysis. A two-sided p < 0.05 was considered statistically significant.

Results

Clinico-radiological characteristics

No significant difference was found in all the clinico-radiological features between the training and validation datasets (p = 0.056–0.988) (Table S1). The clinico-radiological features of patients in LVI-positive and LVI-negative groups in the training and validation datasets are shown in Table 1. The rates of LVI were 36.7% (41 of 112) and 31.3% (15 of 48) in the training and validation datasets, respectively.

Table 1 Comparisons of clinico-radiological features between patients with LVI and without LVI in the training and validation datasets

The univariate analysis showed that tumor size, lesion type, tumor margin, peritumoral edema, AVS, and mrALN status were significantly correlated to LVI (p < 0.05) (Table S2). The multivariate logistic regression analysis revealed that peritumoral edema (odds ratio [OR] = 3.500; 95%confidence interval [CI]: 1.611–7.604; p = 0.002), AVS (OR = 2.266; 95%CI: 1.025–5.010; p = 0.043), and mrALN status (OR = 3.503; 95%CI: 1.625–7.552; p = 0.001) remained independent predictors of LVI (Table 2). A clinico-radiological model was then established based on the abovementioned independent predictors of LVI. Representative MR images of LVI-positive and LVI-negative of IDC are presented in Figure S1.

Table 2 Multivariate logistic analysis of the risk factors for LVI

Radiomics feature selection, radiomics signature building, and validation

In this study, a total of 3111 imaging features for each patient were extracted, 1037 features were from each of the ADC, cT1WI, and T2WI sequences. We selected 930, 897, and 976 features with high stability and reproducibility (ICC > 0.80) from these features for the above sequences, respectively. Afterward, a total of 1228 features were selected by the univariate analysis. After eliminating redundancy by applying the Pearson correlation analysis, 96 features remained, and were then subjected to the LASSO method. Finally, 3 ADC features, 2 cT1WI features, and 7 T2WI features were selected. The Rad-score calculation formulas of each sequence were constructed with the corresponding selected features, respectively (Appendix E4).

For the fusion radiomics signature derived from the abovementioned sequences, eight radiomics features were selected after the multivariate logistic regression analysis, including two wavelet features and six Laplacian of Gaussian (LoG) features. All these eight features were significantly different between the LVI-positive and LVI-negative groups (all p < 0.05; Fig. 1). The Rad-score calculation formula of the fusion radiomics signature model was constructed (Appendix E4). The distributions of the Rad-scores for each patient in the training and validation datasets are shown in Fig. 2a and b. The Rad-scores in the LVI-positive group were significantly higher than those in the LVI-negative group in the training and validation datasets (Fig. 2c and d).

Fig. 1
figure 1

Plots (ah) illustrate the eight radiomics features with a significant difference between the LVI-positive and LVI-negative groups in the training dataset

Fig. 2
figure 2

Plots (a, b) show the Rad-score for each patient, and plots (c, d) depict the Rad-score in the training and validation datasets, respectively

The prediction performances of the single radiomics signature and the fusion radiomics signature for predicting LVI status are presented in Fig. 3 and Table 3. The radiomics signature derived from ADC, cT1WI, and T2WI sequences yielded AUC values of 0.804 (95%CI: 0.719–0.889), 0.800 (95%CI: 0.717–0.883), and 0.860 (95%CI: 0.790–0.930) in the training dataset, which were then confirmed in the validation dataset with the AUC values of 0.762 (95%CI: 0.600–0.924), 0.713 (95%CI: 0.558–0.868), and 0.844 (95%CI: 0.705–0.984), respectively. The fusion radiomics signature yielded the highest AUC values of 0.905 (95%CI: 0.850–0.960) and 0.848 (95%CI: 0.712–0.985) in the training and validation datasets, respectively.

Fig. 3
figure 3

Plots (a, b) display the ROC curves of the radiomics signatures, the clinico-radiological model, and the radiomics nomogram for LVI status in the training and validation datasets, respectively

Table 3 Diagnostic performance of the radiomics signatures, clinico-radiological model, and the radiomics nomogram for LVI status

Development, performance, and validation of the radiomics nomogram

On multivariate logistic regression analysis, the radiomics signature and peritumoral edema were identified as independent risk factors for predicting LVI status. The radiomics nomogram incorporating the abovementioned independent predictors showed the highest discrimination ability for predicting LVI, with AUC values of 0.919 (95%CI:0.871–0.967) and 0.863 (95%CI: 0.726–0.999) in the training and validation datasets, respectively (Figs. 3 and 4, Figure S2, and Table 3).

Fig. 4
figure 4

The developed radiomics nomogram for the prediction of LVI status in the training dataset

The calibration curve of the radiomics nomogram showed a good agreement between the predicted LVI probabilities and the actual LVI status in both training and validation datasets (Fig. 5a and b). The DCA showed that the radiomics nomogram had higher net benefit than the radiomics signature and the clinico-radiological model in predicting LVI in the two datasets (Fig. 5c and d).

Fig. 5
figure 5

Plots (a, b) illustrate the calibration curves of the radiomics nomogram in the training and validation datasets, respectively. Plots (c, d) show the results of DCA for the radiomics nomogram in the training and validation datasets, respectively. The Y-axis represents the net benefit. The X-axis indicates the threshold probability

Predictors of survival

The median follow-up period was 41 months (range, 5–75 months). LR, DM, or deaths from any cause occurred in 25 patients (15.6%) at a median follow-up period of 16 months (range, 5–36 months).

We identified the optimal radiomics signature or nomogram-defined score for prediction of DFS as 0.711 and 0.581, respectively. Accordingly, all patients were divided into high-risk and low-risk groups, and the Kaplan–Meier survival curves showed that higher radiomics signature or nomogram-defined scores were significantly associated with a worse DFS in the entire cohort (hazard ratio [HR], 2.950; 95%CI: 1.339–6.501, p = 0.005 versus HR, 3.160; 95%CI: 1.434–6.963, p = 0.003) (Fig. 6). Table S3 summarizes the results of the univariate Cox regression analysis of the predictors of DFS in the entire cohort. The results of the multivariate Cox regression analysis showed that the Rad-score was an independent predictor of DFS (HR, 4.535; 95%CI: 1.428–14.404, p = 0.010), following the peritumoral edema (Table 4).

Fig. 6
figure 6

Kaplan–Meier survival analysis according to the radiomics signature (a) and the nomogram (b) in the entire cohort

Table 4 Multivariable Cox regression analysis of predictors of DFS in the entire cohort

Discussion

In the present study, we developed and validated a radiomics signature derived from multiparametric MRI, which was capable of predicting LVI in patients with breast IDC. Furthermore, the proposed radiomics nomogram that integrated MRI features and the radiomics signature exhibited a high accuracy for predicting LVI preoperatively, providing a straightforward and noninvasive approach for personalized prediction of LVI preoperatively. Additionally, the nomogram was significantly associated with DFS, playing an important role in predicting the prognosis in patients with breast IDC.

A recent study considered that DCE-MRI-based radiomics signature was an independent risk factor for predicting LVI status of IDC patients [21]. In their study, the researchers only evaluated the clinical value of radiomics derived from contrast-enhanced MRI. While in the present study, multiparametric MRI-based radiomics features, including T2WI, ADC maps, and cT1WI, were extracted and used for LVI prediction. The results demonstrated that the fusion radiomics signature from the combination of all the sequences achieved a better predictive efficacy than either of them alone for LVI status of breast cancer. Different sequences can reflect different characteristics of tumors, such as tumor intensity, histologic cellularity, and microvessel density. Therefore, multiparametric MRI features can make full use of the advantages of each sequence to reflect more comprehensive tumor information. Additionally, seven of the eight selected features were derived from T2-weighted images, while the DCE-MRI-based radiomics signature was not included. This is consistent with previous speculation that T2-weighted images capture better tumor heterogeneity than contrast-enhanced T1-weighted images [26, 27].

For the fusion radiomics signature, eight radiomics features were selected after the multivariate logistic regression analysis, including two wavelet features and six LoG features. The LoG filter employs a Gaussian filter to smooth the images, which may help decrease the influence of noise. It can also enhance textural details, which will help improve the efficiency of capturing phenotypic features mapped to tumoral heterogeneity [28, 29]. The wavelet transformation can help us quantify high-dimensional multi-frequency tumor information that are difficult to explain intuitively [30]. Wavelet features mirror multi-frequency information of the tumor in a different wavelet-transformed frequency range [30]. LoG and wavelet features are extracted from multiple frequency and spatial domain, which are difficult to explain intuitively in clinic. Nevertheless, through radiomics analysis, these high-dimensional image features were detected and successfully used in the prediction of efficacy of neoadjuvant therapy, lymph node metastasis, and gene expression [28,29,30]. In our study, all these high-dimensional features were significantly higher in the LVI-positive group than those in the LVI-negative group.

In addition, peritumoral edema, AVS, and mrALN status were independent predictors of LVI in the clinico-radiological model in this study. However, only peritumoral edema showed a sufficient predictive power with the adjunction of radiomics signature into the multivariable logistic regression model. Then, peritumoral edema was incorporated to develop a radiomics nomogram model, which showed a satisfactory discrimination ability for predicting LVI, with an AUC value of 0.863 (95% CI: 0.726–0.999) in the validation dataset. The biological mechanism of the association between peritumoral edema and LVI remains elusive. Peritumoral edema may be caused by the increased vascular permeability and peritumoral injection of cytokines [1024]. Uematsu et al. pointed out that edema may be caused by obstruction and dilation of lymphatic drainage massage by tumor emboli, and a high degree of LVI is significantly associated with edema [12, 13].

Histological LVI has been reported to be associated with a poorer prognosis of breast cancer in numerous studies [431,32,33]. Additionally, there is a significant difference in survival outcomes between the high- and low-risk groups defined by our radiomics-based models, in which patients with high-risk LVI had a significantly poorer DFS, compared with those with low-risk LVI. As the outcomes in the LVI high-risk group were not satisfactory postoperatively, neoadjuvant therapy may be a better choice for patients in the high-risk group to obtain drug sensitivity in vivo and to kill potential metastatic cells as early as possible. Therefore, based on our radiomics model, it is feasible to guide the treatment plan and implement personalized treatment.

Several limitations of this study should be pointed out. Firstly, this was a retrospective single-center study, indicating the necessity to a large-scale prospective multicenter study to further validate the effectiveness of the proposed radiomics nomogram. Secondly, multi-phase DCE-MRI is commonly used in breast contrast-enhanced scan of breast, while only the early images after enhancement were used to extract radiomics features in this study. Finally, the DCE-MRI sequence of our institution was originally designed for the analysis of quantitative pharmacokinetic parameters. The spatial resolution was slightly inferior to the conventional contrast T1W sequence, which may affect the robustness of these radiomics features. Moreover, extracting radiomics features of quantitative parameter (ktrance, kep, and Ve) of DCE-MRI may provide more information for identification of LVI status, so that further studies are needed to verify this aspect as well.

In conclusion, radiomics features derived from multiparametric MRI are potential biomarkers for predicting LVI. The proposed radiomics nomogram, incorporating multiparametric MRI-based radiomics signature and MR-reported peritumoral edema, exhibited a promising predictive efficacy for LVI status and clinical outcomes, and may provide valuable guidance for individualized treatment in patients with breast IDC.