Introduction

Breast cancer is the most commonly diagnosed cancer among women [1, 2]. Breast-conserving surgery (BCS) followed by radiotherapy is now established as the “standard of care” for early-stage invasive breast cancer (IBC) [3, 4]. Emerging pieces of evidence suggest that positive surgical margins are correlated with the increased locoregional recurrence after BCS [5,6,7]. Therefore, the success of BCS relies on the clear surgical margins during initial tumor resection in a cosmetically acceptable manner [8].

The presence of an intraductal component (ductal carcinoma in situ, DCIS) in invasive breast cancer (IBC-IC) is a commonly mentioned risk factor that drives positive margin [9,10,11], re-operation rate [12, 13], and local-regional recurrence [7, 14] in BCS. Lack of imaging biomarkers for preoperatively identifying IBC-IC is the main cause for positive surgical margin and repetitive surgery [13]. MRI is an important imaging modality in locoregional management of breast cancer [15] and has been proven to be more sensitive and accurate than mammography and ultrasonography in detecting and depicting the extent of a given invasive cancer [16] and DCIS [17, 18]. Previous studies also indicated that preoperative MRI improved surgical planning and outcomes of patients with DCIS by depicting additional malignancy [18] and was associated with a lower risk of positive surgical margins in patients with IBC eligible for BCS [19].

Preoperative prediction of the presence of an intraductal component in a given tumor is crucial for surgical planning and outcome in breast cancer [20]. To date, some researchers found that the MRI could improve the detection and depiction of the IBC-IC before surgery, compared with the other imaging modalities [21, 22]. Together with MRI-guided needle biopsy and MRI-guided surgery, MRI could contribute to improved surgical outcomes by reducing the positive margin during BCS [21]. However, evidence of the utility of radiomics based on dynamic contrast-enhanced MRI (DCE-MRI) to improve the preoperative prediction of IBC-IC, to our knowledge, is scarce. Radiomics refers to the science of converting medical images into high-throughput quantitative features, followed by subsequent data analysis and model building for clinical decision-making support [23]. MRI-based radiomics unravel an increasing number of mineable quantitative imaging biomarkers [24] and have been applied in breast cancer diagnosis [25, 26], molecular subtypes classification [27], therapeutic evaluation [28,29,30], early prognostic prediction [31,32,33], and recurrence risk prediction [34, 35], to stratify patients for precise and personalized care [36].

Therefore, we hypothesized that the intratumoral and peritumoral radiomic features derived from DCE-MRI might predict IBC-IC. The purpose of this retrospective study was to develop and validate radiomic models from two independent DCE-MRI protocols for preoperative prediction of IBC-IC.

Materials and methods

Patients

The retrospective analysis was approved by our institutional review board and the informed consent was waived. A total of 374 consecutive female patients between September 2017 and January 2020 were preliminarily enrolled in this study. The inclusion criteria were (1) patients with unilateral IBC confirmed by surgical pathology and (2) with DCE-MRI examination, which was the routine tool to aid breast surgeons for preoperative consulting and surgery planning in our institution. The exclusion criteria were (1) patients with previous breast surgery (n = 47), (2) with previous neoadjuvant or induction chemotherapy (n = 25), (3) with occult breast cancer (n = 2), and (4) without qualified image data (n = 6). Finally, a total of 294 patients were remained and divided into primary cohort (September 2017 to December 2018) and validation cohort (February 2019 to January 2020) according to two different MRI acquisition protocols (Fig. 1).

Fig. 1
figure 1

Flowchart for patient recruitment in this study. IBC-IC, intraductal component in invasive breast cancer; IBC, invasive breast cancer; DCE-MRI, dynamic contrast-enhanced MRI

Histopathological analyses

Histopathology and immunohistochemistry data were directly obtained from the surgical pathology report. The existence and status of the intraductal component were reviewed by two dedicated breast pathologists with 5 and 30 years of experience. All cases were thoroughly evaluated to identify the existence of the intraductal component from hematoxylin-eosin staining sections. Additional immunohistochemical staining (cytokeratin 5/6 and p63) was performed to differentiate intraductal components from invasive cancer. The status of the intraductal component including fraction and nuclear grade [37] was measured and documented as its ratio of the entire lesion (i.e., invasive plus intraductal part) [21]. In this study, IBC-IC was defined as the cases with DCIS component in or around invasive cancer (i.e., index cancer), while invasive cancer without DCIS component was defined as IBC. Estrogen receptor (ER) and progesterone receptor (PR) positivity was defined as the presence of 10% or more positively stained nuclei in 10 high-power fields. Human epidermal growth factor receptor 2 (HER2) expression was scored from 0 to 3: tumors with a score of 3 were classified as positive, tumors with a score of 0 or 1 were classified as negative, and tumors with scores of 2 were subjected to HER2 gene amplification by using chromogenic in situ hybridization to further confirm [31]. The threshold of high or low Ki-67 was defined as 14% [38].

MRI acquisition

All included patients underwent DCE-MRI on the 3.0 T scanner (Skyra, Siemens Healthcare) with a 16-channel breast coil. Gadodiamide (0.1 mmol/kg; Omniscan, GE Healthcare) was intravenously administered using a power injector (rate, 2.5 mL/s) before DCE-MRI. Then, a 20-mL saline flush was injected (rate, 2.5 mL/s). Two different ultrafast DCE-MRI protocols for radiomic model development and validation respectively were described as follows.

DCE-MRI protocol 1

The ultrafast DCE-MRI protocol for the primary cohort was performed between September 2017 and December 2018 which has been described in our previous studies [39, 40]. It included 26 consecutive phases using the commercially available CAIPIRINHA-Dixon-TWIST-VIBE sequence (Siemens Healthcare): repetition time (TR), 5.64 ms; echo time (TE), 2.46 ms; echo train length (ETL), 32; number of excitations (NEX), 1; field of view (FOV), 360 × 360 mm; slice thickness, 2.5 mm; no gap; matrix, 269 × 384; flip angle, 10°; acquisition type, 3D; temporal resolution, 12.0 s/phase; and total acquisition time (TA), 5 min 12 s. The images were reconstructed from the k-space using the inverse Fourier transform with interleaved stochastic trajectories method.

DCE-MRI protocol 2

The ultrafast DCE-MRI protocol for the validation cohort was performed between February 2019 and January 2020. It included 20 consecutive phases using the homemade sequence: TR, 5.06 ms; TE, 1.83 ms; ETL, 32; NEX, 1; FOV, 340 × 255 mm; slice thickness, 2.5 mm; no gap; matrix, 320 × 240; flip angle, 10°; acquisition type, 3D; temporal resolution, 16.6 s/phase; and TA, 5 min 33 s. The images were reconstructed from the k-space using the inverse Fourier transform with the linear filling method.

Image processing and segmentation

The 5th phase images of 26 dynamic phases in the primary cohort and the 4th phase images of 20 dynamic phases in the validation cohort were selected for processing and segmentation, which were equivalent to the first post-contrast images at the early arterial phase [41]. Then these images were preprocessed using the Artificial Intelligence Kit (A.K., version 3.2.0, GE Healthcare) software before segmentation. All images were firstly resampled to 1 × 1 × 1 mm3 isotropic voxels with a trilinear interpolation algorithm. The position of the interpolation grid was aligned by the center and the dimensions of the interpolation grid were rounded to the nearest integer. Gaussian filter was used for anti-aliasing when down-sampling and the bias-field correction was also conducted. Then voxel intensity values were normalized with a z-score to eliminate the influence of the different ranges of gray values.

The region of interest (ROI) that covers the entire MRI-visible tumor (i.e., index tumor) was manually contoured slice by slice in axial plane using ITK-SNAP (version 3.8.0, http://www.itksnap.org). The largest tumor was selected as the index tumor when there were multiple lesions. The peritumoral regions were then obtained using the A.K. software by equidistant 3-dimensional dilation of the intratumoral regions with 4 mm [42,43,44]. The segmentations for all cases were performed by one radiologist with 5 years of experience in breast imaging. To assess inter-observer agreement, 30 randomly selected cases were segmented again by another radiologist with 15 years of experience in breast imaging. Both radiologists were blind to the clinical and histopathological data. The reliability was calculated using the intraclass correlation coefficient (ICC). Features with ICCs greater than 0.75 indicated satisfactory reproducibility and were reserved for further analysis. The representative DCE-MRI images and the intratumoral and peritumoral ROIs are shown in Fig. 2.

Fig. 2
figure 2

The representative dynamic contrast-enhanced MRI images and the intratumoral and peritumoral regions of interest. a, b A 63-year-old female with invasive breast cancer. c, d A 46-year-old female with intraductal component in invasive breast cancer

Radiomic feature extraction

A total of 208 original features including shape, first order, and texture features were extracted from both the intratumoral and peritumoral ROIs (104 features for each) after image preprocessing and segmentation using an open-source Python package (PyRadiomics, version 3.0, https://pyradiomics.readthedocs.io) [45], which was basically compliant with the image biomarker standardization initiative (IBSI) standard [46]. A fixed bin number of 32 was used to discretize voxel intensity values before extracting texture features according to its good performance after z-score normalization in MRI [47]. Further details of radiomic features are provided in Supplementary Materials and Table S1-2.

Radiomic feature selection and radiomic signature construction

Radiomic features were standardized using z-score normalization and then selected to build radiomic signatures based on intratumoral radiomic features, peritumoral radiomic features, and the combined intratumoral and peritumoral radiomic features respectively in the primary cohort. Then, a four-step feature selection methodology was performed. Firstly, the Mann-Whitney U test was conducted to select the features that were statistically different between patients with IBC-IC and IBC. The p value threshold for the remaining significant features was set at 0.05. Secondly, the Spearman correlation analysis was sequentially conducted to exclude redundant radiomic features. Highly correlated features (Spearman correlation coefficient > 0.9) were excluded. Thirdly, the least absolute shrinkage and selection operator (LASSO) method was used to select the most predictive features associated with IBC-IC in the primary cohort [48]. The radiomic signatures were calculated via a linear combination of the selected features and weighted by the respective coefficients. Spearman correlations were performed among the features in radiomic signatures to assess their multicollinearity. Feature selection and radiomic signature building were all performed using R software (version 4.0.0, http://www.r-project.org).

Model development and validation

Independent factors for differentiating IBC-IC from IBC among radiomic signatures and clinical variables were identified by inputting significant variables found in the primary cohort using univariate logistic regression analysis. Then a multivariable logistic regression analysis was applied to build a radiomic nomogram.

The discrimination performance of each model was assessed by the area under the curve (AUC) of receiver-operator characteristic (ROC) curve in both the primary and validation cohorts. The optimal threshold was determined by maximizing the Youden index. The sensitivity and specificity were also calculated. The binomial exact method was used to determine the confidence intervals (CIs) of the reported performance metrics. The Hosmer-Lemeshow (H-L) test was performed to assess the goodness-of-fit of predictive models. The calibration curve was also plotted and its corresponding calibration slope and calibration-in-the-large were measured [49].

MRI image analysis

All MRI images were retrospectively analyzed by two radiologists according to the American College of Radiology Breast Imaging Reporting and Data System (ACR BI-RADS). Both radiologists were blinded to the pathological outcomes and assessed the following MRI morphologic features of each index tumor on the first post-contrast phase of DCE-MRI, including the lesion type (mass or non-mass enhancement, NME), lesion internal enhancement (homogeneous or heterogeneous), shape (oval/round or irregular), and margin (circumscribed or irregular/speculated) of masses. The disagreements between the two radiologists were resolved by consulting a third radiologist with 23 years of experience in breast imaging.

TRIPOD and statistical analysis

As a prediction model study for diagnostic purposes, this study followed the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement [50].

Statistical analyses were performed using MedCalc (version 18.2.1, https://www.medcalc.org) and SPSS software (version 26.0, https://www.ibm.com). The independent sample t-test was used to compare the age between IBC-IC and IBC groups. The chi-square test or Fisher’s exact test was conducted to analyze categorical variables including menopausal status, MRI morphologic features, and histopathological characteristics. The comparisons of AUCs between the predictive radiomic model with the highest AUC and the others were performed using the DeLong test in the primary and validation cohorts [51]. Further comparisons of sensitivity and specificity were performed using the McNemar test [52]. A two-tailed p value of <0.05 was considered statistically significant. The multiple comparisons were corrected using the Bonferroni method.

The association between IBC-IC and MRI morphologic features was further assessed using univariate logistic regression analysis. The prediction performance of the MRI morphologic features was assessed by the AUC of ROC curve and compared with that of the best radiomic model using the DeLong test [51].

Results

Demographic characteristics

A total of 294 patients including 72 (24.5%) IBC-IC patients and 222 (75.5%) IBC patients were recruited in this study. The MRI morphologic features and clinical and histopathological characteristics of patients in the primary and validation cohorts are summarized in Table 1. Significant differences were found in lesion type between the IBC-IC group and the IBC group in both primary and validation cohorts (p < 0.001 and p = 0.009). The IBC-IC patients were younger than IBC patients (p = 0.012) and the proportion of irregular or speculated mass margin and HER2 positive in the IBC-IC group was higher than that in the IBC group (p = 0.013 and 0.044) in the primary cohort. But there were no significant differences in age, mass margin, and HER2 status in the validation cohort (p = 0.336, 0.820, and 0.726). Additionally, there were no significant differences in menopausal status, lesion internal enhancement, mass shape, histological type, molecular subtype, ER status, PR status, and Ki-67 between the IBC-IC and IBC groups in the primary and validation cohorts (all p > 0.05).

Table 1 MRI morphologic features and clinical and histopathological characteristics of patients in the primary and validation cohorts

Radiomic feature selection and radiomic signature construction

Of the 208 extracted original radiomic features, 89 features from the intratumoral region and 55 features from the peritumoral region showed good inter-observer agreement (ICCs > 0.75). The process of radiomic features selection is presented in Supplementary Fig. S1-2. Finally, 4, 4, and 7 features were selected to construct intratumoral, peritumoral, and combined intratumoral and peritumoral radiomic signatures respectively. The mean and standard deviation of these features are provided in Supplementary Table S3 to conduct z-score normalization before calculation.

The calculation formula for intratumoral radiomic signature

ln (P/1-P) = -1.357 - 0.0095 × shape_Flatness - 0. 8371 × shape_Sphericity + 0.6066 × shape_SurfaceVolumeRatio - 0.7562 × GLDM_LowGrayLevelEmphasis, where P is the probability of IBC-IC (cutoff > 0.1817).

The calculation formula for peritumoral radiomic signature

ln (P/1-P) = -1.2048 - 0.4029 × shape_Flatness + 0.2186 × firstorder_10Percentile + 0.3995 × firstorder_RootMeanSquared - 0.2168 × firstorder_TotalEnergy, where P is the probability of IBC-IC (cutoff > 0.2755).

The calculation formula for combined intratumoral and peritumoral radiomic signature

ln (P/1-P) = -1.4034 - 0.9322 × intratumoral_shape_Sphericity + 0.5822 × intratumoral_shape_SurfaceVolumeRatio - 0.5794 × intratumoral_GLDM_LowGrayLevelEmphasis + 0.2619 × peritumoral_firstorder_10Percentile + 0.1189 × peritumoral_firstorder_Range + 0.3665 × peritumoral_firstorder_RootMeanSquared - 0.3933 peritumoral_firstorder_TotalEnergy, where P is the probability of IBC-IC (cutoff > 0.2496).

The distributions of these radiomic signatures in the primary and validation cohorts are shown in Supplementary Fig. S3. The Spearman correlations among the features in these radiomic signatures are presented in Supplementary Table S4.

Model development and validation

Univariate logistic regression analysis found that IBC-IC was associated with age (OR = 0.954, p = 0.013). Further multivariate logistic regression analysis found that only age in the peritumoral dataset was associated with IBC-IC (p = 0.041) (Table 2). Therefore, the peritumoral radiomic nomogram was developed by integrating the peritumoral radiomic signature and age (Supplementary Fig. S4). The calculation formula for peritumoral radiomic nomogram is as follows: ln (P/1-P) = 2.0973 - 0.0410 × age + 1.1666 × peritumoral radiomic signature, where P is the probability of IBC-IC (cutoff > 0.2374).

Table 2 Univariate and multivariate logistic regression analyses for predictive factors of IBC-IC in the primary cohort

Finally, four radiomic models for prediction of IBC-IC were built including intratumoral radiomic signature, peritumoral radiomic signature, peritumoral radiomic nomogram, and combined intratumoral and peritumoral radiomic signature. The ROC curves of these radiomic models in the primary and validation cohorts are shown in Fig. 3. The AUC, sensitivity, and specificity of each radiomic model in the primary and validation cohorts are summarized in Table 3.

Fig. 3
figure 3

Receiver operating characteristic curves of radiomic models for prediction of intraductal component in invasive breast cancer. a Primary cohort. b Validation cohort

Table 3 Diagnostic performance of radiomic models in the primary and validation cohorts

The calibration curves of intratumoral radiomic signature, peritumoral radiomic signature, peritumoral radiomic nomogram, and combined intratumoral and peritumoral radiomic signature showed good agreement between prediction and observation in both primary and validation cohorts (Supplementary Fig. S5), and the results of the Hosmer-Lemeshow test were insignificant (p = 0.324, 0.177, 0.219, 0.256 in the primary cohort and p = 0.873, 0.265, 0.800, 0.281 in the validation cohort, respectively), which suggested no departure from a perfect fit.

Univariate logistic regression analysis revealed that lesion type was associated with IBC-IC (OR = 7.118, 95% CI = 2.457–20.622, p < 0.001). The lesion type yielded AUCs of 0.600 (0.526–0.672) in the primary cohort and 0.581 (0.483–0.674) in the validation cohort (Supplementary Fig. S6).

Comparison of models and TRIPOD

The combined intratumoral and peritumoral radiomic signature yielded the highest AUC among the radiomic models in both the primary and validation cohorts (AUC = 0.821, 95% CI = 0.758–0.874; AUC = 0.815, 95% CI = 0.730–0.882, respectively). According to the DeLong test, the AUC of the combined intratumoral and peritumoral radiomic signature was higher than that of the peritumoral radiomic signature and nomogram in the primary cohort (p < 0.001 and p = 0.006), but not in the validation cohort (p = 0.930 and 0.626). There were no significant differences in the AUCs between the combined intratumoral and peritumoral radiomic signature and the intratumoral radiomic signature in the primary (p = 0.080) or validation cohort (p = 0.056) (Table 4). The McNemar test showed the specificities of the combined intratumoral and peritumoral radiomic signature in both the primary and validation cohorts were higher than those of the intratumoral radiomic signature (both p < 0.001). The sensitivity of the combined intratumoral and peritumoral radiomic signature was higher than that of the peritumoral radiomic signature in the primary cohort (p = 0.006), but not in the validation cohort (p = 0.727) (Table 4). The discriminating performances of the combined intratumoral and peritumoral radiomic signature in the primary and validation cohorts were superior to that of the lesion type (0.821 vs. 0.600, p < 0.001; 0.815 vs. 0.581, p < 0.001, respectively). The representative images of the lesion type as mass and NME in IBC-IC and IBC groups are shown in Fig. 4.

Table 4 Comparisons of AUC, sensitivity, and specificity between the combined intratumoral and peritumoral radiomic signature and the other radiomic models in the primary and validation cohorts
Fig. 4
figure 4

a–d The representative images of the lesion type as mass and NME in IBC-IC and IBC groups. The radiomic signature was calculated from the combined intratumoral and peritumoral model and the cutoff probability for prediction of IBC-IC was 0.2496. NME, non-mass enhancement; IBC-IC, intraductal component in invasive breast cancer; IBC, invasive breast cancer

We concluded the type of this study could be type 2b, and the TRIPOD checklist is presented in Supplementary Table S5.

Discussion

In this study, we developed and validated radiomic models incorporating intratumoral and peritumoral radiomic features for preoperative prediction of IBC-IC based on two different DCE-MRI protocols. The result showed the combined intratumoral and peritumoral radiomic signature had the optimal ability to preoperatively predict IBC-IC.

Preoperative prediction of IBC-IC based on DCE-MRI can help breast surgeons to optimize surgical planning before BCS. The incidence of IBC-IC is not rare in clinical practice. It was 24.5% (72/294) in our study, which was similar to previous literatures (20.6 to 52.6%) [21, 22, 53, 54]. IBC-IC is also a well-known risk factor that drives positive margin [9,10,11] and re-operation rate [13] in BCS. The published positive surgical margin rates in patients with IBC-IC ranged from 19.3 to 38.3% [55, 56] and the average re-operation rate was about 29.5% [13], which were both much higher than that in patients with pure IBC. Therefore, anticipating the IBC-IC before surgery can help surgeons appropriately define resection margins during BCS and reduce the rate of positive surgical margins [21] and incomplete resections [20]. Our results showed that the radiomic models based on DCE-MRI could non-invasively predict the presence of an intraductal component in patients with IBC before BCS, which might facilitate the personalized surgical planning for these patients. Together with MRI-guided needle biopsy and MRI-guided surgery, previous research demonstrated that the rate of positive surgical margins in IBC-IC patients could be reduced to 3.7% [21].

Traditional imaging researches found that the extensive IBC-IC often presented as ductal or linear enhancement, long spicules, a regional enhancing area, or nodules adjacent to a mass on MRI [22]. Compared the MRI findings with the histopathological examination, Van Goethem et al found that the peritumoral enhancement on MRI corresponded in 89.1% of cases with malignant extension by surgery pathology, and more than half of these correlated with DCIS [57]. Therefore, utilization of the peritumoral radiomic features may be helpful in the detection of IBC-IC. Previous studies showed that the early and overall peritumoral enhancement and amount of fibroglandular tissue around the MRI-visible tumor were associated with extensive IBC-IC, and the AUC of the prediction model based on these features was 0.79 [20]. These results are concordant with our results, suggesting that the peritumoral characteristics of breast tumors on MRI are of great help for the detection of IBC-IC. Recent studies have also demonstrated that the peritumoral radiomic features contained crucial information for breast cancer diagnosis [58], prognosis prediction [42, 44], and therapeutic evaluation [59]. As compared with the above studies, our study revealed that the combined intratumoral and peritumoral radiomic signature was the most optimal model among the radiomic models in predicting IBC-IC with the highest AUCs of 0.821 and 0.815 in both the primary and validation cohorts.

Our results also showed that the predictive performance of radiomic features was superior to the morphological features. Traditionally, morphological features of index tumor on MRI are the only predictors for IBC-IC prediction, which are subject to visual interpretation. The traditional diagnostic mode is highly affected by the experience of radiologist. Thus, the performance for the prediction of IBC-IC based on morphological features ranged from 68 to 84.9% in previous literature [21, 22].

In the combined intratumoral and peritumoral radiomic signature, the intratumoral shape_Sphericity was negatively associated with IBC-IC and the intratumoral shape_SurfaceVolumeRatio was positively associated with IBC-IC, indicating that the masses of IBC-IC had less spherical and compact shape than that of IBC. The GLDM_LowGrayLevelEmphasis from the intratumoral region was negatively associated with IBC-IC, indicating that the masses of IBC-IC had less homogeneous textures with less concentration of low gray-level values. Besides, 4 first-order features from the peritumoral region including 10Percentile, Range, RootMeanSquared, and TotalEnergy were selected to construct the combined intratumoral and peritumoral radiomic signature. Our results suggested that the IBC-IC had a higher 10th percentile and range of gray level intensity, a higher square root of the mean of all the squared intensity values, and a lower sum of the squares of voxel values within the peritumoral region than IBC.

This study had several limitations. First, the retrospective design of this study had inherent selection bias. Second, the radiomic models in the study were developed and validated from two ultrafast DCE-MRI protocols using the same MRI scanner. Thus, the variance of radiomic features caused by different MRI scanners was not investigated. In addition, this was a single-center study. Hence, the radiomic models in this study need to be verified in a multicenter study with different imaging equipment in the future.

In conclusion, we developed and validated the radiomic model for preoperative prediction of IBC-IC based on the combined intratumoral and peritumoral features from DCE-MRI. This radiomic model provided the individualized probability of IBC-IC and might help to optimize surgical planning for patients with breast cancer before BCS.