Introduction

Breast cancer is one of the most common malignancies and the second leading cause of cancer death among women [1]. MR imaging (MRI) is not only helpful in screening but also in defining the primary tumour. It is also better than other methods in assessing the response to chemotherapy treatment [2]. The main advantage of MRI over mammography or ultrasonography is its higher sensitivity [3, 4]. It can detect cancers missed by mammography or ultrasound in women who have dense breasts and can influence the surgical staging of breast cancer by enabling identification of multifocal and multicentric cancers in the ipsilateral and/or contralateral breast [5]. Conventional breast MRI includes morphological assessment, enhancement and time signal intensity curve (TIC) analysis [6,7,8,9]. Quantitative dynamic contrast-enhanced MRI (qDCE-MRI) is promising, because it can provide multiple pharmacokinetic parameters including semi-quantitative parameters (model-free parameters) and quantitative parameters (model-based parameters derived from pharmacokinetic model calculation), which can fundamentally describe the tumour vascularity and permeability. Previously, qDCE-MRI has been shown to improve the diagnostic specificity of MRI in many cancers [10,11,12,13]. Compared with semi-quantitative parameters, quantitative parameters are less affected by a wide variability in MRI scanners, scanning sequence, temporal resolution, injection of contrast media, and image post-processing calculation [14,15,16]. To ensure the repeatability and reproducibility of the parameter calculation, three fundamental entities are required for qDCE-MRI: (1) Pre-contrast T1 mapping to convert signal intensity into contrast agent concentration, (2) measurements based on both arterial input function and regions of interest tissue, and (3) a suitable pharmacokinetic model to analyse such data [17]. Nonetheless, the use of qDCE-MRI in differentiation between benign and malignant breast lesions has not been well evaluated. Previous studies on qDCE-MRI in breast cancer differed widely in terms of temporal resolution and sampling duration of data acquisition, and pharmacokinetic models used for calculation, and had a limited number of included patients [18,19,20].

In this study, we aimed to find if and which pharmacokinetic parameters derived from DCE data using a two-compartment extended Tofts model and three-dimensional volume of interest (3D-VOI) would be suitable for the differentiation between malignant and benign breast lesions. Furthermore, we assessed the specificity of the found parameters.

Materials and Methods

Patients

Between April 2016 and July 2016, 180 consecutive patients who were suspected to have solid breast tumour on mammography and ultrasound examination were enrolled. The study was approved by the institutional review board of Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University. Written informed consent was obtained from all patients. All patients underwent biopsy before MRI examination. All patients with histopathologically proven malignant lesions underwent surgery as well as the patients with high-risk lesions including papilloma, radial scars and benign phyllodes tumour. Patients were excluded if they underwent neoadjuvant chemotherapy, or if have had a recent surgical procedure in the breast before the MRI examination. Three patients with obvious motion artefacts on MRI were also excluded. Finally, 124 female patients, aged from 25-74 years with a mean of 47.6±11.9 years, were included in this study.

MRI

DCE-MRI was performed on a 1.5 T unit (Magnetom Avanto; Siemens Medical Solutions, Erlangen, Germany). The body coil was used as the transmitter, and an 8-channel phased-array breast coil (Siemens Medical Solutions, Erlangen, Germany) was used as the receiver. For all pre-menopausal women, the examination was performed between the seventh and 14th day of the menstrual cycle to reduce hormonal variations and minimize the enhancement on the fibroglandular tissue. The patients were placed in the prone position with their breasts placed in a holder to reduce motion. The acquisition sequences included T1- and T2-weighted imaging, and DCE imaging. After axial and sagittal T2-weighted imaging using inversion recovery for fat-suppression (repetition time/echo time [TR/TE], 3920/68 ms; inversion time, 170 ms; flip angle, 150°; field of view [FOV], 350×350 mm2; voxel size, 0.5×0.5×5.0 mm3;section thickness/gap, 5.00/0 mm) and Dixon T1-weighted imaging (TR/TE, 6.86/2.39 ms; flip angle, 15°; FOV, 360×360 mm2; voxel size, 0.9×0.9×2.0 mm3, section thickness, 2.00 mm and overlapping gap, 0.1 mm) were obtained, axial DCE imaging was acquired by using a 3D T1-weighted volume interpolated body examination (T1W-VIBE) sequence (TR/TE, 4.87/2.4 ms; flip angle, 10°; FOV, 380×380 mm2; voxel size, 0.5×0.5×3.0 mm3; slice thickness, 3.00 mm and overlapping gap, 1.5 mm). The DCE acquisition consisted of 40 measurements with a temporal resolution of 8 sec. After the initial two dynamic measurements, Gd-DTPA-BMA (Omniscan, GE Healthcare, Ireland) at the dosage of 0.1 mmol/kg was administered by intravenous bolus injection using an MR-compatible power injector (Spectris, Medrad, Pittsburgh, PA) at a flow rate of 3.5ml/sec, followed by a 20 ml saline flush. Before DCE acquisition, multiple flip angle images (2°, 4°, 6°, 8°, 10° and 12°) were obtained for the calculation of T1 maps using the same sequence and parameters except for the flip angle.

Image analysis

The morphology (size, margin and internal enhancement pattern) and time-signal intensity curve (TIC) of the lesions were assessed in isolation, and in a blinded manner by one radiologist (J.S. with 5 years of experience with DCE-MRI). The size was measured in the greatest diameter in either transverse or sagittal planes. The margin was categorised as smooth, irregular, or spiculated; and the internal enhancement pattern was categorised as homogeneous, heterogeneous or rim [19]. The TIC types were categorised as persistent enhancing (type I), plateau (type II) or washout (type III) according to the previously reported method [19].

DCE-MRI data were transferred to an off-line workstation and analysed by using a specialized quantitative analysis software (Omni Kinetics, GE Healthcare). Before parameter calculation, a nonlinear registration framework utilising the Free Form Deformation algorithm was used to correct misalignment caused by body motion between consecutive DCE scans, and the signal intensity was converted into gadolinium concentration using the variable flip angle method [17, 21]. Patient-specific arterial input function was drawn on the thoracic aorta. A two-compartment extended Tofts model was used to obtain quantitative parameters. To obtain perfusion parameters in a 3D-VOI, regions of interest (ROIs) were manually drawn on each slice to cover the whole lesion. DCE-MRI data were analysed by two radiologists (Z.W., and Z.C. with 5 years and 3 years of experience with breast MRI), who were blinded for histology. Necrotic or cystic areas of the lesions, if presented, were excluded from the evaluation. Quantitative parameters including Ktrans (volume transfer constant), Kep (reverse reflux rate constant), Ve (volume fraction of extravascular extracellular space), and Vp (volume fraction of plasma), and semi-quantitative parameters including TTP (time to peak), MaxCon (maximum concentration), MaxSlope (maximal slope) and AUC (area under curve), were calculated. The detailed calculation of the extended Tofts model has been reported previously [22,23,24].

Statistical analysis

The agreement of the evaluation of qDCE-MRI parameters between the two readers was calculated using the intra-class correlation coefficient (ICC). An ICC value >0.75 represents well to excellent agreement, and a value between 0.4 and 0.75 represents fair to moderate agreement. Data from the two readers were averaged for analysis. The normality of distribution of qDCE-MRI parameters was determined by using the Shapiro-Wilk test. All numerical data were presented as mean ± standard deviation. The Mann-Whitney U test and χ 2 test were respectively used to compare the numerical and categorical variables between benign and malignant lesions. Univariate binary logistic regression analysis was used to determine if a single parameter was able to differentiate between malignant and benign lesions. Receiver operating characteristics (ROC) analysis was used to evaluate the diagnostic ability of the parameters. The area under the ROC curve (AUROC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and accuracy were calculated for each parameter and the optimal threshold was determined by the Youden index. Those variables with p value less than 0.05, as determined by the univariate binary logistic regression analysis or by χ 2 test, were chosen for multivariate binary logistic regression analysis to determine the parameters that can best predict the diagnosis of malignancy. A collinearity diagnosis was made on the logistic regression model to test the independence of predictors. If there was multicollinearity, a least absolute shrinkage and selection operator (LASSO) model was used to select the potential predictors to be used by shrinking the coefficients toward zero through setting a constraint on the sum of the absolute standardized coefficients. Then, multivariate binary logistic regression analyses were performed to generate the coefficients of all independent predictors from LASSO selection. Odds ratios (OR) as estimates of relative risk with 95% confidence intervals (CI) were obtained for each independent predictor. For multivariate logistic regression, three models were developed: model 1 in which qDCE parameters were chosen as the independent risk factors; model 2 in which morphological parameters and TIC type were chosen as the independent risk factors; model 3 in which qDCE parameters, morphological parameters and TIC type were chosen as the independent risk factors. The diagnostic performance of three models was evaluated by ROC analysis and their AUROCs were compared by using the Hanley and McNeill test. The sensitivity, specificity, PPV, NPV and accuracy were calculated for each model. A two-sided p value of less than 0.05 was considered to indicate a significant difference. All statistical tests were performed by using R3.1.1 (www.r-project.org) and SPSS (version 21.0; SPSS, Chicago, Ill, USA).

Results

Study population

There was a total of 136 lesions in the 124 patients, including 105 malignant lesions and 31 benign lesions. 83 of 105 malignant lesions were diagnosed as infiltrating ductal carcinomas (IDC), eight were diagnosed as ductal carcinomas in situ (DCIS), four were diagnosed as infiltrating lobular carcinomas (ILC), six were diagnosed as papillary carcinomas, one was diagnosed as carcinosarcoma, one was diagnosed as adenoid cystic carcinoma (ACC), and two were diagnosed as mucinous carcinomas. Twenty-one of 31 benign lesions were diagnosed as fibroadenomas (FA), three were diagnosed as papillomas, five were diagnosed as radial scars, and two were diagnosed as benign phyllodes tumours.

Morphology and TIC

The averages of lesion size, margin, internal enhancement pattern and TIC, and the comparison between malignant and benign lesions are shown in Table 1. The malignant lesions had a greater size than the benign lesions. Smooth margin, homogeneous enhancement and type I TIC were more often found in the benign lesions (Fig. 1A), while irregular or spiculated margin, heterogeneous or rim enhancement, and type II or III TICs were predominant features of the malignant lesions (Fig. 2A). ROCs of each parameter are shown in Fig. 3A. The diagnostic performance of these parameters for discrimination between malignant and benign breast lesions is shown in Table 2. TIC type had the highest accuracy of 0.890 for the diagnosis of malignancy.

Table 1 Morphological parameters, TIC types and DCE-MRI parameters for malignant and benign breast lesions (n = 136)
Fig. 1
figure 1

Benign breast tumour. (A) A 35-year-old woman has a benign phyllodes tumour in the right breast, which is seen as a 27-mm mass (arrows) with hypointense signal on T1WI and hyperintense signal on fat-suppressed T2WI, with a smooth margin and heterogeneous enhancement on DCE-MRI (2 min after contrast injection) and a time-signal intensity curve (TIC) of type I; (B) DCE-MRI parameters are evaluated by a 3-D volume of interest (arrow); (C) Pseudo-colorized maps show individual parameters derived from DCE-MRI. This tumour has low MaxCon (0.1253), MaxSlope (0.1573), AUC (0.6175), Ktrans (0.0192min-1), Kep (0.1976 min-1), Ve (0.0976) and Vp (0.0070), but high TTP (5.6178 min).

Fig. 2
figure 2

Malignant breast tumour. (A) A 63-year-old woman has an infiltrating ductal carcinoma in the right breast, which is seen as a 36-mm mass (arrows) with isointense signal on T1WI and hyperintense signal on fat-suppressed T2WI, with a spiculated margin and rim enhancement on DCE-MRI (2 min after contrast injection) and a time-signal intensity curve (TIC) of type II; (B) DCE-MRI parameters are evaluated by a 3-D volume of interest (arrow); (C) Pseudo-colorized maps show individual parameters derived from DCE-MRI. This tumour has high MaxCon (0.1814), MaxSlope (0.4188), AUC (0.8897), Ktrans (0.2112 min-1), Kep (1.3795 min-1), Ve (0.1595) and Vp (0.0283), but low TTP (2.1685 min)

Fig. 3
figure 3

Graph shows the ROC curves of morphological parameters, time-signal intensity (TIC) type (A), qDCE-MRI semi-quantitative (B) and quantitative parameters C), and the three models established by multivariate logistic regression (D) for differentiation between malignant and benign breast lesions. The AUROC, sensitivity, specificity, PPV, NPV and accuracy are listed in Table 3

Table 2 ROC analyses of morphological parameters, TIC type and qDCE-MRI parameters significant for discrimination between malignant and benign breast lesions (n=136)

qDCE-MRI parameters

The ICC of qDCE parameters between the two readers was 0.862 for Ktrans (95% CI: 0.809-0.900), 0.836 for Kep (95% CI: 0.778-0.880), 0.901 for Ve (95% CI: 0.864-0.929), 0.988 for Vp (95% CI: 0.983-0.991), 0.897 for TTP (95% CI: 0.858-0.925), 0.707 for MaxCon (95% CI: 0.590-0.791), 0.782 for MaxSlope (95% CI: 0.705-0.840), and 0.730 for AUC (95% CI: 0.631-0.805). Therefore, the agreement between the two readers in the evaluation of the qDCE parameters was good.

The averages of the qDCE parameters and their comparison between malignant and benign lesions are shown in Table 1. The mean Ktrans, Kep, Vp, MaxCon, MaxSlope and AUC values were significantly higher in malignant lesions than those in benign lesions (p≤0.001). The mean TTP in malignant lesions was significantly lower than that in benign lesions (p<0.001). There was no significant difference in the Ve value between malignant and benign breast lesions (p=0.726) (Fig. 1B, C and 2B, C). ROCs of each parameter are shown in Fig. 3B, C. The diagnostic performances of these parameters are shown in Table 2. Ktrans had the highest specificity of 0.935 and highest accuracy of 0.890 for diagnosis of malignancy.

Multivariate binary logistic regression analysis

Ktrans, Kep, and MaxSlope were independent predictors for malignancy in model 1, as determined by univariate analysis, collinearity diagnosis, LASSO selection and multivariate analysis. For model 2, the lesion size, margin and TIC were independent predictors for malignancy. For model 3, the lesion size, margin, TIC, Ktrans and MaxSlope were independent predictors for malignancy. The diagnostic performance of the three models is shown in Table 3. Model 3 had the highest diagnostic accuracy among the three models. ROCs of the three models are shown in Fig. 3D. There was a significant difference in the AUROC between model 2 and model 3 (p=0.006), while no difference was found between model 1 and model 2 (p>0.05), as determined by the Hanley and McNeill test.

Table 3 ROC analyses of the three models established by logistic multiunivariate binary logistic analysis

Discussion

In this study, we used a 3-D-VOI and a two-compartment extended Tofts model to derive qDCE parameters of breast lesions. Our results showed that Ktrans, Kep, Vp, MaxCon, MaxSlope, TTP and AUC of malignant lesions were different from those of benign lesions. The multivariate binary logistic regression analysis showed that Ktrans, Kep and MaxSlope were independent predictors of malignancy. The logistic model including these three parameters combined with lesion size, margin and TIC had a higher diagnostic accuracy than the logistic model including only lesion size, margin and TIC for the discrimination of malignant from benign breast lesions. Therefore, the diagnostic performance was improved when morphological parameters and TIC were combined with qDCE parameters.

To date, only a few studies have used 3-D ROI to derive quantitative and/or semi-quantitative parameters from DCE-MRI for the differential diagnosis of breast lesions [19, 25]. Using a 3-D ROI, volumetric parameters can be obtained from the whole tumour. As regards tumour heterogeneity, volumetric parameters would more truly reflect the physiological characteristics of lesions in theory compared with using a two-dimensional ROI (2-D ROI) [26]. Previously, malignant breast tumours were found to have higher Ktrans than benign tumours by using 3-D ROIs [19]. This is consistent with our study results. However, it was also found that Ktrans was significantly higher in benign breast tumours compared to malignant tumours [25]. This discrepancy might be related to the different temporal resolution of DCE-MRI. It was reported that 12 sec or higher temporal resolution is desired to avoid bias and allow for accurate qDCE-MRI parameters calculation [28]. The temporal resolution of DCE-MRI in our study was 8 sec, which is higher than that in the above two studies, which was 15-20 sec [19, 25].

A multitude of semi-quantitative parameters calculated from the time-signal intensity curve derived from DCE-MRI were previously investigated to determine the nature of tumours [6, 9]. Compared with semi-quantitative parameters, quantitative parameters have the advantage to be less sensitive to the MR machine set-up and data acquisition protocols, which makes inter-patient and inter-system comparisons more accurate [27, 28]. Our study showed that AUC, MaxCon, TTP and MaxSlope had a higher sensitivity (81.0%-87.6%), specificity (77.4%-90.3%) and accuracy (80.1%-86.8%) for the differentiation of malignant from benign breast lesions than previously reported [6, 29]. However, the diagnostic accuracy of semi-quantitative parameters was lower than that of quantitative parameters, in particular, Ktrans and Kep. Quantitative parameters can improve diagnostic accuracy and provide an insight into the underlying biological characteristics of breast lesions [30, 31]. Previous studies showed that Ktrans and Kep could discriminate between malignant and benign breast lesions [18,19,20]. Our results showed that Ktrans, Kep combined with MaxSlope achieved a high diagnostic accuracy of 94.9%, which was higher than if each individual parameter would be used. This is reasonable because Ktrans and MaxSlope represents a wash-in process of contrast agent in the tumour and Kep represents a wash-out process [19].

Ktrans is determined by microvascular blood flow, vessel permeability and vessel density [22], while Kep represents vessel permeability. Vp was positively correlated with microvessel density [32]; Ve was influenced by cellular density and tumour interstitium [33]. In general, malignant breast lesions have higher microvascular blood flow, microvessel density, vessel permeability, cell density, contractible extracellular space and increased interstitial fluid pressure than a benign one. In our study, Ktrans, Kep and Vp were higher in malignant than benign breast lesions. These findings are consistent with previous reports [18, 19]. However, there was no difference in Ve. This reason might be assumed by the fact that cell proliferation in benign tumours may be active in some period, which would cause high cellular density and contractible tumour intersitium resulting in low Ve [34, 35].

Clinically, lesion morphology and kinetic TIC from DCE-MRI are regularly used to characterise breast lesions. In our study, the lesion size, margin and TIC were found to be independent predictors for malignancy. The diagnostic accuracy of lesion size, margin and TIC combined together was 89.0%, which was lower than that of quantitative parameters including Ktrans, Kep, and MaxSlope. Furthermore, the incorporating of morphology and TIC into qDCE parameters improved the diagnostic accuracy from 94.9% to 95.6%. Only a marginal benefit was observed for the incorporation of morphology and TIC. These results indicate that quantitative parameters are more important than morphological aspects including regular kinetic analysis of the lesions.

Notably, an underestimated T1 value from T1 mapping can cause the final overestimation of Ktrans [28]. It has been reported that the dual flip angle method of T1 mapping results in errors in Ktrans measurements compared to the multiple flip angle method; therefore, we chose to use a multiple flip angle to ensure correct derivation of Ktrans value [36]. Moreover, we found that Vp has a high diagnostic performance with a higher sensitivity and specificity than reported previously [20]. In that study, however, a much lower temporal resolution of 56 sec was applied to acquire DCE-MRI data, which might have caused the underestimation of its performance [20]. It has been suggested to use a temporal resolution up to 8 sec, after which the errors grow rapidly with the decrease of temporal resolution [37]. To estimate well the Vp, a temporal resolution of 4 sec was recommended for the tracer kinetics model, which is mathematically identical to the Extended Tofts-Kety model [38]. Whereas, a trade-off should then be reached between temporal resolution and spatial resolution. In our study, we used a 3-D fast scanning sequence to ensure appropriate spatial resolution and high temporal resolution of 8 sec. Such temporal resolution is already higher than that used in similar studies on breast tumours [19, 20, 39].

There are several limitations in our study. First, our study population includes several types of malignant and benign breast lesions. To determine the diagnostic value of qDCE-MRI, we simply categorised them as two groups; malignant versus benign tumours. A previous report suggested that pharmacokinetic parameters are continuous variables, and cut-off values that match a binary classification of malignant versus benign are unlikely [19]. Other reports demonstrated that pharmacokinetic parameters did not differ in different immunohistochemical subtypes of breast cancer [31, 40, 41]. Future studies with a larger number and a different subtype of breast tumours should be warranted to verify the value of qDCE-MRI in the differential diagnosis of breast lesions. Second, only mass lesions were investigated. Mass lesions can be easily defined when a volumetric ROI is applied for evaluation of qDCE parameters. Future investigations are needed to determine whether this method could be used on non-mass enhancement lesions.

Conclusion

In summary, qDCE-MRI parameters have a good performance in differentiating malignant from benign breast lesions, in particular, Ktrans, Kep and MaxSlope. The qDCE-MRI parameters combined with morphological parameters and kinetic curve types could achieve a high diagnostic accuracy. The qDCE-MRI can be used as an important tool for preoperative differential diagnosis between malignant and benign breast lesions.