Introduction

Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) has become an important imaging tool for breast cancer staging [1,2,3] because the technique has high sensitivity, which allows detection of additional lesions that are undetectable by mammography or ultrasound (US) in 6%–34% of cases [4]. However, the reported specificity of DCE-MRI ranges from 37% to 97%, so additional lesions in breast cancer patients should always be confirmed histopathologically to rule out multifocal or multicentric cancers [5, 6]. Indeed, histologically confirmed multifocal or multicentric breast cancers occur in 21%–63% of affected breasts [7, 8]. Although the American Joint Committee on Cancer (AJCC) staging system does not consider whether the disease is multifocal or multicentric [9], such findings are known to be associated with worse breast cancer-specific and relapse-free survival [10]. Furthermore, multifocality and multicentricity in breast cancer affect the decisions surrounding surgical management, such as whether to perform mastectomy or breast-conserving surgery [10].

Multiparametric MRI, which derives diagnostic information from both morphologic and functional MRI parameters, has increased the accuracy of breast cancer diagnosis [11, 12]. Among the functional MRI parameters used, diffusion-weighted imaging (DWI) has become important in clinical practice; DWI provides apparent diffusion coefficient (ADC) values, which differ significantly between benign and malignant lesions [13, 14]. Moreover, combining DWI with conventional DCE-MRI improves diagnostic accuracy and reduces the number of false diagnoses [11, 12, 15]. However, no previous investigations have validated whether DWI could also be used to evaluate additional suspicious lesions detected using DCE-MRI in patients with breast cancer, thus revealing multifocality or multicentricity. Therefore, the purpose of this study was to investigate whether DWI adds value to pre-operative DCE-MRI in the evaluation of additional lesions in patients with breast cancer.

Materials and methods

This prospective study was approved by the institutional review board (IRB number: ED 14341). Since 2013, our hospital has offered DWI, in addition to DCE-MRI, to women scheduled for breast cancer surgery.

Study populations and breast lesions

Development cohort

Between July 2014 and February 2015, 96 consecutive women with newly diagnosed breast cancer underwent pre-operative DCE-MRI and DWI. Among them, we excluded women who (1) were not to undergo surgery in our institution (n = 13), (2) had motion artefacts (n = 5), and (3) had undergone excisional biopsy prior to MRI (n = 2). Ultimately, the development set consisted of 76 women (mean age: 54 years, range: 32–83 years) with 131 breast lesions (mean diameter: 1.7 cm, range: 0.5–7.0 cm), all of which had final pathology results available. Of the 131 breast lesions, 106 (80.9%) were malignant and 25 (19.1%) were benign. The apparent diffusion coefficient (ADC) of each lesion was measured, and a cut-off value for differentiation between malignant and benign lesions was calculated using receiver-operating characteristic (ROC) curve analysis. The ADC value from the development set was applied prospectively to a validation set.

Validation cohort

After an interval of 1 month, we prospectively enrolled patients with breast cancer who were to undergo pre-operative DCE-MRI and DWI between March and October 2015. In this way, we investigated the diagnostic performance of the combined protocol by combining the ADC cut-off value with DCE-MRI. Among 102 consecutive women with breast cancer who underwent pre-operative DCE-MRI, we excluded 25 who fulfilled the above-mentioned criteria. The remaining 77 women had 117 additional lesions—detected using pre-operative breast MRI. Of these 117 additional lesions, 10 (8.5%) were excluded from this study because they were not precisely found or biopsied. Ultimately, 107 additional lesions in 77 women (mean age: 48.8 years; range: 32–76 years) were included, and all lesions were verified by performing pathological analysis.

Magnetic resonance imaging

A 3.0-T system (Achieva 3.0 T TX; Philips Healthcare) with a dedicated, phased-array breast coil was used for DCE-MRI; patients were placed in the prone position. DWI was acquired in the transverse plane. A spin-echo, single-shot, echo-planar-imaging sequence, with diffusion-sensitising gradients, was applied along the orthogonal directions. The images thus obtained were used to synthesise isotopic transverse images [repetition time (ms)/echo time (ms): 12,500/58; flip angle 90°; SPAIR fat suppression; in-plane resolution 1.15 mm × 1.15 mm; 50 slices; b-values: 0, 600 and 1000 s/mm2; image matrix: 128 × 128; field of view: 320 × 320 mm; section thickness: 3 mm; section gap: 0 mm; SENSE factor = 2.3, two averages; three signals acquired; acquisition time: 80 s]. After DWI, T2-weighted, fast spin-echo, transverse images were obtained [repetition time (ms)/echo time (ms): 5727/70; flip angle: 90°; image matrix: 620 × 309; field of view: 581 × 342 mm; section thickness: 3 mm; section gap: 0 mm]. For DCE-MRI, one pre-contrast and six post-contrast dynamic series were performed using a three-dimensional, T1-weighted, fast spoiled gradient-echo sequence (6/3; flip angle: 10°: 100 slices: image matrix: 436 × 436; field of view: 330 × 340 mm; section thickness: 1.5 mm; no gap) immediately after and at 60, 120, 180, 240, and 300 s after the contrast injection.

Magnetic resonance image analysis

Two breast radiologists (S.E.S., E.K.P.) with 6 and 3 years of experience in breast radiology, respectively, prospectively assessed the characteristics of each lesion, reaching a consensus in each case. Their analysis was based on morphological and kinetic criteria and it used the breast imaging reporting and data system (BI-RADS) MRI atlas of the American College of Radiology [16]. In accordance with the BI-RADS atlas, the following descriptors were used to analyse the masses: shape, margin, and internal enhancement; similarly, the following descriptors were used to analyse non-mass enhancement (NME): distribution and internal enhancement pattern. The enhancement kinetic curve was evaluated using a computer-aided detection (CAD) system (CADstream software, version 5.2.8.591; Merge Healthcare, Chicago, IL, USA). The type of curve was defined in terms of the single most suspicious enhancement: type I (low persistent), type II (medium, fast persistent, or plateau), and type III (medium or fast washout). Based on previous positive predictive value (PPV) results suggesting malignancy on the basis of MRI findings [16,17,18], the final assessment category of each lesion was defined as follows: 3, probably benign enhancement; 4, suspicious; 5, highly suggestive of malignancy.

Diffusion-weighted imaging and ADC analysis

The ADC maps were constructed using a commercially available CAD system using b-values of 0 and 1000 s/mm2. When each additional breast lesion with a BI-RADS category of 3, 4, or 5 was detected on the colour map using the CAD system, the ADC map was placed beside the colour map. One breast radiologist (E.K.P.), with 3 years’ experience in breast radiology, manually outlined the regions of interest (ROIs) and measured the ADC value, using the CAD system to cover the entire tumour in its three largest cross-sections. The measurements were performed three times, and the average value was defined as the ADC of each lesion.

Combined analysis using DCE-MRI and DWI

To carry out the combined analysis of DCE-MRI and DWI, we used an algorithm adapted from the BI-RADS atlas [11, 12, 16], in which the ADC cut-off values acquired from the development set were applied to the validation set to estimate the likelihood of malignancy in any additional lesions with a given BI-RADS assessment category. For example, in the case of BI-RADS assessment category 3, a lower ADC cut-off value was required in the development set to upgrade a lesion to malignant status. In the case of BI-RADS assessment categories 4 or 5, a higher ADC cut-off value was required in the development set to downgrade a lesion to “probably benign” status.

Histopathological Analysis

To carry out histopathological diagnosis, image-guided core biopsies, or image-guided needle-localisation and excisional biopsies, were performed on all lesions. Pathological specimens were obtained within 1–2 weeks of the pre-operative MRI scans. In the case of breast-conserving surgery, specimens were prepared as serial 5-mm slices; mastectomy specimens were prepared as 5–10-mm slices [19]. To determine whether pathological features corresponded with the findings of breast imaging, pathologists, breast surgeons, and radiologists reviewed all specimens together. The pathologists were then asked to assess the additional suspicious lesions separately to evaluate multifocality and multicentricity.

Data analysis

In the development cohort, a receiver-operating characteristic (ROC) curve was constructed to determine the ADC cut-off value, which was used to differentiate benign from malignant lesions. In the validation cohort, the combined protocol, which considered both the ADC cut-off value and DCE-MRI, was evaluated. Breast lesions with an ADC value above the cut-off were reclassified as negative, and those with an ADC value below the cut-off were reclassified as positive. Sensitivity, specificity, PPV, negative predictive value (NPV), and accuracy of both DCE-MRI and the combined protocol were calculated; sensitivity, specificity, and accuracy were compared using McNemar’s test. In the validation cohort, ROC curve analysis was also performed. All statistical analyses were performed using statistical software (SPSS, version 20.0; SPSS Chicago, IL, USA), and a P-value < 0.05 denoted significance.

Results

DWI and ROC analysis in the development cohort

Of the 131 breast lesions in the development cohort, 106 (80.9%) were malignant and 25 (19.1%) were benign. Among the 106 malignant lesions, 92 (86.8%) were invasive carcinomas and 14 (13.2%) were ductal carcinomas in situ (DCISs). The mean ADC of the malignant lesions was 0.93 ± 0.16 × 10-3 mm2/s and that of benign lesions was 1.20 ± 0.26 × 10-3 mm2/s (Fig. 1). The ADC values were significantly lower in the malignant lesions than in the benign lesions (P < 0.01). The best ADC cut-off to differentiate benign from malignant lesions, as determined using ROC-curve analysis, was 1.11×10-3 mm2/s, which yielded a sensitivity of 87.0% and a specificity of 67.2%. The area under the ROC curve for differentiating benign from malignant lesions was 0.810 (95% confidence interval: 0.766–0.898).

Fig. 1
figure 1

ADC values from the development cohort. (a) Graph showing the mean ADC values of benign versus malignant breast lesions. The mean ADC value of the malignant lesions (0.93 ± 0.16 × 10-3 mm2/s) was significantly lower than those of benign lesions (1.20 ± 0.26 × 10-3 mm2/s, P <0.01). (b) The best ADC cut-off value to differentiate benign from malignant lesions, as determined using ROC curve analysis, was 1.11 × 10-3 mm2/s. The area under the ROC curve to differentiate benign from malignant lesions was 0.810

DCE-MRI in the validation cohort

Among the 107 additional lesions in the validation cohort, histopathological analysis revealed 70 malignant lesions (mean diameter: 1.0 cm, range: 0.5–4.2 cm) and 37 benign lesions (mean diameter: 0.9 cm, range: 0.5–2.3 cm). The malignant lesions comprised 61 invasive cancers (50 invasive ductal carcinomas, 6 invasive micropapillary carcinomas, two invasive medullary carcinomas, two invasive cribriform carcinomas, and one invasive mucinous carcinoma) and 9 DCISs. The benign lesions comprised 26 cases of fibrocystic change, three fibroadenomas, three cases of sclerosing adenosis, three cases of usual ductal hyperplasia, and two benign intraductal papillomas.

There were 95 masses (mean diameter: 0.9 cm, range: 0.5–2.4 cm) and 12 NME lesions (mean: 1.7 cm, range: 0.7–4.2 cm). Eight lesions (7.5%) were assigned the BI-RADS final assessment category 3; 73 lesions (68.2%) were defined as category 4, and 26 lesions (24.3%) were placed in category 5 (Table 1). All benign lesions that were not excised but biopsied showed stability after a median 15-month follow-up period. DCE-MRI showed 98.6% sensitivity, 18.9% specificity, 69.6% PPV, 87.5% NPV, and 61.7% diagnostic accuracy (Table 2).

Table 1 Lesion characteristics of 107 additional lesions in the validation cohort
Table 2 Diagnostic performance of combined DCE-MRI and MRI protocol in 107 additional lesions

Combined analysis using DCE-MRI and DWI in the validation set

When DWI was applied to DCE-MRI using the best ADC cut-off value (1.11 × 10-3 mm2/s), as derived from the development cohort, 21 false-positive DCE-MRI findings with BI-RADS category 4 became true-negative findings (Fig. 2; Table 3) and 3 true-negative DCE-MRI findings with BI-RADS category 3 became false-positive findings. With application of DWI, 3 of 26 lesions with BI-RADS category 5 and 4 of 43 lesions with BI-RADS category 4 in DCE-MRI were downgraded to probably benign status and became false-negative findings (Table 4). Specifically, the combined MRI protocol conferred 90.0% sensitivity, 82.2% specificity, 92.0% PPV, 78.1% NPV, and 82.2% diagnostic accuracy. The addition of DWI to DCE-MRI conferred a 63.3% increase in the specificity (P < 0.001) and a 20.5% increase in the diagnostic accuracy (P = 0.05), without significantly decreasing the sensitivity (P = 0.07).

Fig. 2
figure 2

A 47-year-old female with left-breast cancer. (a) Contrast-enhanced, fat-suppressed, T1-weighted, axial MR image showing an irregular-shaped and marginated mass with heterogeneous enhancement in the left upper outer breast. (b) A regional non-mass enhancement 2.2 cm in diameter in the lower portion of the same quadrant as known cancer. (c) Colour map created using CAD showing the same non-mass enhancement with a predominantly blue overlay, indicating that the kinetic pattern was persistent. (d) To measure the ADC value, the regions of interest are manually outlined, ensuring that the entire tumour is covered in the three largest cross-sections. (e) The average ADC is 1.33 × 10-3 mm2/s, and the lesion appears as fibrocystic change after excision

Table 3 Reduction of false-positive cases by combining the ADC value of diffusion-weighted imaging and DCE-MRI
Table 4 False-negative cases by combining the ADC value of diffusion-weighted imaging and DCE-MRI

Malignancy rate and performance in multifocal and multicentric breast cancer

Of the 107 additional lesions, 60 (56.0%) were located in the same quadrant of the affected breast; 47 (44.0%) were located in a different quadrant. Of the 60 lesions in the same quadrant, 52 (86.6%) were malignant (47 invasive cancers, 5 DCISs). Of the 47 lesions in a different quadrant, 18 (48.6%) were malignant (14 invasive cancers, 4 DCISs; Fig. 3). The malignancy rate of the multifocal lesions was 86.6%, while that of the multicentric additional lesions was 48.6%. In the case of lesions in the same quadrant, DCE-MRI showed 100% sensitivity, 21.4% specificity, 82.5% PPV, 100% NPV, and 83.3% overall accuracy. The combined protocol conferred 90.4% sensitivity, 78.5% specificity, 94.0% PPV, 68.7% NPV, and 87.8% overall accuracy; the specificity was significantly higher in the combined protocol (P = 0.008). However, NPV decreased from 100% to 68.7%. Five of 16 (31.3%) multifocal lesions were missed by the use of the combined protocol (Table 4). In the case of lesions in a different quadrant, DCE-MRI showed 94.4% sensitivity, 17.3% specificity, 47.2% PPV, 80% NPV, and 51.2% overall accuracy. The combined protocol conferred 88.8% sensitivity, 60.8% specificity, 64.0% PPV, 87.5% NPV, and 73.1% overall accuracy; the specificity was significantly higher in the combined protocol (P = 0.021).

Fig. 3
figure 3

A 46-year-old female with right-breast cancer (a) Contrast-enhanced, fat-suppressed, T1-weighted axial MR image showing an irregular-shaped, spiculated-marginated mass with heterogeneous enhancement in the right upper outer breast. (b) Oval, circumscribed, heterogeneously enhancing mass, 0.9 cm in diameter, is seen in the right upper inner breast. (c) Colour map created using CAD showing the mass with a predominantly red overlay and indicating the washout kinetic pattern. (d) To measure the ADC value, the regions of interest are manually outlined, ensuring that the entire tumour is covered in the three largest cross-sections. (e) The average ADC is 0.94 × 10-3 mm2/s, and the lesion is revealed as an invasive ductal carcinoma after excision

Comparison of performance according to lesion size

Of the 107 lesions, 41 (38.3%) were large (≥1 cm in diameter) and 66 (61.7%) were small (<1 cm in diameter). In the case of large lesions, DCE-MRI showed 100% sensitivity, 16.6% specificity, 74.3% PPV, 100% NPV, and 75.6% overall accuracy. The combined protocol conferred 82.7% sensitivity, 91.6% specificity, 96.0% PPV, 68.7% NPV, and 85.4% overall accuracy; the specificity was significantly higher in the combined protocol (P = 0.004). However, NPV decreased from 100% to 68.7%. Five of 16 (31.3%) large lesions were missed precisely because DWI was applied (Table 4). In the case of small lesions, DCE-MRI showed 97.6% sensitivity, 20.0% specificity, 66.6% PPV, 83.3% NPV, and 68.1% overall accuracy. The combined protocol showed 95.1% sensitivity, 56.0% specificity, 78.0% PPV, 87.5% NPV, and 80.3% overall accuracy; the specificity was significantly higher in the combined protocol (P = 0.035).

Discussion

Not only has breast MRI improved cancer detection, but it has also yielded substantial numbers of unpredictable lesions with low PPV [5, 6, 20]. Several researchers have suggested combining DWI with DCE-MRI to help distinguish malignant lesions from benign lesions and to reduce the number of false positives [11,12,13,14,15]. Furthermore, some recent studies have reported improved specificity in the characterisation of breast tumours—from 13.5% to 89% [21] and from 22% to 78% [11]—when the ADC cut-off was applied to the DCE-MRI classification. Spick et al. [15] also reported that applying DWI to suspicious lesions visible on DCE-MRI may prevent unnecessary biopsies in up to 34.5% of cases. However, to our knowledge, the present study was the first to evaluate the usefulness of DWI in pre-operative breast cancer staging. Overall diagnostic accuracy and specificity were significantly improved in our combined protocol, without significant loss of sensitivity, when additional lesions that showed an ADC value of <1.11×10-3 mm2/s were considered malignant. Recommended ADC cut-off values for differentiation between benign and malignant breast lesions varied from 0.90 to 1.76 × 10−3 mm2/s [22]. Another meta-analysis of 12 articles using a b value of 0, 1000 s/mm2 on 1.5 T recommended an ADC cut-off value of 1.23 × 10−3 mm2/s [23]. Cakir et al. [24] using a b value of 0, 1000 s/mm2 on 3.0-T suggested an ADC value of 1.12 × 10−3 mm2/s with diagnostic accuracy of 82%, which was similar to our ADC value of 1.11 × 10−3 mm2/s and diagnostic accuracy of 82.2%. The main strength of our study was that the results were not derived from a single cohort; instead, they were validated using another cohort. Moreover, the cut-off values in both cohorts were very similar (1.11×10-3 mm2/s from the development cohort and 1.05×10-3 mm2/s from the validation cohort). In the present study, DCE-MRI led to 30 false-positive findings; the combined protocol helped to reduce this number to only 12 false-positive findings. Therefore, if DWI is combined with pre-operative MRI when breast cancer patients show additional lesions, unnecessary biopsies or extensive surgery could be avoided.

Patients with multifocal or multicentric cancers should be treated with a mastectomy rather than with wide excisions because there is a higher risk of local recurrence and because a worse cosmetic outcome is expected [25]. In one study, treatment was converted from breast conserving therapy to mastectomy in 8.1% of cases because of multifocality or multicentricity [8]. For this reason, histopathological confirmation of additional lesions should be performed in patients with breast cancer. In a meta-analysis by Houssami et al. [8], DCE-MRI of additional lesions in patients with breast cancer had a PPV of 66%. In the present study, the PPV of DCE-MRI alone was higher (82.5%) in the case of multifocal lesions, but lower (47.2%) in the case of multicentric lesions. In the combined protocol, both the PPV (47.2% to 64.0%) and NPV (80.0% to 87.5%) had increased in the case of multicentric lesions; however, in the case of multifocal cancers, the NPV had decreased from 100% to 68.7%, while the PPV had increased from 82.5% to 94.0%. Five of 16 (31.3%) multifocal lesions were missed by the use of the combined protocol. Considering the malignancy rates of multifocal and multicentric lesions in the present study were 86.6% and 48.6%, respectively, DWI could be used alongside pre-operative MRI to inform decisions about whether multicentric lesions should be excised or biopsied; however, the same does not apply in the case of multifocal lesions.

We further compared the diagnostic performance of the protocols according to lesion size. The combined protocol consistently increased the PPV, regardless of lesion size, although the PPV of small lesions was lower than that of large lesions (78.0% vs. 96.0%). Another important finding in the present study was that the NPV of larger lesions (≥1 cm in diameter) had decreased from 100% when using DCE-MRI to 68.7% when using the combined protocol, while the NPV of small lesions (<1 cm in diameter) had increased from 83.3% to 87.5%. In the present study, 5 of 16 (31.3%) large lesions were missed precisely because DWI was applied. Therefore, when applying the ADC value to DCE-MRI, physicians should pay close attention to lesions ≥1 cm in diameter to avoid missing cancers on pre-operative breast MRI.

There were several limitations in this study. First, lesions that were probably benign, but that lacked pathological diagnosis, were excluded, as were several additional lesions that were detected using MRI, but were not found or biopsied. This may have led to selection bias. Second, the number of benign lesions within the total sample size was small. Third, the ADC values of benign lesions in our study seem to be low compared with those in the published literature. Methods for selecting a suitable ROI within a lesion for measuring of the ADC values are varied and rather subjective. So, we used a CAD system. The reason for low ADC values in our study was assumed to be that we measured the ADC values by outlining the ROIs to cover the entire tumour with a colour map above the set enhancement threshold of a CAD. It enabled measuring the lesions without including surrounding breast tissues, resulting in low ADC values. Fourth, the DWI acquisition resolution was 1.15 m ×1.15 mm, and the smallest lesions in our study were 5-6 mm in diameter; thus we measured the ADC values on the basis of 4-5 acquisition pixels, which was a risky undertaking. Fifth, we did not assess inter- or intra-observer agreement with regard to ADC measurement.

In conclusion, MRI staging leads to unnecessarily extensive breast surgery in a considerable proportion of patients because it allows the detection of additional lesions. Therefore, it is necessary to reduce the number of false-positive additional lesions detected using pre-operative MRI. Our current findings indicate that, with the implementation of DWI to additional lesions in patients with breast cancer, unnecessary biopsies or extensive surgery could be avoided. However, when we applied the ADC value to DCE-MRI, we found that, to avoid missing cancers, physicians should pay close attention to additional lesions located in the same quadrant as the confirmed cancer, as well as to those >1 cm in diameter, because such lesions have reduced NPVs and because the malignancy rate of lesions in the same quadrant as the original cancer is 86.6%.