Introduction

Contrast-enhanced MRI (CE-MRI) is a well-established imaging modality for the detection, characterization and staging of breast tumours [1]. The sensitivity of CE-MRI for the depiction of breast carcinoma has been reported to be in the range of 89 – 100 %. Although CE-MRI is extremely sensitive, specificity is limited, ranging from 67 % to 80 %, which leads to additional work-up and biopsies of benign lesions [1]. Several studies have investigated the use of higher magnet field strengths (3 T) and high-resolution CE-MRI protocols, which allow a more detailed depiction of lesion morphology and enhancement kinetics, and have demonstrated an improvement in diagnostic accuracy [26].

Recently, 18F-FDG PET/CT has emerged as a valuable tool in breast imaging, particularly when examinations are performed in the prone position [710]. 18F-FDG PET/CT has been shown to be a valuable technique, with sensitivities ranging from 80 % to 87 % and specificities ranging from 73 % to 100 % [10, 11]. Furthermore, it seems that prone 18F-FDG PET/CT is more specific than CE-MRI. However, a direct comparison with state-of-the-art CE-MRI at 3 T has not yet been performed. If 18F-FDG PET/CT achieves results comparable to those with CE-MRI at 3 T, it could then be used as a substitute in those patients who are not candidates for MRI [12] and vice versa.

We hypothesized that 18F-FDG PET/CT could achieve results that are at least comparable to those with CE-MRI at 3 T. The aim of this study was threefold:

  1. 1.

    To compare the diagnostic accuracy of prone 18F-FDG PET/CT with that of CE-MRI at 3 T in breast lesions

  2. 2.

    To evaluate whether the diagnostic accuracy is influenced by lesion size

  3. 3.

    To determine whether quantitative assessment of PET data using maximum standardized uptake values (SUVMAX) is helpful in the characterization of malignant and benign breast lesions, compared to qualitative visual assessment.

Materials and methods

Patients

Of 198 consecutive patients, 172 (age range 18 – 87 years, mean 57 years) who fulfilled the following inclusion criteria were enrolled from September 2009 to May 2013 in this prospective, institutional review board-approved study: age >18 years, suspicious findings on mammography or breast ultrasonography (BI-RADS® category 0, 4 and 5), not pregnant or breastfeeding, no previous treatment (e.g. breast biopsy, neoadjuvant chemotherapy), and no contraindications to MRI or MRI contrast agents. The remaining 28 patients were excluded because of incomplete examinations or prior treatment. Written informed consent was obtained from all patients before the examinations. Regardless of the results of either prone 18F-FDG PET/CT or CE-MRI at 3 T, histopathological verification of all suspicious lesions was performed and was used as the gold standard. The initial BI-RADS® categories of lesions before the study examinations were: 0 for 18 lesions, 4 for 63 lesions, and 5 for 91 lesions.

Imaging

All patients underwent prone 18F-FDG PET/CT and CE-MRI of the breast at 3 T. Examinations were scheduled no longer than 7 days apart (same day in 60 patients, 1 day in 28, 2 days in 13, 3 days in 8, 4 days in 11, 5 days in 7, 6 days in 4, 7 days in 41; range 0 – 7 days, mean 3 days).

18F-FDG PET/CT

Prone 18F-FDG PET/CT of the breast was performed using a combined whole-body PET/CT in-line system (Biograph 64 TruePoint®; Siemens, Erlangen, Germany) equipped with a high-resolution PET scanner and a 64-row detector CT scanner. Patients were required to fast for at least 5 h before injection of approximately 200 – 350 MBq 18F-FDG based on body weight and to have a blood glucose level <150 mg/dl (8.3 mmol/l). Scanning was started 60 min after intravenous injection of 18F-FDG as a bolus. Initially, a prone low-dose CT scan without contrast agent from the base of the skull to the upper abdominal region was acquired for attenuation correction. To enable the same patient geometry as with CE-MRI a commercially available cushion (GE Healthcare) for supporting a patient in a prone position during the PET scan was used. Subsequently, a high-resolution prone PET dataset over the same region was recorded. PET images were reconstructed using the iterative TrueX algorithm (Siemens, Erlangen, Germany). This algorithm incorporates a particular correction for the point-spread function in addition to usually used correction factors [13, 14]. Four iterations per 21 subsets were used with a matrix size of 168 × 168, a transaxial FOV of 605 mm (pixel size 3.6 mm) and a section thickness of 5 mm.

CE-MRI

All patients were examined with a 3 T MRI scanner (Tim Trio®; Siemens, Erlangen, Germany) in the prone position using a dedicated, four-channel breast coil (InVivo®; Orlando, FL). In premenopausal women, the MRI examination was performed during the second week of the menstrual cycle to reduce breast parenchymal background enhancement [1]. A T2-weighted turbo inversion recovery magnitude (TIRM) sequence with fat suppression was used (TR/TE 4,800/59 ms, FOV 340 mm, 48 slices at SI 3 mm, flip angle 120°, matrix 384 × 512, acquisition time 2 min 35 s) together with a split dynamics, contrast-enhanced MRI protocol [4] with the following parameters: T1-weighted volume-interpolated breathhold examination (VIBE) sequences (TR/TE 3.61/1.4 ms, FOV 320 mm, 72 slices, 1.7 mm isotropic, matrix 192 × 192, one average, 13.2 s per volume), and T1-weighted turbo fast-low-angle shot (FLASH) 3D sequences with selective water excitation (TR/TE 877/3.82 ms, FOV 320 mm, 96 slices, 1 mm isotropic, matrix 320 × 134, one average, 2 min). The total acquisition time was approximately 15 min 20 s.

The contrast agent used was gadoterate meglumine (Dotarem®; Guerbet, France), which was administered intravenously as a bolus (0.1 mmol/kg body weight) using a power injector (Spectris Solaris EP®; Medrad, Pittsburgh, PA).

Data analysis

Two experienced nuclear medicine specialists (reader 1 with 7 years experience, reader 2 with 15 years experience) assessed the 18F-FDG PET/CT data independently. Two experienced breast radiologists (reader 3 with 7 years experience, reader 4 with 11 years experience) evaluated the CE-MRI data independently. The readers were aware of the inclusion criteria, but were not provided with clinical histories or previous imaging findings. Lesions were assessed on a patient-by-patient basis. In patients with multiple lesions, the readers were asked to assess the most suspicious lesion. Lesion locations were recorded clockwise in quadrants. After data analysis, the recorded lesion locations were compared to identify potential lesion mismatches.

18F-FDG PET/CT

Two readers qualitatively evaluated all prone 18F-FDG PET/CT images visually for increased tracer uptake by breast lesions. Lesions with focal uptake above background activity were classified as malignant. If no focal 18F-FDG uptake was apparent, 18F-FDG PET/CT was considered negative for malignancy. Fatty and normal fibroglandular tissue within the breast served as background tissue. If there was disagreement between the readers, a consensus decision was reached and used for statistical analysis. SUVMAX was calculated, but no threshold for malignancy was applied [15, 16]. To measure SUVMAX, a region of interest was drawn over the area of focal 18F-FDG uptake by both readers. If no focal 18F-FDG uptake was apparent, the lesion was localized with CE-MRI and a corresponding region of interest was placed on the PET images.

CE-MRI

Lesions were classified as masses or non-mass-enhancing (NME) lesions. Criteria for distinguishing benign from malignant contrast-enhancing lesions were based on the morphology and enhancement kinetics defined by the revised American College of Radiology (ACR) BI-RADS® atlas [17]. The probability of malignancy was determined by assigning a final BI-RADS® assessment category [17]. If findings were discordant between readers, a consensus decision was reached and used for statistical analysis. For the assessment of the enhancement characteristics of masses, an automated, semiquantitative, curve-type analysis was performed using dedicated software (Syngo BreVis; Siemens, Erlangen, Germany). For NME lesions, kinetics were not considered [18, 19]. The size of the index lesion was assessed using the largest diameter in the axial plane on CE-MRI.

Histopathology

The histopathological result was considered the final diagnosis in all patients. Using image-guided needle biopsy, surgical biopsy, mastectomy or lumpectomy, the histopathological diagnosis was established by one pathologist (M.R., with >20 years experience). If both the image-guided needle biopsy and imaging findings were benign, the final diagnosis was considered benign (33 patients). If the findings were discordant between image-guided needle biopsy and imaging, the final diagnosis was established by surgery (7 patients). If a lesion was high risk or had an uncertain potential for malignancy on image-guided needle biopsy, the final diagnosis was established by surgery (7 patients) [20]. All patients with malignant lesions underwent surgery after image-guided needle biopsy (132 patients).

Statistical methods

Prior to data analysis, the recorded lesion locations were compared for potential lesion mismatch, and no mismatches were found. Statistical tests were performed by a statistician (M.W., 14 years of experience) using SPSS 19.0 and CIA version 2.2.0. For statistical analysis, the assigned BI-RADS® assessment categories for CE-MRI data were dichotomized. BI-RADS® 2 and 3 were considered benign, and 4 and 5 malignant. Sensitivity, specificity, diagnostic accuracy, positive predictive value and negative predictive value for 18F-FDG PET/CT and CE-MRI were calculated. Differences in sensitivity, specificity and diagnostic accuracy were assessed by McNemar and Cochran Q tests. Receiver operating characteristic (ROC) curves were calculated, the area under the curve was determined, and the significance of differences was assessed using the DeLong method. SUVMAX was evaluated for its potential to differentiate benign from malignant lesions using ROC analysis and Welch-corrected t tests. Interreader variability for diagnosis with 18F-FDG PET/CT and CE-MRI was assessed using κ-statistics. A p value ≤0.05 was considered significant.

Results

The mean size of the 172 breast tumours was 26.02 mm (range 5 – 97 mm). Of these 172 lesions, 40 (23 %) were benign and 132 (77 %) were malignant. Twenty three lesions (13 %) were <10 mm, of which 9 were malignant and 14 were benign. Histopathological diagnoses are summarized in Table 1.

Table 1 Histopathological results of all breast lesions

The diagnostic values for 18F-FDG PET/CT and CE-MRI at 3 T for all lesions are summarized in Table 2, and for lesions <10 mm in Table 3. There were no significant differences in sensitivity, specificity or diagnostic accuracy between 18F-FDG PET/CT and CE-MRI at 3 T for all lesions (Fig. 1, Table 2) or for lesions <10 mm (Table 3).

Table 2 Sensitivity, specificity, positive predictive value, negative predictive value, diagnostic accuracy and area under the curve of 18F-FDG PET/CT and CE-MRI together with p values and 95 % confidence intervals (in parentheses) for all 172 lesions
Table 3 Sensitivity, specificity, positive predictive value, negative predictive value, diagnostic accuracy and area under the curve of 18F-FDG PET/CT and CE-MRI together with p values and 95 % confidence intervals (in parentheses) for 23 lesions <10 mm
Fig. 1
figure 1

ROC analysis shows that the diagnostic value of 18F-FDG PET/CT is not significantly different from that of CE-MRI at 3 T

18F-FDG PET/CT classified 136 lesions as malignant (Fig. 2) and 36 lesions as benign (Figs. 3 and 4). Diagnostic accuracy was influenced by lesion size and decreased from 93 % to 91 % in lesions <10 mm. There was excellent interreader agreement for 18F-FDG PET/CT (κ = 0.85).

Fig. 2
figure 2

Invasive ductal carcinoma (high-grade) laterally in the left breast in a 40-year-old woman. a, b The round, irregularly marginated mass lesion (a) shows a heterogeneous initial strong enhancement followed by a washout phase (b), and was classified by CE-MRI as BI-RADS® 5 (suspicious). c On the 18F-FDG PET/CT image, the lesion is strongly 18F-FDG-avid, with an SUVMAX of 16.83. The lesion was true-positive by both modalities

Fig. 3
figure 3

Invasive lobular carcinoma (intermediate grade) laterally in the left breast in a 53-year-old woman. a, b The irregularly shaped and marginated mass lesion (a) shows a heterogeneous medium initial enhancement followed by a plateau (b), and was classified by CE-MRI as BI-RADS® 4 (suspicious). c On the 18F-FDG PET/CT image, the mass is not significantly 18F-FDG-avid, with an SUVMAX of 1.9. The lesion was a true-positive finding by DCE-MRI at 3 T and false-negative by 18F-FDG PET/CT

Fig. 4
figure 4

A high-risk retroareolar lesion (atypical ductal hyperplasia) medially in the right breast in a 32-year-old woman. a, b The irregularly shaped and spiculated mass lesion (a) shows persistent enhancement (b), and was classified by CE-MRI as BI-RADS® 4 (suspicious). c On the 18F-FDG PET/CT image, the mass is not significantly 18F-FDG-avid, with an SUVMAX of 2.8. The lesion was true-negative by 18F-FDG PET/CT and false-positive by CE-MRI at 3 T

On 18F-FDG PET/CT, SUVMAX ranged from 1.03 to 6.52 in benign lesions (mean SUVMAX 2.49, SD 1.13) and from 1.1 to 36.02 in malignant lesions (mean SUVMAX 6.11, SD 5.9). There was a significant difference in SUVMAX between malignant and benign tumours (p < 0.001; Fig. 5). However, quantitative assessment using an SUVMAX threshold was not helpful for differentiating benign from malignant breast tumours (Table 4). With 18F-FDG PET/CT, there were eight false-positives (FP) and four false-negatives (FN) (Table 5).

Fig. 5
figure 5

Distribution of SUVMAX values for benign and malignant breast lesions. Quantitative assessment using an SUVMAX threshold was not helpful for differentiating benign from malignant breast tumours

Table 4 Sensitivity and false-positive rate of SUVMAX thresholds for differentiating benign from malignant breast tumours using 18F-FDG PET/CT
Table 5 False-positive and false-negative lesions on 18F-FDG PET/CT and contrast-enhanced MRI at 3 T on the basis of histopathological diagnosis

On CE-MRI, there were 152 enhancing masses (mean size 24.7 mm, range 5 – 94 mm) and 20 NME lesions (mean size 35.9 mm, range 3 – 97 mm). CE-MRI classified 144 lesions as malignant (Figs. 2, 3, and 4) and 28 lesions as benign.

Of 30 benign mass lesions, 14 (47 %) showed a type 1 curve, 13 (43 %) a type 2 curve, and 3 (10 %) a type 3 curve. Of 122 malignant mass lesions, 5 (4 %) showed a type 1 curve, 42 (34 %) a type 2 curve, and 75 (62 %) a type 3 curve. There was excellent inter-reader agreement for CE-MRI at 3 T (κ = 0.92). With CE-MRI there were 12 FP (Fig. 4) and no FN (Table 5).

Discussion

Our results indicate that both 18F-FDG PET/CT and CE-MRI at 3 T provide equally high diagnostic accuracy with excellent interreader agreement. Although the results were influenced by lesion size, there were no significant differences in diagnostic accuracy between 18F-FDG PET/CT and CE-MRI. Our results demonstrate that quantitative assessment of 18F-FDG PET/CT using SUVMAX was not helpful in differentiating benign from malignant lesions. To date, few studies have investigated the ability of 18F-FDG PET or 18F-FDG PET/CT to differentiate malignant from benign breast lesions, and have shown lower diagnostic accuracies. The divergent results compared to this study may be explained by differences in study design and hardware, such as patient positioning (prone vs. supine), qualitative visual versus quantitative interpretation of data, and the use of different equipment (PET vs. PET/CT vs. dedicated breast PET scanners) [11, 2123]. Adequate patient positioning is of importance for any imaging examination. It has been demonstrated that prone rather than supine patient position during 18F-FDG PET or 18F-FDG PET/CT examination improves breast cancer detection due to better delineation of lesions by expansion of the breast parenchyma, as well as higher influx of 18F-FDG to the breast [79, 24, 25]. Previous studies have investigated 18F-FDG PET and 18F-FDG PET/CT in breast cancer patients and shown lower sensitivities (approximately 90 %) using supine patient positioning [26, 27]. In this study, we used prone patient positioning and a state-of-the-art combined PET/CT scanner, which explains the higher sensitivity and diagnostic accuracy achieved.

The approach to the interpretation of imaging data also influences diagnostic accuracy. Studies have investigated the quantitative assessment of 18F-FDG PET/CT data using SUVMAX thresholds in breast cancer diagnosis and have shown lower diagnostic accuracies of about 85 % [22, 23]. The results of our study confirm that quantitative data interpretation using an SUVMAX threshold does not help in differentiating benign from malignant tumours, compared to qualitative visual assessment and thus should be omitted. The qualitative visual assessment approach yielded diagnostic accuracies of 93 % for all lesions and 91 % for lesions <10 mm. Thus, the data support the notion that 18F-FDG PET/CT is a valuable tool in detecting breast cancer in patients with BI-RADS® 4, 5 or 0 lesions.

In this study, we employed the above-mentioned strategies and used state-of the art equipment, which contributed to the excellent results of 18F-FDG PET/CT of the breast. However, due to the limited spatial resolution of 18F-FDG PET/CT, its diagnostic value was lower in lesions <10 mm [10, 11, 22, 27]. Whereas with CE-MRI the excellent sensitivity of 100 % was sustained in lesions <10 mm, the sensitivity of 18F-FDG PET/CT was reduced to 78 %, indicating that smaller lesions may be missed when not highly metabolically active. However, due to excellent specificity, the diagnostic accuracies of 18F-FDG PET/CT and CE-MRI were equal in lesions <10 mm.

18F-FDG PET/CT seems to be as useful as CE-MRI at 3 T for the assessment of breast tumours, but has some disadvantages, which limit its overall application in breast imaging. One inherent limitation is the whole-body radiation exposure [28]. Nevertheless, in patients with a suspicious finding, the modest radiation exposure seems justified. Due to the necessary delay of at least 45 min after tracer administration [29], 18F-FDG PET/CT is more time-consuming than MRI of the breast by approximately 20 min [4]. The imaging accuracy of 18F-FDG PET/CT is affected by tumour histology. Diffuse or slowly growing cancers, such as lobular carcinoma, or low-grade ductal carcinoma in situ (DCIS), can be missed with 18F-FDG PET/CT [10], but this also applies to CE-MRI, and the performance of the two modalities in these lesions seems to be equal. In addition, it has been demonstrated that the performance of 18F-FDG PET/CT in these subtypes can be improved by dual time-point acquisition [27]. An advantage of 18F-FDG PET/CT is that, in patients with a high level of suspicion for breast malignancy, lymph node, or even distant metastases, an additional supine whole-body 18F-FDG PET/CT scan for staging can be performed in a single session. It has been demonstrated that such a combined prone and supine whole-body 18F-FDG PET/CT scan may influence further therapeutic decisions by detecting a greater number of distant metastases than conventional staging algorithms [24]. The radiotracer used in the current study was 18F-FDG. It has been demonstrated that 18F-FDG has good sensitivity but limited specificity, as several types of benign breast diseases can be 18FDG-avid and mimic malignancy [10]. In this study the majority of FP findings were metabolically active fibroadenomas/fibroadenomatous hyperplasia together with a high-risk lesion (atypical ductal hyperplasia), mastitis and a scar.

CE-MRI of the breast is an established imaging modality routinely used for the assessment of breast tumours, with reported excellent sensitivities (98 – 100 %) and good specificities (72 – 94 %) [1]. In the current study, CE-MRI at 3 T achieved similar results with excellent diagnostic accuracy (93 %). With the use of a high-resolution MR imaging protocol at 3 T, no cancers were missed, but there were 12 FP lesions, which consisted of five NME lesions and seven mass lesions, and which exhibited suspicious morphological and/or kinetic features. These findings are in good agreement with previously reported data showing that there are limits to what is achievable with CE-MRI alone [2, 19, 3032]. Several studies have demonstrated that, with the use of additional MR parameters, such as diffusion-weighted MR imaging or proton MR spectroscopy, an increase in diagnostic accuracy is achievable [2, 19]. However, in patients with certain contraindications to MRI [12] who cannot benefit from multiparametric MRI, imaging modalities other than MRI would be of more value. The results of our study show that 18F-FDG PET/CT may be such a modality in patients who are not candidates for MRI.

The current study had some limitations. The study included only a small number of patients with DCIS or invasive lobular carcinoma (ILC). Therefore, the findings regarding the performance of both 18F-FDG PET/CT and CE-MRI in these subgroups should be regarded with caution. Nevertheless, the results in DCIS [33] and ILC [34] are in good agreement with previously published data. In addition, this was a prospective study enrolling consecutive patients with suspicious breast lesions, and therefore disease prevalence was high, and does not reflect the prevalence in a screening population. However, in this setting the distribution of malignant histopathological subtypes was as expected indicating that our patients were a representative patient population. Furthermore, the aim of the study was not to determine the indications for prone breast PET/CT, but merely to assess its diagnostic accuracy and the factors that influence accuracy. Further studies are necessary to elucidate the potential application of prone breast PET/CT.

In conclusion, both 18F-FDG PET/CT and CE-MRI at 3 T are useful in the assessment of breast tumours. Both imaging modalities achieve a high diagnostic accuracy. In breast lesions <10 mm, the diagnostic accuracy of both modalities deteriorates. In contrast to visual PET/CT assessment, SUVMAX is not helpful in differentiating malignant from benign breast lesions. 18F-FDG PET/CT can be considered an alternative imaging modality in patients who are not candidates for CE-MRI.