Introduction

Breast cancer is the second leading cause of death among women in the USA (https://www.cancer.net/cancer-types/breast-cancer/statistics). Multiple randomized controlled trials and observational studies have shown that breast screening reduces the mortality rates of breast cancer by 30% or more [1,2,3,4,5]. Screening of dense breasts remains a major challenge despite the implementation of full-field digital mammography (FFDM) screening programs since 2005 [6]. Up to 15–30% of cancers are not identified at standard screening [7], and an even higher percentage of cancers are undetected in women younger than 50 years and in women with dense breasts [8, 9]. Because FFDM is a two-dimensional (2D) rendering of the three-dimensional breast, the resultant overlapping of fibroglandular densities may either mimic a tumor (false positive) or may mask one (false negative). The reduced sensitivity of 2D mammography with its associated false-positive recalls and negative biopsies, as well as its inherent radiation, have raised criticism [10, 11]. Whether breast density increases the risk of breast cancer by its masking effect or whether it is a primary risk factor in itself [12,13,14], overcoming the challenge of imaging dense breasts is a main factor in developing imaging technologies to improve diagnostic performance. The evolution of a three-dimensional derivative of digital mammography, known as digital breast tomosynthesis (DBT) has been a major advance in addressing this issue.

Tomosynthesis Technique

Digital breast tomosynthesis is a technique where image acquisition occurs as an x-ray tube moves in an arc over a limited angle (15–50°), creating multiple low dose projection images of the stationary compressed breast. These images are subsequently reconstructed into thin slices, usually 1 mm in thickness, allowing the radiologist to scroll through the breast to evaluate each plane without superimposition of adjacent structures [15, 16].

Historical Overview

Tomosynthesis as a concept was initially established in the 1930s [17]. Miller et al. again investigated tomosynthesis in the 1970s [18]; however, due to the high radiation doses required, tomographic techniques were not implemented clinically at that time. With the development of digital detectors in the 1990s, the interest in tomosynthesis resurfaced. In 1997, Nikalson et al. introduced digital breast tomosynthesis, which in a reader study of mastectomy specimens, revealed improved depiction of lesions over 2D mammograms [19]. As interest grew for this new technology, many different small reader studies were performed in the 2000s, supporting its potential as a breast imaging modality with variable reports of decreased recall rates and/or increased lesion detection [20,21,22,23].

In 2011, the Food and Drug Administration (FDA) approved DBT for its use in clinical practice. Since then, multiple studies have been performed to assess its value in both the diagnostic and screening settings.

DBT in the Screening Setting

Once DBT was approved by the FDA, it was immediately incorporated in the clinical setting at many breast centers in the USA; first experience publications started to appear in the literature by 2013. However, in Europe, large prospective studies were done, looking at the value of DBT in the screening population. The three largest trials were the Oslo trial, the STORM (Screening with Tomosynthesis OR standard Mammography) trial, and the Malmö Breast Tomosynthesis Screening trial [24••, 25•, 26•]. The Oslo trial and the STORM trial compared 2D FFDM alone with 2D FFDM combined with DBT. They found that interpreting mammography using standard 2D FFDM in combination with DBT increased their breast cancer detection rates, and reduced their false-positive rates. Cancer detection rates in the Oslo trial were 6.1 per 1000 examinations for mammography alone, and 8.0 per 1000 examinations for mammography plus tomosynthesis (27% increase) [24••]. In the STORM trial, cancer detection rates were 5.3 cancers per 1000 screens for 2D only, and 8.1 cancers per 1000 screens for integrated 2D and 3D screening. The incremental cancer detection rate attributable for integrated 2D and 3D mammography was 2.7 cancers per 1000 screens [25•].

The Malmö trial uses a different approach, assessing the performance of single view DBT as a stand-alone in comparison to 2D FFDM. The authors plan to accrue 15,000 women, but an exploratory analysis of the first 7500 women showed an increase in detection rate of 43%, with 6.3 cancers per 1000 screens detected with 2D FFDM, and 8.9 cancers per 1000 screens with single view DBT. The recall rate increased from 2.6% with 2D-FFDM to 3.8% with DBT (still well below US standards), which the authors attributed to an increase in detection of stellate distortions, some of which were mammographically occult invasive cancers, while others were radial scars or post-operative scars. However, they noted a downward trend of the recall rate over the first 1.5 years of the trial, implying improvement with increased experience [26•].

The first large retrospective analysis in the USA was published by Freidewald et al. in 2014, and compared 2D FFDM with 2D FFDM and DBT at 13 academic and nonacademic medical centers, totaling 454,850 patients. Reported model-adjusted recall rates were reduced by 16 per 1000 screens with digital mammography plus tomosynthesis, and the reported incremental cancer detection rate increased by 1.2 per 1000 women screened [27•]. This was slightly lower than the European population-based screening trials of DBT, which revealed an incremental increase in cancer detection of 2–3 additional cancers per 1000 women screened. Of note, both Friedewald et al. and Skaane et al. reported that the 40–41% increase in cancer detection using DBT was for invasive cancers, with no significant increase in the detection of ductal carcinoma in situ (DCIS) [24••, 27•].

Multiple additional studies have since been published, with the two main parameters assessed being recall rates and cancer detection rates. The majority of small and large-scale trials have shown a significant reduction in recall rates when DBT was added to 2D FFDM, ranging from 15 to 37% (Table 1) [24••, 25•, 27•, 28,29,30,31,32,33,34,35, 36•, 37]. Asymmetry was the most common mammographic finding associated with a reduction in recall rates, due to decreased summation artifact from superimposition of tissues [30, 33]. In addition, when patients were recalled, a greater percentage underwent ultrasound alone (without additional mammographic views), due to the improved evaluation of lesion margins with DBT [33]. The majority of studies have shown an increase in cancer detection rate with the addition of DBT, ranging from 20 to 54% [24••, 25•, 27•, 29, 31, 32, 3435, 36•, 37]. Although three studies reported no statistically significant difference in cancer detection [28, 30, 33], they all noted improved recall rates and the importance of decreasing anxiety and cost associated with screening recalls.

Table 1 Screening data of 2D full-field digital mammography (FFDM) alone compared with 2D FFDM combined with digital breast tomosynthesis (DBT)

Some groups further assessed the impact of DBT in screening by stratifying their analysis into subgroups to address which, if any, breast density and age groups were more affected. A study by Haas et al. revealed that the addition of tomosynthesis reduced recall rates for all breast density and patient age groups, though the greatest reductions were for those younger than 50 years and those with dense breasts [28]. McDonald et al. noted a more pronounced reduction in recall rates in women younger than 50 years. In their study, DBT showed a reduction in recalls of 24.1% in women younger than 50 years versus a reduction of 17.9% in women 50 years or older. In the same study, recall rate reduction for dense breasts was 17.2% versus 24.1% in non-dense breasts [34].

A recent analysis by Rafferty et al. also addressed the screening performance of FFDM alone versus FFDM in combination with DBT as a function of breast density [37]. The primary analysis compared the performance of DBT among dense (BI-RADS C and D) versus non-dense breasts (BI-RADS A and B). The addition of DBT caused a reduction in recall rates and an increase in cancer detection rates for both groups, though this was slightly more pronounced in the dense breast group. However, the exploratory subgroup analysis revealed that the improvements were greatest for heterogeneously dense breasts (BI-RADS C) and scattered fibroglandular densities (BI-RADS B). Differences were not statistically significant for the almost entirely fatty (BI-RADS A) and extremely dense breasts (BI-RADS D).

Skaane et al. also noted the most marked improvement in lesion detection in the BI-RADS B and C breasts, i.e., those with scattered fibroglandular densities or heterogeneously dense parenchyma [24••].

Radiation Dose and Synthetic 2D Images

Radiation dose from yearly mammograms has been a point of criticism of breast cancer screening. Kopans et al. noted that despite millions of women having undergone mammography since the 1990s, no increased risk of breast cancer from radiation exposure during screening has been observed [15]. In addition, radiation risk to the breast is age related, and by age 40, the breast is mature and relatively resistant to radiation. Thus, the benefits of mammography are felt to outweigh the minimal radiation risks [38]. However, it is agreed that for any radiologic modality to replace digital mammography, it should have a dose that does not exceed that of conventional full-field digital mammography. As each projection of DBT requires only a fraction of the total dose of a 2D mammogram, DBT can be performed at a radiation dose similar or even less than the combined dose for the standard two-view FFDM [15]. However, DBT is often used in combination with 2D FFDM, as two-dimensional images provide an overview, and are used to compare with prior non-DBT studies, and to evaluate calcifications (which may not be perceived as grouped when scattered over several DBT slices). The combination of 2D and DBT doubles the radiation dose, which is equivalent to 1–2 months of annual background radiation in the USA, though remains below the FDA safety limits of 3 mGy/view [39, 40]. This limitation was addressed with the development of synthesized 2D images (s2D). s2D images are two-dimensional mammographic images that are reconstructed from data acquired during tomosynthesis, with the intent of negating the need for a separate 2D-FFDM image acquisition. Several studies have compared mammographic interpretation of DBT with synthetic 2D images versus DBT and 2D-FFDM images.

Houssami, in a recent review, summarized the prospective and retrospective studies comparing s2D/DBT and 2D/DBT breast imaging [41]. He noted that cancer detection rates were not significantly different across the studies (though improved over 2D alone), and that the radiation dose of s2D/DBT was 55–58% of that for 2D/DBT. Although overall similar, some heterogeneity in recall rates was noted, with lower recall rates using s2D reported by Aujero et al. [42] attributed to more experience (having transitioned to s2D/DBT after gaining experience with 2D/DBT). In contrast, Bernardi et al. attributed their increased recall rates with synthesized images to study design (sequential readers recalling without double reading consensus or arbitration), as well as to lack of experience with interpretation of synthetic images, which enhance lesion detail and parenchymal structures [43]. They noted that with further experience and prior s2D images for comparison in the future, recall rates would likely decline.

DBT in the Diagnostic Setting

Although many studies have documented the merits of DBT in the screening population, its value is also appreciated in the diagnostic setting. As noted previously, the ability to accurately assess lesion margins with tomosynthesis has reduced the need for extra views in mammography [33, 44, 45]. Additional images are not completely obviated, as magnification views are necessary to evaluate calcifications and spot compression images may be necessary to evaluate subtle DBT findings including questionable architectural distortion.

The improved lesion characterization of DBT also results in better differentiation of benign from malignant lesions. This improved specificity has resulted in fewer examinations being categorized as BI-RADS 3 (probably benign), which in turn has decreased unnecessary follow-ups [40, 46].

Studies have shown that DBT provides equal accuracy and greater conspicuity in the evaluation of non-calcified breast lesions when compared to spot magnification 2D FFDM views [20, 44, 45, 47,48,49,50]. Two published studies evaluating the role of DBT in assessing calcifications revealed mixed results [51, 52]. 2D FFDM may be superior in detecting calcifications, yet once detected, DBT shows similar accuracy to 2D-FFDM in the evaluation of calcifications.

DBT Limitations

DBT is uniquely sensitive in the detection of subtle architectural distortion (a potential presentation of malignancy), which is in part responsible for its improved cancer detection rates. However, architectural distortion can also be due to non-malignant causes, most notably post-surgical changes or complex sclerosing lesions (i.e., radial scars). As noted previously, Lång et al. attributed their increased recall rate with DBT to the increased sensitivity for radial scars and post-operative scars [26•]. Partyka et al. have suggested that DBT-detected architectural distortion in the absence of a sonographic correlate has a higher likelihood of benignity [53], though others have disagreed. Freer et al., in their study assessing feasibility and accuracy of DBT-guided needle localization, reported that DBT-detected suspicious architectural distortion that is mammographically or sonographically occult has a 47% positive predictive value for malignancy [54]. Thus, at present, most sites agree that any architectural distortion that is visible solely with DBT should undergo DBT-guided biopsy (Fig. 1).

Fig. 1
figure 1

Fifty-year-old female with a history of left breast conservation therapy (lumpectomy and radiation) for invasive ductal carcinoma 9 years ago presents for annual screening mammogram. a 2D FFDM image reveals heterogeneously dense parenchyma, with several rim-calcified cysts (short arrow). b DBT plane reveals architectural distortion (long arrow), which is more apparent on magnification (c). No ultrasound correlate was identified. The patient underwent DBT-guided core biopsy, revealing invasive lobular carcinoma

Another limitation of DBT is the added interpretation time. Several studies have reported an increase in the reading time over 2D FFDM alone ranging from 33 to 50% [24••, 45, 55]. Although DBT does increase the reading time in the screening setting (up to two-fold at initiation), this is partly compensated by the reduction in recalls for diagnostic images, as well as the increase in patients recalled for ultrasound only (not requiring additional mammographic imaging) [33].

Supplemental Screening Modalities

Whole-Breast Screening Ultrasound

Given the promising improved performance in screening with the addition of DBT, the question arises as to whether additional supplemental imaging is necessary. Multiple studies have shown that whole-breast sonography has an incremental cancer detection rate ranging from 0.8 to 10 per 1000 women when used in the supplemental screening of mammography-negative dense breasts [56,57,58,59]. As yet, no one has directly compared DBT to ultrasound for supplemental screening, though this is currently being evaluated in the ASTOUND prospective multicenter comparative trial [60]. An interim report published in 2016 by Tagliafico et al. revealed that among 3231 mammography-negative screening participants with dense breasts, 24 additional cancers were detected, of which 13 were tomosynthesis-detected versus 23 ultrasound-detected. False-positive recall rates were similar. Although the authors caution that these are only interval results, they report that ultrasound has a better incremental breast cancer detection than tomosynthesis in mammography-negative dense breasts, though suggest that DBT should potentially replace FFDM as the primary screening modality.

Although screening ultrasound in women with dense breasts is very effective in detecting mammographically occult breast cancer, the examination has its limitations. The majority of studies have shown an overall low PPV (positive predictive value) of supplemental ultrasound, with its decreased specificity and decreased PPV3 [40, 58, 61]. Screening ultrasound is also a time-consuming examination, with a handheld ultrasound examination of the breasts requiring an average of 20 min, regardless of whether performed by a radiologist or technologist. Automated breast ultrasound (ABUS) has reduced the time of exam, though research regarding its efficacy as a supplemental screening modality is somewhat limited and requires further assessment [61, 62].

Breast MRI

Supplemental breast MR imaging is most widely used for supplemental screening in high-risk women, most commonly those with greater than 20% lifetime risk of developing breast cancer based on risk-assessment models, BRCA and PTEN genetic mutation carriers or their first-degree untested relatives, and patients with history of chest radiation between the ages of 10–30 years. Although many institutions also include patients with a personal history of breast cancer or pre-malignant breast lesions, patients with a family history of malignancy, and women with dense breasts, its widespread use is limited due to expense, availability, and personal contraindications (including incompatible surgical implants, claustrophobia, contrast allergy, or risk of nephrogenic systemic fibrosis in patients with renal insufficiency that receive gadolinium contrast), as well as high false-positive rates. However, a recent study by Kuhl et al. reported a total supplemental cancer detection rate with MRI of 15.5 per 1000 cases in average risk woman, regardless of breast density [63]. They advocate that MRI replace mammography in screening the average risk woman, given its apparent improved sensitivity for detecting biologically relevant cancers.

Kuhl et al. have also been strong proponents of an abbreviated MRI protocol (AB-MRI) for breast screening [63, 64]. The protocol consists of only one pre- and one post-contrast acquisition, and their derived images (the first post-contrast subtracted [FAST] and maximum-intensity projection [MIP] images). Study acquisition time was reduced from 17 to 3 min, and radiologist reading time was 2.8 s for interpretation of the MIP image (deciding upon presence or absence of significant enhancement) and 28 s for interpretation of the complete abbreviated study, with a reported NPV (negative predictive value) of 99.8 [64]. These values are competitive with batch reading of screening mammograms and are shorter than the time to review DBT images [64]. Further studies to evaluate the performance of AB-MRI are being performed, and although scan and interpretation times in the USA have shortened, they remain longer than those reported by Kuhl. The EA1141 trial is an ongoing prospective multicenter diagnostic accuracy trial sponsored by ECOG-ACRIN in an aim to assess the performance of abbreviated breast MRI and digital breast tomosynthesis in breast cancer screening in women with dense breasts. The trial will assess AB-MRI as both a supplemental screening modality and a stand-alone [65••].

Molecular Breast Imaging

Molecular breast imaging (MBI) is a functional imaging modality that utilizes a short-lived radiotracer (99mTc-sestamibi) to detect cancer. Rhodes et al., in a prospective clinical trial comparing mammography with MBI in patients with dense breasts demonstrated that MBI has a substantially higher cancer detection rate than 2D mammography, with a supplemental cancer detection rate of 8.8 per 1000 woman, when added to FFDM [66•]. Shermis et al. reported a similar 7.7% incremental cancer detection rate when employed in a large, community-based practice [67]. The associated low false-positive rate makes MBI an intriguing candidate for supplemental screening to mammography in dense breasts, allowing the detection of cancer regardless of whether a concordant structural abnormality is identified on DBT or 2D-FFDM. MBI has also shown added value in the supplemental assessment of the dense breast with certain practices entirely substituting it for whole-breast ultrasonography [68, 69]. MBI may also be indicated when a physiologic supplemental examination is needed and there are contraindications to MRI use.

A limitation to MBI is its added radiation dose. Rhodes et al. reported an effective whole body dose of 2.4 mSv. Although higher than the average effective dose from digital mammography (~ 0.5 mSv) and the effective dose from digital mammography combined with DBT (1.2 mSv), it is still below natural background radiation levels (US annual average, 3 mSv) [66•].

Contrast-Enhanced Mammography and Contrast-Enhanced Tomography

Various groups have evaluated the role of contrast-enhanced mammography (CEM) over the years, with increased interest of late due to the adoption of digital mammography. This enhanced examination has the potential to detect vascularization and physiological activity of the breast at a lower cost than MRI. As with MBI, it may be of benefit to women who have a contraindication to MRI, women with dense breasts, or even women of intermediate to high risk of breast cancer. Studies have documented that breast MRI has the highest sensitivity for cancer detection, when compared to MBI, ultrasound, and DBT [70, 71]. However, recent studies have shown that the sensitivity of CEM may approach that of MRI [72, 73]. Some groups have combined CEM with tomosynthesis, to develop contrast-enhanced tomography (CET). A study by Chou et al. reported that CET is superior to non-enhanced breast imaging tools. However, no statistically significant difference in the AUC-value was observed when comparing it to CEM [74].

Neither CEM nor CET has found widespread acceptance. Many opponents disagree with the utilization of IV non-iodinated contrast, with its inherent risks and morbidities (including allergy, anaphylaxis, and potential contrast-induced nephropathy) in the mammography suite.

Conclusion

The ultimate goal of breast cancer mortality rate reduction has led to the implementation of multiple advanced mammographic imaging techniques, the most notable and widely used being digital breast tomosynthesis. Numerous studies have confirmed its added value with increased cancer detection of 1–3 additional cancers per 1000 screens, and in most cases, its decreased recall rates, leading to decreased patient anxiety and diminished cost of screening evaluations. Given these added benefits, DBT is gradually replacing 2D FFDM as the primary screening modality. Despite this incremental improvement in screening, DBT remains less sensitive than other supplemental screening modalities. Multiple studies have shown that MRI has the greatest sensitivity for supplemental cancer detection, though given its high false-positive rate, in addition to cost, often limited availability, and personal contraindications, screening MRI is predominantly reserved for high-risk patients. Supplemental whole-breast ultrasound is the most widely used supplemental screening tool, due to its low cost, absence of radiation, and ease of accessibility. False-positive rates remain high, though proponents argue that with increased experience, these will decrease over time. MBI has a higher sensitivity than ultrasound and higher specificity, but is less widely utilized, in part due to accessibility and radiation dose. With further study and potential decreased radiation, MBI may be a promising supplemental screening modality in patients with dense breasts. CEM and CET are being investigated, though are less likely to gain widespread acceptance, given their reliance on intravenous contrast with its inherent risks and necessary precautions in the mammography suite.