Background

Screening women at increased risk of breast cancer (BC) has been an area of intense research and has evolved in practice with MRI screening as an adjunct to mammography for some high-risk groups [1, 2]. Research in screening high-risk women has largely focused on women with BC gene mutations and/or those with a family history of BC [1, 2]. Women with biopsy-proven lobular carcinoma in situ (LCIS), atypical lobular hyperplasia (ALH), or atypical ductal hyperplasia (ADH) have an increased risk of BC [38]. Risk in women with LCIS is approximately 2–10 times that of women without proliferative lesions on biopsy [3, 7], and for atypical hyperplasia (AH) that includes ADH or ALH the risk is 3–6 times higher than women without proliferative breast disease history [46, 9]. BC risk associated with LCIS or AH may be modified by factors such as family history [5] but is independent of mammographic density [4]. Despite the long-documented increased BC risk in women with a history of proliferative breast lesions, accuracy of mammography screening in women with previous LCIS or AH has received little research attention.

A few studies have examined MRI accuracy in screening women with previous LCIS [10, 11], and LCIS or AH [12]—these have focused on the incremental detection of MRI but have also indicated low mammography sensitivity [1012]. In the largest study of screening women with LCIS (reporting 17 cancers in 14 women), mammography sensitivity approximated 30 % [10]. However, it is unknown whether this represents mammography screening accuracy in LCIS women, or whether it represents mammography accuracy in those selected to adjunct MRI due to other factors associated with reduced mammography sensitivity. For example, Port et al. [12] examined MRI screening in women with LCIS or AH history, and found that those selected to MRI screening were more likely to be younger women or to have a family history of BC.

Because of their increased risk of BC, and given the lack of evidence on the outcomes of mammography screening in women with a history of LCIS or AH as well as the interest in adjunct screening for these women, we examined the accuracy and outcomes of screening mammography in women with LCIS or AH history relative to matched women without such history.

Methods

We identified all women with a history of histology-proven LCIS, ALH, ADH, or AH (not further specified) who participated in screening mammography in facilities affiliated with one of the seven mammography registries forming the National Cancer Institute-funded Breast Cancer Surveillance Consortium (BCSC) [13]. BCSC registries collect demographic and mammography information linked with state or Surveillance Epidemiology and End Results (SEER) cancer registries to ascertain BC diagnoses; five registries additionally collect pathology data. Each registry and BCSC Statistical Coordinating Center (SCC) received institutional review board approval for active or passive consenting processes or consent waiver to enroll women, link data, and perform analytic studies. All procedures are Health Insurance Portability and Accountability Act compliant and all registries and SCC received a Federal Certificate of Confidentiality and other protections for identities of women, physicians, and facilities who are subjects of this research

Screening mammograms (1996–2010) from women with LCIS, ALH, ADH, or AH based on surgery, excision biopsy, or core-needle biopsy, were included except for diagnoses with subsequent BC within 12 months. This exclusion avoids core-needle histology diagnoses of atypical lesions that represented underestimates of BC [14]. We also excluded women having mammograms for symptom evaluation, based on information from the radiologist or self-reported symptoms, and women with a personal history of BC. Definition of screening mammography was based on standard BCSC definition [15, 16], except that unilateral screens from women with LCIS/ALH or ADH/AH (and without BC history) who received mastectomy for their high-risk histology, were included (see “Sensitivity analysis” section).

Based on the above-defined eligibility criteria we assembled two cohorts of women at increased BC risk defined by histology: one cohort combined LCIS and ALH (“lobular neoplasias”), and another cohort comprising ADH and other AHs (‘ADH/AH cohort’) that included predominantly ADH but also mixed ADH/ALH, AH not further specified, and rare forms of AH.

Comparator cohorts

We assembled two cohorts of asymptomatic women without a history of LCIS, ALH, ADH, AH, or surgical biopsy (to ensure that comparison screens did not have a history of atypical lesions), matched at the mammogram level on a 5:1 ratio to each screen from our high-risk cohorts. Screens were matched for 5-year age-groups, breast density category, BC family history, year of screen, and mammography registry.

Demographic and mammogram characteristics

Age, self-reported race/ethnicity, first-degree family history of BC, menopausal status, time since last mammogram, and self-reported use of HRT or use of chemoprevention agents (such as tamoxifen) were collected at time of screening. BI-RADS [17] breast density was routinely recorded.

Outcomes

A positive screening mammogram was an initial BI-RADS assessment 0, 4, 5, or 3 with recommendation for immediate follow-up. A negative mammogram was BI-RADS 1, 2, or 3 with no recommendation for immediate follow-up. For each cohort, we determined screening accuracy [sensitivity, specificity, recall rate, positive predictive value (PPV)], and screening outcomes (cancer detection and interval cancer rates). Frequency of recommendation for surgical consult/biopsy was based on final assessments and recommendations made within 90 days of initial assessment. Outcomes were ascertained at 1 year from screening or prior to the next screen (whichever occurred first) through linkage to SEER registries or regional cancer registries and also to pathology records to determine (invasive or in situ) cancer status based on clinical pathology reports. Central pathology review was not performed. LCIS was not considered a malignant outcome.

Statistical analyses

Characteristics of screening mammograms and BCs (total observed, total invasive) were summarized for each cohort. We calculated accuracy measures using BCSC definitions [15, 16], and estimated cancer incidence rates, cancer detection rate (CDR), and interval cancer rates (per 1,000 screens) at 1-year follow-up, for each cohort. Accuracy and outcome measures were compared between the LCIS/ALH and the ADH/AH cohorts with the matched comparator group. Odds ratios (OR) and 95 % confidence intervals (CI) were calculated for each cohort relative to its comparator, using logistic regression, both unadjusted and adjusted for year of screen, age-group, BC family history, breast density, and time since last mammogram. For specificity, we accounted for potential correlation among mammograms on the same woman using GEE with an independent correlation structure and empirical standard errors.

Sensitivity analysis

For the ADH/AH cohort, sensitivity analysis for outcome measures excluded screens from women with unspecified or rare types of AH. We also examined outcomes after excluding 36 screens from women with unilateral mastectomy for atypical lesions.

Results

Table 1 summarizes the characteristics of the 52,380 included screening mammograms and 359 cancers (277 invasive), by history of LCIS/ALH, ADH/AH, and matched cohorts. Given that we matched screens for several variables, the main notable difference in distribution of characteristics was that women in the high-risk (“atypia”) cohorts were more likely to have a shorter time-interval since the last mammogram than their comparator cohort. In the LCIS/ALH group, 19 cancers (17 invasive) occurred on the same-side as the previously affected breast, 25 cancers (18 invasive) affected the contralateral breast, and 2 were bilateral invasive BC (in women with unilateral LCIS/ALH history). In the ADH/AH group, 49 cancers (33 invasive) occurred on the same-side as that previously affected, 28 cancers (19 invasive) were in the contralateral breast, and 7 were bilateral invasive BC (in women with unilateral ADH/AH history).

Table 1 Distribution of characteristics of screening mammograms and cancers within 1 year of screen, in women with a history of LCIS/ALH, or ADH/AH, and matched screens without such history

LCIS/ALH cohort

Table 2 reports outcomes for the 2,505 LCIS/ALH screens, and matched cohort (12,525 screens). Cancer rates and CDR were significantly higher (P < 0.0001) for LCIS/ALH screens than comparator screens: cancer rates indicate an approximate four-time increased risk of BC for the LCIS/ALH cohort. Interval cancer rates were 4.4/1,000 screens for LCIS/ALH cohort compared with 0.9/1,000 for matched screens (P = 0.0003); however, the proportion of cancers that were interval cancers did not significantly differ between the two cohorts (P = 0.43). Screening sensitivity of 76.1 % in the LCIS/ALH cohort was not significantly different from a sensitivity of 82.3 % in the matched cohort (P = 0.43) but a specificity of 85.1 % in the LCIS/ALH cohort was significantly lower than that of 90.7 % in the matched cohort (P < 0.0001). Recall rates, PPV, and the percentage recommended for surgical consultation were all higher for LCIS/ALH screens compared with matched screens (P ≤ 0.001). Table 2 shows the (unadjusted or simple) ORs for the LCIS/ALH cohort relative to matched cohort, and also the adjusted ORs (see “Methods” section for variables included in adjustment). The adjusted ORs did not substantially differ from the simple ORs and did not alter the statistical associations shown in Table 2, except that the adjusted OR for DCIS detection rate showed weaker evidence of a difference compared to the matched cohort (P = 0.10). Other changes noted from the adjusted analysis were a slightly lower OR for interval cancer rates (adjusted OR = 4.47; P = 0.013) and a higher OR for sensitivity (adjusted OR = 1.10; P = 0.90) relative to the simple ORs. The latter reflects that there is less difference in mammography sensitivity between the LCIS/ALH and comparator cohorts after the adjustment, possibly due to allowing for differences in screening intervals (time-interval since last mammogram) between the groups.

Table 2 Accuracy and outcomes of mammography screening in women with a history of LCIS or ALH relative to matched mammograms in women without such history

ADH/AH cohort

Table 3 reports outcomes for the 6,225 ADH/AH screens and matched comparator (31,125 screens). Cancer rates and CDR were significantly higher (P < 0.0001) for the ADH/AH screens relative to matched screens: cancer rates indicate more than double the risk of BC for the ADH/AH cohort. Interval cancer rates were 2.6/1,000 screens for ADH/AH screens compared with 0.9/1,000 for matched screens (P = 0.002); however, the proportion of cancers that were interval cancers did not significantly differ between the two cohorts (P = 0.74). Screening sensitivity of 81.0 % in the ADH/AH cohort was similar to sensitivity of 82.6 % in the matched cohort (P = 0.74) but a specificity of 86.2 % in the ADH/AH cohort was significantly lower than that of 90.2 % in the comparator cohort (P < 0.0001). Recall rates, PPV, and the percentage recommended for surgical consultation were all higher for ADH/AH screens than matched screens (P < 0.001). A sensitivity analysis for the ADH/AH cohort that excluded 1,188 screens from women with unspecified or rare forms of AH had little to no effect on estimates of screening accuracy and outcomes. Table 3 also shows the (unadjusted/simple) ORs for the ADH/AH cohort relative to its comparator, and the adjusted ORs; the latter did not substantially differ from the (simple) ORs, and did not alter statistical associations.

Table 3 Accuracy and outcomes of mammography screening in women with a history of ADH or AH relative to matched mammograms in women without such history

Discussion

We report the first evaluation to date of the accuracy and outcomes of mammography screening in women at increased risk of BC defined by LCIS, ALH, ADH, or AH history. Our findings from screening these women are presented in comparison to screens from women without such history, matched for variables known to affect mammography screening accuracy. Our work confirms that women with a history of these atypical lesions are at substantially increased risk of BC (relative to matched cohorts) even though risk was estimated only for 1-year follow-up from screening given that the study focused on screening accuracy. Our findings also highlight that these atypical lesions are markers of generalized (bilateral) BC risk, particularly in the LCIS/ALH cohort, as evidenced by our data on BC laterality (see “Results” section) and in keeping with the findings from other researchers [5]. The bilaterality of the observed cancers also reinforces the potential value of considering chemoprevention for management of these women, particularly the LCIS/ALH cohort, as recommended in guidelines [18]. We noted that although women in our high-risk cohorts were much more likely to report current use of chemoprevention agents (including tamoxifen, Table 1) than those in the matched cohorts, overall only a minority reported receiving chemo-prevention.

In the LCIS/ALH cohort, cancer rates, CDR, and interval cancer rates indicate approximately four-times increased risk of BC relative to the matched cohort. Despite the higher cancer rates, mammography screening did not have significantly lower sensitivity in these women relative to matched screens. Our estimated screening sensitivity of 76.1 % is inconsistent with the low mammography sensitivity (<50 %) from small studies restricted to subjects selected to adjunct screening (described in our “Background” section) [1012]. High interval cancer rates in the LCIS/ALH cohort reflect the higher underlying cancer rates in these women, and raise the possibility that some interval cancers (conventionally defined as cancers arising after a negative mammographic screen and before the next routine screen) may have been identified through adjunct (MRI or ultrasound) screening. Whereas most interval cancers emerge as symptomatic or clinically-detected cancers, if these women are being referred for adjunct screening on the basis of their history of atypical lesions, then some cancers may have been detected through more intensive screening and would be identified as ‘interval cancers’ even though they may not have emerged as true interval cases in the absence of adjunct screening. Because we did not have the data on adjunct screening we cannot determine whether or not some of these interval cancers were detected through adjunct screening. We found that mammography specificity was significantly lower in the LCIS/ALH cohort than comparator cohort; other measures of screening accuracy (recalls, PPV, recommendation for surgical assessment) were also significantly higher in the LCIS/ALH cohort due in part to lower specificity but also due to higher cancer rates. We cannot compare our findings to those from other studies because we did not identify any studies investigating the accuracy of mammography screening in unselected women with a history of these atypical lesions. Of note, a recent study of 776 women with a history of LCIS reported that there were no differences in crude CDRs among women who had annual mammography with clinical examination for screening and the subgroup that also had adjunct MRI screening [19].

The pattern of findings for the ADH/AH cohort relative to its matched comparator was generally similar to that outlined above for the LCIS/ALH cohort, except that the underlying risk of BC in the ADH/AH cohort was approximately 2.5-times that of its comparator cohort. CDR and interval cancer rates were significantly higher in the ADH/AH cohort relative to its comparator, reflecting higher underlying BC rates in the ADH/AH cohort and again raising the possibility that adjunct screening may have contributed to higher interval cancer rates. Mammography screening in the ADH/AH cohort had similar sensitivity to that estimated for the matched cohort; here our findings again contradict the low mammography sensitivity suggested in studies of subjects selected to adjunct screening [1012, 19]. Mammography specificity was significantly lower in the ADH/AH cohort (relative to comparator) and recall rates, PPV, and recommendation for surgical review were also higher in the ADH/AH cohort, due to the combination of lower specificity and higher underlying cancer rates.

The lower mammography screening specificity in the LCIS/ALH and ADH/AH cohorts relative to matched screens may be partly due to the vast majority of women with these atypical lesions having had surgical biopsy, whereas the comparator group included women without a biopsy history. A history of surgical biopsy is associated with lower mammography specificity [20], hence a limitation of our study is that we cannot differentiate the effect from these histological markers, as opposed to that from surgical biopsy history, in contributing to relatively lower screening specificity in the cohorts with a history of atypical lesions. It is also possible that knowledge of the woman’s history of an atypical lesion leads radiologists to adopt a lower threshold for recommending further (potentially unnecessary) testing and biopsy hence leading to lower specificity. Another study limitation (outlined above) is that we did not have information on whether some interval cancers were detected through adjunct screening of women with history of atypical lesions. Some might argue that 1-year follow-up from screening could lead to relatively modest risk estimates in our LCIS/ALH and ADH/AH women; however, we emphasize that our study focused on screening outcomes (not risk calculation) hence 1-year follow-up was appropriate for the aim of our research.

Although the relatively high interval cancer rates in the LCIS/ALH and ADH/AH cohorts raise concern, our findings should not be taken as inference of recommendation for or against adjunct screening of women with a history of these lesions. Our study primarily establishes mammography screening accuracy in women with a history of LCIS/ALH or ADH/AH using methods that avoid selection bias, and reports these estimates in the context of matched screening participants. The high interval cancer rates in both LCIS/ALH and ADH/AH cohorts appear to be predominantly due to higher underlying cancer rates, because the proportion of cancers that were interval BC did not statistically differ from that in the matched cohorts, and screening sensitivity was not significantly lower in our high-risk cohorts. Importantly, our results suggest that incremental BC detection from adjunct screening for other high-risk cohorts (for example gene mutation carriers) where mammography sensitivity is very low [1, 2] cannot be extrapolated to LCIS/ALH or ADH/AH women in whom mammography screening has adequate sensitivity evidenced by our study findings. Given the high interval cancer rates, we suggest that adjunct screening may be of value in women with LCIS/ALH or ADH/AH history if it can be shown to reduce interval cancer rates. Because of the relatively lower specificity and higher recall rates in the LCIS/ALH and the ADH/AH cohorts, it will be particularly important to examine the impact on specificity and on recall and biopsy rates of introducing adjunct screening in these women.