Introduction

Barrett’s esophagus is a preneoplastic condition in which the lining of the distal esophagus changes from normal squamous epithelium to SIM, a precursor for progression to EA. The incidence of EA has increased over the past few decades, with prognosis linked to the stage of the tumor [14]. Individuals diagnosed in the early stages of EA can have 5-year survival rates as high as 95%, whereas those diagnosed with more advanced disease have only a 10–15% 5-year survival rate [5]. As a result, major emphasis has been placed upon the early detection of dysplasia and EA in order to eradicate these high-grade lesions and improve overall survival.

The current standard of practice is periodic four-quadrant surveillance biopsies in each centimeter of the involved esophageal segment if dysplasia is present. This allows for stratification of individuals based upon degree of dysplasia and subsequently helps determine likelihood of progressing to EA. However, adherence to these guidelines appears to be low, thus limiting the detection of high-grade dysplastic lesions [6]. Additionally, this method is limited in two substantial ways: (1) sampling error and (2) high pathologist inter-observer variability [7]. Sampling error can occur in tissue collection because early or subtle mucosal changes in dysplastic tissue may be patchy, and offer little macroscopic evidence of its premalignant potential [8]. After collection, diagnosis by a pathologist can have significant variability. This is most pronounced in distinguishing between the lower-grade dysplastic lesions, but notably only approximately 85% inter-observer agreement is measured when diagnosing high-grade dysplasia and EA versus all other forms of dysplasia (including no dysplasia) [9, 10]. Thus, improved more objective methods of risk stratification for BE patients are needed.

Several studies demonstrate that an increasing number of genetic alterations occur as SIM progresses to EA, and exploration of these markers to diagnose dysplasia is an active area of research [1113]. In 2006, FISH probes comprising four loci, 8q24 (MYC), 9p21 (CDKN2A), 17q11.2 (ERBB2), and 20q13.2 (ZNF217), showed high sensitivity and specificity for identifying HGD and EA, and this method was shown to be superior to digital image analysis (DIA) and cytology [14, 15]. Furthermore, polysomy, as detected by FISH, was the most commonly detected abnormality in areas of dysplasia and EA, and the most likely predictor of progression to HGD/EA [15, 16]. This multicenter study aims to validate the measurement of polysomy by FISH as a way to objectively discriminate degrees of dysplasia of BE biopsy specimens.

Methods

Clinical Ascertainment

Forty-five patients with known BE and no prior eradication therapy were enrolled at four different institutions: University of Michigan Health System, Brigham and Women’s Hospital, University of Texas MD Anderson Cancer Center, and St. Michael’s Hospital in Toronto. BE tissue specimens were collected during standard of care surveillance endoscopies. A random subset of the specimens collected from each consented patient were then used for analysis. A “Site diagnosis” was read by that institution’s pathology department. The “Review diagnosis” was determined by re-sectioning the paraffin-embedded tissue and having these slides read by a single gastrointestinal pathologist at the University of Michigan Health System (“central pathologist”). This central pathologist was blinded to the Site diagnosis read. The same slides read by the central pathologist were then sent to the Mayo Clinic for FISH analysis. The potential diagnoses in order of increasing severity were: no evidence of BE (normal cardia and squamous epithelium), SIM (specialized intestinal metaplasia), IGD (indefinite-grade dysplasia), LGD (low-grade dysplasia), HGD (high-grade dysplasia), and EA (esophageal adenocarcinoma). Data were analyzed using the Site diagnosis, the Review diagnosis, and the highest grade diagnosis within the entire BE segment of each patient (“Overall diagnosis”). The Overall diagnosis is clinically applicable because the patient’s management is determined by the highest grade lesion within the esophagus. No baseline demographic data were collected as all tissue specimens were subsequently de-identified for analysis. A total of 46 non-BE, 42 SIM, 23 IGD, 10 LGD, 29 HGD, and 42 EA specimens were analyzed. The institutional review boards of each participating institution approved this study.

FISH

Unstained formalin-fixed paraffin-embedded slides were pretreated and hybridized with probes provided by Abbott Molecular (DesPlaines, IL) directed to 8q24 (MYC), 9p21 (CDKN2A), 17q11.2 (ERBB2), and 20q13.2 (ZNF217), as previously described [14]. FISH signal patterns for 50 nuclei within the histologic area of interest were enumerated by a technologist experienced in FISH analysis. Each cell analyzed was categorized as FISH negative (having the anticipated number of two FISH signals per probe) or FISH abnormal (having more or less than the anticipated number of FISH signals). Gain of a locus was defined as a single locus having greater than two signals, and loss of a locus was defined a single locus having less than two signals. FISH abnormality types included polysomy (gain of at least two loci), single locus gain (3–9 signals of a single locus and two copies of other loci), amplification of a single locus (≥10 signals of a single locus and two copies of other loci), and 9p21 loss (0–1 signals of 9p21 and two copies of other loci). The predominant FISH abnormality was defined as the most common abnormality (i.e., highest percentage) detected within a specimen.

Statistical Analysis

Statistical analyses were performed with SAS version 9.4 and IBM SPSS version 22. The kappa statistic was calculated as a measure of agreement between Site diagnosis and Review diagnosis for varying degrees of dysplasia. Receiver operator curves were used in order to compare different FISH chromosomal abnormalities as diagnostic criteria for determining degree of dysplasia. In order to determine the best predictor of dysplasia, the area under the receiver operator curves (AUCs) for various FISH abnormality types were plotted using a bootstrap analysis. P values were then computed using the method appropriate for comparing correlated AUCs [17]. Given that prior single-institution data found polysomy to be the most commonly detected abnormality in areas of dysplasia and EA, we focused our analysis on the validation of polysomy as a sensitive and specific marker for detecting varying degrees of dysplasia and EA.

Results

A total of 192 specimens were analyzed by a site pathologist, central pathologist, and FISH analysis using probes directed to 8q24 (MYC), 9p21 (CDKN2A), 17q11.2 (ERBB2), and 20q13.2 (ZNF217). The histologic breakdown of the specimens as recorded by the central pathologist was 46 non-BE, 42 SIM, 23 IGD, 10 LGD, 29 HGD, and 42 EA specimens. Inter-observer agreement was calculated between the central and site pathologists. The kappa statistic for agreement on identification of EA/HGD versus rest was .79, with a value of 1.0 indicating 100% agreement between the different reads. In accordance with previously published studies, higher inter-observer agreement was seen when distinguishing EA and HGD from other degrees of dysplasia (Table 1).

Table 1 Pathologist inter-observer variability

To validate FISH polysomy as a marker for detection of dysplasia in these specimens, the area under the receiver operating curves were calculated to compare different FISH abnormalities as predictors for the central pathologist’s Review diagnosis, our gold standard. Table 2 shows the AUC values for both cells with polysomy and cells with 9p21 loss as the predominant FISH abnormality, and the comparative p values. Previous studies used positive criteria for FISH to include polysomy, gain of a single locus and 9p21 loss [13]. In our data, polysomy had the highest discriminatory power for most comparisons, with an AUC of .83 in discriminating EA/HGD from all other diagnoses. Interestingly, similar values were seen when looking at the AUC for percentage of cells with polysomy as a predictor of both the Site diagnosis and Overall diagnosis. Of note, adding SIM to the comparison causes 9p21 loss to become the predominant FISH abnormality, which lessens the AUC for polysomy. The AUC for the percentage of cells with a predominant 9p21 loss for EA/HGD/LGD/IGD/SIM versus rest was .90, whereas the AUC for the percentage of cells with polysomy for the same comparison was .76. An additional analysis that included any FISH abnormality (polysomy, single gain, amplification, and 9p21 loss) resulted in an AUC of .75 for HGD/EA samples versus rest.

Table 2 AUC for chromosomal abnormalities

Although many types of chromosomal abnormalities are present for each respective histologic classification, polysomy is most commonly observed with increasing degrees of dysplasia (Fig. 1). Furthermore, as SIM progresses to EA, the percent of specimens with polysomy as the most common FISH abnormality also increased from approximately 5 to 80%, respectively. Of note, all 46 specimens that were negative for BE had no cells with polysomy.

Fig. 1
figure 1

Predominant FISH abnormality per histologic classification. If more than one FISH abnormality was detected in a specimen, the most predominant (highest percentage) was used for this analysis

We examined the receiver operator curve for polysomy to select an optimal cutoff value of 10%. When compared to the Overall diagnosis, if at least 10% cells in a specimen were positive for polysomy, we were able to obtain a sensitivity and specificity of 88 and 75% for EA versus rest and a sensitivity and specificity of 80 and 88% for EA/HGD versus rest, respectively. When LGD and IGD were added, specificity increased to 96% but sensitivity decreased to 74% (Table 3).

Table 3 Sensitivity and specificity of polysomy using overall diagnosis

Since early or subtle mucosal changes in dysplastic tissue may be small and frequently not identifiable by gross examination, we explored whether high-grade lesions elsewhere in the esophagus could lead to a higher number of cells having amplified loci in the surrounding non-HGD/EA tissue, a so-called field effect. From our analyses, we found 7/7 (100%) of IGD/LGD specimens to have evidence of amplification or polysomy when HGD was present elsewhere in the same esophageal segment (Table 4).

Table 4 The role of FISH in the detection of a field effect

Discussion

Current methods of monitoring BE patients for progression to EA rely on random sampling of mucosa and have high inter-observer variability among pathologists [7]. Several studies have examined alternative ways to measure degree of dysplasia, specifically comparing FISH to both DIA (digital image analysis) and cytology [15]. Using a four-probe combination developed in 2006, FISH was found to have superior sensitivity for determining LGD, HGD, and EA as compared to DIA and cytology. Since then, studies have used different criteria for FISH positivity and have focused upon revealing the cytogenetic changes as normal mucosa progresses to EA [1416, 18, 19]. Polysomy is the most commonly detected abnormality in areas of dysplasia and EA, and our study focused on using polysomy by FISH to analyze the degree of dysplasia in order to limit pathologist inter-observer variability and validate a more objective measure of dysplasia [15].

This multicenter study builds upon single-institution data that suggest that FISH may be useful for the detection of dysplasia and EA in patients with BE. In concordance with previous findings, our study demonstrated an increasing number of specimens with polysomy as the predominant chromosomal abnormality detected by FISH as the degree of dysplasia increased (Fig. 1). This indicates that cellular progression toward adenocarcinoma coincides with chromosomal aberrations at these probe sites resulting in gain of loci. This is true with the exception of the 9p21 locus, which is the site of a tumor suppressor gene (CDKN2A) that is usually lost as the cell progresses from normal mucosa to BE [16, 20].

In comparing the AUC for each FISH abnormality, we focused on 9p21 loss, since it is an early and pervasive alteration in the change to metaplasia. Using the Review diagnosis, we found the AUC for polysomy to be superior for most histologic comparisons (Table 2). Similar AUC values were noted when using the Overall diagnosis, indicating that the percentage of cells with polysomy was able to predict the patient’s highest grade pathologic diagnosis with comparable accuracy. Of note, for both the Review and Overall diagnosis, the AUC for the percentage of cells with 9p21 loss as the predominant abnormality for EA/HGD/LGD/IGD/SIM versus rest was greater than the AUC for the percentage of cells with polysomy for the same histologic comparison. We expect this result because 9p21 deletion is an early marker often found in metaplasia, whereas polysomy is more prevalent in dysplasia and EA.

We found that polysomy by FISH had excellent diagnostic capabilities in discriminating HGD/EA from the remaining histologic diagnoses, a clinically important distinction, as these diagnoses frequently lead to endoscopic treatment and oncologic evaluation. As a result, FISH may potentially be utilized in conjunction with pathology, as current guidelines state that any diagnosis of dysplasia must be confirmed. Traditionally, this has resulted in a second pathologist with expertise in BE reviewing the specimen. However, given the high diagnostic accuracy of FISH in detecting HGD/EA, FISH analysis may be used to confirm pathologic diagnoses of HGD and obviate the need for a second pathologist’s review. When LGD and IGD were added, specificity greatly increased at the expense of sensitivity. In a screening setting, a high sensitivity would be preferable to maximize the number of cases identified. Of note, our results demonstrate improved sensitivity and specificity in detecting HGD/EA as compared to cytology (both brush and balloon), and digital image analysis [15, 21].

Our study also suggests the presence of a “field effect,” potentially enhancing the use of FISH as a screening tool during endoscopy. Our data, although limited in power, suggest that genetic changes may occur in nearby tissue when HGD is present elsewhere in the segment. More specifically, we found that all esophageal segments read as IGD/LGD had evidence of amplification or polysomy only when HGD was found elsewhere in the patient’s esophageal column. Additionally, recent evidence has suggested that FISH may play a role in risk-stratifying individuals with dysplasia. A retrospective analysis of patients with HGD found that if more than 4 of 100 cells had evidence of polysomy, there was a higher likelihood of progression to EA [22]. Although further verification is needed, these findings suggest that genetic changes detected by FISH can both predict HGD lesions elsewhere in the esophageal column as well as progression to EA. As a screening tool, this has the potential to reduce the effect of sampling bias by limiting the number of HGD lesions missed, as well as guide future management through further risk stratification of patients with dysplasia.

One potential factor limiting the widespread use of FISH is its availability. In practice, biopsies taken during a surveillance endoscopy would first be read by an experienced pathologist, re-sectioned, and then sent for FISH analysis. Currently, although many commercial laboratories and academic centers have the necessary equipment and technologists required to run FISH, it is not widely available. Analogous to sending a specimen to a center with an experienced pathologist, the tissue would need to be sent to a center with FISH capabilities. Although the FISH results would be ready in only a few days, sending the specimens for processing would incur an additional cost that is on the order of a few hundred dollars.

One limitation of our study is that our kappa statistic appears to be higher than those previously reported [10]. In our analysis, we focused on binary comparisons, and we evaluated agreement between two specialized pathologists. This may inflate our observed kappa statistic. Additionally, it should be noted that re-sectioning tissue may contribute to differences in the Site and Review diagnoses, although this is likely negligible given that only a few microns were used to cut each tissue block.

In conclusion, this study demonstrated a high detection rate for HGD and EA using a FISH panel with probes to 8q24 (MYC), 9p21 (CDKN2A), 17q12 (ERBB2), and 20q13 (ZNF217). Additionally, FISH analysis of LGD/IGD specimens was able to accurately predict the presence of high-grade dysplastic lesions elsewhere within the patient’s BE segment. Given its high objective diagnostic accuracy for identifying HGD/EA and potential to detect higher-grade histology at other levels in the BE segment, FISH may be useful in conjunction with traditional histologic analysis, both as a diagnostic marker and screening tool to identify those at higher risk of progression to EA. Future prospective studies specifically examining its use as an adjunctive marker are needed in order to further investigate its promising and clinically relevant potential.