Introduction

Neuroendocrine neoplasms can broadly be divided into well-differentiated neuroendocrine tumors (NETs) and poorly differentiated neuroendocrine carcinomas (NECs), with the latter made up of small cell neuroendocrine carcinoma and large cell neuroendocrine carcinoma (LCNEC). The two most common organ systems for development of neuroendocrine neoplasms are the digestive system and the respiratory system [1]. In the former, well-differentiated tumors are more common in most organs, though NEC can occur. In the latter, well-differentiated tumors (carcinoids and atypical carcinoids) are less common than NEC due to the high prevalence of small cell lung carcinoma (SCLC); LCNEC can also occur in the lung but is far less common.

Gastrointestinal NECs (GI-NECs), though rare, are very aggressive; 60–80% patients with GI-NEC have distant metastatic disease at the time of presentation [2,3,4,5]. Median survival is only 1 month without chemotherapy, compared to 11–14 months with palliative chemotherapy [3, 5,6,7].

SCLC, which is seen almost solely in smokers, is the preferred term for pulmonary small cell NEC and currently makes up 15–25% of invasive lung cancer worldwide; approximately 45,000 new cases are diagnosed annually in the USA alone [8, 9]. It has a rapid doubling time [10], with a more aggressive clinical course than non-SCLC [11, 12]. Like GI-NEC, it is often metastatic at presentation [13, 14]. LCNEC, in contrast, accounts for approximately 3% of lung malignancies in the USA [15]; it also has a higher incidence in smokers [16, 17]. Overall, the lungs are the most common sites for primary NEC to occur, with the digestive system coming in second [18].

Histologically, NEC from the gastrointestinal tract and the lung appear essentially identical, with sheets of high-grade neuroendocrine cells demonstrating abundant mitoses and areas of necrosis. Furthermore, as noted, both GI-NECs and pulmonary NECs (P-NEC) metastasize early. As a result, some patients may present with the metastatic foci representing the first-encountered or most obvious site of disease. In situations where these foci are sampled for histologic evaluation, pathologists may initiate workup to attempt to identify site of origin. This is becoming increasingly important, as site of origin is emerging as a relevant factor in selecting oncologic treatment regimens for NECs. However, relatively few immunohistochemical (IHC) markers have been interrogated for their ability to reliably distinguish between GI-NECs and P-NECs. Indeed, it has been stated that “immunohistochemistry has a more limited role in assigning NEC site of origin” [19]. Prior reports have indicated that TTF1 is a reliable, though variable, marker for P-NEC [20], while CDX2 and SATB2 can reportedly be useful to diagnose GI-NEC [17, 21]. Our study sought to confirm and expand these findings by performing and comparing a large number of IHC stains on a series of GI-NEC and P-NEC.

Material and methods

Following approval by the Institutional Review Board (Dana Farber Cancer Institute, Office for Human Research Studies, 19-085, 27/2/2019), primary GI-NECs and P-NECs were identified from the surgical pathology archives of the Beth Israel Deaconess Medical Center. Mixed neuroendocrine–non-neuroendocrine neoplasms were excluded from this search. Original glass slides were reviewed, the diagnoses were confirmed (based on histologic appearance, IHC expression profile, and lack of other potential primary site), and the most representative tumor block was identified from each case. Cases excluded after review included multiple grade 3 pancreatic NETs, two pancreas NEC biopsies with insufficient remaining tissue, one SCLC metastatic to the pancreas, and one colonic medullary carcinoma with spurious synaptophysin positivity (negative upon repeat staining).

Two tissue microarrays (TMAs) were constructed with the EZ-TMA Manual Tissue Microarray Kit 3—3 mm × 24 Core (IHC World, LLC, Ellicott City, MD), one containing 13 confirmed GI-NEC and the other containing 20 confirmed P-NEC. As NEC cases are often sampled via biopsy, resulting in a small amount of diagnostic tissue (as were many cases in this study), we included one 3-mm core from each case in the TMAs; however, in 4 GI-NECs and 1 P-NEC with relatively scant available tissue, two cores were included in order to ensure lesional tissue remained present in the resultant TMA sections.

In order to identify potential IHC markers that could help distinguish GI-NEC from P-NEC, the following 21 IHC stains were performed on 4-micron sections cut from both TMAs: AE1/AE3 cocktail (expressed in most carcinomas [22]); CK7 and CK20 (useful in carcinoma localization [22]); synaptophysin, chromogranin, CD56, and INSM1 (neuroendocrine markers [22]); SSTR2A (variably expressed in NETs [22]); CDX2, SATB2, TTF1, Napsin A, PR, GATA3, PAX8, AFP, and ISL1 (often considered site- or system-specific markers [22]); and beta-catenin, SMAD4, Rb, and p53 (often correlating to genetic mutations [22, 23]). Antibody details are provided in Table 1. Staining protocol was as follows:

  • Staining for AE1/AE3, CK7, CK20, synaptophysin, chromogranin, CD56, CDX2 (initial stain), TTF1, PR, GATA3, Rb, and p53 was performed using the Leica Bond III automated staining platform with the Leica Refine detection kit (Leica Biosystems, Wetzlar, Germany).

  • Staining for INSM1, SSTR2A, SATB2, PAX8, ISL1, and SMAD4 was performed using the Dako Envision Plus detection system (Agilent Technologies, Santa Clara, CA, USA).

  • Staining for CDX2 (repeat stain), Napsin A, beta-catenin, and AFP was performed using the Dako Envision Flex HRP detection system (Agilent).

Table 1 Manufacturer and staining details for immunohistochemical stains performed

AE1/AE3, CK7, CK20, synaptophysin, chromogranin, CD56, INSM1, SSTR2A, CDX2, SATB2, TTF1, Napsin A, PR, GATA3, PAX8, ISL1, beta-catenin (nuclear only), and AFP expression were evaluated for intensity (0–3+) and percent of corresponding intensity in each TMA core, and an overall H-score was calculated for each: 3*(percent 3+) + 2*(percent 2+) + 1*(percent 1+). While there is no generally accepted universal H-score cutoffs for positivity and negativity, due to variation among stains and among labs (Dr. Andrew M. Bellizzi, personal communication), for the purposes of this study, a cutoff of 50 was used (namely, 0–49 was interpreted as negative). SMAD4 and Rb were assessed for complete loss of nuclear staining, whereas p53 was assessed for mutant-pattern (strong and diffuse, or null) staining. H-scores were compared between GI-NEC and P-NEC using the t-test, and rates of normal versus abnormal staining for SMAD4, Rb, and p53 were compared using Fisher’s exact test on a 2 × 2 contingency table, with statistical significance set at p < 0.05. All statistical tests were performed using GraphPad online (http://graphpad.com/quickcalcs, GraphPad Software, San Diego, CA, USA, last accessed 3/9/2021).

Once potentially useful markers were identified in the cohort above, 20 additional cases of metastatic NEC foci were identified to create a second, validation cohort. These included 10 metastatic GI-NEC and 10 metastatic P-NEC (based on clinical history and imaging). Four-micron whole-section unstained tissue slides from these cases were stained for CK7, TTF1, beta-catenin, CDX2 (Agilent), and SSTR2A as above, and an H-score was determined for each slide. Statistical analysis was performed on this cohort in the same manner as for the first cohort.

Results

The GI-NECs included 6 small cell and 7 large cell NECs, from the esophagus (n = 2), stomach (n = 1), small bowel (n = 4), and colorectum (n = 6). The P-NECs included 7 small cell carcinomas and 13 large cell NECs. All 33 cases expressed at least one neuroendocrine marker (synaptophysin, chromogranin, CD56, INSM1); among these, synaptophysin had the strongest staining and INSM1 the weakest staining overall. For the five cases with two cores in the TMA, the higher H-score expression was included, but the two numbers were almost always very similar within a case. Only one GI case yielded a situation where the two core H-scores were < 50 and > 50 (for AE1/AE3 and CD56; the “negative” core showed very little lesional tissue).

IHC expression for the 13 primary GI-NEC and 20 primary P-NEC is summarized in Table 2 (leftmost columns). The most strongly expressed IHC markers among the 13 GI-NECs (Fig. 1a) included synaptophysin (Fig. 1b), chromogranin, AE1/AE3, CD56, SSTR2A (Fig. 1c), SATB2 (Fig. 1d), and nuclear beta-catenin (Fig. 1e). CDX2 expression was low in this cohort (mean H-score 43; Fig. 1f), on both on the original CDX2 stain and on a confirmatory CDX2 stain using a different antibody. No cases were positive for TTF1, PR, Napsin A, or AFP. Only one case showed abnormal loss of SMAD4 expression, compared to 6 with abnormal Rb expression and 10 with abnormal p53 expression. The most strongly expressed IHC markers among the 20 P-NECs (Fig. 2a) included AE1/AE3 (Fig. 2b), CK7 (Fig. 2c), TTF1 (Fig. 2d), synaptophysin, chromogranin, and CD56. Napsin A was only convincingly positive in one case. No cases were positive for PR, CK20, CDX2, or AFP. Only one case showed abnormal loss of SMAD4 expression, compared to 11 with abnormal Rb expression and 18 with abnormal p53 expression.

Table 2 GI-NEC and P-NEC staining results in primary and metastatic cohorts
Fig. 1
figure 1

a This colonic neoplasm showed H&E features compatible with a large cell neuroendocrine carcinoma. b All GI-NEC were positive for at least one neuroendocrine marker, confirming the diagnosis. Synaptophysin is shown here. c SSTR2A was positive in most GI-NEC in our study. d Similarly, SATB2 was positive in most GI-NEC. e Nuclear beta-catenin staining was often seen in GI-NEC. f CDX2 was positive in this GI-NEC, but overall was negative in most

Fig. 2
figure 2

a This lung neoplasm was readily diagnosable as a small cell neuroendocrine carcinoma on H&E. Neuroendocrine markers (e.g., synaptophysin) were positive to confirm. b AE1/AE3 cocktail was strikingly positive in this and most P-NEC. GI-NEC was on average less strikingly positive. c CK7 was positive in this and most P-NEC. d Similarly, TTF1 was positive in this and most P-NEC

Markers with significantly different mean H-scores between primary GI-NEC and primary P-NEC included AE1/AE3 (149 vs. 230, p = 0.011), CK7 (30 vs. 209, p < 0.0001), TTF1 (1 vs. 177, p < 0.0001), SSTR2A (123 vs. 14, p = 0.0021), SATB2 (106 vs. 35, p = 0.018), CDX2 (43 vs. 0, p = 0.019), and nuclear beta-catenin (106 vs. 23, p = 0.029).

SSTR2A expression was highest in foregut GI-NEC (n=3), with a mean H-score of 257, which was significantly higher than that in midgut (n = 3; mean H-score of 13, p = 0.002). SSTR2A levels were comparable between foregut and hindgut (n = 7), and between midgut and hindgut. There were no significant differences in SATB2 expression levels across foregut, midgut, and hindgut. We also compared H-score expression of all the markers in small cell vs. large cell GI-NEC and in small cell vs. large cell P-NEC. SATB2 demonstrated higher expression in large cell carcinoma in both the GI tract (median H-score 176, vs. 8 for small cell, p = 0.0017) and the lung (median H-score 54, vs. 0 for small cell, p = 0.0498); other markers showed no significant difference (all p > 0.05).

Based on the above findings and on relative antibody availability (i.e., most labs offer CK7, TTF1, CDX2, and beta-catenin, while SSTR2A is offered in fewer labs), we created a three-step algorithmic approach to IHC workup of NEC (Fig. 3). In step 1, a case is stained for CK7 and TTF1. Positive staining for both suggested P-NEC, with 65% sensitivity and 100% specificity in our initial cohort of primary cases. If either stain is negative, then in step 2, a case is stained for beta-catenin and CDX2. Positive staining for either suggested GI-NEC, with 62% sensitivity and 86% specificity in our initial cohort of primary cases. If both stains are negative, then in step 3, a case is stained for SSTR2A. A positive result suggested GI-NEC, with 60% specificity and 100% sensitivity in our initial cohort of primary cases; leftover cases are most likely P-NEC. This algorithm correctly localized 30 of the 33 primary cases (accuracy of 91%). The 3 mis-localized cases included one small cell GI-NEC negative for all 5 stains, one large cell GI-NEC negative for all 5 stains, and one large cell P-NEC negative for everything but nuclear beta-catenin. While AE1/3 and SATB2 showed significant differences in staining between GI-NEC and P-NEC, the former was not included in the algorithm because the mean H-score was high (i.e., positive) in both malignancy types, and the latter was not included because it is not widely available and because pairing it with SSTR2A in the final step did not improve overall accuracy of the algorithm.

Fig. 3
figure 3

An algorithmic approach for differentiating between P-NECs and GI-NECs using IHC markers based on study findings. Sensitivities and specificities provided apply to the entire study cohort of 53 cases, not the smaller cohorts of only primary or only metastatic cases. * indicates only nuclear staining for beta-catenin should be considered a positive result

The second cohort of 20 metastatic cases (10 GI-NEC and 10 P-NEC) was assessed for CK7, TTF1, CDX2, nuclear beta-catenin, and SSTR2A. Not all five markers showed significantly different expression in GI-NEC vs. P-NEC in this cohort (Table 2, middle columns), though when the H-score results of the primary and metastatic lesions were combined, all five markers retained statistical significance across the entire cohort of 53 cases (Table 2, rightmost columns). The above algorithm correctly localized 17 of the 20 metastatic cases (accuracy of 85%). The 3 mis-localized cases included a large cell GI-NEC that was positive for CK7 and TTF1, a small cell GI-NEC positive for CK7 and negative for everything else, and a small cell GI-NEC negative for all 5 stains.

When considering the entire cohort of 53 cases, our algorithm properly localized 47 (accuracy of 89%). As provided in Fig. 3, the initial step of CK7 and TTF1 staining had a sensitivity of 67% and specificity of 96% for P-NEC overall, the second step of beta-catenin and CDX2 staining had a sensitivity of 54% and specificity of 90% for GI-NEC overall, and the third step of SSTR2A staining had a sensitivity of 56% and specificity of 100% for GI-NEC overall. Of note, three of the six cases mis-localized by this algorithm were GI-NEC negative for all 5 markers. Therefore, if the stepwise algorithm indicates a case is likely P-NEC at the third step, a GI-NEC should still be considered if both CK7 and TTF1 were negative.

Discussion

Several prior studies have investigated the utility of IHC in determining site of origin for NEC [20]. Reported positivity rates have varied widely for most proteins, likely due to different cutoff thresholds and different laboratory conditions (including different antibodies). As NEC is rare in most organ systems, many studies have also been relatively small in size (as is ours), which may also contribute to variation in reported positivity rates. For example, reported TTF1 positivity rates in studies examining at least 20 SCLC have ranged from 57 [24] to 100% [25]; our rate in this study was 71% (5 of 7). TTF1 has also been reported as positive in small cell NEC from other sites, including prostate and cervix [26, 27]. Reported rates are generally lower for pulmonary LCNEC, ranging from 24 [25] to 48% [28]. La Rosa et al. [29] and Masai et al. [30] reported different TTF1 positivity rates in LCNEC with different antibody clones (SPT24 giving higher rates than 8G7G3/1; our study used the latter). Napsin A is rarely positive in P-NEC, as observed in our study and others [31, 32].

CDX2 is usually used to confirm gastrointestinal origin in malignancies, and some authors have reported high rates of positivity in GI-NEC. This includes Barbareschi et al. [33], who reported CDX2 positivity in 13 of 16 (81%) GI-NECs. However, La Rosa et al. [34] reported weak CDX2 staining in 3 of 27 (11%) GI-NECs, along with one of 14 (7%) P-NECs. Similarly, only one of our GI-NEC expressed strong CDX2 IHC; the stain was repeated with a different antibody, yielding similar results. Our metastatic GI-NEC similarly showed minimal CDX2 expression. CDX2 is reportedly expressed in half of NECs of the uterine cervix as well [35].

Along these lines, Lee et al. [36] performed TTF1, CDX2, and ISL1 IHC on 38 gastroenteropancreatic NECs and 36 P-NECs. They found that the former were more likely to express CDX2, and the latter were more likely to express TTF1 and ISL1. (We had analogous results for TTF1 and CDX2, though we found no significant difference in ISL1 expression between GI-NEC and P-NEC.) Cheuk et al. also reported that 43 of 52 (83%) P-NECs in their study were TTF1 positive, though 8 of 15 (53%) GI-NECs were as well [37].

SATB2 has relatively recently come into use as an IHC marker indicating gastrointestinal origin [38, 39]. Li et al. [40] reported positive SATB2 staining in 17% of foregut, 12% of midgut, and 90% of hindgut NETs; our study did not find significant SATB2 staining differences among NECs in these sites. Bellizzi recently reported that SATB2 was expressed in 60% of extrapulmonary visceral NECs, compared to 33% of P-NECs, though highest expression was seen in Merkel cell carcinoma (79%) [19, 21]. Similarly, we found SATB2 positivity in 54% of our GI-NECs and only 25% of our P-NECs (p = 0.018). It was therefore more reliable than CDX2 in our study to indicate a diagnosis of GI-NEC, though the stain is less widely available. We also observed that SATB2 was the only marker we tested that stained significantly stronger in one form of NEC than the other (namely, more common in large cell than small cell, for both GI-NEC and P-NEC).

TTF1, CDX2, and SATB2 are members of the superfamily of homeobox genes. Other examples utilized in our study include ISL1 and PAX8. ISL1 was originally described as a marker for pancreatic NETs but was subsequently demonstrated in NETs and NECs of various sites, including Merkel cell carcinoma, SCLC, and head and neck NECs [41]. As above, Lee et al. showed that ISL1 was more frequently expressed in P-NECs than in GI-NECs [36]. The data on PAX8 expression in NECs appear limited and inconsistent.

CK7 was preferentially expressed in P-NEC in our study, though this has undergone little investigation in the literature. Only two of our GI-NECs were positive for CK7; one was an ampullary LCNEC that also expressed CK20 weakly (H-score of 40). Accordingly, Nassar et al. [42] reported that 87% of ampullary NECs express CK7, and 38% express CK20. To our knowledge, these keratin markers have not been systematically evaluated in GI-NECs of other sites. While most of our GI-NECs were from the colorectum (including the other CK7-positive case) or the small bowel, both esophagus cases and the gastric case were CK7-negative. Nearly all NECs expressed AE1/AE3 in our study, though mean H-score was higher among P-NECs. AE1/AE3 expression has not been well studied in GI-NEC, though it has been reported as positive in 88% of SCLCs [43]. It is unclear why P-NECs expressed AE1/AE3 more strongly than GI-NECs in our study, as well as why some cases demonstrated weak or no staining.

It has been suggested that poorly differentiated GI-NECs exhibit reduced SSTR2A expression [44]. However, we found IHC positivity for SSTR2A in nearly half (6/13, 46%) of our GI-NECs. One other study has evaluated SSTR2A in GI-NEC, finding positivity in 2 of 9 (22%) colorectal NECs [45]. Only one of our P-NECs (5%) was positive for SSTR2A, a significantly lower rate than for our GI-NEC (p = 0.0021). Other authors have reported higher rates (36% [46] and 68% [47]) of SSTR2A positivity in P-NECs.

A few other markers we investigated have not been thoroughly studied in NEC in the literature. These include INSM1, a somewhat recently publicized neuroendocrine marker [22] that was moderately positive in both GI-NECs and P-NECs in our study, and beta-catenin, which often shows nuclear positivity in colorectal adenocarcinomas [48] and accordingly was a useful marker in our study to distinguish GI-NEC (sometimes positive) from P-NEC (rarely positive). We are not aware of other studies evaluating beta-catenin for NEC localization.

Finally, mutations in TP53 and RB1 [49,50,51] have frequently been reported in P-NECs, and accordingly, p53 and Rb IHC expression was frequently abnormal in the P-NECs in our study and in others [52]. This aberrancy was observed at a similar rate in our GI-NECs.

It should be noted that some prior studies were performed when a different definition of NEC was generally accepted outside the respiratory system; prior to the 4th Edition of the WHO Classification of Tumours of Endocrine Organs, published in 2017 [53], any digestive tract neuroendocrine neoplasm that was grade 3 by mitotic rate or Ki67 index was considered a poorly differentiated NEC, even when resembling a well-differentiated NET histologically [54]. Such tumors are now considered grade 3 NET. Some earlier studies may have included both in their cohorts, introducing a small but potentially confounding factor into their results. All cases in our study were carefully confirmed as NEC using current criteria.

Based on our findings, CK7 and TTF1 combined have a high specificity for P-NECs, and CDX2, nuclear beta-catenin, and SSTR2A staining are decently specific for GI-NECs. Using this information, we designed an algorithm for differentiation between P-NECs and GI-NECs. This had a high accuracy both in the original cohort of 33 primary lesions and in a confirmatory cohort of 20 metastatic lesions. While there were some outliers in both cohorts, this approach remains practical, as 4 of the 5 markers are readily available in most IHC labs.

Traditional therapy for NECs of any site (most notably SCLC) consists of platinum/etoposide [55]. However, site of origin is becoming increasingly relevant in treatment of NECs (Dr. Andrew M. Bellizzi, personal communication). This is most notable for Merkel cell carcinomas, where checkpoint inhibitor monotherapy is considered the preferred first-line treatment [56, 57]. However, both the National Comprehensive Cancer Network and the European Neuroendocrine Tumor Society recommend therapies such as FOLFOX and FOLFIRI (typically employed in colorectal adenocarcinoma) and capecitabine/temozolomide (typically employed in pancreatic NETs) as potential treatment options for extrapulmonary NEC but not for pulmonary NEC [58, 59]. Such approaches are bolstered by recent molecular discoveries and classifications, which indicate that NECs are often genetically similar to adenocarcinomas of the same site [60]. These recommendations and categorizations are evolving, and a handful of pertinent clinical trials are ongoing, but determining NEC site of origin is necessary in order to select patients appropriately for such clinical trials in order to further explore this frontier of treatment for NEC patients.

Our study was designed to be narrow and focused in scope, which resulted in several self-imposed limitations. First, we excluded NECs at other, less common, sites (e.g., endometrium) in order to perform a clean, direct comparison between GI-NEC and P-NEC. We also did not include NEC of the skin (namely, Merkel cell carcinoma) or well-differentiated NETs of any site. Including such cases would likely have yielded more convoluted results, and the additional subgroups would have been small as well, since NECs in general are rare.

Second, we used a TMA to evaluate the primary NEC cases. As a result, lesions with significant immunohistochemical heterogeneity could have been misrepresented. This approach was undertaken to identify key potential markers, which were then validated on a cohort of metastatic NEC; we used whole tissue sections to evaluate this second cohort, and our proposed algorithm performed similarly in those cases. Therefore, the use of a TMA does not appear to have significantly skewed our findings.

Third, our study utilized the H-score, which has been used to semi-quantitate expression of protein biomarkers and which offers a broad dynamic range against which clinical outcomes might be measured [61]. H-scores are admittedly utilized more in research settings than in daily diagnostic practice, where markers with clear and striking differences in positivity and negativity are the most practical for use. Fortunately, such differences were found in three of the five markers in our algorithm (TTF1, beta-catenin, SSTR2A), wherein NECs from one location were usually strikingly positive and NECs from the other location totally negative. CDX2 was not often positive in GI-NEC but was often striking in positive cases, and it was totally negative in all P-NECs. CK7 was usually negative in GI-NEC but was strongly positive in six of them, and for this reason it was paired with TTF1 in our algorithm. Therefore, the H-score helped in devising our algorithm but is not required when employing it.

Finally, we included small cell and large cell NECs from both the GI tract and the lung in our cohorts, as GI-NECs are sufficiently rare that we could not have otherwise compiled a cohort of reasonable size. As shown above, the H-scores for the IHC markers used in our algorithm did not significantly differ between small cell and large cell GI-NEC or P-NEC.

While most of the IHC information provided in this manuscript is not novel in isolation, we feel our algorithm has utility given the tools currently available to practicing pathologists, and hopefully it lays the groundwork for additional investigation into an increasingly clinically relevant matter. Improved localization of metastatic NECs will likely require additional markers that are currently unknown or uninvestigated in this setting.

In conclusion, we found that IHC for AE1/AE3, CK7, TTF1, CDX2, beta-catenin, SSTR2A, and SATB2 can be utilized to help distinguish GI-NEC from P-NEC, which may be helpful in the workup of metastatic lesions. Given the varied rates of positivity reported in the literature for these markers in GI-NEC and P-NEC, it appears generally advisable to use a panel containing multiple markers rather than rely on one result for diagnosis, as demonstrated in our algorithm utilizing most of these markers. If possible, individual labs may also wish to test available in-house cases to determine how these markers perform under local conditions.