Introduction

One of the essential issues in oncology is to predict tumor progression and patient survival. Traditionally, tumor type, stage, and mitotic activity are the main prognostic criteria. Tumor growth is correlated with mitotic activity; however, because of technical problems (e.g., section thickness, staining, sample size), it is often difficult to determine the mitotic count of a tumor exactly. This is much easier using the proliferation marker Ki67, which has therefore gained increasing significance in recent years, particularly in slowly growing tumors such as gastroenteropancreatic neuroendocrine neoplasms (GEP-NENs). The predictive value of Ki67 depends very much on the precise assessment of the marker’s immunohistochemical expression in tumor tissues. This review briefly reflects on the history of Ki67 and its inclusion into the GEP-NEN classifications and focuses then on the various issues in conjunction with the accurate and reproducible determination of Ki67 as predictive indicator of tumor growth and prognosis.

Short history of Ki67

In 1983, Gerdes, Schwab, Lemke, and Stein reported upon a new monoclonal antibody called Ki67, which was generated by immunizing mice with nuclei of the Hodgkin lymphoma cell line L428. This antibody was found to label nuclei of proliferating cells including tumor cells and recognizes a nuclear protein (encoded by the MKI 67 gene) which is involved in nuclear remodeling during proliferation and is thought to control heterochromatin organization and enable mitotic chromosomes to move independently from each other [11, 52]. However, as the exact function of the protein is still not fully understood, the initial name Ki67 is kept. This name derived from the city of Kiel (Ki) where Gerdes and his group worked in the university departments of pathology and biochemistry, and the number 67 referred to the original clone in the 96-well plate. In their report, the authors concluded that this antibody may be a potent tool for easy and quick assessment of the proportion of proliferating cells in a tumor [12]. The original antibody worked only on frozen tissues and was therefore subsequently replaced by antibodies, such as the monoclonal MIB-1, which were found to react with Ki67 in formalin-fixed and paraffin-embedded tissues. Using the MIB-1 antibody, it was possible to perform large-scale studies in archive material to assess the proliferative activity in different neoplasms [21, 49].

The systematic assessment of the proliferative activity of pancreatic neuroendocrine neoplasms (PanNENs) started in the 1990s [23, 38, 62], and the first two studies using the MIB-1 antibody on formalin-fixed and paraffin-embedded PanNENs were published in 1996 by two independent Italian groups [23, 38]. As it became apparent that the Ki67 index, along with the mitotic count, may become a determinant of the prognosis of GEP-NENs, the Ki67 index was included as prognostic parameter into the WHO classifications of the gastrointestinal (in 2000) and pancreatic NENs (in 2004). Both classifications originated from a classification concept proposed by Capella, Solcia, Heitz, Höfler, and Klöppel in 1995 [6, 19]. The subsequent WHO classification of GEP-NENs [43], that was published in 2010, included a three-tiered grading system, mainly based on the Ki67 categories, defined by the European Neuroendocrine Tumor Society (ENETS) in 2006 and 2007 [45, 46] (Fig. 1). Since then, the Ki67 index has become essential for grading GEP-NENs and has been firmly established in the reporting protocols and guidelines used by ENETS [33], the North American Neuroendocrine Tumor Society (NANETS) [60], and the American Joint Committee on Cancer (AJCC) [28]).

Fig. 1
figure 1

Timeline of the evolution of the classifications of the gastroenteropancreatic neuroendocrine neoplasms and the introduction of a Ki67 proliferative index as a prognostic marker. PanNET/NEN pancreatic neuroendocrine tumor/neoplasm

Calculation of the Ki67 index

The monoclonal antibody Ki67 identifies a 359-kD non-histone nuclear protein that is involved in the control and timing of cell proliferation and expressed in all phases of the cell cycle, with a maximum in the G2 and M phases. This implicates that the mitotic counts are distinctly lower than the Ki67 labeling, depending on the number of cells which are in the G1 phase. Exact figures do not yet exist, but the ratio may be about 1:10 in a given area. During its expression, the Ki67 antigen is redistributed from the interior of the nucleus or the nucleolus to the perichromosomal layer and heterochromatin in the nucleus [39] (Fig. 2). For the calculation of the Ki67 index, all the immunohistochemically labeled nuclei, regardless of the staining intensity or whether the nuclei show a speckled expression pattern or are diffusely stained, need to be counted for the scoring process [1]. The number of stained nuclei is then expressed as a percentage (index) of immunoreactive cells. It is recommended to count between 500 and 1000 tumor cells in the highest labeled area (hot spot). Among the methods used to determine the Ki67 index, a manual count of a camera-captured, printed image (CCPI) has appeared to be the most reliable procedure [41, 57] showing good reproducibility, although the whole evaluation process takes on the average between 10 and 15 min (Fig. 3). This recommendation is based on the results of a study comparing CCPI with three other counting methods. “Eye-ball” estimation was the fastest method (average time < 1 min) but with the poorest reliability and reproducibility. Manual eye count proved to be a rather quick way to determine the Ki67 index, averaging between 5 and 8 min but had also a poor reproducibility. Automated count was the most expensive and least practical method with major impact on turnaround time (limited by the accessibility of machine and personnel) but, more importantly, had significant inaccuracies in over-counting nonendocrine cells such as lymphocytes, endothelial cells, and stromal cells. Also molding of tumor nuclei, overly thick sections and background pigment may contribute to miscalculations [41]. However, these statements are probably not the “last word,” since in a paper by Tang, automated counting was shown to have comparable accuracy as counting on camera-captured printed images [57].

Fig. 2
figure 2

Ki67 immunostaining of nuclei in a pancreatic neuroendocrine carcinoma: during cell cycle, Ki67 labels initially the nucleolus and is then redistributed to the entire nuclear area

Fig. 3
figure 3

Precise assessment of the Ki67 index is achieved by manually counting unlabeled and labeled nuclei on a camera-captured, printed image. In this picture, a total of 1581 cells were counted and divided by 81 labeled cells, resulting in a Ki67 index of 19.6%

Problems in assessing Ki67 labeling

During recent years, immunostaining procedures for Ki67 have generally been improved and standardized by the wide use of automated staining machines. However, there may be still differences between laboratories, since it was recently shown that interlaboratory variability in tissue processing and fixation, including the use of different reagents and pretreatments, may be the reason for interlaboratory differences [5], especially in low-proliferating tumors. It can be expected that commercially available staining optimizers and external standardization of the staining protocols by quality measures may help to further improve reliability and reproducibility.

Other problems in assessing Ki67 staining and calculating a reproducible Ki67 index are loss of antigenicity over time, intratumoral heterogeneity, and the interpretation of “pale brown” tumor nuclei as positive. Ki67 may lose its antigenicity with tissue age in paraffin-embedded formalin-fixed blocks, with a 10% signal loss in staining intensity over a period of 4.5 years [9]. Intratumoral heterogeneity may be observed in an individual tumor or among different metastatic sites, or may become obvious in higher grade metastasis developing in the course of disease progression (Fig. 4). This problem is endogenous to tumors and can only be solved by the availability of sufficient tumor tissue for evaluation. The problem of interpreting “pale staining” nuclei as positive is difficult to solve, as it will always remain to some extent a subjective issue. However, if in a slide with a Ki67 immunostaining of high quality, all nuclei with sharp borders are counted that display an intense or less intense, but homogeneous or speckled staining pattern, the counts have an excellent reproducibility in our experience.

Fig. 4
figure 4

Ki67 immunostaining reveals intratumoral proliferative heterogeneity in a lymph node metastasis from a G1 ileal neuroendocrine tumor, with a G1 area in the upper half of the picture and G3 area at the bottom (Courtesy of Dr. Silvia Uccella, Department of Medicine and Surgery, University of Insubria, Varese, Italy)

Intratumoral heterogeneity is another issue of the reliability of a Ki67 index in liver metastases from PanNETs when diagnosed by a radiologically guided core needle biopsy. However, it has been shown that Ki67 staining of core biopsies usually provides an adequately reliable method of proliferation assessment for prognosis of metastatic NETs to the liver [64].

Ki67 index and its prognostic significance in pancreatic NENs

In recent years, it has been shown that Ki67 index is the most significant factor in the prognostic evaluation of PanNENs. This has been revealed by a number of studies using the 2010 WHO classification [34, 44, 50] which, on the basis of the proliferative activity, grades well-differentiated PanNENs, called pancreatic neuroendocrine tumors (PanNETs), into either G1 PanNETs (Ki67 index ≤ 2%) or G2 PanNETs (Ki67 index 2–20%), and poorly differentiated PanNENs, called pancreatic neuroendocrine carcinomas (PanNECs), into G3 category with a Ki67 index > 20% [43]. In one of these studies, it was suggested to use a cut-off of 5% for the G1 PanNETs, since a higher risk of progression was observed when 5% was used instead of 2% [50]. However, there is so far not sufficient evidence of differences in clinical management based on this higher cut-point to justify changing it.

Recently, the distinction between well and poorly differentiated PanNENs applying the 2010 WHO classification has become difficult for those few well-differentiated PanNENs whose Ki67-index exceeds 20%. Although these tumors retain their well-differentiated neuroendocrine growth pattern, their Ki67 index greater than 20% classifies them into the G3 category of poorly differentiated neuroendocrine neoplasms (NEC) [4]. Though the prognosis of these tumors is worse than that of G2 PanNETs, it is still better than that of PanNECs. Moreover, they retain the features (i.e., hormone expression and hormonal syndromes) characterizing PanNETs and appear to lack genetic abnormalities (i.e., changes in expression and mutation of TP53 and RB1) associated with PanNECs [56, 63]. In the new 2017 WHO classification, these PanNENs therefore constitute a new category, called PanNET G3 (Table 1). Their clear distinction from PanNECs is currently the topic of a number of articles [4, 20, 29, 56].

Table 1 2017 WHO classification of pancreatic neuroendocrine neoplasms

Ki67 index and its prognostic significance in gastric NENs

Gastric NENs are a heterogeneous group of tumors showing different clinicopathological features and behavior [25]. They include NETs arising in both the oxyntic and antral mucosa as well as NECs and mixed adenoneuroendocrine carcinomas (MANECs) which have been recently proposed to be renamed as mixed neuroendocrine-nonneuroendocrine neoplasms (MiNENs) [24]. With the exception of very rare antral NETs, that produce gastrin, somatostatin, or serotonin, all other gastric NETs arise in the oxyntic (corpus) mucosa and are composed of histamine-producing enterochromaffin-like (ECL) cells [25]. ECL cell NETs are a heterogeneous group of neoplasms in terms of pathogenesis, biology, and prognosis. In 1993, the group of Rindi, Capella, and Solcia, considering the morphology of the peritumoral oxyntic mucosa, the presence of antral G cell hyperplasia, the presence of hypergastrinemia, the setting of MEN1, and the presence of hypo/achlorhydria, divided ECL cell NETs into the following three categories: type 1, type 2, and type 3 [47]. These different types of ECL-NETs are known to show per se different clinical and, more importantly, prognostic features [22, 42], with type 1 NETs showing the best prognosis and type 3 the worst. As for other GEP sites, gastric NENs are graded using Ki67 labeling and mitotic count. However, it has been demonstrated that Ki67 is more sensitive in identifying intermediate grade gastric NENs than mitotic count [22], since counting of mitoses has its limits in small biopsy specimens, that contain insufficient tissue to evaluate 10HPF. For this reason, tumor grading is mainly based on Ki67 index and this correlates well with prognosis. An even better Ki67 cut-off to distinguish G1 from G2 neoplasms seems to be 3% [22]. Further improvement of the prognostic stratification of patients is achieved, if grading is combined with Rindi’s clinicopathological classification [22, 36, 47]. Both approaches should therefore be used for proper diagnostic assessment of a gastric ECL cell NET, especially in a small biopsy.

The majority of G1-NETs of gastric type 1 have a good prognosis and require only endoscopic treatment, when they are small, confined to the mucosa, and arising in the setting of type A chronic atrophic gastritis. The clinical relevance of the few type 1 ECL cell NETs showing a Ki67 index > 3% is still not clear. Grozinsky-Glasberg demonstrated that metastatic type 1 ECL cell gastric NETs showed an average Ki67 labeling index of 6.8% which was statistically higher than the average Ki67 index (1.9%) of the whole series used as control [14]. However, the Ki67 index alone does not seem to be the most important parameter determining the metastatic potential, as this tumor property is also closely related to tumor size and depth of tumor infiltration of the gastric wall [14, 21].

Gastric type 2 NETs that arise in MEN1-ZES patients, and especially gastric type 3 NETs, that occur sporadically in normal gastric mucosa, show a more aggressive biology and often need, depending on tumor stage, further treatment including gastric resection [51].

The stomach, like the pancreas, also harbors NETs whose Ki67 index exceeds 20%. These peculiar tumors, which in adaptation of the recently proposed nomenclature for PanNETs may be called gastric G3-NETs, were only found among gastric type 3 (sporadic) ECL cell NETs and showed a prognosis that was worse than in “conventional” G2-NETs, but still better than in gastric poorly differentiated NENs that have been called NECs [22] or gastric type 4 NENs [18]. NECs and MiNENs (MANECs) arise more frequently in the cardial or antral region [25] and show similar morphological, immunohistochemical, and prognostic features to those of their counterparts arising in other organs of the digestive system.

Ki67 index and its prognostic significance in duodenal NENs

Duodenal NENs are a heterogeneous group of tumors including well-differentiated and poorly differentiated neoplasms that differ in terms of clinical presentation, localization, and immunophenotype [18]. In a large series of duodenal NENs, recently published by Vanoli and collaborators, four different groups were distinguished among the well-differentiated neoplasms including gastrinomas (associated with the Zollinger-Ellison syndrome [48]), nonfunctioning NETs (lacking any association with endocrine symptoms but expressing gastrin or somatostatin [48]), ampullary nonfunctioning somatostatin-producing NETs, and gangliocytic paraganliomas [18, 59]. Using a cut-off of 2.5% for distinguishing G1 and G2 tumors, the Ki67 grading failed to separate the four tumor groups but was found to be a predictor of lymph node metastasis. Moreover, at univariate analysis, the Ki67 index was well associated with disease-specific survival, but at multivariate analysis failed to discriminate between the disease-free survival of G1 and G2 tumors. From this study, it appeared that a multiparametric approach including NET size, site, and proliferative activity is useful to identify the metastatic cases but has limited significance for disease-specific survival.

Ki67 index and its prognostic significance in ileal and appendiceal NENs

The vast majority of ileal NENs are serotonin-producing NETs, while NECs are extremely rare. Ileal NETs are peculiar tumors because of their ability to metastasize early to regional lymph nodes and/or the liver despite a low proliferation index (most tumors are G1). It is therefore conceivable that tumor grading fails to predict the metastatic potential of these tumors [7, 8, 13, 26, 32, 35]. However, tumor grading, mainly based on Ki67 proliferation, has been demonstrated to be correlated with prognosis either by using the cut-offs proposed by ENETS/WHO or the cut-offs of 1% [2, 10]. Panzuto has demonstrated that Ki67 grading was statistically associated with tumor progression and patients’ survival, but at multivariate analysis, the best predictive Ki67 cut-off in discriminating G1 versus G2 tumors was 5% [35]. Interestingly, these authors also calculated the increasing risk for tumor progression and tumor death for each increasing Ki67 unit which was 14 and 18%, respectively. This underlines the biological concept that Ki67 should be considered as a continuous variable and the calculation of prognostic risk for each increasing Ki67 unit may be superior to the separation of tumors using fixed Ki67 categories. Although the prognostic classification of ileal NETs based on individual parameters such proliferative activity alone has been demonstrated to have a clinical power [3, 35, 54], a new multiparametric approach including a NET nomogram seems more promising for improved prognostic stratification of patients [8, 30].

Appendiceal NETs are the most peculiar tumors among the intestinal NENs. Despite frequent infiltrative growth into the sub serosal tissues, lymph node metastases are rare and liver metastases are virtually absent. Patients with these tumors, among them many children, have an excellent outcome after appendectomy. The Ki67 labeling index which is generally low has no predictive power regarding stage and outcome [61]. For this reason, therapeutic strategies exceeding appendectomy and including hemicolectomy should not be based on tumor grading but rather on tumor stage [37].

Ki67 index and its prognostic significance in rectal NENs

Rectal NENs include both well and poorly differentiated neoplasms. Their number increased significantly in recent decades, probably due to the increased use of diagnostic colonoscopy [40, 51]. Rectal NECs are highly aggressive cancers with dismal prognosis and show the same clinicopathologic features of NECs at other sites. Rectal NETs show a broad range of clinical features ranging from indolent and asymptomatic to aggressive and metastatic cases. Most rectal NETs are composed of L cells producing glicentin and/or PP or PYY [18]. Several papers demonstrated that tumor size and level of wall invasion are prognostic factors, so that small (< 10 mm) mucosal/submucosal rectal NETs can be treated by endoscopic polypectomy alone [40]. Grading rectal NETs on the basis of their Ki67 index reveals that about 90% of the tumors falls into the G1 category [15,16,17, 27, 31, 53, 58] and shows a better survival than G2 NETs [16]. Recently, a Ki67 cut-off of 3% was proposed, as it seems to be better in predicting metastatic dissemination than a cut-off of 2% [55]. Tumor grading has been proved to be a prognostic marker in univariate analysis [16, 53], but it was not an independent factor at the multivariate analysis [53]. It has therefore been suggested that the best approach for stratifying patients into different prognostic categories is a multiparametric evaluation considering grade together with tumor size, lympho-vascular invasion, level of wall infiltration, and immunophenotype (L cell versus EC cell NET) [16, 53]. G1 rectal NETs with a size less than 10 mm, absence of lympho-vascular and muscular layer infiltration, and L cell phenotype require only endoscopic resection, while larger G2 NETS, especially when of EC cell type and deeply infiltrating the rectal wall in the presence of lympho-vascular invasion, need surgical resection.

Conclusion and perspectives

During the last 20 years, Ki67 labeling and grading have become essential for prognostic assessment of many GEP-NENs (Table 2). The cut-offs currently proposed by ENETS/WHO to separate the G1, G2, and G3 NEN-categories work well and are used together with tumor type, site, and stage, to stratify patients in different prognostic categories. In future, however, it can be anticipated that the Ki67 cut-offs will be refined in relation to tumor type, its location, and its response to therapy. For foregut and hindgut NENs, including gastric, duodenal, pancreatic, and rectal neoplasms, the best Ki67 cut-off in distinguishing G1 and G2 NETs seems to be 3%, while for ileal NETs it is 5%. In addition, it may be worthwhile to consider the use of multiparametric approaches which include the Ki67 index and combine it with other clinicopathologic parameters. Finally, the biological view of Ki67 expression as a continuous variable in the proliferation process should be taken into account and may lead to a new approach to predict the outcome of the patients by calculating the prognostic risk for each increasing Ki67 unit. This approach could then replace the separation of tumor categories on the basis of “a priori” fixed Ki67 cut-offs that are currently in use.

Table 2 Prognostic relevance of the Ki67 index in gastroenteropancreatic neuroendocrine neoplasms (NENs), based on the grading systems proposed by the 2010 and 2017 WHO classifications