Introduction

Neuroendocrine tumors (NETs) have a relatively low incidence but an increasing frequency and prevalence. Their often indolent clinical course coupled with delayed detection can culminate in a late diagnosis, with metastatic progression [1, 2]. Limitations in imaging resolution as well as in the sensitivity and specificity of current blood and tissue biomarkers accentuate the clinical problem [3]. As a consequence, treatment outcomes have remained disappointing [4, 5]. Response to therapy is principally based on radiological criteria, which are challenging in “indolent” lesions since early measurable changes in tumor size are rare [68]. Alternative criteria, like attenuation measurements on CT (Choi criteria) initially proposed for gastrointestinal stromal tumors (GISTs), have been evaluated in NETs [9] but remain to be validated. Combinations of morphological and functional techniques, with 68Ga-somatostatin analog (SSA) positron emission tomography (PET)/CT, are also under consideration [10, 11].

Circulating biomarkers, e.g., chromogranin A (CgA), are currently used despite the lack of reproducibility and high variability [3, 12]. The prognostic value of histopathological parameters, e.g., Ki-67 index, has been validated in a number of studies and is generally accepted as the best currently available histopathological parameter for assessment of tumor grading [13]. However, the intratumoral heterogeneity in proliferative rates and the clinical difficulties related to repeated assessments remain as limiting factors for Ki-67 [14]. An alternative to the measurement of single analytes is the novel strategy of multianalyte assays with algorithmic analyses (MAAA) [15]. In this respect, a polymerase chain reaction (PCR)-based blood biomarker test that captures the expression levels of the spectrum of transcripts has been described [16]. This assay defines the circulating “fingerprint” of a NET [17] and exhibits a high sensitivity and specificity (98 and 97 %, respectively) for identifying NETs. The assay is standardized and highly reproducible (inter- and intra-assay coefficient of variation <2 %) and is independent of tumor heterogeneity [18].

An amalgamation of biomarker and imaging technologies may provide an opportunity to optimize accurate detection and early determination of disease progress [19, 4]. We hypothesized that imaging and biomarkers may capture similar biological information and that combinations of this information could be of clinical utility. We therefore evaluated the concordance of two of the most sensitive and specific methods of NET diagnosis, 68Ga-SSA PET/CT and the blood-based PCR assay (NETest) in NET disease and assessed whether they correlated with tissue and blood biomarkers. Our aims were to: (1) evaluate whether the circulating gene transcripts correlated with imaging parameters (the maximum standardized uptake value, SUVmax), (2) identify the relationship between gene expression levels or SUVmax and circulating CgA, and (3) examine if the NETest was a surrogate marker for tumor proliferation (Ki-67). Additionally, we determined whether a combination of imaging and the NETest provided an index that had clinical utility for the assessment of disease progression.

Materials and methods

Patients

Two independent groups of metastatic NETs that demonstrated measurable disease by 68Ga-SSA PET/CT were studied. Data set 1 (n = 27) included patients who were referred before (0–30 months, median 3) peptide receptor radionuclide therapy (PRRT), which was performed as primary or salvage treatment (IEO Milan, IRST Meldola; Table 1A). Data set 2 (n = 22) included patients who were referred for initial disease staging or for restaging after various therapies (Charité University, Berlin; Table 1B). Information in Table 1 included Ki-67, the tissue and method utilized for Ki-67 assessment (surgery, biopsy, or cytology), grading, tumor functionality, disease duration, and performance status according to the Eastern Cooperative Oncology Group (ECOG) [20]. All patients provided informed consent for the translational analysis, which was authorized by the local Ethics Committees [Authorization IRST-70/12 (data set 1) and EA2_064_09 (data set 2)]. Status at baseline was assessed according to RECIST1.1 criteria [21]. The demographics and disease characteristics of the entire group are included in Table 2.

Table 1 Patients’ characteristics
Table 2 Baseline demographic and disease characteristics of the entire group

Somatostatin receptor (SSTR) imaging

68Ga was eluted from 68Ge/68Ga generators and labeled with DOTATATE, DOTATOC, or DOTANOC. 68Ga-SSA PET/CT was performed following the European Association of Nuclear Medicine (EANM) Guidelines [22].

Image acquisition

For Data set 1 (68Ga-DOTATOC), a GE Discovery 600 (#1), Discovery ST (#2), Discovery LS (#3), or Discovery STE (#4) PET/CT scanner was used (GE Healthcare, Milwaukee, WI, USA). The acquisition protocol included a low-dose CT (120 kV, 80 mA, 0.8 s/rotation, 1.35 pitch, 3.75-mm slice thickness) for attenuation correction followed by the whole-body PET scan (5–6 beds/3 min each). Scanners #1, #2, and #4 acquired in 3-D mode (256×256 matrix, VUE point attenuation-weighted ordered subset expectation maximization (OSEM) algorithm, smoothing Gaussian filter), while scanner #3 acquired in 2-D mode (128×128 matrix, same reconstruction). For data set 2 (17 68Ga-DOTATATE, 4 68Ga-DOTATOC) two different scanners were used. A Biograph 16 (Siemens AG, Erlangen, Germany; 120 kV, 230 mAs, 5–6 bed position/3 min, 0.75 mm collimation, and 0.75-mm slice thickness) was used in 2008–2009 (scanner A). Thereafter, (2010 to present) a 3-D Gemini TF-16 PET/CT (Philips Medical Systems, scanner B) was used. The 3-D line-of-response (LOR) algorithm of the system software was used to reconstruct the images (transaxial slices: 144×144 voxels, 4.0×4.0×4.0 mm3). A pre-scan low-dose CT was used for attenuation correction (120 kVp, 30 mAs).

Quantitative image analyses

PET images were interpreted by two nuclear medicine physicians (VP, LB) with >10 years of experience. The SSR expression in tumor was assessed using a semiquantitative method (SUVmax). These were measured using a spherical region of interest (ROI) in a transaxial attenuation-corrected PET slice in those lesions that were positive on the visual assessment, i.e., uptake more than the immediate normal surrounding tissue. The uptake in the normal liver was used as a reference value for tumor SUVmax normalization [23]. To avoid the partial volume effect, only lesions >1.5 cm, based on the coregistered CT, were considered.

Multianalyte algorithm analysis (MAAA) PCR-based test (NETest)

A two-step protocol (RNA isolation, cDNA production, and PCR) was used as described [16]. Transcripts (mRNA) were isolated from 1 ml ethylenediaminetetraacetic acid (EDTA)-collected whole blood samples using the Blood Mini Kit (Qiagen, Valencia, CA, USA). Real-time PCR (50 °C 2 min, 95 °C 10 min, then 95 °C 15 s, 60 °C, 60s for 40 cycles) as described [16, 18] was performed (384-well plate, HT-7900 machine) with 200 ng/μl of cDNA and 16 μl of reagents/well (Fast Universal PCR Master Mix, Life Technologies, Carlsbad, CA, USA). All primers used were exon spanning and are <150 base pairs recognition sequence (bprs). PCR values were normalized to housekeeping genes and expression was quantified against a population control (calibrator sample). Four different learning algorithms trained on the internal training set using upregulated features were used and resulted in a consensus categorization of samples into different groups using “majority vote” methodology. A NET score (0–8) was derived from the PCR data using MATLAB (R2011a, MathWorks, Natick, MA, USA) [17]; a value ≥2 is a positive tumor score [17, 18].

CgA assay

CgA was measured using the DAKO CgA enzyme-linked immunosorbent assay (ELISA) kit (DAKO, Carpinteria, CA, USA). A cutoff of 19 units/l defined the upper limit of normal [16].

Ki-67 proliferation index

The Ki-67 value was obtained from the original histopathological reports. In some cases the classification of NETs was revised. The WHO 2010 classification was taken into account for gastroenteropancreatic (GEP) tumors while the 2004 Travis classification was used for bronchial forms [24, 25].

Statistical analyses

Comparative analysis of SUVmax, CgA, and Ki-67 across data sets 1 and 2 was undertaken using the Kolmogorov-Smirnov test of equality of continuous one-dimensional distributions to determine if values of each parameter were drawn from the same underlying distribution, regardless of the data set origin. A two-sample test (null hypothesis: x and y drawn from the same continuous distribution) was performed; p values approaching 1 signified statistically identical distributions. Numerical predictions of Ki-67 and SUVmax using gene expression profiles were produced by fitting a generalized linear model via penalized maximum likelihood (“glmnet” [26]) to the training data set (set 1), implemented in the “caret” R package [27]. Model tuning parameters were estimated in caret by minimizing root mean square error (RMSE). Variable importance was calculated using the caret “varImp” function specific to the glmnet package. Regression models were tested on data set 2. In the instances where t variables (e.g., Ki-67 and CgA) were identified as noncompatible (significantly different) between data sets 1 and 2, the regression model was trained on the combined data set (data set 1 + data set 2) and model performance was estimated using five repeats of 10-tenfold cross-validation. Combinatorial assessments were undertaken using the SUVmax and MORF4L2 expression, Ki-6,7 and CgA. Sensitivity comparisons were undertaken using χ2, nonparametric measurements and receiver-operating characteristic (ROC)/AUC (continuous variables) analysis. Both Prism 6.0 for Windows (GraphPad Software, La Jolla, CA, USA, www.graphpad.com) and MedCalc Statistical Software version 12.7.7 (MedCalc Software, Ostend, Belgium, http://www.medcalc.org; 2013) were utilized. AUCs were compared and the Z-statistic derived (MedCalc) [28].

Results

Positive 68Ga-SSA PET/CT scans were available in all patients. Of 49 patients, 47 (96 %) exhibited a positive NETest, while 26 (54 %) had elevated CgA levels (χ2 = 20.1, p < 2.5×10−6). The MAAA score was significantly associated with image positivity (>95 % concordance), while CgA levels were non-predictive.

Quality control and data processing

We initially examined whether the SUVmax, the circulating 51-gene expression profiles, plasma CgA levels, and tissue Ki-67 were statistically comparable between the two data sets. Principal component analysis of 51 marker gene expression profiles identified that one sample in data set 1 was an outlier. This sample, a bronchial tumor—small cell lung carcinoma (SCLC)—was removed (Fig. 1a). Results of the two-sample Kolmogorov-Smirnov test of equality indicated that SUVmax were comparable (p = 0.546) between the two data sets (Fig. 1b), while neither CgA (p = 0.165) nor Ki-67 values (p = 0.0535) were mathematically comparable (Fig. 1c, d). Based on the mathematically defined acceptable comparability, we then merged the two data sets to mathematically amplify data analysis.

Fig. 1
figure 1

Principal component analysis of data set 1 and data set 2 (a) using expression profiles of 51 marker genes. One outlier sample was excluded. Distribution densities of SUV (b), CgA (c), and Ki-67 (d) in data sets 1 and 2 confirmed the comparability of SUVmax but not of Ki-67 or CgA

Chromogranin A

Although CgA levels were poorly reproducible (Kolmogorov-Smirnov p = 0.165) between the two data sets, we examined if the marker genes could predict these values. For this analysis, the regression model was trained on data set 1 and validated on data set 2. Utilizing the mathematical model of “feature importance selection,” SLC18A2 and LED1 were identified as circulating marker transcripts that could be used to predict CgA levels (Fig. 2a, b). The overall regression model was R 2 = 0.13 and RMSE = 1.47 (Fig. 3a).

Fig. 2
figure 2

Bar plots of top contributing genes to prediction of CgA and SUVmax levels (a, c). Scatter plots of expression levels of top contributing genes and CgA and SUVmax levels in data set 1 (b, d). SLC18A2 and LED1 were identified as key marker genes for CgA, while MORF4L2 was the most effective predictor of SUVmax. Linear regression lines for each scatter plot are shown in blue, while 95 % confidence regions are shaded in gray

Fig. 3
figure 3

Scatter plots of predicted and actual CgA and SUVmax values (a, b). A good regression was identified for both. Linear regression lines for each scatter plot are shown in blue, while the 95 % confidence regions are shaded in gray

Histopathology and Ki-67

Regression analysis of Ki-67 values and circulating marker gene expression (combined data sets) failed to identify any significant relationships. In addition, Ki-67 levels could not be extrapolated by generalized linear regression modeling. A similar result was arrived at when data sets 1 and 2 were interrogated individually. No relationships could be identified between Ki-67 and CgA or between the SUVmax and Ki-67 using the same modeling approaches.

Somatostatin receptor imaging

We next examined whether circulating marker gene expression could be used to predict the SUVmax. Given the SUVmax were comparable between the two data sets (see Fig. 1), data set 1 was used to train the generalized linear regression model, while data set 2 was used as a test. In the training model, the transcript MORF4L2 was the most effective predictor of the SUVmax (Fig. 2c, d). A regression of R 2 = 0.31 and RMSE = 9.4 was identified (Fig. 3b). An assessment of the combined data set indicated that among expression levels of SSTR1, 3, and 5, the latter was the single most important predictor of SUVmax (R 2 = 0.15 and RMSE = 9.5). Neither CgA levels nor Ki-67 were predictive of SUVmax (Fig. 4).

Fig. 4
figure 4

Bar plot of top contributing genes to predicting of CgA and SUVmax levels in the combined data set (data sets 1 and 2) (a, c). Scatter plot of predicted and actual CgA and SUVmax values in the combined data set (b, d). Linear regression lines for each scatter plot are shown in blue, while the 95 % confidence regions are shaded in gray

Combinatorial assessments

To examine whether circulating marker levels and imaging could be used in combination, we evaluated different combinations of gene expression levels and the SUVmax to predict disease status, e.g., stable disease according to RECIST1.1 criteria [21]. While neither a CgA-based nor Ki-67-weighted score correlated with the clinical status, the MORF4L2-calculated quotient (a combination of circulating transcript expression and the SUVmax) was significantly elevated in progressive disease (p < 0.03). This quotient also exhibited significant ROC-derived AUCs (R 2 = 0.7, p < 0.05) when combined with SUVmax (Fig. 5, Table 3).

Fig. 5
figure 5

Expression differences and ROC curves for the four different quotients. a The MORF4L2 quotient value was significantly increased in progressive disease (n = 24) patients compared to those with stable disease (n = 25). b) The AUC for the SUVmax MORF4L2 quotient was 0.71 compared to quotients assessing averaged SSR expression and either Ki-67 index or CgA. *p < 0.05 vs stable disease. MORF MORF4L2 quotient, SSR averaged SSR expression quotient, Ki-67 Ki-67 indexed quotient, CgA chromogranin A-calculated quotient

Table 3 Performance metrics for the combinatorial quotients

Discussion

We evaluated whether SSR-based imaging and blood or tissue biomarker measurements in NETs provided complementary information by assessing the degree of the correlation between each of these parameters. We also examined whether combinations of imaging and these biomarkers provided clinically useful information. Our results demonstrated that the SSR imaging parameter, SUVmax, and the NET marker gene panel were well correlated, capturing similar biological data. Moreover, a combination of circulating transcript levels, particularly MORF4L2, and imaging effectively differentiated progressive from stable disease. No statistical relationship could be identified between imaging parameters and either CgA or Ki-67 or between the NETest and these two measurements.

This effort to examine the correlations between imaging parameters and circulating transcripts has a number of strengths and weaknesses. The principal strength is the demonstration that it is feasible to combine information from two separate sources, namely, SSR-based imaging and molecular information. Three-dimensional lesion localization and 68Ga-SSA PET quantification of SSTR density can be related to the multidimensional data from the measurements of specific circulating tumor transcripts. Thus, two objectively quantifiable parameters obtained in two different compartments (blood and tumor tissue) exhibit a mathematically definable relationship directly related to the biological parameters of disease. The clinical implication is that it identifies the possibility of developing sensitive bidimensional diagnostic protocols for the management of NETs.

A weakness of the study is the use of a retrospective patient series and the limited number of cases available for evaluation and the lack of homogeneity of the histopathological assessment of some of the cases, which may have diminished the effectiveness of the evaluation of Ki-67. Nevertheless, the observations derived from this study support undertaking a larger prospective series. This should prospectively evaluate a combination of 68Ga-SSA PET and NET transcripts tested against the long-term outcome. An important caveat would be to ensure comparability of SSR imaging. In this series, it was evident that imaging parameters obtained from 68Ga-DOTANOC were not optimally mathematically comparable with 68Ga-DOTATOC and 68Ga-DOTATATE (Supplemental Fig. 1). Although this may represent the different receptor binding affinities of the individual peptides, it may also reflect the limited number of patients available for evaluation and would require a more detailed, prospective assessment.

68Ga-SSA PET is used for localization, staging, and selection for SSR-based therapies. The pooled sensitivity and specificity is >90 % [29]. The resolution sensitivity (~5-6 mm) and a threshold for partial volume effect ≥1 cm with the newer PET/CT systems, however, limit the ability to detect small changes in tumors [30, 22]. This is a concern when seeking to identify a therapeutic response in an often indolent disease process. A prognostic use for 68Ga-SSA PET/CT, particularly measurements of the SUVmax, has been suggested for predicting responses to SSA or PRRT [31, 32] or for predicting progression-free survival (PFS). These data need to be both standardized, i.e., similar measurements at different institutions, and would require prospective confirmation in larger series [33]. Similar information has also been obtained from use of the OctreoScan, where the uptake classified according to the “Krenning scale” generally correlates with the response to PRRT. In this case, a grade 4 uptake is predictive of response to PRRT in 60 % of individuals [34]. An improvement in these measures is clearly required. Imaging also provides prognostic data because it identifies SSTR expression. The latter, however, does not directly reflect the proliferation characteristics of the tumor cells. This is based upon the individual unique proliferome of a tumor, which is a complex multiplex of genomic regulators [35], and is important for predicting therapeutic responsiveness, e.g., low proliferative tumor does not respond to chemotherapeutics [1].

The SUV is a widely accepted metric for assessing tissue accumulation of tracers. In the current study, we considered the SUVmax per body weight since there is no differential biodistribution of SSA based on lean body mass or body surface. We identified that the SUVmax could be predicted by measurements of circulating gene markers (R 2 > 0.3, RSME = 9.4). The detection of specific NET transcripts in circulating blood therefore correlates with the level of uptake at 68Ga-SSA PET. Rather than implying the possibility of substituting either technique with the other (the overlap in biological data captured is ~30 %), this observation rather suggests an “information complementarity.” Developing the appropriate tools to define this prospectively could be used to both facilitate diagnostic assessment as well as improve patient selection for therapy of SSR-expressing tumors.

The 68Ga-DOTATOC SUVmax is biologically related to expression of transcript and protein for SSTR2 in tumors, and a close correlation has been noted between SUVmax and immunohistochemical SSTR scores in NET tissue (p < 0.001 and p < 0.05 for SSTR2A and SSTR5, respectively) [36]. The NETest gene panel does not include measurements of circulating expression of SSTR2. This gene is ubiquitously expressed in normal blood cell populations, e.g., leukocytes [37], and circulating levels from tumor cells cannot be consistently differentiated because of the preexisting expression in whole blood (Kidd, Modlin: unpublished data). Expressions, however, of SSTR1, 3, and 5 are transcript components of the NETest and contribute to the regression model that predicts the SUVmax (Fig. 2a). In this respect it appears that the SSTR5 subtype is of significance. Irrespective of the SSR subtype target, 68Ga-DOTATOC and 68Ga-DOTATATE have been reported to exhibit a comparable diagnostic accuracy for NET detection [38]. These observations are consistent with our identification that expression of clinically relevant SSR genes, particularly SSTR5, is biologically informative regarding the receptor density of the tumor in situ.

We noted that expression levels of MORF4L2 were an important component of the predictive model for the SUVmax. This gene is involved in telomere homeostasis via the regulation of cellular transcription through chromatin remodeling [39]. Since telomere “health” has been related in pancreatic NETs to tumor differentiation and therefore prognosis [40], there exists sound biological rationale for considering that MORF4L2 expression may be related to SSR density and radiolabeled peptide uptake. Higher uptake of SSR peptides has been demonstrated as predictive of well-differentiated NET response to SSR-based therapy [41, 31]. In this study, assessments of a candidate quotient—expression of MORF4L2 and SUVmax—demonstrated potential utility as a measure of disease progression, independent of the comparison with the previous or subsequent exam. A possibility worth examining is that a combination of clinical imaging methods with validated molecular biomarkers could provide added value for the more accurate prediction of the treatment efficacy by using a single imaging exam with a blood transcript value [31, 32, 42].

While the uptake per se is considered to be of prognostic significance, measurements of the SUVmax are dependent on a number of variables. These, particularly in indolent tumors, often fail to accurately reflect the response to therapy. Although multiple consecutive assessments with visual changes are required, the diverse patient characteristics (SSA treatment, splenic uptake), the PET model (resolution specifics, acquisition mode, acquisition time per bed position, reconstruction method, attenuation correction, and image analysis) as well as image analysis (ROI reconstruction algorithms) can all affect image interpretation [11, 30]. A circulating tumor MAAA can be easily acquired (simple blood draw) at multiple time points in the intervals between sequential receptor PET assessments. Such information can provide added information regarding tumor behavior (proliferative gene expression levels) as well as responses to therapy (decrease in transcript levels) [17].

Monoanalytes, such as CgA, have been used in conjunction with clinical information to aid the interpretation of imaging with varying degrees of clinical efficacy [19, 15]. Many NET patients (~30–50 %) do not have elevated CgA [43], and CgA exhibits low specific metrics with a wide variation in values depending upon the assay type, the laboratory, and numerous coexisting factors (renal insufficiency, proton pump inhibitors) [12]. In our study, CgA was elevated in <60 % of patients, was not correlated with SUVmax or Ki-67, and was only related to the secretory component of the circulating gene signature. The genes identified to be most significantly related to CgA levels were SLC18A2 (VMAT2), well-known as a transporter gene involved in vesicular amine uptake and secretion [44], and LED1, involved in transcriptional regulation [45]. As a combinatorial quotient, imaging and CgA, were only poorly correlated with outcome (AUC = 0.5, Table 3).

Given the information provided by imaging and Ki-67, inductive logic would suggest that a combinatorial quotient of the two parameters would provide information predictive of outcome. Ki-67 is used as a prognostic clinicopathological surrogate for tumor cell proliferation, forms the basis for the current grading classification, and generally has been used to guide therapeutic strategy [24] despite some reservations (vide European Society for Medical Oncology [46] guidelines). In this study, Ki-67 was not correlated with imaging or the NETest and as a combination with the SUVmax was only poorly correlated with outcome. This likely reflects the fact that Ki-67 measures are typically historical (usually measured in the primary early in the course of the disease) and do not provide a real-time evaluation of a dynamic tumor and the well-described evolving proliferative heterogeneity of neoplasia [14]. In the current study, the distribution of Ki-67 was inconsistent between the data sets and could not be extrapolated by generalized linear regression modeling from the circulating gene expression levels. This might reflect the known limitations noted for Ki-67 measurements (different centers; staining protocols, antibodies, and interpreters) but also the time difference between the blood sampling and the histology. Data set 1 had a median interval of 28 months (range 1–176 months) between the blood collection and the histology, while this was 34 months (range 1–194 months) for data set 2. Another factor that may have influenced the Ki-67 results is the “real-world” situation of patient referral and management, where surgical samples were available for histopathological analysis in 12/27 patients, whereas biopsies were obtained in 7 and cytology in 1, and not repeated in all sites of disease for the majority of subjects. Added value could be provided by real-time dynamic information of tumor status using a circulating signature as a “liquid biopsy.” Timed biopsies and blood sampling therefore are required to define whether a circulating measurement could provide a real-time measure of tumor growth.

The successful integration and validation of imaging and biomarker approaches could guide and better define criteria for clinical decisions. In terms of clinical benefit, apart from accurately confirming the presence and status of disease, the most immediate added value would be in the assessment of treatment efficacy. The dimensionality of an MAAA test offers insight into the biological pathways that comprise neoplastic transformation as well as targets for therapy and can provide information for a more accurate assessment of response. A strategy of fusion of not only imaging types (68Ga PET and CT) but imaging and molecular information of disease status is likely to provide added information regarding tumor behavior and response to therapy. Such data promise to be of clinical utility by making available a dynamic measurement which will facilitate a more precise delineation of evolving NET disease in an individual patient.