Abstract
Metabolomics as one of the most rapidly growing technologies in the “-omics” field denotes the comprehensive analysis of low molecular-weight compounds and their pathways. Cancer-specific alterations of the metabolome can be detected by high-throughput mass-spectrometric metabolite profiling and serve as a considerable source of new markers for the early differentiation of malignant diseases as well as their distinction from benign states. However, a comprehensive framework for the statistical evaluation of marker panels in a multi-class setting has not yet been established. We collected serum samples of 40 pancreatic carcinoma patients, 40 controls, and 23 pancreatitis patients according to standard protocols and generated amino acid profiles by routine mass-spectrometry. In an intrinsic three-class bioinformatic approach we compared these profiles, evaluated their selectivity and computed multi-marker panels combined with the conventional tumor marker CA 19-9. Additionally, we tested for non-inferiority and superiority to determine the diagnostic surplus value of our multi-metabolite marker panels. Compared to CA 19-9 alone, the combined amino acid-based metabolite panel had a superior selectivity for the discrimination of healthy controls, pancreatitis, and pancreatic carcinoma patients \( [ {\text{volume under ROC surface}}\;\left( {\text{VUS}} \right) = 0. 8 9 1 { }\left( { 9 5\,\% {\text{ CI }}0. 7 9 4- 0. 9 6 8} \right)]. \) We combined highly standardized samples, a three-class study design, a high-throughput mass-spectrometric technique, and a comprehensive bioinformatic framework to identify metabolite panels selective for all three groups in a single approach. Our results suggest that metabolomic profiling necessitates appropriate evaluation strategies and—despite all its current limitations—can deliver marker panels with high selectivity even in multi-class settings.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Pancreatic cancer is the fourth leading cause of cancer death in the United States, and most patients diagnosed with pancreatic cancer develop clinical symptoms usually late in the course of the disease (Lowenfels and Maisonneuve 2006). Therefore, only 20 % of patients can be treated with a potentially curative therapy and only about 3–5 % survive at least 5 years (Michl et al. 2006). For these patients, time, especially the so called ‘biomarker lead time’ between the onset of asymptomatic cancer still localized to the organ of origin and clinical diagnosis (Konforte and Diamandis 2012), is crucially important (Hazelton and Luebeck 2011). Although recent modeling studies have illustrated that blood-based biomarkers might provide a successful tool for the early detection and differentiation of premalignant lesions, substantial methodological enhancements of unanticipated extent (Burgess 2012) are still required. Yachida et al. (2010) demonstrated a latency of about 17 years from the initiating mutation to pancreatic cancer death. Similarly, Hori and Gambhir (2011) stated “that shedding rates of current clinical blood biomarkers are likely 104-fold too low to enable detection of a developing tumor within the first decade of tumor growth” and suggested to increase sensitivity and specificity by introducing multi-marker panels of up to 10 biomarkers. In a proof-of-principle study for evaluating the utility of multiplexed circulating biomarkers, Brand et al. (2011) investigated the selectivity of 83 proteins and their combinations. Two panels consisting of CA 19-9, ICAM-1, and OPG, as well as CA 19-9, CEA, and TIMP-1 were found to discriminate pancreatic cancer patients from healthy control subjects and from patients with benign pancreatic conditions, respectively. Since the cohorts in this study were compared separately, an integral model encompassing all three disease states was not obtained. Whereas Brand et al. (2011) focused on known tumor markers, tumor-associated peptides, etc., other studies have employed several of the emerging “-omics” subspecialties, such as proteomics (Fiedler et al. 2009), transcriptomics (Zhang et al. 2010), and—as the probably closest to the “bedside” (Van and Veenstra 2009)—metabolomics (Bathe et al. 2011; Ceglarek et al. 2009; Nishiumi et al. 2010; OuYang et al. 2011; Urayama et al. 2010; Zhang et al. 2011). The latter bears the chance to learn from the intricacies that have plagued “-omics” researchers over the last years, standardization (Van and Veenstra 2009), data processing (Blekherman et al. 2011) and data interpretation (Kholodenko et al. 2012), amongst others.
In this study, we addressed these challenges by using a three-class study design. We collected highly standardized samples of pancreatic cancer patients, subjects with pancreatitis, and healthy controls. Following tandem mass spectrometric metabolite profiling, we evaluated the differences between groups and applied Bayesian methodology to identify multi-metabolite models as “meta-markers”, which are selective for each of the three study groups and provide improved diagnostic performance compared to CA 19-9, the conventional tumor marker.
2 Materials and methods
2.1 Patients and samples
We recruited patients suffering from pancreatic cancer (\( n = 40 \)), healthy controls (\( n = 40 \)), and patients hospitalized due to acute pancreatitis (\( n = 2 6 \)) at the University Hospital of Leipzig in the context of previously published studies (Fiedler et al. 2009; Leichtle et al. 2012). We collected cubital vein fasting samples of cancer patients and healthy controls in two independent sets. Additionally, we collected fasting serum samples of 26 patients with pancreatitis as inflammatory control group (A, C, and D, \( n_{\text{total}} = 10 6 \); Table 1). We adjusted subjects according to age and gender and performed blood sampling from patients before the initiation of specific therapy. Diagnosis of pancreatic cancer was confirmed by histologic examination in all cases. Healthy controls showed no evidence of actual disease in physical examination and routine laboratory testing [alkaline phosphatase, bilirubin, C-reactive protein, CA 19-9, CEA, creatinine, γ-glutamyltransferase, transaminases (Roche Modular, Germany)]. Pancreatitis patients were diagnosed clinically without proof of pancreatic carcinoma during the study period. Serum samples were collected and stored (at −80 °C) using standardized techniques and protocols (Baumann et al. 2005).
2.2 Chemicals, standards and consumables
Methanol and isopropanol (gradient grade) were purchased from Merck (Darmstadt, Germany). The amino acid isotope labelled standard kit (NSK-A, Cambridge Isotope Laboratories, Andover, USA) was used as internal standard. Water (HPLC grade) was obtained from J. T. Baker (Deventer, Netherlands). The derivatization reagent 3n butanolic HCl was made in-house by mixing 4:1 v/v of 1-butanol (for spectroscopy) from Merck (Darmstadt, Germany) and acetyl chloride (p.a.) from Sigma-Aldrich (Steinheim, Germany). 96-well polypropylene microtiter plates were purchased from Greiner Bio-One (Frickenhausen, Germany). Sampling material was obtained from Sarstedt (Nümbrecht, Germany). For sample storage 450 μL CryoTubes™ were purchased from Sarstedt.
2.3 Sample pretreatment
Sample derivatization was performed according to our previously described protocols (Brauer et al. 2011). Shortly, serum samples were diluted 1:10 with methanol for protein precipitation. After centrifugation we placed 10 μL of the supernatant into 96-well polypropylene microtiter plates and diluted it with 100 μL of the internal standard solution. Following evaporation at 70 °C for 40 min, we added 60 μL of 3n butanolic-HCl for derivatization at 65 °C for 18 min. The residual solution was again evaporated at 70 °C for 40 min and then reconstituted with 150 μL of the mobile phase (1/1 v/v isopropanol/water). After 15 min of gentle shaking of the microtiter plate at room temperature, we analyzed the samples by flow injection analysis (FIA)-tandem mass spectrometry (MS/MS).
2.4 Tandem mass spectrometry
An API 3000 MS/MS (Applied Biosystems, Germany) equipped with a Turbo Ion Spray Source (TIS) in combination with an HTC Pal autosampler and a PE 200 microgradient pump was used for flow injection analysis (FIA). 25 μL of the sample were directly injected at a flow rate of 80 μL/min in an analysis time of 1.5 min. We detected amino acids by a neutral loss scan of 102 in the mass range of 130–280 or multiple reaction monitoring (MRM). Quantitative analysis using internal standards for 26 amino acids was performed using ChemoView™ 1.4.2 (Applied Biosystems, Germany). A comprehensive overview of mass transitions, internal standards, and performance data for the different amino acids is summarized in Brauer et al. (2011).
2.5 Bioinformatic analysis
Statistical analyses were conducted (unless otherwise stated) using R for Windows (Version 2.14.2) and its related CRAN packages (http://cran.r-project.org/). We tested data for normality by the Anderson–Darling test (nortest) and the gender distribution in the subgroups by the binomial test (stats). The homogeneity of variances of the quantitative routine laboratory data was evaluated with the approximative Fligner–Killeen test (stats), whereas the paired differences were investigated by Games–Howell testing (script source: http://aoki2.si.gunma-u.ac.jp/R/src/tukey.R). Three missing CA 19-9 values were imputed by multiple imputation (MI) with 3 chains of imputation (which were averaged thereafter), a \(\hat{\text{R}}\) value of 1.1, and bootstrap as random imputation method until conversion (after 7111 iterations). Three pancreatitis samples with non-random missing data as a consequence of insufficient sample volume were excluded from further analysis. The selectivity of single amino acid concentrations was assessed in all disease states simultaneously via the volume under ROC surface (VUS), which is the three-dimensional analogue of AUROC analysis (Nakas and Yiannoutsos 2004). The VUS’ and their associated confidence intervals were calculated nonparametrically using \( B = 2 ,000 \) bootstraps and 50 k subdivisions on the amino acid concentrations (DiagTest3Grp). As we assumed a high degree of collinearity in the amino acid concentrations, we computed Kendall’s τ as well as its Hochberg-adjusted significance (ltm, corrplot) and plotted the correlation matrix (Fig. 1). Based on our previous results indicating that marker models comprising combinations of different amino acids and/or CA 19-9 might be superior to single amino acids and/or CA 19-9 with respect to their selectivity (Leichtle et al. 2012), we also evaluated combined models. These models consisted on the one hand of the conventional tumor marker CA 19-9 combined with different principal components (PC1, PC2, …) of the different amino acid concentrations to control for multicollinearity, which is a significant constraint on variable selection (Leigh 1988). On the other hand we also combined CA 19-9 with mere amino acid concentrations to avoid potentially unnecessary transformation steps prior to modeling. After Yeo–Johnson transformation (car) of the amino acid concentrations, we generated PCs (princomp and factoMineR), from which the first six PCs had eigenvalues >1.0 and cumulatively covered 78.7 % of the variance. For the modeling in a three-state design, we merged sample set A with C and obtained one tumor group, one control group and one pancreatitis group (set D). In the second step we used CA 19-9 alone and combined with the PCs of the amino acid concentrations as well as with mere concentrations for Bayes-averaged multinomial logit modeling [mlogitBMA, bic.mlogit(mlogitBMA)] using Begg and Gray approximation. We validated the latter model by CAR [‘Correlation-Adjusted (marginal) coRrelation’] scores (care) assuming empirical values of 1.0, 0.3, and 0.1 as responses of the pancreatic carcinoma, the healthy control, and the pancreatitis groups, respectively, in a CAR model truncated to a number of variables comparable to the penalized multinomial logit model. We computed the VUS for the four predictors, namely CA 19-9, the PCA-based mlogitBMA-predictor (PCA), the amino acid concentration-based mlogitBMA-predictor (AA), and the amino acid concentration-based CAR-model-predictor (CAR) analogously to the VUS values of the amino acid concentrations. The ROC surface plots (Fig. 2) were drawn using MATLAB (The MathWorks, Natick, MA, USA). Since significant differences of the VUS between predictors and CA 19-9 do not imply inferiority or superiority, we performed non-inferiority and superiority testing applying bootstrap techniques on ΔVUS (\( B_{\text{outer}} = 1 ,000 \), boot, \( B_{\text{inner}} = 1 ,000 \), DiagTest3Grp, UBELIX Cluster of the University of Bern). We constructed the corresponding CIs and tested for the overlap of Δ0 and the ΔVUS’ CI according to the methods proposed by Liu et al. (2006), Tunes da Silva et al. (2009), and Lesaffre (2008) at a predefined δ level of 5 % which was considered to be medically reasonable designing this and a previous study (Leichtle et al. 2012). We visualized the performance data in a forest plot [Fig. 3, forestplot(rmeta), R version 2.15.0, cf. Mascha (2010)].
2.6 Ethics
The study was approved by the Ethics Committee of the Medical Faculty of the University of Leipzig (Reg. No. 013-2005) and it fulfills the requirements of the Helsinki declaration. All study subjects gave written informed consent to participate in the study.
3 Results and discussion
3.1 Descriptives
We collected serum samples of 40 (20 males/20 females) pancreatic carcinoma patients, 26 (22 males/4 females, \( P_{\text{binomial}} = 0.000 5 \)) pancreatitis patients, and 40 (20 males/20 females) healthy controls. Table 1 displays the distributions of age, BMI, UICC cancer staging of the patients, the CA 19-9, bilirubin, and HbA1c concentrations in the three different sample sets. Of 26 amino acid concentrations, we found only four (arginine, glutamic acid, phenylalanine, and tryptophan) unaltered between the study groups (Table 2). Several amino acid concentrations were non-normally distributed. In order to detect sample set-specific alterations in the values, we also compared the amino acid concentrations of the tumor patients and the healthy control group between the two sample sets and found no significant differences (Table 2).
3.2 Correlations
We evaluated the multicollinearity of the amino acid concentrations by generating their correlation matrix (Kendall’s τ and its Hochberg-adjusted significance). Kendall’s τ values ranged between −0.516 (aspartic acid with glutamine) and 0.709 (threonine with glutamine), the P values between 0.001 and 0.97 (Fig. 1).
3.3 Modeling
We hypothesized that combinatory markers including several amino acids and/or CA19-9 might have additive or even multiplicative effects and thus be superior to single amino acids and/or CA 19-9 in diagnostics of pancreatic cancer (Brand et al. 2011). Therefore, in addition to evaluating the single VUS of the amino acid concentrations, we also generated models based on CA 19-9 combined with PCs and Bayesian multinomial logit model averaging (mlogitBMA) as well as on CA 19-9 conjoined with amino acid concentrations. Furthermore, we used models based on CAR scores to evaluate their three-class selectivity in comparison with that of single amino acid concentrations and CA 19-9 as a validation method for the mlogitBMA models. The best PC-based mlogitBMA-model comprised CA 19-9 and PC2. The best amino acid concentration-based mlogitBMA-model contained CA 19-9 and aspartic acid, which both were also contained in the truncated amino acid concentration-based CAR score-model. To evaluate the selectivity of both, amino acid concentrations and modeled predictors, we computed their volume under the ROC surface (VUS) (Table 3). The VUS of the amino acid concentrations spanned from 0.180 (arginine, 95 % CI 0.106–0.270) to 0.850 (glutamine, 95 % CI 0.761–0.929). The VUS of CA19-9 was 0.528 (95 % CI 0.400–0.654), and the predictors PCA, AA, and CAR had VUSs of 0.604 (95 % CI 0.446–0.745), 0.891 (95 % CI 0.794–0.968), and 0.871 (95 % CI 0.776–0.952), respectively. For a random classifier, the VUS could be geometrically determined (Landgrebe and Duin 2007) with a value of \( 1.\bar{6}. \) To illustrate the selectivity of CA 19-9 and the predictors, we generated true class rate-plots depicting the ROC surface (Fig. 2).
3.4 Non-inferiority and superiority testing
Since it is generally accepted that significant difference alone might not be an adequate measure of non-inferiority or superiority, we sequentially tested for both with an a priori-defined acceptance criterion (equivalence limit) of \( \delta = 5\,\% \) ΔVUS. We computed the lower and upper limits of the (100 − 2δ) % bootstrap confidence interval of the estimated ΔVUS as proposed by Liu et al. (2006). The results are shown in Fig. 3. Following the criteria by Mascha (2010) we deduced non-inferiority for predictors AA and CAR compared to CA 19-9. Furthermore, superiority of the AA and CAR predictors over CA 19-9 alone was derived in a second step, since the lower CI of ΔVUS was positive.
“The ideal biological marker(s) for cancer risk assessment and early detection must have high sensitivity and specificity, be found in a biosample obtained using minimally invasive procedures, and be analyzed using a highthroughput, cost-effective assay.” These requirements stated by Van and Veenstra (2009) are challenging to fulfill. Particularly, in the case of pancreatic cancer diagnostics this challenge is even bigger due to the number of differential diagnoses, which are difficult to discern from malignant disease even for experienced clinicians (Gong et al. 2012). Furthermore, chronic pancreatitis patients also have a 15-fold higher risk than the general population to develop pancreatic cancer (Huggett and Pereira 2011). In order to identify biomarkers capable of discriminating different disease states, we designed a study including pancreatic cancer patients and healthy controls of two independently collected sample sets as well as an additional group of pancreatitis patients, since the principal feasibility of the metabolomic approach to pancreatic cancer was recently shown (Bathe et al. 2011; Tesiram et al. 2012; Zhang et al. 2012). Samples were processed following highly standardized preanalytical protocols and applying a routinely used tandem mass spectrometric technique. By comparing the sample groups, we found 22 of 26 amino acids altered in at least one out of ten possible comparisons. The number of different metabolites is comparable to that given by Bathe et al. (2011) who found 22 of 58 metabolites significantly different between malignant and benign pancreatic disease applying 1H NMR and 2D NMR spectroscopy, with OuYang et al.’s (2011) 1H NMR spectroscopy results showing significant alterations of at least 8 metabolites between only 17 pancreatic carcinoma patients and 25 healthy controls. It is consistent with Urajama et al.’s (2010) combined GC/TOF-MS, LC/ESI-MS, and LC/LTQ-Orbitrap study revealing 26 significantly different metabolites in a comparison of 5 pancreatic cancer samples and 5 mixed pancreatitis/healthy controls, and with Nishiumi et al.’s (2010) GC–MS investigations based on 21 pancreatic cancer patients and 9 healthy volunteers identifying 18 of 60 metabolites as significantly different. In addition to the inter-class comparisons of the different sample sets, we also evaluated the inter-sample set differences in the respective classes (cancerA − cancerC and controlA − controlC) and found no significant differences. Since the sample groups were homogeneous, we preferred a joint analysis in the modeling approach over a split-half design to keep the degree of random error as low as possible (Knottnerus and Muris 2003; Ransohoff and Gourlay 2010). Although the previously published metabolome profiling studies of pancreatic carcinoma are heterogeneous regarding the used mass-spectrometric techniques and the studied metabolites, they all share canonical variance-based evaluation strategies with two-class comparisons. Additionally, only one of the studies (Bathe et al. 2011) assessed the selectivity (e.g. AUROC or VUS analyses) of the marker metabolites. Our aim was to perform a comprehensive data analysis that also allows a clear interpretation of the diagnostic value of the markers (Leichtle et al. 2012). To this end, we implemented four unexampled features in our bioinformatic pipeline: (1) The computation of three-class VUS values of the single amino acid concentrations as an integral selectivity measure, (2) a Bayesian multinomial logit model averaging procedure to extend the previously used binomial logistic regression modeling (Leichtle et al. 2012) on the three-class study design to generate multi-marker panels (including CA 19-9), (3) the VUS-based analysis of the panel predictors, and, finally, (4) their non-inferiority and superiority determination. The VUS values of the single amino acid concentrations ranged from 0.18 slightly above a random classifier to 0.85 (glutamine), which is close to the best panel predictors. As none of the previous metabolite profiling studies on pancreatic cancer performed VUS analysis, we can only rely on the utterly inconsistent P values they present, in Urayama et al.’s (2010) case 0.000021, or in Nishiumi et al.’s (2010) 0.97 for glutamine. CA 19-9 alone reached the 10th rank, which is probably attributable to its low selectivity between benign and malign pancreatic diseases (Fig. 2a). Since our previous investigations (Leichtle et al. 2012) indicated a high degree of multicollinearity in the amino acid concentrations, which is known to impede many feature selection techniques (Jesneck et al. 2009; Leigh 1988), we set up a Kendall’s correlation matrix to visualize the multicollinearity and its significance. As expected, the full range of correlation spanned from −0.516 to 0.709, which supported the inclusion of the frequently recommended PC-based analysis approach, albeit it has been shown that variance-based techniques might not always yield optimal predictors (Leichtle et al. 2012). To compute robust predictive multi-metabolite marker panels, we combined CA 19-9 and the PCs as well as CA 19-9 and the mere amino acid concentrations and used a Bayesian multinomial logit model averaging procedure for our categorical three-class study design (Robin et al. 2009). The first model consisted of CA 19-9 and PC2 providing a two-marker “panel” predictor (PCA) with a VUS of 0.604. The omission of PC1 and preference of PC2 with less contribution to explained variance during the mlogitBMA procedure is an astounding finding possibly reflecting a predilection of variables sharing covariance with CA 19-9. The second model based on amino acid concentrations included CA 19-9 and aspartic acid providing a two-marker “panel” predictor (AA) with a VUS of 0.891. Our results indicate that CA 19-9 provides the selectivity mainly for the discrimination between healthy controls and pancreatic cancer patients (Table 2), whilst aspartic acid predominantly contributed to the identification of pancreatitis patients. Nishiumi et al. (2010) reported a borderline significant P value of 0.075 for aspartic acid, whereas Urayama et al. (2010), OuYang et al. (2011) and Bathe et al. (2011) did not mention significant differences. Our results and panel predictors, however, require extremely cautious interpretation since in a previous study an analytical variability >25 % was observed for aspartic acid (Brauer et al. 2011). On the other hand, regarding the substantial impact of especially pancreatic disease on nutrition, it was not unexpected to find models different to those of our previous study on colorectal cancer (Leichtle et al. 2012). The mechanisms disturbing amino acid homeostasis and enabling the discrimination of pancreatic cancer patients from pancreatitis patients on the basis of metabolite profiles are not entirely elucidated. Schrader et al. (2009) suggested—apart from malnutrition—mainly inflammatory effects and pointed at the inverse relationship between the circulating amino acid concentrations and the degree of inflammation present e.g. in hemodialysis patients. Whether increased tumor-associated proteolytic activity (Findeisen and Neumaier 2012) contributes not only to the generation of specific peptide decay profiles, but also to the specificity of the amino acid profiles is still unknown. To validate our results and the Bayesian modeling approach, we also applied model selection techniques based on CAR scores (Zuber and Strimmer 2011) as a non-Bayesian linear alternative. Since our study covered three—more or less—independent classes, we could neither rely on a binary (CAT score) nor on a metric (CAR score) response. Therefore we assumed empirical values of 1.0, 0.3, and 0.1 as “responses” of the respective groups while acknowledging that such a procedure might be somewhat artificial and not necessarily justified by the underlying pathophysiology. To gain a comparable number of model variables as in the penalized mlogitBMA-model and thereby an at least limited comparability, we used a two-predictor CAR model including CA 19-9 and aspartic acid. The CAR panel predictor had a VUS of 0.871 similar to the value obtained with the Bayesian modeling approach. As the final evaluation step, we performed a two-step non-inferiority and superiority testing based on the bootstrapped ΔVUS and on a ± δ equivalence range of 5 % as outlined in a previous study (Leichtle et al. 2012). CA 19-9’s VUS ± δ served as reference we tested the other predictors’ ΔVUS against. In the first step, we observed non-inferiority only for the AA and CAR panel predictors, but not for the PCA panel predictor, whereas in the second step, we determined superiority of AA and CAR panel predictors (Fig. 3). These encouraging results indicate an improved selectivity of the models compared to CA 19-9 alone. Our study has several limitations to be considered. First, we merged the sample sets A and C to keep the degree of random error as low as possible in our modeling analysis (Knottnerus and Muris 2003). However, the “external” validity of the results could not thus be evaluated (Ransohoff and Gourlay 2010). Therefore, subsequent studies are necessary in order to assess the generalizability of our predictor models. Second, due to high preanalytical standardization and refinement of our bioinformatic methodology, the variability of the analytical method itself might have become the main source of bias. With our study design and evaluation strategy, we probably have reached an analytical boundary, that still requires substantial improvements (Hori and Gambhir 2011). Therefore, new analytical techniques are necessary to reach both, superior sensitivity and stability at the same time. The third limitation of the study originates from the strong penalization of our Bayesian modeling approach: The predictor panels generated by the mlogitBMA procedure were both two-component panels consisting of CA 19-9 and another variable. Especially in the case of PC-based modeling and the selection of the second PC while leaving out the first, a considerable amount of selectivity might have been lost. On the other hand, the amino acid concentration-based model was superior in selectivity [without taking misclassification costs into account (Klawonn et al. 2011)], suggesting that PCs might not serve as optimal modeling variables when Occam’s razor is strictly availed. Finally, rather than proposing a superior diagnostic metabolite model or “meta-marker” our results suggest that our bioinformatic framework combined with a methodology refined to sufficient sensitivity and stability might provide a valuable diagnostic tool for metabolic profiling even in the three-class differentiation dilemma of health, inflammation, and malignancy.
4 Short summary
Multi-marker panels have been suggested to improve the selectivity of pancreatic cancer diagnostics and its differentiation from various benign lesions. However, a comprehensive framework for the statistical evaluation of marker panels in a multi-class setting has not yet been established.
Using a disease model encompassing pancreatic cancer, pancreatitis, and healthy controls, 106 standardized serum samples, and metabolic profiling, we generated models to discriminate between the three study groups.
Multi-marker models are superior to the conventional tumor marker CA 19-9 in simultaneously differentiating between pancreatic cancer, pancreatitis, and healthy controls.
Our comprehensive bioinformatic approach provides a novel framework to address a common diagnostic challenge, and thus paves the way for biomarker validation in a clinical three-class setting.
Abbreviations
- 1H NMR:
-
Proton nuclear magnetic resonance (spectrometry)
- 2D NMR:
-
2-Dimensional nuclear magnetic resonance (spectrometry)
- AU(RO)C:
-
Area under the (receiver operator characteristics) curve
- mlogitBMA:
-
Bayesian multinomial logit model averaging
- CA 19-9:
-
Carbohydrate antigen 19-9
- CAR scores:
-
‘Correlation-adjusted (marginal) correlation’ scores
- CEA:
-
Carcinoembryonic antigen
- GC–MS:
-
Gas chromatography mass spectrometry
- GC/TOF–MS:
-
Gas chromatography/time of flight mass spectrometry
- ICAM-1:
-
Intracellular adhesion molecule 1
- LC/ESI-MS:
-
Liquid chromatography/electrospray ionization mass spectrometry
- LC/LTQ-Orbitrap:
-
Liquid chromatography/linear ion trap combined with an orbitrap (mass spectrometry)
- OPG:
-
Osteoprotegerin
- PC(A):
-
Principal component (analysis)
- TIMP-1:
-
TIMP metallopeptidase inhibitor 1
- VUS:
-
Volume under (ROC) surface
References
Bathe, O. F., Shaykhutdinov, R., Kopciuk, K., Weljie, A. M., McKay, A., Sutherland, F. R., et al. (2011). Feasibility of identifying pancreatic cancer based on serum metabolomics. Cancer Epidemiology Biomarkers & Prevention, 20, 140–147.
Baumann, S., Ceglarek, U., Fiedler, G. M., Lembcke, J., Leichtle, A., & Thiery, J. (2005). Standardized approach to proteome profiling of human serum based on magnetic bead separation and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Clinical Chemistry, 51, 973–980.
Blekherman, G., Laubenbacher, R., Cortes, D. F., Mendes, P., Torti, F. M., Akman, S., et al. (2011). Bioinformatics tools for cancer metabolomics. Metabolomics : Official Journal of the Metabolomic Society, 7, 329–343.
Brand, R. E., Nolen, B. M., Zeh, H. J., Allen, P. J., Eloubeidi, M. A., Goldberg, M., et al. (2011). Serum biomarker panels for the detection of pancreatic cancer. Clinical Cancer Research : an Official Journal of the American Association for Cancer Research, 17, 805–816.
Brauer, R., Leichtle, A., Fiedler, G. M., Thiery, J., & Ceglarek, U. (2011). Preanalytical standardization of amino acid and acylcarnitine metabolite profiling in human blood using tandem mass spectrometry. Metabolomics : Official Journal of the Metabolomic Society, 7, 344–352.
Burgess, D. J. (2012). Biomarkers: Major mathematical hurdles for biomarker-based screening. Nature Reviews Cancer, 12, 3.
Ceglarek, U., Leichtle, A., Brugel, M., Kortz, L., Brauer, R., Bresler, K., et al. (2009). Challenges and developments in tandem mass spectrometry based clinical metabolomics. Molecular and Cellular Endocrinology, 301, 266–271.
Fiedler, G. M., Leichtle, A. B., Kase, J., Baumann, S., Ceglarek, U., Felix, K., et al. (2009). Serum peptidome profiling revealed platelet factor 4 as a potential discriminating Peptide associated with pancreatic cancer. Clinical Cancer Research, 15, 3812–3819.
Findeisen, P., & Neumaier, M. (2012). Functional protease profiling for diagnosis of malignant disease. Proteomics. Clinical Applications, 6, 60–78.
Gong, P. L., Liu, T. T., & Shen, X. Z. (2012). Differentiation of autoimmune pancreatitis with pancreatic carcinoma remains a challenge to physicians. Journal of Digestive Diseases, 13, 267–273.
Hazelton, W. D., & Luebeck, E. G. (2011). Biomarker-based early cancer detection: Is it achievable? Science Translational Medicine, 3, 109fs9.
Hori, S. S., & Gambhir, S. S. (2011). Mathematical model identifies blood biomarker-based early cancer detection strategies and limitations. Science Translational Medicine, 3, 109ra116.
Huggett, M. T., & Pereira, S. P. (2011). Diagnosing and managing pancreatic cancer. The Practitioner, 255(21–5), 2–3.
Jesneck, J. L., Mukherjee, S., Yurkovetsky, Z., Clyde, M., Marks, J. R., Lokshin, A. E., et al. (2009). Do serum biomarkers really measure breast cancer? BMC cancer, 9, 164.
Kholodenko, B., Yaffe, M. B., & Kolch, W. (2012). Computational approaches for analyzing information flow in biological networks. Science Signaling, 5, re1.
Klawonn, F., Höppner, F., May, S. (2011) An alternative to ROC and AUC analysis of classifiers advances in intelligent data analysis X in gama, In J. E. Bradley, J. Hollmén (Eds.). Berlin/Heidelberg: Springer. pp. 210–221.
Knottnerus, J. A., & Muris, J. W. (2003). Assessment of the accuracy of diagnostic tests: the cross-sectional study. Journal of Clinical Epidemiology, 56, 1118–1128.
Konforte, D., & Diamandis, E. P. (2012). Is early detection of cancer with circulating biomarkers feasible? Clinical Chemistry.
Landgrebe, T., & Duin, R. P. W. (2007). A simplified volume under the ROC hypersurface. SAIEE Africa Research Journal, 98, 94–100.
Leichtle, A., Nuoffer, J.-M., Ceglarek, U., Kase, J., Conrad, T., Witzigmann, H., et al. (2012). Serum amino acid profiles and their alterations in colorectal cancer. Metabolomics, 8, 643–653.
Leigh, J. P. (1988). Assessing the importance of an independent variable in multiple regression: is stepwise unwise? Journal of Clinical Epidemiology, 41, 669–677.
Lesaffre, E. (2008). Superiority, equivalence, and non-inferiority trials. Bulletin of the NYU Hospital for Joint Diseases, 66, 150–154.
Liu, J. P., Ma, M. C., Wu, C. Y., & Tai, J. Y. (2006). Tests of equivalence and non-inferiority for diagnostic accuracy based on the paired areas under ROC curves. Statistics in Medicine, 25, 1219–1238.
Lowenfels, A. B., & Maisonneuve, P. (2006). Epidemiology and risk factors for pancreatic cancer. Best Practice & Research Clinical Gastroenterology, 20, 197–209.
Mascha, E. J. (2010). Equivalence and noninferiority testing in anesthesiology research. Anesthesiology, 113, 779–781.
Michl, P., Pauls, S., & Gress, T. M. (2006). Evidence-based diagnosis and staging of pancreatic cancer. Best Practice & Research Clinical Gastroenterology, 20, 227–251.
Murdoch, D. J., & Chow, E. D. (1996). A graphical display of large correlation matrices. The American Statistician, 50, 178–180.
Nakas, C. T., & Yiannoutsos, C. T. (2004). Ordered multiple-class ROC analysis with continuous measurements. Statistics in Medicine, 23, 3437–3449.
Nishiumi, S., Shinohara, M., Ikeda, A., Yoshie, T., Hatano, N., Kakuyama, S., et al. (2010). Serum metabolomics as a novel diagnostic approach for pancreatic cancer. Metabolomics: Official Journal of the Metabolomic Society, 6, 518–528.
OuYang, D., Xu, J., Huang, H., & Chen, Z. (2011). Metabolomic profiling of serum from human pancreatic cancer patients using 1H NMR spectroscopy and principal component analysis. Applied Biochemistry and Biotechnology, 165, 148–154.
Ransohoff, D. F., & Gourlay, M. L. (2010). Sources of bias in specimens for research about molecular markers for cancer. Journal of Clinical Oncology : Official Journal of the American Society of Clinical Oncology, 28, 698–704.
Robin, X., Turck, N., Hainard, A., Lisacek, F., Sanchez, J. C., & Muller, M. (2009). Bioinformatics for protein biomarker panel classification: What is needed to bring biomarker panels into in vitro diagnostics? Expert Review of Proteomics, 6, 675–689.
Schrader, H., Menge, B. A., Belyaev, O., Uhl, W., Schmidt, W. E., & Meier, J. J. (2009). Amino acid malnutrition in patients with chronic pancreatitis and pancreatic carcinoma. Pancreas, 38, 416–421.
Tesiram, Y. A., Lerner, M., Stewart, C., Njoku, C., & Brackett, D. J. (2012). Utility of nuclear magnetic resonance spectroscopy for pancreatic cancer studies. Pancreas, 41, 474–480.
Tunes da Silva, G., Logan, B. R., & Klein, J. P. (2009). Methods for equivalence and noninferiority testing. Biology of Blood and Marrow Transplantation: Journal of the American Society for Blood and Marrow Transplantation, 15, 120–127.
Urayama, S., Zou, W., Brooks, K., & Tolstikov, V. (2010). Comprehensive mass spectrometry based metabolic profiling of blood plasma reveals potent discriminatory classifiers of pancreatic cancer. Rapid Communications in Mass Spectrometry: RCM, 24, 613–620.
Van, Q. N., & Veenstra, T. D. (2009). How close is the bench to the bedside? Metabolic profiling in cancer research. Genome Medicine, 1, 5.
Yachida, S., Jones, S., Bozic, I., Antal, T., Leary, R., Fu, B., et al. (2010). Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature, 467, 1114–1117.
Zhang, L., Farrell, J. J., Zhou, H., Elashoff, D., Akin, D., Park, N. H., et al. (2010). Salivary transcriptomic biomarkers for detection of resectable pancreatic cancer. Gastroenterology, 138(949–57), e1–e7.
Zhang, H., Wang, Y., Gu, X., Zhou, J., & Yan, C. (2011). Metabolomic profiling of human plasma in pancreatic cancer using pressurized capillary electrochromatography. Electrophoresis, 32, 340–347.
Zhang, L., Jin, H., Guo, X., Yang, Z., Zhao, L., Tang, S., et al. (2012). Distinguishing pancreatic cancer from chronic pancreatitis and healthy individuals by (1)H nuclear magnetic resonance-based metabonomic profiles. Clinical Biochemistry, 45, 1064–1069.
Zuber, V., & Strimmer, K. (2011). High-dimensional regression and variable selection using CAR scores. Statistical Applications in Genetics and Molecular Biology, 10, 34.
Acknowledgments
The authors would like to thank David Gurtner from the UBELIX Linux High Performance Computing Cluster of the University of Bern for his outstanding technical support, Xavier Robin from the Department of Structural Biology and Bioinformatics of the Biomedical Proteomics Research Group at the University of Geneva for inspiring discussions, and Johanna Sistonen from the Pharmacogenomics and Drug Metabolism Group of the Center of Laboratory Medicine at the Inselspital Bern for her thorough revision of the manuscript.
Conflict of interest
None.
Open Access
This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
Author information
Authors and Affiliations
Corresponding author
Additional information
Alexander Benedikt Leichtle and Uta Ceglarek have contributed equally to this study.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Leichtle, A.B., Ceglarek, U., Weinert, P. et al. Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma. Metabolomics 9, 677–687 (2013). https://doi.org/10.1007/s11306-012-0476-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11306-012-0476-7