Background

Tumor genetic alterations are increasingly being used to identify targeted therapies for individual patients. Conventional methods use sequencing of DNA extracted from tumor tissues, which may be archived or newly acquired by biopsy or surgical excision. Since tumors genetically evolve, particularly in response to treatment with targeted therapies [1,2,3,4], profiling of recently biopsied tumors is preferable over archived tumors. However, not all tumors are amenable to biopsy, and intratumoral and intertumoral heterogeneity raises concern over biopsy bias [5,6,7,8]. If a tumor biopsy contains genetic lesions present at different allelic frequencies (AF) compared to the bulk of the tumor, or if the biopsied tumor is not genetically reflective of most tumors within a patient, then the optimal targeted therapy may be overlooked. Identifying targetable genetic changes using minimally invasive procedures will be an increasing challenge for oncologists. One way to potentially overcome this issue is through analysis of cell-free circulating tumor DNA (ctDNA), which is detectable in the bloodstream [9,10,11]; this method provides the ability to identify targetable mutations through DNA sequencing, and the required blood draw is far less invasive than traditional biopsies.

Quantification and mutational profiling of ctDNA have shown promise for several clinical applications. In patients with early-stage breast cancer that has been surgically resected, detection of ctDNA in plasma is associated with higher likelihood of recurrence, suggesting that the presence of ctDNA reflects residual disease. In addition, the mutational profile of post-surgical ctDNA better reflects the mutational profile of the impending recurrent tumor than the primary breast tumor, suggesting that such ctDNA can be used to identify pre-emptive therapies to target micrometastatic disease [12, 13]. Genetic profiling of plasma DNA may be useful to identify targeted therapies in patients with advanced disease [14, 15], and such profiling is accepted in lieu of tumor genetic profiling in ongoing clinical trials (e.g., TAPUR study, clinicaltrials.gov identifier NCT02693535; NCI-MATCH trial, clinicaltrials.gov identifier NCT 02465060). ctDNA levels also correlate with disease burden and can serve as an early marker of therapeutic response in cases of advanced disease [16].

Several technologies have been developed to analyze ctDNA from plasma [17,18,19], and service providers have begun offering FDA-approved genetic profiling of DNA from patient plasma. Although tumor-derived DNA is detectable in plasma, particularly in patients with heavy cancer burden [16], it remains unclear whether plasma DNA captures all of the clinically actionable mutations present in tumors in patients with advanced disease. Addressing this issue is critical to assignment of the weight of plasma DNA profiling in treatment decision-making.

Methods

Patients and study design

Tissue and blood specimens were obtained from patients under protocols approved by the Dartmouth College Institutional Review Board. All patients provided signed informed consent. This study was conducted in accordance with good clinical practice and all applicable regulatory requirements, including the 1996 version of the Declaration of Helsinki.

DNA sequencing

Blood was separated in plasma and buffy coat fractions. Tissues were obtained by biopsy or excision. Extracted DNA was used for library preparation, capture, and amplification per Illumina TruSight Tumor 170, which covers exons of 149 cancer-associated genes. DNA was sequenced using Illumina NextSeq 500. DNA sequencing data are available at NCBI Sequencing Read Archive (accession # PRJNA604239). Further details on DNA extraction, quantification, and bioinformatics procedures are described in Supplemental Methods and Fig. S1.

Results

Patient characteristics

Blood and tissue specimens from 4 patients were evaluated (Fig. S2). Patient #9 donated her body to research upon her death, which occurred 7 months after initial acquisition of blood and tumor specimens; she provided a second blood sample 3 months prior to death, and additional tumor specimens were obtained post-mortem (denoted as Patient #9A). Blood and tumor specimens were obtained from one patient (#2) recently diagnosed with oligometastatic disease who had not received anti-cancer therapy in the preceding 3 years, and from 3 patients who received systemic or tumor-directed radiation therapy within the preceding 4 months. Patient disease and treatment histories are detailed in Figs. S3 and S4.

DNA sequencing

DNA extracted from plasma gave the characteristic dominant ~ 167-base-pair fragments reported previously [20], as well as ~ 310-base-pair fragments in some samples (Fig. S5). DNA from plasma, tumor, and buffy coat specimens (normal liver sample was used for Patient #2) was used for capture-based sequencing of all exons in 149 cancer-associated genes of the Illumina TST170 panel. Plasma DNA samples were sequenced to an average depth of 719 fold (range of 129–1150 fold), tumor DNA samples were sequenced to an average depth of 1479 fold (range of 444–2411 fold), and buffy coat and liver DNA samples were sequenced to an average depth of 1498 fold (range of 224–1506 fold) (Table S1).

Plasma DNA incompletely reflects mutational profiles of tumor DNA

Comparisons between mutations identified in plasma versus those found in tumors revealed that plasma DNA identified between 11.1% (ovary metastasis in Patient #9) and 100% (liver and omental metastases in Patient #9A) of the mutations found in tumors; Fig. 1 shows overlapping mutations identified in each patient. Overall, the ratio of mutations found in plasma to those found in tumors varied greatly: plasma from Patient #2 revealed only 6/36 (16.7%) total mutations detected in the 3 tumors sampled (1 in-breast recurrence and 2 liver metastases); plasma from Patient #10 revealed 8/17 (47.1%) total mutations found in the 3 liver metastases sampled among 5 lesions observed.

Fig. 1
figure 1

Sequencing of plasma DNA identifies tumor-derived mutations. DNA from plasma, tumors, and buffy coat (or normal liver sample) from 4 patients with metastatic breast cancer were used for sequencing of all exons in 149 genes. Plasma and tumor DNA sequences were compared to buffy coat or normal liver to identify somatic mutations in each patient. Mutations present in both plasma (≥ 5% AF) and at least one tumor specimen (≥ 10% AF) are indicated in heatmaps. Samples are labeled along y-axes with number of mutations found in plasma out of number of mutations called in parentheses. Mutations are indicated along x-axes. Locations of known tumors in each patient at the time of biopsy are indicated with boxes: green indicates a sequenced lesion; black indicates a non-sequenced lesion

Among mutations detected in both plasma and at least one tumor sample within a patient, we found significant positive correlations (p ≤ 0.05) between plasma AF and tumor AF in 7/12 tumors across all 4 patients, but not in Patient #9A in which tumors were sampled post-mortem (likely due to the limited number of overlapping mutations found) (Table S2).

This analysis also revealed a large degree of intertumoral genetic heterogeneity (Fig. 1). For example, analysis of 3 metastases in Patient #9 revealed a total of 58 unique mutations (≥ 10% AF), only 1 of which was a “stem” mutation common to all 3 tumors sampled; the other 57 mutations were “private” (found in only 1 tumor) or “clade” (found in > 1 but not all tumors) (Fig. 2a). Despite not occurring often, stem mutations were found at higher frequencies than private/clade mutations for most patients, with the exception of Patient #8 (Fig. 2b, c). In addition to not identifying the sole stem mutation in Patient #9, plasma samples identified 3/11 stem mutations in Patient #2, 0/8 stem mutations in Patient #8, 4/4 stem mutations in Patient #10, and 1/1 stem mutations in Patient #9A (Fig. 2c).

Fig. 2
figure 2

Plasma DNA sequencing identifies proportions of stem and non-stem mutations. a Proportions of stem mutations (found in all tumors) versus non-stem mutations (found in at least one but not all tumors) are shown for each patient. b, c Proportions of non-stem (private/clade) (b) and stem (c) mutations identified in plasma DNA are shown

Considering the low incidence of detection of tumor-derived mutations in plasma when plasma AF was ≥ 5%, we sought to determine if lowering the AF threshold in plasma to ≥ 1% captured more tumor-derived mutations (≥ 10% AF). The increase in numbers of mutations called in plasma samples at ≥ 1% AF can be seen in Table 1. Several samples saw minimal increases in the number of mutations called, with plasma from Patient #8 gaining only two mutations, and plasma from Patient #2 gained only one mutation. Lowering the threshold of plasma AF had drastic effects on the numbers of mutations called in plasma samples of Patients #9, #10, and #9A, which increased by as much as 14 fold. Despite these drastic increases in plasma mutation calls, lowering the threshold to ≥ 1% AF failed to identify any further mutations found in the sampled tumors, suggesting that the gaps in mutational overlap between tumors and plasma are likely not due to exclusion of rare (< 1% AF) plasma mutations. Increasing the plasma DNA mutation threshold to ≥ 10% AF would further improve concordance with tumor mutations, as most concordant mutation calls can be characterized by greater AF in plasma (Fig. 3). Likewise, only one mutation < 20% AF in any tumor sample was detected in a corresponding plasma sample (Fig. 4). We conclude that plasma DNA provides a snapshot of tumor mutational profiles, but the picture is incomplete and the majority of mutations identified in plasma are not attributable to sampled tumors.

Table 1 Decreasing plasma AF threshold does not improve concordance with tumor samples
Fig. 3
figure 3

Plasma DNA mutations with high allelic frequency tend to be detected in tumors. Each vertical bar indicates one mutation. Plasma AFs of mutations detected in plasma at ≥ 5% AF are noted for each patient along y-axes. Red bars indicate variants present in both plasma and ≥ 1 tumor (≥ 10% AF). Black bars indicate variants present in plasma only. Numbers of tumors evaluated from each patient are noted

Fig. 4
figure 4

Tumor DNA mutations at high allelic frequency are often detected in plasma DNA. Each vertical bar indicates one mutation. Tumor AFs of mutations detected in a tumor(s) at ≥ 10% AF are noted for each tumor along y-axes. Red bars indicate variants present in both a tumor and plasma (≥ 5% AF). Black bars indicate variants present only in a tumor(s)

Plasma DNA reflects clinically occult and evident disease

Patient #9 donated her body (post-mortem) to research: a second plasma sample was acquired 3 months before her death due to widespread metastatic disease; samples from 7 metastatic tumors were acquired at autopsy; these specimens were labeled “Patient #9A” (Fig. S4). More mutations (≥ 5% AF) were detected in the second plasma sample compared to the plasma sample acquired 4 months earlier (Table 1), with very little overlap between the two plasma samples; increasing the threshold to ≥ 10% AF or decreasing to ≥ 1% AF had little to no effect on the number of overlapping mutation calls (Fig. S6). Contrary to other patient cases (Fig. 1), we found increased overlap between mutational profiles of plasma and tumors in Patient #9A: the one stem mutation found in Patient #9 was the only stem mutation found in Patient #9A, and tumor mutations were identified in the plasma of Patient #9A at a rate of concordance as high as 100% (Fig. 5a). Despite the increased frequency of tumor-associated mutations identified in plasma, there were no significant correlations between plasma AF and tumor AF in samples from Patient #9A (Table S2).

Fig. 5
figure 5

Plasma DNA reflects clinically occult and evident tumors in Patient #9A. a DNA from plasma acquired 3 months before death, buffy coat, and samples from 7 tumors acquired post-mortem from Patient #9 were used for DNA sequencing as in Fig. 1. Tumor sites sampled for DNA sequencing are indicated by green boxes. Additional tumor sites noted at autopsy (but not sampled) are indicated by black boxes. b Mutant AFs were plotted according to organ site of tumor. For organ sites with multiple tumors, data were pooled. Data were analyzed by linear regression. c, d Mutant AFs in plasma were compared to tumor size (mean from each organ site) determined from CT scans (c) and relative blood perfusion (mL/min) of tumor-containing organs (d) using Spearman correlation and linear regression analyses

Patient #9A provided tissue samples from metastases found in the heart, omentum (n = 2), kidneys (one from each), and liver (n = 2). In patients with advanced breast cancer, metastases to the heart and peritoneum (including omentum) occur in 15.5% (evaluated at autopsy) and 7.6% (evaluated by imaging) of cases, respectively [21, 22]. The heart is typically not evaluated during routine follow-up due to variability in cardiac uptake of [18F]FDG in PET imaging [23], and heart motion precluding measurement by CT. Omentum is often not evaluable by CT because it contains primarily fat that blends with subperitoneal fat upon anatomical imaging, but CT remains the best routine imaging modality to detect peritoneal metastasis [24, 25]. Thus, heart and omentum are two sites of disease that are difficult to follow clinically, but metastases in these organs were potentially represented (FGFR2 c.*303G > A and FLT3 p.D324N) in the pool of ctDNA in plasma in Patient #9A (Fig. 5a).

We then combined data from all time points of sample acquisition from Patient #9 and #9A. Since some tumors sampled at autopsy were not detected or measurable in radiographs, mutational profiles of multiple tumors acquired from the same organ site (n = 6) were combined for this analysis. Comparison of AF of mutations in plasma DNA vs. organ-specific tumor DNA revealed that all tumor samples aside from the lymph node metastasis exhibited high correlation with plasma (Spearman R2 > 0.8, Fig. 5b), indicating that most tumor samples are well represented in the plasma. As the AF of mutations in tumor and plasma increased, concordance of mutually detected mutations also increased, lending confidence to higher-AF mutation calls in plasma. We considered whether tumor size (Fig. S7) was associated with contribution of DNA to plasma, and thus represented by mutation AF in plasma; however, we found a non-significant slightly negative correlation between tumor size (using mean of tumor sizes from a given organ site) and median AF of mutations in plasma (Fig. 5c).

Since a heart metastasis was potentially represented in plasma DNA (Fig. 5a, b), and a source of ctDNA is leakage of tumor DNA into the bloodstream, we considered whether tumor location and access to the bloodstream were associated with contribution of tumor DNA to plasma. As a surrogate for tumor access to the bloodstream, we used blood perfusion (mL/min) of an organ site [26,27,28] for each tumor-containing organ. Organ blood perfusion rate, relative to the heart, was not significantly correlated with AF of mutations in plasma DNA (Fig. 5d). These data suggest that neither tumor location and access to the bloodstream nor tumor size are strongly associated with contribution of DNA to plasma.

Timing or sampling bias may affect mutation detection in plasma

As previously shown in Fig. S6, mutational profiles of plasma samples can differ between time points within a patient; this difference may be due to disease evolution and/or plasma sampling bias. To examine the effect that this could have on mutation calls of tumor samples for Patient #9/9A, we tested whether the near-autopsy plasma sample (#9A) revealed mutations found in the first set of tumor biopsy samples (#9; acquired 4 months earlier) that the first plasma sample (#9) did not; conversely, we tested whether the first plasma sample (#9) revealed mutations found in the tumor samples taken at autopsy (#9A; acquired 7 months later) that the near-autopsy plasma sample (#9A) did not (Table 2).

Table 2 Timing of plasma sample acquisition identifies new mutations

In the tumor samples acquired at autopsy, using the first plasma sample would have identified one additional mutation found in the heart metastasis, and one additional mutation found in the right kidney metastasis. In the earlier set of tumor biopsies, using the near-autopsy plasma sample identified several new mutations in each of the tumors. This might have proved useful in the case of identifying mutations in the ovarian metastasis, where the first plasma sample only identified one mutation. The near-autopsy plasma sample identifies four additional mutations, including missense variants BRCA1 p.S1634G and FGFR4 p.G388R, the latter of which has been associated with progression and treatment resistance in breast cancer [29,30,31]. This suggests that identifying variants in plasma samples in a time course manner may provide a more complete picture of the mutational profile of the patient and allow for better identification of therapeutic targets, as has also been suggested previously [32].

Discussion

Studies have shown that tumor-derived DNA is detectable in plasma, prompting the development of diagnostic tests using plasma as a “liquid biopsy”. Although liquid biopsy is undoubtedly useful for detection of residual disease and cancer progression [12, 16], it remains unclear whether plasma DNA is suitable for genetic profiling to inform selection of targeted therapies. The results of our study were varied, as we found that sequencing of plasma DNA captured as many as 100% (Patient #9A) to as few as 11.1% (Patient #9) of mutations (≥ 5% AF in plasma) detected in ≥ 1 tumor at ≥ 10% AF in patients with oligometastatic breast cancer. The majority (mean 79.3% ± SD 13.8%) of mutations found in plasma DNA at ≥ 5% AF were not found in corresponding sampled tumors; this result may have been affected by the sequencing depth of the plasma samples. Chae et al. similarly reported 31.7% of all tumor mutations were found in plasma samples of 45 patients with metastatic breast cancer using the Guardant360 sequencing panel [33].

There was a negative correlation (Spearman R2 = − 0.53) between the number of plasma samples also found in corresponding tumors and sequencing depth of the plasma sample, suggesting that the proportion of plasma mutations found in tumor samples decreases as sequencing depth of plasma increases; this is likely due to increased sequencing depth leading to more low-AF mutation calls, which have already been shown to have no impact on concordance with tumor mutation calls (Table 1). The majority (68.5%) of mutations found in both plasma and tumors had a plasma AF ≥ 0.8 (Fig. 3), giving confidence in the tumor origin of high-AF mutations found in plasma.

Studies have used several methodologies to evaluate the utility of plasma DNA sequencing as a surrogate for tumor DNA sequencing. Several studies used the amplicon-based Ion Torrent AmpliSeq platform that covers 2855 cataloged mutations across 50 genes. Rothé et al. found 0 to 2 mutations in five genes (TP53, PIK3CA, PTEN, AKT1, and IDH2) in plasma or tumor specimens among 17 patients with metastatic breast cancer [34]. Their study found that plasma samples identified mutations in tumors in 13/17 (76%) patients, yet overall correlation of AF reported was lower than our study; the relatively good concordance may have been a result of the low number of mutations in detected in the few genes reported. Frenel et al. evaluated specimens from patients with late-stage metastatic solid tumors, and identified 1–5 (median = 2) mutations in each tumor [35]. Sequencing of plasma DNA identified ≥ 1 tumor-derived mutation in 23/39 (59%) patients. In patients with advanced solid tumors, Jovelet et al. found 0 to 4 (median = 1) mutations per tumor, and plasma DNA provided mutational concordance with tumors in 145/283 (51%) patients [36]. A major limitation of such amplicon-based platforms is the short amplicon length (mean = 154 bp for AmpliSeq), where most plasma DNA fragments (~ 167 bp) would not provide template for amplification. More akin to the capture-based method that we used herein (which is not subject to bias due to DNA fragment size), Dietz et al. performed capture-based whole-exome sequencing (Agilent High Sensitivity Kit, sequenced on Illumina HiSeq 2000) of DNA from serum and primary tumor specimens from 6 patients with early-stage lung cancer, achieving 68.5-fold mean depth of coverage for serum DNA [37]. Their analysis identified averages of 2557 (range: 1892–3322) and 1416 (range: 589–4105) mutations among tumor specimens (20–80% AF) and serum specimens (> 0% AF and ≥ 10 mutant reads), respectively. Comparing patient-matched specimens, Dietz et al. detected 5–57% (mean = 23%) of tumor mutations in serum, and 22–41% (mean = 31%) of serum mutations in tumors. A limitation of their study was a lack of normal DNA (e.g., buffy coat) controls, which may have resulted in the inclusion of many false positives, including germline mutations and clustered mutation events [38, 39]. More recently, Wyatt et al. reported an impressive 93.6% concordance among metastatic prostate cancer patients using targeted sequencing of 72 prostate cancer-related genes, finding 109 unique somatic mutations within those 72 genes [40]. In comparison, we found a total of 272 (range: 2–85) unique mutations within ≥ 1 tumor (≥ 10% AF) when considering all 4 patients among 149 cancer-related genes. These ranges in mutational burden are similar to those reported by Shah et al. [41], who reported that in cases of triple-negative breast cancer, numbers of somatic mutations ranged from as few as 5 to as many as 100 using whole-exome sequencing and RNA-seq. The mutational overlap between tumors and plasma was lower in the current study possibly due to the advanced stage of disease (advanced breast cancer vs. early-stage lung cancer) and increased genetic intra- and inter-tumor heterogeneity in patients with advanced disease [42]; however, in the case of Patient #2, all 3 clinically evident tumors were sampled, helping to rule out the variable of inter-tumor heterogeneity. We found that decreasing the plasma DNA mutation threshold to ≥ 1% AF had no effect on overlap with groups of tumor mutations, suggesting that low-AF mutations found in plasma were mostly (A) below the threshold used for calling mutations in tumors, and/or (B) sequencing errors. Differences in DNA sequencing platforms (Illumina and Ion Torrent have error rates of 1.78% vs. 0.4% [43]) and variant callers [37, 44,45,46,47,48] may have also contributed to disparate results.

The clinical importance of the discrepancy between tumor and plasma mutational profiles is exemplified by the lack of detection of a TP53 p.P72R mutation found in all tumor samples in Patient #2, which was not detected in the corresponding plasma sample. This mutation has previously been reported as a strong indicator of toxicity due to cyclophosphamide-based neoadjuvant chemotherapy in breast cancer patients [49, 50]. Patient #2 did not receive cyclophosphamide as part of their treatment regimen, but an accurate plasma sample denoting this mutation could have been invaluable in making treatment decisions for this patient.

As liquid biopsy profiling assays have been developed, a recurring concern has been whether plasma DNA reflects the clinically important (i.e., life threatening) tumor(s) in a patient. Results of our analysis of tumor specimens acquired at autopsy from Patient #9 highlight an inherent disconnect between tumors in vital organs and plasma DNA. Among the 7 tumors evaluated, the mutational profile of a heart metastasis was potentially represented in plasma DNA; this was confounded by the detection of the same mutations in kidney and/or omental metastases (Fig. 2). If Patient #9 had diagnostic plasma-only DNA sequencing, a genetically informed therapy could have been selected based on a private or clade mutation(s) found in the heart tumor. In this scenario, the heart tumor (which was not detected by routine imaging) may have responded to treatment, while other (detectable) tumors that did not harbor the same mutation(s) could have progressed. As a result, the patient would have been taken off treatment due to disease progression, which would be a failure of genetically informed therapy.

Conclusions

Plasma contains a mixture of DNA from tumors in patients with oligometastatic disease, so liquid biopsy creates a gap between tumor identity and tumor genetic profile: mutations identified in plasma DNA cannot be attributed to a specific tumor, so response of a given tumor to therapy may not be attributable to mutations identified in plasma DNA. In the case of Patient #9, only 13/48 (27%) mutations found in plasma were also found in tumor samples. This low rate of concordance could be due to several factors, most notably the presence of other tumors in the body that were occult or not biopsied.

The oncology community must ultimately determine the value of plasma DNA profiling for selecting targeted therapies. Our findings indicate that plasma DNA sequencing is a weak surrogate for tumor DNA sequencing, sporadically detecting a minority of tumor mutations. We conclude that liquid biopsy should not yet replace, but may complement, tumor biopsy for genetic profiling.