Abstract
Defining genetic factors impacting chemotherapy failure can help to better predict response and identify drug resistance mechanisms. However, there is limited understanding of the contribution of inherited noncoding genetic variation on inter-individual differences in chemotherapy response in childhood acute lymphoblastic leukemia (ALL). Here we map inherited noncoding variants associated with treatment outcome and/or chemotherapeutic drug resistance to ALL cis-regulatory elements and investigate their gene regulatory potential and target gene connectivity using massively parallel reporter assays and three-dimensional chromatin looping assays, respectively. We identify 54 variants with transcriptional effects and high-confidence gene connectivity. Additionally, functional interrogation of the top variant, rs1247117, reveals changes in chromatin accessibility, PU.1 binding affinity and gene expression, and deletion of the genomic interval containing rs1247117 sensitizes cells to vincristine. Together, these data demonstrate that noncoding regulatory variants associated with diverse pharmacological traits harbor significant effects on allele-specific transcriptional activity and impact sensitivity to antileukemic agents.
Similar content being viewed by others
Introduction
Due to continual advances in treatment protocol over the last 60 years, the survival rate of the most common malignancy in children, acute lymphoblastic leukemia (ALL), has dramatically improved to over 90% in high-income countries1. Despite these advances, survival rates of pediatric patients experiencing refractory or relapsed ALL were only 30–50%, and those of adults were especially low (~10%)2. Thus, improving the understanding of the underlying genetic risk factors impacting response to ALL chemotherapy is a major step in improving outcomes for patients with refractory or relapsed ALL.
Genome-wide association studies (GWAS) have identified numerous inherited DNA sequence variants associated with treatment outcome in childhood ALL from clinical trials carried out by St. Jude Children’s Research Hospital and the Children’s Oncology group3,4,5. This includes GWAS analyses that identified inherited genetic contributors associated with patient relapse4,5 and persistence of minimal residual disease (MRD) after induction chemotherapy3, which is an early indicator of treatment failure6,7,8,9. In addition, ex vivo chemotherapeutic drug sensitivity testing using primary ALL cells from patients serves as an informative pharmacological phenotype10. When integrated with genotype profiling for GWAS, these analyses identify variants contributing to antileukemic drug resistance that reflects in vivo and ex vivo resistance and is, therefore, predictive of treatment outcomes in patients10,11,12,13,14,15,16,17,18,19,20,21,22.
Because most GWAS variants, including pharmacogenomic variants23,24, lie in noncoding sequences in the human genome, their connection to gene regulation and cellular biology has yet to be established. Moreover, given that dozens of variants are typically in strong linkage disequilibrium (LD) with an associated sentinel variant, pinpointing causal variants at GWAS loci has been challenging. Noncoding GWAS variants have been consistently linked to disruption of cis-regulatory element (CRE) activity and gene regulation25. As a result, the functional evaluation of these regulatory variants involves an examination of their allele-specific activities on transcriptional output, which has traditionally been a low-throughput endeavor. Therefore, the functional investigation of all associated regulatory variations at GWAS loci (sentinel and LD) proved to be an intractable hurdle to investigators. Recent technological advances, however, have ameliorated these challenges through the advent of massively parallel reporter assays (MPRAs) where the reporter aspect is often a self-transcribed barcode in the 3′-UTR of a reporter gene that is detected using next-generation sequencing. MPRAs allow the simultaneous, rapid, and robust detection of differences in transcriptional output from a library of cis-regulatory sequences of interest26,27,28. MPRAs have since been applied to the study of regulatory variation at GWAS loci through an examination of allele-specific effects on reporter gene expression29,30,31,32,33,34,35.
Another challenge is connecting regulatory variation at promoter-distal CREs to a target gene, as the closest gene may not be the target gene36. To circumvent these challenges, regulatory variation can be coupled to transcriptomics to identify variants impacting the expression of a candidate target gene through expression quantitative trait locus (eQTL) mapping36,37. Functional genomics offers additional solutions through the mapping of three-dimensional (3D) genomic interactions38. Because promoter-distal CREs (e.g., enhancers) regulate gene expression through long-range 3D looping to the promoters of target genes25, an attractive assay to identify gene targets of promoter-distal regulatory variants is promoter capture Hi-C (promoter CHiC)39. Promoter capture Hi-C, and related 3D chromatin interaction assays have been implemented at multiple GWAS loci to identify gene targets of promoter-distal regulatory variation29,36,40,41,42,43.
In this work, to better understand the underlying genetic and gene regulatory factors that impact diverse pharmacological traits in ALL, we performed a comprehensive functional interrogation of GWAS regulatory variants that map to ALL accessible chromatin sites and that are associated with ALL treatment outcome (i.e., relapse and persistence of MRD after induction chemotherapy) in patients and/or ex vivo chemotherapeutic drug resistance in primary ALL cells from patients using MPRA. We coupled these results with promoter CHiC to identify candidate target genes of functional regulatory variants with significant allele-specific effects on reporter gene expression. Finally, we functionally investigated the impact of the top regulatory variants chemotherapeutic drug resistance in ALL cell lines. Collectively, this study implements a comprehensive functional investigation of the allele-specific, gene-regulatory effects of noncoding variants associated with the pharmacogenomics of ALL treatment, and, therefore, fills an unmet need for large-scale functional examinations of regulatory GWAS variants associated with ALL pharmacological traits.
Results
Identification of noncoding regulatory variants impacting the pharmacogenomics of ALL treatment
Single-nucleotide variants (SNVs) impacting diverse pharmacological traits in ALL were identified for functional interrogation. We chose SNVs associated with relapse or persistence of MRD after induction chemotherapy in childhood ALL patients to investigate the role of inherited noncoding regulatory variants impacting clinical phenotypes (i.e., treatment outcome). These SNVs were identified from published GWAS of ALL patients enrolled in St. Jude Children’s Research Hospital and the Children’s Oncology Group clinical protocols3,4,5 (see Methods for variant selection criteria). Variant selection also included prioritization for treatment outcome SNVs associated with drug resistance phenotypes in primary ALL cells to enrich for variation impacting ALL cell biology (see Methods for variant selection criteria). These treatment outcome-associated variants, as well as all variants in high LD (r2 > 0.8) with the sentinel GWAS variants, were further evaluated (Fig. 1a, b).
We also identified variants directly associated with ex vivo chemotherapeutic drug resistance in primary ALL cells from patients by performing GWAS analyses using SNV genotype information and ex vivo drug resistance assay results for six antileukemic agents (prednisolone, dexamethasone, vincristine, L-asparaginase, 6-mercaptopurine [6MP] and 6-thioguanine [6TG]) in primary ALL cells from 312–344 patients (not all patients were tested for all drugs) enrolled in the Total Therapy XVI clinical protocol at St. Jude Children’s Research Hospital (see Methods). We further prioritized functional ex vivo drug resistance SNVs by determining if they were eQTLs in primary ALL cells or related cell types (i.e., whole blood and EBV-transformed lymphocytes) from the Genotype-Tissue Expression (GTEx) consortium37 (see Methods for variant selection criteria). Ex vivo drug resistance variants that were also identified as eQTLs, as well as variants in high LD (r2 > 0.8) with these sentinel GWAS variants, were further evaluated (Fig. 1a, b).
GWAS have also been performed for childhood ALL disease susceptibility and identified several GWAS loci harboring variants with genome-wide significance44,45,46,47,48,49,50. Several follow-up studies of these GWAS loci have identified candidate causal noncoding variants and mechanisms involving gene regulatory disruptions51,52,53. As a result, we used ALL disease susceptibility variants (n = 11), as well as variants in high LD (r2 > 0.8) with them, for further analysis as positive controls in our study (Fig. 1a, b).
Because most of these variants map to noncoding portions of the human genome, these data point to disruptions in gene regulation as the underlying mechanism of how these variants impact ALL cell biology. We therefore utilized assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq)54 chromatin accessibility data in 161 ALL cell models, comprised of primary ALL cells (cryopreserved, n = 2455; fresh, n = 12056), ALL cell lines (n = 14) and ALL patient-derived xenografts (PDXs, n = 3), to uncover which variants map to putative CREs in ALL cells57 (i.e., regulatory variants; Fig. 1c). Although we detected variation in ATAC-seq TSS enrichment scores and peak counts that is to be expected from such a large, mixed cohort of ALL cell models, the peaks called were largely reproducible (found in >3 samples) within each group (Supplementary Fig. 1a–c). ATAC-seq data from primary ALL cells, ALL cell lines, and PDXs were combined and identified 1696 regulatory variants at accessible chromatin sites in ALL cells for functional investigation (Fig. 1c and Supplementary Data 1).
Assessing the impact of regulatory variation on transcriptional output using MPRA
To examine the functional effects of these 1696 regulatory variants on transcriptional output in a high-throughput manner we utilized a barcode-based MPRA platform29,32 to measure differences in allele-specific transcriptional output (Fig. 2a). Oligonucleotides containing 175-bp of genomic sequence centered on each reference (ref) or alternative (alt) variant allele, a restriction site, and a unique 10-bp barcode sequence were cloned into plasmids. An open reading frame containing a minimal promoter driving GFP was then inserted at the restriction site between the alleles of interest and their unique barcodes (Fig. 2a). We utilized 28 unique 3′UTR DNA barcodes per variant allele (56 barcodes per regulatory variant), and variants near bidirectional promoters (47 total variants) were tested using both sequence orientations. In total, 97,608 variant-harboring oligonucleotides were evaluated for allele-specific differences in gene regulatory activity (Fig. 2a).
Following transfection into 7 different B-cell precursor ALL (B-ALL; 697, BALL1, Nalm6, REH, RS411, SEM, SUPB15) and 3 T-cell ALL (T-ALL; CEM, Jurkat, P12-Ichikawa) human cell lines (n = 4 transfections per cell line; 40 total), the transcriptional activity of each allele variant was measured by high-throughput sequencing to determine the barcode representation in reporter mRNA and compared to DNA counts obtained from high-throughput sequencing of the MPRA plasmid pool (Fig. 2a). In the 10 cell lines MPRA detected 4633 instances of significant differential activity between alleles across 91% (1538/1696) of regulatory variants tested (Fig. 2b, c, Supplementary Data 2). The 10 ALL cell lines showed substantial differences in the total number of regulatory variants harboring significant allele-specific activity, which we suspect largely stems from differences in transfection efficiency (Fig. 2c). Importantly, when comparing changes in allele-specific MPRA activity for each regulatory variant we found that significant changes in activity (adj. p < 0.05) were highly correlated between ALL cell lines, with 87% concordance in allelic-specific activity, suggesting that significant MPRA hits were likely to be robust and reproducible between cell lines (Fig. 2d). Allele-specific MPRA activities were also correlated using all pairwise cell line comparisons for each regulatory variant, irrespective of significance (Supplementary Fig. 2a). Importantly, 31 of the 35 positive control variants (i.e., ALL disease susceptibility-associated variants and variants in high LD) showed significant allelic effects in at least 1 cell line, and 10 showed significant and concordant allelic effects in at least three ALL cell lines, including two variants (rs3824662 at GATA3 locus and rs75777619 at 8q24.21) directly associated with ALL susceptibility44,49,52 (Supplementary Data 2). The risk A allele at rs3824662 was associated with higher GATA3 expression and chromatin accessibility and demonstrated significantly higher allele-specific activity in our MPRA44,52, thereby demonstrating that the MPRA could detect allelic effects previously identified by others.
To further validate MPRA hits in an ex vivo model, we performed MPRA using two B-ALL PDX samples that were freshly harvested from mice. These samples detected 26 and 67 significant gene regulatory variants, respectively, and showed significant correlation with the cell line MPRA data (Supplementary Fig. 2b, c, Supplementary Data 3). We attribute the detection of relatively lower numbers of variants in PDXs to technical effects stemming from poor transfection efficiency and limited cell survival ex vivo. Overall, our data suggest that the cohort of SNVs tested contained functional regulatory variants with the potential to impact gene regulation.
Identification of functional regulatory variants showing reproducible and concordant changes in allele-specific gene expression
To further focus on regulatory variants most likely to broadly impact gene regulation in ALL cells, we prioritized 556 variants with significant (adj. p < 0.05) and concordant allele-specific activities in at least three ALL cell lines (i.e., functional regulatory variants; Fig. 3a–d, Supplementary Data 4). Most of these functional regulatory variants (318/556) mapped to accessible chromatin found only in primary ALL cell samples, underscoring the importance of incorporating chromatin architecture from primary ALL cells, and 54 functional regulatory variants mapped to transcription factor footprints in primary ALL cells (Supplementary Fig. 3). Additionally, we used Genomic Regions Enrichment of Annotations Tool (GREAT) to associate these SNVs with their nearby genes and search for enrichment in gene ontology biological processes pathways58. Although GREAT identified gene associations for nearly all SNVs, we found no significant pathway associations (Supplementary Data 4 and 5). Because further functional investigation of variants in primary ALL cells or PDXs ex vivo is largely intractable, we focused on 210 functional regulatory variants that were detected in open chromatin in one of the 14 ALL cell lines that we had generated ATAC-seq data (Fig. 3d). Most of these variants (159/210; 76%) were also found in accessible chromatin in PDXs and/or in primary ALL cells from patients (Fig. 3d).
For additional validation using traditional luciferase reporter assays, we prioritized these 210 functional regulatory variants based on allele-specific effect size and selected high-ranking SNVs. Dual-luciferase reporter assays showed similar allele-specific changes in activity to that which was detected by MPRA for 7 SNVs tested (Supplementary Fig. 4a–k). In fact, a significant positive correlation (p = 0.0017) was observed between the allelic effects detected by MPRA and luciferase reporter assays (Supplementary Fig. 4l). Together, these analyses assessed the robustness of our MPRA screen of functional regulatory variants and identified 556 SNVs with reproducible and concordant allele-specific effects on gene regulation. Importantly, 210 of the 556 significant hits that were concordant in at least three cell lines were found in open chromatin sites in ALL cell lines and, therefore, warranted further exploration.
Association of functional regulatory variants with putative gene targets
To better understand how these variants impact cellular phenotypes, we first determined if the 210 functional regulatory variants found in accessible chromatin sites in ALL cell lines could be directly associated with a target gene. While 35 functional regulatory variants were localized close (±2.5 kb) to nearby promoters (Fig. 4a, Supplementary Data 4 and 6), 175 variants were promoter-distal (>2.5 kb), and therefore likely to map to CREs with unclear gene targets (Fig. 4a). While CREs are often associated with the nearest genes, 3D chromatin looping methods are a more reliable method to associate a CRE with its target gene promoter. In pursuit of evidence-based association of promoters and specific CREs, we performed two related chromatin looping methods, H3K27Ac HiChIP59 and promoter capture HiC (CHiC)39, in 8 of 10 ALL cell lines used in MPRA and determined that 19 of the 175 non-promoter functional regulatory variants showed connectivity to distal promoters in the same cell line where allele-specific MPRA activity and chromatin accessibility were detected (Fig. 4a, Supplementary Data 6). Interestingly, H3K27Ac HiChIP and promoter CHiC called similar numbers of loops across all 8 cell lines (690,579 versus 660,313, respectively), but promoter CHiC loop calling was more consistent per cell line (Supplementary Fig. 5, Supplementary Data 7). HiChIP detected no looping at any of the 556 reproducible and concordant SNVs from the MPRA, and the 19 SNVs showing connectivity to a promoter were solely detected by promoter CHiC, further highlighting the utility of this method in GWAS-oriented studies41,60,61,62,63.
In prioritizing functional regulatory variants, we were interested in the gene regulatory impact of variants at TSS-proximal promoter-associated versus TSS-distal promoter-connected CREs as measured by MPRA. Interestingly, we found that SNVs found at TSS-distal open chromatin sites, promoter-associated or not, showed higher allele-specific changes in MPRA activity than those at promoters (Fig. 4b). While we acknowledge that many of the 156 variants for which we did not detect a relationship with a promoter are likely to have meaningful gene targets, we focused on CREs containing variants with known gene targets in ALL cells for functional validation. Amongst the TSS-distal promoter-connected functional regulatory variants, we found that distal intergenic and intronic SNVs showed significantly higher allele-specific activity than those in UTRs (Fig. 4c). These data suggest that the most robust allelic effects attributable to these regulatory variants are likely to occur at distal intergenic and intronic sites >2.5 kb from the TSS of the target gene.
Next, we ranked TSS-proximal promoter-associated and TSS-distal promoter-connected functional regulatory variants by the geometric mean of their significant MPRA data to account for the magnitude of allele-specific activity and the reproducibility of a significant change across ALL cell lines (Fig. 4d, e). This analysis identified rs1247117 as the most robust functional regulatory variants, which we then pursued for mechanistic understanding (Fig. 4e).
rs1247117 determines genomic accessibility, PU.1 binding, and EIF3A expression
We pursued functional validation of rs1247117 based on its highest-ranking geometric mean of MPRA allelic effect. rs1247117 is in high LD with two GWAS sentinel variants (rs1312895, r2 = 0.99; rs1247118, r2 = 1) that are associated with persistence of MRD after induction chemotherapy3. This functional regulatory variant maps to a distal intergenic region harboring chromatin accessibility downstream of the CACUL1 gene, for which it is an eQTL in EBV-transformed lymphocytes37. However, we found that rs1247117 loops to the EIF3A promoter in Nalm6 B-ALL cells (Fig. 5a). We, therefore, explored how this accessible chromatin site might recruit transcriptional regulators that would depend on the allele present at rs1247117. For this, we first performed ChIP-seq for RNA pol II and H3K27Ac, which confirmed RNA Pol II occupancy and H3K27Ac enrichment in Nalm6 cells, indicating that rs1247117 is associated with an active CRE (Fig. 5a). Through an examination of the underlying DNA sequence spanning rs1247117, we found that the reference guanine (G) risk allele at rs1247117 resides in a PU.1 transcription factor binding motif that is disrupted by the alternative adenine (A) allele (Fig. 5b). Although the risk G allele is the reference allele, the alternative A allele is more common in human populations. Supporting PU.1 binding at this location, accessible chromatin profiling in primary ALL cells identified an accessible chromatin site and PU.1 footprint spanning rs1247117 in diverse ALL samples (Supplementary Fig. 6a, b). Significantly greater chromatin accessibility at rs1247117 was also observed in heterozygous (GA) patient samples compared to patient samples homozygous for the alternative A allele (Supplementary Fig. 6c), and the G allele at rs1247117 harbored significantly greater ATAC-seq read count compared to the A allele (Supplementary Fig. 6d). Importantly, we determined that PU.1 was bound at this site in Nalm6 cells using CUT and RUN64 (Fig. 5a).
Nalm6 cells contain the alternative A allele that disrupts the PU.1 motif at rs1247117, yet our data suggests that this site still binds PU.1 (Fig. 5a, b). This led us to hypothesize that PU.1 binding affinity for the PU.1 motif surrounding rs1247117 would be strengthened by the risk G allele. Therefore, we designed biotinylated DNA probes containing two tandem 25-bp regions centered on reference G or alternative A allele-containing rs1247117 to test this hypothesis (Fig. 5c). Using biotinylated probes, we performed an in vitro DNA-affinity pulldown from Nalm6 nuclear lysate and found that while PU.1 was indeed bound to the alternative A allele, PU.1 was more robustly bound to the reference G allele at rs1247117 (Fig. 5d). To further assess the impact of the rs1247117 allele on PU.1 binding, we changed the Nalm6 allele from A to G using CRISPR/Cas9 (Fig. 5e; AA = parental genotype, GG = mutated genotype). We used ChIP-PCR to determine that PU.1 binding was increased with the G allele relative to the A allele at the CRE containing rs1247117 in two A > G Nalm6 clones across 3 unique primer sets within the PU.1 peak at rs1247117 that was detected in Nalm6 cells (Fig. 5f). We then asked if transposase accessibility was also increased at the CRE containing rs1247117 when the G allele was present. Using ATAC-seq, we found that accessibility was indeed increased at rs1247117 in mutated Nalm6 cells with the G allele when compared to the parental Nalm6 cells containing the A allele (Fig. 5g). These data suggest that the risk G allele increases genomic accessibility and the affinity of PU.1 binding at rs1247117 relative to the alternative A allele.
We were next interested in how allele-specific PU.1 binding at rs1247117 was related to the expression of the protein encoded by the connected gene, EIF3A. We found that the G allele, which increased recruitment of PU.1, resulted in decreased expression of EIF3A when compared to Nalm6 cells with the A allele (Fig. 5h). These data suggest that PU.1 recruitment to the CRE containing rs1247117 results in a net-repressive effect on EIF3A protein levels, and that less PU.1 recruitment with the A allele results in greater EIF3A expression.
Deletion of CREs containing top MPRA SNVs demonstrates their impact on drug sensitivity
Clonal selection can lead to the accumulation of random SNVs and even larger structural variations65 that can confound functional interpretation of more complex trans phenotypic effects. Therefore, to examine the connection between rs1247117 and the persistence of MRD after induction chemotherapy, we decided to use CRISPR/Cas9 to delete the CRE containing rs1247117 in heterogeneous cell pools of Nalm6 and SUPB15 cells (rs1247117 del) to avoid clonal selection (Fig. 6a, b, Supplementary Fig. 7a). Given that loss of the CRE containing rs1247117 would abolish PU.1 recruitment at this region, we hypothesized that rs1247117 del would result in increased EIF3A expression. Accordingly, we found that EIF3A expression was elevated in rs1247117 del cells relative to parental Nalm6 and SUPB15 cells, respectively (Fig. 6c, d, Supplementary Fig. 7b), further supporting an inverse relationship between PU.1 binding at rs1247117 and EIF3A expression.
Because the risk G allele at rs1247117 was also associated with vincristine resistance in primary ALL cells from patients, we additionally sought to determine the impact of the CRE deletion containing rs1247117 on cellular response to vincristine treatment. We hypothesized that because the risk G allele is associated with enhanced PU.1 binding and resistance to vincristine, complete disruption of PU.1 binding in Nalm6 cells harboring the CRE deletion would show increased sensitivity to vincristine relative to parental Nalm6 cells. As predicted, Nalm6 cells with the CRE deletion exhibited significantly increased sensitivity to vincristine across a range of concentrations after 24, 48, and 72 hours of treatment (Fig. 6e–g), and we found consistent effects on cell viability in SUPB15 cells (Supplementary Fig. 7c). Consistent with enhanced sensitivity to vincristine, we also found increased caspase 3/7 activity in rs1247117 del Nalm6 cells relative to parental Nalm6 cells after 72hrs and across a range of vincristine concentrations (Fig. 6h). These data suggest that a functional regulatory variant alters the binding affinity of a key transcription factor, PU.1, and disruption of this locus impacts EIF3A expression and vincristine sensitivity in ALL cells. To further validate our methodology utilizing CRISPR/Cas9 to delete CREs, we deleted CREs spanning two additional top variants, rs7426865 and rs12660691 (see Fig. 4e), that was associated with the ex vivo resistance to 6-mercaptopurine and dexamethasone, respectively, in primary ALL cells. Deletion of these CREs also impacted protein expression and sensitivity to the associated chemotherapeutic agent, thereby supporting our functional approach (Supplementary Figs. 8 and 9).
We next wanted to connect EIF3A directly to vincristine resistance. Given that EIF3A is an essential gene per the Broad Institute’s DepMap, we opted to test the hypothesis EIF3A overexpression alone was sufficient to impact the Nalm6 cell response to vincristine. We, therefore, used lentiviral transduction to overexpress EIF3A in Nalm6 cells and compared EIF3A overexpression (EIF3A OE) cells to control infected cells (Nalm6 WT, Supplementary Fig. 10a). Using two independent infections of EIF3A OE, we found that at 48 hr and 72 hr, EIF3A OE cells were more sensitive to vincristine than Nalm6 WT cells (Supplementary Fig. 10b). These data suggest that EIF3A expression impacts the ALL cell response to vincristine, with higher expression sensitizing cells to the drug, and further establishes this gene as the likely target of the association.
Discussion
Using MPRA, we systematically interrogated the functional effects of inherited noncoding variation associated with relapse, persistence of MRD after induction chemotherapy, and/or ex vivo chemotherapeutic drug resistance in childhood ALL. We refined our search to regulatory variants that were found in accessible chromatin sites in 161 ALL cell models, including primary ALL cells from patients, PDXs, and ALL cell lines, as those noncoding regions were likely to be participating in transcriptional regulation. Using MPRA we identified 556 functional regulatory variants showing reproducible and concordant changes in an allele-specific gene regulatory activity. To further explore the impact of these variants on gene regulation in ALL cell lines, we selected a subset of functional regulatory variants from MPRA that were within an accessible chromatin site in an ALL cell line. We overcame difficulties in associating promoter-distal functional regulatory variants with gene targets using promoter CHiC and found 19 variants with robust looping to a distal promoter, as well as 35 promoter-associated functional regulatory variants.
We identified rs1247117 as the top functional regulatory variant, showing the highest geometric mean of differential transcription activity, which was identified in 9 of 10 ALL cell lines assayed by MPRA. We found that the allele present at rs1247117 was a determinant of PU.1 transcription factor binding, with the risk G allele leading to greater chromatin accessibility and PU.1 binding affinity. Interestingly, the allele-specific activities as measured by MPRA and traditional dual-luciferase reporter assays suggest that the reference G allele at rs1247117 stimulates transcription more than the alternative A allele, and we suspect this is driven by greater PU.1 binding affinity in these episomal assays. However, our endogenous genetic manipulation that disrupted PU.1 binding at this locus in Nalm6 cells suggests that EIF3A expression is driven inversely to PU.1 binding. Moreover, complete disruption of PU.1 binding resulted in greater sensitivity to vincristine, which is consistent with the risk G allele contributing to both greater PU.1 binding affinity and vincristine resistance in primary ALL cells from patients. This discrepancy may be due to the ability of PU.1 to act in an activating or repressing manner on gene expression dependent on its genomic context and other transcriptional regulators present66,67. While not addressed within the scope of this work, our hypothesis is that MPRA and luciferase reporter assays, which are episomal and utilize a non-native minimal promoter, detected transcription-activating PU.1 activities rather than the PU.1 repressive activities we detect within the endogenous locus. Collectively, these observations stress the importance of performing subsequent functional follow-up experimentation within an endogenous sequence context.
The rs1247117 risk G allele frequency is ~20-30% in African (33%), American (29%), and East Asian (23%) populations, and only ~10% in Europeans (10%) and South Asians (11%). Notably, vincristine-associated neurotoxicity is more common in Europeans compared to African Americans68. However, this effect has been attributed to greater cytochrome P450 3A5 (CYP3A5) allele expression in African Americans compared to Europeans. Importantly, this variant was originally associated with the persistence of MRD after induction chemotherapy3, and numerous reports demonstrate poorer outcomes in patients of African and American ancestries compared to White populations69,70,71,72,73. Overall, our results are consistent with these studies and suggest that greater resistance to vincristine, which is typically given during induction chemotherapy, is a potential link between the risk G allele at rs1247117 and the persistence of MRD in patients after induction chemotherapy.
Although the risk G allele at rs1247117 is associated with decreased EIF3A expression and increased risk of MRD after induction chemotherapy and vincristine resistance, it remains unclear why EIF3A expression might impact vincristine efficacy. EIF3A expression has been correlated with cell cycle progression, and others have shown that EIF3A knockdown leads to decreased cell proliferation74. However, vincristine can act on microtubules to rapidly kill cells during G1 and later the mitotic spindle to arrest cells during metaphase, so increased EIF3A expression may facilitate cell cycle progression, thus increasing the rate at which metaphase mitotic spindles are disrupted by vincristine75. EIF3A expression has also been previously linked to chemotherapeutic sensitivity in both melanoma and lung cancer76,77. EIF3A expression led to decreased phosphorylation of ERK, supporting the effect of vemurafenib-induced MAP kinase signaling blockade, while EIF3A loss led to sustained activation of ERK and therapeutic resistance76. Interestingly, ERK activation is important in G1/S progression, and therefore, it follows that EIF3A-dependent inhibition of ERK may support the rapid killing of cells in the G1 phase shortly after initial vincristine treatment75,78. This notion is supported by significantly greater sensitivity to vincristine of ALL cells harboring greater EIF3A expression through disruption of a distal CRE after just 24 hours of treatment.
The regulatory variants assayed in this study were originally discovered from GWAS in patient samples, and most of our functional regulatory variant hits from MPRA were present in accessible chromatin sites found only in primary ALL cells from patients. Consequently, these data highlight both substantial differences in the chromatin landscape between immortalized cell lines and primary cells and a limitation of our study that relied on the functional exploration of top regulatory variants in ALL cell line models. An optimal approach would be to validate top functional regulatory variants in patient samples; however, this is not currently feasible due to the limited duration of patient sample viability in culture for genetic manipulation. Nonetheless, future implementation of promoter CHiC in patient samples can be used to map gene connectivity of promoter-distal functional regulatory variants found only in primary cells, and these gene targets can then be genetically disrupted in ALL cell line models for functional validation.
We acknowledge that our MPRA analysis only provides information on the allele-specific gene regulatory effects of these variants, and any connection with pharmacological effects would require additional validation, as we have provided and demonstrated with experimentation. Additionally, because genotyping was performed in primary ALL cells for variants associated with ex vivo drug resistance, we cannot exclude the possibility that some of our tested variants are somatic and not germline. Nonetheless, because we were comprehensive and provided extensive functional data across diverse ALL cell samples for this large set of variants, we believe that our rich dataset can additionally be utilized as an important resource for future GWAS studies on ALL pharmacogenomics. Namely, as future clinical trials are performed and additional GWAS variants impacting clinical phenotypes are discovered, our dataset can be queried to determine if newly discovered variants harbor functional effects on gene expression.
This translational work represents the largest functional investigation of inherited noncoding variation that is associated with diverse pharmacological traits in ALL to date. Our study identified hundreds of functional regulatory variants with significant, reproducible, and concordant allele-specific effects on gene expression, and further connected gene regulatory disruptions to differences in chemotherapy response through alterations in antileukemic drug sensitivity in ALL cells. Collectively, these data support the importance of noncoding, gene regulatory disruptions in the pharmacogenomics of ALL treatments. The further functional investigation of these regulatory variants and the discovery of additional inherited variants impacting therapeutic outcomes can be used by clinicians to tailor therapies based on a patient’s unique genetic makeup through precision or personalized medicine.
Methods
Ethics
This study complies with all relevant ethical regulations, and the relevant study protocols were approved by the institutional review boards at St. Jude Children’s Research Hospital and Children’s Hospital of Philadelphia, as well as the Institutional Animal Care and Use Committee of St. Jude.
Patient samples and consent
All patients or their legal guardians provided written informed consent. The use of these samples was approved by the institutional review board at St. Jude Children’s Research Hospital. Ex vivo drug sensitivity assay and genotyping datasets were obtained from primary ALL cells of patients enrolled in St. Jude Children’s Research Hospital (Memphis, Tennessee) Total Therapy XV (TOTXV, NCT00137111) and Total Therapy XVI (TOTXVI, NCT00549848) protocols10,21. RNA-seq data was obtained from primary ALL cells of patients enrolled on TOTXVI. Patient sample ATAC-seq data was obtained from published studies55,56. No associations between survival or treatment response and next-generation sequencing data were performed on samples from patients who are enrolled in ongoing clinical trials. No considerations for sex and/or gender were made for this study, and the pharmacogenomic traits were identified regardless of sex and or gender.
Selection of SNVs impacting treatment outcome in patients from published GWAS
We chose 13 relapse SNVs with p < 1 × 10−5 from Yang et al.4, 19 ancestry-specific SNVs relapse SNPs associated with relapse in both discovery and replication ALL patient cohorts (p < 0.05) from Karol et al.5 and 3 SNVs associated with persistence of MRD with p < 1 × 10−6 from Yang et al.3 (n = 35). In addition, we chose all SNVs (n = 126; Karol et al.5 n = 104, Yang et al.4 n = 10 and Yang et al.3 n = 12) from these GWAS with nominal genome-wide association (p < 0.05) but that were additionally associated with ex vivo drug resistance (p < 0.05) in primary ALL cells from patients. Relapse SNVs additionally associated with ex vivo drug resistance were already identified by Karol et al.5. GWAS for ex vivo drug resistance (p < 0.05) using primary ALL cells from patients enrolled on TOTXV and TOTXVI protocols were performed on SNVs identified in Yang et al.4 and Yang et al.3 (see Methods below).
Ex vivo drug sensitivity assays in primary ALL cells
Ex vivo drug sensitivity assay data was previously published10,21. The ex vivo drug sensitivity assays are previously described21. Briefly, primary leukemia cells were isolated from the bone marrow or peripheral blood of newly diagnosed ALL patients from St. Jude Total Therapy XVI protocol (TOTXVI, NCT00549848) and tested for antileukemic drug sensitivity by a 96-hour MTT assay using a range of drug concentrations. Primary ALL cells were treated with prednisolone (n = 320), dexamethasone (n = 312), bacterially derived L-asparaginase (n = 335), vincristine (n = 323), 6-mercaptopurine (n = 344) and 6-thioguanine (n = 325). Following drug treatment, the lethal concentration resulting in 50% viability (LC50) was calculated for each patient sample.
Genotyping in primary ALL cells
Genotyping data was previously published21. Briefly, genomic DNA was collected from primary ALL cells of patients enrolled on TOTXV and TOTXVI protocols using Blood and Cell Culture DNA kit (Qiagen), and SNP genotyping was performed using the Affymetrix Gene ChIP Human Mapping 500 K array or Affymetrix Genome-Wide Human SNP 6.0 array. Genotypes were called BRLMM algorithm in the Affymetrix GTYPE software (http://www.affymetrix.com/products/software/specific/gtype.affx) as previously described79. We excluded SNVs for call rates <95% among patients or minor allele frequencies <1%. SNPs were additionally filtered based on call rate and Hardy Weinberg equilibrium (p > 1e-4). Genome-wide genotypes were imputed using Topmed imputation server.
Ex vivo drug resistance GWAS
Imputed genotypes with r2 ≥ 0.6 were tested for their association with LC50. To test the association between LC50 and genotype, LC50 were first log10 transformed and treated as a continuous dependent variable and genotype as independent variable. Genotypes were coded as 0, 1, and 2. The association between LC50 and genotype was tested using a general linear regression model for additive genotypic effect using PLINK. Two-tailed p-values were generated using a Wald test. Top 5 principal components were included as covariates in the regression model to control for population stratification. Ex vivo drug resistance GWAS was performed using primary ALL cells from patients enrolled on TOTXVI protocol. All statistical analysis was performed in R v4.0.2. SNVs that were nominally associated with ex vivo drug resistance (p < 0.05) and that were also identified as eQTLs in the same patient cohort (p < 0.05; see Methods below) were included in the MPRA analysis. For ex vivo drug resistance associations with treatment outcome SNVs (see Methods above), GWAS was performed from patients enrolled on TOTXV and TOTXVI protocols separately, and the GWAS results were then combined through meta-analysis using METAL.
Cell lines, cell culture, and authentication
Leukemia cell lines (Nalm6 DSMZ ACC128, Jurkat DSMZ ACC282, B-ALL-1 DSMZ ACC742, 697 DSMZ ACC42, CEM DSMZ ACC240, REH DSMZ ACC22, P12-Ichikawa DSMZ ACC34, SEM DSMZ ACC546, Loucy DSMZ ACC394, DND41 DSMZ ACC525, HSB2 DSMZ ACC435, MOLT16 DSMZ ACC29, RS411 DSMZ ACC508, SUPB15 DSMZ ACC389) were cultured in RPMI 1640 + 10% fetal bovine serum. Cells were not cultured beyond 25 passages after we received them for experiments in this work. PCR-based mycoplasma testing determined that the cell lines were negative for mycoplasma contamination. STR profiling was confirmed before freezing down cell aliquots before passage 5 after we received them.
eQTL mapping in primary ALL cells
RNA-seq data was obtained from a previous publication21. Briefly, total RNA from primary ALL cells was isolated using RNAeasy Mini kit (Qiagen), and mRNA sequencing using an Illumina HiSeq platform was performed by the Hartwell Center for Bioinformatics and Biotechnology at St. Jude Children’s Research Hospital. Genotype information and RNA expression data from patients enrolled on TOTXVI protocol were correlated to identify eQTLs. QTL mapping was performed using multiple linear regression and log-transformed FPKM gene expression as the dependent variable and genotype as the independent variable. Genotypes were coded as 0, 1, and 2. Patient genetic ancestry was included in the linear model as covariates. Two-tailed p-values were generated using a Wald test. All statistical analysis was performed in R v4.0.2.
Quantitative real-time PCR (qPCR)
See Supplemental Methods for more details.
Dual-luciferase reporter assays
See Supplemental Methods for more details.
MPRAs
MPRA oligo design, plasmid library construction, plasmid transfection, sequencing, and analysis are provided in Supplemental Methods.
Epigenomic profiling in ALL cells
See Supplemental Methods for more details.
PU.1 in vitro binding affinity assay
DNA pulldown assay was adopted from the previous article and performed using a modified protocol80. See Supplemental Methods for more details.
CRISPR/Cas9 deletion
rs1247117 deletions in Nalm6 were generated using CRISPR-Cas9 technology. See Supplemental Methods for more details.
Nalm6 vincristine sensitivity assays
Drug viability assays were performed as previously with slight modification22. Additional information is provided in Supplemental Methods.
Statistical analysis
See Supplemental Methods for more details on statistical analyses performed.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The ATAC-seq, promoter CHiC, HiChIP, RNA Pol II, and PU.1 genomic binding data generated in this study are publicly available in the GEO database under accession code GSE224204. The fresh patient sample ATAC-seq data used in this study are publicly available in the GEO database under accession code GSE226400. The frozen patient sample ATAC-seq data used in this study are publicly available in the GEO database under accession code GSE161501. H3K27Ac ChIP-seq data, “GSE175482 Nalm6 H3K27ac 0 hr merged.bw”, used in this study are publicly available in the GEO database under accession code GSE175482. All other data generated in this study are provided in Supplementary Information and Source Data file. Source data are provided with this paper.
References
Hunger, S. P. & Mullighan, C. G. Acute lymphoblastic leukemia in children. N. Engl. J. Med. 373, 1541–1552 (2015).
Oriol, A. et al. Outcome after relapse of acute lymphoblastic leukemia in adult patients included in four consecutive risk-adapted trials by the PETHEMA Study Group. Haematologica 95, 589–596 (2010).
Yang, J. J. et al. Genome-wide interrogation of germline genetic variation associated with treatment response in childhood acute lymphoblastic leukemia. JAMA 301, 393–403 (2009).
Yang, J. J. et al. Genome-wide association study identifies germline polymorphisms associated with relapse of childhood acute lymphoblastic leukemia. Blood 120, 4197–4204 (2012).
Karol, S. E. et al. Genetics of ancestry-specific risk for relapse in acute lymphoblastic leukemia. Leukemia 31, 1325–1332 (2017).
Cave, H. et al. Clinical significance of minimal residual disease in childhood acute lymphoblastic leukemia. European Organization for Research and Treatment of Cancer–Childhood Leukemia Cooperative Group. N. Engl. J. Med. 339, 591–598 (1998).
Borowitz, M. J. et al. Minimal residual disease detection in childhood precursor-B-cell acute lymphoblastic leukemia: relation to other risk factors. A Children’s Oncology Group study. Leukemia 17, 1566–1572 (2003).
Coustan-Smith, E. et al. Clinical importance of minimal residual disease in childhood acute lymphoblastic leukemia. Blood 96, 2691–2696 (2000).
Zhou, J. et al. Quantitative analysis of minimal residual disease predicts relapse in children with B-lineage acute lymphoblastic leukemia in DFCI ALL Consortium Protocol 95-01. Blood 110, 1607–1611 (2007).
Lee, S. H. R. et al. Pharmacotypes across the genomic landscape of pediatric acute lymphoblastic leukemia and impact on treatment response. Nat. Med. 29, 170–179 (2023).
Pieters, R. et al. Relation of cellular drug resistance to long-term clinical outcome in childhood acute lymphoblastic leukaemia. Lancet 338, 399–403 (1991).
Bosanquet, A. G. Correlations between therapeutic response of leukaemias and in-vitro drug-sensitivity assay. Lancet 337, 711–714 (1991).
Hwang, W. S., Chen, L. M., Huang, S. H., Wang, C. C. & Tseng, M. T. Prediction of chemotherapy response in human leukemia using in vitro chemosensitivity test. Leuk. Res. 17, 685–688 (1993).
Pieters, R., Kaspers, G. J., Klumper, E. & Veerman, A. J. Clinical relevance of in vitro drug resistance testing in childhood acute lymphoblastic leukemia: the state of the art. Med Pediatr. Oncol. 22, 299–308 (1994).
Klumper, E. et al. In vitro cellular drug resistance in children with relapsed/refractory acute lymphoblastic leukemia. Blood 86, 3861–3868 (1995).
Hongo, T., Yajima, S., Sakurai, M., Horikoshi, Y. & Hanada, R. In vitro drug sensitivity testing can predict induction failure and early relapse of childhood acute lymphoblastic leukemia. Blood 89, 2959–2965 (1997).
Kaspers, G. J. et al. In vitro cellular drug resistance and prognosis in newly diagnosed childhood acute lymphoblastic leukemia. Blood 90, 2723–2729 (1997).
Den Boer, M. L. et al. Patient stratification based on prednisolone-vincristine-asparaginase resistance profiles in children with acute lymphoblastic leukemia. J. Clin. Oncol. 21, 3262–3268 (2003).
Frost, B. M. et al. Increased in vitro cellular drug resistance is related to poor outcome in high-risk childhood acute lymphoblastic leukaemia. Br. J. Haematol. 122, 376–385 (2003).
Holleman, A. et al. Gene-expression patterns in drug-resistant acute lymphoblastic leukemia cells and response to treatment. N. Engl. J. Med 351, 533–542 (2004).
Autry, R. J. et al. Integrative genomic analyses reveal mechanisms of glucocorticoid resistance in acute lymphoblastic leukemia. Nat. Cancer 1, 329–344 (2020).
Bergeron, B. P. et al. Epigenomic profiling of glucocorticoid responses identifies cis-regulatory disruptions impacting steroid resistance in childhood acute lymphoblastic leukemia. Leukemia 36, 2374–2383 (2022).
Smith, R. P., Lam, E. T., Markova, S., Yee, S. W. & Ahituv, N. Pharmacogene regulatory elements: from discovery to applications. Genome Med. 4, 45 (2012).
Luizon, M. R. & Ahituv, N. Uncovering drug-responsive regulatory elements. Pharmacogenomics 16, 1829–1841 (2015).
Sakabe, N. J., Savic, D. & Nobrega, M. A. Transcriptional enhancers in development and disease. Genome Biol. 13, 238 (2012).
White, M. A., Myers, C. A., Corbo, J. C. & Cohen, B. A. Massively parallel in vivo enhancer assay reveals that highly local features determine the cis-regulatory function of ChIP-seq peaks. Proc. Natl Acad. Sci. USA 110, 11952–11957 (2013).
Kheradpour, P. et al. Systematic dissection of regulatory motifs in 2000 predicted human enhancers using a massively parallel reporter assay. Genome Res. 23, 800–811 (2013).
Savic, D. et al. Promoter-distal RNA polymerase II binding discriminates active from inactive CCAAT/ enhancer-binding protein beta binding sites. Genome Res. 25, 1791–1800 (2015).
Ulirsch, J. C. et al. Systematic functional dissection of common genetic variation affecting red blood cell traits. Cell 165, 1530–1545 (2016).
Kircher, M. et al. Saturation mutagenesis of twenty disease-associated regulatory elements at single base-pair resolution. Nat. Commun. 10, 3583 (2019).
Choi, J. et al. Massively parallel reporter assays of melanoma risk variants identify MX2 as a gene promoting melanoma. Nat. Commun. 11, 2718 (2020).
Joslin, A. C. et al. A functional genomics pipeline identifies pleiotropy and cross-tissue effects within obesity-associated GWAS loci. Nat. Commun. 12, 5253 (2021).
Ajore, R. et al. Functional dissection of inherited non-coding variation influencing multiple myeloma risk. Nat. Commun. 13, 151 (2022).
Long, E. et al. Massively parallel reporter assays and variant scoring identified functional variants and target genes for melanoma loci and highlighted cell-type specificity. Am. J. Hum. Genet. 109, 2210–2229 (2022).
Klein, J. C. et al. Functional testing of thousands of osteoarthritis-associated variants for regulatory activity. Nat. Commun. 10, 2434 (2019).
Smemo, S. et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature 507, 371–375 (2014).
Consortium, G. T. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
van Steensel, B. & Dekker, J. Genomics tools for unraveling chromosome architecture. Nat. Biotechnol. 28, 1089–1095 (2010).
Mifsud, B. et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598–606 (2015).
Montefiori, L. E. et al. A promoter interaction map for cardiovascular disease genetics. Elife 7 https://doi.org/10.7554/eLife.35788 (2018).
Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384.e1319 (2016).
Thulson, E. et al. 3D chromatin structure in chondrocytes identifies putative osteoarthritis risk genes. Genetics 222 https://doi.org/10.1093/genetics/iyac141 (2022).
Shi, C. et al. Chromatin looping links target genes with genetic risk loci for dermatological traits. J. Invest. Dermatol. 141, 1975–1984 (2021).
Perez-Andreu, V. et al. Inherited GATA3 variants are associated with Ph-like childhood acute lymphoblastic leukemia and risk of relapse. Nat. Genet. 45, 1494–1498 (2013).
Perez-Andreu, V. et al. A genome-wide association study of susceptibility to acute lymphoblastic leukemia in adolescents and young adults. Blood 125, 680–686 (2015).
Xu, H. et al. Inherited coding variants at the CDKN2A locus influence susceptibility to acute lymphoblastic leukaemia in children. Nat. Commun. 6, 7553 (2015).
Hungate, E. A. et al. A variant at 9p21.3 functionally implicates CDKN2B in paediatric B-cell precursor acute lymphoblastic leukaemia aetiology. Nat. Commun. 7, 10635 (2016).
Vijayakrishnan, J. et al. Genome-wide association study identifies susceptibility loci for B-cell childhood acute lymphoblastic leukemia. Nat. Commun. 9, 1340 (2018).
Vijayakrishnan, J. et al. Identification of four novel associations for B-cell acute lymphoblastic leukaemia risk. Nat. Commun. 10, 5348 (2019).
Vijayakrishnan, J. et al. The 9p21.3 risk of childhood acute lymphoblastic leukaemia is explained by a rare high-impact variant in CDKN2A. Sci. Rep. 5, 15065 (2015).
Studd, J. B. et al. Genetic and regulatory mechanism of susceptibility to high-hyperdiploid acute lymphoblastic leukaemia at 10p21.2. Nat. Commun. 8, 14616 (2017).
Yang, H. et al. Noncoding genetic variation in GATA3 increases acute lymphoblastic leukemia risk through local and global changes in chromatin conformation. Nat. Genet. 54, 170–179 (2022).
Zhao, X. et al. Molecular mechanisms of ARID5B-mediated genetic susceptibility to acute lymphoblastic leukemia. J. Natl. Cancer Inst. 114, 1287–1295 (2022).
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
Diedrich, J. D. et al. Profiling chromatin accessibility in pediatric acute lymphoblastic leukemia identifies subtype-specific chromatin landscapes and gene regulatory networks. Leukemia 35, 3078–3091 (2021).
Barnett, K. R. et al. Epigenomic mapping reveals distinct B cell acute lymphoblastic leukemia chromatin architectures and regulators. Cell Genom. 3, 100442 (2023).
Klemm, S. L., Shipony, Z. & Greenleaf, W. J. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 20, 207–220 (2019).
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 919–922 (2016).
Dryden, N. H. et al. Unbiased analysis of potential targets of breast cancer susceptibility loci by Capture Hi-C. Genome Res. 24, 1854–1868 (2014).
Jager, R. et al. Capture Hi-C identifies the chromatin interactome of colorectal cancer risk loci. Nat. Commun. 6, 6178 (2015).
Martin, P. et al. Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci. Nat. Commun. 6, 10069 (2015).
McGovern, A. et al. Capture Hi-C identifies a novel causal gene, IL20RA, in the pan-autoimmune genetic susceptibility region 6q23. Genome Biol. 17, 212 (2016).
Skene, P. J. & Henikoff, S. An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites. Elife 6 https://doi.org/10.7554/eLife.21856 (2017).
Panda, A. et al. Clonally selected lines after CRISPR-Cas editing are not isogenic. CRISPR J. 6, 176–182 (2023).
Hosokawa, H. et al. Transcription factor PU.1 represses and activates gene expression in early T cells by redirecting partner transcription factor binding. Immunity 49, 782 (2018).
Carey, H. A. et al. Enhancer variants reveal a conserved transcription factor network governed by PU.1 during osteoclast differentiation. Bone Res. 6, 8 (2018).
Renbarger, J. L., McCammack, K. C., Rouse, C. E. & Hall, S. D. Effect of race on vincristine-associated neurotoxicity in pediatric acute lymphoblastic leukemia patients. Pediatr. Blood Cancer 50, 769–771 (2008).
Yang, J. J. et al. Ancestry and pharmacogenomics of relapse in acute lymphoblastic leukemia. Nat. Genet 43, 237–241 (2011).
Kadan-Lottick, N. S., Ness, K. K., Bhatia, S. & Gurney, J. G. Survival variability by race and ethnicity in childhood acute lymphoblastic leukemia. JAMA 290, 2008–2014 (2003).
Bhatia, S. et al. Racial and ethnic differences in survival of children with acute lymphoblastic leukemia. Blood 100, 1957–1964 (2002).
Pollock, B. H. et al. Racial differences in the survival of childhood B-precursor acute lymphoblastic leukemia: a Pediatric Oncology Group Study. J. Clin. Oncol. 18, 813–823 (2000).
Pui, C. H. et al. Results of therapy for acute lymphoblastic leukemia in black and white children. JAMA 290, 2001–2007 (2003).
Dong, Z. et al. Role of eIF3a in regulating cell cycle progression. Exp. Cell Res. 315, 1889–1894 (2009).
Kothari, A., Hittelman, W. N. & Chambers, T. C. Cell cycle–dependent mechanisms underlie vincristine-induced death of primary acute lymphoblastic leukemia cells. Cancer Res. 76, 3553–3561 (2016).
Jiang, S.-L. et al. The downregulation of eIF3a contributes to vemurafenib resistance in melanoma by activating ERK via PPP2R1B. Front. Pharmacol. 12 https://doi.org/10.3389/fphar.2021.720619 (2021).
Yin, J.-Y. et al. Effect of eIF3a on response of lung cancer patients to platinum-based chemotherapy by regulating DNA Repair. Clin. Cancer Res. 17, 4600–4609 (2011).
Torii, S., Yamamoto, T., Tsuchiya, Y. & Nishida, E. ERK MAP kinase in G1 cell cycle progression and cancer. Cancer Sci. 97, 697–702 (2006).
French, D. et al. Acquired variation outweighs inherited variation in whole genome analysis of methotrexate polyglutamate accumulation in leukemia. Blood 113, 4512–4520 (2009).
Makowski, M. M. et al. An interaction proteomics survey of transcription factor binding at recurrent TERT promoter mutations. Proteomics 16, 417–426 (2016).
Acknowledgements
We would like to thank the Hartwell Center at St. Jude for ATAC-seq, ChIP-seq, Cut & Run, promoter CHiC, HiChIP, and MPRA next-generation sequencing. We would also like to thank Jeremy Hunt and Brandon Smart for their technical support. This work was performed at St. Jude Children’s Research Hospital and was also conducted in part by the ECOG-ACRIN Cancer Research Group. This work is supported by the National Cancer Institute (R01CA234490—Daniel Savic, P30CA021765—St. Jude Children’s Research Hospital, U10CA180820—ECOG-ACRIN Medical Research Foundation, UG1CA232760—MAYO Clinic, Rochester and UG1CA189859—Montefiore Medical Center), the National Institute of General Medical Studies (P50GM115279—St. Jude Children’s Research Hospital) and the American Lebanese Syrian Associated Charities (ALSAC). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author information
Authors and Affiliations
Contributions
Kashi R.B., R.J.M.: investigation, writing—original draft, review, and editing, visualization, supervision, methodology, conceptualization, Kelly R.B.: formal analysis, data curation, D.C.F., B.S.H., J.D.D., B.P.B., S.Y.: investigation, W.Y., C.C.: formal analysis, K.R.C., C.S.M., E.J., S.M.K., W.S., H.I., S.J., C.P., J.J.Y.: resources, E.P., M.R.L., M.V.R., W.E.E.: resources, funding acquisition, S.P.-M.: methodology, D.S.: conceptualization, methodology, supervision, project administration, funding acquisition, writing—original draft, review, and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Martin Walsh, Andrew DeWan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bhattarai, K.R., Mobley, R.J., Barnett, K.R. et al. Investigation of inherited noncoding genetic variation impacting the pharmacogenomics of childhood acute lymphoblastic leukemia treatment. Nat Commun 15, 3681 (2024). https://doi.org/10.1038/s41467-024-48124-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-48124-4
- Springer Nature Limited