Abstract
Despite the large public health toll of smoking, genetic studies of smoking cessation have been limited with few discoveries of risk or protective loci. We investigated common and rare variant associations with success in quitting smoking using a cohort from 8 randomized controlled trials involving 2231 participants and a total of 10,020 common and 24,147 rare variants. We identified 14 novel markers including 6 mapping to genes previously related to psychiatric and substance use disorders, 4 of which were protective (CYP2B6 (rs1175607105), HTR3B (rs1413172952; rs1204720503), rs80210037 on chr15), and 2 of which were associated with reduced cessation (PARP15 (rs2173763), SCL18A2 (rs363222)). The others mapped to areas associated with cancer including FOXP1 (rs1288980) and ZEB1 (rs7349). Network analysis identified significant canonical pathways for the serotonin receptor signaling pathway, nicotine and bupropion metabolism, and several related to tumor suppression. Two novel markers (rs6749438; rs6718083) on chr2 are flanked by genes associated with regulation of bodyweight. The identification of novel loci in this study can provide new targets of pharmacotherapy and inform efforts to develop personalized treatments based on genetic profiles.
Similar content being viewed by others
Introduction
Smoking is a well-established primary risk factor for several types of cancer1, cardiac disease2 and many other chronic illnesses3. It is responsible for nearly 8 million premature deaths each year worldwide (including 1.2 million deaths from second-hand smoke)4, and is the cause of substantial loss in productivity and increased healthcare expenditures in the US5. Several studies6,7,8,9,10,11,12 have shown that multiple aspects of smoking behavior are moderately heritable (~ 50%), including smoking cessation (~ 54%)11, and that this relationship may have increased over time13.
The overwhelming majority of studies relating genetic factors to smoking behavior have utilized large-scale epidemiological cross sectional and cohort samples and have concentrated on behavioral phenotypes that can be readily assessed through questionnaires and single item surveys, such as nicotine dependence, cigarettes per day, heaviness of smoking index, age of initiation and quitting status (current/former smoker). Multiple studies of this type has shown that non-overlapping SNPs from chr15q25.1, within the CHRNA3-CHRNA5 -CHRNB4 (nicotinic acetylcholine receptor) gene cluster, are consistently related to nicotine dependence14,15,16, with rs16969968 (within CHRNA5) having substantial influence, and a second signal tagged by rs680244 (CHRNA3)17. CHRNA3 SNP rs1051730 also shows some of the strongest associations with nicotine dependence (cigarettes per day)18, as do the intronic SNPs rs588765 and rs578776, all of which are highly correlated with rs1696996814,19,20. The rs578776 SNP has demonstrated a protective effect in relation to nicotine dependence (minor allele more frequent in controls than dependent smokers). Joint analyses of rs16969968 and rs3743078 (highly correlated with rs578776) representing the risk and protective haplotypes at the cluster, resulted in a 2.4-fold increase in risk of heavy versus light smoking21. Other SNPs in this region have demonstrated nicotine dependence susceptible and protective haplotypes, and the relationship of these loci with heaviness of smoking is supported in meta-analysis of 34 datasets20. Another meta-analysis, involving 38,602 smokers with European and African origins across 15 studies, re-confirmed the association between smoking and SNPs in this gene cluster but also found that the SNP rs910083 C allele in the DNA methyltransferase 3 beta gene DNMT3B was associated with increased risk of nicotine dependence22. Other cohort studies have identified genetic markers of: 1) tobacco use and nicotine dependence23,24,25,26,27,28 including several from a cross-ancestry analysis of smokers of European and African descent (rs16969968 at CHRNA5, rs13284520 at DBH, rs151176846 at CHRNA4, rs2714700b between MAGI2 and GNAI1, rs1862416 at TENM2)29 and of European and Asian decent (rs6474414 at CHRNB3 and rs1072003 at CHRNA6); 2) number of self-reported quit-attempts, including SNPs rs6298, rs834829 and rs8192729 from HTR1B, NR4A2, and CYP2A6 respectively30; 3) other addictive behaviors, such as alcohol use10,29,31; and 4) psychiatric disorders29,32. Most recently, in a large sample of both current and former smokers (~ 110K) and never smokers (~ 375K), an exome-wide association study (ExWAS) showed that rare predicted loss-of-function and likely deleterious missense variants in CHRNB2 in aggregate were associated with a 35% decreased odds (protective) for smoking more than 10 cigarettes per day. An independent common variant of CHRNB2, rs2072659, also showed a protective effect for heavy smoking33.
GWAS studies of nicotine metabolism and clearance in European ancestry cohorts have shown significant associations between SNPs on chromosome 19 (including CYP2A6, MAP3K10, ADCK4, and CYP2B6) and on chromosome 4 (including TMPRSS11E) and the nicotine metabolism ratio (trans-3’-hydroxycotinine/cotinine or NMR) and between SNPs on chromosomes 9, 4 and 15 (including CHRNB4, CHRNA3, and CHRNA5) and measures of nicotine clearance (cotinine and the sum of cotinine and trans-3’-hydroxcotinine) 34,35. GWAS analysis in smokers of African American ancestry identified multiple independent SNPs at the CYP2A6 chromosome 19 locus and two SNPs on chromosome 2 associated with the NMR; most of these SNPs were not previously identified in European ancestry cohorts.
While important for understanding population level associations between genetics and smoking, the type of studies noted above do not directly assess genetic factors that may drive quitting success during an actual quit attempt, as a smoker makes the transition from smoking to abstinence. The genetic mechanisms underlying the process of smoking cessation and relapse are poorly understood36. National surveys indicate nearly 70% of smokers want to quit smoking37, but despite over 50% of smokers attempting to quit each year, only about 7.5% achieve success annually38. Genetic studies that use a prospective sample of smokers trying to quit may more directly address biological substrates associated with cessation success and help improve treatment outcomes by providing new targets for pharmacotherapy and/or informing efforts at precision medicine that attempt to assign treatments to smokers based on a genetic profile39.
Although fewer in number, prospective studies of smokers attempting to quit have shown some promise in realizing these goals. For example, haplotypes of rs16969968 (CHRNA5) and rs680244 (CHRNA3) have demonstrated an association with abstinence among smokers receiving a placebo vs active pharmacotherapy for smoking cessation40. Moreover, an analysis of eight clinical trials41 found that minor alleles of CHRNA3 rs1051730 and CHRNA5 rs588765 were associated with increased abstinence among smokers receiving nicotine replacement therapy (NRT) but reduced abstinence among those receiving placebo, though these findings were not replicated in later studies42,43. The CHRNB2 SNP rs2072661 has been associated with reduced cessation and the tryptophan 2,3-dioxygease (TDO2) SNP rs10517626 with enhanced cessation in a trial including NRT, placebo and bupropion, with the most pronounced effect of rs2072661 on the bupropion treated smokers44. Similarly in a clinical trial involving varenicline, bupropion or placebo, King and colleagues45 found that CHRNB2 SNPs, most notably rs3811450, and SNPs in the CHRNA5-CHRNA3-CHRNB4 region, e.g., rs7164594, were associated with increased abstinence among varenicline treated smokers; several SNPs from CYP2B6, including rs8109525, were associated with an enhanced response to bupropion specifically, and to overall cessation among all treated smokers.
In a first of its kind genetically informed treatment trial, Chen and colleagues46 examined the treatment response to combined NRT (patch plus lozenge) vs varenicline among smokers stratified by the CHRNA5 SNP rs16969968 (GG vs. AA/GA alleles) at treatment onset. Results showed that among African American smokers, compared with placebo, those with the GG genotype quit significantly more often with NRT but not varenicline, while those with the AA/GA genotype quit significantly more often with varenicline and not NRT. No treatment by genotype interactions were observed for European descent smokers. This group also observed that polygenic risk scores for age of smoking initiation (older) and smoking persistence (past failed attempts) were predictive of abstinence across two prospective treatment trials, though treatment specific interactions were not reported.
In another pioneering trial Lerman and colleagues47 randomly assigned smokers to patch NRT, varenicline or placebo, stratifying by the NMR, and showed that smokers classified as normal metabolizers were more likely to quit using varenicline vs. NRT, while slow metabolizers quit equally often on both medications. NMR is a genetically informed biomarker that encompasses multiple SNPs, particularly within CYP2A6.
While a highly desirable approach for addressing questions related to precision and development of pharmacotherapies, the limitation of studying genetic predictors of smoking cessation in prospective clinical trials is that such studies typically involve much smaller samples than the large-scale epidemiological studies noted above. Moreover, when multiple trials are combined to increase sample size, harmonization of measurements and time points across trials can be problematic. In the current study, we address these potential limitations by combining smoking cessation outcomes from a cohort of 2231 smokers across 8 smoking cessation studies, which shared several common instruments and measurement points and the baseline collection of DNA. To our knowledge, this is the largest prospective sample of its kind. In this paper, we present novel findings relating common and rare variants to cessation success at 6-month post-treatment follow-up, a commonly used standard for measuring long term treatment outcome48. This contrasts with several of the trials reviewed above that focused on abstinence at the end of treatment, typically 12 weeks. The availability of cessation data of all studies at the 6-month time-point and the measurement of abstinence using both self-report and biochemical verification enables us to examine the relationship between key smoking-related genes in an integrated, well-phenotyped, ethnically diverse sample and to evaluate the findings for these traits more systematically than has previously been possible. This study can significantly improve our understanding of the etiology and pathophysiology of this complex phenotype, and aid in prevention and treatment efforts.
Methods
Subjects
The study included smokers who participated in 7 NIH- and 1 CPRIT- (Cancer Prevention Institute of Texas) funded studies of smoking cessation awarded to Drs. Paul Cinciripini and David Wetter, conducted at the University of Texas MD Anderson Cancer Center. Details of the design, recruitment, and inclusion/exclusion criteria for each of the studies have been described in detail elsewhere and include: Breakfree49, CARE50, CASSI51, MIND52, PNS53, QuitRx54, STEPS55 and Two2Quit56 (Grant numbers and ClinicalTrials.gov registration numbers, where required, are provided in the acknowledgements). Participants were recruited from the Houston metropolitan area from a wide variety of sources including local print media, flyers, and collaborations with local healthcare institutions. All studies were prospective smoking cessation clinical trials which shared at a minimum, recruitment of current smokers wanting to quit, exposure to smoking cessation guideline based treatment57 involving behavioral counseling for smoking cessation and pharmacotherapy, common measures of abstinence and 6 month outcome information. Regardless of treatment type, 6-month post-treatment outcome was the primary outcome variable for this analysis. All participants signed an informed consent form that permitted us to collect buccal swab samples and demographic and phenotypic data shared across studies. The study protocols were approved by the Institutional Review Board of MD Anderson in accordance with tenets of the Declaration of Helsinki.
Outcome of interest
Abstinence status was measured at the end of six months using the self-reported 7-day point prevalence (no smoking even a puff in last 7 days) verified by exhaled carbon monoxide (CO) at or below a cutoff of 4 parts per million (ppm). Such a cutoff was recommended to verify smoking abstinence and has been shown to be more accurate than cutoffs of 8 or 10 ppm58,59,60. Based on the affirmative self-report of no smoking in the last 7 days plus CO ≤ 4 ppm, participants were classified as abstainers (i.e., successfully quit smoking). Individuals who reported smoking or had a CO > 4 ppm were classified as nonabstainers.
Covariates
Demographic information (age, gender, race/ethnicity), type of cessation treatment received, and baseline smoking information (e.g., numbers of cigarettes smoked per day), were used as covariates in the analyses. The participants were treated using different medications for smoking cessation, including bupropion, nortriptyline, varenicline, NRT, combination of varenicline and bupropion, and placebo. Because the smoking cessation counseling duration and the pharmacotherapies differed across some of the trials, a study ID and a designator for medication type were included in all analyses as covariates. In addition, covariates for population structure, as described below, were included in the model.
Sequencing and genotyping
The final sample consisted of 2231 participants prospectively enrolled in smoking cessation trials across the 8 studies shown in Table 1, for which both genetic and phenotypic (abstinence) information was available. For each of these individuals we collected DNA samples using buccal swabs, a 30″ mouth rinse using standard travel size bottle of Scope mouthwash (~ 2.5 oz) and processed using genomic DNA purification kits from Qiagen. The sample included a total of 10,020 genetic markers that were derived from both sequencing and genotyping procedures as described below.
Sequencing was carried out using the Illumina Hiseq2000 sequencing system at the Human Genome Sequencing Center, Baylor College of Medicine. We sequenced 55 candidate genes (Supplementary Material Table S1) primarily covering exon regions. Candidate genes were selected based upon literature survey of markers previously associated with smoking phenotypes (including dependence and cessation), other substance abuse and psychiatric disorders. The short sequence reads were filtered by Illumina CASAVA analysis software (v 13.10.01) and mapped to the reference genome using Burrows-Wheeler Aligner (BWA)61 to create a .bam file. Variants were called by Atlas-SNP2 (v1.4.3 r171, includes Atlas-Indel)62 to create a VCF file and these in turn were annotated using the Cassandra pipeline63. The average coverage of the target bases for the samples was 221x. Standard quality assurance and quality control procedures were conducted to detect problems with initial DNA quality, library construction methods, emulsion and bead quality, instrument chemistry and performance during the run and final sequence metrics after completion of the run. The initial result of the sequencing identified a total of 45,365 variants. Those not having a quality score of “PASS”, lower genotyping rate (< 0.95) and failing the Hardy–Weinberg proportion (HWP) test (p < 10–6) were removed, resulting in a total of 5138 common variants with a minor allele frequency (MAF) ≥ 0.05 and 24,197 rare variants (analyzed separately) with a MAF < 0.01 but ≥ 0.0001.
For the genotyping, we used Illumina’s Infinium iSelect Custom Genotyping Chip that included 6839 tagging SNPs. The tagging SNPs selected for this analysis, were derived from the literature identifying genetic markers associated with tobacco use, substance abuse and psychiatric disorders, plus those derived from our sequenced SNPs and included 169 ancestral informative markers. A set of 75 duplicate samples were genotyped to ensure genotyping quality. Illumina GenomeStudio was used for genotype calling based on the GenTrain clustering algorithm64. Cluster boundaries were determined using samples from the study. SNPs were filtered according to GenCall Score (GC score) > 0.15 and a median score of 0.88 using GenomeStudio (v1.9.4). Furthermore, we removed SNPs with a MAF < 0.05, and those that failed HWP test (P value < 10–6). The Genome Reference Consortium Human Build 37 (GRCh37) was used to map the genetic variants.
PLINK65 software (v1.90b3) was used to convert sequencing VCF and GenomeStudio files, and to process basic quality controls. The final analytic sample following quality control and availability of 6-month smoking cessation outcome data (the phenotypic of interest here) was 2231, which included 5138 variants from sequencing and 4882 additional markers from genotyping for a total of 10,020 markers used in the analyses (see Supplementary Information 2). The final composition of genetic information for the 2231 participants used in this analysis, included 1169 that had both sequencing and genotyping data, 439 and 623 with sequencing or genotyping alone, respectively. The mix of subjects in the datasets of two phases were proportional across these subsets.
Population structure
Population structure was assessed using the Structure (v.2.3.4) program66. For assurance of the results, analysis using Admixture (v.1.3.0)67,68 was done on the same data sets. Population reference data from 1000 Genome Project (Phase 3 release, 2504 individuals)69 were used. The reference data contains 2478 unrelated individuals and over 84 million SNPs from 5 major racial groups from 26 geographic locations, African (7 locations), Latin American (4 locations), European (5 locations), East Asian (5 locations), and South Asian (5 locations)70. We first extracted 5317 markers with a minor allele frequency (MAF) ≥ 0.05 from the reference data set, which were overlapping with our data, and had a P value ≥ 1 × 10–6 for the HWP test. A general measurement of informativeness for assignment71 was calculated for each of the 5317 markers using the reference data. A set of 935 SNPs (including 169 ancestry informative markers) with a measurement of informativeness > 0.05 were selected for assessing population structure in our study.
The number of ancestries (K) was estimated with the use of the admixture model, based on the set of 935 SNPs. We considered a range of 5 to 15 for the number of ancestries K. For each of the value’s K, the Markov chain Monte Carlo (MCMC) process ran for 15,000 iterations, among which the first 5000 iterations were used as burn-in process. The likelihoods of the data given different K values (i.e., posterior probabilities) were calculated and the K value which maximized the posterior probability was selected as the number of ancestries in the study population. We thus obtained K = 11 ancestries for our study.
Given 11 ancestries, for each individual, STRUCTURE provided the probability of this individual belonging to each of the ancestry group (i.e., 11 probabilities per individual). One can assign the individual to one of the ancestries based on the highest probability. In our study, we created the population cluster score for each individual based on his/her ancestries corresponding to the three highest probabilities, which provided a higher resolution to classify individuals into different ancestries. The population structure scores created in this way were included in the statistical analysis as a covariate.
Statistical analyses
Statistical analyses were conducted using PLINK65 (v1.90), SKAT-O72,73, R74, SAS 9.3 (SAS Institute, Cary NC) and KING75 (v1.4) software. We used the genotypic data to identify individuals with discordant gender information, duplicates, and closely related individuals. We identified genetically related individuals by estimating the pairwise kinship coefficients using KING (v1.4) software. For any pair of individuals which were duplicates or related (i.e., with allele sharing of > 80%), we excluded the individual with lower call rate. Deviation from HWP for each genetic variant was assessed by 1 degree-of-freedom χ2 test or Fisher’s exact test where an expected cell count was less than five76.
Association analyses for common variants (SNPs; MAF ≥ 0.05) were conducted using multivariable unconditional logistic regression based on a two-sided Wald test implemented in the software PLINK65. We tested each common variant assuming an additive genetic model. Age, gender, study ID, medication type and population cluster were included in the analyses as covariates. For the smoking cessation (abstinence) phenotype, participants with missing information (14% of the sample) were imputed as “smoking” as it is the common practice in smoking cessation studies.
The study data were randomly divided into phase 1 data (70% of the participants) and phase 2 data (30% of the participants). For the joint analysis with pooled data from both phases, we included a fixed indicator as a covariate for the phases to control for the possible confounding effects of phases. We used the standard established threshold of genome-wide significance level of P value 5 × 10–8 to declare statistical significance.
For the association analyses of rare variants (MAF ≤ 0.01), we conducted the gene-based analysis using the optimal Sequence Kernel Association Test (SKAT-O)72,73 , which uses the collapse method to test the joint effect of multiple rare variants within a gene region on a phenotype. Same covariates, including age, sex, study ID, medication type, population cluster, and the indicator for phases (phase 1/phase 2), were included in the analyses as covariates. To account for multiple testing issues, we used the significance level of 9.1 × 10–4 (i.e., 0.05/55) for the gene-based rare-variants genetic association analysis.
Ingenuity pathway analysis
Ingenuity Pathway Analysis (IPA; Ingenuity® Systems, www.ingenuity.com) 77 is a software program employed to connect molecules based on the scientific data in the Ingenuity Knowledge Base, including information on biological interactions and functional annotations78. In this study, we used IPA to further explore the biological mechanism/insight of the genes that harbor the genetic variants identified to be significantly associated with the abstinence phenotype in the association analysis. These genes of interest are denoted as focus genes in IPA. The IPA core analysis function was employed to determine biological functions, search for signaling and metabolic canonical pathways, and generate relevant molecular networks on the basis of the focus genes79,80. IPA creates the biological functions and canonical pathways from the literature, independent of focus genes. Specifically, IPA core analysis compares the focus genes with all build-in canonical pathways and biological functions in the IPA database and identifies the canonical pathways/biological functions, which include genes that overlap with the focus genes. The molecular network is constructed based on the focus genes and the connections in which they function, based on the main assumption that the biological function involves locally dense interactions. The details regarding the network generation algorithm have been described (summarized in the Supplementary Materials)81,82. Importantly, when generating a network, the iterative algorithm attempts to connect additional non-focus genes from its entire database to any of the genes which have already involved in the gene network (focus or non-focus genes) if such genes are more likely to have connections (i.e., biological relationships) with the network. As these non-focus genes are from a background consisting of all genes in the database, the resulting relevant networks may potentially identify additional genes that interact with the focus genes associated with abstinence. These additional genes emerge as potential candidate genes of interest for future investigations of abstinence. The resulting network also presents a bigger view of the genes likely to be interacting and directly or indirectly associated with abstinence. To evaluate the resulting functions, pathways, or networks, P values are calculated using a right-tailed Fisher's exact test, which measures the likelihood that the association between the set of focus genes and a given function/pathway/network is due to random chance81,82.
Results
Characteristics of study populations
The study included 1571 participants (275 abstainers and 1296 non-abstainers) in the phase 1 data; and 660 participants (120 abstainers and 540 non-abstainers) in the phase 2 data (Table 1). In the phase 1 dataset, the distributions of age and cigarettes smoked per day were very similar in the abstainers and non-abstainers: mean age 45.7 (standard deviation [SD] = 11.4) for the abstainers and 43.3 (SD = 10.9) for the non-abstainers; mean cigarettes smoked per day, 19.0 (SD = 9.1) for the abstainers and 20.8 (SD = 10.2) for the non-abstainers. Approximately half of the participants were male (56% for abstainers and 51.8% for non-abstainers). There were more White (both Hispanic and non-Hispanic) participants who were abstainers (70.9%) than non-abstainers (53.5%), while more Black (Hispanic and non-Hispanic) participants were non-abstainers (41.2%) than abstainers (24.8%). More participants were employed in the abstainer group (79.3%) compared with the non-abstainer group (63.6%). More participants had a high school/GED or less education in the non-abstainers (37%); while more participants had a bachelor’s degree or some post-graduate work or above in the abstainers (29.1%). The majority of the participants were treated using NRT for smoking cessation (57.1% for abstainers and 72.7% for non-abstainers).
The population characteristics in the phase 2 dataset were very similar to those in the phase 1 dataset. The abstainer group had similar distributions of age (mean 44.4 [SD = 10.2]), cigarettes smoked per day (mean 19.9 [SD = 10.1]), and sex (53.3% male), compared with the non-abstainer group (mean age 43.3 [SD = 10.9], mean cigarettes per day 20.7 [SD = 10.2], 53.1% male; Table 1). Similarly, in the abstainers, more participants were Hispanic and non-Hispanic White (65%), employed (77.1%) and had a bachelor’s degree or some post-graduate work or above (30.3%). The majority of the participants were treated with NRT for smoking cession (62.5% for abstainers and 72.2% for non-abstainers).
Analyses of common variants
We found 14 genetic variants associated with the abstinence phenotype that met the genome-wide statistical significance threshold (that is, P < 5 × 10–08; Table 2). A Manhattan plot for the joint analysis using data merged from both phases is shown in Supplementary Fig. S1.
Based on the P values using the meta-analysis of data from both phases, the variant rs1175607105 was found to be the strongest statistically significant signal protective for smoking cessation behavior (i.e., OR > 1; likely to quit smoking) (OR = 2.34, 95% CI: 1.83–2.98; P = 9.06 × 10–12). The rs1175607105 localizes to 19q13.2 (41,520,210 bp; Fig. 1A and Table 2) and maps to the gene CYP2B6 (cytochrome P450 family 2 subfamily B member 6).
There were three additional genetic variants identified to be protective for smoking cessation behavior, including rs1413172952 (OR = 1.98, 95% CI: 1.57–2.49; P = 6.1 × 10–9), rs1204720503 (OR = 2.17, 95% CI: 1.65–2.85; P = 2.72 × 10–8) and rs80210037 (OR = 1.91, 95% CI: 1.52–2.40; P = 3.49 × 10–8). The variants rs1413172952 (113,792,339 bp; Fig. 1B and Table 2) and rs1204720503 (113,781,550 bp) localize to 11q23.2 and maps to the gene HTR3B (5-hydroxytryptamine receptor 3B).
In addition, ten risk genetic variants were identified as being significantly associated with the abstinence phenotype (i.e., OR < 1; less likely to quit smoking), including rs2173763 (OR = 0.64, 95% CI: 0.55–0.74; P = 5.88 × 10–10), rs6749438 (OR = 0.55, 95% CI: 0.45–0.67; P = 9.91 × 10–10), rs6718083 (OR = 0.58, 95% CI: 0.49–0.69; P = 1.82 × 10–9), rs7349 (OR = 0.69, 95% CI: 0.61–0.78; P = 2.28 × 10–9), rs6869603 (OR = 0.67, 95% CI: 0.58–0.76; P = 2.35 × 10–9), rs363222 (OR = 0.67, 95% CI: 0.59–0.77; P = 3.73 × 10–9), rs1288980 (OR = 0.65, 95% CI: 0.57–0.76; P = 6.71 × 10–9), rs992528 (OR = 0.70, 95% CI: 0.61–0.79; P = 3.22 × 10–8), rs11064432 (OR = 0.71, 95% CI: 0.63–0.80; P = 4.14 × 10–8) and rs1333758 (OR = 0.61, 95% CI: 0.51–0.73; P = 4.4 × 10–8).
Among the ten significant signals, two variants (rs2173763 and rs1288980) were located on chromosome 3. In particular, the variant rs2173763 localizes to 3q21.1 (122,329,160 bp; Fig. 1C and Table 2) and maps to the intron of the gene PARP15 (poly(ADP-ribose) polymerase family member 15). The variant rs1288980 localizes to 3p13 (71,105,863 bp; Fig. 1D) and maps to the gene FOXP1 (forkhead box P1).
Three variants (rs7349, rs363222, rs992528) were located on chromosome 10. The variant rs7349 localizes to 10p11.22 (31,817,905 bp; Fig. 1E) and maps to the 3’ untranslated region of the gene ZEB1 (zinc finger E-box binding homeobox 1), which has been associated with lung cancer83,84,85. The variant rs363222 localizes to 10q25.3 (119,019,448 bp; Fig. 1F) and maps to the gene SLC18A2 (solute carrier family 18 member A2).
Two variants (rs6749438 and rs6718083) were located on chromosome 2 and are close to each other. The variant rs6749438 localizes to 2p23.3 (25,190,127 bp; Fig. 1G) and maps to gene DNAJC27 (DnaJ heat shock protein family (Hsp40) member C27). The variant rs6718083 localizes to 2p23.3 (25,362,194 bp; Fig. 1G) and maps to the gene EFR3B (EFR3 homolog B). The variant rs11064432 localizes to 12p13.31 (6,968,741 bp; Fig. 1H) and maps to the intron of the gene USP5 (ubiquitin specific peptidase 5). The variant rs1333758 localizes to 13q32.3-q33.1 (101,897,883 bp; Fig. 1I) and maps to the gene NALCN, a gene that belongs to a family of voltage-gated sodium and calcium channels expressed throughout the nervous system86.
The linkage disequilibrium (LD) was assessed for the significant genetic variants (Table 2) in close proximity. No strong LD was observed between pairs of significant variants in proximity: rs6749438 and rs6718083 (\({r}^{2}=0.07\)), rs363222 and rs992528 (\({r}^{2}=0.104\)), and rs1413172952 and rs1204720503 (\({r}^{2}=0.015\)).
We further investigated the LD between the 14 significant genetic variants (Table 2) and the variants associated with smoking cessation from the literature. Specifically, we extracted SNPs associated with smoking cessation from the EBI/NHGRI GWAS Catalog87 as of January 30, 2024, selecting entries referencing "smoking cessation" as the disease/trait. We omitted SNPs from studies that compared current smokers to former smokers because such comparisons do not align with the methodology of our current study. Consequently, we identified three SNPs associated with smoking cessation89,90. We employed LDlink, a comprehensive web-based platform, to investigate the LD92. LDlink utilizes the genomic data from the 1000 Genomes Project, offering a rich repository of human genetic variation across diverse populations. The resulting \({r}^{2}\) values are reported in Supplementary Material Table S2. No strong LD was noted, with \({r}^{2}\) values ranging from < 0.001 to 0.027.
Ingenuity pathway analysis (IPA)
From the analysis of common variants, we identified 10 genes that harbor the genetic variants significantly associated with smoking cessation (Table 2). The 10 identified genes were employed as the focus genes in the IPA core analysis. Significant canonical pathways and biological functions were identified based on the focus genes. As described in the Methods section, the core analysis provides a measure of the association of focus genes of interest with the built-in canonical pathways and biological functions. In particular, the most significant canonical pathway identified was the Serotonin Receptor signaling pathway (P = 1.78 × 10–4), which is relevant in the etiology of neuropsychiatric and mood disorders88. The focus genes were also shown to be potentially related to Bupropion Degradation and Nicotine Degradation pathways (P = 1.15 × 10–2 and 2.33 × 10–2, respectively), which are, in general, related to smoking cessation. The significant P values imply over-representation of focus genes in these pathways, and that the association between focus genes and pathways is non-random.
Furthermore, the IPA core analysis generated a network showing the additional molecules that directly or indirectly relate to or interact with the genes identified through the association analyses (Supplementary Figure S2). The molecules with most interconnections are of interest since the highly connected molecules are considered to be most likely associated with diseases or biological functions79,81. Fourteen molecules with 15 or more interactions, as indicated by the numbers of edges connected to other molecules in the network, were identified and are highlighted in the figure. The most highly connected molecules including several related to cancer etiology (see Discussion) such as: TP53 (tumor protein p53)91; RB1 (RB transcriptional corepressor)93; CDKN2A (cyclin dependent kinase inhibitor 2A)94 and EGFR (epidermal growth factor receptor)95, plus RELA, a subunit of the heterodimeric transcription factor called NF-Kappa-B, related to substance abuse96.
Analyses of rare variants
Based on the gene-based association analyses for rare variants (MAF ≤ 0.01), we observed marginal associations of ADCY5 (P = 1.16 × 10–2) and SLC6A2 (P = 1.72 × 10–2) for smoking abstinence phenotype using data combined from both phases. Gene ADCY5 localizes to 3q21.1 and has been associated with low birth weight and type 2 diabetes97,98. Gene SLC6A2 localizes to 16q12.2 which is associated with norepinephrine transport and bipolar disorder, depression and ADHD99,100,101. Note that these signals were not statistically significant after adjusting for multiple comparisons (a significance threshold of P value ≤ 9.1 × 10–4 corresponding to 0.05/55 candidate genes).
Discussion
This study examined genetic predictors of long-term treatment success (6-months) for smoking cessation that used a prospective sample of 2331 smokers undergoing standard smoking cessation therapy including behavioral counseling and pharmacotherapy. Genotyping involved sequencing of 55 candidate genes previously associated with smoking behavior, other substance abuse and psychiatric disorders, and the 6606 tagging SNPs and 233 AIMS, yielding 10,020 common and 24,147 rare variants of sufficient quality for analysis. We took the approach of engaging in two phases using 70% and 30% of the sample, respectively and present the combined results of all markers exceeding GWAS defined significance levels, associating common and rare variants with the cessation phenotype, while controlling for multiple factors including genomic ancestry, study related factors, and demographics.
Our analysis revealed 14 novel markers not previously identified with smoking cessation phenotype of interest defined in the manuscript (P < 5 × 10–08). When mapping these SNPs to specific genes and regions, two major themes emerged. The first theme highlights shared genetic substrates between abstinence from smoking and selected psychiatric and substance use disorders among 6 of these markers, four of which were protective (OR > 1 favoring smoking cessation). Among them, the variant rs1175607105 produced the strongest signal and maps to the gene CYP2B6 which is the primary enzyme responsible for metabolism of the smoking cessation and antidepressant drug bupropion102,103 but has also been implicated in nicotine metabolism34,35,104,105. While a modest inhibitor of norepinephrine and dopamine reuptake, which may account for its antidepressant effects, bupropion also acts as an antagonist of several nicotine cholinergic receptor subtypes106. The other 3 protective variants include rs1413172952, and rs1204720503 which map to the HTR3B (5-hydroxytryptamine-serotonin receptor 3B) gene and variant rs80210037 on chromosome 15. The HTR3B serotonergic receptor gene has been implicated in longer time to relapse following treatment in a combined analysis of bupropion, varenicline and placebo treated smokers45 and nicotine dependence in a mixed ancestry sample of African and European decent107. This suggests that loci on this gene may be predictive of smoking cessation treatment regardless of the type of pharmacotherapy given and of dependence on nicotine in a mixed ancestry sample. Interestingly, other polymorphisms on this gene have been related to a protective effect for obsessive compulsive disorder108 and major depression109 which, like other psychiatric disorders, have been associated with increased prevalence of smoking110 and a shared causal genetic basis111,112.
Two other novel variants, associated with reduced likelihood of quitting, also mapped to genes with previously noted markers for psychiatric and substance use disorders: rs2173763 maps to PARP15 on which several locations have been associated with a broad mood disorder phenotype (Major Depression and Bipolar disorder)113; and rs363222 maps to SLC18A2 which is associated with monoamine neurotransmitter transport (dopamine, norepinephrine, serotonin). Varenicline is one of our most effective smoking cessation medications114 and acts as a dopamine partial agonist115. Several other loci on this gene have been related to alcohol116, opioid117 and nicotine dependence118,119, and PTSD120. Moreover, while our analyses of rare variants did not yield significant associations that survived correction for multiple comparisons, a strong signal was present for SLC6A2 (norepinephrine transporter) which has been implicated in mood disorders and ADHD99,100,101, both of which are more prevalent among smokers110,121.
Consistent with the relationship between smoking cessation and psychiatric disorders described above for individual markers mapped to specific genes, our IPA of the 10 significant common variants that mapped to specific genes (see Table 2) showed significant canonical pathways for the serotonin receptor signaling pathway and for nicotine and bupropion metabolism. Serotonin reuptake inhibitors are used routinely in the treatment of depression122, while as noted previously, differences in nicotine metabolism47 have been associated with a differential response to NRT, varenicline47, and bupropion123. Interestingly, the drug venlafaxine, a norepinephrine and serotonin reuptake inhibitor, has been associated with increased smoking cessation when combined with NRT124, which is commensurate with the findings noted above for SLC18A2 and SCL6A2. Our IPA network analysis of related molecules also revealed relationship between these genes and RELA, a transcription factor involved in NFkB heterodimer formation, nuclear translocation and activation and previously implicated in drug addiction96.
The second theme among our results points to genetic regions associated with both smoking cessation and cancer pathophysiology (note smokers in this sample did not have a current cancer diagnosis, though past-history is unknown). For example, we found significant associations between abstinence and the variant rs1288980 mapping to the gene FOXP1 containing regions reported to act as a tumor suppressor125,126. The variant rs7349 on chromosome 10 maps to the gene ZEB1 which has been associated with invasiveness, metastasis and poor prognosis of lung cancer83,84,85. While previous studies have found associations between lung cancer127,128,129,130 and COPD131,132 and the CHRNA5 SNP rs16969968, noted for its relationship to nicotine dependence14, our findings suggest that additional markers associated with poor cessation outcome may also be related to lung cancer pathophysiology.
These findings were extended by our IPA network analysis of molecules related to the genes identified above in our association analyses. The results further highlight the connection between genetic predictors of smoking cessation and cancer, most likely attributable to tobacco smoke exposure. Relations with several tumor suppressors were noted, including: TP5391 which is associated with tobacco related mutations133,134 and several cancer types including breast, leukemia, cervical135,136,137 and lung133,138,139; RB1, is related to several cancers including childhood retinoblastoma, osteogenic sarcoma, bladder93 and lung, specifically with regard to smoking behavior140,141; and CDKN2A, which has been associated with a wide variety of cancers,94 including those that are tobacco-related, such as head and neck squamous cell carcinoma, oral and lung cancer142,143,144,145. Numerous interactions were also noted for MYC, a proto-oncogene146 and EGFR. Down regulation of c-Myc is associated with invasion/migration capacity of bronchial epithelial cells exposed to cigarette smoke extract147. EGFR95 mutations act an oncogenic driver of lung cancer in non-smokers and light smokers148,149,150.
Other findings include two variants, rs6749438 and rs6718083 mapped to genes DNAJC27 and EFR3B, on chromosome 2, respectively. While there is no specific information for rs6749438, that region is flanked by ERF3B and ADCY3. Multiple studies have linked SNPs in this area to regulation of body weight, obesity and BMI151,152. Interactions between body weight and smoking have been reported for rs16969968-rs1051730 in the CHRNA5-A3-B4 cluster which are associated with reduced bodyweight in smokers but increased body weight in nonsmokers153. Of the remaining variants, no relevant information is available for rs11064432 which maps to the intron of the gene USP or rs1333758 mapping to NALCN.
Finally, we did not find associations with rare variants that survived correction for multiple comparisons, although the strongest signal was noted for SLC6A2, associated with norepinephrine transport: a finding consistent with our other observations associating smoking cessation with regulation of monoamine neurotransmitters especially serotonergic as noted above.
Conclusions
In this study of over 2000 smokers attempting to quit smoking from a multiple ancestry sample, we found 14 novel markers, not previously identified with smoking cessation. When mapped to specific genes and regions, shared genetic substrates between abstinence from smoking and selected psychiatric and substance use disorders were noted among 6 of these markers, four of which were protective. Strong signals were observed for CYP2B6; HTR3B; PARP15; SCL18A2; and SLC6A2. Loci within the HTR3B gene may be of particular interest as they may be predictive of smoking cessation regardless of the type of pharmacotherapy administered. Our network analysis also showed significant canonical pathways for the serotonin receptor signaling pathway and for nicotine and bupropion metabolism. We also found several markers of smoking cessation among genes previously implicated in the development of cancer. These included FOXP1 and ZEB1; and through our network analysis, TP53; RB1; CDKN2; MYC and EGFR. Two novel markers (rs6749438; rs6718083) on chromosome 2 are flanked by genes associated with regulation of bodyweight. Overall, our results identified several novel genetic markers of smoking cessation, both protective and at-risk, both individually and in combination. Larger studies are needed to identify future targets for smoking cessation pharmacotherapy and personalized treatment based on genetic profiles.
Data availability
The dataset generated and analyzed in this study is available from the corresponding author on a reasonable request.
References
US Department of Health and Human Services. The Health Consequences of Smoking—50 Years of Progress: A Report of the Surgeon General. Atlanta, GA: Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Office on Smoking and Health (2014).
Ambrose, J. A. & Barua, R. S. The pathophysiology of cigarette smoking and cardiovascular disease. An update. J. Am. Coll. Cardiol. 43(10), 1731–1737. https://doi.org/10.1016/j.jacc.2003.12.047 (2004).
Yanbaeva, D. G., Dentener, M. A., Creutzberg, E. C., Wesseling, G. & Wouters, E. F. M. Systemic effects of smoking. Chest. 131(5), 1557–1566. https://doi.org/10.1378/chest.06-2179 (2007).
World Health Organization. Global report on trends in prevalence of tobacco use 2000–2025. Accessed February 24 (2022).
US Department of Health and Human Services. Smoking Cessation: A Report of the Surgeon General. Washington (DC) (2020).
Kendler, K. S., Schmitt, E., Aggen, S. H. & Prescott, C. A. Genetic and environmental influences on alcohol, caffeine, cannabis, and nicotine use from early adolescence to middle adulthood. Arch. Gen. Psychiatry. 65(6), 674–682. https://doi.org/10.1001/archpsyc.65.6.674 (2008).
Kim, Y.-K. Handbook of behavior genetics 1st edn. (Springer, 2010).
Zeiger, J. S. et al. The neuronal nicotinic receptor subunit genes (CHRNA6 and CHRNB3) are associated with subjective responses to tobacco. Hum. Mol. Gen. 17(5), 724–734 (2008).
Haberstick, B. C. et al. Common and drug-specific genetic influences on subjective effects to alcohol, tobacco and marijuana use. Addiction. 106(1), 215–224. https://doi.org/10.1111/j.1360-0443.2010.03129.x (2011).
Liu, M. et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat. Genet. 51(2), 237–244. https://doi.org/10.1038/s41588-018-0307-5 (2019).
Xian, H. et al. The heritability of failed smoking cessation and nicotine withdrawal in twins who smoked and attempted to quit. Nicotine Tob. Res. 5, 245–254 (2003).
Evans, L. M. et al. Genetic architecture of four smoking behaviors using partitioned SNP heritability. Addiction. 116(9), 2498–2508. https://doi.org/10.1111/add.15450 (2021).
Domingue, B. W., Conley, D., Fletcher, J. & Boardman, J. D. Cohort effects in the genetic influence on smoking. Behav. Genet. 46(1), 31–42. https://doi.org/10.1007/s10519-015-9731-9 (2016).
Bierut, L. J. et al. Variants in nicotinic receptors and risk for nicotine dependence. Am. J. Psychiatry. 165(9), 1163–1171. https://doi.org/10.1176/appi.ajp.2008.07111711 (2008).
Lee, S.-H., Ahn, W.-Y., Seweryn, M. & Sadee, W. Combined genetic influence of the nicotinic receptor gene cluster CHRNA5/A3/B4 on nicotine dependence. BMC Genom. 19(1), 42185. https://doi.org/10.1186/s12864-018-5219-3 (2018).
Xu, K. et al. Genome-wide association study of smoking trajectory and meta-analysis of smoking status in 842,000 individuals. Nat. Commun. 11(1), 5302. https://doi.org/10.1038/s41467-020-18489-3 (2020).
Weiss, R. B. et al. A candidate gene approach identifies the CHRNA5-A3-B4 region as a risk factor for age-dependent nicotine addiction. PLoS Genet. 4(7), e1000125–e1000125. https://doi.org/10.1371/journal.pgen.1000125 (2008).
Pérez-Morales, R. et al. CHRNA3 rs1051730 and CHRNA5 rs16969968 polymorphisms are associated with heavy smoking, lung cancer, and chronic obstructive pulmonary disease in a mexican population. Ann. Hum. Genet. 82(6), 415–424. https://doi.org/10.1111/ahg.12264 (2018).
Agrawal, A. & Bierut, L. J. Identifying genetic variation for alcohol dependence. Alcohol Res. 34(3), 274–281 (2012).
Saccone, N. L. et al. Multiple independent loci at chromosome 15q25.1 affect smoking quantity: A meta-analysis and comparison with lung cancer and COPD. PLoS Genet. 6(8), 1–16. https://doi.org/10.1371/journal.pgen.1001053 (2010).
Stevens, V. L. et al. Nicotinic receptor gene variants influence susceptibility to heavy smoking. Cancer Epidemiol. Biomarkers Prev. 17(12), 3517–3525. https://doi.org/10.1158/1055-9965.EPI-08-0585 (2008).
Hancock, D. B. et al. Genome-wide association study across European and African American ancestries identifies a SNP in DNMT3B contributing to nicotine dependence. Mol. Psychiatry. https://doi.org/10.1038/mp.2017.193 (2017).
Tobacco and Genetics Consortium. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat. Genet. 42(5), 441–447. https://doi.org/10.1038/ng.571 (2010).
Erzurumluoglu, A. M. et al. Meta-analysis of up to 622,409 individuals identifies 40 novel smoking behaviour associated genetic loci. Mol. Psychiatry. 25(10), 2392–2409. https://doi.org/10.1038/s41380-018-0313-0 (2020).
Loukola, A. et al. Genome-wide association study on detailed profiles of smoking behavior and nicotine dependence in a twin sample. Mol. Psychiatry. 19(5), 615–624. https://doi.org/10.1038/mp.2013.72 (2014).
Hancock, D. B. et al. A multiancestry study identifies novel genetic associations with CHRNA5 methylation in human brain and risk of nicotine dependence. Hum. Mol. Genet. 24(20), 5940–5954. https://doi.org/10.1093/hmg/ddv303 (2015).
Saccone, N. L. et al. Genome-wide association study of heavy smoking and daily/nondaily smoking in the Hispanic community health study/Study of Latinos (HCHS/SOL). Nicotine Tob. Res. 20(4), 448–457. https://doi.org/10.1093/ntr/ntx107 (2018).
Wessel, J. et al. Resequencing of nicotinic acetylcholine receptor genes and association of common and rare variants with the Fagerström test for nicotine dependence. Neuropsychopharmacol. 35(12), 2392–2402. https://doi.org/10.1038/npp.2010.120 (2010).
Quach, B. C. et al. Expanding the genetic architecture of nicotine dependence and its shared genetics with multiple traits. Nat. Commun. 11(1), 5562. https://doi.org/10.1038/s41467-020-19265-z (2020).
Stevens, V. L. et al. Evaluation of a novel difficulty of smoking cessation phenotype based on number of quit attempts. Nicotine Tob. Res. 19(4), 435–441. https://doi.org/10.1093/ntr/ntw234 (2017).
Matoba, N. et al. GWAS of 165,084 Japanese individuals identified nine loci associated with dietary habits. Nat. Hum. Behav. 4(3), 308–316. https://doi.org/10.1038/s41562-019-0557-y (2020).
Taylor, A. E. et al. Investigating the possible causal association of smoking with depression and anxiety using Mendelian randomisation meta-analysis. The CARTA consortium. BMJ Open. 4(10), e006141. https://doi.org/10.1136/bmjopen-2014-006141 (2014).
Rajagopal, V. M. et al. Rare coding variants in CHRNB2 reduce the likelihood of smoking. medRxiv https://doi.org/10.1101/2022.10.27.22281470 (2022).
Buchwald, J. et al. Genome-wide association meta-analysis of nicotine metabolism and cigarette consumption measures in smokers of European descent. Mol. Psychiatry. 26(6), 2212–2223. https://doi.org/10.1109/TVCG.2011.56 (2021).
Chenoweth, M. J. et al. Genome-wide association study of a nicotine metabolism biomarker in African American smokers. Impact of chromosome 19 genetic influences. Addiction. 113(3), 509–523. https://doi.org/10.1111/add.14032 (2018).
Hancock, D. B., Markunas, C. A., Bierut, L. J. & Johnson, E. O. Human genetics of addiction. New insights and future directions. Curr. Psychiatry Rep. 20(2), 8. https://doi.org/10.1007/s11920-018-0873-3 (2018).
Babb, S., Malarcher, A., Schauer, G., Asman, K. & Jamal, A. Quitting smoking among adults—United States, 2000-2015. MMWR Morb. Mortal. Wkly. Rep. 65(52), 1457–1464 (2017).
Creamer, M. R. et al. Tobacco product use and cessation indicators among adults—United States, 2018. MMWR Morb. Mortal. Wkly. Rep. 68(45), 1013–1019 (2019).
Chen, L.-S. et al. Leveraging genomic data in smoking cessation trials in the era of precision medicine. Why and How. Nicotine Tob. Res. 20(4), 414–424. https://doi.org/10.1093/ntr/ntx097 (2018).
Chen, L. S. et al. Interplay of genetic risk factors (CHRNA5-CHRNA3-CHRNB4) and cessation treatments in smoking cessation success. Am. J. Psychiatry. 22(7), 551–554 (2012).
Bergen, A. W. et al. Nicotinic acetylcholine receptor variation and response to smoking cessation therapies. Pharmacogenet. Genom. 23(2), 94–103 (2013).
Leung, T. et al. Effect of the rs1051730–rs16969968 variant and smoking cessation treatment. A meta-analysis. Pharmacogenomics. 16(7), 713–720. https://doi.org/10.2217/pgs.15.34 (2015).
Tyndale, R. F. et al. Lack of associations of CHRNA5-A3-B4 genetic variants with smoking cessation treatment outcomes in caucasian smokers despite associations with baseline smoking. PloS one. 10(5), e0128109. https://doi.org/10.1371/journal.pone.0128109 (2015).
Conti, D. V. et al. Nicotinic acetylcholine receptor beta2 subunit gene implicated in a systems-based candidate gene study of smoking cessation. Hum. Mol. Genet. 17(18), 2834–2848 (2008).
King, D. P. et al. Smoking cessation pharmacogenetics: analysis of varenicline and bupropion in placebo-controlled clinical trials. Neuropsychopharmacol. 37(3), 641–650 (2012).
Chen, L.-S. et al. Genetic variant in CHRNA5 and response to varenicline and combination nicotine replacement in a randomized placebo-controlled trial. Clin. Pharmacol. Ther. 108(6), 1315–1325. https://doi.org/10.1002/cpt.1971 (2020).
Lerman, C. et al. Use of the nicotine metabolite ratio as a genetically informed biomarker of response to nicotine patch or varenicline for smoking cessation: A randomised, double-blind placebo-controlled trial. Lancet Respir. Med. 3(2), 131–138. https://doi.org/10.1016/S2213-2600(14)70294-2 (2015).
Piper, M. E. et al. Defining and measuring abstinence in clinical trials of smoking cessation interventions. An updated review. Nicotine Tob. Res. 22(7), 1098–1106. https://doi.org/10.1093/ntr/ntz110 (2020).
Heppner, W. L. et al. Dispositional mindfulness predicts enhanced smoking cessation and smoking lapse recovery. Ann. Behav. Med. 50(3), 337–347. https://doi.org/10.1007/s12160-015-9759-3 (2016).
Cambron, C. et al. Momentary precipitants connecting stress and smoking lapse during a quit attempt. Health. Psychol. 38(12), 1049–1058. https://doi.org/10.1037/hea0000797 (2019).
Cinciripini, P. M. et al. The effects of scheduled smoking reduction and Precessation nicotine replacement therapy on smoking cessation. Randomized controlled trial with compliance. JMIR Form. Res. 7(1), e39487. https://doi.org/10.2196/39487 (2023).
Spears, C. A. et al. Mechanisms underlying mindfulness-based addiction treatment versus cognitive behavioral therapy and usual care for smoking cessation. J. Consult. Clin. Psychol. 85(11), 1029–1040. https://doi.org/10.1037/ccp0000229 (2017).
Haslam, A. K. et al. Anhedonia and smoking cessation among Spanish-speaking Mexican-Americans. Health. Psychol. 37(9), 814–819. https://doi.org/10.1037/hea0000633 (2018).
Cinciripini, P. M. et al. Effects of varenicline and bupropion sustained-release use plus intensive smoking cessation counseling on prolonged abstinence from smoking and on depression, negative affect, and other symptoms of nicotine withdrawal. JAMA Psychiatry. 70(5), 522–533. https://doi.org/10.1001/jamapsychiatry.2013.678 (2013).
Kendzor, D. E. et al. The influence of discrimination on smoking cessation among Latinos. Drug Alcohol. Depend. 136, 143–148. https://doi.org/10.1016/j.drugalcdep.2014.01.003 (2014).
Cinciripini, P. M. et al. An RCT with the combination of varenicline and bupropion for smoking cessation. Clinical implications for front line use. Addiction. 113(9), 1673–1682. https://doi.org/10.1111/add.14250 (2018).
Fiore, M. C., Jaen, C. R., Baker, T. B., Bailey, W. C., Benowitz, N. L., Curry, S. J., Wewers, M. E. Treating tobacco use and dependence: Treating Tobacco Use and Dependence: 2008 Update, Clinical Practice Guideline. [Rockville, Md.]: U.S. Dept. of Health and Human Services, Public Health Service (2008).
Cropsey, K. L. et al. How low should you go? Determining the optimal cutoff for exhaled carbon monoxide to confirm smoking abstinence when using cotinine as reference. Nicotine Tob. Res. 16(10), 1348–1355. https://doi.org/10.1093/ntr/ntu085 (2014).
Raiff, B. R., Faix, C., Turturici, M. & Dallery, J. Breath carbon monoxide output is affected by speed of emptying the lungs. Implications for laboratory and smoking cessation research. Nicotine Tob. Res. 12(8), 834–838. https://doi.org/10.1093/ntr/ntq090 (2010).
Perkins, K. A., Karelitz, J. L. & Jao, N. C. Optimal carbon monoxide criteria to confirm 24-hr smoking abstinence. Nicotine Tob. Res. 15(5), 978–982 (2013).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 26(5), 589–595. https://doi.org/10.1093/bioinformatics/btp698 (2010).
Challis, D. et al. An integrative variant analysis suite for whole exome next-generation sequencing data. BMC Bioinf. 13(1), 1–12 (2012).
Baylor College of Medicine. Human Genome Sequencing Center. https://www.hgsc.bcm.edu/software/cassandra.
Illumina. GenomeStudio. San Diego, CA: Illumina Inc.
Purcell, S. et al. PLINK. A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81(3), 559–575. https://doi.org/10.1086/519795 (2007).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics. 155(2), 945–959 (2000).
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19(9), 1655–1664. https://doi.org/10.1006/tpbi.2001.1543 (2009).
Alexander DH, Shringarpure SS, Novembre J, Lange K. Admixture 1.3 software manual. Los Angeles: UCLA Human Genetics Software Distribution (2015).
Consortium 1GP. A map of human genome variation from population scale sequencing. Nature. 467(7319), 1061 (2010).
Fairley, S., Lowy-Gallego, E., Perry, E. & Flicek, P. The international genome sample resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res. 48(D1), D941–D947. https://doi.org/10.1093/nar/gkz836 (2020).
Rosenberg, N. A., Li, L. M., Ward, R. & Pritchard, J. K. Informativeness of genetic markers for inference of ancestry. Am. J. Hum. Genet. 73(6), 1402–1422. https://doi.org/10.1086/380416 (2003).
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89(1), 82–93. https://doi.org/10.1016/j.ajhg.2011.05.029 (2011).
Lee, S., Wu, M. C. & Lin, X. Optimal tests for rare variant effects in sequencing association studies. Biostatistics. 13(4), 762–775. https://doi.org/10.1093/biostatistics/kxs014 (2012).
R Core Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org. Accessed January 26 (2022).
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics. 26(22), 2867–2873. https://doi.org/10.1093/bioinformatics/btq559 (2010).
Wang, J. & Shete, S. Testing departure from Hardy-Weinberg proportions. Methods Mol. Biol. 850, 77–102 (2012).
Ingenuity Pathway Analysis. Ingenuity Pathway Analysis Software. www.ingenuity.com. Updated December 21 (2023).
Muurling, T. & Stankovic, K. M. Metabolomic and network analysis of pharmacotherapies for sensorineural hearing loss. Otol. Neurotol. 35(1), 1–6. https://doi.org/10.1097/MAO.0000000000000254 (2014).
Reyes-Gibby, C. C., Wang, J., Yeung, S.-C.J. & Shete, S. Informative gene network for chemotherapy-induced peripheral neuropathy. BioData Min. 8(1), 1–23 (2015).
Ingenuity Systems. Ingenuity Pathways Analysis (IPA) of Large Datasets. http://www.usc.edu/hsc/nml/assets/bioinfo/IPA/Data%20Analysis%20training%20Handouts.pdf. Accessed May (2014).
Reyes-Gibby, C. C. et al. MAPK1/ERK2 as novel target genes for pain in head and neck cancer patients. BMC Genet. 17(1), 1–13 (2016).
Ingenuity Systems. Calculating and interpreting the p‐values for functions, pathways, and lists in IPA (2010).
Liu, L. et al. Notch3 is important for TGF-β-induced epithelial-mesenchymal transition in non-small cell lung cancer bone metastasis by regulating ZEB-1. Cancer Gene Ther. 21(9), 364–372. https://doi.org/10.1038/cgt.2014.39 (2014).
Merikallio, H. et al. Zeb1 and twist are more commonly expressed in metastatic than primary lung tumours and show inverse associations with claudins. J. Clin. Pathol. 64(2), 136–140. https://doi.org/10.1136/jcp.2010.086678 (2011).
Vu, T., Jin, L. & Datta, P. K. Effect of cigarette smoking on epithelial to mesenchymal transition (EMT) in Lung Cancer. J. Clin. Med. https://doi.org/10.3390/jcm5040044 (2016).
Cochet-Bissuel, M., Lory, P. & Monteil, A. The sodium leak channel, NALCN, in health and disease. Front. Cell Neurosci. 8, 132. https://doi.org/10.1007/s11906-013-0388-6 (2014).
Sollis, E. et al. The NHGRI-EBI GWAS Catalog. Knowledgebase and deposition resource. Nucleic Acids Res. 51(D1), D977–D985. https://doi.org/10.1093/nar/gkac1010 (2023).
Sahu, A. et al. The 5-Hydroxytryptamine signaling map. An overview of serotonin-serotonin receptor mediated signaling network. J. Cell Commun. Signal. 12(4), 731–735. https://doi.org/10.1164/rccm.200302-264OC (2018).
Uhl, G. R. et al. Genome-wide association for smoking cessation success in a trial of precessation nicotine replacement. Mol. Med. 16(11–12), 513–526. https://doi.org/10.2119/molmed.2010.00052 (2010).
M’e, O. et al. The genetics of smoking in individuals with chronic obstructive pulmonary disease. Respir. Res. 19(1), 59. https://doi.org/10.1038/mp.2016.48 (2018).
Kern, S. E. et al. Identification of p53 as a sequence-specific DNA-binding protein. Science. 252(5013), 1708–1711. https://doi.org/10.1126/science.2047879 (1991).
Machiela MJ, Chanock SJ. LDlink. A web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics. 31(21), 3555–3557 (2015). https://doi.org/10.1093/bioinformatics/btv402.
Nennig, S. E. & Schank, J. The role of NFkB in drug addiction. Beyond inflammation. Alcohol Alcohol. 52(2), 172–179 (2017).
Freathy, R. M. et al. Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight. Nat. Genet. 42(5), 430–435. https://doi.org/10.1038/ng.567 (2010).
Roman, T. S. et al. A type 2 diabetes-associated functional regulatory variant in a pancreatic islet enhancer at the ADCY5 locus. Diabetes. 66(9), 2521–2530. https://doi.org/10.2337/db17-0464 (2017).
Kim, S.-Y. et al. Association between genetic variants of the norepinephrine transporter gene (SLC6A2) and bipolar I disorder. Prog. Neuropsychopharmacol. Biol. Psychiatry. 107, 110227. https://doi.org/10.1016/j.pnpbp.2020.110227 (2021).
Angyal, N. et al. Association analysis of norepinephrine transporter polymorphisms and methylphenidate response in ADHD patients. Prog. Neuropsychopharmacol. Biol. Psychiatry. 84(Pt A), 122–128. https://doi.org/10.1016/j.pnpbp.2018.01.013 (2018).
Kim, Y.-K. et al. Association between norepinephrine transporter gene (SLC6A2) polymorphisms and suicide in patients with major depressive disorder. J. Affect. Disord. 158, 127–132. https://doi.org/10.1016/j.jad.2014.01.018 (2014).
Zhu, A. Z. X. et al. CYP2B6 and bupropion’s smoking-cessation pharmacology. The role of hydroxybupropion. Clin. Pharmacol. Ther. 92(6), 771–777. https://doi.org/10.1038/clpt.2012.186 (2012).
Lerman, C. & Berrettini, W. Elucidating the role of genetic factors in smoking behavior and nicotine dependence. Am. J. Med. Genet. B Neuropsychiatr. Genet. 118B(1), 48–54. https://doi.org/10.1002/ajmg.b.10003 (2003).
Bloom, J. et al. The contribution of common CYP2A6 alleles to variation in nicotine metabolism among European-Americans. Pharmacogenet. Genomics. 21(7), 403–416 (2011).
Bergen, A. W. et al. Multiethnic prediction of nicotine biomarkers and association with nicotine dependence. Nicotine & Tobacco Res. 23(12), 2162–2169. https://doi.org/10.1093/ntr/ntab124 (2021).
Costa, R., Oliveira, N. G. & Dinis-Oliveira, R. J. Pharmacokinetic and pharmacodynamic of bupropion. Integrative overview of relevant clinical and forensic aspects. Drug Metab. Rev. 51(3), 293–313. https://doi.org/10.1080/03602532.2019.1620763 (2019).
Yang, Z. et al. Serotonin transporter and receptor genes significantly impact nicotine dependence through genetic interactions in both European American and African American smokers. Drug Alcohol. Depend. 129(3), 217–225. https://doi.org/10.1016/j.drugalcdep.2012.12.007 (2013).
Kim, H. W. et al. Common variants of HTR3 genes are associated with obsessive-compulsive disorder and its phenotypic expression. Sci. Rep. 6, 32564. https://doi.org/10.1038/srep32564 (2016).
Krzywkowski, K., Davies, P. A., Feinberg-Zadek, P. L., Bräuner-Osborne, H. & Jensen, A. A. High-frequency HTR3B variant associated with major depression dramatically augments the signaling of the human 5-HT3AB receptor. P. Natl. Acad. Sci. USA 105(2), 722–727. https://doi.org/10.1073/pnas.0708454105 (2008).
Lipari, R.N., van Horn, S. The CBHSQ Report: Smoking and Mental Illness among Adults in the United States. Rockville (MD) (2013).
Yuan, S., Yao, H. & Larsson, S. C. Associations of cigarette smoking with psychiatric disorders. Evidence from a two-sample Mendelian randomization study. Sci. Rep. 10(1), 1–9. https://doi.org/10.1038/s41598-020-70458-4 (2020).
Barkhuizen, W., Dudbridge, F. & Ronald, A. Genetic overlap and causal associations between smoking behaviours and mental health. Sci. Rep. 11(1), 14871. https://doi.org/10.1038/s41598-021-93962-7 (2021).
Ripke, S. et al. A mega-analysis of genome-wide association studies for major depressive disorder. Mol. Psychiatry. 18(4), 497–511. https://doi.org/10.1038/mp.2012.21 (2013).
Mills, E. J. et al. Comparisons of high-dose and combination nicotine replacement therapy, varenicline, and bupropion for smoking cessation: a systematic review and multiple treatment meta-analysis. Ann. Med. 44(6), 588–597. https://doi.org/10.3109/07853890.2012.705016 (2012).
Jimenez-Ruiz, C., Berlin, I. & Hering, T. Varenicline: a novel pharmacotherapy for smoking cessation. Drugs. 69(10), 1319–1338 (2009).
Lin, Z. et al. SLC18A2 promoter haplotypes and identification of a novel protective factor against alcoholism. Hum. Mol. Genet. 14(10), 1393–1404. https://doi.org/10.1093/hmg/ddi148 (2005).
Randesi, M. et al. VMAT2 gene (SLC18A2) variants associated with a greater risk for developing opioid dependence. Pharmacogenomics. 20(5), 331–341. https://doi.org/10.2217/pgs-2018-0137 (2019).
Schwab, S. G. et al. Association of DNA polymorphisms in the synaptic vesicular amine transporter gene (SLC18A2) with alcohol and nicotine dependence. Neuropsychopharmacol. 30(12), 2263–2268. https://doi.org/10.1038/sj.npp.1300809 (2005).
Sullivan, P. F. et al. Candidate genes for nicotine dependence via linkage, epistasis, and bioinformatics. Am. J. Med. Genet. B Neuropsychiatr. Genet. 126B(1), 23–36. https://doi.org/10.1002/ajmg.b.20138 (2004).
Solovieff, N. et al. Genetic association analysis of 300 genes identifies a risk haplotype in SLC18A2 for post-traumatic stress disorder in two independent samples. Neuropsychopharmacol. 39(8), 1872–1879. https://doi.org/10.1038/npp.2014.34 (2014).
McClernon, F. J. & Kollins, S. H. ADHD and smoking From genes to brain to behavior. Ann. NY Acad. Sci. 1141, 131–147. https://doi.org/10.1196/annals.1441.016 (2008).
Mace, S. & Taylor, D. Selective serotonin reuptake inhibitors. A review of efficacy and tolerability in depression. Exp. Opin. Pharmacother. 1(5), 917–933. https://doi.org/10.1517/14656566.1.5.917 (2000).
David, S. P. et al. Pharmacogenetic clinical trial of sustained-release bupropion for smoking cessation. Nicotine Tob. Res. 9(8), 821–833 (2007).
Cinciripini, P. M. et al. Combined effects of venlafaxine, nicotine replacement & brief counseling on smoking cessation. Exp. Clin. Psychopharmacol. 13(4), 282–292. https://doi.org/10.1037/1064-1297.13.4.282 (2005).
Kaminskiy, Y., Kuznetsova, V., Kudriaeva, A., Zmievskaya, E. & Bulatov, E. Neglected, yet significant role of FOXP1 in T-cell quiescence, differentiation and exhaustion. Front. 13, 971045. https://doi.org/10.3389/fimmu.2022.971045 (2022).
National Library of Medicine National Center for Biotechnology Information. FOXP1 Forkhead Box P1. https://www.ncbi.nlm.nih.gov/gene/27086. Updated October 2022. Accessed November 18 (2022).
Amos, C. I. et al. Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat. Genet. 40(5), 616–622 (2008).
Thorgeirsson, T. E. et al. A variant associated with nicotine dependence, lung cancer and peripheral arterial disease. Nature. 452(7187), 638–642 (2008).
Lips, E. H. et al. Association between a 15q25 gene variant, smoking quantity and tobacco-related cancers among 17 000 individuals. Int. J. Epidemiol. 39(2), 563–577 (2010).
Truong, T. et al. Replication of lung cancer susceptibility loci at chromosomes 15q25, 5p15, and 6p21: A pooled analysis from the International Lung Cancer Consortium. J. Natl. Cancer Inst. 102(13), 959–971 (2010).
Pillai, S. G. et al. A genome-wide association study in chronic obstructive pulmonary disease (COPD) identification of two major susceptibility loci. PLoS Genet. 5(3), e1000421. https://doi.org/10.1371/journal.pgen.1000421 (2009).
Gabrielsen, M. E., Romundstad, P., Langhammer, A., Krokan, H. E. & Skorpen, F. Association between a 15q25 gene variant nicotine-related habits lung cancer and COPD among 56 307 individuals from the HUNT study in Norway. Eur. J. Hum. Genet. 21(11), 1293. https://doi.org/10.1038/ejhg.2013.26 (2013).
Gibbons, D. L., Byers, L. A. & Kurie, J. M. Smoking, p53 mutation, and lung cancer. Mol. Cancer Res. 12(1), 3–13 (2014).
Halvorsen, A. R. et al. TP53 mutation spectrum in smokers and never smoking lung cancer patients. Front. Genet. 7, 85. https://doi.org/10.3389/fgene.2016.00085 (2016).
Diakite, B. et al. p.Arg72Pro polymorphism of P53 and breast cancer risk. A meta-analysis of case-control studies. BMC Med. Genet. 21(1), 206. https://doi.org/10.1186/s12881-020-01133-8 (2020).
Bories, P. et al. Impact of TP53 mutations in acute myeloid leukemia patients treated with azacitidine. PLoS ONE. 15(10), e0238795 (2020).
Khan, M. A. et al. Exploring the p53 connection of cervical cancer pathogenesis involving north-east Indian patients. PLoS ONE. 15(9), e0238500 (2020).
Piao, J.-M. et al. p53 codon 72 polymorphism and the risk of lung cancer in a Korean population. Lung Cancer. 73(3), 264–267 (2011).
Toyooka, S., Tsuda, T. & Gazdar, A. F. The TP53 gene, tobacco exposure, and lung cancer. Hum. Mutat. 21(3), 229–239 (2003).
Cardona, A. F. et al. Multigene mutation profiling and clinical characteristics of small-cell lung cancer in never-smokers vs. heavy smokers (Geno1.3-CLICaP). Front Oncol. 9, 254 (2019).
Du, X., Qi, F., Lu, S., Li, Y. & Han, W. Nicotine upregulates FGFR3 and RB1 expression and promotes non-small cell lung cancer cell proliferation and epithelial-to-mesenchymal transition via downregulation of miR-99b and miR-192. Biomed. Pharmacotherapy. 101, 656–662 (2018).
El-Naggar, A. K. et al. Methylation, a major mechanism of p16/CDKN2 gene inactivation in head and neck squamous carcinoma Am. J. Pathol. 151(6), 1767 (1997).
Asokan, G. S., Jeelani, S. & Gnanasundaram, N. Promoter hypermethylation profile of tumour suppressor genes in oral leukoplakia and oral squamous cell carcinoma. J. Clin. Diagn. Res.: JCDR. 8(10), ZC09 (2014).
Marchetti, A. et al. Alterations of P16 (MTS1) in node-positive non-small cell lung carcinomas. J. Pathol.: J. Pathol. Soc. G.B. Irel. 181(2), 178–182 (1997).
Huang, T. et al. Meta-analyses of gene methylation and smoking behavior in non-small cell lung cancer patients. Sci. Rep. 5(1), 1–8 (2015).
Lu, L. et al. Feedback circuitry via let-7c between lncRNA CCAT1 and c-Myc is involved in cigarette smoke extract-induced malignant transformation of HBE cells. Oncotarget. 8(12), 19285 (2017).
Takeda, K. et al. Frequency of Epidermal Growth Factor Receptor mutation in smokers with lung cancer without pulmonary emphysema. Anticancer Res. 37(2), 765–771 (2017).
Ren, J.-H. et al. EGFR mutations in non-small-cell lung cancer among smokers and non-smokers A meta-analysis. Environ. Mol. Mutagenesis. 53(1), 78–82 (2012).
Li, H. et al. Frequency of well-identified oncogenic driver mutations in lung adenocarcinoma of smokers varies with histological subtypes and graduated smoking dose. Lung Cancer. 79(1), 8–13 (2013).
Gene Cards. DNAJC27 Gene - DnaJ Heat Shock Protein Family (Hsp40) Member C27. https://www.genecards.org/cgi-bin/carddisp.pl?gene=DNAJC27#function-gwas. Updated November 09, 2022. Accessed November 20 (2022).
Gene Cards. EFR3B Gene—EFR3 Homolog B. https://www.genecards.org/cgi-bin/carddisp.pl?gene=EFR3B#diseases. Updated November 09, 2022. Accessed November 20 (2022).
Taylor, A. E. et al. Stratification by smoking status reveals an association of CHRNA5-A3-B4 genotype with body mass index in never smokers. PLoS Genet. 10(12), e1004799. https://doi.org/10.1371/journal.pgen.1004799 (2014).
Acknowledgements
The DNA and behavioral data collection was supported by 7 NIH and one CPRIT grant to Dr. Paul Cinciripini including CASSI (R01DA011822); QUITRX (R01DA017073; NCT00507728); Two2QUIT (R01DA24709; NCT00943618); and to Dr. David Wetter including: BREAKFREE (R01CA094826; NCT00310141); CARE (R01DA014818); MIND (R01DA018875; NCT00297470); PNS (P60MD000503; NCT00507728); and STEPS (CPRIT RP110025; NCT011310842). The genotyping and analytic work for this manuscript was supported by the Cancer Prevention Institute of Texas grant RP170259 to Drs. Cinciripini and Shete and by the generous philanthropic contributions to The University of Texas MD Anderson Cancer Center Lung Cancer Moonshot Program. Drs. Shete and Cinciripini are partially supported by the Betty B. Marcus Chair in Cancer Prevention (SS), and the Margaret & Ben Love Chair in Clinical Cancer Care (PC) and the National Cancer Support Grant P30CA016672 to MD Anderson. The authors wish to acknowledge the contributions of numerous individuals in the Human Genome Sequencing Center (HGSC) at Baylor College of Medicine including Xiang Qin, Richard Gibbs and Donna Muzny.
Author information
Authors and Affiliations
Contributions
All authors contributed to the final paper. S.S. and P.M.C. conceptualized the study, guided the analysis plan and contributed to the preparation and writing of this manuscript. R.Y. performed the statistical analyses. S.S., J.W., R.Y., G.K., Y.C. and P.M.C. contributed to drafting the manuscript. J.D.R., D.W.W., T.R.K., A.W.B., C.E.G. and S.S. reviewed and edited the manuscript and T.R.K. and S.E.S. conducted initial DNA analysis.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cinciripini, P.M., Wetter, D.W., Wang, J. et al. Deep sequencing of candidate genes identified 14 variants associated with smoking abstinence in an ethnically diverse sample. Sci Rep 14, 6385 (2024). https://doi.org/10.1038/s41598-024-56750-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-56750-7
- Springer Nature Limited
Keywords
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.