Introduction

Colorectal cancer (CRC) is the third most common cancer in the United States and is associated with risk factors including physical inactivity dietary patterns, and increased body mass index (BMI) [1]. Although the observed association between increased BMI and CRC risk has been fairly consistent in prospective cohort studies, the association may vary among CRC subtypes and by biological sex. For example, in observational studies, BMI appears to be more strongly associated with CRC risk in men than women [1,2,3]. Recently, distinct subtypes of CRC have been established, and these subtypes are thought to arise from distinct precursor lesions: conventional adenomas and serrated polyps. In one study, BMI showed a much stronger association with serrated polyps than with conventional adenomas [4]. Other studies suggest that BMI is more strongly associated with microsatellite-instable CRC (compared to microsatellite-stable), which is more likely to develop from serrated polyps than conventional adenomas [5].

Evidence for the association between increased BMI and polyp subtypes is limited to observational studies. Mendelian randomization provides insight on disentangling the effects of predisposed obesity versus acquired obesity on CRC risk. We previously reported that genetic risk of obesity, derived from 97 established adult BMI-associated variants, correlated with BMI across all ages [6]. Genetic risk of BMI has been associated with risk of CRC using Mendelian randomization [7]. When stratified by sex, genetic risk of BMI was associated with CRC in women only, in contrast to studies of measured BMI [7]. The association between genetic risk of adulthood obesity and CRC precursor lesions has not yet been assessed and has the potential to improve the understanding of the influence of increased adiposity on heterogeneous CRC pathways.

The objective of this study was to evaluate the relationship between genetically predicted BMI and two distinct CRC precursor lesions (conventional adenomas and serrated polyps). We created a genetic risk score (GRS) using 97 previously identified SNPs that were associated with adult BMI. The association between GRS and risk of conventional adenomas or serrated polyps was evaluated among participants with genetic data in three large cohort studies: Nurses’ Health Study (NHS), Nurses’ Health Study II (NHS2), and Health Professionals Follow-up Study (HPFS). For comparison, we also assessed the associations for measured BMI in the same set of samples. To test for potential sex difference, we also performed the analysis in men and women separately.

Materials and Methods

Study Population

The NHS enrolled 121,700 registered female nurses, aged 30–55 (at enrollment), in 1976; the NHS2 enrolled 116,686 registered female nurses, aged 25–42, in 1989; and the HPFS enrolled 51,529 male health professionals, aged 40–75, in 1986. A subset of participants provided blood specimens: 32,826 NHS participants, between 1989 and 1990; 29,611 NHS2 participants, between 1996 and 1999; and 18,225 HPFS participants, between 1993 and 1995. Blood specimens were returned on ice packs by overnight courier.

The present study was limited to 37,661 participants with genetic information available from previous, nested genome-wide association studies (NHS, 18,498; NHS2, 8,274; HPFS, 10,889). After excluding participants who were of non-European origin; had a history of cancer (except non-melanoma skin cancer), colorectal polyps, or inflammatory bowel disease; or who had no reported endoscopies of the lower GI tract; a total of 27,436 participants (NHS, 13,103; NHS2, 6,494; HPFS, 7,839) were included in the present analysis.

The collection of detailed histological information on colorectal polyps began in 1992 for the NHS and HPFS, and 1991 for the NHS2; these years were used as baseline for the present study. Eligible participants were followed until first colorectal polyp diagnosis, death, or end of follow-up for the present study: June 1, 2012 for NHS, June 1, 2011 for NHS2, and January 1, 2010 for the HPFS.

Computation of GRS

Genotype data were derived from various nested studies within the NHS, NHS2, and HPFS studies. Details of genotyping and imputation of SNPs included in the GRS have been described elsewhere in detail [8]. Weighted GRS was calculated from 97 SNPs and relative effect size (β coefficient) identified as associated with adult BMI in the most recent genome-wide association study (GWAS) [9]. GRS was calculated using established methods, with an assigned value of 0, 1, or 2 for the number of risk alleles and the following equation: GRS=\(\left({\sum }_{i=1}^{97}{\beta }_{i} * {SNP}_{i}\right)*\left(97/{\sum }_{i=1}^{97}{\beta }_{i}\right),\) where βi is the regression coefficient identified in the GWAS. Sex-specific GRS was calculated using the sex-specific βi [9].

Assessment of Exposure Variables

Measured BMI was calculated as the cumulative average of BMI based on self-reported height at baseline and self-reported weight from enrollment to polyp diagnosis or end of follow-up. Weight reported on the NHS questionnaires was validated among 140 NHS participants; self-reported and measured weights were highly correlated (r = 0.97), indicating that self-reported weight measurements are reasonably valid. Lifestyle characteristics were assessed by questionnaire at baseline and biennially via follow-up questionnaires. Food frequency questionnaires were administered every four years to assess dietary risk factors. Missing data for covariates during follow-up questionnaires were carried forward from the most recent available data.

Assessment of Outcomes

Every two years via follow-up questionnaire, participants reported whether they had undergone a colonoscopy or sigmoidoscopy, and whether any colorectal polyps had been diagnosed, during that two-year period. If a polyp diagnosis was reported, we requested permission to obtain the endoscopy and pathology reports. Investigators who were blinded to participants’ exposures and genetic data reviewed the medical records and confirmed polyp diagnoses. Relevant clinical and pathological data were also extracted from the medical records [4].

In the present study, conventional adenomas included tubular, tubulovillous, and villous adenomas and adenomas with high-grade dysplasia. Serrated polyps included hyperplastic polyps and mixed/serrated adenomas. Mixed/serrated adenomas consisted of both mixed polyps (those with both adenomatous and hyperplastic changes in histology) and polyps with any serrated diagnosis (e.g., serrated adenomas, serrated polyps, and sessile serrated adenoma/polyp). Diagnosis of serrated polyp and conventional adenoma at the same endoscopy was considered a synchronous polyp.

Statistical Analyses

All analyses were conducted using three pooled cohorts and repeated separately in women (NHS, NHS2) and men (HPFS) to assess potential differences in the relationship between BMI and CRC risk by biologic sex. We evaluated measured BMI and genetically predicted BMI in relation to risk of polyp type (non-polyps, conventional adenoma only, serrated polyp only, and synchronous polyps). GRS was regressed on measured BMI to derive the change in GRS associated with a 2 kg/m2 change in measured BMI.

Multivariable logistic regression was used to evaluate the risk of conventional, serrated, or synchronous polyps in relation to genetically predicted and measured BMI, compared to non-polyps. We also modeled the risk of conventional adenoma subtypes (non-advanced or advanced) and serrated polyp subtypes (small, < 10 mm; large, ≥ 10 mm) compared to non-polyps. Advanced conventional adenomas were defined as having at least 1 conventional adenoma of 10 mm or greater in diameter or with advanced histology (tubulovillous/villous histologic features or high-grade or severe dysplasia).

Effect modification by biologic sex (i.e., study cohort) was evaluated using Wald test for the product term. Heterogeneity among the polyp subtypes was assessed using multivariate regression in case-only analyses. To account for multiple records per participant and to handle time-varying covariates efficiently, we used an Andersen-Gill data structure with a new record for each 2-year follow-up period during which a participant underwent an endoscopy.

In a secondary analysis, the models were repeated with conventional adenomas, serrated polyps, and their subtypes with the inclusion of synchronous polyps in each subtype grouping. We also modeled the risk of polyp location (proximal colon, distal colon, or rectum) for conventional adenomas and serrated polyps. Polyps were classified as proximal if they were removed from the cecum to the transverse colon, distal if they were removed from the splenic flexure to the sigmoid colon, and rectal if they were removed from the rectosigmoid junction to the anal canal (excluding anal squamous cell carcinoma). Groupings by polyp location were not mutually exclusive; for example, one participant could have contributed both a proximal and distal conventional adenoma, or a proximal conventional adenoma and distal serrated polyp. In another secondary analysis, we assessed the polyp associations for each of the 97 individual SNPs included in the GRS. Bonferroni correction for multiple testing (adjusted α = 0.05/97 = 5.2 × 10–4) was applied in this analysis.

Genetically predicted BMI models were adjusted for the following characteristics: age, study cohort (NHS, NHS2, HPFS; in pooled models only), time period of endoscopy (in 2-year intervals), reason for endoscopy (screening or symptoms), number of previous endoscopies, time in years since most recent endoscopy, and top three principal components for population structure (in order to account for systematic ancestry differences) [10]. Measured BMI models were adjusted for age, study cohort, time period of endoscopy, reason for endoscopy, number of previous endoscopies, time in years since most recent endoscopy, family history of colorectal cancer (yes or no), height, alcohol intake, regular aspirin use (yes or no), and physical activity. All covariates were treated as continuous variables unless otherwise noted.

Results

Among 27,436 participants with genetic data and outcomes available, there were a total of 2,952 participants with conventional adenomas only, 1,589 with serrated polyps only, and 790 with synchronous polyps. Genetically predicted BMI, measured BMI, and endoscopy-related characteristics are shown in Table 1. Participants with serrated polyps were more likely to be women. Serrated polyps and synchronous polyps both tracked with higher measured BMI. Participants with no polyps had a higher average number of endoscopies, longer time since most recent previous period endoscopy, and were less likely to report symptoms as the reason for endoscopy.

Table 1 BMI and endoscopy characteristics of pooled study participants from three cohort studies (NHS, NHS2, HPFS) by polyp diagnosis

The median measured BMI was 1.7 kg/m2 greater at the highest vs. lowest quintile of BMI genetic risk score (26.3 kg/m2 compared to 24.6 kg/m2), with a wide interval of measured BMI at each quintile (Fig. 1). An 18-unit increase in GRS was correlated with 2 kg/m2 increase in measured BMI (R-squared = 0.02).

Fig. 1
figure 1

Distribution of measured BMI according to quintiles of genetic risk score in three cohorts (NHS, NHS2, HPFS). Median value of measured BMI for each quintile is shown. BMI: body mass index, NHS: Nurses’ Health Study, NHS2: Nurses’ Health Study II, HPFS: Health Professionals Follow-up Study

Measured BMI was significantly associated with risk of conventional adenomas (OR per 2 kg/m2 increase = 1.03, 95% CI: 1.01–1.04), serrated polyps (OR = 1.06, 95% CI: 1.04–1.08), and synchronous polyps (OR = 1.10, 95% CI: 1.07–1.13) (Fig. 2, Online Resource 1). Among conventional adenoma subtypes, measured BMI was associated with increased risk of non-advanced adenomas (OR = 1.03, 95% CI: 1.01–1.05) and advanced adenomas (OR = 1.02, 95% CI: 0.99–1.04). Among serrated polyp subtypes, measured BMI was associated with increased risk of small polyps (OR = 1.06, 95% CI: 1.04–1.08) and large polyps (OR = 1.03, 95% CI: 0.96–1.11), although the large serrated polyps group had a smaller sample size and confidence interval included unity (Online Resource 1).

Fig. 2
figure 2

Association between genetically predicted a,b or measured BMI c and risk of colorectal polyp subtypes in three cohort studies (NHS, NHS2, HPFS). BMI: body mass index, NHS: Nurses’ Health Study, NHS2: Nurses’ Health Study II, HPFS: Health Professionals Follow-up Study. a An 18-unit change in genetic risk score is equivalent to a 2 kg/m2 change in measured BMI, per regression analysis. b Genetically predicted BMI multivariable logistic regression model adjusted for age, study cohort (NHS, NHS2, HPFS), time period of endoscopy (in 2-year intervals), reason for endoscopy (screening or symptoms), number of previous endoscopies, time in years since most recent endoscopy, and top three principal components for population structure. All covariates were treated as continuous variables unless otherwise noted. c Measured BMI multivariable logistic regression model adjusted for age, study cohort (NHS, NHS2, HPFS), time period of endoscopy (in 2-year intervals), reason for endoscopy (screening or symptoms), number of previous endoscopies, time in years since most recent endoscopy, family history of colorectal cancer (yes or no), height, alcohol intake, regular aspirin use (yes or no), and physical activity. All covariates were treated as continuous variables unless otherwise noted

The association between genetically predicted BMI and polyp risk in the pooled cohorts was the same as measured BMI for serrated polyps (OR per 18 unit [equivalent to 2 kg/m2] increase = 1.04, 95% CI: 0.90, 1.20) and synchronous polyps (OR = 1.09, 95% CI: 0.89, 1.34), although confidence intervals were wide and included unity (Fig. 2 and Online Resource 1). Test for heterogeneity showed that measured BMI was associated with a significantly higher risk of serrated polyps compared to conventional adenomas and significantly higher risk of synchronous polyps compared to conventional adenomas or serrated polyps alone. All other tests for heterogeneity were null. Including synchronous polyps in the conventional adenoma and serrated polyp groups did not change the association with genetically predicted or measured BMI (Online Resource 2).

Table 2 shows the association between genetically predicted BMI or measured BMI and risk of colorectal polyp subtypes by polyp location, in the three pooled cohorts (NHS, NHS2, HPFS). Measured BMI was associated with increased risk of conventional adenomas and serrated polyps at all locations. Of the subtypes and locations, measured BMI was associated with serrated polyps of the proximal colon (OR = 1.07, CI: 1.04–1.10), distal colon (OR per 2 kg/m2 increase = 1.07, 95% CI: 1.05–1.10), and rectum (OR = 1.08, 95% CI: 1.05–1.11). Genetically predicted BMI was also associated with increased risk of serrated polyps of the distal colon (OR per 18 unit [equivalent to 2 kg/m2] increase = 1.07, CI: 0.90–1.28) and rectum (OR = 1.10, 95% CI: 0.91–1.34), although the confidence intervals were wide and included unity. All tests for heterogeneity were null.

Table 2 Association between genetically predicted or measured BMI and risk of colorectal polyps, by location, in three cohort studies (NHS, NHS2, HPFS)

No differences were found between men and women, with the exception of genetically predicted BMI and risk of synchronous polyps (P value for interaction = 0.04). Both genetically predicted BMI and measured BMI were associated with increased risk of synchronous polyps in women (genetically predicted BMI, OR per 18 unit [equivalent to 2 kg/m2] increase = 1.37, 95% CI: 1.05–1.79; measured BMI, OR = 1.11, 95% CI: 1.07–1.14). Measured BMI, but not genetically predicted BMI, was associated with increased risk of synchronous polyps in men (genetically predicted BMI, OR = 0.88, 95% CI: 0.65–1.20; measured BMI, OR = 1.09, 95% CI: 1.03–1.16).

Tests for association between individual SNPs and risk of polyp subtypes at the adjusted α level revealed two SNPs that were inversely associated with synchronous polyp: rs11727676 (OR per 1-allele = 0.72, 95% CI: 0.61–0.84) and rs10132280 (OR = 0.83, 95% CI: 0.74–0.92). No individual SNPs were significantly associated with risk of conventional adenomas or serrated polyps. There were no significant associations between individual SNPs and polyps in men or women separately.

Discussion

In three cohorts of men and women, genetically predicted BMI was correlated with measured adulthood BMI. Measured BMI showed a positive association with risk of conventional adenoma, serrated polyp, and synchronous polyps. These associations tended to be stronger for serrated polyp and synchronous polyps compared to conventional adenoma. Genetically predicted BMI showed similar positive associations with risk of serrated and synchronous polyps; however, confidence intervals for genetically predicted BMI were wide and included unity. When stratified by sex, genetically predicted BMI was associated with increased risk of synchronous polyps in women only.

This is the first study to our knowledge to assess risk of CRC polyp subtypes in relation to BMI genetic risk score; thus, it is difficult to put these findings in the context of the literature. Several studies analyzed genetic BMI in relation to CRC diagnosis: all three found that genetic BMI was associated with increased risk of CRC overall [11, 12] and in women only, when stratified by sex [7]. Our finding that genetically predicted BMI was not significantly associated with CRC precursor lesions could be due to limited power in the present study, given that these other studies investigating CRC risk included 9,254 to 51,537 cases. It is not possible to draw conclusions from the present study regarding whether genetically predicted BMI is more strongly associated with CRC than CRC precursor lesions.

Our finding that measured BMI was associated most strongly with risk of serrated polyp and synchronous polyps is consistent with one other large study which analyzed measured BMI in relation to polyp subtypes in the same three cohorts (NHS, NHS2, HPFS), and found that BMI was most strongly associated with synchronous polyps compared to serrated polyp or conventional adenoma [4]. Prior studies have demonstrated an association between BMI and conventional adenoma and serrated polyp independently, but no other studies to our knowledge have compared the association between BMI and various polyp subtypes [13, 14]. The association between increased BMI and serrated polyp risk is reasonable given that serrated polyps tend to be associated with other environmental factors related to inflammation, including smoking and alcohol consumption [4]. Inflammatory cytokines, which are increased in a state of excess adiposity, are hypothesized to contribute to the development of microsatellite-instable colorectal cancer [15, 16]. These microsatellite-instable cancers are more commonly developed in serrated polyps and more commonly found in women; thus, the inflammatory state of obesity may explain the association of increased adiposity with serrated as well as synchronous polyps, especially in women [5].

The significance of the relationship between BMI and synchronous polyps is unclear. Synchronous polyps were not associated with greater risk of progression to CRC, compared to conventional adenoma only, in a previous study, suggesting that synchronous polyp diagnosis is not necessarily predictive of a more advanced lesion [17]. In contrast to observational studies, which found increased CRC risk in association with BMI in men only, but consistent with Mendelian randomization studies, which found increased CRC risk in association with BMI in women only, we observed an association between genetically predicted BMI and synchronous polyps in women only [3, 12]. Genetically predicted BMI has been associated with adiposity across the lifespan, particularly in early life and therefore may capture the influence of early life adiposity [6, 18]. Interestingly, early life body fatness has been associated with higher CRC risk in women but not in men [19].

Our analysis of individual obesity-related SNPs showed no statistically significant associations of individual SNPs and conventional adenoma or serrated polyp. Another study evaluated the association between five individual FTO gene polymorphisms and colorectal neoplasia, and found no significant associations in Caucasians, consistent with our findings for individual BMI alleles in relation to serrated polyp or conventional adenoma [20]. Interestingly, we identified two SNPs which were inversely related to synchronous polyp risk: rs11727676 and rs10132280. SNP rs11727676 is associated with the HHIP (Hedgehog interacting protein) gene, which is hypothesized to be associated with increased subcutaneous fat with more favorable metabolic markers [9]. SNP rs10132280 is related to the STXBP6 (syntaxin-binding protein 6) gene, which has previously been shown to be negatively correlated with lung adenocarcinoma [21]. No other studies to our knowledge have investigated individual obesity-related SNPs in relation to CRC precursor lesions or CRC risk.

The strengths of this study include the large cohort with prospective assessment of lifestyle factors, polyp documentation, and detailed histopathological information, which allowed for stratification of CRC precursor lesions by histology and location. We used a genetic risk score derived from 97 SNPs that have been associated with adult BMI to date. Despite the large sample size, the number of polyp cases was still relatively small for a Mendelian randomization study, and the non-significant findings for genetically predicted BMI and polyp subtypes may have been due to limited power.

In conclusion, in three large prospective cohorts of men and women, we found similar associations between measured or genetic BMI and CRC precursor lesions. Associations tended to be stronger for serrated and synchronous polyps, compared to conventional adenoma, in congruence with previous studies. Larger cohorts and meta-analyses would be helpful in elucidating the relationship between genetic risk of BMI and distinct CRC precursor lesions.