Introduction

Genome wide association studies (GWAS) detect large numbers of common variants showing statistically significant association with obesity although it can be difficult to interpret the biological processes underlying these signals [1]. In addition, a small number of genes have been identified in which very rare variants can have a major effect on body mass index (BMI) and their contribution and mechanisms have recently been reviewed [2]. In some of these, such as LEP, LEPR, PCSK1 and SIM1, recessively acting variants cause deficiency of the gene product and this can result in obesity. In others, including POMC and MC4R, heterozygous variants have been reported to be causative. Dominantly and recessively acting MC4R variants together constitute the commonest causes of inherited early-onset obesity, with a prevalence of 0.5–0.6%. It is also recognised that other nonsynonymous variants in MC4R can be associated with lower BMI and can be protective against obesity [3, 4].

As sequence data becomes available for larger numbers of subjects it is possible to explore the contribution of rare genetic variants to traits in the general population and we recently reported results obtained from analysing the association between rare variants and BMI in 50,000 exome-sequenced UK Biobank subjects [5]. Although no gene was exome wide significant, the analysis did highlight some which were potentially of interest, including LYPLAL1 and NSDHL. Since then, additional data has been released meaning that exome sequence data is now available for 200,000 of the 500,000 UK Biobank subjects to approved researchers [6]. Analyses of this larger dataset shows that it is better powered to detect rare variant effects and such analyses were successful in implicating, at exome-wide significance, genes previously recognised as risk factors for both hyperlipidaemia and type 2 diabetes [7, 8]. Here, we apply the same approach as previously, using BMI as the phenotype in the enlarged sample.

Early access to exome sequence data from the remaining UK Biobank subjects was granted to Regeneron Pharmaceuticals Inc. and their collaborators and a study using data from 429,000 UK Biobank subjects of European origin along with 217,000 from other samples has recently been published [9]. This study of over 640,000 exomes used BMI as a phenotype and performed burden analyses of rare variants to implicate 16 genes at exome-wide significance: UHMK1, GPR75, ROBO1, KIAA1109, PCSK1, GPR151, SPARC, UBR2, CALCR, PDE3B, ANO4, KIAA0586, MC4R, DPP9, ANKRD27 and GIPR. The approach used in the present study differs in a number of ways. The 640 K exome study excluded UK Biobank subjects of non-European ancestry, whereas we have previously shown that the methods used here are robust against population stratification, allowing the inclusion of subjects of all ancestries without inflation of the test statistic [5]. The 640 K exome study applied a simple burden analysis whereby counts of different categories of variant within each gene, such as predicted loss of function and missense predicted to be deleterious, were totalled together to test for association with BMI. Seven different variant selection models were used, requiring an additional correction for multiple testing. By contrast, we apply a weighted burden analysis which incorporates all variants in a single analysis but with higher weights assigned to those expected to have a larger effect. For example, predicted loss of function variants are given higher weights than missense variants predicted to be deleterious, which in turn have a higher weight than other missense variants. This removes the requirement to correct for multiple testing and, more importantly, is expected to yield higher power. From a statistical point of view, this is because likelihood ratio tests have higher power when the model for the alternative hypothesis more closely resembles the real situation. Thus, if predicted loss of function variants do in fact have a larger effect on the phenotype than missense variants, then one expects to gain power by weighting them differently. We have previously shown that predicted loss of function variants in LDLR are associated with a very high odds ratio for developing hyperlipidaemia whereas variants annotated as deleterious have a much more modest, though still statistically significant, effect [7]. In addition, we assign higher weights to variants which are rarer, under the assumption that variants which are extremely rare may have larger effect sizes. Work on real world data confirms that weighting on variant annotation and allele frequency in this way does indeed yield increased power [10].

The aims of this study were to identify genes in which rare sequence variants had an effect on BMI and to characterise the effect sizes of different categories of variant. Attention was focused on rare variants because the effects of common variants would have been well established from previous GWASs.

Methods

The UK Biobank dataset was downloaded along with the variant call files for 200,632 subjects who had undergone exome-sequencing and genotyping by the UK Biobank Exome Sequencing Consortium using the GRCh38 assembly with coverage 20X at 95.6% of sites on average [6]. UK Biobank had obtained ethics approval from the North West Multi-centre Research Ethics Committee which covers the UK (approval number: 11/NW/0382) and had obtained informed consent from all participants. The UK Biobank approved an application for use of the data (ID 51119) and ethics approval for the analyses was obtained from the UCL Research Ethics Committee (11527/001). All variants were annotated using the standard software packages VEP, PolyPhen and SIFT [11,12,13]. To obtain population principal components reflecting ancestry, version 2.0 of plink (https://www.cog-genomics.org/plink/2.0/) was run with the options --maf 0.1 --pca 20 approx [14, 15]. The phenotype was obtained from data field 21001-0.0, which records BMI at first assessment. Although BMI is an imperfect indicator of adiposity it has been widely used in similar investigations and also has the advantage that this data field is missing in very few subjects.

Using the same approach as described previously, the SCOREASSOC program was used to carry out a weighted burden analysis to test whether, in each gene, the weighted burden of sequence variants which were rarer and/or predicted to have more severe functional effects correlated with BMI [5]. Attention was restricted to rare variants with minor allele frequency (MAF) < = 0.01. As previously described, variants were weighted by overall MAF so that variants with MAF = 0.01 were given a weight of 1 while very rare variants with MAF close to zero were given a weight of 10 [5]. Variants were also weighted according to their functional annotation using the GENEVARASSOC program, which was used to generate input files for weighted burden analysis by SCOREASSOC [16, 17]. The weights allocated are to some extent arbitrary but took account of the analysis of the effects of different categories of variant in LDLR on hyperlipidaemia risk [7]. A systematic exploration of different weighting schemes has shown than no one scheme is optimal in all circumstances but that the one used here has reasonable performance [10]. Variants predicted to cause complete loss of function (LOF) of the gene were assigned a weight of 100. Nonsynonymous variants were assigned a weight of 5 but if PolyPhen annotated them as possibly or probably damaging then 5 or 10 was added to this and if SIFT annotated them as deleterious then 20 was added. In order to allow exploration of the effects of different types of variant on disease risk the variants were also grouped into broader categories to be used in multivariate analyses as described below. The full set of weights and categories is displayed in Table 1. As described previously, the weight due to MAF and the weight due to functional annotation were multiplied together to provide an overall weight for each variant. Variants were excluded if there were >10% of genotypes missing or if the heterozygote count was smaller than both homozygote counts. If a subject was not genotyped for a variant then they were assigned the subject-wise average score for that variant. For each subject a gene-wise weighted burden score was derived as the sum of the variant-wise weights, each multiplied by the number of alleles of the variant which the given subject possessed. For variants on the X chromosome, hemizygous males were treated as homozygotes.

Table 1 The table shows the weight which was assigned to each type of variant as annotated by VEP, Polyphen and SIFT as well as the broad categories which were used for multivariate analyses of variant effects [11,12,13].

For each gene, multiple linear regression analysis was carried out including the first 20 population principal components and sex as covariates and a likelihood ratio test was performed comparing the likelihoods of the models with and without the gene-wise burden score. For convenience, the statistical significance is expressed as a signed log p value (SLP), which is the log base 10 of the p value given a positive sign if the score is positively correlated with BMI. This means strongly positive or negative values for the SLP indicate results which are statistically significant, while the sign indicates whether impaired functioning of the gene is positively or negatively associated with BMI.

Gene set analyses were carried out as before using the 1454 “all GO gene sets, gene symbols” pathways as listed in the file c5.all.v5.0.symbols.gmt downloaded from the Molecular Signatures Database at http://www.broadinstitute.org/gsea/msigdb/collections.jsp [18]. For each set of genes, the natural logs of the gene-wise p values were summed according to Fisher’s method to produce a chi-squared statistic with degrees of freedom equal to twice the number of genes in the set. The p value associated with this chi-squared statistic was expressed as a minus log10 p (MLP) as a test of association of the set with BMI.

For selected genes, additional analyses were carried out to clarify the contribution of different categories of variant. As described previously, multiple linear regression analyses were performed on the counts of the separate categories of variant as listed in Table 1, again including principal components and sex as covariates, to estimate the effect size for each category [7]. The mean effect on BMI for each category was estimated along with the standard error and the Wald statistic was used to obtain a p value. The associated p value was converted to an SLP, again with the sign being positive if the mean count was positively correlated with BMI. In these analyses, stop variants and frameshift variants were considered jointly as “disruptive variants” and splice site variants were considered separately, although all three types of variant might generally be expected to have a similar LOF effect.

Data manipulation and statistical analyses were performed using GENEVARASSOC, SCOREASSOC and R [19]. Code availability: Software and scripts used to carry out the analyses are available at https://github.com/davenomiddlenamecurtis.

Results

There were 199,807 exome sequenced subjects for whom BMI was recorded. There were 110,092 male subjects with mean age 56.3 (SD = 8.0) and mean BMI 27.0 (SD = 5.1). There were 89,715 female subjects with mean age 56.7 (SD = 8.2) and mean BMI 27.8 (SD = 4.2). There were 20,384 genes for which there were qualifying variants, meaning that the critical threshold for the absolute value of the SLP to declare a result as formally statistically significant is -log10(0.05/20384) = 5.61. This threshold was met by two genes, MC4R (SLP = 15.79) and PCSK1 (SLP = 6.61). The quantile-quantile (QQ) plot for the SLPs obtained for all genes except MCR4 is shown in Fig. 1. This shows that the test appears to be well-behaved and conforms fairly well with the expected distribution. Omitting the genes with the 100 highest and 100 lowest SLPs, which might be capturing a real biological effect, the gradient for positive SLPs is 1.23 with intercept at −0.0005 and the gradient for negative SLPs is 1.03 with intercept at 0.02, indicating only moderate inflation of the test statistic for those genes showing a positive correlation.

Fig. 1: QQ plot of gene-level SLPs testing association with BMI.
figure 1

QQ plot of SLPs obtained for weighted burden analysis of association with BMI showing observed against expected SLP for each gene, omitting results for MC4R, which has SLP = 15.79.

For the two exome-wide significant genes, MC4R (SLP = 15.79) and PCSK1 (SLP = 6.61), logistic regression analysis of different categories of variants was carried out to elucidate their relative contributions. The results are shown in Table 2, which shows differences between the genes relating to the implicated pattern of variants. In MC4R, disruptive variants (stop and frameshift) are associated with a highly significant (SLP = 6.55) increase in BMI by 2.72 units, equivalent to about 8 kg for somebody of average height, and carriers have an average BMI of 30.16. These variants occur a total of 80 times at 19 separate positions. There are no splice site variants. In addition, nonsynonymous variants annotated by PolyPhen as probably damaging are also significantly (SLP = 4.29) associated with an average increase in BMI of 2.02 units. These occur in total 425 times at 55 positions. By contrast, other variants, including those annotated as deleterious by SIFT, are not associated with BMI changes. The estimated effect of the probably damaging variants represents an average across all the variants in this category and of course it is possible that some have major effects whereas other do not. However inspection of the detailed results showed that all of these variants were very rare (MAF < 0.001) and so it was not possible to reliably assess the effect of any individual variant. In PCSK1, disruptive variants are also significantly (SLP = 3.28) associated with an increase in BMI of 2.29 units and carriers have a mean BMI of 29.66. The estimated effect of splice site variants, which are also predicted to cause LOF, is similar, an increase of 2.01 units, but they only occur 8 times and this effect is not statistically significant. In contrast with MC4R, there is no suggestion that variants in PCSK1 annotated as probably damaging have any effect on BMI. However the much larger general category of protein-altering variants is associated with a modest (0.34 units) but statistically significant (SLP = 2.74) increase in BMI. In total these occur 2,970 times, meaning that there is an average burden per subject of 0.015.

Table 2 Results from regression analysis showing the effects on BMI of different categories of variant within the two exome-wide significant genes, MC4R and PCSK1.

One would expect that by chance 20 genes would produce SLPs with absolute value greater than 3, equivalent to p < 0.001, whereas in fact there are 68, suggesting that some might have an effect on BMI while failing to reach exome-wide significance after correction for multiple testing. These genes are listed in Table 3 and the SLPs for all genes are listed in Supplementary Table S1. Variant category analyses were carried out for those which seemed biologically plausible as well as for genes previously reported to be causative of obesity as listed in the introduction. These analyses yielded some findings of possible interest, discussed as follows.

Table 3 Genes with absolute value of SLP exceeding 3 or more (equivalent to p < 0.001) for test of association of weighted burden score with BMI.

It is perhaps striking that two similar genes, GALNT14 (SLP = 4.72) and GALNT9 (SLP = 4.01), fall within the top 13 genes. These enzymes catalyze the transfer of N-acetyl-D-galactosamine (GalNAc) to the hydroxyl groups on serines and threonines in target peptides. The GALNT9 intronic SNP rs11247009-A has been reported to be associated with BMI (p = 6 × 10–9) [20]. A study of broiler chickens claimed that in unpublished data one of the six most highly significant variants in a genome-wide study of abdominal fat was in GALNT9 and reported that GALNT9 expression in liver differed between lean and fat lines [21]. However, overall there seems to be little prior evidence to implicate these genes as affecting BMI and they have mostly been studied in the context of cancer progression, although there is also a report of a homozygous frameshift variant of GALNT14 being found in a patient with nonsyndromic keratoconus. The results of variant-wise analysis of these two genes are shown in Table 4A, B. This shows that GALNT14 there are 302 disruptive variants associated with a significant (SLP = 2.89) increase in BMI of 0.88 units, while in GALNT9 there are 12 splice site variants associated with an increase in BMI of 3.97 units (SLP = 2.44) and 9 indels associated with an increase in BMI of 4.84 units (SLP = 2.65). 36 disruptive variants in GALNT9 are also associated with an increase in BMI of 1.14 units but this is not statistically significant (SLP = 0.86).

Table 4 Results from variant category regression analyses for other genes of possible interest.

The results for SIRT1 (SLP = 3.16) are potentially of interest because SIRT1 and other sirtuins have effects similar to calorie restriction and reduced expression of SIRT1 and SIRT2 promotes adipogenesis and accumulation of visceral fat [22, 23]. From these findings one might well predict that genetic variants damaging SIRT1 might lead to increased BMI. The results from variant-wise analysis are shown in Table 4C, which shows only weakly significant effects from disruptive (SLP = 1.34) and possibly damaging (SLP = 1.61) variants.

ZBED6 (SLP = −3.33) codes for a transcriptional inhibitor of IGF2 which has a major impact on muscle development in placental mammals and CRISPR/Cas9 disruption of its binding site is being used commercially to produce strains of pigs which are leaner and have enhanced muscle development [24, 25]. The results for variant-wise analysis are shown in Table 4D, showing that disruptive variants are associated with a reduction in BMI of 1.59 units (SLP = −2.48) and deleterious nonsynonymous variants with a reduction of 0.37 units (SLP = −1.49).

The gene with the most negative SLP, BAIAP3 (SLP = −5.01), may have some role in insulin secretion but does not in general seem to be an obvious candidate to have effects on BMI [26]. Splice site variants are associated with a reduction in BMI of 1.41 units (SLP = −3.47).

It is well established that variants in LEP (SLP = 0.61) and LEPR (SLP = 0.13) can cause obesity but the gene-based analyses produced no evidence to implicate them. The results of variant-wise analyses are shown in Table 5A, B. It can be seen that disruptive and splice site variants in LEP do indeed have substantially higher BMIs but because there are only 6 of them this does not produce a statistically significant effect, at least if one corrects for the numbers of categories tested. There is no suggestion that any other type of variant has an effect. By contrast, in LEPR there are a total of 88 disruptive and splice site variants but their effect on mean BMI is negligible, as is also the case for other types of variant.

Table 5 Results from variant category regression analyses for LEP and LEPR.

A common nonsynonymous variant BDNF, rs6265, causes a Val66Met substitution which was originally reported to be associated with anorexia nervosa and minimum BMI in anorexia nervosa patients and whose effect on BMI was subsequently confirmed in large GWAS samples [27, 28]. This variant shows highly significant association in the current sample (SLP = −21.86). The number of subjects with Val/Val, Val/Met and Met/Met genotypes is 132,003, 60,639 and 7165 with uncorrected mean BMIs of 27.47, 27.22 and 26.96. The per-allele effect size on BMI as estimated from multiple linear regression analysis including principal components and sex as covariates is −0.19 (−0.23 to −0.15). However the gene-wise weighted burden analysis of BDNF using rare variants produced no evidence for association (SLP = 0.41) and variant-wise analyses likewise failed to show any effect from any category of rare variant. The mean effect size for protein-altering variants was 0.23 but there were only 1910 of these in total and the result does not approach statistical significance.

Of the remaining genes implicated by the analysis of the 640 K exome study, some produced some evidence for association which did not survive correction for multiple testing consisting of UHMK1 (SLP = −1.53), GPR75 (SLP = −2.98), ROBO1 (SLP = 2.21), KIAA1109 (SLP = 1.84), UBR2 (SLP = 2.69), PDE3B (SLP = 2.06), ANO4 (SLP = 1.50), DPP9 (SLP = −2.09) and GIPR (SLP = −2.63). However other genes showed no overall evidence for association, consisting of GPR151 (SLP = −0.80), SPARC (SLP = 0.14), CALCR (SLP = 0.41), KIAA0586 (SLP = −0.84) and ANKRD27 (SLP = −0.37). Detailed variant category analyses for these genes are presented in Supplementary Table S2. For some genes, it was possible to identify particular variant categories which appeared to be associated with BMJ. These consisted of disruptive variants in GPR75 (SLP = −4.87), ROBO1 (SLP = 2.91), KIAA1109 (SLP = 3.20), GPR151 (SLP = −2.17) and ANO4 (SLP = 1.31) whereas the broad category of protein-altering variants produced the strongest signal in two other genes, SPARC (SLP = 4.93) and GIPR (SLP = −4.29). For other genes, no category of variant was associated.

Other genes previously implicated in obesity which likewise failed to show evidence of association in either gene-wise analyses or variant category analyses include SIM1 (SLP = 0.89), NTRK2 (SLP = 0.88), KSR2 (SLP = 0.17), CPE (SLP = −0.35), SH2B1 (SLP = 0.78), TUB (SLP = −0.08) and FTO (SLP = 1.02). Variant category analyses for all genes of interest are presented in Supplementary Table S3.

In order to see if any additional genes were highlighted by analysing gene sets, gene set analysis was performed as described above after first removing all genes with absolute SLP value greater than 3. In order to correct for the observed inflation of the positive SLPs, the absolute value of each SLP was divided by an average inflation factor of 1.13 before being utilised to contribute to the set-wise chi-squared statistic. Following this adjustment, no gene set produced a result significant after correction for multiple testing. The highest MLP was 2.45, achieved by the set Specific Transcriptional Repressor Activity. Out of 1454 sets, the fifth highest ranked was Regulation Of Lipid Metabolic Process (MLP = 1.93). This contains 12 genes including NPC2 (SLP = 2.80), which is involved in cholesterol transport and recessively acting variants in NPC2 are a cause of Niemann-Pick C disease in which lipid accumulation causes neurodegeneration [29]. NPC2 presents cholesterol to NPC1 and rare LOF variants in NPC1 are known to cause obesity although NPC1 does not demonstrate association with BMI in the current sample (SLP = 0.15) [30]. In a GWAS of obesity in F2 pigs a variant within NPC2, rs81396056, produced the most highly significant result (p = 10−16) [31]. The results of variant category analysis of NPC2 are shown in Table 4E and it can be seen that there is significant (SLP = 3.10) association of 3119 splice site variants, occurring at 3 different positions, with an average increase in BMI of 0.28. Disruptive variants are also associated with higher BMI but there are only 111 of them and this result is not statistically significant. Results for all gene sets are presented in Supplementary Table S4.

Discussion

These analyses help to elucidate the impact of rare genetic variants on a complex phenotype such as BMI and also illustrate some of the challenges of dealing with exome sequence data. The gene-wise weighted burden analyses successfully identify two genes already known to impact BMI, MC4R and PCSK1, but fail to detect effects of other known obesity genes. In due course sequence data will become available for all 500,000 UK Biobank participants and it is reasonable to expect that this larger dataset will produce additional results. For example, the subjects with LOF variants in LEP do have notably higher BMIs but there are so few of them that they do not produce a statistically significant result in this sample. Obviously, the power to detect association depends both on the effect size and the frequency of variants, and power will improve with increased sample size. To take another example of this issue, although the results for the BDNF Val66Met variant are highly statistically significant, other protein altering variants in BDNF are associated with a larger average effect size but do not produce a statistically significant result because they are cumulatively so much rarer.

The results provide some indication about the quantitative effects of sequence variants but we should first note that the UK Biobank is not completely representative. It consists of volunteer participants who are on average older and healthier than the population as a whole. One implication of this is that subjects with more severe phenotypes will be less likely to be included and an overall effect of this will be to underestimate the effect size of rare variants which can cause morbidity and premature mortality. For example, we can observe that LOF variants in MC4R and PCSK1 are associated with an average increase of 2 or more BMI units but that this estimate may well represent a floor for the real effect size, and indeed much larger effects have been reported in a birth cohort characterised at age 18 [32].

The public health impact of genetic variants depends on their effect and on how many people carry them. For those categories of variant which are rare, the proportion of people carrying such a variant will be approximated by the average variant load because few people will have more than one variant. Thus, we may say that 0.04% of this sample has a LOF variant in MC4R associated with an increase of 2.7 in BMI while 0.2% have a variant annotated as probably damaging by PolyPhen associated with an average BMI increase of 2.0. Likewise, <0.03% of the sample has a LOF variant in PCSK1 which tends to increase BMI by 2.3 units whereas 1.5% carry a protein altering variant associated with an average BMI increase of 0.3.

The analyses fail to conclusively implicate novel genes as influencing BMI. The three which are arguably biologically the most plausible are SIRT1, ZBED6 and NPC2 but it must be acknowledged that the statistical evidence supporting their involvement is fairly weak. Conversely, there are other genes with higher statistical significance but whose function, as far as it is known, does not immediately suggest that they would have a prominent role in influencing BMI. It is clear that additional data will be needed to arrive at definitive solutions, whether it be from the remaining UK Biobank subjects or from alternative sources.

The results from these analyses would seem to point to very rare variants in a fairly small number of genes as having detectable effects on BMI but there are some caveats which are worth stating. First, the approach used makes the assumption that when variants are considered jointly then they will tend to have the same direction of effect on the phenotype. This seems a reasonable assumption for LOF variants, expected to reduce the functioning of a gene, but the method would fail if some non-synonymous variants reduced function but were balanced out by others which produced gain of function. While we may expect that on average a non-synonymous change, especially one annotated as damaging or deleterious, will be more likely to impair than improve function it is important to acknowledge that if there is a good deal of heterogeneity of effect then genes and classes of variant will fail to achieve statistical significance. Thus, these results should not be taken to exclude the possibility that there may be very large numbers of individually rare variants in many genes which might cumulatively make a substantial contribution to the overall variance of BMI in the population.

Another point to make is that association studies such as this, especially those based on population samples, are not expected to necessarily identify genes which affect BMI but rather genes in which naturally occurring variation affects BMI. For example, there are large variations in the frequency with which LOF variants are observed in different genes, reflecting partly the size of the gene but also selection pressures. Only 6 subjects have LOF variants in LEP compared to thousands in NPC2 and so LEP does not produce a detectable signal. However it may well be that by recognising LEP as potentially having a major and direct impact on BMI, functional studies will yield useful understanding of the underlying physiology. It should be noted that the selection pressures reducing variation in a particular gene may relate to the phenotype under consideration, here BMI, but may also involve other biological processes impacting on fitness.

To conclude, the study of very large, exome-sequenced samples such as the UK Biobank can afford us further insights into the relationship between genetic variation and a quantitative, health-related phenotype such as BMI.