Assessment of the potential role of natural selection in type 2 diabetes and related traits across human continental ancestry groups: comparison of phenotypic with genotypic divergence

Hanson, Robert L.; Van Hout, Cristopher V.; Hsueh, Wen-Chi; Shuldiner, Alan R.; Kobes, Sayuko; Sinha, Madhumita; Baier, Leslie J.; Knowler, William C.

doi:10.1007/s00125-020-05272-8

Assessment of the potential role of natural selection in type 2 diabetes and related traits across human continental ancestry groups: comparison of phenotypic with genotypic divergence

Article
Published: 04 September 2020

Volume 63, pages 2616–2627, (2020)
Cite this article

Download PDF

Diabetologia Aims and scope Submit manuscript

Assessment of the potential role of natural selection in type 2 diabetes and related traits across human continental ancestry groups: comparison of phenotypic with genotypic divergence

Download PDF

Robert L. Hanson ORCID: orcid.org/0000-0002-4252-7068¹,
Cristopher V. Van Hout²,
Wen-Chi Hsueh¹,
Alan R. Shuldiner²,
Sayuko Kobes¹,
Madhumita Sinha¹,
Leslie J. Baier¹,
Regeneron Genetics Center &
…
William C. Knowler¹

1361 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Aims/hypothesis

Prevalence of type 2 diabetes differs among human ancestry groups, and many hypotheses invoke differential natural selection to account for these differences. We sought to assess the potential role of differential natural selection across major continental ancestry groups for diabetes and related traits, by comparison of genetic and phenotypic differences.

Methods

This was a cross-sectional comparison among 734 individuals from an urban sample (none of whom was more closely related to another than third-degree relatives), including 83 African Americans, 523 American Indians and 128 European Americans. Participants were not recruited based on diabetes status or other traits. BMI was calculated, and diabetes was diagnosed by a 75 g oral glucose tolerance test. In those with normal glucose tolerance (n = 434), fasting insulin and 30 min post-load insulin, adjusted for 30 min glucose, were taken as measures of insulin resistance and secretion, respectively. Whole exome sequencing was performed, resulting in 97,388 common (minor allele frequency ≥ 5%) variants; the coancestry coefficient (F_ST) was calculated across all markers as a measure of genetic divergence among ancestry groups. The phenotypic divergence index (P_ST) was also calculated from the phenotypic differences and heritability (which was estimated from genetic relatedness calculated empirically across all markers in 761 American Indian participants prior to the exclusion of close relatives). Under evolutionary neutrality, the expectation is P_ST = F_ST, while for traits under differential selection P_ST is expected to be significantly greater than F_ST. A bootstrap procedure was used to test the hypothesis P_ST = F_ST.

Results

With adjustment for age and sex, prevalence of type 2 diabetes was 34.0% in American Indians, 12.4% in African Americans and 10.4% in European Americans (p = 2.9 × 10⁻¹⁰ for difference among groups). Mean BMI was 36.3, 33.4 and 33.0 kg/m², respectively (p = 1.9 × 10⁻⁷). Mean fasting insulin was 63.8, 48.4 and 45.2 pmol/l (p = 9.2 × 10⁻⁵), while mean 30 min insulin was 559.8, 553.5 and 358.8 pmol/l, respectively (p = 5.7 × 10⁻⁸). F_ST across all markers was 0.130, while P_ST for liability to diabetes, adjusted for age and sex, was 0.149 (p = 0.35 for difference with F_ST). P_ST was 0.094 for BMI (p = 0.54), 0.095 for fasting insulin (p = 0.54) and 0.216 (p = 0.18) for 30 min insulin. For type 2 diabetes and BMI, the maximum divergence between populations was observed between American Indians and European Americans (P_ST-MAX = 0.22, p = 0.37, and P_ST-MAX = 0.14, p = 0.61), which suggests that a relatively modest 22% or 14% of the genetic variance, respectively, can potentially be explained by differential selection (assuming the absence of neutral drift).

Conclusions/interpretation

These analyses suggest that while type 2 diabetes and related traits differ significantly among continental ancestry groups, the differences are consistent with neutral expectations based on heritability and genetic distances. While these analyses do not exclude a modest role for natural selection, they do not support the hypothesis that differential natural selection is necessary to explain the phenotypic differences among these ancestry groups.

Analysis of genetic selection at insulin receptor substrate-2 gene loci

Article 23 January 2021

Complex interplay between neutral and adaptive evolution shaped differential genomic background and disease susceptibility along the Italian peninsula

Article Open access 01 September 2016

Evidence for natural selection at the melanocortin-3 receptor gene in European and African populations

Article 24 February 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Prevalence of type 2 diabetes varies among human continental ancestry groups, as does obesity, which is a strong risk factor for diabetes. In the USA, prevalence of diabetes and obesity is particularly high in American Indians, whereas prevalence is low in European Americans and intermediate in African Americans [1, 2]. Both type 2 diabetes and obesity are highly heritable [3, 4], and several hypotheses have invoked differences in natural selection across ancestry groups to explain differences in prevalence [5,6,7,8,9,10,11,12,13]. Recent genome-wide association studies have identified many variants reproducibly associated with both type 2 diabetes and obesity [14, 15]. Several investigators have analysed these established susceptibility loci for evidence of natural selection. Such studies have generally involved assessment of genetic signatures of recent selection or comparison of allele frequencies among ancestry groups [16,17,18,19,20,21,22]. Results of these studies are largely equivocal; however, both approaches are limited in their ability to detect selection on polygenic traits. An alternative approach involves comparison of genetic components of variance for the trait, among and within ancestry groups, with corresponding genotypic variance components across representative genomic markers [23, 24]. These variance components methods are well suited for detection of polygenic selection that differs in magnitude across groups, and they do not require knowledge of specific susceptibility variants. They do require comparably measured phenotypic data, along with genotypic data, across diverse ancestry groups. In the present study, we compare phenotypic divergence for type 2 diabetes and related traits with genotypic divergence in a cohort that includes African American, American Indian and European American individuals who had undergone whole exome sequencing.

Methods

Participants and measures

Participants were derived from a multiethnic study, conducted in urban Phoenix, Arizona, designed to identify determinants of diabetes and related traits; the methods have been previously described [25]. In brief, individuals were ≥18 years old, of any ethnicity, and participants were not recruited based on diabetes or other conditions. A large proportion of participants were American Indian, primarily from tribes of the Southwestern United States. The study was approved by the institutional review boards of the National Institute of Diabetes and Digestive and Kidney Diseases and the Phoenix Area Indian Health Service, and all participants gave informed consent. The present cross-sectional sample was derived from 1389 participants, examined in 2011–2016, who had relevant phenotypic data available, and data from whole exome sequencing. After exclusion of ten individuals who did not cluster with their primary self-reported ancestry group in principal components analyses, there were 88 individuals who were full-heritage African American by self-report, 761 individuals who were full-heritage American Indian and 129 individuals who were full-heritage European American. Since some analyses may be influenced by presence of closely related individuals, genetic relatedness was calculated between pairs of individuals using PREST (version 3.02) [26], and a set of ‘unrelated’ individuals was selected by randomly excluding one member of each pair in whom the observed proportion of alleles shared identical by descent was >0.14. (This excludes individuals who are second-degree relatives or closer to another individual in the sample.) This resulted in 83 African Americans, 523 American Indians and 128 European Americans. Characteristics of individuals are shown in electronic supplementary material (ESM) Table 1, and a principal components plot is shown in ESM Fig. 1.

Fasting plasma glucose and HbA_1c were measured, and a 75 g oral glucose tolerance test was administered to those without a previous diagnosis of diabetes, with glucose concentrations measured 30 min and 2 h after the oral glucose load. Individuals were classified as having diabetes if they had a previous diagnosis by self-report, fasting plasma glucose ≥7.0 mmol/l, 2 h plasma glucose ≥11.1 mmol/l or HbA_1c ≥ 6.5% (48 mmol/l) [27]. Serum insulin concentrations were measured by immunoassay (Tosoh Bioscience, Tokyo, Japan); fasting serum insulin and 30 min serum insulin, adjusted for 30 min glucose level, were taken as measures of insulin resistance and insulin secretion, respectively. Analyses of insulin measures were restricted to those with normal glucose tolerance (nondiabetic, and 2 h glucose <7.8 mmol/l), constituting 59 African Americans, 281 American Indians and 94 European Americans. Height and weight were measured for calculation of BMI. The maximum weight and contemporaneous height were also obtained by self-report. Analyses of BMI are generally shown based on self-reported maximum weight, as this was more strongly associated with diabetes.

Genotypes

Whole exome sequencing in DNA derived from peripheral blood was conducted at Regeneron Genetics Center, as previously described [28, 29]. Sequencing was conducted using a Hi-Seq 2500 sequencer (Illumina, San Diego, CA, USA). Sequencing was part of a larger project involving 8137 individuals, 43 of whom were excluded for low-quality sequence data. In 98% of samples, at least 90% of the exome achieved at least 20x coverage. Analysis was restricted to variants with <10% missing genotype calls, that were within Hardy–Weinberg equilibrium (p > 0.0001 in full-heritage American Indians), that had concordance rates >97.5% in 100 duplicate samples and for which average minor allele frequency across ancestry groups was ≥5%. This resulted in 97,388 autosomal markers. Missing genotypes were imputed from phased haplotypic data in each ancestry group using BEAGLE (version 3.2.2) [30].

Measures of divergence

The coancestry coefficient (F_ST) was calculated as a measure of genotypic divergence among ancestry groups. F_ST represents the proportion of variance in allele frequency in the total population that is explained by group membership. For each marker, we calculated F_ST by the method of moments [31, 32]. Across r ancestry groups, the mean squares among and within groups for a given allele u are, respectively:

$$ MSA=\frac{1}{r-1}{\sum}_{i=1}^r2{n}_i{\left({p}_{iu}-{\overline{p}}_u\right)}^2 $$

$$ MSW=\frac{1}{\sum_{i=1}^r\left(2{n}_i-1\right)}{\sum}_{i=1}^r2{n}_i{p}_{iu}\left(1-{p}_{iu}\right) $$

where 2n_i is the total number of alleles measured in the ith group (twice the number of individuals), p_iu is the frequency of allele u in the ith group and $ {\overline{p}}_u $ is the mean frequency of the u allele across groups. F_ST for a single marker with m alleles is:

$$ {F}_{ST-M}=\frac{\sum_{u=1}^m\left({MSA}_u-{MSW}_u\right)}{\sum_{u=1}^m\left[{MSA}_u+\left(2{n}_c-1\right){MSW}_u\right]} $$

where $ 2{n}_c=\frac{1}{\left(r-1\right)}{\sum}_{i=1}^r2{n}_{ic} $, and $ 2{n}_{ic}=2{n}_i-4{n}_i^2/{\sum}_{i=1}^r2{n}_i $. The mean value of F_ST-M over all markers was taken as the overall F_ST. This mean marker-wise F_ST is comparable to the phenotypic divergence measures described below and, thus, represents the expected value under neutrality [24, 33]. However, it tends to modestly underestimate the evolutionary distance, so we also report F_ST calculated by the ‘ratio of averages’ method, which provides a better estimate of this distance [34]. F_ST calculated from exome sequence data is generally comparable to that calculated from whole genome sequence data [35].

The quantitative genetic divergence index (Q_ST) is a measure of phenotypic divergence that is analogous to F_ST [23, 24]. For diploid organisms Q_ST is calculated as:

$$ {Q}_{ST=}\frac{\sigma_{Ga}^2}{\sigma_{Ga}^2+2{\sigma}_{Gw}^2} $$

where $ {\upsigma}_{\mathrm{Ga}}^2 $ is the variance among ancestry groups attributable to additive genetic effects and $ {\upsigma}_{\mathrm{Gw}}^2 $ is the genetic variance within groups. Under evolutionary neutrality, the expectation is that Q_ST = F_ST, whereas with diversifying selection (when differences in direction or magnitude of natural selection across groups drive phenotypic divergence), the expectation is Q_ST > F_ST [24, 33]. With stabilising selection (when selection is of similar direction and magnitude across groups), then Q_ST < F_ST. Variance components for calculation of Q_ST are typically estimated by ‘common garden’ controlled breeding experiments.

In humans and other natural populations where controlled breeding experiments are not feasible, Q_ST can be approximated by the phenotypic divergence index (P_ST). This uses the total phenotypic components of variance among and within ancestry groups, $ {\upsigma}_{Pa}^2 $ and $ {\upsigma}_{Pw}^2 $, respectively, rather than the genetic variance components. A general formula for P_ST is:

$$ {P}_{ST=}\frac{{c\upsigma}_{Pa}^2}{{c\upsigma}_{Pa}^2+2{h}^2{\upsigma}_{Pw}^2} $$

(1)

where h² represents the proportion of the within-group phenotypic variance due to additive genetic effects (i.e., heritability) and c represents the proportion of the among-group variance due to genetic factors [36]. When h² and c are known from representative populations, then P_ST, calculated from eq. 1, is an unbiased estimate of Q_ST.

In the present study we estimate h² in genetically related individuals (i.e., in pedigree data without exclusion of close relatives), but c is unknown, as is often the case. In this situation, there are two widely used formulae for P_ST, which make different assumptions about c. The formula of Leinonen et al. is [37]:

$$ {P}_{ST=}\frac{\upsigma_{Pa}^2}{\upsigma_{Pa}^2+2{h}^2{\upsigma}_{Pw}^2} $$

(2)

This assumes that c = 1, i.e., that all phenotypic differences among ancestry groups are due to genetic factors, and this estimate represents the maximum possible value of Q_ST for a given h². This can be justified by the notion that P_ST is a screen for identifying traits potentially under differential natural selection. Other investigators, however, consider it more prudent to assume that c = h² [38], and this leads to:

$$ {P}_{ST=}\frac{\upsigma_{Pa}^2}{\upsigma_{Pa}^2+2{\upsigma}_{Pw}^2} $$

(3)

This is more stringent in that it gives lower values of P_ST than eq. 2 (unless h² = 1). We calculate P_ST under both equations, and we present analyses under eq. 2 as the primary results with the recognition that these represent maximal estimates of P_ST. Results calculated under eq. 3 are presented in ESM Tables 5 and 8, and we conduct sensitivity analyses across a range of values for h² and c (including situations with c < h²) to evaluate effects on the conclusions.

Statistical analyses

Analyses were conducted in SAS (version 9.4; SAS Institute, Cary, NC, USA). Kernel density estimation (PROC KDE in SAS) was used to estimate nonparametric density functions. Phenotypic differences for continuous traits among ancestry groups were assessed using linear regression models with control for age and sex (and 30 min glucose, for analyses of 30 min insulin). A logistic regression model was used for analyses of diabetes. Heritability was assessed in the 761 American Indian participants (without exclusion of close relatives) using a linear mixed model. The total phenotypic variance was modelled as:

$$ {\upsigma}_P^2={\Phi \upsigma}_G^2+\mathrm{I} {\upsigma}_E^2 $$

where $ {\upsigma}_G^2 $ is the variance potentially attributable to genetic factors, $ {\upsigma}_E^2 $ is the variance attributable to individual-level environmental factors, Φ is a matrix of the proportion of alleles shared identical by descent between pairs of individuals (estimated by PREST [26]) and І is an identity matrix. Heritability was calculated as $ {h}^2={\upsigma}_G^2 $/$ {\upsigma}_P^2 $. These analyses were conducted in SOLAR (version 8.1.1) with adjustment for age, sex and the first genetic principal component in American Indians (to account for potential population stratification), and a probit model was used to analyse liability to diabetes [39]. Confidence intervals were calculated with a likelihood-based method [40].

For continuous traits, variance components for calculation of P_ST were taken from the mean squares among and within ancestry groups, derived from the regression model with ancestry group as a fixed effect [38, 41]. Thus,

$$ {\upsigma}_{Pw}^2= MSW= SSE/\left[{\sum}_{i=1}^r{n}_i-\left(r-1\right)-1\right] $$

$$ {\upsigma}_{Pa}^2=\frac{MSA- MSW}{n_0} $$

where SSE is the sums of squares error from the regression, and n_i is the number of individuals in the ith ancestry group; if μ_i is the trait mean in the ith ancestry group, predicted by the regression model, and $ \overline{\upmu} $is the total sample mean:

$$ {n}_0=\frac{1}{\left(r-1\right)}\ \left({\sum}_{i=1}^r{n}_i-\frac{\sum_{i=1}^r{n}_i^2}{\sum_{i=1}^r{n}_i}\right) $$

$$ MSA=\frac{1}{\left(r-1\right)}{\sum}_{i=1}^r{n}_i{\left({\upmu}_i-\overline{\upmu}\right)}^2 $$

For liability to diabetes, parameters were inferred from estimates of variance components derived from a probit mixed model in which ancestry group was a random effect (fit with PROC GLIMMIX in SAS). If $ {s}_{grp}^2 $ is the variance attributed to ancestry group and $ {s}_{rsd}^2 $ is the residual variance (from the Pearson χ² fit of the model), then $ SSE={s}_{rsd}^2{\sum}_{i=1}^r{n}_i $ and $ MSA=\frac{s_{grp}^2}{\left(r-1\right)}{\sum}_{i=1}^r{n}_i $. Similar approaches have been used elsewhere [42].

Parameters were estimated using a bootstrap procedure with 1000 iterations; 90% CIs were generated from centiles of the bootstrap distribution. Estimates of F_ST and P_ST depend on sample sizes, and their interpretation as measures of population divergence is most straightforward if sample sizes are equal in each ancestry group [32]. Therefore, in each bootstrap iteration, a sample size equal to that of the ancestry group with the smallest sample size was selected for each group. Although the mean marker-wise F_ST represents the expectation of P_ST under neutrality, there is substantial biological variability; despite polygenicity, the distribution is approximately that of F_ST for individual markers [24, 33]. Thus, comparison of P_ST with F_ST is most appropriately made across the single-marker distribution. Following Guo et al. [43], we generated this comparison from the distribution of F_ST-M shown in ESM Fig. 2. The proportion of markers for which the difference with the mean (F_ST-M − F_ST) was greater than the observed value of P_ST − F_ST was taken as the empirical one-sided p value for the null hypothesis P_ST = F_ST against the alternative P_ST > F_ST. The distribution of F_ST-M had a thick lower tail, with ~5% of markers having a value <0.01; thus, we did not test for the alternative P_ST < F_ST. Diversifying selection may act primarily on one ancestry group, and, in this situation, it may be most powerful to consider the maximum divergence of P_ST from F_ST across all pairwise comparisons; we report this value as P_ST-MAX (and assess its p value with correction for three pairwise comparisons). We also conduct a multivariable test of the null hypothesis of P_ST = F_ST across any of the five major traits (diabetes, BMI, height, fasting insulin, 30 min insulin) using a method that combines p values with allowance for the correlations between traits [44].

F_ST outlier analyses

For primary analyses, we used the ‘robust’ approach of comparing P_ST with F_ST taken across all available markers [45]. However, some of the markers themselves may have been subject to natural selection, and, to examine their potential influence, we conducted Bayesian outlier analysis of the F_ST-M distribution using BAYESCAN2 (version 2.01) [46]. This models the allele frequency differences among groups as a function of a population-specific F_ST component and a locus-specific F_ST component, subject to selection. Markers for which the locus-specific component is necessary are candidates for being under natural selection. As results depend on the specified prior odds of selection vs neutrality, we varied this parameter over a range of values and designated markers for which posterior odds were >1:1 as potentially under selection. Then, p values were calculated using the remaining, putatively neutral, markers.

Genetic admixture estimates

We used ADMIXTURE (version 1.3.0) to obtain estimates of genetic admixture proportions for each individual, assuming three ancestral populations [47]. In these analyses we included data from individuals in the 1000 Genomes project to improve resolution (ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/); data from the HapMap Yoruba in Ibadan, Nigeria, population and the Centre d’Etude du Polymorphism Humain Utah population were used as representative of African and European ancestry groups, respectively. To reduce the influence of linkage disequilibrium, we selected markers ~100 kb apart, after exclusion of markers that did not have consistent reference and alternative alleles between our exome sequence data and the 1000 Genomes data (and excluding A/T and C/G polymorphisms); this resulted in 14,672 markers.

Additional population data

To assess applicability of our results in more general population data, we obtained data for individuals ≥18 years of age from a population-based study from a high-risk southwest American Indian (SWAI) population (4032 full-heritage American Indians) [48], and from the oral glucose tolerance subset of the 2005–2010 National Health and Nutrition Examination Survey (NHANES) (https://wwwn.cdc.gov/nchs/nhanes/Default.aspx), which is representative of the general US population. This included 1271 individuals of non-Hispanic black ancestry, taken as representative of African Americans, and 2905 individuals of non-Hispanic white ancestry, taken as representative of European Americans. We calculated P_ST across these three populations for diabetes and BMI. In these analyses, diabetes was diagnosed based on self-report, fasting plasma glucose or 2 h plasma glucose (as HbA_1c was not available in all participants). Exome sequencing data were available from 3435 SWAI participants, but, as such data were not available for NHANES participants, we could not directly compare P_ST with F_ST. For an indirect comparison, we used genotypic data from the 1000 Genomes project, including the HapMap African Americans from the American Southwest population, as representative of African Americans, and the Centre d’Etude du Polymorphism Humain Utah population, as representative of European Americans. F_ST was calculated across these populations for 81,700 markers that had alleles called consistently between the exome sequence and the 1000 Genomes data.

Results

Phenotypic differences

Phenotypic differences among ancestry groups are shown in Fig. 1. Age- and sex-adjusted prevalence of diabetes was highest in American Indians (34.0%) and lower in African Americans (12.4%) and European Americans (10.4%, p = 2.9 × 10⁻¹⁰ for difference among groups). Similarly, mean age- and sex-adjusted maximum BMI was 36.3 kg/m² in American Indians, 33.4 kg/m² in African Americans and 33.0 kg/m² in European Americans (p = 1.9 × 10⁻⁷). Height was also significantly different among ancestry groups, with American Indians being shorter than African Americans and European Americans (p = 1.9 × 10⁻¹⁸, ESM Fig. 3). Among those with normal glucose tolerance, fasting serum insulin was higher in American Indians (geometric mean = 63.8 pmol/l adjusted for age and sex) than in African Americans (48.4 pmol/l) or European Americans (45.2 pmol/l, p = 9.2 × 10⁻⁵). The 30 min insulin, adjusted for age, sex and 30 min glucose, was lower in European Americans (358.8 pmol/l) than in African Americans (553.5 pmol/l) and American Indians (559.8 pmol/l, p = 5.7 × 10⁻⁸). With additional adjustment for BMI, differences in fasting insulin were largely attenuated (p = 0.25), while with additional adjustment for BMI and fasting insulin, differences in 30 min insulin remained statistically significant (p = 2.3 × 10⁻⁵).

Heritability

In 761 American Indian participants, type 2 diabetes was highly familial; 77% of the liability was potentially due to genetic factors (h² = 0.77; 90% CI 0.34, 1.00; p = 0.0017). Similarly, significant familial aggregation was observed for maximum BMI (h² = 0.36; 0.19, 0.53; p = 2.5 × 10⁻⁴) and height (h² = 0.71; 0.54, 0.85; p = 4.4 × 10⁻¹¹). In 434 American Indian participants with normal glucose tolerance, significant heritability was observed for fasting insulin (h² = 0.35; 0.04, 0.65; p = 0.035) and 30 min insulin adjusted for 30 min glucose (h² = 0.31; 0.03, 0.66; p = 0.035). These estimates were made with adjustment for the first genetic principal component, which captures the major source of stratification in this population; additional sources of population stratification may be captured with additional principal components, but, given the small number of relative pairs, at the risk of model overspecification. To assess the robustness of the h² estimates, we repeated the analysis with adjustment for the first five genetic principal components. For most traits, the h² estimates were only modestly attenuated (ESM Table 2). The exception was 30 min insulin, for which h² approached 0 (where P_ST is of questionable meaning as a measure of selection), so the h² estimate for this trait is not robust.

Comparison of genotypic and phenotypic divergence

Mean F_ST-M among all three ancestry groups across all 97,388 markers was 0.130 (Table 1). Estimates of P_ST for each phenotype are shown in Table 1; none of these were significantly higher than F_ST. For type 2 diabetes P_ST = 0.149 (90% CI 0.038, 0.272; p = 0.35 for comparison with F_ST). For maximum BMI, P_ST = 0.094 (0.017, 0.184; p = 0.54), and for height P_ST = 0.116 (0.053, 0.189; p = 0.46). Among those with normal glucose tolerance, P_ST for fasting insulin was 0.095 (0.001, 0.214; p = 0.54), while for 30 min insulin P_ST = 0.216 (0.082, 0.358; p = 0.18). The multivariable test across all five traits was not significant (p = 0.46). The largest departures from neutral expectations for pairs of ancestry groups generally occurred between American Indians and European Americans, but were not statistically significant; for diabetes P_ST-MAX = 0.22 (p = 0.37), while for BMI P_ST-MAX = 0.14 (p = 0.61). Similar results were obtained with directly measured BMI, with fasting insulin adjusted for BMI and with 30 min insulin adjusted for BMI and fasting insulin (ESM Table 3). Similar results were also obtained when men and women were analysed separately (ESM Fig. 4, ESM Table 4). When P_ST was calculated according to eq. 3, P_ST values tended to be lower than F_ST (ESM Table 5). A summary of the primary analyses is shown in Fig. 2.

Table 1 Genotypic divergence (F_ST) and phenotypic divergence (P_ST) for type 2 diabetes and related traits among ancestry groups

Full size table

In Bayesian outlier analyses, the number of markers potentially under selection ranged from 416 with prior odds for selection vs neutrality of 1:10, to 61,195 with prior odds of 4:3. Statistical significance levels were similar when restricted to putatively neutral markers regardless of prior odds (ESM Table 6). By analysis of individual admixture proportions, we estimated that, on average, 80% of the ancestry of African American participants derived from African sources, 96% of the ancestry of American Indian participants derived from Amerindian sources and 99% of the ancestry of European American participants derived from European sources. We repeated the divergence analyses with restriction to those whose genetic ancestry derived ≥85% from the continent corresponding to their stated ancestry group, constituting 31 African Americans, 475 American Indians and 126 European Americans. Similar results were obtained (ESM Table 7); for diabetes P_ST = 0.143 (p = 0.41), and, for BMI, P_ST = 0.098 (p = 0.57), while F_ST = 0.144.

Sensitivity analyses

Results of sensitivity analyses, which calculate P_ST for different values of h² and c, are shown in Fig. 3. For diabetes, P_ST was generally less than the 95th centile of marker-wise F_ST (0.340), except when h² was low and c was high (e.g., h² = 0.20 and c > 0.80). For BMI, fasting insulin and 30 min insulin, P_ST did not exceed the critical value of 0.340 for any value of c, for any h² ≥ 0.20.

Analyses of additional population data

Results of analyses comparing African Americans and European Americans from NHANES, and SWAI, are shown in Fig. 4. Age- and sex-adjusted prevalence of diabetes was highest in SWAI (40.6%), and lower in African Americans (14.2%) and European Americans (8.0%, p = 1.1 × 10⁻²¹⁴); similarly, mean BMI was highest in SWAI (35.1 kg/m²), and lower in African Americans (30.5 kg/m²) and European Americans (29.0 kg/m², p = 2.1 × 10⁻²⁸⁶). F_ST comparing SWAI with African Americans and European Americans from the HapMap populations was 0.134; the 95th centile of marker-wise F_ST was 0.354. Although P_ST values were modestly higher among these three populations than among the Phoenix cohort, they were well within the expected distribution of F_ST-M; for diabetes P_ST = 0.195 (90% CI 0.166, 0.224), while for BMI P_ST = 0.252 (0.223, 0.282) (ESM Table 8). When calculated by eq. 3, P_ST = 0.157 (0.133, 0.182) for diabetes and P_ST = 0.109 (0.094, 0.124) for BMI.

Discussion

Prevalence of type 2 diabetes and obesity differs across human continental ancestry groups, and there has been considerable speculation about the role of natural selection in these differences. The ‘thrifty genotype’ hypothesis posits that greater efficiency in using energy from food conferred a selective advantage in time of famine but that this predisposes to diabetes and obesity in modern environments [49, 50]. Differences in exposure and response to famine could, thus, have resulted in differences in prevalence of diabetes and obesity across ancestry groups [5,6,7]. Alternatively, it has been proposed that release from predation freed humans from selective pressure against obesity, and that high prevalence of obesity in humans is the result of neutral genetic drift [51]. Others have proposed that agriculture introduced a high load of carbohydrate into human diets, and that populations that adopted agriculture early, such as Europeans, have experienced greater selection for carbohydrate tolerance than other populations, resulting in protection from diabetes [8, 9]. Others have hypothesised that differences in diabetes and obesity across ancestry groups reflect adaptation to different climates or to different infectious diseases [10,11,12,13]. Such hypotheses have sometimes been discussed on the basis of the differences among ancestry groups, without consideration of whether these differences could arise neutrally.

There are few empirical genetic data supporting hypotheses that natural selection across ancestry groups contributes to risk of diabetes or obesity. Some studies have analysed established type 2 diabetes and obesity variants for molecular signatures of recent natural selection, such as extended haplotypic homozygosity. These studies have generally not found greater evidence for selection at established variants in comparison with suitably matched genomic variants [16,17,18], although one study did find a modest excess of evidence for selection at protective alleles for diabetes [22]. These methods are most powerful for detecting classic ‘sweeps’ where a previously rare allele at a single locus rapidly increases in frequency, and they are not well suited for detecting selection on polygenic traits. Others have used established variants to estimate a polygenic Q_ST analogue, and they found little evidence for differential selection for BMI or type 2 diabetes variants [52]. Other studies have compared allele frequencies for established variants across ancestry groups. Although these have found considerable differences in allele frequencies, the pattern of differences has not corresponded to epidemiologic risk [2, 19,20,21]; type 2 diabetes risk allele frequencies for established variants tend to be high in Africans but low in American Indians. However, the causal variants that contribute to diabetes and obesity are incompletely known and linkage disequilibrium patterns vary across populations, and this can introduce unpredictable biases into these comparisons [53]. The variance components methods used in the present study are designed to detect polygenic selection which differs across ancestry groups, and they do not require knowledge of specific causal variants. Our analyses show that phenotypic differences for type 2 diabetes and related traits among African Americans, American Indians and European Americans are consistent with expectations based on heritability and genetic divergence. Thus, strong diversifying selection is not necessary to explain the phenotypic differences.

Recent genetic admixture among groups could attenuate the phenotypic differences. However, we obtained similar results when analyses were restricted to those whose genetic ancestry derived ≥85% from the continent corresponding to their stated ancestry, and this suggests that our results are not unduly influenced by admixture. Phenotypic measures were made identically across the ancestry groups in our cohort. Although the Phoenix cohort is a relatively small convenience sample, we observed significant differences in diabetes and obesity risk among groups, which replicate known epidemiologic associations [1, 2]. Furthermore, P_ST estimates were only modestly higher when calculated among NHANES samples representative of the US African American and European American populations and an SWAI population with high prevalence of obesity and diabetes. Thus, results from these larger population samples are generally consistent with those from the Phoenix cohort.

Comparisons of phenotypic with genotypic divergence are optimally made using genetic components of variance for the trait (i.e., Q_ST), while we have used the total phenotypic components of variance (P_ST). The extent to which P_ST approximates Q_ST depends on heritability (h²) and the proportion of the among-population phenotypic variance explained by genetic factors (c). Our estimates of h² are based on a relatively small sample of related individuals, but they are comparable to those reported in large meta-analyses of twin studies for BMI and diabetes [3, 4]. Although our primary analyses were conducted under the assumption that c = 1, environmental differences in determinants of these traits among populations would result in lower values of c, and would tend toward lower values of P_ST. Our sensitivity analyses suggest that our findings are consistent over a large range of values of h² and c. For BMI and diabetes, the largest pairwise differences in phenotypic and genotypic divergence were observed between American Indians and European Americans, and these differences suggest that a relatively modest 14% or 22% of the genetic variance in each trait, respectively, is potentially attributable to differential selection (this is under the assumption that there is no effect of neutral genetic drift, although the genotypic divergence suggests these values are well within neutral expectations).

With comparisons over only three ancestry groups representing a large portion of modern human genetic history, the present approach has limited power to detect modest degrees of diversifying selection—fairly strong or sustained selection is required for the phenotypic divergence of a single trait to exceed expectations based on genotypic divergence. We estimate by simulation that P_ST ≈ 0.40 is required to detect p < 0.05 with 80% power (ESM Fig. 5; this corresponds to an average difference in phenotypic mean of ~0.9 SD between pairs of ancestry groups). This degree of divergence is somewhat less than observed for traits under established diversifying selection, such as skin pigmentation or craniofacial morphometry ([38, 43]; we did not directly measure skin pigmentation in the present study, but, for skin pigmentation predicted genetically [54], P_ST = [Q_ST] = 0.831, p = 1.0 × 10⁻⁵, ESM Fig. 6). The power of these variance component methods to detect diversifying selection depends on the number of ancestry groups included, as well as the number of individuals in each ancestry group [45]. While we obtained similar P_ST estimates among the larger population cohorts as in the Phoenix cohort with the same three ancestry groups, inclusion of additional ancestry groups may be required to detect more subtle selection. The present results do not exclude more complex models of selection, such as selection on a suite of complex traits, including some diabetes-related traits, with the overall differentiation constrained by pleiotropy, nor do they exclude modest diversifying selection on diabetes-related traits too weak to be detected by the present methods primarily affecting the American Indian group. However, our results were obtained in major continental ancestry groups at diverse risk for diabetes and obesity, and they suggest that differences in natural selection across these groups are not necessary to explain the phenotypic differences. Investigations of the causes of differences in diabetes risk across these groups would do well to consider alternative explanations.

Data availability

Data for consenting individuals will be made available through the Database of Genotype and Phenotype (https://www.ncbi.nlm.nih.gov/gap) pending Institutional Review Board approval.

Abbreviations

F _ST :: Coancestry coefficient
NHANES:: National Health and Nutrition Examination Survey
P _ST :: Phenotypic divergence index
Q _ST :: Quantitative genetic divergence index
SWAI:: Southwest American Indian

References

Cowie CC, Casagrande SS, Geiss LS (2018) Prevalence and incidence of type 2 diabetes and prediabetes. In: Cowie CC, Casagrande SS, Menke A et al (eds) Diabetes in America, 3rd edn. National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, NIH Publication, 17–1468, pp 3–1 to 3–32.
Hanson RL, Rong R, Kobes S et al (2015) Role of established type 2 diabetes-susceptibility genetic variants in a high prevalence American Indian population. Diabetes 64:2646–2657. https://doi.org/10.2337/db14-1715
Article CAS Google Scholar
Willemsen G, Ward KJ, Bell CG et al (2015) The concordance and heritability of type 2 diabetes in 34,166 twin pairs from international twin registers: The discordant twin (DISCOTWIN) consortium. Twin Res Hum Genet 18:762–771. https://doi.org/10.1017/thg.2015.83
Article Google Scholar
Elks CE, den Hoed M, Zhao JH et al (2012) Variability in the heritability of body mass index: A systematic review and meta-regression. Front Endocrinol (Lausanne) 3:29. https://doi.org/10.3389/fendo.2012.00029
Article Google Scholar
Joffe B, Zimmet P (1998) The thrifty genotype in type 2 diabetes: An unfinished symphony moving to its finale? Endocrine 9:139–141. https://doi.org/10.1385/ENDO:9:2:139
Article CAS Google Scholar
Wendorf M, Goldfine ID (1991) Archaeology of NIDDM. Excavation of the "thrifty" genotype. Diabetes 40:161–165. https://doi.org/10.2337/diab.40.2.161
Article CAS Google Scholar
Gerstein HC, Waltman L (2006) Why don't pigs get diabetes? Explanations for variations in diabetes susceptibility in human populations living in a diabetogenic environment. CMAJ 174:25–26. https://doi.org/10.1503/cmaj.050649
Article Google Scholar
Miller JC, Colagiuri S (1994) The carnivore connection: Dietary carbohydrate in the evolution of NIDDM. Diabetologia 37:1280–1286. https://doi.org/10.1007/BF00399803
Article CAS Google Scholar
Corbett SJ, McMichael AJ, Prentice AM (2009) Type 2 diabetes, cardiovascular disease, and the evolutionary paradox of the polycystic ovary syndrome: A fertility first hypothesis. Am J Hum Biol 21:587–598. https://doi.org/10.1002/ajhb.20937
Article Google Scholar
Sellayah D, Cagampang FR, Cox RD (2014) On the evolutionary origins of obesity: a new hypothesis. Endocrinology 155:1573–1588. https://doi.org/10.1210/en.2013-2103
Article CAS Google Scholar
Fridlyand LE, Philipson LH (2006) Cold climate genes and the prevalence of type 2 diabetes mellitus. Med Hypotheses 67:1034–1041. https://doi.org/10.1016/j.mehy.2006.04.057
Article CAS Google Scholar
Dayaratne DA (2010) Impact of ecology on development of NIDDM. Med Hypotheses 74:986–988. https://doi.org/10.1016/j.mehy.2009.12.017
Article CAS Google Scholar
Wells JC (2009) Ethnic variability in adiposity and cardiovascular risk: the variable disease selection hypothesis. Int J Epidemiol 38:63–71. https://doi.org/10.1093/ije/dyn183
Article Google Scholar
Mahajan A, Taliun D, Thurner M et al (2018) Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat Genet 50:1505–1513. https://doi.org/10.1038/s41588-018-0241-6
Article CAS Google Scholar
Locke AE, Kahali B, Berndt SI et al (2015) Genetic studies of body mass index yield new insights for obesity biology. Nature 518:197–206. https://doi.org/10.1038/nature14177
Article CAS Google Scholar
Southam L, Soranzo N, Montgomery SB et al (2009) Is the thrifty genotype hypothesis supported by evidence based on confirmed type 2 diabetes- and obesity-susceptibility variants? Diabetologia 52:1846–1851. https://doi.org/10.1007/s00125-009-1419-3
Article CAS Google Scholar
Ayub Q, Moutsianas L, Chen Y et al (2014) Revisiting the thrifty gene hypothesis via 65 loci associated with susceptibility to type 2 diabetes. Am J Hum Genet 94:176–185. https://doi.org/10.1016/j.ajhg.2013.12.010
Article CAS Google Scholar
Wang G, Speakman JR (2016) Analysis of positive selection at single nucleotide polymorphisms associated with body mass index does not support the "thrifty gene" hypothesis. Cell Metab 24:531–541. https://doi.org/10.1016/j.cmet.2016.08.014
Article CAS Google Scholar
Klimentidis YC, Abrams M, Wang J, Fernandez JR, Allison DB (2011) Natural selection at genomic regions associated with obesity and type-2 diabetes: East Asians and sub-Saharan Africans exhibit high levels of differentiation at type-2 diabetes regions. Hum Genet 129:407–418. https://doi.org/10.1007/s00439-010-0935-z
Article Google Scholar
Chen R, Corona E, Sikora M et al (2012) Type 2 diabetes risk alleles demonstrate extreme directional differentiation among human populations, compared to other diseases. PLoS Genet 8:e1002621. https://doi.org/10.1371/journal.pgen.1002621
Article CAS Google Scholar
Corona E, Chen R, Sikora M et al (2013) Analysis of the genetic basis of disease in the context of worldwide human relationships and migration. PLoS Genet 9:e1003447. https://doi.org/10.1371/journal.pgen.1003447
Article CAS Google Scholar
Segurel L, Austerlitz F, Toupance B et al (2013) Positive selection of protective variants for type 2 diabetes from the Neolithic onward: a case study in Central Asia. Eur J Hum Genet 21:1146–1151. https://doi.org/10.1038/ejhg.2012.295
Article CAS Google Scholar
Spitze K (1993) Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135:367–374
CAS Google Scholar
Leinonen T, McCairns RJ, O'Hara RB, Merila J (2013) Q(ST)-F(ST) comparisons: evolutionary and ecological insights from genomic heterogeneity. Nat Rev Genet 14:179–190
Article CAS Google Scholar
Olaiya MT, Hanson RL, Kavena KG et al (2019) Use of graded Semmes Weinstein monofilament testing for ascertaining peripheral neuropathy in people with and without diabetes. Diabetes Res Clin Pract 151:1–10. https://doi.org/10.1016/j.diabres.2019.03.029
Article Google Scholar
Sun L, Wilder K, McPeek MS (2002) Enhanced pedigree error detection. Hum Hered 54:99–110. https://doi.org/10.1159/000067666
Article Google Scholar
American Diabetes Association (2010) Diagnosis and classification of diabetes. Diabetes Care 33(Suppl 1):S62–S69. https://doi.org/10.2337/dc10-S062
Article Google Scholar
Dewey FE, Murray MF, Overton JD et al (2016) Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study. Science 354:aaf6814. https://doi.org/10.1126/science.aaf6814
Article CAS Google Scholar
Kim HI, Ye B, Gosalia N et al (2020) Characterization of exome variants and their metabolic impact in 6,716 American Indians from southwest US. Am J Hum Genet 107:251–264. https://doi.org/10.1016/j.ajhg.2020.06.009
Article CAS Google Scholar
Browning SR, Browning BL (2007) Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet 81:1084–1097. https://doi.org/10.1086/521987
Article CAS Google Scholar
Weir BS, Cockerham CC (1984) Estimating F-statistics for the analysis of population structure. Evolution 38:1358–1370. https://doi.org/10.1111/j.1558-5646.1984.tb05657.x
Article CAS Google Scholar
Weir BS, Hill WG (2002) Estimating F-statistics. Annu Rev Genet 36:721–750. https://doi.org/10.1146/annurev.genet.36.050802.093940
Article CAS Google Scholar
Whitlock MC (2008) Evolutionary inference from Q_ST. Mol Ecol 17:1885–1896. https://doi.org/10.1111/j.1365-294X.2008.03712.x
Article Google Scholar
Bhatia G, Patterson N, Sankararaman S, Price AL (2013) Estimating and interpreting F_ST: The impact of rare variants. Genome Res 23:1514–1521. https://doi.org/10.1101/gr.154831.113
Article CAS Google Scholar
Maroti Z, Boldogkoi Z, Tombacz D, Snyder M, Kalmar T (2018) Evaluation of whole exome sequencing as an alternative to BeadChip and whole genome sequencing in human population genetic analysis. BMC Genomics 19:778
Article CAS Google Scholar
Brommer JE (2011) Whither P_ST? The approximation of Q_ST by P_ST in evolutionary and conservation biology. J Evol Biol 24:1160–1168. https://doi.org/10.1111/j.1420-9101.2011.02268.x
Article CAS Google Scholar
Leinonen T, Cano JM, Makinen H, Merila J (2006) Contrasting patterns of body shape and neutral genetic divergence in marine and lake populations of threespine sticklebacks. J Evol Biol 19:1803–1812. https://doi.org/10.1111/j.1420-9101.2006.01182.x
Article CAS Google Scholar
Zaidi AA, Mattern BC, Claes P, McEvoy B, Hughes C, Shriver MD (2017) Investigating the case of human nose shape and climate adaptation. PLoS Genet 13:e1006616. https://doi.org/10.1371/journal.pgen.1006616
Article CAS Google Scholar
Almasy L, Blangero J (1998) Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet 62:1198–1211. https://doi.org/10.1086/301844
Article CAS Google Scholar
Neale MC, Miller MB (1997) The use of likelihood-based confidence intervals in genetic models. Behav Genet 27:113–120. https://doi.org/10.1023/a:1025681223921
Article CAS Google Scholar
Sokal RR, Rohlf FJ (1969) Single-classification analysis of variance. In: Emerson R, Kennedy D, Park RB (eds) Biometry: The principles and practice of statistics in biological research. W.H. Freeman and Company, San Francisco, pp 204–252
Google Scholar
Hangartner S, Laurila A, Rasanen K (2012) Adaptive divergence in moor frog (Rana arvalis) populations along an acidification gradient: Inferences from Q_ST–F_ST correlations. Evolution 66:867–881. https://doi.org/10.1111/j.1558-5646.2011.01472.x
Article Google Scholar
Guo J, Tan J, Yang Y et al (2014) Variation and signatures of selection on the human face. J Hum Evol 75:143–152. https://doi.org/10.1016/j.jhevol.2014.08.001
Article Google Scholar
Kost JT, McDermott MP (2002) Combining dependent P-values. Stat Prob Lett 60:183–190. https://doi.org/10.1016/S0167-7152(02)00310-3
Article Google Scholar
Whitlock MC, Guillaume F (2009) Testing for spatially divergent selection: comparing Q_ST to F_ST. Genetics 183:1055–1063. https://doi.org/10.1534/genetics.108.099812
Article Google Scholar
Fischer MC, Foll M, Excoffier L, Heckel G (2011) Enhanced AFLP genome scans detect local adaptation in high-altitude populations of a small rodent (Microtus arvalis). Mol Ecol 20:1450–1462. https://doi.org/10.1111/j.1365-294X.2011.05015.x
Article Google Scholar
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664. https://doi.org/10.1101/gr.094052.109
Article CAS Google Scholar
Traurig M, Hanson RL, Marinelarena A et al (2016) Analysis of SLC16A11 variants in 12,811 American Indians: genotype-obesity interaction for type 2 diabetes and an association with RNASEK expression. Diabetes 65:510–519. https://doi.org/10.2337/db15-0571
Article CAS Google Scholar
Neel JV (1962) Diabetes mellitus: a "thrifty" genotype rendered detrimental by "progress"? Am J Hum Genet 14:353–362
CAS Google Scholar
Neel JV, Weder AB, Julius S (1998) Type II diabetes, essential hypertension, and obesity as "syndromes of impaired genetic homeostasis": the "thrifty genotype" hypothesis enters the 21st century. Perspect Biol Med 42:44–74. https://doi.org/10.1353/pbm.1998.0060
Article CAS Google Scholar
Speakman JR (2008) Thrifty genes for obesity, an attractive but flawed idea, and an alternative perspective: the 'drifty gene' hypothesis. Int J Obes 32:1611–1617. https://doi.org/10.1038/ijo.2008.161
Article CAS Google Scholar
Berg JJ, Coop G (2014) A population genetic signal of polygenic adaptation. PLoS Genet 10:e1004412. https://doi.org/10.1371/journal.pgen.1004412
Article CAS Google Scholar
Martin AR, Gignoux CR, Walters RK et al (2017) Human demographic history impacts genetic risk prediction across diverse populations. Am J Hum Genet 100:635–649. https://doi.org/10.1016/j.ajhg.2017.03.004
Article CAS Google Scholar
Maroñas O, Phillips C, Söchtig J et al (2014) Development of a forensic skin colour predictive test. Forensic Sci Int Genet 13:34–44. https://doi.org/10.1016/j.fsigen.2014.06.017
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the staff of the Phoenix Epidemiology and Clinical Research Branch who assisted with this study. The opinions expressed in this paper are those of the authors, and do not necessarily reflect the views of the Indian Health Service. This work was presented in part at the American Society of Human Genetics Annual Meeting, 15–19 October 2019, Houston, TX, USA.

Authors’ relationships and activities

ARS and CVH are employed by Regeneron Genetics Center. The authors declare that there are no other relationships or activities that might bias, or be perceived to bias, their work.

Funding

This work was supported in part by the intramural research program of the National Institute of Diabetes and Digestive and Kidney Diseases, and in part by the Regeneron Genetics Center. The study funders were not involved in the design of the study; the collection, analysis, and interpretation of data; writing the report; and did not impose any restrictions regarding the publication of the report.

Author information

Authors and Affiliations

Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, Phoenix, AZ, USA
Robert L. Hanson, Wen-Chi Hsueh, Sayuko Kobes, Madhumita Sinha, Leslie J. Baier & William C. Knowler
Regeneron Genetics Center, Tarrytown, NY, USA
Cristopher V. Van Hout & Alan R. Shuldiner

Authors

Robert L. Hanson
View author publications
You can also search for this author in PubMed Google Scholar
Cristopher V. Van Hout
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Chi Hsueh
View author publications
You can also search for this author in PubMed Google Scholar
Alan R. Shuldiner
View author publications
You can also search for this author in PubMed Google Scholar
Sayuko Kobes
View author publications
You can also search for this author in PubMed Google Scholar
Madhumita Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Leslie J. Baier
View author publications
You can also search for this author in PubMed Google Scholar
William C. Knowler
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Regeneron Genetics Center

Contributions

RLH contributed to study conception and design, data acquisition, analysis and interpretation of data and drafting of the manuscript. ARS, LJB, W-CH and WCK contributed to study conception and design, data acquisition, analysis and interpretation of data and revising the draft for intellectual content. CVH, SK and MS contributed to data acquisition, analysis and interpretation of data and revising the draft for intellectual content. Contributions of authors in the Regeneron Genetics Center are listed in ESM Text. All authors read and approved the final manuscript. RLH is the guarantor of the integrity of the work.

Corresponding author

Correspondence to Robert L. Hanson.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A list of Regeneron Genetics Center contributors is included in the electronic supplementary material (ESM).

Electronic supplementary material

ESM

(PDF 689 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hanson, R.L., Van Hout, C.V., Hsueh, WC. et al. Assessment of the potential role of natural selection in type 2 diabetes and related traits across human continental ancestry groups: comparison of phenotypic with genotypic divergence. Diabetologia 63, 2616–2627 (2020). https://doi.org/10.1007/s00125-020-05272-8

Download citation

Received: 29 February 2020
Accepted: 22 July 2020
Published: 04 September 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s00125-020-05272-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Assessment of the potential role of natural selection in type 2 diabetes and related traits across human continental ancestry groups: comparison of phenotypic with genotypic divergence

Abstract

Aims/hypothesis

Methods

Results

Conclusions/interpretation

Similar content being viewed by others

Analysis of genetic selection at insulin receptor substrate-2 gene loci

Complex interplay between neutral and adaptive evolution shaped differential genomic background and disease susceptibility along the Italian peninsula

Evidence for natural selection at the melanocortin-3 receptor gene in European and African populations

Introduction

Methods

Participants and measures

Genotypes

Measures of divergence

Statistical analyses

FST outlier analyses

Genetic admixture estimates

Additional population data

Results

Phenotypic differences

Heritability

Comparison of genotypic and phenotypic divergence

Sensitivity analyses

Analyses of additional population data

Discussion

Data availability

Abbreviations

References

Acknowledgements

Authors’ relationships and activities

Funding

Author information

Authors and Affiliations

Consortia

Regeneron Genetics Center

Contributions

Corresponding author

Additional information

Publisher’s note

Electronic supplementary material

ESM

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

F_ST outlier analyses