Abstract
To explore novel molecular mechanisms underlying obesity, we applied a systems genetics framework to integrate risk genetic loci from the largest body mass index (BMI) genome-wide association studies (GWAS) meta-analysis with mRNA and microRNA profiling in adipose tissue from 200 subjects. One module was identified to be most significantly associated with obesity and other metabolic traits. We identified eight hub genes which likely play important roles in obesity metabolism and identified microRNAs that significantly negatively correlated with hub genes. This module was preserved in other three test gene expression datasets, and all hub genes were consistently downregulated in obese subjects through the meta-analysis. Gene GPD1L had the highest connectivity and was identified a key causal regulator in the module. Gene GPD1L was significantly negatively correlated with the expression of miR-210, which was experimentally validated that miR-210 regulated GPD1L protein level through direct interaction with its mRNA three prime untranslated region (3′-UTR). GPD1L was found to be upregulated during weight loss and weight maintenance induced by low calorie diet (LCD), while downregulated during weight gain induced by high-fat diet (HFD). The results indicated that increased GPD1L in adipose tissue may have a significant therapeutic potential in reducing obesity and insulin resistance.
Similar content being viewed by others
Introduction
Obesity is a complex condition characterized by excessive storage of energy mainly as lipids in white adipocytes, as a result of a positive energy balance. Epidemic of obesity has become an exponentially growing public health problem associated with several severe diseases such as type 2 diabetes (T2D), cardiovascular diseases (CVD) and various types of cancers1. According to a World Health Organization (WHO) report, obesity has reached epidemic proportions globally, with at least 2.8 million people dying each year from obesity.
Body mass index (BMI) is the most widely used measure for assessing obesity, and has a strong genetic component (40% to 70% heritability)2, 3. The largest genome-wide association study (GWAS) meta-analysis for BMI, including a total of 339,224 individuals, identified 97 obesity-associated loci (P < 5 × 10−8), which accounted for only 2.7% of BMI variation4. A large proportion of the genetic variance remained to be discovered. It is extremely difficult to follow up on every gene in the functional experiment among a large number of loci identified by GWAS. Therefore, prioritization of the promising genes was of great interest by placing them in a broad context such as implicated tissues/cell-types, pathways and inter-connected networks, as gene functions are commonly dependent on their tissue context and the disordered interplay of tissue/cell-type with specific processes5. Transcriptomic analysis on thousands of genes simultaneously can provide further insight into the molecular mechanisms of complex traits, as gene expression provides an important link between genetic variations and their corresponding phenotypic alterations. It is well recognized that network analysis is capable of revealing comprehensive transcriptional regulation for complex diseases, since genes interact with each other in complex regulatory networks. For example, gene coexpression networks comprised of modules of genes demonstrating high levels of co-regulation6.
We implemented a systems genetics framework to integrate genetic loci identified from the largest BMI GWAS meta-analysis with gene expression profiles in adipose tissue from 200 subjects to explore underlying molecular mechanisms for obesity (Fig. 1). Genetic loci associated with diseases from GWAS can perturb specific parts of gene networks, whose overall dysregulation may shift homeostatic processes and lead to disease4, 7, 8. Using weighted gene coexpression network analysis (WGCNA), we identified modules composed of genes that were highly interconnected with one another and displayed significant associations with obesity and related metabolic traits. Through characterization of module content and topology, we identified hub genes which likely play an important role in obesity metabolism. Next, we applied the causal network inference and key driver analysis (KDA) to identify the key causal regulators in the obesity-associated module. As adipose microRNAs (miRNAs) have been linked to adipogeneisis in vitro and in animal obesity models, they play an important role in the development of obesity9. The cooperative and combinatorial targeting ability of miRNA allow precise and robust gene regulation at both the single-gene and gene-network levels10. In order to assess the role of miRNA in regulating gene expression in pathological processes, we studied the regulation of human adipose miRNAs expression and their relationships with obesity-associated modules and hub genes in the same 200 subjects. Furthermore, we utilized four publicly available experiments in adipose tissues from obese (BMI > 30) and non-obese subjects to evaluate preservation of module topology. In addition, we further utilized five studies including four longitudinal dietary intervention studies (one in-house clinical trial, The POUND LOST Trial) in human abdominal subcutaneous adipose tissue to provide supportive biological information for future functional validation of hub genes’ involvement in adipose tissue metabolism.
Our goal is to better understand the biological mechanisms underlying obesity risk and to identify most important candidate causal genes for obesity by integration of genetic loci, transcriptome in adipose tissue, phenotypic data and experiments relevant to adipose tissue metabolism. A most promising candidate causal gene GPD1L was highlighted. Our findings provided strong support for the important role of GPD1L and also shed light on its molecular functional mechanisms in the etiology of obesity and insulin resistance.
Materials and Methods
Discovery Stage (I)
Gene-based method on summary statistics of BMI meta-analysis GWAS
We downloaded publicly available summary statistics from a meta-analysis of 125 studies of BMI performed by the Genetic Investigation of ANthropometric Traits (GIANT) Consortium, a total sample of 339,224 individuals (http://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files)4. GWAS meta-analysis results were corrected for genomic control using all the SNPs11.
All the SNPs in the BMI meta-analysis GWAS were converted to gene lists using ProxyGeneLD, which calculated gene-wide p-value for each gene by reading the SNP list and BMI association statistics12. Based on CEU HapMap samples, ProxyGeneLD identified clusters of GWAS SNPs in high LD (r 2 ≤ 0.80). Genes were defined as the transcript plus a 1-kbp extension upstream to include promoter regions. Gene-wide p-values were then calculated as the minimum of any SNP in the gene, either as a singleton or member of a proxy cluster per gene. Genes with gene-wide P ≤ 0.05 for BMI were referred to as the nominally significant GWAS genes for subsequent analyses.
Gene expression data processing
To generate gene coexpression networks, we used a microarray data set from 200 random human subjects who were part of population-based cross-sectional Metabolic Syndrome in Men (METSIM) study13. Participants of the METSIM study had been extensively characterized for physical and metabolic traits, including BMI, fat mass (FM), waist-to-hip ratio (WHR), total cholesterol (TC), high-density lipoprotein (HDL), low-density lipoprotein (LDL), total triglyceride (TG) levels, adiponectin, C-reactive protein (CRP), Matsuda index of insulin sensitivity14 and the homeostasis model assessment-estimated insulin resistance (HOMA-IR)15. Total RNA from the METSIM subjects was isolated from abdominal subcutaneous adipose tissue, which was obtained with needle biopsies. High-quality RNA samples from each METSIM subject were hybridized to Illumina Human HT-12 v3 Expression Bead-Chips (San Diego, CA, USA). Data were downloaded from the National Center for Biotechnology Information Gene Expression Omnibus (GEO) (accession number: GSE32512) and subjected to a stringent quality control analysis, using non-parametric background correction followed by quantile normalization with both control and expression probes utilizing the neqc function in the Linear Models for Microarray Analysis (LIMMA) package in R/Bioconductor16, 17. Probes with detection p-value ≥0.01 in ≥90% of 200 subjects were removed, which were not significantly expressed above the background level. The remaining missing data were imputed using the k-nearest neighbor procedure, as implemented in the impute R package18. As part of quality-control protocol, a clustering and principal-components analysis (PCA) was performed to identify potential outliers in the dataset by using functions hclust and prcomp in R. After clustering samples based on global expression values and PC1, one sample was identified as an outlier and therefore removed from subsequent analyses.
Construction of coexpression networks and identification of obesity-associated gene coexpression modules
Gene coexpression networks were constructed using WGCNA6, 9. WGCNA is a well-established tool for studying biological networks based on pair wise correlation between variables in high-dimensional microarray data sets. First, we identified probes assaying the expression of nominally significant GWAS genes. For these genes, we then calculated Pearson correlation coefficients for all gene-gene comparisons across microarray samples. The matrix of correlations was then converted to an adjacency matrix of connection strengths. The adjacencies were defined as \({a}_{ij}={|cor({x}_{i},{x}_{j})|}^{\beta }\), where x i and x j are the ith and jth gene expression traits. The parameter β of the power function was defined in such a way that the resulting coexpression network (adjacency matrix) satisfies approximate scale-free topology, as previously outlined by Zhang and Horvath19. We chose a power of 6, which resulted in an approximate scale-free topology network with the scale-free fitting index R2 greater than 0.8.
Modules were defined as sets of genes with high topological overlap. The topological overlap measure (TOM) \(TO{M}_{ij}=\frac{{\sum }_{u\ne i}{a}_{iu}{a}_{uj}+{a}_{ij}}{\min ({k}_{i},{k}_{j})+1-{a}_{ij}}\), where \({k}_{i}={\sum }_{u\ne i}{a}_{ui}\) denotes the network connectivity, calculated as the sum of the connection strengths with all other module genes. A TOM-based dissimilarity measure (1 − TOM) was used for hierarchical clustering. Gene modules corresponded to the branches of the resulting dendogram and were precisely defined using the “dynamic hybrid” branch-cutting algorithm20. Module eigengenes (MEs) were defined as the first principal component calculated using PCA, which can summarize modules’ behavior. These MEs were tested for association with obesity and other phenotypes adjusted for age using a linear regression model. Gene significance (GS) for the each module gene was defined as the absolute value of its Pearson correlation with obesity. The module membership (MM) (also referred as intramodular connectivity) of gene i in module q, kME value, was defined as the absolute value of Pearson correlation between its gene expression and the module eigengene. Specifically, \(kM{E}_{q}(i)=|cor({x}_{i},M{E}_{q})|\), where ME q is the module eigengene of the module q. Note that it specifies how close gene i is to module q and does not require that gene i to be a member of the module q. Both GS and MM can be combined in a systems biological screening method for finding hub genes, which are highly interconnected nodes within gene coexpression modules and have been shown to play key roles in controlling module behavior and disease mechanisms6, 21, 22. The module hub gene was defined as the gene in the module with the highest connectivity, or based on a high intra-modular connectivity (MM > 0.8) and high GS (>0.2).
To assess the importance of hub genes, we used the Mouse Genome Informatics (MGI) web tool (http://www.informatics.jax.org/)23 to analyze the reported endogenous gene expression during mouse development. To determine whether the obesity-associated modules were biologically meaningful, GO enrichment analyses were conducted using the WebGestalt web site (http://genereg.ornl.gov/webgestalt)24. We examined enrichment of Biological Process GO terms using hypergeometric test and the p-value was adjusted by Bonferroni correction.
Inference of causal network structure and identification of key causal regulators
In order to establish a causal relationship or dependency between nodes in the network, we constructed causal structures from the gene expression data. A popular method for constructing causal structures is probabilistic graphical models, which are used to analyze and visualize the conditional independence relationships between variables25. The structure of conditional independence among variables is usually represented as a directed acyclic graph (DAG), where each node represents a variable (expression of a gene) and each directed edge represents a direct cause. The PC algorithm (PC stands for the initials of its inventors Peter Spirtes and Clark Glymour) was used to learn causal structure in R package pcalg25, setting the significant level of the conditional independence test (partial correlation) α = 0.01. The PC algorithm starts from a complete undirected graph and deletes recursively edges based on conditional independence decisions. This yields an undirected graph that can then be partially directed. With high-dimensional datasets when the distribution of variables is multivariate normal, the partial correlation test is the most efficient tool to implement the conditional independence test in the PC algorithm26.
We intersected the full causal network with the Yellow module to identify the causal regulators (i.e., key drivers) that primarily regulated the expression of the module through key driver analysis (KDA). KDA was developed to identify key causal regulators for a particular gene set of interest with respect to a given regulatory network and recently has been successfully applied to identify local hubs as potential key drivers of pathological perturbations in disease-associated modules in multiple complex diseases8, 27, 28. Briefly, the KDA took as input a set of genes (G) and a gene causal (directed) network N. We calculated the size (μ) of the h-layer neighborhood (HLN) downstream and the out-degrees (d) for each node in the subnetwork resulting from the intersection between causal network and the Yellow module. Genes with the size of their HLN greater than \(\overline{\mu }+\sigma (\mu )\) were nominated as causal regulators. The regulators with degree above \(\overline{d}+2\sigma (d)\), become key causal regulators of a corresponding network module27.
Integrative analysis of miRNA and mRNA expression data
In order to assess the role of miRNA in regulating obesity-associated module genes in pathological processes, we integrated mRNA and miRNA expression profiles from the same 200 subjects using miRComb R package (http://mircomb.sourceforge.net). MiRNA abundance was quantified using next-generation sequencing. After quality control and normalization on the miRNA expression levels, detailed in the study by Civelek, et al.13, 356 miRNA species were reliably quantified that were expressed in human adipose tissue. The log2 of the normalized expression values for the 356 miRNAs were used for subsequent analysis.
Firstly, miRComb package calculated Pearson correlation of the expressions of all possible combinations of significantly differentially expressed mRNAs and miRNAs. Secondly, miRNA-mRNA pairs were identified by screening databases for known and predicted miRNA-mRNA target pairs in the MicroCosm database29. The package assigned a p-value to each miRNA-mRNA pair according to the MicroCosm database (p_database) which predicted mRNA-miRNA interactions based on miRNA and mRNA 3′UTR alignment sequences29. Finally, only those pairs with significant negative correlation and predicted by the MicroCosm database as potential targets were selected as potential functional miRNA-mRNA interactions.
Replication Stage (II)
We studied module preservation and gene association with obesity in other four test gene expression profiles in adipose tissue from obese (BMI > 30) and non-obese subjects. Gene expression data sets were downloaded from GEO (accession number: GSE25401, GSE64567), and from ArrayExpress (accession number: E-MTAB-54), which contained whole-genome expression profiling on abdominal (ABD) and gluteal (GLU) adipose tissue. Each data set was subjected to stringent quality control. Potential outliers were identified by clustering and PCA and then removed.
Module preservation
Network module preservation statistics assess whether modules identified in the discovery network remain connected in the test network (density), and whether node connections are similar between the discovery and the test network (connectivity). To assess the preservation of modules discovered from GSE32512 in other four test expression data sets, we used the network module preservation statistics implemented in the function modulePreservation (100 permutations) in the WGCNA R package30, 31. Module preservation statistics were calculated without the need to define modules in the test dataset.
Although there are many statistics to determine which module properties are preserved, we focused on the Zsummary statistic which summarizes evidence that a module is preserved more significantly than a random sample of all network genes30. The following thresholds were suggested by simulation results in Langfelder et al.’s study30: Zsummary < 2 implies no evidence for module preservation, 2 < Zsummary < 10 implies weak to moderate evidence, and Zsummary ≥ 10 implies strong evidence for module preservation.
Meta-analysis of module genes
In GSE32512 and other four test gene expression profiles, the associations between genes of the Yellow module and obesity were tested by a linear model adjusting for age using LIMMA package17. For each coefficient in the linear model, empirical Bayes moderated t-statistics and their associated p-values were calculated to assess the significance of the observed expression changes17. P-values were adjusted for multiple testing by controlling for false discovery rate (FDR) using the Benjamini-Hochberg (BH) method32. In order to identify differentially expressed genes (DEGs) associated with obesity consistently across these five gene expression datasets, two meta-analysis methods were applied. First, a nonparametric meta-analysis method, Fisher’s method was performed to combine p-values from individual data sets to identify DEGs with large effect sizes across these five gene expression datasets. Fisher’s method was used to combine p-values by LIMMA from individual data sets. The combined Fisher’s statistic \({\chi }^{2}=-2\sum _{i=1}^{k}\mathrm{ln}({p}_{i})\) followed a χ 2 distribution with 2k degrees of freedom (where k is the number of datasets) under the null hypothesis. Note that smaller p-values contributed larger scores to the Fisher’s χ 2 statistic. Second, because Fisher’s method could not distinguish direction of effect sizes, a second meta-analysis method was used to combine effect sizes across all datasets into a meta-effect size to estimate the magnitude of gene expression change. Cochran’s Q statistic and I 2 were calculated as measures of between-study heterogeneity for each gene. Genes with Q test’s p-value < 0.05 or I 2 > 50% were assessed by a random effects model that allows heterogeneity in the effect sizes between different datasets33; the remaining genes were assessed by a fixed effects model, which assumed that the standardized effect sizes can be combined across different data sets and that the variations in observed effects were due only to random error34. Differences in each gene’s expression between obese and normal groups were expressed as the standardized mean difference (SMD). The z-statistic for each gene was computed as a ratio of the pooled SMD to its standard error, and the result was compared with 1000 permutations to obtain a nominal p-value using R MetaDE package35. P-values were corrected for multiple testing using the BH method.
Validation Stage (III)
We utilized five experiments including four longitudinal dietary intervention studies in human abdominal subcutaneous adipose tissue to assess supportive biological information for future functional validation of hub genes’ involvement in adipose tissue metabolism. These experiments included the gene expression data in adipose tissue from obese or overweight human subjects in The Preventing Overweight Using Novel Dietary Strategies (POUNDS LOST) Trial (experiment 1)36, gene expression profile in adipose tissue from obese human subjects during weight loss and weight maintenance induced by LCD (experiment 2)37, real-time quantitative PCR (RT-qPCR) analysis of the effects of a longitudinal dietary intervention on human adipose tissue in the Diet, Obesity and Genes (Diogenes) intervention trial (experiment 3)38, 39, gene expression profile of metabolic responses and changes in adipose tissue from lean and overweight subjects during HFD overfeeding (experiment 4)40 and gene expression profile in vitro differentiated adipocytes from healthy subjects (experiment 5)41.
Experiment 1: Differential gene expression in adipose tissue from obese or overweight human subjects in the POUNDS LOST trial
The POUNDS LOST Trial (clinical trial reg. no. NCT00072995) is a 2-year randomized intervention trial which tested the effect of energy-reduced diets on weight loss in overweight or obese subjects36. We examined the differential gene expression of hub genes in human adipose tissue between baseline and 6-month weight loss intervention.
The POUNDS LOST Trial was conducted at Boston, MA, and Baton Rouge, LA, in 2004–2007. The study was approved by the human subjects committee at the Harvard School of Public Health and Brigham and Women’s Hospital, Boston, MA; the Pennington Biomedical Research Center of the Louisiana State University, Baton Rouge, LA; and a data and safety monitoring board appointed by the National Heart, Lung, and Blood Institute. All participants gave written informed consent. Detailed information on the The POUNDS LOST Trial study design and methods was previously described36. Major criteria for study exclusion were the presence of diabetes or unstable cardiovascular disease, the use of medications that affect body weight, and insufficient motivation.
A total of 811 overweight or obese subjects (BMI ≥ 25 and ≤ 40 kg/m2), age 30–70 years, were randomly assigned to one of weight loss diets. Each participant’s caloric prescription represented a deficit of 750 kcal per day from baseline, as calculated from the person’s resting energy expenditure and activity level. After an overnight fast, volunteers were biopsied by using Bergstrom technique with suction (Micrins Inc., Lake Forest, IL, USA) from abdominal subcutaneous adipose tissue (~500 mg). Total RNA was extracted by Trizol (Invitrogen, Carlsbad, CA, USA). The quantity and integrity of the RNA was analyzed by spectrophotometry and gel electrophoresis. Gene expression was measured by direct hypbridization using the Illumina HT-12 v3 expression beadchip (Illumina, San Diego, CA, USA). Gene expression profiles were assessed at baseline and at six months from 102 participants. After background correction and normalization, 13,948 of 48,803 probes had detection p-value < 0.01 (unpublished data). The differential expression of hub genes in the Yellow module between baseline and after 6 months intervention was conducted in LIMMA package and p-values were corrected for multiple testing using the BH method.
Experiment 2: Differential gene expression in adipose tissue from obese human subjects during weight loss and weight maintenance
To examine the differential gene expression of hub genes in human adipose tissue during diet-induced weight loss and weight maintenance after weight loss, we downloaded and analyzed the publicly available data set from GEO (accession number: GSE35411)37. RNA from abdominal subcutaneous adipose tissue from nine obese subjects was obtained and analyzed at baseline, after weight reduction on a low calorie diet (LCD), and after a period of group therapy in order to maintain weight stability. RNA profiling was performed using Affymetrix Human HG U133 Plus 2.0 arrays. Expression values were normalized by RMA algorithms42, implemented in R within the Affy package43. Differential expression analysis across three time points was conducted in LIMMA package. We focused on differences at the later time points by investigating contrasts of differences between each time point. LIMMA gave a moderated F statistic which can be used to test whether all contrasts were zero simultaneously, that is, whether there was no difference between the three time points. P-values were corrected for multiple testing using the BH method.
Experiment 3: Real-time quantitative PCR analysis of the effects of a longitudinal dietary intervention on human adipose tissue in Diogenes intervention trial
In order to decipher the adipose tissue response to diet-induced weight changes, another study focused on 221 genes during a longitudinal dietary intervention, which was part of the Diogenes intervention trial. A detailed description of the trial can be found in previous publications38, 39. Abdominal subcutaneous adipose tissue was obtained from 135 obese individuals at baseline, after a weight loss phase of 8-week LCD and after an ad libitum 26-week weight maintenance diet (WMD). After RNA extraction the mRNA levels of a panel of 221 genes selected from previous published and unpublished DNA microarray analyses was assessed using high throughput real-time quantitative PCR39. The processed RT-qPCR data files were downloaded from GEO (accession number: GSE60946). The differential expression analysis and multiple testing were the same with GSE35411 (Experiment 2).
Experiment 4: Differential expression in adipose tissue development and growth during overfeeding in human subjects
To further characterize the metabolic responses and changes of hub genes in gene expression that take place in adipose tissue during the early adaptive response to weight gain, we downloaded and analyzed the publicly available data set from GEO (accession number: GSE28005)40. In total, 13 subjects (7 overweight and 6 lean subjects) were involved in a high fat diet (HFD) for 56 days. Adipose tissue biopsies were taken at Day 0, Day 14 and Day 56. RNA profiling in adipose tissue was performed using Affymetrix Human HG U133 Plus 2.0 arrays. Expression values were normalized by Robust Multi-array Analysis (RMA) algorithms42, implemented by the Affy package in R43. Statistical analysis was also performed in LIMMA package. There were two groups (overweight and lean) and each subject was measured at Day 0, Day 14 and Day 56. Determining the differentially expressed genes meant studying contrasts of the various group × time levels. We were mainly interested in differences between the overweight and lean group at each time point. So a possible set of contrasts to investigate was “overweight0-lean0”, “overweight14-lean14”, “overweight56-lean56”. Similarly, a moderated F statistic was used to test whether all contrasts were zero simultaneously, that is, whether there was no difference between the two groups at any of the three time points. P-values were corrected for multiple testing using the BH method.
Experiment 5: Differential expression during differentiation of primary human adipocytes
To determine whether the hub genes were differentially expressed during in vitro differentiation of primary human adipocytes, we analyzed gene expression profiles of subcutaneous adipose tissue obtained from healthy subjects undergoing cosmetic liposuction (n = 12)41. The gene expression dataset was publicly accessible via GEO (accession number: GSE25910). Preadipocytes were isolated from the adipose tissue and in vitro differentiated to adipocytes. The cells were lysed at day 4/5 (early), 8 (middle) and 12 (late) of differentiation. From the samples, biotinylated complementary RNA was prepared and hybridized to Affymetrix GeneChip Human Gene 1.0 ST Array (Affymetrix Inc., Santa Clara, CA). Pre-processing and quality control was performed using the Affymetrix Expression Console version 1.1 by summarization, background correction and normalization41. Differential expression analysis across three time points was the same with GSE35411 (Experiment 2).
Results
Our systems genetics framework for underlying molecular mechanisms of obesity was shown in Fig. 1. There were three stages in the framework. For each gene expression dataset in the present study, the detailed characteristics of samples, type of gene chip, tissue and number of outliers were shown in Supplementary Table S1.
Construction of coexpression networks and identification of obesity-associated gene coexpression modules
In total, 3035 genes, which were nominally significant GWAS genes with gene-wide p-value ≤ 0.05 for BMI and measured in GSE32512, were used to construct the coexpression networks in WGCNA package. Shown in Supplementary Fig. S1, 11 modules were identified by WGCNA and module sizes ranged from 41 (Purple module) to 492 genes (Turquoise module). Five modules (Pink, Turquoise, Yellow, Black, and Green) were significantly associated with both obesity and BMI at FDR < 0.001. Especially, the Yellow module (module size = 119 genes) showed the most significant result (p-value = 6.4E-20 for BMI and p-value = 7.3E-05 for obesity). We observed that ME of the Yellow module explained 7.0% of obesity variation and 34.2% of BMI variation, the latter of which was the largest among all the modules and relatively high considering BMI was a complex trait. We also observed that FM, CRP, TG, WHR, and HOMA-IR were significantly negatively correlated with the Yellow module, while adiponectin, HDL and Matsuda index were significantly positively correlated with the Yellow module (Fig. 2). Similarly, the Yellow module ME accounted for 17.5% of the variation of FM, 36.8% of HOMA-IR, 14.0% of adiponectin, 22.0% of HDL, 34.3% of Matsuda index, 24.8% of TG and 28.3% of WHR.
Based on the biological significance, the Yellow module appeared to play an important biological role in metabolism. We found that there was a significant positive correlation between module membership (MM) and Gene significance (GS) in the Yellow module (r = 0.62, p-value = 5.5E-14) (Supplementary Fig. S2). In total, there were eight hub genes (GPD1L, CCDC50, NAALAD2, ALDH1L, ADH1B, ADH1A, PCCA and ORMDL3) identified by the criteria in the Yellow module. Out of 97 loci identified with genome-wide significance by GIANT4, 32 genes were measured in GSE32512 and 18 of them failed to fit within a distinct group and were assigned to the “grey” module. Only one gene EPB41L4B was contained in the Yellow module. Although EPB41L4B was not identified as a hub gene (MM = 0.67), it reached the genome-wide significance and therefore was included into the subsequent analyses together with the hub genes.
Except for LDL and TC, all other traits had significant gene expression findings at FDR < 0.05 (Supplementary Table S2). We then conducted GO enrichment analysis for gene content of the Yellow module using the WebGestalt website (http://genereg.ornl.gov/webgestalt)24. The Yellow module genes showed six significantly enriched GO terms at Bonferroni-corrected p-value < 0.05 (Supplementary Table S3), which included cytoplasm, protein complex binding, cytoplasmic part, intracellular part, pore complex and alcohol dehydrogenase activity, zinc-dependent. Interestingly, ADH1A and ADH1B, two hub genes in the Yellow module, were enriched for GO term alcohol dehydrogenase activity, zinc-dependent (Fold enrichment = 71.42, Bonferroni-correct p-value = 4.77E-02).
The reported endogenous gene expression of hub genes during mouse development from MGI web tool was shown in Supplementary Table S4. In particular, all hub genes were reported to express in nervous system. Especially, GPD1L was reported to associate with Brugada syndrome44. ADH1A and ADH1B were associated with alcohol dependence45.
Inference of causal network structure and identification of key causal regulators
The causal structure based on the conditional independence test (partial correlation, detailed in Methods) contained 3843 significant causal relationships among 3035 genes in GSE32512. We intersected the causal network with the Yellow module to identify the key drivers which primarily regulated the expression of the module. 7 key drivers were identified including ADH1A, ADH1B, ALDH1L1, GPD1L, CCDC50, CCDC80 and COL16A1, shown in Fig. 3. Not surprisingly, the first five key drivers were hub genes defined above by GS and MM, as KDA relied on precomputed gene-gene interaction networks and topology-based gene ranking. Especially, based on the significant level of the conditional independence test α = 0.01, the causal inference revealed the causal links among all eight hubs and the top GWAS loci EPB41L4B (Fig. 3).
Integrative analysis of miRNA and mRNA expression data
For the 119 genes in the Yellow module, 35 out of 852 miRNA-mRNA interactions were significant (FDR < 0.05). The network of miRNA-mRNA interactions with FDR < 0.05 was shown in Supplementary Fig. S3. For the hub genes, shown in Table 1, four pairs were identified at FDR < 0.05, including ALDH1L1 and hsa-miR-342-5p (r = −0.27, FDR = 1.76E-03), NAALAD2 and hsa-miR-140-3p (r = −0.29, FDR = 7.98E-04), ORMDL3 and hsa-miR-362-3p (r = −0.20, FDR = 3.23E-02), and PCCA and hsa-miR-301b (r = −0.25, FDR = 5.42E-03). The hub gene GPD1L was negatively correlated with hsa-miR-210 (r = −0.17, FDR = 7.91E-02), which was experimentally validated and confirmed that the predicted hsa-miR-210 binding site on GPD1L 3′ UTR is a bona fide target and that has-miR-210 regulated GPD1L protein levels through direct interaction with the mRNA 3′ UTR46. There was no predicted miRNA-mRNA target pair in the MicroCosm database for gene ADH1B.
Preservation of Yellow module in other four datasets
The Yellow module showed strong evidence for preservation in three datasets, including E-MTAB-54 ABD, GLU and GSE25401, with Zsummary = 14, 10 and 14, respectively. However, in GSE64567, the Yellow module showed no evidence for module preservation (Zsummary = 1.6).
Meta-analysis of module genes
In the meta-analysis of combining p-values across individual gene expression profiles using Fisher’s method, 73 genes in the Yellow module showed significant results with obesity (FDR < 0.05) (Supplementary Table S5). The second meta-analysis approach combined effect sizes across individual gene expression profiles, resulting in the identification of genes that were upregulated and downregulated. 84 genes in the Yellow module were identified to differentially express between obese and non-obese subjects with FDR < 0.05 (Supplementary Table S6). Specifically, 66 overlapped genes showed significant results in both meta-analysis methods. The results of meta-analysis for the hub genes were showed in Table 2. All hub genes and gene EPB41L4B had negative SMDs with 95% CIs not including 0, which indicated that they were significantly downregulated in obese subjects.
Expression profiling experiments in human adipose tissue
In experiment 1 (The POUND LOST Trial)36, we investigated the effects of a 6-month weight-loss diet intervention on hub genes expression in 102 obese or overweight subjects. Although all the hub genes were upregulated during the 6-month weight loss (Fold change >1), five hub genes (GPD1L, ADH1B, ADH1A, ORMDL3 and EPB41L4B) reached the statistical significance with FDR < 0.05 (Table 3).
In experiment 2 (GSE35411)37, we examined the differential gene expression of hub genes in adipose tissue from nine obese subjects during diet-induced weight loss and weight maintenance after weight loss. After multiple testing correction, GPD1L, ADH1B, ADH1A and PCCA showed significant differential expression across the time course of baseline, weight-loss and weight maintenance (FDR < 0.05) (Table 3). Experiment 3 (GSE60946) had a study design similar to experiment 2. The sample size of experiment 3 was 13538, 39, much larger than experiment 2, although it used RT-qPCR to quantify mRNA levels of only 221 genes, in which two hub genes (GPD1L and PCCA) in the Yellow module were measured. Interestingly, hub gene GPD1L and PCCA both showed significant differential expression over three time points (baseline, weight-loss and weight maintenance), with FDR = 3.27E-02 and 1.92E-4, respectively (Table 3).
In experiment 4 (GSE28005)40, we examined changes of hub genes in gene expression of adipose tissue during the early adaptive response to weight gain induced by overfeeding. We observed that the expression of all the hub genes were significantly different between lean and overweight subjects over three time points (Day 0, 14 and 56), even after correction for multiple testing (FDR < 0.05) (Table 3).
In experiment 5 (GSE25910)41, we characterized the gene expression patterns during in vitro differentiation of primary human adipocytes. Apart from ADH1A and ADH1B, the other six hub genes and gene EPB41L4B were significantly differentially expressed across a time course of early, middle and late of adipocyte development (FDR < 0.05). Especially, gene ALDH1L1 showed the most significant result even after multiple testing (FDR = 7.90E-17) (Table 3).
The most promising candidate gene for obesity in adipose tissue
We sought to characterize hub genes as a way to prioritize and identify the most promising and potential causal genes for obesity. Gene GPD1L had the highest connectivity in the Yellow module and was identified a key causal regulator in the Yellow module. It was significantly negative association with BMI and obesity. And GPD1L expression was negatively correlated with miR-210, which was experimentally validated and confirmed to regulate GPD1L levels through direct interaction with the mRNA 3′ UTR. In the meta-analysis, GPD1L was identified to be significantly associated with obesity by Fisher’s method, as well as by combining effect size method (Supplementary Fig. S4). With age adjusted, GPD1L was also significantly positively associated with adiponectin, HDL and Matsuda index, negatively associated with FM, CRP, HOMA-IR, total TGs and WHR (Supplementary Table S2). Especially, the highly significant negative correlation of GPD1L with WHR and FM provided evidence for a potential role in central obesity which was highly related to insulin resistance. The negative association of GPD1L with the measure of insulin resistance HOMA-IR (β = −0.83, FDR = 1.99E-14) was shown in Supplementary Fig. S5. Moreover, GPD1L was upregulated in response to weight loss in the POUND LOST trial (experiment 1)36. Interestingly, in both experiments 2 and 3, GPD1L also showed a similarly upregulated expression pattern from baseline to weight maintenance induced by LCD (Fig. 4a and b)37,38,39. In experiment 4 (GSE28005), on the contrary, GPD1L was downregulated in response to weight gain induced by HFD overfeeding in lean and overweight subjects (Fig. 4c)40. In experiment 5 (GSE25910)41, GPD1L had significantly differential expression across a time course of early, middle and late of adipocyte development. Taken together, these findings in various metabolic measures provided strong supporting evidence for an important role for GPD1L in obesity.
Discussion
Our systems genetics approach not only identified a Yellow module most significantly associated with obesity and other metabolic traits, but also identified eight hub genes and five of them were key causal regulators in the module. We identified miRNAs that significantly negatively correlated with the expression of hub genes, providing an understanding of the role of miRNA in regulating hub genes in pathological processes. A comprehensive characterization of module connectivity and hub genes’ association with obesity provided with critical insights into the underlying mechanisms. This novel approach resulted in the discovery of GPD1L as the most important candidate causal gene for obesity that may serve as an effective target for therapeutic intervention.
Gene GPD1L on chromosome 3p22.3 encodes the protein glycerol 3-phosphate dehydrogenase 1-like (GPD1L), which catalyzes the conversion of sn-glycerol 3-phosphate to glycerone phosphate. The encoded protein was found in the cytoplasm, associated with the plasma membrane47. Although the biological function of the gene GPD1L in human adipose tissue and obesity requires further investigation, our findings and other evidence supported that GPD1L in adipose tissue may play a pivotal role in the molecular mechanism of obesity. First, several lines of evidence suggest that adipose tissue is poorly oxygenated in obese human subjects, even in the early course of HFD feeding and mouse models of obesity, indicating that adipose tissue expansion in response to chronic and excessive nutrient consumption increased oxygen consumption and created a state of relative hypoxia due to mitochondrial dysfunction48, 49. Second, in mammals the response to low oxygen conditions was mediated primarily by gene expression changes induced by the hypoxia-inducible factor (HIF) family of transcription factors. In addition, it has been reported that adipocyte-specific HIF-1α overexpression in mice led to more severe obesity with increased white adipose tissue mass50, and insulin resistance with increased adipose tissue inflammation51. Several studies showed that adipocyte-specific HIF-1α knockout mice exhibited reduced fat formation, protection from HFD-induced obesity and insulin resistance compared with similarly fed knockout mice and wild-type controls49, 52, 53. Although the mechanism of HIF-1α effects on adiposity and obesity requires further investigation, compounds that inhibit HIF-1α activity in adipose tissue might have a significant therapeutic potential in reducing obesity and insulin resistance53. Third, interestingly, Kelly et al. identified the enzyme GPD1L as a novel regulator of HIF-1α stability and a direct target of miR-210 in its 3′ UTR46. miR-210 was regarded as “master miRNA” of hypoxic response as it has been found to be upregulated by hypoxia in virtually all cell types tested to date54. Kelly showed a model of a hypoxia-induced feedback loop, which was subsequently found and confirmed by another study55. Overexpression or knockdown of GPD1L levels resulted in a decrease or increase in HIF-1α stability and transcriptional activity, respectively, by regulating proline hydroxylation of HIF-1α at proline 56446. As oxygen levels decreased (hypoxia), HIF-1α protein and transcriptional activity increased, triggering accumulation of miR-210. Increased miR-210 directly repressed GPD1L protein expression through binding to 3′UTR of GPD1L gene and further repressed prolyl hydroxylases (PHDs) activity, subsequently promoting HIF-1α protein expression46. And the overexpression of HIF-1α led to insulin resistance, adipose tissue inflammation and obesity.
Consistent with the above feedback loop activity, the present study showed that GPD1L was negatively correlated with miR-210 and consistently downregulated in obese subjects. This is possibly because in obese adipose tissue the increased stability of HIF-1α was caused by decreased GPD1L due to the accumulation of miR-210. More importantly, in the present study, GPD1L was found to be downregulated in response to weight gain induced by HFD. Based on the hypoxia-induced feedback loop, HFD induced a state of relative hypoxia, resulting in the accumulation of miR-210 and subsequently the decreased GPD1L. On the contrary, GPD1L was found to be upregulated in obese subjects in response to weight loss and maintenance induced by LCD, because during LCD the state of relative hypoxia was attenuated, resulting in the decreased miR-210 and subsequently the increased GPD1L. The increased GPD1L inhibited HIF-1α activity in adipose tissue, leading to reduction in obesity and insulin resistance. Consistently, in the present study GPD1L was found to be significantly negatively correlated with the measure of insulin resistance, HOMA-IR.
Note that the associations of the Yellow module and key driver genes with obesity observed in the discovery stage alone did not support causal relationships, as the METSIM Study is cross-sectional. However, together with the findings from four clinical studies with longitudinal weight-loss and weight-gain interventions, we were able to infer causal links among diet interventions, miR-210, gene GPD1L and weight change in the mechanism of HFD-induced obesity (Fig. 5), which was worthy of further validation by functional experiments. The present study linked the regulation of miR-210 on GPD1L, the biological importance of GPD1L in adipose tissue, obesity and insulin resistance with the hypoxia-induced feedback loop. It indicated that increased GPD1L which can inhibit HIF-1α activity in adipose tissue might have significant therapeutic potential in reducing obesity and insulin resistance.
Two hub genes we identified, ADH1A and ADH1B, were also found to be candidate genes for obesity and insulin resistance by Winnier et al. in a study of human abdominal subcutaneous adipose tissue in 75 Mexican Americans56. Winnier’s study provided strong experimental support for the potential roles for ADH1A and ADH1B in obesity, insulin resistance and T2D. Both genes had almost identical tertiary structures and enzymatic mechanisms, which metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products57. According to a microarray assay of 76 normal human tissues (http://BioGPS.org), ADH1A and ADH1B were most highly expressed in the liver and adipose tissue, respectively. One possible mechanism of ADH1A and ADH1B in adipose tissue and obesity has been proposed to explain their negative association with obesity-related traits and insulin resistance. They are involved in the reversible conversion of alcohol to fatty acids, subsequently producing acetyl-CoA via β-oxidation56. Therefore, elevated levels of ADH1A and ADH1B in human adipocytes could plausibly promote efficient metabolism of energy from alcohol, moderating its storage as fat56. Another hub gene ALDH1L1, which also plays an important role in ethanol metabolism, was found to be significantly downregulated in the obese compared with lean co-twins58. ALDH1L1 appeared to be a chief regulator of cellular metabolism as it was strongly downregulated in certain physiological and pathological conditions, while its upregulation can produce drastic antiproliferative effects59.
Other hub genes, including CCDC50, ORMDL3, PCCA and NAALAD2, have diverse cellular functions and have not been previously implicated as obesity candidate genes. However, given their relationships with obesity and metabolic traits, and their expression patterns in the five experiments of validation stage, their functional characterization and specific roles in adipocyte metabolism in obesity are worthy of further investigation.
Although gnome-wide significant SNPs in GWAS explain only a small proportion of the heritability, known as “missing heritability”60, recent studies have shown that the total variance explained by all common variants has a large proportion of the heritability for complex traits61. Much of the missing heritability that could not pass the stringent genome-wide significance threshold can be found by using more advanced integrative methods. GWAS still may provide useful information about the putative causal relationship between genetic loci and traits, as genetic loci can impose disease risk via the products of the genome, such as gene expression, miRNA and protein level. Functional mechanisms of complex traits can be inferred when the risk loci are placed in a greater context, such as implicated tissues/cell-types and inter-connected networks, as gene functions are commonly dependent on their tissue context and the disordered interplay of tissue–specific processes5. Our system genetics approach demonstrated the power of discovering more prominent driver genes influencing complex traits and diseases by integrating risk genetic loci with gene expression profiling in disease-related tissues. Our results contextualized in human abdominal subcutaneous adipose tissue discrete BMI GWAS genes, which were functionally linked via gene regulatory networks important for obesity.
Gene EPB41L4B was identified as one of the top BMI loci by the GIANT study and we were able to follow up this gene as a Yellow module gene in our systems genetics framework. It was consistently downregulated in obese subjects and had a significantly differential pattern in adipose tissue during in vitro differentiation of primary human adipocytes. Besides, it was significantly downregulated and upregulated in response to weight gain and weight loss, respectively in the validation stage. The role of EPB41L4B gene in BMI regulation deserves further investigation.
Tissue and cell-type identity lie at the core of human physiology and disease etiology. It is very crucial to understand the genetic underpinnings of complex tissues and the role of genetic variants in disease specific tissues for disease etiology, in order to develop improved diagnostics and therapeutics5. Pathologically, obesity is mainly caused by dysfunctional adipose tissue62. Adipose tissue is one of the largest tissues in the human body and essential for normal energy homeostasis. Adipose tissue plays a key role in the control of energy balance by secreting e.g. hormones, cytokines and growth factors. During the time of excessive energy intake, insulin stimulates the uptake of glucose in fat, resulting in excessive accumulation of triacylglycerol in multiple fat depots. Therefore, adipose tissue may enlarge by increasing the size (hypertrophy) and number (hyperplasia) of adipocytes. Enlarged adipocytes lead to dysregulated secretion of adipokines and increased release of free fatty acids14. Dysfunction of normally insulin-sensitive adipose tissue has been repeatedly shown to be involved in important metabolic pathologies, such as insulin resistance, T2D, obesity and CVD63. It is suggested that adipose tissue can serve as a very important in vivo model to better understand complex molecular mechanisms underlying obesity64, 65. Therefore, all gene expression datasets in the present study were from adipose tissue.
In summary, our systems genetics approach revealed important candidate causal genes and mechanisms for obesity. Our approach showed the efficiency of mining and prioritizing GWAS results in publicly available gene expression profiling in disease-related tissues. More importantly, the most promising candidate causal gene GPD1L and its role in the etiology of HFD-induced obesity were highlighted. Our findings indicated that increased GPD1L which can inhibit HIF-1α activity in adipose tissue might have a significant therapeutic potential in reducing obesity and insulin resistance. Our novel and comprehensive systems genetics study provided strongly supportive biological information for further functional validation of the mechanism underlying obesity.
References
Bener, A., Zirie, M. & Al-Rikabi, A. Genetics, obesity, and environmental risk factors associated with type 2 diabetes. Croatian medical journal 46, 302–307 (2005).
Bell, C. G., Walley, A. J. & Froguel, P. The genetics of human obesity. Nature reviews. Genetics 6, 221–234, doi:10.1038/nrg1556 (2005).
Stunkard, A. J., Foch, T. T. & Hrubec, Z. A twin study of human obesity. Jama 256, 51–54, doi:10.1001/jama.1986.03380010055024 (1986).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206, doi:10.1038/nature14177 (2015).
Greene, C. S. et al. Understanding multicellular function and disease with human tissue-specific networks. Nature genetics 47, 569–576, doi:10.1038/ng.3259 (2015).
Zhang, B. & Horvath, S. A general framework for weighted gene co-expression network analysis. Statistical applications in genetics and molecular biology 4, Article 17, doi:10.2202/1544-6115.1128 (2005).
Civelek, M. & Lusis, A. J. Systems genetics approaches to understand complex traits. Nature reviews. Genetics 15, 34–48, doi:10.1038/nrg3575 (2014).
Makinen, V. P. et al. Integrative genomics reveals novel molecular pathways and gene networks for coronary artery disease. PLoS genetics 10, e1004502, doi:10.1371/journal.pgen.1004502 (2014).
Heneghan, H. M., Miller, N. & Kerin, M. J. Role of microRNAs in obesity and the metabolic syndrome. Obesity reviews: an official journal of the International Association for the Study of Obesity 11, 354–361, doi:10.1111/j.1467-789X.2009.00659.x (2010).
Friedman, Y., Balaga, O. & Linial, M. Working together: combinatorial regulation by microRNAs. Advances in experimental medicine and biology 774, 317–337, doi:10.1007/978-94-007-5590-1_16 (2013).
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004, doi:10.1111/j.0006-341X.1999.00997.x (1999).
Hong, M. G., Pawitan, Y., Magnusson, P. K. & Prince, J. A. Strategies and issues in the detection of pathway enrichment in genome-wide association studies. Human genetics 126, 289–301, doi:10.1007/s00439-009-0676-z (2009).
Civelek, M. et al. Genetic regulation of human adipose microRNA expression and its consequences for metabolic traits. Human molecular genetics 22, 3023–3037, doi:10.1093/hmg/ddt159 (2013).
Jung, U. J. & Choi, M. S. Obesity and its metabolic complications: the role of adipokines and the relationship between obesity, inflammation, insulin resistance, dyslipidemia and nonalcoholic fatty liver disease. International journal of molecular sciences 15, 6184–6223, doi:10.3390/ijms15046184 (2014).
Matthews, D. R. et al. Homeostasis model assessment: insulin resistance and beta-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia 28, 412–419, doi:10.1007/BF00280883 (1985).
Ritchie, M. E. et al. A comparison of background correction methods for two-colour microarrays. Bioinformatics 23, 2700–2707, doi:10.1093/bioinformatics/btm412 (2007).
Smyth, G. K. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Statistical applications in genetics and molecular biology 3, Article 3, doi:10.2202/1544-6115.1027 (2004).
Troyanskaya, O. et al. Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525, doi:10.1093/bioinformatics/17.6.520 (2001).
Hagg, S. et al. Gene-based meta-analysis of genome-wide association studies implicates new loci involved in obesity. Human molecular genetics 24, 6849–6860, doi:10.1093/hmg/ddv379 (2015).
Langfelder, P., Zhang, B. & Horvath, S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics 24, 719–720, doi:10.1093/bioinformatics/btm563 (2008).
Ghazalpour, A. et al. Integrating genetic and network analysis to characterize genes related to mouse weight. PLoS genetics 2, e130, doi:10.1371/journal.pgen.0020130 (2006).
Gargalovic, P. S. et al. Identification of inflammatory gene modules based on variations of human endothelial cell responses to oxidized lipids. Proceedings of the National Academy of Sciences of the United States of America 103, 12741–12746, doi:10.1073/pnas.0605457103 (2006).
Blake, J. A., Richardson, J. E., Bult, C. J., Kadin, J. A. & Eppig, J. T. MGD: the Mouse Genome Database. Nucleic acids research 31, 193–195, doi:10.1093/nar/gkg047 (2003).
Zhang, B., Kirov, S. & Snoddy, J. WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic acids research 33, W741–748, doi:10.1093/nar/gki475 (2005).
Kalisch, M., Mächler, M., Colombo, D., Maathuis, M. H. & Bühlmann, P. Causal Inference Using Graphical Models with the R Package pcalg. 2012 47, 26, doi:10.18637/jss.v047.i11 (2012).
Le, T. D. et al. Inferring microRNA-mRNA causal regulatory relationships from expression data. Bioinformatics 29, 765–771, doi:10.1093/bioinformatics/btt048 (2013).
Zhang, B. et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell 153, 707–720, doi:10.1016/j.cell.2013.03.030 (2013).
Huan, T. et al. Integrative network analysis reveals molecular mechanisms of blood pressure regulation. Molecular systems biology 11, 799–799, doi:10.15252/msb.20145399 (2015).
Griffiths-Jones, S., Saini, H. K., van Dongen, S. & Enright, A. J. miRBase: tools for microRNA genomics. Nucleic acids research 36, D154–158, doi:10.1093/nar/gkm952 (2008).
Langfelder, P., Luo, R., Oldham, M. C. & Horvath, S. Is my network module preserved and reproducible? PLoS computational biology 7, e1001057, doi:10.1371/journal.pcbi.1001057 (2011).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC bioinformatics 9, 559, doi:10.1186/1471-2105-9-559 (2008).
Ferreira, J. A. The Benjamini-Hochberg method in the case of discrete test statistics. The international journal of biostatistics 3, Article 11 (2007).
Choi, J. K., Yu, U., Kim, S. & Yoo, O. J. Combining multiple microarray studies and modeling interstudy variation. Bioinformatics 19(Suppl 1), i84–90, doi:10.1093/bioinformatics/btg1010 (2003).
Tseng, G. C., Ghosh, D. & Feingold, E. Comprehensive literature review and statistical considerations for microarray meta-analysis. Nucleic acids research 40, 3785–3799, doi:10.1093/nar/gkr1265 (2012).
Wang, X. et al. An R package suite for microarray meta-analysis in quality control, differentially expressed gene analysis and pathway enrichment detection. Bioinformatics 28, 2534–2536, doi:10.1093/bioinformatics/bts485 (2012).
Sacks, F. M. et al. Comparison of weight-loss diets with different compositions of fat, protein, and carbohydrates. The New England journal of medicine 360, 859–873, doi:10.1056/NEJMoa0804748 (2009).
Johansson, L. E. et al. Differential gene expression in adipose tissue from obese human subjects during weight loss and weight maintenance. Am J Clin Nutr 96, 196–207, doi:10.3945/ajcn.111.020578 (2012).
Larsen, T. M. et al. The Diet, Obesity and Genes (Diogenes) Dietary Study in eight European countries–a comprehensive design for long-term intervention. Obesity reviews: an official journal of the International Association for the Study of Obesity 11, 76–91, doi:10.1111/j.1467-789X.2009.00603.x (2010).
Montastier, E. et al. System model network for adipose tissue signatures related to weight changes in response to calorie restriction and subsequent weight maintenance. PLoS computational biology 11, e1004047, doi:10.1371/journal.pcbi.1004047 (2015).
Alligier, M. et al. Subcutaneous adipose tissue remodeling during the initial phase of weight gain induced by overfeeding in humans. The Journal of clinical endocrinology and metabolism 97, E183–192, doi:10.1210/jc.2011-2314 (2012).
Arner, E. et al. Adipose tissue microRNAs as regulators of CCL2 production in human obesity. Diabetes 61, 1986–1993, doi:10.2337/db11-1508 (2012).
Irizarry, R. A. et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249–264, doi:10.1093/biostatistics/4.2.249 (2003).
Gautier, L., Cope, L., Bolstad, B. M. & Irizarry, R. A. affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315, doi:10.1093/bioinformatics/btg405 (2004).
Van Norstrand, D. W. et al. Molecular and functional characterization of novel glycerol-3-phosphate dehydrogenase 1 like gene (GPD1-L) mutations in sudden infant death syndrome. Circulation 116, 2253–2259, doi:10.1161/CIRCULATIONAHA.107.704627 (2007).
Birley, A. J. et al. ADH single nucleotide polymorphism associations with alcohol metabolism in vivo. Human molecular genetics 18, 1533–1542, doi:10.1093/hmg/ddp060 (2009).
Kelly, T. J., Souza, A. L., Clish, C. B. & Puigserver, P. A hypoxia-induced positive feedback loop promotes hypoxia-inducible factor 1alpha stability through miR-210 suppression of glycerol-3-phosphate dehydrogenase 1-like. Molecular and cellular biology 31, 2696–2706, doi:10.1128/MCB.01242-10 (2011).
Valdivia, C. R., Ueda, K., Ackerman, M. J. & Makielski, J. C. GPD1L links redox state to cardiac excitability by PKC-dependent phosphorylation of the sodium channel SCN5A. American journal of physiology. Heart and circulatory physiology 297, H1446–1452, doi:10.1152/ajpheart.00513.2009 (2009).
Wood, I. S., de Heredia, F. P., Wang, B. & Trayhurn, P. Cellular hypoxia and adipose tissue dysfunction in obesity. The Proceedings of the Nutrition Society 68, 370–377, doi:10.1017/S0029665109990206 (2009).
Lee, Y. S. et al. Increased adipocyte O2 consumption triggers HIF-1alpha, causing inflammation and insulin resistance in obesity. Cell 157, 1339–1352, doi:10.1016/j.cell.2014.05.012 (2014).
Zhang, X. et al. Adipose tissue-specific inhibition of hypoxia-inducible factor 1{alpha} induces obesity and glucose intolerance by impeding energy expenditure in mice. The Journal of biological chemistry 285, 32869–32877, doi:10.1074/jbc.M110.135509 (2010).
Halberg, N. et al. Hypoxia-inducible factor 1alpha induces fibrosis and insulin resistance in white adipose tissue. Molecular and cellular biology 29, 4467–4483, doi:10.1128/MCB.00192-09 (2009).
Krishnan, J. et al. Dietary obesity-associated Hif1alpha activation in adipocytes restricts fatty acid oxidation and energy expenditure via suppression of the Sirt2-NAD+ system. Genes & development 26, 259–270, doi:10.1101/gad.180406.111 (2012).
Jiang, C. et al. Disruption of hypoxia-inducible factor 1 in adipocytes improves insulin sensitivity and decreases adiposity in high-fat diet-fed mice. Diabetes 60, 2484–2495, doi:10.2337/db11-0174 (2011).
Magenta, A., Greco, S., Gaetano, C. & Martelli, F. Oxidative stress and microRNAs in vascular diseases. International journal of molecular sciences 14, 17319–17346, doi:10.3390/ijms140917319 (2013).
Liu, S. C. et al. CTGF increases vascular endothelial growth factor-dependent angiogenesis in human synovial fibroblasts by increasing miR-210 expression. Cell death & disease 5, e1485, doi:10.1038/cddis.2014.453 (2014).
Winnier, D. A. et al. Transcriptomic identification of ADH1B as a novel candidate gene for obesity and insulin resistance in human adipose tissue in Mexican Americans from the Veterans Administration Genetic Epidemiology Study (VAGES). PloS one 10, e0119941, doi:10.1371/journal.pone.0119941 (2015).
Levey, D. J. The evolutionary ecology of ethanol production and alcoholism. Integrative and comparative biology 44, 284–289, doi:10.1093/icb/44.4.284 (2004).
Heinonen, S. et al. Impaired Mitochondrial Biogenesis in Adipose Tissue in Acquired Obesity. Diabetes 64, 3135–3145, doi:10.2337/db14-1937 (2015).
Hwang, P. H., Lian, L. & Zavras, A. I. Alcohol intake and folate antagonism via CYP2E1 and ALDH1: effects on oral carcinogenesis. Medical hypotheses 78, 197–202, doi:10.1016/j.mehy.2011.10.023 (2012).
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753, doi:10.1038/nature08494 (2009).
Lee, S. H., Wray, N. R., Goddard, M. E. & Visscher, P. M. Estimating missing heritability for disease from genome-wide association studies. American journal of human genetics 88, 294–305, doi:10.1016/j.ajhg.2011.02.002 (2011).
Despres, J. P. & Lemieux, I. Abdominal obesity and metabolic syndrome. Nature 444, 881–887, doi:10.1038/nature05488 (2006).
Guilherme, A., Virbasius, J. V., Puri, V. & Czech, M. P. Adipocyte dysfunctions linking obesity to insulin resistance and type 2 diabetes. Nature reviews. Molecular cell biology 9, 367–377, doi:10.1038/nrm2391 (2008).
Greenberg, A. S. & Obin, M. S. Obesity and the role of adipose tissue in inflammation and metabolism. The American journal of clinical nutrition 83, 461S–465S (2006).
Wellen, K. E. & Hotamisligil, G. S. Obesity-induced inflammatory changes in adipose tissue. The Journal of clinical investigation 112, 1785–1788, doi:10.1172/JCI20514 (2003).
Acknowledgements
This work was partially supported by grants from the National Institutes of Health [P50AR055081, R01AG026564, R01AR050496, R01AR057049, and R01AR059781], and startup funds from Tulane University and Edward G. Schlieder Endowed. The POUNDS LOST Trial was supported by grants from the National Heart, Lung, and Blood Institute (HL-071981), the Boston Obesity Nutrition Research Center (DK-46200), the National Institute of Diabetes and Digestive and Kidney Diseases (DK-091718), the National Natural Science Foundation of China (NNSFC30972453), and the Program for New Century Excellent Talents in University (NCET- 10-0420).
Author information
Authors and Affiliations
Contributions
H.H. contributed to the study concept and design; the acquisition, analysis, and interpretation of the data; the statistical analysis; and the drafting and critical revision of the manuscript. D.S., Y.Z., R.F.W., W.Z. and S.L.C. contributed to the data analysis. G.A.B., F.M.S. and L.Q. contributed to the study concept and data acquisition. H.S., W.C. and L.Q. provided comments and suggestions for the manuscript. H.H. and H.W.D. contributed to study concept and design, the critical revision of the manuscript, and administration, material support, and study supervision. All authors contributed to interpretations of data and to revisions of the manuscript. H.H. and H.W.D. are the guarantors of this work and had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Accession Codes: The Gene Expression Omnibus (GEO) accession numbers in this paper are GSE32512, GSE25401, GSE64567, GSE25910, GSE28005, GSE35411, and GSE60946. The ArrayExpress accession numbers in this paper is E-MTAB-54. The POUNDS LOST Trial (clinical trial reg. no. NCT00072995).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
He, H., Sun, D., Zeng, Y. et al. A Systems Genetics Approach Identified GPD1L and its Molecular Mechanism for Obesity in Human Adipose Tissue. Sci Rep 7, 1799 (2017). https://doi.org/10.1038/s41598-017-01517-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-017-01517-6
- Springer Nature Limited
This article is cited by
-
Unveiling genetic variants for age-related sarcopenia by conducting a genome-wide association study on Korean cohorts
Scientific Reports (2022)
-
Tissue and circulating microRNAs as biomarkers of response to obesity treatment strategies
Journal of Endocrinological Investigation (2021)
-
Harnessing Qatar Biobank to understand type 2 diabetes and obesity in adult Qataris from the First Qatar Biobank Project
Journal of Translational Medicine (2018)