Abstract
Endometriosis is a complex condition that is caused by both multiple genetic and environmental factors. The heritable component for endometriosis is estimated at ~50%. Here we provide a summary of genome-wide associations studies (GWAS) performed to date to identify common genetic variants for endometriosis. Largest GWAS meta-analysis conducted by the International Endogene Genomics Consortium (IEGC) included a total of 58,115 endometriosis cases and 733,480 controls and has identified 27 loci genome-wide significantly associated with endometriosis. Positionally, the lead SNPs for the identified genetic loci reside near genes that are involved in sex-steroid hormone, WNT signaling, cell adhesion/migration, cell growth/carcinogenesis, and inflammation-related pathways. Furthermore, genetic variants associated with subtypes of endometriosis are described. In particular, eight genome-wide significant signals were associated with stage III/IV disease and 21 of 27 loci had larger effect sizes in stage III/IV disease compared to stage I/II disease suggesting that particular variants may confer risk for different subtypes of endometriosis through different pathways. Together, the 27 loci explained 2.15% of variance for overall endometriosis and 3.83% for stage III/IV disease which highlight that there remain many more genetic loci to be revealed for endometriosis in larger, deeply phenotyped datasets.
Furthermore, future research needs to focus on fine-mapping of the identified loci to reveal the causal variants for each of the 27 loci and functional follow-up examining their effects on multiomics data in tissues and cells relevant to endometriosis, i.e., endometrium and its cellular components.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
1 Introduction
Endometriosis is a common complex condition that is caused by the interplay of multiple genetic and environmental factors. The genetic risk variants for the condition only present part of the disease risk, and environmental factors also a play an important role in disease pathogenesis either independently or through interaction with genetic factors [1]. The heritability that is the proportion of disease risk due to genetic factors for endometriosis has been estimated in two large twin studies [2, 3] that arrived at very similar estimates (49–51%). A separate study estimated 26% to be due to common genetic variation (DNA variants with a frequency >1% in the population) [4]. As the underlying pathology of endometriosis is not well understood, one way to explore underlying mechanisms is to investigate the genetic factors and their functions that are causal for the disease. For complex diseases such as endometriosis, the most powerful and appropriate study design to detect genetic risk factors is that of a genetic association study, in which the frequencies of variants are compared between cases and controls, similar to an epidemiological case control study in which the frequency of risk-factor exposures is compared. For situations in which a disease shows a very strong pattern of familial inheritance (e.g., “monogenic” familial breast or ovarian cancer), family-based approaches are more appropriate, which we do not cover here.
2 Discovery of Endometriosis Genetic Susceptibility Variants
In population-based study designs, genetic variants can be investigated using hypothesis-driven or hypothesis-free association methods. The hypothesis-driven approach, candidate gene association studies, relies on prior biological understanding of the condition and testing for association in these regions that are prioritized based on previous knowledge. Similar to other complex diseases, candidate gene association studies have not generally been successful in identifying robust results for endometriosis [5]. For the results to be robust, identified associations need to be replicated in an independent study in individuals of similar ancestral background. The reason for general failure of candidate gene association studies is manyfold: (1) The prior biological knowledge on the tested regions for association may not be relevant to the disease in question; (2) the coverage of common genetic variation in candidate gene regions is often limited and does not allow the testing of all potential common genetic risk variants in these regions (either directly, or indirectly through linkage disequilibrium with other variants); (3) the number of genes included in the study are often limited to a few that make up only a small part of a potentially causal underlying pathway; (4) and the sample sizes of candidate gene studies have often been insufficient to detect common genetic variants for common complex conditions. The standard approach now to identify common genetic variants for common complex conditions is a hypothesis-free method, namely the genome-wide association study (GWAS).
3 Genome-Wide Association Studies
GWAS have been very successful in the identification of common genetic variants underlying complex conditions. In a GWAS, typically at least 2000 cases and 2000 controls are genotyped at a genome-wide level using an “off the shelf” microarray containing probes that capture 100,000s of single nucleotide polymorphisms (SNPs) – single base-pair DNA variants. After extensive quality control, the genotypes of SNPs nearby that are not directly genotyped can be imputed, using a reference panel that includes a comprehensive catalogue of common genetic variants in the relevant ancestry population. Subsequently, the frequency of common SNPs is tested for differences between the case and control groups. Owing to the millions of statistical tests conducted across the genome, a stringent significance threshold needs to be adopted to reduce the number of false positive findings. The standard threshold used for genome-wide significance is p < 5 × 10−8. A detailed overview of GWAS design is given in Zondervan and Cardon [6]. All common genome-wide significant variants identified for common complex diseases and traits through GWAS are documented in the National Human Genome Research Institute (NHGRI) GWA Catalogue (www.genome.gov/GWAStudies). This catalogue demonstrates how successful the GWAS approach has been in identifying common variants underlying complex diseases and traits: To date, the catalogue includes data on 255,015 SNP-disease associations (25 April 2021).
To date, 10 GWAS in women of European and East Asian ancestry have been published for endometriosis, varying from 171 to 58,115 included cases (Table 6.1). The largest is a meta-analysis led by the International Endogene Genomics Consortium (IEGC), for which interim results were released in 2018, comprising of 15 GWAS and a replication analysis including a total of 58,115 cases and 733,480 controls [16]. An early GWAS had analyzed the effect of all SNPs combined by rASRM stage, showing a significantly higher genetic contribution to rASRM stage III/IV versus stage I/II disease (Proportion of endometriosis variation explained by common SNPs = 0.34, SD: 0.04 vs. 0.15, SD = 0.15) [9]. Therefore, subsequent GWAS meta-analyses were conducted separately for stage III/IV disease; the largest IEGC-led GWAS meta-analysis (2018) investigated association with rASRM stage III/IV disease, rASRM stage I/II disease (for the first time), and infertility-associated endometriosis subphenotypes, in addition to overall endometriosis. This study revealed 27 loci genome-wide significantly associated with endometriosis, 13 of which were novel (Table 6.2). Positionally, the lead SNPs for the identified genetic loci reside near genes that are involved in sex-steroid hormone, WNT signaling, cell adhesion/migration, cell growth/carcinogenesis, and inflammation-related pathways.
In subphenotype genome-wide association analyses, eight genome-wide significant signals were associated with stage III/IV disease and one genome-wide significant signal with infertility-associated endometriosis. Moreover, 21 of the 27 loci had larger effect sizes for stage III/IV compared to stage I/II disease (Table 6.2) suggesting that specific variants may confer risk for different subtypes of endometriosis through distinct pathways. Further studies with more detailed phenotypic data on endometriosis are needed to decipher the genetic variants that may be associated with different subtypes of the disease, and the identity of these subtypes beyond ASRM staging.
4 Conclusions and Future Work
The variance explained by the 27 loci together is 2.15% for overall endometriosis and 3.83% for rASRM stage III/IV disease [16], which shows that there are many more genetic susceptibility loci to be uncovered for endometriosis in larger, deeply phenotyped datasets. The most up-to-date findings show that genetic mechanisms underlying endometriosis implicate metabolic, reproductive, inflammatory, and pain-related pathways, although these are based on “nearest gene” assumptions (the notion that the gene nearest the risk variant is affected by the risk variant in terms of expression). Furthermore, the stronger associations observed with infertile endometriosis or stage III/IV endometriosis strengthen the fact that specific variants may confer risk for different subtypes of endometriosis through distinct pathways. Fine-mapping analyses are needed to identify the causal variants for each of the 27 loci. In particular, functional follow-up of identified variants is vitally important, examining their effects on transcriptomic, proteomic, metabolomic, and epigenomic data in tissues and cells relevant to endometriosis, i.e., endometrium and its cellular components.
As an example, WNT4/1p36.12 is a well-established locus associated with endometriosis, and the gene that sits nearest to the identified genome-wide significant variant is the WNT4 gene. However, this positional evidence is not enough to determine whether this is the gene that involved functionally in endometriosis pathology. Powell et al. investigated the gene expression profile around this 1p36.12 cytoband and identified that the endometriosis associated variant is a significant eQTL in whole blood decreasing expression of LINC00339 and increasing expression of CDC42. The eQTL for LINC00339 was also observed in endometrium tissue with same direction of effect. However, no evidence for eQTL effects of WNT4 was identified highlighting the importance and need for these functional studies to understand the disease-relevant mechanisms of the identified genetic risk variants [17].
Tissue-based molecular phenotyping data (transcriptomics, proteomics, and metabolomics) are not available for endometrium or its relevant cellular components in sufficiently large sample sizes from publicly available databases (e.g., the Genotype-Tissue Expression (GTEx) project [18, 19]). Two recent studies investigated the whole-transcriptome profiles utilizing RNA-sequencing (N = 206) and microarray-based gene expression (N = 123) in endometrium tissue and generated expression-quantitative trait loci (eQTL) maps to determine the genetic variants that regulate gene expression in endometrium tissue [20, 21]. The microarray-based and RNAseq-based eQTL maps identified variants that regulate expression of 198 and 327 unique genes, respectively. Such studies are very important to better understand the effect genetic risk variants have on gene expression in endometrium; however, similar profiling studies need to be conducted using other “omics” data (epigenomics, proteomics, and metabolomics). There is also need for collection of these tissue and cell types utilizing standardized protocols that will allow for collaboration between study centers to reach samples size needed for these functional investigations. The Endometriosis Phenome and Biobanking Harmonisation Project of the World Endometriosis Research Foundation has provided globally standardized protocols for data and sample collection in studies of endometriosis [22,23,24,25]. At the time of writing, 47 centers are using the standards for data and/or sample collection, with many 10,000s of samples already stored for research purposes in local study repositories. More large-scale integrated omics studies in deeply phenotyped patients are needed to understand the underlying causal mechanisms for endometriosis and dissect subtypes of this complex condition, leading to the discovery of novel, better targeted treatments.
References
Craig J. Complex diseases: research and applications. Nat Educ. 2008;1(1):184.
Saha R, Pettersson HJ, Svedberg P, Olovsson M, Bergqvist A, Marions L, Tornvall P, Kuja-Halkola R. Heritability of endometriosis. Fertil Steril. 2015;104:947–52.
Treloar SA, O’Connor DT, O’Connor VM, Martin NG. Genetic influences on endometriosis in an Australian twin sample. sueT@qimr.edu.au. Fertil Steril. 1999;71:701–10.
Lee SH, Harold D, Nyholt DR, Goddard ME, Zondervan KT, Williams J; Consortium, A.N., International Endogene, C., Genetic, Environmental Risk for Alzheimer’s disease, C., et al. Estimation and partitioning of polygenic variation captured by common SNPs for Alzheimer’s disease, multiple sclerosis and endometriosis. Hum Mol Genet. 2013;22:832–41.
Rahmioglu N, Missmer SA, Montgomery GW, Zondervan KT. Insights into assessing the genetics of endometriosis. Curr Obstet Gynecol Rep. 2012;1:124–37.
Zondervan KT, Cardon LR. Designing candidate gene and genome-wide case-control association studies. Nat Protoc. 2007;2:2492–501.
Adachi S, Tajima A, Quan J, Haino K, Yoshihara K, Masuzaki H, Katabuchi H, Ikuma K, Suginami H, Nishida N, et al. Meta-analysis of genome-wide association scans for genetic susceptibility to endometriosis in Japanese population. J Hum Genet. 2010;55:816–21.
Uno S, Zembutsu H, Hirasawa A, Takahashi A, Kubo M, Akahane T, Aoki D, Kamatani N, Hirata K, Nakamura Y. A genome-wide association study identifies genetic variants in the CDKN2BAS locus associated with endometriosis in Japanese. Nat Genet. 2010;42:707–10.
Painter JN, Anderson CA, Nyholt DR, Macgregor S, Lin J, Lee SH, Lambert A, Zhao ZZ, Roseman F, Guo Q, et al. Genome-wide association study identifies a locus at 7p15.2 associated with endometriosis. Nat Genet. 2011;43:51–4.
Albertsen HM, Chettier R, Farrington P, Ward K. Genome-wide association study link novel loci to endometriosis. PLoS One. 2013;8:e58257.
Nyholt DR, Low SK, Anderson CA, Painter JN, Uno S, Morris AP, MacGregor S, Gordon SD, Henders AK, Martin NG, et al. Genome-wide association meta-analysis identifies new endometriosis risk loci. Nat Genet. 2012;44:1355–9.
Steinthorsdottir V, Thorleifsson G, Aradottir K, Feenstra B, Sigurdsson A, Stefansdottir L, Kristinsdottir AM, Zink F, Halldorsson GH, Munk Nielsen N, et al. Common variants upstream of KDR encoding VEGFR2 and in TTC39B associate with endometriosis. Nat Commun. 2016;7:12350.
Sapkota Y, Steinthorsdottir V, Morris AP, Fassbender A, Rahmioglu N, De Vivo I, Buring JE, Zhang F, Edwards TL, Jones S, et al. Meta-analysis identifies five novel loci associated with endometriosis highlighting key genes involved in hormone metabolism. Nat Commun. 2017;8:15539.
Sobalska-Kwapis M, Smolarz B, Slomka M, Szaflik T, Kepka E, Kulig B, Siewierska-Gorska A, Polak G, Romanowicz H, Strapagiel D, et al. New variants near RHOJ and C2, HLA-DRA region and susceptibility to endometriosis in the Polish population-The genome-wide association study. Eur J Obstet Gynecol Reprod Biol. 2017;217:106–12.
Galarneau G, Fontanillas P, Clementi C, Hu-Seliger T, Parfitt DE, Tung JY, Yurttas-Beim P; the Celmatix Research Team; the 23andMe Research Team. Genome-wide association studies on endometriosis and endometriosis related infertility. BioRxiv. 2018.
Rahmioglu N, Banasik K, Christofidou P, Danning R, Galarneau G, Giri A, MacGregor S, Mortlock S, Sapkota Y, Schork JA, et al. Large-scale genome-wide association meta-analysis of endometriosis reveals 13 novel loci and genetically-associated comorbidity with other pain conditions. BioRxiv. 2018.
Powell JE, Fung JN, Shakhbazov K, Sapkota Y, Cloonan N, Hemani G, Hillman KM, Kaufmann S, Luong HT, Bowdler L, et al. Endometriosis risk alleles at 1p36.12 act through inverse regulation of CDC42 and LINC00339. Hum Mol Genet. 2016;25:5046–58.
Consortium, G.T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science. 2015;348:648–60.
Consortium, G.T., Laboratory, D.A., Coordinating Center-Analysis Working, G., Statistical Methods Groups-Analysis Working, G., Enhancing, G.g., Fund, N.I.H.C., NIH/NCI, NIH/NHGRI, NIH/NIMH, NIH/NIDA, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–13.
Fung JN, Mortlock S, Girling JE, Holdsworth-Carson SJ, Teh WT, Zhu Z, Lukowski SW, McKinnon BD, McRae A, Yang J, et al. Genetic regulation of disease risk and endometrial gene expression highlights potential target genes for endometriosis and polycystic ovarian syndrome. Sci Rep. 2018;8:11424.
Mortlock S, Kendarsari RI, Fung JN, Gibson G, Yang F, Restuadi R, Girling JE, Holdsworth-Carson SJ, Teh WT, Lukowski SW, et al. Tissue specific regulation of transcription in endometrium and association with disease. Hum Reprod. 2020;35:377–93.
Becker CM, Laufer MR, Stratton P, Hummelshoj L, Missmer SA, Zondervan KT, Adamson GD, Group, W.E.W. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonisation Project: I. Surgical phenotype data collection in endometriosis research. Fertil Steril. 2014;102:1213–22.
Fassbender A, Rahmioglu N, Vitonis AF, Vigano P, Giudice LC, D’Hooghe TM, Hummelshoj L, Adamson GD, Becker CM, Missmer SA, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonisation Project: IV. Tissue collection, processing, and storage in endometriosis research. Fertil Steril. 2014;102:1244–53.
Rahmioglu N, Fassbender A, Vitonis AF, Tworoger SS, Hummelshoj L, D’Hooghe TM, Adamson GD, Giudice LC, Becker CM, Zondervan KT, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonization Project: III. Fluid biospecimen collection, processing, and storage in endometriosis research. Fertil Steril. 2014;102:1233–43.
Vitonis AF, Vincent K, Rahmioglu N, Fassbender A, Buck Louis GM, Hummelshoj L, Giudice LC, Stratton P, Adamson GD, Becker CM, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonization Project: II. Clinical and covariate phenotype data collection in endometriosis research. Fertil Steril. 2014;102:1223–32.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Rahmioglu, N., Zondervan, K.T. (2022). Pathogenesis of Endometriosis: Genetics. In: Oral, E. (eds) Endometriosis and Adenomyosis. Springer, Cham. https://doi.org/10.1007/978-3-030-97236-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-97236-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97235-6
Online ISBN: 978-3-030-97236-3
eBook Packages: MedicineMedicine (R0)