GWAS for systemic sclerosis identifies multiple risk loci and highlights fibrotic and vasculopathy pathways

López-Isac, Elena; Acosta-Herrera, Marialbert; Kerick, Martin; Assassi, Shervin; Satpathy, Ansuman T.; Granja, Jeffrey; Mumbach, Maxwell R.; Beretta, Lorenzo; Simeón, Carmen P.; Carreira, Patricia; Ortego-Centeno, Norberto; Castellvi, Ivan; Bossini-Castillo, Lara; Carmona, F. David; Orozco, Gisela; Hunzelmann, Nicolas; Distler, Jörg H. W.; Franke, Andre; Lunardi, Claudio; Moroncini, Gianluca; Gabrielli, Armando; de Vries-Bouwstra, Jeska; Wijmenga, Cisca; Koeleman, Bobby P. C.; Nordin, Annika; Padyukov, Leonid; Hoffmann-Vold, Anna-Maria; Lie, Benedicte; Proudman, Susanna; Stevens, Wendy; Nikpour, Mandana; Vyse, Timothy; Herrick, Ariane L.; Worthington, Jane; Denton, Christopher P.; Allanore, Yannick; Brown, Matthew A.; Radstake, Timothy R. D. J.; Fonseca, Carmen; Chang, Howard Y.; Mayes, Maureen D.; Martin, Javier

doi:10.1038/s41467-019-12760-y

GWAS for systemic sclerosis identifies multiple risk loci and highlights fibrotic and vasculopathy pathways

Article
Open access
Published: 31 October 2019

Volume 10, article number 4955, (2019)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

GWAS for systemic sclerosis identifies multiple risk loci and highlights fibrotic and vasculopathy pathways

Download PDF

Elena López-Isac¹,
Marialbert Acosta-Herrera¹^na1,
Martin Kerick¹^na1,
Shervin Assassi²,
Ansuman T. Satpathy ORCID: orcid.org/0000-0002-5167-537X^3,4,
Jeffrey Granja^3,4,
Maxwell R. Mumbach^3,4,
Lorenzo Beretta⁵,
Carmen P. Simeón⁶,
Patricia Carreira⁷,
Norberto Ortego-Centeno⁸,
Ivan Castellvi⁹,
Lara Bossini-Castillo¹⁰,
F. David Carmona ORCID: orcid.org/0000-0002-1427-7639¹¹,
Gisela Orozco ORCID: orcid.org/0000-0002-3479-0448¹²,
Nicolas Hunzelmann¹³,
Jörg H. W. Distler¹⁴,
Andre Franke ORCID: orcid.org/0000-0003-1530-5811¹⁵,
Claudio Lunardi¹⁶,
Gianluca Moroncini¹⁷,
Armando Gabrielli¹⁷,
Jeska de Vries-Bouwstra¹⁸,
Cisca Wijmenga¹⁹,
Bobby P. C. Koeleman²⁰,
Annika Nordin²¹,
Leonid Padyukov ORCID: orcid.org/0000-0003-2950-5670²¹,
Anna-Maria Hoffmann-Vold²²,
Benedicte Lie²³,
European Scleroderma Group†,
Susanna Proudman²⁴,
Wendy Stevens²⁵,
Mandana Nikpour²⁶,
Australian Scleroderma Interest Group (ASIG),
Timothy Vyse ORCID: orcid.org/0000-0003-1123-1464²⁷,
Ariane L. Herrick^28,29,
Jane Worthington¹²,
Christopher P. Denton³⁰,
Yannick Allanore ORCID: orcid.org/0000-0002-6149-0002³¹,
Matthew A. Brown ORCID: orcid.org/0000-0003-0538-8211³²,
Timothy R. D. J. Radstake³³,
Carmen Fonseca³⁰,
Howard Y. Chang ORCID: orcid.org/0000-0002-9459-4393^3,4,
Maureen D. Mayes ORCID: orcid.org/0000-0001-5070-2535² &
…
Javier Martin¹

17k Accesses
104 Citations
35 Altmetric
1 Mention
Explore all metrics

Abstract

Systemic sclerosis (SSc) is an autoimmune disease that shows one of the highest mortality rates among rheumatic diseases. We perform a large genome-wide association study (GWAS), and meta-analysis with previous GWASs, in 26,679 individuals and identify 27 independent genome-wide associated signals, including 13 new risk loci. The novel associations nearly double the number of genome-wide hits reported for SSc thus far. We define 95% credible sets of less than 5 likely causal variants in 12 loci. Additionally, we identify specific SSc subtype-associated signals. Functional analysis of high-priority variants shows the potential function of SSc signals, with the identification of 43 robust target genes through HiChIP. Our results point towards molecular pathways potentially involved in vasculopathy and fibrosis, two main hallmarks in SSc, and highlight the spectrum of critical cell types for the disease. This work supports a better understanding of the genetic basis of SSc and provides directions for future functional experiments.

Genome-wide association study meta-analysis identifies five new loci for systemic lupus erythematosus

Article Open access 30 May 2018

Updates on genetics in systemic sclerosis

Article Open access 15 June 2021

GWAS for systemic sclerosis identifies six novel susceptibility loci including one in the Fcγ receptor region

Article Open access 31 January 2024

Introduction

Rheumatic diseases are one of the main causes of physical disability of non-mental origin in the Western world according to the World Health Organization. Rheumatic diseases have a marked impact on the quality of life of patients. Among them, systemic sclerosis (SSc) has one of the highest mortality rates¹. SSc is a chronic autoimmune disease (AD) that affects the connective tissue, with very heterogeneous clinical manifestations. The pathogenesis of the disease involves extensive fibrosis of the skin and internal organs, vascular damage, and immune imbalance, including autoantibody production^2,3. Lung involvement—both pulmonary hypertension and/or pulmonary fibrosis—is the leading cause of death⁴.

As most ADs, SSc has a complex genetic component and its etiology is poorly understood. Genome-wide association studies (GWASs) have been successful in the identification of thousands of genetic variants associated with the susceptibility of complex traits. Moreover, GWASs provide invaluable information on disease aetiopathogenesis and contribute to drug discovery and repurposing^5,6. Several GWASs of SSc have been published, which greatly contributed to the understanding of SSc pathogenesis, and pointed out to relevant pathways for the disease, such as the interferon pathway, the interleukin 12 pathway, and apoptosis^{7,8,9,10,11,12}. Nonetheless, the rate of discovery of previous studies was limited owing to the relatively small sample sizes of the study cohorts.

To continue unraveling the partially known genetic background of SSc, we perform a powerful meta-GWAS in European population that includes ~ 10,000 patients. We also hypothesize that an integrative approach combining all SSc association signals, fine-mapping, and the identification of target genes based on chromatin contacts would provide further insights into the biology of the disease.

Results

Twenty-seven signals independently associated with SSc

We performed genome-wide association analyses in 14 independent European cohorts comprising a total of 26,679 individuals (9,095 SSc patients and 17,584 healthy controls). Nine out of the 14 SSc GWAS cohorts were so far unreported, whereas 5 had been previously published^7,8 (Supplementary Data 1). After correcting for sex and the first five principal components (PCs) (Methods), we did not observe genomic inflation in any of the independent GWAS cohorts, with the exception of the Italian cohort, which remained with residual inflation (Supplementary Data 1 and Supplementary Fig. 1). Overall, the meta-analysis showed a genomic inflation factor (λ) of 1.10, with a rescaled λ₁₀₀₀ of 1.008 for an equivalent study of 1000 cases/1000 controls (Supplementary Data 1).

We undertook an inverse variance-weighted meta-analysis with a high-density genotyped and imputed SNP panel (4.72 million SNPs) to combine all independent GWASs. We considered all the SNPs that were shared by at least two data sets to avoid SNP data loss. This approach yielded 431 significantly associated SNPs (association test p value ≤ 5 × 10⁻⁸) excluding the well-known HLA region. Significant signals involved 23 genomic regions, of which 13 were new genome-wide significant loci for SSc and 10 corresponded to previously reported GWAS signals (Table 1, Fig. 1).

Table 1 Twenty-seven signals independently associated with systemic sclerosis in the meta-GWAS

Full size table

The presence of independent signals in the genomic regions that showed significant associations was investigated by stepwise conditional analysis using summary statistics from the meta-analysis (Methods). Four genomic regions—TNFSF4 (1q25.1), STAT4 (2q32.2-q32.3), DNASE1L3 (3p14.3), and IRF5-TNPO3 (7q32.1)—showed additional significant signals after conditioning on the lead SNP of each locus (conditional association test p value (Pcond) < 5 × 10⁻⁶) (Table 1, Fig. 1, Supplementary Fig. 2). Hence, a total of 27 independent signals associated with SSc were identified.

The two independent signals identified in the DNASE1L3 genomic region (3p14.3) (rs4076852, rs7355798) were intronic variants at PXK and FLNB, respectively. PXK-rs4076852 is in high linkage disequilibrium (LD) with PXK-rs2176082 (r² = 0.92), which was reported to be associated with SSc in Martin et al.¹³ However, two Immunochip studies conducted by Mayes et al.⁹ and Zochling et al.¹² showed that the primary association in this genomic region was with the nonsynonymous SNP DNASE1L3-rs35677470 (R206C), not present in our SNP panel. Mayes et al.⁹ showed that the PXK-rs2176082 association was dependent on the rs35677470 (R206C). Therefore, although we could not analyze the dependence in our GWAS data, we presumed that PXK-rs4076852 signal was also dependent on DNASE1L3-rs35677470 on the basis of previous evidence. Regarding the intronic signal in FLNB (rs7355798), we could not estimate whether it was dependent on DNASE1L3-rs35677470 or not. However, given its role in vascular injury repair¹⁴, FLNB may be an interesting SSc locus and should be the object of future research. In the case of the STAT4 genomic region (2q32.2-q32.3), we observed three independent signals, of which the third was an intronic variant at NAB1 (rs16832798). This finding—added to further functional evidence provided below—revealed NAB1 as a new SSc risk locus. We also observed that the genome-wide signal in GSDMB (17q21.1; rs883770) was independent (Pcond = 1.27 × 10⁻⁷) from the recently reported signal at GSDMA (rs3894194), which is located in the same genomic region¹⁰.

Fine-mapping of SSc-associated loci in a Bayesian framework

The identification of the causal SNPs driving the association signals remains an open question after completion of a GWAS. To address this question, Bayesian fine-mapping was performed to define 95% credible sets (the smallest set of variants that summed together at least a 95% probability of including the likely causal variant) in each of the independently associated loci (the two independent signals in IRF5-TNPO3 were excluded as fine-mapping was not feasible). To improve SNP prioritization accuracy, the probabilistic method integrated association strength with functional annotation data (Methods). Eighteen (72%) and 12 (48%) out of the 25 loci were fine-mapped to ≤10 and to <5 plausible causal variants, respectively (Table 2, Supplementary Data 2). In six loci, the 95% credible set comprised a single variant (ARHGAP31, BLK, CD247, TNIP1, CSK, STAT4-a), and for four others the credible set contained two SNPs (DGKQ, NUP85-GRB2, STAT4-b, IL12RB1). Moreover, in 64% of the credible sets, the index SNP showed the maximum posterior probability (PP_max) of being causal. The SNPs with PP_max were intergenic, intronic, or noncoding RNA intronic (ncRNA intronic) variants, although the remaining credible set SNPs involved additional SNP categories, namely: UTR3′, downstream, exonic synonymous, and exonic nonsynonymous (Table 2).

Table 2 Posterior probabilities of systemic sclerosis fine-mapped loci

Full size table

Functional annotation of SNPs from credible sets

Since most of the likely causal variants were linked to regulatory functions rather than affecting the function of proteins encoded by surrounding genes, we further explored their regulatory effects. For this purpose, we performed functional annotation of SNPs from credible sets through eQTL analysis (Methods). In addition, we explored overlap with histone marks of active promoters and active enhancers (H3K9ac, H3K4me1, and H3K27ac) of cell types relevant to the disease using data from the Roadmap Epigenomics Project¹⁵ (Supplementary Table 1) (Methods).

Supplementary Figure 3 summarizes the results of the functional characterization of credible set SNPs. When the 95% credible set was not well resolved (credible sets that contained > 15 likely causal variants), we selected the SNP with PP_max and the index SNP. In the case of IRF5-TNPO3, where the credible set was not feasible, we selected the two independent signals identified at this locus. We obtained a final reduced list of credible set SNPs containing a total of 81 variants. As it can be observed in Supplementary Fig. 3, the vast majority of the likely causal variants overlapped with promoter and enhancer histone marks in the cell types interrogated. These observations suggest that most of the genetic variations involved in the susceptibility to SSc modulate transcriptional regulatory mechanisms. In this regard, we found that 61 out of the 81 interrogated variants (75.31% of the 81 listed credible set SNPs) represent eQTLs, thus altering gene expression in different tissues and cell types (Supplementary Data 3). In fact, the credible set SNPs were significantly enriched for eQTLs in blood and non-blood tissues (odds ratio (OR) = 3.05, Fisher’s exact test P = 5.65 × 10⁻⁶; OR = 1.61, Fisher’s exact test P = 4.48 × 10⁻², respectively) (Supplementary Table 2). Many SNPs were shown to impact the expression of the closest gene (a priori candidate gene) (Supplementary Data 3). In addition, we also found genetic variants affecting the expression of a priori candidate genes and other genes. However, some SNPs only showed eQTL signals for genes other than the closest one. As an example, the SNP rs9884090—which is an intronic variant at ARHGAP31—was found to alter the expression of POGLUT1 and TIMMDC1 in several tissues (Supplementary Data 3). These results highlight that assigning association signals to the nearby gene is not always the most appropriate strategy and the functional role of certain SSc signals may expand to different target genes.

Five 95% credible sets comprised exonic nonsynonymous variants or contained SNPs in high-to-moderate LD with exonic nonsynonymous variants (r² ≥ 0.8, r² ≥ 0.6, respectively) (ARHGAP31, IRF7, GSDMB, NUP85-GRB2, and IL12RB1) (Supplementary Fig. 3, Supplementary Data 4). However, based on SIFT and PolyPhen, none of these exonic variants showed a clear consensus to be deleterious^16,17 (Supplementary Data 4).

Finally, we assessed pleiotropic effect of our signals by determining whether the likely causal SNPs were also risk factors for other diseases. The results showed extensive overlap especially with other two ADs: systemic lupus erythematosus and primary biliary cholangitis (Supplementary Data 5). These findings were consistent with previous reports that identified shared risk loci for SSc and other immune-mediated diseases^11,13.

H3K27ac HiChIP in T cells expands and refines target genes

As stated above, assigning disease-associated variants to the closest gene is not always an appropriate strategy to determine the potential mechanistic effect of association signals. With the aim of identifying the putative drivers of SSc association hits on the basis of functional evidence, we performed an analysis of experimentally derived high-resolution maps of enhancer-promoter interactions generated by H3K27ac HiChIP experiments in human CD4 + T cells¹⁸ (Methods).

HiChIP interactions were detected in 18 out of the 27 (66.67%) independently associated loci, using the SNPs with PP_max as anchor points (Table 3). Several intronic variants were linked to the target gene promoter in which they are mapped. This was the case of CD247-rs2056626, which showed a strong H3K27ac HiChIP signal to the CD247 promoter (Fig. 2). Other relevant examples of this type of interactions were found in IL12RB2 and NFKB1. The intronic variants in STAT4, rs3821236 and rs4853458, showed strong normalized HiChIP signal to STAT4 and STAT1 promoters (Fig. 2). We also observed HiChIP contacts that linked intergenic SNPs to the closest genes. For example, rs11117422, located ~40 kb downstream of IRF8 transcriptional start site, showed interactions with the promoter region of IRF8 (Fig. 2). In addition, several other enhancer-promoter interactions linked intronic and intergenic SNPs to distant genes. In total, H3K27ac HiChIP signals nominated 155 target genes from 18 SSc likely causal variants (~8 genes per SNP on average) (Table 3).

Table 3 H3K27ac HiChIP target genes and nominated target genes for the 27 systemic sclerosis association signals

Full size table

Subsequently, we further validated the functional relevance of H3K27ac HiChIP results by investigating whether the explored SNPs were eQTLs for the nominated target genes. Forty enhancer-target gene relationships showed overlap with SSc eQTL genes (eGenes) (OR = 10.1, Fisher’s exact test P = 2.92 × 10⁻¹⁹, Supplementary Table 3). Although no eQTL to IRF8, STAT4, and STAT1 signals were found, the enrichment of the HiChIP signal observed at these loci (q value < 1e-60, Methods) over a global background of distance-matched interactions, and the crucial role of these genes in the immune response, provided evidence to prioritize them as candidate genes. Remarkably, the third independent association signal observed in the STAT4 genomic region (2q32.2-q32.3)—mapped in a NAB1 intron—was linked to NAB1 promoter by our H3K27ac HiChIP analysis. The interaction was validated by an eQTL signal (Supplementary Data 3). These results supported NAB1 as a new SSc risk locus.

In total, we provided strong evidence to nominate 43 genes as robust SSc target genes in CD4 + T cells (Table 3). Interestingly, some of them pinpointed to new mechanistic insights relevant for the diseases (see Discussion).

Chromatin interaction analyses in other relevant cell types

It is noteworthy that the epigenomic profiles are cell type specific^19,20. Considering that the HiChIP analyses were performed in CD4⁺ T cells, and that the pathogenesis of SSc is not only mediated by T cells, we also explored chromatin interaction maps derived from promoter capture Hi-C experiments in additional immune cell types^21,22 (Methods). These analyses identified promoter interactions not observed in CD4⁺ T cells, which targeted new genes (Table 3, Supplementary Data 6). For example, we observed that the FLNB-intronic variant rs7355798 interacted with FLNB and PXK promoters in B cells and macrophages. Moreover, some of the interactions observed with H3K27ac HiChIP in CD4⁺ T cells were also found in other immune cell types.

The functional relevance of the observed chromatin interactions by promoter capture Hi-C analyses was also validated by eQTLs analysis. In total, these analyses nominated 25 additional target genes for SSc (Table 3).

Candidate genes prioritized by DEPICT

In addition, we also conducted gene prioritization by means of DEPICT (Data-driven Expression-Prioritized Integration for Complex Traits) (http://www.broadinstitute.org/mpg/depict/index.html), which pinpoints the most likely candidate gene/s in associated loci based on predicted gene functions²³. Significant gene prioritization p values were found in 19 of the 27 queried loci (Table 3, Supplementary Data 7). Most of the prioritized genes were previously nominated by the chromatin interaction analyses. In addition, this method nominated TNFSF4, TMEM194B, FLNB-AS1, CD80, ARHGAP31, C8orf14, TSPAN32, and MAST3.

Tissue-specific enrichment of SSc loci in epigenetic marks

The majority of credible set SNPs overlapped with epigenetic marks related to active regions (Supplementary Fig. 3). To quantify the extent of this overlap, we investigated whether SSc associations were non-randomly distributed in histone marks of active promoters (H3K9ac, H3K4me2, H3K4me3, H3K4ac), active enhancers (H3K27ac, H3K4me1, H2BK20ac), and active (or at least accessible) genes (H3K79me1, H2BK15ac) across the 127 reference epigenomes available from the Roadmap Epigenomics Consortium and the Encyclopedia of DNA Elements (ENCODE) projects^15,24. We used a nonparametric approach (GARFIELD^25,26) to compute ORs and estimate the significance of functional enrichment at various GWAS p value cutoffs (5 × 10⁻⁶, 5 × 10⁻⁷, and 5 × 10⁻⁸) (Methods).

Our results showed 363 significant enrichments (p value < 1.25 × 10⁻⁴) in 59 out of the 127 cell and tissue types analyzed (Fig. 3, Supplementary Data 8). Most of the significant enrichments were found in immune cells. SSc-associated variants displayed the most significant enrichments in H2BK15ac and H2BK20ac marks in the GM12878 lymphoblastoid cell line (OR = 37.33, enrichment p value (P_enr) = 4.84 × 10⁻¹⁴; OR = 12.79, P_enr = 2.40 × 10⁻¹⁰, respectively), followed by H3K79me1 in primary Natural Killer (NK) cells (BLD.CD56.PC) (OR = 12.23, P_enr = 5.73 × 10⁻⁰⁹) and primary T cells (BLD.CD3.PPC) (OR = 12.58, P_enr = 9.78 × 10⁻⁰⁹). The spleen also showed a strong functional enrichment in H2BK15ac (OR = 14.69, P_enr = 9.81 × 10⁻⁰⁹). There were significant enrichments of associations with SSc within H3K27ac, H3K4me1, H3K4me2, and H3K9ac marks—among others—of several CD4⁺ T cells (T helper, T regulatory, etc), CD8⁺ T cells, primary B cells, monocytes, primary neutrophils, and thymus (Fig. 3, Supplementary Data 8). Moreover, the SSc association signals showed different epigenetic enrichment patterns in non-immune cell/tissue types, such as lung, fibroblasts, chondrocytes, keratinocytes, osteoblasts, intestinal mucosa, and esophagus, among others (Fig. 3, Supplementary Data 8).

Interestingly, our large panel of reference epigenomes allowed us to identify cell type specific patterns of enrichment. This specificity was especially relevant for some tissue/cell types that showed enrichment for a single histone mark. For example, dermal fibroblast primary cells (SKIN.NHDFAD) only showed significant enrichment for H3K4me2 (OR = 7.91, P_enr = 6.54 × 10⁻⁶).

Specific patterns of associations for the main SSc subtypes

We performed GWAS stratified analyses considering the main SSc clinical subtypes (limited cutaneous SSc (lcSSc) or diffuse cutaneous SSc (dcSSc)) and autoantibody status according to the presence of anticentromere (ACA), antitopoisomerase (ATA), and anti-RNA polymerase III (ARA) autoantibodies (Supplementary Table 4) (Methods).

A total of 18 and 5 non-HLA significant signals were identified for lcSSc and dcSSc, respectively, representing 15 new genome-wide significant signals for lcSSc and 3 for dcSSc (Supplementary Data 9). Among the associations, there were loci that yielded stronger associations and larger effect size in the subtype analyses than in the global meta-analysis; all despite the reduction of the sample and, consequently, of statistical power. To assess whether the more powerful genetic signals were randomly observed, we performed 10,000 permutation analyses (Methods) and computed empirical p values (p*) taking into account the proportion of permuted genetic signals that were at least as extreme as the observed signals. As an example, DNASE1L3 genomic region (3p14.3, rs7652027) was associated with the global disease with an OR of 1.15, whereas the OR observed for the lcSSc subtype was 1.20. Permutation analysis showed significant empirical p value (p* = 1.9 × 10⁻³) for DNASE1L3-rs7652027 thus confirming a larger effect of this risk factor in lcSSc patients (Supplementary Data 9, Supplementary Fig. 4).

Remarkably, we observed two subtype-specific signals that did not show statistical significance in the global analysis. In the case of lcSSc, there was a significant association in the MERTK genomic region (2q13) (association test p value = 1.04 × 10⁻⁰⁸, OR = 1.15) that was not significant in the global meta-analysis (association test p value = 3.49 × 10⁻⁰⁵, OR = 1.09) nor in dcSSc sub-analysis (association test p value = 0.503, OR = 1.02) (Supplementary Data 9 and Supplementary Fig. 5) (Supplementary Fig. 4, p* = 1.0 × 10⁻⁴). MERTK is a tyrosine kinase member of the MER/AXL/TYRO3 receptor kinase family that is associated with multiple sclerosis²⁷ and hepatitis C-induced liver fibrosis²⁸. Moreover, the associated variant (rs3761700), affects the expression of MERTK in whole blood (p value = 1.22 × 10⁻³⁵).

Regarding dcSSc, we found an association signal in the ANKRD12 genomic region (18p11.22) (association test p value = 3.97 × 10⁻⁰⁸, OR = 1.22). This locus did not show significant associations in the global meta-analysis (association test p value = 2.30 × 10⁻⁰⁵, OR = 1.10) nor in the lcSSc subtype (association test p value = 0.034, OR = 1.06) (Supplementary Data 9 and Supplementary Fig. 5) (Supplementary Fig. 4, p* = 4.0 × 10⁻⁴). ANKRD12 encodes a member of the ankyrin repeats-containing cofactor family involved in the modulation of gene transcription. The associated variant—rs4798783—is an eQTL for TWSG1 (p value = 6.44 × 10⁻⁰⁶) in transformed fibroblasts. Interestingly, TWSG1 enhances TGF-beta signaling (which profibrotic role is well-known) in activated T lymphocytes²⁹. These findings and the specific association with dcSSc are of utmost importance, given the more aggressive and rapidly progressing fibrosis observed in this clinical subtype^2,3.

When data were stratified by patients positive for ACA, nine genome-wide significant signals were found, of which two had been previously reported (IRF5-TNPO3, and DNASE1L3) and seven were novel associations (Supplementary Data 10). Notably, the locus CDHR5-IRF7 was strongly associated with this autoantibody presentation (association test p value = 5.32 × 10⁻⁰⁹, OR = 0.73) and showed stronger effect as compared to the global meta-analysis (association test p value = 1.97 × 10⁻⁰⁸, OR = 0.80) (Supplementary Fig. 4, p* = 2.1 × 10⁻³). Regarding the ATA-positive subgroup, we replicated the previously reported association with IRF5-TNPO3 (Supplementary Data 10)³⁰. Finally, in the case of the RNA pol III-positive SSc patients, we did not observe any signal at the genome-wide significance level outside the HLA region. Nonetheless, we found suggestive associations in FAM167A-BLK, GUSBP1-CDH12, and STEAP2 (Supplementary Data 10).

Overall, our findings highlights how performing GWASs in more homogeneous group of patients can increase the success of case–control studies by improving association strengths, thus avoiding reduction of statistical power owing to phenotypic heterogeneity³¹. Moreover, consistent with previous studies, suggesting genetic differences in the susceptibility to SSc subtypes^9,32 the identification of specific patterns of association in each SSc subphenotype emphasizes the importance of classification biomarkers to predict more accurately the best therapeutic approach in each group of patients.

Drug target enrichment analysis

The advantage of using human genetic evidence in drug discovery and repurposing has been comprehensively addressed in the last years^5,6. In this line, we assessed whether any of the 78 SSc target genes identified in the present study (genes from the last three columns of Table 3) encode proteins that are drug targets in any phase of development. Seven out of the 78 genes (9%) overlapped with pharmacological active targets (CD80, BLK, TNFSF4, IL12A, DRD4, PSMD3, FDFT1) in the Open Targets Platform³³ (Supplementary Data 11). Among them, CD80 and BLK were targets of drugs for SSc in any phase of clinical trial (i.e., abatacept and dasatinib, respectively). We assessed the significance of the overlap and found that our SSc target genes were significantly enriched in pharmacological active targets for SSc (OR = 6.0; Fisher’s exact test P = 4.7 × 10⁻²) (Supplementary Table 5).

Discussion

The large cohort of SSc included in the present study allowed us to identify 13 new risk loci for the disease, almost doubling the number of genome-wide association signals reported for SSc, bringing the total number of SSc risk loci up to 28.

In the present study, some of the cohorts were genotyped using different genotyping platforms between cases and controls. To control for the potential spurious associations that this fact may lead owing to the possibility of differential imputation quality for the SNPs, we applied additional steps along with the standard QC procedures. Prior to the imputation, we carefully excluded multi-allelic, or A/T-C/G variants with MAF > 0.4 from all the data sets. After imputation, we applied an in-house Perl script that compares the genotypic frequencies between cases and controls and excluded all SNPs showing genotypic inconsistencies. In addition, manual inspection of the individual Manhattan plots from the 14 independent cohorts was performed and any suspicious false positive signal was carefully analyzed and removed, if necessary. In addition, the genome-wide significant signals identified in the present study were the results of combining the effect of the signals across several independent cohorts. Moreover, no significant heterogeneity of the ORs was observed.

Applying a statistical fine-mapping approach, we reduced associated signals to 95% credible sets of 10 likely causal SNPs or fewer for 18 loci (72%). Notably, 95% credible sets comprised a single variant in 6 loci (ARHGAP31, BLK, CD247, TNIP1, CSK, STAT4-a). In other four loci, the credible sets contained two SNPs (DGKQ, NUP85-GRB2, STAT4-b, IL12RB1). Functional annotation of likely causal variants from credible sets revealed that all variants with PP_max were intronic, intergenic, or ncRNA intronic SNPs. These observations suggest that most of the genetic variations underlying SSc susceptibility are related to transcriptional regulatory mechanisms, including mRNA processing or stability mediated by ncRNAs. Our results are in accordance with emerging evidence that suggest a role of ncRNAs in autoimmunity³⁴. Moreover, the exonic nonsynonymous variants included in the 95% credible sets or in high LD to credible set SNPs did not show clear evidence of being damaging mutations, as in the case of CDHR5-rs2740375 located close to IRF7. As an exception, it is worth mentioning that the nonsynonymous variant rs35677470 (R206C) in DNASE1L3, not present in our SNP panel, was reported to impact the DNase activity of the encoded protein in in vitro studies³⁵. This fact is consistent with the result of our statistical fine-mapping since the 95% credible set for this locus was not well resolved (27 likely causal variants comprised the credible set). Further functional studies will be needed to confirm that rs35677470 (R206C) is the actual causal variant underlying the association or whether there are secondary signals that also influence the role of this genomic region in SSc susceptibility.

As expected, the results from gene expression data (eQTLs) suggested that the functional role of certain SSc signals may expand to several target genes. This hypothesis was confirmed through the experimentally derived high-resolution maps of enhancer-promoter interactions generated by H3K27ac HiChIP in human CD4⁺ T cells. On average, HiChIP results found physical interactions for approximately eight genes per SNP across the 18 SSc likely causal variants that were mapped in the H3K27ac HiChIP analysis, consistent with previous findings for other ADs²⁰. Strong interactions were observed in relation to some SNPs. For example, CD247-rs2056626 (intronic SNP with PP_max = 0.99 in the fine-mapping) showed a strong normalized signal of HiChIP interaction to the CD247 promoter, suggesting that the SNP affects an intronic regulatory element that controls gene expression. This interaction between the SSc risk SNP and the promoter of the CD247 gene in CD4 + T-cells was also observed by the promoter capture Hi-C technique^21,36, further supporting that the SNP may be involved in the transcriptional regulation of this gene. In fact, rs2056626 is a cis-eQTL for CD247 (p value = 2.411 × 10⁻⁴⁸; FDR = 0) in whole blood.

The identification of target genes for GWAS signals is one of the most challenging questions. In the present study, aggregate analysis of chromatin interaction maps in a wide spectrum of immune cell lines and eQTLs provided strong support to nominate 68 genes as robust SSc target genes (Table 3). Interestingly, the function of some of these experimentally nominated target genes are related to relevant pathways or biological processes in SSc. For example, DDX6 encodes a RNA helicase essential for efficient miRNA-induced gene silencing. De Vries et al. demonstrated the role of DDX6 in the regulation of vascular endothelial growth factor under hypoxic conditions³⁷. Therefore, this hit may establish a link between vasculopathy and SSc unknown so far.

Other two new loci providing relevant mechanistic insights are RAB2A and GSDMB. RAB2A belongs to the Rab family, a group of membrane-bound proteins, involved in vesicular fusion and trafficking. Specifically, RAB2A has been proposed to be a key factor in autophagosome clearance³⁸, thus it is another SSc risk locus involved in autophagy apart from the previously described ATG5⁹. These results reinforce the role of autophagy in SSc pathogenesis. In regard to GSDMB, it encodes a member of the gasdermin-domain containing protein family. The functional mechanism of gasdermin proteins is not clearly understood yet. However, recent evidence demonstrated that some gasdermin-N domains—including GSDMB—play a role in the induction of pyroptosis^39,40, an inflammatory form of cell death that is crucial for the immune response. In line of these observations, our results also suggest a role of defective pyroptosis in SSc.

Enrichment analyses of SSc loci in epigenetic marks of active gene regulation showed a strong immune signature. We identified relevant cell types and tissues for disease pathogenesis. Noteworthy, primary NK cells represented one of the highest enrichment signals across almost the entire panel. Our results are consistent with previous reports linking NK cells to SSc⁴¹. In a very recent publication, Benyamine et al.⁴² reported a particular expression profile of NK cells in SSc and showed that this cell type induced endothelial activation⁴². These findings may provide a link between vascular damage and the immune imbalance in SSc.

The inclusion of a wide panel of tissue and cell types captured cell type-specific patterns of enrichment. For example, there were some cell types that showed enrichment for a single histone mark. It is important to note that these results add valuable information to design future functional studies on the basis of accurately and well-chosen cell types or tissues, thus increasing the rate of success of the experiment.

Finally, it has been demonstrated that human genetic evidence positively impacts the success rate in clinical development⁵. The drug target enrichment found in the identified SSc target genes supports that our results might also be informative in drug repurposing. As an example, the present study supports the possibility to consider ustekinumab for SSc treatment, which is a drug currently approved for related diseases, such as psoriasis, active psoriatic arthritis, and Crohn’s disease.

Methods

Study cohorts and GWAS quality control

This study included 14 independent epidemiological cohorts comprising a total of 28,179 unrelated and genome-wide genotyped individuals (9846 SSc) patients and 18,333 healthy controls), after genotyping quality control (QC) steps. In brief, nine new SSc GWAS cohorts and five previously published SSc GWAS cohorts^7,8 of European ancestry were included (Spain 1, Germany 1, The Netherlands 1, USA 1, France, Spain 2, Germany 2, The Netherlands 2, USA 2, Italy, UK, Sweden, Norway and Australia/UK) (Supplementary Table 1). SSc patients fulfilled the 1980 American College of Rheumatology classification criteria for this disease or the criteria proposed by LeRoy and Medsger for early-SSc^43,44. In addition, patients were classified as having lcSSc or dcSSc, as described in LeRoy et al.⁴⁵ Patients were also subdivided by autoantibody status according to the presence of ACA, ATA, or ARA autoantibodies. The main clinical features are shown in Supplementary Table 4. This study complied with all relevant ethical regulations. CSIC’s Ethics Committee approved the study protocol, and written informed consent was obtained in accordance with the tenets of the Declaration of Helsinki.

Genome-wide genotyping was undertaken using the arrays specified in detail in Supplementary Table 1. Stringent QC measures were applied to all GWAS data sets as follows: SNPs with call rates < 0.98; minor allele frequencies (MAFs) < 0.01; and those that deviated from Hardy-Weinberg equilibrium (HWE; p < 0.001 in both case and control subjects) were filtered out from further analysis; samples with call rates < 0.95 were removed. The presence of relatives and/or duplicates was assessed by computing identity-by-descent (IBD) estimation using PLINK⁴⁶. An individual from each pair of relatives (Pi_Hat > 0.45) or duplicates (Pi_Hat > 0.99) was removed. Additionally, duplicate/relatedness testing was also performed between different GWAS data sets with the same country origin.

PC analysis and identification of outliers

To identify ancestry outliers, ~100,000 quality-filtered independent SNPs were selected from each case-control GWAS cohort. PC analysis was performed using PLINK and GCTA64 and R-base software under GNU Public license v.2. The first ten PCs per individual were calculated and plotted. Samples showing > 4 standard deviations from the cluster centroids of each cohort were considered outliers and removed from further analyses.

The total number of individuals that remained in the final filtered data sets after this procedure was 26,679 (9095 SSc patients and 17,584 healthy controls).

Imputation

QC-filtered GWAS data sets were subjected to whole-genome genotype imputation using IMPUTE2⁴⁷ and the 1000 Genome Project Phase III (1KGPh3) data as reference panel⁴⁸. GTOOL was used to convert data sets into the file format used by IMPUTE2. SNPs that were duplicated, multi-allelic, or A/T-C/G with MAF > 0.4 were excluded. Imputation was done separately for each independent study. A probability threshold of 0.9 was set for merging genotypes using GTOOL. Imputed data sets were also QC-filtered by removing SNPs with call rates < 0.98, with MAFs < 0.01 and those that deviated from HWE (p < 0.001). In addition, singleton SNPs (which are not informative for phasing) and those that showed genotypic inconsistency between cases and controls were also excluded from analysis using an in house Perl script.

Genome-wide association analysis

Genome-wide association analyses were performed in PLINK⁴⁶ using a logistic regression model of additive effects, including sex and the five first PCs as covariates in each of the 14 independent European cohorts. Genomic inflation factor (λ) was calculated by cohort and rescaled for an equivalent study of 1000 cases and 1000 controls when necessary (λ₁₀₀₀). Quantile–quantile (Q–Q) plots were generated and plotted with an in house R script to compare genome-wide distribution of the test statistic with the expected null distribution (Supplementary Fig. 1). We conducted a fixed effects inverse variance meta-analysis in PLINK⁴⁶ to combine the ORs obtained in each independent GWAS study. Heterogeneity values (I² and Q) were calculated with PLINK⁴⁶ to evaluate possible OR heterogeneity across the 14 individuals cohorts. Novel signals of associations were defined as the genome-wide significant associations (p value ≤ 5 × 10⁻⁸) that did not overlap with previously SSc reported signals at the genome-wide significance level of association.

Stratified analysis in clinical and serological SSc subtypes

Stratification of patients according to SSc subtype (lcSSc, dcSSc) or autoantibody status (ACA, ATA, and ARA) was performed to conduct stratified genome-wide association analyses using the same procedure as for global analysis. All sub-analyses included the 14 independent cohorts, with the exception of the GWAS analysis for ARA-positive patients, which included Spain 2, USA 2, Italy, and UK cohorts according to data availability.

Permutation analysis for subphenotype hits

A number of loci exhibited stronger genetic signals in stratified analysis (lcSSc, dcSSc, ACA) as compared with SSc as a whole despite the loss of statistical power caused by smaller numbers of the subphenotypes. To investigate whether these outcomes could have occurred by chance, we randomly shuffled 10,000 times a number of cases from each cohort (while keeping controls constant) and reran association testing and subsequent meta-analysis on the reshuffled data sets. The p values were converted to z scores to generate a null distribution of this test statistic. In detail, for each subtype, the number of cases randomly selected was determined by the prevalence of the subtype observed in the present study: we selected 62.52% of cases for lcSSc, 27.75% for dcSSc, and 36.77% for ACA +. The empirical p value (p*) was calculated as the number of permuted z scores that were at least as extreme as the actual z score + 1 divided by the number of permutations + 1³¹.

Stepwise conditional analysis in SSc-associated loci

The presence of independent signals in the genomic regions with significant signals in the meta-analysis was assessed by joint conditional analysis by GCTA⁴⁹. This method uses summary-level statistics from meta-analysis and applies LD correction between SNPs estimated from a reference sample set. Conditional analysis of each associated locus was performed within a standard region of 1.5 Mb-window centered on the most associated SNP (index or lead SNP), with the exception of DNASE1L3 region, where we explored the locus to 2 Mb owing to the extent of the haplotype block. LD patterns were estimated using genotype data from the 14 individual cohorts as reference. Conditional association analysis was performed including the lead SNP as covariate. Any SNP showing a conditional association p value < 5 × 10⁻⁶ was considered as independent signal and was further included in a new round of conditional analysis. This process was repeated until no SNP with p value < 5 × 10⁻⁶ remained in any of the genomic regions explored. The observed independent signals were confirmed using PLINK⁴⁶ by dependence analysis at cohort level scans through stepwise logistic regression with adjustment for the most associated signals in each locus, followed by inverse variance weighted meta-analysis under a fixed effects model.

Fine-mapping of SSc-associated loci in a Bayesian framework

After the assessment of independent signals in significant loci from the meta-analysis, statistical fine-mapping was carried out using PAINTOR (Probabilistic Annotation INTegratOR) v3.0^50,51 searching for one causal SNP per independent associated region. PAINTOR performs probabilistic inference and computes posterior probabilities (PP) for SNPs to be casual considering the strength of association (Z score) and the LD pattern across genomic regions. The association strength was quantified using Wald statistic (“Equation (1)”) from ref. ⁵⁰, and the LD information was provided by a LD matrix containing pairwise Pearson correlations coefficients between each SNP. In addition, PAINTOR leverages functional annotation data as a prior probability to improve SNP prioritization. Finally, the method uses Bayes theorem to obtain PP for SNPs to be casual, which in turn were used to generate 95% credible sets (the smallest list of variants that jointly have a probability of including the causal variant ≥ 95%). Associated regions that contained more than one independent signal were split to obtain regions containing only one independent signal by integrating local LD information as well as the recombination rates using the online-tool LDlink (https://ldlink.nci.nih.gov/)⁵².

The selection of functional annotations for PAINTOR fine-mapping was carried out by stratified information enrichment calculations using GARFIELD^25,26 (http://www.ebi.ac.uk/birney-srv/GARFIELD/) (method explained in more detail in ‘Enrichment analysis of SSc risk loci in epigenetic marks and cell types’) with the annotation panel distributed in GARFIELD package. The purpose was to systematically select annotations relevant to SSc on the basis of functional enrichment analysis. GARFIELD tests its robustness by calculating functional enrichment for at least four significance cutoffs (p value < 1e −5/−6/−7/−8) applied to the variants. GARFIELD analysis was carried out in our genome-wide SNP panel by setting default parameters and omitting SNPs from chromosome 6 between Mb25 and Mb34. We determined a set of nine annotations to be used for fine-mapping that showed: A significant enrichment (FDR < 0.05) of GWAS SNPs for at least two out of the four significance cutoffs analyzed (p value < 1e −5/−6/−7/−8); and b) A low inter-annotation correlation as suggested by PAINTOR (median inter-annotation Pearson correlation < 0.35) (Supplementary Data 12).

Functional annotation of SNPs from credible sets

Functional characterization of the SNPs included in credible sets was performed by assessing SNP functional categories by means of wANNOVAR using default parameters⁵³. Then we explored overlap with eQTLs, epigenetic histone marks of active promoters and active enhancers (H3K9ac, H3K4me1, and H3K27ac), and the presence of exonic nonsynonymous variants in high or moderate LD (r² ≥ 0.8, r² ≥ 0.6, respectively) using HaploReg v4.1⁵⁴. For eQTL interrogation, we used blood eQTL from Westra et al.⁵⁵, the Geuvadis data set⁵⁶—which contains expression data from lymphoblastoid cell lines—and the Genotype–Tissue Expression (GTEx) project⁵⁷—which provides RNA sequencing-based eQTL for a wide range of human tissues. Overlap of SNPs with chromatin marks was interrogated in selected cell lines from the Roadmap Epigenomics Project¹⁵ (Supplementary Table 1). Cell lines were selected according to the results of the functional enrichment analysis from ‘Enrichment analysis of SSc risk loci in epigenetic marks and cell types’ for H3K9ac, H3K4me1, and H3K27ac histone marks.

Finally, we assessed pleiotropic effect of our signals by determining whether the SNPs included in the credible sets had been reported to be associated with other ADs. For this, we interrogated the new NHGRI-EBI GWAS Catalog (https://www.ebi.ac.uk/gwas/)⁵⁸ through the web tool FUMA GWAS (http://fuma.ctglab.nl/)⁵⁹.

H3K27ac HiChIP analysis in human CD4⁺ T cells

Experimentally derived high-resolution maps of enhancer-promoter interactions generated by H3K27ac HiChIP experiments²⁰ were explored to identify target genes of SSc-associated variants. HiChIP was developed by Mumbach et al.¹⁸ for the analysis of protein-directed chromosome conformation in a very efficient and sensitive way. The H3K27ac HiChIP experiments were performed by Mumbach et al. in human CD4 + T cells from healthy donors: Primary human naïve T cells (CD4⁺CD45RA⁺CD25⁻CD127^hi), regulatory T (Treg) cells (CD4⁺CD25⁺CD127^lo) and T helper 17 (Th17) cells (CD4⁺CD45RA⁻CD25⁻CD127^hiCCR6⁺CXCR5⁻)²⁰. Virtual 4 C plots were generated from dumped matrices generated with Juicebox. The Juicebox tools dump command was used to extract the chromosome of interest from the.hic file^60,61. The interaction profile of a specific 5 kb or 10 kb bin containing the anchor was then plotted in R. Replicate reproducibility was visualized with the mean profile shown as a line and the shading surrounding the mean representing the standard deviation between replicates. We explored chromatin interactions of the most likely causal variants by setting as anchor points the SNPs with maximum PPs in each of the independent associated loci. To identify the connectivity of candidate SNPs to target genes, we called interactions by manual inspection of individual SNP virtual 4 C interaction files and subset these interactions to those containing a transcription start site and SNP^18,20. Fit-Hi-C algorithm was used to identify statistically significant (q value ≤ 1e-60) distance-matched enrichment of interaction over background^18,62.

The functional relevance of the H3K27ac HiChIP findings was further validated by evaluating whether the explored SNPs were eQTLs for the HiChIP nominated target genes.

Promoter capture Hi-C analysis

Chromatin interaction maps obtained by promoter capture Hi-C experiments in a wide spectrum of immune cell types^21,22 were assessed using the web-based tool Capture Hi-C Plotter (CHiCP) (https://www.chicp.org/)⁶³. The SNPs with maximum PPs in each of the independent associated loci were used as anchor points to explore physical interactions between restriction fragments containing the variants and gene promoters.

Enrichment of SSc loci in epigenetic marks and cell types

To assess whether our SSc GWAS SNPs were not randomly distributed among functional or regulatory elements in the genome, we performed functional enrichment analysis of non-MHC SNPs using GARFIELD v2.0^25,26. This method estimates enrichment of overlap on functional information computing ORs at various GWAS p value cutoffs, and tests the significance of the enrichment under a generalized linear model. GARFIELD accounts for major sources of confounding factors by incorporating high-LD proxies (r² > 0.8), MAF, and transcription start site distance as categorical covariates in the logistic regression model. Enrichment was tested on independent SNPs after pruning of GWAS SNPs (r² > 0.1). We omitted SNPs of chromosome 6 between Mb25 and Mb34 to avoid bias.

GARFIELD provides an annotation panel that includes 1005 annotations (genetic annotations, chromatin states, histone modifications, DNase I hypersensitive sites and transcription factor binding sites in different cell lines) from ENCODE, GENCODE and Roadmap Epigenomics projects^15,24,64. Moreover, GARFIELD can be run using a custom annotation panel. The second option was selected for our enrichment analysis using annotations for 127 reference epigenomes (Supplementary Data 13) and 9 epigenetic marks (H2BK20ac, H3K27ac, H3K4me1, H3K4me2, H3K4me3, H3K9ac, H3K4ac, H3K79me1, H2BK15ac) obtained from the Roadmap Epigenomics Consortium and the Encyclopedia of DNA Elements (ENCODE) projects^15,24. Annotations used were Imputed Narrow Peaks as generated by the software Chrom-Impute⁶⁵ and obtained from https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidatedImputed/narrowPeak/. The estimated ORs were computed at various GWAS p value cutoffs (5 × 10⁻⁶, 5 × 10⁻⁷, 5 × 10⁻⁸) and the R code Garfield-Meff-Padj.R provided by GARFIELD was used to calculate an enrichment p value threshold adjusted for multiple testing (P value = 1.25 × 10⁻⁴) on the effective number of annotations (Meff = 400.4454).

Drug-target gene enrichment analysis

Target genes nominated in the present study were used to query the Open Target Platform³³ in order to assess whether any of the genes encode proteins that are drug targets in any phase of clinical trial (phase I–IV). Enrichment of overlap between SSc target genes with pharmacological active targets for the diseases was calculated by Fisher’s exact test.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Summary statistics of the meta-GWAS analyzed in the current study will be made available through the NHGRI-EBI GWAS Catalog (https://www.ebi.ac.uk/gwas/downloads/summary-statistics) (please use ‘Systemic Sclerosis’ and/or ‘Lopez-Isac/Martin’ as search terms). Individual-level genotype data are not publicly available owing to them containing information that could compromise research participant privacy or informed consent. All other data are contained in the article file and its supplementary information or available upon reasonable request to the corresponding authors. Epigenetic annotation panel used in this study were Imputed Narrow Peaks obtained from https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidatedImputed/narrowPeak/.

References

Barnes, J. & Mayes, M. D. Epidemiology of systemic sclerosis: incidence, prevalence, survival, risk factors, malignancy, and environmental triggers. Curr. Opin. Rheumatol. 24, 165–170 (2012).
Article PubMed Google Scholar
Gabrielli, A., Avvedimento, E. V. & Krieg, T. Scleroderma. N. Engl. J. Med. 360, 1989–2003 (2009).
Article CAS PubMed Google Scholar
Denton, C. P. & Khanna, D. Systemic sclerosis. Lancet 390, 1685–1699 (2017).
Article PubMed Google Scholar
Steen, V. D. & Medsger, T. A. Changes in causes of death in systemic sclerosis, 1972–2002. Ann. Rheum. Dis. 66, 940–944 (2007).
Article PubMed PubMed Central Google Scholar
Nelson, M. R. et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 47, 856–860 (2015).
Article CAS PubMed Google Scholar
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
Article ADS CAS PubMed Google Scholar
Radstake, T. R. et al. Genome-wide association study of systemic sclerosis identifies CD247 as a new susceptibility locus. Nat. Genet. 42, 426–429 (2010).
Article CAS PubMed PubMed Central Google Scholar
Allanore, Y. et al. Genome-wide scan identifies TNIP1, PSORS1C1, and RHOB as novel risk loci for systemic sclerosis. PLoS Genet. 7, e1002091 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mayes, M. D. et al. Immunochip analysis identifies multiple susceptibility loci for systemic sclerosis. Am. J. Hum. Genet. 94, 47–61 (2014).
Article CAS PubMed PubMed Central Google Scholar
Terao, C. et al. Transethnic meta-analysis identifies GSDMA and PRDM1 as susceptibility genes to systemic sclerosis. Ann. Rheum. Dis. 76, 1150–1158 (2017).
Article CAS PubMed Google Scholar
Lopez-Isac, E. et al. Brief report: IRF4 newly identified as a common susceptibility locus for systemic sclerosis and rheumatoid arthritis in a cross-disease meta-analysis of genome-wide association studies. Arthritis Rheumatol. 68, 2338–2344 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zochling, J. et al. An Immunochip-based interrogation of scleroderma susceptibility variants identifies a novel association at DNASE1L3. Arthritis Res. Ther. 16, 438 (2014).
Article CAS PubMed PubMed Central Google Scholar
Martin, J. E. et al. A systemic sclerosis and systemic lupus erythematosus pan-meta-GWAS reveals new shared susceptibility loci. Hum. Mol. Genet. 22, 4021–4029 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zhou, X. et al. Filamin B deficiency in mice results in skeletal malformations and impaired microvascular development. Proc. Natl. Acad. Sci. U S A 104, 3919–3924 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Roadmap Epigenomics, C. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article CAS Google Scholar
Sim, N. L. et al. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 40, W452–W457 (2012).
Article CAS PubMed PubMed Central Google Scholar
Adzhubei, I., Jordan, D. M. & Sunyaev, S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. Chapter 7, Unit 7.20 (2013).
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods. 13, 919–922 (2016).
Article CAS PubMed PubMed Central Google Scholar
Trynka, G. et al. Chromatin marks identify critical cell types for fine mapping complex trait variants. Nat. Genet. 45, 124–130 (2013).
Article CAS PubMed Google Scholar
Mumbach, M. R. et al. Enhancer connectome in primary human cells identifies target genes of disease-associated DNA elements. Nat. Genet. 49, 1602–1612 (2017).
Article CAS PubMed PubMed Central Google Scholar
Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384 e19 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mifsud, B. et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598–606 (2015).
Article CAS PubMed Google Scholar
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Article CAS PubMed Google Scholar
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Iotchkova, V. et al. GARFIELD - GWAS Analysis of Regulatory or Functional Information Enrichment with LD correction. bioRxiv. https://doi.org/10.1101/085738 (2016).
Article Google Scholar
Iotchkova, V. et al. Discovery and refinement of genetic loci associated with cardiometabolic risk using dense imputation maps. Nat. Genet. 48, 1303–1312 (2016).
Article CAS PubMed PubMed Central Google Scholar
International Multiple Sclerosis Genetics, C. et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 476, 214–219 (2011).
Article ADS CAS Google Scholar
Patin, E. et al. Genome-wide association study identifies variants associated with progression of liver fibrosis from HCV infection. Gastroenterology 143, 1244–1252 e12 (2012).
Article CAS PubMed Google Scholar
Tzachanis, D. et al. Twisted gastrulation (Tsg) is regulated by Tob and enhances TGF-beta signaling in activated T lymphocytes. Blood 109, 2944–2952 (2007).
Article CAS PubMed PubMed Central Google Scholar
Bossini-Castillo, L., Lopez-Isac, E. & Martin, J. Immunogenetics of systemic sclerosis: defining heritability, functional variants and shared-autoimmunity pathways. J Autoimmun 64, 53–65 (2015).
Article CAS PubMed Google Scholar
Sham, P. C. & Purcell, S. M. Statistical power and significance testing in large-scale genetic studies. Nat. Rev. Genet. 15, 335–346 (2014).
Article CAS PubMed Google Scholar
Gorlova, O. et al. Identification of novel genetic markers associated with clinical phenotypes of systemic sclerosis through a genome-wide association strategy. PLoS Genet. 7, e1002178 (2011).
Article CAS PubMed PubMed Central Google Scholar
Carvalho-Silva, D. et al. Open Targets Platform: new developments and updates two years on. Nucleic Acids Res. 47, D1056–D1065 (2018).
Gupta, B. & Hawkins, R. D. Epigenomics of autoimmune diseases. Immunol Cell Biol. 93, 271–276 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ueki, M. et al. Caucasian-specific allele in non-synonymous single nucleotide polymorphisms of the gene encoding deoxyribonuclease I-like 3, potentially relevant to autoimmunity, produces an inactive enzyme. Clin. Chim. Acta. 407, 20–24 (2009).
Article CAS PubMed Google Scholar
Martin, P. et al. Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci. Nat. Commun. 6, 10069 (2015).
Article ADS CAS PubMed Google Scholar
de Vries, S. et al. Identification of DEAD-box RNA helicase 6 (DDX6) as a cellular modulator of vascular endothelial growth factor expression under hypoxia. J. Biol. Chem. 288, 5815–5827 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lorincz, P. et al. Rab2 promotes autophagic and endocytic lysosomal degradation. J. Cell Biol. 216, 1937–1947 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ding, J. et al. Pore-forming activity and structural autoinhibition of the gasdermin family. Nature 535, 111–116 (2016).
Article ADS CAS PubMed Google Scholar
Chao, K. L., Kulakova, L. & Herzberg, O. Gene polymorphism linked to increased asthma and IBD risk alters gasdermin-B structure, a sulfatide and phosphoinositide binding protein. Proc. Natl. Acad. Sci. USA 114, E1128–E1137 (2017).
Article CAS PubMed PubMed Central Google Scholar
Barranco, C. Systemic sclerosis: the future is CD56-bright. Nat. Rev. Rheumatol. 12, 624 (2016).
Article PubMed Google Scholar
Benyamine, A. et al. Natural killer cells exhibit a peculiar phenotypic profile in systemic sclerosis and are potent inducers of endothelial microparticles release. Front. Immunol. 9, 1665 (2018).
Article CAS PubMed PubMed Central Google Scholar
Anon, M. C. Preliminary criteria for the classification of systemic sclerosis (scleroderma). Subcommittee for scleroderma criteria of the American Rheumatism Association Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum. 23, 581–590 (1980).
Article Google Scholar
LeRoy, E. C. & Medsger, T. A. Criteria for the classification of early systemic sclerosis. J. Rheumatol. 28, 1573–1576 (2001).
CAS PubMed Google Scholar
LeRoy, E. C. et al. Scleroderma (systemic sclerosis): classification, subsets and pathogenesis. J. Rheumatol. 15, 202–205 (1988).
CAS PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article CAS PubMed PubMed Central Google Scholar
Genomes Project, C. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article ADS CAS Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 S1-3. (2012).
Article CAS PubMed PubMed Central Google Scholar
Kichaev, G. et al. Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics 33, 248–255 (2017).
Article CAS PubMed Google Scholar
Kichaev, G. & Pasaniuc, B. Leveraging functional-annotation data in trans-ethnic fine-mapping studies. Am. J. Hum. Genet. 97, 260–271 (2015).
Article CAS PubMed PubMed Central Google Scholar
Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chang, X. & Wang, K. wANNOVAR: annotating genetic variants for personal genomes via the web. J. Med. Genet. 49, 433–436 (2012).
Article PubMed Google Scholar
Ward, L. D. & Kellis, M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 40, D930–D934 (2012).
Article CAS PubMed Google Scholar
Westra, H. J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238–1243 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lappalainen, T. et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature 501, 506–511 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Consortium, G. T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article CAS Google Scholar
MacArthur, J. et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 45, D896–D901 (2017).
Article CAS PubMed Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ay, F., Bailey, T. L. & Noble, W. S. Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 24, 999–1011 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schofield, E. C. et al. CHiCP: a web-based tool for the integrative and interactive visualization of promoter capture Hi-C datasets. Bioinformatics 32, 2511–2513 (2016).
Article CAS PubMed PubMed Central Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ernst, J. & Kellis, M. Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues. Nat. Biotechnol. 33, 364–376 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Sofia Vargas, Sonia García, and Gema Robledo for their excellent technical assistance and all the patients and control donors for their essential collaboration. We thank National DNA Bank Carlos III (University of Salamanca, Spain) that supplied part of the control DNA samples from Spain, WTCCC and EIRA Consortiums, and PopGen 2.0 network. This work was supported by Spanish Ministry of Economy and Competitiveness (grant ref. SAF2015-66761-P), Consejeria de Innovacion, Ciencia y Tecnologia, Junta de Andalucía (P12-BIO-1395), Ministerio de Educación, Cultura y Deporte through the program FPU, Juan de la Cierva fellowship (FJCI-2015-24028), Red de Investigación en Inflamación y Enfermadades Reumaticas (RIER) from Instituto de Salud Carlos III (RD16/0012/0013), and Scleroderma Research Foundation and NIH P50-HG007735 (to H.Y.C.). H.Y.C. is an Investigator of the Howard Hughes Medical Institute. PopGen 2.0 is supported by a grant from the German Ministry for Education and Research (01EY1103). M.D.M and S.A. are supported by grant DoD W81XWH-18-1-0423 and DoD W81XWH-16-1-0296, respectively.

Author information

These authors contributed equally: Marialbert Acosta-Herrera and Martin Kerick.

Authors and Affiliations

Institute of Parasitology and Biomedicine López-Neyra, IPBLN-CSIC, Granada, Spain
Elena López-Isac, Marialbert Acosta-Herrera, Martin Kerick & Javier Martin
The University of Texas Health Science Center–Houston, Houston, USA
Shervin Assassi & Maureen D. Mayes
Center for Personal Dynamic Regulomes, Stanford University School of Medicine, Stanford, CA, USA
Ansuman T. Satpathy, Jeffrey Granja, Maxwell R. Mumbach & Howard Y. Chang
Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
Ansuman T. Satpathy, Jeffrey Granja, Maxwell R. Mumbach & Howard Y. Chang
Referral Center for Systemic Autoimmune Diseases, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico di Milano, Milan, Italy
Lorenzo Beretta
Department of Internal Medicine, Valle de Hebrón Hospital, Barcelona, Spain
Carmen P. Simeón, Fonollosa V & A. Guillén
Department of Rheumatology, 12 de Octubre University Hospital, Madrid, Spain
Patricia Carreira
Department of Internal Medicine, San Cecilio Clinic University Hospital, Granada, Spain
Norberto Ortego-Centeno, R. Ríos & J. L. Callejas
Department of Rheumatology, Santa Creu i Sant Pau University Hospital, Barcelona, Spain
Ivan Castellvi
Wellcome Trust Sanger Institute, Hinxton, UK
Lara Bossini-Castillo
Department of Genetics and Institute of Biotechnology, University of Granada, Granada, Spain
F. David Carmona
Arthritis Research UK Centre for Genetics and Genomics, Centre for Musculoskeletal Research, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Oxford Road, Manchester, UK
Gisela Orozco & Jane Worthington
Department of Dermatology, University of Cologne, Cologne, Germany
Nicolas Hunzelmann
Department of Internal Medicine 3, Institute for Clinical Immunology, University of Erlangen-Nuremberg, Erlangen, Germany
Jörg H. W. Distler
Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
Andre Franke
Department of Medicine, Università degli Studi di Verona, Verona, Italy
Claudio Lunardi
Clinica Medica, Department of Clinical and Molecular Science, Università Politecnica delle Marche and Ospedali Riuniti, Ancona, Italy
Gianluca Moroncini & Armando Gabrielli
Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands
Jeska de Vries-Bouwstra & C. Magro
Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
Cisca Wijmenga
University Medical Center Utrecht, Utrecht, The Netherlands
Bobby P. C. Koeleman
Division of Rheumatology, Department of Medicine, Karolinska University Hospital, Karolinska Institute, Stockholm, Sweden
Annika Nordin & Leonid Padyukov
Department of Rheumatology, Oslo University Hospital, Oslo, Norway
Anna-Maria Hoffmann-Vold
Department of Medical Genetics, and the Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Benedicte Lie
Royal Adelaide Hospital and University of Adelaide, Adelaide, SA, Australia
Susanna Proudman
St. Vincent’s Hospital, Melbourne, VIC, Australia
Wendy Stevens
The University of Melbourne at St. Vincent’s Hospital, Melbourne, VIC, Australia
Mandana Nikpour
Department of Medical and Molecular Genetics, King’s College London, London, UK
Timothy Vyse
Centre for Musculoskeletal Research, The University of Manchester, Salford Royal NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
Ariane L. Herrick
NIHR Manchester Biomedical Research Centre, Manchester, UK
Ariane L. Herrick
Centre for Rheumatology, Royal Free and University College Medical School, London, United Kingdom
Christopher P. Denton & Carmen Fonseca
Department of Rheumatology A, Cochin Hospital, INSERM U1016, Paris Descartes University, Paris, France
Yannick Allanore
Institute of Health and Biomedical Innovation, Queensland University of Technology, Translational Research Institute, Princess Alexandra Hospital, Brisbane, QLD, Australia
Matthew A. Brown
Department of Rheumatology & Clinical Immunology, Laboratory of Translational Immunology, department of Immunology, University Medical Center Utrecht, Utrecht, The Netherlands
Timothy R. D. J. Radstake
Department of Internal Medicine, Virgen de las Nieves Hospital, Granada, Spain
J. A. Vargas-Hitos
Department of Rheumatology, Virgen de la Victoria Hospital, Málaga, Spain
R. García-Portales
Department of Internal Medicine, Carlos Haya Hospital, Málaga, Spain
M. T. Camps
Department of Rheumatology, Carlos Haya Hospital, Málaga, Spain
A. Fernández-Nebro
Department of Immunology, Virgen del Rocío Hospital, Sevilla, Spain
M. F. González-Escribano
Department of Internal Medicine, Virgen del Rocío Hospital, Sevilla, Spain
F. J. García-Hernández & M. J. Castillo
Department of Rheumatology, Reina Sofía/IMIBIC Hospital, Córdoba, Spain
M. A. Aguirre & I. Gómez-Gracia
Department of Rheumatology, San Carlos Clinic Hospital, Madrid, Spain
B. Fernández-Gutiérrez & L. Rodríguez-Rodríguez
Department of Rheumatology, Madrid Norte Sanchinarro Hospital, Madrid, Spain
P. García de la Peña
Department of Rheumatology, La Princesa Hospital, Madrid, Spain
E. Vicente
Department of Rheumatology, Puerta de Hierro Hospital-Majadahonda, Madrid, Spain
J. L. Andreu & M Fernández de Castro
Department of Rheumatology, Gregorio Marañón University Hospital, Madrid, Spain
F. J. López-Longo & L. Martínez
Department of Internal Medicine, Clinic Hospital, Barcelona, Spain
G. Espinosa
Department of Internal Medicine, Parc Tauli Hospital, Sabadell, Spain
C. Tolosa
Department of Rheumatology, Hospital Del Mar, Barcelona, Spain
A. Pros
Department of Internal Medicine, Hospital Universitari Mútua Terrasa, Barcelona, Spain
M. Rodríguez-Carballeira
Department of Rheumatology, Bellvitge University Hospital, Barcelona, Spain
F. J. Narváez
Department of Internal Medicine, Bellvitge University Hospital, Barcelona, Spain
M. Rubio-Rivas
Department of Rheumatology, Granollers Hospital, Granollers, Spain
Ortiz-Santamaría V
Department of Internal Medicine, Hospital General San Jorge, Huesca, Spain
A. B. Madroñero
Epidemiology, Genetics and Atherosclerosis Research Group on Systemic Inflammatory Diseases, DIVAL, University of Cantabria, Santander, Spain
M. A. González-Gay
Department of Internal Medicine, Hospital Central de Asturias, Oviedo, Spain
B. Díaz & L. Trapiella
Infectious Diseases Unit, Department of Internal Medicine, Hospital Xeral-Complexo Hospitalario Universitario de Vigo, Vigo, Spain
A. Sousa
Department of Internal Medicine, Hospital Universitario Cruces, Barakaldo, Spain
M. V. Egurbide
Department of Internal Medicine, Hospital Virgen del Camino, Pamplona, Spain
P. Fanlo-Mateo
Department of Internal Medicine, Hospital Universitario Miguel Servet, Zaragoza, Spain
L. Sáez-Comet
Department of Rheumatology, Hospital Universitario de Canarias, Tenerife, Spain
F. Díaz & Hernández V
Department of Rheumatology, Hospital General Universitario de Valencia, Valencia, Spain
E. Beltrán
Department of Rheumatology, Hospital Universitari i Politecnic La Fe, Valencia, Spain
J. A. Román-Ivorra & E. Grau
Department of Rheumatology, Hospital Universitari Doctor Peset, Valencia, Spain
J. J. Alegre-Sancho
Department of Internal Medicine, Thrombosis and Vasculitis Unit, Complexo Hospitalario Universitario de Vigo, Vigo, Spain
M. Freire
Department of Rheumatology, INIBIC-Hospital Universitario A Coruña, La Coruña, Spain
F. J. Blanco-García & N. Oreiro
Department of Clinical Immunology, Hannover Medical School, Hannover, Germany
T. Witte
Department of Dermatology, Josefs-Hospital, Ruhr University Bochum, Bochum, Germany
A. Kreuter
Clinic of Rheumatology, University of Lübeck, Lübeck, Germany
G. Riemekasten
Service of Rheumatology and Clinic Immunology Spedali Civili, Brescia, Italy
P. Airó
Department of Rheumatology, VU University Medical Center, Amsterdam, The Netherlands
A. E. Voskuyl
Department of Rheumatology, Radboud University Nijmegen Medical Center, Nijmegen, Netherlands
M. C. Vonk
Department of Rheumatology, Lund University, Lund, Sweden
R. Hesselstrand
Menzies Research Institute Tasmania, University of Tasmania, Hobart, TAS, Australia
J. Zochling
Department Rheumatology, Monash Medical Centre, Melbourne, VIC, Australia
J. Sahhar
Rheumatology, Royal Perth Hospital, Perth, WA, Australia
J. Roddy
Research Unit, Sunshine Coast Rheumatology, Maroochydore, QLD, Australia
P. Nash
Canberra Rheumatology, Canberra, ACT, Australia
K. Tymms
Department Rheumatology, The Queen Elizabeth Hospital, Woodville, SA, Australia
M. Rischmueller & S. Lester

Authors

Elena López-Isac
View author publications
You can also search for this author in PubMed Google Scholar
Marialbert Acosta-Herrera
View author publications
You can also search for this author in PubMed Google Scholar
Martin Kerick
View author publications
You can also search for this author in PubMed Google Scholar
Shervin Assassi
View author publications
You can also search for this author in PubMed Google Scholar
Ansuman T. Satpathy
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Granja
View author publications
You can also search for this author in PubMed Google Scholar
Maxwell R. Mumbach
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Beretta
View author publications
You can also search for this author in PubMed Google Scholar
Carmen P. Simeón
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Carreira
View author publications
You can also search for this author in PubMed Google Scholar
Norberto Ortego-Centeno
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Castellvi
View author publications
You can also search for this author in PubMed Google Scholar
Lara Bossini-Castillo
View author publications
You can also search for this author in PubMed Google Scholar
F. David Carmona
View author publications
You can also search for this author in PubMed Google Scholar
Gisela Orozco
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Hunzelmann
View author publications
You can also search for this author in PubMed Google Scholar
Jörg H. W. Distler
View author publications
You can also search for this author in PubMed Google Scholar
Andre Franke
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Lunardi
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Moroncini
View author publications
You can also search for this author in PubMed Google Scholar
Armando Gabrielli
View author publications
You can also search for this author in PubMed Google Scholar
Jeska de Vries-Bouwstra
View author publications
You can also search for this author in PubMed Google Scholar
Cisca Wijmenga
View author publications
You can also search for this author in PubMed Google Scholar
Bobby P. C. Koeleman
View author publications
You can also search for this author in PubMed Google Scholar
Annika Nordin
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Padyukov
View author publications
You can also search for this author in PubMed Google Scholar
Anna-Maria Hoffmann-Vold
View author publications
You can also search for this author in PubMed Google Scholar
Benedicte Lie
View author publications
You can also search for this author in PubMed Google Scholar
Susanna Proudman
View author publications
You can also search for this author in PubMed Google Scholar
Wendy Stevens
View author publications
You can also search for this author in PubMed Google Scholar
Mandana Nikpour
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Vyse
View author publications
You can also search for this author in PubMed Google Scholar
Ariane L. Herrick
View author publications
You can also search for this author in PubMed Google Scholar
Jane Worthington
View author publications
You can also search for this author in PubMed Google Scholar
Christopher P. Denton
View author publications
You can also search for this author in PubMed Google Scholar
Yannick Allanore
View author publications
You can also search for this author in PubMed Google Scholar
Matthew A. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Timothy R. D. J. Radstake
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Fonseca
View author publications
You can also search for this author in PubMed Google Scholar
Howard Y. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Maureen D. Mayes
View author publications
You can also search for this author in PubMed Google Scholar
Javier Martin
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

European Scleroderma Group†

R. Ríos
, J. L. Callejas
, J. A. Vargas-Hitos
, R. García-Portales
, M. T. Camps
, A. Fernández-Nebro
, M. F. González-Escribano
, F. J. García-Hernández
, M. J. Castillo
, M. A. Aguirre
, I. Gómez-Gracia
, B. Fernández-Gutiérrez
, L. Rodríguez-Rodríguez
, P. García de la Peña
, E. Vicente
, J. L. Andreu
, M Fernández de Castro
, F. J. López-Longo
, L. Martínez
, Fonollosa V
, A. Guillén
, G. Espinosa
, C. Tolosa
, A. Pros
, M. Rodríguez-Carballeira
, F. J. Narváez
, M. Rubio-Rivas
, Ortiz-Santamaría V
, A. B. Madroñero
, M. A. González-Gay
, B. Díaz
, L. Trapiella
, A. Sousa
, M. V. Egurbide
, P. Fanlo-Mateo
, L. Sáez-Comet
, F. Díaz
, Hernández V
, E. Beltrán
, J. A. Román-Ivorra
, E. Grau
, J. J. Alegre-Sancho
, M. Freire
, F. J. Blanco-García
, N. Oreiro
, T. Witte
, A. Kreuter
, G. Riemekasten
, P. Airó
, C. Magro
, A. E. Voskuyl
, M. C. Vonk
& R. Hesselstrand

Australian Scleroderma Interest Group (ASIG)

J. Zochling
, J. Sahhar
, J. Roddy
, P. Nash
, K. Tymms
, M. Rischmueller
& S. Lester

Contributions

E.L.I., L.B.C., S.A., M.D.M. and J.M. contributed to the conception and study design. ELI., M.A.H., M.K., F.D.C. and G.O. contributed to data collection, QC, and imputation. A.F., C.W., T.V., Y.A., M.A.B. and T.R.D.J.R contributed to control and/or case GWAS data collection. E.L.I., M.A.H., M.K., A.T.S., J.G., M.R.M. and H.Y.C. contributed to data analysis. All co-authors made substantial contributions to data acquisition, data interpretation, and revised the work critically for important intellectual content.

Corresponding authors

Correspondence to Elena López-Isac or Javier Martin.

Ethics declarations

Competing interests

H.Y.C. is a co-founder of Accent Therapeutics and advisor to 10x Genomics and Spring Discovery. All other authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Wilson Liao and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Dataset 5

Dataset 6

Dataset 7

Dataset 8

Dataset 9

Dataset 10

Dataset 11

Dataset 12

Dataset 13

Reporting Summary

Description of Additional Supplementary Files

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

López-Isac, E., Acosta-Herrera, M., Kerick, M. et al. GWAS for systemic sclerosis identifies multiple risk loci and highlights fibrotic and vasculopathy pathways. Nat Commun 10, 4955 (2019). https://doi.org/10.1038/s41467-019-12760-y

Download citation

Received: 28 February 2019
Accepted: 30 September 2019
Published: 31 October 2019
DOI: https://doi.org/10.1038/s41467-019-12760-y
Springer Nature Limited

This article is cited by

Shared genetic architecture between autoimmune disorders and B-cell acute lymphoblastic leukemia: insights from large-scale genome-wide cross-trait analysis
- Xinghao Yu
- Yiyin Chen
- Yang Xu
BMC Medicine (2024)
A genome-wide association meta-analysis implicates Hedgehog and Notch signaling in Dupuytren’s disease
- Sophie A. Riesmeijer
- Zoha Kamali
- Ilja M. Nolte
Nature Communications (2024)
Genetically transitional disease: conceptual understanding and applicability to rheumatic disease
- Timothy B. Niewold
- Ivona Aksentijevich
- Qingping Yao
Nature Reviews Rheumatology (2024)
tauFisher predicts circadian time from a single sample of bulk and single-cell pseudobulk transcriptomic data
- Junyan Duan
- Michelle N. Ngo
- Bogi Andersen
Nature Communications (2024)
Effects of genetically predicted posttraumatic stress disorder on autoimmune phenotypes
- Adam X. Maihofer
- Andrew Ratanatharathorn
- Caroline M. Nievergelt
Translational Psychiatry (2024)

GWAS for systemic sclerosis identifies multiple risk loci and highlights fibrotic and vasculopathy pathways

Abstract

Similar content being viewed by others

Introduction

Results

Twenty-seven signals independently associated with SSc

Fine-mapping of SSc-associated loci in a Bayesian framework

Functional annotation of SNPs from credible sets

H3K27ac HiChIP in T cells expands and refines target genes

Chromatin interaction analyses in other relevant cell types

Candidate genes prioritized by DEPICT

Tissue-specific enrichment of SSc loci in epigenetic marks

Specific patterns of associations for the main SSc subtypes

Drug target enrichment analysis

Discussion

Methods

Study cohorts and GWAS quality control

PC analysis and identification of outliers

Imputation

Genome-wide association analysis

Stratified analysis in clinical and serological SSc subtypes

Permutation analysis for subphenotype hits

Stepwise conditional analysis in SSc-associated loci

Fine-mapping of SSc-associated loci in a Bayesian framework

Functional annotation of SNPs from credible sets

H3K27ac HiChIP analysis in human CD4+ T cells

Promoter capture Hi-C analysis

Enrichment of SSc loci in epigenetic marks and cell types

Drug-target gene enrichment analysis

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

European Scleroderma Group†

Australian Scleroderma Interest Group (ASIG)

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation

H3K27ac HiChIP analysis in human CD4⁺ T cells