Introduction

Thoracic aortic aneurysm (TAA) is an aortopathy characterized by dilation of the proximal aorta and risk of life-threatening complications such as aortic dissection and sudden cardiac death. TAA accounts for at least 10,000 deaths per year in the USA [1]. When a diagnosis of TAA is made, the ability to predict complications of disease is limited. Guidelines for surgery incorporate aortic diameter, genetic diagnosis, and family history, but clinical prediction models are imperfect [2, 3]. There is a critical need to improve methods for stratifying a patient’s risk in order to optimize clinical decisions such as indications for aortic replacement surgery (ARS), medical therapy, activity restrictions, and frequency of follow-up. Broadening the use of individual genotype data is one promising strategy to improve risk stratification.

TAA is clinically and genetically heterogeneous. TAA is associated with autosomal dominant connective tissue disorders (CTDs) including Marfan syndrome (MFS) (FBN1 mutations), Loeys-Dietz syndrome (TGFBR1/TGFBR2), and vascular type Ehlers-Danlos syndrome (COL3A1) [4,5,6]. Non-syndromic TAA is also frequently genetic and may occur as autosomal dominant familial TAA, which in 15–20% of cases is associated with mutations in genes encoding vascular smooth muscle cell proteins such as ACTA2 and MYH11 [7]. Thus, existing knowledge of the genetic basis of TAA indicates that pathogenesis is mediated by the extracellular matrix (ECM), transforming growth factor beta (TGFβ) signaling, and smooth muscle contraction.

Disease severity and the risk for complications are variable not only between individuals with different mutations in the same gene but also between relatives with the same mutation, making prognosis challenging [8, 9]. The severity of non-cardiovascular phenotypic features is also variable and may correlate with TAA severity [5, 10]. The genetic mechanisms impacting the degree of TAA severity have not been established. For this study, we hypothesized that whole exome sequencing (WES) in subjects with extreme TAA phenotypes would identify genetic variants that modify TAA severity. We tested this hypothesis by complementary analyses using family-based and case-control approaches.

Methods

Detailed methods are available in the Online Resource Methods.

Study Cohort and Classification of TAA Phenotypes

Subjects were prospectively enrolled from the Cincinnati Children’s Hospital Medical Center Cardiovascular Genetics clinic from January 2012 to December 2014. This study was approved by the local Institutional Review Board, and all subjects gave informed consent. Subjects eligible for this study had syndromic or familial TAA or carried the same mutation as a first-degree relative with confirmed TAA. Genetic rare variant analyses in this study were restricted to Caucasian subjects. Clinical data, including clinical genetic testing results, were collected by review of the electronic medical record. Subjects were defined as mutation positive if they had a pathogenic or likely pathogenic variant identified by clinical testing in one of 24 genes associated with TAA (Online Resource Table 1). Cardiovascular imaging was performed clinically and included echocardiography, magnetic resonance imaging (MRI), or computed tomography. Published nomograms were used to calculate z-scores for subjects aged 18 years or younger [11]. To divide the cohort into extreme phenotype groups, we identified two groups: severe and no/mild TAA. Severe TAA was defined by a history of aortic dissection, need for ARS according to guidelines [2], aortic diameter ≥5 cm, or z-score ≥+6. No/mild TAA was defined by a maximum aortic root or ascending aorta diameter ≤4.5 cm or z-score ≤+4 and no history of dissection or ARS. The no/mild TAA group included mutation-positive subjects without TAA (aortic diameter <4 cm or z-score ≤+2) [2, 12].

WES and Variant Inclusion Criteria

Sequencing was performed in the Cincinnati Children’s Genetic Variation and Gene Discovery Core. After quality filtering, there were in total 176,885 variant calls (average 6551 per subject). Rare variants were defined as variants with minor allele frequency <0.01 or absent in public databases from the 1000 Genomes Project and the National Heart, Lung, and Blood Institute Grand Opportunity Exome Sequencing Project. Variants were filtered for those that were predicted to result in a protein change.

Family-Based Analysis for Divergent Extreme TAA Phenotypes

Pairs of first-degree relatives within the study cohort who were mutation positive by clinical genetic testing but demonstrated divergent extreme phenotypes were identified from three families. For each pedigree, the rare protein-changing variants unshared between first-degree relatives with divergent phenotypes were selected. These unshared variants and genes were then analyzed across the pedigrees. To aid interpretation of the unshared rare variants within each pedigree, genes were ranked according to functional similarity to known TAA-causing genes using the prioritization algorithm of ToppGene (http://toppgene.cchmc.org) [13]. Four bioinformatics damage prediction programs were also used to annotate missense variants. To investigate pathway-level associations, unshared gene lists for each pedigree were analyzed for functional enrichment of Gene Ontology terms and pathways using ToppGene.

Case-Control Design for Rare Variant Association Testing Between Extreme TAA Phenotype Groups

We utilized the optimal unified sequence kernel association test (SKAT-O) to perform burden and non-burden association tests using rare variants [14]. The two phenotype groups were defined as a dichotomous variable. Due to the limited power to test rare variants individually in the study cohort, collapsing (i.e., grouping) variants onto genetic regions is necessary. To investigate family-based findings within the overall cohort, each gene that segregated with TAA severity across two or more families was individually tested using SKAT-O in all 27 samples. The significant pathways that were identified in family-based ToppGene enrichment analyses were also tested in the overall cohort, excluding family members in which the enrichment was identified. These samples were excluded because SKAT-O is a non-burden test and these samples, which are by definition enriched for the tested pathways, would skew the model and limit the effects of the other samples in the cohort. In separate exploratory global analysis, all genes containing at least two variants in the overall cohort were tested using gene-level SKAT-O. Pathway-level associations were also globally investigated by collapsing variants within Gene Ontology biological process (GO BP) terms.

Statistical Analysis

Demographic information, clinical genetic data, additional cardiovascular characteristics, and overall number of autosomal rare variants were compared between TAA extreme phenotype groups with 2 × 2 Fisher’s exact test (categorical variables) or Student’s t test (continuous variables). A p value <0.05 was used to define statistical significance.

In family-based variant segregation analysis, simulation was used to estimate the significance of observing any variant overlapping all three pedigrees. A detailed description of the simulation method is in the Online Resource Methods. For family-based functional enrichment analysis using ToppGene, a p value <0.05 after Bonferroni correction was used to define statistical significance.

SKAT-O analyses were performed using the statistical program R (version 3.2.1) and package “SKAT” (version 1.21). The uniform variant weighting scheme was used. Age and gender were included as covariates. When investigating the targeted subset of genes and pathways identified in family-based analyses, a p value <0.05 was used to define statistical significance [15]. For exome-wide gene-level SKAT-O, simulation was used to estimate significance. Details are provided in the Online Resource Methods.

Results

Genetic and Cardiovascular Features of Cohort with Extreme TAA Phenotypes

Genetics Evaluation

Demographic and clinical data for the 27 subjects in this study are shown in Table 1. Clinical genetic testing for TAA was previously performed in 26 of 27 subjects. Overall, 20 subjects were positive for TAA-causing mutations in FBN1 (n = 11), TGFBR1 (3), TGFBR2 (3), and ACTA2 (3) (Online Resource Table 2). A likely pathogenic variant in TGFB2 was incidentally identified with WES in one subject. The remaining six subjects had TAA and autosomal dominant family history of TAA but no identified disease-causing mutation through clinical testing or WES. Four of these six genotype-negative subjects were examined by a geneticist experienced in CTD evaluation while two were referred exclusively for genetic counseling. All four subjects examined were found to have signs of CTD, including one who met clinical criteria for MFS based on systemic features and TAA (Online Resource Table 3) [16]. One subject without a formal examination by a geneticist has a brother with TAA and clinically documented signs of CTD.

Table 1 Cohort description and TAA severity groups

Upon WES, there was no significant difference in the overall number of protein-changing rare variants between the severe TAA phenotype (334 ± 18 variants/subject) and no/mild TAA phenotype (328 ± 35) groups (p = 0.55). In addition to the variants in TAA genes that were interpreted as pathogenic or likely pathogenic through clinical genetic testing, 13 other rare missense variants in TAA genes that were not classified as pathogenic or likely pathogenic were identified (Table 2). Population allele frequency data, bioinformatics predictions, and previous submissions to ClinVar were used to classify these variants in accordance with the 2015 guidelines set forth by the American College of Medical Genetics and Genomics [17]. Most of these additional non-disease-causing variants in TAA genes were found in mutation-positive subjects. The MFAP5 or MYLK variant identified in a genotype-negative subject with mild TAA may be disease-causing based on strong damage predictions, as we have previously described [18].

Table 2 Heterozygous rare variants in TAA genes not classified as disease-causing within the study cohort

Extreme TAA Phenotype Groups

The severe TAA group (n = 15 subjects) included 13 with history of ARS, which was performed emergently for type A aortic dissection (4) or electively (9). The aortic diameter was documented to be at least 5.0 cm in seven of eight adults undergoing elective ARS. Pre-operative imaging data was not available for one adult, but ARS was performed soon after TAA diagnosis. The one pediatric subject undergoing elective ARS had aortic root z-score of +9.5. The no/mild TAA group (n = 12) included six subjects with mild TAA and six who had no clinical evidence of TAA but are at risk because they are mutation positive and have at least one first-degree relative with TAA who carries the same mutation. All mutation-negative subjects had at least mild TAA. None in the mild TAA group who carried a TGFβ receptor mutation had aortic diameter greater than 4.0 cm. Only one severe TAA subject and one no/mild TAA subject had clinical hypertension.

Family-Based Analyses of Pedigrees with Extreme TAA Phenotypes

Identification of Unshared Rare Variants Within Pedigrees

To compare subjects who have the same pathogenic mutation but clearly different TAA severity, three pairs of first-degree relatives from each pedigree were studied (Fig. 1). Pedigree I includes a 9-year-old boy with MFS (maternally inherited FBN1 mutation; p.Cys1806Tyr) and severe aortic root dilation (z-score = +6.4), whose mother has no evidence of TAA at age 26. This boy’s brother reportedly died secondary to a neonatal aortic dissection. Pedigree II includes a subject with a TGFBR1 mutation (p.Asn478Ser) who underwent elective ARS at age 59 years for ascending aorta diameter of 5.1 cm. In contrast, his sister carries the mutation but has no evidence of TAA at age 55. Neither has findings of Loeys-Dietz syndrome or other CTD, leading to the diagnosis of familial TAA due to the TGFBR1 mutation. Finally, pedigree III includes a subject with Loeys-Dietz syndrome (TGFBR2 mutation; p.Arg229Pro) resulting in aortic dissection at age 27, whose mutation-positive sister only has evidence of mild TAA (aortic root 4 × 3.4 × 3.3 cm on cardiac MRI) at age 32. In each of these families, the subject with severe TAA was younger or very close in age to the included relative with no/mild TAA.

Fig. 1
figure 1

Three pedigrees containing first-degree relatives with divergent extreme TAA phenotypes. Red asterisks indicate subjects sequenced for family-based variant analyses. ARS aortic replacement surgery, Asc ascending aorta, BAV bicuspid aortic valve, MVA motor vehicle accident

Genes and Variants Associate with TAA Severity Across Pedigrees

For each pedigree, a list of the rare variants that were unshared between first-degree relatives was tabulated. The numbers of unshared rare variants in each pedigree were similar (Fig. 2). In each pedigree, the subject with severe TAA had more rare variants overall and more with strong bioinformatics damage predictions (Online Resource Table 4). Lists of unshared rare variants and genes were compared between pedigrees. Those overlapping across at least two pedigrees were selected for further analysis.

Fig. 2
figure 2

Distribution of unshared rare variants between first-degree relatives with divergent TAA phenotypes. Cross-hatched areas represent the number of variants predicted to be damaging by four of four bioinformatics programs or frameshift, nonsense, stop loss, splice site, or initiation codon variants

In total, there were 50 genes with unshared rare variants overlapping across pedigrees. These are organized in Fig. 3 to convey variant- and gene-level annotation data, including damage predictions and functional similarity to TAA genes based on ToppGene ranking. To our knowledge, none of these genes is reported to cause TAA independently. For most genes, the rare variants were family specific. However, there were six genes (marked by asterisks in Fig. 3) for which the identical variant was identified in at least two pedigrees. This includes a heterozygous variant in ADCK4 (c.187C>T, p.Arg63Trp) identified in individuals with mild TAA in all three pedigrees. This variant is predicted to affect function by all four bioinformatics programs. Based upon the reported minor allele frequency for this variant in control populations (0.005987), simulations utilizing 5000 sequencing data sets predict the likelihood for this occurring by chance as 0.009, strongly supporting the significance of this observation. Meanwhile, variants in GATA2, KLC4, THSD7B, MCL1, and ZNF98 overlapped at least two pedigrees. Each was identified in subjects with concordant TAA phenotypes (Fig. 3). None of these six variants, including the ADCK4 variant, was otherwise identified in the overall cohort.

Fig. 3
figure 3

Genes and specific variants overlap across pedigrees. Variants that were unshared between first-degree relatives were collected for each pedigree. Two genes overlapped all three pedigrees, and 14 to 18 genes overlapped any two pedigrees. These included ADCK4 for which the same variant was identified in subjects with mild TAA in all three pedigrees. Five other specific variants overlapped at least two of the three pedigrees (asterisks). The predicted functional impact of variants was stratified as high (missense variants predicted damaging in at least four of four programs or frameshift/stop gain/splice variant) or low (missense variants predicted damaging in 0–3 programs or inframe insertions or deletions) and displayed by font size. The genes are ordered based on ToppGene ranking for similarity to TAA genes, reading from top to bottom and left to right within each overlap segment

In order to further investigate the significance of the 50 overlapping genes, each was tested for association with TAA severity in the overall cohort using SKAT-O. Among these prioritized genes, COL15A1 was most significantly associated with TAA severity (p = 0.025). The COL15A1 variants (p.Phe851Leu, p.Ile1304Met) segregating with mild TAA in pedigrees are each predicted damaging by at least two prediction programs. The COL15A1 variant (p.Phe851Leu) identified in pedigree III was also identified within the overall cohort in a subject with mild TAA who carries a TGFB2 mutation. Thus, in total three subjects with mild TAA carried a rare coding variant in COL15A1.

Functional Enrichment Analysis Identified Significant Pathways in Families

Enrichment analysis of each pedigree’s list of unshared rare variants using ToppGene identified seven significant GO terms or pathways (Fig. 4). Similar to our gene-level, targeted investigation of family-based findings in the overall cohort, we tested whether the gene sets identified by enrichment analysis in families could be validated in the remainder of the cohort using SKAT-O. Interestingly, the retina homeostasis genes that were enriched in pedigree I were also significantly associated with TAA severity (p = 0.035) (Online Resource Table 5). In contrast, variants in known TAA genes were not collectively associated with TAA severity using SKAT-O (p = 0.54). Replication of a family-based finding within the overall cohort strongly supports these retina genes as candidate modifiers. Thus, our tiered approach identified significant candidate modifiers at the variant, gene, and pathway levels, as summarized in Fig. 5.

Fig. 4
figure 4

Rare variants unshared between first-degree relatives were enriched for GO and pathway annotations. Pedigrees (circles) are connected to significantly enriched annotations (squares) by edges that pass through the genes contributing to the enrichment (hexagons). Genes found in severe TAA subjects are red, genes in mild TAA subjects are green, and genes in both severe and mild TAA subjects are gray. None of the enrichments overlapped multiple pedigrees, but some genes within the significant enrichments overlapped pedigrees as indicated by edges connecting a gene to more than one pedigree (e.g., COL15A1)

Fig. 5
figure 5

Identifying candidate modifiers of TAA severity: summary of study design and primary results. Rare coding variants among subjects with extreme TAA phenotypes (severe vs no/mild) were analyzed using a tiered approach. First, family-based analyses were performed using a subset of the cohort in families with divergent extreme phenotypes to identify segregation with TAA severity at the level of variant, gene, and pathway. Second, case-control analyses were performed in the entire cohort for the family-based gene and pathway-level findings and data simulation for variant level findings. Strong candidate modifiers of TAA severity were identified at each level, including a variant in ADCK4 (p.Arg63Trp), the gene COL15A1, and a pathway important for retina homeostasis. SKAT-O optimal unified sequence kernel association test

Exome-Wide Case-Control Analysis of Extreme TAA Phenotype Groups

Exome-Wide SKAT-O Identifies a Candidate Modifier Gene and Pathways

In addition to the targeted investigation of family-based findings, we performed exome-wide rare variant analysis. Based on simulated data sets, the only individual gene to achieve exome-wide empirical significance was PADI3 (p = 7.5 × 10−4) (Online Resource Table 6). Simulations found 243 out of 5000 data sets (4.9%) that contained genes with p values less than 7.5 × 10−4, supporting the significance of this finding. Three mild TAA subjects carried a missense PADI3 variant (rs142129409, rs144080386, rs144944758), each predicted damaging by all four programs. After collapsing variants within annotated GO BPs, the most significant SKAT-O pathways associated with TAA severity were endosome transport, lipoprotein metabolism, apoptosis, and photoreceptor homeostasis (Table 3). Thus, exome-wide SKAT-O analyses identify additional modifier candidate genes and pathways that may indirectly support family-based findings.

Table 3 Gene Ontology biological processes most associated with TAA severity using SKAT-O

Discussion

Reduced penetrance and variable expression complicate the clinical management of syndromic and familial TAA. To test the hypothesis that genetic modifiers contribute to phenotype severity, we have studied subjects with extreme phenotypes using WES (Fig. 5). TAA provides an opportunity to study genetic modifiers because syndromic and familial TAAs are usually autosomal dominant conditions with reduced penetrance and reproductive fitness is adequate to study family members. The identification of TAA modifiers will improve methods for predicting the risk of progression to severe TAA or dissection and identify novel targets for medical therapy. Comprehensive prediction models will facilitate precise family-specific and individual-based clinical recommendations, such as frequency of surveillance, activity restriction, medical therapy, and timing for ARS.

TAA features histopathological findings of collagen fibril dysregulation in the context of different disease-causing mutations [19,20,21]. Therefore, the collagen genes and pathways identified in this study are plausible candidate modifiers across different genetic causes of heritable TAA. Among these candidates, COL15A1 presented the strongest independent evidence of association, hypothetically related to its role in basement membrane structure or control of angiogenesis [22]. It is also notable that variants in COL3A1, COL5A1, and COL5A2 comprised 5 of the 13 variants identified in TAA genes that were not independently causative mutations (Table 2). Dysregulated TGFβ signaling is clearly associated with TAA, but, somewhat unexpectedly, annotated TGFβ signaling pathway gene sets were not associated with TAA severity in these analyses. This does not contradict established mechanisms because TGFβ signaling intersects with other signaling pathways and has diverse pleiotropic downstream effects that are incompletely defined but include ECM homeostasis. We speculate that the effects of certain heterozygous variants in collagen-related genes are subclinical in isolation but impact pathogenesis when combined with a TAA-causative mutation. This may contrast with the essential TGFβ signaling pathway that may be more likely to cause disease independently.

We observed compelling convergence between family-based findings and SKAT-O analyses for genes important for retina homeostasis. There is indirect evidence to suggest common mechanisms between ocular and aortic homeostasis. We have previously reported that (1) ocular findings may precede progression of TAA and (2) ocular findings correlate with TAA severity in pediatric patients [10, 23]. It is interesting that two top candidate genes identified in this study, ADCK4 and COL15A1, are also associated with retinal disorders [24, 25]. However, the possible association between different types of ocular abnormalities and TAA severity remains unclear. Detailed ocular phenotyping in heritable TAA is indicated and may identify clinical features useful for TAA risk prediction. Functional studies of these retina homeostasis genes (e.g., RP1L1) within the aorta are needed.

Our findings also implicate genes important for energy metabolism, oxidative stress, and apoptosis as candidate modifiers. There is prior evidence that these pathways are associated with the development of TAA in diverse genetic contexts [26,27,28,29,30,31]. The protein product of ADCK4 is AarF Domain Containing Kinase 4, which translocates to the inner mitochondrial membrane to participate in the synthesis of coenzyme Q10 and electron transport [32]. ADCK4 is associated with autosomal recessive nephrotic syndrome but is also expressed in the thoracic aorta (www.gtexportal.org) and in cultured vascular smooth muscle cells derived directly from human ascending aorta (data not shown). Tissues and cells that have high energy requirements are known to be affected by mitochondrial dysfunction. Thus, the ADCK4 variant identified in this study may mediate aortic smooth muscle function or survival through mitochondrial energy metabolism and oxidative stress pathways. Identification of apoptosis and lipoprotein metabolic pathways in our exome-wide SKAT-O analyses further supports a role for apoptosis and lipid metabolism pathways in TAA pathogenesis. Together, these findings strongly warrant further investigation.

Throughout this study, we identified candidate modifiers in subjects with severe TAA (i.e., modifiers that worsen disease) and mild TAA (i.e., modifiers that lessen disease). For instance, the variant in ADCK4 segregated with mild TAA, suggesting a protective effect. The role of protective rare variants is relatively understudied but essential for the development of novel therapies [33]. Genomic analyses and functional studies must increasingly focus on protective genetic mechanisms. It is likely that combinations of rare variants interact to affect phenotype [33]. Variants within the same pathway may have opposing biological effects. Our observation of collagen-related variants in both severe and mild TAA demonstrates this complexity. These challenges are partially addressed by existing statistical methods such as SKAT-O, which has non-burden and epistatic variant-variant interaction functions. However, there remains a need to develop novel statistical and experimental platforms to define how specific variants interact to influence phenotype. This is particularly challenging for autosomal dominant diseases and will require low- and high-throughput experimental approaches [34, 35]. Ultimately, these methods will define mechanisms by which aggregated variants correlate with specific phenotypes and clinical outcomes.

Limitations to this study include the small sample size. We aimed to optimize the power of rare variant analysis by studying subjects with extreme phenotypes and employing complementary family-based and case-control designs. Pathway-based analysis may overcome sample size limitations by reducing dimensionality while optimizing biological plausibility. However, this approach is dependent on existing knowledge and curation methods. This study cohort is genetically heterogeneous. Certain modifier genes may depend on the primary disease-causing genotype, but shared mechanisms, and therefore shared modifiers, likely exist. Our replication of family-based analysis findings in the overall cohort supports the latter. Nevertheless, validation of all identified candidates is needed in a larger cohort. Common variants and non-coding variants in regulatory regions were not studied but also may impact phenotype. For example, differential expression of mutant versus non-mutant alleles in TAA genes due to genetic variation in their regulatory regions could affect disease penetrance. The determinants of TAA severity are likely multifactorial, but the study of genetic modifiers is a first step to understanding complex multifactorial etiologies, including factors impacting possible differences in TAA severity between men and women. Because patients were recruited in a subspecialty Cardiovascular Genetics clinic, there may be ascertainment bias in the cohort. Finally, TAA is a progressive condition and therefore phenotype assignment may change as the patient ages. We limited this theoretical risk by primarily studying adult-aged subjects and by comparing extreme phenotypes, recognizing that the study of young patients is necessary to predict early disease progression.

In conclusion, we have identified strong candidates for modifying TAA severity. This study identifies candidates consistent with existing knowledge of TAA pathogenesis as well as new genes and pathways suggesting novel mechanisms. Together, these hypothesis-generating findings initiate a path toward risk stratification through genetic testing at an early stage of disease and identifying novel therapeutic targets.