Introduction

Most human traits and disorders have a complex etiology, which is influenced by multiple environmental and genetic factors. While some phenotypes follow simple patterns of Mendelian inheritance, large-scale genome-wide association studies (GWAS) conducted during the last decade have shown that most phenotypes have a complex polygenic architecture, in which genetic risk is accounted for by a large number of genetic variants, each with small effect (Visscher et al. 2017). Accumulating evidence from GWAS demonstrates that many genetic variants influence more than one phenotype, i.e., they exhibit allelic pleiotropy (Sivakumaran et al. 2011; Solovieff et al. 2013). Identification of shared genetic influences between human traits and disorders can be highly valuable to inform disease nosology, epidemiological associations, and diagnostic classification systems, improve treatment strategies, provide biological insights and uncover shared biological underpinnings (Sivakumaran et al. 2011; Solovieff et al. 2013; Visscher et al. 2017). For example, it is now evident that psychiatric disorders share a large proportion of their genetic architecture (Brainstorm et al. 2018; Cross-Disorder Group of the Psychiatric Genomics C et al. 2013), suggesting that their etiologies are not fully distinct and hence challenging existing diagnostic guidelines (Smoller et al. 2018).

GWAS typically consist of genome-wide scans of millions of common genetic variants (tag single-nucleotide polymorphisms [SNPs]), estimating the strength of their association with the phenotype of interest in massively univariate regression analyses. Given the large numbers of SNPs tested, a GWAS must correct for multiple testing and applies a genome-wide significance threshold of p < 5 × 10−8 to avoid false positive findings. The consequence is that only a subset of all involved genetic variants is revealed (i.e., many false negative findings) with a large fraction of the polygenic architecture remaining to be uncovered. This phenomenon was previously labeled “the missing heritability” (Manolio et al. 2009). With increasing GWAS sample sizes, statistical power has improved and more genetic variants have been uncovered (Visscher et al. 2017). However, despite the assembly of very large GWAS samples, often involving hundreds of thousands of participants, most of the polygenic architecture underlying complex human phenotypes remain undetected (Holland et al. 2019). The number of participants needed for a GWAS to fully uncover all genetic variants influencing a given phenotype depends on the unique polygenic architecture underlying that phenotype, which is determined by the number of causal variants involved and the distribution of effect sizes (Holland et al. 2019). For example, it has been estimated that to uncover most of the genetic variants influencing the complex disorders schizophrenia and bipolar disorder, genotypes from more than one million individuals are required (Holland et al. 2019).

Improved discovery of shared loci using conditional false discovery rate

Although the successive incremental increases in GWAS sample sizes have effectively improved the discovery of trait-associated loci, an alternative and more cost-efficient approach is to apply statistical tools that improve the yield of existing GWAS. The conditional false discovery rate (condFDR) is such an approach, which boosts GWAS discovery by leveraging auxiliary genetic information to re-adjust the GWAS test statistics in a primary phenotype (Andreassen et al. 2013b; Schork et al. 2016). The condFDR method is a model-free strategy for analysis of GWAS summary statistics inspired by the Empirical Bayes statistical framework, which is designed for situations with dense elements, such as the large number of small genetic effects seen in polygenic traits and disorders. Most commonly, the condFDR method has been applied for cross-trait analysis, by leveraging overlapping SNP associations (i.e., cross-trait enrichment) between separate GWAS to re-rank the test statistics in a primary phenotype conditional on the associations in a secondary phenotype (Andreassen et al. 2013b; Schork et al. 2016). Other auxiliary enrichment sources, such as genomic annotations (Schork et al. 2013), can also be leveraged using condFDR (Lo et al. 2017; Wang et al. 2016b). Since its introduction in 2013 (Andreassen et al. 2013a), the condFDR method has increased genetic discovery in a wide spectrum of complex human traits and disorders, including psychiatric, cardiovascular and neurological disorders, as well as metabolic, psychological and cognitive traits, among others (see Table 1 for a selection of cross-trait condFDR studies) (Andreassen et al. 2013a, c, 2014a, b, c; Broce et al. 2018, 2019; Desikan et al. 2015; Drange et al. 2019; Ferrari et al. 2017; Hu et al. 2018; Karch et al. 2018; Le Hellard et al. 2017; LeBlanc et al. 2015; Liu et al. 2013; Lv et al. 2017; McLaughlin et al. 2017; Mufford et al. 2019; Shadrin et al. 2018; Smeland et al. 2017a, b, 2018; 2019; van der Meer et al. 2018; Wang et al. 2016a; Winsvold et al. 2017; Witoelar et al. 2017; Yokoyama et al. 2016, 2017; Zuber et al. 2018).

Table 1 Selected cross-trait conditional false discovery rate studies

The present review focuses on the cross-trait condFDR approach, which returns a condFDR value for each SNP, defined as the probability that a SNP is null in the first phenotype (i.e., that it has no association with the phenotype) given that the p values in the first and second phenotypes are as small as or smaller than the observed ones. The condFDR estimates are obtained for each nominal SNP p value in the primary phenotype after computing the stratified empirical cumulative distribution functions (cdfs) of the nominal p values (Sun et al. 2006; Yoo et al. 2009). The separate strata are determined by the relative enrichment of SNP associations as a function of increased nominal SNP p values in a secondary phenotype. The standard FDR framework derives from a model that assumes that the distribution of test statistics in a GWAS can be formulated as a mixture of null and non-null effects, with true associations (non-null effects) having more extreme test statistics than false associations (null effects) on average. Given a statistical genetic relationship between two phenotypes, stratification of the test statistics in a primary phenotype based on the genetic associations with a secondary phenotype will result in a reduction in the FDR at a given nominal p value relative to the FDR computed from the unstratified distribution of the primary phenotype p values alone, and thus, re-rank the test statistics.

The first step in the condFDR procedure is to construct conditional quantile–quantile (Q–Q) plots, which extends the standard Q–Q plots commonly applied in GWAS. Standard Q–Q plots visualize the enrichment of statistical association relative to that expected under the global null hypothesis by plotting the nominal − log10p values of the single SNP association statistics versus their empirical distribution. Conditional Q–Q plots help visualize the cross-trait enrichment between two phenotypes and are constructed by creating subsets of SNPs based on the level of association with the secondary phenotype. Under the global null hypothesis, the nominal p values will form a straight line plotted as a function of their empirical distribution. Under polygenic association, standard Q–Q plots will be deflected leftwards, while cross-trait enrichment can be seen as successive leftward deflections in conditional Q–Q plots as levels of SNP associations with the secondary phenotype increase. Figure 1a presents a conditional Q–Q plot demonstrating SNP enrichment for the psychiatric disorder bipolar disorder (n = 51,710) (Stahl et al. 2019) as a function of the association with intelligence (n = 269,867) (Savage et al. 2018), adapted from Smeland et al. (2019). A complementary way to assess for cross-trait enrichment is to construct fold-enrichment plots, which provide a more direct visualization of the polygenic enrichment (Fig. 1b). The fold enrichment is calculated as the ratio between the -log10(p) cumulative distribution for a given stratum and the cumulative distribution for all SNPs. Figure 1b shows that for SNPs with p values below 0.001 in intelligence, there was up to 60-fold enrichment of stronger SNP associations with bipolar disorder in comparison to all SNPs. The enrichment seen in conditional Q–Q plots and fold-enrichment plots reflects increased tail probabilities in the distribution of test statistics and an overabundance of low p values compared to that expected by chance, which can be directly interpreted in terms of a Bayesian interpretation of the true discovery rate (TDR = 1 − FDR; see Box 1 for mathematical framework) (Efron 2010). This is illustrated in Fig. 1c.

Fig. 1
figure 1

Cross-trait enrichment between bipolar disorder (BD; n = 51,710) (Stahl et al. 2019) and intelligence (n = 269,867) (Savage et al. 2018), adapted from Smeland et al. (2019). a Conditional Q–Q plot displaying the nominal −log10p values of the single SNP association statistics versus their empirical distribution in BD below the standard GWAS threshold of p < 5×10−8 as a function of significance of association with intelligence at the level of p ≤ 0.1, p ≤ 0.01, p ≤ 0.001, respectively. The blue line indicates all SNPs. The dashed line indicates the null hypothesis. b Fold-enrichment plot of enrichment versus nominal −log10p values in BD as a function of association with intelligence. c Conditional true discovery rate (TDR) plot illustrating the increase in TDR associated with increased enrichment in BD conditioned on intelligence. The test statistics were corrected for genomic inflation, SNPs were randomly pruned across 500 iterations using a linkage disequilibrium r2 threshold of 0.1, and the extended major histocompatibility complex region and chromosomal region 8p.23.1 were excluded (Smeland et al. 2019)

To control for spurious (i.e., non-generalizable) enrichment due to population stratification or cryptic relatedness (Devlin and Roeder 1999), all test statistics are corrected using a genomic inflation control procedure leveraging intergenic SNPs, which are relatively depleted for true associations (Schork et al. 2013). Conditional Q–Q plots and the condFDR computation are conducted after random pruning to approximate independence, by selecting one random SNP per LD block (defined by an r2 > 0.1) averaged over at least 100 iterations (Andreassen et al. 2013b; Schork et al. 2016). Similar to previously described stratified FDR procedures (Sun et al. 2006; Yoo et al. 2009), the condFDR value is then determined for each SNP by constructing a two-dimensional FDR look-up table where the FDR for SNP associations with the primary phenotype is computed conditionally on the nominal p values for SNP associations with the secondary phenotype (Box 1). Figure 2a presents the respective condFDR look-up table for bipolar disorder conditional on intelligence, corresponding to the cross-trait enrichment observed in Fig. 1.

Fig. 2
figure 2

a Conditional false discovery rate (condFDR) 2D look-up table for SNP associations with bipolar disorder (BD) conditional on SNP associations with intelligence, corresponding to the cross-trait enrichment observed in Fig. 1. The FDR in BD SNPs are computed conditionally on the nominal intelligence p values. b condFDR 2D look-up table for SNP associations with intelligence conditional on SNP associations with BD. c Corresponding conjunctional FDR (conjFDR) 2D look-up table for SNP associations shared between BD and intelligence. The color refers to the FDR values

The conjunctional FDR (conjFDR) is an extension of the condFDR, which allows for discovery of SNPs significantly associated with two phenotypes simultaneously (Andreassen et al. 2013a; Schork et al. 2016). The conjFDR is determined after inverting the roles of the primary and secondary phenotypes and repeating the condFDR procedure. Based on previous conjunction tests for p value statistics (Nichols et al. 2005), the conjFDR is defined as the maximum of the two condFDR values, providing a conservative estimate of the FDR for an SNP association with both phenotypes jointly (Fig. 2c). Thus, in combination, the condFDR/conjFDR approaches both improve SNP discovery rates (condFDR) and enable detection of shared genomic loci (conjFDR), respectively. Since the condFDR/conjFDR estimates are based on nominal p values only, these methods are agnostic to the effect directions of the individual SNPs, and can detect overlapping SNP associations irrespective of the genome-wide genetic correlation between phenotypes. However, after detecting likely overlapping SNPs, the directional SNP effects in the loci can be determined post hoc by comparing the effect sizes (z scores or odds ratios) between the phenotypes.

The condFDR/conjFDR approaches have some limitations. Although all SNPs are randomly pruned using an LD r2 threshold of 0.1, complex correlations among the test statistics may bias the condFDR estimates (Schwartzman and Lin 2011). Hence, given strong SNP associations within long-range LD regions, such as the extended major histocompatibility complex (MHC) region, chromosomal region 8p.23.1, the microtubule-associated tau protein (MAPT) region or the APOE region (Price et al. 2008), these regions should be excluded to avoid artificially inflated genetic enrichment. The condFDR/conjFDR procedures are agnostic about the specific causal variants underlying the overlapping genomic associations, which could arise from both shared and separate causal variants, or “mediated pleiotropy”, where one phenotype is causative of the other (Solovieff et al. 2013). Given that the cross-trait enrichment both reflects the extent of polygenic overlap between the phenotypes and the power of the two GWAS analyzed, cross-trait enrichment will be harder to detect if one or both investigated GWAS are inadequately powered. Another important limitation of the condFDR method is that a large fraction of overlapping participants between the investigated GWAS may inflate the cross-trait enrichment, and shared participants should, therefore, be reduced to a minimum. An extension of condFDR, allowing shared controls, has been proposed (Liley and Wallace 2015).

Comparison to other cross-trait analytical tools

A large number of tools for cross-trait analysis using GWAS data have been developed in recent years, which have been reviewed in detail elsewhere (Gratten and Visscher 2016; Hackinger and Zeggini 2017; Pasaniuc and Price 2017; Schork et al. 2016). In short, the methods differentiate in terms of the data analyzed (summary statistics versus individual genotype data), the underlying mathematical framework and assumptions, whether they are bivariate or multivariate in nature, and whether they measure overlap at the genome-wide level or across individual SNPs or loci/regions. Here, we compare the condFDR/conjFDR approach to a selection of relevant cross-trait analytical tools.

The most common approaches for evaluating genetic overlap at the genome-wide level include tools such as polygenic risk scores (Purcell et al. 2009), mixed-model approaches (Cross-Disorder Group of the Psychiatric Genomics C et al. 2013; Lee et al. 2012) and LD score regression (Bulik-Sullivan et al. 2015a), which return a single estimate of shared genetic risk between phenotypes. Polygenic risk scores are per-individual risk profiles based on the sum of alleles associated with a phenotype weighted by their effect sizes (Purcell et al. 2009). The polygenic risk score approach uses summary statistics as training data and requires individual genotype data in an independent target sample to test how well the polygenic risk score explains phenotypic variation in the target phenotype. Another traditional measure that estimates the degree of pleiotropy is the genetic correlation, which is defined as the correlation between the genetic influences for a pair of traits, thus indicating the proportion of variance that the two traits share due to genetic causes. Mixed-model approaches (Lee et al. 2012), originally implemented in the Genome-wide Complex Trait Analysis software (GCTA), obtained unbiased estimates of the genetic correlation using individual genotype data, relaxing several limitations of traditional studies based on pedigree data. Estimates of genetic correlation can also be quantified from GWAS summary statistics, using cross-trait LD score regression (Bulik-Sullivan et al. 2015a) and its multivariate extension Genomic SEM (Grotzinger et al. 2019). LD score regression aims to distinguish confounding from polygenicity by regressing the association statistics of SNPs on their ‘LD scores’, which is a measure of the amount of genetic variation the SNP represents (Bulik-Sullivan et al. 2015b). Application of LD score regression to the bivariate framework estimates the co-variance in the SNP heritability between two phenotypes, allowing sample overlap (Bulik-Sullivan et al. 2015a). An alternative approach estimating local genetic correlations based on the fixed effects model is also available (Shi et al. 2017). The condFDR approach is fundamentally different to these tools by aiming for discovery of specific genomic loci. However, the condFDR approach similarly focuses on the polygenic fraction that did not reach genome-wide significance to uncover cross-trait enrichment. To fully disentangle the genetic relationship between complex phenotypes, it is necessary to complement measures of genetic overlap at the genome-wide level with cross-trait analytical tools allowing detection of individual shared loci regardless of their directional effects. For instance, a recent condFDR study demonstrated substantial cross-trait enrichment between bipolar disorder (Stahl et al. 2019) and intelligence (Savage et al. 2018) (Fig. 1) and uncovered a balanced pattern of concordant and discordant directional effects among 79 shared loci identified at conjFDR < 0.05 (Fig. 3) (Smeland et al. 2019). These findings extend and comply with prior genetic studies reporting no significant genome-wide genetic correlation between the phenotypes (Brainstorm et al. 2018; Davies et al. 2018; Hill et al. 2016; Lencz et al. 2014; Savage et al. 2018; Sniekers et al. 2017; Stahl et al. 2019).

Fig. 3
figure 3

Common genetic variants jointly associated with bipolar disorder (BD; n = 51,710) and intelligence (n = 269,867) at conjunctional false discovery rate (conjFDR) < 0.05, adapted from Smeland et al. (2019). Manhattan plot showing the – log10 transformed conjFDR values for each SNP on the y-axis and chromosomal positions along the x-axis. The dotted horizontal line represents the threshold for significant shared associations (conjFDR < 0.05, i.e., –log10 (conjFDR) > 1.3). Independent lead SNPs are encircled in black. For details, see Supplementary Table 9 in Smeland et al. (2019)

There is a large class of cross-trait methods aiming to discover specific genomic loci unique or shared between phenotypes inspired by the meta-analysis technique (Willer et al. 2010) and its extensions dealing with sample overlap (Han et al. 2016; Lin and Sullivan 2009). For example, the COMBINE approach (Ellinghaus et al. 2012) consists of two separate runs of a same-effect and opposite-effect meta-analysis, both using the inverse variance-weighted procedure. In the opposite-effect meta-analysis, the minor and major alleles are flipped in the second dataset to capture bi-allelic SNPs with opposite-effect directions in the two phenotypes investigated. This method was later refined and extended to multiple heterogeneous traits using restricted and weighted subset search (ASSET) (Bhattacharjee et al. 2012), which exhaustively explore subsets of studies to achieve the best possible trade-off between specificity and sample size. Its successor, compare-and-contrast meta-analysis (CCMA) (Baurecht et al. 2015), further improved the power to discover associations by combining the subset search approach with trans-ethnic meta-analysis (MANTRA) (Morris 2011). Several alternative approaches explore additional information, including individual-level genotypes (MultiPhen) (O’Reilly et al. 2012), phenotypic correlations (TATES) (van der Sluis et al. 2013) or estimated genetic correlations (MTAG) (Turley et al. 2018). A common feature of all techniques based on a meta-analysis framework is that the analysis is performed independently for each SNP, thus requiring a follow-up mechanism to control for multiple testing, such as Bonferroni correction, to avoid false positive findings. The condFDR analysis, on the other hand, directly works with the entire original set of p values from the two GWAS and intrinsically incorporates multiple testing via the FDR framework (Efron 2010).

Another class of methods aims at disentangling LD structure to reveal underlying causal genetic mechanisms. Mendelian randomization aims to distinguish true pleiotropy from mediated pleiotropy by investigating whether one phenotype is causative to the other (Hernan and Robins 2006; Lawlor et al. 2008; Smith and Ebrahim 2003; Zhu et al. 2018). Mendelian randomization assigns genetic variants, which are expected to be independent of confounding factors, as instrumental variables to test for causality. Several available Bayesian approaches (Giambartolomei et al. 2014; Pickrell et al. 2016) explore whether two association signals in the same genomic region obtained from two different GWAS share a single causal variant or multiple causal variants. Frei and colleagues performed a similar analysis at the genome-wide level, estimating the proportion of phenotype-specific causal variants and shared variants between complex phenotypes using GWAS summary data, while controlling for shared participants (Frei et al. 2019). The analysis demonstrates how the shared polygenic component may constitute a large fraction of the genetic architecture of one phenotype, while constituting a smaller fraction of the architecture of a phenotype with larger polygenicity. While the condFDR/conjFDR approach is agnostic about the causal variants underlying the identified associations, it complements these methods by improving the discovery of genomic loci, which can be used to prioritize down-stream analysis.

Conclusion

Accumulating evidence has shown that genetic pleiotropy is pervasive among complex human traits and disorders, providing important insights into etiological relationships. Since its introduction in 2013, application of the condFDR/conjFDR approach has increased yield of existing GWAS and aided the discovery of overlapping genomic loci between polygenic phenotypes. Given that large fractions of the polygenic architecture underlying most complex phenotypes still remain undetected, the condFDR/conjFDR approach represents a cost-effective powerful strategy useful for improving GWAS discovery and help elucidating shared genetic etiologies.