Introduction

Preeclampsia is a major pregnancy-related disorder that causes maternal and fetal morbidity and mortality (Lachmeijer et al. 2002a). Family-based studies have shown that genetic factors play a role in preeclampsia but the exact inheritance pattern is still unknown. Candidate gene studies on preeclampsia have produced inconsistent results so far. As an alternative to the single-gene approach, genome search can provide useful information on chromosomal loci that may be linked to complex diseases (Risch 1990). However, genome scans have also produced inconclusive inferences because linkage signals tend to be rather weak, the number of families and affected sibpairs are relatively small, and the individual genome scans identify linkage in different chromosomal regions (Wise et al. 1999; Dempfle et al. 2004). In an effort to generate more conclusive evidence, several investigators have proposed methods for summing the data from diverse genome scans through meta-analysis (Wise et al. 1999; Fisher et al. 2003).

Genome search meta-analysis (GSMA) is one of the best established methods for meta-analysis of such data (Wise et al. 1999; Levinson et al. 2003; Zintzaras et al. 2005a). GSMA, has already been applied to genome scans of several diseases (Fisher et al. 2003; Levinson et al. 2003; Chiodini et al. 2003; Demenais et al. 2003; Lewis et al. 2003; Segurado et al. 2003; van Heel et al. 2004; Koivukoski et al. 2004; Trikalinos et al. 2005) The methodology has been extended for evaluating also heterogeneity between different genome scans in the context of GSMA (HEGESMA: heterogeneity-based genome-scan meta-analysis). Heterogeneity metrics may offer additional useful insights besides summary estimates (Lau et al. 1998; Engels et al. 2000). In the present meta-analysis, these two methods were applied to the available genome scan data on preeclampsia (Zintzaras and Ioannidis 2005a, b).

Materials and methods

Eligible genome scans

We considered all preeclampsia genome scans published until December 2005. The studies were identified by search of the PubMed database. We used the combination of the following terms as the search criterion: “genome search” or “genome scan” and “preeclampsia”. Scans restricted to specific individual chromosomes were excluded.

Databases

The following information was extracted for each suitable study: first author, journal, year of publication, country of recruitment, racial descent of study population, criteria of disease diagnosis, number of families, number of affected sibpairs, number of microsatellite markers, linkage statistic, software of linkage analysis. The corresponding investigators of each genome scan were then contacted and asked to provide high quality data figures and details on each genome scan so that information could be standardized similarly across genome scans. Data were separately extracted for two phenotypes: generally-defined preeclampsia (“general preeclampsia”, including gestational hypertension, proteinuric preeclampsia, and eclampsia); and severe preeclampsia, defined to include only proteinuric preeclampsia, and eclampsia.

Genome search meta-analysis and heterogeneity testing

The GSMA starts by splitting the chromosomes into bins of approximately equal length; usually each bin has a width of 30 centiMorgan (cM) giving 120 bins in total for the whole genome (Wise et al. 1999; Levinson et al. 2003; Zintzaras and Ioannidis 2005a). For nomenclature purposes, bin c.n is the nth bin on chromosome c. For each genome scan, the most significant result of the test statistic obtained within the bin is recorded. Then, for each scan the bins are ranked according to their significance of results and the ranks for each bin are summed across scans. The significance of the average rank of each bin is assessed empirically against the distribution of average ranks. Under the null hypothesis of no linkage in any chromosomal bin, the ranks are randomly assigned from each study, then, the probability that the ranks X i (where i = 1 − m studies) from a specific bin sum to R is:

$$ P{\left( {{\sum\limits_{i = 1}^m {X_{i} = R} }} \right)} = 0\quad {\text{for}}\;R < m, $$
$$ P{\left( {{\sum\limits_{i{\text{ = 1}}}^m {X_{i} = R} }} \right)} = \frac{1} {{n^{m} }}{\sum\limits_{k = 0}^{\text{int} [(R - m)/n]} {( - 1)^{k} } }{\left( {\begin{array}{*{20}c} {{R - KN - 1}} \\ {{M - 1}} \\ \end{array} } \right)}{\left( {\begin{array}{*{20}c} {m} \\ {k} \\ \end{array} } \right)}\quad {\text{for}}\;m = R = nm, $$
$$ P{\left( {{\sum\limits_{{\text{i = 1}}}^m {X_{i} = R} }} \right)} = 0\quad {\text{for}}\;R > nm{\text{.}} $$

When a bin has a high summed rank, then this is considered as evidence for linkage. If several bins within a study had equal test statistics, they were assigned tied ranks, and negative linkage scores were ranked as zero.

The Q statistic measures heterogeneity in the ranks between studies for each bin. It is the sum of the squared deviations of each study’s bin rank from the mean of ranks:

$$ Q = {\sum\limits_{i = 1}^m {w_{i} (R_{i} - \ifmmode\expandafter\bar\else\expandafter\=\fi{R})} }^{2} , $$

where R i is the rank of the bin under investigation for study i (i = 1 − m studies), \( {\text{\ifmmode\expandafter\bar\else\expandafter\=\fi{R}}} \) is the mean rank across the available studies, and w i is the weighting factor for study i. Significantly low between-study heterogeneity is an indication that the results of different studies for the same bin are very consistent among themselves. The presence of low heterogeneity for a specific bin with high ranks in all studies can be interpreted as further supportive evidence for the importance of this bin. Conversely, significantly high between-study heterogeneity would suggest that there may be major differences in genetic effect, design, population, or other factors across the combined studies. Besides the Q statistic, two other heterogeneity metrics (B and Ha) have been proposed, but inferences with these metrics were similar to Q metric (not shown in detail) (Zintzaras and Ioannidis 2005a).

The statistical significance of the average rank and the Q metric were assessed using a Monte Carlo method. The ranks of each study are randomly permuted and the simulated average rank and Q metric is calculated; then the procedure is repeated 50,000 times and a null distribution for the average rank and the Q metric is constructed. The significance level (P rank) of the average rank of bins is the percentage of simulated average ranks greater than or equal to the observed. The statistical significance level (P Q ) for low heterogeneity is the percentage of simulated metrics less than the observed (left-sided p value) (Zintzaras and Ioannidis 2005a). For high between-study heterogeneity, we examined the right-sided p values. Moreover, we performed a Monte Carlo test that generates null distributions separately for each bin, considering only the simulated distributions of the Q metric (Q adjusted) for bins having similar simulated average ranks (± 2) as the bin being considered each time (Zintzaras and Ioannidis 2005a). This analysis takes into account that the Q statistic may be influenced by the average rank.

Finally, we calculated the probability of observing a given average rank for a bin by chance in bins with the same “place” in the ascending order of average ranks in the runs (ordered ranks) (P order) (Levinson et al. 2003). P rank assesses the significance of each bin independently, whereas P order is based on the distribution of average ranks across all bins simultaneously (Fisher et al. 2003).

We performed both unweighted and weighted analyses. In weighted analysis, the ranks of the bins in each study were weighted by

$$ w_{i} = \surd {\left[ {({\text{number}}\;{\text{of}}\;{\text{individuals}})_{i} \times ({\text{number}}\;{\text{of}}\;{\text{markers}})_{i} } \right]} $$

where i denotes the study, and the weights were scaled to sum up to one.

The main analysis considered only genome scans where all bins were represented. When a scan had evaluated all chromosomes but with fewer markers than those required to generate information for all 120 bins, then, we considered this scan only in a secondary analysis. The ranks of this scan were adjusted to give a maximum rank of 120 and the meta-analysis was then restricted only to common bins represented across all scans (referred to as “conditional” analysis).

Analyses used the HEGESMA software (http://www.biomath.med.uth.gr) (Zintzaras and Ioannidis 2005b). Nominal significance corresponds to one-sided p values <0.05 that are presented here without any adjustment for multiple comparisons.

Results

Databases

The literature review identified 17 titles in PubMed. The full articles of the retrieved studies were read to assess their suitability for meta-analysis. Five studies met the inclusion criteria (Harrison et al. 1997; Lachmeijer et al. 2001; Arngrimsson et al. 1998; Moses et al. 2000; Laivuori et al. 2003). All studies used similar diagnosis criteria for general preeclampsia and severe preeclampsia, apart from the study by Lachmeijer et al. (2001), which was included only in the analysis for severe preeclampsia. In two studies (Harrison et al. 1997; Laivuori et al. 2003), the authors provided LOD scores and NPL scores, respectively, for each marker used in the genome scans. In the remaining studies, the data were extracted from the published graphs showing linkage scores per distance for all chromosomes. This information was also occasionally complemented by specific data on observed peaks presented in tables and/or text. Graphs were electronically digitized using appropriate software (Engauge Digitizer, ver 2.12, Mark Mitchell, 2002). In the study by Arngrimsson et al. (1998), we extracted the data from the genome scan that includes two families with a large effect to the LOD score at the 2p13 locus and this may cause some bias towards this locus.

Details on the analyzed studies are shown in Table 1. Briefly, the Finnish study consisted of 381 microsatellite markers genotyped in 15 families with 50 affected cases with severe preeclampsia and 63 cases with general preeclampsia (Laivuori et al. 2003). In the Australian and New Zealand study, the genome scan was performed by genotyping 400 microsatellite markers in 34 families with 87 affected cases for severe preeclampsia and 121 cases for general preeclampsia (Moses et al. 2000). In the Icelandic study for severe preeclampsia, 72 families containing 186 cases were evaluated using 440 microsatellite markers (Arngrimsson et al. 1998). For general preeclampsia, 124 families containing 343 cases were analyzed. All of the above-mentioned studies had used multi-point analysis to assess genome scans. In the Dutch study, the genome scan was performed with 293 microsatellite markers in 38 families with 77 cases for severe defined preeclampsia only (Lachmeijer et al. 2001). Finally, the Australian study consisted of 15 families with 40 and 50 severe and general preeclampsia cases, respectively (Harrison et al. 1997). This study used two-point analysis and only 90 markers, thus only 57 of the 120 bins were represented. A few families in this study partly overlapped with the larger Australian–New Zealand study. The exact number of families that were possibly double-counted was not clarified, the data from the Harrison et al. (1997). Therefore, the data of the smaller Australian study were only considered in the secondary “conditional” analysis. All genome scans have been conducted by considering the inheritance of the maternal genotype. The genetic and chromosomal location of the markers used in the original publications were not re-assigned.

Table 1 Characteristics and major results of preeclampsia genome-scans

LOD score was used as linkage statistics in all studies, except the Finnish study where NPL score was used instead. The Australian study had the least weight with a weight factor equal to w = 0.08 or 0.07 for general or severe preeclampsia, respectively, and the Iceland study had the most weight (w = 0.46 or 0.34 for general or severe preeclampsia, respectively). The other studies had weights ranging from 0.18 to 0.26. The chromosome regions with suggestive linkage identified from each individual genome-scan are shown in Table 1.

General preeclampsia

Figure 1 shows the average ranks for each bin from three general preeclampsia genome scans, (Arngrimsson et al. 1998; Moses et al. 2000; Laivuori et al. 2003) and Fig. 2 shows the respective ranks from four severe preeclampsia genome scans (Lachmeijer et al. 2001; Arngrimsson et al. 1998; Moses et al. 2000; Laivuori et al. 2003). The bins with significant P rank < 0.05 in unweighted or weighted analysis are indicated on the figure plots. The significant bins in the main and conditional analyses, the observed ranks, the GSMA and heterogeneity statistics for each study are shown in Table 2.

Fig. 1
figure 1

Unweighted (open circle) and weighted (filled circle) average ranks from three (Iceland, Australia–New Zealand, Finland) general preeclampsia genome-scans. Bins (c.n: nth bin on chromosome c) with significant P rank < 0.05 in unweighted or weighted analysis are showed above the solid line

Fig. 2
figure 2

Unweighted (open circle) and weighted (filled circle) average ranks from four (Iceland, Australia–New Zealand, Netherlands, Finland) severe preeclampsia genome-scans. Bins (c.n: nth bin on chromosome c) with significant P rank < 0.05 in unweighted or weighted analysis are showed above the solid line

Table 2 General preeclampsia genome-scan meta-analysis results in unweighted and (in parentheses) weighted analyses. Results based on three scans (4 scans on conditional analysisb)

In general preeclampsia, 12 bins were found to have P rank < 0.05 by either unweighted or weighted analyses; four of them (bins 2.5, 9.4, 2.4, 2.8) had P rank < 0.05 with both methods (Table 2). Conditional analysis revealed an additional bin (bin 4.7). These bins were not significant in the order statistics for the unweighted or weighted analysis (P order > 0.05). Bin 2.8 showed low heterogeneity that was statistically significant in both unweighted and weighted analysis (P Q  = 0.006 and 0.007, respectively). Even when the Monte Carlo distributions were adjusted for the average rank of each bin, significantly low heterogeneity was seen in bin 2.8 for the unweighted and weighted analysis (P Qadjusted = 0.027 and 0.032, respectively). Thus, bin 2.8 provided evidence of linkage to preeclampsia in terms of both high average rank and significantly low heterogeneity between genome scans. Two bins (2.5 and 2.4) showed evidence for significantly high between study heterogeneity (right-sided P Qadjusted < 0.05): these bins ranked at the top ranks in the Icelandic and Australian–New Zealand scans, but did not rank that high in the Finnish scan.

Severe preeclampsia

In the main analysis for severe preeclampsia, eight bins were found to have P rank ≤ 0.05 by either unweighted or weighted analyses and five of them (bins 3.5, 2.10, 18.1, 2.4, 7.6) had P rank < 0.05 with both methods (Table 3). Another two bins (bins 22.2, 4.7) were significant in the conditional analysis. In all bins, the order statistics was non-significant for the unweighted or weighted analysis (P order > 0.05). Bin 2.10 produced marginally low heterogeneity in unweighted analysis (P Q  = 0.056). However, heterogeneity was not significantly low, when analyses were adjusted for average rank, P Qadjusted = 0.177. No bins showed significantly high between study heterogeneity.

Table 3 Severe preeclampsia genome-scan meta-analysis results in unweighted and (in parentheses) weighted analyses. Results based on four scans (5 scans on conditional analysisb)

Correlation of results for the two phenotypes

Three bins (2.4, 2.6, 4.7) had significant average ranks for both general and severe preeclampsia, although this was not seen in both weighted and unweighted analyses for all of them. Analyses showed consistently significant high average ranks only for bin 2.4, regardless of the outcome definition and regardless of whether weighting was used or not. There was modestly high correlation of the average bin ranks between general and severe preeclampsia (Spearman’s r = 0.50, p < 0.00001 based on unweighted analyses; r = 0.48, p < 0.00001 based on weighted analyses).

Discussion

The present meta-analysis has identified several chromosomal regions with some evidence for linkage for general and severe preeclampsia. However, only four (bins 2.5, 9.4, 2.4, 2.8) and five (bins 3.5, 2.10, 18.1, 2.4, 7.6) regions were formally significant in both unweighted and weighted analysis for general and severe preeclampsia, respectively. There was modest overlap in the regions (bin 2.4) identified for each phenotype suggesting that the two phenotypes are likely to share some common genetic basis, but may also have phenotype-specific genetic effects. However, the Icelandic study contributed mostly to the significance of bin 2.4 (2p13 locus). Heterogeneity testing revealed low heterogeneity at two regions (bin 2.8 for general preeclampsia and bin 2.10 for severe preeclampsia) identified by the HEGESMA in both weighted and unweighted analysis. Heterogeneity testing adjusting for average rank retained the formal significance for low between scan heterogeneity for bin 2.8 suggesting that results are very consistently pointing to this chromosomal region across diverse scans.

A recent paper by Lewis and Levinson (2006) showed that the Q metric can be underpowered for moderate sibling relative risks, while the restricted test (Q adjusted) may provide a better power to detect low heterogeneity. However, more extensive simulation studies are required to draw definitive conclusions. In addition, the low power of heterogeneity statistics is well known phenomenon in meta-analysis (Lau et al. 1998). Nevertheless, quantification of heterogeneity is an important metric in a meta-analysis, and currently, HEGESMA is the only available approach for heterogeneity testing in GSMA.

For general preeclampsia, the present meta-analysis replicated the significance of seven chromosomal regions that were also reported as significant by the individual genome scans (2p11.2–2q21.1, 2p15–2p11.2, 9q31.2–9q34.11, 15q11.2–15q14, 4q28.3–4q32.1, 2q21.1–2q24.1, 4q32.1–4q35.1). Six novel candidate chromosomal regions for general preeclampsia were identified: 9q21.32–9q31.2, 2q32.1–2q35, 6p22.3–6p21.1, 1q32.3–1q43, 13q33.1–13q34 and 15q22.32–15q26.1. For severe preeclampsia, there was a replication for six already reported chromosomal regions: 18p11.32–18p11.22, 11q13.3–11q22.1, 2p15–2p11.2, 2q21.1–2q24.1, 22q12.3–22q13.3 and 4q32.1–4q35.1. Four new candidate regions were identified: 3q11.1–3q21.2, 2q37.1–2q37.3, 9q34.11–9q34.3 and 7q34–7q36.3.

Genes at several of the identified regions may confer susceptibility to disease. Bin 2.8 includes at least two genes investigated for susceptibility to preeclampsia: the INHA and CTLA-4 genes. CTLA-4 has been implicated as a candidate gene for severe preeclampsia in a recent study (Samsami Dehaghani et al. 2005). CTLA-4 is a well established candidate gene also for several diseases with autoimmune pathophysiology and in particular for diabetes mellitus which may share with preeclampsia a pro-thrombotic tendency and relationship to hypertension (Ueda et al. 2003; Kavvoura et al. 2005). Inhibin levels are affected in preeclampsia, but it is unclear whether polymorphisms of the INHA gene affect susceptibility to preeclampsia (Ciarmela et al. 2005). In bin 3.5, the region that had the highest average rank for severe preeclampsia, there is the AGT1R gene, while bin 2.5 contains the IL1B gene, both of them associated to preeclampsia in some studies (Bouba et al. 2003; Lachmeijer et al. 2002b). In bins 2.4 and 2.6 there are no known genes associated with preeclampsia. However, based on the results of the Iceland and Australian–New Zealand study, Fitzpatrick et al. have designated this 74 cM region, overlapping bins 2.4, 2.5 and 2.6 as PREG1 (preeclampsia/eclampsia gene 1) locus. Two positional candidate genes have been analyzed in bin 2.4 (TACR1, TCF7L1) (Fitzpatrick et al. 2004). Bin 1.9 is the region where various known candidate genes are located such as AGT, REN and MTR (Bouba et al. 2003; Also Rallo et al. 2005). Bin 13.4 is the location of factor VII gene promoter (F7), and bin 7.6 contains the NOS3 and TBXAS1 genes which may also be implicated in the development of preeclampsia (Laasanen et al.2002; Bashford et al. 2001).

The populations involved in the meta-analysis are of Caucasian origin. Therefore any heterogeneity should be attributed more to genuine inconsistency of genetic effects and to differences in the studies’ design and conduct, and less to ethnic descent differences across populations. However, we should acknowledge that Finnish and Icelandic populations may have some of the characteristics of genetic isolates.

The Dutch study, under the strict criteria, presented data for 67 families including the diagnosis of HELLP syndrome and for 38 families excluding HELLP syndrome. The authors of this study analysed the data separately, because in the previous scans no HELLP syndrome cases had been recruited (Lachmeijer et al. 2001). The most recent genome scan following the Dutch study is the Finnish study, which did not include HELLP syndrome cases as well (Laivuori et al. 2003). Consequently, our meta-analysis included the data from the 38 families without HELLP syndrome.

When preeclampsia starts early, i.e. before week 32, it is frequently associated with intrauterine growth retardation due to deterioration of placental function. The potential placental contribution requires special consideration. The fetal gene load may also influence a mother’s susceptibility to preeclampsia. A recent analysis indicated that both maternal and fetal genes play an etiological role and estimated the heritability conferred by maternal genes as 35% and that due to fetal genes as 20% (Pawitan et al. 2004). Another recent study (GOPEC trial) with tightly phenotyped mothers with preeclampsia together with suitable family structures (maternal and fetal trios) found that none of the genetic variants tested conferred a high risk of disease (GOPEC Consortium 2005). Even if the meta-analysis could be restricted to the early-onset, severe phenotype, preeclampsia is still likely to be very heterogenous, and placental contribution (through maternally or paternally inherited genes) could vary in different pregnancies. Most typically preeclampsia affects first pregnancies, but in familial preeclampsia it occurs also more often in subsequent pregnancies and the genetic contribution is likely to be stronger.

Conventionally HEGESMA is based on bins with width of 30 cM. An analysis with bin width less than 30 cM was not performed given that marker data would be sparse for this purpose. Moreover, decisions on multiple tests have not been made in this analysis because heterogeneity-based genome scan meta-analysis is an exploratory non-parametric procedure interested in the relative significance of the regions (Rothman 1990). The statistical significance values that we report are nominal and they should be interpreted with caution given the multiple bins tested, even if these bin results are correlated. Another limitation of the analysis is the use of the variable map density within and between studies (Lewis et al. 2003).

Given these limitations, we should caution that the proportion of type I and II errors may be considerable. Type II errors are expected to be less common in the meta-analysis than in single studies, but the total evidence is not extensive enough to be deemed conclusive even in the combined database. It is possible that some important regions may have been missed, while some of the identified regions may result from type I errors. Genetic effects of single genes are likely to be small. Candidate gene studies highlight that single-gene effects may be variable across populations, may differ between generally-defined preeclampsia and severe preeclampsia or postulated effects may be subject to bias (Kosmas et al. 2003, 2004). Obtaining large-scale evidence with collaborative meta-analyses is a step towards bypassing these problems (Ioannidis et al. 2006).

In conclusion, allowing for these caveats, the genome scan meta-analysis provides some evidence of linkage for several regions in general and severe preeclampsia. Further investigation of these regions by genotyping with additional markers and families may direct the identification of candidate genes for association studies in preeclampsia.