Introduction

Venous thromboembolism (VTE) is a complex disease with a yearly incidence rate of 100 per 100,000 adults in the United States. The risk of VTE rises exponentially from <5 cases per 100,000 persons <15 years old, to 500 cases per 100,000 persons at age 80 years [1]. VTE includes the common manifestation of deep venous thrombosis (DVT) of the lower extremities, and the less frequent life-threatening pulmonary embolism (PE) [2]. Epidemiological studies clearly indicate that genetic and environmental factors interact in determining the risk of VTE, and strong differences in the incidence have been shown between different ethnicities [3]. Factor V Leiden G1691A (FVL), prothrombin G2021A (PT20210A), and methylenetethraydrofolate reductase C677T (C677T MTHFR) are the most common polymorphic variants investigated in the past decades as putative genetic risk factors for VTE. While it is apparent that FVL and PT20210A represent by themselves significant risk factors for VTE, more controversial is the role of C677T MTHFR. According to a recently published meta-analysis [4], heterozygosity for FVL significantly increases the risk of VTE by 9.45-fold, and PT20210A by 3.17-fold compared with wild-types. As for C677T MTHFR, a significantly 57 % increased risk of VTE was shown in a Chinese population for homozygotes compared with heterozygotes and homozygous wild-types, whereas in Caucasians no significant association was found.

Between 3 and 8 percent of the Caucasian US and European populations carry one copy of the FVL variant. The variant is less common in Hispanics and it is extremely rare in people of African or Asian descent [5]. Heterozygous carriers of the PT20210A variant are found in about 2 % of the US white population. The mutation is uncommon in African Americans (approximately 0.5 %) and is rare in Asians, Africans, and Native Americans [6].

The homozygous variant of C677T MTHFR ranges from less than 1 % among African Americans to 20 % and more among some Caucasian populations and Hispanics. Asian populations have a prevalence of around 11 % [7].

Due to the relatively low prevalence of FVL and PT20210A variants, large studies of VTE are needed to achieve enough power to provide reliable risk estimates, and even more to explore gene–gene and gene-environment interaction. The effect of these three genes, in fact, seems to be influenced by modifiable risk factors such as oral contraceptives, pregnancy, surgery and trauma [8, 9]. Additionally, the combined effect of more than one genetic variant can double or triple the risk from a single variant. To examine to what extent FVL, PT2120A, and C677T MTHFR alone, and in combination with each other and with several environmental risk factors, affect the risk of VTE, we conducted an individual patient (IPD) data analysis by pooling data from 36 published studies. We also evaluated the association of each of the polymorphic variants with the occurrence of thromboembolic events stratified by type and site, and among individuals at higher risk such as women using oral contraceptives.

Methods

Literature search

A detailed literature search on the association of VTE occurrence and presence of the FVL, PT20210A and C677T MTHFR polymorphic variants was conducted on Medline and Embase databases, and on Google. The Medline query was structured using a string made of two parts, with the first was repeated for the three polymorphic variants considered: “(“Venous Thrombosis”[Mesh] OR Venous Thrombosis OR “deep vein thrombosis” OR “thrombosis” OR thrombosis OR Venous Thromboembolism OR Thromboembolism OR DVT OR pulmonary embolism OR VTE)”, followed by the second part, specific for each of the three polymorphisms as detailed below:

  • FVL: “(“factor V Leiden”[Substance Name] OR “factor V Leiden” OR “factor V G1691A”)”.

  • PT20210A: “(“Prothrombin”[Mesh] OR “factor II”) AND (“G20210A”[Mesh] OR G20210A OR “20210”[Mesh] OR 20210 OR 20210GA OR 20210G/A OR 20210A)”.

  • C677T MTHFR: “(“Methylenetetrahydrofolate Reductase (NADPH2)”[Mesh] OR “Methylenetetrahydrofolate reductase “ OR “MTHFR” OR Methylenetetrahydrofolate Reductase OR MTHFR OR “MTHFR-677” OR MTHFR677”)”.

The two parts were searched using the Boolean operator “AND”.

The literature search was repeated on EMBASE using a generic part “(“venous thrombosis” or “deep vein thrombosis” or “venous thromboembolism” or thromboembolism or DVT or “pulmonary embolism” or PE or VTE).tea. [mp = title, abstract, and full text, caption text]” and the following keywords for each polymorphism:

  • FVL: “(FVL or “leiden”).mp”

  • PT20210A: “(prothrombin or “factor II”).ti, ab. and (20210$ or g20210$).mp”

  • C677T MTHFR: “(MTHFR or “tetrahydrofolate reductase”).mp”

Finally, the search was completed by consulting Google as a non-specific search engine and by reading of the references of the eligible articles retrieved.

Language restrictions were not given, and the search was updated on January 30th 2010.

Inclusion criteria

The studies identified by the search were considered eligible when they met all inclusion criteria:

  • Case-controls or cohort studies reporting the occurrence of at least one of the three polymorphic variants considered, both in the population of cases and in that of the healthy controls;

  • Clear definition of cases (individuals affected by VTE) and of controls (individual not affected by VTE). To avoid overlaps, studies retrieved with each of the three strings were cross-referenced and duplicates were excluded.

Exclusion criteria

Studies were not considered eligible when at least one of the following situations was present:

  • Studies focusing solely on the association between the polymorphic variants and VTE occurrence in individuals affected by other conditions that are known risk factors for thromboembolic events (tumors, inflammatory bowel diseases, autoimmune conditions, Behçet disease, transplant);

  • Studies focusing solely on individuals younger than 16 years old;

  • Case-only and case-series studies, with no control population;

  • Studies solely on families or on recurrences;

  • Studies where the allelic frequencies of the polymorphic variants considered in the control population did not follow Hardy–Weinberg equilibrium (p ≤ 0.10 using the χ2 test).

Responders

Authors of the eligible studies were invited to send their datasets via email. Formal invitations were sent to either first or last authors of the papers. In case of no response, or when the email addresses were no longer in use, other co-authors were contacted. Also, authors were asked to state whether they had used their data on more than one publication, and, if that was the case, to merge the data and provide only the larger dataset. Authors were asked to hand in the most updated information available, including unpublished data.

Non responders

Four reminders were sent to non-responders. Authors who did not want to collaborate, or could not, were asked to explicitly decline the invitation and, if possible, state the reasons for non-participation. We kept a record of missed contributions to the study and collected available data (polymorphic variants considered, number of exposed cases/controls, number of non exposed cases/controls, general notes) from the relative publications.

Case definition

Most studies included validated their endpoints via physicians review of medical records. In some instances the International Classification of Diseases, 9th Edition or 10th Edition (ICD-9, ICD-10) discharge codes were used without further validation.

For the purpose of this pooled analysis, venous thromboembolism cases were grouped as venous thrombosis with no evidence of pulmonary embolism; venous thrombosis with pulmonary embolism; cerebral venous sinus thrombosis; splanchnic venous thrombosis; other types of events, i.e. multiple, undetermined, unspecified site.

Individual data collection

For the pooled analysis, participating authors were asked to send their dataset, coding the variables in a standard format. For each case enrolled we asked the status of the polymorphic variants of FVL, PT20210A and C677T MTHFR, (wild type homozygous, heterozygous carrier, homozygous carrier) and several potential confounders\effect modifiers:

  • Age at the first thromboembolic event

  • Gender

  • Body mass index (BMI)

  • Ethnic group

  • Type of VTE [10]

  • Known VTE family history (defined as having one or more first-degree relatives with a history of venous thrombosis) [11]

  • VTE recurrence

  • Plasma or serum concentration of folates [12]

  • Plasma or serum vitamin B12 levels [12]

  • Homocysteinemia or homocysteine levels [13]

  • Diagnosis of Diabetes Mellitus (DM)

  • Smoking habits (current smoker, former smoker, never smoker; pack/years)

  • Pregnancy status

  • Occurrence of adverse pregnancy events (preeclampsia, defined as hypertension and proteinuria detected for the first time after 20 weeks’ gestation with systolic blood pressure ≥140 mm; spontaneous recurrent abortion, defined as ≥3 unexplained foetal loss in the first trimester; intrauterine growth restriction, defined as birth weight <10th centile for gestational age; placental abruption, defined as presence of tender, hypertonic uterus and disseminated intravascular coagulation and/or retroplacental haematoma with/without signs of infarction) [14]

  • Use of oral contraceptives (OCs) in women at a reproductive age and of hormonal replacement therapy in post-menopausal women

  • Other VTE predisposing factors, such as major trauma, long journeys, prolonged bed rest and leg cast

  • Oncologic diagnosis

  • Presence of autoimmune diseases

  • Organ transplant

  • Diagnosis of Behçet disease.

The same data were requested for controls, except for: age at thromboembolic event (age at enrollment in the study was requested instead), type of VTE and VTE recurrence.

After all the individual data were collected, some subjects were excluded on an individual basis:

  • Occurrence of known conditions associated with highly increased risk of thromboembolism (tumors, inflammatory bowel diseases, autoimmune conditions, Behçet disease, transplant)

  • Age under 16 years old

Where the studies included were multi-centric, we considered the database from each centre separately.

The databases received were extracted and merged in one global database.

Statistical analysis

Data from cohort studies were extracted and used as if they were nested case–control samples. All individuals followed in the cohort who developed a DVT, at the time when the database was sent, were considered cases. Individuals who had not developed a DVT were considered controls.

For each study included, we reported the p-values of the Hardy–Weinberg equilibrium (HWE) for the whole population and for the controls; and the value of the minor allelic frequency (MAF).

The association between the three polymorphic variants and VTE was estimated from each study included by unconditional logistic regression models to estimate odds ratios (ORs) and 95 % confidence intervals (CIs). Adjusted models controlled for age and gender. Summary ORs and 95 % CI were estimated by pooling study-specific ORs using a random effects model for taking into account the heterogeneity between studies. Multivariate-adjusted models among all subjects were further stratified by age groups (<45 and ≥45 years old: this treshold was chosen based on the studies included in the pooled analysis), gender, use of OCs, recurrence of thromboembolic events, family history of VTE, and type of VTE event (venous thrombosis with no evidence of pulmonary embolism; pulmonary embolism; cerebral venous sinus thrombosis; splanchnic thrombosis; and other types of events). A test of interaction was then performed to assess for statistically significant differences among strata estimates (data not reported in the tables, only in the text if p-value for interaction <0.05). Lastly, a gene–gene interaction analysis was performed.

Based on the relative frequencies of heterozygous and homozygous variants of each polymorphism in populations, from previous examples in literature, and on a biological plausibility criterion, we applied a dominant model to determine the effect of the polymorphic variants of FVL and PT20210A on the occurrence of VTE (carriers [both heterozygous and homozygous] versus non-carriers). Conversely, a recessive model was used with regards to C677T MTHFR (homozygous carriers versus heterozygous and non-carriers) [15]. These assumptions were also used when performing gene–gene interaction analyses and stratified analyses.

Publication bias was evaluated both graphically and statistically, using Beggs and Egger tests and with funnel plots. Presence of inter-study heterogeneity in the overall and stratified analyses was assessed with the I 2 method. When high heterogeneity (I 2 ≥ 50 %) was found, a sensitivity analysis was performed to single out the accountable studies and reasons for heterogeneity described.

Additional analyses of the pooled data

All the statistical analyses described above were performed after exclusion of individuals with known triggering factors or at increased risk, that is: long journeys, immobilisation, trauma, major surgery, pregnancy, use of OCs or hormonal replacement therapy.

Studies not included

In order to measure bias due to non responders, we performed a tabular meta-analysis of the studies not included.

We compared the overall measures of association yielded by the analysis of studies not included with those yielded by the pooled analysis, using a heterogeneity test.

All calculations were performed using the Stata 11.0 ® software package.

Results

Study selection

The literature search yielded over 4,000 results. Of these, after application of the inclusion and exclusion criteria and deletion of duplicates, 115 were deemed eligible. Once the authors of these papers were contacted, 31 papers were included [9, 1645]. Of the remaining 84 [46129], 20 papers were excluded on the grounds that authors could not provide the information required. Most common reasons for non-participation were: data no longer available, scarcity of time available for the retrieval and codification of the data, and laboratory no longer existed. Fifty-nine papers could not be included because the authors never replied (Fig. 1).

Fig. 1
figure 1

Flow chart of the selection process for including articles

Because in several cases the same database was used to produce more than one article on the topic, authors were asked to send in their largest database available, including unpublished data, and to communicate whether their data were used for more than one publication. Two studies were multi-centric [9, 27, 28], so each database from these studies was considered independently from the others. For these two reasons, the number of databases included does not coincide with the number of papers considered: 37 databases were included in the pooled analysis (Table 1).

Table 1 Characteristics of the databases included in the analysis and crude and adjusted Odds Ratio (OR) for Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T polymorphisms

Odds ratios (ORs) and 95 % confidence intervals (CIs) for each database included are reported in Table 1. It is worth mentioning that crude ORs are not the same as those of the referenced articles, as new data were provided, and some subjects were excluded on an individual basis as detailed in the methods section.

The IPD analysis was performed on an overall sample of 11,239 cases and 21,521 controls. 28 databases out of 37 (76 %) came from European countries [9, 16, 19, 20, 2326, 2937, 3943, 45], 3 from Latin America [9, 21, 44], 4 from North America [22, 27, 28, 38] and 2 from Asia [17, 18].

The p-values of the HWE for each study overall and for the control population, as well as the MAF, are reported in Table 2.

Table 2 Minor allelic frequency and p-value for the Hardy–Weinberg Equilibrium for the control population and for the overall sample of the databases included in the analysis for Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T polymorphisms

Synthesis of results

FVL

Based on 9,081 cases and 17,513 controls, FVL was associated with an elevated risk of developing VTE, with an overall OR: 4.38 (CI: 3.48-5.51; I-squared = 70.3 %, 95 % CI: 54.6 % to 80.5 %; Table 3, Fig. 2). The risk was significantly higher in younger individuals (<45 years old, OR: 5.43; ≥45 years old, OR: 3.71; p-value for interaction = 0.036). Men had a higher, but not statistically significant, risk of developing VTE with FVL compared to women (male population, OR: 5.06; female population, OR 3.82), with no significant difference between strata estimates.

Table 3 Overall and stratum-specific adjusted odds ratios (OR) by polymorphic variants: Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T
Fig. 2
figure 2

Forrest plot: association between Factor V Leiden and risk of venous thromboembolism (odds ratios are represented on log scale)

As expected, the VTE OR for FVL was higher in women using OCs (OR: 6.10) than women not using OCs (OR: 3.91), but this difference was not significant. No significant differences in ORs for FVL emerged when stratifying by presence of triggering conditions. Complete data on triggering conditions, however, were only available for 20 % of the databases included. Family history did not seem to interact with FVL in risk of developing VTE. On the other hand, FVL was associated with recurrences (recurrent VTE event, OR: 5.81; no recurrence reported, OR: 3.95).

The stratification by type of outcome showed that the polymorphic variant was more strongly associated with the risk of developing venous thrombosis without evidence of pulmonary embolism (OR: 4.49) or a cerebral venous sinus thrombosis (OR: 4.14); the risk of developing a splanchnic thrombosis was not statistically significant (Table 4).

Table 4 Risk of venous thromboembolic accidents stratified by the type of event in subjects carriers of Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T polymorphism

Lastly, the OR for FVL homozygotes was 11.45 (CI 95 % 6.79–19.29), including 116 subjects among cases and 24 controls (data shown in Appendix 1).

PT20210A

Based on 9,134 cases and 17,606 controls, PT20210A was associated with an increased risk of developing VTE (overall OR 2.80, CI 2.25-3.48; I-squared = 46.1 %, 95 % CI: 11.2 % to 67.3 %), albeit not as strongly as FVL (Table 3, Fig. 3). Again, younger carriers of the polymorphic variant were at higher risk from PT20210A than their older counterparts (<45 years old, OR: 3.19; ≥45 years old, OR: 2.57), although the difference between the two groups was not as remarkable as for FVL and not significantly different. Gender did not seem to affect the risk of developing VTE in the presence of the polymorphic variant: the risks for men (OR: 2.91) and women (OR: 2.88) were similar. As expected, the use of OCs affected risk of PT20210A for developing VTE (women not using OCs, OR: 2.73; women using OCs, OR: 5.58; p-value for interaction = 0.045) was confirmed in the present study. As with FVL, the presence of triggering conditions did not affect risk of developing VTE in the presence of PT20210A. Complete data on triggering conditions, again, were only available for 20 % of the databases included. Individuals who reported a family history of VTE and individuals with recurrences had a higher risk of VTE in the presence of PT20210A (positive family history, OR: 4.49; negative family history: 3.86; recurrence, OR: 4.38; no recurrence: OR, 3.08), although in neither instance was the difference statistically significant.

Fig. 3
figure 3

Forrest plot: association between Prothrombin 20210A and risk of venous thromboembolism (odds ratios are represented on log scale)

The stratification by outcome showed a very strong association of the polymorphic PT20210A variant with regard to the risk of developing a thrombosis of the cerebral venous sinus (OR: 4.52). The polymorphic PT20210A variant was significantly associated also with the other types of thrombotic events considered, with ORs ranging from 2.10 to 3.53, as shown in Table 4. Lastly, the OR for PT20210A homozygotes was 6.74 (CI 95 % 2.19–20.72), including 28 subjects among cases and 4 controls (data shown in Appendix 1).

C677T MTHFR

The overall analysis (OR: 1.38, CI: 0.98–1.93; I-squared = 69.6 %, 95 % CI: 46.2–82.8 %. Table 4, Fig. 4) and the stratification analyses indicated that the C677T MTHFR variant was associated with no consistent risk of developing a VTE (see Table 3). From the stratified analysis, only men with the polymorphic variant had a significantly increased risk of developing VTE (OR: 1.84, CI: 1.24–2.74).

Fig. 4
figure 4

Forrest plot: association between Methylenetetrahydrofolate reductase C677T and venous thromboembolism (odds ratios are represented on log scale)

Women using OCs had elevated risk associated with the C677T MTHFR variant (OR: 2.05). The C677T MTHFR variant carried a borderline significantly increased risk to develop a venous thrombosis without evidence of pulmonary embolism (OR: 1.33, CI: 1.03–1.72). No other thromboembolic event type was significantly associated with the presence of the polymorphic variant (Table 4).

Gene–Gene interactions

Notably, the presence of both FVL and PT20210A did not seem to confer additional risk for first VTE beyond the sole presence of FVL. However, given the scarcity of carriers of both variants, the confidence intervals were broad (OR: 3.423, CI: 1.65–7.13) (Table 5).

Table 5 Overall and stratum-specific adjusted odds ratios (OR) in the presence of gene–gene interaction: Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T

There was no interaction of the C677T MTHFR variant with FVL and with PT20210A in risk of developing VTE, confirming the notion that C677T MTHFR is not a risk factor for VTE. (Table 6) Because of the rarity of the variants in the general population (especially when combined), and in spite of the ample sample size considered, the analysis yielded very wide confidence intervals.

Table 6 Risk of cardiovascular thromboembolic accidents stratified by the type of event in the presence of gene–gene interaction: Factor V Leiden (FVL), prothrombin (PT) 20210A and methylenetetraydrofolate reductase (MTHFR) C677T

The stratification by outcome showed particularly increased risks for individuals with both FVL and PT20210A variants to develop a cerebral venous sinus thrombosis, although not statistically significant (OR: 7.38, CI: 0.99–55.06).

Only 5 cases and no control carried a triple combination of the gene variants investigated (double heterozygosity for FVL and PT20210A with homozygous C677T MTHFR), producing a high OR but with quite unstable confidence intervals due to the rarity of combination (Table 6).

Risk of bias within studies

Statistical tests for publication bias were not statistically significant for FVL (p-values for Beggs test: 0.492, Egger: 0.706), PT20210A (p-values for Beggs test: 0.822, Egger: 0.470) and C677T MTHFR (p-values for Beggs test: 0.360, Egger 0.373). This was also confirmed graphically by the Funnel plots for the three polymorphic variants (Figs. 5, 6, 7).

Fig. 5
figure 5

Funnel plot: publication bias for the studies on Factor V Leiden

Fig. 6
figure 6

Funnel plot: publication bias for the studies on Prothrombin 202010A

Fig. 7
figure 7

Funnel plot: publication bias for the studies on Methylenetetrahydrofolate reductase C677T

Inter-study heterogeneity was high for the FVL and C677T MTHFR. Sensitivity analysis singled out the study by Almawi et al. [17] as the main source of heterogeneity for C677T MTHFR (excluding Almawi, OR: 1.25; CI: 0.98–1.59; I 2 = 31 %, p = 0.144). As for FVL, the heterogeneity remained still high even after exclusion of the two greater contributors of heterogeneity (Emmerich et al., database 4 [9], Frederiksen et al. [29]: OR: 4.88; 95 % CI: 4.00–5.96; I 2 = 52 %, p = 0.003).

Additional analyses of the pooled data

Results did not change significantly after the exclusion of homozygous carriers of the FVL and PT20210A variants from the analyses (FVL, OR: 4.22; CI: 3.35–5.32; I 2 = 70 %, p < 0.001. PT20210A, OR: 2.79; CI: 2.25–3.46; I 2 = 44 %, p = 0.016). If it is true that both FVL and PT20210A homozygous carriers are at a higher risk of developing VTE than the heterozygous counterparts, the frequencies of homozygosity in the general population are so low that the results do not change whether they are included or excluded.

Analysis of the studies not included

Results from the tabular meta-analyses of the studies not included [46129] did not differ significantly from those yielded by the pooled analysis (Forrest plots reported in Appendixes 2, 3, 4).

Carriers of FVL had a 5-fold increase in the odds of developing VTE (OR: 4.23; CI: 4.23-5.59). This result is in line with that from the pooled analysis (p-value for heterogeneity: 0.448).

Similarly, PT20210A yielded an OR of 3.07 (CI: 2.72-3.48; p-value for heterogeneity: 0.472) and MTHFR an OR of 1.10 (CI: 0.84-1.45; p-value for heterogeneity: 0.308).

Discussion

Inherited deficiency of antithrombin (AT), protein C (PC) and its co-factor, protein S (PS), were the first identified causes of increased risk for DVT. More recently, two common gene polymorphisms were recognized as additional causes of hypercoagulability: FVL, resistant to the anticoagulant action of activated protein C, and PT20210A, associated with increased levels of circulating prothrombin. Mild hyperhomocysteinemia is also an established risk factor for thrombophilia [130].

Overall, the rare deficiencies of natural coagulation inhibitors (AT, PC, and PS) are detectable in less than 1 % of the general population and in less than 10 % of unselected patients with VTE [130]. FVL is present almost exclusively among Caucasians, with a prevalence of 5 % in the general population with European ancestry and 18 % among patients with VTE. In some European areas (Sweden, Alsace, Cyprus) the prevalence of FVL in the general population has been reported to be 10–15 % [131]. Finally, the PT20210A allele is present in 2 % of healthy individuals and in 7 % of patients with VTE [132]. Acquired factors (low intake of pyridoxine, cobalamin, folate) can produce mild hyperhocysteinemia interacting with gene factors, such as the C677T polymorphism in the MTHFR gene. Homozygous carriers can develop hyperhomocysteinemia especially in the presence of low folate levels. Among Caucasians the prevalence of the TT genotype is 13.7 %, quite similar to that found among patients with VTE, suggesting that search for this genotype is not useful per se [133]. Such figures have been recently confirmed in a large case–control Dutch study (Multiple Environmental and Genetic Assessment, MEGA, of risk factors for VTE study), recruiting 4,375 patients and 4,856 control subjects. Among the controls, the frequency of FVL, PT20210A, heterozygous MTHFR C677T, and homozygous MTHFR C677T was 5, 2, 43, and 11 %, respectively. In the patients the distribution of FVL, PT20210A, heterozygous MTHFR C677T, and homozygous MTHFR C677T was 16, 5, 43, and 10 %. Accordingly, FVL and PT20210A were confirmed to be risk factors for VTE, whereas the carriership of the MTHFR C677T polymorphism had no effect on the risk for VTE [134].

VTE is a common complex (multifactorial) disease, being the resultant of gene–gene and gene-environment interactions. Unfortunately, a simple model expressing the presence or the absence of two dichotomous factors (high-risk allele and exposure to an environmental risk factor) is not sufficient. This is due to incomplete clinical penetrance of genotypes, since not all carriers develop VTE during life, and to variable expressivity of severity and age of onset of the disease; moreover, the onset of disease is modulated also by gene–gene interactions, in the large majority of cases still obscure, and by multiple effects of environmental risk factors: these act on the genotype in a additive or multiplicative way.

The influence of genetics is supported by the fact that family history of VTE has been consistenly reported to be a risk factor for VTE independent of the presence of known thrombophilic abnormalities [11, 135, 136]. Moreover, the carriers of FVL with a family history of VTE have been reported to be more prone to VTE than those without [11].

Acquired risk factors are far more common than inherited thrombophilias. Advanced age, cancer, immobility, and recent trauma, surgery, or hospitalization are well-recognized risk factors [137139]. Important gender risk factors for VTE are pregnancy and hormone treatment. Compared to age-matched nonpregnant women, the risk of VTE is increased 5- to 10-fold during the antepartum period and 15- to 35-fold after delivery [140]; current oral contraceptive use or hormonal replacement therapy are associated with a 2 to 6-fold increased risk of VTE [141].

Our analysis confirmed a significant risk of VTE associated with the common polymorphisms FVL and PT20210A, with an overall odds ratio of 4.4 and 2.8, respectively. These figures are consistent with the large meta-analysis of Gohil et al. [4], who reported an overall odds ratio of 4.9 for FVL and 3.2 for PT20210A. Homozygous carriers of FVL were confirmed to be at higher risk of VTE (11.4-fold), consistently with the results obtained by Gohil et al. [4] (odds ratio for VTE 9.4 for FVL homozygotes. Moreover, we were able to estimate the risk for VTE associated with homozygous PT20210A, (6.7-fold), which was unexplored by Gohil et al. [4]. In agreement with current knowledge, the association of the MTHFR C677T homozygous variant with VTE was negligible, substantially confirming the results of Gohil et al. [4]. Our meta-analysis included fewer cases and controls than that of Gohil et al. [4]: 9,081/17,513 cases/controls versus 22,225/37,566 for FVL, 9,134/17,606 cases/controls versus 21,605/27,947 for PT20210A, 2,501/11,409 versus 13,570/20,935 for MTHFR C677T homozygous variant. This could be due to some differences in the selection criteria adopted in our meta-analysis in respect to those of Gohil et al. [4]: in fact, we excluded the studies focusing on individuals younger than 16 years, on recurrent thrombotic events, and family studies. More importantly, we invited the authors of the eligible studies to furnish the last version of their database. Although this policy reduced by two-thirds the number of studies included in our analysis, we are highly confident that the cases and the controls recruited in the final pooled database did non include duplicated groups of cases or controls. Contrary to Gohil et al. [4], we included cases with post-surgical or pregnancy-related VTE, to allow an evaluation of the impact of common thrombophilia on the risk of VTE more representative of real life. Finally, we also included cases with cerebral or splanchnic vein thrombosis not related to overt cancer or myeloproliferative neoplasms.

The strength of our analysis in respect to previous studies consists in the application of a multivariate model having as covariates age, gender, and presence of FVL, PT20210A and MTHFR 677.

We confirmed that the use of OCs further increases the risk of VTE associated with FVL or PT20210A, with a greater effect with the former, as previously reported [142, 143]. No extensive data are available on the interaction between the MTHFR 677 TT genotype and the use of OCs; in our analysis the use of OC produced a small increase in risk which seems not different from that expected in non-thrombophilic women. The occurrence of other triggering situations did not significantly affect the overall risk of VTE associated with FVL or PT20210A.

Our study is the largest meta-analysis so far carried out on individuals with combined abnormalities. In a previous multicenter study (included in the present meta-analysis) recruiting 51 cases of VTE with double heterozygosity for FVL and PT20210A the risk of VTE was 20-fold increased compared to the controls, none of them was a double heterozygote [144]. In the present analysis we collected the data of 187 cases and 121 controls carriers of both FVL and PT20210A (among 4,343 cases and 3,725 controls investigated for both genes) allowing a more reliable statistical analysis. The risk of first VTE resulted 3.4-fold increased, quite similar to the risk associated with either heterozygous abnormality (OR = 4.2 for FVL and 2.8 for PT20210A). Risk of first VTE associated with double heterozygosity not greatly exceeding that associated with heterozygous FVL alone could be interpreted by a weak or null additional risk due to the presence of PT20210A, as recently suggested in the clinical settings of pregnancy-related VTE [144]. Consistently with this finding, a pooled analysis of appropriate family studies showed that double heterozygotes for FVL and PT20210A, who are relatives of VTE index patients, had an increased risk for an initial episode of VTE not much higher than that estimated for those relatives who were only FVL heterozygous (pooled OR 6.7 vs. 3.5, respectively) [145].

Consistently with the null effect of the MTHFR 677 TT genotype on the risk of VTE, the combination of MTHFR 677 TT with either FVL or PT20210A, did not produce any increase in the thrombotic risk. This is in agreement with the results obtained in the aforementioned MEGA Study [134].

Despite the large sample size of this study, there are several limitations that need to be taken into account in the interpretation of results. Firstly, the studies included in the analyses come from very different populations. Sources of controls and methods for data collection were different. All these factors account, at least in part, for the observed heterogeneity among studies.

Data relative to potential confounders/effect modifiers were also collected using different methods (self-administered questionnaires, clinical assessments, interviews), and possibly different wordings. Furthermore, some potential confounders/effect modifiers could not be considered in the analyses for lack or incompleteness of the data retrieved (pregnancy status and pregnancy-related risk factors, homocysteinaemia, body mass index, diagnosis of diabetes mellitus).

Missing articles and non-responding authors are a potential source of bias. However, our results are in line with the more inclusive meta-analysis produced by Gohil et al. [4], and with the meta-analyses of Marjot et al. [146], focusing on CVTs only, and Gouveia et al. [147], relative to MTHFR only. The comparability of results is suggestive that the bias due to high proportion of non-responders is unlikely to have affected our estimates systematically. Table 7 compares the meta-analyses available in literature on this topic, in terms of endpoints considered, population included, number of studies and sample size, inter-study heterogeneity (I-squared test,  %). The statistical assessment also does not show a significant publication bias for our study.

Table 7 Comparative table of tabular meta-analyses published on association between polymorphic variants of VFL, PT202010A, MTHFR and risk of first VTE

Furthermore, as confirmed by the tabular analysis of the studies not included, the results yielded by our pooled analysis are, in all likelihood, reliable in spite of the high proportion of studies that could not be included. This might be also due to the fact that we were able to include all the biggest studies on the topic, and that most of the non responders were authors of relatively smaller studies. Our pooled analysis yielded more conservative results for FVL and PT20210A then Gohil [4] and then those from our meta-analysis of studies not included—albeit this difference is not statistically significant. Again, this could be due to the fact that bigger studies tend to be, generally, more conservative than smaller ones, and to the fact that, for each polymorphic variant, in the pooled analysis we adjusted by the presence of the other two polymorphic variants.

One limitation of this study is that ethnicity could not be analysed at an individual patient level, given the paucity of responses on this. Most of the studies in the analysis are from Western European countries, and in some of the studies it is specified that all–or most–subjects enrolled are white Caucasians. However, especially for what concerns studies from the Americas, or from European countries with high prevalence of non-Caucasian groups (such as France, the United Kingdom or the Netherlands), we felt that using the country of residence to assume ethnicity would have been too big of an assumption.

In conclusion, FVL and PT20210A are confirmed in a large meta-analysis to be moderate risk factors for VTE in the adult population, with an associated risk further doubled in the homozygotes in comparison with the heterozygotes, suggesting a gene–gene additive effect; in contrast, the MTHFR 677 TT homozygous genotype is not a risk factor. Finally, the double carriership of FVL and PT20210A seems to produce an impact on the risk of VTE weaker than previously thought.