Introduction

After several decades, warfarin continues to be the most prescribed oral anticoagulant worldwide [1, 2], notwithstanding the increasing use of new oral anticoagulants (DOAC) [3, 4]. However the clinical management of warfarin may be challenging because of its narrow therapeutic window along with within- and between- subject variability of dose required.

Individual warfarin response variability has been found to be associated with clinical factors (e.g., age, gender, compliance, nutritional status, presence of comorbidity, and concomitant medications), and polymorphisms involved in the metabolic route and warfarin effector/response pathway. Therefore, a number of models, which include both clinical and genetic factors, have been developed to predict warfarin dose requirements, but their usefulness is still under debate [5].

In 2009 the International Warfarin Pharmacogenetics Consortium (IWPC) proposed an algorithm that includes both clinical and genetic data [6]. The IWPC algorithm has been externally validated in other countries, mostly in small single-center studies that included adult patients irrespective of age [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. A recent meta-analysis showed that the proportion of warfarin doses that were systematically underpredicted by the IWPC algorithm in patients requiring higher than average doses was as high as 95.8% (95% confidence interval 92.4–97.9%) [35].

The predictive ability of the IWPC algorithm is particularly questionable in high-risk elderly people, who represent a highly vulnerable population because of increased risk of (i) bleeding during treatment with vitamin K antagonists, (ii) reduced metabolic clearance, and (iii) possible interactions with drugs used for comorbidities. In 2012 we planned the ‘Validation of IWPC algorithm in high-risk elderly patients (VIALE)’ study to validate the IWPC algorithm in a prospective cohort of high-risk elderly people (≥65 years) with at least one comorbid condition (clinicaltrials.gov identifier NCT02069132).

Materials and methods

Study design and participants

All patients aged 65 years or more, referring to six public hospitals, who first required treatment with warfarin because of either medical (non valvular atrial fibrillation) or surgical conditions (heart valve replacement), who had at least one comorbid condition, and regularly used two or more other drugs besides warfarin were prospectively recruited. Patients with systemic coagulopathies and malignancies needing chemotherapy were excluded.

Intervention

The investigators were allowed to adjust the dose of Warfarin or discontinue the treatment according to clinical practice, blind to genotype assessment. No additional interventional procedures were required.

IWPC algorithm

The IWPC pharmacogenetic algorithm [6] includes both clinical and genetic data: age, height, weight, race, VKORC1 genotype, CYP2C9 genotype, and use of drugs that are CYP2C9-inducer (rifampicin, phenytoin, and carbamazepine) or inhibitor (amiodarone). A different version of the algorithm that also included smoking and target INR was proposed in 2012 within the CoumaGen-II trial [36]. IWPC dosing algorithms 2009 and 2012 are reported in the Supplementary Table 1.

Variables

Baseline variables included demographic and clinical information, primary indication for warfarin treatment, risk factors such as smoking and drinking status, baseline and target INR, medical/surgical history, starting warfarin dose, comorbidity according to the Cumulative Index Rating Scale (CIRS) [37] and use of concomitant medications.

Follow-up (FU) visits were performed according to the clinical practice of each center and patients’ need. Maximum length of FU was set equal to 1 year. At each FU visit data on warfarin dose changes, INR levels, occurrence of cardiovascular, and cerebro-vascular (CCV) events and assumed drugs were recorded.

To assess the genetic variants included in the IWPC model (CYP2C9*2 [rs1799853], CYP2C9*3 [rs1057910] and VKORC1–1639G>A [rs9923231]), peripheral blood samples (4–5 ml) were collected in BD Vacutainers, containing EDTA as anticoagulant and stored at temperature of −80 °C, or −20 °C, for a period of maximum 3 months before shipping (in dry ice, to prevent defrost) to the central laboratory in Salerno. Genomic DNA was extracted using E.Z.N.A.® Blood DNA Kit (Omega Bio-Tek). Quantification and quality analysis of DNA was performed using a NanoDrop 2000c spettrophotometer. A BeadXpress Reader using Illumina VeraCode GoldenGate Assay Kit was used for genotyping. A total of 500 ng of DNA was used per assay.

Outcomes

Consistently with the 2009 IWPC paper [6], the primary outcome was the percentage of patients whose dose of warfarin, predicted by IWPC algorithm, was within 20% of the actual stable maintenance dose. The stable dose provided by centers was used for analyses, as defined in the Section 2 of Supplementary Appendix 1 of IWPC original paper [6]. All doses are reported as weekly doses.

Clinical and laboratory secondary endpoints were also evaluated:

  • overall incidence of CCV events either in the first year of warfarin treatment or in the first four weeks of warfarin treatment, defined as the occurrence of any one of death for any cause, hospitalization for CCV events, major bleeding, or thromboembolism;

  • overall incidence of major bleeding in the first year of treatment, defined as the occurrence of fatal bleeding, or symptomatic bleeding in a critical area or organ, or bleeding causing a fall in hemoglobin level of ≥2 g/dL or leading to transfusion of two or more units of blood or red cells [38];

  • overall incidence of thromboembolism in the first year of treatment, defined as the occurrence of cerebral infarction, or myocardial infarction, or peripheral arterial embolism;

  • time to therapeutic INR, defined as the time of first achieving INR measurement within the individual’s target range, providing that INR was also within the target range at the subsequent clinic visit;

  • percentage time (%TIR) in the therapeutic INR range during the first 3 months of treatment;

  • percentage time (%TIR) in the therapeutic INR range during the first four weeks of treatment.

As for patients lost at laboratory FU clinical information was retrieved through phone calls to patients’ home or from administrative registry offices of the patients’ towns of residence.

Sample size

Primary endpoint was the percentage of subjects who had a predicted dose within a range of ±20% of the stable dose [6], and sample size was calculated to achieve a predefined precision of the estimate. Initially a precision of ±3% of the 95% confidence interval (CI) was desired and 1067 subjects were required. Eventually the recruitment rate was much lower, and the actual sample size lead to a precision of CI of about ±5% of the estimate.

Statistical analysis

All statistical analyses were performed using both the 2009 [6] and the 2012 [36] version of IWPC algorithm in the whole sample and separately by primary indication (medical or surgical).

Warfarin dose distributions were graphically depicted using the box-and-whiskers plots. A scatter diagram showed the relationship between the actual and predicted warfarin therapeutic doses. Differences of percentages between groups were evaluated by means of chi-square test or Fisher’s exact test if needed. Mean differences between groups were tested with Student’s t test or Mann–Whitney test when normality assumption was unmet. The accuracy of the IWPC prediction algorithm was evaluated using linear regression models with the actual dose as dependent variable and the predicted dose as independent variable. Validity of the predictive algorithm was confirmed when the composite null hypothesis α = 0 (intercept) and β = 1 (slope) was not rejected at the two-tailed α level of 0.05. The determination coefficient (R2) measured the proportion of total variation of the actual therapeutic dose explained by the predicted dose. As measures of accuracy we also calculated the mean prediction error (MPE), defined as the average of the differences between the predicted and the actual dose, and the mean absolute error (MAE), defined as the average of the absolute value (in the mathematical sense) of the difference between the predicted and the actual doses. MAE is usually reported as a measure of predictive accuracy, but rather is a measure of variability of the difference distribution, with a similar interpretation of root mean square error [28].

The clinical usefulness of the IWPC algorithm was further quantified by the percentages of correct identification within subgroups of increasing levels of the warfarin actual dose.

Statistical analyses were performed with R software version 3.0.1 (R Foundation for Statistical Computing).

Results

Three hundred and seventy-six patients were enrolled between March 2013 and May 2016 (142 with medical and 234 with surgical indication). Baseline patients’ characteristics are shown in Table 1. All patients were Caucasian. Overall, most subjects (84%) had INR target between 2 and 3. However 55 patients (14%), almost exclusively with surgical indications, had target INR between 2.5 and 3.5. As expected most patients received a warfarin starting dose of 35 mg/week, but higher starting doses were also observed in medical patients. Relevant differences between medical and surgical subjects were found for age, NYHA class, comorbidity and INR target. The allelic frequencies of the CYP2C9*2, CYP2C9*3 and VKORC1-1639G>A SNPs were similar to those reported in other Caucasian populations [39, 40]. Comorbidity distribution as assessed by the CIRS index is reported in the Supplementary Table 2.

Table 1 Baseline characteristics of study patients.

Stable maintenance dose was achieved in 283 (75%) patients without differences according to indication: 105/142 (74%) patients with medical indication and 178/234 (76%) with surgical indication. Overall characteristics of patients who achieved or did not achieve maintenance dose were similar (Supplementary Table 3), even though the latter ones were slightly older (76.5 vs 74.2 years on average) and had a higher prevalence of previous myocardial infarction (15.1% vs 8.1%). Twenty out of 93 (21.5%) patients, who did not achieve maintenance dose, discontinued warfarin treatment to move to DOAC or other anticoagulant, while 25/93 (26.9%) withdrew consent, possibly for the same reason. Consistently patients who did not achieve stable dose stayed less on warfarin therapy.

Distributions of warfarin maintenance doses reported by centers and predicted by the IWPC algorithm, using both the original version of 2009 [6] and the updated version of 2012 [36], are reported in Fig. 1. Median actual dose was equal to 20.3 mg/week (interquartile range, IQR, 14.1–27.7), much lower than the starting dose of 35 mg/week; median values in patients with medical and surgical indications were 17.5 mg/week (IQR 8.8–26.3) and 21.4 mg/week (IQR 15.9–28.9), respectively. On average, the IWPC 2009 predicted doses were higher than the actual therapeutic doses: overall median indeed was equal to 23.8 mg/week (IQR 19.0–28.7), and median values in patients with medical and surgical indication were 23.8 mg/week (IQR 17.8–28.2) and 23.9 mg/week (IQR 19.3–29.2), respectively. The analogous values predicted by IWPC 2012 algorithm were still higher.

Fig. 1
figure 1

Box-and-whiskers plots of the actual and IWPC predicted doses, by primary indication.

The relationship between actual and predicted doses according to the 2009 (upper panel) and 2012 (lower panel) version of the algorithm is shown in Fig. 2. In the whole sample (Fig. 2, left panel) both intercept and regression coefficients were significantly far from the null values, and R2 was as small as 0.31. In medical patients (Fig. 2, middle panel) fitted values diverged strongly from the perfect prediction with very low R2 value (0.15 and 0.17), while a slightly better fitting was found in surgical patients (Fig. 2, right panel), with R2 values being equal to 0.41 and 0.39. In the whole sample the MPE was equal to 1.8 and 3.3 mg/week with the 2009 and 2012 version of the algorithm, respectively, slightly lower than corresponding median values (2.3 and 3.7 mg/week). Overprediction bias was greater in medical than in surgical subjects; MPE indeed was equal to 3.4 mg/week and 0.9 mg/week in medical and surgical patients, respectively, with IWPC 2009. Overprediction increased further with IWPC 2012, MPE being equal to 4.7 and 2.5 mg/week in medical and surgical patients, respectively. The MAE was equal to 7.64 mg/week in the whole sample and to 9.31 and 6.65 mg/week in medical and surgical patients, respectively, with IWPC 2009, and increased further with IWPC 2012.

Fig. 2
figure 2

Scatterplots of the actual and IWPC predicted doses, by primary indication.

The percentage of patients whose dose of warfarin, predicted by IWPC 2009 algorithm, was within the 20% of the actual dose was equal to 37.5%, (95% CI 32.0–43.3%), better in surgical (43.3%, 95% CI 36.1–50.7%) than in medical patients (27.6%, 95% CI 19.8–37.1%) (Table 2). Overprediction was always 2–3 times higher than underprediction. Similar results were found with IWPC 2012.

Table 2 Accuracy of IWPC prediction with a relative errora threshold of 20%, by primary indication.

A deeper investigation of the ability of the pharmacogenetic algorithm to correctly identify patients requiring lower or higher therapeutic dose is reported in Table 3. The IWPC algorithm largely overestimated the dose in patients requiring very small doses, while underestimated the dose in the few patients requiring large doses. Results did not substantially change when analyses were repeated with the 2012 version of the IWPC algorithm (data not shown).

Table 3 Accuracy of IWPC 2009 predicted doses according to increasing levels of warfarin actual dose in the whole sample (A) and separately by medical (B) and surgical (C) indication. Absolute figures (%) are reported.

The occurrence of clinical and laboratory secondary outcomes in all patients is reported in Table 4. Forty-two (11.2%) cardio- and cerebro-vascular (CCV) events were observed in the first year of treatment, 14 (3.7%) of which in the first 4 weeks of treatment. The risk of bleeding was higher than the risk of thromboembolism (3.7% and 2.4%, respectively). Median time to first INR in target was nearly double for medical patients (12 days) with respect to surgical subjects (7 days). Overall, the percentage time in the therapeutic INR range was 52% in the first 3 months and 39% in the first 4 weeks of therapy, larger for medical than for surgical patients. About a third of subjects experienced at least one INR value ≥4.

Table 4 Clinical and laboratory secondary outcomes, by primary indication, in all patients.

Discussion

To our knowledge VIALE study is the first prospective study that assessed the predictive accuracy of the IWPC algorithm in a cohort of high-risk elderly people with comorbidity. In such a vulnerable population maintenance doses of warfarin were very low on average, particularly in medical subjects, and the IWPC algorithm substantially overestimated them. Only a third of patients had a predicted dose within a relative error of 20%. Further the pharmacogenetic algorithm only explained 30–40% of the warfarin dose variability (R2), and this value declined to only 15% in medical patients. Results did not change substantially when the 2012 version of the IWPC algorithm was used.

Anticoagulant doses were particularly low in medical subjects possibly as a consequence of the reduced metabolic clearance in elderly people [41] and prudence of doctors in the management of high-risk elderly patients. Conversely, higher doses were found in patients with surgical indications, leading to a slightly improved prediction. In these subjects higher doses might be expected as a consequence of higher target INR, but we cannot exclude that more accurate choice of surgical patients could have induced a selection of patients with less comorbidities.

In our study a quarter of subjects were unable to achieve a stable maintenance dose without differences between medical and surgical patients. As usually in the literature we did not take them into account when evaluating the predictive accuracy of the algorithm, but, from a clinical viewpoint they should contribute to get it worse. These percentages are lower than other figures previously reported in the literature [42,43,44]. Chappell et al. [42, 43] found that nearly 50% of subjects were unable to achieve the defined INR target during titration to a stable warfarin dose, whereas in Roberts et al. [44] about 55% of patients were ineligible due to maintenance INR outside the acceptable range. Different definitions of stable maintenance dose, characteristics of population, and setting of recruitment could explain discrepancies, that could heavily affect the clinical usefulness of pharmacogenetic algorithms.

In the only previous study that specifically addressed the predictive accuracy of the IWPC algorithm in elderly people, the algorithm also failed to identify older patients requiring low daily doses of warfarin; however, this study included a small number of subjects and in principle was prone to selection bias because of the retrospective design [12]. Most of the studies that validated the IWPC algorithm did not have age limits, and either excluded patients with comorbidity or did not report at all this information (Table 5). Moreover, almost all studies had a retrospective design and included only patients who achieved stable dose with possible selection biases.

Table 5 Studies who externally validated the IWPC algorithm.

Yan et al. [45] also report that more than 86% of patients receiving lower doses of warfarin (<13.16 mg/week) were overpredicted by the algorithm, while Saffian et al. [35] showed a systematic underprediction in patients requiring higher than average doses. The pharmacogenetic algorithm indeed predicts quite accurately warfarin dosing requirements on average, but possibly both higher- and lower-dose patients do not benefit substantially from this information. This is elucidated by the low percentage of warfarin dose variation explained by the model that in our study was as low as 31% and declined to a very low value of 15% in medical patients. Since published pharmacogenetic algorithms mainly include variables associated to a reduction in warfarin dose possible improvements of predictive accuracy in high-risk elderly people might be attained either encompassing comorbidity information into the algorithm or evaluating more complex non-linear models.

Our clinical findings were in line with literature results. Overall incidence of major bleedings was 3.7%, similar to the value of 3.5% reported by Burmester et al. [46] in relation to standard clinical warfarin therapy management. On the other hand, our incidence of thromboembolism was slightly higher than 2% reported by Verhoef et al. [47] in elderly subjects with atrial fibrillation. In purely surgical series, the rate of heart valve prosthesis-related thromboembolism ranged between 0 and 4 per 100 patients/year, whereas the rate of hemorrhagic events ranged between 0.2 and 7.2 per 100 patients/year [48].

Our study has several strengths. To our knowledge, it is the first prospective trial that attempted to externally validate the IWPC algorithm on vulnerable elderly people (≥65 years) only. Secondly, only incident cases were prospectively recruited, and we were able to identify patients unable to achieve dose stability, who are typically excluded from retrospective studies. Thirdly, we included patients with both medical and/or surgical indications in the attempt to provide a more complete picture of warfarin use in elderly population. Lastly, to our knowledge, this is the first study that assessed generalizability of the 2012 version of the IWPC algorithm.

Our study only included Caucasian subjects, although the predicted dose should not be affected, since the IWPC algorithm entails different coefficients for non Caucasian people. A further limitation of our study is sample size, that was smaller than the planned one; accordingly the precision of our estimates was reduced from the planned ±3% to about ±5%. However our study still remains larger than most previous trials, and our estimate was quite as precise as that observed in the original paper [6]. The reduced recruitment was mainly due to the increasing use of DOAC in clinical practice, which prevented the recruitment of elderly warfarin-naive patients with medical indication and favored withdrawal from the study. Sample size was further reduced because of subjects who did not achieve the maintenance dose. About half of them withdrew from the study, likely to move from warfarin to DOAC or other anticoagulant; another possible explanation could be the clinical complexity of the study population. Unfortunately, almost none of the studies reported the percentage of unstable patients thus interpretation is difficult.

The IWPC algorithm overpredicted warfarin maintenance doses in high-risk elderly people, mainly in patients with medical indication. More tailored models, possibly including comorbidity information, are to be evaluated in such a vulnerable population.