Background

Lost to follow-up is a problem in long-term follow-up studies concerning total knee arthroplasties (TKA). It implies patients who were participating in a clinical research trial but became lost at the time of follow-up. Lost to follow-up is inevitable in most cohort studies [1].

Discussion exists whether patients being lost to follow-up have a different outcome in comparison with the patients who do attend follow-up appointments [24]. Murray et al. showed that lost to follow-up matters [2]. In the likely case that this occurs, a worst case scenario analysis should be carried out [5]. The traditional worst case scenario analysis based on the Kaplan-Meier method considers all patients to have undergone revision surgery at the time of lost to follow-up. However, this assumption is rather conservative and unrealistic.

First, the timing of revision surgery in the worst case scenario is unrealistic considering the notion that the assumed revision could take place at any time in the period between the time the patient becomes lost to follow-up and when the patient dies.

Second, another shortcoming of the traditional worst case scenario is its application of the Kaplan-Meier (KM) method. Although the KM method is a well-known tool for estimating survival probabilities of total knee arthroplasties, it leads to incorrect and biased results, in situations where competing events are involved [611]. This is because the KM estimator was developed to estimate one single endpoint, the probability of death that occurs to everyone given a long enough follow-up [12]. Death is, obviously, an event which eventually will happen to all patients, so there is no competing event that prevents death from happening.

Contrary to death, failure of a prosthesis is an event that may be prevented from occurring due to competing events. Patients are likely to experience events other than loosening of the prosthesis (e.g. death), because the survival of TKA at ten years follow-up generally exceeds 90 %. Consider for example that a patient dies before developing loosening of the arthroplasty. In that case death prevents a revision for loosening from being performed, i.e. after a patient has deceased the prosthesis will never be revised. Therefore death is a competing event for revision of TKA. The cumulative incidence estimator has been developed to overcome the problems with KM in studies where competing events are involved [69, 13].

Therefore in this study we propose a novel methodology for dealing with lost to follow-up patients without making extreme and unrealistic assumptions (i.e. all prostheses lost to follow-up being revised at the moment of lost to follow-up) and employ for the analysis the competing risk approach.

Methods

A cohort of 112 patients (93 female/19 male) forms the basis for this study. All patients underwent primary cemented total knee arthroplasty (TKA) of the total condylar type-I (Zimmer, Warsaw, Indiana) featuring an all-polyethylene semi-constrained tibial component. The 143 procedures were performed between 1979 and 1990. The follow-up was up to 25 years (mean ten; range one to 25). The mean age at operation was 68 (range 30–82) years. Rheumatoid arthritis was the diagnosis in 73 % of all patients; the other 27 % of the patients suffered from osteoarthritis. The endpoint of interest was revision of the tibial component for any reason.

A glossary is available for definitions of terminology used in this article.

Ethics

This study was conducted after approval by the local institutional review board (Reference number P09.244).

Competing risks in the worst case scenario

Competing risks models concern the situation where more than one cause of failure is possible. In our setting death and failure of a prosthesis are the two competing events. To estimate the probability of each competing event, including the one of interest, one has to use a cumulative incidence estimator [13]. The cumulative incidence function of cause k is defined as the probability of failing from cause k before time t. In the competing risks framework when a patient experiences one of the competing events, it is taken into account that he or she does not have the same probability of revision surgery (implant failure) as other individuals, who have not failed from either causes [1419].

There are many situations in orthopaedics in which competing risks are likely and for which the cumulative incidence estimator must be considered. In our setting death and prosthesis failure are the two competing events.

For more technical details about competing risks see the Appendix.

The lost to follow-up gap

The traditional worst case scenario shows the effect of lost to follow-up on the estimated survival probability by choosing the time of last known follow-up as the moment of revision [5]. However, this assumed revision could have taken place at any time between the moment a patient is lost to follow-up and the moment a patient has died or the end of study. We define this period as the "lost to follow-up gap" (LFU gap). In order to determine the LFU gap we checked the date of death by consulting the community registry.

Revision during the LFU gap could have taken place at any possible moment. We consider three different scenarios:

  1. 1.

    Worst-worst case scenario’ considers time to revision at time of lost to follow-up as described by Murray et al. [5].

  2. 2.

    Medium-worst case scenario’ considers time to revision in the middle between time of lost to follow-up and time of death.

  3. 3.

    Best-worst case scenario’ considers time to revision just (seconds) before date of death.

We plot the cumulative incidence of the estimated revision probability corresponding to the “three worst case scenarios” to describe the effect of the lost to follow-up.

Since it is unlikely for the patient lost to follow-up to be revised just after the last follow-up or just before death, the medium-worst case scenario may therefore be considered to give on average the best estimate [20].

The new method

In TKA studies not every patient will be revised, i.e. some patients die during follow-up, some stay alive without being revised, and some will be lost to follow-up. Consequently, there are six different situations possible in studies of TKA with endpoint revision. The competing risks model depicts the situations described below:

  • Situation A: Patients with known revision of the arthroplasty during follow-up. Time to event is time to revision of the TKA.

  • Situation B: Patients without revision of the arthroplasty during follow-up and alive at end of study. Time to event is time to end of study.

  • Situation C: Patients without revision of the arthroplasty during follow-up and who have died during the study. Time to event is time to date of death.

  • Situation D: Patients lost to follow-up and alive at end of study. For estimating time to event we use the medium-worst case scenario. In this situation the end of the LFU gap is the end of the study. Time to event is time to halfway of the LFU gap.

  • Situation E: Patients lost to follow-up and who have died during the study. For estimating time to event we use the medium-worst case scenario. In this situation end of the LFU gap is the date of death. Time to event is time to halfway of the LFU gap.

  • Situation F: Patients lost to follow-up and for whom it is uncertain if they are alive at the end of study. Due to lack of knowledge of (possible) date of death in this situation, the LFU gap cannot directly be reconstructed. However, it is possible to estimate this date of (assumed) death with life expectancy based on patient’s gender and age at time of LFU using national mortality statistics. (In the Netherlands information of Statistics Netherlands can be used [21].) For estimating time to event we use the medium-worst case scenario. In this situation the end of the LFU gap is date of the estimated date of death. Time to event is time to halfway the LFU gap. Note: the estimated time of revision should always be within the study follow-up limits.

In all six situations time to event is estimated using the cumulative incidence estimator with the Mstate R library [22].

Results

Traditional Kaplan-Meier estimator

Two revisions for aseptic loosening were performed during the 25-years follow-up: one at 3.8 years and one at 20 years postoperative. One revision for septic loosening was performed at 3.7 years postoperative. Mean survivorship of the all-polyethylene tibial component with revision for any reason as endpoint according to the traditional Kaplan-Meier analysis was 98.3 % (95%CI 95.9–100 %) at ten years and 92.5 % (95%CI 81.3–100 %) at 25 years.

Competing risks in the worst-worst case scenario

In Fig. 1 the cumulative incidence for death and implant failure are shown; even in the worst case scenario patients are more likely to decease then to be revised.

Fig. 1
figure 1

Plot depicting cumulative incidences of the competing events in the worst case scenario

In our study the KM method overestimates the probability of revision of the all-polyethylene tibial component with revision for any reason as endpoint in the worst-worst case scenario by 30 %, 55 %, 64 % and 73 % at ten, 15, 20, and 25 years, respectively.

Lost to follow-up gap

For 77 patients (97 TKA) there was a complete follow-up with a mean of 10.6 years (SD 8.0). The remaining 35 patients (46 TKA) were considered lost to follow-up at a mean follow-up of 9.6 years (SD 4.7). For 25 of these patients (33 TKA) it was possible to determine the date of death and therefore the LFU gap. The LFU gap is illustrated in Fig. 2. The average time span of the LFU gap was 6.3 years (SD 4.0).

Fig. 2
figure 2

Illustrates the lost to follow-up gap (LFU gap) in a Gant chart. The blue lines show the known period of the follow-up of the 33 patients lost to follow. The red lines show the period of the LFU gap of the 33 patients lost to follow-up

Figure 3 shows that the revision rate is different throughout the study time for each of the three worst case scenarios (worst-worst case, medium-worst case and best-worst case). The relative difference of the revision rate in the three worst case scenarios (best-worst case, medium-worst case, worst-worst case) was 33 %. This figure shows timing of the assumed revision is vital and the medium-worst case can be considered to give on average the best estimate.

Fig. 3
figure 3

Figure showing the cumulative probability of implant failure for all three worst case scenarios (worst-worst case, medium-worst case and best-worst case) using the cumulative incidence estimator. The time of the revision matters

The new method

Six different situations were identified covering all possible situations in long-term follow-up for total knee arthroplasty (Fig. 4). In our study patients ended up in five of the six situations (see Table 1). Estimating time to event of all patients in these six situations while using the cumulative incidence estimator gives the cumulative probability of implant failure with the new method. The new method—considering all patients lost to follow-up as revised—shows a twofold reduction in revision rate compared to the worst case scenario with Kaplan-Meier as shown in Fig. 5.

Fig. 4
figure 4

The six different possible situations in long-term follow-up for total knee arthroplasty (TKA) for revision for any reason of the tibial component

Table 1 Number of total knee arthroplasties (TKAs) in all six possible situations
Fig. 5
figure 5

Cumulative probability of revision of the all-polyethylene tibial component for revision for any reason as endpoint estimated with the traditional worst case scenario using Kaplan-Meier (KM) and the new method using the cumulative incidence estimator (CR)

Discussion

The aim of our study was to illustrate how to deal with patients lost to follow-up in total knee replacement studies. Our newly developed method based on the competing risk approach shows a twofold reduction of the revision rate in comparison with the traditional worst case scenario estimated by Kaplan-Meier. Our method comprises the identification of competing risks and the correct estimation of the survival using the cumulative incidence estimator. As in the traditional worst case scenario, in our new method all patients lost to follow-up are still considered as being revised.

This is the first study since Murray et al. that gives a pragmatic method to deal with patients lost to follow-up [5]. While lost to follow-up should be avoided, this is not always possible. For this reason, we proposed an instrument that can deal with these patients. Our new method provides a statistically correct and more realistic tool in comparison with the traditional worst case scenario. The only additional requirement is some additional information of patients, including date of death, gender, and age at the time of lost to follow-up.

Six different situations were identified covering all possible situations in long-term follow-up for total knee arthroplasty. In our study there was no patient in situation D (patients lost to follow-up and alive at end of study). However, it could be possible to have patients in this situation. For example, a patient is unwilling to participate in the follow-up of the study and does not give consent to use his medical information. Therefore the patient is alive, but revision cannot be ruled out.

We used the cumulative incidence estimator to accurately estimate the probability of implant failure [7, 13]. Although this method was first introduced in 1978, it has only recently been proposed in orthopaedic literature. In accordance with Biau et al., Fennema et al., and Gillam et al. our study shows an overestimation of KM in an orthopaedic risks setting [6, 811].

While our method concerning lost to follow-up is developed in a cohort of total knee arthroplasties, it may be also applicable for other types of joint replacement studies and other endpoints such as aseptic loosening. Nelissen et al. showed that endpoint definitions have impact on the outcome of studies concerning total knee arthroplasties [23]. In our study we focused on the endpoint revision for any reason. However, when the endpoint is revision for aseptic loosening, one should be aware of the competing risk framework, i.e. several competing events can appear. Not only the competing risk ‘death’ can prevent the event of interest ‘aseptic loosening’ to occur, but also the event ‘septic loosening’ of the prosthesis.

Although the best measure is to prevent lost to follow-up, this may be unavoidable, especially for long-term follow-up studies. In situations where lost to follow-up does occur, the new method proposed in this study offers an efficient and elegant approach to deal with this problem.