Introduction

Total knee replacement is a highly successful procedure for the treatment of end-stage knee arthritis. It has achieved this by relieving the debilitating pain associated with this condition and the procedure has resulted in restoring functional independence and quality of life [1, 2].

Total knee replacement as currently performed uses manufacturer provided instrumentation that (co-opts) took cognizance of the anatomy of the patient’s knees [3]. The instrumentation enables reproducible cuts on the bone, thereby ensuring a more accurate prosthetic sizing and (implantation) and fitting.

Refinements in metallurgy and component manufacture, improvement in pre-operative management, and surgical techniques have enabled total knee replacement to evolve into a reliable and reproducible procedure with high success rate with a 95% survival at 10–15 years [4].

However, despite the successes of knee arthroplasty, there is a small but constant number of patients whose knee replacements fail early. This is partly attributed to some areas of imprecision in the performance of total knee replacement using the conventional (manual) instrumented technique [3]. Furthermore, up to 20% of patients are dissatisfied with their ‘new’ knee following total knee replacement [5].

Various studies have consistently shown that with the use of the conventional instrumented techniques, a failure to achieve a post-operative mechanical alignment within the optimal range of 0–3 degrees of valgus occurs in 10–38% of total knee replacements [6, 7].

Computer navigation, on the other hand, with its novel kinematic support has delivered improved precision of technique in knee arthroplasty surgery with subsequent reduction in the error rates observed with the conventional instrumented technique [8,9,10]. The question that is often asked is whether this improvement in precision will translate to improvement in functional outcome and reduction in revision rates.

The aim of this systematic review, therefore, is to assess the functional outcome of imageless computer navigation-assisted total knee replacement compared with conventional instrumented knee replacement at mid-term follow-up using prospective randomised control studies and prospective non-randomised cohort studies. Furthermore, our aims were to also assess other aspects of treatment outcome including complication rates and survival of implanted knees.

Methodology

This study is carried out according to PRISMA guidelines [11], and was limited to prospective randomised control studies and prospective cohort studies. The intervention of interest is imageless computer navigation modalities in total knee replacement. Inclusion criteria are as per the PICO format. The population is patients with end-stage knee osteoarthritis undergoing total knee replacement. Intervention is imageless computer-navigated total knee replacement and the comparator is the conventional instrumented total knee replacement. Primary outcome measure is mid-term functional outcome following total knee replacement with a minimum follow-up period of 4–5 years post-operation. Secondary outcomes reviewed included peri-operative and post-operative complications, patient satisfaction, and implant survival.

Excluded from this review were Non-English language studies, animal studies, retrospective studies, cases including femoral mal-union, cadaveric studies, case series, case reports, systematic reviews, revision surgeries, and review articles. Other types of navigation apart from imageless navigation were also excluded as are robot-assisted total knee replacement procedures. In addition, outcomes less than 4 years were equally excluded.

Search Strategy

The searches were carried out independently by authors (CA, TA, and ON) and the selected abstracts agreed by consensus. The databases searched included Medline OVID (1997–January 2015), The Cochrane database of systematic reviews and randomised control trials (1997–January 2015), EMBASE (1997–January 2015), Cochrane and DARE (Database of Abstracts of Reviews of Effectiveness), Scopus (1997–January 2015), and Web of Science (1997–January 2015).

To complement the searches, screening reference lists of relevant articles, systematic reviews, and meta-analysis and other studies were carried out on relevant papers that may have been missed by the more formalised database searches.

Search terms included MESH (Medical Subject Headings) and ‘Free text’ keywords within the title and abstract fields. Similar terms were combined with OR, while concepts were combined with AND. The search terms used included; arthritis, osteoarthritis, knee arthritis, knee osteoarthritis, arthroplasty, replacement, knee, computer-assisted surgery, computer navigation, navigation, computed, treatment outcome, and functional outcome.

Study Selection

The papers were screened for eligibility by reading titles and abstracts of the studies. This was carried out independently by the three authors. Selected studies are further screened against the inclusion criteria for eligibility for inclusion in the systematic review by reading the full-text articles (Fig. 1). Those rejected at the full-text stage are noted and reasons for exclusion at this stage provided. Any disagreement was resolved by consensus. Figure 1 shows flowchart of the study selection process.

Fig. 1
figure 1

Flowchart of the study selection process

Data Extraction

Data extracted included: study characteristics including author and year of publication, type of publication, participant numbers and numbers per category, and patient demographic data including age (mean and standard deviation) and gender. The other data extracted included: the type of intervention and control, the details of intervention including type of navigation, and the duration of the surgery (Table 1). The post operative management of both groups (imageless computer navigation and conventional instrumented total knee replacement) was noted. Outcome data extracted included peri-operative complications, post-operative outcomes including range of motion, functional outcome, satisfaction rates, and revision rates. Data extraction tool was adapted from the Cochrane data extraction form and was piloted on three papers by two of the authors (CA, TA).

Table 1 Characteristics of included studies

Study Quality and Risk of Bias

Cochrane risk of bias tool was used to rate the internal validity of the selected studies. The Cochrane risk of bias tool is grouped into five risk groupings with seven corresponding domains. The assessment of risk of bias for each domain was determined as low risk of bias, unclear risk of bias, and high risk of bias [12].

Statistics

R version 3.1.2 (2014-10-31) [13] was used for all analyses. Meta package [14] was used for R to produce the DerSimonian–Laird estimates. Global estimates and 50% and 95% confidence intervals (CI) were obtained for each analysis. Raw data for meta-analysis are in Appendix C.

Results

A search of the electronic databases revealed a total of 479 studies. Medline revealed 292 studies, Embase revealed 39 studies, Scopus revealed 89 studies, Web of Science revealed 100 studies, and Cochrane database search revealed 133 studies. Grey literature search revealed a further 15 studies. Thirty-nine (39) duplicate studies were identified and removed leaving 440 studies. Screening of these studies by titles and abstract led to the exclusion of 389 studies. The full-text articles of the remaining 51 studies were assessed by the authors with 40 studies excluded for the following reasons; 30 had short-term follow-up/alignment outcome only, 2 were laboratory studies, 2 were robotic-assisted navigation, and 4 were retrospective studies with 5-year follow-up. This left 11 papers which are the subject of this review.

The studies included involve 2206 participants with 1099 allocated to the imageless navigation-assisted total knee replacement arm and 1107 assigned to the conventional instrumented total knee replacement arm of the study. Two studies involved bilateral total knee replacement with one knee assigned to one technique and the other knee to the alternative technique of knee replacement [19, 24].

Total knee replacement was carried out in all cases allocated to the navigation arm of the study using imageless computer navigation assistance. Optical/infrared referencing was used in ten studies with a study [21] using electromagnetic referencing for the navigation.

Two studies used the navigation system for the femoral cuts only and conventional surgical procedure to conduct the tibia cuts of the knee replacement [20, 23]. In the conventional arm of the study, manufacturer supplied instruments and jigs were used according to the manufacturer’s specification. All surgical exposures were carried out identically in each arm of the study and ten studies involved standard length parapatellar exposure, and in one study [21], minimal incision surgery was utilised in both arms of their study. A total of nine different implants by different manufacturers were used in the 11 studies with 3 studies using Nexgen implant, 1 study using PFC implant, and 1 study using both PFC and Nexgen implants. A study used a standard Nexgen and a high flex Nexgen.

Outcomes

Five different functional outcome scores were used to assess knee function following total knee replacement in the 11 studies. Functional outcome was assessed using the Knee Society Score in seven studies, and the Western Ontario and McMaster Universities Osteoarthritic index (WOMAC) was used in two studies, whilst the Oxford Knee Score was used in one study as was the International Knee Society (IKS) score and the Hospital for Special Surgery knee scores (HSS). Knee Society Score (KSS) is a well-validated scoring system that has an objective (surgeon assessed) knee joint score of 100 points and a subjective (patient assessed) knee function score of 100 points. The objective knee score assesses range of knee motion, pain relief, knee alignment, and stability. The subjective function score assesses use of walking aids, walking distance, and stair climbing ability. Total score is a summation of the two scores. The higher the score, the better the outcome. A score of 80–100 excellent; 70–79 good; 60–69 Fair; Below 60 poor.

Four studies showed statistically significant improvement in functional outcome at mid-term follow-up in favour of computer-navigated total knee replacement [15, 17, 18, 20]. In three of the studies [15, 17, 18], the improved functional outcome at 5-year follow-up was correlated with improvement in coronal alignment within the optimal three degrees of varus/valgus mechanical alignment. One study [20] demonstrated improvement in functional outcome at 5-year follow-up with computer-navigated knee replacement with no difference in alignment outcome between the two arms of the study. The reason given for this finding was that differences in rotational alignment outcome between the two techniques may be responsible for the difference in functional outcome, even though this was not measured in that study.

Four studies showed no significant difference in both alignment and functional outcomes between the two techniques [16, 19, 21, 24]. All studies, however, showed a high level of accuracy in achieving implant alignment within the optimal three degrees mechanical alignment in the use of either technique. This reflects the experience of the surgeons with the use of either technique and explains the similarity in functional outcomes achieved. Three studies demonstrated improvement in radiographic alignment but no improvement in functional outcome at 5-year follow-up [22, 23, 25].

Quantitative Analysis

We present the results graphically for each of the Total Score, Objective Score, and Functional Score. Six studies [16, 18, 20, 21, 23, 25] provided total, objective, and functional score, while a seventh Havie et al. [22] had only the total score.

Total Score

Figure 2 contains the point estimates, 50% confidence intervals (the heavy lines), and 95% confidence intervals (the thinner lines).

Fig. 2
figure 2

Point estimates with 50% and 95% confidence intervals for Total Score

For Total Score, the global estimate is 3.19 and its 95% confidence interval contains zero quite comfortably. This suggests that there is modest evidence of improvement in outcomes when computer-assisted navigation is used. Using DerSimonian–Laird random effects approach, point estimate is 1.45 with 95% confidence interval (0.11, 2.799).

Objective Score

We now perform the analysis for the Objective Score component of the Total Score (see Fig. 3). For Objective Score, the global estimate is 1.5 and its 95% confidence interval contains zero, suggesting that there is modest evidence of a small improvement in outcomes when computer-assisted navigation is used. Using DerSimonian–Laird random effects approach, point estimate is 1.45 with 95% confidence interval (0.11, 2.799).

Fig. 3
figure 3

Point estimates with 50% and 95% confidence intervals for Objective Score

Functional Score

The results of running the analysis on the Functional Score component appear in Fig. 4. For Functional Score, the global estimate is 2.12 and its 95% confidence interval comfortably contains zero, suggesting that there is weak evidence of a small improvement in Functional Score when computer-assisted navigation is used. Using the DerSimonian–Laird random effects approach, the point estimate is 2.4 with 95% confidence interval (− 1.174, 5.977).

Fig. 4
figure 4

Point estimates with 50% and 95% confidence intervals for Functional Score

The summary of the primary outcome shows modest evidence of improvement in outcomes when computer-assisted navigation is used, but the size of the difference is small: Total score, point estimate 3.36 CI (− 0.583, 7.298), objective score, point estimate 1.45 CI (0.11, 2.799), and function score, point estimate 2.4, CI (− 1.174, 5.977).

The Secondary Outcome is as outlined below:

Technical complications including anterior femoral notching and excessive tibia resections necessitating the use of large plastic inserts were more in the computer-navigated arm of the study compared with the conventional group. These complications were thought to be due to a failure of registration, but may also reflect the learning curve in the use of computer navigation [19]. Infection rate was similar in both groups (Table 2).

Table 2 Complication rates

Risk of Bias Assessment

The assessment of bias was carried out using the Cochrane collaborations tool for assessing risk of bias [11]. A summary Table of Risk of assessment of individual studies in the review is presented below in Table 3.

Table 3 Risk of bias table of included studies

Whilst six studies [15, 17, 19, 21, 22, 25] demonstrated adequate randomisation, three studies, on the other hand, clearly described the method of allocation concealment [15, 17, 18]. Although, in nine studies, the blinding of the assessor was well elucidated, blinding of the surgeon was not documented in most studies. A problem with most studies was the loss to follow-up with death of the participants on account of the age group involved being the most common cause. Only two studies accounted for incomplete outcome data statistically on an intention-to-treat basis [17, 20]. In addition, five studies described the experience of the surgeon with navigation, whilst, in six studies, this was not specified.

Publication Bias

Funnel plots of all three components of the Knee Society Score are shown below in Figs. 5, 6 and 7.

Fig. 5
figure 5

Funnel plot for Total Score Component of the Knee Society (KSS) Score

Fig. 6
figure 6

Funnel plot for the Objective Score Component of the KSS Score

Fig. 7
figure 7

Funnel plot for the Function Score Component of the KSS Score

The vertical dotted line is the estimate of the global mean from the random effects model with the horizontal axis representing the confidence interval of the estimate. Publication bias is indicated by an absence of points at the lower quadrant of the plot.

The funnel plot shows the study by Cip et al. [18] as an outlier with other studies within the funnel plot. There are only seven studies included in the analysis and so interpretation is subjective. There was, however, no evidence of publication bias.

The funnel plot for the knee (Objective) score did not show any evidence of publication bias. The numbers of included studies, however, are small making for subjective interpretation.

The funnel plot shows an outlier in the study by Cip et al. [18]. There were no studies in the lower right quadrant suggesting some evidence of publication bias with some asymmetry in the distribution of the studies. There were only seven studies included in the analysis and so interpretation is subjective.

Discussion

Summary of Evidence

The result of analysis showed modest evidence of improvement, however, small in functional outcome, when total knee replacement was performed using computer navigation assistance compared with the conventional instrumented surgery. The improvement is noted in the total scores, the objective knee, and the function subscales of the knee society scores.

Blinded outcome assessment was the norm in all studies and the randomised studies were in the high quality studies with adequate randomisation, although most studies did not clearly specify how allocations were concealed. Many of the studies noted the experience of their surgeons, an important variable that can increase the risk of bias. Only two studies indicated that patients were blinded, but did not provide the details of how this was carried out. There were four prospective cohort studies which apart from the risk of bias due to study design had low risk of bias on other parameters. Loss to follow-up was a major factor in most studies with death of participants an important cause; there was no attempt to account for loss to follow-up in the recruitment process or by intention-to-treat analysis.

A complication that seemed higher with computer navigation-assisted knee replacement group was stiffness and arthrofibrosis. This may be because the use of this procedure resulted in longer operating time and more trauma to the soft tissues. It could also be due to the placement of pins in the quadriceps tendon/muscle away from the surgical wound, resulting in quadriceps adhesion and peri-articular soft-tissue trauma.

Study Findings in Relation to Current Literature

Recent well-conducted meta-analysis has shown that computer navigation assistance in the conduct of total knee replacement results in more precise placement of the implant compared with conventionally replaced knee [8, 10]. This is particularly true when lower limb mechanical axis within the optimal three degrees varus/valgus is assessed. This parameter has been shown in various studies to be associated with improved survival with values outside this range associated with failure [6, 26, 27].

There is, however, controversy regarding whether the improved limb alignment and implant position obtained using imageless navigation assistance results in improvement in functional outcome. A recent meta-analysis of solely randomised control trials comparing computer navigation-assisted total knee replacement and conventional knee replacement demonstrated statistically significant improvement in functional outcome in favour of computer navigation group at 2-year follow-up [8]. The explanation given by Rebal et al. [8] for the positive result in favour of navigation was their inclusion of only well randomised control trials and other stringent inclusion/exclusion criteria. Previous meta-analyses have been unable to demonstrate any such improvement with the use of navigation assistance [9, 28, 29].

In the present study, a mid-term review and meta-analysis with 4–5-year follow-up data was able to demonstrate modest evidence of small improvement in functional outcome in favour of computer navigation-assisted knee replacement. This is consonant with the results of other studies that demonstrated improved outcomes with computer-assisted navigation [8, 30]. There is also evidence from the Australian arthroplasty registry of reduced risk of revision in younger patients (under 65 years) by more than one-third when computer navigation assistance is used in total knee arthroplasty compared to conventionally instrumented total knee replacement [4].

It is, therefore, likely that the younger arthroplasty patients would benefit the most from computer navigation-assisted knee replacement as the improvement in functional outcome and significant reduction in risk of revision is most beneficial in an age group with historic poor functional outcomes and survival [31]. We, therefore, recommend routine use of imageless computer navigation assistance when undertaking total knee replacement in young patients under 65 years of age.

Study and Outcome-Level Limitations

The study is a mixture of randomised and cohort studies. Better designed studies, preferably prospective randomised studies adequately powered to adjust for attrition due to death of participants on follow-up, with appropriate blinding of patient and outcome assessors will provide definitive answers with respect to the role of this new technology in total knee replacement surgery. Data for this study were extracted a few years ago, and although, to the best of our knowledge, there has not been additions to the literature that will substantially alter the conclusion to warrant a redraft, we perceive that it may well constitute another limitation of this review. Finally, a further limitation of this paper is the exclusion of Non-English literature.