Introduction

Total knee arthroplasty (TKA) is the gold standard for patients with end-stage knee osteoarthritis (OA), with excellent clinical results. It provides pain relief, enhanced mobility and improved quality of life in patients with OA [1]. In the literature, implant survival rates between 81% and 92% at 15-years follow-up were reported [2, 3]. Accurate implant alignment and soft tissue balancing is an important factor for a successful outcome [4, 5]. Well-known reasons for early failure are instability, implant malpositioning and limb malalignment [6]. The typical objective of TKA is to obtain a limb alignment within 2°–3° around a neutral mechanical axis (MA) of the leg in the coronal plane [7, 8]. Deviation of more than 3° from a neutral MA is associated with increased failure rates due to off-axis loading, polyethylene wear and subsequent implant loosening [4, 7,8,9].

As with most techniques in modern medicine, TKA has been developed significantly over the past decades with improvement of both implant survival and daily function. Computer-assisted surgery (CAS) for TKA was introduced in the late 1990s to improve alignment of TKA and reduce the incidence of outliers. Pins are placed on the femur and tibia and with help of a computer platform, tracking system and a rigid body marker, anatomical landmarks around the knee are registered and prosthetic sizing and bone resection level can be analyzed. Studies showed the percentage of outliers in the coronal plane was 9–13% with CAS and up to 32% with conventional instruments (CI) [10, 11].

A concern that arises with any new technique is whether it will achieve satisfactory outcome. Published results on CAS are contradictory. Despite improving mechanical alignment, CAS failed to improve functional results when compared to conventional techniques [12,13,14].

Recently, the next generation of robotics was introduced to further reduce the amount of outliers. An image-free handheld robotic sculpting system enables the surgeon to plan the implant position in six degrees of freedom peroperatively without the need for preoperative imaging. Intraoperative adjustments allow the surgeon to optimize soft tissue balancing and bony alignment. Implant placement accuracy of this image-free approach has been demonstrated in a cadaveric study [15]. Published clinical results on this image-free handheld robot system are scarce. To the best of our knowledge, no clinical studies have yet been published comparing the precision freehand robotic sculpting system with a matched cohort of conventional TKAs. This matched cohort study on robotics and conventional technique for the implantation of TKA was designed to address the following research question: Is there a difference in the alignment accuracy between robotic and conventional TKA? Does the robot achieve the desired MA as planned by the surgeon’s preference? We hypothesize that the robot will be more accurate when compared to conventional TKA and will achieve the desired MA with few outliers.

Methods

This retrospective case-controlled study included a consecutive series of patients operated with an image-free handheld robot system for TKA (NAVIO Surgical System, Smith & Nephew, Memphis, USA) between May 2018 and March 2019 in the St. Trudo Hospital (St. Truiden, Belgium). A total of 79 patients were operated by means of robotic-assisted TKA (Robot-group) for end-stage knee OA. Two patients were excluded because of technical difficulties with the robot system and were switched to conventional technique peroperatively. The remaining 77 patients were matched on gender, age, type of implant (Journey II/Genesis/Legion) and follow-up time with seventy-seven patients who had been operated with the conventional technique using intramedular rods (Control group), forming a total cohort of 154 patients. All TKA procedures were performed by 2 experienced knee surgeons (PB and JM), both performing a minimum of 150 TKA procedures annually. Six patients were excluded because of missing data. Baseline demographics and operative data are shown in Table 1.

Table 1 Baseline and operative data of the included patients are presented as mean (SD) or absolute numbers (%) for both groups

Operating technique

In all procedures, a tourniquet and a standard medial parapatellar approach was used. In the Robot-group, two percutaneous bicortical partially threaded pins were placed with help of a pointer instrument into the proximal tibia and into the distal femur for the tracking arrays of the robot system. Standard exposure of both the femur and tibia was carried out with patella eversion. The anterior and posterior cruciate ligaments were sacrificed and osteophytes were excised before gathering data with the robot. Mechanical and rotational axes of the limb were determined peroperatively by establishing the hip, knee and ankle centers. The kinematic axis of the knee was identified and selected to determine the rotational position of the femoral component. The morphology of the knee was determined by mapping out the condylar anatomy by “painting” the surfaces with the probe. A software program of the robot creates a virtual 3D model of the knee. The implant size that best fits on the virtual model is then selected by the surgeon, including the position of the implant in coronal, sagittal and rotational planes.

Thereafter, valgus and varus stress was applied during full range of motion to tension the medial and lateral structures for balancing the dynamic soft tissue. The robot software creates a graphical gap space through the full range of motion by which the surgeon can adjust the desired MA and the planned position of the femoral and tibial components to optimize the soft tissue balancing. Adjustments of the implant size and position in all planes were made to optimize soft tissue balance and component tracking before bone preparation. After planning the components, the distal femur bone was prepared with a high-speed 5-mm burr by continuously moving a hand piece and switched on and off as the motorized burr moved in and out of the mapped cutting space. The pin holes for the saw guides were prepared with a 5-mm burr for the proximal tibia and with a 2-mm burr for the distal femur. The saw guides were then fixed, with the feedback of the robot, in an optimal position to make the bone cuts of the proximal tibia and anterior and posterior femur.

After putting in the trial components, balancing and component tracking was checked again with valgus and varus stress during full range of motion with graphical representation of gap spacing. TKA were cemented (Gentamicin Palacos, Heraeus Kulzer GmbH., Hanau, Germany) in all procedures.

The operative procedure in the Control group was identical to that in the Robot-group, except for pin placement for the cutting blocks, which was done using conventional intramedullary alignment rods. Soft tissue balancing and patella resurfacing were performed when deemed necessary.

Clinical outcome

All patients were seen on standard outpatient visits, and all information was obtained from the patient’s records. Standing, full weight-bearing, AP long-leg and lateral digital radiographs were taken preoperatively and 6 weeks postoperatively in both groups. Radiographic measurements were made with the use of a calibrated protocol on digital images using the Impax system for Windows (AGFA HealthCare). All radiographic evaluation was performed twice for each radiograph separately by 1 observer (AB) to obtain intra-observer reliability and once by an independent observer (SK) to calculate the inter-observer reliability.

The MA was determined according to Tigani et al. [16]. The position of the femoral (frontal femoral component, FFC) and tibial (frontal tibial component, FTC) components perpendicular to the MA was measured on the same standing, weight-bearing, AP long-leg radiographs. On the standard lateral radiographs, the flexion and slope for, respectively, the femoral (lateral femoral component, LFC) and tibial (lateral tibial component, LTC) components were measured according to Tigani et al. [16]. The LFC was measured as the anterior angle between the femoral component and the anterior cortex of the femur. The LTC was measured as the anterior angle between the tibial component and the posterior cortex of the tibia. Proportions of deviation > 3° were calculated for the MA and individual components. These were considered as outliers.

Further perioperative conditions including operating time (incision to closure in minutes) and type of implant (Journey II/Genesis/Legion) were assessed from the patients operative records.

The study was approved by the Local Review Board (Sint-Truiden, Belgium; approved on February 20th, 2019; IRB-Nr PEC-V-HJ2019-02-20) and conducted according to the principles of the Declaration of Helsinki (2008) and in accordance with the Medical Research Involving Human Subjects Act [17].

Statistical analysis

Statistical Package for the Social Sciences version 21.0 (SPSS, Inc., Chicago, IL) for Windows was used. All radiographic evaluation was performed by 2 independent assessors (AB and SK). Intra- and inter-class correlation coefficients (ICCs) were calculated to check for intra- and inter-observer reliabilities. An ICC ≥ 0.7 was considered as good correlation. Statistically significant differences for the baseline conditions and radiographs between the Robot-group and the matched Control group were analyzed with a Student’s t test. Fisher’s exact test was used to test differences of proportions. No a priori sample size or power was calculated for this retrospective case-controlled study. It was carried out in an attempt to predict an appropriate sample size to design a full-scale research project. Therefore, effect size (i.e., Cohen’s d) for the primary endpoint, the postoperative mechanical axis, was calculated as the standardized difference between two means divided by the standard deviation of both groups. An effect size of 1.0 is equivalent to a change of one SD in the sample and considered very large. An effect size of 0.8 is large, 0.5 is moderate and ≤ 0.3 is small [18]. For all analyses, a p-value was considered to be statistically significant at p ≤ 0.05. Results are presented as mean (SD), proportions (%) and 95% confidence interval (CI).

Results

Patients from the Control group were perfectly matched with patients from the Robot-group, and no differences in baseline characteristics were reported. Additional perioperative information shows a significant longer operation (OR) time (p = 0.000) in the Robot-group. Baseline and operative data are presented in Table 1.

The difference in the postoperative MA was statistically significant (p = 0.028) between the Robot-group and Control group with a mean of 180.1° (1.9°) and 179.1° (3.1°), respectively. The postoperative MA in the Robot-group improved significantly (p = 0.005) when compared to the preoperative MA. Difference between the pre- and postoperative MA for both the Robot-group and Control group is visualized in Fig. 1.

Fig. 1
figure 1

Boxplots visualizing the pre-and postoperative mechanical axis (MA) for both the Navio TKA and Conventional TKA groups

A moderate effect size of 0.4 was calculated for the MA. The frontal and lateral radiographic outcome for both the femur and tibia are summarized in Table 2.

Table 2 Mean (SD) radiographic values compared between the robot TKA and the Conventional TKA groups

The MA was restored accurately in all patients, but with accompanying numbers of outliers in the Robot-group and Control group. All radiographic measurements of both observers were reliable, and ICCs were excellent except for the inter-observer agreement (0.411) regarding the LFC. Only the number of outliers for the varus–valgus position of the tibial component showed a significant difference (p = 0.038) with more outliers in the Control group (n = 6, 8%) than in the Robot-group (0%). Outliers and corresponding percentage, including the intra- and inter-observer agreement, are summarized in Table 3.

Table 3 Amount of outliers and percentage (%) compared between the robot TKA and the Conventional TKA groups

Regarding the preoperative planned and postoperative achieved mechanical leg alignment as measured with the robot, data of 71 patients (92%) from the Robot-group were available. When compared with the postoperative standing, weight-bearing, AP long-leg radiographs, 6 (8%) and 11 (15%) outliers were found for the planned and achieved MA, respectively, as obtained with the robot.

Discussion

The most important finding of the present study is the ability of a novel image-free handheld robot sculpting system to accurately achieve the planned mechanical axis (MA) in TKA with few outliers. There were significant but no clinically relevant differences between the two groups regarding the radiographic outcome measures. The present study investigated the alignment and component positioning of TKA performed with an image-free handheld robotic sculpting system and compared these results with a matched control group of TKAs performed with conventional intramedullary technique, without focusing on learning curve, clinical outcome and complication rate of the robot.

Restoring a neutral MA within 3° of varus/valgus remains one of the most fundamental principles in TKA [8, 19, 20]. Recent studies have raised doubts about the benefits of a neutral MA. A study with a 20-year follow-up revealed that there was no difference in survival between TKA that were mechanically aligned or those outside the 3° of varus/valgus [21]. Furthermore, this objective to obtain a neutral MA has been questioned by the introduction of kinematic alignment (KA) in TKA [22]. The philosophy of KA is to restore the physiological kinematics by allowing a certain degree of constitutional varus/valgus in TKA. In the present study, KA robot TKA was compared with MA conventional TKA. Due to the retrospective nature of this study, we did not report the degree of the planned femoral cut in the Control group. In the Control group, bony cuts were measured on preoperative X-rays and made in order to achieve a neutral MA. In the Robotic-group, all TKAs’ were kinematically aligned to achieve optimal soft tissue balancing, and an alignment of ± 2° from neutral was tolerated. Despite the ongoing debate on optimal limb alignment, it remains clear that a well-balanced TKA is paramount to ensure long-term survival by avoiding off-axis loading, polyethylene wear and subsequent implant loosening [4, 7,8,9]. The robotic-assisted system provides a useful tool for the surgeon to peroperatively change implant positioning and alignment in order to achieve a well-balanced TKA.

In literature, conventional TKA have a risk of outliers of up to 30% [10, 23]. Thienpont et al. observed 20% outliers in a meta-analysis on patient-specific instrumentation (PSI) for TKA [23]. A meta-analysis studying computer-assisted surgery (CAS) for TKA showed that outliers occurred in 9% of the cases [10]. The most recent robotic-assisted TKA established outliers in just 3% of all cases [24]. The outliers (6%) found in the present study with the robotic system are in line with the other robotic systems.

Interestingly, study data reveals a difference in outliers between the achieved MA, as measured by the robotic-assisted system, and the postoperative MA, which was measured on long-leg standing X-rays. Based on our experience, we believe this is due to the extra amount of force of a weight-bearing radiograph in comparison with the robotic-assisted system measurements, where the patient is lying flat on the operating table. This has previously been shown in the literature too [25, 26]. Despite these preliminary data, we should therefore consider the possibility of inaccurate measurements of the robotic-assisted system.

In the present study, no outliers of the frontal tibia component were observed in the Robot-group, which was significantly better compared to the Control group with 8% outliers (p = 0.038). Tibial component malalignment had been shown to be a well-known cause of early failure in TKA [27]. The accuracy of the robotic-assisted system in implant positioning has been previously validated in unicondylar knee arthroplasty [15, 28, 29]. Casper et al. validated its accuracy in TKA in a recent cadaveric study [30]. Mason et al. previously reported frontal femoral and tibial malalignment in 5% and 4%, respectively, in CAS TKA [10]. We expected fractions of outliers to be comparable, or better, than CAS TKA, though a higher fraction of femoral outliers was found. We believe this is due to a learning curve. However, we should consider low accuracy of the planning software.

There may be some concerns regarding the radiological measurements in the sagittal plane. The present study observed 17% and 86% outliers in the Robot- group for, respectively, the tibia slope and flexion of the femoral component as measured on lateral X-rays. These large amounts of outliers were also observed in the Control group, without significant differences between both groups. This worse outcome can partly be explained by the suboptimal quality of the lateral X-ray measurements as found between both observers (inter-observer agreement: 0.411) and was therefore not reliable. This low inter-observer agreement was disappointing that it should be considered invalid. A wide variety of different analyses in the literature are used to objectively determine the postoperative position for both the femur and tibia implants. Most of the literature studying postoperative implant position use long-standing radiographs. Determining sagittal component alignment using plain radiographs is challenging due to the effects of difficulty in establishing a true sagittal axis using standard lateral X-rays [31, 32]. Despite a good ICC for the evaluation of the frontal position of both femur and tibia implants, rotational alignment was not examined. Postoperative evaluation on 3D-CT has shown to be a valuable tool to measure position and orientation of both the femur and tibia components, including the implant rotation [33, 34]. Unfortunately, a postoperative 3D-CT is not routinely performed in our clinic. Future studies on the accuracy of the robotic-assisted system are needed with accurate CT-based measurements, to assess this observation.

A valuable strength of this study was the well-described patient cohorts due to the extensive information available in the patient’s medical records. This allowed matching of patients from the Robot-group with suitable patients from the Control group. Another particular strength of this study was that both surgeons operated with the robot and conventional alignment tools and with use of the different designs of implants. These strengths provide less sensitivity for selection bias in this study.

In this study, 2 patients were excluded because of technical issues of the Navio tracking system. Both cases were switched to conventional instrumentation peroperatively. Here we report setup issues in the early experience with the Navio robot. As mentioned before, a learning curve is to be expected with the use of a novel technology.

Together with the retrospective design of the study, the most important limitation was the limited number of patients. The study was designed for radiographic analysis of the postoperative MA with the use of long-leg standing X-rays. Therefore, a type II error cannot be ruled out. A post hoc power analysis revealed a power of 0.669 (1− β). Based on these results, we performed a power analyses with effect size of 0.4, power (1− β) set at 0.80 and a = 0.05 (two-tailed). Approximately 105 patients in each group are needed to obtain statistical power at the recommended 0.80 level.

Finally, despite the retrospective nature of this study, with a limited number of patients, this is the first study comparing an image-free handheld robot with conventional alignment for TKA giving valuable insight about the use of a promising knee alignment tool addressing a relevant outcome. The results illustrate that an image-free handheld robot for TKA can help to restore the desired limb alignment and accurate tibial implant position.

Conclusion

The present study illustrates that this simplified surgical robot technique for TKA is effective with acceptable radiological outcome. The Robot-group showed significantly less mechanical axis outliers compared to the conventional technique. Whether these differences are clinically relevant is questionable and should be investigated on the long term. Randomized controlled trials with larger patient series will be needed in the future to confirm our preliminary results.