Introduction

Osteoarthritis, including knee osteoarthritis, has become an increasing global challenge, with its burden growing notably in recent years [1,2,3]. Total knee arthroplasty (TKA) is one of the most frequently performed orthopaedic surgeries, with an estimated 1,065,000 patients undergoing the procedure in the United States in 2020. This number is anticipated to reach 3,416,000 by 2040 [4]. While TKA proves to be an effective and viable treatment, approximately 20% of patients express dissatisfaction post-surgery. This dissatisfaction is influenced by various factors, including patients’ expectations before the surgery, the extent of improvement in knee function, and the relief of pain following the procedure [5].

Recently, the introduction of robotic technologies has led to significant improvements in the accuracy and predictability of bone cuts performed during TKA with the intention of accurately restoring alignment, reducing the extent of soft tissue releases required and ultimately improving patient satisfaction and long term outcomes [6]. Currently, there are controversial findings regarding the differences between conventional total knee arthroplasty (cTKA) and rTKA in terms of functional outcomes and patient satisfaction [7, 8]. Some studies have reported a significantly higher proportion of satisfied or very satisfied patients in rTKA compared to conventional methods [9,10,11]. However, other studies have found no significant difference in patient satisfaction between rTKA and cTKA [12, 13]. Additionally, longer operative duration, higher intraoperative costs, and iatrogenic injuries associated with robotic surgeries could potentially have a negative impact on patient satisfaction [14].

The aim of the present study was to systematically review the existing literature on satisfaction rates following rTKA and compare patient satisfaction between rTKA and cTKA.

Methods

This study adhered to the PRISMA guidelines. The study protocol was registered with the International Prospective Register of Systematic Reviews (PROSPERO) under the code [CRD42023479878].

Search strategy

A search was conducted across the following databases: PubMed, Scopus, Web of Science (WOS), and the Cochrane Online Library. The search utilized the keywords "Knee replacement," "Total knee arthroplasty," "Robotic," and "Patient satisfaction." Additionally, a manual search of the reference lists of identified articles was performed to uncover any additional relevant studies. The search spanned from the inception of each database to August 2nd, 2023. The complete search strategy was provided in supplementary file (Table S1).

Inclusion and exclusion criteria

Studies were included if they met the following criteria: (1) randomized controlled trials (RCTs), case–control studies, and prospective or retrospective cohorts investigating either unilateral or bilateral primary knee arthroplasty using robotic arm-assisted techniques; (2) provided information about patient satisfaction and preferences; (3) were published in English. The exclusion criteria comprised (1) case reports, letters, correspondents, pilot studies, reviews, editorials, and commentaries; (2) studies with incomplete data on patient satisfaction and (3) revision TKA.

Study selection and data extraction

Two independent reviewers screened the titles and abstracts of all retrieved articles. Full texts were obtained for potentially eligible studies. The two reviewers (SE, AHG), independently assessed the eligibility of each full-text article using the inclusion and exclusion criteria. Disagreements were resolved by consulting a third independent reviewer (AHH).

Data were extracted from the included studies by two reviewers (SE, AHG). The extracted data included: year of publication, study design, number of participants, mean age in years, Body Mass Index (BMI), robot brand, mean length of follow-up, patient satisfaction outcome measures, Knee Society Score (KSS), Oxford Knee Score (OKS), Forgotten Joint Score (FJS), Short Form (SF-36) Hospital for Special Surgery (HSS), and Knee Injury and Osteoarthritis Outcome Score (KOOS).

Quality assessment

Two individual reviewers [SE, AHG] assessed the risk of bias. The Joanna Briggs Institute Critical Appraisal Tool (JBI) [15] was employed for estimating the risk of bias in cohort and case series studies. This tool assesses studies with meticulous attention to key factors, including participant selection bias, accuracy of outcome measurement, appropriateness of statistical analysis and reported results selection, and potential bias arising from missing data.

Data synthesis and statistical analysis

Patient satisfaction disparities between rTKA and cTKA were assessed utilizing R software, version 4.2.2, provided by the R Foundation for Statistical Computing in Vienna, Austria (http://www.R-project.org). The analysis involved the Mantel–Haenszel model, presenting the odds ratio (OR) and its corresponding 95% confidence interval (CI). Additionally, proportion analysis was employed, utilizing the inverse variance method to calculate the overall patient satisfaction rate following rTKA. Heterogeneity was determined using Cochrane’s Q test, generating I2. If the P-value of the heterogeneity test was below 0.05 or I2 exceeded 50%, the random-effects model was applied; otherwise, the common-effects model was used. Egger's Regression test and Begg's funnel plot were utilized to evaluate potential publication bias.

Results

Study selection

While searching across four databases, including PubMed, Scopus, Web of Science (WOS), and the Cochrane Online Library, a total of 203 articles were identified after eliminating duplicates. The initial assessment focused on titles and abstracts, resulting in 41 articles progressing to the next stage, which entailed a thorough full-text review. Subsequently, a comprehensive examination of the full texts resulted in the inclusion of 17 articles in this systematic review. Among these, nine articles involved a two-group comparison [9,10,11, 16,17,18,19,20,21], focusing on patient satisfaction between cTKA and rTKA, while the remaining eight single-group articles exclusively examined patient satisfaction following rTKA [22,23,24,25,26,27,28,29]. The Prisma chart in Fig. 1 visually illustrates the various stages of the screening process.

Fig. 1
figure 1

PRISMA graph

Quality of assessment

Seven studies utilized a prospective design [9, 11, 17, 18, 21, 24, 26], while the remainder employed a retrospective approach. The JBI critical appraisal tool for cohort and case series studies was employed to assess the quality of the studies. Detailed results of the qualitative assessment for the articles are available in the supplementary file (Tables S2 and S3).

Baseline characteristics

This review analyzed a total of nine comparative studies, involving 1148 patients (571 in the rTKA group and 577 in the cTKA group) [9,10,11, 16,17,18,19,20,21]. Furthermore, eight single-group studies comprised 1533 patients who underwent rTKA [22,23,24,25,26,27,28,29]. Among the eight comparative studies that reported gender distribution, 345 were male, and 743 were female [9,10,11, 16,17,18,19,20]. Across all the articles included, pertaining to patients who underwent rTKA, there were 786 males and 1311 females. The mean age in the comparative studies ranged from 65 to 70.1 years, and among rTKA patients in all included studies, mean age fell within the same range of 65 to 70 years. Mean BMI in the comparative studies ranged from 26.7 to 34.7, while among rTKA patients in all included studies, it ranged from 26.4 to 33.2. The minimum follow-up duration in the included studies was three months, and the maximum was extended to 24 months (Table 1).

Table 1 Baseline characteristics of the eligible studies

Among the studies, the MAKO (Stryker) robotic system was the most frequently employed, and featured in seven studies [11, 18, 20, 23, 24, 27, 28]. Following this, the NAVIO (Smith & Nephew) system was used in three studies [16, 19, 22], the ROBODOC (Curexo) [17, 21] and OMNIBotics (Corin) [25, 26] systems were each utilized in two studies, and the ROSA (Zimmer Biomet) system was used in one study [9] (Table 1).

Overall patient satisfaction rate after rTKA

After performing a proportion analysis on patient satisfaction across 11 articles (comprising 6 single-group studies and five double-group comparative studies) following rTKA, the results indicated that 94% of patients expressed satisfaction with rTKA, with a confidence interval between 92 and 96% (I2 = 54%) (Fig. 2). Notably, Egger's test revealed the presence of publication bias in this analysis (p = 0.0145, Fig. 3). Following a Trim-and-fill analysis, incorporating three additional studies, the patient satisfaction rate was adjusted to 93.2% with a confidence interval spanning from 90.4% to 95.2%.

Fig. 2
figure 2

Forest plot displaying the proportion analysis of satisfaction rates among patients who underwent rTKA

Fig. 3
figure 3

Funnel plot of the proportion analysis of satisfaction rates for rTKA

In single-group research, Abhari et al., Daffara et al., and Richards et al. evaluated satisfaction levels using the Likert scale, obtaining mean scores of 4.6, 4.5, and 4.7 out of a possible five, respectively [23, 28, 29]. Additionally, Winnock de Grave et al. documented an average VAS satisfaction score of 8.9, and Turan et al. indicated a score of 8.3 ± 1.5 [16, 22] (Table 2).

Table 2 Patient satisfaction following robotic TKA

Comparing patient satisfaction following rTKA and cTKA

An proportions analysis of comparative studies reveals that the satisfaction rate among patients who underwent rTKA was 95% ([95% CI: 88–98%]; Fig. 4A), while for cTKA patients, it was 91% ([95% CI: 85–95%]; Fig. 4B). A meta-analysis examining patient satisfaction rates after TKA indicates that there is no statistically significant difference between the two groups of rTKA and cTKA (OR [95% CI] = 2.62 [7.73–0.89], I2 = 0%) (Fig. 4C).

Fig. 4
figure 4

Forest plots illustrating proportion analysis of satisfaction rates among patients who underwent rTKA (A) and cTKA (B), along with meta-analysis comparing patient satisfaction rates between rTKA and cTKA (C)

Choi et al. observed no significant disparity in KSS satisfaction and expectation scores among patients who underwent rTKA and cTKA at the three-month postoperative stage. However, at the one-year and two-year follow-up points, the mean scores were higher in the rTKA group [20]. Khlopas et al. reported no difference in the three-month mean of KSS satisfaction and expectation scores between the two groups [18]. Also, Smith et al.'s study noted that the difference in KSS satisfaction score was statistically nonsignificant [11]. Moreover, Turan et al. reported the mean satisfaction score on the Visual Analog Scale (VAS), revealing no significant difference between the two groups [16] (Table 3).

Table 3 Patient satisfaction following robotic and manual TKA

Mulpur et al. divided satisfaction levels into four categories: very satisfied, satisfied, dissatisfied, and very dissatisfied. Their findings showed a notably higher proportion of satisfied patients following rTKA compared to those who underwent cTKA [10]. Likewise, Smith et al., who categorized satisfaction into three levels, observed a significantly greater ratio of very satisfied and satisfied individuals among patients who received rTKA [11] (Table 3).

Another metric for assessing patient satisfaction involves their willingness to undergo surgery again, a topic investigated in studies conducted by Eerens et al. and Kenanidis et al. In Eerens et al.'s study, a larger percentage of patients in the cTKA group expressed a willingness for repeat surgery, suggesting higher satisfaction [19]. In contrast, in Kenanidis et al.'s publication, this percentage was notably elevated in the rTKA group [9] (Table 3).

Liow et al. found that there was no statistically significant difference in the ratio of patients with fulfilled expectations for TKA, both six months and two years after the operation [21] (Table 3).

Song et al. conducted a study involving patients who underwent simultaneous bilateral TKA using both robotic and manual techniques. Before the operation, most patients expressed a preference for cTKA. However, at the three-month follow-up 33.3% favored cTKA, 33.3% preferred rTKA, and 33.3% found both methods equally preferable. In the final follow-up, 20% preferred cTKA, 40% favored rTKA, and 40% found both methods equally appealing [17] (Table 3).

Patient-Reported Outcome Measures (PROMs)

We also examined PROMs as a secondary outcome. Both Song et al. and Choi et al. found no significant disparity in mean WOMAC scores among patients who underwent rTKA and cTKA at three months and one year postoperative stages [17, 20]. However, in Choi et al.'s study, a notable difference emerged at the two-year postoperative follow-up, with the mean score being lower in the rTKA group [20] (Table 4).

Table 4 Patient-reported outcomes following robotic and manual TKA

In Choi et al.'s study, the initial mean KSS knee score varied between the two groups, but this distinction lost significance by the final follow-up [20]. Also, Liow et al. noted no significant difference. In contrast, Smith et al. found a notably higher mean score in the rTKA group [21]. The average KSS function score in Choi et al.'s study consistently showed significant superiority in the rTKA group across all follow-up periods, a pattern that is similarly evident in Smith et al.'s research [11, 20]. In contrast to these two investigations, the studies conducted by Liow and Khlopas did not reveal any notable differences in this regard [18, 21] (Table 4).

Four studies reported the OKS [9, 10, 16, 21], and among them, only Kenanidis et al.'s study showed a significant difference (in favor of rTKA) [9]. The FJS was also documented in four studies [10, 16, 19, 20], and except for Turan et al.'s study [22], all others indicated a significant advantage for rTKA.

In Kenanidis et al.'s study, the mean SF-36 Physical Component Summary (PCS) was notably higher in the rTKA group [9]. Conversely, Liow et al.'s study did not indicate any significant differences in either PCS or Mental Component Summary (MCS) [21] (Table 4).

At three months and one year postoperatively, Song et al. found no significant difference in the mean HSS scores between the two groups [17]. Likewise, in Turan et al.'s study, none of the components of the KOOS exhibited significant differences between the two groups [16] (Table 4).

Discussion

Patient satisfaction is an important component of decision-making in orthopaedic surgery [30, 31]. While significant focus has been placed on comparing the radiological and clinical outcomes of rTKA with cTKA, patient satisfaction and patient-reported outcomes stand as critical metrics in surgical decision-making [32, 33]. Results of this study suggest that rTKA is a reliable method with high levels of patient satisfaction. Additionally, the level of satisfaction in cTKA was high and close to that in rTKA. The results of this study suggest that rTKA and cTKA achieve comparable results in these domains.

rTKA is a reliable and safe method in total knee replacement that has achieved satisfactory results in radiological and clinical outcomes [34,35,36,37]. Literature shows that utilizing robotic assistance in TKA can achieve more precise radiological outcomes. rTKA comes with significantly fewer outliers in component positioning in the femoral coronal and tibial sagittal axes and mechanical alignment [34]. Such outcome measures reflect the success of the operation in achieving the desired component position and alignment. Although proper alignment and soft tissue balancing are crucial for maintaining knee function, the extent to which observed differences in these measures affect the final clinical outcomes and patient satisfaction is debatable [38, 39].

Several studies indicate that functional outcomes and implant survivorship can be affected by improved accuracy in alignment and implant positioning. However, a long-term follow-up, seems to be needed in order to notice differences in these outcomes [38]. Most current evidence comparing rTKA and cTKA for patient satisfaction consisted of a short follow-up length, with a maximum duration of two years in our included studies. Additionally, newer robotic systems, claiming improved technical properties, have not been in use long enough to provide medium or long term results.

Most studies which examine the results of rTKA and cTKA show similar results, with rTKA showing slightly better functional and range of motion goals than cTKA [34, 35, 37]. The majority of the studies comparing the two techniques argue that improved radiological outcomes following rTKA, may not be clinically meaningful in the short- to mid-term follow-up [34, 35, 37]. Also, no remarkable difference seems to exist regarding the rate of complications associated with both techniques [34]. Aside from slight differences in blood loss, post-surgery drainage, operative time, and periprosthetic joint infection [8, 21, 36, 40], evidence comparing complication rates between these methods does not robustly favor one over the other.

The previous systematic review and meta-analysis performed by Zhang et al. showed better PROMs for rTKA based on KSS and WOMAC scores, while still arguing that no minimal clinically important difference (MCID) was notable [8]. Since there is a great heterogeneity in the tools used to evaluate PROMs in the included studies, a meta-analysis was not feasible in our study. On the other hand, while some studies reported significantly better outcomes for rTKA [9,10,11, 19, 20], others suggested comparable results [16,17,18, 21]. Also, no significant difference in PROMs between the two techniques were observed in the current study.

An essential consideration when comparing rTKA with cTKA could be the diversity and lack of consistent reporting in alignment and balancing techniques across various studies. Other factors such as surgeon experience, pre operative discussions with the patients and biases based on funding sources for studies need to be considered as these all potentially influence results.

The idea of implementing a uniform alignment approach in all cases of TKA has been challenged recently [41, 42]. While mechanical alignment is the most utilized surgical alignment goal for TKA, a more personalized goal accounting for the constitutional alignment of each patient may be the key to addressing patient dissatisfaction [43]. Since rTKA has been successful in fulfilling the objectives of mechanical alignment with high precision, improving patient satisfaction may involve understanding the significance of presurgical knee morphology. The true potential of technological advancements such as robotic and computer assistance [44] may emerge by defining more personalized surgical goals rather than solely attempting to optimize the precision of the currently defined single target alignment.

In contrast to TKA, robotic-assisted uni-compartment arthroplasty (rUKA) was associated with better patient satisfaction in our previous review [45]. Several factors may contribute to this difference. Firstly, the importance of highly optimizing alignment and soft tissue balance may differ between procedures. A successful outcome in UKA largely depends on both precision of implant alignment and soft tissue balance [46, 47], while TKA seems to tolerate variability in component alignment to some extent [38, 48, 49]. Importantly, studies comparing rUKA and conventional manual uni-compartment arthroplasty (mUKA) mostly consisted of longer follow-up durations [45], which could unravel the clinical effect of radiological precision achieved by robotic assistance. However, such a hypothesis could be easily challenged with findings regarding the long-term studies on PROMs following robotic-assisted total hip arthroplasty (THA), which still indicate comparable results with the conventional method [50]. The potential effect of variety in the recruited robotic systems in these operations should also be kept in mind.

Our study was subject to some limitations. This review only encompassed the short- to mid-term patient satisfaction outcomes. Therefore, studies with longer durations may help reveal any notable differences among these techniques. Second, many key aspects of the procedures other than the use of robotic assistance (e.g., alignment and balancing methods, surgeon experience both in terms of years of practice and volume of procedures) were not considered in most studies, limiting the interpretability of such comparisons. Achieving optimal outcomes in rTKA may necessitate relatively different alignment approaches than those available in conventional methods. Additionally, due to the high degree of heterogeneity of included studies in the tools used for assessing PROMs, a meta-analysis was not feasible to draw robust conclusions. Finally, the extent to which utilizing different robotic systems could impact the outcome could not be determined. Although the majority of the studies reporting better outcomes for rTKA had utilized either MAKO or NAVIO robotic systems, other studies reporting similar results to cTKA with the same systems were also present.

In conclusion, rTKA achieves reliable and acceptable patient satisfaction in short- to mid-term follow-up, comparable to the outcomes of conventional TKA. rTKA has been associated with improved precision of bone cuts and radiological outcomes. Patient satisfaction is a complex construct, and uniform measurements among studies are crucial for sound judgement [51, 52].