Introduction

In patients with moderate symptomatic tibiofemoral osteoarthritis and varus alignment of the leg, an opening-wedge High-Tibial Osteotomy (owHTO) is a joint sparing operative option [2, 3, 8, 18]. Over the last decade, the outcomes have improved in owHTO by redefining the indications, improving pre-operative planning and by making the surgical technique more accurate, safer and reproducible [3, 10, 14, 27].

The use of patient-reported outcome measures (PROMs) such as Knee injury and Osteoarthritis Outcome Score (KOOS) [5, 26] and Knee Society Knee Score (KSS) [22], or the SF-12 [31, 32] are common and essential tools for a structured and standardized assessment of patient-perceived pain, function and life quality after owHTO surgery. There is a need of an appropriate tool to analyse the clinical relevance of the improvement in subjective outcomes after owHTO.

The Minimal Clinically Important Difference (MCID) firstly described by Jaeschle et al. [15]. as “the smallest difference in score in the domain of interest which patients perceive as beneficial and which would mandate, in the absence of troublesome side effects and excessive cost, a change in the patient’s management,” is a useful benchmark to determine whether patients improve enough clinically to notice a real clinical difference.

Therefore, defining a MCID value for a specific population and specific procedure is crucial for determining the effectiveness of the procedure and calculating the sample size for clinical trial planning.

To date, the MCID values ​​have never been defined for the KOOS, KSS and SF-12 scoring system in the owHTO procedure.

The purpose of this study was to determine the MCID of several PROMs in patients who underwent owHTO with Patient Specific Cutting Guides (PSCGs), using anchor-based methods.

Materials and methods

All patients aged from 18 to 80 years old undergoing medial owHTO with PSCGs between January 2013 and January 2017 were enrolled in this single-center, prospective, observational study (n = 238).

Indications for owHTO included [14]: isolated medial knee osteoarthritis Ahlbäck grade < 3 [1] with a preserved status of the patellofemoral and lateral tibio-femoral joints assessed using clinical and radiological (x-rays and MRI-scan) methods, a significant metaphyseal tibial varus [Medial plateau tibial angle (MPTA) < 84°], a stable knee in sagittal and coronal plane and the failure of all non-surgical treatments.

Absolute contraindications included inflammatory joint disease, unresolved or recent septic arthritis, or deficient soft-tissue coverage.

Exclusion criteria for the present study included: concomitant cartilage surgery (n = 8) and preoperatively planned modification of the angulation of the tibial slope (n = 11).

All the surgeries were performed by two senior surgeons in a single center.

In the preoperative planning stage, the planned correction was first calculated by the surgeon using conventional radiographs (weight bearing long-leg, A/P and lateral views) following Miniaci’s Method [9]. Subsequently all patients underwent a CT-scan and a Patient-Specific Cutting Guide was designed to achieve the osteotomy.

Surgical technique [3]

After performing the standard medial surgical approach, a PSCG was used following the technique described by Chaouche et al. [3]. The plate (Activmotion ®, Newclip Technics, Haute-Goulaine, France) was secured using six locking screws. The osteotomy gap was filled with femoral head allograft.

The postoperative rehabilitation protocol has been previously described [10] with full weight bearing and unlimited range of motion as tolerated. The patient was advanced to full weight bearing, closed chain strengthening, and cycling by 3 weeks. Progression to sport-specific activities and full training was permitted by 12 weeks post- operatively.

In case of Lateral Hinge fractures (LHF) [30]:

For Takeuchi Type 1 and 2 diagnosed intraoperatively; toe touch weight bearing for 6 weeks with the aid of crutches was recommended. Full weight bearing was allowed after 6 weeks.

For Takeuchi 3 diagnosed intraoperatively: the fracture was treated intraoperatively by two additional compression screw and non-weight bearing for 6 weeks followed by toe touch weight bearing for 6 weeks with the aid of crutches was recommended. Full weight bearing was allowed after 12 weeks.

If a hinge fracture was diagnosed at 6 weeks’ follow-up, for the type 1 and 2, toe touch weight bearing was not extended. For the type 3 a toe touch weight bearing for 6 weeks was recommended before full weight bearing.

Patients’ evaluation and follow-up

Patients were prospectively evaluated at months 1, 3, 6, 12, 18 and 24 months follow-up. Radiographic outcomes were evaluated with intra- and postoperative fluoroscopic imaging and with CT imaging at 6 weeks post owHTO surgery. Every radiograph and CT-scan was reviewed by two senior clinicians (one radiologist and one surgeon). In instances of different interpretation, a third evaluation was performed by the two clinicians to reach a definitive interpretation. The Kappa coefficient of the first evaluation was 0.95. Lateral hinge fractures were classified following the Takeuchi classification [30].

Postoperative infections and bone union were also evaluated during the survey. With regards to postoperative infections, superficial infections requiring only antibiotic therapy were separated from the deep infections treated by a surgical debridement associated with antibiotic therapy.

A total of three outcome scores were collected pre-operatively and at the 2 years follow-up evaluation: Knee Society Knee Score (KSS), SF-12 Score, and Knee injury and Osteoarthritis Score (KOOS).

KOOS comprises a 42-item self-reported questionnaire of subscales that include pain (nine items), other symptoms (seven items), function in daily living (17 items), function in sport and recreation (five items), and knee-related quality of life (QOL) (four items), which are scored individually from 0 (extreme knee problems) to 100 (no knee problems).

The SF-12 score, which is derived from the SF-36 score, comprises a 12-item questionnaire evaluating specific factors of general health-related QOL that are divided into the physical (PCS) and mental (MCS) component summaries. The mean score of the general population is 50, with an SD of 10. Higher scores demonstrate better health-related QOL.

The KSS was developed and published in 2012 and has been validated to better characterize the expectations, satisfaction, and physical activities of the younger and more diverse population of patients undergoing total knee arthroplasty [28]. The internal properties and responses of each patient can be captured in every domain of the subjective score, including symptoms (/25), satisfaction (/40), and expectation (/15). Various kind of activities (/100), including essential activities, higher-level activities, and discretionary/recreational activities are also evaluated by KSS [22]. For this study, two sub-scales were included: KSS symptoms (/25) and KSS activity (/100). Indeed, the sub-scale KSS satisfaction items are similar to those of the anchor method and could introduce a bias in the case of discordant results. For the sub-scale KSS expectation, it is not possible to perform a MCID estimation since the pre-operative questionnaire for expectation is different from that used post-operatively.

Patients without a 2-year follow-up evaluation were excluded from the analysis (n = 23; 10.5%) (Fig. 1: Flow Chart).

Fig. 1
figure 1

Flow chart

Calculation of minimal clinically important differences

The MCIDs were calculated using four anchor-based methods [17]. Patients were given the anchor question at 2 years postoperatively: “Compared with before surgery, how would you rate each operated joint now?” The responses were recorded using a 5-point scale: “much better,” “somewhat better,” “about the same,” “somewhat worse,” and “much worse.” Patients who answered “about the same” or “somewhat worse” were classified into the no change group, while those who answered “somewhat better” were classified into the minimal change group. Patients who answered “much better” or “much worse” were not included in the analysis because they experienced more than minimal change. Four anchor-based methods were used to calculate the MCID. The average change corresponded to the mean change in the score of the minimal change group. The minimum detectable change (MDC) approach defines minimal change as the smallest change that can be considered above the measurement error with a given level of confidence (95%). Therefore, the MCID is equal to the upper value of the 95% confidence interval for the average change in score that is seen in the no change group. The difference in change was defined as the difference in the average change in score between the minimal change and no change groups. A receiver operating characteristic (ROC) curve was used to define the cutoff point that best discriminated between the minimal change and no change groups. The optimal cutoff point was estimated using the point that maximized both specificity and sensitivity. The Area Under the ROC Curve (AUC) was calculated to assess reliability. An AUC value of 0.7–0.8 was considered acceptable and an AUC value of 0.8 to 0.9 was considered excellent [6].

Local Ethical Committee approval was obtained prior to study’s initiation [Comité Informatique et Liberté (CIL)/Assistance Publique des Hopitaux de Marseille (AP-HM)/Registration Number 2019-17-129)].

Statistical analysis

The descriptive statistics were presented as mean ± standard deviation for quantitative variables. Normal (Gaussian) distributions were verified to determine adequate statistical testing method (either parametric or non-parametric) to estimate the difference between groups in an univariate analysis.

Mann–Whitney U test for two independent samples with 95% confidence interval (95%CI) was used to evaluate the differences between two variables, one-way ANOVA between more than two variables and multiple linear regressions for relationships. Paired student’s t test was used to estimate the evolution of functional outcomes during follow-up. The differences in proportions between the two samples were estimated with z test with 95% CI. PASW Statistics version 20 (SPSS, IBM Inc., Chicago, Illinois) was used for statistical analysis. The threshold for statistical significance was set at p < 0.05.

Results

One hundred and ninety-six patients were evaluated at 2-years post-operatively and finally included in this study. A significant improvement was observed for all the cohort between the preoperative functional scores (KOOS, KSS and SF-12) and the 24 months postoperative functional scores (Table 1).

Table 1 Pre- and postoperative (24 months follow-up) PROMs of the 196 patients

Two non-unions were seen at 9 months and 10 months after owHTO, union was achieved at 5 and 7 months, respectively, after a revision procedure. No patients had owHTO conversion to knee arthroplasty during the 2-years follow-up.

Ninety (45,9%) patients responded that their knees were “much better” than they were before surgery, 75 (38.3%) patients responded that their knees were “somewhat better”, 14 (7.1%) responded that they were”about the same”, ten (5.1%) reported that they were “somewhat worse”, and seven (3.6%) reported that their knees were “much worse”. 75 (somewhat better) and 24 patients (about the same and somewhat worse) were, respectively, assigned to the “no change” and “minimal change” groups for the MCID calculation.

There was no significant difference between these two groups in terms of baseline characteristics and postoperative complications (Table 2).

Table 2 Patient’s characteristics and post-operative complications following Anchor distribution

At 24 months follow-up all the PROMs (KOOS, KSS and SF-12) were significantly better for the “minimal change” group compared to the “no change” group (Table 3).

Table 3 Delta (pre-operative-24 months postoperative) PROMs following Anchor classification

The MCIDs for each PROM calculated with the four anchor-based methods were presented in Table 4.

Table 4 MCID values for the PROMs

The AUC defined by the ROC were considered acceptable (> 0.7, < 0.8) for the all the subscale of the KOOS and SF-12 score and excellent (> 0.8) for the two subscales of the KSS.

Discussion

The most important finding of this study was the identification of the MCID regarding the KOOS, KSS and SF12 scores in patients undergoing owHTO for primary and isolated medial knee osteoarthritis (OA). For this procedure, MCID values were 15.4 for KOOS pain, 15.1 for KOOS symptoms, 17 for KOOS ADL, 11.2 for KOOS sports/recreation, 16.5 for KOOS QQL, 3 for KSS symptoms, 5.6 for KSS activity, 7.2 for SF-12 physical component and 6.3 for PCS mental component.

Although some of the literature reporting on outcomes after owHTO estimated long-term clinical success as it relates to HTO survivorship, with failure defined by the need for revision procedure or conversion to total knee replacement [7, 11, 16, 24] much of publications focus on short and mid-term outcomes through knee-specific PROMs, without providing MCIDs [4, 12, 20, 29]. Yet, the target population is most often young (less than 60 years-old), able to deal with pain and to be more likely to prefer non-operative option [13]. Consequently, it is useful to draw upon reliable definitions of clinically meaningful improvement after owHTO.

While it is common to provide a statistically significant difference in subjective knee scores, it can be more difficult to assess the clinical meaning of this difference for the patient. Therefore, the MCID determination is relevant in the analysis of the literature’s outcomes, especially since taking into account patient perspectives is more widely established, allowing to standardize this type of statistical instrument [21, 23]. Moreover, information about MCID is helpful to calculate power analysis and determine the number of subjects required.

While there is a high prevalence of MCID investigations after total knee arthroplasty [21], the present study is the only one to have investigated MCID for subjective outcomes after owHTO. Even if no consensus has been reached on the approach used to quantify MCID [19] the four anchor-based method remains the most reliable one as it involves patients [25]. The current study used the same methodology as other recent studies [23] which have already highlighted its effectiveness as a key instrument for MCID estimation. The other strengths were a large cohort with almost 200 patients enrolled prospectively and operated in a single-center, with the same selection, surgical procedure and postoperative rehabilitation.

However, there are several limitations to this study. First, the anchor-based method, despite its advantages, can lead to a memory bias. Second, some patients’ characteristics that might affect survey responses have not been taken into account, such as gender, body mass index (BMI), comorbidity and mental health status. Finally, MCID values quantified in this study can only be applied to cases of owHTO performed in primary and isolated medial OA and in a stable knee.

Conclusion

In conclusion, our study successfully quantified the MCIDs of three frequently used PROMs such as KOOS, SF-12 and KSS scores after OWHTO. Moreover, these MCID values can be helpful to guide clinical decision-making for surgeons considering OWHTO and to handle preoperative patient expectations.