Introduction

The majority of patients undergoing TKA have excellent clinical results [14, 18, 19, 21]. A minority of patients may experience knee pain, stiffness, and/or patellar instability postoperatively [2, 3, 20]. Some authors have suggested that rotational malalignment of the femoral and/or tibial components causes these problems, and because of this belief, malrotation is a major cause of revision TKA [4, 21]. Others have written that rotational component alignment can cause rotational knee instability and even posterior dislocations of the knee after TKA [1, 17]. If revision surgery is proposed, the indication for surgery such as malrotation should be identified. Also, if poor outcomes are attributed to malrotation, it is important for surgeons to have a reliable and reproducible way to measure it to confirm the diagnosis before recommending revision surgery.

Before the wide adoption of CT, component rotation was measured clinically or on plain radiography [6, 10, 11, 13, 16]. Although these methods could demonstrate gross malalignment, it was difficult to quantify and probably not adequate to identify more subtle cases of malrotation. Jazwari et al. [15] demonstrated the accuracy of CT in measuring rotation of TKA components on cadavers. Berger et al. [4] described a CT protocol to measure component rotation in patients undergoing TKA and proposed that the amount of combined malrotation is directly correlated with the severity of patellofemoral complications. Specifically, Berger et al. describe that patients with lateral patella tracking and tilting have combined component excessive internal rotation of 1° to 4°. Those with patella subluxation have 3° to 8° of excessive internal rotation, patella dislocation 7° to 16°, and component failure at 8° to 17° [4]. However, to attribute these complications to component malrotation with any confidence, in particular if the result of these measurements might cause a surgeon to recommend revision arthroplasty, one must know the reliability of the measurement. Unfortunately, there is a lack of adequate intra- and interobserver reliability data in the published literature regarding two-dimensional (2-D) CT scan measurements of TKA components.

The primary purpose of our study, therefore, was to determine the inter- and intraobserver reliability of 2-D CT scan measurements of TKA component rotation. The second purpose was to determine the repeatability coefficient of this test, which is defined as the difference in measurements that include 95% of the values using 2 SDs from the mean. This measurement identifies the magnitude of variability by the same reviewer in degrees of rotation measured.

Patients and Methods

Fifty patients (52 knees) had 2-D CT scans using the McKesson PACS system (McKesson Corporate Headquarters, San Francisco, CA, USA). There were 22 men and 28 women. Average age at time of CT scan was 64 years with a range of 46 to 88 years. All patients were being evaluated for possible revision as the indication for ordering the CT scan. These patients were being evaluated for one or more of the following diagnoses: patellar subluxation and anterior knee pain, patellar instability, knee instability with suspected malalignment, and knee stiffness (ROM < 70° of flexion) [9]. All 52 scans were measured according to the protocol Berger et al. described [4]. This protocol uses 1.5-mm thick axial images obtained at four locations of the knee: through the femoral epicondylar axis, through the tibial component tray, the proximal tibial plateau, and the tibial tubercle. The femoral component rotation is determined by the angle comparing the anatomic epicondylar axis with the posterior condylar surface of the component. Native rotation from the posterior condylar angle is 0.3º (± 1.2º) internal rotation for females and 3.5º (± 1.2º) internal rotation for males relative to the anatomic epicondylar axis [4] (Fig. 1). The tibial component rotation is first calculated by identifying the geometric center of the tibial plateau. Next, the lines for the tibial component axis and the tibial anatomic axis (the line from the geometric center to the center of the tibial tubercle) are drawn. The angle between the two lines is the rotation of the tibial component. Normal rotation using this method is 18º (± 2.5º) [4] (Figs. 2, 3, 4). The evaluators included an adult reconstruction surgeon (BK), an orthopaedic chief resident (RH), and a musculoskeletal radiologist (MM) to determine interobserver reliability. The scans were measured twice, more than 2 weeks apart, by the orthopaedic resident and attending surgeon to determine intraobserver reliability. A sample size was conducted to ensure adequate power for the study. We determined that 50 CT scans read twice by the same observer will achieve 80% power to detect an intraclass correlation coefficient (ICC) of 0.78 compared with an ICC of 0.60 under the null hypothesis with a significance level of 0.05. For interobserver reliability with 50 CT scans read by each of the reviewers, we had 80% power to detect a ICC of 0.51 compared with an ICC 0.29 [12] under the null hypothesis. To assess intraobserver reliability, ICC values were calculated using a one-way analysis of variance (ANOVA) model for agreement and for interobserver reliability, we calculated ICC values with two-way mixed-effects ANOVA models using SPSS Version 13.0 (SPSS Inc, Chicago, IL, USA); 95% confidence intervals for ICC values are provided. An ICC value of 1 indicates perfect reliability, 0.8 to 1 is very good, 0.61 to 0.8 is good, 0.41 to 0.6 is moderate, and less than or 0.4 is poor [12] (Table 1). We also calculated the average difference in degrees between the measurements we expect 95% of the differences to fall within 2 SDs of zero, which is the repeatability coefficient [7, 8]. As an additional metric, we calculated the proportion of the time that the same measurement by different observers and by the same observer would fall within 5° of each other, called the ± margin of equivalency [7, 8].

Fig. 1
figure 1

The CT scan is an image at the level of the femoral medial and lateral epicondyles. Line A represents the epicondylar axis of the femur and Line B represents the femoral component posterior condylar surface and the component rotational axis. In this patient, the femoral component is 1.03º internally rotated.

Fig. 2
figure 2

The intersection of the lines shown on the tibia represents the geometric center of the knee (GC).

Fig. 3
figure 3

The line is to the center of the tibial tubercle (T).

Fig. 4
figure 4

Line “TR” represents the tibial anatomic axis for rotation and line “CR” represents the tibial component axis for rotation. The angle formed by the intersection of these lines represents the component rotation. In this patient, the tibial component is internally rotated 27º.

Table 1 Intraclass correlation coefficient

Results

The interobserver reliability of rotational measurement ranged from fair to good. The overall ICC measurement for interobserver reliability was 0.39 for the femoral component, 0.67 for the tibial component, and 0.62 for the combined rotation. The 95% confidence intervals were 0.21 to 0.56 for the femoral component, 0.47 to 0.80 for the tibial component, and 0.43 to 0.76 for the combined measurement (Table 2).

Table 2 Interobserver reliability

The intraobserver reliability of rotational measurement ranged from good to very good. The overall intraobserver ICC for the orthopaedic resident and faculty member was 0.61 for the femoral component, 0.81 for the tibial component, and 0.75 for the combined measurement. The respective 95% confidence intervals were 0.46 to 0.71 for the femoral component, 0.73 to 0.86 for the tibial component, and 0.66 to 0.83 for combined rotation (Table 3).

Table 3 Intraobserver reliability

The intraobserver overall ± 5° margin of equivalency for the femoral component was 95%; for the tibial component, it was 69%; and for the combined rotation, there was a 54% chance that all the measurements taken would fall within 5° of each other (Table 4). The intraobserver repeatability coefficient for the femoral component was 5°, tibial component 11°, and the combined rotation 12° (Table 4).

Table 4 Intraobserver agreement and test repeatability

Discussion

Attention to implant rotation after TKA has increased following the contribution of Berger et al. [4]. Additionally, Bedard et al. [3] demonstrated the benefit of revision surgery to correct malrotation. In this study, the authors showed improved Knee Society score, pain scores, and ROM with revision surgery to correct internal rotation of TKA components and confirmed the correction with pre- and postrevision 2-D CT scans. Unfortunately, there is a lack of established literature evaluating the reliability of measuring rotation by 2-D CT scan. Our study evaluated the 2-D CT scans of TKA patients’ components to determine the inter- and intraobserver reliability and repeatability coefficient of our measurements for component rotation. The interobserver reliability measurement was good (0.62 combined) but much better for the tibial component than the femur. The 95% confidence intervals were rather wide for all measurements and better for the tibia than the femur. We found that the intraobserver reliability ICC measurements were good (0.75 combined) with the tibial component again having greater reliability than the femur. Also important were the results of the ± 5° equivalency calculations. Although it was not consistent for the measurement of the femoral component to be the same when done twice by the same reviewer, or another, the measurements were closely grouped within a few degrees. Therefore, it is likely that a given measurement will be within 5° of another on the femoral side. This is much less likely on the tibial side or for the combined rotation. According to this study, there is only a 54% chance that if the measurements are done twice, the combined rotations will be within 5° of each other. Perhaps most interesting were the results of the repeatability coefficient. The magnitude of variability by the same reviewer was high for all three measurements and significantly higher for the tibial component (11°) than the femur (5°). With the same reviewer reading the same scan twice, there was a 12° difference in combined rotation. This variability could significantly alter a surgeon’s evaluation and treatment plan. There are two limitations to our study worth considering. First, this is a relatively small sample (52 knees) and we used only three reviewers. Although our statistical power was adequate, larger studies with more patients or reviewers, and perhaps other practice settings, would help to determine how well our findings might generalize to others. Second, there is no validation with known controls to these measurements as would be done in most studies. It is possible that our measurements are significantly different from others and a known control value would identify this bias. Hirschmann et al. evaluated 30 knees in 29 patients who had undergone TKA with 2-D and three-dimensional (3-D) CT to determine which method would be more accurate [12]. They found the measurements on 3-D CT (intraobserver reliability ICC 0.91) were statistically better compared with 2-D (intraobserver ICC 0.29). Based on their findings, they recommended 3-D CT scans for determining component orientation of TKA.

The strength of this study is that the measurements are objective readings by three blinded reviewers using an established technique and protocol [5]. The technique by Berger and Rubash remains the common practice for measuring TKA component malrotation, and its contribution to our understanding of this complication should not be overlooked. A technique for using a 3-D CT scan has been described and may gain wide acceptance [12]. Our study demonstrates that the 2-D CT protocol most commonly used to assess rotation of TKA components has good but not excellent intra- and interobserver reliability and that there can be a significant magnitude of variability in degrees measured by the same reviewer. Our findings highlight the difficulty measuring component malrotation with results that are precise and repeatable. The clinical use of this technique must be carefully considered therefore if the attribution of symptoms or recommendation for revision surgery is based on the CT measurement of malrotation.