Introduction

Hip resurfacing arthroplasty (HRA) has gained renewed interest since the application of metal-on-metal bearing surfaces in the 1990s [13]. The procedure was revisited to address the high failure rates in young patients with standard stemmed total hip arthroplasty (THA). Many reports have documented promising mid-term outcomes of this surgical technique. In an effort to improve the results of HRA, various factors have been identified that may lead to a higher risk of failure. Initially, in a small group of 119 patients, Beaule identified femoral head cysts, previous hip surgery, low patient weight (<82 kg) and high activity (UCLA score ≥6) as risk factors for HRA. This was called the surface arthroplasty risk index (SARI) [4]. Soon, other studies found that older age and female gender were associated with more complications, particularly femoral neck fracture [2, 5]. Recently two studies have implicated smaller component size, rather than low weight or female gender, as the factors truly responsible for higher failure rates in HRA [6, 7] .

Dysplasia is well known to be a high-risk group in stemmed THA [8, 9]. The primary causes of failure have been instability and failure of socket fixation. Theoretically, larger bearings in HRA should decrease the problem of instability, but limited fixation options with acetabular components in HRA may increase the socket fixation problems. In addition, adverse wear has been associated with dysplasia in HRA in one study [10]. Dysplasia, female gender and small component size have all been identified as risk factors for HRA [6, 11]. These factors are interrelated: dysplasia is more common in young women presenting for HRA, while small component size is more common in women and in dysplastic patients. Our hypothesis is that difficulty in gaining component fixation in deformed dysplastic sockets is the primary risk factor for HRA, underlying the reported risk factors of female gender and small component size. The purpose of this study was to compare the clinical and radiological outcomes of a group of osteoarthritis (OA) and dysplasia patients from our database in order to determine whether our hypothesis is correct.

Materials and methods

This study was approved by the institutional review board (IRB). From January 2002 to July 2008, the senior author (T.P.G.) performed 1,374 metal-on-metal HRAs. We selected all patients with the preoperative diagnosis of OA (1,082 cases) and compared them to all cases with a diagnosis of dysplasia (134 cases). The 158 hips with other diagnoses were excluded for this analysis. In total 1,216 HRA cases in 1,041 patients were included in this study. Corin Cormet 2000 hip resurfacing prostheses (Corin Group PLC, Cotswolds, UK) were used in 280 cases, Biomet Recap/Magnum hybrid hip resurfacing prostheses (Biomet, Warsaw, IN, USA) in 668 cases and Biomet Recap/Magnum fully porous coated hip resurfacing prostheses (Biomet, Warsaw, IN, USA) in 268 cases.

The Corin Cormet 2000 acetabular device has a dual plasma spray coating of titanium and hydroxyl apatite. The Biomet Magnum acetabular component has only a titanium plasma spray layer that is thicker. The arc of coverage in both components varies with implant size and is similar. Both are high carbon cast cobalt chrome with similar surface finishes, except that the Corin device is heat-treated. The undersurface of the Corin femoral device is a chamfer cylinder, while the Biomet femoral recap is a hemisphere on a cylinder. The Corin stem is tapered, while the Recap stem is cylindrical. Both cemented femoral components are grit blasted. The uncemented Recap also has a layer of titanium plasma spray on the component undersurface (but not the stem), similar to the Magnum acetabular component.

The first group of implants in this study were hybrid Corin Cormet 2000 implants as part of a US Food and Drug Administration (FDA) Investigational Device Exemption (IDE) study that the senior author led [12]. However, because the senior author wished to use an uncemented system and Corin was unable to provide this, he embarked on a development project with Biomet. The Biomet Recap /Magnum system became available as a hybrid system in 2005 and finally as an uncemented system in 2007 [13]. We now exclusively use uncemented components in all cases.

There were 867 (71%) cases implanted in 740 men and 349 (29%) cases implanted in 301 women. In men, the primary diagnosis was dysplasia in 38/867 (4%) cases and OA in 829/867 (96%) cases; in women, the diagnosis was dysplasia in 96/349 (28%) cases and OA in 253/349 (72%) cases (P < 0.001). Of all dysplasia patients in our series of young patients presenting for HRA 72% (96/134) were women, while men comprised only 29% of all cases. The average age for the whole group was 51 ± 7 years (range 20–78), the average body mass index was 27 ± 4 (range 17–55).

All operations were performed using the posterior approach. Over time, the approach was modified using minimally invasive techniques which were previously described [14]. All patients with Crowe I and II dysplasia were treated with a HRA. No structural bone grafts were used. To gain maximum component coverage, often an attempt was made to deepen the socket. In these cases, we drilled and measured the medial wall thickness with a depth gauge to avoid decreasing the medial wall thickness below 6 mm. The rare patients that we encountered with Crowe III and IV dysplasia were treated with stemmed THA and were therefore not included in this study. Often in dysplasia cases, the acetabular component was uncovered in the anterior-superior quadrant. We accepted up to 30% of lack of coverage, but did not document the actual amount intraoperatively. No supplemental fixation was available for the acetabular component at the time of this study.

The average American Society of Anesthesiologists (ASA) score was 2 ± 1 (range 1–4). The average femoral component size was 51 ± 4 mm (range 40–64 mm) and the average acetabular component size was 57 ± 4 mm (range 46–68 mm). There were 205 (17%) cases with the femoral component size <48 mm in the entire group. There were ten (1.2%) cases in men and 195 (55.9%) cases in women. The average size of femoral component was 46 ± 2.5 mm (range 40–56 mm) for the women and 53 ± 2.9 mm (range 40–64 mm) for the men (P < 0.001); the average size of femoral component was 48 ± 3.6 mm (range 40–58 mm) for the dysplasia patients and 51 ± 3.9 mm (range 44–64 mm) for the OA patients (P < 0.001). The average operation time was 113 ± 18 minutes (range 76–278 minutes), and the average hospital stay was 3 ± 1 days (range 1–11 days). The average estimated blood loss was 245 ± 120 cc (range 50–1000 cc). The cell saver was used in 98 cases, returning an average of 125 ± 69 cc (range 50–440 cc); 375 cc of autologous blood was used in one bilateral patient (2/1216, 0.2%). Post-operative follow-ups were requested at six weeks, one year, two years and every other year thereafter. Local patients were seen in our office. Out-of-state patients were requested to return to our office for follow-up, but often they submitted their questionnaires online, by mail or email, or had a phone interview. The physical examination data were obtained from a local physical therapist and radiographs were mailed to us. We defined failures as all cases that were revised either elsewhere or by us. We analysed the data using two methods: comparing failure rates and also survivorship curves. Failure rates were calculated without considering the variable of time, while survivorship curves take time into account.

Paired t tests were used to compare the numerical outcomes between pre-operative and post-operative visits; standard t tests were used to compare the differences in the numerical outcomes among different groups [15]. Chi-square tests were calculated when comparing categorical outcomes. Chi-square tests were also used to compare the differences of failure rates between groups without considering the time variable. Kaplan-Meier curves were plotted to evaluate the survivorship rates using revision as the end point. Cox proportional hazard regression models were used to evaluate the significance of each variable using both univariate and multivariate analyses. For all tests in this study, α = 0.05 was used as the level of significance. OrthoTrack (Midlands Orthopaedics, p. a., Columbia, SC) and JMP (SAS, Cary, NC, USA) were used for the statistical analyses in this study.

Results

In total, 1092 (1092/1216; 89.8%) cases completed their latest follow-up with latest X-rays available in 759 (62.4%) cases. The average length of follow-up was 5 ± 2 years (range 3–9 years). Eleven patients died due to causes unrelated to their hip surgery. Failures occurred in 39 of 1,216 cases (3.2%). Failure rates analysed by gender and diagnosis were listed in Table 1. Complications and failures were also tabulated individually for the three different prostheses (Table 2). Ten cases (0.8%) were revised due to femoral neck fracture and all occurred within six months post-operatively; 12 (0.9%) were revised due to acetabular component loosening (Fig. 1), ten (0.8%) were revised due to femoral component loosening, two (0.2%) were revised for deep infection, one (0.1%) traumatic intertrochanteric fracture was revised for psoas tendonitis, and three cases (0.3%) in two women were revised for adverse wear related failure. The three cases that were revised for adverse wear were all in women with acetabular inclination angles of greater than 60 degrees on standing anteroposterior pelvis X-rays who presented with pain, swelling, elevated metal ion levels, three dimensional imaging studies that showed fluid accumulation and metallosis at the time of surgery.

Table 1 Failure modes related to gender and diagnosis
Table 2 Complication and failure modes among three different hip resurfacing arthroplasty (HRA) prostheses groups
Fig. 1
figure 1

X-ray of female dysplasia patients before metal-on-metal hip resurfacing arthroplasty (a), after hip resurfacing arthroplasty (b), and failure of acetabular fixation (c)

The overall failure rate for dysplasia cases was 10/134 (7.4%). Fifty percent of these were due to a failure of acetabular fixation, and 20% were due to acetabular component malposition resulting in an adverse wear reaction. Therefore, 70% of our failures in dysplasia were due to problems with the acetabular component. Overall, there was a 0.4% (5/1216) rate of dislocation, there were no revisions required for instability, and only one dislocation occurred in 134 dysplasia patients (0.7%).

At eight year follow-up post-operatively, women with the primary diagnosis of dysplasia had a survivorship rate of only 75% (Table 3). The survivorship rate was worse when dysplasia was compared to OA, when women were compared to men, but not when components < 48 mm were compared to components ≥ 48 mm, at nine year follow-up post-operatively (Fig. 2).

Table 3 Statistical significance of survivorship curves
Fig. 2
figure 2

Time-weighted Kaplan-Meier survivorship curves for the study group up to nine years post-operatively and time independent failure rates. a Revision for any reason taken as the end point for the female group and the male group. b Revision for any reason taken as the end point for the group with dysplasia and the group with osteoarthritis (OA). c Revision for any reason taken as the end point for the group with the femoral component size < 48 mm and with the femoral component size ≥ 48 mm

Based on Cox proportional hazard models, the primary diagnosis, the component size (grouped by 48 mm), and the gender all had a significant effect on the survivorship rate (Table 4). However, only the primary diagnosis showed a significant effect in the multivariate model with a P-value of 0.05 and a hazard ratio of 2.31. Size of femoral components had a much stronger, though not statistically significant, effect on the survivorship rate than gender in the multivariate model.

Table 4 Results of univariate analyses and multivariate analysis with use of Cox proportional hazard regression models

Other complications included dislocation in four cases, deep vein thrombosis (DVT) in one case, sciatic nerve palsy in two cases, and deep infection in two cases as well as one traumatic intertrochanteric fracture repaired at two years after the index procedure. Excluding the revised cases, the average post-operative HHS score for the whole group improved significantly from a preoperative score of 54 ± 13 to 95 ± 9 at the latest follow-up (P < 0.001). The average postoperative UCLA activity score was 7 ± 2, and the average visual analog scale (VAS) pain scale score was 0 ± 1 on regular days and 2 ± 2 on the worst days.

Radiological analysis of the nonrevised hips revealed three cases of femoral stem radiolucency. There were also 11 cases with a reactive femoral shadow, 19 cases of focal femoral neck narrowing presumed to be due to posterior-lateral impingement. There were four cases with partial radiolucency around the acetabulum. There were 20 cases with Brooker I [16], five cases with Brooker II, and one case with Brooker III heterotopic bone.

Discussion

Our data indicate that the higher prevalence of dysplasia in young women is the primary reason that women have a higher risk of failure after HRA. Although others have indicated that small component size and female gender are important negative risk factors for HRA, our results indicate that the high incidence of dysplasia in young women presenting for HRA is the main reason that women fare worse. Furthermore, 70% of our failures in dysplasia are related to acetabular component fixation or positioning. In contrast to a widely reported study by a UK group [17], failure due to adverse wear (pseudotumuor) has been seen in only three cases in women in our study (3/1216, 0.2%).

Dysplasia is more common in young women presenting for HRA. In our study group 89% of cases were due to OA and only 11% of cases were due to dysplasia. Only 4% of men had dysplasia, compared to 28% of women. Although men outnumbered women by more than 2:1 in this series, 72% of dysplasia cases were in women.

Although the results of HRA for dysplasia patients is worse than for OA, the same can be said for stemmed THA where the rate of dislocation rate in dysplasia cases is reported from zero to 11% [18] and the failure rate is also significantly higher than for OA [1921].

Smaller femoral component size has previously been implicated as the risk factor accounting for the higher failure rate seen among women undergoing HRA [7, 22, 23]. None of these studies have included diagnosis in their multivariable analysis. Our data is in agreement with these studies—we find that small component size is a more important risk factor than female gender; however, when the additional variable of diagnosis (OA or dysplasia) was added into the multivariable analysis model, femoral component size became an insignificant risk factor (P = 0.09). Component size was found to be significantly smaller in dysplasia than in OA (P < 0.001).

Others have reported that the outcomes for HRA are worse for dysplasia than for OA patients [24, 25]. In Amstutz’s early study of dysplasia [18], there were more problems on the femoral side with a femoral neck fracture rate of 8.5%. Also, he noted that it was difficult to restore equality of leg length with HRA. Our study confirms the findings in McBryde’s case-control study [25] that acetabular problems are more common in dysplasia.

It is well known that dysplastic sockets are shallow and are often described as anteverted. We have noticed that they are always oval. This has been described in the literature, but is rarely emphasised. In our experience, the long axis of this oval is always from posterior/inferior to anterior/superior. We believe that this fact is responsible for the appearance of a more “anteverted” socket. It is also the reason that after reaming the socket, a wall deficiency sometimes remains in the anterior/superior quadrant of the socket. When the acetabular component is placed in the correct orientation, a variable degree of component “uncoverage” occurs that is difficult to quantify on postoperative plane X-ray (two dimensional). However, if the surgeon visually aligns the component with this deficient anterior superior corner, less coverage deficiency results; but instead the component may then be too anteverted and too steeply inclined. A dilemma exists—either place the component more anteverted and inclined and obtain good coverage of the porous coated wall (worse for wear, better for fixation), or place the component more horizontal and less anteverted and have a greater portion of the cup uncovered (better for wear, worse for fixation). The solution would seem to be using components with supplemental fixation placed in lower inclination and less anteversion, while grafting the defect that results. Mid-term studies have shown that the BHR using dysplasia components (with flanges and bolts), had fewer acetabular loosenings [26].

McMinn et al. have suggested [11] that another reason for higher wear-related failure in dysplasia might be due to excessive femoral component anteversion in dysplasia. This can only be measured using CT scans. We only had three adverse wear related failures (0.2%) that were related to acetabular malposition and performed no CT component measurements. We are therefore unable to evaluate whether femoral component anteversion played a role in our wear failures. In a CT based study, Hart has shown that while acetabular component inclination is strongly correlated with adverse wear, combined anteversion is only weakly correlated [27].

A limitation of this study is that a single experienced surgeon operated on all the cases in this study. Previous studies have shown that there is an extended learning curve for HRA [1, 28]. Therefore, the complications that are reported by a large single surgeon series may not reflect the same complications seen in national registries, which include HRAs performed by both experienced and inexperienced surgeons. We only evaluated the effect of gender, component size and diagnosis on the failure rate in HRA. Other factors may also be found to be significant upon closer analysis.

In summary, we have demonstrated that the higher prevalence of dysplasia in young women is the primary reason why women and patients with small component sizes have been found to have a higher failure rate with HRA. While others have had more problems with adverse wear in this patient population, we have had more problems with acetabular component fixation. Both of these problems are related to acetabular component implantation. Dysplastic patients also have a higher failure rate with stemmed THA. No direct comparison study exists. Therefore, we question the common recommendation to avoid HRA in young women. We suggest that it may be more productive to concentrate on developing more accurate implantation techniques (to avoid wear) together with the use of supplemental acetabular fixation such as flanges or spikes (to avoid fixation failure) in cases with significant defects. This may allow dysplasia patients, who are at a high risk for dislocation due to their inherent tissue laxity, to benefit from the enhanced stability of large metal bearings.