Introduction

Total hip replacement is one of the most performed elective orthopaedic surgical procedures. Western civilisation is continuously aging, with the elderly population of Europe (> 65 years old) expected to double within the next 10–20 years [1]. Annually, over 800,000 new hip replacements are performed worldwide. In 2014, about 3.2% of the population aged > 55 years had an artificial joint, while another 2% required one [2]. The increasing life expectancy and number of hip arthroplasties performed—especially in younger, more active patients—elevate the statistical chance of falls, and occurrence of periprosthetic fractures [3]. Periprosthetic femoral fractures (PFFs) are becoming increasingly common and challenging, usually concerning individuals older than 78 years; naturally, their number of comorbidities is higher than in the younger population [4].

The incidence of PFFs after total hip replacement is reportedly 0.1–18% [5]; the Mayo Clinique Joint Replacement Database reports periprosthetic fractures in 1% of 23,980 primary hip replacements. Lindahl et al. previously reported a PFF rate of up to 9.5%, taken from the Swedish Hip Registry [3]. Analysing 1751 patients from the same registry, Chatziagorou et al. reported that the incidence of fractures around the femoral stem had increased from 1/1000 patients in 2001, to 1.4/1000 patients in 2011; the rate of fracture in patients aged > 80 years increased from 1.3 to 2.3 per 1000 primary total hip arthroplasties (THA) between 2001 and 2011 [6].

The Vancouver classification, described in 1995, is the most widely accepted classification for periprosthetic fractures around the femoral stem [7]. It has demonstrated high reliability and inter- and intra-observer agreement, and has been validated in numerous studies, with several advantages over other classifications; the most important being its value in establishing a surgical strategy [8,9,10,11]. Fractures with loose stems are most common; B2 and B3 type fractures are reported in 75–85% of all periprosthetic hip fractures [2, 7]. If the femoral stem is loose, classic treatment involves revision arthroplasty using cerclage, plates, strut allografts, or a combination of thereof [12].

Recent literature reports open reduction and internal fixation (ORIF) without stem revision for the treatment of type B2 PFFs, particularly in the aged population, or in patients with multiple comorbidities [13, 14]. Some advocate ORIF use due to its lesser surgical complexity, others suggest that properly re-fixed stems can osseointegrate with good results; most importantly, fewer surgery-related risks are involved in ORIF, compared with stem revision surgery [13,14,15].

We observed that surgeons within our institution tended to prefer ORIF over femoral stem revision for Vancouver B2 fractures, even though the classic approach indicates otherwise. It is unclear whether their decision to choose ORIF was intuitive, or related to a lack of experience, particularly regarding younger specialists; alternatively, it may be based on recently emerging literature supporting this treatment modality for B2 fractures.

All patients treated for Vancouver B2 PFFs were divided into two groups: those treated via stem revision, and those treated via ORIF. Our primary objective was to retrospectively analyse the parameters that were significantly different between ORIF and stem revision groups. We then examined the pre-operative parameters, including patient age, BMI, and ASA score. The intra-operative parameters—anaesthesia type, blood loss, operating time, and surgeon’s experience—were compared between the two groups. Finally, post-operative parameters including weight-bearing, length of hospital stay and radiological outcome were analysed. Our null hypothesis was that there was no difference in these parameters between the two groups.

Materials and methods

After obtaining institutional review board approval, a retrospective medical chart and radiograph review was performed on 2476 patients who underwent hemi- or total hip arthroplasty, followed up at two university hospitals. All revisions that occurred between 1st January 2010, and 15th November 2019, were studied. After excluding all other reasons for hip replacement failure, we outlined 105 periprosthetic hip fractures. All femoral fractures, except for Vancouver B2, were excluded; a total of 76 patients with 81 periprosthetic Vancouver B2 fractures around the cementless stem were included in this study. The average age of patients at the index operation was 70.7 (18–94) years, the average age at the time of PFF surgery was 76.6 (29–98) years, and the median follow-up times from the index and revision procedures were 134 (12–312) and 60 (6–111) months, respectively. Patients were divided into two groups: the revision (n = 27) and ORIF (n = 54) groups.

The revision group included patients whose primary implant was removed; a revision stem, bypassing the fracture by at least two diaphyseal diameters, was implanted. Additional hardware, such as cerclage wires or plates, was used at the surgeon’s discretion. In ORIF group, the fractures were treated by cerclage alone (n = 10), plate with screws and cerclage (n = 14), hook plate with screws and cerclage (n = 26) or by a combination of plate, strut allograft, screws and cerclage wires (n = 4). In all ORIF cases, the original stem was preserved.

Medical records

The dates of index arthroplasty and reoperation (ORIF or stem revision), and blood loss during the revision procedure, both from the surgical protocol and from the anaesthetist reports, were carefully recorded. Patients’ BMI, ASA score, and type of anaesthesia during revision surgery were analysed. The duration of the procedure, from skin incision until wound closure, was precisely calculated, and the length of hospital stay before discharge or referral to the rehabilitation centre was calculated. Furthermore, the level of experience (years) of the surgeon who performed the second intervention was analysed; all surgeons included perform at least 21 hip replacements per year.

The correlation between the surgeon’s level of experience and their tendency to choose ORIF or revision was analysed, hypothesizing that younger surgeons would favour an ORIF procedure. Following the operation, the weight-bearing protocol was reviewed. Patients were divided into seven groups: immediate full weight-bearing; 50% weight-bearing; and 2, 4, 6, 8, 12 weeks of no weight-bearing on the operated side.

Radiographic analysis

The pre- and post-operative images of the index arthroplasties were evaluated as a reference. The post-traumatic radiographs were analysed to define the extension of the fracture, level of bone comminution, subsidence, rotational displacement, and angular migration of the stem. After operation, anteroposterior, lateral, and oblique radiographs were evaluated; the AGORA Roentgenographic Assessment (ARA) score was used to evaluate fractures around the stems [16]. Published in 1995, this is a numeric score with an initial value of six, that subtracts points for each parameter that reduces the stability of the stem; results may be interpreted as excellent (5–6), good (4), fair (3), and poor (≤ 2). The ARA score correlates with the more familiar Engh score; however, the former is presumably better adapted to modern-type stems, with better discriminative capacity in the upper spectrum of the radiographic images [17].

Statistical analysis

A t test was used for independent variables, whereas categorical variables were compared using the Chi-squared test through cross-tabulation; McNemar’s test was used for the surgeon’s experience. The Mann–Whitney test was used for non-normally distributed variables; a p value < 0.05 was considered statistically significant.

Results

Medical record

Of the 2476 hip replacements followed up in our clinic, 297 (12%) arthroplasty failures were surgically treated between 1st January 2010 and 15th November 2019; 37% of failures were due to aseptic loosening, 9% due to infections, 18% due to dislocations, and ~ 1% due to technical or other reasons. A total of 105 patients were admitted to the emergency department for periprosthetic hip fracture, representing 35% of all revised failures, and 4.2% of overall hip replacements.

Of the 105 periprosthetic hip fractures, one was around the acetabulum without femoral fracture, two were Vancouver type AG, eight were type B1, 81 were type B2, nine were type B3, and 4 were type C.

Seventy-six patients were treated for 81 Vancouver B2 PFFs (male: n = 22, female: n = 59). The ORIF and revision groups included 54 and 27 fractures, respectively. The average age of patients at the index procedure was 70.7 (SD = 14.4, range: 24–88) years for both groups, and 74.3 (SD = 11.2) and 62.6 (SD = 17.2) years for the ORIF and revision groups, respectively; the average age at the second surgery was 76.6 (SD = 13.4; range: 29–98) years for both groups, and 85.4 (SD = 12.1) and 75.1 (SD = 16.6) years for the ORIF and revision groups, respectively. The ratio between the left and right hips was 44:37.

Comparing the mean age at the second surgery in the ORIF (85.4 years) and revision (75.1 years) groups, we found that the ORIF group was significantly older (p = 0.002). Patients’ BMI (p = 0.89) and ASA score (p = 0.19) did not differ significantly between the two groups, and were therefore not considered determining criteria for either ORIF (23.9) or revision (26.1). The mean lengths of hospital stay were 19.4 and 21.7 days in the ORIF and revision groups, respectively; no significant difference in hospitalization time was found between the two groups (p = 0.28).

Regarding types of anaesthesia, four possibilities were observed: epidural, epidural with femoral bloc, general, and general with femoral bloc. Although the revision group tended toward general anaesthesia, this difference was not significant (p = 0.23). The average blood loss in the ORIF group was significantly less than in the revision group, 390.7 and 1141.6 ml, respectively, p < 0.0001; (Fig. 1). The mean operating times were 225 and 134.5 min in the revision and ORIF groups, respectively (Fig. 2), which was significant (p < 0.0001).

Fig. 1
figure 1

Blood loss (ml) in the revision and ORIF groups

Fig. 2
figure 2

The duration of surgery in minutes for revision and ORIF groups

Surgeons’ experience was calculated based on the number of years performing hip replacement surgeries. Sixteen surgeons were divided in four clusters: 1–5, 5–10, 15–20, and 25–30 years of hip arthroplasty experience; each cluster included four surgeons. These four clusters were compared to examine the correlation between level of experience and the choice between ORIF and revision surgery; no significant correlation was determined (p = 0.79).

Cross-tabulating the two groups with seven post-operative weight-bearing categories indicated significantly earlier weight-bearing in the revision group (p = 0.002).

Radiographic analysis

The pre-fracture and follow-up images, for at least 12 (mean: 91.3; range: 12–262.3) months after the index arthroplasty, were scored using ARA assessment. The ARA score following the second procedure, either ORIF or revision, was calculated based on X-rays performed 12 (mean: 36.3; range: 6–58.9) months after the procedure. Exceptions were made for three patients who died before their 12-month follow-up (images analysed six months after the procedure).

In the ORIF group, the mean ARA scores before periprosthetic fracture and after PFF fixation with ORIF were 5.25 (SD = 0.73) and 5.08 (SD = 0.92), respectively. In the revision group, the mean scores were 4.87 (SD = 1.01) after the index procedure and 4.96 (SD = 1.1) after revision.

The most frequently seen negative radiographic ARA score parameters, before periprosthetic fractures, were calcar remodelling and hypertrophy (− 2 points), initial osteolysis (− 2 points), and to a lesser extent, lucent lines in the smooth zones (− 2 points) and reactive lines in < 50% of the porous zones (− 2 points). Pedestal formation, cortical hypertrophy, migration, and interface deterioration were seen much less frequently.

After ORIF or revision procedures, the most commonly seen ARA score parameters were calcar remodelling (− 2 points), lucent lines in smooth and/or porous zones (− 2 and − 3 points, respectively), initial or progressive osteolysis (− 2 and − 4 points, respectively), and moderate migration or varus (− 2 points). Less often, severe migration (− 4 points) was observed; pedestal formation and cortical hypertrophy were rarely observed. All those parameters were present almost equally in ORIF and revision groups. The 10 cases in ORIF group treated by cerclage wires only, showed a slightly worse ARA score compared to other modalities (p = 0.35).

Case examples

Case 1

A 70-year-old male patient with periprosthetic Vancouver type B2 fracture around cementless stem. (Fig. 3). The index procedure was done 8 years prior to the fracture. At the time of fracture his BMI was 32 and ASA score 3. The patient had general anaesthesia. The blood loss was 600 ml and the operating time was 150 min. He was operated by a surgeon with 15 years of experience. The patient spent 4 days in the orthopaedic ward and 16 days in the rehabilitation centre. The patient had 8 weeks of partial weight-bearing. After the index procedure his ARA score was 6 and it remained the same at the follow-up of one and 4.5 years. The patient reported Forgotten Joint score (FJS) of 92 and Oxford Hip score (OHS) of 42 at the last visit.

Fig. 3
figure 3

Periprosthetic fracture Vancouver type B2 (a), ORIF with two plates and cerclage wires (b), The X-ray at 4.5 years after the ORIF surgery AP view (c) and lateral view (d)

Case 2

A female patient of 94 years was hospitalised with Vancouver B2 fracture around a cementless stem after a fall (Fig. 4). The index procedure was done when she was 84 years old. At the time of ORIF procedure, the patient had a BMI 22 and ASA score 3. She was operated under spinal anaesthesia. The blood loss was 450 cc and the operating time was 140 min. She was operated by a surgeon with more than 25 years of experience. The patient spent 7 days in the orthopaedic ward, then 24 days in a specialized centre for convalescence. The patient was asked to walk with toe-touch on the right side for 6 weeks. Her ARA score was a constant 6 after the index arthroplasty and at 1 and 5 years after the ORIF surgery. At 5 years’ follow-up, the FJS was 82 and the OHS was 34, she walked into the office by herself without a walking aid.

Fig. 4
figure 4

Periprosthetic fracture Vancouver type B2 (a), Post-operative radiograph showing a hook plate with screws and cerclage wires (b), A radiograph at 5 years, note a slight radiolucency of the lateral cortex and greater trochanter, but an excellent overall hardware position

Case 3

A female patient whose hip had been replaced at the age of 82, suffered a B2 type periprosthetic fracture 19 months later (Fig. 5). The BMI was 19, ASA score 2. The patient lost 550 ml of blood during a 140-min-long surgery under general anaesthesia. She was operated by a surgeon who had finished his residency two years earlier. The patient stayed 6 days in the surgical ward and 17 days in the rehabilitation. The weight-bearing was not allowed for 12 weeks after surgery. The ARA score was 6 after the index THA. After the ORIF procedure, it decreased to 4 due to subsidence and varus migration at 1 and 3 years’ follow-up. At her latest visit, she presented FJS of 72 and OHS of 32.

Fig. 5
figure 5

Periprosthetic fracture Vancouver type B2 (a), Immediate post-operative radiograph—note the unsatisfactory reduction and malrotation (b), 3-year post-operative radiograph—note the subsidence of the stem and the proximal cerclage wires cut through the medial cortex

Discussion

The 4.2% of PFFs in our study was within the 0.1–18% range reported in the literature [5, 6]. The percentage of revision due to aseptic loosening, dislocation, or infection was also comparable to the rates reported; however, the rate of periprosthetic fractures was 35% of all revised failures, which was somewhat higher than reported by other researchers [18]. (Table 1)

Table 1 Studies comparing indications for revision of hip arthroplasties

We observed a proportion of Vancouver B2 fractures comparable to those reported in the literature, representing 78% of all PFFs in our study [19]. Remarkably, all surgeons between the two university hospitals tended to prefer ORIF to revision when patients were older. Even in the relatively small cohort included in this study, we showed that the average age in the ORIF group was over 10 years higher than in the revision group (85.4 vs 75.1 years). We speculated that ORIF was chosen as the older population with PFF presents a higher number of comorbidities, often precluding more complex procedures, such as stem revision. However, the ASA score, designed to serve as an indicator for a patient’s general state, was not significantly different between the ORIF and revision groups, which does not support our speculation.

Along with the difference in age in the ORIF and revision groups, we found a significant difference in blood loss and operating time. As demonstrated by Manara et al., these two parameters are inter-related; the probability of massive blood loss increases by 3% for every minute surgery is prolonged [20]. A three times lesser blood loss and twice shorter operating time in the ORIF group cannot be overlooked; these are crucial factors in a patient’s ability to tolerate the surgery and allow for better post-operative recovery. We observed that weight-bearing was allowed earlier in the revision group, which may favour stem revision surgery. However, the weight-bearing protocol after PFF varies significantly between institutions [21].

Regarding surgical experience, the hypothesis that younger surgeons prefer ORIF due to its lesser complexity was rejected; no correlations were found between years of experience and the choice of treatment modality for type B2 PFFs. We intentionally presented the third case (Fig. 5) which showed a migration of the stem, rarely seen in our ORIF cohort. Despite the stem subsidence, the patient was satisfied with the outcome. Possibly the ORIF was a poor choice for this stem design or in this particular case, surgeon’s experience was a disadvantageous factor.

Regarding radiographic evaluation, the ARA score was a convenient, efficient, and precise tool to evaluate the radiographs of the cementless stems after ORIF or revision surgery.

We found no studies using ARA score for periprosthetic fractures in the literature. Comparing results between the index arthroplasty and second procedure, we observed a slight decline in ARA score in the ORIF group, and a small improvement in the revision group; no significant difference was observed between the two groups, probably due to the small sample size. An important observation is that patients who underwent ORIF in our study showed neither major hardware failure, nor stem migration or displacement; these potential complications are often used as an argument for stem revision. Recent data suggest that ORIF can offer results equivalent to stem revision [15]; our radiological results propose the same. Moreover, less blood loss and a shorter operating time suggest that ORIF is a viable treatment option for Vancouver B2 PFFs [2, 14].

Some points may strengthen this study; first, it would be beneficial to perform an in-depth analysis of implant type and model, and exact modality of internal fixation used, rather than simply dividing patients into ORIF and revision groups. Second, a mortality analysis would be useful. Third, comorbidities could be further explored, using a Charlson score, for example. Despite these weaknesses, we encourage further, prospective, larger sample size investigations regarding the role of ORIF in Vancouver B2 fracture treatment. Considering that the radiographic results are comparable between the two groups, the null hypothesis is rejected, indicating that ORIF could be beneficial in patients older than 75 years, particularly from the operation duration and blood loss standpoint.