Introduction

With the introduction of Charnley’s low friction hip arthroplasty in the 1960’s, total hip arthroplasty (THA) has reshaped the natural history of hip arthritis. A disease formerly managed with soft-tissue interposition yielding meager results could suddenly be addressed with cement, metal and polyethylene with high rates of success [1]. THA has continued to gain popularity and it is estimated that 635,000 THAs will be performed annually by the year 2030 [2]. Over the last several decades, technological advancements have been made in surgical technique, implant design, and material science. In the United States cementless designs have generally replaced traditional cement fixation and several iterations of bearing surfaces have been developed to combat wear and osteolysis [3]. As implant longevity has significantly improved, younger patients are being considered candidates for THA.

Although THA is primarily used to address the effects of end stage osteoarthritis in older, lower demand individuals, an increasing number of extremely young patients are undergoing THA to address end stage hip disease [4]. Historically, the literature has shown decreased survivability and increased revision rates in THAs performed on young patients, but more recent reviews have begun to demonstrate significantly improved implant survivability [Wangen et al.; Chmell et al.; Lee et al.; Taheriazam and Saeidinia, “Short-Term Outcomes of One-Stage Bilateral Total Hip Arthroplasty in Young Patients (< 30 Years Old)”]. While one may conclude that improvements in THA outcomes and longevity may be attributable to advances in technique and technology, no study has provided conclusive evidence to support this assumption.

The goal of this systematic review and meta-analysis is to track the evolution of THAs performed patients less than 30 years of age to determine if any conclusions can be drawn comparing newer techniques and technologies to previous generations. We specifically evaluated: (1) changes in surgical indications, (2) implant related differences, (3) implant survivability, and (4) objective clinical and radiographic outcomes.

Materials and methods

Search strategy and criteria

A systematic review was conducted according to the Preferred Reported Items for Systematic Reviews and Meta-Analysis Statement for Individual Patient Data (PRISMA) [9]. All included studies were retrospective in nature, thereby making this review Level III evidence. Comprehensive database queries of MEDLINE, EMBASE, The Cochrane Library, and Google Scholar were performed including all articles published up to May of 2020. Articles were filtered using the following search terms: “Total hip arthroplasty” OR “Total hip replacement” OR “Hip Prosthesis” in combination with “Juvenile” OR “Immature” OR “Adolescent” OR “Young” OR “Under 30” OR “Under thirty.” Unpublished data and conference proceedings were excluded from the data set. Once a final list of articles had been obtained according to the inclusion and exclusion criteria listed below, the reference section from each study was reviewed to capture any article which may have been missed in our initial database search.

Inclusion and exclusion criteria

This review is specifically aimed at analyzing total hip arthroplasty performed on patients under 30 years of age; therefore, studies looking at hip resurfacing or hemiarthroplasty were excluded. Studies analyzing total knee arthroplasty in combination with THA were excluded from this review. All etiologies of end stage hip disease were considered.

Original articles written in English were considered and were included if the following criteria were met: (1) All patients in the study were under the age of 30; (2) All patients underwent THA; (3) Study population was greater than 5 patients; (4) Minimum of 2 years clinical follow–up; (5) Indications for THA were reported; (6) Radiographic outcomes reported; (7) Report of revision rates and indications; (8) Reporting of objective clinical outcome data. Studies were excluded if any of these criteria were not met. Review articles were also excluded.

Search results

A total of 20,816 articles were identified and exported to citation management software (Mendeley, London, United Kingdome) where 10,291 duplicate articles were removed. The remaining 10,525 articles were manually processed to remove 6501 not primarily focusing on total hip arthroplasty and another 3907 which included patients > 30 years old. Abstracts for the remaining 118 articles were reviewed independently by two authors to select studies with the inclusion criteria listed above. Where disagreement between authors existed, a third reviewer was consulted. Figure 1 provides a summary of the review process according to PRISMA guidelines [9]. Full length manuscripts of the remaining articles were reviewed by a single author and the bibliographies were cross referenced for completeness. In total, thirty-four articles met our inclusion criteria (Table 1) [5, 7, 8, 10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42].

Fig. 1
figure 1

PRISMA flow chart outlining article review process

Table 1 Authors, publication year, journal, type of study, and MINORS score for each paper included

The years during which patients were collected for each study was recorded and the midpoint was calculated. To show trends in indications, technique, and outcomes over time, we split the included articles as evenly as possible into three groups. The timeframe of surgery was collected for each study and the midpoint of this timeframe was calculated. The articles were then evenly distributed into two groups of 11 and one group of 12. Patients in Group I underwent THA at study midpoints from 1971 to 1992, Group II from 1993 to 1999, and Group III from 2000 to 2015. Studies in Group I had an average of 31 patients undergoing 42 THAs with a total of 459 THAs performed, Group II studies averaged 54 patients undergoing 71 THAs with a total of 780 THAs performed and Group III studies had an average of 53 patients undergoing an average of 77 THAs with a total of 918 THAs performed.

For each manuscript, the following data were recorded where possible: history of previous hip surgery, indication for surgery, surgical details including fixation technique and bearing surface selection, radiographic and clinical outcome data, revision rates, and indications for revision.

Assessment of study quality

Studies were independently reviewed each study and the Methodological Index for Non-randomized Studies (MINORS) scoring system for non-randomized studies was used to quantify the quality of each study (Table 10 of the appendix) [43]. The average MINORS score for Groups I, II, and III were 10.4 (9–11), 10.7 (10–12), and 10.8 (10–12), respectively.

Data collection and analysis

Data points were retrieved from each of the 34 studies including history of previous hip surgery, indications for total hip arthroplasty, surgical technique, bearing surface, assessment of follow-up radiographic studies, revision rates, and indications as well as clinical outcome measures. Information on previous hip surgery was not available for the majority of included studies (21 of 34 studies not reporting history of previous intervention). All but one study included indications for THA and all studies reported surgical technique, bearing surface, radiographic outcome, survival, and clinical outcome measures.

Data analysis was performed through Excel (Microsoft, Redmond, WA) and GraphPad (PRISM, San Diego, CA) under a fixed effect model. Categorical variables were compared with either Chi squared or Fisher’s exact test where appropriate. We used the Cochrane–Mantel–Haenszel adjustment in subgroup analysis of categorical variables to address the use repeated 2 × 2 contingency tables. Continuous variables were verified for normal distribution using Kolmogorov–Smirnov, Shapiro–Wilk or D'Agostino & Pearson tests. Frequency weighted means were reported, and Welch’s Analysis of Variance (ANOVA) test was performed on normally distributed data followed by Turkey–Kramer subgroup analysis. Kruskal–Wallis intergroup analysis followed by Nemenyi test subgroup analysis was performed for non-normally distributed continuous data. For each category, all three groups were analyzed together (intergroup) and subgroup analysis (intra-group) was performed reflexively if a significant p value was obtained. A p value of < 0.05 was considered significant.

Results

Age demographics

Each study provided mean age at the time of index total hip. All but one study provided age ranges. The mean age at index THA for Group I was 20.5 years, Group II was 22.1 years and Group III was 21.5. A summary of this data can be found in Table 2.

Table 2 Summary of age demographic data for each study group

Indications and history of previous hip surgery

The indications for THA significantly changed over time. In Group I, juvenile rheumatoid arthritis (JRA) was the most common indication for arthroplasty (40.8%) followed by congenital hip dislocation and post-traumatic arthritis. JRA remained the most common indication for THA in Group II (22.5%), with iatrogenic avascular necrosis (IAVN) being the most common indication for THA in Group III (41.5%). Overall, there was a decrease in the number of THAs performed for post-traumatic arthritis while the number of THAs performed for post-treatment AVN increased over time. Though congenital hip dislocations/developmental hip dysplasia was a common indication in all groupings, there was no significant change in rates of THAs performed for this condition (Table 3). Thirteen studies reported on prior history of hip surgery. As time progressed, proportionally more patients had undergone previous hip surgery prior to receiving THA (Group I—21.7%; Group II—30.8%, Group III—48.7%).

Table 3 Indications for THA in patients 30 years of age and younger

The average follow-up duration in Group I was 11.4 years, 11.5 years in Group II, and 6 years in Group III (Table 12 of the appendix).

Surgical fixation technique and bearing surface

There were significant differences in fixation technique between groups. The majority of patients in Group I underwent cementation of both the femoral and cup components (60.4%), Group II received primarily cementless components (65.6%) and Group III had nearly ubiquitous use of cementless fixation (95.9%) (Table 4).

Table 4 Implant fixation methods for THA in patients 30 years of age and younger

A myriad of bearing surfaces were employed and various trends were recognized. The most common bearing surface in Group I was metal on ultra-high molecular weight polyethylene (UHMWPE) (71.7%) which was supplanted by ceramic-on-ceramic bearings in Group II (34.5%) and then ceramic on highly cross-linked polyethylene (HXLPE) in Group III (42.5%) (Table 5).

Table 5 Bearing surfaces used in THA for patients 30 years of age and younger

Implant survival and indications for revision

Overall, there was a significant decrease in revision rates over time. Nearly one-third of patients in Group I required a future revision THA (29.8%), dropping to 15.5% in Group II and further falling to 4.5% in Group III (Table 6). There was no significant difference in the average time to revision (Table 11 of the appendix).

Table 6 Mean revision rates for THA in patients age 30 years and younger

The main indication for revision THA in all groups was aseptic loosening, though there was significant reduction in the rates of aseptic loosening over time (Group I—22.4%, Group II—9.9%, Group III—1.7%). Recurrent instability was the second most common cause for revision in Groups I and II (2.6% and 2.1%, respectively) but was not a frequent cause for revision in Group III. Lastly, there was a significant increase in revision THA performed secondary to wear in Group III compared to groups I and II (Table 7).

Table 7 Indications for revision THA in patients age 30 years and younger

Radiographic and clinical outcome

Radiographic evidence of loosening was reported as either isolated femoral stem loosening, isolated cup loosening or combined cup and stem loosening. In all categories, there was a significant decrease in reported loosening over time (Table 8).

Table 8 Radiographic loosening seen in THA performed in patients age 30 years and younger

Clinical outcome data were reported via several different protocols. A total of seven studies in Groups I and II implemented the modified Merle d’Aubigne (modMdA) score to assess pre-operative and post-operative function. Improvement in modMdA score between Groups I and II were similar (7.9 and 7.7, respectively). The majority of included studies used Harris Hip Score (HHS) to assess clinical outcome. In total, 18 studies reported pre-operative HHS and 24 reported post-operative scores. Analysis showed no significant difference in HHS score improvement amongst the three groups of studies (Table 9).

Table 9 Improvement in Harris Hip Scores in THA for patients age 30 years and younger

Discussion

Total hip arthroplasty has served as an excellent long-term solution to end stage hip arthritis in older, lower demand patients but has historically demonstrated variable success in extremely young patients. While THA has undergone several iterations over the last 5 decades, it is unknown whether these advances have translated to improvements in extremely young patients requiring arthroplasty. A few systematic reviews and meta-analyses have been published investigating overall outcomes of THA in patients under 30 years old, but no review has established whether modern surgical techniques offer any improvement over previous generations [44,45,46]. The goal of this study was to track changes in surgical indications, technique, technology, survivability, and outcomes over time to determine if modern methods have resulted in patients benefit.

Regarding our first study objective on operative indications, over time we found that proportionally fewer patients were undergoing THA for a primary diagnosis of juvenile rheumatoid arthritis (JRA). This is consistent with the literature describing the overall rate of THA performed for rheumatic patients. In a large database review by Mertelsmann-Voss et al., the rate of arthroplasty performed in patients with JRA decreased at a rate of 3.6% per year from 1991 to 2005, and the mean age to arthroplasty rose substantially from 30.9 to 36.7 years. The authors primarily attributed this to the introduction of disease modifying antirheumatic drugs (DMARDS) beginning with methotrexate in the early 1990’s followed by the biologic DMARDS targeting tumor necrosis factor-alpha in 1998 [47].

Our review of the literature further demonstrated that the proportion of patients undergoing THA for idiopathic AVN had significantly increased over time. Reports have shown that the rates of idiopathic AVN have remained relatively stable, so this increase is likely a proportional and reciprocal response to the significant decrease in rates of THA for JRA [48, 49]. We also showed an increase in the proportion of patients undergoing THA as a result of treatment induced osteonecrosis of the femoral head, specifically resulting from glucocorticoid use. Studies have shown that the rates of post-treatment osteonecrosis of the femoral head doubled from 1989 to 2003, but it is thought to be the result of increased use of MRI in evaluating hip pathology rather than an actual increase in disease prevalance [50]. Lastly, we found that the proportion of THAs performed for post-traumatic arthritis was significantly lower in Group III compared to Groups I and II. This may be the result of improved management of fractures about the hip, but literature supporting this claim is limited.

In evaluating surgical technique, we assessed both the method of fixation as well as choice in bearing surface. As expected, Group I demonstrated the heaviest utilization of cement fixation for both the femoral and acetabular components. Cement fixation was supplanted by press fit technology in Group II, although there was still frequent reliance on cement fixation for one or both components. Modern implants are almost exclusively cementless designs. Literature comparing cemented to cementless technology suggests that cementless components may be less prone to late aseptic loosening compared their cemented counterparts [51,52,53]. Bearing surface selection is a controversial topic and this is reflected in the literature by the myriad of bearing surface combinations reported. Early studies were limited to metal heads articulating with monobloc ultra-high molecular weight polyethylene (UHMWPE) cups which were quickly supplanted with metal shells containing UHMWPE liners. The timeframe of Group II coincides with the development of highly cross-linked polyethylene (HXLPE) which exhibited improved wear properties over conventional UHMWPE and decreased osteolysis secondary to particulate wear products [54]. This period also demonstrated the highest proportions of ceramic-on-ceramic and metal-on-metal bearing interfaces in a continued search to combat issues with wear, durability and corrosion. Unfortunately, ceramic liners have been prone to fracture and metal-on-metal implants tend to demonstrate high levels of failure secondary to metal ion production [55,56,57]. While no consensus exists on bearing surface selection for younger patients, the most common combination (as demonstrated in Group III) is a ceramic head and a HXLPE liner which combines low wear rates with decreased fretting corrosion, virtually eliminates issues with metal ion production, and does not suffer from problems related to brittleness.

Implant survivorship was shown to improve significantly over time likely due to a synthesis of the factors mentioned above. It has been demonstrated that patients undergoing THA for JRA have worse implant survivorship compared to their primary osteoarthritis counterparts [6, 11, 12]. Thus, as the proportion of patients indicated for THA secondary to JRA decreased, implant survivorship reciprocally increased. Similarly, transitioning from cemented to cementless components has shown a decrease in late aseptic loosening as mentioned above. Lastly, as material science evolved, more stable bearing surfaces were developed which decreased the amount of peri-implant osteolysis secondary to particulate byproducts. Interestingly, our data show a proportional increase in the number of revisions performed secondary to wear in Group III compared with Groups I and II. The cause for this is likely twofold. First, the revision rate in Group III was extremely low compared to the other groups which magnifies the effect of individual patients. Second, patients in Group III may not have experienced early complications such as aseptic loosening or recurrent instability in their course of care thus allowing for a long-term mechanism (i.e. wear) to be a more prominent indication for revision.

In regard to radiographic outcomes, literature demonstrates a significant decrease in implant radiolucency over time. This is supported by the discussion above on indications, fixation strategy and bearing surface. Fewer patients with JRA, decreased use of cemented fixation and avoidance of bearing surfaces prone to peri-implant osteolysis all combine to improve prevalence of radiographic loosening. Interestingly, the literature did not show a significant difference in overall clinical improvement, measured by Harris Hip Score (HHS), over time. This indicates that despite improvements in implant survivability and decreased radiolucency as technology advanced, patients are receiving the same level of pain relief and functional improvement as they were 5 decades ago. This result must be taken with a word of caution as several early studies did not report pre-operative HHS or employed a completely different clinical outcome scoring system (Merle d’Aubigne, hip disability and osteoarthritis outcome score, etc.).

Though this is an exhaustive review of the current literature, it has several limitations. First, none of the studies meeting inclusion criteria were prospective, randomized controlled trials (RCTs) which limits the value of data pooling. Compounding the issue was a lack of standardized reporting of various categories of data. For instance, earlier studies frequently reported Merle d’Aubigne scores to assess clinical outcome while later studies employed Harris Hip Scores as standard. Other inconsistencies included failure to obtain pre-operative clinical scores in some studies, variability in radiographic reporting methods, and a majority of studies not including rates of previous hip surgery. Furthermore, splitting the studies into separate groups precluded performance of heterogeneity studies such as an I2 statistic. That being said, this collection of studies is the best available and conclusions should be viewed cautiously. There were several articles excluded from the final review as they were missing key data points. While inclusion of these studies could have bolstered support for other categories, we did not want to compromise on study quality especially in light of the fact that all included articles were retrospective. A final point to consider is the method used to group the studies. In an effort to ensure that each defined group had enough data, we decided to split the 34 studies as evenly as possible according to study midpoint. While this succeeded in providing a fairly consistent data volume across groupings, this method of grouping is susceptible to variability in publication rates on the topic of THA in patients under 30 years of age. This is illustrated by the fact that the studies included in Group I span several decades while Group II merely spans 8 years. Nevertheless, we believe that the frequency of published articles pertaining to this topic directly mirrors changes in industry and surgical techniques, thus justifying our method of study grouping.

In conclusion, this systematic review and meta-analysis has shown that indications for THA in patients younger than 30 years of age have changed over time transitioning from JRA to IAVN, though several etiologies can contribute to end stage hip disease. Fixation strategies have moved away from cemented implants and are nearly exclusively cementless. This shift in technique along with advances in bearing surface technology have significantly improved rates of revision and radiographic loosening. Nevertheless, post-operative improvement in clinical outcome scores have remained stable over the decades.