With an aging population, the number of older patients with hip fractures has substantially increased. It is expected that by 2050, about six million older individuals worldwide will experience a hip fracture each year.1 Hip fractures can cause severe pain and disability and even shorten life expectancy, which places a burden on patients, their families, and the social medical security system.2 Generally, nearly all patients with hip fractures undergo surgical treatment, which requires general anesthesia (GA) or regional anesthesia (RA), which includes spinal anesthesia, epidural anesthesia, and peripheral nerve blockade. Given the typical patient cohort’s prevalent comorbidities, such cardiopulmonary diseases, cerebrovascular diseases, and osteoporosis, surgery is associated with a high risk of developing perioperative complications, potentially culminating in mortality.3,4 The type of anesthesia may influence outcomes. Some studies have indicated that GA is associated with higher in-hospital mortality than RA is.5,6 Nevertheless, two recent large randomized controlled trials (RCTs) did not find a difference in 30-day or 60-day mortality between RA and GA.7,8 Recently, some published meta-analyses have studied postoperative mortality and other perioperative outcomes in patients undergoing hip fracture surgery with different anesthesia techniques. Some only focused on RCTs or limited research articles and showed conflicting conclusions.9,10,11,12,13 In general, meta-analyses that only look at RCTs may have a stronger certainty of evidence. Nevertheless, considering that RCTs are generally unable to include large numbers of patients as well as the low incidence of postoperative mortality, the results of such meta-analyses should be interpreted with caution. In particular, limited by incomplete follow-up, few comparable studies have examined longer-term mortality, such as 90-day mortality.14,15 The optimal anesthesia technique for hip fracture surgery is, therefore, still controversial.16

In this updated systematic review and meta-analysis, we sought to include sufficient recent data to conduct a comparatively comprehensive and systematic study to assess the short-term to long-term mortality and other perioperative outcomes of RA vs GA in patients undergoing hip fracture surgery. We chose 30-day mortality as the primary outcome and 90-day mortality, in-hospital mortality, perioperative complications, length of hospital stay, and length of surgery as the secondary outcomes.

Methods

Search strategy and selection criteria

This review is reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.17 The study has been registered in the prospective register of systematic reviews, PROSPERO (CRD42023411854). Two independent authors systematically searched the electronic databases including MEDLINE, Embase, Web of Science and the Cochrane Library for articles published from the construction of the library to 15 May 2023. Search terms were applied to both subject headings and as keywords and the language of publication was restricted to English. Search terms, used both alone and in combination, included but were not limited to “spinal anesthesia” OR “regional anesthesia” OR “general anesthesia” AND “hip fractures” OR “femoral neck fractures” OR “arthroplasty, replacement, hip” OR “intertrochanteric fractures” OR “trochanteric fractures.” References to relevant reviews and the identified articles were also manually searched. Two authors independently screened the titles and abstracts after expurgating the duplicates. Subsequently, full texts of the identified studies were screened by the same two reviewers working independently and in duplicate to assess whether the studies met the inclusion criteria, and any disagreements were discussed with a third author.

Both RCTs and observational studies were eligible for this review. We included all studies that reported perioperative outcomes on RA compared with GA in patients undergoing hip fracture surgery. The following outcome measures were included: postoperative 30-day mortality, postoperative in-hospital mortality, postoperative 90-day mortality, perioperative complications (including postoperative delirium, pneumonia, myocardial infarction, intraoperative hypotension, and venous thromboembolism), length of hospital stay, and length of surgery. The following exclusion criteria were used: 1) articles published before 2000; 2) summary studies; 3) case reports or case series reports; 4) meeting summaries; and 5) inability to obtain the relevant information after contacting the author.

Data extraction

A structured table was designed to extract all the relevant data from each study that met the inclusion criteria by two independent reviewers. Extracted data were the first author’s name, publication year, country, sample size, age, American Society of Anesthesiologists (ASA) Physical Status, anesthesia technique, and study outcome measures. Any outstanding disagreements were resolved by consensus.

Methodological quality assessment

Two authors independently assessed the methodological quality of included RCTs and observational studies. Any disagreements were discussed with a third author. We used the Cochrane Collaboration risk of bias tool for randomized studies to assess the reporting quality and risk of bias of the included RCTs.18 This tool evaluates the following seven possible sources of bias: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other biases. The methodology for each study was classified into low, unclear, and high risk of bias. The Newcastle–Ottawa Scale (NOS) was used for observational studies to assess the risk of bias in individual studies.19 Each observational study was judged on three aspects: the selection of the study groups, the comparability of the groups, and the exposure or outcome of interest for the groups. Articles with NOS scores ≥ 6 were regarded as high-quality observational studies.

Statistical analyses

We performed a meta-analysis for postoperative mortality (including 30-day mortality, in-hospital mortality, and 90-day mortality), perioperative complications (including postoperative delirium, pneumonia, myocardial infarction, intraoperative hypotension, and venous thromboembolism), length of hospital stay, and length of surgery. We used Review Manager version 5.4.1 (The Cochrane Collaboration, London, UK) and Stata version 18 (StataCorp LLC, College Station, TX, USA) to perform the meta-analyses. Dichotomous data were analyzed as odds ratios (ORs) with 95% confidence intervals (CIs), and continuous data are presented as mean differences (MDs) and 95% CIs. For length of hospital stay, some articles provided the median and quartiles or ranges. In these cases, we detected the skewness of data using the method proposed by Shi et al.20 If no skewness was seen in the data, we converted the data into means and standard deviations according to the method proposed by Shi et al.,21,22 Luo et al.,23 and Wan et al.24 For log-normally distributed variables like length of surgery,25,26 we included only those that reported mean with SD in the meta-analysis. The P value with the Cochrane Q test was tested to estimate the extent of statistical heterogeneity among the studies. A random effect model was performed because of differences in patients and interventions.15 For analyses with few studies (< 10), a DerSimonian–Laird test combined with Knapp–Hartung adjustment was performed.27,28 Subgroup analysis was conducted according to the study design. We conducted sensitivity analyses by leaving out one study at a time to investigate whether the removal of a particular article had any effect on the overall results to assess the stability.14

Results

Study selection

We identified 1,102 studies through the Web of Science, MEDLINE, Embase, and Cochrane Library. After checking for duplicates, 312 studies were excluded. A total of 697 articles were eliminated after screening the titles and abstracts. Ninety-three additional studies were assessed by perusing full texts; however, 61 of these studies were excluded for not meeting the inclusion criteria. A further 23 articles that met the inclusion criteria from references of included articles and other systematic reviews were included. Overall, 55 studies were included in this systematic review (Fig. 1).

Fig. 1
figure 1

Flow diagram of study selection

Study characteristics

Twelve RCTs and 43 observational studies were included in the systematic review.5,6,7,8,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79 Ten of the studies included patients of any age, while the majority of the other studies focused on elderly individuals. The largest study included 124,960 patients, and the smallest one included 30 patients. The 30-day mortality was the outcome most frequently evaluated in the included studies. The descriptive characteristics of the included studies are shown in the Table.

Table Characteristics of included studies

Risk of bias assessment

The Cochrane Risk of Bias tool (The Cochrane Collaboration, London, UK) was used to evaluate the quality of RCTs (Fig. 2A and B). Only one trial was considered as a high risk of selection bias, and the others were at low or unclear risk. Considering the blinding of the patients is challenging when comparing RA and GA, we judged performance bias to be at high risk of bias for eight studies and five studies were found to have a high risk of detection bias. For attrition bias, we classified two studies as having a high risk of bias. Two studies were deemed to have a high risk of reporting bias, and three were considered to have a high risk of other bias. Observational studies with NOS scores ≥ 6 were regarded as high quality and all studies that we selected met the criteria for high quality (Electronic Supplementary Material [ESM] eAppendix 1). Publication bias of the outcomes was visualized by a funnel plot (ESM eAppendix 2).

Fig. 2
figure 2

(A) Risk of bias summary; (B) risk of bias graph

Primary outcome

30-day mortality

Thirty-one studies, six of which were RCTs8,40,44,48,57,66 and 26 of which were observational studies29,34,35,36,37,39,41,43,45,46,50,54,55,56,58,62,64,65,67,68,70,71,74,75,77,79 compared the 30-day mortality between the RA and GA groups. One hundred and eighty-nine thousand, nine hundred and twenty-three patients were included in the RA group, and 242,206 patients were included in the GA group (Fig. 3A). After Knapp–Hartung adjustment, meta-analysis showed no significant difference between the RA and GA groups in six RCTs (OR, 0.88; 95% CI, 0.44 to 1.74; I2 = 0%). Twenty-six observational studies also reported no significant difference (OR, 0.95; 95% CI, 0.88 to 1.03; I2 = 63%).

Fig. 3
figure 3figure 3

Meta-analysis of 30-day mortality (A), in-hospital mortality (B), and 90-day mortality (C) in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Secondary outcomes

In-hospital mortality

Eighteen studies reported different in-hospital mortality rates in the RA group and GA group. Three RCTs7,30,48 and 15 observational studies5,6,29,31,32,33,35,36,38,43,50,58,63,65,69 were included. One hundred and six thousand, one hundred and twenty patients in the RA group and 185,292 patients in the GA group were involved (Fig. 3B). Because of the small sample size, the RCT results showed a wide CI (OR, 1.96; 95% CI, 0.02 to 171.66; I2 = 71%). Fifteen observational studies showed a lower in-hospital mortality in the RA group (OR, 0.84; 95% CI, 0.77 to 0.91; I2 = 17%).

90-day mortality

The 90-day mortality was examined by six studies, two of which were RCTs40,48 and four of which were observational studies6,41,46,55 (Fig. 3C). A total of 16,043 patients received RA and 19,850 patients received GA. After Knapp–Hartung adjustment, a meta-analysis of RCTs indicated a wide CI in 90-day mortality between the RA group and GA group (OR, 1.08; 95% CI, 0.43 to 2.72; I2 = 0%). Observational studies also reported no significant difference (OR, 0.99; 95% CI, 0.77 to 1.28; I2 = 69%).

Delirium

Sixteen studies, including seven RCTs7,8,40,44,48,49,51 and ten observational studies,35,42,45,53,54,55,60,63,68,79 analyzed the incidence of delirium after GA and RA in patients with hip fracture (Fig. 4). There were 99,647 patients in the GA group and 89,171 patients in the RA group. After Knapp–Hartung adjustment, a meta-analysis of seven RCTs could not find a difference (OR, 0.87; 95% CI, 0.36 to 2.12; I2 = 67%). Ten observational studies indicated that RA significantly reduced postoperative delirium (OR, 0.89; 95% CI, 0.76 to 1.03; I2 = 82%).

Fig. 4
figure 4

Meta-analysis of postoperative delirium in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Pneumonia

Pneumonia incidence was assessed by 22 studies with 56,197 patients in the RA group and 79,744 patients in the GA group (Fig. 5). From the merging data of RCTs,7,8,30,40,48,51 no significant difference between the two groups was found in the incidence of pneumonia after Knapp–Hartung adjustment (OR, 0.65; 95% CI, 0.42 to 1.03; I2 = 0%). Sixteen observational studies6,29,31,35,37,39,50,52,53,55,56,63,65,68,70,75 revealed no significant difference between the RA group and GA group (OR, 1.03; 95% CI, 0.93 to 1.15; I2 = 40%).

Fig. 5
figure 5

Meta-analysis of pneumonia in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Myocardial infarction

Five RCTs7,8,30,40,66 and 16 observational studies6,29,31,35,37,38,39,45,47,50,56,65,68,70,75,79 assessed the incidence of myocardial infarction (Fig. 6). This analysis contained 135,682 patients in the RA group and 159,989 patients in the GA group. After Knapp–Hartung adjustment, meta-analysis of RCTs indicated no significant difference in myocardial infarction between the two groups (OR, 0.83; 95% CI, 0.47 to 1.44; I2 = 0%). Observational studies also showed no statistically significant difference between the two groups (OR, 0.95; 95% CI, 0.89 to 1.02; I2 = 0%).

Fig. 6
figure 6

Meta-analysis of myocardial infarction in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Intraoperative hypotension

Nine studies provided the outcome of intraoperative hypotension rate, including four RCTs8,40,57,78 and five observational studies29,43,50,59,64 (Fig. 7). Four thousand, five hundred and seventy-six patients were included in the RA group and 7,550 patients in the GA group. After Knapp–Hartung adjustment, the meta-analysis of RCTs showed a significant reduction in intraoperative hypotension with RA (OR, 0.52; 95% CI, 0.38 to 0.72; I2 = 0%). Nevertheless, five observational studies indicated no significant difference between the two groups (OR, 0.87; 95% CI, 0.41 to 1.84; I2 = 88%).

Fig. 7
figure 7

Meta-analysis of intraoperative hypotension in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Venous thromboembolism

Fourteen studies that examined the incidence of deep venous thromboembolism and/or pulmonary thromboembolism were included in this meta-analysis (Fig. 8), including three RCTs30,40,51 and 12 observational studies.6,33,35,37,39,47,53,56,69,70,75,79 Seventy-three thousand, nine hundred and thirty-six patients belonged to the RA group, and 87,481 patients belonged to GA group. After Knapp–Hartung adjustment, the pooled analysis of RCTs revealed a wide CI between two groups in postoperative venous thromboembolism (OR, 1.19; 95% CI, 0.18 to 8.03; I2 = 0%). For observational studies, no significant difference was found between the two groups (OR, 0.77; 95% CI, 0.58 to 1.02; I2 = 51%).

Fig. 8
figure 8

Meta-analysis of venous thromboembolism in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Length of hospital stay (in days)

The length of hospital stay was recorded in 26 studies, seven of which were RCTs8,30,40,44,48,51,57 and 20 of which were observational studies32,33,34,35,37,38,43,45,46,50,54,55,59,63,64,65,70,73,74,79 (Fig. 9). There were 152,476 in the RA group and 195,467 participants in the GA group. No significant difference was found between two groups of RCTs (MD, 0.22 days; 95% CI, −0.22 to 0.66; I2 = 54%). For observational studies, there was no significant difference between the RA and GA groups (MD, −0.36 day; 95% CI, −0.87 to 0.14; I2 = 100%).

Fig. 9
figure 9

Meta-analysis of length of hospital stay in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Duration of surgery (in minutes)

A total of four RCTs30,44,49,76 and eight observational studies33,35,37,39,53,61,65,79 were included in the analysis of length of surgery (Fig. 10). After Knapp–Hartung adjustment, there was no significant difference in the duration of surgery between the two groups reported in RCTs (MD, −8.60 min; 95% CI, −20.48 to 3.28; I2 = 54%). The observational studies showed a slight reduction in the length of surgery in the RA group (MD, −6.68 min; 95% CI, −11.30, −2.06; I2 = 77%).

Fig. 10
figure 10

Meta-analysis of length of surgery in patients receiving general anesthesia versus regional anesthesia

CI = confidence interval; GA = general anesthesia; RA = regional anesthesia

Sensitivity analysis

Sensitivity analyses showed that most results were stable and consistent with the main analysis. Nevertheless, when we removed the study of Morgan et al.47 from the analysis of venous thromboembolism, the OR value changed distinctly (Fig. 11). Therefore, the results of venous thromboembolism were unstable.

Fig. 11
figure 11

Sensitivity analysis of venous thromboembolism

CI = confidence interval

Discussion

The present systematic review and meta-analysis aimed to compare the postoperative 30-day mortality and other perioperative outcomes of RA vs GA in patients undergoing surgery for hip fracture. Our study was a large systematic review with 55 studies. Nevertheless, some observational studies we included only reported unadjusted data. Pooling all studies without consideration of adjustment was not considered methodologically appropriate. Therefore, we considered the results of the meta-analyses of RCTs to be reliable. The findings of this study suggested that RA might have advantages over GA in terms of intraoperative hypotension. Nevertheless, intraoperative hypotension per se is a surrogate outcome, and there were no significant differences between the two groups regarding 30-day mortality, 90-day mortality, in-hospital mortality, postoperative delirium, pneumonia, myocardial infarction, venous thromboembolism, or length of hospital stay.

At present, there is no consensus in the literature regarding whether RA can reduce mortality in patients after hip fracture surgery. Our meta-analysis showed no significant difference in 30-day mortality between the RA and GA groups. These findings are consistent with previous studies that have reported similar outcomes between the two groups.9,10,11,12,13,14,15,80,81,82 Recently, the REGAIN and RAGA trials also indicated that RA did not reduce 30-day or 60-day mortality in patients after hip fracture surgery.7,8 Due to the small sample size and wide CI, our current data are insufficient to know the effect of RA vs GA on in-hospital mortality (OR, 1.96; 95% CI, 0.02 to 171.66; I2 = 72%) and 90-day mortality (OR, 1.08; 95% CI, 0.43 to 2.72; I2 = 0%). Previous meta-analyses have found differences in in-hospital mortality between the RA and GA groups.9,14,82 A large propensity score-matched study that evaluated 52,044 paired patients indicated that those undergoing RA had a significantly lower incidence of in-hospital mortality.38 A meta-analysis including only two RCTs could not find a significant difference between the two groups.13 The sample size in RCTs may not be large enough to show a difference in mortality endpoints between GA and RA.

Several previous meta-analyses reported 30-day mortality, but few reported 90-day mortality. Nevertheless, as reported, the mortality after hip fractures remained high for several months and up to a year.83 In our meta-analysis, we did not find a difference in 90-day mortality between the RA and GA groups. A meta-analysis included only three articles and suggested that those receiving RA had a lower 90-day mortality.82 Desai et al. retrospectively identified 16,695 patients, indicating that RA was associated with a lower likelihood of overall 90-day mortality.6 Nevertheless, the difference was not significant from hospital discharge to 90 days postoperatively. In a large propensity score-matched cohort, researchers found no significant difference in 90-day mortality between RA and GA groups after adjusting for confounding factors.46

In the context of patients undergoing surgical intervention for hip fractures, no significant association was found between the use of RA and short- and long-term mortality. The recent RAGA trial8 and previous meta-analyses deemed no significant difference in terms of postoperative delirium between the two groups.11,12,14,15 In our meta-analysis, we found similar results. It is possible that the sample size of RCTs was not large enough to detect a significant difference in this outcome. The mechanism behind postoperative delirium is complex and multifactorial, and age, sex, medical illness, and biochemical abnormalities are considered to be the risk factors for postoperative delirium.84 The choice of anesthesia may play a role in its development, and this finding supports the use of RA in patients undergoing hip fracture surgery to reduce the risk of postoperative delirium. The confusion assessment method (CAM) and confusion assessment method for the intensive care unit (CAM-ICU) were the most common tools used to diagnose delirium.85 There is no consensus between the studies regarding which tool should be the gold standard.86 Among the included studies, different methods were used to evaluate delirium, which may have led to the heterogeneity of the results.

We also investigated the incidence of pneumonia, myocardial infarction, intraoperative hypotension, and venous thromboembolism between the GA and RA groups. We found that there was no significant difference between the two groups in the incidence of postoperative pneumonia, myocardial infarction, and venous thromboembolism. This is consistent with the conclusions of two recent meta-analyses, which only included RCTs.13,87

The present meta-analysis revealed a significantly higher incidence of intraoperative hypotension in the GA group. This was similar to the results of most included studies.8,50,57,78 Recently, two reviews that included only RCTs suggested no significant difference in the incidence of intraoperative hypotension between the two anesthesia techniques.12,13 This might be related to the fact that the two reviews analyzed limited research.

The meta-analysis of the incidence of venous thromboembolism indicated a significant difference between the two groups. Nevertheless, the CI was too wide to know the effect of RA vs GA on the outcome. The sensitivity analysis also showed that the meta-analysis results were not robust and must be interpreted cautiously.

Similar to the results of some meta-analyses,10,11,12 our meta-analysis found no significant difference in the length of hospital and length of surgery between the two groups. Two previous reviews considered a longer hospital stay in the GA group than in the RA anesthesia group.14,15 In our review, we included more studies to analyze length of hospital stay, making it more representative.

Strengths and limitations

We systematically analyzed short-term and long-term mortality after hip fracture surgery under two different anesthesia techniques. In addition, we included several new RCTs in our meta-analysis.

Our current study also has the following potential limitations, which were primarily inherent: 1) among eligible studies, some outcome definitions and time points are not precisely the same; 2) the small sample size and wide CI make it insufficient to know the effect of RA vs GA on some outcomes; 3) the sample size of the included studies varied widely; 4) like in all meta-analyses, publication bias was an inevitable flaw; 5) the inclusion of both RCTs and observational studies may have led to confounding and other bias, and may have magnified the problems seen in observational studies; 6) subgroup analysis by ASA Physical Status classifications or age of patients could not be performed to make the results more reliable because data acquisition was challenging; and 7) research88 has indicated that the surgeon may be an important covariate for observational studies, but this was not studied as a covariate in the included studies. Greater attention should be paid to the role of surgeons as covariates affecting outcomes in future studies.

Conclusion

In conclusion, RA may have advantages over GA in reducing intraoperative hypotension. Our findings indicate no difference in in-hospital mortality, 30-day mortality, 90-day mortality, postoperative delirium, pneumonia, myocardial infarction, venous thromboembolism, length of hospital stay, and length of surgery between RA and GA for hip fracture surgery.