Introduction

The frequency of cervical disc disease (CDD) increases with age, with over 85 % of the population aged over 60 expected to have severe degeneration of at least one cervical level [1]. Additionally, one study observed symptomatic CDD in 25 % of young adults [2]. Multi-level cervical disc disease (mCDD) is a condition most commonly seen in the elderly that can present with neurologic deficit and impact on quality of life [3].

Anterior cervical discectomy and fusion (ACDF) has been the gold standard for treating mCDD for many years [4]. Although ACDF preserves neurological function of the spinal cord effectively, this approach has been associated with significant decrease in range of motion (ROM) and increase in long-term complications such as adjacent segment disease [5]. As a result, a variety of alternative motion-sparing surgical procedures have been developed [69]. These include artificial disc replacement (ADR) by arthroplasty and anterior cervical corpectomy and fusion (ACCF). In the context of multi-level surgery, ADR [10, 11] has been proven to be beneficial in terms of cervical spine motion preservation after surgery and ACCF [12] is associated with high fusion rates.

When a surgical approach consists of elements of ACDF, ADR and ACCF in varying proportions, it can be considered as a hybrid of these approaches, termed hybrid surgery (HS). HS is seen as a novel option in treating mCDD where not all disease levels may meet acceptable criteria for one particular approach [13]. The use of HS in the treatment of mCCD is relatively novel and thus its multi-level effectiveness is not thoroughly established. This is because the different approaches that constitute HS for mCDD may each possess unique indication criteria, exclusive to the other HS components. Rather than to challenge the indications for the different elements that constituted the HS, the aim of this study was to compare the outcomes of mCCD surgery treated by novel HS, a collection of elements from multiple approaches, to that of traditional ACDF, a single, controlled approach.

Methods

Literature search strategy

The present review was conducted according to PRISMA guidelines and recommendations [14, 15]. Electronic searches were performed using Ovid Medline, PubMed, Cochrane Central Register of Controlled Trials (CCTR), Cochrane Database of Systematic Reviews (CDSR), ACP Journal Club and Database of Abstracts of Review of Effectiveness (DARE) from their dates of inception to January 20 [1] 6. To achieve maximum sensitivity of the search strategy and identify all studies, we combined the terms: “hybrid”, “replacement or arthroplasty”, “fusion” and “cervical” as either keywords or MeSH terms. The reference lists of all retrieved articles were reviewed for further identification of potentially relevant studies. All identified articles were systematically assessed using the inclusion and exclusion criteria.

Selection criteria

Eligible comparative studies for the present systematic review and meta-analysis included those in which: (1) study designs were comparative studies, either RCTs or non-RCTs, (2) population including patients with multi-level cervical degenerative disc disease, (3) intervention group hybrid surgery (ADR with another approach), (4) control group being ACDF, (5) outcomes reporting at least one of the following measurements: surgical outcomes, intraoperative blood loss; operative time; length of stay; complications; and/or preoperaitve and postoperative functional outcomes, C2–C7 range of motion (ROM); subjective pain perception Visual Analogue Scale (VAS); neck disability index (NDI); superior and/or inferior adjacent segment ROM; and functional Japanese Orthopaedic Association (JOA) scale. Measurements were standardised across studies after analysing for heterogeneity and reported methodology. Excluded studies included those where patients had a neoplastic indication (metastasis or myeloma), infection, trauma, spondylolisthesis, or serious osteoporosis. When institutions published duplicate studies with accumulating numbers of patients or increased lengths of follow-up, only the most complete reports were included for quantitative assessment at each time interval. All publications were limited to those involving human subjects and in the English language. Abstracts, case reports, conference presentations, editorials and expert opinions were excluded. Review articles were omitted because of potential publication bias and duplication of results.

Data extraction and critical appraisal

All data were extracted from article texts, tables and figures. Two investigators independently reviewed each retrieved article (K.P., L.Z.) Discrepancies between the two reviewers were resolved by discussion and consensus with a third reviewer (V.M.L.).

Because quality scoring is controversial in meta-analyses of observational studies, two reviewers independently appraised each article included in our analysis according to a critical review checklist of the Dutch Cochrane Centre proposed by MOOSE [16]. The key points of this checklist include: (1) clear definition of study population; (2) clear definition of outcomes and outcome assessment; (3) independent assessment of outcome parameters; (4) sufficient duration of follow-up; (5) no selective loss during follow-up; and (6) important confounders and prognostic factors identified. Discrepancies between the two reviewers were resolved by discussion and consensus.

Statistical analysis

The odds ratio (OR) or weighted mean difference (WMD) was used as a summary statistic. In the present study, both fixed- and random-effect models were tested. In the fixed-effects model, it was assumed that treatment effect in each study was the same, whereas in a random-effects model, it was assumed that there were variations between studies. χ2 tests were used to study heterogeneity between trials. I 2 statistic was used to estimate the percentage of total variation across studies, owing to heterogeneity rather than chance, with values greater than 50 % considered as substantial heterogeneity. I 2 can be calculated as: I 2 = 100 % × (Q − df)/Q, with Q defined as Cochrane’s heterogeneity statistics and df defined as degree of freedom. If there was substantial heterogeneity, the possible clinical and methodological reasons for this were explored qualitatively. In the present meta-analysis, the results using the random-effects model were presented to take into account the possible clinical diversity and methodological variation between studies. Specific analyses considering confounding factors were not possible because raw data were not available. All P values were two-sided. All statistical analyses were conducted with Review Manager Version 5.3.3 (Cochrane Collaboration, Software Update, Oxford, United Kingdom).

Results

Literature search

A total of 521 references were identified through six electronic database searches (Fig. 1). After exclusion of duplicate or irrelevant references, 505 potentially relevant articles were retrieved. After detailed evaluation of these articles, 27 studies remained for assessment. Manual search of reference lists yielded one new study. After applying the selection criteria, eight comparative studies were selected for analysis. The study characteristics of these trials are summarised in Table 1. Baseline patient characteristics and risk factors are summarised in Table 2. The quality assessment of each included study is presented in Table 3.

Fig. 1
figure 1

PRISMA flowchart for systematic review and meta-analysis comparing hybrid surgery vs anterior cervical discectomy and fusion for multi-level cervical disc disease

Table 1 Study characteristics. HS, hybrid surgery; ACDF, anterior cervical discectomy and fusion; P, prospective; R, retrospective; OS, observational study; RCT, randomised controlled trial; NR, not reported; n, number of patients
Table 2 Study measurements used in analysis
Table 3 Assessment of the quality of included studies by MOOSE criteria

Demographics

The included studies describe a total of 360 patients treated for mCDD, with 169 (46.9 %) and 193 (53.1 %) treated by HS and ACDF, respectively. Their demographic features are reported in Table 1. Mean age ranged from 44.2 to 53.6 years in the HS group and from 47.3 to 55.3 years in the ACDF group. Proportion of males ranged from 42.8 to 66.7 % in the HS group and 45.0 to 67.2 % in the ACDF group. Average number of levels treated was 2.46 and 2.44 in the HS and ACDF groups, respectively. There was no statistically significant difference in these features between the HS and ACDF groups.

Operative time

Data were extracted from six studies, three 2-level and three 3-level studies (Fig. 2a). Overall operative time was significantly longer in HS than ACDF (MD 41.82 min; 95 % CI 40.60–43.03; p < 0.00001) with pooled means of 135 vs 122 min, respectively. The difference was significant in 2-level studies (MD 44.30 min; 95 % CI 4306–45.55; p < 0.00001), however, not 3-level studies (p = 0.09). Subgroup difference was significant (p < 0.00001).

Fig. 2
figure 2

Forest plots comparing a operative time (mins); b intraoperative blood loss (mL); c length of stay (days) for hybrid surgery (HS) versus anterior cervical discectomy and fusion (ACDF) in terms of. SD standard deviation, IV inverse variance, CI confidence interval

Intraoperative blood loss

Data were extracted from four studies, one 2-level and three 3-level studies (Fig. 2b). Overall intraoperative blood loss was significantly lower after HS than ACDF (MD 25.53 mL; 95 % CI −36.13 to −14.93; p < 0.00001) with pooled means of 167 vs 202 millilitres, respectively. The difference was significant in the 2-level study (MD −46.00 mL; 95 % CI −62.57 to −29.43; p < 0.00001), however, not 3-level studies (p = 0.11). Subgroup difference was significant (p = 0.002).

Length of stay

Data were extracted from two studies, one 2-level and one 3-level studies (Fig. 2c). Overall length of stay was not significantly different after HS or ACDF (p = 0.24). The difference was not significant in the 2-level study (p = 0.14) or 3-level study (p = 0.59). Subgroup difference was not significant (p = 0.30).

Complications

Data were extracted from eight studies, where three 2-level and four 3-level studies observed complications (Table 2; Fig. 3). Overall complication rates were not significantly associated with either HS or ACDF (p = 0.18). Complications were more associated with HS than ACDF in 2-level (OR 2.08; 95 % CI 1.13–6.96; p = 0.03), however, this was not observed in 3-level studies (p = 0.63). Subgroup difference was not significant (p = 0.06).

Fig. 3
figure 3

Forest plot comparing complications after hybrid surgery (HS) versus anterior cervical discectomy and fusion (ACDF). M–H Mantel–Haenszel, CI confidence interval

Return to Work Time

Data were extracted from two 2-level studies (Fig. 4). Overall return to work time was significantly earlier after HS than ACDF (MD −32.01 d; 95 % CI −33.13 to −30.90; p < 0.00001) with pooled means of 27 vs 74 days, respectively.

Fig. 4
figure 4

Forest plot comparing return to work (days) after hybrid surgery (HS) vs anterior cervical discectomy and fusion (ACDF). SD standard deviation, IV inverse variance, CI confidence interval

C2–C7 range of motion (ROM)

Pre- and postoperative C2–C7 ROM measurements were extracted from seven studies, three 2-level and four 3-level studies. Overall postoperative C2–C7 measurements were significantly greater after HS than ACDF (MD, 9.22; 95 % CI 6.97–11.46; p < 0.00001) with pooled means of 41.9 vs 32.5, respectively (Fig. 5a). Further comparisons are reported in Table 4.

Fig. 5
figure 5

Forest plot comparing postoperative a C2–C7 range of motion (ROM); b visual analogue scale (VAS); c neck disability index (NDI) after hybrid surgery (HS) versus anterior cervical discectomy and fusion (ACDF) in terms of. SD standard deviation, IV inverse variance, CI confidence interval

Table 4 Complete analysis of functional outcomes C2–C7 range of motion (ROM), visual analogue scale (VAS) and neck disability index (NDI) with subgroup analysis of 2- and 3-level studies

Visual analogue scale (VAS)

Pre- and postoperative VAS measurements were extracted from six studies, three 2-level and three 3-level studies. Overall postoperative VAS measurements were not significantly different after HS than ACDF (p = 0.12, Fig. 5b). Further comparisons are reported in Table 4.

Neck disability index (NDI)

Pre- and postoperative NDI measurements were extracted from four studies, two 2-level and two 3-level studies. Overall postoperative NDI measurements were significantly lower after HS than ACDF (MD −1.33; 95 % CI −2.32 to −0.34; p = 0.008) with pooled means of 8.67 vs 10.22, respectively (Fig. 5c). Further comparisons are reported in Table 4.

Superior adjacent segment range of motion (ROM)

Pre- and postoperative superior adjacent segment ROM measurements were extracted from three studies, one 2-level and two 3-level studies. Overall postoperative superior adjacent ROM measurements were significantly lower after HS than ACDF (MD −3.75; 95 % CI −5.97 to −1.52; p = 0.0009) with pooled means of 14.77 vs 19.17, respectively (Supplementary Fig. 1a). Further comparisons are reported in Supplementary Table 1.

Inferior adjacent segment range of motion (ROM)

Pre- and postoperative inferior adjacent segment ROM measurements were extracted from three studies, one 2-level and two 3-level studies. Overall postoperative inferior adjacent ROM measurements were significantly lower after HS than ACDF (MD −3.05; 95 % CI −4.75 to −1.34; p = 0.0005) with pooled means of 13.31 vs 16.47, respectively (Supplementary Fig. 1b). Further comparisons are reported in Supplementary Table 1.

Japanese orthopaedic association (JOA) scale

Pre- and postoperative JOA measurements were extracted from four studies, one 2-level and three 3-level studies. Overall postoperative JOA measurements were significantly greater after HS than ACDF (MD 0.56; 95 % CI 0.26–0.86; p = 0.0002) with pooled means of 14.20 vs 13.80, respectively (Supplementary Fig. 1c). Further comparisons are reported in Supplementary Table 1.

Discussion

The surgical rationale behind HS in treating mCDD is to achieve the benefits of ACDF as well as those of the other approaches. Overall, HS showed greater operative time, less intraoperative blood loss and earlier return to work compared to ACDF. Length of stay and postoperative complications between HS and ACDF were not significantly different. After HS, postoperative C2–C7 ROM and functional NDI outcomes were improved, while pain VAS outcomes were comparable to ACDF. Lower postoperative superior and inferior segment ROM measurements were observed after HS.

Given the novelty of HS, it is not necessarily surprising a longer operative time than ACDF has been demonstrated. During the advent of cervical ADR, a large Food and Drugs Administration (FDA) trial [17] showed operative time compared to ACDF was significantly longer. This was attributed to the time required to learn the new technique. The current case for HS is remarkably similar given its early stage of use. Given that the operative time would include any significant amounts of teaching and clarification, as well as operating, it may explain the observation that HS involves significantly less intraoperative blood loss despite a longer operative time.

From a postoperative standpoint, it has been posited that both complication rates and length of stay would be comparable between HS and ACDF due to the use of similar surgical approaches and nerve decompression procedures [18, 19]. What is interesting is that HS patients seem to return to work earlier than ACDF patients. This may be due to greater fusion rates and reduced postoperative immobilisation with cervical instrumentation insertion during ACDF which can only be observed after discharge [5, 20].

In terms of HS and its components, it would seem that the relationship between number of levels treated with HS and clinical outcomes is not clearly linear. As a reference, Bohlman et al. reported fusion rates after ACDF at one-, two-, three- and four-level operations were 89, 73, 73 and 0 %, respectively [4]. Other studies comparing treatment for single-level cervical disc disease and mCDD showed no major clinical differences in outcomes between them in with either ACDF [21] or ADR [11] individually.

There is currently a lack within the literature of comparative outcomes between 2- and 3-level HS operations and they are mostly only inferable from analyses of individual HS components. A comparison of 2-level ACCF and 3-level ACDF showed no significant difference in outcomes [22]. Studies of ACDF only have established that a direct correlation exists between the number of construct fusion levels and the occurrence of pseudarthrosis [4, 23, 24]. Should such a tendency for complicated recovery extend beyond pseudarthrosis, it could then be associated with the subgroup differences seen in the follow-up parameters of this review. Technically ADR inclusion within HS should circumvent the concern for non-union and theoretical decrease overall pseudarthrosis rates when implemented in multi-levels [10]. Future comparative studies are needed to investigate the differences between 2- and 3-level HS outcomes more fully.

It is worth noting the difference between clinical outcomes of NDI and VAS between 2- and 3-level studies, with the former subgroup not detecting significance for either outcome while the latter subgroup detecting significance in both outcomes. It is difficult to determine the exact reason for this observation as there are many potential confounding factors. These include the nature of 3-level surgery which involves a vertebral body at both ends and any bias in operation based on the same geographical location.

Limitations

This study was able to extensively search the current literature for all examples of HS–this includes ACDF, ADR, and ACCF. However, a consequence of this inherent variation of HS components may weaken the implications derived which may be dependent on each component to different degrees. This would be overcome with larger studies. Furthermore, this study remains severely limited by the small size and number of studies that compare the clinical outcomes of HS to the more traditional ACDF. While this currently renders analysis difficult to ascertain significance in more outcomes, it does illustrate the need to continue to evaluate novel approaches such as HS to fully evaluate their applicability.

Another constraint on this study was the low quality of evidence afforded by the currently available literature. Although the challenges of designing higher quality comparative studies involving novel neurosurgical techniques must be recognised, the accuracy and precision of these studies are compromised by many factors. First, the entire cohort of 362 cases was studied over an accumulative 35-year study period, indicating a very low accrual rate per study which is reflective of the small cohort sizes as well. Second, the retrospective or ambispective nature of most studies poorly controls for selection and observation biases. And third, most studies were observational in nature which limits the validity of reported outcomes. The credibility of the evidence in the future would be strengthened by larger, perhaps multi-institutional, more prospective and randomised controlled trials.

In addition, the validity of the comparative postoperative courses following ACDF vs HS needs to be clarified with long-term studies. This is due to the long-term nature of complications such as adjacent segment disease which only manifest many years after surgery [5]. There is a paucity of comparative data in the literature for long-term follow-up studies investigating ACDF vs HS, as well as studies reporting adjacent segment disease. Only Kan et al. [25] and Kang et al. [26] reported adjacent segment disease with the former being the only paper to report nonzero outcomes for both ACDF and HS. Given it is also the reduction in these long-term complications associated with ACDF which provide impetus to explore alternative treatment strategies for mCDD, the true effectiveness of such strategies including HS can only be warranted once their effects on these long-term complications are elucidated as well.

Finally, the novel and variable nature of HS currently limits the significance of these early findings. It is the unstandardized mixture of different approaches in the HS between studies that challenges the homogeneity of the HS operation for comparative purposes. Although, the exact indications and surgical detail for each HS performed in each study is beyond the scope of this study, every effort was made to clarify the components involved. The extent to which including various HS components from study to study affects the validity of the outcomes is difficult to ascertain at this moment, due to the aforementioned limitations. Nonetheless, these findings indicate that the concept of HS is a comparable concept in mCDD treatment and future investigations into the impact of its makeup are warranted. This particular consideration serves as an important reminder that the term ‘hybrid surgery’ remains more of a theoretical concept rather than that a single, exact practice.

Conclusion

Based on the currently available literature, HS appears to be a safe and feasible alternative surgical option in the treatment of mCDD in select patients. Future large, prospective, long-term randomised controlled trials are needed to fully elucidate the effectiveness of this surgery in treating mCDD compared to the current gold standard of ACDF.