Introduction

Anterior approach to the cervical spine allows for decompression of the spinal cord and preserves neurological function when the primary compressive pathology lies anterior to the spinal cord [1]. To date, anterior cervical discectomy and fusion (ACDF) is the most accepted procedure with satisfactory clinical outcome and proven radiological fusion rates from 90 to 100 % [2, 3].

Lately, artificial disc replacement (ADR) of the cervical spine has gained popularity. Although the results of ADR are less established than that of ACDF, it has proven to be beneficial in terms of preserving motion of the cervical spine and possibly adjacent level degeneration [8, 9]. These benefits are more apparent in the context of multilevel surgeries [14, 15]. However, indications for ADR are more stringent and the procedure may be longer, leading to the possibility of increased blood loss and wound complications [10, 13, 22]. Given this trade-off, controversy still exists as to the ideal surgical technique(s) that could benefit patients with multilevel cervical disc diseases.

Hybrid surgery (HS) incorporates ACDF and ADR at different levels. The purpose is to combine the advantages in both techniques, thus showing preservation of cervical spine motion without prolonging operative duration and increasing complications. To our knowledge, the existing literature consists of a few studies looking independently at ACDF versus HS, or ADR versus HS. These studies do not allow direct comparison of the results between ACDF, HS and ADR. Therefore, the primary aim of this study was to explore the peri-operative parameters and outcome differences in all three types of surgery. The results of this study will provide grounds for a large, prospective randomised controlled trial.

Materials and methods

Seven consecutive patients who underwent HS by a single surgeon were prospectively enrolled into the study. Using the same hospital electronic database, we did a retrospective random selection of another 7 ACDF and 7 ADR patients matched to the same levels of operation as controls. Selection of appropriate controls was carried out by an independent party from a pool of computer-generated database without having prior knowledge of their outcomes. After this step, a check was performed to ensure that all patients had surgeries performed for prolapsed intervertebral discs, had no obvious degeneration at non-operated levels, and all matched patients had surgeries done within 1 year of each other. One control in the ACDF group and two in the ADR group had to be replaced with newly selected random subjects to meet the above criteria. In our study, spinal levels in which X-rays or MRI scans suggest facet joint arthropathy or hypermobility are treated with fusion. Patients with multi-level disease will receive HS if some levels require replacement and others fusion.

Following ethics board approval, prospective data including patient demographics (age, gender), presence of pre-operative myelopathy, intra-operative details (duration of operation, level of operation, types and sizes of implant, somatosensory evoked potentials (SSEP) and motor-evoked potentials (MEP), complications), post-operative details (drop in haemoglobin, length of hospitalization, and time to return to work), post-operative radiological findings (fusion, cervical spine mobility), functional scores (Visual analogue scale (VAS), Neck disability index (NDI), EQ-5D health score, and EQ-5D index) and surgery related-complications were collected.

In this study, the functional scores were obtained using self-administered questionnaires in the English language during routine follow-up visit at 2 years after the surgery. The evaluation of interbody fusion was according to Bridwell classification [4]. Cervical spine range of movement during flexion and extension was interpreted using lateral projections of the cervical spine obtained at the 2-year follow-up visit by two independent spine surgeons not directly involved in the management of the patient. Measurements were made directly on our radiographic software [Centricity Web V2.0, GE Medical Systems Information Technologies, Milwaukee, USA] using two methods. The first method employs lines placed on the superior endplate of C3 and the inferior endplate of C7 to measure global cervical spine flexion–extension, and lines placed on the superior endplate of the upper vertebra and the inferior endplate of the lower vertebra to measure flexion and extension of the ADR functional spinal unit. The second method uses lines projected from the posterior border of the dens and posterior vertebral line from C7 to measure global cervical spine flexion–extension (see Fig. 1 for method of measurement).

Fig. 1
figure 1

a Angle subtended from superior endplate of C3 and inferior endplate of C7 used to measure global cervical spine movement (method 1). b Angle subtended from posterior border of C2 dens and posterior vertebral line of C7 used to measure global cervical spine movement (method 2)

Statistical analysis was performed using StataCorp 2011. Stata Statistical Software: Release 12. College Station, TX: StataCorp LP. Categorical variables were analysed using Fisher’s exact test. Continuous variables were summarized using median and standard deviations and analysed using Mann–Whitney U test. Type I error was set at 5 %.

Technique of ACDF and ADR

All three procedures were performed using a left horizontal incision 4–9 cm long, at the projected level of surgery. Blunt dissection was performed down to the anterior border of the spine taking care not to injure the trachea and oesophagus. A spinal needle was placed in the intervertebral disc to identify the correct level using image intensifier, followed by a complete discectomy and preparation of the endplates. For ACDF, an intervertebral cage packed with bone auto graft mixed with demineralised bone matrix was placed between the vertebras to act as a spacer. For ADR, an artificial disc was inserted under fluoroscopic guidance (see Fig. 2a, b). One drain was inserted and the platysma muscle was repaired. Closure was performed using continuous vicryl suturing for the subcutaneous and dermal layer and the skin using monocryl 3–0 sutures. Post-operatively, all patients with ACDF had 1 month of aspen collar. ADR and HS had 1 month of soft collar.

Fig. 2
figure 2

Anteroposterior and lateral X-rays of the cervical spine post hybrid surgery (C5/6 ADR and C6/7 ACDF)

Results

The baseline patient characteristics were comparable in all three groups (Table 1). The median age was 51 years for HS, 48 years for ACDF and 46 years for ADR. There is no significant difference in gender ratio between the three groups of patient. Comparable number of patients displayed myelopathy features pre-operatively (4 in HS group, 3 in ACDF group, and 2 in ADR group). The levels of index surgery were matched in the three groups. In each arm of comparison, four patients underwent 2-level surgery, and three patients underwent 3-level surgery. The preoperative functional scores were similar in all groups. Minimum follow-up was for 24 months (range 24–45 months).

Table 1 Baseline patient characteristics

Disc arthroplasty implants used in the study were cobalt-chrome, UHMWPE, semi constrained disc, i.e. Prodisc-C (Synthes Spine, Paoli, PA, USA). Fusion constructs were performed using Cervios cages (Synthes spine, West Chester, PA). Iliac crest autograft was used for fusion.

The median duration of operation was shortest for ACDF (135 min) followed by HS (195 min) and ADR (197 min). This observation is statistically significant with p = 0.025 when ACDF is being compared with both HS and ADR (see Table 2). Intra-operative SSEP and MEP monitoring did not show any deterioration for all patients in the three groups. Although improvements were noticed for SSEP (3 in HS, 1 in ACDF, 2 in ADR group) and MEP (1 in ADR group), this was not statistically significant.

Table 2 Results

Post-operative haemoglobin drop, used as a surrogate marker of peri-operative blood loss showed significant difference between ACDF and ADR group (p = 0.036). The ACDF group experienced greater blood loss with a drop of post-operative haemoglobin of 1.2 g/L when compared with ADR group of 0 g/L. Drop in haemoglobin for the HS group (0.7 g/L) lies between both ACDF and ADR groups. No significant difference in blood loss was observed when the hybrid group was compared with both ACDF and ADR groups (p > 0.05).

The length of hospitalization showed statistically significant difference across the three groups. The ADR group had shortest stay of 3 days, followed by HS group of 5 days. The ACDF group required longest hospital stay of 7 days. This is likely to be directly related to donor site morbidity secondary to iliac crest bone graft harvesting.

Patients from HS group returned to work fastest (54 days) when compared with both ACDF (107 days) and ADR (73 days). Although statistical significance was not observed between HS and ADR, it was seen between HS and ACDF (p = 0.035).

At 2-year follow-up, fusion was successfully achieved in all ACDF and HS patients based on Bridwell classification [4] using plain X-rays. No patients undergoing HS or ADR have heterotopic ossification at the disc replacement levels. The three groups shared similar functional improvement at 2 years. Both global and ADR functional spinal units have similar range of movements. No patients needed revision surgery.

In our small series, only four patients demonstrated complications. Three patients (one in each group) had residual limb symptoms after the operation. This was transient in all patients and they improved 6 weeks post-operatively after several courses of outpatient physiotherapy and analgesics. No significant difference in the complication rates was noted across the three groups (see Table 3).

Table 3 Post-operative complications

One patient had dysphagia after the operation which resolved during the first follow-up visit at 2 weeks after surgery. No other complications of dura/spinal cord injury, hoarseness of voice, wound infection or implant-related failures were observed at 2-year follow-up.

Discussion

The functional spinal unit (FSU) is mobile and mechanically stable in a normal person. Although ACDF can provide neurological decompression and stability to the FSU, it has the disadvantage of rendering it immobile, resulting in adjacent segment hypermobility and accelerated degeneration [57]. However, ACDF remains the gold standard in surgical management of cervical spondylotic myelopathy and has a high level of patient satisfaction [1]. Yet, it is known that the clinical outcomes and fusion rates of instrumented cervical fusion deteriorate as the number of vertebral levels of involvement increases. With the same capability at neurological decompression, ADR is able to preserve both motion and stability at the operated level [8, 9]. This also leads to reduction in adjacent level degeneration. The benefits of ADR in multilevel surgery are thus more obvious with preservation of spinal mobility, avoidance of pseudarthosis and reduction in adjacent segment disease (ASD).

Earlier result has found arthroplasty to be a safe alternative to ACDF for single-level disease with lower reoperation rates [2, 1012]. With the newly released result of 4-year follow-up on patients with cervical arthroplasty [13], ADR is increasingly being recognized as a realistic surgical option in managing single-level disc disease between C3 and C7. Sasso et al. [13] concluded with level I evidence that the results of arthroplasty are superior to ACDF at 4 years’ follow-up. However, long-term study is needed to further determine its safety in other aspects, e.g. biology of wear debris, effects on the facet joints, ASD, device-fatigue and durability.

The evidence in multilevel ADR though promising is not as well established as its role in single level disease. There are positive level II/III evidence on the early results of multilevel arthroplasty [8, 14, 15]. In terms of reoperation rates and serious adverse events, similar results between the single-level and multilevel arthroplasty groups were found [11]. Moreover, several studies have shown the duration of operation for ADR to be much longer than ACDF [10, 13, 22]. At the same time, the study by Murrey et al. [10] showed no significant difference in the length of hospital stay between Prodisc-C and ACDF.

When selecting patients for HS, several factors may dissuade surgeons to perform ADR over fusion. These include degenerative diseases of the functional spinal unit that may render motion at that level unpredictable—facet joint arthropathy and spinal hypermobility. Since the spondylotic spine is often associated with multiple-level disease, and not all diseased levels may meet acceptable criteria for ADR [16], some surgeons turned to HS as an alternative to multi-level ACDF or ADR [17]. The need to show results of HS in carefully selected patients is important as it gives the surgeon the armamentarium in the treatment of degenerative cervical spinal conditions.

By simply combining ACDF and ADR in HS, the main objective lies at restoring or maintaining motion where appropriate, or favouring bony fusion when indicated by the extent of the degenerative changes and hypermobility [18, 19]. Theoretically, a HS should reach a balance between both ACDF and ADR in terms of both intra-operative and post-operative results. The existing literature either compares ACDF with HS or ADR with HS, thus causing difficulty when drawing conclusions as to where HS stands between ACDF and ADR. To our best knowledge, this is the first study that directly compared the three different techniques for multilevel cervical disc disease. By standardizing multiple factors including surgeon, technique and the protocol of treatment for our patients, we therefore provide a more direct and valid comparison of all three groups in a single-surgeon series performed by a surgeon skilled in both ADR and ACDF. The success of this pilot study will push off larger studies in the future that directly compare these three surgical procedures.

Our study has demonstrated that hybrid construct is a feasible alternative to ADR and ACDF in managing multilevel disc disease in carefully selected patients. In many aspects, the results of HS were between ACDF and ADR. In terms of duration of surgery, HS was similar to multilevel ADR. In terms of blood loss and duration of hospitalization, the results of HS were a fine balance between ACDF and ADR.

With regard to 2-year follow-up, our study showed no significant differences in functional outcome and cervical spine range of movement. Shin et al. [17] found HS to be superior to double-level ACDF by having a better functional outcome and less adjacent range of motion (ROM). Favourable results have also been shown in biomechanical analyses. Cho et al. [20] found that HS produced combined ROM at the operative levels comparable to the intact spine. Similar conclusions were also drawn by Lee et al.’s [21] cadaveric biomechanical study.

However, we noted earlier return to work in the HS group, as compared with both ADR and ACDF groups. This finding was not shown in earlier reported studies. The superiority of HS over ACDF and ADR could not be clearly explained, especially since HS would also involve iliac crest bone grafting similar in ACDF. There is a possibility that patients who received HS truly benefited from the advantages of both techniques. The inability of functional scores to demonstrate similar findings could be secondary to a small sample size. The current literature only compares ACDF with ADR. Mummaneni et al. [2] showed that patients with ADR returned to work in significantly fewer days (range 14–16 days) than patients treated with ACDF. Riew et al. [23] showed that at 6 weeks, significantly more patients (49.2 %) returned to work in the arthroplasty group than fusion group (39.4 %). No statistically significant differences in the percentage of patients who returned to work at 24 and 48 months were found in other studies [10, 13, 24].

With regard to complications, McAfee et al. [25] demonstrated a lower incidence of dysphagia with ADR in single level disease in their study of 251 patients. Our study showed only one patient in the HS group who developed dysphagia post-surgery. Although the literature showed heterotrophic ossification ranging from 50 to 70 % with Prodisc-C up to a minimum of 1-year follow-up, no patients in our study who underwent ADR or HS developed heterotopic ossification [2628].

Finally, we recognize that this pilot study is limited by a relatively small study population and may not be powered to detect smaller differences. As the 95 % confidence interval estimate of median is not accurate, it should be interpreted carefully and be used to generate hypothesis only. Nevertheless, this study has some strengths being a single-surgeon series that uses a standardized treatment protocol for all patients. This allows valid comparison among all three groups. Given our findings that HS appears comparable to ACDF and ADR in terms of safety and feasibility, and superior in terms of earlier return to work, HS seems an attractive option in carefully selected patients. This should be further explored in large, randomised controlled trials.