Introduction

Anterior cervical discectomy and interbody fusion (ACDF) remains the gold standard for treatment for degenerative spine conditions in western countries. Aside from decompression of the spinal cord and nerve roots, the aim of surgery is to achieve fusion of index segments and correction of sagittal cervical alignment. The assessments of fusion and alignment, however, are not standardized among professionals and there is a variety of measurement methods to evaluate these aspects of radiographic outcome [1, 23]. In general, plain radiographs in anterior-posterior as well as lateral projections with additional dynamic imaging are utilized for most measurement methods. Subsidence, fusion, angle of fused segments, global cervical lordosis and adjacent disc mid-height are the most commonly reported variables in studies concerning radiographic outcome following ACDF among others.

Aim of this study was to compare various proposed measurement methods for fusion, alignment and subsidence and correlate these with clinical outcome.

Material and methods

Patient population and procedure

We retrospectively analyzed a consecutive cohort of patients who underwent one- or two-level ACDF for degenerative cervical spine disease between 2007 and 2013. The patient cohort was selected to include those with predominant neck pain attributable to osteochondrotic degeneration of the intervertebral discs, facet joint degeneration and degeneration of the osteoligamentous apparatus, or a combination of these. Patients undergoing ACDF for clear radiculopathy without neck pain or long-standing neck pain exceeding a duration of 12 months were not included in the analysis.

Evidence of tumor, trauma and infection and patients with previous surgery on the cervical spine were not eligible. For the ACDF, surgical approach and preparation were done in the standard fashion as has been extensively described in the literature [7, 30, 39]. After discectomy, we implanted a PEEK cage (NUBIC® Cage, Signus Medizintechnik GmbH, Alzenau, Germany) packed with demineralized bone matrix in 124 patients (59.6%) and a stand-alone cage in 84 patients (40.4%; Table 1).

Table 1 Baseline characteristics of retrospective cohort. No., number; DBM, demineralized bone matrix

We have already published our results focusing on clinical outcome following ACDF with empty PEEK cages in comparison with PEEK cages filled with demineralized bone matrix [36, 37].

In all patients, visual analogue scale (VAS) scores for neck pain were polled prior to surgery and on final follow-up.

Radiographic outcome

Regional sagittal alignment

Patients received a plain radiograph imaging on the first postoperative day and on final follow-up with additional flexion and extension radiographs. Sagittal alignment measurements were assessed by three techniques. First, we calculated Cobb’s angle between the endplates of C2 and C7, as depicted in Fig. 1 [8, 12, 35].

Fig. 1
figure 1

Assessment of Cobb angle in the fused segment during flexion (a) and extension (b)

Secondly, we used the Gore method, also called Jackson physiological stress line method (JPSLM), which aligns tangents to the posterior walls of C2 and C7 each for their respective intersecting angle (Fig. 2) [1, 19, 22, 25, 34].

Fig. 2
figure 2

Assessment of sagittal alignment via the Gore method. Angle of tangent lines to C2 and C7 designated by θ

The Laing method provides simple assessment of alignment of the cervical spine by connecting a line in parallel to the posterior walls of C2 and C7 (Fig. 3) [28]. Kyphotic and straight alignments of the cervical spine were interpreted as unfavorable outcome. We subtracted the angle of the final follow-up compared with the first postoperative radiographs to distinguish two groups. The group with a “negative” alignment (designated by a minus sign) was allotted to the “deterioration” group. Patients that gained in lordosis or did not experience change in alignment were allotted to the “stable” group.

Fig. 3
figure 3

Laing method for the assessment of sagittal alignment on plain radiographs in lateral projection

Fusion

To determine the fusion rate and identify pseudarthrosis after ACDF, we used the three most commonly reported measurements obtained from flexion-extension radiographs (Table 2): the amount of angular motion in the fused segment derived from the difference of local Cobb angles (Fig. 1) and the distance between the tips of the spinous processes as well as the distance between their bases in the operated segment(s). A difference less than 2 mm between these distances in flexion and extension radiographs indicates radiographic fusion (Fig. 4). The presence of bridging bone was detected in lateral radiographs, also indicating bony fusion (Fig. 5) [38].

Table 2 Criteria for bony fusion via separate radiographic assessments
Fig. 4
figure 4

Assessment of distances between spinous processes in the fused segment during flexion (a) and extension (b)

Fig. 5
figure 5

Assessment of presence of bridging bone in the operated segment. Absent intervertebral fusion (a) and full bony fusion (b)

The difference of distances between the spinous processes on dynamic radiographs was indicative of segmental fusion when it amounted to less than 2 mm, as recommended by the joint guidelines committee of the AANS/CNS [5, 23]. For the Cobb measurements, several studies define a rather strict cut-off with angles less than 2° advocating fusion and thus radiographic success, whereas others accept a threshold of 4° [9, 21, 23, 42]. We assessed both definitions independently for correlation with clinical outcome.

Subsidence

Ventral and dorsal segmental height reduction was measured and compared between first postoperative imaging and final follow-up imaging; height reduction of more than 3 mm ventrally or dorsally was interpreted as subsidence [15, 27]. In addition, the Mochida method was used with two different thresholds of height loss of 10 and 20%, respectively (Fig. 6) [27, 31].

Fig. 6
figure 6

Assessment of subsidence after fusion via direct measurement of ventral and dorsal segment (a) and the Mochida method (b). Δ, difference between pre- and postoperative; Pre, preoperative

For the aforementioned Mochida method, the ratio between the vertebral body height and the adjacent disc height is calculated through the following sequence upon studying the plain radiograph: First, diagonals between the corners of two neighbouring vertebral bodies are constructed. A straight line through the hereby produced intersections allows for measurement of vertebral and disc heights. A quota of vertebral body height divided by disc height can now be used for comparison on follow-up (Fig. 6).

Statistical analysis

Statistics were calculated with IBM SPSS Statistics version 23. Values of α below 0.05 were indicative of a statistically significant difference between mean values with a 95% confidence interval.

Results

Study population

Of 218 patients included initially, 10 patients were excluded post hoc because of incomplete postoperative imaging. All other patients had received pre- and postoperative radiographs in anterior-posterior and lateral projections.

A hundred and one women (48.6%) and 107 men (51.4%) were operated on. Of these, a single segment procedure was done in 123 (59.1%) and two segments in 85 (40.9%). The median follow-up was 30 months (range 11–61; Table 1). Data on VAS scores preoperatively and on the final follow-up was complete in 176 cases (84.6%). The preoperative mean value for neck pain was 6.45 (± 2.87). On final follow-up, the mean VAS score amounted to 2.30 (± 2.57), improving by a mean 4.15 across the entire cohort.

Sagittal alignment

Assessments of the Gore and Cobb angles were possible in all but one patient (0.5%) where effacement of C7 margins due to overlying soft tissue precluded radiographic evaluation. The absolute difference between measurements by Gore and Cobb amounted to 1.03° on the first postoperative day and 0.72° on final follow-up. The Pearson coefficient showed statistically significant and strong correlation between the Gore and Cobb measurements on the first postoperative day (r = 0.856, p < 0.001) as well as on last follow-up (r = 0.828, p < 0.001).

In Table 3, information about sagittal alignment of the cervical spine on final follow-up is summarized. In general, measurements of alignment corresponded well between Gore and Cobb methods, but deviated for the Laing method.

Table 3 Rates of sagittal alignments of retrospective cohort as measured by three separate methods

To determine the correlation between clinical outcome and radiographic alignment on follow-up, we used linear regression with neck pain as a dependent variable (Table 4). None of the measurements showed significant correlation of neck pain with alignment as judged by the Gore and Cobb angles. Distortion of sagittal alignment was seen equally in both groups of improved and deteriorated neck pain (Table 5). For the Laing method, we found significant inverse correlation between postoperative changes of alignment and neck pain (chi-square; p = 0.049). Patients who developed radiographic deterioration of alignment postoperatively improved on the VAS more frequently than patients with stable or improved cervical alignment.

Table 4 Correlation of sagittal alignment with clinical outcome
Table 5 Clinical outcome in relation to change of sagittal alignment as assessed by the Laing method

Fusion

Two hundred and two cases were evaluated for fusion; in the remaining six cases, suboptimal quality of dynamic imaging rendered any measurements inconclusive.

The Cobb angles as well as distances between the tips and bases of the spinous processes of the fused segments were distributed normally according to Shapiro-Wilk testing (W = 0.797 p = 0.000; W = 0.796, p < 0.001; W = 0.698, p < 0.000).

The highest fusion rate of 85.6% was obtained by evaluation of bridging bone in lateral plain radiographs (Fig. 7). Only a third of patients had bony fusion with present bridging bone in addition to a Cobb angle of 2° or less. Evaluating fusion by measurement of interspinous distance close to the vertebral body yielded higher fusion rates than at the tips of the interspinous processes (Fig. 7).

Fig. 7
figure 7

Fusion rates in percentages according to various assessment methods

Linear regression was employed to evaluate correlation between the method of fusion assessment and its predictive value regarding intensity of neck pain. Patients that were assigned to the fused group by definition of a Cobb angle of 4° or less showed a significant positive correlation (r = 0.04) with clinical improvement, defined as an increase of one or more points on the VAS (p = 0.035; Tables 6 and 7).

Table 6 Correlation of fusion rates with clinical outcome
Table 7 Correlation coefficients of fusion assessment measures with clinical outcome. Italic facetype indicates statistical significance

None of the other measurements showed any significant correlation between neck pain and fusion rates. The proportion of patients who experienced clinical satisfaction after surgery was 80%, irrespective of fusion rate as evaluated by any of the three methods (Table 7).

Subsidence

Subsidence was evaluated in 201 cases. Its occurrence rate varied between 8 and 51% depending on the applied method (Table 8).

Table 8 Rates of subsidence as measured via different procedures. Cut-off thresholds for Mochida method predefined as more than 10% and 20% height reduction

In cases of subsidence, loss of height in the dorsal part of the vertebra amounted to less than in the ventral part, resulting in bias towards a higher subsidence rate for the ventral method. The Mochida method had the lowest subsidence rate when a cut-off of more than 20% height loss was predefined (Table 8). Patients with subsidence in the follow-up did not report higher scores on the VAS for neck pain and no significant correlation was found (Tables 9 and 10).

Table 9 Correlation of subsidence rates with clinical outcome
Table 10 Correlation coefficient of subsidence rates with clinical outcome

For the correlation between method of assessment and clinical outcome, no statistically significant relations for either ventral (p = 0.788) or dorsal (p = 0.445) measurements were found. Chi-square testing revealed no significant differences when employing the Mochida method (10%: p = 0.294; 20%: p = 0.270).

Discussion

Regional sagittal alignment

For comparison of postoperative regional sagittal alignment to preoperative imaging, one may find several different methods described in the literature, although there exists no “gold standard” for alignment measurement. It remains unclear whether different techniques produce corresponding and valid values or a meaningful impact on clinical outcome. Therefore, we investigated the most commonly used measurements for radiographic interpretation of sagittal alignment and fusion as well as the comparability of the results.

In general, the Cobb angle method is widely used in published works due to its simple application and high interrater reliability. Alternatively, the Gore method is an often reported measurement with comparable characteristics [1, 25, 34]. Laing et al. implemented a less popular but considerably more straightforward technique for the evaluation of alignment, prompting us to include it in this study [28]. Several other techniques, such as the Harrison method, have been described, but these seem to be rarely applied for their more intricate nature [1, 34].

Our own mean measurements of sagittal alignment as per Cobb and Gore methods were well within the range of pertinent published works [1, 34]. Both methods produced similar results, whereas our rates of kyphotic and straight malalignments were fairly high with the Laing method. This is concordant with other studies with a similar modus operandi [28, 44].

Paradoxically, patients who improved on the VAS scale had a higher proportion of deterioration of sagittal alignment according to the Laing method. Surely, this represents an unprecedented novelty and leaves us in doubt about that correlation, particularly when considering the weak correlation coefficients. In none of the patients with improved or deteriorated alignment according to Cobb angle or Gore, there was any correlation with change in VAS score.

The majority of comparable studies reported similar results, with weak correlation of sagittal alignment in imaging and development of postoperative VAS [10, 24, 32, 43]. In a group of 57 patients, Wu et al. described a moderate correlation between Japanese Orthopaedic Association score for cervical myelopathy and Cobb angle after 5 years of follow-up [44]. None of the three methods revealed any advantages and benefit in regard to interpretation of alignment and its relation to the clinical outcome. The Gore and Cobb angle methods are comparable tools with congruent results, whereas the Laing method differed. Thus, premature comparison of studies utilizing these measurements is not recommended, as it has to be questioned whether cervical alignment on its own may serve as a reliable indicator for clinical success after ACDF surgery. The evaluation of clinical and radiographic findings is more so challenged by the notion that a large proportion of asymptomatic patients develop physiological alterations of their sagittal profile during their lifetime, with pathomorphogenic changes commonly affecting several of the weight-bearing structures without consequences for the patients [16, 17]. Vice versa, radiographic phenomena may not always be pertinent to the patient’s neck pain, putting into question as to how clinical improvement in such a patient cohort best be gauged. By contrast, a clinically manifest radiculopathy regularly originates from some form of pathomorphogenic degeneration that exerts a clear neural compression, allowing for clearer clinical reasoning [18].

The majority of published literature failed to demonstrate any significant correlation between neck pain and deteriorating alignment, which may however be owed to the heterogeneity of employed methods of measurement and general scarcity of studies [11, 14, 18, 26].

Fusion

It is common practice to evaluate for radiographic fusion after ACDF surgery. Several techniques for fusion assessment have been devised, but are not employed uniformly in available literature [23]. We therefore focused our efforts on the validity of the different fusion assessment methods and their respective relation to the clinical outcome.

The first method of measuring distances between the spinous processes of the operated segment represents an often reported instrument though without clear definition as to the exact reference point [31]. By choice, we hence applied measurements at two different and extreme reference points and found that with increasing distance between these reference points and the vertebral body, values of measurements will increase correspondingly. As such, we had to consider the necessity of defining a single reference point for the spinous processes. Aside from this, the Cobb angle and presence of bridging bone were used as fusion criteria.

The applicability of the Cobb angle was limited by poor quality in the presence of extensive degenerative changes. Whilst all cases could be assessed with both other methods, assessment of the Cobb angle was not clearly evaluable in 6 cases out of 208. Sudhakar et al. reported similar problems in their analysis and discuss this as a possible factor for inferior interrater comparability [5, 40].

As expected, there was high correlation of interspinous distances in contrast to the Cobb angle method evaluated by absolute measurements which were not provided by the method of bridging bone. The estimated fusion rates varied to a considerable extent between the three measurements. A Cobb angle of less than 2° showed the lowest, whereas screening for bridging bone showed the highest fusion rates. Unsurprisingly, we acquired more fused cases with measurement of the distance between the bases of the spinous processes than with the distance that was measured further away from the vertebral body.

Even when considering our divergent definition of Cobb angle reference values, our results were comparable with other studies to a certain degree. Hipp et al. reported a rise in-fusion rates when admitting a cut-off for the Cobb angle of 1° rather than 4° [20]. Beyond that, comparability with other studies is always confounded by discrete variations of surgical techniques, types of implants and timing of follow-up of which each may produce a considerable impact on fusion rates [6, 9, 38, 40]. Our analysis of comparability showed no significant case-related correlation between the three described measurements. Regarding the measurement of Cobb’s angle, the defined cut-off value is the most crucial determinant for evaluation in addition to a clearly defined point on the spinous processes for the assessment of their distances.

Several authors sought to evaluate the reliability of different measurements for the assessment of fusion rates. Cannada et al. investigated 27 cases which were considered fused cases when bridging bone was present [5]. Nevertheless, there was no evidence for superiority of this method, which was repeated by Tuli et al. [41]

Depending on the used implants and imaging quality, the comparability and feasibility of evaluating plain radiographs for bone bridges may be impaired [23, 40].

On follow-up, we found no significant correlation between VAS scores and presence of fusion for any of the three methods, which leads us to the assumption that there is no impact of bony fusion on clinical outcome. There is no agreement in the literature resolving this supposed contradiction. Cauthen et al. report a significantly higher rate of 80% satisfaction for patients with radiographic fusion, in contrast to 68% that report clinical improvement without fusion [6]. Conversely, the studies of Moon et al. as well as Löfgren et al. found no correlation between fusion rates and clinical outcome when defining fusion in cases of an interspinous distance of less than 2 mm or using the Cobb angle and bridging bone criteria [30, 32]. Our results merely demonstrate a significant but weak correlation between a Cobb angle of 4° or less with our definition of clinical improvement, which is further undermined by the questionable relevance of a 1-point increase on the VAS for the individual.

In essence, we concur with the recommendation of the CNS/AANS joint section guidelines that dynamic radiographs be considered for evaluation of fusion [23]. Absence of motion may be indicative of fusion despite clear absence of bridging bone, necessitating computed tomography (CT) scanning for confirmation of fusion. In light of this, Buchowski et al. and Feng et al. independently employed CT scanning in their postoperative follow-up, although again no positive correlation between clinical and radiographic outcome was found [3, 13].

Subsidence

Conventional criteria for evaluating subsidence after ACDF procedure are still lacking. One of the most often reported methods is the determination of ventral and dorsal segmental height reduction.

The Mochida method, initially devised for assessment of subsidence on magnetic resonance imaging (MRI), provides a reproducible, yet more complicated alternative to the other methods [31]. According to our results, there was a strong correlation between ventral and dorsal height reduction. Expectedly, the Mochida measurement did not correlate with the other two methods. The rate of subsidence within the group of ventral height reduction was almost twice as high as in the group of dorsal height reduction, although intra-individual correlation was only moderate. Statistical analysis could not show any significant correlation of subsidence and VAS scores. This is in accordance with other studies that reported similar results [2, 4, 29, 33, 45].

In conclusion, subsidence rates are clearly dependent on the measurement that is used and lead to incomparable results between studies. More so, the correlation between the occurrence of subsidence and clinical satisfaction for the individual after ACDF remains obscure.

Study limitations

There are two shortcomings limiting the validity of this study.

First, the surrogate parameter for clinical outcome was represented by the intensity of neck pain, which was present in all included patients. This in turn neglects other hallmarks of clinical success, being the improvement of myelopathic and radiculopathic signs such as gait ataxia or pain radiating to the upper extremities. We specifically chose a common denominator as representative for clinical outcome so as to sustain comparability within a rather large cohort that reports a plethora of clinical signs and symptoms. Albeit this seems to diminish the value of determining a surgery to be successful, it allows us to reference the various radiographic assessments to one definite variable. As was outlined in this and other published works, there is hardly any consensus on what clinical parameter to attribute to radiographic outcome [14, 16, 18]. We maintain the intensity of neck pain to be the most suitable parameter representative of cervical radiographic dimensions in combination with its duration to a lesser extent, much more so than either radiating arm pain or myelopathic signs and symptoms, all of which are primarily evoked by anatomically evident neural compression. Seeking association between neck pain and sagittal alignment, fusion and subsidence remains elusive, but perhaps the most promising for lack of a better alternative.

The other shortcoming stems from the retrospective nature of this study and the absence of preoperative complete diagnostics in a high number of patients, which precluded comparisons of pre- and postoperative imaging.

Conclusion

The radiographic outcome after ACDF for degenerative cervical spine disease rests on a multitude of parameters, for which no standardized assessment or measurement methods exist. The available methods vary greatly and apart from one do not demonstrate a significant correlation between radiographic and clinical outcome.