Introduction

Spinal fusion is a common treatment for degenerative disorders of the lumbar spine providing adequate clinical results in terms of pain relief and high fusion rates [1]. However, spinal fusion has been associated with adjacent segment degeneration as a potential long-term sequel [2, 3]. A recent meta-analysis reported a radiographic prevalence of adjacent segment degeneration from 4.8 to 92.2 % [3]. Radiographic adjacent segment degeneration is poorly defined and does not correlate very well with symptomatic adjacent segment degeneration [4], which has also been termed adjacent segment disease [5]. In a study by Cheh et al. [6], 43 % of patients showed radiographic adjacent segment degeneration, and 24 % symptomatic adjacent segment disease, whereas 6.3 % had clinical signs without radiographic evidence. The reported rates for adjacent segment disease in the literature range from 2.6 to 30.3 % [2, 6, 7]. Throughout the present article, the term adjacent segment disease is used for symptomatic adjacent segment degeneration.

Based on the rates of occurrence, adjacent segment degeneration and disease have a significant clinical impact and numerous studies have aimed at identifying risk factors. It has been indicated that patient factors such as age, gender, obesity, pre-existing degeneration, and facet tropism may contribute to adjacent segment degeneration [2, 79]. Lee et al. [7] reported in their study of 1,069 patients after lumbosacral fusion that mainly pre-existing degeneration of the facet joints may be a predisposing factor. Other authors have attributed adjacent segment degeneration and disease to surgical factors, such as the numbers of segments fused [10], increased postoperative disc height [11], or a low postoperative lordotic angle [12] with often conflicting findings among the different studies. However, risk factors which have relatively clear implications on subsequent surgical management have not been identified.

It has been suggested before that sagittal alignment may contribute to adjacent segment degeneration. Kumar et al. [13] concluded in their study that patients with a normal postoperative C7 plumbline and sacral inclination had the lowest risk of adjacent segment degeneration. A relationship between adjacent segment degeneration after lumbar fusion and spino-pelvic parameters, such as lumbar lordosis, pelvic incidence, pelvic tilt, and sacral slope has never been described, however. Legaye et al. [14] and Duval-Beaupère et al. [15] described a chain of correlations between positional parameters of upright posture and pelvic incidence. Boulay et al. [16] demonstrated an equation for predicting lumbar lordosis based on pelvic incidence and explained that within confidence limits the standing position is within the conditions of an economic posture and that out of confidence limits, the adaptation potential of the spine and pelvis is exceeded and may evoke pathological positions and loading patterns. Pelvic incidence has subsequently been generally acknowledged as a predictor of the amount of lumbar lordosis required to assume a balanced sagittal posture [15, 17, 18]. As the relationship between pelvic incidence as a morphologic parameter and lumbar lordosis seems important for the sagittal profile of the spine, it may also account for different loading patterns in the lumbar spine which may be relevant for the development of adjacent segment degeneration and disease.

In the present study, patients that underwent revision surgery for symptomatic adjacent segment disease were compared to a control group, to investigate whether differences in spino-pelvic parameters and especially spino-pelvic alignment could be associated with a higher risk for developing adjacent segment disease.

Materials and methods

For analysis of the relation of spino-pelvic parameters and adjacent segment disease, a retrospective case–control study was carried out. Approval of the research ethics committee of the state of Zurich has been granted for non-invasive, retrospective studies using patient data only. Patients were included into the adjacent segment disease group (ASDis) if they underwent primary lumbar fusion of one, two, or three segments between L2 and S1 and had surgery for symptomatic adjacent segment disease during follow-up. Indications for surgery were degenerative lumbar spondylosis or spondylolisthesis with leg pain or claudication. Patients that lacked complete preoperative radiographic documentation consisting of plain films and MRI, which had prior spine surgery or showed lumbar deformity, such as degenerative scoliosis or isthmic spondylolisthesis were excluded from further analysis. A control group (CTRL) was randomly selected out of the patient pool of lumbar fusion procedures that met the above inclusion criteria except for revision surgery to be similar in the distribution of age and gender to the ASDis group with the same exclusion criteria. In addition, patients were included in the CTRL group only if they had a minimal follow-up of 5 years, no signs of symptomatic adjacent segment disease upon last follow-up, similar distribution of levels and number of segments fused and comparable degree of disc degeneration in the prospective adjacent segment before the surgical procedure as assessed on MRI. All patients had standing radiographs of the lumbar spine with inclusion of the femoral heads; whole spine radiographs were not available.

In total, 84 patients were included which demonstrated adequate follow-up and radiographic documentation with preoperative MRI of the lumbar spine. All included patients had a posterolateral instrumented fusion with pedicle screws and showed union during follow-up. Forty-five patients demonstrated adjacent segment disease (ASDis group), all of which were in the epifusional segments, and underwent revision surgery after a mean of 49 months (7–125 months). The average follow-up after the initial surgery was 71 months (15–149 months). The control group consists of 39 patients with a mean follow-up of 84 months (61–142 months). The average body mass index (BMI) in the ASDis group is 26.2 ± 4.4 vs. 26.8 ± 4.0 in the CTRL group. Details on the patients’ characteristics are given in Table 1. The patients in the control group were selected to be similar in the distribution of the levels treated and number of segments fused (Table 2). To rule out that the ASDis group had more preoperative degenerative changes of the lumbar spine resulting in adjacent segment disease, the patients in the control group were selected to have similar preoperative grade of disc degeneration on MRI (Table 3). Selecting similar groups in the distribution of their matching variables was carried out before measurements were performed on radiographs to minimize selection bias.

Table 1 Patients
Table 2 Levels and number of segments fused
Table 3 Preoperative Pfirrmann grade of the adjacent disc

Analysis of radiographs consisted of measurements of spino-pelvic parameters, such as pelvic incidence (PI), sacral slope (SS), pelvic tilt (PT), lumbar lordosis (LL), and L1 plumb line. Figure 1 depicts how measurements were taken. Measurements were taken on the preoperative radiographs and at last follow-up in the CTRL group or before revision surgery in the ASDis group. LL was measured from L1–L5 and L1–S1 for assessment of their sensitivity as described below. In addition, measurements of lordosis of the segments to be fused and the postoperative fusion angle (fusion angle) as well as the pre- and postoperative angles of the epifusional (upper AS angle) and subjacent segments (lower AS angle) were measured by sagittal Cobb measurements in lateral radiographs. PILL mismatch (∆PILL) was calculated as the difference between PI and LL (∆PILL = PI−LL). Measurements were performed by two raters and Bland–Altman analysis performed to indicate the accuracy of measurement [19]. For subsequent analyses, the mean of the two measurements was used. The inter-rater bias for PI, LL, and ∆PILL was 0.78 ± 2.2, 0.34 ± 1.45, and 0.56 ± 1.89, indicating less measurement accuracy for PI. The 95 % limits of agreement were calculated to range from 5.1 to −3.5, 3.2 to −2.5, and 4.3 to −3.1, respectively. Disc degeneration was graded on preoperative MRI of the lumbar spine according to Pfirrmann et al. [20]. Degeneration of facet joints could not be fully assessed as not all MRI included axial images of all segments of the lumbar spine and, therefore, had to be excluded.

Fig. 1
figure 1

Measurements were carried out as shown: lumbar lordosis is measured as the sagittal Cobb angle between the superior endplate of L1 and the sacral plateau S1. Pelvic incidence is the angle of a line perpendicular to the S1 endplate at its midpoint and a line connecting to the midpoint of the line connecting the centers of the femoral heads. Pelvic tilt is measured as the angle between a vertical line trough the midpoint of the centers of the femoral head and a line connecting to the midpoint of the endplate of S1. Finally, sacral slope is measured as the angle between a horizontal line and the endplate of S1

Statistics

Analysis of all data was carried out using JMP 9.0 for Mac (SAS Institute Inc., Cary, North Carolina, USA). Normality of distribution was checked by the Shapiro–Wilk test. For the variables lumbar lordosis (p = 0.56), pelvic incidence (p = 0.82), pelvic tilt (p = 0.71), sacral slope (p = 0.09), ∆PILL (p = 0.92), and PT/SS (p = 0.06), a normal distribution could be assumed and therefore parametric testing used. A dependent and independent t test statistic was employed for intra-group and inter-group comparisons, respectively. For the variables L1 plumb line, fusion angle, upper AS angle, and lower AS angle, a normal distribution could not be confirmed. Intra-group comparisons were carried out by the Wilcoxon signed-rank test and between group comparisons by the Wilcoxon rank-sum/Mann–Whitney tests. All p values reported are 2-tailed and considered significant if p < 0.05. Only significant p values are reported. Classification was performed using logistic regression and ROC curve analysis. For ROC curve graphing, the pROC library for the R statistics package was used [21].

Results

Comparison of the pre- and postoperative values within the groups showed significant differences in the ASDis as well as CTRL group for lumbar lordosis (ASDis p = 0.012 and CTRL p = 0.027) and pelvic incidence (ASDis p = 0.017 and CTRL p = 0.002), for pelvic tilt only in the CTRL group (p = 0.011) and for sacral slope only in the ASDis group (p = 0.011). The differences between pre- and postoperative measurements are within a few degrees and can be regarded as largely unchanged although significant, as the differences are within the limits of measurement accuracy as indicated by the Bland–Altman analysis above. All measurement results are summarized in Table 4. Fusion did not change the preoperative lordosis angle of the segments to be fused in both groups (ASDis: 26.5 ± 11.3° vs. 24.4 ± 9.3°, p = 0.162; CTRL:23.6 ± 12.6° vs. 22.7 ± 8.6°, p = 0.535). Likewise, the segmental angle in the upper and lower adjacent segments did not change from pre- to postoperative in both groups, indicating that they unlikely contribute to adjacent segment degeneration and disease. Subsequent analyses focused on the postoperative measurements only to link postfusion alignment to adjacent segment disease.

Table 4 Radiographic measurements

Comparison of the radiographic parameters between the groups reveals that patients in the ASDis group have a significantly higher pelvic incidence than in the CTRL group (60.9 ± 10.0° vs. 51.7 ± 10.4, p = 0.001) and correspondingly also a higher pelvic tilt (22.4 ± 7.0° vs. 18.8 ± 6.5°, p = 0.012) and slightly higher sacral slope (37.2 ± 8.0° vs. 35.3 ± 7.7°, p = 0.004). In addition, lumbar lordosis was significantly lower in the ASDis group compared to the CTRL group (48.1 ± 12.5° vs. 53.8 ± 10.8°, p = 0.012). To assess alignment, pelvic incidence was put into relation to lumbar lordosis by calculating the difference between pelvic incidence and lumbar lordosis measured to S1 (∆PILL = PI−LL) following the notion that lumbar lordosis should more or less match pelvic incidence. ∆PILL showed a large significant difference in spino-pelvic alignment between the ASDis and CTRL group (12.5 ± 16.7° vs. 3.4 ± 12.1°, p = 0.001), indicating that it may be a potentially contributing factor to adjacent segment disease. It was further found that the L1 plumb line, measured as the horizontal distance between the posterior endplate of S1 and the plumb line from the center of L1, was slightly shifted anteriorly in the ASDis group (39.4 ± 68.2 mm vs. 23.6 ± 61.4 mm, n.s.), which corresponds to the lower lordosis in this group compared to the CTRL group. Despite the large difference for the L1 plumb line between ASDis and CTRL, it is not significant, which may be explained by the large distribution of values as indicated by the standard deviation. In addition, the relationship of the L1 plumb line to global balance is not defined and conclusions on the overall balance can therefore not be made from this measure.

Classification of alignment according to PILL mismatch in relation to adjacent segment disease

Logistic regression revealed that ∆PILL as a measure of spino-pelvic alignment can be used to classify patients into the two groups (Chi-square = 12.5, p < 0.0001), while other radiographic factors such as pelvic incidence or lumbar lordosis alone were significant but showed less sensitivity on subsequent analyses (Chi-square = 11.82, p < 0.0001 and Chi square = 6.53, p = 0.0106, respectively). Sacral slope did not show any discrimination between the two groups (Chi-square = 3.06, p = 0.18). For ∆PILL a cutoff value of 9.8° was determined to discriminate between the two groups with the highest sensitivity and specificity by ROC analysis with an area under the curve AUC = 0.73 (95 % confidence interval 0.62–0.84; Fig. 2). The same analysis for measurements of LL from L1 to L5 reveals a cutoff value of 15.2° and AUC of the ROC curve equals 0.87. For scientific purposes, this measurement was used for classification in a subsequent biomechanical study [22]; however, for clinical use, ∆PILL used in this study signifies pelvic incidence-lumbar lordosis mismatch with LL measured from L1 to S1 as internationally widely agreed. Based on the cutoff value of ∆PILL of 10°, patients were classified into two types of alignment according to pelvic incidence-lumbar lordosis (PILL) mismatch. In type A alignment, the difference between pelvic incidence and lumbar lordosis (∆PILL) is below 10° and in type B alignment, ∆PILL is above or equal to 10°. Forty-one patients were classified as type A alignment, 29 belonged to the CTRL and 12 to the ASDis group. In type B alignment, 35 out of 43 patients belong to the ASDis and only 8 to the CTRL group. In type A alignment, 25.5 % of patients underwent revision surgery for adjacent segment disease, whereas 78.3 % of patients classified as type B alignment had revision surgery. Classification of patients according to ∆PILL <10° yields a sensitivity for predicting adjacent segment disease of 71 % with a specificity of 81 %. The odds ratio as an estimate of the relative risk for developing adjacent segment disease is 10.6, indicating a 10-times higher risk for patients with a type B alignment and a high degree of PILL mismatch. The characteristics of patients according to their classification are given in Table 5. Comparison of ASDis and CTRL within type A and type B alignments, i.e., false compared to true negatives and false compared to true positives with reference to adjacent segment disease, does not show any significant differences. On the other hand, comparison between type A and type B not distinguishing between ASDis and CTRL showed significant differences for all variables except for values of L1 plumb line, indicating two distinct morphologies of spino-pelvic alignment (Table 5, Fig. 3). In type B alignment, the pelvis assumes a compensatory position with increased pelvic tilt as indicated by the ratio PT/SS which was different between type A and B alignments (0.53 ± 0.19 vs. 0.67 ± 0.23, p = 0.004).

Fig. 2
figure 2

Logistic regression and ROC analysis revealed a cutoff value for ∆PILL of 9.8° at which classification according to ∆PILL yields a sensitivity of 71 % and specificity of 81 %. The area under the curve AUC is 0.73 with a 95 % confidence interval from 0.62 to 0.84

Table 5 Classification of spino-pelvic alignment into type A and B according to ∆PILL
Fig. 3
figure 3

The two distinct morphologies of pelvic incidence-lumbar lordosis mismatch are shown. In a type A alignment, pelvic incidence is in the normal range with a corresponding lumbar lordosis and balanced lumbar spine, consequently ∆PILL is below 10°. In a type B alignment, the pelvic incidence is greater than in type A with slight pelvic back tilt. If this pelvic morphology is not paralleled by a corresponding lumbar lordosis, the resulting mismatch predisposes to a higher risk for adjacent segment disease

Discussion

This investigation was designed and the control group selected to be similar in the distribution of the matching variables to investigate spino-pelvic alignment in relation to adjacent segment disease irrespective of preoperative degenerative changes, levels and number of segments fused. In accordance with the previously established relationship of pelvic incidence and lumbar lordosis and the notion that the two should match within certain limits [15, 17, 18], the difference ∆PILL was calculated as a measure of spino-pelvic alignment. ∆PILL was significantly different in the ASDis and CTRL group and could be shown to be useful for classifying patients by logistic regression at a cutoff value of 10°. Classification of PILL mismatch into type A (∆PILL <10°) and type B (∆PILL ≥10°) shows adequate sensitivity and reliability for predicting the risk of adjacent segment disease and patients with ∆PILL greater than 10° have a 10-times higher risk of developing adjacent segment disease. The relationship between local alignment described as PILL mismatch and global sagittal balance, which may be a potential contributing factor, could not be assessed due to the retrospective nature of the study as whole spine radiographs were not obtained routinely for investigating the lumbar spine and were therefore not available for analysis.

The etiology of adjacent segment degeneration and disease has been related to biomechanical alterations following spinal fusion [2326] or regarded simply as progression of the degenerative process not influenced by the fusion procedure [12, 27]. As such it is likely to be multifactorial and disc degeneration and therefore degeneration of the motion segment might be related to genetic influences and has been associated with for example polymorphisms of the vitamin D receptor and collagen IX genes [28]. Studies on altered kinematics and biomechanics after lumbar spinal fusion have not focused on spino-pelvic alignment, but investigated hypermobility and increased loads in the adjacent segment [24, 26]. Our findings indicate that PILL mismatch describes different morphologies with expected consequences on spinal biomechanics (Fig. 3). Our data show that in some cases a high pelvic incidence is not matched by a correspondingly high lordosis as would be expected in a balanced spine [29, 30] and therefore optimal congruence between pelvic and spinal parameters for spinal balancing and for an economical posture not present [31]. It seems likely and is expected from a biomechanical point of view that differences in ∆PILL as a measure of spino-pelvic alignment have consequences on the load patterns in the lumbar spine and even more so in the adjacent segment after spinal fusions, which is in accordance with findings of previous studies [13, 32]. Whether adjacent segment disease was evoked by the fusion procedure or whether more degenerative changes would have been seen in a non-fusion group over time as a result of a type B alignment remains to be investigated. However, a subsequent modeling study has shown that segmental joint reaction forces acting on the motion segment are unfavorable in patients with PILL mismatch before fusion [22], indicating that alignment may contribute to the natural history. The study of segmental biomechanics in the background of fusion should therefore not be decoupled from alignment and the resulting forces thereof.

The relevance of pelvic incidence-lumbar lordosis mismatch in adult deformity has just very recently been reported in a study by authors of the International Spine Study Group [33]. In their analysis of 492 patients with adult spinal deformity in a prospective database, they identified the sagittal vertical axis (SVA), pelvic tilt, and PILL mismatch to correlate with disability and quality of life as defined by an Oswestry Disability Index (ODI) of more than 40. PILL mismatch of more than 11° was found to have the strongest correlation with disability, but also correlations with the SVA and pelvic tilt. Therefore, PILL mismatch of 11° or more indicated that patients are either likely to be unbalanced or compensating. Their values compare favorably to our cutoff value of 10° for adjacent segment disease. Schwab et al. [33] concluded from their analysis that PILL mismatch is an intrinsic part of the adult deformity and restoration of PILL mismatch should be the primary objective in the surgical management of adult deformity. Their study indicates that about a third of patients with PILL mismatch of more than 11° did not have global sagittal imbalance as defined by an SVA greater than 47 mm [33]. While in the present study global sagittal balance could not be measured, the patients reported here were treated for lumbar degeneration and not for global sagittal imbalance. PILL mismatch brings the deformity world into the degenerative lumbar spine and indicates an intrinsic deformity whereby global sagittal imbalance does not necessarily have to be present. If PILL mismatch and, therefore, the intrinsic deformity of the degenerative spine is not addressed in a fusion procedure and fusion is carried out by maintaining the same extent of lordosis, then these patients are at higher risk for adjacent segment disease as demonstrated here.