Introduction

In a 1985 review, Cloward [4], who had been the first to develop the use of autologous iliac bone grafts (IBGs) for posterior lumbar interbody fusion (PLIF), asserted that PLIF is the answer to the treatment of diseases of the lumbar spine and may be the operation of the future. During the past 10 years, less invasive surgery using local bone grafts (LBGs) obtained from the laminectomy has been used instead of IBG for the interbody fusion [10, 13, 19, 24], achieving fusion outcomes similar to those with IBG. However, various materials, such as allografts [1], platelet gels [3, 25], ceramic synthetic bone-graft substitutes [5, 25], demineralized bone matrix [18], and recombinant human bone morphogenetic proteins 2 [2, 6, 9] and 7 [23], have been used to obtain more complete interbody fusion. Therefore, fusion outcome may be related not only to the type of bone graft, but also to the types of cage and posterior instrumentation, or to surgical skill. This retrospective cohort study compared fusion and clinical outcomes achieved at more than two postsurgical years by the same surgeon who did L4–L5 single-level PLIFs using the same posterior instrumentation with four combinations of one of three types of interbody cage with one of two sources of bone graft (iliac and local, or only local).

Materials and methods

Case selection

Records of patients who had undergone L4–L5 single-level PLIF performed by the same surgeon (M.K.) at the university hospital from April 2004 through December 2009 were reviewed. The indication for PLIF was based on a diagnosis of lumbar spinal stenosis with spondylolisthesis from clinical and radiographic assessments. The Japanese Orthopaedic Association (JOA) score [14, 20] was used for clinical assessment. The visual analogue scale (VAS) [12] graded on a 100-mm scale to evaluate low back pain and leg pain was also used for clinical assessment.

Radiologic assessment was based on reviewing plain radiographs. A finding of greater than 10 mm sagittal translation in the lateral radiographic view of the patient in the lumbar neutral position, or of greater than 5° local kyphosis in the lateral radiographic view of the patient in the lumbar flexion position, constituted diagnosis of lumbar instability. Cases of trauma, infection, previous lumbar surgery, and surgery at other levels were excluded.

PLIF procedure and postsurgical treatment

Two thirds of the upper lamina toward bony lesion was removed while maintaining integrity of the interspinous and supraspinous ligaments. A pair of TSRH-RP pedicle screws (Texas Scottish Rite Hospital spinal system, Medtronic Sofamor Danek, Memphis, TN, USA) was inserted from the entry point at the lateral border of the superior facet where it intersected the midportion of the transverse process at the bilateral L4 and L5, and two screws, another pair of TSRH-RP pedicle screws, were inserted on the other side. The pairs of screws were connected by a titanium rod [17]. Disc material and cartilaginous endplates were excised to make the graft bed. Bone grafts were harvested from the posterior iliac crest or were removed from the lamina and crushed by bone milling. Two cages of the same type were filled with crushed bone graft (Fig. 1a). The extent of the volume of the cages filled with bone graft was determined according to the cage manufacturer’s instructions. Then, the cages were implanted into the lateral and anterior portions of the interbody space (Fig. 1b), then the remaining bone chips were inserted outside of the cage (Fig. 1c). Additional materials, such as allografts, platelet gels, ceramics, demineralized bone matrix, or bone morphogenetic proteins were not used.

Fig. 1
figure 1

Bone-chip grafts were inserted inside the cage (a, arrow), and the cages were implanted into the lateral and anterior portions of the interbody space (b). Then, the remaining bone chips were inserted outside of the cage (c, arrows)

Postsurgically, patients were required to wear a fitted, rigid brace for at least 3 months.

Postsurgical radiologic and clinical assessments

For radiologic assessment, anteroposterior (AP) and lateral radiographic views with the patient in flexion and extension positions were examined by two independent observers who had not participated in PLIF surgery and who were blinded at the more than 2-year postsurgical assessment. Any movement detected between the vertebral bodies or lucency with a diameter greater than 1 mm observed within the cage or at the cage-bone interface on lateral radiographs was considered a sign of non-fusion.

CT scanning for sagittal and coronal image reconstruction of the involved lumbar segments and the bony structure was done to evaluate the success of fusion at the more than 2-year postsurgical assessment. Classification of interbody fusion success was evaluated according to criteria based on Brantigan, Steffee, and Fraser (BSF) criteria; BSF-1: Radiographic pseudarthrosis (non-fusion) is indicated by collapse of the construct, loss of disc height, vertebral slip, broken screws, displacement of the carbon cage, or significant resorption of the bone graft or lucency visible around the periphery of the graft or cage. BSF-2: Radiographic locked pseudarthrosis (questionable fusion) is indicated by lucency visible in the middle of the cages with solid bone growing into the cage from each vertebral endplate. BSF-3: Radiographic fusion is indicated by bone bridges at least half of the fusion area with at least the density originally achieved at surgery. Radiographic fusion through one cage (half of the fusion area) is considered to be mechanically solid fusion even if there is lucency on the opposite side [8]. At the same time, the VAS and the JOA score were for repeated clinical assessment.

Statistical analysis

Statistical tests used to compare differences among treatment groups are identified on the tables and the figure presenting results. For incomplete fusion group, to assess the factors affecting fusion, multivariate logistic regression models were used to estimate odds ratios (ORs), P values, and associated 95 % confidence intervals, and these following factors were examined: smoking (positive), sex (woman), surgical time and blood loss, and the inner volumes of various cage sizes. The kappa statistic was used to determine interrater reliability for the postsurgical radiographic assessments [16].

Data were analyzed with SPSS (version 16; SPSS, Chicago, Illinois, USA). Alpha was set at 0.05.

Study approval, informed consent, and funding

The study was approved by the institutional review board of the university hospital, and study subjects provided informed consent. The study did not receive any external funding, and authors do not have any disclosures to declare.

Results

Demographics and surgical details

Review of records showed that 67 patients (24 men and 43 women) underwent L4–L5 single-level PLIF by the same surgeon (M.K.) during April 2004 through December 2009. Their median (range) age was 67 (36 to 80) years, and the median (range) duration of follow-up was 34 (24 to 62) months. The records were sorted into four groups according to cage type and bone graft (Table 1). Age, sex, follow-up times, smoker or nonsmoker, and surgical time and blood loss were not statistically significantly different among the four groups (Table 2).

Table 1 Group characteristics according to cage type and bone graft
Table 2 Presurgical demographic characteristics and surgical details of the four groups of PLIF patients

Fusion outcomes

Outcomes were compared by groups taken in two different ways: by the four groups according to the four different combinations of cages and bone graft and by the two groups according to different sources of bone graft (iliac and local or only local) regardless of the type of cage.

As evaluated at final follow-up by observing AP and lateral flexion-extension radiographic images, a patient in the BI (Brantigan I/F cage; iliac and local bone graft) group had no fusion. Based on BSF classification, fusion for the TL (TELAMON C cage; local bone graft) group was statistically significantly less than those for the BI (P = 0.02, χ 2 test) and TI (TELAMON C cage; iliac and local bone graft) groups (P = 0.03, χ 2 test) and OL (OIC PEEK Interbody; local bone graft) group (P = 0.03, χ 2 test) (Tables 3). Fusions in the iliac and local bone graft (IBG) and local bone graft (LBG) groups on the BSF criteria after surgery were not significantly different (P = 0.09, χ 2 test, Table 4). The concordance of interrater reliability for these evaluations was 81 %, which was almost perfect [16].

Table 3 Final postsurgical outcomes according to BSF classification
Table 4 Source of bone graft and 2-year postsurgical outcome by BSF classification

Clinical assessment at final follow-up

Presurgically, the JOA score, the VAS (low back pain), and the VAS (leg pain) scores were not significantly different when compared by the four combinations of source of bone graft. At final follow-up, the JOA score and the VAS (leg pain) scores were also not significantly different. However, the VAS (low back pain) score was significantly higher for the TL group than for the other three groups (P = 0.01, ANOVA, Table 5). In the TL group, the change of the JOA score and the VAS score for low back pain and the VAS for leg pain score between presurgical and final follow-up time were not significantly among BSF fusion classification (JOA score; P = 0.51, VAS for low back pain; P = 0.59, VAS for leg pain; P = 0.50, two-way ANOVA).

Table 5 Surgical group and mean (SD) JOA and VAS scores at presurgical and final follow-ups

Multivariate logistic regression models for the TL group

In the factors affecting poor fusion and poor clinical outcome, multivariate logistic regression analysis in the TL group showed that the inside volume of cage of ≥2.0 mL was the only significant factor for incomplete fusion 34 months after PLIF (Table 6).

Table 6 Multivariate logistic regression models of incomplete fusion in the TL group at the final follow-up

Discussion

We found that at a 34-month follow-up, fusion outcomes were the same for iliac bone grafts and local bone grafts used for L4–L5 single-level PLIF. However, the fusion outcomes at a 34-month follow-up for the TL group were statistically significantly less than those of the BI and TI groups and the OL group. As for clinical assessment, we found that the VAS (low back pain) score was significantly higher for the TL group compared with the other three groups. Therefore, it is very likely that the TELAMON C cage with local bone graft lead to incomplete fusion or inadequate clinical outcome.

We considered two reasons why the TELAMON C cage with local bone graft led incomplete interbody fusion. First, there might be a relationship between bone fusion and the amount of bone grafts, because the fusion rate in the TELAMON C cage with local bone graft was significantly less than that for the TELAMON C cage with iliac and local bone graft. Second, inside volume of cages might also affect the bone fusion. Because by using different inside volumes of cages with same local bone graft, there was a significant worse bone fusion in TL group compared with OL group. The volume inside the cage we used from catalogues was not significant between BI, TI, and TL; however, the cages in the OL group had significantly less volume than the cages of the other three groups (Fig. 2). Moreover, in our result, multivariate logistic regression models of incomplete fusion in the TL group showed that larger inside volume of the cage was statistically significantly related to incomplete fusion. Quantities of laminar bone chips for local bone grafts (LBG) were limited, and we conjectured that the larger volume of the TELAMON C cage compared with that of the OIC cage meant that less LBG was available to transplant around the TELAMON C cage. Shar et al. indicated that bony fusion is most likely be observed at the lateral zone in the intervertebral disc space where PLIF is performed [22]. They argued that for successful outcomes of PLIF, the amount of bone transplanted needs to be sufficient for the insertion into the intervertebral disc space posterior and lateral to the cages. For all of these reasons, we speculated that large volume inside the cage with only local bone graft led incomplete interbody fusion because of a lack of the bone transplantation around the cage.

Fig. 2
figure 2

Mean (range) inside volume of cages. *P = 0.01 (ANOVA)

As to clinical outcomes at the 34-month follow-up, among the four combinations of cage and source of bone graft, the improvement of the VAS (low back pain) was significantly less in the TL group than in the other three groups. Thus, the TL group, which had more incomplete fusion than the other three groups, also had more back pain at final follow-up. Some previous reports presented that non-union did not affect the clinical result in the short term [7, 11]. By contrast, other previous reports noted that some patients with incomplete fusion have persistent or recurrent symptoms after surgery [15, 21]. The relationship between incomplete bone fusion and clinical outcomes has been controversial. In our results, it is probable that the large volume cages (TELAMON C cage) with local bone graft leading the incomplete fusion will be related to residual postsurgical low back pain. Thus, we believe that the large volume cages with local bone graft may be the cause of the residual postsurgical low back pain, and the type of the graft with amount is important for fusion rates for PLIF.

This study had limitations. It was a retrospective comparison, not a prospective, randomized controlled trial. Only three types of cages were used. If more types had been used, stronger results might have been obtained.

Conclusions

The type of cage or the amount of grafts used in PLIF could lead to incomplete fusion or inadequate clinical outcome. We believe that iliac bone grafts should be added to local bone grafts if the volume inside the cage is ≥2.0 mL. We recommend that a cage with a small inside volume be used to transplant the bone chips around the cage if local bone grafting is used. Incomplete interbody bone fusion after PLIF may be related to residual postsurgical low back pain.