Introduction

Posterior lumbar interbody fusion (PLIF) is performed to decompress the neural tissues and fuse the affected motion segments. Although pedicle screw fixation has dramatically reduced the incidence of pseudoarthrosis (1.7–10 %) [14], delayed union or pseudoarthrosis remains a major radiographic complication after PLIF.

The clinical importance of fusion status has been underestimated on the basis of reports in which pseudoarthrosis did not affect the short-term outcome after lumbar arthrodesis [3, 59]. However, the clinical evaluations in the majority of those studies were primarily physician-based assessments. The effect of fusion status after PLIF on patient-based quality-of-life (QOL) outcomes remains unknown. We hypothesized that non-fusion after PLIF negatively affects QOL outcomes.

The purpose of this study was to investigate the effect of fusion status after PLIF on patient-based QOL outcomes by using the Japanese Orthopaedic Association Back Pain Evaluation Questionnaire (JOABPEQ) [10].

Methods

This study was a retrospective review of prospectively collected data and approved by the research ethics committee of National Hospital Organization Osaka Minami Medical Center (No. 24-11).

Potential subjects were 117 consecutive patients who underwent one-level PLIF (including single-level PLIF combined with laminectomy at other levels) for treatment of lumbar spinal canal stenosis between November 2010 and April 2013 at National Hospital Organization Osaka Minami Medical Center. The indications of PLIF were as follows: spondylolisthesis with slippage greater than 3 mm and/or posterior opening greater than 5° on dynamic lateral plain radiograph; foraminal stenosis requiring total fecetectomy for decompression. Out of the total, 17 patients were excluded: 12 patients with rheumatoid arthritis and 5 patients undergoing dialysis for chronic renal failure. Finally, 100 patients [mean age, 66.8 years (range 40–89 years)] were evaluated 6 months after surgery for fusion status by examining anteroposterior and dynamic lateral plain radiographs using Ito’s method [2] by the first author (T.M.), blindly. In brief, the bony fusion as apparent on the radiographs was classified into 4 grades: grade 1, complete fusion achieved with formation of a bone bridge between the upper and lower vertebral bodies; grade 2, bone bridge not formed, but formation of a thick fusion mass with no translucency observed around the cages; grade 3, fusion not achieved and translucency apparent around the cages; grade 4, cage sinking into the vertebral body or bone resorption around the cages. Fusion was defined as grade 1 or 2 with a flexion–extension angle of less than 5°. On the basis of these criteria, incomplete fusion was observed in 29 of the 100 patients, who were designated the incomplete fusion group. This group included 15 men and 14 women with a mean age of 71.1 years (range 49–84 years) at the time of surgery. Of the 71 patients in whom fusion had occurred, 29 patients whose age (±2 years) and sex matched those of the incomplete fusion group were designated as the fusion group. The fusion group included 15 men and 14 women with a mean age of 70.9 years (range 49–86 years) at the time of surgery.

Surgical procedure

In all patients, PLIF was performed through conventional open surgery (without minimally invasive surgery techniques) using carbon-polyetheretherketone cages and titanium pedicle screws and rods. The disc and cartilaginous endplates were removed to prepare the graft bed, and morselized local bone chips were implanted in the anterior and lateral portions of the interbody space. Two cages filled with local bone chips were inserted into the intervertebral space, and two bone blocks were inserted lateral to the cages. Finally, pedicle screws were inserted and connected with rods. Partial laminectomy at other levels was added if required.

Demographic and clinical characteristics

Age at the time of surgery, gender, number of levels of additional laminectomy, and level of the fusion segments were obtained from the medical charts and operative notes.

Outcome assessment

An overall clinical evaluation was made before and 6 months after surgery using the JOA Score for Low Back Pain (JOA score) [11]. The patients also answered the JOABPEQ and reported visual analogue scale (VAS) values for low back pain, pain in the buttocks and lower limb(s), and numbness in the buttocks and lower limb(s) at the same time points. The recovery rate based on the JOA score was calculated according to Hirabayashi’s method [(postoperative score − preoperative score)/(29 − preoperative score) × 100 (%)] [12]. The changes in the scores (postoperative score − preoperative score) in all 5 subdomains of the JOABPEQ (low back pain, lumbar function, walking ability, social life function, and mental health) and in the VAS values (postoperative VAS value − preoperative VAS value) for all categories were also calculated.

Statistical analysis

We compared the above parameters between the fusion and incomplete fusion groups. The Mann–Whitney U test was used to compare the age, number of levels of additional laminectomy, pre- and post-operative JOA scores, JOA score-based recovery rate, pre- and post-operative JOABPEQ score for each subdomain, change in the score for each JOABPEQ subdomain, pre- and post-operative VAS values for each category, and change in the VAS value for each category. Fisher’s exact probability test was used to compare genders. The chi-square test was used to compare the level of the fusion segments. Intraobserver and interobserver agreement for evaluation of fusion status were assessed with Cohen’s kappa coefficient in the first 25 patients blindly. For the analysis of intraobserver reliability, the first author (T.M.) evaluated the fusion status twice, with a week’s interval. For the analysis of interobserver reliability, the two spinal surgeons (T.M. and Y.M.) evaluated fusion status. Statistical analysis was performed using StatView Version 5.0 (SAS Institute Inc., Cary, NC, USA) except Cohen’s kappa coefficient. Cohen’s kappa coefficient was analyzed using R version 2.8.1 (The R Foundation for Statistical Computing). Differences were considered statistically significant at p < 0.05.

Results

Fusion was achieved within 6 months after surgery in 71 of overall 100 patients (71.0 %). No surgical site infection or instrumentation failure was observed. The age, gender, number of levels of additional laminectomy, and fusion level did not differ between the groups (Table 1).

Table 1 Patients’ demographic and clinical data

The preoperative JOA score, JOABPEQ scores for all subdomains, and VAS values for all categories did not differ between the groups (Tables 2, 3, 4).

Table 2 Pre- and post-operative Japanese Orthopaedic Association Score for Low Back Pain (JOA score) scores and associated recovery rate
Table 3 Pre- and post-operative Japanese Orthopaedic Association Back Pain Evaluation Questionnaire (JOABPEQ) scores and changes (postoperative score − preoperative score) therein
Table 4 Pre- and post-operative visual analogue scale (VAS) values and changes (postoperative value − preoperative value) therein

The postoperative JOA score and the associated recovery rate were higher in the fusion group than in the incomplete fusion group (JOA score, p = 0.0202; recovery rate, p = 0.0185) (Table 2). The postoperative scores of JOABPEQ for low back pain, walking ability, and social life function were significantly higher in the fusion group than in the incomplete fusion group (low back pain, p = 0.0234; walking ability, p = 0.0104; social life function, p = 0.0043) (Table 3). The changes in the JOABPEQ subdomain scores for low back pain, lumbar function, and mental health did not differ between the groups (Table 3). However, the changes in the JOABPEQ subdomain scores for walking ability and social life function were significantly greater in the fusion group than in the incomplete fusion group (walking ability, p = 0.0172; social life function, p = 0.0191) (Table 3). The postoperative VAS values and the changes in the VAS values for all categories did not differ between the groups (Table 4).

Intraobserver agreement for evaluation of fusion status had a kappa value of 0.884 and interobserver agreement had a kappa value of 0.750.

Discussion

This study revealed that fusion status affected not only clinical outcome in terms of the physician’s assessment (JOA score) but also patient-based QOL outcomes assessed by the JOABPEQ, though it did not affect VAS values for all categories. In particular, the JOABPEQ revealed that incomplete fusion after PLIF was associated with patients’ functional disability in walking ability and social life function. The JOABPEQ could reveal what kinds of disability the patients under incomplete fusion suffered from. Furthermore, the JOABPEQ was more sensitive than VAS scales for detecting postoperative patient QOL changes. To the best of our knowledge, this is the first report demonstrating an effect of fusion status after PLIF on the patient’s QOL outcome by using the JOABPEQ.

The effect of fusion status after lumbar arthrodesis on the patient’s outcome has been controversial. Reports with long-term follow-up have suggested that pseudoarthrosis adversely affects the patient’s outcome [9, 13]. In contrast, many authors have reported that pseudoarthrosis after lumbar arthrodesis did not affect the short-term outcome [3, 59]. Tsutsumimoto et al. [9] reported that clinical results 5 years after posterolateral lumbar fusion were better in patients with fusion than in those without fusion, although there was no significant difference in the clinical results after 1 and 3 years’ follow-up. Lee et al. [3] reported no difference in outcome in terms of the Korean Oswestry Disability Index, SF-36, and VAS 1 year after PLIF between patients who achieved fusion and those with pseudoarthrosis. However, the interpretation of these reports is complicated by their outcome evaluation methods (using predominantly physician-based outcome measurements), inhomogeneous fusion techniques and numbers of fusion levels, and different definitions of fusion. In this study, patients treated with the same surgical technique and one-level PLIF were evaluated by using patient-based outcome measures.

A number of trends in patient evaluation have resulted in the development and growing use of patient-based outcome measures to assess functional status and health-related QOL. In the field of lumbar spine disease, VASs or numerical rating scales have been used to quantify symptoms, the Roland-Morris Disability Questionnaire (RDQ) and Oswestry Disability Index have been used as lumbago-specific QOL measures, and the SF-36 has been used as a comprehensive measure of QOL. The JOABPEQ is a newly developed patient-based evaluation questionnaire drawn from the SF-36 and RDQ [10]. Its reproducibility, validity, and sensitivity have been verified previously [10, 1416]. The JOABPEQ consists of 25 questions and five functional scores (low back pain, lumbar function, walking ability, social life function, and mental health) that are calculated as weighted sums of the corresponding domains according to the formulas provided [10]. The minimally clinical important difference is 20 points between two time points [10]. The advantage of the JOABPEQ is that it allows independent evaluation of 5 subdomains of low back pain-related disability, whereas the RDQ and ODI both result in a single overall score. The detailed evaluation provided by the JOABPEQ may have contributed to the clarification of the effect of fusion status on the patient’s outcome in this study.

We chose 6 months postoperatively as the time for final evaluation. One reason for this is because maximum recovery in terms of the JOABPEQ results is expected to occur between 6 and 12 months after lumbar decompression surgery and PLIF [17, 18]. The other is because the fusion rate after PLIF is reportedly over 90 % 1 year postoperatively [14]. Furthermore, several factors such as the emergence of adjacent segment disease and degenerative changes would make it difficult to evaluate the true effect of fusion status on patient outcome after a longer follow-up period. In order to evaluate the true effect of fusion status, we chose 6 months after surgery as the final evaluation time. Moreover, we used an age-matched control group to exclude any effect of an age difference between the groups, as the mean age of the overall fusion group (n = 71, 65.0 years) was about 6 years younger than that of the incomplete fusion group. This was important because age has previously been shown to affect JOABPEQ subdomain scores [19].

We speculate about several reasons why incomplete fusion caused the poor improvement in walking ability and social life function. One possible reason is low back pain by endplate injury. It has been reported that lumbar endplate lesions associated with disc degeneration could induce low back pain [20]. The cages at the incomplete fusion segment could also injure the adjacent vertebral endplates by repetitive micro-motion. The other possible reason is patients’ fear-avoidance behavior caused by incomplete fusion. If patients were informed of incomplete fusion, they might restrain their daily activities to prevent undesirable results such as instrumentation failure. However, we neither took magnetic resonance images or computed tomographies for all patients, nor evaluated fear-avoidance behavior, and so further investigation is needed to clarify the reason why the fusion status affects patients’ QOL.

In conclusion, we investigated patients’ QOL outcomes after PLIF by using the JOABPEQ, a patient-based QOL outcome evaluation. This assessment revealed that QOL outcomes were in fact better in the fusion group than in the incomplete fusion group. Specifically, the JOABPEQ revealed that incomplete fusion after PLIF correlated with poor improvement in walking ability and social life function. This result indicates that achievement of fusion after PLIF is essential for not only long-term but also short-term QOL outcomes. We plan to perform additional studies to determine whether the QOL outcomes of patients in the incomplete fusion group can improve if fusion is later achieved, or even if fusion is not achieved.