Introduction

Interbody fusion is generally performed for degenerative lumbar spondylolisthesis (DLS) [18, 28]. However, there are many satisfactory long-term outcomes after posterolateral fusion (PLF) [2, 14, 16, 32]. Some authors have reported no significant differences in clinical outcomes and fusion rates between PLF and interbody fusion, thereby questioning the need for anterior support in short lumbar fusions [6, 13, 15, 20]. Furthermore, even nonunion cases after PLF achieved good clinical outcomes [8, 25, 31]. Miyashita et al. found that facet fusion (FF) using a percutaneous pedicle screw (PS) system, as a minimally invasive evolution of PLF, also resulted in good clinical outcomes [22]. In this study, we assessed the clinical outcomes and course of nonunion cases after FF and determined whether the nonunion cases achieved good clinical outcomes as well as those after PLF.

Methods

Patient population

This was a retrospective study using prospectively collected data. We reviewed 136 out of 143 consecutive patients (77 women, 59 men; average age, 68.3 years; age range, 41–91 years) who underwent FF for single-level DLS between June 2009 and September 2015, after at least 1 year of follow-up. Seven patients were excluded; two were lost to follow-up, and five were not assessed using computed tomography (CT) up to 1 year postoperatively until adequate fusion was confirmed. The average postoperative follow-up period was 72.8 months (16–120 months). The slipped vertebra occurred at L4 in 126 patients and L3 in 10 patients. All included patients presented with intractable radiculopathy (either bilateral or unilateral) and intermittent claudication caused by spinal stenosis after several months of conservative treatment. Patients were excluded if they had low back pain (LBP) alone, foraminal stenosis, and spondylolytic spondylolisthesis. There were two indications for lumbar fusion: (1) sagittal translation of ≥ 8% on a flexion-extension lateral radiograph or (2) anterior wedging of ≥ 5° on a flexion radiograph and a disc range of motion (ROM) of ≥ 10°. The indication for fusion was not done according to lateral disc space height. When lumbar fusion was indicated, we did not use other approaches, such as interbody fusion or slippage reduction, and all DLS patients were treated with FF regardless of the spondylolisthesis grade (translation degree of all patients was grade I or II).

Operative technique

The FF procedure was performed according to Miyashita’s method [22]. Thus, the central canal and lateral recess were decompressed using laminar fenestration and bilateral undercutting of the medial part of the facet joints. Bone chips harvested from the spinous process and laminae were pushed into the thoroughly decorticated facet joint spaces (Fig. 1a). Percutaneous PSs were then inserted through the fascia.

Fig. 1
figure 1

Axial computed tomography of facet fusion. a Bone chips pushed into the facet joint spaces are observed immediately after surgery. b Complete bony continuity. c Bony continuity cannot be confirmed at any facet joint portion

Postoperative protocol

Patients were allowed to sit on bed with a soft brace on the first postoperative day and were mobilized with physiotherapy after the removal of the drain and urine catheter on the second postoperative day. The soft brace was worn for 3 months postoperatively. If a patient had osteoporosis, a hard brace was used for the first 3 months, and a soft brace was used for another 3 months.

Radiological assessment

ROM at the fused level was measured on a flexion-extension lateral radiograph preoperatively and 1 year postoperatively. Jiang et al. reported a mean difference of 1.9° in the Cobb angle between readings on midsagittal CT images of thoracolumbar burst fractures, with intra- and inter-observer reliability using intraclass correlation coefficients (ICCs) of 0.969–0.986 and 0.941–0.953, respectively [12]. Accordingly, ROM of less than 2° was defined as no motion taking measurement error into consideration. CT scans were performed repeatedly until adequate fusion was confirmed at 3, 6, and 12 months postoperatively and once a few years thereafter. Fusion was assessed according to Miyashita’s grading system for FF [22]. Complete (Grade I) or partial (Grade II) bony continuity on the facet joint was considered adequate fusion (Fig. 1b), and nonunion was determined when bony continuity could not be confirmed at any portion of the facet joint (grade III) (Fig. 1c) or when there was obvious FF nonunion (grade IV). Three experienced spine surgeons assessed three to five axial CT planes through the center of the facet. Bony continuity on at least one of the planes was judged as fusion. The planes were blindly evaluated on two separate occasions. When judgment differed between observers, discussions were conducted to reach a consensus. Based on the findings of 1-year postoperative CT scans, patients were classified into Fusion or Unconfirmed Fusion group. Furthermore, Unconfirmed Fusion group patients were classified into Delayed Union or Nonunion group depending on whether FF was confirmed on the following CT examination (Fig. 2). For the Nonunion group, the joint space proportion in the bilateral facets was output using image software (Ziostation2, Ziosoft, Inc., Minato-ku, Tokyo, Japan) on the axial CT plane through the center of the facet pre- and postoperatively. We evaluated lateral areas to a line through the anterior edge of the inferior articular process and vertical to a line through the anterior edges of the bilateral facets (Fig. 3).

Fig. 2
figure 2

Classification chart of facet fusion grades assessed by computed tomography

Fig. 3
figure 3

Axial computed tomography through the center of the facets of a nonunion case. Arrowheads indicate the anterior edge of the facet. Arrows indicate the anterior edge of the inferior articular process. Stars indicate areas to be measured

Clinical outcome measures

The therapeutic effectiveness of FF was assessed as a clinical outcome using the Japanese Orthopaedic Association Back Pain Evaluation Questionnaire (JOABPEQ) [9], the Roland-Morris Disability Questionnaire (RMDQ), the visual analogue scale (VAS) of LBP (LBP-VAS), and VAS of buttock and lower limb pain (BLP-VAS), with all tests being done both preoperatively and 1 year postoperatively. According to previous reports, a minimal clinically important difference was defined as a 20-point increase in each category score of the JOABPEQ [9], a 5-point decrease in the RMDQ [24], a 21-point decrease in the LBP-VAS, and a 28-point decrease in the BLP-VAS [27]. Clinical outcomes were compared between the Fusion, Delayed Union, and Nonunion groups. In addition, surgical complications and revision surgeries were reviewed.

Statistical analysis

ICC was calculated to determine both intra- and inter-observer reliability for judgments of fusion using R (The R Foundation for Statistical Computing, Vienna, Austria). The Kruskal-Wallis test was used to determine significant differences in the average preoperative ROM between the three groups. The nonparametric Mann-Whitney U test was used to detect differences in ROM between each pair within the three groups and between the Fusion plus Delayed Union groups and the Nonunion group. The Wilcoxon signed-rank test was used to test differences between pre- and postoperative ROM and joint space proportion in the facets in the Nonunion group and between the pre- and postoperative RMDQ and VAS values in each of the three groups. Fisher’s exact probability test was employed to assess differences in therapeutic effectiveness between the three groups, between the Fusion and Nonunion groups, and between the Fusion plus Delayed Union and Nonunion groups. The level of significance was set at P < 0.05.

Institutional ethics committee approval

This study was approved by the Medical Ethics Committee of our hospital.

Results

Radiological results

The reproducibility of the fusion judgments for each observer was quite high, and the ICC for intra-observer reliability was 0.833 for observer 1, 0.883 for observer 2, and 0.904 for observer 3. The ICC for inter-observer reliability was 0.763, which was above the statistically acceptable threshold of > 0.7. The fusion rate at the 1-year follow-up was 80.1% (109/136 cases) for unilateral facets and 76.5% (104/136 cases) for bilateral facets. The fusion rate at the final follow-up was 90.4% (123/136 cases) for unilateral facets and 86.8% (118/136 cases) for bilateral facets. That is, of the 136 patients, the Fusion group consisted of 109 cases; Delayed Union group, 14; and Nonunion group, 13 (Fig. 2) (Table 1). There was no patient for whom a judgment regarding fusion was changed during the follow-up period. We excluded seven patients from the ROM analysis because their bending radiographs were not taken from the precise lateral direction parallel to the disc space of the fusion segment. All Fusion group patients had ROM less than 2°, and no motion was recognized with the naked eye at the fused level on bending radiographs. In 14 delayed union patients, two had motion of 3.1° and 3.3° respectively at the fused level with slight PS loosening, and the others had no motion 1 year postoperatively. The average fusion period was 25.8 months (18–57 months) in nine of the 14 patients followed up with regular CT scans. Fusion was confirmed only at the final follow-up in the other five patients who did not have regular CT scans. In 13 nonunion patients, the average ROM significantly decreased from 13.0° preoperatively to 4.9° postoperatively (P < 0.01), with screw halos (range, slight to 4 mm). There was no significant change in the proportion of the facet joint space pre- and postoperatively.

Table 1 Preoperative group characteristics classified by facet fusion status (averages ± standard deviations)

The average preoperative ROM of the Fusion, Delayed Union, and Nonunion groups was 9.8°, 10.3°, and 13.0°, respectively, and there was a significant difference between the three groups (P < 0.01). There were also significant differences between the Fusion and Nonunion groups (P < 0.01), Delayed Union and Nonunion groups (P < 0.05), and Fusion plus Delayed Union and Nonunion groups (P < 0.01). That is, the larger the preoperative ROM, the fewer facets fused.

Clinical outcomes

JOABPEQ category scores demonstrated therapeutic effectiveness in 78.2% of the Fusion group, 72.7% of the Delayed Union group, and 90.9% of the Nonunion group for low back pain; 43.4%, 54.5%, and 10.0% of the respective groups for lumbar function; 88.4%, 100%, and 90.9% of the respective groups for walking ability; 64.2%, 63.6%, and 45.5% of the respective groups for Social life function; and 43.2%, 45.5%, and 9.1% of the respective groups for mental health (Fig. 4). There was no significant difference in therapeutic effectiveness between the three groups for any category. Moreover, there was no significant difference between the Fusion and Nonunion groups and between the Fusion plus Delayed Union and Nonunion groups for any category, with the exception of lumbar function (P < 0.05) and mental health (P < 0.05).

Fig. 4
figure 4

Therapeutic effectiveness of facet fusion with the Japanese Orthopaedic Association Back Pain Evaluation Questionnaire category scores. There is no significant difference in therapeutic effectiveness between the three groups for any category. The asterisks indicate significant differences between the Fusion and Nonunion groups and between the Fusion plus Delayed Union and Nonunion groups (P < 0.05). N.S. not significant

The average RMDQ value significantly decreased from 11.3 preoperatively to 4.1 postoperatively in the Fusion group (P < 0.001), from 14.0 preoperatively to 5.5 postoperatively in the Delayed Union group (P < 0.01), and from 9.8 preoperatively to 4.7 postoperatively in the Nonunion group (P < 0.01). Therapeutic effectiveness was 67.4% in the Fusion group, 90.9% in the Delayed Union group, and 50.0% in the Nonunion group. There was no significant difference between the three groups in RMDQ value changes before and after surgery and in therapeutic effectiveness (Fig. 5a).

Fig. 5
figure 5

The Roland-Morris Disability Questionnaire (a) and the visual analogue scale of low back pain (b) and buttock and lower limb pain (c) value changes before and after surgery. There is no significant difference between the three groups in any of categories

The average LBP-VAS value significantly decreased from 52.9 preoperatively to 17.1 postoperatively in the Fusion group (P < 0.001), from 52.5 preoperatively to 22.5 postoperatively in the Delayed Union group (P < 0.01), and from 53.9 preoperatively to 30.1 postoperatively in the Nonunion group (P < 0.01). Therapeutic effectiveness was 71.7% in the Fusion group, 60.0% in the Delayed Union group, and 63.6% in the Nonunion group. There was no significant difference between the three groups in LBP-VAS value changes before and after surgery and in therapeutic effectiveness (Fig. 5b).

The average BLP-VAS value significantly decreased from 68.6 preoperatively to 20.4 postoperatively in the Fusion group (P < 0.001), from 62.9 preoperatively to 26.2 postoperatively in the Delayed Union group (P < 0.05), and from 65.1 preoperatively to 23.5 postoperatively in the Nonunion group (P < 0.01). Therapeutic effectiveness was 78.2% in the Fusion group, 70.0% in the Delayed Union group, and 63.6% in the Nonunion group. There was no significant difference between the three groups in BLP-VAS value changes before and after surgery and in therapeutic effectiveness (Fig. 5c).

Complications and revision surgeries

There were no major complications such as implant failure, neural or vascular injury, or deep wound infection. Revision surgery for complications or poor clinical outcomes immediately after surgery was not required. Five patients required revision surgery for adjacent segment disease (ASD) between 1 and 5.5 years after the first surgery (revision surgery rate, 3.7%). Of these five patients, two were in the Fusion group, two in the Delayed Union group, and one in the Nonunion group.

Discussion

This study clearly demonstrated that nonunion cases after FF achieved good clinical outcomes, likely due to the significantly decreased postoperative ROM, indicating unstable spondylolisthesis stabilization. The fusion rate at the 1-year follow-up was 80%, and 10% of the patients achieved fusion thereafter. The larger the preoperative ROM, the fewer facets fused.

Radiographic assessment and fusion rate

The reported interbody fusion rates ranged from 77 to 100% [5]. However, definitive criteria for diagnosing a successful fusion are controversial, and CT is currently recommended as the most reliable assessment method [29]. Authors who reported high fusion rates used flexion-extension radiographs with a 5° angular motion cutoff [4, 13, 19, 23]. Santos et al. reported a fusion rate of 96% with a 5° cutoff using flexion-extension radiographs and 65% using CT [29]. Furthermore, they indicated that none of the nonunions seen on a thin-section helical CT scan was detected when the 5° cutoff was used. They also indicated that most of the levels assessed as having nonunion demonstrated movement of 2°, with 4° as the highest amount of movement measured. FF was assessed by CT, and a fusion rate of 90.4% was thought to be comparable with or higher than other fusion techniques assessed by CT.

Delayed union cases

In this study, 14 cases (10.3%) achieved fusion after the first postoperative year. Of these 14 cases, 12 had no motion at the fused level, and the other two had motion of 3.1° and 3.3° at the 1-year follow-up. Fay et al. reported that FF eventually occurred in 75.6% of patients after Dynesys dynamic stabilization for DLS [7]. Furthermore, Hoppe et al. reported an FF rate of 74.4% after the stabilization [11]. Among the delayed union cases, some without motion at the fused level just needed longer time for bony continuity to be confirmed on CT, and the others, with no fusion but “stabilized,” eventually achieved fusion as these reports.

Nonunion and preoperative intervertebral instability

This study indicated that preoperative intervertebral instability could result in nonunion. There are few reports regarding nonunion and preoperative intervertebral instability [1, 14]. Kornblum et al. reported, in their PLF study, that preoperative average angulation of 10.1° in the pseudarthrosis group was significantly larger than that of 6.6° in the fusion group [14]. They considered pseudarthrosis to be present if there was no continuity in the fusion mass between the transverse processes on the anteroposterior radiographs or if lateral flexion-extension radiographs demonstrated > 2° of angular motion or > 2 mm of sagittal motion at the spondylolisthesis level. In contrast, Agabegi et al. concluded that the translation degree, angular motion, and lateral disc space height could not predict fusion or development of pseudarthrosis [1]. However, they determined fusion if continuous bone bridging was present between the transverse processes, an angulation of ≤ 5°, and a translational movement of ≤ 3 mm on flexion-extension radiographs. In our study, all cases of fusion that were confirmed by CT displayed no motion at the fused level on a flexion-extension radiograph. Agabegi’s lenient fusion criteria may have included pseudarthroses as fusions.

Postoperative stabilization in the nonunion cases

Even if FF failed, unstable spondylolisthesis was stabilized postoperatively with the average ROM significantly decreasing from 13.0° to 4.9°. Postoperative stabilization was thought to be due to facet joint space change, restabilization, scar contracture, and impact of PS, even if it was loosened. Although facet joint orientation was reported to be significantly correlated to DLS [17], orientation-determining lines drawn through the two midpoints of the most anterior and posterior facet joint spaces did not change before and after FF, and we evaluated the joint space proportion. Although there was no significant change in proportion, we found obvious changes in joint space shape, which were thought to contribute to the stability (Fig. 6). Restabilization was occasionally observed in the natural course of DLS, and disc height loss, osteophyte formation, vertebral endplate sclerosis, and ligament ossification have been implicated in the process [30]. Minamide et al. reported that 34.9% of patients with preoperative instability had disc space collapse and restabilization during the 3-year follow-up period even after minimally invasive microendoscopic laminotomy [21].

Fig. 6
figure 6

Axial computed tomography through the center of the facets of a nonunion case. a Preoperative image. b One-year postoperative image. There is an obvious change of the joint space shape compared to the preoperative image

Clinical outcomes and revision surgery rate of the nonunion cases

Good clinical outcomes were noted even in nonunion cases after FF, likely due to the significantly decreased postoperative ROM, indicating unstable spondylolisthesis stabilization. Fay et al. reported that all patients experienced significant improvement in clinical evaluations after Dynesys dynamic stabilization regardless of whether FF occurred, and that there were no significant differences between patients with and without FF [7]. Ohtori et al. also reported that there were no significant differences between the union and nonunion groups 2 years after PLF [25]. Although the effect of fusion status on clinical outcomes has been controversial, poor outcomes of nonunion cases, even shortly after interbody fusion, have been reported [19, 23]. The poor outcomes may have been due to cage migration or cage-related pain. Although the therapeutic effectiveness of FF with JOABPEQ was low for lumbar function and mental health in the Nonunion group, the other clinical outcomes were as good as the Fusion and Delayed Union groups, and revision surgery for nonunion itself was not required. The revision surgery rate of FF was 3.7% for an average follow-up period of more than 6 years, and ASD was the cause for all revisions. The insertion of cages can result in complications, such as cage migration or cage-related pain, thereby increasing the revision surgery rate of the interbody fusion, which has been reported to be 12.1% [26] and 12.9% [10] at 5 years postoperatively.

Fusion indication for DLS

We found that nonunion cases achieved good clinical results, as well as fusion cases. This poses a clinical question regarding the indication of fusion for DLS. Blumenthal et al. reported that reoperation was performed in 15 (37.5%) of the 40 patients after laminectomy for grade I DLS, despite excluding patients with gross motion of more than 3 mm on flexion-extension radiographs [3]. Minamide et al. reported that19 (7.8%) of 242 patients underwent reoperation due to insufficient decompression and recurrent stenosis caused by disc bulging or instability, even after minimally invasive microendoscopic laminotomy for DLS [21]. We believe that fusion is necessary for unstable DLS to avoid reoperation.

Limitations

This study has several limitations. Firstly, it is not clear whether the 1-year postoperative period was appropriate to assess fusion and to classify groups. Secondly, bone quality was not assessed, which could have implications for fusion rates. Thirdly, the study evaluated mid-term clinical outcomes; further studies are required to clarify long-term outcomes after FF.

Conclusions

This study demonstrated that even nonunion cases after FF achieved good clinical outcomes as well as those after PLF. This is likely due to the significantly decreased postoperative ROM of the slip segment, indicating unstable spondylolisthesis stabilization. FF did not require revision surgery for nonunion itself, and the revision surgery rate of 3.7% was quite low for an average follow-up period of > 6 years. Therefore, FF is useful for DLS management.