Introduction

Several posterior decompression techniques are used for decompression of lumbar spinal stenosis (LSS), but there is neither evidence to recommend one particular technique nor to suggest how extensive the decompression should be [13]. To our knowledge, there are no objective scientific criteria that can guide the extent of decompression needed. Neither radiological parameters, neuromonitoring, nor epidural pressure monitoring have been useful in deciding surgical technique or the extent of decompression needed. Bilateral laminotomy has been compared to a full laminectomy in some trials. One study found superior clinical results after bilateral laminotomy compared to laminectomy [4], but these findings have not been confirmed by subsequent studies [58]. Unilateral laminotomy with crossover has also been presented as a minimally invasive surgical technique, but clinical results have been reported to be equal to bilateral laminotomy [911]. Both bilateral laminotomy and unilateral laminotomy with crossover have gained popularity as midline-preserving methods, and are considered to reduce the risk of iatrogenic post-operative instability, as compared to laminectomy, but scientific evidence is lacking.

Another midline-preserving method is spinal process osteotomy, which was first introduced in 1978 by Yong-Hing and Kirkaldy-Willis [12]. Theoretically, this method may have some advantages over other midline-preserving methods, including better visualization of the spinal canal and the lateral recesses, facilitating a more extensive decompression. Studies of spinal process osteotomy have shown good clinical results [1315], but the effectiveness has not been compared with the other midline-preserving methods. Takaso et al. reported in 2011 [16] that a modified version could be performed with a minimally invasive surgical technique with good results.

While the proposed advantages of posterior decompression techniques that limit the extent of bony decompression or avoid removal of posterior midline structures regarding iatrogenic instability and post-operative low back pain are plausible, the scientific evidence documenting these advantages is lacking. A recently published Cochrane review evaluated 10 trials and 733 patients, treated with different midline-preserving and conventional techniques, concluded that the studies were of very low quality, and that further research is necessary to compare the safety and effectiveness of these decompression techniques [3].

The aim of this study was to investigate the clinical outcome after spinal process osteotomy and compare them to bilateral laminotomy and unilateral laminotomy with crossover.

Methods

Study population

This cohort study is based on data from the Norwegian Registry for Spine Surgery (NORspine). During the given period 31 of 36 (86 %), hospitals in Norway were reporting to the NORspine register. Patients with lumbar spinal stenosis who had midline-retaining decompression in the period from 1/1-2009 to 11/3-2013 at 31 Norwegian hospitals were included. The NORspine register does not record spinal process osteotomy in the register forms. The bilateral laminotomy and unilateral laminotomy with crossover groups were identified from the register from 27 hospitals not performing spinal process osteotomy. In three other hospitals, spinal process osteotomy is performed occasionally, and patients from these hospitals were excluded. The spinal process osteotomy group was identified manually by reviewing the operative protocol for all patients who had surgery for LSS at one large hospital (the Coastal Hospital in Hagevik), performing this type of surgery on a regular basis. Patients missing exact data about the surgical method or who had a unilateral decompression were excluded. Informed consent was obtained from all participants. The registry protocol was approved by the Norwegian board of ethics, REC Central (2014/98).

Patient-reported outcome measures

The patient questionnaires were self-administered and identical at admission for surgery (baseline) and at 3 and 12 months follow-up; they contained established patient-reported outcome measures. At baseline, the forms include additional questions about demographics and lifestyle issues. During the hospital stay data, concerning comorbidity; radiological classifications; American Society of Anesthesiologists (ASA) grade; perioperative complications; operation method; duration of surgery; and hospital stay were recorded by the surgeon.

Primary outcome

Pain-related function was assessed by the Oswestry Disability Questionnaire, Norwegian version 2.0, (ODI) [17], and change in the ODI score was chosen as the primary outcome measure. ODI scores range from 0 (no disability) to 100 (worst possible).

Secondary outcomes

Secondary outcome measures were duration of surgery, length of stay, health related quality of life, measured by the EQ-5D (ranging from −0.59 to 1), EQ-VAS, leg pain and back pain numeric rating from 0 (no) to 10 (worst possible), and a seven point global perceived effect scale (1 = completely recovered, 2 = much improved, 3 = slightly improved, 4 = no change, 5 = slightly worse, 6 = much worse, and 7 = worse than ever).

The NRS pain scales, ODI, and EQ-5D have shown good validity and reliability, and the Norwegian versions of these instruments have shown good psychometric properties [1820].

We also analyzed differences between patients who were excluded and those included in the trial, both at baseline and during follow-up.

Statistics

Propensity score matching (PSM) [21] is frequently used in orthopedic research [22, 23] to reduce selection bias and confounding caused by a non-random treatment assignment. The patients are then divided into subgroups according to their propensity scores, and the statistical analysis is performed within these subgroups. The purpose is to balance prognostic factors in different patients in order. A propensity score is calculated by summarizing a score generated by logistic regression analysis from different baseline parameters to balance groups with respect to factors that may influence clinical outcome.

The Propensity Score was performed by three logistic regression models for each combination of treatment groups, using the following baseline variables: age, gender, smoking, education, hip/knee arthritis, ischemic heart disease, vascular claudication, osteoporosis, hypertension, diabetes, and the number of levels operated on. All variables, except gender, differed significantly between at least two groups in the unadjusted sample. The individuals from each comparison were then stratified into three or five subgroups (three when spinal process osteotomy was involved, due to the smaller number in this group) based on their Propensity Score. Statistical analysis was performed between the PSM subgroups with the two-way ANOVA. Unmatched comparisons between the three treatments groups were made by the one-way ANOVA for continuous variables, Kruskal–Wallis test for ordinal variables, and Pearson’s Chi-square test for categorical variables. A p value <0.05 was regarded statistically significant without adjustment for multiple comparisons.

Results

Of 2935 patients screened, 1531 patients fulfilled the inclusion criteria, and were included in the study. Follow-up response rates varied from 71.4 to 86.4 % (Fig. 1).

Fig. 1
figure 1

Of 2935 patients 1404 patients were excluded. In the included group, there were 103 in the spinous process osteotomy group, 966 patients in the bilateral laminotomy group, and 462 in the unilateral with crossover group

Baseline data

There were small differences in baseline data between patients who were excluded and those included in the study (Table 1). The mean age of the excluded patients [61.9 years (SD 12.0)] was almost 4 years younger (p < 0.05) than those included in the study [65.7 years (SD 10.7)]. Number of levels operated on were 1.3 (SD 0.6) for the excluded versus 1.4 (SD 0.5) for those included (p < 0.05). There were 27 % smokers in the excluded group and 24 % in the included group. Otherwise, there were no differences between excluded and included patients (Table 1).

Table 1 Demographic and baseline characteristics from the patients included vs the patients excluded and the three study groups

We identified 103 patients treated by spinal process osteotomy, 966 by bilateral laminotomy, and 462 by unilateral laminotomy with crossover. There were small differences in clinical scores between the groups at baseline. Patients who underwent bilateral laminotomy with crossover were younger (p < 0.05), and the spinal process osteotomy group had more patients with multilevel surgery (p < 0.05) (Table 1).

Primary outcome

Analysis of primary and secondary outcome measures after 12 months showed no differences between the patients excluded from the study versus the patients included.

The clinical improvement was most pronounced at 3 months, and then slightly reduced at 12 months after surgery for the whole cohort (Table 2; Fig. 2). The mean unadjusted improvement in ODI points at 12 months was 15.2 (SD 16.7) in the spinous process osteotomy group, 16.9 (SD 17.0) in the bilateral laminotomy group, and 16.7 (SD 16.9) in the unilateral laminotomy with crossover group (Table 2). Differences between the three groups were not statistically significant. The adjusted mean improvement in ODI in the propensity scores stratified subgroups was not significantly different.

Table 2 Clinical outcome 3 and 12 months after surgery, unmatched cohort
Fig. 2
figure 2

Mean ODI (95 % CI) at baseline, 3, and 12 months after surgery

Secondary outcomes

There were no significant differences in patient-reported outcome measures between the three surgical groups (Table 2).

The chosen operation method had no impact on self-assessed improvement, which was reported in 84.8 % in the spinous process osteotomy group, 80.8 % in the bilateral laminotomy group, and 80.1 % in the unilateral laminotomy with crossover group. A worse outcome was reported by 8.2, 10.2, and 9.5 %, respectively (Table 3).

Table 3 Global perceived change

When summarizing Table 3, 80.8 % of the included patients report that they are “slightly improved,” “much improved,” or “completely recovered”. If one uses a success criteria that is “much improved” or “completely recovered” 59.5 % of the included patients report clinical success.

A complete overview of the improvement in clinical scores (ODI, EQ-5D, NRS leg pain, NRS back pain, and EQ-VAS) within the global perceived effect scale subgroups is shown in Table 4.

Table 4 Clinical outcome scores 12 months after surgery within the global perceived effect score subgroups

The bilateral laminotomy procedure lasted shorter than unilateral laminotomy with crossover and spinal process osteotomy. Bilateral laminotomy lasted 85 min (SD 37), unilateral laminotomy with crossover 104 min (SD 40), and spinous process osteotomy 121 min (SD 41), whereas length of hospital stay was 2.1 days (SD 2.1) following bilateral laminotomy, 3.5 days (SD 2.4) following unilateral laminotomy with crossover, and 6.9 days (SD 4.1) days following spinal process osteotomy. These findings were significantly different between groups in both the analyses (Table 2).

Complications included accidental dural tears, nerve root injury, wrong level surgery, peri-operative bleeding requiring transfusion, respiratory, or cardiac complications. There were no significant differences in the number of reported complications (Table 2).

We also analyzed the clinical data for the three methods at the single institution, where the spinous process osteotomy group was operated. There were no significant differences in the baseline data or clinical outcome, complications, duration of surgery, or length of stay between the three groups at this particular hospital.

Discussion

In the present study, clinical outcomes were similar in patients treated with three posterior decompression techniques for lumbar spinal stenosis, both regarding improvement and worsening. In total, 80.8 % of the patients reported improvement, and few patients reported a worse outcome (9.8 %).

These findings are in accordance with a previous study from the Spine Tango register [23], in which no differences in the patient-reported outcome measures after 24 months were reported following four different surgical methods (laminectomy with fusion, laminectomy, hemi-laminectomy, and laminotomy). A recent publication from NORspine reported similar results when comparing laminectomy to micro-decompression [24]. Most of the studies evaluating different surgical techniques have a relatively short (less than 3 years) follow-up [48, 11, 23]. In this study, the clinical outcome deteriorated slightly from 3 to 12 months of follow-up. This suggests that longer time for follow-up is warranted. In addition, Overdevest et al. who recently published a Cochrane review on posterior decompression techniques concluded that studies with longer follow-up are warranted [3].

Baseline data regarding pain and function were similar in the three groups. The patients in the unilateral laminotomy with crossover group were younger, and more levels were operated in the spinal process osteotomy group. When adjusting for these differences by the propensity score matching method, we found no differences in primary outcome. Two recent publications concluded that smoking was associated to less improvement after surgery, while increased age was not [25, 26]. In this trial, these factors were incorporated in the propensity score matching, and would, therefore, not confound the comparison of treatment groups.

Although differences at baseline and in outcome measures between the included and excluded patients were small, (Table 1), the included patients were somewhat older, and were operated on at more levels than excluded patients. Both groups reported improved clinical scores after surgery (Table 3). Given these small differences between included and excluded patients, the trial has a high external validity.

When generally accepted definitions of minimally clinical important differences are used [2729], 80.8 % of the included patients reported a clinically meaningful improvement at follow-up. These results are in accordance with previous reports of minimally important clinical differences in spine surgery [27, 2932]. Of the patients, 9.8 % reported deterioration after the surgery; this number is also in correspondence with previous trials [33]. Recent year’s clinical success or substantial improvement has been considered as more important than minimal change by several authors [3336]. In the present study, 59.5 % of the patients altogether had a substantial improvement, when the success criteria “completely recovered” or “much improved” in the global perceived effect scale were applied. No major differences between the surgical groups were found in the global perceived effect scale (Table 3).

A limitation of the present study is that all the spinal process osteotomy procedures were performed at one single orthopedic educational institution. This is considered to be the only high-volume hospital, where orthopedic consultants and residents are performing spinous process osteotomy on a regular basis. This may lead to selection bias and reduce the validity for the findings concerning process osteotomy, even in the propensity score matched cohorts. This single institution had longer duration of surgery and hospital stay, also when performing bilateral laminotomy and unilateral laminotomy with crossover, probably due to local traditions. The effectiveness of all procedures should, therefore, preferably be assessed in a high-quality clinical trial. The NORspine register collects clinical data from follow-up at 3 and 12 months. Not having data for 24 months follow-up, in comparison to the Spine Tango-based paper by Munting et al. [23], is, therefore, another limitation of the present study.

Unilateral laminotomy with crossover is by some considered to be the most technically challenging surgical method with a longer learning curve, and one might expect a longer duration of the operation and a higher rate of complications. The present study only partly confirms these assumptions. Although unilateral laminotomy with crossover lasted longer than bilateral laminotomy, the three operations had similar complication rates. On the other hand, complications may have been underreported [37]. Since unilateral laminotomy with crossover is considered to be more technically challenging, possibly the most experienced surgeons perform this procedure. However, data on surgeon’s experience are not recorded in the NORspine register.

The bilateral laminotomy procedure had shorter duration of surgery compared to the unilateral laminotomy with crossover and spinous process osteotomy procedures. The patients who had a bilateral laminotomy procedure also had a shorter hospital stay than the unilateral laminotomy with crossover group and the spinous process osteotomy group. Despite the fact that all patients in the spinous process osteotomy group were treated in a single institution, which might have influenced the length of hospital stay, the large difference suggests that the spinous process osteotomy is more expensive.

The main goal of surgery for LSS is to reduce pain and disability by relieving the actual stenosis in the affected level of the spine. The amount of bony and ligamentous structures that is removed varies between the posterior decompression techniques applied in this study. It is, however, still unclear how much of the bony and ligamentous structures outside and inside the spinal canal needs to be removed to achieve satisfactory long-term results with minimal iatrogenic complications. To date, we are lacking objective criteria of how extensive the surgical procedure needs to be.

Conclusion

In this prospective study from the Norwegian Registry for Spine Surgery, no differences in clinical outcome 12 months after decompression of the spinal canal by spinal process osteotomy, bilateral laminotomy, or unilateral laminotomy with crossover were found. Bilateral laminotomy had the shortest duration of surgery and the shortest length of hospital stay. Surgical technique does not seem to affect clinical outcome after three different midline-retaining posterior decompression techniques. A randomized study is needed to assess the relative efficacy of these surgical procedures.