Introduction

The average life expectancy of the population is steadily increasing, and best estimates indicate that this will be accompanied by a marked increase in the incidence of degenerative disorders seen in clinical practice [7, 38]. Degenerative disorders of the lumbar spine are often characterised by a compression of the neural elements, resulting in radicular pain and neurogenic claudication, weakness, numbness/tingling and (often) pain in the lower back or buttocks [13, 37]. These complaints not only cause a great deal of discomfort, but also marked restrictions in mobility and function. Together with the inevitable age-associated decline in musculoskeletal function [6, 17], this results in impairment in the patient’s ability to perform activities of daily living and a threat to his/her independence. Since the symptoms are largely due to mechanical (structural) obstruction [39], in all but the most moderate of cases the problem is best treated surgically, with decompression of the affected segment(s) [3, 4, 41].

Long-term outcome studies are essential for all surgical procedures to ascertain whether results deteriorate (or improve) in the long-term and, if so, to identify factors that might influence such change over time. As far as decompression surgery is concerned, the literature reports a general decrease of about 10–20% in the proportion of good outcomes over time, from around 67–88% successes in the initial year after surgery to approximately 52–70% after 5–8 years [4, 19, 21, 33, 36]. Katz et al. [22] reported that after an average 8 years follow-up 33% of patients had severe back pain, 53% were unable to walk two blocks, and 25% were not satisfied with the results of surgery; 23% of them had undergone re-operation. The severity of the spine-related symptoms was a stronger correlate of physical functional status at the time of follow-up than age or non-spinal comorbid conditions. Similar outcomes after 4–12 years were reported by others [4, 8, 21]. These are not particularly impressive long-term results.

One of the problems confronting patients with such spinal disorders is that although decompression surgery addresses the immediate mechanical obstruction, it cannot halt the ongoing degenerative process. Degenerative changes of the spine are often accompanied by instability of the affected motion segment [12, 42] and this, coupled with any iatrogenic instability as a result of the decompression procedure itself, may lead to impaired mechanical integrity of the spine and the recurrence of symptoms [20, 32]. Recent years have seen a trend towards the use of “less destructive” techniques in lumbar spine decompression, in an attempt to avoid the (potential) instability associated with extensive bony resection [9]. However, the long-term impact on symptoms, function and re-operation rates has not been studied extensively.

The aim of the present study was to investigate the 5-year outcome of decompression without fusion in a large group of patients who had participated in a randomised controlled trial (RCT) comparing different post-operative programmes of rehabilitation (self-management versus two different types of active physiotherapy) [25]. Consistent with the findings of a comparable study of structured rehabilitation in a similar patient group [1], our 2-year follow-up results suggested no notable benefits of physiotherapy compared with self-management [25]. As such, for the purposes of the present investigation, the patients were considered as one group.

Methods

Study admission criteria

The inclusion criteria for the study were: age over 45 years; diagnosis of degenerative spinal disease with compression of the neural elements [in association with either hard tissue (osteophytes, calcification, etc.) or soft tissue (herniated disc material)] as ascertained from the medical history, clinical examination, conventional radiography and MRI/CT of the lumbar spine, with an indication for decompression surgery without fusion (if fusion was subsequently deemed necessary, intraoperatively, the patient was excluded from further analysis); failed conservative therapy; willingness to comply with any programme to which randomly assigned (for the original RCT), attendance for all necessary follow-ups and completion of postal questionnaires; a good understanding of written and spoken German.

The exclusion criteria were: previous spinal fusion (although other spinal surgery carried out more than 6 months previously was acceptable); disorders preventing active rehabilitation.

Recruitment of patients

Patients were recruited into the trial from the authors’ hospital, a non-profit making (foundation) orthopaedic clinic. The hospital operation list for the forthcoming 2 months was examined on a weekly basis, and consecutive patients who appeared to satisfy the main admission criteria were sent a letter inviting them to participate, along with an information sheet about the study and a questionnaire booklet (see later). All patients were contacted by phone a week later to enquire about their interest in participating, to check the inclusion/exclusion criteria, and to address any queries in connection with the study. Those who agreed to participate were invited to attend the research department on the day of admission to clarify any remaining issues and sign the informed consent form.

The study was approved by the local university ethics committee.

Surgical and post-operative procedures

Four experienced surgeons operated on the patients. Decompression surgery was carried out using a minimally destructive approach. Briefly, using a posterior midline approach to the laminae, the spinal canal was entered by removal of the ligamentum flavum. Undercutting laminotomy of the caudal and rostral laminae and, where necessary, partial medial facetectomy and removal of disc fragments were performed. Patients were allowed to leave bed the day after surgery. To promote wound healing, no specific treatment or rehabilitation was carried out in the first 6–8 weeks, but the patients were encouraged to walk and move around as normal. Between 2 and 5 months post-operatively, the patients completed a rehabilitation programme comprising either self-management (patients were simply instructed to “keep active”), physiotherapeutic “spine segmental stabilisation exercises” or mixed physiotherapy techniques. The results up to 2 years, indicating no significant between-group differences, have already been published [25]. Examination of the data at 5 years confirmed no significant benefit of physiotherapy compared with self-management, supporting an analysis of all patients together in the present report.

Administration of the outcome questionnaires

Questionnaires were completed before surgery, and at 2 months, 5 months, 1 year, 2 years and 5 years after surgery. The patients filled in the questionnaires at home, to achieve consistent conditions for completion, and returned them by mail. Questionnaires were always checked immediately and patients were requested to complete any missing information. Non-responders during follow-up were contacted by study personnel not involved in the patients’ care until the questionnaire was returned or the patient explicitly stated that no questionnaire would be returned.

Questionnaires

The questionnaire variables enquired about before surgery and at follow-up are shown in Table 1.

Table 1 Key variables enquired about in the patient questionnaires

For all patients, re-operations (co-interventions) were enquired about in the follow-up questionnaires up to 5 years post-operatively and also checked in the hospital information system (or by communication with the patient). At the longer-term follow-ups, in addition to completing the serial outcome instruments, patients rated the global effectiveness of the treatment they had received (“How much did the operation and post-operative rehabilitation help?”) on a 5-point Likert Scale; responses were then dichotomised into “good” (helped a lot, helped) and “poor” (helped only little, did not help, made things worse) [30].

Statistics

As most of the continuous variables under investigation were approximately normally distributed, parametric statistics were employed for the analyses. Changes in continuous variables were assessed using analysis of variance with repeated measures (time of assessment). Independent t tests were used to examine differences in 5-year outcome scores between those who did and those who did not undergo re-operation in the 5-year period. Relationships between variables were analysed using Pearson coefficients (parametric data) or Spearman rank correlation coefficients with correction for ties (non-parametric).

Statistical significance was accepted at the p < 0.05 level.

Results

Study sample

A total of 159 patients were included in the original trial [25]. Ten had died by the 5-year follow-up. Of those still alive, 143/149 (96%) returned a 5-year questionnaire and these comprised the group under study in the present investigation. Their mean ± SD age was 64 ± 11 years and 92/143 (64%) were men.

Re-operation rate

By the time of the 5-year follow-up, 34/143 patients (24%) had undergone re-operation, at an average of 29.1 ± 20.9 months (range 0.3–59.6 months) after the primary surgery (Table 2). In 30/34 (88%) re-operations, surgery involved the same segment as in the initial surgery; in 2 (6%) cases it involved the adjacent segment, and in 2 (6%) cases, a different segment. 8/143 (5.6%) underwent a third operation.

Table 2 Summary of further operations after the initial decompression surgery

Outcome in patients with no further surgery up to 5 years post-operatively

The changes in each of the outcome scores over time for the group that did not undergo further surgery are shown in Fig. 1.

Fig. 1
figure 1

a–d Changes in (a) leg pain, (b) low back pain, (c) Roland Morris disability and (d) general health throughout the course of the study, up to 5-years’ follow-up in patients who did not undergo any further surgery

Leg pain intensity

From pre-operative to 2 months post-operative, there was a significant reduction in mean leg pain of 3.6 ± 2.5 points (p < 0.0001), after which there was no significant further change up to 5 years follow-up (Fig. 1a).

Back pain intensity

The mean score for back pain intensity showed a significant reduction of 2.2 ± 2.5 points from before surgery to the first follow-up at 2 months (p < 0.0001) and a small, statistically significant (p = 0.02), but clinically non-relevant, increase of 0.5 ± 2.1 points from 2 months to 5 years post-operatively (Fig. 1b).

Roland Morris disability score

There was a significant decrease of 2.7 ± 5.2 points in the mean Roland Morris (RM) score from before surgery to 2 months post-operatively (p < 0.0001) and a further decrease of 2.5 ± 3.6 points up to 5 months post-operatively (p < 0.0001) (Fig. 1c). After this, no further changes were seen up to 5 years post-operatively.

General health

The general health score increased slightly, but significantly, by 0.7 ± 2.5 points from pre-operative to 2 months post-operative (p = 0.006) and then showed no further change up to 5 years (Fig. 1d).

For each of the above scores, there were highly significant correlations between the scores at the final follow-up after 5 years and each of the previous follow-ups (Table 3).

Table 3 Correlation coefficients describing the strength of the relationship between scores at 5 years postoperatively (the last follow-up) and at the previous follow-up time points (all coefficients significant, p < 0.0001)

Frequency of back/leg trouble

Pre-operatively, the frequency of back/leg trouble was 1%, never; 7%, occasional; 29%, often; and 63%, constant. Two months post-operatively, the frequency decreased markedly (p < 0.0001) to 10%, never; 48%, occasional; 34%, often; and 8%, constant. After 5 years, there were somewhat more patients in the two “extreme” categories (never and constant): 25%, never; 37%, occasional; 18%, often; and 20%, constant. There was a significant correlation between the responses at 2 months and 5 years follow-up (ρ = 0.39, p < 0.0001), with the frequency of back/leg trouble differing by no more than one category (i.e. was just one category higher or lower) in 86% patients.

Use of pain medication

Pre-operatively, the frequency of pain medication use was 23%, never; 31%, occasional; 17%, often; and 29%, constant. Medication use declined significantly after surgery (p < 0.0001) and was similar at both the short-term and long-term follow-ups: 2 months post-operatively, 66%, never; 18% occasional, 10% often, and 6% constant use; after 5-year, 60% never, 22% occasional, 9%, often; and 9%, constant.

The responses at the two time points showed a moderate, significant correlation (ρ = 0.49, p < 0.0001), and in a high proportion of patients (87%), medication use did not vary by more than one category between 2 months and 5 years.

Global outcome of treatment

When the global outcome measured on the 5-point Likert scale was dichotomised into “good” and “poor” (see “Methods”), the proportion of patients reporting a “good” global outcome reduced from 86.1% at 2 months to 78.7% at 5 years follow-up (p = 0.057). 87.0% patients gave exactly the same global outcome rating (i.e. either good or poor) at both 2 months and 5 years; 2.8% gave a better rating at 5 years (poor became good), and 10.2% a worse rating (good became poor).

Final outcomes in patients who underwent re-operation during the 5-year follow-up

Compared with the patients who were not re-operated, the patients who had undergone further surgery during the course of the study showed significantly less improvement in Roland Morris Disability and leg pain scores from baseline to the 5-year follow-up [reduction in Roland Morris score: no further surgery, 5.6 ± 6.2 points vs. further surgery, 1.8 ± 4.8 points (p = 0.0012); reduction in leg pain: no further surgery, 3.6 ± 2.9 points vs. further surgery, 2.2 ± 3.6 points (p = 0.018)] (Fig. 2). There were similar tendencies in relation to the degree of improvement in back pain and general health, but they failed to reach significance (p > 0.05).

Fig. 2
figure 2

(a) Leg pain and (b) Roland Morris disability scores before surgery and at 5 years follow-up in patients who did not undergo re-operation (“no reop”; black bars) and those who did undergo further surgery (“re-op”; grey bars) during the course of the study. For both outcomes, there was a significant difference between the groups (no re-op vs. re-op) for the reduction in score after 5 years (p < 0.05)

Good global outcomes were reported by 27/34 (79%) of the group after 2 months, and just 22/34 (64.7%) at the follow-up 5 years after the first surgery.

Discussion

Main findings and methodological considerations/limitations

The present study sought to quantify the long-term outcome of lumbar spinal decompression surgery in patients with degenerative spinal disorders. The main findings were that the mean levels of pain and disability reported at 2 months post-operatively remained fairly stable up to 5 years. However, almost one in four patients required further surgery on the lumbar spine in the 5-year study period and those that were re-operated had significantly worse final outcomes, in terms of leg pain and disability, than did those who had no further surgery.

The group under study did not include all eligible patients undergoing decompression within our spine unit, but instead comprised patients who had offered their voluntary participation in a RCT [25]. Nonetheless, adequate external validity of the findings was suggested by the fact that the medical history, clinical status and treatment outcomes of the study patients were broadly comparable to those of the typical patient undergoing decompression surgery described in the literature [10, 31] and of consecutive patients from our own centre [23], documented systematically in connection with the Spine Tango Spine Registry of the Spine Society of Europe [34]. The only exception to this was that the study attracted a predominance of male volunteers, with slightly lower baseline symptoms than the “typical patient”. This type of gender and symptom severity bias has frequently been reported for trials in other areas of medicine [5, 16, 24]. Whilst the reasons for it are not clear, they include suggestions that men and women may react differently to presentations of risk–benefit analysis or that women may feel less able to release themselves from their homemaking duties to participate in trials [18]. Studies involving elderly patients are notoriously difficult to carry out, and the present study was no exception. The repeat surgery and deaths during follow-up were just some of the problems that beset the current investigation. Nonetheless, a highly respectable follow-up rate (96% of those still alive) was still achieved up to 5 years post-surgery.

In any studies of surgical outcome, deciding on the most appropriate method of dealing (in the analysis) with patients who undergo re-operation is never straightforward. In the present study, when examining the course of change in the main outcomes, we decided to exclude the re-operated patients, on the basis that if their (presumably more severe) symptoms prior to the second intervention had been remedied by re-operation, they could have falsely elevated the outlook after 5 years in the whole group. In practice, however, when examined as a separate group, even after re-operation they displayed less improvement on their pre-operative status compared with the patients who were not re-operated (see later). We could not depict the whole course of change over the 5 years in the re-operated group, because the precise time of re-intervention differed for the individual patients, rendering it impossible to calculate meaningful point estimates of the group’s level of pain or disability at the fixed follow-ups. Hence, we elected to present only their results for the final follow-up, as compared with baseline values (i.e. before the first index operation). Interestingly, a secondary analysis including all patients together (re-operated and not re-operated; actual results not shown) did not substantially alter the overall course of change shown in Fig. 1.

Consistency of changes over time

On an individual basis, the outcomes recorded 5 years post-operatively showed a relatively high, significant correlation with those recorded at the earlier follow-ups. In other words, the early outcome was a reasonably good predictor of the longer-term outcome. This phenomenon has been reported before by ourselves [26, 29] and other groups [15, 31] for patients with degenerative disorders followed up for up to 2 years, and by Amundsen et al. [3] for patients followed up even longer (10 years). Since surgery for degenerative diseases typically serves a “mechanical” purpose, aiming to relieve pain by removing a physical obstruction (e.g. by decompression in the case of spinal stenosis), as far as the main symptoms are concerned, the success (or otherwise) of the operation should be evident relatively early [29]. It may be expected that a somewhat longer time would be required for the final improvements in disability or function to manifest themselves, but clearly even here the early results are able to herald the longer-term outcome. This suggests that we should pay close attention to the early failures and not simply expect time to heal. Timely and evidence-based re-interventions (either conservative or operative) should be implemented for unimproved patients in an attempt to forestall the development of chronic disability and its accompanying psychological sequelae.

Long-term outcome after decompression

In keeping with other reports in the literature, the proportion of overall “good” outcomes recorded in the present study declined slightly over the 5-year follow-up period, though to a lesser extent compared with earlier studies [4, 19, 21, 22, 33, 36] and still remaining fairly good after 5 years (79% with a “good” global outcome after 5 years). It is not possible to ascertain whether the less marked deterioration over time in the present study was the result of the modern-day, less “destructive” surgical techniques used or was due to unidentified differences between studies in the patient populations examined.

In view of the observed changes over time in the serially measured outcomes (Fig. 1), the slightly poorer global outcome ratings at 5 years may have been the result of the slight, but significant, increases in pain (especially LBP) over time, and the fact that almost 40% of the patients declared that they had back trouble either “often” or “constantly”. Since decompressive surgery serves only to eliminate the immediate obstruction and does not reverse or even halt the degenerative process per se, it is only to be expected that the degenerative changes will progress with time. These should not necessarily be considered as “failure” in relation to the index procedure, though they obviously need to be acknowledged in explaining the likely long-term outcome to the patient.

The need for re-operation was almost certainly a factor that rendered the global outcome ratings as less positive over time in the group that underwent further surgery. The re-operation rate in the present study was 24%, which was slightly higher than the 5-year re-operation rate reported by Jonsson et al. (18%; [21]) and comparable to the 8-year repeat surgery rate reported by Katz et al. (23%; [22]). Analogous to these other reports, the majority of repeat surgeries in the present study also occurred at the same level as the primary surgery and involved either further decompression or fusion of the segment. The average time of re-operation after the primary surgery was approximately two and a half years, which would certainly be commensurate with a gradual recurrence of further symptoms as a result of ongoing degenerative changes in the spine. However, the range of values was quite wide and fairly evenly spread across the 5-year follow-up period and, on an individual basis (results not shown), whilst some patients had a clear increase in symptoms in the period preceding their re-operation, others appeared to have simply waited with a relatively poor result from the outset. Either way, it was clear that re-operation—whether due to recurrence of symptoms or as an attempted remedy for an initially poor result—generally resulted in a less good outcome in the end. This concurs with previous reports that the probability of a good surgical outcome generally decreases with each successive surgery [40].

It was interesting to note that, at the final follow-up in the present study, 5 years after surgery, patients experienced (on average) moderate levels of disability in everyday activities, not dissimilar to the levels recorded using the same instruments in patients with non-specific chronic LBP [28]. Similar values have been reported in other long-term investigations of patients after decompression surgery [22, 31]. Hence, this may well represent the level of symptoms and function that these patients finally have to learn to cope with, if there is no clear indication for further surgery. The exploitation of this valuable information for the purposes of patient education in relation to the likely outcome of surgery should at least lead to better informed and potentially more satisfied patients in the long run [27], whilst pain management programmes, of the type commonly used in patients with non-specific chronic LBP [2], might be considered for those with the most severe, persistent problems.