Introduction

According to the recent studies [31, 34, 35], the degenerative disc disease (DDD) occurs even in asymptomatic patients, but for about 10% of the population it results in permanent chronic pain and disability. The frequency of lumbar disc herniation (LDH) amounts from 5.1 to 3.7% with the male and female population, respectively, with the majority of 90–97% displaying at L4–L5 and L5–S1 levels [1, 25, 48]. Therefore, long lasting pain and functional impairments undergone by the patients as well as the varied treatment opportunities have led to situation, when the operation of the ruptured lumbar intervertebral disc represents a special challenge.

Lumbar microdiscectomy (MD) and open-discectomy (OD) are commonly performed surgical procedures for that type of patients. Mixter et al. were the first to describe pitfalls of laminectomies [38] and later in 1978 Caspar and Williams [29, 56] initially reported the technique of lumbar MD, which slightly contributes to a relatively smaller incision, less soft tissue damage, therefore reduced postoperative pain, early discharge from hospital and return to work compared to OD.

Despite various underlying factors, the success rate depends commonly on the appropriate screening for those treatment modalities, because pain and disability may persist regardless of positive radiological changes, consequently it should not be restricted only to radiological diagnostic techniques [2]. The cost of unsatisfactory outcomes is crucial both for the patient in terms of health-related life quality and to providers of health service in terms of financial costs and further treatment [11, 18].

Along with the clinical examinations, computed tomography (CT) and magnetic resonance imaging (MRI), the paraclinical diagnostic techniques are frequently applied due to the additional standardized screening of patients for those surgical interventions [4, 16, 21]. The generic measurements are broadly applicable and therefore employed across patient population, as they could be cheaply and lightly used for the additional screening. However, the SF-36 [52, 58] is recommended for the assessment of general health status, and more spine-specific measures are further recommended. The accomplished study of the Short Form Health Survey (SF-36), Oswestry disability and visual analogue scale (VAS) for pain was performed due to their effectiveness in preoperative screening of the patients with LDH disease in comparison with data of the control group (CG) as well as in postoperative follow-up assessment with two operating techniques.

Patients and methods

Patient population

Within the prospective observational cohort study, there were two populations of the clinical subjects complaining of low back pain and sciatica at the Kaunas University of Medicine, Department of Neurosurgery reviewed. The study sample examination was done between June 2003 and May 2007, under permission from our local ethics committee (No.BE-2-31). The patients’ group predisposed to surgery was divided into two quantitatively equal subgroups (MD and OD). There were no special criteria for that sampling provided.

Inclusion and exclusion criteria

One hundred patients with the LDH disease were recruited for the present study guiding to the following criteria: (1) chronic pain, occurring daily, for at least 3 months, and at least 20 h a day, refractory to >6 weeks of conservative treatment; (2) chief complaint of pain and/or numbness in the lumbar spine, buttock, and/or lower extremity; (3) age >21 years and <76 years; (4) duration of current episode <16 days (judged from the patient’s self-report); (5) symptoms extending distal to the knee (judged from the pain diagram); (6) stiffness in the lumbar spine (judged from segmental mobility testing); (7) signs consistent with nerve root compression, including any one of the following: (a) reproduction of low back or leg pain with straight leg raise <45°; (b) muscle weakness involving a major muscle group of the lower extremity; (c) diminished lower extremity muscle stretch reflex (quadriceps and Achilles tendon); (d) diminished or absence of sensation to pinprick in any lower extremity dermatome; (8) MRI or CT demonstrating anatomical unilateral LDH correlating with the patient’s symptoms.

The next hundred of the CG’s subjects, the majority of which belonged to the Kaunas University of Medicine Hospital nursing personnel experiencing milder low back pain symptoms, were involved into the present study according to the following criteria: (1) chronic pain, occurring after physical chores, for at least 3 months, susceptible to conservative treatment; (2) chief complaint of pain and/or numbness in the lumbar spine; (3) age >20 years and <65 years; (4) no symptoms extending distal to the knee (judged from the pain diagram); (5) reproduction of low back and/or leg pain with straight leg raise >45°.

The patients were ineligible if they met further specification: prior lumbar surgery, segmental instability, vertebral fractures and spinal infections, other types of DDD, tumors, and pregnancy.

Outcome measures

Subjects were asked to complete a screening questionnaire that combined demographic characteristics with information about previous, present history and current medication. Physical examinations of the preoperative patients included motor, sensation, reflexes, degree of pain-onset by the straight leg raising test (Laseque symptom), and CT or MRI. The motor function was determined using a manual muscle test and results were classified as normal, good, fair, poor, trace and zero. Sensation was judged according to whether or not there were some hypaesthesic or hypalgesic changes. Consumption of analgesics was classified into several categories: simple analgesics, NSAIDs, mild opioids, major opioids, antidepressive, and antianxiety drugs.

Specific outcome tools at the baseline and follow-up examination periods

One of the most frequently applied questionnaires for the health-related quality of life evaluation in spinal pathology is SF-36 [47, 59]. The SF-36 questionnaire is a multipurpose, short-form health survey with 36 questions and it yields an 8-scale profile, representing functional health: physical health component (PCH) and psychological status: mental health component (MCH) equally.

The more specific measure—degree of disability, expressed as Oswestry Disability Index (ODI), was assessed guiding to the Oswestry Low Back Pain Disability Questionnaire [14, 15].

Intensity of low back pain and sciatica was scored on the visual analogue scale (VAS) and obtained in a standard fashion. Relevance, validity, and reliability of the VAS are commonly accepted in the area of low back and leg pain [5, 22].

The postoperative evaluation was conducted in nearly the same manner as the preoperative examination. In that part of the survey, patients were asked to rate their satisfaction as positive, satisfactory and negative. In these subsets were consumption of analgesics, specific health-related quality of life questionnaires for the disability and pain intensity measurement included.

Follow-up examinations on the MD and OD patients with one level LDH disease were performed at 1/2, 1, 2, 3, and 4 years after surgery. The imaging examination was carried out whether the patient had any pathological conditions agreeably to accurate functional and neurological evaluation. The essential data were collected by postal questionnaire. Non-responders received reminders, contrarily they were telephoned.

Surgical procedures

Fifty percent of patients underwent conventional OD [38]. The large majority of consecutive patients were surgical candidates for a limited posterior discectomy approach. Due to the lumbar disc localization type, the procedure has been done through the interlaminar space, whereas medial facetectomy was performed rarely, when the medial facet was clearly impinging on the nerve root.

Another fifty patients were scheduled for the MD surgical procedure following the Caspar’s technique [29, 56]. During the study period, only two spinal surgeons with adequate training and expertise in performing OD and MD participated and the preoperative diagnosis was confirmed in all cases.

Statistical methods

According to the one-way ANOVA test, normal and patient population groups concerning their age and sex were proved homogenous. Data were expressed as means ± SE. Statistical significance of the difference between the means was performed with Student’s independent test, and due to assumption of abnormal distribution of variables, there were applied non-parametric tests (Mann–Whitney). Significance accepted at p < 0.05. The statistical analysis was performed using SPSS version 10.0.

Results

At the baseline, the 200 participants consisted of 100 LDH cases and 100 of CG’s subjects. The age of patients was 43 ± 1 years (range 21–76), whereas in the control group it made up 41 ± 1 years (range 20–65). Due to our study patients with lumbar disc medial and lateral herniations were included. The lower two segments composed 94.2% of the majority (Table 1).

Table 1 Initial data of the patient population

The objective examination of the preoperative patients showed weak lower extremity muscle function. For example, the moderate muscle dysfunction as well as reduction of sensory and reflex function was typical for the vast majority (75–83% of patients).

A trend toward increased satisfaction levels in means of the entire patients (40%), combining MD (45%) and OD (36%) groups was observed. Less favorable mean results were reported by 50% of all, 45% in MD and 52% in OD group patients, negative by overall, and MD subjects rated 10% while OD group rated 12%.

Evaluating the return-to-work status: 64% of overall, 60% of MD and 64% of OD patients returned to work; 10% of all, 10% of MD and 6% OD patients were retired; 26% of all, 30% in OD, likewise in the MD group have changed their qualification or work.

Thus, the consumption of analgesics for all patients drew up 21%, 43% for the MD and 57% for the OD group, accordingly. The opioids were administered for the 5% of patients and slightly increased in the OD group.

By the first 2 years of follow-up period, the failed back surgery syndrome (FBSS) along with reherniation comprised 9%: 5% in adjacent and 4% in the same level as before the treatment alone in the OD group.

According to the SF-36 questionnaire the PHC varied as follows: in the CG it made up 35.23 ± 1, preoperative 12.8 ± 1 and postoperative patients 26.21 ± 1.54. Change among pre- and post-treatment groups was 51.2% (Fig. 1a). The results of MHC differed closely in the same manner. In the CG it made up 35.57 ± 1, preoperative 18 ± 1 and postoperative patients 25.42 ± 2. Change among pre- and post-treatment groups comprised 30%.

Fig. 1
figure 1

a Statistically significant differences (p < 0.02) in CG of PHC and MHC scores compared with pre- and postoperative assessments among all subgroups. b, c Bar graphs of norm-based PHC and MHC scores compared with pre- and postoperative assessments at ½, 1, 2, 3, 4 years of follow-up time points. Compared post- to preoperative scores—p < 0.05 with exception to ½ and 1 year assessment (p > 0.05). Compared postoperative to the CG—p > 0.05 at all follow-up time periods, with exception to PHC by the first and fourth year (p < 0.05), as well as to MHC by the second year of follow-up (p = 0.03). d The SF-36 PHC and MHC profiles (means ± SE) for both surgical treatment modalities (bars) compared with CG values and preoperative ones. Except among OD and MD patients (p > 0.05), the statistically significant differences were found in all subgroups (p < 0.05)

Further, the postoperative patients were grouped due to follow-up periods. A trend toward improvement in the PHC and MHC comprised the positive changes by all follow-up time points (Fig. 1b). The same positive mutual tendency was observed by the second year of study. However, the lower values of MHC were rated at ½ and 1-year periods (Fig. 1c).

Statistically insignificant results of two standard operating techniques with advantage to MD were defined (Fig. 1d). The PHC’s mean values among preoperative and patients after OD comprised positive change of 50% and after MD 53%. The MHC’s mean values among preoperative and patients after OD comprised positive mean change of 28.4% and after MD 30.2%.

Due to the Oswestry Disability Questionnaire in patients grouped by follow-up period, the greatest mend was noted by the second year (Fig. 2a). Comparing the statistically significant results of both operative methods with the preoperative, the positive changes made up 62% for the OD and 65% for MD (Fig. 2b).

Fig. 2
figure 2

a Statistically significant differences (p < 0.05) showing the ODI scores (means ± SE) in the CG compared to pre- and postoperative assessments at ½, 1, 2, 3, 4 years of follow-up periods, except among the postoperative and CG (p > 0.05) by the second year. b Comparison of postoperative statistically significant Oswestry results (means ± SE) in MD and OD versus preoperative and CG subjects (p < 0.05)

The preoperative pain according to the VAS self-reporting scale showed to be stronger in the leg (Fig. 3a, b) and conversely postoperatively. In addition, the pain intensity in both areas was tested for correlation between variables in the Oswestry and the SF-36 questionnaires. The moderate correlations were observed among ODI in the low back (R = 0.42, p = 0.022) and leg (R = 0.46, p = 0.015), likewise VAS pain intensity in the leg correlated moderately with PHC (R = 0.52, p = 0.015). A strong correlation was received from ODI and PHC and MHC at the second year of follow-up time (R = 0.98, R = 0.77, R = 0.72, p < 0.02). The poor correlation, however, statistically significant (p < 0.05) was obtained from the rest variables.

Fig. 3
figure 3

a, b The statistically significant (p < 0.05) VAS changes (means ± SE) in the low back and leg compared between the pre- and postoperative ones at all follow-up assessment time points. Statistically insignificant (p > 0.05) changes in both pain areas compared to the CG. c Bar graph delineating CG, pre- and postoperative (MD and OD methods) VAS pain outcomes (means ± SE) in both pain areas. Statistically significant differences were found in all variables (p < 0.05), except among MD and OD subgroups (p > 0.05)

Ultimately, the data of both operating methods were compared with preoperative and CG ones. The significant results among pre- and postoperative patients after OD involved the change of 50.2% in the low back and 68.8% in the leg. Slightly higher values were observed between pre- and postoperative patients after MD amounting 60.9% in the low back and 74% in the leg (Fig. 3c).

The minimum clinically important difference (MCID) for the PHC in the OD patients was 12.6 and MHC was 7.2 points. In the MD patients, it made up 14.4 for the PHC and 7.7 for the MHC. The MCID for the ODI in the OD group was 20 points and 23.4 in the MD group. Finally, the MCID for the VAS in the low back and leg in the OD patients composed 2.8 and 5.8, whereas in the MD subjects 3.5 and 6.3, respectively.

Discussion

Despite the multitude debates on the optimal treatment for LDH disease, the effectiveness of surgery varies from 49 to 90% [24, 42, 57], the results of which are observed mostly in short-term study samples with the lack of mid-term and long-term clinical data.

The aim of our study was twofold: firstly, to compare the pre- and postoperative results at mid-term follow-up periods along with data of the CG; secondly, we evaluated the effectiveness among OD and MD surgical subgroups, as it was expected the OD to be more muscle damaging.

Comparing some other randomized controlled studies [27, 28, 32, 45] dealing with the outcomes of patients underwent the standard OD and minimally invasive approaches with mid-term follow-up. The strong evidence of OD providing faster relief than conservative treatment for carefully selected LDH patients with sciatica should be amplified. However, it remained broadly similar in OD and MD subgroups.

However, design of the Weinstein’s SPORT [54] provides an opportunity to compare over time the randomized trial results to the simultaneously involved observational cohort, with small absolute differences. The latter was relatively more symptomatic and functionally impaired.

Analyzing the SF-36 and Oswestry data in the observational cohort compared to our surgical candidates, the physical function and ODI tend mostly to improve during the first 6 months postoperatively and the achieved mutual outgrowth stability during the next 4 years.

Judged by the secondary outcomes, the mutually analyzed satisfaction level and return-to-work status in the SPORT were ascendant in favor of surgery, which combined to our study exhibited almost the same tendency, with exception to return-to-work of 84.4% against our 64%.

In general, comparing the SF-36 data with other studies, almost similar results of the PHC (from 47 to 52% of positive change) were reported by Gerszten et al. [19, 55] in chronic low back pain patients undergoing intradiscal electrothermy procedure at 1–2-year follow-up time points. The total mean change of 42.2% at ½–1-year study periods amounted by Thomas et al. [50].

Application of more spine-specific tool—the Oswestry Disability Questionnaire—revealed more stable changes after desired ½, 3 and 4-year time periods, where most of means remained at a 1/2 lower than preoperatively with exception to 1 and 2 years of follow-up with a slightly better improvement of ODI means appending to the SF-36 and VAS outcomes. Though all the postoperative ODI means never reached the level of the CG population, for the majority of patients moderate disability was estimated. Our mid-term results suggest of substantial improvement in 1–2 years of observation attributed to studies [17, 37, 45], however, coincide with those short-term ones [6, 7, 41].

The goal of surgical treatment of LDH is sufficient decompression and minimization of pain intensity measured by the VAS. The measurements were performed pre- and postoperatively as described in the “Methods”. Accordingly to Zanoli et al. [59], the preoperative pain was intrinsic in population of the LDH analyzed, where the leg pain was the mostly prominent, likewise it reduced strikingly at all follow-up periods especially at the second year postoperatively and correlated highly with other questionnaires, therefore suggestive of effectiveness of the procedures performed.

A more or less remarkable postoperative pain was thought to be the consequence of peridural fibrosis, however, due to several findings [3, 30, 39, 40], the correct relation among radiological and clinical symptoms was not found. The free fat graft concluded as useful in reduction of scar tissue formation after surgery, did not result in clinically better outcomes [20].

Due to our study, pain alleviation over 50% was considered to be a positive response. In respective data, the obtained VAS scores indicated stability in change especially at 3 and 4 years of follow-up in both pain areas, thus highly correlating with the adequate ODI rates of other trials [10, 20]. However, the follow-up results by 3–4 years showed to be poorer than we expected and were not attributed to positive response. This could be explained by higher reherniation frequency, possible muscle injury during operation period or by insufficient and/or premature rehabilitation program, and/or by the scar tissue formation at these time periods [13].

Interpretation of clinical significance of changes in quality of life questionnaires becomes important in the evidence-based methods in terms of MCID, involvement of various instruments and underlying pathologies. Based on published SPORT study [54], it is reasonable to estimate MCID for the scales used: 10 points for the SF-36 subscales, 8–10 points for the Oswetry [23, 36]. The MCID in pain on VAS for lumbar surgery patients is considered to be 20 mm [23, 43].

The improved optical conditions serve for better differentiation of the anatomical structures with milder manipulation of the nerve root and dural sac [9]. It should be added that, by comparing the overall results of the questionnaires (Figs. 1d, 2a, 3b) among both golden standard surgical procedures, slightly better findings but statistically insignificant were in the MD group and maintained these unique surveys [33, 46, 51].

The reherniation rate along with other entities: segmental instability, facet joint disease, epidural fibrosis, incomplete or wrong level decompression, and poor patient selection could be attributed to the relatively frequent manifesting FBSS, comprising from 5 to 10% of lumbosacral spine surgery cases [16, 49]. In our series it did not overwhelm the results of these studies [12, 53]. Moreover, the complication rates reported by Waguespack and Ebeling were higher and could be explained by different referral patterns and categories provided.

Since LDH disease became a sore point of the working population, there is a demand to set the risk factors for work disability days after concrete surgery, where the most predicting ODI > 20, considerable degree of pain, poor motivation to work are among other confounding agents making the prognosis more complicated [44]. The early return to vigorous activities and daily work with the mean sick leave of 1.7 weeks seems likely not increasing the complication rate [8]. In present case, for 26% of rest population the disability level limited the return to job rate, which may contribute to increased consumption of analgesics.

Finally, patient satisfaction with both operating procedures was not as boosting, as it was claimed by Hersht et al. [26], where the overall experience was satisfactory for 90% of patients and several other factors such as adequate quality and amount of patient education, trust in one’s surgeon were taken into account. Based on our study, patients’ opinion on the effect of surgery associated with more subjective experience influenced by much more confusing factors compared to applied health-related life quality instruments have represented objective findings.

Conclusions

The results of present study suggest of predictably sufficient decompression in both operative techniques. Though quality of life parameters were defined as clinically improved, they had not reached preferred values of the CG. The remained minor low back pain should be held beneficial to demonstrate the effectiveness of the decompressive surgery alone. Despite slightly better values in the MD group, the technical superiority has not been proven, however, it could probably be the key to faster recovery and diminished complication rate. The secondary outcomes did not follow improvement in pain, function or satisfaction with treatment.

The provided methodology could be used in population-based studies or clinical samples that focus on specific impairments and seek to control for pain frequency, intensity or follow-up assessments testing the effectiveness of performed surgical procedures, and elicit the pathways leading to other impairments: functional limitations, disability, etc.