Introduction

Arachnoid cysts (ACs) are congenital malformations of the arachnoid layer covering the central nervous system (CNS) [2, 3, 34]. They are most often asymptomatic, but may cause neurological deficits such as seizures, or non-specific symptoms such as headache, impaired cognition, or dizziness [10, 13, 15, 22, 27, 28, 31,32,33].

Various surgical techniques have been used to fenestrate the wall of ACs, creating communication with the normal subarachnoid space [1, 4, 7, 8, 11, 12, 14, 21, 23, 26, 29, 30], but surgical treatment of these cysts has remained controversial [13, 14, 16, 25, 26].

In previous prospective and retrospective studies, our group has reported that surgical decompression of AC can be performed with a relatively low risk and that it yields significant improvement of complaints and patient-reported quality of life on a short-term basis [12,13,14, 17].

The aim of the present study was to determine whether this postoperative improvement in quality of life (Qol) is permanent and stable and if there had been any change in Qol from short-term to long-term follow-up. We therefore re-examined the patient cohort that was included in our previous prospective study [17], approximately 6 years after surgery.

Methods

This is a prospective long-term follow-up study of the patients included in our previous study [17]. The included patients, see below, were followed up 5 ± 2 (mean 6) years after surgery.

Inclusion

A total of 154 adult patients were referred to our department with a newly diagnosed intracranial AC. Of these, 58 patients had minor complaints and no clear indication for surgery. Ninety-six adult patients had significant symptoms and were offered surgery. Six of them declined surgery despite severe symptoms. Eight of these 90 patients received a cysto-subdural shunt, whereas 82 were operated with craniotomy and fenestration of the cyst; these were invited to participate in the present study, and 76 of these consented and were included in the study.

All the 76 patients enrolled in our previous study had responded to the preoperative questionnaires. Of these, nine were lost to follow-up and two died from unrelated causes during the study period, thus 67 and 65 Qol questionnaires (Short Form 36 [SF-36] and Glasgow Benefit Inventory [GBI]) were accessible for pre- and long-term comparative analysis, respectively. Sixty-four headache and dizziness VAS scores were accessible for pre- and long-term comparative analysis. A comparison between the 11 patients lost to follow-up and the remaining study cohort revealed no significant differences in baseline demographics, questionnaire scores, and cyst volumes (for a summary of patients and cyst characteristics, see our previous publication [17]).

Surgery

All patients underwent a craniotomy under general anaesthesia. During this procedure, the parietal cyst membrane covering the inside of the dura was peeled off the dura and then removed from its attachment to the surrounding cortex, for temporal cysts all the way to the tentorial slit. The membrane that covered the brain surface (the visceral membrane) was not removed. Small bridging veins running for some distance along the parietal membrane before they drained into the basal sinuses were coagulated and removed with the parietal membrane. Large bridging veins near the sphenoid ridge, running for only a very short distance along the membrane before draining were not extirpated. The medial, remaining membrane, covering the basal structures (the tentorial slit, the oculomotor nerve, the carotid artery, and the optic nerve) was fenestrated, thus creating communication to the basal cisterns and the posterior fossa. When accessible, the arachnoid covering the Sylvian fissure was also opened.

Ethical approval

Ethical approval was obtained from the Norwegian regional ethics committee of Health Region West, and written informed consent was given by all participants according to the Declaration of Helsinki.

Questionnaires

For details regarding the questionnaires that were used, see our previous article [17].

SF-36

The SF-36 is a generalized survey of patient health consisting of 36 questions yielding eight scaled subscores, each reflecting a dimension of health and is the weighted sum of specified questions within the questionnaire transformed to a scale ranging from 0 to 100, where a higher score indicates a better health status.

The subscores in SF-36 are as follows: physical function (pf), role physical (rp), bodily pain (bp), general health (gh), vitality (vt), social function (sf), role emotional (re), and mental health (mh).

VAS

TheVAS is a simple and sensitive method for estimating the intensity of various chronic pain conditions, including headache and has also been validated for use in visual vertigo or dizziness. As an adjunct to Qol questionnaires, we used VAS for quantifying pre- and postoperative intensity for headache and dizziness. In this study, VAS questionnaires comprised five (headache) and three (dizziness) symptom-directed questions.

Glasgow Benefit Inventory forms

At short-term and long-term follow-ups, the patients completed GBI questionnaires, which is a survey designed to measure the effect of an intervention on the patient’s health status. It consists of 18 questions; the response to each question is based on a 5-point Likert scale ranging from a large deterioration to a large improvement in health status. The GBI questionnaire is evaluated by calculating a total score (Tot), a general subscale score (Gss), a social support subscale score (Sss), and a physical health subscale score (Phs). Possible scores range from − 100 to + 100, indicating a negative or positive change.

Statistical analyses

Data was analyzed using Excel (Microsoft, Redmond, WA, USA) and SPSS 20/21 (IBM, Armonk, NY, USA). If not otherwise stated, data is reported as averages ± one standard deviation. For comparison of pre- and postoperative scores, we used a paired Student t test. The general significance level was set to 0.05.

To investigate the association between quality of life and improvement in headache, dizziness, or radiological improvement, we used a linear regression model in SPSS. We performed both multivariate and univariate analyses to determine existing correlations between predictors (headache, dizziness, and sex) and patient health status as measured by improvement in SF-36 scores and positive GBI scores. Baseline SF-36 scores were checked for correlation.

Results

A total of 67 patients answered the long-term questionnaires, 35 males (52.2%) and 32 females (47.8%). The mean age was 52.1 (range 20–88 years).

Preoperative findings

Prior to surgery, 73 of the 76 patients (96.1%) reported some kind of subjective complaints: 64 patients had headache and 53 patients reported dizziness (for preoperative characteristics of headache and dizziness, see Table 1).

Table 1 Preoperative, postoperative, and long-term headache and dizziness characteristics

Quality of life measures

SF-36 scores: comparison between preoperative and long-term follow-up scores

In the long-term follow-up, we found highly significant improvements from the preoperative scores across all norm-based SF-36 dimensions (all p values < 0.008, paired samples t test, n = 58–65, F 1), except for general health (gh, p = 0.117) (see Fig. 1).

Fig. 1
figure 1

Long-term postoperative improvement in Qol—SF-36. a Box plots showing long-term delta SF-36 scores (postoperative long-term follow-up scores minus preoperative scores) for all eight SF-36 dimensions. The mean standard score for postoperative improvement was as follows: pf from 75.0 to 87.6, rp from 34.7 to 65.1, bp from 47.0 to 63.8, gh from 58.2 to 62.7, vt from 35.7 to 48.3, sf from 57.2 to 74.6, re from 50.2 to 77.1, and mh from 63.7 to 71.3. b Box plots showing postoperative short-term delta SF-36 scores (postoperative short-term scores minus preoperative scores) for all eight SF-36 dimensions [17]. The mean standard score improvement after surgery is noted above each box plot and was as follows: pf from 75.0 to 84.5, rp from 34.7 to 54.0, bp from 47.0 to 61.6, gh from 58.2 to 67.3, vt from 35.7 to 51.1, sf from 57.2 to 76.2, re from 50.2 to 73.1, and mh from 63.7 to 72.9. a, b Boxes represent values from 25 to 75 percentiles and median values are marked as lines within the boxes. Error bars indicate 95% confidence intervals

SF-36: comparison to healthy control group

The short-term follow-up study indicated that the Qol after surgery matched that of an age-stratified population average for all SF-36 dimensions [17]. At long-term follow-up, we found that the Qol mathced that of the same population average in six out of eight SF-36 dimensions (rp, bp, gh, vt, sf, mh) (all p < 0.05, independent samples t test, n = 55–58).

SF-36: comparison between short-term and long-term follow-up scores

Comparing the short-term follow-up and long-term follow-up for all subscale scores of SF-36, we found highly significant improvements for all dimensions (all p < 0.05, paired samples t test, n = 48–52). Thus, there appeared to be a continuing Qol improvement after the short-term follow-up.

Glasgow Benefit Inventory scores at long-term follow-up

Long-term total GBI scores were positive with mean improvements ranging from + 0.14 to + 15.25, indicating a long-lasting improved patient health status. The mean scores were all significantly higher than 0 (Fig. 2), except for the physical health subscale (Phs) score (p < 0.001, one-sided t test).

Fig. 2
figure 2

Improvement in Qol following surgery—GBI. a Box plots showing GBI total and subscale scores at long-term follow-up. Mean scores were as follows: Tot + 12.4 ± 27.84, Gss + 15.2 ± 32.86, Sss + 10.6 ± 24.74, and Phs + 0.1 ± 32.2. b Box plots showing GBI total and subscale scores at short-term follow-up27. Mean scores were as follows: Tot + 15.8, Gss + 19.3, Sss + 13.5, and Phs + 4.2. a, b Boxes represent values from 25 to 75 percentiles and median values are marked as lines within the boxes. Error bars indicate 95% confidence intervals

GBI: comparison between short-term and long-term follow-up scores

We found no significant difference in GBI scores at long-term versus short-term follow-up (p > 0.05, paired samples t test, n = 61).

VAS scores

Headache

The mean preoperative headache VAS score was 45.6 ± 25.8. There was a significant reduction in headache scores at short-term follow-up after surgery to 25.7 ± 23.5 (n = 64). At long-term follow-up, the VAS score for headache was 24.8 ± 26.6 (n = 61) (Fig. 3). In 45 patients, the follow-up headache VAS scores were reduced (improved) by a mean of 31.8 when compared with their preoperative scores, and in eight patients, there was no change and the mean VAS scores increased by a mean of 10.0 in the remaining eight patients.

Fig. 3
figure 3

Permanent improvement in headache and dizziness and following surgery. a Headache VAS scores. Bar diagram showing the average of mean patient-reported preoperative (45.6 ± 25.8, n = 76), postoperative short-term (25.7 ± 23.5, n = 64) [17], and long-term follow-up (25.7 ± 23.5 (n = 64)) headache VAS scores. a, b Error bars signify one standard deviation, and asterisk symbol above postoperative bar signifies a statistically significant improvement (all p < 0.0001). b Dizziness VAS scores. Bar diagram showing average of patient-reported preoperative (35.2 ± 31.2, n = 76), postoperative short-term (12.2 ± 20.1, n = 64) [17], and long-term follow-up (13.9 ± 22.8, n = 59) dizziness VAS scores

Dizziness

The mean preoperative dizziness VAS score was 35.2 ± 31.2. There was a significant reduction in dizziness scores at short-term follow-up after surgery to 12.2 ± 20.1 (n = 64). At long-term follow-up, the mean dizziness VAS score was 13.9 ± 22.8, (n = 59) for dizziness (Fig. 3). In 39 patients, the follow-up dizziness VAS scores were reduced (improved) by a mean of 47.3 when compared with their preoperative scores, and in eight patients, there was no change and the mean VAS scores increased by a mean of 25.2 in the remaining 12 patients.

Correlation between improvement in Qol and improvement of complaints at long-term follow-up

SF-36 long-term follow-up scores

We analyzed the possible impact of improvement in headache and dizziness, patient’s age, and sex on the improvement as reflected in the SF-36 subscale scores. Linear regression was performed between mean headache and dizziness VAS change, patient’s sex, and age versus all postoperative delta SF-36 subscale scores using both multivariate and univariate analysis models (ANOVA). Results are summarized in Table 2.

Table 2 Linear regression analysis showing correlation between delta SF-36 scores and change in predictors

We found that a reduction in headache or dizziness is related to an improvement in the SF-36 rp and bp subscale scores using multivariate regression analysis (p = 0.041 to 0.026), and in all dimensions except from the re subscale using univariate regression analysis (p = 0.026 to 0.007). Gender and age did not correlate with the SF-36 scores.

At short-term follow-up, we found a correlation between a reduction in headache or dizziness in six out of eight SF-36 dimensions [17].

GBI long-term follow-up scores

A linear regression analysis was performed between mean headache and dizziness VAS change, patient sex, and age versus all GBI subscale scores using both multivariate and univariate models (ANOVA). Results are summarized in Table 3.

Table 3 Linear regression analysis showing correlation between GBI scores and change in predictors

We found a significant correlation between improvement in headache and the GBI Phs subscale score using both multivariate (p = 0.038) and univariate (p = 0.014) analyses. A reduction in headache was related to an increase in the GBI Phs subscale score. There was no other correlations between predictors and any of the other GBI subscale scores.

We checked all predictors (headache, dizziness, patient age, and sex) for signs of multicollinearity or significant covariance. The preoperative mean headache and dizziness VAS scores showed significant correlation with a Pearson correlation coefficient of 0.941 (p < 0.001), but there was no significant covariance between other preoperative predictors. Similarly, there was a significant correlation between reductions in the mean headache VAS score and mean dizziness VAS score following surgery with a Pearson’s correlation coefficient of 0.443 (p = 0.001), which may explain some of the difference in correlation coefficients between our multivariate and univariate analysis models. There was no other covariance of predictors in our material.

Patient contentment with surgery

On long-term follow-up, we added a form regarding patient contentment with surgical decompression of the cyst. Fifty-nine patients responded to this questionnaire. Fifty-four patients (91.5%) were satisfied with the surgery, while five patients (8.5%) were not satisfied with the choice.

Discussion

Key results

The present study is the first to report prospectively on long-term results after surgical decompression of AC in a relatively large cohort (76 patients) of adults.

At follow-up 5 ± 2 years after surgery, we found a significantly improved Qol as measured with the Short Form 36 (SF-36) and to a lesser degree with the Glasgow Benefit Inventory (GBI). Moreover, we have demonstrated a correlation between improvement in dizziness and headache and improved Qol.

A total of 82.0% of the patients improved from their headache, 9% worsened, and 9% remained unchanged (considered as less than 5-point change in average VAS scores). The corresponding long-term results for dizziness were that 67.9% improved, 19.6% worsened, and 12.5% of the patients remained unchanged. These findings are in accordance with our group’s previous findings [17], giving the patients a probability ratio of 3–4 for improvement rather than worsening after surgery.

Quality of life

The patients’ long-term postoperative improvement in Qol scores may be explained solely by the postoperative symptomatic relief in headache and dizziness. However, as discussed in our previous study on the short-term effect of cyst decompression, we cannot rule out other, more subtle effects of the cyst as causes for the preoperative Qol reduction and the postoperative improvement. We have previously shown that temporal cysts may impair cognition and that cyst decompression restores cognition [9, 15, 22, 27, 31, 33], and other studies have demonstrated reduced temporal lobe volume and reduced metabolism and perfusion in the cortex adjacent to the cyst, and that these changes normalize after cyst decompression [5, 24, 35], paralleling the postoperative cognitive improvements in the same patients [5, 24]. Thus, it appears reasonable to assume that the impaired, preoperative cognition and the postoperative recovery may have influenced both the preoperative impairment and the postoperative Qol improvement in our patients.

Placebo and long-term follow-up

As discussed in our previous report on the short-term effects [17], there is a risk of the results being affected by placebo effects. Cyst decompression is mostly undertaken to alleviate subjective symptoms; it is therefore possible that patients, seeking justification for their decision to undergo surgery, will exaggerate any positive effect an operation might have had. There is also a possible selection bias; patients opting for surgical treatment are likely to have a high degree of confidence in what they can obtain from the operation before treatment. Our department has relatively large experience with vestibular schwannoma patients [19, 20]. Like AC patients, vestibular schwannoma patients score preoperatively below average on the SF-36 questionnaire, mainly because of the vertigo and dizziness. The symptoms of vestibular schwannoma patients are also mainly subjective, but they did not show any postoperative improvement in SF-36 scores, whereas our AC patients experienced significant postoperative symptomatic relief and improvement in Qol. This difference between patient groups argues against bias from having gone through surgery and supports the notion that the improvement in fact is caused by the operation.

The fact that our patients reported an additional long-term improvement in Qol from the short-term follow-up makes it even less likely that placebo has played any major role, as it seems rather unlikely that placebo has such long-term effects [6, 18].

Limitations and strengths

As discussed in our short-term report, the most important limitation of our study is the lack of conservatively managed patients as controls. However, our conservatively managed patients were not suitable as controls, as their preoperative complaints were much less severe than those constituting the indication for surgery in the operated patients.

As cyst decompression is not a life-saving procedure, it is crucial that patients receive thorough and unbiased preoperative information about both risks and benefits of the procedure, in order to enable them to decide whether their symptoms are debilitating enough to accept the risks of undergoing this surgery. We also believe that patients with significantly increased surgical risks due to comorbidity should not be operated upon, except after careful consideration, and then perhaps with less invasive methods [11].

Conclusion

The long-lasting improvement of patient symptoms and Qol after surgical cyst decompression makes it less probable that the decrease in symptoms and increase in Qol could be a matter of placebo.

The data presented in this study suggest that surgical treatment surgery is associated with long-term improvement in quality of life. Ideally, a firm conclusion that the observed improvement is caused by the surgical decompression cannot be reached without including a control group that is left unoperated. This, however, poses an ethical problem; one would then have to select a subgroup of patients with the same symptom intensity and impairment as the operated patients and leave them unoperated for the follow-up time used in this study, i.e., many years.