Introduction

Fibromyalgia syndrome (FS) is a musculoskeletal disorder characterized by chronic widespread pain, tenderness at marked sites of the body called tender points, and fatigue. It is often accompanied by a broad array of subjective symptoms such as sleep disturbances, headache, irritable bowel syndrome, morning stiffness, paresthesia of the extremities, frequent psychological distress, and depressed mood [1]. The prevalence of FS ranges from 2.1 to 2.4% in women and 1.8% in men [2]. Patients with FS experience disability and poor quality of life (QoL) [3]. Various pharmacological treatments including analgesic or non-steroidal anti-inflammatory drugs, opioids, anticonvulsivants, serotonin and noradrenaline reuptake inhibitors, selective serotonin reuptake inhibitors, growth hormone, and muscle relaxants have been used to treat FS [4]. However, the efficacy of these different drugs has been modest or poor.

Optimal management of FS consists of a multidisciplinary approach with a combination of pharmacological modalities and non-pharmacological treatments, such as hot and cold applications, balneotherapy (BT), patient education, and physical exercises [4,5,6]. BT represents a possible non-pharmacological and complementary intervention into the therapeutic strategy for patients with FS.

BT traditionally involves immersion in mineral and/or thermal waters from natural springs and should not be confused with bathing in tap water (hydrotherapy).

Various randomized clinical trials (RCT), meta-analysis, and systematic review showed a significant improvement in pain and tender point counts (TPC) at the end of a cycle of BT lasting in the medium long-term follow-up (3–6 months) [7,8,9,10]. Furthermore, despite low tolerance of physical treatments by FS patients, BT seems to be well-tolerated and have a lower percentage of side effects, which also are less severe, than those associated with pharmacological treatments. However, some aspects of these studies are criticizable and could be a source of bias such as the lack of double blind experimental design and the small number of patients included in the trials.

The aim of the present randomized double-blind controlled study was to investigate the efficacy of BT with a highly mineralized sulfate water on global pain and Fibromyalgia Impact Questionnaire total score (FIQ-Total) as primary end points in patients with primary FS. Secondary objectives were to evaluate the effects on QoL, anxiety, and depressive symptoms and to analyze the safety profile of BT. The follow-up period was extended to 6 months following study start to determine maintenance of the efficacy.

Our hypothesis is that BT is safe and superior to bath in tap water in short- and long-term follow-up in patients with FS.

Materials and methods

Trial design

This 24-weeks prospective randomized, controlled, parallel group, double-blind trial was conducted between April 2015 and November 2015. All potential participants were recruited at Rheumatology Unit of Trento (Italy) by a rheumatologist; treatments and all following assessments were performed at Levico Terme Spa Center (Trento, Italy). All eligible patients provided a written informed consent.

The study protocol followed the Principles of the Declaration of Helsinki 1964 and later amendments and was approved by the Ethics Committee of Santa Chiara-Trento (Italy) (decision no. 2/2014, March 20, 2014) and registered on http://www.clinicaltrials.gov (NCT02548065).

Participants

Inclusion criteria

Patients of both sexes who met the 2010 American College of Rheumatology (ACR) criteria for primary FS [11] and aged between 18 and 65 years were eligible for this study. All subjects resided in the rural area near the spa of Levico Terme within a 30-km radius, continued to live at home, and carried out their daily routines during the study period. Furthermore, entry criteria included (1) Widespread Pain Index (WPI) ≥ 7 and Symptom Severity Scale Score (SS) ≥ 5 or WPI 3–6 and SS ≥ 9 at screening (7 days before enrollment) and at baseline visit (time of enrollment); (2) stable doses of drugs for at least 4 weeks; and (3) are able to understand and willing to follow the study protocol.

Exclusion criteria

Exclusion criteria from the study included (1) severe comorbidity of the heart, liver, kidney, systemic blood disease, and neoplasms in the last 5 years; (2) concurrent autoimmune or inflammatory diseases (rheumatoid arthritis, systemic lupus erythematous, inflammatory bowel disease, etc.); (3) severe psychiatric illnesses (as current schizophrenia, major depression with suicidal ideation) within 2 years; (4) routine daily use of narcotic analgesics or history of substance abuse; and (5) pregnancy and nursing.

Also, we excluded patients taking part in another interventional trial or who had spa treatments 6 months previously prior to the screening visit and patients unable to understand consent form and the questionnaires.

Treatments

After the screening, patients were randomly assigned to receive a cycle of BT with Vetriolo’s water (VW) (BT group) or with tap water (control group). VW is a highly mineralized (fixed residue at 180 °C 1702 mg/l) strongly acidic (pH 5.7) sulfate (1100 mg/l) water, rich in calcium (111.0 mg/l), magnesium (65.5 mg/l), and iron (315 mg/l). The patients in the BT group were treated with daily immersion in a bathtub containing 150 l of VW at 36 °C for 15 min. The control group was treated with 36 °C heated tap water. The color of the tap water was changed to resemble that of the VW through the addition of powdered red bush (rooibos) tea. An independent blinded physician supervised the treatment to detect possible adverse events.

All patients in both groups had 15 min of bed rest after each treatment and received BT six times a week, for 2 consecutive weeks in the same place (spa center of Levico Terme).

Patients in both groups continued their established pharmacological and non-pharmacological treatments (physical exercise and/or physical and cognitive behavioral therapy) started before the screening visit.

Washout of concomitant symptomatic drugs was required for a week before the assessment of basal time with the exception of acetaminophen (≤ 3 g/day) taken as rescue therapy. However, all symptomatic drugs should be stopped 24 h before each visit to avoid masking symptoms of pain. Patients were advised to take acetaminophen only if needed.

Outcomes

The primary outcome measures were global pain during the last 24 h prior to assessment on a 0–10-cm Visual Analogue Scale (VAS) with 0 = “no pain” and 10 = “the worst pain possible” and FIQ-Total. FIQ, an extensively validated patient self-report questionnaire, evaluates the impact of FS on daily life [12, 13]. In this work, we used the Italian version of the FIQ [14]. It ranges from 0 to 100, with higher scores indicating greater impact of FS on functioning [12,13,14].

The secondary outcome measures included WPI, SS, Short Form Health Survey (SF-12), State-Trait Anxiety Inventory (STAI), and Center for Epidemiologic Studies Depression Scale (CES-D). WPI identifies body areas where pain or tenderness was felt during the previous 7 days, with a total of 19 body areas identified; the maximum score is 19 [11]. The SS score includes two parts (SS score 2a and SS score 2b): The first one evaluates three major symptoms (fatigue, trouble thinking or remembering and waking up tired, which can be coded 0–3 (0 = no problem;1 = slight or mild problems; 2 = moderate, considerable problems; and 3 = severe, continuous, life-disturbing problems), and the second one investigates three additional symptoms (pain or cramps in lower abdomen, depression, headache), which can be coded as present (1) or absent (0) (total sub-score 0–3). These three items are surrogates for the somatic symptom burden item of the ACR 2010 criteria [11]. The total SS score ranges from 0 to 12. The 12-item SF-12 is a short version of SF-36 that is a widely used measure of health and well-being, validated in multiple countries, including Italy [15]. SF-12 comprises two main domains: the physical component score (PCS) and the mental component score (MCS), and eight scales for assessing eight dimensions: physical functioning, physical role, social role, emotional role, bodily pain, general health, vitality, and mental health. Scores range from “0 to 100” where “0” indicates the worst condition and “100” indicates the best possible condition [16].

STAI is a self-report questionnaire with two independent 20-item scales (STAI T-Anxiety Scale or Form X-2 and STAI S-Anxiety Scale or Form X-1) for measuring state-related or trait-related anxiety [17]. A high score on the STAI corresponds to a high level of anxiety symptoms. We used the Italian validated version [18]. The 20-item CES-D Scale is frequently used to estimate the prevalence of depressive symptomatology in the general population. Respondents rate the frequency with which they have experienced particular depressive symptoms during the past week. Responses to each item range from 0 (less than 1 day) to 3 (5–7 days) and are summed to compute a total score [19]. The Italian version measures one depression factor scoring from 0 to 60; scores of 16 or above are considered cases of depression [20].

Clinical assessments of the patients were performed at the screening visit, at basal time, after 2 weeks, and after 3 and 6 months following the beginning of the study.

All assessments were performed by two blinded investigators (ST and RB).

Safety parameters

All adverse events, whether reported spontaneously by the patients or observed by the physician, were reported in a diary, noting the severity and any possible correlations with the treatment. Serious adverse events were to be reported immediately to the University of Siena’s Rheumatology Unit and resulted in the patient’s exclusion from the treatment.

Sample size

The sample size was determined for the primary outcomes, VAS, and overall score of FIQ-Total. According to previous studies [21, 22], we considered a difference of 2.0 points on the VAS scale and a change of 20% in the FIQ-Total at 15 days as a clinically meaningful difference in pain intensity and function impairment. Assuming a maximum dropout rate of 15% with a standard deviation of 2 cm for the VAS score and of 20 points for the FIQ-Total, a sample size of 50 patients in each group was required to obtain a power analysis of 0.80 with α = 0.05.

Randomization and blinding

Patients fulfilled the screening criteria after signed the informed consent were randomized 1:1 and allocated to one of two groups using a computer-generated table of random numbers.

Randomization was performed immediately before the basal time visit.

The principal investigator (AF) was provided with individual envelopes, each containing patient codes, corresponding to the treatment assignment. The investigator who assigned patients to their randomized treatment (GP), the assessors who performed the patient evaluation, the medical team and the nurses of the spa center, and the statistician were blinded to the treatment allocation. The result of the randomization was known exclusively by the bath assistant who filled the bathtubs with the mineral water or tap water.

The patients did not know which treatment they received, and to verify if the blinding was successful, the bath assistant asked them, at the end of the course.

Statistical analysis

The main analysis for efficacy and safety was an intent-to-treat (ITT) analysis. Missing follow-up assessments were replaced using the last observation carried forward (LOCF) method.

Descriptive statistics for each group was performed by calculating mean and standard deviation for quantitative variables, frequency counts, and percentage for qualitative variables.

The Kolmogorov-Smirnov test was applied to verify the normality distribution of all quantitative variables. The Student’s t test was used to compare sample data for normally distributed quantitative variables. The corresponding non-parametric tests of Wilcoxon or Mann-Whitney were used for non-normal dependent or independent data, respectively.

A significance level of 95% was chosen for all statistical analyses executed using the SPSS statistical software, version 10.

Results

Between 1 April 2015 and 1 November 2015, 273 patients were screened and 100 met the inclusion criteria. These patients were enrolled in the clinical trial and included in the ITT analysis (50 patients in the BT group and 50 patients in the control group) (Fig. 1).

Fig. 1
figure 1

Flowchart of participants in the study

The baseline demographic and clinical characteristics of the study participants were similar between the two groups (Table 1). Most study patients were female (95%), slightly overweight, with a median BMI of 25.49 ± 4.70 and with a mean duration of FS of 15.55 ± 11.73.

Table 1 Patient demographic and clinical characteristics at baseline (data are expressed as mean ± SD)

As shown in Table 2, there were no significant differences at baseline between the two groups concerning the outcome measures, except for SF-12 PCS (P = 0.009).

Table 2 Results of the primary and secondary end points evaluated in both groups throughout the study period

The changes from baseline in patient’s assessment of global pain measured by VAS and in FIQ total score are reported in Fig. 2a, b. Concerning measurement of pain by VAS in the BT group, we observed a statistically significant (P < 0.0001) reduction at the end of treatment (2 weeks) that persisted at 3- and 6-month follow-up (Fig. 2a) (Table 3). In the control group, there were no significant differences at the end of treatment and during the entire follow-up period. The differences between the BT and control group were already significant at week 2 (P < 0.01) and were maintained after 3 (P < 0.0001) and 6 (P < 0.001) months. The FIQ total score in the BT group resulted significantly improved after 2 weeks (P < 0.001) and persisted significantly until 6 months (P < 0.0001) (Fig. 2b), while in the control group, no significant differences were observed. The difference between the two groups was significant at the end of the treatment (P < 0.05) and during the follow-up period (P < 0.001) (Fig. 2b) (Table 2).

Fig. 2
figure 2

Changes in Global pain using a Visual Analogue Scale (VAS) (a) and in Fibromyalgia Impact Questionnaire (FIQ) total score (b) during follow-up period by treatment group. Data are expressed as mean ± SD. ***P < 0.001; ****P < 0.0001 2 weeks vs baseline vs; 3 months vs baseline vs; 6 months vs baseline. °P < 0.05, °°P < 0.01; °°°P < 0.001; °°°°P < 0.0001 BT vs control groups

Table 3 Number of patients with side effects in either treatment group

WPI significantly decreased at the end of the treatment both in the BT and control group, and this reduction persisted during the follow-up (P < 0.0001 and P < 0.01, respectively). The decrease of WPI was more significant in BT in comparison to the control group after 2 weeks (P < 0.05) and until 6 months (P < 0.0001). For what concerns SS total score and the sub-scores 2a and 2b, they significantly reduced only in the BT group at the end of the BT cycle (P < 0.01 for SS 2a and SS total; P < 0.05 for SS 2b) and until 6 months (P < 0.0001 for SS 2a and SS total; P < 0.001 for SS 2b). SF-36 PCS resulted at baseline significantly higher in the BT group than in the control group (P < 0.01), and the difference between groups persisted during the follow-up (P < 0.0001). SF-12 MCS and CES-D significantly improved in the BT group at the end of the treatment (P < 0.01) and were sustained until 6 months of follow-up (P < 0.0001 and P < 0.05, respectively); the differences between groups were significant after 3 (P < 0.05) and 6 months (P < 0.0001 for SF-12 MCS; P < 0.05 for CES-D) from the treatment. No significant differences between the basal time and all other times were observed in both groups for STAI score (Table 2).

Generally, the treatment was well-tolerated; during the cycle of BT, adverse events occurred in 10 (20%) patients of the BT group and in 22 (44%) patients within the control group. However, in the BT group, the side effects observed were transient and of light intensity, and no patient discontinued the therapy, while in the control group, two patients interrupted the therapy for exacerbation of symptoms (after five BT sessions in one case and after seven sessions in the other) (Table 3).

Furthermore, we notified that during the follow-up period, no adverse events or comorbidities able to influence our results, in particular the indexes of QoL, occurred.

Discussion

This paper aimed to assess the efficacy and tolerability of a cycle of BT with highly mineralized sulfate water in patients affected by primary FS. The results obtained indicate that BT provides a rapid and persistent belief in controlling the global pain and improving the function; indeed, the decrease of VAS score and FIQ was already significant at the end of the BT cycle and persisted after 6 months. Significant improvements were also reported in the BT group for WPI, SS, SF-12 MCS, and CES-D.

Nowadays, the management of FS is still a challenge, and it is generally accepted that a multidisciplinary approach with a combination of non-pharmacological and pharmacological treatments should be preferred and shared with the patient. At this regard, an increasing number of health professionals consider the non-pharmacological therapy the first choice of treatment and encourage a patient-tailored approach. In this context, BT represents an interesting alternative, and it was included in the European League Against Rheumatology (EULAR) revised recommendations for the management of FS [4].

The results of the current study strongly supported this evidence with a superior methodological quality than the previous studies. Indeed, this is the first double-blind randomized clinical trial that investigates the effect of BT in FS patients. The double-blind design is particularly important, and it was considered the main limitation of the previous trials, considering that a treatment such as BT requires active participation in the study for the patients and medical staff [10].

However, in literature, many review or meta-analysis suggest the positive effect of BT in FS considering the pain severity, FIQ scores, and indexes of QoL [7,8,9,10, 23]. Our data are in agreement with previous RCTs, although a lot of them were methodologically flawed and had small sample sizes [24,25,26]. Neumann et al. [24] and Buskila et al. [25] reported in two different papers the beneficial effect of BT at the Dead Sea on severity and frequency of FS-related symptoms and the significant improvement of QoL indexes. Other authors referred, as observed in our trial, a decrease of FIQ at the end of BT cycle that persisted until 6 months [26, 27]. In 2005, Zijlstra et al. [28] experimented a combination treatment of thalassotherapy techniques (including physical therapy and hydrotherapy), exercise, and patient education in FS patients and reported a medium-term (3 months) improvement of SF-36 and FIQ. More recently, the beneficial effect of BT, alone or in association to patient education or exercise, was confirmed by others [29,30,31,32].

In the present study, BT resulted safe and well-accepted by the patients without the frequent side effects associated to the pharmacological therapy of FS.

The efficacy of BT can be partially explained by heat which increases the pain threshold in the nerve endings and reduces muscle spasm [33]; indeed, thermal stress stimulates the secretion of beta endorphin helping to induce analgesic and anti-spastic effects of BT [34]. In addition to the physical properties, the minerals dissolved in the water can have a key role in the mechanism of action of BT [33, 35]. This assumption could be substantiated by the significantly better and longer effects of BT on pain and function in comparison to tap water bath at the same temperature in different musculoskeletal disorders [36,37,38,39]. Indeed, the persistent clinical efficacy of BT after the treatment constitutes another indirect demonstration of the specific chemical effect of mineral water [35].

Various in vitro and in vivo studies demonstrated that the mineral waters affect the oxidant-antioxidant balance with an enhancement of total antioxidant status and a decrease of oxidant release [40, 41]. Oxidative stress seems to play an important role in the pathogenesis and in the clinical symptoms of FS patients [42, 43]; the effect on oxidant/antioxidant system could contribute to explain the therapeutic efficacy of BT in FS.

Furthermore, BT seems to induce a reduction in circulating levels of Interleukin (IL)-1, prostaglandin E2 (PGE2) and leukotriene B4 (LTB4), important mediators of inflammation and pain, in FS patients, contributing to explain the mechanism of clinical benefits of the mineral waters in this disorder [44].

Another possible mechanism of action of VW in FS could be attributed to the high concentration of magnesium in this water; indeed, several studies have reported a deficiency of magnesium in FS subjects and a significant improvement of FS symptoms after magnesium supplementation [45].

Finally, additional factors such as environmental change, pleasant scenery, and absence of work duties can contribute to induce the clinical improvement obtained by BT; however, in the present study, in order to remove these unspecific factors, all patients lived in the areas near the thermal center and continued their routine activities.

Furthermore, other physical or rehabilitation treatments were not included in the spa program and during the follow-up period for both studied groups to eliminate all potential confounding factors.

Others strengths of this paper are the long-term follow-up, the use of the consolidate standards of reporting trials (CONSORT), and the assessment of the end points (e.g., pain VAS, SF-36, and FIQ) suggested by outcome measure in rheumatology (OMERACT) [46].

In conclusion, the results obtained from our study support the evidence of tolerability and long-term effectiveness of BT in controlling pain and improving functionality in patients affected by FS. Considering the usual side effects often associated with pharmacological therapy, this conclusion underlines the practical utility and the growing importance of BT among other complementary and non-pharmacological modalities in the management of FS.