Introduction

Recent treatment regimens of rheumatoid arthritis (RA) include disease-modulating and symptomatic drug therapy as well as multimodal rehabilitative programmes aiming at long-term disease management with pain relief, preservation of remaining functions and development of compensatory functions. Together with psychological care, specific exercises, physical and occupational therapy [1] spa applications represent an integral part of physical therapy for RA [2] in many European countries.

The chemically inert natural radioactive gas radon has been applied in rheumatology since the beginning of the 20th century with a few European health resorts as vanguard in its therapeutic use.

Evidence from several newer randomised controlled trials (RCT) [310], clinical observational studies [1115] and empirical experience over decades are in keeping with sustained analgesic effects. Beyond pain relief, anti-inflammatory [16] and immune modulating effects [17] were reported from laboratory studies.

Within the last years, formerly restrictive health-care authorities and insurance companies became increasingly open-minded concerning radon therapy, not least because of the increasing evidence of beneficial mid/long-term effects, the very small radioactive doses [18] and the probably beneficial risk ratio.

The only RCT in RA patients [7, 9] reported superiority of radon spa therapy compared to reference treatment regarding pain and health-related quality of life at the end of the 6 months’ postintervention observation period. We designed a replication study aimed at comparing a series of baths of natural spring water containing radon plus carbon dioxide (Rn + CO2) versus radon-free (CO2) baths (control) within a complex rehabilitative programme, which is currently reimbursed by statutory health and pension insurance companies in Germany.

Methods

Study design

The study was a trial with two randomised parallel groups and patients, therapists, and investigator unaware of group allocation. It was performed in Bad Brambach, which is known for its natural springs containing both radon and carbon dioxide gas in therapeutically relevant concentrations. We hypothesised that the radon spa therapy is superior to the control treatment. Quarterly follow-ups had been performed until 1 year after the end of the intervention.

For the externally performed randomisation, a computer-generated random allocation sequence was provided by the responsible biometrician (AF). Randomisation was restricted to block lengths of four.

According to the list, a bar code card was prepared for each participant. An automated device activated by the patients’ cards guaranteed correct tub fillings according to group assignment [16]. The de-blinding was performed not before the last follow-up unless in case of emergency. The ethics committee of the University of Dresden approved the protocol. Written informed consent was obtained from all patients.

Patients

Patients with RA according to the 1987 revised ACR criteria for RA [19] and with sufficient knowledge of the German language were eligible. Exclusion criteria were current exacerbations of the inflammatory process or other systemic inflammatory diseases, concomitant musculoskeletal diseases possibly interfering with outcome measurement, pregnancy, breast feeding, disorders of the central nervous system, a known tendency toward thrombosis, malignant hypertension, coronary heart disease, heart failure, arrhythmia, severe disorders of lungs, kidneys or liver, advanced malignancies, abuse of alcohol or drugs, major skin lesions, severe fever or infections.

Setting

The study participants were inpatients routinely referred to the rehabilitation hospital, Bad Brambach, from widespread regions of Germany. Because only three statutory health-care/pension bodies agreed to the randomisation procedure, patients were only invited to participate if they met the inclusion/exclusion criteria and were members of one of these three funds. As proposed by Fielding et al. [20], a third parallel group of non-randomised consecutive patients was formed including patients who did not consent to randomisation or who came from other financing bodies. These patients received the usual 3 weeks in-house rehabilitation programme including radon baths like the randomised index group. However, this group is not part of this report.

Interventions

The rehabilitative treatment consisted of daily specific exercises/ physiotherapy, occupational therapy and galvanic baths thrice a week each, and Swedish massage twice a week. In case of need/request, psychological care was provided. Furthermore, patients assigned to Rn + CO2 took 15 baths in natural spring water (1.1 kBq per l Rn; 1.3 g per l CO2), whereas the control group received 15 plain CO2 baths (artificially produced, 1.6 g/l). The water temperature was 35°C.

Apart from Sundays, 20 min whole body baths were administered daily at the same time starting the fourth day after arrival. Additional offers (leisure time sport, relaxation therapy) were allowed in order to maximise the patients’ compliance. Criteria for treatment discontinuation were adverse reactions, withdrawal of informed consent, current inflammatory episodes or need of steroid medication.

Outcomes

Primary outcome criteria were self-assessed limitations in everyday life and private activities as well as limitations in the occupational context both measured on 100 mm visual analogue scales (VAS) anchored at ‘not at all’ and ‘dramatically restricted’. The mean of both scores was used as main outcome measure (MOM) for patients who were still on their jobs. The scale on everyday life restriction was used alone in pensioners/ unemployed persons. This pooled and rather global criterion met the interests of the cooperating health-care/pension bodies best. It was accompanied by pain intensity (PI), pain frequency (PF), morning stiffness (MS), functional capacity (FC) and drug consumption.

PI was measured on a VAS as well. All VAS were double measured at baseline and averaged to increase the reliability, although VAS are regarded as reliable, valid and sensitive to change [21, 22].

PF and MS were assessed on verbal scales. To compare the findings of the groups for both assessments, all changes were summed up over the course of time. This summarising measure was dichotomised to ‘at least two levels improved’ versus ‘less two levels improved’.

FC was both patient-rated by the Hanover Functional Capacity Questionnaire for RA [23, 24] and investigator-rated by Keitel’s functional test [25] (possible only before the start and after the end of treatment). Both proved to be valid and reliable [24, 26].

The drugs were categorised as (a) DMARD, (b) corticosteroids and (c) NSAIDs and/or analgesics. DMARD were assessed qualitatively, primarily to describe the groups. It was not expected to observe substantial changes in quantity or quality because of the rather short-termed study intervention (compared to the long-term duration of disease) and rehabilitative orientation of the treatment. For steroids, and NSAIDs and/or analgesics, we hypothesised a mid/long-term reduction after radon treatment. Therefore, all steroid intakes were transferred into prednisolone equivalents (in mg) according to guidelines [27, 28].

Operationalisation of NSAIDs and/or analgesics was more difficult because many patients used two or even three different drugs from a broad range of medications. We decided to use the percentage of recommended dose of every medication [29, 30] and summed up all intake percentages for further analyses. The ESR (Westergren method), and the serum concentration of CRP were measured before start of intervention to describe disease activity.

Data were gathered before the first and after the last treatment, as well as quarterly for 1 year thereafter by means of a postal questionnaire. Change scores built from start minus course differences were used for analyses.

Sample size

We assumed moderate between-group differences because of the intensive basic treatment in both groups. Given a two-sided type-I error probability of 5%, balanced groups and a test power of about 80%, 63 patients per treatment arm would be necessary to detect relevant between-group differences. Taking dropouts into account, about 140 patients should have to be included in the study. About 3 years’ time was estimated for the patients’ recruitment.

Statistical methods

Primary analysis was performed by intention-to-treat. All randomised patients were included. Missing values (MV) substitution was performed only for the MOM according to a predefined strategy. For MV at the end of intervention, the ‘last observation carried forward’ method was applied. Single missing follow-up (FU) values were handled by means of linear interpolation. For the few more MV, the mean change score of the respective group and time point [31, 32] was filled in. Repeated-measures analyses of covariance (RM-ANCOVA) were done using baseline scores as covariates. Hierarchically ordered hypotheses ensured the nominal significance level and allowed examining long-term effects [33, 34].

Based on previous evidence on treatment effects, until 6 months post intervention, the first RM-ANCOVA should be performed including data from the end of treatment, 3 and 6 months’ FU. Only if this analysis resulted in significant group main effects (GME) or significant group × course-interactions (G × C), the data from 9 months’ FU should be included into the respective second analysis. If the second analysis produced significant differences, a third analysis should be carried out including data from the 12 months’ FU. No interim analysis was done. Neither statistical adjustments regarding multiple testing nor replacement strategies for MV were applied for secondary analyses. Inferential statistics on these outcomes were interpreted descriptively. RM-ANCOVA, Fisher’s exact test and calculations of odds ratios with 95% CI for improvement rates were performed. Common descriptive statistics were presented in tables and figures.

Statistical analyses were done with SPSS Version 12 (SPSS Corp., Chicago, IL).

Results

Patient flow

The trial started in July 1998 and was completed (including the last patient’s 12 months’ FU) in May 2005. The recruitment period lasted that long because only three health-care/pension bodies agreed to the inclusion of their members, if they too consented. About 186 patients were screened of whom 35 did not fulfil the inclusion/exclusion criteria, and 14 refused randomisation because of possible exclusion from radon therapy. In total, 207 RA patients were enrolled of whom 134 were randomised (n = 67 per group) and are reported here. Details of participants’ flow are shown in Fig. 1.

Fig. 1
figure 1

Flow of participants through the study

Baseline characteristics

Table 1 summarises the characteristics of the enrolled sample. About two-thirds of the participants were women. Mean age was 56 years (SD 12 years) and mean duration of disease was reported as 11 years (SD 10 years).

Table 1 Patient characteristics of the study population

For approximately two-thirds of the patients, marked erosions of joints, cysts, lesions or partial dislocations were found in X-rays. Nearly 90% were treated with DMARD, thereof 60% with MTX alone or in combinations. More than 60% were treated with corticosteroids and NSAIDs/analgesics. Disease activity according to the laboratory measures was low but showed a remarkable variation in both groups. All in all, baseline characteristics and outcome measures were quite well balanced between groups.

Protocol compliance

Neither violations of inclusion and/or exclusion criteria occurred nor were there erroneous allocations to the false treatment. The mean number of baths was 15.8 (SD 2.6) in the radon group and 15.3 (SD 2.0) in the control group, with 65 and 68% of patients receiving 15 baths exactly. Less 14 baths were applied in 6/65 respectively 3/66 patients because of menorrhoea (4×) and long holiday periods (Easter/Christmas) or early discharge (4×). The remaining patient suffered from intercurrent illness so that the bath treatment had to be interrupted. No series was less than 11 baths. No relevant group discrepancies of treatment intensity were found (Table 2).

Table 2 Treatment programme within 21 days of rehabilitation

Guessing after end of treatment whether they believed having received radon therapy, 30/61 of the radon group and 37/60 of the control group were unsure. About 44% of the whole sample assumed their allocation correctly. Not less than 25% of the control group guessed as being treated with radon baths. De-blinding before the end of FU period happened in just one case after the 6 months’ FU on patient’s request.

Outcomes

Change scores of outcome measures and results of statistical analyses are shown in Table 3.

Table 3 Change scores and ANCOVA results of outcome criteria

Primary analysis

Both groups showed comparable treatment effects at discharge (Fig. 2) but between-group differences increased with longer follow-up periods until 9 months post intervention. Therefore, RM-ANCOVAs in all hierarchically ordered analyses (until 6, 9 and 12 months’ FU) resulted in significant G × C interactions. Regarding the GME, no statistical significance, although consistently low p values were observed (Table 3).

Fig. 2
figure 2

Change scores of limitations in everyday life/job (mean and 95% CI) for the treatment groups (main outcome measure)

Despite a decreased extent of effects after the 6 months’ FU, the radon group did better until at least 9 months’ postintervention as compared to baseline, whereas the control group had returned to values below the baseline level already at the 6 months’ FU (Fig. 2, Table 3).

Secondary analyses

Favourable longitudinal changes of pain relief were observed at least until the 9 months’ FU in the radon group. Between-group differences were most pronounced 9 months’ postintervention, resulting in RM-ANCOVA P-values for G × C until 9 and 12 months’ FU of borderline significances (Table 3). These PI results were observed in spite of reduced drug intakes of corticosteroids and NSAIDs and/or analgesics in the radon group (Figs. 3, 4).

Fig. 3
figure 3

Change scores of corticosteroids (mean and 95% CI) for the treatment groups

Fig. 4
figure 4

Change scores of NSAIDs and/or analgesics (mean and 95% CI) for the treatment groups

Regarding corticosteroids, an increasing dose reduction was found in the radon group during the entire FU period, whereas the dose remained essentially unchanged in the control group (Fig. 3). A borderline significant group main effect was found until the end of observation (Table 3).

For NSAIDs and/or analgesics, reduced intake was observed until the 9 months’ FU (Fig. 4), and the 12 months’ FU was still comparable to that of the pretreatment phase in the radon group. In contrast, in the control group an increased dose was observed 1 year postintervention. These differences proved to be significant (GME) for RM-ANCOVAs until the 9 and 12 months’ assessments (Table 3).

No significant group differences were found regarding PF, MS or functional capacity. Consistently to the FFbH questionnaire, (Table 3) the Keitel test provided small and non-significant between-group differences at the end of treatment. For PF and MS, the dichotomised summary parameter resulted in 19/64 versus 17/60 (OR = 1.07 [49; 2.32]) and 17/61 versus 16/60 (OR = 1.06 [48; 2.37]) improvements respectively.

During the course of study neither intervention-related adverse reactions nor serious adverse events occurred. Two intercurrent illnesses were observed with causality to both baths and other interventions judged unlikely.

Discussion

Our study is the second RCT comparing radon spa therapy to radon-free treatment for RA and demonstrates that Rn + CO2 baths are superior to plain CO2 baths within a multimodal rehabilitation in the long run. Secondary analyses support the results of the primary analysis. Longer lasting improvements in pain intensity, as well as reduced doses of corticosteroids and NSAIDs and/or analgesics were observed in favour of the radon treatment. Protocol adherence and tolerability could be judged as excellent.

Furthermore, reduced drug consumption in the radon group might correspond with a lower risk of known side effects, especially, as the combination of steroids plus NSAIDs is known for an even higher risk rate than NSAID alone [35].

Various years ago, the mortality due to gastro-intestinal complications following NSAID consumption was estimated to be about 2.000 per year in Germany [36] and 16.500 per year in the USA [37]. Therefore, additional use of gastro-protective agents or consumption of COXIB was recommended. Nevertheless, in 2004, still 1/400 NSAID consumers suffered from and 1/8,000 died because of ulcer complications in Germany [38]. Whilst a remarkable decrease in gastro-intestinal event rates was associated with development and launching of COXIB [39, 40], the focus regarding adverse reactions changed to cardiovascular complications [41] for both NSAID and COXIB.

Taking into account the annual frequency of 11 million NSAID prescriptions in Germany [38], there is consensus that even small reductions in medication may positively affect the individual long-term career of RA patients [4143].

Our results are, in general, in accordance with those of other trials in patients with inflammatory rheumatism ([9] in RA; [5, 10, 30] in AS). All trials showed more pronounced mid-term/long-term benefits in favour of radon therapy over various months of FU than that suggested by between-group differences at the end of treatment. In the AS trial of van Tubergen et al. increased functional capacity could be demonstrated (using BASFI), which could not be shown in our RA trial. Sensitivity to change of the FFbH-P self-assessment questionnaire may be limited; its results, however, were consistent with the Keitel test after the end of treatment. Similar to our findings, a reduced NSAID consumption was also reported by Lind-Albrecht [30] in AS patients. The consistency in the current and former results strengthens the evidence of benefits of radon treatment.

Various lines of evidence indicate a potentially causal relationship between radon spa therapy and the inhibition of inflammation and pain relief. Radon (²²²Rn) dissolved in bath water is incorporated via the skin and by inhalation, and is distributed via blood circulation all over the body. Retention time in the body is short, with 50% disappearing within 15–30 min [44], mainly through exhalation [45, 46] and also through excretion and diffusion through the skin after the bath. The decay rate is about 0.2% during the passage through the body [47], while small doses of high energetic alpha-particle radiation are being emitted. The short-living decay daughters of radon—218Po, 214Pb, 214Bi and 214Po—mainly deposited on the skin and in the lung [47], further fuel the applied dose of energy.

Due to its relatively high atomic mass, alpha radiation is absorbed immediately on contact with matter and is therefore only effective over very short distances [44]. The relatively large transfer of energy associated with this absorption causes a series of complicated reactions on the molecular and cellular level, resulting in cell apoptosis with high efficiency.

Evidence from physiological studies underpins the biological plausibility of radon as a therapeutically active substance. Alpha particles stimulate the release of anti-inflammatory cytokines (like TGF-β and IL-10) as a consequence of the phagocytosis of apoptotic cells by dendritic cells. These cytokines act as antagonists against the pro-inflammatory cytokines (like TNF-α, IL-12, IL-1β, etc.) and thus down-regulate the cellular response regarding the activity of macrophages and neutrocytes and limit the migration of leucocytes. This mode of action is already well known for UV-B and low-dose X-rays and is increasingly supported by experiments and physiological observations under low-dose alpha-particle irradiation. The intensified biological efficacy of low-dose alpha radiation compared to that of X-rays is explained by the enhanced linear energy transfer along the tracks of the alpha particles in tissue and by the so-called ‘bystander’ effect. It has been shown that not only those cells hit directly by alpha particles react in this manner but also the neighbouring ones [47, 48].

Furthermore, CO2 also dissolved in the spring water may contribute to enhanced effects because it forwards the blood circulation and intensifies the radon transfer and the incorporated concentrations within the body [46].

Currently, no evidence on anti-rheumatic effects of CO2 baths (exclusively) exists based on controlled clinical trials. The only available trial in musculoskeletal conditions was performed by Mucha [49] in stage-I patients with algodystrophy. Although it reported pain relief and functional improvement, the study suffered from various methodological flaws and was regarded insufficient to prove therapeutical effectiveness [50].

It could be assumed that the known CO2 effect of hyperaemisation of peripheral vessels contributes to a quicker and/or better removal of analgetic substances. But, to date, this is not supported by data. There is no hint that CO2 might be able to influence the pathogenesis of the rheumatic processes, although it possibly may have an impact on reactive concomitant symptoms.

Nevertheless, in applying CO2 to both treatment groups, the effect differences observed could only be explained by radon, which was the sole systematic distinction in the treatment of both groups. Therefore, the benefits we found in our trial are assignable to radon spa therapy, in general.

Considering the meanwhile established reproducibility of positive long-term clinical outcomes with a series of radon baths, future studies should address aspects of its cost-effectiveness as done in AS [51] and the underlying immunological processes regarding the long-term reactions in inflammatory rheumatism. Also, dose-finding studies seem to be reasonable to combine best benefit and least risk for the patients. Additionally, the role of radon spa therapy in osteoarthritis and other degenerative musculoskeletal disorders should be further researched.