Introduction

The combination of high-dose intravenous (iv) glucocorticoids (GC) (ivGC) and orbital radiotherapy (ORT) is a well-established treatment for active Graves’ orbitopathy (GO) [17], the most common extrathyroidal manifestation of Graves’ disease [8, 9]. The use of systemic glucocorticoids takes advantage from their immune suppressive and antiinflammatory actions, which results in an overall beneficial effect on GO, ranging from ~60 to ~85 % of patients in various studies [6]. Several investigations have shown that the intravenous route of administration is superior to the oral route, both in terms of GO outcome and GC side effects [6], the latter in spite of the observation of a few cases of severe liver damage in patients given ivGC [10, 11]. ORT is used in GO because of its general antiinflammatory properties and, more specifically, because of its possible ability of damaging lymphocytes infiltrating orbital tissues, thereby exerting a local immune suppressive action [7]. There have been debates on whether ORT is actually effective for GO and on whether it should be employed, although the majority of studies have shown a good response to treatment, especially if associated with GC [7].

Based on findings of previous studies [17], in our Center we use routinely the combination of ivGC and ORT in patients with moderate–severe, active GO, regardless of thyroid treatment [12]. In the present study, we took advantage of the fact that the majority of our GO patients undergo routine visits in our GO Clinic, thereby allowing us to investigate retrospectively the outcome of GO treated with ivGC and ORT after a relatively long period of time, with the aim of indentifying factors that may affect the response of GO to treatment, an information that could be useful when designing a therapeutic strategy for GO. Thus, although several prospective and retrospective studies have addressed the issue of GO response to ivGC and ORT [17], little is known on the factors that may help predict GO outcome.

Patients and methods

Patients

A retrospective cohort study was conducted. Patients who met the following criteria were included: (1) no treatments for GO before first observation with the exception of eye lubricants; (2) GO treatment with ivGC and ORT between first and last observation; and (3) no treatments for Graves’ hyperthyroidism before first observation, with the exception of anti-thyroid drugs [methimazole (MMI) in all cases]; (4) no additional treatments for GO after the initial treatments.

A total of 96 patients meeting the above-mentioned criteria were identified out of 787 consecutive patients who came to our GO Clinic for a follow-up visit between September 2010 and June 2013. Demographical data on these patients are reported in Table 1. Signed informed consent was obtained from all patients prior to using their data.

Table 1 Features of 96 patients with Graves’ ophthalmopathy (GO) treated with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy, at our first observation before treatment, and then at last observation

Clinical and serological evaluation

All patients underwent an ophthalmological assessment at first and last observation, which included: (1) exophthalmometry; (2) measurement of eyelid aperture; (3) evaluation of the Clinical Activity Score (CAS) according to Mourits [13, 14]; (4) assessment of diplopia; (5) measurement of visual acuity. The GO features of patients are summarized in Table 2.

Table 2 Features and outcome of Graves’ ophthalmopathy (GO) in 96 patients treated with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy, at our first observation before treatment and then at last observation

Serum anti-TSH receptor antibodies (TRAb, Brahms, Berlin, Germany) at first and last observation were available in 93 patients. Thyroid ultrasound was performed in all patients at first observation. Thyroid volume was calculated using the ellipsoid formula, as reported previously [15]. Smoking habits were recorded in all patients. The thyroid features of patients are summarized in Table 1.

Treatments

As reported in Table 1, after the first observation 69 patients underwent treatment with 131I (a single dose of 15 mCi), 11 underwent near-total thyroidectomy, and 16 were treated with a procedure named total thyroid ablation [1618], namely near-total thyroidectomy followed by 131I (a single dose of 30 mCi). As reported previously [16, 17], in the latter group 131I was given ~45 days after thyroidectomy, and between thyroidectomy and 131I patients were given T3 (40 μg/day) to maintain euthyroidism, which was withdrawn 2 weeks before 131I. All patients were given LT4 at replacement doses after thyroid treatment.

Approximately, 2 weeks after thyroid treatment all patients started ivGC and ORT for GO. Patients were given one iv infusion/week of methylprednisolone acetate for a total of 12 infusions. The dose was 15 mg/kg for the first four infusions (maximum dose allowed 1 g), and 7.5 mg/kg for the last eight infusions. The cumulative dose of GC ranged between ~6 and ~8 g. The reason why we used a relatively low dose of methylprednisolone, compared with that usually used by others [18], was to avoid adverse events associated with ivGC, especially hepatitis [10, 11]. On the other hand, this protocol was largely employed in previous studies [16, 17]. No major side effects of GC were observed. ORT was performed with a high-voltage linear accelerator. A cumulative radiation dose of 10 Gy was delivered to each eye in 10 fractionated doses over a period of 2 weeks. All patients were treated bilaterally.

End-points

The primary end-point of the study was the possible relation between the overall outcome of GO and the following individual variables: age; gender, smoking habits, thyroid volume, thyroid treatment, TRAb at first and last observation, individual GO features at first observation (exophthalmometry, eyelid aperture, CAS, visual acuity, diplopia) and time elapsed between first and last observation. The secondary end-point was to investigate the relation between the variables, if any, affecting the outcome of GO and the individual GO features.

The overall GO outcome was evaluated as follows. GO was considered improved when at least one of the following criteria was fulfilled, without worsening of the other criteria: (1) reduction in proptosis ≥2 mm in at least one eye, with no increase ≥2 mm in the other eye; (2) reduction of CAS ≥1/7 points; (3) reduction in eyelid aperture ≥2 mm in at least one eye, with no increase ≥2 mm in the other eye; (4) disappearance or improvement (change of degree from constant to inconstant or intermittent, or from inconstant to intermittent) of diplopia; (5) increase in visual acuity ≥2/10. GO was considered worsened when at least one of the following criteria was fulfilled: (1) increase of CAS ≥1/7 points; (2) increase in proptosis ≥2 mm; (3) increase in eyelid aperture ≥2 mm; (4) appearance or worsening (change of degree) of diplopia; (5) decrease in visual acuity ≥2/10.

Data presentation and statistical analyses

Descriptive data are presented as median and interquartile range. When appropriate and as indicated, the following tests were performed: (1) Wilcoxon; (2) Mann–Whitney; (3) Fisher exact test.

Results

Outcome of GO following ivGC and ORT

The general and thyroid features of patients are reported in Table 1. The features of GO at first and last observation are reported in Table 2. As mentioned above, all patients were treated with ivGC and ORT and underwent an ophthalmological evaluation after a median follow-up period of 55.5 months. As reported in Table 2, exophthalmometry, eyelid aperture, CAS and diplopia improved significantly after treatment, whereas visual acuity did not. Concerning the latter, the finding is not surprising, considering that the vast majority of patients had a normal visual acuity (10/10) at first observation. Thus, only eight patients had a reduced visual acuity due to optic neuropathy (median: 2/10; IQR 1–4.5, range 0–8), which improved significantly after treatment (median: 8.5/10; IQR 5–8.5, range 5–10; P = 0.007 by Wilcoxon). These eight patients, who clearly had a severe GO, were not excluded as the degree of GO was not among the inclusion and exclusion criteria. In agreement with the clear amelioration of virtually all GO features, we estimated that the majority (67.7 %) of patients had an overall GO improvement, 29.1 had remained stable, whereas only 4.5 % had worsened.

Effect of individual variables on GO outcome

As reported above, the primary end-point of the study was the relation between the outcome of GO and several individual variables. Because of the low number of patients who had worsened, namely three, we grouped them together with patients who had remained stable and considered them as non-responders, whereas patients who had improved were defined as responders. Age, smoking habits, thyroid volume, thyroid treatment, TRAb at first and last observation and individual GO features at first observation (exophthalmometry, eyelid aperture, CAS, visual acuity, diplopia) did not affect significantly the outcome of GO. In contrast and as detailed below, the overall GO outcome was affected significantly by gender and by the time elapsed between first and last observation.

Relation between gender and GO outcome

As shown in Fig. 1, the prevalence of responders was significantly higher in females (76.4 %) than in males (48 %, P = 0.02 by Fisher exact test). We analyzed the individual GO features according to gender. As shown in Table 3, eyelid aperture, CAS, visual acuity and diplopia before treatment did not differ between males and females. In contrast, males had a significantly higher degree of proptosis at first observation. Presumably, the latter reflected the known difference between genders concerning exophthalmometry [19], rather than a more severe GO in males, as suggested by the fact that all other GO parameters before treatment were similar regardless of gender. In confirmation of the better GO outcome in females, after GC and ORT several GO features became significantly different between genders. Thus, in addition to exophthalmometry, which after treatment was still higher in males but to a greater statistical extent, eyelid aperture after treatment was significantly greater in males, as was the extent of eye muscle involvement (degrees of diplopia).

Fig. 1
figure 1

Prevalence of responders (patients who had improved) and non-responders (patients who had remained stable or had worsened) according to gender, in 96 patients with Graves’ ophthalmopathy treated with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy, after a median time of 55.5 months. P = 0.02 by Fisher exact test

Table 3 Features of Graves’ ophthalmopathy (GO) according to gender in 96 patients after treatment with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy

Relation between time and GO outcome

As shown in Fig. 2a, the time elapsed between first and last observation was significantly greater in responders (P = 0.02 by Mann–Whitney). We grouped patients into seven categories according to the time between first and last observation. As shown in Fig. 2b, whereas the prevalence of responders and non-responders was similar up to 36 months, there was an increase in responders beginning between 37 and 48 months and reaching a peak of ~80 % between 61 and 72 months, to plateau thereafter. We analyzed the individual GO features in these seven time groups. As shown in Table 4, exophthalmometry and eyelid aperture were quite variable over time, and visual acuity remained overall stable, the latter presumably due to the low number of patients with optic neuropathy. In contrast, CAS and diplopia were affected by time, thereby reflecting the observations reported in Fig. 2b. Thus, CAS decreased between 13 and 24 months and further between 25 and 36 months, to plateau thereafter. Diplopia improved progressively, beginning at 13–24 months, to reach a peak of amelioration between 37 and 48 months, although its behavior was quite variable compared with that of CAS.

Fig. 2
figure 2

a Median (±IQR) follow-up time (time elapsed between first and last observation) in responders (patients who had improved) and non-responders (patients who had remained stable or had worsened), in 96 patients with Graves’ ophthalmopathy treated with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy. P = 0.02 by Mann–Whitney. b Prevalence of responders and non-responders after patient grouping according to the duration of follow-up

Table 4 Features of Graves’ ophthalmopathy (GO) at various time points since first observation, in 96 patients treated with intravenous high-dose glucocorticoid pulse therapy and orbital radiotherapy

Discussion

Although ivGC and ORT are well-established treatments for active GO, little is known on which factors may affect the long-term GO outcome, an information that could be quite useful when designing a therapeutic strategy for GO. Here, we investigated this issue in a retrospective study, performed in a relatively large series of patients with GO. After a median follow-up period of 55.5 months, GO was found to be improved in ~70 % of patients. This finding is in line with the observations of previous retrospective studies on the long-term GO outcome after ORT [20, 21], although comparisons are quite difficult, because of the different patient selection, because in those studies not all patients were given GC, and finally because of the different outcome criteria used. Comparisons with the previous randomized, prospective studies assessing the effectiveness of GC and ORT are also difficult, because of their shorter follow-up periods and because of the fact that in some of those studies patients were treated with oral GC, rather than with ivGC as the patients studied here [2224].

In any case, the main end-point of the present study was not to investigate the response of GO to treatment, but rather to identify individual variables that may have an influence on the GO outcome in the long term. In this regard, we did not compare the outcome of GO with that of patients treated with ivGC, but not with ORT, which, in any case, would not have been possible because of a selection bias. Thus, in our Center ORT is usually given only to patients with inconstant or constant diplopia, in the absence of which patients are usually treated only with ivGC.

We found that gender and the time elapsed after treatment play a major role. Thus, the prevalence of responders (patients whose GO had improved at the end of follow-up) was greater in females: GO ameliorated in ~75 % of female patients and only in ~50 % of males. The duration of follow-up, which, as reported above, also affected GO outcome, was not different between males and females (not shown), as were all the other parameters examined, with the exception of exophthalmometry at first observation, which was greater in males, presumably reflecting the difference in exophthalmometry between genders already under physiological conditions [19], as also suggested by the fact that all the other GO parameters before treatment did not different between males and females. In agreement with the more favorable GO outcome after treatment in females, at the end of individual follow-up women had a lower degree of proptosis, eyelid aperture and diplopia than men. The obvious consequence of these observations is that, when planning a GO treatment in a given patient, he/she should be made aware of the fact that his/her chances of having a good response in the long term are about 50 and 75 %, respectively. Given the limitations of a retrospective study, an issue discussed also later, in our opinion, together with the severity of GO before treatment and the estimate of risk for ivGC side effects [25], this information should be taken into account when planning the treatment, not the least in view of the patient’s preference once properly informed.

As mentioned above, the duration of follow-up, namely the time elapsed after treatment in individual patients, also affected the outcome of GO, as it was significantly longer in responders (~60 months) than in non-responders (~40 months). In agreement with this observation, the prevalence of responders increased with time, reaching a peak of ~80 % between 61 and 72 months, which is a relatively long period. In this regard, it is possible that this late improvement of GO reflects the natural history of the syndrome rather than treatment. Thus, it is well known that GO improves spontaneously with time [1, 2629], as we also observed recently in mild GO [29]. CAS and diplopia were the GO features affected by time to a greater extent.

To our surprise, CAS at first observation did not affect the GO outcome in the long term. Thus, GO activity, of which CAS is a well-established measure [14], is known to affect the response to immune suppressive/antiinflammatory treatments [16]. How could this observation be explained? Most likely, the lack of relation between CAS and GO outcome is due to the fact that the majority of patients had a quite long follow-up, therefore, as mentioned above, a GO behavior reflecting not only the response to treatment, but also the natural history of the syndrome, which should not be affected by the initial degree of CAS. The same explanation probably applies also to the lack of correlation between the long-term GO outcome and TRAb, and between GO outcome and smoking habits, two factors that also are known to affect the response of GO to treatment [17, 30]. On the other hand, before treatment patients had a median CAS value of 3/7 points, indicating an active, but not a very active GO. Thus, CAS, as well as TRAb and smoking that are related to CAS, might not have affected the GO outcome in this series of patients simply because GO was not too much active to begin with, an issue that requires further investigations.

One possible limitation of the present study is that patients were relatively heterogeneous in terms of GO features before treatment, even though the majority of them had a mild-to-moderate GO. Ideally, this issue should be investigated in prospective studies that are certainly needed. Another, maybe even more important, limitation, is the fact that treatment for GO was started 2 weeks after treated of hypothyroidism, meaning that some of the patients given radioiodine might have been hypothyroid, a condition that is known to affect GO in a negative manner [17]. On the other hand, thyroid treatment did not seem to affect the outcome of GO, although this conclusion awaits confirmation in view of the relatively low number of patients treated with thyroidectomy or with total thyroid ablation (thyroidectomy followed by radioiodine), considering that the latter procedure has been reported to affect the outcome of GO by several studies [1618].

Obviously, this study carries the limitations of retrospective investigations. On the other hand, it would be quite difficult to perform a prospective study on the follow-up of GO in the long term. Taking this aspect into account, our observations confirm that the combination of GC and ORT is effective in GO and show that female have greater chances than males to respond to treatment. The notorious tendency of GO to improve spontaneously with time most likely contribute the late amelioration of the eye syndrome. Clearly, further prospective, possibly randomized studies are needed to investigate this issue further.