Introduction

Thyroid eye disease (TED) is the most common extrathyroidal manifestation of Graves’ disease [1, 2]. Besides autoimmune processes oxygen-free radicals and cytokines play a pathogenic role in TED [3]. There is a spectrum of ocular involvement in Graves’ disease, from complete absence of symptoms and signs-to-sight-threatening conditions [4]. Dysthyroid optic neuropathy (DON) is a major sight-threatening complication affecting 4–8 % of patients with TED [47]. It is defined as neuropathy caused by compression or stretch of the optic nerve associated with edema, volume increase and inflammation of orbital tissue due to deposition of excess glycosaminoglycans. Impaired vision leads to quality of life impairment [8, 9] and might have socio-economic consequences [10]. DON of recent onset requires urgent medical treatment (high-dose intravenous steroid pulses [11, 12]) and/or orbital bony decompression to avoid permanent or progressive visual loss [1319]. Because these procedures carry the risk of serious complications [2024], it is essential that a timely and correct diagnosis is made [25]. Controversy still exists regarding the diagnostic features of DON [26, 27]. Also, recognition of DON might be delayed, especially if the presentation is atypical [28, 29]. Furthermore, in newly presenting patients, it can be particularly difficult to correctly evaluate whether DON has developed recently and thus warrants urgent therapy. Finally, alternative causes for visual impairment can be present in patients with TED [3033] highlighting the fact that serious efforts should be undertaken to improve diagnostics and accurate management of DON. Several studies have suggested that serum levels of the functional thyroid stimulating immunoglobulins (TSI), measured with a novel cell-based thyroid stimulating hormone (TSH) receptor (TSHR) bioassay, may correlate with symptoms and signs of TED [3439]. Analytical performance, validation, clinical utility and standardization of this FDA-cleared bioassay have been demonstrated [4044]. Since recognition of DON requires sensitive diagnostic tools and clinical assessment may fail to reliably evaluate the onset and acuteness of DON, we hypothesized a relationship between the serum TSI levels and the onset of DON.

Methods

Patient data and ophthalmic assessment

At an academic referral multidisciplinary orbital center, complete data was collected regarding the presence of DON and associated clinical features in 180 consecutive and unselected patients with TED. DON of recent onset occurred early and/or within the last 6 months at the latest. Those patients had been treated with high-dose intravenous steroids and/or bony decompression surgery during this period of time. All had clinical and/or radiological evidence of DON. In patients with DON, data on visual acuity, relative afferent pupillary deficit, vision-field testing, colour desaturation testing, and visual-evoked potentials were obtained. Visual-evoked potentials (VEPs) were performed to pattern stimulation. Measurements were made of the peak-to-peak amplitude and the absolute latency of P100. VEP was considered abnormal according to the following definition: P100 amplitude reductions which were either relative (an interocular difference of >25 %) or absolute (a value <4 µV, which is the normal standard in this unit). The latency was considered abnormal if an absolute delay of P100 was measured or in case of a relative delay (interocular difference >8 ms, being beyond the upper limit of normal). Color vision was tested with Ishihara color plates. As recommended by EUGOGO [13] color vision was defined to be abnormal if more than two errors were made. Visual-field testing was performed with Goldmann or static automated perimetry using the central 30-2 full threshold program. We defined a visual field to be pathologic if a central scotoma or an inferior altitudinal defect was seen or if an enlarged blind spot, a paracentral scotoma, a nerve fiber bundle defect, or a generalized constriction were noted.

Post-DON mirrored a condition where patients were treated medically or surgically more than 6 months ago. Further ophthalmic assessments (e.g. fluorescein angiography, etc.) were performed if the “standard” investigations alone were unable to ascertain that the eye abnormality was due to DON or to other concomitant comorbidities. If both eyes had DON, the values of the more severely affected eye were chosen for analyses. In all patients, clinical assessment of TED and DON was performed according to the Consensus Statement of the European Group on Graves Orbitopathy, EUGOGO [13, 27, 45]. The clinical activity score (CAS) is based on the classical signs of inflammation and consists of seven items. One point is added for each item that is present and includes: spontaneous pain behind the globe, pain on attempted up-, side- or down-gaze, redness and/or chemosis of the eyelids, and of the caruncle. The CAS ranges from 0 to 7. Per definition, TED is called “active” if CAS is ≥3. To gain clinical information about “DON activity” the rates of active and inactive disease were compared between subjects with recent-onset and post-DON. This information was also used to assess whether the associations between TSI and the acuteness of DON were independent from clinical signs of activity or not. Regarding clinical severity, we differentiated according to EUGOGO between mild, moderate-to-severe, and sight-threatening TED or DON.

Endocrine investigation and laboratory tests

We retrospectively reviewed the medical records of all 180 patients and collected data on thyroid function and therapy that had been assessed at the endocrine outpatient clinic of the multidisciplinary JGU center. The protocol was consistent with the principles of the Declaration of Helsinki. Written informed consent from each patient for blood sampling and complete data set was obtained. Patients that had participated in previous studies of the thyroid laboratory had agreed that blood samples stored at the bio-bank of the lab were used for further research. This procedure was approved by the local Ethics Committee. None of the individual-related data were passed to third parties. Sera that had been stored anonymously at the thyroid lab were used for TSI measurements. Serum TSI activity was measured with a functional cell-based TSHR bioassay (Thyretain, Quidel, CA, USA) according to the manufacturer’s instructions. Briefly, TSI levels were measured in triplicate with the Infinite M200 micro plate reader (Tecan, Crailsheim, Germany). All measured values were corrected for the plate internal auto-luminescence by reduction of the mean value in blank wells. The results were reported as percentage of specimen-to-reference ratio (SRR %). SRR % values were calculated according to the following formula: SRR % = Average TSI specimen relative light units (RLU)/average reference standard RLU × 100 as previously described [38, 39]. Patient serum was considered positive for the presence of TSI if the resultant SRR % measured ≥140 % over the reference control. Any specimen having a coefficient of variation (CV %) >15 % was excluded from the data set and re-tested. The thyroid-binding inhibitory immunoglobulin (TBII) activity was measured with the human TSHR autoantibody radioimmunoassay (TRAK RIA, Thermo Fisher Scientific, Hennigsdorf, Germany).

To test the specificity of the TSI-bioassay, serum TSI levels were also measured in a large control group of 302 “healthy” individuals. “Healthy” was defined as a negative personal and familial history of thyroidal, metabolic, autoimmune, infectious, and tumor diseases, normal ultrasound imaging of the thyroid, as well as normal thyroid-related serum hormone levels and autoantibodies.

Data analysis and statistics

The main outcome measures of this study were the TSI serum levels in patients with DON of recent onset vs. in patients having either a post-DON status or TED without DON. The secondary outcome measure was to compare the TSI and TBII serum levels in TED patients. Regarding the TSI measures, the acquisition file templates and the parameters of data analysis were defined using the Tecan instrument control and data analysis software (Magellan Tracker version 6.6). The records of all acquired data were stored in the format of the original acquisition source files and as data calculations in exported SPSS (Statistical Package for the Social Sciences, Version 22.0, Chicago, Illinois, USA) spread sheets. T test and Chi-square test were used for univariate analyses to compare DON of recent onset with post-DON or TED with DON versus TED without DON. Spearman´s correlation coefficient was calculated for association analyses between continuous variables. Due to the small subgroups, an explorative analysis was performed regarding the differences of TSI and TBII in unilateral versus bilateral DON. One eye only from each subject was used for the analyses. For the multivariate analysis a binary logistic regression model was done. Sensitivity and specificity values with respect to discrimination between DON of recent onset and post-DON were determined for several TSI cut-points, resulting in a receiver operating characteristic (ROC) curve. As an overall measure of discrimination the area under the ROC curve (AUC) was considered. Furthermore, the ROC curve was used for determining an optimal TSI cut-point. Specifically, a pair of sensitivity and specificity was chosen to optimize the Youden index. Illustrations were made with error bars showing the mean ±95 % confidence intervals. Clinical activity of TED, thyroid function and treatment status of thyroid disease were included in the multivariate analysis to detect potential confounders.

Results

Patient characteristics

Demographic and clinical data of the 180 patients with TED are summarized in Table 1. Overall 30 (16.7 %) patients had DON of recent onset or post-DON and in 150 (83.3 %) patients the optic nerve was not affected by TED. DON of recent onset and post-DON were present in 20 of 30 (66.7 %) and in ten (33.3 %) patients, respectively. All patients had Graves’ disease and associated TED.

Table 1 Demographic and ophthalmic data

Clinical signs pertaining to dysthyroid optic neuropathy

Five of 20 (25 %) and 3 of 10 (33 %) patients with DON of recent onset and post-DON showed bilateral eye disease, respectively (p = 0.70). Clinical assessment of these 25 eyes with DON of recent onset and 13 eyes with post-DON revealed that optic disk swelling was present in all eyes with DON of recent onset. In post-DON optic disk swelling improved in all patients but signs of a condition after DON were detectable in all of them (paleness/minor optic disk swelling). Visual-evoked potentials were pathologic in all eyes (25 of 25; 100 %) of patients with DON of recent onset but in one of 13 (7.7 %) patients with post-DON, only (p = 0.005). A relative afferent pupillary defect was present in 21 of 25 (84 %) eyes with recent onset (unilateral or asymmetric) DON and in one of 13 (7.7 %) eyes in post-DON (p = 0.287). Color vision abnormalities were present in 30 and 10 % of patients with recent DON vs. post-DON. Vision field tests were pathologic in 73 % of affected eyes of patients with recent-onset DON and in 50 % with post-DON (p = 0.496). Visual acuity (logMAR) in the DON-affected eyes was 0.37 ± 3.28 in patients with recent-onset DON and 0.60 ± 0.99 in patients with post-DON (p = 0.449).

TSI correlated with the clinical activity score of TED (R = 0.70, p < 0.001). Mean TSI levels in active TED were 485.1 ± 132.3 vs. 277.7 ± 143.7 in inactive disease (p < 0.001). TBII was associated with clinical activity as well: it was 48.9 ± 107.6 IU/l in active vs. 16.5 ± 39.6 IU/l in inactive TED (p < 0.001).

No significant differences of TSI and TBII in unilateral or bilateral DON were found: of patients with bilateral DON of recent onset 80 % were TSI-and TBII-positive versus 100 % TSI-positivity and 86.7 % TBII-positivity in unilateral DON of recent onset. In post-DON, 60 % with a condition after bilateral optic neuropathy were both TSI and TBII positive versus 60 and 40 % being TSI- and TBII positive patients with unilateral post-DON.

TSI versus TBII to identify recent-onset dysthyroid optic neuropathy

TSI were measured in 482 individuals, 180 consecutive and unselected patients with TED with and without DON and in 302 healthy euthyroid subjects (aged 28 ± 8 years, 155 females). All controls were TSI negative with a mean SRR % (SD) 53.17 (16.12) and a median (range) of 51.5 (15–114). Differences of the TSI absolute values in patients with recent onset DON vs. in patients with post-DON or without DON are illustrated in Fig. 1. To evaluate the association between TSI and DON of recent onset, a multivariate analysis was performed (Table 2). Thyroid dysfunction and intake of antithyroid drugs were not included into the multivariate analysis, as there were no cases for these two parameters in the group of subjects with post-DON. In the binary regression analysis—independent from the potential confounders’ radioiodine treatment, thyroid surgery, or clinical activity of TED—serum positivity of TSI was associated with DON of recent onset. Given that high levels of TSI were associated with recent-onset DON, a discriminatory cut-point for TSI was determined based on a ROC analysis (reference: recent-onset DON). The ROC curve indicated reasonable discriminatory performance (area under the curve, AUC: 0.855, p = 0.002, 95 % CI 0.683–1.0; Fig. 2). A cut-point of 377 SRR % (chosen according to an optimal Youden index of 0.75) corresponded to a sensitivity and specificity of 0.95 and 0.8, respectively. Summarized in Table 3 are the rates of TSI-and TBII-positivity in clinically active versus inactive DON of recent onset and post-DON. The serum TBII values in patients with recent onset DON vs. in patients without DON and post-DON are illustrated in Fig. 3. Of patients with recent onset and post-DON 13 of 20 (65 %) and 4 of 10 (40 %) had clinically active TED, respectively. All patients but one with recent onset DON was also TSI-positive (p < 0.001), however, six of 7 (85.7 %) patients with clinically inactive TED with recent onset DON were TSI-positive. The single TSI-negative patient had thyroidectomy 10 years prior to the occurrence of DON. Of patients with post-DON and without DON, 6 (60 %) and 143 (95.3 %) were TSI-positive (p < 0.001). In contrast, TBII was positive (>1 IU/l) in 16 (84.2 %), five (50 %), and 124 (83.2 %) patients with recent onset DON, post-DON, and without DON, respectively (p = 0.031). In the subgroup of patients without DON, all 103 patients with moderate-to-severe TED but 40 of 47 (85 %) only with mild TED were TSI positive (p < 0.001).

Fig. 1
figure 1

Thyroid stimulating immunoglobulins, TSI [in SRR %] in patients with recent-onset dysthyroid optic neuropathy (DON) versus in patients with a condition post-DON versus in subjects with thyroid eye disease without DON. Plotted are the means ±95 % confidence intervals. Mean TSI (±standard deviation) was SRR % 478.3 ± 98.0 in patients with DON of recent onset versus 397.8 ± 174.4 in post-DON (p < 0.001) versus 408.5 ± 170.0 in patients without DON (p = 0.074). p values according to t test

Table 2 Multivariate analysis (binary logistic regression analysis): association between positivity of thyroid stimulating immunoglobulins (TSI), treatment status of Graves’ disease and clinical activity of the thyroid eye disease (TED) and the presence of dysthyroid optic neuropathy (DON) of recent onset
Fig. 2
figure 2

Receiver operating characteristing (ROC)-curve: the true positive rate (sensitivity) is plotted in function of the false positive rate (100-Specificity) for different cut-off points. Each point on the ROC curve represents a sensitivity/specificity pair corresponding to a particular decision threshold (here: dysthyroid optic neuropathy of recent onset). Given that high levels of TSI were associated with recent-onset DON, a discriminatory cut-point for TSI was determined based on a ROC analysis (reference: recent-onset DON). The ROC curve indicated reasonable discriminatory performance (area under the curve, AUC: 0.855, p = 0.002, 95 % CI 0.683–1.0; Fig. 2). A cut-point of 377 SRR % (chosen according to an optimal Youden index of 0.75) corresponded to a sensitivity and specificity of 0.95 and 0.8, respectively

Table 3 Positivity of thyroid-stimulating immunoglobulins (TSI) and thyroid-binding inhibitory immunoglobulins (TBII) in active and inactive dysthyroid optic neuropathy (DON) of recent onset versus post-DON
Fig. 3
figure 3

Thyroid binding inhibitory immunoglobulins, TBII [in IU/l] in patients with recent-onset dysthyroid optic neuropathy (DON) versus in patients with a condition post-DON versus in subjects with thyroid eye disease without DON. Plotted are the means ±95 % confidence intervals. Mean TBII (±standard deviation) was 25.0 ± 30.3 IU/l in patients with recent onset DON versus 38.3 ± 94.5 IU/l in post-DON (p = 0.280) versus 40.1 ± 97.1 IU/l in patients without DON (p = 0.504). p values according to t test

Association with therapy status of DON

All patients had initially been treated with high-dose intravenous steroids (500 mg methylprednisolone 2–3 times per week for a maximum of 2 weeks. At time of inclusion 8 of 20 (40 %) with recent-onset DON and no patient with post-DON had intravenous steroids in the past 6 weeks (p = 0.020). Steroid therapy did not interfere with the rates of TSI-positivity in the present cohort: of patients having had intravenous steroids recently all were TSI-positive. If response to steroids was insufficient bony decompression surgery was performed as recommended by EUGOGO [13]. At time of inclusion 7 of 25 eyes of patients with recent onset DON and 11 of 13 eyes with post-DON had been decompressed (p = 0.165).

Thyroid function and serological parameters

Twenty-eight of 30 (93 %) patients with DON/post-DON were euthyroid, while two were mildly hyperthyroid. In patients without DON, euthyroidism and thyroid dysfunction were present in 87 (58 %) and 63 (42 %) cases, respectively. Median time since onset of thyroid disease was 36 (range 0–216) months in patients with recent onset DON and 24 (0–96) in patients with post-DON (p = 0.450). Median time since onset of TED was 12 months both in patients with recent onset and in post-DON (ranges 0–216 and 0–72), respectively (p = 0.821). Graves’ hyperthyroidism was treated with antithyroid drugs in 103 (68.7 %) patients without DON and in five (25 %) patients with recent-onset DON (p < 0.001). Radioiodine had been administered in three (2 %) and 15 (50 %) patients without vs. with DON (p < 0.001). Similar data are noted for thyroid surgery: three (2 %) and 15 (50 %) patients without vs. with DON had thyroidectomy.

Serum TSI levels were higher in patients with Graves’ hyperthyroidism vs. in euthyroidism (SRR % 457.4 ± 143.4 vs. 378.2 ± 177.0; p < 0.001) and in patients still requiring antithyroid drugs vs. in those being euthyroid in remission (459.8 ± 151.8 vs. 327.2 ± 164.7; p < 0.001). In patients with TED, thyroidectomy led to a slight decrease of TSI serum levels only (409.7 ± 168.3 vs. 380.0 ± 183.3; p = 0.487). To assess the relationship between radioiodine administration and thyroid surgery with DON, the time lag between the occurrence of DON and these specific thyroid treatments was assessed: in five of 11 (45 %) and in five of 25 (25 %) patients, respectively, radioiodine therapy and surgery were performed within 1-year prior to the onset of DON.

Discussion

To the best of our knowledge, this is the first study to demonstrate within a group of consecutive and unselected patients with TED that high serum TSI levels are associated with the presence of DON of recent onset. Despite the strong association between clinical activity and the levels of TSI [39], the vast majority of patients with inactive TED and DON of recent onset were TSI-positive. This study is the first to indicate that serum TSI levels may be a useful diagnostic tool to identify patients with early onset DON who require urgent specific therapy—even if signs for activity are missing.

Since the vast majority of the patients with DON and/or post-DON (93 %) was euthyroid, thyroid dysfunction i.e. Graves’ hyperthyroidism had no relevant impact on this condition. The multivariate analysis showed a significant correlation and a very high odds ratio between high serum levels of TSI and the occurrence/presence of DON of recent onset.

As previously shown [2729, 32], clinical features in patients with suspected DON were noted to be highly heterogeneous. In the DON-patients assessed within this study, based on clinical findings alone and without any information regarding medical history it would have been difficult to decide whether DON had developed recently and warranted urgent therapy or not. Due to irreversible nerve fiber loss clinical signs of DON and decreased visual acuity may persist subsequently to medical or surgical decompression of the optic nerve (post-DON). In the present study, TSI usefully differentiated between three subgroups: DON of recent onset, post-DON and no DON. Except in one case in which total thyroidectomy was performed 10 years prior to the onset of DON all patients with recent onset DON were highly TSI-positive. In contrast, there was a wide overlap of the serum TBII values between the three study groups. Furthermore, the specificity of the novel cell-based bioassay was confirmed by the analysis of serum TSI-levels in a large healthy control group as all of these individuals were TSI-negative.

Since it has been shown that TSI is associated with both clinical activity of TED [39, 4648] as well as with the treatment status of thyroid disease [38, 4953], these variables were also included in the multivariate analysis. Interestingly, the association between TSI and DON of recent onset were independent from the clinical activity of TED. The comparison of TSI-positivity in clinically active versus inactive DON of recent onset and post-DON revealed the highest positivity rates in both clinically active and inactive DON of recent onset. In contrast, only one quarter of patients with clinically active post-DON was TSI-positive. This is in line with the previously published EUGOGO-survey on DON which emphasized that both soft tissue signs and signs of clinical activity did not help confirm or refute the diagnosis of DON, as such features were absent in several patients [27]. TSI could therefore be used as a biomarker to identify patients with acute DON even in the absence of signs of acute inflammation.

In the present study, no significant differences of TSI and TBII in unilateral or bilateral DON were found. This emphasizes DON as being part of a systemic disease. Nevertheless, DON usually occurs bilaterally in the majority of patients. Nevertheless, in the present cohort we had a relatively high proportion of unilateral DON. This might be because only eyes with definite DON (diagnosis was considered certain) were included. Eyes with equivocal DON (diagnosis likely but not certain) were not included into analysis. This might have led to an underestimation of the proportion of bilateral DON. The fact that only eyes with definite DON were included into analysis is also the explanation for the frequencies of the clinical signs of DON: for example, optic disk swelling was found in all patients with recent onset DON in the present study, whereas it is missing in up to 50 % of cases in daily routine [25]. This would have not been like this if we had included eyes with a suspicion of DON also. This might have led to an underestimation of the total prevalence of DON.

In this study, there were patients that had received radioiodine treatment in the year prior to the onset of DON. Hence, potentially, radioiodine could have increased the risk for DON at least in some of these patients [54, 55]. In another study a post-radioiodine therapy induced increase of TSHR autoantibodies in 50 % of the patients with Graves’ disease [56]. Further, a prospective controlled trial in Europe showed a dramatic surge of the TSHR autoantibodies serum levels post-radioiodine therapy while the high serum levels were sustained 5 years after radioiodine treatment [57]. However, multivariate analysis revealed that the association of serum TSI-positivity with recent onset DON was independent of radioiodine therapy.

The following limitations of the present study have to be addressed. First, this study compared the clinical and laboratory data of patients with different stages of TED with and without DON only at one time point. As this study was retrospective, we were not able to collect data of “very acute DON” but only a subgroup of patients that had been treated during the past 6 months. A prospective study should systematically measure the serum values of TSI and TBII prior to the occurrence, within the active phase of the disease, during and after specific therapy. Second, the collective of patients with DON was relatively small. Therefore, the results need to be confirmed or disproved in larger series. Furthermore, the method used for TSI measurement is not widely available and therefore the conclusion of this study cannot be widely used in clinical practice. Luckily, a careful clinical assessment and imaging is able to identify patients requiring urgent therapy in the vast majority of cases. Despite these limitations, this study is the first to provide assess TSI as a biomarker for DON. Optimized diagnostics of DON are necessary to ensure that patients requiring sight-saving treatment receive it in a timely manner, whereas those who do not need it spared the unnecessary risk of adverse events. Diagnostic tools such as measuring serum TSI levels seem to be useful especially in patients with atypical presentation and when clinical signs of activity are missing. High-quality prospective studies are warranted to assess whether measuring TSI may be used for the development and evaluation of novel therapies for patients with sight-threatening TED.