Introduction

Gadolinium-based contrast agents (GBCAs) are a family of paramagnetic compounds consisting of gadolinium ion chelated with linear or macro-cyclic ligands to form a stable complex [1]. Largely used in diagnostic neuroradiology for their ability to shorten T1- and T2-relaxation times of tissues enhancing tissue characterization, GBCAs have been applied to a large range of central nervous system (CNS) diseases, especially in inflammatory or oncologic conditions requiring periodic follow-up [2].

Despite GBCA safety and the limited number of adverse reactions [3] and long-term complications [4], possible gadolinium retention in the brain of patients receiving multiple GBCAs doses has recently become a major public health concern [5], with possible implications for clinical practice [6].

The presence of spontaneous high-signal intensity on unenhanced T1-weighted images of the dentate nucleus (DN) and the globus pallidus (GP) in patients who had undergone multiple GBCA administration has been described by Kanda and colleagues in 2014, showing a direct correlation between increased relative signal intensity and the number of contrast-agent administrations [7]. Since this first report, several qualitative and quantitative MRI studies confirmed a T1 shortening, on unenhanced images, of the deep nuclei of subjects who had received multiple doses of GBCAs [8, 9], notwithstanding the presence of a conserved renal function, preserved hepatobiliary system, or intact blood–brain barrier [10,11,12]. Following these observations, both animal [13, 14] and human [11, 15, 16] ex vivo studies demonstrated that these regional signal changes correspond to gadolinium retention, occurring independently from the presence of renal dysfunction [15].

These findings exerted a great impact on the scientific community, as they have possible consistent implications for the daily clinical practice, with concerns about the potentially harmful effects of gadolinium deposits [10]. Although no clear relationship between GBCA administration and overall disability has been described neither in animal models nor in human [10], the possible clinical impact of gadolinium retention in the brain is still largely unknown. A recent study reported an association between increased DN signal intensity and lower verbal fluency in multiple sclerosis (MS) patients, although a causal relationship between the two could not be determined [17]. At the same time, other authors have proposed the existence of a “Gadolinium Deposition Disease” (GDD), a still controversial definition used to designate the onset of mild, non-specific, self-limiting, and, generally, self-reported systemic symptoms arising from hours to months after GBCA administration [18,19,20,21].

To expand the current knowledge about the clinical meaningfulness of GBCA retention, we have assessed the impact of repeated intravenous administration of GBCAs on the clinical evolution of patients with relapsing–remitting MS (RR-MS) by investigating the relationship between MRI signs suggestive of gadolinium retention and disability progression.

Material and methods

Participants

This retrospective analysis was conducted on the same group of 74 RR-MS patients enrolled in a previous cross-sectional study [22]. For each patient included in the analysis, longitudinal clinical data were obtained within one week from the MRI acquisition and after a mean follow-up time of 3.6 (± 0.6) years. All patients fulfilled the 2010 revision of the McDonald’s criteria for MS diagnosis [23]. Seventy three out of 74 patients were in treatment with a disease-modifying therapy (DMT) at baseline (fingolimod n = 17; natalizumab n = 28; interferon beta-1a n = 20; interferon beta-1b n = 7; glatiramer acetate n = 1), with 35 patients under a different agent at the follow-up examination (fingolimod n = 23; natalizumab n = 19; interferon beta-1a n = 9; dimethyl fumarate n = 7; interferon beta-1b n = 5; alemtuzumab n = 4; rituximab n = 1; glatiramer acetate n = 1).

Global clinical disability was assessed at baseline and follow-up with the expanded disability status scale (EDSS) score, and disease duration (DD) was estimated as the interval period between the first occurrence of symptoms referable to the disease onset and the baseline assessment. Despite its known limitations [24], EDSS was selected as the outcome measure not only because it is routinely administered in our MS center and was the clinical measure more consistently available for retrospective collection across subjects and time points but also because, to date, it is still the most widely used measure of clinical disability progression in MS trials [25]. For each subject, the change in the EDSS score (ΔEDSS, calculated as Follow-up EDSS − Baseline EDSS) was reported. According to the criteria first described by Weinshenker and colleagues [26] and commonly applied in research setting and clinical trials [27, 28], patients were considered having a clinical worsening when scoring a ΔEDSS ≥ 1 (if baseline EDSS was 5.5 or lower), while for subjects with a baseline EDSS above 5.5, a ΔEDSS ≥ 0.5 was used to define progression. Finally, the number of new relapses (NR) and the change to a secondary progressive (SP-MS) type of the disease were recorded at the follow-up examination, as additional indices of clinical status.

Standard protocol approvals, registrations, and patient consents

This study was formerly approved by the local Ethics Committee. Written informed consent was obtained from all patients prior to enrollment.

MRI data acquisition

All MRI data were acquired on the same 3T scanner (Trio, Siemens Medical Systems, Erlangen, Germany) after one year (12.3 ± 0.9 months) from the last contrast-enhanced MRI scan. The following sequences were included in the acquisition protocol: a 3D Magnetization Prepared Rapid Acquisition Gradient Echo (MPRAGE) T1-weighted sequence (TR = 2500 ms; TE = 2.8 ms; TI = 900 ms; FA = 9°; voxel size = 1 × 1 × 1 mm3; 176 sagittal slices) acquired before contrast administration, a 3D Fluid-Attenuated Inversion Recovery sequence (FLAIR; TR = 6000 ms; TE = 396 ms; TI = 2200 ms; flip angle = 120°; voxel size = 1 × 1 × 1 mm3, 160 sagittal slices) used for the estimation of lesions load (LL) volume, and two unenhanced 3D double-echo spoiled GRE sequences (TR = 28 ms; TE1 = 7.63 ms; TE2 = 22.14 ms; FA1 = 3°; FA2 = 20°; GRAPPA factor = 2; bandwidth = 190 Hz/pixel; voxel size = 0.65 × 0.65 × 1.3 mm3; 128 axial slices).

MRI data processing and analysis

Hyperintense lesions on FLAIR images were identified and segmented for all patients included in the study. This analysis was performed in consensus by three observers (CP, MDS, EV) with more than 4 years of experience in brain imaging using a semiautomatic approach (Jim 7, Xinapse Systems, Northants, UK), and LL volume was obtained for each subject.

Corresponding lesion masks were then coregistered using an affine registration to the T1-weighted volume for an in-painting procedure, carried out using FSL v. 5.0.10 (FMRIB’s Software Library, www.fmrib.ox.ac.uk/fsl), to correct for the impact of white matter lesions during the segmentation procedure [29]. Using SIENAX [30], gray matter volume (GMV) (normalized by multiplying for the V-scaling factor) was obtained for each subject, as an index of cortical atrophy.

Qualitative and quantitative evaluation of DN hyperintensity were respectively used as clinical and advanced MRI indices of GBCA retention.

Qualitative evaluation was performed on volumetric T1w images, visually inspected independently by three different readers (CP, MDS, EV). In case of dubious interpretation of MRI findings, a fourth expert reader (AB) with more than 25 years of experience in neuroradiology was asked to review the images, to reach a final decision. For each patient, readers were asked to evaluate whether a clear and defined bilateral hyperintensity affecting DN was present. DN hyperintensity was considered present when reported by at least 2 out of 3 of the readers.

Quantitative evaluation was achieved through calculation of longitudinal relaxation rate (R1) maps according to [31, 32]. For each patient, the R1 values of the DN were obtained by hand-drawing two irregular bilateral regions of interest (ROIs) on the axial slice of the GRE images where the best representation of the DN was available, and mean values were normalized with those obtained by placing two circular ROIs on the brainstem. A complete and detailed description of R1 image analysis is available in [22].

Statistical analysis

The Fleiss’ kappa coefficient was calculated to assess the agreement between readers in evaluating the presence of DN hyperintensity, with the following classification: ≤ 0.20 = poor agreement; 0.20–0.40 = fair agreement; 0.40–0.60 = moderate agreement; 0.60–0.80 = good agreement; ≥ 0.80 = excellent agreement.

An independent two-sample t test was used to evaluate possible differences in terms of age, DD, and number of contrast-enhanced MRI between MS patients with and without a significant clinical worsening at the follow-up examination, while differences in terms of sex and EDSS values were probed via chi-squared and Mann–Whitney tests, respectively.

To verify the reliability of visually detected DN hyperintensity as an actual index of objective relaxometric modifications (and, thus, of GBCA retention), an independent two-sample t test was used with the aim of demonstrating differences in terms of DN R1 values between MS patients with and without DN hyperintensity.

Furthermore, possible differences in terms of ΔEDSS between patients with and without DN hyperintensity, as well as the comparison of DN R1 values between stable patients and those showing a clinical worsening were probed via the general linear model, removing the effects of potential confounding factors, such as age, sex, MS type, GMV, LL, DMT, number of NR, and DD.

Similarly, the possible relationship between R1 values and ΔEDSS was tested via hierarchical multiple linear regression analysis, including age, sex, MS type, DMT, number of NR, and DD in the first block and MRI variables in the second.

Analyses were carried out using the Statistical Package for Social Science (SPSS) package (SPSS Inc., version 17.0, Chicago, IL), with a p = 0.05 set to indicate a statistically significant difference in the between-group comparison and regression analyses.

Data availability

Anonymized data will be shared by request from any qualified investigator.

Results

Of the 74 patients enrolled in the baseline evaluation, 4 subjects did not return for the follow-up clinical examination and were, therefore, dropped-out from the subsequent analyses, thus leaving a total number of 70 subjects included in this study. Among these, 6 patients (8.6%) converted to a progressive phenotype and 12 patients (17.1%) showed a significant clinical worsening at the follow-up examination, with a mean ΔEDSS of 1.3 (± 0.78, ranging from 0.5 to 3.5 points). A complete list of demographical and clinical data of the studied population is available in Tables 1 and 2.

Table 1 Demographic and clinical information of the patients with multiple sclerosis included in the study
Table 2 Demographic and clinical information of the patients with multiple sclerosis showing a significant clinical worsening

A good agreement between readers was achieved when evaluating the presence of DN hyperintensity (κ = 0.64), with consensus reached in 74.3% of the cases (52/70 subjects). DN hyperintensity was visible in 27 out of 70 (37.0%) of the examined MS patients. An example of a positive DN involvement is shown in Fig. 1.

Fig. 1
figure 1

Axial slice showing a T1-weighted DN hyperintensity in a 28-year-old woman who has received 14 gadolinium administration

As reported in the baseline work in which quantitative MRI data were evaluated, the normalized DN R1 values proved to be significantly correlated with the number of GBCA administrations (p < 0.001) [22]. This correlation remained significant after correction for MS-related variables and was mainly linked to linear GBCAs rather than macro-cyclic chelates [22].

When investigating the correspondence between qualitative and quantitative DN alterations, significantly higher DN R1 values were found in MS patients showing DN hyperintensity on unenhanced T1-weighted images compared to those without visible MRI changes (1.15 ± 0.01 vs 1.11 ± 0.01 s−1, p = 0.03).

When testing for possible differences in ΔEDSS between patients showing DN hyperintensity and patients without visible changes on unenhanced T1-weighted images, no significant difference emerged between the two groups (p = 0.32) (Fig. 2). Similarly, no difference was found in terms of DN R1 when subjects showing a significant clinical worsening were compared to stable patients (p = 0.54) (Fig. 3).

Fig. 2
figure 2

Box plot showing the distribution of ΔEDSS values when patients were divided based on the presence of a T1-weighted DN hyperintensity

Fig. 3
figure 3

Box plot showing the distribution of R1 DN values when patients were divided based on the presence of a significant clinical worsening

Finally, when possible relationships between DN R1 values and ΔEDSS were tested, the regression model explained 23.4% of the variance of ΔEDSS, without any significant effect of DN R1 values (p = 0.76). A scatter plot of the correlation between MRI data and clinical findings is reported in Fig. 4.

Fig. 4
figure 4

Scatter plot of the relation between R1 DN and ΔEDSS values

Discussion

In this study, we explored the possible impact of gadolinium retention in the brain on global motor disability in RR-MS patients, showing that qualitative and quantitative MRI features suggestive of GBCAs deposition are not associated with significant MS-related global disability progression.

Free gadolinium ion is known to show high toxicity, whereas chelated gadolinium has been historically considered relatively safe [33, 34]. Although animal models failed to demonstrate significant harm or behavioral changes in rats after repeated GBCA administrations [13, 14], some evidences of acute toxicity in vitro [35] or in animal models [36] have been reported. In human, besides some anecdotal reports of nephrotoxicity [37], neurotoxicity [38], and pancreatitis [39], very few data are available on the long-term effects of GBCA retention in the brain tissue.

In this work, we expanded the current knowledge about gadolinium deposition investigating, in a cohort of MS patients, the global motor disability evolution as function of the GBCA retention. We found no difference in terms of disability progression when comparing MS patients with and without DN hyperintensity on T1-weighted unenhanced images, suggesting a relative independence between the most recognizable indirect marker of gadolinium retention in the brain and the clinical course of the disease. Nevertheless, although in our analysis the readers reached a good agreement in evaluating this neuroradiological feature, visual assessment of DN T1-weighted hyperintensity shows the limits of any qualitative analysis, both in terms of reproducibility and reliability. These limitations can be overcome using quantitative measurements, which allow for an accurate and solid evaluation of tissue properties and provide information on microstructural changes not achievable using qualitative analyses [40]. Therefore, in the present study, the lack of association between gadolinium retention qualitatively estimated as DN hyperintensity and clinical progression has been confirmed by quantitative data, demonstrating no significant difference in terms of DN R1 values between patients with a stable disease and those who manifested a significant clinical worsening. Furthermore, when investigating the possible relationship between DN R1 values and disability progression rate, measured as ΔEDSS change between baseline and follow-up examination, no significant association was found. This result, in addition to the reported lack of effect of DD and EDSS on R1 values at the cross-sectional analysis [22], further supports the independence between DN relaxometry changes induced by gadolinium deposition and clinical status.

Our results mitigate against the presence of conditions, such as the GDD, in which unspecific symptoms, such as tightness, pain, and persistent headache with clouded mentation (“brain fog”), are reported usually within 1 month after GBCA administration [20, 21]. In contrast with these results, which were obtained collecting data via an online survey in which MS patients self-reported their symptoms without a control group, our findings suggest that clinical worsening, although evaluated using a rather rough but widely used scale, such as the EDSS [25], is independent from objective and quantitative MR measures, such as R1.

A study recently published has proposed an association between MRI signs of gadolinium retention and cognitive disability in MS [17]. In this light, our results substantiate these evidences, proving additional information about the possible clinical reflections of GBCA deposition in MS patients. Indeed, the reported relation between increased DN signal intensity and poorer performances at verbal fluency proved to be independent from EDSS scores, a result in line with the one we obtained, further advocating against a possible role of GBCA retention in global motor disability evolution. However, it has to be noted that EDSS is unable to capture subtle functional changes linked to the physiological roles of the DN (e.g., planning, initiation, and control of voluntary movements) [41]. For this reason, future prospective studies are warranted with the inclusion of a more focused neurological examination, to unravel the possible association between gadolinium retention in the brain and subtle neurological and cognitive deficits. Furthermore, the lack of follow-up MRI data has limited our ability to evaluate the relationship between changes in DN R1 and clinical worsening. In this regard, assuming a linear relation between gadolinium dose and DN R1, we have explored the correlation between number of gadolinium administrations over the follow-up period and delta EDSS in subjects showing DN hyperintensity on the baseline scan by means of a Spearman’s rank analysis (data not shown). The lack of correlation between the two metrics seems to further support the independence of clinical worsening from repeated gadolinium administrations.

Another limitation of the present study is that the specific molecule administered in our population was not always reported in scans previously performed in different centers [22], thus not allowing to investigate a possible different behavior between linear and macro-cyclic agents in determining significant clinical changes. However, it is known that linear agents are more prone to release gadolinium ions compared to macro-cyclic ones [13] and that they induce significant R1 changes [22]. As we assume a potential relation between clinical changes and gadolinium retention, it is unlikely that macro-cyclic GBCAs, being less prone to deposition, could impact more profoundly clinical scores compared to those that accumulate but (at least in our studied patient group) do not induce significant clinical changes. Nevertheless, this speculation needs to be tested in future longitudinal studies in which the class of GBCAs is known for all subjects.

Finally, our mean follow-up time was 3.5 years, thus limiting our observation to a relatively short period, not allowing to conclude whether gadolinium retention in the brain causes more delayed toxicity.

Nevertheless, our results indicate that MS patients showing DN hyperintensity on unenhanced T1-weighted images, a neuroradiological finding with a significant correlation with the number of previous GBCA administration, have similar EDSS changes over a period of 3.6 years compared to those without DN high-signal intensity. Furthermore, mean DN R1 values of patients with a significant clinical worsening were comparable to those obtained in more stable subjects, and were unrelated to global motor disability.

These findings, taken together, provide evidences that qualitative and quantitative signs of gadolinium retention in the brain of MS patients who underwent multiple GBCA-enhanced MRI examinations do not determine significant changes on global motor performances and, thus, do not affect the course of the disease.