FormalPara Key Summary Points

While high-efficacy disease-modifying treatments for MS provided a reduction in new brains lesions and relapses when compared to low-efficacy disease-modifying treatments, there was no evidence of a reduction in new spinal cord lesions.

This study should encourage further research by including spinal cord outcome measures in MS drug trials and in prospective observational cohorts, to show whether there is indeed a regional difference in the effect of DMTs.

1 Introduction

Multiple sclerosis (MS) is an immune-mediated inflammatory disease of the central nervous system involving both the brain and spinal cord. In addition to brain lesions, approximately 80% of people with MS develop lesions within the spinal cord [1, 2], which are an important contributor to disability [3]. Spinal cord lesions, symptomatic as well as asymptomatic, are associated with more disability accrual in the long term and give an increased risk of secondary progression [2, 4]. Hence, the presence of spinal cord lesions generally warrants a more aggressive treatment strategy [5].

Disease-modifying treatments (DMTs) aim at reducing disease activity by targeting neuroinflammation, where a distinction is made between low- and high-efficacy DMTs, usually defined based on the degree of relapse reduction [6]. Of the DMTs currently used in clinical practice, no trials exist that report spinal cord lesions as an outcome. Consequently, little is known about whether DMTs do inhibit lesion formation in the cord as is the case for brain lesions. One single-center retrospective study of relapse-onset MS patients compared a group on intermediate- and high-efficacy DMTs to a matched group treated with no DMT and/or low-efficacy DMTs. There were significantly fewer new spinal cord lesions in the group with intermediate- and high-efficacy DMTs. However, when only patients with low-efficacy DMTs during follow-up were considered controls, the study was underpowered to show whether high-efficacy DMTs give a larger reduction in formation of new cord lesions than low-efficacy DMTs [7].

Important differences exist between the brain and spinal cord in MS. This argues that brain imaging outcomes in DMT studies should not just be extrapolated to the cord: First, anatomical differences exist, for example in white/grey matter organization, volume and venous anatomy [8, 9]. Second, the blood-brain (BBB) and blood-spinal cord barrier (BSCB) are different with regard to composition, function and, subsequently, their permeability for various cytokines [10, 11]. Last, immunological differences seem to be at play as well. For example, there are indications that the balance between Th1 and Th17 helper cells (through interferon [IFN]γ and interleukin [IL]-17) influence the degree of brain versus spinal cord involvement [12, 13]. These differences underline the importance to evaluate spinal cord separately from brain outcome measures.

In the present study we compared the occurrence of new spinal cord lesions in patients who started with a high-efficacy DMT as initial treatment to matched patients starting a low-efficacy DMT using observational data from the international MSBase registry. The aim is to improve our understanding on the effect of DMTs on spinal cord pathology in MS.

2 Methods

Observational patient-level data were collected from the MSBase registry (WHO International Clinical Trials Registry Platform identifier: ACTRN12605000455662). The MSBase Registry is an observational international MS database established in July 2004. Data are entered into the registry in a standardized manner, using the iMed or the MSBase online data entry systems. The entered MRI data are based upon local radiology reports. All code for data preparation and analysis can be found on https://github.com/danielkreiter/spinalcord-msbase. Data for this study were extracted on the 1st of July, 2022 (i.e., patients included between the start of the registry, July 2004, and data extraction, July 2022, could be included). Ethical approval for the MSBase registry was granted by the Melbourne Health Human Research Ethics Committee and institutional review boards at each site (or exemptions were granted according to local regulations). All patients provided informed consent as per local institutional regulations for inclusion of their data in the registry for clinical and research purposes.

2.1 Study Population

Selected relapse-onset MS patients had at least (i) a spinal cord MRI (at minimum cervical cord coverage) within 6 months before or after initiation of a first DMT, (ii) brain MRI and an EDSS score within 3 months of baseline spinal cord MRI, (iii) another spinal cord MRI at minimum 6 months later, (iv) availability of a minimal baseline dataset (date of birth, sex, treating center, dates of disease onset, clinical definite diagnosis and relapses, treatment start/stop dates). Patients who received placebos, blinded trial medication or treatment not in the list of eligible treatments (see below) were excluded. For the procedures applied to identify any invalid or inconsistent entries, see supplement S1. For a visual overview of baseline and follow-up time window definitions, see Fig. 1.

Fig. 1
figure 1

Baseline and follow-up time window definitions [34].* Censor on last follow-up spinal cord MRI within timeframe where treatment group cut-offs remained satisfied DMT disease modifying treatment; EDSS Expanded Disability Status Scale

Alemtuzumab, cladribine, mitoxantrone, natalizumab, ocrelizumab, ofatumumab, rituximab were classified as hDMTs [14, 15]. Interferons (beta-1a, beta-1b, peginteferon), glatiramer acetate, teriflunomide, methotrexate, azathioprine and immunoglobulins were considered lDMTs. For a clear distinction between the hDMT and lDMT groups, the S1P receptor modulators (S1Ps; fingolimod, ponesimod, ozanimod, siponimod) and dimethyl fumarate (DMF) were excluded from the primary analysis. In the secondary analyses DMF and the S1P receptor modulators were included as hDMTs. Patients treated with either lDMTs or hDMTs for ≥90% of the spinal cord MRI follow-up duration were included, respectively, in the lDMT and hDMT group. Follow-up time was defined as the longest possible spinal cord MRI follow-up in which treatment-group cut-offs remained satisfied. Patients receiving treatment from either class <90% of follow-up were excluded from the primary and secondary analyses. Since in most cases a time interval exists between the baseline spinal cord MRI and initiating the DMT, a 90% of follow-up cut-off was chosen instead of 100%. To assess the impact of the chosen cut-off, sensitivity analyses with ≥95% and ≥80% cut-off were also performed.

To account for the differences in duration of the treatment effect between therapies, the assumed duration of treatment effect after the last dose was based on data on the duration of biological effectiveness and values used in earlier studies: 2 months for natalizumab [15]; 6 months for mitoxantrone, ocrelizumab, rituximab and ofatumumab [15,16,17]; 5 years for alemtuzumab [15]; 96 weeks for cladribine [18]. For the other therapies, the treatment effect was assumed to last until the last dose.

2.2 Outcome

The primary outcome event was new spinal cord lesions during follow-up (new T2 or T1-Gd+ lesions), where the first spinal cord MRI (within 6 months before or after DMT initiation) is considered the baseline. Relapses and new brain MRI lesions were secondary outcomes included as validation, given that for these outcome measures there is a known stronger protective effect of hDMTs over lDMTs. Patients could have had repeated events during follow-up.

2.3 Statistical Analysis

Patients from both treatment groups were matched using propensity scores based on clinical (age, sex, time since first clinical event, EDSS, number of relapses in the preceding 12 months, number of relapses in the preceding 24 months, country) and radiological parameters (baseline number of brain and cord MRI lesions) [19]. The covariates chosen for matching are based upon factors influencing the choice of treatment class in clinical practice and which are used in earlier MS registry studies [15, 19]. Nearest-neighbor matching with a 1:3 variable ratio, without replacement and caliper of 0.1 was performed using the MatchIt R-package [20]. The balance of baseline covariates was assessed using the standardized mean difference (SMD). An SMD of < 0.1 was considered a negligible difference in the mean of a covariate between groups. All subsequent analyses using matched data were weighted to take the variable matching ratio into account. The common follow-up for each matched pair was determined as the shorter of the pairs follow-up periods (pairwise censoring) in order to mitigate attrition bias.

Cox proportional hazards models were applied on matched data to estimate the risk ratio of new spinal lesions, new brain lesions and relapses between treatment groups. In case of repeated events (i.e., multiple relapses, new brain or cord lesions during follow-up) these were all included in the time-to-event analyses. Cumulative hazard plots were used to visualize time-to-event data. All statistical analyses were performed in R version 4.2.1.

A secondary analysis was carried out including S1Ps and one with both S1Ps and DMF as hDMT. To evaluate the robustness of the results and the influence of methodological choices, four sensitivity analyses were performed: (i) changing the group cut-off of ≥90% lDMT/hDMT treatment of follow-up duration to 95%; or (ii) 80% to assess the impact of the chosen cut-off. Additionally, to assess the influence of matching procedure parameters, sensitivity analyses (iii) using a broad caliper in the matching procedure (1:10 match, with broad caliper of 0.4) were performed. In the final sensitivity analysis, (iv) the broad caliper was combined with an 80% cut-off to assess the effect of using a parameter set that maximizes the included sample size.

All analyses were corrected for multiple testing using the Benjamini–Hochberg procedure. Adjusted p-values are reported unless stated otherwise. Where in the primary analysis no evidence of a significant difference was found between the compared groups, the minimum detectable effect (MDE) was calculated within the available cohort at 1 − β = 0.80 and α = 0.05 with 200 simulations per outcome [21].

3 Results

In total, 1842 patients were identified with a relapse-onset MS who underwent spinal cord MRI within a time window of 6 months around initiation of their first DMT and had at least one more spinal cord MRI at follow-up (minimal interval 6 months after baseline spinal cord MRI). Where 68.7% of spinal cord MRIs covered the whole cord, the rest covered only the cervical cord. In 82.5% of cord investigations post-gadolinium sequences were acquired. Figure 2 shows the flowchart of patient selection and exclusion. Ninety-four and 783 satisfied hDMT and lDMT group criteria, respectively. Seventy-seven hDMT patients were matched to 184 lDMT patients. Patient characteristics before and after matching are shown in Table 1. The two matched groups were well balanced on the measured characteristics at baseline (SMD < 0.1). For additional figures on the matching procedure, see Supplement S2. The list of countries of where patients were treated can be found in Supplement S3. Treatments used during follow-up in both treatment groups (total groups and the subgroups that developed new cord lesions) as proportion of the total follow-up time are shown in Fig. 3.

Fig. 2
figure 2

Flowchart of screening, inclusion and matching of study population. EDSS Expanded Disability Status Scale, hDMT high-efficacy disease-modifying treatment, lDMT low-efficacy disease-modifying treatment, SC spinal cord

Table 1 Patient characteristics of study groups before and after matching
Fig. 3
figure 3

Disease-modifying treatment usage during follow-up as proportion of follow-up time. ALE alemtuzumab, CLA cladribine, GLA glatiramer acetate, IFN interferons, IVIG immunoglobulins, MIT mitoxantrone, NAT natalizumab, OCR ocrelizumab, OFA ofatumumab, ONA ozanimod, SIP siponimod, TER teriflunomide. * Fingolimod (0.06%), glatiramer acetate (0.17%) for the hDMT group. In the control group: methotrexate (0.21%), dimethylfumarate (0.03%), natalizumab (0.13%), fingolimod (0.12%), siponimod (0.44%). In the subgroup with new cord lesions in the control group: methotrexate (0.67%), natalizumab (0.10%)

At baseline, 81.2% of matched patients had at least one spinal cord lesion (60.5% in unmatched cohort). In the hDMT group there were 12 (15.6%) patients who developed 12 new cord lesions (mean follow-up 2.6 years [SD 1.5 years]; 0.06 lesions per patient year; 55.6% gadolinium-enhancing). In the lDMT group, 51 (27.7%) patients developed 65 new cord lesions (mean follow-up 4.2 years [SD 3.0 years]; 0.08 lesions per patient year; 41.3% gadolinium-enhancing). After pairwise censoring, mean follow-up time was 2.1 years (SD 1.2, range 0.5–8.3 years). There was no statistically or clinically significant difference in the risk of new cord lesions between groups (HR 0.99, [95% CI 0.51, 1.92], p = 0.97), as can also been seen the survival curves in Fig. 4. This while there were fewer new brain lesions (HR 0.22, [95% CI 0.10, 0.49], p < 0.001) on MRI and relapses (HR 0.45, [95% CI 0.28, 0.72], p = 0.004) in the hDMT compared to the lDMT group. When S1Ps and DMF were included as hDMT, there was a similar pattern (see Table 2).

Fig. 4
figure 4

Cumulative hazard plots and risk tables for A new spinal cord lesions, B new brain lesions and C relapses in the hDMT versus lDMT group. CI confidence interval, hDMT high-efficacy disease-modifying treatment, HR hazard ratio, lDMT low-efficacy disease-modifying treatment

Table 2 Hazard ratios and respective confidence intervals and p-values for the primary, secondary and sensitivity analyses

Sensitivity analyses showed similar results, an overview of which is shown in Table 2. Here the differences in the occurrence of spinal cord lesions were somewhat larger (HRs between 0.58 and 0.70 across the different sensitivity analyses) but not statistically significant. Analysis of the minimum detectable effect size was performed for the primary analyses. This did not show statistically significant differences, which was only the case for the occurrence of new cord lesions. This analysis was sufficiently powered to detect a minimum difference of 55% in cumulative hazard for new spinal cord lesions.

4 Discussion

This observational propensity score-matched study of patients with relapse-onset MS aimed to show whether treatment with hDMTs results in a larger reduction of new spinal cord lesions when compared to lDMTs. The study found no evidence of a difference for the occurrence of new cord lesions between the hDMT and lDMT groups. In contrast, the study found a reduction of new brain lesions and relapses in the hDMT when compared with lDMT. Therefore, while earlier literature suggests that using a DMT in general reduces the formation of new spinal cord lesions [7], currently there is no evidence for a larger reduction of spinal cord disease activity when using hDMTs compared to lDMTs, as is the case for brain lesions and clinical relapses.

While the primary analysis convincingly shows no difference in new cord lesions between groups, the sensitivity analysis using a broad caliper combined with a loosened cut-off for the proportion of follow-up duration used for the definition of treatment groups (≥80% instead of ≥90%) showed some difference between groups for new cord lesions (HR 0.58, p = 0.05). While these liberal parameters led to a larger included sample, with still small differences in baseline group characteristics (SMD < 0.1, see Supplement S4), this result needs to be interpreted with caution, since 20% of time not on the group-defining medication could be of influence.

A possible reason for the discrepancy in the effect of hDMTs in preventing new cord lesions versus brain lesions lies in the differences between brain and spinal cord with regard to anatomical organization, BBB/BSCB composition and function [10], regional immunological differences [12, 22, 23] and maybe also in repair capabilities [24,25,26].

Exploratively comparing the medications being used in the group of patients that developed new cord lesions while under a hDMT to the group that did not (see Fig. 3), there are no striking differences with regard to which treatments were used. Therefore, currently, there are no indications that one subgroup of hDMTs would have a more potent effect on the spinal cord compartment of the CNS. This contrasts with a smaller recent local cohort study, where cord lesions were less common in patients being treated with B-cell–depleting therapies [7].

With the use of conventional MRI, as in the present study, we can only address focal inflammatory demyelinating activity for which focal lesions on MRI are a marker [27]. However, diffuse inflammation and axonal loss in the cord are other important components of cord pathology resulting in disability worsening [28, 29]. Therefore, we do not know whether these different groups of DMTs have a different effect on the relevant pathological processes other than focal demyelination. Currently, quantification of diffuse inflammation and demyelination in the CNS in vivo, and even more in the spinal cord, is still challenging and requires advanced imaging techniques, which currently are not easily applied in clinical practice [30, 31].

4.1 Limitations and Strengths

Inherent to the use of registry-based data in this study, there are some important limitations. A main limitation is the absence of systematic and comparable acquisition of radiological outcomes and the lack of information in the registry on acquisition methods. Since spinal cord MRI imaging is challenging in acquisition as well as the assessment, and artifact-prone, detection of lesions is very dependent on the quality of the investigation. Also, information on the field strength on which the investigations were performed was not available. But, in contrast to brain imaging, the difference in lesion sensitivity between 1.5 T and 3 T seems limited for spinal cord MRI [32]. Still, with regard to acquisition, it is not expected that there would be a structural difference between the two groups. One has to recognize that the incidence of new spinal cord lesions in this study is probably not representative of the true occurrence of new cord lesions. Spinal cord MRI is currently not an investigation performed routinely at follow-up [33] and is mostly performed at the discretion of the clinician. This means that the included patients (with >1 spinal cord MRI) were probably patients with either new cord-related symptoms, unexplained disability progression, who possibly switched therapies during follow-up or were followed-up with cord MRI due to earlier cord lesions. This could have resulted in patients with good therapy response to be underrepresented in both groups. Finally, given the minimum detectable difference of 55%, a small-to-moderate reduction of the risk of spinal cord activity when using hDMTs compared to lDMTs for new cord lesions cannot be fully ruled out.

With regard to the propensity scores, ideally, gadolinium-enhancing lesions on MRI and CSF markers (number of oligoclonal bands, IgG index) would also be part of the covariates used in the matching procedure, since these are also considered in treatment decisions in clinical practice. However, due to the degree of incomplete cases that this would lead to, these were not included.

Furthermore, a large number of patients from the registry could not be included because there was no spinal cord MRI close enough to treatment initiation (see Fig. 2). For a large part, this could be due to more limited registration of spinal cord MRIs in the registry compared to brain MRIs, while in reality these were probably performed since spinal-cord MRI is a routine part of the diagnostic workup in international guidelines. However, when patients initiate treatment—not after diagnosis but later at follow-up—it is likely that there are a certain number of cases where there was no rebaseline spinal cord MRI causing their exclusion from this study. However, this is unlikely to have caused a structural difference between groups since this applies to both groups. Also, there were differences in the median follow-up time, where the median follow-up time in the lDMT group was longer than in the hDMT group. A possible reason is that in the lDMT group relapses or progression occurred more often until longer after baseline leading to more spinal-cord MRIs being performed until later after baseline. To avoid attrition bias, this has been accounted for in the analyses by adapting pairwise censoring.

A strength of this study is the application of matching to address confounding by severity and the matching resulted in similar groups across the important determinants. But it is still possible that differences exist between the groups on unmeasured variables. Another strength of this registry-based study is that it is a multicentric study including patients from all over the world. Although, it is important to note that the main part of the sample consisted of patients from Italy followed by Turkey and Australia.

5 Conclusion

In conclusion, there is currently no evidence that hDMT usage results in a larger reduction of new cord lesions than lDMTs, as is the case for new brain lesions and relapses. Whilst, this is an interesting and important finding, results need to be interpreted with caution as the present registry study has important limitations as discussed in the limitations section. The current findings should be considered a plea to work towards more, and a higher level, of evidence to either confirm or reject the presently suggested discrepancy of the hDMT effect on brain lesions and relapses versus spinal cord lesions. In order to gain further insight into the effect of DMTs on spinal cord pathology in MS, it is necessary to (i) more often include spinal cord outcome measures in MS drug trials and prospective observational cohorts, (ii) encourage registration of spinal cord data in MS registries and (iii) study the interplay between therapies and the different MS pathological processes in the cord using advanced imaging techniques, which have increasingly become part of the research toolbox.