Introduction

Measures of central nervous system (CNS) atrophy are increasingly recognized as viable biomarkers of disease burden in multiple sclerosis (MS) [1, 2]. While the occurrence of cerebellar signs and symptoms in MS has been well-known [3], the exact contribution of cerebellar damage to disability in MS has still not been fully explored.

The cerebellum is known to play an important role in motor function, coordination and cognitive-behavioural processing [4, 5]. In MS patients, both cerebellar signs and symptoms are significant contributors to the development of disability and often progress despite disease-modifying treatment [6, 7]. Neuropathological studies have shown extensive demyelination in the cerebellar cortex mainly of progressive MS patients [8]. Further cross-sectional neuroimaging studies have confirmed cerebellar volume reductions in patients with MS when compared to healthy controls [9,10,11,12,13]. However, it is currently not clear to what extent atrophy in the cerebellum matters clinically. Also, data from previous MRI studies on cerebellar volume reductions, especially early in the disease course, provided inconsistent findings [10, 14], and the clinical correlations of volume abnormalities were limited. While correlation with the expanded disability status scale (EDSS) was at best modest [10] or non-existing [11, 15, 16], more convincing correlations were observed between cerebellar volumes and clinical measures, which directly reflect fine-motor skills, locomotion or cognition [9, 12,13,14, 17]. The high variability between studies may be explained by the application of different techniques (semi-automated versus automated segmentation, voxel based morphometry, whole versus lobule-wise analyses etc.), heterogeneity in patient cohorts (disease duration, disease types, number of patients) and the fact that mainly cross-sectional study designs were used.

Longer-term observational periods are necessary to better understand the dynamics of pathological changes within the cerebellum and their clinical consequences in MS patients. We therefore analysed clinical and MRI data of a large cohort of patients with relapse-onset MS with up to 11 years of follow-up in order to examine the relationship of cerebellar volume loss and disease progression, and determine whether cerebellar atrophy may serve as a potential biomarker for future disease progression. We applied our in-house developed ‘Rapid Automatic Segmentation of the human Cerebellum And its Lobules’ (RASCAL) [18], showing high accuracy in a previous segmentation challenge (Carass et al. 2018).

Methods

Participants and Study Design

Clinical and MRI data of 163 MS patients, 125 with relapsing–remitting (pwRRMS) and 38 with secondary progressive disease courses (pwSPMS), of an ongoing large-scale cohort study from a single centre (tertiary MS Centre, University Hospital, Basel) were analysed retrospectively (for details see Tables 1 and 2). Patients were followed annually including a clinical visit and MRI over a median of 4 years (here referred to as: period I) plus a clinical long-term follow-up of a median of 6 years (here referred to as: period II). The diagnosis of MS was made in accordance with international panel established criteria [19].

Table 1 Baseline demographics
Table 2 Baseline MRI volumes and annualized volume loss rates

Procedures

All patients received comprehensive assessment annually. This included a standardized neurological examination with Expanded Disability Status Scale (EDSS) by certified neurologists, timed 25-foot walk test (T25FWT), nine-hole peg test (9HPT) with the dominant and non-dominant hand, Paced Auditory Serial Addition Test (PASAT) and Symbol Digit Modality Test (SDMT).

All MRI scans were acquired on the same 1.5 T MR scanner (Magnetom Avanto, Siemens Healthineers, Erlangen, Germany). Morphological analyses were performed on 3D T1-weighted (T1w) magnetization-prepared rapid gradient-echo (MPRAGE) brain MRI scans acquired in sagittal orientation (TR/TI/TE = 2080/1100/3.0 ms; α = 15°, 160 slices, resolution: 0.98 × 0.98 × 1 mm3). Additionally, a double spin echo proton density (PD)/T2-weighted sequence was applied (TR/TE1/TE2 = 3980/14/108 ms; 40 slices 3 mm thick without gap with an in-plane resolution of 1 mm2). Each 3D T1w data set underwent automatic cerebellar and whole-brain segmentation. Total and regional cerebellar volumes were computed using the automated pipeline RASCAL18. The original pipeline was modified using an updated version of the MNI152 reference template with 30 mm larger coverage inferiorly and additionally a study-specific registration template created from 37 scans of the RASCAL library to improve non-linear registration around the cerebellum and brain stem [20]. For each data set, all individual cerebellar lobules, cerebellar peduncles and white matter core of both hemispheres were assessed. Based on the latter, the following volumes were generated: total cerebellar volume (TCV) as the sum of all lobules plus white matter core incl. peduncles; total cerebellar grey matter (CGV) as the sum of anterior lobes (I–V), posterior superior lobes (VI + Crus I) and posterior inferior lobes (Crus II-lobule X) of both hemispheres; and total cerebellar white matter (CWV) as the sum of cerebellar peduncles and white matter core of both hemispheres, including the deep cerebellar nuclei. Cerebellar volumes were normalized for head size and reported in the MNI152 stereotaxic space. Segmentations were visually inspected for quality and excluded from further statistical analysis in case of segmentation errors. White matter lesions were segmented on the PD/T2w images by trained expert MRI readers according to standard operating procedures used at the local institution for the analysis of clinical period II and period III trial as described before [21]. Lesion volumes (LV) were calculated according to their anatomical location with respect to the tentorium (so-called supratentorial and infratentorial LV). A second processing pipeline was set up to compute whole-brain parenchymal volume using the 3D T1w datasets (for details [22]). The cerebral volume contains the remaining tissue after subtracting the brainstem volume and TCV from the whole-brain parenchymal volume (extracted using BEaST [23]).

Statistical Analyses

All statistical analyses were performed using R Version 3.2.3 (https://www.r-project.org/).

The mean annual volume loss rate of each structure was calculated for every patient as the average of the annualized changes between all available time points1. In order to approximate a normal distribution, a logarithmic transformation for the EDSS and 9HPT, an inverse transformation for the T25FWT and cubic transformation for the PASAT were conducted. Further statistical analyses were performed using the transformed clinical scores.

Between-group comparisons of baseline demographic factors, clinical measurements and number of follow-ups were performed using the following tests (where appropriate): Welch’s t-test, Pearson’s chi-squared test with Yate’s continuity correction and Mann–Whitney U tests. Between-group differences for baseline MRI measures and annual rates of change were performed using analyses of covariance (ANCOVA), corrected for age, sex and disease duration.

Linear mixed-effects regression (LMER) analyses were deployed to explore the longitudinal evolution of cerebellar and cerebral volumes as well as the associations between MRI changes and clinical measures (period I data). Analyses were conducted in the whole cohort and for subgroups separately. For simplification, we report only the subgroup analyses within the “Results” section. Results for the whole cohort are reported within the supplementary material only. LMER models were performed in a forward stepwise fashion, using a “random intercept” and a “random slope” to allow for within-subject and between-subject variance. Each factor was tested both for its contribution to the fit’s intercept as well as to the fit’s slope. The fit’s intercept corresponds to the average of the dependent variable, whereas the fit’s slope to the change of the dependent variable over time. Independent variables were entered blockwise keeping the following sequence: first demographics and clinical factors, then cerebellar volumes and finally cerebral volume. All independent variables without statistical significance were excluded from the final model.

In a second step, the predictive capabilities of the cross-sectional cerebellar volumes of the last available MRI (baseline of period II) and the annualized cerebellar volume changes (MRI data from period I) regarding clinical disease progression in the future (clinical data from period II) were analysed with LMER models as described above with baseline volumes added before annualized changes in the model.

All results were corrected for multiple comparisons using the false discovery rate approach set at q < 0.05.

Cerebellar volumes included in the analyses were as follows: TCV, CGV, CWV, anterior lobes, posterior superior and inferior lobes in case of all clinical parameters tested. Further, selected individual lobes depending on the probability to be involved in the processing of the included clinical test (hypothesis driven) were included in the analyses mentioned above, namely lobules I–IV in case of analyses of the T25FWT [24, 25]; sum of lobules IV–VI + VIII in case of analyses of the 9HPT of the dominant (DH9HPT) and non-dominant hand (NDH9HPT) [24, 25] and peduncle volumes in case of analyses of the PASAT and SDMT [12, 26, 27]. In the analyses including the 9HPT as the dependent variable, only the ipsilateral volumes were considered. For simplification we still refer to the volumes as named above.

Results

Data of 125 pwRRMS and 38 pwSPMS were included in the analyses. All patients had a median follow-up of 4 years in period I (clinical visit and MRI) with a subgroup (n = 77) having a median clinical long-term follow-up of 6 additional years (period II). All cerebellar lobes showed volume reduction over time in both groups, with the highest atrophy rate in the anterior lobes. Detailed baseline data are reported in Tables 1 and 2.

Cerebellar Volume Changes — Period I Analyses

Cerebellar volumes (TCV, CGV, CWV, anterior lobe, posterior superior lobe, posterior inferior lobe — see Fig. 1) and their change over time were evaluated with respect to demographic and clinical metrics using LMER (Table 3). Analyses revealed that men, in general, had lower cerebellar volumes compared to women. In addition, in men, volume loss progressed faster compared to women in TCV, CGV, posterior superior lobe and posterior inferior lobe volume. Older age was associated with reduced cerebellar volumes on average and with faster volume loss over time for all cerebellar structures except for the anterior lobe. Disease duration was not associated with average cerebellar volumes or its loss over time. Further, disease subtype was not associated with cerebellar volumes on average, but pwSPMS exhibited faster posterior superior lobe volume loss over time compared to pwRRMS.

Fig. 1
figure 1

Upper row: individual volume trajectories over the follow-up years of RRMS (red) and SPMS (blue) of anterior lobes, posterior superior lobes and posterior inferior lobes. Lower row: individual volume trajectories over the follow-up years of RRMS (red) and SPMS (blue) of total grey matter and white matter as well as the cerebral volume for comparison. The thick blue lines represent the volume trajectories over the observation period for each group; confidence intervals are shown in grey

Table 3 Cerebellar volumes and their change over time with respect to demographic factors

Correcting for significant demographic factors, disease subtype and disease duration, we examined the association between cerebellar volumes, and infra- and supratentorial LV. Infratentorial LV were not associated with average cerebellar volumes or with cerebellar volume loss over time. Supratentorial LV was associated with all cerebellar volumes on average but the anterior lobe and the posterior superior lobe (TCV: B =  − 0.44 ± 0.14, p < 0.01; CGV: B =  − 0.28 ± 0.12, p < 0.05; CWV: B =  − 0.17 ± 0.03, p < 0.001; posterior inferior lobe: B =  − 0.17 ± 0.07, p < 0.05). Changes of supratentorial LV over time were also associated with changes of all cerebellar volumes over time but the anterior lobe (TCV: B =  − 0.04 ± 0.01, p < 0.01; CGV: B =  − 0.28 ± 0.12, p < 0.05; CWV: B =  − 0.17 ± 0.03, p < 0.001, posterior superior lobe: B =  − 0.01 ± 0.004, p < 0.01, posterior inferior lobe: B =  − 0.17 ± 0.07, p < 0.05).

We further examined the association between cerebellar volume and supratentorial cerebral volume, correcting for significant demographic factors, disease subtype and disease duration. Supratentorial cerebral volume was correlated with all cerebellar volumes on average (TCV: B = 0.007 ± 0.004, p < 0.01; CGV: B = 0.007 ± 0.003, p < 0.01; CWV: B = 0.003 ± 0.001, p < 0.01; anterior lobe: B = 0.002 ± 0.001, p < 0.001; posterior superior lobe: B = 0.001 ± 0.001, p < 0.05; posterior inferior lobe: B = 0.004 ± 0.002, p < 0.01). Changes of supratentorial cerebral volume were correlated with changes of all cerebellar volumes over time but CWV and anterior lobe (TCV: B = 0.002 ± 0.001, p < 0.05; CGV: B = 0.002 ± 0.001, p < 0.05; posterior superior lobe: B = 0.001 ± 0.0002, p < 0.001; posterior inferior lobe: B = 0.001 ± 0.0004, p < 0.05).

Cerebellar Volume Changes and Disability — Period I Analyses

LMER in the pwRRMS subgroup (Table 4) showed that all cerebellar volumes were inversely correlated with the average EDSS, T25FWT and directly with the average SDMT. Average D9HPT and ND9HPT were inversely associated with the total volume of the cerebellar hemisphere and the sum of lobules IV, VI and VIII on the respective ipsilateral side. Peduncle and anterior lobe volumes were by trend correlated with the average PASAT. No correlation was found between changes of any clinical score over time and cerebellar volumes.

Table 4 Correlations between cerebellar volumes and clinical scores in RRMS patients

LMER in the pwSPMS subgroup showed a trend to an inverse correlation between posterior superior lobe volume changes and EDSS changes over time (B =  − 10−6 ± 4 × 10−7, q = 0.085; final model: R2m = 67%, R2c = 92%). Cerebral volume did not contribute further in this model. No other associations between cerebellar volumes and clinical scores were found.

Cerebellar Volume Changes and Prediction of Future Disability — Period II Analyses

LMER in the pwRRMS subgroup (Table 5) showed that all baseline cerebellar volumes were inversely correlated with the future average T25FWT and all baseline cerebellar volumes but the posterior inferior lobe were inversely correlated with the future average EDSS and SDMT. Baseline cerebral volume also contributed in these models. However, no cerebellar metrics were associated with future EDSS, T25FWT and SDMT changes over time. Baseline TCV, baseline CGV, baseline posterior superior lobe volume (by trend) as well as the sum of cerebellar lobules IV, VI and VIII (baseline and by trend with annual changes) and annual CWV changes were associated with the average future D9HPT, whereas no cerebellar metrics were correlated with faster D9HPT worsening over time. Further, all baseline cerebellar volumes (but baseline CGV) as well as the annual volume changes of CGV, CWV, posterior inferior lobe and the summed cerebellar lobules IV, VI and VIII were associated with the average future ND9HPT, whereas higher annual atrophy of the summed volumes of cerebellar lobules IV, VI and VIII and by trend of the TCV were correlated to faster ND9HPT worsening over time. Cerebral volume also contributed in the 9HPT analyses. No cerebellar metrics were correlated with PASAT.

Table 5 Correlations between annual cerebellar volume change rates and future clinical scores in RRMS patients

LMER in the pwSPMS subgroup (Table 6) showed that the annual volume change rate of the anterior lobe was positively associated with changes of future ND9HPT over time. Average future PASAT was correlated with annual change rates of CGV, anterior lobe and posterior superior lobe volume, whereas annual volume change rates in the posterior superior lobe were correlated with future PASAT changes over time. No other associations between cerebellar volumes and clinical scores were found.

Table 6 Correlations between annual cerebellar volume change rates and future clinical scores in SPMS patients

Discussion

Previous studies have shown a relation between reduced cerebellar volumes in pwMS and sensory-motor dysfunction as well as cognitive-behavioural tasks [13, 17, 22, 28]; however, little is known on the temporal evolution of cerebellar volume loss and the corresponding deterioration of function. In this work, we analysed a large cohort over a long clinical observation period of up to 11 years. We could show significant volume reduction in the cerebellum and its substructures in relapse-onset MS patients and their relationship between regional cerebellar volume loss and motor and cognitive function over time.

In our cohort, pwSPMS exhibited faster posterior superior lobe volume loss over time compared to pwRRMS, a region known to be activated during various language, working memory and executive function tasks [29]. This is in line with earlier cross-sectional studies describing predominant cerebellar volume reduction in progressive patients over relapsing and benign MS forms [9, 15]. Average infratentorial LV or its change over time was neither associated with average cerebellar volume nor its changes over time, indicating at least partial independence between the formation of regional lesional burden (primarily inflammatory processes) and neurodegenerative processes leading to atrophy. The lack of association may on the other hand be explained by technical challenges in infratentorial space leading to limited lesion detection especially cortical lesions. In contrast, cerebral and cerebellar volumes correlated significantly. There is no preferential volume loss of cerebral over cerebellar tissue, as described before [30].

As to the clinical consequences of cerebellar volume reduction, we could confirm previous data in finding significant relations between average cerebellar volumes on a global as well as on a defined regional level and clinical test performance [10, 13, 17, 22]. However, for pwRRMS no associations were found when analysing the relation between regional volume loss over time and changes in the corresponding task performance over time. In contrast, for pwSPMS volume reduction in the posterior superior lobe (which was discriminative between both groups) over the study period was at least by trend related to increase of EDSS over time. While EDSS represents a measure for predominantly motor disability, the posterior superior lobe (including lobule VI and Crus I) — as mentioned before — has been predominantly related with higher order cognitive tasks (e.g. language, spatial tasks, executive function and affective processing). However, some studies have also shown involvement of this region with motor tasks and suggest relations of the posterior cerebellar regions with higher order motor planning [24, 26, 29, 31,32,33]. These and our findings suggest that not only direct damage of motor pathways may influence EDSS changes, and that neurodegeneration within associative centres affects motor performance in an indirect fashion. In addition, pwSPMS with longer disease duration and older age in general may have reduced adaptive and repair mechanisms to make up for loss of neuronal tissue, which again may translate more directly into dysfunction than in pwRRMS [34, 35].

Additionally, we were interested in determining whether cerebellar structural changes may predict future clinical disease progression. In pwRRMS, atrophy rates of cerebellar lobes and total volumes together with supratentorial cerebral volume were significant predictors of disease severity in terms of motor scores (e.g. average EDSS, T25FWT, 9HPT of both hands) and the SDMT, confirming previous results of cross-sectional analyses [13, 17, 22]. No relation was found between atrophy rates and future cognitive decline in this patient group, which may be partially explained by a more prominent learning effect for both tests (SDMT: 0.459 ± 0.667/year; PASAT2: 14.35 ± 18.65/year) as seen in other studies before [36, 37]. Another explanation may be the concept of “brain and cognitive reserve”, where cognitive performance may not correlate with brain atrophy due to “protective effects” of maximal life time brain volume (genetics) and premorbid intelligence and educational levels (environmental factors) [38].

Interestingly, we showed that lower baseline volumes and higher atrophy rates of ipsilateral hemispherical structures and in particular motor-associated lobules were significant predictors of faster future performance worsening on the 9HPT of the non-dominant hand in all pwMS. The cerebellum, in particular its anterior and superior posterior lobe, is activated as part of a large visuo-motor network including also fronto-parieto-occipital regions when dexterity tasks are performed by healthy controls. It shows even stronger activations when using the non-dominant over the dominant hand [32]. The 9HPT of the non-dominant hand further seems to better reflect real life upper limb disability [39] and may therefore be a more sensitive measure (than e.g. the EDSS) to volume changes even in less affected patients like RRMS. In pwSPMS, we showed that atrophy rates of the posterior superior lobe were a significant predictor of future PASAT performance deterioration. PASAT performance is known as a measure of processing speed, working memory and attention, activating a broad bilateral mainly fronto-parietal network but also including subcortical structures and the anterior and posterior superior cerebellum [40, 41]. Our results underline the importance of the cerebellar structures within this network, where neurodegeneration leads to faster deterioration of processing speed and attention.

General findings in this study were lower head-sized normalized cerebellar volumes in men than women, as previously described [18]. Further, men showed higher rates of volume loss over time especially within the posterior lobe grey matter, an area (amongst other functions) well-known to be part of cognitive-behavioural processing3. This might support the view of male sex being a risk factor for faster disability accumulation in relapse-onset patients [42]. Further, older age was associated with lower volumes and faster volume loss, but disease duration did not seem to be a relevant factor. Unfortunately, for this study no healthy matched control group was available, which could help in estimating the present cerebellar volume reduction in relation to healthy ageing. However, in general, the association between cerebellar volume loss and disability is not affected by this limitation.

The retrospective, longitudinal design is prone to potential bias due to dropout of patients. PwSPMS were smaller in number; however, they showed similar follow-up time than pwRRMS and therefor dropout is probably not due to disease progression. About 69% of patients were treated with disease-modifying therapy with dominance of first-line injectables, which potentially may bias volume analyses. This effect should, however, be negligible for the latter therapeutics [43]. Another limitation is the lack of data on atrophy of the deep cerebellar nuclei, which are an important relay station for motor and cognitive pathways from and to the cerebellum and might better reflect disability progression. Unfortunately, the quality of the T1w MRI data did not allow a reliable segmentation of the latter. High-field MRI data would be of advantage to look into the possible specific predictive value of the cerebellar nuclei for future disease progression.

Conclusion

Overall, we conclude that the cerebellum not only plays an important role for motor and cognitive function in MS, but also reflects decline in clinical motor and cognitive performance and may even serve as a predictor for future disability, especially when dexterity and processing speed performance is in focus.