Introduction

Cerebellar signs are common in multiple sclerosis (MS) patients [27, 44]. In addition to its well-established role in motor coordination, the cerebellum has an increasingly recognized contribution in cognitive performance [27, 44].

In the cerebellum of MS patients, pathological evidence showed extensive white matter (WM) and grey matter (GM) damage, especially in patients with progressive MS (PMS) [19, 21]. MRI studies confirmed higher cortical lesion number and volume in PMS compared to relapsing–remitting (RR) MS patients [7] and more prominent GM atrophy with increasing disability [30]. Regarding correlations with physical disability and cognitive impairment, while earlier studies focused on MRI measures of WM vs GM damage [2, 3], more recent studies investigated lobular patterns of cerebellar GM damage. Consistently with their functional specialization [37], damage to anterior lobes correlated with physical disability and involvement of posterior lobes correlated with cognitive impairment [11, 13, 26]. Functional plasticity mechanisms occurring in the cerebellum may contribute to compensate for MS structural damage [10, 35].

The cerebellum is a highly interconnected structure, whose functioning is critically dependent onto input and output pathways [37, 38]. Cortico-ponto-cerebellar (passing through middle cerebellar peduncle [MCP]) and cerebello-thalamo-cortical (through superior cerebellar peduncle [SCP]) pathways participate to motor control and cognitive functions. Spino-cerebellar (through inferior cerebellar peduncle [ICP]) and cerebello-spinal (through MCP) pathways provide rapid balance, posture and movement control [37]. Diffusion tensor (DT) MRI measures of microstructural damage of MCP and SCP better differentiated patients with and without cerebellar symptoms, compared with cerebellar T2-hyperintense lesion volume (T2-LV) and atrophy [30].

Our study rationale was twofold. First, we applied a multiparametric MRI approach (lesions, atrophy and microstructural abnormality quantification) evaluating cerebellum, brain and cervical spinal cord (CSC), to detect abnormalities of these structures and to identify predictors of physical disability and cognitive impairment in MS patients according to their clinical phenotype. Second, hypothesizing that damage to input and output pathways might partly explain cerebellar symptoms/signs in MS, we applied a data-driven clustering technique on previous MRI variables in patients with cerebellar disability, to identify the main pathophysiological mechanisms underlying clinically evident cerebellar dysfunction.

Materials and methods

Ethics committee approval

The local ethical standards committee on human experimentation approved this hypothesis-driven analysis of prospectively acquired data. Written informed consent was obtained from all participants.

Subjects and clinical assessment

From June 2017 to January 2020, we consecutively enrolled 164 MS patients [39] and 53 healthy controls (HC), as detailed in Fig. 1 and Supplemental methods. On the day of MRI acquisition, all subjects underwent neurological examination with assessment of clinical phenotype [23] and Expanded Disability Status Scale (EDSS) score, and neuropsychological evaluation, with the Brief Repeatable Battery of Neuropsychological tests (BRB-N) [31]. Domain specific and global cognition (BRB-N) z-scores were calculated as detailed in Supplemental Methods.

Fig. 1
figure 1

Study flow-chart. This figure illustrates enrollment of MS patients and healthy controls. MS multiple sclerosis, CSC cervical spinal cord, GM grey matter, PSIR phase-sensitive inversion recovery

MRI acquisition

Using a 3.0 Tesla Philips scanner, the following sequences of the brain were acquired: 3D T2-weighted fluid-attenuated inversion-recovery, T2-weighted turbo spin echo, T1-weighted turbo field echo, and diffusion-weighted imaging. During the same scanning session, the following CSC sequences were obtained: 3D T1-weighted turbo field echo, T2-weighted fast field echo, and 2D phase-sensitive inversion recovery (PSIR) at the C2-C3 intervertebral disk level. See Supplemental Methods for sequence geometry details.

MRI analysis

T2-LV was calculated using a fully automated approach using 3D FLAIR and 3D T1-weighted sequences [41]. For the whole brain, supratentorial brain, brainstem, cerebellum GM, and cerebellum WM, a “percentage T2-LV” (pT2-LV) was calculated dividing T2-LV by the volume of the same region (see below), indicating the percentage of tissue affected by lesions.

On 3D T1-weighted lesion-filled images [42], normalized brain, GM, cortical GM and WM volumes were calculated using SIENAX software. Automated segmentation of the thalamus and basal ganglia (caudate, putamen, pallidum and accumbens) was performed using FIRST software [28]. Volume of these structures was multiplied by the head-normalization factor derived from SIENAX, and summated between left and right sides. The thalamus was considered separately due to its role in explaining cognitive and motor deficits in MS [25] and in cerebello-thalamo-cortical connections [37].

Cerebellar volumes were calculated using the Spatially Unbiased Infratentorial Toolbox (SUIT), which is a dedicated tool for the accurate segmentation of posterior fossa structures [16]. Lobular volumes were computed as the sum of their right and left hemispheric portions. Anterior and posterior cerebellar volumes were calculated as the sum of lobules I–V and VI–X, respectively. Cerebellar vermis was considered separately.

Diffusion-weighted MRI analysis

Diffusion-weighted images were pre-processed for correcting movement and distortions, and fractional anisotropy (FA) and mean diffusivity (MD) maps were derived for each subjects. Then, an in-house WM atlas was used for extracting DT metrics in each cerebellar peduncle. Given the symmetry of EDSS cerebellar functional system score (EDSS-C), we averaged DT metrics for left and right tracts before statistical analysis (see Supplemental Methods for more details on analysis).

CSC MRI analysis

CSC MRI analysis was performed as previously described [6], obtaining pT2-LV between C1 and C5 vertebral levels for the whole CSC, the GM, and dorsal columns (which include spinocerebellar pathways); and cross-sectional area (below just “area”) of the global CSC and GM at C2–C3 vertebral level. Intra- and inter-rater reliability for manual GM area assessments showed intra-class correlation coefficients of 0.98 and 0.90 [6].

Statistical analysis

Continuous demographic and clinical variables were compared between groups using a two-sample t-test or Mann–Whitney test, according to variable distribution. The chi-square test was applied for categorical data.

All T2-LV measures underwent square-root transformation. Cerebellar volumes were head-size corrected by building an age- and sex-adjusted linear model between raw cerebellar volume and inverse of SIENAX head-normalization factor in HC. The estimated regression coefficient was used for adjusting cerebellar volume. The ratio between adjusted and raw cerebellar volume was used for scaling raw volumes of cerebellar lobules, preserving their additive properties. All MRI variables were compared using age- and sex-adjusted linear models, false-discovery rate (Benjamini–Hochberg procedure) corrected [5]. Given the longer disease duration in PMS patients, we retested the PMS vs RRMS contrast, including disease duration as additional covariate in a sensitivity analysis.

Random forest regression models were performed to identify MRI variables associated with physical EDSS score and cognition z-score in all MS, RRMS and PMS patients, including age, sex, disease duration and phenotype in the analyses for adjustment purposes. To gain more insight into MRI predictors of performance in each cognitive domain, we built similar models identifying MRI variables associated with the z-score of each cognitive domain in all MS patients. For each model, 10,000 regression trees were built on a random subset of covariates, with a bootstrap resampling of the observations. A permutation test (1000 permutations) of the outcome was applied to assess feature relevance, providing an unbiased measure of variable importance and significance p values for each predictor [1]. The goodness of fit of a new model, trained using only the selected predictors (p < 0.05), was expressed by the out-of-bag (OOB)-R2, the coefficient of multiple determination computed on the left-out observations.

To study the substrates of cerebellar disability, we ran a random forest classification model for identifying MRI variables associated with an EDSS-C ≥ 2, adjusting for age, sex, disease duration and phenotype. The model OOB-accuracy was reported. In accordance with a previous study [22], this cut-off was selected to include only patients with end-stage damage (i.e., disability) to the cerebellar system, to prevent MRI analysis from being contaminated by adaptive mechanisms (e.g., plasticity preserving GMV) [34], which may be prominent in patients with cerebellar signs but no disability (i.e., EDSS-C = 1). Furthermore, we applied a data-driven clustering technique on MS patients having EDSS-C ≥ 2. We ran a principal components analysis on all MRI variables, for dimensionality reduction. The first eight components, showing eigenvalues greater than one, according to Kaiser’s rule, and capturing 81% of the variance, were retained. These latent variables were finally used to perform an agglomerative hierarchical clustering analysis, using Ward linkage, in MS patients with cerebellar disability. The optimal number of clusters was determined by a consensus voting approach across 23 different indices using NBclust R package [9]. Differences among clusters in demographic, clinical and MRI variables were assessed by false-discovery rate corrected [5] Kruskal–Wallis and pairwise Wilcoxon rank-sum tests.

For all analyses, statistical significance was set at p < 0.05 (SAS Software, version 9.4, and R Software, version 4.0.3).

Data availability statement

The anonymized dataset used and analyzed during the current study is available from the corresponding author on reasonable request.

Results

Clinical characteristics

The study included 164 MS (89 RRMS, 75 PMS) patients (97 women aged 21–70 years [mean 44 years] and 67 men aged 20–71 years [mean 45 years]) and 53 HC (26 women aged 19–72 years [mean 43 years] and 27 men aged 19–71 years [mean 44 years]) (Fig. 1). Table 1 summarizes their main demographic and clinical characteristics.

Table 1 Main demographic and clinical characteristics of healthy controls and MS patients (as a whole and according to MS clinical phenotype)

MRI variables

Table 2 summarizes results of MRI analyses and between-group comparisons. Compared to HC, RRMS patients had reduced volume of all examined global brain, supratentorial and infratentorial structures (p ≤ 0.001), except for brain, cortical and cerebellar GMV. Compared to HC and RRMS, PMS patients had reduced volume of all global brain, supratentorial and infratentorial structures (p < 0.001).

Table 2 Main MRI metrics in healthy controls and MS patients and between-group comparisons

In the cerebellum, compared to HC, RRMS patients had reduced GMV of posterior lobe (p = 0.02), Crus II (p = 0.01), lobule VIIb (p = 0.02) and lobule X (p = 0.05), whereas PMS patients had reduced GMV of both lobes, vermis and all single lobules (p < 0.001). Compared to RRMS, PMS patients showed reduced GMV in all cerebellar compartments (p ranging from 0.005 to < 0.001). Compared to HC, RRMS and PMS patients had increased SCP, MCP and ICP MD (p ranging from < 0.001 to 0.04). PMS patients also had reduced SCP (p = 0.04), MCP (p = 0.03) and ICP (p < 0.001) FA. Compared to RRMS, PMS patients showed reduced ICP FA (p = 0.01) and increased ICP MD (p = 0.003).

Compared to HC, both groups of MS patients had reduced global and GM CSC areas (p ranging from 0.004 to < 0.001). Compared to RRMS, PMS patients had higher global, dorsal column and GM CSC pT2-LV and reduced global and GM CSC areas (p < 0.001).

After adjusting for disease duration, significant differences between PMS and RRMS patients survived in terms of GM atrophy in all explored areas, brainstem global CSC atrophy, infratentorial and CSC pT2-LV, and ICP microstructural damage, while WM atrophy and supratentorial pT2-LV became not significant (Table 2).

Informative predictors of physical disability and cognitive impairment

Table 3 and Fig. 2 summarize results of random forest analysis. In MS patients, predictors of higher EDSS score (OOB-R2 = 0.83) were (in decreasing order of importance): lower CSC GM area, higher CSC GM pT2-LV, lower NBV, lower brain GMV, higher brainstem pT2-LV, lower CSC global area, lower brain cortical GMV, lower cerebellum lobules I–IV and lower vermis GMV. In RRMS patients, predictors of higher EDSS score (OOB-R2 = 0.35) were: higher brainstem and CSC GM pT2-LV, lower CSC global area, higher MCP and cerebellum WM pT2-LV. In PMS patients, predictors of higher EDSS score (OOB-R2 = 0.31) were: lower CSC GM area, lower cerebellum lobules I-IV GMV, lower NBV and lower brain GMV.

Table 3 Informative predictors of physical disability and cognitive impairment in MS patients
Fig. 2
figure 2

Informative MRI predictors of physical disability and cognitive function. This figure illustrates informative MRI predictors (random forest analysis, p < 0.05) of: A EDSS score; B cognition z-score. Analysis was performed for all MS patients (left column), RRMS patients (middle column) and PMS patients (right column). Relative importance of MRI predictors is represented by the width on the x-axis of the bar corresponding to the predictor. Color of bars reflects the strength of association of the predictor with the dependent variable, as quantified by Spearman ρ (white to red for positive associations, white to blue for negative associations). MS multiple sclerosis, RRMS relapsing–remitting multiple sclerosis, PMS progressive multiple sclerosis, NBV normalized brain volume, GMV grey matter volume, pT2-LV percentage T2-hyperintense lesion volume, GM grey matter, WM white matter, MCP middle cerebellar peduncle, SCP superior cerebellar peduncle, CSC cervical spinal cord

In MS patients, predictors of lower cognition z-score (OOB-R2 = 0.25) were: higher supratentorial and SCP pT2-LV, lower NBV, lower thalamus volume, lower cerebellum lobule VIIIb GMV, lower basal ganglia volume, lower brain GMV and lower cerebellum Crus II GMV. In RRMS patients, predictors of lower cognition z-score (OOB-R2 = 0.18) were: lower thalamus volume, higher supratentorial and SCP pT2-LV, lower NBV, lower posterior cerebellum and cerebellum Crus II GMV, and higher cerebellum WM pT2-LV. In PMS patients, predictors of lower cognition z-score (OOB-R2 = 0.22) were: lower basal ganglia volume, lower brain GMV, lower cerebellum lobule VIIIb GMV, lower thalamus volume, higher supratentorial and lower cerebellum Crus II GMV.

MRI predictors of EDSS-C ≥ 2 and cognitive domain-specific z-scores were also investigated in MS patients, as reported in Table 4.

Table 4 Informative predictors of cerebellar disability (EDSS-C ≥ 2) and performance in each cognitive domain in MS patients

Characteristics of MS patients with cerebellar disability

Supplementary Table 1 summarizes clinical and MRI characteristics of MS patients with cerebellar disability. According to the optimal number of clusters criterion (as suggested by 13 out of 23 indices), 3 groups of MS patients with cerebellar disability were identified from data-driven hierarchical clustering of MRI variables. The first cluster, including 18 patients and the highest percentage of RRMS patients (n = 4, 22%), was named “lesions” due to highest supratentorial T2-LV and high T2-LV in infratentorial compartments. The second cluster, including 24 patients (2, 8%, RRMS), was named “cerebellum” due to marked cerebellum GMV reduction compared to the others. The third cluster, including 14 PMS and no RRMS patients, was named “cord” due to worst CSC damage, in terms of both highest CSC pT2-LV and lowest CSC GM area.

Figure 3 summarizes differences in demographic and clinical variables between groups. “Cerebellum” patients (mean age = 54, SD = 7 years) were older compared to “lesions” (mean age = 50, SD = 11 years; p = 0.05) and “cord” (mean age = 46, SD = 10 years; p = 0.02) patients. Sex, EDSS and EDSS-C had similar distributions among groups. “Cord” patients had shorter disease duration (median = 17, IQR = 10–20 years) compared to “cerebellum” patients (median = 23, IQR = 19–27 years; p = 0.02) but not to “lesions” patients (median = 20, IQR = 16–25 years; p = 0.21). “Lesions” and “cerebellum” patients had similar cognitive function compared to each other, while they had worse cognitive function compared to “cord” patients (p = 0.01 and p = 0.03, respectively).

Fig. 3
figure 3

Group comparisons of demographic and clinical variables among data-driven clusters of MS patients with cerebellar disability. This figure illustrates the distribution of demographic and clinical variables among the 3 clusters of MS patients with cerebellar disability (“lesions”, “cerebellum” and “cord” patients). Regarding sex distribution, females are represented in salmon pink and males in green–blue. For the other graphs, RRMS patients are represented as blue dots or bars, and PMS patients as red dots or bars. EDSS Expanded Disability Status Scale

Figure 4 summarizes between-group differences in MRI variables. NBV, brain GMV and WMV, thalamus volume and brain cortical GMV were similar in “lesions” and “cerebellum” patients, but reduced in “cord” patients (p ≤ 0.01). Supratentorial pT2-LV was higher in “lesions” compared to “cerebellum” patients (p = 0.01), and in the latter compared to “cord” patients (p = 0.01).

Fig. 4
figure 4

Group comparisons of MRI variables among data-driven clusters of MS patients with cerebellar disability. This figure illustrates the distribution of MRI variables among the 3 clusters of MS patients with cerebellar disability (“lesions”, “cerebellum” and “cord” patients). For illustrative purposes, MRI variables were grouped into an upper part, with the 3 clusters and MRI variables most significantly affected in each cluster (AC), and a lower part with other MRI variables (D). A MRI variables most significantly affected in “lesions” patients; B MRI variables most significantly affected in “cerebellum” patients; C) MRI variables most significantly affected in “cord” patients; D other global brain, supratentorial and infratentorial MRI variables. HC are only shown as a reference group for illustrative purposes. p values are derived from Kruskal–Wallis test among groups of MS patients, false discovery rate (FDR) corrected for multiple comparisons. Pairwise comparisons between the groups of patients are reported in the results section. HC are colored in green, RRMS patients in blue, PMS patients in red. All pT2-LV measures are displayed on square-root scale. Each cluster of MS patients is accompanied by an illustrative vignette underscoring the main damage mechanism. Small red arrows highlight MS lesions. HC healthy controls, NBV normalized brain volume, GMV grey matter volume, WMV white matter volume, pT2-LV ercentage T2-hyperintense lesion volume, GM grey matter, WM white matter, ICP inferior cerebellar peduncle, MCP middle cerebellar peduncle, SCP superior cerebellar peduncle, FA fractional anisotropy, MD mean diffusivity, CSC cervical spinal cord

Brainstem volume was lowest in “cerebellum” compared to “lesions” (p = 0.01) and “cord” (p < 0.001) patients. Likewise, cerebellum GMV volume was lowest in “cerebellum” compared to “lesions” and “cord” (p < 0.001 for both) patients. Cerebellum lobe and lobule GMV showed similar distributions among groups (data not shown). No differences were observed for cerebellum WMV. Brainstem and cerebellum WM pT2-LV were higher in “lesions” and “cerebellum” compared to “cord” patients (p < 0.05). Cerebellum GM pT2-LV was higher in “lesions” compared to “cerebellum” (p = 0.03) and “cord” (p = 0.005) patients. SCP pT2-LV was higher in “lesions” compared to “cerebellum” patients (p = 0.01), and in the latter compared to “cord” patients (p = 0.01). MCP and ICP pT2-LV were higher in “lesions” and “cerebellum” compared to “cord” patients (p < 0.05). Likewise, cerebellar peduncle MD were higher, and ICP FA was reduced, in “lesions” and “cerebellum” compared to “cord” patients (p ≤ 0.01).

CSC pT2-LV was higher in “cord” compared to “lesions” (p = 0.02) and “cerebellum” (p = 0.01) patients. CSC GM area was reduced in “cord” compared to “lesions” (p = 0.02) and “cerebellum” (p = 0.01) patients.

Discussion

This multiparametric MRI study aimed to evaluate in-vivo damage to the cerebellum and its role in explaining physical disability and cognitive impairment in MS. Differently from previous work [2, 3, 7, 11, 13, 26, 30], RRMS and PMS patients were assessed separately, given their known immunological, pathological and MRI differences [12, 19, 21]. Furthermore, we included an assessment of CSC damage, given the central role of spinocerebellar pathways for motor cerebellar functions [37].

Results of between-group comparisons of global brain T2-LV and atrophy measures were in line with existing literature [6, 33]. They confirmed more severe damage in PMS compared to RRMS patients [19, 33]. Cerebellum WMV—but not GMV—was reduced in RRMS patients compared to HC, likely reflecting earlier WM atrophy and an orderly process of GM atrophy in MS [17, 33]. Posterior cerebellum and lobules Crus II, VIIb, X were atrophied in RRMS compared to HC. Crus II and lobule VIIb belong to areas of the cerebellum with known cognitive functions, possibly reflecting the often subclinical nature of cognitive dysfunction in RRMS patients [8, 14]. Lobule X corresponds to the flocculonodular lobe, sitting in close proximity with the forth ventricle, making it susceptible to cerebrospinal fluid-mediated damage [15]. In line with global analysis, all cerebellar compartments were atrophic and showed higher pT2-LV in PPMS compared to RRMS patients. Cerebellar peduncles had abnormal DT-MRI indices in PMS patients vs HC, and increased MD in RRMS patients, indicating demyelination and axonal damage [19, 30]. Compared to RRMS, PMS patients showed more severe DT-MRI abnormalities in the ICP, possibly as a consequence of the myelopathy that characterizes PMS, affecting spinocerebellar pathways (the main component of the ICP). Interestingly, even after adjusting for disease duration, MRI variables investigating damage to brain GM, infratentorial and CSC compartments were significantly more affected in PMS vs RRMS patients, pointing towards a disproportional relevance of damage to these areas in determining the PMS phenotype.

Random forest analysis underscored the central role of CSC GM pT2-LV and atrophy in explaining physical disability in MS patients, in agreement with existing literature [6, 36]. Brain GMV and NBV reduction were also relevant, though their relative importance was lower compared to CSC metrics [33]. Finally, higher brainstem pT2-LV and reduced cerebellar GMV (especially cerebellum lobules I-IV and vermis) predicted worse physical disability, in line with a body of evidence implicating the anterior cerebellar lobe and the vermis—and their connections through the brainstem—in motor function [11, 13, 30, 37, 44]. Furthermore, CSC GM atrophy, SCP pT2-LV, cerebellum GMV, cerebellum lobule VIIa Crus II GMV and brainstem volume and pT2-LV emerged as informative predictors of cerebellar disability, pointing towards damage to infratentorial and CSC areas as determinant for cerebellar-type physical disability (see below for further discussion).

Random forest analysis also confirmed the centrality of supratentorial brain damage in explaining worse global cognitive function in MS patients. Supratentorial T2-LV was the most important predictor, reflecting the fundamental role of WM lesions for cognitive dysfunction in MS, possibly causing a disconnection syndrome [24, 32]. SCP pT2-LV emerged as the second most relevant predictor, supporting: (1) a relevant role of cerebellar damage in cognitive impairment in MS; (2) the central role of damage to philologically important, non-redundant, long WM pathways in determining clinical deficits in MS patients [24]. Other relevant predictors of cognitive functioning were atrophy of the whole brain and deep GM [4, 14], and posterior cerebellum GM [11, 13, 26]. Cerebellum lobule VIIa Crus II was the most consistently implicated lobule, in line with previous evidence on information processing speed in MS patients [26], and cognitive processing in HC [43]. Lobule VIIIb, previously implicated in attention and working memory [37], also emerged as a critical area. In domain-specific random forest analysis, SCP pT2-LV was mainly involved in visuospatial memory, attention and verbal fluency performance; cerebellum lobule VIIa Crus II in visuospatial memory and attention performance; and cerebellum lobule VIIIb in visuospatial memory and attention performance. MCP pT2-LV emerged as an additional predictor of verbal fluency performance in MS patients.

Interestingly, random forest OOB-R2 index, which reflects the proportion of the variance in the dependent (clinical) variable predictable from independent (MRI) variables, was higher for EDSS than cognition model. This result likely reflects the efficiency of structural MRI techniques assessing lesions, atrophy and WM microstructural damage in grasping MS-related damage to subcortical structures, which largely impacts on physical disability [19, 33]. Other mechanisms not investigated in this study, such as the presence and efficiency of functional plasticity, might contribute to explain cognitive function [32, 35].

The results of random forest analysis in RRMS and PMS patients showed that different mechanisms contribute to explain disease clinical manifestations in the different stages of the disease. Indeed, lesions played a major role in RRMS, whereas GM atrophy measures were more important in PMS, suggesting a transition from (mainly) a “disconnection syndrome” in RRMS to a neurodegenerative condition in PMS [19, 33].

The second part of the study aimed to shed light on the pathophysiology of cerebellar disability, through data-driven clustering, which is a promising novel tool for understanding MS pathophysiology [18, 40]. Three clusters of MS patients with cerebellar disability were identified: “lesions”, “cerebellum”, and “cord”. “Cord” patients had better cognitive function compared to the other two groups, which is somehow expected, given their less pronounced abnormalities of brain MRI measures. Despite no clinical differences in EDSS score, these clusters of MS patients showed some important differences in MRI metrics. In spite of lower supratentorial pT2-LV, “cerebellum” patients had similar NBV, brain GMV and thalamus volumes compared to “lesions” patients, likely indicating a predisposition to neurodegeneration, at least in part explained by the older age. Likewise, “cerebellum” patients showed lower cerebellum GMV and brainstem volume despite similar infratentorial pT2-LV compared to “lesions” patients, confirming the predisposition to neurodegeneration. Interestingly, “lesions” patients had the highest cerebellum GM pT2-LV, suggesting GM lesions and GM neurodegeneration are two distinct phenomena [20]. DT-MRI metrics of cerebellar peduncles showed similar distributions to brainstem and cerebellum WM pT2-LV, likely reflecting lesion-mediated damage to the normal-appearing WM. Damage to cerebellar peduncles (in terms of focal lesions and microstructural damage) and impaired cerebellum-cortex connectivity (due to supratentorial brain damage) are likely to represent the two pathophysiological mechanisms underlying cerebellar disability in the cluster of “lesions” patients. Instead, it is tempting to hypothesize the cluster of “cerebellum” patients may show a predisposition to neurodenegeration, leading to marked cerebellar atrophy, probably initiated by a significant infratentorial T2-LV, but possibly also mediated by cerebrospinal fluid -vehiculated toxic mediators [15, 19]. Indeed, the cerebellum is a plicated structure, with a high surface area in direct contact with the cerebrospinal fluid. Interestingly, a previous study [29] highlighted the key role of early cerebellar atrophy in predicting a poor prognosis in MS, and an association of cerebellar atrophy with cerebrospinal fluid beta-amyloid burden. On the other hand, “cord” patients had rather isolated CSC damage, with preserved brain, brainstem, and cerebellar volumes, and lower damage to cerebellar peduncles, compared to the other two clusters. This supports a relevant role of spinocerebellar and (indirect) cerebello-spinal pathways in determining cerebellar disability in a subset of MS patients.

This study has clinical relevance. First, damage to the cerebellum, cerebellar peduncles and brainstem predicted physical disability and worse cognition, underscoring the prognostic relevance of infratentorial lesion burden (easily assessable through conventional clinical MRI) for motor and cognitive outcomes. Second, this study delineates the highly heterogenous nature of MS, with complex interplays of different types of damage to several structures in explaining physical disability and cognitive dysfunction. Finally, it clearly separated a subset of MS patients with cerebellar disability and MRI evidence of GM neurodegeneration (namely, “cerebellum” patients), only partly explained by an older age, in agreement with a previous interesting study on beta-amyloid cerebrospinal fluid burden in MS [29]. These patients might benefit from the application of atrophy measures from a proper monitoring of their clinical manifestations.

The study is not without limitations. First, it was cross-sectional, and longitudinal associations between MRI variables and progression of physical disability and cognitive dysfunction remain to be determined. Second, despite pathophysiological similarities, there are also differences between PPMS and SPMS patients, but the small number of PPMS patients did not allow a separate analysis. Third, CSC analysis is still limited by technical issues, limitation to the upper CSC and difficulty in accurately delineating the GM border.

In conclusion, damage to cerebellum GM and connecting structures explains a significant proportion of physical disability and cognitive dysfunction in MS patients, underscoring clinical relevance of posterior fossa lesions and atrophy. Data-driven identification of three MRI-subtypes of patients with cerebellar disability (high brain lesion volumes, cerebellum grey matter atrophy and severe cord damage) might improve our knowledge of MRI-clinical correlations.