Introduction

Cognitive impairment (CI) is a common and debilitating feature of multiple sclerosis (MS). It occurs in 40%–70% of MS patients and severely impacts on quality of life, employment status and adherence to therapy(Rao et al. 1991b; Julian 2011; DeLuca et al. 2015). Cognitive domains most commonly impaired in MS are attention and processing speed (A-PS), executive functions, immediate and delayed recall or memory and verbal fluency (Chiaravalloti and DeLuca 2008; DeLuca et al. 2015). Heterogeneity in frequency and characteristics of cognitive deficits among patients with MS reflects the influence of many factors, including genetics, sex, intelligence, disease course, comorbid neuropsychiatric illness, and health behaviors (Benedict and Zivadinov 2011). CI starts early in the disease course, generally worsens over time and is not strictly linked to physical disability (Julian 2011). CI has also shown a prognostic value in predicting the conversion from clinically isolated syndrome to MS. For this reason, its detection should influence therapeutic decisions (Zipoli et al. 2010). The Symbol Digit Modalities Test (SDMT), which primarily assesses A-PS and sustained attention (Rao et al. 1991a; Smith 1982), is among the most reliable, sensitive (Parmenter et al. 2007) and clinically meaningful cognitive tests in MS, being also able to predict the vocational status of MS patients (Morrow et al. 2010; Strober et al. 2012).

SDMT performance have shown moderate-to-strong correlations with a series of brain magnetic resonance imaging (MRI) markers of tissue damage (Rao et al. 2014), such as T2 lesion volume (T2-LV), and global brain atrophy (Rao et al. 2014).

Nevertheless, in order to further explore the pathophysiology of A-PS deficits in MS as well as to ameliorate clinical-MRI correlations, regional structural-MRI studies have recently been proposed and implemented. One of the most feasible and promising approaches is represented by voxel based morphometry (VBM) (Good et al. 2001), an unbiased voxel-wise spatially-specific method of MRI analysis, which allows to assess regional gray matter (GM) volume and to compare it between groups of subjects, as well as to correlate it with clinical variables.

Some authors have already used this approach to explore the relationship between regional GM atrophy and SDMT performance (Sastre-Garriga et al. 2009; Nocentini et al. 2014). These studies, however, have showed conflicting results. In particular, Sastre Garriga et al. described no significant correlation between SDMT and regional brain volumes (Sastre-Garriga et al. 2009). On the contrary, Nocetini et al. found that performances in SDMT were associated with a reduction of GM volumes in several regions, such as the prefrontal cotex, the pre- and postcentral gyrus, and in the right temporal cortex (Nocentini et al. 2014). However, it is worth noting that both studies (Nocentini et al. 2014; Sastre-Garriga et al. 2009) were conducted on a small sample size - 15 MS patients in the Sastre-Garriga study; 18 MS patients and 18 healthy controls (HC) in the Nocentini study - and with VBM analysis not controlling for the potential detrimental impact of T1 WM lesions on brain segmentation (i.e. without a lesion-filling of 3D T1 images) (Ceccarelli et al. 2012; Battaglini et al. 2012). Furthermore, the Sastre-Garriga study only included cognitively impaired MS patients and was conducted using a 1.5 T MRI scanner (Sastre-Garriga et al. 2009).

In recent years, MRI studies have tried to move from group-based analysis, such as VBM, to individual measures, mainly through the implementation of automated methods of brain segmentation, such as the FIRST tool from the FMRIB Software Library (FSL) (Patenaude et al. 2011) or FreeSurfer (Dale et al. 1999). Using these approaches, A-PS performances have been found to relate to deep GM structures volumes, especially the thalamus and putamen (Batista et al. 2012; Schoonheim et al. 2012; Houtchens et al. 2007; Minagar et al. 2013; Bergsland et al. 2016; Benedict et al. 2013).

Based on this knolewdge, the objectives of the present study were: 1) to further explore, by using VBM in a large MS population, which GM regions subtend SDMT performance in MS, after controlling for the effect of focal WM lesions; 2) to assess, by usingVBM, which SDMT-correlated GM regions are atrophic in MS patient compared to HC; 3) to explore which of the SDMT-correlated and atrophic GM regions, identified by VBM and measured by FIRST, contributes to A-PS performance in MS patients.

Our working hypothesis was that A-PS impairment in MS patients might be subtended by atrophy of specific brain GM regions.

Methods

Study population

125 relapsing remitting (RR) MS patients (Polman et al. 2011) were consecutively enrolled at our MS center. 52 HC matched for sex, age and education were recruited from a large HC database available at our Institution (Table 1). Inclusion criteria were: 1) age between 18 and 65 years; 2) no current or past disorders, other than MS (for patients), which could affect cognitive performance at SDMT; 3) no concomitant treatments with psychoactive drugs; 4) absence of motor, sensory and/or visual deficits that might interfere with cognitive test performance; 5) no relapses and/or steroid treatments within 1 month before participating in the study (for patients); 6) normal MRI findings and preservation at SDMT (see the “Clinical and neuropsycological assessment” section) in HC.

Table 1 Main demographic, clinical and conventional magnetic resonance imaging characteristics of patients with multiple sclerosis (MS) and healthy controls (HC) enrolled in the study

The study was approved by the local Ethic Committee and a signed informed consent was obtained from all participants.

Clinical and neuropsycological assessment

On the same day of MRI acquisition all the subjects underwent: 1) a neurological examination including Expanded Disability Status Scale (EDSS) (Kurtzke 1983) (for MS patients); 2) SDMT administration (Rao et al. 1991b): all subjects were required to verbally pair numbers and symbols according to a fixed pattern. The final score was determined by the amount of pairings solved correctly within 90 s (Smith 1982); SDMT scores were converted into z scores using available normative data of the Italian population (Amato et al. 2006). HC with an SDMT Z-scores ≤ 1.5 (Amato et al. 2010) were excluded from the study.

MRI acquisition

Brain MRI scans were acquired on a 3 T GE Medical System (Milwaukee, WI) scanner equipped with an 8-channel parallel head coil. The following images were acquired: 1) DP/T2 weighted (dual-echo (DE) fast spin echo (FSE), repetition time [TR] = 3080 ms, echo time [TE] 1/TE2 = 24/127.5 ms, axial slices =44/44, matrix =256 X 384, field of view [FOV] = 240 mm, slice thickness = 3 mm, interslice gap =0 mm); 2) high-resolution 3D–T1 (magnetization prepared Fast Spoiled Gradient echo , TR = 6988 ms, TI = 650 ms, TE = 2,85 ms, matrix =256 × 256, slice number = 166, sagittal slices, flip angle =8°, voxel size =1 X 1 X 1.2 mm3, FOV = 256, sagittal).

Conventional MRI analysis

The identification of T2 hyperintense lesions in MS patients was conducted on DP/T2 images by a single experienced observer blinded to patients’ clinical characteristics. The MIPAV software (Medical Image Processing, Analysis and Visualization; version 4.2.2; http://mipav.cit.nih.gov) was used to contour lesions and to compute T2-LV of each patient. Normalized brain (NBV), WM (NWMV), GM (NGMV) and peripheral GM volume (NPGMV) volumes were measured on 3D–T1 images using the SIENAx software (Smith et al. 2002), after T1-hypointense lesion refilling (Battaglini et al. 2012).

VBM-analysis (Good et al. 2001)

VBM analysis was performed using SPM12 software (Wellcome Trust Centre for Neuroimaging, London, UK; http://www.fil.ion.ucl.ac.uk/spm) on 3D–T1 lesion-filled images (Battaglini et al. 2012). Images were bias-corrected, tissue-classified, and registered using linear (12-parameter affine) and non-linear transformations (warping) within a unified mode, with default parameters incorporating the DARTEL toolbox (Ashburner 2007). Subsequently, the warped GM segments were affine-transformed into Montreal Neurological Institute (MNI) space and were scaled by the Jacobian determinants of the deformations to account for the local compression and stretching that occurs as a consequence of the warping and affine transformation (modulated GM volumes). The modulated volumes were smoothed with a Gaussian kernel of 8-mm full-width at half maximum (FWHM).

Thalamic, putamen and cerebellar atrophy measures

Thalamus, Putamen and Cerebellum were segmented from the 3D T1-weighted images in native space using the FIRST tool from the FMRIB Software Library (Patenaude et al. 2011). The results of segmentation were all visually checked. Normalized volumes were calculated from the FIRST output using the SIENAx scaling factor (Smith 2002).

Statistical analysis

A Kolmogorov–Smirnov test was used to verify normal distribution of demographic, clinical and conventional MRI variables. Between-group comparisons were performed using a Student’s t-test for unpaired samples, the Mann-Whitney and Chi-square tests, as appropriate. A p < 0.05 was considered statistically significant. Correction for multiple comparisons was performed by applying the Bonferroni method (SPSS Statistics version 20.0). The GM volume maps were statistically analysed using the general linear model based on Gaussian random field theory. Correlation between GM volume and SDMT Z-scores was assesed both in MS and in HC group using multiple regression analysis with total intracranial volume (TIV), age and sex as covariates of no interest. Regional differences in GM volume between the two experimental groups (MS patients and HC) were also assessed with TIV, age and sex as covariates of no interest. Statistical inference was performed at the voxel level, with a family-wise error (FWE) correction for multiple comparisons. Clusters were considered significant at p < 0.05. Pearson product moment correlations were examined between thalamic, putamen, cerebellar and NPGV volumes and SDMT Z-scores with significance set at p < 0.05, using first age and then age and NPGMV as covariates. As the right and left structures were strongly correlated (data not shown), they were combined by calculating mean values in order to reduce the number of variables (Batista et al. 2012). Only the structures whose volume was significantly correlated with SDMT Z-scores were used in subsequent regression analyses. Stepwise linear regression models (entrance criterion p < 0.05 and exit criterion p = 0.10) were generated in order to determine the strongest predictors of SDMT performance. Age was entered as a covariate and retained in block 1, and the significant GM structures were entered in block 2 using the forward stepwise technique. Finally, each model was repeated with NPGMV added in block 1 (Batista et al. 2012) (SPSS Statistics version 20.0).

Results

Clinical-demographic, neuropsychological and conventional MRI data

Table 1 summarizes the main clinical-demographic, neuropsychological and conventional MRI data in the two studied groups. To better describe the distribution of SDMT scores in the studied populations, MS patients were stratified in three subgroups, based on SDMT z- scores, as follows: i) > −0.99 SD: 46/125, 37%, ii) between −1 and −1.99 SD: 33/125, 26%, iii) ≤ −2 SD: 46/125, 37%.

MS patients had a lower SDMT Z-scores, NBV, NWMV, NGMV and NPGMV than HC (Table 1).

VBM analysis

In the MS group, the correlation analysis showed a positive association between SDMT score and regional GM volume in the following brain regions (Fig. 1, Table 2): the posterior lobe of the cerebellum, bilaterally; the thalamus, bilaterally; the left lateral occipital cortex, the occipital pole, bilaterally; the right occipital fusiform gyrus and the left putamen. On the contrary, no correlations were found between SDMT and regional GM volume in the HC group.

Fig. 1
figure 1

Regional pattern of gray matter (GM) volumes positively associated with Symbol Digit Modalities Test (SDMT) Z-scores in multiple sclerosis (MS) patients, superimposed on the customized GM template. Legend: The regions showing significant correlations (p < 0.05) are shown in red–yellow (t-value 0–6). Abbreviations: A = anterior, P = posterior, L = left, R = right, S = superior, I = inferior

Table 2 Positive correlations between regional gray matter volumes and Symbol Digit Modalities Test Z-scores in multiple sclerosis patients

Compared to HC, MS patients showed significant regional GM loss in the following areas: the thalamus, bilaterally; the postcentral gyrus, bilaterally; the right precentral gyrus; the right inferior temporal gyrus; the insular cortex, bilaterally; the middle temporal gyrus, bilaterally; the left superior frontal gyrus; the right cerebellum; the left pallidum; the left cingulate gyrus; the right putamen; the right temporal pole and the left inferior frontal gyrus (Fig. 2, Table 3). No areas of regional GM volume reduction were found in HC when compared to MS patients.

Fig. 2
figure 2

Regional pattern of gray matter (GM) atrophy in patients with multiple sclerosis (MS), compared to healthy controls (HC), superimposed on the customized GM template. Legend: The regions of significant reduced GM volume in MS patients (p < 0.05) are shown in red-yellow (t-value 0–8). Abbreviations: A = anterior, P = posterior, L = left, R = right, S = superior, I = inferior

Table 3 Regions of significant gray matter atrophy in multiple sclerosis patients compared to healthy controls

Thalamic, putamen and cerebellar atrophy measures

In MS patients the mean thalamic volume was 10.1 mL (1.1 SD), the mean putamen volume was 4.5 mL (0.9 SD) and the mean cerebellar volume was 79.6 mL (4.6 SD).

Correlation analysis and linear regression analysis

Correlations analysis between NPGMV, thalamic, putamen and cerebellar volumes and SDMT Z-scores are presented in Table 4. All significant correlations were positive. SDMT Z-scores showed a significant correlation with: 1) thalamic volumes, putamen volumes and NPGMV; 2) thalamic and putamen volumes, controlling for age; 3) only thalamic volumes, controlling for age and NPGMV.

Table 4 Positive correlation between Thalamus, Putamen and Cerebellum volumes and Symbol Digit Modalities Test (SDMT) Z-scores in MS patients

In the linear regression model (including thalamus and putamen), the thalamus was the only independent predictor of SDMT Z-score, after controlling for age and for age + NPGMV (Table 5).

Table 5 Linear regression models predicting Symbol Digit Modalities Test (SDMT) performance in MS.

Discussion

Despite the high frequency and the negative impact of CI on patients’ daily life, neuropsychological (NP) assessment is not currently included in routine clinical evaluation of MS patients. The principal factors that limit its use are center-dependent (dedicated rooms, scheduling, etc.), rater-dependent (training, time of administration, etc.), test-dependent (screening vs comprehensive batteries) and patient-dependent (i.e. confounders such as depression, fatigue, physical impairment, etc.) (Chiaravalloti and DeLuca 2008).

For the abovementioned reasons, SDMT, a simple and short test which explores those cognitive domains frequently impaired in MS patients (Smith 1982), has been proposed as a screening test for routine clinical assessment of cognitive dysfunction in MS (Lopez-Gongora et al. 2015). Indeed, it has been shown that SDMT is able to correctly classify MS patients as cognitively impaired (Parmenter et al. 2007; Lopez-Gongora et al. 2015) and to discriminate between MS patients and HC (Benedict et al. 2008). Moreover, among various MS-related factors (e.g. disability, disease duration, MS course, other cognitive tests, and personality characteristics) SDMT is the most reliable predictor of employment status (Strober et al. 2012) and its decline over time predicts a transition from fulfilling employment to vocational disability (Morrow et al. 2010). Finally, SDMT is simple and fast to administer, lasting a total of five minutes (Drake et al. 2010). Based on these premises, in the present study VBM analysis was applied to a large population of MS patients to investigate the contribution of regional GM damage to A-PS deficits, as measured by SDMT. Subsequently, we tested if volumes of SDMT-related GM regions independently contribute to A-PS performances in MS patients.

Our results showed that lower performance in SDMT was associated with a reduction of GM volumes in the cerebellum, bilaterally (posterior lobe), the thalamus, bilaterally, the left putamen and the occipital cortex.

These results are in contrast with those of previous studies using SDMT and a VBM approach. In particular, Sastre-Garriga et al. reported no significant correlation between regional GM volumes and SDMT scores (Sastre-Garriga et al. 2009), while Nocetini et al. found a correlation between SDMT performance and GM atrophy of specific regions located in the frontal, parietal and temporal cortex (Nocentini et al. 2014). However, these discrepancies might reflect the small sample size (Nocentini et al. 2014; Sastre-Garriga et al. 2009), the characteristics of studied populations (e.g. Cognitive status of enrolled MS patients) (Sastre-Garriga et al. 2009) and/or methodological issues (Nocentini et al. 2014; Sastre-Garriga et al. 2009) (e.g. using VBM without lesion filling of 3D–T1 images) (Battaglini et al. 2012).

On the other hand, in agreement with other studies, using an individual based method (Batista et al. 2012; Schoonheim et al. 2012; Houtchens et al. 2007; Minagar et al. 2013; Bergsland et al. 2016; Benedict et al. 2013), we were also able to observe that global thalamic and putamen volumes are related to SDMT Z-scores. The absence of correlation of SDMT Z-scores with cerebellar and neocortical (as measured by NPGMV) volumes could depend on the fact that VBM results included only sub-regions of these relatively large structures. Furthermore, in the linear regression analysis, we were able to confirm that thalamic atrophy is an independent and strong contributor to MS-related A-PS deficit, also controlling for age and neocortical atrophy (Batista et al. 2012; Benedict et al. 2013).

The contribution of the thalamus to cognition is well documented and several previous MRI studies have shown that different thalamic MRI measures (e.g. volume, microstructural damage, etc.) strongly correlate with global and selective CI in MS (Minagar et al. 2013; Bisecco et al. 2015; Batista et al. 2012; Schoonheim et al. 2012; Houtchens et al. 2007; Bergsland et al. 2016; Benedict et al. 2013). Our findings, therefore, strongly confirm and reinforce the role of the thalamus as an independent and additional contributor in determining A-PS functions in MS patients.

With regards to the cerebellum, several studies have demonstrated its key role in cognition, particularly regarding language, executive function and visuo-spatial abilities (Tedesco et al. 2011). It has been hypothesized that the “cognitive” cerebellum stems from its strong anatomical connections with many associative cortical areas, including the prefrontal cortex (Middleton and Strick 2000). Moreover, it has been recently demonstrated that the cerebellum also plays a critical and specific role in MS-related cognitive dysfunction, mostly affecting those cognitive domains related to executive functions, A-PS and verbal fluency (Sarica et al. 2015). Poor performance in A-PS in MS has been related to a disconnection between posterior cerebellum and cortical areas involved in cognition (Rocca et al. 2016). Structural studies have also explored the relationship between A-PS and volume, LV and microstructural damage of the cerebellum, showing conflicting results (Sarica et al. 2015; Weier et al. 2014; Damasceno et al. 2014). Moreover, a lower performance at SDMT has been linked to GM atrophy in cortical areas strictly connected with the cerebellum, such as the dorsolateral prefrontal cortex (Cerasa et al. 2013). Finally, cognitive deficits observed in MS patients seem to be related to a fronto-cerebellar connectivity impairment (Bonnet et al. 2010) that might be partially overcome by cognitive rehabilitation (Sastre-Garriga et al. 2011).

To our knowledge, the present study is the first one that reported a significant correlation between cognitive performance and regional cerebellar atrophy in MS in a whole brain voxel-vise study, thus suggesting that cerebellar damage might be relevant to MS-related CI, and particularly to A-PS deficits. The absence of correlation with global cerebellar volume and the localization of areas SDMT-correlated in the posterior lobe of cerebellum support the hypothesis of a cerebellar sub-regional specialization for cognitive functions (Rocca et al. 2016; Sarica et al. 2015).

As for the abovementioned structures, putamen damage has also been previously associated with A-PS in MS (Batista et al. 2012). This nucleus is involved in voluntary fixational control and saccades, and contributes to planning tasks with a visual searching component (Batista et al. 2012). Here we confirm the role of the putamen in A-PS deficits, emphasizing the role of visual movements in cognitive tasks influenced by visual searching such as SDMT.

A similar interpretation might also be applied to the relationship between occipital cortex atrophy and SDMT performance, since it was demonstrated that occipital regions experience a significant functional MRI activation during SDMT execution in MS patients (Genova et al. 2009).

To assess whether GM regions showing correlations with SDMT performance are also atrophic in MS patients, a whole-brain VBM analysis comparing MS patients and HC was also performed. Our findings are in line with those of previous studies (Prinster et al. 2006; Lansley et al. 2013) reporting regional GM loss at the level of the thalamus, basal ganglia, pre/postcentral and cingulate gyri in MS patients (Lansley et al. 2013). Interestingly, in our study, all SDMT-correlated GM regions, except for occipital cortex, were atrophic in MS patients.

We recognize that generalization of our results might be limited by the lack of a global neuropsychological evaluation. Consequently, we cannot be confident that our MS patients are representative of a “common” MS population. Nevertheless, SDMT has been shown to be able to correctly classify MS patients as cognitively impaired (Parmenter et al. 2007; Lopez-Gongora et al. 2015). Accordingly, the percentage of our MS patients impaired (Amato et al. 2006) at SDMT (37%), was in line with the expected percentage of CI in MS general population (Chiaravalloti and DeLuca 2008). In line with the abovementioned statements, the strong relationship between SDMT performance and CI does not allow us to exclude that our findings reflect global CI (identified by lower scores on the SDMT) rather than just A-PS deficits.

In conclusion, this study suggests that A-PS deficits in MS are associated with GM atrophy of strategic brain structures, directly (such as the thalamus and the cerebellum) or indirectly (through the control of visual functions, such as the putamen and occipital cortex) involved in cognitive functions. Furthermore, thalamic atrophy plays a specific, independent and additional role in determining A-PS deficits in MS.