Introduction

Autosomal dominant cerebellar ataxias (ADCA), referred to as spinocerebellar ataxias (SCAs) in genetic nomenclature, are a clinically and genetically heterogeneous group of rare neurodegenerative disorders characterised by the slow degeneration of the Cerebellum and its connections, often accompanied by degenerative changes in the Brainstem and elsewhere (Dürr 2010; Schöls et al. 2004; Soong and Paulson 2007). Overall, SCAs have a prevalence of 1–3/100,000 in Europe (Dürr 2010). At present there are at least 29 subtypes of SCA identified (Dürr 2010; Schöls et al. 2004); SCA2 represents the most frequent genotype (24%) in Italian ADCA kindred (Brusco et al. 2004; Schöls et al. 2004). Its pathogenesis relies on CAG repeats expansion in the affected gene (12q24), clinically the phenotype is highly variable and comprises ataxia with slow saccades, peripheral neuropathy and less frequent extrapyramidal findings. The age at onset is most commonly in the third–fourth decade, it is inversely correlated with the repeat length, and it shows the anticipation phenomenon (Schöls et al. 2004). The disease has generally a progression to the loss of physical control, disability and death, with a relative retains of mental capacities, although the presence of cognitive alterations (especially verbal memory and executive dysfunction) is a common findings (Bürk et al. 1999; Bürk et al. 2003; Bürk 2007; Klinke et al. 2010; Le Pira et al. 2002) and dementia is reported from 15 to 25% depending on the stage of the disease (Bürk et al. 1999; Orsi et al. 2010).

Three neuropathological studies (a total of 15 patients, 11 from Cuba, 2 from Martinique, 1 from USA, 1 not specified) found extreme loss of pontine and olivary nuclei and Purkinje cells, with relative preservation of the dentate nuclei and cerebral cortex, the spinal cord was reported as severely affected (Dürr et al. 1995; Estrada et al. 1999; Pang et al. 2002). Two other neuropathological studies (five patients from Germany) reported also a thalamic nuclei involvement (Rüb et al. 2003a, b, 2005).

Voxel-based morphometry (VBM), introduced by Ashburner and Friston (2000), consisted, at its simplest version, in a voxel-wise comparison of the local concentration of Grey Matter (GM) between two groups of subjects. This procedure involved spatially normalising high-resolution images from all the subjects in the study into the same stereotactic space. This is followed by segmenting the GM from the spatially normalised images and smoothing the GM segments. Voxel-wise parametric statistical tests, which compare the smoothed GM images from the two groups, were performed to discover local differences in GM concentration. Good et al. (2001) proposed a change in the workflow processing, the so-called Optimised VBM. Separate Grey and White Matter (WM) templates were created from an initial simple VBM analysis and then the templates were used to optimally normalise the images that were segmented again before the parametric statistical tests. Moreover, in order to preserve the volume of a particular tissue (GM or WM) within a voxel, a further processing step was incorporated, the multiplying (or modulating) voxel values in the segmented images by the Jacobian determinants derived from the spatial normalisation step. Ashburner and Friston (2005) proposed the Unified Segmentation (implemented in SPM5 and in SPM8), a probabilistic framework that enables image registration, tissue classification, and bias correction to be combined within the same generative model, to resolve the inherent circularity of the Optimised VBM, in fact the steps, instead of being done in a iterative sequence, are components of a single model. Ashburner (2007) introduced Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL), which is a fast algorithm for diffeomorphic images registration; the idea, behind adding DARTEL in the VBM workflow, is to increase the accuracy of inter-subject alignment by modelling the shape of each brain using millions of parameters (three parameters for each voxel). DARTEL works by aligning GM among the images, while simultaneously aligning WM. This is achieved by generating its own increasingly crisp average template data, to which the data are iteratively aligned. DARTEL has demonstrated to improve the VBM results in the majority of studies where it has been employed and compared with previous methods (Bergouignan et al. 2009; Cuingnet et al. 2010; Klein et al. 2009; Pereira et al. 2010).

Another way to improve the inter-subjects alignment in Cerebellum VBM is the employment of the Spatially Unbiased Infratentorial Template (SUIT) of Diedrichsen (2006), Diedrichsen et al. (2009). SUIT is a new high-resolution atlas template of the human Cerebellum and Brainstem in the MNI stereotactic space. The analysis is more accurate, but is confined to the Cerebellum and Brainstem.

The Tower of London (ToL) is a popular neuropsychological test for the assessment of executive functions and Working Memory (Shallice 1982). Executive functions refer to the goal-directed planning abilities, which include decision-making, strategy formation, organisation and sequency of mental functions. Functional studies demonstrated the activation of a fronto-parietal–occipital network including the DorsoLateral PreFrontal cortex, and the Inferior Parietal Lobe during a ToL task and decision-making task (Schall et al. 2003; Andreasen et al. 1992; Morris et al. 1993; Baker et al. 1996; Dagher et al. 1999; Lazeron et al. 2000; Fincham et al. 2002; Newman et al. 2003; Van den Heuvel et al. 2003; Rasser et al. 2005). Boghi et al. (2006) showed that highly significant increases of activation in frontal, parietal, occipital cortical areas and in cerebellar hemispheres were associated with the performance of the task confirming previous SPECT, PET and fMRI studies.

The classical version of this test requires not only executive abilities but also visuospatial abilities and a fine coordinate visuomotor manipulation (ToL PEG). We realised a computerised version (ToL CPU) that can be solved without any complex hands or eyes motor control (Boghi et al. 2006), more correct for the investigation of patients with motor disorders. Neither of the two tests can be regarded as pure measure of executive functioning, as both tests involve different other domains in their task performance as Working Memory, Arithmetic Skill, Attention and Visuospatial Memory. Moreover, the load of the different components could be different between the two tests.

Aim of the study

In this study, first, we characterised the atrophy pattern of SCA2 patients, in comparison with healthy controls, to validate our SPM results respect the VBM studies taken from the scientific literature (Brenneis et al. 2003; Della Nave et al. 2008a, b). Then, we correlated the performances in the two versions of ToL with SCA2 atrophy to demonstrate a possible dissociation of the areas of atrophy associated with a prevalent coordinative (ToL PEG) or executive dysfunction (ToL CPU).

An additional aim was the comparison between the results obtained employing different methods: DARTEL and SUIT.

Methods

All statistical analyses, when not specified, were implemented on SPSS 13™ software statistical package (SPSS Inc., Chicago, USA; http://www.spss.com).

Subjects

Twelve right-handed patients (10 males, mean ± SD age = 48 ± 10 years, demographic and clinical data in Table 1) with genetically defined SCA2 (Nardacchione et al. 1999) were recruited at the Center for Spinocerebellar Ataxia Diseases—AOU S. Giovanni Battista, Turin, Italy and underwent a brain MRI. Cerebellar ataxia was scored by ICARS, an international validated clinical scale for ataxia (Schmitz-Hübsch et al. 2006; Trouillas et al. 1997).

Table 1 Demographics and clinical data

A group of 31 right-handed healthy controls of similar age (p = 0.49, 12 males, mean ± SD age = 45 ± 11 years, demographic data in Table 1), without history of alcohol or substance abuse and neurological or psychiatric disorder, were enrolled. At the time of the study, neither patients nor controls followed any kind of pharmacological treatment. The study protocol was approved by the Ethical Committee of the Turin AOU S. Giovanni Battista and all the subjects gave their written informed consent to the study and the further use of their anonymous data.

Neuropsychological assessment

A neuropsychologist administered MMSE (Folstein et al. 1975) to exclude subjects below the cut-off of 23.8 score corrected for subject’s age and education (Measso et al. 1993). Depression was scored using Zung (1965) Self-Rating Depression Scale (SDS) a 20-item self-report questionnaire that is widely used as a screening tool, covering affective, psychological and somatic symptoms associated with depression. Most people with depression score between 50 and 69, while a score of 70 and above indicates severe depression.

Then he administered the two versions of ToL. The PEG version uses a board with three pegs of different length (Fig. 1, upper left) and three coloured spheres. The test consists in solving, in a fixed number of moves, 12 puzzles, going from a start position (Fig. 1, lower left) to a goal position (e.g. Fig. 1, lower right), moving only one sphere at a time from one peg to another. The test is scored on the basis of the time of solution of the puzzles, 6 are easy problems with a maximum resolution time of 30 s and 6 are difficult problems with a maximum time of 60 s. The maximum score is 36. The CPU version (Fig. 1, upper right) uses a picture with coloured circles and three vertical lines representing spheres and pegs, the figure is split in two parts, the upper half shows the start position and the lower half the goal position. The test consists in answering what is the minimum number of moves to go from start to goal moving one circle at time from one line to another. The picture is static so the subjects need to mentally imagine the in-between moves. The test has 20 puzzles and is scored on the basis of the number of correct answers, 10 problems are easy with a maximum time of 15 s to answer and 10 are difficult with a maximum time of 30 s. The score for every problem depends on difficulty and it is proportional to the number of in-between moves needed to keep in mind. The maximum score is 41. All tests were administered during a single session in the day of the MRI scan. The paradigm followed this sequence: First, in a room free of noise and without interruption, we administered PEG version. A 10-min training preceded the administration of the CPU version. Parallel and equivalent forms were used for training session. We spend approximately 20 min between the administration of the PEG and CPU versions.

Fig. 1
figure 1

Tower of London test materials. Upper left tower of London pegs-board (ToL PEG), upper right difficult trial of the computerised Tower of London (ToL CPU), the answer is 6 moves to go from start to goal positions, lower left start position for all the ToL PEG trials, lower right an example of an easy ToL PEG trial (goal position for 3 moves)

MRI data acquisition

All the structural MRI scans of all participants were acquired on a 1.0 T Impact Magnetom Imager™ scanner (Siemens, Erlangen, Germany). Whole brain scans were obtained as high-resolution T1-weighted 3D MP-RAGE (Magnetisation Prepared Rapid Acquisition Gradient Echo), acquired in the sagittal plane (TR = 11.4 ms, TE = 4.4 ms, TI = 300 ms, FA = 15°, FOV = 250 mm × 250 mm, 256 × 256 in-plane matrix, 128 slices of 1.5 mm thickness).

VBM analysis

Images analysis was performed on a workstation using the Statistical Parametric Mapping package (SPM8, rel. lug-10, http://www.fil.ion.ucl.ac.uk/spm) in MATLAB 7.5™ environment (Math Works Inc., Natick, MA, USA; http://www.mathworks.com). We took the following 1–4 steps for both DARTEL and SUIT workflows:

  1. 1.

    Check for scanner artefacts and gross anatomical abnormalities for each subject.

  2. 2.

    Set image origin at the Anterior Commissure AC.

  3. 3.

    Use the Unified Segmentation toolbox to segment the images in GM and WM partitions (see Supplementary Fig. 1).

  4. 4.

    Reslice the images with a trilinear interpolation algorithm to an isotropic voxel of 1 mm × 1 mm × 1 mm.

Then we normalised, in a different way, for the two methods in the final steps:

DARTEL

 

  1. 5.

    Use DARTEL toolbox to have a high-dimensional normalisation protocol (see the resulting template in Supplementary Fig. 2). We followed John Ashburner’s chapter in its standard version including the MNI space transformation.

  2. 6.

    Check for homogeneity across sample and use the standard version of the smoothing (i.e. 8 mm FWHM isotropic Gaussian kernel, Supplementary Fig. 3 left).

SUIT

 

  1. 5.

    Use SUIT 2.5 an SPM extension (http://www.icn.ucl.ac.uk/motorcontrol/imaging/suit.htm) to isolate the Cerebellum and generate the segmentation maps.

  2. 6.

    Hand correct the isolated map if necessary, excluding any GM included outside the Cerebellum: exclusive mask were drawn with MRIcron (http://www.cabiatl.com/mricro/mricron/index.html) and SPM Image Calculator used to exclude the outside voxels.

  3. 7.

    Normalise the cropped anatomical images and reslice the segmentation map into SUIT atlas space (e.g. Supplementary Fig. 4).

  4. 8.

    Check for homogeneity across sample and use a lesser smoothing to preserve precision in the definition of cerebellar structures (i.e. 4 mm FWHM isotropic Gaussian kernel, Supplementary Fig. 3 right).

After this pre-processing we obtained smoothed modulated normalised data of 1 mm × 1 mm × 1 mm voxels for DARTEL (181 × 217 × 181 voxels) and SUIT (141 × 95 × 87 voxels) that we used for the statistical analysis.

Regions of interest analysis

We used MarsBar (Brett et al. 2002), an SPM extension for Regions of Interest (ROIs) analysis, to make statistical inferences on normalised unsmoothed data limited to some cortical and subcortical areas. Moreover we created masks from 25 GM (Anterior Cingulate Cortex, Brainstem, Caudate, Cerebellum, DorsoLateral PreFrontal Cortex, Inferior Parietal Lobule, lateral Orbito Frontal Cortex, Primary Motor Cortex, Middle Cingulate Cortex, Medulla, Medial Frontal Cortex, Midbrain, medial Orbito Frontal Cortex, Pallidum, Posterior Cingulate Cortex, Pons, Putamen, Red Nucleus, Primary Sensorial Cortex, Supplementary Motor Area, Superior Parietal Lobule, Superior Temporal Gyrus, Thalamus, Primary Visual Cortex, VentroLateral PreFrontal Cortex) and 22 WM ROIs (Body of Callosum, Brainstem, Callosum, Cingulum, Corticospinal Tract, Genu Callosum, Inferior Cerebellar Peduncles, Inferior Occipital Fascicle, High Corticospinal Tract, Low Corticospinal Tract, Middle Cerebellar Peduncles, Medium Corticospinal Tract, Medulla, Midbrain, Optic Radiation, Pons, Pontine Crossing Tract, Superior Cerebellar Peduncles, Superior Longitudinal Fascicle, Superior Occipital Fascicle, Splenium of Callosum, Uncinate Fascicle) using MarsBar to convert ROIs from marsbar-aal-0.2 (http://marsbar.sourceforge.net/download.htm), Fibers v. 15 Anatomy Toolbox (http://www.fz-juelich.de/inm/inm-1/spm_anatomy_toolbox) (Eickhoff et al. 2005) and ICBM labelled template (http://www.loni.ucla.edu/Atlases). We backprojected the masks (e.g. Supplementary Fig. 5) from MNI space into the native space of the subjects using the inverted deformation from standard space to subject space derived from Unified Segmentation toolbox and we extracted the partial GM, WM volumes from masks with the Ged Rigdway’s get_totals matlab function (http://www.cs.ucl.ac.uk/staff/g.ridgway/vbm/get_totals.m) that gives the total volume of voxels above a given relative threshold (we used 1%) inside a specified mask.

For the estimation of Total Intracranial Volume (TIV) of subjects we used a robust method (Keihaninejad et al. 2010) that backproject the GM + WM + CSF tissue probability maps (10% thresholded) into native space to extract the volume.

Groups comparisons

We used independent two tailed two samples t test (equal or not equal variance assumed on the basis of a Levene’s test) to compare means of demographic and neuropsychological scores of controls and SCA2. We also used an ANCOVA model with TIV, age and education as covariate and genotypes as factor to compare the Tower of London scores.

An ANCOVA model with age, sex, education, TIV as covariates and genotypes as two factor levels (SCA2, controls) was estimated with SPM8 for GM atrophy (no global normalisation and no grand mean scaling). We used an identical model to test WM atrophy. We tested two contrasts: controls > SCA2 and SCA2 > controls. We used these models on DARTEL (43 GM + 43 WM whole brain 8 mm smoothed modulated normalised images), SUIT (43 GM + 43 WM Cerebellum and Brainstem 4 mm smoothed modulated normalised images) and ROIs data (25 GM ROIs + 22 WM ROIs).

All the contrasts SPM maps were thresholded at an uncorrect p < 0.001. We make an additional request, to minimise false positives, to consider a voxel significant it should also belong to a cluster with a corrected cluster-level p corr ≤ 0.05, FWE corrected, by applying the Random Field Theory (Kiebel et al. 1999), for infratentorial clusters or FDR corrected (Benjamini et al. 2001) for supratentorial clusters. We used a more permissive cluster-level threshold only for extra-cerebellar voxels (as in Brenneis et al. 2003).

In the ROIs analysis we used the step-down Finner formula (FWE correction) to choice the significant group comparisons at ROIs level \( p_{\text{corr}} < 0.05:p_{\text{corr}} = 1 - (1 - p_{i} )^{{\frac{n}{i}}} \), where i is the ith smallest p value in the list and n are the number of considered comparisons (Finner 1990, 1993).

Correlations

Multiple regression models with age, sex, TIV and one variable of interest were estimated with SPM8 (no global normalisation and no grand mean scaling) for GM atrophy correlations with clinical, demographic, genetic and neuropsychological parameters. The considered variables of interest were: disease duration, ICARS total score, CAG repeats, ToL PEG and ToL CPU, we tested all these one by one on DARTEL (43 GM + 43 WM whole brain 8 mm smoothed modulated normalised images) and SUIT (43 GM + 43 WM Cerebellum and Brainstem 4 mm smoothed modulated normalised images) data. We considered as significant voxels with an uncorrected p < 0.001 belonging to a cluster p corr ≤ 0.05 cluster-level corrected (FWE corrected for infratentorial voxels and FDR corrected for supratentorial voxels).

We calculated with the Pearson’s coefficient r the table of correlations between ROIs volumes (TIV normalised), ToL PEG, ToL CPU, demographics, CAG repeats and clinical parameters (means and SDs reported in Table 1). We used the step-down Finner formula (FWE correction) (Finner 1990, 1993) to determinate the significant correlations. To confirm our data we also tested some correlation that was already consistently reported in literature.

Labelling

Statistically significant peaks were topographically labelled using MRIcron together with the AAL (Automated Anatomical Labelling) template (Tzourio-Mazoyer et al. 2002) for forebrain GM, the Schmahmann’s human cerebellar atlas (Schmahmann et al. 1999) for the Cerebellum GM and the ICBM-DTI-81 atlas (http://www.loni.ucla.edu/Atlases) (Mori and van Zijl 2007) for WM.

Choosing controls and covariates

For the choice of the controls group composition, we followed the suggestion of Pell et al. (2008). A difference in group sex composition and education was apparent (Table 1), but we preferred to include all the controls images in our possession to maximise the power of our analysis, instead of using a subset of better matched controls because, empirically, it could be better to increase the size of the controls group instead of matching perfectly the controls with the patients group. To verify that our choice is at least as good as having a more matched controls group we also performed all the analyses with two differently composed controls groups. We used also a sex matched controls group and an education matched controls group (see Supplementary Table 1).

For the covariates to be controlled in the modelling, we inserted sex and age as they are important confounding in volumetric studies; we also added TIV as suggested by Pell et al. (2008) and education that could have a potential impact on executive tests performance (i.e. Tower of London) and therefore could explain some of the variance.

Results

Table 1 shows SCA2 patients and controls clinical and demographic data. All the patients were over the MMSE cut-off and did not have a severe depressive condition or late stage of the disease.

The composition of controls group did not change the results of comparisons and correlations as we verified. For an example see Supplementary Fig. 6 and Supplementary Table 2 where we compared the overlapping of the three different groups comparisons for GM atrophy.

Groups comparisons

The two groups differed (p < 0.01, t test) for scores in ToL PEG and ToL CPU (when expressed in percent of maximum score the performances at PEG and CPU where very similar intra group, Fig. 3 Upper left and right). The difference was confirmed by the ANCOVA model (PEG, genotype p = 0.001 adjusted r 2 = 0.41; CPU, genotype p = 0.01 adjusted r 2 = 0.41).

The GM and WM atrophy patterns were described in Fig. 2, Supplementary Figs. 7–13; statistical peaks in Table 2 for GM and Table 3 for WM; clusters gyral description in Supplementary Table 3 for GM and fascicle involvement in Supplementary Table 4 for WM. DARTEL and SUIT gives a similar atrophy description with the whole Cerebellum, cerebellar peduncles and Brainstem involved (Supplementary Tables 3, 4).

Fig. 2
figure 2

VBM comparison between controls and SCA2 patients. Upper left SCA2 versus controls GM comparison for DARTEL normalised data, SPM maps projected onto a 3D render of the template created by DARTEL, upper right SCA2 versus controls GM comparison for SUIT normalised data, SPM maps projected onto a 3D render of the SUIT template, lower left SCA2 versus controls WM comparison for SUIT normalised data, SPM maps projected onto 3 orthogonal planes of the SUIT template, blue cross on the global maxima, lower right SCA2 versus controls WM comparison for DARTEL normalised data, projected onto 3 orthogonal planes of the Colin27 template, blue cross on the global maxima, detail of the Corticospinal Tracts involvement. For all the contrasts, the threshold was p < 0.001 uncorrected

Table 2 SCA and controls GM comparison
Table 3 SCA and controls WM comparison

The pattern included: Lobules III–VI, Crus I–II, VIIb, VIII–X, Vermis I–X, Inferior/Middle/Superior Cerebellar Peduncles, Medulla, Midbrain, Pons, Medial Lemniscus and Pontine Crossing Tract. The atrophy extended with a number of voxels over the threshold of nearly 50% in the left and 50% in the right hemisphere (for both DARTEL and SUIT, see Supplementary Table 3). The atrophy extended slightly more in the Anterior Cerebellum compared to the Posterior (for both DARTEL and SUIT, see Supplementary Table 3). The maximum GM peaks differed for DARTEL and SUIT, where the order from max to min was Brainstem, Bilateral Posterior Cerebellum and Right Anterior Cerebellum for DARTEL (Table 2, Supplementary Table 3) and Brainstem, Right Anterior Cerebellum, and Bilateral Posterior Cerebellum for SUIT (Table 2, Supplementary Table 3). Both methods gave the same peak for WM: Right Middle Cerebellar Peduncles (Table 3).

Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra showed more extended clusters of atrophy, especially in Posterior Cerebellum, but also more voxels outside the GM atlas mask (Supplementary Table 3).

Additionally only DARTEL can show significant voxels outside Cerebellum and, indeed, there were several: in the Left Inferior Parietal Lobule (see Table 2 and Supplementary Fig. 12) and along the Cortico Spinal Tracts (they agreeably followed the a priori known shape of the fascicle, see Table 3; Fig. 2, lower right and Supplementary Fig. 13).

Regions of Interest analysis (Supplementary Table 5) showed significant differences in GM (Finner step-down corrected) for: whole Cerebellum, whole Brainstem, Midbrain, Medulla, Pons and Thalamus. In WM for: whole Brainstem, Inferior/Middle/Superior Cerebellar Peduncles, Pontine Crossing Tract, Medulla, Pons, Inferior Occipital Fascicle, Superior Longitudinal Fascicle, Uncinate Fascicle, Corticospinal Tract (especially the Low part, as we divided the Corticospinal Tract in three, approximately equal, parts along the infero-superior direction: Low, Medium and High).

Correlations

We did not find any significant correlation with SPM8 for ICARS, CAG repeats and duration of the disease. We found a striking correlation with the two version of ToL (Fig. 3, Supplementary Fig. 14, 15, 16; Table 4, Supplementary Table 6). The ToL PEG correlated with a prevalently (83%) Anterior Cerebellum area of about 20,000 voxels with global maxima in x, y, z = 31, −33, −27 (Right Lobule IV–V near the boundary with Lobule VI). The clusters extended more in the Right Cerebellum (68%) and included many voxels in Lobules III–VI with small parts of the Right Parahippocampal Gyrus, Crus 1, 2 and Midbrain (Supplementary Table 6). The ToL CPU correlated with areas prevalently centred on the Posterior Cerebellum (71%) of about 25,000 voxels with global maxima in x, y, z = 38, −76, −36 (Right Crus 1 near the boundary with Crus 2). The clusters extended more in the Right Cerebellum (78%) and included Lobules III–VI, Crus 1, 2, VIII, Vermis I–V, Right Hippocampus and Parahippocampal Gyrus with small parts of Midbrain, Left Hippocampus and Parahippocampal Gyrus (Supplementary Table 6).

Fig. 3
figure 3

Correlation between Tower of London scores and atrophy patterns. Upper left Traditional Tower of London scores (ToL PEG) for controls (in blue, left histogram) and SCA2 (in green, right histogram), upper right Computerised Tower of London scores (ToL CPU) for controls (in blue, left histogram) and SCA2 (in green, right histogram). Error bars stand for SEM, *p < 0.05 two independent samples t test, ordinates units are in percent of maximum score. Lower left voxels volume correlated with ToL PEG, blue cross on the global maxima x, y, z = 31, −33, −27 mm; Lower right voxels volume correlated with ToL CPU, blue cross on the global maxima x, y, z = 38, −76, −36 mm. For all the contrasts the threshold was p < 0.001 uncorrected

Table 4 GM correlation with PEG and CPU

The correlations that survived to Finner step-down correction for multiple comparisons were reported in Supplementary Table 7. CAG repeats did not have significant corrected correlations, anyway, we reported the maximum that was r = −0.77 (NS) with the Inferior Cerebellar Peduncles. Age correlated significantly (|r| > 0.80) with some medial frontal areas, Superior Temporal Gyrus, total GM and ToL CPU (with ToL PEG the correlation was medium, r = −0.61, but NS). Education did not have significant correlations FDR corrected, but showed a medium r = 0.50 (NS) with both the ToL versions. ToL CPU and PEG showed a strong intercorrelation (r = 0.77), but this result would not survive to correction (NS). ICARS score; onset of disease and duration did not have any significant correlations.

We reported age and education correlations with ToL for their known effect over executive functions (Plumet et al. 2005) and the CAG repeats correlation with disease onset (−0.51, NS) for the anticipation phenomenon in SCA2 (Schöls et al. 2004).

Discussion

Comparison with previous works

Our results reproduced very well those of Della Nave et al. (2008a, b). All their and our GM and WM atrophy’s areas were the same with the exception of Corpus Callosum.

We had a partial agreement for the Corpus Callosum: we found a ROIs significance, but only with p uncorrected, and, what is more, also the authors cannot find it with VBM, emerging it only with an additional analysis with the tract-based spatial statistics (TBSS) method (Smith et al. 2006).

In their study, the R Thalamus would not be significant if they applied the threshold chosen in our work, however, this finding was interesting and in perfect agreement with Brenneis et al. (2003) and some previous neuropathological findings (Rüb et al. 2005; Rüb et al. 2003a, b, 2005). Our ROI analysis on Thalamus (Supplementary Table 5) revealed a significant lesser volume (Finner step-down corrected). This result, together with the Corticospinal Tracts and Parietal Cortex involvement, led us to assume a progressive involvement of this system and of the sensorial Thalamus with the progression of the disease. This progression could be also linked to the somatosensory and somatomotor symptoms in SCA2 (Kubis et al. 1999; Rüb et al. 2003a, b, 2005, 2007). In addition, Brenneis et al. (2003) hypothesised that the cortical and Thalamic degeneration could start as a secondary process. This would be confirmed by the fact that a subgroup of patients with a lesser disease duration in Brenneis’ work (2003) did not show any cortical atrophy (the main difference between our and Brenneis’ work was the absence of cortical atrophy in our results). This fact would be confirmed also by the work of Giuffrida et al. (1999) that found a diffused cortical atrophy in their sample, but as in Brenneis study (2003) the patients had a greater number of CAG repetitions and a worse ataxia score compared to our patients. Our patients did not have dementia or forebrain global GM reduction, a feature that could be associated with a cognitive impairment, more associated with a more severe or a later stage of the disease (Klockgether et al. 1998). Taking together these observations, we could suppose that our data, together with Della Nave data (Della Nave et al. 2008a, b), were as an earlier picture of the disease evolution compared to Brenneis data.

The second work of Della Nave et al. (2008a, b) was not comparable with the previous because they used a very high conservative threshold. If we would use the same threshold we would find similar results (not shown), in fact large portions of the Cerebellum hemispheres would not appear atrophic anymore.

In the ROI analysis we found that WM is reduced also in the Inferior Occipital Fascicle, Superior Longitudinal Fascicle and Uncinate Fascicle. The involvement of these WM tracts has never been described before and allows us to suppose a more widespread WM damage in this pathology.

DARTEL and SUIT comparison

Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra and SUIT gave similar results with some differences in the extension and t values of the posterior clusters. The difference could arise from different smoothing amount employed and a good agreement in subject’s alignment of the two methods. To confirm these hypothesis we estimated again the DARTEL GLM using a minor smoothing (4 mm FWHM), in fact we found an order of intensity of the peaks equal to SUIT (Right Anterior Cerebellum before Bilateral Posterior Cerebellum) and less extended clusters (data not shown).

SCA2 anatomic-clinical correlation

We did not find any correlations with ICARS, unlike previous studies (Brenneis et al. 2003; Della Nave et al. 2008a, b; Ying et al. 2006), and any correlation with disease duration or onset, like two previous studies (Brenneis et al. 2003; Della Nave et al. 2008a, b) and unlike another one (Della Nave et al. 2008a, b), both in ROIs and GLM analyses. This result, in part, depended from the fact that the correlations reported in previous works were mild and they did not survive to the applied Finner step-down correction.

SCA2 anatomic-genetic correlation

We did not find any correlations with CAG repeats, as in previous studies in both ROIs and GLM analyses. Although the correlations did not survive to the multiple comparison correction, we wanted to indicate the strong association between Inferior Cerebellar Peduncles volume and the triplet repetitions (r = −0.77, p = 0.003). In future works, if this data will be replicated, demonstrating enough sensitivity and reliability, the volumetry of this white matter bundle could be used as a possible biomarker of CAG repetitions in SCA2.

SCA2 anatomic-neuropsychological correlations

We observed a correlation between Anterior Cerebellum and ToL PEG performance and between Posterior Cerebellum and ToL CPU. The main differences of the two tasks were that in the CPU version the sensorimotor feedbacks and skills, needed for fine coordination, were absent and on the contrary the Working Memory load was increased.

Tower of London planning task increases blood flow activity, as attested consistently by many PET and fMRI studies (e.g. Beauchamp et al. 2003; Boghi et al. 2006; Schall et al. 2003), in a network mainly composed by bilateral dorsolateral prefrontal, inferior parietal cortex and Cerebellum. These evidences converge with other theoretical, anatomical, physiological, clinical, and functional considerations that led to the incorporation of the cerebellum into the distributed neural circuits subserving cognition and emotion (Schmahmann 2010). In a recent meta-analysis (Stoodley and Schmahmann 2009), confirmed by a multiple fMRI tasks intra-subjects study (Stoodley et al. 2010), sensorimotor tasks activated the anterior Lobule IV–V and the adjacent Lobule VI. The posterior lobe was involved in higher-level tasks: executive functions activated the Lobules VI, Crus I and VIIb. These are the same areas that we observed as associated to different dysfunctions in the SCA2 patients. The dissociation between the coordinative and executive impairment was reported, regard prominence and evolution, in SCA2 patients (Le Pira et al. 2007). We hypothesise that this dissociation is linked to different areas of atrophy in the Cerebellum and that it resemble the dissociation founded in patients with cerebellar damage in different cerebellar areas (Stoodley and Schmahmann 2010). In both ToL tasks, we observed an association of the Right Cerebellum atrophy with the performances. This result is in agreement with the hand dominance of subjects and with the involvement of the frontal left cortex in these tasks that, with Right Cerebellum is active during planning. The role of cerebellum in Working Memory is well recognised and in the recent meta-analysis (Stoodley and Schmahmann 2009) authors highlighted, as well as for executive functions, the strongest activation in the lobule VI/Crus I in both hemispheres, during verbal Working Memory tasks. The increased Working Memory load in the ToL CPU respect to the classical version, may have played a role in finding the correlation between the posterior cerebellum and the ToL CPU.

Although the two sets of areas associated with ToL versions were quite separable, the ToL CPU pattern included a part of the ToL PEG (Supplementary Fig. 16, Supplementary Table 5). At present, the scientific community is divided between embodied and disembodied theories believers, with motor representations being considered critical or merely ancillary for action and object recognition, imagery and even language understanding. The former predominantly are built on associations of deficits in these abilities whereas the latter rely on dissociations between deficits. To date, although no convincing theory is available to explain both sets of observations, cognitive neuroscientists have begun to explore an intermediate hypothesis space (Rumiati et al. 2010) that could fit well in our findings of partially overlapping and partially disjointed atrophy areas associated with different dysfunctions.

Conclusions

The correlations of different versions of the test suggested a dissociation between dysfunctions in SCA2: the Posterior Cerebellum linked to the executive dysfunction and Working Memory and the Anterior Cerebellum linked to the coordinative dysfunction. Our results could also have a clinical relevance, supporting the fact that neuropsychological classical tests, when require motor coordination, may not be adequate for the assessment of patients with movement disorders.