Introduction

The most salient characteristic of Alzheimer’s disease (AD) on magnetic resonance imaging (MRI) is atrophy of the medial temporal lobe, including the hippocampus [1]. However, atrophy may also occur in the posterior cortex and early posterior cortical involvement is emerging as an important aspect of AD [27]. Posterior cortical atrophy (PCA) is often evident from visual inspection on structural MRI. We recently proposed a visual rating scale for assessing the degree of PCA [8]. This scale was found to be robust, reproducible, and easily applicable in a clinical setting [7, 8].

To establish the validity of this visual PCA rating scale and thereby determine whether its use in clinical practice is justified, it should be compared against quantitative brain volumetry, as has been done previously for other visual rating scales [911]. This issue has partially been addressed by comparing PCA scores with the manually determined volumes of the posterior cingulate gyrus, a single structure within the posterior region [7]. The visual rating scale for PCA however covers a larger anatomical area and full validation against volumetry is lacking.

Therefore, in the current study, we provide a validation covering the entire anatomical region of interest. We assessed the mutual discriminatory value of the possible scores of the visual PCA rating scale at three different levels of anatomical detail: the entire parietal brain region, the individual anatomical subregions, and in much finer detail using voxel-based morphometry (VBM). Finally, we assessed which anatomical regions contribute most to the discrimination between PCA scores.

Materials and methods

Patients

The study was approved by the local institutional medical ethics committee. All patients gave written informed consent.

We included 398 patients with probable AD or subjective complaints from our memory clinic-based Amsterdam Dementia Cohort. All patients visited the Alzheimer Centre or the Department of Internal Medicine (COGA) of the VU University Medical Centre (VUmc) between August 2008 and January 2011 and underwent a standardised 1-day assessment including MRI. Diagnoses of probable AD were made in a multidisciplinary consensus meeting according to the NIA-AA criteria [12]. Patients with normal clinical investigations [i.e. not fulfilling criteria for mild cognitive impairment (MCI) [13] or any major psychiatric disorder] were labelled as having subjective memory complaints. For inclusion in the present study a three-dimensional T1-weighted sequence (3D T1) and a 3D fluid-attenuated inversion-recovery (FLAIR) sequence acquired during the same session were required. Exclusion criteria were (1) poor MR image quality and/or large image artefacts (n = 4), (2) an asymmetric PCA visual rating score (n = 34), and (3) gross abnormalities other than atrophy, including severe white matter hyperintensities (WMH) (n = 3) as defined below.

MRI acquisition and review

Magnetic resonance imaging was performed on a 3.0-T whole-body MRI system (SignaHDxt, GE Healthcare, Milwaukee, WI, USA) using an eight-channel head coil with foam padding to restrict head motion. Imaging included a whole-brain 3D T1 fast spoiled gradient echo sequence (FSPGR; TR 708 ms, TE 7 ms, flip angle 12°, 180 sagittal slices, field of view 250 mm, slice thickness 1 mm, voxel size 0.98 × 0.98 × 1 mm) used for VBM and for the visual rating. A 3D FLAIR sequence (TR 8,000 ms, TE 125 ms, 132 sagittal slices, field of view 250 mm, slice thickness 1.2 mm, TI = 2349 ms) was acquired for visual rating of WMH. MR images were reviewed for brain abnormalities other than atrophy by experienced neuroradiologists (FB, 20 years’ experience; MPW, 10 years’ experience). WMHs were rated on 3-mm axial reformats of the 3D FLAIR sequences (by FB and MPW) with the Fazekas scale [14], a four-point rating of the overall presence of WMH. Subjects with a maximum Fazekas scale score were excluded.

Visual rating of PCA

Details about rating of posterior cortical atrophy can be found in the original paper about the visual rating scale [8]. In short, PCA was rated with a 4-point scale with PCA-0 = no atrophy, PCA-1 = minimal atrophy, PCA-2 = moderate atrophy, and PCA-3 = severe atrophy based on multiplanar reconstructions (sagittal, coronal, axial) of the 3D T1 and 3D FLAIR images (Fig. 1). Rating of PCA takes approximately 2 min per patient for a skilled rater. All imaging was rated by the same rater (MB, 6 months’ experience). Each tenth acquisition was rated in consensus with an experienced neuroradiologist (MPW, 10 years’ experience). When there was doubt about appropriate scoring, imaging was discussed in a consensus meeting with MPW. Both raters were blinded to the subjects’ age, sex, and diagnosis. Intra-rater reliability was determined for 60 acquisitions from the data set and was very good with a weighted kappa (quadratic weights) of 0.95 for left and right scores. Inter-rater reliability between MB and MPW was also very good with a weighted kappa (quadratic weights) of 0.85 for left and right scores. Patients were categorised into groups based on the PCA scores of their imaging. MRIs were also rated in terms of medial temporal lobe atrophy (MTA) (according to the five-point visual rating scale) [9] and global cortical atrophy (GCA) (according to the four-point visual rating scale) [9, 15].

Fig. 1
figure 1

Scoring of the visual rating scale for posterior cortical atrophy (PCA). In sagittal orientation, widening of the posterior cingulate sulcus (PCS) and parieto-occipital sulcus (POS) and atrophy of the precuneus (PRE) were evaluated. In axial orientation, the widening of the posterior cingulate sulcus and sulcal dilation in the parietal lobes (PAR) were evaluated. In coronal orientation, the widening of the posterior cingulate sulcus and sulcal dilation in the parietal lobes were evaluated. From left to right sagittal 3D T1, axial 3D FLAIR, and coronal 3D T1 reconstructions with a: 0 = no atrophy (38-year-old man with subjective memory complaints); b: 1 = minimal atrophy (73-year-old woman with Alzheimer’s disease [AD]); c: 2 = moderate atrophy (68-year-old woman with AD); d: 3 = severe atrophy (62-year-old woman with AD)

Tissue type segmentation

DICOM images of the FSPGR sequence were corrected in three dimensions for gradient non-linearity distortions and converted to the Nifti format. The linear transformation matrix to the MNI space was calculated using FSL-FLIRT [16] and used to place the image coordinate origin (0,0,0) on the anterior commissure using the Nifti s-form.

The structural 3D T1 images were analysed using Statistical Parametric Mapping (SPM8; Functional Imaging Laboratory, University College London, London, UK) implemented in MATLAB 7.12 (MathWorks, Natick, MA, USA). First, SPM8 was used to automatically quantify probabilities of grey matter (GM), white matter (WM), and cerebrospinal fluid (CSF) for each voxel of each native image. Total intracranial volume (TIV) was calculated by summing up the native space volumes of GM, WM, and CSF segmentations.

Volumetry of regions of interest

We calculated the total GM volumes of six structures covered by the visual rating scale for PCA: bilateral posterior cingulate, postcentral gyrus, superior and inferior parietal gyrus, and angular gyrus and precuneus, as follows. Using the fully automated “Individual Brain Atlases as implemented in the Statistical Parametric Mapping” (IBASPM) toolbox [17], (http://www.thomaskoenig.ch/Lester/ibaspm.htm), based on SPM software (http://www.fil.ion.ucl.ac.uk/spm), for each participant, each voxel of the native GM density maps was assigned to one of the 84 predefined cerebral GM structures in the Automatic Anatomic Labelling (AAL) atlas by applying the inverse of the image transformation obtained in the VBM normalisation process (details below) [18]. Segmentation results were visually inspected for accuracy, and none had to be discarded. The GM volume (cm3) of each of the six structures covered by the PCA rating scale was estimated using the IBASPM volume statistic function. GM volumes for the left and right parts of each structure were summed and transferred to SPSS. The total GM volume of the whole parietal region of interest (ROI) was calculated as the sum of the volumes of the six structures.

Statistical analysis of volumes

Statistical analyses of clinical data were performed using SPSS 20.0 for Windows (IBM SPSS Statistics, Somers, NY, USA), using Student’s t-tests, Mann-Whitney U-tests, and Pearson’s chi-squared tests to compare groups where appropriate. To test if the total GM volume of the whole parietal ROI (dependent variable) differed among the four PCA scores (independent variable), we used analysis of variance (ANOVA) with TIV as a covariate. As the raters of the degree of PCA were blinded to the subjects’ age and sex, and therefore these did not influence the rating, these two variables were not included as covariates. Bonferroni post-hoc tests were conducted. Additionally, we investigated whether the GM volume of the six structures separately (dependent variables) differed among the four PCA scores (fixed factor) using a multivariate model (MANOVA) with TIV as a covariate. Partial eta squared (ηp 2) values were calculated to estimate the effect sizes. To assess which anatomical regions contributed most to a higher score on the visual PCA rating scale, we conducted two binary logistic regression analyses between PCA groups (dependent variable): first, between PCA-0 and PCA-1; second, between PCA-1 and combined PCA-2/PCA-3. For these logistic regression analyses, volumes of the six structures were transformed to z-scores and included as independent variables. PCA scores were coded such that odds ratios reflected the increased risk associated with a higher PCA-score per standard deviation smaller volume of a specific structure. We used the forward conditional method. The level of significance was set at P < 0.05.

Voxel-based morphometry

Voxel-based morphometry was performed using a modified pipeline in SPM8 (Functional Imaging Laboratory, University College London, London, UK). After tissue segmentation, images were rigidly aligned. Next, a “DARTEL” GM template of all acquisitions was created by non-linearly aligning the GM images of all participants to a common space [19]. DARTEL is an average group-specific template to increase the accuracy of inter-subject alignment. Native GM and WM segmentations were spatially normalised to the DARTEL template by applying the individual flow fields of all acquisitions, using modulation to compensate for volume changes resulting from compression and/or expansion. Images were smoothed using a 4-mm full-width at half maximum (FWHM) isotropic Gaussian kernel. Images were visually inspected at every processing step.

To localise GM differences, voxelwise statistical comparisons between groups were made using a full factorial design with PCA as a factor with independent levels, unequal variance, TIV as a covariate, an absolute threshold of 0.2, and implicit masking. The following post-hoc pairwise comparisons were made: PCA-0 was compared with PCA-1 and with PCA-2; PCA-1 was compared with PCA-2 and PCA-3 together (PCA-2/3). The statistical significance threshold was set to P < 0.05 with family-wise error correction (FWE) at the voxel level.

Results

Images of 357 patients (229 AD patients, 128 patients with subjective memory complaints) were available for analysis. Patients were categorised according to PCA rating scores, resulting in a study sample of 122 patients with PCA-0, 143 patients with PCA-1, 79 patients with PCA-2, and 13 patients with PCA-3.

Demographic data according to PCA groups are summarised in Table 1. AD patients were predominantly classified as PCA-1 (47 %), whereas most of the patients with subjective memory complaints (66 %) were in the group PCA-0. MTA and GCA scores were lowest in PCA-0 and highest in PCA-3 group (P < 0.001). Patients with any degree of PCA had lower total brain volume (TBV) than patients categorised in PCA-0 (P < 0.001).

Table 1 Demographic characteristics

Analysis of variance (ANOVA) showed that total GM volume of the whole parietal ROI differed among the four PCA groups (P < 0.001) with the largest volumes in PCA-0 and the smallest volumes in PCA-3 (Table 2; Fig. 2). MANOVA for the six posterior cortical structures revealed a significant main effect of the PCA rating scale: All structures except the posterior cingulate differed among the four PCA groups (posterior cingulate gyrus: P = 0.054, ηp 2 = 0.021; postcentral gyrus: P < 0.001, ηp 2 = 0.30; superior parietal gyrus: P < 0.001, ηp 2 = 0.29; inferior parietal gyrus: P < 0.001, ηp 2 = 0.35; angular gyrus: P < 0.001, ηp 2 = 0.25; precuneus: P < 0.001, ηp 2 = 0.30). Bonferroni-corrected post-hoc pairwise comparisons revealed that the volumes of these five structures (all except the posterior cingulate gyrus) differed among all PCA group pairs, except between PCA-2 and PCA-3 (Fig. 3, Table 2). The small group size of the PCA-3 group limits the statistical power of comparisons with this group.

Table 2 Grey matter (GM) volume of parietal structures
Fig. 2
figure 2

Boxplot of GM volume of the whole parietal ROI of the four different PCA groups. Volumes differed significantly among all PCA groups (P < 0.001)

Fig. 3
figure 3

Boxplots of GM volumes of the six different structures for each PCA group. Volumes of all structures differed among all PCA groups except between PCA-2 and PCA-3. The volumes of the posterior cingulate did not differ among any of the PCA groups. Coloured structures: red;, precuneus; green, superior parietal gyrus; dark blue, postcentral gyrus; yellow, inferior parietal gyrus; light blue, angular gyrus; copper, posterior cingulate gyrus

To investigate which structures contributed most to the discrimination between scoring levels of the PCA rating scale, we used binary logistic regression. Comparing PCA-0 and PCA-1, logistic regression (stepwise forward) revealed that the inferior parietal gyrus (OR = 3.7, 95 % CI = 1.5–9), the precuneus (OR = 2.2, 95 % CI = 1.2-4.1), and the angular gyrus (OR = 0.4, 95 % CI = 0.2–0.9) contributed independently to a higher score. Comparing PCA-1 and PCA-2/3, the inferior parietal gyrus (OR = 5.1, 95 % CI = 2.3–11.6) and the angular gyrus (OR = 0.5, 95 % CI = 0.2–0.99) contributed the most to the higher PCA score. Odds ratios higher than 1 reflect a negative relationship between volume of the area and the PCA score, i.e., the smaller the volume of an area, the higher the odds of a higher PCA score.

VBM results

Voxel-based morphometry analyses showed that the visual PCA rating scale discriminates well between atrophy (PCA-1, PCA-2) and no atrophy (PCA-0) in the parietal brain regions considered by the rating scale and that with increasing parietal atrophy more atrophy of the whole brain was detected. Compared with PCA-0, the PCA-1 group had less GM in the medial cingulate gyrus, middle occipital gyrus, insular cortex, hippocampus, inferior frontal gyrus, precentral gyrus, angular gyrus, and middle temporal gyrus (P < 0.05 FWE; Fig. 4). Compared with PCA-0, PCA-2 had less GM in the precuneus, postcentral gyrus, and cerebellum (P < 0.05 FWE). Compared with PCA-1, moderate to severe PCA (PCA-2/3) had less GM in the right precuneus, supplementary motor area, fusiform gyrus, and medial temporal lobe (P < 0.05 FWE; Fig. 4).

Fig. 4
figure 4

Voxel-wise comparisons between PCA-0 vs. PCA-1 showed that the visual PCA rating scale discriminates well between atrophy and no atrophy (above) as well as between minimal and moderate/severe atrophy (PCA-1 vs. PCA-2 and PCA-3, below). In particular, higher scores on the visual scale reflect specific atrophy of the posterior cortex. Brighter colours indicate higher t-values. Figures are displayed with a threshold of P < 0.001, uncorrected

Discussion

We showed with volumetric analysis and VBM that the visual PCA rating scale reliably reflects GM atrophy in parietal cortical regions. There was a clear separation between brains rated as having PCA and those rated as having no atrophy. Moreover, the different severity scores in the rating scale corresponded to different quantitative degrees of atrophy. Finally, the volume of the inferior parietal gyrus in particular affected the visual PCA scoring.

Previous studies showed that the presence of PCA may be a helpful additional imaging marker for AD, especially in patients with an early onset [3, 20], and in distinguishing AD from fronto-temporal dementia (FTD) [7, 21]. The recently developed visual rating scale for PCA detects a wide range of PCA and is a quick, reproducible, and easily applicable tool for the clinical setting [8]. However, it has not been quantitatively validated so far, which may have hampered its clinical applicability.

The current study quantitatively validated the PCA scale at three levels of anatomical detail. First, GM volumes of the entire parietal ROI covered by the scale were smaller for higher scores and differed significantly among all pairs of PCA groups, except between PCA-2 and PCA-3, which may be explained by insufficient power due to the small size of the PCA-3 group (13 patients). Second, the same behaviour was observed for five of the six individual structures. Only the volumes of the posterior cingulate gyrus did not differ among any of the PCA groups, although there was a trend, mainly driven by the subtle difference between PCA-0 and PCA-2. This limited correspondence between PCA rating scale scores and posterior cingulate gyrus volumes is in line with an earlier report where higher PCA scores only moderately corresponded with smaller volumes of the posterior cingulate gyrus [7]. The authors hypothesised that PCA might reflect more than posterior cingulate atrophy alone, which is confirmed by our current findings as the other five anatomical structures did show the expected relation with PCA scores. Another explanation may be that the anatomical proximity of the posterior cingulate gyrus, especially to the retrosplenial cortex and precuneus, makes it rather difficult to assess visually, although limiting its contribution to the PCA score determined by the rater. Finally, the posterior cingulate gyrus is a very small structure, making it more prone to registration errors than larger regions, possibly increasing variability among subjects.

Third, VBM analyses showed that the visual PCA rating scale discriminates well between atrophy and no atrophy in the parietal brain regions considered by the rating scale. The visual PCA rating scale also discriminates well between minimal atrophy and moderate/severe atrophy, reflecting that higher visual rating scale scores correspond to a specific pattern of atrophy in the posterior cortex. The VBM findings complement the volumetric measurements with more anatomical detail. Nevertheless, the volumetric measurements revealed differences among the minimal, moderate, and severe atrophy groups separately, whereas VBM only detected significant differences in the parietal brain regions when comparing PCA-1 with PCA-2 and PCA-3 together. This may be due to differences among atrophy patterns in individual patients: atrophic regions that do not overlap precisely may go unnoticed by VBM but can be detected when quantifying GM volumes in larger ROIs. Our three-level analysis suggests that higher PCA scores are related to smaller posterior GM volumes, with probably some (limited) variation between patients as regards the exact anatomical distribution of those atrophic changes.

Voxel-based morphometry analyses also showed that in this subject group, increasing PCA scores corresponded to more GM atrophy in other brain regions. This finding is driven by the composition of our study group, which is a representative sample of patients attending a memory clinic, and emphasises that isolated atrophy of the posterior cortex is an exception.

The discrimination between no and mild atrophy (PCA-0 vs. PCA-1), as well as the discrimination between mild and moderate/severe atrophy (PCA-1 vs. PCA-2/3), was driven mainly by the inferior parietal gyrus volume, which thus seems to play a major role in the visual scoring of PCA. Inferior parietal gyrus volume was related to cognitive status and predictive of future AD development in a previous study [5]. This is in line with our findings, as the higher the score, the more AD patients are categorised into the PCA groups, making the inferior parietal gyrus an import subregion of the PCA spectrum. In addition to the inferior parietal gyrus, the precuneus was found to contribute to the discrimination between rating scores 0 and 1, which is not surprising as it is one of the largest structures in the region. Finally, the angular gyrus contributed to the distinction between 0 and 1, and between 1 and 2. It should be noted that in both cases the effect was reversed because of collinearity.

A strength of this study was the comprehensive approach with three levels of anatomical detail, including a carefully applied VBM pipeline with visual checks at each step, allowing clear quantitative and visual validation of the PCA rating scale. A possible limitation of this study is that only one rater rated most of the MRI examinations. To ensure that our results can be generalised, our study protocol included an additional rating by another rater of 10 % of all acquisitions (each tenth consecutive acquisition), as well as of all acquisitions where there was doubt about the scoring. The inter-rater reliability between the two raters was very high with a weighted kappa (quadratic weights) of 0.85 for left and right scores, indicating the generalisability of our results. Another possible limitation is that, strictly speaking, the visual rating scale by definition also assesses the widening of the sulci in the posterior region. However, widening of the sulci is difficult to quantify reliably as the inner boundary of the skull is notoriously difficult to detect on 3D T1 images owing to the limited contrast between bone and CSF. Nevertheless, GM atrophy is supposed to underlie the widening of the sulci, which was reliably measured in this study. In order to limit the number of patient groups in our comparisons, and because most of the patients had equal PCA rating scores for their left and right hemispheres, we only included subjects with symmetric PCA rating scores. Nevertheless, there does not appear to be a plausible biological reason why the current validation of the PCA rating scale would not generalise to asymmetric scores. In spite of the large total number of patients (357), there were only 13 patients in the PCA-3 group, hampering the detection of any putative GM volume differences with the PCA-2 group because of low statistical power.

The findings of the current study have important implications. As PCA has been shown to be an additional marker for AD and helps distinguish AD from FTD, it is important to use the visual rating scale in daily practice. We demonstrated that the visual rating scale for PCA reliably reflects GM atrophy in posterior regions. Because alternative approaches involving quantitative image post-processing techniques are time-consuming, require sophisticated post-imaging analysis, and may be variable across pulse sequences, the simplicity of this visual rating scale has a great advantage for clinical practice, making it a useful tool in the daily radiological assessment of dementia.