Introduction

Diffusion tensor imaging (DTI) is useful for the diagnosis of Alzheimer’s disease (AD) at the prodromal stage (mild cognitive impairment; MCI) and at the stage of manifest dementia, as shown in monocentric [4, 7, 8, 12, 29, 37] and multicentric studies [9, 10, 38]. It may especially aid the diagnosis at an early stage, as the DTI parameters mean diffusivity (MD), fractional anisotropy (FA) and mode of anisotropy (MO) indicate neuronal dysfunction at a microstructural level that are assumed to precede macroscopic atrophic changes in AD pathogenesis [6, 8].

Recently, evidence has accumulated indicating that subjective cognitive decline (SCD) could represent the earliest symptomatic stage of AD [21, 26]. To date, only few monocentric DTI studies with varying acquisition parameters and analysis methods investigated microstructural changes in SCD. Compared to cognitively healthy control (CON) participants, these studies reported lower FA and higher MD mainly in the corpus callosum and cingulum, the white matter (WM) of the medial temporal lobe (including parahippocampal WM and WM underlying the entorhinal cortex) and of the medial parietal lobe (including the WM underlying the precuneus as well as the retrosplenial cortex) [19, 25, 30, 33, 40]. Another study found changes in widespread WM regions across the brain [25]. Some of the WM changes predicted subsequent hippocampus atrophy and memory decline [32]. Moreover, altered diffusivity in the posterior cingulum was associated with tau pathology in a sample of participants with SCD or MCI [35].

However, other studies did not detect deviant MD or FA in SCD [23, 40]. The different results could partly be due to the high variability of DTI across different scanners [38]. Obviously, a post hoc comparison of outcomes of different monocentric studies cannot explicitly assess a multicenter effect. In a previous multicentric sample with retrospectively collected data, center accounted for 56–75% of the FA and MD variance [39]. This emphasizes the relevance to investigate the performance of DTI in a multicentric SCD sample. So far, the usefulness of DTI for capturing WM alterations in a multicentric sample has only been addressed in people with MCI who were β-amyloid positive. The results of this multicentric study were promising, revealing a significantly higher accuracy of the DTI parameters compared to the accuracy of volume markers for detecting prodromal AD [9].

Besides the evaluation of group differences, the application of a future biomarker in a multicenter data set provides a more realistic estimate of its potential diagnostic use in the presence of multicenter variability than its evaluation in a monocenter study. Using imaging markers for risk stratification in future clinical trials will likely involve multiple scanners. Therefore, an estimate of the effect of multicenter variance on the achievable levels of accuracy is of high interest.

In the present study, we aimed to assess if microstructural neuronal integrity as measured by DTI is altered at the stage of subjectively impaired cognitive functioning, i.e., before cognitive symptoms are measurable. To investigate this, we used a prospective sample from a multicentric longitudinal study with scan parameters and procedures that were matched across the centers [22]. We hypothesized that DTI could differentiate the diagnostic groups SCD, MCI and AD dementia from CON participants. Secondly, we hypothesized that the variance due to center would be reduced compared to a previous multicentric study that included data that were obtained with varying acquisition parameters [39].

Methods

Participants

We used data from the interim baseline data set of the multicenter DZNE-longitudinal Cognitive Impairment and Dementia Study (DELCODE), conducted by the German Center for Neurodegenerative Diseases (DZNE) [22]. After excluding all cases with insufficient image quality or neurologic conditions, DTI data from 271 participants from nine centers were included (35 AD, 45 MCI, 98 SCD and 93 CON; n = 2–76 participants per center; Supplementary Table S2). The participants underwent a clinical assessment of their cognitive status, including the Mini Mental State Examination (MMSE) [13] and an extensive neuropsychological testing battery [22]. Depressive symptoms were assessed by means of the Geriatric Depression Scale (GDS) [15]. The DELCODE exclusion criteria ensured that no persons were included who had a current major depressive episode, past or present major psychiatric disorders, neurological diseases other than AD, or unstable medical conditions [22].

SCD was defined as a persistent self-perceived cognitive impairment in the absence of objective cognitive impairment, lasting at least 6 months and being unrelated to an acute event [21]. The MCI patients met the core clinical criteria for MCI according to NIA-AA workgroup guidelines [1]. The AD patients had a clinical diagnosis of probable AD dementia according to the National Institute on Aging-Alzheimer’s Association (NIA-AA) workgroups guidelines [27]. The CON participants had no objective cognitive impairment in cognitive tests, no history of neurological or psychiatric disease and did not report self-perceived cognitive decline. All participants or their representatives provided written informed consent. The study protocol was approved by the local institutional review boards and ethical committees of the participating centers. It was conducted in accord with the Helsinki Declaration of 1975.

Image acquisition and preprocessing

The data were acquired from nine Siemens 3.0 Tesla MRI scanners (four Verio, one Skyra, three TimTrio and one Prisma system) using identical acquisition parameters and harmonized instructions. To ensure high image quality throughout the acquisition phase, all scans had to pass a semiautomated quality check during the study conduction, so that protocol deviations could be reported to the study sites, and the acquisition at the respective site could be adjusted. An axial diffusion sequence was measured based on a single-shot echo planar imaging sequence (field of view 240 × 240 mm, matrix size 120 × 120, isotropic voxel size 2 mm, repetition time 12,100 ms, echo time 88 ms, flip angle 90°, number of gradients 60, b values 700 s/mm2 and 1000 s/mm2, number of slices 72, parallel imaging acceleration factor 2). High-resolution T1-weighted anatomical images were obtained using a sagittal magnetization-prepared rapid gradient echo (MPRAGE) sequence (field of view 256 × 256 mm, matrix size 256 × 256, isotropic voxel size 1 mm, echo time 4.37 ms, flip angle 7°, repetition time 2500 ms, number of slices 192, parallel imaging acceleration factor 2).

The T1-weighted anatomical images were segmented into gray matter (GM), WM, and cerebrospinal fluid (CSF) using the Statistical Parametric Mapping (SPM12) (Wellcome Trust Centre for Neuroimaging, London, UK, http://www.fil.ion.ucl.ac.uk/spm/) New Segment toolbox implemented in Matlab 2015a (Mathworks, Natwick). The T1-weighted GM and WM partitions were normalized to the Montreal Neurological Institute (MNI) reference coordinate system using the Diffeomorphic Anatomical Registration Through Exponentiated Lie algebra (DARTEL) algorithm [2] and the default brain template included in CAT12 [24] as target. We created a WM mask by averaging the normalized WM partitions of 45 randomly selected participants equally distributed across all diagnostic groups and applying a threshold of 50% WM density. The DTI scans were preprocessed using the FSL diffusion toolbox (Version 5.0.9, FMRIB, Oxford, UK, http://www.fmrib.ox.ac.uk/fsl/). The images were corrected for eddy currents and head motion. Skull stripping was performed using the Brain Extraction Tool. DTIfit was used to fit the diffusion tensors at each voxel and calculate the MD, FA and MO maps. The DTI maps were then co-registered to the T1-weighted anatomical images. Using the deformation fields that originated from the normalization of the T1-weighted images, the DTI maps were normalized to the MNI reference coordinate system. The normalization quality was checked visually and the scan homogeneity was determined using the standard deviation across the sample. As a last step, we smoothed the DTI maps with a Gaussian kernel of 8 mm. Within the WM mask, we calculated mean FA and MD and MO indices for each participant. Hippocampal GM volume was calculated for each participant by summing up the modulated grey matter voxel values within a mask that had been manually delineated in the reference space according to the international harmonized protocol for hippocampus segmentation [14]. The raw hippocampal volume was proportionally scaled to the total intracranial volume to adjust for head size. For the elastic-net penalized regression, regional DTI mean values were calculated for all 48 major WM tracts of the John Hopkins University-International Consortium of Brain Mapping (JHU-ICBM) DTI atlas [28].

Statistics

Statistical analyses were performed using SPM12 and SPSS (IBM; Version 23). Kruskal–Wallis tests were used to compare age, years of education, MMSE and GDS between the diagnostic groups. The distribution of sex was compared by means of the Chi-square test.

Two-sample t tests were used to compare the WM FA, MD and MO values voxel-wise across the diagnostic groups. The directions of the comparisons were hypothesis-driven (FA: SCD/MCI/AD < CON, MO: SCD/MCI/AD < CON, MD: SCD/MCI/AD > CON). Additional contrasts in the opposite directions were also tested (Supplementary Fig. S4). The variance explained by center (partial η2) was calculated using univariate analyses of variance, controlled for diagnosis, age, gender, years of education. For exploratory purposes, a liberal statistical significance level of p < 0.01 was used, uncorrected for multiple comparisons. Masks were derived that included the voxel clusters (k ≥ 20) that had reached significance (p < 0.01) in the AD vs. CON comparison of FA, MD or MO, respectively. The inclusive masks entered the univariate analyses of variance to assess the center effect within the regions of significant group effects.

We chose elastic net regression to account for multicollinearity among the predictor variables, which would inflate the variance of the estimated regression coefficients [42]. We used the R package “glmnet” to determine the group discrimination accuracy (http://cran.r-project.org/web/packages/glmnet/). Individual analyses were performed for each group of participants with cognitive deficits versus CON. The regional mean values for all three DTI measures FA, MD and MO within the 48 WM tracts of the JHU-ICBM DTI atlas were combined and entered in the analyses. The mixing parameter alpha was set to α = 0.5, specifying the balance of regularization between the ridge penalty and the least absolute shrinkage and selection operator (LASSO) penalty [43]. To define the strength of the regularization, the parameter lambda was determined as the mean of 100 iteratively determined values in a grid search with tenfold cross-validation. The average area under the receiver operating characteristic curve (AUC) was obtained from all runs of a tenfold cross-validation that was repeated 1000 times. Within the cross-validation procedure, the effect of the covariates age, gender and site was partialed out using a linear regression model. The residuals were then entered to estimate the penalized regression model for group separation. Confidence intervals for the AUC were estimated by determining the 2.5 and 97.5 percentiles across all iterations of the cross-validation. The diagnostic accuracy of DTI measures was compared to the accuracy of hippocampus volume as a reference. For obtaining p values, the proportion of iterations with AUC ≤ 0.5 (corresponding to chance level) was determined. For creating boxplots, a cubical region of interest (8 × 8 × 8 mm) was placed in the region that showed the most consistent results across the diagnostic groups and DTI parameters, i.e., the left part of the splenium of the corpus callosum (MNI: x =  − 20, y =  − 50, z = 13) (Supplementary Fig. S1). Analyses were repeated excluding individuals who showed outlier values in these boxplots. For subjects with CSF amyloid status available, we repeated the analyses including only amyloid-positive AD and MCI subjects and amyloid-negative controls. Amyloid-positivity was defined as Aß42/40 < 0.09.

Results

The groups differed significantly regarding age (the CON group being youngest on average), but they did not differ significantly with respect to sex and years of education (Table 1). The MMSE scores differed significantly across all diagnostic groups (p < 0.001) but not between the CON and SCD groups. The GDS scores differed significantly between groups, with the lowest mean score in the CON group, but did not reach a clinically relevant level in any group.

Table 1 Demographic characteristics

Group comparisons in whole brain WM

The voxel-based two-sample t tests revealed significantly higher MD in mainly anterior regions in the SCD group compared to the CON group (Cohen’s d: 0.4–0.5, p < 0.01, uncorrected) (Fig. 1; Supplementary Table S1). No clusters remained significant at a statistical threshold of p < 0.001 (uncorrected). In the AD as well as in the MCI group compared to the CON group, there was higher MD in several regions, including the genu, body and splenium of the corpus callosum, the anterior and superior corona radiata, posterior thalamic radiation, and sagittal stratum, which also remained significant at p < 0.05, FWE corrected.

Fig. 1
figure 1

Voxel-based analysis of MD (slices positioned at z = 28, 18, 9; p < 0.01, k ≥ 20; view from above)

FA was significantly lower in the SCD group compared to the CON group, mainly in the anterior corona radiata, inferior fronto-occipital fasciculus and splenium of the corpus callosum (Cohen’s d: 0.3–0.6, p < 0.01, uncorrected) (Fig. 2; Supplementary Table S1). At p < 0.001 (uncorrected), bilateral clusters in the inferior fronto-occipital fasciculus and splenium of the corpus callosum remained significant (Supplementary Table S1). In the AD group and in the MCI group compared to the CON group, FA was lower in a number of regions, including the genu and splenium of corpus callosum, the anterior corona radiata, and the posterior thalamic radiation, remaining statistically significant at p < 0.05, FWE corrected.

Fig. 2
figure 2

Voxel-based analysis of FA (slices positioned at z = 30, 12, 2; p < 0.01, k ≥ 20; view from above)

MO was significantly lower in the SCD group compared to the CON group in the anterior thalamic radiation, superior longitudinal fasciculus, cingulum and the splenium of the corpus callosum (Cohen’s d: 0.4–0.6, p < 0.01, uncorrected) (Fig. 3; Supplementary Table S1). Clusters in the anterior thalamic radiation, cingulum and splenium of the corpus callosum remained significant at p < 0.001 (uncorrected; Supplementary Table S1). In the AD group and in the MCI group compared to the CON group, MO was mainly lower in the genu and splenium of the corpus callosum and cingulum, remaining significant at p < 0.05, FWE corrected. For SCD vs. CON comparisons, no effects remained statistically significant when correcting for multiple comparisons. Supplementary Fig. S2 shows the distribution of MO across the diagnostic groups (ROI based).

Fig. 3
figure 3

Voxel-based analysis of MO (slices positioned at z = 18, 10, 8; p < 0.01, k ≥ 20; view from above)

Conducting analyses with contrasts in the non-hypothesized directions (i.e., MD: SCD < CON, FA: SCD > CON, MO: SCD > CON) did not result in significant group effects, except for small distributed clusters of significant MO differences, e.g., in the right inferior fronto-occipital fasciculus (T = 2.3, k = 75; Supplementary Fig. S4).

Group discrimination

For the discrimination of the diagnostic groups based on the DTI data, a cross-validated AUC of 67% was reached for separating SCD and CON (p < 0.01), 79% for MCI and CON (p < 0.01), and 93% for AD and CON (p < 0.01) (Fig. 4). When considering a subsample of CSF β-amyloid-positive SCD participants (n = 17) and β-amyloid-negative CON (n = 25), the AUC was 63% (p = 0.11). A subsample of CSF β-amyloid-positive MCI participants (n = 20) versus the β-amyloid-negative CON group reached an AUC of 91% (p < 0.01). A subsample of β-amyloid-positive AD participants resulted in an AUC of 88% (p = 0.01). For hippocampal volume, the AUC for SCD vs. HC was 57% (p = 0.08), 59% for CSF β-amyloid-positive SCD (p = 0.4), 77% for MCI (p = 0.01), 84% for β-amyloid-positive MCI (p < 0.01), 95% for AD (p < 0.01) and 94% for β-amyloid-positive AD (p < 0.01) (Fig. 4). A sensitivity analysis without outliers resulted in comparable AUC values (Supplementary Table S3, Supplementary Fig. S3).

Fig. 4
figure 4

Area under the curve (AUC) for elastic-net penalized logistic regression models

Multicentric variability

Within the whole-brain WM of all participants combined, center explained 10% of the FA variance, 15% of the MD variance and 7% of the MO variance. When restraining the regions of interest to the voxel clusters that were significantly different between the AD group and the CON group (p < 0.01, uncorrected), center explained 10% of the FA, 14% of the MD and 3% of the MO variance (Fig. 5).

Fig. 5
figure 5

Proportion of variance explained by diagnostic group and by center (all participants combined, within regions with a significant group effect)

Discussion

We investigated if DTI revealed deviant brain WM diffusivity in SCD. Compared to CON participants, we found subtle diffusivity differences that were lower than the diffusivity alterations in MCI patients. DTI distinguished SCD from CON participants with a classification accuracy of 67% (p < 0.01).

In addition to FA and MD changes, which have previously been reported in SCD [19, 25, 30, 33, 40], our analyses revealed MO differences. MO has previously been used to assess WM alterations in MCI and AD [7,8,9, 18, 36]. It reflects the diffusion of water molecules, which is restricted by myelinated axons of neurons [41]. It has the advantage of capturing subtle WM changes even in regions with more than one fiber direction [7]. For example, MO reflects linear anisotropy in the corpus callosum and more planar anisotropy in regions with crossing fibres such as the boundary between two major white matter tracts. The diffusion tensor in DTI analyses is a symmetric 3 × 3 matrix that can be described by its eigenvalues and eigenvectors, reflecting the magnitude and direction of diffusion [34]. From this, the scalar indices MO, FA, and MD are derived. MO specifies the shape of diffusion, from planar (MO =  − 1) to linear (MO = 1), so that a decrease in MO reflects a more planar shape of the diffusion tensor [11]. FA corresponds to the degree of directionality. Reduced FA is assumed to reflect a disrupted WM organization caused by microstructural damage, such as axonal degradation [34]. In contrast, MD is the mean of the three eigenvalues and informs about the diffusion rate. An increase in MD is assumed to reflect damage to the membrane integrity [34]. MD is also increased in regions affected by neurodegeneration, as this leads to an expansion of the regions filled with cerebrospinal fluid which are isotropic.

The AD group and the MCI group exhibited significantly higher MD and significantly lower FA and MO in extensive regions, including the corpus callosum and the anterior and superior corona radiata, as expected from previous monocentric [4, 7, 8, 12, 29, 37] and multicentric studies [9, 38].

Some of the monocentric studies on SCD reported slightly stronger diffusivity alterations than our study [19, 25, 30, 33]. In addition to methodical deviances such as using ROI-based approaches, differences may be due to the fact that the definition of the SCD group varied, as these studies were conducted before the conceptual framework for research on SCD was published [21]. The average MMSE scores of the SCD samples in these monocentric studies tended to be lower compared to our study, suggesting a stronger cognitive decline. Notably, the sample sizes of previous studies were considerably smaller than that of the present study (ranging from 16 to 28 SCD participants) and samples may have overlapped across reports from the same group [17, 32, 33, 35].

A multicentric design is necessary for evaluating the use of DTI across several settings. Because homogenized scanning procedures and parameters were applied, we expected stable DTI results across the centers. When we investigated the impact of the multicentric acquisition, we found that significant center effects were still present, yet markedly reduced compared to a previous multicentric study with a naturalistic design [3, 39]. The DTI parameter MO was least affected by center.

In the present study, we reported first results for the cross-sectional baseline data available in DELCODE. The results suggest that subtle white matter changes are present at the very early stage of SCD. These brain changes may be measurable earlier than hippocampal atrophy, which was shown to have a lower AUC for discriminating SCD and β-amyloid-positive SCD from CON. However, it must be kept in mind that the uncorrected significance levels of the voxel-based effects are prone to false-positive effects. The AUC of DTI for detecting SCD (67%, p < 0.01) was higher than that of hippocampal volume (57%, p = 0.08). Its discriminative accuracy was still below the benchmark for a useful biomarker, suggesting that DTI measures alone do not provide a useful early marker of preclinical AD in SCD cases. Subtle white matter differences could (for instance in combination with other markers) aid the identification of individuals with slight cognitive changes who have an increased risk for AD, but not as a stand-alone marker.

As a next step, analyses of the longitudinal data that are currently acquired will be performed to examine the value of DTI as a prognostic marker for the conversion of SCD to MCI and AD. Further studies should also address the relationship of DTI markers and the severity of self-reported cognitive impairment in addition to objectively measured memory change. As a limitation, the current sample size did not allow for matching age and gender across centers. This is especially relevant as age is related to white matter changes [16, 31] and was significantly different between the groups; on average, SCD participants were 2.8 years older than the CON participants. As matching was not possible, we controlled for gender and age in the statistical models. In the progress of the longitudinal DELCODE study, a future sample will allow for more homogeneous subsamples to be analyzed.

DTI can detect microstructural changes before substantial atrophy has taken place. It is a promising imaging marker for detecting early AD [5, 9] and has been shown to be highly sensitive to β-amyloid pathology [20]. In our study, we reported mild DTI WM deviations in SCD. Given the replication in larger longitudinal studies, these WM changes could—in combination with other markers—aid the identification of individuals with a higher risk for AD in clinical dementia studies.