Introduction

Formal neuropsychological assessment reveals mild cognitive impairment (MCI) in one out of every four non-demented patients with Parkinson’s disease (PD) (Aarsland & Kurz, 2010; Litvan et al., 2011), a diagnosis that strongly predicts subsequent dementia (PDD) (Pedersen et al., 2013; Hoogland et al., 2017; Pigott et al., 2015). An accurate identification of PD patients at high risk of rapid cognitive decline is paramount to ensure adequate clinical care, including reliable prognostic information for individual and populations, as well as appropriate sample selection for upcoming trials targeting cognitive impairment.

To progress in the validation of MCI in PD as a clinical marker, criteria for PD-MCI have been standardized by the Movement Disorder Society (MDS) (Hoogland et al., 2017; Litvan et al., 2012). Accordingly, PD-MCI can be diagnosed either as level-I, by using recommended global cognitive instruments with appropriate psychometric validation in PD (Skorvanek et al., 2018; Hoogland et al., 2018) or as level-II, with a comprehensive examination using two neuropsychological tests for each of five cognitive domains (Litvan et al., 2012). The validity of the diagnosis of PD-MCI as a predictor of conversion to PDD, according to level-II criteria, has been recently established (Hoogland et al., 2017). However, the consistency and comparability for detecting cognitive decline between level I and level II diagnosis, and between different recommended level-I instruments (Skorvanek et al., 2018), is less known (Hoogland et al., 2018).

Despite methodological heterogeneities, the clinical diagnosis of cognitive deterioration and PD-MCI has been consistently associated to early structural and functional neuroimaging changes (Pagonabarraga et al., 2013; González-Redondo et al., 2014; Mak et al., 2015; Gasca-Salas et al., 2016; Gee et al., 2017). Given that clinical classification is a necessary first step in any neuroimaging study regarding cognition in PD, there is a need to evaluate the association between clinical assessment and the neural correlates of PD-MCI. In the present study we aimed to explore neuroimaging changes in a prospective sample of non-demented PD patients with PD-MCI at accepted per-domain level II testing, as well as using recommended Level-I scales Montreal Cognitive Assessment Scale (MoCA) and Parkinson’s Disease-Cognitive Rating Scale (PD-CRS) (Skorvanek et al., 2018). We hypothesized that if level I and level II diagnosis of PD-MCI are equivalent, a common pattern of gray matter loss and network disruption should emerge.

Methods

Participants

Fifty-seven patients with early to mid-stage idiopathic PD regularly attending the Movement Disorders Outpatient Unit and who were willing to participate in this study were prospectively recruited. Inclusion criteria were diagnosis of PD according to the United Kingdom PD Society Brain Bank (Hughes et al., 1992) and exclusion criteria were: 1) presence of dementia, according to the MDS-PDD Criteria (Emre et al., 2007); Hoehn & Yahr scale >III; 2) presence of any other significant psychiatric, neurological or unstable systemic comorbidities; 3) pathological MRI findings beyond mild white matter hyperintensities, or not compatible with PD in MRI FLAIR sequence; and 4) presence of head motion or other MRI artifacts. Two patients were excluded because of dementia, and two were excluded after reviewing MRI images for quality control.

Cognitive assessment

Patients were assessed using a battery of neuropsychological tests to achieve level I and level II classification. For level I diagnosis we employed the MoCA and the PD-CRS, both endorsed instruments by the MDS (Skorvanek et al., 2018). For the PD-CRS we adopted the cut-off score of <83 points (Fernández de Bobadilla et al., 2013). For the MoCA we employed the recommended cut-off score of <26, which has shown the best trade-off between sensitivity and specificity as a screening tool as measured by the Youden index (Dalrymple-Alford et al., 2010). We also explored a lower cut-off score of <24 points for the MoCA, which represents the optimal diagnostic value in the same study at the cost of lower sensitivity. For level II classification we administered a comprehensive neuropsychological battery that included two recommended neuropsychological tests per cognitive domain. Attention and working memory were assessed using the Digit Span Backward and Trail Making Test-A; executive dysfunction was measured using the Trail Making Test B and phonemic fluency; visuospatial function was determined by the Rey-Osterrieth complex figure-copy- (ROCFT) and the Visual Object and Space Perception Battery (VOSP) (number location); language function was evaluated using the Boston Naming Test and the Token Test; and memory function was assessed using the Free and Cued Selective Reminding Test (FCSRT) and the ROCFT According to established MDS-II criteria (Litvan et al., 2012), patients were classified as PD-MCI when they were 1.5 standard deviations (SD) below the Spanish normative data (Peña-Casanova et al., 2009) in two tests within a single cognitive domain, or in two tests within two different cognitive domains. Although the MDS-Task Force proposes a cut-off that ranges between 1 and 2 SD, we chose 1.5 SD as recommended by the MDS Study group for the Validation of Mild Cognitive Impairment in Parkinson Disease, which considers that impairment beyond 1.5 SDs represents clinically meaningful decline (Hoogland et al., 2017), and because decline from this cut-off better detects the risk of conversion from PD-MCI to PDD (Wood et al., 2016). Finally, Depression and Anxiety were assessed using the Hospital Anxiety and Depression Scale (HADS).

MRI acquisition and preprocessing

All participants had available structural and resting-state functional MRI. T1-weighted scans were acquired in a 3 T Philips Achieva, and were performed using a dedicated axial T13D-magnetization-prepared rapid gradient-echo (T13D-MP-RAGE) MRI: Repetition time/Echo time (TR/TE) 500/50 milliseconds, flip-angle = 8°, field of view (FOV) = 23 cm, an in-plane resolution of 256 × 256 and 1-mm slice thickness. Resting-state BOLD images during 12 min were also obtained. (TR = 2000 ms, FOV 240 mm, voxel size 3 mm, TE = 30 ms, flip angle 78°).

MRI scan acquisition, preprocessing pipeline and analysis of voxel-based morphometry (VBM) and functional connectivity (FC) have been described in our previous work (Martinez-Horta et al., 2016; Aracil-Bolaños et al., 2019). Briefly, gray matter volume (GMV) imaging data were obtained from T1-weighted images using a voxel-based morphometry (VBM) approach in SPM12 (http://www.fil.ion.ucl.ac.uk/spm/). The original T1-MRI images were segmented to obtain their corresponding gray matter tissue probability maps. These maps were then spatially normalized into 2x2x2mm MNI space using DARTEL (Ashburner, 2007). Finally, an isotropic smoothing of 8 mm full-width at half maximum (FWHM) was applied.

Cortical thickness (Cth) analysis was performed using the FreeSurfer 6.0 software package (https://surfer.nmr.mgh.harvard.edu/). The procedure of cortical reconstruction of structural MRI images has been fully described elsewhere (Fischl & Dale, 2000). Briefly, white matter and pial surfaces were generated following image intensity gradients, from which a Cth measure was obtained for each vertex of the surface. On visual inspection, no major surface reconstruction errors were observed in our sample. Cth maps were transformed to fsaverage space and smoothed using a Gaussian kernel of 15 mm FWHM.

Functional imaging pre-processing was performed using CONN v14p software and its standard processing pipeline (described in depth in (Whitfield-Gabrieli & Nieto-Castanon, 2012)). Spatial preprocessing of functional volumes included realignment, normalization, and smoothing (8-mm FWHM Gaussian filter), using CONN default parameter choices.. Anatomical volumes were segmented into gray matter, white matter, and CSF areas, and the resulting masks were eroded (one voxel erosion, isotropic 4-mm voxel size) to minimize partial volume effects. The temporal time series characterizing the estimated subject motion (three-rotation and three translation parameters, plus another six parameters representing their first-order temporal derivatives), as well as the BOLD time series within the subject-specific white matter mask (ten PCA parameters) and CSF mask (ten PCA parameters), were used as temporal covariates and removed from the BOLD functional data using linear regression, and the resulting residual BOLD time series were band-pass filtered (0.01 Hz < f < 0.10 Hz). A random subset of 256 voxels (the same voxels across subjects and sessions) was used to compute the sample distribution of voxel-to voxel BOLD signal correlation values separately for each subject and session, before and after removal of the defined temporal covariates. We did not employ global signal regression, since it can introduce artifactual biases, remove potentially meaningful neural components, and introduce confounding effects across populations (for more details, see (Aarsland & Kurz, 2010)). The entire matrix of ROI-to-ROI functional connectivity values (bivariate correlation measure) was computed for each subject using the set of functional brain networks described in the Shirer atlas (Shirer et al., 2012) and available at https://findlab.stanford.edu/functional_ROIs.htmlwas introduced in CONN, focusing on the Default-Mode, Central Executive and Salience network regions of interest (ROI) as described in our previous work (Aracil-Bolaños et al., 2019) We calculated the between- and within-network connectivity of three large-scale cognitive networks-Default-Mode (DMN), Salience (SN) and Central Executive (CEN) networks- using built-in CONN scripts (conn_withinbetweenROItest) for each subject.

Finally, for graph theoretical analysis, network construction, graph metrics and statistical analysis were performed using Graphvar v2.01 (Kruschwitz et al., 2015). The adjacency matrices were constructed using a cost threshold of .15, which represented the most adequate trade-off between global and local efficiency in our sample and thus a good measure of small-worldness (see Supplementary material, Figure 4). We analized weighted node degree, betweenness centrality and local efficiency, widely accepted graph-theoretical metrics in brain networks (Filippi et al., 2013). Gephi v 0.92 was used for rendering purposes (Bastian et al., 2009).

Statistical analyses

For the VBM analysis, the resulting GMV images were entered into a voxelwise, second-level, two-sample t-test analysis between Parkinson’s disease normal cognition (PD-NC) and PD-MCI patient groups to determine GMV differences. This statistical model included age, sex, Unified Parkinson’s Disease Rating Scale part III (UPDRS III) and total intracranial volume (TIV) as covariates of no interest. Results were considered significant using a p < 0.005 uncorrected threshold commonly used in the structural neuroimaging literature (Watanabe et al., 2013; Goldman et al., 2014). The resulting statistical maps were represented in three-dimensional renders available in the MRIcron tool (Rorden et al., 2007). For Cth, Vertexwise data was introduced into a generalized linear model (GLM) to compare cognitive groups, using age, sex and UPDRS III as covariates. Results were considered significant using a p < 0.005 uncorrected threshold.

For the functional connectivity comparisons between the cognitive networks, we performed a two-sample T-test to compare the cognitive groups according to each scale using age, sex and UDPRS III scores as covariates of no interest. The results were considered significant using a p < 0.05 threshold. For graph theoretical metrics, the statistical analysis was performed with the built-in General Linear Model tool in Graphvar, using age, sex and UPDRS III as covariates of no interest, and permutation testing to control for multiple comparisons (n = 10.000) with a statistical threshold of p < 0.05.

Results

Clinic and sociodemographic data

Fifty-three patients with early-to-mid stages of PD were included (age 68.4 ± 7 years; disease duration 6.4 ± 3 years, MDS-UPDRS III 23.5 ± 9). Patients in our sample showed no significant anxiety or depression (HADS-A 3 ± 2.7, HADS-D 3 ± 2.8) Eighteen patients (33%) were classified as PD-MCI using MDS level II criteria. Using level I classification, 19 patients (35%) were classified as PD-MCI according to the PD-CRS and 32 patients (60%) according to the MoCA. Of note, an alternate cut-off score of <24 points for the MoCA classified 18 patients as PD-MCI (30%). Differences in age, sex, education, disease duration, disease severity, and levodopa equivalent daily dose (LEDD) are shown in Table 1.

Table 1 Demographic data of the population according to the three neuropsychological classifications

We also explored the agreement between the global scales and MDS level II criteria (Table 2). Data showed that while MoCA captured a large proportion of PD-MCI patients identified by level II criteria (77% sensitivity, versus 61% for the PD-CRS), it did so by accurately diagnosing less than 50% of PD-NC patients (48% specificity). The negative predictive values (NPV) of MoCA and PD-CRS compared to level II were similar (79% for the PD-CRS to 81% for the MoCA) but positive predictive value (PPV) was higher for the PD-CRS (57% to 43%). Using the alternate <24 cut-off for the MoCA, specificity increased at the expense of decreased sensitivity (80% and 56% respectively) and predictive values compared to gold-standard level II approached but did not improve on the PD-CRS (PPV 56%, NPV 75%). Table 3 shows the number of subjects in which the global scales agree with Level-II assessment, and subdomain analysis, in which multidomain involvement was shown to be predominant in both level I classifications.

Table 2 Sensitivity, specifity and predictive values of the Level-I classifications according to Level-II gold standard
Table 3 Agreement between Level-I and gold-standard Level-II classifications methods in our sample

Voxel-based morphometry

PD-CRS and Level-II criteria showed overlapping areas of gray matter decrease in the precuneus region of the DMN in PD-MCI patients. For this statistical level, the PD-CRS showed additional reductions in gray matter located in the anterior cingulate cortex and bilateral temporal cortices, while level-II highlighted a loss of gray matter in bilateral superior frontal cortices. The MoCA showed a more restricted reduction of gray matter in the anterior cingulate cortex and right inferior temporal cortex. None of these areas overlapped with those found with the other two classifications (Fig. 1). For the alternate cut-off score of <24 points, the MoCA showed gray matter reductions in bilateral fusiform gyrii and right inferior frontal gyrus (Supplementary material, Figure 2).

Fig. 1
figure 1

Voxel-based morphometry showing structural correlates of the three cognitive classifications

Cortical thickness

In cortical thickness maps (Fig. 2), level-II criteria showed isolated areas of cortical thinning in the left precuneus and right inferior frontal gyrus. The PD-CRS showed widespread and marked cortical thinning in relevant cognition related areas, with bilateral involvement of the temporo-parieto-occipital junction, posterior superior temporal gyrus, precuneus, orbitofrontal gyrus, and anterior, mid and posterior cingulate cortices. The MoCA showed less extensive cortical thinning, restricted to areas of the anterior and posterior cingulate cortex, and left temporo-occipito-parietal junction. Alternatively, a cut-off score of <24 points showed a similar pattern of cortical thinning highlighting more severe cortical thinning in anterior and posterior cingulate cortex, but also including the right precuneus (Supplementary material, Figure 3).

Fig. 2
figure 2

Cortical thinning in PD-MCI compared to PD-NC patients according to level II, PD-CRS and MoCA

Functional connectivity

Comparing functional connectivity within and between networks, PD-CRS showed a distinct pattern of higher salience network connectivity in PD-NC patients, whereas the rest of the large-scale networks showed no differences between cognition groups. Across networks, PD-MCI patients showed higher coupling between the DMN and both SN and CEN, indicating lower anticorrelations between these normally antagonistic networks. None of these changes were significant for the MoCA (<26,<24) or level-II classification (Fig. 3). Details about the connectivity figures can be found in Supplementary material, Table 1.

Fig. 3
figure 3

Functional connectivity findings across level I and level II assessment

Graph theory

In graph theoretical analysis, the three classifications showed a lower weighted node degree in key nodes of the SN for the PD-MCI group (Fig. 4). Number and weight of connections were lower in the right supramarginal gyrus, mid frontal gyrus and the right anterior insula, whereas the posterior insula showed higher values in PD-MCI patients. Local efficiency was also lower in bilateral anterior insulae, mid frontal gyrus and right supramarginal gyrus, while PD-MCI patients showed an higher values in the mid cingulate cortex.

Fig. 4
figure 4

Graph-theoretical findings across level I and level II assessment

Weighted betweenness centrality was also found to be lower in the right mid frontal gyrus, but of note, some of the changes in centrality followed an opposing trend, with key nodes of the DMN showing higher centrality in PD-MCI patients. A higher number of weighted shortest paths were found to cross the left and right precuneus in the PD-MCI group, thus showing higher centrality in these key nodes.

Detailed information about these metrics can be found in Supplementary material, Tables 2, 3 and 4.

Discussion

In this study, we aimed to explore the neural correlates associated with the different accepted methods for diagnosing PD-MCI. Although differences were found between the gold-standard level II and the global scales in level I regarding patient classification and proportion of PD-MCI patients in the sample -as shown in Tables 1 and 2, these were expected given the intrinsic differences between the classification methods (Skorvanek et al., 2018). Far from intending to outline which method produced the “best” correlates, we sought to investigate whether the diverse assessment tools converged in defining the structural and functional hallmarks of PD-MCI. To this effect, we have observed both commonalities and divergences in the neuroimaging findings associated with MDS recommended criteria.

On the structural analysis, both PD-CRS and MDS level II classifications showed overlapping loss of gray matter in the precuneus as a shared feature of PD-MCI, while the PD-CRS and the MoCA showed grey matter atrophy and cortical thinning in the anterior cingulate gyrus. While some studies have found no differences in cortical thickness in PD-MCI compared to PD-NC (Mak et al., 2014), our convergent findings replicate previous studies showing cortical thinning in the precuneus, supramarginal gyrus, and angular gyrus (Segura et al., 2014). The precuneus is a main component of the DMN (Greicius et al., 2003), one of the large-scale neurocognitive networks that has been shown to be a target of a number of degenerative diseases (Seeley et al., 2009). In our sample, these changes are consistent between structural modalities, as the precuneus also showed cortical thinning in both level-II and PD-CRS classifications. The cortical thickness analysis showed that both level-I assessments highlighted the anterior cingulate cortex as a key affected area in the early stages of cognitive impairment. The dorsal anterior cingulate cortex is part of the salience network (Seeley et al., 2007) and as such it plays a pivotal role in the cognitive control of motor selection and execution (Hoffstaedter et al., 2014). To sum up, the three scales showed varying degrees of frontal disruption (superior frontal gyrus for the Level-II classification, and anterior cingulate cortex for the Level-I classifications), but only the Level-I PD-CRS and the Level-II assessments captured both the anterior-executive and posterior cortical deficits that have been reported as neuropsychological hallmarks of the initial stages of PD-MCI (Kehagia et al., 2012; Pagonabarraga & Kulisevsky, 2012). For the MoCA, switching to a lower cut-off score might more closely resemble these findings, as gray matter loss was also found in posterior cortical regions such as the fusiform gyrus, and cortical thinning also involved the right precuneus. Of note, the structural changes disclosed by the PD-CRS were significantly more widespread, involving associative temporo-parietal areas previously reported as key cortical regions involved in the development of PD-MCI (Garcia-Diaz et al., 2014). These more extensive changes could be explained because although the PD-CRS and Level-II identify a similar proportion of PD-MCI cases in the sample, both assessments only concur in 61% of the patients.

Though tendencies emerge from the structural analyses, these findings suggest that there is still great heterogeneity among recommended cognitive scales and between different studies in PD cognition. This debate is part of an ongoing discussion in the neuroimaging community that ranges beyond PD (Darby et al., 2018), where similar diseases and symptoms seem to stem from the disruption of very different cortical areas. In our functional connectivity analysis, a decline in within-SN connectivity and decrease in normally-occurring anticorrelations between DMN and SN/CEN was observed when using the PD-CRS classification. This lower connectivity in the frontoparietal regions of the salience network in early PD-MCI, as well as an increase in PD-NC, has been previously reported by our group and others (Aracil-Bolaños et al., 2019; Gorges et al., 2015; Christopher et al., 2015) These changes were not observed by the MoCA-at either cut-off score- or level II criteria, which stresses the impact that clinical classifications may have on the definition of neural correlates of PD-MCI, as well as in the search for imaging biomarkers.

The lack of agreement in functional connectivity, where only the PD-CRS showed a consistent alteration in between-network connectivity, contrasted with the results in graph theoretical metrics. Two key heteromodal nodes, the right anterior insula and the right supramarginal gyrus, showed a lower number and weight of connections in PD-MCI patients. The preferential disruption of highly associative areas has been previously described as a feature of the PD subtypes with greater cognitive impairment (Lopes et al., 2016). In particular, the disruption of the salience network seems to constitute a stepping stone in the path to cognitive decline, as previous studies have shown that the reduction in dopamine receptor availability signals the onset of mild cognitive impairment in PD (Christopher et al., 2015). Analysis of local efficiency confirmed a decrease in insular, frontal and supramarginal cortices in the different PD-MCI classifications. Therefore, all the classification methods showed decreases in the node degree of salience network nodes, while changes in other large-scale networks were less apparent. Interestingly, previous works have found an increase in global efficiency and global clustering coefficient in PD-MCI patients associated with decreases in long-range connections (Baggio et al., 2014). Other studies, however, showed that PD-MCI progression contributed to a reduction in the functional segregation of cognitive networks (Lopes et al., 2016). In our model, local efficiency is preserved in those regions that show higher weighted node degrees, and this coincidence in connection strength and organization could reflect compensation mechanisms that wane as disease progresses.

Of note, we also found regions of cognitive networks that show possible functional gains with the onset of cognitive decline. PD-MCI patients showed increases in the betweenness centrality of the precuneus, one of the main nodes of the posterior DMN, that is featured prominently in previous analyses of cognition and behaviour in PD (Lucas-Jiménez et al., 2016; Tessitore et al., 2012; Bejr-kasem et al., 2019). As the structural analysis proves, these areas that show higher centrality in our PD-MCI group are those affected by structural disruptions.

To sum up, two mechanisms of possible compensation are captured by different assessment scales. One seems to affect the node degree of the salience network and is present while patients remain cognitively “normal”, and the other appears in PD-MCI patients as an increase in the centrality of the posterior hubs of the DMN in the face of advancing grey matter loss. However, differences in the obtained neural correlates still persist. Across the three endorsed methods, PD-CRS seems to show the most consistent outline of widespread cortical volume and thickness decreases, coupled with functional connectivity disruptions that extend into architectural network features. These findings are not replicated by the other Level-I instrument, the MoCA, although lowering the cut-off score to 24 points does somewhat bridge the gap between this classification and the two other endorsed methods. The Level-II classification, although showing similar key features in structural precuneus volume loss and node degree decreases in the salience network, does not exhibit the same degree of consistency in the PD-MCI correlates. We believe that the use of different tests at the discretion on the researchers might hamper the ability of Level-II assessment to capture consistent PD-MCI correlates.

We acknowledge several limitations in this study. First, the study could be underpowered to detect differences and similarities in the diverse cognitive scales. Nevertheless, the proportion of patients classified as having PD-MCI using level II and PD-CRS is consistent with data from the literature (Litvan et al., 2011; Svenningsson et al., 2012). Second, an uncorrected statistical threshold was applied throughout the imaging analysis in order to capture changes across different modalities; although this is a threshold commonly employed in neuroimaging of movement disorders (Watanabe et al., 2013; Goldman et al., 2014) results should be considered exploratory. Third, longitudinal studies will be required to properly cement these findings. Strengths of the present work include the multimodal imaging approach that provides complementary evidences of structural and functional disruptions and the comprehensive neuropsychological battery employed that allows for both level I and level II classification.

Conclussions

To sum up, neuroimaging traits that define PD-MCI emerge from both Level-I and Level-II with two relevant implications: first, our clinical classifications seem to reflect converging structural and functional changes in PD-MCI; and second, short assessment Level-I scales retain the ability to identify the neural correlates of the more time-consuming Level-II, although with some particularities.