Introduction

The correct identification of people at risk to develop Alzheimer’s disease dementia (ADD) is crucial in terms of early diagnosis (prodromal Alzheimer), prognosis, and treatment. Research efforts to improve the positive predictive value of progression from amnestic mild cognitive impairment (aMCI) to ADD are needed, since many trials on disease modifying drugs fail because of low progression rates. In this sense, efforts to correctly identify individuals with prodromal AD are of utmost importance (McGeer and McGeer 2013; Molinuevo et al. 2014). Even so, it is often difficult to identify who among those with aMCI will progress to ADD based solely on neuropsychological evaluations (Petersen et al. 2014).

Different biomarkers demonstrated to be useful diagnostic tools in vivo, and are now gaining attention in the clinical practice after being incorporated into research criteria for AD (Albert et al. 2011; Dubois et al. 2014; Morris et al. 2014). The combination of decreased concentrations of beta amyloid protein (Aβ1–42) and increased concentrations of hyperphosphorylated Tau protein (p-Tau181) in the CSF is related to AD pathology and may predict the progression to ADD (Jack et al. 2010, 2013).

Many studies have evaluated the value of CSF biomarkers in predict progression from MCI to ADD (Ferreira et al. 2014b; Prestia et al. 2015; Ritchie et al. 2014; Tondelli et al. 2015; Vos et al. 2013). However, no study has evaluated the relative risk to develop ADD in an exclusively amnestic MCI patient sample in a 3-year follow-up. This study consisted of measuring the concentrations of both CSF Aβ1–42 and p-Tau181 proteins in a sample of exclusive amnestic MCI subjects and analyzing to which extent these biomarkers could predict progression to ADD in a 3-year follow-up. The relationship and agreement of these biomarkers with neuropsychological tests were then verified.

Materials and Methods

Participants

Thirty-three subjects diagnosed with aMCI from a memory clinic in Southern Brazil were included in the study. All individuals were evaluated by a behavioral/geriatric neurologist and the National Institute on Aging-Alzheimer’s Association (NIA-AA) criteria for MCI (Albert et al. 2011), corroborated by the Clinical Dementia Rating scale (CDR) (Morris 1993). aMCI diagnoses were based on criteria by Petersen (2004) and Petersen et al. (2014). We excluded those who had diagnosis of dementia, stroke, Parkinson’s disease, depression or other neurological conditions that could possibly cause cognitive impairment. Furthermore, we also excluded individuals with normal cognition. A questionnaire was applied in order to obtain sociodemographic data.

Standard Protocol Approvals, Registrations, and Patient Consents

This study was approved by the local research ethics committee and is in accordance with the Declaration of Helsinki. All participants gave their written consent in order to participate in this study.

CSF Analyses

All participants underwent a fasted lumbar puncture to measure Aβ1–42 and p-Tau181 proteins in the CSF. CSF (5 mL) was collected in polypropylene tubes and immediately centrifuged (4000g × 10 min) at 4 °C and then stored at − 80 °C until the assay. Commercially available ELISA kits were used to determine the levels of Aβ1–42 (Innotest β-amyloid1–42, Fujirebio-Europe, Gent, Belgium) and p-Tau181 (Innotest Phospho-Tau181P, Fujirebio-Europe). Tests were performed in duplicate and the laboratory technician was blinded to any clinical and demographic information about the participants.

Neuropsychological Tests

At baseline, participants were submitted to a battery of neuropsychological evaluations performed by a certified neuropsychologist. We utilized the validated adaptation for Brazilian patients version of the verbal memory test subscore of the Consortium to Establish a Registry for Alzheimer’s Disease (CERADs) (Morris et al. 1989; Bertolucci et al. 1998). The final score was composed by a possible sum of 30 points for word list learning, 10 points for word list recall, and 10 points for word list recognition (total of 50 possible points) (Chandler et al. 2005). The Geriatric Depression Scale (GDS) 15-item version (Yesavage et al. 1982), which was translated and validated to Brazilian Portuguese (Almeida and Almeida 1999), was utilized to screen for depressive symptoms.

After 3 years of follow-up, subjects were reassessed in order to verify whether or not they progressed to ADD. Evaluation was performed blindly by the same behavioral neurologist and neuropsychologist, and included a structured clinical interview, a full neurological examination, and neuropsychological evaluation. At this time, they were analyzed by NIA-AA criteria for ADD (McKhann et al. 2011), corroborated by the CDR, and were classified in two groups: progressors (P), and non-progressors (NP) to ADD. The neuropsychological tests, besides those performed at baseline, included the Montreal Cognitive Assessment (MoCA) (Memória et al. 2013; Nasreddine et al. 2005), Boston Naming Test (BNT) (Kaplan et al. 1983; Miotto et al. 2010), verbal fluency test, animals category (VFT) (Brucki et al. 1997; Isaacs and Kennie 1973) and Constructional Praxis (CP) (Bertolucci et al. 2001; Rosen et al. 1984), all translated and validated to Brazilian Portuguese. Instead of MMSE, we preferred to utilize MoCA because it has been shown to be more accurate in detecting MCI than MMSE (Trzepacz et al. 2015).

Statistical Analyses

Statistical analyses were performed using the Statistical Package for Social Sciences (SPSS) version 20. To compare sex distribution, a Chi-square test was performed. Mann–Whitney tests were performed to compare clinical, neuropsychological, and biomarker data. Receiver operating characteristic (ROC) curve analyses were used to obtain the area under the curve (AUC) and other parameters. The optimal cutoff values were determined by calculating the maximal sensitivity and specificity [i.e., maximizing the Youden index (Youden 1950)]. The Generalized Estimating Equation Model with the Bonferrroni post hoc test was applied to verify the mean of CERADs score between groups, discriminated by time. Linear regressions were performed to evaluate the relationship among ΔCERADs and CSF biomarkers. Values were adjusted for age and sex. All results were showed as confidence intervals CI 95% and P values < 0.05 were considered significant.

Results

Thirty-one of 33 (93.9%) participants completed the follow-up. Of these, twelve (38.4%) progressed to ADD after 3 years. Mean age was 68.1 ± 5.1 years and 67.75% were female. No participants deceased during the follow-up. Table 1 lists the demographic characteristics and baseline neuropsychological scores between P and NP. There was no significant difference between groups when baseline characteristics were analyzed.

Table 1 Patients characteristics and baseline neuropsychological scores

Table 2 lists the neuropsychological scores after 3 years of follow-up and CSF biomarker features. Aβ1–42 concentrations in CSF were significantly different between groups (P < 0.001), as opposed to p-Tau181 concentrations (P = 0.101). The p-Tau181/Aβ1–42 ratio also displayed a significant difference between groups (P < 0.001). The CERADs and MoCA tests differed significantly at endpoint as well (P = 0.006 and P = 0.025, respectively).

Table 2 Relationship between baseline CSF Aβ1–42, p-Tau181, and neuropsychological scores at follow-up

In Table 3, cutoff values were determined by Youden’s index, which stretches the maximum potential effectiveness of a biomarker. For Aβ1–42, the threshold value was 618.5 pg/mL, with 91.7% sensitivity (CI 95% 64.6–98.5) and 94.7% specificity (CI 95% 75.4–99.0). At this cutoff level, 29 of 31 individuals (93.5%) were correctly classified in P or NP. The relative risk of developing ADD in those directly exposed (Aβ1–42 < 618.5 pg/mL) was 17.4 times higher than among those who were not exposed (CI 95% 2.5–118.2; P = 0.003). In other words, those who had values above this concentration had their risk of developing ADD reduced by 94.3%. The optimal cutoff value for the p-Tau181/Aβ1–42 ratio was 0.135, with 66.7% sensitivity (CI 95% 39.0–86.2) and 100% specificity (CI 95% 83.2–100.0). At this cutoff value, 27 of 31 individuals (87%) were correctly classified as P or NP. The relative risk of developing ADD in those whose p-Tau181/Aβ1–42 ratio was higher than 0.135 was 5.7 times greater when compared to those with a lower p-Tau181/Aβ1–42 ratio (CI 95% 2.3–14.0; P < 0.001).

Table 3 ROC curve parameters of CSF biomarkers in predicting progression from aMCI to ADD

As measure of experimental validity, Fig. 1 displays the mean and standard errors of the CERADs score for each group discriminated by the baseline and follow-up times in years (baseline in 2013; follow-up in 2016). When the time was set and the groups were compared, there was no difference in the CERADs score of the NP group (P = 0.437), but there was a significant difference in the P group (P < 0.001). When the groups were fixed and times were compared, there was no difference between groups in 2013 (P = 0.575), but, in 2016, a significant difference between P and NP was observed (P = 0.004).

Fig. 1
figure 1

Line chart of the means and standard errors of the CERADs score of each group. There was an association between variables (P < 0.001). When the time was set and the groups were compared, there was no difference in the NP group (P = 0.437) but there was a significant difference in the P group (P < 0.001). When the groups were fixed and times were compared in 2013 (baseline), there was no difference between groups (P = 0.575). When times were compared in 2016 (follow-up), there was a statistical difference between groups (P = 0.004). P progressors, NP non-progressors, CERADs Consortium to Establish a Registry for Alzheimer’s Disease. Generalized Estimating Equation Model with the Bonferroni post hoc test was used

ΔCERADs was defined as the difference between CERADs score from 2016 to 2013. ΔCERADs was statistically different between P and NP groups (P = 0.002) and correlated with Aβ1–42 levels (P = 0.011), p-Tau181 levels (P = 0.045), and p-Tau181/Aβ1–42 ratio (P = 0.010). Aβ1–42 and p-Tau181 levels explained 40.1% of ΔCERADs variance (B: − 6.2; SE: 13.7; P = 0.008). As demonstrated in Fig. 2, Aβ1–42 alone explained 26.8% of ΔCERADs variance (P = 0.036), whereas p-Tau181 alone accounted for 27.5% of ΔCERADs variance (P = 0.032).

Fig. 2
figure 2

Linear relationship between Aβ1–42 (pg/mL) and ΔCERADs (R2: 0.268; P = 0.036). Values were adjusted for age and sex. Distribution at P and NP. P progressors, NP non-progressors, CERADs Consortium to Establish a Registry for Alzheimer’s Disease

Discussion

In this study, 38.7% of aMCI individuals progressed to ADD, a rate which is consistent with previous studies (Schjønning Nielsen et al. 2016). CSF concentration of Aβ1–42 was significantly diminished in the P group (P < 0.001), whereas there was no difference in p-Tau181 levels between groups (P = 0.101). These results are in line with the model of dynamic pathophysiological biomarkers in AD (Jack et al. 2013), in which Aβ1–42 levels in the CSF becomes abnormal 5–10 years or before the diagnosis of dementia. A meta-analysis of fifty studies confirmed that Aβ1–42 is a good biomarker for discriminating ADD from MCI and other dementias (Ferreira et al. 2014a). Moreover, our group recently demonstrated that Aβ1–42, but not p-Tau181, is significantly associated with aMCI (Rizzi et al. 2018). Previous studies agree that p-Tau181 levels in the CSF become progressively abnormal, and have better predictive power only 0–5 years before progression to ADD (Buchhave et al. 2012). The p-Tau181/Aβ1–42 ratio also showed significant difference (P < 0.001) between patients that progressed to ADD versus those who did not. The literature supports the evidence that the combination of Aβ1–42 and p-Tau181 protein levels is a suitable measure for MCI prognosis (Ferreira et al. 2014a).

In the CSF, the Aβ1–42 concentration displayed a cutoff value of 618.5 pg/mL for aMCI cases, with greater sensitivity and specificity when compared to similar reports in the literature (Ferreira et al. 2014a; Ritchie et al. 2014). From this value, we can infer that the relative risk to develop ADD in those whose CSF Aβ1–42 levels were lower than 618.5 pg/mL was about 17.4 times higher than in those whose level was above this value. Other studies described different values associated with the relative risk of developing ADD from MCI. Michaud et al. (2015) found a hazard ratio of developing ADD among MCI patients with high-risk biomarker levels about 4 times greater than in MCI patients with low-risk biomarker values. Tondelli et al. (2015) established that Aβ1–42 was helpful to discriminate between MCI and dementia, with a relative risk of 1.01 in MCI patients. Here, we emphasize that our cutoff value was specific for amnestic MCI patients, even though our sample size was small. The interpretation of any comparative results should be made with caution due to the heterogeneity of the population that was evaluated in this study.

For the p-Tau181/Aβ1–42 ratio in CSF, the optimal cutoff value was 0.135, with less sensitivity than Aβ1–42 alone, but with 100% of specificity. The relative risk of developing ADD in those whose ratio was higher than 0.135 was 5.7 times greater than in those whose ration was lower than 0.135. Michaud and colleagues found that Aβ1–42 and p-Tau181 were the best combination among CSF biomarkers to predict the overall risk of developing ADD among MCI patients with an area under the curve of 0.77 (Michaud et al. 2015). In our study, Aβ1–42 alone (AUC: 0.961) performed better than the p-Tau181/Aβ1–42 ratio (AUC: 0.864) in predicting overall risk, but the combination of the two proteins showed a better positive predictive value (100%). Prestia et al. found that the best predictive accuracy was achieved by combinations of amyloidosis and neurodegeneration biomarkers (Prestia et al. 2015). Moreover, in a recent meta-review (Ferreira et al. 2014a), it was found that the p-Tau181/Aβ1–42 ratio was the most accurate CSF measure in predicting ADD progression from MCI. Our finding that Aβ1–42 levels alone is a superior, more accurate, measure when compared to the p-Tau181/Aβ1–42 ratio needs to be confirmed in further studies with larger samples sizes and a more heterogeneous population.

Although several different possible cutoff values were proposed, there is a lack of agreement on which would be the best discriminant value. This controversy might be due to the variability in CSF measurements between laboratories that utilize different techniques (Samtani et al. 2013; Tang et al. 2014). We decided to find the best cutoff values for our sample by utilizing the Youden test (1950), differently from other reports in which the threshold values were based on laboratory cutoff points that were based on different populations and methodologies (Ritchie et al. 2014). For instance, difficulties in predicting MCI progression to ADD could be influenced by the intrinsic heterogeneity of MCI individuals.

After 3 years of follow-up, only CERADs and MoCA tests reflected differences between the P and NP groups. These results agree with our baseline selection of exclusively aMCI cases, since episodic memory is the major cognitive domain impaired in this subtype of MCI patients (Gifford et al. 2015). There was no significant difference in the GDS test, although some patients who did not progress to ADD presented depressive symptoms after 3 years.

The CERADs test was performed at baseline and again after 3 years. As shown in Fig. 1, there was no difference between groups at baseline, but over time a significant decline in the episodic memory of those who progressed to ADD was observed. This pattern was not verified in the NP group, which performed similarly at baseline and follow-up. Additionally, ΔCERADs correlated with the two proteins alone, besides correlating with their combination ratio. This finding reinforces the early and specific involvement of episodic memory in the prodromal phase of AD. Haldenwanger et al. (2010) already emphasized the existence of a significant correlation between Aβ1–42 in the CSF and memory performance for aMCI patients, but not for non-amnesic MCI. Rami and collaborators also suggested that memory performance is first related with Aβ1–42 levels and then with t-Tau or p-Tau181 (Rami et al. 2011). Moreover, a recent study suggested that, in particular, Aβ1–42 protein is associated with a delayed memory performance CERAD (Haapalinna et al. 2016), agreeing with our results. Aβ1–42 and p-Tau181 levels explained 40.1% of ΔCERADs variance, better than each one alone.

Often, MCI represents an intermediate stage between normal cognition and dementia (Petersen et al. 2014). In 1 year, about 10–15% of MCI patients develop ADD. In 5 years, this number can raise up to 40–60% (Schjønning Nielsen et al. 2016; Tondelli et al. 2015). To our knowledge, this is the first study that evaluated the relative risk of both Aβ1–42 and p-Tau181/Aβ1–42 ratio in predicting progression from aMCI to ADD in an exclusively amnestic MCI sample, in a 3-year follow-up.

An important limitation of our study is the relatively small sample size. Another possible methodologic drawback is the absence of controls. Nonetheless, in this study, we did not aim to evaluate differences between individuals with aMCI and controls, but, instead, to analyze the role of CSF Aβ1–42 and p-Tau181 in predicting progression from aMCI to ADD. In another words, we tried to answer a common question that physicians find in everyday practice: what is the predictive value of analyzing Aβ1–42 and p-Tau181 in the CSF of individuals with aMCI?

Conclusion

Taken together, our results suggest that the CSF biomarkers analyzed herein provide predictive information about progression from aMCI to ADD. Aβ1–42 strongly predicted progression from aMCI to ADD, whereas p-Tau181 alone or its relation to Aβ1–42 was no better than Aβ1–42 alone as a progression to ADD predictor. However, the CSF p-Tau181/Aβ1–42 ratio did improve the specificity and negative predictive value as compared to CSF Aβ1–42 levels alone.