Introduction

It has been challenging to identify clinical cognitive markers that can differentiate patients with AD from those with behavioral variant frontotemporal dementia (bvFTD). There is considerable overlap in cognitive scores between these two conditions [1] and recent studies failed to find the expected executive function (bvFTD) and episodic memory (AD) asymmetry between these two dementia sub-types [2, 3], especially in the mild dementia stages [4]. Therefore, a cognitive test that could contribute to the differential diagnosis between AD and bvFTD would be valuable.

The short-term memory binding (STMB) test assesses the ability to integrate colors and shapes into unified representations and hold them temporarily during online performance [5]. Previous studies have shown that STMB is not affected by normal ageing. Relative to young adults, healthy older adults have shown no additional cost when remembering bindings as compared to remembering single features [5,6,7,8]. Moreover, STMB seems to be insensitive to the educational level of the individual [9]. Besides, the STMB is not affected by repeated testing or practice [10]. Finally, STMB has been shown to capture a specific deficit in AD patients. The test differentiated pre-clinical familial AD from controls [11], AD dementia from chronic depression in the elderly [12], and AD from non-AD dementias [13]. This evidence has led to the suggestion that the STMB may be a suitable cognitive marker for AD or pre-clinical AD [14].

There are different STMB paradigms and in clinical settings two versions have been used. One uses the change detection paradigm [6], in which participants are asked to recognize changes in colors, shapes or their combination across two consecutive screens. The other is a free recall version of the STMB test [13, 15] in which participants are required to verbally recall objects and colors individually or in combinations. The present study relied on the free recall version of the STMB test.

Parra and colleagues [15] demonstrated that, when compared with controls, AD patients showed a specific deficit in holding integrated features in verbal short-term memory. Della Sala et al. [13] reported that only AD patients showed significant deficits in recalling object-color bindings when compared to patients that suffered from other types of dementias. In these two previous studies, controls and patients performed tasks with different set sizes. This procedure was aimed at titrating the difficulty of the task to keep performance level on baseline conditions (i.e., single features) similar across groups. This procedure, however, may not be suitable to be used in clinical settings. Therefore, it remains to be investigated whether the free recall STMB test differentiates AD from controls and other dementias, when the same difficulty level is used for all groups.

The present study investigated whether free recall deficits during STMB differentiate patients with AD from patients with the bvFTD. Based on a previous study [13], we predicted worse scores among AD patients and that the free recall STMB would show high accuracy to differentiate AD from controls and bvFTD.

Methods

Participants

Patients were recruited from neurology outpatient units from the University of São Paulo (USP) and the Federal University of Minas Gerais (UFMG). We recruited 42 patients who met criteria for dementia due to probable AD based on the NIA-AA (National Institute on Aging/Alzheimer’s Association) [16]. Of these, seven were excluded: three presented moderate dementia (CDR = 2.0), one had visual deficits, one had object-naming problems, one was unable to complete the free recall test, and one received a diagnosis of Parkinson’s disease. For the bvFTD group, we recruited 30 patients who met the international diagnostic criteria for this type of dementia [17]. Of these, 11 were excluded: 8 presented moderate dementia (CDR = 2.0), 2 due to object-naming problems, and 1 patient was unable to complete the free recall test. For the control group, we recruited 39 older adults from senior centers and University of Third Age programs (10 from USP Ribeirão Preto; 22 from USP São Paulo; and 7 from the Paulista Institute of Geriatrics and Gerontology). Of these, seven were excluded: five due to low performance on cognitive tests, one participant was not fluent in Portuguese and one was using psychoactive medication with no stable doses. The final sample consisted of 35 AD patients, 18 patients with bvFTD, and 32 cognitively healthy older adults (controls). Control participants and caregivers of patients with dementia signed the informed consent form which was approved by the Ethics Committee from USP (protocol number 16627413.0.0000.0068) and UFMG (protocol number CAA 17850513.2.0000.5149).

Instruments and procedures

All patients were assessed by a neurologist and a neuropsychologist. In neurological care, patients underwent a clinical evaluation and screening tests for dementia (MMSE) [18, 19] and laboratory and neuroimaging exams. Patients completed a neuropsychological battery to assist in the dementia diagnosis. The diagnosis was made by neurologists involved in the project. After the diagnosis, patients were referred to perform the assessment with the STMB test. Controls completed the neuropsychological battery to ascertain normal cognitive status, and, in the same session, they were assessed with the STMB test.

Short-term memory binding

Of the free recall paradigm previously used to assess memory binding [5, 13] we selected two conditions, the unbound and bound features conditions. The rationale behind this selection was that the unbound condition represents a better baseline against which the binding cost could be assessed, than conditions assessing STMB for single features (i.e., color or object only). This is because the only difference between the unbound and bound condition is the need to remember the features together in the latter, that is, the binding. At the beginning of the task, participants were presented with two separate arrays—one consisting of 20 colors and the other consisting of 20 objects. These arrays consisted of the 11 colors and 11 objects used in the experiment and other 9 colors and 9 objects intermixed within the arrays as distractors. Participants were requested to name colors and objects to ensure that they had no problems naming the items used in the experiment (see section Participants above for the outcomes of this screening test).

Unbound features

In this condition, the study array consisted of three colors and three objects presented as separate features. Half of the items were colored squares and the other half were line drawings of common objects. The study array was presented for 9 s (1.5 s per feature). Participants were given the following instructions: ‘Now we will test your memory for colors and objects. You will see three colors and three objects on the screen. You should try to remember as many colors and objects as you can. After these colors and objects disappear, you will have to say aloud all the colors and objects that you have just seen’. The experimenter recorded responses using a scoring sheet.

Bound features

In this condition, the study array consisted of three objects filled with a different color each (i.e., colored objects), and was also presented for 9 s. These colored objects were constructed by randomly combining objects with colors from the two sets in a way that avoided prototypical color-object associations (e.g., red apple). During this condition, participants were asked to try to remember ‘as many colored objects as possible, that is, remember each object together with the color in which it was presented’. The participants should memorize the combination of colors and objects, for instance: “red-bed”, or “gray-shoe”. A correct response was considered only when the two features (color and object) were recalled together.

Each condition (bound and unbound) consisted of six trials with six features each (three colors and three objects). The bound and unbound conditions were counterbalanced. Figure 1 presents an illustration of this task.

Fig. 1
figure 1

Free recall short-term memory binding test in the unbound and bound conditions

Statistical analyses

To assess normality in the distribution of the data, the Shapiro–Wilk test was used. Only age followed a normal distribution in all groups. Thus, descriptive analyses comparing the clinical groups were carried out using the ANOVA test to compare age and the Kruskal–Wallis test was used for the other variables. To evaluate the effect of group, condition and their interaction, a 3 × 2 mixed model with a between-subject factor diagnostic Group (controls, AD and bvFTD) and a within-subject factor Condition (unbound versus bound) was used, and to this aim, we relied on the adjusted rank transform test [20], for nonparametric data. The effect size, as informed by partial eta-squared (ƞ 2), and power by Beta (β), were calculated in these mixed models as well. In addition, the binding cost was calculated as the percentage of loss in performance observed in the bound condition compared to the unbound condition [binding cost = 100 – 100 × (bound/unbound)]. Receiver-operating characteristic (ROC) analyses were used to examine the diagnostic accuracy of the bound STMB and binding cost measures to differentiate between the clinical groups. The area under the curve (AUC), specificity and sensitivity values were calculated. Bivariate correlations were calculated for STMB (bound condition) with age, education and MMSE variables. Significance level was set at 0.05.

Results

Sample characteristics and cognitive profiles are presented in Table 1. Comparisons showed that the three groups were equivalent in age and years of formal education. AD and bvFTD patients were in similar stages of dementia as informed by CDR. AD patients had worse cognitive performance when compared with controls (MMSE, unbound STMB, bound STMB and binding cost). Patients with bvFTD differed from controls in the unbound STMB. AD patients differed from bvFTD in the bound STMB and binding cost.

Table 1 Main demographics, functional measure and cognitive tests results statistics from the studied groups

The results of the adjusted rank transform test showed no significant main effect of test condition [F(1,82) = 0.403, p = 0.527, ƞ 2 = 0.005, β = 0.096] but there was a significant main effect of diagnostic group [F(1,82) = 27.867, p < 0.001, ƞ 2 = 0.405, β = 1.000]. In addition, there was a significant interaction between condition and group [F(1,82) = 3.366, p = 0.039, ƞ 2 = 0.076, β = 0.620]. When the three groups were compared (Fig. 2), there was a significant difference between controls and both dementia groups in the unbound condition. In the bound condition, however, there was a significant difference between controls and AD, bvFTD and AD, but no significant difference between controls and bvFTD patients.

Fig. 2
figure 2

Short-term memory binding test performance (unbound and bound conditions) with standard error bars. bvFTD behavioral variant frontotemporal dementia, AD Alzheimer’s disease. *p < 0.05 (error bars = SEM)

The results of the binding cost analyses indicated that the AD group showed a significantly higher percentage drop (26.23%) than the other groups (controls = 11.66% and bvFTD = 7.44%). There was a significant difference between controls and AD (p = 0.011) and bvFTD and AD (p = 0.009), but no difference between controls and bvFTD (p = 1.000).

ROC analyses using the bound STMB (Table 2) indicated that the highest diagnostic accuracy was obtained when the test contrasted the controls and the AD groups. Moderate accuracy was observed when the two dementia groups were contrasted. Low accuracy was observed when the STMB was used to differentiate controls from bvFTD. ROC analyses using the cost of binding variable indicated that the highest diagnostic accuracy was observed when the test contrasted AD and bvFTD groups, followed closely by the contrast of controls and AD, and it showed low accuracy when contrasting controls and bvFTD.

Table 2 Diagnostic accuracy for the bound condition and for binding cost in the STMB

The STMB test (bound condition) showed no significant correlation with age (p = 0.541) or education (p = 0.098), and showed a significant correlation coefficient of 0.454 (p < 0.001) with the MMSE, indicating it maintains a moderate association with general cognition.

Discussion

In this study, we aimed to compare cognitively healthy controls, patients with AD and bvFTD on the free recall modality of the STMB test. For the unbound condition, there was a significant difference between controls and both dementia groups (controls > bvFTD = AD). However, in the bound condition, AD patients showed significantly lower performance compared to bvFTD and controls, and there was no difference between controls and bvFTD (controls = bvFTD > AD). ROC analyses confirmed that the bound condition of the STMB test can be helpful in the differential diagnosis between AD and bvFTD. When we compared the groups in the binding cost (relative percentage drop in performance from the unbound to the bound condition), the results showed that the AD group presented the highest percentage drop when compared with the other groups. In ROC analyses, the binding cost yielded lower accuracy to distinguish the clinical groups when compared with the bound condition. Therefore, present results suggest that the condition of the SMB test with best diagnostic accuracy is that assessing free recall of bound features.

To discuss our results, we would like to consider these in the light of previous findings [13, 15]. Such earlier evidence may provide valuable insights to best interpret our current data. In Table 3 below the results from the current study were contrasted with those previously reported. These earlier studies used an easier version of the task, whereby AD patients were presented with screens of four features, whereas in the present study their screens presented six features. In addition, controls were exposed to a larger number features on the screen, in an attempt to equate task difficulty among groups. Of note, controls and AD patients in the earlier studies were similar in age to participants of the present study but they had fewer years of education. Despite methodological differences, present results are largely consistent with previous findings.

Table 3 Comparison between present results and previous studies

Compared to Parra et al. [15], our results were similar for both clinical groups, even with differences in education and with patients performing a task with more items. Compared to Della Sala et al. [13], the present study showed a smaller performance drop in AD patients from the unbound to the bound condition. This may be due to the higher difficulty of the present task and to the fact that Della Sala et al. [13] included patients in the moderate stage of AD dementia, while the present sample included only mild cases (CDR 0.5 or 1.0). Regarding FTD patients, Della Sala et al. [13] reported a performance of approximately 65% in the unbound condition and 80% in the bound condition, and, in present study, this clinical group performed approximately at 71 and 67%, respectively. That difference might be explained by the fact that Della Sala et al. [13] included the semantic variant of FTD in their group, whereas the current study included solely bvFTD. The semantic variant and bvFTD show different patterns of brain atrophy. While bvFTD patients show atrophy especially in areas of the frontal lobe, anterior cingulate and anterior insula [21, 22], semantic variant patients have anterior and inferior temporal lobe atrophy (in particular, the temporal pole) and perirhinal cortices [23,24,25].

In Della Sala et al. [13] and Parra et al. [15], a smaller set size was used for dementia patients to equate task difficulty across patients and healthy controls. It may be argued that in clinical settings this titration strategy is challenging to implement, as it is impossible to know a priori if someone is a patient or a control. To overcome this barrier, in the present study, the same set size was used for controls and patients, with six features per screen to avoid ceiling effects among controls. Increased task difficulty for patients with dementia may have led to an underestimation of the binding cost, as performance in the unbound condition may have shown a further drop due to the task difficulty, as shown in the comparison between the present study and Parra and colleagues [15]. Therefore, arrays of 4 features might be a more suitable set size if the classical dissociation (performance on unbound > performance on bound) is sought for diagnostic accuracy. The fact that increased task difficulty reduced binding drop (as performance in the unbound condition was already low) may have generated lower scores for the binding cost variable, as observed in Results.

The present findings are also in line with studies that used the change detection paradigm to assess STMB [9, 11]. Taken together, the results from these various studies indicate that short-term conjunctive memory is impaired specifically in AD, even in mild dementia stages, regardless of the nature of the stimuli used (meaningless shapes with non-nameable colors or common objects with common colors) or the retrieval function required (recognition or recall). These results have important clinical implications, as the test could be useful to differentiate AD from bvFTD in the early stages of the disease, which has proven to be quite challenging [3, 4, 26].

We acknowledge that recent studies have pursued similar aims using different memory binding paradigms. One particular type of memory binding, known as relational binding [8], refers to the recall of the association between two different items, for instance, when one recalls a name associated with a face, or information associated with a context, or even the semantic meaning of two words. In the present study, we have used a conjunctive memory binding paradigm, as the recalled feature conjunctions create unique representations (i.e., integrated objects) in memory. Relational and conjunctive memory binding are affected by AD. For instance, the Free and Cued Selective Reminding (FCSR) test [27] showed to be an accurate predictor of AD [28] and mild cognitive impairment (MCI) [29] and possibly fares better in AD and MCI diagnosis than traditional memory tests, such as the Rey Auditory Verbal Learning Test [30]. However, relational binding is affected by age [31] (but see [32]). Conjunctive binding, on the other hand, is not affected by age or education, as the correlation evidence in the present study also suggests, and showed higher diagnostic accuracy for AD when compared with the FCSR test [33]. This may be explained by the fact that relational binding is related to hippocampus activity [34,35,36], whereas conjunctive binding does not seem to be [37, 38]. In addition, hippocampal degeneration does not seem to be an ideal marker to differentiate AD from bvFTD [39] neither seems to be the earliest pathological change causing memory deficits in AD [40].

A few limitations of the study should be noted. Although greater than samples recruited for previous STBM studies, the samples in the current study were not large, restricting the generalization of the outcome. Moreover, we did not have biomarker evidence for the control group making it possible to have included in this group people with normal cognition but in a pre-clinical stage of the disease. This could have decreased the observed discrepancies between controls and the pathological groups.

In conclusion, our results indicate that the free recall version of the STMB test can be used for clinical purposes and may aid the early diagnosis of AD, differentiating this condition from other dementias and validating previous studies with this paradigm. Future studies should continue to explore the specificity of STMB deficits in AD versus other dementias and consider both conjunctive and relational paradigms [32, 41] of temporary binding. Future studies should also address the correlations between performance in STMB tests and biomarkers such as structural, functional or molecular neuroimaging, as well as CSF measures.