Introduction

Working memory maintenance and long-term retention

Researchers in the working memory (WM) field have shown increasing interest in the link between WM and long-term memory (LTM), in particular how one WM maintenance mechanism, attentional refreshing, contributes to LTM formation. Refreshing is defined as a domain-general maintenance mechanism that relies on attention to increase the activation level of information in WM and keep it accessible (Barrouillet & Camos, 2015; Camos et al., 2018; Johnson, 1992).

Attentional refreshing is mainly studied using complex span tasks. In these tasks, to-be-recalled items are presented sequentially and interspaced by a concurrent processing task (e.g., parity task, operation task). The Time-Based Resource-Sharing model (Barrouillet et al., 2004) proposes that refreshing availability is modulated by the cognitive load (CL) of the concurrent processing task, defined as the portion of the total time of the task during which attention is diverted from maintenance. CL can be manipulated by varying the number of distractors in the same period, the pace of the concurrent task, or its difficulty. Increasing CL reduces memory performance at immediate recall and, more critically, at delayed recall (Barrouillet et al., 2007; Camos & Portrat, 2015; Jarjat et al., 2018, 2020; Plancher & Barrouillet, 2013). Similarly, it has been observed that complex span tasks lead to better episodic memory than simple span tasks, supposedly by increasing the number of refreshing opportunities (the McCabe effect; Loaiza & McCabe, 2012; McCabe, 2008). Together, these results suggest that on top of short-term retention, refreshing also promotes LTM.

However, because the main purpose of WM is to allow short-term retention of information, it is not always relevant to remember this information at long term. Thus, we can wonder if intention to remember at long term modulates the effect of WM on LTM.

Role of recall relevance in working memory (WM) and long-term memory (LTM)

A meta-analysis on the effect of WM on LTM (Hartshorne & Makovski, 2019), highlighted that the effect of maintenance on long-term recall was stronger in studies that announced the upcoming delayed memory test, compared to studies using a surprise recall. This result suggests that intention to learn at long term increases the effect of WM on LTM. However, to our knowledge, the cause of this effect has not yet been investigated. One possibility could be that WM maintenance adapts to better promote LTM when aiming for long-term recall. Thus, we could expect that warning participants of a final delayed-recall test would lead to a change in the maintenance strategies deployed.

Another way to investigate intentions is to modulate participants’ motivation by manipulating the relevance of information through external rewards, such as monetary or abstract values associated with the to-be-remembered information. There is substantial evidence regarding the effect of reward on WM. Financially rewarding correct recall in visual WM increases WM capacity for high- compared to low-reward trials (Kawasaki & Yamaguchi, 2013), and varying reward value at the item-level improves recall accuracy for high reward items (Allen & Ueno, 2018; Hitch et al., 2018). Similar results were found for verbal memory presented visually (Sandry et al., 2014, 2020) or auditorily (Atkinson et al., 2021). Reward boosts in WM have been proposed to be driven, at least partially, by the use of refreshing (Atkinson et al., 2022; Sandry et al., 2014, 2020; Sandry & Ricker, 2020).

Some studies also investigated whether WM reward-driven prioritization effects transfer to subsequent LTM. In Sandry et al. (2020) study, WM trials contained high-value items presented in red that were worth more points than low-value items presented in black. A final free delayed recall indicated better long-term recall for prioritized items, similar to what was observed in WM. Jeanneret et al. (2022) investigated the long-term consequences of two retrospective prioritization procedures in WM: reward-based (i.e., retrospectively assigned abstract value) and retro-cue (i.e., an arrow pointing to the previous location of the to-be-tested item). Using object images, they found that the benefit of cued items transferred to long-term recognition, while reward effects were more ambiguous. Together, these two studies suggest that prioritization in WM can also affects the resulting LTM of the information. However, and critically, these studies did not reward explicitly long-term recall but only the WM task, and used surprise LTM tests. It is not clear, therefore, whether the effects would be the same if participants would prioritize LTM formation during WM maintenance.

The present study

In the present study, we aimed to better understand the role of intentions on the link between WM and LTM by investigating (1) if the effect of WM on LTM is modulated by intention to remember at long term and (2) whether and how WM maintenance adapts to fulfill these intentions.

In a first experiment, we manipulated intention to remember at long term by informing or not participants of the final delayed-recall test before the WM task. We manipulated the CL of the concurrent task to evaluate the use of attentional refreshing (Barrouillet et al., 2007; Camos & Portrat, 2015; Jarjat et al., 2018, 2020; Plancher & Barrouillet, 2013). As it has been proposed that the addition of distractors can modulate the effect of WM on LTM (Loaiza & McCabe, 2012; McCabe, 2008), we also included a simple span condition to be compared with complex span conditions. If higher intention to remember at long term leads to increased use of attentional refreshing, we should observe a stronger CL effect at immediate and delayed recall when participants were informed of the delayed test, as compared to when they were not (i.e., lower intention to remember at long term).

In a second experiment, we modulated intentions using monetary rewards. We varied the value of the items to create a within-trial prioritization (i.e., high and low rewards associated with the to-be-encoded material, as in Hitch et al., 2018; Sandry et al., 2020), and the CL of the concurrent task as in Experiment 1 (excluding the simple span condition). Additionally, we manipulated which type of recall tests (i.e., immediate or delayed) yielded rewards to induce prioritization of either short- or long-term retention. Following previous literature, we anticipated that high-value items would be better recalled than low-value items at both immediate and delayed recall. In line with the hypotheses of Experiment 1, we expected that rewarding delayed recalls would lead to a stronger CL effect on immediate and delayed recalls than rewarding immediate recalls. We also predicted that the reward effect would be stronger on the rewarded test, immediate or delayed recall, compared to the non-rewarded test.

Experiment 1

Method

Participants

Eighty young adults (58 females) between 18 and 30 years old (= 23.51, SD = 3.40 years) were recruited for this experiment. All participants had to be aged between 18 and 30 years old, be native French speakers, have normal or corrected-to-normal vision, and did not report any history of neurological or reading problems. All participants provided written informed consent before taking part in the study.

Material and design

The experiment was developed using jsPsych (de Leeuw, 2015) and was conducted online on a JATOS server (Lange et al., 2015). The experimental material consisted of 240 words that were selected from the Lexique3 database (New et al., 2001, 2004). Words were high frequency (= 138.21, SD = 126.50, in written occurrences per million) singular common nouns, four to eight letters, and one to two syllables long. The concurrent parity task consisted of 32 sequences of 15 digits ranging from 1 to 9, with a pseudorandom presentation order.

At the end of the experiment, participants were presented a strategies form. This form asked participants to indicate the percentage of use during the WM task among the proposed strategies, the total having to reach 100%. Each strategy was described by a short sentenceFootnote 1, and an additional fillable field (“other”) could be used to report the use of non-proposed strategies. The proposed strategies were verbal rehearsal, attentional refreshing, stories, mental images, places (method of loci), mental line, and visual scenes. Finally, participants filled a motivation evaluation scale (Intrinsic Motivation Inventory; Ryan, 1982) composed of four dimensions: perceived choice, perceived competence, interest/enjoyment, pressure/tension.

We manipulated the maintenance condition of the task (simple span vs. complex span low CL vs. complex span high CL) as a within-subject variable and the delayed recall awareness (aware vs. unaware) as a between-subject variable. Two versions of the experiment were used to counterbalance words, digits, and CL condition order.

Procedure

The experiment started with the initial instructions describing the various phases of the experiment and the WM task. An additional sentence indicating the presence of a final delayed recall was added for half the participants (aware group) but not the other half (unaware group), and this recall was reminded in the halfway break screen for aware participants only. The experimental session then started with a training phase that was followed by the experimental task, and ended with the strategies form, the motivation evaluation scale and a final form asking for participants’ personal information.

The experimental task was a simple or complex span task (Fig. 1). It consisted of 48 trials of five memoranda. Each trial started with a fixation cross for 1,000 ms. Words were presented sequentially for 1,500 ms each. In complex span trials, words were interspaced by a concurrent parity task consisting of three digits that participants had to judge as even or odd (“m” key for even and “q” for odd on an Azerty keyboard), that were presented for 600 ms (high CL) or 1,200 ms (low CL). In simple span trials, words were interspaced by a 3,000 ms black screen. Each word and digit was followed by an interstimulus interval of 150 ms. The time interval between two words in the simple span trials was selected to approximately match the available free time in the low CL condition. To estimate the time required to process a digit, we referred to previous literature that used a similar concurrent parity task within a complex span design. These studies reported presentation times ranging from 1,200 ms to 1,700 ms per digit, resulting in processing times of 621 ms and 615 ms in the two experiments conducted by Labaronne et al. (2023), and 672 ms and 625 ms for adults in the study by Rosselet-Jordan et al. (2022). Based on this information, we estimated the digit processing time to be approximately 650 ms. Considering this processing time, the free time between two words in the low CL condition was estimated to be 3,150 ms (subtracting three times 650 ms, to account for processing, to the total duration of 5,100 ms). Consequently, we used this duration for the simple span task (150 ms of interstimulus interval after the word, followed by a 3,000-ms blank screen). Participants were not asked to read the words or digits aloud.

Fig. 1
figure 1

Illustration of maintenance conditions in the task used in Experiment 1

At the end of a trial, a recall screen prompted participants to recall the five presented words in their original order by typing in the five response boxes, without time limit. The recall was confirmed by clicking on a submit button and was followed by a message asking participants to place again their fingers on the m and q keys to prepare for the coming trial. The next trial was started by pressing the spacebar. Following the last trial, a 1-min distracting task of multiplication problems (e.g., 4 × 5 = 20?) was presented. After this distracting task, participants performed a delayed recall, in which they were invited to recall as many words as possible in any order and without time limit using the keyboard. This recall was confirmed by clicking on a submit button, followed by the motivation evaluation scale and the maintenance strategies form. A final form asked participants for their personal information, after what the experiment ended.

The practice was constituted of three phases. The first phase comprised 54 practice trials of the parity task without time limit. Accuracy was calculated, and the task had to be performed again if it did not reach 70%. The second phase corresponded to ten examples of the arithmetic problems used in the distracting task before the delayed-recall test. The third phase was similar to the experimental task, consisting of six trials so that every condition of CL was presented twice along with two simple span trials. Performance on the secondary parity task was monitored during the entire experiment. If their correct-response rate reached the lower limit of 70%, participants were reminded the importance of paying attention to this task and were warned that they had to increase their performance to prevent being sent back to the practice phase. A new performance check was done three trials later, and participants were indeed required to complete again the parity practice phase if their performance had not increased above 70%.

Statistical analyses

To ensure active processing of the concurrent parity task, participants having less than 70% of correct responses were excluded from analyses (final n = 74) as done in some previous studies (e.g., Camos et al., 2011, 2019; Labaronne et al., 2023).

Analyses were conducted using R (R Core Team, 2019) and RStudio (RStudio Team, 2021) with BayesFactor (Morey & Rouder, 2015) and bayestestR (Makowski et al., 2019) packages. Bayesian analyses of variance were conducted on immediate serial and delayed recall scorings, using maintenance condition (simple span vs. complex span low CL vs. complex span high CL) and delayed recall awareness (aware vs. unaware) as predictors and subjects as a random factor. Bayesian models were compared to a null model including only a random effect of subjects. The likelihood of each effect was assessed using BFinclusion and BFexclusion calculated across matched models (Mathôt, 2017), reflecting the proofs in favor or against an effect. Resulting BFs were interpreted using the following classification (Lee & Wagenmakers, 2013, adapted from Jeffreys, 1961): BF at 1 shows no evidence, anecdotal evidence between 1 and 3, moderate evidence between 3 and 10, strong evidence between 10 and 30, very strong evidence between 30 and 100 and extreme evidence for BF > 100. Therefore, BFs below 3 were interpreted as inconclusive. Interactions supported by at least moderate evidence were decomposed using Bayesian t-tests.

By design, strategies values were interdependent, as response to a strategy conditioned possible values to the other strategies, which made it unsuited for statistical analyses. Thus, we observed descriptively if the strategies data suggested any difference between aware and unaware participants.

Additional exploratory analyses that were conducted are included as Appendices. First, to evaluate if the effect of intentions could depend on individuals' WM capacity, the median of the simple span performance was used to distinguish high- and low-performance participants but it did not affect the other factors (Appendix 1). To explore a possible role of motivation level, we tested if differences in the reported motivation could be found between the two groups or if the scores in some dimensions correlated with memory performance at short- or long-term. We mostly found evidence supporting the null or inconclusive results (Appendix 2).

Results and discussion

Main analysis

At immediate recall (Fig. 2), the likeliest model included only the main effect of maintenance condition (BF10 = 1.62e+15). There was extreme evidence in favor of the effect of maintenance condition (BFinclusion = 1.63e+15), with better recall in simple span trials (M = 88.26, SD = 18.36) than under low (M = 80.88, SD = 18.10) and high (M = 76.42, SD = 19.29) CL. Evidence for the main effect of awareness was inconclusive (BFexclusion = 1.44), and there was strong evidence against the maintenance condition × awareness interaction (BFexclusion = 10.63). Thus, we did not find evidence that awareness of the upcoming delayed-recall test affected overall immediate performance or interacted with maintenance condition at short-term. A complementary analysis conducted on a lenient immediate recall scoring, that did not consider items’ serial position, led to similar findings (Appendix 3).

Fig. 2
figure 2

Mean correct recall percentage in Experiment 1 by maintenance condition of the task (simple span vs. complex span low vs. complex span high) and delayed recall awareness group (aware vs. unaware). The error bars refer to the standard error

At delayed recall, the likeliest model included the main effects of maintenance condition and awareness, and the maintenance condition × awareness interaction (BF10 = 675.80). There was extreme evidence for the effect of maintenance condition (BFinclusion = 454.58), with better recall in simple span (M = 15.08, SD = 9.43) and low CL trials (M = 14.73, SD = 10.67) compared to high CL trials (M = 11.54, SD = 8.75). Evidence for the main effect of awareness (BFexclusion = 1.06) and the maintenance condition × awareness interaction (BFinclusion = 1.58) was inconclusive. Given it was included in the best model and its importance for our theoretical hypothesis, we conducted post hoc Bayesian t-tests on the maintenance condition × awareness interaction. In the unaware group, the difference between simple span and low CL was inconclusive (BF10 = 1.09). However, evidence supported a difference between simple span and high CL (BF10 = 3.12) and an absence of difference between low and high CL (BF01 = 3.47). In the aware group, evidence pointed against a difference between simple span and low CL (BF01 = 3.26) but supported a difference between simple span and high CL (BF10 = 10.65) and between low and high CL (BF10 = 362.66). We did not replicate the classical McCabe effect (Loaiza & McCabe, 2012; McCabe, 2008) by which items studied in complex span trials are better recalled at long term than those studied in simple span trials. This difference may be caused by the addition of free time between memoranda in simple span trials (e.g., Souza & Oberauer, 2017). As attention is not displaced by distractors in simple span, it may also be that this free time was used for short-term consolidation, which has been shown to benefit both WM and LTM and is independent from maintenance mechanisms (Cotton & Ricker, 2021; Labaronne et al., 2023). It is thus difficult to clearly conclude on the simple span results. Interestingly, we replicated the delayed CL effect in complex span tasks (Camos & Portrat, 2015; Jarjat et al., 2018, 2020) in the aware group (+5.23), but not in the unaware group (+0.88). We should note that we did not control for whether participants predicted the LTM test. This could explain the inconclusive evidence regarding awareness and the interaction between awareness and maintenance condition, however results of t-tests on the CL effect at delayed recall goes against this hypothesis. Future studies should examine this possibility more carefully.

Strategies

Descriptive observation of the data did not suggest a clearly distinct pattern of strategies between the two groups, at best only minor differences (Fig. 3).

Fig. 3
figure 3

Mean percentage of strategy use in Experiment 1 depending on delayed recall awareness. Error bars refer to the standard error

Results summary

Surprisingly, the results of this experiment suggest that the knowledge of an upcoming delayed recall does not visibly affect WM maintenance, neither in recall performance nor in maintenance strategies deployed, but still modulates the impact of complex span task CL on LTM.

Experiment 2

A second experiment was conducted to test our theoretical hypothesis using different manipulations. Because it has been shown that rewards in WM also affect subsequent LTM (Sandry et al., 2020), we investigated if the long-term reward effect could be modulated by reward-driven intentions regarding long-term retention. We manipulated monetary rewards within-trials using high- and low-value memoranda as in previous literature (e.g., Hitch et al., 2018; Sandry et al., 2020), and intentions were manipulated by varying whether rewards were given for performance in immediate or delayed memory tests.

Method

Participants

As participants were financially compensated, the sample size was reduced compared to Experiment 1 due to resources limitations. We recruited 60 new participants (43 females) aged between 18 and 28 years old (M = 21.92, SD = 2.23) for this experiment. The recruitment criteria were the same as in Experiment 1. Although participants believed that their reward would depend on their memory performance during the task, all those who completed the experiment received 10€ for their participation.

Material and design

We randomly selected 140 items from Experiment 1 material. Given the absence of effect of intentions on motivation levels when manipulating delayed recall awareness in Experiment 1 (Appendix 2) or rewards in previous literature (Sandry & Ricker, 2020), the motivation evaluation scale was removed for this experiment. Additionally, to prevent the response to one strategy from being dependent on the responses to the other strategies, so we could use statistical analyses, the strategies form now asked for a percentage of use for each proposed strategy independently.

Procedure

The same procedure as in Experiment 1 was used, excepted for the following points. The experiment consisted of two blocks of 14 trials, with a delayed recall at the end of each block. At the beginning of the experiment, all participants were informed of the presence of delayed-recall tests. Participants were split into two groups, who were told that their reward (i.e., the final amount of money won) depended on their memory performance in either immediate or delayed-recall tests. The second or third memoranda in each trial, depending on the block, was a high-value item that rewarded 0.16€ upon correct recall while the rest of the items were rewarded 0.05€. The high-value item was presented underlined to ensure correct identificationFootnote 2. The high-value item’s serial position in the first block was counterbalanced between participants (2 or 3) and changed between blocks. These instructions were reminded at the end of the training session and after the first block. After a rewarded test (immediate or delayed recall, depending on the group), a screen displayed the amount rewarded for this recall and the total gain. Given the conclusions of Experiment 1, the simple span condition was removed.

In summary, we manipulated the CL of the concurrent parity task (low vs. high) and the item reward value (high vs. low) as within-subject variables, and rewarded test (immediate recall vs. delayed recall) as a between-subject variable.

Statistical analyses

Analyses were conducted similarly to Experiment 1. Participants having less than 70% correct responses to the concurrent parity task were discarded from following analyses (final n = 52).

Bayesian analyses of variance (bANOVA) were conducted on mean correct recall percentage at both immediate and delayed recall, using CL (low vs. high), reward value (low vs. high) and rewarded test (immediate recall vs. delayed recall) as predictive variables, and subject as a random factor. Regarding the reward value factor, we computed the mean percentage of high-value items correctly recalled and the percentage of low-value correctly recalled, independently of their position in the trials.

For strategies data, the change made in this experiment removed the responses’ interdependence and allowed us to use statistical analyses. We conducted a Bayesian analysis of variance on the percentage of use of strategies, with the rewarded test and the strategies as predictors, and subject as a random variable.

Results and discussion

Main analysis

At immediate recall (Fig. 4), the likeliest model included the main effects of CL and reward value (BF10 = 2.98e+09). Extreme evidence supported an effect of CL (BFinclusion = 8.42e+04), with better recall under low (M = 88.85, SD = 12.26) than high (M = 83.21, SD = 14.14) CL. Additionally, we found extreme evidence for a main effect of reward value (BFinclusion = 3.31e+05), with a better recall for high-value items (M = 90.66, SD = 9.17) than for low-value items (M = 84.87, SD = 13.97), replicating the reward effect in verbal WM (Atkinson et al., 2021; Sandry et al., 2014, 2020). Evidence regarding the main effect of rewarded test was inconclusive (BFexclusion = 2.81). There was moderate evidence against the CL × reward value interaction (BFexclusion = 4.93). As the effect of rewards was not modulated by the CL, thought to determine refreshing availability, this does not support the previous proposition (Atkinson et al., 2022; Sandry et al., 2014, 2020; Sandry & Ricker, 2020) that the effect of reward can be attributed to refreshing. CL did not interact with rewarded test either (BFexclusion = 3.45), suggesting, consistent with Experiment 1, that intentions did not modulate the use of attentional refreshing. There was moderate evidence against the rewarded test × reward value interaction (BFexclusion = 3.28) and the three-way interaction (BFexclusion = 3.38). The analysis using lenient scoring led to similar findings (Appendix 3).

Fig. 4
figure 4

Mean correct recall percentage in Experiment 2 by cognitive load of the concurrent task (low vs. high), item reward value (low vs. high) and rewarded test (immediate recall vs. delayed recall). The error bars refer to the standard error

At delayed recall, the best model included the main effects of CL, reward value and rewarded test, and the rewarded test × reward value interaction (BF10 = 2.54e+12). There was extreme evidence for an effect of CL (BFinclusion = 5.08e+04), with better recall under low (M = 22.42, SD = 14.24) than high (M = 16.37, SD = 10.91) CL. Moderate evidence supported the main effect of rewarded test (BFinclusion = 3.15), with better recall performance when the delayed recall was rewarded (M = 22.50, SD = 12.73) than when immediate recall was rewarded (M = 16.29, SD = 10.38). Consistent with the study of Sandry et al. (2020), results indicated extreme evidence for the main effect of reward value (BFinclusion = 1.00e+07). As with immediate recall scoring, there was moderate evidence against the CL × reward value (BFexclusion = 4.65) and CL × rewarded test (BFexclusion = 3.66) interactions. Crucially, the effect of reward value was stronger in the group in which delayed recall was rewarded (+12.23) compared to the group in which immediate recall was rewarded (+4.02), as suggested by the very strong evidence in favor of the reward value × rewarded test interactionFootnote 3 (BFinclusion = 36.97). Post hoc comparisons indicated inconclusive evidence regarding the effect of rewards in the immediate rewards group (BF01 = 1.82), and moderate evidence in the delayed rewards group (BF10 = 5.64). Evidence regarding the three-way interaction was inconclusive (BFexclusion = 1.97).

Strategies analysis

The best model included only the main effect of strategies (BF10 = 2.95e+25). We found moderate evidence against an effect of the rewarded test (BFexclusion = 6.76) and very strong evidence against the strategy × rewarded test interaction (BFexclusion = 66.67). As can be seen in Fig. 5, these results do not support the use of distinct maintenance strategies between the two groups.

Fig. 5
figure 5

Mean percentage of strategy use in Experiment 2 depending on the rewarded test. Error bars refer to the standard error

Results summary

We found no evidence that manipulating the rewarded test (i.e., immediate or delayed recall) affected short-term recall. However, participants that were rewarded in the delayed-recall tests showed better recall performance and a stronger reward effect in delayed recalls. This experiment suggests that higher intention to remember at long term increases the long-term effect of rewards without modulating short-term recall.

General discussion

Our first objective was to investigate if intentions modulate the effect of WM on LTM. In Experiment 1, in line with the literature (Hartshorne & Makovski, 2019), we observed that awareness of the upcoming delayed recall increased the CL effect on delayed-recall performance, however the bANOVA gave inconclusive evidence and this effect was supported only by t-tests. In Experiment 2, we found that the long-term benefit of rewards was increased when rewarding delayed recall rather than immediate recall. Together, these first results suggest that intentions regarding long-term remembering could modulate the long-term recall of items presented in a WM task. Future studies will be needed to explore this question in more detail.

Our second objective was to observe if and how WM maintenance adapts when aiming for long-term recall. While we replicated the effect of CL on immediate recall (Barrouillet et al., 2007; Camos & Portrat, 2015; Jarjat et al., 2018, 2020; Plancher & Barrouillet, 2013) in both experiments, we failed to find evidence that this effect was modulated by intentions manipulation, would it be through delayed recall awareness or the rewarded test. Additionally, in both experiments, we did not find evidence that the maintenance strategies used differed. In sum, our results do not suggest that intention to remember at long term modulated the maintenance mechanisms used or their efficiency at short-term.

While our study provides new evidence that WM manipulations modulate subsequent LTM more strongly with higher intention to remember at long term, the cause of this effect remains to be identified. We anticipated that trying to learn at long term could lead to increased use of refreshing. The increased CL effect with awareness in the delayed recall of Experiment 1 could partly support this idea. However, refreshing is more sensitive to concurrent attentional demand than other low-attentional mechanisms such as verbal rehearsal (Camos, 2015; Camos et al., 2011; Camos & Barrouillet, 2014; Mora & Camos, 2013). Thus, intentions not modulating the effect of CL on immediate recall in both experiments makes this interpretation implausible. Alternatively, conditional use of elaborative strategies could be proposed to explain our results, as previous literature suggested that elaboration improves delayed recall without impacting immediate recall (Bartsch et al., 2018). However, this explanation requires to consider that their use is concurrent to other maintenance mechanisms affecting short-term recall to explain the short-term CL effect and, as we saw no change in the reported strategies, that participants were unable to explicitly report the use of such strategies. Similarly, it has been reported that a change in strategies was not able to explain the reward effect (Sandry & Ricker, 2020). Thus, the proposition of a change in the maintenance strategies used does not seem promising.

Considering our results and the literature, we propose the hypothesis that intentions induce a combined change in encoding and maintenance. In a long-term recognition task, a concurrent random number generation during encoding, contrary to an articulatory suppression task, removes the reward effect on delayed recognition (Elliott & Brewer, 2019). Coherently, imaging studies reported a preparatory phase before the presentation of items that can predict their successful encoding (Adcock et al., 2006; Addante et al., 2011, 2015), and that intention to remember modulates frontal-midline’s low beta band power during this phase (Schneider & Rose, 2016). Thus, it appears that intention to remember at long term could alter encoding, leading to a better long-term recall. However, evidence also suggest a role of WM maintenance. In Experiment 1, we found that CL, a manipulation constricted to WM maintenance, affected delayed recall differently depending on delayed recall awareness. In Experiment 2, in which all participants were aware of the delayed test, CL affected the delayed recall of all participants. Accordingly, oscillatory activity during maintenance can predict both immediate and delayed-recall performance (Khader et al., 2010). Therefore, one possible explanation could be that intention to remember at long term does not modulate the kind and amount of maintenance mechanisms engaged. Instead, it could lead to a change in encoding processes, for instance through the nature of the encoded representation, that would determine to what extent WM maintenance mechanisms affect long-term recall. Further studies are needed to clarify how intentions modulate the effect of WM on LTM, which could bring a better understanding of the links between WM and LTM.