Introduction

Cognitive training represents an interesting opportunity to enhance cognitive abilities for people of every age. The aim of cognitive training is to transfer cognitive gains to untrained abilities, with the ultimate goal of improving a person’s daily functioning. However, improvements were often limited to the trained task (learning effect). It was rare to see transfer to a different task measuring either the same cognitive process (near transfer) or another cognitive process (far transfer; Greenwood & Parasuraman, 2016; Simons et al., 2016). The lack of effectiveness of existing programs can be explained by the poor understanding of the mechanisms underlying far transfer (Greenwood & Parasuraman, 2016). Research on far transfer mechanisms became one of the top priorities in cognitive training domain, because it could allow the development of training programs that would promote long-term benefits of daily functioning for the population (Bherer, 2015; Foroughi et al., 2016).

Based on a review of the neuroscientific literature, Greenwood and Parasuraman (2016) proposed that the training of distraction suppression is central in producing transfer to fluid intelligence. Cowan (2001) affirmed that a good ability to suppress distractors facilitates the processing of working memory by protecting the relevant content from such irrelevant content. Greenwood and Parasuraman found a pattern of brain activation among several brain imaging studies that supports Cowan’s theory. Indeed, analysis of neuroscientific and cognitive data suggested that suppression of distractors has the potential to generate benefits on working memory and fluid intelligence tasks. Specifically, Greenwood and Parasuraman argued that a training task that solicitates distraction suppression would alter the activation patterns of the dorsal attention network (intraparietal sulcus, superior parietal lobule and frontal eye field) and its variable relationship with default mode network. The authors discussed that mastering the training task can result in a dynamic reconfiguration of coherence in activity between specific brain networks involved in attention to external stimuli. These modifications promote transfer to other interconnected networks that involve additional cognitive processes, like fluid intelligence.

Greenwood and Parasuraman (2016) suggested that N-back training should be effective by soliciting distraction suppression and by triggering the activation of the dorsal attention network (Conway et al., 2001; Engle, 2002; Halford et al., 2007). However, N-back training has been the most studied in the literature (Linares et al., 2019), and there is no convincing evidence that such training can produce far transfer (Colom et al., 2013; Dougherty et al., 2016). Indeed, the multiple recent meta-analyses and literature reviews focusing on N-back training obtained a small or null effect size for the transfer measured on fluid intelligence, control of attention and other executive functions (Au et al., 2015; Brehmer et al., 2012; Karbach & Verhaeghen, 2014; Melby-Lervåg & Hulme, 2013; Melby-Lervåg et al., 2016; Morrison & Chein, 2011; Schwaighofer et al., 2015; Soveri et al., 2017; Von Bastian & Oberauer, 2014). Consistently, several researchers expressed their skepticism for the potential of N-back training to generate far transfer (Dahlin et al., 2008; Harrison et al., 2013; Linares et al., 2018, 2019; Redick et al., 2013) and some argued that training with the N-back task cannot ensure far transfer, according to the data reviewed (Colom et al., 2013; Dougherty et al., 2016). The inconsistencies between Greenwood and Parasuraman’s hypothesis and empirical evidence on N-back training call into question the far transfer mechanism proposed by the authors and highlight the need to test empirically the role of distraction suppression in cognitive training effectiveness using another approach.

One type of training that had more success in producing far transfer is action video game training (AVG; Bavelier et al., 2012; Cohen et al., 2007). Murphy et al. (2016) used the load theory of attention (Lavie et al., 2004) to explain the benefits observed in AVG. They argued that perceptual capacity can be increased and that the load theory can be an interesting starting point for the development of an effective cognitive training. The load theory presents selective attention as arising from the involvement of two distinct distraction suppression mechanisms, which are engaged according to the level and type of information load involved in the task. When perceptual load is high, the perceptual capacity becomes saturated due to the processing of multiple stimuli. In such a context, a perceptual selection mechanism acts as a filter that excludes irrelevant stimuli based on physical distinctions such as location, colors, dimensions, or combination of features (Lavie et al., 2004). This is a rather passive mechanism since distractors are not treated simply because there is not enough capacity to process them. When perceptual load is low, it is rather a cognitive control mechanism that actively rejects distractors (e.g., by inhibiting them), preventing them from undergoing more advanced processing to the detriment of priority information. Since it requires some mental effort, this form of selectivity is considered to be active.

In light of the load theory, the effectiveness of AVG-based training could lie in its complex environment, which can provide sometimes low, sometimes high perceptual load, thus engaging both distraction suppression mechanisms (Murphy et al., 2016). These authors proposed that this type of training could increase early filtering of irrelevant stimuli and control of attention. In fact, they claimed that far transfer could be favored by a training program promoting both low- and high-load contexts, but also encouraged further investigations to better understand this possible impact. We designed the present study in that perspective. In fact, we consider that far transfer to working memory tasks would be possible, due to the fact that both mechanisms are crucial for the protection of information in this cognitive process (Gaspar et al., 2016; Greenwood & Parasuraman, 2016; Vissers et al., 2017). The ineffectiveness of the N-back training, which involves presenting a single stimulus at a time, could be attributable to the fact that it offers low perceptual load only, hence stimulating only one of the distraction suppression mechanisms, namely cognitive control.

The present study

The goal of the present study was to determine whether the two putative mechanisms of distraction suppression put forward by the load theory play a crucial role in far transfer. To do so, we drew on the response competition (RC) task used to test the load theory to implement a cognitive training paradigm that allows training both perceptual selection and cognitive control and compare its effectiveness in inducing far transfer to that of the classical N-back training that engages only cognitive control.

The RC task developed by Lavie and Cox (1997) was used as a training task. This task consisted of presenting a circle formed of six letters with a distractor at the periphery or at the centre of the screen, as shown in Fig. 1. The circle was made up of a target letter, which the participants must find and identify, and five non-target letters. To stimulate both distraction suppression mechanisms, perceptual load was manipulated in the training task to provide a context of either high or low load. Perceptual load was controlled by changing the shape of the non-target letters in the circle. Low perceptual load display was composed of non-target letters that are homogeneous and dissimilar to the target letter in the circle of letters (the letters O with a target letter that is either X or N). According to Lavie et al. (2004), such a context does not saturate perceptual capacity, which favors perceptual processing of all letters presented on the screen, including distractors. These must then be actively inhibited by the cognitive control mechanism to prevent them from undergoing further processing to the detriment of the target letter. High load display was composed of various non-target letters similar to the target letter (the letters V, W, H, K, and Z with a target letter that is either X or N). This context prevented the processing of several stimuli, because the perceptual capacity quickly became saturated. Non-target letters were then discarded at an early stage in information processing by the perceptual selection mechanism that excluded irrelevant stimuli based on physical distinctions (Lavie et al., 2004). To our knowledge, such a cognitive training program based on the RC task has never been implemented in the scientific literature. Nevertheless, this was not the first cognitive training to adaptively manipulate distractor interference. For instance, Mishra et al., (2014, 2016) found far transfer as well as sensory plasticity following a training procedure whose mechanics specifically challenged participants to discriminate targets among progressively more interfering distractors. Such findings are promising as they provide evidence that training approaches requiring the suppression of progressively more challenging distractors, such as the present RC training program, can successfully produce transfer effects.

Fig. 1
figure 1

Example of low-load and high-load displays in the response-competition task

The effectiveness of RC and N-bask trainings in producing far transfer was assessed relative to an active control training focusing on perceptual discrimination. Learning and transfer effects for each of the three training programs were evaluated by contrasting the same measures taken before (pretest) and after (posttest) the completion of the training sessions (see Fig. 2). These pretest–posttest experimental tasks were carefully selected to measure either selective attention or working memory. The two tasks measuring selective attention, the ability to process goal-relevant information while ignoring or inhibiting irrelevant information (Murphy et al., 2016), were the RC task and the auditory distraction task. The RC task involved searching for a target letter in the presence of visual distractors (e.g., Lavie et al., 2004), whereas the auditory distraction task required recalling the missing digit from a predefined series of digits in the presence of auditory distractors to ignore (e.g., Vachon et al., 2017). The two tasks assessing working memory, as the ability to “simultaneously store and manipulate information” (Greenwood & Parasuraman, 2016, p. 743), were the N-back task and the operation span (OSPAN) task. The N-back task consisted of recalling a target letter while inhibiting other target letters (e.g., Heinzel et al., 2014; Jaeggi et al., 2010), whereas the OSPAN task involved the serial recall of multiple target letters interspersed with a mathematical operation to be solved (e.g., Kane et al., 2007). As shown in Table 1, learning effects were evaluated using the task employed in the training phase, i.e. the RC task for RC training and the N-back task for N-back training. The auditory distraction task served as a measure of near transfer for RC training and far transfer for N-back training. Conversely, the OPSAN task was used to evaluate near transfer for N-back training and far transfer for RC training.

Fig. 2
figure 2

Schematic representation of the pretest–posttest design employed in the present study. Participants were assigned to one of the three training programs: response competition (RC), N-back, or control. Blue shapes represent selective-attention tasks whereas pink shapes represent working-memory tasks

Table 1 Targeted tasks to measure learning, near transfer, and far transfer effects for response-competition (RC) and N-back trainings

Our hypothesis that training both perceptual selection and cognitive control should promote far transfer better than training a single distraction suppression mechanism predicts that RC training should produce greater far transfer effects than N-back training. More specifically, it is expected that RC training will induce significant posttest improvements in performance on the two working-memory tasks (i.e., N-back and OSPAN) relative to control training. Conversely, N-back training should produce little, if any, posttest improvements in performance on the two selective-attention tasks (i.e., RC and auditory distraction tasks) relative to control training, consistent with the existing literature (Colom et al., 2013; Dougherty et al., 2016).

Method

Participants

Seventy-three volunteers between the age of 18 and 40 years were recruited through emails and posters on the campus of Université Laval. Each volunteer reported normal or corrected-to-normal vision, normal hearing and no history of neurological disorders. The study was approved by the ethics committee of Université Laval and all participants provided written consent prior to the experiment. The financial compensation offered was $60 ($5 for each of the eight training sessions and $10 for each of the pretest and posttest sessions). Additionally, a contest participation coupon was offered for each completed session for a chance to win a $100 gift card.

Seven participants dropped out of the training program due to lack of time and six others were excluded due to data recording problems. Sixty were, therefore, included in the analysis of the present study. The final sample was composed of 90.40% undergraduate, graduate or postgraduate students at Université Laval. A total of 20 subjects were randomly assigned to each of the three trainings (Control, RC and N-back). As shown in Table 2, each of the training groups included in the analyses presented similar age and gender characteristics. Given a sample size of n = 20, α = 0.05, a training effect (i.e. a pretest–posttest difference) of size η2p = 0.08 could be detected with a statistical power of 0.80 according to G-Power software (Faul et al., 2007, 2009).

Table 2 Sample characteristics (n = 20 for each group)

Design

The experimental design is illustrated in Fig. 2. The present study was conducted in laboratory and included a 60-min pretest phase followed by a training phase and a 60-min posttest phase. In both pretest and posttest, four experimental tasks were administered to assess selective attention (RC task and auditory distraction task; shown in pink in Fig. 2) and working memory (N-back task and OSPAN task; shown in blue in Fig. 2). During the training phase, participants performed one of the three training programs (N-back, RC, or Control). Training consisted in eight 30-min sessions that had to be completed within 4–8 weeks at a frequency of one or two sessions per week. Such flexibility in the training schedule was offered to promote study adherence and to avoid the loss of participants as much as possible due to time constraints.

Material

The presentation of instructions, stimuli and recording of responses for the N-back task, the N-back training, the RC task as well as the RC training were developed using Python. In addition, the presentation of instructions, stimuli and recording of responses for OSPAN task and auditory distraction task were programmed using E-Prime 2.0 software. For all tasks, visual stimuli were presented on a computer screen while auditory stimuli were presented through a headset. The experiment took place in an isolated dimly lit experimental room.

RC training

RC training was based on the RC task proposed in Lavie and Cox (1997) and was developed using different variations from different studies (Beck & Lavie, 2005; Forster & Lavie, 2008; Gupta & Srinivasan, 2010; Jenkins et al., 2005; Lavie & Cox, 1997; Lavie & De Fockert, 2003; Lavie et al., 2004; Yeshurun & Marciano, 2013). The RC task consisted of a visual search task where participants must indicate which of the two target letters (N or X) is present by pressing a key (respectively, 0 or 2) on the keyboard as quickly as possible. The stimulus set was composed of a target letter and five non-target letters arranged in a circle at an equal distance from the centre of the screen (visual angle of 0.48° horizontally and 0.67° vertically at a distance of 60 cm; see Fig. 1). The target letter had an equal chance of being an X or N and appearing in any of the six positions of the circle. In addition, a distractor was positioned in one of the three following positions: on the periphery to the left or to the right of the circle of letters (visual angle of 0.57° horizontally and 0° vertically at a distance of 60 cm) or exactly at the centre of the circle. All stimuli (i.e., the fixation cross and the letters) were presented in black Times New Roman font on a white background.

Each trial began with a fixation cross lasting 600 ms, followed by the stimulus set which was presented for 100 ms or 150 ms. The screen then turned blank and participants had a maximum of 3000 ms to respond. When participants answered correctly, no feedback was given. When participants made a mistake or did not respond within 3000 ms, a pure tone was presented for 95 ms. The subsequent trial started automatically. A training session consisted of 12 practice trials followed by 15 blocks of 48 trials. An optional 5-min break was offered to participants after completing seven blocks to avoid cognitive fatigue.

Perceptual load varied within a block following the sequence HLLHHLLH, where “H” indicates a high-load trial and “L” indicates a low-load trial. In a counterbalanced order, half of the participants started with a high-load trial and the other half started with a low-load trial. In low perceptual load, the non-target letters were all Os. In high-load trials, the non-target letters corresponded to the letters H, K, V, W and Z, arranged randomly at the 5 remaining positions of the circle. The distractor could be congruent (either X for a target letter X and N for a target letter N), incongruent (N for a target letter X and X for a target letter N) and neutral (T for a target letter X or N, a face for a target letter X or N, and a cartoon character for a target letter X or N). Combinations of target letter identity, target letter position, distractor identity, and distractor position were equiprobable in each block of trials.

RC training had 18 difficulty levels created from variations of the RC task. As shown in Table 3, task difficulty was manipulated by varying the presentation time of the stimulus set (100 or 150 ms), as well as the perceptual load provided by the number of angular non-target letters in the high-load trials (K, H, V, Z, W), the nature (letter, face or cartoon) and position (periphery or fixation) of the distractor, and the presence of a mask following the stimulus set. The mask replaced each stimulus (target letter, non-target letters and distractor) by a “#”, which reduced the visibility. The level of difficulty was adaptively increased depending on the performance of the participants within a given block. Each training session began at the last level successfully achieved by the participants in the previous session. The threshold for moving to the next difficulty level was set according to a pilot study and existing literature. To reach the next level, participants must achieve fewer than 5% of errors and a block’s average response time of less than 700 ms. When participants met these two criteria, they moved to a higher level. Otherwise, they stayed at the same level.

Table 3 Characteristics of the difficulty levels of RC training

N-back training

N-back training was inspired from the training presented by Jaeggi et al. (2010) and Heinzel et al. (2014). Participants indicated if the letter presented at the center of the screen matches a letter N trials before, where N can take the value 2, 3, 4, 5, 6, 7, 8 or 9. All stimuli were presented in 120-point black Times New Roman font on a white background. The letters were selected from a set of 10 letters, namely F, H, K, L, M, Q, R, S, X, and Z (Green & Bavelier, 2003). A trial consisted in the presentation of a fixation cross for 1000 or 1800 ms, followed by a letter for 500 ms (see Fig. 3). Participants pressed the space bar to start a trial. When participants decided that a given letter matches the one presented in N previous positions, they had 500 ms to press the space bar. If they judged the letter was different, no action was required. When participants answered correctly, no feedback was given. A 95-ms pure tone was presented following an error. Each training session was composed of 25 test blocks of 20 + N letters. Participants were offered an optional 5-min break after completing 12 blocks to avoid cognitive fatigue.

Fig. 3
figure 3

Illustration of the stimuli set shown in level N = 2 in the N-back task

Like RC training, the level of difficulty was adapted to the performance of the participants, to ensure maximum performance of the participants. N-back training consisted of 18 difficulty levels, as shown in Table 4. The value of N increased from 1 to 9 letters. Heinzel et al. (2014) established that the difficulty increases proportionally with the value N, because the participants had to retain more letters in working memory. The highest level was set at N = 9 to avoid ceiling effects during training; the highest level achieved in cognitive training literature is usually N = 6 (Jaeggi et al., 2010). In addition, the value of the interstimulus interval (ISI) is 1000 ms for even levels and 1800 ms for odd levels. The difficulty also increased proportionally as the presentation time between stimuli increased, because the participants must retain the stimuli for a longer time (Heinzel et al., 2014). Each training session began at the last level successfully achieved by the participants in the previous session. To reach the next difficulty level, participants had to reach an accuracy rate of 95% without making any false alarms within a block.

Table 4 Structure of N-back training (ISI = interstimulus interval)

Active control training

The active control training program was based on a perceptual discrimination task taken from Lacouture (1997). More specifically, perceptual training was made up of three versions of the same relative judgment task (see Fig. 4): participants had to compare two lines on their length, width, or orientation. The lines were presented horizontally 10 cm from the center of the screen.

Fig. 4
figure 4

Illustration of the stimulus set for the three relative judgment tasks of the control training: a length, b thickness, and c orientation

A trial consisted in the presentation of a black screen for 100 ms, followed by the stimulus set for 1500 ms, and then a white screen for 500 ms. Participants pressed the space bar to start a new trial. Participants had 2000 ms to indicate which line is the (a) longest, (b) thickest, or (c) closest to the vertical. Participants had to press the letter “C” when they decided the correct answer is the line on the left and the letter “M” when they considered the correct answer is the line on the right. When participants answered correctly, no feedback was given. A 95-ms pure tone was presented following an error.

A training session consisted of two practice trials and five consecutive blocks of 24 different trials for each of the three types of judgment. Participants underwent five blocks of one type of judgment at a time. The order of the three tasks was counterbalanced across participants and for the same participant across the eight sessions. An optional 5-min break was offered to participants after five completed blocks to avoid cognitive fatigue.

Each training session began at the last level successfully achieved by the participants in the previous session. The difficulty of each version of the task was manipulated by modifying the difference in length, width or angle from the vertical between the two lines. As shown in Table 5, the task has six levels of difficulty. Like N-back and RC trainings, the level of difficulty was adapted to the performance level of the participants. The first level begins with an 18% difference in length, width or angle from the vertical. Depending on the performance of the participants, the discriminability gradually decreases to 6%, which gradually increases the difficulty of the task (Matthews & Welch, 1997; Shiu & Pashler, 1992). The length, width or angle varies randomly with each type of judgment task, where the minimum and maximum value is 3.3 and 23.3 cm for the length, 1.5 and 10 cm for the width, and 10° and 170° for orientation. The greater magnitude is distributed randomly between the stimulus on the left and the one on the right. Participants reached a higher difficulty level in the block when they made two errors or less within a block.

Table 5 Structure of the active control training for the three judgment tasks

Transfer tasks

Working memory tasks

At both pretest and posttest, working memory was measured using the n-back and OSPAN tasks.

N-back task

This task was identical to the one used during N-back training, but it was restricted to levels N = 2 to N = 6. The higher level was set at N = 6 because it is usually the higher level reached in N-back training (Jaeggi et al., 2008). Participants completed two consecutive blocks of 24 trials for each of the five levels of the N-back task, which were presented from the lower level to the highest. The interstimulus interval was fixed at 1000 ms. The dependent variables were the percentage of correct responses and the response time for correct responses.

OSPAN task

The task was to recall a series of letters in the order in which they were presented. Each letter was separated by a distractive task which consisted in solving a simple mathematical operation (see Fig. 5). All stimuli were shown at the center of the screen in black Times New Roman type on a white background. The OSPAN task was taken from the study of Foster et al. (2015) and was translated in French. We only modified the presentation time of the letters to 1000 ms.

Fig. 5
figure 5

Example of OSPAN. Here, two problem-letter sequences are illustrated

Before performing the experimental phase, participants first familiarized with letter recall alone, mathematical problem solving alone (e.g., “(1 + 0) × 12 = ?”), and then with the complete problem-letter sequence. After this familiarization phase, participants performed 15 experimental trials. In each trial, participants first solved a mathematical problem, and then saw a letter, and then solved another problem, and saw another letter, and so on. Such an alternation of mathematical problem and letter was repeated from 3 to 7 times for each trial with an unpredictable length each time. Each sequence length was encountered three times. At the end of the problem-letter sequence, participants had to recall, in order, the preceding letters by clicking on the corresponding letters presented in a 3 × 4 matrix (see Fig. 5). The dependent variable was the partial score (see Turner & Engle, 1989), computed by summing the number of letters correctly recalled in the correct order.

Selective attention tasks

At both pretest and posttest, selective attention was measured using the RC task and an auditory distraction task.

RC task

Participants completed four versions of the RC task using Levels 6, 14, 16, and 18 from RC training. Participants completed one block of 48 trials for each version of the task, which were presented from the lowest level to the highest. Stimulus presentation was fixed at 100 ms. The dependent variables measured were the percentage of correct identification of the target letter and response time on correct trials. Because we had no a priori hypotheses about the potential influence of distractor compatibility on learning and transfer following RC training, we chose not to analyze distractor compatibility effects (e.g., Beck & Lavie, 2005) for the sake of brevity and clarity.

Auditory distraction task

The auditory distraction task was inspired by the irrelevant sound paradigm of Vachon et al. (2017) and consisted of identifying the missing item from a list of visual stimuli while ignoring a sequence of auditory distractors presented simultaneously (see Fig. 6). Each visual sequence was composed of nine digits taken without replacement from the digit set 0–9 and arranged in a quasi-random order, with the constraint that successive digits were not adjacent integers. Each to-be-remembered item was approximately 2.39° in height and presented sequentially in black Times New Roman font at the centre of a white background for 300 ms with an interstimulus interval of 350 ms. The standard irrelevant auditory sequence consisted of nine repetitions of a spoken letter randomly selected from the set A–B–C–G–J–K–L–M–Q–S. On deviant trials, the sixth letter was replaced by one of the remaining letters, chosen at random (e.g., AAAAABAAA). Insertion of such a deviant sound tends to capture attention and confuse identification of the missing item (e.g., Vachon et al., 2017). The letters were recorded in a female voice at an approximately even pitch and edited using SoundForge (Sony Creative Software) to last 250 ms. The interstimulus interval was 350 ms.

Fig. 6
figure 6

Schematic presentation of stimulus presentation in the auditory distraction task. The top illustrates the to-be-remembered visual digits of the missing item task. The bottom depicts the to-be-ignored irrelevant sound sequence presented simultaneously through headphones. In this example, the auditory sequence contains a deviant item (here, a ‘B’ embedded among ‘As’), indicated by the bold frame

On each trial, a to-be-remembered visual sequence and a to-be-ignored auditory sequence were presented concurrently. The auditory sequence began 125 ms before the beginning of the visual sequence. Following the disappearance of the last visual digit, the 10 digits appeared on the screen simultaneously in canonical order and the participants had to click on the digit that was not presented in the series (i.e., the missing item). Following two practice (control) trials, participants performed a single block of experimental trials containing 39 control trials and six deviant trials presented randomly with the constraint that deviant trials could not be adjacent. We computed a distractibility index as the dependent variable. It consists in the magnitude of the deviation effect, that is, the difference in the percentage of correct identification of the missing item between control and deviant trials. A positive score reflects the presence of distraction.

Statistical analyses

Analyses were carried out on the average performance of the four transfer tasks at both pretest and posttest for each training program. Therefore, a 2 × 3 mixed ANOVA with Session (pretest vs. posttest) as the within-subject factor, and Training (control, RC and N-back) as the between-subjects factor was performed on each dependent variable. For each effect, we also reported the probability that the data favored the alternative hypothesis over the null hypothesis [pBIC(H1|D)] as computed by a Bayes factor analysis using Masson’s (2011) method. When the interaction was significant, simple effects tests were performed: dependent-sample t tests compared performance between pretest and posttest for each type of training using Holm–Bonferroni method. Data, analysis code, and stimuli are available from the corresponding author upon reasonable request.

Results

N-back task

Figure 7A illustrates the mean accuracy rate and Fig. 7B illustrates the average response time at the N-back task at pretest and posttest for each type of training. Accuracy seemed to improve at posttest following RC training and N-back training.

Fig. 7
figure 7

Mean performance at pretest and posttest in each of the four transfer tasks (N-back, response competition [RC], OSPAN, and auditory distraction) for control (in grey), RC (in blue), and N-back (in red) training. Lighter colors represent pretest and darker colors represent posttest. Error bars represent the standard error of the means (color figure online)

Accuracy

The ANOVA performed on accuracy rate revealed main effects of Session, F(1, 57) = 25.72, p < 0.001, η2p = 0.311, pBIC(H1|D) > 0.999, and Training, F(2, 57) = 8.94, p < 0.001, η2p = 0.239, pBIC(H1|D) = 0.983. In addition, there was a significant two-way interaction, F(2, 57) = 20.28, p < 0.001, η2p = 0.416, pBIC(H1|D) > 0.999. Simple effects tests revealed that participants in the N-back training group, F(1, 19) = 52.70, p < 0.001, η2p = 0.735, pBIC(H1|D) > 0.999, and in the RC training group, F(1, 19) = 8.11, p = 0.010, η2p = 0.399, pBIC(H1|D) = 0.894, improved significantly at posttest while no improvement was observed for the control group, F(1, 19) = 1.29, p = 0.271, η2p = 0.063, pBIC(H1|D) = 0.299.

Response time

The ANOVA performed on response time revealed that neither the main effect of Session, F(1, 57) = 0.04, p = 0.849, η2p = 0.001, pBIC(H1|D) = 0.116, nor that of Training, F(2, 57) = 0.65, p = 0.525, η2p = 0.022, pBIC(H1|D) = 0.032, was significant. The interaction was also not significant, F(2, 57) = 2.08, p = 0.134, η2p = 0.068, pBIC(H1|D) = 0.121.

OSPAN task

Figure 7C illustrates the mean partial score of the OSPAN task at pretest and posttest for each type of training. A visual inspection suggests that performance improved at posttest, but only following RC training. The ANOVA performed on these data revealed a significant main effect of Session, F(1, 57) = 5.00, p = 0.029, η2p = 0.081, pBIC(H1|D) = 0.616, but no main effect of Training, F(2, 57) = 0.55, p = 0.580, η2p = 0.019, pBIC(H1|D) = 0.023. The Session × Training interaction, F(2, 57) = 2.21, p = 0.119, η2p = 0.072, pBIC(H1|D) = 0.135, was not significant, which seems to contradict the visual impression that OSPAN performance did improve at posttest in the RC training group (see Fig. 7C). To avoid making a Type II error, we did perform dependent-sample t tests to further investigate this issue. The tests revealed that partial score was indeed significantly higher at posttest following RC training group, F(1, 19) = 13.27, p = 0.002, η2p = 0.411, pBIC(H1|D) = 0.978. However, no pre-post difference was found for the N-back training group, F(1, 19) = 0.57, p = 0.460, η2p = 0.029, pBIC(H1|D) = 0.231, and the control group, F(1, 19) < 0.01, p = 0.953, η2p < 0.001, pBIC(H1|D) = 0.183. The fact that the Bayes factor for RC training provided strong evidence in favor of the alternative hypothesis suggests that there was a genuine improvement of OSPAN performance at posttest following RC training. The absence of significant interaction could be attributable to the relatively high variability found in the N-back and control groups. In fact, we plotted the pretest–posttest difference in partial score for each group (see Fig. 8) and noticed the presence of two extreme outliers (showing an increase of partial score over 20), one in control group and one in the N-back group. To reduce to potential influence of these outliers, we used the median of the pretest–posttest partial score difference instead of the mean. Whereas 15 participants for the RC training group were above the median, this was the case for only 5 and 8 participants in the control and N-back group, respectively. The median test contrasting the three groups was significant, χ2(2) = 10.58, p = 0.005, suggesting that the non-significant interaction likely arose from the presence of extreme outliers. Such results provided further evidence that OSPAN performance improved at posttest, but only following RC training.

Fig. 8
figure 8

Box plot of the pretest–posttest difference in OSPAN partial score for control, RC and N-back trainings

RC task

Performance on the RC task at pretest and posttest is plotted in Fig. 7D (mean accuracy) and Fig. 7E (mean response time) for each training. While response times seemed to improve following each type of training, accuracy appeared to increase only after RC training.

Accuracy

The ANOVA performed on accuracy rate revealed significant main effects of Session, F(1, 57) = 26.47, p < 0.001, η2p = 0.317, pBIC(H1|D) > 0.999, and Training, F(2, 57) = 4.82, p = 0.012, η2p = 0.145, pBIC(H1|D) = 0.641. In addition, there was a significant interaction, F(2, 57) = 10.10, p < 0.001, η2p = 0.262, pBIC(H1|D) = 0.994. Simple effects tests indicated that the RC training group significantly improved at posttest, F(1, 19) = 24.95, p < 0.001, η2p = 0.568, pBIC(H1|D) > 0.999, whereas no such improvement was found for the N-back training group, F(1, 19) = 2.66, p = 0.119, η2p = 0.123, pBIC(H1|D) = 0.445, and the control training group, F(1, 19) = 1.61, p = 0.219, η2p = 0.078, pBIC(H1|D) = 0.340.

Response time

The ANOVA performed on mean response time revealed significant main effects of Session, F(1, 57) = 88.86, p < 0.001, η2p = 0.609, pBIC(H1|D) > 0.999, and Training, F(2, 57) = 4.01, p = 0.023, η2p = 0.123, pBIC(H1|D) = 0.465. In addition, there was a significant interaction between Session and Training, F(2, 57) = 11.80, p < 0.001, η2p = 0.293, pBIC(H1|D) = 0.998. Simple effects tests revealed that participants were significantly faster at posttest in each training group: control training F(1, 19) = 38.44, p < 0.001, η2p = 0.669, pBIC(H1|D) > 0.999, RC training, F(1, 19) = 60.39, p < 0.001, η2p = 0.761, pBIC(H1|D) > 0.999, and N-back training, F(1, 19) = 6.48, p = 0.020, η2p = 0.254, pBIC(H1|D) = 0.808. Given that all groups improved at posttest with regard to response time, we contrasted the pretest–posttest difference between each group to verify whether some groups showed better improvement than others. The tests revealed a significantly larger improvement following RC training relative to both control training, F(1, 38) = 12.16, p = 0.001, η2p = 0.242, pBIC(H1|D) = 0.976, and N-back training, F(1, 38) = 17.07, p < 0.001, η2p = 0.318, pBIC(H1|D) = 0.997. However, N-back and control training groups showed similar improvement in response time on the RC task, F(1, 38) = 2.04, p = 0.161, η2p = 0.051, pBIC(H1|D) = 0.310.

Auditory distraction task

Figure 7F shows the size of the deviation effect (i.e. the amount of distraction) at pretest and posttest in each training condition. A visual inspection of the plot suggests that distraction did not seem to diminish at posttest, regardless of the type of training. This was confirmed by the ANOVA carried out on these data, which revealed no main effect of Session, F(1, 57) = 0.35, p = 0.557, η2p = 0.006, pBIC(H1|D) = 0.134, nor of Training, F(2, 57) = 1.91, p = 0.158, η2p = 0.063, pBIC(H1|D) = 0.104. The interaction was also not significant, F(2, 57) = 0.85, p = 0.431, η2p = 0.029, pBIC(H1|D) = 0.034.

Discussion

The general objective of the study was to contribute to the understanding of far transfer mechanisms in cognitive training by empirically testing the hypothesis developed by Greenwood and Parasuraman (2016). They suggested that distraction suppression plays a central role in far transfer. The present study tested this hypothesis from the load theory perspective by examining whether a training program that solicits both cognitive control and perceptual selection—the two distraction suppression mechanisms proposed by Lavie et al. (2004)—would promote far transfer. We developed a novel training program using the RC task and compared its efficacy to that of a classical N-back training, which stimulates essentially only the cognitive control mechanism. The results showed that, relative to a control (perceptual) training, the only training program that produced far transfer was the one soliciting two distraction suppression mechanisms, namely RC training.

Far transfer mechanisms

We assumed that RC training involved both perceptual selection and cognitive control in suppressing distraction while N-back training solicited only the latter. The observation here that far transfer was restricted to RC training suggests that training both distraction suppression mechanisms plays a key role in far transfer. Given the newness of the RC training program and the fact that it was not closely match to the N-back training program, one must remain cautious, however, about the exact contribution of these two suppression mechanisms to far transfer effects. For instance, one cannot exclude the possibility that the success of RC training in producing far transfer relied exclusively on its cognitive control component.

Nevertheless, the present findings were consistent with the distraction suppression hypothesis which postulates that cognitive training programs designed to increase the ability to ignore distractors should promote far transfer. In light of existing studies, training both distraction suppression mechanisms would result in more effective working memory processes (Corbetta et al., 2008; Cowan et al., 2005). First, training the perceptual selection mechanism could allow for a more effective suppression of distractors that appear concurrently to the target (see also Mishra et al., 2014, 2016). It would allow for a better selection of relevant information accessing working memory, thereby preventing the presence of irrelevant information that could interfere with it (Vissers et al., 2017). Moreover, various neuroimaging studies demonstrated the implication of perceptual selection in the efficacy of working memory (Gazzaley et al., 2005; Rees et al., 1997; Sreenivasan & Jha, 2007). For its part, training the cognitive control mechanism could allow for a more effective suppression of distractors that are actively retained in working memory (Corbetta et al., 2008). The effectiveness of such mechanism could be a determining factor for encoding and retention of information in working memory (Vissers et al., 2017). Moreover, Gaspar et al. (2016) showed that the ability to ignore distractors is linked to the capacity of working memory (see also Hughes et al., 2013; Sörqvist, 2010).

An unexpected result was the absence of near transfer from RC training to the auditory distraction task. Such an absence could be attributable to the inadequate calibration of the auditory distraction task. The results obtained indicated inconsistent and unexpected trends, limiting any possible conclusion for this task. First, contrary to what is usually found in this task (e.g., Hughes et al., 2013; Vachon et al., 2017), the deviation effect was absent from the pretest for the RC training group. This absence was surprising given that the deviation effect is considered a benchmark phenomenon for modelling short-term memory (Benchmark 6.3; Oberauer et al., 2018). It was therefore difficult to observe an improvement (i.e., a decrease in distraction) following the training program. In addition, the data appeared noisy, as suggested by the large error bars. This large variability in the data could be explained by the small number of deviant trials. There were only six deviant trials, which may not have been sufficient to provide a good signal-to-noise ratio. It is possible that a greater number of participants or trials would have been necessary to circumvent this limit. Hence, the parameters chosen in the auditory distraction task may have limited the possibility of observing a noticeable modulation of performance, calling into question the absence of near transfer. Another conceivable explanation for the lack of near transfer from RC training is the presence of a switch from visual to auditory distractors between the training task and the transfer task. In fact, there is evidence for the failure to observe transfer across sensory modalities (e.g., Schneiders et al., 2011, 2012, but see also Kattner et al., 2019). Even if this is a plausible hypothesis, it is important to mention that the literature on cross-modal transfer in cognitive training is inconsistent and further studies should be conducted to clarify this specific type of transfer (Taya et al., 2015).

Implications for Greenwood and Parasuraman’s hypothesis

Greenwood and Parasuraman’s (2016) hypothesis specified patterns of activation of brain regions, such as the activation of the dorsal attention network (see also Rueda et al., 2016), that characterize learning and far transfer. From this standpoint, the dorsal attention network, which involves the intraparietal sulcus, superior parietal lobule, and frontal eye fields, would be responsible for distraction suppression (see also Jerde et al., 2012). It is notably because N-back training is deemed to activate the dorsal network of attention that Greenwood and Parasuraman pointed to this form of training as a potentially effective way to induce far transfer. Yet the present study, consistent with previous research (De Simoni & Von Bastian, 2018; Melby-Lervåg & Hulme, 2013; Soveri et al., 2017), questioned this capacity to transfer N-back training to untrained abilities. In fact, the present findings suggest that an effective way to increase the ability to ignore distractions, hence to promote far transfer, is to train cognitive control (as in N-back training) along with perceptual selection. There is evidence that the stimulation of both distraction suppression mechanisms engages more cerebral regions than proposed by Greenwood and Parasuraman. Indeed, previous research found that the bilateral insula, left middle frontal gyrus and anterior cingulate cortex were activated during a task soliciting both cognitive control and perceptual selection (Torralbo et al., 2016; Wei et al., 2013). Moreover, both studies noticed that increasing perceptual load increase activity in the fronto-parietal regions (Torralbo et al., 2016; Wei et al., 2013). Wei et al. (2013) specified that this increased activation was located in the frontal eye fields, posterior parietal cortex, anterior insula and anterior cingulate cortex. Such expanded activation within the dorsal attention network pertaining to the solicitation of the perceptual selection mechanisms could be linked to the effectiveness of RC training relative to N-back training in transferring to untrained abilities observed in the present study. More research is required to corroborate this hypothesis.

Inefficacy of N-back training

The present study provided further evidence that N-back training fails to generate far transfer. The results were consistent with the existing literature showing that the benefits of N-back training are limited to the trained task (De Simoni & Von Bastian, 2018; Melby-Lervåg & Hulme, 2013; Soveri et al., 2017). It has been demonstrated that effective distraction suppression contributes to the effectiveness of working memory processes (Au et al., 2015; Karbach & Verhaeghen, 2014). We suggest that a training that solicit only the cognitive control mechanism would not be effective to produce significant improvement on selective attention and working memory tasks, because such training would not increase distraction suppression at an early stage of information processing (Corbetta et al., 2008; Vissers et al., 2017). Indeed, it has been shown that effective distraction suppression at a perceptual stage can improve the processing of relevant stimuli in working memory (Gazzaley et al., 2005; Rees et al., 1997; Sreenivasan & Jha, 2007). If this hypothesis is accurate, then N-back training could be modified to engage both cognitive control and perceptual selection by integrating high and low perceptual load contexts. If such version of the N-back training proved to be effective in promoting far transfer, then one could conclude with more confidence that the lack of transfer from N-back training to other working memory and selective attention tasks observed in the present study was indeed due to the fact that it does not stimulate the perceptual selection mechanism.

Action video games

Well studied in the literature, AVG-based training is deemed to stimulate both distractor suppression mechanisms (Murphy et al., 2016). Indeed, AVG expose learners to a variety of tasks that alternate between a predominantly high perceptual load (e.g., attempting to shoot multiple opponents that appear on the screen at the same time) and predominantly a low perceptual load (e.g., attempting to shoot a single opponent without any distractor; Murphy et al., 2016). There is growing evidence that AVG-based training can produce far transfer by modifying the behavior and organization of neural networks (Altarelli et al., 2020). Nonetheless, it is not possible to establish what makes AVG effective in producing better performance at the perceptual, attentional, and memory levels (Green & Bavelier, 2003; Oei & Patterson, 2013). Based on the present study, we suggest that the solicitation of both distraction suppression mechanisms (cognitive control and perceptual selection) is at the basis of the effectiveness of AVG in enhancing untrained abilities (e.g., Halo or Call of Duty; Green & Bavelier, 2003, 2012). A parallel can be drawn between the findings from AVG studies and the results of the present study. Indeed, Mishra et al. (2011) provided neural evidence for enhanced suppression of distracting irrelevant information in AVG players relative to non-players. Furthermore, far transfer to selective attention and working memory tasks has been documented following AVG-based training (Cohen et al., 2007; Dye et al., 2009; Murphy et al., 2016). Specifically, researchers have shown that experts—who have played numerous hours of AVG—are more accurate and quicker at the RC task than beginners—who have played at best a few hours of AVG (Cohen et al., 2007; Dye et al., 2009). Moreover, experts are faster (Boot et al., 2008; McDermott et al., 2014) and more accurate at the N-back task compared to beginners (Colzato et al., 2013). In fact, due to its success in producing transfer on selective attention and working memory tasks, AVG are being considered as an attractive therapy for the recovery of individuals with cognitive deficits in clinical neuropsychology (Nahum & Bavelier, 2020). Identifying the mechanisms underpinning the transfer capacities of AVG-based training to untrained abilities could help a better understanding of the outcomes from this new type of therapy.

Conclusion

The present research contributed to a better understanding of the mechanisms underlying far transfer in cognitive training, a priority in the cognitive enhancement field (Burgess et al., 2014; Greenwood & Parasuraman, 2016; Taya et al., 2015). The study provides support for the hypothesis that distraction suppression training fosters far transfer to working memory tasks in cognitive training. By borrowing the concept of cognitive control and perceptual selection to the load theory of attention (Lavie et al., 2004), the present study proposed that effective cognitive training—one that promotes far transfer—could benefit from engaging both distraction suppression mechanisms. The demonstrated capacity of RC training implemented in the present study to enhance performance to untrained tasks points to this novel training program as a promising tool that could be exploited by researchers in the field. Yet, more research is required not only to replicate the present findings, but also to investigate RC-training-related changes in neural activity.