Spreading the sparing: against a limited-capacity account of the attentional blink

Olivers, Christian N. L.; van der Stigchel, Stefan; Hulleman, Johan

doi:10.1007/s00426-005-0029-z

Spreading the sparing: against a limited-capacity account of the attentional blink

Original Article
Published: 08 December 2005

Volume 71, pages 126–139, (2007)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Psychological Research Aims and scope Submit manuscript

Spreading the sparing: against a limited-capacity account of the attentional blink

Download PDF

Christian N. L. Olivers¹,
Stefan van der Stigchel¹ &
Johan Hulleman²

1284 Accesses
147 Citations
4 Altmetric
Explore all metrics

Abstract

The identification of the second of two targets presented in close succession is often impaired—a phenomenon referred to as the attentional blink. Extending earlier work (Di Lollo, Kawahara, Ghorashi, and Enns, in Psychological Research 69:191–200, 2005), the present study shows that increasing the number of targets in the stream can lead to remarkable improvements as long as there are no intervening distractors. In addition, items may even recover from an already induced blink whenever they are preceded by another target. It is shown that limited memory resources contribute to overall performance, but independent of the attentional blink. The findings argue against a limited-capacity account of the blink and suggest a strong role for attentional control processes that may be overzealously applied.

The attentional blink: why does Lag-1 sparing occur when the dependent measure is accuracy, but Lag-1 deficit when it is RT?

Article 26 May 2018

Training and the attentional blink: Raising the ceiling does not remove the limits

Article 24 July 2017

Time to see the bigger picture: Individual differences in the attentional blink

Article Open access 17 November 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The human visual system appears limited in the amount of information it can process across time, as has become apparent from the rapid serial visual presentation (RSVP) paradigm, in which stimuli (typically alphanumerical characters) are shown in rapid succession to the observer. It is often found that the detection of the second of two targets (T2) within such a stimulus stream is impaired when presented within about 500 ms after the first target (T1, e.g., Chun and Potter, 1995; Raymond, Shapiro, and Arnell, 1992; see also Kahneman, Beatty, & Pollack, 1967). To characterize this phenomenon, Raymond et al. (1992) proposed the term attentional blink: it is as if attention is temporarily unavailable for new input when processing earlier relevant visual information.

One of the characteristics of the attentional blink is that it often does not occur for a second target when this target immediately follows the first target, i.e., at lag 1. This phenomenon is referred to as “lag-1 sparing” and it appears to occur as long as the transition from T1 to T2 does not involve a task switch or a location shift (Potter, Chun, Banks, & Muckenhoupt, 1998; Visser, Zuvic, Bischof, & Di Lollo, 1999b).

Prevalent explanations of the attentional blink and lag-1 sparing stress limited-capacity resources as their major cause. For instance, according to the bottleneck account, T1 needs to be consolidated in short-term memory (STM) for it to be available for conscious report (Chun & Potter, 1995; Jolicoeur & Dell’Acqua, 1998). This process of consolidation requires limited-capacity resources, which are then unavailable (or at least not sufficiently available) for T2, whose representation therefore remains vulnerable and becomes easily overwritten by the subsequent items in the stream (which are believed to act like masks, Brehaut, Enns, & diLollo, 1999; Dell’Acqua, 2003; Giesbrecht & DiLollo, 1998; Grandison, Ghirardelli, & Egeth, 1997; Seiffert & DiLollo, 1997). Lag-1 sparing may then be explained by assuming that when T1 and T2 occur in close succession, the exact temporal order information is often lost, and both targets may compete more or less equally for the same resources (Potter, Staub, & O’Conner, 2002). The interference account of Shapiro and Raymond and colleagues (Raymond, Shapiro, & Arnell, 1995; Shapiro & Raymond, 1994; Shapiro, Raymond, & Arnell, 1994) poses that rather than just a single item, multiple items may enter a STM stage. Typically this would involve the two targets, but also some of the intervening or subsequent distractor items. Within STM these items then compete for report. This competition is heavily biased by a number of factors, one of them being the order of entrance into STM. More attentional resources are assigned to T1 (by virtue of it being the first target) and often its immediate successor (because it so closely follows T1). This also explains the lag-1 sparing phenomenon. However, because the total attentional weight within STM is limited, the assignment of resources to these early items will be automatically at the expense of the somewhat later items, including T2, which may even fail to enter STM at all if resources are insufficient. Thus, even though multiple items are in theory allowed to enter the limited-capacity stage, the overall principle is the same as in the bottleneck theory: capacity is limited, and it is used up by T1.

First indications against a limited-capacity account

A number of recent findings argue against a T1-induced resource deficiency as a sufficient or even necessary explanation of the attentional blink. Olivers and Nieuwenhuis (2005a, b) have shown that the attentional blink is attenuated when observers adopt a more distributed attentional state, which may be induced by, for example, an additional cognitive task or by positive affect (Ashby, Isen, & Turken, 1999; Derryberry & Tucker, 1994). Performance for T2 improved when participants were instructed to concentrate less, simultaneously think of their holidays, listen to music, perform an additional memory task or watch positively laden emotional pictures. I. Arend, S. Johnston and K. Shapiro (unpublished results, as cited in Kessler et al., 2005) found improvements when the RSVP stream was embedded in a distracting visual display of random “starfield” motion. One may argue that these manipulations simply led to a redistribution of resources away from T1 and towards T2. Against this, however, Olivers and Nieuwenhuis found that T1 detection too improved under distracting conditions. To explain these findings, Olivers and Nieuwenhuis (2005b) proposed that the attentional blink is due to an overinvestment of attention in the RSVP stream rather than due to a lack of attentional resources. We will return to how this may work in the General discussion.

Findings by Di Lollo, Kawahara, Ghorashi, and Enns (2005; Kawahara, Enns, & Di Lollo, 2005) also argue against an explanation in terms of a T1-induced resource deficiency. Their study focused on triplets of items that were embedded in an RSVP stream of distractors. In one condition, the triplets consisted of a target, a distractor and a second target (i.e. T1 D T2). As expected, relative to T1, performance dropped substantially for T2, indicative of an attentional blink. In another condition, the triplets consisted of three successive targets (i.e. T1 T2 T3). Note that in this three-target condition, the last target (now T3) was in exactly the same temporal position relative to T1 as was the last target (T2) in the two-target condition. If the attentional blink is caused by a T1-induced resource deficiency, an attentional blink would again be expected for this last target, especially because the additional target in between is assumed to also require resources. However, Di Lollo et al. (2005) found that detection accuracy for T3 in the three-target triplets did not differ from that for T1. In other words, there was no attentional blink, but “lag-2 sparing” instead. They further found that performance for the middle target was best of all. Di Lollo et al. (2005) argued that, in principle, there are sufficient resources available to process multiple targets in close succession (at least more than one or two). They proposed that instead, the attentional blink is caused by a temporary disruption of endogenous attentional control settings. According to this temporary loss of control (TLC) account, observers seek to filter the information in the RSVP stream by setting up an attentional set that matches the target category (e.g., letters) and rejects the distractor category (e.g., digits). The maintenance of such an attentional set demands a certain amount of executive control. However, when T1 is presented, these same executive control functions are needed to process the target. The consequence is a TLC over the input filter. This loss of control is harmless as long as the incoming items are targets, but it becomes harmful when it allows for distractors to enter. According to the TLC account, a distractor will exogenously disrupt the now vulnerable input settings, affecting the selection of subsequent items. Given sufficient time, attentional control will be regained and the input filter will be reinstated. Thus, according to the TLC account, the attentional blink is not due to limited resources at the level of the individual targets. Instead, the limitations lie at a higher, executive level where only one task aspect (target identification, input control) can be actively handled at a time.

The present study

The present study sought to further investigate the roles of limited capacity and attentional control settings in the attentional blink, as well as the relationship between them. For this purpose we used RSVP streams containing up to four targets. Targets could be presented in immediate succession or with distractors inserted at various temporal positions (lags). According to the T1-induced resource deficiency accounts, the occurrence of the attentional blink should be tied to T1, regardless of following targets. If anything, additional targets are expected to aggravate the blink, since additional resources are required. According to the TLC account, T1 processing destabilizes the attentional input filter, allowing for distractors to disrupt it. This means that the attentional blink is not tied to T1, but to the occurrence of the first post-T1 distractor. As long as no such distractor is presented, the processing of additional targets should be unaffected.

Experiment 1 replicated and extended Di Lollo et al.’s (2005; see also Kawahara et al., 2005) work: sparing from the attentional blink is not limited to lag 1, but can be extended to lag 2 and even lag 3 as long as the intervening items are targets too. Performance for the fourth target was nevertheless affected, suggesting a remaining role for capacity limitations. Experiment 2 repeated the main manipulations, but controlled for differential masking effects between targets and distractors. Again sparing was found for T3 and T4. A new and exciting finding was that once a proper attentional blink had been induced (through an intervening distractor), subsequent targets could recover from this blink when preceded by another target. This indicates that control processes were still in place, responding dynamically to the changing input. Furthermore, the data suggested that multiple initial targets may induce a more profound blink for later targets than may just a single initial target, again pointing towards a residual role for capacity limitations. Experiment 3 served to explore both these effects further. It showed that across its entire time course, target items may escape from the attentional blink. Furthermore, multiple items did eventually induce a greater drop in performance, but this effect was additive with the effect of temporal position (lag). This indicates that limited target processing capacity and the attentional blink independently contribute to performance; the one is not caused by the other.

Experiment 1: sparing spreads to lag 2 and lag 3

Di Lollo et al. (2005; Kawahara et al., 2005) have shown that sparing from the blink is not limited to lag 1, but may spread to lag 2 as long as the intervening item is a target too. They concluded that the attentional blink is not due to a lack of limited-capacity resources. The question is if there remains no role for limited-capacity target processing resources whatsoever. The mainstream theories of the attentional blink assume that the conscious report of the targets in the RSVP stream requires some sort of (visual) STM (e.g., Chun & Potter, 1995; Jolicoeur & Dell’Acqua, 1998; Raymond et al., 1995). Others have suggested that the capacity of STM is functionally limited to about three to four items (see Cowan, 2000, for an extensive review). This would mean that limited-capacity resources may still play a role in the attentional blink if we tax STM a little further. Using stimuli that were highly similar to those used by Di Lollo et al. (2005), we presented quadruplets of items embedded in a stream of distractors. Table 1 shows the different possible sequences of targets and distractors. In the one-target (1-T) control condition, a single target was presented at any of the four positions within the quadruplet, while the rest was filled with distractors. In the two-target (2-T) standard attentional blink condition, T1 was presented on the first position, whereas T2 could follow on any of the remaining three positions (i.e. at lag 1, 2, or 3; again the rest was filled with distractors). An attentional blink was expected for T2. In the three-target (3-T) condition, T1 was presented on the first position, whereas the remaining three positions were filled with the various possible combinations of T2, T3 and a distractor. Finally, in the four-target (4-T) condition, the four positions were successively filled with T1, T2, T3 and T4. On the basis of the TLC account we expected sparing in the 4-T condition to spread to T4. However, if available STM capacity contributes to performance, we may expect a performance drop for the final T4 target (compared to the 3-T and 2-T conditions), since most of the resources have been used up by the first three targets.

Table 1 Possible target–distractor sequences for the one-target (1-T), two-target (2-T), three-target (3-T) and four-target (4-T) conditions of Experiments 1 and 2, and the predictions according to the resource deficiency and TLC accounts

Full size table

Further support for a limited-capacity account may come from comparing the 3-T condition to the 2-T condition. Notably, detection of the final target (T3) in the T1 T2 D T3 quadruplet (where “D” denotes a distractor) is expected to be worse than that of the final target (T2) in the T1 D D T2 quadruplet, because the initial two targets require more resources than an initial single target.

Finally, for exploratory purposes, we also included the T1 D T2 T3 quadruplet. Both the resource deficiency and TLC account predict the occurrence of an attentional blink for T2 here. But what will happen for T3? The resource deficiency account predicts a clear blink for this target too. However, the TLC account is less clear in its predictions. It states that the presentation of a distractor exogenously disrupts the input filter. But what does the presentation of a target do to the filter? A proper loss of control would suggest that the filter stays disrupted and we would therefore also expect an attentional blink for T3. However, if the control processes are more dynamic, and a proper input filter can be reinstated on presentation of a target (in this case T2), we might expect T3 to be spared.

As in Di Lollo et al.’s study (2005), participants were not required to report the targets in the order of presentation. Moreover, although targets were randomly chosen from a set, there was the restriction that all targets within a stream had to be different; thus, probabilities of occurrence were not independent. This meant that we felt the need to take into account the possibility of guessing—something Di Lollo et al. (2005) did not do. This is because the more targets are reported, the higher the chance that one gets at least one of them right merely by guessing. Note further that from hereon we will avoid the term “lag” as much as possible and use the term “temporal position” instead. This is because “lag” is defined relative to T1, whereas we were interested in effects of targets beyond T1.

Method

Participants

Twelve students of the Vrije Universiteit Amsterdam (nine males; two left-handed; aged 17–33 years; average 22 years) participated in return for monetary payment.

Stimuli, procedure and design

Stimulus generation and response recording were done using E-Prime (Psychology Software Tools, Inc., Pittsburgh, PA, USA). After a 1,000-ms blank period, a 0.5°×0.5° fixation cross was presented for 1,000 ms in the center of the display, and subsequently replaced by a rapid serial presentation of 19–22 letters, each measuring approximately 0.8×0.8°. The entire RSVP series (including the fixation cross) was presented in black on a gray (40 cd/m²) background. Each letter was randomly drawn (without replacement) from the alphabet and presented for 75 ms, followed by a 25 ms blank. “I”, “O”, “Q” and “S” were left out as they may resemble digits too much. On each trial, one to four letters were replaced with digits, randomly drawn (without replacement) from the set 0 to 9. The first target (T1) was presented randomly between positions 12 and 16 inclusive. Subsequent targets, when present, followed within the next three positions, which were otherwise filled with distractors. This way, the relevant items were all presented as a quadruplet embedded in a stream of distractors. The participant’s task was to identify all targets and an unspeeded response was made at the end of each trial by typing in the digits on a standard keyboard. Participants were instructed to guess whenever they failed to identify a digit. They were also asked to enter the targets in the order they perceived them, if possible, but it was made clear that this was not crucial. Correctly identified targets that were entered in the wrong order were counted as correct. Feedback on accuracy was given after each trial.

Table 1 summarizes all possible target–distractor sequences within the relevant quadruplets. In the 1-T control condition, a single target was presented in any of the four possible temporal positions (with equal probability). In the 2-T condition, T1 was presented on the first position, and T2 could then appear at positions 2, 3 or 4. In the 3-T condition, T1 was presented on position 1, T2 could appear on either of positions 2 and 3, whereas T3 could appear on either of positions 3 and 4 (depending on T2; see Table 1). In the 4-T condition, T1, T2, T3 and T4 appeared at positions 1, 2, 3 and 4, respectively.

The experiment started with 12 practice trials for each number of targets, followed by two sessions of four blocks each, with a short break in between. Within each session, there was one block for each number of targets, and block order was randomized. Temporal positions of the target(s) were randomly varied within a block. Each block contained 36 trials. The experiment lasted approximately 45 min.

Results

Proportions correct for each target were first corrected for guessing depending on the number of targets in the condition. In the 1-T condition, we assumed that the observed proportion correct $({P}_{\rm T1\_obs})$ consisted of a proportion really perceived targets $({P}_{\rm T1\_real})$ plus a proportion guessed targets $({P}_{\rm T1\_guess}).$ Since we used ten digits as possible targets, the latter component can be described as ${P}_{\rm T1\_guess} = (1 - {P}_{\rm T1\_real})\frac{{1}}{{10}},$ so that:

$${{P}}_{{\text{T1\_obs}}}\,=\,{{P}}_{{\text{T1\_real}}}\,+\,(1 - {{P}}_{{\text{T1\_real}}})\frac{{1}}{{10}}.$$

(1)

In the 2-T condition, observed performance for T1 $({P}_{\rm T1\_obs})$ depended not only on whether T1 was perceived $({P}_{\rm T1\_real})$ or guessed $({P}_{\rm T1\_guess})$ correctly, but also if T2 was perceived $({P}_{\rm T2\_real})$ correctly. This is because if neither T1 nor T2 was perceived correctly, there were two chances of guessing T1 correctly (since order of report did not matter). The same goes for T2, leading to the following set of equations:

$${{P}}_{{\text{T1\_obs}}}\,=\,{{P}}_{{\text{T1\_real}}}\,+\,(1 - {{P}}_{{\text{T1\_real}}})(1 - {{P}}_{{\text{T2\_real}}})\frac{{2}}{{10}} + {{P}}_{{\text{T2\_real}}} (1 - {{P}}_{{\text{T1\_real}}})\frac{{1}}{{9}},$$

(2)

$${{P}}_{{\text{T2\_obs}}}\,=\,{{P}}_{{\text{T2\_real}}}\,+\,(1 - {{P}}_{{\text{T1\_real}}})(1 - {{P}}_{{\text{T2\_real}}})\frac{{2}}{{10}} + {{P}}_{{\text{T1\_real}}} (1 - {{P}}_{{\text{T2\_real}}})\frac{{1}}{{9}}.$$

(3)

The same principle was applied to the 3-T and 4-T conditions, resulting in equivalent but increasingly complex equations. These equations were then numerically solved for ${{P}}_{{\text{T1\_real}},} \;{{P}}_{{\text{T2\_real}},} \;{{P}}_{{\text{T3\_real}}} \;{\text{and}}\;{{P}}_{{\text{T4\_real}}}. $

Figure 1 shows these real proportions correct target identification for the different numbers of targets after correction for guessing, as a function of temporal position. In the multiple target conditions (i.e. 2-T, 3-T and 4-T), accuracy for the post-T1 targets was contingent upon correct T1 identification. It deserves mentioning though that the same pattern of results held when analyzed independently of T1 accuracy. Figure 1 reveals a complex pattern of findings and we will discuss them step by step. A mnemonic may be of help in interpreting Fig. 1: the single line “_” symbol represents the single target (1-T) condition; the “X” (containing two lines) represents the 2-T condition; the triangle (three sides) represents the 2-T condition; and the square (four sides) represents the 4-T condition (see also the figure caption). Note further that not all conditions contained targets in all temporal positions, so an omnibus ANOVA was not possible. Therefore, separate comparisons were performed where appropriate.

In the 1-T control condition, overall accuracy was high (91%) and there was no effect of temporal position (1–4; F<1, P>0.5), indicating that the position within the RSVP stream per se did not contribute to performance. The pattern in the 2-T condition was different from that in the 1-T condition, as indicated by a number of targets (1-T vs. 2-T) × temporal position (1–4) interaction, F(3, 33)=27.07, MSe=0.009, P<0.001. Accuracy was quite high for T1 (temporal position 1) and for T2, when it immediately followed T1 (temporal position 2), reflecting the lag-1 sparing effect. In contrast, performance showed a steep drop relative to the single target control condition, when T2 was presented at temporal positions 3 and 4, t(11)=6.71, P<0.001 and t(11)=6.39, P<0.001, respectively. Thus, the 2-T condition reveals a standard attentional blink pattern.

We subsequently assessed if presenting more than two targets leads to deviations from this standard attentional blink pattern. In the version of the 3-T condition in which the three targets were presented in succession (i.e. the T1 T2 T3 D quadruplet), the pattern was indeed different from that in the 2-T condition, as confirmed by a number of targets (2-T vs. 3-T) × temporal position (1–3) interaction, F(2, 22)=14.56, MSe=0.008, P<0.001. Pair-wise comparisons revealed that accuracy on temporal position 3 was substantially improved in the 3-T condition (83%) relative to the 2-T condition (59%), t(11)=3.78, P<0.01. Similar improvements relative to the standard 2-T condition occurred in the 4-T condition (featuring the T1 T2 T3 T4 quadruplet); number of targets (2-T vs. 4-T) × temporal position (1 to 4), F(3, 33)=12.48, MSe=0.011, P<0.001. Pair-wise comparisons revealed that performance was worse on temporal position 1 in the 4-T relative to the 2-T condition, t(11)=2.85, P<0.05, but better on temporal positions 3 and 4, t(11)=3.63, P<0.01 and t(11)=2.36, P<0.05, respectively. Nevertheless, performance was somewhat deteriorated for the fourth target, relative to the first three targets in the 4-T condition, all ts≥3, all Ps<0.05. Thus, sparing spreads to the third and, to a lesser extent, the fourth temporal position, when targets immediately succeed each other.

The pattern in the T1 D T2 T3 quadruplet of the 3-T condition also differed remarkably from the standard attentional blink found in the 2-T condition, as indicated by a number of targets (2-T vs. 3-T) × temporal position (1, 3, 4) interaction, F(2, 22)=3.64, MSe=0.016, P<0.05. Pair-wise comparisons showed that accuracy in the 3-T condition was somewhat improved for temporal position 1 (93 vs. 88% in the 2-T condition), t(11)=2.31, P<0.05, did not differ for temporal position 3, but then remarkably improved again for position 4 (73 vs. 50%), t(11)=3.13, P=0.01. Thus, although a clear attentional blink had been induced (as indicated by T2 performance), T3 somehow managed to escape from this blink.

Taken together, these results suggest that the attentional blink is not induced by T1, but by the first post-T1 distractor. This is because an attentional blink occurred for the final targets in the T1 D T2 D, the T1 D D T2 and the T1 T2 D T3 quadruplets (as well as for the middle target in the T1 D T2 T3 quadruplet), whereas it was absent or considerably reduced in the T1 T2 D D, T1 T2 T3 D and T1 T2 T3 T4 quadruplets. The finding of relative sparing of a third and fourth target, in combination with a distractor-induced blink, replicates and extends findings by Di Lolllo et al. (2005) and provides direct support for the TLC account. However, there may still be an additional role for limited-capacity resources in target processing. The finding that accuracy for a fourth target, though relatively spared, was not as good as for earlier targets or as in the single target control condition, suggests that observers were running out of resources. The involvement of limited-capacity resources may also be suggested by the finding that processing multiple targets eventually led to a slightly deeper blink than when only a single target needed to be processed. Performance for the final target in the T1 T2 D T3 quadruplet of the 3-T condition was worse than for the final target in the quadruplet of the 2-T condition T1 D D T2 (43 vs. 50%). Although there was a trend, this difference failed to reach significance, t(11)=1.63, P=0.13. However, the same pattern also appeared in Experiments 2 and 3, and we will return to it later.

The finding that a target can be recovered even when a full blink has been induced (in the T1 D T2 T3 quadruplet) is quite exciting. Apparently, even though T2 itself was often not detected, it could nevertheless re-open the attentional “gate” for the subsequent T3. This effect too will be further investigated in later experiments. It is worth pointing out here though that this beneficial effect for the second of two targets also occurred before a blink was even induced, namely between T1 (temporal position 1, on average 83% correct) and T2 (temporal position 2, on average 93% correct), t(11)=6.38, P<0.001. This suggests that, more generally, detection of targets in an RSVP stream may improve from immediate repetition of the target category.

Experiment 2: controlling for masking

The general finding of Experiment 1 was that performance was relatively good as long as the target was preceded by another target. Although this finding is consistent with the idea that a loss of control over the input filter is not harmful as long as only targets are presented (as proposed by the TLC account), it may also be explained in terms of forward masking. In Experiment 1 the targets were digits, whereas the distractors were letters. It is possible that digits mask digits less well than letters mask digits, for example, because fewer features are shared or because of different pixel densities (cf. Maki, Bussard, Lopez, & Digby, 2003). To control for this we changed the stimuli in Experiment 2, so that targets were now letters, and distractors were taken from a set of “fantasy” characters. The stimuli are illustrated in Fig. 2. Across the set, the fantasy characters shared exactly the same line segments as the letters, in exactly the same quantities. This way, low-level visual forward masking effects should be equal within and across target and distractor categories, in terms of line features as well as pixel densities, and thus cannot explain potential sparing effects.