Introduction

The gap effect, a phenomenon first described by Saslow (1967), corresponds to a reduction in saccadic reaction time (SRT) when the fixation point is removed some time before the onset of the eccentric target, i.e., when there is a temporal “gap” between the removal of the fixation point and the onset of the eccentric target. The reaction time improvement relative to the overlap condition, where the fixation point remains the same before and after the onset of the eccentric target, typically peaks for gap values around 200 ms.

Several studies have shown that there are different components that can contribute to a reduction of the saccadic latency. Recent studies seem to agree that the dominant components are due to the attentional disengagement (or release) and the fixation offset effect (FOE).

The attentional disengagement theory is based on the attention model of Posner and Petersen (1990). According to this model, to switch between two visual targets, attention first needs to be disengaged from its current focus, then shifted to the new location, and finally must be re-engaged on the visual target. Fisher and Breitmeyer (1987) proposed that the offset of the fixation point produces a disengagement of attention that reduces the SRT. Subsequent studies have supported this theory (Jin and Reeves, 2009; Mackeben and Nakayama 1993; Pratt et al., 2006).

The fixation offset effect theory is corroborated by the study of Munoz and Wurtz (1992). From experiments carried out in monkeys, they showed that while fixating a visual stimulus, neurons in the superior colliculus discharge in a tonic manner and inhibit the execution of saccades. During a saccade, these neurons pause their activity permitting the saccadic movement. Several studies (Dorris and Munoz 1995; Kingstone and Klein 1993; Reuter-Lorenz et al. 1995) have proposed that the offset of the fixation point pauses the activity of the fixation neurons during the gap interval, producing an oculomotor release that facilitates the saccade and hence reduces the latency.

Typical gap effect experiments manipulate visual properties of the fixation point or of the eccentric target. For example, Reuter-Lorenz et al. (1991) studied the effect of two luminance levels for the eccentric target, one bright and one dim. They found that brighter eccentric targets produce shorter reactions times, and that this effect is additive with the SRT reduction due to the offset of the fixation point. Bell et al. (2006) also found that brighter targets produce shorter SRTs in monkeys, concluding that brighter targets are processed faster by the visual system.

Reulen (1984) measured SRTs for three different levels of the fixation point brightness (high, medium, and low), while keeping the intensity of the eccentric target constant (always high brightness). The results of Reulen’s experiments showed that the mean SRT was longer for lower brightness values of the fixation point. Reulen also concluded that only a significant decrease in the brightness of the fixation stimulus shortens the SRT, but the magnitude of the decrease was not defined.

Instead of brightness, Pratt et al. (2000) manipulated the area of the fixation point to estimate the contribution of visual warning and FOE to the gap effect. Their study included an auditory warning in half of the trials. The authors found a gap effect of 67 ms in the absence of the warning tone, and 40 ms when the warning tone was present, thus confirming that alertness modulates the gap effect.

Jin and Reeves (2009) evaluated several theories of the gap effect and concluded that attentional release has a major impact on the SRT reduction, while oculomotor release has a smaller but significant effect. The authors included in their experiments the dim gap trial, where the luminance of the fixation point was reduced, and the white gap trial, where the luminance was increased. Although the dim and white gap trials were selected to provide equally salient warning signals, SRTs were shorter in the dim gap trial than in the white gap trial. The authors assumed that this difference was because dimming the fixation point sufficiently is similar to turning it off, hence releasing attention, whereas brightening it would hold attention. They also ran a finger-press reaction time experiment to foveal probe dots and confirmed that attention was held after the fixation point was brightened, and released when it was removed.

These results can be explained in terms of the bottom-up account of visual attention, according to which attentional allocation is driven by external factors such as stimuli salience produced by, for example, movement, contrast, and loudness (Awh et al. 2012). An example of the bottom-up control of attention can be found in the cueing paradigm, where subjects have to initially fixate a central stimulus and then search for a target that is presented at one of several possible locations (Fuchs and Ansorge 2012). A task-irrelevant, exogenous cue presented before the target onset facilitates the search task if the cue’s location coincides with the future target location. This facilitation supports the idea that attention is attracted by the cue.

There is evidence that higher-contrast cues attract more attention than lower-contrast cues (Fuchs and Ansorge 2012; Fuller et al. 2009; Kean and Lambert 2003). Furthermore, Fuller et al. (2009) found that attention is modulated by cue contrast, concluding that although allocation of attention is automatic and unconscious, the attentional process is gradual rather than all-or-none.

In gap effect experiments where the task does not demand any specific conscious attentional behavior (Bell et al. 2006; Jin and Reeves 2009; Pratt et al. 2000; Reuter-Lorenz et al. 1991), it is reasonable to consider that attention is mainly driven by the stimuli salience.

Considering the role of attention in the gap effect and the evidence from the cueing paradigm that bottom-up allocation of attention is modulated by cue contrast, we hypothesized that the attentional engagement component of the gap effect can be modulated by perceptual changes in the fixation point.

If this hypothesis is correct, we expect that the saccadic reaction time can be expressed by psychophysical laws such as Weber–Fechner’s law (Fechner 1860), since it is a perceptual response to physical stimuli. The Weber–Fechner law has been applied in saccadic latency models to encode the difference in luminance or contrast of a visual target, in situations where the target is present or not (Carpenter 2004) or when its intensity changes from an initial to a final value (Taylor et al. 2006).

We developed a mathematical model that expresses the SRT as a logarithmic function of the brightness gain (the ratio between the final and initial brightness of the fixation point). The significance of such a model is twofold. In practical terms, it permits quantifying the probable effect of different stimuli manipulation on SRT, without having to try each one, and also to estimate if a desired effect on SRT is possible by manipulating stimuli. In theoretical terms it establishes a quantitative link between perception and action, based on Weber–Fechner’s law (Fechner 1860) which states that perceptual response is a logarithmic function of stimulus physical intensity.

The model is introduced in the next section, and is then tested in an experiment where the brightness of the fixation point was changed for several initial and final levels.

Mathematical model for saccadic reaction time

Let’s assume that the initial luminance of the fixation point is L i n i t and, some time (gap) before the onset of the eccentric target, it changes to a final value L f i n a l . Let’s assume also that the background where the stimuli are presented is black (luminance 0 cd/ m2), similar to previous gap effect experiments (Jin and Reeves, 2009; Pratt et al., 2000; Reuter-Lorenz et al., 1995, 1991), and that the ambient illumination is L a . The intensity of the light that reaches the observer’s eyes are L i n i t +L a and L f i n a l +L a for the initial and final values of the fixation point, respectively.

Weber–Fechner’s law (Fechner 1860) states that the subjective sensation of a stimulus is proportional to the logarithm of its physical magnitude. According to this law, the subjective sensation produced by the initial and final brightness of the fixation point would be log(L i n i t +L a ) and log(L f i n a l +L a ), respectively. Hence, the subjective sensation difference produced by the brightness change of the fixation point can be expressed as:

$$ \log \left(\frac{L_{final} + L_{a}}{L_{init} + L_{a}} \right) $$
(1)

It is noteworthy that the logarithm argument in Eq. 1 is independent of the units or representation used for the intensity of the fixation point. Because of that, we have named it the brightness gain (BG). Using Eq. 1, we propose that the saccadic reaction time can be expressed mathematically by:

$$ SRT = \alpha \cdot \textnormal{log} (BG) + \beta $$
(2)

where the values of α and β are numerical constants determined empirically. We define α as the attentional engagement factor and represents the slope of the lin-log graph curve of SRT by log(BG). A steep slope implies that the SRT is very sensitive to variations of the brightness gain, while a more gradual slope (flat) implies that the reaction time will vary little even for large brightness variations. We define β as the non-modulated SRT since it represents the saccadic reaction time measured without any visual manipulation of the fixation point (overlap condition). Observe that when the brightness gain is equal to 1, log(BG) is zero and the parameter β is equal to the SRT observed in the overlap condition.

When the brightness gain is lower than 1, log (B G) is negative and the attentional engagement is reduced, thus facilitating the saccade, i.e., the SRT will be shorter than β. On the other hand, when the brightness gain is greater than 1, log(BG) is positive and the attentional engagement is enhanced (the SRT will be longer than β).

In order to validate our quantitative model of saccadic reaction time, we designed an experiment with several levels of initial and final brightness of the fixation point and two gap intervals that is described in the next section.

Methods

Participants

Sixteen people from the University of São Paulo participated in the experiment (three female) with ages ranging from 21 to 50 years old, average 28.6 (± 8). All participants had normal or corrected to normal vision and seven of them had never used an eye tracker before. The research protocol was approved by the Ethics Board of the Institute of Biomedical Sciences of the University of São Paulo.

Apparatus

Eye movement data was collected using an SMI RED500 remote eye tracker with a sampling frequency of 500 Hz. The eye tracker was placed at the bottom of a 22” LCD monitor used to present the visual stimuli. The brightness and contrast of the monitor were initially set to their maximum values (100 %), and the brightness of the visual stimuli was controlled by software during the experiment. The experiment was conducted in a room with regular fluorescent illumination, common in office spaces, with no direct sunlight as recommended by Holmqvist et al. (2011).

The eye tracker and monitor were positioned 60 cm from the participant’s eyes as depicted in Fig. 1. A chin rest was used during the experiment to reduce head movements and keep the eyes at approximately the same height as the center of the screen. Disks with a diameter of 1 of visual angle were used as visual stimuli. A small dot was placed at the center of the disks to serve as target. The distance between the fixation point and the eccentric target corresponds to 10 of visual angle.

Fig. 1
figure 1

Experimental setup showing the position of the participant’s eyes, the monitor, and the eye tracker

Design

In order to study the effect of perceptual changes of the fixation point in attentional engagement, SRTs for several configurations with different values of brightness gain must be evaluated. The other independent variable in our experiment was the gap interval.

We defined three levels of initial brightness and four levels of final brightness of the fixation point. The initial brightness levels were high (100 % brightness of the monitor), medium (50 % of the maximum brightness of the monitor), and low (5 % of the maximum brightness of the monitor). The final brightness levels were the same high, medium, low, and zero (0 % of the maximum brightness of the monitor). The gap values were 0 and 200 ms.

The following four conditions were defined, each containing three manipulations of the fixation point. For the overlap condition the fixation point stays high (HH), medium (MM), or low (LL). Observe that the notation used in parenthesis corresponds to the manipulation of the fixation point, i.e., HH implies that the fixation point started at H and there was no change. For the offset condition (fixation point disappears), the fixation point goes to zero, from high (HZ), medium (MZ), or low (LZ) intensities. For the lower brightness condition, the fixation point intensity decreases in one of three possible ways: HL, ML, or HM. Finally, for the higher brightness condition, the fixation point intensity increases in one of three possible ways, which are the opposite of the lower condition: LH, LM, and MH.

A fully crossed design was adopted thus resulting in 4 (conditions) × 3 (fixation point manipulations per condition) × 2 (gap values) = 24 configurations. Each configuration shall be referred to by a number (0 for zero gap, 2 for 200-ms gap), and two letters, as above, that define the fixation point manipulation. Thus, 2HL refers to the configuration in which the fixation point changes for high to low intensity 200 ms before target onset.

The offset configurations served to validate our experiment by comparing with existing studies where the fixation point was removed. The overlap condition (brightness of the fixation point was constant) was used as the control. According to our model, all overlap configurations should elicit similar SRTs, since the brightness gain is always 1, regardless of the fixation point brightness. For the lower brightness condition shorter SRTs than in the control condition are to be expected, because of the attentional release produced by dimming the fixation point. Finally, for the higher brightness condition SRTs would be expected to be longer than in all other conditions, since brightening the fixation point engage more attention.

In the experimental conditions described above, with exception of the overlap condition, the brightness change of the fixation point occurs at the same time or 200 ms before the target onset. This change acts as a top-down visual warning that permits the subject to prepare to make the saccade (Trappenberg et al. 2001). Since we wanted to estimate the effect of perceptual changes on attention for all conditions, including the overlap condition, a 1-kHz auditory warning was played 100 ms before the visual manipulation, similar to (Craig et al., 1999; Pratt et al., 2006, 2000; Reuter-Lorenz et al., 1995, 1991), so as to introduce a warning signal in all conditions.

Because the luminance intensity varies among monitors, to make our experiment replicable, the monitor was calibrated with a J1803 Tektronix LumaColorTM Photometer. We manually annotated the photometer readings in cd/ m2 for equally spaced values of brightness levels at the center of the monitor, where the fixation point was shown during the experiment. The calibration showed that our monitor had a power luminance curve expressed by y=249.9x 2.26,R 2=.99, where x is the brightness percent (expressed as a fraction, i.e., for a 100 % brightness x equals 1) and y is the luminance in cd/ m2. The values of the levels high, medium, low, and zero were converted to cd/ m2, resulting in the values presented in Table 1.

Table 1 Conversion between brightness levels and luminance

The ambient luminance was computed by placing the photometer at approximately the same position of the participants’ eyes, pointing towards the center of the monitor, which was turned off during this procedure. The ambient luminance was obtained after the photometer reading stabilized at 4 cd/ m2.

Procedure

Each participant was asked to initially look at the fixation point at the center of the screen until an eccentric target appeared, and then look as fast as possible at the target, 10 to the left or right-hand side of the fixation point. Participants repeated 20 times every possible configuration. From the 20 repetitions, the eccentric target appeared ten times to the left and ten times to the right of the central fixation point in random order. Therefore, each participant performed 20 × 24 = 480 trials. The order of the trials was randomly selected for each participant.

Each trial was composed of five phases as shown in Fig. 2: an initial fixation phase, followed by an auditory warning, fixation point manipulation, and gap periods, and finally the phase that started with the onset of the eccentric target. Observe that in some conditions the duration of a phase could be 0 ms.

During the initial phase, the central stimulus was presented for a period uniformly distributed in the range 400 to 600 ms, to avoid any learning/prediction bias in the results. A 1-kHz warning tone that lasted 100 ms followed in the second phase. The visual manipulation of the fixation point occurred step-wise (i.e., in successive frames), at the end of the warning tone, so was a phase that lasted 0 ms. In this phase, the fixation point might disappear (offset condition), remain with constant brightness (overlap condition), or have its brightness decreased (lower brightness condition) or increased (higher brightness condition). The fourth phase was the gap period (0 or 200 ms), which preceded the final phase, in which the eccentric target appeared, always with brightness level high, at the left or right-hand side of the fixation point. Participants were asked to gaze at the eccentric target as fast as possible after it appeared, and to keep fixating until it disappeared.

Fig. 2
figure 2

Gap, overlap, lower brightness and higher brightness conditions used in our experiment

Figure 2 shows a graphic representation of the four conditions used in the experiment. The first row shows an example of how the fixation point changes during an offset condition trial where the brightness drops from high to zero, with a 0- or 200-ms gap. The second row shows the overlap condition, where the fixation point never changes (in this example stays at high) throughout the trial. Figure 2c shows an example of the lower brightness condition, where the final brightness of the fixation point is lower than its initial brightness. In this example, the fixation point goes from high to medium. The last row of Fig. 2 shows an example of the higher brightness condition, where the fixation point varies from medium to high.

Data collection and processing

Each participant visited our laboratory once and remained there for about 35 min. Data gathering was divided into four sessions, each one having a duration of approximately 5 min with at least 5 min break between sessions. Each session was composed of ten sets of 12 trials each, and each set lasted about 30 s. Participants were instructed to try not to blink during each set and they were free to rest their eyes between sets. Participants initiated each set by pressing a key. All participants received a brief introduction to the experiment and signed an informed consent form. After the introduction, the eye tracker was calibrated and participants were familiarized with the experiment through a short training session.

The software for controlling the experiment, i.e., manipulating the visual stimuli and recording the eye movements, was written using the Python programming language and the PsychoPy library (Peirce 2009). During the experiment, the software was designed to identify some problems and try to correct them. When the data quality was degraded above a certain threshold by calibration drifts of the eye tracker, the software interrupted the execution of the current set of saccades, asked the participant to re-calibrate the eye tracker, and repeated the interrupted set at most once. The software computed fixations in real time using a dispersion-based algorithm (Salvucci and Goldberg 2000) and was designed to detect situations where participants ended the fixation on the central stimulus before the onset of the eccentric target (anticipations). The software also detected blinks using information of the pupil diameter. When such anticipations or blinks were detected, the unsuccessful trial was appended to a list of trials that were repeated in a separate set (or sets) at the end of the session. To keep the sessions within reasonable duration, trials in the unsuccessful list were not repeated and, in case of a second problem, the trial was discarded.

Saccades were detected offline using a variation of the velocity-threshold fixation identification (I-VT) algorithm of Salvucci and Goldberg (2000). For each trial, the algorithm considered the midpoint of the left and right eye coordinates from the first 500 ms after the onset of the eccentric target. Before detecting saccades, eye tracker acquisitions were filtered to remove high-frequency noise using a moving average filter. Trials with more than three consecutive invalid acquisitions (as reported by the eye tracker) were discarded. Invalid acquisitions were substituted by the mean of a window of size 5 centered at the acquisition.

Results

We recorded a total of 7407 trials, including those that were repeated due to calibration drifts, blinks, and anticipations. In 0.4 % of the total number of recorded trials, the saccade detection algorithm did not detect saccades, in 5.7 % the saccade was not directed towards the eccentric target, 2.7 % had blinks or anticipations, and 1.7 % were discarded due to noise. The remaining 89.5 % of the trials contained valid saccades. Data from one participant were discarded because he tried to guess the location of the eccentric target most of the time, resulting in much higher error rate and a much shorter SRT. Hence, the results are based on the other 15 participants.

Results for the four conditions are shown in Fig. 3. Figure 3a shows the results for the zero-gap interval and Fig. 3b shows the results for the 200-ms gap interval. Each column shows the box plot of a particular combination of the independent parameters, computed using the data from the 15 participants. Each box plot shows the minimum and maximum SRT as the extremes of the dashed line, the lower and upper quartiles as the extremes of the shaded box, the median as a horizontal line within the box, and the mean as a circle (’o’). The values of the mean and one standard deviation are also presented above each box. Outliers are shown as ’*’. Below each box plot there is a three-letter code consistent with the notation introduced in Section “Design”, describing gap value and brightness change, defined hereinafter as “configuration”.

Fig. 3
figure 3

Experimental results for the overlap, offset, lower brightness and higher brightness conditions for 0- and 200-ms gap intervals. Each visual manipulation was coded using three letters. The first letter indicates a 0- or 200-ms gap interval, and the second and third letters indicate the brightness transition of the fixation point. Brightness values are coded as H (high), M (medium), L (low), or Z (zero). Each box plot shows the minimum and maximum SRT as the extremes of the dashed line, the lower and upper quartiles as the extremes of the shaded box, the median as a horizontal line within the box, and the mean as a circle (’o’). The values of the mean and one standard deviation are also presented above each box. Outliers are shown as ’*’.

The rest of this section describes the results of each of the four conditions in turn, and concludes with a global quantitative analysis of all the data based on the model proposed in Eq. 2. Every ANOVA test reported here was preceded by a Mauchly’s test of sphericity. In case of sphericity violation, the degrees of freedom and the p value were corrected using the Greenhouse–Geisser correction method, since the estimation of sphericity (𝜖) was lower than 0.75 in all cases of sphericity violation.

Overlap condition

In the overlap condition, the fixation point brightness remained high, medium, or low throughout each trial (Fig. 2b). The warning tone was played for 100 ms and ended at the same time (zero-gap) or 200 ms before the onset of the eccentric target.

A two-way repeated-measures ANOVA of brightness level (high, medium, low) × gap (0, 200) showed no significant effect of level on SRT, F(2,28)=1.56,p=.23,η p 2=0.1. Therefore, in the overlap condition, the brightness of the fixation point does not seem to influence the saccadic reaction time.

Significant changes in SRT were observed as a function of gap, F(1,14)=66.69,p<.001,η p 2=0.83, which can only be attributed to the presence of the warning tone, and was not associated with brightness changes of the fixation point. Using the grand mean of the three brightness levels, SRT went from 201 (±29) ms for the zero-gap configurations, to 169(±24) ms for the 200-ms gap configurations, making a difference of about 32 ms. There was no significant interaction between brightness and gap.

Offset condition

In the offset condition, the fixation point, with initial brightness levels high, medium, or low, was removed at the same time (zero-gap) or 200 ms (gap) before the onset of the eccentric target. This visual manipulation was synchronized with the warning tone, which was played for 100 ms and ended at the same time (zero-gap) or 200 ms before the onset of the eccentric target.

An inverse relationship between the initial brightness level and SRT was found: lower SRT’s were associated with higher fixation point levels. A two-way repeated-measures ANOVA of brightness (high, medium, low) × gap (0, 200) showed a significant main effect of brightness, F(1.39,19.5)=11.43,p=.0014,η p 2=0.45. A post hoc test with Bonferroni correction revealed that SRT for the low level brightness was significantly longer than for the high (p=.0093) and medium (p=.017) levels.

ANOVA also revealed a significant main effect of gap, F(1,14)=169.24,p<.001,η p 2=0.92. A longer grand mean SRT was found for the zero-gap configurations (190 ±22 ms) than for the 200-ms gap configurations (152 ±20 ms), a difference of 38 ms. There was no interaction between initial brightness level and gap.

To see whether there was a gap effect due specifically to the removal of the fixation point, SRTs in the offset and overlap conditions must be compared, as in Fig. 4.

Fig. 4
figure 4

Grand mean from 15 participants for configurations where the fixation point brightness was reduced for 0- and 200-ms gap intervals, grouped by the initial brightness value

Considering the high brightness offset (high to zero) and comparing with the control configuration (unchanging level high), for zero-gap there was a significant 20-ms reduction in SRT, t(14)=5.6,p<.001, Cohen’s d=1.45, while for the 200-ms gap, the reduction was 22 ms, also significant, t(14)=5.99,p<.001, Cohen’s d=1.55.

Considering the medium offset (medium to zero) and comparing with the control configuration (level medium), for zero-gap there was a significant 13-ms reduction in SRT, t(14)=4.25,p<.001, Cohen’s d=1.1, whereas for the 200-ms gap, the reduction was 19 ms, also significant, t(14)=5.52,p<.001, Cohen’s d=1.43.

When the initial brightness level was low, a significant SRT reduction was observed only for the 200-ms gap, of 11 ms, t(14)=3.11,p=.0077, Cohen’s d=0.8.

Thus, removing a brighter fixation point produced significant shorter reaction times than removing less-bright fixation points. For both gap intervals we observed a significant gap effect for the high and medium brightness levels, whereas for the low level, the only significant gap effect was for the 200-ms gap. Similar to the overlap condition, the warning tone produced a significant SRT reduction and did not interact with the initial brightness level.

Lower brightness condition

In the lower brightness condition, the brightness of the fixation point was reduced in the three possible ways: from high to low, medium to low, and high to medium levels at the same time (zero-gap) or 200 ms before the onset of the eccentric target. Once more, the warning tone was played for 100 ms and ended at the same time (zero-gap) or 200 ms before the onset of the eccentric target.

While the results for the high-to-low and medium-to-low configurations were very similar, the high-to-medium configuration presented a longer mean reaction time than the other two. A two-way repeated-measures ANOVA of brightness (high-to-low, medium-to-low, high-to-medium) × gap(0, 200) showed a significant main effect of brightness, F(1.33,18.6)=11.02,p=.002,η p 2=0.44. A post hoc test with Bonferroni correction showed that the high-to-medium configuration had a significant longer SRT than the high-to-low (p=.02) and medium-to-low (p=.004) configurations.

Analyzing the effect of gap on SRT in this condition, the grand mean for the zero-gap trials (191 ±25 ms) was found to be higher than for the 200-ms gap trials (156 ±23 ms), a difference of 35 ms. This difference was significant as revealed by ANOVA, F(1,14)=85.96,p<.001,η p 2=0.86. There was no significant interaction between brightness and gap.

To see whether dimming the fixation point might specifically reduce the SRT in a way comparable to when it is turned off altogether, it is necessary to compare the results of the lower brightness with the overlap condition.

As can be seen in Fig. 4, the high-to-low brightness showed a significant reduction of about 20 ms for zero-gap, t(14)=4.28,p<0.001, Cohen’s d=1.1, and about 17 ms, also significant, for the 200-ms gap, t(14)=4.26, p<.001, Cohen’s d=1.1.

Figure 4 also shows the SRT for the medium-to-low trials. For the zero-gap, there was a significant SRT reduction of about 16 ms, t(14)=4.26,p<.001, Cohen’s d=1.1, while for the 200-ms gap, the reduction was about 19 ms, also significant, t(14)=4.14,p=.001, Cohen’s d=1.06.

However, in the high-to-medium configuration, no significant SRT reduction was observed both for zero-gap, t(14)=0.04,p=.97, Cohen’s d=0.01, and for the 200-ms gap, t(14)=1.06, p=.31, Cohen’s d=0.27, as can be observed in Fig. 4.

Summarizing, configurations where the final brightness level was low had significant shorter SRTs than configuration with final level medium, and also produced significant gap effects for both the zero-gap and 200-ms gap, whereas trials where the final brightness was medium did not produce any significant gap effect. Similar to the overlap and offset conditions, the gap interval had a significant effect on SRT and did not interact with the brightness changes.

Higher brightness condition

In the higher brightness condition, the fixation point brightness was increased in the three possible configurations: low-to-high, low-to-medium, and medium-to-high. This manipulation occurred at the same time (zero-gap) or 200 ms before the onset of the eccentric target. As in the other three conditions, the warning tone was played for 100 ms and ended at the same time (zero-gap) or 200 ms before the onset of the eccentric target.

As can be observed in Fig. 3, SRTs were different among higher brightness configurations. A two-way repeated-measures ANOVA of brightness (low-to-high, low-to-medium, medium-to-high) × gap (0, 200) showed a significant main effect of brightness, F(1.38,19.3)=7.16, p=.009,η p 2=0.34. A post hoc test with Bonferroni correction showed that the medium-to-high configuration had a significant shorter SRT than the low-to-high (p=.006) and low-to-medium (p=.04) configurations.

Analyzing the effect of gap on SRT in this condition, the grand mean for the zero-gap trials (223 ±34 ms) was found to be higher than for the 200-ms gap trials (185 ±38 ms), a significant difference of 38 ms, F(1,14)=22.55, p<.001, η p 2=0.62. There was no significant interaction between brightness and gap.

To see whether brightening the fixation point might specifically increase the SRT in a way opposite to when it is dimmed or turned off altogether, it is necessary to compare the results of the higher brightness condition with the overlap condition.

As can be seen in Fig. 5, relative to the low-low overlap configuration, the low-to-high configuration showed a significant increase of about 33 ms for the zero-gap, t(14)=4.18, p<.001, Cohen’s d=1.08, and about 25 ms, also significant, for the 200-ms gap, t(14)=3.08, p=.008, Cohen’s d=0.8.

Fig. 5
figure 5

Grand mean from 15 participants for configurations where the fixation point brightness was increased, for 0- and 200-ms gap intervals, grouped by the initial brightness value

Figure 5 also shows the SRT for the low-to-medium trials. Relative to the low-low overlap configuration, for zero-gap there was a significant SRT increase of about 26 ms, t(14)=3.3, p=.005, Cohen’s d=0.85, while for 200 ms gap the increase was about 19 ms, also significant, t(14)=2.88, p=.01, Cohen’s d=0.73.

In the medium-to-high configuration, also shown in Fig. 5, relative to the medium-medium overlap configuration, a significant SRT increase of 14 ms was observed for zero-gap, t(14)=3.1, p=.008, Cohen’s d=0.43. For the 200-ms gap, a smaller SRT increase of 6 ms was observed, which was not significant, t(14)=1.97, p=.07, Cohen’s d=0.51.

Results of the higher brightness condition showed that increasing the brightness of the fixation point produces significantly longer reaction times compared with the overlap condition. SRTs were significantly longer in trials with initial brightness level low, compared to trials with initial brightness level medium. Similar to all the other conditions, the gap interval showed a significant effect on SRT’s and did not interact with the brightness changes.

Global analysis of the experimental saccadic reaction times

Recall that our model of saccadic reaction time presented in Eq. 2 has the form:

$$SRT = \alpha \cdot \log (BG) + \beta $$

where α is the attentional engagement factor and β is the non-modulated reaction time (observed without visual manipulation). Both α and β must be determined empirically from experimental data. BG is the brightness gain defined as the ratio of final and initial intensities of the fixation point.

Figure 6 shows the regression results for 0- and 200-ms gap configurations. The horizontal axis at the bottom represents the base 10 logarithm (log10) of the brightness gain, and the vertical axis represents the SRT in ms. Each point in the regression corresponds to the mean performance of 15 participants for each experimental configuration. Because for the overlap condition BG=1 for all configurations, the grand mean of the three levels (high, medium, and low) was used as a single point for the regression. Thus, the regression was computed using ten points for each gap interval as seen in Fig. 6.

Fig. 6
figure 6

Results of regression to compute the parameters of the model proposed in Eq. 2 for the 0- and 200-ms gap configurations. Note that the x axis has a logarithmic scale

For zero-gap, the proposed model significantly predicts the saccadic reaction time, F(1,8)=135.96, p<.001, and R 2=.94. The estimated value of α was 13.9 ms/log10(BG)and had a significant effect, t(8)=11.66, p<.001, while the estimated value of β was 205.1 ms.

For the 200-ms gap, the results also strongly supports the model, F(1,8)=66.67, p<.001,R 2=.89. The estimated value of α was 12.1 ms/log10(BG) and had a significant effect, t(8)=8.17, p<.001, while the estimated value of β was 167.6 ms.

Because both α values are positive, the curves have positive slopes. The value of β can be seen at the points with log 10(B G)=0, and corresponds to the SRT for a particular gap interval when there is no visual manipulation.

Discussion

We first compare our results with existing gap-effect studies in the literature and then discuss the mathematical model of saccadic reaction time based on perceptual brightness changes in the fixation point. Finally, we discuss how the model could be explained by the two current dominant alternative theories of attention disengagement and fixation offset effect.

Comparing with the literature

In a gap-effect experiment, presenting an auditory warning before the target onset produces an overall reduction in SRTs compared with setups where the auditory warning is not presented. This reduction varies among conditions, being more pronounced in overlap trials because of the lack of visual warning (Pratt et al. 2000; Reuter-Lorenz et al. 1995). Therefore, we must compare our results with other gap-effect studies that also included an auditory warning.

The largest SRT reduction was observed between the high-to-high and high-to-zero configurations for the 200-ms gap. This reduction, of about 22 ms, is comparable in magnitude with other gap effect experiments that included a warning tone for gap intervals of 200 ms, such as Pratt et al. (2000) and Reuter-Lorenz et al. (1995).

Since we had two gap intervals (0 and 200 ms), it is possible to compute a rough estimate of the gap effect we could have observed in our experiment without an auditory warning. Saccadic reaction times in the 200-ms gap trials were on average 36 ms shorter than in the 0-gap trials. This reduction due to the gap interval was independent of the brightness changes, since we did not observe any significant interaction between gap values and brightness changes in the ANOVA tests. Given that SRTs are shorter when the auditory warning is synchronized with the target onset than when there is no auditory warning (Ross and Ross 1981), we would expect to see a gap effect of at least 36 + 22 =58 ms for the 200-ms gap without an auditory warning, which is compatible with existing studies without an auditory warning, such as Forbes and Klein (1996), Pratt et al. (2000), and Reuter-Lorenz et al. (1995).

Although the experiments of Reulen (1984) did not include an auditory warning, Reulen found an inverse relationship between fixation point brightness and saccadic reaction time. We had three configurations where the fixation point was removed (high-to-zero, medium-to-zero, and low-to-zero) and found, similar to Reulen, an inverse relationship between the initial brightness level and SRT’s.

Jin and Reeves (2009) also did not include an auditory warning, however we can use the luminance values of their fixation stimulus to estimate the saccadic reaction time difference between the dim and white gap trials according to our model. In the dim gap trial, Jin and Reeves reduced the luminance of the fixation point from 80.1 cd/ m2 to 4.2 cd/ m2, whereas in the white gap trial the luminance was increased from 80.1 cd/ m2 to 117 cd/ m2. Similar to the present experiment, Jin and Reeves presented the stimuli over a black background. The brightness gain for the dim gap trial is (4.2+4)/ (80.1+4)=0.1, whereas for the white gap trial is (117+4)/(80.1+4)=1.44. According to our SRT function for a gap value of 200 ms, the predicted SRT in the dim gap trial is S R T d i m =12.1l o g 10(0.1)+167.6=155 ms, whereas the predicted SRT in the white gap trial is S R T w h i t e =12.1l o g 10(1.44)+167.6=170 ms. Therefore, the predicted difference between the dim and white gap trials is approximately 15 ms. This value is identical to the 15-ms difference found by Jin and Reeves in their randomized design, so our model’s prediction was relatively close to the real results, even though there are differences between the experimental setups, like the area and shape of the fixation point, the number of participants and session duration, and the use of an auditory warning.

Model discussion

Our results showed that saccadic reaction time can be expressed by the mathematical model introduced in Eq. 2:

$$SRT=\alpha \log_{10} (BG)+\beta $$

where BG is the brightness gain, defined as the ratio of final and initial brightness of the fixation point (including the ambient luminance), α is the attentional engagement factor, β is the non-modulated SRT, and the logarithm is base 10 (without loss of generality).

After fitting the experimental results to the model, we obtained positive values of α for the 0- and 200-ms gap intervals (13.9 and 12.1, respectively). This indicates that increasing the salience of the fixation point elicits longer saccadic reaction times, whereas reducing the salience shortens the saccadic reaction time in an essentially symmetric way.

In the light of our results, we can now answer the following question raised by Reulen (Reulen 1984): what should the magnitude of the brightness reduction be in order to produce a given gap effect? According to our model, the magnitude of the brightness reduction must be proportional to the initial brightness of the fixation point. More specifically, the final brightness must be ten-times smaller than the initial brightness in order to elicit an approximately 13-ms reduction in SRT. Conversely, to increase the SRT by approximately 13 ms, the final brightness must then be ten-times greater than the initial brightness.

Future studies could also include more than one level of background intensity and different eccentricities of the target, since those factors can influence the saccadic reaction time as well.

Attentional disengagement or FOE

As pointed out by the reviewers of this paper, this study does not explicitly manipulate attention. Therefore, though our model has been inspired by the thought that the attentional engagement component of the gap effect is modulated by perceptual changes in the fixation point, we acknowledge that the modulation effect reported here could have alternative explanations, such as low-level processes that affect saccadic reaction time. For example, perceptual changes of the fixation point could influence the oculomotor system that controls the execution of saccades in the superior colliculus, since it receives direct input from the retina and the collicular neurons are sensitive to stimulus intensity (Liversedge et al. 2011). If this alternative explanation was valid, the activity of the fixation cells in the rostral pole of the superior colliculus would be modulated by perceptual changes of the fixated stimulus. This hypothesis could be investigated by measuring the activity of the fixation neurons while manipulating the salience of the fixation point, to find out if such correlation exists.

In support of our hypothesis of attentional modulation, our model corroborates that the level of attentional engagement, rather than having two states (released or engaged), varies continuously with perceptual changes of the fixation stimulus. The model shows that the saccadic reaction time is modulated by perceived brightness changes of the fixation point, and can be expressed by the logarithmic model of Eq. 2. Different from Fuller et al. (2009), who found that attentional allocation is modulated by cue contrast, the modulation effect reported in this paper occurs at the fixation point, which the subject is already attending.

Our results could be interpreted as pointing to the existence of both top-down and bottom-up attentional processes that affect the saccadic reaction time. The auditory warning might activate an endogenous process that prepares the disengagement from the fixation point. This top-down disengagement seems to be independent of visual changes in the fixation point, given that the SRT reduction observed from the 0- to 200-ms gap interval was almost constant for all configurations (see Figs. 4 and 5). The bottom-up process is driven by perceptual changes in the fixation point by exogenously releasing or engaging attention accordingly. When the fixation point becomes more salient, attention becomes more engaged, hence delaying the saccadic response. On the other hand, when the fixation point salience is reduced, attention is released, resulting in shorter saccadic reaction times.

Despite possible explanations for the model, quantitative analysis of the gap effect provided by our model has potential value in clinical psychology, both for diagnostic purposes and for cognitive therapy. Other areas such as Human-Computer Interaction could also benefit from the model. For example, Tula et al. (2013) have manipulated the brightness of computer interface elements to improve visual feedback while providing attention disengagement, thus facilitating interaction by exploiting natural eye movements. In this case, quantitative models could be used to dynamically adjust interface parameters, such as contrast, to optimize user response.

Conclusions

We have presented a mathematical model of saccadic reaction time (SRT) based on the fixation point brightness gain. The model is based on the Weber–Fechner law, which states that perceived brightness is proportional to the logarithm of the stimulus intensity. It permits to quantity the effect of different brightness manipulations (increasing or reducing the brightness) of the fixation point. We have empirically validated the model using a typical gap-effect experiment with 15 participants, where the brightness gain was manipulated under several conditions, prior to the onset of the eccentric target. Although, in our experiments, we do not directly manipulate attention, we speculate that this modulation effect could be a consequence of attentional engagement being modulated by perceptual changes of the fixation point. Future experiments will be conducted to investigate if these results can actually be attributed to attentional modulation. Nonetheless, because the model can predict SRT changes due to brightness variations, it has potential value in clinical psychology, both for diagnostic purposes and for cognitive therapy.