Free choice tasks as random generation tasks: an investigation through working memory manipulations

Naefgen, Christoph; Janczyk, Markus

doi:10.1007/s00221-018-5295-2

Free choice tasks as random generation tasks: an investigation through working memory manipulations

Research Article
Published: 31 May 2018

Volume 236, pages 2263–2275, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Experimental Brain Research Aims and scope Submit manuscript

Free choice tasks as random generation tasks: an investigation through working memory manipulations

Download PDF

633 Accesses
16 Citations
Explore all metrics

Abstract

Free choice tasks are tasks in which two or more equally valid response options per stimulus exist from which participants can choose. In investigations of the putative difference between self-generated and externally triggered actions, they are often contrasted with forced choice tasks, in which only one response option is considered correct. Usually, responses in free choice tasks are slower when compared with forced choice task responses, which may point to a qualitative difference in response selection. It was, however, also suggested that free choice tasks are in fact random generation tasks. Here, we tested the prediction that in this case, randomness of the free choice responses depends on working memory (WM) load. In Experiment 1, participants were provided with varying levels of external WM support in the form of displayed previous choices. In Experiment 2, WM load was induced via a concurrent n-back task. The data generally confirm the prediction: in Experiment 1, WM support improved both randomness and speed of responses. In Experiment 2, randomness decreased and responses slowed down with increasing WM load. These results suggest that free choice tasks have much in common with random generation tasks.

Why free choices take longer than forced choices: evidence from response threshold manipulations

Article 03 August 2017

Which task will we choose first? Precrastination and cognitive load in task ordering

Article 30 November 2018

Individual differences in use of the recognition heuristic are stable across time, choice objects, domains, and presentation formats

Article 16 November 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In everyday life we often have to make choices without having a clear criterion for which option is better: Choosing what to eat when we only care about whether we eat, which set of purpose-appropriate clothes to pick from our wardrobe, from which lane we want to take a shopping cart when they’re all equally far away and so on. Despite occasional assertions to the contrary,^{Footnote 1} we make such decisions with ease and swiftly. This type of choice devoid of almost all personal meaning, however, is also often used in laboratories when certain modes of action selection are investigated with so-called free choice tasks.

Free choice tasks

In these tasks, participants are instructed to freely choose one of two (or more) response options that are considered equally correct. For example, consider a task in which whenever an ‘H’ is displayed on a screen, participants are supposed to press either a button to their left or a button to their right. Often, the participants are instructed to avoid obvious patterns in their choices (like left-right-left-right, for example) and to give all response options in equal proportions. We will discuss potential issues with this type of instruction in the subsequent “Free choice and random generation tasks” section, after we have introduced the task and important observations in the following. The experiments reported in this paper address critical aspects following from such instructions.

Starting with Berlyne’s (1957) study, free choice tasks are often used in contrast with forced choice tasks, in which only one response is considered correct to a stimulus. One almost universal observation in the literature is that free choice response times (RTs) are longer than forced choice RTs (but see, e.g., Wirth et al. 2018 for an exception). This RT difference might be taken to indicate qualitative differences with regard to response selection. Accordingly, free choice tasks are often used to operationalize what has been termed self-generated (or intentional, internally generated, intention-based, voluntary, goal-directed) action, while forced choice tasks are often used to operationalize externally triggered (or stimulus-based) action (e.g., Brass and Haggard 2008; Herwig et al. 2007; Passingham et al. 2010; Keller et al. 2006; Waszak et al. 2005). In support of this, there is some evidence that associations between actions and their effects can only be learned in an intention-based action control mode as operationalized with free choice tasks (Herwig et al. 2007; see also; Gaschler and Nattkemper 2012; Herwig and Waszak 2009, 2012; Pfister et al. 2010). However, Pfister et al. (2011) reported that these associations are also learned in forced choice tasks. In addition, there is ample evidence that action effects play a role even when using forced choice tasks (e.g., Gozli et al. 2016; Huffman et al. 2018; Janczyk et al. 2012a, b, 2014, 2017; Kühn et al. 2009, Exp. 3; Kunde 2001; Kunde et al. 2012; Pfister and Kunde 2013; Wolfensteller and Ruge 2011). In sum, it appears that the majority of evidence argues for the same role of action effects in forced and free choice tasks. This conclusion received additional support from other lines of research. For example, Janczyk et al. (2015a) compared both task types with regard to their susceptibility to dual-task interference. While replicating the RT difference in all experiments, no differences in dual-task costs between free and forced choice tasks were observed, again pointing to similar “action control mechanisms” involved in both tasks. In line with this, the RT difference was attributed to a perceptual source in a further study (Janczyk et al. 2015b). Coming from a different perspective, Bermeitinger and Hackländer (2018) observed that response priming effects induced by motion primes affected both free and forced choice tasks similarly.

If, then, both tasks do not differ regarding their response selection mechanisms, it appears helpful to identify further commonalities. As a step toward this, Naefgen et al. (2017) viewed the RT difference through a sequential sampling lens (e.g., Grice 1968). In such a framework, evidence for or against a response option (or more precisely in the context of that study: the desired goal state, that is, the depressing of a left or right response key) is noisily accumulated over time. Once the total amount of this evidence surpasses one of the thresholds, a response is emitted. This results in three theoretically relevant parameters for a choice type: The speed of evidence accumulation, the thresholds for making a choice, and the time not spent accumulating evidence (such as, e.g., time needed for the motor execution of the choice made). Within this framework, Naefgen et al. then asked whether the RT difference can be attributed to differences in the speed of evidence accumulation or to differences outside the accumulation process. To this end, the amount of catch-trials (e.g., Bausenhart et al. 2010) and time pressure (e.g., Dror et al. 1999) were used to manipulate decision thresholds. If differences in evidence accumulation were the reason, the RT difference should become smaller the lower the thresholds. As this was not observed, the cause is likely located in a process different from evidence accumulation, that is, in the non-accumulation time. The present study aims to address the nature of this process and focuses on the generation of random responses as one candidate.

Free choice and random generation tasks

Frith (2013) argued that in free choice tasks, “in essence, the experimenter is asking her subjects to try to be unpredictable and random” (p. 291). He based this argument both on psychological evidence that participants associate randomness and the perception of choices as free (Ebert and Wegner 2011) and on neuroimaging evidence that random choice tasks and free choice tasks activate similar brain regions (Jahanshahi et al. 2000; Jenkins et al. 2000). This becomes even more evident when looking at the similarities between the instructions for free choice tasks and random generation tasks. The former appear in three variants: (1) explicit instructions to choose responses at random, (2) instructions similar to random generation instructions (e.g., avoidance of patterns^{Footnote 2}), and (3) instructions emphasizing spontaneity or freedom of choice. Lastly, there are also studies in which no instruction as to the desired patterns was reported. Examples for these categories can be found in Table 1. Please note that this overview is meant as an illustration, and is not exhaustive. One thing illustrated by Table 1 is the prevalence of instructions to avoid patterns in the free choice responses. One reason for such instructions is that, when they are not given, participants sometimes give responses with only one or almost only one of the response options.

Table 1 Illustrative examples of different instructions for free choice tasks as well as random generation tasks

Full size table

While this type of instruction could be argued to constrain the choices that participants can give, this is true of all tasks that could feasibly be observed in an experimental laboratory. However, free choice responses are still less constrained than forced choice responses. While free choice instructions and random generation instructions bear similarities, free choice instructions are used this way in the literature on self-generated action and are, as such, worthy of investigation. The next section will discuss the relationship between random generation tasks and how they are affected by working memory (WM) manipulations.

Random generation and working memory

Baddeley reported that random generation performance can be influenced by various factors such as time constraints (Baddeley 1962, as cited in, 1966) or concurrently performed tasks (Baddeley 1966), suggesting that the capacity to create random information is limited in some way. As such, it stands to reason that adding a secondary task that involves WM to the random generation task would interfere with the random generation task. For example, Cooper et al. (2012) used a dual-tasking paradigm in which a random digit (1–9) generation task was coupled either with a 2-back task or a go/no-go task. Indeed, performance in the random generation task as measured through RTs and different indices of randomness was worse when combined with the 2-back task.

Additional evidence for a relationship between WM functions and random generation can be derived from principal component analyses. In particular, Miyake et al. (2000) reported correlations between the executive functions of updating and inhibition with measures of randomness (equality of response usage and inhibition of prepotent associates, respectively) as described by Towse and Neil (1998).

In sum, the literature suggests that WM plays a critical role in random generation tasks. The assessment of randomness will be discussed in the next section.

Measuring randomness

A difference between the aforementioned random generation tasks and free choice tasks is that in free choice tasks there are most often only two response options while for the random generation tasks there were usually nine response options. This renders several ways of how randomness of a choice sequence can be measured less informative. For example, it cannot be measured, as it can be with nine digits, whether two subsequent responses have adjacent values.

As there is a plethora of different measures of randomness (Towse and Neil 1998 alone described 14 different measures in their review), it is necessary to choose which one(s) to use. For the purposes of the present paper, randomness will be measured through the local unevenness (LU) measure (see, e.g., Heuer et al. 2005, 2010). While earlier studies used a more general form of LU, the following description is specific to a two-response-options situation with left and right responses.

In essence, the LU is a measure of the deviation of empirical responses from an ideal random distribution of responses, as measured in running windows of predefined sizes. “Running window” here means that a sequence is divided into all possible sequential sub-sequences of a predefined length and the formula is applied to all of these sub-sequences. For an illustration of what this looks like, see Fig. 1. The formula for the LU in each segment is as follows:

$${\text{L}}{{\text{U}}_w}=\sqrt {\frac{{{{({p_{{\text{left}}}} - 0.5)}^2}+{{({p_{{\text{right}}}} - 0.5)}^2}}}{2}} ,$$

where p is the ratio of the respective response option given in the respective window. Because in the case of only two options the two ratios are complementary, this formula can be further simplified to:

$$\text{L}{\text{U}_w}=\frac{{\sqrt {{{(2 \cdot {p_{\text{left}}} - 1)}^2}} }}{2}.$$

The range of values for the LU lies between 0 and 0.5, where 0 means that in the given window, the distribution is perfectly in line with the expected ratios (i.e., both choices are represented equally often, that is completely evenly) and 0.5 means that only one of the two choices is present in the given window (i.e., the sequence is as uneven as possible).

To illustrate, Fig. 1 gives an example sequence of choices and the resulting LUs, for four different window sizes of 2, 4, 6, and 8, as well as the mean LU for the sequence.

For an infinitely long random sequence, the expected mean value of the LU is, however, not 0.0, as this would imply that in every single segment the options are represented equally often, without, for example, any run-ons of the same choice. Instead, it is the average of all the potential combinations of the options when taking the order of the options into account. Figure 2 illustrates the potential response option combinations when using a window of the size 4.

This results in an ideal LU of 0.1875, as all these potential sequences have the same chance to appear in a random sequence. The ideal values for the four window sizes mentioned above are 0.25, 0.1875, 0.15625, and 0.1367188 (for window size of 2, 4, 6, and 8, respectively). Mean LUs higher than those ideal values then mean that unbalanced segments were overrepresented in the whole sequence compared to what would be expected in a random sequence. Conversely, mean LUs below those ideal values imply that balanced segments were overrepresented. From this follows that the deviation from these ideal LU values in a sufficiently long sequence can be viewed as a deviation from (ideal) randomness.

The present study

Our prediction is that, if free choice tasks are random generation tasks, WM manipulations should influence randomness (and also response speed) accordingly. We chose a complementary approach of both lowering and increasing WM load. WM support should then increase randomness (and LUs should be closer to ideally random LUs) and decrease RTs, while experimentally induced WM load should have the opposite effects. To achieve a decrease and an increase in WM load we (1) either displayed varying amounts of previous choices to reduce the need for participants to remember their choices (Experiment 1), or (2) introduced a concurrent n-back task of varying difficulty (Experiment 2). We then measured the (non-)randomness of the responses in a free choice task via the distance to the ideal LU and the speed of the responses. While analyses of LU are the theoretically most important ones, we also included the analysis of RTs to exclude any kinds of potential trade-offs. For example, it might be the case that participants change from a focus on more random responses to a focus on faster responses (similar to speed-accuracy trade-offs, where faster responses come with committing more errors). Thus, additionally analyzing RTs makes it possible to rule out such phenomena.

Experiment 1

Experiment 1 used a paradigm in which the participants gave free choice responses while receiving different levels of WM support in the form of arrows that display previous choices (for a similar approach, see Hadland et al. 2001). We used WM support because one potential way WM influences the ease with which participants generate random responses is by providing information (i.e., previous responses) that is used to decide which response would look more ‘random’ if chosen next. We predict that with growing WM support the distance from ideal LU will decrease and the RTs will shorten.

Methods

Participants

Thirty people from the Tübingen area participated for monetary compensation (Mean age = 23 years, 26 female, 4 male). All participants reported normal or corrected-to-normal vision, were naïve regarding the underlying hypotheses, and provided written informed consent prior to data collection.

Apparatus and stimuli

Stimulus presentation and response collection happened on a PC connected to a 17-in. CRT monitor. Stimuli were a fixation circle in the middle of the screen as well as arrows, appearing within the fixation circle and, depending on block type, above it. Stimuli were white, presented against a black background. The manual responses were given with the left and right Ctrl keys on a QWERTZ keyboard.

Tasks and procedure

The task was to freely choose one of the two response options. The fixation circle was always visible during blocks slightly below the middle of the screen. After a response, an arrow indicating which response was given in the current trial appeared for 50 ms in the fixation circle. During these 50 ms, no new response could be given. In the two block types with WM support, the same arrow then appeared above the fixation circle, shifting all other already displayed arrows one slot upwards and, once three/seven responses were already given, displacing the oldest arrow at the top of the screen. This results in up to three or seven arrows indicating previous choices that are displayed above the fixation circle, as is illustrated in Fig. 3. The 50 ms in which no new response could be given were the only inter-trial interval. There was no time limit for responses.

Responses were collected in blocks of 500 trials with every participant performing all three block types twice, that is, in a total of six blocks. The order of the first three blocks was counterbalanced and the second set of three blocks was ordered in the reverse of the first three blocks. Participants were informed before each block how many of their previous choices would be displayed in this block.

Participants were instructed to give about equal amounts of left and right responses and to avoid patterns (e.g., alternating left and right responses or repeating sequences). There was one test session per participant which lasted about 45 min.

Design and analyses

The dependent variables were the distances from the ideally random LU (LUD) and the RTs. The independent variable was the level of WM support (0 vs. 3 vs. 7). For analyses of LUDs, however, we also analyzed four different window sizes (2 vs. 4 vs. 6 vs. 8). Accordingly, two main analyses were performed: LUDs were analyzed with a 3 × 4 analysis of variance (ANOVA) with WM support and window size as repeated-measures. RTs were analyzed with an ANOVA with WM support as a repeated-measure. Because we predicted decreasing RTs and LUD approaching zero with increasing WM support, we calculated Helmert contrasts on WM support (Contrast 1: no support vs. three and seven previous displayed choices; Contrast 2: three vs. seven displayed previous choices). In case of interactions between window size and the Helmert contrast, separate Helmert contrasts for each window size were calculated and are reported in the “Appendix” section.

LUDs were calculated on the whole data set once sufficient responses were given for the respective window size. For the subsequent analyses, trials were excluded as outliers if their RTs deviated more than 2.5 SDs from the respective cell mean (calculated separately for each participant).

Results

The LUDs and average RTs (1.79% outliers) are visualized in Fig. 4 and are summarized in Table 2. For LUDs, Contrast 1 was significant and indicated a difference between conditions with and without memory support, t(29) = 3.79, p = .001, without interacting with window size, t(29) = 1.70, p = .100. However, there was no significant difference between the two memory support conditions according to Contrast 2, t(29) = 0.36, p = .551. While this contrast interacted with window size, t(29) = 2.68, p = .012, when tested separately, all contrasts were not significant, all ps ≥ 0.217 (for more details, please see the “Appendix” section).

Table 2 Means (and SD) of RTs in ms and LUDs for Experiment 1 for each WM support condition

Full size table

Responses were significantly slower in the condition without WM support compared with the two other conditions, Contrast 1: t(29) = 2.63, p = .013, but there was no significant difference between the two WM support conditions, Contrast 2: t(29) = − 0.14, p = .886.

Discussion

In sum, response patterns were more random and RTs shortened with the presence of WM support. No such difference was detectable between the different levels of WM support. These results can be taken as first evidence that WM plays a similar role in free choice tasks as it does for random generation tasks.

There is one potential confound in this particular experimental design: The presence of the arrows employed as WM support can be interpreted as a type of action effect (or action outcome), which conceivably differs between the no-support and the two support conditions. Furthermore, the last presented arrow was always spatially compatible with the selected response. Importantly, RTs are shorter when the responses produce compatible action effects compared with incompatible ones (Kunde 2001; see also; Janczyk and Lerche 2018; Janczyk et al. 2017; Koch and Kunde 2002). At first glance, this might have contributed to the shorter RTs in the two WM support conditions. However, we believe that this argument does not pose serious problems for several reasons. First, it is important to note that in all conditions an immediate and compatible arrow appeared in the center of the fixation circle. Second, in the two WM support conditions, always multiple arrows were present on the screen. Thus, there would most of the time (unless the participants repeated responses multiple times) be a mixture of compatible and incompatible action effects be present what would weaken a potential impact on RTs. Third, the RT difference we observed (roughly 70 ms) is larger than the usual effects of action effect compatibility (e.g., between 20 and 50 ms in Kunde 2001). Hence, if this confound played a role in the RT results, it likely would account only for a part of the difference. Lastly, and potentially most important, it is not clear how the theoretically more important LUD results would be affected by compatible or incompatible action effects.

A further objection might be that the presence of the previous choices on the screen turned the free choice task into a “cue-dependent task”. Of course, we cannot exclude that participants’ used different strategies between conditions. It is the case, though, that the information about the previous choices were actually always available to the participants in form of a memory trace. The presence of the WM support arrows merely made it more accessible.

To attain more and converging evidence from a different kind of experimental manipulation, we experimentally increased WM load through an n-back task in Experiment 2.^{Footnote 3}

Experiment 2

In Experiment 2, we paired a free choice task with a WM-intensive task to induce WM load. Specifically, we alternated a free choice task with an n-back task for this purpose (Kirchner 1958). In all n-back conditions, participants had to react only under specific circumstances: For 0-back, whenever a stimulus (colored circles that were displayed left/right and above/below center on the screen) with a pre-specified color or location appeared, and for 1-, 2-, and 3-back whenever the stimulus color or location in a given trial matched that n trials ago. The two relevant stimulus features (color vs. location) were chosen to generalize the results and counteract potential modality-specific influences. Furthermore, this experiment completely avoids the potential confound of compatible action effects from Experiment 1. Conversely to the previous experiment, we predict that with an increasing WM load, the LUDs should deviate more from zero and the RTs should increase.