Taking the easy way out? Increasing implementation effort reduces probability maximizing under cognitive load

Schulze, Christin; Newell, Ben R.

doi:10.3758/s13421-016-0595-x

Taking the easy way out? Increasing implementation effort reduces probability maximizing under cognitive load

Published: 16 February 2016

Volume 44, pages 806–818, (2016)
Cite this article

Download PDF

Memory & Cognition Aims and scope Submit manuscript

Taking the easy way out? Increasing implementation effort reduces probability maximizing under cognitive load

Download PDF

Christin Schulze^1,2 &
Ben R. Newell¹

2905 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

Cognitive load has previously been found to have a positive effect on strategy selection in repeated risky choice. Specifically, whereas inferior probability matching often prevails under single-task conditions, optimal probability maximizing sometimes dominates when a concurrent task competes for cognitive resources. We examined the extent to which this seemingly beneficial effect of increased task demands hinges on the effort required to implement each of the choice strategies. Probability maximizing typically involves a simple repeated response to a single option, whereas probability matching requires choice proportions to be tracked carefully throughout a sequential choice task. Here, we flipped this pattern by introducing a manipulation that made the implementation of maximizing more taxing and, at the same time, allowed decision makers to probability match via a simple repeated response to a single option. The results from two experiments showed that increasing the implementation effort of probability maximizing resulted in decreased adoption rates of this strategy. This was the case both when decision makers simultaneously learned about the outcome probabilities and responded to a dual task (Exp. 1) and when these two aspects were procedurally separated in two distinct stages (Exp. 2). We conclude that the effort involved in implementing a choice strategy is a key factor in shaping repeated choice under uncertainty. Moreover, highlighting the importance of implementation effort casts new light on the sometimes surprising and inconsistent effects of cognitive load that have previously been reported in the literature.

Choice under uncertainty and cognitive load

Article Open access 14 March 2024

Time pressure changes how people explore and respond to uncertainty

Article Open access 08 March 2022

The composition of the choice set modulates probability weighting in risky decisions

Article 26 January 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Solving complex problems requires effort, time, and resources. Conventional wisdom suggests that the more difficult the problem at hand, the more effort must be invested to achieve success: Proverbially, Rome was not built in a day, and there is no gain without pain. Paradoxically, however, people’s performance on simple choice tasks—which seem to be surprisingly challenging to begin with—can apparently be boosted by increasing the task demands.

Consider the simple task of choosing between two options that offer the same payoff with unequal odds (e.g., with p = .70 and 1 − p = .30). For example, imagine you are to choose between two casino slot machines: Slot machine A will pay out $10 with a probability of .70, and slot machine B will also pay out $10, but with a probability of .30. Which slot machine would you prefer to play? It is easy to see that a rational decision maker should select the alternative with the higher payoff probability (slot machine A) to maximize her chances of success. The same holds when she faces this choice repeatedly, provided that the outcome probabilities remain stationary and are serially independent. Thus, in repeated risky choice, a simple reward-maximizing strategy is to always select the more probable option—that is, to probability maximize. Yet many people faced with this task apply a more complicated strategy that involves aligning their choice frequencies to the relative probabilities of the outcomes. In the slot machine example, this means switching repeatedly between the options and playing machine A 70 % of the time and machine B 30 % of the time. The expected payoffs of this strategy—called probability matching—are much lower (for a review, see Vulkan, 2000).

Surprisingly, under some conditions, people maximize more (and match less) when the overall task difficulty is increased, for example, by the introduction of a concurrent verbal memory task that competes for cognitive resources (Wolford, Newman, Miller, & Wig, 2004). Yet this finding is surprising only if one assumes that people really believe the structure of the sequential choice task to be simple and the outcome sequence to be random. If they do not believe that the outcomes are statistically independent—which seems a reasonable assumption, in light of everyday experiences of repeated events (see, e.g., Ayton & Fischer, 2004)—they might attempt to outperform the static maximizing strategy by finding a predictable pattern in the outcome sequence (Gaissmaier & Schooler, 2008; Peterson & Ulehla, 1965). Because any predictable pattern must match the outcome frequencies, probability matching would occur as a by-product of such an elaborate search, rather than as a strategy per se. By this account, probability matching represents an ecologically rational response associated with the search for patterns. Wolford et al. (2004) argued that the assumption that people search for patterns in outcome sequences is in line with the reduced probability matching rates observed when cognitive resources are taxed. That is, occupying the cognitive resources needed for vigilant pattern search would undercut such normally occurring search behavior, and thus reduce probability matching.

Other findings also support the pattern search account of probability matching. Gaissmaier and Schooler (2008) demonstrated that participants who probability matched in the absence of a fixed pattern in the outcome sequence were more likely to detect patterns introduced at a later stage. Unturbe and Corominas (2007) found that the complexity of the rules that participants reported to have followed during a sequential choice task was inversely related to probability maximizing behavior (see also McMahon & Scheel, 2010). Moreover, allowing participants to conclusively infer that the outcome-generating process is random—and thereby encouraging them to accept the true absence of patterns in the outcome sequence—reduces probability matching (Morse & Runquist, 1960; Peterson & Ulehla, 1965).

However, not all researchers agree with this pattern search interpretation of probability matching. In fact, probability matching has classically been viewed as a simple mistake that violates the assumptions of traditional (as opposed to ecological) rational choice theory (Vulkan, 2000). Recent research adopting this view has argued that people make this mistake because cognitive constraints motivate them to fall back on cognitively simpler heuristic choice strategies (e.g., Koehler & James, 2009, 2014; Kogler & Kühberger, 2007; West & Stanovich, 2003) or because limitations associated with the task environment—for instance, lack of financial incentives or insufficient outcome feedback—prevent them from learning how best to respond (Newell & Rakow, 2007; Shanks, Tunney, & McCarthy, 2002). According to this account, probability matching is simply a cognitive error that arises from cognitive constraints of the decision maker and/or from structural inadequacies of the choice environment.

Findings from dual-task paradigms (e.g., Wolford et al., 2004) appear to be at odds with this “simple-mistake” interpretation of probability matching, however. If people match by mistake because their cognitive capacity is limited, why would they match less when cognitive resources are further taxed under dual-task conditions? One explanation for this apparent inconsistency relates to differences in the effort involved in implementing probability matching and maximizing (see Koehler & James, 2014). To implement a probability matching strategy, decision makers need to track their choice proportions throughout sequential choice tasks—an effort that can be assumed to involve cognitive mechanisms similar to those that would be required by a concurrent verbal memory task. Probability maximizing, on the other hand, involves only a simple repeated response to the same option and requires less effort to implement. Thus, a concurrent task might reduce the adoption of probability matching simply because it impedes people’s ability or willingness to track their own choices (rather than patterns in the outcome sequence), and consequently drives them to default to choosing the same option repeatedly (i.e., to probability maximize).

Thus, the observation of reduced probability matching under cognitive load does not necessarily discriminate between the “sophisticated pattern search” and “simple-mistake” accounts of probability matching. Moreover, the potentially pivotal role of strategy implementation effort has remained largely unexplored, which may explain the inconsistent findings on the effects of cognitive load in these settings. Specifically, some studies have demonstrated that probability matching decreases under cognitive load (Wolford et al., 2004), whereas others have failed to find differences in probability matching rates under dual- versus single-task conditions (Otto, Taylor, & Markman, 2011).

We aimed to close this gap in the literature by evaluating the role that strategy implementation effort plays in moderating the effects of cognitive load on sequential choice. To this end, we introduced a manipulation that reversed the typical experimental situation, by making the implementation of probability maximizing more taxing and, at the same time, allowing decision makers to probability match via a simple repeated response to a single option. Specifically, we manipulated the allocation of choice options to physical response options, so that repeatedly choosing the same physical option resulted in either probability matching or probability maximizing (see the Method section of Exp. 1 for details). By applying this manipulation in dual-task paradigms, we tested whether cognitive load causes people to repeatedly choose the same physical option rather than the maximizing option. Our two experiments explored the significance of strategy implementation effort when decision makers learn about outcome probabilities and respond to a dual task simultaneously (Exp. 1) or in two procedurally separated stages (Exp. 2).

Experiment 1

Method

Participants

One hundred (51 female, 49 male) undergraduate students from the University of New South Wales with a mean age of 19.63 years (SD = 2.56 years) participated in this experiment in exchange for course credit. These figures exclude one additional participant who strongly favored the infrequent event throughout (indicating misinterpretation of the task) and whose data were therefore disregarded. In addition to course credit, participants could earn a performance-based monetary payoff. Earnings ranged from AU$4.55 to AU$10.95 (AU$1 ≈ US$0.96 at the time of the experiment).

Design and procedure

All participants completed a computer-based repeated binary choice task over 500 trials. We factorially crossed two between-subjects factors: the effort involved in implementing a probability maximizing strategy during the choice task (high vs. low) and the presence of an interleaved concurrent working memory load task (present vs. absent). The concurrent task was a 3-back memory task that asked participants to memorize the three numbers last seen on the screen. We randomly assigned 25 participants to each of the four resulting experimental conditions. Participants could earn performance-based payoffs in both the choice and the memory task (if present), which were paid in cash at the end of the experiment. Participants were instructed to earn as much money as possible and to treat both tasks as equally important. After every block of 100 choice trials, participants received feedback on their accuracy and earnings in both tasks, and the main instructions were reiterated for the subsequent block. There was no practice period before the main task, and participants were not informed in advance about the total number of trials.

Following the choice (and concurrent memory) task(s), all participants were asked to complete a short questionnaire assessing their understanding of the underlying probability structure and their strategy use during the choice task (see the Appendix for details). Specifically, they were asked to estimate the outcome probabilities for each choice alternative and to consider two prediction strategies for ten hypothetical choice trials: (a) choosing the dominant color for all ten trials (i.e., probability maximizing) and (b) choosing the dominant color for seven out of ten trials (i.e., probability matching). Note that the labels “probability maximizing” and “probability matching” were not used in the questionnaire. Participants were then instructed to indicate which strategy, (a) or (b), their choices most closely resembled (1) early and (2) late in the experiment, and which strategy, (a) or (b), they (3) expected to earn them more money and (4) would use if they were to play the game again (see Koehler & James, 2010, for similar post-task strategy evaluation questions).^{Footnote 1}

Choice task

The choice task was adapted from Wolford et al. (2004) and involved repeated binary decisions over 500 choice trials. Each trial started with the presentation of either a fixation cross (working memory load absent) or a digit between 0 and 9 (working memory load present) in the center of the computer screen. Figure 1A illustrates the task screen shown to participants in the working memory load conditions. The presentation of the fixation cross/digit served as a cue for participants to predict which of two colored squares—either a red or a green square—would appear next. The green square appeared in 70 % of trials, and the red square in the remaining 30 % (randomized across participants for red and green majority outcomes). Participants were informed that the sequence of red and green squares was random. Each of the colored squares was mapped to a different location on the screen—either above or below the fixation cross/digit (again, randomized across participants)—and participants made their predictions by pressing the up or down arrow key on the computer keyboard. This color–key mapping was shown on the screen throughout the task, as is illustrated in Fig. 1A. Participants earned two cents for each correct color prediction, and they were encouraged to attempt to earn as much money as possible. Following each choice, either a red or a green square appeared on the screen (in the location indicated by the color–key mapping), participants received verbal feedback about the accuracy of their prediction, and earnings were updated on the screen (see Fig. 1A). The next trial then started with either the fixation cross or a new digit. The primary dependent measure was participants’ proportions of choices of the more probable—that is, dominant—color outcome.

The implementation effort of probability maximizing was manipulated by modifying the allocation of response keys to predicting a red or a green square, so that repeatedly choosing the same physical keyboard key resulted in either probability matching or maximizing. In the low maximizing effort conditions, the color–key allocation remained the same throughout the task, and solely pressing the key corresponding to the majority outcome resulted in probability maximizing, as is shown in Fig. 1B. For example, solely pressing the up arrow key to predict that a green square would appear implemented a probability maximizing strategy for green majority outcomes. In the high maximizing effort conditions, the color–key allocation remained the same on 70 % of trials, but flipped on the other 30 %, which matches the frequencies with which the two colors appeared. The switch was shown in the mapping illustration displayed on the screen. Now, solely pressing the key mostly corresponding to the majority outcome resulted in probability matching, as is shown in Fig. 1C. For example, a participant solely pressing the up arrow key throughout the task would predict a green square to appear on 70 % of trials and a red square to appear on 30 % of trials, and would thus implement a matching strategy for green majority outcomes. Consequently, implementation of probability maximizing was made more difficult. Throughout the article, we refer to these two conditions as fixed color–key mapping (in which probability maximizing was easy to implement) and varied color–key mapping (in which probability maximizing was difficult to implement).

Working memory task

The memory task, which was interwoven with the choice task, required participants in the working memory load conditions to remember the last three numbers shown on the screen. Numbers between 0 and 9 were randomly selected and displayed in the center of the screen at the start of each choice task trial, as is illustrated in Fig. 1A. Once participants had made a choice (there was no time limit), the number was replaced by a fixation cross, followed by feedback on the outcome and earnings. At the start of the next trial, a new digit appeared. Participants were asked to maintain the last three numbers in memory, updating the set of numbers remembered with the appearance of each new digit. At four times at random intervals during each block of 100 choice trials, participants were tested and asked to recall the last three numbers they had seen as accurately as possible. Each correctly recalled digit raised participants’ earnings from the choice task by 5 %.

Results

For all parametric inferential statistics, we conducted Bayesian analyses in addition to using conventional methods of hypothesis testing. On the basis of the default Bayesian analyses of variance (ANOVAs) suggested by Rouder, Morey, Speckman, and Province (2012) and the default Bayesian t tests suggested by Rouder, Speckman, Sun, Morey, and Iverson (2009), we report Bayes factors (BF) that quantify the strength of evidence in favor of the presence of an effect.^{Footnote 2}

Choice task performance

The proportions of participants’ dominant color choices for each block of 100 trials, shown in Fig. 2, were subjected to a 2 (working memory load) × 2 (mapping) × 5 (block) mixed model ANOVA. The main effect of learning across trial blocks was significant, F(2.65, 254.17) = 58.77, p < .001, η _p ² = .380, BF = 5.76 × 10³⁴, and is illustrated by the upward trajectory of all group lines in Fig. 2.^{Footnote 3} We found a main effect of mapping, F(1, 96) = 10.29, p = .002, η _p ² = .097, BF = 17.84; participants who experienced fixed color–key mappings chose the dominant color more frequently (M = .78 across blocks of 100 trials) than did those who experienced varied color–key mappings (M = .70 across blocks of 100 trials). Additionally, the mapping effect significantly interacted with the within-subjects Block factor, F(2.65, 254.17) = 3.43, p = .022, η _p ² = .034, BF = 2.49; the learning slopes of participants in the varied color–key mapping conditions remained flatter across blocks than did those of participants in the fixed color–key mapping conditions (see Fig. 2). Participants under working memory load had a slight tendency to select the dominant color less often (M = .72 across blocks of 100 trials) than did nonloaded participants (M = .76 across blocks of 100 trials); however, neither the main effect of working memory load, F(1, 96) = 2.40, p = .125, η _p ² = .024, BF = 0.73, nor any of the interactions with this factor reached statistical significance (all ps ≥ .184, all BFs ≤ 0.65).

Turning to the individual-level responses, Fig. 3 displays the full range of participants’ proportions of dominant color choices in each trial block and for all conditions. To assess strategy selection in individual participants toward the end of learning, we classified participants’ response proportions in the final trial block as either probability maximizing or probability matching. Participants who selected the dominant color on no less than 95 % of trials in the last block were defined as probability maximizers; participants who allocated their choices within 5 % of the average reward probability of the more probable option (.70 ± .05) were defined as probability matchers (see, e.g., Schulze, van Ravenzwaaij, & Newell, 2015). We carried out three-way chi-square tests to evaluate whether the adoption of probability maximizing and probability matching in the final trial block was associated with the color–key mapping manipulation, contingent on working memory load condition. Under working memory load, participants who experienced varied color–key mappings were 7.67 times less likely to probability maximize in the final trial block than were participants who experienced fixed color–key mappings, χ ²(1) = 7.02, p = .008. For nonloaded participants, we found no association between maximizing and color–key mapping condition during the final trial block; in fact, the same number of participants (seven out of 25) were classified as probability maximizers in both mapping conditions.^{Footnote 4} There was no relationship between the mapping manipulation and the use of probability matching in the final trial block, either for participants under working memory load, χ ²(1) = 0.10, p = .747, or for nonloaded participants, χ ²(1) = 0.14, p = .713.

Moreover, as Fig. 3 highlights, less than 30 % of the participants in each condition were characterized as probability matchers by the final trial block. Critically, this included participants who experienced varied color–key mappings under working memory load, and whose average response proportion converged on probability matching by Block 5. The full range of individual choice proportions shown in Fig. 3 indicates that, in this condition, the proportion of probability matchers (28 %) equaled the proportion of participants selecting colors at random (.50 ± .05 choices to either option) in the final trial block. Across all other conditions, only two other participants responded randomly in the last block.

Working memory task and questionnaire responses

The proportion of correctly remembered numbers in the working memory load task for each block of the choice task (see Table 1) was subjected to a 2 (mapping) × 5 (block) mixed model ANOVA. Memory task performance varied significantly across choice task blocks, with lower accuracy scores during both early and late blocks, although the Bayesian evidence was somewhat ambiguous, F(2.69, 129.24) = 3.09, p = .035, η _p ² = .060, BF = 1.53. Participants who experienced varied color–key mappings were slightly less accurate on the memory task (M = .88 across blocks) than were participants who experienced fixed color–key mappings (M = .92 across blocks). However, neither the main effect of color–key mapping nor the mapping by block interaction was statistically significant. In fact, the Bayesian analysis provided evidence in favor of the absence of an interaction; F(1, 48) = 1.41, p = .241, η _p ² = .029, BF = 0.49, for the main effect of mapping, and F(2.69, 129.24) = 0.24, p = .847, η _p ² = .005, BF = 0.04, for the mapping by block interaction.

Table 1 Performance on the memory task: Mean proportions (and standard deviations) of correctly remembered numbers in each block of 100 choice trials

Full size table

The data from the post-task questionnaire indicated that participants’ outcome probability estimates were least accurate when they experienced varied color–key mappings under working memory load (averaging at Ms = .62 and .37 for the two choice alternatives). By contrast, the mean probability estimates in all other conditions deviated no more than .03 points from the programmed outcome probabilities (.70 and .30). A 2 (working memory load) × 2 (mapping) ANOVA on the absolute distance between the probability estimates for both choice alternatives revealed a significant main effect of mapping, F(1, 96) = 8.38, p = .005, η _p ² = .080, BF = 8.52; the participants who experienced varied color–key mappings during the choice task discriminated less well between the two values (M _diff = .32) than did the participants who experienced fixed color–key mappings (M _diff = .42), judging them to be closer together than they actually were. This suggests that probability learning progressed less accurately when participants experienced varied color–key mappings. No other effects in this analysis were significant; in fact, the Bayesian analyses provided evidence in favor of the absence of these effects (all ps ≥ .169, all BFs ≤ 0.28).

A similar pattern of results was observed for participants’ strategy endorsements. We carried out three-way chi-square tests to evaluate the association between endorsements of maximizing on three survey items (which strategy was used late in the experiment, which was expected to yield the highest payoff, and which would be used again) and color–key mapping, contingent on working memory load condition, as is summarized in Table 2. Under working memory load, relative to participants who had experienced fixed color–key mappings, participants who had experienced varied color–key mappings were 4.33 times less likely to identify probability maximizing as the strategy that would earn them more money, and 3.69 times less likely to say they would use maximizing in future games. No such relationship was found for self-reported strategy use toward the end of learning (“late in the experiment”) or under single-task conditions (see Table 2).

Table 2 Mean proportions of endorsements of probability maximizing as the strategy used toward the end of the experiment, the strategy with the highest expected payoff, and the strategy that would be used again; associations between maximizing endorsements and color–key mapping, contingent on working memory (WM) load condition

Full size table

Discussion

Experiment 1 showed that increasing the effort involved in implementing probability maximizing led to decreased adoption rates of this strategy. This effect was most severe when participants’ cognitive resources were taxed by a concurrent working memory task. These findings suggest that the cognitive effort of attending to a concurrent task does not (paradoxically) cause people to choose the maximizing option more readily per se (as was suggested by Wolford et al., 2004). If that were the case, we would expect to have seen increased probability maximizing in the presence of a concurrent task, regardless of the mapping manipulation. The fact that we did not indicates that other factors, such as the effort involved in implementing a strategy, moderate people’s engagement in probability matching and maximizing under cognitive load. A potential caveat to this conclusion needs to be considered, however. The effect of implementation effort on strategy selection may have been confounded by impaired probability learning when color–key mappings varied under working memory load. Participants in this experimental condition estimated the outcome probabilities least accurately: Many responded at random during the choice task, and half failed to recognize probability maximizing as the superior strategy afterward. In Experiment 2 we addressed this issue and aimed to rule out impaired probability learning as a possible confound by separating probability learning and the imposition of cognitive load.

Increasing the implementation effort of probability maximizing by varying color–key mappings also simplified the implementation of probability matching. This is because the color–key allocation changed with probabilities that matched the outcome frequencies. Therefore, exclusively selecting the key mostly corresponding to the majority outcome would have resulted in “easy” probability matching. Unlike probability maximizing, however, matching can be implemented in various ways over a long sequence of choices, and it is unlikely that the programmed color–key mapping changes strictly corresponded to participants’ concepts of a probability matched outcome sequence. Thus, it is not surprising that the majority of participants who probability matched when experiencing varied color–key mappings under working memory load did so by implementing their own representations of this strategy, whereas only one participant implemented probability matching by pressing a single key repeatedly throughout multiple blocks.

Experiment 2

Because the results of Experiment 1 suggested a possible confound in the mapping manipulation—namely, impaired probability learning when color–key mappings varied under working memory load—we designed a second experiment to address this issue. Experiment 2 replicated the basic task design of Experiment 1, but separated learning about outcome probabilities from responding under cognitive load versus no load by dividing the task into two distinct parts. Through this procedural change, we aimed to isolate the effects of implementation effort on strategy selection from those on probability learning.